Abstract
Globus Online manages fire-and-forget file transfers for big-data, high-performance scientific collaborations.
- Allcock, B., Bresnahan, J., Kettimuthu, R., Link, M., Dumitrescu, C., Raicu, I., and Foster, I. The Globus striped GridFTP framework and server. In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (Seattle, Nov. 12--18). ACM Press, New York, 2005. Google ScholarDigital Library
- Bell, G., Hey, T., and Szalay, A. Beyond the data deluge. Science 323, 5919 (Mar. 2009), 1297--1298.Google ScholarCross Ref
- Berriman, G.B. and Groom, S. How will astronomy archives survive the data tsunami? Commun. ACM 54, 12 (Dec. 2011), 52--56. Google ScholarDigital Library
- Chervenak, A., Schuler, R., Kesselman, C., Koranda, S. and Moe, B. Wide-area data replication for scientific collaborations. In Proceedings of the Sixth IEEE/ACM International Workshop on Grid Computing (Seattle, Nov. 13). IEEE Computer Society, Washington, D.C., 2005. Google ScholarDigital Library
- Childers, L., Liming, L., and Foster, I. Perspectives on Distributed Computing: 30 People, Four User Types, and the Distributed Computing User Experience, Technical Report ANL/MCS/CI-31. Argonne National Laboratory, Argonne, IL, 2008.Google Scholar
- Cho, B. and Gupta, I., Budget-constrained bulk data transfer via internet and shipping networks. In Proceedings of the Eighth ACM international conference on Autonomic Computing (Karlsruhe, Germany, June 14-16). ACM Press, New York, 2011, 71--80. Google ScholarDigital Library
- Cholia, S., Skinner, D., and Boverhof, J. NEWT: A RESTful service for building high-performance computing Web applications. In Proceedings of the 2010 Gateway Computing Environments Workshop (New Orleans, Nov. 14). IEEE Computer Society Press, 2010, 1--11.Google ScholarCross Ref
- Cohen, B. Incentives build robustness in BitTorrent. In Proceedings of the First International Workshop on Economics of P2P Systems (Berkeley, CA, June 5--6, 2003).Google Scholar
- Egeland, R., Wildishb, T., and Huang, C.-H. PhEDEx data service. Journal of Physics: Conference Series 219 (2010).Google Scholar
- Erdos, M. and Cantor, S. Shibboleth Architecture. Internet 2, May 2, 2002; http://shibboleth.internet2.edu/docs/draft-internet2-shibboleth-arch-v05.pdfGoogle Scholar
- Gray, J., Chong, W., Barclay, T., Szalay, A., and Vandenberg, J. TeraScale SneakerNet: Using Inexpensive Disks for Backup, Archiving, and Data Exchange Technical Report MSR-TR 2002-54. Microsoft Research, Redmond, WA, 2002.Google Scholar
- Hammer-Lahav, E. The OAuth 1.0 Protocol. Internet Engineering Task Force RFC 5849, 2010; http://tools.ietf.org/html/rfc5849Google Scholar
- Hanushevsky, A., Trunov, A., and Cottrell, L. Peer-to-peer computing for secure high-performance data copying. In Proceedings of the 2001 International Conference on Computing in High Energy and Nuclear Physics (Beijing, Sept. 3--7, 2001).Google Scholar
- Kosar, T. and Livny, M. A framework for reliable and efficient data placement in distributed computing systems. Journal of Parallel and Distributed Computing 65, 10 (Oct. 2005), 1146--1157. Google ScholarDigital Library
- Monti, H., Butt, A.R., and Vazhkudai, S.S. CATCH: A cloud-based adaptive data-transfer service for HPC. In Proceedings of the 25 th IEEE International Parallel & Distributed Processing Symposium (Anchorage, Alaska, May 16--20). IEEE Computer Society, 2011, 1242--1253. Google ScholarDigital Library
- Novotny, J., Tuecke, S., and Welch, V. An online credential repository for the grid: MyProxy. In Proceedings of the 10 th IEEE International Symposium on High-Performance Distributed Computing (San Francisco, Aug. 7--9). IEEE Computer Society Press, Washington, D.C., 2001, 104--111. Google ScholarDigital Library
- Rajasekar, A., Moore, R., Hou, C.-Y., Lee, C.A., Marciano, R., de Torcy, A., Wan, M., Schroeder, W., Chen, S.-Y., Gilbert, L., Tooby, P., and Zhu, B. iRODS Primer: Integrated Rule-Oriented Data System. Morgan and Claypool Publishers, 2010. Google ScholarDigital Library
- Sun, W., Zhang, K., Chen, S.-K., Zhang, X., and Liang, H. Software as a service: An integration perspective. In Proceedings of the Fifth International Conference on Service-Oriented Computing, B. Krämer, K.-J. Lin, and P. Narasimhan, Eds. (Vienna, Austria, Sept. 17--20). Springer, Berlin/Heidelberg, 2007, 558--569. Google ScholarDigital Library
- Thain, D., Basney, J., Son, S.-C., and Livny, M. The Kangaroo approach to data movement on the grid. In Proceedings of the 10 th IEEE International Symposium on High-Performance Distributed Computing (San Francisco, Aug. 7--9). IEEE Computer Society Press, Washington, D.C., 2001, 325--333. Google ScholarDigital Library
- Tridgell, A. and Mackerras, P. The Rsync Algorithm TR-CS-96-05. Department of Computer Science, Australian National University, Canberra, 1994.Google Scholar
- Wang, L., Park, K.S., Pang, R., Pai, V., and Peterson, L. Reliability and security in the CoDeeN content distribution network. In Proceedings of the USENIX Annual Technical Conference (Boston, June 27--July 2). USENIX Association, Berkeley, CA, 2004, 171--184. Google ScholarDigital Library
- Welch, V., Foster, I., Kesselman, C., Mulmo, O., Pearlman, L., Tuecke, S., Gawor, J., Meder, S., and Siebenlist, F. X.509 proxy certificates for dynamic delegation. In Proceedings of the Third Annual Public Key Infrastructure R&D Workshop (Gaithersburg, MD, Apr. 12--14), National Institute of Standards and Technology, Gaithersburg, MD, 2004.Google Scholar
Index Terms
- Software as a service for data scientists
Recommendations
Software Design for Empowering Scientists
Scientific research is increasingly digital. Some activities, such as data analysis, search, and simulation, can be accelerated by letting scientists write workflows and scripts that automate routine activities. These capture pieces of the scientific ...
Service-oriented middleware for distributed data mining on the grid
Distribution of data and computation allows for solving larger problems and executing applications that are distributed in nature. The grid is a distributed computing infrastructure that enables coordinated resource sharing within dynamic organizations ...
A Grid service broker for scheduling e-Science applications on global data Grids: Research Articles
Middleware for Grid ComputingThe next generation of scientific experiments and studies, popularly called e-Science, is carried out by large collaborations of researchers distributed around the world engaged in the analysis of huge collections of data generated by scientific ...
Comments