skip to main content
research-article
Free Access

Software as a service for data scientists

Authors Info & Claims
Published:01 February 2012Publication History
Skip Abstract Section

Abstract

Globus Online manages fire-and-forget file transfers for big-data, high-performance scientific collaborations.

References

  1. Allcock, B., Bresnahan, J., Kettimuthu, R., Link, M., Dumitrescu, C., Raicu, I., and Foster, I. The Globus striped GridFTP framework and server. In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing (Seattle, Nov. 12--18). ACM Press, New York, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bell, G., Hey, T., and Szalay, A. Beyond the data deluge. Science 323, 5919 (Mar. 2009), 1297--1298.Google ScholarGoogle ScholarCross RefCross Ref
  3. Berriman, G.B. and Groom, S. How will astronomy archives survive the data tsunami? Commun. ACM 54, 12 (Dec. 2011), 52--56. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chervenak, A., Schuler, R., Kesselman, C., Koranda, S. and Moe, B. Wide-area data replication for scientific collaborations. In Proceedings of the Sixth IEEE/ACM International Workshop on Grid Computing (Seattle, Nov. 13). IEEE Computer Society, Washington, D.C., 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Childers, L., Liming, L., and Foster, I. Perspectives on Distributed Computing: 30 People, Four User Types, and the Distributed Computing User Experience, Technical Report ANL/MCS/CI-31. Argonne National Laboratory, Argonne, IL, 2008.Google ScholarGoogle Scholar
  6. Cho, B. and Gupta, I., Budget-constrained bulk data transfer via internet and shipping networks. In Proceedings of the Eighth ACM international conference on Autonomic Computing (Karlsruhe, Germany, June 14-16). ACM Press, New York, 2011, 71--80. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cholia, S., Skinner, D., and Boverhof, J. NEWT: A RESTful service for building high-performance computing Web applications. In Proceedings of the 2010 Gateway Computing Environments Workshop (New Orleans, Nov. 14). IEEE Computer Society Press, 2010, 1--11.Google ScholarGoogle ScholarCross RefCross Ref
  8. Cohen, B. Incentives build robustness in BitTorrent. In Proceedings of the First International Workshop on Economics of P2P Systems (Berkeley, CA, June 5--6, 2003).Google ScholarGoogle Scholar
  9. Egeland, R., Wildishb, T., and Huang, C.-H. PhEDEx data service. Journal of Physics: Conference Series 219 (2010).Google ScholarGoogle Scholar
  10. Erdos, M. and Cantor, S. Shibboleth Architecture. Internet 2, May 2, 2002; http://shibboleth.internet2.edu/docs/draft-internet2-shibboleth-arch-v05.pdfGoogle ScholarGoogle Scholar
  11. Gray, J., Chong, W., Barclay, T., Szalay, A., and Vandenberg, J. TeraScale SneakerNet: Using Inexpensive Disks for Backup, Archiving, and Data Exchange Technical Report MSR-TR 2002-54. Microsoft Research, Redmond, WA, 2002.Google ScholarGoogle Scholar
  12. Hammer-Lahav, E. The OAuth 1.0 Protocol. Internet Engineering Task Force RFC 5849, 2010; http://tools.ietf.org/html/rfc5849Google ScholarGoogle Scholar
  13. Hanushevsky, A., Trunov, A., and Cottrell, L. Peer-to-peer computing for secure high-performance data copying. In Proceedings of the 2001 International Conference on Computing in High Energy and Nuclear Physics (Beijing, Sept. 3--7, 2001).Google ScholarGoogle Scholar
  14. Kosar, T. and Livny, M. A framework for reliable and efficient data placement in distributed computing systems. Journal of Parallel and Distributed Computing 65, 10 (Oct. 2005), 1146--1157. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Monti, H., Butt, A.R., and Vazhkudai, S.S. CATCH: A cloud-based adaptive data-transfer service for HPC. In Proceedings of the 25 th IEEE International Parallel & Distributed Processing Symposium (Anchorage, Alaska, May 16--20). IEEE Computer Society, 2011, 1242--1253. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Novotny, J., Tuecke, S., and Welch, V. An online credential repository for the grid: MyProxy. In Proceedings of the 10 th IEEE International Symposium on High-Performance Distributed Computing (San Francisco, Aug. 7--9). IEEE Computer Society Press, Washington, D.C., 2001, 104--111. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Rajasekar, A., Moore, R., Hou, C.-Y., Lee, C.A., Marciano, R., de Torcy, A., Wan, M., Schroeder, W., Chen, S.-Y., Gilbert, L., Tooby, P., and Zhu, B. iRODS Primer: Integrated Rule-Oriented Data System. Morgan and Claypool Publishers, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Sun, W., Zhang, K., Chen, S.-K., Zhang, X., and Liang, H. Software as a service: An integration perspective. In Proceedings of the Fifth International Conference on Service-Oriented Computing, B. Krämer, K.-J. Lin, and P. Narasimhan, Eds. (Vienna, Austria, Sept. 17--20). Springer, Berlin/Heidelberg, 2007, 558--569. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Thain, D., Basney, J., Son, S.-C., and Livny, M. The Kangaroo approach to data movement on the grid. In Proceedings of the 10 th IEEE International Symposium on High-Performance Distributed Computing (San Francisco, Aug. 7--9). IEEE Computer Society Press, Washington, D.C., 2001, 325--333. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Tridgell, A. and Mackerras, P. The Rsync Algorithm TR-CS-96-05. Department of Computer Science, Australian National University, Canberra, 1994.Google ScholarGoogle Scholar
  21. Wang, L., Park, K.S., Pang, R., Pai, V., and Peterson, L. Reliability and security in the CoDeeN content distribution network. In Proceedings of the USENIX Annual Technical Conference (Boston, June 27--July 2). USENIX Association, Berkeley, CA, 2004, 171--184. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Welch, V., Foster, I., Kesselman, C., Mulmo, O., Pearlman, L., Tuecke, S., Gawor, J., Meder, S., and Siebenlist, F. X.509 proxy certificates for dynamic delegation. In Proceedings of the Third Annual Public Key Infrastructure R&D Workshop (Gaithersburg, MD, Apr. 12--14), National Institute of Standards and Technology, Gaithersburg, MD, 2004.Google ScholarGoogle Scholar

Index Terms

  1. Software as a service for data scientists

                              Recommendations

                              Comments

                              Login options

                              Check if you have access through your login credentials or your institution to get full access on this article.

                              Sign in

                              Full Access

                              • Published in

                                cover image Communications of the ACM
                                Communications of the ACM  Volume 55, Issue 2
                                February 2012
                                111 pages
                                ISSN:0001-0782
                                EISSN:1557-7317
                                DOI:10.1145/2076450
                                Issue’s Table of Contents

                                Copyright © 2012 ACM

                                Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                                Publisher

                                Association for Computing Machinery

                                New York, NY, United States

                                Publication History

                                • Published: 1 February 2012

                                Permissions

                                Request permissions about this article.

                                Request Permissions

                                Check for updates

                                Qualifiers

                                • research-article
                                • Popular
                                • Refereed

                              PDF Format

                              View or Download as a PDF file.

                              PDF

                              eReader

                              View online with eReader.

                              eReader

                              HTML Format

                              View this article in HTML Format .

                              View HTML Format