skip to main content
10.1145/3093172.3093238acmconferencesArticle/Chapter ViewAbstractPublication PagespascConference Proceedingsconference-collections
research-article
Open Access

A Scalable Object Store for Meteorological and Climate Data

Authors Info & Claims
Published:26 June 2017Publication History

ABSTRACT

Numerical Weather Prediction (NWP) and Climate simulations sit in the intersection between classically understood High Performance Computing (HPC) and the Big Data / High Performance Data Analytics (HPDA) communities. Driven by ever more ambitious scientific goals, both the size and number of output data elements generated as part of NWP operations has grown by several orders of magnitude, and will continue to grow into the future. The total amount of data is expected to grow exponentially with time, and over the last 30 years this increase has been approximately 40% per year. This poses significant scalability challenges for the data processing pipeline, and the movement of data through and between stages is one of the most significant factors in this. At ECMWF, meteorological data within the HPC facility is stored in an indexed data store for retrieval according to a well defined schema of meteorological metadata. This paper discusses the design and implementation of the next version (5th) of this indexed data store, which aims to increase the range of contexts within the operational workflow in which it can be used, and to increase its tolerance to failure. Further, it aims to pre-emptively head off some upcoming scalability bottlenecks present in the previous versions.

References

  1. Amazon. 2016. AWS Storage Services Whitepaper. https://d0.awsstatic.com/whitepapers/Storage/AWS%20Storage%20Services%20Whitepaper-v9.pdf. (2016).Google ScholarGoogle Scholar
  2. Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, and Robert E. Gruber. 2006. Bigtable: A Distributed Storage System for Structured Data. In Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7 (OSDI '06). USENIX Association, Berkeley, CA, USA, 15--15. http://dl.acm.org/citation.cfm?id=1267308.1267323 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Cray Inc. 2014. Cray XC40 DataWarp applications I/O accelerator. http://www.cray.com/sites/default/files/resources/CrayXC40-DataWarp.pdf. (2014).Google ScholarGoogle Scholar
  4. Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner Vogels. 2007. Dynamo: amazonâĂŹs highly available key-value store. In IN PROC. SOSP. 205--220. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Brian Eaton, Jonathan Gregory, Bob Drach, Karl Taylor, Steve Hankin, John Caron, Rich Signell, Phil Bentley, Greg Rappa, Heinke HÃűck, Alison Pamment, and Martin Juckes. 2011. NetCDF Climate and Forecast (CF) Metadata Conventions. http://cfconventions.org/cf-conventions/v1.6.0/cf-conventions.pdf. (2011).Google ScholarGoogle Scholar
  6. ECMWF. 2015. ECMWF Strategy 2016--2015, The strength of a common goal. http://www.ecmwf.int/sites/default/files/ECMWF_Strategy_2016-2025.pdf. (2015).Google ScholarGoogle Scholar
  7. Brad Fitzpatrick. 2004. Distributed Caching with Memcached. Linux J. 2004, 124 (Aug. 2004), 5--. http://dl.acm.org/citation.cfm?id=1012889.1012894 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Erich Gamma, Richard Helm, Ralph Johnson, and John Vlissides. 1995. Design Patterns: Elements of Reusable Object-oriented Software. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Intel. 2014. DAOS API Design Document. https://wiki.hpdd.intel.com/download/attachments/12127153/DAOS%202.5%20DAOS%20API%20and%20DAOS%20POSIX%20Design%20Document.pdf. (2014).Google ScholarGoogle Scholar
  10. Intel Corporation. 2017. pmem.io Persistent Memory Programming. http://pmem.io. (2017).Google ScholarGoogle Scholar
  11. Avinash Lakshman and Prashant Malik. 2010. Cassandra: A Decentralized Structured Storage System. SIGOPS Oper. Syst. Rev. 44, 2 (April 2010), 35--40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yann Meurdesoif, Ozdoba H., Caubel A., and Marti O. 2012. XIOS. http://forge.ipsl.jussieu.fr/ioserver/raw-attachment/wiki/WikiStart/XIOS_IO_Workshop_Hamburg.pdf. (2012).Google ScholarGoogle Scholar
  13. NextGenIO. 2016. NextGenIO website. http://www.nextgenio.eu/. (2016).Google ScholarGoogle Scholar
  14. University of Michigan. Department of Electrical Engineering, Computer Science. Computer Science, Engineering Division, D Thaler, and C Ravishankar. 1996. A name-based mapping scheme for rendezvous.Google ScholarGoogle Scholar
  15. B. Raoult. 2012. Architecture of the new MARS server. (2012).Google ScholarGoogle Scholar
  16. Redis MPI Forum. 2017. Redis Cluster Specification. https://redis.io/topics/cluster-spec. (2017).Google ScholarGoogle Scholar
  17. SAGE. 2016. Data Storage for Extreme Scale; The SAGE Project Technical White Paper. http://sagestorage.eu/sites/default/files/Sage%20White%20Paper%20v1.0.pdf. (2016).Google ScholarGoogle Scholar
  18. SAGE. 2016. Percipient StorAGe for Exascale. http://www.sagestorage.eu. (2016).Google ScholarGoogle Scholar
  19. The HDF Group. 2017. Parallel HDF5. https://support.hdfgroup.org/HDF5/PHDF5. (2017).Google ScholarGoogle Scholar
  20. The MPI Forum. 2012. MPI: A Message Passing Interface. http://mpi-forum.org/docs/mpi-3.0/mpi30-report.pdf. (2012).Google ScholarGoogle Scholar
  21. University Corporation for Atmospheric Research. 2016. NetCDF Format. www.unidata.ucar.edu/software/netcdf/docs. (2016).Google ScholarGoogle Scholar
  22. Richard W Watson and Robert A Coyne. 1995. The parallel I/O architecture of the high-performance storage system (HPSS). In Mass Storage Systems, 1995. 'Storage-At the Forefront of Information Infrastructures', Proceedings of the Fourteenth IEEE Symposium on. IEEE, 27--44. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. World Meteorological Organization. 2013. GRIB Format. http://www.wmo.int/pages/prog/www/DPS/FM92-GRIB2-11-2003.pdf. (2013).Google ScholarGoogle Scholar

Index Terms

  1. A Scalable Object Store for Meteorological and Climate Data

              Recommendations

              Comments

              Login options

              Check if you have access through your login credentials or your institution to get full access on this article.

              Sign in
              • Published in

                cover image ACM Conferences
                PASC '17: Proceedings of the Platform for Advanced Scientific Computing Conference
                June 2017
                136 pages
                ISBN:9781450350624
                DOI:10.1145/3093172

                Copyright © 2017 Owner/Author

                This work is licensed under a Creative Commons Attribution International 4.0 License.

                Publisher

                Association for Computing Machinery

                New York, NY, United States

                Publication History

                • Published: 26 June 2017

                Permissions

                Request permissions about this article.

                Request Permissions

                Check for updates

                Qualifiers

                • research-article
                • Research
                • Refereed limited

                Acceptance Rates

                PASC '17 Paper Acceptance Rate13of33submissions,39%Overall Acceptance Rate83of185submissions,45%

                Upcoming Conference

                PASC '24
                Platform for Advanced Scientific Computing Conference
                June 3 - 5, 2024
                Zurich , Switzerland

              PDF Format

              View or Download as a PDF file.

              PDF

              eReader

              View online with eReader.

              eReader