ABSTRACT
There is an expectation that scientists will archive their experimental data online in public repositories to enable other investigators to verify their work and to re-explore their data in search of new discoveries. When left to their own devices, however, scientists do a poor job creating the metadata that describe their datasets. A lack of standardization makes it difficult for other investigators to find relevant datasets and to perform secondary analyses. The Center for Expanded Data Annotation and Retrieval (CEDAR) was founded with the goal of enhancing the authoring of experimental metadata to make online datasets more useful to the scientific community. CEDAR technology includes Web-based methods for creating and managing libraries of templates for representing metadata. CEDAR's templates interoperate with a repository of scientific ontologies to standardize the way in which the templates may be filled out. Collaborations with several major research projects are allowing us to explore how CEDAR may ease access to scientific data sets stored in public repositories.
- Rafael S Gonçalves, Martin J. OConnor, Marcos Martínez-Romero, et al. 2017. Metadata in the BioSample online repository are impaired by numerous anomalies. In Proceedings of the First Workshop on Enabling Open Semantic Science (SemSci). 39--46. http://ceur-ws.org/Vol-1931/#paper-06Google Scholar
- Marcos Martínez-Romero, Martin J OConnor, R Shankar, et al. 2017. Fast and accurate metadata authoring using ontology-based recommendations. In Proceedings of the AMIA Annual Symposium.Google Scholar
- Mark A Musen, Carol A Bean, Kei-Hoi Cheung, et al. 2015. The center for expanded data annotation and retrieval. Journal of the American Medical Informatics Association 22, 6 (2015), 1148--1152.Google ScholarCross Ref
- Martin J OConnor, Marcos Martínez-Romero, Attila L Egyedi, et al. 2016. An open repository model for acquiring knowledge about scientific experiments. In European Knowledge Acquisition Workshop. Springer, 762--777. Google ScholarDigital Library
- Florian Rubelt, Christian E Busse, Syed Ahmad Chan Bukhari, et al. 2017. Adaptive Immune Receptor Repertoire Community recommendations for sharing immunerepertoire sequencing data. Nature Immunology 18, 12 (2017), 1274.Google ScholarCross Ref
- Uma D Vempati, Caty Chung, Chris Mader, et al. 2014. Metadata standard and data exchange specifications to describe, model, and integrate complex and diverse high-throughput screening data from the Library of Integrated Network-based Cellular Signatures (LINCS). Journal of Biomolecular Screening 19, 5 (2014), 803-- 816.Google ScholarCross Ref
- Mark D Wilkinson, Michel Dumontier, IJsbrand Jan Aalbersberg, et al. 2016. The FAIR guiding principles for scientific data management and stewardship. Scientific Data 3 (2016).Google Scholar
Recommendations
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata that Describe Scientific Experiments
The Semantic Web – ISWC 2017AbstractThe Center for Expanded Data Annotation and Retrieval (CEDAR) aims to revolutionize the way that metadata describing scientific experiments are authored. The software we have developed—the CEDAR Workbench—is a suite of Web-based tools and REST ...
Metadata tensions: a case study of library principles vs. everyday scientific data practices
ASIS&T '10: Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem - Volume 47Data sharing requirements and mandates are becoming more common, and many institutions are investigating data curation methods. Metadata are a critical component to any institutional scientific data curation initiative. As libraries and other ...
CREAM: CREAting metadata for the Semantic Web
Special issue: The Semantic Web: an evolution for a revolutionRichly interlinked, machine-understandable data constitute the basis for the Semantic Web. We provide a framework, CREAM, that allows for creation of metadata. While the annotation mode of CREAM allows creation of metadata for existing Web pages, the ...
Comments