ABSTRACT
In this paper, we describe Berkeley Prosopography Services (BPS), a new set of tools for prosopography - the identification of individuals and study of their interactions - in support of humanities research. Prosopography is an example of "big data" in the humanities, characterized not by the size of the datasets, but by the way that computational and data-driven methods can transform scholarly workflows. BPS is based upon re-usable infrastructure, supporting generalized web services for corpus management, social network analysis, and visualization. The BPS disambiguation model is a formal implementation of the traditional heuristics used by humanists, and supports plug-in rules for adaptation to a wide range of domain corpora. A workspace model supports exploratory research and collaboration. We contrast the BPS model of configurable heuristic rules to other approaches for automated text analysis, and explain how our model facilitates interpretation by humanist researchers. We describe the significance of the BPS assertion model in which researchers assert conclusions or possibilities, allowing them to override automated inference, to explore ideas in what-if scenarios, and to formally publish and subscribe-to asserted annotations among colleagues, and/or with students. We present an initial evaluation of researchers' experience using the tools to study corpora of cuneiform tablets, and describe plans to expand the application of the tools to a broader range of corpora.
- Ahmed, A. 2011. The Religious Elite of the Early Islamic Hijaz : Five Prosopographical Case Studies. Occasional Publications of the Oxford Unit for Prosopographical Research 14. Oxford: Unit for Prosopographical Research Linacre College University of Oxford.Google Scholar
- Booth, A. 2013. Brief Overview of Curating Lives: Museums, Archives, Online Sites, Autobiography, Biography, and Life Writing session, MLA Commons, Jan 5 2013. Available at: http://commons.mla.org/docs/a-brief-synopsis-of-curating-lives-mla-paper-alison-booth/Google Scholar
- Bostock, M., et al. 2011. D3: Data-Driven Documents, IEEE Trans. Visualization & Comp. Graphics (Proc. InfoVis), 2011. Google ScholarDigital Library
- Brandes, U., et al. 2000. The GraphML file format.Google Scholar
- Csillag, K. 2013. Fuzzy anchoring (blog post, April 22, 2013). Available at: http://hypothes.is/blog/fuzzy-anchoring.Google Scholar
- Elson, D., Dames, N., and McKeown, K. 2010, Extracting Social Networks from Literary Fiction, Proc. of 48th Annual Meeting of the Association for Computational Linguistics, pages 138--147, Uppsala, Sweden, 11-16 July 2010 Google ScholarDigital Library
- Fielding, R., and Taylor, R. 2002. Principled design of the modern Web architecture. ACM Trans. Internet Technol. 2,2 (May 2002) DOI=10.1145/514183.514185 Google ScholarDigital Library
- Gerritsen, A. 2008. Prosopography and its Potential for Middle Period Research, Journal of Song-Yuan Studies Volume 38, 2008 pp. 161--201.Google Scholar
- Higgins, S. 2011. Digital Curation: The Emergence of a New Discipline, International Journal of Digital Curation, 2011, Vol. 6, No. 2, pp. 78--88, doi:10.2218/ijdc.v6i2.191. http://ijdc.net/index.php/ijdc/article/view/184Google Scholar
- Mueller, M. 2011. Collaboratively Curating Early Modern English Texts. Contributed essay to Project Bamboo wiki. https://wikihub.berkeley.edu/x/QAdRB. Accessed June 2013.Google Scholar
- O'Madadhain, J., et al. 2005. Analysis and visualization of network data using JUNG. Journal of Statistical Software 10.2 (2005): 1--35.Google Scholar
- Pasin, M. 2012, Exploring Prosopographical Resources Through Novel Tools and Visualizations: a Preliminary Investigation, Digital Humanities 2012, Hamburg.Google Scholar
- Sanderson, R, et al. (eds.) Open Annotation Data Model, W3C Community Draft, Accessed July 2014. http://www.openannotation.org/spec/core/.Google Scholar
- Schmitz, P., and Pearce, L., 2013. Berkeley Prosopography Services: Ancient Families, Modern Tools, DH-Case 2013 (workshop), ACM Document Engineering 2013, Florence, Italy. Google ScholarDigital Library
- Szaley, A., 2012. Data-intensive discoveries in science: the fourth paradigm. Data-Intensive Distributed Computing 2012 (DIDC '12). DOI=10.1145/2286996.2286998 Google ScholarDigital Library
- TEI P5: Guidelines for Electronic Text Encoding and Interchange, Ch. 13 Names, Dates, People, and Places, V 2.3.0. Available at: http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ND.html (accessed April 2014).Google Scholar
- Waerzeggers, C. 2003-2004. The Babylonian Revolts Against Xerxes and the 'End of Archives', Archiv für Orientforschung 50: 150--173.Google Scholar
- Waerzeggers, C. 2013. Social Network Analysis of Cuneiform Archives: A New Approach. Proc. of the Second START Conference in Vienna (17-19th July 2008) Too Much Data? Generalizations and Model-building in Ancient Economic History on the Basis of Large Corpora of documentary Evidence, edited by H. D. Baker and Michael Jursa.Google Scholar
Index Terms
- Humanist-centric tools for big data: berkeley prosopography services
Recommendations
Berkeley prosopography services: ancient families, modern tools
DH-CASE '13: Proceedings of the 1st International Workshop on Collaborative Annotations in Shared Environment: metadata, vocabularies and techniques in the Digital HumanitiesIn this paper, we describe Berkeley Prosopography Services (BPS), a new set of tools for prosopography - the identification of individuals and study of their interactions - in support of humanities research. The BPS tools include 1) functionality to ...
A Brief Survey on Big Data in Healthcare
This article presents a brief introduction to big data and big data analytics and also their roles in the healthcare system. A definite range of scientific researches about big data analytics in the healthcare system have been reviewed. The definition ...
Big data exploration through visual analytics
VAST '12: Proceedings of the 2012 IEEE Conference on Visual Analytics Science and Technology (VAST)SAS® Visual Analytics Explorer is an advanced data visualization and exploratory data analysis application that is a component of the SAS Visual Analytics solution. It excels at handling big data problems like the VAST challenge. With a wide range of ...
Comments