ABSTRACT
In this paper, we describe a novel approach of extracting models from natural language text sources. This requires linguistic analysis as well as techniques for interpreting and using the analysis results. Our linguistic analysis engine provides feature analysis for a rule-based model element detection. Furthermore, the presented approach enables users to generate domain- and application-specific model element detection rules based on natural language sample sentences. Detection rules also have to be connected to instantiation rules for the respective type of model element. This is done through a highly system-supported mapping step where users are able to choose elements from arbitrary meta models and to connect their properties with functions over natural language sentence parts. As both, the definition and application of detection rules is always a sensitive balancing act between precision and recall, these steps are highly interactive. That is why our current prototype also supports detection rule adaption and iterative rule set completion -- always to the level of current need.
- Baader, F. and Küsters, R. 1998. Computing the least common subsumer and the most specific concept in the presence of cyclic ALN-concept descriptions. In KI-98: Advances in Artificial Intelligence, O. Herzog and A. Günter, Eds. LNCS. Springer Berlin Heidelberg, 129--140. Google ScholarDigital Library
- Borman, L. 1985. Human factors in computing systems. CHI '85 conference proceedings April 14-18 San Francisco. CHI Conference, New York, NY. Google Scholar
- Friedrich, F., Mendling, J., and Puhlmann, F. 2011. Process Model Generation from Natural Language Text. In Advanced Information Systems Engineering, Eds. Lecture Notes in Computer Science. Springer Berlin Heidelberg, Berlin, Heidelberg, 482--496. Google ScholarDigital Library
- Ghose, A., Koliadis, G., and Chueng, A. Process Discovery from Model and Text Artefacts. In 2007 IEEE Congress on Services (Services 2007), 167--174.Google ScholarCross Ref
- Hendrix, G. G. 1982. Natural-language interface. Comput. Linguist. 8, 2, 56--61. Google ScholarDigital Library
- Jablonski, S., Volz, B., and Dornstauder, S. A Meta Modeling Framework for Domain Specific Process Management. In 2008 32nd Annual IEEE International COMPSAC, 1011--1016. Google ScholarDigital Library
- Jurafsky, D. and Martin, J. H. 2009. Speech and language processing. An introduction to natural language processing, computational linguistics, and speech recognition. Prentice-Hall-series in artificial intelligence. Pearson Education International Prentice Hall Google ScholarDigital Library
- L. Ratinov and D. Roth. 2009. Design Challenges and Misconceptions in Named Entity Recognition. In CoNLL. Google ScholarDigital Library
- Marneffe, M.-C. de and Manning, C. D. 2008. The Stanford typed dependencies representation. In Coling 2008: Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation. CrossParser '08. ACL, Stroudsburg, PA, USA, 1--8. Google ScholarDigital Library
- Marta Fernandez and Caroline Eastman. 1990. Basic Taxonomic Structures and Levels of Abstraction. Advances in Classification Research Online 1, 1.Google Scholar
- Mohri, M., Rostamizadeh, A., and Talwalkar, A. 2012. Foundations of machine learning. Adaptive computation and machine learning. MIT Press, Cambridge, Mass. Google ScholarDigital Library
- Pollard, C. and Sag, I. A. 1994. Head-driven phrase structure grammar. Univ. of Chicago Press, Chicago, Ill.Google Scholar
- Pooley, L. D. Object-Oriented Analysis Using Natural Language Processing. CiteSeerX, DOI=10.1.1.60.1836.Google Scholar
- Sinha, A., Paradkar, A., Kumanan, P., and Boguraev, B. A linguistic analysis engine for natural language use case description and its application to dependability analysis in industrial use cases. In Networks (DSN), 327--336.Google Scholar
Index Terms
model[NL]generation: natural language model extraction
Recommendations
Analyzing word frequencies in large text corpora using inter-arrival times and bootstrapping
ECML PKDD'11: Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part IIComparing frequency counts over texts or corpora is an important task in many applications and scientific disciplines. Given a text corpus, we want to test a hypothesis, such as "word X is frequent", "word X has become more frequent over time", or "word ...
Analyzing word frequencies in large text corpora using inter-arrival times and bootstrapping
ECMLPKDD'11: Proceedings of the 2011th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part IIComparing frequency counts over texts or corpora is an important task in many applications and scientific disciplines. Given a text corpus, we want to test a hypothesis, such as "word X is frequent", "word X has become more frequent over time", or "word ...
Extracting definitions from brazilian legal texts
ICCSA'12: Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part IIIIn order to avoid ambiguity and to ensure, as far as possible, a strict interpretation of law, legal texts usually define the specific lexical terms used within their discourse by means of normative rules. With an often large amount of rules in effect ...
Comments