ABSTRACT
Despite the growing interest in NLP focused on the Brazilian Portuguese language in recent years, its obvious counterpart -- Natural Language Generation (NLG) -- remains in that case a little-explored research field. In this paper we describe preliminary results of a first project of this kind, addressing the issue of surface realization for Brazilian Portuguese. Our approach, which may be particularly suitable to simpler NLG applications in which a domain corpus of the most likely output sentences happens to be available, is in principle adaptable to many closely-related languages, and paves the way to further NLG research focused on Romance languages in general.
- }}Bangalore, S. and O. Rambow (2000) Corpus-based lexical choice in natural language generation. Proceedings of the 38th Meeting of the ACL, Hong Kong, pp. 464--471. Google ScholarDigital Library
- }}Bateman, J. A. (1997) Enabling technology for multilingual natural language generation: the KPML development environment. Natural Language Engineering, 3(1):15--55. Google ScholarDigital Library
- }}Bick, E. (2000) The parsing system PALAVRAS: automatic grammatical analysis of Portuguese in a constraint grammar framework. PhD Thesis, Aarhus University.Google Scholar
- }}DeVault, David, David Traum and Ron Arstein (2008) Practical Grammar-Based NLG from Examples. Proceedings of the 5th International Natural Language Generation Conference (INLG-2008) Columbus, USA. Google ScholarDigital Library
- }}Gatt, Albert and Ehud Reiter (2009) SimpleNLG: A realization engine for practical applications. Proceedings of the European Natural Language Generation workshop (ENLG-2009.) Google ScholarDigital Library
- }}Langkilde, Irene (2000) Forest-based statistical sentence generation. Proceedings of the 6th Applied Natural Language Processing Conference and 1st Meeting of the North American Chapter of the Association of Computational Linguistics (ANLP-NAACL'00), pp. 170--177. Google ScholarDigital Library
- }}Marciniak, T. and M. Strube (2005) Using an Annotated Corpus As a Knowledge Source For Language Generation. Proceedings of the Corpus Linguistics'05 Workshop Using Corpora for NLG (UNNLG-2005), pp. 19--24.Google Scholar
- }}McRoy, Susan, Songsak Channarukul and Syed S. Ali (2003) An augmented template-based approach to text realization. Natural Language Engineering 9 (4) pp. 381--420. Cambridge University Press. Google ScholarDigital Library
- }}Muniz, M. C., Laporte, E., Nunes, M. G. V (2005) UNITEX-PB, a set of flexible language resources for Brazilian Portuguese. Proceedings of the III Information and Language Technology Workshop (TIL-2005).Google Scholar
- }}Oh, A. and A. Rudnicky (2000) Stochastic language generation for spoken dialogue systems. Proceedings of the ANLP-NAACL 2000 Workshop on Conversational Systems, pp. 27--32. Google ScholarDigital Library
- }}Ratnaparkhi, A. (2000) Trainable methods for surface natural language generation. Proceedings of ANLP-NAACL 2000, pp. 194--201. Google ScholarDigital Library
- }}Reiter, E. (2007) An Architecture for Data-to-Text Systems. Proceedings of the European Natural Language Generation workshop (ENLG-2007), pp. 97--104. Google ScholarDigital Library
- }}Smets, M., M. Gamon, S. Corston-Oliver and E. Ringger (2003) French Amalgam: A machine-learned sentence realization system. Proceedings of the TALN-2003 Conference, Batz sur-Mer.Google Scholar
- }}van Deemter, K., Emiel Krahmer and Mariët Theune (2005) Real versus template-based NLG: a false opposition? Computational Linguistics 31(1). Google ScholarDigital Library
- }}Zhong, Huayan and A. J. Stent (2005) Building Surface Realizers Automatically from Corpora. Proceedings of the Corpus Linguistics'05 Workshop Using Corpora for NLG, pp. 49--54.Google Scholar
Recommendations
Bootstrapping a Lexicon of Multiword Adverbs for Brazilian Portuguese
Computational and Corpus-Based PhraseologyAbstractThis paper presents the process for bootstrapping a computational lexicon of multiword adverbs for Brazilian Portuguese (PT-BR) from an already existing lexicon built for the European variety of the language (PT-PT). This ongoing work aims to ...
Groundwork for the Development of the Brazilian Portuguese Wordnet
PorTAL '02: Proceedings of the Third International Conference on Advances in Natural Language ProcessingConsidering the Princeton WordNet built for English as a reference, new Wordnets in other languages are being built, such as the ones for European Portuguese, Galician, Basque, Catalan, and Spanish, just to mention some Iberian languages. In this paper ...
Verb Clustering for Brazilian Portuguese
CICLing 2014: Proceedings of the 15th International Conference on Computational Linguistics and Intelligent Text Processing - Volume 8403Levin-style classes which capture the shared syntax and semantics of verbs have proven useful for many Natural Language Processing NLP tasks and applications. However, lexical resources which provide information about such classes are only available for ...
Comments