skip to main content
10.5555/1698381.1698392dlproceedingsArticle/Chapter ViewAbstractPublication Pagesacl-ijcnlpConference Proceedingsconference-collections
research-article
Free Access

Stand-off TEI annotation: the case of the National Corpus of Polish

Published:06 August 2009Publication History

ABSTRACT

We present the annotation architecture of the National Corpus of Polish and discuss problems identified in the TEI stand-off annotation system, which, in its current version, is still very much unfinished and untested, due to both technical reasons (lack of tools implementing the TEI-defined XPointer schemes) and certain problems concerning data representation. We concentrate on two features that a stand-off system should possess and that are conspicuously missing in the current TEI Guidelines.

References

  1. Ide, N. and L. Romary. (2007). Towards International Standards for Language Resources. In Dybkjaer, L., Hemsen, H., Minker, W. (eds.), Evaluation of Text and Speech Systems, Springer, 263--84.Google ScholarGoogle ScholarCross RefCross Ref
  2. Przepiórkowski, A., R. L. Górski, B. Lewandowska-Tomaszczyk and M. Laziński. (2008). Towards the National Corpus of Polish. In the proceedings of the 6th Language Resources and Evaluation Conference (LREC 2008), Marrakesh, Morocco.Google ScholarGoogle Scholar
  3. TEI Consortium, eds. 2007. TEI P5: Guidelines for Electronic Text Encoding and Interchange. Version 1.2.0. Last updated on February 1st 2009. TEI Consortium.Google ScholarGoogle Scholar

Index Terms

  1. Stand-off TEI annotation: the case of the National Corpus of Polish

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        ACL-IJCNLP '09: Proceedings of the Third Linguistic Annotation Workshop
        August 2009
        203 pages
        ISBN:9781932432527

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 6 August 2009

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader