skip to main content
10.1145/2169095.2169102acmotherconferencesArticle/Chapter ViewAbstractPublication PagestempwebConference Proceedingsconference-collections
research-article

Identification of top relevant temporal expressions in documents

Published:17 April 2012Publication History

ABSTRACT

Temporal information is very common in textual documents, and thus, identifying, normalizing, and organizing temporal expressions is an important task in IR. Although there are some tools for temporal tagging, there is a lack in research focusing on the relevance of temporal expressions. Besides counting their frequency and verifying whether they satisfy a temporal search query, temporal expressions are often considered in isolation only. There are no methods to calculate the relevance of temporal expressions, neither in general nor with respect to a query.

In this paper, we present an approach to identify top relevant temporal expressions in documents using expression-, document-, corpus-, and query-based features. We present two relevance functions: one to calculate relevance scores for temporal expressions in general, and one with respect to a search query, which consists of a textual part, a temporal part, or both. Using two evaluation scenarios, we demonstrate the effectiveness of our approach.

References

  1. O. Alonso, M. Gertz, and R. A. Baeza-Yates. Temporal Analysis of Document Collections: Framework and Applications. In SPIRE'10, pages 290--296, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. O. Alonso, M. Gertz, and R. A. Baeza-Yates. Enhancing Document Snippets Using Temporal Information. In SPIRE'11, pages 26--31, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. O. Alonso, J. Strötgen, R. Baeza-Yates, and M. Gertz. Temporal Information Retrieval: Challenges and Opportunities. In TWAW'11, pages 1--8, 2011.Google ScholarGoogle Scholar
  4. I. Arikan, S. J. Bedathur, and K. Berberich. Time Will Tell: Leveraging Temporal Expressions in IR. In WSDM'09, 2009.Google ScholarGoogle Scholar
  5. K. Berberich, S. J. Bedathur, O. Alonso, and G. Weikum. A Language Modeling Approach for Temporal Information Needs. In ECIR'10, pages 13--25, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Y. Bestgen and W. Vonk. Temporal Adverbials as Segmentation Markers in Discourse Comprehension. Journal of Memory and Language, 42(1):74--87, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  7. H. Llorens, E. Saquete, and B. Navarro. TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval-2. In SemEval'10, pages 284--291, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Lucene. http://lucene.apache.org/.Google ScholarGoogle Scholar
  9. J. Makkonen, H. Ahonen-myka, and M. Salmenkivi. Topic Detection and Tracking with Spatio-Temporal Evidence. In ECIR'03, pages 251--265, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. C. man Au Yeung and A. Jatowt. Studying How the Past is Remembered: Towards Computational History through Large Scale Text Mining. In CIKM, pages 1231--1240, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. I. Mani, J. Pustejovsky, and R. Gaizauskas, editors. The Language of Time. Oxford University Press, 2005.Google ScholarGoogle Scholar
  12. C. D. Manning, P. Raghavan, and H. Schütze. Introduction to Information Retrieval. Cambridge University Press, New York, NY, USA, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. C. D. Manning and H. Schuetze. Foundations of Statistical Natural Language Processing. The MIT Press, 1 edition, June 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. Metzler, R. Jones, F. Peng, and R. Zhang. Improving Search Relevance for Implicitly Temporal Queries. In SIGIR '09, pages 700--701, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Nunes, C. Ribeiro, and G. David. Use of Temporal Expressions in Web Search. In ECIR'08, pages 580--584, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. OpenNLP. http://opennlp.sourceforge.net/.Google ScholarGoogle Scholar
  17. S.-T. Park, D. M. Pennock, C. L. Giles, and R. Krovetz. Analysis of Lexical Signatures for Improving Information Persistence on the World Wide Web. ACM Trans. Inf. Syst., 22(4):540--572, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. F. Schilder and C. Habel. From Temporal Expressions to Temporal Information: Semantic Tagging of News Messages. In Proceedings of the Workshop on Temporal and Spatial Information Processing, pages 65--72, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. M. Shokouhi. Detecting seasonal queries by time-series analysis. In SIGIR, pages 1171--1172, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Strötgen and M. Gertz. HeidelTime: High Quality Rule-based Extraction and Normalization of Temporal Expressions. In SemEval'10, 2010.Google ScholarGoogle Scholar
  21. J. Strötgen and M. Gertz. TimeTrails: A System for Exploring Spatio-Temporal Information in Documents. PVLDB, 3(2):1569--1572, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. J. Strötgen and M. Gertz. Multilingual and Cross-domain Temporal Tagging. Language Resources and Evaluation, accepted for journal publication, 2012.Google ScholarGoogle Scholar
  23. J. Strötgen, M. Gertz, and C. Junghans. An Event-centric Model for Multilingual Document Similarity. In SIGIR'11, pages 953--962, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. TimeML. http://www.timeml.org/.Google ScholarGoogle Scholar
  25. UIMA. http://uima.apache.org/.Google ScholarGoogle Scholar
  26. M. Verhagen, R. Gaizauskas, F. Schilder, M. Hepple, G. Katz, and J. Pustejovsky. SemEval-2007 Task 15: TempEval Temporal Relation Identification. In SemEval'07, pages 75--80, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Verhagen and J. Pustejovsky. Temporal Processing with the TARSQI Toolkit. In Coling 2008: Companion volume: Demonstrations, pages 189--192, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. M. Verhagen, R. Sauri, T. Caselli, and J. Pustejovsky. SemEval-2010 Task 13: TempEval-2. In SemEval'10, pages 57--62, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Wikipedia Featured Articles. http://en.wikipedia.org/wiki/Wikipedia: Featured_articles.Google ScholarGoogle Scholar

Index Terms

  1. Identification of top relevant temporal expressions in documents

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        TempWeb '12: Proceedings of the 2nd Temporal Web Analytics Workshop
        April 2012
        55 pages
        ISBN:9781450311885
        DOI:10.1145/2169095

        Copyright © 2012 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 17 April 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader