ABSTRACT
The paper introduces an open domain entity search system called ESearch, which aims at finding a list of relevant entities to an open domain entity search query (a natural language question). The system is built on top of a Wikipedia text corpus, as well as the structured DBPedia knowledge base. Entities are initially ranked by a model which effectively associates context matching (based on the contexts of entities in the unstructured text corpus) and category matching (based on the types of entities in the structured knowledge base). They are ranked further by a re-ranking component supported by blind feedback or user feedback on entities. We show that category matching is critical for the search performance and the re-ranking component can boost the performance largely. Category matching therefore needs some query entity types (especially specific entity types) as input. However, it is often hard for systems to detect specific entity types because users may not be familiar with how the types of desired entities are defined in the structured knowledge base. In ESearch, we design an effective ranking model of entity types to facilitate blind feedback and user feedback on desired entity types for category matching, so that users can effectively perform entity search without the need of explicitly providing any query entity types as inputs.
- K. Balog, M. Bron, and M. de Rijke. Category-based query modeling for entity search. In ECIR, pages 319--331, 2010. Google ScholarDigital Library
- C. Bizer, J. Lehmann, G. Kobilarov, S. Auer, C. Becker, R. Cyganiak, and S. Hellmann. Dbpedia - A crystallization point for the web of data. J. Web Sem., 7(3):154--165, 2009. Google ScholarDigital Library
- Y. Chen, L. Gao, S. Shi, X. Du, and J. Wen. Improving context and category matching for entity search. In AAAI, pages 16--22, 2014. Google ScholarDigital Library
- G. Demartini, T. Iofciu, and A. P. de Vries. Overview of the INEX 2009 entity ranking track. In Focused Retrieval and Evaluation, 8th International Workshop of INEX, pages 254--264, 2009. Google ScholarDigital Library
- Y. Fang, L. Si, Z. Yu, Y. Xian, and Y. Xu. Entity retrieval with hierarchical relevance model. In TREC, 2009.Google Scholar
- R. Kaptein and J. Kamps. Exploiting the category structure of wikipedia for entity ranking. Artif. Intell., 194, 2013. Google ScholarDigital Library
- S. Liang and M. de Rijke. Formal language models for finding groups of experts. Inf. Process. Manage., 52(4):529--549, 2016. Google ScholarDigital Library
- D. N. Milne and I. H. Witten. Learning to link with wikipedia. In CIKM, pages 509--518, 2008. Google ScholarDigital Library
- R. L. T. Santos, C. Macdonald, and I. Ounis. Voting for related entities. In RIAO, pages 1--8, 2010. Google ScholarDigital Library
- Z. Wang, H. Wang, and Z. Hu. Head, modifier, and constraint detection in short texts. In ICDE, pages 280--291, 2014. Google ScholarCross Ref
- G. Weikum. Search for knowledge. In SeCO Workshop, pages 24--39, 2009.Google Scholar
Index Terms
- ESearch: Incorporating Text Corpus and Structured Knowledge for Open Domain Entity Search
Recommendations
A search based approach to entity recognition: magnetic and IISAS team at ERD challenge
ERD '14: Proceedings of the first international workshop on Entity recognition & disambiguationERD 2014 was a research challenge focused on the task of recognition and disambiguation of knowledge base entities in short and long texts. This write-up describes Magnetic-IISAS team's approach to the entity recognition in search queries with which we ...
Concordance-based entity-oriented search
We consider the problem of finding relevant named entities in response to a search query over a given text corpus. Entity search can readily be used to augment conventional web search engines for a variety of applications. We use entity concordance ...
STICS: searching with strings, things, and cats
SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrievalThis paper describes an advanced search engine that supports users in querying documents by means of keywords, entities, and categories. Users simply type words, which are automatically mapped onto appropriate suggestions for entities and categories. ...
Comments