skip to main content
research-article

Aggregated search: A new information retrieval paradigm

Published:01 January 2014Publication History
Skip Abstract Section

Abstract

Traditional search engines return ranked lists of search results. It is up to the user to scroll this list, scan within different documents, and assemble information that fulfill his/her information need. Aggregated search represents a new class of approaches where the information is not only retrieved but also assembled. This is the current evolution in Web search, where diverse content (images, videos, etc.) and relational content (similar entities, features) are included in search results.

In this survey, we propose a simple analysis framework for aggregated search and an overview of existing work. We start with related work in related domains such as federated search, natural language generation, and question answering. Then we focus on more recent trends, namely cross vertical aggregated search and relational aggregated search, which are already present in current Web search.

References

  1. Pal Aditya and Kawale Jaya. 2008. Leveraging query association in federated search. In Proc. of SIGIR 2008 Workshop on Aggregated Search.Google ScholarGoogle Scholar
  2. Eugene Agichtein and Luis Gravano. 2000. Snowball: Extracting relations from large plain-text collections. In Proc. of the 5th ACM Conference on Digital Libraries. 85--94. http://dx.doi.org/10.1145/336597.336644 Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Enrique Alfonseca, Marius Pasca, and Enrique Robledo-Arnuncio. 2010. Acquisition of instance attributes via labeled and related instances. In Proc. of SIGIR 2010. 58--65. http://dx.doi.org/10.1145/1835449.1835462 Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Abdulrahman Almuhareb and Massimo Poesio. 2004. Attribute-based and value-based clustering: An evaluation. In In EMNLP 2004, ACL. 158--165.Google ScholarGoogle Scholar
  5. Jaime Arguello and Robert Capra. 2012. The effect of aggregated search coherence on search behavior. In Proc. of CIKM 2012. 1293--1302. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Jaime Arguello, Fernando Diaz, and Jamie Callan. 2011. Learning to aggregate vertical results into Web search results. In Proc. of CIKM 2011. 201--210. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jaime Arguello, Fernando Diaz, Jamie Callan, and Ben Carterette. 2011. A methodology for evaluating aggregated search results. In Proc. of ECIR 2011. 141--152. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Jaime Arguello, Fernando Diaz, Jamie Callan, and Jean-Francois Crespo. 2009. Sources of evidence for vertical selection. In Proc. of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 315--322. DOI: http://dx.doi.org/10.1145/1571941.1571997 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Jaime Arguello, Fernando Diaz, and Jean-François Paiement. 2010. Vertical selection in the presence of unlabeled verticals. In Proc. of SIGIR 2010. 691--698. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Jaime Arguello, Fernando Diaz, and Milad Shokouhi. 2012. Integrating and ranking aggregated content on the Web. In Proc. WWW 2012.Google ScholarGoogle Scholar
  11. Jaime Arguello, Wan-Ching Wu, Diane Kelly, and Ashlee Edwards. 2012. Task complexity, vertical display and user interaction in aggregated search. In Proc. of SIGIR 2012. 435--444. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Yonatan Aumann, Ronen Feldman, Yair Liberzon, Benjamin Rosenfeld, and Jonathan Schler. 2006. Visual information extraction. Knowl. Inf. Syst. 10, 1 (July 2006), 1--15. DOI: http://dx.doi.org/10.1007/s10115-006-0014-x Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Thi Truong Avrahami, Lawrence Yau, Luo Si, and Jamie Callan. 2006. The FedLemur project: Federated search in the real world. JASIST 57, 3 (2006), 347--358. DOI: http://dx.doi.org/10.1002/asi.v57:3 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. K. Balog, A. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. 2009a. Overview of the TREC 2009 entity track. In Proc. of TREC 2009.Google ScholarGoogle Scholar
  15. K. Balog, A. P. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. 2009b. Overview of the TREC 2009 entity track. In TREC 2009 Working Notes. NIST.Google ScholarGoogle Scholar
  16. K. Balog, P. Serdyukov, and A. de Vries. 2010. Overview of the trec 2010 entity track. In Proc. of TREC 2010.Google ScholarGoogle Scholar
  17. K. Balog, P. Serdyukov, and A. de Vries. 2011. Overview of the trec 2010 entity track. In Proc. of TREC 2011.Google ScholarGoogle Scholar
  18. Michele Banko, Michael J. Cafarella, Stephen Soderland, Matt Broadhead, and Oren Etzioni. 2007. Open information extraction from the web. In Proc. of IJCAI 2007. 2670--2676. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Senjuti Basu Roy, Sihem Amer-Yahia, Ashish Chawla, Gautam Das, and Cong Yu. 2010. Constructing and exploring composite items. In Proc. of SIGMOD 2010. 843--854. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Mustapha Baziz, Mohand Boughanem, Yannick Loiseau, and Henri Prade. 2007. Fuzzy logic and ontology-based information retrieval. In Studies in Fuzziness and Soft Computing. Vol. 215/2007. 193--218.Google ScholarGoogle Scholar
  21. Kedar Bellare, Partha Pratim Talukdar, Giridhar Kumaran, O. Pereira, Mark Liberman, Andrew Mccallum, and Mark Dredze. 2007. Lightly-supervised attribute extraction for Web search. In Proc. of Machine Learning for Web Search Workshop, NIPS 2007.Google ScholarGoogle Scholar
  22. Ori Ben-Yitzhak, Nadav Golbandi, Nadav Har’El, Ronny Lempel, Andreas Neumann, Shila Ofek-Koifman, Dafna Sheinwald, Eugene Shekita, Benjamin Sznajder, and Sivan Yogev. 2008. Beyond basic faceted search. In Proc. of WSDM 2008. 33--44. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. C. Bizer, T. Heath, and T. Berners-Lee. 2009. Linked data—the story so far. International Journal Semantic Web and Information Systems 5, 3 (2009), 1--22.Google ScholarGoogle ScholarCross RefCross Ref
  24. Christian Bizer, Tom Heath, Kingsley Idehen, and Tim Berners-Lee. 2008. Linked data on the Web (LDOW2008). In Proc. of WWW 2008. 1265--1266. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Ourdia Bouidghaghen, Lynda Tamine, and Mohand Boughanem. 2009. Dynamically Personalizing Search Results for Mobile Users. In Proc. Flexible Query Answering (FQAS). 99--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Bert R. Boyce. 1982. Beyond topicality: A two stage view of relevance and the retrieval process. Inf. Process. Manage. 18, 3 (1982), 105--109.Google ScholarGoogle ScholarCross RefCross Ref
  27. Michael J. Cafarella, Michele Banko, and Oren Etzioni. 2006. Relational Web Search. Technical Report. University of Washington.Google ScholarGoogle Scholar
  28. Michael J. Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang. 2008. WebTables: Exploring the power of tables on the Web. Proc. VLDB Endow. 1, 1 (2008), 538--549. DOI: http://dx.doi.org/10.1145/1453856.1453916 Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Michael J. Cafarella, Alon Y. Halevy, and Nodira Khoussainova. 2009. Data Integration for the Relational Web. PVLDB 2, 1 (2009), 1090--1101. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Jamie Callan. 2000. Distributed information retrieval. In Advances in Information Retrieval, W. Bruce Croft (Ed.). Kluwer Academic Publishers, Dordrecht, 235--266.Google ScholarGoogle Scholar
  31. S. Campinas, D. Ceccarelli, T. E. Perry, R. Delbru, K. Balog, and G. Tummarello. 2011. The Sindice-2011 dataset for entity-oriented search in the web of data. In 1st International Workshop on Entity-Oriented Search (EOS). 26--32.Google ScholarGoogle Scholar
  32. Chih-Chung Chang and Chih-Jen Lin. 2001. LIBSVM: A Library for Support Vector Machines. Technical Report.Google ScholarGoogle Scholar
  33. Hsin-Hsi Chen, Shih-Chung Tsai, and Jin-He Tsai. 2000. Mining tables from large scale HTML texts. In Proc. of COLING 2000. 166--172. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Kenneth Ward Church and Patrick Hanks. 1989. Word association norms, mutual information, and lexicography. In Proc. of ACL 1998. 76--83. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Charles L. A. Clarke, Maheedhar Kolla, Gordon V. Cormack, Olga Vechtomova, Azin Ashkan, Stefan Büttcher, and Ian MacKinnon. 2008. Novelty and diversity in information retrieval evaluation. In Proc. of SIGIR 2008. 659--666. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Valter Crescenzi, Giansalvatore Mecca, and Paolo Merialdo. 2001. RoadRunner: Towards Automatic Data Extraction from Large Web Sites. In Proc. of VLDB 2001. 109--118. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. H. T. Dang, D. Kelly, and J. Lin. 2007. Overview of the TREC 2007 Question Answering Track. In Proc. TREC 2007.Google ScholarGoogle Scholar
  38. Hoa Tang Dang. 2006. Overview of DUC 2006. In Proc. of the 2006 Document Understanding Conference.Google ScholarGoogle Scholar
  39. Gianluca Demartini, Tereza Iofciu, and Arjen P. Vries. 2010. Overview of the INEX 2009 Entity Ranking Track. In Focused Retrieval and Evaluation. 254--264. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Fernando Diaz. 2009a. Integration of news content into web results. In Proc. of WSDM 2009. 182--191. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Fernando Diaz. 2009b. Integration of news content into web results. In Proc. WSDM. 182--191. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Fernando Diaz and Jaime Arguello. 2009. Adaptation of offline vertical selection predictions in the presence of user feedback. In Proc. of SIGIR 2009. 323--330. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Anlei Dong, Yi Chang, Zhaohui Zheng, Gilad Mishne, Jing Bai, Ruiqiang Zhang, Karolina Buchner, Ciya Liao, and Fernando Diaz. 2010. Towards recency ranking in Web search. In Proc. of WSDM 2010. 11--20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Doug Downey, Oren Etzioni, and Stephen Soderland. 2005. A probabilistic model of redundancy in information extraction. In Proceedigns of IJCAI. 1034--1041. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Oren Etzioni, Michele Banko, Stephen Soderland, and Daniel S. Weld. 2008. Open information extraction from the web. Commun. ACM 51 (December 2008), 68--74. Issue 12. http://dx.doi.org/10.1145/1409360.1409378 Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Oren Etzioni, Michael Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, and Alexander Yates. 2005. Unsupervised named-entity extraction from the web: An experimental study. Artif. Intell. 165, 1 (2005), 91--134. DOI: http://dx.doi.org/10.1016/j.artint.2005.03.001 Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. O. Etzioni, A. Fader, J. Christensen, S. Soderland, and Mausam. 2011. Open information extraction: The second generation. In Proc. of IJCAI 2011, Barcelona, Spain. 3--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. John R. Frank, Max Kleiman-Weiner, Daniel A. Roberts, Feng Niu, Ce Zhang, Christopher Re, and Ian Soboroff. 2012. Building an entity-centric stream filtering test collection for TREC 2012. In Proc. of TREC 2012.Google ScholarGoogle Scholar
  49. Shlomo Geva, Jaap Kamps, and Andrew Trotman (Eds.). 2009. Proc. of INEX 2008.Google ScholarGoogle Scholar
  50. Jeremy Goecks. 2002. NuggetMine: Intelligent groupware for opportunistically sharing information nuggets. In Proc. of IUI 2002. 87--94. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Luis Gravano, Chen-Chuan K. Chang, Héctor García-Molina, and Andreas Paepcke. 1997. STARTS: Stanford proposal for Internet meta-searching. SIGMOD Rec. 26 (June 1997), 207--218. Issue 2. DOI: http://dx.doi.org/10.1145/253262.253299 Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. Ohad Greenshpan, Tova Milo, and Neoklis Polyzotis. 2009. Autocompletion for mashups. Proc. VLDB Endow. 2, 1 (2009), 538--549. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. H. P. Grice. 1975. Logic and conversation. In Syntax and Semantics: Vol. 3: Speech Acts, P. Cole and J. L. Morgan (Eds.). Academic Press, San Diego, CA, 41--58.Google ScholarGoogle Scholar
  54. Ralph Grishman and Beth Sundheim. 1996. Message Understanding Conference-6: A brief history. In Proc. of the 16th Conference on Computational Linguistics. 466--471. Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. A. Gulli and A. Signorini. 2005. Building an open source meta-search engine. In Proc. of WWW ’05: Special Interest Tracks and Posters. 1004--1005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. 2009. Named entity recognition in query. In Proc. of SIGIR 2009. 267--274. Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. H. Halpin, D. M. Herzig, P. Mika, R. Blanco, J. Pound, H. S. Thompson, and D. T. Tran. 2010. Evaluating ad-hoc object retrieval. In Proc. of the International Workshop on Evaluation of Semantic Technologies (IWEST 2010)Google ScholarGoogle Scholar
  58. Marti A. Hearst. 1992. Automatic acquisition of hyponyms from large text corpora. In Proc. of the 14th Conference on Computational Linguistics. 539--545. Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Marti A. Hearst. 1998. Automated discovery of WordNet relations. In WordNet: An Electronic Lexical Database, C. Fellbaum (Ed.). MIT Press, 131--153. Retrieved from http://www.sims.berkeley.edu/hearst/papers/wordnet98.pdf.Google ScholarGoogle Scholar
  60. Marti A. Hearst and Jan O. Pedersen. 1996. Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In SIGIR. 76--84. Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Sascha Hennig and Michael Wurst. 2006. Incremental clustering of newsgroup articles. In IEA/AIE. 332--341. Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. L. Hirschman and R. Gaizauskas. 2001. Natural language question answering: The view from here. Natural Language Engineering 7, 4 (11 2001), 275--300. DOI: http://dx.doi.org/10.1017/S1351324901002807 Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Lei Ji, Jun Yan, Ning Liu, Wen Zhang, Weiguo Fan, and Zheng Chen. 2009. ExSearch: A novel vertical search engine for online barter business. In Proc. of CIKM 2009. 1357--1366. Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Christopher B. Jones and Ross S. Purves. 2009. Geographical information retrieval. In Encyclopedia of Database Systems. 1227--1231.Google ScholarGoogle Scholar
  65. Jaap Kamps, Shlomo Geva, and Andrew Trotman. 2008. Report on the SIGIR 2008 workshop on focused retrieval. SIGIR Forum 42, 2 (2008), 59--65. DOI: http://dx.doi.org/10.1145/1480506.1480517 Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. Rianne Kaptein and Maarten Marx. 2010. Focused retrieval and result aggregation with political data. Inf. Retr. 13, 5 (October 2010), 412--433. DOI: http://dx.doi.org/10.1007/s10791-010-9130-z Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. Makoto P. Kato, Hiroaki Ohshima, Satoshi Oyama, and Katsumi Tanaka. 2009. Query by analogical example: Relational search using Web search engine indices. In Proc. of CIKM 2009. 27--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  68. B. Katz, G. Borchardt, and S. Felshin. 2005. Syntactic and semantic decomposition strategies for question answering from multiple resources. In Proc. of the AAAI 2005 Workshop on Inference for Textual Question Answering. Pittsburgh, Pennsylvania, USA.Google ScholarGoogle Scholar
  69. Diane Kelly and Jimmy Lin. 2006. Overview of the TREC 2006 question answering task. In Proc. of the Text REtrieval Conference 2006.Google ScholarGoogle Scholar
  70. Diane Kelly and Jimmy Lin. 2007. Overview of the TREC 2007 question answering task. In Proc. of the Text REtrieval Conference 2007.Google ScholarGoogle Scholar
  71. Lyndon S. Kennedy and Mor Naaman. 2008. Generating diverse and representative image search results for landmarks. In Proc. of WWW 2008. 297--306. Google ScholarGoogle ScholarDigital LibraryDigital Library
  72. Arlind Kopliku. 2011. Approaches to Implement and Evaluate Aggregated Search. Thèse de doctorat. Université Paul Sabatier, Toulouse, France.Google ScholarGoogle Scholar
  73. Arlind Kopliku, Mohand Boughanem, and Karen Pinel-Sauvagnat. 2011a. Mining the Web for lists of named entities. In Proc. of CORIA 2011. 113--120.Google ScholarGoogle Scholar
  74. Arlind Kopliku, Mohand Boughanem, and Karen Pinel-Sauvagnat. 2011b. Towards a framework for attribute retrieval. In Proc. of CIKM 2011. 515--524. Google ScholarGoogle ScholarDigital LibraryDigital Library
  75. Arlind Kopliku, Firas Damak, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011c. Interest and evaluation of aggregated search. In Proc. of IEEE/WIC/ACM International Conference on Web Intelligence, Lyon, France. 154--161. Google ScholarGoogle ScholarDigital LibraryDigital Library
  76. Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2009. Aggregated Search: Potential, Issues and Evaluation. Technical Report. Institut de Recherche en Informatique de Toulouse, France.Google ScholarGoogle Scholar
  77. Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011d. Attribute Retrieval from Relational Web tables. In Proc. of SPIRE 2011. 117--128. Google ScholarGoogle ScholarDigital LibraryDigital Library
  78. Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011e. Retrieving attributes using Web tables. In Proc. of JDCL 2011. 397--398. Google ScholarGoogle ScholarDigital LibraryDigital Library
  79. Arlind Kopliku, Paul Thomas, Stephen Wan, and Cecile Paris. 2013. Filtering and ranking for social media monitoring. In Proc. of CORIA 2013.Google ScholarGoogle Scholar
  80. Ines Krichen, Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011. Une approche de recherche d’attributs pertinents pour l’agrégation d’information. In Proc. of INFORSID 2009. 385--400.Google ScholarGoogle Scholar
  81. Mounia Lalmas. 2011. Advanced topics on information retrieval. Springer, Chapter Aggregated search.Google ScholarGoogle Scholar
  82. Maurizio Lenzerini. 2002. Data integration: A theoretical perspective. In Proc. of PODS 2002. 233--246. Google ScholarGoogle ScholarDigital LibraryDigital Library
  83. Xiao Li, Ye-Yi Wang, and Alex Acero. 2008. Learning query intent from regularized click graphs. In Proc. of SIGIR 2008. 339--346. Google ScholarGoogle ScholarDigital LibraryDigital Library
  84. Girija Limaye, Sunita Sarawagi, and Soumen Chakrabarti. 2010. Annotating and searching Web tables using entities, types and relationships. Proc. VLDB Endow. 3, 1--2 (September 2010), 1338--1347. Google ScholarGoogle ScholarDigital LibraryDigital Library
  85. C.-J. Lin and R.-R. Liu. 2008. An analysis of multi-focus questions. In Proc. of SIGIR Workshop on Focused Retrieval.Google ScholarGoogle Scholar
  86. Thomas Lin and Oren Etzioni. 2010. Identifying functional relations in web text. In Proc. of EMNLP 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  87. Ning Liu, Jun Yan, and Zheng Chen. 2009. A probabilistic model based approach for blended search. In Proc. of WWW 2009. 1075--1076. Google ScholarGoogle ScholarDigital LibraryDigital Library
  88. Craig Macdonald. 2009. The voting model for people search. SIGIR Forum 43, 1 (June 2009), 73. DOI: http://dx.doi.org/10.1145/1670598.1670616 Google ScholarGoogle ScholarDigital LibraryDigital Library
  89. Christopher Manning, Prabhakar Raghavan, and Heinrich Schütze. 2008. Introduction to Information Retrieval. Cambridge University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  90. M. Manoj and Elisabeth Jacob. 2008. Information retrieval on Internet using meta-search engines: A review. Journal of Scientific & Industrial Research 67, 10 (2008), 739--746.Google ScholarGoogle Scholar
  91. Kevin S. McCurley. 2001. Geospatial mapping and navigation of the web. In Proc. of the 10th International Conference on WWW. ACM, New York, NY, 221--229. DOI: http://dx.doi.org/10.1145/371920.372056 Google ScholarGoogle ScholarDigital LibraryDigital Library
  92. Xiaofeng Meng, Haiyan Wang, Dongdong Hu, and Chen Li. 2003. A supervised visual wrapper generator for Web-data extraction. In Proc. of the 27th Annual International Conference on Computer Software and Applications(COMPSAC ’03). IEEE Computer Society, Washington, DC, 657. Google ScholarGoogle ScholarDigital LibraryDigital Library
  93. Véronique Moriceau and Xavier Tannier. 2010. FIDJI: Using syntax for validating answers in multiple documents. Inf. Retr. 13, 5 (October 2010), 507--533. DOI: http://dx.doi.org/10.1007/s10791-010-9131-y Google ScholarGoogle ScholarDigital LibraryDigital Library
  94. Vanessa Murdock and Mounia Lalmas. 2008. Workshop on aggregated search. SIGIR Forum 42, 2 (2008), 80--83. DOI: http://dx.doi.org/10.1145/1480506.1480520 Google ScholarGoogle ScholarDigital LibraryDigital Library
  95. Mor Naaman, Yee Jiun Song, Andreas Paepcke, and Hector Garcia-Molina. 2006. Assigning textual names to sets of geographic coordinates. Computers, Environment and Urban Systems 30, 4 (2006), 418--435.Google ScholarGoogle ScholarCross RefCross Ref
  96. Zaiqing Nie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen, and Wei-Ying Ma. 2007a. Web object retrieval. In Proc. of WWW 2007. 81--90. Google ScholarGoogle ScholarDigital LibraryDigital Library
  97. Zaiqing Nie, Ji-Rong Wen, and Wei-Ying Ma. 2007b. Object-level vertical search. In Proc. of CIDR. 235--246.Google ScholarGoogle Scholar
  98. Shiyan Ou and Christopher S. G. Khoo. 2008. Aggregating search results for social science by extracting and organizing research concepts and relations. In SIGIR 2008 Workshop on Aggregated Search.Google ScholarGoogle Scholar
  99. C. Paris and N. Colineau. 2006. Scifly: Tailored Corporate Brochures on Demand. Technical Report. CSIRO ICT Centre.Google ScholarGoogle Scholar
  100. C. Paris, A. Lampert, S. Lu, and M. Wu. 2005. Enhancing Dynamic Knowledge Management Services—Tailored Documents. Technical Report 05/034, Commercial-in-Confidence. CSIRO ICT Centre.Google ScholarGoogle Scholar
  101. Cécile Paris, Stephen Wan, and Paul Thomas. 2010. Focused and aggregated search: A perspective from natural language generation. Information Retrieval Journal 44, 3 (2010). Google ScholarGoogle ScholarDigital LibraryDigital Library
  102. Cécile Paris, Stephen Wan, Ross Wilkinson, and Mingfang Wu. 2001. Generating personal travel guides—and who wants them? In Proc. User Modeling 2001. 251--253. Google ScholarGoogle ScholarDigital LibraryDigital Library
  103. Cécile L. Paris. 1988. Tailoring object descriptions to a user’s level of expertise. Comput. Linguist. 14, 3 (1988), 64--78. Google ScholarGoogle ScholarDigital LibraryDigital Library
  104. Marius Pasca and Benjamin Van Durme. 2008. Weakly-supervised acquisition of open-domain classes and class attributes from Web documents and query logs. In ACL. 19--27.Google ScholarGoogle Scholar
  105. Ashok Kumar Ponnuswami, Kumaresh Pattabiraman, Qiang Wu, Ran Gilad-Bachrach, and Tapas Kanungo. 2011. On composition of a federated web search result page: Using online users to provide pairwise preference for heterogeneous verticals. In Proc. of WSDM 2011. 715--724. Google ScholarGoogle ScholarDigital LibraryDigital Library
  106. Jay M. Ponte and W. Bruce Croft. 1998. A language modeling approach to information retrieval. In Proc. of SIGIR 1998. 275--281. Google ScholarGoogle ScholarDigital LibraryDigital Library
  107. Ana-Maria Popescu and Oren Etzioni. 2005. Extracting product features and opinions from reviews. In Proc. of HLT 2008. 339--346. Google ScholarGoogle ScholarDigital LibraryDigital Library
  108. Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, and Grant Weddell. 2012. Interpreting keyword queries over web knowledge bases. In Proc. of CIKM 2012. 305--314. Google ScholarGoogle ScholarDigital LibraryDigital Library
  109. Anand Ranganathan, Anton Riabov, and Octavian Udrea. 2009. Mashup-based information retrieval for domain experts. In Proc. of CIKM 2009. 711--720. Google ScholarGoogle ScholarDigital LibraryDigital Library
  110. Stephen E. Robertson and Steve Walker. 1994. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proc. of SIGIR 1994 (Special Issue of the SIGIR Forum), W. Bruce Croft and C. J. van Rijsbergen (Eds.). 232--241. Google ScholarGoogle ScholarDigital LibraryDigital Library
  111. Cyril Rohr and Dian Tjondronegoro. 2008. Aggregated cross-media news visualization and personalization. In Proc. of MIR 2008. 371--378. Google ScholarGoogle ScholarDigital LibraryDigital Library
  112. Nachiketa Sahoo, Jamie Callan, Ramayya Krishnan, George Duncan, and Rema Padman. 2006. Incremental hierarchical clustering of text documents. In Proc. of CIKM 2006. 357--366. Google ScholarGoogle ScholarDigital LibraryDigital Library
  113. G. Salton, A. Wong, and C. S. Yang. 1975. A vector space model for automatic indexing. Commun. ACM 18 (November 1975), 613--620. Issue 11. DOI: http://dx.doi.org/10.1145/361219.361220 Google ScholarGoogle ScholarDigital LibraryDigital Library
  114. Rodrygo L. T. Santos, Craig Macdonald, and Iadh Ounis. 2011. Aggregated search result diversification. In Proc. of the 3rd International Conference on the Theory of Information Retrieval. Springer, Bertinoro, Italy. Google ScholarGoogle ScholarDigital LibraryDigital Library
  115. Christina Sauper and Regina Barzilay. 2009. Automatically generating Wikipedia articles: A structure-aware approach. In Proc. of ACL-IJCNLP. 208--216. Google ScholarGoogle ScholarDigital LibraryDigital Library
  116. Karen Sauvagnat, Mohand Boughanem, and Claude Chrisment. 2006. Answering content-and-structure-based queries on XML documents using relevance propagation. Information Systems, Special Issue SPIRE 2004 31 (2006), 621--635. Google ScholarGoogle ScholarDigital LibraryDigital Library
  117. Satoshi Sekine, Kiyoshi Sudo, and Chikashi Nobata. 2002. Extended named entity hierarchy. In Proc. of LREC 2002.Google ScholarGoogle Scholar
  118. Erik Selberg and Oren Etzioni. 1995. Multi-service search and comparison using the MetaCrawler. In Proc. of the 4th International World Wide Web Conference. 195--208.Google ScholarGoogle Scholar
  119. Semantic. 2010. Semantic Search Challenge 2010. Retrieved April 2013 from http://km.aifb.kit.edu/ws/semsearch10/.Google ScholarGoogle Scholar
  120. Semantic. 2011. Semantic Search Challenge 2011. Retrieved April 2013 from http://semsearch.yahoo.com/.Google ScholarGoogle Scholar
  121. Milad Shokouhi, Justin Zobel, Saied Tahaghoghi, and Falk Scholer. 2007. Using query logs to establish vocabularies in distributed information retrieval. Inf. Process. Manage. 43, 1 (January 2007), 169--180. DOI: http://dx.doi.org/10.1016/j.ipm.2006.04.003 Google ScholarGoogle ScholarDigital LibraryDigital Library
  122. Amit Singhal. 2012. Introducing the Knowledge Graph: Things, Not Strings. Official Blog of Google Retrieved April 2013 from http://googleblog.blogspot.co.uk/2012/05/introducing-knowledge-graph-things-not.html.Google ScholarGoogle Scholar
  123. Karen Spärck-Jones, Stephen E. Robertson, and Mark Sanderson. 2007. Ambiguous requests: Implications for retrieval tests, systems and theories. SIGIR Forum 41, 2 (2007), 8--17. http://dx.doi.org/10.1145/1328964.1328965 Google ScholarGoogle ScholarDigital LibraryDigital Library
  124. K. Srinivas, P. V. S. Srinivas, and A. Govardhan. 2011. A survey on the performance evaluation of various meta search engines. Int. Journal of Computer Science Issues 8, 3 (2011), 359--364.Google ScholarGoogle Scholar
  125. Andreas Strotmann and Dangzhi Zhao. 2008. Bibliometric maps for aggregated visual browsing in digital libraries. In Proc. of the SIGIR 2008 Workshop on Aggregated Search.Google ScholarGoogle Scholar
  126. Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. YAGO: A core of semantic knowledge. In Proc. of WWW 2007. 697--706. Google ScholarGoogle ScholarDigital LibraryDigital Library
  127. Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2008. YAGO: A large ontology from Wikipedia and WordNet. Web Semant. 6, 3 (2008), 203--217. DOI: http://dx.doi.org/10.1016/j.websem.2008.06.001 Google ScholarGoogle ScholarDigital LibraryDigital Library
  128. Fabian M. Suchanek, Mauro Sozio, and Gerhard Weikum. 2009. SOFIE: A self-organizing framework for information extraction. In Proc. of the WWW Conference. 631--640. Google ScholarGoogle ScholarDigital LibraryDigital Library
  129. Shanu Sushmita, Hideo Joho, and Mounia Lalmas. 2009. A task-based evaluation of an aggregated search interface. In Proc. of SPIRE 2009. 322--333. Google ScholarGoogle ScholarDigital LibraryDigital Library
  130. Shanu Sushmita, Hideo Joho, Mounia Lalmas, and Robert Villa. 2010. Factors affecting click-through behavior in aggregated search interfaces. In Proc. of CIKM 2010. 519--528. Google ScholarGoogle ScholarDigital LibraryDigital Library
  131. TAC. 2011. Proc. of the 4th Text Analysis Conference. National Institute of Standards and Technology, Gaithersburg, MD.Google ScholarGoogle Scholar
  132. Bilyana Taneva, Mouna Kacimi, and Gerhard Weikum. 2010. Gathering and ranking photos of named entities with high precision, high recall, and diversity. In Proc. of WSDM 2010. 431--440. Google ScholarGoogle ScholarDigital LibraryDigital Library
  133. Kosuke Tokunaga and Kentaro Torisawa. 2005. Automatic discovery of attribute words from web documents. In Proc. of the 2nd International Joint Conference on Natural Language Processing (IJCNLP-05). 106--118. Google ScholarGoogle ScholarDigital LibraryDigital Library
  134. T. Tran, H. Wang, S. Rudolph, and P. Cimiano. 2009. Top-k exploration of query candidates for efficient keyword search on graph-shaped (rdf) data. In Proc. of ICDE 2009. 405--416. Google ScholarGoogle ScholarDigital LibraryDigital Library
  135. Andrew Trotman, Shlomo Geva, Jaap Kamps, Mounia Lalmas, and Vanessa Murdock. 2010. Current research in focused retrieval and result aggregation. Journal of Information Retrieval (2010). Google ScholarGoogle ScholarDigital LibraryDigital Library
  136. Peter D. Turney. 2001. Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In Proc. of EMCL 2001. 491--502. Google ScholarGoogle ScholarDigital LibraryDigital Library
  137. Subodh Vaid, Christopher B. Jones, Hideo Joho, and Mark Sanderson. 2005. Spatio-textual indexing for geographical search on the web. In Proc. of SSTD. 218--235. Google ScholarGoogle ScholarDigital LibraryDigital Library
  138. David Vallet and Hugo Zaragoza. 2008. Inferring the most important types of a query: A semantic approach. In Proc. of SIGIR 2008e, Singapore. 857--858. Google ScholarGoogle ScholarDigital LibraryDigital Library
  139. Ellen M. Voorhees. 2003. Evaluating answers to definition questions. In Proceedigns of NAACL’03. 109--111. Google ScholarGoogle ScholarDigital LibraryDigital Library
  140. Tak-Lam Wong and Wai Lam. 2004. A probabilistic approach for adapting information extraction wrappers and discovering new attributes. In Proc. of ICDM 2004. 257--264. Google ScholarGoogle ScholarDigital LibraryDigital Library
  141. Tak-Lam Wong and Wai Lam. 2009. An unsupervised method for joint information extraction and feature mining across different Web sites. Data Knowl. Eng. 68, 1 (January 2009), 107--125. DOI: http://dx.doi.org/10.1016/j.datak.2008.08.009 Google ScholarGoogle ScholarDigital LibraryDigital Library
  142. Fei Wu, Raphael Hoffmann, and Daniel S. Weld. 2008. Information extraction from Wikipedia: Moving down the long tail. In Proc. of KDD 2008. 731--739. Google ScholarGoogle ScholarDigital LibraryDigital Library
  143. M. Wu and M. Fuller. 1997. Supporting the Answering Process. In Proc. of the 2nd Australian Document Computing Symposium. 65--73.Google ScholarGoogle Scholar
  144. Naoki Yoshinaga and Kentaro Torisawa. 2007. Open-domain attribute-value acquisition from semi-structured texts. In Proc. of the Workshop on Ontolex. 55--66.Google ScholarGoogle Scholar
  145. K. Zhou, R. Cummins, M. Lalmas, and J. M. Jose. 2011. Evaluating large-scale distributed vertical search. In Proc. of LSDS-IR Workshop in CIKM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  146. Ke Zhou, Ronan Cummins, Mounia Lalmas, and Joemon M. Jose. 2012. Evaluating aggregated search pages. In Proc. of SIGIR 2012. 115--124. Google ScholarGoogle ScholarDigital LibraryDigital Library
  147. J. Zhu, Z. Nie, X. Liu, B. Zhang, and J.-R. Wen. 2009. Statsnowball: a statistical approach to extracting entity relationships. In Proc. of WWW 2009, Madrid, Spain. 101--110. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Aggregated search: A new information retrieval paradigm

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Computing Surveys
      ACM Computing Surveys  Volume 46, Issue 3
      January 2014
      507 pages
      ISSN:0360-0300
      EISSN:1557-7341
      DOI:10.1145/2578702
      Issue’s Table of Contents

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 January 2014
      • Accepted: 22 August 2013
      • Revised: 27 May 2013
      • Received: 4 January 2013
      Published in csur Volume 46, Issue 3

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader