research-article

Aggregated search: A new information retrieval paradigm

Authors:
Arlind Kopliku

IRIT, University of Paul Sabatier, Toulouse, France

IRIT, University of Paul Sabatier, Toulouse, France
View Profile

,
Karen Pinel-Sauvagnat

IRIT, University of Paul Sabatier, Toulouse, France

IRIT, University of Paul Sabatier, Toulouse, France
View Profile

,
Mohand Boughanem

IRIT, University of Paul Sabatier, Toulouse, France

IRIT, University of Paul Sabatier, Toulouse, France
View Profile

Authors Info & Claims

ACM Computing Surveys Volume 46 Issue 3Article No.: 41pp 1–31https://doi.org/10.1145/2523817

Published:01 January 2014Publication History

ACM Computing Surveys

Abstract

Traditional search engines return ranked lists of search results. It is up to the user to scroll this list, scan within different documents, and assemble information that fulfill his/her information need. Aggregated search represents a new class of approaches where the information is not only retrieved but also assembled. This is the current evolution in Web search, where diverse content (images, videos, etc.) and relational content (similar entities, features) are included in search results.

In this survey, we propose a simple analysis framework for aggregated search and an overview of existing work. We start with related work in related domains such as federated search, natural language generation, and question answering. Then we focus on more recent trends, namely cross vertical aggregated search and relational aggregated search, which are already present in current Web search.

References

Pal Aditya and Kawale Jaya. 2008. Leveraging query association in federated search. In Proc. of SIGIR 2008 Workshop on Aggregated Search.Google Scholar
Eugene Agichtein and Luis Gravano. 2000. Snowball: Extracting relations from large plain-text collections. In Proc. of the 5th ACM Conference on Digital Libraries. 85--94. http://dx.doi.org/10.1145/336597.336644 Google ScholarDigital Library
Enrique Alfonseca, Marius Pasca, and Enrique Robledo-Arnuncio. 2010. Acquisition of instance attributes via labeled and related instances. In Proc. of SIGIR 2010. 58--65. http://dx.doi.org/10.1145/1835449.1835462 Google ScholarDigital Library
Abdulrahman Almuhareb and Massimo Poesio. 2004. Attribute-based and value-based clustering: An evaluation. In In EMNLP 2004, ACL. 158--165.Google Scholar
Jaime Arguello and Robert Capra. 2012. The effect of aggregated search coherence on search behavior. In Proc. of CIKM 2012. 1293--1302. Google ScholarDigital Library
Jaime Arguello, Fernando Diaz, and Jamie Callan. 2011. Learning to aggregate vertical results into Web search results. In Proc. of CIKM 2011. 201--210. Google ScholarDigital Library
Jaime Arguello, Fernando Diaz, Jamie Callan, and Ben Carterette. 2011. A methodology for evaluating aggregated search results. In Proc. of ECIR 2011. 141--152. Google ScholarDigital Library
Jaime Arguello, Fernando Diaz, Jamie Callan, and Jean-Francois Crespo. 2009. Sources of evidence for vertical selection. In Proc. of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 315--322. DOI: http://dx.doi.org/10.1145/1571941.1571997 Google ScholarDigital Library
Jaime Arguello, Fernando Diaz, and Jean-François Paiement. 2010. Vertical selection in the presence of unlabeled verticals. In Proc. of SIGIR 2010. 691--698. Google ScholarDigital Library
Jaime Arguello, Fernando Diaz, and Milad Shokouhi. 2012. Integrating and ranking aggregated content on the Web. In Proc. WWW 2012.Google Scholar
Jaime Arguello, Wan-Ching Wu, Diane Kelly, and Ashlee Edwards. 2012. Task complexity, vertical display and user interaction in aggregated search. In Proc. of SIGIR 2012. 435--444. Google ScholarDigital Library
Yonatan Aumann, Ronen Feldman, Yair Liberzon, Benjamin Rosenfeld, and Jonathan Schler. 2006. Visual information extraction. Knowl. Inf. Syst. 10, 1 (July 2006), 1--15. DOI: http://dx.doi.org/10.1007/s10115-006-0014-x Google ScholarDigital Library
Thi Truong Avrahami, Lawrence Yau, Luo Si, and Jamie Callan. 2006. The FedLemur project: Federated search in the real world. JASIST 57, 3 (2006), 347--358. DOI: http://dx.doi.org/10.1002/asi.v57:3 Google ScholarDigital Library
K. Balog, A. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. 2009a. Overview of the TREC 2009 entity track. In Proc. of TREC 2009.Google Scholar
K. Balog, A. P. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. 2009b. Overview of the TREC 2009 entity track. In TREC 2009 Working Notes. NIST.Google Scholar
K. Balog, P. Serdyukov, and A. de Vries. 2010. Overview of the trec 2010 entity track. In Proc. of TREC 2010.Google Scholar
K. Balog, P. Serdyukov, and A. de Vries. 2011. Overview of the trec 2010 entity track. In Proc. of TREC 2011.Google Scholar
Michele Banko, Michael J. Cafarella, Stephen Soderland, Matt Broadhead, and Oren Etzioni. 2007. Open information extraction from the web. In Proc. of IJCAI 2007. 2670--2676. Google ScholarDigital Library
Senjuti Basu Roy, Sihem Amer-Yahia, Ashish Chawla, Gautam Das, and Cong Yu. 2010. Constructing and exploring composite items. In Proc. of SIGMOD 2010. 843--854. Google ScholarDigital Library
Mustapha Baziz, Mohand Boughanem, Yannick Loiseau, and Henri Prade. 2007. Fuzzy logic and ontology-based information retrieval. In Studies in Fuzziness and Soft Computing. Vol. 215/2007. 193--218.Google Scholar
Kedar Bellare, Partha Pratim Talukdar, Giridhar Kumaran, O. Pereira, Mark Liberman, Andrew Mccallum, and Mark Dredze. 2007. Lightly-supervised attribute extraction for Web search. In Proc. of Machine Learning for Web Search Workshop, NIPS 2007.Google Scholar
Ori Ben-Yitzhak, Nadav Golbandi, Nadav Har’El, Ronny Lempel, Andreas Neumann, Shila Ofek-Koifman, Dafna Sheinwald, Eugene Shekita, Benjamin Sznajder, and Sivan Yogev. 2008. Beyond basic faceted search. In Proc. of WSDM 2008. 33--44. Google ScholarDigital Library
C. Bizer, T. Heath, and T. Berners-Lee. 2009. Linked data—the story so far. International Journal Semantic Web and Information Systems 5, 3 (2009), 1--22.Google ScholarCross Ref
Christian Bizer, Tom Heath, Kingsley Idehen, and Tim Berners-Lee. 2008. Linked data on the Web (LDOW2008). In Proc. of WWW 2008. 1265--1266. Google ScholarDigital Library
Ourdia Bouidghaghen, Lynda Tamine, and Mohand Boughanem. 2009. Dynamically Personalizing Search Results for Mobile Users. In Proc. Flexible Query Answering (FQAS). 99--110. Google ScholarDigital Library
Bert R. Boyce. 1982. Beyond topicality: A two stage view of relevance and the retrieval process. Inf. Process. Manage. 18, 3 (1982), 105--109.Google ScholarCross Ref
Michael J. Cafarella, Michele Banko, and Oren Etzioni. 2006. Relational Web Search. Technical Report. University of Washington.Google Scholar
Michael J. Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang. 2008. WebTables: Exploring the power of tables on the Web. Proc. VLDB Endow. 1, 1 (2008), 538--549. DOI: http://dx.doi.org/10.1145/1453856.1453916 Google ScholarDigital Library
Michael J. Cafarella, Alon Y. Halevy, and Nodira Khoussainova. 2009. Data Integration for the Relational Web. PVLDB 2, 1 (2009), 1090--1101. Google ScholarDigital Library
Jamie Callan. 2000. Distributed information retrieval. In Advances in Information Retrieval, W. Bruce Croft (Ed.). Kluwer Academic Publishers, Dordrecht, 235--266.Google Scholar
S. Campinas, D. Ceccarelli, T. E. Perry, R. Delbru, K. Balog, and G. Tummarello. 2011. The Sindice-2011 dataset for entity-oriented search in the web of data. In 1st International Workshop on Entity-Oriented Search (EOS). 26--32.Google Scholar
Chih-Chung Chang and Chih-Jen Lin. 2001. LIBSVM: A Library for Support Vector Machines. Technical Report.Google Scholar
Hsin-Hsi Chen, Shih-Chung Tsai, and Jin-He Tsai. 2000. Mining tables from large scale HTML texts. In Proc. of COLING 2000. 166--172. Google ScholarDigital Library
Kenneth Ward Church and Patrick Hanks. 1989. Word association norms, mutual information, and lexicography. In Proc. of ACL 1998. 76--83. Google ScholarDigital Library
Charles L. A. Clarke, Maheedhar Kolla, Gordon V. Cormack, Olga Vechtomova, Azin Ashkan, Stefan Büttcher, and Ian MacKinnon. 2008. Novelty and diversity in information retrieval evaluation. In Proc. of SIGIR 2008. 659--666. Google ScholarDigital Library
Valter Crescenzi, Giansalvatore Mecca, and Paolo Merialdo. 2001. RoadRunner: Towards Automatic Data Extraction from Large Web Sites. In Proc. of VLDB 2001. 109--118. Google ScholarDigital Library
H. T. Dang, D. Kelly, and J. Lin. 2007. Overview of the TREC 2007 Question Answering Track. In Proc. TREC 2007.Google Scholar
Hoa Tang Dang. 2006. Overview of DUC 2006. In Proc. of the 2006 Document Understanding Conference.Google Scholar
Gianluca Demartini, Tereza Iofciu, and Arjen P. Vries. 2010. Overview of the INEX 2009 Entity Ranking Track. In Focused Retrieval and Evaluation. 254--264. Google ScholarDigital Library
Fernando Diaz. 2009a. Integration of news content into web results. In Proc. of WSDM 2009. 182--191. Google ScholarDigital Library
Fernando Diaz. 2009b. Integration of news content into web results. In Proc. WSDM. 182--191. Google ScholarDigital Library
Fernando Diaz and Jaime Arguello. 2009. Adaptation of offline vertical selection predictions in the presence of user feedback. In Proc. of SIGIR 2009. 323--330. Google ScholarDigital Library
Anlei Dong, Yi Chang, Zhaohui Zheng, Gilad Mishne, Jing Bai, Ruiqiang Zhang, Karolina Buchner, Ciya Liao, and Fernando Diaz. 2010. Towards recency ranking in Web search. In Proc. of WSDM 2010. 11--20. Google ScholarDigital Library
Doug Downey, Oren Etzioni, and Stephen Soderland. 2005. A probabilistic model of redundancy in information extraction. In Proceedigns of IJCAI. 1034--1041. Google ScholarDigital Library
Oren Etzioni, Michele Banko, Stephen Soderland, and Daniel S. Weld. 2008. Open information extraction from the web. Commun. ACM 51 (December 2008), 68--74. Issue 12. http://dx.doi.org/10.1145/1409360.1409378 Google ScholarDigital Library
Oren Etzioni, Michael Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, and Alexander Yates. 2005. Unsupervised named-entity extraction from the web: An experimental study. Artif. Intell. 165, 1 (2005), 91--134. DOI: http://dx.doi.org/10.1016/j.artint.2005.03.001 Google ScholarDigital Library
O. Etzioni, A. Fader, J. Christensen, S. Soderland, and Mausam. 2011. Open information extraction: The second generation. In Proc. of IJCAI 2011, Barcelona, Spain. 3--10. Google ScholarDigital Library
John R. Frank, Max Kleiman-Weiner, Daniel A. Roberts, Feng Niu, Ce Zhang, Christopher Re, and Ian Soboroff. 2012. Building an entity-centric stream filtering test collection for TREC 2012. In Proc. of TREC 2012.Google Scholar
Shlomo Geva, Jaap Kamps, and Andrew Trotman (Eds.). 2009. Proc. of INEX 2008.Google Scholar
Jeremy Goecks. 2002. NuggetMine: Intelligent groupware for opportunistically sharing information nuggets. In Proc. of IUI 2002. 87--94. Google ScholarDigital Library
Luis Gravano, Chen-Chuan K. Chang, Héctor García-Molina, and Andreas Paepcke. 1997. STARTS: Stanford proposal for Internet meta-searching. SIGMOD Rec. 26 (June 1997), 207--218. Issue 2. DOI: http://dx.doi.org/10.1145/253262.253299 Google ScholarDigital Library
Ohad Greenshpan, Tova Milo, and Neoklis Polyzotis. 2009. Autocompletion for mashups. Proc. VLDB Endow. 2, 1 (2009), 538--549. Google ScholarDigital Library
H. P. Grice. 1975. Logic and conversation. In Syntax and Semantics: Vol. 3: Speech Acts, P. Cole and J. L. Morgan (Eds.). Academic Press, San Diego, CA, 41--58.Google Scholar
Ralph Grishman and Beth Sundheim. 1996. Message Understanding Conference-6: A brief history. In Proc. of the 16th Conference on Computational Linguistics. 466--471. Google ScholarDigital Library
A. Gulli and A. Signorini. 2005. Building an open source meta-search engine. In Proc. of WWW ’05: Special Interest Tracks and Posters. 1004--1005. Google ScholarDigital Library
Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. 2009. Named entity recognition in query. In Proc. of SIGIR 2009. 267--274. Google ScholarDigital Library
H. Halpin, D. M. Herzig, P. Mika, R. Blanco, J. Pound, H. S. Thompson, and D. T. Tran. 2010. Evaluating ad-hoc object retrieval. In Proc. of the International Workshop on Evaluation of Semantic Technologies (IWEST 2010)Google Scholar
Marti A. Hearst. 1992. Automatic acquisition of hyponyms from large text corpora. In Proc. of the 14th Conference on Computational Linguistics. 539--545. Google ScholarDigital Library
Marti A. Hearst. 1998. Automated discovery of WordNet relations. In WordNet: An Electronic Lexical Database, C. Fellbaum (Ed.). MIT Press, 131--153. Retrieved from http://www.sims.berkeley.edu/hearst/papers/wordnet98.pdf.Google Scholar
Marti A. Hearst and Jan O. Pedersen. 1996. Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In SIGIR. 76--84. Google ScholarDigital Library
Sascha Hennig and Michael Wurst. 2006. Incremental clustering of newsgroup articles. In IEA/AIE. 332--341. Google ScholarDigital Library
L. Hirschman and R. Gaizauskas. 2001. Natural language question answering: The view from here. Natural Language Engineering 7, 4 (11 2001), 275--300. DOI: http://dx.doi.org/10.1017/S1351324901002807 Google ScholarDigital Library
Lei Ji, Jun Yan, Ning Liu, Wen Zhang, Weiguo Fan, and Zheng Chen. 2009. ExSearch: A novel vertical search engine for online barter business. In Proc. of CIKM 2009. 1357--1366. Google ScholarDigital Library
Christopher B. Jones and Ross S. Purves. 2009. Geographical information retrieval. In Encyclopedia of Database Systems. 1227--1231.Google Scholar
Jaap Kamps, Shlomo Geva, and Andrew Trotman. 2008. Report on the SIGIR 2008 workshop on focused retrieval. SIGIR Forum 42, 2 (2008), 59--65. DOI: http://dx.doi.org/10.1145/1480506.1480517 Google ScholarDigital Library
Rianne Kaptein and Maarten Marx. 2010. Focused retrieval and result aggregation with political data. Inf. Retr. 13, 5 (October 2010), 412--433. DOI: http://dx.doi.org/10.1007/s10791-010-9130-z Google ScholarDigital Library
Makoto P. Kato, Hiroaki Ohshima, Satoshi Oyama, and Katsumi Tanaka. 2009. Query by analogical example: Relational search using Web search engine indices. In Proc. of CIKM 2009. 27--36. Google ScholarDigital Library
B. Katz, G. Borchardt, and S. Felshin. 2005. Syntactic and semantic decomposition strategies for question answering from multiple resources. In Proc. of the AAAI 2005 Workshop on Inference for Textual Question Answering. Pittsburgh, Pennsylvania, USA.Google Scholar
Diane Kelly and Jimmy Lin. 2006. Overview of the TREC 2006 question answering task. In Proc. of the Text REtrieval Conference 2006.Google Scholar
Diane Kelly and Jimmy Lin. 2007. Overview of the TREC 2007 question answering task. In Proc. of the Text REtrieval Conference 2007.Google Scholar
Lyndon S. Kennedy and Mor Naaman. 2008. Generating diverse and representative image search results for landmarks. In Proc. of WWW 2008. 297--306. Google ScholarDigital Library
Arlind Kopliku. 2011. Approaches to Implement and Evaluate Aggregated Search. Thèse de doctorat. Université Paul Sabatier, Toulouse, France.Google Scholar
Arlind Kopliku, Mohand Boughanem, and Karen Pinel-Sauvagnat. 2011a. Mining the Web for lists of named entities. In Proc. of CORIA 2011. 113--120.Google Scholar
Arlind Kopliku, Mohand Boughanem, and Karen Pinel-Sauvagnat. 2011b. Towards a framework for attribute retrieval. In Proc. of CIKM 2011. 515--524. Google ScholarDigital Library
Arlind Kopliku, Firas Damak, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011c. Interest and evaluation of aggregated search. In Proc. of IEEE/WIC/ACM International Conference on Web Intelligence, Lyon, France. 154--161. Google ScholarDigital Library
Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2009. Aggregated Search: Potential, Issues and Evaluation. Technical Report. Institut de Recherche en Informatique de Toulouse, France.Google Scholar
Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011d. Attribute Retrieval from Relational Web tables. In Proc. of SPIRE 2011. 117--128. Google ScholarDigital Library
Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011e. Retrieving attributes using Web tables. In Proc. of JDCL 2011. 397--398. Google ScholarDigital Library
Arlind Kopliku, Paul Thomas, Stephen Wan, and Cecile Paris. 2013. Filtering and ranking for social media monitoring. In Proc. of CORIA 2013.Google Scholar
Ines Krichen, Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011. Une approche de recherche d’attributs pertinents pour l’agrégation d’information. In Proc. of INFORSID 2009. 385--400.Google Scholar
Mounia Lalmas. 2011. Advanced topics on information retrieval. Springer, Chapter Aggregated search.Google Scholar
Maurizio Lenzerini. 2002. Data integration: A theoretical perspective. In Proc. of PODS 2002. 233--246. Google ScholarDigital Library
Xiao Li, Ye-Yi Wang, and Alex Acero. 2008. Learning query intent from regularized click graphs. In Proc. of SIGIR 2008. 339--346. Google ScholarDigital Library
Girija Limaye, Sunita Sarawagi, and Soumen Chakrabarti. 2010. Annotating and searching Web tables using entities, types and relationships. Proc. VLDB Endow. 3, 1--2 (September 2010), 1338--1347. Google ScholarDigital Library
C.-J. Lin and R.-R. Liu. 2008. An analysis of multi-focus questions. In Proc. of SIGIR Workshop on Focused Retrieval.Google Scholar
Thomas Lin and Oren Etzioni. 2010. Identifying functional relations in web text. In Proc. of EMNLP 2010. Google ScholarDigital Library
Ning Liu, Jun Yan, and Zheng Chen. 2009. A probabilistic model based approach for blended search. In Proc. of WWW 2009. 1075--1076. Google ScholarDigital Library
Craig Macdonald. 2009. The voting model for people search. SIGIR Forum 43, 1 (June 2009), 73. DOI: http://dx.doi.org/10.1145/1670598.1670616 Google ScholarDigital Library
Christopher Manning, Prabhakar Raghavan, and Heinrich Schütze. 2008. Introduction to Information Retrieval. Cambridge University Press. Google ScholarDigital Library
M. Manoj and Elisabeth Jacob. 2008. Information retrieval on Internet using meta-search engines: A review. Journal of Scientific & Industrial Research 67, 10 (2008), 739--746.Google Scholar
Kevin S. McCurley. 2001. Geospatial mapping and navigation of the web. In Proc. of the 10th International Conference on WWW. ACM, New York, NY, 221--229. DOI: http://dx.doi.org/10.1145/371920.372056 Google ScholarDigital Library
Xiaofeng Meng, Haiyan Wang, Dongdong Hu, and Chen Li. 2003. A supervised visual wrapper generator for Web-data extraction. In Proc. of the 27th Annual International Conference on Computer Software and Applications(COMPSAC ’03). IEEE Computer Society, Washington, DC, 657. Google ScholarDigital Library
Véronique Moriceau and Xavier Tannier. 2010. FIDJI: Using syntax for validating answers in multiple documents. Inf. Retr. 13, 5 (October 2010), 507--533. DOI: http://dx.doi.org/10.1007/s10791-010-9131-y Google ScholarDigital Library
Vanessa Murdock and Mounia Lalmas. 2008. Workshop on aggregated search. SIGIR Forum 42, 2 (2008), 80--83. DOI: http://dx.doi.org/10.1145/1480506.1480520 Google ScholarDigital Library
Mor Naaman, Yee Jiun Song, Andreas Paepcke, and Hector Garcia-Molina. 2006. Assigning textual names to sets of geographic coordinates. Computers, Environment and Urban Systems 30, 4 (2006), 418--435.Google ScholarCross Ref
Zaiqing Nie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen, and Wei-Ying Ma. 2007a. Web object retrieval. In Proc. of WWW 2007. 81--90. Google ScholarDigital Library
Zaiqing Nie, Ji-Rong Wen, and Wei-Ying Ma. 2007b. Object-level vertical search. In Proc. of CIDR. 235--246.Google Scholar
Shiyan Ou and Christopher S. G. Khoo. 2008. Aggregating search results for social science by extracting and organizing research concepts and relations. In SIGIR 2008 Workshop on Aggregated Search.Google Scholar
C. Paris and N. Colineau. 2006. Scifly: Tailored Corporate Brochures on Demand. Technical Report. CSIRO ICT Centre.Google Scholar
C. Paris, A. Lampert, S. Lu, and M. Wu. 2005. Enhancing Dynamic Knowledge Management Services—Tailored Documents. Technical Report 05/034, Commercial-in-Confidence. CSIRO ICT Centre.Google Scholar
Cécile Paris, Stephen Wan, and Paul Thomas. 2010. Focused and aggregated search: A perspective from natural language generation. Information Retrieval Journal 44, 3 (2010). Google ScholarDigital Library
Cécile Paris, Stephen Wan, Ross Wilkinson, and Mingfang Wu. 2001. Generating personal travel guides—and who wants them&quest; In Proc. User Modeling 2001. 251--253. Google ScholarDigital Library
Cécile L. Paris. 1988. Tailoring object descriptions to a user’s level of expertise. Comput. Linguist. 14, 3 (1988), 64--78. Google ScholarDigital Library
Marius Pasca and Benjamin Van Durme. 2008. Weakly-supervised acquisition of open-domain classes and class attributes from Web documents and query logs. In ACL. 19--27.Google Scholar
Ashok Kumar Ponnuswami, Kumaresh Pattabiraman, Qiang Wu, Ran Gilad-Bachrach, and Tapas Kanungo. 2011. On composition of a federated web search result page: Using online users to provide pairwise preference for heterogeneous verticals. In Proc. of WSDM 2011. 715--724. Google ScholarDigital Library
Jay M. Ponte and W. Bruce Croft. 1998. A language modeling approach to information retrieval. In Proc. of SIGIR 1998. 275--281. Google ScholarDigital Library
Ana-Maria Popescu and Oren Etzioni. 2005. Extracting product features and opinions from reviews. In Proc. of HLT 2008. 339--346. Google ScholarDigital Library
Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, and Grant Weddell. 2012. Interpreting keyword queries over web knowledge bases. In Proc. of CIKM 2012. 305--314. Google ScholarDigital Library
Anand Ranganathan, Anton Riabov, and Octavian Udrea. 2009. Mashup-based information retrieval for domain experts. In Proc. of CIKM 2009. 711--720. Google ScholarDigital Library
Stephen E. Robertson and Steve Walker. 1994. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proc. of SIGIR 1994 (Special Issue of the SIGIR Forum), W. Bruce Croft and C. J. van Rijsbergen (Eds.). 232--241. Google ScholarDigital Library
Cyril Rohr and Dian Tjondronegoro. 2008. Aggregated cross-media news visualization and personalization. In Proc. of MIR 2008. 371--378. Google ScholarDigital Library
Nachiketa Sahoo, Jamie Callan, Ramayya Krishnan, George Duncan, and Rema Padman. 2006. Incremental hierarchical clustering of text documents. In Proc. of CIKM 2006. 357--366. Google ScholarDigital Library
G. Salton, A. Wong, and C. S. Yang. 1975. A vector space model for automatic indexing. Commun. ACM 18 (November 1975), 613--620. Issue 11. DOI: http://dx.doi.org/10.1145/361219.361220 Google ScholarDigital Library
Rodrygo L. T. Santos, Craig Macdonald, and Iadh Ounis. 2011. Aggregated search result diversification. In Proc. of the 3rd International Conference on the Theory of Information Retrieval. Springer, Bertinoro, Italy. Google ScholarDigital Library
Christina Sauper and Regina Barzilay. 2009. Automatically generating Wikipedia articles: A structure-aware approach. In Proc. of ACL-IJCNLP. 208--216. Google ScholarDigital Library
Karen Sauvagnat, Mohand Boughanem, and Claude Chrisment. 2006. Answering content-and-structure-based queries on XML documents using relevance propagation. Information Systems, Special Issue SPIRE 2004 31 (2006), 621--635. Google ScholarDigital Library
Satoshi Sekine, Kiyoshi Sudo, and Chikashi Nobata. 2002. Extended named entity hierarchy. In Proc. of LREC 2002.Google Scholar
Erik Selberg and Oren Etzioni. 1995. Multi-service search and comparison using the MetaCrawler. In Proc. of the 4th International World Wide Web Conference. 195--208.Google Scholar
Semantic. 2010. Semantic Search Challenge 2010. Retrieved April 2013 from http://km.aifb.kit.edu/ws/semsearch10/.Google Scholar
Semantic. 2011. Semantic Search Challenge 2011. Retrieved April 2013 from http://semsearch.yahoo.com/.Google Scholar
Milad Shokouhi, Justin Zobel, Saied Tahaghoghi, and Falk Scholer. 2007. Using query logs to establish vocabularies in distributed information retrieval. Inf. Process. Manage. 43, 1 (January 2007), 169--180. DOI: http://dx.doi.org/10.1016/j.ipm.2006.04.003 Google ScholarDigital Library
Amit Singhal. 2012. Introducing the Knowledge Graph: Things, Not Strings. Official Blog of Google Retrieved April 2013 from http://googleblog.blogspot.co.uk/2012/05/introducing-knowledge-graph-things-not.html.Google Scholar
Karen Spärck-Jones, Stephen E. Robertson, and Mark Sanderson. 2007. Ambiguous requests: Implications for retrieval tests, systems and theories. SIGIR Forum 41, 2 (2007), 8--17. http://dx.doi.org/10.1145/1328964.1328965 Google ScholarDigital Library
K. Srinivas, P. V. S. Srinivas, and A. Govardhan. 2011. A survey on the performance evaluation of various meta search engines. Int. Journal of Computer Science Issues 8, 3 (2011), 359--364.Google Scholar
Andreas Strotmann and Dangzhi Zhao. 2008. Bibliometric maps for aggregated visual browsing in digital libraries. In Proc. of the SIGIR 2008 Workshop on Aggregated Search.Google Scholar
Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. YAGO: A core of semantic knowledge. In Proc. of WWW 2007. 697--706. Google ScholarDigital Library
Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2008. YAGO: A large ontology from Wikipedia and WordNet. Web Semant. 6, 3 (2008), 203--217. DOI: http://dx.doi.org/10.1016/j.websem.2008.06.001 Google ScholarDigital Library
Fabian M. Suchanek, Mauro Sozio, and Gerhard Weikum. 2009. SOFIE: A self-organizing framework for information extraction. In Proc. of the WWW Conference. 631--640. Google ScholarDigital Library
Shanu Sushmita, Hideo Joho, and Mounia Lalmas. 2009. A task-based evaluation of an aggregated search interface. In Proc. of SPIRE 2009. 322--333. Google ScholarDigital Library
Shanu Sushmita, Hideo Joho, Mounia Lalmas, and Robert Villa. 2010. Factors affecting click-through behavior in aggregated search interfaces. In Proc. of CIKM 2010. 519--528. Google ScholarDigital Library
TAC. 2011. Proc. of the 4th Text Analysis Conference. National Institute of Standards and Technology, Gaithersburg, MD.Google Scholar
Bilyana Taneva, Mouna Kacimi, and Gerhard Weikum. 2010. Gathering and ranking photos of named entities with high precision, high recall, and diversity. In Proc. of WSDM 2010. 431--440. Google ScholarDigital Library
Kosuke Tokunaga and Kentaro Torisawa. 2005. Automatic discovery of attribute words from web documents. In Proc. of the 2nd International Joint Conference on Natural Language Processing (IJCNLP-05). 106--118. Google ScholarDigital Library
T. Tran, H. Wang, S. Rudolph, and P. Cimiano. 2009. Top-k exploration of query candidates for efficient keyword search on graph-shaped (rdf) data. In Proc. of ICDE 2009. 405--416. Google ScholarDigital Library
Andrew Trotman, Shlomo Geva, Jaap Kamps, Mounia Lalmas, and Vanessa Murdock. 2010. Current research in focused retrieval and result aggregation. Journal of Information Retrieval (2010). Google ScholarDigital Library
Peter D. Turney. 2001. Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In Proc. of EMCL 2001. 491--502. Google ScholarDigital Library
Subodh Vaid, Christopher B. Jones, Hideo Joho, and Mark Sanderson. 2005. Spatio-textual indexing for geographical search on the web. In Proc. of SSTD. 218--235. Google ScholarDigital Library
David Vallet and Hugo Zaragoza. 2008. Inferring the most important types of a query: A semantic approach. In Proc. of SIGIR 2008e, Singapore. 857--858. Google ScholarDigital Library
Ellen M. Voorhees. 2003. Evaluating answers to definition questions. In Proceedigns of NAACL’03. 109--111. Google ScholarDigital Library
Tak-Lam Wong and Wai Lam. 2004. A probabilistic approach for adapting information extraction wrappers and discovering new attributes. In Proc. of ICDM 2004. 257--264. Google ScholarDigital Library
Tak-Lam Wong and Wai Lam. 2009. An unsupervised method for joint information extraction and feature mining across different Web sites. Data Knowl. Eng. 68, 1 (January 2009), 107--125. DOI: http://dx.doi.org/10.1016/j.datak.2008.08.009 Google ScholarDigital Library
Fei Wu, Raphael Hoffmann, and Daniel S. Weld. 2008. Information extraction from Wikipedia: Moving down the long tail. In Proc. of KDD 2008. 731--739. Google ScholarDigital Library
M. Wu and M. Fuller. 1997. Supporting the Answering Process. In Proc. of the 2nd Australian Document Computing Symposium. 65--73.Google Scholar
Naoki Yoshinaga and Kentaro Torisawa. 2007. Open-domain attribute-value acquisition from semi-structured texts. In Proc. of the Workshop on Ontolex. 55--66.Google Scholar
K. Zhou, R. Cummins, M. Lalmas, and J. M. Jose. 2011. Evaluating large-scale distributed vertical search. In Proc. of LSDS-IR Workshop in CIKM. Google ScholarDigital Library
Ke Zhou, Ronan Cummins, Mounia Lalmas, and Joemon M. Jose. 2012. Evaluating aggregated search pages. In Proc. of SIGIR 2012. 115--124. Google ScholarDigital Library
J. Zhu, Z. Nie, X. Liu, B. Zhang, and J.-R. Wen. 2009. Statsnowball: a statistical approach to extracting entity relationships. In Proc. of WWW 2009, Madrid, Spain. 101--110. Google ScholarDigital Library

Index Terms

Aggregated search: A new information retrieval paradigm
1. Information systems
  1. Information retrieval

Recommendations

From federated to aggregated search
SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

Federated search refers to the brokered retrieval of content from a set of auxiliary retrieval systems instead of from a single, centralized retrieval system. Federated search tasks occur in, for example, digital libraries (where documents from several ...
Read More
A new aggregated search method
Special Section: Fuzzy Logic for Analysis of Clinical Diagnosis and Decision-Making in Health Care

Aggregated search is the task of integrating results from potentially multiple specialized search services, or verticals (images, videos, news, etc.), into the Web search results. Major search engines perform what is known as Aggregated Search. ...
Read More
Aggregated Search and Interleaving Methods: A survey
BDAW '16: Proceedings of the International Conference on Big Data and Advanced Wireless Technologies

Aggregated search attempts to satisfy user's need by searching and assembling information from variety verticals and placing them into a single result page. Aggregated search has two research directions namely, cross-vertical Aggregated Search (cvAS) ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Computing Surveys Volume 46, Issue 3
January 2014
507 pages
ISSN:0360-0300
EISSN:1557-7341
DOI:10.1145/2578702
Issue’s Table of Contents

Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 January 2014
- Accepted: 22 August 2013
- Revised: 27 May 2013
- Received: 4 January 2013
Published in csur Volume 46, Issue 3

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Information retrieval
aggregated search
focused retrieval
information extraction
relational search
result aggregation
vertical search
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 44
  Total Citations
  View Citations
- 1,985
  Total Downloads
- Downloads (Last 12 months)51
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Aggregated search: A new information retrieval paradigm

ACM Computing Surveys

Abstract

References

Cited By

Index Terms

Recommendations

From federated to aggregated search

A new aggregated search method

Aggregated Search and Interleaving Methods: A survey

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Aggregated search: A new information retrieval paradigm

ACM Computing Surveys

Abstract

References

Cited By

Index Terms

Recommendations

From federated to aggregated search

A new aggregated search method

Aggregated Search and Interleaving Methods: A survey

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media