Abstract
Traditional search engines return ranked lists of search results. It is up to the user to scroll this list, scan within different documents, and assemble information that fulfill his/her information need. Aggregated search represents a new class of approaches where the information is not only retrieved but also assembled. This is the current evolution in Web search, where diverse content (images, videos, etc.) and relational content (similar entities, features) are included in search results.
In this survey, we propose a simple analysis framework for aggregated search and an overview of existing work. We start with related work in related domains such as federated search, natural language generation, and question answering. Then we focus on more recent trends, namely cross vertical aggregated search and relational aggregated search, which are already present in current Web search.
- Pal Aditya and Kawale Jaya. 2008. Leveraging query association in federated search. In Proc. of SIGIR 2008 Workshop on Aggregated Search.Google Scholar
- Eugene Agichtein and Luis Gravano. 2000. Snowball: Extracting relations from large plain-text collections. In Proc. of the 5th ACM Conference on Digital Libraries. 85--94. http://dx.doi.org/10.1145/336597.336644 Google ScholarDigital Library
- Enrique Alfonseca, Marius Pasca, and Enrique Robledo-Arnuncio. 2010. Acquisition of instance attributes via labeled and related instances. In Proc. of SIGIR 2010. 58--65. http://dx.doi.org/10.1145/1835449.1835462 Google ScholarDigital Library
- Abdulrahman Almuhareb and Massimo Poesio. 2004. Attribute-based and value-based clustering: An evaluation. In In EMNLP 2004, ACL. 158--165.Google Scholar
- Jaime Arguello and Robert Capra. 2012. The effect of aggregated search coherence on search behavior. In Proc. of CIKM 2012. 1293--1302. Google ScholarDigital Library
- Jaime Arguello, Fernando Diaz, and Jamie Callan. 2011. Learning to aggregate vertical results into Web search results. In Proc. of CIKM 2011. 201--210. Google ScholarDigital Library
- Jaime Arguello, Fernando Diaz, Jamie Callan, and Ben Carterette. 2011. A methodology for evaluating aggregated search results. In Proc. of ECIR 2011. 141--152. Google ScholarDigital Library
- Jaime Arguello, Fernando Diaz, Jamie Callan, and Jean-Francois Crespo. 2009. Sources of evidence for vertical selection. In Proc. of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 315--322. DOI: http://dx.doi.org/10.1145/1571941.1571997 Google ScholarDigital Library
- Jaime Arguello, Fernando Diaz, and Jean-François Paiement. 2010. Vertical selection in the presence of unlabeled verticals. In Proc. of SIGIR 2010. 691--698. Google ScholarDigital Library
- Jaime Arguello, Fernando Diaz, and Milad Shokouhi. 2012. Integrating and ranking aggregated content on the Web. In Proc. WWW 2012.Google Scholar
- Jaime Arguello, Wan-Ching Wu, Diane Kelly, and Ashlee Edwards. 2012. Task complexity, vertical display and user interaction in aggregated search. In Proc. of SIGIR 2012. 435--444. Google ScholarDigital Library
- Yonatan Aumann, Ronen Feldman, Yair Liberzon, Benjamin Rosenfeld, and Jonathan Schler. 2006. Visual information extraction. Knowl. Inf. Syst. 10, 1 (July 2006), 1--15. DOI: http://dx.doi.org/10.1007/s10115-006-0014-x Google ScholarDigital Library
- Thi Truong Avrahami, Lawrence Yau, Luo Si, and Jamie Callan. 2006. The FedLemur project: Federated search in the real world. JASIST 57, 3 (2006), 347--358. DOI: http://dx.doi.org/10.1002/asi.v57:3 Google ScholarDigital Library
- K. Balog, A. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. 2009a. Overview of the TREC 2009 entity track. In Proc. of TREC 2009.Google Scholar
- K. Balog, A. P. de Vries, P. Serdyukov, P. Thomas, and T. Westerveld. 2009b. Overview of the TREC 2009 entity track. In TREC 2009 Working Notes. NIST.Google Scholar
- K. Balog, P. Serdyukov, and A. de Vries. 2010. Overview of the trec 2010 entity track. In Proc. of TREC 2010.Google Scholar
- K. Balog, P. Serdyukov, and A. de Vries. 2011. Overview of the trec 2010 entity track. In Proc. of TREC 2011.Google Scholar
- Michele Banko, Michael J. Cafarella, Stephen Soderland, Matt Broadhead, and Oren Etzioni. 2007. Open information extraction from the web. In Proc. of IJCAI 2007. 2670--2676. Google ScholarDigital Library
- Senjuti Basu Roy, Sihem Amer-Yahia, Ashish Chawla, Gautam Das, and Cong Yu. 2010. Constructing and exploring composite items. In Proc. of SIGMOD 2010. 843--854. Google ScholarDigital Library
- Mustapha Baziz, Mohand Boughanem, Yannick Loiseau, and Henri Prade. 2007. Fuzzy logic and ontology-based information retrieval. In Studies in Fuzziness and Soft Computing. Vol. 215/2007. 193--218.Google Scholar
- Kedar Bellare, Partha Pratim Talukdar, Giridhar Kumaran, O. Pereira, Mark Liberman, Andrew Mccallum, and Mark Dredze. 2007. Lightly-supervised attribute extraction for Web search. In Proc. of Machine Learning for Web Search Workshop, NIPS 2007.Google Scholar
- Ori Ben-Yitzhak, Nadav Golbandi, Nadav Har’El, Ronny Lempel, Andreas Neumann, Shila Ofek-Koifman, Dafna Sheinwald, Eugene Shekita, Benjamin Sznajder, and Sivan Yogev. 2008. Beyond basic faceted search. In Proc. of WSDM 2008. 33--44. Google ScholarDigital Library
- C. Bizer, T. Heath, and T. Berners-Lee. 2009. Linked data—the story so far. International Journal Semantic Web and Information Systems 5, 3 (2009), 1--22.Google ScholarCross Ref
- Christian Bizer, Tom Heath, Kingsley Idehen, and Tim Berners-Lee. 2008. Linked data on the Web (LDOW2008). In Proc. of WWW 2008. 1265--1266. Google ScholarDigital Library
- Ourdia Bouidghaghen, Lynda Tamine, and Mohand Boughanem. 2009. Dynamically Personalizing Search Results for Mobile Users. In Proc. Flexible Query Answering (FQAS). 99--110. Google ScholarDigital Library
- Bert R. Boyce. 1982. Beyond topicality: A two stage view of relevance and the retrieval process. Inf. Process. Manage. 18, 3 (1982), 105--109.Google ScholarCross Ref
- Michael J. Cafarella, Michele Banko, and Oren Etzioni. 2006. Relational Web Search. Technical Report. University of Washington.Google Scholar
- Michael J. Cafarella, Alon Halevy, Daisy Zhe Wang, Eugene Wu, and Yang Zhang. 2008. WebTables: Exploring the power of tables on the Web. Proc. VLDB Endow. 1, 1 (2008), 538--549. DOI: http://dx.doi.org/10.1145/1453856.1453916 Google ScholarDigital Library
- Michael J. Cafarella, Alon Y. Halevy, and Nodira Khoussainova. 2009. Data Integration for the Relational Web. PVLDB 2, 1 (2009), 1090--1101. Google ScholarDigital Library
- Jamie Callan. 2000. Distributed information retrieval. In Advances in Information Retrieval, W. Bruce Croft (Ed.). Kluwer Academic Publishers, Dordrecht, 235--266.Google Scholar
- S. Campinas, D. Ceccarelli, T. E. Perry, R. Delbru, K. Balog, and G. Tummarello. 2011. The Sindice-2011 dataset for entity-oriented search in the web of data. In 1st International Workshop on Entity-Oriented Search (EOS). 26--32.Google Scholar
- Chih-Chung Chang and Chih-Jen Lin. 2001. LIBSVM: A Library for Support Vector Machines. Technical Report.Google Scholar
- Hsin-Hsi Chen, Shih-Chung Tsai, and Jin-He Tsai. 2000. Mining tables from large scale HTML texts. In Proc. of COLING 2000. 166--172. Google ScholarDigital Library
- Kenneth Ward Church and Patrick Hanks. 1989. Word association norms, mutual information, and lexicography. In Proc. of ACL 1998. 76--83. Google ScholarDigital Library
- Charles L. A. Clarke, Maheedhar Kolla, Gordon V. Cormack, Olga Vechtomova, Azin Ashkan, Stefan Büttcher, and Ian MacKinnon. 2008. Novelty and diversity in information retrieval evaluation. In Proc. of SIGIR 2008. 659--666. Google ScholarDigital Library
- Valter Crescenzi, Giansalvatore Mecca, and Paolo Merialdo. 2001. RoadRunner: Towards Automatic Data Extraction from Large Web Sites. In Proc. of VLDB 2001. 109--118. Google ScholarDigital Library
- H. T. Dang, D. Kelly, and J. Lin. 2007. Overview of the TREC 2007 Question Answering Track. In Proc. TREC 2007.Google Scholar
- Hoa Tang Dang. 2006. Overview of DUC 2006. In Proc. of the 2006 Document Understanding Conference.Google Scholar
- Gianluca Demartini, Tereza Iofciu, and Arjen P. Vries. 2010. Overview of the INEX 2009 Entity Ranking Track. In Focused Retrieval and Evaluation. 254--264. Google ScholarDigital Library
- Fernando Diaz. 2009a. Integration of news content into web results. In Proc. of WSDM 2009. 182--191. Google ScholarDigital Library
- Fernando Diaz. 2009b. Integration of news content into web results. In Proc. WSDM. 182--191. Google ScholarDigital Library
- Fernando Diaz and Jaime Arguello. 2009. Adaptation of offline vertical selection predictions in the presence of user feedback. In Proc. of SIGIR 2009. 323--330. Google ScholarDigital Library
- Anlei Dong, Yi Chang, Zhaohui Zheng, Gilad Mishne, Jing Bai, Ruiqiang Zhang, Karolina Buchner, Ciya Liao, and Fernando Diaz. 2010. Towards recency ranking in Web search. In Proc. of WSDM 2010. 11--20. Google ScholarDigital Library
- Doug Downey, Oren Etzioni, and Stephen Soderland. 2005. A probabilistic model of redundancy in information extraction. In Proceedigns of IJCAI. 1034--1041. Google ScholarDigital Library
- Oren Etzioni, Michele Banko, Stephen Soderland, and Daniel S. Weld. 2008. Open information extraction from the web. Commun. ACM 51 (December 2008), 68--74. Issue 12. http://dx.doi.org/10.1145/1409360.1409378 Google ScholarDigital Library
- Oren Etzioni, Michael Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, and Alexander Yates. 2005. Unsupervised named-entity extraction from the web: An experimental study. Artif. Intell. 165, 1 (2005), 91--134. DOI: http://dx.doi.org/10.1016/j.artint.2005.03.001 Google ScholarDigital Library
- O. Etzioni, A. Fader, J. Christensen, S. Soderland, and Mausam. 2011. Open information extraction: The second generation. In Proc. of IJCAI 2011, Barcelona, Spain. 3--10. Google ScholarDigital Library
- John R. Frank, Max Kleiman-Weiner, Daniel A. Roberts, Feng Niu, Ce Zhang, Christopher Re, and Ian Soboroff. 2012. Building an entity-centric stream filtering test collection for TREC 2012. In Proc. of TREC 2012.Google Scholar
- Shlomo Geva, Jaap Kamps, and Andrew Trotman (Eds.). 2009. Proc. of INEX 2008.Google Scholar
- Jeremy Goecks. 2002. NuggetMine: Intelligent groupware for opportunistically sharing information nuggets. In Proc. of IUI 2002. 87--94. Google ScholarDigital Library
- Luis Gravano, Chen-Chuan K. Chang, Héctor García-Molina, and Andreas Paepcke. 1997. STARTS: Stanford proposal for Internet meta-searching. SIGMOD Rec. 26 (June 1997), 207--218. Issue 2. DOI: http://dx.doi.org/10.1145/253262.253299 Google ScholarDigital Library
- Ohad Greenshpan, Tova Milo, and Neoklis Polyzotis. 2009. Autocompletion for mashups. Proc. VLDB Endow. 2, 1 (2009), 538--549. Google ScholarDigital Library
- H. P. Grice. 1975. Logic and conversation. In Syntax and Semantics: Vol. 3: Speech Acts, P. Cole and J. L. Morgan (Eds.). Academic Press, San Diego, CA, 41--58.Google Scholar
- Ralph Grishman and Beth Sundheim. 1996. Message Understanding Conference-6: A brief history. In Proc. of the 16th Conference on Computational Linguistics. 466--471. Google ScholarDigital Library
- A. Gulli and A. Signorini. 2005. Building an open source meta-search engine. In Proc. of WWW ’05: Special Interest Tracks and Posters. 1004--1005. Google ScholarDigital Library
- Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. 2009. Named entity recognition in query. In Proc. of SIGIR 2009. 267--274. Google ScholarDigital Library
- H. Halpin, D. M. Herzig, P. Mika, R. Blanco, J. Pound, H. S. Thompson, and D. T. Tran. 2010. Evaluating ad-hoc object retrieval. In Proc. of the International Workshop on Evaluation of Semantic Technologies (IWEST 2010)Google Scholar
- Marti A. Hearst. 1992. Automatic acquisition of hyponyms from large text corpora. In Proc. of the 14th Conference on Computational Linguistics. 539--545. Google ScholarDigital Library
- Marti A. Hearst. 1998. Automated discovery of WordNet relations. In WordNet: An Electronic Lexical Database, C. Fellbaum (Ed.). MIT Press, 131--153. Retrieved from http://www.sims.berkeley.edu/hearst/papers/wordnet98.pdf.Google Scholar
- Marti A. Hearst and Jan O. Pedersen. 1996. Reexamining the cluster hypothesis: Scatter/gather on retrieval results. In SIGIR. 76--84. Google ScholarDigital Library
- Sascha Hennig and Michael Wurst. 2006. Incremental clustering of newsgroup articles. In IEA/AIE. 332--341. Google ScholarDigital Library
- L. Hirschman and R. Gaizauskas. 2001. Natural language question answering: The view from here. Natural Language Engineering 7, 4 (11 2001), 275--300. DOI: http://dx.doi.org/10.1017/S1351324901002807 Google ScholarDigital Library
- Lei Ji, Jun Yan, Ning Liu, Wen Zhang, Weiguo Fan, and Zheng Chen. 2009. ExSearch: A novel vertical search engine for online barter business. In Proc. of CIKM 2009. 1357--1366. Google ScholarDigital Library
- Christopher B. Jones and Ross S. Purves. 2009. Geographical information retrieval. In Encyclopedia of Database Systems. 1227--1231.Google Scholar
- Jaap Kamps, Shlomo Geva, and Andrew Trotman. 2008. Report on the SIGIR 2008 workshop on focused retrieval. SIGIR Forum 42, 2 (2008), 59--65. DOI: http://dx.doi.org/10.1145/1480506.1480517 Google ScholarDigital Library
- Rianne Kaptein and Maarten Marx. 2010. Focused retrieval and result aggregation with political data. Inf. Retr. 13, 5 (October 2010), 412--433. DOI: http://dx.doi.org/10.1007/s10791-010-9130-z Google ScholarDigital Library
- Makoto P. Kato, Hiroaki Ohshima, Satoshi Oyama, and Katsumi Tanaka. 2009. Query by analogical example: Relational search using Web search engine indices. In Proc. of CIKM 2009. 27--36. Google ScholarDigital Library
- B. Katz, G. Borchardt, and S. Felshin. 2005. Syntactic and semantic decomposition strategies for question answering from multiple resources. In Proc. of the AAAI 2005 Workshop on Inference for Textual Question Answering. Pittsburgh, Pennsylvania, USA.Google Scholar
- Diane Kelly and Jimmy Lin. 2006. Overview of the TREC 2006 question answering task. In Proc. of the Text REtrieval Conference 2006.Google Scholar
- Diane Kelly and Jimmy Lin. 2007. Overview of the TREC 2007 question answering task. In Proc. of the Text REtrieval Conference 2007.Google Scholar
- Lyndon S. Kennedy and Mor Naaman. 2008. Generating diverse and representative image search results for landmarks. In Proc. of WWW 2008. 297--306. Google ScholarDigital Library
- Arlind Kopliku. 2011. Approaches to Implement and Evaluate Aggregated Search. Thèse de doctorat. Université Paul Sabatier, Toulouse, France.Google Scholar
- Arlind Kopliku, Mohand Boughanem, and Karen Pinel-Sauvagnat. 2011a. Mining the Web for lists of named entities. In Proc. of CORIA 2011. 113--120.Google Scholar
- Arlind Kopliku, Mohand Boughanem, and Karen Pinel-Sauvagnat. 2011b. Towards a framework for attribute retrieval. In Proc. of CIKM 2011. 515--524. Google ScholarDigital Library
- Arlind Kopliku, Firas Damak, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011c. Interest and evaluation of aggregated search. In Proc. of IEEE/WIC/ACM International Conference on Web Intelligence, Lyon, France. 154--161. Google ScholarDigital Library
- Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2009. Aggregated Search: Potential, Issues and Evaluation. Technical Report. Institut de Recherche en Informatique de Toulouse, France.Google Scholar
- Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011d. Attribute Retrieval from Relational Web tables. In Proc. of SPIRE 2011. 117--128. Google ScholarDigital Library
- Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011e. Retrieving attributes using Web tables. In Proc. of JDCL 2011. 397--398. Google ScholarDigital Library
- Arlind Kopliku, Paul Thomas, Stephen Wan, and Cecile Paris. 2013. Filtering and ranking for social media monitoring. In Proc. of CORIA 2013.Google Scholar
- Ines Krichen, Arlind Kopliku, Karen Pinel-Sauvagnat, and Mohand Boughanem. 2011. Une approche de recherche d’attributs pertinents pour l’agrégation d’information. In Proc. of INFORSID 2009. 385--400.Google Scholar
- Mounia Lalmas. 2011. Advanced topics on information retrieval. Springer, Chapter Aggregated search.Google Scholar
- Maurizio Lenzerini. 2002. Data integration: A theoretical perspective. In Proc. of PODS 2002. 233--246. Google ScholarDigital Library
- Xiao Li, Ye-Yi Wang, and Alex Acero. 2008. Learning query intent from regularized click graphs. In Proc. of SIGIR 2008. 339--346. Google ScholarDigital Library
- Girija Limaye, Sunita Sarawagi, and Soumen Chakrabarti. 2010. Annotating and searching Web tables using entities, types and relationships. Proc. VLDB Endow. 3, 1--2 (September 2010), 1338--1347. Google ScholarDigital Library
- C.-J. Lin and R.-R. Liu. 2008. An analysis of multi-focus questions. In Proc. of SIGIR Workshop on Focused Retrieval.Google Scholar
- Thomas Lin and Oren Etzioni. 2010. Identifying functional relations in web text. In Proc. of EMNLP 2010. Google ScholarDigital Library
- Ning Liu, Jun Yan, and Zheng Chen. 2009. A probabilistic model based approach for blended search. In Proc. of WWW 2009. 1075--1076. Google ScholarDigital Library
- Craig Macdonald. 2009. The voting model for people search. SIGIR Forum 43, 1 (June 2009), 73. DOI: http://dx.doi.org/10.1145/1670598.1670616 Google ScholarDigital Library
- Christopher Manning, Prabhakar Raghavan, and Heinrich Schütze. 2008. Introduction to Information Retrieval. Cambridge University Press. Google ScholarDigital Library
- M. Manoj and Elisabeth Jacob. 2008. Information retrieval on Internet using meta-search engines: A review. Journal of Scientific & Industrial Research 67, 10 (2008), 739--746.Google Scholar
- Kevin S. McCurley. 2001. Geospatial mapping and navigation of the web. In Proc. of the 10th International Conference on WWW. ACM, New York, NY, 221--229. DOI: http://dx.doi.org/10.1145/371920.372056 Google ScholarDigital Library
- Xiaofeng Meng, Haiyan Wang, Dongdong Hu, and Chen Li. 2003. A supervised visual wrapper generator for Web-data extraction. In Proc. of the 27th Annual International Conference on Computer Software and Applications(COMPSAC ’03). IEEE Computer Society, Washington, DC, 657. Google ScholarDigital Library
- Véronique Moriceau and Xavier Tannier. 2010. FIDJI: Using syntax for validating answers in multiple documents. Inf. Retr. 13, 5 (October 2010), 507--533. DOI: http://dx.doi.org/10.1007/s10791-010-9131-y Google ScholarDigital Library
- Vanessa Murdock and Mounia Lalmas. 2008. Workshop on aggregated search. SIGIR Forum 42, 2 (2008), 80--83. DOI: http://dx.doi.org/10.1145/1480506.1480520 Google ScholarDigital Library
- Mor Naaman, Yee Jiun Song, Andreas Paepcke, and Hector Garcia-Molina. 2006. Assigning textual names to sets of geographic coordinates. Computers, Environment and Urban Systems 30, 4 (2006), 418--435.Google ScholarCross Ref
- Zaiqing Nie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen, and Wei-Ying Ma. 2007a. Web object retrieval. In Proc. of WWW 2007. 81--90. Google ScholarDigital Library
- Zaiqing Nie, Ji-Rong Wen, and Wei-Ying Ma. 2007b. Object-level vertical search. In Proc. of CIDR. 235--246.Google Scholar
- Shiyan Ou and Christopher S. G. Khoo. 2008. Aggregating search results for social science by extracting and organizing research concepts and relations. In SIGIR 2008 Workshop on Aggregated Search.Google Scholar
- C. Paris and N. Colineau. 2006. Scifly: Tailored Corporate Brochures on Demand. Technical Report. CSIRO ICT Centre.Google Scholar
- C. Paris, A. Lampert, S. Lu, and M. Wu. 2005. Enhancing Dynamic Knowledge Management Services—Tailored Documents. Technical Report 05/034, Commercial-in-Confidence. CSIRO ICT Centre.Google Scholar
- Cécile Paris, Stephen Wan, and Paul Thomas. 2010. Focused and aggregated search: A perspective from natural language generation. Information Retrieval Journal 44, 3 (2010). Google ScholarDigital Library
- Cécile Paris, Stephen Wan, Ross Wilkinson, and Mingfang Wu. 2001. Generating personal travel guides—and who wants them? In Proc. User Modeling 2001. 251--253. Google ScholarDigital Library
- Cécile L. Paris. 1988. Tailoring object descriptions to a user’s level of expertise. Comput. Linguist. 14, 3 (1988), 64--78. Google ScholarDigital Library
- Marius Pasca and Benjamin Van Durme. 2008. Weakly-supervised acquisition of open-domain classes and class attributes from Web documents and query logs. In ACL. 19--27.Google Scholar
- Ashok Kumar Ponnuswami, Kumaresh Pattabiraman, Qiang Wu, Ran Gilad-Bachrach, and Tapas Kanungo. 2011. On composition of a federated web search result page: Using online users to provide pairwise preference for heterogeneous verticals. In Proc. of WSDM 2011. 715--724. Google ScholarDigital Library
- Jay M. Ponte and W. Bruce Croft. 1998. A language modeling approach to information retrieval. In Proc. of SIGIR 1998. 275--281. Google ScholarDigital Library
- Ana-Maria Popescu and Oren Etzioni. 2005. Extracting product features and opinions from reviews. In Proc. of HLT 2008. 339--346. Google ScholarDigital Library
- Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, and Grant Weddell. 2012. Interpreting keyword queries over web knowledge bases. In Proc. of CIKM 2012. 305--314. Google ScholarDigital Library
- Anand Ranganathan, Anton Riabov, and Octavian Udrea. 2009. Mashup-based information retrieval for domain experts. In Proc. of CIKM 2009. 711--720. Google ScholarDigital Library
- Stephen E. Robertson and Steve Walker. 1994. Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval. In Proc. of SIGIR 1994 (Special Issue of the SIGIR Forum), W. Bruce Croft and C. J. van Rijsbergen (Eds.). 232--241. Google ScholarDigital Library
- Cyril Rohr and Dian Tjondronegoro. 2008. Aggregated cross-media news visualization and personalization. In Proc. of MIR 2008. 371--378. Google ScholarDigital Library
- Nachiketa Sahoo, Jamie Callan, Ramayya Krishnan, George Duncan, and Rema Padman. 2006. Incremental hierarchical clustering of text documents. In Proc. of CIKM 2006. 357--366. Google ScholarDigital Library
- G. Salton, A. Wong, and C. S. Yang. 1975. A vector space model for automatic indexing. Commun. ACM 18 (November 1975), 613--620. Issue 11. DOI: http://dx.doi.org/10.1145/361219.361220 Google ScholarDigital Library
- Rodrygo L. T. Santos, Craig Macdonald, and Iadh Ounis. 2011. Aggregated search result diversification. In Proc. of the 3rd International Conference on the Theory of Information Retrieval. Springer, Bertinoro, Italy. Google ScholarDigital Library
- Christina Sauper and Regina Barzilay. 2009. Automatically generating Wikipedia articles: A structure-aware approach. In Proc. of ACL-IJCNLP. 208--216. Google ScholarDigital Library
- Karen Sauvagnat, Mohand Boughanem, and Claude Chrisment. 2006. Answering content-and-structure-based queries on XML documents using relevance propagation. Information Systems, Special Issue SPIRE 2004 31 (2006), 621--635. Google ScholarDigital Library
- Satoshi Sekine, Kiyoshi Sudo, and Chikashi Nobata. 2002. Extended named entity hierarchy. In Proc. of LREC 2002.Google Scholar
- Erik Selberg and Oren Etzioni. 1995. Multi-service search and comparison using the MetaCrawler. In Proc. of the 4th International World Wide Web Conference. 195--208.Google Scholar
- Semantic. 2010. Semantic Search Challenge 2010. Retrieved April 2013 from http://km.aifb.kit.edu/ws/semsearch10/.Google Scholar
- Semantic. 2011. Semantic Search Challenge 2011. Retrieved April 2013 from http://semsearch.yahoo.com/.Google Scholar
- Milad Shokouhi, Justin Zobel, Saied Tahaghoghi, and Falk Scholer. 2007. Using query logs to establish vocabularies in distributed information retrieval. Inf. Process. Manage. 43, 1 (January 2007), 169--180. DOI: http://dx.doi.org/10.1016/j.ipm.2006.04.003 Google ScholarDigital Library
- Amit Singhal. 2012. Introducing the Knowledge Graph: Things, Not Strings. Official Blog of Google Retrieved April 2013 from http://googleblog.blogspot.co.uk/2012/05/introducing-knowledge-graph-things-not.html.Google Scholar
- Karen Spärck-Jones, Stephen E. Robertson, and Mark Sanderson. 2007. Ambiguous requests: Implications for retrieval tests, systems and theories. SIGIR Forum 41, 2 (2007), 8--17. http://dx.doi.org/10.1145/1328964.1328965 Google ScholarDigital Library
- K. Srinivas, P. V. S. Srinivas, and A. Govardhan. 2011. A survey on the performance evaluation of various meta search engines. Int. Journal of Computer Science Issues 8, 3 (2011), 359--364.Google Scholar
- Andreas Strotmann and Dangzhi Zhao. 2008. Bibliometric maps for aggregated visual browsing in digital libraries. In Proc. of the SIGIR 2008 Workshop on Aggregated Search.Google Scholar
- Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2007. YAGO: A core of semantic knowledge. In Proc. of WWW 2007. 697--706. Google ScholarDigital Library
- Fabian M. Suchanek, Gjergji Kasneci, and Gerhard Weikum. 2008. YAGO: A large ontology from Wikipedia and WordNet. Web Semant. 6, 3 (2008), 203--217. DOI: http://dx.doi.org/10.1016/j.websem.2008.06.001 Google ScholarDigital Library
- Fabian M. Suchanek, Mauro Sozio, and Gerhard Weikum. 2009. SOFIE: A self-organizing framework for information extraction. In Proc. of the WWW Conference. 631--640. Google ScholarDigital Library
- Shanu Sushmita, Hideo Joho, and Mounia Lalmas. 2009. A task-based evaluation of an aggregated search interface. In Proc. of SPIRE 2009. 322--333. Google ScholarDigital Library
- Shanu Sushmita, Hideo Joho, Mounia Lalmas, and Robert Villa. 2010. Factors affecting click-through behavior in aggregated search interfaces. In Proc. of CIKM 2010. 519--528. Google ScholarDigital Library
- TAC. 2011. Proc. of the 4th Text Analysis Conference. National Institute of Standards and Technology, Gaithersburg, MD.Google Scholar
- Bilyana Taneva, Mouna Kacimi, and Gerhard Weikum. 2010. Gathering and ranking photos of named entities with high precision, high recall, and diversity. In Proc. of WSDM 2010. 431--440. Google ScholarDigital Library
- Kosuke Tokunaga and Kentaro Torisawa. 2005. Automatic discovery of attribute words from web documents. In Proc. of the 2nd International Joint Conference on Natural Language Processing (IJCNLP-05). 106--118. Google ScholarDigital Library
- T. Tran, H. Wang, S. Rudolph, and P. Cimiano. 2009. Top-k exploration of query candidates for efficient keyword search on graph-shaped (rdf) data. In Proc. of ICDE 2009. 405--416. Google ScholarDigital Library
- Andrew Trotman, Shlomo Geva, Jaap Kamps, Mounia Lalmas, and Vanessa Murdock. 2010. Current research in focused retrieval and result aggregation. Journal of Information Retrieval (2010). Google ScholarDigital Library
- Peter D. Turney. 2001. Mining the web for synonyms: PMI-IR versus LSA on TOEFL. In Proc. of EMCL 2001. 491--502. Google ScholarDigital Library
- Subodh Vaid, Christopher B. Jones, Hideo Joho, and Mark Sanderson. 2005. Spatio-textual indexing for geographical search on the web. In Proc. of SSTD. 218--235. Google ScholarDigital Library
- David Vallet and Hugo Zaragoza. 2008. Inferring the most important types of a query: A semantic approach. In Proc. of SIGIR 2008e, Singapore. 857--858. Google ScholarDigital Library
- Ellen M. Voorhees. 2003. Evaluating answers to definition questions. In Proceedigns of NAACL’03. 109--111. Google ScholarDigital Library
- Tak-Lam Wong and Wai Lam. 2004. A probabilistic approach for adapting information extraction wrappers and discovering new attributes. In Proc. of ICDM 2004. 257--264. Google ScholarDigital Library
- Tak-Lam Wong and Wai Lam. 2009. An unsupervised method for joint information extraction and feature mining across different Web sites. Data Knowl. Eng. 68, 1 (January 2009), 107--125. DOI: http://dx.doi.org/10.1016/j.datak.2008.08.009 Google ScholarDigital Library
- Fei Wu, Raphael Hoffmann, and Daniel S. Weld. 2008. Information extraction from Wikipedia: Moving down the long tail. In Proc. of KDD 2008. 731--739. Google ScholarDigital Library
- M. Wu and M. Fuller. 1997. Supporting the Answering Process. In Proc. of the 2nd Australian Document Computing Symposium. 65--73.Google Scholar
- Naoki Yoshinaga and Kentaro Torisawa. 2007. Open-domain attribute-value acquisition from semi-structured texts. In Proc. of the Workshop on Ontolex. 55--66.Google Scholar
- K. Zhou, R. Cummins, M. Lalmas, and J. M. Jose. 2011. Evaluating large-scale distributed vertical search. In Proc. of LSDS-IR Workshop in CIKM. Google ScholarDigital Library
- Ke Zhou, Ronan Cummins, Mounia Lalmas, and Joemon M. Jose. 2012. Evaluating aggregated search pages. In Proc. of SIGIR 2012. 115--124. Google ScholarDigital Library
- J. Zhu, Z. Nie, X. Liu, B. Zhang, and J.-R. Wen. 2009. Statsnowball: a statistical approach to extracting entity relationships. In Proc. of WWW 2009, Madrid, Spain. 101--110. Google ScholarDigital Library
Index Terms
- Aggregated search: A new information retrieval paradigm
Recommendations
From federated to aggregated search
SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrievalFederated search refers to the brokered retrieval of content from a set of auxiliary retrieval systems instead of from a single, centralized retrieval system. Federated search tasks occur in, for example, digital libraries (where documents from several ...
A new aggregated search method
Special Section: Fuzzy Logic for Analysis of Clinical Diagnosis and Decision-Making in Health CareAggregated search is the task of integrating results from potentially multiple specialized search services, or verticals (images, videos, news, etc.), into the Web search results. Major search engines perform what is known as Aggregated Search. ...
Aggregated Search and Interleaving Methods: A survey
BDAW '16: Proceedings of the International Conference on Big Data and Advanced Wireless TechnologiesAggregated search attempts to satisfy user's need by searching and assembling information from variety verticals and placing them into a single result page. Aggregated search has two research directions namely, cross-vertical Aggregated Search (cvAS) ...
Comments