Abstract
Search engines are among the most useful and high-profile resources on the Internet. The problem of finding information on the Internet has been replaced with the problem of knowing where search engines are, what they are designed to retrieve, and how to use them. This article describes and evaluates SavvySearch, a metasearch engine designed to intelligently select and interface with multiple remote search engines. The primary metasearch issue examined is the importance of carefully selecting and ranking remote search engines for user queries. We studied the efficacy of SavvySearch's incrementally acquired metaindex approach to selecting search engines by analyzing the effect of time and experience on performance. We also compared the metaindex approach to the simpler categorical approach and showed how much experience is required to surpass the simple scheme.
- BOWMAN, C. M., DANZIG, P. B., MANBER, U., AND SCHWARTZ, M.F. 1994. Scalable internet resource discovery: Research problems and approaches. Commun. ACM 37, 8 (Aug.). Google Scholar
- BOWMAN, C. M., DANZIG, P. B., MANBER, U., SCHWARTZ, M. F., HARDY, D. R., AND WESSELS, D. P. 1995. Harvest: A scalable, customizable discovery and access system. Tech. Rep., Univ. of Colorado, Boulder, Colo.Google Scholar
- DREILINGER, D. 1996. Description and evaluation of a meta-search agent. Master's thesis, Computer Science Dept., Colorado State Univ., Fort Collins, Colo.Google Scholar
- EICHMANN, D. 1994. Ethical web agents. In Electronic Proceedings of the 2nd World Wide Web Conference '94: Mosaic and the Web. Elsevier, London. Available as http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/Agents/eichmann.ethical/ethics.html.Google Scholar
- GAUCH, S., WANG, G., AND GOMEZ, M. 1996. Profusion: Intelligent fusion from multiple, different search engines. J. Univ. Comput. Sci. 2, 9 (Sept.).Google Scholar
- GRAVANO, L., GARC#A-MOLINA, H., AND TOMASIC, A. 1994. Precision and recall of GLOSS estimators for database discovery. In Proceedings of the 3rd International Conference on Parallel and Distributed Information Systems (PDIS'94). IEEE Computer Society, Washington, D.C. Google Scholar
- SALTON, G. 1989. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Reading, Mass. Google Scholar
- SELBERG, E. AND ETZIONI, O. 1995. Multi-service search and comparison using the MetaCrawler. In Proceedings of the 4th International World Wide Web Conference.Google Scholar
- SHELDON, M. A., DUDA, A., WEISS, R., AND GIFFORD, D.K. 1995. Discover: A resource discovery system based on content routing. In Proceedings of the 3rd International World Wide Web Conference. Elsevier, North Holland, Amsterdam. Google Scholar
- WITTEN, I. H., MOFFAT, A., AND BELL, T.C. 1994. Managing Gigabytes: Compressing and Indexing Documents and Images. Von Nostrand Reinhold, New York. Google Scholar
- YAN, T. W. AND GARCIA-MOLINA, H. 1995. SIFT--A tool for wide-area information dissemination. In Proceedings of the 1995 USENIX Technical Conference. USENIX Assoc., Berkeley, Calif., 177-186. Google Scholar
- ZILBERSTEIN, S. 1995. An anytime computation approach to information gathering. In Working Notes of the AAAI Spring Symposium Series on Information Gathering from Distributed, Heterogeneous Environments. AAAI, Menlo Park, Calif.Google Scholar
Index Terms
- Experiences with selecting search engines using metasearch
Recommendations
Building efficient and effective metasearch engines
Frequently a user's information needs are stored in the databases of multiple search engines. It is inconvenient and inefficient for an ordinary user to invoke multiple search engines and identify useful documents from the returned results. To support ...
Re-ranking search results using query logs
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge managementThis work addresses two common problems in search, frequently occurring with underspecified user queries: the top-ranked results for such queries may not contain documents relevant to the user's search intent, and fresh and relevant pages may not get ...
Search Engine Optimization by Re-Ranking the Product Search Result Based on User Click Data
AISS '21: Proceedings of the 3rd International Conference on Advanced Information Science and SystemBlibli.com provides a search engine for its customers. It used Solr search engine with only plain BM25 similarity function which is based on probability. In order to improve search engine performance, this research tried to implement an algorithm that ...
Comments