skip to main content
article

Reducing human interactions in Web directory searches

Published:01 October 2007Publication History
Skip Abstract Section

Abstract

Consider a website containing a collection of webpages with data such as in Yahoo or the Open Directory project. Each page is associated with a weight representing the frequency with which that page is accessed by users. In the tree hierarchy representation, accessing each page requires the user to travel along the path leading to it from the root. By enhancing the index tree with additional edges (hotlinks) one may reduce the access cost of the system. In other words, the hotlinks reduce the expected number of steps needed to reach a leaf page from the tree root, assuming that the user knows which hotlinks to take. The hotlink enhancement problem involves finding a set of hotlinks minimizing this cost.

This article proposes the first exact algorithm for the hotlink enhancement problem. This algorithm runs in polynomial time for trees with logarithmic depth. Experiments conducted with real data show that significant improvement in the expected number of accesses per search can be achieved in websites using this algorithm. These experiments also suggest that the simple and much faster heuristic proposed previously by Czyzowicz et al. [2003] creates hotlinks that are nearly optimal in the time savings they provide to the user.

The version of the hotlink enhancement problem in which the weight distribution on the leaves is unknown is discussed as well. We present a polynomial-time algorithm that is optimal for any tree for any depth.

References

  1. Armstrong, R., Freitag, D., Joachims, T., and Mitchell, T. 1995. WebWatcher: A learning apprentice for the World Wide Web. In Working Notes of the AAAI Spring Symposium: Information Gathering from Heterogeneous, Distributed Environments. Stanford, CA, AAAI Press, 6--12.Google ScholarGoogle Scholar
  2. Attardi, di Marco, S., and Salvi, D. 1998. Categorization by context. J. Universal Comput. Sci. 4, 9, 719--736.Google ScholarGoogle Scholar
  3. Bose, P., Krizanc, D., Langerman, S., and Morin, P. 2002. Asymmetric communication protocols via hotlink assignments. In Proceedings of the 9th Colloquium on Structural Information and Communication Complexity. 33--39.Google ScholarGoogle Scholar
  4. Bose, P., Czyzowicz, J., Gasieniec, L., Kranakis, E., Krizanc, D., Pelc, A., and Martin, M. V. 2000. Strategies for hotlink assignments. In Proceedings of the 11th International Symposium on Algorithms and Computation (ISAAC). 23--34. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Cyzyowicz, J., Kranakis, E., Krizanc, D., Pelc, A., and Vargas Martin, M. 2003. Enhancing hyperlink structure for improving web performance. J. Web Eng. 1, 2, 93--127.Google ScholarGoogle Scholar
  6. Dmoz. 2007. DMOZ website. www.dmoz.org.Google ScholarGoogle Scholar
  7. Fink, J., Kobasa, A., and Nill, A. 1996. User-oriented adaptivity and adaptability in the AVANTI project. In Designing for the Web: Empirical Studies. Microsoft Usability Group, Redmond, WA.Google ScholarGoogle Scholar
  8. Gerstel, O., Kutten, S., Matichin, R., and Peleg, D. 2003. Hotlink enhancement algorithms for web directories. In Proceedings of the International Symposium on Algorithms and Computation (ISAAC). 68--77.Google ScholarGoogle Scholar
  9. Glassman, S. 1994. A caching relay for the World Wide Web. In Proceedings of the 1st International World Wide Web Conference. 69--76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Google. 2007. Google website. http://www.google.com/.Google ScholarGoogle Scholar
  11. Kranakis, E., Krizanc, D., and Shende, S. 2004. Approximating hotlink assignment. Inf. Proc. Lett. 90, 3, 121--128. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Matichin, R. and Peleg, D. 2003. Approximation algorithm for hotlink assignments in web directories. In Proceedings of the 8th Workshop on Algorithms and Data Structures. Ottawa, Canada, 271--280.Google ScholarGoogle Scholar
  13. Perkowitz, M. and Etzioni, O. 1999. Towards adaptive web sites: Conceptual framework and case study. In Proceedings of the 8th World Wide Web Conference. Google ScholarGoogle Scholar
  14. Pessoa, A., Laber, E., and Souza, C. 2004a. Efficient implementation of a hotlink assignment algorithm for web sites. In Proceedings of the Workshop on Algorithm Engineering and Experiments (ALENEX).Google ScholarGoogle Scholar
  15. Pessoa, A., Laber, E., and Souza, C. 2004b. Efficient algorithms for the hotlink assignment problem: The worst case search. In Proceedings of the International Symposium on Algorithms and Computation (ISAAC).Google ScholarGoogle Scholar
  16. Yahoo. 2007. Yahoo website. http://www.yahoo.com/.Google ScholarGoogle Scholar

Index Terms

  1. Reducing human interactions in Web directory searches

                Recommendations

                Comments

                Login options

                Check if you have access through your login credentials or your institution to get full access on this article.

                Sign in

                Full Access

                • Published in

                  cover image ACM Transactions on Information Systems
                  ACM Transactions on Information Systems  Volume 25, Issue 4
                  October 2007
                  159 pages
                  ISSN:1046-8188
                  EISSN:1558-2868
                  DOI:10.1145/1281485
                  Issue’s Table of Contents

                  Copyright © 2007 ACM

                  Publisher

                  Association for Computing Machinery

                  New York, NY, United States

                  Publication History

                  • Published: 1 October 2007
                  Published in tois Volume 25, Issue 4

                  Permissions

                  Request permissions about this article.

                  Request Permissions

                  Check for updates

                  Qualifiers

                  • article

                PDF Format

                View or Download as a PDF file.

                PDF

                eReader

                View online with eReader.

                eReader