skip to main content
Skip header Section
Internet agents: spiders, wanderers, brokers, and botsJanuary 1996
Publisher:
  • New Riders Publishing
  • Post Office Box 4846 Thousand Oaks, CA
  • United States
ISBN:978-1-56205-463-2
Published:01 January 1996
Pages:
413
Skip Bibliometrics Section
Bibliometrics
Abstract

No abstract available.

Cited By

  1. ACM
    Mittal P, Dixit A and Sharma A A scalable, extensible web crawler based on P2P overlay networks Proceedings of the International Conference and Workshop on Emerging Trends in Technology, (159-162)
  2. Chau M and Chen H (2008). A machine learning approach to web page filtering using content and structure analysis, Decision Support Systems, 44:2, (482-494), Online publication date: 1-Jan-2008.
  3. Chau M, Huang Z, Qin J, Zhou Y and Chen H (2006). Building a scientific knowledge web portal, Decision Support Systems, 42:2, (1216-1238), Online publication date: 1-Nov-2006.
  4. Cubranic D, Murphy G, Singer J and Booth K (2005). Hipikat, IEEE Transactions on Software Engineering, 31:6, (446-465), Online publication date: 1-Jun-2005.
  5. Spalazzi L (2003). M. J. Wooldridge, Reasoning about Rational Agents, Intelligent Robots and Autonomous Agents Series, Cambridge, MA, Minds and Machines, 13:3, (429-435), Online publication date: 1-Aug-2003.
  6. Nekrestyanov I and Panteleeva N (2019). Text Retrieval Systems for the Web, Programming and Computing Software, 28:4, (207-225), Online publication date: 1-Jul-2002.
  7. ACM
    Bergmark D Collection synthesis Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, (253-262)
  8. Coleman K (2019). Android arete, Ethics and Information Technology, 3:4, (247-265), Online publication date: 14-Jan-2002.
  9. Pistolesi G How synthetic characters can help decision making Decision making support systems, (239-256)
  10. Chang S and Znati T (2001). Adlet, IEEE Transactions on Knowledge and Data Engineering, 13:1, (112-123), Online publication date: 1-Jan-2001.
  11. ACM
    Almeida V, Menascé D, Riedi R, Peligrinelli F, Fonseca R and Meira W Analyzing robot behavior in e-business sites Proceedings of the 2001 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, (338-339)
  12. ACM
    Almeida V, Menascé D, Riedi R, Peligrinelli F, Fonseca R and Meira W (2001). Analyzing robot behavior in e-business sites, ACM SIGMETRICS Performance Evaluation Review, 29:1, (338-339), Online publication date: 1-Jun-2001.
  13. Desjardins G and Godin R Combining relevance feedback and genetic algorithms in an internet information filtering engine Content-Based Multimedia Information Access - Volume 2, (1676-1685)
  14. Falchuk B and Karmouch A (1999). The Mobile Agent Paradigm Meets Digital Document Technology, Multimedia Tools and Applications, 8:1, (137-166), Online publication date: 1-Jan-1999.
  15. Favela J and Meza V (1999). Image Retrieval Agent, IEEE Intelligent Systems, 14:5, (36-39), Online publication date: 1-Sep-1999.
  16. Nekrestyanov I, O'Meara T, Patel A and Romanova E Building Topic-Specific Collections with Intelligent Agents Proceedings of the 6th International Conference on Intelligence and Services in Networks: Paving the Way for an Open Service Market, (70-82)
  17. Gotterbarn D (2018). Privacy lost, Ethics and Information Technology, 1:2, (147-154), Online publication date: 2-Jan-1998.
  18. ACM
    Berghel H (1997). Cyberspace 2000, Communications of the ACM, 40:2, (19-24), Online publication date: 1-Feb-1997.
  19. ACM
    Harmandas V, Sanderson M and Dunlop M Image retrieval by hypertext links Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval, (296-303)
  20. ACM
    Harmandas V, Sanderson M and Dunlop M (1997). Image retrieval by hypertext links, ACM SIGIR Forum, 31:SI, (296-303), Online publication date: 2-Dec-1997.
  21. Mannina B, Giraud E, Quoniam L and Rostaing H AURESYS Computer-Assisted Information Searching on Internet - Volume 2, (658-660)
Contributors

Recommendations

Reviews

H. Van Dyke Parunak

The huge amount of information provided by the Internet is at once a blessing and a curse. The net can deliver information much more rapidly and at lower expense than older technologies, but the time needed to track it down in a jungle of possible sources can outweigh the benefits for a user who does not already know where to go. One solution to this problem is a computer program with the ability to search the Internet automatically on behalf of a human user. Such programs are commonly called “agents.” This use of the term, derived from expressions such as “travel agent” or “real-estate agent,” emphasizes the role of the program—to represent a human—and should not be confused with an alternative use of the term, derived from the etymological root meaning “to act,” which denotes a software object with its own thread of control and its own initiative, but no necessary connection to the Internet or to a specific human user. Cheong has distilled his extensive experience with Internet agents, both as a researcher and in the commercial world, to provide an accurate summary of this technology at a popular level. The book is organized into five parts. Part 1, an introduction, describes the foundation on which Internet agents are built. It defines agents as “personal software assistants with authority delegated from their users” and summarizes a number of pioneering and current research projects that are developing the underlying tools and demonstrating their potential benefits. This part also gives a brief history of the Internet and an overview of its operation, and outlines the structure of the World Wide Web, including a summary of its lingua franca, the Hypertext Transfer Protocol (HTTP) and the Hypertext Markup Language (HTML). Part 2, “Web Robot Construction,” shows how a computer program can travel through the Web to gather information for its human master, describing several existing programs (or “spiders,” including Lycos, harvest, and WebAnts) that crawl through the Web in order to index it. Automated programs can impose undesirable loads on Web servers that hinder access by their primary human audience, so this section lays down operational guidelines for Web robots, including four laws of Web robotics and six laws for robot operators. It provides a detailed summary of HTTP, showing how the protocol supports the recommended operational guidelines, and then outlines the architecture and operation of the WebWalker, an Internet agent that searches the W eb for dangling pointers and reports them back to its master. While much of the information available on the Web today is provided without charge to the user, the environment can only become self-sustaining if ways are found for people to pay for what they find valuable. Part 3, “Agents and Money on the Net,” discusses two important technologies to support commercial transactions on the Internet: security (which enables people to entrust proprietary information to the net), and electronic cash and payment services (which allow people to pay for a net-based service in the same environment in which they receive it). Part 4, “Bots in Cyberspace,” describes several software denizens of the Internet that do not fall directly under the category of assistants to humans, but demonstrate some of the techniques and capabilities on which the earlier chapters draw. These include malicious applications that can travel over the network, such as worms and viruses, and programs such as Julia and Colin, which impersonate humans in online virtual worlds, also known as MUDs. Part 5, “Appendices,” is the largest of the book's five parts. It includes the specifications for HTTP 1.0; Perl listings for two Web robots (the WebWalker from Part 2, and a WebShopper that compares prices for CDs and books offered by online stores); lists of online bookstores, CD shops, and MUD sites; and a list of Web spiders and robots. These appendices offer a wealth of detail, and for many readers will be the most frequently referenced section of the book. Ironically, the technology described in the book makes it less and less important to have lists such as these printed on paper and occupying space on a shelf, but users new to the Internet will find these lists an invaluable source of starting points. Unfortunately, but not surprisingly, some of the most important links printed in the book (such as that for Martijn Koster's list of active robots) are already out of date. The volume includes chapter-by-chapter bibliographies through 1995 and a comprehensive index.

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.