skip to main content
Skip header Section
Programming collective intelligenceAugust 2007
Publisher:
  • O'Reilly
ISBN:978-0-596-52932-1
Published:16 August 2007
Pages:
360
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

Want to tap the power behind search rankings, product recommendations, social bookmarking, and online matchmaking? This fascinating book demonstrates how you can build Web 2.0 applications to mine the enormous amount of data created by people on the Internet. With the sophisticated algorithms in this book, you can write smart programs to access interesting datasets from other web sites, collect data from users of your own applications, and analyze and understand the data once you've found it. Programming Collective Intelligence takes you into the world of machine learning and statistics, and explains how to draw conclusions about user experience, marketing, personal tastes, and human behavior in general -- all from information that you and others collect every day. Each algorithm is described clearly and concisely with code that can immediately be used on your web site, blog, Wiki, or specialized application. This book explains: Collaborative filtering techniques that enable online retailers to recommend products or mediaMethods of clustering to detect groups of similar items in a large datasetSearch engine features -- crawlers, indexers, query engines, and the PageRank algorithmOptimization algorithms that search millions of possible solutions to a problem and choose the best oneBayesian filtering, used in spam filters for classifying documents based on word types and other featuresUsing decision trees not only to make predictions, but to model the way decisions are madePredicting numerical values rather than classifications to build price modelsSupport vector machines to match people in online dating sitesNon-negative matrix factorization to find the independent features in a datasetEvolving intelligence for problem solving -- how a computer develops its skill by improving its own code the more it plays a game Each chapter includes exercises for extending the algorithms to make them more powerful. Go beyond simple database-backed applications and put the wealth of Internet data to work for you. "Bravo! I cannot think of a better way for a developer to first learn these algorithms and methods, nor can I think of a better way for me (an old AI dog) to reinvigorate my knowledge of the details." -- Dan Russell, Google "Toby's book does a great job of breaking down the complex subject matter of machine-learning algorithms into practical, easy-to-understand examples that can be directly applied to analysis of social interaction across the Web today. If I had this book two years ago, it would have saved precious time going down some fruitless paths." -- Tim Wolters, CTO, Collective Intellect

Cited By

  1. d’Hondt J, Minartz K and Papapetrou O (2024). Efficient detection of multivariate correlations with different correlation measures, The VLDB Journal — The International Journal on Very Large Data Bases, 33:2, (481-505), Online publication date: 1-Mar-2024.
  2. ACM
    Butt S, Suran S, Pappel I, Smærup M, Krimmer R and Draheim D (2023). A Digital Collaborative Platform for the Silver Economy: Functionalities Required by Stakeholders in a Multinational Baltic Sea Region Project, Digital Government: Research and Practice, 4:2, (1-20), Online publication date: 30-Jun-2023.
  3. ACM
    Alavizadeh H, Jang-Jaccard J, Enoch S, Al-Sahaf H, Welch I, Camtepe S and Kim D (2022). A Survey on Cyber Situation-awareness Systems: Framework, Techniques, and Insights, ACM Computing Surveys, 55:5, (1-37), Online publication date: 30-Jun-2023.
  4. Lo A and Zhang R (2022). The wisdom of crowds versus the madness of mobs: An evolutionary model of bias, polarization, and other challenges to collective intelligence, Collective Intelligence, 1:1, Online publication date: 1-Aug-2022.
  5. Yaghtin M, Sotudeh H, Mirzabeigi M, Fakhrahmad S and Mohammadi M (2019). In quest of new document relations, Scientometrics, 119:2, (987-1008), Online publication date: 1-May-2019.
  6. ACM
    Kim D, Lee J, Choi D, Choi J and Kang J Learning User Preferences and Understanding Calendar Contexts for Event Scheduling Proceedings of the 27th ACM International Conference on Information and Knowledge Management, (337-346)
  7. ACM
    Utku A, Aydogan E, Mutlu B and Akcayol M A New Recommender System Based on Multiple Parameters and Extended User Behavior Analysis Proceedings of the 9th International Conference on Information Management and Engineering, (133-138)
  8. ACM
    Yue Z, Jiang Y, Pan D and Luo Z An End-to-end Tag-based Recommendation System for Verbal Reasoning Questions Proceedings of the 10th EAI International Conference on Simulation Tools and Techniques, (131-135)
  9. ACM
    Shi X, Cui B, Dobbie G and Ooi B (2016). UniAD, ACM Transactions on Database Systems, 42:1, (1-42), Online publication date: 2-Mar-2017.
  10. Quintana-Amate S, Bermell-Garcia P, Tiwari A and Turner C (2017). A new knowledge sourcing framework for knowledge-based engineering, Computers and Industrial Engineering, 104:C, (35-50), Online publication date: 1-Feb-2017.
  11. Kolomvatsos K (2016). An intelligent, uncertainty driven aggregation scheme for streams of ordered sets, Applied Intelligence, 45:3, (713-735), Online publication date: 1-Oct-2016.
  12. ACM
    Harper F and Konstan J (2015). The MovieLens Datasets, ACM Transactions on Interactive Intelligent Systems, 5:4, (1-19), Online publication date: 7-Jan-2016.
  13. ACM
    Adams R, Sadasivam R, Balakrishnan K, Kinney R, Houston T and Marlin B PERSPeCT Proceedings of the 8th ACM Conference on Recommender systems, (329-332)
  14. ACM
    Marttila-Kontio M, Kontio M and Hotti V Advanced data analytics education for students and companies Proceedings of the 2014 conference on Innovation & technology in computer science education, (249-254)
  15. ACM
    Shi X, Cui B, Dobbie G and Ooi B Towards unified ad-hoc data processing Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, (1263-1274)
  16. Hayakawa K, Hayakawa T and Ito T Implementation of an Event Information Sharing Support System Utilizing OpenStreetMap Proceedings, Part II, of the 27th International Conference on Modern Advances in Applied Intelligence - Volume 8482, (328-337)
  17. Ito T and Iwama Y Implementation of Question Answering System Based on Reference-Based Ranking Algorithm Proceedings, Part I, of the 27th International Conference on Modern Advances in Applied Intelligence - Volume 8481, (160-169)
  18. ACM
    Li X, Paracha S, Wu J and Yoshie O Using Planning with Action Preference in Story Generation Proceedings of International Conference on Advances in Mobile Computing & Multimedia, (555-558)
  19. Ziesemer A, Müller L and Silveira M Gamification aware Proceedings of the 12th Brazilian Symposium on Human Factors in Computing Systems, (276-279)
  20. ACM
    Inzinger C, Hummer W, Satzger B, Leitner P and Dustdar S Identifying incompatible service implementations using pooled decision trees Proceedings of the 28th Annual ACM Symposium on Applied Computing, (485-492)
  21. Luz N, Silva N and Novais P Social Networked Multi-agent Negotiation in Ontology Alignment Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 02, (310-315)
  22. Rahman M, Ma W, Tran D and Campbell J A comprehensive survey of the feature extraction methods in the EEG research Proceedings of the 12th international conference on Algorithms and Architectures for Parallel Processing - Volume Part II, (274-283)
  23. Hunsicker S, Yu C and Federmann C Machine learning for hybrid machine translation Proceedings of the Seventh Workshop on Statistical Machine Translation, (312-316)
  24. Federmann C Can machine learning algorithms improve phrase selection in hybrid machine translation? Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra), (113-118)
  25. ACM
    Priedhorsky R, Pitchford D, Sen S and Terveen L Recommending routes in the context of bicycling Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work, (979-988)
  26. Ignatov D, Poelmans J, Dedene G and Viaene S A new cross-validation technique to evaluate quality of recommender systems Proceedings of the First Indo-Japan conference on Perception and Machine Intelligence, (195-202)
  27. Vivacqua A and Borges M (2012). Taking advantage of collective knowledge in emergency response systems, Journal of Network and Computer Applications, 35:1, (189-198), Online publication date: 1-Jan-2012.
  28. ACM
    Weichselbraun A, Gindl S and Scharl A Using games with a purpose and bootstrapping to create domain-specific sentiment lexicons Proceedings of the 20th ACM international conference on Information and knowledge management, (1053-1060)
  29. Chmielewski M and Stąpor P Protégé based environment for DL knowledge base structural analysis Proceedings of the Third international conference on Computational collective intelligence: technologies and applications - Volume Part I, (314-325)
  30. Li Z and Tang X Group polarization and non-positive social influence Proceedings of the 2011 international conference on Brain informatics, (295-303)
  31. ACM
    Jung S and Lawrance J (2011). Web information retrieval and filtering course to undergraduates using open source programming, ACM Inroads, 2:3, (47-50), Online publication date: 31-Aug-2011.
  32. Shamshurin I Data representation in machine learning-based sentiment analysis of customer reviews Proceedings of the 4th international conference on Pattern recognition and machine intelligence, (254-260)
  33. Gao M, Wu Z and Jiang F (2011). Userrank for item-based collaborative filtering recommendation, Information Processing Letters, 111:9, (440-446), Online publication date: 1-Apr-2011.
  34. ACM
    Santosa H, Milton J and Kennedy P HMXT-GP Proceedings of the 2011 ACM Symposium on Applied Computing, (1070-1075)
  35. ACM
    Berezovskiy A and Carr L A framework for dynamic data source identification and orchestration on the web Proceedings of the 3rd and 4th International Workshop on Web APIs and Services Mashups, (1-8)
  36. ACM
    Vergados D, Lykourentzou I and Kapetanios E A resource allocation framework for collective intelligence system engineering Proceedings of the International Conference on Management of Emergent Digital EcoSystems, (182-188)
  37. Bozo J, Alarcón R and Iribarra S Recommending learning objects according to a teachers' contex model Proceedings of the 5th European conference on Technology enhanced learning conference on Sustaining TEL: from innovation to learning and practice, (470-475)
  38. ACM
    De Pessemier T, Dooms S, Deryckere T and Martens L Time dependency of data quality for collaborative filtering algorithms Proceedings of the fourth ACM conference on Recommender systems, (281-284)
  39. ACM
    Withana A, Matsui R, Sugimoto M, Harada K and Inakage M Narrative image composition using objective and subjective tagging ACM SIGGRAPH 2010 Posters, (1-1)
  40. Mudge R The design of a proofreading software service Proceedings of the NAACL HLT 2010 Workshop on Computational Linguistics and Writing: Writing Processes and Authoring Aids, (24-32)
  41. Bidlack C and Wellman M (2010). Exceptional Data Quality Using Intelligent Matching and Retrieval, AI Magazine, 31:1, (65-73), Online publication date: 1-Mar-2010.
  42. Neal M and Speers d Context aware personal agent for spiritual exploration Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics, (1242-1249)
  43. ACM
    Rafelsberger W and Scharl A Games with a purpose for social networking platforms Proceedings of the 20th ACM conference on Hypertext and hypermedia, (193-198)
  44. Fessakis G and Dimitracopoulou A Proposing "collaborative filtering" to foster collaboration in ScratchR community Proceedings of the 9th international conference on Computer supported collaborative learning - Volume 2, (168-170)
  45. ACM
    Garcia-Perate G, Agarwal P and Wilson D HINTeractions Proceedings of the 3rd International Conference on Tangible and Embedded Interaction, (119-122)
  46. Arcoverde D, Melo C and Franco R Labbos Proceedings of the VIII Brazilian Symposium on Human Factors in Computing Systems, (326-327)
  47. Scharl A, Weichselbraun A and Wohlgenannt G A web-based user interaction framework for collaboratively building and validating ontologies Proceedings of the VIII Brazilian Symposium on Human Factors in Computing Systems, (244-247)
  48. ACM
    Ketter W, Batchu A, Berosik G and McCreary D A semantic web architecture for advocate agents to determine preferences and facilitate decision making Proceedings of the 10th international conference on Electronic commerce, (1-10)
Contributors

Recommendations