Individual and Collective Graph Mining: Principles, Algorithms, and Applications

Individual and Collective Graph Mining: Principles, Algorithms, and ApplicationsOctober 2017

Go to Individual and Collective Graph Mining

October 2017

Publisher:

Morgan & Claypool Publishers

ISBN:978-1-68173-039-4

Published:26 October 2017

Pages:

206

Available at Amazon

Bibliometrics

Sections

2017

Abstract

Graphs naturally represent information ranging from links between web pages, to communication in email networks, to connections between neurons in our brains. These graphs often span billions of nodes and interactions between them. Within this deluge of interconnected data, how can we find the most important structures and summarize them? How can we efficiently visualize them? How can we detect anomalies that indicate critical events, such as an attack on a computer system, disease formation in the human brain, or the fall of a company? This book presents scalable, principled discovery algorithms that combine globality with locality to make sense of one or more graphs. In addition to fast algorithmic methodologies, we also contribute graph-theoretical ideas and models, and real-world applications in two main areas:Individual Graph Mining: We show how to interpretably summarize a single graph by identifying its important graph structures. We complement summarization with inference, which leverages information about few entities (obtained via summarization or other methods) and the network structure to efficiently and effectively learn information about the unknown entities. Collective Graph Mining: We extend the idea of individual-graph summarization to time-evolving graphs, and show how to scalably discover temporal patterns. Apart from summarization, we claim that graph similarity is often the underlying problem in a host of applications where multiple graphs occur (e.g., temporal anomaly detection, discovery of behavioral patterns), and we present principled, scalable algorithms for aligning networks and measuring their similarity. The methods that we present in this book leverage techniques from diverse areas, such as matrix algebra, graph theory, optimization, information theory, machine learning, finance, and social science, to solve real-world problems. We present applications of our exploration algorithms to massive datasets, including a Web graph of 6.6 billion edges, a Twitter graph of 1.8 billion edges, brain graphs with up to 90 million edges, collaboration, peer-to-peer networks, browser logs, all spanning millions of users and interactions.

Cited By

Contributors

Danai Koutra
University of Michigan, Ann Arbor
- Publication Years2011 - 2024
- Publication counts66
- Citation count1,880
- Available for Download48
- Downloads (cumulative)35,925
- Downloads (12 months)7,393
- Downloads (6 weeks)1,118
- Average Downloads per Article748
- Average Citation per Article28
View Full Profile
Christos Faloutsos
Carnegie Mellon University
- Publication Years1983 - 2024
- Publication counts508
- Citation count38,866
- Available for Download266
- Downloads (cumulative)319,847
- Downloads (12 months)24,774
- Downloads (6 weeks)3,607
- Average Downloads per Article1,202
- Average Citation per Article77
View Full Profile
J. Han
University of Illinois Urbana-Champaign
- Publication Years1986 - 2024
- Publication counts702
- Citation count41,152
- Available for Download384
- Downloads (cumulative)376,735
- Downloads (12 months)31,883
- Downloads (6 weeks)5,063
- Average Downloads per Article981
- Average Citation per Article59
View Full Profile
Lise Carol Getoor
University of California, Santa Cruz
- Publication Years1995 - 2023
- Publication counts177
- Citation count5,444
- Available for Download91
- Downloads (cumulative)75,419
- Downloads (12 months)4,601
- Downloads (6 weeks)675
- Average Downloads per Article829
- Average Citation per Article31
View Full Profile
Wei Wang
- Publication Years2017 - 2017
- Publication counts2
- Citation count6
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article3
View Full Profile