ABSTRACT
We study the problem of determining the proper aggregation granularity for a stream of time-stamped edges. Such streams are used to build time-evolving networks, which are subsequently used to study topics such as network growth. Currently, aggregation lengths are chosen arbitrarily, based on intuition or convenience. We describe ADAGE, which detects the appropriate aggregation intervals from streaming edges and outputs a sequence of structurally mature graphs. We demonstrate the value of ADAGE in automatically finding the appropriate aggregation intervals on edge streams for belief propagation to detect malicious files and machines.
- R. S. Caceres. Temporal Scale of Dynamic Networks. PhD thesis, University of Illinois at Chicago, 2013.Google Scholar
- E. Keogh, S. Chu, D. Hart, and M. Pazzani. An online algorithm for segmenting time series. In ICDM, pages 289--296, 2001. Google ScholarDigital Library
- J. Kiernan and E. Terzi. Constructing comprehensive summaries of large event sequences. TKDE, 3(4):21:1--21:31, 2009. Google ScholarDigital Library
Index Terms
- Generating Graph Snapshots from Streaming Edge Data
Recommendations
PSoup: a system for streaming queries over streaming data
Abstract.Recent work on querying data streams has focused on systems where newly arriving data is processed and continuously streamed to the user in real time. In many emerging applications, however, ad hoc queries and/or intermittent connectivity also ...
Crowdsourced Live Streaming over Aggregated Edge Networks
2016 IEEE Global Communications Conference (GLOBECOM)Recent years have witnessed a dramatic increase of user-generated video services. In such user-generated video services, crowdsourced live streaming (e.g., Periscope, Twitch) has significantly challenged today's content delivery infrastructure: today's ...
Streaming queries over streaming data
VLDB '02: Proceedings of the 28th international conference on Very Large Data BasesRecent work on querying data streams has focused on systems where newly arriving data is processed and continuously streamed to the user in real-time. In many emerging applications, however, ad hoc queries and/or intermittent connectivity also require ...
Comments