ABSTRACT
The rapid growth in data science and technology, in particular in the complexity and volume of Big Data, has introduced new challenges for the research community. Several of these are related to the nature of data generation, since most of the data sources produce data continuously. Examples include sensor and wireless networks, radio frequency identification, customer click streams, telephone records, multimedia and scientific data, and sets of retail chain transactions, among others. These sources are called data streams, ordered sequences of instances that can typically be read only once or a small number of times due to its their high speed of flow and continuous nature. Data streams are characterized by being open-ended, and generated by non-stationary distributions. Thus, they are increasingly important in the research community, as new algorithms are needed to efficiently process this streaming data, to enable rapid and real-time updated understanding of the data. The goal of this track is to convene researchers who work with data streams, defining models, processing continuous queries, developing sampling, filtering and stream mining methods, machine learning, and visualization techniques and related issues.
Recommendations
Data Streams with Bounded Deletions
PODS '18: Proceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database SystemsTwo prevalent models in the data stream literature are the insertion-only and turnstile models. Unfortunately, many important streaming problems require a Θ(log(n)) multiplicative factor more space for turnstile streams than for insertion-only streams. ...
Comments