Abstract
In the last decade, advances in computing have deeply transformed data processing. Increasingly systems aim to process massive amounts of data efficiently, often with fast response times that are typically characterised by the 4V's, i.e., Volume, Variety, Velocity, and Veracity. While fast data processing is desirable, it is also often the case that the outcomes of computationally expensive processes become obsolete over time, due to changes in inputs, reference datasets, tools, libraries, and deployment environment. Given massive data processing, such changes must be carefully accounted for, and their impact on original computation assessed, to determine how much re-computation is needed in response to changes.
- Anand, M. K., Bowers, S., McPhillips, T. M., and Lud¨ascher, B. Exploring Scientific Workflow Provenance Using Hybrid Queries over Nested Data and Lineage Graphs. In SSDBM (2009), pp. 237--254. Google ScholarDigital Library
- Cai, Y., Giarrusso, P. G., Rendel, T., and Ostermann, K. A theory of changes for higher-order languages: Incrementalizing γ-calculi by static differentiation. In ACM SIGPLAN Notices (2014), vol. 49, ACM, pp. 145--155. Google ScholarDigital Library
- Ca la, J., and Missier, P. Selective and Recurring Re-computation of Big Data Analytics Tasks: Insights from a Genomics Case Study. Big Data Research in press (aug 2018).Google Scholar
- Cheney, J., Acar, U. A., and Perera, R. Toward a Theory of Self-explaining Computation. Springer Berlin Heidelberg, Berlin, Heidelberg, 2013, pp. 193--216.Google ScholarCross Ref
- Cheney, J., Chiticariu, L., and Tan, W.-C. Provenance in databases: Why, how, and where. Foundations and Trends in Databases 1, 4 (2009), 379--474. Google ScholarDigital Library
- Griffin, T., Libkin, L., and Trickey, H. An improved algorithm for the incremental recomputation of active relational expressions. IEEE Transactions on Knowledge and Data Engineering (1997). Google ScholarDigital Library
- Horn, R., Perera, R., and Cheney, J. Incremental relational lenses. Proc. ACM Program. Lang. 2, ICFP (July 2018), 74:1--74:30. Google ScholarDigital Library
- K¨ohler, S., Riddle, S., Zinn, D., McPhillips, T., and Lud¨ascher, B. Improving workflow fault tolerance through provenance-based recovery. In SSDBM (2011), pp. 207--224. Google ScholarDigital Library
- Krishnan, D. R., Quoc, D. L., Bhatotia, P., Fetzer, C., and Rodrigues, R. Incapprox: A data analytics system for incremental approximate computing. In 25th International Conference on World Wide Web (2016), pp. 1133--1144. Google ScholarDigital Library
- Lud¨ascher, B., Altintas, I., Berkley, C., Higgins, D., Jaeger, E., Jones, M., Lee, E. A., Tao, J., and Zhao, Y. Scientific Workflow Management and the Kepler System. Concurrency and Computation: Practice and Experience 18, 10 (2005), 1039--1065. Google ScholarDigital Library
- McSherry, F., Murray, D. G., Isaacs, R., and Isard, M. Differential dataflow. In CIDR (2013).Google Scholar
- Murray, D. G., McSherry, F., Isaacs, R., Isard, M., Barham, P., and Abadi, M. Naiad: a timely dataflow system. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles (2013), ACM, pp. 439--455. Google ScholarDigital Library
- Pham, Q., Malik, T., and Foster, I. Using Provenance for Repeatability. In TaPP (2013). Google ScholarDigital Library
- Souza, R., and Mattoso, M. Provenance of dynamic adaptations in user-steered dataflows. In IPAW (2018), pp. 16--29.Google ScholarCross Ref
Recommendations
IWGS 2016 workshop report: The 7th ACM SIGSPATIAL International Workshop on GeoStreaming: San Francisco, CA, USA - October 31, 2016
The ACM SIGSPATIAL International Workshop on Geostreaming (IWGS) was held for the seventh time in conjunction with the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACMGIS 2016). The workshop has been a ...
Comments