Data Quality: The Accuracy Dimension is about assessing the quality of corporate data and improving its accuracy using the data profiling method. Corporate data is increasingly important as companies continue to find new ways to use it. Likewise, improving the accuracy of data in information systems is fast becoming a major goal as companies realize how much it affects their bottom line. Data profiling is a new technology that supports and enhances the accuracy of databases throughout major IT shops. Jack Olson explains data profiling and shows how it fits into the larger picture of data quality. * Provides an accessible, enjoyable introduction to the subject of data accuracy, peppered with real-world anecdotes. * Provides a framework for data profiling with a discussion of analytical tools appropriate for assessing data accuracy. * Is written by one of the original developers of data profiling technology. * Is a must-read for any data management staff, IT management staff, and CIOs of companies with data assets.
Cited By
- Francisco M, Alves-Souza S, Campos E and De Souza L Total Data Quality Management and Total Information Quality Management Applied to Costumer Relationship Management Proceedings of the 9th International Conference on Information Management and Engineering, (40-45)
- Song S, Zhu H and Wang J Constraint-Variance Tolerant Data Repairing Proceedings of the 2016 International Conference on Management of Data, (877-892)
- Xu H (2015). What Are the Most Important Factors for Accounting Information Quality and Their Impact on AIS Data Quality Outcomes?, Journal of Data and Information Quality, 5:4, (1-22), Online publication date: 3-Mar-2015.
- Liu S, Zhao Q and Wu X (2014). Feature selection based on partition clustering, International Journal of Knowledge-based and Intelligent Engineering Systems, 18:2, (135-142), Online publication date: 1-Apr-2014.
- Alpar P and Winkelsträter S (2014). Assessment of data quality in accounting data with association rules, Expert Systems with Applications: An International Journal, 41:5, (2259-2268), Online publication date: 1-Apr-2014.
- Pavlov I A QoX model for ETL subsystems Proceedings of the 14th International Conference on Computer Systems and Technologies, (15-21)
- Lóscio B, Batista M, Souza D and Salgado A Using information quality for the identification of relevant web data sources Proceedings of the 14th International Conference on Information Integration and Web-based Applications & Services, (36-44)
- Collins C and Janssens K (2012). Creating a General (Family) Practice Epidemiological Database in Ireland - Data Quality Issue Management, Journal of Data and Information Quality, 4:1, (1-9), Online publication date: 1-Oct-2012.
- Fürber C and Hepp M Using semantic web resources for data quality management Proceedings of the 17th international conference on Knowledge engineering and management by the masses, (211-225)
- Khatri V and Brown C (2010). Designing data governance, Communications of the ACM, 53:1, (148-152), Online publication date: 1-Jan-2010.
- Fisher C, Lauria E and Matheus C (2009). An Accuracy Metric, Journal of Data and Information Quality, 1:3, (1-21), Online publication date: 1-Dec-2009.
- Rodic J and Baranovic M Generating data quality rules and integration into ETL process Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP, (65-72)
- Hüner K, Ofner M and Otto B Towards a maturity model for corporate data quality management Proceedings of the 2009 ACM symposium on Applied Computing, (231-238)
- Jovanovic V and Cupic L Teaching agile validation of data models Proceedings of the 9th ACM SIGITE conference on Information technology education, (139-146)
- van Hooland S, Bontemps Y and Kaufman S Answering the call for more accountability Proceedings of the 2008 International Conference on Dublin Core and Metadata Applications, (93-103)
- Farinha J and Trigueiros M An extensible metadata framework for data quality assessment of composite structures Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery, (34-44)
- Cappiello C, Comuzzi M and Plebani P On automated generation of web service level agreements Proceedings of the 19th international conference on Advanced information systems engineering, (264-278)
- Gomes P, Farinha J and Trigueiros M A data quality metamodel extension to CWM Proceedings of the fourth Asia-Pacific conference on Comceptual modelling - Volume 67, (17-26)
- Ardagna D, Cappiello C, Francalanci C and Groppi A Brokering multisource data with quality constraints Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part I, (807-817)
- Chen Z and Narasayya V Efficient computation of multiple group by queries Proceedings of the 2005 ACM SIGMOD international conference on Management of data, (263-274)
- Leser U and Freytag J Mining for patterns in contradictory data Proceedings of the 2004 international workshop on Information quality in information systems, (51-58)
Index Terms
- Data Quality: The Accuracy Dimension
Recommendations
Towards Data Quality into the Data Warehouse Development
DASC '11: Proceedings of the 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure ComputingCommonly, DW development methodologies, paying little attention to the problem of data quality and completeness. One of the common mistakes made during the planning of a data warehousing project is to assume that data quality will be addressed during ...