Analysis of data quality issues in real-world industrial data. Hubauer, T. M., Lamparter, S., Roshchin, M., Solomakhina, N., & Watson, S. In Proceedings of the 2013 Annual Conference of the Prognostics and Health Management Society, 2013. abstract bibtex In large industries usage of advanced technological methods and modern equipment comes with the problem of storing, interpreting and analyzing huge amount of information. Handling information becomes more complicated and important at the same time. So, data quality is one of major challenges considering a rapid growth of information, fragmentation of information systems, incorrect data formatting and other issues. The aim of this paper is to describe industrial data processing and analytics on the real-world use case. The most crucial data quality issues are described, examined and classified in terms of Data Quality Dimensions. Factual industrial information supports and illustrates each encountered data deficiency. In addition, we describe methods for elimination data quality issues and data analysis techniques, which are applied after cleaning data procedure. In addition, an approach to address data quality problems in large-scale industrial datasets is proposed. This techniques and methods comprise several well-known techniques, which come from both worlds of mathematical logic and also statistics, improving data quality procedure and cleaning results.
@inproceedings{ Hubauer2013,
abstract = {In large industries usage of advanced technological methods and modern equipment comes with the problem of storing, interpreting and analyzing huge amount of information. Handling information becomes more complicated and important at the same time. So, data quality is one of major challenges considering a rapid growth of information, fragmentation of information systems, incorrect data formatting and other issues. The aim of this paper is to describe industrial data processing and analytics on the real-world use case. The most crucial data quality issues are described, examined and classified in terms of Data Quality Dimensions. Factual industrial information supports and illustrates each encountered data deficiency. In addition, we describe methods for elimination data quality issues and data analysis techniques, which are applied after cleaning data procedure. In addition, an approach to address data quality problems in large-scale industrial datasets is proposed. This techniques and methods comprise several well-known techniques, which come from both worlds of mathematical logic and also statistics, improving data quality procedure and cleaning results.},
added-at = {2013-09-17T09:27:19.000+0200},
audience = {industrial},
author = {Hubauer, Thomas M. and Lamparter, Steffen and Roshchin, Mikhail and Solomakhina, Nina and Watson, Stuart},
biburl = {http://www.bibsonomy.org/bibtex/2fbde32101039ca739d0e7fbd09b3f5d3/thubauer},
booktitle = {Proceedings of the 2013 Annual Conference of the Prognostics and Health Management Society},
interhash = {de6fdc2760c5843f4f93ddc0f4fed4ca},
intrahash = {fbde32101039ca739d0e7fbd09b3f5d3},
keywords = {analysis data myown optique-project quality},
partneroptique = {SIE},
title = {Analysis of data quality issues in real-world industrial data},
wpoptique = {WP8},
year = {2013},
yearoptique = {Y1}
}
Downloads: 0
{"_id":{"_str":"538836340e577e1d6b002d60"},"__v":2,"authorIDs":[],"author_short":["Hubauer, T.<nbsp>M.","Lamparter, S.","Roshchin, M.","Solomakhina, N.","Watson, S."],"bibbaseid":"hubauer-lamparter-roshchin-solomakhina-watson-analysisofdataqualityissuesinrealworldindustrialdata-2013","bibdata":{"downloads":0,"keyword":["analysis data myown optique-project quality"],"bibbaseid":"hubauer-lamparter-roshchin-solomakhina-watson-analysisofdataqualityissuesinrealworldindustrialdata-2013","urls":{},"role":"author","yearoptique":"Y1","year":"2013","wpoptique":"WP8","type":"inproceedings","title":"Analysis of data quality issues in real-world industrial data","partneroptique":"SIE","keywords":"analysis data myown optique-project quality","key":"Hubauer2013","intrahash":"fbde32101039ca739d0e7fbd09b3f5d3","interhash":"de6fdc2760c5843f4f93ddc0f4fed4ca","id":"Hubauer2013","booktitle":"Proceedings of the 2013 Annual Conference of the Prognostics and Health Management Society","biburl":"http://www.bibsonomy.org/bibtex/2fbde32101039ca739d0e7fbd09b3f5d3/thubauer","bibtype":"inproceedings","bibtex":"@inproceedings{ Hubauer2013,\n abstract = {In large industries usage of advanced technological methods and modern equipment comes with the problem of storing, interpreting and analyzing huge amount of information. Handling information becomes more complicated and important at the same time. So, data quality is one of major challenges considering a rapid growth of information, fragmentation of information systems, incorrect data formatting and other issues. The aim of this paper is to describe industrial data processing and analytics on the real-world use case. The most crucial data quality issues are described, examined and classified in terms of Data Quality Dimensions. Factual industrial information supports and illustrates each encountered data deficiency. In addition, we describe methods for elimination data quality issues and data analysis techniques, which are applied after cleaning data procedure. In addition, an approach to address data quality problems in large-scale industrial datasets is proposed. This techniques and methods comprise several well-known techniques, which come from both worlds of mathematical logic and also statistics, improving data quality procedure and cleaning results.},\n added-at = {2013-09-17T09:27:19.000+0200},\n audience = {industrial},\n author = {Hubauer, Thomas M. and Lamparter, Steffen and Roshchin, Mikhail and Solomakhina, Nina and Watson, Stuart},\n biburl = {http://www.bibsonomy.org/bibtex/2fbde32101039ca739d0e7fbd09b3f5d3/thubauer},\n booktitle = {Proceedings of the 2013 Annual Conference of the Prognostics and Health Management Society},\n interhash = {de6fdc2760c5843f4f93ddc0f4fed4ca},\n intrahash = {fbde32101039ca739d0e7fbd09b3f5d3},\n keywords = {analysis data myown optique-project quality},\n partneroptique = {SIE},\n title = {Analysis of data quality issues in real-world industrial data},\n wpoptique = {WP8},\n year = {2013},\n yearoptique = {Y1}\n}","author_short":["Hubauer, T.<nbsp>M.","Lamparter, S.","Roshchin, M.","Solomakhina, N.","Watson, S."],"author":["Hubauer, Thomas M.","Lamparter, Steffen","Roshchin, Mikhail","Solomakhina, Nina","Watson, Stuart"],"audience":"industrial","added-at":"2013-09-17T09:27:19.000+0200","abstract":"In large industries usage of advanced technological methods and modern equipment comes with the problem of storing, interpreting and analyzing huge amount of information. Handling information becomes more complicated and important at the same time. So, data quality is one of major challenges considering a rapid growth of information, fragmentation of information systems, incorrect data formatting and other issues. The aim of this paper is to describe industrial data processing and analytics on the real-world use case. The most crucial data quality issues are described, examined and classified in terms of Data Quality Dimensions. Factual industrial information supports and illustrates each encountered data deficiency. In addition, we describe methods for elimination data quality issues and data analysis techniques, which are applied after cleaning data procedure. In addition, an approach to address data quality problems in large-scale industrial datasets is proposed. This techniques and methods comprise several well-known techniques, which come from both worlds of mathematical logic and also statistics, improving data quality procedure and cleaning results."},"bibtype":"inproceedings","biburl":"http://www.bibsonomy.org/bib/search/optique-project?bibtex.entriesPerPage=10000","downloads":0,"keywords":["analysis data myown optique-project quality"],"search_terms":["analysis","data","quality","issues","real","world","industrial","data","hubauer","lamparter","roshchin","solomakhina","watson"],"title":"Analysis of data quality issues in real-world industrial data","year":2013,"dataSources":["tYYCZGwzkJatkJPTa"]}