Data mining and clustering in chemical process databases for monitoring and knowledge discovery

Data mining and clustering in chemical process databases for monitoring and knowledge discovery. Thomas, M. C., Zhu, W., & Romagnoli, J. A. Journal of Process Control, 67:160–175, July, 2018.

Paper doi abstract bibtex

Modern chemical plants maintain large historical databases recording past sensor measurements which advanced process monitoring techniques analyze to help plant operators and engineers interpret the meaning of live trends in databases. However, many of the best process monitoring methods require data organized into groups before training is possible. In practice, such organization rarely exists and the time required to create classified training data is an obstacle to the use of advanced process monitoring strategies. Data mining and knowledge discovery techniques drawn from computer science literature can help engineers find fault states in historical databases and group them together with little detailed knowledge of the process. This study evaluates how several data clustering and feature extraction techniques work together to reveal useful trends in industrial chemical process data. Two studies on an industrial scale separation tower and the Tennessee Eastman process simulation demonstrate data clustering and feature extraction effectively revealing significant process trends from high dimensional, multivariate data. Process knowledge and supervised clustering metrics compare the cluster results against true labels in the data to compare performance of different combinations of dimensionality reduction and data clustering approaches.

@article{thomas_data_2018,
	series = {Big {Data}: {Data} {Science} for {Process} {Control} and {Operations}},
	title = {Data mining and clustering in chemical process databases for monitoring and knowledge discovery},
	volume = {67},
	issn = {0959-1524},
	url = {https://www.sciencedirect.com/science/article/pii/S095915241730032X},
	doi = {10.1016/j.jprocont.2017.02.006},
	abstract = {Modern chemical plants maintain large historical databases recording past sensor measurements which advanced process monitoring techniques analyze to help plant operators and engineers interpret the meaning of live trends in databases. However, many of the best process monitoring methods require data organized into groups before training is possible. In practice, such organization rarely exists and the time required to create classified training data is an obstacle to the use of advanced process monitoring strategies. Data mining and knowledge discovery techniques drawn from computer science literature can help engineers find fault states in historical databases and group them together with little detailed knowledge of the process. This study evaluates how several data clustering and feature extraction techniques work together to reveal useful trends in industrial chemical process data. Two studies on an industrial scale separation tower and the Tennessee Eastman process simulation demonstrate data clustering and feature extraction effectively revealing significant process trends from high dimensional, multivariate data. Process knowledge and supervised clustering metrics compare the cluster results against true labels in the data to compare performance of different combinations of dimensionality reduction and data clustering approaches.},
	language = {en},
	urldate = {2022-05-02},
	journal = {Journal of Process Control},
	author = {Thomas, Michael C. and Zhu, Wenbo and Romagnoli, Jose A.},
	month = jul,
	year = {2018},
	keywords = {Data clustering, Data mining, Dimensionality reduction, Knowledge discovery},
	pages = {160--175},
}

Downloads: 0

{"_id":"EEK2TvwN2ya2d7wMa","bibbaseid":"thomas-zhu-romagnoli-dataminingandclusteringinchemicalprocessdatabasesformonitoringandknowledgediscovery-2018","author_short":["Thomas, M. C.","Zhu, W.","Romagnoli, J. A."],"bibdata":{"bibtype":"article","type":"article","series":"Big Data: Data Science for Process Control and Operations","title":"Data mining and clustering in chemical process databases for monitoring and knowledge discovery","volume":"67","issn":"0959-1524","url":"https://www.sciencedirect.com/science/article/pii/S095915241730032X","doi":"10.1016/j.jprocont.2017.02.006","abstract":"Modern chemical plants maintain large historical databases recording past sensor measurements which advanced process monitoring techniques analyze to help plant operators and engineers interpret the meaning of live trends in databases. However, many of the best process monitoring methods require data organized into groups before training is possible. In practice, such organization rarely exists and the time required to create classified training data is an obstacle to the use of advanced process monitoring strategies. Data mining and knowledge discovery techniques drawn from computer science literature can help engineers find fault states in historical databases and group them together with little detailed knowledge of the process. This study evaluates how several data clustering and feature extraction techniques work together to reveal useful trends in industrial chemical process data. Two studies on an industrial scale separation tower and the Tennessee Eastman process simulation demonstrate data clustering and feature extraction effectively revealing significant process trends from high dimensional, multivariate data. Process knowledge and supervised clustering metrics compare the cluster results against true labels in the data to compare performance of different combinations of dimensionality reduction and data clustering approaches.","language":"en","urldate":"2022-05-02","journal":"Journal of Process Control","author":[{"propositions":[],"lastnames":["Thomas"],"firstnames":["Michael","C."],"suffixes":[]},{"propositions":[],"lastnames":["Zhu"],"firstnames":["Wenbo"],"suffixes":[]},{"propositions":[],"lastnames":["Romagnoli"],"firstnames":["Jose","A."],"suffixes":[]}],"month":"July","year":"2018","keywords":"Data clustering, Data mining, Dimensionality reduction, Knowledge discovery","pages":"160–175","bibtex":"@article{thomas_data_2018,\n\tseries = {Big {Data}: {Data} {Science} for {Process} {Control} and {Operations}},\n\ttitle = {Data mining and clustering in chemical process databases for monitoring and knowledge discovery},\n\tvolume = {67},\n\tissn = {0959-1524},\n\turl = {https://www.sciencedirect.com/science/article/pii/S095915241730032X},\n\tdoi = {10.1016/j.jprocont.2017.02.006},\n\tabstract = {Modern chemical plants maintain large historical databases recording past sensor measurements which advanced process monitoring techniques analyze to help plant operators and engineers interpret the meaning of live trends in databases. However, many of the best process monitoring methods require data organized into groups before training is possible. In practice, such organization rarely exists and the time required to create classified training data is an obstacle to the use of advanced process monitoring strategies. Data mining and knowledge discovery techniques drawn from computer science literature can help engineers find fault states in historical databases and group them together with little detailed knowledge of the process. This study evaluates how several data clustering and feature extraction techniques work together to reveal useful trends in industrial chemical process data. Two studies on an industrial scale separation tower and the Tennessee Eastman process simulation demonstrate data clustering and feature extraction effectively revealing significant process trends from high dimensional, multivariate data. Process knowledge and supervised clustering metrics compare the cluster results against true labels in the data to compare performance of different combinations of dimensionality reduction and data clustering approaches.},\n\tlanguage = {en},\n\turldate = {2022-05-02},\n\tjournal = {Journal of Process Control},\n\tauthor = {Thomas, Michael C. and Zhu, Wenbo and Romagnoli, Jose A.},\n\tmonth = jul,\n\tyear = {2018},\n\tkeywords = {Data clustering, Data mining, Dimensionality reduction, Knowledge discovery},\n\tpages = {160--175},\n}\n\n\n\n","author_short":["Thomas, M. C.","Zhu, W.","Romagnoli, J. A."],"key":"thomas_data_2018","id":"thomas_data_2018","bibbaseid":"thomas-zhu-romagnoli-dataminingandclusteringinchemicalprocessdatabasesformonitoringandknowledgediscovery-2018","role":"author","urls":{"Paper":"https://www.sciencedirect.com/science/article/pii/S095915241730032X"},"keyword":["Data clustering","Data mining","Dimensionality reduction","Knowledge discovery"],"metadata":{"authorlinks":{}},"html":""},"bibtype":"article","biburl":"https://bibbase.org/zotero/mh_lenguyen","dataSources":["iwKepCrWBps7ojhDx"],"keywords":["data clustering","data mining","dimensionality reduction","knowledge discovery"],"search_terms":["data","mining","clustering","chemical","process","databases","monitoring","knowledge","discovery","thomas","zhu","romagnoli"],"title":"Data mining and clustering in chemical process databases for monitoring and knowledge discovery","year":2018}