Continuous Health Monitoring of Machinery using Online Clustering on Unlabeled Data Streams. Le-Nguyen, M., Turgis, F., Fayemi, P., & Bifet, A. In 2022 IEEE International Conference on Big Data (Big Data), pages 1866–1873, December, 2022.
doi  abstract   bibtex   
Maintenance is an important support function to ensure the reliability, safety, and availability in the railway. Lately, machine learning has become a major player and allows practitioners to build intricate learning models for machinery maintenance. Commonly, a model is trained on static data and is retrained on new data that exhibit novelties unknown to the model. On the contrary, online machine learning is a learning paradigm that adapts the models to new data, thus enabling adaptive, lifelong learning. Our goal is to leverage online learning on unlabeled data streams to enhance railway machinery maintenance. We propose Continuous Health Monitoring using Online Clustering (CheMoc) as an unsupervised method that learns the health profiles of the systems incrementally, assesses their working condition continuously via an adaptive health score, and works efficiently on streaming data. We evaluate CheMoc on a real-world data set from a national railway company. The results show that CheMoc discovered relevant health clusters, as confirmed by a domain expert, and processed the data of an entire year under two hours using only 600 MB of memory.
@inproceedings{le-nguyen_continuous_2022,
	title = {Continuous {Health} {Monitoring} of {Machinery} using {Online} {Clustering} on {Unlabeled} {Data} {Streams}},
	doi = {10.1109/BigData55660.2022.10021002},
	abstract = {Maintenance is an important support function to ensure the reliability, safety, and availability in the railway. Lately, machine learning has become a major player and allows practitioners to build intricate learning models for machinery maintenance. Commonly, a model is trained on static data and is retrained on new data that exhibit novelties unknown to the model. On the contrary, online machine learning is a learning paradigm that adapts the models to new data, thus enabling adaptive, lifelong learning. Our goal is to leverage online learning on unlabeled data streams to enhance railway machinery maintenance. We propose Continuous Health Monitoring using Online Clustering (CheMoc) as an unsupervised method that learns the health profiles of the systems incrementally, assesses their working condition continuously via an adaptive health score, and works efficiently on streaming data. We evaluate CheMoc on a real-world data set from a national railway company. The results show that CheMoc discovered relevant health clusters, as confirmed by a domain expert, and processed the data of an entire year under two hours using only 600 MB of memory.},
	booktitle = {2022 {IEEE} {International} {Conference} on {Big} {Data} ({Big} {Data})},
	author = {Le-Nguyen, Minh-Huong and Turgis, Fabien and Fayemi, Pierre-Emmanuel and Bifet, Albert},
	month = dec,
	year = {2022},
	keywords = {Adaptation models, Companies, Employee welfare, Machine learning, Maintenance engineering, Memory management, Rail transportation, maintenance, online clustering, railway},
	pages = {1866--1873},
}

Downloads: 0