Maintenance optimisation of multicomponent systems using hierarchical coordinated reinforcement learning. Zhou, Y., Li, B., & Lin, T. R. Reliability Engineering & System Safety, 217:108078, January, 2022.
Maintenance optimisation of multicomponent systems using hierarchical coordinated reinforcement learning [link]Paper  doi  abstract   bibtex   
The Markov decision process (MDP) is a widely used method to optimise the maintenance of multicomponent systems, which can provide a system-level maintenance action at each decision point to address various dependences among components. However, MDP suffers from the “curse of dimensionality” and can only process small-scale systems. This paper develops a hierarchical coordinated reinforcement learning (HCRL) algorithm to optimise the maintenance of large-scale multicomponent systems. Both parameters of agents and the coordination relationship among agents are designed based on system characteristics. Furthermore, the hierarchical structure of agents is established according to the structural importance measures of components. The effectiveness of the proposed HCRL algorithm is validated using two maintenance optimisation problems, one on a natural gas plant system and the other using a 12-component series system under dependant competing risks. Results show that the proposed HCRL outperforms methods in two recently published papers and other benchmark approaches including the new emerging deep reinforcement learning.
@article{zhou_maintenance_2022,
	title = {Maintenance optimisation of multicomponent systems using hierarchical coordinated reinforcement learning},
	volume = {217},
	issn = {0951-8320},
	url = {https://www.sciencedirect.com/science/article/pii/S0951832021005767},
	doi = {10.1016/j.ress.2021.108078},
	abstract = {The Markov decision process (MDP) is a widely used method to optimise the maintenance of multicomponent systems, which can provide a system-level maintenance action at each decision point to address various dependences among components. However, MDP suffers from the “curse of dimensionality” and can only process small-scale systems. This paper develops a hierarchical coordinated reinforcement learning (HCRL) algorithm to optimise the maintenance of large-scale multicomponent systems. Both parameters of agents and the coordination relationship among agents are designed based on system characteristics. Furthermore, the hierarchical structure of agents is established according to the structural importance measures of components. The effectiveness of the proposed HCRL algorithm is validated using two maintenance optimisation problems, one on a natural gas plant system and the other using a 12-component series system under dependant competing risks. Results show that the proposed HCRL outperforms methods in two recently published papers and other benchmark approaches including the new emerging deep reinforcement learning.},
	language = {en},
	urldate = {2021-11-15},
	journal = {Reliability Engineering \& System Safety},
	author = {Zhou, Yifan and Li, Bangcheng and Lin, Tian Ran},
	month = jan,
	year = {2022},
	keywords = {Condition based maintenance, Coordinated reinforcement learning, Deep reinforcement learning, Hierarchical multiagent reinforcement learning},
	pages = {108078},
}

Downloads: 0