Deep Reinforcement Learning of Autonomous Control Actions to Improve Bus-Service Regularity. Anonymous 2023.
abstract   bibtex   
Bus Bunching is caused by irregularities in demand across the bus route, together with other factors such as traffic. The effect of this problem is that buses operating on the same route start to catch up with each other, severely impacting the regularity and the quality of the service. Control actions such as Bus Holding and Stop Skipping can be introduced to regulate the service and adjust the headway, the temporal gap between the arrival times of two buses on the same bus stop. Traditionally, this phenomenon is mitigated either reactively online through simple rule-based control, or preemptively through analytical scheduling solutions such as mathematical opti- mization. Over time, both approaches degrade to an irregular service. In this work, we investigate the use of Deep Reinforcement Learn- ing algorithms to train a policy that determines which actions should take place at specific control points, in order to regularise the bus service. While prior studies are typically restricted to one control ac- tion, we consider both Bus Holding and Stop Skipping. We replicate benchmarks found in the latest literature, and also introduce traffic to increase the realism of the simulation. Furthermore we also con- sider scenarios where the service is already unstable and buses are already bunched together, a first of this kind of study. We compare the performance of the RL-based policies with a no-control policy and a rule-based policy, and the learnt policies not only keep a sig- nificantly lower headway variance and mean waiting time, but also recover from unstable scenarios and restore service regularity.
@misc{anonymous_deep_2023,
	title = {Deep {Reinforcement} {Learning} of {Autonomous} {Control} {Actions} to {Improve} {Bus}-{Service} {Regularity}},
	abstract = {Bus Bunching is caused by irregularities in demand 
across the bus route, together with other factors such as traffic. The 
effect of this problem is that buses operating on the same route start 
to catch up with each other, severely impacting the regularity and 
the quality of the service. Control actions such as Bus Holding and 
Stop Skipping can be introduced to regulate the service and adjust the 
headway, the temporal gap between the arrival times of two buses on 
the same bus stop. Traditionally, this phenomenon is mitigated either 
reactively online through simple rule-based control, or preemptively 
through analytical scheduling solutions such as mathematical opti- 
mization. Over time, both approaches degrade to an irregular service.
In this work, we investigate the use of Deep Reinforcement Learn- 
ing algorithms to train a policy that determines which actions should 
take place at specific control points, in order to regularise the bus 
service. While prior studies are typically restricted to one control ac- 
tion, we consider both Bus Holding and Stop Skipping. We replicate 
benchmarks found in the latest literature, and also introduce traffic 
to increase the realism of the simulation. Furthermore we also con- 
sider scenarios where the service is already unstable and buses are 
already bunched together, a first of this kind of study. We compare 
the performance of the RL-based policies with a no-control policy 
and a rule-based policy, and the learnt policies not only keep a sig- 
nificantly lower headway variance and mean waiting time, but also 
recover from unstable scenarios and restore service regularity.},
	author = {Anonymous},
	year = {2023},
}

Downloads: 0