Loading...
Please wait, while we are loading the content...
Similar Documents
Emergent collective behaviors in a multi-agent reinforcement learning based pedestrian simulation
| Content Provider | Semantic Scholar |
|---|---|
| Author | Martinez-Gil, Francisco Fernández, Fernando Lozano, Miguel |
| Copyright Year | 2012 |
| Abstract | In this work, a Multi-agent Reinforcement Learning framework is used to get plausible simulations of pedestrians groups. In our framework, each virtual agent learns individually and independently to control its velocity inside a virtual environment. The case of study consists on the simulation of the crossing of two groups of embodied virtual agents inside a narrow corridor. This scenario permits us to test if a collective behavior, specifically the lanes formation is produced in our study as occurredin corridors with realpedestrians. The paper studies the influence of differentlearning algorithms, function approximation approaches, and knowledge transfer mechanisms in the performance of the learned pedestrian behaviors. Specifically, two different RL-based schemas are analyzed. The first one, Iterative Vector Quantization with Q-Learning (ITVQQL) improves iteratively a state-space generalizer based on vector quantization. The second scheme, named TS, uses Tile coding as the generalization method with the Sarsa(λ) algorithm. Knowledge transfer approach is based on the use of Probabilistic Policy Reuse to incorporate previously acquired knowledge in current learning processes; additionally, value function transfer is also used in the ITVQQL schema to transfer the value function between consecutive iterations. The results demonstrate empirically that our RL framework generates individual behaviors capable of emerging the expected collective behavior as occurred in real pedestrians. This collective behavior appears independently of the generalization method used, but depends extremely on whether knowledge transfer was applied or not. In addition, the use of transfer techniques has a notable influence in the final performance (measured in number of times that the task was solved) of the learned behaviors. A video of the simulation is available at the URL: http://www.uv.es/agentes/RL/index.htm |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://rldm.org/wp-content/uploads/2013/10/RLDM2013ExtendedAbstracts.pdf |
| Alternate Webpage(s) | https://www.uv.es/agentes/RL/publicaciones/rldm13.pdf |
| Alternate Webpage(s) | http://www.uv.es/agentes/RL/publicaciones/rldm13.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |