Traffic signal optimization through discrete and continuous reinforcement learning with robustness analysis in downtown Tehran

↓ Direkt till sidans innehåll
↓ Direkt till sidans sekundära innehåll (sidomenyn)

Sökning: onr:"swepub:oai:DiVA.org:hig-28333" > Traffic signal opti...

1 av 1
Föregående post
Nästa post
Till träfflistan

Aslani, MohammadHögskolan i Gävle,Datavetenskap,Geospatial Informationsvetenskap (författare)

Traffic signal optimization through discrete and continuous reinforcement learning with robustness analysis in downtown Tehran

Artikel/kapitelEngelska2018

Förlag, utgivningsår, omfång ...

Elsevier BV,2018
printrdacarrier

Nummerbeteckningar

LIBRIS-ID:oai:DiVA.org:hig-28333
https://urn.kb.se/resolve?urn=urn:nbn:se:hig:diva-28333URI
https://doi.org/10.1016/j.aei.2018.08.002DOI
https://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-373216URI

Kompletterande språkuppgifter

Språk:engelska
Sammanfattning på:engelska

Ingår i deldatabas

SwePubSwePub

Klassifikation

Ämneskategori:ref swepub-contenttype
Ämneskategori:art swepub-publicationtype

Anmärkningar

Traffic signal control plays a pivotal role in reducing traffic congestion. Traffic signals cannot be adequately controlled with conventional methods due to the high variations and complexity in traffic environments. In recent years, reinforcement learning (RL) has shown great potential for traffic signal control because of its high adaptability, flexibility, and scalability. However, designing RL-embedded traffic signal controllers (RLTSCs) for traffic systems with a high degree of realism is faced with several challenges, among others system disturbances and large state-action spaces are considered in this research.The contribution of the present work is founded on three features: (a) evaluating the robustness of different RLTSCs against system disturbances including incidents, jaywalking, and sensor noise, (b) handling a high-dimensional state-action space by both employing different continuous state RL algorithms and reducing the state-action space in order to improve the performance and learning speed of the system, and (c) presenting a detailed empirical study of traffic signals control of downtown Tehran through seven RL algorithms: discrete state Q-learning(λ" role="presentation">), SARSA(λ" role="presentation">), actor-critic(λ" role="presentation">), continuous state Q-learning(λ" role="presentation">), SARSA(λ" role="presentation">), actor-critic(λ" role="presentation">), and residual actor-critic(λ" role="presentation">).In this research, first a real-world microscopic traffic simulation of downtown Tehran is carried out, then four experiments are performed in order to find the best RLTSC with convincing robustness and strong performance. The results reveal that the RLTSC based on continuous state actor-critic(λ" role="presentation">) has the best performance. In addition, it is found that the best RLTSC leads to saving average travel time by 22% (at the presence of high system disturbances) when it is compared with an optimized fixed-time controller.

Ämnesord och genrebeteckningar

NATURVETENSKAP Data- och informationsvetenskap hsv//swe
NATURAL SCIENCES Computer and Information Sciences hsv//eng
TEKNIK OCH TEKNOLOGIER Samhällsbyggnadsteknik hsv//swe
ENGINEERING AND TECHNOLOGY Civil Engineering hsv//eng
Reinforcement learning
System disturbances
Traffic signal control
Microscopic traffic simulation
Hållbar stadsutveckling
Sustainable Urban Development

Biuppslag (personer, institutioner, konferenser, titlar ...)

Seipel, StefanUppsala universitet,Högskolan i Gävle,Datavetenskap,Faculty of Geodesy and Geomatics Engineering, K. N. Toosi University of Technology, Tehran, Iran,Geospatial Informationsvetenskap,Avdelningen för visuell information och interaktion,Bildanalys och människa-datorinteraktion(Swepub:uu)stefseip (författare)
Mohammad Saadi, MesgariDivision of Visual Information and Interaction, Department of Information Technology, Uppsala University, Uppsala, Sweden (författare)
Wiering, Marco A.Institute of Artificial Intelligence and Cognitive Engineering, University of Groningen, Groningen, Netherlands (författare)
Högskolan i GävleDatavetenskap (creator_code:org_t)

Sammanhörande titlar

Ingår i:Advanced Engineering Informatics: Elsevier BV38, s. 639-6551474-03461873-5320

Internetlänk

Hitta via bibliotek

Advanced Engineering Informatics (Sök värdpublikationen i LIBRIS)

Till lärosätets databas

1 av 1
Föregående post
Nästa post
Till träfflistan

Hitta mer i SwePub

Av författaren/redakt...: Aslani, Mohammad; Seipel, Stefan; Mohammad Saadi, ...; Wiering, Marco A ...

Om ämnet

NATURVETENSKAP: NATURVETENSKAP; och Data och informa ...

TEKNIK OCH TEKNOLOGIER: TEKNIK OCH TEKNO ...; och Samhällsbyggnads ...

Artiklar i publikationen: Advanced Enginee ...

Av lärosätet: Högskolan i Gävle; Uppsala universitet

Sök utanför SwePub

Sök vidare i:: Google; Google Book Search; Google Scholar

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

LIBRIS.kb.se