SwePub
Sök i LIBRIS databas

  Utökad sökning

id:"swepub:oai:DiVA.org:hig-28332"
 

Sökning: id:"swepub:oai:DiVA.org:hig-28332" > Developing adaptive...

  • Aslani, MohammadHögskolan i Gävle,Datavetenskap,Geospatial Informationsvetenskap (författare)

Developing adaptive traffic signal control by actor-critic and direct exploration methods

  • Artikel/kapitelEngelska2019

Förlag, utgivningsår, omfång ...

  • Thomas Telford,2019
  • printrdacarrier

Nummerbeteckningar

  • LIBRIS-ID:oai:DiVA.org:hig-28332
  • https://urn.kb.se/resolve?urn=urn:nbn:se:hig:diva-28332URI
  • https://doi.org/10.1680/jtran.17.00085DOI
  • https://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-373351URI

Kompletterande språkuppgifter

  • Språk:engelska
  • Sammanfattning på:engelska

Ingår i deldatabas

Klassifikation

  • Ämneskategori:ref swepub-contenttype
  • Ämneskategori:art swepub-publicationtype

Anmärkningar

  • Designing efficient traffic signal controllers has always been an important concern in traffic engineering. This is owing to the complex and uncertain nature of traffic environments. Within such a context, reinforcement learning has been one of the most successful methods owing to its adaptability and its online learning ability. Reinforcement learning provides traffic signals with the ability automatically to determine the ideal behaviour for achieving their objective (alleviating traffic congestion). In fact, traffic signals based on reinforcement learning are able to learn and react flexibly to different traffic situations without the need of a predefined model of the environment. In this research, the actor-critic method is used for adaptive traffic signal control (ATSC-AC). Actor-critic has the advantages of both actor-only and critic-only methods. One of the most important issues in reinforcement learning is the trade-off between exploration of the traffic environment and exploitation of the knowledge already obtained. In order to tackle this challenge, two direct exploration methods are adapted to traffic signal control and compared with two indirect exploration methods. The results reveal that ATSC-ACs based on direct exploration methods have the best performance and they consistently outperform a fixed-time controller, reducing average travel time by 21%.

Ämnesord och genrebeteckningar

Biuppslag (personer, institutioner, konferenser, titlar ...)

  • Mesgari, Mohammad SaadiDivision of Visual Information and Interaction, Department of Information Technology, Uppsala University, Uppsala, Sweden (författare)
  • Seipel, StefanUppsala universitet,Högskolan i Gävle,Datavetenskap,Faculty of Geodesy and Geomatics Engineering, K. N. Toosi University of Technology, Tehran, Iran,Avdelningen för visuell information och interaktion,Bildanalys och människa-datorinteraktion(Swepub:uu)stefseip (författare)
  • Wiering, MarcoInstitute of Artificial Intelligence and Cognitive Engineering, University of Groningen, Groningen, Netherlands (författare)
  • Högskolan i GävleDatavetenskap (creator_code:org_t)

Sammanhörande titlar

  • Ingår i:Proceedings of the Institution of Civil Engineers: Thomas Telford172:5, s. 289-2980965-092X1751-7710

Internetlänk

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy