SwePub
Sök i LIBRIS databas

  Extended search

onr:"swepub:oai:DiVA.org:hig-27845"
 

Search: onr:"swepub:oai:DiVA.org:hig-27845" > Continuous residual...

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist
  • Aslani, MohammadHögskolan i Gävle,Datavetenskap (author)

Continuous residual reinforcement learning for traffic signal control optimization

  • Article/chapterEnglish2018

Publisher, publication year, extent ...

  • NRC Research Press,2018
  • printrdacarrier

Numbers

  • LIBRIS-ID:oai:DiVA.org:hig-27845
  • https://urn.kb.se/resolve?urn=urn:nbn:se:hig:diva-27845URI
  • https://doi.org/10.1139/cjce-2017-0408DOI
  • https://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-364927URI

Supplementary language notes

  • Language:English
  • Summary in:English

Part of subdatabase

Classification

  • Subject category:ref swepub-contenttype
  • Subject category:art swepub-publicationtype

Notes

  • Traffic signal control can be naturally regarded as a reinforcement learning problem. Unfortunately, it is one of the most difficult classes of reinforcement learning problems owing to its large state space. A straightforward approach to address this challenge is to control traffic signals based on continuous reinforcement learning. Although they have been successful in traffic signal control, they may become unstable and fail to converge to near-optimal solutions. We develop adaptive traffic signal controllers based on continuous residual reinforcement learning (CRL-TSC) that is more stable. The effect of three feature functions is empirically investigated in a microscopic traffic simulation. Furthermore, the effects of departing streets, more actions, and the use of the spatial distribution of the vehicles on the performance of CRL-TSCs are assessed. The results show that the best setup of the CRL-TSC leads to saving average travel time by 15% in comparison to an optimized fixed-time controller.

Subject headings and genre

Added entries (persons, corporate bodies, meetings, titles ...)

  • Seipel, StefanUppsala universitet,Högskolan i Gävle,Datavetenskap,Division of Visual Information and Interaction, Department of Information Technology, Uppsala University, Uppsala, Sweden,Avdelningen för visuell information och interaktion,Bildanalys och människa-datorinteraktion(Swepub:uu)stefseip (author)
  • Wiering, MarcoInstitute of Artificial Intelligence and Cognitive Engineering, University of Groningen, Groningen, the Netherlands (author)
  • Högskolan i GävleDatavetenskap (creator_code:org_t)

Related titles

  • In:Canadian journal of civil engineering (Print): NRC Research Press45:8, s. 690-7020315-14681208-6029

Internet link

Find in a library

To the university's database

  • 1 of 1
  • Previous record
  • Next record
  •    To hitlist

Find more in SwePub

By the author/editor
Aslani, Mohammad
Seipel, Stefan
Wiering, Marco
About the subject
ENGINEERING AND TECHNOLOGY
ENGINEERING AND ...
and Other Engineerin ...
ENGINEERING AND TECHNOLOGY
ENGINEERING AND ...
and Electrical Engin ...
and Control Engineer ...
Articles in the publication
Canadian journal ...
By the university
University of Gävle
Uppsala University

Search outside SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Close

Copy and save the link in order to return to this view