SwePub
Sök i LIBRIS databas

  Utökad sökning

WFRF:(Zhang Zhiqiang)
 

Sökning: WFRF:(Zhang Zhiqiang) > (2020-2024) > Multi-Agent Reinfor...

Multi-Agent Reinforcement Learning in Dynamic Industrial Context

Zhang, Hongyi (författare)
Chalmers University of Technology,Gothenburg,Sweden
Li, Jingya (författare)
Ericsson,Ericsson Research
Qi, Zhiqiang (författare)
Ericsson,Ericsson Research
visa fler...
Aronsson, Anders (författare)
Ericsson,Ericsson Research
Bosch, Jan (författare)
Chalmers University of Technology,Gothenburg,Sweden
Olsson, Helena Holmström (författare)
Malmö universitet,Institutionen för datavetenskap och medieteknik (DVMT)
visa färre...
 (creator_code:org_t)
Institute of Electrical and Electronics Engineers (IEEE), 2023
2023
Engelska.
Ingår i: 2023 IEEE 47th Annual Computers, Software, and Applications Conference (COMPSAC). - : Institute of Electrical and Electronics Engineers (IEEE). - 9798350326970 - 9798350326987
  • Konferensbidrag (refereegranskat)
Abstract Ämnesord
Stäng  
  • Deep reinforcement learning has advanced signifi-cantly in recent years, and it is now used in embedded systems in addition to simulators and games. Reinforcement Learning (RL) algorithms are currently being used to enhance device operation so that they can learn on their own and offer clients better services. It has recently been studied in a variety of industrial applications. However, reinforcement learning, especially when controlling a large number of agents in an industrial environment, has been demonstrated to be unstable and unable to adapt to realistic situations when used in a real-world setting. To address this problem, the goal of this study is to enable multiple reinforcement learning agents to independently learn control policies on their own in dynamic industrial contexts. In order to solve the problem, we propose a dynamic multi-agent reinforcement learning (dynamic multi-RL) method along with adaptive exploration (AE) and vector-based action selection (VAS) techniques for accelerating model convergence and adapting to a complex industrial environment. The proposed algorithm is tested for validation in emergency situations within the telecommunications industry. In such circumstances, three unmanned aerial vehicles (UAV-BSs) are used to provide temporary coverage to mission-critical (MC) customers in disaster zones when the original serving base station (BS) is destroyed by natural disasters. The algorithm directs the participating agents automatically to enhance service quality. Our findings demonstrate that the proposed dynamic multi-RL algorithm can proficiently manage the learning of multiple agents and adjust to dynamic industrial environments. Additionally, it enhances learning speed and improves the quality of service.

Ämnesord

NATURVETENSKAP  -- Data- och informationsvetenskap -- Datavetenskap (hsv//swe)
NATURAL SCIENCES  -- Computer and Information Sciences -- Computer Sciences (hsv//eng)

Publikations- och innehållstyp

ref (ämneskategori)
kon (ämneskategori)

Hitta via bibliotek

Till lärosätets databas

Sök utanför SwePub

Kungliga biblioteket hanterar dina personuppgifter i enlighet med EU:s dataskyddsförordning (2018), GDPR. Läs mer om hur det funkar här.
Så här hanterar KB dina uppgifter vid användning av denna tjänst.

 
pil uppåt Stäng

Kopiera och spara länken för att återkomma till aktuell vy