RISS 학술연구정보서비스

검색
다국어 입력

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)
  • 中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
  • 北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.
닫기
    인기검색어 순위 펼치기

    RISS 인기검색어

      KCI등재 SCIE SCOPUS

      Explicit Dynamic Coordination Reinforcement Learning Based on Utility

      한글로보기

      https://www.riss.kr/link?id=A108078308

      • 0

        상세조회
      • 0

        다운로드
      서지정보 열기
      • 내보내기
      • 내책장담기
      • 공유하기
      • 오류접수

      부가정보

      다국어 초록 (Multilingual Abstract)

      Multi-agent systems often need to achieve the goal of learning more effectively for a task through coordination. Although the introduction of deep learning has addressed the state space problems, multi-agent learning remains infeasible because of the ...

      Multi-agent systems often need to achieve the goal of learning more effectively for a task through coordination. Although the introduction of deep learning has addressed the state space problems, multi-agent learning remains infeasible because of the joint action spaces. Large-scale joint action spaces can be sparse according to implicit or explicit coordination structure, which can ensure reasonable coordination action through the coordination structure. In general, the multi-agent system is dynamic, which makes the relations among agents and the coordination structure are dynamic. Therefore, the explicit coordination structure can better represent the coordinative relationship among agents and achieve better coordination between agents. Inspired by the maximization of social group utility, we dynamically construct a factor graph as an explicit coordination structure to express the coordinative relationship according to the utility among agents and estimate the joint action values based on the local utility transfer among factor graphs. We present the application of such techniques in the scenario of multiple intelligent vehicle systems, where state space and action space are a problem and have too many interactions among agents. The results on the multiple intelligent vehicle systems demonstrate the efficiency and effectiveness of our proposed methods.

      더보기

      참고문헌 (Reference)

      1 Sunehag P, "Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward" 2085-2087, 2018

      2 Sharma P K, "Survey of recent multi-agent reinforcement learning algorithms utilizing centralized training" III : 2021

      3 Foerster J, "Stabilising experience replay for deep multi-agent reinforcement learning" 1146-1155, 2017

      4 Claudine Badue, "Self-driving cars : A survey" 165 : 113816-, 2021

      5 X. Li, "Reinforcement learning based overtaking decision making for highway autonomous driving" IEEE 336-342, 2015

      6 Rashid, T., "QMIX : Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning" 4295-4304, 2018

      7 D. Koller, "Probabilistic Graphical Models: Principles and Techniques" MIT Press 2009

      8 Lin M, "Policy Gradient Adaptive Critic Designs for Model-Free Optimal Tracking Control With Experience Replay" 1-12, 2021

      9 Mnih, V., "Playing atari with deep reinforcement learning"

      10 Q. Wei, "Optimal elevator group control via deep asynchronous actor-critic learning" 31 (31): 5245-5256, 2020

      1 Sunehag P, "Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward" 2085-2087, 2018

      2 Sharma P K, "Survey of recent multi-agent reinforcement learning algorithms utilizing centralized training" III : 2021

      3 Foerster J, "Stabilising experience replay for deep multi-agent reinforcement learning" 1146-1155, 2017

      4 Claudine Badue, "Self-driving cars : A survey" 165 : 113816-, 2021

      5 X. Li, "Reinforcement learning based overtaking decision making for highway autonomous driving" IEEE 336-342, 2015

      6 Rashid, T., "QMIX : Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning" 4295-4304, 2018

      7 D. Koller, "Probabilistic Graphical Models: Principles and Techniques" MIT Press 2009

      8 Lin M, "Policy Gradient Adaptive Critic Designs for Model-Free Optimal Tracking Control With Experience Replay" 1-12, 2021

      9 Mnih, V., "Playing atari with deep reinforcement learning"

      10 Q. Wei, "Optimal elevator group control via deep asynchronous actor-critic learning" 31 (31): 5245-5256, 2020

      11 R. Lowe, "Multiagent actor-critic for mixed cooperative-competitive environments" 6382-6393, 2017

      12 Parunak, "Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence" MIT Press 377-421, 2000

      13 Stone P, "Multiagent Systems : A Survey from a Machine Learning Perspective" 8 : 345-383, 2000

      14 Kuyer, L, "Multiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs" Springer 2008

      15 M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents" 330-337, 1993

      16 Y. Yang, "Mean field multiagent reinforcement learning" 5571-5580, 2018

      17 Littman, M. L, "Markov games as a framework for multi-agent reinforcement learning" Morgan Kauffman Publishers 157-163, 1994

      18 M. L. Puterman, "Markov decision processes: discrete stochastic dynamic programming" John Wiley & Sons 2014

      19 Jiang, "Learning attentional communication for multi-agent cooperation" 7265-7275, 2018

      20 Z. Zhang, "Integrating independent and centralized multi-agent reinforcement learning for traffic signal network optimization" 2083-2085, 2020

      21 Zhang Y, "Human-like Autonomous Vehicle Speed Control by Deep Reinforcement Learning with Double Q-Learning" IV : 1251-1256, 2018

      22 Volodymyr, M., "Human-level control through deep reinforcement learning" 518 (518): 529-533, 2015

      23 D. Huang, "Ensemble clustering using factor graph" 50 : 131-142, 2016

      24 Zawadzki, E., "Empirically evaluating multiagent learning algorithms" 2014

      25 P. Kravets, "Dynamic coordination of strategies for multi-agent systems" Springer 653-670, 2020

      26 C. Yu, "Distributed multiagent coordinated learning for autonomous driving in highways based on dynamic coordination graphs" 21 (21): 735-748, 2020

      27 K. Shah, "Distributed independent reinforcement learning (dirl) approach to resource management in wireless sensor networks" 2007

      28 T. Hester, "Deep q-learning from demonstrations" 32 : 2018

      29 W. B¨ohmer, "Deep coordination graphs" 980-991, 2020

      30 Farinelli A, "Decentralised coordination of low-power embedded devices using the max-sum algorithm" International Foundation for Autonomous Agents and Multiagent Systems 2008

      31 Foerster J, "Counterfactual Multi-Agent Policy Gradients"

      32 C. Guestrin, "Coordinated reinforcement learning" 2 : 227-234, 2002

      33 Gupta, J. K., "Cooperative Multi-agent Control Using Deep Reinforcement Learning" Springer 2017

      34 Kok J R, "Collaborative multiagent reinforcement learning by payoffpropagation" 7 : 1789-1828, 2006

      35 R. Dechter, "Bucket elimination : A unifying framework for reasoning" 113 (113): 41-85, 1999

      36 N. A. Khalid, "An adaptive agent-based partner selection for routing packet in distributed wireless sensor network" 2016

      37 W. Du, "A survey on multi-agent deep reinforcement learning : from the perspective of challenges and applications" 54 : 3215-3238, 2021

      38 Grigorescu S, "A survey of deep learning techniques for autonomous driving" 37 (37): 362-386, 2020

      39 H. Liu, "A new hybrid ensemble deep reinforcement learning model for wind speed short term forecasting" 202 : 117794-, 2020

      40 D. Ye, "A multi-agent framework for packet routing in wireless sensor networks" 15 (15): 10026-10047, 2015

      41 Smirnov N, "A game theory-based approach for modeling autonomous vehicle behavior in congested, urban lane-changing scenarios" 21 (21): 1523-, 2021

      42 Meiyu Liu, "A cellular automata traffic flow model combined with a bp neural network based microscopic lane changing decision model" 23 (23): 309-318, 2019

      43 Zeyu Zhu, "A Survey of Deep RL and IL for Autonomous Driving Policy Learning"

      44 Zhu, Z., "A Survey of Deep RL and IL for Autonomous Driving Policy Learning" 2021

      더보기

      분석정보

      View

      상세정보조회

      0

      Usage

      원문다운로드

      0

      대출신청

      0

      복사신청

      0

      EDDS신청

      0

      동일 주제 내 활용도 TOP

      더보기

      주제

      연도별 연구동향

      연도별 활용동향

      연관논문

      연구자 네트워크맵

      공동연구자 (7)

      유사연구자 (20) 활용도상위20명

      인용정보 인용지수 설명보기

      학술지 이력

      학술지 이력
      연월일 이력구분 이력상세 등재구분
      학술지등록 한글명 : KSII Transactions on Internet and Information Systems
      외국어명 : KSII Transactions on Internet and Information Systems
      2023 평가예정 해외DB학술지평가 신청대상 (해외등재 학술지 평가)
      2020-01-01 평가 등재학술지 유지 (해외등재 학술지 평가) KCI등재
      2013-10-01 평가 등재학술지 선정 (기타) KCI등재
      2011-01-01 평가 등재후보학술지 유지 (기타) KCI등재후보
      2009-01-01 평가 SCOPUS 등재 (신규평가) KCI등재후보
      더보기

      학술지 인용정보

      학술지 인용정보
      기준연도 WOS-KCI 통합IF(2년) KCIF(2년) KCIF(3년)
      2016 0.45 0.21 0.37
      KCIF(4년) KCIF(5년) 중심성지수(3년) 즉시성지수
      0.32 0.29 0.244 0.03
      더보기

      이 자료와 함께 이용한 RISS 자료

      나만을 위한 추천자료

      해외이동버튼