RISS 검색 - 국내학술지논문 상세보기

다국어 입력

あぁかがさざただなはばぱまやゃらわゎんいぃきぎしじちぢにひびぴみりうぅくぐすずつづっぬふぶぷむゆゅるえぇけげせぜてでねへべぺめれおぉこごそぞとどのほぼぽもよょろを

アァカサザタダナハバパマヤャラワヮンイィキギシジチヂニヒビピミリウゥクグスズツヅッヌフブプムユュルエェケゲセゼテデヘベペメレオォコゴソゾトドノホボポモヨョロヲ ―

http://chineseinput.net/에서 pinyin(병음)방식으로 중국어를 변환할 수 있습니다.

변환된 중국어를 복사하여 사용하시면 됩니다.

예시)

中文 을 입력하시려면 zhongwen을 입력하시고 space를누르시면됩니다.
北京 을 입력하시려면 beijing을 입력하시고 space를 누르시면 됩니다.

ㅥ ㅦ ㅧ ㅨ ㅩ ㅪ ㅫ ㅬ ㅭ ㅮ ㅯ ㅰ ㅱ ㅲ ㅳ ㅴ ㅵ ㅶ ㅷ ㅸ ㅹ ㅺ ㅻ ㅼ ㅽ ㅾ ㅿ ㆀ ㆁ ㆂ ㆃ ㆄ ㆅ ㆆ ㆇ ㆈ ㆉ ㆊ ㆋ ㆌ ㆍ ㆎ

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω α β γ δ ε ζ η θ ι κ λ μ ν ξ ο π ρ σ τ υ φ χ ψ ω

á à Á À é è É È ç Ç ê

Ä Ö Ü ä ö ü ß

ְ ֳ ֲ ֱ ָ ַ ֵ ֶ ִ ֹ ּ ֻ ׂ ׁ ּ פ ם ן ו ט א ר ק ף ך ל ח י ע כ ג ד ש ץ ת צ מ נ ה ב

‘ ’ “ ” 〔〕〈〉「」『』【】＂（）［］｛｝

± × ÷ ≠ ≤ ≥ ∞ ∴ ♂ ♀ ∠ ⊥ ⌒ ∂ ∇ ≡ ≒ ≪ ≫ √ ∽ ∝ ∵ ∫ ∬ ∈ ∋ ⊆ ⊇ ⊂ ⊃ ∪ ∩ ∧ ∨ ￢ ⇒ ⇔ ∀ ∃ ∮ ∑ ∏ ＋－＜＝＞

、。 · ‥ … ¨ 〃 ― ∥ ＼ ∼ ´ ～ ˇ ˘ ˝ ˚ ˙ ¸ ˛ ¡ ¿ ː ！＇，．／：；？＾＿｀｜

½ ⅓ ⅔ ¼ ¾ ⅛ ⅜ ⅝ ⅞ ¹ ² ³ ⁴ ⁿ ₁ ₂ ₃ ₄

Æ Ð Ħ Ĳ Ł Ø Œ Þ Ŧ Ŋ æ đ ð ħ ı ĳ ĸ ŀ ł ø œ ß þ ŧ ŋ ŉ

А Б В Г Д Е Ё Ж З И Й К Л М Н О П Р С Т У Ф Х Ц Ч Ш Щ Ъ Ы Ь Э Ю Я а б в г д е ё ж з и й к л м н о п р с т у ф х ц ч ш щ ъ ы ь э ю я

′ ″ ℃ Å ￠￡￥ ¤ ℉ ‰ ＄％Ｆ￦㎕㎖㎗ ℓ ㎘㏄㎣㎤㎥㎦㎙㎚㎛㎜㎝㎞㎟㎠㎡㎢㏊㎍㎎㎏㏏㎈㎉㏈㎧㎨㎰㎱㎲㎳㎴㎵㎶㎷㎸㎹㎀㎁㎂㎃㎄㎺㎻㎽㎾㎿㎐㎑㎒㎓㎔ Ω ㏀㏁㎊㎋㎌㏖㏅㎭㎮㎯㏛㎩㎪㎫㎬㏝㏐㏓㏃㏉㏜㏆

§ ※ ☆ ★ ○ ● ◎ ◇ ◆ □ ■ △ ▽ → ← ↑ ↓ ↔ 〓 ◁ ◀ ▷ ▶ ♤ ♠ ♡ ♥ ♧ ♣ ⊙ ◈ ▣ ◐ ◑ ▒ ▤ ▥ ▨ ▧ ▦ ▩ ♨ ☏ ☎ ☜ ☞ ¶ † ‡ ↕ ↗ ↙ ↖ ↘ ♭ ♩ ♪ ♬ ㉿㈜ № ㏇ ™ ㏂㏘ ℡ ＃＆＊＠ ª º

ⅰ ⅱ ⅲ ⅳ ⅴ ⅵ ⅶ ⅷ ⅸ ⅹ Ⅰ Ⅱ Ⅲ Ⅳ Ⅴ Ⅵ Ⅶ Ⅷ Ⅸ Ⅹ

ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع غ ف ق ک ل م ن ه و ی

최근 검색 목록
전체삭제 닫기

RISS 인기검색어

Explicit Dynamic Coordination Reinforcement Learning Based on Utility

한글로보기

https://www.riss.kr/link?id=A108078308

저자

Huaiwei Si (Dalian University of Technology) ; Guozhen Tan (Dalian University of Technology) ; Yifu Yuan (Dalian University of Technology) ; Yanfei peng (Dalian University of Technology) ; Jianping Li (Dalian University of Technology)
발행기관
한국인터넷정보학회
학술지명
KSII Transactions on Internet and Information Systems(TIIS)
권호사항

Vol.16 No.3 [2022]
발행연도
2022
작성언어
English
주제어

Reinforcement Learning ; Multi-agent System ; Explicit Coordination Learning ; Utility Dependence ; Intelligent Vehicle
등재정보
KCI등재,SCIE,SCOPUS
자료형태
학술저널
발행기관 URL
http://www.itiis.org/
수록면

792-812(21쪽)
KCI 피인용횟수
0
DOI식별코드
http://dx.doi.org/10.3837/tiis.2022.03.003
제공처
ScienceON, KISS

0
상세조회
0
다운로드
0
내보내기

서지정보 열기

부가정보

다국어 초록 (Multilingual Abstract)

Multi-agent systems often need to achieve the goal of learning more effectively for a task through coordination. Although the introduction of deep learning has addressed the state space problems, multi-agent learning remains infeasible because of the ...

Multi-agent systems often need to achieve the goal of learning more effectively for a task through coordination. Although the introduction of deep learning has addressed the state space problems, multi-agent learning remains infeasible because of the joint action spaces. Large-scale joint action spaces can be sparse according to implicit or explicit coordination structure, which can ensure reasonable coordination action through the coordination structure. In general, the multi-agent system is dynamic, which makes the relations among agents and the coordination structure are dynamic. Therefore, the explicit coordination structure can better represent the coordinative relationship among agents and achieve better coordination between agents. Inspired by the maximization of social group utility, we dynamically construct a factor graph as an explicit coordination structure to express the coordinative relationship according to the utility among agents and estimate the joint action values based on the local utility transfer among factor graphs. We present the application of such techniques in the scenario of multiple intelligent vehicle systems, where state space and action space are a problem and have too many interactions among agents. The results on the multiple intelligent vehicle systems demonstrate the efficiency and effectiveness of our proposed methods.

더보기

참고문헌 (Reference)

1 Sunehag P, "Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward" 2085-2087, 2018

2 Sharma P K, "Survey of recent multi-agent reinforcement learning algorithms utilizing centralized training" III : 2021

3 Foerster J, "Stabilising experience replay for deep multi-agent reinforcement learning" 1146-1155, 2017

4 Claudine Badue, "Self-driving cars : A survey" 165 : 113816-, 2021

5 X. Li, "Reinforcement learning based overtaking decision making for highway autonomous driving" IEEE 336-342, 2015

6 Rashid, T., "QMIX : Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning" 4295-4304, 2018

7 D. Koller, "Probabilistic Graphical Models: Principles and Techniques" MIT Press 2009

8 Lin M, "Policy Gradient Adaptive Critic Designs for Model-Free Optimal Tracking Control With Experience Replay" 1-12, 2021

9 Mnih, V., "Playing atari with deep reinforcement learning"

10 Q. Wei, "Optimal elevator group control via deep asynchronous actor-critic learning" 31 (31): 5245-5256, 2020

1 Sunehag P, "Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward" 2085-2087, 2018

2 Sharma P K, "Survey of recent multi-agent reinforcement learning algorithms utilizing centralized training" III : 2021

3 Foerster J, "Stabilising experience replay for deep multi-agent reinforcement learning" 1146-1155, 2017

4 Claudine Badue, "Self-driving cars : A survey" 165 : 113816-, 2021

5 X. Li, "Reinforcement learning based overtaking decision making for highway autonomous driving" IEEE 336-342, 2015

6 Rashid, T., "QMIX : Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning" 4295-4304, 2018

7 D. Koller, "Probabilistic Graphical Models: Principles and Techniques" MIT Press 2009

8 Lin M, "Policy Gradient Adaptive Critic Designs for Model-Free Optimal Tracking Control With Experience Replay" 1-12, 2021

9 Mnih, V., "Playing atari with deep reinforcement learning"

10 Q. Wei, "Optimal elevator group control via deep asynchronous actor-critic learning" 31 (31): 5245-5256, 2020

11 R. Lowe, "Multiagent actor-critic for mixed cooperative-competitive environments" 6382-6393, 2017

12 Parunak, "Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence" MIT Press 377-421, 2000

13 Stone P, "Multiagent Systems : A Survey from a Machine Learning Perspective" 8 : 345-383, 2000

14 Kuyer, L, "Multiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs" Springer 2008

15 M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents" 330-337, 1993

16 Y. Yang, "Mean field multiagent reinforcement learning" 5571-5580, 2018

17 Littman, M. L, "Markov games as a framework for multi-agent reinforcement learning" Morgan Kauffman Publishers 157-163, 1994

18 M. L. Puterman, "Markov decision processes: discrete stochastic dynamic programming" John Wiley & Sons 2014

19 Jiang, "Learning attentional communication for multi-agent cooperation" 7265-7275, 2018

20 Z. Zhang, "Integrating independent and centralized multi-agent reinforcement learning for traffic signal network optimization" 2083-2085, 2020

21 Zhang Y, "Human-like Autonomous Vehicle Speed Control by Deep Reinforcement Learning with Double Q-Learning" IV : 1251-1256, 2018

22 Volodymyr, M., "Human-level control through deep reinforcement learning" 518 (518): 529-533, 2015

23 D. Huang, "Ensemble clustering using factor graph" 50 : 131-142, 2016

24 Zawadzki, E., "Empirically evaluating multiagent learning algorithms" 2014

25 P. Kravets, "Dynamic coordination of strategies for multi-agent systems" Springer 653-670, 2020

26 C. Yu, "Distributed multiagent coordinated learning for autonomous driving in highways based on dynamic coordination graphs" 21 (21): 735-748, 2020

27 K. Shah, "Distributed independent reinforcement learning (dirl) approach to resource management in wireless sensor networks" 2007

28 T. Hester, "Deep q-learning from demonstrations" 32 : 2018

29 W. B¨ohmer, "Deep coordination graphs" 980-991, 2020

30 Farinelli A, "Decentralised coordination of low-power embedded devices using the max-sum algorithm" International Foundation for Autonomous Agents and Multiagent Systems 2008

31 Foerster J, "Counterfactual Multi-Agent Policy Gradients"

32 C. Guestrin, "Coordinated reinforcement learning" 2 : 227-234, 2002

33 Gupta, J. K., "Cooperative Multi-agent Control Using Deep Reinforcement Learning" Springer 2017

34 Kok J R, "Collaborative multiagent reinforcement learning by payoffpropagation" 7 : 1789-1828, 2006

35 R. Dechter, "Bucket elimination : A unifying framework for reasoning" 113 (113): 41-85, 1999

36 N. A. Khalid, "An adaptive agent-based partner selection for routing packet in distributed wireless sensor network" 2016

37 W. Du, "A survey on multi-agent deep reinforcement learning : from the perspective of challenges and applications" 54 : 3215-3238, 2021

38 Grigorescu S, "A survey of deep learning techniques for autonomous driving" 37 (37): 362-386, 2020

39 H. Liu, "A new hybrid ensemble deep reinforcement learning model for wind speed short term forecasting" 202 : 117794-, 2020

40 D. Ye, "A multi-agent framework for packet routing in wireless sensor networks" 15 (15): 10026-10047, 2015

41 Smirnov N, "A game theory-based approach for modeling autonomous vehicle behavior in congested, urban lane-changing scenarios" 21 (21): 1523-, 2021

42 Meiyu Liu, "A cellular automata traffic flow model combined with a bp neural network based microscopic lane changing decision model" 23 (23): 309-318, 2019

43 Zeyu Zhu, "A Survey of Deep RL and IL for Autonomous Driving Policy Learning"

44 Zhu, Z., "A Survey of Deep RL and IL for Autonomous Driving Policy Learning" 2021

동일학술지(권/호) 다른 논문

Adaptive low-resolution palmprint image recognition based on channel attention mechanism and modified deep residual network
- 한국인터넷정보학회
- Xuebin Xu
- 2022
- KCI등재,SCIE,SCOPUS
A Protein-Protein Interaction Extraction Approach Based on Large Pre-trained Language Model and Adversarial Training
- 한국인터넷정보학회
- Zhan Tang
- 2022
- KCI등재,SCIE,SCOPUS
Multi-view Clustering by Spectral Structure Fusion and Novel Low-rank Approximation
- 한국인터넷정보학회
- Yin Lon
- 2022
- KCI등재,SCIE,SCOPUS
Sentiment Analysis of Product Reviews to Identify Deceptive Rating Information in Social Media: A SentiDeceptive Approach
- 한국인터넷정보학회
- M. Irfan Marwat
- 2022
- KCI등재,SCIE,SCOPUS

동일학술지 더보기

더보기

분석정보

View

상세정보조회

0

Usage

원문다운로드

0

대출신청

0

복사신청

0

EDDS신청

0

동일 주제 내 활용도 TOP

주제

연도별 연구동향

연도별 활용동향

연관논문

연구자 네트워크맵

공동연구자 (7)

더보기

유사연구자 (20) 활용도상위20명

더보기

인용정보 인용지수 설명보기

학술지 이력

학술지 이력
연월일	이력구분	이력상세	등재구분
	학술지등록	한글명 : KSII Transactions on Internet and Information Systems 외국어명 : KSII Transactions on Internet and Information Systems
2023	평가예정	해외DB학술지평가 신청대상 (해외등재 학술지 평가)
2020-01-01	평가	등재학술지 유지 (해외등재 학술지 평가)
2013-10-01	평가	등재학술지 선정 (기타)
2011-01-01	평가	등재후보학술지 유지 (기타)
2009-01-01	평가	SCOPUS 등재 (신규평가)

학술지 인용정보

학술지 인용정보
기준연도	WOS-KCI 통합IF(2년)	KCIF(2년)	KCIF(3년)
2016	0.45	0.21	0.37
KCIF(4년)	KCIF(5년)	중심성지수(3년)	즉시성지수
0.32	0.29	0.244	0.03

연관 공개강의(KOCW)

이 자료와 함께 이용한 RISS 자료

나만을 위한 추천자료

서지정보
부가정보
동일학술지(권/호) 다른 논문
분석정보
인용정보
연관 공개강의(KOCW)

해외이동버튼