2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

MLSP-27: Reinforcement Learning 3

Session Type: Poster
Time: Thursday, 10 June, 13:00 - 13:45
Location: Gather.Town
Session Chair: Seung-Jun Kim, University of Maryland, Baltimore County
 
MLSP-27.1: GAUSSIAN PROCESS TEMPORAL-DIFFERENCE LEARNING WITH SCALABILITY AND WORST-CASE PERFORMANCE GUARANTEES
         Qin Lu; University of Minnesota
         Georgios B. Giannakis; University of Minnesota
 
MLSP-27.2: SELF-INFERENCE OF OTHERS' POLICIES FOR HOMOGENEOUS AGENTS IN COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING
         Qifeng Lin; Sun Yat-sen University
         Qing Ling; Sun Yat-sen University
 
MLSP-27.3: SEMI-SUPERVISED BATCH ACTIVE LEARNING VIA BILEVEL OPTIMIZATION
         Zalán Borsos; ETH Zurich
         Marco Tagliasacchi; Google
         Andreas Krause; ETH Zurich
 
MLSP-27.4: KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING
         Rami Mowakeaa; University of Maryland, Baltimore County
         Seung-Jun Kim; University of Maryland, Baltimore County
         Darren Emge; Combat Capabilities Development Command
 
MLSP-27.5: POLICY AUGMENTATION: AN EXPLORATION STRATEGY FOR FASTER CONVERGENCE OF DEEP REINFORCEMENT LEARNING ALGORITHMS
         Arash Mahyari; Florida Institute For Human and Machine Cognition (IHMC)
 
MLSP-27.6: GRAPHCOMM: A GRAPH NEURAL NETWORK BASED METHOD FOR MULTI-AGENT REINFORCEMENT LEARNING
         Siqi Shen; Xiamen University
         Yongquan Fu; National University of Defense Technology
         Huayou Su; National University of Defense Technology
         Hengyue Pan; National University of Defense Technology
         Qiao Peng; National University of Defense Technology
         Yong Dou; National University of Defense Technology
         Cheng Wang; Xiamen University