MLSP-25: Reinforcement Learning 1 |
Session Type: Poster |
Time: Thursday, 10 June, 13:00 - 13:45 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Chang Yoo, Korea Advanced Institute of Science and Technology |
MLSP-25.1: COOPERATIVE SCENARIOS FOR MULTI-AGENT REINFORCEMENT LEARNING IN WIRELESS EDGE CACHING |
Navneet Garg; University of Edinburgh |
Tharmalingam Ratnarajah; University of Edinburgh |
MLSP-25.2: ROBUST DEEP REINFORCEMENT LEARNING FOR UNDERWATER NAVIGATION WITH UNKNOWN DISTURBANCES |
Juan Parras; Universidad Politécnica de Madrid |
Santiago Zazo; Universidad Politécnica de Madrid |
MLSP-25.3: ONLINE HYPER-PARAMETER TUNING FOR THE CONTEXTUAL BANDIT |
Djallel Bouneffouf; IBM Research |
Emmanuelle Claeys; Strasbourg University |
MLSP-25.4: DOUBLE-LINEAR THOMPSON SAMPLING FOR CONTEXT-ATTENTIVE BANDITS |
Djallel Bouneffouf; IBM Research |
Raphael Feraud; Orange |
Sohini Upadhyay; IBM Research |
Yasaman Khazaeni; Universite de montreal |
Irina Rish; Universite de montreal |
MLSP-25.5: ON THE MARGINAL BENEFIT OF ACTIVE LEARNING: DOES SELF-SUPERVISION EAT ITS CAKE? |
Yao-Chun Chan; University of California, Riverside |
Mingchen Li; University of California, Riverside |
Samet Oymak; University of California, Riverside |
MLSP-25.6: ROBUST MAML: PRIORITIZATION TASK BUFFER WITH ADAPTIVE LEARNING PROCESS FOR MODEL-AGNOSTIC META-LEARNING |
Thanh Nguyen; Korea Advanced Institute of Science and Technology (KAIST) |
Tung Luu; Korea Advanced Institute of Science and Technology (KAIST) |
Trung Pham; Korea Advanced Institute of Science and Technology (KAIST) |
Sanzhar Rakhimkul; Korea Advanced Institute of Science and Technology (KAIST) |
Chang Dong Yoo; Korea Advanced Institute of Science and Technology (KAIST) |