Blocker 128
Abstract: Multi-agent learning and decision-making in networked and dynamic environments are at the forefront of challenges that artificial intelligence faces. That is, how can a team of learners coordinate to do better when there are limitations and hurdles to their interactions in the form of privacy, communication, or security? How can agents learn, adapt, and decide in the presence of other agents who strive to make decisions to further their own objectives? This workshop on “Multi-Agent Learning in Dynamic Environments” aims to provide a landscape of recent research activities on multi-agent learning in dynamic environments, and foster potential collaborations among the researchers broadly working in the fields of reinforcement learning, game theory, optimization and control, and those that are interested in engineering and science applications of such methods. This workshop will consist of seminar-style talks followed by a discussion, and related short presentations.
Santiago Paternain
Assistant Professor, Department of Electrical, Computer and Systems Engineering
Rensselaer Polytechnic Institute
Anna Winnicki
PhD Candidate, Department of Electrical and Computer Engineering
University of Illinois Urbana-Champaign
Dongsheng Ding
Postdoctoral Researcher, Department of Electrical and Systems Engineering
University of Pennsylvania
Time | Presenter | Topic |
---|---|---|
8:50 AM | Ceyhun Eksin | Welcome and Introductions |
9:00 AM–9:50 AM | Santiago Paternain | Safe Learning for Dynamical Systems and Control |
9:50 AM–10:00 AM | Break | |
10:00 AM–10:30 AM | Research Highlights | |
Soham Das, Ceyhun Eksin | Learning Nash in Constrained Markov GAmes with an α-Potential | |
Steve Suh, Bin Wu | Trust-Driven Collaboration: Mechanisms to Facilitate Multi-Robot Cooperation in Constrained Environments | |
Ujwal Dinesha, Srinivas Shakkotai | A Multi-Agent View of Wireless Video Streaming with Delayed Client-Feedback | |
10:30 AM–11:20 AM | Anna Winnicki | The Role of Lookahead in Reinforcement Learning Algorithms |
11:20 AM–11:30 AM | Break | |
11:30 AM–12:00 PM | Research Highlights | |
Khaled Nakhleh, Ceyhun Eksin | Simulation-Based Optimistic Policy Iteration in Multi-Agent Games with Kullback-Leibler Control Cost | |
Jiachen Xi, Alfredo Garcia, Petar Momcilovic | Multi-Agent Reinforcement Learning for Multi-Area Power Exchange | |
Austin Carroll, Bharath Sivaram, Swaminathan Goppalswamy, Lucas Krakow | Cooperative Target Tracking: What did you see? | |
12:00 PM–2:00 PM | Lunch | |
2:00 PM–2:50 PM | Dongsheng Ding | Multi-Agent Reinforcement Learning for Large-Scale Markov Potential Games |
2:50 PM–3:00 PM | Break | |
3:00 PM–3:30 PM | Research Highlights | |
Ruida Zhou, Chao Tian | Provable Policy Gradient Methods for Average-Reward Markov Potential Games | |
Zhide Wang, Yanling Chang | Structural Estimation of Partially Observable Dynamic Games: an Application of Retail Investment |