Tata Institute of Fundamental Research

Speaker:	Siddharth Chandak (Stanford University)
Organiser:	Hariharan Narayanan
Date:	Wednesday, 17 Sep 2025, 16:00 to 17:00
Venue:	A-201 (STCS Seminar Room)

(Scan to add to calendar)

Abstract:

Large-scale multi-agent systems are often modeled as games, where each player's reward depends on the joint actions of all agents. In strongly monotone games, players converge to a Nash equilibrium (NE) by optimizing their local objectives, but such equilibria may not align with the global objective. We study two scenarios where a game manager, with access only to the global objective and limited control over utility parameters, seeks to steer the system toward better equilibria.

In the first scenario, the controller adjusts linear coefficients in the players' utilities to impose linear constraints on the equilibrium. We design a simple two-time-scale stochastic approximation algorithm and show almost sure convergence and a mean square error rate of near-$O(t^{-1/4})$ for the algorithm.
In the second scenario, the game manager has to choose among K discrete parameters. We propose a novel optimism-based bandit algorithm with additional terms to account for the distance from equilibrium, and prove that this algorithm achieves a regret of O(log(T)).

Short Bio:
Siddharth Chandak is currently a Ph.D. candidate in Electrical Engineering at Stanford University, USA. He received his B.Tech. from IIT Bombay, India, in 2021, where he was awarded the President of India Gold Medal, and his M.S. from Stanford University in 2023. His research interests include game theory, multi-agent learning, stochastic approximation, and its applications in reinforcement learning.

Tata Institute of Fundamental Research

Learning to Control Unknown Multi-Agent Systems

STCS Seminar