BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//IORA - Institute of Operations Research and Analytics - ECPv6.15.11//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:IORA - Institute of Operations Research and Analytics
X-ORIGINAL-URL:https://iora.nus.edu.sg
X-WR-CALDESC:Events for IORA - Institute of Operations Research and Analytics
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:Asia/Singapore
BEGIN:STANDARD
TZOFFSETFROM:+0800
TZOFFSETTO:+0800
TZNAME:+08
DTSTART:20240101T000000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=Asia/Singapore:20250423T100000
DTEND;TZID=Asia/Singapore:20250423T113000
DTSTAMP:20260421T105154
CREATED:20250413T134048Z
LAST-MODIFIED:20250413T134048Z
UID:26138-1745402400-1745407800@iora.nus.edu.sg
SUMMARY:DAO-ISEM-IORA Seminar Series: Bernhard von Stengel
DESCRIPTION:  \n\n\n\nName of Speaker\nBernhard von Stengel\n\n\nSchedule\n23 April 2025\, 10am – 11.30am\n\n\nVenue \nBIZ1 03-02\n\n\nLink to Register\nhttps://nus-sg.zoom.us/meeting/register/AWc8AlsiQo67S7-kKnLwmQ\n\n\nTitle\nFinding an Empirical Equilibrium by Machine Learning in a Pricing Game\n\n\nAbstract\nWe apply machine learning to a classical multi-period pricing game. The game is a duopoly with demand inertia\, introduced by Selten in 1965 and studied around 1990 with economic experiments. Its strategies are too complex to represent explicitly. Unlike standard multi-agent reinforcement learning of training agents against each other\, our framework is that of a Policy-Space Response Oracle (PSRO) and the “double oracle” method. A few initial strategies that play against each other define a “meta-game”. A Nash equilibrium of this meta-game is computed and defines the training environment for a learning agent. Once the agent is sufficiently trained to get a higher payoff than in the current equilibrium\, it is added as a new strategy to the meta-game\, which expands in this way\, and the process repeats. Various learning methods such as SAC (Soft Actor Critic) and PPO (Proximal Policy Optimisation) lead to different limit equilibria in this setup\, namely either competitive pricing or cooperative price collusion in the duopoly. \n(Joint work with Sahar Jahani and Rahul Savani)\n\n\nAbout the Speaker\nBernhard von Stengel is Professor of Mathematics at the London School of Economics which he joined in 1998\, after studies in Germany and the USA. He has been communications officer of the Game Theory Society\, and was program chair of the GAMES 2016 congress (held jointly with EC’2016) in Maastricht. He is interested in mathematical questions of game theory. The geometry and computation of Nash equilibria is one of his research specialities.\n\n\n\n 
URL:https://iora.nus.edu.sg/events/dao-isem-iora-seminar-series-bernhard-von-stengel/
CATEGORIES:IORA Seminar Series
END:VEVENT
END:VCALENDAR