Uni-Logo
Sie sind hier: Startseite Seminar Rémy Degenne

Rémy Degenne

— abgelegt unter:

Adaptive Testing: Bandits find correct answers fast

Was
  • FDM-Seminar
Wann 16.06.2023
von 12:00 bis 13:00
Termin übernehmen vCal
iCal

Abstract

Testing is the task of finding out which of several possible actions leads to the best outcome by repeatedly trying actions and observing their random effects. A company may want to find which web page A or B generates the most interaction with its clients. Clinical trials try to determine which drug quantity has the best efficiency-toxicity trade-off.

In the sequential testing framework, an agent repeatedly selects one of the actions and observes a random outcome. The agent wants to find the action with the best mean outcome as quickly as possible and with high certainty. A simple strategy is to try each action in turn until enough information is gathered. Bandit algorithms instead select their future actions based on past observations: they adapt to the data as it comes. This adaptive behavior makes them stop faster.

« April 2024 »
April
MoDiMiDoFrSaSo
1234567
891011121314
15161718192021
22232425262728
2930
Benutzerspezifische Werkzeuge