Uni-Logo
Sie sind hier: Startseite Seminar Rémy Degenne

Rémy Degenne

— abgelegt unter:

Adaptive Testing: Bandits find correct answers fast

Was
  • FDM-Seminar
Wann 16.06.2023
von 12:00 bis 13:00
Termin übernehmen vCal
iCal

Abstract

Testing is the task of finding out which of several possible actions leads to the best outcome by repeatedly trying actions and observing their random effects. A company may want to find which web page A or B generates the most interaction with its clients. Clinical trials try to determine which drug quantity has the best efficiency-toxicity trade-off.

In the sequential testing framework, an agent repeatedly selects one of the actions and observes a random outcome. The agent wants to find the action with the best mean outcome as quickly as possible and with high certainty. A simple strategy is to try each action in turn until enough information is gathered. Bandit algorithms instead select their future actions based on past observations: they adapt to the data as it comes. This adaptive behavior makes them stop faster.

« Mai 2024 »
Mai
MoDiMiDoFrSaSo
12345
6789101112
13141516171819
20212223242526
2728293031
Benutzerspezifische Werkzeuge