Confidence intervals for policy evaluation in adaptive experiments
Confidence intervals for policy evaluation in adaptive experiments
Significance Randomized controlled trials are central to the scientific process, but they can be costly. For example, a clinical trial may assign patients to treatments that are detrimental to them. Adaptive experimental designs, such as multiarmed bandit algorithms, reduce costs by increasing the probability of assigning promising treatments over the …