Stable Experimentation? An Analysis of Multi-Armed Bandit Problems with Bundling

This paper introduces bundled experimentation problems and compares behavior in these problems to a classic (non-bundled) experimentation problem. We transform a two-decision four-armed bandit problem into two different bundling experimentation problems. The first bundling problem is a “stable” problem where each type of urn in the four-armed bandit problem becomes bundled with a known urn of the same quality. The second bundling problem is an “encouraged” problem where the urn types with larger exploration incentives in the four-armed bandit get bundled with urns with higher known reward rates. In our experiment, there should be no differences in the (expected) information generated from the first decision in each type of experimentation problem. However, we find that both types of bundling problems lead to more expected information generated than the non-bundled experimentation problem. Additionally, in contrast to theory, we find that subjects’ second decision in a bundled bandit problem is influenced by the balls drawn from both of the initially chosen options urns. These two results suggest that choice is not preserved when transforming a non-bundled experimentation problem into a bundled experimentation problem.

The contents above will be part of a list of publications, if the user clicks the link for the publication than the contents of section will be rendered as a full page, allowing you to provide more information about the paper for the reader. When publications are displayed as a single page, the contents of the above “citation” field will automatically be included below this section in a smaller font.

Authors: Hudja, Stanton and Woods, Daniel and Ralston, Jason, Stable Experimentation? An Analysis of Multi-Armed Bandit Problems with Bundling (November 30, 2025). Available at SSRN: https://ssrn.com/abstract=

Jason Ralston