Start / LessWrong (30+ Karma) / Linkpost new paper infra bayesian decision estimation theory by vanessa kosoy diffractor

[Linkpost] “New Paper: Infra-Bayesian Decision-Estimation Theory” by Vanessa Kosoy, Diffractor

3 min • 10 april 2025

This is a link post.

Diffractor is the first author of this paper.
Official title: "Regret Bounds for Robust Online Decision Making"

Abstract: We propose a framework which generalizes "decision making with structured observations" by allowing robust (i.e. multivalued) models. In this framework, each model associates each decision with a convex set of probability distributions over outcomes. Nature can choose distributions out of this set in an arbitrary (adversarial) manner, that can be nonoblivious and depend on past history. The resulting framework offers much greater generality than classical bandits and reinforcement learning, since the realizability assumption becomes much weaker and more realistic. We then derive a theory of regret bounds for this framework. Although our lower and upper bounds are not tight, they are sufficient to fully characterize power-law learnability. We demonstrate this theory in two special cases: robust linear bandits and tabular robust online reinforcement learning. In both cases [...]

The original text contained 2 footnotes which were omitted from this narration.

---

First published:
April 10th, 2025

Source:
https://www.lesswrong.com/posts/LgLez8aeK24PbyyQJ/new-paper-infra-bayesian-decision-estimation-theory

Linkpost URL:
https://arxiv.org/abs/2504.06820

---

Narrated by TYPE III AUDIO.

Kategorier

Filosofi Poddar Samhälle och kultur Teknologi

Förekommer på

Teknik

00:00 -00:00