Adaptive Estimation of Random Vectors with Bandit Feedback | Indian Institute of Technology Madras

Home
ताजा घटनाएं
कार्यक्रम
Adaptive Estimation of Random Vectors with Bandit Feedback

Adaptive Estimation of Random Vectors with Bandit Feedback

Date4th Mar 2022

Time03:00 PM

Venue Google Meet

PAST EVENT

Details

We consider the problem of sequentially learning to estimate, in the mean squared error (MSE) sense, a Gaussian K-vector of unknown covariance by observing only m K of its entries in each round. This reduces to learning an optimal subset for estimating the entire vector. Towards this, we first establish an exponential concentration bound for an estimate of the MSE for each observable subset. We then frame the estimation problem with bandit feedback in the best-subset identification setting. We propose a variant of the successive elimination algorithm to cater to the adaptive estimation problem, and we derive an upper bound on the sample complexity of this algorithm. In addition, to understand the fundamental limit on the sample complexity of this adaptive estimation bandit problem, we derive a minimax lower bound.

Speakers

Dipayan Sen (CS18S012)

CSE