In-Context Multi-Armed Bandits via Reward-Weighted Sampling

Date:

Thanks to the invitation from Professor Bo Li!