BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design Paper • 2508.21184 • Published Aug 28 • 2
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published May 30 • 277