arxiv:2603.11784
Michal Valko
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
authored a paper about 13 hours ago
Online Semi-Supervised Learning on Quantized Graphs authored a paper about 13 hours ago
Derivative-Free & Order-Robust Optimisation authored a paper about 13 hours ago
A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption