ESSA: Evolutionary Strategies for Scalable Alignment Paper β’ 2507.04453 β’ Published Jul 6, 2025 β’ 4
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper β’ 2510.14528 β’ Published Oct 16, 2025 β’ 111
Group-in-Group Policy Optimization for LLM Agent Training Paper β’ 2505.10978 β’ Published May 16, 2025 β’ 19
G-CUT3R: Guided 3D Reconstruction with Camera and Depth Prior Integration Paper β’ 2508.11379 β’ Published Aug 15, 2025 β’ 12
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper β’ 2508.04280 β’ Published Aug 6, 2025 β’ 35
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success Paper β’ 2508.04280 β’ Published Aug 6, 2025 β’ 35
Reinforcement Learning for Long-Horizon Interactive LLM Agents Paper β’ 2502.01600 β’ Published Feb 3, 2025 β’ 1
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! Mar 7, 2025 β’ 89