Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward Paper • 2510.03222 • Published Oct 3, 2025 • 75
StoryTeller: Improving Long Video Description through Global Audio-Visual Character Identification Paper • 2411.07076 • Published Nov 11, 2024
AGILE: A Novel Reinforcement Learning Framework of LLM Agents Paper • 2405.14751 • Published May 23, 2024
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory Paper • 2508.09736 • Published Aug 13, 2025 • 57
Memory Retrieval and Consolidation in Large Language Models through Function Tokens Paper • 2510.08203 • Published Oct 9, 2025 • 9
Memory Retrieval and Consolidation in Large Language Models through Function Tokens Paper • 2510.08203 • Published Oct 9, 2025 • 9 • 2
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory Paper • 2508.09736 • Published Aug 13, 2025 • 57
MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning Paper • 2506.05523 • Published Jun 5, 2025 • 34
Frac-Connections: Fractional Extension of Hyper-Connections Paper • 2503.14125 • Published Mar 18, 2025 • 22