Enabling Versatile Controls for Video Diffusion Models Paper • 2503.16983 • Published Mar 21, 2025 • 15
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks Paper • 2503.04065 • Published Mar 6, 2025
baidu/ERNIE-4.5-VL-28B-A3B-Thinking Image-Text-to-Text • 30B • Updated Dec 24, 2025 • 1.21k • 518
baidu/ERNIE-4.5-21B-A3B-Thinking Text Generation • 22B • Updated Nov 26, 2025 • 578 • • 772
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 229
baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle Image-Text-to-Text • 424B • Updated Aug 19, 2025 • 18 • 61
Running on Zero Featured 2.7k Whisper 📉 2.7k Transcribe audio and YouTube videos into text instantly