Wrapper-Aware Rate-Distortion Optimization in Feature Coding for Machines Paper • 2601.22070 • Published Jan 29 • 1
Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators Paper • 2605.22717 • Published 3 days ago • 2
WorldKV: Efficient World Memory with World Retrieval and Compression Paper • 2605.22718 • Published 3 days ago • 32
LiVeAction: a Lightweight, Versatile, and Asymmetric Neural Codec Design for Real-time Operation Paper • 2605.06628 • Published 17 days ago • 6
Standard compliant video coding using low complexity, switchable neural wrappers Paper • 2407.07395 • Published Jul 10, 2024 • 1
CPUBone: Efficient Vision Backbone Design for Devices with Low Parallelization Capabilities Paper • 2603.26425 • Published Mar 30 • 1
The Last Human-Written Paper: Agent-Native Research Artifacts Paper • 2604.24658 • Published 25 days ago • 21
Soft Anisotropic Diagrams for Differentiable Image Representation Paper • 2604.21984 • Published 27 days ago • 5
End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer Paper • 2605.00503 • Published 23 days ago • 11
Let ViT Speak: Generative Language-Image Pre-training Paper • 2605.00809 • Published 23 days ago • 32
Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published 22 days ago • 23
OceanPile: A Large-Scale Multimodal Ocean Corpus for Foundation Models Paper • 2605.00877 • Published 29 days ago • 15
Training Transformer Models by Wavelet Losses Improves Quantitative and Visual Performance in Single Image Super-Resolution Paper • 2404.11273 • Published Apr 17, 2024 • 1