Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models
Paper • 2604.26951 • Published • 46
nlu
Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models
From Skill Text to Skill Structure: The Scheduling-Structural-Logical Representation for Agent Skills