Translate, speak, and evaluate language sentences
Memory-Guided Diffusion for Expressive Talking Video Gen