CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought
Paper • 2409.19510 • Published • 1
How to use yxdu/llm-srt with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("yxdu/llm-srt", dtype="auto")This repository contains the model described in Making LLMs Better Many-to-Many Speech-to-Text Translators with Curriculum Learning.