Skip to main content

🏢 Paris-Saclay University

From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM
·1953 words·10 mins· loading · loading
AI Generated 🤗 Daily Papers Multimodal Learning Vision-Language Models 🏢 Paris-Saclay University
SPIRE: Adds speech to text-only LLMs, maintaining text performance via discretized speech and continued pre-training.