🏢 Paris-Saclay University
From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM
·1953 words·10 mins·
loading
·
loading
AI Generated
🤗 Daily Papers
Multimodal Learning
Vision-Language Models
🏢 Paris-Saclay University
SPIRE: Adds speech to text-only LLMs, maintaining text performance via discretized speech and continued pre-training.