↓Skip to main content

Speech Recognition

FinAudio: A Benchmark for Audio Large Language Models in Financial Applications

26 March 2025·370 words·2 mins· loading · loading

AI Generated 🤗 Daily Papers Speech and Audio Speech Recognition 🏢 Stevens Institute of Technology

FINAUDIO: First benchmark for financial audio LLMs, enhancing financial audio analysis and investment decisions.

Quantization for OpenAI's Whisper Models: A Comparative Analysis

12 March 2025·1308 words·7 mins· loading · loading

AI Generated 🤗 Daily Papers Speech and Audio Speech Recognition 🏢 Independent Researcher

Quantization optimizes OpenAI’s Whisper models, balancing model size, speed, and accuracy for diverse applications.

Samba-asr state-of-the-art speech recognition leveraging structured state-space models

6 January 2025·1451 words·7 mins· loading · loading

AI Generated 🤗 Daily Papers Natural Language Processing Speech Recognition 🏢 SandLogic Technologies Pvt Ltd

Samba-ASR, a novel speech recognition model using Mamba architecture, surpasses existing transformer models in accuracy and efficiency, setting a new benchmark for future ASR research.