Skip to main content

Speech Recognition

FinAudio: A Benchmark for Audio Large Language Models in Financial Applications
·370 words·2 mins· loading · loading
AI Generated 🤗 Daily Papers Speech and Audio Speech Recognition 🏢 Stevens Institute of Technology
FINAUDIO: First benchmark for financial audio LLMs, enhancing financial audio analysis and investment decisions.
Quantization for OpenAI's Whisper Models: A Comparative Analysis
·1308 words·7 mins· loading · loading
AI Generated 🤗 Daily Papers Speech and Audio Speech Recognition 🏢 Independent Researcher
Quantization optimizes OpenAI’s Whisper models, balancing model size, speed, and accuracy for diverse applications.
Samba-asr state-of-the-art speech recognition leveraging structured state-space models
·1451 words·7 mins· loading · loading
AI Generated 🤗 Daily Papers Natural Language Processing Speech Recognition 🏢 SandLogic Technologies Pvt Ltd
Samba-ASR, a novel speech recognition model using Mamba architecture, surpasses existing transformer models in accuracy and efficiency, setting a new benchmark for future ASR research.