Skip to main content

Speaker Recognition

SLIM: Style-Linguistics Mismatch Model for Generalized Audio Deepfake Detection
·1964 words·10 mins· loading · loading
Speech and Audio Speaker Recognition 🏢 Reality Defender Inc.
SLIM: A novel audio deepfake detection model leverages style-linguistics mismatch for superior generalization and explainability.
Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing
·2129 words·10 mins· loading · loading
Speech and Audio Speaker Recognition 🏢 Telecom Paris
Annealed Multiple Choice Learning (aMCL) overcomes limitations of Winner-takes-all in multiple choice learning by using annealing, improving robustness and performance.