Skip to main content

🏢 Mondragon University

o3-mini vs DeepSeek-R1: Which One is Safer?
·578 words·3 mins· loading · loading
AI Generated 🤗 Daily Papers AI Theory Safety 🏢 Mondragon University
ASTRAL, a novel automated safety testing tool, reveals DeepSeek-R1’s significantly higher unsafe response rate compared to OpenAI’s o3-mini, highlighting critical safety concerns in advanced LLMs.
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation
·1678 words·8 mins· loading · loading
AI Generated 🤗 Daily Papers AI Theory Safety 🏢 Mondragon University
Researchers used ASTRAL to systematically test OpenAI’s 03-mini LLM’s safety, revealing key vulnerabilities and highlighting the need for continuous, robust safety mechanisms in large language models.