Skip to main content

🏢 Delft University of Technology

Benchmarking AI Models in Software Engineering: A Review, Search Tool, and Enhancement Protocol
·3624 words·18 mins· loading · loading
AI Generated 🤗 Daily Papers Machine Learning Deep Learning 🏢 Delft University of Technology
This paper reviews AI4SE benchmarks, introduces BenchScout for benchmark discovery, and proposes BenchFrame for benchmark enhancement, demonstrated via HumanEvalNext.