Skip to main content

Deep Learning

S*: Test Time Scaling for Code Generation
·2539 words·12 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข UC Berkeley
S*: Hybrid test-time scaling for code generation, boosting both coverage and selection accuracy.
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation
·6586 words·31 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข National University of Singapore
NExT-Mol: Combines 1D language models with 3D diffusion for molecule generation, achieving state-of-the-art performance and validity.
Thinking Preference Optimization
·5794 words·28 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข Case.edu
ThinkPO improves LLM reasoning by preferring longer CoT, boosting performance without new data.
Small Models Struggle to Learn from Strong Reasoners
·4149 words·20 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข University of Washington
Small language models struggle to learn complex reasoning from large models, but a novel ‘Mix Distillation’ method balances complexity for effective capability transfer.
AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting
·3650 words·18 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข Huawei Noah's Ark Lab, Paris, France
AdaPTS effectively adapts pre-trained univariate time series models to probabilistic multivariate forecasting, improving accuracy and uncertainty quantification.
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights
·3096 words·15 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข School of Computer Science and Engineering
ProbeLog: Zero-shot model search directly from weights, boosting efficiency and accuracy!
Weak-to-Strong Diffusion with Reflection
·4655 words·22 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข Hong Kong University of Science and Technology
W2SD: A novel framework boosts diffusion model quality by using the difference between weak and strong models to refine sampling trajectories, achieving state-of-the-art performance.
PIG: Physics-Informed Gaussians as Adaptive Parametric Mesh Representations
·5378 words·26 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข Department of Artificial Intelligence, Sungkyunkwan University
Physics-Informed Gaussians (PIGs) revolutionize PDE solving by using adaptive, learnable Gaussian functions for superior accuracy and efficiency.
Best of Both Worlds: Advantages of Hybrid Graph Sequence Models
·3440 words·17 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข Google Research
Hybrid Graph Sequence Model (GSM++) outperforms existing models by using hierarchical sequences and a hybrid architecture of Transformers and recurrent models, effectively capturing both local and glo…
Improving the detection of technical debt in Java source code with an enriched dataset
·1778 words·9 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข Hanoi University of Science and Technology
Enriched dataset TESORO improves technical debt detection by combining self-admitted comments and Java source code, advancing state-of-the-art models.
SambaMixer: State of Health Prediction of Li-ion Batteries using Mamba State Space Models
·3912 words·19 mins· loading · loading
AI Generated ๐Ÿค— Daily Papers Machine Learning Deep Learning ๐Ÿข UNED - Universidad Nacional De Educaciรณn a Distancia, Madrid, Spain
SambaMixer: A novel state-space model accurately predicts Li-ion battery health using efficient Mamba architecture and innovative resampling techniques.