🏢 University of Illinois Urbana-Champaign
Validating Climate Models with Spherical Convolutional Wasserstein Distance
·2133 words·11 mins·
loading
·
loading
AI Theory
Optimization
🏢 University of Illinois Urbana-Champaign
Researchers developed Spherical Convolutional Wasserstein Distance (SCWD) to more accurately validate climate models by considering spatial variability and local distributional differences.
SnapKV: LLM Knows What You are Looking for Before Generation
·2730 words·13 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 University of Illinois Urbana-Champaign
SnapKV: Slashing LLM memory usage & boosting speed via smart KV cache compression!
Sketching for Distributed Deep Learning: A Sharper Analysis
·3663 words·18 mins·
loading
·
loading
AI Generated
Machine Learning
Federated Learning
🏢 University of Illinois Urbana-Champaign
This work presents a sharper analysis of sketching for distributed deep learning, eliminating the problematic dependence on ambient dimension in convergence analysis and proving ambient dimension-inde…
SelfCodeAlign: Self-Alignment for Code Generation
·1983 words·10 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 University of Illinois Urbana-Champaign
SelfCodeAlign is a novel self-alignment method for code generation LLMs that surpasses existing methods by avoiding reliance on expensive human annotation or proprietary LLMs. The method achieves thi…
Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes
·2551 words·12 mins·
loading
·
loading
AI Generated
AI Theory
Optimization
🏢 University of Illinois Urbana-Champaign
BICCOS: Scalable neural network verification via branch-and-bound inferred cutting planes.
Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks
·1538 words·8 mins·
loading
·
loading
Large Language Models
🏢 University of Illinois Urbana-Champaign
Robust Prompt Optimization (RPO) creates robust LLM defenses against jailbreaking attacks by optimizing a transferable suffix, achieving state-of-the-art robustness.
Relational Verification Leaps Forward with RABBit
·1822 words·9 mins·
loading
·
loading
AI Theory
Robustness
🏢 University of Illinois Urbana-Champaign
RABBit: A novel Branch-and-Bound verifier for precise relational verification of Deep Neural Networks, achieving substantial precision gains over current state-of-the-art baselines.
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
·4163 words·20 mins·
loading
·
loading
Reinforcement Learning
🏢 University of Illinois Urbana-Champaign
Boost online finetuning of Decision Transformers by adding TD3 gradients, especially when pretrained with low-reward data.
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs
·3598 words·17 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 University of Illinois Urbana-Champaign
Regularizing hidden states improves reward model generalization in RLHF for LLMs, boosting accuracy and mitigating over-optimization.
RAMP: Boosting Adversarial Robustness Against Multiple $l_p$ Perturbations for Universal Robustness
·3379 words·16 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 University of Illinois Urbana-Champaign
RAMP: A novel training framework significantly boosts DNN robustness against diverse adversarial attacks by mitigating accuracy-robustness tradeoffs and improving generalization.
ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing
·1818 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Illinois Urbana-Champaign
ProEdit: High-quality 3D scene editing via progressive subtask decomposition.
PageRank Bandits for Link Prediction
·2009 words·10 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 University of Illinois Urbana-Champaign
PageRank Bandits (PRB) revolutionizes link prediction by framing it as a sequential decision-making problem, thus enabling the system to adapt to evolving data. Combining contextual bandits with PageR…
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model
·1619 words·8 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 University of Illinois Urbana-Champaign
This paper proposes a novel, reward-free RLHF framework using a general preference oracle, surpassing existing reward-based approaches in efficiency and generalizability.
On the Expressive Power of Tree-Structured Probabilistic Circuits
·1425 words·7 mins·
loading
·
loading
AI Theory
Optimization
🏢 University of Illinois Urbana-Champaign
Tree-structured probabilistic circuits are surprisingly efficient: this paper proves a quasi-polynomial upper bound on their size, showing they’re almost as expressive as more complex DAG structures.
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
·299 words·2 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 University of Illinois Urbana-Champaign
This paper tackles the ‘curse of horizon’ in off-policy evaluation for partially observable Markov decision processes (POMDPs) by proposing novel coverage assumptions, enabling polynomial estimation e…
Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality
·1532 words·8 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 University of Illinois Urbana-Champaign
Model-free policy gradient methods using occupancy functions are developed for online and offline RL, achieving computational efficiency and handling arbitrary data distributions.
Most Influential Subset Selection: Challenges, Promises, and Beyond
·1721 words·9 mins·
loading
·
loading
AI Theory
Interpretability
🏢 University of Illinois Urbana-Champaign
Adaptive greedy algorithms significantly improve the accuracy of identifying the most influential subset of training data, overcoming limitations of existing methods that fail to capture complex inter…
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
·2610 words·13 mins·
loading
·
loading
Computer Vision
Scene Understanding
🏢 University of Illinois Urbana-Champaign
Lexicon3D: a first comprehensive study probing diverse visual foundation models for superior 3D scene understanding, revealing that unsupervised image models outperform others across various tasks.
InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction
·2177 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 University of Illinois Urbana-Champaign
InterDreamer: Zero-shot text-guided 3D human-object interaction generation without paired data, achieved via decoupled semantic and dynamic modeling, using LLMs and a physics-based world model.
Fine-grained Control of Generative Data Augmentation in IoT Sensing
·2239 words·11 mins·
loading
·
loading
AI Applications
Healthcare
🏢 University of Illinois Urbana-Champaign
Fine-grained control is added to generative models for IoT sensing data augmentation, tailoring synthetic data to specific application needs by leveraging domain expertise and statistical metrics of s…