🏢 University of Illinois Urbana-Champaign

Validating Climate Models with Spherical Convolutional Wasserstein Distance

26 September 2024·2133 words·11 mins· loading · loading

AI Theory Optimization 🏢 University of Illinois Urbana-Champaign

Researchers developed Spherical Convolutional Wasserstein Distance (SCWD) to more accurately validate climate models by considering spatial variability and local distributional differences.

SnapKV: LLM Knows What You are Looking for Before Generation

26 September 2024·2730 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Illinois Urbana-Champaign

SnapKV: Slashing LLM memory usage & boosting speed via smart KV cache compression!

Sketching for Distributed Deep Learning: A Sharper Analysis

26 September 2024·3663 words·18 mins· loading · loading

AI Generated Machine Learning Federated Learning 🏢 University of Illinois Urbana-Champaign

This work presents a sharper analysis of sketching for distributed deep learning, eliminating the problematic dependence on ambient dimension in convergence analysis and proving ambient dimension-inde…

SelfCodeAlign: Self-Alignment for Code Generation

26 September 2024·1983 words·10 mins· loading · loading

Natural Language Processing Large Language Models 🏢 University of Illinois Urbana-Champaign

SelfCodeAlign is a novel self-alignment method for code generation LLMs that surpasses existing methods by avoiding reliance on expensive human annotation or proprietary LLMs. The method achieves thi…

Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes

26 September 2024·2551 words·12 mins· loading · loading

AI Generated AI Theory Optimization 🏢 University of Illinois Urbana-Champaign

BICCOS: Scalable neural network verification via branch-and-bound inferred cutting planes.

Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks

26 September 2024·1538 words·8 mins· loading · loading

Large Language Models 🏢 University of Illinois Urbana-Champaign

Robust Prompt Optimization (RPO) creates robust LLM defenses against jailbreaking attacks by optimizing a transferable suffix, achieving state-of-the-art robustness.

Relational Verification Leaps Forward with RABBit

26 September 2024·1822 words·9 mins· loading · loading

AI Theory Robustness 🏢 University of Illinois Urbana-Champaign

RABBit: A novel Branch-and-Bound verifier for precise relational verification of Deep Neural Networks, achieving substantial precision gains over current state-of-the-art baselines.

Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers

26 September 2024·4163 words·20 mins· loading · loading

Reinforcement Learning 🏢 University of Illinois Urbana-Champaign

Boost online finetuning of Decision Transformers by adding TD3 gradients, especially when pretrained with low-reward data.

Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs

26 September 2024·3598 words·17 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Illinois Urbana-Champaign

Regularizing hidden states improves reward model generalization in RLHF for LLMs, boosting accuracy and mitigating over-optimization.

RAMP: Boosting Adversarial Robustness Against Multiple $l_p$ Perturbations for Universal Robustness

26 September 2024·3379 words·16 mins· loading · loading

Computer Vision Image Classification 🏢 University of Illinois Urbana-Champaign

RAMP: A novel training framework significantly boosts DNN robustness against diverse adversarial attacks by mitigating accuracy-robustness tradeoffs and improving generalization.

ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing

26 September 2024·1818 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Illinois Urbana-Champaign

ProEdit: High-quality 3D scene editing via progressive subtask decomposition.

PageRank Bandits for Link Prediction

26 September 2024·2009 words·10 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Illinois Urbana-Champaign

PageRank Bandits (PRB) revolutionizes link prediction by framing it as a sequential decision-making problem, thus enabling the system to adapt to evolving data. Combining contextual bandits with PageR…

Online Iterative Reinforcement Learning from Human Feedback with General Preference Model

26 September 2024·1619 words·8 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Illinois Urbana-Champaign

This paper proposes a novel, reward-free RLHF framework using a general preference oracle, surpassing existing reward-based approaches in efficiency and generalizability.

On the Expressive Power of Tree-Structured Probabilistic Circuits

26 September 2024·1425 words·7 mins· loading · loading

AI Theory Optimization 🏢 University of Illinois Urbana-Champaign

Tree-structured probabilistic circuits are surprisingly efficient: this paper proves a quasi-polynomial upper bound on their size, showing they’re almost as expressive as more complex DAG structures.

On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation

26 September 2024·299 words·2 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 University of Illinois Urbana-Champaign

This paper tackles the ‘curse of horizon’ in off-policy evaluation for partially observable Markov decision processes (POMDPs) by proposing novel coverage assumptions, enabling polynomial estimation e…

Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality

26 September 2024·1532 words·8 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of Illinois Urbana-Champaign

Model-free policy gradient methods using occupancy functions are developed for online and offline RL, achieving computational efficiency and handling arbitrary data distributions.

Most Influential Subset Selection: Challenges, Promises, and Beyond

26 September 2024·1721 words·9 mins· loading · loading

AI Theory Interpretability 🏢 University of Illinois Urbana-Champaign

Adaptive greedy algorithms significantly improve the accuracy of identifying the most influential subset of training data, overcoming limitations of existing methods that fail to capture complex inter…

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

26 September 2024·2610 words·13 mins· loading · loading

Computer Vision Scene Understanding 🏢 University of Illinois Urbana-Champaign

Lexicon3D: a first comprehensive study probing diverse visual foundation models for superior 3D scene understanding, revealing that unsupervised image models outperform others across various tasks.

InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction

26 September 2024·2177 words·11 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Illinois Urbana-Champaign

InterDreamer: Zero-shot text-guided 3D human-object interaction generation without paired data, achieved via decoupled semantic and dynamic modeling, using LLMs and a physics-based world model.

Fine-grained Control of Generative Data Augmentation in IoT Sensing

26 September 2024·2239 words·11 mins· loading · loading

AI Applications Healthcare 🏢 University of Illinois Urbana-Champaign

Fine-grained control is added to generative models for IoT sensing data augmentation, tailoring synthetic data to specific application needs by leveraging domain expertise and statistical metrics of s…