🏢 University of Oxford
No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery
·4811 words·23 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 University of Oxford
AI agents learn better with well-designed training environments. This paper reveals flaws in current environment-selection methods and introduces Sampling for Learnability (SFL), a new approach that …
No 'Zero-Shot' Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
·6344 words·30 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
🏢 University of Oxford
Multimodal models’ impressive ‘zero-shot’ performance hinges on the frequency of concepts in their training data, not inherent generalization ability; exponentially more data is needed for linear impr…
Metric Flow Matching for Smooth Interpolations on the Data Manifold
·2425 words·12 mins·
loading
·
loading
AI Generated
Machine Learning
Deep Learning
🏢 University of Oxford
METRIC FLOW MATCHING (MFM) generates smooth interpolations on data manifolds by minimizing kinetic energy, outperforming Euclidean methods and achieving state-of-the-art results in single-cell traject…
Marginal Causal Flows for Validation and Inference
·1827 words·9 mins·
loading
·
loading
AI Theory
Causality
🏢 University of Oxford
Frugal Flows: Generate realistic causal benchmarks with exact marginal causal effects, enabling robust causal method validation.
LoCo: Learning 3D Location-Consistent Image Features with a Memory-Efficient Ranking Loss
·1960 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Oxford
LoCo: Memory-efficient location-consistent image features learned via a novel ranking loss, enabling three orders of magnitude memory improvement and outperforming state-of-the-art.
Learning Segmentation from Point Trajectories
·2329 words·11 mins·
loading
·
loading
Image Segmentation
🏢 University of Oxford
This paper introduces a novel unsupervised video object segmentation method using long-term point trajectories and optical flow, outperforming prior art by effectively combining sparse, long-term moti…
Learning on Large Graphs using Intersecting Communities
·2286 words·11 mins·
loading
·
loading
Machine Learning
Semi-Supervised Learning
🏢 University of Oxford
Learn on massive graphs efficiently using Intersecting Community Graphs (ICGs)! This method approximates large graphs with ICGs, enabling linear time/memory complexity for node classification.
Language Models as Hierarchy Encoders
·2232 words·11 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 University of Oxford
Language models struggle with hierarchical information. This work introduces Hierarchy Transformer Encoders (HITs), a novel method to retrain transformer encoders using hyperbolic geometry and special…
Label Delay in Online Continual Learning
·4705 words·23 mins·
loading
·
loading
AI Generated
Machine Learning
Continual Learning
🏢 University of Oxford
Bridging the accuracy gap in online continual learning caused by label delays, a new framework with Importance Weighted Memory Sampling prioritizes relevant memory samples, significantly outperforming…
Interventionally Consistent Surrogates for Complex Simulation Models
·1862 words·9 mins·
loading
·
loading
AI Generated
AI Theory
Causality
🏢 University of Oxford
This paper introduces a novel framework for creating interventionally consistent surrogate models for complex simulations, addressing computational limitations and ensuring accurate policy evaluation.
Interpreting Learned Feedback Patterns in Large Language Models
·2900 words·14 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 University of Oxford
Researchers developed methods to measure and interpret the divergence between learned feedback patterns (LFPs) in LLMs and human preferences, helping minimize discrepancies between LLM behavior and tr…
Improved learning rates in multi-unit uniform price auctions
·442 words·3 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 University of Oxford
New modeling of bid space in multi-unit uniform price auctions achieves regret of Õ(K4/3T2/3) under bandit feedback, improving over prior work and closing the gap with discriminatory pricing.
G2D: From Global to Dense Radiography Representation Learning via Vision-Language Pre-training
·2099 words·10 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 University of Oxford
G2D: a novel medical VLP framework achieves superior performance in medical image analysis by simultaneously learning global and dense visual features using image-text pairs without extra annotations.
FilterNet: Harnessing Frequency Filters for Time Series Forecasting
·2439 words·12 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 University of Oxford
FilterNet: A novel deep learning architecture using learnable frequency filters for superior time series forecasting accuracy and efficiency.
Ensemble sampling for linear bandits: small ensembles suffice
·310 words·2 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 University of Oxford
Small ensembles in stochastic linear bandits achieve near-optimal regret; a rigorous analysis shows that ensemble size need only scale logarithmically with horizon.
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer
·2139 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Oxford
Direct3D: Revolutionizing image-to-3D generation with a scalable, native 3D diffusion model achieving state-of-the-art quality.
Deep Bayesian Active Learning for Preference Modeling in Large Language Models
·2339 words·11 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 University of Oxford
BAL-PM, a novel active learning approach, drastically reduces human feedback in LLM preference modeling by leveraging both model uncertainty and prompt distribution diversity, achieving 33%-68% fewer …
CountGD: Multi-Modal Open-World Counting
·2520 words·12 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 University of Oxford
COUNTGD: A new multi-modal model counts objects in images using text or visual examples, significantly improving open-world counting accuracy.
Can Learned Optimization Make Reinforcement Learning Less Difficult?
·3614 words·17 mins·
loading
·
loading
Reinforcement Learning
🏢 University of Oxford
Learned optimizer OPEN tackles RL’s non-stationarity, plasticity loss, and exploration using meta-learning, significantly outperforming traditional and other learned optimizers.
Can Large Language Model Agents Simulate Human Trust Behavior?
·3567 words·17 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 University of Oxford
LLM agents surprisingly exhibit human-like trust behavior, especially GPT-4, paving the way for simulating complex human interactions in various applications.