Posters
2024
On Causal Discovery in the Presence of Deterministic Relations
·2932 words·14 mins·
loading
·
loading
AI Generated
AI Theory
Causality
🏢 Mohamed Bin Zayed University of Artificial Intelligence
DGES, a novel framework, efficiently detects & handles deterministic relations in causal discovery, enhancing accuracy and scalability for real-world applications.
On Affine Homotopy between Language Encoders
·2070 words·10 mins·
loading
·
loading
AI Generated
Natural Language Processing
Representation Learning
🏢 ETH Zurich
This paper introduces a novel notion of intrinsic similarity between language encoders, based on affine homotopy, and demonstrates its strong correlation with extrinsic similarity (downstream task per…
On $f$-Divergence Principled Domain Adaptation: An Improved Framework
·1963 words·10 mins·
loading
·
loading
Machine Learning
Transfer Learning
🏢 Tongji University
Improved unsupervised domain adaptation framework achieves superior performance via refined f-divergence and novel f-domain discrepancy, enabling faster algorithms and tighter generalization bounds.
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
·2191 words·11 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
🏢 Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University
OmniTokenizer: A transformer-based tokenizer achieving state-of-the-art image and video reconstruction by leveraging a novel spatial-temporal decoupled architecture and progressive training strategy.
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
·3479 words·17 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
🏢 Peking University
OmniJARVIS: Unified vision-language-action tokenization enables open-world instruction-following agents via unified multimodal interaction data.
Omnigrasp: Simulated Humanoid Grasping on Diverse Objects
·2619 words·13 mins·
loading
·
loading
AI Generated
AI Applications
Robotics
🏢 Carnegie Mellon University
Omnigrasp: A novel RL-based method enables simulated humanoids to grasp diverse objects and precisely follow complex trajectories, advancing realistic human-object interaction in virtual environments.
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
·3418 words·17 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Skywork AI
OMG-LLaVA: A single model elegantly bridges image, object, and pixel-level reasoning for superior visual understanding.
Oja's Algorithm for Streaming Sparse PCA
·382 words·2 mins·
loading
·
loading
Machine Learning
Unsupervised Learning
🏢 University of Texas at Austin
Oja’s algorithm achieves minimax optimal error rates for streaming sparse PCA using a simple single-pass thresholding method, requiring only O(d) space and O(nd) time.
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
·2487 words·12 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 Tsinghua University
Offline RL agents often fail in real-world scenarios due to unseen test states. SCAS, a novel method, simultaneously corrects OOD states to high-value, in-distribution states and suppresses risky OOD …
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
·592 words·3 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 MIT
LOLIPOP: A novel algorithm achieving near-optimal regret for offline contextual Markov Decision Processes (CMDPs) using only O(H log T) offline density estimation oracle calls.
Offline Behavior Distillation
·1729 words·9 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 School of Computer Science, University of Sydney
This paper introduces Offline Behavior Distillation (OBD) to synthesize compact expert behavioral data from massive sub-optimal RL data, enabling faster policy learning.
Off-Policy Selection for Initiating Human-Centric Experimental Design
·1760 words·9 mins·
loading
·
loading
AI Applications
Education
🏢 Stanford University
First-Glance Off-Policy Selection (FPS) revolutionizes human-centric AI by enabling personalized policy selection for new participants without prior data, improving learning and healthcare outcomes.
Off-policy estimation with adaptively collected data: the power of online learning
·240 words·2 mins·
loading
·
loading
AI Generated
AI Theory
Causality
🏢 University of Chicago
This paper develops novel finite-sample bounds for off-policy linear treatment effect estimation with adaptively collected data, proposing online learning algorithms to improve estimation accuracy and…
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
·6706 words·32 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 Johns Hopkins University
DARAIL, a novel algorithm, tackles off-dynamics reinforcement learning by combining reward modification with imitation learning to transfer a learned policy from a source to a target domain. This app…
ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings
·1959 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Dept. of ECE & ASRI
ODGS: Lightning-fast 3D scene reconstruction from single omnidirectional images using 3D Gaussian splatting, achieving 100x speedup over NeRF-based methods.
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
·4424 words·21 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Tsinghua University
ODGEN: Boosting object detection accuracy by generating high-quality synthetic images using diffusion models conditioned on bounding boxes and text descriptions.
OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries
·2593 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 ShanghaiTech University
OctreeOcc uses octree queries for efficient and multi-granularity 3D occupancy prediction, surpassing state-of-the-art methods with reduced computational costs.
Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding
·1696 words·8 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 Baidu
Octopus, a novel multi-modal LLM, uses parallel visual recognition and sequential understanding to achieve 5x speedup on visual grounding and improved accuracy on various MLLM tasks.
Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality
·1532 words·8 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
🏢 University of Illinois Urbana-Champaign
Model-free policy gradient methods using occupancy functions are developed for online and offline RL, achieving computational efficiency and handling arbitrary data distributions.
OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
·2014 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Stanford University
OccFusion: High-fidelity human rendering from videos, even with occlusions, using 3D Gaussian splatting and 2D diffusion priors.