Posters

On Causal Discovery in the Presence of Deterministic Relations

26 September 2024·2932 words·14 mins· loading · loading

AI Generated AI Theory Causality 🏢 Mohamed Bin Zayed University of Artificial Intelligence

DGES, a novel framework, efficiently detects & handles deterministic relations in causal discovery, enhancing accuracy and scalability for real-world applications.

On Affine Homotopy between Language Encoders

26 September 2024·2070 words·10 mins· loading · loading

AI Generated Natural Language Processing Representation Learning 🏢 ETH Zurich

This paper introduces a novel notion of intrinsic similarity between language encoders, based on affine homotopy, and demonstrates its strong correlation with extrinsic similarity (downstream task per…

On $f$-Divergence Principled Domain Adaptation: An Improved Framework

26 September 2024·1963 words·10 mins· loading · loading

Machine Learning Transfer Learning 🏢 Tongji University

Improved unsupervised domain adaptation framework achieves superior performance via refined f-divergence and novel f-domain discrepancy, enabling faster algorithms and tighter generalization bounds.

OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation

26 September 2024·2191 words·11 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University

OmniTokenizer: A transformer-based tokenizer achieving state-of-the-art image and video reconstruction by leveraging a novel spatial-temporal decoupled architecture and progressive training strategy.

OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents

26 September 2024·3479 words·17 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 Peking University

OmniJARVIS: Unified vision-language-action tokenization enables open-world instruction-following agents via unified multimodal interaction data.

Omnigrasp: Simulated Humanoid Grasping on Diverse Objects

26 September 2024·2619 words·13 mins· loading · loading

AI Generated AI Applications Robotics 🏢 Carnegie Mellon University

Omnigrasp: A novel RL-based method enables simulated humanoids to grasp diverse objects and precisely follow complex trajectories, advancing realistic human-object interaction in virtual environments.

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

26 September 2024·3418 words·17 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Skywork AI

OMG-LLaVA: A single model elegantly bridges image, object, and pixel-level reasoning for superior visual understanding.

Oja's Algorithm for Streaming Sparse PCA

26 September 2024·382 words·2 mins· loading · loading

Machine Learning Unsupervised Learning 🏢 University of Texas at Austin

Oja’s algorithm achieves minimax optimal error rates for streaming sparse PCA using a simple single-pass thresholding method, requiring only O(d) space and O(nd) time.

Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression

26 September 2024·2487 words·12 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 Tsinghua University

Offline RL agents often fail in real-world scenarios due to unseen test states. SCAS, a novel method, simultaneously corrects OOD states to high-value, in-distribution states and suppresses risky OOD …

Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff

26 September 2024·592 words·3 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 MIT

LOLIPOP: A novel algorithm achieving near-optimal regret for offline contextual Markov Decision Processes (CMDPs) using only O(H log T) offline density estimation oracle calls.

Offline Behavior Distillation

26 September 2024·1729 words·9 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 School of Computer Science, University of Sydney

This paper introduces Offline Behavior Distillation (OBD) to synthesize compact expert behavioral data from massive sub-optimal RL data, enabling faster policy learning.

Off-Policy Selection for Initiating Human-Centric Experimental Design

26 September 2024·1760 words·9 mins· loading · loading

AI Applications Education 🏢 Stanford University

First-Glance Off-Policy Selection (FPS) revolutionizes human-centric AI by enabling personalized policy selection for new participants without prior data, improving learning and healthcare outcomes.

Off-policy estimation with adaptively collected data: the power of online learning

26 September 2024·240 words·2 mins· loading · loading

AI Generated AI Theory Causality 🏢 University of Chicago

This paper develops novel finite-sample bounds for off-policy linear treatment effect estimation with adaptively collected data, proposing online learning algorithms to improve estimation accuracy and…

Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation

26 September 2024·6706 words·32 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 Johns Hopkins University

DARAIL, a novel algorithm, tackles off-dynamics reinforcement learning by combining reward modification with imitation learning to transfer a learned policy from a source to a target domain. This app…

ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings

26 September 2024·1959 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Dept. of ECE & ASRI

ODGS: Lightning-fast 3D scene reconstruction from single omnidirectional images using 3D Gaussian splatting, achieving 100x speedup over NeRF-based methods.

ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models

26 September 2024·4424 words·21 mins· loading · loading

Computer Vision Object Detection 🏢 Tsinghua University

ODGEN: Boosting object detection accuracy by generating high-quality synthetic images using diffusion models conditioned on bounding boxes and text descriptions.

OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries

26 September 2024·2593 words·13 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 ShanghaiTech University

OctreeOcc uses octree queries for efficient and multi-granularity 3D occupancy prediction, surpassing state-of-the-art methods with reduced computational costs.

Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding

26 September 2024·1696 words·8 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 Baidu

Octopus, a novel multi-modal LLM, uses parallel visual recognition and sequential understanding to achieve 5x speedup on visual grounding and improved accuracy on various MLLM tasks.

Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality

26 September 2024·1532 words·8 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of Illinois Urbana-Champaign

Model-free policy gradient methods using occupancy functions are developed for online and offline RL, achieving computational efficiency and handling arbitrary data distributions.

OccFusion: Rendering Occluded Humans with Generative Diffusion Priors

26 September 2024·2014 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Stanford University

OccFusion: High-fidelity human rendering from videos, even with occlusions, using 3D Gaussian splatting and 2D diffusion priors.