Skip to main content

Posters

2024

On Causal Discovery in the Presence of Deterministic Relations
·2932 words·14 mins· loading · loading
AI Generated AI Theory Causality 🏢 Mohamed Bin Zayed University of Artificial Intelligence
DGES, a novel framework, efficiently detects & handles deterministic relations in causal discovery, enhancing accuracy and scalability for real-world applications.
On Affine Homotopy between Language Encoders
·2070 words·10 mins· loading · loading
AI Generated Natural Language Processing Representation Learning 🏢 ETH Zurich
This paper introduces a novel notion of intrinsic similarity between language encoders, based on affine homotopy, and demonstrates its strong correlation with extrinsic similarity (downstream task per…
On $f$-Divergence Principled Domain Adaptation: An Improved Framework
·1963 words·10 mins· loading · loading
Machine Learning Transfer Learning 🏢 Tongji University
Improved unsupervised domain adaptation framework achieves superior performance via refined f-divergence and novel f-domain discrepancy, enabling faster algorithms and tighter generalization bounds.
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
·2191 words·11 mins· loading · loading
AI Generated Multimodal Learning Vision-Language Models 🏢 Shanghai Key Lab of Intell. Info. Processing, School of CS, Fudan University
OmniTokenizer: A transformer-based tokenizer achieving state-of-the-art image and video reconstruction by leveraging a novel spatial-temporal decoupled architecture and progressive training strategy.
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
·3479 words·17 mins· loading · loading
AI Generated Multimodal Learning Vision-Language Models 🏢 Peking University
OmniJARVIS: Unified vision-language-action tokenization enables open-world instruction-following agents via unified multimodal interaction data.
Omnigrasp: Simulated Humanoid Grasping on Diverse Objects
·2619 words·13 mins· loading · loading
AI Generated AI Applications Robotics 🏢 Carnegie Mellon University
Omnigrasp: A novel RL-based method enables simulated humanoids to grasp diverse objects and precisely follow complex trajectories, advancing realistic human-object interaction in virtual environments.
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding
·3418 words·17 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Skywork AI
OMG-LLaVA: A single model elegantly bridges image, object, and pixel-level reasoning for superior visual understanding.
Oja's Algorithm for Streaming Sparse PCA
·382 words·2 mins· loading · loading
Machine Learning Unsupervised Learning 🏢 University of Texas at Austin
Oja’s algorithm achieves minimax optimal error rates for streaming sparse PCA using a simple single-pass thresholding method, requiring only O(d) space and O(nd) time.
Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
·2487 words·12 mins· loading · loading
Machine Learning Reinforcement Learning 🏢 Tsinghua University
Offline RL agents often fail in real-world scenarios due to unseen test states. SCAS, a novel method, simultaneously corrects OOD states to high-value, in-distribution states and suppresses risky OOD …
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
·592 words·3 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 MIT
LOLIPOP: A novel algorithm achieving near-optimal regret for offline contextual Markov Decision Processes (CMDPs) using only O(H log T) offline density estimation oracle calls.
Offline Behavior Distillation
·1729 words·9 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 School of Computer Science, University of Sydney
This paper introduces Offline Behavior Distillation (OBD) to synthesize compact expert behavioral data from massive sub-optimal RL data, enabling faster policy learning.
Off-Policy Selection for Initiating Human-Centric Experimental Design
·1760 words·9 mins· loading · loading
AI Applications Education 🏢 Stanford University
First-Glance Off-Policy Selection (FPS) revolutionizes human-centric AI by enabling personalized policy selection for new participants without prior data, improving learning and healthcare outcomes.
Off-policy estimation with adaptively collected data: the power of online learning
·240 words·2 mins· loading · loading
AI Generated AI Theory Causality 🏢 University of Chicago
This paper develops novel finite-sample bounds for off-policy linear treatment effect estimation with adaptively collected data, proposing online learning algorithms to improve estimation accuracy and…
Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation
·6706 words·32 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 Johns Hopkins University
DARAIL, a novel algorithm, tackles off-dynamics reinforcement learning by combining reward modification with imitation learning to transfer a learned policy from a source to a target domain. This app…
ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings
·1959 words·10 mins· loading · loading
Computer Vision 3D Vision 🏢 Dept. of ECE & ASRI
ODGS: Lightning-fast 3D scene reconstruction from single omnidirectional images using 3D Gaussian splatting, achieving 100x speedup over NeRF-based methods.
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
·4424 words·21 mins· loading · loading
Computer Vision Object Detection 🏢 Tsinghua University
ODGEN: Boosting object detection accuracy by generating high-quality synthetic images using diffusion models conditioned on bounding boxes and text descriptions.
OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries
·2593 words·13 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 ShanghaiTech University
OctreeOcc uses octree queries for efficient and multi-granularity 3D occupancy prediction, surpassing state-of-the-art methods with reduced computational costs.
Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding
·1696 words·8 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 Baidu
Octopus, a novel multi-modal LLM, uses parallel visual recognition and sequential understanding to achieve 5x speedup on visual grounding and improved accuracy on various MLLM tasks.
Occupancy-based Policy Gradient: Estimation, Convergence, and Optimality
·1532 words·8 mins· loading · loading
AI Generated Machine Learning Reinforcement Learning 🏢 University of Illinois Urbana-Champaign
Model-free policy gradient methods using occupancy functions are developed for online and offline RL, achieving computational efficiency and handling arbitrary data distributions.
OccFusion: Rendering Occluded Humans with Generative Diffusion Priors
·2014 words·10 mins· loading · loading
Computer Vision 3D Vision 🏢 Stanford University
OccFusion: High-fidelity human rendering from videos, even with occlusions, using 3D Gaussian splatting and 2D diffusion priors.
Buy Me A Coffee