Posters

DisCEdit: Model Editing by Identifying Discriminative Components

26 September 2024·2619 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 Indian Institute of Science

DISCEDIT efficiently identifies and edits discriminative neural network components for structured pruning and class unlearning, achieving high sparsity and forgetting rates without needing training da…

DisC-GS: Discontinuity-aware Gaussian Splatting

26 September 2024·2095 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Lancaster University

DisC-GS enhances Gaussian Splatting for real-time novel view synthesis by accurately rendering image discontinuities and boundaries, improving visual quality.

Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text

26 September 2024·2781 words·14 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Shanghai Artificial Intelligence Laboratory

Director3D generates realistic 3D scenes and camera trajectories from text descriptions using a three-stage pipeline: Cinematographer, Decorator, and Detailer.

Directional Smoothness and Gradient Methods: Convergence and Adaptivity

26 September 2024·1502 words·8 mins· loading · loading

AI Generated AI Theory Optimization 🏢 Stanford University

New sub-optimality bounds for gradient descent leverage directional smoothness, a localized gradient variation measure, achieving tighter convergence guarantees and adapting to optimization paths.

Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer

26 September 2024·2139 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Oxford

Direct3D: Revolutionizing image-to-3D generation with a scalable, native 3D diffusion model achieving state-of-the-art quality.

Direct Unlearning Optimization for Robust and Safe Text-to-Image Models

26 September 2024·4016 words·19 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 NAVER AI Lab

Direct Unlearning Optimization (DUO) robustly removes unsafe content from text-to-image models by using paired image data and output-preserving regularization, effectively defending against adversaria…

Direct Preference-Based Evolutionary Multi-Objective Optimization with Dueling Bandits

26 September 2024·5743 words·27 mins· loading · loading

AI Generated AI Theory Optimization 🏢 School of Computer Science and Engineering, University of Electronic Science and Technology of China

D-PBEMO: A novel framework for preference-based multi-objective optimization using clustering-based stochastic dueling bandits to directly leverage human feedback, improving efficiency and managing co…

Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion models

26 September 2024·3011 words·15 mins· loading · loading

Computer Vision Image Generation 🏢 KAIST

Boosting personalized image generation! Direct Consistency Optimization (DCO) fine-tunes text-to-image models, ensuring subject consistency and prompt fidelity, even when merging separately customized…

DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection

26 September 2024·2876 words·14 mins· loading · loading

AI Generated Computer Vision Object Detection 🏢 University of Queensland

DiPEx: a novel self-supervised prompt expansion method dramatically boosts class-agnostic object detection by progressively learning non-overlapping hyperspherical prompts, surpassing existing methods…

DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization

26 September 2024·2302 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 Advanced Micro Devices, Inc.

DiP-GO: A novel pruning method accelerates diffusion models via few-step gradient optimization, achieving a 4.4x speedup on Stable Diffusion 1.5 without accuracy loss.

DINTR: Tracking via Diffusion-based Interpolation

26 September 2024·2223 words·11 mins· loading · loading

Computer Vision Object Detection 🏢 University of Arkansas

DINTR: A novel diffusion-based object tracker surpasses existing methods by using efficient interpolation, achieving superior performance across diverse benchmarks.

DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation

26 September 2024·3655 words·18 mins· loading · loading

Computer Vision Image Generation 🏢 VinAI Research

DiMSUM: A novel diffusion model boosts image generation by unifying spatial and frequency information, achieving superior results and faster training.

Dimension-free Private Mean Estimation for Anisotropic Distributions

26 September 2024·233 words·2 mins· loading · loading

AI Generated AI Theory Privacy 🏢 UC Berkeley

Dimension-free private mean estimation is achieved for anisotropic data, breaking the curse of dimensionality in privacy-preserving high-dimensional analysis.

DiGRAF: Diffeomorphic Graph-Adaptive Activation Function

26 September 2024·2555 words·12 mins· loading · loading

Machine Learning Deep Learning 🏢 Purdue University

DIGRAF, a novel graph-adaptive activation function, significantly boosts Graph Neural Network performance by dynamically adapting to graph structure, offering consistent superior results across divers…

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

26 September 2024·3604 words·17 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 UC Berkeley

DigiRL: Autonomous RL trains robust in-the-wild device-control agents by offline-to-online RL, surpassing prior methods.

DiffusionPDE: Generative PDE-Solving under Partial Observation

26 September 2024·3911 words·19 mins· loading · loading

Machine Learning Deep Learning 🏢 University of Michigan

DiffusionPDE uses generative diffusion models to solve PDEs accurately, even with highly incomplete observations, outperforming state-of-the-art methods.

DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion

26 September 2024·2705 words·13 mins· loading · loading

AI Generated Computer Vision Face Recognition 🏢 Tencent AI Lab

DiffusionFake enhances deepfake detection by cleverly reversing the image generation process, enabling detectors to learn more robust features and significantly improve cross-domain generalization.

DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction

26 September 2024·2570 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Michigan

DiffusionBlend++ learns a 3D image prior via position-aware diffusion score blending, achieving state-of-the-art 3D CT reconstruction with superior efficiency.

Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion Models

26 September 2024·1559 words·8 mins· loading · loading

Computer Vision Image Generation 🏢 University of Toronto

Diffusion4D: Fast, consistent 4D content generation via a novel 4D-aware video diffusion model, surpassing existing methods in efficiency and 4D geometry consistency.

Diffusion-Reward Adversarial Imitation Learning

26 September 2024·2028 words·10 mins· loading · loading

Machine Learning Reinforcement Learning 🏢 NVIDIA

Diffusion-Reward Adversarial Imitation Learning (DRAIL) enhances Generative Adversarial Imitation Learning by integrating diffusion models, resulting in more stable and smoother reward functions for s…