Posters
2024
DisCEdit: Model Editing by Identifying Discriminative Components
·2619 words·13 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Indian Institute of Science
DISCEDIT efficiently identifies and edits discriminative neural network components for structured pruning and class unlearning, achieving high sparsity and forgetting rates without needing training da…
DisC-GS: Discontinuity-aware Gaussian Splatting
·2095 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Lancaster University
DisC-GS enhances Gaussian Splatting for real-time novel view synthesis by accurately rendering image discontinuities and boundaries, improving visual quality.
Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
·2781 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Shanghai Artificial Intelligence Laboratory
Director3D generates realistic 3D scenes and camera trajectories from text descriptions using a three-stage pipeline: Cinematographer, Decorator, and Detailer.
Directional Smoothness and Gradient Methods: Convergence and Adaptivity
·1502 words·8 mins·
loading
·
loading
AI Generated
AI Theory
Optimization
🏢 Stanford University
New sub-optimality bounds for gradient descent leverage directional smoothness, a localized gradient variation measure, achieving tighter convergence guarantees and adapting to optimization paths.
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer
·2139 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Oxford
Direct3D: Revolutionizing image-to-3D generation with a scalable, native 3D diffusion model achieving state-of-the-art quality.
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
·4016 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
Image Generation
🏢 NAVER AI Lab
Direct Unlearning Optimization (DUO) robustly removes unsafe content from text-to-image models by using paired image data and output-preserving regularization, effectively defending against adversaria…
Direct Preference-Based Evolutionary Multi-Objective Optimization with Dueling Bandits
·5743 words·27 mins·
loading
·
loading
AI Generated
AI Theory
Optimization
🏢 School of Computer Science and Engineering, University of Electronic Science and Technology of China
D-PBEMO: A novel framework for preference-based multi-objective optimization using clustering-based stochastic dueling bandits to directly leverage human feedback, improving efficiency and managing co…
Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion models
·3011 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 KAIST
Boosting personalized image generation! Direct Consistency Optimization (DCO) fine-tunes text-to-image models, ensuring subject consistency and prompt fidelity, even when merging separately customized…
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection
·2876 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 University of Queensland
DiPEx: a novel self-supervised prompt expansion method dramatically boosts class-agnostic object detection by progressively learning non-overlapping hyperspherical prompts, surpassing existing methods…
DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization
·2302 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 Advanced Micro Devices, Inc.
DiP-GO: A novel pruning method accelerates diffusion models via few-step gradient optimization, achieving a 4.4x speedup on Stable Diffusion 1.5 without accuracy loss.
DINTR: Tracking via Diffusion-based Interpolation
·2223 words·11 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 University of Arkansas
DINTR: A novel diffusion-based object tracker surpasses existing methods by using efficient interpolation, achieving superior performance across diverse benchmarks.
DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation
·3655 words·18 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 VinAI Research
DiMSUM: A novel diffusion model boosts image generation by unifying spatial and frequency information, achieving superior results and faster training.
Dimension-free Private Mean Estimation for Anisotropic Distributions
·233 words·2 mins·
loading
·
loading
AI Generated
AI Theory
Privacy
🏢 UC Berkeley
Dimension-free private mean estimation is achieved for anisotropic data, breaking the curse of dimensionality in privacy-preserving high-dimensional analysis.
DiGRAF: Diffeomorphic Graph-Adaptive Activation Function
·2555 words·12 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Purdue University
DIGRAF, a novel graph-adaptive activation function, significantly boosts Graph Neural Network performance by dynamically adapting to graph structure, offering consistent superior results across divers…
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning
·3604 words·17 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 UC Berkeley
DigiRL: Autonomous RL trains robust in-the-wild device-control agents by offline-to-online RL, surpassing prior methods.
DiffusionPDE: Generative PDE-Solving under Partial Observation
·3911 words·19 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 University of Michigan
DiffusionPDE uses generative diffusion models to solve PDEs accurately, even with highly incomplete observations, outperforming state-of-the-art methods.
DiffusionFake: Enhancing Generalization in Deepfake Detection via Guided Stable Diffusion
·2705 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Face Recognition
🏢 Tencent AI Lab
DiffusionFake enhances deepfake detection by cleverly reversing the image generation process, enabling detectors to learn more robust features and significantly improve cross-domain generalization.
DiffusionBlend: Learning 3D Image Prior through Position-aware Diffusion Score Blending for 3D Computed Tomography Reconstruction
·2570 words·13 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Michigan
DiffusionBlend++ learns a 3D image prior via position-aware diffusion score blending, achieving state-of-the-art 3D CT reconstruction with superior efficiency.
Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion Models
·1559 words·8 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 University of Toronto
Diffusion4D: Fast, consistent 4D content generation via a novel 4D-aware video diffusion model, surpassing existing methods in efficiency and 4D geometry consistency.
Diffusion-Reward Adversarial Imitation Learning
·2028 words·10 mins·
loading
·
loading
Machine Learning
Reinforcement Learning
🏢 NVIDIA
Diffusion-Reward Adversarial Imitation Learning (DRAIL) enhances Generative Adversarial Imitation Learning by integrating diffusion models, resulting in more stable and smoother reward functions for s…