3D Vision

ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling

26 September 2024·1848 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Imperial College London

ID-to-3D: Generate expressive, identity-consistent 3D human heads from just a few in-the-wild images using score distillation sampling and 2D diffusion models.

HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors

26 September 2024·2066 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 ByteDance

HumanSplat: single image-based 3D human reconstruction using Gaussian Splatting with structural priors, achieving state-of-the-art quality and speed.

Human-3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models

26 September 2024·4637 words·22 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 University of Tübingen

Human-3Diffusion generates realistic 3D avatars from single RGB images using coupled 2D multi-view and 3D consistent diffusion models, achieving high-fidelity geometry and texture.

How to Use Diffusion Priors under Sparse Views?

26 September 2024·2930 words·14 mins· loading · loading

Computer Vision 3D Vision 🏢 Beihang University

Inline Prior Guided Score Matching (IPSM) improves sparse-view 3D reconstruction by leveraging visual inline priors from pose relationships to rectify rendered image distribution and effectively guide…

HOPE: Shape Matching Via Aligning Different K-hop Neighbourhoods

26 September 2024·1940 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Hong Kong University of Science and Technology

HOPE: a novel shape matching method achieving both accuracy and smoothness by aligning different k-hop neighborhoods and refining maps via local map distortion.

HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting

26 September 2024·2356 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Peking University

HiCoM, a novel framework, achieves high-fidelity streamable dynamic scene reconstruction by using a hierarchical coherent motion mechanism and parallel processing to significantly reduce training time…

HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting

26 September 2024·1800 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Johns Hopkins University

HDR-GS: 1000x faster HDR novel view synthesis via Gaussian splatting!

Harmonizing Stochasticity and Determinism: Scene-responsive Diverse Human Motion Prediction

26 September 2024·2828 words·14 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Zhejiang University

DiMoP3D: Predicting diverse, physically realistic human motions in 3D scenes by harmonizing stochasticity and determinism.

Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba

26 September 2024·3671 words·18 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Carnegie Mellon University

Hamba: a novel graph-guided framework for single-view 3D hand reconstruction, significantly outperforms existing methods by efficiently modeling spatial relationships between joints using a fraction o…

Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation

26 September 2024·2871 words·14 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Chinese Academy of Sciences

Hallo3D: a tuning-free method resolving 3D generation hallucinations via multi-modal inconsistency detection and mitigation for consistent 3D content.

GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes

26 September 2024·2497 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Hong Kong University of Science and Technology

GVKF: A novel method achieves highly efficient and accurate 3D surface reconstruction in open scenes by integrating fast 3D Gaussian splatting with continuous scene representation using kernel regres…

GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian Splats

26 September 2024·2282 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 SungKyunKwan University

GSGAN introduces a hierarchical 3D Gaussian representation for faster, high-quality 3D model generation in GANs, achieving 100x speed improvement over existing methods.

GSDF: 3DGS Meets SDF for Improved Neural Rendering and Reconstruction

26 September 2024·2215 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Shanghai Artificial Intelligence Laboratory

GSDF: A novel dual-branch neural scene representation elegantly resolves the rendering-reconstruction trade-off by synergistically combining 3D Gaussian Splatting and Signed Distance Fields via mutual…

GS-Hider: Hiding Messages into 3D Gaussian Splatting

26 September 2024·2889 words·14 mins· loading · loading

Computer Vision 3D Vision 🏢 Peking University

GS-Hider: A novel framework secures 3D Gaussian Splatting by embedding messages in a coupled, secured feature attribute, enabling invisible data hiding and accurate extraction.

Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting

26 September 2024·2420 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Nankai University

Grid4D: A novel 4D decomposed hash encoding boosts high-fidelity dynamic Gaussian splatting, surpassing state-of-the-art models in visual quality and rendering speed.

GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration

26 September 2024·2622 words·13 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Carnegie Mellon University

GL-NeRF accelerates NeRF rendering by using Gauss-Laguerre quadrature, drastically reducing MLP calls without needing additional networks or data structures.

GIC: Gaussian-Informed Continuum for Physical Property Identification and Simulation

26 September 2024·2226 words·11 mins· loading · loading

3D Vision 🏢 Hong Kong University of Science and Technology

GIC: Novel hybrid framework leverages 3D Gaussian representation for accurate physical property estimation from visual observations, achieving state-of-the-art performance.

GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields

26 September 2024·1769 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Tongji University

GeoNLF: Geometry-guided Pose-free Neural LiDAR Fields revolutionizes LiDAR point cloud processing by cleverly combining neural and geometric optimization for superior novel view synthesis and multi-vi…

Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images

26 September 2024·4369 words·21 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Hong Kong Baptist University

Geometry Cloak embeds invisible perturbations in images to thwart AI-based 3D reconstruction, forcing the AI to generate identifiable patterns that act as watermarks to assert copyright.

GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation

26 September 2024·2032 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

GeoLRM: Generate stunning 3D models from just 21 images using a novel geometry-aware transformer, surpassing existing methods in efficiency and quality!