3D Vision
ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling
·1848 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Imperial College London
ID-to-3D: Generate expressive, identity-consistent 3D human heads from just a few in-the-wild images using score distillation sampling and 2D diffusion models.
HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors
·2066 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
π’ ByteDance
HumanSplat: single image-based 3D human reconstruction using Gaussian Splatting with structural priors, achieving state-of-the-art quality and speed.
Human-3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models
·4637 words·22 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ University of TΓΌbingen
Human-3Diffusion generates realistic 3D avatars from single RGB images using coupled 2D multi-view and 3D consistent diffusion models, achieving high-fidelity geometry and texture.
How to Use Diffusion Priors under Sparse Views?
·2930 words·14 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Beihang University
Inline Prior Guided Score Matching (IPSM) improves sparse-view 3D reconstruction by leveraging visual inline priors from pose relationships to rectify rendered image distribution and effectively guide…
HOPE: Shape Matching Via Aligning Different K-hop Neighbourhoods
·1940 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Hong Kong University of Science and Technology
HOPE: a novel shape matching method achieving both accuracy and smoothness by aligning different k-hop neighborhoods and refining maps via local map distortion.
HiCoM: Hierarchical Coherent Motion for Dynamic Streamable Scenes with 3D Gaussian Splatting
·2356 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Peking University
HiCoM, a novel framework, achieves high-fidelity streamable dynamic scene reconstruction by using a hierarchical coherent motion mechanism and parallel processing to significantly reduce training time…
HDR-GS: Efficient High Dynamic Range Novel View Synthesis at 1000x Speed via Gaussian Splatting
·1800 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Johns Hopkins University
HDR-GS: 1000x faster HDR novel view synthesis via Gaussian splatting!
Harmonizing Stochasticity and Determinism: Scene-responsive Diverse Human Motion Prediction
·2828 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ Zhejiang University
DiMoP3D: Predicting diverse, physically realistic human motions in 3D scenes by harmonizing stochasticity and determinism.
Hamba: Single-view 3D Hand Reconstruction with Graph-guided Bi-Scanning Mamba
·3671 words·18 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ Carnegie Mellon University
Hamba: a novel graph-guided framework for single-view 3D hand reconstruction, significantly outperforms existing methods by efficiently modeling spatial relationships between joints using a fraction o…
Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation
·2871 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ Chinese Academy of Sciences
Hallo3D: a tuning-free method resolving 3D generation hallucinations via multi-modal inconsistency detection and mitigation for consistent 3D content.
GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes
·2497 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Hong Kong University of Science and Technology
GVKF: A novel method achieves highly efficient and accurate 3D surface reconstruction in open scenes by integrating fast 3D Gaussian splatting with continuous scene representation using kernel regres…
GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian Splats
·2282 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
π’ SungKyunKwan University
GSGAN introduces a hierarchical 3D Gaussian representation for faster, high-quality 3D model generation in GANs, achieving 100x speed improvement over existing methods.
GSDF: 3DGS Meets SDF for Improved Neural Rendering and Reconstruction
·2215 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Shanghai Artificial Intelligence Laboratory
GSDF: A novel dual-branch neural scene representation elegantly resolves the rendering-reconstruction trade-off by synergistically combining 3D Gaussian Splatting and Signed Distance Fields via mutual…
GS-Hider: Hiding Messages into 3D Gaussian Splatting
·2889 words·14 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Peking University
GS-Hider: A novel framework secures 3D Gaussian Splatting by embedding messages in a coupled, secured feature attribute, enabling invisible data hiding and accurate extraction.
Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting
·2420 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Nankai University
Grid4D: A novel 4D decomposed hash encoding boosts high-fidelity dynamic Gaussian splatting, surpassing state-of-the-art models in visual quality and rendering speed.
GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration
·2622 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ Carnegie Mellon University
GL-NeRF accelerates NeRF rendering by using Gauss-Laguerre quadrature, drastically reducing MLP calls without needing additional networks or data structures.
GIC: Gaussian-Informed Continuum for Physical Property Identification and Simulation
·2226 words·11 mins·
loading
·
loading
3D Vision
π’ Hong Kong University of Science and Technology
GIC: Novel hybrid framework leverages 3D Gaussian representation for accurate physical property estimation from visual observations, achieving state-of-the-art performance.
GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields
·1769 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Tongji University
GeoNLF: Geometry-guided Pose-free Neural LiDAR Fields revolutionizes LiDAR point cloud processing by cleverly combining neural and geometric optimization for superior novel view synthesis and multi-vi…
Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images
·4369 words·21 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ Hong Kong Baptist University
Geometry Cloak embeds invisible perturbations in images to thwart AI-based 3D reconstruction, forcing the AI to generate identifiable patterns that act as watermarks to assert copyright.
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation
·2032 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Tsinghua University
GeoLRM: Generate stunning 3D models from just 21 images using a novel geometry-aware transformer, surpassing existing methods in efficiency and quality!