3D Vision

GaussianMarker: Uncertainty-Aware Copyright Protection of 3D Gaussian Splatting

26 September 2024·2093 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 NVIDIA Research

GaussianMarker: A novel uncertainty-aware watermarking method ensures robust copyright protection for 3D Gaussian Splatting assets, invisibly embedding messages into model parameters and extractable …

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

26 September 2024·2946 words·14 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

GaussianCube revolutionizes 3D generative modeling with a structured, explicit radiance representation, achieving state-of-the-art results using significantly fewer parameters.

Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images

26 September 2024·2277 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

Gaussian Graph Network (GGN) revolutionizes novel view synthesis by efficiently generating generalizable Gaussian representations from multi-view images, achieving superior rendering quality with fewe…

Fully Explicit Dynamic Gaussian Splatting

26 September 2024·3268 words·16 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 School of Electrical Engineering and Computer Science

Ex4DGS achieves real-time high-quality dynamic scene rendering using explicit 4D Gaussian representations and keyframe interpolation.

From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $lpha$-NeuS

26 September 2024·1946 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Key Laboratory of System Software (CAS) and State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences

α-NeuS: A novel method for neural implicit surface reconstruction that accurately reconstructs both transparent and opaque objects simultaneously by leveraging the unique properties of distance fields…

From Chaos to Clarity: 3DGS in the Dark

26 September 2024·2516 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Nanyang Technology University

Researchers developed a self-supervised learning framework to create high-dynamic-range 3D Gaussian Splatting (3DGS) models from noisy raw images, significantly improving reconstruction quality and sp…

From an Image to a Scene: Learning to Imagine the World from a Million 360° Videos

26 September 2024·2541 words·12 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 University of Washington

ODIN, trained on a million 360° videos (360-1M), generates realistic novel views and reconstructs 3D scenes from single images.

FreeSplat: Generalizable 3D Gaussian Splatting Towards Free View Synthesis of Indoor Scenes

26 September 2024·2183 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 National University of Singapore

FreeSplat achieves state-of-the-art novel view synthesis by accurately localizing 3D Gaussians from long image sequences, overcoming limitations of prior methods confined to narrow-range interpolation…

Flatten Anything: Unsupervised Neural Surface Parameterization

26 September 2024·2390 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Department of Computer Science, City University of Hong Kong

Flatten Anything Model (FAM) revolutionizes neural surface parameterization with unsupervised learning, handling complex topologies and unstructured data fully automatically.

Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models

26 September 2024·4174 words·20 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 City University of Hong Kong

OLIVINE uses visual foundation models for fine-grained image-to-LiDAR contrastive distillation, mitigating self-conflict issues and improving 3D representation learning.

FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors

26 September 2024·2339 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 School of Computer Science and Engineering, Sun Yat-Sen University

FFAM uses feature factorization and gradient weighting to produce high-quality visual explanations for 3D object detectors, improving model interpretability and trust.

FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training

26 September 2024·2255 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Amsterdam

FewViewGS: A novel method for high-quality novel view synthesis from sparse images using a multi-stage training scheme and a new locality-preserving regularization for 3D Gaussians.

Fast Encoder-Based 3D from Casual Videos via Point Track Processing

26 September 2024·2766 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 NVIDIA Research

TRACKSTO4D: Fast & accurate 3D reconstruction from casual videos using 2D point tracks, drastically reducing runtime by up to 95% while matching state-of-the-art accuracy.

Expressive Gaussian Human Avatars from Monocular RGB Video

26 September 2024·1431 words·7 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Texas at Austin

EVA: a novel method generates expressive 3D Gaussian human avatars from monocular RGB videos, excelling in detailed hand and facial expressions via context-aware density control and improved SMPL-X al…

Event-3DGS: Event-based 3D Reconstruction Using 3D Gaussian Splatting

26 September 2024·2242 words·11 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Tsinghua University

Event-3DGS: First event-based 3D reconstruction using 3D Gaussian splatting, enabling high-quality, efficient, and robust 3D scene reconstruction in challenging real-world conditions.

Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data

26 September 2024·3089 words·15 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Purdue University

DSPoser: A novel two-stage approach accurately estimates full-body pose from doubly sparse egocentric video data using masked autoencoders for temporal completion and conditional diffusion models for …

Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention

26 September 2024·2478 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Hong Kong University of Science and Technology

Era3D: High-resolution multiview diffusion using efficient row-wise attention, generates high-quality multiview images from single views, overcoming prior limitations.

Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis

26 September 2024·2012 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Zhejiang University

eFreeSplat: a novel, epipolar-free 3D Gaussian splatting model for generalizable novel view synthesis, surpassing state-of-the-art methods by achieving superior geometry reconstruction and novel view …

EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views

26 September 2024·2364 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Science and Technology of China

EgoChoir: a novel framework harmonizes visual appearance, head motion, and 3D objects to accurately estimate 3D human contact and object affordance from egocentric videos, surpassing existing methods.

EfficientCAPER: An End-to-End Framework for Fast and Robust Category-Level Articulated Object Pose Estimation

26 September 2024·2239 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Zhejiang University of Technology

EfficientCAPER: A novel end-to-end framework achieves fast & robust category-level articulated object pose estimation by using a joint-centric approach, eliminating post-processing optimization and en…