3D Vision
GaussianMarker: Uncertainty-Aware Copyright Protection of 3D Gaussian Splatting
·2093 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 NVIDIA Research
GaussianMarker: A novel uncertainty-aware watermarking method ensures robust copyright protection for 3D Gaussian Splatting assets, invisibly embedding messages into model parameters and extractable …
GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling
·2946 words·14 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Tsinghua University
GaussianCube revolutionizes 3D generative modeling with a structured, explicit radiance representation, achieving state-of-the-art results using significantly fewer parameters.
Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images
·2277 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Tsinghua University
Gaussian Graph Network (GGN) revolutionizes novel view synthesis by efficiently generating generalizable Gaussian representations from multi-view images, achieving superior rendering quality with fewe…
Fully Explicit Dynamic Gaussian Splatting
·3268 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 School of Electrical Engineering and Computer Science
Ex4DGS achieves real-time high-quality dynamic scene rendering using explicit 4D Gaussian representations and keyframe interpolation.
From Transparent to Opaque: Rethinking Neural Implicit Surfaces with $lpha$-NeuS
·1946 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Key Laboratory of System Software (CAS) and State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences
α-NeuS: A novel method for neural implicit surface reconstruction that accurately reconstructs both transparent and opaque objects simultaneously by leveraging the unique properties of distance fields…
From Chaos to Clarity: 3DGS in the Dark
·2516 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Nanyang Technology University
Researchers developed a self-supervised learning framework to create high-dynamic-range 3D Gaussian Splatting (3DGS) models from noisy raw images, significantly improving reconstruction quality and sp…
From an Image to a Scene: Learning to Imagine the World from a Million 360° Videos
·2541 words·12 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 University of Washington
ODIN, trained on a million 360° videos (360-1M), generates realistic novel views and reconstructs 3D scenes from single images.
FreeSplat: Generalizable 3D Gaussian Splatting Towards Free View Synthesis of Indoor Scenes
·2183 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 National University of Singapore
FreeSplat achieves state-of-the-art novel view synthesis by accurately localizing 3D Gaussians from long image sequences, overcoming limitations of prior methods confined to narrow-range interpolation…
Flatten Anything: Unsupervised Neural Surface Parameterization
·2390 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Department of Computer Science, City University of Hong Kong
Flatten Anything Model (FAM) revolutionizes neural surface parameterization with unsupervised learning, handling complex topologies and unstructured data fully automatically.
Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models
·4174 words·20 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 City University of Hong Kong
OLIVINE uses visual foundation models for fine-grained image-to-LiDAR contrastive distillation, mitigating self-conflict issues and improving 3D representation learning.
FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors
·2339 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 School of Computer Science and Engineering, Sun Yat-Sen University
FFAM uses feature factorization and gradient weighting to produce high-quality visual explanations for 3D object detectors, improving model interpretability and trust.
FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training
·2255 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Amsterdam
FewViewGS: A novel method for high-quality novel view synthesis from sparse images using a multi-stage training scheme and a new locality-preserving regularization for 3D Gaussians.
Fast Encoder-Based 3D from Casual Videos via Point Track Processing
·2766 words·13 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 NVIDIA Research
TRACKSTO4D: Fast & accurate 3D reconstruction from casual videos using 2D point tracks, drastically reducing runtime by up to 95% while matching state-of-the-art accuracy.
Expressive Gaussian Human Avatars from Monocular RGB Video
·1431 words·7 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Texas at Austin
EVA: a novel method generates expressive 3D Gaussian human avatars from monocular RGB videos, excelling in detailed hand and facial expressions via context-aware density control and improved SMPL-X al…
Event-3DGS: Event-based 3D Reconstruction Using 3D Gaussian Splatting
·2242 words·11 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Tsinghua University
Event-3DGS: First event-based 3D reconstruction using 3D Gaussian splatting, enabling high-quality, efficient, and robust 3D scene reconstruction in challenging real-world conditions.
Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data
·3089 words·15 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Purdue University
DSPoser: A novel two-stage approach accurately estimates full-body pose from doubly sparse egocentric video data using masked autoencoders for temporal completion and conditional diffusion models for …
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention
·2478 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Hong Kong University of Science and Technology
Era3D: High-resolution multiview diffusion using efficient row-wise attention, generates high-quality multiview images from single views, overcoming prior limitations.
Epipolar-Free 3D Gaussian Splatting for Generalizable Novel View Synthesis
·2012 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Zhejiang University
eFreeSplat: a novel, epipolar-free 3D Gaussian splatting model for generalizable novel view synthesis, surpassing state-of-the-art methods by achieving superior geometry reconstruction and novel view …
EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views
·2364 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Science and Technology of China
EgoChoir: a novel framework harmonizes visual appearance, head motion, and 3D objects to accurately estimate 3D human contact and object affordance from egocentric videos, surpassing existing methods.
EfficientCAPER: An End-to-End Framework for Fast and Robust Category-Level Articulated Object Pose Estimation
·2239 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Zhejiang University of Technology
EfficientCAPER: A novel end-to-end framework achieves fast & robust category-level articulated object pose estimation by using a joint-centric approach, eliminating post-processing optimization and en…