Skip to main content

3D Vision

Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection
·2705 words·13 mins· loading · loading
Computer Vision 3D Vision 🏒 Hong Kong University of Science and Technology
Object-centric occupancy completion boosts 3D object detection accuracy by using temporal information from long sequences to precisely reconstruct object shapes, particularly for incomplete or distant…
Toward Dynamic Non-Line-of-Sight Imaging with Mamba Enforced Temporal Consistency
·2152 words·11 mins· loading · loading
Computer Vision 3D Vision 🏒 University of Science and Technology of China
Dynamic NLOS imaging gets a speed boost! New ST-Mamba method leverages temporal consistency across frames for high-resolution video reconstruction, overcoming speed limitations of traditional methods.
Toward Approaches to Scalability in 3D Human Pose Estimation
·2344 words·12 mins· loading · loading
Computer Vision 3D Vision 🏒 Korea University
Boosting 3D human pose estimation: Biomechanical Pose Generator and Binary Depth Coordinates enhance accuracy and scalability.
TFS-NeRF: Template-Free NeRF for Semantic 3D Reconstruction of Dynamic Scene
·2695 words·13 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 Faculty of IT, Monash University
TFS-NeRF: A template-free neural radiance field efficiently reconstructs semantically separable 3D geometries of dynamic scenes featuring multiple interacting entities from sparse RGB videos.
Tetrahedron Splatting for 3D Generation
·2346 words·12 mins· loading · loading
3D Vision 🏒 Fudan University
TeT-Splatting: a novel 3D representation enabling fast convergence, real-time rendering, and precise mesh extraction for high-fidelity 3D generation.
Tensor-Based Synchronization and the Low-Rankness of the Block Trifocal Tensor
·1554 words·8 mins· loading · loading
Computer Vision 3D Vision 🏒 University of Minnesota
Low-rank block trifocal tensor unlocks accurate, efficient camera pose synchronization.
Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis
·2005 words·10 mins· loading · loading
Computer Vision 3D Vision 🏒 Peking University
This research introduces a template-free articulated Gaussian splatting method for real-time dynamic view synthesis, automatically discovering object skeletons from videos to enable reposing.
Target-Guided Adversarial Point Cloud Transformer Towards Recognition Against Real-world Corruptions
·3740 words·18 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 Beijing Institute of Technology
APCT: a novel architecture enhances 3D point cloud recognition by using an adversarial feature erasing mechanism to improve global structure capture and robustness against real-world corruptions.
Subsurface Scattering for Gaussian Splatting
·2275 words·11 mins· loading · loading
Computer Vision 3D Vision 🏒 University of Tübingen
Real-time rendering of objects with subsurface scattering effects is now possible with SSS-GS, a novel method combining explicit surface geometry and implicit subsurface scattering for high-quality no…
STONE: A Submodular Optimization Framework for Active 3D Object Detection
·2151 words·11 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 University of Texas at Dallas
STONE: A novel submodular optimization framework drastically cuts 3D object detection training costs by cleverly selecting the most informative LiDAR point cloud data for labeling, achieving state-of-…
SplitNeRF: Split Sum Approximation Neural Field for Joint Geometry, Illumination, and Material Estimation
·5201 words·25 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 King Abdullah University of Science and Technology
SplitNeRF: One-hour training on a single GPU yields state-of-the-art scene geometry, lighting, and material property estimation!
Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation
·2477 words·12 mins· loading · loading
Computer Vision 3D Vision 🏒 Shanghai Jiao Tong University
SFCNet, a novel spherical frustum sparse convolution network, tackles LiDAR point cloud semantic segmentation by eliminating quantized information loss, leading to superior performance, especially for…
SpelsNet: Surface Primitive Elements Segmentation by B-Rep Graph Structure Supervision
·1917 words·9 mins· loading · loading
Computer Vision 3D Vision 🏒 University of Luxembourg
SpelsNet, a novel neural architecture, achieves accurate 3D point cloud segmentation into surface primitives by incorporating B-Rep graph structure supervision, leading to topologically consistent res…
Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting
·3727 words·18 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 Zhejiang University
Spec-Gaussian enhances 3D Gaussian splatting by using anisotropic spherical Gaussians for view-dependent appearance modeling, achieving superior real-time rendering of scenes with specular and anisotr…
Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis
·2245 words·11 mins· loading · loading
Computer Vision 3D Vision 🏒 Carnegie Mellon University
SparseAGS: High-fidelity 3D reconstruction & camera pose estimation from sparse views via generative synthesis.
SfPUEL: Shape from Polarization under Unknown Environment Light
·2725 words·13 mins· loading · loading
Computer Vision 3D Vision 🏒 Peking University
SfPUEL: A novel end-to-end SfP method achieves robust single-shot surface normal estimation under diverse lighting, integrating PS priors and material segmentation.
Semi-Open 3D Object Retrieval via Hierarchical Equilibrium on Hypergraph
·2346 words·12 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 Tsinghua University
HERT: a novel framework for semi-open 3D object retrieval using hierarchical hypergraph equilibrium, achieving state-of-the-art performance on four new benchmark datasets.
Self-Distilled Depth Refinement with Noisy Poisson Fusion
·2691 words·13 mins· loading · loading
Computer Vision 3D Vision 🏒 Huazhong University of Science and Technology
Self-Distilled Depth Refinement (SDDR) tackles noisy depth maps via a novel noisy Poisson fusion approach, achieving significant improvements in depth accuracy and edge quality.
SE(3)-bi-equivariant Transformers for Point Cloud Assembly
·3085 words·15 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏒 University of Gothenburg
SE(3)-bi-equivariant Transformers (BITR) revolutionizes point cloud assembly by guaranteeing robust alignment even with non-overlapping clouds, thanks to its unique equivariance properties.
SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
·3116 words·15 mins· loading · loading
Computer Vision 3D Vision 🏒 University of Toronto
SCube: Instant large-scale 3D scene reconstruction from sparse images using VoxSplats, a novel 3D Gaussian splat representation.