3D Vision

LoCo: Learning 3D Location-Consistent Image Features with a Memory-Efficient Ranking Loss

26 September 2024·1960 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Oxford

LoCo: Memory-efficient location-consistent image features learned via a novel ranking loss, enabling three orders of magnitude memory improvement and outperforming state-of-the-art.

LinNet: Linear Network for Efficient Point Cloud Representation Learning

26 September 2024·2362 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Northwest University

LinNet: A linear-time point cloud network achieving 10x speedup over PointNeXt, with state-of-the-art accuracy on various benchmarks.

Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis

26 September 2024·3953 words·19 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Nankai University

LE3D: Real-time HDR view synthesis from noisy RAW images is achieved using 3DGS, significantly reducing training time and improving rendering speed.

LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS

26 September 2024·2198 words·11 mins· loading · loading

3D Vision 🏢 University of Texas at Austin

LightGaussian achieves 15x compression of 3D Gaussian scene representations, boosting rendering speed to 200+ FPS while maintaining visual quality, solving storage and efficiency issues in real-time n…

Learning to be Smooth: An End-to-End Differentiable Particle Smoother

26 September 2024·2507 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 UC Irvine

Learned Mixture Density Particle Smoother (MDPS) surpasses state-of-the-art for accurate, differentiable city-scale vehicle localization.

Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars

26 September 2024·2288 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Shenzhen Campus of Sun Yat-Sen University

Create animatable interacting hand avatars from a single image using a novel two-stage interaction-aware 3D Gaussian splatting framework!

Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization

26 September 2024·1608 words·8 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Cooperative Medianet Innovation Center, Shanghai Jiao Tong University

DisPA: a novel disentangled representation learning framework for perceptual point cloud quality assessment achieves superior performance by minimizing mutual information between content and distortio…

Learning 3D Garment Animation from Trajectories of A Piece of Cloth

26 September 2024·2097 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Nanyang Technological University

Animates diverse garments realistically from a single cloth’s trajectory using a disentangled learning approach and Energy Unit Network (EUNet).

Learning 3D Equivariant Implicit Function with Patch-Level Pose-Invariant Representation

26 September 2024·2788 words·14 mins· loading · loading

Computer Vision 3D Vision 🏢 Xi'an Jiaotong University

3D surface reconstruction revolutionized: PEIF leverages patch-level pose-invariant representations and 3D patch-level equivariance for state-of-the-art accuracy, even with varied poses and datasets!

LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling

26 September 2024·2913 words·14 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Tsinghua University

LCM: a novel, locally constrained, compact point cloud model surpasses Transformer-based methods by significantly improving performance and efficiency in various downstream tasks.

Large Spatial Model: End-to-end Unposed Images to Semantic 3D

26 September 2024·1766 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 NVIDIA Research

Large Spatial Model (LSM) achieves real-time semantic 3D reconstruction from just two unposed images, unifying multiple 3D vision tasks in a single feed-forward pass.

LAM3D: Large Image-Point Clouds Alignment Model for 3D Reconstruction from Single Image

26 September 2024·2617 words·13 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Australian National University

LAM3D: A novel framework uses point cloud data to boost single-image 3D mesh reconstruction accuracy, achieving state-of-the-art results in just 6 seconds.

L4GM: Large 4D Gaussian Reconstruction Model

26 September 2024·2618 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Toronto

L4GM: The first 4D model generating high-quality animated 3D objects from single-view videos in a single feed-forward pass.

Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features

26 September 2024·2426 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Peking University

Key-Grid: An unsupervised 3D keypoint detector achieving state-of-the-art semantic consistency and accuracy for both rigid and deformable objects using novel grid heatmap features.

Inferring Neural Signed Distance Functions by Overfitting on Single Noisy Point Clouds through Finetuning Data-Driven based Priors

26 September 2024·3586 words·17 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

This research presents LocalN2NM, a novel method for inferring neural signed distance functions (SDF) from single, noisy point clouds by finetuning data-driven priors, achieving faster inference and b…

Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery

26 September 2024·2718 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 South China University of Technology

Meta-learning enhances human mesh recovery by unifying training and test-time objectives, significantly improving accuracy and generalization.

In-N-Out: Lifting 2D Diffusion Prior for 3D Object Removal via Tuning-Free Latents Alignment

26 September 2024·2437 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Melbourne

In-N-Out: Lifting 2D Diffusion Priors for 3D Object Removal via Tuning-Free Latents Alignment enhances 3D scene reconstruction by aligning 2D diffusion model latents for consistent multi-view inpainti…

Improving Robustness of 3D Point Cloud Recognition from a Fourier Perspective

26 September 2024·2312 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Chinese Academy of Sciences

Boosting 3D point cloud recognition robustness, Frequency Adversarial Training (FAT) leverages frequency-domain adversarial examples to improve model resilience against corruptions, achieving state-of…

ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images

26 September 2024·2938 words·14 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

ImOV3D: Revolutionizing open-vocabulary 3D object detection by learning from 2D images alone!

IllumiNeRF: 3D Relighting Without Inverse Rendering

26 September 2024·2411 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Google Research

IllumiNeRF: Relightable 3D reconstruction without inverse rendering using image diffusion and NeRF.