3D Vision
LoCo: Learning 3D Location-Consistent Image Features with a Memory-Efficient Ranking Loss
·1960 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Oxford
LoCo: Memory-efficient location-consistent image features learned via a novel ranking loss, enabling three orders of magnitude memory improvement and outperforming state-of-the-art.
LinNet: Linear Network for Efficient Point Cloud Representation Learning
·2362 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Northwest University
LinNet: A linear-time point cloud network achieving 10x speedup over PointNeXt, with state-of-the-art accuracy on various benchmarks.
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis
·3953 words·19 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Nankai University
LE3D: Real-time HDR view synthesis from noisy RAW images is achieved using 3DGS, significantly reducing training time and improving rendering speed.
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS
·2198 words·11 mins·
loading
·
loading
3D Vision
🏢 University of Texas at Austin
LightGaussian achieves 15x compression of 3D Gaussian scene representations, boosting rendering speed to 200+ FPS while maintaining visual quality, solving storage and efficiency issues in real-time n…
Learning to be Smooth: An End-to-End Differentiable Particle Smoother
·2507 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 UC Irvine
Learned Mixture Density Particle Smoother (MDPS) surpasses state-of-the-art for accurate, differentiable city-scale vehicle localization.
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
·2288 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Shenzhen Campus of Sun Yat-Sen University
Create animatable interacting hand avatars from a single image using a novel two-stage interaction-aware 3D Gaussian splatting framework!
Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization
·1608 words·8 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Cooperative Medianet Innovation Center, Shanghai Jiao Tong University
DisPA: a novel disentangled representation learning framework for perceptual point cloud quality assessment achieves superior performance by minimizing mutual information between content and distortio…
Learning 3D Garment Animation from Trajectories of A Piece of Cloth
·2097 words·10 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Nanyang Technological University
Animates diverse garments realistically from a single cloth’s trajectory using a disentangled learning approach and Energy Unit Network (EUNet).
Learning 3D Equivariant Implicit Function with Patch-Level Pose-Invariant Representation
·2788 words·14 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Xi'an Jiaotong University
3D surface reconstruction revolutionized: PEIF leverages patch-level pose-invariant representations and 3D patch-level equivariance for state-of-the-art accuracy, even with varied poses and datasets!
LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling
·2913 words·14 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Tsinghua University
LCM: a novel, locally constrained, compact point cloud model surpasses Transformer-based methods by significantly improving performance and efficiency in various downstream tasks.
Large Spatial Model: End-to-end Unposed Images to Semantic 3D
·1766 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 NVIDIA Research
Large Spatial Model (LSM) achieves real-time semantic 3D reconstruction from just two unposed images, unifying multiple 3D vision tasks in a single feed-forward pass.
LAM3D: Large Image-Point Clouds Alignment Model for 3D Reconstruction from Single Image
·2617 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
🏢 Australian National University
LAM3D: A novel framework uses point cloud data to boost single-image 3D mesh reconstruction accuracy, achieving state-of-the-art results in just 6 seconds.
L4GM: Large 4D Gaussian Reconstruction Model
·2618 words·13 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Toronto
L4GM: The first 4D model generating high-quality animated 3D objects from single-view videos in a single feed-forward pass.
Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features
·2426 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Peking University
Key-Grid: An unsupervised 3D keypoint detector achieving state-of-the-art semantic consistency and accuracy for both rigid and deformable objects using novel grid heatmap features.
Inferring Neural Signed Distance Functions by Overfitting on Single Noisy Point Clouds through Finetuning Data-Driven based Priors
·3586 words·17 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Tsinghua University
This research presents LocalN2NM, a novel method for inferring neural signed distance functions (SDF) from single, noisy point clouds by finetuning data-driven priors, achieving faster inference and b…
Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery
·2718 words·13 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 South China University of Technology
Meta-learning enhances human mesh recovery by unifying training and test-time objectives, significantly improving accuracy and generalization.
In-N-Out: Lifting 2D Diffusion Prior for 3D Object Removal via Tuning-Free Latents Alignment
·2437 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Melbourne
In-N-Out: Lifting 2D Diffusion Priors for 3D Object Removal via Tuning-Free Latents Alignment enhances 3D scene reconstruction by aligning 2D diffusion model latents for consistent multi-view inpainti…
Improving Robustness of 3D Point Cloud Recognition from a Fourier Perspective
·2312 words·11 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Chinese Academy of Sciences
Boosting 3D point cloud recognition robustness, Frequency Adversarial Training (FAT) leverages frequency-domain adversarial examples to improve model resilience against corruptions, achieving state-of…
ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images
·2938 words·14 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Tsinghua University
ImOV3D: Revolutionizing open-vocabulary 3D object detection by learning from 2D images alone!
IllumiNeRF: 3D Relighting Without Inverse Rendering
·2411 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 Google Research
IllumiNeRF: Relightable 3D reconstruction without inverse rendering using image diffusion and NeRF.