Skip to main content

Computer Vision

GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian Splats
·2282 words·11 mins· loading · loading
Computer Vision 3D Vision 🏢 SungKyunKwan University
GSGAN introduces a hierarchical 3D Gaussian representation for faster, high-quality 3D model generation in GANs, achieving 100x speed improvement over existing methods.
GSDF: 3DGS Meets SDF for Improved Neural Rendering and Reconstruction
·2215 words·11 mins· loading · loading
Computer Vision 3D Vision 🏢 Shanghai Artificial Intelligence Laboratory
GSDF: A novel dual-branch neural scene representation elegantly resolves the rendering-reconstruction trade-off by synergistically combining 3D Gaussian Splatting and Signed Distance Fields via mutual…
GS-Hider: Hiding Messages into 3D Gaussian Splatting
·2889 words·14 mins· loading · loading
Computer Vision 3D Vision 🏢 Peking University
GS-Hider: A novel framework secures 3D Gaussian Splatting by embedding messages in a coupled, secured feature attribute, enabling invisible data hiding and accurate extraction.
Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting
·2420 words·12 mins· loading · loading
Computer Vision 3D Vision 🏢 Nankai University
Grid4D: A novel 4D decomposed hash encoding boosts high-fidelity dynamic Gaussian splatting, surpassing state-of-the-art models in visual quality and rendering speed.
GraphMorph: Tubular Structure Extraction by Morphing Predicted Graphs
·2370 words·12 mins· loading · loading
Computer Vision Image Segmentation 🏢 Peking University
GraphMorph: revolutionizing tubular structure extraction by morphing predicted graphs for superior topological accuracy.
Gradient-free Decoder Inversion in Latent Diffusion Models
·2408 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 Seoul National University
This paper introduces a novel gradient-free decoder inversion method for latent diffusion models, improving efficiency and memory usage compared to existing gradient-based methods. The method is theo…
GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching
·3808 words·18 mins· loading · loading
AI Generated Computer Vision Video Understanding 🏢 School of Computer Science, National Engineering Research Center for Multimedia Software, and Institute of Artificial Intelligence, Wuhan University
GoMatching, a novel video text spotting baseline, enhances tracking efficiency while maintaining strong recognition by integrating long- and short-term matching via a Transformer-based module and a re…
Goal Conditioned Reinforcement Learning for Photo Finishing Tuning
·3405 words·16 mins· loading · loading
Computer Vision Image Generation 🏢 Shanghai AI Laboratory
This paper introduces a goal-conditioned reinforcement learning approach that efficiently tunes photo finishing pipelines, achieving high-quality results in fewer iterations than optimization-based me…
GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration
·2622 words·13 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Carnegie Mellon University
GL-NeRF accelerates NeRF rendering by using Gauss-Laguerre quadrature, drastically reducing MLP calls without needing additional networks or data structures.
GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields
·1769 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 Tongji University
GeoNLF: Geometry-guided Pose-free Neural LiDAR Fields revolutionizes LiDAR point cloud processing by cleverly combining neural and geometric optimization for superior novel view synthesis and multi-vi…
Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images
·4369 words·21 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Hong Kong Baptist University
Geometry Cloak embeds invisible perturbations in images to thwart AI-based 3D reconstruction, forcing the AI to generate identifiable patterns that act as watermarks to assert copyright.
Geometric Exploitation for Indoor Panoramic Semantic Segmentation
·3017 words·15 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏢 MAXST
Boosting indoor panoramic semantic segmentation, a new approach leverages geometric properties to optimize over- and under-sampled image segments for improved accuracy and robustness.
Geometric Analysis of Nonlinear Manifold Clustering
·1790 words·9 mins· loading · loading
Computer Vision Image Classification 🏢 Lehigh University
Guaranteed Manifold Clustering: Novel method provides geometric conditions ensuring accurate data grouping from nonlinear manifolds, showing competitive performance on CIFAR datasets.
GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation
·2032 words·10 mins· loading · loading
Computer Vision 3D Vision 🏢 Tsinghua University
GeoLRM: Generate stunning 3D models from just 21 images using a novel geometry-aware transformer, surpassing existing methods in efficiency and quality!
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping
·2403 words·12 mins· loading · loading
Computer Vision Image Generation 🏢 Sony AI
GenWarp generates high-quality novel image views from a single input image by using a semantic-preserving generative warping framework, outperforming existing methods.
GenRec: Unifying Video Generation and Recognition with Diffusion Models
·2342 words·11 mins· loading · loading
Computer Vision Video Understanding 🏢 Fudan University
GenRec: One diffusion model to rule both video generation and recognition!
Generating compositional scenes via Text-to-image RGBA Instance Generation
·4227 words·20 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 University of Edinburgh
This paper introduces a novel multi-stage generation framework for creating compositional scenes with fine-grained control by leveraging a trained diffusion model to produce individual scene component…
Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts
·3159 words·15 mins· loading · loading
AI Generated Computer Vision Image Segmentation 🏢 ShanghaiTech University
This research presents a novel method for robust semantic segmentation, achieving state-of-the-art results by generating coherent images with both semantic and covariate shifts and recalibrating uncer…
Generalizable Person Re-identification via Balancing Alignment and Uniformity
·3010 words·15 mins· loading · loading
AI Generated Computer Vision Face Recognition 🏢 KAIST
Balancing Alignment and Uniformity (BAU) framework improves generalizable person re-identification by mitigating the polarized effects of data augmentation, achieving state-of-the-art performance.
Generalizable Implicit Motion Modeling for Video Frame Interpolation
·2114 words·10 mins· loading · loading
Computer Vision Video Understanding 🏢 Nanyang Technological University
Generalizable Implicit Motion Modeling (GIMM) revolutionizes video frame interpolation by accurately predicting optical flows at any timestep, surpassing existing methods and achieving state-of-the-ar…