Computer Vision

GSGAN: Adversarial Learning for Hierarchical Generation of 3D Gaussian Splats

26 September 2024·2282 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 SungKyunKwan University

GSGAN introduces a hierarchical 3D Gaussian representation for faster, high-quality 3D model generation in GANs, achieving 100x speed improvement over existing methods.

GSDF: 3DGS Meets SDF for Improved Neural Rendering and Reconstruction

26 September 2024·2215 words·11 mins· loading · loading

Computer Vision 3D Vision 🏢 Shanghai Artificial Intelligence Laboratory

GSDF: A novel dual-branch neural scene representation elegantly resolves the rendering-reconstruction trade-off by synergistically combining 3D Gaussian Splatting and Signed Distance Fields via mutual…

GS-Hider: Hiding Messages into 3D Gaussian Splatting

26 September 2024·2889 words·14 mins· loading · loading

Computer Vision 3D Vision 🏢 Peking University

GS-Hider: A novel framework secures 3D Gaussian Splatting by embedding messages in a coupled, secured feature attribute, enabling invisible data hiding and accurate extraction.

Grid4D: 4D Decomposed Hash Encoding for High-fidelity Dynamic Gaussian Splatting

26 September 2024·2420 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Nankai University

Grid4D: A novel 4D decomposed hash encoding boosts high-fidelity dynamic Gaussian splatting, surpassing state-of-the-art models in visual quality and rendering speed.

GraphMorph: Tubular Structure Extraction by Morphing Predicted Graphs

26 September 2024·2370 words·12 mins· loading · loading

Computer Vision Image Segmentation 🏢 Peking University

GraphMorph: revolutionizing tubular structure extraction by morphing predicted graphs for superior topological accuracy.

Gradient-free Decoder Inversion in Latent Diffusion Models

26 September 2024·2408 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 Seoul National University

This paper introduces a novel gradient-free decoder inversion method for latent diffusion models, improving efficiency and memory usage compared to existing gradient-based methods. The method is theo…

GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term Matching

26 September 2024·3808 words·18 mins· loading · loading

AI Generated Computer Vision Video Understanding 🏢 School of Computer Science, National Engineering Research Center for Multimedia Software, and Institute of Artificial Intelligence, Wuhan University

GoMatching, a novel video text spotting baseline, enhances tracking efficiency while maintaining strong recognition by integrating long- and short-term matching via a Transformer-based module and a re…

Goal Conditioned Reinforcement Learning for Photo Finishing Tuning

26 September 2024·3405 words·16 mins· loading · loading

Computer Vision Image Generation 🏢 Shanghai AI Laboratory

This paper introduces a goal-conditioned reinforcement learning approach that efficiently tunes photo finishing pipelines, achieving high-quality results in fewer iterations than optimization-based me…

GL-NeRF: Gauss-Laguerre Quadrature Enables Training-Free NeRF Acceleration

26 September 2024·2622 words·13 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Carnegie Mellon University

GL-NeRF accelerates NeRF rendering by using Gauss-Laguerre quadrature, drastically reducing MLP calls without needing additional networks or data structures.

GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields

26 September 2024·1769 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 Tongji University

GeoNLF: Geometry-guided Pose-free Neural LiDAR Fields revolutionizes LiDAR point cloud processing by cleverly combining neural and geometric optimization for superior novel view synthesis and multi-vi…

Geometry Cloak: Preventing TGS-based 3D Reconstruction from Copyrighted Images

26 September 2024·4369 words·21 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Hong Kong Baptist University

Geometry Cloak embeds invisible perturbations in images to thwart AI-based 3D reconstruction, forcing the AI to generate identifiable patterns that act as watermarks to assert copyright.

Geometric Exploitation for Indoor Panoramic Semantic Segmentation

26 September 2024·3017 words·15 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 MAXST

Boosting indoor panoramic semantic segmentation, a new approach leverages geometric properties to optimize over- and under-sampled image segments for improved accuracy and robustness.

Geometric Analysis of Nonlinear Manifold Clustering

26 September 2024·1790 words·9 mins· loading · loading

Computer Vision Image Classification 🏢 Lehigh University

Guaranteed Manifold Clustering: Novel method provides geometric conditions ensuring accurate data grouping from nonlinear manifolds, showing competitive performance on CIFAR datasets.

GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation

26 September 2024·2032 words·10 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

GeoLRM: Generate stunning 3D models from just 21 images using a novel geometry-aware transformer, surpassing existing methods in efficiency and quality!

GenWarp: Single Image to Novel Views with Semantic-Preserving Generative Warping

26 September 2024·2403 words·12 mins· loading · loading

Computer Vision Image Generation 🏢 Sony AI

GenWarp generates high-quality novel image views from a single input image by using a semantic-preserving generative warping framework, outperforming existing methods.

GenRec: Unifying Video Generation and Recognition with Diffusion Models

26 September 2024·2342 words·11 mins· loading · loading

Computer Vision Video Understanding 🏢 Fudan University

GenRec: One diffusion model to rule both video generation and recognition!

Generating compositional scenes via Text-to-image RGBA Instance Generation

26 September 2024·4227 words·20 mins· loading · loading

AI Generated Computer Vision Image Generation 🏢 University of Edinburgh

This paper introduces a novel multi-stage generation framework for creating compositional scenes with fine-grained control by leveraging a trained diffusion model to produce individual scene component…

Generalize or Detect? Towards Robust Semantic Segmentation Under Multiple Distribution Shifts

26 September 2024·3159 words·15 mins· loading · loading

AI Generated Computer Vision Image Segmentation 🏢 ShanghaiTech University

This research presents a novel method for robust semantic segmentation, achieving state-of-the-art results by generating coherent images with both semantic and covariate shifts and recalibrating uncer…

Generalizable Person Re-identification via Balancing Alignment and Uniformity

26 September 2024·3010 words·15 mins· loading · loading

AI Generated Computer Vision Face Recognition 🏢 KAIST

Balancing Alignment and Uniformity (BAU) framework improves generalizable person re-identification by mitigating the polarized effects of data augmentation, achieving state-of-the-art performance.

Generalizable Implicit Motion Modeling for Video Frame Interpolation

26 September 2024·2114 words·10 mins· loading · loading

Computer Vision Video Understanding 🏢 Nanyang Technological University

Generalizable Implicit Motion Modeling (GIMM) revolutionizes video frame interpolation by accurately predicting optical flows at any timestep, surpassing existing methods and achieving state-of-the-ar…