Skip to main content

Computer Vision

Pseudo-Siamese Blind-spot Transformers for Self-Supervised Real-World Denoising
·2358 words·12 mins· loading · loading
AI Generated Computer Vision Image Denoising 🏢 South China University of Technology
SelfFormer: A novel self-supervised transformer-based method outperforms existing techniques by leveraging directional self-attention for efficient and accurate real-world image denoising.
Prune and Repaint: Content-Aware Image Retargeting for any Ratio
·2137 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 Southeast University
Prune and Repaint: A new content-aware method for superior image retargeting across any aspect ratio, preserving key features and avoiding artifacts.
ProxyFusion: Face Feature Aggregation Through Sparse Experts
·2043 words·10 mins· loading · loading
AI Generated Computer Vision Face Recognition 🏢 University at Buffalo
ProxyFusion, a novel face feature fusion method, achieves real-time performance by using sparse experts to weight features without relying on intermediate representations or metadata, substantially im…
ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Field
·1770 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 Stanford University
ProvNeRF enhances NeRF reconstruction by modeling per-point provenance as a stochastic field, improving novel view synthesis and uncertainty estimation, particularly in sparse, unconstrained view sett…
Prototypical Hash Encoding for On-the-Fly Fine-Grained Category Discovery
·3085 words·15 mins· loading · loading
Computer Vision Image Classification 🏢 University of Trento
Prototypical Hash Encoding (PHE) significantly boosts on-the-fly fine-grained category discovery by using multiple prototypes per category to generate highly discriminative hash codes, thus resolving …
PromptFix: You Prompt and We Fix the Photo
·5744 words·27 mins· loading · loading
AI Generated Computer Vision Image Generation 🏢 University of Rochester
PromptFix: a novel framework enables diffusion models to precisely follow instructions for diverse image processing tasks, using a new high-frequency guidance sampling method and an auxiliary prompt a…
Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models
·3455 words·17 mins· loading · loading
Computer Vision Image Generation 🏢 Xi'an Jiaotong University
Prompt-Agnostic Adversarial Perturbation (PAP) defends customized diffusion models against image tampering, achieving superior generalization over prompt-specific methods.
Progressive Exploration-Conformal Learning for Sparsely Annotated Object Detection in Aerial Images
·2177 words·11 mins· loading · loading
Computer Vision Object Detection 🏢 Nanjing University of Science and Technology
Progressive Exploration-Conformal Learning (PECL) revolutionizes sparsely annotated object detection in aerial images by adaptively selecting high-quality pseudo-labels, overcoming limitations of exis…
ProEdit: Simple Progression is All You Need for High-Quality 3D Scene Editing
·1818 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 University of Illinois Urbana-Champaign
ProEdit: High-quality 3D scene editing via progressive subtask decomposition.
Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors
·2800 words·14 mins· loading · loading
Computer Vision Image Generation 🏢 Department of Computing and Mathematical Sciences, Caltech
Principled Probabilistic Imaging uses diffusion models as plug-and-play priors for accurate posterior sampling in inverse problems, surpassing existing methods.
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
·4348 words·21 mins· loading · loading
Computer Vision Image Generation 🏢 City University of Hong Kong
PrefPaint: Aligning image inpainting diffusion models with human preferences using reinforcement learning, resulting in significantly improved visual appeal.
PPLNs: Parametric Piecewise Linear Networks for Event-Based Temporal Modeling and Beyond
·1984 words·10 mins· loading · loading
Computer Vision 3D Vision 🏢 University of Texas at Austin
Parametric Piecewise Linear Networks (PPLNs) achieve state-of-the-art results in event-based and frame-based computer vision tasks by mimicking biological neural principles.
Polyhedral Complex Derivation from Piecewise Trilinear Networks
·2972 words·14 mins· loading · loading
Computer Vision 3D Vision 🏢 NAVER AI Lab
This paper presents a novel method for analytically extracting meshes from neural implicit surface networks using trilinear interpolation, offering theoretical insights and practical efficiency.
PointMamba: A Simple State Space Model for Point Cloud Analysis
·2563 words·13 mins· loading · loading
Computer Vision 3D Vision 🏢 Huazhong University of Science & Technology
PointMamba: A linear-complexity state space model achieving superior performance in point cloud analysis, reducing computational cost significantly.
PointAD: Comprehending 3D Anomalies from Points and Pixels for Zero-shot 3D Anomaly Detection
·5033 words·24 mins· loading · loading
AI Generated Computer Vision 3D Vision 🏢 Zhejiang University
PointAD: a novel zero-shot 3D anomaly detection method using CLIP’s strong generalization abilities to identify anomalies in unseen objects by transferring knowledge from both points and pixels.
Point-PRC: A Prompt Learning Based Regulation Framework for Generalizable Point Cloud Analysis
·2874 words·14 mins· loading · loading
Computer Vision 3D Vision 🏢 Department of Computer Science, Renmin University of China
Point-PRC improves generalizable 3D point cloud analysis by regulating prompt learning to harmonize task-specific and general knowledge within large 3D models.
PLIP: Language-Image Pre-training for Person Representation Learning
·3449 words·17 mins· loading · loading
Computer Vision Representation Learning 🏢 National Key Laboratory of Multispectral Information Intelligent Processing Technology, School of Artificial Intelligence and Automation, Huazhong University of Science and Technology
PLIP: Novel language-image pre-training framework excels at person representation learning, surpassing existing methods on various downstream tasks thanks to its three pretext tasks and large-scale SY…
Physics-Constrained Comprehensive Optical Neural Networks
·1493 words·8 mins· loading · loading
Computer Vision Image Classification 🏢 Beijing University of Posts and Telecommunications
Physics-constrained learning significantly boosts optical neural network accuracy by addressing systematic physical errors, achieving state-of-the-art results on image classification tasks.
PhyRecon: Physically Plausible Neural Scene Reconstruction
·2451 words·12 mins· loading · loading
Computer Vision 3D Vision 🏢 Tsinghua University
PHYRECON: A novel neural scene reconstruction method uses differentiable rendering and physics simulation for physically plausible 3D models.
Phased Consistency Models
·5013 words·24 mins· loading · loading
Computer Vision Image Generation 🏢 Hong Kong University of Science and Technology
Phased Consistency Models (PCMs) revolutionize diffusion model generation by overcoming LCM limitations, achieving superior speed and quality in image and video generation.