Object Detection
Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection
·2363 words·12 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Institute of Automation, Chinese Academy of Sciences (CAS)
ZiRa achieves zero-shot generalizable incremental learning for vision-language object detection by using a memory-efficient dual-branch architecture and zero-interference loss, significantly boosting …
You Only Look Around: Learning Illumination-Invariant Feature for Low-light Object Detection
·2686 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 Megvii Technology
YOLA: A novel framework for object detection in low-light conditions, achieving significant improvements by learning illumination-invariant features through a novel module.
YOLOv10: Real-Time End-to-End Object Detection
·1949 words·10 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Tsinghua University
YOLOv10: Real-time object detection achieves state-of-the-art speed and accuracy by eliminating NMS post-processing and holistically optimizing model architecture for efficiency and accuracy.
Unsupervised Object Detection with Theoretical Guarantees
·2140 words·11 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 University of Oxford
First unsupervised object detection method with theoretical guarantees to recover true object positions, up to quantifiable small shifts!
UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes
·2470 words·12 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 Delft University of Technology
UNION: Unsupervised 3D object detection method doubles average precision, leveraging LiDAR, camera, and temporal data for efficient training without manual labels.
UMB: Understanding Model Behavior for Open-World Object Detection
·3512 words·17 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 South China University of Technology
UMB: A novel model enhances open-world object detection by understanding model behavior, surpassing state-of-the-art with a 5.3 mAP gain for unknown classes.
Towards Unsupervised Model Selection for Domain Adaptive Object Detection
·1885 words·9 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 University of Electronic Science and Technology of China
Unsupervised model selection for domain adaptive object detection is achieved via a new Detection Adaptation Score (DAS), effectively selecting optimal models without target labels by leveraging the f…
Spiking Neural Network as Adaptive Event Stream Slicer
·2956 words·14 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Hong Kong University of Science and Technology
SpikeSlicer: An adaptive event stream slicer using a spiking neural network (SNN) to efficiently split events for improved downstream processing in object tracking and recognition.
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection
·2771 words·14 mins·
loading
·
loading
Object Detection
🏢 Nankai University
SARDet-100K: A new benchmark dataset and open-source toolkit revolutionizes large-scale SAR object detection.
Revisiting motion information for RGB-Event tracking with MOT philosophy
·2713 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 Tsinghua University
RGB-Event tracker CSAM leverages MOT philosophy for enhanced robustness by integrating appearance and motion information from RGB and event streams, achieving state-of-the-art performance.
Revisiting Adversarial Patches for Designing Camera-Agnostic Attacks against Person Detection
·1724 words·9 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Peking University
Researchers developed Camera-Agnostic Patch (CAP) attacks, improving adversarial patch reliability by simulating camera image processing in attacks against person detectors.
RETR: Multi-View Radar Detection Transformer for Indoor Perception
·4299 words·21 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
🏢 Mitsubishi Electric Research Laboratories
RETR: Multi-view radar detection transformer significantly improves indoor object detection and segmentation.
Real-time Stereo-based 3D Object Detection for Streaming Perception
·2407 words·12 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Sun Yat-Sen University
StreamDSGN: a real-time stereo 3D object detection framework significantly boosts streaming perception accuracy by leveraging historical information, a feature-flow fusion method, and a motion consist…
Progressive Exploration-Conformal Learning for Sparsely Annotated Object Detection in Aerial Images
·2177 words·11 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Nanjing University of Science and Technology
Progressive Exploration-Conformal Learning (PECL) revolutionizes sparsely annotated object detection in aerial images by adaptively selecting high-quality pseudo-labels, overcoming limitations of exis…
Parameter-Inverted Image Pyramid Networks
·2381 words·12 mins·
loading
·
loading
Object Detection
🏢 Tsinghua University
Parameter-Inverted Image Pyramid Networks (PIIP) boost image pyramid efficiency by using smaller models for higher-resolution images and larger models for lower-resolution ones, achieving superior per…
Open-Vocabulary Object Detection via Language Hierarchy
·2960 words·14 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Nanyang Technological University
Language Hierarchical Self-training (LHST) enhances weakly-supervised object detection by integrating language hierarchy, mitigating label mismatch, and improving generalization across diverse dataset…
ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models
·4424 words·21 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Tsinghua University
ODGEN: Boosting object detection accuracy by generating high-quality synthetic images using diffusion models conditioned on bounding boxes and text descriptions.
MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders
·1750 words·9 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 UCAS-Terminus AI Lab, University of Chinese Academy of Sciences, China
MonoMAE enhances monocular 3D object detection by using depth-aware masked autoencoders to effectively handle object occlusions, achieving superior performance on both occluded and non-occluded object…
Long-Tailed Out-of-Distribution Detection via Normalized Outlier Distribution Adaptation
·2335 words·11 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Beihang University
AdaptOD: a novel approach for robust OOD detection in long-tailed recognition, dynamically adapting outlier distributions to true OOD distributions using a dual-normalized energy loss for improved acc…
Long-tailed Object Detection Pretraining: Dynamic Rebalancing Contrastive Learning with Dual Reconstruction
·2225 words·11 mins·
loading
·
loading
Computer Vision
Object Detection
🏢 Nanjing University of Science and Technology
Dynamic Rebalancing Contrastive Learning with Dual Reconstruction (2DRCL) pre-training significantly boosts object detection accuracy, especially for underrepresented classes.