Posters
2024
ΟP^2: Effective Sharpness Aware Minimization Requires Layerwise Perturbation Scaling
·9260 words·44 mins·
loading
·
loading
AI Generated
AI Theory
Optimization
π’ University of TΓΌbingen
Β΅PΒ²: Layerwise perturbation scaling in SAM enables hyperparameter transfer and improved generalization in large models.
ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving
·2348 words·12 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Multimedia Laboratory, the Chinese University of Hong Kong
ZOPP: A groundbreaking framework for zero-shot offboard panoptic perception in autonomous driving, enabling high-quality 3D scene understanding without human labeling.
Zipfian Whitening
·2044 words·10 mins·
loading
·
loading
Natural Language Processing
Representation Learning
π’ Tohoku University
Zipfian Whitening: Weighting PCA whitening by word frequency dramatically improves NLP task performance, surpassing established baselines and providing a theoretical framework for existing methods.
ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
·2672 words·13 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
π’ Zhejiang University
ZipCache: Efficient KV cache quantization for LLMs using salient token identification, achieving 4.98x compression with minimal accuracy loss!
Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising Diffusion
·2790 words·14 mins·
loading
·
loading
AI Theory
Sampling
π’ Georgia Institute of Technology
Zeroth-Order Diffusion Monte Carlo (ZOD-MC) efficiently samples from non-log-concave distributions using only zeroth-order queries, overcoming metastability issues and outperforming state-of-the-art s…
ZeroMark: Towards Dataset Ownership Verification without Disclosing Watermark
·3327 words·16 mins·
loading
·
loading
AI Generated
Computer Vision
Face Recognition
π’ University of Maryland College Park
ZeroMark revolutionizes dataset ownership verification by enabling copyright protection without exposing watermarks, leveraging the intrinsic properties of DNNs trained on watermarked data.
Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering
·3094 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
π’ Technion
Zero-to-Hero enhances zero-shot novel view synthesis by cleverly filtering attention maps during inference, achieving significantly higher fidelity and realism without retraining.
Zero-Shot Transfer of Neural ODEs
·1748 words·9 mins·
loading
·
loading
AI Applications
Robotics
π’ University of Texas at Austin
Zero-shot Neural ODEs enable autonomous systems to rapidly adapt to unseen scenarios by learning a space of dynamical systems spanned by neural ODE basis functions, achieving efficient online adaptati…
Zero-Shot Tokenizer Transfer
·2795 words·14 mins·
loading
·
loading
Natural Language Processing
Large Language Models
π’ University of Cambridge
Zero-Shot Tokenizer Transfer (ZeTT) detaches language models from their tokenizers via a hypernetwork, enabling efficient on-the-fly tokenizer swapping without retraining, significantly improving LLM …
Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly
·2753 words·13 mins·
loading
·
loading
Computer Vision
3D Vision
π’ Tsinghua University
Zero-shot 3D scene reconstruction from single images is achieved by assembling diverse deep priors from large models, eliminating the need for 3D/2D training data and achieving superior performance.
Zero-Shot Reinforcement Learning from Low Quality Data
·4722 words·23 mins·
loading
·
loading
AI Generated
Machine Learning
Reinforcement Learning
π’ University of Cambridge
Zero-shot RL struggles with low-quality data; this paper introduces conservative algorithms that significantly boost performance on such data without sacrificing performance on high-quality data.
Zero-shot Image Editing with Reference Imitation
·2284 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
π’ University of Hong Kong
MimicBrush: a novel image editing approach using reference imitation for intuitive zero-shot edits.
Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection
·2363 words·12 mins·
loading
·
loading
Computer Vision
Object Detection
π’ Institute of Automation, Chinese Academy of Sciences (CAS)
ZiRa achieves zero-shot generalizable incremental learning for vision-language object detection by using a memory-efficient dual-branch architecture and zero-interference loss, significantly boosting …
Zero-Shot Event-Intensity Asymmetric Stereo via Visual Prompting from Image Domain
·4096 words·20 mins·
loading
·
loading
AI Generated
Computer Vision
3D Vision
π’ Peking University
Zero-shot Event-Intensity Asymmetric Stereo (ZEST) uses visual prompting and monocular cues to achieve robust 3D perception without event-specific training, outperforming existing methods.
Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training
·2424 words·12 mins·
loading
·
loading
Machine Learning
Self-Supervised Learning
π’ UC Riverside
Diffusion models benefit from contrastive training, improving sample quality and speed by addressing poor denoiser estimation in out-of-distribution regions.
Your contrastive learning problem is secretly a distribution alignment problem
·381 words·2 mins·
loading
·
loading
Machine Learning
Self-Supervised Learning
π’ University of Toronto
Contrastive learning is reframed as a distribution alignment problem, leading to a flexible framework (GCA) that improves representation learning with unbalanced optimal transport.
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
·2987 words·15 mins·
loading
·
loading
Natural Language Processing
Text Generation
π’ University of Texas at Austin
YOUDREAM generates anatomically consistent, high-quality 3D animal models from text and 2D pose priors, pushing creative boundaries in text-to-3D generation.
You Only Look Around: Learning Illumination-Invariant Feature for Low-light Object Detection
·2686 words·13 mins·
loading
·
loading
AI Generated
Computer Vision
Object Detection
π’ Megvii Technology
YOLA: A novel framework for object detection in low-light conditions, achieving significant improvements by learning illumination-invariant features through a novel module.
You Donβt Need Domain-Specific Data Augmentations When Scaling Self-Supervised Learning
·2133 words·11 mins·
loading
·
loading
AI Generated
Machine Learning
Self-Supervised Learning
π’ FAIR at Meta
Self-supervised learning’s reliance on complex data augmentations is challenged; a large-scale study shows comparable performance using only cropping, suggesting dataset size is more important than au…
YOLOv10: Real-Time End-to-End Object Detection
·1949 words·10 mins·
loading
·
loading
Computer Vision
Object Detection
π’ Tsinghua University
YOLOv10: Real-time object detection achieves state-of-the-art speed and accuracy by eliminating NMS post-processing and holistically optimizing model architecture for efficiency and accuracy.