Posters

πP^2: Effective Sharpness Aware Minimization Requires Layerwise Perturbation Scaling

26 September 2024·9260 words·44 mins· loading · loading

AI Generated AI Theory Optimization 🏢 University of Tübingen

µP²: Layerwise perturbation scaling in SAM enables hyperparameter transfer and improved generalization in large models.

ZOPP: A Framework of Zero-shot Offboard Panoptic Perception for Autonomous Driving

26 September 2024·2348 words·12 mins· loading · loading

Computer Vision 3D Vision 🏢 Multimedia Laboratory, the Chinese University of Hong Kong

ZOPP: A groundbreaking framework for zero-shot offboard panoptic perception in autonomous driving, enabling high-quality 3D scene understanding without human labeling.

Zipfian Whitening

26 September 2024·2044 words·10 mins· loading · loading

Natural Language Processing Representation Learning 🏢 Tohoku University

Zipfian Whitening: Weighting PCA whitening by word frequency dramatically improves NLP task performance, surpassing established baselines and providing a theoretical framework for existing methods.

ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification

26 September 2024·2672 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Zhejiang University

ZipCache: Efficient KV cache quantization for LLMs using salient token identification, achieving 4.98x compression with minimal accuracy loss!

Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising Diffusion

26 September 2024·2790 words·14 mins· loading · loading

AI Theory Sampling 🏢 Georgia Institute of Technology

Zeroth-Order Diffusion Monte Carlo (ZOD-MC) efficiently samples from non-log-concave distributions using only zeroth-order queries, overcoming metastability issues and outperforming state-of-the-art s…

ZeroMark: Towards Dataset Ownership Verification without Disclosing Watermark

26 September 2024·3327 words·16 mins· loading · loading

AI Generated Computer Vision Face Recognition 🏢 University of Maryland College Park

ZeroMark revolutionizes dataset ownership verification by enabling copyright protection without exposing watermarks, leveraging the intrinsic properties of DNNs trained on watermarked data.

Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering

26 September 2024·3094 words·15 mins· loading · loading

Computer Vision Image Generation 🏢 Technion

Zero-to-Hero enhances zero-shot novel view synthesis by cleverly filtering attention maps during inference, achieving significantly higher fidelity and realism without retraining.

Zero-Shot Transfer of Neural ODEs

26 September 2024·1748 words·9 mins· loading · loading

AI Applications Robotics 🏢 University of Texas at Austin

Zero-shot Neural ODEs enable autonomous systems to rapidly adapt to unseen scenarios by learning a space of dynamical systems spanned by neural ODE basis functions, achieving efficient online adaptati…

Zero-Shot Tokenizer Transfer

26 September 2024·2795 words·14 mins· loading · loading

Natural Language Processing Large Language Models 🏢 University of Cambridge

Zero-Shot Tokenizer Transfer (ZeTT) detaches language models from their tokenizers via a hypernetwork, enabling efficient on-the-fly tokenizer swapping without retraining, significantly improving LLM …

Zero-Shot Scene Reconstruction from Single Images with Deep Prior Assembly

26 September 2024·2753 words·13 mins· loading · loading

Computer Vision 3D Vision 🏢 Tsinghua University

Zero-shot 3D scene reconstruction from single images is achieved by assembling diverse deep priors from large models, eliminating the need for 3D/2D training data and achieving superior performance.

Zero-Shot Reinforcement Learning from Low Quality Data

26 September 2024·4722 words·23 mins· loading · loading

AI Generated Machine Learning Reinforcement Learning 🏢 University of Cambridge

Zero-shot RL struggles with low-quality data; this paper introduces conservative algorithms that significantly boost performance on such data without sacrificing performance on high-quality data.

Zero-shot Image Editing with Reference Imitation

26 September 2024·2284 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 University of Hong Kong

MimicBrush: a novel image editing approach using reference imitation for intuitive zero-shot edits.

Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection

26 September 2024·2363 words·12 mins· loading · loading

Computer Vision Object Detection 🏢 Institute of Automation, Chinese Academy of Sciences (CAS)

ZiRa achieves zero-shot generalizable incremental learning for vision-language object detection by using a memory-efficient dual-branch architecture and zero-interference loss, significantly boosting …

Zero-Shot Event-Intensity Asymmetric Stereo via Visual Prompting from Image Domain

26 September 2024·4096 words·20 mins· loading · loading

AI Generated Computer Vision 3D Vision 🏢 Peking University

Zero-shot Event-Intensity Asymmetric Stereo (ZEST) uses visual prompting and monocular cues to achieve robust 3D perception without event-specific training, outperforming existing methods.

Your Diffusion Model is Secretly a Noise Classifier and Benefits from Contrastive Training

26 September 2024·2424 words·12 mins· loading · loading

Machine Learning Self-Supervised Learning 🏢 UC Riverside

Diffusion models benefit from contrastive training, improving sample quality and speed by addressing poor denoiser estimation in out-of-distribution regions.

Your contrastive learning problem is secretly a distribution alignment problem

26 September 2024·381 words·2 mins· loading · loading

Machine Learning Self-Supervised Learning 🏢 University of Toronto

Contrastive learning is reframed as a distribution alignment problem, leading to a flexible framework (GCA) that improves representation learning with unbalanced optimal transport.

YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals

26 September 2024·2987 words·15 mins· loading · loading

Natural Language Processing Text Generation 🏢 University of Texas at Austin

YOUDREAM generates anatomically consistent, high-quality 3D animal models from text and 2D pose priors, pushing creative boundaries in text-to-3D generation.

You Only Look Around: Learning Illumination-Invariant Feature for Low-light Object Detection

26 September 2024·2686 words·13 mins· loading · loading

AI Generated Computer Vision Object Detection 🏢 Megvii Technology

YOLA: A novel framework for object detection in low-light conditions, achieving significant improvements by learning illumination-invariant features through a novel module.

You Don’t Need Domain-Specific Data Augmentations When Scaling Self-Supervised Learning

26 September 2024·2133 words·11 mins· loading · loading

AI Generated Machine Learning Self-Supervised Learning 🏢 FAIR at Meta

Self-supervised learning’s reliance on complex data augmentations is challenged; a large-scale study shows comparable performance using only cropping, suggesting dataset size is more important than au…

YOLOv10: Real-Time End-to-End Object Detection

26 September 2024·1949 words·10 mins· loading · loading

Computer Vision Object Detection 🏢 Tsinghua University

YOLOv10: Real-time object detection achieves state-of-the-art speed and accuracy by eliminating NMS post-processing and holistically optimizing model architecture for efficiency and accuracy.