Spotlight Others
2024
Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference
·1463 words·7 mins·
loading
·
loading
π’ Hong Kong University of Science and Technology
Reverse Transition Kernel (RTK) framework accelerates diffusion inference by enabling balanced subproblem decomposition, achieving superior convergence rates with RTK-MALA and RTK-ULD algorithms.
Rethinking 3D Convolution in $ll_p$-norm Space
·1754 words·9 mins·
loading
·
loading
3D Vision
π’ University of Chinese Academy of Sciences
L1-norm based 3D convolution achieves competitive performance with lower energy consumption and latency compared to traditional methods, as proven through universal approximation theorem and experimen…
ResAD: A Simple Framework for Class Generalizable Anomaly Detection
·2059 words·10 mins·
loading
·
loading
Anomaly Detection
π’ Shanghai Jiao Tong University
ResAD, a novel framework, tackles class-generalizable anomaly detection by learning residual feature distributions, achieving remarkable results on diverse datasets without retraining.
Reproducibility of predictive networks for mouse visual cortex
·2493 words·12 mins·
loading
·
loading
π’ Max Planck Institute for Dynamics and Self-Organization
Deep learning models for neural activity lack reproducibility; this paper introduces adaptive regularization and iterative feature pruning to improve embedding consistency and predictive performance.
Reparameterization invariance in approximate Bayesian inference
·2291 words·11 mins·
loading
·
loading
π’ Technical University of Denmark
Bayesian neural networks often underfit due to their lack of reparameterization invariance; this paper introduces a Riemannian diffusion process to improve posterior sampling and enhance predictive pe…
Recursive PAC-Bayes: A Frequentist Approach to Sequential Prior Updates with No Information Loss
·1805 words·9 mins·
loading
·
loading
π’ University of South Denmark
Recursive PAC-Bayes: A frequentist method enabling sequential prior updates without information loss, resulting in significantly tighter generalization bounds.
Recurrent neural network dynamical systems for biological vision
·2292 words·11 mins·
loading
·
loading
Image Classification
π’ University of Cambridge
CordsNet: a hybrid CNN-RNN architecture enabling biologically realistic, robust image recognition through continuous-time recurrent dynamics.
Reconstruct and Match: Out-of-Distribution Robustness via Topological Homogeneity
·1935 words·10 mins·
loading
·
loading
π’ Shenzhen University
Reconstruct & Match (REMA) enhances deep learning’s out-of-distribution robustness by leveraging object’s topological homogeneity, outperforming state-of-the-art methods.
Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding Network
·2135 words·11 mins·
loading
·
loading
Image Generation
π’ Tsinghua University
CORUN-Colabator: a novel cooperative unfolding network and coherence-based label generator achieves state-of-the-art real-world image dehazing by effectively integrating physical knowledge and generat…
QKFormer: Hierarchical Spiking Transformer using Q-K Attention
·2062 words·10 mins·
loading
·
loading
Image Classification
π’ Pengcheng Laboratory
QKFormer: A groundbreaking spiking transformer achieving 85.65% ImageNet accuracy using a linear-complexity, energy-efficient Q-K attention mechanism.
Provable Benefit of Cutout and CutMix for Feature Learning
·1796 words·9 mins·
loading
·
loading
Image Classification
π’ KAIST
CutMix and Cutout data augmentation methods provably improve feature learning by enabling the network to learn rarer features and noise vectors more effectively.
Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation
·2063 words·10 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
π’ University of Strasbourg
PeskaVLP: Hierarchical knowledge augmentation boosts surgical video-language pretraining!
Probablistic Emulation of a Global Climate Model with Spherical DYffusion
·2553 words·12 mins·
loading
·
loading
π’ University of California, San Diego
Spherical DYffusion: a novel AI model generates accurate, physically consistent global climate ensemble simulations, surpassing existing methods in efficiency and skill.
Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks
·6833 words·33 mins·
loading
·
loading
π’ LinkΓΆping University
Graph-EFM: a novel probabilistic weather forecasting model using hierarchical graph neural networks that efficiently generates large ensembles for improved accuracy and uncertainty quantification.
Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control
·2633 words·13 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
π’ University of Oxford
Pre-trained text-to-image diffusion models create highly effective, versatile representations for embodied AI control, surpassing previous methods.
Poisson Variational Autoencoder
·2239 words·11 mins·
loading
·
loading
π’ University of California, Berkeley
Poisson Variational Autoencoder (P-VAE) improves deep learning by encoding inputs as discrete spike counts, enhancing biological realism and interpretability while avoiding posterior collapse and achi…
Physically Compatible 3D Object Modeling from a Single Image
·1864 words·9 mins·
loading
·
loading
3D Vision
π’ Massachusetts Institute of Technology
Single image to physically compatible 3D objects: A new framework ensures 3D models maintain stability and mirror real-world equilibrium states, advancing realism in dynamic simulations and 3D printi…
Peri-midFormer: Periodic Pyramid Transformer for Time Series Analysis
·3968 words·19 mins·
loading
·
loading
π’ Xidian University
Peri-midFormer uses a novel periodic pyramid transformer to effectively model complex periodic variations in time series, achieving state-of-the-art results in forecasting, imputation, classification,…
PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders
·2194 words·11 mins·
loading
·
loading
3D Vision
π’ Shanghai Jiao Tong University
PCP-MAE enhances point cloud self-supervised learning by cleverly predicting masked patch centers, leading to superior 3D object classification and scene segmentation.
Particle Semi-Implicit Variational Inference
·1634 words·8 mins·
loading
·
loading
π’ University of Warwick
Particle Variational Inference (PVI) revolutionizes semi-implicit variational inference by directly optimizing the ELBO using a novel particle approximation, improving efficiency and expressiveness ov…