Spotlight Others

Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference

26 September 2024·1463 words·7 mins· loading · loading

🏢 Hong Kong University of Science and Technology

Reverse Transition Kernel (RTK) framework accelerates diffusion inference by enabling balanced subproblem decomposition, achieving superior convergence rates with RTK-MALA and RTK-ULD algorithms.

Rethinking 3D Convolution in $ll_p$-norm Space

26 September 2024·1754 words·9 mins· loading · loading

3D Vision 🏢 University of Chinese Academy of Sciences

L1-norm based 3D convolution achieves competitive performance with lower energy consumption and latency compared to traditional methods, as proven through universal approximation theorem and experimen…

ResAD: A Simple Framework for Class Generalizable Anomaly Detection

26 September 2024·2059 words·10 mins· loading · loading

Anomaly Detection 🏢 Shanghai Jiao Tong University

ResAD, a novel framework, tackles class-generalizable anomaly detection by learning residual feature distributions, achieving remarkable results on diverse datasets without retraining.

Reproducibility of predictive networks for mouse visual cortex

26 September 2024·2493 words·12 mins· loading · loading

🏢 Max Planck Institute for Dynamics and Self-Organization

Deep learning models for neural activity lack reproducibility; this paper introduces adaptive regularization and iterative feature pruning to improve embedding consistency and predictive performance.

Reparameterization invariance in approximate Bayesian inference

26 September 2024·2291 words·11 mins· loading · loading

🏢 Technical University of Denmark

Bayesian neural networks often underfit due to their lack of reparameterization invariance; this paper introduces a Riemannian diffusion process to improve posterior sampling and enhance predictive pe…

Recursive PAC-Bayes: A Frequentist Approach to Sequential Prior Updates with No Information Loss

26 September 2024·1805 words·9 mins· loading · loading

🏢 University of South Denmark

Recursive PAC-Bayes: A frequentist method enabling sequential prior updates without information loss, resulting in significantly tighter generalization bounds.

Recurrent neural network dynamical systems for biological vision

26 September 2024·2292 words·11 mins· loading · loading

Image Classification 🏢 University of Cambridge

CordsNet: a hybrid CNN-RNN architecture enabling biologically realistic, robust image recognition through continuous-time recurrent dynamics.

Reconstruct and Match: Out-of-Distribution Robustness via Topological Homogeneity

26 September 2024·1935 words·10 mins· loading · loading

🏢 Shenzhen University

Reconstruct & Match (REMA) enhances deep learning’s out-of-distribution robustness by leveraging object’s topological homogeneity, outperforming state-of-the-art methods.

Real-world Image Dehazing with Coherence-based Pseudo Labeling and Cooperative Unfolding Network

26 September 2024·2135 words·11 mins· loading · loading

Image Generation 🏢 Tsinghua University

CORUN-Colabator: a novel cooperative unfolding network and coherence-based label generator achieves state-of-the-art real-world image dehazing by effectively integrating physical knowledge and generat…

QKFormer: Hierarchical Spiking Transformer using Q-K Attention

26 September 2024·2062 words·10 mins· loading · loading

Image Classification 🏢 Pengcheng Laboratory

QKFormer: A groundbreaking spiking transformer achieving 85.65% ImageNet accuracy using a linear-complexity, energy-efficient Q-K attention mechanism.

Provable Benefit of Cutout and CutMix for Feature Learning

26 September 2024·1796 words·9 mins· loading · loading

Image Classification 🏢 KAIST

CutMix and Cutout data augmentation methods provably improve feature learning by enabling the network to learn rarer features and noise vectors more effectively.

Procedure-Aware Surgical Video-language Pretraining with Hierarchical Knowledge Augmentation

26 September 2024·2063 words·10 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Strasbourg

PeskaVLP: Hierarchical knowledge augmentation boosts surgical video-language pretraining!

Probablistic Emulation of a Global Climate Model with Spherical DYffusion

26 September 2024·2553 words·12 mins· loading · loading

🏢 University of California, San Diego

Spherical DYffusion: a novel AI model generates accurate, physically consistent global climate ensemble simulations, surpassing existing methods in efficiency and skill.

Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks

26 September 2024·6833 words·33 mins· loading · loading

🏢 Linköping University

Graph-EFM: a novel probabilistic weather forecasting model using hierarchical graph neural networks that efficiently generates large ensembles for improved accuracy and uncertainty quantification.

Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control

26 September 2024·2633 words·13 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Oxford

Pre-trained text-to-image diffusion models create highly effective, versatile representations for embodied AI control, surpassing previous methods.

Poisson Variational Autoencoder

26 September 2024·2239 words·11 mins· loading · loading

🏢 University of California, Berkeley

Poisson Variational Autoencoder (P-VAE) improves deep learning by encoding inputs as discrete spike counts, enhancing biological realism and interpretability while avoiding posterior collapse and achi…

Physically Compatible 3D Object Modeling from a Single Image

26 September 2024·1864 words·9 mins· loading · loading

3D Vision 🏢 Massachusetts Institute of Technology

Single image to physically compatible 3D objects: A new framework ensures 3D models maintain stability and mirror real-world equilibrium states, advancing realism in dynamic simulations and 3D printi…

Peri-midFormer: Periodic Pyramid Transformer for Time Series Analysis

26 September 2024·3968 words·19 mins· loading · loading

🏢 Xidian University

Peri-midFormer uses a novel periodic pyramid transformer to effectively model complex periodic variations in time series, achieving state-of-the-art results in forecasting, imputation, classification,…

PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders

26 September 2024·2194 words·11 mins· loading · loading

3D Vision 🏢 Shanghai Jiao Tong University

PCP-MAE enhances point cloud self-supervised learning by cleverly predicting masked patch centers, leading to superior 3D object classification and scene segmentation.

Particle Semi-Implicit Variational Inference

26 September 2024·1634 words·8 mins· loading · loading

🏢 University of Warwick

Particle Variational Inference (PVI) revolutionizes semi-implicit variational inference by directly optimizing the ELBO using a novel particle approximation, improving efficiency and expressiveness ov…