Spotlight Others

Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature

26 September 2024·2320 words·11 mins· loading · loading

🏢 Purdue University

Deep learning privacy is enhanced by a new membership inference attack using input loss curvature, exceeding existing methods, especially on large datasets.

CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics

26 September 2024·1838 words·9 mins· loading · loading

AI Applications Robotics 🏢 Tsinghua University

CooHOI: A two-phase learning framework enables physically simulated characters to perform cooperative object transportation tasks naturally and efficiently, overcoming the limitations of existing meth…

Context and Geometry Aware Voxel Transformer for Semantic Scene Completion

26 September 2024·2245 words·11 mins· loading · loading

3D Vision 🏢 Zhejiang University

CGFormer: a novel voxel transformer boosting semantic scene completion accuracy by using context-aware queries and 3D deformable attention, outperforming existing methods on SemanticKITTI and SSCBench…

Connecting Joint-Embedding Predictive Architecture with Contrastive Self-supervised Learning

26 September 2024·2598 words·13 mins· loading · loading

Self-Supervised Learning 🏢 Carnegie Mellon University

C-JEPA boosts self-supervised visual learning by integrating contrastive learning with a joint-embedding predictive architecture, enhancing stability and representation quality.

Conditioning non-linear and infinite-dimensional diffusion processes

26 September 2024·1703 words·8 mins· loading · loading

🏢 University of Copenhagen

Conditioning infinite-dimensional nonlinear diffusion processes is made possible, enabling analysis of complex data like organism shapes in evolutionary biology.

Compositional Generalization Across Distributional Shifts with Sparse Tree Operations

26 September 2024·2516 words·12 mins· loading · loading

Machine Translation 🏢 Johns Hopkins University

Sparse Differentiable Tree Machine (sDTM) improves compositional generalization in neural networks by efficiently representing tree structures in vector space, enabling simultaneous symbolic and neura…

Cluster-wise Graph Transformer with Dual-granularity Kernelized Attention

26 September 2024·1558 words·8 mins· loading · loading

🏢 Shanghai Jiao Tong University

Cluster-wise Graph Transformer (Cluster-GT) improves graph learning by using a novel Node-to-Cluster Attention mechanism that leverages multiple kernel learning to capture node and cluster-level infor…

CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning

26 September 2024·2694 words·13 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Washington

Boosting multimodal contrastive learning, this research introduces negCLIPLoss and NormSim, novel data selection methods surpassing existing techniques by improving data quality and task relevance. Th…

Cell ontology guided transcriptome foundation model

26 September 2024·4051 words·20 mins· loading · loading

Self-Supervised Learning 🏢 University of Toronto

scCello: A Cell Ontology-Guided Transcriptome Foundation Model improves single-cell RNA sequencing analysis by incorporating cell lineage information, significantly boosting accuracy and generalizabil…

Bridge the Points: Graph-based Few-shot Segment Anything Semantically

26 September 2024·3214 words·16 mins· loading · loading

Image Segmentation 🏢 Beijing Institute of Technology

GF-SAM: A novel graph-based few-shot semantic segmentation method leverages SAM’s power efficiently via positive-negative prompt alignment and mask clustering for superior accuracy and speed.

BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO

26 September 2024·2436 words·12 mins· loading · loading

AI Applications Robotics 🏢 Universitat Pompeu Fabra

BricksRL: A low-cost, open-source platform democratizes robotics and reinforcement learning research using LEGO, enabling accessible real-world experiments.

Breaking Long-Tailed Learning Bottlenecks: A Controllable Paradigm with Hypernetwork-Generated Diverse Experts

26 September 2024·2226 words·11 mins· loading · loading

Few-Shot Learning 🏢 University of Science and Technology of China

Controllable long-tailed learning achieved via hypernetwork-generated diverse experts, adapting to user preferences and distribution shifts.

Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking

26 September 2024·2427 words·12 mins· loading · loading

Self-Supervised Learning 🏢 National University of Singapore

Brain-JEPA: a novel brain dynamics foundation model leverages fMRI data via innovative gradient positioning and spatiotemporal masking to achieve state-of-the-art performance in diverse brain activity…

BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning

26 September 2024·1651 words·8 mins· loading · loading

AI Applications Finance 🏢 University of California, Berkeley

BPQP: A new differentiable convex optimization framework accelerates end-to-end learning by an order of magnitude, achieving significant efficiency gains over existing methods.

Boosting Vision-Language Models with Transduction

26 September 2024·2950 words·14 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 UCLouvain

TransCLIP significantly boosts vision-language model accuracy by efficiently integrating transduction, a powerful learning paradigm that leverages the structure of unlabeled data.

BMRS: Bayesian Model Reduction for Structured Pruning

26 September 2024·2098 words·10 mins· loading · loading

🏢 University of Copenhagen

BMRS: Bayesian Model Reduction for Structured Pruning offers a principled, threshold-free approach to neural network compression, achieving high accuracy and competitive efficiency.

BackTime: Backdoor Attacks on Multivariate Time Series Forecasting

26 September 2024·2166 words·11 mins· loading · loading

AI Applications Security 🏢 University of Illinois

BACKTIME unveils effective backdoor attacks on multivariate time series forecasting, highlighting vulnerabilities and offering novel defense strategies.

Autoregressive Image Generation without Vector Quantization

26 September 2024·1807 words·9 mins· loading · loading

Image Generation 🏢 Massachusetts Institute of Technology

Autoregressive image generation is revolutionized by eliminating vector quantization, achieving strong results with increased speed using a novel diffusion procedure.

Automatically Learning Hybrid Digital Twins of Dynamical Systems

26 September 2024·2680 words·13 mins· loading · loading

AI Applications Healthcare 🏢 University of Cambridge

AI autonomously designs highly effective hybrid digital twins by combining neural networks and mechanistic models, significantly advancing digital twin technology.

Association of Objects May Engender Stereotypes: Mitigating Association-Engendered Stereotypes in Text-to-Image Generation

26 September 2024·3516 words·17 mins· loading · loading

Image Generation 🏢 Southern University of Science and Technology

New framework, MAS, effectively mitigates stereotypes in text-to-image generation by aligning the probability distribution of generated images to stereotype-free distributions.