Skip to main content

Spotlight Others

2024

Curvature Clues: Decoding Deep Learning Privacy with Input Loss Curvature
·2320 words·11 mins· loading · loading
🏢 Purdue University
Deep learning privacy is enhanced by a new membership inference attack using input loss curvature, exceeding existing methods, especially on large datasets.
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
·1838 words·9 mins· loading · loading
AI Applications Robotics 🏢 Tsinghua University
CooHOI: A two-phase learning framework enables physically simulated characters to perform cooperative object transportation tasks naturally and efficiently, overcoming the limitations of existing meth…
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion
·2245 words·11 mins· loading · loading
3D Vision 🏢 Zhejiang University
CGFormer: a novel voxel transformer boosting semantic scene completion accuracy by using context-aware queries and 3D deformable attention, outperforming existing methods on SemanticKITTI and SSCBench…
Connecting Joint-Embedding Predictive Architecture with Contrastive Self-supervised Learning
·2598 words·13 mins· loading · loading
Self-Supervised Learning 🏢 Carnegie Mellon University
C-JEPA boosts self-supervised visual learning by integrating contrastive learning with a joint-embedding predictive architecture, enhancing stability and representation quality.
Conditioning non-linear and infinite-dimensional diffusion processes
·1703 words·8 mins· loading · loading
🏢 University of Copenhagen
Conditioning infinite-dimensional nonlinear diffusion processes is made possible, enabling analysis of complex data like organism shapes in evolutionary biology.
Compositional Generalization Across Distributional Shifts with Sparse Tree Operations
·2516 words·12 mins· loading · loading
Machine Translation 🏢 Johns Hopkins University
Sparse Differentiable Tree Machine (sDTM) improves compositional generalization in neural networks by efficiently representing tree structures in vector space, enabling simultaneous symbolic and neura…
Cluster-wise Graph Transformer with Dual-granularity Kernelized Attention
·1558 words·8 mins· loading · loading
🏢 Shanghai Jiao Tong University
Cluster-wise Graph Transformer (Cluster-GT) improves graph learning by using a novel Node-to-Cluster Attention mechanism that leverages multiple kernel learning to capture node and cluster-level infor…
CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning
·2694 words·13 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 University of Washington
Boosting multimodal contrastive learning, this research introduces negCLIPLoss and NormSim, novel data selection methods surpassing existing techniques by improving data quality and task relevance. Th…
Cell ontology guided transcriptome foundation model
·4051 words·20 mins· loading · loading
Self-Supervised Learning 🏢 University of Toronto
scCello: A Cell Ontology-Guided Transcriptome Foundation Model improves single-cell RNA sequencing analysis by incorporating cell lineage information, significantly boosting accuracy and generalizabil…
Bridge the Points: Graph-based Few-shot Segment Anything Semantically
·3214 words·16 mins· loading · loading
Image Segmentation 🏢 Beijing Institute of Technology
GF-SAM: A novel graph-based few-shot semantic segmentation method leverages SAM’s power efficiently via positive-negative prompt alignment and mask clustering for superior accuracy and speed.
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
·2436 words·12 mins· loading · loading
AI Applications Robotics 🏢 Universitat Pompeu Fabra
BricksRL: A low-cost, open-source platform democratizes robotics and reinforcement learning research using LEGO, enabling accessible real-world experiments.
Breaking Long-Tailed Learning Bottlenecks: A Controllable Paradigm with Hypernetwork-Generated Diverse Experts
·2226 words·11 mins· loading · loading
Few-Shot Learning 🏢 University of Science and Technology of China
Controllable long-tailed learning achieved via hypernetwork-generated diverse experts, adapting to user preferences and distribution shifts.
Brain-JEPA: Brain Dynamics Foundation Model with Gradient Positioning and Spatiotemporal Masking
·2427 words·12 mins· loading · loading
Self-Supervised Learning 🏢 National University of Singapore
Brain-JEPA: a novel brain dynamics foundation model leverages fMRI data via innovative gradient positioning and spatiotemporal masking to achieve state-of-the-art performance in diverse brain activity…
BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning
·1651 words·8 mins· loading · loading
AI Applications Finance 🏢 University of California, Berkeley
BPQP: A new differentiable convex optimization framework accelerates end-to-end learning by an order of magnitude, achieving significant efficiency gains over existing methods.
Boosting Vision-Language Models with Transduction
·2950 words·14 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 UCLouvain
TransCLIP significantly boosts vision-language model accuracy by efficiently integrating transduction, a powerful learning paradigm that leverages the structure of unlabeled data.
BMRS: Bayesian Model Reduction for Structured Pruning
·2098 words·10 mins· loading · loading
🏢 University of Copenhagen
BMRS: Bayesian Model Reduction for Structured Pruning offers a principled, threshold-free approach to neural network compression, achieving high accuracy and competitive efficiency.
BackTime: Backdoor Attacks on Multivariate Time Series Forecasting
·2166 words·11 mins· loading · loading
AI Applications Security 🏢 University of Illinois
BACKTIME unveils effective backdoor attacks on multivariate time series forecasting, highlighting vulnerabilities and offering novel defense strategies.
Autoregressive Image Generation without Vector Quantization
·1807 words·9 mins· loading · loading
Image Generation 🏢 Massachusetts Institute of Technology
Autoregressive image generation is revolutionized by eliminating vector quantization, achieving strong results with increased speed using a novel diffusion procedure.
Automatically Learning Hybrid Digital Twins of Dynamical Systems
·2680 words·13 mins· loading · loading
AI Applications Healthcare 🏢 University of Cambridge
AI autonomously designs highly effective hybrid digital twins by combining neural networks and mechanistic models, significantly advancing digital twin technology.
Association of Objects May Engender Stereotypes: Mitigating Association-Engendered Stereotypes in Text-to-Image Generation
·3516 words·17 mins· loading · loading
Image Generation 🏢 Southern University of Science and Technology
New framework, MAS, effectively mitigates stereotypes in text-to-image generation by aligning the probability distribution of generated images to stereotype-free distributions.