Posters

Customizing Language Models with Instance-wise LoRA for Sequential Recommendation

26 September 2024·1854 words·9 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 University of Science and Technology of China

Instance-wise LoRA (iLoRA) boosts LLM sequential recommendation accuracy by customizing model parameters for each user, mitigating negative transfer and improving performance.

Customized Subgraph Selection and Encoding for Drug-drug Interaction Prediction

26 September 2024·2210 words·11 mins· loading · loading

AI Applications Healthcare 🏢 Northwestern Polytechnical University

AI-powered drug interaction prediction gets a boost! CSSE-DDI uses neural architecture search to customize subgraph selection and encoding, resulting in superior accuracy and efficiency compared to e…

Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning

26 September 2024·1867 words·9 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 University of Washington

Multi-Sub leverages multi-modal learning to achieve customized multiple clustering, aligning user-defined textual preferences with visual representations via a subspace proxy learning framework.

Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise

26 September 2024·1703 words·8 mins· loading · loading

Computer Vision Image Classification 🏢 Gwangju Institute of Science and Technology

CUFIT: a novel curriculum fine-tuning paradigm significantly improves medical image classification accuracy despite noisy labels by leveraging pre-trained Vision Foundation Models.

CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts

26 September 2024·2211 words·11 mins· loading · loading

Multimodal Learning Vision-Language Models 🏢 SHI Labs @ Georgia Tech & UIUC

CuMo boosts multimodal LLMs by efficiently integrating co-upcycled Mixture-of-Experts, achieving state-of-the-art performance with minimal extra parameters during inference.

CulturePark: Boosting Cross-cultural Understanding in Large Language Models

26 September 2024·2738 words·13 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Microsoft Research

CulturePark, a novel multi-agent communication framework, generates high-quality cross-cultural data to fine-tune LLMs, significantly reducing cultural bias and boosting cross-cultural understanding.

CultureLLM: Incorporating Cultural Differences into Large Language Models

26 September 2024·2507 words·12 mins· loading · loading

Natural Language Processing Large Language Models 🏢 Microsoft Research

CultureLLM, a new approach, effectively incorporates cultural nuances into LLMs using semantic data augmentation, significantly outperforming existing models.

Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance

26 September 2024·2880 words·14 mins· loading · loading

Computer Vision Image Generation 🏢 UC Los Angeles

Ctrl-X: Zero-shot text-to-image generation with training-free structure & appearance control!

CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search

26 September 2024·2426 words·12 mins· loading · loading

AI Theory Optimization 🏢 Fudan University

CSPG: a novel framework boosting Approximate Nearest Neighbor Search speed by 1.5-2x, using sparse proximity graphs and efficient two-staged search.

Cryptographic Hardness of Score Estimation

26 September 2024·386 words·2 mins· loading · loading

AI Generated AI Theory Optimization 🏢 University of Washington

Score estimation, crucial for diffusion models, is computationally hard even with polynomial sample complexity unless strong distributional assumptions are made.

CryoSPIN: Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose Inference

26 September 2024·1731 words·9 mins· loading · loading

Computer Vision 3D Vision 🏢 University of Toronto

CryoSPIN revolutionizes ab-initio cryo-EM reconstruction with semi-amortized pose inference, achieving faster and more accurate 3D structure determination.

CryoGEM: Physics-Informed Generative Cryo-Electron Microscopy

26 September 2024·2131 words·11 mins· loading · loading

Computer Vision Image Generation 🏢 ShanghaiTech University

CryoGEM: Physics-informed generative model creates realistic synthetic cryo-EM datasets, boosting particle picking and pose estimation accuracy for higher-resolution protein structure determination.

CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection

26 September 2024·2900 words·14 mins· loading · loading

AI Generated AI Applications Autonomous Vehicles 🏢 Hanyang University

CRT-Fusion: Boosting 3D object detection by fusing camera, radar, and motion information for more accurate, robust results!

Cross-video Identity Correlating for Person Re-identification Pre-training

26 September 2024·1957 words·10 mins· loading · loading

Computer Vision Person Re-Identification 🏢 String

Cross-video Identity-cOrrelating pre-training (CION) revolutionizes person re-identification by leveraging identity correlation across videos for superior model pre-training, achieving state-of-the-ar…

Cross-Scale Self-Supervised Blind Image Deblurring via Implicit Neural Representation

26 September 2024·3186 words·15 mins· loading · loading

Computer Vision Image Generation 🏢 National University of Singapore

Self-supervised blind image deblurring (BID) breakthrough! A novel cross-scale consistency loss and progressive training scheme using implicit neural representations achieves superior performance wit…

Cross-model Control: Improving Multiple Large Language Models in One-time Training

26 September 2024·1811 words·9 mins· loading · loading

Natural Language Processing Large Language Models 🏢 East China Normal University

One-time training improves multiple LLMs using a tiny portable model, drastically reducing costs and resource needs for model enhancement.

Cross-Modality Perturbation Synergy Attack for Person Re-identification

26 September 2024·1933 words·10 mins· loading · loading

Computer Vision Face Recognition 🏢 Xiamen University

Cross-Modality Perturbation Synergy (CMPS) attack: A novel universal perturbation method for cross-modality person re-identification, effectively misleading ReID models by leveraging gradients from di…

Cross-modal Representation Flattening for Multi-modal Domain Generalization

26 September 2024·3259 words·16 mins· loading · loading

AI Generated Multimodal Learning Vision-Language Models 🏢 Hong Kong Polytechnic University

Cross-Modal Representation Flattening (CMRF) improves multi-modal domain generalization by creating consistent flat loss regions and enhancing knowledge transfer between modalities, outperforming exis…

Cross-Device Collaborative Test-Time Adaptation

26 September 2024·2757 words·13 mins· loading · loading

Machine Learning Deep Learning 🏢 South China University of Technology

CoLA: Collaborative Lifelong Adaptation boosts test-time adaptation efficiency by sharing domain knowledge across multiple devices, achieving significant accuracy gains with minimal computational over…

CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks

26 September 2024·2029 words·10 mins· loading · loading

Machine Learning Deep Learning 🏢 Stanford University

CRONOS: Scaling convex neural network training to ImageNet!