Posters
2024
Customizing Language Models with Instance-wise LoRA for Sequential Recommendation
·1854 words·9 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 University of Science and Technology of China
Instance-wise LoRA (iLoRA) boosts LLM sequential recommendation accuracy by customizing model parameters for each user, mitigating negative transfer and improving performance.
Customized Subgraph Selection and Encoding for Drug-drug Interaction Prediction
·2210 words·11 mins·
loading
·
loading
AI Applications
Healthcare
🏢 Northwestern Polytechnical University
AI-powered drug interaction prediction gets a boost! CSSE-DDI uses neural architecture search to customize subgraph selection and encoding, resulting in superior accuracy and efficiency compared to e…
Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning
·1867 words·9 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 University of Washington
Multi-Sub leverages multi-modal learning to achieve customized multiple clustering, aligning user-defined textual preferences with visual representations via a subspace proxy learning framework.
Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise
·1703 words·8 mins·
loading
·
loading
Computer Vision
Image Classification
🏢 Gwangju Institute of Science and Technology
CUFIT: a novel curriculum fine-tuning paradigm significantly improves medical image classification accuracy despite noisy labels by leveraging pre-trained Vision Foundation Models.
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
·2211 words·11 mins·
loading
·
loading
Multimodal Learning
Vision-Language Models
🏢 SHI Labs @ Georgia Tech & UIUC
CuMo boosts multimodal LLMs by efficiently integrating co-upcycled Mixture-of-Experts, achieving state-of-the-art performance with minimal extra parameters during inference.
CulturePark: Boosting Cross-cultural Understanding in Large Language Models
·2738 words·13 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Microsoft Research
CulturePark, a novel multi-agent communication framework, generates high-quality cross-cultural data to fine-tune LLMs, significantly reducing cultural bias and boosting cross-cultural understanding.
CultureLLM: Incorporating Cultural Differences into Large Language Models
·2507 words·12 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 Microsoft Research
CultureLLM, a new approach, effectively incorporates cultural nuances into LLMs using semantic data augmentation, significantly outperforming existing models.
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance
·2880 words·14 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 UC Los Angeles
Ctrl-X: Zero-shot text-to-image generation with training-free structure & appearance control!
CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search
·2426 words·12 mins·
loading
·
loading
AI Theory
Optimization
🏢 Fudan University
CSPG: a novel framework boosting Approximate Nearest Neighbor Search speed by 1.5-2x, using sparse proximity graphs and efficient two-staged search.
Cryptographic Hardness of Score Estimation
·386 words·2 mins·
loading
·
loading
AI Generated
AI Theory
Optimization
🏢 University of Washington
Score estimation, crucial for diffusion models, is computationally hard even with polynomial sample complexity unless strong distributional assumptions are made.
CryoSPIN: Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose Inference
·1731 words·9 mins·
loading
·
loading
Computer Vision
3D Vision
🏢 University of Toronto
CryoSPIN revolutionizes ab-initio cryo-EM reconstruction with semi-amortized pose inference, achieving faster and more accurate 3D structure determination.
CryoGEM: Physics-Informed Generative Cryo-Electron Microscopy
·2131 words·11 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 ShanghaiTech University
CryoGEM: Physics-informed generative model creates realistic synthetic cryo-EM datasets, boosting particle picking and pose estimation accuracy for higher-resolution protein structure determination.
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection
·2900 words·14 mins·
loading
·
loading
AI Generated
AI Applications
Autonomous Vehicles
🏢 Hanyang University
CRT-Fusion: Boosting 3D object detection by fusing camera, radar, and motion information for more accurate, robust results!
Cross-video Identity Correlating for Person Re-identification Pre-training
·1957 words·10 mins·
loading
·
loading
Computer Vision
Person Re-Identification
🏢 String
Cross-video Identity-cOrrelating pre-training (CION) revolutionizes person re-identification by leveraging identity correlation across videos for superior model pre-training, achieving state-of-the-ar…
Cross-Scale Self-Supervised Blind Image Deblurring via Implicit Neural Representation
·3186 words·15 mins·
loading
·
loading
Computer Vision
Image Generation
🏢 National University of Singapore
Self-supervised blind image deblurring (BID) breakthrough! A novel cross-scale consistency loss and progressive training scheme using implicit neural representations achieves superior performance wit…
Cross-model Control: Improving Multiple Large Language Models in One-time Training
·1811 words·9 mins·
loading
·
loading
Natural Language Processing
Large Language Models
🏢 East China Normal University
One-time training improves multiple LLMs using a tiny portable model, drastically reducing costs and resource needs for model enhancement.
Cross-Modality Perturbation Synergy Attack for Person Re-identification
·1933 words·10 mins·
loading
·
loading
Computer Vision
Face Recognition
🏢 Xiamen University
Cross-Modality Perturbation Synergy (CMPS) attack: A novel universal perturbation method for cross-modality person re-identification, effectively misleading ReID models by leveraging gradients from di…
Cross-modal Representation Flattening for Multi-modal Domain Generalization
·3259 words·16 mins·
loading
·
loading
AI Generated
Multimodal Learning
Vision-Language Models
🏢 Hong Kong Polytechnic University
Cross-Modal Representation Flattening (CMRF) improves multi-modal domain generalization by creating consistent flat loss regions and enhancing knowledge transfer between modalities, outperforming exis…
Cross-Device Collaborative Test-Time Adaptation
·2757 words·13 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 South China University of Technology
CoLA: Collaborative Lifelong Adaptation boosts test-time adaptation efficiency by sharing domain knowledge across multiple devices, achieving significant accuracy gains with minimal computational over…
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks
·2029 words·10 mins·
loading
·
loading
Machine Learning
Deep Learning
🏢 Stanford University
CRONOS: Scaling convex neural network training to ImageNet!