Skip to main content

Posters

2024

Customizing Language Models with Instance-wise LoRA for Sequential Recommendation
·1854 words·9 mins· loading · loading
AI Generated Natural Language Processing Large Language Models 🏢 University of Science and Technology of China
Instance-wise LoRA (iLoRA) boosts LLM sequential recommendation accuracy by customizing model parameters for each user, mitigating negative transfer and improving performance.
Customized Subgraph Selection and Encoding for Drug-drug Interaction Prediction
·2210 words·11 mins· loading · loading
AI Applications Healthcare 🏢 Northwestern Polytechnical University
AI-powered drug interaction prediction gets a boost! CSSE-DDI uses neural architecture search to customize subgraph selection and encoding, resulting in superior accuracy and efficiency compared to e…
Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning
·1867 words·9 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 University of Washington
Multi-Sub leverages multi-modal learning to achieve customized multiple clustering, aligning user-defined textual preferences with visual representations via a subspace proxy learning framework.
Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise
·1703 words·8 mins· loading · loading
Computer Vision Image Classification 🏢 Gwangju Institute of Science and Technology
CUFIT: a novel curriculum fine-tuning paradigm significantly improves medical image classification accuracy despite noisy labels by leveraging pre-trained Vision Foundation Models.
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts
·2211 words·11 mins· loading · loading
Multimodal Learning Vision-Language Models 🏢 SHI Labs @ Georgia Tech & UIUC
CuMo boosts multimodal LLMs by efficiently integrating co-upcycled Mixture-of-Experts, achieving state-of-the-art performance with minimal extra parameters during inference.
CulturePark: Boosting Cross-cultural Understanding in Large Language Models
·2738 words·13 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Microsoft Research
CulturePark, a novel multi-agent communication framework, generates high-quality cross-cultural data to fine-tune LLMs, significantly reducing cultural bias and boosting cross-cultural understanding.
CultureLLM: Incorporating Cultural Differences into Large Language Models
·2507 words·12 mins· loading · loading
Natural Language Processing Large Language Models 🏢 Microsoft Research
CultureLLM, a new approach, effectively incorporates cultural nuances into LLMs using semantic data augmentation, significantly outperforming existing models.
Ctrl-X: Controlling Structure and Appearance for Text-To-Image Generation Without Guidance
·2880 words·14 mins· loading · loading
Computer Vision Image Generation 🏢 UC Los Angeles
Ctrl-X: Zero-shot text-to-image generation with training-free structure & appearance control!
CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search
·2426 words·12 mins· loading · loading
AI Theory Optimization 🏢 Fudan University
CSPG: a novel framework boosting Approximate Nearest Neighbor Search speed by 1.5-2x, using sparse proximity graphs and efficient two-staged search.
Cryptographic Hardness of Score Estimation
·386 words·2 mins· loading · loading
AI Generated AI Theory Optimization 🏢 University of Washington
Score estimation, crucial for diffusion models, is computationally hard even with polynomial sample complexity unless strong distributional assumptions are made.
CryoSPIN: Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose Inference
·1731 words·9 mins· loading · loading
Computer Vision 3D Vision 🏢 University of Toronto
CryoSPIN revolutionizes ab-initio cryo-EM reconstruction with semi-amortized pose inference, achieving faster and more accurate 3D structure determination.
CryoGEM: Physics-Informed Generative Cryo-Electron Microscopy
·2131 words·11 mins· loading · loading
Computer Vision Image Generation 🏢 ShanghaiTech University
CryoGEM: Physics-informed generative model creates realistic synthetic cryo-EM datasets, boosting particle picking and pose estimation accuracy for higher-resolution protein structure determination.
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection
·2900 words·14 mins· loading · loading
AI Generated AI Applications Autonomous Vehicles 🏢 Hanyang University
CRT-Fusion: Boosting 3D object detection by fusing camera, radar, and motion information for more accurate, robust results!
Cross-video Identity Correlating for Person Re-identification Pre-training
·1957 words·10 mins· loading · loading
Computer Vision Person Re-Identification 🏢 String
Cross-video Identity-cOrrelating pre-training (CION) revolutionizes person re-identification by leveraging identity correlation across videos for superior model pre-training, achieving state-of-the-ar…
Cross-Scale Self-Supervised Blind Image Deblurring via Implicit Neural Representation
·3186 words·15 mins· loading · loading
Computer Vision Image Generation 🏢 National University of Singapore
Self-supervised blind image deblurring (BID) breakthrough! A novel cross-scale consistency loss and progressive training scheme using implicit neural representations achieves superior performance wit…
Cross-model Control: Improving Multiple Large Language Models in One-time Training
·1811 words·9 mins· loading · loading
Natural Language Processing Large Language Models 🏢 East China Normal University
One-time training improves multiple LLMs using a tiny portable model, drastically reducing costs and resource needs for model enhancement.
Cross-Modality Perturbation Synergy Attack for Person Re-identification
·1933 words·10 mins· loading · loading
Computer Vision Face Recognition 🏢 Xiamen University
Cross-Modality Perturbation Synergy (CMPS) attack: A novel universal perturbation method for cross-modality person re-identification, effectively misleading ReID models by leveraging gradients from di…
Cross-modal Representation Flattening for Multi-modal Domain Generalization
·3259 words·16 mins· loading · loading
AI Generated Multimodal Learning Vision-Language Models 🏢 Hong Kong Polytechnic University
Cross-Modal Representation Flattening (CMRF) improves multi-modal domain generalization by creating consistent flat loss regions and enhancing knowledge transfer between modalities, outperforming exis…
Cross-Device Collaborative Test-Time Adaptation
·2757 words·13 mins· loading · loading
Machine Learning Deep Learning 🏢 South China University of Technology
CoLA: Collaborative Lifelong Adaptation boosts test-time adaptation efficiency by sharing domain knowledge across multiple devices, achieving significant accuracy gains with minimal computational over…
CRONOS: Enhancing Deep Learning with Scalable GPU Accelerated Convex Neural Networks
·2029 words·10 mins· loading · loading
Machine Learning Deep Learning 🏢 Stanford University
CRONOS: Scaling convex neural network training to ImageNet!