🏢 SI-TECH Information Technology
Adversarial Moment-Matching Distillation of Large Language Models
·2972 words·14 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 SI-TECH Information Technology
Boosting LLM efficiency, this study introduces adversarial moment-matching distillation, outperforming existing methods by matching action-value moments for superior knowledge transfer and achieving s…