↓Skip to main content

🏢 Huawei Technologies Co., Ltd.

D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models

26 September 2024·2704 words·13 mins· loading · loading

AI Generated Natural Language Processing Large Language Models 🏢 Huawei Technologies Co., Ltd.

D-LLM dynamically allocates computing resources during LLM token processing, reducing computational costs and memory usage by up to 50% without sacrificing accuracy.