🏢 Huawei Technologies Co., Ltd.
D-LLM: A Token Adaptive Computing Resource Allocation Strategy for Large Language Models
·2704 words·13 mins·
loading
·
loading
AI Generated
Natural Language Processing
Large Language Models
🏢 Huawei Technologies Co., Ltd.
D-LLM dynamically allocates computing resources during LLM token processing, reducing computational costs and memory usage by up to 50% without sacrificing accuracy.