首页|大语言模型算力度量模型

大语言模型算力度量模型

扫码查看
面对大语言模型对算力需求的快速增长,传统的摩尔定律已经难以满足需求,而大语言模型的扩展法则表明更多参数、更多数据和更多算力能够得到更好的模型智能.针对大语言模型的算力度量问题开展研究,旨在评估大语言模型的算力需求.提出大语言模型训练的算力度量模型和大语言模型推理的算力度量模型,并通过理论分析提出了相应的计算方法.
Computational Measurement Model for Large Language Models
In the face of the rapidly increasing demand for computing power in large language models,traditional Moore's Law is no longer sufficient to meet the demand,while the expansion rules of large language models indicate that more parameters,more data,and more computing power can lead to better model intelligence.Research is conducted on the measurement of computing power for large language models in order to evaluate the computing power requirements of large language models.It proposes a computational power measurement model for training large language models and a computational power measurement model for inference of large language models,and the corresponding calculation methods is put forward through theoretical analysis.

Large language modelComputational measurementAI

刘永生、张岩、周广、曹畅

展开 >

中国联通研究院,北京 100048

大语言模型 算力度量 人工智能

2024

邮电设计技术
中讯邮电咨询设计院有限公司

邮电设计技术

影响因子:0.647
ISSN:1007-3043
年,卷(期):2024.(9)
  • 9