首页|汉字熵值计算及其科学意义

汉字熵值计算及其科学意义

扫码查看
汉字的熵值计算是确定计算机汉字编码形式的前提和基础.汉字熵值计算是交叉学科性质的研究.字符的熵值计算需要借助信息论理论、概率理论和香农推导出的计算英文字母熵的公式.汉字编码的依据是"信道编码定理".世界首次计算出汉字的熵值是在20 世纪70 年代中期,由中国学者冯志伟通过手工操作完成的.该研究具有重要的科学意义.为后来的多八位双字节汉字编码提供了语言学理据,为中国的计算机中文信息处理技术的飞速发展作出了重要贡献.
Calculation of Chinese Characters Entropy and Its Scientific Significance
The calculation of entropy of Chinese characters is the prerequisite and foundation for determining the encoding form of Chinese characters.The calculation of entropy of Chinese characters is a cross disciplinary research.The calculation of entropy of characters needs to rely on the information theory,probability theory and Shannon's formula for calculation of the entropy of English letter.The basis of Chinese character encoding is the"channel coding theorem".The first calculation of the entropy of Chinese characters was carried out in the mid-1970s by Chinese scholar Feng Zhiwei through manual operation.This research has important scientific significance.It provides linguistic evidence for the later multi-octal double byte encoding of Chinese character.And it has made important contributions to the rapid development of Chinese information processing technology in China.

information theorychannel encoding theorementropy of Chinese charactersencoding of Chinese characters

冯志伟

展开 >

教育部语言文字应用研究所 (北京 100010)

新疆大学

黑龙江大学

信息论 信道编码定理 汉字的熵值 汉字编码

2024

北华大学学报(社会科学版)
北华大学

北华大学学报(社会科学版)

CHSSCD
影响因子:0.311
ISSN:1009-5101
年,卷(期):2024.25(1)
  • 8