北华大学学报(社会科学版)2024,Vol.25Issue(1) :17-25.DOI:10.19669/j.issn.1009-5101.2024.01.002

汉字熵值计算及其科学意义

Calculation of Chinese Characters Entropy and Its Scientific Significance

冯志伟
北华大学学报(社会科学版)2024,Vol.25Issue(1) :17-25.DOI:10.19669/j.issn.1009-5101.2024.01.002

汉字熵值计算及其科学意义

Calculation of Chinese Characters Entropy and Its Scientific Significance

冯志伟1
扫码查看

作者信息

  • 1. 教育部语言文字应用研究所 (北京 100010);新疆大学;黑龙江大学
  • 折叠

摘要

汉字的熵值计算是确定计算机汉字编码形式的前提和基础.汉字熵值计算是交叉学科性质的研究.字符的熵值计算需要借助信息论理论、概率理论和香农推导出的计算英文字母熵的公式.汉字编码的依据是"信道编码定理".世界首次计算出汉字的熵值是在20 世纪70 年代中期,由中国学者冯志伟通过手工操作完成的.该研究具有重要的科学意义.为后来的多八位双字节汉字编码提供了语言学理据,为中国的计算机中文信息处理技术的飞速发展作出了重要贡献.

Abstract

The calculation of entropy of Chinese characters is the prerequisite and foundation for determining the encoding form of Chinese characters.The calculation of entropy of Chinese characters is a cross disciplinary research.The calculation of entropy of characters needs to rely on the information theory,probability theory and Shannon's formula for calculation of the entropy of English letter.The basis of Chinese character encoding is the"channel coding theorem".The first calculation of the entropy of Chinese characters was carried out in the mid-1970s by Chinese scholar Feng Zhiwei through manual operation.This research has important scientific significance.It provides linguistic evidence for the later multi-octal double byte encoding of Chinese character.And it has made important contributions to the rapid development of Chinese information processing technology in China.

关键词

信息论/信道编码定理/汉字的熵值/汉字编码

Key words

information theory/channel encoding theorem/entropy of Chinese characters/encoding of Chinese characters

引用本文复制引用

出版年

2024
北华大学学报(社会科学版)
北华大学

北华大学学报(社会科学版)

CHSSCD
影响因子:0.311
ISSN:1009-5101
参考文献量8
段落导航相关论文