Research of Mining the Category Knowledge Based on English - Chinese Humanities and Social Sciences Parallel Corpus in Phrase Level
The experiment of mining the category knowledge from English - Chinese humanities and social sciences parallel corpus in phrase level is performed based on the established clustering algorithm. The clustering and morphological conver- sion algorithms are determined by experimental data and specific research needs. The performance of English - Chinese bilingual word features is better than monolingual word by comparing the performance of the Chinese, English and English - Chinese word level knowledge clustering. The category knowledge is directly applied to knowledge base and machine translation system, and the English and Chinese word' s expression is explored in mining the category knowledge.
CSSCI English- Chinese parallel corpus in phrase level Bisecting Kmeans clustering algorithm Category knowledge