Affiliation:
1. School of Foreign Languages, NanYang Institute of Technology, Nanyang 473000, Henan, P. R. China
Abstract
The current corpus has the problem of imperfect span retrieval function, which leads to a large classification noise. This paper designs a Python-based corpus of Chinese medicine English vocabulary translation teaching system. Here, we select the script material of web crawler, extract topic tags in the form of tag window, calculate the amount of information carried by words, use Python to extract the characteristics of Chinese medicine English vocabulary, and according to the observation value of exploration strategy, use instant time difference learning algorithm to construct the translation mode of teaching system, limit the scope of key words, and design the cross-range retrieval function of corpus. Experimental results: the average classification noise of the designed corpus and the other two corpora is 25.007[Formula: see text]dB, 33.877[Formula: see text]dB and 32.166[Formula: see text]dB, which proves that the integrated Python corpus has higher comprehensive value.
Publisher
World Scientific Pub Co Pte Ltd
Subject
Library and Information Sciences,Computer Networks and Communications,Computer Science Applications
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献