Natural Language Processing System for Text Classification Corpus Based on Machine Learning

Author:

Su Yawen1ORCID

Affiliation:

1. Normal College, Jimei University, Xiamen, Fujian, China

Abstract

A classification system for hazardous materials in air traffic control was investigated using the Human Factors Analysis and Classification System (HFACS) framework and natural language processing to prevent hazardous situations in air traffic control. Based on the development of the HFACS standard, an air traffic control hazard classification system will be created. The dangerous data of the aviation safety management system is selected by dead bodies, classified and marked in five levels. Time Frame Return Frequency TextRank text classification method based on key content extraction and text classification model based on Convolutional Neural Network and Bidirectional Encoder Representations from Transforms models were used in the experiment to solve the problem of small samples, many labels and random samples in hazardous environment of air pollution control. The results show that the total cost of model training time and classification accuracy is the highest when the keywords are around 8. As the number of points increases, the time spent in dimensioning decreases and affects accuracy. When the number of points reaches about 93, the time spent in determining the size increases, but the accuracy of the allocation remains close to 0.7, but the increase in the value of time leads to a decrease in the total cost. It has been proven that extracting key content can solve text classification problems for small companies and contribute to further research in the development of security systems.

Publisher

Association for Computing Machinery (ACM)

Reference19 articles.

1. Natural language processing for imaging protocol assignment: Machine learning for multiclass classification of abdominal CT protocols using indication text data;Xavier B. A.;J. Dig. Imag.,2022

2. A novel automatic classification system based on hybrid unsupervised and supervised machine learning for electrospun nanofibers;Ieracitano Cosimo A.;IEEE/CAA J. Automatica Sinica,2021

3. Automated segmentation of heel fissures based on thermal image processing and classification based on machine learning algorithms;Guhan B.;Biomed. Eng.: Appl. Basis Commun.,2021

4. A General Algorithm of Association Rule-Based Machine Learning Dedicated for Text Classification

5. Automatic medical protocol classification using machine learning approaches;López-Úbeda Pilar;Comput. Methods Programs Biomed.,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3