A classification and extraction method of attribute hybrid big data based on Naive Bayes algorithm-Reference-Cited by-同舟云学术

A classification and extraction method of attribute hybrid big data based on Naive Bayes algorithm

Published:2023-08-18 Issue:4 Volume:23 Page:1955-1970
ISSN:1472-7978
Container-title:Journal of Computational Methods in Sciences and Engineering
language:
Short-container-title:JCM

Author:

Li Liantian,Yang Ling

Abstract

In the identification of network text information, the existing technology is difficult to accurately extract and classify text information with high propagation speed and high update speed. In order to solve this problem, the research combines the Naive Bayes algorithm with the feature two-dimensional information gain weighting method, uses the feature weighting method to optimize the Naive Bayes algorithm, and calculates the dimension of different documents and data categories through a new feature operation method. The data gain between them can improve its classification performance, and the classification models are compared and analyzed in the actual Chinese and English databases. The research results show that the classification accuracy rates of the IGDC-DWNB model in the Sogou database, 20-newsgroup database, Fudan database and Ruster21578 database are 0.89, 0.89, 0.93, and 0.88, respectively, which are higher than other classification models in the same environment. It can be seen that the model designed in the research has higher classification accuracy, stronger overall performance, and stronger reliability and robustness in practical applications, which can provide a new development idea for big data classification technology.

Publisher

IOS Press

Subject

Computational Mathematics,Computer Science Applications,General Engineering

Reference27 articles.

1. Big data classification and Internet of Things in healthcare;Rghioui;Int J E-Health Med C.,2020

2. Comprehensive survey of classification, streaming techniques in big data analytics;Uma;Des Eng (Toronto).,2021

3. Komparasi algoritma Naive Bayes dan k-nearest neighbor untuk membangun pengetahuan diagnosa penyakit diabetes;Nurmalasari;Jurnal Komtika (Komputasi dan Informatika).,2021

4. Komparasi algoritma Naive Bayes, decision tree dan support vector machine untuk prediksi penyakit kanker payudara;Prahartiwi;Jurnal Teknik Komputer.,2021

5. Aplikasi asesmen calon debitur menggunakan Naive Bayes di koperasi mitra sejahtera SMK negeri 1 kota sukabumi;Isa;Jurnal Sisfokom (Sistem Informasi dan Komputer).,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Understanding customer behavior by mapping complaints to personality based on social media textual data;Data Technologies and Applications;2024-09-09