融合文本图卷积和集成学习的文本分类方法 postprint

Author: 周玄郎 ¹ 邱卫根 ¹ 张立臣 ¹
Institute:

1. 广东工业大学计算机学院
Submit Time:2022-05-10 11:22:57

Abstract: In order to improve the accuracy of text classification and solve the problem of insufficient utilization of node features by text graph convolution neural network, this paper proposes a new text classification model, which integrates the advantages of text graph convolution and Stacking integrated learning method. The model first learns the global expression of documents and words and the grammatical structure information of documents through text graph convolution neural network, and then secondary learns the features extracted by text graph convolution through integrated learning, so as to make up for the insufficient utilization of text graph convolution node features, and improve the accuracy of single label text classification and the generalization ability of the whole model. In order to reduce the time consumption of ensemble learning, the fusion algorithm removes the k-fold cross verification mechanism in ensemble learning. The fusion algorithm realizes the correlation between text graph convolution and stacking integrated learning method. The classification effect on R8, R52, Mr, Ohsumed, 20ng and other data sets is improved by more than 1.5%, 2.5%, 11%, 12% and 7% respectively compared with the traditional classification model. This method performs well in the comparison of classification algorithms in the same field.

文本表示文本分类文本图卷积集成学习融合模型

Subject: Computer Science >> Integration Theory of Computer Science

Journal:

计算机应用研究

Contribution： Published
Cite as: ChinaXiv:202205.00079 (or this version ChinaXiv:202205.00079V1)
DOI:10.12074/202205.00079V1
CSTR:32003.36.ChinaXiv.202205.00079.V1
TXID： 24429c6d-89fc-4e5a-a462-ba02439478aa
Recommended references： 周玄郎,邱卫根,张立臣.融合文本图卷积和集成学习的文本分类方法.计算机应用研究:https://chinaxiv.org/abs/202205.00079.[ChinaXiv:202205.00079V1] (Click&Copy)

Version History

[V1]

2022-05-10 11:22:57

ChinaXiv:202205.00079V1

Download

Related Paper

1. AI4Games：基于强化学习的演化博弈策略挖掘	2025-08-18
2. 机器学习的信息科学原理：基于形式化信息映射的因果链元框架	2025-08-15
3. 中枢智药：基于多智能体的药物设计与递送全流程系统的设计	2025-08-15
4. 基于长序列时序嵌入的水电交互大模型快速检索	2025-08-14
5. Generative AI for Brain-Computer Interfaces Decoding: A Systematic Review	2025-08-14
6. 藏语拉萨话韵律词库——基于语音合成的实验研究	2025-08-07
7. 基于证据积累的认知决策神经网络模型	2025-07-23
8. 矩阵论——以数据挖掘与机器学习为例	2025-07-19
9. 信息论安全的可信验证算法	2025-07-17
10. 关于命名实体识别领域的综述报告	2025-07-16


Public comments Anonymous comments Send only to author