您当前的位置: > Detailed Browse

Optimization of a prediction model of life satisfaction based on text data augmentation

请选择邀稿期刊:
Abstract: Objective With the development of network big data and machine learning, more and more studies starting to combine text analysis and machine learning algorithms to predict individual satisfaction. In the studies focused on building life satisfaction prediction models, it is often difficult to obtain large amounts of valid and labeled data. This study aims at solving this problem using data augmentation and optimizing the prediction model of life satisfaction. Method Using 357 life status descriptions annotated by self-rating life satisfaction scale scores as original text data. After preprocessing using DLUT-Emotionontology, EAD and back-translation method was applied and the prediction model was built using traditional machine learning algorithms. Results Results showed that (1) the prediction accuracy was largely enhanced after using the adapted version of DLUT-Emotionontology; (2) only linear regression model was enhanced after data augmentation; (3) rigid regression model showed the greatest prediction accuracy when trained by original data (r = 0.4131). Conclusion The improvement of feature extraction accuracy can optimize the current life satisfaction prediction model, but the text data augmentation methods, such as back translation and EDA may not be applicable for the life satisfaction prediction model based on word frequency.

Version History

[V2] 2024-02-29 13:31:45 ChinaXiv:202201.00007V2 Download
[V1] 2022-01-04 11:07:49 ChinaXiv:202201.00007v1 View This Version Download
Download
Preview
Peer Review Status
Awaiting Review
License Information
metrics index
  •  Hits6556
  •  Downloads640
Comment
Share
Apply for expert review