Interpreting Nonsignificant Results: A Quantitative Investigation Based on 500 Chinese Psychological Research

Author: 王珺 ¹ 宋琼雅 ¹ 许岳培 ^{2 3} 贾彬彬 ⁴ 陆春雷 ⁵ 陈曦 ⁶ 戴紫旭 ⁷ 黄之玥 ⁸ 李振江 ⁹ 林景希 ¹⁰ 罗婉莹 ¹¹ 施赛男 ¹² 张莹莹 ¹³ 臧玉峰 ¹⁴ 左西年 ² 胡传鹏 ¹⁵
Institute:

1. 中山大学

2. 中国科学院行为科学重点实验室(中国科学院心理研究所)

3. 中国科学院大学心理学系

4. 上海体育学院

5. 浙江师范大学教师教育学院

6. 个人

7. 华南师范大学心理学院

8. 复旦大学心理学系

9. 苏州大学教育学院

10. 黑龙江大学教育科学研究院

11. 北京大学元培学院

12. 华东师范大学心理与认知科学学院

13. 东北师范大学心理学院

14. 杭州师范大学认知与脑疾病研究中心

15. Leibniz Institute for Resilience Research
Correspondent： 胡传鹏 Email:hcp4715@hotmail.com
Submit Time:2020-03-22 20:19:13

Abstract: Background: P-value is the most widely used statistical index for inference in science. A p value greater than 0.05, i.e., nonsignificant results, however, cannot distinguish the two following situations: the absence of evidence or the evidence of absence. Unfortunately, researchers in psychological science may not be able to interpret p-value correctly, resulting in possible mistakes in statistical inference based on nonsignificant result. Indeed, Aczel et al (2019) surveyed three empirical studies published in Psychonomic Bulletin & Review, Journal of Experimental Psychology: General, and Psychological Science. They found that about 72% of nonsignificant results were misinterpreted as evidence in favor of the null hypothesis. The misinterpretation of nonsignificant results may lead severe consequences. One such consequence is the dismay of the nonsignificant results as null effect, ignoring the small but meaningful effects (e.g., Jia, et al., 2018). More importantly, misintepreted non-signficant results when comparing certain traits (e.g., age, gender) in matched-group clinical trials may creat a false “matched” group, thus render the effect of intervention meaningless. As psychological science keeps growing in China, it is important to estimate how nonsignificant results were interpreted in the empirical studies published in Chinese Journals. However, no such meta-research has been done. To fill the gap, we surveyed 500 empirical papers published in five important Chinese psychological journals, to explore the following questions: (1) how often are nonsignificant results reported, that is, how severe is the publication bias; (2) how do researchers interpret nonsignificant results in their own studies; (3) if researcher interpreted nonsignificant as “evidence for absence,” does empirical data provide enough support the null effect. Method: Based on our pre-registration (https://osf.io/czx6f), we randomly selected empirical research papers published in 2017 and 2018 in five Chinese prominent journals (Acta Psychologica Sinica, Psychological Science, Chinese Journal of Clinical Psychology, Psychological Development and Education, Psychological and Behavioral Studies). First, according to the publication volume of each journal, we randomly selected 500 empirical research. Secondly, we screened the abstracts of the selected articles and judged whether they contained negative statements. Thirdly, we categorized each negative statement into 4 categories (Correct-frequentist, Incorrect-frequentist: whole population, Incorrect-frequentist: current sample, Difficult to judge). Finally, we calculated Bayes factors based on the t values and sample size associated with the nonsignificant results to investigate whether empirical data provide enough evidence in favor of null hypothesis. Results: Our survey revealed that: (1) out of 500 empirical research, 36% of their abstracts (n = 180) mentioned nonsignificant results; (2) there were 236 negative statements in the article that referred to nonsignificant results in abstracts, and 41% negative statements misinterpreted nonsignificant results, i.e., the authors inferred that the results provided evidence for the absence of effects; (3) 5.1% (n = 2) nonsignificant results can provide strong evidence in favor of null hypothesis (BF01 > 10). Compared with the results from Aczel et al (2019), we found that empirical papers published in Chinese journal reported more nonsignificant results (36% vs. 32%), and researchers make fewer misinterpretation based on nonsignificant results (41% vs. 72%). It worth noting that there exists a categorization of ambiguous statements about nonsignificant results in the Chinese context: “there is no significant difference between condition A and condition B”. This statement has two interpretations: it can be interpreted as a different way to say “statistically nonsignificant”, or as “there is no differences between condition A and condition B”. The percentage of misinterpretation of nonsignificant results raised to 61% if we used the second interpretation, instead of 41% when we use the first interpretation. Conclusion: The results suggest that Chinese researchers need to enhance their understanding of nonsignificant results and use more appropriate statistical methods to extract information from non-significant results. Also, more precise wording should be used in the Chinese context. "

Nonsignificant results Null-hypothesis significance testing Bayes factors Meta-research.

From: 王珺
Subject: Psychology >> Statistics in Psychology
Cite as: ChinaXiv:202003.00056 (or this version ChinaXiv:202003.00056V1)
DOI:10.12074/202003.00056V1
CSTR:32003.36.ChinaXiv.202003.00056.V1
TXID： 0875eb08-5238-44e4-b866-e43488d4abab
Recommended references： 王珺,宋琼雅,许岳培,贾彬彬,陆春雷,陈曦,戴紫旭,黄之玥,李振江,林景希,罗婉莹,施赛男,张莹莹,臧玉峰,左西年,胡传鹏.解读不显著结果：基于500个实证研究的量化分析.中国科学院科技论文预发布平台.[ChinaXiv:202003.00056V1] (Click&Copy)

Version History

[V2]	2020-10-17 20:40:23	ChinaXiv:202003.00056v2 View This Version	Download
[V1]	2020-03-22 20:19:13	ChinaXiv:202003.00056V1	Download

Related Paper

1. 越被小用，越失激情？员工资质过剩感对其工作激情的影响	2025-08-18
2. 经颅交流电刺激在心理学研究中的应用	2025-08-18
3. 内隐情绪调节的认知神经机制	2025-08-18
4. 领导人际情绪管理策略如何打破员工向领导者宣泄的自我延续效应？宣泄者一接受者互动视角	2025-08-14
5. 面向空间导航能力的虚拟现实测验设计	2025-08-14
6. 为何最优化患者对医生更警惕？道德推脱的中介作用	2025-08-13
7. 后悔情绪及其调节	2025-08-11
8. 中国内地学生学习投入的变迁(2006~2024年)	2025-08-09
9. 孤独症儿童负性情绪调节特征及干预：基于多模态评估的正念与认知策略训练	2025-08-07
10. 孤独症谱系障碍儿童语音情绪识别的障碍：韵律、语义还是整合困难？——基于三水平元分析的探究	2025-08-05


Public comments Anonymous comments Send only to author