How many of you use p=0.05 as an absolute cut off? p ≥ 0.05 means not significant. No evidence. Nada. And then p < 0.05 great it’s significant. This is a crude way of using p-values, and hopefully I will convince you of this.

你们中有多少人使用p = 0.05作为绝对截止值？ p≥0.05表示不显着。没有证据。娜达然后p <0.05很好，很有意义。这是使用p值的粗略方法，希望我能说服您。

什么是p值？ (What is a p-value?)

A lot of us use p-values following this arbitrary cut off but don’t actually know the theoretical background of a p-value. A p-value is the probability, under the null hypothesis, of observing data at least as extreme as the observed data. It is not, for example, the probability that some population parameter x = 0. x either equals 0 or it does not (in a frequentist setting).

我们中的许多人都在此任意取舍之后使用p值，但实际上并不了解p值的理论背景。 p值是在零假设下观察数据至少与观察数据一样极端的概率。例如，这不是某个总体参数x = 0的概率。x等于0或不等于0(在常客设置中)。

So, the smaller the p-value, the more unlikely it is that this data would have been observed under the null hypothesis. In essence, the smaller the p-value, the stronger the evidence against the null hypothesis.

因此，p值越小，在原假设下观察到该数据的可能性就越小。本质上，p值越小，针对原假设的证据越强。

什么会影响p值？ (What affects p-values?)

Two things mainly. The first is the strength of effect. The greater the difference from the null hypothesis. The smaller the p-value will be.

主要有两件事。首先是效果的强度。与原假设的差异越大。 p值越小。

The second is the sample size. The larger the sample, the smaller the p-value will be (if in fact the null hypothesis is false).

第二个是样本量。样本越大，p值就越小(如果实际上零假设是假的)。

So, this means that if p ≥ 0.05, it could be because the effect isn’t that strong (or doesn’t exist) or that our sample is too small, resulting in our test being underpowered to detect a difference.

因此，这意味着如果p≥0.05，则可能是因为效果不那么强烈(或不存在)或我们的样本太小，导致我们的测试能力不足以检测差异。

一些例子 (Some examples)

致命药 (A deadly drug)

Suppose we were looking at adverse events of a new drug. Now suppose p=0.051 for evidence that the drug increases the rate of deaths. Now, if we used p=0.05 as a cut-off then it’s great. No evidence that the drug increases the rate of deaths — let’s put it into production. Now imagine that p=0.049 of an increase in the rate of deaths. Oh no! There’s evidence that the drug is harmful. Let’s not put it into production.

假设我们正在研究一种新药的不良React。现在假设p = 0.051作为该药物增加死亡率的证据。现在，如果我们使用p = 0.05作为临界值，那就太好了。没有证据表明这种药物会增加死亡率，我们将其投入生产。现在，假设死亡率增加了p = 0.049。不好了！有证据表明这种药物有害。我们不要将其投入生产。

Mathematically, there’s not really a difference between the two. They are essentially the same. But by using this arbitrary cut off we reach very different conclusions.

从数学上来说，两者之间并没有真正的区别。它们本质上是相同的。但是，通过使用这种任意截断，我们得出了截然不同的结论。

这种药物有效吗 (Does this drug work)

Now imagine another drug. We’ve got a very large sample (n=10,000) and we want to know whether this drug cures cancer. So we get p=0.049 that it cures cancer. Great! Significant evidence this drug cures cancer. Let’s give it to everyone.

现在想象另一种药物。我们有一个非常大的样本(n = 10,000)，我们想知道这种药物是否可以治愈癌症。因此我们得到p = 0.049可以治愈癌症。大！重要证据表明该药可治愈癌症。让我们给大家。

Though, it’s a large sample. Wouldn’t we expect p to be smaller? It’s not that strong evidence against the null hypothesis. There’s approximately a one in twenty chance that our results are down to chance. Now suppose this drug is really expensive. Do we really want to start giving it out to everyone based on some fairly weak evidence? Probably not.

虽然，这是一个很大的样本。我们难道不希望p变小吗？并非没有证据支持原假设。我们的结果接近偶然的可能性大约为十分之一。现在假设这种药真的很贵。我们是否真的要根据一些相当薄弱的证据开始向所有人分发？可能不是。

Now of course if p=0.001 this would be a one in a hundred chance that our results our down to chance. This would be much stronger evidence that the drug works.

当然，现在如果p = 0.001，这将是我们得出结果的机会的百分之一。这将是该药有效的更有力证据。

那么我们应该如何解释p值呢？ (So how should we interpret p-values?)

As a continuous scale. The smaller the p-value is, the stronger the evidence is. But, you should take the sample size and effect size into account. You should also consider whether you are looking at something positive or negative. If looking at something like our deadly drug example, we should be concerned even if the evidence is very weak. However, with something like wanting to know whether a drug works, we can afford to be much more sceptical about our result.

作为连续的规模。 p值越小，证据越强。但是，您应该考虑样本大小和效果大小。您还应该考虑看的是正面还是负面。如果以类似我们致命毒品的例子来看，即使证据不足，我们也应予以关注。但是，由于想知道某种药物是否有效，我们可以对我们的结果持怀疑态度。

So, hopefully in the future, you’ll stop using p=0.05 as some threshold picked out of threshold and consider it as what it truly is — the weight of evidence against the null hypothesis. And, of course, if you don’t have the evidence you need that isn’t necessarily because it doesn’t exist it could be that you lack statistical power to detect an effect.

因此，希望在将来，您将停止使用p = 0.05作为从阈值中选出的某个阈值，并将其视为真正的阈值-反对原假设的证据权重。而且，当然，如果您没有所需的证据，不一定是因为该证据不存在，可能是您缺乏统计能力来检测效果。

翻译自: https://towardsdatascience.com/stop-using-p-0-05-4a059e622c75

查看全文

http://www.taodudu.cc/news/show-994802.html

成像数据更好的展示_为什么更多的数据并不总是更好
vue domo网站_DOMO与Tableau-逐轮
每个人都应该使用的Python 3中被忽略的3个功能
数据探查_数据科学家，开始使用探查器
从ncbi下载数据_如何从NCBI下载所有细菌组件
线性插值插值_揭秘插值搜索
如果您不将Docker用于数据科学项目，那么您将生活在1985年
docker部署flask_使用Docker，GCP Cloud Run和Flask部署Scikit-Learn NLP模型
问卷假设检验 t检验_真实问题的假设检验
大数据技术学习之旅_为什么聚焦是您数据科学之旅的关键
无监督学习 k-means_无监督学习-第4部分
深度学习算法原理_用于对象检测的深度学习算法的基本原理
软件本地化 pdf_软件本地化与标准翻译
数据库不停机导数据方案_如何计算数据停机成本
python初学者_面向初学者的20种重要的Python技巧
贝叶斯网络建模
数据科学家数据分析师_使您的分析师和数据科学家在数据处理方面保持一致
python db2查询_如何将DB2查询转换为python脚本
爱因斯坦提出的逻辑性问题_提出正确问题的重要性
餐厅数据分析报告_如何使用数据科学选择理想的餐厅设计场所
熊猫直播使用什么sdk_没什么可花的-但是16项基本操作才能让您开始使用熊猫
关系型数据库的核心单元是_核中的数据关系
小程序国际化_在国际化您的应用程序时忘记的一件事
robo 3t连接_使用robo 3t studio 3t连接到地图集
软件需求规格说明书通用模版_通用需求挑战和机遇
一类动词二类动词三类动词_基于http动词的完全无效授权技术
一年了
将DataSet中的操作更新到Access数据库
我喜欢的一首歌--《幸福的瞬间》
XForum 里用 Filter 编程实现安全访问控制

停止使用p = 0.05相关推荐

python尝试不同的随机数进行数据划分、使用卡方检验依次计算不同随机数划分下训练接和测试集所有分类特征的卡方检验的p值，如果所有p值都大于0.05则训练集和测试集都具有统计显著性、数据划分合理
python尝试不同的随机数进行数据划分.使用卡方检验依次计算不同随机数划分下训练接和测试集所有分类特征(categorical)的卡方检验的p值,如果所有p值都大于0.05则退出循环.则训练集和测试 ...
Java黑皮书课后题第5章：*5.30（金融应用：复利值）假设你每月在储蓄账户上多存100美元，年利率为5%，那么每月利率是0.05 / 12 = 0.00417。编写程序提示用户输入数据显示定月钱数
5.30(金融应用:复利值)假设你每月在储蓄账户上多存100美元,年利率为5%,那么每月利率是0.05 / 12 = 0.00417.编写程序提示用户输入数据显示定月钱数题目题目概述破题代码 ...
httpsurlconnection 写不进去authorization值_23. 假设检验的时候为什么常写p lt; 0.05,而不写具体的p值？...
在进行假设检验的时候,如果p值小于设定的临界值,比如0.05或0.01.0.001等,人们常常会写p<0.05.p<0.01.p<0.001, 而没有写具体的p值.这种传统是曾经的技 ...
Adonis结果P值小于0.05，一定代表两组样品物种构成差异显著吗？
前情回顾方差分析基本概念:方差分析中的"元"和"因素"是什么? PERMANOVA原理解释:这个统计检验可用于判断PCA/PCoA等的分群效果是否显著! 实战 ...
aic值检验 p值_23. 假设检验的时候为什么常写p lt; 0.05,而不写具体的p值？
在进行假设检验的时候,如果p值小于设定的临界值,比如0.05或0.01.0.001等,人们常常会写p<0.05.p<0.01.p<0.001, 而没有写具体的p值.这种传统是曾经的技 ...
【R语言】他说每个生存曲线一定要看到p值，不能0.05，0.01，0.001
前言起初听到这个我是不理解的,这不是统计学常识吗?划分三个程度:* ,** ,***. 头儿咋还要精确到小数位,不是画蛇添足吗?不了解归不了解,该干还是要干. 目录前言一.P值二.生存分析三 ...
p值＞0.05,统计意义上不显著？
其实,很多非统计学专业的朋友或者没有阅读学习过统计或计量相关书籍的朋友会对各统计分析或数据分析软件上的数值迷惑住.那么在进行参数估计时,p值和显著性水平(α)以及统计显著性之间有什么样的关系?如何使结 ...
240次方在线计算机,0.05的240次方是多少
跟你分享下吧:(我现在的AC率是55.0*%) 收集物品不用说了吧,到处都爆的,刷不详野兽就可以了.满支线40次,刷满(建议不详)主线20次刷满. 各种任务去做完,循环任务做一次就好拉,做多了 ...
P小于0.05的P应该是大写的P还是小写的P
P小于0.05的P应该是大写的P还是小写的P 结论是,没有统一的规范, 可以是大写,也可是小写,但是在一篇paper中,要统一. https://www.sohu.com/a/270616304_74 ...

停止使用p = 0.05