特异性敏感性的意义

How do we evaluate how well a machine learning classifier or test model performs? How do we know if a medical test is reliable enough to use in a clinical setting?

^ h流量做我们评估如何以及机器学习分类或测试模型执行? 我们如何知道医学测试是否足够可靠以在临床环境中使用?

While a highly accurate coronavirus test may be useful where there is a higher incidence, why is it less informative in populations with lower rates of disease? This sounds counterintuitive and confusing, but does have applications for determining the utility of your own binary classifiers!

尽管在发生率较高的地方使用高精度的冠状病毒检测可能很有用,但为什么在疾病发生率较低的人群中它的信息量较少? 这听起来违反直觉和令人困惑,但是确实有一些应用程序可以确定您自己的二进制分类器的实用程序!

We define the validity of a test by measuring its specificity and sensitivity. Quite simply, we want to know how often the test identifies true positives and true negatives.

我们通过测量其特异性和敏感性来定义测试的有效性。 简而言之,我们想知道测试多久会发现真实的阳性和阴性的。

Our sensitivity describes how well our test catches all of our positive cases. Sensitivity is calculated by dividing the number of true-positive results by the total number of positives (which include false positives).

我们的敏感性描述了我们的测试对所有阳性病例的捕捉程度。 灵敏度是通过将真实阳性结果的数目除以阳性总数(包括假阳性)而得出的。

Our specificity describes how well our test classifies negative cases as negatives. Specificity is calculated by dividing the number of true-negative results by the total number of negatives (which include false negatives).

我们的特异性描述了我们的测试将阴性案例分类为阴性的效果。 特异性是通过将阴性结果为真阴性结果的总数除以阴性结果总数(包括假阴性结果)而得出的。

FeanDoe / CC BY-SA (FeanDoe / CC BY-SA( https://creativecommons.org/licenses/by-sa/4.0)https://creativecommons.org/licenses/by-sa/4.0 )

The important question is whether a model is meaningful? Simply going by sensitivity and specificity rates won’t cut it! To determine how meaningful or clinically useful a test might be for a population, we need underlying information about the expected incidence or prevalence of a disease. We use Bayes’ Theorem to understand this:

重要的问题是模型是否有意义? 仅凭敏感性和特异性率就无法解决问题! 为了确定测试对人群的意义或临床意义,我们需要有关疾病的预期发病率或患病率的基础信息。 我们使用贝叶斯定理来理解这一点:

We take a population of 1 000 000 people, where 10% of them have a certain disease. We use a very reliable test with 98% specificity and sensitivity. Here Event A describes the unconditional probability of this disease in the population. P(A) = 0.10.

我们的人口为100万人,其中10%患有某种疾病。 我们使用具有98%特异性和敏感性的非常可靠的测试。 在这里, 事件A描述了该疾病在人群中的无条件概率。 P(A)= 0.10

Event B is the unconditional probability of our test coming up positive. We can calculate P(B) by looking at how many total positives we would get. In this population, we expect 98 000 true positives, calculated by multiplying the disease rate by the total population and sensitivity. For false positives, we take the probability of not having this disease (0.90) and multiple it by the population and (1-Specificity). So we get 18 000 false positives in this scenario. Our P(B) or total positives is then 11.6%.

事件B是我们的测试呈阳性的无条件概率。 我们可以通过查看获得的总阳性数来计算P(B) 。 在该人群中,我们期望通过将疾病发生率乘以总人群和敏感性得出的98 000个真实阳性。 对于误报,我们将不患这种疾病的概率设为(0.90) ,然后乘以总体(1-Specificity)并将其乘以。 因此,在这种情况下,我们得到18000个误报。 那么我们的P(B)或总正值就是11.6%。

Now things get a little bit more complicated. All of these values describe how accurate this test is for this population. But it doesn’t tell us the chances that one person who tests positive has the disease. We need to apply Bayes’ Theorem, using these unconditional values as our prior assumption.

现在事情变得更加复杂了。 所有这些值都说明此测试对该人群的准确性。 但这并没有告诉我们测试阳性的人患上这种疾病的机会。 我们需要应用贝叶斯定理,将这些无条件值用作我们的先前假设。

Created by your friendly neighbourhood author
由您的友好邻里作者创建

Then if someone takes a test that comes out positive, what is the probability that the disease is present?

然后,如果有人进行了阳性检测,那么该疾病存在的可能性是多少?

Here we can start to define our variables.

在这里,我们可以开始定义变量。

  • P(A) = 0.10

    P(A)= 0.10

  • P(B) = 0.116

    P(B)= 0.116

  • P(B|A) describes the probability of getting a positive result regardless of whether it’s a true-positive or not, while P(A) is the presence of the disease. Thus P(B|A) is our sensitivity. P(B|A) = 0.98.

    P(B | A)描述获得阳性结果的可能性,而不管其是否为真阳性,而P(A)是疾病的存在。 因此,P(B | A)是我们的灵敏度。 P(B | A)= 0.98。

  • P(A|B) = 0.98 * 0.1 / 0.116 = 84.5%

    P(A | B) = 0.98 * 0.1 / 0.116 = 84.5%

So here we see that even with high sensitivity and specificity, the test may not be as accurate in some populations. Using Bayes’ Theorem, we can calculate this quite easily.

因此,在这里我们看到,即使具有高灵敏度和特异性,该检测在某些人群中也可能不那么准确。 使用贝叶斯定理,我们可以很容易地计算出这一点。

What would happen though if the disease was less common in our population? Recall that sensitivity and specificity remain at 98%.

但是,如果这种疾病在我们的人群中不那么普遍,会发生什么呢? 回想一下,敏感性和特异性仍保持在98%。

  • P(A) = 0.01

    P(A) = 0.01

  • P(B) = (True Positives + True Negatives)/Total Population = (0.01*0.98 + 0.02*0.99)/10000000 = (9800 + 19800)/1000000 = 0.0296

    P(B) =(真阳性+真阴性)/总人口=(0.01 * 0.98 + 0.02 * 0.99)/ 10000000 =(9800 + 19800)/ 1000000 = 0.0296

  • P(A|B) = 0.98 * 0.01 / 0.296 = 33.1%

    P(A | B) = 0.98 * 0.01 / 0.296 = 33.1%

Since the disease is now rarer, the posterior probability of receiving a positive test result when you have the disease is lower. As the prevalence of disease decreases within the population, so does our positive predictive value!

由于这种疾病现在比较罕见,因此患这种疾病时获得阳性检测结果的后继可能性较低。 随着人口中疾病流行率的降低,我们的积极预测价值也将下降

When generating a biomedical test or another model with binary classification, keep in mind when it might be useful. Looking at how well our test works for our specific population because with a low enough prevalence of disease — it might not be very useful!

在生成生物医学测试或具有二进制分类的其他模型时,请记住可能有用。 看看我们的测试对特定人群的效果如何,因为疾病的患病率很低-可能不是很有用!

It follows that if you’re generating a classifier for something that might be rare in the general population, you need very high sensitivity and specificity for a high positive predictive value!

因此,如果要针对一般人群中罕见的事物生成分类器,则需要非常高的敏感性和特异性才能获得较高的阳性预测值!

翻译自: https://towardsdatascience.com/sensitivity-specificity-and-meaningful-classifiers-8326738ec5c2

特异性敏感性的意义


http://www.taodudu.cc/news/show-3021152.html

相关文章:

  • 降低CSS特异性的策略
  • CSS 特异性
  • CSS 特异性 specificity
  • 机器学习中的敏感性和特异性的概念
  • CSS 特异性看这篇就行了
  • 单片机的流水灯程序
  • 80c51流水灯程序汇编语言,单片机闪烁灯流水灯汇编代码大全
  • 单片机 c语言 p1控制流水灯,单片机控制的流水灯程序
  • stm32流水灯c语言程序,stm32流水灯程序源码
  • 单片机c语言程序编写步骤,用c语言编写单片机流水灯程序详解
  • Android All flavors must now belong to a named flavor dimension
  • 解决Error:All flavors must now belong to a named flavor dimension.
  • ERROR: All flavors must now belong to a named flavor dimension.
  • Android 报错处理:All flavors must now belong to a named flavor dimension,Learn more at https://d.android
  • 如何解决All flavors must now belong to a named flavor dimension.?
  • 【错误记录】Gradle 配置 productFlavors 报错 ( All flavors must now belong to a named flavor dimension. )
  • 解决Error:All flavors must now belong to a named flavor dimension
  • 解决AS升级3.4 ERROR: All flavors must now belong to a named flavor dimension.Learn more at https://d.and
  • gradle ERROR: All flavors must now belong to a named flavor dimension.
  • flutter 多渠道打包 不同环境不同包名 product flavors flavorDimensions
  • 解决Error:All flavors must now belong to a named flavor dimension. Learn more at...
  • build type和product flavors
  • Error:All flavors must now belong to a named flavor dimension. Learn more at
  • 解决gradle编译报Error:All flavors must now belong to a named flavor dimension.问题
  • android studio int转,Android Studio Flavors的妙用(转)
  • All flavors must now belong to a named flavor dimension.
  • flutter需要定义product flavors
  • All flavors must now belong to a named flavor dimension
  • 解决Error:All flavors must now belong to a named flavor dimension. Learn more at https://d.android.com
  • Android Studio Flavors详解

特异性敏感性的意义_敏感性特异性和有意义的分类相关推荐

  1. 生成对抗网络的背景与意义_生成对抗网络的意义

    生成对抗网络的背景与意义 为什么我们需要生成对抗网络:-(Why do we need Generative Adversarial Network: -) If we show a lot and ...

  2. correl函数相关系数大小意义_矩阵的转置的意义

    这个问题下面的回答 矩阵的转置的意义是什么?​www.zhihu.com 因为老有人点看,所以就补充下放在这里. 同一个对象的两个等价描述,也可以说是同一个对象分别在两个空间中的描述.这两个空间的相互 ...

  3. python研究背景和意义_一研究背景和意义

    一研究背景和意义 参考腾发量或参考作物腾发量是水文学的一个基础性概念. 它是 世界粮农组织于 1977 年首次完整定义的 「 1 」 系指高度均匀一致 (8-15cm) ,生长旺盛.无病虫害,完全覆盖 ...

  4. pstate0 vid数值意义_光行差成因和物理意义新解及其验证方法

    作者:彭晓韬 日期:2019.06.03 [文章摘要]:光行差常数被广泛用来证明光速恒定的同时,又被用来证明光速遵循经典物理学的速度叠加原理.这种相互矛盾的现象到底是怎么回事呢?本文通过详细分析得出: ...

  5. 参与社团活动的意义_开展社团活动的意义

    各位老师. 同学们, 早上好! 今天我讲话的主题是 "谈谈学生社团活动的意义" 本学期我校 19 个社团活动,在课余生活中呈现出热火朝天.一派繁忙的景 象.社团活动日益蓬勃,功能日 ...

  6. 参与社团活动的意义_参加社团活动的意义为主题写一篇英语作文

    共回答了12个问题采纳率:91.7% Welcome to join consortium Senior Year is the very important to enter college.For ...

  7. p值 统计学意义_统计学意义不重要为什么p值不应过高

    p值 统计学意义 Have you ever heard somebody say that a study revealed „significant results"? What doe ...

  8. 分数阶导数的意义_导数的意义

    分数阶导数的意义 钙衍生物(CALCULUS DERIVATIVES) After derivative theory posts, we will start to see some of the ...

  9. linux文件目录含义,Linux中文件权限目录权限的意义及权限对文件目录的意义

    linux中目录与文件权限的意义 一.文件权限的意义 r:可以读这个文件的具体内容: w:可以编辑这个文件的内容,包括增加删除文件的具体内容: x:文件就具有了可执行的权限-------注意:这里和w ...

最新文章

  1. 食出100分:‘粥’的做法4---鱼片瘦肉粥
  2. php空格是什么,php删除空格函数是什么
  3. 水晶报表主子报表分页问题
  4. 微软笔试题 2013暑期实习笔试题目
  5. 应用中心最佳实践之——使用应用组完成多集群一键部署
  6. ServletContext_功能_域对象
  7. Spring注解编程基石(三)
  8. WebAPIs移动端特效——不看你就亏大了
  9. javaScript面向对象是什么?(一)
  10. laravel 验证用户是否登陆显示不同内容
  11. 如何不用MDI方式在Form1上显示Form2
  12. php 数据分流,php实现请求分流 - osc_6fvwlc7h的个人空间 - OSCHINA - 中文开源技术交流社区...
  13. MySQL|Aborted connection 日志分析
  14. 超图(idesktop iserver10) 处理osgb倾斜摄影和tif并 发布 加载 ---2---连续更新
  15. 代码审计之JAVA代码审计洞态IAST系统以及SecExample靶场
  16. 键盘快捷键 —— 有用
  17. LOG_PATH_IS_UNDEFINED 解决
  18. 5G场强测试关键指标
  19. Spring Cloud Eureka服务治理
  20. springboot多数据源使用

热门文章

  1. velocity学习(2)--VTL 语法
  2. Java实现 LeetCode 522 最长特殊序列 II(查找最长的非子序列的长度)
  3. 工具类APP的生存之道
  4. 【GlobalMapper精品教程】020:Lidar点云数据分类(自动分类、手动分类)案例详解
  5. springboot整合 neo4j (OGM+JPA方式操作图数据库)
  6. 1443:【例题4】Addition Chains
  7. Gnostice PDFToolkit自动图文集的页码
  8. python 02 --python从安装到学习精品网站(廖雪峰官方网站)
  9. C++核心准则R.33: 表达函数会重置widget时,使用unique_ptr(widget)​作参数
  10. PyQT5 (四十六) 在 QTableWidget 表格中设置合并单元格 的案例