标记偏见

“Beware of the HiPPO in the room” — The risks and dangers of top-down, intuition-based decision making are well known in the business world. Experimentation and data-based decision making become widely acknowledged as the right way to steer a business.

“当心机房中的HiPPO” —自上而下,基于直觉的决策制定的风险和危险在商业界众所周知。 实验和基于数据的决策被公认为是指导业务的正确方法。

For a good reason: Leading experimenters such as Netflix, Google and Booking show that making decisions based on facts and evidence rather than intuition can lead to exceptional business success.

有一个很好的理由:Netflix,Google和Booking等领先的实验者表明,根据事实和证据而不是凭直觉做出决策可以带来非凡的业务成功。

But what if in the course of this development the HiPPO (Highest Paid Person’s Opinion) is not the one to be afraid of anymore? What if the person that should help to fight top-down decision making took his place?

但是,如果在这种发展过程中,HiPPO(最高付费人士的意见)不再是一个令人恐惧的东西呢? 如果应该帮助反对自上而下的决策的人接任该怎么办?

What if the analyst is the new biasing factor in decision making?

如果分析师是决策中的新偏见因素怎么办?

Let me be clear. Having personal opinions about new ideas, suggestions and approaches couldn’t be more natural. We all have our cognitive biases. But we can not ignore this fact just because of the allegedly safe framework of evidence-based decision making.

让我清楚一点。 对新想法,建议和方法有个人见解不会再自然。 我们都有我们的认知偏见。 但是,我们不能仅仅因为所谓的基于证据的决策安全框架而忽视这一事实。

UnsplashUnsplash

偏差分析结果 (Biasing Analytics Results)

The analyst’s responsibility is to paint a clear picture of the business’ situation and inform decision-making. And there are plenty of ways how an analyst can, willingly or entirely unaware, bias the final decision that is made. I like to split those into conscious and unconscious biases.

分析师的责任是清楚地描述业务情况并为决策提供依据。 分析师可以通过多种方式自愿或完全不了解最终决策。 我喜欢将这些分为有意识的和无意识的偏见。

自觉偏见 (Conscious Biases)

Let’s have a look at the first kind, where analysts make deliberate decisions that will impact their results. Conscious biases are closely connected to the analyst’s personal opinion. This can be about a new marketing campaign or a new product feature. Whether the analyst believes in a specific idea can significantly impact how the following research is being conducted. And if it is conducted at all. Let’s have a look at a few potential sources for conscious biases:

让我们看一下第一种类型,在这种类型中,分析师做出会影响其结果的深思熟虑的决策。 有意识的偏见与分析师的个人看法紧密相关。 这可能与新的营销活动或新的产品功能有关。 分析人员是否相信特定想法会严重影响以下研究的进行方式。 如果是进行的话。 让我们看一下一些有意识的偏见的潜在来源:

“Can you just give us a rough estimate for this particular metric?”

“您能给我们这个特定指标的大概估算吗?”

a) Making Guesses. Questions for estimates and opinions are more or less an invitation for introducing biases. Obviously you can’t know all the numbers by heart and the best thing to do would be to go back to your desk, check the metrics and report them back. But checking every single metric costs too much time. Often enough, we simply trust our intuition, which strongly correlates with our personal opinion about a specific idea. So you make an educated guess about what the number might be. At this point, analysts can already substantially impact whether an initiative is pursued and what everybody’s expectations are. The first number one comes up with serves as an anchor figure and sets expectations stakeholders might reference in the future to assess an idea’s potential.

a)猜测。 有关估计和意见的问题或多或少地引起了人们引入偏见。 显然,您不能一味地知道所有数字,而最好的办法是回到办公桌前,查看指标​​并将其报告回来。 但是检查每个指标会花费太多时间。 通常,我们只是相信我们的直觉,这与我们对特定想法的个人看法紧密相关。 因此,您可以对数字可能进行合理的猜测。 在这一点上,分析人员已经可以对是否采取主动行动以及每个人的期望产生实质性影响。 第一个数字一个想出了作为锚人物和套的预期利益相关者在未来可能会引用到评估一个想法的潜力。

“Traffic is so low on this page, it’s not worth looking further into this.”

“此页面上的流量如此之低,因此不值得进一步研究。”

b) Giving Personal Opinions. Sometimes we might be tempted to provide no number at all and instead give a personal opinion. While this opinion is (hopefully) based on facts and the analyst’s experience, it can still strongly correlate with one’s subjective opinion about the idea discussed.

b)发表个人意见。 有时我们可能会不愿提供任何电话号码,而是发表个人意见。 尽管这种观点(希望)基于事实和分析师的经验,但仍可以与人们对所讨论想法的主观观点密切相关。

c) Depth of Research. After kicking off the research, the question is:

c)研究深度。 在开始研究之后,问题是:

When does an analyst have enough information to give a good recommendation or overview for a particular problem?

分析师何时有足够的信息为特定问题给出好的建议或概述?

Of course, you can always drill deeper into a specific topic to get more evidence to support a decision-making process. Analysts might be inclined to dig deeper into an area to prove or disprove a particular idea they have a strong opinion about. Simultaneously, we might put less effort into a research question where the outcome is expected to be less exciting or the decision that has to be made seems to be pretty trivial anyway.

当然,您总是可以更深入地研究特定主题,以获取更多证据来支持决策过程。 分析师可能倾向于更深入地研究某个领域,以证明或反对他们有强烈看法的特定想法。 同时,我们可能会在研究问题上投入更少的精力,因为预期结果不会那么令人兴奋,或者必须做出的决定似乎微不足道。

d) Setting targets. The analytics and experimentation landscape itself invites analysts and anybody who operates in it to introduce biases at some points. Be it setting the right significance or power level for an AB-test or selecting an appropriate metric to measure a new feature or a campaign’s success? Those are, to a certain degree, subjective decisions the analyst has to make to produce any results. But at the same time, those can have a significant effect on the actual outcome of the research.

d)设定目标。 分析和实验环境本身会邀请分析师和其中的任何人在某些时候引入偏见。 是为AB测试设置正确的重要性或功率级别,还是选择适当的度量标准来衡量新功能或活动的成功? 在某种程度上,这些是分析人员必须做出的主观决定才能产生任何结果。 但是同时,这些可能会对研究的实际结果产生重大影响。

UnsplashUnsplash

无意识的偏见 (Unconscious Biases)

Unconscious biases are not introduced by the analyst’s active decision making. This sort of bias is less connected to the personal opinion about a specific idea or research question but can have the same magnitude of impact on the results. Biases that fall into this category are for example:

分析人员的主动决策不会引入无意识的偏见。 这种偏见与对特定想法或研究问题的个人看法联系较少,但对结果的影响程度相同。 属于此类的偏差例如:

a) Programming Errors. Be it an error in a SQL-query, a wrong logic statement when filtering a pandas dataframe or an incorrect regex expression. All these programming errors can occur when we’re trying to get insights from the data in front of us. Other than syntax errors, this kind of programming error can remain wholly unnoticed when running our code and thus can have a substantial impact on the results of our analysis.

a)编程错误。 是SQL查询中的错误,过滤熊猫数据框时的错误逻辑语句还是不正确的正则表达式。 当我们试图从眼前的数据中获取洞察力时,所有这些编程错误都可能发生。 除了语法错误外,在运行我们的代码时,这种编程错误可能仍然完全未被注意到,从而可能对我们的分析结果产生重大影响。

b) Wrong handling of data. Usually, the data we want to examine to answer a particular research question does not come in a usable format. Before we can use a statistical model to derive insights from our data, we might have to clean it, select and engineer appropriate features, and eventually perform data transformations. All those actions can bias our dataset and thus our decisions in one direction or another.

b)错误处理数据。 通常,我们要检查以回答特定研究问题的数据不是可用的格式。 在使用统计模型从数据中获取见解之前,我们可能必须清理数据,选择和设计适当的功能,并最终执行数据转换。 所有这些动作都会使我们的数据集产生偏差,从而使我们的决策朝着一个方向或另一个方向倾斜。

c) Wrong interpretation of data. We might have done everything done when handling and modelling our dataset. But in the end, we can still derive the wrong conclusions from the results in front of us. Classic misinterpretations are confusing correlation with causation or drawing the wrong conclusions about the relationship of two parameters.

c)数据解释错误。 在处理和建模数据集时,我们可能已经完成了所有工作。 但是最后,我们仍然可以从我们面前的结果中得出错误的结论。 经典的误解使因果关系变得混乱,或者对两个参数之间的关系得出了错误的结论。

那么分析师是不值得信任的吗? (So is the analyst to be mistrusted?)

Today’s most valuable companies such as Netflix, Amazon and Google show that experimentation and data have to replace intuition as a basis for making decisions.

如今,诸如Netflix,Amazon和Google之类的最有价值的公司表明,实验和数据必须取代直觉作为决策的基础。

Hence having trust in the data and therefore the analyst’s output is essential. It’s the analyst’s responsibility to build and maintain that trust. Analysts have to do their best to provide unbiased, informative insights to support decision-making and drive businesses in the right direction.

因此,对数据以及对分析人员的输出的信任至关重要。 建立和维护这种信任是分析师的责任。 分析师必须尽力提供​​公正,有用的见解,以支持决策制定并推动业务朝着正确的方向发展。

Therefore, it is imperative to be aware of your own biases and to overcome them where possible.

因此,必须意识到自己的偏见并在可能的情况下克服它们。

When being asked for a rough estimate or when thinking about how deep you want to drill into a specific topic, take a step back. Reflect on your decision and thought process and try to get a neutral perspective on your current issue.

当被要求进行粗略估算或考虑要深入到特定主题的深度时,请后退一步。 反思您的决策和思考过程,并尝试对当前问题持中立观点。

To avoid unconscious biases, it helps to adopt some best practices from the world of software engineering: Use unit tests in your queries and notebooks, start pair programming and ask colleagues to review your code and approach.

为避免无意识的偏见,它有助于采用一些软件工程领域的最佳实践:在查询和笔记本中使用单元测试,开始结对编程,并要求同事审查您的代码和方法。

Liked this article? Then you might also like one of my other posts:

喜欢这篇文章吗? 然后,您可能还会喜欢我的其他帖子之一:

翻译自: https://towardsdatascience.com/the-analysts-bias-5c84825c0f48

标记偏见


http://www.taodudu.cc/news/show-995390.html

相关文章:

  • 用户体验数据分析 书单_如何使用数据改善用户体验设计
  • 熊猫数据集_熊猫迈向数据科学的第一步
  • 为什么选择做班级管理系统_为什么即使在平衡的班级下准确性也很麻烦
  • netflix的准实验面临的主要挑战
  • 微观计量经济学_微观经济学与数据科学
  • 熊猫数据集_熊猫迈向数据科学的第二部分
  • 为什么我要重新开始数据科学
  • 数据科学的发展_数据科学的发展与发展
  • 并发插入数据库会导致失败吗_会导致业务失败的数据分析方法
  • 机器学习模型部署_9月版部署机器学习模型
  • 女朋友天天气我怎么办_关于我的天气很奇怪
  • 第一次马拉松_成为数据科学家是一场马拉松而不是短跑
  • 云原生数据库_数据标签竞赛云原生地理空间冲刺
  • 抽象类细分举行_什么是用聚类技术聚类的客户细分
  • java 在底图上绘制线条_使用底图和geonamescache绘制k表示聚类
  • 电子表格转换成数据库_创建数据库,将电子表格转换为关系数据库,第1部分...
  • 求一个张量的梯度_张量流中离散策略梯度的最小工作示例2 0
  • 离群值如何处理_有理处理离群值的局限性
  • ppt图表图表类型起始_梅科图表
  • 现实世界 机器学习_公司沟通分析简介现实世界的机器学习方法
  • 数据中心细节_当细节很重要时数据不平衡
  • 余弦相似度和欧氏距离_欧氏距离和余弦相似度
  • 机器学习 客户流失_通过机器学习预测流失
  • 预测股票价格 模型_建立有马模型来预测股票价格
  • 柠檬工会_工会经营者
  • 大数据ab 测试_在真实数据上进行AB测试应用程序
  • 如何更好的掌握一个知识点_如何成为一个更好的讲故事的人3个关键点
  • 什么事数据科学_如果您想进入数据科学,则必须知道的7件事
  • 季节性时间序列数据分析_如何指导时间序列数据的探索性数据分析
  • 美团骑手检测出虚假定位_在虚假信息活动中检测协调

标记偏见_分析师的偏见相关推荐

  1. 人工智能也存在偏见?探究人工智能偏见的识别和管理

    摘译 | 李朦朦/赛博研究院实习研究员 来源 | NIST 2022年3月16日,美国国家标准与技术研究院(NIST)发布了<迈向识别和管理人工智能偏见的标准>(以下简称<标准> ...

  2. 标记偏见_协作和透明的机器学习可消除偏见

    标记偏见 No one wants bias in their organization. Underrepresentation has plagued the business world for ...

  3. 标记偏见_如何(巧妙地)扭曲视觉效果以支持您的偏见叙事

    标记偏见 Data is important - it is the logical justification for world-changing decisions. Unfortunately ...

  4. 如何识别媒体偏见_描述性语言理解,以识别文本中的潜在偏见

    如何识别媒体偏见 TGumGum can do to bring change by utilizing our Natural Language Processing technology to s ...

  5. 怎样避免无意识偏见_精神病学意识到大数据和人工智能的价值和偏见

    怎样避免无意识偏见 Cure her of that! Canst thou not minister to a mind diseased, pluck from the memory a root ...

  6. 如何识别媒体偏见_超越偏见:为什么我们不能仅仅“修正”面部识别

    如何识别媒体偏见 In recent months, the troubling issue of facial recognition has hit headlines. 近几个月来,令人困扰的面 ...

  7. 什么是认知偏见_偏见

    什么是认知偏见 The term Artificial Intelligence has been in use since 1955. AI pioneer, John McCarthy descr ...

  8. 图片 标记 软件_如何设计软件功能标记

    图片 标记 软件 A previous company had a problem: our deploys were thousands of lines in size, took nearly ...

  9. 标记图标_标记您的图标

    标记图标 Not labeling your icons is the same as assuming that we are all fluent in ancient hieroglyphics ...

最新文章

  1. vue-router路由基础
  2. k8s master 节点加入到可以调配node节点中的命令
  3. 带你学python基础:变量和基本数据类型
  4. 北京师范大学网络教育期末考试计算机,北京师范大学网络教育———《计算机应用基础》第二章同步练习题(4)...
  5. Redirecting to binsystemctl start crond.service
  6. 为什么awt_为AWT的机器人创建DSL
  7. 如何用区块链保障数据安全和承载数据确权
  8. linux+mysql登录日志_Linux查看登录日志
  9. mybatis3 添加ehcache支持
  10. 时间序列之平稳时间序列预测、趋势型序列预测、复合型序列预测
  11. 高版本node,无法编译程序,删除所有文件
  12. 中考考试的指令广播_中考考试指令系统使用的说明.doc
  13. 一篇文章告诉你大数据的重要性
  14. websocket连接不上
  15. 系统架构设计的一点思考
  16. 生活-啤酒鸡 鸡啤酒 酒啤鸡 啤鸡酒- 做法
  17. 百度优化客服电话搜索
  18. 地理信息系统-坐标系统
  19. Tiny 4412 lcd 驱动分析
  20. C++语法特性cheat paper

热门文章

  1. 计算机网络【六】网络层协议
  2. CodeForces 1138B暴力+剪枝
  3. C++:String的写时拷贝
  4. 【数据结构】快速排序非递归算法及其改进
  5. 宏定义对调试代码的作用
  6. 资深Android开发带你入门Framework,再不刷题就晚了!
  7. asp.net core Serilog的使用
  8. hdu_2048 错排问题
  9. Android RecyclerView (一) 使用完全解析
  10. java基础之匿名内部类