Machine Learning System Design

5 试题

1.

You are working on a spam classification system using regularized logistic regression. "Spam" is a positive class (y = 1) and "not spam" is the negative class (y = 0). You have trained your classifier and there are m = 1000 examples in the cross-validation set. The chart of predicted class vs. actual class is:

  Actual Class: 1 Actual Class: 0
Predicted Class: 1 85 890
Predicted Class: 0 15 10

For reference:

  • Accuracy = (true positives + true negatives) / (total examples)
  • Precision = (true positives) / (true positives + false positives)
  • Recall = (true positives) / (true positives + false negatives)
  • F1 score = (2 * precision * recall) / (precision + recall)

What is the classifier's recall (as a value from 0 to 1)?

Enter your answer in the box below. If necessary, provide at least two values after the decimal point.

2.

Suppose a massive dataset is available for training a learning algorithm. Training on a lot of data is likely to give good performance when two of the following conditions hold true.

Which are the two?

The classes are not too skewed.

A human expert on the application domain

can confidently predict y when given only the features x

(or more generally, if we have some way to be confident

that x contains sufficient information to predict y

accurately).

Our learning algorithm is able to

represent fairly complex functions (for example, if we

train a neural network or other model with a large

number of parameters).

When we are willing to include high

order polynomial features of x (such as x21, x22,

x1x2, etc.).

3.

Suppose you have trained a logistic regression classifier which is outputing hθ(x).

Currently, you predict 1 if hθ(x)≥threshold, and predict 0 if hθ(x)ltthreshold, where currently the threshold is set to 0.5.

Suppose you decrease the threshold to 0.1. Which of the following are true? Check all that apply.

The classifier is likely to now have higher recall.

The classifier is likely to have unchanged precision and recall, but

higher accuracy.

The classifier is likely to now have higher precision.

The classifier is likely to have unchanged precision and recall, but

lower accuracy.

4.

Suppose you are working on a spam classifier, where spam

emails are positive examples (y=1) and non-spam emails are

negative examples (y=0). You have a training set of emails

in which 99% of the emails are non-spam and the other 1% is

spam. Which of the following statements are true? Check all

that apply.

If you always predict non-spam (output

y=0), your classifier will have 99% accuracy on the

training set, and it will likely perform similarly on

the cross validation set.

If you always predict non-spam (output

y=0), your classifier will have an accuracy of

99%.

A good classifier should have both a

high precision and high recall on the cross validation

set.

If you always predict non-spam (output

y=0), your classifier will have 99% accuracy on the

training set, but it will do much worse on the cross

validation set because it has overfit the training

data.

5.

Which of the following statements are true? Check all that apply.

It is a good idea to spend a lot of time

collecting a large amount of data before building

your first version of a learning algorithm.

If your model is underfitting the

training set, then obtaining more data is likely to

help.

On skewed datasets (e.g., when there are

more positive examples than negative examples), accuracy

is not a good measure of performance and you should

instead use F1 score based on the

precision and recall.

After training a logistic regression

classifier, you must use 0.5 as your threshold

for predicting whether an example is positive or

negative.

Using a very large training set

makes it unlikely for model to overfit the training

data.

Machine Learning week 6 quiz: Machine Learning System Design相关推荐

  1. Machine Learning week 7 quiz: Unsupervised Learning

    Unsupervised Learning 5 试题 1. For which of the following tasks might K-means clustering be a suitabl ...

  2. 【架构师必知必会系列】系统架构设计需要知道的5大精要(5 System Design fundamentals)...

    无论是在大厂还是初创公司,技术产品经理 (TPM)都需要具备系统设计的基础知识.从历史上看,系统设计基础知识通常是软件工程师在面试时的要求,而 TPM 不受此期望的约束.然而,现在趋势正在发生变化.作 ...

  3. Coursera公开课笔记: 斯坦福大学机器学习第十一课“机器学习系统设计(Machine learning system design)”

    Coursera公开课笔记: 斯坦福大学机器学习第十一课"机器学习系统设计(Machine learning system design)" 斯坦福大学机器学习斯坦福大学机器学习第 ...

  4. 阅读笔记|Ensemble Learning for Multi-Source Neural Machine Translation

    原文链接: Ensemble Learning for Multi-Source Neural Machine Translation Abstract 本文描述和评价了神经机器翻译(NMT)中进行集 ...

  5. 转【重磅干货整理】机器学习(Machine Learning)与深度学习(Deep Learning)资料汇总

    原文出处:http://blog.csdn.net/zhongwen7710/article/details/45331915 介绍:这是一篇介绍机器学习历史的文章,介绍很全面,从感知机.神经网络.决 ...

  6. 机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现

    网址:https://github.com/NLP-LOVE/ML-NLP 此项目是机器学习(Machine Learning).深度学习(Deep Learning).NLP面试中常考到的知识点和代 ...

  7. 机器学习(Machine Learning)amp;深度学习(Deep Learning)资料

    机器学习(Machine Learning)&深度学习(Deep Learning)资料 機器學習.深度學習方面不錯的資料,轉載. 原作:https://github.com/ty4z2008 ...

  8. (转)Paper list of Meta Learning/ Learning to Learn/ One Shot Learning/ Lifelong Learning

    Meta Learning/ Learning to Learn/ One Shot Learning/ Lifelong Learning 2018-08-03 19:16:56 本文转自:http ...

  9. machine ID问题 ( u-boot设置machine id , set mach_type 0x16a )

    machine ID问题 machine ID问题跟CPU具体到型号相关,启动时打印信息如下: Uncompressing Linux................................. ...

最新文章

  1. 特斯拉全新自动驾驶芯片最强?英伟达回怼,投资者用脚投票
  2. kubernetes入门指南(一)
  3. idea webapp目录404问题,war包方式运行
  4. 灯泡里的钨丝是怎么放进去的,这个视频解开我20多年的疑惑!
  5. getHibernateTemplate()的find用法大全
  6. lenovo电脑_办公笔记本电脑有哪些值得推荐?
  7. svn中文语言安装包使用
  8. win8修复计算机u盘,Win8系统U盘里的文件不见了怎么办?
  9. 对我启发最大的数学学习方法(转自知乎)
  10. vbs表白小程序,女生同意才能关掉。
  11. DiskGenius屏蔽硬盘坏道方法
  12. hp服务器 iLO远程管理 python api模块 hpilo-python 接口列表
  13. Zookeeper 深入进阶 —— ZAB 协议
  14. 佛祖保佑永无Bug—— 最好用的注释插件
  15. manjaro下6.828实验环境配置
  16. vue2升级到 Vue3的异同(入门须知)
  17. python中的阶乘求和公式_Python阶乘求和的方法
  18. oauth2-authorization-server;oauth2-resource-server;oauth2-client
  19. AI遮天传 ML-回归分析入门
  20. PW4056H充电芯片,1A可调充电

热门文章

  1. 将pdf转换html_pdf文件怎么转换成html网页格式?用什么方法来转换?
  2. Spring-AOP @AspectJ切点函数之target()和this()
  3. 学习笔记Hadoop(二)—— Hadoop介绍(2)——Hadoop 核心组件
  4. Lock锁的基本使用
  5. 解决:The application could not be installed: INSTALL_FAILED_SHARED_USER_INCOMPATIBLE
  6. java绘图板_Java中的画图板简单功能实现
  7. php 正则获取数字,php结合正则获取字符串中数字的几种方法
  8. 战神笔记本电脑自带access吗_双·12超值价 十代酷睿+GTX1650游戏本只卖4219元_神舟 战神Z7M-CU5NB_笔记本新闻...
  9. 艺术签名python_个性签名设计五十行Python轻松实现
  10. pc双网卡实现路由转发_route命令如何使用?如何利用route命令实现内外网切换?...