航行金税盘

by Jordan Dworkin

通过乔丹德沃金

通过陌生事物的情感进行统计好奇心航行 (A Statistical Curiosity Voyage Through the Emotion of Stranger Things)

Like a few million other people, I spent the weekend of Oct. 27, 2017, watching Stranger Things 2, and the following week dealing with minor withdrawal.

像其他几百万人一样,我度过了2017年10月27日的那个周末,看了《陌生人事物》 2,接下来的一周是处理小额提款的事情。

To cope, and to transition back into my research, I decided to apply the statistical paddles of sentiment analysis and network analysis to the scripts from Seasons 1 and 2, and to consider what the results might say about the emotional structure of each episode of the show.

为了应对并过渡到我的研究,我决定将情感分析和网络分析的统计技巧应用于第1季和第2季的剧本,并考虑结果可能对每个情节的情感结构有何影响表演。

To begin understanding the emotion of Stranger Things, I downloaded the scripts and assigned a numeric value to each word based on its positive or negative valence. The simplest aspect of emotion to look at is the average valence of each episode. This metric is obtained by taking the mean sentiment value of all words in an episode, regardless of the order in which they are spoken.

为了开始理解“陌生人事物”的情感,我下载了脚本并根据其正负价为每个单词分配了一个数值。 情绪最简单的方面是每个情节的平均化合价。 通过获取情节中所有单词的平均情感值来获得此度量标准,而不管其说出的顺序如何。

These values are generally what you might expect from a show about missing children and inter-dimensional monsters, with most episodes showing more negative than positive emotion.

这些值通常是您可能会从有关失踪儿童和多维怪物的节目中获得的,大多数情节表现出的负面情绪多于正面情绪。

The most interesting finding seems to be the degree to which the less-than-well-received Chicago episode — number 15 — stands out as being more than twice as negative as any other. This may be due to a combination of its dark plot and the lack of any comedic relief from the Hawkins gang.

最有趣的发现似乎是在不那么受欢迎的芝加哥情节(第15号)中,其负面影响程度是其他任何原因的两倍以上。 这可能是由于其阴暗的情节以及霍金斯团伙缺乏任何喜剧性的救济所致。

However, there is a lot of temporal information missing from those averages, so let’s take a look at how the emotional tone changes over time within episodes.

但是,这些平均值缺少很多时间信息,因此让我们看一下情节中情绪随时间的变化。

One way to do this is to use the sliding window technique. For this version of a sliding window, we’re going to take the average of the 40 words surrounding a central word, then continually shift one word over and take a new average. This yields a smooth trajectory of the emotional valence over the course of each episode.

一种方法是使用滑动窗口技术。 对于此版本的滑动窗口,我们将取围绕一个中心单词的40个单词的平均值,然后连续将一个单词移至另一个平均值上。 这在每个情节的过程中产生了一个平稳的情绪价轨迹。

While it’s difficult to learn much from simply looking at the trajectories, a few things do pop out.

虽然仅通过观察轨迹很难学到很多东西,但确实有一些事情发生了。

First, with only one exception (you guessed it, the Chicago episode), even the darkest episodes typically have a few scenes that are heavy on positive sentiment.

首先,只有一个例外(您猜对了,芝加哥情节),即使是最黑暗的情节,也通常会出现一些正面情绪沉重的场景。

Second, of the 17 episodes, only three end on a high note: the Season 1 finale, the Season 2 premiere, and the Season 2 finale.

其次,在这17集中,只有三集以高调结束:第1季大结局,第2季首演和第2季结局。

Third, there is a lot of variation in how these episodes are structured, and they don’t seem to follow a clear emotional pattern. Let’s roll with that last point and see if we can characterize some of that variation.

第三,这些情节的结构有很多差异,而且它们似乎并没有遵循明确的情感模式。 让我们继续讨论最后一点,看看我们是否可以表征某些变化。

As a baseline for comparison, we can check whether the season in which an episode appears contains enough information to explain similarities and differences across emotional trajectories.

作为比较的基准,我们可以检查某个情节出现的季节是否包含足够的信息来说明情绪轨迹之间的相似性和差异。

Unsurprisingly, it does not (p = .34). In general, both Season 1 and Season 2 have a great deal of variability in their episodes’ structures. The average trajectories both tend to hover around neutral.

毫不奇怪,事实并非如此( p = .34) 。 总的来说,第1季和第2季在情节结构上都有很大的可变性。 平均轨迹都倾向于在中性附近徘徊。

To find a better classification, let’s first conceptualize the relationships between episodes as a network by calculating the temporal correlation for every pair of episodes. In this context, the nodes are episodes and the edges represent the degree to which pairs show similar patterns of emotion.

为了找到更好的分类,我们首先通过计算每对情节的时间相关性,将情节之间的关系概念化为网络。 在这种情况下,节点是情节,边缘表示配对显示相似情绪模式的程度。

Once this network is constructed, we can apply methods borrowed from graph theory to find communities in our data. In this case, three distinct episode groups are revealed, and the within-group similarity is greater than would be expected to occur by chance (p < .001).

一旦建立了这个网络,我们就可以应用从图论那里借来的方法来在我们的数据中寻找社区。 在这种情况下,揭示了三个不同的情节组,并且组内相似性大于偶然发生的预期值( p < .0 .01)。

Now that we’ve found our communities of interest, let’s map them back onto the emotion trajectories to see if they capture any more of the variability.

现在我们已经找到了我们感兴趣的社区,让我们将它们映射回情感轨迹,以查看它们是否捕获了更多的可变性。

Unlike the division by season, these average group trajectories appear to describe three distinct patterns. They also seem to track fairly well onto their underlying episode trajectories.

与按季节划分不同,这些平均群体轨迹似乎描述了三种不同的模式。 他们似乎还可以很好地追踪其潜在的情节轨迹。

Looking at the average patterns, we can see that group 1 contains episodes that begin and end with neutral emotion and have slow fluctuations in the middle, group 2 contains episodes that begin with negative emotion and gradually climb towards a positive ending, and group 3 contains episodes that begin on a positive note and oscillate downwards towards a darker ending.

查看平均模式,我们可以看到第1组包含以中性情绪开始和结束并且中间缓慢波动的情节,第2组包含以消极情绪开始并逐渐向积极结局发展的情节,第3组包含情节以积极的音调开始,然后朝着一个黑暗的结局向下波动。

In addition to plotting the communities of emotion patterns, let’s take a look at the full network structure.

除了绘制情感模式的社区之外,我们来看一下完整的网络结构。

The first thing that jumps out is that each group contains an approximately equal number of episodes from Season 1 and Season 2. This supports the earlier finding that season is not a good predictor of episode similarity. We can also see that episode 15 again stands out from the rest. This time because it is more loosely connected to the rest of the graph than any of the other episodes.

跳出来的第一件事是,每个组包含第1季和第2季中大致相等的情节。这支持了较早的发现,即该季不是情节相似性的良好预测指标。 我们还可以看到第15集再次脱颖而出。 这次是因为与其他任何情节相比,它与图的其余部分的联系更为松散。

Perhaps most interestingly, the network reveals that episodes tend to be less like those preceding and proceeding them than you would expect to occur by chance(p = .03). Additionally, the transitions from episodes 1→2, 2→3, and 3→4 have three of the five largest shifts in emotional structure of the 16 transitions that occurred in the show.

也许最有趣的是,该网络揭示了情节倾向于不像您之前和进行的情节那样多,而与您偶然偶然发生的情节相比( p = .03) 。 此外,从剧集1→2、2→3和3→4进行的过渡,在演出中发生的16个过渡中,情感结构的五个最大变化中有三个。

Together, these results suggest that varying the emotional trajectory from episode to episode may be a strategy for getting viewers hooked.

总之,这些结果表明,在情节之间改变情感轨迹可能是吸引观众的一种策略。

Will future episodes continue to show these changes in emotional structure? Will they follow the same three dominant emotional trajectories? Will future attempts at creating boldly different episodes land better than Chicago? Chalk those up as more unanswered questions for Season 3.

未来的情节会继续表现出情绪结构的这些变化吗? 他们会遵循相同的三个主要情感轨迹吗? 将来尝试制作大胆不同的情节是否会比芝加哥更好? 将其作为第3季中更多未解决的问题 。

Miscellaneous observations [some spoilers ahead]

杂项观察[前面有一些破坏者]

  • Series emotional high point: Christmas at the Byers household in the Season 1 finale.系列情感高峰:第1季大结局中的Byers家庭过圣诞节。
What are you - What are you doing? [Jonathan] Documenting.Oh, why? - Because - [Joyce chuckles] - It looks great. - [Joyce] Oh, this is just so overcooked. - And look, the potatoes are runny.- [Jonathan] Mom. - [Joyce] They're so runny.- [Jonathan chuckles] Mom, it's gonna be great.
  • Series emotional low point #1: Chase scene to open Season 2.系列情感低点#1:追逐场景,开启第二季。
[Horns honking] Shit! Shit! Shit! Shit! Shit! [Cackles] [Exhales] Okay. Okay.[Police sirens wailing] - Son of a bitch! We got more! - [Mick] Oh, shit! They're headed down 7th.
  • Series emotional low point #2: Steve vs. Billy in the Season 2 finale.系列情感低点2:第2季大结局的史蒂夫(Steve)与比利(Billy)。
[Steve] Get out.[Dustin] Yes! Kick his ass, Steve! - [Mike] Get him! - [Dustin] Murder the son of a bitch! - [Dustin] Now! Now! - [Mike] Get that shithead! - [Dustin] Kill the son of a bitch! - [Lucas] Steve! - [Max] Billy! - [Mike] Holy shit! Shit!
  • Honorable mention for the best scene that could not be counted because of the lack of dialogue: Billy in the mirror.荣誉奖:由于缺乏对话而无法计数的最佳场景:镜中的比利。
[Billy preens]

For those interested, code for this project is publicly available here.

对于那些感兴趣的人,此项目的代码可在此处公开获得。

翻译自: https://www.freecodecamp.org/news/a-statistical-curiosity-voyage-through-the-emotion-of-stranger-things-e7bc8b2a6395/

航行金税盘

航行金税盘_通过陌生事物的情感进行统计好奇心航行相关推荐

  1. 航行金税盘_使用Codeship在平行海中航行

    航行金税盘 This article was sponsored by Codeship. Thank you for supporting the sponsors who make SitePoi ...

  2. 从金蝶k3到金税盘_金蝶K3操作流程图详解

    目录 一.流程图符号说明: ______________________________________________ 3 二. K/3 系统基础操作流程图: ___________________ ...

  3. 从金蝶k3到金税盘_经典全套金蝶K3操作流程大全

    -结帐-期末结帐 注意点: 不能结帐的原因: (1)有未过帐的凭证(2)无权限(3)其他子系统未结帐(4)与其他用户冲突 八.套打 1.套打格式 凭证:*上海TR101记帐凭证 上海TR102收款凭证 ...

  4. 从金蝶k3到金税盘_请问金碟K3财务软件具体操作流程?

    一.新建帐套 操作流程:服务器电脑--开始--程序--K3中间层--帐套管理-新建按钮(白纸按钮) 注意点:帐套号.帐套名不能重复但可修改,但帐套类型不可更改 二.设置参数和启用帐套 操作流程:服务器 ...

  5. csv格式用什么打开可以编辑_如何用EXCEL/WPS整理航信版(金税盘/白盘)客户(商品)编码表...

    企业开票人员在使用金税盘开票软件,客户编码保存或者自动保存的越来越多就很混乱,那么如何快速的把客户编码整理好呢? 本文就以WPS为例教大家整理客户编码. 1. 导出客户编码 依次点击:系统设置 - 客 ...

  6. 金税盘怎么安装在电脑上_金税盘怎么安装,电脑重装后怎样安装金税盘税控开票软件?...

    金税盘是税务部门为了报税方便而进化出来的一个系统.通过金税盘可以开票.抄税.清卡.领购发票.金税盘使用增值税防伪税控系统,可开具增值税专用发票和增值税普通发票,一般的企业或个体商铺纳税人需要用到.第一 ...

  7. 金税盘怎么安装在电脑上_金税盘处于锁死期不能开票怎么办?

    金税盘俗称白盘是从国税局购买,配合开票软件,进行开具发票的一种税控设备.企业使用金税盘开具发票的时候,如果出现某些操作或使用上的问题,就会无法开具发票,比如,金税盘处于锁死期,那遇到这种情况怎么办?下 ...

  8. 原创-VBA金税盘开票XML生成

    '2017-11-24 20:05 乔治 '原创代码,转载请注明出处 Option Explicit Public Type SE     S As Integer     E As Integer ...

  9. 金税盘 USB无法识别的解决方法、金税盘插上电脑没反应、航天信息

    新买了一台华硕顽石笔记本,将金税盘连接电脑,金税盘亮绿灯,但是电脑一点反应也没有,设备管理器也找不到这个设备(拔插金税盘时,没有叮咚的提示音,设备管理器也没有刷新). 但是,将金税盘接到其他电脑,可以 ...

最新文章

  1. Spring Cloud 微服务实战笔记
  2. kvm_guest主机克隆
  3. 下拉插件dropload js时间计算(几天前)
  4. 15分钟从零开始搭建支持10w+用户的生产环境(二)
  5. javafx 值对_我今天对JavaFX的了解
  6. linux内核的裁剪和移植实验,实验5 linux内核的裁剪移植.doc
  7. 《感谢自己的不完美》----经典语录
  8. android 中如何监听按键的长按事件
  9. 分析器错误 分析器错误信息: 类型“Websystem.Global”不明确: 它可能来自程序集...的解决...
  10. Win7 64位 IIS未能加载文件或程序集“System.Data.SQLite”或它的某一个依赖项
  11. 创基为您介绍目前市面上的主流蓝牙方案
  12. 如何提高你的工作效率
  13. MATLAB做晶体结构图(固体物理)
  14. 【OpenCV】 实战 银行卡卡号读取
  15. 片段音乐歌词生成工具
  16. excel批量将数字转为文本格式
  17. 混血网站诞生 公司相互嫁接成就新商业模式
  18. Windows Server - 如何查看机器的开机/关机时间
  19. element ui table表格合计移到顶部
  20. 网络协议之一(ICMP)

热门文章

  1. 设置让终端保持utf8 cmd的设置
  2. 如何使用idea生成javaDoc文档
  3. 演练 网站的头部导航栏的制作 1014 HTML
  4. java演练 类与对象 给我一个数字还你一个字符串
  5. 网页浏览器 市面上存在的网页浏览器
  6. css3 2D动画效果 200303
  7. 数据结构与算法-原始版-a+b+c=1000并且a方+b方=c方
  8. xtrabackup备份还原
  9. 前端集成weex,你需要学习的objective-c基础
  10. centos 下端口开放设置