全文:

  • 2020美赛F奖论文(一):摘要、绪论和模型准备
  • 2020美赛F奖论文(二):传球网络模型(PNM)的建立和影响因子分析
  • 2020美赛F奖论文(三):足球团队指标和基于机器学习的球队表现预测
  • 2020美赛F奖论文(四):模拟退火算法驱动的结构策略设计
  • 2020美赛F奖论文(五):结合团队动力学的模型拓展、模型评价

Soccer Teamwork Evaluation Models

足球团队合作评价模型

  • 2020MCM-ICM ProblemD
  • Finalist 方案

2020年美国大学生数学建模竞赛ICM-D题 特等奖提名

GitHub仓库

Summary

  • This paper proposes a method, with graph theory, probability theory and calculus, to build machine learning models based on data analysis, which aims at providing strategies for soccer coach’s lineup arrangement and players’ training.

本文利用图论,概率论和微积分的方法,利用数据分析和建立机器学习模型,为足球教练的阵容安排和球员训练提供策略。

  • Firstly, the Pass Network Model can be established according to the graph theory, whose edge-weights are evaluation of coordination degree of each dyadic configurations. Pass Evaluate Index is designed for evaluate a single pass, and the summation of each pass can be defined as the edge-weights of PNM. For analysis, the adjacency matrix of N participating players within a period. Several outstanding M configurations can be found by the sort of M-element combination with the key of the sum of the sub-complete graph edge weights. What’s more, investigation of the influence of time on pass density depends on the constructed and approximate function of time and pass.

Firstly,根据图论,在球员之间建立传球网络,并建立单次传球的价值评价模型,用于评价两两球员间传球的配合程度,即传球网络的边权。建立在一定时间范围内所有参与比赛的N个球员的邻接矩阵,通过以M个点的子完全图边权之和为排序关键字找出若干组优秀的M元组合。同时建立基于时间尺度的价值模型,用于评价时间对传球效率的影响。

  • Secondly, performance indicators that reflect successful teamwork can be divided into dynamic indicators and static indicators. Static indicators include player position arrangement and line-up with which player season heatmap models and player position models can be established while the dynamic indicators include opponents’ strength, side, coach, passes, defense, attack and fail. etc. After visualized analysis of the correlation between the dynamic indicators extracted after data cleaning, and with the setting label by the goal difference, the random forest classifier, a machine learning model, is used as a evaluation model of dynamic indicators. After the Grid Search used for tuning parameters, and cross-validation, the accuracy of the model achieving 80% approximately.

Secondly,我们将反映成功团队合作的绩效指标划分为静态指标和动态指标。静态指标包括球员位置安排和球队阵型(line-up),我们建立球员赛季热点模型和球员分布模型。动态指标包括opponents,side,coach,passes,defence,attack and fail等。对经过数据清洗动态指标之间通过可视化进行相关性分析后,以净胜球分类作为比赛样本标签,以随机森林分类器作为机器学习的模型,用网格搜索调优参数,建立动态指标评价模型,进行交叉验证,达到了80%的准确率。

  • Thirdly, the study focuses on the role of static indicators in the performance of the team and establishes different players’ value evaluation models in different positions which comprehensively consider the player’s positions and technical statistical data evaluation. To optimize the value of 11-person permutation, we choose simulated annealing (SA) algorithm which searches the global optimal solution in cousin points in the same minimized search tree after the local optimal solution has attained. The model finally gave the best starting lineup formation. In addition, we also consider the following three secondary factors: tacit understanding between players, home and away influence, and coaching arrangements. All analysis above can be concluded as comprehensive suggestion to the coach.

Thirdly,通过上述中建立的模型进行观察分析,我们着重研究静态指标对球队的胜利起到的关键作用,综合考虑球员位置和技术数据评价模型,建立不同球员在不同位置价值评价模型。通过模拟退火算法,优化11人排列组合的考虑,在局部最优解的父级搜索树进行搜索全局最优解,最终给出价值最优的首发阵容阵型图。此外我们还考虑以下三个次要影响因素:球员间默契度,主客场影响和教练安排。给教练提出的综合建议。

  • Finally, we use the case of the Huskies to explain group dynamics. And use the conclusions obtained by the Huskies to build a model to explain how to design a more effective team and supplement the team performance indicators.

Finally,我们用哈士奇球队的案例来解释群体动力学。并用哈士奇球队建立模型得到的结论来说明如何设计更有效的团队,并对团队绩效指标进行补充。

Key words: Network; Graph theory; Calculus; Machine learning; Random forest classifier; Simulated annealing; Heat map; Group dynamics

0 Content

1 Introduction 3

  • 1.1 Background 3
  • 1.2 Problem Restatement 3

2 Preparation of the Models 3

  • 2.1 Processing Tools 3
  • 2.2 Data Cleaning 4

3 Establishment of PNM and Analysis of Influence Factors 4

  • 3.1 Pass Evaluation Index (PEI) 4
  • 3.2 Pass Network Model (PNM) and Recognition of Network Pattern 6
  • 3.3 Fluctuation of Passing State at The Time 6

4 Soccer Team Indexes and Performance Prediction Based on ML 7

  • 4.1 Static Index (SI) 8
  • 4.2 Dynamic Index (DI) 9
    • 4.2.1 Data Cleaning and Feature Engineering 9
    • 4.2.2 Visualization Analysis 9
  • 4.2.3 RFC Establishment, Optimization, and Training 12

5 Design of Structural Strategies Driven by SA 13

  • 5.1 Position Evaluation Engineering (PEE) 13
  • 5.2 Optimization of Permutation and Combination Based on SA Algorithm 14
  • 5.3 Other Structural Strategy Factors 15
  • 5.4 Structural Strategy Conclusion 16

6 Model Extension Combined with Group Dynamics 16

  • 6.1 Group and Soccer Team 17

    • 6.1.1 Group Cohesiveness 17
    • 6.1.2 Group Standard and Group Pressure 17
    • 6.1.3 Individual Motivation and Group Goals 17
    • 6.1.4 Leadership and Group Performance 18
    • 6.1.5 Group Structure 18
  • 6.2 Other influence factor of successful teamwork 18

7 Evaluation 18

  • 7.1 Strength 18
  • 7.2 Weakness 19

8 Reference 19

0 目录

1 绪论 3

  • 1.1 背景 3
  • 1.2 问题重述 3

2 模型准备 3

  • 2.1 预处理工具 3
  • 2.2 数据清洗 4

3 传球网络模型(PNM)的建立和影响因子分析 4

  • 3.1 传球评价指标 (PEI) 4
  • 3.2 传球网络模型(PNM)构建及识别网络模式 6
  • 3.3 时间尺度上传球状态波动 6

4 足球团队指标和基于机器学习的球队表现预测 7

  • 4.1 静态指标 (SI) 8
  • 4.2 动态指标 (DI) 9
    • 4.2.1 数据清洗和特征工程 9
    • 4.2.2 可视化分析 9
  • 4.2.3 随机森立分类器模型的建立、参数调优和训练 12

5 模拟退火算法驱动的结构策略设计 13

  • 5.1 位置评价工程(PEE) 13
  • 5.2 基于SA算法优化排列组合 14
  • 5.3 其他结构策略因素 15
  • 5.4 结构性策略总结 16

6 结合团队动力学的模型拓展 16

  • 6.1 团体动力学和足球队 17

    • 6.1.1 群体内聚力 17
    • 6.1.2 群体标准和群体压力 17
    • 6.1.3 个人动机和群体目标 17
    • 6.1.4 领导与群体性能 18
    • 6.1.5 群体的结构性 18
  • 6.2 成功团队合作其他影响因素 18

7 评价 18

  • 7.1 优势 18
  • 7.2 缺陷 19

8 参考文献 19

1 绪论 Introduction

1.1 背景 Background

Football has a long history. It has been loved all over the world since it was popularized. Football can be considered as the most popular sports in the world. Football, a seemingly simple sport, contains the secrets of individual ability and team cooperation. With the development of the times and the progress of science and technology, football players and coaches continue to improve in skills, showing the audience wonderful matches. As we all know, a wonderful football match is inseparable from the contributions of players and teams. By studying the actions of everyone in the team, coordinating the team relationship, reasonably arranging the minutes and line-up, we can score best.

1.2 问题重述 Problem Restatement

Football is a sport suitable for all ages. Since its inclusion in international tournaments, people have created a variety of methods to evaluate the team dynamics throughout the match and over the entire season to help determine specific strategies that can improve teamwork next season. We need to use the data provided by the ICM team to build a model to solve the following four problems.

足球赛是一项老少皆宜的运动,自从其纳入国际赛事以来,人们就创造出各种各样的方法来评价整个比赛和整个赛季的团队动态,来帮助确定下个赛季可以改善团队合作的具体策略。我们需要使用ICM团队提供的数据建立模型来解决以下四个问题。

  1. Consider each player as a node and create a passing network to identify dyadic, triadic and multiple configurations. We need to establish a value evaluation model of a single pass and a general evaluation model of the passing of the time structure index under the passing network.
  2. To Identify performance indicators that reflect successful teamwork, we need to consider static and dynamic indicators. Establish a model of the impact of each performance indicator on successful teamwork, and use one model to encompass these four sub-models.
  3. By observing and analyzing the model established in Questions 1 and 2, tell the coach that which form of structural strategy is applicable to the Huskies. Using the results of the model analysis to make suggestions for the coach to improve the team’s success rate next season.
  4. Use the case of the Huskies to explain the theory of group dynamics, and use the conclusion of the model established by the Huskies to explain how to design a more effective team, and supplement the team performance indicators.
  1. 将每一个球员当做一个节点,创建传球网络来识别二元配置,三元配置和 多元配置。我们需要建立在传球网络下,单次传球的价值评价模型,以及时间结构指标的传球总数评价模型。
  2. 确定反映成功团队合作的绩效指标,我们需要考虑静态指标和动态指标。建立每个绩效指标对成功团队合作影响的模型,并用一个模型来囊括这四个子模型。
  3. 通过对问题1,2中建立的模型的观察分析,告诉教练什么样的结构策略适用于哈士奇球队。用模型分析的结果为教练提高球队的下个赛季的成功率给出建议。
  4. 用哈士奇球队的案例来解释群体动力学理论,用哈士奇球队建立模型得到的结论来说明如何设计更有效的团队,并对团队绩效指标进行补充。

2 模型准备 Preparation of the Models

2.1 预处理工具 Processing Tools

Tool Uses
Visual Studio Code 1.42 Coding, Visualization
IPython 3.6.8 Run Code
Visio Design Flowchart
Excel Arrange Dataset
GitHub Synchronization, Storing
MindMaster Plot Mind Map

2.2 数据清洗 Data Cleaning

若空白则为上一个相同

Data Name Processing Type Feature Name
Side Map + Dummy Side_1, Side_0
Coach Dummy Coach_1, Coach_2, Coach_3
Opponent Strength Analysis Oppo
Shots Count Attack
Dribbles
Touch
Corner
Offside
Tackle Count Defence
Dispossess
Aerial Won
Interception
Clearance
Blocks
Saves
Passes Count Pass
Possession Search + Integrate
Pass Success Calculate
Foul Count Fail
Loss of Possession Search + Count

后接:2020美赛F奖论文(二):传球网络模型(PNM)的建立和影响因子分析
全文:

  • 2020美赛F奖论文(一):摘要、绪论和模型准备
  • 2020美赛F奖论文(二):传球网络模型(PNM)的建立和影响因子分析
  • 2020美赛F奖论文(三):足球团队指标和基于机器学习的球队表现预测
  • 2020美赛F奖论文(四):模拟退火算法驱动的结构策略设计
  • 2020美赛F奖论文(五):结合团队动力学的模型拓展、模型评价

2020美赛F奖论文(一):摘要、绪论和模型准备相关推荐

  1. 2020美赛F奖论文(四):模拟退火算法驱动的结构策略设计

    上接:2020美赛F奖论文(三):足球团队指标和基于机器学习的球队表现预测 全文: 2020美赛F奖论文(一):摘要.绪论和模型准备 2020美赛F奖论文(二):传球网络模型(PNM)的建立和影响因子 ...

  2. 数学建模美赛O奖论文研读启示录——从模仿开始

    美赛O奖论文研读启示录

  3. [美赛F奖][数学建模][经验贴]2021美赛F奖的那些事

    写在前面 2021美赛都过去半年了,一直也在忙各种各样的事情,刚好上学期有一门项目管理的课程,课程论文写的就是美赛经验,偷个懒直接改下排版复制上来,以作留念 贴个奖状~~ 一.引言 项目是一个组织为实 ...

  4. 2018年美赛O奖论文

    蓝奏云:https://www.lanzous.com/i2wpahg 原网址:https://download.csdn.net/download/csdngauss/10616370

  5. 【2020数模F奖】 美赛C题参赛感受及做题思路记录【编程手的角度,含大量代码及参考链接】

    目录 写在前面的话 题目分析 [数据清洗] [NLTK] [第一题] [第2题e问] [词云]---wordcloud包 [TF-IDF算法] [第2题a.b.c问]需要先对评论数值化 [Textbl ...

  6. 2023年美赛论文写作方法——图表篇:美赛O奖中那些好看的图表是如何制作的?

    思路:永久更新,全网最新最全,持续更新中,查看最下方QQ群获取. 2023年美赛论文写作方法--图表篇:美赛O奖中那些好看的图表是如何制作的? 相信很多关注七七的小伙伴们都知道数模论文最重要的是:简洁 ...

  7. 2021美赛F题解题思路

    新队伍,大家都差不多是小白,借鉴的博客:(19条消息) 2021年美赛F题总结_wzu_cza123的博客-CSDN博客_美赛2021f题 一.数据的查找和处理 二.TOPSIS 1.TOPSIS熵权 ...

  8. 2021年美赛F题总结

    2021年美赛F题总结 肝到了早上六点20分才算是把F题的论文交上去了呜呜,最后把论文发给官方的时候3个人紧张死了,检查了7,8遍就怕出一点错,官方不接收我们的文章,那个点已经神志不清了,又在官网不停 ...

  9. 2020美赛C题:python实现npl自然语言处理记录

    2020美赛C题:python实现npl自然语言处理记录 前言 文本预处理 LDA主题分析加可视化 多进程程序需写进main函数 可视化 NLTK情感分析 制作语料包 情感积极性量化 一些收获 pyt ...

  10. 2023美赛F题全部代码+数据+结果 数学建模

    2023年美赛F题全部思路 数据代码都已完成 全部内容见链接:https://www.jdmm.cc/file/2708700/ 1.根据文献选的GGDP的指标,发现GGDP与水资源等有关,由此可以筛 ...

最新文章

  1. 日常运维管理技巧十六(iftop网卡流量监控工具)(转载)
  2. SpringBoot | 第九章:Mybatis-plus的集成和使用
  3. 自己写一个轻量的JqueryGrid组件
  4. POJ3133(插头dp)
  5. 获取生产订单的系统状态
  6. jq设置保留两位小数_如何实现python中format函数保留指定位数的小数?
  7. labview自动生成html,使用LabVIEW实现网页数据提取及交互.pptx
  8. html标记ruby,html5 ruby标签的定义及使用方法详解(内有实例介绍)
  9. guid主分区表损坏_固态硬盘用mbr还是guid
  10. 华为云生态2020年政策FAQ(一)
  11. 伺服电机转矩常数的标定方法
  12. 编辑器拓展 CustomEditor
  13. U盘制作ubuntu18.04.6系统安装盘
  14. b区计算机复试国家线,今年调剂太恐怖 B区考研分数线竟比A区高?
  15. 阿里云建站之模板建站的核心优势有哪些?
  16. 【azkaban】开启进程秒退
  17. 《让子弹飞》系列——张麻子的斗争策略
  18. rar文件解压后可以删除吗?rar文件删除后怎么恢复?
  19. 如何查看Outlook搜索出的邮件所在的文件夹
  20. 用 Python 简单做个 动态模拟太阳系运转 吧

热门文章

  1. ddrelease64 黑苹果_GitHub - wangtufly/Precision5510-High-Sierra: DELL Precision5510 10.13.X 黑苹果教程...
  2. python缠论代码_缠论dll(czsc - 缠中说禅技术分析工具)
  3. c语言常用函数大全超详细
  4. font-family:微软雅黑;与font-family:Microsoft YaHei;的区别?
  5. 查看tomcat目前用的jdk版本
  6. CAD导入arcgisMap进行shp导出异常现象
  7. 使用软件测试路由器性能报告,小米路由器网络性能初步测试报告
  8. 《鸟哥的Linux私房菜》简评
  9. 基于SSH的宠物管理系统
  10. 你相信这是XP经典桌面拍摄地现在的模样吗?