【中国图形图像学报】面向COVID-19疫情预测的图卷积神经网络时空数据学习

摘 要
当前的疾病传播研究主要集中于时序数据和传染病模型,缺乏运用空间信息提升预测精度的探索和解释。在处理时空数据时需要分别提取时间特征和空间特征,再进行特征融合得到较为可靠的预测结果。本文提出一种基于图卷积神经网络(graph convolutional neural network,GCN)的时空数据学习方法,能够运用空间模型端对端地学习时空数据,代替此前由多模块单元相集成的模式。方法 依据数据可视化阶段呈现出的地理空间、高铁线路、飞机航线与感染人数之间的正相关关系,将中国各城市之间的空间分布关系和交通连接关系映射成网络图并编码成地理邻接矩阵、高铁线路直达矩阵、飞机航线直达矩阵以及飞机航线或高铁线路直达矩阵。按滑动时间窗口对疫情数据进行切片后形成张量,依次分批输入到图深度学习模型中参与卷积运算,通过信息传递、反向传播和梯度下降更新可训练参数。结果 在新型冠状病毒肺炎疫情数据集上的实验结果显示,采用GCN学习这一时空数据的分布特征相较于循环神经网络模型,在训练过程中表现出了更强的拟合能力,在训练时间层面节约75%以上的运算成本,在两类损失函数下的平均测试集损失能够下降80%左右。结论 本文所采用的时空数据学习方法具有较低的运算成本和较高的预测精度,尤其在空间特征强于时间特征的时空数据中有着更好的性能,并且为流行病传播范围和感染人数的预测提供了新的方法和思路,有助于相关部门在公共卫生事件中制定应对措施和疾病防控决策。
关键词
深度学习 图卷积神经网络(GCN) 时空数据处理 新冠肺炎 疫情预测

Spatiotemporal data learning of graph convolutional neural network for epidemic prediction of COVID-19

Yang Chengyi,Liu Feng,Qi Jiayin,Duan Yan,Lyu Runqian,Xiao Zilong(Institute of Artificial Intelligence and Change Management, Shanghai University of International Business and Economics, Shanghai 200336, China;School of Statistics and Information, Shanghai University of International Business and Economics, Shanghai 201620, China;Institute of Artificial Intelligence and Change Management, Shanghai University of International Business and Economics, Shanghai 200336, China;School of Computer Science and Technology, East China Normal University, Shanghai 200062, China;School of Data and Computer Science, Sun Yat-sen University, Guangzhou, 510006, China)

Abstract
COVID-19 has caused a severe impact on the medical system and economic growth of countries all over the world. Therefore, the epidemic information of each city has important reference value for governments and enterprises to formulate public health prevention and control measures and decisions in opening the economy. According to relevant research, the infectious disease model and time series model have played an important role in finding potential hosts, confirming human to human transmission, and estimating the basic reproductive number. Related research methods on disease transmission and confirmed case prediction have experienced a series of evolution, including demographic method, dynamic model, social network analysis, flight passenger volume estimation, data mining, and machine learning method. However, the prediction accuracy of these methods still needs to be improved by using spatial information in the study of epidemic transmission. In recent years, the boom of graph deep learning has provided new technologies and methods for the estimation of epidemic spread. From the iteration of trainable parameters in the way of information interaction in early time to the optimization of graph type, propagation mechanism, and output steps, this development process laid the foundation for the generation of graph convolutional neural network(GCN). The development of graph convolution network optimizes the performance of graph neural network in spectral domain and spatial domain by changing convolution kernel and information aggregation mode. The progress of representation learning improves the convenience of graph data processing. The rise of integrated framework realizes more accurate prediction in spatiotemporal data processing represented by traffic flow. Method Compared with traffic flow prediction, epidemic data prediction has stronger spatial attribute and weaker temporal attribute, which is the reason why GCN is used alone instead of integration approaches. First, according to the data visualization stage of the epidemic information and geographical location and traffic network between the positive correlation, the spatial distribution relationship and traffic connection relationship between affected cities in China are mapped into a graph network and encoded into geographic adjacency matrix, high-speed railway direct matrix, aircraft route direct matrix, and aircraft route or high-speed railway direct matrix. Four cities networks with different connection modes are formed by these four adjacent matrices, and the corresponding GCN model is constructed based on these cities networks, including geographical proximity graph convolutional neural network(GPGCN), airline graph convolutional neural network(ALGCN), high speed railway graph convolutional neural network(HSRGCN), airline and high speed railway graph convolutional neural network(ALHSRGCN). After dividing the training, validation, and test sets at a ratio of 6:2:2, the epidemic data were sliced according to the sliding time window to form a three-dimensional tensor with a size of 30×327×7 as the test set, which was input into the graph deep learning model in batches to participate in convolution operation. The training parameters were updated by information transmission, back propagation, and gradient descent. Result The experimental results on the COVID-19 dataset demonstrated that the learning distribution features of spatiotemporal data by the GCN model showed stronger fitting ability than the recurrent neural network model in the training process. It could save more than 75% of the computation cost at the training time level, and the average test set loss on mean absolute error(MAE) and mean square error(MSE) decreased by about 80%. The value of loss would converge to a lower position and achieve a more stable training process so as to obtain more accurate prediction when MSE is chosen as the loss function. Conclusion The spatiotemporal data learning method in this study has lower operation cost and higher prediction accuracy, which shows better performance especially in the case of spatiotemporal data with stronger spatial characteristics than temporal characteristics. It provides new approaches and ideas for the prediction of epidemic spread range and number of infected people, which is conducive for relevant departments to formulate countermeasures for disease prevention and control decisions in public health events.
Keywords
deep learning graph convolutional neural network(GCN) spatiotemporal data processing Coronavirus Disease 2019(COVID-19) epidemic prediction

全文点我跳转链接

【中国图形图像学报】面向COVID-19疫情预测的图卷积神经网络时空数据学习——CCF B、CSCD相关推荐

  1. 【论文阅读】Learning Traffic as Images: A Deep Convolutional ... [将交通作为图像学习: 用于大规模交通网络速度预测的深度卷积神经网络](1)

    [论文阅读]Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation ...

  2. 面向睡眠阶段分类的自适应时空图卷积神经网络

    ©PaperWeekly 原创 · 作者|宁晓军 学校|北京交通大学硕士生 研究方向|时间序列分析与挖掘:图神经网络 本文介绍一篇北京交通大学网络科学与智能系统研究所贾子钰博士等人,于 IJCAI 2 ...

  3. 【论文阅读】Learning Traffic as Images: A Deep Convolutional ... [将交通作为图像学习: 用于大规模交通网络速度预测的深度卷积神经网络](2)

    [论文阅读]Learning Traffic as Images: A Deep Convolutional Neural Network for Large-Scale Transportation ...

  4. 基于图卷积神经网络的微博疫情情感分析

    一.前言 参考论文:Graph Convolutional Networks for Text Classification 官方Github源码:text_gcn 关于微博疫情情感分析,博主之前有过 ...

  5. 2018年中国视频监控行业现状及行业发展趋势分析预测【图】

    一.中国视频监控行业现状 中国 2013-2018 年模拟标清视频监控摄像机和模拟高清视频监控摄像机的复合增长率分别为-15.2%. 29.6%. 模拟标清视频监控摄像机需求量不断下降, 预计 201 ...

  6. 视频教程-图形图像-无中生有Photoshop CS6背景素材技法ps教程 [超多案例]-其他

    图形图像-无中生有Photoshop CS6背景素材技法ps教程 [超多案例] 全栈工程师,曾担任多家互联网公司技术总监.运营经理,在技术与运营方面积累了大量实战经验,同时也是清华大学出版社特约作家, ...

  7. 【论文翻译】HCGN:面向集体分类的异构图卷积网络深度学习模型

    HCGN:面向集体分类的异构图卷积网络深度学习模型 摘要 集合分类是研究网络数据的一项重要技术,旨在利用一组具有复杂依赖关系的互联实体的标签自相关性.随着各种异构信息网络的出现,集合分类目前正面临着来 ...

  8. 京东城市时空数据引擎JUST亮相中国数据库技术大会

    受疫情影响,第十一届中国数据库技术大会(DTCC 2020)从原定的5月份,推迟到了8月份,再推迟到了12月份.尽管如此,依然没有减退国人对数据库技术的热情.2020年12月21日-12月23日,北京 ...

  9. 京东城市时空数据引擎JUST亮相中国数据库技术大会(附PPT链接)

    受疫情影响,第十一届中国数据库技术大会(DTCC2020)从原定的5月份,推迟到了8月份,再推迟到了12月份.尽管如此,依然没有减退国人对数据库技术的热情.2020年12月21日-12月23日,北京国 ...

最新文章

  1. java实现最长连续子序列_最长公共子序列 ||
  2. 如何用TensorFlow在安卓设备上实现深度学习推断
  3. DevExpress的TreeList怎样设置数据源使其显示成单列树形结构
  4. 多表连接的三种方式详解 HASH JOIN MERGE JOIN NESTED LOOP
  5. 为特使建立控制平面的指南-部署权衡
  6. 深度学习基础实战使用MNIST数据集对图片分类
  7. POSIX互斥锁api函数
  8. amd核芯显卡控制面板自定义分辨率_AMD翻盘,NVIDIA翻车,你的下一张光追显卡选哪个...
  9. Linux内核第六节 20135332武西垚
  10. 拓端tecdat|虎扑社区论坛数据爬虫分析报告
  11. 产品研发项目管理软件哪个好?
  12. 前端安全XSS,CSRF
  13. 终端conda install安装包 import 错误_Tecplot 360 2017软件安装包+安装教程
  14. 【嵌入式开发】SIP信令交互总结(1)
  15. 【知识】SpringBoot项目结构目录
  16. python创建窗体_python生成窗口
  17. 如何检测摄像头是否被占用
  18. 单片机入门教程:第七章 1602LCD液晶显示模块
  19. Linkage Mapper工具包:ArcGIS中打造连接分析利器的详细指南
  20. Uos统信系统本地apt及基础网络,主机名时区配置

热门文章

  1. mac系统数据恢复软件|易我数据恢复Mac版下载
  2. HTML——2022年超好看的元旦倒计时
  3. 2012国信蓝点杯全国软件设计大赛编程大题
  4. 国信蓝点2011c本科概率问题
  5. 利用java的网络编程实现一个小游戏(类似于石头剪刀布)
  6. 一个通过nbtscan收集局域网MAC地址及IP地址的脚本
  7. nbtscan 扫描WINDOWS网络NetBIOS信息软件
  8. 解决wps工作簿从文本中复制粘贴时单元格中首位0无法显示
  9. 辽宁IBM服务器金牌代理商
  10. 20210126 nlp中文分词库