【MLA首日报告摘要】周志华、马毅等教授分享机器学习最新进展

来源：专知

概要：第15届中国机器学习及其应用研讨会今天11月4日在北京交通大学举行，海内外从事机器学习及相关领域研究的10余位专家与会进行学术交流。

第15届中国机器学习及其应用研讨会今天11月4日在北京交通大学举行，海内外从事机器学习及相关领域研究的10余位专家与会进行学术交流，包括特邀报告、顶会论文交流、以及Top Conference Review等部分。

1. 深度森林初探

这是由机器学习西瓜书作者、南京大学周志华老师讲述的关于他最新集成学习研究成果-深度森林，一种对深度神经网络可替代性方法。

图示：级联森林结构的图示。级联的每个级别包括两个随机森林（蓝色字体标出）和两个完全随机树木森林（黑色）。假设有三个类要预测; 因此，每个森林将输出三维类向量，然后将其连接以重新表示原始输入。

gcForest的整体架构

gcForest在图像分类、人脸识别、音乐分类、情感分类等特定数据集上都取得了非常不错的分类效果，是非深度神经网络表现最好的方法。gcForest只是深度森林一个开始。有很多可探索的可能性和应用场景。

2. Latent tree analysis

香港科技大学张连文教授的报告。

Latent tree analysis seeks to model the correlations among a set of random variables using a tree of latent variables. It was proposed as an improvement to latent class analysis — a method widely used in social sciences and medicine to identify homogeneous subgroups in a population. It provides new and fruitful perspectives on a number of machine learning areas, including cluster analysis, topic detection, and deep probabilistic modeling. In this talk, I will give an overview of the research on latent tree analysis and various ways it is used in practice.

3. Graph Refinement

浙江大学张振跃教授的报告。

数据聚类方法的有效性非常受制于差异性或相似性图矩阵内涵的类属性特点。由于受多种因素的影响，图矩阵或高维数据本身的类属性通常比较模糊，即便是由局部邻域点构成的图矩阵也通常如此。在多源异尺度数据聚类中，图矩阵的类属性模糊性或矛盾性更为明显。在本报告中，我们将从三个角度考虑如何修正给定的图矩阵，提升图矩阵的类属性：（1）从多源数据的视角扭曲及图矩阵形模拟，恢复固有的一致性图矩阵；（2）从多源数据的稀疏邻域表达，构建一致化稀疏图矩阵；（3）从单源图矩阵的稀疏低秩逼近，修正图矩阵。我们将从理论基础、模型建立、算法设计和数值检验等方面说明上述图修正方法的合理及其有效性。

4. Low-dimensional Structures and Deep Models for High-dimensional (Visual) Data

加州大学伯克利分校马毅教授的报告。

We discuss a class of models and techniques that can effectively model and extract rich low-dimensional structures in high-dimensional data such as images and videos, despite nonlinear transformation, gross corruption, or severely compressed measurements. This work leverages recent advancements in convex optimization from Compressive Sensing for recovering low-rank or sparse signals that provide both strong theoretical guarantees and efficient and scalable algorithms for solving such high-dimensional combinatorial problems. We illustrate how these new mathematical models and tools could bring disruptive changes to solutions to many challenging tasks in computer vision, image processing, and pattern recognition. We will also illustrate some emerging applications of these tools to other data types such as 3D range data, web documents, image tags, bioinformatics data, audio/music analysis, etc. Throughout the talk, we will discuss strong connections of algorithms from Compressive Sensing with other popular data-driven models such as Deep Neural Networks, providing some new perspectives to understand Deep Learning.

5. 回复神经网络学习

四川大学张蕾教授的报告。

随着大数据时代的到来及深度神经网络的兴起，神经网络在图像理解、语音识别、自然语言处理等领域取得了令人瞩目的成功。回复神经网络作为神经网络的一种主要用于处理时序数据，广泛用于机器翻译、图像理解、情感分析、语音翻译等时序任务中。这一讲座将系统地对回复神经网络进行回顾，并针对其两个学习算法Back Propagation Through Time (BPTT) 和Real Time Recurrent Learning (RTRL) 进行介绍，并基于此对回复神经网络训练中存在的问题进行了“进一步的思考”。具体包括：（1）生物神经网络与人工神经网络；（2）回复神经网络的学习算法BPTT和RTRL；（3）回复神经网络训练过程中存在的“梯度消失”问题及相应的解决方法，基于此简要地介绍新的回复神经网络模型，如：Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU) 及 Recurrent Highway Network (RHN)等。

6. Towards Understanding Deep Learning: Two Theories of Stochastic Gradient Langevin Dynamics

北京大学王立威教授的报告。

Deep learning has achieved great success in many applications. However, deep learning is a mystery from a learning theory point of view. In all typical deep learning tasks, the number of free parameters of the networks is at least an order of magnitude larger than the number of training data. This rules out the possibility of using any model complexity-based learning theory (VC dimension, Rademacher complexity etc.) to explain the good generalization ability of deep learning. Indeed, the best paper of ICLR 2017 “Understanding Deep Learning Requires Rethinking Generalization” conducted a series of carefully designed experiments and concluded that all previously well-known learning theories fail to explain the phenomenon of deep learning.

7. 大规模分类任务的结构化学习策略

胡清华教授天津大学

随着数据规模的不断扩大，分类学习算法面临的任务也越来越复杂，分类学习的类别数从几个增长到几百个，甚至几万个。此时，不同的类别标签之间可能会形成复杂的结构关系。充分利用这种结构信息可显著提升分类性能和决策的可靠性。本报告将讨论结构化学习任务的特点、评价指标、特征评价和分类模型构造算法。

8. Active Learning: Query Less for More

黄圣君副教授南京航空航天大学

In supervised learning, a large training set of labeled examples is usually required to train an effective model. However, in many real applications, there are plentiful unlabeled data but limited labeled data, and the acquisition of labels is costly. Active learning reduces the labeling cost by iteratively selecting the most valuable data to query their labels from the annotator. This talk will summarize some important issues in active learning, including the designing of selection criterion and query type, querying from imperfect annotators and fast selection from large scale unlabeled data. Our recent efforts towards solving these issues will be reported.

【MLA首日报告摘要】周志华、马毅等教授分享机器学习最新进展相关推荐

【大咖论道】周志华，唐杰教授等专家，站在 2022，展望大模型的未来
28 日,阿里巴巴达摩院发布 2022 十大科技趋势.其中,"大模型参数竞赛进入冷静期,大小模型将在云边端协同进化"的断言,在 AI 圈备受关注. 2021 是大模型爆发之年,我们 ...
周志华：“数据、算法、算力”，人工智能三要素在未来还要加上“知识”
点击蓝字关注我们作者丨李雨晨来源丨AI科评论 2020 年 8 月 7 日,全球人工智能和机器人峰会(CCF-GAIR 2020)正式开幕.CCF-GAIR 2020 峰会由中国计算机学会(C ...
周志华：“数据、算法、算力”人工智能三要素，在未来还要加上“知识”
点击上方,选择星标或置顶,不定期资源大放送! 阅读大概需要15分钟 Follow小博主,每天更新前沿干货来源:AI科技评论作者:李雨晨 2020 年 8 月 7 日,全球人工智能和机器人峰会(CC ...
周志华：“数据、算法、算力”人工智能三要素，在未来要加上“知识”| CCF-GAIR 2020...
来源:雷锋网作者 | 李雨晨如何将"机器学习"与"逻辑推理"相结合,是人工智能领域的"圣杯问题" " 编者按:2020 年 8 ...
周志华：“数据、算法、算力” 人工智能三要素，在未来要加上“知识”！
干货分享人:周志华教授,来源:AI科技评论作者:李雨晨编辑:丛末在CCF-GAIR 2020 的人工智能前沿专场上,南京大学计算机系主任.人工智能学院院长.CCF会士.ACM.AAAI.I ...
周志华、张潼亲自辅导AI课程，DeeCamp 2019正式启动
4 月 8 日,创新工场对外宣布 DeeCamp 2019 人工智能训练营正式启动. 据介绍,DeeCamp 2019 将于 7 月 15 日至 8 月 23 日在北京.上海.南京.广州四地同时举办. ...
重磅大礼！100本《机器学习》by周志华，免费送！
我相信这么优秀的你已经置顶了我亲爱的小伙伴们~ 我可想死你们啦! 福利小编再次上线继续给大家送温暖~ 100本! <机器学习> by 周 ...
机器学习-周志华-学习记录-第一章绪论
文章目录绪论一.什么是机器学习二.基本术语三.假设空间四.归纳偏好总结参考链接绪论为了更早地适应研究生的生活,我决定重新学习周志华老师的机器学习这本书.同时也为了能够养成博客记录的习 ...
周志华、贾扬清入选！2022中国高被引学者榜单揭晓，计算机界214人上榜
视学算法报道编辑:好困桃子 [导读]28日,爱思唯尔官方发布了「2022中国高被引学者榜单」,共有5216人上榜.其中,计算机领域有214学者上榜. 刚刚,爱思唯尔(Elsevier)重磅 ...

【MLA首日报告摘要】周志华、马毅等教授分享机器学习最新进展

【MLA首日报告摘要】周志华、马毅等教授分享机器学习最新进展相关推荐

最新文章

热门文章