关于 #今日arXiv精选 

这是「AI 学术前沿」旗下的一档栏目,编辑将每日从arXiv中精选高质量论文,推送给读者。

Learning from Multiple Noisy Augmented Data Sets for Better Cross-Lingual Spoken Language Understanding

Comment: Long paper at EMNLP 2021

Link: http://arxiv.org/abs/2109.01583

Abstract

Lack of training data presents a grand challenge to scaling out spokenlanguage understanding (SLU) to low-resource languages. Although various dataaugmentation approaches have been proposed to synthesize training data inlow-resource target languages, the augmented data sets are often noisy, andthus impede the performance of SLU models. In this paper we focus on mitigatingnoise in augmented data. We develop a denoising training approach. Multiplemodels are trained with data produced by various augmented methods. Thosemodels provide supervision signals to each other. The experimental results showthat our method outperforms the existing state of the art by 3.05 and 4.24percentage points on two benchmark datasets, respectively. The code will bemade open sourced on github.

Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation

Comment: Findings of EMNLP 2021

Link: http://arxiv.org/abs/2109.01484

Abstract

Exemplar-Guided Paraphrase Generation (EGPG) aims to generate a targetsentence which conforms to the style of the given exemplar while encapsulatingthe content information of the source sentence. In this paper, we propose a newmethod with the goal of learning a better representation of the style andthecontent. This method is mainly motivated by the recent success of contrastivelearning which has demonstrated its power in unsupervised feature extractiontasks. The idea is to design two contrastive losses with respect to the contentand the style by considering two problem characteristics during training. Onecharacteristic is that the target sentence shares the same content with thesource sentence, and the second characteristic is that the target sentenceshares the same style with the exemplar. These two contrastive losses areincorporated into the general encoder-decoder paradigm. Experiments on twodatasets, namely QQP-Pos and ParaNMT, demonstrate the effectiveness of ourproposed constrastive losses.

Language Modeling, Lexical Translation, Reordering: The Training Process of NMT through the Lens of Classical SMT

Comment: EMNLP 2021

Link: http://arxiv.org/abs/2109.01396

Abstract

Differently from the traditional statistical MT that decomposes thetranslation task into distinct separately learned components, neural machinetranslation uses a single neural network to model the entire translationprocess. Despite neural machine translation being de-facto standard, it isstill not clear how NMT models acquire different competences over the course oftraining, and how this mirrors the different models in traditional SMT. In thiswork, we look at the competences related to three core SMT components and findthat during training, NMT first focuses on learning target-side languagemodeling, then improves translation quality approaching word-by-wordtranslation, and finally learns more complicated reordering patterns. We showthat this behavior holds for several models and language pairs. Additionally,we explain how such an understanding of the training process can be useful inpractice and, as an example, show how it can be used to improve vanillanon-autoregressive neural machine translation by guiding teacher modelselection.

Detecting Speaker Personas from Conversational Texts

Comment: Accepted by EMNLP 2021

Link: http://arxiv.org/abs/2109.01330

Abstract

Personas are useful for dialogue response prediction. However, the personasused in current studies are pre-defined and hard to obtain before aconversation. To tackle this issue, we study a new task, named Speaker PersonaDetection (SPD), which aims to detect speaker personas based on the plainconversational text. In this task, a best-matched persona is searched out fromcandidates given the conversational text. This is a many-to-many semanticmatching task because both contexts and personas in SPD are composed ofmultiple sentences. The long-term dependency and the dynamic redundancy amongthese sentences increase the difficulty of this task. We build a dataset forSPD, dubbed as Persona Match on Persona-Chat (PMPC). Furthermore, we evaluateseveral baseline models and propose utterance-to-profile (U2P) matchingnetworks for this task. The U2P models operate at a fine granularity whichtreat both contexts and personas as sets of multiple sequences. Then, eachsequence pair is scored and an interpretable overall score is obtained for acontext-persona pair through aggregation. Evaluation results show that the U2Pmodels outperform their baseline counterparts significantly.

·

今日arXiv精选 | 4篇EMNLP 2021最新论文相关推荐

  1. 今日arXiv精选 | 11篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从arXiv中精选高质量论文,推送给读者. Does Vision-and-Language Pretraining I ...

  2. 今日arXiv精选 | 13篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从arXiv中精选高质量论文,推送给读者. Classification-based Quality Estimatio ...

  3. 今日arXiv精选 | 21篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从arXiv中精选高质量论文,推送给读者. Efficient Domain Adaptation of Languag ...

  4. 今日arXiv精选 | 28篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从arXiv中精选高质量论文,推送给读者. Broaden the Vision: Geo-Diverse Visual ...

  5. 今日arXiv精选 | 46篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从arXiv中精选高质量论文,推送给读者. Neural Machine Translation Quality and ...

  6. 今日arXiv精选 | 31篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从 arXiv 中精选高质量论文,推送给读者. Analysis of Language Change in Colla ...

  7. 今日arXiv精选 | 15篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从 arXiv 中精选高质量论文,推送给读者. Beyond Preserved Accuracy: Evaluatin ...

  8. 今日arXiv精选 | 12篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从arXiv中精选高质量论文,推送给读者. You should evaluate your language mode ...

  9. 今日arXiv精选 | 14篇EMNLP 2021最新论文

     关于 #今日arXiv精选  这是「AI 学术前沿」旗下的一档栏目,编辑将每日从arXiv中精选高质量论文,推送给读者. Effective Sequence-to-Sequence Dialogu ...

最新文章

  1. linux磁盘分配方案,张明贵-Linux磁盘分区方案
  2. Silverlight MMORPG团队项目截图
  3. sparksql dataframe变成csv保存_Spark大数据分析(三):DataFrame和SQL
  4. 使用maven聚合安装多个maven工程到本地仓库报错的解决方法:child module pom.xml does not exist
  5. anaconda创建新环境_【创建社会主义新农村】怀城街道:转变整治理念 农村人居环境换新颜...
  6. AD19一键同时修改PCB各元器件黄色标志字符的尺寸(大了太占空间改小一点
  7. Hive怎样加入第三方JAR
  8. 从零开始学Docker(超详细讲解+案例分析)
  9. 民企信息化建设个人经历(二)
  10. linux之调试触摸屏驱动
  11. Google 回归中国,你准备好成为 Googler 了吗?
  12. c++课程设计图书馆管理系统总结
  13. 印象最深的一次服务器维护中,与服务器最深的羁绊,你有着怎样的闯荡大荒的故事?...
  14. 多测师肖sir_高级金牌讲师_面试题
  15. FISCO BCOS 2022年度贡献者榜单
  16. 自然语言处理(NLP)常用开源工具总结----不定期更新
  17. 华为鸿蒙Matepad 11,你值得拥有的学习、娱乐、办公利器
  18. Huggingface的from pretrained的下载代理服务器方法设置
  19. SAP 从零起步之 1.2 编辑科目表清单
  20. 实践是检验真理的唯一标准2 - 脱壳篇03

热门文章

  1. android添加动画文件,Android开发之图形图像与动画(五)LayoutAnimationController详解
  2. ubuntu共享usb接口给虚拟机_如何在虚拟机的Ubuntu12.04中使用外部USB设备
  3. 爬取虎扑NBA首页主干道推荐贴的一只小爬虫,日常爬不冷笑话解闷
  4. qsort()编译器自带快速排序的用法
  5. SaltStack安装Redis-第十篇
  6. Mac下Homebrew的图形化界面工具Cakebrew
  7. (原创)VS2017 C# 运行 Javasrcipt RSA 加密用户名登录 Java开发的服务器
  8. Java之关于面向对象
  9. tableview 分组显示返回footerviewt和headerView的高度不能为0的问题
  10. Android 博客园客户端 (八) 下拉刷新、分页,AsyncTask