INTERSPEECH2020大会收录了哪些论文？

*转载文章请留言联系作者

众所周知，INTERSPEECH论文的入选门槛较高，竞争异常激烈，那今年有哪些论文被大会收录了呢？

我们根据语音的几个方向对顶会收录的153篇论文进行了整理汇总，希望可以帮助大家快速获取想要的论文~

整理的论文主要分为以下几个语音领域方向：

语音合成
语音识别
场景&说话人识别
语音增强
多模&翻译

1. 语音合成

1.A Cyclical Post-filtering Approach to Mismatch Refinement of Neural Vocoder for Text-to-speech Systems.pdf

2.Attentron Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding.pdf

3.Bunched LPCNet Vocoder for Low-cost Neural Text-To-Speech Systems.pdf

4.Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS.pdf

5.Controllable Neural Prosody Synthesis.pdf

6.Converting Anyone’s Emotion Towards Speaker-Independent Emotional Voice Conversion.pdf

7.Deep MOS Predictor for Synthetic Speech Using Cluster-Based Modeling.pdf

8.DurIAN-SC Duration Informed Attention Network based Singing Voice Conversion System.pdf

9.Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion.pdf

10.Exploring TTS without T Using BiologicallyPsychologically Motivated Neural Network Modules (ZeroSpeech 2020).pdf

11.From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint.pdf

12.Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning.pdf

13.Quasi-Periodic Parallel WaveGAN Vocoder A Non-autoregressive Pitchdependent Dilated Convolution Model for Parametric Speech Generation.pdf

14.Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes.pdf

15.Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder.pdf

16.One Model, Many Languages Meta-learning for Multilingual Text-to-Speech.pdf

17.Peking Opera Synthesis via Duration Informed Attention Network.pdf

18.Phonological Features for 0-shot Multilingual Speech Synthesis.pdf

19.Proc. Interspeech 2020-Improving Opus Low Bit Rate Quality with Neural Speech Synthesis.pdf

20.Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit.pdf

21.Quantification of Transducer Misalignment in Ultrasound Tongue Imaging.pdf

22.Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning.pdf

23.Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation.pdf

24.Speaker Conditional WaveRNN Towards Universal Neural Vocoder for Unseen Speaker and Recording Conditions.pdf

25.Speaking Speed Control of End-to-End Speech Synthesis using Sentence-Level Conditioning.pdf

26.Speech-to-Singing Conversion based on Boundary Equilibrium GAN.pdf

27.SpeedySpeech Efficient Neural Speech Synthesis.pdf

28.Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis-Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis.pdf

29.Understanding Self-Attention of Self-Supervised Audio Transformers.pdf

30.Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages.pdf

31.Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones.pdf

32.VocGAN A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network.pdf

33.Vocoder-Based Speech Synthesis from Silent Videos.pdf

34.WG-WaveNet Real-Time High-Fidelity Speech Synthesis without GPU.pdf

2. 语音识别

Interspeech 2020收录了大约61篇语音识别的论文，以下是部分论文题目。（论文集中是完整的61篇论文）

1.Augmenting Generative Adversarial Networks for Speech Emotion Recognition.pdf

2.Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder.pdf

3.Autosegmental Neural Nets Should Phones and Tones be Synchronous or Asynchronous.pdf

4.CAT A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency.pdf

5.CLASS LM AND WORD MAPPING FOR CONTEXTUAL BIASING IN END-TO-END ASR.pdf

6.Conference paper at Interspeech 2020-Evaluating the reliability of acoustic speech embeddings.pdf

7.Conv-Transformer Transducer Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition.pdf

8.Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings.pdf

9.Cotatron Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion without Parallel Data.pdf

10.DARTS-ASR Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation.pdf

3. 场景&说话人识别

Interspeech 2020收录了大约34篇场景&说话人识别的论文，以下是部分论文题目。（论文集中是完整的34篇论文）

1.End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors.pdf

2.Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification.pdf

3.Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification.pdf

4.Extrapolating False Alarm Rates in Automatic Speaker Verification.pdf

5.Identify Speakers in Cocktail Parties with End-to-End Attention.pdf

6.Improving Multi-Scale Aggregation Using Feature Pyramid Module for Robust Speaker Verification of Variable-Duration Utterances.pdf

7.Improving on-device speaker verification using federated learning with privacy.pdf

8.Interspeech 2020-Sum-Product Networks for Robust Automatic Speaker Identification.pdf

9.JukeBox A Multilingual Singer Recognition Dataset.pdf

10.Length- and Noise-aware Training Techniques for Short-utterance Speaker Recognition.pdf

4.语音增强

Interspeech 2020收录了大约24篇场景&说话人识别的论文，以下是部分论文题目。（论文集中是完整的24篇论文）

1.Deep Noise Suppression Challenge Datasets, Subjective Testing Framework, and Challenge Results.pdf

2.Do face masks introduce bias in speech technologies The case of automated scoring of speaking proficiency.pdf

3.Dual-Path Transformer Network Direct Context-Aware Modeling for End-to-End Monaural Speech Separation.pdf

4.Efficient Low-Latency Speech Enhancement with Mobile Audio Streaming Networks.pdf

5.Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement.pdf

6.g2pM A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset.pdf

4.翻译&多模

翻译的论文有1篇

Efficient Wait-k Models for Simultaneous Machine Translation.pdf

多模方向的论文有3篇

1.Automatic Quality Assessment for Audio-Visual Verification.pdf

Systems. The LOVe submission to NIST SRE Challenge 2019

2.End-to-End Lip Synchronisation.pdf

3.Unsupervised vs. transfer learning for multimodal one-shot matching of speech and images.pdf

我们已经帮小伙伴们准备好了153篇Interspeech 2020论文集，快来解读顶会的前沿研究与技术吧！

论文获取方式

添加叶子：shenlanyez，备注“语音论文”即可领取~

* 感谢深蓝学院多门语音课程的优秀学员——龙岩同学，在百忙之中协助我们细分了论文的各个领域方向！

INTERSPEECH2020大会收录了哪些论文？相关推荐

计算机区块链的杂志,CCF区块链技术大会收录Wanchain共识论文并推荐SCI期刊检索...
1. The following terms and conditions ("Terms") shall govern the relationship between Wanc ...
[转帖]美国《工程索引》收录中国科技论文的最新规定
来源:http://www.dytrol.com/dispbbs.asp?boardID=24&ID=4901&page=1 朱诚 (中国高等学校自然科学学报研究会对外联络委员会) ...
ICML 2019收录774篇论文：谷歌153篇，清华北大26篇
晓查发自凹非寺量子位报道 | 公众号 QbitAI 机器学习领域顶级学术会议ICML 2019将于6月10日-15日在美国加州长滩举办.近日ICML主办方公布了今年被收录的文章名单. 据统计 ...
在淘宝知网查重论文会被收录到学术论文联合对比库吗？
有同学问之前在淘宝买了个知网查重,太贵了和同学一起买的,论文名字写的同学的.最近在网上看到有人说只要查过重一年后就会被收录,那万一毕业被抽查查到岂不是杯具了? 其实是不会被收录的,除非被商家恶意上传, ...
深圳神目信息COVID-19抗疫科研成果入选ICMLA2020 oral论文
全球机器学习与应用国际顶级会议ICMLA2020(International Conference on Machine Learning)将于2020年12月在美国佛罗里达举行,会议由IEEE主办. ...
INTERSPEECH2023｜达摩院语音实验室入选论文全况速览
近日,语音技术领域旗舰会议INTERSPEECH 2023公布了本届论文审稿结果,阿里巴巴达摩院语音实验室有17篇论文被大会收录. 01 论文题目:FunASR: A Fundamental End- ...
人脸识别技术在COVID-19抗疫中的应用：发热病人的筛查及密切接触者追踪
全球机器学习与应用国际顶级会议ICMLA2020(International Conference on Machine Learning)将于2020年12月在美国佛罗里达举行,会议由IEEE主办. ...
九亿条数据汇聚之下的技术脉动
撰文:康翔编辑:阿由设计:紫菜麻辣鲜香的美食,往往会压制味蕾对其他味道的感受.同样,火锅飘香的成都,也让人经常会有意无意地忽视了它背后的数字韵味. 实际上,在三年前的第十九届高交会"2 ...
史上最大规模ACL大会放榜，百度10篇NLP论文被录用！
近日,自然语言处理(NLP)领域的国际顶级学术会议"国际计算语言学协会年会"(ACL 2019)公布了今年大会论文录用结果.根据 ACL 2019 官方数据,今年大会的有效投稿数量 ...

INTERSPEECH2020大会收录了哪些论文？

INTERSPEECH2020大会收录了哪些论文？相关推荐

最新文章

热门文章