论文链接均在作者Github下,大家多多专注呀!

1. 字形转音素

1.1 基于规则

(1998) Issues in building general letter to sound rules

1.2 基于联合序列模型

(2002) Pronunciation of proper names with a joint n-gram model for bi-directional grapheme-to-phoneme conversion
(2008) Joint-sequence models for grapheme-to-phoneme conversion
(ICASSP 2017) Bayesian_joint-sequence_models_for_grapheme-to-phoneme_conversion

1.3 基于编码-解码器结构

(Interspeech 2015) Sequence-to-sequence neural net models for grapheme-to-phoneme conversion
(Interspeech 2017) Multitask sequence-to-sequence models for grapheme-to-phoneme conversion
(ICASSP 2018) Convolutional sequence to sequence model with non-sequential greedy decoding for grapheme to phoneme conversion
(2019) Grapheme-to-phoneme conversion with convolutional neural networks
(ICASSP 2022) Neural_Grapheme-To-Phoneme_Conversion_with_Pre-Trained_Grapheme_Models
(ICASSP 2021) Joint_Alignment_Learning-Attention_Based_Model_for_Grapheme-to-Phoneme_Conversion

1.4 基于长短时记忆循环神经网络

(ICASSP 2015) GRAPHEME-TO-PHONEME CONVERSION USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS

1.5 基于多语言编码

(2016) Grapheme-to-phoneme models for any language
(2017) Massively multilingual neural grapheme-to-phoneme conversion
(2018) Epitran: Precision g2p for many languages

1.6 基于小样本的循环神经网络模型

(ICASSP 2017) Low-resource_grapheme-to-phoneme_conversion_using_recurrent_neural_networks

2. 中文转拼音

2.1 基于规则

(1998) Chinese sentence tokenization using Viterbi decoder
(2004) Grapheme-to-phoneme conversion for Chinese text-to-speech
(2005) Grapheme-to-phoneme_conversion_in_Chinese_TTS_system
(2006) 汉语字音转换中的多层面多音字读音消歧
(2008) 中文TTS系统中多音字的一种解决方案
(2008)Disambiguating_effectively_Chinese_polyphonic_ambiguity_based_on_unify_approach
(2010) Disambiguation for polyphones of Chinese based on two-pass unified approach

2.2 基于机器学习

2.2.1 基于决策树分类

(1996) Comparison of Two Tree-Structured Approaches for Grapheme-to-Phoneme Conversion
(1996) The broad study of homograph disambiguity for mandarin speech synthesis
(2002) An efficient way to learn rules for grapheme-to-phoneme conversion in Chinese
(2010) Polyphonic word disambiguation with machine learning approaches

2.2.2 基于最大熵

(2007) 基于最大熵模型的多音字消歧
(2007) Inequality maximum entropy classifier with character features for polyphone disambiguation in mandarin tts systems
(2011) Polyphone disambiguation based on maximum entropy model in mandarin grapheme-to-phoneme conversion

2.2.4 基于TBL算法

(2005) Grapheme-to-phoneme conversion based on TBL algorithm in Mandarin TTS systems
(ICASSP2008) Tree-guided transformation-based homograph disambiguation in Mandarin TTS system

2.3 基于深度学习

2.3.1 基于LSTM

(ICASSP 2016) A Bi-directional LSTM Approach for Polyphone Disambiguation in Mandarin Chinese
(2020) 基于深度学习的中文至拼音首字母自动转化方法
(2021)A Method of Polyphone Disambiguation Based on Semantic Extension

2.3.2 基于神经网络

(2019) Polyphone Disambiguation for Mandarin Chinese Using Conditional Neural Network with Multi-level Embedding Features
(Interspeech 2020) A Mask-based Model for Mandarin Chinese Polyphone Disambiguation

2.3.3 基于Encoder_Decoder结构

(Interspeech 2019) Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT
(2020) Multilingual grapheme-to-phoneme conversion with byte representation
(2020) Neural Machine Translation for Multilingual Grapheme-to-Phoneme Conversion

2.3.4 基于预训练模型

(Interspeech 2019) Pre-Trained Text Representations for Improving Front-End Text Processing in Mandarin Text-to-Speech Synthesis
(ASRU 2019) Knowledge_Distillation_from_Bert_in_Pre-Training_and_Fine-Tuning_for_Polyphone_Disambiguation
(ICASSP 2021) A Universal Bert-Based Front-End Model for Mandarin Text-To-Speech Synthesis
(ICASSP 2022) Data_Augmentation_for_Long-Tailed_and_Imbalanced_Polyphone_Disambiguation_in_Mandarin

2.4 基于新工具包

(2020): g2pM:A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Dataset

G2P 中文转拼音 多音字消歧 论文总结相关推荐

  1. java中文转拼音 多音字_java使用jpinyin汉语转拼音(支持多音字)

    importcom.github.stuxuhai.jpinyin.ChineseHelper;importcom.github.stuxuhai.jpinyin.PinyinFormat;impor ...

  2. 【多音字消歧】g2pM

    g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Bench ...

  3. Java 将中文转拼音(多音字拼音)

    1.先导入第三方jar包,并添加依赖 lucene-analysis-multipinyin-1.0.jar,这个jar包在我资源文件,需要的可以在我主页找资源文件去下载 https://downlo ...

  4. pinyin4j把中文句子(含有多音字字母)转成拼音(二维数组递归求所有组合情况返回list)算法实现!...

    介绍 Pinyin4j是一个流行的Java库,支持中文字符和拼音之间的转换.拼音输出格式可以定制,然而真正的把含有多音字.数字.字母的中文句子转成拼音得到所有的组合情况却有很大难度,我看过很多有关博客 ...

  5. EndNote参考文献英文论文的中文作者拼音姓名显示名的第二个字的首写字母

      本文介绍如何利用EndNote,对论文参考文献中英文文献的汉语拼音姓名(即作者包含中国人的英文文章)的名的第二个字的首写字母加以补充显示.例如,假如有如下一篇文章:   可知其第一作者的姓为Kon ...

  6. pinyin4j把中文句子(含有多音字字母)转成拼音(二维数组递归求所有组合情况返回list)算法实现!

    介绍 Pinyin4j是一个流行的Java库,支持中文字符和拼音之间的转换.拼音输出格式可以定制,然而真正的把含有多音字.数字.字母的中文句子转成拼音得到所有的组合情况却有很大难度,我看过很多有关博客 ...

  7. 【论文阅读】一种用于消歧和语义表示的统一模型 A Unified Model for Word Sence Representation and Disambiguation

    [论文阅读]一种用于消歧和语义表示的统一模型 A Unified Model for Word Sence Representation and Disambiguation 问题 解决方案 模型搭建 ...

  8. 论文翻译 —— Disambiguation-Free Partial Label Learning 非消歧偏标记学习(PL-ECOC)

    标题:Disambiguation-Free Partial Label Learning 文章链接:http://aaai.org/ocs/index.php/AAAI/AAAI17/paper/v ...

  9. php 中文转拼音最全字符集函数(包含20902个基本汉字+5059生僻字)

    与传统的范围比对方式不同,这个函数采用直接字符匹配的方式翻译拼音,不需要做gbk转码.而且当遇到遗漏的中文字符时,也可手动维护加入map表,扩展简单.包含20902个基本汉字+5059生僻字,几乎全了 ...

最新文章

  1. iOS中UISearchBar(搜索框)使用总结
  2. java excel 操作 jar包_[JAVA]使用POI向Excel中写入数据以及如何在Intellj IDEA中打jar包...
  3. Fedora20 优化体验
  4. C#判断Textbox是否为数字
  5. 三年级开始学计算机,三年级学生初识计算机的教学心得
  6. android 画布抗锯齿,android – 如何在画布和路径中进行抗锯齿处理
  7. jboss7.1.0配置数据库(mysql)
  8. Spring加载Properties配置文件,java通过注解读取数据
  9. java word转pdf jacob_java使用jacob.jar将word转pdf
  10. junit5和junit4_JUnit声明异常– JUnit 5和JUnit 4
  11. 简单的shell命令
  12. Java 编程题自动评分技术的研究与实现(一)
  13. windows server 2008 web服务器搭建
  14. 女性游戏市场崛起,B站能否能否抓住风口打造下一个FGO?
  15. pdfptable 设置行间距_table中tr间距的设定table合并单元格 colspan(跨列)和rowspan(跨行)...
  16. 如何做好App性能测试
  17. 半年销售100万辆 关注比亚迪后300万时代
  18. Check It Again: Progressive Visual Question Answeringvia Visual Entailment(SAR)
  19. 苍蝇也有自己的“领空”吗?
  20. 通达信资金净流入公式_通达信成交额资金净流入指标公式

热门文章

  1. Github删除forked项目
  2. 年仅38岁!中科院研究员周传不幸去世,饶毅发文悼念:他还是我的学生时就指导过我!...
  3. 中国蔬菜摊行业市场供需与战略研究报告
  4. 打造视听游戏新体验:TCL T7G真高刷电视II发布
  5. C语言中,%m.ns与%m.nf的区别
  6. 【Android 】零基础到飞升 | 背景相关与系统架构分析
  7. 【HDU 2601 】
  8. stm32 手指按下持续复位
  9. Google Earth Engine(GEE)——1870年至2100年的全球城市范围数据集
  10. typedef 函数类型 详解