Unlimited Vocabulary Grapheme to Phoneme Conversion for Korean TTS

AI-generated keywords: Grapheme-to-Phoneme Conversion Korean TTS Morpheme Normalization Phrase-Break Detection Phoneme Connectivity

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Paper presents a grapheme-to-phoneme conversion method for Korean TTS systems
  • Method utilizes phoneme connectivity and CCV conversion rules for accurate and efficient conversion
  • Method consists of four main modules: morpheme normalization, phrase-break detection, morpheme-to-phoneme conversion, and phoneme connectivity check
  • Morpheme normalization module replaces non-Korean symbols with standard Korean graphemes for consistency
  • Phrase-break detector assigns appropriate phrase breaks based on part-of-speech information
  • Morpheme-to-phoneme conversion module converts each morpheme into phonetic patterns using a morpheme phonetic pattern dictionary
  • Graphemes within each morpheme are grouped into CCV patterns and converted into corresponding phonemes using CCV conversion rules
  • Phoneme connectivity table ensures grammaticality by checking compatibility between adjacent phonetic morphemes
  • Proposed method achieved 99.9% accuracy in grapheme-to-phoneme conversion and 97.5% accuracy in sentence conversion in evaluation with a corpus of 4,973 sentences
  • Authors are implementing a full Korean TTS system based on this method
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Byeongchang Kim (POSTECH, Korea), WonIl Lee (POSTECH, Korea), Geunbae Lee (POSTECH, Korea), Jong-Hyeok Lee (POSTECH, Korea)

5 pages, uses colacl.sty and acl.bst, uses epsfig. To appear in the Proceedings of the Joint 17th International Conference on Computational Linguistics 36th Annual Meeting of the Association for Computational Linguistics (COLING-ACL'98)

Abstract: This paper describes a grapheme-to-phoneme conversion method using phoneme connectivity and CCV conversion rules. The method consists of mainly four modules including morpheme normalization, phrase-break detection, morpheme to phoneme conversion and phoneme connectivity check. The morpheme normalization is to replace non-Korean symbols into standard Korean graphemes. The phrase-break detector assigns phrase breaks using part-of-speech (POS) information. In the morpheme-to-phoneme conversion module, each morpheme in the phrase is converted into phonetic patterns by looking up the morpheme phonetic pattern dictionary which contains candidate phonological changes in boundaries of the morphemes. Graphemes within a morpheme are grouped into CCV patterns and converted into phonemes by the CCV conversion rules. The phoneme connectivity table supports grammaticality checking of the adjacent two phonetic morphemes. In the experiments with a corpus of 4,973 sentences, we achieved 99.9% of the grapheme-to-phoneme conversion performance and 97.5% of the sentence conversion performance. The full Korean TTS system is now being implemented using this conversion method.

Submitted to arXiv on 10 Jun. 1998

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: cmp-lg/9806008v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

This paper titled "Unlimited Vocabulary Grapheme to Phoneme Conversion for Korean TTS" presents a detailed description of a grapheme-to-phoneme conversion method for Korean text-to-speech (TTS) systems. The proposed method utilizes phoneme connectivity and CCV conversion rules to achieve accurate and efficient conversion. The method is composed of four main modules: morpheme normalization, phrase-break detection, morpheme-to-phoneme conversion, and phoneme connectivity check. In the morpheme normalization module, non-Korean symbols are replaced with standard Korean graphemes to ensure consistency in the input data. The phrase-break detector assigns appropriate phrase breaks based on part-of-speech (POS) information. In the morpheme-to-phoneme conversion module, each morpheme in the input phrase is converted into phonetic patterns by consulting a morpheme phonetic pattern dictionary. This dictionary contains candidate phonological changes that occur at the boundaries of morphemes. Additionally, graphemes within each morpheme are grouped into CCV patterns and converted into corresponding phonemes using CCV conversion rules. To ensure grammaticality, a phoneme connectivity table is employed to check the compatibility between adjacent phonetic morphemes. This table helps maintain coherence and naturalness in the synthesized speech output. The proposed method was evaluated using a corpus of 4,973 sentences which demonstrated excellent performance with 99.9% accuracy in grapheme-to-phoneme conversion and 97.5% accuracy in sentence conversion. Based on these promising results, the authors are currently implementing a full Korean TTS system utilizing this conversion method. Overall, this paper provides valuable insights into an effective approach for grapheme-to-phoneme conversion in Korean TTS systems which combines phonemic connectivity and CCV conversion rules to ensure accurate pronunciation generation while maintaining grammatical integrity in the synthesized speech output.
Created on 13 Jul. 2023

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.