add load_custom_phonemes() #33

veelion · 2023-01-05T01:52:10Z

For many technical terms, e.g. "AI", "GitHub", phonemes converted by seq2se2 are not the right pronunciation, so a custom dict is necessary.

The added method load_custom_phonemes(self, file_path) to G2p, reads a cmudict like file to self.cmu. usage:

from g2p_en import G2p

texts = [
        "AI is popular on GitHub.",
        ]
g2p = G2p()
for text in texts:
    out = g2p(text)
    print(out)

g2p.load_custom_phonemes('./z-custom')
for text in texts:
    out = g2p(text)
    print(out)

['AY1', ' ', 'IH1', 'Z', ' ', 'P', 'AA1', 'P', 'Y', 'AH0', 'L', 'ER0', ' ', 'AA1', 'N', ' ', 'G', 'IH1', 'TH', 'AH0', 'B', ' ', '.']
['EY1', 'AY1', ' ', 'IH1', 'Z', ' ', 'P', 'AA1', 'P', 'Y', 'AH0', 'L', 'ER0', ' ', 'AA1', 'N', ' ', 'G', 'IH0', 'T', 'HH', 'AH1', 'B', ' ', '.']

add load_custom_phonemes()

09a1643

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add load_custom_phonemes() #33

add load_custom_phonemes() #33

Uh oh!

veelion commented Jan 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

add load_custom_phonemes() #33

Are you sure you want to change the base?

add load_custom_phonemes() #33

Uh oh!

Conversation

veelion commented Jan 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant