IndicF5: High-Quality Text-to-Speech for Indian Languages

datasets

language

pipeline_tag

ai4bharat/indicvoices_r

ai4bharat/Rasa

as

bn

gu

mr

hi

kn

ml

or

pa

ta

te

text-to-speech

IndicF5: High-Quality Text-to-Speech for Indian Languages

We release IndicF5, a near-human polyglot Text-to-Speech (TTS) model trained on 1417 hours of high-quality speech from Rasa, IndicTTS, LIMMITS, and IndicVoices-R.

IndicF5 supports 11 Indian languages: Mainly Assamese, Bengali, Gujarati, Hindi, Kannada, Malayalam, Marathi, Odia, Punjabi, Tamil, Telugu.

🚀 Installation

conda create -n indicf5 python=3.10 -y
conda activate indicf5
pip install git+https://github.com/ai4bharat/IndicF5.git

🎙 Usage

To generate speech, you need to provide three inputs:

Text to synthesize – The content you want the model to speak.
A reference prompt audio – An example speech clip that guides the model’s prosody and speaker characteristics.
Text spoken in the reference prompt audio – The transcript of the reference prompt audio.

from transformers import AutoModel
import numpy as np
import soundfile as sf

# Load IndicF5 from Hugging Face
repo_id = "ai4bharat/IndicF5"
model = AutoModel.from_pretrained(repo_id, trust_remote_code=True)

# Generate speech
audio = model(
    "नमस्ते! संगीत की तरह जीवन भी खूबसूरत होता है, बस इसे सही ताल में जीना आना चाहिए.",
    ref_audio_path="prompts/PAN_F_HAPPY_00001.wav",
    ref_text="ਭਹੰਪੀ ਵਿੱਚ ਸਮਾਰਕਾਂ ਦੇ ਭਵਨ ਨਿਰਮਾਣ ਕਲਾ ਦੇ ਵੇਰਵੇ ਗੁੰਝਲਦਾਰ ਅਤੇ ਹੈਰਾਨ ਕਰਨ ਵਾਲੇ ਹਨ, ਜੋ ਮੈਨੂੰ ਖੁਸ਼ ਕਰਦੇ  ਹਨ।"
)

# Normalize and save output
if audio.dtype == np.int16:
    audio = audio.astype(np.float32) / 32768.0
sf.write("namaste.wav", np.array(audio, dtype=np.float32), samplerate=24000)
print("Audio saved succesfully.")

Remodeled for expresivity

Terms of Use

By using this model, you agree to only clone voices for which you have explicit permission. Unauthorized voice cloning is strictly prohibited. Any misuse of this model is the responsibility of the user.

References

We would like to extend our gratitude to the authors of F5-TTS for their invaluable contributions and inspiration to this work. Their efforts have played a crucial role in advancing the field of text-to-speech synthesis.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
IndicF5		IndicF5
Parler		Parler
1.wav		1.wav
2.wav		2.wav
README.md		README.md
TAM_F_HAPPY_TRIMMED.wav		TAM_F_HAPPY_TRIMMED.wav
tts_tamil.py		tts_tamil.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IndicF5: High-Quality Text-to-Speech for Indian Languages

🚀 Installation

🎙 Usage

Terms of Use

References

About

Uh oh!

Releases

Packages

Languages

samnaveenkumaroff/Indic-F5

Folders and files

Latest commit

History

Repository files navigation

IndicF5: High-Quality Text-to-Speech for Indian Languages

🚀 Installation

🎙 Usage

Terms of Use

References

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages