Skip to content

Explore VibeVoice T2S model export & quantization #713

@IgorSwat

Description

@IgorSwat

Problem description

VibeVoice is an open-source Text to Speech model provided by Microsoft.

What should be done

  1. Explore the model properties
  2. Try to export the model to ExecuTorch format
  3. Try to quantize the model

Benefits to React Native ExecuTorch

Compared to already implemented Kokoro model, it could bring some meaningful gains such as:

  • Remove the need for using additional phonemization library
  • Simplify the model's inference logic

Metadata

Metadata

Assignees

Labels

modelIssues related to exporting, improving, fixing ML models

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions