Download the local inference package from Hugging Face: StyleTTS2‑Lite (Vietnamese).
Annotate any non‑Vietnamese words with the appropriate language tag, e.g., [en-us]{ } for English. For more information, see eSpeakNG docs