Special tokens to control pausing?

by mstachow - opened Sep 11, 2023

Sep 11, 2023

The model appears to randomly ignore periods or commas and the speech sounds a bit odd. Are there special pause characters or something to get it to stop and take a breath?

airsheysr

Sep 20, 2023

@mstachow , in my case, i use hyphen - for pauses

sanchit-gandhi

Sep 22, 2023

The model is trained on neither commas and periods, so they are filtered from the input text via a normalisation step in the pre-processing: https://github.com/huggingface/transformers/blob/910faa3e1f1c566b23a0318f78f5caf5bda8d3b2/src/transformers/models/vits/tokenization_vits.py#L127

Using hyphens is indeed the best option here

windytrip

Nov 8, 2023

Will this model be re-trained on commas? even use hyphen, it still sounds very odd, just a bit better than commas. Besides, I found some pronounce is wrong, you can test "library".

Kazeo57

Sep 10, 2025

@mstachow ,I have same problem there, Have you find a solution? Two years later , It is discouraging ,I'm working on an other checkpoint, there is not an other best option ...

mstachow

Sep 11, 2025

I've long since given up on this model. Microsoft's new TTS model is impeccable and produces more natural speech anyway.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment