I actually compared ten-vad and silero-vad under conditions of faster speech to see which one could better cut at sentence pause points. I set ten-vad to cut whenever the pause exceeded 20ms, while sirelo-vad was set to cut at 40ms. The result was that for the same speaker speaking continuously for thirty seconds, ten-vad did not cut, whereas sirelo-vad performed better, cutting several sentences.
My test language is Chinese; is ten-vad's support for Chinese not as good?