🎤 Comparativa: Clonación de voz
Voz clonada desde el audio de Lazkao Txiki usando 3 modelos diferentes · Mismo texto en cada modelo
Texto
"Welcome to this test. This is a voice cloned from Lazkao Txiki using F5 TTS. The quick brown fox jumps over the lazy dog."
Texto
"Numbers are handled naturally here. Five point two million dollars and forty two percent. This is how this voice sounds when reading financial data in English."
Texto
"Welcome to this test. This is a voice cloned from Lazkao Txiki using MOSS TTS Nano. The quick brown fox jumps over the lazy dog."
Texto
"Numbers are handled naturally here. Five point two million dollars and forty two percent. This is how this voice sounds when reading financial data in English."
Texto
"Welcome to this test. This is a voice cloned from Lazkao Txiki using MOSS TTS Realtime model. The quick brown fox jumps over the lazy dog."
Texto
"Numbers are handled naturally here. Five point two million dollars and forty two percent. This is how this voice sounds when reading financial data in English."
📊 Comparativa de modelos
| Modelo | Params | Sample rate | Canales | RTF (CPU) | Clonación | Idiomas |
|---|---|---|---|---|---|---|
| F5-TTS | ~335M | 24 kHz | Mono | ~24x | ✅ Zero-shot | Cualquiera |
| MOSS-TTS-Nano | 100M | 48 kHz | Estéreo | ~8x | ✅ Zero-shot | 20 |
| MOSS-TTS-Realtime | 2.33B | 48 kHz | Estéreo | ~15x | ✅ Zero-shot | 20 |