TTS applied to the generation of datasets for automatic speech recognition (2024)
- Authors:
- USP affiliated authors: ALUISIO, SANDRA MARIA - ICMC ; PONTI, MOACIR ANTONELLI - ICMC ; CASANOVA, EDRESSON - ICMC
- Unidade: ICMC
- Subjects: RECONHECIMENTO DA FALA; APRENDIZAGEM PROFUNDA; PORTUGUÊS DO BRASIL
- Agências de fomento:
- Language: Inglês
- Imprenta:
- Publisher: ACL
- Publisher place: Stroudsburg
- Date published: 2024
- Source:
- Título do periódico: Proceedings
- Conference titles: International Conference on Computational Processing of Portuguese - PROPOR
-
ABNT
CASANOVA, Edresson e ALUÍSIO, Sandra Maria e PONTI, Moacir Antonelli. TTS applied to the generation of datasets for automatic speech recognition. 2024, Anais.. Stroudsburg: ACL, 2024. Disponível em: https://aclanthology.org/2024.propor-1.73. Acesso em: 30 abr. 2024. -
APA
Casanova, E., Aluísio, S. M., & Ponti, M. A. (2024). TTS applied to the generation of datasets for automatic speech recognition. In Proceedings. Stroudsburg: ACL. Recuperado de https://aclanthology.org/2024.propor-1.73 -
NLM
Casanova E, Aluísio SM, Ponti MA. TTS applied to the generation of datasets for automatic speech recognition [Internet]. Proceedings. 2024 ;[citado 2024 abr. 30 ] Available from: https://aclanthology.org/2024.propor-1.73 -
Vancouver
Casanova E, Aluísio SM, Ponti MA. TTS applied to the generation of datasets for automatic speech recognition [Internet]. Proceedings. 2024 ;[citado 2024 abr. 30 ] Available from: https://aclanthology.org/2024.propor-1.73 - SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model
- ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion
- Speech2Phone: a novel and efficient method for training speaker recognition models
- TTS-portuguese corpus: a corpus for speech synthesis in brazilian portuguese
- YourTTS: towards zero-shot multi-speaker TTS and zero-shot voice conversion for everyone
- Deep learning approaches for speech synthesis and speaker verification
- Evaluating sentence segmentation in different datasets of neuropsychological language tests in brazilian portuguese
- Transfer learning and data augmentation techniques to the COVID-19 identification tasks in ComParE 2021
- Evaluating semantic similarity methods to build semantic predictability norms of reading data
- Brazilian portuguese speech recognition using Wav2vec 2.0
Download do texto completo
Tipo | Nome | Link | |
---|---|---|---|
3187003.pdf | Direct link |
How to cite
A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas