Pooja Sree K C, Murugan R
Student, Jain University
Abstract
Voice cloning is the assignment of figuring out how to orchestrate the voice of a concealed speaker from a couple of tests. While current voice cloning techniques accomplish promising outcomes in Text-to-Speech (TTS) union for another voice, these methodologies come up short on the capacity to control the expressiveness of orchestrated sound. In this work, we propose a controllable voice cloning strategy that permits fine-grained authority over different style parts of the incorporated discourse for a concealed speaker. We accomplish this by expressly molding the discourse blend model on a speaker encoding, pitch form and inactive style tokens during preparing. Through both quantitative and subjective assessments, we show that our system can be utilized for different expressive voice cloning errands utilizing a couple of deciphered or untranscribed discourse tests for another speaker. These cloning errands incorporate style move from a reference discourse, combining discourse straightforwardly from text, and fine-grained style control by controlling the style molding factors during induction.
Keywords: WaveNet, TTS, Vocoder,Spectogram.
Journal Name :
EPRA International Journal of Research & Development (IJRD)

VIEW PDF
Published on : 2022-02-21

Vol : 7
Issue : 2
Month : February
Year : 2022
Copyright © 2024 EPRA JOURNALS. All rights reserved
Developed by Peace Soft