Abstract: This paper presents a zero-shot voice cloning system leveraging the DIS-Vector framework, which disentangles and encodes key speech features: content, pitch, timbre, and rhythm. Using the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results