HuBERT: Self-Supervised Speech Representation Learning
IEEE/ACM TASLP - 2021
HuBERT provides self-supervised speech representations used by RVC-style voice conversion pipelines.
Users making voice-color experiments, cover demos, or character voice trials
Converted vocals, original vocal references, and downloadable audio for demos and listening tests
Input
Clean vocals, cover material, or separated vocal stems through an RVC v2 conversion chain
Audio formats
Output
Converted vocals, original vocal references, and downloadable audio for demos and listening tests
Best for
Users making voice-color experiments, cover demos, or character voice trials
The AI cover pipeline combines separation, RVC voice conversion, and mixdown. The core inputs are a vocal file and target voice model.
RVC v2 converts the input vocal timbre into the selected character voice model.
RMVPE is the robust vocal pitch estimator used by the RVC path and the hybrid F0 option.
The AI cover pipeline separates vocals/accompaniment before conversion and mixdown.
IEEE/ACM TASLP - 2021
HuBERT provides self-supervised speech representations used by RVC-style voice conversion pipelines.
Interspeech - 2023
RMVPE is an Interspeech 2023 robust vocal pitch estimator designed for polyphonic music and used for quality-first F0 extraction.
Official repository - current
The RVC project is the official technical source for the VITS/HuBERT retrieval-based voice conversion workflow.