Comments (2)
Hi, --hubert-label-dir
should be the transcription for ASR. So, it should not be the HuBERT labels extracted by kmeans model. You can just follow issue_15 and libri-label to prepare.
from speecht5.
Hi, @Ajyy , thank you so much! I confirm that everything works fine now. I have run fine-tuning with the data.wrd
, the training loss looked good, and in the inference, I am getting a very close result to the one I get with the released model. Currently, I am running fine-tuning on the other data as well.
from speecht5.
Related Issues (20)
- Baseline implementation HOT 1
- Text feature extraction using SpeechLM
- British English TTS model HOT 1
- "SpeechT5" on Android OS
- Link to train_960.tsv is broken
- What is the time taken to converge for the hidden unit tokenizer?
- Does the pre-trained model for hidden unit tokenizer use speaker embeddings?
- extract transorformer layer feature HOT 2
- WavLLM checkpoint HOT 5
- Single Task Training HOT 1
- Error in loading WavLLM model HOT 9
- Confusion/Question about SpeechT5SpeechDecoderPostnet output
- What's the model_path and data_name on inference code? HOT 1
- SpeechUT does not have a link for download HOT 2
- What languages are supported? How to specify a language?
- Unable to Download wavLLM Due to Error HOT 1
- soundfile.LibsndfileError: <exception str() failed>
- How to fine-tune SpeechT5 HifiGAN vocoder?
- Please fix the broken download link!!! So many models cann't be used without checkpoint.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from speecht5.