Comments (5)
Thank you so much!
from rocm_lab.
Hi, I am busy at the moment and if you are in a hurry, you can try building it like torchvision
by yourself:
If you already have ROCm 5.5 installed, skipping scripts/prepare.sh
should be totally fine.
Then you can modify scripts/build_torchvision.sh
, replace torchvision
stuff with torchaudio
, run it and see if it compiles. The built wheels should be located in the dist
dir.
I will experiment with it later and eventually have a GitHub workflow for it.
UPDATE: It's not as easy as I thought, will be back later.
from rocm_lab.
I'm not in a hurry, I will try to compile it and test If I have time. Thank you for the reply,
from rocm_lab.
torchaudio==2.0.2
is now available in this workflow, and will be moved to GitHub Releases once verified.
from rocm_lab.
I managed to generate voices using neonbjb/tortoise-tts, here is how:
git clone https://github.com/neonbjb/tortoise-tts
cd tortoise-tts
python3 -m venv venv
source venv/bin/activate
curl -L -O https://github.com/evshiron/rocm_lab/releases/download/v1.14.514/torch-2.0.1+gite19229c-cp310-cp310-linux_x86_64.whl
pip install torch-2.0.1+gite19229c-cp310-cp310-linux_x86_64.whl
curl -L -O https://github.com/evshiron/rocm_lab/releases/download/v1.14.514/torchaudio-2.0.2+31de77d-cp310-cp310-linux_x86_64.whl
pip install torchaudio-2.0.2+31de77d-cp310-cp310-linux_x86_64.whl
I am using Python 3.10.6 and I have to edit requirements.txt
like this to avoid a lot of compiling from source (don't do this if you know how to do it correctly):
diff --git a/requirements.txt b/requirements.txt
index c1846c9..0babe68 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -6,13 +6,11 @@ inflect
progressbar
einops==0.4.1
unidecode
-scipy==0.10.1
+scipy
librosa==0.9.1
-numba==0.48.0
+numba
ffmpeg
-numpy==1.20.0
-numba==0.48.0
-torchaudio
+numpy
threadpoolctl
llvmlite
-appdirs
\ No newline at end of file
+appdirs
Install neonbjb/tortoise-tts into venv:
pip install -r requirements.txt
python3 setup.py install
Generate voices:
python3 tortoise/do_tts.py --text "I'm going to speak this" --voice random --preset fast
The results will be located in the results
dir.
Samples: results.zip
from rocm_lab.
Related Issues (15)
- Why are we using a different repo for automatic1111 build script? HOT 1
- CUDA Setup failed despite GPU being available. HOT 4
- how can i solve the problem in training lora with 7900xtx HOT 1
- Error running ghcr.io/evshiron/rocm_lab:rocm5.5-text-gen-webui 7dea7110f293 HOT 13
- ROCM5.7 build pytorch failed HOT 36
- what about ait HOT 1
- Roadmap HOT 5
- Generation not starting locally HOT 13
- Copy to VRAM hanging HOT 56
- 7900xtx get confused sdpa result HOT 2
- use wheels on artix? HOT 2
- stable diffusion segement faults HOT 3
- bitsandbytes 0.39.0? HOT 7
- Any progress on Rocm 5.6 ? HOT 8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from rocm_lab.