Giter Club home page Giter Club logo

Comments (5)

CubeSkyy avatar CubeSkyy commented on September 20, 2024 1

Thank you so much!

from rocm_lab.

evshiron avatar evshiron commented on September 20, 2024

Hi, I am busy at the moment and if you are in a hurry, you can try building it like torchvision by yourself:

If you already have ROCm 5.5 installed, skipping scripts/prepare.sh should be totally fine.

Then you can modify scripts/build_torchvision.sh, replace torchvision stuff with torchaudio, run it and see if it compiles. The built wheels should be located in the dist dir.

I will experiment with it later and eventually have a GitHub workflow for it.

UPDATE: It's not as easy as I thought, will be back later.

from rocm_lab.

CubeSkyy avatar CubeSkyy commented on September 20, 2024

I'm not in a hurry, I will try to compile it and test If I have time. Thank you for the reply,

from rocm_lab.

evshiron avatar evshiron commented on September 20, 2024

torchaudio==2.0.2 is now available in this workflow, and will be moved to GitHub Releases once verified.

from rocm_lab.

evshiron avatar evshiron commented on September 20, 2024

I managed to generate voices using neonbjb/tortoise-tts, here is how:

git clone https://github.com/neonbjb/tortoise-tts
cd tortoise-tts

python3 -m venv venv
source venv/bin/activate

curl -L -O https://github.com/evshiron/rocm_lab/releases/download/v1.14.514/torch-2.0.1+gite19229c-cp310-cp310-linux_x86_64.whl
pip install torch-2.0.1+gite19229c-cp310-cp310-linux_x86_64.whl

curl -L -O https://github.com/evshiron/rocm_lab/releases/download/v1.14.514/torchaudio-2.0.2+31de77d-cp310-cp310-linux_x86_64.whl
pip install torchaudio-2.0.2+31de77d-cp310-cp310-linux_x86_64.whl

I am using Python 3.10.6 and I have to edit requirements.txt like this to avoid a lot of compiling from source (don't do this if you know how to do it correctly):

diff --git a/requirements.txt b/requirements.txt
index c1846c9..0babe68 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -6,13 +6,11 @@ inflect
 progressbar
 einops==0.4.1
 unidecode
-scipy==0.10.1
+scipy
 librosa==0.9.1
-numba==0.48.0
+numba
 ffmpeg
-numpy==1.20.0
-numba==0.48.0
-torchaudio
+numpy
 threadpoolctl
 llvmlite
-appdirs
\ No newline at end of file
+appdirs

Install neonbjb/tortoise-tts into venv:

pip install -r requirements.txt
python3 setup.py install

Generate voices:

python3 tortoise/do_tts.py --text "I'm going to speak this" --voice random --preset fast

The results will be located in the results dir.

Samples: results.zip

from rocm_lab.

Related Issues (15)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.