Giter Club home page Giter Club logo

Comments (8)

FENRlR avatar FENRlR commented on August 18, 2024

A huge thank you for sharing the results. The main reason of using iSTFT here was its fast synthesis speed that it showed from its original VITS variant. As so, I would say the result is far beyond my expectations. Magnificent.

from mb-istft-vits2.

shigabeev avatar shigabeev commented on August 18, 2024

@FENRlR do you know by chance the optimal configs for different sampling rates? I need 16kHz, 24kHz and 48kHz.

from mb-istft-vits2.

FENRlR avatar FENRlR commented on August 18, 2024

Currently, no. It seems there were some issues with 16kHz sampling rate in the original iSTFT repo. I've never seen the other two, however.

from mb-istft-vits2.

p0p4k avatar p0p4k commented on August 18, 2024

@FENRlR hi, can you add me on discord and ping me? (id -> p0p4k)'
thanks.

from mb-istft-vits2.

DavidNTompkins avatar DavidNTompkins commented on August 18, 2024

Super neat! Was this on an A100? Looks like it took ~3 days?

from mb-istft-vits2.

Insensiblee avatar Insensiblee commented on August 18, 2024

I downloaded the model from the web disk you provided, and reported this error when reasoning, do you know how to solve it?
RuntimeError: Error(s) in loading state_dict for SynthesizerTrn:
size mismatch for enc_p.emb.weight: copying a param with shape torch.Size([155, 192]) from checkpoint, the shape in current model is torch.Size([205, 192]).

from mb-istft-vits2.

shigabeev avatar shigabeev commented on August 18, 2024

I downloaded the model from the web disk you provided, and reported this error when reasoning, do you know how to solve it? RuntimeError: Error(s) in loading state_dict for SynthesizerTrn: size mismatch for enc_p.emb.weight: copying a param with shape torch.Size([155, 192]) from checkpoint, the shape in current model is torch.Size([205, 192]).

Hey, it's possible that the repository have changed and some weight sizes don't match defaults anymore. The easiest way to run it is to go back to the commit that dates back to the time of the post, clone it, plug in the weights and launch it from there.

from mb-istft-vits2.

FENRlR avatar FENRlR commented on August 18, 2024

@Insensiblee Before reverting back to that commit, have you tried changing symbols?
The length of symbols he used for Russian is exactly 155, while 205 is the length of the default symbol. So I'm 90% sure that
you've forgot to modify it.

from mb-istft-vits2.

Related Issues (16)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.