thjashin / multires-conv Goto Github PK

View Code? Open in Web Editor NEW

118.0 118.0 2.0 414 KB

Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)

License: MIT License

Python 97.60% Shell 2.40%

multires-conv's People

Contributors

Stargazers

Watchers

Forkers

neurohelp policecar

multires-conv's Issues

Wavenet

It was mentioned in the paper "Our model resembles WaveNet (Oord et al., 2016a) in the use of tree-structured dilated convolutions. However, our principle-guided design has distinct skip-connection structures and filter sharing patterns, resulting in significantly better parameter efficiency and performance...Additionally, the link we establish between wavelets and tree-structured dilated causal convolutions offers the first principled justification for the effectiveness of WaveNet in modeling raw audio waveforms, an exemplary case of lengthy sequences with multiscale structure."

Do you have any ablations on the difference in performance in any specific tasks or tests? Also any specific audio samples? Overall very interesting paper!

A couple of fixes and generative applications

First of all, congratulations on what looks like very exciting work!

I wanted to check your generative modelling application by running autoregressive.py, however, I had to change a couple of things to get it running:

I got an error that MultiresLayer doesn't take the mixing argument. I figured, since it was being given False as an input, I could probably just comment it out, which does run.
I also got an error when the forward() function calls out = self.output_mapping(x), since that function doesn't seem to exist. I guessed this should probably be out = self.decoder(x), which seems to be running.... and converging. :)
However, if I'm wrong about these "fixes" please let me know!

More generally, I'm very curious about the application of this model in generative contexts. I'm specifically in the music and audio field, where powerful sequence models are (obviously) essential.

So, a few things I'm wondering about:

Is there a relatively painless way to get images from the autoregressive_eval.py script, so I can see the output? (I can obviously dig in and figure this out, but if there's a quick mod you can suggest that would be great).
Do you see opportunities for conditional generation?
Since autoregressive generation can be somewhat limited, do you see opportunities using different training methods—for example, "infilling" generation via masking?

Again, thanks for your work.

drop in replacement for conv1D?

Thanks for sharing this interesting work!
Can the MultiresConv be utilized as a drop-in replacement for conv1d operating on audio signals?

Also, is it possible to build an inverse or transpose MultiresConv and use it as a drop-in replacement for transposed conv1d?

question about shared filters

Shared filters across timescales don’t necessarily sound like an advantage, any intuition on why that is better? do you have an ablation on this? Thanks for any insights!

thjashin / multires-conv Goto Github PK

multires-conv's People

Contributors

Stargazers

Watchers

Forkers

multires-conv's Issues

Wavenet

A couple of fixes and generative applications

drop in replacement for conv1D?

question about shared filters

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent