facebookresearch / octconv Goto Github PK
View Code? Open in Web Editor NEWCode for paper
License: MIT License
Code for paper
License: MIT License
If the network is downsampled n times, the input size must be 2^a (a >= n), other sizes are not supported, such as 600x1000.
I read your code and found that only not depthwise
do the # High -> Low
and # Low -> High
.
So how to get the first data_l in this setting.
Just for reference, there is a PyTorch implementation of OctConv
https://github.com/d-li14/octconv.pytorch, including detailed training logs and pre-trained models of ResNet-50 on the ImageNet benchmark.
First of all, feel the author's open source spirit, great idea!
I know that low and high frequencies are divided according to a given proportion, but how are they divided along the channel dimension? Are they divided randomly according to proportion? Or are there other pre-processing in it?
Thank you very much.
Hi,
Thanks for sharing the code. I am just wondering if there is any implementation for the Octave Transposed Convolution (octave de-convolution)? I have not found it in your code. Is there a plan to implement it?
Thanks.
Incompatible attr in node L4_B01_conv1_conv-h at 1-th input: expected [1,96,7,7], got [1,96,6,6]
Hello @cypw. First of all, thanks to your amazing research. I had a question regarding Table 2 in your paper. This is not an issue but rather a follow up question. You reported CPU inference times of Resnet-50 for various values of alpha. I'm wondering if you had observed a similar trend of decreasing run times on GPU ?
I've replicated your implementation in Tensorflow and here are my inference times on 2080Ti for Resnet-50 imagenet model. Image size 224x224x3
alpha | GPU runtime (ms) |
---|---|
0 | 13.7 |
0.125 | 22.8 |
0.25 | 23.1 |
0.5 | 23.8 |
0.75 | 22.6 |
From the above table, octave convolution performance that I get is worse on GPU.
Did you see a similar performance on GPU from your side? Does this suggest that Octave convolution improvement is more suitable for CPU rather than GPU?
Thank you !!
I have demonstrated the effectiveness of octave Conv on detection tasks.
Hi,
Thanks for your work.
Could you also release the code or the pre-trained model of Oct-I3D for action recognition?
Why OctConv processes low-frequency information with corresponding (low-frequency) convolutions can effectively enlarges the receptive field in the original pixel space.
Thank you very much for your work. Will the pre-trained model of the OctResNet-26 or 18 be released? Thanks.
or we need to train from scratch every time?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.