Comments (8)
Hi, we usually use 4 GPUs to train TSN. If you put the minibatch of 128 samples on one GPU it will take about 24GB memory. So in your case you can try lower the batchsize like -b 64
.
from tsn-pytorch.
It was written before 0.2.0 release but has been tested under the 0.2.0 version.
from tsn-pytorch.
This is possible. Please try setting the flag --test_crops 1
.
from tsn-pytorch.
Okay thanks and what PyTorch version are you using ?
from tsn-pytorch.
Hi @yjxiong
l am ready to test the model, it also out of memory.I am using a GPU with 16 GB VRAM. Do you have any ideas ?
my command is :
sudo python test_models.py hmdb51 RGB hmdb51_rgb_val_split_1.txt hmdb51_bninception_rgb_checkpoint.pth.tar --arch BNInception --save_scores rgbscores --workers 1
The result are :
model epoch 80 best prec@1: 52.3529411864
Freezing BatchNorm2D except the first one.
THCudaCheck FAIL file=/pytorch/torch/lib/THC/generic/THCStorage.cu line=66 error=2 : out of memory
Traceback (most recent call last):
File "test_models.py", line 128, in
rst = eval_video((i, data, label))
File "test_models.py", line 116, in eval_video
rst = net(input_var).data.cpu().numpy().copy()
File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 224, in call
result = self.forward(*input, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/torch/nn/parallel/data_parallel.py", line 58, in forward
return self.module(*inputs[0], **kwargs[0])
File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 224, in call
result = self.forward(*input, **kwargs)
File "/home/dl/tsn/tsn-pytorch/models.py", line 197, in forward
base_out = self.base_model(input.view((-1, sample_len) + input.size()[-2:]))
File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 224, in call
result = self.forward(*input, **kwargs)
File "/home/dl/tsn/tsn-pytorch/tf_model_zoo/bninception/pytorch_load.py", line 49, in forward
data_dict[op[2]] = getattr(self, op[0])(data_dict[op[-1]])
File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/module.py", line 224, in call
result = self.forward(*input, **kwargs)
File "/usr/local/lib/python2.7/dist-packages/torch/nn/modules/pooling.py", line 505, in forward
self.padding, self.ceil_mode, self.count_include_pad)
File "/usr/local/lib/python2.7/dist-packages/torch/nn/functional.py", line 264, in avg_pool2d
ceil_mode, count_include_pad)
File "/usr/local/lib/python2.7/dist-packages/torch/nn/_functions/thnn/pooling.py", line 360, in forward
ctx.ceil_mode, ctx.count_include_pad)
RuntimeError: cuda runtime error (2) : out of memory at /pytorch/torch/lib/THC/generic/THCStorage.cu:66
from tsn-pytorch.
Thank you very much. lt is ok now. Thanks for your reply. @yjxiong
from tsn-pytorch.
@utsavgarg hi, i train ucf101 on the google colab, and also limit the batch-size to 64. However, the last prec1 accuracy is 84%, i was wondering what accuracy you finally get. Thanks in advance
from tsn-pytorch.
@dandingol03
please i want know , what kind the google colab did you used (free, pro or pro+)
from tsn-pytorch.
Related Issues (20)
- Error: return int(self._data[1]) IndexError: list index out of range
- what is the command of RGB and Flow modality fusion? HOT 2
- what is the command of RGB and Flow modality fusion? Thanks!! HOT 3
- How to fuse the scores from both RGB and FLOW on training and testing? HOT 1
- No module named 'mmaction.datasets.utils' HOT 1
- 请问经过模型后的输出是什么格式呢? HOT 2
- weried phenomenon about using main.py train ucf101 dataset HOT 1
- error about epoch print statement when i use the main.py to train with ucf101 dataset HOT 1
- RuntimeError: bool value of non-empty torch.cuda.ByteTensor objects is ambiguous
- getting 84% in ucf101 on split 1 HOT 2
- RuntimeError: Legacy autograd function HOT 1
- The score of rgb and flow
- Solved
- video live test?
- Training the TSN model on custom dataset - couldn't implement as mentioned in paper HOT 1
- Why subtract 'new_length' to calculate 'average_duration' ?
- Need pre-trained ActivityNet weights for Pytorch
- 报错求助magic_number = pickle_module.load(f, **pickle_load_args) _pickle.UnpicklingError: invalid load key, '<'.(困扰我很久了救救孩子吧)) HOT 2
- f**k-est implementation HOT 3
- mean() received an invalid combination of arguments - got (Tensor, list, keepdim=bool), but expected one of: * (Tensor input) * (Tensor input, torch.dtype dtype) * (Tensor input, int dim, torch.dtype dtype, Tensor out) * (Tensor input, int dim, bool keepdim, torch.dtype dtype, Tensor out) * (Tensor input, int dim, bool keepdim, Tensor out)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tsn-pytorch.