Hello. I am so grateful that you wrote this implementation code.
I want to use this code to learn the model with the coco dataset, but I have a problem, so I leave a question.
While using the "run_train.py" code, an error such as a photo occurred.
At this time, attns should look like [layer_num, head_num, batch_size, max_len, code_size^2], but in my result, it appears that the dimension corresponding to [layer_num, batch_size, max_len, code_size^2] has disappeared.
To solve this problem, efforts were made to look at "models/IC_encoder_decoder/transformer.py", which seems to be a problem that occurs when attns is created through a module called layer. I'm leaving a question because I was wondering if there was a way to solve this problem.
I'll be waiting for your reply.
- In addition, due to lack of English skills, the tone of the question may be unpleasant. I ask for your generous understanding of this.