Giter Club home page Giter Club logo

v-cnn's Introduction

V-CNN

Viewport-based CNN for visual quality assessment on 360° video.

Note that this is an updated version of the approach in our CVPR2019 paper, and thus the results are further improved. There are several differences between the CVPR2019 paper and this code.

Dataloader and the corresponding files for our VQA-ODV dataset are also provided.

At least 1 GPU is required by FlowNet2.

Dependencies

Binaries

The binaries including pre-trained model, as well as the list files for VQA-ODV in inference can be obtained HERE.

Please put all these files under the log directory.

Usage

python test.py --log_dir /path/to/log/directory --flownet_ckpt /path/to/flownet2/pre-trained/model [--batch_size 1] [--num_workers 4] [--test_start_frame 21] [--test_interval 45]

Note that this released version only supports batch_size of 1 in inference. The num_workers should be set according to the condition of the computer.

It may spend a lot of time to test on all frames for each sequence. Therefore, frame drop can be set via test_start_frame and test_interval. The default settings are to test every 45 frames for each sequence, beginning with the 22 frame.

Reference

If you find this code useful in your work, please acknowledge it appropriately and cite the paper:

@inproceedings{Li_2019_CVPR,
author = {Li, Chen and Xu, Mai and Jiang, Lai and Zhang, Shanyi and Tao, Xiaoming},
title = {Viewport Proposal CNN for 360deg Video Quality Assessment},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
pages = {10177--10186},
month = {June},
year = {2019}
}

v-cnn's People

Contributors

archer-tatsu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

v-cnn's Issues

about accuracy

Hi, there is a question bothers me when I run this project. I found that the scroce SROCC sometimes is a large and negetive number during training, like -89.9%. Does it turns out a good performance?

a question about resampling

Hi, I notice that before the input send to the scnn, you guys make a 4k or 8k image resample into shape with 256 x 256. and my question is why can do this, if it makes the quality of the image worst and lose some information?

can't use flownet2

Traceback (most recent call last):
File "/dev/360/flownet2/models.py", line 9, in
from networks.resample2d_package.resample2d import Resample2d
ImportError: No module named 'networks'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "test.py", line 14, in
from flownet2.models import FlowNet2
File "/dev/360/flownet2/models.py", line 19, in
from .networks.resample2d_package.resample2d import Resample2d
File "/dev/360/flownet2/networks/resample2d_package/resample2d.py", line 3, in
import resample2d_cuda
ImportError: libcudart.so.10.0: cannot open shared object file: No such file or directory

how to solve this?

No such file or directory: '/path/to/VQA_ODV/Group1/G1BikingToWork_ERP_3840x2160_fps23.976_qp27_12306k.yuv'

when i enumerate the test_loader, it raises the following error:
File "C:\Users\LIGHT\Desktop\code\360VQA\dataset\dataset_VQA_ODV.py", line 75, in call
ori = yuv.yuv_import(file_path, resolution, 1, frame_index)
File "C:\Users\LIGHT\Desktop\code\360VQA\utils\yuv.py", line 16, in yuv_import
fp = open(file_path, 'rb')
FileNotFoundError: [Errno 2] No such file or directory: '/path/to/VQA_ODV/Group1/G1BikingToWork_ERP_3840x2160_fps23.976_qp27_12306k.yuv'

I download the links about this dataset, but there's no yuv files. How can I solve this problem?

关于 gt anchor weight 和 gt hm

我在尝试根据论文计算真实的anchor weight时发现按照论文给定的公式
image
计算得到的真实权重几乎全部近似于1/256的平均值,这应该是不正常的,请问真实权重应该如何计算呢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.