Light

archer-tatsu / v-cnn Goto Github PK

View Code? Open in Web Editor NEW

33.0 3.0 7.0 13 KB

Viewport-based CNN for visual quality assessment on 360° video

License: MIT License

Python 100.00%

v-cnn's Introduction

V-CNN

Viewport-based CNN for visual quality assessment on 360° video.

Note that this is an updated version of the approach in our CVPR2019 paper, and thus the results are further improved. There are several differences between the CVPR2019 paper and this code.

Dataloader and the corresponding files for our VQA-ODV dataset are also provided.

At least 1 GPU is required by FlowNet2.

Dependencies

python3
PyTorch == 1.0.1 (CUDA 9.0 is required for compilation of FlowNet2)
s2cnn: https://github.com/jonas-koehler/s2cnn
FlowNet2: https://github.com/NVIDIA/flownet2-pytorch
numpy
scipy
scikit-image
tqdm

Binaries

The binaries including pre-trained model, as well as the list files for VQA-ODV in inference can be obtained HERE.

Please put all these files under the log directory.

Usage

python test.py --log_dir /path/to/log/directory --flownet_ckpt /path/to/flownet2/pre-trained/model [--batch_size 1] [--num_workers 4] [--test_start_frame 21] [--test_interval 45]

Note that this released version only supports batch_size of 1 in inference. The num_workers should be set according to the condition of the computer.

It may spend a lot of time to test on all frames for each sequence. Therefore, frame drop can be set via test_start_frame and test_interval. The default settings are to test every 45 frames for each sequence, beginning with the 22 frame.

Reference

If you find this code useful in your work, please acknowledge it appropriately and cite the paper:

@inproceedings{Li_2019_CVPR,
author = {Li, Chen and Xu, Mai and Jiang, Lai and Zhang, Shanyi and Tao, Xiaoming},
title = {Viewport Proposal CNN for 360deg Video Quality Assessment},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
pages = {10177--10186},
month = {June},
year = {2019}
}

v-cnn's People

Contributors

Stargazers

Watchers

Forkers

yonghoonkwon ma3252788 bobbyhanbin haitian2du dothuhahb98 jizhongpeng danmuking

v-cnn's Issues

about accuracy

Hi, there is a question bothers me when I run this project. I found that the scroce SROCC sometimes is a large and negetive number during training, like -89.9%. Does it turns out a good performance?

a question about resampling

Hi, I notice that before the input send to the scnn, you guys make a 4k or 8k image resample into shape with 256 x 256. and my question is why can do this, if it makes the quality of the image worst and lose some information?

can't use flownet2

Traceback (most recent call last):
File "/dev/360/flownet2/models.py", line 9, in
from networks.resample2d_package.resample2d import Resample2d
ImportError: No module named 'networks'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "test.py", line 14, in
from flownet2.models import FlowNet2
File "/dev/360/flownet2/models.py", line 19, in
from .networks.resample2d_package.resample2d import Resample2d
File "/dev/360/flownet2/networks/resample2d_package/resample2d.py", line 3, in
import resample2d_cuda
ImportError: libcudart.so.10.0: cannot open shared object file: No such file or directory

how to solve this?

No such file or directory: '/path/to/VQA_ODV/Group1/G1BikingToWork_ERP_3840x2160_fps23.976_qp27_12306k.yuv'

when i enumerate the test_loader, it raises the following error:
File "C:\Users\LIGHT\Desktop\code\360VQA\dataset\dataset_VQA_ODV.py", line 75, in call
ori = yuv.yuv_import(file_path, resolution, 1, frame_index)
File "C:\Users\LIGHT\Desktop\code\360VQA\utils\yuv.py", line 16, in yuv_import
fp = open(file_path, 'rb')
FileNotFoundError: [Errno 2] No such file or directory: '/path/to/VQA_ODV/Group1/G1BikingToWork_ERP_3840x2160_fps23.976_qp27_12306k.yuv'

I download the links about this dataset, but there's no yuv files. How can I solve this problem?

关于 gt anchor weight 和 gt hm

我在尝试根据论文计算真实的anchor weight时发现按照论文给定的公式

计算得到的真实权重几乎全部近似于1/256的平均值，这应该是不正常的，请问真实权重应该如何计算呢

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.