Giter Club home page Giter Club logo

pupil-shape-prior's Introduction

Noise-Robust Pupil Center Detection through CNN-Based Segmentation with Shape-Prior Loss

Sang Yoon Han, Hyuk Jin Kwon, Yoonsik Kim, and Nam Ik Cho

[Paper]

Environments

Abstract

Detecting the pupil center plays a key role in human-computer interaction, especially for gaze tracking. The conventional deep learning-based method for this problem is to train a convolutional neural network (CNN), which takes the eye image as the input and gives the pupil center as a regression result. In this paper, we propose an indirect use of the CNN for the task, which first segments the pupil region by a CNN as a classification problem, and then finds the center of the segmented region. This is based on the observation that CNN works more robustly for the pupil segmentation than for the pupil center-point regression when the inputs are noisy IR images. Specifically, we use the UNet model for the segmentation of pupil regions in IR images and then find the pupil center as the center of mass of the segment. In designing the loss function for the segmentation, we propose a new loss term that encodes the convex shape-prior for enhancing the robustness to noise. Precisely, we penalize not only the deviation of each predicted pixel from the ground truth label but also the non-convex shape of pupils caused by the noise and reflection. For the training, we make a new dataset of 111,581 images with hand-labeled pupil regions from 29 IR eye video sequences. We also label commonly used datasets (ExCuSe and ElSe dataset) that are considered real-world noisy ones to validate our method. Experiments show that the proposed method performs better than the conventional methods that directly find the pupil center as a regression result.

Brief Description of Our Proposed Method

Overall Workflow of The Proposed Method

Example object O that satisfies convex shape condition and does not satisfy convex shape condition.

Example object O that satisfies convex shape condition and does not satisfy convex shape condition.

We add a convex-shape regularization term that any line connecting two arbitrary points of the segmentation region is inside the region. The entire loss term incorporating the convex shape prior is

where

Experimental Results

Performance comparison on each data sequence from ExCuSe & ElSe, in terms of Detection rate (%) allowing 5-pixel error tolerance. In the case of PupilNet, we report the performance shown in the original publication.*

Visualized Results

Qualitative comparison of segmentation results with/without regularization term: (a) input image with ground truth (b) result with only unary term (c) result with additional regularization term that encodes the convex shape prior. Numbers on (b),(c) represent error of estimated pupil center position.

Guidelines for Codes

Requisites should be installed beforehand.

Clone this repo.

git clone http://github.com/jaegal88/pupil-shape-prior.git
cd pupil-shape-prior/

Download & Unzip dataset on pupil-shape-prior folder and Make tfRecords

python make_TFrecord_Segmentation_ExCuSe_Augmented.py

Learns model for validate each dataset (Cross Validation)

python make_TFrecord_Segmentation_ExCuSe_Augmented.py --cross_val_num 0

Test

python UNet_test_detect_rate.py --cross_val_num 0

Dataset

ExCuSe Dataset - Binary Label
Our Own Binary Dataset - Binary_Label(training Seq.)

Citation

When you use data or codes in this site, please cite the following paper:

@article{han2020gaze,
  title={Noise-Robust Pupil Center Detection through CNN-Based Segmentation with Shape-Prior Loss},
  author={Han, Sang Yoon and Kwon, Hyuk Jin and Kim, Yoonsik and Cho, Nam Ik},
  journal={IEEE Access},
  volume={8},
  pages={64739--64749},
  year={2020},
  publisher={IEEE}
}

pupil-shape-prior's People

Contributors

jaegal88 avatar ashrafsa avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.