Giter Club home page Giter Club logo

hiragana-identifier's Introduction

Hiragana

Hiragana-Identifier

Machine Learning project to identify Japanese characters (hiragana) from a data set.

Introduction

Image recognition is crucial in the evolution of artificial intelligence. Specifically, finding an efficient model for reading handwritten characters is an ongoing research for very different alphabets, whose symbols vary drastically. In this poster the problem of reading Japanese characters (hiragana) will be solved using Convolutional Neural Networks (CNN). The data set used was taken from a Github repository, which contained 50 different characters and 20 different samples for each one. While sample size is small, a considerable level of certainty can be obtained utilizing a good learning model.

Objective

The objective of this project is to demonstrate that despite utilizing a small sample as training data, its possible to create a learning model utilizing convolutional neural networks to classify 50 different Japanese hiragana characters. The following algorithm is proposed in order to solve the problem:

Flowchart

Methodology

Our convolutional neural network consists of:

  • A convolutional layer with 50 kernels of size 5x5 with activation function ReLU
  • A maxpooling layer in order to reduce computational workload.
  • A fully connected layer consisting of 128 nodes with activation function ReLU.
  • A fully connected output layer with 50 nodes and activation function softmax.

CNN

Results

An accuracy of around 88% was obtained by training the model through the CNN with five iterations (epochs) without further improvement. There were 200 samples in the test data. results

Conclusions

From the results obtained by this model (88% accuracy) it has been partially proved that it is not necessary to have a data set of great magnitude to get exceptional classification results through convolutional neural networks. Furthermore, the advantage of having a very small sample is that applications based on this model will have a short running time. On the other side, there is still a lot of room for improvement, given that the model has not been tested on other data sets or by using re-sampling methods such as bootstrap or cross-validation.

Data set available at: Hiragana73 and HiraganaGit

Project by @RakuTheSenpai, @TheChouzenOne and @KillerFarmer

hiragana-identifier's People

Contributors

rakuthesenpai avatar thechouzanone avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

vokt

hiragana-identifier's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.