Giter Club home page Giter Club logo

tibetanmnist's Introduction

TibetanMNIST:藏文手写数字数据集

TibetanMNIST:国产MNIST数据集

这个Repo主要包括了:

1. 数据集描述

详细请见:数据集文档

  • 数据文化

    藏区按方言划分为卫藏、康巴、安多三大藏区,东接汉地九州。藏区有典:“法域卫藏、人域康巴、马域安多”即“卫藏的宗教、康巴的人、安多的马”。而藏文主要有楷体和形体两种文字,我们本次的TibetanMNIST正是形体藏文中的数字,也就是图片中连笔书写更加简洁的那种

    pic1

  • 文件列表

    i.TibetanMNIST.tfrecords(每张图像存储为28x28x3的三通道图像矩阵)

    ii.TibetanMNIST.npz(每张图像存储为28x28的单通道图像矩阵)

    iii.TibetanMNIST(原始图像文件,图像文件名的第一个数字为数字类别标签,第二个数字为数字所在纸张标签,第三个数字为纸张标签内的数字序列)

  • 数据特征及属性

    pic2

  • 数据分布

    pic3

  • 藏文数字与阿拉伯数字对照表

    pic4

  • 数据示例

    pic5

  • 数据使用

    • TFReords文件使用
     import tensorflow as tf
    
     def _parse_function(example_proto):
     	features = {
     		'label':tf.FixedLenFeature([], tf.int64),
     		'img_raw':tf.FixedLenFeature([], tf.string)
     	}
    
     	parsed_features = tf.parse_single_example(example_proto, features)
     	img = tf.decode_raw(parsed_features['img_raw'], tf.uint8)
     	img = tf.reshape(img, [28, 28, 3])
     	# 在流中抛出img张量和label张量
     	img = tf.cast(img, tf.float32) / 255
     	label = tf.cast(parsed_features['label'], tf.int32)
     	return img, label
    
     filenames = ["要读取的文件序列"]
     dataset = tf.data.TFRecordDataset(filenames)
     dataset = dataset.map(_parse_function)
     # 创建单次迭代器
     iterator = dataset.make_one_shot_iterator()
     # 读取图像数据、标签值
     image, label = iterator.get_next()
     
    • NPZ文件读取
     import numpy as np
    
     data = np.load('文件')
     x_train = data['image'].reshape(17768, 784)
     y_train = utils.to_categorical(data['label'], 10)
  • 数据来源

    **民族大学创业团队巨神人工智能科技

  • 使用注意

    本数据集版权归**民族大学所有,使用该数据请务必注明数据出处,否则我们将追究相应的法律责任!

2. 不同方法与框架实现各种脑洞

除了自己写的,剩余皆来自此网站,若下面链接打不开可自行下载根目录中文件本地打开。

tibetanmnist's People

Contributors

bat67 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

tibetanmnist's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.