Light

rdshi / voiceprint Goto Github PK

View Code? Open in Web Editor NEW

86.0 4.0 35.0 5 MB

A simple model implemented with tensorflow for voiceprint

Home Page: https://blog.csdn.net/SrdLaplace/article/details/81942222

Shell 4.05% Python 95.95%

voiceprint's Introduction

A simple model implemented with tensorflow for voiceprint

wav-->fbank-->lstm-->embedding-->train and test/softmax(pretrain)-->train

dataset: https://datashare.is.ed.ac.uk/download/DS_10283_2651.zip

Hyperparameters used in the model

duration of wavs used for training: 2000ms
how long each frame of spectrograme: 25ms
how far to move in time between two frames: 10ms
numbers of coefficients of fbank: 40
numbers of enrollment utts for each speaker: 5
numbers of units for each layer of lstm: 128
dimension of projection layer of lstm: 64
number of layers of multi-lstm: 3
dimension of linear layer on top of lstm: 64
learning rate: 0.0001
dropout prob: 0.5
batch size: 80
Each batch contains N = 8 speakers and M = 10 utterances per speaker.
~~Each batch contains N = 64 speakers and M = 10 utterances per speaker.~~

voiceprint's People

Contributors

Stargazers

Watchers

voiceprint's Issues

代码中不明白的一行

您好：
首先非常感谢您的工作，我在理解代码过程中有一个地方始终没搞明白：

train_model.py中的187行左右
embsp = sess.run(embeddings, feed_dict={x_b: test_feats[args.batch_size-len(enroll_feats):], keep_prob: 1})
embeddings要求输入的x_b大小必须是batch_size维度的，对吧？
这里feed的test_feats[args.batch_size-len(enroll_feats):]的大小肯定不是batch_size啊？
源代码batch_size=128 我修改了一下之后这里一直报错，由于我自己的数据集比较小，对test_file的选取的一些参数我都改小了，但是即便是原始的参数，我计算了一下，也不对啊
不知道我不是哪里理解错了

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

rdshi / voiceprint Goto Github PK

voiceprint's Introduction

voiceprint's People

Contributors

Stargazers

Watchers

Forkers

voiceprint's Issues

代码中不明白的一行

issue

是否有对应的paper？

None

您好，请问程序运行出现Could not create cudnn handle: CUDNN_STATUS_NOT_INITIALIZED原因在哪里

无法使用GPU运算

测试py

数据集

pretrained_model

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent