mmiglio / speechrecognition Goto Github PK
View Code? Open in Web Editor NEWSmall-footprint Keyword Spotting
Small-footprint Keyword Spotting
Implement and test residual neural network architecture.
Hello Matteo,
I am looking for a very low processor load "wake-word/phrase" mechanism for my Raspberry Pi3B based home robot (similar to PicoVoice Porcupine.)
In my search I came upon your "Small-footprint Keyword Spotting" that appears to use the full TensorFlow package. I have experimented with tflite_runtime.interpreter on my robot with an off-robot trained camera object recognition model.
Is this how I would be able to utilize your keyword spotter?
PicoVoice Porcupine uses about 10% of one core of my robot's Pi3B and is phenomenally good at far-field recognition and false rejection, (but does not allow custom wake-words for personal projects).
I have experimented with Vosk-api (the successor to Kaldi successor to PocketSphinx) but the processor load averages four times that of Porcupine (40% 15min ave. 30-100% 1min average of one core), so I continue to look for a "Small-footprint keyword spotter"
Your thoughts and suggestions?
Regards,
Alan McDonley
LSTM Networks are useful in processing two-dimensional processions of words
Useful documentation:
Test different training method (e.g. optimisers, lr schedulers, ... ) for the CNN model.
Useful resources:
Use a CNN / RNN to extract features from audio files followed by a One Class classifier trained to to recognise the keyword (marvin
in our case).
As classifier we can use either a OC-SVM or Isolation Forest.
Useful references:
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.