Giter Club home page Giter Club logo

triggerwordassistant's Introduction

Trigger Word Assistant

A voice based assistant application which executes the assigned command on detecting the trigger word from the user voice. A trigger word is a word to which the model gets triggered once it detects it. Like for example "Hey Cortana" or "Ok Google". Here the trigger word used is "activate".

The user needs to speak through the microphone , then the audio data is processed and fed to the model. The model after detecting the trigger word does the action specified in the code, like I coded it to open Chrome everytime it detects that the user has spoken activate.

The Model used for detecting trigger word uses a deep Recurrent Neural Network with Gated Recurrent Units (GRU). The model is build using Keras.

After the audio is processed by the model a notification alert is inserted at the timesteps where the model detected the trigger word in the audio file, you can listen to it inside the output folder.

Inside the audio folder the recorded voice of user is stored.

There are 2 versions for this application:

1. Python file incase you want to directly see what is happening.

Just run the script and wait for the message "Started Listening" , then speak through the microphone of the Computer and see it work.
Inside the output folder the spectogram of the processed audio is stored. Here is a gif showing this in action

Alt Text Note: The script continues to run till the desired application opened by the command after trigger detection is not closed.

2. Jupyter notebook explaining everything.

Detailed walkthrough of the code including option to listen to the actual output processed by the model. Alternate image text

Credits:

This project is based on the assignment from Sequence Models Specialization by Deeplearning.ai on Coursera. https://www.coursera.org/learn/nlp-sequence-models/home/welcome

triggerwordassistant's People

Contributors

susantabiswas avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

xdlia

triggerwordassistant's Issues

Could not train the model properly

I have tried to train the model on my custom dataset. I have done different tries.
1- I make 500 audios of 10 sec with background noise on which positive audios and negative are inserted and trained for 2000 epochs.
2- Then I make 2000 audios and train it for 800 epochs.
I also tried some other things like this but could not get any results. Not even 1%. Either it detects every thing the trigger word or does not detect even the trigger word.
Can you help me with this?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.