Anime AI Waifu is an AI powered voice assistant with VTuber's model, that combines the charm of anime characters with cutting-edge technologies. This project is meant to create an engaging experience where you can interact with desired character in real-time without powerful hardware.
-
๐ค Voice Interaction: Speak to your AI waifu and get instant (almost) responses.
- Whisper - openai's paid speech recognition.
- Google sr - free speech recognition alternative.
- Console - if you don't want use microphone just type prompts with your keyboard.
-
๐ค AI Chatbot Integration: Conversations are powered by an AI chatbot, ensuring engaging and dynamic interactions.
- Openai's 'gpt-3.5-turbo' or any other available model.
- File with personality and behaviour description.
- Remembers previous messages.
-
๐ข Text-to-Speech: Hear your AI waifu's responses as she speaks back to you, creating an immersive experience.
- Google tts - free and simple solution.
- ElevenLabs - amazing results, tons of voices.
- Console - get text responses in your console (but VTube model will be just idle).
-
๐ Integration with VTube Studio: Seamlessly connect your AI waifu to VTube Studio for an even more lifelike and visually engaging interaction.
- Lipsync while talking.
live demo here : https://www.youtube.com/shorts/SL3E6bAc6Qc
*Demonstration in real time without cutouts or speed up. This is real delay in answers.
-
Install Python 3.10.5 if you don't already have it installed.
-
Clone the repository by running
git clone https://github.com/JarikDem-Bot/ai-waifu.git
-
Install the required Python packages by running
pip install -r requirements.txt
in the project directory. -
Create
.env
file and enter your API keys.env template
VTS_TOKEN='' OPENAI_API_KEY='YOUR_OPEN_AI_KEY' ELEVENLABS_API_KEY='YOUR_ELEVENLABS_KEY'
-
Install VB-Cable
-
Install and set VTube Studio
-
Select your required settings in
main.py
inwaifu.initialise
-
Run the project by executing
python main.py
in the project directory.
Depending on the selected input mode, program may send all recorded sounds to the 3-rd parties.