Giter Club home page Giter Club logo

mikael's Introduction

Mikael

An advanced multimodal model hosted as a Discord bot that synergizes Mistral as the language model and LLaVA (Large Language and Vision Assistant) as the vision model. Together, they comprise a potent combination, incorporating a vision encoder and Vicuna for comprehensive visual and language understanding. This configuration empowers Mikael with remarkable chat capabilities, mirroring the versatility of the multimodal GPT-4.

Showcase of Mikael Discord Bot

Invite Mikael

Mikael does not require any permissions, it is a chat-only Discord bot:

https://discord.com/api/oauth2/authorize?client_id=1202687794213036112&permissions=0&scope=bot

Purpose

To combine Mistral's LLM and LLaVA multimodal model into a Discord bot for users to chat with similar to ChatGPT.

Benchmarks

Currently, Mistral's 7.3B parameter LLM can:

  • Outperform Llama 2 13B on all benchmarks
  • Outperform Llama 1 34B on many benchmarks
  • Approach CodeLlama 7B performance on code, while remaining good at English tasks
  • Use Grouped-query attention (GQA) for faster inference
  • Use Sliding Window Attention (SWA) to handle longer sequences at smaller cost

(Mistral AI, 2024)

Running locally

Mikael can be self-hosted by following these steps:

Install Ollama

For Linux:

$ curl https://ollama.ai/install.sh | sh

For MacOS:

https://ollama.ai/download/Ollama-darwin.zip

Run the following terminal commands:

  1. $ ollama pull mistral
    
  2. $ ollama pull llava
    

Mistral needs 4.7 GB, while llava requires 4.1 GB of space.

Install Requirements

$ pip install discord.py ollama

Set Mikael's Token

$ nvim /path/to/bash-or-zsh
$ export MIKAEL_TOKEN="TOKEN HERE"
$ source /path/to/bash-or-zsh

Mikael should run correctly; open an issue if it does not.

Future Plans

Privacy

  • Mikael temporarily downloads all images it's sent to be fed into LLaVA's multimodal model; after successfully finishing its task, your images are immediately deleted from the server. (L39-L59)
  • Mikael only stores your chats in random access memory (RAM) (L63-L66).

Acknowledgements

Support Server

https://discord.gg/JX4XgrQSeV

mikael's People

Contributors

ibnaleem avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

Forkers

mitch907

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.