Giter Club home page Giter Club logo

llm-on-aws-lambda's Introduction

Deploying an LLM on AWS Lambda

This repository contains code and instructions for deploying a smaller open-source Language Large Model (LLM) on AWS Lambda, using Python, Docker. The model used for demonstration is Phi-2 from Microsoft. This project aims to demonstrate how to use serverless infrastructure for LLM inference, particularly for applications requiring processing of sensitive data or specialized tasks.

Project Overview

The project involves deploying the Microsoft Phi-2 model, a 2.7 billion parameter LLM, on AWS Lambda using Docker. It demonstrates creating an HTTP REST endpoint through Lambda's URL mechanism to provide LLM outputs with execution details.

Detailed Guide

For a step-by-step tutorial, refer to the article: How to deploy an LLM on AWS Lambda?

Key Features

  • Utilizes the Phi-2 model from Microsoft.
  • Implements docker-based AWS Lambda functions.
  • Demonstrates the use of the llama-cpp-python package for LLM inference.

Prerequisites for the tutorial

  • Basic knowledge of programming, Docker, AWS, and Python.
  • AWS account with AWS CLI installed and configured.
  • Docker installed on your machine.
  • A preferred IDE, such as Visual Studio Code.

Getting Started

Clone this repository to get started with deploying your own LLM on AWS Lambda. Follow the instructions provided in the tutorial to set up your environment, run a containerized LLM locally, and deploy it to AWS Lambda.

Social Media and Contact

Stay updated and reach out through the following channels:

Feel free to contribute to this repository, raise issues, or suggest improvements. Your feedback and contributions are highly appreciated!

llm-on-aws-lambda's People

Contributors

horosin avatar

Stargazers

 avatar hillary avatar David Rivkin avatar Adeel Ahmad avatar A.Kerem Erdoğmuş avatar John Pope avatar OC avatar Damian Jankowski avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.