boundaryml / baml Goto Github PK

BAML is a templating language to write typed LLM functions. Check out the promptfiddle.com playground

License: Apache License 2.0

Rust 39.27% Python 18.51% Shell 0.75% TypeScript 27.24% JavaScript 0.79% HTML 0.01% CSS 1.34% Ruby 11.45% Dockerfile 0.14% Jinja 0.50%

baml llm boundaryml guardrails llm-playground playground vscode structured-data prompt prompt-config

baml's People

Stargazers

Watchers

Forkers

dr-gareth-roberts villagab4 lloydchang smravec gauntletwizard vijaydl babybirdprd rinshadka acamtech lukaskf elijas utopic-dev droid-mohit

baml's Issues

[CLI-20] Remove compiler check on providers being exclusively openai/azure

_{From SyncLinear.com | CLI-20}

[MIS-9] Anthropic provider has fails without good errors when no API key is set

null

_MIS-9

[Docs] Redo the overview docs (no ai engineer, product, etc pages)

Update w/ latest cleaner intro

[VSCode] Playground buttons or code lens buttons do not work in windows

Playground summary tab is stuck in compiling even though baml test finishes running (conda)

Baml build should have a flag to suppress warnings

Function Chaining

It would be great in baml files to be able to chain functions together and have some sort of if, then, else logic whereby a model could, for example, run a validation function on a call, and then take action based on it.

[BAM-10] Support streaming

Important metric is time to first token

_BAM-10

[baml pytest] Fix test dashboard url not showing up at the start of the test

Only happens in pytest, not thru the actual baml test

[BAM-51] [baml] Add Map type

null

_BAM-51

[VScode] Switch to file-watcher architecture to prevent stale-file issues

VSCode only listens to opened files or files in the baml_src directory. Our current approach of lazily refreshing the list of files in baml_src isn't very robust.

We should just watch for file changes preemptively via a file-watcher mechanism, so we always have the latest state.

Things to keep in mind:

If we retrieve a doc from the cache, it should also have the unsaved changes as returned by VSCode onDocumentChanged(..) callbacks. Our playground works off the in-memory content even if you havent saved a file.

Implement Batching when sending logs to BAML Studio endpoint

Reduces the # of requests to our backend.

baml-fallback should maybe be a strategy as opposed to another client

I would like to have some separation between azure/anthropic/openai/etc clients and fallback clients

I see fallback clients as a strategy which defines a ruleset for orchestrating various LLM clients. I would prefer if this strategy gave me the ability to set the order of clients (ideally including which ones I can send in parallel), how to deal with various errors (which Baml already provides), and an ability to override default options (such as request timeouts).

Here is a specific example: I have two clients which are both responsible for calling the GPT-3.5-Turbo deployment on Azure given below. The only difference is the request timeout

client<llm> AzureGPT35Turbo {
    provider baml-azure-chat
    retry_policy ZenfetchDefaultPolicy
    options {
      api_key env.AZURE_OPENAI_API_KEY
      api_base env.AZURE_OPENAI_BASE
      engine env.AZURE_GPT_35_TURBO_DEPLOYMENT_NAME
      api_version "2023-07-01-preview"
      api_type azure
      request_timeout 30
    }
}

client<llm> AzureGPT35TurboShortTimeout {
    provider baml-azure-chat
    retry_policy ZenfetchDefaultPolicy
    options {
      api_key env.AZURE_OPENAI_API_KEY
      api_base env.AZURE_OPENAI_BASE
      engine env.AZURE_GPT_35_TURBO_DEPLOYMENT_NAME
      api_version "2023-07-01-preview"
      api_type azure
      request_timeout 5
    }
}

// My version of "strategy"
client<llm> GPTFamilyShortTimeout {
  provider baml-fallback
  options {
    strategy [
      AzureGPT35TurboShortTimeout,
      AzureGPT4TurboShortTimeout
    ]
  }
}

Notice all of the options are the same with the exception of the request_timeout field. The reason I did this is because in certain AI functions, I need the operations to complete quickly, so it's not reasonable to wait for the default 30 second timeout. This leads to a lot of redundancy in my clients (really the "strategies").

With the proposed functionality, I could instead do something like

client<llm> AzureGPT35Turbo {
    provider baml-azure-chat
    retry_policy ZenfetchDefaultPolicy
    options {
      api_key env.AZURE_OPENAI_API_KEY
      api_base env.AZURE_OPENAI_BASE
      engine env.AZURE_GPT_35_TURBO_DEPLOYMENT_NAME
      api_version "2023-07-01-preview"
      api_type azure
      request_timeout 30
    }
}

// My version of "strategy"
client<llm> GPTFamilyShortTimeout {
  provider baml-fallback
  options {
    strategy [
      AzureGPT35Turbo.options(request_timeout=5),
    ]
  }
}

[Compiler] If you don't name an impl you dont get an error squiggle and the error is vague

[MIS-8] Comments dont work inside of arrays

client {
  options {
    strategy [
       // Comments
       GPT4
    ]
  }
}

_MIS-8

Document exceptions raised from our client

Add section on exceptions (which ones are passthrough vs our client-generated ones) and how users should be thinking about them.

[BAM-24] [Docs] Add help page for how to deploy using docker

null

_BAM-24

[BAM-49] [baml] Add datetime type

null

_BAM-49

[Compiler] Test out optionals and unions in deserializer tests

[Python] For optional function arguments (or any optional fields), set default value to None

If you add an optional field like

function A {
  input (title: string?)
  output string
}

The generated python code always asks you to include the title field, when it should be happy with not providing anything. It's because the parameter to that function call should have a default None value.

[Compiler] Add cargo test to github actions

[CLI-20] Remove compiler check on providers being exclusively openai/azure

_{From SyncLinear.com | CLI-20}

[Docs] Serializing an array of msgs into a prompt (or conversation memory)

It's a highly requested ask and we should explain how to do this.

[BAM-45] In baml test, allow for setting expectations

null

_BAM-45

Convert production request into a test case(s)

Customers want to be able to just iterate on a production request / add it to their test suite.

Baml init fixes

BAML init should be usable by anyone to add baml to their project.

Ask for what language they use
Ask for what package manager (pip or poetry or pnpm or yarn)
ask for what type of project (empty, tutorial - classification or entity extraction, chatbot, function-calling, etc) they should start with.
Ask what clients you want (openai, anthropic, azure? a sample of some) [P2]
Do a confirmation

P0 is just empty with a READMe that just says go to the docs or example repo.

Autosave .baml files before running tests

The playground prompt updates on unsaved changes, but the tests do not. So if you run a test you may be running witj and old prompt intil you save your changes.

Find a way to save unsaved .baml changes before running a test so it reflects what is shown in playground.