simonw / llm Goto Github PK

View Code? Open in Web Editor NEW

3.0K 31.0 144.0 607 KB

Access large language models from the command-line

Home Page: https://llm.datasette.io

License: Apache License 2.0

Python 99.01% Just 0.52% Shell 0.47%

llms openai ai

llm's Issues

Mechanism for storing prompt templates

This will allow users to store templates for complex prompts (both as system prompts and regular prompts that have strings interpolated into them) so they can use them in the future.

llm templates edit summary
# An editor opens to edit that prompt
llm --template summary "$(curl -s https://www.example.com/)"

With a shortcut so llm -t summary works too.

Release notes for 0.4

Tool for importing ChatGPT history

Being able to paste in ChatGPT JSON transcripts and save them to the database would be really useful - similar to the method described in https://simonwillison.net/2023/Mar/27/ai-enhanced-development/

Log prompts and responses to SQLite

This is the feature I built llm for. I want to log my interactions to a SQLite database.

Plugin hook: `register_models`

Blocks:

llm logs trigger api request

After setting up the OPENAI_API_KEY and llm init-db when I now write llm logs I get an API response to the prompt "logs" (explanation of logs).

What I expected is to see the logfiles.

Better automated tests

I need to mock the OpenAI API calls so I can write tests against the llm "prompt" command.

llm.datasette.io documentation site

Refs:

#13 (comment)_

Drop platformdirs for click.get_app_dir()

I just noticed Click already has functionality for app directories, which I can use instead of the extra platformdirs dependency: https://click.palletsprojects.com/en/8.1.x/utils/#finding-application-folders

cfg = os.path.join(click.get_app_dir(APP_NAME), 'config.ini')

It looks like I can pass this io.datasette.llm:

>>> import click
>>> click.get_app_dir("hello")
'/Users/simon/Library/Application Support/hello'
>>> click.get_app_dir("io.datasette.llm")
'/Users/simon/Library/Application Support/io.datasette.llm'

Move log.db database to the new user_data_dir

Some thoughts on that migration:

Should I have the tool perform a one-off migration when you upgrade it, to move ~/.llm/log.db to the new location?

I think not - instead, I'll have llm init-db take an optional argument for starting the database by copying an existing one, then mention that upgrade path in the release notes.

Originally posted by @simonw in #7 (comment)

Updated schema design for 0.4

I want to make a few schema changes in time for 0.4:

Add an explicit id integer column so I don't have to remember to select rowid
chat_id should be an integer, and a foreign key to id
Ditch the concept of a provider and just use the model column

One thing I'm torn on right now: should I keep the system prompt as a separate column, even though most models other than OpenAI don't have that as a concept?

A model that picks the right sized model

Count tokens with tiktoken and switch to the 16k or 32k models if necessary.

Rename llm to LLM in documentation

It will still be pip install llm and llm prompt ... but I want to call it LLM when I write about it.

Improvements to logs command: show rowid, support --truncate

Refs:

In order to find the chat ID that I should use to continue a conversation, llm logs needs to include the rowid in the output.

Since prompt responses can be really long, it would be useful to provide an optional -t/--truncate option for truncating them.

Reconsider `llm chatgpt` command and general command design

Options for an interactive chat mode:

It's a new command - llm chat -m gpt-4 for example. This feels a bit odd since the current default command is actually llm chatgpt ... and llm chat feels confusing.

It's part of the default command: llm --chat -4 starts one running.

Maybe the llm chatgpt command is mis-named, especially since it can be used to work with GPT-4.

I named llm chatgpt that because I thought I'd have a separate command for bard and for llama and so-on, and because I thought the other OpenAI complete APIs (the non-chat ones, like GPT-3) may end up with a separate command.

Originally posted by @simonw in #6 (comment)

Plugin hook: `register_templates` - return a list of `llm.Template`

Include schema in documentation

Use cog for this.

`llm models` command

Lists available models.

Consider using XDG_DATA_HOME directory instead of .llm

The XDG base directory spec has a directory that would be perfect for this tool's database: ~/.local/share/llm.

There are two big advantages to using XDG_DATA_HOME instead of ~/.llm:

Users can easily configure the directory to use by setting XDG_DATA_HOME
Limits the explosion of dot directories in the home directory

here's how I implement it in my similar tool, for reference - use XDG_DATA_HOME if present, otherwise default to ~/.local/share/<app_name>

llm web command - launches a web server

A command that launches a web server to allow you to both browse your logs and interact with APIs through your browser.

Update README to point to /stable/ not /latest/ docs

Do this just before the 0.4 release which will make the /stable/ pages start working.

Bug: llm templates list doesn't skip newlines

If any templates have newlines the output gets very confusing:

% llm templates list
bad       : this is bad
joke      : Tell a really funny and short joke, surprise me
long      : This is a really long prompt. It's long long long. This is a really long prompt. It's long long long. This is a really long prompt. It's long long long. This is a really long prompt....
recipe    : Suggest a recipe using ingredients: $ingredients

It should be based on cuisine from this country: $country
roast     : 
steampunk : Summarize the following text.
Insert frequent satirical steampunk-themed illustrative anecdotes. Really go wild with that.
Text to summarize: $input

summarize : 
summary   : Summarize this: $input

llm install and llm uninstall commands

To ensure plugins are installed in the correct virtual environment, similar to datasette install and sqlite-utils install.

Here's where I did it for sqlite-utils:

simonw/sqlite-utils#483

Needed by plugins:

Record actual model used to run the prompt

Right now I'm just recording the model that was requested, e.g. gpt-3.5-turbo in the model column.

But... it turns out the response from OpenAI includes this - "model": "gpt-3.5-turbo-0301" - and there are meaningful differences between those model versions, e.g. the latest is gpt-3.5-turbo-0613 but you have to opt into it.

I'd like to record the model that was actually used. Not sure how best to put this in the schema though, since it may only make sense for OpenAI models.

Drop the `--code` feature

This can be handled using the new templates feature instead:

Add support for Google's PaLM 2

Better way of saving templates: a `--save name` option on `llm prompt`

Having to write templates in YAML is a bit nasty.

How about this as an option?

llm 'Summarize this: $input' --system "You are GlaDOS" -m gpt4 --save glados

Getting started guide (docs and README) should promote piping to llm

cat myfile.py | llm --system 'Explain this code'

Ability to mess with logit biases

Inspired by https://twitter.com/goodside/status/1669613516402089984

Neater OpenAI API error messages

Displaying the traceback isn't useful for this tool.

Markdown renderer support

would be interesting to add markdown output preview built in.

for now i'm using it with glow https://github.com/charmbracelet/glow to do the same

Mechanism for continuing an existing conversation

The ChatGPT endpoints only work for chatting if you manually send back your previous questions and responses:

https://til.simonwillison.net/gpt3/chatgpt-api

This tool could help with that, maybe through a llm chat command?

UPDATE: Or a -c/--continue option for continuing the most recent conversation.

Better ways of storing and accessing API keys

I'm not sure this is actually an issue, as I've developed a workaround, but I thought it was worth bringing up for discussion.

I prefer to keep keys like this in my password manager. Among other things, it allows secure access and consistent sync across machines. I already had a function to access the key, but I don't want to call it on every new shell session as it pops up a prompt in my password manager. I'd prefer to only do that when using the tool, and only the first time in the shell session.

So, I wrote a wrapper function to do that:

llm() {
  if [ -z "$OPENAI_API_KEY" ]; then
    export OPENAI_API_KEY=`open_ai_key`
  fi
  command llm "$@"
}

I use zsh on macOS.

I'm not aware of other patterns that llm could use to look for a key, and you already provide two reasonable ones...but if there was a third way that would obviate the need for my little wrapper, that would be cool! Otherwise, maybe someone else finds this helpful.

gpt-3.5-turbo-16k Support

Would love to see gpt-3.5-turbo-16k support, along with the 0613 models.

See: https://community.openai.com/t/gpt-3-5-turbo-0613-function-calling-16k-context-window-and-lower-prices/263263

`llm "prompt $var" -p var default --save template` should store default for that variable

Related code:

llm/llm/cli.py

Lines 121 to 127 in 7e2a835

 bad_options = [] 

 for option, var in ( 

 ("--template", template), 

 ("--continue", _continue), 

 ("--chat", chat_id), 

 ("--param", param), 

 ):

llm/llm/__init__.py

Lines 24 to 27 in 7e2a835

 if self.defaults: 

 for k, v in self.defaults.items(): 

 if k not in params: 

 params[k] = v

--param should not be a bad option any more.

Ability to set default model in config file

would be nice

Accept input from standard in

Enables this kind of pattern:

curl -s 'https://simonwillison.net/2023/May/15/per-interpreter-gils/' | \
  llm --system 'A hilarious joke about this post' --stream

Output just now:

Why did the Python developer compile Python themselves just to test the Per-Interpreter GIL feature?

Because they wanted to thread the needle!

Add a page to the docs showing all help output

Way to configure the default model

So you can default to something other than 3.5 turbo.

Command for browsing captured logs

Best way to do this will be with Datasette or sqlite-utils but it would be neat to have a basic history command built into llm itself.

Include system prompts in llm templates list

Following:

% llm templates list
bad       : this is bad
joke      : Tell a really funny and short joke, surprise me
long      : This is a really long prompt. It's long long long. This is a really long prompt. It's long long long. This is a really long prompt....
recipe    : Suggest a recipe using ingredients: $ingredients  It should be based on cuisine from this country: $country
roast     : 
steampunk : Summarize the following text. Insert frequent satirical steampunk-themed illustrative anecdotes. Really go wild with that. Text to ...
summarize : 
summary   : Summarize this: $input

Should default to streaming output, if supported by the model, unless --no-stream is passed

I want to make streaming mode the default - I'm fed up of forgetting to add -s to everything. I don't see any harm in it as a default, people can turn it off with --no-stream if they really want to.

Originally posted by @simonw in #17 (comment)

Documentation on writing plugins

I'm not going to implement the same one-off plugin mechanism as Datasette, so I'll have to instead teach people how to develop package plugins locally with pip install -e and show them how to create wheel files they can install elsewhere in case they don't want to push packages to PyPI.

Plugin mechanism for registering extra commands

Inspired by Datasette's: https://docs.datasette.io/en/stable/plugins.html

Refs:

Three initial plugin hooks:

register_commands(cli) - to register extra commands
#52
#53

--code mode for outputting code

Since this is a CLI tool it's nice to be able to > file.py to save generated Python code.

Problem: it usually comes wrapped in triple backticks.

Solution: a --code option which sets a system prompt to avoid that happening.

Make it so this can be installed using Homebrew

Following https://til.simonwillison.net/homebrew/packaging-python-cli-for-homebrew

Initial design

Initially this tool will let you run things against ChatGPT and GPT-4 from the command-line.

Over time I want to introduce Pluggy plugins to allow you to hook it up to all sorts of other language models, including ones that run locally.

But for starters it will do this:

llm "Prompt goes here"

And a streaming variant:

llm "Ten ideas for cheesecakes" -s

Plus use -4 to run against GPT-4, or --model X to specify another model.

Rename log.db to logs.db

For consistency with the llm logs command.

core.LLM class exposing most of the functionality as a Python API

Idea came from here:

Or... maybe it takes an llm argument which is similar to datasette in that it's an object offering a documented API for various useful things, like looking up configuraiton settings and loading templates and suchlike.
@hookspec
def register_models(llm):
    """Return a list of Models"""

Originally posted by @simonw in #53 (comment)

Made me realize that this tool could work like sqlite-utils in that most features could be available both as Python API methods and as CLI commands.

Add copy button to docs

As seen on https://sqlite-utils.datasette.io/en/stable/cli.html#running-sql-queries - use sphinx-copybutton.

	bad_options = []
	for option, var in (
	("--template", template),
	("--continue", _continue),
	("--chat", chat_id),
	("--param", param),
	):

	if self.defaults:
	for k, v in self.defaults.items():
	if k not in params:
	params[k] = v

simonw / llm Goto Github PK

llm's Issues

Recommend Projects

Recommend Topics

Recommend Org