zhu327 / gemini-openai-proxy Goto Github PK

A proxy for converting the OpenAI API protocol to the Google Gemini Pro protocol.

License: MIT License

Go 98.18% Dockerfile 1.82%

chatgpt-api gemini openai gemini-pro gemini-pro-vision gpt-4-vision-preview chatgpt

gemini-openai-proxy's Introduction

Gemini-OpenAI-Proxy

Gemini-OpenAI-Proxy is a proxy designed to convert the OpenAI API protocol to the Google Gemini Pro protocol. This enables seamless integration of OpenAI-powered functionalities into applications using the Gemini Pro protocol.

Gemini-OpenAI-Proxy
- Table of Contents
- Build
- Deploy
- Usage
- Compatibility
- License

Build

To build the Gemini-OpenAI-Proxy, follow these steps:

go build -o gemini main.go

Deploy

We recommend deploying Gemini-OpenAI-Proxy using Docker for a straightforward setup. Follow these steps to deploy with Docker:

docker run --restart=always -it -d -p 8080:8080 --name gemini zhu327/gemini-openai-proxy:latest

Adjust the port mapping (e.g., -p 8080:8080) as needed, and ensure that the Docker image version (zhu327/gemini-openai-proxy:latest) aligns with your requirements.

Usage

Gemini-OpenAI-Proxy offers a straightforward way to integrate OpenAI functionalities into any application that supports custom OpenAI API endpoints. Follow these steps to leverage the capabilities of this proxy:

Set Up OpenAI Endpoint: Ensure your application is configured to use a custom OpenAI API endpoint. Gemini-OpenAI-Proxy seamlessly works with any OpenAI-compatible endpoint.
Get Google AI Studio API Key: Before using the proxy, you'll need to obtain an API key from ai.google.dev. Treat this API key as your OpenAI API key when interacting with Gemini-OpenAI-Proxy.

Integrate the Proxy into Your Application: Modify your application's API requests to target the Gemini-OpenAI-Proxy, providing the acquired Google AI Studio API key as if it were your OpenAI API key.

Example API Request (Assuming the proxy is hosted at http://localhost:8080):

curl http://localhost:8080/v1/chat/completions \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer $YOUR_GOOGLE_AI_STUDIO_API_KEY" \
 -d '{
     "model": "gpt-3.5-turbo",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
 }'

Alternatively, use Gemini Pro Vision:

curl http://localhost:8080/v1/chat/completions \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer $YOUR_GOOGLE_AI_STUDIO_API_KEY" \
 -d '{
     "model": "gpt-4-vision-preview",
     "messages": [{"role": "user", "content": [
        {"type": "text", "text": "What’s in this image?"},
        {
          "type": "image_url",
          "image_url": {
            "url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg"
          }
        }
     ]}],
     "temperature": 0.7
 }'

If you already have access to the Gemini 1.5 Pro api, you can use:

curl http://localhost:8080/v1/chat/completions \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer $YOUR_GOOGLE_AI_STUDIO_API_KEY" \
 -d '{
     "model": "gpt-4-turbo-preview",
     "messages": [{"role": "user", "content": "Say this is a test!"}],
     "temperature": 0.7
 }'

Model Mapping:

GPT Model	Gemini Model
gpt-3.5-turbo	gemini-1.0-pro-latest
gpt-4	gemini-1.5-flash-latest
gpt-4-turbo-preview	gemini-1.5-pro-latest
gpt-4-vision-preview	gemini-1.0-pro-vision-latest

If you wish to map gpt-4-vision-preview to gemini-1.5-pro-latest, you can configure the environment variable GPT_4_VISION_PREVIEW = gemini-1.5-pro-latest. This is because gemini-1.5-pro-latest now also supports multi-modal data.

Handle Responses: Process the responses from the Gemini-OpenAI-Proxy in the same way you would handle responses from OpenAI.

Now, your application is equipped to leverage OpenAI functionality through the Gemini-OpenAI-Proxy, bridging the gap between OpenAI and applications using the Google Gemini Pro protocol.

Compatibility

License

Gemini-OpenAI-Proxy is licensed under the MIT License - see the LICENSE file for details.

gemini-openai-proxy's People

Contributors

Stargazers

Watchers

Forkers

xhmily linqiu919 1431241631 zenghongtu hkint dhudashu coloz vkingw ccc0168 lvwzhen pptt121212 swiftdev29 hitech777 ironicbo xiaosaaaa fyl080801 kirsi33 derek-zl pl6me crackercat shawokou123 demangrove 139demajia mokinzhao hcy1251 iloili we331 azsou kinofsin solider245 liuzhaobing polya20 iseeyo gemar881 wangcghy zggsong 1sankalp garyblankenship ocd0711 nickcxm antman2023 fewk k2052 sunwood-ai-labs zhutony tokimwc kbpxz mikelix scbizu catcars rusenask dennisrutjes dgheroin tazlad justflymars ok5288 tust-freely 4t8dd zkcooler itsharex githubcust kulovecc jonnyquan vitsumen nelis-tech folderchat0 yuanjie-ai musuiot scvmofic freesosaifared traderpedroso bigdataer01 iosmanthus lingyun0406 yessirwhatever0 aicoder01 objccodingtogether gun07 khoindq lip-z kazuph milkmilkking agitronics bosscoming supersonictw liheji wkzattj ctc2610 slbidd zhousanfu tanbushi ai-tech-tw

gemini-openai-proxy's Issues

to become the best proxy for gemini

could you add embbending endpoint compatible to openai there is a proxy for that in go https://github.com/cheahjs/gemini-to-openai-proxy and a option to send history

example js when using langchain or redis chat memory

const {
  GoogleGenerativeAI,
  HarmCategory,
  HarmBlockThreshold,
} = require("@google/generative-ai");

const apiKey = process.env.GEMINI_API_KEY;
const genAI = new GoogleGenerativeAI(apiKey);

const model = genAI.getGenerativeModel({
  model: "gemini-1.5-flash",
  systemInstruction: ""
  });

const generationConfig = {
  temperature: 1,
  topP: 0.95,
  topK: 64,
  maxOutputTokens: 8192,
  responseMimeType: "text/plain",
};

async function run() {
  const chatSession = model.startChat({
    generationConfig,
 // safetySettings: Adjust safety settings
 // See https://ai.google.dev/gemini-api/docs/safety-settings
    history: [
      {
        role: "user",
        parts: [
          {text: "user history"},
        ],
      },
      {
        role: "model",
        parts: [
          {text: "model history "},
        ],
      },
    ],
  });

  const result = await chatSession.sendMessage("INSERT_INPUT_HERE");
  console.log(result.response.text());
}

run();```

[Feature Request] Option to disable model mapping

Currently, this project maps GPT models to Gemini models, which can cause model ID overlap on some projects such as Open WebUI. It would be great if we have an option in which we can disable model mappings.

pro vision cannot use text

I just found out that the update supports vision, but after using it, I found that using vision with plain text is not supported and must be accompanied by pictures.

genai get stream message error proto: (line 2:3): unknown field "error"

Hi, I sometimes get this error message recently. I presume it's due to Google changes?

genai get stream message error proto: (line 2:3): unknown field "error"

Function support?

Not sure if gemini in itself even has function support, but it would be nice as I'd like to use this for my Home Assistant pipeline

Pythagora open ai API: 'choices' issue

Pythagora

Implementing task #1: Set up a Node.js project with package.json, install all necessary dependencies, and set up an express server.

I have successfully using for GPT pilot. during code generation, There was a problem with request to openai API:
'choices'

kindly help in this reg.

太好了，如果能支持proxy，那就更好了。

这个可以使得和openai的模型混用。因为openai接口经常不稳定，这样可以搭配这个一起使用，增强了稳定性。
如果能支持proxy，这样就更好了。难度应该不大。

Suggestion : Adding the Gemini Pro Vision model.

Maybe you can also add support in the proxy for the Vision models, so that we can use Gemini with UIs such as https://github.com/mckaywrigley/chatbot-ui

Refer here too : mckaywrigley/chatbot-ui#1034

Feature request: Checking for Rate limiting blocking (error 429) and throttling back requests.

Hi!

Love the tool!

I hit a rate limit and received error 429 when working with a project..

Could you implement request throttling or request backoff delay to help alleviate the problem?

Feature request: Checking for Rate limiting blocking (error 429) and throttling back requests.

How can I deploy on Railway?

How can I deploy on Railway?
I tried it, but it doesn't seem to work

Have the plan support Gemini 1.5 Flash？

openai api : InternalServerError: Error code: 502

its work for curl

OPENAI_API_KEY="xxx" #google ai api
curl http://localhost:8080/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer $OPENAI_API_KEY"   -d '{
    "model": "gpt-3.5-turbo",
    "messages": [
      {
        "role": "system",
        "content": "You are a poetic assistant, skilled in explaining complex programming concepts with creative flair."
      },
      {
        "role": "user",
        "content": "Compose a poem that explains the concept of recursion in programming."
      }
    ]
  }'

but failed for python

from openai import OpenAI
client = OpenAI(api_key='xxx'
                , base_url='http://localhost:8080/v1/')

completion = client.chat.completions.create(
  model="gpt-3.5-turbo",
  messages=[
    {"role": "system", "content": "You are a poetic assistant, skilled in explaining complex programming concepts with creative flair."},
    {"role": "user", "content": "Compose a poem that explains the concept of recursion in programming."}
  ]
)

print(completion.choices[0].message)

the logs

File [~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:930](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:930), in SyncAPIClient._request(self, cast_to, options, remaining_retries, stream, stream_cls)
    [927](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:927)     if not err.response.is_closed:
    [928](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:928)         err.response.read()
--> [930](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:930)     raise self._make_status_error_from_response(err.response) from None
    [932](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:932) return self._process_response(
    [933](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:933)     cast_to=cast_to,
    [934](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:934)     options=options,
   (...)
    [937](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:937)     stream_cls=stream_cls,
    [938](https://file+.vscode-resource.vscode-cdn.net/Users/jade_mayer/projects/agents/llamaindex/~/anaconda3/envs/agents/lib/python3.9/site-packages/openai/_base_client.py:938) )

InternalServerError: Error code: 502

Error 429

Hello, I found that any call returns error 400 without making any modifications to the program, and the message is 429.
But when I directly call the Gemini Pro API with the Gemini Pro data format, it works fine.
I originally thought it was an incorrect key, but intentionally changed the key to the wrong one, and got the message Error 400.

The following is the data for when I called.

{
  "stream":false,
  "model":"gpt-3.5-turbo",
  "messages":[
  {
    "role":"user",
    "content":"HELLO"
  }]
}

{
  "code": 400,
  "message": "genai send message error: googleapi: Error 429:",
  "type": ""
}

Please modify to streaming reply mode: model.GenerateContentStream

ctx := context.Background()
client, err := genai.NewClient(ctx, option.WithAPIKey(os.Getenv("API_KEY")))
if err != nil {
log.Fatal(err)
}
defer client.Close()

model := client.GenerativeModel("gemini-pro")

iter := model.GenerateContentStream(ctx, genai.Text("Write a story about a magic backpack."))
for {
resp, err := iter.Next()
if err == iterator.Done {
break
}
if err != nil {
log.Fatal(err)
}

// print resp
}

400 error code

curl http://localhost:8080/v1/chat/completions \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer xxxxx" \
 -d '{
     "model": "gpt-4-vision-preview",
     "messages": [{"role": "user", "content": [
        {"type": "text", "text": "You are a zoological expert who knows what animal it is and what it is thinking"},
        {
          "type": "image_url",
          "image_url": {
            "url": "https://www.wikiwand.com/zh-hans/File:Cat_poster_2.jpg"
          }
        }
     ]}],
     "temperature": 0.7
 }'

{"code":400,"message":"genai send message error: googleapi: Error 400: Request contains an invalid argument.","type":""}%

能部署在cf worker或vercel就完美了

连梯子都不用开

希望能兼容 function calling

Gemini 也支持 function calling，格式跟 OpenAI 类似，希望能做个兼容

gpt-engineer

Trying to use gemini-openai-proxy with the gpt-engineer
I invoke gpt-engineer as following OPENAI_API_KEY=1234 OPENAI_API_BASE=http://localhost:8080/v1 gpt-engineer Tic-tac-toe gpt-3.5-turbo and get the following error on proxy side

2023/12/31 07:50:03 genai get stream message error googleapi: Error 400:
[GIN] 2023/12/31 - 07:50:03 | 200 |  740.287576ms |             ::1 | POST     "/v1/chat/completions"

Not sure if the problem is on proxy side or gpt-engineer side tho

[Note] Client Compatibility List and Known Issues

Perfectly Compatible:

Known Issues:

~~There is a bug in the Google Gemini Pro SDK, which may lead to unexpected interruptions during streaming output.~~
If your prompt contains keywords like "openai," "chatgpt," "gemini," etc., it might be rejected by Gemini Pro for a response.

Feel free to contribute to the compatibility list, allowing us to continuously enhance the user experience of our project. Thank you for your support and feedback!

Can parameters be added to set gemini security threshold to BLOCK_NONE?

gemini default review is too strict, totally unnecessary. Even the text of the quarrel between the protagonists in the movie subtitles is blocked.
Often report errors when doing translation work.
https://ai.google.dev/docs/safety_setting_gemini#safety-filters

Replied sometimes contain /n

The gemini pro responses will randomly contain, /n sometimes instead of actually going to the next line. I suspect it’s similar to the previous issue of quotation marks getting printed.

支持 gemini-pro-vision 多模态模型

如何在chatgpt-nextweb使用？

已经用docker安装完成，使用 http://192.168.1.1:8080/v1等等地址，加上geminipro api无法使用

proxy error L curl: (52) Empty reply from server

$ curl http://127.0.0.1:8081/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer $API_KEY" -d '{
"model": "gpt-4",
"messages": [{"role": "user", "content": "Say this is a test!"}],
"temperature": 0.7
}'
curl: (52) Empty reply from server

Here's the docker:
$ docker ps -a
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
9a49c5c0b3e4 zhu327/gemini-openai-proxy:latest "/app/gemini" 3 minutes ago Up 3 minutes 0.0.0.0:8081->8081/tcp gemini

The return value is not as expected

When making a call using curl, I receive an empty object as the return value.

Using the “openai” npm package results in the same issue as with the curl call.
npm package openai
However, using other clients (such as OpenCat / ChatX) allows for successful proxying.

Does this project support accessing the Gemini API through a proxy server?

Hi,
Thank you for your contribution and sharing, it's really great.
Does this project support accessing the Gemini API through a proxy server?
I am unable to access Google from my region.

好像不支持微信机器人之类的应用？

只有前两句可以回复，后面都是 Error 400

Chinese Location

Hello,

I am reaching out on behalf of several users from the China region who have deployed the gemini-openai-proxy Docker image but are facing consistent access timeout issues. The error being encountered frequently is a connection reset during API interactions.

Given that there might be unique considerations for accessing services like these from China, could you provide guidance or recommendations on how to ensure reliable access? Are there specific configurations, additional proxy setups, or any other steps that we should consider to address this timeout problem?

Any help or suggestions you could offer would be greatly appreciated.

Thank you for your assistance.

Best regards,
Jack

感谢，可以支持gemini。但貌似不兼容openai模型

使用gemini没问题。但传入openai的模型和KEY时出错了

{
    "error": "googleapi: Error 400:"
}

只能单向支持gemini吗

autogen getting 400 from proxy

Chat completions work fine from PostMan so it's working perfectly.

Autogen is getting a 400 error.

from autogen import AssistantAgent, UserProxyAgent

llm_config_gemini = {
    "config_list": [
        {
            "api_key": "***"
            "base_url": "http://localhost:8080/v1",
        }
    ]
}

assistant = AssistantAgent("assistant", llm_config_gemini)
user_proxy = UserProxyAgent("user_proxy", human_input_mode="TERMINATE", code_execution_config={"work_dir": "coding", "use_docker": False})

user_proxy.initiate_chat(assistant, message="Plot a chart of top performingi  blue chip stock price change YTD use dark mode")

Error:

Traceback (most recent call last):
  File "c:\Dev\ai\autogen\gemini\main..py", line 21, in <module>
    user_proxy.initiate_chat(assistant, message="Plot a chart of top performingi  blue chip stock price change YTD use dark mode")
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\autogen\agentchat\conversable_agent.py", line 550, in initiate_chat
    self.send(self.generate_init_message(**context), recipient, silent=silent)
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\autogen\agentchat\conversable_agent.py", line 348, in send
    recipient.receive(message, self, request_reply, silent)
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\autogen\agentchat\conversable_agent.py", line 481, in receive
    reply = self.generate_reply(messages=self.chat_messages[sender], sender=sender)
            ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\autogen\agentchat\conversable_agent.py", line 906, in generate_reply
    final, reply = reply_func(self, messages=messages, sender=sender, config=reply_func_tuple["config"])
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\autogen\agentchat\conversable_agent.py", line 625, in generate_oai_reply
    response = client.create(
               ^^^^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\autogen\oai\client.py", line 247, in create
    response = self._completions_create(client, params)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\autogen\oai\client.py", line 327, in _completions_create
    response = completions.create(**params)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\openai\_utils\_utils.py", line 272, in wrapper
    return func(*args, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\openai\resources\chat\completions.py", line 645, in create
    return self._post(
           ^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\openai\_base_client.py", line 1088, in post
    return cast(ResponseT, self.request(cast_to, opts, stream=stream, stream_cls=stream_cls))
                           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\openai\_base_client.py", line 853, in request
    return self._request(
           ^^^^^^^^^^^^^^
  File "C:\Users\david\miniconda3\envs\pytorch\Lib\site-packages\openai\_base_client.py", line 930, in _request
    raise self._make_status_error_from_response(err.response) from None
openai.BadRequestError: Error code: 400 - {'error': {'message': "'$.messages[0].content' is invalid. Please check the API reference: https://platform.openai.com/docs/api-reference.", 'type': 'invalid_request_error', 'param': None, 'code': None}}

Feature request: install as service

I hope you can add this mod: https://github.com/kardianos/service
And provide the relevant command so that I can install it as a service and run it directly.
Thank you sincerely.

ERROR: {"code":400,"message":"message.multiContent: json.Unmarshal: json: cannot unmarshal string into Go value of type []openai.ChatMessagePart","type":""}

docker run --restart=always -it -d -p 8080:8080 --name gemini
-e GPT_4_VISION_PREVIEW=gemini-1.5-pro-latest
zhu327/gemini-openai-proxy:latest

报错。

Vercel

Create for vercel script pliss

[疑似BUG] 请求gpt-4-vision-preview时报错

{
  "error": {
    "headers": {
      "alt-svc": "h3=\":443\"; ma=86400",
      "cf-cache-status": "DYNAMIC",
      "cf-ray": "8759f2923a5d369e-YYZ",
      "connection": "keep-alive",
      "content-length": "131",
      "content-type": "application/json; charset=utf-8",
      "date": "Wed, 17 Apr 2024 05:17:14 GMT",
      "nel": "{\"success_fraction\":0,\"report_to\":\"cf-nel\",\"max_age\":604800}",
      "report-to": "{\"endpoints\":[{\"url\":\"https:\\/\\/a.nel.cloudflare.com\\/report\\/v4?s=fV7mNNt1AvY1ZIWrL3ayQ0vCV7tjzAs0nA4pJMJCpR0wDIFcL%2BmQ7N2M4CLsOzXZy%2BRa3duFEtAgXdQYsH0uRE04%2BOM883x86W6Q2A325UVwvHc6mH150tmI7fFyl%2BNvDw8W2U0%3D\"}],\"group\":\"cf-nel\",\"max_age\":604800}",
      "server": "cloudflare"
    },
    "stack": "Error: 400 status code (no body)\n    at eP.generate (/app/.next/server/edge-chunks/316.js:4:1718)\n    at s_.makeStatusError (/app/.next/server/edge-chunks/316.js:4:14205)\n    at s_.makeRequest (/app/.next/server/edge-chunks/316.js:4:15128)\n    at process.processTicksAndRejections (node:internal/process/task_queues:95:5)\n    at async Object.chat (/app/.next/server/edge-chunks/708.js:1:2328)\n    at async /app/.next/server/app/api/chat/[provider]/route.js:1:1500\n    at async /app/.next/server/edge-chunks/369.js:6:64203\n    at async O.execute (/app/.next/server/edge-chunks/369.js:6:61096)\n    at async O.handle (/app/.next/server/edge-chunks/369.js:6:65470)\n    at async ey.handler (/app/.next/server/edge-chunks/369.js:7:31644)",
    "status": 400
  },
  "endpoint": "https://gemini.nvoid.***.ua/v1",
  "provider": "openai"
}

请求应用为LobeChat

服务搭建完测试没问题
已增加环境变量使vision指向gemini1.5-pro

正常请求1.5-pro没问题

非常感谢。流式输出过程是是否可以逐字输出。

部署后发现他的流式输出过程是一段一段的内容。

gemimi和openai模型都是这样，是否可以优化，让openai模型可以逐字输出。