Comments (5)
Seems you didn't do the "Run server:" step? i.e.
Run server:
```bash
pip install gradio==4.17.0
python -m llava.serve.gradio_web_server --controller http://localhost:$server_port --model-list-mode once
i.e. h2oGPT talks to custom llava web server with API exposed that takes to controller that talks to workers.
Will close for now, feel free to ask new questions.
from h2ogpt.
Oh, ok, I will try that! Thank you for such a fast response!
from h2ogpt.
This absolutely worked! If anybody runs into this issue in the future, in a new terminal, I ran:
pip3 install gradio==4.17.0
python -m llava.serve.gradio_web_server --controller http://localhost:8080 --model-list-mode once
The gradio_web_server ran on local URL http://0.0.0.0:7860
, so the final command became:
python generate.py --score_model=None --llava_model= http://0.0.0.0:7860 --base_model=liuhaotian/llava-v1.6-vicuna-13b --inference_server=http://0.0.0.0:7860 --prompt_type=plain
from h2ogpt.
Take note of other things in that FAQ section, e.g.:
When launching LLaVa, if you want the server and worker to work with a remote gradio, then replace `localhost` with the IP of the server.
Also yes, by default the llava gradio runs on 7860. If that's an issue because h2oGPT runs on 7860, then just change one of them. I'll add note to FAQ
from h2ogpt.
I changed the h2oGPT port to 8000 with:
export GRADIO_SERVER_PORT=8000
from h2ogpt.
Related Issues (20)
- python dependency module version tweaks HOT 1
- AWQ Model Works from UI in Windows, But Fails When Launched from .bat File HOT 6
- Rest API for inference locally HOT 5
- HuggingFaceM4/idefics2-8b as vision model
- How to delete content in user_paste HOT 2
- Can you make_db from documents stored on another (for example, PostgreSQL) HOT 2
- No way to save prompt/response pairs in a database?
- error intalling from linux_install_full.sh HOT 5
- Failed to import transformers.pipelines HOT 6
- Intel ARC GPU support
- Document Storage HOT 2
- How should I upload my personal data to the h2o website I deployed and make it persistent? HOT 1
- Collection Selection showen multiple times HOT 1
- ValueError: load_in_8bit must be a boolean HOT 5
- Question: correct prompts template for llama3-instruct HOT 12
- httpx.ConnectError with --openai_server=True --ssl-verify=False HOT 12
- h2ogpt on ubuntu server HOT 3
- branding capitalization HOT 1
- Support for https://huggingface.co/lightblue/suzume-llama-3-8B-multilingual HOT 3
- OCR issue HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from h2ogpt.