Giter Club home page Giter Club logo

Comments (6)

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on September 23, 2024 1

Embedding models are going to be supported in v0.4.3

So, install from source to use now

from vllm.

Fanb1ing avatar Fanb1ing commented on September 23, 2024

Thank you very much. I will try to install new version.

from vllm.

Loreley99 avatar Loreley99 commented on September 23, 2024

Hello, I would like to ask why this error:
"---------------------------------------------------------------------------
AttributeError Traceback (most recent call last)
in <cell line: 1>()
----> 1 outputs = model.encode(prompts)

10 frames
/content/vllm/vllm/model_executor/sampling_metadata.py in _prepare_seq_groups(seq_group_metadata_list, seq_lens, query_lens, device)
206
207 if seq_group_metadata.is_prompt:
--> 208 if sampling_params.seed is not None:
209 seq_group_metadata.state.generator = torch.Generator(
210 device=device).manual_seed(sampling_params.seed)
AttributeError: 'NoneType' object has no attribute 'seed'"occurs during use?

The model I'm using is llama3-8b, thanks!

from vllm.

robertgshaw2-neuralmagic avatar robertgshaw2-neuralmagic commented on September 23, 2024

Hello, I would like to ask why this error: "--------------------------------------------------------------------------- AttributeError Traceback (most recent call last) in <cell line: 1>() ----> 1 outputs = model.encode(prompts)

10 frames /content/vllm/vllm/model_executor/sampling_metadata.py in _prepare_seq_groups(seq_group_metadata_list, seq_lens, query_lens, device) 206 207 if seq_group_metadata.is_prompt: --> 208 if sampling_params.seed is not None: 209 seq_group_metadata.state.generator = torch.Generator( 210 device=device).manual_seed(sampling_params.seed) AttributeError: 'NoneType' object has no attribute 'seed'"occurs during use?

The model I'm using is llama3-8b, thanks!

If you're running the generic Llama-3-8b model, you are running with a generation model rather than an embedding model. I should update the encode API to fail more gracefully if called in this configuration.

If you want to use an embedding model, try:

from vllm.

Loreley99 avatar Loreley99 commented on September 23, 2024

Hello, I would like to ask why this error: "--------------------------------------------------------------------------- AttributeError Traceback (most recent call last) in <cell line: 1>() ----> 1 outputs = model.encode(prompts)
10 frames /content/vllm/vllm/model_executor/sampling_metadata.py in _prepare_seq_groups(seq_group_metadata_list, seq_lens, query_lens, device) 206 207 if seq_group_metadata.is_prompt: --> 208 if sampling_params.seed is not None: 209 seq_group_metadata.state.generator = torch.Generator( 210 device=device).manual_seed(sampling_params.seed) AttributeError: 'NoneType' object has no attribute 'seed'"occurs during use?
The model I'm using is llama3-8b, thanks!

If you're running the generic Llama-3-8b model, you are running with a generation model rather than an embedding model. I should update the encode API to fail more gracefully if called in this configuration.

If you want to use an embedding model, try:

Thank you very much for your quick reply! May I ask if there is an API for vllm if I want to extract the hidden state of the generation model?

from vllm.

Fanb1ing avatar Fanb1ing commented on September 23, 2024

Hello, I would like to ask why this error: "--------------------------------------------------------------------------- AttributeError Traceback (most recent call last) in <cell line: 1>() ----> 1 outputs = model.encode(prompts)
10 frames /content/vllm/vllm/model_executor/sampling_metadata.py in _prepare_seq_groups(seq_group_metadata_list, seq_lens, query_lens, device) 206 207 if seq_group_metadata.is_prompt: --> 208 if sampling_params.seed is not None: 209 seq_group_metadata.state.generator = torch.Generator( 210 device=device).manual_seed(sampling_params.seed) AttributeError: 'NoneType' object has no attribute 'seed'"occurs during use?
The model I'm using is llama3-8b, thanks!

If you're running the generic Llama-3-8b model, you are running with a generation model rather than an embedding model. I should update the encode API to fail more gracefully if called in this configuration.
If you want to use an embedding model, try:

Thank you very much for your quick reply! May I ask if there is an API for vllm if I want to extract the hidden state of the generation model?

Getting the hidden state of the generation model is exactly what I am struggling with. Have you found any good solutions? All the tools I found are targeted for embedding model, not supporting llama3.

from vllm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.