Giter Club home page Giter Club logo

co-instruct's People

Contributors

teowu avatar zzc-1998 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

co-instruct's Issues

Training code release

Congratulations on your model, it seems like a great improvement on current models for visual comparison :)

Do you plan on releasing the training code and/or finetuning code ?

Thanks

A question about loading the model

Thank you so much for your contribution to the community!
I encountered a problem of loading the model. When I tried to load the model via the provided script:

model = AutoModelForCausalLM.from_pretrained("q-future/co-instruct",
trust_remote_code=True,
torch_dtype=torch.float16,
attn_implementation="eager",
device_map={"":"cuda:0"})

I got:

Traceback (most recent call last):
File "./test_co-ins.py", line 17, in
device_map={"":"cuda:0"})
File "$MY_PATH$/python3.7/site-packages/transformers/models/auto/auto_factory.py", line 476, in from_pretrained
class_ref, pretrained_model_name_or_path, **hub_kwargs, **kwargs
File "$MY_PATH$/python3.7/site-packages/transformers/dynamic_module_utils.py", line 443, in get_class_from_dynamic_module
return get_class_in_module(class_name, final_module.replace(".py", ""))
File "$MY_PATH$/python3.7/site-packages/transformers/dynamic_module_utils.py", line 164, in get_class_in_module
module = importlib.import_module(module_path)
File "$MY_PATH$/python3.7/importlib/init.py", line 127, in import_module
return _bootstrap._gcd_import(name[level:], package, level)
File "", line 1006, in _gcd_import
File "", line 983, in _find_and_load
File "", line 967, in _find_and_load_unlocked
File "", line 677, in _load_unlocked
File "", line 728, in exec_module
File "", line 219, in _call_with_frames_removed
File "$MY_PATH$/.cache/huggingface/modules/transformers_modules/q-future/co-instruct/bc0349eee484e340fb1c1786f4d800bb183d3d43/modeling_mplug_owl2.py", line 35, in
from .modeling_llama2 import replace_llama_modality_adaptive
File "$MY_PATH$/.cache/huggingface/modules/transformers_modules/q-future/co-instruct/bc0349eee484e340fb1c1786f4d800bb183d3d43/modeling_llama2.py", line 212, in
class LlamaFlashAttention2(LlamaAttention):
File "$MY_PATH$/.cache/huggingface/modules/transformers_modules/q-future/co-instruct/bc0349eee484e340fb1c1786f4d800bb183d3d43/modeling_llama2.py", line 235, in LlamaFlashAttention2
use_cache: bool = False,
NameError: name 'Cache' is not defined

Does it mean that my versions of packages are not appropriate? If so, would you please provide the requirements of the model?

About model's inconsistent answer

First of all, I appreciate for your amazing works that gives me an idea for my future research.
image
Like above figure, I asked over/under exposed area of image sequentially. And model answered as I intended.
But when I asked like below. It gives me inconsistent answer ('No over or under exposed part')
image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.