Giter Club home page Giter Club logo

ranni's People

Contributors

alibaba-oss avatar thss15fyt avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

ranni's Issues

pip conflict when install the requirements during conda env creation

....
The conflict is caused by:
The user requested safetensors==0.2.7
transformers 4.30.2 depends on safetensors>=0.3.1

To fix this you could try to:

  1. loosen the range of package versions you've specified
  2. remove package versions to allow pip attempt to solve the dependency conflict

Continuous editing

Hi, thanks for sharing this amazing work.
I'm currently experimenting with continuous editing. In the readme, it is specified that we should be able to modify the boxes by modifying the prompt. Unfortunately, this not looks immediate to me, can you provide more explanation?

Also, do you have an ETA for the release of the more powerful model? Thanks in advance!

Inference GPU memory

Hi, I want to know, What is the minimum GPU memory required? I had inferenced the model in GPU RTX3090 24G, but it turned out of CUDA out of memory, what can I do to reduce the GPU memory ? Thanks.

Quantized Model Usage?

Is there a way to use quantized models? The current version is really out of reach for most people with consumer-grade GPUs.

Also, will you train / release models for SD1.5 and SDXL?

Thanks!

Image generation results are easily affected by random seed

I got good results using the recommended prompt and seed, but when I used the recommended prompt and random seed I got worse results.
Good Result:
input prompt: a black dog and a white cat
input seed: 15
a black dog and a white cat_1

Bad Result:
input prompt: 10 apples on the table
input seed: 15
10 apples on the table_1
Any solutions?

llama 13b version?

The paper mentioned you use llama-13B as the LLM model. But the provided version is 7B. Have you observed any performance drop because of the different LLM size. Will you plan to release the 13B version?

Example results

Hi, I tried to reproduce some of your examples and this is what I got:

Screenshot 2024-04-10 at 13 10 07

While I understand that errors may occur, it looks to me that something is broken in my installation, am I correct? I have loaded LLaMA in bfloat16, could it be caused by that in your opinion?

bug on initializing open-clip model

In the config, you disable the clip by setting the version to None, So the initializing the clip encounter the following error. How to fix this bug?

cond_stage_config:
target: ranni.ranni.HackedFrozenOpenCLIPEmbedder
params:
freeze: True
layer: "penultimate"
version: null # disable loading CLIP here

Screenshot 2024-04-08 at 10 50 00 PM

demo error

size mismatch for base_model.model.model.layers.31.self_attn.q_proj.lora_B.default.weight: copying a param with shape torch.Size([4096, 64]) from checkpoint, the shape in current model is torch.Size([5120, 64]).

I met error like this, how do I solve it ?

Why the box prompt not same as in paper.

In the paper, you give the system prompt with detailed illustration of coordinate system and the example. But in this codebase, you remove those contents in the box prompt. Is there any difference in these settings?

Screenshot 2024-04-09 at 10 02 03 PM

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.