Light

ali-vilab / ranni Goto Github PK

View Code? Open in Web Editor NEW

164.0 164.0 11.0 10.55 MB

License: Apache License 2.0

Python 100.00%

ranni's People

Contributors

Stargazers

Watchers

Forkers

fasvid bizai3000 sdbds jags111 andupotorac dichen-cd paperwave jinwook-shim kustomzone camenduru

ranni's Issues

Will there be 1.5 and XL versions?

this method is powerful, i hope you can release more code and checkpoint

this method is powerful, i hope you can release more code and checkpoint

pip conflict when install the requirements during conda env creation

....
The conflict is caused by:
The user requested safetensors==0.2.7
transformers 4.30.2 depends on safetensors>=0.3.1

To fix this you could try to:

loosen the range of package versions you've specified

remove package versions to allow pip attempt to solve the dependency conflict

Continuous editing

Hi, thanks for sharing this amazing work.
I'm currently experimenting with continuous editing. In the readme, it is specified that we should be able to modify the boxes by modifying the prompt. Unfortunately, this not looks immediate to me, can you provide more explanation?

Also, do you have an ETA for the release of the more powerful model? Thanks in advance!

Inference GPU memory

Hi, I want to know, What is the minimum GPU memory required? I had inferenced the model in GPU RTX3090 24G, but it turned out of CUDA out of memory, what can I do to reduce the GPU memory ? Thanks.

Quantized Model Usage?

Is there a way to use quantized models? The current version is really out of reach for most people with consumer-grade GPUs.

Also, will you train / release models for SD1.5 and SDXL?

Thanks!

Image generation results are easily affected by random seed

I got good results using the recommended prompt and seed, but when I used the recommended prompt and random seed I got worse results.
Good Result:
input prompt: a black dog and a white cat
input seed: 15

Bad Result:
input prompt: 10 apples on the table
input seed: 15

Any solutions?

llama 13b version?

The paper mentioned you use llama-13B as the LLM model. But the provided version is 7B. Have you observed any performance drop because of the different LLM size. Will you plan to release the 13B version?

Example results

Hi, I tried to reproduce some of your examples and this is what I got:

Screenshot 2024-04-10 at 13 10 07

While I understand that errors may occur, it looks to me that something is broken in my installation, am I correct? I have loaded LLaMA in bfloat16, could it be caused by that in your opinion?

bug on initializing open-clip model

In the config, you disable the clip by setting the version to None, So the initializing the clip encounter the following error. How to fix this bug?

cond_stage_config:
target: ranni.ranni.HackedFrozenOpenCLIPEmbedder
params:
freeze: True
layer: "penultimate"
version: null # disable loading CLIP here

Screenshot 2024-04-08 at 10 50 00 PM

Why are the results I tried so different from the results in the paper?

demo error

size mismatch for base_model.model.model.layers.31.self_attn.q_proj.lora_B.default.weight: copying a param with shape torch.Size([4096, 64]) from checkpoint, the shape in current model is torch.Size([5120, 64]).

I met error like this, how do I solve it ?

Segmentation fault error when doing panel-to-image

Segmentation fault error when doing panel-to-image

Error also happens on sample images / generation

Why the box prompt not same as in paper.

In the paper, you give the system prompt with detailed illustration of coordinate system and the example. But in this codebase, you remove those contents in the box prompt. Is there any difference in these settings?

Screenshot 2024-04-09 at 10 02 03 PM

all black when running the text to panel

after run the demo and select an example, this is what I got:

ModuleNotFoundError: No module named 'ldm.models'

Thank you for work.
The error encountered during runtime is as follows.

It seems that the models is missed in ldm package

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.