Giter Club home page Giter Club logo

pndm's Introduction

Hi there πŸ‘‹

Hi, I am Luping Liu (刘路平). I am a second-year Master's student in Computer Science from Zhejiang University, supervised by Pro. Zhou Zhao. I also collaborate with Yi Ren (Bytedance AI Lab), Chongxuan Li (Renmin University of China) closely. Prior to that, I just obtained my Bachelor’s degree in Mathematics from Chu Kochen Honors College, Zhejiang University, supervised by Pro. Feng Wang. For now, I focus on the acceleration and application of diffusion models, security and generalization of AI. My ultimate goal is to build general artificial intelligence to free people from repetitive and tedious work.

I am looking for a 2024 Fall PhD position and academic collaboration. If you are interested in my work, please feel free to contact me via email at [email protected].

πŸ“Ž Homepages

πŸ”₯ News

  • 2023.07: I begin an intership at Tencent, working on the Hunyuan Text-to-Image Project.
  • 2023.04: πŸŽ‰πŸŽ‰ I get my first 100 citations on Google Scholar.
  • 2023.04: One paper is accepted by ICML 2023.
  • 2022.10: πŸŽ‰πŸŽ‰ I win the 2022 national scholarship.
  • 2022.05: My PNDM is used as the default acceleration sampling method for CompVis/stable-diffusion .
  • 2022.02: I release the official implementation for PNDM (ICLR 2022).

πŸ“ Publications

pndm's People

Contributors

luping-liu avatar patrickvonplaten avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

pndm's Issues

Is the implementation for FON really 4-th order?

Thanks for your inspiring work.

I found the implementation of FON is confusing, which may not what claimed in the paper.

Paper claims
image

However,

def gen_fon(img, t, t_next, model, alphas_cump, ets):
    t_list = [t, (t + t_next) / 2.0, t_next]
    if len(ets) > 2:
        noise = model(img, t)
        img_next = transfer(img, t, t-1, noise, alphas_cump)
        delta = img_next - img
        ets.append(delta)
    else:
        noise = model(img, t_list[0])
        img_ = transfer(img, t, t - 1, noise, alphas_cump)
        delta_1 = img_ - img
        ets.append(delta_1)

        img_2 = img + delta_1 * (t - t_next).view(-1, 1, 1, 1) / 2.0
        noise = model(img_2, t_list[1])
        img_ = transfer(img, t, t - 1, noise, alphas_cump)
        delta_2 = img_ - img

        img_3 = img + delta_2 * (t - t_next).view(-1, 1, 1, 1) / 2.0
        noise = model(img_3, t_list[1])
        img_ = transfer(img, t, t - 1, noise, alphas_cump)
        delta_3 = img_ - img

        img_4 = img + delta_3 * (t - t_next).view(-1, 1, 1, 1)
        noise = model(img_4, t_list[2])
        img_ = transfer(img, t, t - 1, noise, alphas_cump)
        delta_4 = img_ - img
        delta = (1 / 6.0) * (delta_1 + 2*delta_2 + 2*delta_3 + delta_4)

    img_next = img + delta * (t - t_next).view(-1, 1, 1, 1)
    return img_next

After len(ets) > 2, delta seems only use first order info?

The aliyundrive is empty.

I am writing a paper and need to use your code, but the checkpoint and the fid is empty in the aliyundirve.

Predicted x_0 during denoising

Hi, thanks for the great work!
I'm wondering is it possible to get a predicted x_0 during the denoising process like DDIM? If yes, could you maybe expalin a bit?
Thanks a lot!

How to train a model for new Data

Hi @luping-liu! Thank you for your job. ItΒ΄s really interesting!

Could you help me to know how could I train a model with new data? I have Image in 256x256 size and I have divided it into validation and train because it is medical data and must not be mixed because the train and the validation data will be very correlated.

Regards.

Conditional PNDM

Hi, I wanted to use PNDM for conditional image generation and would like to know how to integrate that. Thanks

Training time

Hi, thanks for your interesting work.

What is your training time on each dataset?

Segmentation fault (11)

Hey I am back, so i moved the program over to a google colab. Ran this command:

!python main.py --runner sample --method F-PNDM --sample_speed 50 --device cuda --config ddim_cifar10.yml --image_path results --model_path models/ddim_cifar10.ckpt
Everything was going fine and then ->

image

any advice?

A question of β€œPseudo Numerical Methodβ€œ

Hi, thank you for your interesting work.

I have a question. Is the "pseudo numerical method" one of the methods of classical numerical? Or is it just named specifically for your research? As I searched for the name in the numerical methods domain, but I didn't find it. So I guess it is a specific term you named specifically for your work?

Looking forward to your reply. Thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.