Giter Club home page Giter Club logo

sd-diffusion-and-dragons's People

Contributors

hunter-meloche avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

sd-diffusion-and-dragons's Issues

Create player view

We need a separate tab or view that the players can see. Ideally, it's a fullscreen gallery that is updated live as images are generated. It also needs to not be too intrusive for tabs to not be easily switched to Owlbear Rodeo or Roll20.

img2txt

Take advantage of BLIP to interrogate generated images for detailed text descriptions. This may be more accurate than going from description to image.

Create long term storage feature to create context for the ChatGPT API

Every time you prompt ChatGPT through the API, it has no memory of previous prompts. This sucks. We can get around this by creating "memories" with structured directories and text files. For example, if we want a feature that allows you to have dialogue with a specific character in your campaign you can have a main summary text file that lists notable information and events in chronological order. Next to each event, there will be a reference to a "memory" text file that expands on that individual event. Sort of like a foreign key in SQL. It might make sense to use sqlite in the future, but I'm keeping it basic for the PoC.

User interacts with dialogue interface -> engineered prompt + user input figures out who the player is trying to talk to and queries memories for references to the character -> ideally it finds it and scans the summary for the character -> any additional granular information that might be found in separate memory files is ascertained -> another (or several) prompts are automatically run in the background to summarize the relevant information the final prompt needs with the player dialogue -> an in-character reply is sent back

This will require a lot of prompts for each "cycle" of information retrieval. We need to also ensure every prompt stays below 3000 words to ensure we don't go past the API's max token amount of 4096.

Have image view inside of DnD tab of SD-web-ui

Ideally we're not having users switch in and out of our tab constantly. We want them to be able to generate the prompt, as well as generate and view the image all in the same place.

Add generative natural language output feature

The idea is there's a separate output box and button for an a natural language description of whatever the input is. For example, you could say "an opened treasure chest". The output would explain the the way the chest looked, the gold, jewels, and other treasures inside of it as if you were reading it in a book.

This will require an entirely different prompt to ChatGPT than our SD prompt generator. It may be best to have it pretend to be a famous fantasy author.

We would then have another button that would feed that natural language description into our original stable diffusion prompt generator for an even better (hopefully) image prompt output.

Add positive and negative prompt boxes

If you want your images to have a constant style, you may want to have "photorealistic" or "anime" as positive tags that are added to the beginning of the generated prompt. The same case with negatives to move away from certain styles or concepts.

Create quality tags checkbox

Add checkbox that automatically prepends generated image prompts with tags like 'masterpiece' and 'best quality'.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.