Light

pozitronik / sinner Goto Github PK

View Code? Open in Web Editor NEW

70.0 70.0 9.0 166.29 MB

Sinner: Sinner Is NoN Exactly Roop

License: GNU Affero General Public License v3.0

Python 100.00%

deepfake deepfakes face-enhancement image-processing video-processing

sinner's Introduction

Hello there 👋

Feel free to contribute to my repos! There are some of my favorites:

cloud.mail.ru support for Total Commander. Pure Delphi, no bullshit.
Video/image deepfake tool. It is not a sin if you had fun!
Yii2 grids configuration module. Let users choose, how it looks.
Yii2 access control module. More flexible than any RBAC-based solution.
and more!

sinner's People

Contributors

Stargazers

Watchers

Forkers

neurogen-dev cj0596 5l1v3r1 degengames bd100b cloudteste5150 navezjt gargantua-voided pneumix

sinner's Issues

Fixme: get output path from processing modules

BatchProcessingCore.suggest_output_path() doesn't applicable when path to a directory passed in --output-path.
If no --output-path is provided, the output path is generated without source path reference.

Feature: multiple targets

Check if it is possible to use multiple sources for swapping. Every source keeps a face with a certain position, and if the target face position is near to the face position,this face source should be used.

Feature: inpainting

Implement inpainting to do better handling of items over the faces.

Option to use ffmpeg/cv2 handlers

I definitely need an option to select which video handler backend should be used.

More processing information

Show, at least, all applied parameters and all characteristics of target and source.

Feature: realtime player

In GUI mode do a realtime player: get frames and the sound from the target, do in-memory frames swap (may be with some buffering) or any other possible operation, and play the result. Also add quality presets: to improve processing speed, frames can be shrunken before processing, and restored in size after.
Also it may be possible to enable frame drop: render as many frames as possible, and just drop any other.

Known issues:

The PygameFramePlayer may not refresh when showing frame while playing (e.g. shows two frames at same time)

Feature: job list from GUI

Add a button, that will add current preview parameters to a certain batch list.

If some frames are lost, try to to reprocess them

Currently, if there is some gap in processed frames, sinner will not handle it: final state check will report about some inconsistency, but lost frames are ignored on restart anyway.

Lookahead for existing processed frames in the processing chain

It is about the case, when a task was stopped, and already processed frames deleted by user (as example, extracted frames when they are already swapped, and now enhancing in progress). Sinner will reprocess those frames again on the next task run, although they are not required anymore.
There is required some lookahead procedure to determine, what processor is running now. It is simple: find last state folder with files, look, what frames are still unprocessed, and prepare only them (if they're not exists).

Feature: intervals support

Add something like --intervals=(start1, stop2) (start1, stop2) , where start and stop are integer markers of frame intervals to process.

Allow to control each processor sub-threads count (`ThreadPoolExecutor` workers).

As example, run FaceSwapper on the 16 CPU threads, and FaceEnhancer on the 2 CUDA threads.

Feature: multiple sources

Allow to pass multiple sources for processing, which should result to process target for the every source repeatedly.
In the GUI mode multiple sources may display as a list of images, and preview result should render for everyone of them, resulting to a list of previews.

Infinite benchmarking

If all benchmark runs has a small delta, the benchmarking process may newer stop. It will be better to measure delta between the fastest and the slowest runs.

Rounded FPS

It seems that when source FPS is fractionary (e.g. like 29.97004) and target FPS is the same, any h264/h265 encoded videos have the same problem with broken rewind in the end part of the target video.
The solution is to round up FPS values by default.

Refactor parameters handler

Every module should be able to handle it own set of command-line parameters. Those parameters should be gathered before program start. Every module also should be able to validate parameters by itself.

Multithreading processing chain

In the current processing logic all frame processors runs in turns, one after one. The advantages of this logic are obvious: it is simple, there is a possibility to stop and resume processing at any time, e.t.c.
But there are disadvantages too, and they seems to be critical at this moment. There is no possibility to implement any realtime preview with this logic, and there is no place for speed enhancements, because we can't parallelize processing task.
So, there is a big (really big) task to implement multithreading there.

There will be named buffers, each buffer named to a linked processor. Each processor processing will start in the separate thread, it will read a frame from the input buffer, process it, and write to the output buffer (which is input buffer for the next processor).

There are subtasks for this task:

Implement the buffered processing chain.
Make *FrameBuffer classes for the different buffers types (memory, file, e.t.c).
Implement threads manager. Some processors may work faster than others, so their buffers will fill quickly. So it make a sense to pause those processors threads to redistribute computing powers to the other threads.
Implement states in this approach. Saving and loading buffers as a files can be useful: this logic partially coincides with the current state logic.
#68
Implement a memory control code. It is important, because several loaded models may require a lot of memory, and frame buffers are too.
#70
Show the general progress bar and the every processor progress bar and statistics (memory usage, buffers occupancy, e.t.c).
Properly handle in-threads exceptions.

Feature: gradio

tkinter sucks so everybody is migrating to gradio (original roop & roop unleashed)

your fork is the most OOP so migration should be better

BUG: DirectoryHandler broken input with extract-frames

When processing directory with --extract-frames parameter, there is no processing, because DirectoryHandler tries to process self.state.in_dir which is not applicable for this type of input.

Feature: remember used directories list

I want to store all previously selected target/source directories in the config file to use them in the file dialogs.

More control flags and ini options for ffmpeg

I definitely want to add more control flags and ini options for ffmpeg, because now ffmpeg resulting can produce unoptimal and, sometimes, broken, videos.

Feature: FaceSwapper face selection

Let user choose what faces should be processed.

BUG: broken previews, if target is album oriented

subj

BUG: temp folders names collisions

There is a collision in the case, when two different videos have the same source file. It produces the same temporary directory name for the FaceEnhancer processor. The solution: the target (not the source) file name should be used in that case somehow.

Originally posted by @pozitronik in #28 (comment)

Different modules settings via config file

Implement support to read module setting from the current config file, like

[FaceSwapper]
execution-provider=cpu
threads=16

[FaceEnhancer]
execution-provider=cuda
threads=2

as addition to current global configurations (config file and command line)

Feature: dynamic preview

Add a control to the preview picture, that allow to compare original and processed frame.

Check all requirements on latest Ubuntu before release

Just to make sure, that all is ok.

Move frame extraction routines to a processor

Instead using frame handler, when frames extraction is required, it is possible to create FrameExtractor processor and use it in the processing chain.
It allow to make code more clean.

Core: 'NoneType' object has no attribute 'normed_embedding'

When trying to use 'cara.jpg' as a source.
The thing is in the image itself, not in the format (it can be any).

Feature: different preview processors

It is possible to use different set of processors to make the quick preview (as example: halved image and no enhancing). The full processing can be done by request.

BUG: Wrong frames count determination (again)?

It seems, that sometimes the frame count of some videos still incorrect, leads to unpredictable errors.
In one case, frame count of the video is 24750 (according both to ffprobe or cv2), but only 24745 were extracted for swapping. FaceEnhancer, somehow, sees 27844 frames, which is wrong in any way. Here can be a bug in the state module.
Also, for this video resulting frame count determined as 360.0, which also isn't right.

Feature: config with defaults

Add support to config, that will store default parameters for every run, if those parameters are not overlapped in command line.

Fix GUI source loading + update processor target without reload

Due to changed logic, there is no source_path in Core anymore.
Also, the source (and maybe the target) can be replaced in the current processors without full reloading.

Refactoring: allow stateless processing

There are different usage scenarios, and most of them are stateless. In fact, states are required only within batch processing chain, but state object is heavily integrated into BaseFrameProcessor class. The same is applicable for data entries (source/input/output files/directories/etc).
So, i have to:

Move out states from BaseFrameProcessor to some kind of top-level handler (I see it in rewritten Core).
Make more general FrameProcessors to work only with frames, not files.

GUI: preview resizing

Preview control should be properly sized, and support windows resize as well

GPU + CPU usage

It is possible to run some processing instances on CUDAExecutionProvider and some on CPUExecutionProvider. In theory, it can utilize GPU and CPU and increase overall speed.

Feature: input from live stream (like camera)

Add a frame processor, which will receive frames from camera (via cv2.VideoCapture).

Feature: better replace for rotated faces

When face in the target is rotated, FaceSwapper result on it is bad. So, it is possible to counter rotate the frame before swapping, and rotate it back after, to achieve better swapping.

DirectoryHandler must respect initial filenames

Currently, all processed frames are renamed by their numerical positions. It is okay for video frames, but in other cases processed files should be saved with their original names.

Check frames valildity after processing

Before create resulting video and deleting processed frames, it is need to check frames validity: if are there all frames in the sequence and all frames are not zero-sized.

ffprobe frame count detection can be really slow on some files

It seems that current implementation is really counting all frames, one by one (maybe, only when no appropriate value is in video header). It may lasts for minutes.
I need to find a way to get frames count in faster manner (may be use a binary search approach, as in C2VideoHandler).
Or i just can implement a workaround in VideoHandler: use detect_fc() from C2VideoHandler and result() from FFmpegVideoHandler.

State class needs tests
FaceAnalyser class needs tests

Add help strings to the every loadable attribute.
Add -h help to show help output. It requires to collect all available help data somehow. As the first approach, it requires to find all descendants of the AttributeLoader class.
Fix spacing in errors output.
Do not show attributes without help, and modules without attributes in help output.
Fix doubling help strings in the help output

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.