Giter Club home page Giter Club logo

cloe's Introduction

Project logo

Cloe

Snipping utility for the MangaOCR model

Contents

About

Inspired by Capture2Text, Cloe is a snipping tool for the Manga OCR library. The project works similarly to Capture2Text but uses the MangaOCR model instead. See demo below to see how it works.

sample_usage.mp4

User Guide

Launch the application and wait for the model to load. Show the snipping window using shortcut Alt+Q and drag and hold the mouse cursor to start performing OCR.

Installation

Download the latest zip file here. Decompress the file in the desired directory. Make sure that the app folder is in the same folder as the shortcut Cloe.

System Requirements

Recommended:

  • Hard drive: at least 700 MB HD space
  • RAM: at least 2 GB (recommended)

Development Setup

  • Install poetry on a supported Python version (3.8, 3.9).
  • Clone this repo and install dependencies by running: poetry install --with dev.
  • In the app directory, use python main.py to run the app.
  • If you want to build the app locally, run pyinstaller main.spec in the build directory.

Acknowledgements

This project will not be possible without the MangaOCR model by Maciej Budyล›.

The software is licensed under GPLv3 (see LICENSE) and uses third party libraries that are distributed under their own terms (see LICENSE-3RD-PARTY).

The icons used in this project are from Icons8.

cloe's People

Contributors

blueaxis avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

cloe's Issues

Refactor settings tab component

  • Split BaseSettings into two components. One contains methods on initializations, saving, loading, and, resetting settings. The other component should handle connecting the methods to the UI.
  • Refactor child BaseSettingsTab classes to remove redundant methods.

Build is missing files and does not run

Running "pyinstaller main.spec" creates a build that is not functional. It fails to run. When compared to the downloadable build, many dll's and some folders are missing.

What are the specific steps to be able to make a build?

Load Modal Error

image
I have use Cloe for a while and 1 day when i open it, it keeps giving me this error even though i have try "Run as administrator" or delete Cloe and redowload it...... (I pretty sure that my internet have no problem)
Do you have any idea how to fix this ?

Unmapping hotkeys doesn't work

e.g. from default settings:

  • changing Start Capture from Q to <Unmapped> and saving: the hotkey reverts back to Q
  • changing Open Settings from <Unmapped> to A, saving, then changing A to <Unmapped> and saving: Cloe quietly crashes. Opening Cloe again shows Open Settings still set to A; attempting to change to <Unmapped> and saving again doesn't crash, but reverts the hotkey back to A.

Restore Defaults does work as expected though, so this isn't a huge deal ๐Ÿ™‚

Nothing Happens After Selecting Area to OCR

Currently on Windows 11 and am able to run Cloe, access the settings, and it loads the MangaOCR model. When I press Alt + Q the screen is highlighted and I can select an area, but then nothing else happens.

Took a look at the video in the readme and can see that the selection / highlight is meant to stay for a short period and that the OCR'd text is meant to show up etc, but I don't get any of that.

Have tried ruling out OS language (EN-GB/JA-JP), system locale (UK/JP/Unicode), and multiple displays (2 x External/Built-in Only) in case they have any quirky effects, but no changes.

[Linux bug report + possible fix] Small capture window on Linux

I've encountered a problem with the capture window on Manjaro Linux. After using the shortcut there was only a small capture box in the upper left corner of the screen, around 300x300 in size.
I was able to fix this by editing BaseOCRView class in base.py inside the ocr folder and adding self.setMinimumSize(QSize(1920,1080)) to it, to directly match my screen resolution. Works as a personal crude fix, but there should be a better way to handle this.
Thanks for the project, aside from some small problems it works perfectly.

No executable; unsure how to launch on Windows

I downloaded the zip file and made sure all of it's components are where they should be re:the FAQ. But there's no exe, no msi. Instructions say to "launch" the application but there's nothing to launch.

[Feature Request] Translation

Currently I have to keep pasting the OCR text into DeepL translation manually.
It would be awesome if the app has an option to show a translation of the OCR text below the Japanese text.

I made a mock up screenshot of what I mean:
image

Implement regular python packages

The app directory must have the following file structure

  • components/: visual components (i.e. widgets)
  • components/services/: non-visual components (i.e. threads/workers)
  • utils/: utilities
  • utils/scripts/: helper functions
  • utils/constants: constants
  • assets/: images and styles

Loading MangaOCR manually

Due to my network problem, I couldn't load MangaOCR successfully. So I want to know if I could download that model from any source and load it manually.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.