FreeOCR4ALL is a free online OPTICAL CHARACTER RECOGNITION (OCR) web tool that is used to extract text from images and pdf documents
You can view this at https://freeocr4all.onrender.com
But it has a problem with pytesseract and the path
FreeOCR4ALL.Demo.mp4
https://github.com/KoushikReddy24/FreeOCR4ALL/files/13623680/ocr.zip
It uses Tesseract open source software and pytesseract python modules to extract text from given files like jpeg,jpg,png,pdf etc. It consists of a Flask file, which serves as a backend to the website and processing part is carried in flask file. It consists of html files which have great user interface and are used to take input from the user, display the extracted text for the user to copy.
To deploy this project run
npm run deploy
Install my-project with npm
npm install my-project
cd my-project
Clone the project
git clone https://link-to-project
Go to the project directory
cd my-project
Install dependencies
npm install
Start the server
npm run start
If you want to run this project on your system follow these steps
- Clone this repository or download all the files.
- Download and install Tesseract OCR software from GitHub.
- Update your environment variables with the Tesseract installation path, which typically looks like: "C:\Program Files\Tesseract-OCR\tesseract.exe".
- Update the Tesseract path wherever required in the Flask file.
- Ensure you have installed necessary libraries like pytesseract and PIL using a package manager.
- Create a folder named "uploads" in the project directory.
- Update the path of the "uploads" folder in the required location in the Flask file.
- Make sure you have Python installed on your system.
- It's recommended to set up a virtual environment before installing Python libraries.
- Check for any additional configuration or dependencies mentioned in the project documentation.
I hope these instructions help you set up and run the project on your system. If you encounter any issues, refer to the project documentation or seek assistance from the project maintainers.