Giter Club home page Giter Club logo

Comments (3)

Sblop avatar Sblop commented on June 2, 2024

check if you have a document in your consumer folder, that paperless cant consume.

from paperless.

shtrom avatar shtrom commented on June 2, 2024

Yup, that seems to have been the case. Looking at the logs, it kept trying to process the same file, and fail:
PARSE FAILURE for /consume/doc.jpg: Language detection failed. Set PAPERLESS_FORGIVING_OCR in config file to continue anyway.

What I'm uncertain about is that, seeing the suggestion in the log message, I set PAPERLESS_FORGIVING_OCR=true in the env_file (docker-compose.yml). It does shows up when I check the env in the consumer container

$ docker-compose run consumer /usr/bin/env                                                                         7s 126 ↵ master
Starting paperless_webserver_1 ... done
HOSTNAME=c1133e84f7ad
PAPERLESS_CONSUMPTION_DIR=/consume
PWD=/usr/src/paperless/src
TZ=Australia/Hobart
HOME=/root
PAPERLESS_FORGIVING_OCR=true
PAPERLESS_EXPORT_DIR=/export
TERM=xterm
SHLVL=0
PAPERLESS_OCR_LANGUAGES=eng fre
PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
PAPERLESS_DISABLE_LOGIN=true

but the consumer continues to choke on it, and suggesting the same fix.

Note that I also have PAPERLESS_DISABLE_LOGIN=true, which also seems to be ignored by the webserver, so maybe I'm not passing the env properly through the docker-compose.env:

$ grep ^PAPERLESS docker-compose.env                                                                                    0s  master
PAPERLESS_DISABLE_LOGIN=true
PAPERLESS_FORGIVING_OCR=true
PAPERLESS_OCR_LANGUAGES=eng fre

Am I setting those properly?

from paperless.

shtrom avatar shtrom commented on June 2, 2024

Yeah, there were a few pathological documents. When I got them out of the way, Paperless finished processing everything else, and the consumer went back to 0%.

Still uncertain about how to pass configuration options through the env, but that's a different issue.

Thanks!

from paperless.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.