Comments (5)
If anybody needs to modify the timeout via REST, just add a header with "X-Tika-OCRTimeout: 200" for 200 seconds of timeout.
Example:
curl -T file_to_ocr.jpg localhost:9998/tika --header "X-Tika-OCRTimeout: 200"
from tika-server.deb.
The build script of the Debian Package now extracts the OCR config org/apache/tika/parser/ocr/TesseractOCRConfig.properties from the Tika Server JAR, changes timeout setting and adds/overwrites with changed config to/in the Tika Server JAR of the package.
from tika-server.deb.
Thanks for your tip, will add that in ETL plugin for the case someone uses a Tika on another server/installation which is not our preconfigured Tika deb package.
from tika-server.deb.
Timeout settings now by Open Semantic ETL using header X-Tika-OCRTimeout for Tika-Server.
from tika-server.deb.
I am having this pop up now; Its for the fake tika server
java[1950828]: ERROR [Thread-22] 22:22:34,199 org.apache.tika.server.core.ServerStatusWatcher Timeout task PARSE, millis elapsed 300091, timeoutMillis 300000, file id b'World History.pdf'consider increasing the allowable time with the <taskTimeoutMillis/> parameter or the X-Tika-Timeout-Millis header
Jan 27 22:22:34 mgp java[1950828]: WARN [Thread-22] 22:22:34,199 org.apache.tika.server.core.ServerStatusWatcher forked process observed TIMEOUT and is shutting down.
Jan 27 22:22:34 mgp java[1950828]: INFO [Thread-22] 22:22:34,214 org.apache.tika.server.core.ServerStatusWatcher Shutting down forked process with status: TIMEOUT
Jan 27 22:22:34 mgp etl_tasks[2349205]: [2022-01-27 22:22:34,677: WARNING/ForkPoolWorker-3] Connection to Tika server (will retry in 120 seconds) failed. Exception: ('Connection aborted.', RemoteDisconnected('Remote end closed connection without response'))
Jan 27 22:22:34 mgp etl_tasks[2349205]: [2022-01-27 22:22:34,677: WARNING/ForkPoolWorker-3] Retrying to connect to Tika server in 120 second(s).
Jan 27 22:22:34 mgp java[1929662]: INFO [pool-2-thread-1] 22:22:34,678 org.apache.tika.server.core.TikaServerWatchDog forked process exited with exit value 3
Jan 27 22:22:36 mgp java[1961770]: INFO [main] 22:22:36,867 org.apache.tika.server.core.TikaServerProcess Starting Apache Tika 2.2.1 server
Jan 27 22:22:37 mgp java[1961770]: INFO [main] 22:22:37,014 org.apache.tika.server.core.TikaServerProcess Using custom config: /etc/tika/tika-config-fakecache.xml
Jan 27 22:22:37 mgp java[1961770]: INFO [main] 22:22:37,897 org.apache.cxf.endpoint.ServerImpl Setting the server's publish address to be http://localhost:9999/
from tika-server.deb.
Related Issues (17)
- Upgrade to Tika 1.13 HOT 1
- Stopping Tika service / daemon HOT 3
- Upgrade to Tika 1.15
- Upgrade to Tika 1.16
- Ubuntu HOT 5
- Upgrade to Tika 1.18 HOT 2
- Error 500 if multiple OCR dictionaries HOT 4
- Upgrade to Tika 1.20 HOT 1
- Upgrade to Tika 1.21 HOT 1
- Upgrade to Tika 1.22 HOT 1
- Docker: Share OCR models with ETL container
- Upgrade to Tika 1.23 HOT 1
- Upgrade to Apache Tika 1.24 HOT 1
- Upgrade to Tika 1.25
- Move Tika version and URL to seperate config file HOT 2
- Upgrade to Tika 2.3.0 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tika-server.deb.