Giter Club home page Giter Club logo

Comments (7)

bainadeashish avatar bainadeashish commented on July 3, 2024 2

I am getting text contents from Azure web server with below code
from tika import parser
text = parser.from_file(file)--file is doc file

I am getting below error
HTTPConnectionPool(host='localhost', port=9998): Max retries exceeded with url: /rmeta/text (Caused by NewConnectionError(': Failed to establish a new connection: [Errno 10013] An attempt was made to access a socket in a way forbidden by its access permissions',))

from tika-python.

chrismattmann avatar chrismattmann commented on July 3, 2024

Hi Frank @fysteven were you doing lots of parallel connections?

from tika-python.

fysteven avatar fysteven commented on July 3, 2024

Hi Chris @chrismattmann , the following snippet is how I iterated the file list.
It failed around the 8181st detection of file.
Looks like the Tika Server would establish a new connection for each detection, right?

        for idx, val in enumerate(file_list):
            file_name = os.path.basename(val)
            file_type = detector.from_file(''.join([base_directory, val]))

from tika-python.

chrismattmann avatar chrismattmann commented on July 3, 2024

hi @fysteven any update here? Are you still seeing this?

from tika-python.

chrismattmann avatar chrismattmann commented on July 3, 2024

closing since I never heard back. Please re-open if you still see this.

from tika-python.

mahtani89 avatar mahtani89 commented on July 3, 2024

Has anyone found a fix to the issue?

from tika-python.

chrismattmann avatar chrismattmann commented on July 3, 2024

The issue usually relates to:

  1. Your firewall doesn't allow incoming connections on localhost to 9998
  2. You already had your own Tika server running on 9998 (not started by Python) and something gets messed up - you need to kill that Tika server, and then let Python start it for you.

from tika-python.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.