Python library to interact with the PDFTables.com API.
PIP:
pip install git+https://github.com/pdftables/python-pdftables-api.git
Locally:
python setup.py install
import pdftables_api
c = pdftables_api.Client('my-api-key')
c.xlsx('input.pdf', 'output.xlsx')
python -m unittest test.test_pdftables_api
If you are converting a large document (hundreds or thousands of pages), you may want to increase the timeout.
Here is an example of the sort of error that might be encountered:
ReadTimeout: HTTPSConnectionPool(host='pdftables.com', port=443): Read timed out. (read timeout=300)
The below example allows 60 seconds to connect to our server, and 1 hour to convert the document:
import pdftables_api
c = pdftables_api.Client('my-api-key', timeout=(60, 3600))
c.xlsx('input.pdf', 'output.xlsx')