Giter Club home page Giter Club logo

Comments (7)

jannisborn avatar jannisborn commented on June 10, 2024

Hi @arifin-chemist89, thanks for reporting the issue. I believe the reason is an empty .jsonl file that is stored under paperscraper/server_dumps.
The file might be empty because the chemrxiv api is down (see #13).
The quick fix should be to delete the empty file.

from paperscraper.

arifin-chemist89 avatar arifin-chemist89 commented on June 10, 2024

Thanks for your quick response.
actually I could not find paperscraper/server_dumps directory. Furthermore, as I mentioned in edit1, I also got the same error even I am importing get_and_dump_arxiv_papers. Is the following line need to be done while dumping 'biorxiv', 'chemrxiv', 'medrxiv' db?

    path = sorted(dump_paths, reverse=True)[0]
    querier = XRXivQuery(path)
    QUERY_FN_DICT.update({db: querier.search_keywords})

from paperscraper.

jannisborn avatar jannisborn commented on June 10, 2024

Did you follow this instruction to download the dumps for chemrxiv, biorxiv and medrxiv before the error occurred for the first time? https://github.com/PhosphorylatedRabbits/paperscraper#download-x-rxiv-dumps

I can see from your error trace that your paperscraper is located in /.local/lib/python3.7/site-packages/paperscraper/.
Navigate there and go to paperscraper/server_dumps and I believe you will find an empty file from chemrxiv. Delete this file and restart your python shell this should do the quick fix

from paperscraper.

jannisborn avatar jannisborn commented on June 10, 2024

While I cant do much against the chemrxiv api being down, what definitely has to be improved is the error handling in the package. I will soon release a proper fix for this.

from paperscraper.

jannisborn avatar jannisborn commented on June 10, 2024

Issue should be fixed @arifin-chemist89
If you reinstall paperscraper, you should have version 0.1.1 which should run without this problem. Unfortunately, the chemrXiv API was shut down. See #16 and the Release notes

from paperscraper.

arifin-chemist89 avatar arifin-chemist89 commented on June 10, 2024

I am sorry for the unclear question. I think I can understand now. Anyway, thank you very much for your notice and quick bug fix. I will try to reinstall it and looking forward for chemrXiv API to go back in action again.

from paperscraper.

jannisborn avatar jannisborn commented on June 10, 2024

no worries, but there is simply no chemrxiv API anymore. Chemrxiv changed its host and the new host does not provide API access. See the README for details

from paperscraper.

Related Issues (15)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.