Giter Club home page Giter Club logo

wikidated's People

Contributors

lschmelzeisen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

wikidated's Issues

Error in build_wikidated_v1_0.py

Dear author. I tried running the build_wikidated_v1_0.py script, but I encountered the following error. Could you help me check what's going wrong?

$ python build_wikidated_v1_0.py 
2023-09-04 11:08:23,462 E Exception occurred.
Traceback (most recent call last):
  File "build_wikidated_v1_0.py", line 44, in <module>
    _main()
  File "build_wikidated_v1_0.py", line 36, in _main
    wikidata_dump = wikidated_manager.wikidata_dump(date(year=2023, month=9, day=1))
  File "/data/wikidated/src/wikidated/wikidated_manager.py", line 42, in wikidata_dump
    return WikidataDump(self.dump_dir, version=version, mirror=mirror)
  File "/data/wikidated/src/wikidated/wikidata/wikidata_dump.py", line 61, in __init__
    self._dump_dir, self.version, self.mirror
  File "/data/wikidated/src/wikidated/wikidata/wikidata_dump.py", line 160, in load
    dump_status = _WikidataDumpStatus.parse_file(path)
  File "pydantic/main.py", line 569, in pydantic.main.BaseModel.parse_file
  File "pydantic/main.py", line 526, in pydantic.main.BaseModel.parse_obj
  File "pydantic/main.py", line 341, in pydantic.main.BaseModel.__init__
pydantic.error_wrappers.ValidationError: 988 validation errors for _WikidataDumpStatus
jobs -> xmlpagelogsdumprecombine -> updated
  time data '' does not match format '%Y-%m-%d %H:%M:%S' (type=value_error)
jobs -> xmlpagelogsdumprecombine -> files -> wikidatawiki-20230901-pages-logging.xml.gz -> size
  field required (type=value_error.missing)
jobs -> xmlpagelogsdumprecombine -> files -> wikidatawiki-20230901-pages-logging.xml.gz -> url
  field required (type=value_error.missing)
jobs -> xmlpagelogsdumprecombine -> files -> wikidatawiki-20230901-pages-logging.xml.gz -> md5
  field required (type=value_error.missing)
jobs -> xmlpagelogsdumprecombine -> files -> wikidatawiki-20230901-pages-logging.xml.gz -> sha1
  field required (type=value_error.missing)
...

By the way, I made the following two modifications:

  1. Since Apache no longer provides Maven 3.8.4, I updated the Maven version to 3.8.8.
  2. Since Wikipedia no longer provides dumps for older versions, I set the date to 2023-09-01.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.