Giter Club home page Giter Club logo

wa-kat's Introduction

Introduction

image

image

image

image

image

WA-KAT is a project that simplifies the job of curators of the Webarchive of National Library of the Czech Republic by cataloging electronic resources using semi-automatic analysis.

Documentation

Module documentation and description can be found at Read the Docs:

wa-kat's People

Contributors

bystroushaak avatar dependabot[bot] avatar kvasnicaj avatar visgean avatar westfood avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

wa-kat's Issues

Tests

Add full unittest coverage.

Implement parsing of the Author

návrh z meta (author), DC, dtb Nár. autorit || návrh z meta (author, copyright), dtb Nár. autorit - bude staženo z adminu // kontrola oproti bázi autorit

Error

database conflict error Traceback (most recent call last): File "bin/../src/wa_kat/rest_api/init.py", line 40, in get_result ri.paralel_processing() File "bin/../src/wa_kat/zeo/request_info.py", line 109, in paralel_processing self.processing_started_ts = time.time() File "/usr/local/lib/python2.7/dist-packages/transaction/_manager.py", line 120, in exit self.commit() File "/usr/local/lib/python2.7/dist-packages/transaction/_manager.py", line 111, in commit return self.get().commit() File "/usr/local/lib/python2.7/dist-packages/transaction/_transaction.py", line 280, in commit reraise(t, v, tb) File "/usr/local/lib/python2.7/dist-packages/transaction/_transaction.py", line 271, in commit self._commitResources() File "/usr/local/lib/python2.7/dist-packages/transaction/_transaction.py", line 417, in _commitResources reraise(t, v, tb) File "/usr/local/lib/python2.7/dist-packages/transaction/_transaction.py", line 394, in _commitResources rm.tpc_vote(self) File "/usr/local/lib/python2.7/dist-packages/ZODB/Connection.py", line 786, in tpc_vote s = vote(transaction) File "/usr/local/lib/python2.7/dist-packages/ZEO/ClientStorage.py", line 1107, in tpc_vote return self._check_serials() File "/usr/local/lib/python2.7/dist-packages/ZEO/ClientStorage.py", line 937, in _check_serials raise s ConflictError: database conflict error (oid 0x0192, class wa_kat.zeo.request_info.RequestInfo, serial this txn started with 0x03b3a6fe24266baa 2015-12-08 17:02:08.472675, serial currently committed 0x03b3a6fe24d0cd99 2015-12-08 17:02:08.628665)

Sphinx docstrings

Add docstrings for all of the

.
├── analyzers
│   ├── annotation_detector.py
│   ├── author_detector.py
│   ├── creation_date_detector.py
│   ├── __init__.py
│   ├── keyword_detector.py
│   ├── language_detector.py
│   ├── place_detector.py
│   ├── shared.py
│   ├── source_string.py
│   └── title_detector.py
├── connectors
│   ├── aleph.py
│   ├── __init__.py
│   └── seeder.py
├── convertors
│   ├── __init__.py
│   ├── iso_codes.py
│   ├── mrc.py
│   └── to_dc.py
├── rest_api
│   ├── aleph_api.py
│   ├── __init__.py
│   ├── keywords.py
│   ├── shared.py
│   ├── to_output.py
│   └── virtual_fs.py
├── templates
│   └── static
│       ├── css
│       ├── fonts
│       └── js
│           └── Lib
│               └── site-packages
│                   ├── components
│                   │   ├── author_picker.py
│                   │   ├── author_switcher.py
│                   │   ├── conspect_handler.py
│                   │   ├── dropdown_handler.py
│                   │   ├── errorbox.py
│                   │   ├── __init__.py
│                   │   ├── input_controller.py
│                   │   ├── keyword_handler.py
│                   │   ├── log_view2.py
│                   │   ├── output_picker.py
│                   │   ├── overlay_controller.py
│                   │   ├── placeholder_handler.py
│                   │   ├── progressbar.py
│                   │   └── shared.py
│                   ├── descriptors.py
│                   ├── rules_view.py
│                   ├── view.py
│                   └── wa_kat_main.py
├── zeo
│   ├── __init__.py
│   ├── request_database.py
│   ├── request_info.py
│   └── worker.py
├── bottle_index.py
├── data_model.py
├── __init__.py
└── settings.py

Language tag normalization?

Implement language tag normalization?

en -> english
eng -> english

And so on. Which format does the output data require?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.