Giter Club home page Giter Club logo

file-catalog-indexer's People

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

Forkers

lgtm-migrator

file-catalog-indexer's Issues

By Default Don't Patch

Replace --no-patch with --patch. Since not patching is the most common usage, it shouldn't require a command-line option.

Publish to PyPI with `wipac-cicd.yml`

We can now use:

We will only need the flake8 and mypy jobs in wipac-cicd.yml, at a minimum. Packaging the repo could have benefits (which means publishing to PyPI, etc), but this is not necessary, though tempting so let's do it! (required for @blinkdog's new disk pipeline).

See WIPACrepo/wipac-dev-tools#20

Edit: Indexer will now be published as a package (see #43)

Add "Good Run(s)" Field

def _get_events_data(self) -> types.EventsData:

Currently, "content_status" is solely based on whether the .i3 file can be read.

There are also the "good runs" list files. Do we want to consider these? This could be a new field in the FC record.

Optionally, we wait until we have an event-based store since this matches the "good run" granularity.

L2 Indexing Race Condition

There's a potential race condition when indexing L2 files, if the client script is using index_file() directly and sharing a single MetadataManager instance between threads. This isn't an issue for using index().

# get directory's metadata
file_dir_path = os.path.dirname(os.path.abspath(file.path))
if (not self.real_l2_dir_metadata) or (file_dir_path != self.dir_path):
self.dir_path = file_dir_path

Solutions include

  • add a threading.Lock() context manager around the above code
  • creating an instance-attribute dict (self.L2_dir_data) keyed on dir_path (instead of a single self.dir_path & self.real_l2_dir_metadata)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.