Giter Club home page Giter Club logo

piicatcher_spacy's Introduction

piicatcher_spacy

PIICatcher plugin that uses spacy to detect PII

piicatcher_spacy's People

Contributors

vrajat avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

piicatcher_spacy's Issues

No compatible package found for 'en_US_core_news_lg'

I'm testing out this library and when I try running this command:
piicatcher detect --source-name pii_test --scan-type data

I get the following error:
No compatible package found for 'en_US_core_news_lg' (spaCy v3.2.1)

I'm using these versions:
python = "3.8.5"
piicatcher = "0.19.1"
piicatcher-spacy = "^0.1.0-alpha.3"
spacy = "3.2.1"

Data are identified incorrectly

Hi
Could you help with spacy dectctor? We see the different result from piicatcher_spacy and spacy.
Fot test we use the table with one row. We check the same data in https://huggingface.co/spacy/ru_core_news_md
Examples (DatumSpacyDetector):
Column; Value; piicatcher_spacy; spacy
City; Самара; Person; LOC
Address; Самара, ул. Зеленая 20-2; Person; LOC
Last_name; Васильков; Address; No token was detected
Resource; CAREER_SECTION; Person; No token was detected

We use the same spacy mode:

    def __init__(self, model: str = "ru_core_news_md"):
        super(SpacyDetector, self).__init__()

Unable to install when using Python version 3.9.12

I'm trying to do a deep scan using piicatcher but cannot install piicatcher_spacy.

`
(venv) user$ pip3 install piicatcher_spacy

ERROR: Ignored the following versions that require a different python version: 0.1.0.dev3 Requires-Python >=3.6.2,<3.9; 0.1.1 Requires-Python >=3.6.2,<3.9

ERROR: Could not find a version that satisfies the requirement piicatcher_spacy (from versions: none)

ERROR: No matching distribution found for piicatcher_spacy

(venv) user$ python3 -V

Python 3.9.12
`

Is there an alternative approach for this?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.