Comments (14)
Now I remember why I had has_pii
instead of pii
column for databases. Piicatcher uses a library called tableprint to print the ascii table and it has its limitations. You've hit both of them.
- It is hard to format long columns
- It expects a terminal to print a table. It cant be redirected to a file.
I am not sure there is an easy fix for either of them. I suggest you use JSON to store to a file. In #51 I'll improve JSON format to also include the PII types.
from piicatcher.
Thanks. So can we save this data directly in db?
from piicatcher.
from piicatcher.
Thanks ! So we will have these options in the similar way as config file ?
from piicatcher.
Yes: https://tokern.io/docs/piicatcher/2-usage You can use --orm-host and other options. FYI: I am going to change those option names to "catalog-". For example, catalog-host etc. Right now, MySQL is supported
…
On Tue, Jan 21, 2020 at 12:10 PM jayeshagwan1 @.**> wrote: Thanks. So can we save this data directly in db? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#52?email_source=notifications&email_token=AAMP7GSCJI47OUAOBQMFVVDQ62KGXA5CNFSM4KJOA4P2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEJOU3SI#issuecomment-576540105>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAMP7GSCOAQRGJ2INZ7JSZLQ62KGXANCNFSM4KJOA4PQ .
Can you provide the exact command for this ? So I have 2 config files 1 for connecting db and read the db and other for inserting into db.
from piicatcher.
Single config file
catalog_host=".."
catalog_port="..."
catalog_user="..."
catalog_password="..."
[db]
...
Also can you give me feedback on the docs ?
https://tokern.io/docs/piicatcher/2-usage#configuration-file
https://tokern.io/docs/piicatcher/3-catalog#database
from piicatcher.
Docs look good, just one observation. In docs its hyphen eg catalog-host and in above response you mentioned catalog_host.
Also if we can add sample of command for each type, it would help.
from piicatcher.
from piicatcher.
Can we add one more column for Pii type ?
In the above screenshot, ACQUISITION_DATE if of which type of PII.
from piicatcher.
from piicatcher.
Aaah. Ok
from piicatcher.
@vrajat Can we keep output consistent for all ? In db, json file shows
If we try file type then response is:
So in file type we are not able to identify which column has PII data and what type of PII data. If file output would also show same as db one, it would be better to understand
from piicatcher.
from piicatcher.
I'll be removing support for files.
from piicatcher.
Related Issues (20)
- Error parsing info on dropped column during deep (data) detect command
- sqlalchemy.orm.exc.NoResultFound: No row was found for one() HOT 6
- Datahub ingestion function HOT 2
- No row was found for one() when trying Local File
- Support Google Cloud BigQuery HOT 1
- Support Google Cloud Spanner HOT 1
- Unable to Connect to Postgres HOT 5
- Scan can take DAYS on large database clusters HOT 2
- Support OpenMetadata integration HOT 1
- Redshift doesnt support bernoulli tablesample HOT 1
- Unclear example of export to datahub HOT 4
- Unique Constraint Failed HOT 1
- Update ReadMe to accommodate new commands and remove outdated data
- Switches for views, external schemas, remote DBs, etc. HOT 2
- piicatcher installation stuck HOT 6
- Columns names and data are identified incorrectly pii HOT 1
- Connection refused when scanning Postgres with Docker HOT 3
- error: subprocess-exited-with-error HOT 5
- PII installation error HOT 3
- Unable to scan Redshift catalog
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from piicatcher.