Comments (2)
Notes: This is primarily a diagnostic tool with the goal being able to compare the total dataset count for an organization within the registry to the total dataset count from CKAN.
from catalog-harvest-registry.
Reviving and possibly modifying an old issue in relation to ioos/ckanext-ioos-theme#148 and Solr index/db sync issues in CKAN.
Could we add extra columns to the About page on the Registry as follows:
Harvest Type: CS-W or WAF (unrelated to dataset count, but useful anyway)
Records with Errors: (or 'Bad Records' as labeled in each Harvest Source page) a count of the records in the Registry that failed validation in some way at the Registry validation level.
Also, alongside the Total Records count at the top of the page, can we sum the number of 'Records with Errors' from each harvest source to get a Total Records in Error count, and at the same time store in the Registry db somehow the total CKAN record count (available via CKAN API here: https://data.ioos.us/api/3/action/package_search).
This will allow a rough way to compare the total counts in Registry vs CKAN and see to what degree the inconsistency can be explained by Bad Records count.
To go further, we could subtract (Registry Total Records - Bad Records) - CKAN Total Records = Missing Records to show how many records are 'lost' in the CKAN harvest(s) due to any issues/additional validation/ or errors in the CKAN harvest.
I think this would help.
from catalog-harvest-registry.
Related Issues (20)
- Unverified account gets 500 Server Error when trying to login HOT 1
- Improving the default harvest schedule HOT 1
- About Page Dataset Count Improvements (+ CKAN Record count)
- Add Harvest type to the Harvests list HOT 1
- Default to user email for WAF contact
- CS-W Harvester Broken HOT 5
- Count inconsistency in Records in harvests HOT 1
- Add a 'x' button to cancel search filters HOT 1
- Qualify ERDDAP WAF harvest type HOT 1
- Automatic Admin Email List in Registry HOT 3
- Error message text not showing in Registry UI HOT 3
- Add qualifier text to the new user registration page HOT 6
- Add fields to the new user registration page HOT 1
- Update Dependencies HOT 6
- Verify harvest functionality on dev instance HOT 3
- Add extra fields to user registration verification email HOT 1
- Harvest Registry to CKAN harvest job connection broken HOT 2
- Cleanup data.ioos.us source WAFs for Registry HOT 2
- GLOS Harvesting Issues (WAF & CS-W) HOT 2
- Clarify whether to use ERDDAP-WAF or WAF harvest type for all ERDDAP harvest sources HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from catalog-harvest-registry.