Comments (8)
Longer term, our intent is to deal with the issue of changing datasets by more clearly defined version management, in which any change in any of those factors would be represented as a distinct different version of the dataset.
from conp-portal.
I understand. So this implies that expanding dataset with frequent (daily/weekly) releases, the DATS document will need to get updated and versioned accordingly?
from conp-portal.
That would be the expectation with the current model, yes.
from conp-portal.
I think it would be nice to have a way to support projects with rolling releases as well. Such projects sometimes want to describe their cohort and datasets content in a standardized way, without entering into the specifics of how many files, what size they are, etc.
from conp-portal.
Exactly how much data are you envisioning storing on CONP, and of what sort? Our processing involves building fixed links to every distinct file, so that needs redoing for anything that changes from release to release.
from conp-portal.
Right now we have two cohorts of >5000 participants, with thousands of whole genomes, whole exomes, etc. But data is under controlled access, which means files wouldn't be indexed by CONP. It's the dataset provenance that we're aiming to describe, rather than its content.
from conp-portal.
Hi David, I was discussing briefly with Emmet this morning. Are the datasets you have in mind those from the BQC19 which we have discussed in the context of distribution through NeuroHub, or different datasets? Thanks!
from conp-portal.
Hi Bryan, this one and others. We have a few cohorts supported in Bento currently, often in a rolling release kind of way. We prepare a DATS file to annotate the datasets, but we're not always able to provide precise details about that dataset content.
from conp-portal.
Related Issues (20)
- Look into SHACL schema validation for CONP data in Nexus
- Addition of a number of views and number of downloads sort by functionality
- Have a way to add metadata that we generate to better describe tools and datasets HOT 2
- Marking "coming soon" data in the portal. HOT 1
- DATS Editor: Date pickers should allow shortcut to change year. Dates should also follow ISO-8601 standard. HOT 1
- extending DATS metadata to enable multiple new features HOT 6
- Add search/filter by age buckets - pedriatric, geriatric, lifespan
- License: Hyperlink / hoverhelp for context
- allow search for eeg OR electroencephalography
- versioned Terms of Use HOT 2
- DATS.json : hover/hyperlink to these in the list of datasets
- Task Executions page: Error message trigger can be stood down/ disabled
- Tool Executions page: add-on wishlist
- Tool License
- Proposal: DATS.json redesign for Interlex-based cross-references with CBRAIN HOT 3
- Revised proposal for Interlex-based cross-references with CBRAIN HOT 3
- Datasets: Option to provide Platform logo for third-party / offsite download platforms (vs. Study/org logo)
- List New/recent tools
- [ Tools ] 2 little important Bugs: Licenses are wrong + popup message glitch
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from conp-portal.