Comments (4)
I agree. To answer these two questions, I would suggest that for Section 3:
where possible, all case studies should be reproducible using publicly available data and code.
i.e. we should be testing for reproducibility, and we should be using the original data. All the Section 3 chapters use either MIMIC (II or III), or simulated data. Therefore, the data used in each chapter are publicly available. I can't comment on whether or not this is the case for other Sections.
from critical-data-book.
What would we be testing for exactly?
I ask because it might be hard to test without the original data. Some chapters will have generic functions, but others will not.
from critical-data-book.
What would we be testing for exactly?
I was thinking mainly of unit tests. e.g. are the functions returning the results that we think they are returning? When we make changes to the code or when dependencies change, this will help to make sure things don't break. Pete also notes that his sections could do with testing in his pull request.
I ask because it might be hard to test without the original data.
Yep, this does add an extra level of complication for some tests. If data is a requirement for a test, we could run the tests on our local Jenkins server or we could make a demo dataset available. I agree that getting complete coverage will be tricky, and the range of languages doesn't help either. I'm only familiar with Python testing frameworks, so will need to investigate R and Matlab.
from critical-data-book.
Thanks Pete, I agree that it would be great to add some tests to check that the complete studies return expected results. We can try to do the same thing for section 2 too. It'll take us a while, but the nice thing about having the code here is that we can continue to tidy it up over time.
from critical-data-book.
Related Issues (16)
- Correct license HOT 7
- Chapter 28 missing Python script HOT 2
- R code for MCMC_solver.m ? HOT 11
- Error part_iii / chapter_23 HOT 6
- Code in chapter 23 did not match with what is described in the read me HOT 3
- Chapter 18 (trend analysis), how to retrieve year? HOT 16
- Chapter23: missing data for heart rate.
- Data needed for Chapter 18
- Critical Data Book
- section 3: chapter04_code.R mangled HOT 4
- Code in Section 2, Chapter 3c contains several files with a .m~ extension HOT 3
- Consistent approach needed for providing data to support code. HOT 2
- correct spelling of impute across all chapter 3c code
- data issues HOT 5
- Chapter #s HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from critical-data-book.