Giter Club home page Giter Club logo

Comments (4)

peterhcharlton avatar peterhcharlton commented on June 24, 2024 1

I agree. To answer these two questions, I would suggest that for Section 3:

where possible, all case studies should be reproducible using publicly available data and code.

i.e. we should be testing for reproducibility, and we should be using the original data. All the Section 3 chapters use either MIMIC (II or III), or simulated data. Therefore, the data used in each chapter are publicly available. I can't comment on whether or not this is the case for other Sections.

from critical-data-book.

jraffa avatar jraffa commented on June 24, 2024

What would we be testing for exactly?

I ask because it might be hard to test without the original data. Some chapters will have generic functions, but others will not.

from critical-data-book.

tompollard avatar tompollard commented on June 24, 2024

What would we be testing for exactly?

I was thinking mainly of unit tests. e.g. are the functions returning the results that we think they are returning? When we make changes to the code or when dependencies change, this will help to make sure things don't break. Pete also notes that his sections could do with testing in his pull request.

I ask because it might be hard to test without the original data.

Yep, this does add an extra level of complication for some tests. If data is a requirement for a test, we could run the tests on our local Jenkins server or we could make a demo dataset available. I agree that getting complete coverage will be tricky, and the range of languages doesn't help either. I'm only familiar with Python testing frameworks, so will need to investigate R and Matlab.

from critical-data-book.

tompollard avatar tompollard commented on June 24, 2024

Thanks Pete, I agree that it would be great to add some tests to check that the complete studies return expected results. We can try to do the same thing for section 2 too. It'll take us a while, but the nice thing about having the code here is that we can continue to tidy it up over time.

from critical-data-book.

Related Issues (16)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.