Giter Club home page Giter Club logo

tsvalid's Introduction

tsvalid's People

Contributors

matentzn avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar

tsvalid's Issues

integration with other libs like csvkit

This is a really useful lib!

@hrshdhgd -- we could really have used this for our NER project with Mayo where we kept getting these broken TSVs that messed things up.

I am starting to use csvkit more and more - I am wondering if it makes sense to donate some functionality there, or if this is more specific to our use cases:

https://github.com/wireservice/csvkit

There are various others like mlr etc, but I am tending to coalesce around csvkit

[no response expected for some time and ok to close as won't-do]

The first version of TSValid is ready for review

@jamesaoverton

You can start from the README.md, and then take a bit of a look around if you have time. I think it would be good if you could look at the project organisation a little bit as well.

  • I have done a huge deal of work on the QC stuff, black, flake8, code cleanliness etc.
  • The encoding error check does not do what we thought it would (which is pointing out bad characters) - turns out you can stick anything into a utf-8 encoded file. We probably need to iterate over this a bit, but do have some tests in place.
  • Let's use this for a while now, and gather issues. Adding checks should be really easy at this point.

You can close this issue once you are happy with the general organisation of the code, and then we should start using the tool in our day to day work and start filing bug reports.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.