Giter Club home page Giter Club logo

Comments (4)

rBatt avatar rBatt commented on August 25, 2024

@JWMorley @bselden I made a video to show how I was making some of the corrections --- they still aren't perfect by any means. But I do show how I filter down to some of the entries that might merit a second glance.

https://www.youtube.com/watch?v=RZlUds2Ph_0

Feel free to go about this however you want, if you can find time. Any help is much appreciated.

from trawldata.

rBatt avatar rBatt commented on August 25, 2024

@JWMorley @bselden Note that I edited the link to the video; so in your email you will still only see the old link. The new link is here: https://www.youtube.com/watch?v=RZlUds2Ph_0

And that is the same link that will appear on the GitHub issues site.

from trawldata.

rBatt avatar rBatt commented on August 25, 2024

@JWMorley @bselden @mpinsky

So, I'm updating the data sets (the US ones for now), and I found 1131 new raw taxonomic ID's that aren't in spp.key already ... whoa. I'll put my auto-match code to work, but everything added in this way will be given the flag of "added_automatically".

Working on properly adding these to the spp.key. It's basically done, just need to integrate it well with make() and add checks.

from trawldata.

rBatt avatar rBatt commented on August 25, 2024

I've recently gone through most lines of the spp.key manually.

I've manually checked 2654 rows in the recent effort; another 548 are "ok", 53 "manual", 577 "fine", 586 "bad", 316 "becca_batch2", and a lot of other random flags that indicate it's been checked in some way. In theory, the "bad" rows might need to be fixed, but they generally aren't ID'd to species, and are tossed out in the trim row due to that flag; so they aren't a big worry.

There are 1009 rows that were "added_automatically", and 349 have an NA flag. None of these rows pertain to species that are in the current trawlDiversity analysis (due to subsampling years, day of year, and strata).

So this is very near completion, and is much less of a worry for my current analysis, but could still use some work. I also wouldn't be surprised is some of my "check" rows had errors/ typos (I found 1 or 2 already). So it ain't perfect.

from trawldata.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.