Giter Club home page Giter Club logo

leiap's Introduction

Jacob Deppen

Twitter Follow

LinkedIn

I was trained as an archaeologist but found my passion in data science. Here are some things I like and the public repos to prove it. You can read more details on everything at deppen8.github.io.

Building open source tools to enable data science and research

pandas-vet Flake8 linter plug-in badge GitHub stars PyPI PyPI - Downloads Conda Version Conda Downloads

prospect Python package badge PyPI PyPI - Downloads Conda Version Conda Downloads

leiap Python package badge docs_badge

HTCondor Submission Creator badge

Data visualization and communication

BBHOFviz badge

Husky Altair theme badge Binder

TremorViz ghw_badge badge

Community, teaching, and learning

Documenting your code: from docstrings to automated builds @ PyCon US 2022 tutorial YouTube

Sprints - Let's Do More of Those! @ PyGotham 2019 YouTube

SciPy Conference 2021 - Sprints Co-chair

Puget Sound Python (PuPPy) Meetup - Scientific Computing SIG (co-organizer) Meetup

pandas in black and white badge site_badge

Certified Software Carpentry Instructor

leiap's People

Contributors

deppen8 avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

leiap's Issues

Create/Update geo_fields

All survey points have a FieldNumber. Ideally, the Easting and Northing should lie geographically within that field. However, that is not always the case: sometimes field boundaries are too difficult to see, sometimes the GPS is not quite accurate enough near field edges, and sometimes there are simply errors in recording.

We should create/update a new column called geo_field that finds the actual geographic FieldNumber of the point. This will require loading the fields shapefile and using geopandas or something similar to find the intersection.

Not sure if this should be part of the checks.py module or something else.

Initial Update

The bot created this issue to inform you that pyup.io has been set up on this repo.
Once you have closed it, the bot will open pull requests for updates as soon as they are available.

Create working dashboard

There are some key decisions to be made still:
1. Use IPy widgets instead of bokeh widgets? IPy widgets CAN control bokeh plots (see bokeh docs), so that seems promising.

2. Given question 1, should we stick with bokeh or try things with altair instead? My inclination is to stick with bokeh because of altair's problems with really big datasets (though this might be a problem with bokeh too).

time_span_chart() outstanding elements

There are some things left to do on report.time_span_chart() to make it look like the one produced with matplotlib for previous annual reports.

  • move time values to top axis (or duplicate them there)
  • add count values to each production entry
  • move time period labels outside the chart area
  • add a count of artifacts without time period outside the chart area
  • decide whether to use size or color to represent proportion; probably bad practice to use both

fillna(0) workaround for artifacts without coordinates

When importing artifacts from the database, 10 of them did not have corresponding survey points. This raises a couple of questions:

  1. How did these artifacts get entered if there are no corresponding points? Were the points deleted accidentally?
  2. What should we do about it?

For now, I have added a .fillna(0) to the Eastings and Northings in the leiap.spatial.find_geo_field() function. This allows the spatial join (gpd.sjoin()) to proceed without throwing an error, but it will cause headaches later if you try to map these artifacts, so it is not ideal behavior.

For the record, these are the artifacts:

  • 160060-44-202-001
  • 160060-44-202-002
  • 160060-44-202-003
  • 160060-44-203-001
  • 160060-44-203-002
  • 160060-44-204-001
  • 160060-44-204-002
  • 18038b-65-003-001
  • 18038b-65-003-002
  • 18038b-65-003-003

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.