Giter Club home page Giter Club logo

ingest_cmorph's People

Contributors

monocongo avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

zhpfu keruicode

ingest_cmorph's Issues

Add multiprocessing

Create a process that downloads all files for a month, sums the data, and writes the monthly values in the appropriate month index for the precipitation variable. Create a range of time indices and run this process for each, with a process pool using the number of CPUs specified (command line option?) or all available. Follow the pattern used in indices_python/process_grid.py, etc.

Consolidate duplicate code

Consolidate the duplicate code for NetCDF creation into a single function. Take care of any other duplicate code found in the same manner.

Missing/fill value handling

  • after reading binary data convert all missing values (-999.9 etc.) to numpy.NaN in the data arrays before writing to NetCDF
  • modify the NetCDF variable's creation fill_value to reflect the above change

Handle final year/month more gracefully

Add code to stop input processing on the actual final month of the final year, the code currently crashes at the end, file closes and writes OK but this is leading to other issues when the file is used later in the pipeline, I think, which are discussed on the NCO Help Forum.

Incorrect handling of full grid lat/lon indices

We used lat_start/lat end as the range limits for full grid but these variables were reused in error as range indices when chopping out CONUS, and later used as such even if CONUS only was not selected, causing them to not be set with proper values when ingesting the full grid.

Provide the ability to ingest gauge adjusted CMORPH datasets

Add the capability of ingesting the gauge adjusted data files, for use with a comparative analysis of SPI results. These are located in the same FTP location (more or less) as the raw files, we'll create an appropriate URL for downloads based on a "raw or adjusted" conditional.

Add code to perform download and unzip from FTP address

Files are currently expected to be resident on the machine where the ingest is performed. Add code that instead goes to the FTP location for the file(s), downloads and unzips, reads, then (optionally?) deletes.

  • Verify with Olivier that the FTP site is public
  • Use urllib if possible (avoid third-party libs)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.