Giter Club home page Giter Club logo

geocute's Introduction

GeoCUTE

Geo Conversion Using Telephone book Entries

Installation

git clone [email protected]:MichaelKreil/geocute.git
cd geocute
npm install
npm link

Usage

geocute geo1 key1 geo2 key2 [pointlist] output

  • geo1: filename of source GeoJSON
  • key1: property name of the key in source GeoJSON
  • geo2: filename of target GeoJSON
  • key2: property name of the key in target GeoJSON
  • pointlist: (optional) name of list of points to through at the data. Can be:
    • ../data/deutschland.bin.br: (default) based on telephone book entries (?)
    • ../data/deutschland-only.bin.br: based on 2011 zensus data and ldbv adress coordinates (2011)
    • ../data/berlin_blk.bin.br: Berlin only, based on "statistische Blöcke" (2019)
    • ../data/berlin_adr_ew.bin.br: Berlin only, based on "Sonderauswertung RBS-Adressen" (2016)
    • ../data/deutschland_berlin_blk.bin.br: Combined berlin_blk.bin for Berlin and deutschland.bin for all other states.
  • output: filename of resulting TSV file

Example

If you want to calculate a matrix for converting from "gemeinden" to "wahlkreise", use:

node geocute gemeinden.geojson AGS wahlkreise.geojson wkr_nr matrix.tsv

How it works

Many data have a spatial reference, e.g. unemployment statistics at municipality level, election results by constituency or insolvencies by postcode. If two databases have different spatial references, it is very difficult to combine them.

Let's take the two spatial references unemployment figures by municipality and insolvencies by postcode area as an example: Postcodes can contain several municipalities and municipalities can consist of several postcode areas. This makes a conversion and thus a comparison of these statistics very difficult. Furthermore, it is not sufficient to calculate the overlapping area between postcode areas and municipalities, as a large wooded area with no inhabitants is certainly not as relevant to unemployment rates/insolvencies as a small settlement with a high population density.

This is exactly where GeoCUTE comes in: GeoCUTE contains a database with the addresses of telephone subscribers as geo-coordinates (in Germany, as of sometime between 2010 ~ 2015). The first step for each of these geo-coordinates is to calculate in which postcode area or municipality they are located. If there are 1000 of these coordinates in postcode area X, and 300 of these 1000 are also in municipality Y, then GeoCUTE assumes that 30% of the insolvencies in postcode area X fall in municipality Y.

GeoCUTE therefore calculates for each known address point in which region of spatial reference A and in which region of spatial reference B the point is located, and can use this to calculate a conversion matrix to convert from spatial reference A to spatial reference B.

In order to increase the quality even further, we added a weight to every address point: We estimated the number of inhabitants per address on the basis of the "population statistic in a 100m grid".

geocute's People

Contributors

michaelkreil avatar yetzt avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

yetzt

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.