Giter Club home page Giter Club logo

partyfactsdata's People

Contributors

fischerbit avatar hdigital avatar mrwunderbar666 avatar pbederke avatar polvis avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

partyfactsdata's Issues

run all script

Create R script to run all import R scripts. This would assure that all files can be run to create the data for the import. Its a lightweight alternative to unit testing.

add duplicate IDs

Datasets with cleaned-up party lists on Google drive ignore duplicates in Party Facts import.

Add these ignored party IDs in a new column and provide examples for dataset transformation.

CLEA 2014 parties not in 2016 data

Some of the parties from CLEA 2014 are not in CLEA 2016.

These parties were not yet removed from Party Facts when the CLEA 2016 data was imported on 3 August 2016.

Double check with CLEA team before finally removing these parties from Party Facts.

Here is a country summary and see list of parties (clea-2014-remove.xlsx). We need some informaton about the high number of parties from India.

clea14 <- read.csv("clea-2014/clea.csv", fileEncoding = "utf-8", as.is=TRUE)
clea14 %>% filter( ! ctr_pty %in% party$ctr_pty) %>% .[['country']] %>% table

AUS AUT CUW DEU IDN IND LUX MDA NLD SLE TUR USA 
  1   1   7   1   1  26   1   1   2   2   1   4 

partyfacts-clea.csv appears to be incomplete/outdated

The file at https://github.com/hdigital/partyfactsdata/blob/master/import/clea/partyfacts-clea.csv appears to be missing some parties.

For example, Demokratikus Koalíció with partyfacts_id = 469 (see: https://partyfacts.herokuapp.com/data/partycodes/469/) and dataset_party_id = 348000004 (see: https://partyfacts.herokuapp.com/data/partyall/51923/) does not appear in the file.

Interestingly, Demokratikus Koalíció is listed in clea.csv at https://github.com/hdigital/partyfactsdata/blob/master/import/clea/clea.csv with party_id = 348000004.

Where should I look if I want a more complete/up-to-date version of partyfacts-clea.csv?

Alternatively, if such a file does not already exist or cannot be provided, do you know how I can generate a complete version of partyfacts-clea.csv on my own using the data available online at https://partyfacts.herokuapp.com/ ?

keep Manifesto Project country names

Marpor country names are adjusted to Party Facts country names for different names used.

Keep Marpor country names and rename Party Facts country names used for merging.

different data formats on dataverse

On Harvard Dataverse, where the stable versions of the Party Facts datasets are available, most data files have the .tab extension. There are three exceptions:

  1. https://dataverse.harvard.edu/file.xhtml?fileId=4274155&version=1.1
  2. https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/UUMC31/JEM0W5&version=1.0
  3. https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/FTQAYT/L8EGUM&version=1.0

These have the .csv extension.

If this is not on purpose, perhaps all files could be of the same format for consistency reasons. I realised this difference when I was trying to write a function that would get these files systematically, where the extension makes a difference.

Perhaps this is not possible and/or desirable. In any case, let me say thank you for this great resource.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.