Giter Club home page Giter Club logo

Comments (17)

Smahoney37 avatar Smahoney37 commented on September 21, 2024

Attached:
a file containing the availability of granular data by state - not necessarily district.
Arizona data - not cleaned

Arizona.zip

Availability of Data.txt

from election-transparency.

chrisdick14 avatar chrisdick14 commented on September 21, 2024

Awesome work! We will have to start assigning people to each of the states we can pull.

from election-transparency.

gvpeek avatar gvpeek commented on September 21, 2024

I'm interested to jump in and help. I was thinking I could start with Texas, since that's where I am. But I'm happy to take on some other states too. That being said, I had some questions...

  1. Is there a list of states people are assigned to?
  2. What is the ideal state of the data, cleaned to a certain spec or just raw for now?
  3. Is there a place for these files to be committed or are they just living in comments for now?

from election-transparency.

KirkHadley avatar KirkHadley commented on September 21, 2024

Hi,

So I'm not really sure where would be the best place to put these but I have for varying recent years (farthest back ~2008) voter files for CO, CT, DC, DE, FL, GA, MI, NC, OK, RI, UT, and WA. Would that be helpful?

from election-transparency.

kflanagan avatar kflanagan commented on September 21, 2024

You can find the current NC registered voter info here
https://data.world/kflanagan/nc-statewide-voter-info
Along with it is the SQL statement to create columns

from election-transparency.

chrisdick14 avatar chrisdick14 commented on September 21, 2024

@KirkHadley and @kflanagan we can definitely use this information. However, this is slightly different data than we have been using in the past so let me think about where we want to store it, and how it will fit into our current structure.

from election-transparency.

KirkHadley avatar KirkHadley commented on September 21, 2024

@chrisdick14 I actually have that file for every NC election since 2005. Should I upload it to data.world?
@kflanagan Has any thought been put into standardizing election results at the state level? If so, I have all the states state level election results at the district level and am more than happy to share.

from election-transparency.

kflanagan avatar kflanagan commented on September 21, 2024

@KirkHadley and @chrisdick14 The source for the data I posted is the state, here's their link. I don't know if there are efforts to standardize but given the sate of things at the federal level I doubt it.
https://s3.amazonaws.com/dl.ncsbe.gov/data/ncvoter_Statewide.zip

from election-transparency.

chrisdick14 avatar chrisdick14 commented on September 21, 2024

@KirkHadley and @kflanagan there are two things we can do for these data. (1) You can post them yourself on data.world and tag them with 'd4d' and 'election transparency' (as well as any other tags you want to use), or (2) we can have you send us the data and we can upload directly to the d4d election transparency data.world page. I am totally fine either way. I agree about the standardization. The Open Elections Project has been doing some of this work: https://github.com/openelections/openelections-results-nc

I think one thing we could do is if we can get results from several states we can all agree on a format moving forward and put something out there, if that is something you all are interested in.

from election-transparency.

kflanagan avatar kflanagan commented on September 21, 2024

Given that I had already put the NC data up on data.world I just went and tagged them with d4d and election transparency. That'll get us started. I don't know what's best, the states keep their own formats, is it a good use of time to re-format every time they update the data? I think that NC updates weekly. Would use of data.world to present the data via SQL like queries be something that we could do to present it in a way that would allow folks to query across states?

from election-transparency.

chrisdick14 avatar chrisdick14 commented on September 21, 2024

@kflanagan I think that is a great idea. Especially with data that are coming out that regularly. I think if there were some 'clean' datasets we needed for projects we could pull the requisite data from your larger file and post it in the cleaned format that we end up using for analysis.

This is really fantastic. We are having a hackathon this weekend and who knows, someone may end up using these data in their analyses!

from election-transparency.

kflanagan avatar kflanagan commented on September 21, 2024

I found a flaw in my logic. Big data sets don't work so well it seems on data.world, file too large to extract from the archive. Maybe I'll try to upload the raw data, but of course the uncompressed file may be too big to upload raw. Perhaps we need to point at the county by county info for NC. I'll take a look at it this evening.

from election-transparency.

chrisdick14 avatar chrisdick14 commented on September 21, 2024

Let me know how big the data set would be. We can chat with the data.world folks and see if there is a work around. If not we may have some other options that I am exploring now to upload the data and make it public.

from election-transparency.

KirkHadley avatar KirkHadley commented on September 21, 2024

So I have voter files on a good number of states (I'm a squirrel with these things). Details on sizes and such:

State-Total Size, Number of Voter Files, Range of Years

  • CO-15gb, 14, 2013-17
  • CT- 8.4gb, 7, 2013,14,16,17
  • DC- 131mb, 6, 2014
  • DE- 1.1G, 7, 2013-15
  • FL-103G, 47, 2012-17
  • mbI- 31G, 8, 2014-16
  • NC- 134G, 51, 2012-17
  • OK- 3.6G, 6, 2014-16
  • RI- 250mb, 7, 2012-15
  • UT- 569mb, 6, 2014
  • WA- 1.8gb, 12, 2006-17

from election-transparency.

kflanagan avatar kflanagan commented on September 21, 2024

@KirkHadley is that the voter file that's found https://s3.amazonaws.com/dl.ncsbe.gov/data/ncvoter_Statewide.zip but with multiple years?

from election-transparency.

chrisdick14 avatar chrisdick14 commented on September 21, 2024

Ok, those are going to be too big for data.world I think. We are going to have to come up with another solution to host these. Let me do some asking around and see what we can find.

from election-transparency.

alistaire47 avatar alistaire47 commented on September 21, 2024

Hi, I'm Edward. I'm new and happy to help. To get rolling I scraped the relevant PDFs off of the DC BoE site in the link above to see how hard the PDFs are to parse. The answer is (predictably) not terribly easy, but possible.

Given that, what data do we want?

  • Since DC is all one district, just the whole city, or wards or precincts?
  • What time frames? They publish monthly, so everything, yearly (start? end?), before elections (which?)?

I also saw on their website that you can get the whole voter file on CD-ROM (yeah) for $2 (yeah). It's not clear if how it handles formerly registered voters, but it's as granular as you can get—but since it's individuals, it's at least dubious to republish it unaggregated, even though it's all public data. I'm not sure we want it, but it's entirely possible to assemble a national voterfile; e.g. you can grab the Ohio CD CSVs at will.

from election-transparency.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.