There should be no necessary libraries to run the code here beyond the Anaconda distribution of Python. The code should run with no issues using Python versions 3.*.
For this project, I was interested in using Stack Overflow data from 2019 to better understand:
- How is the contribution to Open Source?
- How is the characteristics of people who contribute to Open Source?
- How likely is a person to contribute according to different variables?
There are 1 notebook available here to showcase work related to the above questions and analysis made.
The main findings of the code can be found at the post available here.
Must give credit to Stack Overflow for the data. You can find the Licensing for the data and other descriptive information here. Otherwise, feel free to use the code here as you would like!