Giter Club home page Giter Club logo

esem2019's Introduction

What is this?

This repo contains a replication package for the paper entitled ‘‘Why is Developing Machine Learning Applications Challenging? A Study on Stack Overflow Posts’’ published as part of the 2019 ESEM conference.

@inproceedings{DBLP:conf/esem/AlshangitiSMLY19,
  author    = {Moayad Alshangiti and
               Hitesh Sapkota and
               Pradeep K. Murukannaiah and
               Xumin Liu and
               Qi Yu},
  title     = {Why is Developing Machine Learning Applications Challenging? {A} Study
               on Stack Overflow Posts},
  booktitle = {2019 {ACM/IEEE} International Symposium on Empirical Software Engineering
               and Measurement, {ESEM} 2019, Porto de Galinhas, Recife, Brazil, September
               19-20, 2019},
  pages     = {1--11},
  publisher = {{IEEE}},
  year      = {2019},
  url       = {https://doi.org/10.1109/ESEM.2019.8870187},
  doi       = {10.1109/ESEM.2019.8870187},
  timestamp = {Wed, 23 Oct 2019 17:15:06 +0200},
  biburl    = {https://dblp.org/rec/bib/conf/esem/AlshangitiSMLY19},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

What is included?

The package consists of the following:

  1. code.R: This file contains all the code needed to replicate all the figures found in the paper. We have provided detailed commentary with the code to help explain the content.

  2. quantitative_sample: This folder contains the StackOverflow quantitative study sample discussed in the paper consisting of 86983 ML related questions posts. Moreover, the answers (when an accepted answer is available) are also provided for the sample. Finally, we provided the web development sample that was used as part of RQ1 to compare the response time between web development questions and machine learning questions.

  3. qualitative_sample: This folder contains the StackOverflow qualitative study sample discussed in the paper consisting of 684 ML related questions generated by 50 unique users alongside their labels. Moreover, the user expertise labels are also provided.

  4. custom: This folder encapsulates all other data used within the paper. Specifically, the LDA and topic-term matrices for the discovered 30 topics. The tags and their statistics, and the ExpertiseRank score generated to compare the number of experts in machine learning against web development.

esem2019's People

Contributors

mshangiti avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.