Giter Club home page Giter Club logo

ai_research_dynamics's Introduction

Research Community Dynamics behind Popular AI Benchmarks

The widespread use of experimental benchmarks in AI research has created new competition and collaboration dynamics that are still poorly understood. In this work we provide an innovative methodology to explore this dynamics and analyse the way different entrants in these competitions, from academia to tech giants, behave and react depending on their own or others’ achievements.

  • We perform an analysis of twenty five popular benchmarks in AI from Papers With Code, with around two thousand result entries overall, connected with their underlying research papers.

  • We identify links between researchers and institutions (i.e., communities) beyond the standard co-authorship relations, and we explore a series of hypotheses about their behaviour as well as some aggregated results in terms of activity, performance jumps and efficiency.

  • We detect and characterise the dynamics of research communities at different levels of abstraction, including organisation, affiliation, trajectories, results and activity.

Alt text

The following R/Python code (as well the data and results) is based on the methodology explained in:

Research Community Dynamics behind Popular AI Benchmarks by Fernando Martinez-Plumed, Pablo Barredo, Sean ́O h ́Eigeartaigh and Jose Hernandez-Orallo (Nature Machine Intelligence, 2021)

LICENCE: GPL

Code

  • benchmark_analyzer.ipynb: Code for extracting benchmark data from Papers With Code and affiliation data from Scinapse

  • SOTAfront_plots.R: Code for ploting the results in the paper.

  • hypotheses_testing.R: Code for testing the hypotheses in the paper.

Data

(Data/ folder) Papers, authors, results, community memberships, SOTA jumps and dates.

  • Image Classification

    • ImageNet
    • CIFAR-100
  • Semantic Segmentation

    • Cityscapes
    • Pascal VOC 2012 test
  • Object Detection

    • COCO test-dev
    • COCO Minival
  • Image Generation

    • CIFAR-10
  • Pose Estimation

    • MPPII Human Pose
  • Action Recognition

    • Videos on UCF101
    • Videos on HMDB-51
  • Image Super-Resolution

    • Set5 - 4x upscaling
  • Machine Translation

    • WMT2014 Eng-Ger
    • WMT2014 Eng-Fre
  • Question Answering

    • SQuAD1.1
    • WikiQA
  • Language Modelling

    • Penn Treebank
    • enwik8
  • Sentiment Analysis

    • SST-2 Binary classification
    • IMDb
  • Named Entity Recognition

    • CoNLL 2003 (English)
    • Ontonotes v5 (English)
  • Speech Recognition

    • LibriSpeech test-clean
  • Link Prediction

    • WN18RR
  • Atari

    • Atari 2600 Montezuma's Revenge
    • Atari 2600 Space Invaders

Results

  • (/Baselines folder) High quality plots comparing community, affiliation and author grouping dynamics (progress in accuracy over time) for all the benchmarks analysed

  • (/Figures folder) High quality plots showing community grouping dynamics (progress in accuracy over time) for all the benchmarks analysed.

ai_research_dynamics's People

Contributors

nandomp avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.