Giter Club home page Giter Club logo

academic-keyword-occurrence's Introduction

Historic word occurrence in academic papers

Summary

This script extracts the historic word occurrence of a search term in academic papers (from Google Scholar). It allows for spotting trends in research and analyzing the relevance of a topic over time.

Usage

python extract_occurrences.py '<keyword>' <start date> <end date>

This command lists the number of publications for every year using this keyword. The script just searches for articles and excludes patents and citations.

The script requires a couple of packages (e.g. Beautiful Soup 4), you can install them with pip.

Example

  • Search term: 'bitcoin'
  • Desired time span: 2000 to 2015
  • (Optional) Output file: 'results/out.csv'
  • Command: python extract_occurrences.py 'bitcoin' 2000 2015 'results/out.csv'
  • Output: results/out.csv (or out.csv by default), with the following contents:
year results
... ...
2011 141
2012 292
2013 889
2014 2370
2015 2580

bitcoin chart

Troubleshooting

OSX only: Python 3.6 does not include any SSL certificates, therefore any https request will fail due to the impossibility to verify the URL.

This will lead to the following error:

urllib.error.URLError: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed

Fix: Execute /Applications/Python\ 3.6/Install\ Certificates.command to install the certifi package. (More details: https://stackoverflow.com/a/42334357)

Credits

Created by Volker Strobel - [email protected]

If you use this code in academic papers, please cite this repository via Zenodo (http://doi.org/10.5281/zenodo.1218409):

Volker Strobel. (2018, April 14). Pold87/academic-keyword-occurrence: First release (Version v1.0.0). Zenodo. http://doi.org/10.5281/zenodo.1218409

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.