Giter Club home page Giter Club logo

oauni's Introduction

OA shares of German research institutions

This study is a collaboration of BMBF-funded projects OASE and OAUNI on the open access status of the publication output of German research institutions (universities, non-university research institutes, and federal and state institutions).

Research Questions

The goal is to answer the three research questions

  1. Has the OA fraction of the publication output of German universities and research institutions increased constantly over time?
  2. Can we observe differences between the research sectors of the German science system? Are there obvious explanations for this (like different missions or subject profiles?
  3. Which OA type is the most prevalent OA approach and can we identify different patterns of adoption to OA?

The analyses we conducted are documented within this repository.

Structure of the repo

The final analysis of the considered dataset is reported in analysis.Rmd

The document data_gathering.Rmd describes how the data have been selected, pulled from the different sources, combined, preprocessed and classified according to our own schema. Relevant code for this step (sql and R scripts) is documented in the folder data_gathering_preprocessing.

The folder data contains datasets that we downloaded from external sources (like the ISSN to ISSN-L conversion list), as well as intermediate steps of the data gathering process (some of them generated and stored only locally, not here on Github).

Some preliminary descriptive analysis that we undertook to better understand the data is contained in exploration.Rmd together with some plot-testing.

oauni's People

Contributors

ahobert avatar astridorth avatar maxheld83 avatar naustica avatar njahn82 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

oauni's Issues

make data available

the objects in data are currently not available on a system, if the package is devtools::install()ed, and also not during testing.
It appears that the data is only available as exported objects after devtools::load_all().

So that appears to be a reproducibility bug.

Because a docker image, and by extension Azure requires a properly installed and working package, this currently makes it impossible to deploy the app.

migrate to muggle setup and publish

if you want to @Ahobert and @njahn82 I'd be happy to migrate this over to the muggle setup, so we can have this online as a proper vignette as well.

That would be a quick fix for the problem of making this public #9: we can then have the reproducible code online (via echo = TRUE), but not the actual full git source.

I can't promise this would work quickly though, since that requires the project to actually be reproducible.

Also, relatedly, is there by any chance already a package for which this might be a vignette?

scatterplot: select axes

add select option for axes, i.e. possibility to plot total oa share vs. fully oa journal share or something like that

reproduce overlap plot

image

In the preprint, this is written in Python.
@Ahobert can you just paste the code from your colleague right here in the comment?

merge repo with oa-german-inst

I think it might be nice to actually have these in one repo, and one R package, with the paper as a vignette, and the dashboard as another vignette.

reproduce issn to issn-l

this can be reproduced from open sources, though they may not be available all the time, so we should git lfs this as well.

move pubs_cat.csv to git lfs

@Ahobert will share the file with me.
This file is the source for the different (publishable) aggregations.

pubs_cat.csv is created via the sql calls from proprietary datasets, which requires (personalised!) access credentials only available for some team members.
The pubs_cat.csv itself still includes proprietary data and must not be published.

purge secrets

TLDR for all the poor souls I had to tag now: @Ahobert asked me to purge some data from this repo, before it could go public.
This is done, but y'all will have to delete and re-clone it (see below comment).

  • minutes.md

app cannot be deployed from gh actions (there is no CI!)

Error in value[[3L]](cond) : 
  No root directory found in /srv/connect/apps/oadash-refs-heads-muggleme or its parent directories. Root criterion: contains a file `DESCRIPTION`
Calls: local ... tryCatch -> tryCatchList -> tryCatchOne -> <Anonymous>
Execution halted

Might be related to golem ThinkR-open/golem#387 (fix may not be on CRAN)

export and document `exclude_from` as vectors

while doing that we'll also remove the pk_kb_inst column, because that is actually somewhat proprietary (?) but in any event unnecessary info, because the downstream antijoins only use the second column.
So also covers which can be closed #28.

So it's just a string.

deprecate group_by_

Listening on http://127.0.0.1:4465
Warning: `group_by_()` is deprecated as of dplyr 0.7.0.
Please use `group_by()` instead.
See vignette('programming') for more help
This warning is displayed once every 8 hours.
Call `lifecycle::last_warnings()` to see where this warning was generated.
Warning: Ignoring unknown aesthetics: text

migrate to flexdashboard

from what I can see now, this could probably all be done with crosstalk, obviating the need for a shiny runtime.
And it might be worth to standardise on one dashboard solution.
I think flexdashboard is, at this point, the forerunner, because it is more actively maintained and is more lightweight.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.