Giter Club home page Giter Club logo

neuralnlp's Introduction

neuralNLP

Code for integrating a generative topic model (embedded topic model, ETM) and ECoG data in mschrimpf's neural-nlp toolbox

Code for running an *word intruder* MTurk study

Description of available scripts

Data pre-processing for ETM

Code for preprocessing Pereira dataset for usage within the ETM (based on Dieng et al. (2019))

Code for filtering out stopwords as obtained from NLTK

Code for analyzing the 20NewsGroup dataset with respect to proportion of English words

Training of ETM (various parameters)

Code for training ETM models and outputting word-embedding matrix, rho, as well as topic vectors (distribution over vocabulary), betas.

Changing parameters: 1) Number of topics found by the model 2) Minimum document frequency of vocabulary words, i.e., vocabulary size 3) Normalization techniques for turning vocabulary distribution for topics into topic distribution for words

Evaluation of ETM

Brain-score: Model implementations

Implementation of ETM in the brain-score framework and debugging. Possible to test ETM using different model parameters and outputs.

Brain-score: ECoG integration

Packaging of ECoG data for integration into the brain-score framework. Possible to run various language models on the ECoG data.

MTurk: Javascript experimental code and analysis

Code based on "minimal architecture for the generation of portable interactive experiments" script (_magpie); thanks to Polina Tsvilodub for providing code

Linear regression analysis

Analysis: Similarity between obtained embedding spaces

Matching of the GloVe and 20ng vocabulary spaces.

Pearson/Procrustes analyses between GloVe word embeddings and concurrently trained word embeddings from ETM (rho, embeddings in the word/topic space).

Analysis: Generating plots

:)

TODO: Code/analyses to be implemented

Solve dimensionality error to be able to map extracted vectors from rho-matrix onto neural data

Implement ceiling

Evaluate brain-score in different spatial regions of the brain

neuralnlp's People

Contributors

carina-kauf avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.