Giter Club home page Giter Club logo

master_thesis_ubc's People

Contributors

santina avatar waffle-iron avatar

Watchers

 avatar  avatar

Forkers

waffle-iron

master_thesis_ubc's Issues

(ongoing) Things still need to be added to the repo

Will be keeping this issue open until the end of the thesis as more things need to be added to this repo.
As of now, I need to add

  • Documentation on the dataset
    • How to download them
    • How big is it
    • What does it look like once downloaded
  • Parsing code for the the data

Modify geniatagger for better usage

Geniatagger prints everything to the terminal. It'd be nice to modify it so that it provides warnings or other statuses that can be captured by the python code that's using it.

Survey services for human annotation

When the Medline_13M is done, there will be the need to validate the result beyond just using PubMed as ground truth. I.e., how do our SVD predictions compare to PubMed's closely related papers that are calculated from manual assignments of MeSH headings?

Plan A : Look into whether there are existing survey tools that would allow me to set up survey programmatically.

  • Survey Monkey
  • Google form + spreadsheet, existing plugins or write my own plugins

Plan B : If Plan A fails, would need to set up my own website for human annotations.

  • Idea: Set up an SQL database (on Azure? GSC?), Setup a website (Get to try out AngularJS with material design?)
  • Need to figure out how to record the data.

Known bug : find_term.py sometimes pauses or generate wrong results

It's known to me that using find_term to generate termID-termFreq file often results in

  • the process stopped when there are other processes running on the same machine
  • the process sometimes generates wrong results, as shown by running the same program on the same file and the results were slightly different outputs

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.