Giter Club home page Giter Club logo

matrix-explorer's Introduction

NeuroData's Matrix Explorer

Web-based vector analytics and some visualization

To run:

  1. go to the shiny dir
  2. start R
  3. type library('shiny')
  4. type runApp()

Deploy Updates

from terminal:

  1. ssh brainapps1.neurodata.io and enter your pword
  2. cd /srv/shinyapps/Vector-Explorer
  3. git pull
  4. (maybe restart shiny)

If docker container goes down

(below are instructions for graph explorer, not vector explorer)

  1. ./graphexplorerstart.sh, which lives in /srv/shinyapps/Graph-Explorer

If we want to add more dependencies to installation

  • Update dockerfile
  • Run sudo docker build -t graph_explorer . in the same directory as the Dockerfile

To get a terminal on the docker container

  • docker exec -it <<container name>> /bin/bash
  • Find the container name by running docker ps

matrix-explorer's People

Contributors

ikuznet1 avatar jovo avatar

Watchers

Jordan Matelsky avatar James Cloos avatar William Gray Roncal avatar Randal Burns avatar Daniel Jacob Tward avatar Bijan Varjavand avatar  avatar Matt Lee avatar Eric avatar  avatar

Forkers

mayblue9

matrix-explorer's Issues

Add option to export graphics

Table is already exportable. This will probably be put on the back-burner as you can still save the graphics by left clicking and save image. Unfortunately Shiny does not have a very good built in download system, so likely would have to build our own. I suggest maybe doing it via an HTML5 canvas.

when data are big (eg, nnz > 100,000)

probably makes sense to do all the computations, and then send them an email that has a link to the results, which are still interactive, but maybe a bit less (ie, changing which points are outliers will be hard to do interactively).
this means we'll have to make some default choices, but i think it will be awesome.
i used it yesterday with 1000 points & 10 features and it was ok, but getting slower.

of note, the people i demo'ed it to LOVED it.

shorten tab names?

what do you think about shortening tab names?
in particular, they don't all fit on a row for me.
i propose:

  • upload
  • heatmap
  • sample summary
  • feature summary
  • outliers
  • correlation
  • embedding & clustering

Known issues as of 1/10

  • Option to export plots
  • Check R score for scatterplot, etc.
  • on "Data" tab, show: # features, # samples, nnz, # NaN
  • on "Marginal" tab, make "combined" the default.
  • throughout, rename tabs to be more informative
  • "Distance Metric" = "Euclidean Distance"?
  • fix size of correlation matrix
  • mean vector plotting doesn't really have scatter, probably we want 'jittered scatter'
  • add iris.csv file for demo purposes, you can leave the other one, maybe name it something more informative.
  • Implement robust distance metrics
  • tell people the appropriate format (if nan's/infs break code, rows should be data points, columns should be features (or vice versa), appropriate size for things to work)
  • Something like scree for tsne
  • Better caching for tsne
  • Find std of R-scores
  • Allow for boxplot scaling
  • Consider using scalefillbrewer(type = 'div') for heatmaps
  • Scree plots look wrong
  • Add kmeans++ clustering
  • Check if outliers removed from embed
  • Color scheme setting each page?
  • cluster on embedded points.
  • choose # of dimensions via Zhu & Godsi: https://uwaterloo.ca/data-science/sites/ca.data-science/files/uploads/files/automatic-dimensionality-selection-from-scree-plot-use-of-profile-likelihood.pdf (getElbow.R lives in here)
  • we can later enable user to choose the dimension of embedding.
  • matrix completion
  • Add option to replace Nas Infs?
  • Have a link back to issues in vector explorer

some more stuff

heatmap

  • for the heatmap, we should enable setting the number of bins to actually discretize the data that way, and use it throughout.
  • when i click on stuff, the row always says 1
  • we only need to show 2, maybe 3 sig digits, and we should always show the same number.

Summary Sample

  • when conditioning on class, we should show class specific means & medians

Outliers

  • "Color based on class?" actually changes shape based on class. color based on class might be more clear?

Other

  • removing outliers does not seem to work.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.