ncbi-codeathons / automated-sc-rna-seq-analysis-in-the-cloud Goto Github PK
View Code? Open in Web Editor NEWLicense: MIT License
License: MIT License
The docker containers are only about information processing, so we shouldn't need the IPython dependency.
In the current version we run OnClass without doing any batch correcting between the new data and the data used to build the pre-trained model.
While this might work in a couple of situations, the current approach is not general.
What needs to be done:
This might be the same as #19, or it might be different, if the original data is only needed temporarily, but the derived data needs to be kept longer.
Currently we are using the a fixed pre-trained model. We want to allow the user to select different pre-trained models because at the moment the model has been trained using Tabula Muris Senis, so it's better suited for mouse datasets.
What needs to be done:
For instance, you could mock subprocess and see if it starts what you think it should start.
Hi there,
I discovered your repository by chance.
Do you know of the https://github.com/nf-core/scrnaseq pipeline?
(part of nf-core)
Your goal description (QC and cloud computing for scrna
) seems rather similar.
Just thought I should mention that here to avoid unnecessary work.
Having a webapplication spawn a subprocess, much less a docker container, is a bad idea. Even apart from security concerns, there's a mismatch between the needs of the webserver (minimal! except for storage... see #19 and #20.) and the needs of the processing (huge, above 10G RAM, at least.)
This might be appropriate for AWS Lambda, if the jobs can run quickly enough. Google or Azure might have their own offerings in this space.
Different compute platforms would support different storage possibilities.
From their readme, it sounds monolithic: There is no obvious way to pull out the data processing, and incorporate the visualizations within some other site. Simply spawning multiple instances and running on multiple ports sounds like a really bad idea... but getting into their code to pull out reusable parts doesn't seem easy, either.
Run the pages from the site through a validator... Down the road, it would also be nice to have automated tests that validate the HTML.
@aopisco : Will we we working with the data at https://figshare.com/articles/Processed_files_to_use_with_scanpy_/8273102 ? Would a simple script that downloads the data to a local gitignored directory help us get started? Or is that premature?
Assign back to me to implement, if that's the right direction.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.