biodavidjm / artms Goto Github PK
View Code? Open in Web Editor NEWAnalytical R Tools for Mass Spectrometry
License: GNU General Public License v3.0
Analytical R Tools for Mass Spectrometry
License: GNU General Public License v3.0
2do:
artms_enrichForComplexes
should break if anything happens to artms_data_corum_mito_database
artms_changeColumnName
Currently, there are two independent functions to generate the MIST format based on spectral counts or intensity. Unified both
It might require too much time, so not sure if worthy. However, in the long term...
Why do you need so many global variables?
Should it be removed? if no solution is found for the large number of files that are necessary, removed.
A reasonable solution could be: download the data from kroganlab website.
aggregate_fun
optionMSstats_main
Right now is reading from the system. But this is wrong.
It should use artms_annotation function
When selecting the output name by the artms_proteinToSiteConversion
tool, prevent that the output from having the same name as the original file (i.e., overwrite)
data
option should include all the data related options (silac
, filters
, fractions
)files$data
to files$evidence
qc
sectionWhen the server is down, it throws the following error:
Error in function (type, msg, asError = TRUE) : Empty reply from server
which breaks artms_analysisQuantifications
Currently is pointing to the wrong local file. Make it available in the package
When running artms_quantifications, we need a second options in qc to run the extended functions
artms_generatePhSiteExtended
WARNING: Use TRUE/FALSE instead of T/F
Found in R/ directory functions:
Found in files:
It is a key information that should also be provided
Something is wrong when there is only one comparison for the heatmap> the heatmap gets way off.
Enrichment.plotHeatmaps
We need more extensive quality control check from the evidence and the summary file
Some functions might need to increase the number of controls to prevent errors
Break down MaxQ_utilities in individual files.
When naming the individual scripts, try to organize it based on major functions.
It is annoying to deal with the RawFile column name everywhere. Write a function to check and return the dataset (evidence or keys) with the RawFile column name
The ggalt
library cannot be installed on linux. Therefore, it has to go away from the QC extended functions. This is the error:
*** Install PROJ.4 and if necessary set PKG_CPPFLAGS/PKG_LIBS accordingly.
ERROR: configuration failed for package ‘proj4’
* removing ‘/home/travis/R/Library/proj4’
Installation failed: Command failed (1)
missing: ggalt, proj4
The command "Rscript -e 'deps <- devtools::dev_package_deps(dependencies = NA);devtools::install_deps(dependencies = TRUE);if (!all(deps$package %in% installed.packages())) { message("missing: ", paste(setdiff(deps$package, installed.packages()), collapse=", ")); q(status = 1, save = "no")}'" failed and exited with 1 during .
Removing ggalt
only affects QC-IntCorrelation.pdf
#40
I'll take care of this @alexproteomics
Currently it does a correlation plots taking all the features. Add an option to select the ptm type
Post-processing of msstats results, including:
org.db
as the resource for annotations and mappingsIssue to keep track of all the sample data objects created for the package as a way to provide compelling use cases for the package’s functions.
The first one is the PH dataset provided by Danielle
The input file requires by SAINTq is very different from SAINT express. Create a function to generate the input file from the evidence + keys (an additional column will be required: SAINT)
Yes, move annotation packages to Suggests. Otherwise, the user is required to install all of those packages that they may not need.
Your artms_mapUniprot2entrezGeneName
function should be using a data.frame map that has two columns: common name and the annotation abbreviation (e.g., "Anopheles", "Ag"). Then you can use this along with if (!requireNamespace()) then stop("Install to map IDs") to get the user to have the package already installed before using your function. It would also reduce the repetitive code.
You could have used a data.frame map for the 'mapUniprot2entrezGeneName'
function. Is the function necessary?
pathogen
might be needed to be removed. Otherwise is too complicated. Not sure how many people are mixing human and pathogens.
If used, where is the info about pathogens coming from? The user should provide the annotation file, but of course, it would have to be specified how it should be formatted (or how to get it)
Jittered plot would have to be removed as well
why is only human
or mouse
the only two species supported when using org.db
there are way more available. Revise this.
It is a more intuitive name
It could confused the users to document functions that are meant to be used only internally
Generate input files for PHOTON from -imputedL2fcExtended.txt
Many of the functions should be use only internally:
Prefix internal functions with a '.'. Do not @export and in general skip royxgen docs for these functions, with the exception of @importFrom lines.
Give the user the option to provide a file name output, otherwise, use a build-in default one
I meant that your if statement should be more like
any(is.missing("datafilearg"), is.missing("textfilearg")) instead of testing with all(is.null(...), is.null(...)).
The logic in these tests is slightly different and perhaps you don't want any of the file
arguments to be missing rather than the instance where all of them are missing.
It includes:
codecov
Packrat
: isolate the packges installed for this project. Possible problems with bioconductorShould be part of the same function. File plotHeatmap
General tasks related to the old functions that need to be transformed and get package
format
All the functions' arguments must incorporate the type in the documentation.
Make whatever is necessary to make the 80% of the functions runnable
Do it for this functions:
All these functions cannot have runnable examples due to either size restrictions or run for too long:
Long Running time
artms_quantification
: takes too longanalysisQuantifications.R
: takes too longRequire extra (large) files
artms_SILACtoLong
: requires a SILAC evidence fileartms_evidenceToSaintExpressFormat
: requires an APMS datasetartms_evidenceToMISTformat
: requires an APMS datasetartms_msstats_summary
: requires a summary fileartms_dataPlots
: requires uploading an extra fileartms_generatePhSiteExtended
: requires large extra filesFound the following significant warnings:
Warning: replacing previous import ‘biomaRt::select’ by ‘plotly::select’ when loading ‘artMS’
Warning: replacing previous import ‘ggplot2::last_plot’ by ‘plotly::last_plot’ when loading ‘artMS’
Warning: replacing previous import ‘plotly::mutate’ by ‘plyr::mutate’ when loading ‘artMS’
Warning: replacing previous import ‘plotly::arrange’ by ‘plyr::arrange’ when loading ‘artMS’
Warning: replacing previous import ‘plotly::rename’ by ‘plyr::rename’ when loading ‘artMS’
Warning: replacing previous import ‘plotly::summarise’ by ‘plyr::summarise’ when loading ‘artMS’
Warning: replacing previous import ‘data.table::melt’ by ‘reshape2::melt’ when loading ‘artMS’
Warning: replacing previous import ‘data.table::dcast’ by ‘reshape2::dcast’ when loading ‘artMS’
Warning: replacing previous import ‘biomaRt::getSequence’ by ‘seqinr::getSequence’ when loading ‘artMS’
Warning: replacing previous import ‘limma::zscore’ by ‘seqinr::zscore’ when loading ‘artMS’
Warning: replacing previous import ‘plyr::count’ by ‘seqinr::count’ when loading ‘artMS’
Warning: replacing previous import ‘seqinr::a’ by ‘shiny::a’ when loading ‘artMS’
Taking as input -imputedL2fcExtended.txt
As an R user, I have always liked to better know the functions available in every package.
An elegant solution would be to just providing a prefix with the name of the package
artms_
NAME_OF_THE_FUNCTION
RMSQv3 works accepting flags/options. Remove everything related to library(getopt)
Problem with select()
returning many:many mapping between keys and columns. Fix it. Find out how to get the primary gene symbol
just print out the config.yaml file to be filled up by the user
Currently the log2fc value that can be obtained after imputation might be too high if the intensity in the condition where the protein was consistently found is too high. Normalized the imputed values using the maximum of the log2fc calculated by MSstats.
Suggestion: provide two new arguments with the highest and lowest log2fc values to adjust the imputation method
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.