The artms from biodavidjm

Provide documentation (vignettes)

README
Vignettes
Source code comments

Unit Tests

2do:

artms_enrichForComplexes should break if anything happens to artms_data_corum_mito_database
All the functions used across the repository, as for example, artms_changeColumnName

Unified both functions in evidenceToMISTformat

Currently, there are two independent functions to generate the MIST format based on spectral counts or intensity. Unified both

Revise colnames

It might require too much time, so not sure if worthy. However, in the long term...

Why do you need so many global variables?

decide on artms_evidenceToMISTformat

Should it be removed? if no solution is found for the large number of files that are necessary, removed.

A reasonable solution could be: download the data from kroganlab website.

Remove aggregate_fun option from config

Remove aggregate_fun option

Create individual files from the MSstats_main.R

Break each of the large functions in multiple files
Add comments
Fix warnings
Remove MSstats_main

.extras_annotate: fix it

Right now is reading from the system. But this is wrong.
It should use artms_annotation function

Add warning to avoid overwrite evidence.txt

When selecting the output name by the artms_proteinToSiteConversion tool, prevent that the output from having the same name as the original file (i.e., overwrite)

Improve the configuration file

The data option should include all the data related options (silac, filters, fractions)
Rename files$data to files$evidence
Add qc section
Eliminate unused options

Handle errors from gProfileR

When the server is down, it throws the following error:

 Error in function (type, msg, asError = TRUE)  : Empty reply from server

which breaks artms_analysisQuantifications

Update CORUM file location

Currently is pointing to the wrong local file. Make it available in the package

Add new extended QC functions to config

When running artms_quantifications, we need a second options in qc to run the extended functions

analysisQuantification: Check that ptmph / ptmsites are right options

Fix bug with respect to the output name of the function artms_generatePhSiteExtended
Check if ptmsites is the right option based on the PTM_ph or PTM_STKY notation

Fix Warning T/F

WARNING: Use TRUE/FALSE instead of T/F
Found in R/ directory functions:

Found in files:

inst/extdata/createData.R
man/artms_annotationUniprot.Rd
man/artms_mergeMaxQDataWithKeys.Rd
man/artms_volcanoPlot.Rd

Add ENTREZID to annotations

It is a key information that should also be provided

Fix single comparison heatmaps of enrichments

Something is wrong when there is only one comparison for the heatmap> the heatmap gets way off.

Enrichment.plotHeatmaps

Add new functions for quality controls

We need more extensive quality control check from the evidence and the summary file

Add extra controls to prevent errors

Some functions might need to increase the number of controls to prevent errors

Create individual functions from MaxQ_utilities

Break down MaxQ_utilities in individual files.
When naming the individual scripts, try to organize it based on major functions.

Raw.file check function

It is annoying to deal with the RawFile column name everywhere. Write a function to check and return the dataset (evidence or keys) with the RawFile column name

Removing ggalt functions from qc functions

The ggalt library cannot be installed on linux. Therefore, it has to go away from the QC extended functions. This is the error:

*** Install PROJ.4 and if necessary set PKG_CPPFLAGS/PKG_LIBS accordingly.
ERROR: configuration failed for package ‘proj4’
* removing ‘/home/travis/R/Library/proj4’
Installation failed: Command failed (1)
missing: ggalt, proj4
The command "Rscript -e 'deps <- devtools::dev_package_deps(dependencies = NA);devtools::install_deps(dependencies = TRUE);if (!all(deps$package %in% installed.packages())) { message("missing: ", paste(setdiff(deps$package, installed.packages()), collapse=", ")); q(status = 1, save = "no")}'" failed and exited with 1 during .

Removing ggalt only affects QC-IntCorrelation.pdf
#40

I'll take care of this @alexproteomics

ReplicatePlots: select type of experiment

Currently it does a correlation plots taking all the features. Add an option to select the ptm type

analysisQuantifications: Add post-msstats processing functions

Post-processing of msstats results, including:

External data

Issue to keep track of all the sample data objects created for the package as a way to provide compelling use cases for the package’s functions.

The first one is the PH dataset provided by Danielle

Generate SAINTq input file from evidence + keys

The input file requires by SAINTq is very different from SAINT express. Create a function to generate the input file from the evidence + keys (an additional column will be required: SAINT)

Review: Reduce annotation packages

Yes, move annotation packages to Suggests. Otherwise, the user is required to install all of those packages that they may not need.

Your artms_mapUniprot2entrezGeneName function should be using a data.frame map that has two columns: common name and the annotation abbreviation (e.g., "Anopheles", "Ag"). Then you can use this along with if (!requireNamespace()) then stop("Install to map IDs") to get the user to have the package already installed before using your function. It would also reduce the repetitive code.

You could have used a data.frame map for the 'mapUniprot2entrezGeneName'
function. Is the function necessary?

Revise Protein Complex Enrichment analysis

Plot based on fold enrichment instead of pvalue
Add false discovery rate

Decide on argument pathogen

pathogen might be needed to be removed. Otherwise is too complicated. Not sure how many people are mixing human and pathogens.
If used, where is the info about pathogens coming from? The user should provide the annotation file, but of course, it would have to be specified how it should be formatted (or how to get it)

Jittered plot would have to be removed as well

Decide on 'specie'

why is only human or mouse the only two species supported when using org.db there are way more available. Revise this.

Rename artms_evidenceQC to artms_qualityControlEvidenceBasic

It is a more intuitive name

Remove internal functions from documentation

It could confused the users to document functions that are meant to be used only internally

Add PHOTON input generator

Generate input files for PHOTON from -imputedL2fcExtended.txt

Create internal functions

Many of the functions should be use only internally:

Prefix internal functions with a '.'. Do not @export and in general skip royxgen docs for these functions, with the exception of @importFrom lines.

artms_evidenceQC: add output file name argument

Give the user the option to provide a file name output, otherwise, use a build-in default one

Incorporate Quality Control for the evidence file

Add functions for QC analysis of the MaxQuant evidence file
Add options to the configuration file to enable QC

Update MSstats_functions

Add comments
Remove deprecated functions
Rename

Review: check missing arguments

I meant that your if statement should be more like
any(is.missing("datafilearg"), is.missing("textfilearg")) instead of testing with all(is.null(...), is.null(...)).
The logic in these tests is slightly different and perhaps you don't want any of the file
arguments to be missing rather than the instance where all of them are missing.

Build the overall structure of the package

It includes:

Help pages
Vignettes
Unit tests
data/
NAMESPACE
CI: code coverage: codecov
CI: travis
CI: ~~AppVeyor~~ (dimissed, it does not support bioconductor packages)
Do not include Packrat: isolate the packges installed for this project. Possible problems with bioconductor

Unified artms_plotHeatmap & plotHeat

Should be part of the same function. File plotHeatmap

Improve old functions

General tasks related to the old functions that need to be transformed and get package format

Clean up the functions
Add unit tests
Remove unnecessary functions (recommended at the end)

Provide Argument types

All the functions' arguments must incorporate the type in the documentation.

Create runnable examples

Make whatever is necessary to make the 80% of the functions runnable

Do it for this functions:

All these functions cannot have runnable examples due to either size restrictions or run for too long:

Long Running time

artms_quantification: takes too long
analysisQuantifications.R: takes too long

Require extra (large) files

artms_SILACtoLong: requires a SILAC evidence file
artms_evidenceToSaintExpressFormat: requires an APMS dataset
artms_evidenceToMISTformat: requires an APMS dataset
artms_msstats_summary: requires a summary file
artms_dataPlots: requires uploading an extra file
artms_generatePhSiteExtended: requires large extra files

Fix warnings and errors

Found the following significant warnings:
  Warning: replacing previous import ‘biomaRt::select’ by ‘plotly::select’ when loading ‘artMS’
  Warning: replacing previous import ‘ggplot2::last_plot’ by ‘plotly::last_plot’ when loading ‘artMS’
  Warning: replacing previous import ‘plotly::mutate’ by ‘plyr::mutate’ when loading ‘artMS’
  Warning: replacing previous import ‘plotly::arrange’ by ‘plyr::arrange’ when loading ‘artMS’
  Warning: replacing previous import ‘plotly::rename’ by ‘plyr::rename’ when loading ‘artMS’
  Warning: replacing previous import ‘plotly::summarise’ by ‘plyr::summarise’ when loading ‘artMS’
  Warning: replacing previous import ‘data.table::melt’ by ‘reshape2::melt’ when loading ‘artMS’
  Warning: replacing previous import ‘data.table::dcast’ by ‘reshape2::dcast’ when loading ‘artMS’
  Warning: replacing previous import ‘biomaRt::getSequence’ by ‘seqinr::getSequence’ when loading ‘artMS’
  Warning: replacing previous import ‘limma::zscore’ by ‘seqinr::zscore’ when loading ‘artMS’
  Warning: replacing previous import ‘plyr::count’ by ‘seqinr::count’ when loading ‘artMS’
  Warning: replacing previous import ‘seqinr::a’ by ‘shiny::a’ when loading ‘artMS’

Add Phosfate input generator

Taking as input -imputedL2fcExtended.txt

Rename all functions adding `artms_` prefix

As an R user, I have always liked to better know the functions available in every package.

An elegant solution would be to just providing a prefix with the name of the package

artms_NAME_OF_THE_FUNCTION

Remove `getopt` traces

RMSQv3 works accepting flags/options. Remove everything related to library(getopt)

many:many mappings

Problem with select() returning many:many mapping between keys and columns. Fix it. Find out how to get the primary gene symbol

Create function to write config file

just print out the config.yaml file to be filled up by the user

Refine the imputation method of `artms_imputeMissingValues`

Currently the log2fc value that can be obtained after imputation might be too high if the intensity in the condition where the protein was consistently found is too high. Normalized the imputed values using the maximum of the log2fc calculated by MSstats.

Suggestion: provide two new arguments with the highest and lowest log2fc values to adjust the imputation method

biodavidjm / artms Goto Github PK

artms's People

Contributors

Stargazers

Watchers

Forkers

artms's Issues

Recommend Projects

Recommend Topics

Recommend Org