Giter Club home page Giter Club logo

Comments (10)

Hego-CCTB avatar Hego-CCTB commented on September 27, 2024 1

Yeah, you are right. I'm gonna need to replace any reference to tissue with curate_group.

from amalgkit.

Hego-CCTB avatar Hego-CCTB commented on September 27, 2024

Changelog Amalgkit ver. 0.6.0.0

amalgkit csca

  • new sub function for cross species correlation analysis
  • USAGE:
amalgkit csca \
--out_dir \
PATH_TO_WORKING_DIRECTORY \
--file_species_tree \
PATH_TO_NWK_FILE \
--file_singlecopy \
PATH_TO_ORTHOFINDER_FILE \
--file_orthogroup \
PATH_TO_ORTHOFINDER_FILE \
--dir_uncorrected_curate_group_mean \ 
PATH_TO_CURATE_TABLES\
--dir_curate_group_mean\
PATH_TO_CURATE_TABLES\
--dir_sra \
PATH_TO_CURATE_TABLES\
--dir_tc \
PATH_TO_CURATE_TABLES\

--curate_group \
'root,flower,leaf' \
  • Note: This was tested on a 9 species plant dataset retrieved, quantified and curated by amalgkit. That said, further testing is needed. Especially gene name format can cause issues.
  • Note: dir_uncorrected_curate_group_mean, dir_curate_group_mean, dir_sra, dir_tc all point to the same directory, if the input is unchanged curate output. As such, these arguments are inferred by default. If there is a curate/tables folder in the --out_dir path, amalgkit will find those files on its own.

amalgkit curate

  • Now throws a warning when transforming with TPM
  • Now throws an error when cstmm output files are detected (parsed from path) in combination with TPM transformation
  • Now includes option --one_outlier_per_iter yes|no, which allows only 1 sample per same bioproject or same tissue to be removed per iteration of the outlier removal
  • check_within_tissue_correlation() now removes samples below a pearson r of 0.2 (currently hard coded, but can be made an optional input in the future)
  • --cleanup 0|1 is now plot_intermediate yes|no. "yes" calculates and prints SVA correction after every single iteration of outlier removal. This can drastically increase runtimes.

amalgkit getfastq

  • truncated updated_metadata output files to only essential columns for curate. This comes with two benefits: lower filesize (which very slightly increases curate performance) and more importantly, same column number across all individual files
  • obsoleted --ascp and all related options

amalgkit

  • added amalgkit csca subparsers

This should go up later today. I'm still debugging and I have to merge with the other updates today.

from amalgkit.

kfuku52 avatar kfuku52 commented on September 27, 2024

Is there any option like --curate_group all to include all curate_group in the metadata table?

from amalgkit.

Hego-CCTB avatar Hego-CCTB commented on September 27, 2024

If --curate_group is left none , it should parse out all unique values from the curate_group column and use that as input.

from amalgkit.

kfuku52 avatar kfuku52 commented on September 27, 2024

Sounds good!

from amalgkit.

Hego-CCTB avatar Hego-CCTB commented on September 27, 2024

Update is now live.
cbd6852

from amalgkit.

kfuku52 avatar kfuku52 commented on September 27, 2024

The curate_group column is missing in the metadata table. Could you update amalgkit metadata?

from amalgkit.

Hego-CCTB avatar Hego-CCTB commented on September 27, 2024

Ah, it seems the column doesn't survive the last metadata step. There are 3 metadata sheets as output. curate_group is in the second output, but not in the third.

I'll investigate that.

from amalgkit.

kfuku52 avatar kfuku52 commented on September 27, 2024

It seems that curate_group isn't used at all in transcriptome_curation.r. Am I missing something?

from amalgkit.

Hego-CCTB avatar Hego-CCTB commented on September 27, 2024

Yeah, you are right. I'm gonna need to replace any reference to tissue with curate_group.

Amalgkit ver. 0.6.2.3

  • Replaced every 'tissue' or 'tissues' with 'curate_group' or 'curate_groups', including variables
    2174567

from amalgkit.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.