nloyfer / meth_atlas Goto Github PK
View Code? Open in Web Editor NEWLicense: Other
License: Other
Hi,
I am running iteratively across groups, but deconvolve.py
with --plot
plots to screen and so the iteration stops. Possible to allow --plot
to just dump to png, not also to screen? Am I missing something?
Thanks,
Bruce.
Hello Everyone!
I am working with samples of heart rejection patients. I used ONT to sequence two samples one with a slight rejection and the other with no rejection.
After that i used Guppy to generate .bam files and made .bed files from the bam using Modbam2bed. I map each CpG using the start oder end index to the Pos on the reference Atlas to map each CpG to a specific Illumina ID. And after that i use the Deconvole script.
I currently am just not seeing any left.atrium cells or Adipocytes in the samples. I just see Neutrophiles, Erythrocyte_progenitors and Monocytes_EPIC. The only difference between the samples is between the amount of Neutrophiles. So now i am wondering is there a Error in my approach of mapping or my basic Pipeline ? Am i forgetting any steps ? Cause the deconvolution results do not match what i would accept to see.
Maybe someone has experience with using ONT Data for this Algorithm and can help me out.
Thanks in advance!
kind regards,
Azlan
Hi, I was wondering if you or someone has used your algorithm and reference atlas to deconvolute whole genome bisulfite methylation sequence data (WGBS)? I tried using it for some samples but, it predicts the incorrect cell or tissue types for these for e.g. for lung tissue (from a healthy control) it predicts that a large portion of the sample contains cortical neurons, breast etc. (no indication of lung). In addition, I tried it with a sample from adipose tissue and it predicts the same composition as the lung. There is sufficient overlap of CpGs with your reference atlas (at various coverages) thus, I don't think this would be an issue. Is there something inherent with the deconvolution algorithm used, tailored towards array data? Just curious, any help will be appreciated. Thank you.
How to prepare data for WGBS data , Thank you!
Hi,
I came across your deconvolution protocol here and was very much interested in applying it to my tumor samples. However, there is one reference missing in the reference atlas that I would need: alpha cells of pancreas. I wish to incorporate the reference into the atlas. Do you provide a protocol (similar to the sample preparation script) that one can use to optimize the reference file with a new file?
I noticed in the corresponding paper that one can do feature selection and technically, remake the reference. However, the pairwise-specific CpG calculation was not so clear for me. Could you elaborate a bit on exactly how you did it? that way if you do not have a script readily available, I can actually prepare it myself?
Hi,
running deconvolve.pl
with prostate samples (benign and tumour, and cfDNA) shows no prostate in any. I also tested a tumour/normal sample from TCGA_PRAD (far right of image, prad_1, 2), which shows same profile. What is your take on this? NB I tried a few lung, colon samples and they seem to work, possibly this is prostate specific issue?
Thanks,
Bruce.
Hi Netanel
Great work on this!! Is there a possibility to share the output csv from the data in the publication. Basically, I'm asking for the signature matrix that is not filtered by differential methylation etc.
Best,
Altuna
I downloaded SRR641640 run of SRX175350 as a fastq file from NCBI website.
I then used the below set of commands to convert the fastq file to pat format:
bwa index -a bwtsw hg19.fa
bwa mem reference_genome.fasta input.fastq > output.sam
samtools sort -o sorted_output.bam output.sam
samtools index sorted_output.bam
wgbstools bam2pat sorted_output.bam
Attaching the given pat file and output (files starting with name Given_) in the tutorial for SRX175350 and the ones I got (files starting with MyExp_) with SRR641640 run of SRX175350
Given_Lung_STL002.pat.gz
Given_Lung_STL002.pdf
MyExp_Lung_STLOO2.pdf
MyExp_Lung_STLOO2.pat.gz
Hi,
Thank you for the awesome tool and reference. I'm enjoying it greatly.
By the way, are you interested in WGBS mode? deconvolve.py refuses the first column other than cgxxxx. How about adding --wgbs option which accepts first columns which recode chromosome and position (e.g. chr14:68790171 instead of cg08169020 or use first two columns for chrom and pos)?
If you like it, I would PR.
Best, Yoshiaki
Hello, thank you for making such a good tool. First, I looked it up, but... I'm sorry to ask you this question because I can't find a way.
I used idat files in pre_process using the script and reference sample you uploaded in the pre_process, but as the EPIC array has been upgraded, I think some information has been added to the probe ID. So, my idat files cannot work pre_process. Is there any way to use the idat file made by EPIC array version 2 for pre_processing?
Thank you.
Some rows are duplicated in the reference_atlas.csv file. For example, the probe cg19442545 shows up 5 times with the same beta values across all the samples. Is it expected?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.