Comments (6)
I will download mm39 and take a look, thanks!
from methphaser.
Should be a bug, could you please show me the vcf file? I have never used MethPhaser on mice samples so a little example could help.
from methphaser.
It's not public data - I'll have to ask if it could be shared. I may get back to you on this at a later date. Sorry I can't be more helpful now.
Although one thing I encountered from the current implementation is that, since I filtered the .vcf
file for only those on chromosome 1-19 (mouse autosomes), phase_block_df
(defined here) defaults the dtype for the chr
column to be int
instead of str
, and the subsequent comparison fails as int(chromosome) == str(chromosome)
returns False. As a result all chromosomes were being skipped. Adding dtype={'chr' : 'str' }
to the pd.read_csv()
call in defining phase_block_df
fixed the issue. A similar issue is also in methphasing
.
from methphaser.
I think that was because most human autosomes can be parsed as int
but I can fix that, thanks for pointing out! Btw what is your reference genome headers like? Maybe that was different from human autosomes and could be the issue, not sure.
from methphaser.
We use mm39 as reference.
Now I think about it - this line here reads in info from the reference (and the next line gets the chromosome column). If there are non-autosome entries in the reference, there will be non-numerics in their names, and the chromosome column dtype will be str
. In this case, if the .vcf
file is filtered for autosomes and the chromosome column dtype gets defaulted to int
, the int
to str
comparison fails.
from methphaser.
Yeah mm39 starts with something like NC_000067.7
and HG38 starts with chr22
. I will modify the dataframe reader, thanks for pointing that out
from methphaser.
Related Issues (11)
- IndexError: list index out of range HOT 5
- some block have only one SNP HOT 2
- Possible no caught error for single blocks
- Empty output vcf and bam HOT 4
- IndexError: list index out of range
- Question on secondary and supplementary reads HOT 4
- Two warning messages appear HOT 2
- The total size of the output bam file has decreased
- Error when running meth_phaser_parallel HOT 3
- [E::bam_parse_basemod] MM tag refers to bases beyond sequence length HOT 17
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from methphaser.