Comments (2)
Existing practice and the description of the field as an insert size make it clear that the field value is numeric.
HTSJDK and Picard have always enforced that the field value be an integer. See samtools/htsjdk@2799b1f and the output from picard ValidateSamFile I=test/sam/passed/hdr.RG9.sam
in the latest release:
ERROR::INVALID_PREDICTED_MEDIAN_INSERT_SIZE:Error parsing SAM header. PI is not numeric: 123.456.
Hence my recommendation would be that we clarify that this is an integer. Proposed text is in PR #721.
from hts-specs.
Good catch given I wrote most of the test data and hadn't spotted htsjdk rejects the floating point nature. Really, since when have continuous things like means been acceptable only as integer? I don't know why they expected integer only, given ParseFloat would work just fine on integer values.
I thought however I'd tested picard on most of the test data and looked over the causes of the failures. I guess I somehow missed that one (or it's changed since, but that seems unlikely).
from hts-specs.
Related Issues (20)
- VCF: "Genotype fields" vs "FORMAT" and per-sample HOT 1
- primary, secondary, and supplementary alignments with optional MM tags HOT 5
- Modified base single letter codes update HOT 7
- test/sam: Duplicate aux field tags
- test/vcf: Duplicate contig header record ID
- FAIRsharing Record Query - BED format
- CRAM: Need to improve feature positions description HOT 1
- is `*` better than `\*`? HOT 1
- cram: interpretation of "unmapped" flag in a pseudocode seems incorrect HOT 1
- SVCLAIM: VCF4.4 and backward compatibility with VCF4.3 HOT 1
- How to retrieve the primary alignment for secondary and supplementary reads HOT 8
- Is there a semantic difference between GT=./. and GT=0/0 + GQ=0 ? HOT 15
- cram: Inconsistent descriptions of auxiliary tag types HOT 1
- SA tag CIGAR format
- vcf: Handling structured header records with missing IDs in VCF 4.1/4.2 HOT 1
- bcf: First phasing indicators not set in genotype (GT) value examples
- CSI file is BGZF compressed but this is not mentioned in the CSV1 spec HOT 2
- Questions about third-party use of test data HOT 6
- VCF Draft 4.5 and Modified Bases HOT 27
- VCF4.4 SVLEN requirement across different variant representations HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hts-specs.