Comments (7)
GFA is an assembly format. It is closer to FASTA than to SAM. I think alignment to GFA needs a new format.
from gfa-spec.
from gfa-spec.
I am quite interested in aligning long reads to a GFA assembly graph. The three tools that I know of so far are Bandage querypaths
(uses BLAST, plain text output), spades-gmapper
(graph must be a dBG, unsure what the output format is), and vg
.
@ekg Does using either PAF or GFA2 fragment F
records work for your purpose?
from gfa-spec.
My opinion is that GFA2 should (despite the A in the name) be considered more than an assembly format, as the extensions wrt GFA1 allow to represent different kind of sequence graphs.
I agree with @richarddurbin that adding more complexity in the specification could be a bad idea. Specifically, I think that defining new line types would not be advisable. But I would not see as critical e.g. to define and reserve new tags for specific uses for the mentioned application, for the purposes of standardization. Maybe using the available facilities, such as F lines and CIGARS, and adding ad hoc tags solves the problem. A more concrete example could help (at least me) to understand more.
from gfa-spec.
The F-line is designed to keep the locations of reads in the assembly. Most of time, a read is placed to a single segment. In general graph alignment, in particular vg alignment, a query often walks through multiple segments, some of which are just 1bp in length. F-line or PAF is a waste of space, inconvenient to access and inefficient to process. They are not the general solution. I have little experiences with graph alignment, so I don't know what is the best solution.
Mixing graph representation and graph alignment is like merging FASTA and SAM into a single format. This can be done, but I don't think we should go for that.
from gfa-spec.
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.
from gfa-spec.
from gfa-spec.
Related Issues (20)
- Need to specify "reference" in terms of cigar operations in overlap HOT 4
- Do two genes link together in GFA file indicate these two genes associate with each other? HOT 2
- Should a PG line (like in SAM) be codified in the spec? HOT 3
- GFA2: does not mention the encoding expected of file content (ASCII-7bit, UTF-8, etc.) HOT 1
- v1.1 is not semantically distinct from v1 HOT 2
- W lines: no description of '>' and '<' use HOT 2
- Use of GFA2 as a pangenome reference
- Representation of annotations in a GFA2/GFA3 file
- Segment names conflicts in spec
- Translocation and Inversion HOT 2
- Allow lowercase characters in hex strings
- looking for a CLI tool to produce circular candidates from GFA HOT 2
- What do P lines with zero, one or two Segment ids mean in GFA v1? HOT 11
- Namespace for S and P lines in GFA1 HOT 1
- Indicating that a path is circular HOT 2
- manipulating .gfa file HOT 5
- Implied adjacent objects in GFA2 groups are problematic HOT 3
- GFA2 specification does not mention optional field reserved tags HOT 4
- making path overlap cigar list optional HOT 3
- GFA has been submitted to the EDAM ontology HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gfa-spec.