Giter Club home page Giter Club logo

Comments (14)

iminkin avatar iminkin commented on September 23, 2024 1

I will take a look at the PR and let you know if I would stick to GFA1 or migrate to GFA2.

from gfa-spec.

iminkin avatar iminkin commented on September 23, 2024

A kind of: https://github.com/medvedevgroup/TwoPaCo

The code is not "released" yet, though.

from gfa-spec.

sjackman avatar sjackman commented on September 23, 2024

Okay. Good to know. Perhaps the C record should be left as is then. If no one had implemented it, I was thinking we could possibly get rid of it. Would you like to comment on #33?

from gfa-spec.

iminkin avatar iminkin commented on September 23, 2024

I will take a look.

from gfa-spec.

ggonnella avatar ggonnella commented on September 23, 2024

I notice that, in commit c881263, recently merged in the specification, there is a description of C lines, which I find confusing on one point:
"This line can be used to describe mapping between segments in the graph and input sequences."

I.e. until now I thought that the C (as well as L) "from" and "to" fields were supposed to be segment names from S lines in the graph. In the note about "pan-genome" motivation for C, it sounds like one can have a C between a S and a chromosome sequence which is external, not contained in the graph.

I think it would be better to clarify this in the specification text.

from gfa-spec.

iminkin avatar iminkin commented on September 23, 2024

@ggonella you are right. The point is to keep the mapping between pan-genome graph and input sequences, see #7 .

from gfa-spec.

sjackman avatar sjackman commented on September 23, 2024

I agree with Giorgio @ggonnella that the containment C record is a relationship between two segment S records both in the GFA file. GFA doesn't currently have any concept of external IDs. I suggest changing the text like so:

 # `C` Containment line

-This line can be used to describe mapping between segments in the graph and input sequences. It is useful for graphs built from assembled genomes, e.g. pan-genome analysis graphs.
+A containment line represents an overlap between two segments where one is contained in the other.

I've created PR #41 with this change.

from gfa-spec.

sjackman avatar sjackman commented on September 23, 2024

@ggonnella @IlyaMinkin Please vote/comment on #41 so that I can merge it. Three 👍 are required to merge.

from gfa-spec.

sjackman avatar sjackman commented on September 23, 2024

@IlyaMinkin If TwoPaCo is the only implementation of GFA1 containment C records, would you consider migrating to GFA 2 so that we can deprecate the containment record of GFA 1?

from gfa-spec.

iminkin avatar iminkin commented on September 23, 2024

@sjackman does GFA2 specification exist somewhere? And what is the reason for deprecation of containment?

from gfa-spec.

sjackman avatar sjackman commented on September 23, 2024

GFA 2.0 is coming very close to a consensus. The PR is #48. Your comments are welcome.

If you are actively using GFA 1.0 containment C records in released code, then that's fine, we won't deprecate them. If however they were never implemented in released code, then I'd prefer to deprecate them.

from gfa-spec.

ggonnella avatar ggonnella commented on September 23, 2024

I do not understand, why it should be better to deprecate the C lines. They will not be there anymore in GFA2, but also the L lines will not: should we deprecate L lines too? That wouldn't make sense. C lines express things that cannot be expressed in other ways in GFA1.

from gfa-spec.

sjackman avatar sjackman commented on September 23, 2024

If C records had never been implemented by anyone, it would be better to encourage people to migrate to GFA 2.

from gfa-spec.

stale avatar stale commented on September 23, 2024

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs.

from gfa-spec.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.