Comments (6)
The file format(s) used to implement FeatureData[Sequences]
can be anything, as long as the file format(s) can encode this "mapping" of feature IDs to sequence IDs, and the sequences themselves. Your idea of storing the sequences in FASTA format should work -- I think DNAFASTAFormat is what you want. You could design a (second) file format to store the mapping of feature IDs to sequence IDs, and then create your "OTU Map" directory format that is composed of those two file formats (see PairedDNASequencesDirectoryFormat for an example). After that you can register a transformer that converts the "OTU Map" directory format into an appropriate data structure for your plugin.
Get in touch if you have any issues with this -- the docs are currently very sparse on this topic and we'll be reworking the API to make it easier to create directory formats in the future.
from q2-types.
The QIIME 2 semantic type page says "FeatureData[Sequence]: A single unaligned sequence associated with a feature identifier (e.g. a representative sequence).".... So shouldn't FeatureData[Sequences] be a generic .fasta file (unaligned, multiple seqs)? Because I need that one in addition to an OTU map :) Let me know if that exists, and if not I'll register both on q2-ghost-tree for now. Thanks!
from q2-types.
from q2-types.
This recently came up on the forum.
from q2-types.
Something I think might work better than FeatureData[Sequences]
would be a FeatureData[Features]
which would let you describe some hierarchical relationship of features.
In practice this would probably just look like a metadata file which would work with feature-table group
. Your reads would be the feature-ids and the OTUs would be the column. This would compose easily with other stuff and you can kind of make it work right now.
from q2-types.
This is the intention of the new FeatureMap type that's been moved from q2-types-genomics. Closing out this issue as developers should figure out how this should be added to individual plugins.
from q2-types.
Related Issues (20)
- Importing manifest v1 without sample names produce unclear error HOT 1
- qiime 'taxonomy' tabulate error HOT 2
- Move matplotlib pin to >=3.2 HOT 2
- Laneless Casava transformers are broken HOT 1
- Change Newick validation from `sniff` to `validate` HOT 1
- Transformer from Newick to dendropy HOT 1
- ENH: transformer for BLAST6Format --> Metadata
- ENH: new type + format for BLASTDB v5
- Conflict with biom-format 2.1.12
- add types, formats, and transformers for legacy 454 data HOT 1
- new format / transformer for single-end demuliplexed files named by sample id HOT 1
- add usage examples to artifact classes HOT 1
- deprecation warning on import of type `SampleData[PairedEndSequencesWithQuality]` with format `PairedEndFastqManifestPhred33V2`
- add `ImmutableMetadata` semantic type
- Migrate FeatureMap from q2-types-genomics to q2-types
- FEAT: Importing multiplexed paired-end data with separated barcode sequence files HOT 1
- Blast6 file empty HOT 1
- MAINT: Migrate q2-types-genomics types/formats/transformers into q2-types
- Patch to work with Pandas 2.x HOT 1
- FEAT: Importing sequence data from BIOM files HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from q2-types.