Comments (2)
@Wen1953 If I understand correctly, you have created your own dataset, including, for example, files like genemap.gff
and reference.fasta
.
It seems that Nextclade tries to access the array gap_open_close
here on line 23:
at index 1709
, but it's length is less than that.
The gap_open_close
is created on line 9, such that its length is equal to the length of reference sequence + 2, i.e. the length of sequence in reference.fasta
, plus 2:
The array access on line 23 is happening on gene boundaries, as described by the genemap.gff
.
My current hypothesis is that genes in your gene map go beyond the reference sequence length. This causes out-of-bounds array access, which causes the crash. Please make sure that your gene map correctly describes the reference sequence, and all genes fit inside its range.
It can also be an off-by-one bug in Nextclade. But I am currently not convinced that it's the case.
If in doubt, please upload your full dataset (e.g. to a GitHub repo) so that we can investigate.
So far Nextclade datasets have been mostly created by the Nextclade team, and we try to ensure that the different files within a dataset are compatible with each other. However, software should never panic and crash and also because there is an increase of interest by the community in creating new datasets, we should definitely add some error handling in this part of the code and to produce a more meaningful error message. If you have time and forces, feel free to contribute an improvement. This would help other people creating Nextclade datasets.
from nextclade.
Thanks @ivan-aksamentov . Your hypothesis was right, it was a problem with the files and the reference sequence length and not a nexclade bug. We are trying to work with some avian flu sequence even though there is not a database in nextclade. We are convinced that contribute with more databases to other pathogens will be very useful to science comunity.
from nextclade.
Related Issues (20)
- 3.0.0 version not retrieving RSV datasets HOT 1
- Can support be extended for SC2 Datasets V2 for another month? HOT 4
- Feature Request: Dataset download all datasets within specified path HOT 4
- Beginners Help with Nextclade CLI HOT 6
- Empty input file causes uncaught error in v3 (it didn't in v2)
- Erroneous Clade Assignment or More Refined Tool? HOT 4
- Add a BA.1 reference for the web nextclade version HOT 4
- error when using `nextclade dataset get --verbosity` flag HOT 3
- 21L Tree Updates? HOT 2
- `--input-pcr-primers` listed in CLI help options despite being removed in v3 HOT 2
- When using `?input-fasta=` url query param without specifying dataset, web auto-starts analysis (prematurely) HOT 5
- Scrollbar shown for dataset names in dataset picker HOT 9
- how to generate the result table by the cli version auspice HOT 4
- output TSV column(s) for missing bases at beginning and end of sequence? HOT 1
- --input-dataset parameter HOT 5
- Update Fred Hutch logo
- How to get the latest Lineage- with CLI HOT 4
- Community build cache validity bug HOT 2
- Developer guide uses deprecated CLI option
- docs: document nextalign-like use-case HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nextclade.