Comments (8)
@hkunerth can you check the contents of 230029460_WB.filtered.report to ensure that it isn't empty? Also, check that the reference FASTA files during DOWNLOAD_ASSEMBLY are being made.
I don't believe it's an issue with Top Hits but the way in which the fastq files are being registered in the overall pipeline.
from taxtriage.
The 230029460_WB.filtered.report exists and looks normal. DOWNLOAD_ASSEMBLY successfully downloaded for the other sample, 230029461, but never initiated for 230029460.
I can send you the nextflow report if it would be helpful at all. Happy to keep digging to try to solve this.
from taxtriage.
Sure, can you pass the report privately if possible? I'm curious if there is some underlying issue with how the system might be parsing some of the taxa downstream
from taxtriage.
Hi @Merritt-Brian, I've come back to this and I think the issue is simpler than what I'd initially thought. I believe it's centered on the Samplesheet format. I had been using an older format, with the following header row:
sample,single_end,from,platform,barcode,fastq_1,fastq_2,sequencing_summary,trim
I tested some Illumina samples using the example format:
sample,platform,fastq_1,fastq_2,sequencing_summary,trim
and no longer ran into the issue. I think this is due to the older fields being put into the META variable during the samplesheet_check process and passed along to later processes which have issues with too many or too few arguments.
That said, I still hit a snag when running ONT data using the updated samplesheet format. I kept the format the same, except for changing platform to OXFORD and not having a fastq_2. This generated the old error:
Nov-29 09:38:20.942 [Actor Thread 9] DEBUG nextflow.Session - Session aborted -- Cause: No signature of method: Script_b7c7d140$_runScript_closure1$_closure3$_closure24.call() is applicable for argument types: (ArrayList) values: [[[id:Specimen-5-NP-RNA, single_end:true, platform:OXFORD, fastq_1:/home/mdh/shared/taxtriage/231129_test/samples/Specimen-5-NP-RNA.dehosted.fastq.gz, ...], ...]]
Possible solutions: any(), any(), any(groovy.lang.Closure), each(groovy.lang.Closure), tap(groovy.lang.Closure), any(groovy.lang.Closure)
Nov-29 09:38:20.970 [Actor Thread 9] DEBUG nextflow.Session - The following nodes are still active:
It looks to me like there should be more arguments in that array. The successful Illumina run populates it with
[id:230028305_CSF, single_end:false, platform:ILLUMINA, fastq_1:/home/mdh/shared/taxtriage/231129_test/samples_illu/230028305.Illumina.kraken.dehosted.1.fastq.gz, fastq_2:/home/mdh/shared/taxtriage/231129_test/samples_illu/230028305.Illumina.kraken.dehosted.2.fastq.gz, trim:true, directory:false, sequencing_summary:null]
but for some reason it breaks for ONT after fastq_1.
My sample sheet just leaves the fastq_2 field blank, as it is in the ONT example here: https://github.com/jhuapl-bio/taxtriage/blob/main/examples/Samplesheet.csv but I'm wondering if this might be causing issues with the META field.
Thoughts?
from taxtriage.
Nevermind, I ran some newly generated Illumina data and ran into the same issue. It looks to be the same as this issue #45 raised by @erinyoung
Disregard my above message, but any help with this would be much appreciated. Thanks!
from taxtriage.
@hkunerth can you provide your .nextflow.log file as well as the execution report (html) here regarding the issue put in #45
from taxtriage.
Here's a log from a failed run with this issue.
.nextflow.log
I'm wondering if it is possible that the database that this run is using might be the cause of it. I changed a number of things in more recent runs but one was making sure it is pointed at the standard database (k2_standard_20230605) and I haven't run into this issue since then.
Thanks for the help.
from taxtriage.
Ah found the syntax error in your command
--top_per_taxa = '"10239:20:S' '2:20:S"'
should be: --top_per_taxa "10239:20:S 2:20:S"
i.e. no single quote and no equals sign. You can also see that on startup the value for top_per_taxa states it is "="
not "10239:20:S 2:20:S"
from taxtriage.
Related Issues (20)
- Add preliminary step: Host Removal via Alignment HOT 1
- Add Nanoplot/Fastq as outputs to Tower YAML file
- Retry NCBI Reference Pulls
- Aggregate Multi Contig/Accession/Chromosome Stats per Sample & Taxa
- Adding Submodule - Gene prediction and annotation for mapped reads HOT 2
- There are log of warning and channel.view() lines that get printed to the screen while running. HOT 1
- Workflow didn't finish HOT 11
- Add Submodule - Samtools Coverage Histogram File HOT 1
- Add Minmapq filter to samtools HOT 1
- Breadth (X) of Coverage Column no longer works with aggregate assembly alignment HOT 1
- Remove the split VCF by contig
- bin/merge_assemblies_conf.py assumes file content, no QA check HOT 2
- Add a Primary report file that is a single, simple table. Make the full multiqc file supplemental HOT 1
- NanoPlot fails for the test config files - Singularity Only HOT 1
- Conda Integrated
- Pipeline fails at Porechop step HOT 10
- Porechop causes workflow to fail if nothing trims from a sample
- Use local Refseq mirror for post-kraken2 accession querying
- conflicting licenses in the repo HOT 1
- Pathogens not reported as pathogen in report
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from taxtriage.