Giter Club home page Giter Club logo

Comments (4)

DrYak avatar DrYak commented on May 29, 2024

Hi!
We have recently extensively updated the master branch.
could you please give it another try?

Note:
With this new branch you should be able to have V-pipe automatically set some base configuration for your HIV virus if you give it a vpipe.config that looks like:

[general]
virus_base_config=hiv

[output]
snv=true
local=true
global=false
visualization=true
QA=true

Note: it also works with YAML/JSON files (called config.yaml or specified on command line option --configfile)

general:
  virus_base_config: hiv

output:
  snv: true
  local: true
  global: false
  visualization: true
  QA: true

from v-pipe.

jiaqili-fl avatar jiaqili-fl commented on May 29, 2024

If the files are in *fastq.gz format, the error message looks a bit different:
Building DAG of jobs...
MissingInputException in line 10 of /Users/sviat/V-pipe/rules/quality_assurance.smk:
Missing input files for rule gunzip:
samples/ADA1038B/20210521/extracted_data/R1.fastq.gz

This is because of the name of your .fastq file. The name of the fastq file should be *R1.fastq after unzip from .gz file.

from v-pipe.

aitorgarzia avatar aitorgarzia commented on May 29, 2024

Hi,
I am sorry for coming back to this type of problem, but i am trying to run the SARS-CoV-2 first test (datasets SRR10903401 and SRR10903402) but I am always getting the same error and I do not really know what I am doing wrong.

MissingInputException in rule gunzip in file /V-pipe/workflow/rules/quality_assurance.smk, line 11:
Missing input files for rule gunzip:
output: results/SRR10903401/20200102/extracted_data/R1.fastq
wildcards: file=results/SRR10903401/20200102/extracted_data/R1, ext=fastq
affected files:
results/SRR10903401/20200102/extracted_data/R1.fastq.gz

Even when I try to run the ./vpipe --dryrun I get the same error.

VPIPE_BASEDIR = /V-pipe/workflow
No virus base configuration, using defaults
WARNING: protocols YAML look-up file </V-pipe/workflow/../resources/sars-cov-2/primers.yaml> specified, but no sample ever uses it: fourth column absent from samples TSV file.
Building DAG of jobs...
MissingInputException in rule gunzip in file /V-pipe/workflow/rules/quality_assurance.smk, line 11:
Missing input files for rule gunzip:
output: results/SRR10903401/20200102/extracted_data/R1.fastq
wildcards: file=results/SRR10903401/20200102/extracted_data/R1, ext=fastq
affected files:
results/SRR10903401/20200102/extracted_data/R1.fastq.gz

Could please someone help me?
Thanks,

from v-pipe.

DrYak avatar DrYak commented on May 29, 2024

@aitorgarzia : This message happens when Snakemake has trouble finding the input files.

To create this file:

…
Missing input files for rule gunzip:
output: results/SRR10903401/20200102/extracted_data/R1.fastq
…

...it should in theory search for a file called:

  • either samples/SRR10903401/20200102/raw_data/SRR10903401_R1.fastq
  • or samples/SRR10903401/20200102/raw_data/SRR10903401_R1.fastq.gz

(see Preparing a small dataset section)

Please check that it corresponds.
Common troubles:

  • typos: the name of directories should match the columns of the samples.tsv file

    samples/SRR10903401/20200102/raw_data/ vs

    SRR10903401	20200102	250

    check that there hasn't been any mistake while copy-pasting names when naming the directories or writing the TSV.

  • subdirectory: the files must be inside a raw_data/ subdirectory:

    samples/SRR10903401/20200102/raw_data/SRR10903401_R1.fastq

  • filename: When in paired-end mod, the file must follow the format {sample}_R{n}.fastq , i.e.: the base name must have _R1 and _R2 attached at the end for each member of the read pairs:

    samples/SRR10903401/20200102/raw_data/SRR10903401_R1.fastq

from v-pipe.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.