Giter Club home page Giter Club logo

Comments (4)

nakib103 avatar nakib103 commented on September 17, 2024

Hi @migrau,

Thanks for your query!

I tried to re-produce the issue with input that you provided but failed. The issue seems to be with cache - can you try deleting the cache dir and re-downloading it again.

Best regards,
Nakib

from ensembl-vep.

migrau avatar migrau commented on September 17, 2024

Hi nakib103,
I realised it was a problem with the folders hierarchy, maybe something changed in the last versions. In v109, specifying --dir /workspace/datasets/vep/ works (the v109 cache is in /workspace/datasets/vep/homo_sapiens/109_GRCh38). But when downloading the v110, the folder created /workspace/datasets/vep/homo_sapiens/110.0_GRCh38 contains two sublevels more: /workspace/datasets/vep/homo_sapiens/110.0_GRCh38/homo_sapiens/110_GRCh38/. If I specify /workspace/datasets/vep/homo_sapiens/110.0_GRCh38/, vep works correctly.

from ensembl-vep.

nakib103 avatar nakib103 commented on September 17, 2024

Hi @migrau,

The cache directory structure is same between 109 and 110 -

$ tar -tf homo_sapiens_vep_109_GRCh38.tar.gz | head -n 4
homo_sapiens/109_GRCh38/
homo_sapiens/109_GRCh38/1/
homo_sapiens/109_GRCh38/1/1-1000000.gz
homo_sapiens/109_GRCh38/1/1-1000000_reg.gz

$ tar -tf homo_sapiens_vep_110_GRCh38.tar.gz | head -n 4
homo_sapiens/110_GRCh38/
homo_sapiens/110_GRCh38/1/
homo_sapiens/110_GRCh38/1/1-1000000.gz
homo_sapiens/110_GRCh38/1/1-1000000_reg.gz

And as you can see it has structure such as - <species>/<version>_<assembly>/<chromosome>/<cache_files>. So when you gave the -c 110.0_GRCh38/ in your download command it extracted the cache files with the above folder structure under /workspace/datasets/vep/homo_sapiens/110.0_GRCh38.

If you had given -c /workspace/datasets/vep than you would expect the cache files under /workspace/datasets/vep/ same as 109.

Hope that answers your question.

Best regards,
Nakib

from ensembl-vep.

migrau avatar migrau commented on September 17, 2024

Thanks, nakin103. I didn't download the tar.gz directly but using the singularity exec ensembl-vep_110.0.sif INSTALL.pl -c 110.0_GRCh38/ -a cf -s homo_sapiens --ASSEMBLY GRCh38 command, and for some reason, the directory structure inside the downloaded folder is different, as I said.
109_GRCh38 contains all the files and folders (e.g. LRG_, Homo_sapiens.GRCh38.dna.toplevel.fa.gz) but 110.0_GRCh38 contains two subfolders more homo_sapiens/110_GRCh38, before all the files and folders (e.g. LRG_, Homo_sapiens.GRCh38.dna.toplevel.fa.gz)
Anyway, it is solved now, thanks!

from ensembl-vep.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.