Comments (4)
Hi @migrau,
Thanks for your query!
I tried to re-produce the issue with input that you provided but failed. The issue seems to be with cache - can you try deleting the cache dir and re-downloading it again.
Best regards,
Nakib
from ensembl-vep.
Hi nakib103,
I realised it was a problem with the folders hierarchy, maybe something changed in the last versions. In v109, specifying --dir /workspace/datasets/vep/
works (the v109 cache is in /workspace/datasets/vep/homo_sapiens/109_GRCh38
). But when downloading the v110, the folder created /workspace/datasets/vep/homo_sapiens/110.0_GRCh38
contains two sublevels more: /workspace/datasets/vep/homo_sapiens/110.0_GRCh38/homo_sapiens/110_GRCh38/
. If I specify /workspace/datasets/vep/homo_sapiens/110.0_GRCh38/
, vep works correctly.
from ensembl-vep.
Hi @migrau,
The cache directory structure is same between 109 and 110 -
$ tar -tf homo_sapiens_vep_109_GRCh38.tar.gz | head -n 4
homo_sapiens/109_GRCh38/
homo_sapiens/109_GRCh38/1/
homo_sapiens/109_GRCh38/1/1-1000000.gz
homo_sapiens/109_GRCh38/1/1-1000000_reg.gz
$ tar -tf homo_sapiens_vep_110_GRCh38.tar.gz | head -n 4
homo_sapiens/110_GRCh38/
homo_sapiens/110_GRCh38/1/
homo_sapiens/110_GRCh38/1/1-1000000.gz
homo_sapiens/110_GRCh38/1/1-1000000_reg.gz
And as you can see it has structure such as - <species>/<version>_<assembly>/<chromosome>/<cache_files>
. So when you gave the -c 110.0_GRCh38/
in your download command it extracted the cache files with the above folder structure under /workspace/datasets/vep/homo_sapiens/110.0_GRCh38
.
If you had given -c /workspace/datasets/vep
than you would expect the cache files under /workspace/datasets/vep/
same as 109.
Hope that answers your question.
Best regards,
Nakib
from ensembl-vep.
Thanks, nakin103. I didn't download the tar.gz directly but using the singularity exec ensembl-vep_110.0.sif INSTALL.pl -c 110.0_GRCh38/ -a cf -s homo_sapiens --ASSEMBLY GRCh38
command, and for some reason, the directory structure inside the downloaded folder is different, as I said.
109_GRCh38
contains all the files and folders (e.g. LRG_, Homo_sapiens.GRCh38.dna.toplevel.fa.gz) but 110.0_GRCh38
contains two subfolders more homo_sapiens/110_GRCh38
, before all the files and folders (e.g. LRG_, Homo_sapiens.GRCh38.dna.toplevel.fa.gz)
Anyway, it is solved now, thanks!
from ensembl-vep.
Related Issues (20)
- Error with dbNSFP plugin HOT 6
- Inconsistent consequence annotation HOT 2
- Fail to install ensembl-release-111 on MacOS 14.2.1 (23C71) `fatal error: 'lzma.h' file not found` HOT 5
- WARNING: Ignoring non-supported 'five_prime_utr' feature_type from Homo_sapiens.GRCh38.111.gtf.bgzip.gz HOT 3
- Empty fields HGVSc and HGVSp of INFO - CSQ after VCF annotation via GTF and Fasta HOT 2
- Absence MANE and canonical annotation tags in output VCF. HOT 3
- VEP custom anotation not working with gnomad 4.0 file HOT 1
- ERROR: DBI module not found. VEP requires the DBI perl module to function HOT 2
- Annotating with GNOMAD custom vcf makes frequencies become STRING and unable to filter HOT 7
- VEP 111 HGVS C dot annotating dups as Insertions HOT 4
- filter_vep not correctly filtering CADD_PHRED scores HOT 10
- [Question] What is the definition of "coding_sequence_variant"? Why are frameshifts no coding sequence variant? HOT 4
- can not call method "seq" HOT 6
- Normalisation-based allele matching algorithm and custom file HOT 2
- False warning messages with vep 111 when using the range input format HOT 4
- All variants are intergenic with NCBI GFF HOT 6
- filter_vep output file larger than input file HOT 2
- WARNING: Chromosome 22 not found in annotation sources or synonyms on line 1 HOT 8
- VEP in Google Batch fails when more than 5 custom databases are passed HOT 6
- trouble finding cache file "MSG: ERROR: Cache directory /..." HOT 7
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ensembl-vep.