Giter Club home page Giter Club logo

Comments (10)

willmclaren avatar willmclaren commented on August 14, 2024

My guess is that the chromosome names are not the same between your GFF, VCF and possibly FASTA.

The code should be able to cope if the chromosome names differ by the prefix "chr" (e.g. chr8 or 8). If, however, your GFF perhaps uses accessions for chromosome names (e.g. NC_000008.11 for human chromosome 8) and the VCF uses e.g. chr8, then you would need to supply a synonyms file mapping these with --synonyms [file].

Synonym files for each species are available in the relevant cache file; if you have the human 87 GRCh38 cache file, for example, the synonym file can be found as ~/.vep/homo_sapiens/87_GRCh38/chr_synonyms.txt

from ensembl-vep.

willmclaren avatar willmclaren commented on August 14, 2024

No further comment so closing...

from ensembl-vep.

zillurbmb51 avatar zillurbmb51 commented on August 14, 2024

Hello; I am having the same problem here. I did not use fasta or gff, rather than, I ran the vep as follows:
./vep -i /Volumes/Macintosh\ HD2/data/UPR_CANCER_2/snpcall/combined.n2.vcf.Q40d20.vcf -o /Volumes/Macintosh\ HD2/data/UPR_CANCER_2/snpcall/combined.n2.vcf.Q40d20.vcf.vep.txt -offline --assembly GRCh38
And getting the following error.

WARNING: Chromosome chr1 not found in annotation sources or synonyms on line 1
WARNING: Chromosome chr10 not found in annotation sources or synonyms on line 63
WARNING: Chromosome chr11 not found in annotation sources or synonyms on line 81
WARNING: Chromosome chr12 not found in annotation sources or synonyms on line 113
WARNING: Chromosome chr13 not found in annotation sources or synonyms on line 124
WARNING: Chromosome chr14 not found in annotation sources or synonyms on line 138
WARNING: Chromosome chr15 not found in annotation sources or synonyms on line 155
WARNING: Chromosome chr16 not found in annotation sources or synonyms on line 175
WARNING: Chromosome chr17 not found in annotation sources or synonyms on line 195
WARNING: Chromosome chr18 not found in annotation sources or synonyms on line 216
WARNING: Chromosome chr19 not found in annotation sources or synonyms on line 240
WARNING: Chromosome chr2 not found in annotation sources or synonyms on line 266
WARNING: Chromosome chr20 not found in annotation sources or synonyms on line 322
WARNING: Chromosome chr21 not found in annotation sources or synonyms on line 330
WARNING: Chromosome chr22 not found in annotation sources or synonyms on line 338
WARNING: Chromosome chr3 not found in annotation sources or synonyms on line 359
WARNING: Chromosome chr4 not found in annotation sources or synonyms on line 381
WARNING: Chromosome chr5 not found in annotation sources or synonyms on line 420
WARNING: Chromosome chr6 not found in annotation sources or synonyms on line 460
WARNING: Chromosome chr7 not found in annotation sources or synonyms on line 478
WARNING: Chromosome chr8 not found in annotation sources or synonyms on line 507
WARNING: Chromosome chr9 not found in annotation sources or synonyms on line 523
WARNING: Chromosome chrX not found in annotation sources or synonyms on line 545

And finally got no result. Any help? Best, Zillur

from ensembl-vep.

at7 avatar at7 commented on August 14, 2024

Hi @zillurbmb51,
could you please share one of your input lines? Which cache files did you install?
Thanks,
Anja

from ensembl-vep.

zillurbmb51 avatar zillurbmb51 commented on August 14, 2024

Thank you Anja. I do not remember which cache files I installed. Is there any way to find it out? Also what did you mean by input lines? Best, Zillur

from ensembl-vep.

at7 avatar at7 commented on August 14, 2024

I just noticed that you have to specify that you want to use the cache files with --cache. The cache files are installed by default under $ENV{HOME}.'/.vep. By input lines I mean some of your input variants which you want to annotate with VEP from /Volumes/Macintosh\ HD2/data/UPR_CANCER_2/snpcall/combined.n2.vcf.Q40d20.vcf.

from ensembl-vep.

zillurbmb51 avatar zillurbmb51 commented on August 14, 2024

Thank you Anja, Here is the ./vep directory in my home. Among these which one is the cache file?

(base) rcmi-29:snpcall mzillur$ ls /Users/mzillur/.vep/
Plugins homo_sapiens homo_sapiens_refseq
(base) rcmi-29:snpcall mzillur$ ls /Users/mzillur/.vep/homo_sapiens/
97_GRCh37 97_GRCh38
(base) rcmi-29:snpcall mzillur$ ls /Users/mzillur/.vep/homo_sapiens_refseq/
97_GRCh37
(base) rcmi-29:snpcall mzillur$ ls /Users/mzillur/.vep/Plugins/
AncestralAllele.pm CSN.pm Conservation.pm ExAC.pm G2P.pm Gwava.pm LoF.pm MPC.pm NearestGene.pm REVEL.pm TSSDistance.pm miRNA.pm
Blosum62.pm Carol.pm Downstream.pm ExACpLI.pm GO.pm LD.pm LoFtool.pm MTR.pm Phenotypes.pm SameCodon.pm dbNSFP.pm plugin_config.txt
CADD.pm Condel.pm Draw.pm FATHMM_MKL.pm GeneSplicer.pm LOVD.pm LocalID.pm MaxEntScan.pm ProteinSeqs.pm SpliceRegion.pm dbscSNV.pm
(base) rcmi-29:snpcall mzillur$ ls /Users/mzillur/.vep/homo_sapiens/97_GRCh38/
Homo_sapiens.GRCh38.dna.toplevel.fa.gz
Attached is my vcf file I want to annotate. Best, Zillur
combined.n2.vcf.Q30d15.vcf.txt

from ensembl-vep.

at7 avatar at7 commented on August 14, 2024

Hi @zillurbmb51,
thank you for sending me your input file. The cache files which you need for annotating your input file are stored under /Users/mzillur/.vep/homo_sapiens/97_GRCh38. I didn't have any problems annotating your input file with the 97_GRCh38 cache files. Could you please check if you have any files in the /Users/mzillur/.vep/homo_sapiens/97_GRCh38/ directory? Could you also please check your output file's header? There should be a line near the top starting with: ## Using cache in. Can you please let me know what it says in your output file (/Volumes/Macintosh\ HD2/data/UPR_CANCER_2/snpcall/combined.n2.vcf.Q40d20.vcf.vep.txt)?

Thank you,
Anja

from ensembl-vep.

zillurbmb51 avatar zillurbmb51 commented on August 14, 2024

Thank you very much. In /Users/mzillur/.vep/homo_sapiens/97_GRCh38/ I have only one file and it is: Homo_sapiens.GRCh38.dna.toplevel.fa.gz
I have tried:
./vep -i /Volumes/Macintosh\ HD2/data/UPR_CANCER_2/snpcall/combined.n2.vcf.Q40d20.vcf -o /Volumes/Macintosh\ HD2/data/UPR_CANCER_2/snpcall/combined.n2.vcf.Q40d20.vcf.vep.2.txt -offline --assembly GRCh38 --cache /Users/mzillur/.vep/homo_sapiens/97_GRCh38 and
./vep -i /Volumes/Macintosh\ HD2/data/UPR_CANCER_2/snpcall/combined.n2.vcf.Q40d20.vcf -o /Volumes/Macintosh\ HD2/data/UPR_CANCER_2/snpcall/combined.n2.vcf.Q40d20.vcf.vep.2.txt -offline --assembly GRCh38 --cache /Users/mzillur/.vep/homo_sapiens/97_GRCh38/Homo_sapiens.GRCh38.dna.toplevel.fa.gz
Attached are the output files. Best, Zillur
combined.n2.vcf.Q40d20.vcf.vep.3.txt
combined.n2.vcf.Q40d20.vcf.vep.2.txt
combined.n2.vcf.Q40d20.vcf.vep.2.txt_warnings.txt
combined.n2.vcf.Q40d20.vcf.vep.3.txt_warnings.txt

from ensembl-vep.

helensch avatar helensch commented on August 14, 2024

Hi

Thank you for looking for files in /Users/mzillur/.vep/homo_sapiens/97_GRCh38/.

As you only have a FASTA file, it looks like you do not have cache files installed. In the directory 97_GRCh38 there should be files (info.txt and chr_synonyms.txt) and directories (e.g. directories called "1", "2" etc) that contain files.

The caches for VEP can be installed using the installer INSTALL.pl or manually downloaded. For information on downloading the cache see:
https://www.ensembl.org/info/docs/tools/vep/script/vep_cache.html#cache

Helen

from ensembl-vep.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.