dpryan79 / chromosomemappings Goto Github PK
View Code? Open in Web Editor NEWThis repository contains chromosome/contig name mappings between UCSC <-> Ensembl <-> Gencode for a variety of genomes.
License: MIT License
This repository contains chromosome/contig name mappings between UCSC <-> Ensembl <-> Gencode for a variety of genomes.
License: MIT License
Hi, thanks for sharing these tables. I'm working with Ensembl accessions and Genome Reference Consortium issue mapping, and the GenBank (nuccore) accession is the only way I can find to map the issue title to the chromosome/contig name.
This isn't as straightforward as it first looks for some patches (e.g. issue HG-1703
maps to contig name HG237_PATCH
(via GenBank ID: KE332506.1
, matched through your table).
I'd really like to add these mappings to the tables of issues I've made from processing the GRC XML but I can't find the source you've used.
Could you please suggest where/how I could reproduce the mappings in the Ensembl โ Gencode tables?
:)
The GRCh38_UCSC2ensembl.txt file is missing contig mapping from the hg38 UCSC side. In using this file to remap UCSC contigs to Ensembl the map fails because of missing contigs.
For example, chr10_KN196480v1_fix
, chr10_KQ090021v1_fix
, chr11_KN196481v1_fix
, etc. are all within the file being remapped, but these contigs are not in GRCh38_UCSC2ensembl.txt.
I am unaware of other files that may be missing updated contigs, but there may be a few.
Could you update the GRCh38_UCSC2ensembl.txt file, and potentially other files that are missing updated contigs?
Mouse got an update today and human a couple days ago, the "recent" patches/fixes need to get included. I should also check for updates to zebra fish and fruit fly. If someone else would like to do one of these then feel free :)
I should probably start doing this every six months or so.
Hi Ryan,
Thanks for this handy tool!
I've tried to use it, and the following error message pops up.
Is it something to do with the settings of my computer?
Unrecognized VM option 'AggressiveOpts'
Error: Could not create the Java Virtual Machine.
Error: A fatal exception has occurred. Program will exit.
Thanks!
Chiefcat
Thanks for compiling these tables. Could you comment on their sources?
Namely, the mappings between patches in Ensembl and NCBI are wrong, since Ensembl N-pads contigs while NCBI doesn't. This should be checked and clarified in the repo.
What is the license for this directory? Would it be possible to add the license to it? There's a tool in development in which we would like to use these files as a part of.
Thanks in advance!
I've created ucsc<->ensembl mappings for the galGal4 chicken genome, but I don't know how to upload them due to permissions. My github user name is iljungr.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.