Scripts used in the analysis of the 241-mammal Zoonomia dataset for the upcoming publication: "Evolutionary constraint and innovation across hundreds of placental mammals" UNDER REVIEW
These scripts and data files are provided with the main purpose of reproducibility of the analyses presented in the paper. Any questions regarding the content and use of this repository should be directed to matthew dot christmas aT imbim dot uu dot se
This script provides an example of how we calculated a lower-bound estimate of the proportion of the genome under purifying selection based on phyloP values, for human, chimpanzee, house mouse, dog and bat
This script was used to analyse constraint at 4-fold degenerate sites and their overlap with transcription factor binding sites
This script was used to analyse supplementary data file 2, for measuring constraint across the human genome in 100 kb bins
This script was used for analysing constraint in gene deserts and relating this to the location of gene deserts near developmental transcription factors
This script was used for analysing features of UNICORNs, in particular for showing they contain less variation with lower allele frequencies than other unannotated intergenic regions
https://github.com/MattChristmas/Zoonomia/blob/main/getUnicornOverlapsWithBrainDatasets.sh
This script was used for evaluating the overlap of UNICORNs with open chromatin regions from different brain regions, different motor cortex cell types, and different brain developmental stages
This script was used for analysing the distribution of constraint across human repeat families
https://github.com/MattChristmas/Zoonomia/blob/main/extract_OR_hits.pl
This script was used to extract regions of the genome containing putative olfactory receptor sequences identified using tblastx
cds_coverage.py
was used after bedtools coverage
in the short read alignment based gene loss step. It iterates over coverage files and makes a human-readable tabulated file.
gaps_exonerate.py
summarizes where gaps are located in the GFF output from exonerate.
https://github.com/MattChristmas/Zoonomia/blob/main/hibernationCodeAll.R
This script was used for the hibernation RERconverge analyses.