Comments (5)
Hi Quinn,
Really having to rack my memory here, but I think unfortunately the ":" limitation comes through with the use of PLINK by the package downstream (it is the one that can't handle colons in the name). Could you post a couple of lines of the #CHROM POS ID columns here? That might help us plan a solution!
Cheers,
Alana
from gbs_snp_filter.
Hi Alana,
Thanks for your help! Attached is the first few lines of my vcf file (only the first 3 columns). I've thinned my vcf file with plink previously and haven't had problems. I'm primarily interested in the HWE filtering and using this script to filter out loci that are out of HWE in each of the 4 populations (per Pearman et al. (2022) - Commonly used Hardy–Weinberg equilibrium filtering schemes impact population structure inferences using RADseq data).
Thanks!
chrm_loci_populations.snps.txt
from gbs_snp_filter.
Hi Quin,
Cool - to me it looks like your ID column is going to be unique even if everything following the first ':' is stripped away (based on this wee sample size). If you are worried that that might not be the case (i.e. you might have multiple sites that start with the same number), an option would be to run some kind of regex find and replace, and substitute something like an underscore for the colon. Can you test out and see whether the scripts work OK for you? (hopefully no other bugs pop up!). If it does, I will tweak the README to say you can have colons, but anything before the colon needs to be unique.
from gbs_snp_filter.
Great, thanks! I filtered to have a single snp per locus, so they should all be unique (that may not be the case for those who kept multiple snps per locus). The script worked, although I added :.*
to the 8th line because I kept getting an error saying my GBS_SNP_filter.txt file didn't have 8 lines. Thank you for your help!
from gbs_snp_filter.
Thanks Quinn - sorry about this slow response - I'll close the issue and add that workaround to the readme.
from gbs_snp_filter.
Related Issues (11)
- Error in -removepops HOT 14
- unknown issue HOT 6
- Error in generating .HWE.vcf file HOT 4
- Failing after PLINK for stickleback test data HOT 1
- Add some guidance on what vcf to use
- Error in generating the .ld.vcf file (the final step) HOT 4
- Check the formatting of the vcf
- Address some deprecating in code HOT 1
- ld performance in the package HOT 2
- Error using HWE.R HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gbs_snp_filter.