Giter Club home page Giter Club logo

Comments (5)

laninsky avatar laninsky commented on July 27, 2024

Hi Quinn,

Really having to rack my memory here, but I think unfortunately the ":" limitation comes through with the use of PLINK by the package downstream (it is the one that can't handle colons in the name). Could you post a couple of lines of the #CHROM POS ID columns here? That might help us plan a solution!

Cheers,

Alana

from gbs_snp_filter.

quinn-ca avatar quinn-ca commented on July 27, 2024

Hi Alana,

Thanks for your help! Attached is the first few lines of my vcf file (only the first 3 columns). I've thinned my vcf file with plink previously and haven't had problems. I'm primarily interested in the HWE filtering and using this script to filter out loci that are out of HWE in each of the 4 populations (per Pearman et al. (2022) - Commonly used Hardy–Weinberg equilibrium filtering schemes impact population structure inferences using RADseq data).

Thanks!

chrm_loci_populations.snps.txt

from gbs_snp_filter.

laninsky avatar laninsky commented on July 27, 2024

Hi Quin,

Cool - to me it looks like your ID column is going to be unique even if everything following the first ':' is stripped away (based on this wee sample size). If you are worried that that might not be the case (i.e. you might have multiple sites that start with the same number), an option would be to run some kind of regex find and replace, and substitute something like an underscore for the colon. Can you test out and see whether the scripts work OK for you? (hopefully no other bugs pop up!). If it does, I will tweak the README to say you can have colons, but anything before the colon needs to be unique.

from gbs_snp_filter.

quinn-ca avatar quinn-ca commented on July 27, 2024

Great, thanks! I filtered to have a single snp per locus, so they should all be unique (that may not be the case for those who kept multiple snps per locus). The script worked, although I added :.* to the 8th line because I kept getting an error saying my GBS_SNP_filter.txt file didn't have 8 lines. Thank you for your help!

from gbs_snp_filter.

laninsky avatar laninsky commented on July 27, 2024

Thanks Quinn - sorry about this slow response - I'll close the issue and add that workaround to the readme.

from gbs_snp_filter.

Related Issues (11)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.