Comments (3)
From @jbloom22 on October 14, 2015 1:26
core statistical test, though i reworked the formulas:
https://en.wikipedia.org/wiki/Simple_linear_regression#Normality_assumption
from hail.
From @jbloom22 on October 28, 2015 5:53
@cseed ready for your review
from hail.
From @jbloom22 on October 28, 2015 21:2
Hail runtime for linreg with 10 PCs on profile.vds: 56s, 59s, 58s.
Hail runtime for variantqc on profile.vds: 35s, 34s, 35s.
Plink runtime for linreg with 10 PCs on profile.vds: 13s, 13s, 13s.
Hail runtime (8 cores) for linreg with 10 PCs on profile.vds: 23s, 25s, 23s.
LINREG:
/Users/jbloom/k3/build/install/k3/bin/k3 read -i ~/data/profile.vds linreg -f ~/data/profile.fam -c ~/data/profile.cov -o ~/data/profile.linreg
read: 1407.415486
linreg: 58336.701622
VARIANTQC:
/Users/jbloom/k3/build/install/k3/bin/k3 read -i ~/data/profile.vds variantqc -o ~/data/profile.variantqc
read: 1417.763771
variantqc: 35466.355219
PLINK:
create bed/bim/fam:
./plink --vcf ~/data/profile.vcf.bgz
run regression:
time ./plink --bfile plink --double-id --pheno ~/data/profile.pheno
--allow-no-sex --covar ~/data/profile.covar --linear --out
~/data/plinkTest
PLINK v1.90b3w 64-bit (3 Sep 2015) https://www.cog-genomics.org/plink2
(C) 2005-2015 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to /Users/Jon/data/plinkTest.log.
Options in effect:
--allow-no-sex
--bfile plink
--covar /Users/Jon/data/profile.covar
--double-id
--linear
--out /Users/Jon/data/plinkTest
--pheno /Users/Jon/data/profile.pheno
16384 MB RAM detected; reserving 8192 MB for main workspace.
24885 variants loaded from .bim file.
2535 people (0 males, 0 females, 2535 ambiguous) loaded from .fam.
Ambiguous sex IDs written to /Users/Jon/data/plinkTest.nosex .
2535 phenotype values present after --pheno.
Using 1 thread.
Warning: This run includes BLAS/LAPACK linear algebra operations which
currently disregard the --threads limit. If this is problematic, you
may want to recompile against single-threaded BLAS/LAPACK.
--covar: 10 covariates loaded.
Before main variant filters, 2535 founders and 0 nonfounders present.
Calculating allele frequencies... done.
Total genotyping rate is 0.907692.
24885 variants and 2535 people pass filters and QC.
Phenotype data is quantitative.
Writing linear model association results to
/Users/Jon/data/plinkTest.assoc.linear ... done.
real 0m13.167s
user 0m13.071s
sys 0m0.080s
from hail.
Related Issues (20)
- ClassCastException when comparing hl.Locus to row key
- https:// is considered invalid by hailctl config set batch/remote_tmpdir
- Hail is missing deploys 0.2.114 and 0.2.115 in Azure HOT 16
- hailtop.fs `ReadableStream` does not support seeking on streams from blob storage
- 500 inside notebook of hailctl dataproc start cluster
- In Query-on-Batch in Azure, a 1500 partition VCF fails to import
- QoB's job URL should use ?last_job_id to send us to the page containing the current driver.
- QoB should include some cluster statistics in the terminal output HOT 8
- QoB should start a new Batch after a keyboard interrupt. HOT 2
- [QoB] new transient error: java.net.SocketTimeoutException: connect timed out
- Batch ServiceBackend: calling run() on a batch with 0 jobs leads to a cryptic ValueError HOT 1
- Hail does not properly support Haploid genotype calls HOT 1
- Batch cannot make directories in /io if the user is not root HOT 2
- [batch] it is not simple and straightforward to write a Python script that uses Python jobs which need: Hail, a set of local Python files, and third party dependencies.
- [docker] hailgenetics/hail should have tags for each supported Python version HOT 1
- [query] Hail should support more recent version of VEP
- `hl.vds.to_dense_mt` incorrectly densifies the last reference block of a chromosome into the beginning of the next HOT 1
- ResourceGroup issues modelling writing a bgzipped file and its TBI index via `write_output` HOT 10
- hailctl config is too user unfriendly; it should: error or warn on invalid names and document the list of valid names
- [batch] Visually distinguish closed billing projects from open ones on the billing limits page
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hail.