Comments (4)
Hi Clemens,
we think that this task is better left to PLINK. For subsetting, take a look at --keep
or --remove
for samples and --extract
or --exclude
for variants. For merging, there is --merge-list
.
Cheers,
Alex
from bgdata.
Alright - thanks for the fast reply!
I would like to go for a pure R workflow one day without any system dependencies, but in that case good old system()
will have to do the job a little longer.
Best,
Clemens
from bgdata.
You might be interested in the write_bed
function of the BGLR package then. Your matrix needs to fit into memory and it only writes a .bed file, i.e., you need to generate the .bim and .fam files manually to get a full PLINK fileset.
The snp_writeBed
function of the bigsnpr package looks also promising, but I haven't tried it yet.
Cheers,
Alex
from bgdata.
Ah - thanks for these hints. To my understanding both these packages rely on loading the data fully into memory and I wonder if this can be avoided for merging and subsetting. Your approach looks very promising in that regard.
Independent of this specific application bigsnpr is pretty interesting. Thanks for making me aware!
from bgdata.
Related Issues (20)
- as.BGData: Read BIM file
- getG.symDMatrix: Support minVar
- Try to improve error handling for mclapply calls
- GWAS: Check for columns with too many NAs / with zero variance HOT 1
- getG: Check for constant columns HOT 1
- getG.symDMatrix: blockSize does not use all individuals HOT 1
- Support LinkedMatrix in as.BGData
- Check if i and j != integer() HOT 1
- Allow transformations during getG HOT 1
- Consider covariate files in as.BGData HOT 1
- use gpuR HOT 1
- Error in .local(.Object, ...) : No such device HOT 6
- Fix signature of rayOLS
- Add Y parameter to getG HOT 1
- Add colnames in GWAS.lsfit result
- GWAS: attempt to set 'colnames' on an object with less than two dimensions HOT 5
- getG imputes by 0 if center = FALSE HOT 1
- BEDmatrix conversion to data frame HOT 1
- Problem with j in getG HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bgdata.