Giter Club home page Giter Club logo

affymm's Introduction

Hello! ๐Ÿ‘‹

I'm a bioinformatics analyst with Hunter Moseley at University of Kentucky's Markey Cancer Center. I spend most of my time both analyzing various -omics datasets (transcriptomics, metabolomics, etc) and developing new tools and methods for analyzing these kinds of data. This means I'm heavily dependent on others sharing their data, and proudly consider myself a research parasite.

I really, really enjoy programming in R, although I also dabble occasionally in Python. I occasionally write about my research, personal projects, and my thoughts on science on my blog.

  • ๐Ÿ’ฌ Ask me about quality control and quality assurance of high-feature -omics datasets, writing R packages, and using {targets} for data analyses.
  • ๐Ÿ› ๏ธ I maintain a few packages related to -omics data analysis.

affymm's People

Contributors

rmflight avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Forkers

samarthbhaskar

affymm's Issues

references

Need to figure out how to work with the references in the document. Might try the Pandoc way, but have also seen a proposal for an R package.

get exon intervals in function

Need to return both the actual TMmm intervals, as well as the merged Exon intervals, and the original exon intervals returned

create md version of paper

I want to create a markdown version of the submitted paper, and host this on the gh-pages branch of the repo.

This will require a few things:

Rewriting the output so that we save the alignments for the mm and pm probes, and their overlapping exons from merged exons and base exons
Basically I want to be able to do the full paper generation from the output text files, or the RData file with all of the summary of results.

Saving the actual intensity difference matrix so we can refer back to it

Refactoring all of the big code chunks in the lapply into sub-functions so that we don't have to worry about deleting objects after we are done with them.

pdf of paper

Figure out how generate a pdf using pandoc and pdflatex. Currently we get an error from pdflatex when trying to generate the pdf. Not sure why.

journal club presentation

Using the paper as a guideline, create a presentation for bioinformatics journal club next week. I really want to use markdown slides, so figure out how to do it.

new analysis for non-unique PMs

So in practicing the presentation the night before I give the talk at Biomedcom, I realize that perhaps we did not do the analysis of non-unique PM alignment correctly. The current analysis looks at it by simply the numbers of PMs from each class that had more than one alignment to the genome, and the percentages are then calculated based on the total of probes that aligned multiply.

However, what I realized we should do is take it as a percentage of the total number of probes on the array in that class. For example:

AT should be:

of at probes that multiply align / # of at probes on the chip

figure out hg19 mm exon overlaps

with the current data, how do we count the set of mm probes that align to all 5 human builds that overlap with exons only?

The problem with the current data is that the probeAlign that is saved, is only based on those probes that align once in the genome, not all alignments. But maybe that will be good enough ...

svg figures

Where appropriate, use the svg versions of the figures

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.