Giter Club home page Giter Club logo

class_2020's Introduction

class_2020's People

Contributors

arda3929 avatar arsweari avatar boulderrinnlab avatar giuliacorbet avatar graycenw avatar msmallegan avatar nebenb avatar sprasava avatar thaoh51 avatar tomwieser avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

class_2020's Issues

Binding versus expression

Plot logTPM versus number of TFs bound (similar to presentation).
Plot lncRNA and mRNA seperate and Also include "reservoirs"

08_ sweep

Here is the last little bits of 08 to finalize
08 GOOD SHAPE (need neighbor window analysis)
~55 there is something off with lncRNA starting out grey and then turning same pink :)
~93 label which plot is res non res -- add value etc
~223 label which is res or non res
~248 Anova or multi-group test for each promoter type
~284 Let's do the leave out res window and calculate neighbor window

lncRNA vs mRNA promoter enrichment

Calculate a chi-squared of observed -vs - expected overlaps for mRNA and lncRNA promoters.
Calculate phi (effect size)
Plot Phi -vs - pvalue

01 last bits

I fixed most of the other counts in previous one and now just left with:

~60: quartile of 250 peaks filter.
~120: ks.test doesn't run
~255 The figure of peaks vs overlap for all promoters has intercept of 1,200 peaks?

****** 354 I did a length and summary and summary seems that min is 46 overlaps so no dbps don't bind any promters, but length is 161 ? ******* please double check

~364 need help with ggplot -- seems like there is a simply way to plot this with indexing lncRNA and mRNa overlaps in aes()

Density Plot of Promoter binding events

make a density plot of number of binding events at a given promoter.
what DNA binding proteins did not overlap at any promoter?
what promoters never bind

Clean up to final .RMD and figure quality pdf

09_sweep

09 : GOOD SHAPE
~56 the plot is a bit unwieldy just organize into columns of high, med and off
~84 Chi squared warning !
"Warning messages:
1: In chisq.test(df1) : Chi-squared approximation may be incorrect
2: In stats::chisq.test(x, y, ...) :
Chi-squared approximation may be incorrect
3: In chisq.test(df1) : Chi-squared approximation may be incorrect
4: In stats::chisq.test(x, y, ...) :
Chi-squared approximation may be incorrect

CHROMOSOME NAME :)

We have start values for promoters but no chromosome value :) probably need to add in 01 and repopulate across all directories :)

01 sweeping

Here are the minor remaining issues for 01 after careful curration :)

~# means that is about what line it's on -- they should be pretty accurate

~60: quartile of 250 peaks filter.
~120: ks.test doesn't run
~130 we could add peak widths from here?
~196 browser example of longest width window for RFX1
~255 The figure of peaks vs overlap for all promoters has intercept of 1,200 peaks?
~308 Print out min, max, median, mean, range:) of the #DBPs per promoter
LAST chunk doesn't run:
error:
333: Error: Failed to create output due to bad names.

  • Choose another strategy with names_repair

Permutation analysis of repeat classes and families

Perform a genome wide permuation null distribution of binding events at each TE family and class.
Fisher Exact test of observed versus null.
Calculate P-value for each DNA binding factor
Calculate Z-score for each DNA binding factor
Make a clustered (family / class) heat map of Zscores

Reservoirs with Pol II versus non Pol II

Similar chi-squared test now test

All ghosts versus
those with pol II and those with out pol II

Pol II Reservoir - vs- non Pol II reservoir
Bound -vs- unbound

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.