Sivakumar Krishnamoorthy's Projects
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
course about NGS data processing: genomics and transcriptomics
Applied Python Programming for Life Scientists
A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin
Azure Databricks - Advent of 2020 Blogposts
Nextflow tutorial for the BADAS series at NYU Center for Genomics and Systems Biology
create bai file from respective bam file
Course Files for Complete Python 3 Bootcamp Course on Udemy
Toturials coming with the "data science roadmap" picture.
This word document contains useful collected databases
Collection of Sample Databricks Spark Notebooks ( mostly for Azure Databricks )
Collection of Databricks and Jupyter Notebooks
Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.
Open Source Data Science Resources.
Examples of using deep learning in Bioinformatics
Filter DE genes based on log2Folchange, FDR value or both
List of Data Science Cheatsheets to rule the world
Source code repository for "Quantitative analysis of C. elegans transcripts by Nanopore direct-cDNA sequencing reveals terminal hairpins in non trans-spliced mRNAs. Bernard, et al. 2022"
Exon BED Generator: A script to download, process, and generate a BED file with exon coordinates for Homo sapiens from a GTF file
ExUTR is a practical and powerful tool that enables rapid genome-wide 3'-UTR prediction from massive RNA-Seq data
🗂 The perfect Front-End Checklist for modern websites and meticulous developers
Gene ID Retrieval from TAIR: R code to retrieve gene IDs from TAIR (The Arabidopsis Information Resource) and export them as CSV files
Gene Info Extractor: A Python script to extract gene information and annotations from CSV data
Compressor for genomic files (FASTQ, SAM/BAM, VCF, FASTA, GVF, 23andMe...), up to 5x better than gzip and faster too
A Python script to filter and extract information from GTF files based on chromosome names, designed to be easily accessible for biologists without extensive programming experience.
This file will give you an overall idea to choose appropriate statistical test