Name: Srivatsan Ramanujam
Type: User
Company: Meta
Bio: Engineering Manager, ML@Meta.
Previously, ML at Salesforce Einstein, Pivotal, Sony, IBM Almaden Research, and UT Austin CS
Location: San Francisco
Blog: vatsan.github.io
Srivatsan Ramanujam's Projects
CMU ARK Twitter Part-of-Speech Tagger
OpenChorus allows customers, partners, developers, and data scientists to collaboratively realize the potential of Big Data.
Buildpack for Conda.
Boilerplate code for flask apps on PCF that interact with a backend environment (ex: Pivotal BDS or ElephantSQL).
Miscellaneous code related to extracting & processing Twitter streams from GNIP
A PL/Java Wrapper on Ark-Tweet-NLP (http://www.ark.cs.cmu.edu/TweetNLP/) - Twitter Parts-of-speech tagger in Postgres/Greenplum
A place for all things related to using the Greenplum Database with R
Temporary home for data processing/machine learning SQL snippets on Greenplum/HAWQ
Collection of Jupyter notebook templates to work with Greenplum/HAWQ/PostgreSQL
In-database parallel grid-search for XGBoost on Greenplum
Pivotal Greenplum Database
Mirror of Apache MADlib (Incubating)
Ipython Notebook samples
Implementing a ChatGPT-like LLM from scratch, step by step
Open-source library for scalable in-database analytics.
Python Meta Programming
nimfa - A Python Library for Nonnegative Matrix Factorization Techniques
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Invoke Pandas plotting by piping in SQL output via PSQL (Can be used with Postgres or Greenplum or any SQL engine).
PDL Tools is a library of reusable tools used and developed by the Pivotal Data Science and Data Engineering teams.
PL/Java is a free add-on module that brings Java™ Stored Procedures, Triggers, and Functions to the PostgreSQL™ backend.
Scalable in-database machine learning with PL/Python: Postgres Open SV 2017 talk
An introduction to Bayesian methods + probabilistic programming in data analysis with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)
Pure Python API for Maxmind's binary GeoIP databases