Name: Jimmy Lin
Type: User
Company: University of Waterloo
Bio: I profess to know very little at the University of Waterloo. I used to write code for Twitter and slides for Cloudera.
Twitter: lintool
Location: Nearby data lake
Blog: https://cs.uwaterloo.ca/~jimmylin/
Jimmy Lin's Projects
Maven repo for some Anserini dependencies.
The Art and Science of Empirical Computer Science (Fall 2022)
The Art and Science of Empirical Computer Science (Fall 2023)
The Archives Unleashed Toolkit is an open-source platform for analyzing web archives.
Reference implementations of data-intensive algorithms in MapReduce and Spark
Datasets for Bespin
Document retrieval using brute force scans
Scrapes citation statistics from Google Scholar
CS 489/698 Big Data Infrastructure (Winter 2016) at the University of Waterloo
CS 489/698 Big Data Infrastructure (Winter 2017) at the University of Waterloo
CS 451/651 Data-Intensive Distribute Computing (Fall 2018) at the University of Waterloo
CS 451/651 431/631 Data-Intensive Distribute Computing (Winter 2018) at the University of Waterloo
Question answering over knowledge graphs
Implementations of brute force scans for document retrieval in C
Brute force scan in C
Cassovary is a simple big graph processing library for the JVM
Performance comparison between Cassovary and GraphJet
Internet Archive "Save a Page" Plug-In for Chrome
Packaged CRX distribution for Internet Archive "Save a Page" Plug-In
Google Scholar Search Extension for Chrome
Chrome CRX packages for the Google Scholar Search Extension
Cloud9 is a Hadoop toolkit for working with big data
Hadoop tools for manipulating ClueWeb collections
learning-to-rank dataset extracted from ClueWeb09 using TREC judgments
Webgraph for ClueWeb09 Category B
Metadata for 108th United States Congress
List of people with great achievements in Computer Science
Visualizations of top Canadian universities for AI research by CSRankings
Converting the Enron email collection to mbox format
Giraph Tutorial