ProgrammerUnknown's Projects
This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs
The AI project of 22 Spring
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
AQuA: A Benchmarking Tool for Label Quality Assessment
This is the project repository of CIS5528 Project in 2023 Spring
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
A complete computer science study plan to become a software engineer.
List of Computer Science courses with video lectures.
A curated, but incomplete, list of data-centric AI resources.
Finetune Llama-3-8b on the MathInstruct dataset
This is a demo when I was grade one.
记录Learning from data一书中的习题解答
LlamaIndex is a data framework for your LLM applications
A Paper List of Low-resource Information Extraction
Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.
The homework of my course communication system simulator bsed on MATLAB simulink.
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Minimalist BERT implementation assignment for CS11-711
https://huyenchip.com/ml-interviews-book/
Automatically detect errors in annotated corpora.