Oumaima Hourrane's Projects
Dataset for AfriSenti-Semeval
Instruct-tune LLaMA on consumer hardware
Topic modeling using BERT and LDA combination.
Experimental Closed Domain QA Pipeline to build an application that allows to ask open questions to UN project documents.
Creating the Moroccan Darija Language Model
Must-read papers on graph neural networks (GNN)
Pytorch implementation of Graph Transformer for Cross-Lingual Plagiarism Detection (In progress)
Notebooks for the Practicals at the Deep Learning Indaba 2022.
Machine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School
Moroccan NLP Datasets and Corpora
Some known ML architecture implementations with PyTorch, TensorFlow, and JAX for reproduction and application reuse
Working Repo for building a set of models to automate the classification of project log-frames to a comprehensive taxonomy. Data can not be pushed yet.
My personal repository
Mr. Green is a multilingual theme generated with Jekyll and fully compatible with GitHub Pages.
A set of different solutions to classify small sets of long documents (SDLD) to a large amount of co-dependant labels.
A Pytorch implementation of TopicTransformer for language modeling
Get started with: Decision Tree, Random Forest, and XGBoost
Mediterranean Machine Learning school tutorials
Welcome to WordCraft, a comprehensive journey into the world of Natural Language Processing (NLP)