hayj Goto Github PK
Name: hayj
Type: User
Name: hayj
Type: User
This tool recognize 404 error according to the html content
This tool allow users to labelize data one by one with a tkinter UI. You just need to give a set of label type and a set of data to be labelized. All labels will be stored either in a pickle file or a mongo collection.
This repository gather functions and classes allowing to apply a highly scalable and efficient author filtering process on any corpus.
Some useful bash functions
Library of Java tools including basics
Student projet at CentraleSupélec
The class MongoCollection allow an easy config of a MongoDB collection by providing an interface which handle authentication, indexes management, data conversion and pretty print of collections. It can work like a Python dict if you give at least one index.
Some usefull data structures
This repository provide some useful Python data structures, especially SerializableDict
This project gathers useful modules on url parsing, csv reading, html parsing etc.
Python utils for data visualization (bokeh, pandas...)
DeepStyle provides pretrained models aiming to project text in a stylometric space. The base project consists in a new method of representation learning and a definition of writing style based on distributional properties. This repository contains datasets, pretrained models and other ressources that were used to train and test models.
This tool detect duplicates over web pages of a domain to control crawling process. It prevent the crawl of captcha pages or "refuse" page for example.
This tool can recognize honeypot urls using selenium to prevent bot detection
Distributed Asynchronous Hyperparameter Optimization in Python
Hyper-parameter optimization for sklearn
Un outil permettant de convertir les relevés de compte PDF de La Banque Postale en fichier CSV lisibles dans un tableur.
This is a minimal acyclic finite-state automata algorithm in Java based on the paper, "Incremental Construction of Minimal Acyclic Finite-State Automata".
Provide some useful tools for machine learning
Convert a mardown file to a html file with a given css style (or a default one)
A moodle module for learning language
This tool is useful to detect news URLs. It also aggregates several libraries which scrap news web pages (title, content...).
Provide useful NLP tools to get word embeddings, preprocess text data...
Py4J enables Python programs to dynamically access arbitrary Java objects
This repository allows to simulate a Renewal competition
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.