dstoekl Goto Github PK
Name: Daniel Stoekl
Type: User
Company: EPHE, PSL
Location: Paris
Name: Daniel Stoekl
Type: User
Company: EPHE, PSL
Location: Paris
Simple Python library for doing (multiple) sequence alignment
Pre-trained BERT Models for Ancient and Medieval Greek, and associated code for LaTeCH 2021 paper titled - "A Pilot Study for BERT Language Modelling and Morphological Analysis for Ancient and Medieval Greek"
Restoring ancient text using deep learning: a case study on Greek epigraphy.
TensorFlow code and pre-trained models for BERT
Web application to perform clustering of text data on LXX and SBL Greek New Testament
The repository contains scripts for parsing and analyzing Hebrew texts.
Tools for initial data processing to populate database.
Submission to the ICDAR2017 Competition on the Classification of Medieval Handwritings in Latin Script
A selectional auto-encoder approach for document image binarization
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
handwritten text recognition on IAM handwriting dataset
The ISRI Analytic Tools for OCR Evaluation, now with UTF-8 support!
Restoring and attributing ancient texts using deep neural networks
Interactively edit individual DCT blocks in any JPEG image in the browser.
This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants they go with. Some pages have interspersed English words; still others have tables with a lot of numeric data. In addition, there are old pages containing either a lot of broken characters or many words with two or more characters merged into a single connected component.
OCR engine for all the languages
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
A Python Library for Document Layout Understanding
This course covers how you can use NLP to do stuff.
Open Scriptures Hebrew Bible
Notebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities, June-July 2020
Publication-ready NN-architecture schematics.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.