Giter Club home page Giter Club logo

yngcan / jupyter-portfolio Goto Github PK

View Code? Open in Web Editor NEW

This project forked from tyler-seymour/jupyter-portfolio

0.0 1.0 0.0 79.7 MB

A collection of python projects relating to natural language processing, including computational linguistics, network graph analysis, and patent analytics. Live examples are hosted on my JupyterHub and demonstrate some of my favorite libraries, including spaCy, Pandas, NetworkX, Gensim, and TextBlob. Also includes a sprinkle of blockchain and my published papers.

Home Page: https://tylerseymour.pw/user-redirect/notebooks/README.ipynb

Jupyter Notebook 88.86% HTML 11.14% TeX 0.01%

jupyter-portfolio's Introduction

Tyler Seymour's Portfolio

This server hosts live demos of my python code in Jupyter Notebooks. Click on a project below to get started!

Python & NLP

Improving Part-of-Speech and Dependency Tagging of Pre-Modern Literary Chinese Texts. Modern NLP models are improved by hand-labeled linguistic annotations by a subject matter expert.

  • spaCy for part-of-speech and named entity recognition engine;
  • zh_core_web_sm for modern Chinese language spaCy models and POS tags;
  • Pandas for building data structure of linguistic embeddings;
  • displaCy for visualizing part-of-speech and dependency tags.

OSINT workflow for investigating eCommerce fraud networks. Maps buyers to sellers using publicly available feedback and reviews, allowing investigators to track fake review networks and money laundering paths between usernames.

  • NetworkX for constructing graph network and calculating shortest distance between nodes;
  • Pandas for scraping and cleaning transaction reviews;
  • Matplotlib for static node and edge visualization;
  • pyVis for interactive network graph visualization.

⚡️ eBay Patent Maintenance Report | Patent Classification Models ⚡️

Natural language processing and topic modeling techniques are used to cluster 1000 patent abstracts using latent dirichlet allocation ("LDA").

  • Gensim for TF-IDF and Bag-of-Words/LDA models;
  • TextBlob for part-of-speech tagging and noun extraction;
  • pyLDAviz for interactive intertopic distance map.

Blockchain Projects

⚡️StackBlitz | NFT Token Tracker | IP Asset Schema ⚡️

Decentralized application for minting intellectual property licenses on the Ethereum blockchain. Issues open source (ex. Creative Commons, MIT) or proprietary IP licenses as ERC-721 non-fungible tokens that are transferable peer-to-peer. Deployed to Ropsten test network.

  • 0xCert asset management framework and smart contracts;
  • Flask for serving metadata API;
  • NES.css for frontend design and interface;

⚡️IP Gold Token | Patent Futures Exchange | Whitepaper | Enterprise | Deck ⚡️

Patent invalidation and prior art platform. Users earn IP Gold by submitting prior art that suggests an invention is not novel, or by wagering that a patent is valid or invalid. IP Gold creates an incentive for public participation in patent examination. Prediction markets provide a crowdsourced forecast of patent validity and a public incentive to increase patent quality by submitting invalidating prior art.

Publications & Other Work Product

Copyright Infringement
Public Records

jupyter-portfolio's People

Contributors

tyler-seymour avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.