dzianis-pirshtuk Goto Github PK
Name: Dzianis Pirshtuk
Type: User
Location: Minsk, Belarus
Name: Dzianis Pirshtuk
Type: User
Location: Minsk, Belarus
My bachelor's degree thesis (with code and experiments) on sentiment classification of Russian texts using Bi-RNN with attention mechanism.
Researches confirms that social media provides good insights on what people think, feel, concern, etc. It is expected that those insight mined from Twitter data has potential to support a better decision-making, especially in public sectors. Public sector wants to know local’s insight level; therefore they need to make sure they use the conversation from residents. However, the ground truth shows that tweets are mixed from the residents and tourist. This study investigates the best automatic fashion model to classify tweets posted by resident and tourist, in NTB. Indonesia. To do so, several consecutive phases were conducted. Those are pre-processing, data training, classification system, data testing, accuracy comparison, and result visualization. First of all, a Twitter dataset, which has 700,000 tweets posted by approximately 26,000 users in Nusa Tenggara Barat, Indonesia was prepared. The dataset divided into two sets, tweets from 4,000 users for data training and 22,000 users for data testing. Then, three popular classification algorithms were applied to the datasets. There are Multinomial Naïve Bayes, Support Vector Machines and Decision Tree. After that, 7 features are created. There are Bag of Words, Normalizer location, Total Tweet, Total Day, Tweet per Day, Total Location and Location per Day. Experiment shows that Multinomial Naïve Bayes with Bag of Words feature has 86% accuracy, while the rest of features give less than 65% accuracy. This is different with Support Vector Machines and Decision Tree results. These two algorithms produce better accuracy results excluding Bag of Words feature. It implies that Support Vector Machine and Decision Tree are more powerful when processing numerical value. However, among all classification system, Multinomial Naïve Bayes still being the most accurate algorithm for the model.
Automatically activate a conda environment when entering folders/project.
A WebRTC signaling server with support of MQTT and WebSocket as transport protocols, token based authentication (JSON Web Token) and external policy based authorization.
Example of solving a problem in predictive analytics for marketing (banking case)
The open source core of the GraphLab ML library
Deep Face Generation and Editing: A Survey
:panda_face: One Millisecond Deformable Shape Tracking Library (DEST)
Criteo/Kaggle Competition of CTR prediction
caffe 68 points face-landmark
EECS 498 (Intro. to Information Retrieval) Final Project: Text classification applied to social media
Library for guessing a person's gender by their first name.
geonamescache - a Python library for quick access to a subset of GeoNames data.
GitLab is version control for your server
The Social Harvest server that exposes an API and harvests data from the web to be analyzed.
Kaggle Criteo https://www.kaggle.com/c/criteo-display-ad-challenge
Winning solution for the Galaxy Challenge on Kaggle (http://www.kaggle.com/c/galaxy-zoo-the-galaxy-challenge)
Software for the kaggle criteo challenge
VK-KittenPHP/DB/Engine suite
Loan Default Prediction at Kaggle
Code for building a co-occurence data set from instagram
The code I used to get in the top #150 in the Netflix Prize
Open standard for machine learning interoperability
Multilingual text (NLP) processing toolkit
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.