Giter Club home page Giter Club logo

ml-projects's Introduction

Notable A.I. and Machine Learning Projects

Table of Contents

Papers (Unpublished Manuscripts)


  1. Towards Neural Ranking for Mixed-Initiative Conversational Search

    • Abstract: Recent research has shown that integrating clarifying questions and answers into ranking models offers the potential to better understand users’ information needs and improve document ranking. However, previous approaches only used naive ranking models (i.e. QL and BM25) so far and neural rankers remain unexplored. At the same time, neural ranking models dominate leaderboards for single-shot query tasks and bring interesting features that should also be advanta- geous in a conversational setup. In this work we explore how neural rankers can be extended to effectively represent clarifying question and answer in addition to the initial user query. To this end, we first try to extend conventional neural ranking models ConvKNRM and PACRR by naively aggregating FastText word embeddings. We then investigate whether contextualized word embeddings given by BERT are able to incorporate clarifying questions and answers more effectively and outperform these baselines. Lastly, we ana- lyze how our models perform on different answer polarities (affirmation, negation, I don’t know and other).

    • Code
    • Paper

  2. Transparency in Deep Learning Using Hierarchical Prototypes

    • Abstract: Neural Networks are highly effective and widely used algorithms for classification. However, it is often difficult to interpret why these models make certain predictions. Earlier work uses the notion of prototype layers to allow for easier, visual interpretation of the network’s predictions. We extend this prototype model with a hierarchical prototype model, introducing sub- and superprototype layers. These layers enable the model to visualize a number of superprototypes equal to the number of superclasses while simultaneously allowing the model to infer and visualize the latent subclasses present in the data. This extension does not sacrifice accuracy, achieving 99% accuracy on an MNIST classification task. We have found that this model is indeed able to find and visualize general superprototypes and more specific subprototypes. Ultimately, we argue that this model can also be used in the pipeline of debiasing data and subsequent predictions.

    • Code
    • Paper

Professional Projects


  1. Booking forecasting and dynamic pricing for hotels and tourist attractions

    • Description: This project involved forecasting the number of bookings for hotels and tourist attractions in a highly multivaried and dynamic environment. Additionally, we utilized the insights to implement an intelligent dynamic pricing strategy. The project consisted of multiple automated machine learning pipelines for data preperation, feature engineering, training, scoring and reporting. Services that were used included customized machine learning models, AutoML, and distributed processing and optimization.

    • Key topics: Time series forecasting, Machine Learning cloud infrastructure, Recurrent neural networks, Darts TimeSeries, PyTorch, gradient boosting, Feature engineering, SQL, Spark, Distributed computing, Bayesian Optimization

    • The code is not publicly available due to legal reasons.
  2. Hand animation projection

    • Description: The goal of this project was to display video on the hands of a person using a projector. The video was displayed on the hands by using a custom kalman filter to smooth the hand detections from a pre-trained hand detector.

    • Key topics: Kalman Filter, Noisy Signal Processing, Computer Vision, Geometry.

    • Code
    • Video

  3. Fillable field detector for forms

    • Description: The goal of this project was to detect fillable fields PDF on forms. The fillable fields were detected by using a customized YOLOX object detection network. Additionally, the detected fields were classified depending on input type.

    • Key topics: Multi-Class Object Detection, YOLO network, ONNX, PyTorch, Human in loop automated labeling

    • drawing


    • The code is not publicly available due to legal reasons.

  4. Icelandic smart assistant voice commands for smart devices

    • Description: The goal of this project was to integrate voice control for smart devices into the Icelandic smart assistant Embla. The functionality was added into the Icelandic natural language processing engine Greynir. This functionality was then used to extend the capabilities of Embla.

    • Code: Greynir Server
    • Code: Embla App
    • Video

Academic Projects


  1. Thesis: Semantic Segmentation Under Realistic Constraints When Ground Truth Is Inconsistent, Unbalanced and Sparse

    • Description: Semantic segmentation conventionally requires large scale annotated datasets for training that consist of accurate, balanced, and abundant ground truth annotations. Mixing-based semi-supervised learning methods were previously shown to acquire competitive performance when access to labeled data was limited. However, these methods had commonly been applied on datasets that consisted of reliable and balanced ground truth annotations. Along with this, for previously used datasets, acquiring reasonable synthesized images could be achieved using relatively naive mixing strategies. My thesis explored how mixing-based semi-supervised learning could be used for improved image based poultry quality control using a dataset that contained inconsistent, unbalanced and sparse ground truth annotations. Previous work had applied these methods on complex large scale architectures that were not deployable on commonly used edge devices. To account for realistic hardware constraints and to broaden the range where these methods could be utilized, the FC-HarDNet architecture was incorporated. The weighted cross-entropy loss along with new methods of weighting the consistency regularization loss term which better accounted for high class imbalance was also introduced. Finally, selective methods of mixing which enforced more consistent and realistic synthesized images was also introduced to acquire improved performance. The results showed that the proposed methods were able to surpass the performance of any previous methods that had been used by the company. The thesis was done in collaboration with the machine learning team at the multi-national food processing company Marel.

    • The thesis and the code are not publicly available due to legal reasons.

  2. 2D to 3D facial reconstruction

    • Description: Implementation of a 2D to 3D facial reconstruction system from monocular images using a combination of statistical PCA model and energy minimization. This was done as a part of a university project.

    • Code

  3. Structure-from-motion

    • Description: Implementation of a structure-from-motion algorithm to recover three-dimensional structures from 2D images. This was done as a part of a university project.

    • Code

  4. N-step bootstrapping in actor critic methods

    • Description: Blog post with the implementation and analysis of the n-step bootstrapping in actor critic methods. We will analyzed what impact the hyperparameters from GAE could have on the optimality and the convergence-speed of the algorithm and compared it to the Monte-Carlo method (REINFORCE). We will also investigated what impact the aforementioned measurements using Generalized Advantage Estimation had over vanilla Advantage Estimation.

    • Code
    • Blog post

  5. Subreddit sentiment analysis relationship visualization

    • Description: Visual analytics system that allowed users to navigate through and gain insight from a data set of mobilisations between different internet communities. The system consisted of a network graph and sunburst for navigations, and a recommender system and community clustering to see how the communities relate to each other. This was done as a part of a university project.

    • Code
    • Final Report
    • Video

Side projects


  1. MuZero Chess

    • Description: Implemented a custom chess environment for the existing MuZero repository

    • Code

  2. Graph Convolutional Networks

    • Description: Implementation of graph convolutional network with both standard graph convolution and gated attention. This was primarely implemented to help me learn about node and graph classification.

    • Code

  3. Transformer Network

    • Description: Implementation of a transformer network. This was primarely implemented to help me learn about pytorch lightning and transformers since they were relatively new while I was attending university.

    • Code

  4. Snake bot

    • Description: Implementation of a snake bot using reinforcement learning along with a custom environment. This was primarely implemented for fun and to also to get better on-hands experience with Deep Q-learning, REINFORCE and PPO.

    • drawing


    • Code

ml-projects's People

Contributors

hinriksnaer avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.