Giter Club home page Giter Club logo

whai's Introduction

Report generation tool with code-mixed translation using neural machine learning model

DengL โ€“ Breaking down language barriers

Challenge

The challenge is set by the company Knowron during the TUM.ai Makeathon 2023.

Make integration into the job market easier for immigrant workers by allowing them to document their work without speaking the native language of the country.

The idea is to create more opportunities and break down the language barrier in the job market.

Create an efficient and effective process for report creation that incorporates notes and voice memos using NLP to enable talents worldwide to collaborate.

No hallucinations should be present.

Usage

The service that was developed during the Makeathon can be used for free at a privately hosted Website.

Installation

Next to cloning this repository it is also necessary to install one of the latest releases of wkhtmltopdf for automatic pdf generation.

Todo

Train a custom translator with a code-mixed, business domain corpus dataset:

  • When a sentence contains a mixture of languages, the sentence is detected in the language with the highest percentage. When trying to translate the sentence into the detected language, the translator outputs the original sentence with the other languages intact (untranslated). This is because the translator thinks the sentence has already been translated. A new model or language detection algorithm is needed to avoid this problem.

  • Mixing languages with fewer speakers leads to more frequent errors, especially when languages with different word order are mixed.

Authors

Lukas Mahr

You Sun Song

Felix Bastian

Felix Waiblinger

For support, feel free to reach out to [email protected]

whai's People

Contributors

plutokekz avatar felixwaiblinger avatar muhlex avatar ysunnn avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

ysunnn

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.