Giter Club home page Giter Club logo

movie_management_project's Introduction

Hi, I'm Jaeyoon Jung

As an undergraduate student majoring in Artificial Intelligence, I am interested in diverse research areas, including computer vision, natural language processing, and audio signal processing.
My current research interests are several. Large language models that has capability of multimodal inputs and outputs (video, image, text, audio, action), adapting instruction-following capabilities of English-based large language models to mainly Korean and other diverse language, and dehallucinative large language model.

๐Ÿข Career

  • Misys Lab intern (2021-06-18 ~ 2022-08)
  • ๐Ÿ’ผ Maum AI Inc (Mindslab) AI Scientist (2023-04-26 ~ )
    • Improved the quality of RAG by enhancing the retriever. By training it with a high-quality dataset and selecting the best models, the retriever achieved maximum 68% score improvement(Recall@K) compared to its previous performance.
    • Optimized STF(Speech-To-Face) model by using ONNX and TensorRT, reducing the 1-second video generation time from 1.6 seconds to 0.7 seconds. This enhancement enables real-time streaming in real-world applications.
    • Worked on transferring instruction-following capabilities from English to Korean in open-source large language models. This work aims to facilitate the development of high-quality Korean language models at a low cost.
    • Experienced with training large language models(up to 70B), using deepspeed for multi-node training. While each node comprises 8 NVIDIA H100 80GB GPUs, four DGX H100 systems interconnected with NVLink are used for train.
    • https://maum-ai.github.io

๐Ÿ“ Publications

๐Ÿ™Œ EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning

Accepted to ICASSP 2024
paper | code

๐Ÿ† AI Challenge

๐Ÿฅ‡ [1st Place] in 2022 Samsung AI Challenge (3D Metrology)

Task: make an AI that produce depth map from SEM image
repo link | soongsil univ news

๐Ÿฅ‡ [1st Place] in 2023 LG DISPLAY Product Quality Classification

Task: classify the product quality using tabular data from the LG display factory
repo link | hankyung news

๐Ÿฅˆ [2nd Place] in 2022 LG INNOTEK Radar Performance Prediction

Task: predict radar performance using tabular data from the LG innotek factory
repo link | youtube link (interview)

๐ŸŽ–๏ธ [4th Place] in Monthly Dacon Computer Vision Anomaly Detection

Task: detect the anomaly samples and classify it
code link

๐ŸŽ–๏ธ [6th Place] in Monthly Dacon 3D MNIST Classification

Task: 3D MNIST Classification
repo link

๐ŸŽ–๏ธ [7th Place] in 2022 SWUNIV AI Challenge

Task: develop OCR algorithm to recognize hangul text from the image
repo link | soongsil univ news

๐ŸŽ–๏ธ [7th Place] in 2022 Dankook Univ AI Challenge

Task: predict bike sharing demand using tabular data
repo link

[Reach the final] in 2023 4th Sungkyunkwan Univ Bookathon

Task: write essay with AI (GPT3)
repo link | Open In Colab

[Reach the final] in 2022 Military AI Competition

repo link

[Reach the final] in 2022 Naver AI RUSH

repo link

๐Ÿ“’ AI Project

๐Ÿฅˆ [2nd Prize] 2022 Soongsil univ AI Contest

'TryYours' the high resolution virtual try on using HR-VITON
repo link | Open In Colab

๐ŸŽ–๏ธ [participation Prize] 2021 Soongsil univ AI Contest

'Jaeho' the AI speaker that has its name and facial expression
repo link

๐Ÿ’ป Algorithm Problem Solving

  • BOJ (Baekjoon Online Judge)

Solved.ac ํ”„๋กœํ•„

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.