Giter Club home page Giter Club logo

final-project-level3-cv-13's Introduction

웹캠을 이용한 가상 마우스

프로젝트 소개

개요

  • 웹캠을 통해 사용자의 손동작을 인식하여 마우스 커서 조작
  • 목적
    • 마우스 또는 터치스크린 기기의 조작 방식 대체
      • 코로나 등의 질병 전염을 줄이기 위해 키오스크 등의 터치 스크린 기기의 위생 개선
      • 실시간으로 컨텐츠 제작과 컴퓨터 조작을 동시에 수행해야하는 1인 컨텐츠 제작자들의 편의성 증진

흐름도

사용

설치

# 레포지터리 클론
git clone https://github.com/boostcampaitech3/final-project-level3-cv-13.git
cd final-project-level3-cv-13

# 가상환경 생성
virtualenv virtual_mouse --python=3.8
source virtual_mouse/bin/activate

# 의존성 설치 및 플러그인 충돌 우회
pip install -r requirements.txt
mv virtual_mouse/lib/python3.8/site-packages/cv2/qt/plugins/ virtual_mouse/lib/python3.8/site-packages/cv2/qt/plugins_/

# 실행
python3 main.py

GUI 구성

손동작 종류

이동 클릭 우클릭 None
  • 드래그: 클릭 상태 1초간 유지

모델

키포인트 디텍션 모델

  • Top-down heatmap based hand keypoint estimation with MobileNetV3-Large backbone
  • FreiHAND dataset으로 학습 후 CV7AHAND dataset으로 재학습
  • CV7AHAND: 사용환경에 적합하게 직접 제작한 데이터셋

제스쳐 분류 모델

  • Three-layer MLP - Sequential(Linear, BN, ReLU, Linear, BN, ReLU, Linear, Sigmoid)
  • Input shape: (BATCH_SIZE, 42), Output shape: (BATCH_SIZE, 4)
  • 손동작을 취하면서 직접 제작한 데이터셋으로 학습

팀 CV7Λ

T3049 T3114 T3165 T3209 T3216
김재훈 송진우 이종민 조정빈 천영호

참고

final-project-level3-cv-13's People

Contributors

canlion avatar jongminzzang97 avatar star-bits avatar youngho97 avatar

Stargazers

 avatar

Forkers

canlion youngho97

final-project-level3-cv-13's Issues

mediapipe를 이용한 프로토타입 작성

내용

  • mediapipe를 이용한 간단한 프로토타입 추가
  • 설정된 영역 내의 검지의 이동에 따라 마우스 조작
  • 실행
    • pip3 install opencv-python pyautogui mediapipe
    • python3 prototype/test.py

문제점

  • 손을 정지한 상태에서도 인퍼런스 결과가 진동하여 마우스가 심하게 떨림
    • 떨림을 보정할 수 있는 알고리즘이 필요할 것으로 생각됨.
  • 간혹 인퍼런스 결과가 좋지 않음. 더 좋은 성능의 모델 필요

Welcome update to OpenMMLab 2.0

Welcome update to OpenMMLab 2.0

I am Vansin, the technical operator of OpenMMLab. In September of last year, we announced the release of OpenMMLab 2.0 at the World Artificial Intelligence Conference in Shanghai. We invite you to upgrade your algorithm library to OpenMMLab 2.0 using MMEngine, which can be used for both research and commercial purposes. If you have any questions, please feel free to join us on the OpenMMLab Discord at https://discord.gg/amFNsyUBvm or add me on WeChat (van-sin) and I will invite you to the OpenMMLab WeChat group.

Here are the OpenMMLab 2.0 repos branches:

OpenMMLab 1.0 branch OpenMMLab 2.0 branch
MMEngine 0.x
MMCV 1.x 2.x
MMDetection 0.x 、1.x、2.x 3.x
MMAction2 0.x 1.x
MMClassification 0.x 1.x
MMSegmentation 0.x 1.x
MMDetection3D 0.x 1.x
MMEditing 0.x 1.x
MMPose 0.x 1.x
MMDeploy 0.x 1.x
MMTracking 0.x 1.x
MMOCR 0.x 1.x
MMRazor 0.x 1.x
MMSelfSup 0.x 1.x
MMRotate 1.x 1.x
MMYOLO 0.x

Attention: please create a new virtual environment for OpenMMLab 2.0.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.