Giter Club home page Giter Club logo

whisper-website's Introduction

Website which convert speech to text by Whisper model (Official Repo)

Hosting website on localhost:

  1. Clone the repo - git clone [email protected]:Kabanosk/whisper-website.git
  2. Go to repo directory - cd whisper-website
  3. Create virtual environment - python3 -m venv venv
  4. Activate the environment - source venv/bin/activate/. venv/bin/activate
  5. Install requirements - pip install -r requirements.txt
  6. Go to src directory - cd src
  7. Run the run.py file - python3 run.py
  8. Go to your browser and type http://127.0.0.1:8000/ if the browser doesn't open

Run website on localhost with Docker

First time

  1. Install Docker
  2. Clone the repo - git clone [email protected]:Kabanosk/whisper-website.git
  3. Go to repo directory - cd whisper-website
  4. Create Docker image - docker build -t app .
  5. Run Docker container - docker run --name app_container -p 80:80 app
  6. Go to your browser and type http://127.0.0.1:80/

Next time

  1. Start your Docker container - docker start app_container
  2. Go to your browser and type http://127.0.0.1:80/

whisper-website's People

Contributors

kabanosk avatar noahlessard avatar ssummer-ui avatar zackrylangford avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

whisper-website's Issues

ImportError: cannot import name 'whisper' from 'whisper'

Thanks for doing this project. I'm looking for use website in a network server, but when I try to run main.py it says ti cannot import whisper module.
I tried this in my PC and found that mispell error causing this in main.py:

import whisper from whisper

I you chance second whisper to Capital letter ti works:

import whisper from Whisper

Whisper large-V3

Thanks for this, it is really useful. Can this support large-v3?

GUI Application?

Greetings, could you please tell me if there are plans to make a full-fledged GUI that you can run as a local program?

Internal Server Issue when trying to transcribe a file

Describe the bug
A clear and concise description of what the bug is.
For some reason, I am getting an internal server error in the content of the output file when I try to transcribe a m4a file.
OS (please complete the following information):

  • [e.g. iOS]
  • windows

Additional context
Add any other context about the problem here.
This is the error msg:
image

Enhancing GPU support, adding VTT/SRT export, and updating to Whisper 2.0

Hello,

I recently tried using your "website_for_whisper" repository and found it very helpful. I have a few suggestions and questions that I hope could improve the project:

GPU support: I noticed that the current implementation seems to be running the Whisper model on the CPU, which can be quite slow for some users. It would be great if the repository could be updated to support running the model on a GPU when available. This should improve the performance and make the transcription process faster.

VTT and SRT export: Another useful feature to add would be the ability to export the transcriptions in VTT and SRT formats. These formats are widely used for subtitles and captions, and having the option to export in these formats would be beneficial for many users.

Whisper 2.0: I saw that OpenAI has released Whisper 2.0, an updated version of the original Whisper model. I was wondering if the repository is using the latest version of the model. If not, could you please update the repository to use Whisper 2.0?

Thank you for your work on this project. I look forward to any updates and improvements you might make in the future.

Internal server issue when converting audio to text

Describe the bug
A clear and concise description of what the bug is the audio file is working ok on the browser , when the srt file download , file have a internal server error written in it

OS (please complete the following information):

  • [e.g. iOS]
    Windows

Additional context
Add any other context about the problem here.
2023-07-27 14 43 17

Run Pyhton on windows using windows_run.bat

Describe the bug
The command in windows to run python is python not python3
When using Installing python from the python.org

OS (please complete the following information):
Windows

Additional context
At file: windows_run.bat
Line 3

ffmpeg [WinError2]

Describe the bug

Screenshot 2023-01-04 155615

While taking input in a windows environment you get an [winerror2] file not found error : you can solve this error by making the
subprocess.Popen( ...,shell=True )
shell variable to true in _run.py file of ffpeg lib

After solving this error you get ffmpeg not found error :
Screenshot 2023-01-04 155243

you can solve this error by following this blog

and at last take the ffmpeg.exe file and copy it to the src directory to finally run the website.

OS:

  • [Windows]

Additional context
Provided a method to work around the ffmpeg bug

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.