kabanosk / whisper-website Goto Github PK

View Code? Open in Web Editor NEW

233.0 7.0 53.0 44 KB

Simple web application, which can be used to convert audio to subtitles by OpenAI's Whisper model

License: MIT License

Python 47.51% CSS 3.63% HTML 28.00% Shell 1.27% Batchfile 1.58% JavaScript 14.95% Dockerfile 3.06%

fastapi openai speech-to-text whisper python3 uvicorn website audio-to-text subtitles subtitles-generator

whisper-website's Introduction

Website which convert speech to text by Whisper model (Official Repo)

Hosting website on localhost:

Clone the repo - git clone [email protected]:Kabanosk/whisper-website.git
Go to repo directory - cd whisper-website
Create virtual environment - python3 -m venv venv
Activate the environment - source venv/bin/activate/. venv/bin/activate
Install requirements - pip install -r requirements.txt
Go to src directory - cd src
Run the run.py file - python3 run.py
Go to your browser and type http://127.0.0.1:8000/ if the browser doesn't open

Run website on localhost with Docker

First time

Install Docker
Clone the repo - git clone [email protected]:Kabanosk/whisper-website.git
Go to repo directory - cd whisper-website
Create Docker image - docker build -t app .
Run Docker container - docker run --name app_container -p 80:80 app
Go to your browser and type http://127.0.0.1:80/

Next time

Start your Docker container - docker start app_container
Go to your browser and type http://127.0.0.1:80/

whisper-website's People

Contributors

Stargazers

Watchers

whisper-website's Issues

ImportError: cannot import name 'whisper' from 'whisper'

Thanks for doing this project. I'm looking for use website in a network server, but when I try to run main.py it says ti cannot import whisper module.
I tried this in my PC and found that mispell error causing this in main.py:

import whisper from whisper

I you chance second whisper to Capital letter ti works:

import whisper from Whisper

howto config other languages?

I dear, howto config in other languages?

Whisper large-V3

Thanks for this, it is really useful. Can this support large-v3?

Internal Server Error

GUI Application?

Greetings, could you please tell me if there are plans to make a full-fledged GUI that you can run as a local program?

Internal Server Issue when trying to transcribe a file

Describe the bug
A clear and concise description of what the bug is.
For some reason, I am getting an internal server error in the content of the output file when I try to transcribe a m4a file.
OS (please complete the following information):

[e.g. iOS]
windows

Additional context
Add any other context about the problem here.
This is the error msg:

Enhancing GPU support, adding VTT/SRT export, and updating to Whisper 2.0

Hello,

I recently tried using your "website_for_whisper" repository and found it very helpful. I have a few suggestions and questions that I hope could improve the project:

GPU support: I noticed that the current implementation seems to be running the Whisper model on the CPU, which can be quite slow for some users. It would be great if the repository could be updated to support running the model on a GPU when available. This should improve the performance and make the transcription process faster.

VTT and SRT export: Another useful feature to add would be the ability to export the transcriptions in VTT and SRT formats. These formats are widely used for subtitles and captions, and having the option to export in these formats would be beneficial for many users.

Whisper 2.0: I saw that OpenAI has released Whisper 2.0, an updated version of the original Whisper model. I was wondering if the repository is using the latest version of the model. If not, could you please update the repository to use Whisper 2.0?

Thank you for your work on this project. I look forward to any updates and improvements you might make in the future.

Internal server issue when converting audio to text

Describe the bug
A clear and concise description of what the bug is the audio file is working ok on the browser , when the srt file download , file have a internal server error written in it

OS (please complete the following information):

[e.g. iOS]
Windows

Additional context
Add any other context about the problem here.

Run Pyhton on windows using windows_run.bat

Describe the bug
The command in windows to run python is python not python3
When using Installing python from the python.org

OS (please complete the following information):
Windows

Additional context
At file: windows_run.bat
Line 3

ffmpeg [WinError2]

Describe the bug

While taking input in a windows environment you get an [winerror2] file not found error : you can solve this error by making the
subprocess.Popen( ...,shell=True )
shell variable to true in _run.py file of ffpeg lib

After solving this error you get ffmpeg not found error :

you can solve this error by following this blog

and at last take the ffmpeg.exe file and copy it to the src directory to finally run the website.

OS:

[Windows]

Additional context
Provided a method to work around the ffmpeg bug