vosk_stt_app's Introduction

vosk_stt_app

基于vosk的本地轻量化语音识别程序

功能特点

本地运行：所有处理都在本地完成，无需网络连接，保护用户隐私。
轻量化：程序体积小，占用资源少，适合各种计算环境。
简单易用：通过图形界面操作，一键录音并转换为文本。
高识别率：使用vosk模型，提供准确的语音识别效果。

环境要求

Python 3.x
pyaudio
vosk
numpy

安装步骤

确保系统已安装Python 3.x。
安装所需的Python库：
```
pip install pyaudio vosk numpy
```
下载vosk模型文件并放置在程序指定的路径下。

使用方法

运行程序，界面会提示“点击右侧按钮开始录音”。
点击“点击录音”按钮开始录音。
录音过程中保持安静2秒后，程序会自动结束录音并开始转换。
转换完成后，文本会自动显示在下方的文本框中。

注意事项

录音时，请确保环境安静，以提高识别准确率。
如果录音转文字失败，请检查模型文件路径是否正确，并尝试重新录音。

代码结构

speech_to_text：核心函数，负责录音和语音识别。
notice：用于在状态框中显示信息。
run：启动录音和识别的线程。
root.mainloop：Tkinter的事件循环，等待用户操作。

贡献与反馈

欢迎对本项目提出改进建议或贡献代码。如有问题，请通过Issues进行反馈。

开源协议

本项目遵循MIT License，详情见项目中的LICENSE文件。

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.

Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

TensorFlow

An Open Source Machine Learning Framework for Everyone

Django

The Web framework for perfectionists with deadlines.

Laravel

A PHP framework for web artisans

D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

web

Some thing interesting about web. New door for the world.

server

A server is a program made to process requests and deliver data to clients.

Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

Visualization

Some thing interesting about visualization, use data art

Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.

Microsoft

Open source projects and samples from Microsoft.

Google

Google ❤️ Open Source for everyone.

Alibaba

Alibaba Open Source for everyone

D3

Data-Driven Documents codes.

Tencent

China tencent open source team.

swordswind / vosk_stt_app Goto Github PK