Giter Club home page Giter Club logo

transcribee's Introduction

transcribee logo

🎀 transcribee ✍️

[going to be] an open source audio- and videotranscription software

Note:

Currently, transcribee is heavily work-in-progress and not yet ready for production use. Please come back in a few weeks / months.

transcribee 🐝 aims to make the workflow for media transcription easier, faster and more accessible.

  • It can automatically generate a draft transcript of your audio
  • It allows you to quickly improve the automatic draft and fix any errors
  • It's collaborative – split the work with your friends or colleagues
  • It's open-source

Develop!

To get started with developing or to try the current state of transcribee, follow the instructions in the development setup document.

How does it work?

Note:

We're heavily working on transcribee. Not all steps described here are already implemented.

Creating a transcript with transcribee 🐝 is done with the following steps:

  1. Import your media file

    During import, your audio file is automatically converted to text using state-of-the-art models1. transcribee 🐝 also automatically detects different speakers in your file.

  2. Manually improve the transcript

    After the automatic transcript is created, you can edit it to correct any mistakes the automatic transcription made.2 You can also name the speakers.

    Since transcribee 🐝 is a collaborative software, you can do this step (and all other manual steps) together with others. All changes are instantly synced with everyone working on the transcript.

  3. Automatic re-alignment

    To make sure that the timestamps of your corrected text are still correct, transcribee 🐝 matches this text back up with the audio.

  4. Manual re-alignment

    Now you can check the automatically generated timestamps and correct them.

  5. Export

    Once you are happy with the transcript, you can export it.

Acknowledgements

  • Funded from March 2023 until September 2023 by logos of the "Bundesministerium fΓΌr Bildung und Forschung", Prototype Fund and Open Knowledge Foundation Deutschland

Footnotes

  1. At the moment we use whisper.cpp for transcription, Wav2Vec2 for realignment and speechbrain for speaker identification. ↩

  2. The editor is based on slate with collaboration using the automerge CRDT. ↩

transcribee's People

Contributors

pajowu avatar phlmn avatar anuejn avatar rroohhh avatar voronov007 avatar moeffju avatar dnkbln avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.