Giter Club home page Giter Club logo

chrome-extension-transcript's Introduction

Audio Assistant Extension

Overview

Audio Assistant Extension is a cutting-edge tool designed to enhance productivity and assist with a variety of audio-based tasks. This extension is particularly useful for professionals and students who frequently engage in online meetings.

Local Installation and Setup

To use this extension, you need to clone the repository, install dependencies, and build the extension. Here's a step-by-step guide:

  1. Clone the Repository:
git clone https://github.com/felri/chrome-extension-transcript
cd chrome-extension-transcript
  1. Install Dependencies:
npm install
  1. Build the Extension:
npm run build
  1. Load the Extension in Chrome:
  • Open Chrome and navigate to chrome://extensions/
  • Enable Developer mode by toggling the switch in the top right corner.
  • Click on Load unpacked and select the build folder from your cloned repository.
  1. Start Using the Extension:
  • After loading the extension, you will see the OpenAI Audio Assistant icon in your browser toolbar.
  • Click on it to start using the extension as per the earlier instructions.

Make sure to have Node.js and npm installed on your system to run these commands.

Features

  • Audio Recording: Capable of recording audio from your current browser tab, whether it's a video or live audio from platforms like Zoom or Google Meet.
  • Transcription via Whisper API: Converts the recorded audio into text using OpenAI's Whisper API, ensuring accurate and efficient transcription.
  • Interaction with GPT-4 API: Sends the transcribed text to the GPT-4 API along with a predefined user instruction, such as analyzing, summarizing, or generating a response based on the transcript.
  • Custom System Message: Users can set a specific instruction or query at the start to guide the AI's response to the transcript.
  • Local Storage of API Keys: API keys required for Whisper and GPT-4 APIs are securely stored in the browser's local storage for easy and secure access.
  • Open Source Code: The codebase is open source, allowing users to review, modify, and understand how the extension operates.

How to Use

  1. Install the Extension: Add the OpenAI Audio Assistant to your Chrome browser.
  2. Set API Keys: Enter your Whisper and GPT-4 API keys in the provided settings area. These keys will be stored locally for future use.
  3. Define System Message: Specify the type of operation or response you need from GPT-4 regarding the transcribed text.
  4. Record Audio: Activate the extension on a tab where audio is playing. This can be a video, an online meeting, or any other audio source.
  5. Transcription and Interaction: The extension will transcribe the audio and interact with GPT-4 based on your predefined instructions.

Privacy and Security

Your privacy is paramount. The extension does not store any personal data or recordings. All interactions are processed in real-time, and API keys are stored locally on your device.

chrome-extension-transcript's People

Contributors

felri avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.