Giter Club home page Giter Club logo

Comments (4)

ZachNagengast avatar ZachNagengast commented on July 2, 2024 1

We do have different log levels, sounds like you're interested in logLevel: .info rather than debug? For the CLI this is hardcoded at the moment, so we can add this as a new CLI argument. Anything specific you'd especially like to see in the info logs?

from whisperkit.

atiorh avatar atiorh commented on July 2, 2024

@quist00 Adding to Zach's point, if you are interested in a streaming application (as opposed to offlline processing of a file) and want to test/emulate the streaming performance on a file, you can use --stream-simulated in the CLI.

from whisperkit.

quist00 avatar quist00 commented on July 2, 2024

It would be great if that could be added as a flag to the CLI. Streaming applications is not something we are really looking at currently. I work at a library and we want to use whisper internally to drastically reduce the time and expenditure to transcribe / translate items for oral history projects. I and many of my colleagues have Apple Silicon, so I really appreciate you all working on options for us that work more efficiently. I want to share it with other researchers around campus who also may have dozens or hundreds of hours of audio to contend with, so command line will really be the best options for most of them rather than a programmatic API approach given they are not programmers in most cases nor do they have any on staff.

As far as the output, I think the time stamps along with chunks of text as it goes is best. That way, novice users can get rough estimates of if I use this model with whisperkit, then I can estimate that I will get x minutes of output for a minute of processing. They can then grade the output and determine what is the right tradeoff of model verse processing time.

Thanks for you consideration.

from whisperkit.

ZachNagengast avatar ZachNagengast commented on July 2, 2024

@quist00 Could you perhaps give an example of the input/output pairs you're looking for? That way we can build toward a CLI flag that would result in an acceptable output for you.

from whisperkit.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.