Giter Club home page Giter Club logo

bengali.ai-speech-recognition-solution's Introduction

Bengali.AI-Speech-Recognition-Solution

dataset API command -- kaggle competitions download -c bengaliai-speech

data_augmentation.py

!python preprocess.py
--model_name_or_path "openai/whisper-small"
--language "Bengali"
--output_dir "75k-samples"
--preprocessing_num_workers 90
--preprocessing_only
--text_column_name "sentence"
--data_dir "data"
--min_duration_in_seconds 2
--max_duration_in_seconds 30
--max_train_samples 75000
--max_eval_samples 5000
--apply_spec_augment

#train.py data_dir = "/kaggle/input/bengali-ai-asr-10k"

!torchrun --nproc_per_node 2 train.py
--model_name_or_path "bangla-speech-processing/BanglaASR"
--train_data_dir $data_dir
--validation_data_dir $data_dir
--language "Bengali"
--output_dir "whisper-base-bn"
--do_train
--do_eval
--fp16
--group_by_length
--predict_with_generate
--dataloader_num_workers 1
--overwrite_output_dir
--per_device_train_batch_size 4
--length_column_name "input_length"
--report_to "none"
--metric_for_best_model "wer"
--greater_is_better False
--evaluation_strategy "epoch"
--save_strategy "epoch"
--save_total_limit 1
--logging_steps 10
--gradient_checkpointing
--warmup_steps 50
--apply_spec_augment True
--num_train_epochs 3
--learning_rate "1e-5"

bengali.ai-speech-recognition-solution's People

Contributors

hunter-wrynn avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.