nuaazs Goto Github PK

followers: 34.0 following: 230.0 repos: 68.0 gists: 0.0

Type: User

Bio: Speaker Recognition/Diarization, Anti-spoofing, Speech Recognition

👋 Constantly exploring the fascinating fields of Speaker Recognition/Diarization, Anti-spoofing, Speech Recognition etc.
💬 Contact me by email.

nuaazs's Projects

qwen-audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

retnet

An implementation of "Retentive Network: A Successor to Transformer for Large Language Models"

saru-flask

Flask backend for SARU ( MR -> CT transform network)

ScanNetAI is an advanced self-supervised deep learning model tailored for CT image analysis. It excels in processing large-scale CT data, offering superior performance in tasks like image segmentation, medical image conversion, and dosage prediction.

speaker-diarization

speech_microservice

speechalgorithms

Speech Algorithms

speechbrain

A PyTorch-based Speech Toolkit

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

ssspy

A Python toolkit for sound source separation.

stable-diffusion-webui

Stable Diffusion web UI

stray_light_suppression

svc

SoftVC VITS Singing Voice Conversion

tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

topas4bnct

The python-based TOPAS script generation tool automatically calculates the optimal beam direction by providing ct and tumor mask files(nii/nrrd). Other information such as field size, forwardness, etc. can be set through specific templates.