[FEATURE] Work on notebooks ahead of v0.1.0 CEC2 release

Machine learning challenges for hearing aid processing

We are organising a series of machine learning challenges to enhance hearing-aid signal processing and to better predict how people perceive speech-in-noise (Clarity) and speech-in-music (Cadenza). For further details of the Clarity Project visit the Clarity project website, and for details of our latest Clarity challenges visit our challenge documentation site. You can contact the Clarity Team by email at [email protected]. For further details of the Cadenza Project visit the Cadenza project website, and to find out about the latest Cadenza challenges join the Cadenza Challenge Group.

In this repository, you will find code to support all Clarity and Cadenza Challenges, including baselines, toolkits, and systems from participants. We encourage you to make your system/model open source and contribute to this repository.

Current Events

The 3rd Clarity Enhancement Challenge is now open. 🔥🔥
- Visit the challenge website for more details.
- Join the Clarity Challenge Group to keep up-to-date on developments.
The ICASSP 2024 Cadenza Challenge (CAD_ICASSP_2024) will be presented at ICASSP 2024.
- Join the Cadenza Challenge Group to keep up-to-date on developments.
- Visit the Cadenenza Challenge website for more details.
The first Cadenza Challenge (CAD1) is closed.
- Subjective Evaluation is underway. 🆕
The 2nd Clarity Prediction Challenge (CPC2) is now closed.
The 4th Clarity Workshop will be held as a satellite event of Interspeech 2023. For details visit the workshop website.

Installation

PyPI

Clarity is available on the Python Package Index (PyPI) to install create and/or activate a virtual environment and then use pip to install.

conda create --name clarity python=3.8
conda activate clarity

pip install pyclarity

GitHub Cloning

# First clone the repo
git clone https://github.com/claritychallenge/clarity.git
cd clarity

# Second create & activate environment with conda, see https://docs.conda.io/projects/conda/en/latest/user-guide/install/index.html
conda create --name clarity python=3.8
conda activate clarity

# Last install with pip
pip install -e .

GitHub pip install

Alternatively pip allows you to install packages from GitHub sources directly. The following will install the current main branch.

pip install -e git+https://github.com/claritychallenge/clarity.git@main

Challenges

Current challenge

The 3rd Clarity Enhancement Challenge

Previous challenges

Available tools

We provide also a number of tools in this repository:

Hearing loss simulation
- Cambridge MSBG hearing loss simulator: descriptions can be found in the CEC1 description; an usage example can be found in the CEC1 baseline evaluation script evaluate.py.
Objective intelligibility measurement
- Modified binaural STOI (MBSTOI): a Python implementation of MBSTOI. It is jointly used with the MSBG hearing loss model in the CEC1 baseline. The official matlab implementation can be found here: http://ah-andersen.net/code/
- Hearing-aid speech perception index (HASPI): a Python implementation of HASPI Version 2, and the better-ear HASPI for binaural speech signals. For official matlab implementation, request here: https://www.colorado.edu/lab/hearlab/resources
- Hearing-aid speech quality index (HASQI): a Python implementation of HASQI Version 2, and the better-ear HASQI for binaural speech signals.
- Hearing-aid audio quality index (HAAQI): a Python implementation of HAAQI.
Hearing aid enhancement
- Cambridge hearing aid fitting (CAMFIT): a Python implementation of CAMFIT, translated from the HörTech Open Master Hearing Aid (OpenMHA); the CAMFIT is used together with OpenMHA enhancement as the CEC1 baseline, see enhance.py.
- NAL-R hearing aid fitting: a Python implementation of NAL-R prescription fitting. It is used as the CEC2 baseline, see enhance.py.

In addition, differentiable approximation to some tools are provided:

Differentiable MSBG hearing loss model. See also the BUT implementation: https://github.com/BUTSpeechFIT/torch_msbg_mbstoi
Differentiable HASPI (coming)

	# Parameters for the control filter bank
	HLmax = [100, 100, 100, 100, 100, 100]
	shift = 0.02 # Basal shift of 0.02 of the basilar membrane length
	cfreq1 = CenterFreq(nchan, shift) # Center frequencies for the control
	_, BW1, _, _, _ = LossParameters(HLmax, cfreq1)
	# Maximum BW for the control

	def CenterFreq(nchan, shift=None):
	"""
	Function to compute the ERB frequency spacing for the gammatone
	filter bank. The equation comes from Malcolm Slaney (1993).

	Calling variables
	nchan number of filters in the filter bank
	shift optional frequency shift of the filter bank specified as a
	fractional shift in distance along the BM. A positive shift
	is an increase in frequency (basal shift), and negative is
	a decrease in frequency (apical shift). The total length of
	the BM is normalized to 1. The frequency-to-distance map is
	from D.D. Greenwood (1990), JASA 87, 2592-2605, Eq (1).

	James M. Kates, 25 January 2007.
	Frequency shift added 22 August 2008.
	Lower and upper frequencies fixed at 80 and 8000 Hz, 19 June 2012.
	Translated from MATLAB to Python by Zuzanna Podwinska, March 2022.
	"""
	lowFreq = 80
	highFreq = 8000

	# Moore and Glasberg ERB values
	EarQ = 9.26449
	minBW = 24.7

	# In the Matlab code, the loop below never evaluates
	# (but the current code was trained with this bug)
	shift = None # This is to keep consistency with MATLAB code
	if shift is not None:
	k = 1
	A = 165.4
	a = 2.1 # shift specified as a fraction of the total length
	# Locations of the low and high frequencies on the BM between 0 and 1
	xLow = (1 / a) * np.log10(k + (lowFreq / A))
	xHigh = (1 / a) * np.log10(k + (highFreq / A))
	# Shift the locations
	xLow = xLow * (1 + shift)
	xHigh = xHigh * (1 + shift)
	# Compute the new frequency range
	lowFreq = A * (10 ** (a * xLow) - k)
	highFreq = A * (10 ** (a * xHigh) - k)

	# All of the following expressions are derived in Apple TR #35,
	# "An Efficient Implementation of the Patterson-Holdsworth Cochlear
	# Filter Bank" by Malcolm Slaney.
	cf = -(EarQ * minBW) + np.exp(
	np.arange(1, nchan)
	* (-np.log(highFreq + EarQ * minBW) + np.log(lowFreq + EarQ * minBW))
	/ (nchan - 1)
	) * (highFreq + EarQ * minBW)
	cf = np.insert(cf, 0, highFreq) # Last center frequency is set to highFreq
	cf = np.flip(cf)
	return cf

claritychallenge / clarity Goto Github PK

clarity's Introduction

Machine learning challenges for hearing aid processing

Current Events

Installation

PyPI

GitHub Cloning

GitHub pip install

Challenges

Available tools

Open-source systems

clarity's People

Contributors

Stargazers

Watchers

Forkers

clarity's Issues

Machine learning challenges for hearing aid processing.

The 2nd Clarity Enhancement Challenge (CEC2) has launched! Take part🔥🔥🔥

Installation

PyPI

GitHub Cloning

GitHub pip install

Challenges

Available tools

Open-source systems

Recommend Projects

Recommend Topics

Recommend Org