Giter Club home page Giter Club logo

plc-challenge's Introduction

INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge

This repository will contain data and example code for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment Challenge.

You can find more information about the challenge and how to enter at https://aka.ms/plc_challenge

If you have any questions, please contact us via e-mail at [email protected]

Dataset

The training and validation dataset has now been released and is available as a tar.gz archive:

http://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/test_train.tar.gz

The blind set is now also available:

http://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/blind.tar.gz

Update (24. March 2022): The reference data for the blind set is now available:

http://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/blind_set_reference.tar.gz

Please make sure to submit your results by the deadline, March 8th 2022 23:59 AoE.

Additional information about the data included can be found in our challenge paper, and information about how to register for the challenge can be found at https://aka.ms/plc_challenge .

A multipart zip file download of the training set is available for people who cannot download it as one big file:

https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.001 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.002 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.003 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.004 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.005 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.006 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.007 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.008 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.009 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.010 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.011 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.012 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.013 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.014 https://plcchallenge2022pub.blob.core.windows.net/plcchallengearchive/split/test_train.zip.015

PLC-MOS

To help with model development, we will provide access to a prototype PLC-MOS neural model API which will provide MOS score estimates for audio files with packet loss concealment applied. For further details on how to get access to this API, refer to https://aka.ms/plc_challenge . You can find an API usage example in PLC-MOS-API-Example.ipynb .

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

plc-challenge's People

Contributors

alt-ellyse avatar dependabot[bot] avatar halcy avatar microsoft-github-operations[bot] avatar microsoftopensource avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

plc-challenge's Issues

PLC MOS API access problem v1

Hi, Is there any certificate (.pem) file as we still getting same error when processing on our cloud platform ?

Hi,

the token was not active yet. It is now active (or will be in a few minutes). Please be aware that the API is, unfortunately, currently somewhat slow. We're working on getting the onnx model released for PLCMOS so you could run it yourself, but this may still take a bit.

Best,
Lorenz

Originally posted by @halcy in #2 (comment)

Issue on blind dataset

I was having issue on evaluating the following challenge dataset with PESQ

CleanShot 2023-02-22 at 17 48 43

I found out that "427818.wav" in both blind set and reference is completely silence.

CleanShot 2023-02-22 at 17 54 49@2x

Is this on purpose?
Or should I just omit this file?

Thank you.

Very limited download speed for training and testing datasets

Hi,
I am trying to download the open-source training and testing challenge data. However the download speed is surprisingly low (less than 1Mbps) although my personal downalod speed is much higher than that (around 800Mbps). Is there a specific reason for that?
Given the size of the tar archive (10GB) it means one must wait somewhat 4 hours for the download to complete, and it seems the server closes the connection with a timeout inferior to that, causing the download to fail.

Any help is welcome, thanks

Error in running Baseline-Inference.ipynb

Hi,
I am trying to run the baseline PLC model using the notebook Baseline-Inference.ipynb and the last cell gives the following error:

---------------------------------------------------------------------------
AxisError                                 Traceback (most recent call last)
Input In [43], in <module>
      9 loss_mask = np.loadtxt(lost_file)
     11 feats, feats_recon = build_features_logspec_plcmask(data, loss_mask)
---> 12 feats = np.array(feats).swapaxes(0, 1)
     13 feats = feats.reshape(feats.shape[0], -1)
     15 feats_recon = np.array(feats_recon).swapaxes(0, 1)

AxisError: axis2: axis 1 is out of bounds for array of dimension 1

Any idea what causes this problem?
Many thanks in advance!

PLC MOS API access problem

Hi,
We got the token but unable to access the PLC-MOS-API-Example.ipynb script. It throws JSONDecodeError: Expecting value: line 1 column 1 (char 0) when running last line script resp.json() in the example. Please help

With regards,
Nitesh

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.