Giter Club home page Giter Club logo

bdsr's Introduction

Audio Bit-Depth Super Resolution

Our project focuses on the goal of adapting WaveNet, an audio prediction CNN architecture, to superresolve 8-bit audio clips into 16-bit audio clips, trying to restore lossed dynamic range and as a result cleaning compression artifacts.

We evaluate several modifications to WaveNet, including:

  • Discriminative rather than autoregressive prediction
  • Non-causal dilations - both input samples from past and future are available during prediction
  • Delta prediction - assuming 8-bit audio mostly preserves the 16-bit audio data, we aim to only predict the delta between the two waveforms
  • Real-valued prediction - since the amplitude space is inherently continuous (discretized during compression), a real-valued number space is a more natural model than a categorical output passed through softmax.

Improvements are subtle but include audible muffling of the background noise, though we terminated training early due to resource constraints and observed that loss was still decreasing approximately linearly at time of evaluation. We believe there is further improvements to be had with our architecture given sufficient training.

Final Write up

Audio Bit Depth Super Resolution Paper

Presentation

Project Presentation

Generated Samples

Generated Samples

Source code

Source Code

Maintained by Taylor Lundy, Thomas Liu, and William Qi.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.