Giter Club home page Giter Club logo

practical-data-privacy's Introduction

Practical Data Privacy

Notebooks to accompany the book Practical Data Privacy: Enhancing Privacy and Security in Data, O'Reilly, Spring 2023.

You can read the book on Safari now. Pre-order is also available.

These notebooks can also be used separately from the book, as a workhop or self-study to learn about practical data privacy methods. The audience is assumed to be a data scientist or data folks with an understanding of probability, math and data.

Motivation

The goal of the notebooks and the book is to help data scientists and other technologists learn about practical data privacy. I hope you can use these notebooks and the book to not only learn about data privacy, but also to guide implementation of data privacy in your work!

These notebooks are not meant to replace exploring software or building sustainable, production-ready code; but instead are meant to help guide your learning and thinking around the topics. Please always try to use and support open-source libraries based on the learnings you get from these notebooks / the book.

Installation

Please utilize the included requirements.txt to install your requirements using pip (you can also do so in conda. The notebooks have only been tested with Python 3. ๐Ÿ™Œ๐Ÿป

Unfortunately, some of these libraries have conflicting requirements, so you may need to adapt your libraries and install to use later notebooks after you install the earlier tools. You will also need to install several Rust libraries with Python bindings, which you will need to follow the direct installation information from those software packages.

I recommend using virtual environments or conda environments.

To run parts of these notebooks you will also need a running version of Apache Spark. Check the latest documentation to set up for your operating system.

Notebooks

The notebooks follow the order that the ideas are introduced in the book. There are some additional notebooks added for those interested. Please file a pull request if you have an update to a notebook. I will also watch issues to ensure that the notebooks are usable and understandable. Feedback is very welcome!

Recommended Reading and Challenges

Several notebooks have a recommended reading and additional challenges section. I may update this README with additional reading of interest on this topic. I also recommend that you try out at least one or two challenges, to expand the knowledge you learned and practice using this for new problems.

Reader Contributions

I'm hoping this book and repository has inspired you to try out new libraries and tools related to data privacy. To encourage yourself and others to share their work, I have a folder here reader-contributions. If you try something new out, please consider contributing your notebook! To make it easier for others, please ensure you:

  • Write a brief introduction to the concept or library shown in the notebook, including any links for folks to learn more. What will they learn? What does it show?
  • Installation requirements
  • Your name (if you'd like recognition) and any details should people want to reach out (optional!)
  • Guide other readers through the notebook with occasional titles, markdown cells to take someone through the notebook when you cannot be there.
  • Recommended Reading or Challenges

Feel free to send over Pull Requests once you've checked the above!

Thank you for your work and contribution, and helping others learn more about privacy!

Questions?

Questions about getting set up or the content covered in the notebooks or book? Feel free to reach out via email at: katharine (at) kjamistan (dot) com

Acknowledgements and Contributions

These notebooks wouldn't have been possible without the following contributors and reviewers. Thank you!

Update Log

20.02.2023: Main notebooks working and added reader-contributions folder.

practical-data-privacy's People

Contributors

kjam avatar mitchelllisle avatar safetydave avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.