Giter Club home page Giter Club logo

format-bench's Introduction

FormatBench

FormatBench is a Python benchmark of data formats. This project aims to evaluate different data formats for storing tabular and image data.

Check out also: ASV FormatBench

Usage

python3 main.py (--tabular|--compression|--image) [--webface <path>] [--report]

--tabular - run tabular benchmark suite
--compression - run compression benchmark suite
--image - run image benchmark suite
--webface - run benchmarks with the Webface10M dataset; <path> is a path to the Webface10M dataset
--report - generate report from the benchmark results

Examples of usage

Run tabular benchmarks: python3 main.py --tabular

Run image benchmarks and create report: python3 main.py --image --report

Run compression benchmarks with the Webface10M dataset: python3 main.py --compression --webface ~/synthetic_webface10M.h5

Run tabular benchmarks with the Webface10M dataset and create report: python3 main.py --tabular --webface ~/synthetic_webface10M.h5 --report

Related publication

TARAGEĽ, Marián. Column-oriented and Image Data Format Benchmarks. Brno, 2024. Bachelor’s thesis. Brno University of Technology, Faculty of Information Technology. Supervisor Ing. Jakub Špaňhel

Acknowledgements

I would like to convey my gratitude to Ing. Jakub Špaňhel for his supervision. I also express my thanks to my consultant Ing. Petr Chmelař. Both of them provided me with support and advice during the work on this thesis. Last but not least, I would like to thank the external submitter, the Innovatrics company, for their professional help.

format-bench's People

Contributors

mariantaragel avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.