Giter Club home page Giter Club logo

microframe's Introduction

MicroFrame

MicroFrame is a lightweight educational data manipulation library designed to provide a pandas-like interface for students learning to work with real-world data. It is optimized for toy datasets and aims to introduce users to data analysis concepts without the overhead of pandas.

Features

  • Efficient CSV Reading: Quickly and easily read CSV files to create MicroFrame objects, optimized for educational purposes and smaller datasets.
  • Data Type Handling: Advanced data type inference and explicit type setting offer both convenience and control over the structure of your data.
  • Flexible MicroFrame Objects: Utilize MicroFrame objects that mimic pandas DataFrame for intuitive data manipulation and analysis.
  • Clear Tabular Display: Use MicroFrame’s printing capabilities to generate well-formatted tabular representations of your data, making it easier to interpret and present.
  • Robust Data Manipulation: Perform a variety of data manipulation tasks with methods similar to pandas, such as filtering, column dtype modification and summarizing data.
  • Advanced Indexing: Access data efficiently with advanced indexing options, using the iloc method similar to pandas iloc method.
  • Data Conversion Tools: Seamlessly convert your MicroFrame objects to other formats, including NumPy arrays, with the to_numpy method for further numerical computation.
  • User-Friendly API: Experience a user-friendly API that mirrors pandas to facilitate the transition from educational projects to real-world data analysis.

Installation

Install MicroFrame using pip:

pip install microframe

Quick Start

Import MicroFrame and load a dataset:

import microframe as mf

# Read a CSV file into a MicroFrame object
mframe = mf.read_csv("path_to_your_csv_file.csv")

# Alternatively, create a MicroFrame object manually
data = [[1, "a"], [2, "b"], [3, "c"]]
dtypes = ["int32", "U1"]
columns = ["num", "char"]

mframe = mf.MicroFrame(data, dtypes, columns)
mframe.head() # Display first 5 rows
# Extract data as numpy array
mframe_slice = mframe.iloc[:, 0] # returns all rows, but just col 0
numpy_array = mframe_slice.to_numpy() # returns mframe_slice as a numpy array

Examples

Basic Usage

MicroFrame simplifies the process of data analysis. Here are some basic operations:

Reading from a CSV

import microframe as mf

mframe = mf.read_csv("path_to_your_csv_file.csv")

Creating a MicroFrame Object

import microframe as mf
data = [[1, "a"], [2, "b"], [3, "c"]]
dtypes = ["int32", "U1"]
columns = ["num", "char"]

mframe = mf.MicroFrame(data, dtypes, columns)

Data Manipulation

MicroFrame provides several methods to manipulate your data:

Renaming Columns

mframe.rename({"num": "number", "char": "character"})

Changing Data Types

mframe.change_dtypes({"number": "float64", "character": "U10"})

Accessing Column Data with Boolean Indexing

data = [[1, "a"], [2, "b"], [3, "c"]]
dtypes = ["int32", "U1"]
columns = ["num", "char"]

mframe = mf.MicroFrame(data, dtypes, columns)
first_col = mframe["num"] # Access just num column

Accessing Row Data with iloc

first_row = mframe.iloc[0]

Advanced Indexing with iloc

The iloc indexer allows for integer-location based indexing:

Accessing a Single Row

first_row = mframe.iloc[0]

Accessing Multiple Rows

first_two_rows = mframe.iloc[:2]

Accessing a Single Cell

cell_value = mframe.iloc[2, 1]

Setting a Value in a Cell

mframe.iloc[2, 1] = "Test"

Slicing Rows and Columns

subset = mframe.iloc[:2, :2]

Displaying Data

Similar to pandas, you can display parts of your dataset:

Print the First Few Rows

mframe.head(2)

Print the Last Few Rows

mframe.tail(2)

Converting to NumPy Array

For times when you need to work with a NumPy array, MicroFrame provides the to_numpy method:

# Convert the MicroFrame to a 2D NumPy array
numpy_array = mframe.to_numpy()

This method will convert the structured data within the MicroFrame to a regular 2D NumPy array.

Chaining iloc with to_numpy

For scenarios where you need to perform NumPy operations on a subset of your data, you can chain the iloc indexer with the to_numpy method:

# Select the first two rows using iloc and convert them to a NumPy array
numpy_subset = mframe.iloc[:, 1:5].to_numpy()

Contributing

Installing dev dependencies via pip

pip install "microframe[dev]"

Running tests

python -m pytest tests

Documentation

For full documentation, visit our MicroFrame Documentation. Here, you will find detailed information on all the functionalities that MicroFrame offers.

License

MicroFrame is released under the MIT License. Feel free to use it in your projects, and we'd love to hear about what you build!

microframe's People

Contributors

cmsolson75 avatar

Stargazers

1deolson avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.