Comments (4)
Just want to jump in and say I have been thinking about ways to marry Explorer and Axon. Specifically being able to pass in a Dataframe without having to convert to another representation and then getting a Dataframe back. But, perhaps there's another standard we can unify on within the ecosystem so that ML libraries like Axon and the scikit-learn equivalent can work with the same representations - that way we can pretty much plug-and-play with different models across libraries without having to worry about how data should be represented. And libraries like Explorer can have some idea of how data might be used later on.
Relevant issues in the ecosystem:
from explorer.
This is a pretty interesting one. I know that a scikit-learn
equivalent for the Nx
ecosystem is being worked on. I don't think Explorer
is the right place for much of that functionality. However, there is a cost to copying data from its Rust representation to Elixir to then carry out these computations. I'll have to give it some thought.
@josevalim do you have an opinion on this? In R and Python you can typically pass a dataframe as the data for regression, and in an ideal situation (e.g. tidymodels) your output is a dataframe too. Forecasting/time series is another area where this kind of thing shines and you'd want to pass in a dataframe directly.
from explorer.
We are exploring something like scikit-learn on top of Nx, so it may be less general purpose than scikit-learn. It may also be worth it to have some of those ideas on top of explorer too, maybe in this or as a separate library. I think we don't need to make a decision for now and we can wait until things develop a bit. And of course, others are free to explore this too!
from explorer.
As I believe https://github.com/elixir-nx/scholar will fill this niche (once we've implemented the Nx.Container
protocol), I'm closing this as it won't live here.
from explorer.
Related Issues (20)
- [Feature request] Add support for Decimal type HOT 12
- Should we always raise when a column is missing? HOT 3
- Split string column into multiple columns (feature request / use case) HOT 4
- Seeing `:nif_not_loaded` error for `Series.split/2` when mutating a dataframe HOT 1
- [Feature request] Add support for read_database in Polars backend. HOT 1
- Using `sort_by` with a grouped data frame doesn't respect `nils:` option HOT 1
- `{:datetime, :second}` dtype support HOT 2
- Add :streaming option to DataFrame.to_csv/3 HOT 1
- Exporting to CSV with a duration column returns an error
- Regression in `DataFrame.concat_rows/2` in v0.8.2 HOT 1
- Filter throwing undefined variable error HOT 1
- Error using is_finite and is_infinite within mutate HOT 1
- Explorer NIF broken on FreeBSD HOT 12
- Support Elixir built in Duration struct HOT 1
- Bug: Rounding Error in Tests HOT 1
- exposing the `fold` expressions from Polars HOT 6
- :nif_panicked "Chunk require all its arrays to have an equal number of rows" HOT 1
- Sorting an empty DataFrame results in a runtime Polars error HOT 1
- Performance of `DataFrame.new/2` on dataframes containing list columns HOT 7
- `Series.filter` should work inside `DataFrame.summarise` HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from explorer.