Comments (2)
@rustrust I agree, we need to do a better job explaining this in the docs.
Tangram determines the task (either regression, binary classification, or multiclass classification) based on the type of the target column. If the target column is a Number
column, the task will be regression. If the target column is an Enum
column with two variants, the task will be binary classification. If the target column is an Enum
column with more than two variants, the task will be multiclass classification.
Then, tangram trains a grid of linear and gradient boosted decision tree models for the selected task. The definition of the default grid for each task is in this file: https://github.com/tangramxyz/tangram/blob/main/crates/core/grid.rs.
Alternatively, you can specify your own grid in a JSON configuration file passed to tangram train
with the --config
flag. You can see an example in the docs at https://www.tangram.xyz/docs/guides/train_with_custom_configuration. The best documentation for the schema of the configuration file is the serde
type definitions in the source: https://github.com/tangramxyz/tangram/blob/main/crates/core/config.rs.
Let me know if this answers your question or if you want more information.
I will leave this issue open until we update the docs with better information on this.
from modelfox.
@rustrust I've added an internals section to the docs that explains the hyperparameter grid, the model and task types, and the feature engineering. Let us know if there is anything else that we can clarify in the docs!
from modelfox.
Related Issues (20)
- Add CLI Command to auto-generate config file HOT 2
- Playground Chart min value is deceptive, use 0 instead
- Long column names overflow training stats table
- Repo overview View to compare all models contained in the repo
- Ctrl-c to cancel training from python
- Early Stopping Options default to present
- Coerce boolean values appropriately to enum values for prediction in language libraries/ cli
- Improve error message for CLI incorrect path to train/test file
- Forgetting the threshold in logPrediction causes bad request
- Allow data frame as input to predict function in python
- Explain what a baseline classifier is on the metrics page.
- Training error when column to predict has more than 100 variants HOT 2
- Thread 'main' panicked at 'called `Option::unwrap()` on a `None` value' HOT 5
- Failed to get `modelfox` as a dependency of package HOT 1
- URL in repo description is broken HOT 2
- Debian package is not installable HOT 1
- Running `modelfox app` gives "error: No such file or directory (os error 2)" HOT 1
- Bag of words - what is the delimiter? HOT 3
- [Ruby] Does not work for M1 Mac OSX
- datasets are not downloadable anymore
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from modelfox.