Comments (2)
I'm also getting better results with baselines (missforest and sklearn's MICE) than GAIN. I'm using the default configuration for baselines and using the code/parameters provided for GAIN in this implementation on both Letter and Spam datasets.
from gain.
For paper writing, we explicitly divide the data into train/test and train all the models (including GAIN, MICE, and MissForest) on the train data only. Then, we use the trained data to test on the testing data.
However, in this repository, I think that people usually do imputation before dividing the data for further model developing. Therefore, I do not divide the train/test in this repository.
from gain.
Related Issues (20)
- How to decide Missingness Mechanism HOT 1
- Differences with the paper HOT 1
- Using GAIN in inductive mode HOT 1
- Changing only missing values? and scoring? HOT 1
- Why not both L_G and L_D relevant to V(D,G)? HOT 1
- Could you please provide Requirements.txt file HOT 1
- My dataset is 203454KB, I can't get the dataset after filling, because my dataset is too big? It gives some mistakes. HOT 1
- mixed (categorical and numerical) data HOT 3
- Model for the MNIST dataset HOT 1
- alpha HOT 1
- original data HOT 1
- Hyperparameters training HOT 3
- hyperparameters HOT 3
- RMSE is not stable HOT 1
- RMSE HOT 1
- Hint matrix HOT 1
- Why isn't the loss calculated only with b_i=0 values of the Hints. HOT 2
- No split training and testing sets? HOT 2
- Training Query HOT 1
- about minibatch HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gain.