Comments (2)
I didnt qet what you would like to achieve. Why is cov so important?
First of all you may with GAN params:
gen_x, gen_y = GANGenerator(bot_filter_quantile=0.001, top_filter_quantile=0.999, is_post_process=True,
adversarial_model_params={
"metrics": "AUC", "max_depth": 2, "max_bin": 100,
"learning_rate": 0.02, "random_state": 42, "n_estimators": 500,
}, pregeneration_frac=2, only_generated_data=False,
gan_params = {"batch_size": 500, "patience": 50, "epochs" : 500,},
gen_x_times=1.2).generate_data_pipe(train_df, target, test_df)
Secondly:
Your generated data with np.random.multivariate_normal doest have same cov as you passed:
what you wanted
[[1. 0.66666667 0.33333333]
[0.66666667 1. 0.66666667]
[0.33333333 0.66666667 1. ]]
you actually generated
[[0.91722526 0.62890772 0.3325362 ]
[0.62890772 1.04560479 0.74625155]
[0.3325362 0.74625155 1.09697827]]
generated from gan - looks prery similar to me
[[ 1.22316084 0.06813614 -0.02172127]
[ 0.06813614 1.29201178 0.22556574]
[-0.02172127 0.22556574 1.25738251]]
from gan-for-tabular-data.
Sorry for the late answer, if you didnt get required answers, please reopen question
from gan-for-tabular-data.
Related Issues (20)
- is it ok for regression type task? HOT 1
- all sample codes not working till epoch end HOT 1
- second args in generate_data_pipe cannot be left None HOT 2
- TypeError: unsupported operand type(s) for +: 'NoneType' and 'NoneType' HOT 1
- training CTGAN stops in the middle (around 24%) HOT 2
- Difference between OriginalGenerator and GANGenerator HOT 1
- Getting this error when trying to install load HOT 2
- check HOT 1
- ContextualVersionConflict: (scikit-learn 1.0.2 (/usr/local/lib/python3.7/dist-packages), Requirement.parse('scikit-learn==0.23.2'), {'tabgan'}) HOT 3
- Dear Author, May I know the ctgan version for the installation? I am getting error. from ctgan import _CTGANSynthesizer ImportError: cannot import name '_CTGANSynthesizer' HOT 4
- pip install scikit-learn version issue HOT 3
- Mistake in Readme HOT 2
- Some issues araised when running Tab-GAN: 1) Manage Categorical Variables. 2) Batch size problem HOT 8
- Reproducibility issue HOT 1
- ValueError: Input X contains NaN although NaN filtered HOT 7
- IntCastingNaNError Despite No NaN values HOT 3
- LGBMClassifier.fit() got an unexpected keyword argument 'early_stopping_rounds' HOT 2
- Dependency issue with ForestDiffusion Generator HOT 3
- TypeError w/ Boolean Data HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gan-for-tabular-data.