Giter Club home page Giter Club logo

excelformer's People

Contributors

jyansir avatar whatashot avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

excelformer's Issues

Question about the ExcelFormer code to default training

Hi, I was studying the provided code in run_default_config_excel.py, and I realized that the categorical features are encoded using CatBoostEncoder. Then, if you apply this encoding process, you set that X_cat = 0, because they are transformed to numerical features. So I have these questions:

  1. If I dont use this encoding process, ExcelFormer architecture will deal with these categorical features?
  2. 'cardinalities' refers to the number of categorical features that I have in my dataset?
  3. Do I need to set WHICH are the categorical features that I have, if I dont apply CatBoostEncoder?

I appreciate your attention and you've done a great work!

Data to run the code

I am so glad to find your work, and finally see a nn that beats tree models.

Would it possible for you to upload the "/data2/yanjianhuan/research/data/tabular/data" to cloud? So it would be easier for me to run the code. I would like to run your code on your data first to do a sanity check first before trying on my data.

The paper results are not reproducible!

Hi<
Congrats on your great research paper,
I just ran the code for CA and in the best scenario, I got ~-0.445 which is far from -0.433 reported in the draft! Have you considered any specific constraints in your model during training? I used the default params recommended in the repo.
BWs,

code?

Intrigued by reading the paper. Is there any code available?

Support scikit-learn compatible

Hi, great work!

Is there any chance for ExcelFormer to support scikit-learn fit API, so we can use something like ExcelFormerClassifier.fit().

In your another repo, TabCaps, model can be fitted by TabCapsClassifier.fit(), which is good.

It can help compare performance of multiple tabular models if they are all in one format, thanks!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.