pytorch-ganbert's Introduction

Pytorch-GANBERT

A Pytorch implementation of GAN-BERT paper

"bert.py" and "qc-fine_bert.py" files are basically the same with different datasets; this is just my laziness. "labeled_and_unlabeled.tsv" is the mixed version of labeled and unlabeled data from original tensorflow implementation of the paper.

Important Note

Running this model only once with only one seed might not be enough. In my experiments, out of 10 runs, only 2 yielded reasonable results. I believe this is due to the difference of scheduling of the optimizers between this implementation and the original TensorFlow implementation.

pytorch-ganbert's People

Contributors

Stargazers

Watchers

pytorch-ganbert's Issues

Question

Hiii Osman,

Thank you for your clear implementation, its really helpful!
My question is there anyway that we can get the generator's output as Text?
Is it possible to see what exactly it is the generating?

Discriminator predicts only one class for all samples

Although the loss gets smaller when training, in testing model only predict one class for the entire test set.

The only exception was when I was training with my data (balanced binary classification data) after ossilating between the two classes through epochs (always achieving 50% on dev), in epoch 20 the model finally predicted different classes for different samples and achieved a score around 60% on development set. I could not replicate this phenomenon again.

some questions for code

Hi, thanks for the excellent implementation of pytorch version GAN-BERT! I have 3 questions for the code:

Generator in GAN-BERT is a MLP, in model.py it makes the following transformation of noise's dimension: 100 -> 512 -> 512, why just only 100 -> 512?
Why should we import parameters of TensorFlow version in def convert_to_tf_param_name and def get_weights_from_tf of model.py?
In general GAN, we often train discriminator for several times then train generator once in one epoch. But in ganbert.py, it seems you train discriminator only once then train generator once. Will this operation cause the problem mentioned in issue#1?

Thanks!

Recommend Projects

osmanmutlu / pytorch-ganbert Goto Github PK

pytorch-ganbert's Introduction

Pytorch-GANBERT

Important Note

pytorch-ganbert's People

Contributors

Stargazers

Watchers

Forkers

pytorch-ganbert's Issues

Question

Discriminator predicts only one class for all samples

some questions for code

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent