Comments (10)
The reference to jiant above was to v1.3: https://github.com/nyu-mll/jiant-v1-legacy
The new v2.0 is mostly a wrapper around Transformers, so it drops LSTM support. Start with v1.3.
from glue-baselines.
You'll have a much easier time with jiant than with this repo, but if you need an exact reproduction for some reason, ping w4ngatang again.
from glue-baselines.
Hi!
The results in the paper are test set results (as it says in the caption), and several datasets have non-trivial differences between the dev and test data, so it's possible that you've already reproduced our results exactly.
In any case, though, I'd urge you to use the newer jiant codebase. It's much better documented, and gets strictly better results than the baselines here. We don't have public dev set numbers from that codebase yet, but if you post an issue there, we should be able to assemble some.
https://github.com/jsalt18-sentence-repl/jiant
If you do need to use this codebase, reply here and @W4ngatang should be able to share the exact hyperparameters we used.
from glue-baselines.
Thanks for your enthusiastic reply!
I've submitted it to the GLUE platform, but there are still some gaps in CoLA, QNLI and WNLI.
| average | CoLA | SST-2 | MRPC | STS-B | QQP | MNLI | QNLI | RTE | WNLI
BiLSTM baseline | 63.5 | 24 | 85.8 | 82.1/71.9 | 68.8/67.0 | 59.1/80.2 | 65.8/66 | 71.1 | 46.8 | 63.7
my results | 60.4 | 13.9 | 84.6 | 81.6/73.0 | 68.8/66.7 | 57.2/79.7 | 61.3/61.8 | 63 | 54.2 | 52.7
So it will be very nice of you to offer me the hyperparameters which produces the baselines on this codebase.
Besides, I'm willing to transfer to jiant, but I'm not sure whether I can produce the GLUE baselines with it. Can I obtain the results on the leaderboard using the final_glue_runs.sh
script without modification?
Thanks again!
from glue-baselines.
@W4ngatang - Could you take this one?
If you need to exactly match our baselines, jiant won't do that. This paper publishes numbers from the final_glue_runs script, though: https://openreview.net/pdf?id=Bkl87h09FX
Sam
from glue-baselines.
Hey @xxxxxyt , I've added the exact scripts that I'm running here. Could you try running those?
from glue-baselines.
Hey, after fixing lots of issues, I tried running the code. However, I still get the following error:
Traceback (most recent call last):
File "src/main.py", line 280, in
sys.exit(main(sys.argv[1:]))
File "src/main.py", line 186, in main
trainer = MultiTaskTrainer.from_params(model, args.run_dir + '/%s/' % task.name,
NameError: name 'MultiTaskTrainer' is not defined
I find that the 'MultiTaskTrainer' is not defined in the repository. I sincerely asking for the script for 'MultiTaskTrainer'. My great gratitude! @thxyutong @sleepinyourhat
from glue-baselines.
@Bogerchen hey bro, have you fix the problem?
from glue-baselines.
Running into the same MultiTaskTrainer issue. Did someone find a fix? Also @sleepinyourhat concerning jiant I tried using it but found no options for running non-transformers architectures (I want to rerun the LSTM as described in the GloVe paper). Maybe I missed something? Would appreciate you pointing the right way to do it :)
from glue-baselines.
Can someone share the MultiTaskTrainer script? I really need this script to reproduce exactly the original GLUE benckmark. Thanks.
from glue-baselines.
Related Issues (20)
- Has CoLA changed? HOT 1
- The code doesn't work HOT 7
- Please delete <EOM>
- I cna't use pandas to read mrpc's train/dev/test.tsv,ParserError: Error tokenizing data. C error: Expected 5 fields in line 93, saw 6
- Wrong submission feedback for Question NLI HOT 2
- Thanks a lot! It works!
- Processing MRPC, ValueError: need more than 1 value to unpack HOT 2
- Where to download some pretraining data? HOT 1
- cannot import name 'DotProductMatrixAttention' HOT 2
- ImportError: torch.utils.ffi is deprecated......
- can't use elmo HOT 1
- Update data URLs HOT 4
- How was SST-2 training constructed?
- can‘t import allennlp.modules.matrix_attention import DotProductMatrixAttention HOT 3
- fail in downloading MRPC
- newer version of QNLI ?
- ModuleNotFoundError: No module named 'cytoolz.itertoolz'
- The leaderboard is down HOT 1
- QQP: sentences appear in both train and dev/test splits
- MRPC failed, Chinese blocking s3.amazonaws.com HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from glue-baselines.