yuzhimanhua / match Goto Github PK
View Code? Open in Web Editor NEWMATCH: Metadata-Aware Text Classification in A Large Hierarchy (WWW'21)
Home Page: https://arxiv.org/abs/2102.07349
License: Apache License 2.0
MATCH: Metadata-Aware Text Classification in A Large Hierarchy (WWW'21)
Home Page: https://arxiv.org/abs/2102.07349
License: Apache License 2.0
I am seeing the following error
RuntimeError: Expected object of scalar type Long but got scalar type Int for sequence element 1 in sequence argument at position #1 'tensors'
Can you please tell me which version you were using for
cudatoolkit and cudnn
Thanks
abhishek
Hello,
I am reaching out to you regarding the results of your paper, where you evaluated your model's performance on both the MAG and MeSH datasets.
I have tried to reproduce your results on both datasets using your code and the same hyperparameters as mentioned in the paper, and I was able to obtain results similar to the ones reported in the paper for the MAG dataset. However, when I ran the same experiments on the MeSH dataset, I found that my results were worse than those reported in the paper. My result in MeSH is
P@1,3,5: 0.9117345916709328 , 0.7319429296414183 , 0.5970285129209607
NDCG@1,3,5: 0.9117345916709328 , 0.791101895548494 , 0.7191552905597784
I was wondering if there are any additional parameters or configurations that I need to consider while running the experiments on the MeSH dataset. Is it possible that the configuration for the MeSH dataset is different from the one used for the MAG dataset?
Thank you for your time and I look forward to your response.
In the README it states:
NOTE: If you would like to run our code on your own datasets, there is no need to represent each paper/author/word as a number. Just make sure that (1) each paper/venue/author/word name does not have whitespace inside
I noticed that in vocabulary.txt
the words are all lowercase, and many "words" are actually multiple words separated by underscores.
If I'm starting with titles and abstracts that include capitalization and punctuation do I need to transform that in some way before putting it into the "text"
field in the .json
file?
Thank you very much for your work and making the codebase public; it is very inspiring :)
I am planning to implement something similar and had the following questions:
Are there any modifications we should make to run MATCH on a small label hierarchy?
I want to use MATCH to do multi-label text classification on scientific papers using a hierarchical biomimicry label taxonomy I have. Is there a way to use the MeSH labels and MAG fields of study as metadata to improve predictions?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.