Comments (5)
Dear Qianlong,
Thank you very much for your interest in our work!
We agree with your comment that a graph format of MAPLE may increase its usability. We will try to work on that and release it in several weeks. Thanks for the suggestion!
Regarding your question about paper references, for each paper in MAPLE, we include all of its references (represented by IDs) in our dataset. A considerable proportion of these references may not appear as papers in MAPLE (e.g., they are not published in top journals / conferences); some others, as you said, may appear in MAPLE but in a different field. In our paper, the reference ID is used as an input feature to the paper classifier, so we no longer need to know other information about the reference (e.g., text and metadata). However, if you would like to construct a graph, you may need to remove those references not appearing in MAPLE.
Please let us know if you have further questions.
Best,
Yu
from maple.
Thanks for your further illustration, I really appreciate it.
Yes, I believe removing the references not appearing in MAPLE is certainly an option, but the constructed graph could also be overly sparse since a large portion of references will be removed (some fields might have 80%~90% unmapped references according to my statistics study). Since MAPLE is constructed from MAG, is there any possibility that we can directly utilize the graph structure in MAG and split it into different sub-graphs (fields) as MAPLE?
Anyway, thanks again for your help, I look forward to you releasing the graph format of MAPLE!
from maple.
Hi Qianlong,
We have created a graph format of MAPLE. The data is available at https://zenodo.org/record/7797563.
You can refer to https://github.com/yuzhimanhua/MAPLE/blob/master/README_Graph.md for more details.
We removed the references not appearing in MAPLE to construct the graph. As you mentioned, in some fields (e.g., Art, History), the graph was sparse. We also tried to add all those missing references to the graph (by retrieving their text and metadata from MAG). In this case, the graph certainly became larger, but it did not become denser because the newly added papers brought even more unmapped neighbors.
We agree that directly splitting MAG may solve the problem. Thank you for the suggestion! We will explore it later.
from maple.
Thanks for your work and contribution, I really appreciate it!
from maple.
Dear MAPLE authors,
I recently did some preliminary experiments on some sub-fields (e.g., CSRankings and Art) of the MAPLE graph dataset and found a interesting phenomenon. In my experiments, I found that MLPs easily outperformed GNNs with the same number of parameters, which was unexpected. Typically, the absence of graph structures results in a 10-40% performance downgrade, but in this dataset, the opposite was observed. This phenomenon suggests that the graph structures used in this task may be detrimental to node classification performance.
Could you please help me resolve my question?
Best,
Qianlong
from maple.
Related Issues (2)
- Parabel training issue HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from maple.