afshinrahimi / geographconv Goto Github PK

View Code? Open in Web Editor NEW

67.0 67.0 18.0 60 KB

Semi-supervised User Geolocation via Graph Convolutional Networks

Python 94.72% Shell 5.28%

geographconv's People

Contributors

Stargazers

Watchers

Forkers

songfgh afcarl phenomena9 zhengshuhao zhuhm1996 m397026474 michaelwayneliu littlebadrobot amirhossein-dezhboro youngflyasd chaitanyabaweja zengwesley31729 flash-barry wenzheli rajasriramoju konghr msaidzengin wsj-7416

geographconv's Issues

Questions regarding the dataset and best hyper-paramters

Hi,

Thank you for sharing the code of this impressive work.
I have two questions regarding how to reproduce the results in the paper.

Based on the paper, it seems that the TWITTER-WORLD dataset is larger than the TWITTER-US; however, when I downloaded the data from this link, I found that the files in the na folder is larger than the ones in the world folder, which confuses me. I wonder if there is a naming typo here.
I tried the following command to get the GCN results with the default hyper-paramters on GEOTEXT:
python gcnmain.py -save -i cmu -d data/cmu -enc latin1
Unfortunately, I only get:
PM dev results:
PM Mean: 565 Median: 103 Acc@161: 54
PM test results:
PM Mean: 578 Median: 99 Acc@161: 53
This is a lot worse than the Mean: 546 Median: 45 Acc@161: 60 reported in the paper.
Could you please share the commands you used to produce the amazing GCN results on all three datasets in Table 1 of the paper?

How do you get these user id's?

Your data uses these identifiers for users: USER_ee551c6c.

These don't correspond to the type of id's you get from twitter. How do you convert the twitter id's in this format. I am asking this because I need to augment my data to your dataset and would use a similar conversion for my data as well.

How to get predictions out in a csv file?

I wish to get label information out. Which label corresponds to what city?

Question regarding Twitter-World dataset

Hi,
First of all, very impressive work that set the foundation of user geolocation task using graph-based method.

However, there is one question regarding twitter-world dataset. As the mention graph is critical in the methodology, current twitter-world dataset provided by your link seems does not contain the mention@ or reply information towards your encoded id.

For example, the user id in the dataset seems to be encoded as '421527640' , '86941247' but the mentioned user in the stacked tweet seems to be '171966527 36.09986 -80.24422 ||| @wakefan2321 what's your PSN name?', which is the plaintext. Therefore, this makes it unavailable to construct the mention graph based on the encoded user id.

Can you kindly clarify this, maybe I miss some details in the dataset.

Best,

The version of Lasagne(0.1) cannot associate with the version of Theano(1.0.3)

I run the code and it came out some import errors inside the Lasagne package code. I have to edit the code of Lasagne to run the code.

==========================
I solved it by updating the Lasagne.

Questions about the project

Hello,

I am interested in your work about the Twitter graph dataset and i'd like to do the same thing on my own side.

I need to know beforehand, how long did it took to run the whole process and on what kind of hardware ? I am not looking for performance, I just want an overall estimation about the needed time to compute in order to organize my upcoming work (and also it would help me detect a problem if it runs for too long).

Also the Twitter graph dataset is quite big so I'm being cautious, how much RAM will I need to make this run ?

Thank you for putting this into github, I mean it.

Best regards

afshinrahimi / geographconv Goto Github PK

geographconv's People

Contributors

Stargazers

Watchers

Forkers

geographconv's Issues

Questions regarding the dataset and best hyper-paramters

How do you get these user id's?

How to get predictions out in a csv file?

Question regarding Twitter-World dataset

The version of Lasagne(0.1) cannot associate with the version of Theano(1.0.3)

Questions about the project

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent