Giter Club home page Giter Club logo

credbank-data's People

Contributors

compsocial avatar tanumitra avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

credbank-data's Issues

Where is the text of the tweets?

I recently access to the CREDBANK-data, merging all the different databases. So far I have found the main topics, score, and so on. I would love to use this corpus of tweets in a paper, but unfortunately, I can't find the original text of the tweets, where is it? Is it available in another resource? Did I miss anything?

A way of getting the original text of the tweets could be using the id of the tweet and the REST API of Twitter. But given the number of tweets and the time since they were posted, I am afraid it will not be possible or will take a lot of time. So I was wondering if it could be possible to get the text?

BTW thanks for sharing and congrats for the great job done!

Pulling dataset in Jupyter Notebook

I am trying to execute the. s3cmd get --requester-pays s3://credbank/stream_tweets_byTimestamp.data command in a jupyter notebook and am getting a syntax error. Is there another way to write this?

Permission Denied when I try to download the data

Hi!

I'd want to use this dataset for my thesis work, but I get the error

"[Errno 13] Permission denied: 'stream_tweets_byTimestamp.data"

when I launch the command

aws s3api get-object --request-payer requester --bucket credbank --key stream_tweets_byTimestamp.data stream_tweets_byTimestamp.data.

Could anyone try to help me to solve the problem?

Thank you very much!

[Ask] Question regarding the dataset

Dear Authors,

Thank you for hosting your data set in Github, it is very helpful and insightful for my study,
I am a graduate student and currently studying twitter information mining and trying to use your data set, I have questions regarding the data set :

  1. I take the first data (second row) in the cred_event_SearchTweets.data file, the topic_key valued as everything_royals_rain-20141015_161647-20141015_172214 yet the topic_terms valued as state,emergency,#ferguson.
  2. In the JSON tuple from the file the cred_event_SearchTweets.data, I randomly checked 3 tweet IDs (e.g. 522771074048884736) is referencing to the "host,patrick,neil" topic

Please kindly help to explain the findings above, is it suppose to be like that? because as far as I understand the topic_keys, the topic_terms and the JSON tuple should represent the same topic, please advice :)

Thank you

dataset downloadKO

Hello,
I cannot download the dataset with this command

I have this message : Read Time Out On endpoint URL : "None"

can you please tell me what to do

Thanks

Unable to download the files

Hello,

Thank you for sharing these data sets.
I was unsuccessfully trying to download these data sets using my AWS credentials. I always receive "An error occurred (403) when calling the HeadObject operation: Forbidden".
I am not sure if something wrong with my credentials or the bucket has some access restrictions.

Can you please help me please with that.

Thank you very much,

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.