Giter Club home page Giter Club logo

musae's People

Contributors

benedekrozemberczki avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

musae's Issues

reduce node feature dimension

Hi, thank you for your great work!
I have further questions.

  1. FacebookPagePage dataset

    • I want to reduce its dimension from 128 to 64.
    • So can I get the raw text which you used?
    • I saw your recommendation on issue3. Can I do dimensionality reduction on this dataset, too?
  2. Twitch datasets

    • I want to reduce these too.
    • The paper mentions, "Node features are games liked, location and streaming habits."
    • So I think simple dimensionality reduction on this dataset might be harmful.
    • How can I handle these?

Thanks,

A question about node labels

Hi Benedek,

I have one question about the file "DE_target.csv". There are several files like this one in the repository.

There are several columns in this file, including "id", "days", "mature", "view", "partner", and "new_id". I am curious about which column indicates the label of a node, that is, whether a streamer uses explicit language.

Could you give me a hint about this? Many thanks!

Best regards,
Simon

What's the meaning of features?

I download the datasets (github) from SNAP, but I'm now confused about the features in .json format.
Have they been preprocessed already so that they can be put into use without further processing?
Or do I need to understand what each dimension in the features mean?

Node features for Facebook graph

Hi, thanks for your contributions!

About the Facebook dataset - what do the node features represent, and how were they generated? The paper mentions that the features are extracted from site descriptions. Does this mean they're text features, and if so which text representation or embedding did you use?

A question on meaning of the node feature.

Thank you for your excellent work! And I would be very grateful if you could answer my question. That is, what's the meaning of the numbers in the node feature json file. For example, in the MUSAE/input/features/git.json. I guess that one vector in the json corresponds to a node, and you mentioned in the manuscript that ` Node features are location, starred repositories, employer and e-mail address'. How can I turn these infomation into the numbers in the json file?

Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.