Giter Club home page Giter Club logo

Comments (4)

johnb30 avatar johnb30 commented on June 10, 2024 1

It depends on what you mean by event coreference resolution. If you mean something like cross-doc or cross-sent linkage of events, then no. If you mean will PETRARCH2 return multiples of the same coded event per sentence, then also no.

from petrarch2.

philip-schrodt avatar philip-schrodt commented on June 10, 2024 1

By event co-resolution, do you mean determine if multiple texts that code to the same event tuple refer to the same thing? If so, no: in most of the work up until the past five years, event data sets were generally coded from a single source (typically Reuters or Agence France Press for the machine-coded data, New York Times in the human-coded systems prior to that), and this wasn't a big issue because it was fairly easy to detect multiple stories reporting on the same actions. With the advent of sets generated from large numbers of sources (ICEWS, Phoenix) it is a very big issue, and the "one-a-day" filter method that most systems use (including the Phoenix pipeline; ICEWS apparently does no deduplication) has some decided drawbacks: this paper (http://eventdata.parusanalytics.com/papers.dir/Schrodt.TAD-NYU.EventData.pdf) discusses the issue in detail. There's an emerging consensus that we need to do document-level resolution first, either by de-duplication (large NLP literature on this) or clustering (some method similar to Google News or European Media Monitor), but we haven't worked out any open source solutions for this yet.

from petrarch2.

Madity avatar Madity commented on June 10, 2024

Thanks for the clarification!
@philip-schrodt I was thinking on the grounds of sets generated from a large number of sources. Can anything be done on this grounds because with the advent of big data, datasets coded from a single source may not be sufficient, since considering the sets from multiple sources would provide more insight into the events.

from petrarch2.

johnb30 avatar johnb30 commented on June 10, 2024

PETRARCH does that within a sentence. For cross-sentence things we apply a daily one-a-day filter to the final output generated. See the phoenix_pipeline for more details on that. Specifically, this script. In other words, PETRARCh aims to do one thing: code event data. Pre- or post-processing is designed to occur elsewhere.

from petrarch2.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.