Comments (4)
It depends on what you mean by event coreference resolution. If you mean something like cross-doc or cross-sent linkage of events, then no. If you mean will PETRARCH2 return multiples of the same coded event per sentence, then also no.
from petrarch2.
By event co-resolution, do you mean determine if multiple texts that code to the same event tuple refer to the same thing? If so, no: in most of the work up until the past five years, event data sets were generally coded from a single source (typically Reuters or Agence France Press for the machine-coded data, New York Times in the human-coded systems prior to that), and this wasn't a big issue because it was fairly easy to detect multiple stories reporting on the same actions. With the advent of sets generated from large numbers of sources (ICEWS, Phoenix) it is a very big issue, and the "one-a-day" filter method that most systems use (including the Phoenix pipeline; ICEWS apparently does no deduplication) has some decided drawbacks: this paper (http://eventdata.parusanalytics.com/papers.dir/Schrodt.TAD-NYU.EventData.pdf) discusses the issue in detail. There's an emerging consensus that we need to do document-level resolution first, either by de-duplication (large NLP literature on this) or clustering (some method similar to Google News or European Media Monitor), but we haven't worked out any open source solutions for this yet.
from petrarch2.
Thanks for the clarification!
@philip-schrodt I was thinking on the grounds of sets generated from a large number of sources. Can anything be done on this grounds because with the advent of big data, datasets coded from a single source may not be sufficient, since considering the sets from multiple sources would provide more insight into the events.
from petrarch2.
PETRARCH does that within a sentence. For cross-sentence things we apply a daily one-a-day filter to the final output generated. See the phoenix_pipeline for more details on that. Specifically, this script. In other words, PETRARCh aims to do one thing: code event data. Pre- or post-processing is designed to occur elsewhere.
from petrarch2.
Related Issues (20)
- Adding information to 'meta' when expanding cooperating compounds HOT 1
- Add documentation and unit tests to output from #6 HOT 4
- Strange output format for phrase extraction. HOT 2
- Strict documentation/freezing of parse tree input is needed HOT 2
- how do i include custom dictionary in petrarch(2)? HOT 1
- Finish writing error messages to log rather than using print() HOT 1
- Pull dictionaries out of repo HOT 2
- Config file and parsing for NullVerbs and NullActors HOT 1
- Incorrect Command line Parsing Function: parse_cli_args HOT 2
- make_plural_noun(noun) function when reading verb dictionary HOT 1
- Install instructions reference incorrect petrarch version HOT 3
- Add a Contribute section to README HOT 2
- GigaWord.sample.PETR.xml file without parse blocks HOT 8
- Make petrarch2 output more JSON friendly HOT 4
- When to add a pipe ‘|’
- ImportError: No module named 'PETRglobals' HOT 1
- Date comparison bug HOT 1
- Bug in generating text
- Adapting new Treebank format
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from petrarch2.