Comments (2)
Hi,
regarding the output interpretation:
entity_clf is a (B x S x E) tensor where B is the batch size (in sentences), S the count of all token spans up to a specified length (10 per default) and E the number of entity types (+1 for "None"). It contains the model's (softmax) confidences that a given span belongs to a certain type of E. In case the span is assigned to the "None" type (=no entity), it is disregarded in the relation extraction step.
rel_clf is a (B x P x R) tensor where B is again the batch size, P the count of all entity pairs (= spans not assigned to the None class) and R the number of relation types. For each entity pair, it contains the (sigmoid) scores for each relation type.
rels is a (B x P x 2) tensor that contains the corresponding entity indices (in entity_clf, entity_masks, entity_sizes etc.) for each entity pair. With this, you can for example access the corresponding entity scores (by indexing entity_clf with rels).
Regarding your second question: It is (span start, span end, entity type, score). Here 'span end' is exclusive. Also, it corresponds to BPE tokens (byte-pair encoded, as in BERT), not to raw tokens.
from spert.
Thank you Markus for your help
from spert.
Related Issues (20)
- How to easily use this model for inference HOT 3
- Can't make predictions following the example HOT 8
- Help! Help! HOT 1
- Help, HOT 6
- How to call only the relation classifier on a pair of entities? HOT 2
- What is the meaning of the dataset tensors? HOT 1
- Simple example issue HOT 1
- Parts of entities are recognised separately HOT 3
- How does span filtering work? HOT 3
- Runtime Error HOT 1
- RuntimeError: copy_if failed to synchronize: cudaErrorAssert: device-side assert triggered HOT 4
- Does SpERT work with GPT models? HOT 1
- How to prepare dataset for training the model? HOT 9
- Can't download datasets
- TypeError: 'NoneType' object is not callable HOT 1
- Can't make train following the example
- Trained model : Relation classification is bad
- HELP HOT 1
- Extract entities and relation from Spacy tokens?
- [WARNI] NaN or Inf found in input tensor. HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spert.