Hi Allan, Can I ask what is returned in the CRF forward function?(lo

The output for CRF loss about pytorch_neural_crf HOT 5 CLOSED

allanj commented on June 27, 2024

The output for CRF loss

from pytorch_neural_crf.

Comments (5)

allanj commented on June 27, 2024 2

Because initially, the labeled score (in log space) could be something like -180
unlabeled score could be: 557

(I debugged inside and check the value).

In that case, when you take the exp operation, it would be really close to zero, probably exceed the floating-point boundary in python/torch.
It's like torch.exp(-1000) = 0, and torch.exp(-500) still = 0.

Thus, log(0) gives you nan. That's why we prefer to work on log space directly

from pytorch_neural_crf.

mxc19912008 commented on June 27, 2024 1

I see, thanks again! :)

from pytorch_neural_crf.

mxc19912008 commented on June 27, 2024 1

Thanks Allan! I wanted to add something to the softmax, before log. I'll do more experiments :)

from pytorch_neural_crf.

allanj commented on June 27, 2024

Yes.
For CRF, it is a negative log-likelihood.
Mathematically, the probability p(y|x) is:

If we take the negative loglikelihood, it becomes:

So the left-hand side is unlabeled_score, the right-hand side is labeled_score. And we make a subtraction.

You can also take an average (mean) to make the loss smaller but I guess it doesn't change a lot the performance.

from pytorch_neural_crf.

mxc19912008 commented on June 27, 2024

Hi Allan,

I have one more question if you don't mind.
If I change the loss like this:

p = torch.exp(labeled_score - unlabed_score)
return -torch.log(p)

it should work because unlabed_score - labeled_score is the nll loss, then probability should be expressed as above, and then I do -torch.log, it should generate the same nll loss, but instead it generates "nan" at each epoch.

Thanks, Allan!

from pytorch_neural_crf.

Recommend Projects

The output for CRF loss about pytorch_neural_crf HOT 5 CLOSED

Comments (5)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent