Comments (6)
您好,我检查了一下,论文里公式6和7的tanh确实都多了个linear,谢谢指正,之后会纠正过来,一切以代码为准。
from pfn.
非常感谢您的回复。我还有一些小小的疑问,请问您试验的时候是不是也尝试过添加线性层,发现效果较差才没有采用这个方法?还是说最初的想法就是没打算在这两个部分使用线性层,如果是这样的话能方便说一说是什么原因导致您不打算使用线性层呢?
from pfn.
tanh后面的linear我没有加过,只是写论文的时候习惯性得加上了linear。
模型方面是有进行过其他尝试,比如说像lstm那样加上三个门控,或者是在公式6处增加layernorm等,但是都收效甚微。
线性层加不加在我看来不会有什么收益,甚至还可能打乱神经元本来的层次顺序。
from pfn.
谢谢!我也尝试过直接在LSTM上做一个类似interaction的改进,也确实没有带来很好的效果,但您的思路更加深入也更加有新意!
from pfn.
好的,谢谢您对我们论文的认可
from pfn.
@insbread CC: @Coopercoppers
我检查了一下好像论文跟代码里面的是一致的。看下面的图片,红色的是公式6,蓝色的是公式7,请教一下,你们说的多了线性层/或者是没有实现公式6是指的哪个部分?
from pfn.
Related Issues (20)
- Small inconsistency, or not? HOT 1
- question about eval_metric HOT 1
- When training model, is it necessary to set args.do_eval=True?
- FileNotFoundError: [Errno 2] No such file or directory: 'data/data/NYT/ner2idx.json' HOT 1
- OOM for my own bigger datasets HOT 2
- Question about entity extraction of out-of-triples HOT 1
- chinese custom dataset HOT 2
- evaluate the model with customized input HOT 2
- 请问消融实验的编码模式encoding scheme应该怎么设置呀 HOT 2
- Nest下的处理 HOT 2
- SEMEVAL dataset HOT 1
- 为什么PFN-nested model在关系训练时候可以利用实体tail信息呢? HOT 2
- 你好,能否提供适用于nested-pfn的inference.py呢?感谢! HOT 1
- 关于头尾实体特征和关系特征的问题 HOT 5
- 关于实验结果复现的问题 HOT 2
- 关于实体抽取和关系抽取F1分数问题 HOT 2
- 关于OOT(Out-of-triple)数据集问题 HOT 2
- why use albert-xxlarge instead of bert-base when training on some datasets? HOT 2
- Question about the tail re unit HOT 6
- 有一个报错问题,不知道是环境不对还是, HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pfn.