Comments (6)
Hello, questions are welcome.
First of all, the model does use two types of inputs, one is the input formatted for the BERT-BASE model, and the other is in the format of the BERT-SPC model. The former is used to extract local context information only, while the latter is used to extract global context information and ATE tasks. And the input sequence of BERT-BASE is obtained by truncation from the input of BERT-SPC by the functionget_ids_for_local_context_extractor(self, text_indices)
As for the attention mask, I studied several repositories for name entity recognition (NER) task, such as BERT-NER (because NER is very similar to the ATE task), and I think there is no need to mask any aspect during training, while model evaluation does.
This is my description of your question. If your question is not solved, please feel free to contact me.
from lcf-atepc.
Thanks for your response.
But in config file https://github.com/yangheng95/LCF-ATEPC/blob/master/exp-batch.json#L5 the parameter which is controls truncation is False. It means, that there is no truncation.
The format for BERT-SPC model is: [CLS] token_0 token_1 ... token_i [SEP] aspect tokens [SEP]
And this model is used to extract global context information and ATE tasks.
ATE - aspect terms extraction.
You give model, that should extract terms, information about this aim terms in input data, according to the format.
from lcf-atepc.
We designed "bert-base" parameter just to implement the bert-base model. When this parameter is True and takes effect, the input to bert-spc is truncated, and with local_context_focus="None", the model is reduced to the bert-base model. At other times, it represents the input of bert-spc, and local context features extractor always takes the input of BERT-BASE. Those inputs are aims at global context and local context, respectively.
from lcf-atepc.
duplicate:
The format for BERT-SPC model is: [CLS] token_0 token_1 ... token_i [SEP] aspect tokens [SEP]
And this model is used to extract global context information and ATE tasks.
ATE - aspect terms extraction.
You give model, that should extract terms, information about this aim terms in input data, according to the format.
from lcf-atepc.
Hello,
Seems the code of ATE part need a repair. Have you tried to implement the BERT-BASE model for the ATE task?
When I tried to conduct the ATE task based on the codes BERT-NER, but the F1 score on the Laptop test set only reached about 71-72.
from lcf-atepc.
The model temporarily defaults block the input format of BERT-SPC to keep the rationality of ATE performance. However, the BERT-SPC input still could be used to improve the APC subtask. I will redesign the code and update the paper later.
from lcf-atepc.
Related Issues (20)
- 关于数据转换。 HOT 12
- cross validation HOT 1
- Evaluation HOT 1
- The problem of the training on new dataset HOT 8
- pip install -U pyabsa问题 HOT 2
- 掩码长度问题 HOT 20
- 只有cpu可以训练吗? HOT 6
- 训练模型数据转换错误 HOT 3
- 你好,我试验了你们的这个多任务学习模型,有一些问题想请教。 HOT 3
- evaluation HOT 2
- Regard Dataset HOT 1
- Regrad loss function
- 预测的问题 HOT 1
- Hello, how can I solve this problem? thank you very much HOT 7
- 数据集标签问题 HOT 2
- 用我的数据集做预测时,遇到了一个问题 HOT 17
- LCF
- predicited lable
- pool function
- ATE_test_F1值低 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lcf-atepc.