Giter Club home page Giter Club logo

Comments (6)

yangheng95 avatar yangheng95 commented on June 12, 2024

Hello, questions are welcome.

First of all, the model does use two types of inputs, one is the input formatted for the BERT-BASE model, and the other is in the format of the BERT-SPC model. The former is used to extract local context information only, while the latter is used to extract global context information and ATE tasks. And the input sequence of BERT-BASE is obtained by truncation from the input of BERT-SPC by the functionget_ids_for_local_context_extractor(self, text_indices)

As for the attention mask, I studied several repositories for name entity recognition (NER) task, such as BERT-NER (because NER is very similar to the ATE task), and I think there is no need to mask any aspect during training, while model evaluation does.

This is my description of your question. If your question is not solved, please feel free to contact me.

from lcf-atepc.

Ulitochka avatar Ulitochka commented on June 12, 2024

Thanks for your response.

But in config file https://github.com/yangheng95/LCF-ATEPC/blob/master/exp-batch.json#L5 the parameter which is controls truncation is False. It means, that there is no truncation.

The format for BERT-SPC model is: [CLS] token_0 token_1 ... token_i [SEP] aspect tokens [SEP]
And this model is used to extract global context information and ATE tasks.
ATE - aspect terms extraction.
You give model, that should extract terms, information about this aim terms in input data, according to the format.

from lcf-atepc.

yangheng95 avatar yangheng95 commented on June 12, 2024

We designed "bert-base" parameter just to implement the bert-base model. When this parameter is True and takes effect, the input to bert-spc is truncated, and with local_context_focus="None", the model is reduced to the bert-base model. At other times, it represents the input of bert-spc, and local context features extractor always takes the input of BERT-BASE. Those inputs are aims at global context and local context, respectively.

from lcf-atepc.

Ulitochka avatar Ulitochka commented on June 12, 2024

duplicate:
The format for BERT-SPC model is: [CLS] token_0 token_1 ... token_i [SEP] aspect tokens [SEP]
And this model is used to extract global context information and ATE tasks.
ATE - aspect terms extraction.
You give model, that should extract terms, information about this aim terms in input data, according to the format.

from lcf-atepc.

yangheng95 avatar yangheng95 commented on June 12, 2024

Hello,
Seems the code of ATE part need a repair. Have you tried to implement the BERT-BASE model for the ATE task?

When I tried to conduct the ATE task based on the codes BERT-NER, but the F1 score on the Laptop test set only reached about 71-72.

from lcf-atepc.

yangheng95 avatar yangheng95 commented on June 12, 2024

The model temporarily defaults block the input format of BERT-SPC to keep the rationality of ATE performance. However, the BERT-SPC input still could be used to improve the APC subtask. I will redesign the code and update the paper later.

from lcf-atepc.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.