For sentence classification task using BERT, is the PAD token used in IG/Deeplift? or

For sentence classification using BERT, PAD token is used in IG/Deeplift? about captum HOT 1 OPEN

lkqnaruto commented on September 28, 2024

For sentence classification using BERT, PAD token is used in IG/Deeplift?

from captum.

Comments (1)

EldadTalShir commented on September 28, 2024

The default reference in IG is a zero scalar corresponding to each input tensor (effectively PAD for BERT). It can be customized by setting the 'baselines' parameter when calling the attribute function. For example (setting UNK as reference, assuming seq_len are the number of tokens in your input):

# Custom token for IG
from transformers import AutoTokenizer
from captum.attr import TokenReferenceBase

tokenizer = AutoTokenizer.from_pretrained('all-MiniLM-L6-v2') # Load your model's tokenizer
ref_token_id = tokenizer.unk_token_id  # Choose the id of your desired token, you can call tokenizer.all_special_tokens for a list of all special tokens supported by your model
token_reference = TokenReferenceBase(reference_token_idx=ref_token_id) # Use Captum to generate a reference based on the number of tokens in your input
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")
ref = token_reference.generate_reference(seq_len,device=device).unsqueeze(0)

Then when you call attribute set baselines=ref. You can follow this guide as well: https://captum.ai/tutorials/IMDB_TorchText_Interpret

from captum.

For sentence classification using BERT, PAD token is used in IG/Deeplift? about captum HOT 1 OPEN

Comments (1)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent