Thanks for this great work. I am using transformer v4 I know this is not trans

Solution : <div class="snippet-clipboard-content notranslate position-relative ove

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

the implementation above is nearly identical to <a href="https://github.com/monologg/G

TypeError: Can't instantiate abstract class MultiLabelPipeline with abstract methods _forward, _sanitize_parameters, postprocess, preprocess about goemotions-pytorch HOT 5 OPEN

monologg commented on July 1, 2024

TypeError: Can't instantiate abstract class MultiLabelPipeline with abstract methods _forward, _sanitize_parameters, postprocess, preprocess

from goemotions-pytorch.

Comments (5)

1shershah commented on July 1, 2024 6

Solution :

from transformers import BertTokenizer
##files from git
from model import BertForMultiLabelClassification
from multilabel_pipeline import MultiLabelPipeline
from pprint import pprint

tokenizer = BertTokenizer.from_pretrained("monologg/bert-base-cased-goemotions-ekman")
model = BertForMultiLabelClassification.from_pretrained("monologg/bert-base-cased-goemotions-ekman")

texts = [
    "Hey that's a thought! Maybe we need [NAME] to be the celebrity vaccine endorsement!",
    "it’s happened before?! love my hometown of beautiful new ken 😂😂",
    "I love you, brother.",
    "Troll, bro. They know they're saying stupid shit. The motherfucker does nothing but stink up libertarian subs talking shit",
]
import torch
import numpy as np
results = []
for txt in texts:
    inputs = tokenizer(txt,return_tensors="pt")
    outputs = model(**inputs)
    scores =  1 / (1 + torch.exp(-outputs[0]))  # Sigmoid
    threshold = .3
    for item in scores:
        labels = []
        scores = []
        for idx, s in enumerate(item):
            if s > threshold:
                labels.append(model.config.id2label[idx])
                scores.append(s)
        results.append({"labels": labels, "scores": scores})

from goemotions-pytorch.

hannahburkhardt commented on July 1, 2024 3

@bubbazz if you mean that you aren't getting outputs for all labels, but only the main labels, try this.

from transformers import BertTokenizer, AutoModelForSequenceClassification, pipeline

model_name = 'original' #'ekman'

tokenizer = BertTokenizer.from_pretrained(f"monologg/bert-base-cased-goemotions-{model_name}")
model = AutoModelForSequenceClassification.from_pretrained(f"monologg/bert-base-cased-goemotions-{model_name}", num_labels=28)

goemotions=pipeline(
        model=model, 
        tokenizer=tokenizer, 
        task="text-classification",
        return_all_scores=True,
        function_to_apply='sigmoid',
    )

goemotions(texts)

from goemotions-pytorch.

santimarro commented on July 1, 2024

Hey, I had the same issue but I managed to make it work with this:

`from transformers import AutoTokenizer, AutoModelForSequenceClassification
from pprint import pprint
from multilabel_pipeline import MultiLabelPipeline

tokenizer = AutoTokenizer.from_pretrained(
"monologg/bert-base-cased-goemotions-original"
)
model = AutoModelForSequenceClassification.from_pretrained(
"monologg/bert-base-cased-goemotions-original")

def tokenize_text(text):
# Replace "text" with whatever column name has your text inputs
return tokenizer(text, truncation=True)

texts = [
"Hey that's a thought! Maybe we need [NAME] to be the celebrity vaccine endorsement!",
"it’s happened before?! love my hometown of beautiful new ken 😂😂",
"I love you, brother.",
"Troll, bro. They know they're saying stupid shit. The motherfucker does nothing but stink up libertarian subs talking shit",
]

goemotions = MultiLabelPipeline(
model=model,
tokenizer=tokenizer,
threshold=0.3
)
pprint(goemotions(texts))`

Just make sure to use the multilabel_pipeline provided in the python with the same name in this repo!

from goemotions-pytorch.

1shershah commented on July 1, 2024

Hey, I had the same issue but I managed to make it work with this:

`from transformers import AutoTokenizer, AutoModelForSequenceClassification from pprint import pprint from multilabel_pipeline import MultiLabelPipeline

tokenizer = AutoTokenizer.from_pretrained( "monologg/bert-base-cased-goemotions-original" ) model = AutoModelForSequenceClassification.from_pretrained( "monologg/bert-base-cased-goemotions-original")

def tokenize_text(text): # Replace "text" with whatever column name has your text inputs return tokenizer(text, truncation=True)

texts = [ "Hey that's a thought! Maybe we need [NAME] to be the celebrity vaccine endorsement!", "it’s happened before?! love my hometown of beautiful new ken 😂😂", "I love you, brother.", "Troll, bro. They know they're saying stupid shit. The motherfucker does nothing but stink up libertarian subs talking shit", ]

goemotions = MultiLabelPipeline( model=model, tokenizer=tokenizer, threshold=0.3 ) pprint(goemotions(texts))`

Just make sure to use the multilabel_pipeline provided in the python with the same name in this repo!

This will still not work, if you create a custom Pipeline with abstract Pipeline, you have to override the
abstract methods _forward, _sanitize_parameters, postprocess, preprocess

from goemotions-pytorch.

bubbazz commented on July 1, 2024

the implementation above is nearly identical to the pipeline.py but i get different result. can somebody explain what the reason for this is?

results:

[{'labels': ['joy', 'neutral'],
  'scores': [tensor(0.3892, grad_fn=<UnbindBackward0>),
   tensor(0.5499, grad_fn=<UnbindBackward0>)]},
 {'labels': ['joy', 'surprise'],
  'scores': [tensor(0.9277, grad_fn=<UnbindBackward0>),
   tensor(0.4548, grad_fn=<UnbindBackward0>)]},
 {'labels': ['joy'], 'scores': [tensor(0.9889, grad_fn=<UnbindBackward0>)]},
 {'labels': ['anger'], 'scores': [tensor(0.7580, grad_fn=<UnbindBackward0>)]}]

from goemotions-pytorch.

TypeError: Can't instantiate abstract class MultiLabelPipeline with abstract methods _forward, _sanitize_parameters, postprocess, preprocess about goemotions-pytorch HOT 5 OPEN

Comments (5)

Related Issues (11)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent