Describe the bug When I test my LLM on the commonsenseqa dataset,

[Bug] comsenseqa infer/eval bug about opencompass HOT 3 CLOSED

open-compass commented on August 15, 2024

[Bug] comsenseqa infer/eval bug

from opencompass.

Comments (3)

hanjr92 commented on August 15, 2024

but, i use python tools/prompt_viewer.py configs/eval_test_comsenseqa.py -n -a

from mmengine.config import read_base
from opencompass.partitioners import SizePartitioner, NaivePartitioner
from opencompass.runners import LocalRunner
from opencompass.tasks import OpenICLInferTask, OpenICLEvalTask
with read_base():
    from .datasets.commonsenseqa.commonsenseqa_ppl_5545e2 import commonsenseqa_datasets

len of self.index_ds in commonsenseqa : 9741
len of self.test_ds in commonsenseqa : 1221
but the len of ice_idx_list is 1221. It is equal to len of self.test_ds.

from opencompass.

Leymore commented on August 15, 2024

There is a partitioner machenism during launching tasks, that would explain the differences between 611 and 1221. But I'm not sure why there is a out of bound error...

from opencompass.

hanjr92 commented on August 15, 2024

It‘s my fault. I saved the intermediate results of the index as binary files, but did not consider partitioner machenism. Thanks.

from opencompass.

[Bug] comsenseqa infer/eval bug about opencompass HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent