Comments (9)
@LSinev Hi,
Looks like vllm engine which supported in 0.4.0 works faster than hf engine
from mera.
I checked the link you provided here, a
This link goes to fork of lm-evaluation-harness. In this fork there is a code needed for RuTiE task, which is PRed in lm-evaluation-harness, but not yet approved and merged.
There is no plans yet to submit MERA tasks directly into lm-evaluation-harness.
new_harness_codebase is using 0.4.x code, but tasks are not in fully yaml format yet (will be, but not yet, just like, for example, SQUADv2 task in lm-evaluation-harness). MERA tasks are stored in https://github.com/ai-forever/MERA/tree/update/new_harness_codebase/benchmark_tasks as new code allows to use tasks from external directory.
from mera.
Yes, there are! :) stay tuned!
from mera.
Do you have any particular expectations for improvements with the upgrade to the 0.4.0+ backend?
from mera.
hello guys! can I ask you, do you work on this topic, maybe you have some estimated dates?
@LSinev
from mera.
will give more information next week, or may be even branch for playing/testing work in progress
from mera.
new_harness_codebase — "work in progress" branch with submoduled patched (waiting for PR to be merged) lm-evaluation-harness.
All scores will change. Leaderboard will not publish these yet, but you can use for private scoring. Baseline models scoring should be done by you. Changes to model running code (lm-evaluation-harness side) should be done at their repository to be supported here.
from mera.
great, thank you!
from mera.
Hi @LSinev,
I noticed that the tasks from the branch do not include the MERA tasks in 0.4.x
format. I checked the link you provided here, and it seems they are indeed missing.
Could you please confirm if the MERA tasks will be added to this branch, or if there is another location where they might be available?
Thanks!
from mera.
Related Issues (14)
- empty value rummlu HOT 3
- Как добавить форматирование промпта? HOT 2
- Как проскорить модель без метода loglikelihood? HOT 1
- Ошибка при сабмитах на mera.a-ai.ru HOT 2
- tokenizer does not have a padding token HOT 1
- [Feature Request] Support for OpenAI ChatCompletion models HOT 2
- Как бенчмарк закрытой модели, у которой нету метода loglikelihood? HOT 1
- Значения логов бенчмарка HOT 1
- Скоринг GGUF моделей HOT 1
- Не авторизоваться на сайте mera.a-ai.ru
- Большие модели, не влезающие в одну карту, не параллелятся на несколько HOT 2
- влияние промпта на результаты бенчмарков
- no targets in rummlu and others benchmarks HOT 12
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mera.