Comments (3)
@gongel thanks!
from paddlenlp.
This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动,被标记为stale。
from paddlenlp.
This issue was closed because it has been inactive for 14 days since being marked as stale. 当前issue 被标记为stale已有14天,即将关闭。
from paddlenlp.
Related Issues (20)
- [Improvement Request] 简化数据集加载逻辑并改进文档支持 HOT 1
- 在 `paddlenlp 3.0` 的微调案例中数据结构有限性无法满足当前大模型微调所需要的数据结构 HOT 1
- [Bug]: 使用neural_search/recall/in_batch_negative 训练时候报错 TypeError: __init__() got an unexpected keyword argument 'enable_recompute' HOT 4
- [Question]: roberta目录下没有run_glue.py
- [Question]: 参考roberta文档进行fine-tune,发现没有对应脚本 HOT 1
- [Question]: UTC 微調後,使用 Taskflow 與 PromptModelForSequenceClassification 結果不一樣
- [Question]: utc模型是否有关于输入长度限制的说明?如何处理长度超过2048的字符串输入 HOT 1
- [Bug]: Taskflow预测二分类问题,多线程预测程序崩溃 HOT 2
- [Question]: 如何使用PaddleNlp的albert模型进行训练 HOT 3
- [Bug]: 使用albert进行cola任务的fine-tune,精度为0
- [Question]: 关于uie微调 HOT 2
- [Question]: 使用taskflow api 加载英文模型 rocketqav2-en-marco-cross-encoder 执行文本相似度计算,报错“ classifier.weight receives a shape [768, 2], but the expected shape is [768, 1].”
- vision llm HOT 1
- [Question]: 层次分类方案区别(文本分类hierarchical、通用文本分类UTC) HOT 1
- gradient accumulation + linear backward fusion支持 HOT 1
- [Question]: 将一些发票的ocr识别结果用llm提取关键信息,需要微调大模型?OCR识别结果没有语义,如果区微调有意义?
- [Bug]: AssertionError: All tokenizer files should be in the same directory
- [Bug]: UIE模型什么信息都抽不出来
- [Question]: 飞腾+昆仑R200怎么安装XPU自定义算子
- [Bug]: PaddleNLP大模型训练任务在老CPU机器上跑不起来
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from paddlenlp.