Comments (3)
请参考:https://aistudio.baidu.com/projectdetail/4049663?channelType=0&channel=0
from paddlenlp.
This issue is stale because it has been open for 60 days with no activity. 当前issue 60天内无活动,被标记为stale。
from paddlenlp.
This issue was closed because it has been inactive for 14 days since being marked as stale. 当前issue 被标记为stale已有14天,即将关闭。
from paddlenlp.
Related Issues (20)
- [Question]: 数据集加载失败,老是报错。 HOT 1
- paddlenlp wordtag转onnx推理有提升吗
- [Question]: 关于开启block_attn后模型的停止条件
- [Question]: lexical_analysis不支持排序模式了么
- [Docs]:
- [Question]: 关于llm的pretrain部分代码实现与sft部分代码实现
- 增强 paddlenlp 以支持多轮对话、agent对话和工具对话
- [Improvement Request] 简化数据集加载逻辑并改进文档支持 HOT 1
- 在 `paddlenlp 3.0` 的微调案例中数据结构有限性无法满足当前大模型微调所需要的数据结构 HOT 1
- [Bug]: 使用neural_search/recall/in_batch_negative 训练时候报错 TypeError: __init__() got an unexpected keyword argument 'enable_recompute' HOT 4
- [Question]: roberta目录下没有run_glue.py
- [Question]: 参考roberta文档进行fine-tune,发现没有对应脚本 HOT 1
- [Question]: UTC 微調後,使用 Taskflow 與 PromptModelForSequenceClassification 結果不一樣
- [Question]: utc模型是否有关于输入长度限制的说明?如何处理长度超过2048的字符串输入 HOT 1
- [Bug]: Taskflow预测二分类问题,多线程预测程序崩溃 HOT 2
- [Question]: 如何使用PaddleNlp的albert模型进行训练 HOT 3
- [Bug]: 使用albert进行cola任务的fine-tune,精度为0
- [Question]: 关于uie微调 HOT 2
- [Question]: 使用taskflow api 加载英文模型 rocketqav2-en-marco-cross-encoder 执行文本相似度计算,报错“ classifier.weight receives a shape [768, 2], but the expected shape is [768, 1].”
- vision llm HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from paddlenlp.