Comments (9)
您好,可以修改一下代码,在输出结果的时候不使用ascii码
在最后json.dump中加入ensure_ascii=False
from deepke.
谢谢!
再请问一下,我的结果标签结果中为什么没有提取到关系?
结果如下:
[{"sentence": "如何演好自己的角色,请读《演员自我修养》《喜剧之王》周星驰崛起于穷困潦倒之中的独门秘笈", "head": "周星驰", "tail": "喜剧之王", "relation": "None"}, {"sentence": "《稻香》是周杰伦演唱的一首歌曲,由周杰伦作词、作曲,黄雨勋编曲,收录在周杰伦2008年10月15日发行的专辑《魔杰座》中", "head": "魔杰座", "tail": "稻香", "relation": "None"}]
triple_file.csv如下:
head_type,tail_type,relation
人物,影视作品,导演
音乐专辑,歌曲,所属专辑
人物,影视作品,主演
source_data.json如下:
[
{
"sentence": "如何演好自己的角色,请读《演员自我修养》《喜剧之王》周星驰崛起于穷困潦倒之中的独门秘笈",
"head": "周星驰",
"tail": "喜剧之王",
"head_offset": "26",
"tail_offset": "21"
},
//……
]
from deepke.
没有提取到关系是因为您的triple_file.csv里面没有包含您的source_data中的实体,您可以使用我们提供的triple_file: https://drive.google.com/file/d/1YpaMpivodG39p53MM9sMpB41q4EoiQhH/view?usp=sharing
from deepke.
请问您的问题解决了吗?
from deepke.
请问您的问题解决了吗?
没有解决,依旧没有提取到关系
from deepke.
谢谢!已经解决了,使用了您提供的triple_file!
没有提取成功是因为triple_file不包含实体,并且我的source-data中头实体和尾实体反了。
请问一下,有没有办法构造自己的三元组标签数据集?需要将涉及到的每个实体、关系加入自制的triple_file吗? 满足训练条件的数据量是多大?
from deepke.
您好,我们这里不涉及训练,是一个简易版的远程监督标注,您可以在我们构造的triple_file里添加自己需要的三元组进行标注
from deepke.
请问您的问题是否已解决?
from deepke.
已经解决,谢谢!
from deepke.
Related Issues (20)
- RE standard 是否有论文参考 HOT 2
- example中NER任务 HOT 10
- 二次训练数据集 HOT 2
- OneKE在未来是否考虑推出llama3版本 HOT 2
- 您好,请问关系抽取(全监督)任务采用的是管道模型还是联合模型 HOT 3
- 请问standard场景下的re任务中的数据集是贵组自己构建的吗 HOT 5
- 您好,请问关系抽取任务中的train_loss,valid_loss和test_loss HOT 3
- The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable. HOT 4
- 请问为什么模型生成的json格式包含\和多余的" HOT 3
- macos Sonoma 14.4.1 (M3 chip) 安装不了 HOT 2
- pip安装cnschema出错 HOT 10
- KG2Instruction框架图 HOT 1
- baichuan2-13b-iepile-lora模型预测报错zjunlp/baichuan2-13b-iepile-lora does not appear to have a file named config.json. HOT 13
- fastchat can use? HOT 1
- 在特定领域上微调运行脚本报错:finetune.py: error: the following arguments are required: --output_dir HOT 6
- 请问有kg2instrction使用的data目录的数据吗? HOT 5
- infer llama 报错 HOT 2
- infer llama3-8B-instruct 报错return _sentencepiece.SentencePieceProcessor_LoadFromFile(self, arg) TypeError: not a string HOT 1
- 显存不够 HOT 10
- LoRA微调大模型时,恢复训练 如何设置? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deepke.