Giter Club home page Giter Club logo

deepie's People

Contributors

loujie0822 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

deepie's Issues

CHIP2020命名实体识别

大佬,请问一下你用层叠式指针标注处理CHIP2020命名实体识别任务时,一共9类应该会存在标签稀疏问题,能请教一下怎么处理这个问题吗 我用LSTM接Linear当做多分类问题处理,但是效果很差 识别不出实体。

无法抽取百度2019数据的spo信息

仔细查看了代码里面的数据读取,代码应该是没有匹配2019年的数据格式,不知道是不是我看错了
run/relation_extraction/etl_span_transformers/data_loader_v2.py, line 212
2019数据的spo['object']已经是个字符串了,没有keys()属性了,
for spo_object in spo['object'].keys():
if spo['predicate'] in self.spo_conf:
label = spo['predicate']
else:
label = spo['predicate'] + '_' + spo_object
spo_dict[self.spo_conf[label]] = spo['object'][spo_object]

新闻内容抽取问题

大佬好,《Joint Extraction of Entities and Relations Based on a Novel Decomposition Strategy》bert实现和苏剑林的bert4keras信息抽取在百度2019基本f1都在0.82左右,但是真正抽取新闻的时候,使用句子进行切割,效果很不理想,有什么推荐trick?

spo抽取

transformers_multi_label_span 这个目前可以运行抽取百度的spo信息吗?

关于span-based 联合抽取

您好,我最近一直在关注您的项目,您在知乎上的文章我都仔细看过了,注意到您最近有提交transformers的spo方法,请问是spert那篇论文的方法吗?

AttributeError: 'str' object has no attribute 'keys'

File "/home/powerop/work/DeepIE-master/run/relation_extraction/etl_span_transformers/data_loader_v2.py", line 212, in _read
for spo_object in spo['object'].keys():
AttributeError: 'str' object has no attribute 'keys'

CHIP2020

大神,尽快上项目代码,学习学习

FLAT

你好,请问有FLAT这个方法的代码吗?我看了一下,原作者提供了一个空的链接

能否提供一些模型数据

大佬,你好:
尝试跑了一下elt_span_transformers发现报了一些错误:
2021-01-26 14:49:24,295 - transformers.tokenization_utils - INFO - Model name 'transformer_model_path' not found in model shortcut name list (bert-base-uncased, bert-large-uncased, ber
t-base-cased, bert-large-cased, bert-base-multilingual-uncased, bert-base-multilingual-cased, bert-base-chinese, bert-base-german-cased, bert-large-uncased-whole-word-masking, bert-lar
ge-cased-whole-word-masking, bert-large-uncased-whole-word-masking-finetuned-squad, bert-large-cased-whole-word-masking-finetuned-squad, bert-base-cased-finetuned-mrpc, bert-base-germa
n-dbmdz-cased, bert-base-german-dbmdz-uncased). Assuming 'transformer_model_path' is a path or url to a directory containing tokenizer files.
2021-01-26 14:49:24,295 - transformers.tokenization_utils - INFO - Didn't find file transformer_model_path. We won't load it.
2021-01-26 14:49:24,296 - transformers.tokenization_utils - INFO - Didn't find file transformer_model_path\added_tokens.json. We won't load it.
2021-01-26 14:49:24,296 - transformers.tokenization_utils - INFO - Didn't find file transformer_model_path\special_tokens_map.json. We won't load it.
2021-01-26 14:49:24,296 - transformers.tokenization_utils - INFO - Didn't find file transformer_model_path\tokenizer_config.json. We won't load it.
Traceback (most recent call last):
File "run/relation_extraction/etl_span_transformers/main.py", line 148, in
main()
File "run/relation_extraction/etl_span_transformers/main.py", line 129, in main
tokenizer = BertTokenizer.from_pretrained(args.bert_model, do_lower_case=True)
File "D:\Anaconda3\envs\deepie\lib\site-packages\transformers\tokenization_utils.py", line 283, in from_pretrained
return cls._from_pretrained(*inputs, **kwargs)
File "D:\Anaconda3\envs\deepie\lib\site-packages\transformers\tokenization_utils.py", line 347, in _from_pretrained
list(cls.vocab_files_names.values())))
OSError: Model name 'transformer_model_path' was not found in tokenizers model name list (bert-base-uncased, bert-large-uncased, bert-base-cased, bert-large-cased, bert-base-multilingu
al-uncased, bert-base-multilingual-cased, bert-base-chinese, bert-base-german-cased, bert-large-uncased-whole-word-masking, bert-large-cased-whole-word-masking, bert-large-uncased-whol
e-word-masking-finetuned-squad, bert-large-cased-whole-word-masking-finetuned-squad, bert-base-cased-finetuned-mrpc, bert-base-german-dbmdz-cased, bert-base-german-dbmdz-uncased). We a
ssumed 'transformer_model_path' was a path or url to a directory containing vocabulary files named ['vocab.txt'] but couldn't find such vocabulary files at this path or url.
能否提供一些模型数据呢?多谢

etl_span train.py 代码问题?

/run/relation_extraction/etl_span/train.py
line 145-147
ans_dict = self.convert_spo_contour(qids, subject_pred, po_pred, eval_file,
answer_dict, use_bert=self.args.use_bert)
return ans_dict
convert_spo_contour 在 285-315行
该函数代码中并没有 return ,这里是用了其他高级语法还是torch的什么特性,这里没有看懂。
谢谢大家了

数据集上传形式

md文件中shuo的将数据上传到 data/BaiduIE_2020/
百度的duie数据上传直接放三个json的文件就可以了吗

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.