The code for CCF-BDCI-Sentiment-Analysis-Baseline

License: Apache License 2.0

Python 60.99% Dockerfile 0.01% Jupyter Notebook 38.83% Shell 0.17%

ccf-bdci-sentiment-analysis-baseline's People

Contributors

Stargazers

Watchers

Forkers

allensmile rejae mckaymk cooper111 chrisliu007 jingmouren hundred06 qianrenjian t110e4 tiffen zengai maxiaomu lightningsoon autterman jzysaber1996 hiterstone adherer brightmart hitalex phychaos silencelsy balatatree pandascute nihilitior senchfu mickeylq betty-zjl chivalrouss starssummer renhongkai birdflies fyh97 sangensong playai ch488674662 460130107 rogerrojur maogwleon linktopast1990 wangzikui zkcpku gyc913 milkwhite xianbaobao hearts-sp buaazeus md1993 jessie0624 wkw1259 jkhlot berrywrq sherlockholmefeng gaokaigithub duanxian hkzhao123 rwbfd u-help gdh756462786 kxzhang0118 heheomg weizaiff scievan xiong666 hischen mengyuanxi caicaijason jackyang122 rxc205 find-knowledge ukilin tuanbalala 1770031555 ggqshr r-craft dyleaf zheng5yu9 northfishall pokejeff fessence seeker1943 matrixcpu 1637mishenlan chapzq77 luckywonky xuehui0725 millionairechen helloworld729 chthub tszssong iambijav solemnrole sumerzhang wangkangdegithub qibaoyuan fan9 foeinlove lijiadong andrew05200 mangmang-ting zhihaolzh

ccf-bdci-sentiment-analysis-baseline's Issues

/home/ming/anaconda3/lib/python3.7/site-packages/sklearn/metrics/classification.py:1439: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no true samples. 'recall', 'true', average, warn_for) test 0.06457949662369551

/home/ming/anaconda3/lib/python3.7/site-packages/sklearn/metrics/classification.py:1439: UndefinedMetricWarning: F-score is ill-defined and being set to 0.0 in labels with no true samples.
'recall', 'true', average, warn_for)
test 0.06457949662369551
F1正常，test值低，而且出现这样的报错，寻求许久，未解决，请问这是因为什么？

可以用cpu么,怎么设置

运行robera-english报错的问题

哈喽。大佬。
我想运行robera-english，调用了pytorch_transformers中的RobertaForSequenceClassification, RobertaConfig，RobertaTokenizer。
但是会报错，RuntimeError: CUDA error: CUBLAS_STATUS_NOT_INITIALIZED when calling cublasCreate(handle)
这个是什么原因。
好奇，感谢大佬解答。

RuntimeError: set_storage is not allowed on Tensor created from .data or .detach()

跑roberta的时候出现了这个问题，麻烦郭大帮忙看下

bert输出层对截断部分怎么融合

请问bert输出层截断部分融合怎么处理的，代码体现在哪里，谢谢回复！

我不理解这里的cls是什么，还有这里的return super(BertTokenizer, cls)._from_pretrained(pretrained_model_name_or_path, *inputs, **kwargs)

可否共享一下数据? 谢谢. 现在网站无法下载了

请教文章截成的k段，在哪块代码出可以看出是分别输入模型处理？

求解释下一个文章截成k端后怎么输入训练的，没有找到是在哪个地方“分别输入语言模型”的？如果是这样，理论上是不是不管多长的文章都可以通过切成很多端，分别输入处理了，不用截断文章了
BertForSequenceClassification中的forward：
def forward(self, input_ids, token_type_ids=None, attention_mask=None, labels=None,
position_ids=None, head_mask=None):

    flat_input_ids = input_ids.view(-1, input_ids.size(-1))
    flat_position_ids = position_ids.view(-1, position_ids.size(-1)) if position_ids is not None else None
    flat_token_type_ids = token_type_ids.view(-1, token_type_ids.size(-1)) if token_type_ids is not None else None

flat_attention_mask = attention_mask.view(-1, attention_mask.size(-1)) if attention_mask is not None else None
比如k＝2，input_ids中就包含文章划分的两端，通过view又展平了，那输入的长度还是没有变短？和不划分一样？

咨询一个问题：这些模型无显卡的笔记本能跑嘛？

其实知道跑是可以跑的，只是花费的时间问题，因为实在没有这么多计算资源，很多模型都不尝试，直接放弃，kaggle上有 gpu资源，想回头去尝试一下，因此还是想咨询你一下，你这个模型跑完多久？

我看分类的demo上，是在bert之后接了一个lstm层吗，

是的话lstm的输入是bert的输出的第一向量，还是所有的输出呢

run_bert.py eval_loss计算错误

raw:

changed:

训练时，发现eval_loss的曲线太奇怪了，后来发现原版的pytorch_transformer把eval_loss的计算放在了
with torch.no_grad 里，于是修改代码为：

with torch.no_grad():
    tmp_eval_loss= model(input_ids=input_ids, token_type_ids=segment_ids, attention_mask=input_mask, labels=label_ids)
    logits = model(input_ids=input_ids, token_type_ids=segment_ids, attention_mask=input_mask)
    eval_loss += tmp_eval_loss.mean().item()

然后就正常了，不知道是不是这个原因？

guoday / ccf-bdci-sentiment-analysis-baseline Goto Github PK

ccf-bdci-sentiment-analysis-baseline's People

Contributors

Stargazers

Watchers

Forkers

ccf-bdci-sentiment-analysis-baseline's Issues

Recommend Projects

Recommend Topics

Recommend Org