Light

xiongma / chinese-law-bert-similarity Goto Github PK

View Code? Open in Web Editor NEW

137.0 6.0 29.0 50 KB

bert chinese similarity

License: MIT License

Python 100.00%

bert tensorflow nlp sentence-similarity deep-learning

chinese-law-bert-similarity's Introduction

How to use

Prediction

This project, I improve model which was trained, so you can download it, and use it to prediction!

this project just support every sentences with 45 char length
download model file, pwd: vv1k

just use like this

first

bs = BertSim(gpu_no=0, log_dir='log/', bert_sim_dir='bert_sim_model\\', verbose=True)

second

similarity sentences

text_a = '技术侦查措施只能在立案后采取'
text_b = '未立案不可以进行技术侦查'
bs.predict([[text_a, text_b]])

you will get result like this: [[0.00942544 0.99057454]]

not similarity sentence

text_a = '华为还准备起诉美国政府'
text_b = '飞机出现后货舱火警信息'
bs.predict([[text_a, text_b]])

you will get result like this: [[0.98687243 0.01312758]]

Parameter

name	type	detail
gpu_no	int	which gpu will be use to init bert ner graph
log_dir	str	log dir
verbose	bool	whether show tensorflow log
bert_sim_model	str	bert sim model path

Train

Code

In this project, I just use bert pre model to fine tuning, so I just use their original code. I try to create new one, but the new one just same as the original code, so I given up.

Dataset

Because of my domain work, my work is based on judicial examination education, so I didn't use common dataset, my dataset were labeled by manual work, it include 80000+, 50000+ are similar, 30000+ are dissimilar, because of the privacy, I can't open source of this dataset

Suggest:

In original code, they just got the model pool output, I think there may be other ways to increase the accuracy, I tried some ways to increase the accuracy, but I found one, just concat the [CLS] embedding of the fourth from bottom to tailender in encoder output list, if you want to use my way, just do like this。

Delete the following code

output_layer = model.get_pooled_output()

Use the following code, it can increase the accuracy 1%.

output_layer = tf.concat([tf.squeeze(model.all_encoder_layers[i][:, 0:1, :], axis=1) for i in range(-4, 0, 1)], axis=-1)

chinese-law-bert-similarity's People

Contributors

Stargazers

Watchers

chinese-law-bert-similarity's Issues

函数predict返回值说明

Hello, if I want to train on my own datasets, what is the format of datasets should I prepare? sentence1, sentence2, label every line? Thank you!

用的什么数据训练的模型呢？

如题

您好，请问能提供一下训练代码吗

关于predict_result =self.sess.run（）耗时偏长的问题

您好，非常感谢您的分享。但我在运行这个程序的时候，predict_result =self.sess.run（）这一步耗时比较长，平均每次大概在0.1s左右。感觉耗时偏长，不知道您对提高这个程序的运行效率有什么见解吗？

return: label, similarity

请问这里label表示的什么意思

请问一下训练数据个格式是什么？

是否分为3列：text_a,text_b,label,中间'\t'隔开？

data loss

hi, i meet this problem:
this line : saver = self.tf.train.Saver()
Error: DataLossError (see above for traceback): Checksum does not match: stored 2361353507 vs. calculated on the restored bytes 3689058997

maybe because the model file that i download is incomplete. I try to download it again, but still failed.

请问能发一下数据集的形式吗？就一到两个样本就行

关于output_layer 的形式

我看你说output_layer 是最后4层的连接，可以提高1%，但是代码里缺没有用，不知道是为什么呢？
output_layer = tf.concat([tf.squeeze(model.all_encoder_layers[i][:, 0:1, :], axis=1) for i in range(-4, 0, 1)], axis=-1)

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.