Giter Club home page Giter Club logo

duplicatechecking's Introduction

DuplicateChecking

基于 Simhash 的论文查重系统

项目背景

本项目为论文内查系统,即需要自行准备论文库

远古时期的本科毕设项目,时至今日看到偶尔有人 star

全靠回忆写点 readme,希望尽可能有一些帮助

当时导师需求为:

一些学生抄袭往届学长学姐论文(特别是同专业的)

由于特殊原因,本校论文不会及时纳入知网查重库(但导师手里有,原因不详)

故开发此项目

部署

cd D:\duplicateChecking\Flask //进入项目目录
venv\Scripts\activate //激活虚拟环境
python start.py //启动项目
  • 项目运行所需的包已经安装在虚拟环境中,理论上可直接运行
  • 如需直接部署(不建议)请自行根据提示安装
  • 项目基于 MongoDB,确保服务已运行

启动

  1. start.py 中设置
app.run(host='127.0.0.1', port=5000, debug=True) //设置地址并开启调试模式
  1. 网页进入 127.0.0.1:5000 //上述设置地址

image

使用

顺序操作即可

  • 文件上传没有可视化界面可能造成困扰:待上传的文件选中完毕后,点击上传即可
  • 论文库上传后存放在 /docs/lib
  • 待查重论文上传后存放在 /docs/check
  • 论文库的任何改动,都需要重新 点击开始初始化
  • 点击查看结果 只显示最后一篇的查询结果(做了点UI)
  • 可以一次查询多篇论文,结果以 txt 形式保存在 /result

其他说明

项目核心为三个模块,,dupl_ckgflk_mdb, web_mod

  • dupl_ckg 为查重模块,即 Simhash 算法
  • flk_mdb轻量化 MongoDB ORM 库
  • web_mod 为网页操作模块,如上传等
  • /app/routes.py 为路由规则
  • /app/templates 包含网页模板
本项目缺陷明显,bug 众多,可改进空间巨大

当时多少有点应付毕业的心态,能跑通一次截个图就行的那种,在导师督促下紧急修改过一些bug,杯水车薪

但项目本身有一定实用意义(服务导师,揭穿部分同学的不法行为)

此处向曾经因为没有 readme 产生困扰的同学们诚挚道歉(虽然写了好像也没啥用)

duplicatechecking's People

Contributors

cuiyongen avatar

Stargazers

 avatar  avatar  avatar  avatar Andyfoo avatar  avatar Chiaki_Ochiai avatar  avatar  avatar  avatar cyw avatar Hollow51123 avatar  avatar fine avatar yipinren avatar  avatar  avatar  avatar  avatar  avatar GraphiteEdge avatar  avatar  avatar tianguoguo avatar  avatar  avatar  avatar  avatar 日出的幻景 avatar 源文雨 avatar  avatar  avatar  avatar  avatar  avatar duanpeng avatar  avatar ImmortalZ avatar  avatar  avatar  avatar realkris avatar JohnnySun avatar Science而后行 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar Eric Liu avatar  avatar  avatar  avatar  avatar RealChenglong avatar shikai avatar Luna avatar  avatar 日暮颂歌1991 avatar  avatar  avatar  avatar ExcellentDavid avatar Harmonica avatar  avatar

Watchers

James Cloos avatar  avatar 26℃ avatar Amyackerrr avatar

duplicatechecking's Issues

请问这个能正常运行吗?

可以进入到主页面,但是点击上传文件就出错了,显示web_mod.upload_file()
AttributeError: 'module' object has no attribute 'upload_file'

如何使用

大佬,请问能加一个README讲一下要如何用吗?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.