Giter Club home page Giter Club logo

aiospider's Introduction

👋 Welcome

I'm EINDEX, an Application Security Specialist at @Thoughtworks.

With a strong background in the field of application security, I specialize in various areas such as Threat modeling, Application Security Review, Python, and more.

Here are a couple of my notable projects:

You can connect with me through various channels:

aiospider's People

Contributors

eindex avatar pyup-bot avatar

Watchers

 avatar  avatar

aiospider's Issues

单点 Redis 内存溢出

单点 Redis 存放过量队列而且队列消耗速度过低的情况下,内存占用过多,导致在使用 bgsave 时,内存爆炸。

修复方案

  • 更换爬虫队列中间件
  • 优化 boolm 过滤器策略
  • 更换爬虫去重中间件

测试了使用 mongodb 做去重的方案,会使用非常大量的内存,导致其他服务可用内存过小,放弃此方案。
认为需要精准去重还是应该使用Redis Set 集合,只保留需要的尽量少的需要部分即可大量减少内存消耗。
如果粗略去重或,可接受一定的数据损失,可使用 bloom filter 。

回传队列不必回传所有数据,当 Worker 单元发生阻塞时,会导致至大量的内存浪费,致使 Redis 内存被爆无法使用备份。
认为需要拆分 Worker 单元的处理以及解析。

可能会使用 MySQL 的 innoDB 来处理这个问题。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.