Giter Club home page Giter Club logo

mfsysmon's Introduction

logo

mfsysmon

Status (master branch)

GitHub CI Maintenance License Gitter Gitter

What is MetWork FrameWork?

MetWork Framework is an opensource system for building and managing production grade applications or micro-services (from development to operations).

What is MFSYSMON?

MetWork Framework is an open source system for building and managing production grade applications or micro-services.

MFSYSMON is the Metwork Framework "SYStem MONitoring" module. It is a little agent which collects some system metrics (RAM, CPU, Disk, Network...) and send them to a MFADMIN module (which stores them).

Cheatsheet

A cheatsheet for this module is available here

Reference documentation

For very specific use cases, you might be interested in reference documentation for integration branch.

And if you are looking for an old released version, you can search here.

Installation guide

See this document.

Configuration guide

See this document.

Contributing guide

See CONTRIBUTING.md file.

Code of Conduct

See CODE_OF_CONDUCT.md file.

Sponsors

(If you are officially paid to work on MetWork Framework, please contact us to add your company logo here!)

logo

mfsysmon's People

Contributors

dearith avatar mergify[bot] avatar metworkbot avatar thebaptiste avatar thefab avatar

Watchers

 avatar  avatar  avatar

mfsysmon's Issues

Error while starting mfsysmon - circus watcher for telegraf

In MetWork v2.2, RockyLinux 8.7, I face an error while starting mfsysmon, an ERROR is displayed while "waiting for start of telegraf" :

- Waiting for start of telegraf...                         [ ERROR ]

It appears that below command returns an error:

_circusctl --endpoint ipc:///home/mfsysmon/var/circus.socket status "telegraf"
error

While in version 2.1, it returns the expected value active.

See the wait_watcher_started method in file circus.py https://github.com/metwork-framework/mfext/blob/master/adm/circus.py

The call stack is:
mfsysmon.start ->
_circus_wait_watcher_started.sh telegraf ->
layer_wrapper --layers=python3_circus@mfext -- _circus_wait_watcher_started.py telegraf ->
client.wait_watcher_started(name='telegraf', timeout='20') (MetWorkCircusClient from circus.py)
_circusctl --endpoint ipc:///home/mfsysmon/var/circus.socket status "telegraf" ->
layer_wrapper --layers=python3_circus@mfext -- python -m circus.circusctl --endpoint ipc:///home/mfsysmon/var/circus.socket status "telegraf" ...

add a custom netio telegraf collector like custom_diskio

The python function to call:

https://psutil.readthedocs.io/en/latest/#psutil.net_io_counters

We are interested in:

bytes_sent_per_second
bytes_recv_per_second
packets_sent_per_second
packet_recv_per_second
errin_per_second
errout_per_second
dropin_per_second
dropout_per_second
dropin_percent = 100.0 * dropin_per_second / packet_recv_per_second (borné entre 0 et 100)
dropout_percent = 100.0 * dropout_per_second / packet_sent_per_second (borné entre 0 et 100)

cpu load issues

Each 7 hours, we have CPU load spikes (higher than 6), in our two servers, as you can see in our grafana (mfadmin) screenshot shown below:
image

The threshold is 0.5 so we received alerts raised by our shinken probe:

Problem started at 07:21:35 on 2023.03.23
Problem name: Load average is too high (per CPU load over 1.5 for 5m)
Host: ***************
Severity: High
Operational data: Load averages(1m 5m 15m): (6.57 4.09 1.9), # of CPUs: 2
Original problem ID: 309747841

Related articles:

Analysis in progress... seems related to the way telegraf collects system metrics data on mfsymon module.

Issue open for follow-up, please add your comments below.

mfsysmon does not work

mfsysmon.start not found

but it's a PATH problem, no "mfsysmon" layers are loaded after a fresh RPM install

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.