Giter Club home page Giter Club logo

prometheus_server_role's People

Contributors

blalop avatar caldito avatar cfsworks avatar dortegau avatar emepege avatar jdvr avatar jmonterrubio avatar jperera avatar lidiaseldas avatar marcelogalmor avatar vicsufer avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

prometheus_server_role's Issues

Should we still support Prometheus v1?

Prerequisites

Description

Prometheus 2 was released on November 2017 and could be interesting to discontinue our support to previous versions (in order to have the role cleaner and make the Travis tests faster). Plus, maybe we should update the role default version to the most updated one (2.6.1 at this moment) as well

Use files folder instead of templates for rules

Prerequisites

Description

Right now the rules are being copied to the host using the 'copy' module instead of prev used 'template'.
So it's not neccessary to still use the templates folder

Expected behavior: Use files folder (at least change to prometheus_rules_files_path) in the copy/rsync rules task

Actual behavior: Copy/rsync task is using prometheus_rules_templates_path

Reproduces how often: Always

Versions

1.3.1

[FEATURE] Remote write tuning support

Description

It would be great to add support for making tuning when you configure a remote write endpoint to storage data in a third-party long-term storage.

Why is this needed?

Basically, for memory usage tuning.

Rise against the reload timeoout

Prerequisites

Description

Sorry for the catchy title. Sometimes the reload handler fails, but it actually succeeds reloading Prometheus. Problem is that, in big instances, sometimes it takes more than 30 seconds to respond. Rising the timeout will fix the problem.

[FEATURE] Automatically remove old scrape files

Description

[Description of the issue]

Automatic removal of old scrape files that are not in the playbook anymore.

Why is this needed?

[A clear and concise description of why do you think this is needed]

If you want to remove an old scrape file from your prometheus server, you have to manually remove it in the server.
Removing it from your playbook and running it is not enough.
With this implementation, that process would be made by ansible by running your playbook.

Additional Information

[Any additional information, configuration or data.]

[BUG] Change prometheus user shell path in /etc/passwd

Description

To solve the idempotency of abv-monitor-playbook it is necessary to match the path of the prometheus user shell as the rest of the roles on which it depends:

  • Change /sbin/nologin to /usr/sbin/nologin and set it as a variable.

Why is this needed?

So the role is updated along all the roles

Additional Information

N/A

Add test-requirements.txt file

Prerequisites

Description

Add test-requirements.txt in order to make easier the tests launch. In additon, you could launch the tests with pipenv.

Steps to Reproduce

Expected behavior: [What you expect to happen]

Actual behavior: [What actually happens]

Reproduces how often:

Versions

1.4.0

Additional Information

Any additional information, configuration or data that might be necessary to reproduce the issue.

[FEATURE] Use promtool to check syntax errors before the copy

Description

Currently, this role does not provide any mechanism to check if the Prometheus config files are correct. They can be checked in Molecule's verify with promtool, but this option does not provide enough information as the Prometheus service won't boot, failing the playbook in the converge stage.

It would be interesting to check those files with promtool before the copy.

Why is this needed?

In order to provide more descriptive information about bad configured alerts and etc

Additional Information

It would be interesting to try promruval

Stuck in reload handler after rule changes

Description

In some servers it gets stuck when reloading after a change in the rules. It can be solved by stop using the reload handler and only use the restart handler, which works well.

Steps to Reproduce

  1. Playbook with changes in rules
  2. When it tries to reload does not end

Expected behavior:
To work after changing the rules

Actual behavior:
Gets stuck reloading after change in rules

Reproduces how often:
Only on some servers

Environment

  • The release version/s you are using: 1.10.1
  • OS: debian 9/10

Additional Information

N/A

Use docker provider in molecule tests

Prerequisites

Description

The current version of this role uses vagrant for testing. We must move to docker in order to standarize this role. And, by the way, we'll upgrade molecule version and rename role in order be Galaxy compliant.

Versions

All.

Templates with custom jinja delimiters instead of copy for rules

Description

Templates with custom jinja delimiters instead of copy for rules

Why is this needed?

To use conditionals using jinja templates. E.g. different contents in rules file depending if the environment is production or staging

Additional Information

N/A

Prometheus overwrites log file when restarted

Prerequisites

Description

Due to the ">" in https://github.com/idealista/prometheus_server-role/blob/master/templates/prometheus.service.j2#L11, every time Prometheus is restarted it overwrites prometheus.log. It should be replaced with ">>" so it appends the new output to the previous log file.

Steps to Reproduce

  1. Watch prometheus.log file.
  2. Restart Prometheus with systemctl restart prometheus.service
  3. Watch prometheus.log file.

Expected behavior: See both restarts.

Actual behavior: Only last restart appears.

Reproduces how often: Always

Versions

All.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.