Giter Club home page Giter Club logo

Comments (10)

JayCesar avatar JayCesar commented on July 30, 2024

What is a Metric?

In a nutshell, it is a number in and a timestamp!

Metric are numerical values that can track anything about your envirorment over time, from latency to error rates to user signups.

image

In practice we need to collect a lot of them! in order to make sense!

image

This peak can indicate an anomaly.

Reasons to collect metrics

  • To get an idea of ​​a normal scenario;
  • To prevent problems before they snowball;
  • Have a proactive attitude (we don’t need to wait until a problem appears)
    image

By using metrics I can spend less money, wake up less for on call, have less fire drills, happier costumers etc.

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

Metric Walk Through

image

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

What is Monitoring?

  • Just because we can alert someone does not mean that we shold alert someone

- Meaning: Monitoring is the act of paying attention to the patterns that your metrics are telling you. It's about analyzing your data and acting on it.

image

What do we Monitor?

- Performance: by watching performance we can match how our architecture and applications are using the resources that are available.

- Security: Is something going wrong in our environment? Creating monitors around security metrics can stop incidentes in their tracks.

- Usage: How application code is actually functioning

image

Whom do we alert?

image

threshold = a point at which something starts

image

- It's important to only alert team members when something actionable needs to be done.

  • Be careful with the alerting strategy
  • If everything is an aemergency, nothing is an emergency

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

Monitoring Walk Through

I can set up the metric from the dashboard:

image

Tip: think the alert as a software

And I can set up an e-mail to it:

image

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

What is a Log?

A log is usually a bukly piece or length of a cut or fallen tree.

image

Just kidding...

A log is a computer generated file that contains information regarding the usage of a system, This gives you insight into the behaviour of the resource.

image

πŸ“It is a file filled with the history of that that computer / application / resource has been doing.

Why do we collect logs?

  • Complicance: just because we have to. Standards that the business is held to might dictate which logs you will need to store and for how long you need to store them. (Following the rules)
  • Insight: Logs can give you insight into application and system performance that metrics by themselves might not be able to provide. It means they are beyond metrics.
  • Security: This is priority for businesses. Logs are needed to demonstrate that only authorized activities are going on inside of a system.

Practical uses for logs

  • Troubleshooting
  • Auditing
  • Monitoring (monitoring the logs) -> Datalog helps with it;
  • By using logs I can add Alerting based on their content
  • Personal History
  • Not getting in trouble (I can prove something by showing the logs)

_Obs: Computer troubleshooting is the process of diagnosing and solving computer errors or technical problems.

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

Storing our Logs

What kind of services generate Logs?

  • Where else did we get Logsfrom?

image

Everything done in cloud is tracked!

How long do we store Logs?

Thre are three guidances:

  • Compliance: Standards taht business is held to might dictate how long your longs need to be stored.
  • Usefulness: Some logs are more helpful than other. It's up to you decide which logs need to be stored for whatever length of time is useful to you.
  • Cost: Store costas money. Dpending on the services you're using you'll want to keep in mind your budget when deciding on storage length. (Measured service?)
    • A good pratice is to ask myself: "is this Log useful? If it is, how long it will be useful for?

How do we consolidate our Logs?

image

Curiosity: (From ChatGPT)
Datadog, a monitoring and analytics platform for cloud applications, is named to evoke the idea of a vigilant, loyal, and reliable guard dog. The name "Datadog" suggests the platform's role in keeping a watchful eye on data, ensuring that systems are running smoothly, and alerting users to any issues. Just as a guard dog is trusted to protect and notify its owner of potential problems, Datadog is designed to provide comprehensive monitoring and alerting for IT infrastructure and applications.

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

Logging Walk Through

image

image

It is important to know that Logs can not always tell me how to fix a possible problem, but they can alert me the problem I need to investigate

By using DataDog I can extract the context from a specific log

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

What is a Trace?

A trace is used to track the time spent by an application processing a request along with the execution path taken.

  • e.g A trace would be both a path to a database along with how long it takes to get there, when a application calls a database.

What is a Span

A Span is the individual unit of work that the code is doing.

  • I can think of span as a portion of the veral trip.

image

Why do we collect Traces?

- Microservices: As businesses migrate away from Monolithic architecture, tracing is needed to figure out what all of the microservices ar up to.

- Optimization: Tracing allows you to optimize the performance of your applications by identifying bottlenecks in the calls being made.

- Troubleshooting: When something goes wrong, we need insight into the actual application code. This can assist us in tracking down errors with the code.

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

Tracing Walk Through

image

Datadog can tie traces, metrics and logs togheter!

from cloud.

JayCesar avatar JayCesar commented on July 30, 2024

Summary:

image

image

from cloud.

Related Issues (4)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.