Give BigPanda
a try
GET STARTED

anomaly detection - 5 Matches

A Practical Guide to Anomaly Detection for DevOps

A Practical Guide to Anomaly Detection for DevOps

Anomaly detection for monitoring has been a trending topic in recent years. And while the math behind it is fascinating, too much of the discussion has revolved around histograms, moving averages and standard deviations. More discussion needs to happen around its practical applications, and for that reason, this practical guide to anomaly detection will attempt to provide an actionable overview of current off-the-shelf anomaly detection tools.

Read more »
Nagios email alerts are bad for you

Nagios email alerts are bad for you

One of the first things we do right after installing Nagios, is set up email notifications. Without that, how would you know when something went wrong?

Read more »
Stop Managing Ops Incidents with Jira or Zendesk

Stop Managing Ops Incidents with Jira or Zendesk

In many ways, incident management for devops is similar to typical issue tracking processes: it facilitates coordination and collaboration of daily tasks. For this reason, tools such as Jira, Zendesk, and even email are often used as solutions for incident management. But incident management faces one unique challenge that makes it different from other issue tracking processes. In addition to human-operated workflows, incident management also relies heavily on machine-driven workflows. Unfortunately, traditional issue trackers and ticketing systems cannot accommodate for this with their current product mechanics.

Read more »
4 Ways to Combat Non-Actionable Alerts

4 Ways to Combat Non-Actionable Alerts

Many alerts place an unnecessary burden on Ops teams instead of helping them to solve issues. The main problem is that most alerts are not actionable enough:

  • They point to issues that don’t require a response
  • They lack critical information, forcing you to spend time searching for more insights in order to gauge their urgency
Read more »
Building a Fast Ops Incident Dashboard

Building a Fast Ops Incident Dashboard

Few things damage productivity as much as waiting. Waiting forces us to context switch, disrupts our creative momentum and eliminates our ability to experiment. Whether we are deploying a new service or troubleshooting a problem, waiting puts a heavy tax on efficient work. 

Read more »