Algorithmic Alert Correlation

BP’s core competency; any posts relating to alert correlation or BP platform’s handling of alerts.

Until DevOps becomes NoOps, there’s Service Health Analytics

By |2018-04-17T18:23:51+00:00October 5th, 2015|Blog|

We’re adjusting to the new reality that DevOps is a compelling layover on the journey between legacy ops and self-healing infrastructure. Eliminating the cultural gap between developers and operations, the now-cliched state of IT nirvana called “DevOps”, is by no means the end goal. The goal is reliable system performance and availability without human intervention - the panacea called “NoOps”.

Hey Silicon Valley, you’re wrong about “Data Science” and “Machine Learning”

By |2018-04-17T18:59:14+00:00August 31st, 2015|Blog|

Tsunami detection. Crop dusting. Biohazard monitoring. What may sound like innuendos in the next EL James novel are also fields being revolutionized by quant jocks and smart algorithms. And yet, despite all the innovation, we technorati continue to bastardize the terms “data science”, “machine learning,” and “big data”. They’ve become lazy speak for “we’re not sure what we’re doing so we’ll hand wave cliches until we have real technology and a business model."

Why DDoS attacks aren’t just a security problem… and monitoring traffic isn’t the solution – Part One

By |2018-04-17T18:23:44+00:00October 16th, 2015|Blog|

Every company’s a target, every customer’s at risk. But the now-cliched threat of data breaches from Distributed Denial of Service (DDoS) attacks obscures a bigger threat: outages that impact not just data integrity but also profitability, brand equity, and customer retention. 

The volume of attacks is growing and so is the impact of down time. According to Akamai’s most recent State of the Internet report, DDoS attacks are a bigger threat than ever before. “The number of DDoS attacks continued to increase substantially in Q2 2015, more than doubling the number observed in Q2 2014.”

How to Use the 80/20 Rule to Turn Noisy Alerts into Actionable Intelligence

By |2018-04-17T18:23:40+00:00October 26th, 2015|Blog|

If you work in tech, you’ve probably heard of the Pareto principle, or, as it’s more commonly called, the 80/20 rule. According to the 80/20 rule, for many events, 80 percent of the results are generated by 20 percent of the inputs.

A little background: back in the late 1800s the Italian economist Vilfredo Pareto noticed that approximately 80 percent of the land in Italy was owned by 20 percent of the population. Not long after, Pareto also observed that 20 percent of the peapods in his garden generated 80 of the crop’s yield – and thus the 80/20 principle was born. 

15 hours of down time… avoided: part two of a two-part series

By |2018-04-17T18:22:53+00:00October 31st, 2015|Blog|

This is part two of a two-part post about using event correlation to thwart DDoS attacks. Channeling Mark Twain: it would have been shorter if I had more time. In the last post I described why DDoS attacks for SaaS providers are no different than performance and availability issues experienced in other domains like healthcare, finance, or retail. In this post I’ll share a customer story about a security breach that never happened… thanks to a savvy DevOps team and data science.

Key takeaways from DevOpsDays Silicon Valley

By |2018-04-17T18:22:57+00:00November 12th, 2015|Blog|

In between sessions at last weekend’s DevOpsDays Silicon Valley, scores of attendees filled the halls, amplifying the Computer History Museum with chatter and turning it into something more akin to a high school cafeteria than a conference venue. As crowds formed to share their stories and insights with one another, a common theme quickly emerged: It just isn’t as easy as we thought it would be.

Part 1 of 2: The reason why Nagios is so noisy – and what you can do about it

By |2018-04-17T18:23:04+00:00December 1st, 2015|Blog|

If you’re struggling with a flood of Nagios alerts, this two-part blog series is for you. We’ll take a close look at the complicated relationship that IT and Ops professionals have with the monitoring tool, explain why Nagios is so noisy, and discuss the simple way that you take charge of your alerts and maximize the way Nagios works for you.

Key takeaways from AppSphere15

By |2018-04-17T18:22:20+00:00December 10th, 2015|Blog|

Software everywhere or software nowhere?

It was just last week, at AppSphere15 in Vegas, that I sat amongst a packed house at David Kirkpatrick’s keynote, staring up at these words in his deck.

“Software is increasingly everywhere”, Kirkpatrick explained, “but it’s so seamless that you don’t even see it. You just enjoy new efficiencies and ways of getting things done”.

Integrate and correlate your Catchpoint alerts with BigPanda

By |2018-04-17T18:21:07+00:00January 13th, 2016|Blog|

We’re happy to announce that BigPanda now integrates with Catchpoint! Catchpoint is a popular cloud-based monitoring tool used by ops teams to measure availability and performance for synthetic transactions and real user web sessions. By integrating with BigPanda, Catchpoint customers can now aggregate all of their monitoring alerts in one place, intelligently clustering them to reduce alert noise and spot critical issues faster.