We’re excited to announce that the definitive pulse of IT monitoring is back.The State of Monitoring 2017 Report is now available! In 2016, we launched the report to shine a light on the ever-changing and diverse landscape of IT monitoring. We wanted to explore how IT practitioners are tackling the challenges and opportunities posed by the increasing agility and complexity of IT applications and infrastructure.
Is your team ready for 2017? Featuring early release findings from BigPanda’s forthcoming State of Monitoring Report, our latest e-book takes a look at how key industry trends will affect IT operations in the upcoming year.
In the latest episode of our MonitoringScape Live podcast series, we sit down with Jason Dixon, VP Product Strategy at Raintank, founder of Monitorama, and author of "Monitoring with Graphite: Tracking Dynamic Host and Application Metrics at Scale". Jason is joined by Adrian Cockcroft, Technology Fellow at Battery Ventures, former lead architect for Netflix, and author of the best-selling book, "Sun Performance and Tuning".
Those of us lucky enough to have attended the August MonitoringScape meetup at BigPanda with Adrian Cockcroft enjoyed the master of performance tuning as he waxed philosophical about why monitoring isn’t a solved problem after twenty plus years honing our craft. Spoiler-alert: it’s difficult to solve a problem that keeps changing.
Decompressing from an exhausting, inspirational few days at Knowledge16, the annual ServiceNow event...
From humble beginnings (my first Knowledge was a few hundred attendees in a tent in San Diego), Knowledge has become a global tour de force. This year, Mandalay Bay could barely contain more than 11,000 customers and partners (and the expo hall could barely contain more than 100 decibels of the tech equivalent of Queensryche). Getting into the keynote felt like rush hour on the subway in midtown Manhattan.
To maintain operational visibility in modern IT environments, companies are abandoning monolithic monitoring solutions from legacy vendors in favor of a modern set of “best of breed” monitoring tools. Today’s average IT monitoring stack consists of about 6-8 tools, including at least one from each of the following categories: systems monitoring, end user monitoring, application performance monitoring (APM), error detection, log analytics, chat, and ticketing. When service disruptions occur, operations engineers face a flood of alerts across different layers of the IT stack, with no fast way to figure out what’s really going on. Customers are left stranded, while IT professionals struggle to detect, triage and remediate urgent issues. Downtime abounds which negatively impacts revenue, performance, and brand loyalty.
SREcon16 is a wrap, and our team had a blast at this year’s event! Both days were non-stop action: demos, discussions, and - of course - handing out our fair share of panda swag. Between the buzz on the floor and in the sessions, what topics were top of mind at this year’s show? Here are our three key takeaways:
When we released the first version of MonitoringScape six months ago, we set out to help IT professionals navigate the ever-growing and evolving world of modern monitoring. Since then, this landscape has only continued to proliferate and expand. Every month, promising new tools are popping up across a broad range of monitoring specialties – and we here at BigPanda are determined to keep MonitoringScape up-to-date to provide you with a go-to source for discovering and researching new solutions.
In that vein, we’re happy to announce the newest version of MonitoringScape, which includes more than 35 updates!
As a TechOps community, we’re awash in buzz words. Most are initially used to establish geek credibility yet quickly become cliches. Take, for example, the term “DevOps”. From its inception as a Twitter hashtag used to promote a meetup in bucolic Ghent, Belgium in 2009, it began to be co-opted months later by ops teams around the world aspiring to manage infrastructure with code.
Software everywhere or software nowhere?
“Software is increasingly everywhere”, Kirkpatrick explained, “but it’s so seamless that you don’t even see it. You just enjoy new efficiencies and ways of getting things done”.
It was a whirlwind couple of days, but FutureStack15 is in the books! I’m sure I speak for most of the BigPanda team when I say that a weekend of rest was welcomed after the long (but exciting!) days at the show. Between demos, conversations with clients and prospects, and even a surprise visit from Weird Al – there was hardly a moment of downtime. But in our world, who likes downtime? (Excuse the terrible pun).
In case you missed it, here’s a recap of some of the key themes discussed at the show:
If you work in tech, you’ve probably heard of the Pareto principle, or, as it’s more commonly called, the 80/20 rule. According to the 80/20 rule, for many events, 80 percent of the results are generated by 20 percent of the inputs.
A little background: back in the late 1800s the Italian economist Vilfredo Pareto noticed that approximately 80 percent of the land in Italy was owned by 20 percent of the population. Not long after, Pareto also observed that 20 percent of the peapods in his garden generated 80 of the crop’s yield – and thus the 80/20 principle was born.
Rishi is too humble to be the CIO of a Fortune 100 bank, too busy to be the father of four, too accomplished to blog about ice cream, and too educated to love John Gray. Mostly, he's too unpredictable to fit stereotypes and too passionate about everything he does to do anything at less than full throttle.
I met Rishi this week at the Pacific Crest Global Technology Leadership Forum in Vail where he was presenting and I was lucky to be in the audience. We spent an hour together before his talk that inspired me to rescue Nepalese orphans... and eat more ice cream.
Rishi's been an IT leader since before we called it that. He has helped organizations grow and shrink and grow again. He's more scared about the state of IT today than he has ever been.
Here are excerpts from the discussion...
Data center growth over the last 15 years has created significant growing pains in terms of data center management. Tasks that once could be done manually by IT teams have hit the limits of scalability, cost, and efficiency. The key to enabling IT to meet these challenges involves one key theme: automation.
Anomaly detection for monitoring has been a trending topic in recent years. And while the math behind it is fascinating, too much of the discussion has revolved around histograms, moving averages and standard deviations. More discussion needs to happen around its practical applications, and for that reason, this practical guide to anomaly detection will attempt to provide an actionable overview of current off-the-shelf anomaly detection tools.
We engineers love measuring stuff. Whether it helps us solve an immediate problem, gets us ready for a bad day or just because most of us are information junkies, we love keeping track of metrics. The spectrum of what can be measured is very wide. It can include data from every part of our system: from technical metrics such as disk space or RPM, through UI metrics like page load times, to business KPIs such as revenue, conversion rates and so on. When choosing which metrics to collect, we usually start with the obvious ones: those that reflect the current state of the system (e.g., CPU, memory and load). There are quite a few articles and blog posts about these metrics, so I’m not going to discuss that here. Rather, I would like to focus on metrics that reflect the user experience.
Here are the four metrics that we at BigPanda see as the most important in this category: