BigPanda is an incident management platform for IT, NOC, and DevOps teams. Organize, prioritize and triage your incidents faster and more intelligently than ever before. Vastly improve your team's collaboration around Ops alerts and events. The following guide is the first in our series on getting started with BigPanda's incident feed. This BigPanda product introduction will help you to get up and running quickly so you can get back to fixing the world's broken stuff.
Join us next Wednesday for a live introduction to BigPanda. Automatically consolidate alerts and take control of your alert madness. See critical incidents faster. Correlate alerts and deployments. Streamline NOC collaboration and resolve issues faster.
For those of you who are not familiar with Jenkins, it's a dead simple open sourced Continuous Integration solution, which takes absolutely no time to set up. Jenkins has a vibrant ecosystem and community, and until recently, Jenkins only had 999 plugins available...
Data center growth over the last 15 years has created significant growing pains in terms of data center management. Tasks that once could be done manually by IT teams have hit the limits of scalability, cost, and efficiency. The key to enabling IT to meet these challenges involves one key theme: automation.
BigPanda is an incident management platform for modern IT, Ops, and DevOps teams. With BigPanda, you will prioritize and route your incidents better and faster, while vastly improving your team's collaboration and processes. This is part 2 in a series on Getting Started with BigPanda. This guide will help you get up and running quickly and maximize the value you get out of the platform.
BigPanda is an incident management platform for modern IT, NOC and DevOps teams. With BigPanda, you will prioritize and route your incidents better and faster, while vastly improving your team’s collaboration and processes. This is part 3 in a series on Getting Started with BigPanda. This product introduction will help you to get up and running quickly so you can get back to hunting fail-whales and 404 errors.
BigPanda is an incident management platform for modern Ops environments. With BigPanda, you will prioritize and assign your incidents better and faster, while vastly improving your team’s collaboration and processes. This is part 4 in a series on Getting Started with BigPanda. This guide will help you get up and running quickly and maximize the value you get out of the platform.
The last ten years have brought enormous changes to production environments, driven by a best-of-breed approach to production infrastructure enabled by open source and cloud. This has been a boon for developers in terms of flexibility and productivity, but it’s also placed a new set of challenges and expectations on Ops.
CONNECT ALL THE THINGS! Here at BigPanda we are constantly working on adding new monitoring systems to our arsenal of out-of-the-box integrations. We already provide integration with all of the most popular monitoring systems & services. Nagios, Zabbix, Zenoss, New Relic, AppDyamics, CloudWatch, Pingdom are all there. And there's many more – this list gets longer with every week that passes. These out-of-the-box integrations from BigPanda have many advantages:
Ansible is a great automation tool. We use it for server provisioning, application deployments and running maintenance scripts. One problem it does have however, is how (in)convenient it is to run playbooks as opposed to regular shell scripts. Write and run enough Ansible playbooks, and eventually you’ll get tired of the repetitive typing your fingers have to do.
It's been a busy summer! At BigPanda, we're reinventing Incident Management for Ops and that takes a kickass team. Thanks to your enthusiasm for what we're doing, we've doubled since April (including a strategic hire directly from Atlassian's JIRA). To help accommodate all this growth, we just moved into our brand new office in Mountain View, CA. Want to join the BigPanda team? We'd love to have you! Email us at email@example.com.
Last week, I changed the color of the GET A FREE ACCOUNT button on the BigPanda website and it resulted in a dramatic improvement in signups. But it wasn’t my idea. It was the data’s idea. Data makes great decisions. We don’t. Any of us. Leadership is a science, not an art. The last decision you should make is to never make another decision. Data makes the only great decisions. Get out of its way. If you've ever met me you'll know that I have a strong passion for data and the science of leadership. 72% of the people that I tell this to want to hear more of the story. So here it is...
Modeling your production environment correctly is very important for development. Developers need to be able to run and test their code locally for the development process to be efficient, and many times this requires setting up infrastructure that exists in production on their local machines. The basic solution is a simple Vagrant box containing all your infrastructure and application code, like the one we mentioned in our Devbox post.
Monitoring applications in production has never been easier. With only a few code lines, you'll have New Relic installed and monitoring your application from nearly every angle. When something goes wrong, New Relic will start sending alerts. But then what? (hint – New Relic and BigPanda together is the answer).
Anomaly detection for monitoring has been a trending topic in recent years. And while the math behind it is fascinating, too much of the discussion has revolved around histograms, moving averages and standard deviations. More discussion needs to happen around its practical applications, and for that reason, this practical guide to anomaly detection will attempt to provide an actionable overview of current off-the-shelf anomaly detection tools.
We engineers love measuring stuff. Whether it helps us solve an immediate problem, gets us ready for a bad day or just because most of us are information junkies, we love keeping track of metrics. The spectrum of what can be measured is very wide. It can include data from every part of our system: from technical metrics such as disk space or RPM, through UI metrics like page load times, to business KPIs such as revenue, conversion rates and so on. When choosing which metrics to collect, we usually start with the obvious ones: those that reflect the current state of the system (e.g., CPU, memory and load). There are quite a few articles and blog posts about these metrics, so I’m not going to discuss that here. Rather, I would like to focus on metrics that reflect the user experience.
Here are the four metrics that we at BigPanda see as the most important in this category:
As the world of software development is moving to BigPanda, ahmm... sorry, I meant Big Data, we need to pay attention to synchronization, and it's becoming one of the most difficult challenges we face. Here at BigPanda, we receive a lot of alerts from many monitoring systems, and naturally we want to keep the data synchronized and consistent.
One of the first things we do right after installing Nagios, is set up email notifications. Without that, how would you know when something went wrong?
As a part of our DevOps mentality here at BigPanda, we believe in the ancient saying Automate All the Things™. We also believe that each developer should have a standalone offline available version of our applications to work on at any time (regardless of, say, commute connectivity problems). Instead of having each developer carry around a small data center in their backpack, we're using something we call a DevBox. The DevBox is a Vagrant box packed with goodies that are our basic infrastructure needs, such as redis, mongodb, rabbitmq and other stuff. Vagrant is a small layer of automation that wraps your favorite hypervisor/container, available as apt-get/brew or even a windows installer.