Learn about the value of long term cloud analytics.
- [Narrator] Okay, now that we understand short-term reporting, or analytics, let's understand long-term. So, what's important here is that we need to monitor systems over a longer duration of time to understand exactly what they are doing. Systems within the Cloud may live years, so we need to understand every bit of information and every metric that goes on within the system into the requirements of the monitoring that we've created around the systems that we're running in the Cloud. So, what that means is we can't monitor everything, that's just going to slow things down, but we're picking certain metrics, such as CPU utilization, such as storage utilization, such as IO utilization, that's important to us.
And we're monitoring those not only in the short-term, as we saw in the last video, but in the long-term, over months perhaps years. In terms of understanding where things are going, information that we can access at any time to correct the systems based on how we're monitoring the systems. So we need to understand systemic problems, and proactively correct them. So long-term monitoring and analytics finds systemic issues. In other words we may not have configured our virtual servers correctly.
We may not have configured our storage systems correctly. And we need to be able to proactively move into the systems and alter them or correct them before they become a problem. So back to our Foundations of Monitoring and Operations. So understanding reporting, again long-term reporting not short-term reporting, is just that. It's the ability to look at something over a long period of time, gathering particular metrics that we're going to leverage, to figure out what the system is doing and how it's behaving over a long period of time.
Short-term metrics are valuable. We certainly want to know what the system has been doing in the last few days, the last few weeks; however, the ability to look at the system trends over a year are going to be much more valuable to you in understanding the behavior of a system over a long period of time. So, long-term reporting we deal with what's running, we deal with how well they're running, and we deal with corrective actions. Very similar to short-term. So, not a lot has changed other than the fact you have to be very particular about what you're monitoring over a long period of time.
Remember, this is going to cost you money in terms of CPU utilization, in terms of storage utilization; and then if you turn everything on it's going to kill your performance, it's going to cause problems with your systems. So you need to be very particular about what metrics you're monitoring, why you're monitoring it, and understand what's running, how well it's running, and the ability to take corrective actions if something falls out of range. So back to our familiar technology needed. Again, we need data. We're looking at something long-term over time we need to process that information into a database.
We need to deal with trends, in other words what trends are occurring and how they're occurring. Limitations, in other words CP utilization, storage utilization, other things that we can set that are typically going to be numeric values. If they fall out of place, again someone's notified. Or if they're trending and going to fall out of place, someone is notified. Actions that need to be taken; for example, allocating additional storage, allocating additional CPUs. And then reporting, the ability to look at things long-term in terms of what the system is doing and how it's behaving.
This is valuable more so than short-term reporting, because we're able to get into the core information that we need to figure out what the trends are. And it's very difficult to spot a trend in a few days or a few weeks; however, over months we can pretty much figure out what the system is doing, when things are going down, when performance is being challenged. And those things being understood by you allow you to be a better Cloud system administrator and correct things as they occur, or before they occur.
- Cloud health, performance, security, and governance monitoring
- Cloud monitoring analytics
- Cloud monitoring costs
- AWS CloudWatch
- Librato CloudWatch
- Cloud Cruiser
- Microsoft cloud monitoring
- Rackspace cloud monitoring
- Creating a cloud monitoring and operations plan
- Defining cloud monitoring operations patterns