I l@ve RuBoard |
Collecting Disk Performance DataThis section describes some common tools for measuring and monitoring disk performance. The following are some key terms that you need to know for this section:
Performance tools, such as BMC PATROL and HP MeasureWare Agent, do not always provide the same set of metrics on all platforms. For simplicity, this section focuses only on the Sun Solaris and HP-UX platforms. Also, these products are continually being enhanced, so the actual metrics available for use in your environment may not precisely match the information presented in this section. MeasureWareHP MeasureWare Agent is a Hewlett-Packard product that collects and logs resource and performance metrics. MeasureWare agents are installed on the individual server systems to be monitored. MeasureWare agents exist for many platforms and operating systems. MeasureWare agents collect data at the global, application, and process levels. Many of the system metrics are described in Chapter 4, "Monitoring the System." This section lists the additional global metrics that are used to monitor disk devices. The following is a list of system-wide disk-related metrics available on HP-UX and Sun Solaris:
These additional system-wide disk-related metrics are available on HP-UX:
MeasureWare can also provide information on swap space utilization and the "fullest filesystem," which is the filesystem with the highest percentage of disk space in use. GlancePlusGlancePlus is a real-time, graphical performance monitoring tool. It is used to monitor the performance and system resource utilization of a single system. Both Motif-based and character-based interfaces are available. The product can be used on HP-UX, Sun Solaris, and many other operating systems. GlancePlus can be used to view and graph a system's current CPU, memory, swap, and disk activity. GlancePlus has screens dedicated to each of these main resources. GlancePlus can display a variety of data useful for disk monitoring:
The specific list of available metrics can be found when running GlancePlus, through its online help facility. GlancePlus is also capable of setting and receiving performance-related alarms. Customizable rules determine when a system performance problem should be sent as an alarm. The rules are managed by the GlancePlus Adviser. An Adviser menu option allows you to Edit Adviser Syntax. When you select this option, all of the alarm conditions are shown and can be modified, as demonstrated in Figure 5-8. Figure 5-8. Using GlancePlus to configure alarms for monitoring swap space utilization.Notice in Figure 5-8 how the swap-related alarms are integrated into the same definition file along with network-related alarms. When alarms occur, they can be reflected directly in the GlancePlus interface. GlancePlus can be launched from the command line, or you can start it from the Performance Monitors functional area in SAM (on HP-UX). PerfViewPerfView is a graphical performance monitoring tool that is used to monitor the performance and system resource utilization for multiple systems in your environment. A variety of performance graphs can be displayed. The graphs are based on data collected over a period of time, unlike the real-time graphs of GlancePlus. PerfView can show graphs from multiple systems simultaneously, so that comparisons can be made. PerfView is integrated with other monitoring tools. For example, you can launch GlancePlus from within PerfView by accessing the Tools menu. And, PerfView can be launched from the IT/O Applications Bank. When troubleshooting an event in the IT/O Message Browser window, you can launch PerfView to see a related performance graph. PerfView relies on MeasureWare data, so it can display performance information only for systems that support the MeasureWare Agent. Refer to the previous section on MeasureWare to see a list of the disk metrics available. PerfView has three main components:
PerfView's ability to show history and trend information can be helpful in diagnosing disk problems. Graphing performance information can help you to understand whether a persistent problem exists or is an anomaly (simply a momentary spike of activity). Figure 5-9 shows a PerfView graph illustrating an application's I/O performance over time. Additional system performance metrics are also included in the graph. Figure 5-9. PerfView can show the history of an application's I/O access rate.To diagnose a problem further, PerfView Monitor allows the user to change time intervals, to try to find the specific time that a problem occurred. The graph is redrawn showing the new time period. BMC PATROLBMC provides monitoring capabilities through its PATROL software suite. PATROL provides the basic framework for defining thresholds, sending and translating events, and so forth. Optional products called Knowledge Modules (KMs) contain the ability to monitor specific components. For example, BMC PATROL includes KMs for UNIX, SAP R/3, Oracle, Informix, and other applications. In fact, more than 40 KMs are available from BMC for use with PATROL. BMC provides a tool with its UNIX KM to provide information about disks and disk usage. The following disk and filesystem metrics are available on HP-UX and Sun Solaris:
BMC can also monitor the percentage of swap space in use, and the amount of filesystem space in use, per filesystem. The disk monitoring capabilities of BMC PATROL are similar to those of MeasureWare. Some minimal configuration information is provided, but its primary value is in tracking resource and performance information. Indirectly, BMC PATROL can provide some fault information as well. For example, if disk utilization suddenly drops to zero on a particular disk drive, it may be an indication that the disk has failed. Of course, it could also be an indication that an application has terminated and is no longer using the disk. |
I l@ve RuBoard |
No comments:
Post a Comment