iostat – i/o performance – isitdevops.com/databases

iostat – i/o performance

on October 29, 2009 at 2:31 pm

iostat

A key part of the performance assessment is disk performance. The iostat command gives the performance metrics of the storage interfaces.

# iostat
Linux 2.6.9-55.0.9.ELlargesmp (prolin3)Â Â Â Â 12/27/2008
Â
avg-cpu:Â %userÂ Â %niceÂ Â Â %sys %iowaitÂ Â %idle
Â Â Â Â Â Â Â Â Â 15.71Â Â Â 0.00Â Â Â 1.07Â Â Â 3.30Â Â 79.91
Â
Device:Â Â Â Â Â Â Â Â Â Â Â tpsÂ Â Blk_read/sÂ Â Blk_wrtn/sÂ Â Blk_readÂ Â Blk_wrtn
cciss/c0d0Â Â Â Â Â Â Â 4.85Â Â Â Â Â Â Â 34.82Â Â Â Â Â Â 130.69Â 307949274 1155708619
cciss/c0d0p1Â Â Â Â Â 0.08Â Â Â Â Â Â Â Â 0.21Â Â Â Â Â Â Â Â 0.00Â Â Â 1897036Â Â Â Â Â Â 3659
cciss/c0d0p2Â Â Â Â 18.11Â Â Â Â Â Â Â 34.61Â Â Â Â Â Â 130.69Â 306051650 1155700792
cciss/c0d1Â Â Â Â Â Â Â 0.96Â Â Â Â Â Â Â 13.32Â Â Â Â Â Â Â 19.75Â 117780303Â 174676304
cciss/c0d1p1Â Â Â Â Â 2.67Â Â Â Â Â Â Â 13.32Â Â Â Â Â Â Â 19.75Â 117780007Â 174676288
sdaÂ Â Â Â Â Â Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â 184Â Â Â Â Â Â Â Â Â 0
sdbÂ Â Â Â Â Â Â Â Â Â Â Â Â Â 1.03Â Â Â Â Â Â Â Â 5.94Â Â Â Â Â Â Â 18.84Â Â 52490104Â 166623534
sdcÂ Â Â Â Â Â Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â 184Â Â Â Â Â Â Â Â Â 0
sddÂ Â Â Â Â Â Â Â Â Â Â Â Â Â 1.74Â Â Â Â Â Â Â 38.19Â Â Â Â Â Â Â 11.49Â 337697496Â 101649200
sdeÂ Â Â Â Â Â Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â 184Â Â Â Â Â Â Â Â Â 0
sdfÂ Â Â Â Â Â Â Â Â Â Â Â Â Â 1.51Â Â Â Â Â Â Â 34.90Â Â Â Â Â Â Â Â 6.80Â 308638992Â Â 60159368
sdgÂ Â Â Â Â Â Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â Â 0.00Â Â Â Â Â Â Â 184Â Â Â Â Â Â Â Â Â 0
… and so on …

The beginning portion of the output shows metrics such as CPU free and I/O waits as you have seen from the mpstat command.

The next part of the output shows very important metrics for each of the disk devices on the system. Letâ€™s see what these columns mean:

Device
Â The name of the device
Â
tpsÂ Â
Â Number of transfers per second, i.e. number of I/O operations per second. Note: this is just the number of I/O operations; each operation could be huge or small.
Â
Blk_read/sÂ Â
Â Number of blocks read from this device per second. Blocks are usually of 512 bytes in size. This is a better value of the diskâ€™s utilization.
Â
Blk_wrtn/sÂ Â
Â Number of blocks written to this device per second
Â
Blk_readÂ Â
Â Number of blocks read from this device so far. Be careful; this is not what is happening right now. These many blocks have already been read from the device. Itâ€™s possible that nothing is being read now. Watch this for some time to see if there is a change.
Â
Blk_wrtn
Â Number of blocks written to the device
Â

In a system with many devices, the output might scroll through several screensâ€”making things a little bit difficult to examine, especially if you are looking for a specific device. You can get the metrics for a specific device only by passing that device as a parameter.

# iostat sdajÂ Â
Linux 2.6.9-55.0.9.ELlargesmp (prolin3)Â Â Â Â 12/27/2008
Â
avg-cpu:Â %userÂ Â %niceÂ Â Â %sys %iowaitÂ Â %idle
Â Â Â Â Â Â Â Â Â 15.71Â Â Â 0.00Â Â Â 1.07Â Â Â 3.30Â Â 79.91
Â
Device:Â Â Â Â Â Â Â Â Â Â Â tpsÂ Â Blk_read/sÂ Â Blk_wrtn/sÂ Â Blk_readÂ Â Blk_wrtn
sdajÂ Â Â Â Â Â Â Â Â Â Â Â Â 1.58Â Â Â Â Â Â Â 31.93Â Â Â Â Â Â Â 10.65Â 282355456Â Â 94172401

The CPU metrics shown at the beginning may not be very useful. To suppress the CPU related stats shown in the beginning of the output, use the -d option.
Â
You can place optional parameters at the end to let iostat display the device stats in regular intervals. To get the stats for this device every 5 seconds for 10 times, issue the following:

# iostat -d sdaj 5 10

You can display the stats in kilobytes instead of just bytes using the -k option:

# iostat -k -d sdajÂ Â Â
Linux 2.6.9-55.0.9.ELlargesmp (prolin3)Â Â Â Â 12/27/2008
Â
Device:Â Â Â Â Â Â Â Â Â Â Â tpsÂ Â Â kB_read/sÂ Â Â kB_wrtn/sÂ Â Â kB_readÂ Â Â kB_wrtn
sdajÂ Â Â Â Â Â Â Â Â Â Â Â Â 1.58Â Â Â Â Â Â Â 15.96Â Â Â Â Â Â Â Â 5.32Â 141176880Â Â 47085232

While the above output can be helpful, there is lot of information not readily displayed. For instance, one of the key causes of disk issues is the disk service time, i.e. how fast the disk gets the data to the process that is asking for it. To get that level of metrics, we have to get the â€œextendedâ€ stats on the disk, using the -x option.

# iostat -x sdaj
Linux 2.6.9-55.0.9.ELlargesmp (prolin3)Â Â Â Â 12/27/2008
Â
avg-cpu:Â %userÂ Â %niceÂ Â Â %sys %iowaitÂ Â %idle
Â Â Â Â Â Â Â Â Â 15.71Â Â Â 0.00Â Â Â 1.07Â Â Â 3.30Â Â 79.91
Â
Device:Â Â Â rrqm/s wrqm/sÂ Â r/sÂ Â w/sÂ rsec/sÂ wsec/sÂ Â Â rkB/sÂ Â Â wkB/s avgrq-sz avgqu-szÂ Â awaitÂ svctmÂ %util
sdajÂ Â Â Â Â Â Â Â 0.00Â Â 0.00Â 1.07Â 0.51Â Â 31.93Â Â 10.65Â Â Â 15.96Â Â Â Â 5.32Â Â Â 27.01Â Â Â Â 0.01Â Â Â 6.26Â Â 6.00Â Â 0.95

Letâ€™s see what the columns mean:

DeviceÂ Â
Â The name of the device
Â
rrqm/s
Â The number of read requests merged per second. The disk requests are queued. Whenever possible, the kernel tries to merge several requests to one. This metric measures the merge requests for read transfers.
Â
wrqm/sÂ Â
Â Similar to reads, this is the number of write requests merged.
Â
r/sÂ Â
Â The number of read requests per second issued to this device
Â
w/sÂ
Â Likewise, the number of write requests per second
Â
rsec/sÂ
Â The number of sectors read from this device per second
Â
wsec/sÂ Â Â
Â The number of sectors written to the device per second
Â
rkB/sÂ Â Â
Â Data read per second from this device, in kilobytes per second
Â
wkB/s
Â Data written to this device, in kb/s
Â
avgrq-sz
Â Average size of the read requests, in sectors
Â
avgqu-szÂ Â
Â Average length of the request queue for this device
Â
awaitÂ
Â Average elapsed time (in milliseconds) for the device for I/O requests. This is a sum of service time + waiting time in the queue.
Â
svctmÂ
Â Average service time (in milliseconds) of the device
Â
%util
Â Bandwidth utilization of the device. If this is close to 100 percent, the device is saturated.
Â

Well, thatâ€™s a lot of information and may present a challenge as to how to use it effectively. The next section shows how to use the output.

How to Use It
You can use a combination of the commands to get some meaning information from the output. Remember, disks could be slow in getting the request from the processes. The amount of time the disk takes to get the data from it to the queue is called service time. If you want to find out the disks with the highest service times, you issue:

# iostat -x | sort -nrk13
sdatÂ Â Â Â Â Â Â Â 0.00Â Â 0.00Â 0.00Â 0.00Â Â Â 0.00Â Â Â 0.00Â Â Â Â 0.00Â Â Â Â 0.00Â Â Â 18.80Â Â Â Â 0.00Â Â 64.06Â 64.05Â Â 0.00
sdvÂ Â Â Â Â Â Â Â Â 0.00Â Â 0.00Â 0.00Â 0.00Â Â Â 0.00Â Â Â 0.00Â Â Â Â 0.00Â Â Â Â 0.00Â Â Â 17.16Â Â Â Â 0.00Â Â 18.03Â 17.64Â Â 0.00
sdakÂ Â Â Â Â Â Â Â 0.00Â Â 0.00Â 0.00Â 0.14Â Â Â 0.00Â Â Â 1.11Â Â Â Â 0.00Â Â Â Â 0.55Â Â Â Â 8.02Â Â Â Â 0.00Â Â 17.00Â 17.00Â Â 0.24
sdmÂ Â Â Â Â Â Â Â Â 0.00Â Â 0.00Â 0.00Â 0.19Â Â Â 0.01Â Â Â 1.52Â Â Â Â 0.01Â Â Â Â 0.76Â Â Â Â 8.06Â Â Â Â 0.00Â Â 16.78Â 16.78Â Â 0.32
… and so on …

This shows that the disk sdat has the highest service time (64.05 ms). Why is it so high? There could be many possibilities but three are most likely:

The disk gets a lot of requests so the average service time is high.
The disk is being utilized to the maximum possible bandwidth.
The disk is inherently slow.
Looking at the output we see that reads/sec and writes/sec are 0.00 (almost nothing is happening), so we can rule out #1. The utilization is also 0.00% (the last column), so we can rule out #2. That leaves #3. However, before we draw a conclusion that the disk is inherently slow, we need to observe that disk a little more closely. We can examine that disk alone every 5 seconds for 10 times.

# iostat -x sdat 5 10

If the output shows the same average service time, read rate and utilization, we can conclude that #3 is the most likely factor. If they change, then we can get further clues to understand why the service time is high for this device.

Similarly, you can sort on the read rate column to display the disk under constant read rates.

# iostat -x | sort -nrk6
sdjÂ Â Â Â Â Â Â Â Â 0.00Â Â 0.00Â 1.86Â 0.61Â Â 56.78Â Â 12.80Â Â Â 28.39Â Â Â Â 6.40Â Â Â 28.22Â Â Â Â 0.03Â Â 10.69Â Â 9.99Â Â 2.46
sdahÂ Â Â Â Â Â Â Â 0.00Â Â 0.00Â 1.66Â 0.52Â Â 50.54Â Â 10.94Â Â Â 25.27Â Â Â Â 5.47Â Â Â 28.17Â Â Â Â 0.02Â Â 10.69Â 10.00Â Â 2.18
sddÂ Â Â Â Â Â Â Â Â 0.00Â Â 0.00Â 1.26Â 0.48Â Â 38.18Â Â 11.49Â Â Â 19.09Â Â Â Â 5.75Â Â Â 28.48Â Â Â Â 0.01Â Â Â 3.57Â Â 3.52Â Â 0.61
… and so on …
Â Â Â
The information helps you to locate a disk that is â€œhotâ€â€”that is, subject to a lot of reads or writes. If the disk is indeed hot, you should identify the reason for that; perhaps a filesystem defined on the disk is subject to a lot of reading. If that is the case, you should consider striping the filesystem across many disks to distribute the load, minimizing the possibility that one specific disk will be hot.

(Extracted from oracle technet notes author Arup Nanda)

isitdevops.com/databases

Categories

Categories

Meta

iostat – i/o performance

Discussion ¬

Comment ¬ Cancel reply