Does your Mainframe need an oil change?

Jul 25, 2017

Mainframe, Metrics

oil-change

Hemanth Rama

Hemanth Rama, a millennial mainframer, is a senior software engineer at BMC Software. Hemanth has 13+ years of IT experience and he holds 3 patents. Hemanth is recipient of IBM z Champion award 2018. Hemanth writes regularly for many popular IT websites and also on his personal blog. He passionately speaks about mainframe technologies at various conferences, symposiums and user groups. Hemanth writes regularly for many popular IT websites and also on his personal blog.

Performance management is a key consideration on any platform. A system may have several resources (CPU, I/O, storage, network etc.) that collectively work together to process a workload. To assess overall health of the system, mainframe system reports can be reviewed on certain key performance metrics on these resources. These metrics are measured and compared against the service level agreements (SLA), or performance Rule-of-Thumb standards.

An SLA is a contract between the user and system that describes the goals to meet for business critical workloads. If results are not as expected, then mission critical workloads running on the system will usually suffer. Some of the possible remedies could be: performance tuning to get metrics results to base standards, buy more resources, offload eligible workloads to specialty engines, or steal from less critical work by adjusting priorities.

Types of workloads

First let’s take a look at typical workloads that run on mainframe systems. They can be classified into 2 flavors – Batch processing or Online transactional processing.

Batch workloads: Batch workloads process high volumes of data, and produce outputs/reports, are typically scheduled programs processed without user interaction, and are often run during off-peak hours. As an example, a mainframe job that requests large numbers of customer billing statements or customer orders etc.

Transactional workloads: These workloads typically involve end user interactions. Transactions are usually short and are often considered mission-critical workloads for the business. Examples are: bank ATM transactions, merchant credit card processing at checkout stations, online order purchase, etc.

Performance metrics

Mainframe systems regularly capture key systems metrics to gauge system performance, data that various performance monitoring tools use to display to end users. Here are some of them:

Average throughput
This indicates the average number of service completions per unit time. For example, the number of transactions per second or minute. Transaction workloads are typically measured using this performance metric.

Average response time
This measures the average amount of time it takes to complete a single service. Transactional workloads are usually measured using this performance metric – it can also be specified as an SLA goal for workloads.

Resource utilization
Typically, this metric shows the amount of time the workloads (batch or transaction) consumed resources over a period of time. Examples: CPU utilization, Processor storage utilization, I/O rates, paging rates etc.

Resource velocity
Velocity is a measure of resource contention. When multiple workloads require a resource (example CPU) at the same time then there is contention for the resource. While one workload is using the resource other workloads are put in a waiting queue. Resource velocity is the ratio of time taken for using the resource (A) to the total time spent using resource (A) and waiting in the queue (B). i.e., A / (A+B). This value is expressed as percentage of 0-100 range. A value of 0 means a high amount of contention for resource, and a value of 100 indicates no contention. This metric can be specified as an SLA goal for workloads.

Performance Index
As part of SLA, workloads are classified as service classes and each service class has goals. Goals for workloads can be expressed as response time, velocity, etc. Since there are several types of goals defined for various workloads in SLA, to determine how workloads are performing with respect to their defined goals, a simple metric performance index(PI) is used. PI is simply a ratio of defined goal vs achieved goal. A PI value of 1 means workloads are meeting goals. A value < 1 means workloads are exceeding the goals, and value > 1 means workloads are missing goals.

0 Comments

Submit a Comment Cancel reply

Sign up to receive the latest mainframe information

← Previous Article Next Article →

SHARE Pittsburgh 2026 - Register Now

Recently Published

What the New IBM z17 Rack-Mount Systems Bring to the Table

ROI, Business Case, and Tuning for Value: Round Two

by Sonja Soderlund

The bill for technical debt just came due

Stop Defending Mainframe Costs. Start Demonstrating Mainframe Business Value.

by Planet Mainframe

Read More

From the Super Bowl to the System of Record: Why Winning Runs on Data Integrity

From the Super Bowl to the System of Record: Why Winning Runs on Data Integrity

by Allan Zander

The Super Bowl looks like a game of skill and instinct: a quarterback scans the defense, the coach makes a call in seconds, a receiver adjusts a route in motion. All while 70,000 people hold their breath. From the outside, it feels like improvisation under pressure....

The Need for Effective Change Management

The Need for Effective Change Management

by Hugo Prittie

Introduction Over the last few years there has been an ever-increasing number of widely publicised problems involving notable corporate organisations and the failings of their IT systems. Incidents of ransomware, hacking and phishing are becoming worryingly...

AI and ethics and mainframes

AI and ethics and mainframes

by Trevor Eddolls

When Beliefs Shape Machines Imagine two people talking in a bar—one believes in God, and the other doesn’t. One swears by Apple, the other by Android. One supports Trump, the other can’t stand him. It doesn’t matter which side you’re on; the point is that people...

IBM z17 Time Synchronization Resiliency Enhancements

IBM z17 Time Synchronization Resiliency Enhancements

by Steve Guendert PhD

Most likely, you have heard the generic acronym RAS, which typically stands for Resiliency, Availability, and Serviceability. However, in the world of time synchronization at IBM, we changed RAS to mean Resiliency, Accuracy, and Security. From RAS to IBMz17 Timing,...