Analyze process responsiveness

Responsiveness measurements allow you to quickly assess process health by tracking how long it takes for a process to respond to requests from other processes or clients (regardless of resource consumption). Dynatrace measures this on the TCP level, so this metric is available for any process that accepts TCP requests (for example, listening on an open port) and responds to TCP requests. Typical examples are web server processes, proxy server processes, and SQL server processes. Code-level visibility isn't required to measure process responsiveness—Dynatrace only requires insight into inter-process communication to gather the information it needs. Plus, this metric is always measurable—no process restarts or special instrumentation required.

Use case: using responsiveness to assess process performance

Following is a use-case example that shows that it's not enough to only monitor the resource consumption of a process because a process that consumes excessive resources won't suffer from performance degradation—it's the other processes in the system that will suffer.

Let's say, it's 8:15 AM and you receive a Dynatrace problem alert telling you that host Prod has used 99% of available CPU.

Responsiveness screen 1

This may indicate a serious problem, but maybe not—you need to investigate to see what's going on. Click the Consuming processes button.

Responsiveness screen 2

You notice that the Elasticsearch process is consuming nearly all available CPU. The running process on the host that is most critical to the success of your business, NGINX, isn't consuming too much CPU and RAM.

At first glance, this may be reassuring but you still can't be sure that NGINX is performing at its baseline level of performance. Click the NGINX process link to analyze its performance over time.

The Responsiveness confirms bad news—your NGINX process is in fact responding to TCP requests much slower than it was earlier in the day.

Responsiveness screen 3

The root cause of the NGINX process slowdown is clearly a resource consumption issue with the Elasticsearch process.

NGINX is just one example of a technology for which you can analyze responsiveness. This use case is also applicable to:

Other HTTP servers like Lighttpd and LiteSpeed
Database servers like MySQL, CouchDB/Couchbase, and MongoDB
Proxy or web acceleration servers like Squid, Varnish, HAproxy, and SS5
Cache or key value stores like Memcached, Redis, and Riak
Other server types like SMTPD, Dovecot, and LDAP

Since the responsiveness metric in Dynatrace is based on TCP communication, it's virtually technology-agnostic. You can use this metric to gain platform-independent visibility into just about any application and technology.

Platform

Overview
Pricing
Supported technologies
Application Observability
Infrastructure Observability
Automations
Security Protection
Security Analytics
Digital Experience
AIOps
Full stack
Davis, AI-engine
Open ecosystem
AppEngine
AutomationEngine

Solutions

Overview
Cloud operations
Microservices
Container monitoring
DevOps monitoring
Observability
Cloud monitoring
Application monitoring
Site Reliability Engineering
Log Management and Analytics

Resources

Overview
Blog
Demos
Webinars
Customer stories
Podcasts
Free trial
Request demo
Glossary
2023 Gartner APM Magic Quadrant
Is It Observable
Knowledge Base
Engineering blog
Cloud done right

Services & Support

Overview
Success and Support
Software Intelligence Hub
ACE Services
University
Support Center
Product news
Documentation
Community

About

Overview
Leadership
Investor Relations
News
Media kit
Events
Careers
Partners
Locations
ESG
Contact us
Privacy Notice
Your Privacy Choices
Legal disclosure

Trust Center
Dynatrace status
Terms of use
Policies
Sitemap

© 2024 Dynatrace LLC. All rights reserved.