DZone
Thanks for visiting DZone today,
Edit Profile
  • Manage Email Subscriptions
  • How to Post to DZone
  • Article Submission Guidelines
Sign Out View Profile
  • Post an Article
  • Manage My Drafts
Over 2 million developers have joined DZone.
Log In / Join
Please enter at least three characters to search
Refcards Trend Reports
Events Video Library
Refcards
Trend Reports

Events

View Events Video Library

Zones

Culture and Methodologies Agile Career Development Methodologies Team Management
Data Engineering AI/ML Big Data Data Databases IoT
Software Design and Architecture Cloud Architecture Containers Integration Microservices Performance Security
Coding Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks
Culture and Methodologies
Agile Career Development Methodologies Team Management
Data Engineering
AI/ML Big Data Data Databases IoT
Software Design and Architecture
Cloud Architecture Containers Integration Microservices Performance Security
Coding
Frameworks Java JavaScript Languages Tools
Testing, Deployment, and Maintenance
Deployment DevOps and CI/CD Maintenance Monitoring and Observability Testing, Tools, and Frameworks

Because the DevOps movement has redefined engineering responsibilities, SREs now have to become stewards of observability strategy.

Apache Cassandra combines the benefits of major NoSQL databases to support data management needs not covered by traditional RDBMS vendors.

The software you build is only as secure as the code that powers it. Learn how malicious code creeps into your software supply chain.

Generative AI has transformed nearly every industry. How can you leverage GenAI to improve your productivity and efficiency?

Related

  • 5 DevOps Tools To Add to Your Stack in 2022
  • Anticipating Your Business Problem With Infrastructure as Code in DevOps
  • 4 Essential Strategies for Enhancing Your Application Security Posture
  • OpenTelemetry: Unifying Application and Infrastructure Observability

Trending

  • GitHub Copilot's New AI Coding Agent Saves Developers Time – And Requires Their Oversight
  • Scaling Microservices With Docker and Kubernetes on Production
  • Rust, WASM, and Edge: Next-Level Performance
  • Chat With Your Knowledge Base: A Hands-On Java and LangChain4j Guide
  1. DZone
  2. Testing, Deployment, and Maintenance
  3. Maintenance
  4. AI/ML Use Cases in Application Management

AI/ML Use Cases in Application Management

In this article, take a look at some AI and ML use cases in application management and see solutions.

By 
Nageswara Rao user avatar
Nageswara Rao
·
Updated Mar. 27, 20 · Opinion
Likes (6)
Comment
Save
Tweet
Share
9.9K Views

Join the DZone community and get the full member experience.

Join For Free

Overview

Artificial Intelligence-based Operations (AIOps) is the convergence of AI and traditional AM/IM operations. Like in all other domains, AI is going to have a significant impact on operations management. When the power of AI is applied to operations, it will redefine the way applications and the supporting application/infrastructure is managed.

Multiple applications are running simultaneously generates a lot of data. The data is generated right from the network layer to the latency of an API call to an end-user. The user expects experience of applications without the slightest performance disruption.

It is possible to acquire data from disparate layers of the stack becomes a rich source to infer insights. The complexity of operations has led to the creation of algorithmic IT operations (AIOps) platforms. The platform solution uses AI and ML to gain insights from the monitoring data and drive automated solutions by augmenting human decisions.

Important Use Cases and Solutions

Application Monitoring

Application metrics track things like response time, requests/minute, error rates overtime on a time scale and identify trends in their behavior. Besides, infrastructure metrics such as CPU utilization, memory utilization, and load averages are captured to understand how the infrastructure layer supports to cater to different load conditions of an application. As the complexity of the application grows, it is difficult to detect exceptions from an expected pattern. If unnoticed, these anomalies can cause potential outages.

Solution

Patterns of changes can be analyzed and discovered at different scopes including application level, service level, transaction-level, and external dependencies. First, determine what constitutes normal system behavior and then discern departures from that normal system behavior. AIOps can accurately highlight these outliers by pinpointing the sources, which can help better RCA in real-time. Also, it can prevent potential outages and infrastructure disruptions.

Transaction Tracing

Business transactions range from simple synchronous message exchanges between point-to-point application connections to more complex asynchronous communications. To track the transaction, flow a sophisticated tracking and monitoring solution is required. Long-running, multi-step asynchronous transactions transit IT infrastructure, spanning multiple technologies, tiers, etc.

Solution

Complex transactions often morph and split, thus defying standard tracking and analysis via tagging or statistical sampling techniques. Transactions are stitched together by examining method calls and individual message payload contents, correlating them and presenting intuitive visualizations of any pending or existing breaches in expected behavior and performance

Localizing Defects

Managing software quality is a big concern in the software development lifecycle. Almost any software contains at least some minor defects after being released. It is important to identify and fix defects before getting into the production environment. Any defect found in production incurs significant costs. Locating bugs is considered to be the most time consuming and challenging activity in this context where the resources available are limited. Therefore, there is a need for fully/semi-automated techniques in software engineering for augmenting the manual debugging process. If a developer obtains some hints where bugs might be localized, debugging becomes more efficient

Solution

Various graph mining algorithms/techniques come in handy to localize software defects. These techniques rely on detecting discriminative sub-graphs between failing and passing traces. These approaches may not be applicable when the fault does not appear in a rare code pattern. On the other hand, many approaches focus on selecting potentially faulty program components (statements or predicates) and then ranking these components according to their degree of suspiciousness and the context of execution traces based on control flow graphs

Arriving Baselines

Performance baseline determines how the components of Application and Infrastructure performs under different load conditions. The load conditions are namely Normal, Operational, Quasi, Stressed, Spikes, Breakpoint, etc., and the baselines are a set of rules or thresholds of individual metrics expected to vary between upper and lower limits. Traditionally, these associations are modeled by running Machine-Learning algorithms after collecting performance data collected for a defined time interval and deployed in real-time to notify in case of any performance deviations. This method well suited for the components evolves slowly however nullifies the point of “relevance” with the modern development methodologies.

Solution:

The influence of hyper-converged Infrastructure Management, Domain-driven Application development, a surge of Distributed computing, Polyglot Programming and Persistence changed the way software components are developed and deployed. Frequent changes of software components need to be deployed continuously on top of dynamic scaled-up/scaled-down underlying infrastructure. This paradigm shift enforces the model-building exercise to use near-real-time data to stay relevant to the latest changes in Application as well as Infrastructure components. These models need to use real-time feeds to learn new rules and evolve continuously.

Predicting Issues

Intelligent alerting in APM is to detect the abnormal dynamically. For alerts to be intelligent, the tool needs to be configurable to understand the nature of your application and its behavior so that it can detect anomalies. It was quite common to define static thresholds; such as raise an alert if this service call takes more than three seconds to return. However, it is very tedious to identify important metrics to monitor and their thresholds for different application usage patterns, hence the need for an intelligent way to baseline the normalcy of an application and notify in case of abnormal behavior.

Solution

As the algorithmic techniques evolved, alerting became smart enough to perform rudimentary statistical analysis and allowed for alerts based on prediction using measures like things standard deviations, percentiles, forecasting, etc. Today, tools are smart enough to understand the behavior of your application and establish a baseline, allow you to define the strategy to use when analyzing requests against your baseline, and intelligently alert when there is a real problem that you need to review.

application Software development Infrastructure

Opinions expressed by DZone contributors are their own.

Related

  • 5 DevOps Tools To Add to Your Stack in 2022
  • Anticipating Your Business Problem With Infrastructure as Code in DevOps
  • 4 Essential Strategies for Enhancing Your Application Security Posture
  • OpenTelemetry: Unifying Application and Infrastructure Observability

Partner Resources

×

Comments
Oops! Something Went Wrong

The likes didn't load as expected. Please refresh the page and try again.

ABOUT US

  • About DZone
  • Support and feedback
  • Community research
  • Sitemap

ADVERTISE

  • Advertise with DZone

CONTRIBUTE ON DZONE

  • Article Submission Guidelines
  • Become a Contributor
  • Core Program
  • Visit the Writers' Zone

LEGAL

  • Terms of Service
  • Privacy Policy

CONTACT US

  • 3343 Perimeter Hill Drive
  • Suite 100
  • Nashville, TN 37211
  • support@dzone.com

Let's be friends:

Likes
There are no likes...yet! 👀
Be the first to like this post!
It looks like you're not logged in.
Sign in to see who liked this post!