Significant Advances in Application Resiliency: The Data Engineering Perspective on Network Performance Metrics
Jayanna Hallur
Journal of Technology and Systems, 2024, vol. 6, issue 7, 60 - 71
Abstract:
Purpose: The purpose of the article is to explore how network metrics like latency, packet loss, and throughput, combined with application logs, can help organizations improve the reliability and performance of their applications. It focuses on how these insights support Site Reliability Engineering (SRE) teams in proactively addressing issues to achieve better application resiliency, enhancing user experience and market trust. Methodology: The article uses a combination of case studies and analysis to demonstrate how monitoring specific network metrics and application logs helps identify and resolve performance issues. It examines real-world scenarios where proactive adjustments based on these metrics improved application reliability and aligned with organizational objectives. Findings: The findings show that by analyzing network metrics and application logs, organizations can pinpoint causes of transaction failures, such as high latency, packet loss, or misconfigured firewalls. Proactive resolutions based on these insights result in smoother application performance, reduced downtime, and increased user satisfaction. Unique Contribution to theory, practice and policy: This article makes valuable contributions to theory, practice, and policy. For theory, it expands the understanding of how network metrics and application logs can work together to improve application resiliency, offering a framework for integrating Site Reliability Engineering (SRE) principles with network observability. For practice, it provides clear, actionable steps for SRE teams to identify and resolve performance issues, helping organizations enhance reliability and user satisfaction. For policy, it highlights the importance of proactive network monitoring and metric-driven decision-making, encouraging organizations to adopt policies that prioritize resiliency, ensure consistent performance, and meet service-level agreements (SLAs).
Keywords: Data Engineering; Network Observability; Telemetry; Reliability; Resiliency; Scalability; FCAPS (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:
Downloads: (external link)
https://carijournals.org/journals/index.php/JTS/article/view/2408/2834 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bhx:ojtjts:v:6:y:2024:i:7:p:60-71:id:2408
Access Statistics for this article
More articles in Journal of Technology and Systems from CARI Journals Limited
Bibliographic data for series maintained by Chief Editor ().