CLASSIFICATION OF METRICS FOR PROACTIVE ANOMALY DETECTION IN DISTRIBUTED SYSTEM NODE LOADS

Authors

Keywords:

software engineering, distributed systems, actor model, load metrics, proactive migration, monitoring, fault tolerance

Abstract

The paper considers the problem of node state monitoring in high-load distributed systems based on the Actor Model. The disadvantages of reactive load balancing methods are analyzed. A multi-level classification of metrics (system, platform, application) is proposed, the monitoring of which allows implementing proactive migration of computational entities before a critical failure or performance degradation occurs.

References

N. Hayashibara, X. Defago, R. Yared, and T. Katayama, "The ϕ accrual failure detector," in 23rd IEEE International Symposium on Reliable Distributed Systems, 2004, pp. 66–78.

V. Vernon, Reactive Messaging Patterns with the Actor Model: Applications and Integration in Scala and Akka. Addison-Wesley Professional, 2015.

A. Newell, G. Kliot, I. Menache, A. Gopalan, S. Akiyama, and M. Silberstein, "Optimizing Distributed Actor Systems for Dynamic Interactive Services," in Proceedings of the Eleventh European Conference on Computer Systems (EuroSys '16), ACM, 2016.

A. S. Tanenbaum and M. Van Steen, Distributed Systems: Principles and Paradigms, 3rd ed. CreateSpace Independent Publishing Platform, 2017.

B. Burns, Designing Distributed Systems: Patterns and Paradigms for Scalable, Reliable Services. O'Reilly Media, 2018.

Published

2026-05-08

Issue

Section

Plenary Section