CLASSIFICATION OF METRICS FOR PROACTIVE ANOMALY DETECTION IN DISTRIBUTED SYSTEM NODE LOADS
Keywords:
software engineering, distributed systems, actor model, load metrics, proactive migration, monitoring, fault toleranceAbstract
The paper considers the problem of node state monitoring in high-load distributed systems based on the Actor Model. The disadvantages of reactive load balancing methods are analyzed. A multi-level classification of metrics (system, platform, application) is proposed, the monitoring of which allows implementing proactive migration of computational entities before a critical failure or performance degradation occurs.
References
N. Hayashibara, X. Defago, R. Yared, and T. Katayama, "The ϕ accrual failure detector," in 23rd IEEE International Symposium on Reliable Distributed Systems, 2004, pp. 66–78.
V. Vernon, Reactive Messaging Patterns with the Actor Model: Applications and Integration in Scala and Akka. Addison-Wesley Professional, 2015.
A. Newell, G. Kliot, I. Menache, A. Gopalan, S. Akiyama, and M. Silberstein, "Optimizing Distributed Actor Systems for Dynamic Interactive Services," in Proceedings of the Eleventh European Conference on Computer Systems (EuroSys '16), ACM, 2016.
A. S. Tanenbaum and M. Van Steen, Distributed Systems: Principles and Paradigms, 3rd ed. CreateSpace Independent Publishing Platform, 2017.
B. Burns, Designing Distributed Systems: Patterns and Paradigms for Scalable, Reliable Services. O'Reilly Media, 2018.