Open Research Newcastle
Browse

Anomaly detection via mining numerical workflow relations from logs

Download (3.29 MB)
conference contribution
posted on 2025-05-11, 18:37 authored by Bo Zhang, Hongyu ZhangHongyu Zhang, Pablo MoscatoPablo Moscato
Complex software-intensive systems, especially distributed systems, generate logs for troubleshooting. The logs are text messages recording system events, which can help engineers determine the system's runtime status. This paper proposes a novel approach named ADR (stands for Anomaly Detection by workflow Relations), which employs matrix nullspace to mine numerical relations from log data. The mined relations can be used for both offline and online anomaly detection and facilitate fault diagnosis. We have evaluated ADR on log data collected from two distributed systems. ADR successfully mined 87 and 669 numerical relations from the logs and used them to detect anomalies with high precision and recall. For online anomaly detection, ADR employs PSO (Particle Swarm Optimization) to find the optimal sliding windows' size and achieves fast anomaly detection. The experimental results confirm that ADR is effective for both offline and online anomaly detection.

Funding

ARC

DP200102940

History

Related Materials

  1. 1.
  2. 2.
    ISBN - Is version of urn:isbn:9781728176277
  3. 3.

Source title

Proceedings of 2020 International Symposium on Reliable Distributed Systems (SRDS)

Name of conference

2020 International Symposium on Reliable Distributed Systems (SRDS)

Location

online

Start date

2021-09-21

End date

2021-09-24

Pagination

195-204

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Place published

Washington, US

Language

  • en, English

College/Research Centre

Faculty of Engineering and Built Environment

School

School of Electrical Engineering and Computer Science

Rights statement

© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Usage metrics

    Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC