Issue |
E3S Web Conf.
Volume 140, 2019
International Scientific Conference on Energy, Environmental and Construction Engineering (EECE-2019)
|
|
---|---|---|
Article Number | 03006 | |
Number of page(s) | 5 | |
Section | Economics in Energy Sector | |
DOI | https://doi.org/10.1051/e3sconf/201914003006 | |
Published online | 18 December 2019 |
Log mining and knowledge-based models in data storage systems diagnostics
Peter the Great St. Petersburg Polytechnic University, St. Petersburg, Russian Federation
* Corresponding author: mikhail.uspenskiy@spbpu.com
Modern data storage systems have a sophisticated hardware and software architecture, including multiple storage processors, storage fabrics, network equipment and storage media and contain information, which can be damaged or lost because of hardware or software fault. Approach to storage software diagnostics, presented in current paper, combines a log mining algorithms for fault detection based on natural language processing text classification methods, and usage of the diagnostic model for a task of fault source detection. Currently existing approaches to computational systems diagnostics are either ignoring system or event log data, using only numeric monitoring parameters, or target only certain log types or use logs to create chains of the structured events. The main advantage of using natural language processing method for log text classification is that no information of log message structure or log message source, or log purpose is required if there is enough data for classificator model training. Developed diagnostic procedure has accuracy score comparable with existing methods and can target all presented in training set faults without prior log structure research.
© The Authors, published by EDP Sciences, 2019
This is an Open Access article distributed under the terms of the Creative Commons Attribution License 4.0, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.