gms | German Medical Science

64. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS)

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie

08. - 11.09.2019, Dortmund

PLIRF – simple and secure log-file monitoring for early error detection and correction

Meeting Abstract

Search Medline for

  • Benjamin Winter - Universitätsmedizin Greifswald, Institut für Community Medicine, Greifswald, Germany
  • Stephan Struckmann - Universitätsmedizin Greifswald, Institut für Community Medicine, Greifswald, Germany
  • Carsten Oliver Schmidt - Universitätsmedizin Greifswald, Institut für Community Medicine, Greifswald, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 64. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e.V. (GMDS). Dortmund, 08.-11.09.2019. Düsseldorf: German Medical Science GMS Publishing House; 2019. DocAbstr. 225

doi: 10.3205/19gmds116, urn:nbn:de:0183-19gmds1166

Published: September 6, 2019

© 2019 Winter et al.
This is an Open Access article distributed under the terms of the Creative Commons Attribution 4.0 License. See license information at http://creativecommons.org/licenses/by/4.0/.


Outline

Text

Introduction: Complex epidemiologic studies such as the Study of Health in Pomerania (SHIP) generate and integrate data from different origins such as laboratory measurements, imaging, clinical examinations, interviews, and many more [1]. Within EUthyroid, a Horizon 2020 project, web applications were utilized to collect metadata of several European and non-European iodine monitoring studies [2]. A centralized management of the data requires efficient workflows to keep track of any issues occurring during ETL (extract, transform, load) and subsequent data management procedures.

Background: Automatization of procedures is a standard to manage large data quantities. In SHIP for example, medical documentation specialists daily run a large number of scripts to integrate, clean and check data. These scripts, as well as proprietary devices write log-files that may contain error messages and information about technical problems. However, log-files are long, repetitive texts. This makes the extraction of relevant information difficult for human readers, especially when extensive batch jobs are executed daily. Important issues may go unnoticed with potentially serious consequences.

Concept: Conventions for writing log-files are not very strict. Usually, they list one event per row in chronological order amended by time stamps and severity classifications in plain-text format. Therefore, a flexible and easily configurable log-file parser is part of our solution. Relevant information is extracted using configured rules. Since log-files frequently contain confidential information, encryption and access control are necessary as well. The processed log information must trigger some reaction and should therefore not be lost.

Implementation: We developed the software PLIRF ("Parse Logfiles Into Rss Feeds") to automatically filter relevant error information from log-files. PLIRF generates RSS feed files, which can be distributed using methods like encrypted and password protected https-connections. Many reader tools exist for RSS feeds and eligible users can easily subscribe to feeds interesting for them. RSS feeds facilitate a timely response to detected issues which might have gone unnoticed otherwise. PLIRF is fully configurable, reads any plain-text log-file and runs on many operating systems. It filters log-files using search patterns, counts events and recognizes timestamps. It is therefore easily used with any kind of log-files. Even log-files with statistical output, such as data quality indicators, could be handled. PLIRF has been in routine use for several years at SHIP and EUthyroid.

Conclusion: The use of our log-file monitoring software PLIRF has improved our efficiency in dealing with technical problems, configuration problems of proprietary devices as well as with user errors. Especially recurring issues are easier detected and further errors can be prevented. PLIRF is easily integrated with other platforms, and can be shared on request, rendering it a versatile tool for study and data management purposes.

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Völzke H, Alte D, Schmidt CO, Radke D, Lorbeer R, Friedrich N, Aumann N, Lau K, Piontek M, Born G, Havemann C. Cohort profile: the study of health in Pomerania. International journal of epidemiology. 2010 Feb 18;40(2):294-307.
2.
Völzke H, Erlund I, Hubalewska-Dydejczyk A, Ittermann T, Peeters RP, Rayman M, Buchberger M, Siebert U, Thuesen BH, Zimmermann MB, Grünert S. How do we improve the impact of iodine deficiency disorders prevention in Europe and beyond? European thyroid journal. 2018;7(4):193-200.