While having a look at different approaches of event parsing, e.g. logstash or fluentd, we also did our own experiments in that field using python.
This is about how we build up an infrastructure for logging API-calls and prepare the logs to be analyzed.
For some of our devop tasks we use fabric. As this is not a full blown infrastructure management stack like chef or SaltStack it lacks some nice features for e.g. target host selection. As we also use zabbix to monitor our services, a logical approach to this problem was to use zabbix hosts and group configurations for host selection in fabric.
Monitoring tables in SAP Sybase ASE are a valueable tool to detect bottlenecks (e.g. locks, physical i/o). The “wait event” metrics show the causes for bottlenecks. This is about how we got the best out of this metric.