summaryrefslogtreecommitdiff
path: root/puppet/modules/site_check_mk
AgeCommit message (Collapse)Author
2014-12-17Ignore Soledad "Timing out client" warning (Bug #6566)Micah Anderson
Change-Id: I6d3fa5028ba6eaca7b21a7e850136ef980f6e782
2014-12-17Check tapicero heartbeat (Bug #6556)varac
In order to assure tapicero is still working, we need to monitor /var/log/syslog for the last tapicero log msg, which should not be older than the last check_mk_agent run (every 2 mins atm).
2014-12-17Merge branch 'micah/platform-feature/6544' into developvarac
Conflicts: puppet/modules/site_check_mk/files/agent/logwatch/bigcouch.cfg Change-Id: I1646e49ffa5437a861b402b755bc15943c42ec4f
2014-12-16Ignore "Generic server terminating" bigcouch message (Feature #6544)Micah Anderson
Change-Id: I73defd7964501e4eabe7dd05c02887e7aeb2f063
2014-12-16Merge branch 'bug/6545' into 'develop'varac
Bug/6545 See merge request !16
2014-12-16Ignore postfix "too many errors after DATA" logwatch msg (Bug #6545)Micah Anderson
Change-Id: I0abeb88f7b6548e5742bd3d99b2f4e5d9c6cf421
2014-12-16ignore additional bigcouch error messages (#6512)Micah Anderson
Change-Id: Ie51fb485bcae9a9467c465bdd1b4a5785023db04
2014-12-16Move kernel ipv6 log message up before the 'C error' line to it isMicah Anderson
caught (#6540) Change-Id: I1fe8d4cf60532dfe01cfb3a014c4cbeb4acdc479
2014-12-11Ignore additional tapicero message (#6542):Micah Anderson
tapicero[921]: Checking security of user-1b3b1fb78db851190fa72dac01207b8d failed (trying again soon): RestClient::ResourceNotFound: 404 Resource Not Found: {"error":"not_found","reason":"Database does not exist."}") tapicero recovers from this error Change-Id: Ic105823ddc282512000e6d7445539428581eb997
2014-12-11Increase max_check_attempts for hosts checks (Bug #6535)varac
Change-Id: I10ec569821f329e3bd10ac87242db102e9c82246
2014-12-11Merge branch '6539_increase_time_between_check_mk_agent_runs' into 'develop'Micah
6539 increase time between check mk agent runs https://leap.se/code/issues/6539 See merge request !11
2014-12-11Increase time between two check_mk_agent runs (Bug #6539)varac
right now, check_mk_agent is run every minute on each host. The soledad sync test depends on tapicero, and in between finishing the soledad test and removing the testuser db, and the start of another test there's only 13s Change-Id: I5b22ba02470cce799a12043d21091c0c9b8e0b5f
2014-12-11logwatch: ignore ipv6 icmp errors (Bug #6540)Micah Anderson
Change-Id: I198c5245c7e73d6dd7a7d9725fac1eb9a8f425a5
2014-12-10update ffa53ef321bbfd771afff1ccb230d1b5e4f9ab00 to fix orderingMicah Anderson
requirement in logwatch, remove extended regexp character class and also ignore "Writing security" lines Change-Id: I7d33725db06a40361a3b04f9591adeb6a025bf77
2014-12-10Merge branch 'bug/6512' into 'develop'varac
Bug/6512 See merge request !5
2014-12-10ignore transient Tapicero errors when creating a db (Bug #6511)varac
Change-Id: I0939070482fad4f99f03e41094a3df42ff5063e4
2014-12-09Ignore rexi_EXIT bigcouch error (Bug #6512)Micah Anderson
Change-Id: I03842b65329aabb012cc2c7514007e174cbd8fc0
2014-12-09 logwatch: ignore postfix errors on lost connection (Bug #6476)varac
Change-Id: I0b1eec11a3b3da39d65572b6bee8b3ce892e08ac
2014-12-04remove webapp python tests, because they are integrated into the platform ↵varac
now (Bug #6489) Change-Id: Iaec748a173b6e11bb3ab3c11ca152809817644f9
2014-12-04Merge "Change nagios mail To: Header to contain the actual platform ↵Varac
environment's contact email (Bug #6466)" into develop
2014-12-02Change nagios mail To: Header to contain the actual platform environment's ↵Micah Anderson
contact email (Bug #6466) Change-Id: Ib86ae771e0ac3b6f329a517a8a31c9ec54d33a05
2014-12-02Ignore bigcouch conflict errors, mainly coming from tapicero creating new ↵varac
users (Feature #6481) There are potentially many tapicero daemons running, and they all try to do the same thing at the same time. It is basically designed to create race conditions. All tapicero daemons try to create the user db at the same time. Only one of them wins the race and actually creates it. We need to fix this later (see https://leap.se/code/issues/6480) but for now, we ignore them because conflict errors should be handled by the applictation anyway. Change-Id: I91095b1901d238e3d199954ba3716023d3fd49c1
2014-12-01Increase the nagios alert thresholds for bigcouch open file descriptors (#6473)Micah Anderson
Change-Id: I2549d781427fffc865c2bdcd1e950d60dad509fd
2014-12-01Increase nagios max_checks_attempts from 1 to 4 so notifications won't be ↵varac
sent out on first failed check_mk check (Bug #6461) Change-Id: I1bd47b3c3d17508488a4db90d74118006d85a03a
2014-12-01ignore ntp check because it's flapping to often (Bug #6407)varac
Change-Id: I52e19bbdfcf6576bd9c247d99aace47eb86c8116
2014-11-20Fix Check_mk notifications (Bug #6403)varac
Let check_mk put all hosts into the same "admin" contactgroup, which is defined as default contactgroup by nagios. Change-Id: I13b434925711ef2037de0cf6e919ce39a8255a94
2014-10-08every environment is defined as nagios hostsgroup (#5216)varac
Change-Id: I6508ce0d06b37a1c5601a0e981a59f7fda47f76a
2014-04-29nagios: make the check_procs tests for leap_mx and soledad be much moreMicah Anderson
specific, to avoid catching unrelated processes (#5327) Change-Id: I63ffcd644a85137708712daac671b92898c70b7e
2014-04-10fix check_mk resource dependency deploy errors (Bug #5272)varac
2014-03-25 ignore openvpn TLS initialization errors (Feature #5374)varac
2014-03-13catch errors when tapicero fails to create a userdb (Feature #5306)varac
2014-03-12check if soledad is working (Feature #5239)varac
2014-03-05use the right package dependencies for site_check_mk::agent class and subclassesvarac
2014-03-04remove trailing whitespaces from logwatch config filesvarac
2014-03-04use curly brackets for variables in check_leap_mx.sh output, see ↵varac
https://review.leap.se/r/160/#comment156
2014-03-04don't use storedconfigs for check_mk, requires current check_mk module (#5253)varac
2014-03-04fix check_mk resource dependencies (Bug #5145)varac
2014-03-04ignore bigcouch 'Shutting down group server' error (#5246)varac
2014-03-04ignore failing creation of user-dbs by tapicero, see #5168varac
2014-03-04ignore stunnel 'Connection refused' errors that happen too often until we ↵varac
fix #5218
2014-03-04ignore 'epmd: got partial packet only on file descriptor' (#5244)varac
2014-03-03ignore "Uncaught error in HTTP request: {exit, normal}" error (#5226)varac
2014-03-03move generic syslog patterns to the end of syslog.cfg so we can ignore ↵varac
patterns first
2014-03-03ignore stunnel pattern "Peer suddenly disconnected" (#5218)varac
2014-03-03check syslog for bigcouch error "epmd: got partial packet only on file ↵varac
descriptor"
2014-03-03check soledad.log also for Upper case pattern 'Error'varac
2014-03-03check syslog for bigcouch restartsvarac
2014-03-03check leap_mx (Feature #5175)varac
2014-03-03check soledad.log for 'Timing out client:'varac
2014-03-03check leap_mx queuevarac