Page MenuHomePhabricator

[terraform] Add cloudwatch alert for search index lambda failures
ClosedPublic

Authored by will on Feb 21 2024, 10:28 PM.
Tags
None
Referenced Files
Unknown Object (File)
Mon, Jul 1, 10:35 PM
Unknown Object (File)
Mon, Jul 1, 9:51 PM
Unknown Object (File)
Sat, Jun 29, 11:39 PM
Unknown Object (File)
Sat, Jun 29, 11:16 PM
Unknown Object (File)
Fri, Jun 28, 3:48 AM
Unknown Object (File)
Fri, Jun 7, 3:11 PM
Unknown Object (File)
May 7 2024, 5:49 PM
Unknown Object (File)
May 7 2024, 5:48 PM
Subscribers

Details

Summary

This adds a cloudwatch alert for when there are search index lambda errors. An email notification is sent to error-report@comm.app

Test Plan

Tested on staging with my own email will@comm.app and received alarm email

Diff Detail

Repository
rCOMM Comm
Lint
Lint Not Applicable
Unit
Tests Not Applicable

Event Timeline

Harbormaster returned this revision to the author for changes because remote builds failed.Feb 21 2024, 10:36 PM
Harbormaster failed remote builds in B27053: Diff 37423!
will requested review of this revision.Feb 21 2024, 11:08 PM

We don't have a error-report@comm.app mailing list. Did you mean error-reports@comm.app? Or do you want us to create a new one? If so, please pick a different name... this one is too close

We don't have a error-report@comm.app mailing list. Did you mean error-reports@comm.app? Or do you want us to create a new one? If so, please pick a different name... this one is too close

I haven't seen what we use error-reports@comm.app for. If it's mainly for user reported errors, it might be good to create a urgent error mailing list so that devs can pick out errors from the noise

Sounds good. Please make sure that mailing list is created before landing (I can help with that), and please rename the mailing list to avoid confusion with the existing one

Sounds good. Please make sure that mailing list is created before landing (I can help with that), and please rename the mailing list to avoid confusion with the existing one

talked to @will about this. i think we should just use the existing mailing list to avoid having to look through multiple lists to triage errors

This revision is now accepted and ready to land.Feb 22 2024, 8:41 PM

i think we should just use the existing mailing list to avoid having to look through multiple lists to triage errors

Note that we already have like 3 lists... keeping it to 3 instead of 4 doesn't "avoid having to look through multiple lists to triage errors"

i think we should just use the existing mailing list to avoid having to look through multiple lists to triage errors

Note that we already have like 3 lists... keeping it to 3 instead of 4 doesn't "avoid having to look through multiple lists to triage errors"

ah, ok. in that case, i'm indifferent to creating a new list