Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Automation App Docker Monitoring #105

Open
bshien opened this issue Nov 11, 2024 · 1 comment
Open

[FEATURE] Automation App Docker Monitoring #105

bshien opened this issue Nov 11, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@bshien
Copy link
Collaborator

bshien commented Nov 11, 2024

Is your feature request related to a problem?

Coming from #76, the automation app docker collects GitHub events and stores them into S3.

We require a way to know if this app goes down so we can act immediately to bring it back up to minimize the events missed.

What solution would you like?

Have a script that creates and deletes a label every 15 minutes, triggering an event in the OpenSearch Project org, and have the automation app check when it listens on an event to see if the event is this particular canary event. If it is, then send a metric to CloudWatch. Then, we can have an alarm that triggers when no metrics are being sent, so our existing monitoring will be able to send out emails and slack notifications.

The script can authenticate as the opensearch-trigger-bot to create this label.

Do you have any additional context?

Part of #76

@bshien bshien added enhancement New feature or request untriaged Issues that have not yet been triaged labels Nov 11, 2024
@bshien bshien self-assigned this Nov 11, 2024
@bshien bshien moved this from 🆕 New to 🏗 In progress in Engineering Effectiveness Board Nov 11, 2024
@bshien bshien moved this from 🏗 In progress to 🆕 New in Engineering Effectiveness Board Nov 11, 2024
@zelinh zelinh moved this from 🆕 New to 🏗 In progress in Engineering Effectiveness Board Nov 11, 2024
@dblock dblock removed the untriaged Issues that have not yet been triaged label Dec 2, 2024
@dblock
Copy link
Member

dblock commented Dec 2, 2024

[Catch All Triage - 1, 2, 3]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: 🏗 In progress
Development

No branches or pull requests

2 participants