Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Flaky alerting dashboards cypress tests during Jenkins executions #4791

Closed
AWSHurneyt opened this issue Jun 18, 2024 · 3 comments
Closed
Labels
bug Something isn't working

Comments

@AWSHurneyt
Copy link
Contributor

Describe the bug

When the jenkins job runs the cypress tests in the Functional test repo, we found that 2 alerting dashboards cypress tests are flaky when executed using a security-enabled cluster. As described in this comment, we can see that each of the tests passes in previous test runs.
opensearch-project/alerting-dashboards-plugin#975 (comment)

To reproduce

We have difficulty reproducing the flakiness locally. The 2 tests that are called out as failing in the issue above pass reliably when executed locally using a security-enabled cluster.

  1. Use docker to create a security-enabled domain with the frontend running
  2. Use the steps in the functional test repo developer guide to execute the tests against the docker cluster - https://github.com/opensearch-project/opensearch-dashboards-functional-test/blob/main/DEVELOPER_GUIDE.md

Expected behavior

No response

Screenshots

This test fail in this instance because the "select your tenant" window wasn't closed.
Screenshot 2024-06-18 at 1 17 41 PM

This test failed because it timed out waiting for the UI to load.
https://ci.opensearch.org/ci/dbc/integ-test-opensearch-dashboards/2.15.0/7742/linux/x64/deb/test-results/5966/integ-test/alertingDashboards/with-security/cypress-screenshots/plugins/alerting-dashboards-plugin/document_level_monitor_spec.js/DocumentLevelMonitor+--+can+be+created+--+by+extraction+query+editor+--+before+each+hook+%28failed%29.png

This test failed because it timed out waiting for the UI to load.
Screenshot 2024-06-18 at 4 29 53 PM

Host / Environment

No response

Additional context

No response

Relevant log output

No response

@AWSHurneyt AWSHurneyt added bug Something isn't working untriaged Issues that have not yet been triaged labels Jun 18, 2024
@getsaurabh02
Copy link
Member

thanks @AWSHurneyt for creating the issue. When we say these tests are passing reliably when executed locally using a security-enabled cluster, are we running these individual tests or the full suite? I am wondering since the issue is related to the timeout, where test failed because it timed out waiting for the UI to load, it is a cleanup issue when all tests run together?

Can you also point out any resource contention issue, or limitations that these test might be running into based on the OS and OSD logs from the point of failures?

@gaiksaya gaiksaya removed the untriaged Issues that have not yet been triaged label Jun 20, 2024
@gaiksaya
Copy link
Member

[Triage] Couple of questions:

  1. Are these tests passing on GitHub Actions workflow?
  2. Do they pass in single run or need to be run multiple times on GHA/local as well?
  3. Can you mention if there are any additional configs (JVM, memory settings, etc) being added this test cluster while running the tests?

Please work with us before next release to get this fixed. Thanks!

@getsaurabh02 getsaurabh02 moved this from 🆕 New to Later (6 months plus) in Engineering Effectiveness Board Jul 18, 2024
@bshien
Copy link
Contributor

bshien commented Nov 25, 2024

Closing due to no activity for a few months and we have updated our test workflows with a few patches, please report back if there are more flaky tests in the new release, thanks

@bshien bshien closed this as completed Nov 25, 2024
@github-project-automation github-project-automation bot moved this from Backlog to ✅ Done in Engineering Effectiveness Board Nov 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: ✅ Done
Development

No branches or pull requests

4 participants