Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CI] CrossClusterAsyncQueryIT testSuccessfulPathways failing #123721

Closed
elasticsearchmachine opened this issue Feb 28, 2025 · 5 comments
Closed
Assignees
Labels
:Analytics/ES|QL AKA ESQL low-risk An open issue or test failure that is a low risk to future releases Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) >test-failure Triaged test failures from CI

Comments

@elasticsearchmachine
Copy link
Collaborator

Build Scans:

Reproduction Line:

./gradlew ":x-pack:plugin:esql:internalClusterTest" --tests "org.elasticsearch.xpack.esql.action.CrossClusterAsyncQueryIT.testAsyncQueriesWithLimit0" -Dtests.seed=400DB4231EF1C473 -Dtests.jvm.argline="-Des.concurrent_search=true" -Dtests.locale=su-Latn-ID -Dtests.timezone=EAT -Druntime.java=23

Applicable branches:
9.0

Reproduces locally?:
N/A

Failure History:
See dashboard

Failure Message:

java.lang.AssertionError: timeout waiting for query response

Issue Reasons:

  • [9.0] 2 failures in test testSuccessfulPathways (1.2% fail rate in 164 executions)

Note:
This issue was created using new test triage automation. Please report issues or feedback to es-delivery.

@elasticsearchmachine elasticsearchmachine added :Analytics/ES|QL AKA ESQL >test-failure Triaged test failures from CI Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) needs:risk Requires assignment of a risk label (low, medium, blocker) labels Feb 28, 2025
@elasticsearchmachine
Copy link
Collaborator Author

Pinging @elastic/es-analytical-engine (Team:Analytics)

@dnhatn dnhatn self-assigned this Mar 2, 2025
@dnhatn dnhatn added low-risk An open issue or test failure that is a low risk to future releases and removed needs:risk Requires assignment of a risk label (low, medium, blocker) labels Mar 2, 2025
@dnhatn dnhatn assigned smalyshev and unassigned dnhatn Mar 4, 2025
@dnhatn
Copy link
Member

dnhatn commented Mar 4, 2025

@smalyshev I took a quick look at the test failure, and it seems the first test to fail is testStopQueryLocal. Would you mind taking a look? Let me know if you need any help with it. Thanks!

2> REPRODUCE WITH: ./gradlew ":x-pack:plugin:esql:internalClusterTest" --tests "org.elasticsearch.xpack.esql.action.CrossClusterAsyncQueryIT.testStopQueryLocal" -Dtests.seed=14C1D323891169E4 -Dtests.locale=en-GY -Dtests.timezone=Africa/Conakry -Druntime.java=23
2> java.lang.AssertionError: 
  Expected: not <running>
       but: was <running>
      at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:18)
      at org.hamcrest.MatcherAssert.assertThat(MatcherAssert.java:6)
      at org.elasticsearch.test.ESTestCase.assertThat(ESTestCase.java:2625)
      at org.elasticsearch.xpack.esql.action.EsqlAsyncTestUtils.lambda$waitForCluster$0(EsqlAsyncTestUtils.java:99)
      at org.elasticsearch.test.ESTestCase.assertBusy(ESTestCase.java:1485)
      at org.elasticsearch.test.ESTestCase.assertBusy(ESTestCase.java:1458)
      at org.elasticsearch.xpack.esql.action.EsqlAsyncTestUtils.waitForCluster(EsqlAsyncTestUtils.java:89)
      at org.elasticsearch.xpack.esql.action.CrossClusterAsyncQueryIT.testStopQueryLocal(CrossClusterAsyncQueryIT.java:367)

@smalyshev
Copy link
Contributor

@dnhatn Yes, testStopQueryLocal has the stability issue now. See also #121672. Not sure yet what's going on there, somehow it seems to not run smoothly even though other tests that are doing the same things are fine. I'll ping you later if I can't find what's up there.

@smalyshev
Copy link
Contributor

Ah, this is also 9.0 so likely related to #123334. I've backported the lock fix but if it keeps causing issues we may also want to backport moving stop tests to their own class like they are in main.

@smalyshev
Copy link
Contributor

Looks like one-time issue caused by testStopQueryLocal and not repeating anymore, so closing this for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Analytics/ES|QL AKA ESQL low-risk An open issue or test failure that is a low risk to future releases Team:Analytics Meta label for analytical engine team (ESQL/Aggs/Geo) >test-failure Triaged test failures from CI
Projects
None yet
Development

No branches or pull requests

3 participants