Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[AUTOCUT] Gradle Check Flaky Test Report for SimpleSearchIT #16851

Open
opensearch-ci-bot opened this issue Dec 16, 2024 · 1 comment
Open

[AUTOCUT] Gradle Check Flaky Test Report for SimpleSearchIT #16851

opensearch-ci-bot opened this issue Dec 16, 2024 · 1 comment
Labels
autocut flaky-test Random test failure that succeeds on second run >test-failure Test failure from CI, local build, etc.

Comments

@opensearch-ci-bot
Copy link
Collaborator

Flaky Test Report for SimpleSearchIT

Noticed the SimpleSearchIT has some flaky, failing tests that failed during post-merge actions.

Details

Git Reference Merged Pull Request Build Details Test Name
d37cc9b 16831 51449 org.opensearch.search.simple.SimpleSearchIT.classMethod

The other pull requests, besides those involved in post-merge actions, that contain failing tests with the SimpleSearchIT class are:

For more details on the failed tests refer to OpenSearch Gradle Check Metrics dashboard.

@opensearch-ci-bot opensearch-ci-bot added >test-failure Test failure from CI, local build, etc. autocut flaky-test Random test failure that succeeds on second run untriaged labels Dec 16, 2024
@sandeshkr419
Copy link
Contributor

Looking into build logs:

[2024-12-16T08:20:25,037][ERROR][o.o.i.s.IndexShard       ] [node_t0] [test][0] Error Fetching SegmentInfos and latest checkpoint
org.apache.lucene.store.AlreadyClosedException: engine is closed
	at org.opensearch.index.shard.IndexShard.getEngine(IndexShard.java:3728) ~[main/:?]
	at org.opensearch.index.shard.IndexShard.getSegmentInfosSnapshot(IndexShard.java:5367) ~[main/:?]
	at org.opensearch.index.shard.IndexShard.getLatestSegmentInfosAndCheckpoint(IndexShard.java:1728) [main/:?]
	at org.opensearch.index.shard.IndexShard.updateReplicationCheckpoint(IndexShard.java:4868) [main/:?]
	at org.opensearch.index.shard.IndexShard$ReplicationCheckpointUpdater.afterRefresh(IndexShard.java:4862) [main/:?]
	at org.apache.lucene.search.ReferenceManager.notifyRefreshListenersRefreshed(ReferenceManager.java:275) [lucene-core-9.12.0.jar:9.12.0 e913796758de3d9b9440669384b29bec07e6a5cd - 2024-09-25 16:37:02]
	at org.apache.lucene.search.ReferenceManager.doMaybeRefresh(ReferenceManager.java:182) [lucene-core-9.12.0.jar:9.12.0 e913796758de3d9b9440669384b29bec07e6a5cd - 2024-09-25 16:37:02]
	at org.apache.lucene.search.ReferenceManager.maybeRefreshBlocking(ReferenceManager.java:240) [lucene-core-9.12.0.jar:9.12.0 e913796758de3d9b9440669384b29bec07e6a5cd - 2024-09-25 16:37:02]
	at org.opensearch.index.engine.InternalEngine.refresh(InternalEngine.java:1771) [main/:?]
	at org.opensearch.index.engine.InternalEngine.flush(InternalEngine.java:1886) [main/:?]
	at org.opensearch.index.engine.Engine.flush(Engine.java:1214) [main/:?]
	at org.opensearch.index.engine.Engine.flushAndClose(Engine.java:2002) [main/:?]
	at org.opensearch.index.shard.IndexShard.close(IndexShard.java:2056) [main/:?]
	at org.opensearch.index.IndexService.closeShard(IndexService.java:805) [main/:?]
	at org.opensearch.index.IndexService.removeShard(IndexService.java:781) [main/:?]
	at org.opensearch.index.IndexService.close(IndexService.java:495) [main/:?]
	at org.opensearch.indices.IndicesService.removeIndex(IndicesService.java:1160) [main/:?]
	at org.opensearch.indices.cluster.IndicesClusterStateService.removeIndices(IndicesClusterStateService.java:443) [main/:?]
	at org.opensearch.indices.cluster.IndicesClusterStateService.applyClusterState(IndicesClusterStateService.java:284) [main/:?]
	at org.opensearch.cluster.service.ClusterApplierService.callClusterStateAppliers(ClusterApplierService.java:637) [main/:?]
	at org.opensearch.cluster.service.ClusterApplierService.callClusterStateAppliers(ClusterApplierService.java:623) [main/:?]
	at org.opensearch.cluster.service.ClusterApplierService.applyChanges(ClusterApplierService.java:589) [main/:?]
	at org.opensearch.cluster.service.ClusterApplierService.runTask(ClusterApplierService.java:510) [main/:?]
	at org.opensearch.cluster.service.ClusterApplierService$UpdateTask.run(ClusterApplierService.java:205) [main/:?]
	at org.opensearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:932) [main/:?]
	at org.opensearch.common.util.concurrent.PrioritizedOpenSearchThreadPoolExecutor$TieBreakingPrioritizedRunnable.runAndClean(PrioritizedOpenSearchThreadPoolExecutor.java:283) [main/:?]
	at org.opensearch.common.util.concurrent.PrioritizedOpenSearchThreadPoolExecutor$TieBreakingPrioritizedRunnable.run(PrioritizedOpenSearchThreadPoolExecutor.java:246) [main/:?]
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1144) [?:?]
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:642) [?:?]
	at java.base/java.lang.Thread.run(Thread.java:1575) [?:?]
[2024-12-16T08:20:25,057][WARN ][o.o.c.r.a.AllocationService] [node_t0] Falling back to single shard assignment since batch mode 

From initial look, it looks more like OpenSearch process failure or corrupted index failure.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
autocut flaky-test Random test failure that succeeds on second run >test-failure Test failure from CI, local build, etc.
Projects
None yet
Development

No branches or pull requests

2 participants