Fix Flaky Tests #636

rawfalafel · 2018-10-09T13:35:06Z

This fixes #377, as well as flaky tests in sync and rpc that were reproduced on buildkite but haven't been reported on GitHub.

I didn't find any race conditions for blockchain, casper, types, so I re-enabled race there as well.

rawfalafel · 2018-10-09T13:36:35Z

beacon-chain/node/node_test.go

-		t.Fatalf("Failed to create BeaconNode: %v", err)
-	}
-
-	go node.Start()


The problem here is that this goroutine is still running during execution of the next test. This isn't actually testing anything, so I just deleted it.

rawfalafel · 2018-10-09T13:37:19Z

beacon-chain/rpc/service_test.go

@@ -250,10 +249,25 @@ func TestLatestAttestation(t *testing.T) {
 	}(t)

 	rpcService.incomingAttestation <- attestation
+	rpcService.cancel()
+	exitRoutine <- true


The issue here is that the test isn't waiting for the goroutine to finish.

codecov · 2018-10-09T13:37:55Z

Codecov Report

Merging #636 into master will decrease coverage by 0.17%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master     #636      +/-   ##
==========================================
- Coverage   73.14%   72.96%   -0.18%     
==========================================
  Files          52       52              
  Lines        3459     3459              
==========================================
- Hits         2530     2524       -6     
- Misses        706      713       +7     
+ Partials      223      222       -1

Impacted Files	Coverage Δ
beacon-chain/node/node.go	`48.46% <0%> (-3.07%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cd2073e...3cabce9. Read the comment docs.

rawfalafel · 2018-10-09T13:44:49Z

beacon-chain/rpc/service_test.go

@@ -262,9 +276,10 @@ func TestLatestAttestation(t *testing.T) {
 		<-exitRoutine
 	}(t)
 	rpcService.incomingAttestation <- attestation
-	testutil.AssertLogsContain(t, hook, "Sending attestation to RPC clients")


Same here. The assertion needs to come after exitRoutine <- true.

Another issue is that this test originally combined two tests that should have been separated.

func TestBadExample(t *testing.T) { go func() { testSomething(testChan) <-exitRoutine } testChan <- struct{}{} assertSomething() go func() { testSomething(testChan) <-exitRoutine } testChan <- struct{}{} exitRoutine <- true assertSomethingElse() }

The above is a race condition on assertSomething(). The fix is to separate into two tests:

func TestSomething(t *testing.T) { go func() { testSomething(testChan) <-exitRoutine } testChan <- struct{}{} exitRountine <- true assertSomething() } fun TestSomethingElse(t *testing.T){ go func() { testSomething(testChan) <-exitRoutine } testChan <- struct{}{} exitRoutine <- true assertSomethingElse() }

In general, each test should assert one case anyways. Long tests that sequentially assert multiple cases are hard to read/debug.

rawfalafel · 2018-10-09T13:47:13Z

beacon-chain/sync/service_test.go

@@ -265,7 +265,22 @@ func TestBlockRequestErrors(t *testing.T) {
 	}

 	ss.blockRequestBySlot <- invalidmsg
+	ss.cancel()
+	exitRoutine <- true


This test had the same issue. They need to be separated, and the exitRoutine check was missing.

rauljordan

Amazing - thank you!

terencechain

👍

Yutaro Mori added 9 commits October 9, 2018 17:26

Turn off race condition flags

02f47c5

disable cache

11826ec

increase attempts

959ffc9

typo

55ddb8d

more typos

fe57be9

race condition fixes

7890761

more test flakiness

b7d36fe

remove flaky test

009b937

revert buildkite config

3cabce9

rawfalafel commented Oct 9, 2018

View reviewed changes

rauljordan changed the title ~~Fix flaky tests~~ Fix Flaky Tests Oct 9, 2018

rauljordan approved these changes Oct 9, 2018

View reviewed changes

terencechain approved these changes Oct 9, 2018

View reviewed changes

rauljordan merged commit 3e8a450 into master Oct 9, 2018

rawfalafel deleted the flakiness branch October 9, 2018 15:59

This was referenced Oct 9, 2018

Blockchain Tests Fail With Race Detection #412

Closed

Types Tests Fail With Race Detection #604

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Flaky Tests #636

Fix Flaky Tests #636

rawfalafel commented Oct 9, 2018

rawfalafel Oct 9, 2018

rawfalafel Oct 9, 2018

codecov bot commented Oct 9, 2018 •

edited

Loading

rawfalafel Oct 9, 2018

rawfalafel Oct 9, 2018

rauljordan left a comment

terencechain left a comment

Fix Flaky Tests #636

Fix Flaky Tests #636

Conversation

rawfalafel commented Oct 9, 2018

rawfalafel Oct 9, 2018

Choose a reason for hiding this comment

rawfalafel Oct 9, 2018

Choose a reason for hiding this comment

codecov bot commented Oct 9, 2018 • edited Loading

Codecov Report

rawfalafel Oct 9, 2018

Choose a reason for hiding this comment

rawfalafel Oct 9, 2018

Choose a reason for hiding this comment

rauljordan left a comment

Choose a reason for hiding this comment

terencechain left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 9, 2018 •

edited

Loading