Skip to content

Commit

Permalink
[Hotfix] correct occupancy calculation (#1451)
Browse files Browse the repository at this point in the history
  • Loading branch information
KKyang authored Dec 16, 2024
1 parent 0ed9795 commit 41767d9
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion tensilelite/Tensile/KernelWriterAssembly.py
Original file line number Diff line number Diff line change
Expand Up @@ -97,7 +97,7 @@ def getOccupancy(self, numThreads, vgprs, sgprs, ldsSize, accvgprs=0, doubleVgpr
vgprLimitedOccupancy = self.getVgprOccupancy(numThreads, vgprs, doubleVgpr)
accvgprLimitedOccupancy = self.getVgprOccupancy(numThreads, accvgprs, doubleVgpr)
else:
vgprLimitedOccupancy = self.getVgprOccupancy(numThreads, ceil(vgprs//8)*8+accvgprs, doubleVgpr)
vgprLimitedOccupancy = self.getVgprOccupancy(numThreads, ceil(vgprs/8)*8+accvgprs, doubleVgpr)
accvgprLimitedOccupancy = vgprLimitedOccupancy
sgprLimitedOccupancy = self.getSgprOccupancy(sgprs)

Expand Down

0 comments on commit 41767d9

Please sign in to comment.