Skip to content

simple batched dot kernel is ~1.7x slower with Const on Titan RTX #9344

simple batched dot kernel is ~1.7x slower with Const on Titan RTX

simple batched dot kernel is ~1.7x slower with Const on Titan RTX #9344