Skip to content
This repository has been archived by the owner on Oct 1, 2020. It is now read-only.

Per channel quant #51

Open
wants to merge 7 commits into
base: master
Choose a base branch
from
Open

Conversation

mortzur
Copy link

@mortzur mortzur commented Feb 25, 2019

This is a preliminary version in order to get some feedback.
The goal is adding support in GEMM with different kernel quantization parameters per output channel.

Changes:

  • Modified version of 4x8 gemm ukernel was added, to support kernel scale and zero-point per output channel.
  • Helper funcitons were added : weights-packing, computing requantization parameters
  • gemm-micro-kernel test function was added with corresponding unit-tests

@mortzur mortzur requested a review from hlu1 February 25, 2019 21:34
Copy link
Contributor

@hlu1 hlu1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work! Please add the benchmark part. I'll do a more detailed review later.

@@ -362,3 +362,359 @@ void q8gemm_ukernel_4x8__neon(
}
}
}

void q8gemm_per_channel_ukernel_4x8__neon(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you separate this part into a new file?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants