Skip to content

It can be used to build a programmable accelerator or an acccelerator for specific-mapping.

License

Notifications You must be signed in to change notification settings

alsrbok/Accelerator_Templete

Repository files navigation

Accelerator_Templete

It can be used to build a spatial accelerator or an acccelerator for specific-mapping.

  1. PE_new.v

Each PE has register file for activation, weight and partial sum.

PE_array_controller should send correct address and enable/selection signal to PE_array in order to progress the MAC operation.

image

  1. psum_su_irrel_new.v

Since global buffers use high bandwidth(512 bits/cycle), partial sum from PE array should be accumulated using shorter clock than top_module.

However, you should consider the relation between the latency of overall temporal mapping on register files and the latency of psum_accumulator.

In this case, psum_su_irrel_new.v try to finish the calculation of the partial sum at one cycle. (But it requires high HW costs.)

image

3)Top_module.v

Detailed structure can be varied with the targeting accelerator.

Following structure is for the programmable/flexible accelerator.

image

About

It can be used to build a programmable accelerator or an acccelerator for specific-mapping.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published