FONAS: FPGA Optimized Neural Architecture Search for Hardware Efficiency through Evolutionary Search

Summary

This project focuses on searching for efficient deep neural architectures (FPGANets) tailored for image classification tasks while adhering to constraints such as arithmetic intensity and latency for FPGA deployment. Our proposed FPGANets outperform existing networks in terms of both latency and accuracy on the ImageNet-1k dataset. The project also received a research grant from LogicTronix (AMD-Xilinx Partner).

Project Work and Methodologies

Compressing EfficientNet-V2: Implementation of channel pruning techniques to reduce the model size and enhance inference speed for FPGA deployment.
Leveraging NAS techniques: Automation of task-specific neural network creation to target latency requirements on FPGA platforms.
Hardware Optimization: Co-designing models and hardware for improved efficiency and performance.

Hardware NAS Focus

Optimization Goals: Minimizing latency, maximizing accuracy, and efficient resource utilization on FPGA platforms.
Architecture Sampling: Generating diverse architectures meeting hardware constraints like latency and resource usage.
Evaluation Metrics: Assessing performance based on accuracy, inference speed, and resource utilization for optimal architecture selection.

Key Results and Findings

Compressed EfficientNet-V2: Achieved an 88% reduction in channel count resulting in a model that is 14 times smaller, 2.5 times faster in inference speed, and has significantly fewer parameters and MAC operations.
A comprehensive latency table was developed to aid latency estimation, and an accuracy predictor was used to filter out the best performing networks.
Over 40 million architectures within the MobileNet-V3 search space were explored, targeting the Ultra96-V2 MPSoC board.
An evolutionary search process was employed to discover seven architectures varying in size and latency.
FPGANet_L25 emerged as a standout, achieving a top-1 accuracy of 79.12% on ILSVRC2012 within 25ms of inference time.
The proposed approach outperformed established networks like EfficientNet-B0, ResNet-50, and MobileNet-V3 in both inference latency (measured on FPGA) and accuracy.
A comprehensive latency table was developed to aid latency estimation, and an accuracy predictor was used to filter out the best performing networks.

Name		Name	Last commit message	Last commit date
Latest commit History 64 Commits
Evolutionary_Search		Evolutionary_Search
FPGA_EfficientNet_V2_implementation		FPGA_EfficientNet_V2_implementation
FPGAnet_quantization		FPGAnet_quantization
Latency_Profiled_Mobilenetv3_blocks(FPGA)		Latency_Profiled_Mobilenetv3_blocks(FPGA)
Mobilenet_V3_Search_space_blocks		Mobilenet_V3_Search_space_blocks
Retraining on Imagenet		Retraining on Imagenet
Search_Notebooks_and_Results		Search_Notebooks_and_Results
.gitignore		.gitignore
FPGA_based_Accelerator_Co_design_using_Neural_Arch.pdf		FPGA_based_Accelerator_Co_design_using_Neural_Arch.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FONAS: FPGA Optimized Neural Architecture Search for Hardware Efficiency through Evolutionary Search

Summary

Project Work and Methodologies

Hardware NAS Focus

Key Results and Findings

About

Releases

Packages

Contributors 4

Languages

FPGA-Vision/FPGA-Optimized-Neural-Architecture-Search

Folders and files

Latest commit

History

Repository files navigation

FONAS: FPGA Optimized Neural Architecture Search for Hardware Efficiency through Evolutionary Search

Summary

Project Work and Methodologies

Hardware NAS Focus

Key Results and Findings

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages