Skip to content
This repository has been archived by the owner on Oct 16, 2023. It is now read-only.

Optimize PReLU #144

Merged
merged 1 commit into from
Sep 25, 2023
Merged

Optimize PReLU #144

merged 1 commit into from
Sep 25, 2023

Conversation

mejai1206
Copy link
Contributor

@mejai1206 mejai1206 commented Sep 22, 2023

๐Ÿ™ Describe the pull request

PReLU๋ฅผ ์ตœ์ ํ™”ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

๐Ÿ’ฌ Additional context

After
image

Before
image

Add any other context or screenshots about the pull request here.

โœ… Checklist

  • Code follows the project's coding conventions and style.
  • Tests have been added or updated to cover the changes.
  • Documentation has been updated, if necessary.

@mejai1206 mejai1206 self-assigned this Sep 22, 2023
@daemyung daemyung added the enhancement New feature or request label Sep 22, 2023
@mejai1206 mejai1206 marked this pull request as ready for review September 25, 2023 00:31
@mejai1206 mejai1206 requested a review from daemyung as a code owner September 25, 2023 00:31
output_y_size: tl.int32,
output_x_size: tl.int32,
size_along_dim: tl.constexpr,

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

์ด ํƒ€์ž…์ด ๋ฐ”๋€” ์ด์œ ๊ฐ€ ์—†์–ด ๋ณด์ด๋Š”๋ฐ์š”?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"require_boundary_check": lambda args: args["size_along_dim"] % args["block_size"],

์—ฌ๊ธฐ ๋“ค์–ด๊ฐ€๋Š” ์—ฐ์‚ฐ๋“ค์€ tl.constexpr์ด์–ด์•ผ ์ปดํŒŒ์ผ๋•Œ ๊ฒฐ์ •๋˜๋Š” ๊ฑธ๋กœ ์ƒ๊ฐํ–ˆ๋Š”๋ฐ, tl.int32์ด์–ด๋„ ๋˜‘๊ฐ™์€ ํšจ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ๋Š”๊ฑธ๊นŒ์š”?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

๋„ค ํœด๋ ˆ์Šคํ‹ฑ์€ CPU์—์„œ ์‹คํ–‰๋˜์š”. tl.constexpr์€ GPU ์ปค๋„ ์ปดํŒŒ์ผ์— ์˜ํ–ฅ์„ ๋ฏธ์น˜๋Š”๊ฑฐ์—์š”.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

์•„์•„ ๋„ต๋„ต ์ˆ˜์ • ํ•„์š”์—†๋Š”๊ฒŒ ๋งž๊ฒ ๊ตฐ์š”. ์ด ๋ถ€๋ถ„์€ ๋‹ค์‹œ ์›๋ณตํ•˜์˜€์Šต๋‹ˆ๋‹ค.

Copy link
Contributor

@kakao-steve-ai kakao-steve-ai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link

@daemyung daemyung left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@daemyung daemyung merged commit cef626c into main Sep 25, 2023
@daemyung daemyung deleted the opt_prelu branch September 25, 2023 02:08
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants