Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Xilinx / brevitas Public

Notifications You must be signed in to change notification settings
Fork 206
Star 1.3k

Code
Issues 162
Pull requests 28
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Feat (llm/learned_round): fast block update #1110

Merged

Giuseppe5 merged 12 commits into Xilinx:dev from Giuseppe5:fix_llm_entrypoint

Dec 5, 2024

Merged

Feat (llm/learned_round): fast block update #1110

Giuseppe5 merged 12 commits into Xilinx:dev from Giuseppe5:fix_llm_entrypoint

Dec 5, 2024

Conversation 12 Commits 12 Checks 23 Files changed

Conversation

Copy link

Collaborator

Giuseppe5 commented Dec 4, 2024

Reason for this PR

Inter-block update in learned round might be super slow for big models

Changes Made in this PR

We assume that blocks are sequential, so the output of each block is the input to the next.
Furthermore, we assume all kwargs don't change (typical in LLM).

We can run 2 block forwards instead of going through the entire model all over twice.

Testing Summary

NA

Risk Highlight

Limitations described above. Flag should be set to False unless the user knows what they're doing. Potentially to improve in the future.

This PR includes code from another work (please detail).
This PR contains API-breaking changes.
This PR depends on work in another PR (please provide links/details).
This PR introduces new dependencies (please detail).
There are coverage gaps not covered by tests.
Documentation updates required in subsequent PR.

Checklist

Code comments added to any hard-to-understand areas, if applicable.
Changes generate no new warnings.
Updated any relevant tests, if applicable.
No conflicts with destination dev branch.
I reviewed my own code changes.
Initial CI/CD passing.
1+ reviews given, and any review issues addressed and approved.
Post-review full CI/CD passing.

Sorry, something went wrong.

All reactions


          Feat (llm/learned_round): fast block update

1c5229f

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Outdated Show resolved Hide resolved


          Review

f0fc191

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/llm/llm_quant/learned_round_utils.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Outdated Show resolved Hide resolved

pablomlago approved these changes

View reviewed changes

Copy link

Collaborator

pablomlago left a comment •

edited

Loading

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, I'd open an issue to refactor and rely on save_inputs_output as much as possible, to prevent duplicating the block forward code.

Sorry, something went wrong.

All reactions

Giuseppe5 added 6 commits

December 4, 2024 22:05


          review

fbb9109


          update flag

b3ca8d1


          Fix comments

23de059


          Update flag and readme

2a1e324


          init cache

db0d5d0


          Init cache vision

7ed24fb

Giuseppe5 requested a review from pablomlago

December 5, 2024 10:27

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Outdated

@@ @@ -602,26 +603,28 @@ def apply_learned_round( @@
                       # Initialize cache to store partial inputs and outputs for each block
                       cache.initialize_cache()
+                      floating_point_datasets = []

Copy link

Collaborator

pablomlago Dec 5, 2024

There was a problem hiding this comment.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

floating_point_datasets is no longer used after the changes, right?

Sorry, something went wrong.

All reactions

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Outdated Show resolved Hide resolved

pablomlago reviewed

View reviewed changes

src/brevitas_examples/common/learned_round/learned_round_optimizer.py Show resolved Hide resolved

Giuseppe5 added 4 commits

December 5, 2024 10:48


          Simplification

d369baa


          Update learned_round_optimizer.py

0f0dd43


          Update learned_round_optimizer.py

7cef5ce


          precommit

79df5c1

Giuseppe5 merged commit 72b7f66 into Xilinx:dev

23 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

pablomlago pablomlago approved these changes

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

2 participants

Add this suggestion to a batch that can be applied as a single commit. This suggestion is invalid because no changes were made to the code. Suggestions cannot be applied while the pull request is closed. Suggestions cannot be applied while viewing a subset of changes. Only one suggestion per line can be applied in a batch. Add this suggestion to a batch that can be applied as a single commit. Applying suggestions on deleted lines is not supported. You must change the existing code in this line in order to create a valid suggestion. Outdated suggestions cannot be applied. This suggestion has been applied or marked resolved. Suggestions cannot be applied from pending reviews. Suggestions cannot be applied on multi-line comments. Suggestions cannot be applied while the pull request is queued to merge. Suggestion cannot be applied right now. Please check back later.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.