forked from vllm-project/vllm
    
        
        - 
                Notifications
    You must be signed in to change notification settings 
- Fork 6
prefill metadata splitting #112
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
          
     Draft
      
      
            LucasWilkinson
  wants to merge
  325
  commits into
  sage/dbo-full-cudagraphs
  
    
      
        
          
  
    
      Choose a base branch
      
     
    
      
        
      
      
        
          
          
        
        
          
            
              
              
              
  
           
        
        
          
            
              
              
           
        
       
     
  
        
          
            
          
            
          
        
       
    
      
from
lwilkinson/dbo-prefill
  
      
      
   
  
    
  
  
  
 
  
      
    base: sage/dbo-full-cudagraphs
Could not load branches
            
              
  
    Branch not found: {{ refName }}
  
            
                
      Loading
              
            Could not load tags
            
            
              Nothing to show
            
              
  
            
                
      Loading
              
            Are you sure you want to change the base?
            Some commits from the old base branch may be removed from the timeline,
            and old review comments may become outdated.
          
          
                
     Draft
            
            
          Conversation
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
    …#25065) Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Daniel Afrimi <[email protected]> Co-authored-by: root <[email protected]>
…vllm-project#25046) Signed-off-by: jiang1.li <[email protected]>
Signed-off-by: Aidyn-A <[email protected]>
Signed-off-by: Dylan Maloy <[email protected]> Co-authored-by: Jee Jee Li <[email protected]>
…mentation. (vllm-project#24957) Signed-off-by: Tao He <[email protected]>
…d warning. (vllm-project#25010) Signed-off-by: samzong <[email protected]>
Signed-off-by: Matthew Bonanni <[email protected]> Signed-off-by: Matthew Bonanni <[email protected]>
…project#24970) Signed-off-by: samzong <[email protected]>
Signed-off-by: mgoin <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]> Co-authored-by: Jee Jee Li <[email protected]>
) Signed-off-by: Woosuk Kwon <[email protected]>
Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: Woosuk Kwon <[email protected]>
) Signed-off-by: Woosuk Kwon <[email protected]>
…nizer refactor (vllm-project#25086) Signed-off-by: mgoin <[email protected]>
Signed-off-by: Andrew Feldman <[email protected]> Signed-off-by: afeldman-nm <[email protected]> Co-authored-by: Joseph Marinier <[email protected]>
…#25041) Signed-off-by: ApostaC <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: Mohammad Miadh Angkad <[email protected]>
Signed-off-by: ahao-anyscale <[email protected]>
Signed-off-by: czhu-cohere <[email protected]>
Signed-off-by: Andrew Xia <[email protected]>
Signed-off-by: Alexander Matveev <[email protected]> Co-authored-by: Michael Goin <[email protected]>
Signed-off-by: Doug Lehr <[email protected]> Co-authored-by: Doug Lehr <[email protected]>
Signed-off-by: mgoin <[email protected]>
…llm-project#24600) Signed-off-by: elvischenv <[email protected]> Co-authored-by: Michael Goin <[email protected]>
Signed-off-by: Karan Goel <[email protected]> Signed-off-by: Karan Goel <[email protected]> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
…ough headers (vllm-project#24628) Signed-off-by: Alec Solder <[email protected]> Signed-off-by: Alec S <[email protected]> Co-authored-by: Alec Solder <[email protected]> Co-authored-by: Ye (Charlotte) Qi <[email protected]>
Signed-off-by: Luka Govedič <[email protected]>
Signed-off-by: Russell Bryant <[email protected]>
Signed-off-by: NickLucche <[email protected]>
Signed-off-by: Matthew Bonanni <[email protected]> Co-authored-by: Robert Shaw <[email protected]> Co-authored-by: Chris Bamford <[email protected]>
…tput handling (vllm-project#25184) Signed-off-by: Alexander Matveev <[email protected]>
…24611) Signed-off-by: yewentao256 <[email protected]>
…t#25410) Signed-off-by: Isotr0py <[email protected]>
…#25409) Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: yewentao256 <[email protected]>
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
Signed-off-by: liuye.hj <[email protected]> Co-authored-by: liuye.hj <[email protected]>
Signed-off-by: Kunshang Ji <[email protected]>
Signed-off-by: Lu Fang <[email protected]> Signed-off-by: Lucia Fang <[email protected]> Co-authored-by: Lucia (Lu) Fang <[email protected]>
…project#25414) Signed-off-by: Michael Goin <[email protected]>
Signed-off-by: windsonsea <[email protected]>
…project#24588) Signed-off-by: Varun Sundar Rabindranath <[email protected]> Co-authored-by: Varun Sundar Rabindranath <[email protected]>
…er nixl_backend (vllm-project#25121) Signed-off-by: Chendi Xue <[email protected]>
Signed-off-by: DarkLight1337 <[email protected]>
Signed-off-by: Ming Yang <[email protected]>
…ect#25028) Signed-off-by: Zhikaiiii <[email protected]>
…5459) Signed-off-by: DarkLight1337 <[email protected]>
…oject#25203) Signed-off-by: Andreas Hartel <[email protected]>
Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Jee Jee Li <[email protected]>
Signed-off-by: vllmellm <[email protected]>
…ject#25471) Signed-off-by: Isotr0py <[email protected]>
Signed-off-by: Tyler Michael Smith <[email protected]>
Signed-off-by: Lucas Wilkinson <[email protected]>
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
      
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
Purpose
Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.