Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
lvhan028 committed Sep 28, 2023
1 parent 7faed16 commit 9f17d22
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/turbomind/triton_backend/llama/LlamaTritonModel.h
Original file line number Diff line number Diff line change
Expand Up @@ -109,7 +109,7 @@ struct LlamaTritonModel: public AbstractTransformerModel {
std::shared_ptr<typename ft::LlamaV2<T>::SharedState> shared_state_;

std::vector<std::shared_ptr<LlamaTritonSharedModelInstance<T>>> shared_instances_;
std::deque<std::mutex> shared_mutexes_; // is locking really needed?
std::deque<std::mutex> shared_mutexes_; // is locking really needed?

bool is_fp16_;
int enable_custom_all_reduce_ = 0;
Expand Down

0 comments on commit 9f17d22

Please sign in to comment.