Skip to content

v0.0.2

Latest
Compare
Choose a tag to compare
@github-actions github-actions released this 08 Dec 08:41
  • Clear previous chat messages when LLMInference::load_model is called
  • Allow rendering non-ASCII characters on the chat interface generated by the LLMs/SLMs
  • Show token generation speed (in tokens/second) for the latest message in the chat interface (#1)