bare-llama

Native llama.cpp bindings for the bare runtime, enabling efficient large language model inference from JavaScript.

You might want this, but you probably shouldn't

This is nowhere near ready to use.

If you want to use bare-llama be prepared to join the development team 😏

Bare & Pear & nothing else just yet

This is the beginnings of an addon for the bare js runtime, which means we'll be able to use it with Pear, a js runtime that enables p2p applications.

Bare is the core runtime of Pear.

Nope: Node.js, Deno, Bun, etc.

Node.js support is possible, probably not Deno or Bun. But maybe! Who knows?! If you know let me know. I have no idea.

Quick Start

import { LlamaModel } from 'bare-llama'

// Create and initialize text generation model
const model = await LlamaModel.create({
  modelFilepath: './models/model.gguf'
})

// Generate text
const result = await model.generate('The quick brown fox')
console.log(result)

// Clean up
await model.destroy()

Usage

Generate text:

// Create a new model instance
const model = await LlamaModel.create({
  modelFilepath: './path/to/model.gguf',
  embedding: false // true for embedding models, false by default for text generation models
})

// Generate text
const generated = await model.generate('Once upon a time', {
  temperature: 0.8,
  maxTokens: 100
})

await model.destroy()

Create embeddings:

const model = await LlamaModel.create({
  modelFilepath: './path/to/embeddings-model.gguf',
  embedding: true
})

// Get embeddings (requires embedding: true)
const embeddings = await model.encode('Hello world')

await model.destroy()

Additional methods:

const model = await LlamaModel.create({
  modelFilepath: './path/to/embeddings-model.gguf',
  embedding: true
})

// Get model metadata
const metadata = await model.getMetadata()

// Tokenize text & detokenize tokens
const tokens = await model.tokenize('Hello world')
const text = await model.detokenize(tokens)

await model.destroy()

Models

You'll have to download models yourself!

The tests are currently set up to use a smollm gguf model: https://huggingface.co/mradermacher/SmolLM-135M-Instruct-GGUF

Credits

Built on llama.cpp and bare

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
docs		docs
models		models
test		test
.clang-format		.clang-format
.gitignore		.gitignore
.prettierrc		.prettierrc
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
binding.c		binding.c
binding.js		binding.js
example.mjs		example.mjs
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bare-llama

You might want this, but you probably shouldn't

Bare & Pear & nothing else just yet

Nope: Node.js, Deno, Bun, etc.

Quick Start

Usage

Models

Credits

License

About

Releases

Packages

Languages

License

brandtcormorant/bare-llama

Folders and files

Latest commit

History

Repository files navigation

bare-llama

You might want this, but you probably shouldn't

Bare & Pear & nothing else just yet

Nope: Node.js, Deno, Bun, etc.

Quick Start

Usage

Models

Credits

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages