Releases · SciSharp/LLamaSharp

19 Oct 22:13

martindevans

v0.18.0

40ea046

v0.18.0 Latest

Latest

Major Changes

Split platform-specific binaries for NuGet backends by @m0nsky in #957

Other Changes

Updates to ContributingGuide for latest llama.cpp repo by @scritch1sm in #953
Fix README chat session example by @easis in #956

New Contributors

@easis made their first contribution in #956

Full Changelog: v0.17.0...v0.18.0

Contributors

m0nsky, easis, and scritch1sm

Assets 2

13 Oct 17:02

martindevans

v0.17.0

cd9a044

v0.17.0

Important: The CUDA packages for 0.17.0 exceeded the maximum size for a nuget package. This means some of the 0.17.0 packages are not available until a new way is to deploy those packages. If you need one of the missing packages, use 0.16.0 instead.

Major Changes

October binary update by @martindevans in #940

Bug Fixes

Fix: Increase Vulkan detection timeout to 12 seconds by @LSXPrime in #931
Specify file open permissions needed as read only. by @scritch1sm in #943
Fix llama_get_logits_ith Null Handling by @martindevans in #920
Fix CUDA compile error by @martindevans in #924

Other Changes

Add LLama2 Chat Session example with a custom templator by @asmirnov82 in #938

New Contributors

@LSXPrime made their first contribution in #931
@scritch1sm made their first contribution in #943

Full Changelog: v0.16.0...v0.17.0

Contributors

martindevans, asmirnov82, and 2 other contributors

Assets 2

01 Sep 19:47

martindevans

v0.16.0

26a7114

v0.16.0

Major Changes

August 2024 Binary Update by @martindevans in #905
LLamaEmbedder 2.0 by @martindevans in #902

Bug Fixes

Fixed constrained output (Grammar) support in BatchedExecutorGuidance by @webitube in #904
Fixed Issue where Model with Grammar constrains only generates one output in Example by @L-Wehmschulte in #908

Other Changes

Remove OpenCL & add Vulkan to readme by @m0nsky in #907
Bump Swashbuckle.AspNetCore from 6.6.2 to 6.7.3 by @dependabot in #912
Bump BenchmarkDotNet from 0.13.12 to 0.14.0 by @dependabot in #913
Bump Microsoft.NET.Test.Sdk from 17.10.0 to 17.11.0 by @dependabot in #914
Bump Microsoft.AspNetCore.Mvc.Razor.RuntimeCompilation from 8.0.7 to 8.0.8 by @dependabot in #916
Bump Microsoft.KernelMemory.Abstractions and Microsoft.SemanticKernel.Abstractions by @dependabot in #915

New Contributors

@webitube made their first contribution in #904
@L-Wehmschulte made their first contribution in #908

Full Changelog: v0.15.0...v0.16.0

Contributors

martindevans, webitube, and 3 other contributors

Assets 2

03 Aug 15:47

martindevans

v0.15.0

d9e0d26

v0.15.0 - Llama 3.1

Major Changes

Llama 3.1 update binaries by @martindevans in #874
Microsoft.KernelMemory version 0.68+ compatibility fix by @SpaceAntelope in #862

Bug Fixes

Fixed missing [DllImport] for llama_encode by @martindevans in #864
Safer Vulkan System Info by @martindevans in #885
Fixed LLamaTimings by @martindevans in #881

Other Changes

Fixed Incorrect Exception Docs Comments by @martindevans in #883
Deprecate InferenceParams sampler configuration properties by @martindevans in #871
Updated Semantic Kernel and Kernel Memory by @martindevans in #880
Update README.md by @sangyuxiaowu in #872

New Contributors

@SpaceAntelope made their first contribution in #862

Full Changelog: v0.14.0...v0.15.0

Contributors

martindevans, SpaceAntelope, and sangyuxiaowu

Assets 2

16 Jul 12:38

martindevans

v0.14.0

d685588

0.14.0 - Gemma2

Major Changes

July 2024 binaries by @martindevans, @SignalRT & @m0nsky in #833, #843
Vulkan support by @m0nsky in #794, #797, #799
Generic Prompt Formatter by @patrick-hovsepian in #787
Async implementation of LLamaExecutors by @asmirnov82 in #834
Include debug symbols with NuGet packages by @dpmm99 in #851
Allow continuation in Instruct and Interact executors by @dpmm99 in #852

Bug Fixes

Fixes Typo in IContextParamsExtensions by @SerialKicked in #806
Fix possible file locks due to race conditions during build & test by @m0nsky in #836
Fixed CI Out Of Disk Space by @martindevans in #846
Fix CUDA detection on WSL by @m0nsky in #847

Other Changes

Remove Beam Search by @martindevans in #783
Batched Beam Search by @martindevans in #796
Remove OpenCL Backend. by @martindevans in #782
BoolQ Benchmark by @martindevans in #802

New Contributors

@patrick-hovsepian made their first contribution in #787
@SerialKicked made their first contribution in #806
@dpmm99 made their first contribution in #851

Full Changelog: v0.13.0...v0.14.0

Contributors

martindevans, SerialKicked, and 5 other contributors

Assets 2

0 Join discussion

04 Jun 12:46

martindevans

v0.13.0

fdcc035

0.13.0

Major Changes

June 2024 Binary Update by @martindevans in #751
Add Rosetta2 Binaries by @abhiaagarwal in #755
Optimize the native library loading. by @AsakusaRinne in #688
Add experimental auto-download support. by @AsakusaRinne in #692
Upload native libraries to huggingface when release new version. by @AsakusaRinne in #749
Batch Queueing by @martindevans in #761
BatchedExecutor LLava by @martindevans in #770
StreamingTextDecoder Special Token Support by @martindevans in #777

Documentation

Updated READMEs. by @Lamothe in #742
Add documentation for the change of native library loading. by @AsakusaRinne in #747
Add OAI demo link by @sangyuxiaowu in #778

Bug Fixes

KernelMemory bug fix by @zsogitbe in #726
Fixes for website when running on case-sensitive filesystems. by @Lamothe in #753

Other Changes

Exposed basic timing information from llama.cpp by @martindevans in #736
Less Sampler Allocations by @martindevans in #735
Several updates to web project by @Lamothe in #718
Conversation Generate All Logits by @martindevans in #743
Removed .NET 7 now that it's reached EOL. by @Lamothe in #741
BatchedExecutor Double Buffering by @martindevans in #748
Minor BatchedExecutor Example Improvements by @martindevans in #750
LLava API Improvements by @martindevans in #757
SemanticKernel HomeAutomation Example by @zsogitbe in #769

New Contributors

@Lamothe made their first contribution in #718
@sangyuxiaowu made their first contribution in #778
@abhiaagarwal made their first contribution in #755

Full Changelog: v0.12.0...v0.13.0

Contributors

martindevans, Lamothe, and 4 other contributors

Assets 2

12 May 17:04

martindevans

v0.12.0

9a6e8b5

0.12.0 - LLama3

Major Changes

May 2024 Binary Update (Take 2) by @martindevans in #712
Logging interceptor by @martindevans in #649
fix: add cuda llava native libraries. by @AsakusaRinne in #647
Added Whisper.net x LLamaSharp examples for Speech Chat by @Lyrcaxis in #656
Extension LLava with in memory images by @zsogitbe in #653
Llava Initial approach to clear images by @SignalRT in #664
BatchedExecutor Save/Load by @martindevans in #681
Interruptible Async Model Loading With Progress Monitoring by @martindevans in #702
LLava Async Loading by @martindevans in #703
Optional IHistoryTransform added to ChatSession.InitializeSessionFromHistoryAsync by @Norne9 in #711
Implement context shifting in executor base by @ksanman in #714
Llama Text Templater by @martindevans in #715
Make LLamaKvCacheView Safe by @martindevans in #721

Documentation

Removed ContextSize from most examples by @martindevans in #663
Add issue templates. by @AsakusaRinne in #700
Add LLaMA3 chat session example. by @AsakusaRinne in #708
Improved Example Docs by @martindevans in #728

Bug Fixes

fix: llava backend ignores avx and cuda. by @AsakusaRinne in #642
Embeddings correction by @zsogitbe in #674
Fix context params defaults by @dlyz in #729
Made special tokens included in prompts tokenize as intended by @Lyrcaxis in #677
Fixed Minor Issues With Model Loading by @martindevans in #697
Empty batch check by @martindevans in #691
Optimize Semantic Kernel LLamaSharpChatCompletion when running with StatefulExecutorBase models by @kidkych in #671
SemanticKernel: Correcting non-standard way of working with PromptExecutionSettings by @zsogitbe in #689
Fix cublas build action by @martindevans in #725

Other Changes

build(deps): bump System.Text.Json from 8.0.2 to 8.0.3 by @dependabot in #636
build(deps): bump Microsoft.AspNetCore.OpenApi from 8.0.2 to 8.0.3 by @dependabot in #638
build(deps): bump Microsoft.Extensions.Logging.Abstractions from 8.0.0 to 8.0.1 by @dependabot in #637
April 2024 Binary Update by @martindevans in #662
Slightly Safer Quantize Params by @martindevans in #698
Removed Unnecessary Constructor From Safe Handles by @martindevans in #696
ci: add workflow to check the spellings. by @AsakusaRinne in #710
ci: add a workflow to check code format. by @AsakusaRinne in #709
Remove Conversation.Prompt(String) by @martindevans in #719
ci: add benchmark test. by @AsakusaRinne in #720

New Contributors

@Lyrcaxis made their first contribution in #656
@Norne9 made their first contribution in #711
@ksanman made their first contribution in #714
@dlyz made their first contribution in #729
@eltociear made their first contribution in #730

Full Changelog: 0.11.1...v0.12.0

Contributors

martindevans, Norne9, and 9 other contributors

Assets 2

01 Apr 00:30

martindevans

0.11.1

3bc952c

0.11.1 - LLaVA support

🎏 Major Changes

LLaVA Support by @SignalRT in #556, #563, #609
Chat session state management by @eublefar in #560
Classifier Free Guidance by @martindevans in #536
March Binary Update by @martindevans in #565
SetDllImportResolver based loading by @martindevans in #603

📖 Documentation

The documentation has been improved and no longer outdated, see LLamaSharp Documentation.

🔧 Bug Fixes

Added conditional compilation code to progress_callback (in LlamaModelParams struct) by @clovisribeiro in #593
Memory Disposal Tests by @martindevans in #551
Fixed Publish File paths by @martindevans in #561
llama_decode lock by @martindevans in #595
BatchedExecutor Fixed Forking by @martindevans in #621
Fixed off by one error in LLamaBatch sampling position by @martindevans in #626
[LLama.KernelMemory] Fixed System.ArgumentException: EmbeddingMode must be true & #617 by @ChengYen-Tang in #615
fix: the missing of llava_shared library. by @AsakusaRinne in #633

📌 Other Changes

Removed llama_eval() by @martindevans in #553
ChatSession: improve exception message by @swharden in #523
Improve "embeddings" example by @swharden in #525
Add path to find llama.dll for MAUI by @evolcano in #631
LLama.Examples: improve model path prompt by @swharden in #526
NativeLibraryConfig.WithLogs() overload to set log level by @swharden in #529
LLamaSharp.Examples: Document Q&A with local storage by @swharden in #532
Used AnsiConsole in a few more places by @martindevans in #534
ReadOnlySpan<float> in ISamplingPipeline by @martindevans in #538
KernelMemory update with adding the use of already loaded model by @zsogitbe in #630
Add Link To Blazor Demo by @alexhiggins732 in #539
Removed Obsolete SamplingApi by @martindevans in #552
update readme.md backends by @warquys in #587
docs: update the example in readme. by @AsakusaRinne in #604
Update Semantic Kernel & Kernel Memory Package by @xbotter in #612
BatchedExecutor.Create() method by @martindevans in #613
LLamaBatch Logit Tracking by @martindevans in #624

🙌 New Contributors

@swharden made their first contribution in #523
@alexhiggins732 made their first contribution in #539
@clovisribeiro made their first contribution in #593
@warquys made their first contribution in #587
@eublefar made their first contribution in #560
@ChengYen-Tang made their first contribution in #615
@evolcano made their first contribution in #631

Full Changelog: v0.10.0...0.11.0

Contributors

martindevans, clovisribeiro, and 10 other contributors

Assets 2

15 Feb 14:53

martindevans

v0.10.0

9bc129e

0.10.0 - Phi2

Major Changes

Update binaries feb 2024 by @martindevans in #479
Add CLBLAST native library to native libraries build by @jasoncouture in #468
Introduced a new BatchedExecutor by @martindevans in #503
Swapped StatelessExecutor to use llama_decode! by @martindevans in #445
LLamaToken Struct by @martindevans in #404

Bug Fixes

KernelMemory EmbeddingMode bug correction by @zsogitbe in #485
Normalize Embeddings by @martindevans in #507
StreamingTextDecoder Fix & Tests by @martindevans in #428
Tokenizer Fixes For Issue 430 by @martindevans in #433

Other Changes

Use llama instead of libllama in [DllImport] by @jasoncouture in #465
Updated Examples by @vikramvee in #502
Added new file types to quantisation by @martindevans in #495
Smaller Unit Test Model by @martindevans in #496
Using AddRange in LLamaEmbedder by @martindevans in #499
Small KV Cache Handling Improvements by @martindevans in #500
Added increment and decrement operators to LLamaPos by @martindevans in #501
Swapped GetEmbeddings to llama_decode by @martindevans in #474
kv_cache_instance_methods by @martindevans in #454
Removed IModelParams and IContextParams setters. by @martindevans in #472
Managed LLamaBatch by @martindevans in #442
Check Model Path Exists by @martindevans in #437
Model Metadata Loading Cleanup by @martindevans in #438
Added a check for EOS token in LLamaStatelessExecutor by @martindevans in #434
Update README.md by @Oceania2018 in #427
Gpu layer count change by @Kaotic3 in #424
Improved exceptions in IModelParams for unknown KV override types. by @martindevans in #416

New Contributors

@Kaotic3 made their first contribution in #424
@Oceania2018 made their first contribution in #427
@jasoncouture made their first contribution in #465
@zsogitbe made their first contribution in #485
@vikramvee made their first contribution in #502

Full Changelog: 0.9.1...v0.10.0

Contributors

jasoncouture, martindevans, and 4 other contributors

Assets 9

0 Join discussion

06 Jan 15:23

martindevans

0.9.1

d74e6c7

0.9.1 - Mixtral!

Major Changes

Rebuilt ChatSession class by @philippjbauer in #344
Custom Sampling Pipelines by @martindevans in #348
Updated Binaries December 2023 by @martindevans in #361
Added LLamaWeights.Metadata property by @martindevans in #380

Bug Fixes

Fix documentation to reflect changes in ChatSession API by @asmirnov82 in #366
Added missing field to LLamaModelQuantizeParams by @martindevans in #367
Fix broken references in docs by @asmirnov82 in #378
Updated & Fixed WebAPI by @scotmcc in #377
Fixed loading of very large metadata values by @martindevans in #384
Update compile.yml to fix not building for windows by @edgett in #386
Metadata Fixes by @martindevans in #385
Fix typos in SemanticKernel README file by @asmirnov82 in #408

Other Changes

Context Set Seed by @martindevans in #368
Update README.md by @martindevans in #335
ci: fix error in auto-release. by @AsakusaRinne in #334
Update README.md by @markvantilburg in #339
🔧 Refactor Semantic Kernel chat completion implementation by @xbotter in #341
build(deps): bump xunit.runner.visualstudio from 2.5.4 to 2.5.5 by @dependabot in #353
build(deps): bump xunit from 2.6.2 to 2.6.3 by @dependabot in #352
Added AVX and AVX2 to MacOS x86_64 builds by @martindevans in #360
Upgrade unittest target framework to .NET 8.0 by @xbotter in #358
Clone Grammar by @martindevans in #370
Renamed llama_sample_temperature to llama_sample_temp by @martindevans in #369
Reset Custom Sampling Pipeline by @martindevans in #372
Improved support for AVX512 by @martindevans in #373
bump sk to 1.0.1 & km to 0.18 by @xbotter in #356
build(deps): bump xunit from 2.6.3 to 2.6.4 by @dependabot in #389
build(deps): bump xunit.runner.visualstudio from 2.5.5 to 2.5.6 by @dependabot in #391
build(deps): bump Swashbuckle.AspNetCore from 6.4.0 to 6.5.0 by @dependabot in #388
build(deps): bump Microsoft.KernelMemory.Abstractions from 0.18.231209.1-preview to 0.24.231228.5 by @dependabot in #397
build(deps): bump Microsoft.KernelMemory.Core and Microsoft.KernelMemory.Abstractions by @dependabot in #396
Code cleanup driven by R# suggestions by @martindevans in #400
Removed some unnecessary uses of unsafe by @martindevans in #401
Safer Model Handle Creation by @martindevans in #402
Extra ModelParams Checking by @martindevans in #403

New Contributors

@markvantilburg made their first contribution in #339
@asmirnov82 made their first contribution in #366
@scotmcc made their first contribution in #377
@edgett made their first contribution in #386

Thank you so much for all the contributions! 😻

Full Changelog: v0.8.1...0.9.1

Contributors

martindevans, scotmcc, and 7 other contributors

Assets 8

Releases: SciSharp/LLamaSharp

v0.18.0

Major Changes

Other Changes

New Contributors

Contributors

v0.17.0

Major Changes

Bug Fixes

Other Changes

New Contributors

Contributors

v0.16.0

Major Changes

Bug Fixes

Other Changes

New Contributors

Contributors

v0.15.0 - Llama 3.1

Major Changes

Bug Fixes

Other Changes

New Contributors

Contributors

0.14.0 - Gemma2

Major Changes

Bug Fixes

Other Changes

New Contributors

Contributors

0.13.0

Major Changes

Documentation

Bug Fixes

Other Changes

New Contributors

Contributors

0.12.0 - LLama3

Major Changes

Documentation

Bug Fixes

Other Changes

New Contributors

Contributors

0.11.1 - LLaVA support

🎏 Major Changes

📖 Documentation

🔧 Bug Fixes

📌 Other Changes

🙌 New Contributors

Contributors

0.10.0 - Phi2

Major Changes

Bug Fixes

Other Changes

New Contributors

Contributors

0.9.1 - Mixtral!

Major Changes

Bug Fixes

Other Changes

New Contributors

Contributors