Support vision models and function calling #8

DePasqualeOrg · 2024-12-11T22:33:40Z

I've added functionality from the huggingface.js implementation. This is a work in progress.

johnmai-dev · 2024-12-12T03:28:47Z

Thank you for your PR, it's great! Could you please provide some test cases?

I may not have time until next month. I've been a bit busy lately.

Hi @pcuenca ! Do you have time to help review this PR?

DePasqualeOrg · 2024-12-12T08:34:50Z

I'll add some tests for images and function calling and try to polish this up a bit.

I should have also formatted the code before editing it, to make the changes more legible. After this gets merged, maybe we can add some auto-formatting.

pcuenca · 2024-12-12T19:24:58Z

Hi @DePasqualeOrg, thanks a lot for the effort! It's a long diff, I can try to take a look in a couple of days. Do we need everything at once, including namespaces, built-in functions and tool calling, or could this potentially be approached in a few phases?

DePasqualeOrg · 2024-12-29T15:01:15Z

I've rebased this after formatting the repo with swift-format, to make it easier for @pcuenca to review it.

DePasqualeOrg · 2024-12-29T23:13:50Z

The existing tests as well as some additional ones I added pass.

DePasqualeOrg · 2025-01-01T15:18:26Z

Four of the six tool use tests from the TypeScript implementation are now passing. It's quite difficult to get these tests to pass, because dictionaries in Swift don't preserve the order of the keys. I've used OrderedDictionary in the tests, which presents its own challenges, since sometimes these values need to be converted to JSON. I've disabled the two problematic tests and will focus on testing with more recent models. It already appears to work well with Llama 3.2.

I'll be testing this on vision models here: ml-explore/mlx-swift-examples#173

Tool use examples in LLMEval: ml-explore/mlx-swift-examples#174

DePasqualeOrg · 2025-01-13T17:18:05Z

Honestly, no, I don't think that's a good idea. I spent several whole days trying to get as close as possible to the Python Jinja2. I think that should be our reference point, not the TypeScript version. That is what is used for chat templates in Python.

If you have tests from your implementation of the TypeScript version, you can share them and we can see if they pass on my branch. You can take my tests and see if they pass on your branch.

It was an enormous amount of work to get this working, and I would be very disappointed if it gets thrown away.

Also, to prevent duplication of work in the future, I would suggest opening a draft pull request here when you're working on something, so that your work isn't duplicated by someone else, which would be a waste of their time.

johnmai-dev · 2025-01-13T17:37:01Z

I'm very sorry, but it won't be thrown away. I think the ported Jinja2 still needs further inspection and testing, so I want to split some of Jinja2's code into smaller modules for iterative updates.

Secondly, the original intention of Swift Jinja is to replicate TypeScript Jinja; I hope to keep it synchronized with the TypeScript Jinja version and then port Jinja2 on this basis.

Replicating first is the fastest method, and there are significant design differences between Swift Jinja and Jinja2. If we truly want to replicate Jinja2, a refactor of Swift Jinja might be necessary.

DePasqualeOrg · 2025-01-13T17:46:23Z

I really don't know what you have in mind. What exactly do you want to separate out from my contribution? Is there any functionality in your branch that isn't reproduced in mine?

Please also point out specific differences between the TypeScript version and the Python Jinja2 that you think are important.

My pull request has been open for more than a month, and only now am I learning about your parallel effort, which you didn't share here. Since no progress has been made for several months, I decided to take the initiative and make chat templates for vision models and function calling work. Now they're working.

If you're worried about correctness, I can remove the filters for which I disabled tests because fully implementing them would be too complex.

However, considering the huge effort required to make this all work, I don't think we're going to be able to merge your version and mine. Since I was the first to share this here, I think we should use my branch as the basis for further work. I'm happy to remove anything that you think is not ready for production (please provide tests that demonstrate it's not correct), and to add any functionality from your branch that is missing in mine.

The reason I feel confident about my work is that I've already covered a large part of the tests from the Python implementation, and they're passing.

DePasqualeOrg · 2025-01-13T22:30:52Z

I've reviewed my PR, and there are no major architectural changes here. Therefore, I don't understand what you mean about refactoring. Furthermore, I'd like to emphasize that I first ported functionality from the TypeScript implementation and then (after almost entirely covering the TypeScript implementation) added missing functionality (mainly the filters and tests) from Jinja2 in Python. This PR is still comparable to the TypeScript implementation.

I'm hoping we can move forward with this efficiently, because I want to start building actual features in apps using function calling and vision models instead of getting bogged down with this library.

johnmai-dev · 2025-01-14T01:45:53Z

Ok.

Before merging this PR, you still need to resolve the previous review feedback. @DePasqualeOrg

Additionally, I hope @pcuenca can also participate in the review since swift-transformers is mainly being used at present.

Tests/Templates/ToolSpecs.swift

Python/test-chat-template.ipynb

Sources/Parser.swift

Sources/Ast.swift

johnmai-dev

Thank you very much @DePasqualeOrg!
I have reviewed it and think it can be merged at any time.

But we still need @pcuenca to help review it again.

DePasqualeOrg · 2025-01-19T19:16:12Z

@pcuenca, could you let us know if you want to review this? If you don't have time, I think we should merge it so that I can move ahead with huggingface/swift-transformers#151, ml-explore/mlx-swift-examples#174, and ml-explore/mlx-swift-examples#173.

pcuenca

I haven't had the chance to test it in depth, but I'm supportive of merging to unblock downstream uses, as long as the repo owner is happy with the changes. In my opinion, please follow the lead of @johnmai-dev here.

There appears to be a failing test case, but it seems related to dictionary ordering. I would suggest to write test cases in such a way that ordering does not impact results, but we can do that in a different PR.

pcuenca · 2025-01-21T22:41:10Z

Package.resolved

Is it necessary to commit this file?

I'm not sure. It was created after I added swift-collections. Are lock files usually included for dependencies in Swift packages?

I don't think so, Xcode should resolve the dependencies when you compile. I'd suggest to remove.

It's also a bit unfortunate that another dependency had to be added, is it just to guarantee reproducibility for tests? If so, we should be able to move it to the test target (but we can do it later).

swift-collections is necessary for the reproducibility of some tests, and is also needed in the library itself. I'm not sure how else you would test stringification of dictionaries if you don't have a deterministic key order.

I see that mlx-swift has a Package.resolved file, and Claude says it's usually included in Swift packages, for what it's worth.

Apple recommend committing the Package.resolved.

https://developer.apple.com/documentation/xcode/adding-package-dependencies-to-your-app#Coordinate-package-versions-across-your-team

https://developer.apple.com/documentation/xcode/making-dependencies-available-to-xcode-cloud#Use-Swift-package-dependencies-and-Git-submodules

Sources/Parser.swift

johnmai-dev · 2025-01-22T02:05:01Z

Are there any other commits that need to be pushed? If not, I think we can merge. @DePasqualeOrg

awni · 2025-01-22T05:52:05Z

🚀 this should unblock the Deep Seek reasoning chat templates so looking forward to getting it landed!

DePasqualeOrg · 2025-01-22T07:42:06Z

I have nothing more to add, so you can merge this, @johnmai-dev.

johnmai-dev · 2025-01-22T07:51:48Z

Thank you for your efforts! @DePasqualeOrg
Jinja Swift 1.1.0 Released!

alelordelo · 2025-01-22T17:35:27Z

@DePasqualeOrg , thanks for this awesome contribution!

I am using FullMoon , a super clean swiftUI client, build with MLX and Jinja.
https://github.com/mainframecomputer/

Can you give some instructions on how can we parse a function call with the new Jinja 1.1?

DePasqualeOrg · 2025-01-22T22:38:55Z

@alelordelo, when the model responds with the function call, it's up to the app to handle it. Jinja is not involved at all at that point.

DePasqualeOrg marked this pull request as draft December 12, 2024 08:35

johnmai-dev linked an issue Dec 12, 2024 that may be closed by this pull request

Parse Llama tool calls? #6

Closed

DePasqualeOrg marked this pull request as ready for review December 12, 2024 21:46

DePasqualeOrg mentioned this pull request Dec 26, 2024

Fix formatting #9

Closed

DePasqualeOrg force-pushed the add-functionality branch 3 times, most recently from 9a43ea0 to 1c85539 Compare December 29, 2024 14:34

Disable AlwaysUseLowerCamelCase in .swift-format

9d79438

DePasqualeOrg force-pushed the add-functionality branch 2 times, most recently from 8365aec to 0fdf32f Compare December 29, 2024 14:57

DePasqualeOrg force-pushed the add-functionality branch from 0fdf32f to 16375f3 Compare December 29, 2024 18:15

DePasqualeOrg marked this pull request as draft December 29, 2024 19:09

DePasqualeOrg force-pushed the add-functionality branch 2 times, most recently from 5b4472f to a60b6b1 Compare December 29, 2024 23:12

DePasqualeOrg marked this pull request as ready for review December 29, 2024 23:13

DePasqualeOrg marked this pull request as draft December 30, 2024 08:06

DePasqualeOrg force-pushed the add-functionality branch 6 times, most recently from a982715 to 0e65018 Compare January 1, 2025 13:00

DePasqualeOrg force-pushed the add-functionality branch 2 times, most recently from 7c0b020 to 9eb074b Compare January 1, 2025 16:18

johnmai-dev requested a review from pcuenca January 14, 2025 01:46

johnmai-dev reviewed Jan 14, 2025

View reviewed changes

Tests/Templates/ToolSpecs.swift Show resolved Hide resolved

Python/test-chat-template.ipynb Outdated Show resolved Hide resolved

Sources/Parser.swift Show resolved Hide resolved

Sources/Ast.swift Outdated Show resolved Hide resolved

DePasqualeOrg added 2 commits January 14, 2025 09:17

Rename ifCondition in For to test

b2b5aa3

Remove Python

06804d1

DePasqualeOrg force-pushed the add-functionality branch from c6c893b to 06804d1 Compare January 14, 2025 08:25

johnmai-dev self-requested a review January 14, 2025 08:48

johnmai-dev approved these changes Jan 14, 2025

View reviewed changes

johnmai-dev linked an issue Jan 14, 2025 that may be closed by this pull request

Handle vision language model chat templates #7

Closed

johnmai-dev added the enhancement New feature or request label Jan 14, 2025

DePasqualeOrg mentioned this pull request Jan 21, 2025

Chat template not working for DeepSeek Qwen Distill model ml-explore/mlx-swift-examples#181

Closed

Update .pre-commit-config.yaml

e9822c2

pcuenca approved these changes Jan 21, 2025

View reviewed changes

Handle DeepSeek R1 Qwen chat template

30edf9f

DePasqualeOrg force-pushed the add-functionality branch from 83e6773 to 30edf9f Compare January 21, 2025 22:46

johnmai-dev reviewed Jan 22, 2025

View reviewed changes

Sources/Parser.swift Show resolved Hide resolved

johnmai-dev linked an issue Jan 22, 2025 that may be closed by this pull request

Parse error on DeepSeek R1 chat template #12

Closed

johnmai-dev merged commit 9c0bbbc into johnmai-dev:main Jan 22, 2025
2 checks passed

alelordelo mentioned this pull request Jan 22, 2025

Function call mainframecomputer/fullmoon-ios#15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support vision models and function calling #8

Support vision models and function calling #8

DePasqualeOrg commented Dec 11, 2024 •

edited

Loading

johnmai-dev commented Dec 12, 2024

DePasqualeOrg commented Dec 12, 2024 •

edited

Loading

pcuenca commented Dec 12, 2024

DePasqualeOrg commented Dec 29, 2024 •

edited

Loading

DePasqualeOrg commented Dec 29, 2024 •

edited

Loading

DePasqualeOrg commented Jan 1, 2025 •

edited

Loading

DePasqualeOrg commented Jan 13, 2025 •

edited

Loading

johnmai-dev commented Jan 13, 2025

DePasqualeOrg commented Jan 13, 2025 •

edited

Loading

DePasqualeOrg commented Jan 13, 2025

johnmai-dev commented Jan 14, 2025

johnmai-dev left a comment

DePasqualeOrg commented Jan 19, 2025

pcuenca left a comment

pcuenca Jan 21, 2025

DePasqualeOrg Jan 21, 2025

pcuenca Jan 21, 2025

DePasqualeOrg Jan 21, 2025

johnmai-dev Jan 22, 2025

johnmai-dev commented Jan 22, 2025

awni commented Jan 22, 2025

DePasqualeOrg commented Jan 22, 2025

johnmai-dev commented Jan 22, 2025

alelordelo commented Jan 22, 2025

DePasqualeOrg commented Jan 22, 2025

Support vision models and function calling #8

Support vision models and function calling #8

Conversation

DePasqualeOrg commented Dec 11, 2024 • edited Loading

johnmai-dev commented Dec 12, 2024

DePasqualeOrg commented Dec 12, 2024 • edited Loading

pcuenca commented Dec 12, 2024

DePasqualeOrg commented Dec 29, 2024 • edited Loading

DePasqualeOrg commented Dec 29, 2024 • edited Loading

DePasqualeOrg commented Jan 1, 2025 • edited Loading

DePasqualeOrg commented Jan 13, 2025 • edited Loading

johnmai-dev commented Jan 13, 2025

DePasqualeOrg commented Jan 13, 2025 • edited Loading

DePasqualeOrg commented Jan 13, 2025

johnmai-dev commented Jan 14, 2025

johnmai-dev left a comment

Choose a reason for hiding this comment

DePasqualeOrg commented Jan 19, 2025

pcuenca left a comment

Choose a reason for hiding this comment

pcuenca Jan 21, 2025

Choose a reason for hiding this comment

DePasqualeOrg Jan 21, 2025

Choose a reason for hiding this comment

pcuenca Jan 21, 2025

Choose a reason for hiding this comment

DePasqualeOrg Jan 21, 2025

Choose a reason for hiding this comment

johnmai-dev Jan 22, 2025

Choose a reason for hiding this comment

johnmai-dev commented Jan 22, 2025

awni commented Jan 22, 2025

DePasqualeOrg commented Jan 22, 2025

johnmai-dev commented Jan 22, 2025

alelordelo commented Jan 22, 2025

DePasqualeOrg commented Jan 22, 2025

DePasqualeOrg commented Dec 11, 2024 •

edited

Loading

DePasqualeOrg commented Dec 12, 2024 •

edited

Loading

DePasqualeOrg commented Dec 29, 2024 •

edited

Loading

DePasqualeOrg commented Dec 29, 2024 •

edited

Loading

DePasqualeOrg commented Jan 1, 2025 •

edited

Loading

DePasqualeOrg commented Jan 13, 2025 •

edited

Loading

DePasqualeOrg commented Jan 13, 2025 •

edited

Loading