FIM strategy context improvements #2863

beatlevic · 2025-10-08T22:35:16Z

Branched from Test llm strategies #2859

Adds the following to the FIM strategy context:

recent operations (both from current file and global cross file
Use ContextRanking (Jaccard similarity) for smart context selecting

…ecent operations.

… for FIM strategy

changeset-bot · 2025-10-08T22:35:19Z

⚠️ No Changeset found

Latest commit: 2569fef

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

markijbema · 2025-10-09T08:01:24Z

@beatlevic changed the base so it is easier to review

markijbema · 2025-10-09T08:02:52Z

src/services/ghost/GhostDocumentStore.ts

 			}

+			// Analyze and track global operations if we have enough history
+			if (item.history.length >= 2) {


markijbema · 2025-10-09T08:07:48Z

src/services/ghost/GhostDocumentStore.ts

+	 * @param operation The operation to add
+	 * @param filepath The file where the operation occurred
+	 */
+	private addGlobalOperation(operation: UserAction, filepath: string): void {


this needs to respect kilocodeignore/gitignore, see #2852

markijbema · 2025-10-09T08:12:33Z

src/services/ghost/context/ContextRanking.ts

+ * Formula: |A ∩ B| / |A ∪ B|
+ * Where A and B are sets of symbols from each string
+ */
+export function jaccardSimilarity(a: string, b: string): number {


this feels like it could be slow for large files

markijbema · 2025-10-09T08:13:51Z

src/services/ghost/context/ContextRanking.ts

+	let intersection = 0
+	for (const symbol of aSet) {
+		if (bSet.has(symbol)) {
+			intersection++


we can't use these?
https://viljams.medium.com/new-javascript-set-methods-are-here-b21e9a37bc4b

markijbema · 2025-10-09T08:25:27Z

src/services/ghost/context/ContextRanking.ts

+export function jaccardSimilarity(a: string, b: string): number {
+	const aSet = getSymbolsForSnippet(a)
+	const bSet = getSymbolsForSnippet(b)
+	const union = new Set([...aSet, ...bSet]).size


if we know the size of the intersection, we know the size of the union right? So this isn't necessary (and probably slow, especially since you're converting to an array in between)

markijbema · 2025-10-09T08:30:16Z

src/services/ghost/strategies/FimCodestralStrategy.ts

+
+		// Get window around cursor for similarity comparison
+		const position = context.range.start
+		const windowSize = 500 // characters before and after cursor


in the context of an extension/site windowSize seems like a visual thing, maybe characterLookAroundSize or something?

markijbema · 2025-10-09T08:33:16Z

src/services/ghost/context/ContextRanking.ts

+/**
+ * Deduplicate snippets from the same file by merging overlapping content
+ */
+export function deduplicateSnippets(snippets: RankedSnippet[]): RankedSnippet[] {


why are these and following methods unused? Especially I think the constraining of amount of syntax is especially important as it is easy to overwhelm small models. We might even need to compress / slice the current file a bit if it is too large

markijbema · 2025-10-09T08:35:40Z

src/services/ghost/context/ContextRanking.ts

+ */
+export function fillPromptWithSnippets(
+	snippets: RankedSnippet[],
+	maxTokens: number,


we probably need to use the current file as input for maxTokens

markijbema

I like this direction, but we have to be more careful of the context windows size, otherwise this will be a regression (because we'll always go over)

beatlevic added 2 commits October 9, 2025 00:22

Added recent operations context to FIM codestral. Also added global r…

a12682a

…ecent operations.

Added context ranking implemenation of recent operations and snippets…

e87dfc0

… for FIM strategy

beatlevic requested a review from markijbema October 8, 2025 22:35

markijbema changed the base branch from main to beatlevic/test-llm-strategies October 9, 2025 07:59

markijbema reviewed Oct 9, 2025

View reviewed changes

markijbema requested changes Oct 9, 2025

View reviewed changes

Base automatically changed from beatlevic/test-llm-strategies to main October 9, 2025 12:24

Merge branch 'main' into beatlevic/fim-context-improvements

2569fef

markijbema marked this pull request as draft October 10, 2025 09:58

markijbema mentioned this pull request Oct 10, 2025

Add more complex context collection #2899

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FIM strategy context improvements #2863

FIM strategy context improvements #2863

beatlevic commented Oct 8, 2025

Uh oh!

changeset-bot bot commented Oct 8, 2025 •

edited

Loading

Uh oh!

markijbema commented Oct 9, 2025

Uh oh!

markijbema Oct 9, 2025

Uh oh!

markijbema Oct 9, 2025

Uh oh!

markijbema Oct 9, 2025

Uh oh!

markijbema Oct 9, 2025

Uh oh!

markijbema Oct 9, 2025

Uh oh!

markijbema Oct 9, 2025

Uh oh!

markijbema Oct 9, 2025

Uh oh!

markijbema Oct 9, 2025

Uh oh!

markijbema left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

FIM strategy context improvements #2863

Are you sure you want to change the base?

FIM strategy context improvements #2863

Conversation

beatlevic commented Oct 8, 2025

Uh oh!

changeset-bot bot commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

markijbema commented Oct 9, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

markijbema left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

changeset-bot bot commented Oct 8, 2025 •

edited

Loading