ggml : fix handling of zero blocks in IQ quants #7955

ggerganov · 2024-06-16T07:43:34Z

Allow for blocks full of 0s

Review Complexity : Low
I have read the contributing guidelines

ggml-ci

CISC · 2024-06-16T10:36:07Z

ggml-quants.c

@@ -13139,7 +13139,7 @@ static int iq1_find_best_neighbour(const uint16_t * restrict neighbours, const u
        const float * restrict xval, const float * restrict weight, float * scale, int8_t * restrict L, int ngrid) {
    int num_neighbors = neighbours[0];
    GGML_ASSERT(num_neighbors > 0);
-    float best_score = 0;
+    float best_score = -FLT_MAX;


This function isn't used any more, but why negative FLT_MAX?

Any negative value works, but used -FLT_MAX for consistency with usage of FLT_MAX elsewhere in the source

ggml : fix handling of zero blocks in IQ quants

28f7a4d

ggml-ci

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Jun 16, 2024

mofosyne added the Review Complexity : Low Trivial changes to code that most beginner devs (or those who want a break) can tackle. e.g. UI fix label Jun 16, 2024

CISC mentioned this pull request Jun 16, 2024

Avoid division-by-zero on 0-weights #7825

Open

CISC reviewed Jun 16, 2024

View reviewed changes

CISC mentioned this pull request Jun 16, 2024

Bug: QWEN2 quantization GGML_ASSERT #7805

Closed

ggerganov merged commit cddaf02 into master Jun 16, 2024
74 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : fix handling of zero blocks in IQ quants #7955

ggml : fix handling of zero blocks in IQ quants #7955

ggerganov commented Jun 16, 2024

CISC Jun 16, 2024

ggerganov Jun 16, 2024

ggml : fix handling of zero blocks in IQ quants #7955

ggml : fix handling of zero blocks in IQ quants #7955

Conversation

ggerganov commented Jun 16, 2024

CISC Jun 16, 2024

Choose a reason for hiding this comment

ggerganov Jun 16, 2024

Choose a reason for hiding this comment