Get logits as numpy array #4

KeremP · 2023-04-28T04:12:59Z

I believe this could address nomic-ai/pygpt4all#3.

logits are passed to a callback function at each iteration as a numpy array.

for (int i = embd.size(); i < embd_inp.size() + params.n_predict; i++) {
        // predict
        if (embd.size() > 0) {
            const int64_t t_start_us = ggml_time_us();

            if (!gptj_eval(model, params.n_threads, n_past, embd, logits, mem_per_token)) {
                printf("Failed to predict\n");
                return 1;
            }
            // collect logits for each token
            py::array_t<float> _logits = py::array_t<float>{model.hparams.n_vocab, logits.data(), py::none()};
            logits_callback(_logits);
            t_predict_us += ggml_time_us() - t_start_us;
        }
...
}

def _call_logits_callback(self, logits: np.ndarray):
        """
        Internal logits_callback that saves the logit representation at each token.
        :return: None
        """
        self.logits.append(logits.tolist())
        
        if Model._logits_callback is not None:
            Model._logits_callback(logits)

Vectors are appended to self.logits and can be dumped to .npy by calling model.braindump("output_path")

absadiki · 2023-04-28T20:22:43Z

That's great @KeremP.
Thank you very much.

KeremP added 3 commits April 27, 2023 23:54

get logits at each token as numpy array

087d2f0

Merge remote-tracking branch 'upstream/main'

d30b873

minor changes.

9ec6bfd

absadiki merged commit e04bf59 into absadiki:main Apr 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get logits as numpy array #4

Get logits as numpy array #4

KeremP commented Apr 28, 2023

absadiki commented Apr 28, 2023

Get logits as numpy array #4

Get logits as numpy array #4

Conversation

KeremP commented Apr 28, 2023

absadiki commented Apr 28, 2023