A GGML Derived Tensor Library GGML Architecture Context A "container" which holds computational graphs, tensors, and in some cases data. Graph A computation graph. eg. a scaled dot product attention would look something like this: Backend Underlying accelerator device - CUDA, TPUs, CPUs, TT, MPS, AMD GPUs etc.. Quick Guide