From 1e319b689b13a5e880402ad719e5468d98d7ed1c Mon Sep 17 00:00:00 2001
From: "Documenter.jl" <documenter@juliadocs.github.io>
Date: Sat, 25 Nov 2023 03:16:17 -0500
Subject: [PATCH] build based on 10791a2

---
 dev/api/array/index.html        |  2 +-
 dev/api/compiler/index.html     |  4 ++--
 dev/api/essentials/index.html   |  2 +-
 dev/api/kernel/index.html       | 22 +++++++++++-----------
 dev/api/mps/index.html          |  8 ++++----
 dev/faq/contributing/index.html |  2 +-
 dev/faq/faq/index.html          |  2 +-
 dev/index.html                  |  2 +-
 dev/profiling/index.html        |  2 +-
 dev/search/index.html           |  2 +-
 dev/usage/array/index.html      |  2 +-
 dev/usage/kernel/index.html     |  2 +-
 dev/usage/overview/index.html   |  2 +-
 13 files changed, 27 insertions(+), 27 deletions(-)
diff --git a/dev/api/array/index.html b/dev/api/array/index.html
index 313d089bf..c675554be 100644
--- a/dev/api/array/index.html
+++ b/dev/api/array/index.html
@@ -20,4 +20,4 @@
 3-element MtlVector{Int64, Metal.MTL.MTLResourceStorageModePrivate}:
  1
  2
- 3</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/array.jl#L339-L380">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../kernel/">« Kernel programming</a><a class="docs-footer-nextpage" href="../mps/">Metal Performance Shaders »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+ 3</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/array.jl#L339-L380">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../kernel/">« Kernel programming</a><a class="docs-footer-nextpage" href="../mps/">Metal Performance Shaders »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/compiler/index.html b/dev/api/compiler/index.html
index 97c2c3c19..a533d3706 100644
--- a/dev/api/compiler/index.html
+++ b/dev/api/compiler/index.html
@@ -1,8 +1,8 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Compiler · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/api/compiler/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../essentials/">Essentials</a></li><li class="is-active"><a class="tocitem" href>Compiler</a><ul class="internal"><li><a class="tocitem" href="#Execution"><span>Execution</span></a></li><li><a class="tocitem" href="#Reflection"><span>Reflection</span></a></li></ul></li><li><a class="tocitem" href="../kernel/">Kernel programming</a></li><li><a class="tocitem" href="../array/">Array programming</a></li><li><a class="tocitem" href="../mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Compiler</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Compiler</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/api/compiler.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Compiler"><a class="docs-heading-anchor" href="#Compiler">Compiler</a><a id="Compiler-1"></a><a class="docs-heading-anchor-permalink" href="#Compiler" title="Permalink"></a></h1><h2 id="Execution"><a class="docs-heading-anchor" href="#Execution">Execution</a><a id="Execution-1"></a><a class="docs-heading-anchor-permalink" href="#Execution" title="Permalink"></a></h2><p>The main entry-point to the compiler is the <code>@metal</code> macro:</p><article class="docstring"><header><a class="docstring-binding" id="Metal.@metal" href="#Metal.@metal"><code>Metal.@metal</code></a> — <span class="docstring-category">Macro</span></header><section><div><pre><code class="language-julia hljs">@metal threads=... groups=... [kwargs...] func(args...)</code></pre><p>High-level interface for executing code on a GPU.</p><p>The <code>@metal</code> macro should prefix a call, with <code>func</code> a callable function or object that should return nothing. It will be compiled to a Metal function upon first use, and to a certain extent arguments will be converted and managed automatically using <code>mtlconvert</code>. Finally, a call to <code>mtlcall</code> is performed, creating a command buffer in the current global command queue then committing it.</p><p>There is one supported keyword argument that influences the behavior of <code>@metal</code>:</p><ul><li><code>launch</code>: whether to launch this kernel, defaults to <code>true</code>. If <code>false</code> the returned kernel object should be launched by calling it and passing arguments again.</li><li><code>name</code>: the name of the kernel in the generated code. Defaults to an automatically- generated name.</li><li><code>queue</code>: the command queue to use for this kernel. Defaults to the global command queue.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/compiler/execution.jl#L10-L28">source</a></section></article><p>If needed, you can use a lower-level API that lets you inspect the compiler kernel:</p><article class="docstring"><header><a class="docstring-binding" id="Metal.mtlconvert" href="#Metal.mtlconvert"><code>Metal.mtlconvert</code></a> — <span class="docstring-category">Function</span></header><section><div><p>mtlconvert(x, [cce])</p><p>This function is called for every argument to be passed to a kernel, allowing it to be converted to a GPU-friendly format. By default, the function does nothing and returns the input object <code>x</code> as-is.</p><p>Do not add methods to this function, but instead extend the underlying Adapt.jl package and register methods for the the <code>Metal.Adaptor</code> type.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/compiler/execution.jl#L123-L132">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.mtlfunction" href="#Metal.mtlfunction"><code>Metal.mtlfunction</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">mtlfunction(f, tt=Tuple{}; kwargs...)</code></pre><p>Low-level interface to compile a function invocation for the currently-active GPU, returning a callable kernel object. For a higher-level interface, use <a href="#Metal.@metal"><code>@metal</code></a>.</p><p>The output of this function is automatically cached, i.e. you can simply call <code>mtlfunction</code> in a hot path without degrading performance. New code will be generated automatically when the function changes, or when different types or keyword arguments are provided.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/compiler/execution.jl#L145-L154">source</a></section></article><h2 id="Reflection"><a class="docs-heading-anchor" href="#Reflection">Reflection</a><a id="Reflection-1"></a><a class="docs-heading-anchor-permalink" href="#Reflection" title="Permalink"></a></h2><p>If you want to inspect generated code, you can use macros that resemble functionality from the InteractiveUtils standard library:</p><pre><code class="nohighlight hljs">@device_code_lowered
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Compiler · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/api/compiler/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../essentials/">Essentials</a></li><li class="is-active"><a class="tocitem" href>Compiler</a><ul class="internal"><li><a class="tocitem" href="#Execution"><span>Execution</span></a></li><li><a class="tocitem" href="#Reflection"><span>Reflection</span></a></li></ul></li><li><a class="tocitem" href="../kernel/">Kernel programming</a></li><li><a class="tocitem" href="../array/">Array programming</a></li><li><a class="tocitem" href="../mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Compiler</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Compiler</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/api/compiler.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Compiler"><a class="docs-heading-anchor" href="#Compiler">Compiler</a><a id="Compiler-1"></a><a class="docs-heading-anchor-permalink" href="#Compiler" title="Permalink"></a></h1><h2 id="Execution"><a class="docs-heading-anchor" href="#Execution">Execution</a><a id="Execution-1"></a><a class="docs-heading-anchor-permalink" href="#Execution" title="Permalink"></a></h2><p>The main entry-point to the compiler is the <code>@metal</code> macro:</p><article class="docstring"><header><a class="docstring-binding" id="Metal.@metal" href="#Metal.@metal"><code>Metal.@metal</code></a> — <span class="docstring-category">Macro</span></header><section><div><pre><code class="language-julia hljs">@metal threads=... groups=... [kwargs...] func(args...)</code></pre><p>High-level interface for executing code on a GPU.</p><p>The <code>@metal</code> macro should prefix a call, with <code>func</code> a callable function or object that should return nothing. It will be compiled to a Metal function upon first use, and to a certain extent arguments will be converted and managed automatically using <code>mtlconvert</code>. Finally, a call to <code>mtlcall</code> is performed, creating a command buffer in the current global command queue then committing it.</p><p>There is one supported keyword argument that influences the behavior of <code>@metal</code>:</p><ul><li><code>launch</code>: whether to launch this kernel, defaults to <code>true</code>. If <code>false</code> the returned kernel object should be launched by calling it and passing arguments again.</li><li><code>name</code>: the name of the kernel in the generated code. Defaults to an automatically- generated name.</li><li><code>queue</code>: the command queue to use for this kernel. Defaults to the global command queue.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/compiler/execution.jl#L10-L28">source</a></section></article><p>If needed, you can use a lower-level API that lets you inspect the compiler kernel:</p><article class="docstring"><header><a class="docstring-binding" id="Metal.mtlconvert" href="#Metal.mtlconvert"><code>Metal.mtlconvert</code></a> — <span class="docstring-category">Function</span></header><section><div><p>mtlconvert(x, [cce])</p><p>This function is called for every argument to be passed to a kernel, allowing it to be converted to a GPU-friendly format. By default, the function does nothing and returns the input object <code>x</code> as-is.</p><p>Do not add methods to this function, but instead extend the underlying Adapt.jl package and register methods for the the <code>Metal.Adaptor</code> type.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/compiler/execution.jl#L123-L132">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.mtlfunction" href="#Metal.mtlfunction"><code>Metal.mtlfunction</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">mtlfunction(f, tt=Tuple{}; kwargs...)</code></pre><p>Low-level interface to compile a function invocation for the currently-active GPU, returning a callable kernel object. For a higher-level interface, use <a href="#Metal.@metal"><code>@metal</code></a>.</p><p>The output of this function is automatically cached, i.e. you can simply call <code>mtlfunction</code> in a hot path without degrading performance. New code will be generated automatically when the function changes, or when different types or keyword arguments are provided.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/compiler/execution.jl#L145-L154">source</a></section></article><h2 id="Reflection"><a class="docs-heading-anchor" href="#Reflection">Reflection</a><a id="Reflection-1"></a><a class="docs-heading-anchor-permalink" href="#Reflection" title="Permalink"></a></h2><p>If you want to inspect generated code, you can use macros that resemble functionality from the InteractiveUtils standard library:</p><pre><code class="nohighlight hljs">@device_code_lowered
 @device_code_typed
 @device_code_warntype
 @device_code_llvm
 @device_code_air
 @device_code_agx
-@device_code</code></pre><p>For more information, please consult the GPUCompiler.jl documentation. <code>code_air</code> is actually <code>code_native</code>:</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../essentials/">« Essentials</a><a class="docs-footer-nextpage" href="../kernel/">Kernel programming »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+@device_code</code></pre><p>For more information, please consult the GPUCompiler.jl documentation. <code>code_air</code> is actually <code>code_native</code>:</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../essentials/">« Essentials</a><a class="docs-footer-nextpage" href="../kernel/">Kernel programming »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/essentials/index.html b/dev/api/essentials/index.html
index f2ebecb16..5481a9016 100644
--- a/dev/api/essentials/index.html
+++ b/dev/api/essentials/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Essentials · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/api/essentials/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li class="is-active"><a class="tocitem" href>Essentials</a><ul class="internal"><li><a class="tocitem" href="#Global-State"><span>Global State</span></a></li></ul></li><li><a class="tocitem" href="../compiler/">Compiler</a></li><li><a class="tocitem" href="../kernel/">Kernel programming</a></li><li><a class="tocitem" href="../array/">Array programming</a></li><li><a class="tocitem" href="../mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Essentials</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Essentials</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/api/essentials.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Essentials"><a class="docs-heading-anchor" href="#Essentials">Essentials</a><a id="Essentials-1"></a><a class="docs-heading-anchor-permalink" href="#Essentials" title="Permalink"></a></h1><h2 id="Global-State"><a class="docs-heading-anchor" href="#Global-State">Global State</a><a id="Global-State-1"></a><a class="docs-heading-anchor-permalink" href="#Global-State" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="Metal.device!" href="#Metal.device!"><code>Metal.device!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">device!(dev::MTLDevice)</code></pre><p>Sets the Metal GPU device associated with the current Julia task.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/state.jl#L20-L24">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MTL.devices" href="#Metal.MTL.devices"><code>Metal.MTL.devices</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">devices()</code></pre><p>Get an iterator for the compute devices.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/lib/mtl/device.jl#L72-L76">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.current_device" href="#Metal.current_device"><code>Metal.current_device</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">current_device()::MTLDevice</code></pre><p>Return the Metal GPU device associated with the current Julia task.</p><p>Since all M-series systems currently only externally show a single GPU, this function effectively returns the only system GPU.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/state.jl#L3-L10">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.global_queue" href="#Metal.global_queue"><code>Metal.global_queue</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">global_queue(dev::MTLDevice)::MTLCommandQueue</code></pre><p>Return the Metal command queue associated with the current Julia thread.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/state.jl#L29-L33">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.synchronize" href="#Metal.synchronize"><code>Metal.synchronize</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">synchronize(queue)</code></pre><p>Wait for currently committed GPU work on this queue to finish.</p><p>Create a new MTLCommandBuffer from the global command queue, commit it to the queue, and simply wait for it to be completed. Since command buffers <em>should</em> execute in a First-In-First-Out manner, this synchronizes the GPU.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/state.jl#L44-L52">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.device_synchronize" href="#Metal.device_synchronize"><code>Metal.device_synchronize</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">device_synchronize()</code></pre><p>Synchronize all committed GPU work across all global queues</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/state.jl#L59-L63">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../profiling/">« Profiling</a><a class="docs-footer-nextpage" href="../compiler/">Compiler »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Essentials · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/api/essentials/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li class="is-active"><a class="tocitem" href>Essentials</a><ul class="internal"><li><a class="tocitem" href="#Global-State"><span>Global State</span></a></li></ul></li><li><a class="tocitem" href="../compiler/">Compiler</a></li><li><a class="tocitem" href="../kernel/">Kernel programming</a></li><li><a class="tocitem" href="../array/">Array programming</a></li><li><a class="tocitem" href="../mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Essentials</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Essentials</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/api/essentials.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Essentials"><a class="docs-heading-anchor" href="#Essentials">Essentials</a><a id="Essentials-1"></a><a class="docs-heading-anchor-permalink" href="#Essentials" title="Permalink"></a></h1><h2 id="Global-State"><a class="docs-heading-anchor" href="#Global-State">Global State</a><a id="Global-State-1"></a><a class="docs-heading-anchor-permalink" href="#Global-State" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="Metal.device!" href="#Metal.device!"><code>Metal.device!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">device!(dev::MTLDevice)</code></pre><p>Sets the Metal GPU device associated with the current Julia task.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/state.jl#L20-L24">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MTL.devices" href="#Metal.MTL.devices"><code>Metal.MTL.devices</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">devices()</code></pre><p>Get an iterator for the compute devices.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/lib/mtl/device.jl#L72-L76">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.current_device" href="#Metal.current_device"><code>Metal.current_device</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">current_device()::MTLDevice</code></pre><p>Return the Metal GPU device associated with the current Julia task.</p><p>Since all M-series systems currently only externally show a single GPU, this function effectively returns the only system GPU.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/state.jl#L3-L10">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.global_queue" href="#Metal.global_queue"><code>Metal.global_queue</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">global_queue(dev::MTLDevice)::MTLCommandQueue</code></pre><p>Return the Metal command queue associated with the current Julia thread.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/state.jl#L29-L33">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.synchronize" href="#Metal.synchronize"><code>Metal.synchronize</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">synchronize(queue)</code></pre><p>Wait for currently committed GPU work on this queue to finish.</p><p>Create a new MTLCommandBuffer from the global command queue, commit it to the queue, and simply wait for it to be completed. Since command buffers <em>should</em> execute in a First-In-First-Out manner, this synchronizes the GPU.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/state.jl#L44-L52">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.device_synchronize" href="#Metal.device_synchronize"><code>Metal.device_synchronize</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">device_synchronize()</code></pre><p>Synchronize all committed GPU work across all global queues</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/state.jl#L59-L63">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../profiling/">« Profiling</a><a class="docs-footer-nextpage" href="../compiler/">Compiler »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/kernel/index.html b/dev/api/kernel/index.html
index 584b7c850..a70751c6b 100644
--- a/dev/api/kernel/index.html
+++ b/dev/api/kernel/index.html
@@ -1,24 +1,24 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Kernel programming · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/api/kernel/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../essentials/">Essentials</a></li><li><a class="tocitem" href="../compiler/">Compiler</a></li><li class="is-active"><a class="tocitem" href>Kernel programming</a><ul class="internal"><li><a class="tocitem" href="#Indexing-and-dimensions"><span>Indexing and dimensions</span></a></li><li><a class="tocitem" href="#Device-arrays"><span>Device arrays</span></a></li><li><a class="tocitem" href="#Synchronization"><span>Synchronization</span></a></li></ul></li><li><a class="tocitem" href="../array/">Array programming</a></li><li><a class="tocitem" href="../mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Kernel programming</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Kernel programming</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/api/kernel.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Kernel-programming"><a class="docs-heading-anchor" href="#Kernel-programming">Kernel programming</a><a id="Kernel-programming-1"></a><a class="docs-heading-anchor-permalink" href="#Kernel-programming" title="Permalink"></a></h1><p>This section lists the package&#39;s public functionality that corresponds to special Metal functions for use in device code. For more information about these functions, please consult the <a href="https://developer.apple.com/metal/Metal-Shading-Language-Specification.pdf">Metal Shading Language specification</a>.</p><p>This is made possible by interfacing with the Metal libraries through a small C library that wraps the ObjectiveC APIs. These low-level wrappers, along with some slightly higher-level Julia wrappers, are available in the MTL submodule exported by Metal.jl. All wrapped C functions and types start with the mt prefix, whereas the Julia wrappers are prefixed with Mtl:</p><h2 id="Indexing-and-dimensions"><a class="docs-heading-anchor" href="#Indexing-and-dimensions">Indexing and dimensions</a><a id="Indexing-and-dimensions-1"></a><a class="docs-heading-anchor-permalink" href="#Indexing-and-dimensions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_execution_width" href="#Metal.thread_execution_width"><code>Metal.thread_execution_width</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_execution_width()::UInt32</code></pre><p>Return the execution width of the compute unit.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L127-L131">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_index_in_quadgroup" href="#Metal.thread_index_in_quadgroup"><code>Metal.thread_index_in_quadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_index_in_quadgroup()::UInt32</code></pre><p>Return the index of the current thread in its quadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L109-L113">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_index_in_simdgroup" href="#Metal.thread_index_in_simdgroup"><code>Metal.thread_index_in_simdgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_index_in_simdgroup()::UInt32</code></pre><p>Return the index of the current thread in its simdgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L115-L119">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_index_in_threadgroup" href="#Metal.thread_index_in_threadgroup"><code>Metal.thread_index_in_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_index_in_threadgroup()::UInt32</code></pre><p>Return the index of the current thread in its threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L121-L125">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_position_in_grid_1d" href="#Metal.thread_position_in_grid_1d"><code>Metal.thread_position_in_grid_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_position_in_grid_1d()::UInt32
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Kernel programming · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/api/kernel/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../essentials/">Essentials</a></li><li><a class="tocitem" href="../compiler/">Compiler</a></li><li class="is-active"><a class="tocitem" href>Kernel programming</a><ul class="internal"><li><a class="tocitem" href="#Indexing-and-dimensions"><span>Indexing and dimensions</span></a></li><li><a class="tocitem" href="#Device-arrays"><span>Device arrays</span></a></li><li><a class="tocitem" href="#Synchronization"><span>Synchronization</span></a></li></ul></li><li><a class="tocitem" href="../array/">Array programming</a></li><li><a class="tocitem" href="../mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Kernel programming</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Kernel programming</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/api/kernel.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Kernel-programming"><a class="docs-heading-anchor" href="#Kernel-programming">Kernel programming</a><a id="Kernel-programming-1"></a><a class="docs-heading-anchor-permalink" href="#Kernel-programming" title="Permalink"></a></h1><p>This section lists the package&#39;s public functionality that corresponds to special Metal functions for use in device code. For more information about these functions, please consult the <a href="https://developer.apple.com/metal/Metal-Shading-Language-Specification.pdf">Metal Shading Language specification</a>.</p><p>This is made possible by interfacing with the Metal libraries through a small C library that wraps the ObjectiveC APIs. These low-level wrappers, along with some slightly higher-level Julia wrappers, are available in the MTL submodule exported by Metal.jl. All wrapped C functions and types start with the mt prefix, whereas the Julia wrappers are prefixed with Mtl:</p><h2 id="Indexing-and-dimensions"><a class="docs-heading-anchor" href="#Indexing-and-dimensions">Indexing and dimensions</a><a id="Indexing-and-dimensions-1"></a><a class="docs-heading-anchor-permalink" href="#Indexing-and-dimensions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_execution_width" href="#Metal.thread_execution_width"><code>Metal.thread_execution_width</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_execution_width()::UInt32</code></pre><p>Return the execution width of the compute unit.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L127-L131">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_index_in_quadgroup" href="#Metal.thread_index_in_quadgroup"><code>Metal.thread_index_in_quadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_index_in_quadgroup()::UInt32</code></pre><p>Return the index of the current thread in its quadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L109-L113">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_index_in_simdgroup" href="#Metal.thread_index_in_simdgroup"><code>Metal.thread_index_in_simdgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_index_in_simdgroup()::UInt32</code></pre><p>Return the index of the current thread in its simdgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L115-L119">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_index_in_threadgroup" href="#Metal.thread_index_in_threadgroup"><code>Metal.thread_index_in_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_index_in_threadgroup()::UInt32</code></pre><p>Return the index of the current thread in its threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L121-L125">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_position_in_grid_1d" href="#Metal.thread_position_in_grid_1d"><code>Metal.thread_position_in_grid_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_position_in_grid_1d()::UInt32
 thread_position_in_grid_2d()::NamedTuple{(:x, :y), Tuple{UInt32, UInt32}}
-thread_position_in_grid_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the current thread&#39;s position in an N-dimensional grid of threads.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_position_in_threadgroup_1d" href="#Metal.thread_position_in_threadgroup_1d"><code>Metal.thread_position_in_threadgroup_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_position_in_threadgroup_1d()::UInt32
+thread_position_in_grid_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the current thread&#39;s position in an N-dimensional grid of threads.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.thread_position_in_threadgroup_1d" href="#Metal.thread_position_in_threadgroup_1d"><code>Metal.thread_position_in_threadgroup_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">thread_position_in_threadgroup_1d()::UInt32
 thread_position_in_threadgroup_2d()::NamedTuple{(:x, :y), Tuple{UInt32, UInt32}}
-thread_position_in_threadgroup_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the current thread&#39;s unique position within a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threadgroup_position_in_grid_1d" href="#Metal.threadgroup_position_in_grid_1d"><code>Metal.threadgroup_position_in_grid_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threadgroup_position_in_grid_1d()::UInt32
+thread_position_in_threadgroup_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the current thread&#39;s unique position within a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threadgroup_position_in_grid_1d" href="#Metal.threadgroup_position_in_grid_1d"><code>Metal.threadgroup_position_in_grid_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threadgroup_position_in_grid_1d()::UInt32
 threadgroup_position_in_grid_2d()::NamedTuple{(:x, :y), Tuple{UInt32, UInt32}}
-threadgroup_position_in_grid_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the current threadgroup&#39;s unique position within the grid.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threadgroups_per_grid_1d" href="#Metal.threadgroups_per_grid_1d"><code>Metal.threadgroups_per_grid_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threadgroups_per_grid_1d()::UInt32
+threadgroup_position_in_grid_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the current threadgroup&#39;s unique position within the grid.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threadgroups_per_grid_1d" href="#Metal.threadgroups_per_grid_1d"><code>Metal.threadgroups_per_grid_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threadgroups_per_grid_1d()::UInt32
 threadgroups_per_grid_2d()::NamedTuple{(:x, :y), Tuple{UInt32, UInt32}}
-threadgroups_per_grid_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the number of threadgroups per grid.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threads_per_grid_1d" href="#Metal.threads_per_grid_1d"><code>Metal.threads_per_grid_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threads_per_grid_1d()::UInt32
+threadgroups_per_grid_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the number of threadgroups per grid.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threads_per_grid_1d" href="#Metal.threads_per_grid_1d"><code>Metal.threads_per_grid_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threads_per_grid_1d()::UInt32
 threads_per_grid_2d()::NamedTuple{(:x, :y), Tuple{UInt32, UInt32}}
-threads_per_grid_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the grid size.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threads_per_simdgroup" href="#Metal.threads_per_simdgroup"><code>Metal.threads_per_simdgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threads_per_simdgroup()::UInt32</code></pre><p>Return the thread execution width of a simdgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L133-L137">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threads_per_threadgroup_1d" href="#Metal.threads_per_threadgroup_1d"><code>Metal.threads_per_threadgroup_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threads_per_threadgroup_1d()::UInt32
+threads_per_grid_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the grid size.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threads_per_simdgroup" href="#Metal.threads_per_simdgroup"><code>Metal.threads_per_simdgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threads_per_simdgroup()::UInt32</code></pre><p>Return the thread execution width of a simdgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L133-L137">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threads_per_threadgroup_1d" href="#Metal.threads_per_threadgroup_1d"><code>Metal.threads_per_threadgroup_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threads_per_threadgroup_1d()::UInt32
 threads_per_threadgroup_2d()::NamedTuple{(:x, :y), Tuple{UInt32, UInt32}}
-threads_per_threadgroup_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the thread execution width of a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.simdgroups_per_threadgroup" href="#Metal.simdgroups_per_threadgroup"><code>Metal.simdgroups_per_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">simdgroups_per_threadgroup()::UInt32</code></pre><p>Return the simdgroup execution width of a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L103-L107">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.simdgroup_index_in_threadgroup" href="#Metal.simdgroup_index_in_threadgroup"><code>Metal.simdgroup_index_in_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">simdgroup_index_in_threadgroup()::UInt32</code></pre><p>Return the index of a simdgroup within a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L97-L101">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.quadgroup_index_in_threadgroup" href="#Metal.quadgroup_index_in_threadgroup"><code>Metal.quadgroup_index_in_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">quadgroup_index_in_threadgroup()::UInt32</code></pre><p>Return the index of a quadgroup within a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L85-L89">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.quadgroups_per_threadgroup" href="#Metal.quadgroups_per_threadgroup"><code>Metal.quadgroups_per_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">quadgroups_per_threadgroup()::UInt32</code></pre><p>Return the quadgroup execution width of a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L91-L95">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.grid_size_1d" href="#Metal.grid_size_1d"><code>Metal.grid_size_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">grid_size_1d()::UInt32
+threads_per_threadgroup_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the thread execution width of a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.simdgroups_per_threadgroup" href="#Metal.simdgroups_per_threadgroup"><code>Metal.simdgroups_per_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">simdgroups_per_threadgroup()::UInt32</code></pre><p>Return the simdgroup execution width of a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L103-L107">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.simdgroup_index_in_threadgroup" href="#Metal.simdgroup_index_in_threadgroup"><code>Metal.simdgroup_index_in_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">simdgroup_index_in_threadgroup()::UInt32</code></pre><p>Return the index of a simdgroup within a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L97-L101">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.quadgroup_index_in_threadgroup" href="#Metal.quadgroup_index_in_threadgroup"><code>Metal.quadgroup_index_in_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">quadgroup_index_in_threadgroup()::UInt32</code></pre><p>Return the index of a quadgroup within a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L85-L89">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.quadgroups_per_threadgroup" href="#Metal.quadgroups_per_threadgroup"><code>Metal.quadgroups_per_threadgroup</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">quadgroups_per_threadgroup()::UInt32</code></pre><p>Return the quadgroup execution width of a threadgroup.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L91-L95">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.grid_size_1d" href="#Metal.grid_size_1d"><code>Metal.grid_size_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">grid_size_1d()::UInt32
 grid_size_2d()::NamedTuple{(:x, :y), Tuple{UInt32, UInt32}}
-grid_size_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return maximum size of the grid for threads that read per-thread stage-in data.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.grid_origin_1d" href="#Metal.grid_origin_1d"><code>Metal.grid_origin_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">grid_origin_1d()::UInt32
+grid_size_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return maximum size of the grid for threads that read per-thread stage-in data.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.grid_origin_1d" href="#Metal.grid_origin_1d"><code>Metal.grid_origin_1d</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">grid_origin_1d()::UInt32
 grid_origin_2d()::NamedTuple{(:x, :y), Tuple{UInt32, UInt32}}
-grid_origin_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the origin offset of the grid for threads that read per-thread stage-in data.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><h2 id="Device-arrays"><a class="docs-heading-anchor" href="#Device-arrays">Device arrays</a><a id="Device-arrays-1"></a><a class="docs-heading-anchor-permalink" href="#Device-arrays" title="Permalink"></a></h2><p>Metal.jl provides a primitive, lightweight array type to manage GPU data organized in an plain, dense fashion. This is the device-counterpart to the <code>MtlArray</code>, and implements (part of) the array interface as well as other functionality for use <em>on</em> the GPU:</p><article class="docstring"><header><a class="docstring-binding" id="Metal.MtlDeviceArray" href="#Metal.MtlDeviceArray"><code>Metal.MtlDeviceArray</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">MtlDeviceArray(dims, ptr)
+grid_origin_3d()::NamedTuple{(:x, :y, :z), Tuple{UInt32, UInt32, UInt32}}</code></pre><p>Return the origin offset of the grid for threads that read per-thread stage-in data.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/arguments.jl#L149-L155">source</a></section></article><h2 id="Device-arrays"><a class="docs-heading-anchor" href="#Device-arrays">Device arrays</a><a id="Device-arrays-1"></a><a class="docs-heading-anchor-permalink" href="#Device-arrays" title="Permalink"></a></h2><p>Metal.jl provides a primitive, lightweight array type to manage GPU data organized in an plain, dense fashion. This is the device-counterpart to the <code>MtlArray</code>, and implements (part of) the array interface as well as other functionality for use <em>on</em> the GPU:</p><article class="docstring"><header><a class="docstring-binding" id="Metal.MtlDeviceArray" href="#Metal.MtlDeviceArray"><code>Metal.MtlDeviceArray</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">MtlDeviceArray(dims, ptr)
 MtlDeviceArray{T}(dims, ptr)
 MtlDeviceArray{T,A}(dims, ptr)
-MtlDeviceArray{T,A,N}(dims, ptr)</code></pre><p>Construct an <code>N</code>-dimensional dense Metal device array with element type <code>T</code> wrapping a pointer, where <code>N</code> is determined from the length of <code>dims</code> and <code>T</code> is determined from the type of <code>ptr</code>.</p><p><code>dims</code> may be a single scalar, or a tuple of integers corresponding to the lengths in each dimension). If the rank <code>N</code> is supplied explicitly as in <code>Array{T,N}(dims)</code>, then it must match the length of <code>dims</code>. The same applies to the element type <code>T</code>, which should match the type of the pointer <code>ptr</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/array.jl#L8-L22">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.Const" href="#Metal.Const"><code>Metal.Const</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">Const(A::MtlDeviceArray)</code></pre><p>Mark a MtlDeviceArray as constant/read-only and to use the constant address space.</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>Experimental API. Subject to change without deprecation.</p></div></div></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/array.jl#L124-L130">source</a></section></article><h3 id="Shared-memory"><a class="docs-heading-anchor" href="#Shared-memory">Shared memory</a><a id="Shared-memory-1"></a><a class="docs-heading-anchor-permalink" href="#Shared-memory" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-binding" id="Metal.MtlThreadGroupArray" href="#Metal.MtlThreadGroupArray"><code>Metal.MtlThreadGroupArray</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">MtlThreadGroupArray(::Type{T}, dims)</code></pre><p>Create an array local to each threadgroup launched during kernel execution.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/memory.jl#L3-L7">source</a></section></article><h2 id="Synchronization"><a class="docs-heading-anchor" href="#Synchronization">Synchronization</a><a id="Synchronization-1"></a><a class="docs-heading-anchor-permalink" href="#Synchronization" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="Metal.MemoryFlags" href="#Metal.MemoryFlags"><code>Metal.MemoryFlags</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">MemoryFlags</code></pre><p>Flags to set the memory synchronization behavior of threadgroup_barrier and simdgroup_barrier.</p><p>Possible values:</p><pre><code class="nohighlight hljs">None: Set barriers to only act as an execution barrier and not apply a memory fence.
+MtlDeviceArray{T,A,N}(dims, ptr)</code></pre><p>Construct an <code>N</code>-dimensional dense Metal device array with element type <code>T</code> wrapping a pointer, where <code>N</code> is determined from the length of <code>dims</code> and <code>T</code> is determined from the type of <code>ptr</code>.</p><p><code>dims</code> may be a single scalar, or a tuple of integers corresponding to the lengths in each dimension). If the rank <code>N</code> is supplied explicitly as in <code>Array{T,N}(dims)</code>, then it must match the length of <code>dims</code>. The same applies to the element type <code>T</code>, which should match the type of the pointer <code>ptr</code>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/array.jl#L8-L22">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.Const" href="#Metal.Const"><code>Metal.Const</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">Const(A::MtlDeviceArray)</code></pre><p>Mark a MtlDeviceArray as constant/read-only and to use the constant address space.</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>Experimental API. Subject to change without deprecation.</p></div></div></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/array.jl#L124-L130">source</a></section></article><h3 id="Shared-memory"><a class="docs-heading-anchor" href="#Shared-memory">Shared memory</a><a id="Shared-memory-1"></a><a class="docs-heading-anchor-permalink" href="#Shared-memory" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-binding" id="Metal.MtlThreadGroupArray" href="#Metal.MtlThreadGroupArray"><code>Metal.MtlThreadGroupArray</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">MtlThreadGroupArray(::Type{T}, dims)</code></pre><p>Create an array local to each threadgroup launched during kernel execution.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/memory.jl#L3-L7">source</a></section></article><h2 id="Synchronization"><a class="docs-heading-anchor" href="#Synchronization">Synchronization</a><a id="Synchronization-1"></a><a class="docs-heading-anchor-permalink" href="#Synchronization" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="Metal.MemoryFlags" href="#Metal.MemoryFlags"><code>Metal.MemoryFlags</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">MemoryFlags</code></pre><p>Flags to set the memory synchronization behavior of threadgroup_barrier and simdgroup_barrier.</p><p>Possible values:</p><pre><code class="nohighlight hljs">None: Set barriers to only act as an execution barrier and not apply a memory fence.
 
 Device: Ensure the GPU correctly orders the memory operations to device memory
         for threads in the threadgroup or simdgroup.
@@ -30,4 +30,4 @@
         threads in a threadgroup or simdgroup for a texture with the read_write access qualifier.
 
 ThreadGroup_ImgBlock: Ensure the GPU correctly orders the memory operations to threadgroup imageblock memory
-        for threads in a threadgroup or simdgroup.</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/synchronization.jl#L6-L26">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threadgroup_barrier" href="#Metal.threadgroup_barrier"><code>Metal.threadgroup_barrier</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threadgroup_barrier(flag::MemoryFlags=MemoryFlagNone)</code></pre><p>Synchronize all threads in a threadgroup.</p><p>Possible flags that affect the memory synchronization behavior are found in <a href="#Metal.MemoryFlags"><code>MemoryFlags</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/synchronization.jl#L36-L42">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.simdgroup_barrier" href="#Metal.simdgroup_barrier"><code>Metal.simdgroup_barrier</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">simdgroup_barrier(flag::MemoryFlags=MemoryFlagNone)</code></pre><p>Synchronize all threads in a SIMD-group.</p><p>Possible flags that affect the memory synchronization behavior are found in <a href="#Metal.MemoryFlags"><code>MemoryFlags</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/src/device/intrinsics/synchronization.jl#L46-L52">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../compiler/">« Compiler</a><a class="docs-footer-nextpage" href="../array/">Array programming »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+        for threads in a threadgroup or simdgroup.</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/synchronization.jl#L6-L26">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.threadgroup_barrier" href="#Metal.threadgroup_barrier"><code>Metal.threadgroup_barrier</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">threadgroup_barrier(flag::MemoryFlags=MemoryFlagNone)</code></pre><p>Synchronize all threads in a threadgroup.</p><p>Possible flags that affect the memory synchronization behavior are found in <a href="#Metal.MemoryFlags"><code>MemoryFlags</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/synchronization.jl#L36-L42">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.simdgroup_barrier" href="#Metal.simdgroup_barrier"><code>Metal.simdgroup_barrier</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">simdgroup_barrier(flag::MemoryFlags=MemoryFlagNone)</code></pre><p>Synchronize all threads in a SIMD-group.</p><p>Possible flags that affect the memory synchronization behavior are found in <a href="#Metal.MemoryFlags"><code>MemoryFlags</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/src/device/intrinsics/synchronization.jl#L46-L52">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../compiler/">« Compiler</a><a class="docs-footer-nextpage" href="../array/">Array programming »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/api/mps/index.html b/dev/api/mps/index.html
index 7611bae49..dc258bc11 100644
--- a/dev/api/mps/index.html
+++ b/dev/api/mps/index.html
@@ -1,5 +1,5 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Metal Performance Shaders · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/api/mps/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../essentials/">Essentials</a></li><li><a class="tocitem" href="../compiler/">Compiler</a></li><li><a class="tocitem" href="../kernel/">Kernel programming</a></li><li><a class="tocitem" href="../array/">Array programming</a></li><li class="is-active"><a class="tocitem" href>Metal Performance Shaders</a><ul class="internal"><li><a class="tocitem" href="#Matrices-and-Vectors"><span>Matrices and Vectors</span></a></li></ul></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Metal Performance Shaders</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Metal Performance Shaders</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/api/mps.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Metal-Performance-Shaders"><a class="docs-heading-anchor" href="#Metal-Performance-Shaders">Metal Performance Shaders</a><a id="Metal-Performance-Shaders-1"></a><a class="docs-heading-anchor-permalink" href="#Metal-Performance-Shaders" title="Permalink"></a></h1><p>This section lists the package&#39;s public functionality that corresponds to the Metal Performance Shaders functions. For more information about these functions, or to see which functions have yet to be implemented in this package, please consult the <a href="https://developer.apple.com/documentation/metalperformanceshaders?language=objc">Metal Performance Shaders Documentation</a>.</p><h2 id="Matrices-and-Vectors"><a class="docs-heading-anchor" href="#Matrices-and-Vectors">Matrices and Vectors</a><a id="Matrices-and-Vectors-1"></a><a class="docs-heading-anchor-permalink" href="#Matrices-and-Vectors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.MPSMatrix" href="#Metal.MPS.MPSMatrix"><code>Metal.MPS.MPSMatrix</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">MPSMatrix(arr::MtlMatrix)</code></pre><p>Metal matrix representation used in Performance Shaders.</p><p>Note that this results in a transposed view of the input, as Metal stores matrices row-major instead of column-major.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/lib/mps/matrix.jl#L95-L102">source</a></section><section><div><pre><code class="nohighlight hljs">MPSMatrix(arr::MtlArray{T,3})</code></pre><p>Metal batched matrix representation used in Performance Shaders.</p><p>Note that this results in a transposed view of the input, as Metal stores matrices row-major instead of column-major.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/lib/mps/matrix.jl#L117-L124">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.MPSVector" href="#Metal.MPS.MPSVector"><code>Metal.MPS.MPSVector</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">MPSVector(arr::MtlVector)</code></pre><p>Metal vector representation used in Performance Shaders.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/lib/mps/vector.jl#L46-L50">source</a></section></article><h3 id="Matrix-Arithmetic-Operators"><a class="docs-heading-anchor" href="#Matrix-Arithmetic-Operators">Matrix Arithmetic Operators</a><a id="Matrix-Arithmetic-Operators-1"></a><a class="docs-heading-anchor-permalink" href="#Matrix-Arithmetic-Operators" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.matmul!" href="#Metal.MPS.matmul!"><code>Metal.MPS.matmul!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">matMulMPS(a::MtlMatrix, b::MtlMatrix, c::MtlMatrix, alpha=1, beta=1,
-          transpose_left=false, transpose_right=false)</code></pre><p>A <code>MPSMatrixMultiplication</code> kernel thay computes: <code>c = alpha * op(a) * beta * op(b) + beta * C</code></p><p>This function should not typically be used. Rather, use the normal <code>LinearAlgebra</code> interface with any <code>MtlArray</code> and it should be accelerated using Metal Performance Shaders.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/lib/mps/matrix.jl#L176-L184">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.matvecmul!" href="#Metal.MPS.matvecmul!"><code>Metal.MPS.matvecmul!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">matVecMulMPS(c::MtlVector, a::MtlMatrix, b::MtlVector, alpha=1, beta=1,
-             transpose=false)</code></pre><p>A <code>MPSMatrixVectorMultiplication</code> kernel thay computes: <code>c = alpha * op(a) * b + beta * c</code></p><p>This function should not typically be used. Rather, use the normal <code>LinearAlgebra</code> interface with any <code>MtlArray</code> and it should be accelerated using Metal Performance Shaders.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/lib/mps/vector.jl#L91-L99">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.topk" href="#Metal.MPS.topk"><code>Metal.MPS.topk</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">topk(A::MtlMatrix{T}, k) where {T&lt;:MtlFloat}</code></pre><p>Compute the top <code>k</code> values and their corresponding indices column-wise in a matrix <code>A</code>. Return the indices in <code>I</code> and the values in <code>V</code>.</p><p><code>k</code> cannot be greater than 16.</p><p>Uses <code>MPSMatrixFindTopK</code>.</p><p>See also: <a href="#Metal.MPS.topk!"><code>topk!</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/lib/mps/matrix.jl#L284-L295">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.topk!" href="#Metal.MPS.topk!"><code>Metal.MPS.topk!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">topk!(A::MtlMatrix{T}, I::MtlMatrix{Int32}, V::MtlMatrix{T}, k)
-                                                 where {T&lt;:MtlFloat}</code></pre><p>Compute the top <code>k</code> values and their corresponding indices column-wise in a matrix <code>A</code>. Return the indices in <code>I</code> and the values in <code>V</code>.</p><p><code>k</code> cannot be greater than 16.</p><p>Uses <code>MPSMatrixFindTopK</code>.</p><p>See also: <a href="#Metal.MPS.topk"><code>topk</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/e8f16bfc332889aef19b5b486b5a4fdc560a5e45/lib/mps/matrix.jl#L242-L254">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../array/">« Array programming</a><a class="docs-footer-nextpage" href="../../faq/faq/">Frequently Asked Questions »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Metal Performance Shaders · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/api/mps/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../essentials/">Essentials</a></li><li><a class="tocitem" href="../compiler/">Compiler</a></li><li><a class="tocitem" href="../kernel/">Kernel programming</a></li><li><a class="tocitem" href="../array/">Array programming</a></li><li class="is-active"><a class="tocitem" href>Metal Performance Shaders</a><ul class="internal"><li><a class="tocitem" href="#Matrices-and-Vectors"><span>Matrices and Vectors</span></a></li></ul></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">API reference</a></li><li class="is-active"><a href>Metal Performance Shaders</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Metal Performance Shaders</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/api/mps.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Metal-Performance-Shaders"><a class="docs-heading-anchor" href="#Metal-Performance-Shaders">Metal Performance Shaders</a><a id="Metal-Performance-Shaders-1"></a><a class="docs-heading-anchor-permalink" href="#Metal-Performance-Shaders" title="Permalink"></a></h1><p>This section lists the package&#39;s public functionality that corresponds to the Metal Performance Shaders functions. For more information about these functions, or to see which functions have yet to be implemented in this package, please consult the <a href="https://developer.apple.com/documentation/metalperformanceshaders?language=objc">Metal Performance Shaders Documentation</a>.</p><h2 id="Matrices-and-Vectors"><a class="docs-heading-anchor" href="#Matrices-and-Vectors">Matrices and Vectors</a><a id="Matrices-and-Vectors-1"></a><a class="docs-heading-anchor-permalink" href="#Matrices-and-Vectors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.MPSMatrix" href="#Metal.MPS.MPSMatrix"><code>Metal.MPS.MPSMatrix</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">MPSMatrix(arr::MtlMatrix)</code></pre><p>Metal matrix representation used in Performance Shaders.</p><p>Note that this results in a transposed view of the input, as Metal stores matrices row-major instead of column-major.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/lib/mps/matrix.jl#L95-L102">source</a></section><section><div><pre><code class="nohighlight hljs">MPSMatrix(arr::MtlArray{T,3})</code></pre><p>Metal batched matrix representation used in Performance Shaders.</p><p>Note that this results in a transposed view of the input, as Metal stores matrices row-major instead of column-major.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/lib/mps/matrix.jl#L117-L124">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.MPSVector" href="#Metal.MPS.MPSVector"><code>Metal.MPS.MPSVector</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">MPSVector(arr::MtlVector)</code></pre><p>Metal vector representation used in Performance Shaders.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/lib/mps/vector.jl#L46-L50">source</a></section></article><h3 id="Matrix-Arithmetic-Operators"><a class="docs-heading-anchor" href="#Matrix-Arithmetic-Operators">Matrix Arithmetic Operators</a><a id="Matrix-Arithmetic-Operators-1"></a><a class="docs-heading-anchor-permalink" href="#Matrix-Arithmetic-Operators" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.matmul!" href="#Metal.MPS.matmul!"><code>Metal.MPS.matmul!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">matMulMPS(a::MtlMatrix, b::MtlMatrix, c::MtlMatrix, alpha=1, beta=1,
+          transpose_left=false, transpose_right=false)</code></pre><p>A <code>MPSMatrixMultiplication</code> kernel thay computes: <code>c = alpha * op(a) * beta * op(b) + beta * C</code></p><p>This function should not typically be used. Rather, use the normal <code>LinearAlgebra</code> interface with any <code>MtlArray</code> and it should be accelerated using Metal Performance Shaders.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/lib/mps/matrix.jl#L176-L184">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.matvecmul!" href="#Metal.MPS.matvecmul!"><code>Metal.MPS.matvecmul!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">matVecMulMPS(c::MtlVector, a::MtlMatrix, b::MtlVector, alpha=1, beta=1,
+             transpose=false)</code></pre><p>A <code>MPSMatrixVectorMultiplication</code> kernel thay computes: <code>c = alpha * op(a) * b + beta * c</code></p><p>This function should not typically be used. Rather, use the normal <code>LinearAlgebra</code> interface with any <code>MtlArray</code> and it should be accelerated using Metal Performance Shaders.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/lib/mps/vector.jl#L91-L99">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.topk" href="#Metal.MPS.topk"><code>Metal.MPS.topk</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">topk(A::MtlMatrix{T}, k) where {T&lt;:MtlFloat}</code></pre><p>Compute the top <code>k</code> values and their corresponding indices column-wise in a matrix <code>A</code>. Return the indices in <code>I</code> and the values in <code>V</code>.</p><p><code>k</code> cannot be greater than 16.</p><p>Uses <code>MPSMatrixFindTopK</code>.</p><p>See also: <a href="#Metal.MPS.topk!"><code>topk!</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/lib/mps/matrix.jl#L284-L295">source</a></section></article><article class="docstring"><header><a class="docstring-binding" id="Metal.MPS.topk!" href="#Metal.MPS.topk!"><code>Metal.MPS.topk!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">topk!(A::MtlMatrix{T}, I::MtlMatrix{Int32}, V::MtlMatrix{T}, k)
+                                                 where {T&lt;:MtlFloat}</code></pre><p>Compute the top <code>k</code> values and their corresponding indices column-wise in a matrix <code>A</code>. Return the indices in <code>I</code> and the values in <code>V</code>.</p><p><code>k</code> cannot be greater than 16.</p><p>Uses <code>MPSMatrixFindTopK</code>.</p><p>See also: <a href="#Metal.MPS.topk"><code>topk</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaGPU/Metal.jl/blob/10791a29a1f3d485fe5002b00fe8f38f3d17753f/lib/mps/matrix.jl#L242-L254">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../array/">« Array programming</a><a class="docs-footer-nextpage" href="../../faq/faq/">Frequently Asked Questions »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/faq/contributing/index.html b/dev/faq/contributing/index.html
index 6ee7b3d7c..6bf4bcf77 100644
--- a/dev/faq/contributing/index.html
+++ b/dev/faq/contributing/index.html
@@ -7,4 +7,4 @@
                         uint i [[thread_position_in_grid]])
 {
     atomic_store_explicit(&amp;out[i], 0.0f, memory_order_relaxed);
-}</code></pre><p>To compile with Metal&#39;s tools and emit human-readable IR, run something roughly along the lines of: <code>xcrun metal -S -emit-llvm dummy_kernel.metal</code></p><p>This will create a <code>.ll</code> file that you can then parse for whatever information you need. Be sure to double-check the metadata at the bottom for any significant changes your functionality introduces.</p><p>Test with different types and configurations to see what changes are caused. Also ensure that when writing very simple kernels, whatever you&#39;re interested in doesn&#39;t get optimized away. Double-check that the kernel&#39;s IR makes sense for what you wrote.</p><h2 id="Metal-Performance-Shaders"><a class="docs-heading-anchor" href="#Metal-Performance-Shaders">Metal Performance Shaders</a><a id="Metal-Performance-Shaders-1"></a><a class="docs-heading-anchor-permalink" href="#Metal-Performance-Shaders" title="Permalink"></a></h2><p>Metal exposes a special interface to its library of optimized kernels. Rather than accepting the normal set of input GPU data structures, it requires special <code>MPS</code> datatypes that assume row-major memory layout. As this is not the Julia default, adapt accordingly. Adding MPS functionality should be mostly straightforward, so this can be an easy entry point to helping. To get started, you can have a look at the <a href="https://developer.apple.com/documentation/metalperformanceshaders?language=objc">Metal Performance Shaders Documentation</a> from Apple.</p><h2 id="Exposing-your-Interface"><a class="docs-heading-anchor" href="#Exposing-your-Interface">Exposing your Interface</a><a id="Exposing-your-Interface-1"></a><a class="docs-heading-anchor-permalink" href="#Exposing-your-Interface" title="Permalink"></a></h2><p>There are varying degrees of user-facing interfaces from Metal.jl. At the lowest level is <code>Metal.MTL.xxx</code>. This is for low-level functionality close to or at bare Objective-C, or things that a normal user wouldn&#39;t directly be using. <code>Metal.MPS.xxx</code> is for Metal Performance Shader specifics (like <code>MPSMatrix</code>). Next, is <code>Metal.xxx</code>. This is for higher-level, usually pure-Julian functionality (like <code>current_device()</code>). The only thing beyond this is exporting into the global namespace. That would be useful for uniquely-named functions/structures/macros with clear and common use-cases (<code>MtlArray</code> or <code>@metal</code>).</p><p>Additionally, you can override non-Metal.jl functions like <code>LinearAlgebra.mul!</code> seen <a href="https://github.com/JuliaGPU/Metal.jl/blob/main/lib/mps/linalg.jl#L34">here</a>. This is essentially (ab)using multiple dispatch to specialize for certain cases (usually for more performant execution).</p><p>If your function is only available from within GPU kernels (like thread indexing intrinsics). Be sure to properly annotate with <code>@device_function</code> to ensure that calling from the host doesn&#39;t kill your Julia process.</p><p>Generally, think about how frequently you expect your addition to be used, how complex its use-case is, and whether or not it clashes/reimplements/optimizes existing functionality from outside Metal.jl. Put it behind the corresponding interface.</p><h2 id="Creating-Tests"><a class="docs-heading-anchor" href="#Creating-Tests">Creating Tests</a><a id="Creating-Tests-1"></a><a class="docs-heading-anchor-permalink" href="#Creating-Tests" title="Permalink"></a></h2><p>As it&#39;s good practice, and JuliaGPU has great CI/CD workflows, your addition should have associated tests to ensure correctness and edge cases. Look to existing examples under the <code>test</code> folder for initial guidance, and be sure to create tests for all valid types. Any new Julia file in this folder will be ran as its own testset. If you feel your tests don&#39;t fit in any existing place, you&#39;ll probably want to create a new file with an appropriate name.</p><h2 id="Running-a-Subset-of-the-Existing-Tests"><a class="docs-heading-anchor" href="#Running-a-Subset-of-the-Existing-Tests">Running a Subset of the Existing Tests</a><a id="Running-a-Subset-of-the-Existing-Tests-1"></a><a class="docs-heading-anchor-permalink" href="#Running-a-Subset-of-the-Existing-Tests" title="Permalink"></a></h2><p>Sometimes you won&#39;t want to run the entire testsuite. You may just want to run the tests for your new functionality. To do that, you can either pass the name of the testset to the <code>test/runtests.jl</code> script: <code>julia --project=test test/runtests.jl metal</code> or you can isolate test files by running them alone after running the <code>test/setup.jl</code> script: <code>julia --project=test -L test/setup.jl test/metal.jl</code></p><h2 id="Thank-You-and-Good-Luck"><a class="docs-heading-anchor" href="#Thank-You-and-Good-Luck">Thank You and Good Luck</a><a id="Thank-You-and-Good-Luck-1"></a><a class="docs-heading-anchor-permalink" href="#Thank-You-and-Good-Luck" title="Permalink"></a></h2><p>Open-source projects like this only happen because people like you are willing to spend their free time helping out. Most anything you&#39;re able to do is helpful, but if you get stuck, seek guidance from Slack or Discourse. Don&#39;t feel like your contribution has to be perfect. If you put in effort and make progress, there will likely be some senior developer willing to polish your code before merging. Open-source software is a team effort...welcome to the team!</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../faq/">« Frequently Asked Questions</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+}</code></pre><p>To compile with Metal&#39;s tools and emit human-readable IR, run something roughly along the lines of: <code>xcrun metal -S -emit-llvm dummy_kernel.metal</code></p><p>This will create a <code>.ll</code> file that you can then parse for whatever information you need. Be sure to double-check the metadata at the bottom for any significant changes your functionality introduces.</p><p>Test with different types and configurations to see what changes are caused. Also ensure that when writing very simple kernels, whatever you&#39;re interested in doesn&#39;t get optimized away. Double-check that the kernel&#39;s IR makes sense for what you wrote.</p><h2 id="Metal-Performance-Shaders"><a class="docs-heading-anchor" href="#Metal-Performance-Shaders">Metal Performance Shaders</a><a id="Metal-Performance-Shaders-1"></a><a class="docs-heading-anchor-permalink" href="#Metal-Performance-Shaders" title="Permalink"></a></h2><p>Metal exposes a special interface to its library of optimized kernels. Rather than accepting the normal set of input GPU data structures, it requires special <code>MPS</code> datatypes that assume row-major memory layout. As this is not the Julia default, adapt accordingly. Adding MPS functionality should be mostly straightforward, so this can be an easy entry point to helping. To get started, you can have a look at the <a href="https://developer.apple.com/documentation/metalperformanceshaders?language=objc">Metal Performance Shaders Documentation</a> from Apple.</p><h2 id="Exposing-your-Interface"><a class="docs-heading-anchor" href="#Exposing-your-Interface">Exposing your Interface</a><a id="Exposing-your-Interface-1"></a><a class="docs-heading-anchor-permalink" href="#Exposing-your-Interface" title="Permalink"></a></h2><p>There are varying degrees of user-facing interfaces from Metal.jl. At the lowest level is <code>Metal.MTL.xxx</code>. This is for low-level functionality close to or at bare Objective-C, or things that a normal user wouldn&#39;t directly be using. <code>Metal.MPS.xxx</code> is for Metal Performance Shader specifics (like <code>MPSMatrix</code>). Next, is <code>Metal.xxx</code>. This is for higher-level, usually pure-Julian functionality (like <code>current_device()</code>). The only thing beyond this is exporting into the global namespace. That would be useful for uniquely-named functions/structures/macros with clear and common use-cases (<code>MtlArray</code> or <code>@metal</code>).</p><p>Additionally, you can override non-Metal.jl functions like <code>LinearAlgebra.mul!</code> seen <a href="https://github.com/JuliaGPU/Metal.jl/blob/main/lib/mps/linalg.jl#L34">here</a>. This is essentially (ab)using multiple dispatch to specialize for certain cases (usually for more performant execution).</p><p>If your function is only available from within GPU kernels (like thread indexing intrinsics). Be sure to properly annotate with <code>@device_function</code> to ensure that calling from the host doesn&#39;t kill your Julia process.</p><p>Generally, think about how frequently you expect your addition to be used, how complex its use-case is, and whether or not it clashes/reimplements/optimizes existing functionality from outside Metal.jl. Put it behind the corresponding interface.</p><h2 id="Creating-Tests"><a class="docs-heading-anchor" href="#Creating-Tests">Creating Tests</a><a id="Creating-Tests-1"></a><a class="docs-heading-anchor-permalink" href="#Creating-Tests" title="Permalink"></a></h2><p>As it&#39;s good practice, and JuliaGPU has great CI/CD workflows, your addition should have associated tests to ensure correctness and edge cases. Look to existing examples under the <code>test</code> folder for initial guidance, and be sure to create tests for all valid types. Any new Julia file in this folder will be ran as its own testset. If you feel your tests don&#39;t fit in any existing place, you&#39;ll probably want to create a new file with an appropriate name.</p><h2 id="Running-a-Subset-of-the-Existing-Tests"><a class="docs-heading-anchor" href="#Running-a-Subset-of-the-Existing-Tests">Running a Subset of the Existing Tests</a><a id="Running-a-Subset-of-the-Existing-Tests-1"></a><a class="docs-heading-anchor-permalink" href="#Running-a-Subset-of-the-Existing-Tests" title="Permalink"></a></h2><p>Sometimes you won&#39;t want to run the entire testsuite. You may just want to run the tests for your new functionality. To do that, you can either pass the name of the testset to the <code>test/runtests.jl</code> script: <code>julia --project=test test/runtests.jl metal</code> or you can isolate test files by running them alone after running the <code>test/setup.jl</code> script: <code>julia --project=test -L test/setup.jl test/metal.jl</code></p><h2 id="Thank-You-and-Good-Luck"><a class="docs-heading-anchor" href="#Thank-You-and-Good-Luck">Thank You and Good Luck</a><a id="Thank-You-and-Good-Luck-1"></a><a class="docs-heading-anchor-permalink" href="#Thank-You-and-Good-Luck" title="Permalink"></a></h2><p>Open-source projects like this only happen because people like you are willing to spend their free time helping out. Most anything you&#39;re able to do is helpful, but if you get stuck, seek guidance from Slack or Discourse. Don&#39;t feel like your contribution has to be perfect. If you put in effort and make progress, there will likely be some senior developer willing to polish your code before merging. Open-source software is a team effort...welcome to the team!</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../faq/">« Frequently Asked Questions</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/faq/faq/index.html b/dev/faq/faq/index.html
index 2b9a2c82a..71bbfa05e 100644
--- a/dev/faq/faq/index.html
+++ b/dev/faq/faq/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Frequently Asked Questions · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/faq/faq/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../../api/essentials/">Essentials</a></li><li><a class="tocitem" href="../../api/compiler/">Compiler</a></li><li><a class="tocitem" href="../../api/kernel/">Kernel programming</a></li><li><a class="tocitem" href="../../api/array/">Array programming</a></li><li><a class="tocitem" href="../../api/mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li class="is-active"><a class="tocitem" href>Frequently Asked Questions</a><ul class="internal"><li><a class="tocitem" href="#Can-you-wrap-this-Metal-API?"><span>Can you wrap this Metal API?</span></a></li></ul></li><li><a class="tocitem" href="../contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">FAQ</a></li><li class="is-active"><a href>Frequently Asked Questions</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Frequently Asked Questions</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/faq/faq.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Frequently-Asked-Questions"><a class="docs-heading-anchor" href="#Frequently-Asked-Questions">Frequently Asked Questions</a><a id="Frequently-Asked-Questions-1"></a><a class="docs-heading-anchor-permalink" href="#Frequently-Asked-Questions" title="Permalink"></a></h1><h2 id="Can-you-wrap-this-Metal-API?"><a class="docs-heading-anchor" href="#Can-you-wrap-this-Metal-API?">Can you wrap this Metal API?</a><a id="Can-you-wrap-this-Metal-API?-1"></a><a class="docs-heading-anchor-permalink" href="#Can-you-wrap-this-Metal-API?" title="Permalink"></a></h2><p>Most likely. Any help on designing or implementing high-level wrappers for MSL&#39;s low-level functionality is greatly appreciated, so please consider <a href="../contributing/">contributing</a> your uses of these APIs on the respective repositories.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../api/mps/">« Metal Performance Shaders</a><a class="docs-footer-nextpage" href="../contributing/">Contributing »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Frequently Asked Questions · Metal.jl</title><script data-outdated-warner src="../../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/faq/faq/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL="../.."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../../assets/documenter.js"></script><script src="../../siteinfo.js"></script><script src="../../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../../assets/themeswap.js"></script><link href="../../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../../"><img src="../../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../../">Metal.jl</a></span></div><form class="docs-search" action="../../search/"><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../../usage/overview/">Overview</a></li><li><a class="tocitem" href="../../usage/array/">Array programming</a></li><li><a class="tocitem" href="../../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../../api/essentials/">Essentials</a></li><li><a class="tocitem" href="../../api/compiler/">Compiler</a></li><li><a class="tocitem" href="../../api/kernel/">Kernel programming</a></li><li><a class="tocitem" href="../../api/array/">Array programming</a></li><li><a class="tocitem" href="../../api/mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li class="is-active"><a class="tocitem" href>Frequently Asked Questions</a><ul class="internal"><li><a class="tocitem" href="#Can-you-wrap-this-Metal-API?"><span>Can you wrap this Metal API?</span></a></li></ul></li><li><a class="tocitem" href="../contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li><a class="is-disabled">FAQ</a></li><li class="is-active"><a href>Frequently Asked Questions</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Frequently Asked Questions</a></li></ul></nav><div class="docs-right"><a class="docs-edit-link" href="https://github.com/JuliaGPU/Metal.jl/blob/main/docs/src/faq/faq.md#" title="Edit on GitHub"><span class="docs-icon fab"></span><span class="docs-label is-hidden-touch">Edit on GitHub</span></a><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article class="content" id="documenter-page"><h1 id="Frequently-Asked-Questions"><a class="docs-heading-anchor" href="#Frequently-Asked-Questions">Frequently Asked Questions</a><a id="Frequently-Asked-Questions-1"></a><a class="docs-heading-anchor-permalink" href="#Frequently-Asked-Questions" title="Permalink"></a></h1><h2 id="Can-you-wrap-this-Metal-API?"><a class="docs-heading-anchor" href="#Can-you-wrap-this-Metal-API?">Can you wrap this Metal API?</a><a id="Can-you-wrap-this-Metal-API?-1"></a><a class="docs-heading-anchor-permalink" href="#Can-you-wrap-this-Metal-API?" title="Permalink"></a></h2><p>Most likely. Any help on designing or implementing high-level wrappers for MSL&#39;s low-level functionality is greatly appreciated, so please consider <a href="../contributing/">contributing</a> your uses of these APIs on the respective repositories.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../api/mps/">« Metal Performance Shaders</a><a class="docs-footer-nextpage" href="../contributing/">Contributing »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/index.html b/dev/index.html
index fe5eaf24c..ea5651ca3 100644
--- a/dev/index.html
+++ b/dev/index.html
@@ -6,4 +6,4 @@
 # smoke test
 using Metal
 Metal.versioninfo()</code></pre><p>If you want to ensure everything works as expected, you can execute the test suite.</p><pre><code class="language-julia hljs">using Pkg
-Pkg.test(&quot;Metal&quot;)</code></pre><p>The following resources may also be of interest (although are mainly focused on the CUDA GPU  backend):</p><ul><li>Effectively using GPUs with Julia: <a href="https://docs.google.com/presentation/d/1l-BuAtyKgoVYakJSijaSqaTL3friESDyTOnU2OLqGoA/">slides</a></li><li>How Julia is compiled to GPUs: <a href="https://www.youtube.com/watch?v=Fz-ogmASMAE">video</a></li></ul><h2 id="Contributing"><a class="docs-heading-anchor" href="#Contributing">Contributing</a><a id="Contributing-1"></a><a class="docs-heading-anchor-permalink" href="#Contributing" title="Permalink"></a></h2><p>If you want to help improve this package, look at <a href="faq/contributing/">the contributing page</a> for more details.</p><h2 id="Acknowledgements"><a class="docs-heading-anchor" href="#Acknowledgements">Acknowledgements</a><a id="Acknowledgements-1"></a><a class="docs-heading-anchor-permalink" href="#Acknowledgements" title="Permalink"></a></h2><p>The Julia Metal stack has been a collaborative effort by many individuals. Significant contributions have been made by the following individuals:</p><ul><li>Tim Besard (@maleadt) (lead developer)</li><li>Filippo Vicentini (@PhilipVinc)</li><li>Max Hawkins (@max-Hawkins)</li></ul><h2 id="Supporting-and-Citing"><a class="docs-heading-anchor" href="#Supporting-and-Citing">Supporting and Citing</a><a id="Supporting-and-Citing-1"></a><a class="docs-heading-anchor-permalink" href="#Supporting-and-Citing" title="Permalink"></a></h2><p>Some of the software in this ecosystem was developed as part of academic research. If you would like to help support it, please star the repository as such metrics may help us secure funding in the future. If you use our software as part of your research, teaching, or other activities, we would be grateful if you could cite our work. The <a href="https://github.com/JuliaGPU/Metal.jl/blob/main/CITATION.cff">CITATION.cff</a> file in the root of this repository lists the relevant papers.</p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="usage/overview/">Overview »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+Pkg.test(&quot;Metal&quot;)</code></pre><p>The following resources may also be of interest (although are mainly focused on the CUDA GPU  backend):</p><ul><li>Effectively using GPUs with Julia: <a href="https://docs.google.com/presentation/d/1l-BuAtyKgoVYakJSijaSqaTL3friESDyTOnU2OLqGoA/">slides</a></li><li>How Julia is compiled to GPUs: <a href="https://www.youtube.com/watch?v=Fz-ogmASMAE">video</a></li></ul><h2 id="Contributing"><a class="docs-heading-anchor" href="#Contributing">Contributing</a><a id="Contributing-1"></a><a class="docs-heading-anchor-permalink" href="#Contributing" title="Permalink"></a></h2><p>If you want to help improve this package, look at <a href="faq/contributing/">the contributing page</a> for more details.</p><h2 id="Acknowledgements"><a class="docs-heading-anchor" href="#Acknowledgements">Acknowledgements</a><a id="Acknowledgements-1"></a><a class="docs-heading-anchor-permalink" href="#Acknowledgements" title="Permalink"></a></h2><p>The Julia Metal stack has been a collaborative effort by many individuals. Significant contributions have been made by the following individuals:</p><ul><li>Tim Besard (@maleadt) (lead developer)</li><li>Filippo Vicentini (@PhilipVinc)</li><li>Max Hawkins (@max-Hawkins)</li></ul><h2 id="Supporting-and-Citing"><a class="docs-heading-anchor" href="#Supporting-and-Citing">Supporting and Citing</a><a id="Supporting-and-Citing-1"></a><a class="docs-heading-anchor-permalink" href="#Supporting-and-Citing" title="Permalink"></a></h2><p>Some of the software in this ecosystem was developed as part of academic research. If you would like to help support it, please star the repository as such metrics may help us secure funding in the future. If you use our software as part of your research, teaching, or other activities, we would be grateful if you could cite our work. The <a href="https://github.com/JuliaGPU/Metal.jl/blob/main/CITATION.cff">CITATION.cff</a> file in the root of this repository lists the relevant papers.</p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="usage/overview/">Overview »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/profiling/index.html b/dev/profiling/index.html
index 48c3f3c31..7a16b5e3d 100644
--- a/dev/profiling/index.html
+++ b/dev/profiling/index.html
@@ -15,4 +15,4 @@
 ... Metal GPU Frame Capture Enabled
 
 julia&gt; Metal.@profile @metal threads=length(c) vadd(a, b, c);
-[ Info: GPU frame capture saved to /var/folders/x3/75r5z4sd2_bdwqs68_nfnxw40000gn/T/jl_WzKxYVMlon/jl_metal.gputrace/</code></pre><p>To view these GPU traces though, Xcode, with its quite significant install size, needs to be  installed.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../usage/kernel/">« Kernel programming</a><a class="docs-footer-nextpage" href="../api/essentials/">Essentials »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+[ Info: GPU frame capture saved to /var/folders/x3/75r5z4sd2_bdwqs68_nfnxw40000gn/T/jl_WzKxYVMlon/jl_metal.gputrace/</code></pre><p>To view these GPU traces though, Xcode, with its quite significant install size, needs to be  installed.</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../usage/kernel/">« Kernel programming</a><a class="docs-footer-nextpage" href="../api/essentials/">Essentials »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/search/index.html b/dev/search/index.html
index 0e147282b..a7c8fa379 100644
--- a/dev/search/index.html
+++ b/dev/search/index.html
@@ -1,2 +1,2 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Search · Metal.jl</title><script data-outdated-warner src="../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/search/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script><link href="../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">Metal.jl</a></span></div><form class="docs-search" action><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../usage/overview/">Overview</a></li><li><a class="tocitem" href="../usage/array/">Array programming</a></li><li><a class="tocitem" href="../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../api/essentials/">Essentials</a></li><li><a class="tocitem" href="../api/compiler/">Compiler</a></li><li><a class="tocitem" href="../api/kernel/">Kernel programming</a></li><li><a class="tocitem" href="../api/array/">Array programming</a></li><li><a class="tocitem" href="../api/mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Search</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Search</a></li></ul></nav><div class="docs-right"><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article><p id="documenter-search-info">Loading search...</p><ul id="documenter-search-results"></ul></article><nav class="docs-footer"><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body><script src="../search_index.js"></script><script src="../assets/search.js"></script></html>
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Search · Metal.jl</title><script data-outdated-warner src="../assets/warner.js"></script><link rel="canonical" href="https://metal.juliagpu.org/stable/search/"/><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.045/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.13.24/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script><link href="../assets/favicon.ico" rel="icon" type="image/x-icon"/></head><body><div id="documenter"><nav class="docs-sidebar"><a class="docs-logo" href="../"><img src="../assets/logo.png" alt="Metal.jl logo"/></a><div class="docs-package-name"><span class="docs-autofit"><a href="../">Metal.jl</a></span></div><form class="docs-search" action><input class="docs-search-query" id="documenter-search-query" name="q" type="text" placeholder="Search docs"/></form><ul class="docs-menu"><li><a class="tocitem" href="../">Home</a></li><li><span class="tocitem">Usage</span><ul><li><a class="tocitem" href="../usage/overview/">Overview</a></li><li><a class="tocitem" href="../usage/array/">Array programming</a></li><li><a class="tocitem" href="../usage/kernel/">Kernel programming</a></li></ul></li><li><a class="tocitem" href="../profiling/">Profiling</a></li><li><span class="tocitem">API reference</span><ul><li><a class="tocitem" href="../api/essentials/">Essentials</a></li><li><a class="tocitem" href="../api/compiler/">Compiler</a></li><li><a class="tocitem" href="../api/kernel/">Kernel programming</a></li><li><a class="tocitem" href="../api/array/">Array programming</a></li><li><a class="tocitem" href="../api/mps/">Metal Performance Shaders</a></li></ul></li><li><span class="tocitem">FAQ</span><ul><li><a class="tocitem" href="../faq/faq/">Frequently Asked Questions</a></li><li><a class="tocitem" href="../faq/contributing/">Contributing</a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Search</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Search</a></li></ul></nav><div class="docs-right"><a class="docs-settings-button fas fa-cog" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-sidebar-button fa fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a></div></header><article><p id="documenter-search-info">Loading search...</p><ul id="documenter-search-results"></ul></article><nav class="docs-footer"><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body><script src="../search_index.js"></script><script src="../assets/search.js"></script></html>
diff --git a/dev/usage/array/index.html b/dev/usage/array/index.html
index 3d63687e7..e579efc6c 100644
--- a/dev/usage/array/index.html
+++ b/dev/usage/array/index.html
@@ -50,4 +50,4 @@
 
 julia&gt; Base.mapreducedim!(identity, +, b, a)
 1×1 MtlMatrix{Float32, Metal.MTL.MTLResourceStorageModePrivate}:
- 6.0</code></pre></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../overview/">« Overview</a><a class="docs-footer-nextpage" href="../kernel/">Kernel programming »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+ 6.0</code></pre></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../overview/">« Overview</a><a class="docs-footer-nextpage" href="../kernel/">Kernel programming »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/usage/kernel/index.html b/dev/usage/kernel/index.html
index 5885f7502..228c3e6fc 100644
--- a/dev/usage/kernel/index.html
+++ b/dev/usage/kernel/index.html
@@ -4,4 +4,4 @@
     c[i] = a[i] + b[i]
     return
 end</code></pre><p>This kernel takes in three vectors (a,b,c) all of the same length and stores the element-wise sum of <code>a</code> and <code>b</code> into <code>c</code>. Each thread in this kernel gets its unique position in the grid (arrangement of all threadgroups) and stores this value into the variable <code>i</code> which is then used as the index into the vectors. Thus, each thread is computing one sum and storing the result in the output vector.</p><p>To ensure this kernel functions properly, we have to launch it with exactly as many threads as the length of the vectors. If we under or over-launch threads, the result could be incorrect.</p><p>An example of a good launch:</p><pre><code class="language-julia hljs">len = prod(size(d_a))
-@metal threads=len vadd(d_a, d_b, d_c)</code></pre><p>Additional notes:</p><ul><li>Kernels must always return nothing</li><li>Kernels are asynchronous. To synchronize, use the <code>Metal.@sync</code> macro.</li></ul><h2 id="Other-Helpful-Links"><a class="docs-heading-anchor" href="#Other-Helpful-Links">Other Helpful Links</a><a id="Other-Helpful-Links-1"></a><a class="docs-heading-anchor-permalink" href="#Other-Helpful-Links" title="Permalink"></a></h2><p><a href="https://developer.apple.com/metal/Metal-Shading-Language-Specification.pdf">Metal Shading Language Specification</a> <a href="https://wiki.illinois.edu/wiki/display/ECE408/Materials+from+prior+semesters">An Introduction to GPU Programming course from University of Illinois</a> (primarily in CUDA, but the concepts are transferable)</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../array/">« Array programming</a><a class="docs-footer-nextpage" href="../../profiling/">Profiling »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+@metal threads=len vadd(d_a, d_b, d_c)</code></pre><p>Additional notes:</p><ul><li>Kernels must always return nothing</li><li>Kernels are asynchronous. To synchronize, use the <code>Metal.@sync</code> macro.</li></ul><h2 id="Other-Helpful-Links"><a class="docs-heading-anchor" href="#Other-Helpful-Links">Other Helpful Links</a><a id="Other-Helpful-Links-1"></a><a class="docs-heading-anchor-permalink" href="#Other-Helpful-Links" title="Permalink"></a></h2><p><a href="https://developer.apple.com/metal/Metal-Shading-Language-Specification.pdf">Metal Shading Language Specification</a> <a href="https://wiki.illinois.edu/wiki/display/ECE408/Materials+from+prior+semesters">An Introduction to GPU Programming course from University of Illinois</a> (primarily in CUDA, but the concepts are transferable)</p></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../array/">« Array programming</a><a class="docs-footer-nextpage" href="../../profiling/">Profiling »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/usage/overview/index.html b/dev/usage/overview/index.html
index 035ea05c6..65ff9fefc 100644
--- a/dev/usage/overview/index.html
+++ b/dev/usage/overview/index.html
@@ -9,4 +9,4 @@
 # automatic memory management
 a = nothing</code></pre><p>Beyond memory management, there are a whole range of array operations to process your data. This includes several higher-order operations that take other code as arguments, such as <code>map</code>, <code>reduce</code> or <code>broadcast</code>. With these, it is possible to perform kernel-like operations without actually writing your own GPU kernels:</p><pre><code class="language-julia hljs">a = Metal.zeros(1024)
 b = Metal.ones(1024)
-a.^2 .+ sin.(b)</code></pre></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../">« Home</a><a class="docs-footer-nextpage" href="../array/">Array programming »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Monday 30 October 2023 04:03">Monday 30 October 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+a.^2 .+ sin.(b)</code></pre></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../../">« Home</a><a class="docs-footer-nextpage" href="../array/">Array programming »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 0.27.23 on <span class="colophon-date" title="Saturday 25 November 2023 03:16">Saturday 25 November 2023</span>. Using Julia version 1.8.5.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>