From 0a6e82dd48d858b41601fccf13d259a691f83b50 Mon Sep 17 00:00:00 2001
From: mdingemanse <mdingemanse@users.noreply.github.com>
Date: Wed, 26 Jun 2024 19:34:29 +0000
Subject: [PATCH] Apply automatic changes

---
 docs/df.csv      | 2 +-
 docs/figure.html | 4 ++--
 docs/index.html  | 5 +++--
 3 files changed, 6 insertions(+), 5 deletions(-)

diff --git a/docs/df.csv b/docs/df.csv
index fcf09d3..69e1944 100644
--- a/docs/df.csv
+++ b/docs/df.csv
@@ -21,7 +21,7 @@ https://huggingface.co/BramVanroy/GEITje-7B-ultra,Dutch instruction-tuned model
 https://huggingface.co/microsoft/Phi-3-mini-128k-instruct,,Phi3,Unspecified,MIT License,Microsoft,https://huggingface.co/microsoft/Phi-3-mini-128k-instruct,,closed,,"No source code found for pretraining, posttraining, or evaluation",closed,,No datasets made available and no information on datasets disclosed except very generic claims about filtering for high quality.,closed,,No base model of the instruction-tuned Phi-3 was released,closed,,No post-training datasets made available and no information on datasets disclosed except very generic claims about filtering for high quality.,open,https://huggingface.co/microsoft/Phi-3-mini-128k-instruct/tree/main,Instruction-tuned model weights shared through HuggingFace,open,https://huggingface.co/microsoft/Phi-3-mini-128k-instruct/blob/main/LICENSE,MIT License,closed,,"No source code, so no documentation of source code found",open,https://arxiv.org/abs/2404.14219,Architecture described in model card and preprint,partial,https://arxiv.org/abs/2404.14219,"Preprint describes model architecture but not training data, focusing mostly on benchmarks and evalution",closed,,No paper found,open,https://huggingface.co/microsoft/Phi-3-mini-128k-instruct/blob/main/LICENSE,Model card provides reasonable level of detail,closed,,No datasheet made available,partial,,Available through development version of transformers,open,,Available through HuggingFace API,/projects/phi-3-instruct.yaml,6.0
 https://huggingface.co/WizardLM/WizardLM-13B-V1.2,Empowering Large Pre-Trained Language Models to Follow Complex Instructions,LLaMA2-13B,Evol-Instruct (synthetic),CC-BY-NC-4.0,Microsoft & Peking University,https://github.com/nlpxucan,,partial,https://github.com/nlpxucan/WizardLM/tree/main/WizardLM,Fast-evolving repository contains WizardLM code,closed,https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/llama-2-chat.yaml,"Based on LLaMA2, which is claimed to be public but nowhere exactly documented.",partial,https://ai.meta.com/resources/models-and-libraries/llama-downloads/,"Based on LLaMA2 weights, which are made conditionally available by Meta.",open,https://huggingface.co/datasets/WizardLM/WizardLM_evol_instruct_V2_196k,The Evol-Instruct V2 dataset contains 196k instruction-following sequences generated from Evol-Instruct,open,https://huggingface.co/WizardLM/WizardLM-13B-V1.2,Model weights offered in HuggingFace repository,partial,https://github.com/nlpxucan/WizardLM/blob/main/WizardLM/MODEL_DIFF_LICENSE,"Restricted for academic research purposes only. Code and Model diff release under CC-BY-NC-4.0, software code under Apache 2.0",partial,https://github.com/nlpxucan/WizardLM/tree/main/WizardLM,"Code is only partially documented, not clearly versioned, and appears to be in flux.",open,https://arxiv.org/abs/2304.12244,Architecture described in preprint and partly accessible in code repository,open,https://arxiv.org/abs/2304.12244,Preprint describes method for creating large amounts of LLM-based synthetic RLHF data and fine-tuning WizardLM based on it,closed,,No peer-reviewed paper or data audit found,closed,https://huggingface.co/WizardLM/WizardLM-13B-V1.2,Model card is only a placeholder and generates an error (missing yaml metadata),closed,https://huggingface.co/datasets/WizardLM/WizardLM_evol_instruct_V2_196k,Dataset card for Evol-Instruct generates an error,closed,,No package available,closed,,No API available,/projects/wizardlm-13B.yaml,6.0
 https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1,,Llama2,Airoboros (synthetic),Purposely left ambiguous,Jon Durbin,https://github.com/jondurbin,Only active on GitHub since May 2023,partial,https://gist.github.com/jondurbin/87fc040b92a3073125ed516b04bc6e19,Repo exists for RL data but only a gist exists for model training and architecture,closed,,Llama2 training data is nowhere documented or disclosed,partial,,"Llama2, made conditionally available by Meta",open,https://github.com/jondurbin/airoboros,"Airoboros, an implementation of the Self-Instruct paper",open,https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/tree/main,Made available through HuggingFace,partial,https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1#licence-and-usage-restrictions,Licensing left ambiguous because of murky status of OpenAI-derived Self-Instruct data,partial,,What little code available is not very systematically documented,partial,https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/discussions/2#64c29e4c617b36543dedac9a,Some info can be gleaned at link but most remains undocumented,closed,,No preprint found,closed,,No peer-reviewed paper found,partial,https://huggingface.co/jondurbin/airoboros-65b-gpt4-1.4,Instructs reader to look up model card for prior 65B Llama1 version,partial,https://huggingface.co/datasets/jondurbin/airoboros-gpt4-1.4.1,Datasheet for RL data only,closed,,No package found,closed,,No API found,/projects/airoboros.yaml,5.5
-https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md,"From the readme, ""ChatGLM-6B uses technology similar to ChatGPT, optimized for Chinese QA and dialogue. The model is trained for about 1T tokens of Chinese and English corpus, supplemented by supervised fine-tuning, feedback bootstrap, and reinforcement learning wit human feedback. With only about 6.2 billion parameters, the model is able to generate answers that are in line with human preference.""",GLM (own),Unspecified,Apache 2.0,THUDM,https://github.com/THUDM,Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University,partial,https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#deployment,Some code made available on Github,partial,http://doi.org/10.18653/v1/2022.acl-long.26,"Training data not centrally made available, but described in 2022 ACL paper, appears to be mostly public datasets",open,https://huggingface.co/THUDM/chatglm-6b/tree/main,Model made available through HuggingFace,closed,,"docs mention ""supervised fine-tuning, feedback bootstrap, and reinforcement learning wit human feedback"", but none of the datasets used are clearly specified.",closed,,No weights or checkpoints corresponding to the delta of the LLM vs RLHF provided,open,https://github.com/THUDM/ChatGLM-6B/blob/main/LICENSE,Apache 2.0,partial,https://github.com/THUDM/ChatGLM-6B/blob/main/ptuning/README_en.md,"Some documentation available, but a lot of code is not commented or explained.",partial,,Full details architecture not specified in a single place,closed,,,partial,https://aclanthology.org/2022.acl-long.26/,"ACL 2022 paper describes the training of the GLM base model, but the RLHF portion is more recent (there is also a related ICLR paper for a newer generation https://openreview.net/forum?id=-Aw0rrrPUF)",closed,https://huggingface.co/THUDM/chatglm-6b,No modelcard; the HuggingFace modelcard spot is used just as the homepage for the model.,closed,,No datasheet,closed,,No package,open,https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#api-deployment,API provided through fastapi uvicorn,/projects/ChatGLM-6B.yaml,5.5
+https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md,"From the readme, ""ChatGLM-6B uses technology similar to ChatGPT, optimized for Chinese QA and dialogue. The model is trained for about 1T tokens of Chinese and English corpus, supplemented by supervised fine-tuning, feedback bootstrap, and reinforcement learning wit human feedback. With only about 6.2 billion parameters, the model is able to generate answers that are in line with human preference.""",GLM (own),Unspecified,Apache 2.0,THUDM,https://github.com/THUDM,Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University,partial,https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#deployment,Some code made available on Github,partial,http://doi.org/10.18653/v1/2022.acl-long.26,"Training data not centrally made available, but described in 2022 ACL paper, appears to be mostly public datasets",open,https://huggingface.co/THUDM/chatglm-6b/tree/main,Model made available through HuggingFace,closed,,"docs mention ""supervised fine-tuning, feedback bootstrap, and reinforcement learning wit human feedback"", but none of the datasets used are clearly specified.",closed,,No weights or checkpoints corresponding to the delta of the LLM vs RLHF provided,open,https://github.com/THUDM/ChatGLM-6B/blob/main/LICENSE,Apache 2.0,partial,https://github.com/THUDM/ChatGLM-6B/blob/main/ptuning/README_en.md,"Some documentation available, but a lot of code is not commented or explained.",partial,,Full details of architecture not specified in a single place,closed,,,partial,https://aclanthology.org/2022.acl-long.26/,"ACL 2022 paper describes the training of the GLM base model, but the RLHF portion is more recent (there is also a related ICLR paper for a newer generation https://openreview.net/forum?id=-Aw0rrrPUF)",closed,https://huggingface.co/THUDM/chatglm-6b,No modelcard; the HuggingFace modelcard spot is used just as the homepage for the model.,closed,,No datasheet,closed,,No package,open,https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#api-deployment,API provided through fastapi uvicorn,/projects/ChatGLM-6B.yaml,5.5
 https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1,,unclear,unspecified,Apache 2.0,Mistral AI,https://mistral.ai/,,partial,https://github.com/mistralai/mistral-src,repository provides 'minimal code to run our 7B model',closed,,No information provided on pretraining data,open,https://github.com/mistralai/mistral-src#download-the-model,Base LLM model made available for download,closed,https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1,No information provided expect that instruction tuning is done using an unspecified 'variety of publicly available conversation datasets',partial,https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1/tree/main,Instruct version of the model made available but no information on fine-tuning procedure provided,open,https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1/blob/main/README.md,Apache 2.0,closed,https://github.com/mistralai/mistral-src,the little code that is available is uncommented and undocumented,partial,https://github.com/mistralai/mistral-src,Some information on architecture provided in github repo,partial,http://arxiv.org/abs/2310.06825,"Preprint rehashes marketing blurbs also given in blog and provides no details about pretraining datasets, instruction tuning datasets, or fine-tuning process, hence partial.",closed,,No peer reviewed paper available,closed,,"No model card available, HuggingFace modelcard just points to a corporate blog post",closed,,No datasheet available,partial,https://docs.mistral.ai/quickstart/,Docker image shared on github,open,https://docs.mistral.ai/api,API specification provided by vLLM,/projects/mistral-7B.yaml,5.5
 https://github.com/nlpxucan/WizardLM,Empowering Large Pre-Trained Language Models to Follow Complex Instructions,LLaMA-7B,Evol-Instruct (synthetic),CC-BY-NC-4.0,Microsoft & Peking University,https://github.com/nlpxucan,,partial,https://github.com/nlpxucan/WizardLM/tree/main/WizardLM,Fast-evolving repository contains WizardLM code,partial,,"Based on LLaMA, which is claimed to be public but nowhere exactly documented.",closed,,"Based on LLaMA weights, which are not openly available though a leaked versions is in wide circulation.",open,https://github.com/nlpxucan/WizardLM/tree/main/WizardLM#training-data,The Evol-Instruct dataset contains 70k instruction-following sequences generated from Evol-Instruct,partial,https://huggingface.co/WizardLM/WizardLM-7B-V1.0/tree/main,Model weights offered as a delta to LLaMA,partial,https://github.com/nlpxucan/WizardLM/blob/main/WizardLM/MODEL_DIFF_LICENSE,"Restricted for academic research purposes only. Code and Model diff release under CC-BY-NC-4.0, software code under Apache 2.0",partial,https://github.com/nlpxucan/WizardLM/tree/main/WizardLM,"Code is only partially documented, not clearly versioned, and appears to be in flux.",open,https://arxiv.org/abs/2304.12244,Architecture described in preprint and partly accessible in code repository,open,https://arxiv.org/abs/2304.12244,Preprint describes method for creating large amounts of LLM-based synthetic RLHF data and fine-tuning WizardLM based on it,closed,,No peer-reviewed paper or data audit found,closed,https://huggingface.co/WizardLM/WizardLM-7B-V1.0,Model card is only a placeholder and generates an error (missing yaml metadata),closed,https://huggingface.co/datasets/WizardLM/WizardLM_evol_instruct_V2_196k,Dataset card for Evol-Instruct generates an error,closed,,No package available,closed,,No API available,/projects/wizardlm-7B-V1.yaml,5.5
 https://qwenlm.github.io/blog/qwen1.5/,"This is based on the 72B version, the largest of 8 available model sizes.",QwenLM,Unspecified,Qianwen License,Alibaba Cloud,,Qwen (abbr. for Tongyi Qianwen 通义千问) refers to the large language model family built by Alibaba Cloud,partial,https://github.com/QwenLM/Qwen1.5/,Repository provides sparse source code and some examples for SFT,closed,,Pretraining data not specified or documented.,open,https://huggingface.co/Qwen/Qwen1.5-72B/tree/main,Also available in smaller model sizes,closed,https://qwen.readthedocs.io/en/latest/training/SFT/llama_factory.html,Data not specified or documented. Some example code in repo provides directions but no details.,open,https://huggingface.co/Qwen/Qwen1.5-72B-Chat/tree/main,Also available in smaller model sizes,closed,,Qianwen License,partial,,Repository is fairly well-documented.,partial,,No clear description of architecture found.,closed,,No preprint found.,closed,,No peer-reviewed paper found.,closed,,"Model card on HF only serves as a pointer to the model, no actual info provided.",closed,,No datasheet.,partial,,No specific package provided but integrates well with many widely used packages,open,,Available through various APIs,/projects/qwen-1.5-chat.yaml,5.0
diff --git a/docs/figure.html b/docs/figure.html
index 29b16db..9c26df2 100644
--- a/docs/figure.html
+++ b/docs/figure.html
@@ -40,7 +40,7 @@ <h1>Open GenAI: LLMs (simplified table)</h1>
 <tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/phi-3-instruct.yaml" target="_blank" title="data: phi-3-instruct.yaml">Phi 3 Instruct</a></td><td class="closed data-cell"><a href="" target="_blank" title="No source code found for pretraining, posttraining, or evaluation">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasets made available and no information on datasets disclosed except very generic claims about filtering for high quality.">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No base model of the instruction-tuned Phi-3 was released">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No post-training datasets made available and no information on datasets disclosed except very generic claims about filtering for high quality.">✘</a></td><td class="open data-cell"><a href="https://huggingface.co/microsoft/Phi-3-mini-128k-instruct/tree/main" target="_blank" title="Instruction-tuned model weights shared through HuggingFace">✔︎</a></td><td class="open data-cell"><a href="https://huggingface.co/microsoft/Phi-3-mini-128k-instruct/blob/main/LICENSE" target="_blank" title="MIT License">✔︎</a></td><td class="closed data-cell"><a href="" target="_blank" title="No source code, so no documentation of source code found">✘</a></td><td class="open data-cell"><a href="https://arxiv.org/abs/2404.14219" target="_blank" title="Architecture described in model card and preprint">✔︎</a></td><td class="partial data-cell"><a href="https://arxiv.org/abs/2404.14219" target="_blank" title="Preprint describes model architecture but not training data, focusing mostly on benchmarks and evalution">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No paper found">✘</a></td><td class="open data-cell"><a href="https://huggingface.co/microsoft/Phi-3-mini-128k-instruct/blob/main/LICENSE" target="_blank" title="Model card provides reasonable level of detail">✔︎</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasheet made available">✘</a></td><td class="partial data-cell"><a href="" target="_blank" title="Available through development version of transformers">~</a></td><td class="open data-cell"><a href="" target="_blank" title="Available through HuggingFace API">✔︎</a></td></tr>
 <tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/wizardlm-13B.yaml" target="_blank" title="data: wizardlm-13B.yaml">WizardLM 13B v1.2</a></td><td class="partial data-cell"><a href="https://github.com/nlpxucan/WizardLM/tree/main/WizardLM" target="_blank" title="Fast-evolving repository contains WizardLM code">~</a></td><td class="closed data-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/llama-2-chat.yaml" target="_blank" title="Based on LLaMA2, which is claimed to be public but nowhere exactly documented.">✘</a></td><td class="partial data-cell"><a href="https://ai.meta.com/resources/models-and-libraries/llama-downloads/" target="_blank" title="Based on LLaMA2 weights, which are made conditionally available by Meta.">~</a></td><td class="open data-cell"><a href="https://huggingface.co/datasets/WizardLM/WizardLM_evol_instruct_V2_196k" target="_blank" title="The Evol-Instruct V2 dataset contains 196k instruction-following sequences generated from Evol-Instruct">✔︎</a></td><td class="open data-cell"><a href="https://huggingface.co/WizardLM/WizardLM-13B-V1.2" target="_blank" title="Model weights offered in HuggingFace repository">✔︎</a></td><td class="partial data-cell"><a href="https://github.com/nlpxucan/WizardLM/blob/main/WizardLM/MODEL_DIFF_LICENSE" target="_blank" title="Restricted for academic research purposes only. Code and Model diff release under CC-BY-NC-4.0, software code under Apache 2.0">~</a></td><td class="partial data-cell"><a href="https://github.com/nlpxucan/WizardLM/tree/main/WizardLM" target="_blank" title="Code is only partially documented, not clearly versioned, and appears to be in flux.">~</a></td><td class="open data-cell"><a href="https://arxiv.org/abs/2304.12244" target="_blank" title="Architecture described in preprint and partly accessible in code repository">✔︎</a></td><td class="open data-cell"><a href="https://arxiv.org/abs/2304.12244" target="_blank" title="Preprint describes method for creating large amounts of LLM-based synthetic RLHF data and fine-tuning WizardLM based on it">✔︎</a></td><td class="closed data-cell"><a href="" target="_blank" title="No peer-reviewed paper or data audit found">✘</a></td><td class="closed data-cell"><a href="https://huggingface.co/WizardLM/WizardLM-13B-V1.2" target="_blank" title="Model card is only a placeholder and generates an error (missing yaml metadata)">✘</a></td><td class="closed data-cell"><a href="https://huggingface.co/datasets/WizardLM/WizardLM_evol_instruct_V2_196k" target="_blank" title="Dataset card for Evol-Instruct generates an error">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No package available">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No API available">✘</a></td></tr>
 <tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/airoboros.yaml" target="_blank" title="data: airoboros.yaml">Airoboros L2 70B GPT4</a></td><td class="partial data-cell"><a href="https://gist.github.com/jondurbin/87fc040b92a3073125ed516b04bc6e19" target="_blank" title="Repo exists for RL data but only a gist exists for model training and architecture">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="Llama2 training data is nowhere documented or disclosed">✘</a></td><td class="partial data-cell"><a href="" target="_blank" title="Llama2, made conditionally available by Meta">~</a></td><td class="open data-cell"><a href="https://github.com/jondurbin/airoboros" target="_blank" title="Airoboros, an implementation of the Self-Instruct paper">✔︎</a></td><td class="open data-cell"><a href="https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/tree/main" target="_blank" title="Made available through HuggingFace">✔︎</a></td><td class="partial data-cell"><a href="https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1#licence-and-usage-restrictions" target="_blank" title="Licensing left ambiguous because of murky status of OpenAI-derived Self-Instruct data">~</a></td><td class="partial data-cell"><a href="" target="_blank" title="What little code available is not very systematically documented">~</a></td><td class="partial data-cell"><a href="https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/discussions/2#64c29e4c617b36543dedac9a" target="_blank" title="Some info can be gleaned at link but most remains undocumented">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No preprint found">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No peer-reviewed paper found">✘</a></td><td class="partial data-cell"><a href="https://huggingface.co/jondurbin/airoboros-65b-gpt4-1.4" target="_blank" title="Instructs reader to look up model card for prior 65B Llama1 version">~</a></td><td class="partial data-cell"><a href="https://huggingface.co/datasets/jondurbin/airoboros-gpt4-1.4.1" target="_blank" title="Datasheet for RL data only">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No package found">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No API found">✘</a></td></tr>
-<tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/ChatGLM-6B.yaml" target="_blank" title="data: ChatGLM-6B.yaml">ChatGLM-6B</a></td><td class="partial data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#deployment" target="_blank" title="Some code made available on Github">~</a></td><td class="partial data-cell"><a href="http://doi.org/10.18653/v1/2022.acl-long.26" target="_blank" title="Training data not centrally made available, but described in 2022 ACL paper, appears to be mostly public datasets">~</a></td><td class="open data-cell"><a href="https://huggingface.co/THUDM/chatglm-6b/tree/main" target="_blank" title="Model made available through HuggingFace">✔︎</a></td><td class="closed data-cell"><a and="" are="" bootstrap,="" but="" clearly="" datasets="" feedback="" feedback",="" fine-tuning,="" href="" human="" learning="" none="" of="" reinforcement="" specified."="" supervised="" target="_blank" the="" title="docs mention " used="" wit="">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No weights or checkpoints corresponding to the delta of the LLM vs RLHF provided">✘</a></td><td class="open data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/LICENSE" target="_blank" title="Apache 2.0">✔︎</a></td><td class="partial data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/ptuning/README_en.md" target="_blank" title="Some documentation available, but a lot of code is not commented or explained.">~</a></td><td class="partial data-cell"><a href="" target="_blank" title="Full details architecture not specified in a single place">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="">✘</a></td><td class="partial data-cell"><a href="https://aclanthology.org/2022.acl-long.26/" target="_blank" title="ACL 2022 paper describes the training of the GLM base model, but the RLHF portion is more recent (there is also a related ICLR paper for a newer generation https://openreview.net/forum?id=-Aw0rrrPUF)">~</a></td><td class="closed data-cell"><a href="https://huggingface.co/THUDM/chatglm-6b" target="_blank" title="No modelcard; the HuggingFace modelcard spot is used just as the homepage for the model.">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasheet">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No package">✘</a></td><td class="open data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#api-deployment" target="_blank" title="API provided through fastapi uvicorn">✔︎</a></td></tr>
+<tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/ChatGLM-6B.yaml" target="_blank" title="data: ChatGLM-6B.yaml">ChatGLM-6B</a></td><td class="partial data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#deployment" target="_blank" title="Some code made available on Github">~</a></td><td class="partial data-cell"><a href="http://doi.org/10.18653/v1/2022.acl-long.26" target="_blank" title="Training data not centrally made available, but described in 2022 ACL paper, appears to be mostly public datasets">~</a></td><td class="open data-cell"><a href="https://huggingface.co/THUDM/chatglm-6b/tree/main" target="_blank" title="Model made available through HuggingFace">✔︎</a></td><td class="closed data-cell"><a and="" are="" bootstrap,="" but="" clearly="" datasets="" feedback="" feedback",="" fine-tuning,="" href="" human="" learning="" none="" of="" reinforcement="" specified."="" supervised="" target="_blank" the="" title="docs mention " used="" wit="">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No weights or checkpoints corresponding to the delta of the LLM vs RLHF provided">✘</a></td><td class="open data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/LICENSE" target="_blank" title="Apache 2.0">✔︎</a></td><td class="partial data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/ptuning/README_en.md" target="_blank" title="Some documentation available, but a lot of code is not commented or explained.">~</a></td><td class="partial data-cell"><a href="" target="_blank" title="Full details of architecture not specified in a single place">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="">✘</a></td><td class="partial data-cell"><a href="https://aclanthology.org/2022.acl-long.26/" target="_blank" title="ACL 2022 paper describes the training of the GLM base model, but the RLHF portion is more recent (there is also a related ICLR paper for a newer generation https://openreview.net/forum?id=-Aw0rrrPUF)">~</a></td><td class="closed data-cell"><a href="https://huggingface.co/THUDM/chatglm-6b" target="_blank" title="No modelcard; the HuggingFace modelcard spot is used just as the homepage for the model.">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasheet">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No package">✘</a></td><td class="open data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#api-deployment" target="_blank" title="API provided through fastapi uvicorn">✔︎</a></td></tr>
 <tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/mistral-7B.yaml" target="_blank" title="data: mistral-7B.yaml">Mistral 7B-Instruct</a></td><td class="partial data-cell"><a href="https://github.com/mistralai/mistral-src" target="_blank" title="repository provides 'minimal code to run our 7B model'">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No information provided on pretraining data">✘</a></td><td class="open data-cell"><a href="https://github.com/mistralai/mistral-src#download-the-model" target="_blank" title="Base LLM model made available for download">✔︎</a></td><td class="closed data-cell"><a href="https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1" target="_blank" title="No information provided expect that instruction tuning is done using an unspecified 'variety of publicly available conversation datasets'">✘</a></td><td class="partial data-cell"><a href="https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1/tree/main" target="_blank" title="Instruct version of the model made available but no information on fine-tuning procedure provided">~</a></td><td class="open data-cell"><a href="https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1/blob/main/README.md" target="_blank" title="Apache 2.0">✔︎</a></td><td class="closed data-cell"><a href="https://github.com/mistralai/mistral-src" target="_blank" title="the little code that is available is uncommented and undocumented">✘</a></td><td class="partial data-cell"><a href="https://github.com/mistralai/mistral-src" target="_blank" title="Some information on architecture provided in github repo">~</a></td><td class="partial data-cell"><a href="http://arxiv.org/abs/2310.06825" target="_blank" title="Preprint rehashes marketing blurbs also given in blog and provides no details about pretraining datasets, instruction tuning datasets, or fine-tuning process, hence partial.">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No peer reviewed paper available">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No model card available, HuggingFace modelcard just points to a corporate blog post">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasheet available">✘</a></td><td class="partial data-cell"><a href="https://docs.mistral.ai/quickstart/" target="_blank" title="Docker image shared on github">~</a></td><td class="open data-cell"><a href="https://docs.mistral.ai/api" target="_blank" title="API specification provided by vLLM">✔︎</a></td></tr>
 <tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/wizardlm-7B-V1.yaml" target="_blank" title="data: wizardlm-7B-V1.yaml">WizardLM-7B</a></td><td class="partial data-cell"><a href="https://github.com/nlpxucan/WizardLM/tree/main/WizardLM" target="_blank" title="Fast-evolving repository contains WizardLM code">~</a></td><td class="partial data-cell"><a href="" target="_blank" title="Based on LLaMA, which is claimed to be public but nowhere exactly documented.">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="Based on LLaMA weights, which are not openly available though a leaked versions is in wide circulation.">✘</a></td><td class="open data-cell"><a href="https://github.com/nlpxucan/WizardLM/tree/main/WizardLM#training-data" target="_blank" title="The Evol-Instruct dataset contains 70k instruction-following sequences generated from Evol-Instruct">✔︎</a></td><td class="partial data-cell"><a href="https://huggingface.co/WizardLM/WizardLM-7B-V1.0/tree/main" target="_blank" title="Model weights offered as a delta to LLaMA">~</a></td><td class="partial data-cell"><a href="https://github.com/nlpxucan/WizardLM/blob/main/WizardLM/MODEL_DIFF_LICENSE" target="_blank" title="Restricted for academic research purposes only. Code and Model diff release under CC-BY-NC-4.0, software code under Apache 2.0">~</a></td><td class="partial data-cell"><a href="https://github.com/nlpxucan/WizardLM/tree/main/WizardLM" target="_blank" title="Code is only partially documented, not clearly versioned, and appears to be in flux.">~</a></td><td class="open data-cell"><a href="https://arxiv.org/abs/2304.12244" target="_blank" title="Architecture described in preprint and partly accessible in code repository">✔︎</a></td><td class="open data-cell"><a href="https://arxiv.org/abs/2304.12244" target="_blank" title="Preprint describes method for creating large amounts of LLM-based synthetic RLHF data and fine-tuning WizardLM based on it">✔︎</a></td><td class="closed data-cell"><a href="" target="_blank" title="No peer-reviewed paper or data audit found">✘</a></td><td class="closed data-cell"><a href="https://huggingface.co/WizardLM/WizardLM-7B-V1.0" target="_blank" title="Model card is only a placeholder and generates an error (missing yaml metadata)">✘</a></td><td class="closed data-cell"><a href="https://huggingface.co/datasets/WizardLM/WizardLM_evol_instruct_V2_196k" target="_blank" title="Dataset card for Evol-Instruct generates an error">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No package available">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No API available">✘</a></td></tr>
 <tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/qwen-1.5-chat.yaml" target="_blank" title="data: qwen-1.5-chat.yaml">Qwen 1.5</a></td><td class="partial data-cell"><a href="https://github.com/QwenLM/Qwen1.5/" target="_blank" title="Repository provides sparse source code and some examples for SFT">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="Pretraining data not specified or documented.">✘</a></td><td class="open data-cell"><a href="https://huggingface.co/Qwen/Qwen1.5-72B/tree/main" target="_blank" title="Also available in smaller model sizes">✔︎</a></td><td class="closed data-cell"><a href="https://qwen.readthedocs.io/en/latest/training/SFT/llama_factory.html" target="_blank" title="Data not specified or documented. Some example code in repo provides directions but no details.">✘</a></td><td class="open data-cell"><a href="https://huggingface.co/Qwen/Qwen1.5-72B-Chat/tree/main" target="_blank" title="Also available in smaller model sizes">✔︎</a></td><td class="closed data-cell"><a href="" target="_blank" title="Qianwen License">✘</a></td><td class="partial data-cell"><a href="" target="_blank" title="Repository is fairly well-documented.">~</a></td><td class="partial data-cell"><a href="" target="_blank" title="No clear description of architecture found.">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No preprint found.">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No peer-reviewed paper found.">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="Model card on HF only serves as a pointer to the model, no actual info provided.">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasheet.">✘</a></td><td class="partial data-cell"><a href="" target="_blank" title="No specific package provided but integrates well with many widely used packages">~</a></td><td class="open data-cell"><a href="" target="_blank" title="Available through various APIs">✔︎</a></td></tr>
@@ -68,7 +68,7 @@ <h1>Open GenAI: LLMs (simplified table)</h1>
 <p id="table-guide"><em>How to use this table.</em> Every cell records a three-level openness judgement (<span class="openness open"><strong>✔︎</strong> open</span>, <span class="openness partial"><strong>~</strong> partial</span> or <span class="openness closed"><strong>✘</strong> closed</span>) with a direct link to the available evidence; on hover, the cell will display the notes we have on file for that judgement. The name of each project is a direct link to source data. The table is sorted by cumulative openness, where <strong>✔︎</strong> is 1, <strong>~</strong> is 0.5 and <strong>✘</strong> is 0 points. Note that RL may refer to RLHF or other forms of fine-tuning aimed at fostering instruction-following behaviour.</p>
 </div><!-- #content -->
 <div id="footer">
-<p id="build-time">Figure last built on 2024-06-17 at 13:42 UTC</p>
+<p id="build-time">Figure last built on 2024-06-26 at 19:34 UTC</p>
 </div>
 </body>
 </html>
diff --git a/docs/index.html b/docs/index.html
index 1aa089b..0f93c1f 100644
--- a/docs/index.html
+++ b/docs/index.html
@@ -64,7 +64,7 @@ <h1><a href="" title="Opening up ChatGPT: tracking openness, transparency, and a
 <tr class="row-b"><td class="org"><a href="https://huggingface.co/WizardLM/WizardLM-13B-V1.2" target="_blank" title="Empowering Large Pre-Trained Language Models to Follow Complex Instructions">Microsoft &amp; Peking University</a></td><td class="llmbase" colspan="3">LLM base: LLaMA2-13B</td><td class="rlbase" colspan="3">RL base: Evol-Instruct (synthetic)</td><td colspan="7"></td><td class="source-link"><a href="https://github.com/nlpxucan" target="_blank" title="Microsoft &amp; Peking University">6.0</a></td></tr>
 <tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/airoboros.yaml" target="_blank" title="data: airoboros.yaml">Airoboros L2 70B GPT4</a></td><td class="partial data-cell"><a href="https://gist.github.com/jondurbin/87fc040b92a3073125ed516b04bc6e19" target="_blank" title="Repo exists for RL data but only a gist exists for model training and architecture">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="Llama2 training data is nowhere documented or disclosed">✘</a></td><td class="partial data-cell"><a href="" target="_blank" title="Llama2, made conditionally available by Meta">~</a></td><td class="open data-cell"><a href="https://github.com/jondurbin/airoboros" target="_blank" title="Airoboros, an implementation of the Self-Instruct paper">✔︎</a></td><td class="open data-cell"><a href="https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/tree/main" target="_blank" title="Made available through HuggingFace">✔︎</a></td><td class="partial data-cell"><a href="https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1#licence-and-usage-restrictions" target="_blank" title="Licensing left ambiguous because of murky status of OpenAI-derived Self-Instruct data">~</a></td><td class="partial data-cell"><a href="" target="_blank" title="What little code available is not very systematically documented">~</a></td><td class="partial data-cell"><a href="https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1/discussions/2#64c29e4c617b36543dedac9a" target="_blank" title="Some info can be gleaned at link but most remains undocumented">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No preprint found">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No peer-reviewed paper found">✘</a></td><td class="partial data-cell"><a href="https://huggingface.co/jondurbin/airoboros-65b-gpt4-1.4" target="_blank" title="Instructs reader to look up model card for prior 65B Llama1 version">~</a></td><td class="partial data-cell"><a href="https://huggingface.co/datasets/jondurbin/airoboros-gpt4-1.4.1" target="_blank" title="Datasheet for RL data only">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No package found">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No API found">✘</a></td></tr>
 <tr class="row-b"><td class="org"><a href="https://huggingface.co/jondurbin/airoboros-l2-70b-gpt4-1.4.1" target="_blank" title="">Jon Durbin</a></td><td class="llmbase" colspan="3">LLM base: Llama2</td><td class="rlbase" colspan="3">RL base: Airoboros (synthetic)</td><td colspan="7"></td><td class="source-link"><a href="https://github.com/jondurbin" target="_blank" title="Jon Durbin">5.5</a></td></tr>
-<tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/ChatGLM-6B.yaml" target="_blank" title="data: ChatGLM-6B.yaml">ChatGLM-6B</a></td><td class="partial data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#deployment" target="_blank" title="Some code made available on Github">~</a></td><td class="partial data-cell"><a href="http://doi.org/10.18653/v1/2022.acl-long.26" target="_blank" title="Training data not centrally made available, but described in 2022 ACL paper, appears to be mostly public datasets">~</a></td><td class="open data-cell"><a href="https://huggingface.co/THUDM/chatglm-6b/tree/main" target="_blank" title="Model made available through HuggingFace">✔︎</a></td><td class="closed data-cell"><a and="" are="" bootstrap,="" but="" clearly="" datasets="" feedback="" feedback",="" fine-tuning,="" href="" human="" learning="" none="" of="" reinforcement="" specified."="" supervised="" target="_blank" the="" title="docs mention " used="" wit="">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No weights or checkpoints corresponding to the delta of the LLM vs RLHF provided">✘</a></td><td class="open data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/LICENSE" target="_blank" title="Apache 2.0">✔︎</a></td><td class="partial data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/ptuning/README_en.md" target="_blank" title="Some documentation available, but a lot of code is not commented or explained.">~</a></td><td class="partial data-cell"><a href="" target="_blank" title="Full details architecture not specified in a single place">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="">✘</a></td><td class="partial data-cell"><a href="https://aclanthology.org/2022.acl-long.26/" target="_blank" title="ACL 2022 paper describes the training of the GLM base model, but the RLHF portion is more recent (there is also a related ICLR paper for a newer generation https://openreview.net/forum?id=-Aw0rrrPUF)">~</a></td><td class="closed data-cell"><a href="https://huggingface.co/THUDM/chatglm-6b" target="_blank" title="No modelcard; the HuggingFace modelcard spot is used just as the homepage for the model.">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasheet">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No package">✘</a></td><td class="open data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#api-deployment" target="_blank" title="API provided through fastapi uvicorn">✔︎</a></td></tr>
+<tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/ChatGLM-6B.yaml" target="_blank" title="data: ChatGLM-6B.yaml">ChatGLM-6B</a></td><td class="partial data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#deployment" target="_blank" title="Some code made available on Github">~</a></td><td class="partial data-cell"><a href="http://doi.org/10.18653/v1/2022.acl-long.26" target="_blank" title="Training data not centrally made available, but described in 2022 ACL paper, appears to be mostly public datasets">~</a></td><td class="open data-cell"><a href="https://huggingface.co/THUDM/chatglm-6b/tree/main" target="_blank" title="Model made available through HuggingFace">✔︎</a></td><td class="closed data-cell"><a and="" are="" bootstrap,="" but="" clearly="" datasets="" feedback="" feedback",="" fine-tuning,="" href="" human="" learning="" none="" of="" reinforcement="" specified."="" supervised="" target="_blank" the="" title="docs mention " used="" wit="">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No weights or checkpoints corresponding to the delta of the LLM vs RLHF provided">✘</a></td><td class="open data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/LICENSE" target="_blank" title="Apache 2.0">✔︎</a></td><td class="partial data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/ptuning/README_en.md" target="_blank" title="Some documentation available, but a lot of code is not commented or explained.">~</a></td><td class="partial data-cell"><a href="" target="_blank" title="Full details of architecture not specified in a single place">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="">✘</a></td><td class="partial data-cell"><a href="https://aclanthology.org/2022.acl-long.26/" target="_blank" title="ACL 2022 paper describes the training of the GLM base model, but the RLHF portion is more recent (there is also a related ICLR paper for a newer generation https://openreview.net/forum?id=-Aw0rrrPUF)">~</a></td><td class="closed data-cell"><a href="https://huggingface.co/THUDM/chatglm-6b" target="_blank" title="No modelcard; the HuggingFace modelcard spot is used just as the homepage for the model.">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasheet">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No package">✘</a></td><td class="open data-cell"><a href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md#api-deployment" target="_blank" title="API provided through fastapi uvicorn">✔︎</a></td></tr>
 <tr class="row-b"><td class="org"><a 1t="" 6.2="" able="" about="" and="" answers="" are="" billion="" bootstrap,="" by="" chatglm-6b="" chatgpt,="" chinese="" corpus,="" dialogue.="" english="" feedback="" feedback.="" fine-tuning,="" for="" generate="" href="https://github.com/THUDM/ChatGLM-6B/blob/main/README_en.md" human="" in="" is="" learning="" line="" model="" of="" only="" optimized="" parameters,="" preference.""="" qa="" reinforcement="" similar="" supervised="" supplemented="" target="_blank" technology="" that="" the="" title="From the readme, " to="" tokens="" trained="" uses="" wit="" with="">THUDM</a></td><td class="llmbase" colspan="3">LLM base: GLM (own)</td><td class="rlbase" colspan="3">RL base: Unspecified</td><td colspan="7"></td><td class="source-link"><a href="https://github.com/THUDM" target="_blank" title="THUDM">5.5</a></td></tr>
 <tr class="row-a"><td class="name-cell"><a href="https://github.com/opening-up-chatgpt/opening-up-chatgpt.github.io/blob/main/projects/mistral-7B.yaml" target="_blank" title="data: mistral-7B.yaml">Mistral 7B-Instruct</a></td><td class="partial data-cell"><a href="https://github.com/mistralai/mistral-src" target="_blank" title="repository provides 'minimal code to run our 7B model'">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No information provided on pretraining data">✘</a></td><td class="open data-cell"><a href="https://github.com/mistralai/mistral-src#download-the-model" target="_blank" title="Base LLM model made available for download">✔︎</a></td><td class="closed data-cell"><a href="https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1" target="_blank" title="No information provided expect that instruction tuning is done using an unspecified 'variety of publicly available conversation datasets'">✘</a></td><td class="partial data-cell"><a href="https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1/tree/main" target="_blank" title="Instruct version of the model made available but no information on fine-tuning procedure provided">~</a></td><td class="open data-cell"><a href="https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1/blob/main/README.md" target="_blank" title="Apache 2.0">✔︎</a></td><td class="closed data-cell"><a href="https://github.com/mistralai/mistral-src" target="_blank" title="the little code that is available is uncommented and undocumented">✘</a></td><td class="partial data-cell"><a href="https://github.com/mistralai/mistral-src" target="_blank" title="Some information on architecture provided in github repo">~</a></td><td class="partial data-cell"><a href="http://arxiv.org/abs/2310.06825" target="_blank" title="Preprint rehashes marketing blurbs also given in blog and provides no details about pretraining datasets, instruction tuning datasets, or fine-tuning process, hence partial.">~</a></td><td class="closed data-cell"><a href="" target="_blank" title="No peer reviewed paper available">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No model card available, HuggingFace modelcard just points to a corporate blog post">✘</a></td><td class="closed data-cell"><a href="" target="_blank" title="No datasheet available">✘</a></td><td class="partial data-cell"><a href="https://docs.mistral.ai/quickstart/" target="_blank" title="Docker image shared on github">~</a></td><td class="open data-cell"><a href="https://docs.mistral.ai/api" target="_blank" title="API specification provided by vLLM">✔︎</a></td></tr>
 <tr class="row-b"><td class="org"><a href="https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1" target="_blank" title="">Mistral AI</a></td><td class="llmbase" colspan="3">LLM base: unclear</td><td class="rlbase" colspan="3">RL base: unspecified</td><td colspan="7"></td><td class="source-link"><a href="https://mistral.ai/" target="_blank" title="Mistral AI">5.5</a></td></tr>
@@ -134,13 +134,14 @@ <h2>TL;DR</h2>
 <p>We conclude as follows:</p>
 <blockquote id="conclusion">Openness is not the full solution to the scientific and ethical challenges of conversational text generators. Open data will not mitigate the harmful consequences of thoughtless deployment of large language models, nor the questionable copyright implications of scraping all publicly available data from the internet. However, openness does make original research possible, including efforts to build reproducible workflows and understand the fundamentals of instruction-tuned LLM architectures. Openness also enables checks and balances, fostering a culture of accountability for data and its curation, and for models and their deployment. We hope that our work provides a small step in this direction.
   </blockquote>
+<h2>Papers</h2>
 <p>Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Tracking Openness, Transparency, and Accountability in Instruction-Tuned Text Generators.” In <em>CUI '23: Proceedings of the 5th International Conference on Conversational User Interfaces</em>. July 19-21, Eindhoven. doi: <a href="https://doi.org/10.1145/3571884.3604316" target="_blank">10.1145/3571884.3604316</a> (<a href="https://pure.mpg.de/pubman/item/item_3526897_1/component/file_3526898/Liesenfeld%20et%20al_2023_Opening%20up%20ChatGPT.pdf" target="_blank">PDF</a>).</p>
 <p>Andreas Liesenfeld and Mark Dingemanse. 2024. Rethinking open source generative AI: open washing and the EU AI Act. In <em>The 2024 ACM Conference on Fairness, Accountability, and Transparency (FAccT '24)</em>. Association for Computing Machinery, New York, NY, USA, 1774–1787. doi: <a href="https://doi.org/10.1145/3630106.3659005" target="_blank">10.1145/3630106.3659005</a></p>
 </div><!-- #content -->
 <div id="footer">
 <p>We gratefully acknowledge funding from the Dutch Research Council for the project <em><a href="https://markdingemanse.net/elpaco" target="_blank">Elementary Particles of Conversation</a></em> (016.vidi.185.205), and support from the CLS Humanities Lab (<a href="https://github.com/timjzee/" target="_blank">timjzee</a>)</p>
 <p class="copyright">Website &amp; code © 2023-2024 by the authors. If you find any of this useful, our papers provides the canonical and most durable citation.</p>
-<p id="build-time">Table last built on 2024-06-17 at 13:42 UTC</p>
+<p id="build-time">Table last built on 2024-06-26 at 19:34 UTC</p>
 </div>
 </body>
 </html>