This script generates documentation for the wandb
SDK library.
For help with this function, run
python generate.py --help
It uses a more generic documentation generation tool,
docugen
,
based on the
TensorFlow documentation generator.
- Run
pip install --upgrade git+git://github.com/wandb/wandb.git@<commit_id>
to install the version ofwandb
you wish to document. - Run
python generate.py --commit_id <commit_id>
to create the documentation. Make sure to use a full hash or tag -- it's used to create URLs. - Move the generated documentation into a local copy
of the Docs repository (under
docs/ref
).
- A folder named
ref
. The files in theref
folder are the generated markdown. Use the--output_dir
option to change where this folder is saved; by default it is in the working directory. If theoutput_dir
already contains a folder namedref
, any contents inside thecli
orpython
directory will be over-written. - A
SUMMARY.md
file for creating a GitBook sidebar that indexes the automatically-generated docs. This is based on a provided--template_file
(by default,_SUMMARY.md
from this repo).
python generate.py \
--template_file _SUMMARY.md \
--commit_id v0.15.6 \
--output_dir path/to/gitbook
Run pip install -r docugen/requirements.txt
from root to install requirements.
Behavior is configured by the config.ini
file,
which is loaded using ConfigParser
.
To extend the footprint of the library that is documented, this file must be edited.
There's a rough dummy/documentary example in config.ini
, the EXAMPLE_SUBCONFIG
.
If the newly-documented components are in the top level of a module that is already being documented,
e.g. if a method is added to the top-level wandb
or if a new data_type
is added to wandb.data_types
,
then add the elements
to document to either elements
or add-elements
in the proper SUBCONFIG
.
For example, if we added launch
as a method available at wandb.launch
and wanted to document it
at the top level (alongside e.g. wandb.watch
), we'd add it to the elements
of the WANDB_CORE
subconfig.
As another example, if we added PanopticSegmentation
to the wandb.data_types
submodule,
but didn't make it available at the top level, we'd add it to the add-elements
section of the WANDB_DATATYPES
subconfig.
Editing on top of the state at commit 7ab1d97
, the subconfig sections of config.ini
would look like:
[WANDB_CORE]
# main python SDK library
dirname=python
title=Python Library
slug=wandb.
elements=Artifact,agent,config,controller,finish,init,launch,log,save,summary,sweep,watch,__version__
add-from=wandb_sdk.wandb_run
add-elements=Run
module-doc-from=self
[WANDB_DATATYPES]
# data types submodule, including media and tables
dirname=data-types
title=Data Types
slug=wandb.data\_types.
elements=Graph,Image,Plotly,Video,Audio,Table,Html,Object3D,Molecule,Histogram
add-from=data_types
add-elements=ImageMask,BoundingBoxes2D,PanopticSegmentation
module-doc-from=data_types
If the newly-documented components are not in the top level of a module that is already being documented, then you'll need to add a new section to the reference docs (though it'd be easier just export them at the top level and add them to an existing section!).
You'll need to
- Add a new subconfig, a la
WANDB_DATATYPES
- Populate that with all the required tags (see
EXAMPLE_SUBCONFIG
) - Add that subconfig to the
SUB_CONFIGS
- Add handling for the new section to
generate.py
, underget_prefix
- Add handling for the new section to
library.py
, underbuild
. (This last step could probably be easily automated away with some slight changes to the logic there).
In any wandb repo's Actions workflow, you can trigger document generation like so...
- uses: wandb/[email protected]
with:
gitbook-branch: en
access-token: ${{ secrets.DOCUGEN_ACCESS_TOKEN }}
where vX.Y.Z
is a tag on this (docugen) repo and DOCUGEN_ACCESS_TOKEN
is a Personal Access Token with org write permissions, stored in the consuming repo. This will generate docs based on the latest releases of all contributing projects and push them to the target branch in the gitbook repo (en
is the branch used for the main English docs).
After merging your changes to docugen@main
...
git tag -a vX.Y.Z -m "docugen version X.Y.Z"
git push origin vX.Y.Z
Then you'll need to update the uses
line in the consuming repo. If we eventually have a lot of repos doing this it might be better to refer to @main
, to save us the trouble of manually updating all of them (note that if you don't update a repo, it will trigger the old document generation logic, potentially overwriting new docs).
You can test changes to the action on a branch with uses: wandb/docugen@branch-name
from any other repo.