OSCAL XProc 3.0 Tutorial

This is work in progress towards an XProc 3.0 (and 3.1) tutorial or set of tutorials.

Coverage here is not a substitute for project documentation - the tutorial relies on projects in the repo for its treatments - but an adjunct to it for beginners and new users who wish for guidance and information on XProc that they are not likely to find for themselves.

In its current form, only the first introductory materials are offered. The framework is easily extensible to cover more topics, and an XProc-based tutorial production system is part of the demonstration. But the approach needs to be tested more before it is extended.

Tutorial exercises can be centered on OSCAL-oriented applications but the learning focus will be XProc 3.0/3.1.

We need to be modest and realistic in our goals, in proportion to the great potentials.

Start

Follow the tutorial by reading the files published in the repository, or by copying or cloning the repository for hands-on experience.

Tutorial sequence links (Links to Markdown files)

Approach

First and foremost this is a "practicum" or hands-on introduction that encourages readers not only to follow along, but to try things out, practice and learn by interactive observation.

Otherwise, the tutorial is designed to support multiple different approaches suitable for different learners and needs - both learning styles and goals as described below. Develop an approach that works for your case by moving at your own speed and skipping, skimming or delving more deeply into topics and problems of interest.

Each topic ("Lesson") in a sequence offers a set of Lesson Units around a common problem area or theme, leveraging projects in the repository to provide problems and solutions with working pipelines to run and analyze.

The first two topics cover setup and overview ("walkthrough"), while later topics delve into specific application domains.

The important problem of data conversion between XML and JSON is the first topic explored in depth. Later topics (not yet developed) may cover XML-based publication; validation against schemas and rule sets (both standard and locally-defined); data acquisition from PDF and JSON; OSCAL profile resolution (rendition of control sets defined by profiles in catalog form); processing XProc with XProc; unit testing under XProc (XSpec); and others.

Since the topics all build on previous topics, considering them in the given order will be least confusing.

Make the most of this

If you can run Java applications this tutorial will school you directly in XProc development, using an open-source processor. Command-line processing is made to be easy and sweet.

If you can't or won't run the distribution offered, but you can nonetheless run XProc 3.0 pipelines in a conformant processor (for example in an XML coding environment with XProc 3.0/3.1 support), be glad, this tutorial will still work - as the pipelines should run the same in any conformant XProc engine. Just make adjustments as you go.

If you cannot run XProc 3.0 at all, let us know how we can help (in an XProc tutorial).

Target audiences

Data owners, proprietors and process managers who need to oversee XProc-based processes
XML-stack developers starting or working with XProc 3.0/3.1
Students at all levels
OSCAL engineers new to XML/XDM who wish to broaden OSCAL horizons - this tutorial being a gateway to OSCAL applications in this repository

To enable readers to cater to their own needs, the tutorial offers these tracks:

Observer track - operates pipelines (hits 'run' button or equivalent) and inspects outputs (in a file system), but does not code internals
Maker track - able and willing to intervene in the internals
Learner track - everyone

Since the different tracks are arranged along the same topics, the treatments are also suitable for groups who wish to work any or all tracks collaboratively.

If you want a no-code experience, read the Learner track, skip the Maker track and skim the Observer track. Keep in mind that you might have to run pipelines, if only to see their outputs.

If for any reason you can't run XProc or Java, post us an Issue and the dev team will investigate options including a container-based distribution of some nature. The simplicity of 'bare bones' however recommends it to us and you.

Observer Track

This track is for everyone but most especially for researchers including subject experts, managers and developers who want to see how XProc works and understand its capabilities, but who do not plan to build XProc pipelines or write code. Maybe you need to understand it so you can help develop and test against requirements.

In this track you will inspect pipelines, run pipelines, and inspect their inputs and outputs.

Since this track does not ask you to modify or extend anything, you can have confidence that any bugs or errors you encounter expose honest problems to be inquired after, not errors on your part.

The idea is that with a foundation in running pipelines and witnessing their capabilities at first hand, readers can then judge for themselves how deeply to delve into the various topics raised, whether moving along, skipping, stopping, or pausing to look at the other tracks. Everyone gets a grounding by looking at the problems together.

Maker Track

The coverage here parallels the Observer track, but provides exposition appropriate to newcomers to XProc who wish to accelerate learning by means of trial and experiment.

XProc is intimately tied to XML (syntax), XDM (data model), XPath (for queries and pattern-matching) and a host of related 'X' technologies developed over decades at W3C and elsewhere. Those who have prior experience with XML and XSLT will find much that is familiar along with what is new, but even developers who are not XML specialists should be able to use the Maker track to test and develop skills. If XML is the rocket, your data is the fuel and XProc is the launchpad. It should be possible to start from zero, and with a model rocket.

The expectation is that unlike Observers, Makers will be interested in tracebacks and error messages, and will wish sometimes to produce them deliberately if only to see what happens. As a Maker, you are confident that you can put things back well enough or that any damage is well contained.

Makers can also expect the gratification of making things work in new ways.

If you are a tactile learner with no patience for reading, you can skim through the Observer and Maker tracks together quickly and focus on the examples, to try things out. Within topics, all tracks sometimes offer useful context, while the Maker track is specifically streamlined. (But maybe you have found that out for yourself and will never read these words to say so!)

Learner Track

In parallel with the other two tracks, the Learner track offers all readers more explanation and commentary, in greater depth and with more links.

Note that the Learner track itself represents the views of one still learning, so it is subject to change and refinement - most especially if you find things in it that are in need of clarification or correction.

Easter eggs

There are more curiosities to be discovered. For example, the tutorial has indexes, built using XProc. There are worksheets. Close inspection of hidden corners sometimes pays off. Links are not always given here or anywhere, and prizes go to the vigilant.

Interactivity

Interactivity of a sort is built in, inasmuch as every lesson is centered on specific processing pipelines already in place and ready to run.

How far you go with this is up to you, but one obvious possibility for some will be to bring your own data set and consider adapting pipelines to do things with it.

OSCAL data will especially be of interest, especially OSCAL catalogs and profiles.

Editing the lessons

The lessons are published in Markdown, which invites you to create and edit copies.

When you do so, however, please take care:

To see to it that your edits or annotations are easily distinguishable
To save your edited copy out, lest it be deleted inadvertently as a temporary artifact (see next section)

All work product here is in the Public Domain, so no worries there. Of course this means you can't claim copyright either: caveat scriptor.

Production

Source files are maintained in a reduced HTML format.

This reduction is enforced by means of a Schematron rule set, which forbids anything outside of a small core of supported elements in HTML, whose mappings and semantics can be well defined.

Simultaneously, a transformation is maintained that rewrites these into Markdown syntax. The Markdown files can be produced statically or under CI/CD, committed into the repository and read directly in its previews.

This model is designed to requirements including:

Easy to use in available tools including oXygen XML Editor (structured authoring)
Quick and lightweight to stand up
XProc-based
Works bare-bones but easy to extend ad-hoc
Quick and easy to publish
Github/markdown-based publishing is versatile for users
Ease of extensibility and reusability keeping options open for something better, later.

At a future point we can also opt to produce the Tutorial in other formats, if this is considered useful.

Runtimes

See the top-level pipelines for current capabilities. At time of writing:

PRODUCE-TUTORIAL-MARKDOWN.xpl produces a set of Markdown files, writing them to the sequence directory.

PRODUCE-TUTORIAL-TOC.xpl produces the Tutorial Table of Contents

PRODUCE-TUTORIAL-PREVIEW.xpl produces a single preview tutorial on one HTML page

PRODUCE-PROJECTS-ELEMENTLIST.xpl produces an index to XProc elements appearing in pipelines under discussion - read about it in the lessons

Leave your tracks

Follow up

If you make a fork of this repository, let us know - also be aware we may use your name (unless you request we not do so).

If you copy or use anything from the repository also feel free to let us know.

Discussion board

The Github repository Discussion Board is a worthwhile place to browse or to pose questions, whether directly about the tutorial or exercises, or more broadly.

Github PRs

Consider making a pull request with an enhancement to the repository. If there are no corrections or improvements to suggest, sample files are always welcome. Future users can have the benefit of your experience.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

OSCAL XProc 3.0 Tutorial

Start

Approach

Make the most of this

Target audiences

Observer Track

Maker Track

Learner Track

Easter eggs

Interactivity

Editing the lessons

Production

Runtimes

Leave your tracks

Follow up

Discussion board

Github PRs

Files

readme.md

Latest commit

History

readme.md

File metadata and controls

OSCAL XProc 3.0 Tutorial

Start

Approach

Make the most of this

Target audiences

Observer Track

Maker Track

Learner Track

Easter eggs

Interactivity

Editing the lessons

Production

Runtimes

Leave your tracks

Follow up

Discussion board

Github PRs