Skip to content

Latest commit

 

History

History
172 lines (102 loc) · 10.7 KB

README.md

File metadata and controls

172 lines (102 loc) · 10.7 KB

Metaschema XSLT

License

See the project license in this repository.

This project is placed into the worldwide public domain.

Software description

Metaschema-XSLT is an XSLT-based implementation of the NIST (ITL/CSD) Metaschema Information Modeling Framework.

NIST Metaschema defines how to describe sets of data objects -- serialized information sets, considered abstractly -- across and between implementations of processing systems that are coded to manipulate and process such objects. These descriptions are provided by means of a formal language developed for this purpose.

In support of metaschemas defined using this language, this software can be used to generate utilities and software artifacts including XML Schema (XSD) and JSON Schema, data conversion scripts, and model documentation.

Such utilities can in turn provide the basis for ecosystems of tools supporting data models and data sets expressed in the supported formats. Make-your-own Schema Kit except your schema comes with many extras, including not only documentation, but also the ability to cast your data between formats and maintain formal format-agnosticism.

Statement of purpose

NIST ITL/CSD Metaschema is a data modeling technology, which works by providing formal and validable descriptions of data models such that instances (data sets) conforming to these models can be cast and rewritten in various alternative syntaxes, i.e. optimized at the level of format, while permitting lossless conversion of defined-as-equal information sets across format boundaries.

In addition to neutralizing the "XML vs JSON" or more broadly "Markup language vs object notation" issue, this technology also offers other capabilities such as external specification and validation of application-oriented semantics ("business rules") defined over models appropriate to specialized knowledge domains.

The current effort seeks to implement, demonstrate and enable the capabilities of a NIST Metaschema system, with respect to functionalities including schema generation, data conversion or pipelining, stylesheet generation or a host of other potential uses of Metaschema and metaschemas considered broadly. In addition, we aspire to do so sustainably, not in the sense that the software will meet all requirements for all time, but that it will be reliable, validable, traceable, extensible -- and replaceable.

Considering data traceability and process transparency as primary goals will also serve secondary goals, such as the creation and demonstration of use cases (for this and compatible software) and the codification and maturation of relevant standards, including standards supported by Metaschema such as OSCAL.

Stress is therefore on:

  • Transparency, testability and maintainability of processing on an open-source stack
  • Conformance in support of Metaschema-based data interchange / interop
  • Generality of support of Metaschema features

Additionally, we care about, but do not prioritize:

  • Performance
  • Portability beyond the XML/XSLT dependency stack (instead relying on other Metaschema initiatives)
  • Specific use cases

Origins

Formerly housed in the Metaschema repository, this code base traces the history of development of the Metaschema concept in the context of the OSCAL project. It was originally conceived as a demonstration and proof of concept, providing a bridge enabling JSON- and XML-based development in parallel over common problem sets and common data. Success in this effort led to a determination that multiple implementations of a platform-independent specification were needed, at which point this implementation was carved out into its own repository.

Project sunset

The NIST team has no long-term plan to maintain this project outside the context of the initiatives it supports, OSCAL and Metaschema, while we reserve the right to change and adapt to community needs in alignment with NIST's mission. A project goal is to enable the stabilization and socialization of the Metaschema technology, as evidenced by the use and support of other Metaschema implementations.

The best way to ensure long-term access to the code base is to clone or fork the git repository and archive a copy for study or use.

Repository contents

src includes XSLT source code, with supporting infrastructure including ad-hoc testing

support includes dependent submodules with other static resources for configuration

Installation and operation

To operate in trial, test or 'bare-bones' mode, scripts are offered to perform operations with no installation except Maven (with JDK as required) and bash as a command line environment.

The utilities are however designed for integration in a range of environments, and core functionalities are implemented in XSLT 3, which is supported across platforms including Java, node JS and C.

The software is designed to be used in a range of ways:

  • Directly, in development of metaschemas and Metaschema-based software and tools
  • Within Metaschema-based builds, including under CI/CD, to generate artifacts or productions from metaschema source under controlled conditions

To run

The following generalized services are provided by the tools in this repository, separately or in combination

Scripts and stylesheets are documented in place using readmes and in line. Most scripts depend on Apache Maven supporting a Java runtime. Since XSLTs can call, import, include or read XSLTs from elsewhere in the repo, and sometimes do, keep the modules together: each folder on its own is not self-contained.

Accordingly, a good place to start for further research is the src directory with its readme.md.

For testing, all XSpec scenarios (*.xspec) can be run in place to generate local test reports.

Users are also expected to call resources in this repository from their own scripts. Do this either by cloning, copying and modifying scripts here; by writing your own; or by adapting code into the XML/XSLT processing framework or stack of your choice.

In general, at least two invocations will be offered for each process, an XProc-based invocation and a pure-XSLT-based invocation. Either may be useful in different scenarios.

A convention is used indicating that an XProc (*.xpl file) or XSLT (*.xsl) intended to be invoked directly (that is, not only to be used as a module or component) is given a name entirely or partly in ALL-CAPITALS. For example, src/schema-gen/METASCHEMA-ALL-SCHEMAS.xpl is such an XProc pipeline (a step definition intended to be used directly). The XSLTs that observe this convention are, additionally, higher-order transformations by virtue of using the transform() function; for all other resources the convention lower-case-hyphenated is followed.

Dependencies

As a freely-available XSLT 3.0 engine, the Saxon XSLT processor can be regarded as a de facto dependency - while this XSLT-conformant code should in principle run in any processor implementing the language. Saxon-HE can be bundled using Maven or another Java packaging technology.

The POM file for Java/Maven configuration indicates the current tested version of Saxon. At time of writing, Saxon versions 10 and 11 are known to work with this codebase. When reporting bugs please include the version of your processor.

Developers interested in demonstrating the viability of these processes in different processors and environments are eagerly invited to participate in development of this tool or related tools.

Git Client Setup

See more on git setup on the page on the Contributing page.

Clone the project:

git clone --recurse-submodules https://github.com/usnistgov/metaschema-xslt.git

Or to clone with submodule initialization:

git clone --recurse-submodules https://github.com/usnistgov/metaschema-xslt.git

Initializing Git submodules

This GitHub repository makes use of Git submodules to mount other repositories as subdirectories. When cloning this repo for the first time, you need to initialize these.

git submodule update --init

Or do this in one step using clone --recurse-submodules as noted.

Updating submodules

Submodule contents will be periodically updated. To ensure you have the latest commits for a configured submodule:

git submodule update --init --recursive

Contact information

Principal Investigator: Wendell Piez, NIST (US National Institute of Standards and Technology) ITL/CSD (Information Technology Laboratory, Computer Security Division).

Email w e n d e l l (dot) p i e z (at) n i s t (dot) g o v.

As this initiative spins off from and supports the ITL/CSD Metaschema Project, originally and currently being developed in support of OSCAL, the Open Security Controls Assessment Language, feedback on this project will be gratefully received in those forums.

For OSCAL-related discussions see the OSCAL Site.

Cite this work

Piez, Wendell (2023), Metaschema-XSLT. US National Institute of Standards and Technology (NIST). https://github.com/usnistgov/metaschema-xslt.

Related projects

See the Metaschema Repository and its Pages Site for a description and specification of the technology this software is designed to support. Those pages also contain links to other projects.

See the Project Wiki for documentation maintained on this site.

Required outline

This page includes all the following, as described by guidelines at https://raw.githubusercontent.com/usnistgov/opensource-repo/main/README.md

  1. Software or Data description
    • Statements of purpose and maturity
    • Description of the repository contents
    • Technical installation instructions, including operating system or software dependencies
  2. Contact information
    • PI name, NIST OU, Division, and Group names
    • Contact email address at NIST
    • Details of mailing lists, chatrooms, and discussion forums, where applicable
  3. Related Material
    • URL for associated project on the NIST website or other Department of Commerce page, if available
    • References to user guides if stored outside of GitHub
  4. Directions on appropriate citation with example text
  5. References to any included non-public domain software modules, and additional license language if needed, e.g. BSD, GPL, or MIT