Recently, repository structure, schema formats and the amount of schema blocks represented in the repositories have seen some maturation and expansion. The current setup is closing in on moving from an “experimental state” towards production readiness, i.e. a stable format for general document structure, canonical links and also frozen representation of first core schemas.
Early on, {S}[B] participants had agreed on the representation of data schemas using the JSON Schema conventions. While the first demonstrator schemas had been implemented in a similar, OpenAPI derived format, recent efforts have moved towards a clean JSON Schema representation, including a metadata header format specifically geared towards {S}[B]. An important element here is the sb_status attribute, which will serve as primary indicator of a given schemas acceptance level and stability.
BlockMeta
schema [➚]sb_status
documentation and issueAs one of the most advanced and complex data schemas in the GA4GH ecosystem, and due to its dynamic alignment with pre-existing and standards emerging from the different work streams, the Phenopackets schema constitutes a rich source for reference schema blocks which should find their way into other GA4GH related developments.
Over the last weeks, members of the {S}[B] team (especially Isuru, Jules, Ben and Michael) have worked on converting parts of the Phenopackets specification to JSON Schema blocks, with documentation rendered into the {S}[B] schema collection. This process isn’t finished yet, but the existing schemas already should serve as the basis for discussions about representation, code maintenance and versioning and adoption of the working procedures for other schemas.
tools
Repository [➚]Current {S}[B] source code development follows an “integrated document” approach, where the main documentation and data examples for each schema are included inline, as values of JSON Schema attributes (examples
, description
). From the main schema document (written in YAML), a dedicated parser then extracts
This deparsing of the source code can be performed using the sbSchemaParser.pl
script, in a local setting containing the current versions of all relevant {S}[B] repositories.