Specifications⚓︎
Main specification⚓︎
The Data Product Specification (DPDS) is the main specification defined by the Open Data Mesh Initiative to describe all the components of a data product.
DPDS defines what are the important components to describe and how they are organized within the descriptor document.
Whenever possible the description of a component or one of its sub-parts is delegated to an independent specification defined within the Open Data Mesh Initiative (i.e. sub-specification) or by third parties (i.e. external specification).
Sub-specifications⚓︎
Data Store API Specification⚓︎⚓︎
The DatastoreAPI Specification (DSAPI) is a specification for describing, producing, consuming and visualizing connection based service that expose a tabular datastore.
Tip
DPDS RECOMMEND to use this specification to define the API of ports that expose tabular data stored in database accessible with a connection based protocol.
Check out DatastoreAPI repository on GitHub for further information.
Schema Annotation Specification⚓︎⚓︎
The Schema Annotation Specification (SAS) is a specification that defines how to annotate structural elements within a schema describing a data model with metadata. SAS is independent of the Schema Definition Language (SDL) used, meaning it can be applied alongside any SDL (e.g., JSON Schema, Avro, Protobuf, XSD, etc.).
Tip
DPDS RECOMMEND to use this specification to annotate schema within port's API definition to provide all required metadata.
Check out Schema Annotation Specification repository on GitHub for further information.
External specifications⚓︎
API⚓︎
Name | Description |
---|---|
OpenAPI | The OpenAPI Specification, previously known as the Swagger Specification, is a specification for machine-readable interface files for describing, producing, consuming and visualizing RESTful web services. |
AsyncAPI | The AsyncAPI Specification is a specification for describing, producing, consuming and visualizing asynchronous services. |
Tip
DPDS RECOMMEND to use OpenAPI Specification to define the API of ports that expose RESTful services and AsyncAPI Specification to define the API of ports that expose streaming services
Schemas⚓︎
Name | Description |
---|---|
JSON Schema | A set of rules in JSON format defining the expected structure and data types for JSON data. |
Avro | A data contract written in a compact binary format describing the expected structure of data for efficient storage and processing. |
Protobuf | A language-agnostic definition written in a special syntax, defining the structure, data types, and serialization of messages for efficient data exchange. |
Observability⚓︎
Name | Description |
---|---|
OpenTelemetry | an open standard telemetry data collection |
OpenLineage | an open standard for lineage data collection |
Open SLO | an open standard to define Service Level Agreement |
Computational Governance⚓︎
Standard | Description |
---|---|
Open Policy Agent | Policy-based control for cloud native environments |
Great Expectations | A platform to test data quality |
Soda | A platform to test data quality |
Overlapping specifications⚓︎
Name | Description |
---|---|
Open Data Product Specification | A data product specification |
Data Product Specification | Another data product specification (by AgileLab) |
Open Data Contract Standard | A data contract specification (by Bitol) |
Data Contract Specification | Another data contract specification (by INNOQ) |