One of the core benefits of Dagster is fostering collaboration between the engineers who build data pipelines and the end users who consume the data pipelines produce.
Dagster offers several ways to provide useful information and documentation alongside your data pipelines, including metadata and tagging.
Using metadata and tags helps you:
There are two main types of metadata in Dagster: definition and runtime. In code, you can attach metadata to definitions like assets, jobs, and runs. You'll then be able to view the metadata in the Dagster UI.
Definition metadata is information that's fixed or doesn't frequently change. For example, definition metadata could be:
Runtime metadata is information that changes after an action occurs. This could be how many records were processed, how long an asset took to materialize, etc.
How metadata is defined depends on whether you're using assets or ops and jobs:
In addition to definition metadata, you can also use tags to label and organize definitions in your Dagster project. Definitions like assets, jobs, and runs can have multiple tags, which are applied in code. Refer to the Tags documentation to get started.