Explore DAGs¶
A Matchbox collection can store several runs of a DAG. Each run is a server-side snapshot of the sources, models, and resolvers that define one entity view for that collection.
This guide shows how to list collections, download a stored run, and inspect the pipeline it contains.
Listing collections¶
You can also do lightweight exploration from the CLI.
Run mbx collections --help to inspect the available collection commands and options.
Downloading a DAG¶
Load the default or pending run for a collection.
load_default() reconstructs the published run for that collection. This is the run other users and services query by default.
load_pending() reconstructs the latest non-default run, which is usually the draft currently being worked on before publication.
The downloaded DAG includes the serialisable definitions of every source, model, and resolver in that run.
Inspecting the pipeline¶
Use draw() to inspect the dependency graph.
The default resolver is the single final resolver in a complete published DAG. It’s used in functions like DAG.get_matches() if no resolver is supplied. To make a DAG run the default, a final_resolver must be present.
Inspecting individual steps¶
You can retrieve sources, models, and resolvers by name.
Once you have the DAG locally, you can attach warehouse clients, query resolved entities, or start a new run from the same structure.