Skip to main content

System Reference Tables

Ganymede metadata tables enable users to query and reference information about flow runs, tables, and files stored in Ganymede. Some possible uses for these tables are to debug prior runs, find specific files / table records associated with prior runs runs, and systematically displaying tags used to contextualize and ensure traceability of captured data.

The tables described below are found in the Ganymede database, and can be referenced from Data Explorer, within Flow code, or in Dashboards.

Flow Runs

The flow_run_metadata table is written to once per flow execution.

Field NameDescription
flow_run_idUnique identifier for the flow run, stored as epoch time in milliseconds
flow_idFlow name
inputs_dictRun context for the flow; corresponds to the run context described in the Ganymede Class overview
initiatorFlow run initiator; agent name + MAC address for agent-initiated flows, user email for user-initiated flows, event name for event-triggered flows, or flow name if triggered from another flow
initiator_typeType of the user who initiated the flow run (AGENT, USER, EVENT, FLOW)
flow_versionCommit hash associated with flow execution
stateFinal disposition of flow run (Success, Failed)

Table Writes

The public_flow_run_output table contains 1 record per output table written in Ganymede.

Field NameDescription
idID for table save
nodeName of node that wrote to table
nameName of table written to Ganymede table storage
timestampTime of table save; epoch time in fractional seconds
datastream_metadataJSON containing UUID associated with table write transaction and flow run ID

File Writes

The file_metadata table contains 1 record per file written in Ganymede.

Field NameDescription
bucketTypeInput or Output; indicates whether the file was input into Ganymede or an output from processing performed in Ganymede
urifull URI to the file
pathPath within storage bucket to file
sizeSize of file in bytes
createdAtTime of file save; epoch time in milliseconds
creatorName of the file creator (e.g. - email address for users, agent name + MAC address for agents)
fileTagsList of JSON objects containing tags associated with the file

Python Packages

The ganymede_env_python_package_list shows a list of all Python packages installed in the workflow environment.

Field NameDescription
package_nameName of the Python package
version_numberVersion of the Python package

Pandas <-> Ganymede DB Field Reference

SQL columns have character restrictions for compability with SQL which are not present in Pandas. The __TABLE_METADATA contains the mapping between table field names referenced in NodeReturn and the corresponding SQL column names.

The retrieve_tables method associated with Ganymede class provides a convenient method to retrieve tables in notebooks with their original Pandas Dataframe column names.

This table contains 1 record per table write transaction.

Field NameDescription
flow_idFlow name
flow_run_idUnique identifier for the flow run, stored as epoch time in milliseconds
table_idTable name
timestampTime of table save; epoch time in fractional seconds
typeWhether write transaction is to create, replace, or append to database table
table_columnsList of column names in the table
table_column_typesList of column types in the table
table_row_countNumber of records in Pandas DataFrame being written to DB