Skip to main content

Ganymede Metadata Tables

Ganymede metadata tables allow users to query and reference details about flow runs, tables, and files stored within the Ganymede platform. These tables can be utilized for various purposes, such as debugging previous runs, locating specific files or table records associated with past runs, and systematically displaying tags used to contextualize and ensure traceability of captured data.

The tables described below are found in the Ganymede database, and can be referenced from Data Explorer, within Flow code, or in Dashboards.

Flow Runs

The flow_run_metadata table captures a record for each flow execution.

Field NameDescription
flow_run_idUnique identifier for the Flow run, stored as epoch time in milliseconds
flow_idFlow name
inputs_dictRun context for the Flow; corresponds to the run context described in the Ganymede Class overview
initiatorFlow run initiator; agent name + MAC address for agent-initiated Flows, user email for user-initiated flows, event name for event-triggered Flows, or Flow name if triggered from another Flow
initiator_typeType of the user who initiated the flow run (AGENT, USER, EVENT, FLOW)
flow_versionCommit hash associated with flow execution
stateFinal disposition of Flow run (Success, Failed)

Table Writes

The public_flow_run_output table contains 1 record per output table written in Ganymede.

Field NameDescription
idID for table save
nodeName of node that wrote to table
nameName of table written to Ganymede table storage
timestampTime of the table save; stored as epoch time in fractional seconds
datastream_metadataJSON containing the UUID associated with the table write transaction and Flow run ID

File Writes

The file_metadata table contains 1 record per file written in Ganymede.

Field NameDescription
bucketTypeEither "Input" or "Output"; indicates whether the file was input into Ganymede or an output from processing performed in Ganymede
uriFull URI to the file
pathPath within storage bucket to file
sizeSize of the file in bytes
createdAtTime of the file save, stored as epoch time in milliseconds
creatorName of the file creator (e.g. - email address for users, agent name + MAC address for Agents)
fileTagsList of JSON objects containing Tags associated with the file

Python Packages

The ganymede_env_python_package_list shows a list of all Python packages installed in the workflow environment.

Field NameDescription
package_nameName of the Python package
version_numberVersion of the Python package

Pandas <-> Ganymede DB Field Reference

SQL columns have character restrictions for compability with SQL which are not present in Pandas. __TABLE_SCHEMA contains the mapping between table field names referenced in NodeReturn and the corresponding SQL column names.

The retrieve_tables method associated with Ganymede class provides a convenient method to retrieve tables in notebooks with their original Pandas Dataframe column names.

This table contains 1 record per table write transaction.

Field NameDescription
pd_field_namePandas field name
pd_field_typePandas field column type
bq_table_nameTable name in Ganymede data lake
bq_field_nameColumn name in Ganymede data lake
flow_run_idFlow run that the table write is associated with