Excel_Read
Node
Node Description
Reads Excel file and uploads processed result to data lake
Ingests one Excel file, which is passed to the user-defined function as a bytes object for processing. The user-defined function returns one or more tables, which are uploaded to the Ganymede data lake.
Node Attributes
- input_file_excel
- File extension for valid Excel files submitted. For example, filling in this attribute with ".xls" will allow any file containing extension .xls, .xlsx, .xlsm, and .xlsb to be uploaded.
- output_table_results
- Table displayed on Table Head in Ganymede UI.
Notes
In the execute function, returning NodeReturn(tables_to_upload={'results': df}) would render the DataFrame df in the Flow Editor if Table Head visualization is enabled.
Example
The Node attribute configuration below captures all files ending in xls, xlsx, xlsm.
- excel: *.{xls,xlsx,xlsm}
- results: instrument_output_results
User-Defined Python
Reads Excel file and stores processed table(s) in data lake.
Parameters
- excel_file :
bytes | dict[str, bytes]
- Excel file as a bytes object or as dict indexed by filename
- ganymede_context :
GanymedeContext
- Ganymede context variable, which stores flow run metadata
Returns
NodeReturn
Object containing data to store in data lake and/or file storage
Notes
Excel_file is represented in bytes so user can handle cases where Excel spreadsheet is a binary object within this function