Reading Blobs
Defining functions that read blob columns is straight forward. For scalar UDFs, blob columns are expected to be of typeBlobFile
Writing Blobs
Defining UDFs that write outBlobs to a new column is straightforward. Here we add the standard metadata annotation to the UDF so that Geneva knows to write out Blobs.
For scalar udfs, your udf will return bytes, explicitly set the data_type to pa.large_binary(), and add the field_metadata that specifies blob encoding.
pa.RecordBatch batched UDFs you the effort is similar: