Skip to main content

iceberg-bioimage

iceberg-bioimage logo
GitHub stars for WayScience/iceberg-bioimage

iceberg-bioimage scans any image store into a versioned Apache Iceberg catalog that directly exports Cytomining-compatible Parquet warehouses.

Problem: Raw bioimaging archives have no standard catalog — finding, versioning, and joining images to downstream data requires bespoke scripts per lab.

Key capabilities:

  • Scan image stores into canonical ScanResult objects
  • Publish image metadata with PyIceberg for versioned, queryable catalogs
  • Export Cytomining-compatible Parquet warehouses for profiling workflows
  • Validate profile tables against microscopy join contracts
  • Supports Zarr, OME-TIFF, and Parquet source formats

View documentation → · View on GitHub →