Harpy#
Single-cell spatial omics analysis that makes you happy.
π« If you find Harpy useful, please give us a β on GitHub! It helps others discover the project and supports continued development.
Why Harpy?
Harpy is a spatial omics analysis library for spatial transcriptomics and proteomics. Within the scverse stack, it bridges SpatialData and downstream analysis tools such as AnnData, Scanpy, and Squidpy. It provides scalable, image- and geometry-aware computation to transform raw spatial data into analysis-ready representations, with a strong emphasis on interoperability and large-scale workflows.
In practice, Harpy offers fast, out-of-core image preprocessing, tiled segmentation, and efficient aggregation workflows to generate AnnData tables and compute per-cell features from images, segmentation masks, and transcript coordinates. It also supports deep feature extraction, pixel- and cell-level clustering, and the construction of single-cell representations from highly multiplexed images.
Multi-platform support for spatial transcriptomics and proteomics data.
Interoperable outputs built on SpatialData.
Scales to (very) large images: tiled workflows with Dask; optional GPU acceleration with CuPy and PyTorch.
Scalable computational building blocks for segmentation, feature extraction, clustering, and spatial analysis.
For loading and browsing SpatialData stores in napari, alongside feature extraction and interactive object classification workflows, see the napari-harpy package.
For interactive visualization of Harpy outputs using Vitessce, see the harpy_vitessce package.
Note for users upgrading to Harpy
0.4.0: parameters that refer toSpatialDataelements now use the*_nameconvention instead of the older*_layernaming to stay aligned with scverse naming conventions. For example,img_layerbecomesimage_name,labels_layerbecomeslabels_name, andtable_layerbecomestable_name.
Explore how to use Harpy for segmentation, shallow and deep feature extraction, clustering, and spatial analysis of gigapixel-scale multiplexed data with these step-by-step notebooks:
π Basic Usage of Harpy
Learn how to read in data, perform tiled segmentation using Cellpose and Dask-CUDA, extract features, perform QC and analyze results downstream with
ScanpyandSquidpy.π Tutorial image based transcriptomics, Human Ovarian Cancer, Xenium 10x Genomics
π§ Technology-specific advice
Learn which technologies Harpy supports. π Notebook
π§© Pixel and Cell Clustering
Learn how to perform unsupervised pixel- and cell-level clustering using
Harpytogether with FlowSOM. π Tutorial
βοΈ Cell Segmentation
Explore segmentation workflows in
Harpyusing different tools:π‘ Want us to add support for another segmentation method? π Open an issue and let us know!
π§ͺ Single-cell representations from highly multiplexed images and downstream use with PyTorch
Learn how single-cell representations can be generated from highly multiplexed images. These representations can then be used downstream to train classifiers in PyTorch. π Tutorial
π§ Deep Feature Extraction
Discover how
Harpyenables fast, scalable extraction of deep, cell-level features from highly multiplex imaging data with the KRONOS foundation model for proteomics. π Tutorialπ‘ Want us to add support for another deep feature extraction method? π Open an issue and let us know!
π¬ Shallow Feature Extraction
Learn to extract shallow featuresβsuch as mean, median, and standard deviation of intensitiesβfrom multiplex imaging data with
Harpy. π Tutorial
𧬠Spatial Transcriptomics
Learn how to analyze spatial transcriptomics data with
Harpy. We also refer to the SPArrOW documentation.
π Multiple samples and coordinate systems
Learn how to work with multiple samples, intrinsic and micron coordinates. π Tutorial
π Unifying Raster and Vector Annotations
Learn how to convert a segmentation mask (array) into its vectorized form, and segmentation boundaries (polygons) into their rasterized equivalents. This conversion is useful, for example, when integrating annotations (e.g., from QuPath) into downstream spatial omics analysis.π Tutorial
π For a complete list of tutorials, visit the tutorials section.
Learn how to install Harpy.
Run a short, end-to-end example.
Tutorials to help you get up to speed with Harpy.
Learn which technologies Harpy supports.
Find a detailed documentation of Harpy.
Explore Harpyβs benchmark performance.
Learn how to run Harpy in a High-Performance Computing (HPC) environment.
Learn how to contribute to Harpy.
If you use Harpy for spatial proteomics analysis, please cite:
Benjamin Rombaut, Arne Defauw, Frank Vernaillen, Julien Mortier, Evelien Van Hamme, Sofie Van Gassen, Ruth Seurinck, Yvan Saeys. Scalable analysis of whole slide spatial proteomics with Harpy. Bioinformatics (2026), btag122. https://doi.org/10.1093/bioinformatics/btag122
If you use Harpy for spatial transcriptomics analysis, please cite:
Lotte Pollaris, Bavo Vanneste, Benjamin Rombaut, Arne Defauw, Frank Vernaillen, Julien Mortier, Wout Vanhenden, Liesbet Martens, Tinne Thone, Jean-Francois Hastir, Anna Bujko, Wouter Saelens, Jean-Christophe Marine, Hilde Nelissen, Evelien Van Hamme, Ruth Seurinck, Charlotte L. Scott, Martin Guilliams, Yvan Saeys. SPArrOW: a flexible, interactive and scalable pipeline for spatial transcriptomics analysis. https://doi.org/10.1101/2024.07.04.601829