Chapter 2 Introduction

Spatial transcriptomics has emerged as one of the most powerful tools for gaining biological insights, enabling researchers to uncover intricate relationships between gene expression patterns and tissue architecture. Recent advances in the field have resulted in a variety of new platforms, including Visium, Visium HD, and Xenium from 10x Genomics, as well as GeoMx and CosMx from Bruker/NanoString Technologies, which has now been acquired by the Bruker Corporation. However, the existence of diverse spatial transcriptomics platforms and various data formats poses challenges in standardizing data analysis. Thus, there remains a critical gap in the availability of a comprehensive pipeline capable of conducting end-to-end analysis that is necessary to extract biological insights from multiple spatial transcriptomics platforms.

Here, we present SpaceSequest, a tailored pipeline that utilizes cutting-edge computational methodologies to conduct a thorough analysis, enabling the extraction of crucial biological insights from five major spatial transcriptomics technologies. SpaceSequest performs

  • standardized quality control and general data processing.

  • key analyses customized for each spatial platform

  • automated cell type annotation and deconvolution

  • high-quality figure and analysis result generation. In addition, SpaceSequest allows for smooth integration with cellxgene VIP and Quickomics for user-friendly data access and interactive visualization.

SpaceSequest is a unified pipeline for spatial transcriptomics data analysis. It currently handles a wide range of data types generated from five spatial transcriptomics platforms, namely, 10x Genomics Visium, Visium HD, Xenium, Bruker/NanoString GeoMx and CosMx.

This pipeline contains the following programs:

  • Visium

    Our pipeline analyzes 10x Genomics Visium data. It takes SpaceRanger output directory as input, together with a config.yml file. The output will be saved in multiple folders corresponding to the name of the software used, such as, BayesSpace, c2l (cell2location), SpaGCN, SpaTalk, and tangram.

  • VisiumHD

    Our pipeline performs analyses related to Visium HD spatial transcriptomics data. It starts the analysis from SpaceRanger bin-level outputs, and runs a series of steps for QC, cell type annotation, and clustering analysis.

  • Xenium

    Our pipeline directly takes Xenium data as input, runs through gene/cell level quality control and various downstream analyses.

  • CosMX

    Our pipeline performs CosMx data analysis. This includes quality control, cell type annotation (label transfer from a reference dataset), and clustering analysis.

  • GeoMX

    Our pipeline takes GeoMx .dcc files as input, performs sample- and probe-level quality control, and runs a series of steps to call differentially expressed genes. The output will be summarized into Quickomics format for further interactive exploration.

In summary, this pipeline has the ability to analysis the following spatial transcriptomics data platforms: - 10x Genomics Visium: https://www.10xgenomics.com/platforms/visium - 10x Genomics Xenium: https://www.10xgenomics.com/platforms/xenium - 10x Genomics Visium HD: https://www.10xgenomics.com/products/visium-hd-spatial-gene-expression - NanoString GeoMx: https://nanostring.com/products/geomx-digital-spatial-profiler/geomx-dsp-overview/ - NanoString CosMx: https://nanostring.com/products/cosmx-spatial-molecular-imager/