This vignette demonstrates how to read and write Seurat
objects using the anndataR
package, leveraging the interoperability between Seurat and
the AnnData format.
Seurat is a
widely used toolkit for single-cell analysis in R. anndataR
enables conversion between Seurat objects and
AnnData objects, allowing you to leverage the strengths of
both the scverse and Seurat ecosystems.
Seurat ObjectUsing an example .h5ad file included in the package, we
will demonstrate how to read an .h5ad file and convert it
to a Seurat object.
library(anndataR)
library(Seurat)
#>
#> Attaching package: 'Seurat'
#> The following object is masked from 'package:SummarizedExperiment':
#>
#> Assays
h5ad_file <- system.file("extdata", "example.h5ad", package = "anndataR")Read the .h5ad file as a Seurat object:
seurat_obj <- read_h5ad(h5ad_file, as = "Seurat")
seurat_obj
#> An object of class Seurat
#> 100 features across 50 samples within 1 assay
#> Active assay: RNA (100 features, 0 variable features)
#> 5 layers present: counts, csc_counts, dense_X, dense_counts, X
#> 2 dimensional reductions calculated: X_pca, X_umapThis is equivalent to reading in the .h5ad file as an
AnnData and explicitly converting:
adata <- read_h5ad(h5ad_file)
seurat_obj <- adata$as_Seurat()
seurat_obj
#> An object of class Seurat
#> 100 features across 50 samples within 1 assay
#> Active assay: RNA (100 features, 0 variable features)
#> 5 layers present: counts, csc_counts, dense_X, dense_counts, X
#> 2 dimensional reductions calculated: X_pca, X_umapAnnData and SeuratFigure @ref(fig:mapping) shows the structures of the
AnnData and Seurat objects and how anndataR
maps between them. It is important to note that matrices in the two
objects are transposed relative to each other.
Mapping between AnnData and Seurat objects
By default, all items in most slots are converted using the same
names. An exception is the varp slot which doesn’t have a
corresponding slot in Seurat. Items in the
varm slot are only converted when they are specified in a
mapping argument. The Neighbors and Images
slots are not mapped when converting from Seurat. See
?as_Seurat for more details on the default mapping.
You can customize the conversion process by providing specific
mappings for each slot in the Seurat object.
Each of the mapping arguments can be provided with one of the following:
TRUE: all items in the slot will be copied using the
default mappingFALSE: the slot will not be copiedSeurat object, the values are the names of the slot in
the AnnData object.See ?as_Seurat for more details on how to customize the
conversion process. For instance:
seurat_obj <- adata$as_Seurat(
layers_mapping = c("counts", "dense_counts"),
object_metadata_mapping = c(metadata1 = "Int", metadata2 = "Float"),
assay_metadata_mapping = FALSE,
reduction_mapping = list(
pca = c(key = "PC_", embeddings = "X_pca", loadings = "PCs"),
umap = c(key = "UMAP_", embeddings = "X_umap")
),
graph_mapping = TRUE,
misc_mapping = c(misc1 = "Bool", misc2 = "IntScalar")
)
seurat_obj
#> An object of class Seurat
#> 100 features across 50 samples within 1 assay
#> Active assay: RNA (100 features, 0 variable features)
#> 3 layers present: counts, dense_counts, X
#> 2 dimensional reductions calculated: pca, umapThe mapping arguments can also be passed directly to
read_h5ad().
Seurat object to a H5AD fileThe reverse conversion is also possible, allowing you to convert the
Seurat object back to an AnnData object, or to
just write out the Seurat object as an .h5ad
file.
This is equivalent to converting the Seurat object to an
AnnData object and then writing it out:
You can again customize the conversion process by providing specific
mappings for each slot in the AnnData object. For more
details, see ?as_AnnData.
Here’s an example:
adata <- as_AnnData(
seurat_obj,
assay_name = "RNA",
x_mapping = "counts",
layers_mapping = c("dense_counts"),
obs_mapping = c(RNA_count = "nCount_RNA", metadata1 = "metadata1"),
var_mapping = FALSE,
obsm_mapping = list(X_pca = "pca", X_umap = "umap"),
obsp_mapping = TRUE,
uns_mapping = c("misc1", "misc2")
)
adata
#> InMemoryAnnData object with n_obs × n_vars = 50 × 100
#> obs: 'RNA_count', 'metadata1'
#> uns: 'misc1', 'misc2'
#> obsm: 'X_pca', 'X_umap'
#> layers: 'dense_counts'
#> obsp: 'connectivities', 'distances'The mapping arguments can also be passed directly to
write_h5ad().
sessionInfo()
#> R version 4.5.2 (2025-10-31)
#> Platform: x86_64-pc-linux-gnu
#> Running under: Ubuntu 24.04.3 LTS
#>
#> Matrix products: default
#> BLAS: /usr/lib/x86_64-linux-gnu/openblas-pthread/libblas.so.3
#> LAPACK: /usr/lib/x86_64-linux-gnu/openblas-pthread/libopenblasp-r0.3.26.so; LAPACK version 3.12.0
#>
#> locale:
#> [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
#> [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
#> [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
#> [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
#> [9] LC_ADDRESS=C LC_TELEPHONE=C
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
#>
#> time zone: Etc/UTC
#> tzcode source: system (glibc)
#>
#> attached base packages:
#> [1] stats4 stats graphics grDevices utils datasets methods
#> [8] base
#>
#> other attached packages:
#> [1] Seurat_5.3.1 reticulate_1.44.0
#> [3] anndataR_1.0.0 SingleCellExperiment_1.33.0
#> [5] SummarizedExperiment_1.41.0 Biobase_2.71.0
#> [7] GenomicRanges_1.63.0 Seqinfo_1.1.0
#> [9] IRanges_2.45.0 S4Vectors_0.49.0
#> [11] BiocGenerics_0.57.0 generics_0.1.4
#> [13] MatrixGenerics_1.23.0 matrixStats_1.5.0
#> [15] SeuratObject_5.2.0 sp_2.2-0
#> [17] BiocStyle_2.39.0
#>
#> loaded via a namespace (and not attached):
#> [1] RColorBrewer_1.1-3 sys_3.4.3 jsonlite_2.0.0
#> [4] magrittr_2.0.4 spatstat.utils_3.2-0 farver_2.1.2
#> [7] rmarkdown_2.30 vctrs_0.6.5 ROCR_1.0-11
#> [10] memoise_2.0.1 spatstat.explore_3.5-3 htmltools_0.5.8.1
#> [13] S4Arrays_1.11.0 curl_7.0.0 Rhdf5lib_1.33.0
#> [16] SparseArray_1.11.1 rhdf5_2.55.7 sass_0.4.10
#> [19] sctransform_0.4.2 parallelly_1.45.1 KernSmooth_2.23-26
#> [22] bslib_0.9.0 htmlwidgets_1.6.4 ica_1.0-3
#> [25] httr2_1.2.1 plyr_1.8.9 plotly_4.11.0
#> [28] zoo_1.8-14 cachem_1.1.0 buildtools_1.0.0
#> [31] igraph_2.2.1 mime_0.13 lifecycle_1.0.4
#> [34] pkgconfig_2.0.3 Matrix_1.7-4 R6_2.6.1
#> [37] fastmap_1.2.0 fitdistrplus_1.2-4 future_1.67.0
#> [40] shiny_1.11.1 digest_0.6.37 patchwork_1.3.2
#> [43] rprojroot_2.1.1 tensor_1.5.1 RSpectra_0.16-2
#> [46] irlba_2.3.5.1 RSQLite_2.4.4 filelock_1.0.3
#> [49] progressr_0.18.0 spatstat.sparse_3.1-0 httr_1.4.7
#> [52] polyclip_1.10-7 abind_1.4-8 compiler_4.5.2
#> [55] here_1.0.2 bit64_4.6.0-1 withr_3.0.2
#> [58] S7_0.2.0 DBI_1.2.3 fastDummies_1.7.5
#> [61] MASS_7.3-65 rappdirs_0.3.3 DelayedArray_0.37.0
#> [64] tools_4.5.2 lmtest_0.9-40 otel_0.2.0
#> [67] httpuv_1.6.16 future.apply_1.20.0 goftest_1.2-3
#> [70] glue_1.8.0 nlme_3.1-168 rhdf5filters_1.23.0
#> [73] promises_1.5.0 grid_4.5.2 Rtsne_0.17
#> [76] cluster_2.1.8.1 reshape2_1.4.4 gtable_0.3.6
#> [79] spatstat.data_3.1-9 tidyr_1.3.1 data.table_1.17.8
#> [82] XVector_0.51.0 spatstat.geom_3.6-0 RcppAnnoy_0.0.22
#> [85] ggrepel_0.9.6 RANN_2.6.2 pillar_1.11.1
#> [88] stringr_1.6.0 spam_2.11-1 RcppHNSW_0.6.0
#> [91] later_1.4.4 splines_4.5.2 dplyr_1.1.4
#> [94] BiocFileCache_3.1.0 lattice_0.22-7 bit_4.6.0
#> [97] deldir_2.0-4 survival_3.8-3 tidyselect_1.2.1
#> [100] maketools_1.3.2 miniUI_0.1.2 pbapply_1.7-4
#> [103] knitr_1.50 gridExtra_2.3 scattermore_1.2
#> [106] xfun_0.54 stringi_1.8.7 lazyeval_0.2.2
#> [109] yaml_2.3.10 evaluate_1.0.5 codetools_0.2-20
#> [112] tibble_3.3.0 BiocManager_1.30.26 cli_3.6.5
#> [115] uwot_0.2.4 xtable_1.8-4 jquerylib_0.1.4
#> [118] Rcpp_1.1.0 spatstat.random_3.4-2 globals_0.18.0
#> [121] dbplyr_2.5.1 png_0.1-8 spatstat.univar_3.1-4
#> [124] parallel_4.5.2 blob_1.2.4 ggplot2_4.0.0
#> [127] dotCall64_1.2 listenv_0.10.0 viridisLite_0.4.2
#> [130] scales_1.4.0 ggridges_0.5.7 purrr_1.2.0
#> [133] rlang_1.1.6 cowplot_1.2.0