MapMyCells

MapMyCells allows users to assign cell type names from Allen Institute-hosted taxonomies to their own single cell and spatial transcriptomics data. It transforms cell types from a concept in publications to a tool for public research.

Such data integration allows users to leverage additional community knowledge and data visualizations in their own research.

What is MapMyCells

MapMyCells transforms cell types from a concept in publications to a tool for public research. Scientists worldwide can discover what cell types their transcriptomics and spatial data corresponds with by comparing their data to massive, high-quality reference datasets.

One key advantage to MapMyCells is scale: using our cloud-based Brain Knowledge Platform and reference datasets with millions of cells, researchers can provide up to 327 million cell-gene pairs from their own data, which is a huge leap forward for working with whole-brain datasets.

MapMyCells will enable the neuroscience community to

  • Compare their own data to massive, high-quality, and high-resolution cell type taxonomies.
  • Speed up the creation of brain reference atlases by facilitating the integration of datasets from the scientific community with a shared reference.

Data Usage and Privacy

Allen Institute does not use, retain, or aggregate any data uploaded to MapMyCells for its own internal purposes, nor will we publish your data publicly. Allen Institute database administrators can access any uploaded dataset for debugging and other error remediation purposes. All files will be deleted one week after upload. Please do not submit any sensitive data, personally identifiable data, or protected health data that could put an individuals' privacy at risk into MapMyCells. See the Allen Institute Privacy Policy for more information on our privacy practices.

Available taxonomies

10x Whole mouse brain taxonomy (CCN20230722)

A high-resolution transcriptomic and spatial atlas of cell types in the whole mouse brain.

It contains 5,322 clusters that are organized in a hierarchical manner with nested groupings of 34 classes, 338 subclasses, 1,201 supertypes and 5,322 types/clusters.

10x Human MTG SEA-AD taxonomy (CCN20230505)

A high-resolution transcriptomic atlas of cell types from middle temporal gyrus from the SEA-AD aged human cohort that spans the spectrum of Alzheimer’s disease.

It is hierarchically organized into nested levels of classification: 3 classes, 24 subclasses, and 139 supertypes.

10x Whole human brain taxonomy (CCN20240330)

Transcriptomic diversity of cell types in adult human brain.

It is clustered into hierarchical groups of 31 superclusters, 461 clusters, and 3313 subclusters.

Consensus Basal 
Ganglia taxonomy (CCN20250428)

A multi-species, multi-omic cell type atlas of the primate basal ganglia that integrates over two million single nuclei from eight major basal ganglia structures profiled using single-nucleus RNA-seq, ATAC-seq, spatial transcriptomics, morphology, and electrophysiology.

LEARN MORE ABOUT AVAILABLE TAXONOMIES
View details about available taxonomies, their citations, and access to their underlying data.

Available algorithms

Correlation Mapping
Given reference clusters, select the cluster of the minimal distance to the query data using pre-calculated marker genes with correlation as distance metric.




Hierarchical Correlation Mapping
Given the hierarchy tree of reference clusters, traverse the tree down to the terminal cluster selecting the branch with the minimal distance to the query data using pre-calculated marker genes with correlation as distance metric.

This is the default algorithm for all taxonomies except the 0x Human MTG SEA-AD taxonomy.
Deep Generative mapping
Deep Generative Mapping is a deep generative model algorithm for mapping snRNA-seq data sets and putting those data into the same latent space as data from Allen Institute-hosted reference taxonomies.

This is only available for and is the default for mapping to the 10x Human MTG SEA-AD taxonomy.
LEARN MORE ABOUT AVAILABLE ALGORITHMS
View details about available algorithms, benchmarking information, and usage recommendations.

File inputs and outputs

Inputs

MapMyCells

  • can map any matrix in which rows are "cells" and columns are genes
  • expects genes in user data to match the species of the reference taxonomy
  • has a file limit of 2 GB
  • can accept h5ad files or csv files as input

Outputs

MapMyCells produces the following output files. A “standard” CSV output file and an “extended” JSON output file. These files are archived into a single .zip file for download.

  • validation_log.txt: Log of messages produced by job. Even returned for failed jobs. Useful for debugging. If the mapping failed, this is probably the file you want.
  • my_job.csv: Returned by all algorithms. CSV table of mapping results. If the mapping worked, this is probably the file you want.
  • my_job.json: Only returned by Hierarchical and Flat mapping. More detailed results and metadata stored in a JSON file.
  • my_job_summary_metadata.json: JSON file recording number of cells mapped to cell types and number of genes mapped to Ensembl IDs.
LEARN MORE ABOUT  INPUT AND OUTPUT FILE FORMATS
View details about how to generate a compatible input files, convert genes between species, and interpret output files.

Using MapMyCells

We recommend first time users start with the step-by-step guide, which describes file formats, use cases, and additional options.

MapMyCells user interface

MapMyCells includes a drag-and-drop interface for users to assign cell type names from the above taxonomies to their own data. Files up to 2GB or about 150,000 cells can be processed through the browser.

Run the code on your own

This code provides a python package for mapping single sell RNA sequencing data onto a cell type taxonomy such as that provided by the Allen Institute for Brain Science.

Support resources

A step-by-step guide to MapMyCells
Learn how to use MapMyCells in three simple steps: 1) preparing your data for mapping, 2) mapping your data to a reference, and 3) using mapping results.
Get help in the Community Forums
Ask your question in the dedicated support section for MapMyCells. Get help from Allen Institute scientists and other community members.

MapMyCells webinar

MapMyCells video tutorial