What it does
STARSolo is a turnkey solution for analyzing droplet single cell RNA sequencing data (e.g. 10X Genomics Chromium System) built directly into STAR code. STARsolo takes raw FASTQ reads files as input, and performs the following operations:
- Error correction and demultiplexing of cell barcodes using user-input whitelist
- Mapping the reads to the reference genome using the standard STAR spliced read alignment algorithm
- Error correction and collapsing (deduplication) of Unique Molecular Identifiers (UMIs)
- Quantification of per-cell gene expression by counting the number of reads per gene
STARsolo output is designed to be a drop-in replacement for 10X CellRanger gene quantification output. It follows CellRanger logic for cell barcode whitelisting and UMI deduplication, and produces nearly identical gene counts in the same format. At the same time STARsolo is 10 times faster than CellRanger.