For instance, only keep cells with at least min_counts counts or min_genes genes expressed. This is to filter measurement outliers, i.e., "unreliable" observations.
Only provide one of the optional parameters min_counts, min_genes, max_counts, max_genes per call.
More details on the scanpy documentation
Keep genes that have at least min_counts counts or are expressed in at least min_cells cells or have at most max_counts counts or are expressed in at most max_cells cells.
Only provide one of the optional parameters min_counts, min_cells, max_counts, max_cells per call.
More details on the scanpy documentation
More details on the scanpy documentation
It expects logarithmized data.
Depending on flavor, this reproduces the R-implementations of Seurat or Cell Ranger. The normalized dispersion is obtained by scaling with the mean and standard deviation of the dispersions for genes falling into a given bin for mean expression of genes. This means that for each bin of mean expression, highly variable genes are selected.
More details on the scanpy documentation
Downsample counts so that each cell has no more than target_counts. Cells with fewer counts than target_counts are unaffected by this. This has been implemented by M. D. Luecken.
More details on the scanpy documentation