What is the best way to normalize RNA-seq count data before differential expression analysis?

Question

I'm doing differential expression analysis with DESeq2 in R. I have raw count data from featureCounts. Should I normalize the counts before passing them to DESeq2, or does DESeq2 handle this internally?

Also, what's the difference between TMM, TPM, RPKM, and DESeq2's own normalization?

Admin · Accepted Answer

**Do NOT pre-normalize your counts before DESeq2.** DESeq2 expects raw integer counts and does its own normalization internally using the median-of-ratios method.

```r
library(DESeq2)

# Load raw counts (NOT TPM or RPKM)
counts