Skip to main content
メニュー
Revvity logo
Contact us
JP
Revvity Sites Globally

Select your location.

*e-commerce not available for this region.

australia.webp Australia
austria.webp Austria
belgium.webp Belgium
brazil.webp Brazil *
canada.webp Canada
china.webp China *
denmark.webp Denmark
finland.webp Finland
france.webp France
germany.webp Germany
hong-kong.webp Hong Kong (China) *
india.webp India *
ireland.webp Ireland
italy.webp Italy
japan.webp Japan *
luxembourg.webp Luxembourg
mexico.webp Mexico *
netherlands.webp Netherlands
norway.webp Norway
philippines.webp Philippines *
republic of korea.webp Republic of Korea *
singapore.webp Singapore *
spain.webp Spain
sweden.webp Sweden
switzerland.webp Switzerland
thailand.webp Thailand *
uk.webp United Kingdom
usa.webp United States
Breadcrumb
...
  • ホーム
  • Blog
  • NGS
  • Optimizing small RNA-Seq with spike-in controls.
Optimizing Small RNA-Seq with Spike-In Controls

Blog

NGS NGS OEM & Custom Solutions

Jul 22nd 2025

5 min read

Optimizing small RNA-Seq with spike-in controls.

Help us improve your Revvity blog experience!

Feedback

Small RNA-seq provides detailed profiles of microRNAs, small interfering RNAs, and various other short noncoding RNAs that play critical roles in gene regulation, development, and disease. These molecules, often present at low copy numbers, can serve as biomarkers or functional regulators, making accurate quantification essential. Technical variation during extraction, ligation, reverse transcription, and PCR can skew observed small RNA abundances relative to biology. For example, adapter ligation efficiency varies depending on sequence composition and secondary structure, masking true biological differences. Without suitable internal references, distinguishing technical noise from genuine changes in small RNA levels can be challenging.

Synthetic spike-in controls (exogenous RNA oligomers or fragmented RNAs of known sequence and concentration) can be introduced during sample processing to address these challenges. Spike-ins can monitor ligation and amplification biases and enable absolute quantification of endogenous small RNAs. In biofluids with extremely low RNA content, such as plasma or cerebrospinal fluid, spike-ins provide indispensable benchmarks. In large multi-site projects, they help reconcile data generated across different laboratories or platforms by flagging outlier libraries and adjusting for batch effects.

Beyond bias correction, spike-ins facilitate construction of calibration curves: plotting observed read counts against known input amounts allows estimation of absolute copy numbers for endogenous small RNAs, rather than relying solely on relative metrics like reads per million. This capability is relevant when total small RNA levels vary substantially between conditions (for instance, during developmental transitions or in disease states that remodel the small RNA landscape).

This blog reviews recent developments on the design, implementation, and interpretation of spike-in controls in small RNA seq experiments. By highlighting both the utility and limitations of spike-ins, we aim to guide researchers in integrating these controls to achieve accurate, reproducible insights into small RNA biology.

Traditional normalization approaches

In the absence of spike-ins, most small RNA-seq data is normalized using relative methods such as total read counts (e.g., reads per million, RPM) or trimmed mean of M values (TMM). These strategies assume that the overall small RNA population remains constant between samples, which often does not hold true. For example, plasma samples from cancer patients can have a global shift in miRNA composition with respect to healthy controls, meaning that scaling by total mapped reads can obscure genuine biological changes. While endogenous reference RNAs such as U6 snRNA or certain housekeeping miRNAs are sometimes used, their expression can also vary with sample type, making them less reliable.

Because of these limitations, synthetic spike-in mixes are becoming increasingly important. Unlike endogenous references, they provide an external, invariant baseline across experiments, enabling conversion of sequencing reads into absolute molecular counts. Adding spike-in controls after extraction, before library prep, isolates biases from adapter ligation, reverse transcription, and PCR. Selecting optimal concentrations

Spike-in concentrations must bracket the expected abundance range of endogenous small RNAs. Overloading libraries with high-abundance spike-ins can consume sequencing capacity, compromising detection of low-expression species. Conversely, spike-ins below detection thresholds provide little normalization value. A typical approach employs a dilution series spanning 102–108 molecules per reaction; after pilot runs, researchers select concentrations that yield midrange read counts corresponding to typical miRNA copy numbers for the sample type. Commercial mixes like miND® Spike-in Controls are pre-optimized for this range, with validated performance across diverse sample types including biofluids, cells, and FFPE tissue.

Synthetic spike-ins differ from native small RNAs in sequence composition and lack natural modifications (e.g., 2′-O-methylation), so residual biases may persist after normalization. To mitigate this, a diverse panel of spike-ins with varied GC content, lengths, and predicted secondary structures is recommended. When total small RNA content is expected to vary dramatically, combining spike-in normalization with endogenous reference RNAs (e.g., U6 snRNA) yields more reliable quantification.

Use cases

Biofluids are to this date the most common use case for spike-ins. Circulating miRNAs in plasma, serum, and urine serve as minimally invasive biomarkers for diseases ranging from cancer to cardiovascular disorders. However, biofluids typically yield low RNA quantities (<10 ng/mL) and can contain inhibitors like heparin that impede downstream enzymatic steps.

Formalin-fixed paraffin-embedded (FFPE) tissues are invaluable for retrospective analyses but yield fragmented, chemically modified RNA. Incorporating spike-ins after extraction quantifies library preparation biases. It has been shown that spike-in normalization improved representation of miRNA families in FFPE-derived libraries, facilitating differential expression analyses in archived tumor specimens.

miRNA expression patterns shift rapidly during embryogenesis. Synthetic spike-ins at known concentrations have been introduced at five zebrafish developmental stages up to 5 hours post-fertilization, revealing stage-specific miRNA waves obscured by RPM-based normalization. In human organoid differentiation models similar strategies have been applied to quantify absolute expression changes, uncovering subtle miRNA dynamics linked to lineage specification.

More recently, the adoption of miND® spike-ins has enabled multi-site extracellular vesicle studies to harmonize datasets across different laboratories.

Limitations
  • Incomplete mimicry of endogenous RNAs: Synthetic spike-ins lack natural modifications (e.g., 2′-O-methylation) and may not fully capture the behavior of endogenous small RNAs during ligation or reverse transcription.
  • Sequencing resource allocation: High-concentration spike-ins can dominate libraries, limiting coverage of low-abundance endogenous species.
  • Experimental complexity: Designing, titrating, and incorporating custom-made spike-ins adds cost and labor, which may be challenging for laboratories with limited resources.
Conclusion

Spike-in controls are an essential companion for rigorous small RNA-seq workflows. They provide objective measures of library preparation bias, facilitate absolute quantification, and enable reliable comparisons across samples and studies. Their importance is magnified in challenging contexts, where technical variation can overwhelm biological signals. By moving beyond relative normalization methods and integrating solutions like the miND® spike-in controls into both laboratory protocols and data analysis pipelines, researchers can achieve more precise and reproducible insights into small RNA biology, which is particularly relevant for translational research and biomarker discovery.

Learn more
References:
  1. Axtell, M. J. (2013). Classification and comparison of small RNAs from plants. Annu. Rev. Plant Biol., 64, 137–159. doi:10.1146/annurev-arplant-050312-120043
  2. Tanić M, et al (2014). miRNA expression profiling of formalin-fixed paraffin-embedded (FFPE) hereditary breast tumors. Genom Data. 3:75-9. doi:10.1016/j.gdata.2014.11.008.
  3. Etheridge, A., et al (2011). Extracellular microRNA: a new source of biomarkers. Mutat. Res. Rev. Mutat. Res., 717(1–2), 85–90. doi:10.1016/j.mrfmmm.2011.03.004.
  4. Lenart, M., et al. (2024). Identification of miRNAs Present in Cell- and Plasma-Derived Extracellular Vesicles—Possible Biomarkers of Colorectal Cancer. Cancers (Basel). 16(13):2464. doi: 10.3390/cancers16132464.
  5. Pultar, M., Oesterreicher, J., Hartmann, J., Weigl, M., Diendorfer, A., Schimek, K., … Holnthoner, W. (2024). Analysis of extracellular vesicle microRNA profiles reveals distinct blood and lymphatic endothelial cell origins. Journal of Extracellular Biology, 3(1), e134. https://doi.org/10.1002/jex2.134
  6. Hafner, M. et al. (2011). RNA-ligase-dependent biases in miRNA representation in deep-sequenced small RNA cDNA libraries. RNA, 17, 1697–1712. doi: 10.1261/rna.2799511.
  7. McDonald, J. S., et al (2011). Analysis of circulating microRNA: preanalytical and analytical challenges. Clin. Chem., 57(6), 833–840. doi:10.1373/clinchem.2010.157198
  8. Lutzmayer, S., Enugutti, B. & Nodine, M.D. (2017). Novel small RNA spike-in oligonucleotides enable absolute normalization of small RNA-Seq data. Sci Rep., 7, 5913 doi:10.1038/s41598-017-06174-3
  9. Mestdagh, P., et al. (2009). A novel and universal method for microRNA RT-qPCR data normalization. Genome Biol., 10(6), R64. doi:10.1186/gb-2009-10-6-R64.
  10. Morin, R. D. et al. (2008). Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res., 18(4), 610–621. doi:10.1101/gr.7179508.
  11. Tam, S., et al (2015), Optimization of miRNA-seq data preprocessing, Briefings in Bioinformatics, 16(6), 950–963, doi:10.1093/bib/bbv019.
  12. Wang, K. et al. (2012). Comparing the microRNA spectrum between serum and plasma. PLoS ONE, 7(7), e41561. doi: 10.1371/journal.pone.0041561.
  13. Wei, C., Salichos, L., Wittgrove, C. M., Rokas, A., & Patton, J. G. (2012). Transcriptome-wide analysis of small RNA expression in early zebrafish development. RNA, 18(5), 915–929. https://doi.org/10.1261/rna.029090.111.
  14. Locati, M.D., et al (2015), Improving small RNA-seq by using a synthetic spike-in set for size-range quality control together with a set for data normalization, Nucleic Acids Research, 43(14) 18 e89, doi:10.1093/nar/gkv303.

Help us improve your Revvity blog experience!

Feedback

Share this post:

  • Email
  • Facebook
  • Linkedin
  • Twitter

続き NGS posts

Programmable depletion with CRISPR–Cas9: what DASH is, why it works, and where to use it.
Read
Variant calling through bulk RNA-seq is growing up: implications for oncology introduction.
Read
Next GeneSiS: cfDNA sequencing outperforms blood culture in sepsis.
Read
line

Questions?
We’re here to help.

Contact us
Revvity Logo

©2025 Revvity - All rights reserved

Revvity is a trademark of Revvity, Inc. All other trademarks are the property of their respective owners.