Abstract
Nucleoside analogues like 4-thiouridine (4sU) are used to metabolically label newly synthesized RNA. Chemical conversion of 4sU before sequencing induces T-to-C mismatches in reads sequenced from labelled RNA, allowing to obtain total and labelled RNA expression profiles from a single sequencing library. Cytotoxicity due to extended periods of labelling or high 4sU concentrations has been described, but the effects of extensive 4sU labelling on expression estimates from nucleotide conversion RNA-seq have not been studied. Here, we performed nucleotide conversion RNA-seq with escalating doses of 4sU with short-term labelling (1h) and over a progressive time course (up to 2h) in different cell lines. With high concentrations or at later time points, expression estimates were biased in an RNA half-life dependent manner. We show that bias arose by a combination of reduced mappability of reads carrying multiple conversions, and a global, unspecific underrepresentation of labelled RNA emerging during library preparation and potentially global reduction of RNA synthesis. We developed a computational tool to rescue unmappable reads, which performed favourably compared to previous read mappers, and a statistical method, which could fully remove remaining bias. All methods developed here are freely available as part of our GRAND-SLAM pipeline and grandR package.
Original language | English |
---|---|
Article number | gkae120 |
Pages (from-to) | 1-16 |
Number of pages | 16 |
Journal | Nucleic Acids Research |
Volume | 52 |
Issue number | 7 |
DOIs | |
Publication status | Published - 24 Apr 2024 |
Externally published | Yes |
Field of Science*
- 1.4 Chemical sciences
- 3.1 Basic medicine
- 1.6 Biological sciences
Publication Type*
- 1.1. Scientific article indexed in Web of Science and/or Scopus database