Mutations are double edged sword. On one hand, most new mutations can be harmful. On the other hand, they’re the building blocks of adaptation and evolution, helping organisms survive and thrive over time. Major source of mutations are DNA replication errors despite multiple error avoidance and repair control mechanisms. That’s why it’s crucial to figure out how and where these errors appear in the genomes.
To tackle this, we harnessed powerful evolutionary conserved natural error correction machinery, mismatch repair system. We modulated this machinery in the way that it can detect but do not repair errors. By tagging mismatch repair protein MutL, with fluorescent proteins, we were able to visualize DNA replication errors as they emerge in living cells. By using MutL-ChIP-seq analysis we were able to identify replication error prone genomic region, along with associated genomic features and proteins.
Figure: Visualization DNA replication errors and identification of replication error hotspots.
Left panel. Individual DNA replication errors are detected and quantified using fluorescently labeled proteins: MutL (cyan), which marks replication errors, and DnaN (red), which marks replication forks. This allows precise mapping of replication errors and replication forks within E. coli cells that are not able to repair replication errors.
Middle panel. As replication errors are not corrected, we performed MutL-ChIP-Seq analysis to identify the hotspots of replication errors, highlighting regions that are prone to replication instability.
Right panel. Detailed analysis of MutL binding regions revealed genomic features that are enriched or depleted within these hotspots, providing a comprehensive genome-wide map of replication error hotspots and insights into the molecular determinants of replication fidelity.
Our findings reveal that replication error hotspots are non-randomly distributed along the chromosome and are enriched in sequences with distinct features: lower thermal stability facilitating DNA strand separation, mononucleotide repeats prone to DNA polymerase slippage, and sequences prone to forming secondary structures like cruciform structures, which increase likelihood of DNA polymerase stalling. These hotspots showed enrichment for binding sites of nucleoid-associated proteins and have high transcriptional activity.
In conclusion, our study provides a comprehensive genome-wide map of replication error hotspots, offering a holistic perspective on the intricate interplay between various mechanisms that can compromise the faithful transmission of genetic information.
Reference
Hasenauer, F.C., Barreto, H.C., Lotton, C., and Matic, I. Genome-wide mapping of spontaneous DNA replication error-hotspots using mismatch repair proteins in rapidly proliferating Escherichia coli. Nucleic Acids Res, 2024, gkae1196, https://doi.org/10.1093/nar/gkae1196