Journal of the American Chemical Society, Vol.136, No.52, 18111-18119, 2014
Mining Soil Metagenomes to Better Understand the Evolution of Natural Product Structural Diversity: Pentangular Polyphenols as a Case Study
Sequence-guided mining of metagenomic libraries provides a means of recovering specific natural product gene clusters of interest from the environment. In this study, we use ketosynthase gene (KS) PCR amplicon sequences (sequence tags) to explore the structural and biosynthetic diversities of pentangular polyphenols (PP). In phylogenetic analyses, eDNA-derived sequence tags often fall between closely related clades that are associated with gene clusters known to encode distinct chemotypes. We show that these common intermediate sequence tags are useful for guiding the discovery of not only novel bioactive metabolites but also collections of closely related gene clusters that can provide new insights into the evolution of natural product structural diversity. Gene clusters corresponding to two eDNA-derived KS beta sequence tags that reside between well-defined KS beta clades associated with the biosynthesis of (C24)-pradimicin and (C26)-xantholipin type metabolites were recovered from archived soil eDNA libraries. Heterologous expression of these gene clusters in Streptomyces albus led to the isolation of three new PPs (compounds 13). Calixanthomycin A (1) shows potent antiproliferative activity against HCT-116 cells, whereas arenimycins C (2) and D (3) display potent antibacterial activity. By comparing genotypes and chemotypes across all known PP gene clusters, we define four PP subfamilies, and also observe that the horizontal transfer of PP tailoring genes has likely been restricted to gene clusters that encode closely related chemical structures, suggesting that only a fraction of the natural product-like chemical space that can theoretically be encoded by these secondary metabolite tailoring genes has likely been sampled naturally.