Nature, Vol.588, No.7836, 141-+, 2020
Widespread endogenization of giant viruses shapes genomes of green algae
The authors show that large endogenous viral elements derived from giant viruses are prominent components of green algal genomes. Endogenous viral elements (EVEs)-viruses that have integrated their genomes into those of their hosts-are prevalent in eukaryotes and have an important role in genome evolution(1,2). The vast majority of EVEs that have been identified to date are small genomic regions comprising a few genes(2), but recent evidence suggests that some large double-stranded DNA viruses may also endogenize into the genome of the host(1). Nucleocytoplasmic large DNA viruses (NCLDVs) have recently become of great interest owing to their large genomes and complex evolutionary origins(3-6), but it is not yet known whether they are a prominent component of eukaryotic EVEs. Here we report the widespread endogenization of NCLDVs in diverse green algae; these giant EVEs reached sizes greater than 1 million base pairs and contained as many as around 10% of the total open reading frames in some genomes, substantially increasing the scale of known viral genes in eukaryotic genomes. These endogenized elements often shared genes with host genomic loci and contained numerous spliceosomal introns and large duplications, suggesting tight assimilation into host genomes. NCLDVs contain large and mosaic genomes with genes derived from multiple sources, and their endogenization represents an underappreciated conduit of new genetic material into eukaryotic lineages that can substantially impact genome composition.