Protists as mediators of complex microbial and viral associations
“Microbial eukaryotes, including protists, are known for their important roles in nutrient cycling across different ecosystems. However, the composition and function of protist-associated microbiomes remains largely elusive. Here, we employ cultivation-independent single-cell isolation and genome-resolved metagenomics to provide detailed insights into underexplored microbiomes and viromes of over 100 currently uncultivable ciliates and amoebae isolated from diverse environments. Our findings reveal unique microbiome compositions and hint at an intricate network of complex interactions and associations with bacterial symbionts and viruses. We observed stark differences between ciliates and amoebae in terms of microbiome and virome compositions, highlighting the specificity of protist-microbe interactions. Over 115 of the recovered microbial genomes were affiliated with known endosymbionts of eukaryotes, including diverse members of the Holosporales, Rickettsiales, Legionellales, Chlamydiae, Dependentiae, and more than 250 were affiliated with possible host-associated bacteria of the phylum Patescibacteria. We also identified more than 80 giant viruses belonging to diverse viral lineages, of which some were actively expressing genes in single cell transcriptomes, suggesting a possible association with the sampled protists. We also revealed a wide range of other viruses that were predicted to infect eukaryotes or host-associated bacteria. Our results provide further evidence that protists serve as mediators of complex microbial and viral associations, playing a critical role in ecological networks. The frequent co-occurrence of giant viruses and diverse microbial symbionts in our samples suggests multipartite associations, particularly among amoebae. Our study provides a preliminary assessment of the microbial diversity associated with lesser-known protist lineages and paves the way for a deeper understanding of protist ecology and their roles in environmental and human health.
1 Introduction
Protists, microeukaryotes that are not fungi, plants or animals, play instrumental roles in global ecosystems where they contribute to nutrient cycling and shape structure and function of microbial communities. Heterotrophic protists are well known for their role in grazing, a process in which other microbes are being taken up through phagocytosis and digested. This turnover of microbial biomass ultimately makes nutrients available to higher trophic levels. Grazing isn’t the sole process that protists are involved in. Both heterotrophs and autotrophs (i.e. unicellular algae) play central roles in biomineralization, while autotrophs and mixotrophs contribute to organic carbon fixation.
Recent studies suggest that protists may harbor complex microbiomes. For example, some amoebae have evolved strategies to maintain a regime of bacteria as part of their microbiome as food to ensure a constant nutrient supply. Some of the protist-associated bacteria are resistant to digestion and may even be able to replicate inside their eukaryotic host cells. This ability might lead to host dependency or even protection against pathogens. Further, amoebae represent natural reservoirs for a wider range of human pathogens. This complexity could have far-reaching implications, not only for the surrounding microbial communities and nutrient cycling but also for ecosystem, animal, and plant health.
Despite their importance, our insights into the roles of protists in ecosystems and particularly their interactions with associated microbes and viruses remain limited to a few well-studied groups such as Acanthamoeba and Paramecium. These have received attention due to their medical relevance and their ability to be cultivated under axenic or monoxenic conditions. This leaves broad gaps in our understanding of diversity, function and associations of lesser studied protist lineages even though they make up most branches in the eukaryotic tree of life.
To address these limitations, we collected over 100 individual cells of diverse microbial eukaryotes directly from the environment, including ciliates and testate (i.e. shell-building) amoebae. Most of these organisms are understudied and have not been successfully maintained in culture. Using cultivation-independent single cell isolation, whole genome amplification and genome resolved metagenomics, as well as single cell transcriptomics, we provide insights into the unique composition of the protist microbiome and virome. We uncover associations with a wide range of putative pathogens, including both microbial and eukaryotic symbionts and viruses that infect protists and their associated microbes. Our findings underscore the role of protists as mediators of complex microbial and viral infections in the environment and shed light on the intricate roles that these organisms play within ecological networks.
2 Methods
Sample collection Individual ciliates were isolated from environmental samples obtained from a range of different sampling sites. Amoebae were collected in low-pH bogs and fens and washed off the moss that they inhabit using prefiltered (2 µm filter) bog water after size-selecting over a 300 µm filter to discard large plant material. Arcellinida testate amoebae were then picked under an inverted microscope from the water samples using hand-held glass pipettes. They were transferred to a microscope slide with a drop of freshly filtered bog water in an attempt to wash off obvious contamination sticking to the outside of the shell. Each individual was photo-documented and then transferred to a 0.2 ml tube for transcriptome/genome amplification.
We sampled ciliates either from a small low-pH (pH ~4.5) pond within a local fen or from the intertidal zone of a sandy beach. Samples were filtered over an 80 µm mesh (for sandy samples) or directly poured into small Petri dishes (for pond samples). Ciliates were then observed and hand-picked with glass pipettes under an inverted microscope. Cells were washed by passing through slides of in situ water 2–3 times on depression slides to remove obvious surrounding contaminants (e.g., other non-target micro-eukaryotes, sediment particles). Each individual was diluted with nuclease-free water or prefiltered (0.2 µm filter) in situ water preceding single-cell transcriptome/genome amplification.
Whole genome and transcriptome amplification and sequencing Whole genome amplifications of individual cells were performed using the Repli-g Single-Cell Kit (Qiagen, cat. 150345) following the manufacturer’s instructions. Most samples were incubated for the recommended 8 hours, whereas for a few small ciliates with low expected DNA content (e.g., Cryptopharynx spp. and Wilbertomorpha spp.), we extended the incubation time to 10 to 12 hours.
Single-cell whole transcriptome amplifications were carried out using the SMART-Seq v4 Ultra Low Input RNA Kit for Sequencing (Clontech, cat. 634895, 634896) according to the manufacturer’s instructions. Products of both genome and transcriptome amplifications were purified using the Ampure XP for PCR Purification system (Beckman Coulter). A Qubit 3.0 fluorometer (Invitrogen) was used to measure the DNA concentration.
Sequencing libraries for the transcriptome samples were prepared using the Nextera XT DNA Library Preparation Kit (Illumina). Library preparation for the genomic samples as well as the high-throughput sequencing of all libraries (both for genomes and transcriptomes) was carried out at the genome sequencing center at the University of California, San Diego, or at the Institute for Genome Sciences at the University of Maryland, Baltimore. A HiSeq 4000 (Illumina) sequencing platform was used for the majority of samples. The only exceptions are seven genomic samples—six from Hyalosphenia elegans and one from Loxodes sp.—that were sequenced on a NovaSeq 6000 sequencing system. Samples were sequenced at 10 to 80 million reads per sample.
3 Results
Protist microbiomes harbor microbes across known endosymbiont clades A large proportion of recovered taxa belong to groups of known host-associated bacteria with a facultative or even obligate intracellular lifestyle. Specifically, 115 prokaryotic MAGs grouped with known bacterial endosymbionts and 258 with putative symbionts (Patescibacteria). Alphaproteobacterial endosymbionts were exclusively found in association with ciliates, particularly Megaira and Caedimonadales in Spirostomum, and Paracaedibacterales with Loxodes, Chilodonella, and Halteria. Additionally, several novel and currently uncharacterized lineages within the order Rickettsiales were detected within Loxodes and, to a lesser extent, Chilodonella. Three of the sampled Loxodes cells contained bacteria that grouped in the gammaproteobacterial family UBA6186 together with Azoamicus ciliatocola, a bacterial endosymbiont of ciliates with cosmopolitan distribution that has been shown to generate energy for its host by denitrification. Members of Holosporales and Megaira have previously been associated with different ciliates, but none of the other lineages have been identified as ciliate endosymbionts thus far. In both species of Hyalosphenia sampled in this study, likely host-associated gammaproteobacteria of the family Francisellaceae were found. Further, Diplorickettsia were present in Hyalosphenia elegans, along with Coxiellales-related bacteria in the ciliate Cryptopharynx, and members of Legionellales were present in the ciliates Didinium and Loxodes as well as the testate amoeba Hyalosphenia papilio. For Didinium and Cryptopharynx, gammaproteobacteria were the only associated putative symbionts. Another group of protist symbionts, the phylum Dependentiae, had members from four different families associated with amoebae and ciliates sampled in this study; Chromulinavoraceae were exclusively found with Hyalosphenia papilio (an amoeba that harbors green algal symbionts), while other families were mixed between Hyalosphenia species and Loxodes.
This is the first time Dependentiae have been identified as potential ciliate symbionts. One of the best studied symbiont clades is the phylum Chlamydiota, known to infect a wide diversity of eukaryotic hosts. Here, we identified 56 chlamydial MAGs associated with diverse ciliates and amoebae. Four family-level lineages that consist solely of metagenome-assembled genomes were associated with either Hyalosphenia (f__FEN-1388), Loxodes (f__JAAKFR01), Hyalosphenia and Loxodes (f__JAJFMA01), or Hyalosphenia and Nebela (f__SM23-39). Further, the only bacterial symbiont which was found associated with Trachelocerca was a highly divergent member of the Chlamydiota, potentially representing a novel family or even order-level lineage without any closely related relatives. The amoeba Hyalosphenia was found to be associated with Parachlamydiacaea, a group previously shown to infect different amoebae, particularly Acanthamoeba castellanii. In Acanthamoeba it confers protection against giant virus infection but it is also associated with disease in humans and other animals. Most chlamydial MAGs recovered in this study were associated with four different ciliates (Loxodes) and amoebae (Hyalosphenia and Nebela). These were affiliated with Rhabdochlamydiacae, a group previously shown to be predominantly associated with insects and other metazoans.
In addition to members of well-known intracellular bacteria, we recovered 258 MAGs (25 high quality, 79 medium quality, and 154 low quality) representing members from all major groups of the phylum Patescibacteria, mainly associated with Loxodes, Nebela, and Hyalosphenia papilio, and to a lesser extent, with Hyalosphenia elegans, Halteria, and Chilodonella. Given the reduced genomes of Patescibacteria and other features that may underlie host interaction, it’s plausible that some or all of these might be closely associated with amoebae and ciliates. This aligns with a previous study that provided experimental evidence of an uncharacterized Parcubacterium as an intracellular bacterium in the ciliate Paramecium sp. However, reports also exist of association with other bacteria and of a potential free-living lifestyle for Patescibacteria. Our results indicate some patterns of co-occurrence, such as clades of Paceibacteria composed of MAGs derived from Loxodes. However, the high overall diversity of Patescibacteria MAGs and absence of clear host-specificity pattern hampered any predictions in regards to endosymbiosis.
Giant viruses and virophages are frequently found in protist microbiomes and genes transcribed in situ The microbiomes of the ciliates and amoebae sampled in our study did not only contain sequences of various host-associated bacteria but 82 giant virus metagenome-assembled genomes (GVMAGs). Taxonomic identification with gvclass and phylogenomic analysis revealed that these GVMAGs belonged to diverse lineages within the viral phylum Nucleocytoviricota. Specifically, those associated with Hyalosphenia belonged to several orders, including Asfuvirales, Pandoravirales, Algavirales, and Imitervirales. Viruses associated with Loxodes were highly diverse; however, those linked to Hyalosphenia papilio and Chilodonella were confined to a few clades within Imitervirales. In previous studies, giant viruses have not been found to directly infect ciliates. However, the frequent presence of diverse giant viruses in the ciliates Loxodes and Chilodonella sampled here suggests members of Ciliophora as underappreciated potential hosts for these viruses. For samples where sequences from more than one eukaryote were found, inferring sequence-based putative associations is challenging. For example, members of Algavirales might more likely infect the green algae that are symbiotic to Hyalosphenia and were also detected in the same samples, rather than the Hyalosphenia itself. Further, it has been shown that giant viruses are frequently ingested as food. Such uptake may not lead to an infection in amoebae and ciliates, and viruses may accumulate in the cytoplasm, or in some cases multiple highly similar viruses are taken up at the same time.
To better understand if some of the detected giant virus lineages are actively infecting the protists, we analyzed single-cell transcriptomics on various amoebae and ciliates directly isolated from our sampling sites. Using these data, we were able to confirm gene expression for viruses of the Imitervirales family IM_01 (Mesomimiviridae) in several Hyalosphenia elegans, Hyalosphenia papilio, and Loxodes cells. Further, we found genes of members of the Pimascovirales family PM_01 expressed in Hyalosphenia elegans. In-depth experimental assessment of the protist lineages sampled here—which however are challenging to maintain in the lab—will be required to fully establish a direct connection between giant viruses and a particular protist host.
In addition to giant viruses, we were also able to recover sequences of 33 virophages from the sorted protists, of which 25 had a sufficient number of virophage hallmark genes to be placed into the Lavidaviridae taxonomic framework. Twenty-seven out of 33 virophage sequences were found in protist microbiomes that also contained giant virus genomes. Virophages are known to integrate into their host genomes and become activated when the host encounters giant viruses, which they parasitize, potentially offering protection against giant virus infection. All virophages recovered here were on contigs with a length of 5–26 kb, which is the typical genome size range of known virophages, with none found on longer contigs or surrounded by protist genes. Virophage genes were not found actively expressed in metatranscriptome data from independently sorted similar protists. Nevertheless, our findings suggest that the virophages are probably not integrated into the protist genome, but rather actively engaging in virus–virus interactions.
Protists are hot spots of DNA virus diversity All sampled protists in our study were associated with a large number of other viruses. Most of these belonged to the subfamily Gokushovirinae from the family Microviridae in the order Malgrandaviricetes, which are ssDNA viruses that typically have small genomes and are known to infect host-associated bacteria. Host prediction based on iPHoP indicated a broad range of potential hosts, including Legionellales, Coxiellales, Burkholderiales, Acidaminococcales, and others. To a lesser extent, we found members of the order Caudoviricetes, which are diverse tailed dsDNA viruses associated with free-living bacteria. We also identified other viruses that we could not taxonomically classify but that were predicted to infect intracellular bacteria in the order Chlamydiales. Additionally, we found numerous ssDNA viruses from the Shotokuvirae, most of which belonged to the Cressdnaviricota orders Arfiviricetes and Repensiviricetes, all known to infect a wide range of eukaryotic hosts. Host prediction for eukaryotic viruses is less advanced than for bacterial viruses, so it’s not entirely clear which of the detected viruses may infect the protists or associated eukaryotes, or whether these viruses adhere to the protist surface or reside in the cytoplasm or food vacuoles prior to being degraded. Given the diversity and abundance of detected viruses, it’s conceivable that some indeed infect protist hosts.
Complex multipartite associations in protist microbiomes Our single-cell study suggests that multipartite associations among protists, bacterial symbionts, giant viruses, and other viruses are prevalent. This is particularly true for amoebae SAGs, for which sequences from giant viruses, Chlamydia, Dependentiae, and Gammaproteobacteria affiliated with known intracellular bacteria frequently co-occurred. In ciliate SAGs, the co-occurrence pattern differed, and multipartite associations were mainly predicted in Loxodes sp., consisting of giant viruses and chlamydial and alphaproteobacterial symbionts, and to a lesser extent, Dependentiae or Gammaproteobacteria. There was only a single case (Chilodonella sp.) of a predicted multipartite association that involved three or more interaction partners. For other ciliate lineages, we did not identify multiple interaction partners.
4 Discussion
The overall lower complexity of sequences from microbial symbionts and giant viruses in different ciliate samples can potentially be attributed to two factors: first, different grazing preferences compared to amoebae, and second, the absence of other associated microeukaryotes in the same samples. In contrast, most amoebae were associated with sequences from smaller, often flagellated protists, such as kinetoplastids and chrysophytes. In the case of testate amoeba, washing of their shells is more difficult to achieve compared to ciliates or naked amoebae and may have hindered the complete removal of attached smaller eukaryotes.
Additionally, green algae were frequently detected, especially in Hyalosphenia papilio, which is known to be associated with endosymbiotic Chlorella. Notably, while Chloroviruses are known to associate with Chlorella, we did not recover any giant viruses from Hyalosphenia samples containing Chlorella. In contrast, we sampled three Hyalosphenia elegans cells that were not associated with any other eukaryotes but were each associated with multiple giant viruses.
Next, we tested how different factors shape the uniqueness and richness of distinct microbial and viral groups within protist microbiomes. Specifically, the sampling site was a key driver of uniqueness for free-living bacteria and giant viruses (Nucleocytoviricota), while host taxonomy most strongly influenced uniqueness in putative endosymbionts and Patescibacteria. In contrast, microbiome richness was mainly linked to host taxonomy and sequencing depth. Morphology, cell size, and lifestyle had moderate but variable effects. These findings reinforce that both host traits (e.g., taxonomy, morphology) and environmental conditions (e.g., sampling site) collectively shape the complexity of protist-associated microbial and viral communities.
Taken together, our analysis supports the notion that protists serve as powerful drivers of multipartite interactions and are tightly linked to diversity, specificity, and ecological roles of their bacterial and viral partners.
5 Conclusion
In previous studies, protists have been identified as hosts for diverse lineages of new endosymbionts, but such findings often relied on morphological descriptions following isolation. Here, we used cultivation-independent sequencing approaches to sample a diversity of uncultivable protists. In our more than 100 datasets, the diversity of microbes affiliated with known microbial symbionts is unparalleled, as is the diversity of identified giant viruses and other smaller viruses, such as Arfiviricetes and Repensiviricetes that infect eukaryotes and Gokushovirinae that may infect associated symbionts. The frequent co-occurrence of sequences from diverse giant viruses, with genes found to be expressed in situ, and microbial symbionts likely representing multipartite associations, is particularly intriguing.
Previously, Acanthamoeba and ciliate species have been highlighted as evolutionary melting pots and potential training grounds for pathogens of multicellular eukaryotes. Our findings provide strong support for this hypothesis and call for further experimental work to study the microeukaryotic microbiome and virome. Understanding their roles in shaping protist populations and surrounding microbial communities—not only from an evolutionary perspective, but also in the ecological context of contributing to ecosystem dynamics through nutritional symbioses and pathogenicity—is crucial.