ABSTRACT
Proteins containing repetitive amino acid domains are widespread in all life forms. In parasitic organisms, proteins containing repeats play important roles such as cell adhesion and invasion and immune evasion. Therefore, extracellular and intracellular parasites are expected to be under different selective pressures regarding the repetitive content in their genomes. Here, we investigated whether there is a bias in the repetitive content found in the predicted proteomes of 6 exclusively extracellular and 17 obligate intracellular protozoan parasites, as well as 4 free-living protists. We also attempted to correlate the results with the distinct ecological niches they occupy and with distinct protein functions. We found that intracellular parasites have higher repetitive content in their proteomes than do extracellular parasites and free-living protists. In intracellular parasites, these repetitive proteins are located mainly at the parasite surface or are secreted and are enriched in amino acids known to be part of N- and O-glycosylation sites. Furthermore, in intracellular parasites, the developmental stages that are able to invade host cells express a higher proportion of proteins with perfect repeats relative to other life cycle stages, and these proteins have molecular functions associated with cell invasion. In contrast, in extracellular parasites, degenerate repetitive motifs are enriched in proteins that are likely to play roles in evading host immune response. Altogether, our results support the hypothesis that both the ability to invade host cells and to escape the host immune response may have shaped the expansion and maintenance of perfect and degenerate repeats in the genomes of intra- and extracellular parasites.