Interpreting missense mutations in Human TRIM5alpha by computational methods
BMC Research Notes volume 1, Article number: 116 (2008)
The human restriction factor TRIM5α may play an important role in regulation of the human immunodeficiency virus (HIV). It is unclear whether non-synonymous single nucleotide polymorphisms (nsSNP) in TRIM5α affect the clinical course of HIV infection.
We surveyed the literature for TRIM5α nsSNPs and used comparative sequence analysis to predict the effect of each polymorphism on protein function. Twenty-eight nsSNPs were identified with available functional data, clinical data, or both. The four comparative method programs assessed included SIFT, PolyPhen, A-GVGD, and average BLOSUM62 pairwise score. Two common polymorphisms, H43Y and R136Q, were predicted to be benign based on comparative sequence analysis. The nsSNPs P323R, K324N, I328M, G330Q, R332P, I348V, and T369S were all predicted to affect protein function.
Comparative sequence analysis offers a functional tool to analyze unknown nsSNPs in TRIM5α.
Human immunodeficiency virus type 1 (HIV-1) infection depends on both viral and human genetic factors . Single nucleotide polymorphisms (SNP) in different immune-modulation genes have been shown to affect susceptibility and progression of disease. Of these, the restriction factors APOBEC3F , APOBEC3G , and TRIM5α  are innate immune proteins that affect postentry steps in HIV-1 replication and confer resistance to retroviruses in other species.
The tri partite m otif restriction factor TRIM5α is a cytoplasmic and nucleolic protein that restricts viral infection by interfering with the capsid protein, promoting premature disassembly . TRIM5α has been studied in primates where it has been shown to be extremely effective at inhibiting HIV-1 and other lentiviruses [6, 7]. The restriction factor is composed of several regions: the RING domain, B-boxes, a coiled-coil domain, and a carboxy-terminal SPRY (B30.2) domain . The SPRY domain defines antiretroviral activity of TRIM5α and amino acids in this region show a high degree of positive selection based on sequence comparison in primates [9, 10]. The variation in the SPRY domain is responsible for the specificity of TRIM5α in primates, but not humans, to effectively restrict HIV-1. The RING domain contributes to the antiviral activity, but the exact function remains unknown [8, 10, 11]. Different studies have analyzed TRIM5α nsSNPs in both HIV-1 infected and non-infected populations [12–16]. The affect of TRIM5α polymorphisms on protein function and clinical course of HIV-1 remains controversial. Studies have revealed conflicting results secondary to variations in functional assays, lack of power in clinical cohorts, and possible linkage disequilibrium between alleles.
Comparative sequence analysis is a powerful technique that can predict whether an nsSNP is likely to affect protein function. These methods rely on the fact that critical residues for function are conserved across different genomes and should not vary [17–19]. These amino acids may directly participate in enzymatic reaction or have an important role in secondary or tertiary structure. Likewise, residues that are not vital would be subject to increased variation with little to no affect on protein function. Recently, we studied the accuracy of four methods using comparative sequence analysis to predict the affect of nsSNPs on protein function : (1) SIFT (Sorting Intolerant from Tolerant, http://blocks.fhcrc.org/sift/SIFT.html) ; (2) PolyPhen (Polymorphism Phenotyping, http://genetics.bwh.harvard.edu/pph) ; (3) A-GVGD (Grantham Variance-Grantham Difference, http://agvgd.iarc.fr) ; (4) Average BLOSUM62 pairwise score .
The accuracy of any one method for predicting a non-synonymous SNP as either deleterious (affecting protein function) or tolerant (benign) is approximately 80%. When all four methods agree, the predictive value is greater than 90% . The goals of this study are to: (1) Predict the affect of TRIM5α nsSNPs using comparative sequence analysis and compare our results to known in-vitro and clinical data; (2) Identify mutations that are likely to affect TRIM5α protein function and may warrant further investigation in clinical cohorts and functional assays.
Creation of multiple sequence alignments
Amino acid sequence alignments were constructed using the standard program ClustalW as previously described . Homologs of genes of interest were retrieved from GenBank after BLAST searches using the human sequence as the query. Alignments are available as supplemental material.
The TRIM5α sequence alignment consisted of 40 sequences: Homo sapiens (gi 48994821), Pan paniscus (gi 122145800), Pan troglodytes (gi 60593103), Gorilla gorilla (gi 56480705), Pongo pygmaeus (gi 122143969), Pongo abelii (gi 75060761), Symphalangus syndactylus (gi 122143726), Nomascus leucogenys (gi 156079722), Hylobates lar (gi 122143029), Colobus guereza (gi 75060798), Macaca mulatto (gi 62548080), Bunopithecus hoolock (gi 122144995), Pygathrix nemaeus (gi 75060797), Cercocebus torquatus (gi 118772044), Macaca fascicularis (gi 75060455), Papio anubis (gi 162951988), Macaca assamensis (gi 122144997), Macaca nemestrina (gi 122146076), Erythrocebus patas (gi 75060791), Chlorocebus aethiops (gi 48994825), Cercopithecus tantalus (gi 47559193), Chlorocebus pygerythrus (gi 75060767), Erythrocebus patas (gi 58379053), Callithrix jacchus (gi 167427342), Callithrix pygmaea (gi 75060793), Pithecia pithecia (gi 75060790), Saguinus oedipus (gi 122145799), Saguinus labiatus (gi 75060764), Callicebus donacophilus (gi 75060786), Saimiri boliviensis (gi 58379043), Saimiri sciureus (gi 75060788), Ateles geoffroyi (gi 75060789), Lagothrix lagotricha (gi 75060785), Aotus trivirgatus (gi 51317461), Alouatta sara (gi 75060794), Equus caballus (gi 149719383), Bos taurus (gi 77736574), Mus musculus (gi 31982207), Rattus norvegicus (gi 109459178), and Gallus gallus (gi 150247142).
TRIM5α nsSNPs were evaluated by four publicly available methods as previously described : 1) Average BLOSUM62 pairwise; 2) SIFT; 3) PolyPhen; 4) A-GVGD. These computational methods were applied to known clinical and functional data. Literature containing TRIM5α nsSNPs was identified by searching PubMed [12, 13, 15, 16, 23]. The agreement of the four methods was assessed for overall consistency using Fleiss' kappa [24, 25].
Results and discussion
Comparative sequence analysis is a powerful tool for the analysis of nsSNPs in the human genome. A large variety of organisms selected for sequence analysis means fewer sequences will be needed to make inferences secondary to long divergence times and increased number of mutations [20, 26]. Too little variation can cause residues to be overly conserved with 'false positive' results, i.e. a residue may seem to be critical for protein function when it is not. In our study population, organisms were not highly diversified, but there was sufficient variation for comparative analysis [18, 26]. With proper alignment, comparative sequence alignment programs have been shown to be accurate over 90% of the time . The TRIM5α sequence alignment was based on available BLAST data of 40 species with 2550 variants. This met the previous threshold for statistical significance [18, 26]. All sequences were eukaryotes and included primates, mouse, rat, and cow .
Twenty-eight amino acid mutations in TRIM5α were identified in the literature, twenty-one of which are known to be nsSNPs in the human population (Table 1). Eight other nsSNPs had in-vitro functional data and are located in the critical SPRY domain of TRIM5α, but are not found in humans. The most common nsSNPs found in both HIV-1 infected and uninfected people were H43Y (frequency 6 to 43%) and R136Q (11 to 38%). Other common nsSNPs included V112F (1 to 11%), G249D (6 to 27%), and H419Y (1 to 8%).
The nsSNP H43Y of the RING region may be important in protein function, specifically E3 ligase activity [13, 16, 23]. Up to 43% of certain populations carry this polymorphism . Functional data have shown that H43Y retains restriction activity [12, 14, 15], whereas other results show decreased activity [13, 23]. Individuals homozygous for the H43Y mutation may develop X4-trophic virus more rapidly than those who are not and progress to AIDS at a faster rate . To further investigate the affect of H43Y and other polymorphisms on protein function, the twenty-eight TRIM5α mutations were analyzed using SIFT, PolyPhen, A-GVGD, and average BLOSUM62 pairwise score (Table 2). Three of the four computational methods (SIFT, PolyPhen, A-GVGD) suggested that H43Y is a tolerated mutation and does not affect protein function. Although PolyPhen and SIFT do not require aligned sequences, we have previously shown that using a sequence alignment of curated data is superior to a single query sequence alone . In this case, regardless of the sequence(s) entered, SIFT classifies the mutation as tolerant. The BLOSUM62 pairwise program predicted H43Y as deleterious, but does not distinguish specific mutations at a given codon and instead makes general predictions based on overall conservation at a given position. This may be a less specific algorithm for detecting individual mutations, but is still as accurate as the other methods. For H43Y, the agreement between programs suggests a greater than 70% accuracy of a 'tolerant' prediction . This supports the evidence that H43Y does not affect TRIM5α function and is likely a benign mutation.
Similar to H43Y, data regarding R136Q has shown conflicting results (Table 1). This amino-acid resides in the coiled coil domain and may participate in TRIM5α oligomerization [8, 13]. Clinical studies have shown that this mutation is increased in HIV-infected patients versus non-infected (OR = 5.49, 95% CI 1.83–16.45, p = 0.002) , but in-vitro data shows R136Q retains functional activity [15, 23]. Furthermore, other clinical studies have shown that R136Q may have a protective affect against HIV-1 . One reason for the conflicting results may be that this mutation is in linkage disequilibrium with other alleles that do play a role in HIV progression or susceptibility [15, 16]. A questionable protective effect of R136Q has also been observed in people with X4-trophic virus . Comparative sequence analysis shows that all four methods agree R136Q would be tolerant with an accuracy of greater than 90% .
Two other nsSNPs, G249D and H419Y, have also shown ambiguous data with either no effect on clinical outcomes [13, 15] or a slower progression of disease . Functional data show that both of these nsSNPs have no affect on TRIM5α function [12, 13, 15, 23]. Three of four methods using comparative sequence analysis suggest G249D is a benign polymorphism, and all four agree that H419 is benign (Table 2).
Several other TRIM5α nsSNPs of interest were also identified. The polymorphisms C58Y, R119W, Q143R, R238W, and V438G are all observed in different human populations and are all predicted deleterious by the four comparative sequence methods (Table 2). Only R119W has been evaluated in clinical studies and has no effect on HIV outcomes . Both R119W and R238W are functional based on in-vitro studies [15, 23]. Given the conflicting data, these nsSNPs along with C58Y, Q143R, and V438G should be further studied to assess affect on protein function and association with clinical HIV disease.
The SPRY region of TRIM5α is a critical region involved in species-specific restriction of HIV-1 [8, 10, 28] and contains codons under high degrees of positive selection [9, 28]. A number of TRIM5α mutations have been studied in the SPRY region of the protein (Table 1). Of interest, amino acid residues 325 to 344 are in a segment of this domain which differs from primates . This 'hypervariable' region has been shown to be responsible, at least in part, for the ability to specifically target HIV-1 . Although no nsSNPs have yet been observed in this region in the human population, mutations at this site may confer a protective benefit against HIV-1 infection. In-vitro studies have demonstrated that single amino acid changes in this region, specifically R332P and to a lesser extent K324N, may be able to effectively restrict HIV-1 [10, 29]. All methods for these mutations with the exception of I348V were predicted tolerant mutations in agreement with the functional data. For I348V, three methods predicted the mutation as tolerant while only the average BLOSUM62 pairwise predicted it as deleterious.
Overall, the four computational methods agreed the majority of the time (κ = 0.53, moderate agreement ). Clinical studies on TRIM5α nsSNPs have shown conflicting results [12, 13, 15, 16], but in-vitro assays clearly demonstrate activity at inhibiting HIV-1. More studies are needed to define the interaction of TRIM5α with immune regulatory genes and DNA sequences that may be in linkage disequilibrium. Focus should be taken to explore the possibility that TRIM5α may affect certain populations differently, specifically people with X4-dominant HIV infection or other ethnic groups.
Two limitations to this study are the paucity of TRIM5α gene sequences available and the lack of structural data available on TRIM5α. Although the number of sequences is sufficient as discussed above, a greater variety of species would allow better alignments based on sensitivity and specificity plots . Furthermore, programs such as PolyPhen rely on structural databases of which there is none presently for TRIM5α.
Comparative sequence analysis suggests that neither H43Y nor R136Q affect TRIM5α protein function. We identified other nsSNPs that may affect TRIM5α activity and should be analyzed in further clinical and laboratory studies.
Lama J, Planelles V: Host factors influencing susceptibility to HIV infection and AIDS progression. Retrovirology. 2007, 4: 52-10.1186/1742-4690-4-52.
Zheng YH, Irwin D, Kurosu T, Tokunaga K, Sata T, Peterlin BM: Human APOBEC3F is another host factor that blocks human immunodeficiency virus type 1 replication. J Virol. 2004, 78 (11): 6073-6076. 10.1128/JVI.78.11.6073-6076.2004.
Sheehy AM, Gaddis NC, Choi JD, Malim MH: Isolation of a human gene that inhibits HIV-1 infection and is suppressed by the viral Vif protein. Nature. 2002, 418 (6898): 646-650. 10.1038/nature00939.
Nisole S, Stoye JP, Saib A: TRIM family proteins: retroviral restriction and antiviral defence. Nat Rev Microbiol. 2005, 3 (10): 799-808. 10.1038/nrmicro1248.
Stremlau M, Perron M, Lee M, Li Y, Song B, Javanbakht H, Diaz-Griffero F, Anderson DJ, Sundquist WI, Sodroski J: Specific recognition and accelerated uncoating of retroviral capsids by the TRIM5alpha restriction factor. Proc Natl Acad Sci USA. 2006, 103 (14): 5514-5519. 10.1073/pnas.0509996103.
Yap MW, Nisole S, Lynch C, Stoye JP: Trim5alpha protein restricts both HIV-1 and murine leukemia virus. Proc Natl Acad Sci USA. 2004, 101 (29): 10786-10791. 10.1073/pnas.0402876101.
Hatziioannou T, Perez-Caballero D, Yang A, Cowan S, Bieniasz PD: Retrovirus resistance factors Ref1 and Lv1 are species-specific variants of TRIM5alpha. Proc Natl Acad Sci USA. 2004, 101 (29): 10774-10779. 10.1073/pnas.0402361101.
Perez-Caballero D, Hatziioannou T, Yang A, Cowan S, Bieniasz PD: Human tripartite motif 5alpha domains responsible for retrovirus restriction activity and specificity. J Virol. 2005, 79 (14): 8969-8978. 10.1128/JVI.79.14.8969-8978.2005.
Ortiz M, Bleiber G, Martinez R, Kaessmann H, Telenti A: Patterns of evolution of host proteins involved in retroviral pathogenesis. Retrovirology. 2006, 3: 11-10.1186/1742-4690-3-11.
Stremlau M, Perron M, Welikala S, Sodroski J: Species-specific variation in the B30.2(SPRY) domain of TRIM5alpha determines the potency of human immunodeficiency virus restriction. J Virol. 2005, 79 (5): 3139-3145. 10.1128/JVI.79.5.3139-3145.2005.
Javanbakht H, Diaz-Griffero F, Stremlau M, Si Z, Sodroski J: The contribution of RING and B-box 2 domains to retroviral restriction mediated by monkey TRIM5alpha. J Biol Chem. 2005, 280 (29): 26933-26940. 10.1074/jbc.M502145200.
Goldschmidt V, Bleiber G, May M, Martinez R, Ortiz M, Telenti A: Role of common human TRIM5alpha variants in HIV-1 disease progression. Retrovirology. 2006, 3: 54-10.1186/1742-4690-3-54.
Javanbakht H, An P, Gold B, Petersen DC, O'Huigin C, Nelson GW, O'Brien SJ, Kirk GD, Detels R, Buchbinder S, Donfield S, Shulenin S, Song B, Perron MJ, Stremlau M, Sodroski J, Dean M, Winkler C: Effects of human TRIM5alpha polymorphisms on antiretroviral function and susceptibility to human immunodeficiency virus infection. Virology. 2006, 354 (1): 15-27. 10.1016/j.virol.2006.06.031.
Nakayama EE, Carpentier W, Costagliola D, Shioda T, Iwamoto A, Debre P, Yoshimura K, Autran B, Matsushita S, Theodorou I: Wild type and H43Y variant of human TRIM5alpha show similar anti-human immunodeficiency virus type 1 activity both in vivo and in vitro. Immunogenetics. 2007, 59 (6): 511-515. 10.1007/s00251-007-0217-7.
Speelmon EC, Livingston-Rosanoff D, Li SS, Vu Q, Bui J, Geraghty DE, Zhao LP, McElrath MJ: Genetic association of the antiviral restriction factor TRIM5alpha with human immunodeficiency virus type 1 infection. J Virol. 2006, 80 (5): 2463-2471. 10.1128/JVI.80.5.2463-2471.2006.
van Manen D, Rits MA, Beugeling C, van Dort K, Schuitemaker H, Kootstra NA: The Effect of Trim5 Polymorphisms on the Clinical Course of HIV-1 Infection. PLoS Pathog. 2008, 4 (2): e18-10.1371/journal.ppat.0040018.
Bao L, Cui Y: Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information. Bioinformatics. 2005, 21 (10): 2185-2190. 10.1093/bioinformatics/bti365.
Greenblatt MS, Beaudet JG, Gump JR, Godin KS, Trombley L, Koh J, Bond JP: Detailed computational study of p53 and p16: using evolutionary sequence analysis and disease-associated mutations to predict the functional consequences of allelic variants. Oncogene. 2003, 22 (8): 1150-1163. 10.1038/sj.onc.1206101.
Tavtigian SV, Deffenbaugh AM, Yin L, Judkins T, Scholl T, Samollow PB, de Silva D, Zharkikh A, Thomas A: Comprehensive statistical study of 452 BRCA1 missense substitutions with classification of eight recurrent substitutions as neutral. J Med Genet. 2006, 43 (4): 295-305. 10.1136/jmg.2005.033878.
Chan PA, Duraisamy S, Miller PJ, Newell JA, McBride C, Bond JP, Raevaara T, Ollila S, Nystrom M, Grimm AJ, Christodoulou J, Oetting WS, Greenblatt MS: Interpreting missense variants: comparing computational methods in human disease genes CDKN2A, MLH1, MSH2, MECP2, and tyrosinase (TYR). Hum Mutat. 2007, 28 (7): 683-693. 10.1002/humu.20492.
Ng PC, Henikoff S: SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003, 31 (13): 3812-3814. 10.1093/nar/gkg509.
Ramensky V, Bork P, Sunyaev S: Human non-synonymous SNPs: server and survey. Nucleic Acids Res. 2002, 30 (17): 3894-3900. 10.1093/nar/gkf493.
Sawyer SL, Wu LI, Akey JM, Emerman M, Malik HS: High-frequency persistence of an impaired allele of the retroviral defense gene TRIM5alpha in humans. Curr Biol. 2006, 16 (1): 95-100. 10.1016/j.cub.2005.11.045.
Fleiss JL: Measuring nominal scale agreement among many raters. Psychological Bulletin. 1971, 76 (5): 378-382. 10.1037/h0031619.
Landis JR, Koch GG: The measurement of observer agreement for categorical data. Biometrics. 1977, 33 (1): 159-174. 10.2307/2529310.
Cooper GM, Brudno M, Green ED, Batzoglou S, Sidow A: Quantitative estimates of sequence divergence for comparative analyses of mammalian genomes. Genome Res. 2003, 13 (5): 813-820. 10.1101/gr.1064503.
Si Z, Vandegraaff N, O'Huigin C, Song B, Yuan W, Xu C, Perron M, Li X, Marasco WA, Engelman A, Dean M, Sodroski J: Evolution of a cytoplasmic tripartite motif (TRIM) protein in cows that restricts retroviral infection. Proc Natl Acad Sci USA. 2006, 103 (19): 7454-7459. 10.1073/pnas.0600771103.
Sawyer SL, Wu LI, Emerman M, Malik HS: Positive selection of primate TRIM5alpha identifies a critical species-specific retroviral restriction domain. Proc Natl Acad Sci USA. 2005, 102 (8): 2832-2837. 10.1073/pnas.0409853102.
Yap MW, Nisole S, Stoye JP: A single amino acid change in the SPRY domain of human Trim5alpha leads to HIV-1 restriction. Curr Biol. 2005, 15 (1): 73-78. 10.1016/j.cub.2004.12.042.
I would like to thank Rami Kantor and Marc Greenblatt for critical reading of the manuscript.
The author declares that they have no competing interests.
PC was the sole contributor to the contents of this article.
About this article
Cite this article
Chan, P.A. Interpreting missense mutations in Human TRIM5alpha by computational methods. BMC Res Notes 1, 116 (2008). https://doi.org/10.1186/1756-0500-1-116
- Human Immunodeficiency Virus
- Human Immunodeficiency Virus Infection
- Single Nucleotide Polymorphism
- Comparative Sequence Analysis
- Affect Protein Function