Abstract
Background: Programmed death-1 (PD-1) is an immune checkpoint receptor that regulates T-cell function by modulating and terminating immune responses.
Aim: This study investigates the functional and structural impact of missense single nucleotide polymorphisms in the human Programmed Cell Death 1 (PDCD1) gene.
Setting: The data related to PDCD1 gene single nucleotide polymorphisms [SNPs] were collected from dbSNP.
Methods: PredictSNP1.0, integrating eight tools (sorting intolerant from tolerant [SIFT], PolyPhen-1/2, multivariate analysis of protein polymorphism [MAPP], predictor of human deleterious [PhD] SNP, screening for non-acceptable polymorphisms [SNAP], PANTHER, nsSNPAnalyzer), was used for variant predictions. Conservation was assessed with ConSurf, stability with MUPro and I-Mutant 2.0 and pathogenicity with MutPred2. Molecular dynamics (MD) simulations analysed native and mutant PD-1 variants over 100 nanosecond (ns), assessing root-mean-square deviation (RMSD), root-mean-square fluctuation (RMSF), radius of gyration (Rg), solvent-accessible surface area (SASA) and hydrogen bonding.
Results: D117V and W286G were identified as the most deleterious variants. However, W286G was located in an unfavourable structural region, rendering its model unreliable and excluding it from further analysis. Molecular dynamic simulations on the native and D117V models showed no significant differences in RMSD, RMSF, Rg, SASA or hydrogen bonding, suggesting D117V (rs772130993) has minimal impact on PD-1 stability or flexibility.
Conclusion: Bioinformatics tools predicted the D117V variant as deleterious, but molecular dynamics simulations suggest it may have limited functional impact.
Contribution: These findings underscore the importance of integrating computational predictions with experimental validation to guide therapeutic exploration of genetic variants.
Keywords: programmed death-1 (PD-1); non-synonymous single nucleotide polymorphisms (nsSNPs); in-silico analysis; molecular dynamics (MD) simulations; bioinformatics.
Introduction
The Programmed Cell Death 1 (PDCD1) gene, located on chromosome 2q37.3, encodes the Programmed Death 1 (PD-1) protein, a critical immune checkpoint receptor. Programmed death-1 is expressed on the surface of T-cells and plays a pivotal role in down-regulating immune responses and promoting self-tolerance by modulating T-cell activity.1,2 This occurs through its interaction with its ligands, PD-L1 and PD-L2, which are frequently overexpressed in various cancer cells, enabling tumours to evade immune surveillance.3,4 Therefore, inhibition of the PD-1/PD-L1 pathway has become a crucial strategy in cancer immunotherapy, with several PD-1 and PD-L1 inhibitors already approved for clinical use.4,5 This pathway is also essential in preventing autoimmunity, but its dysregulation is associated with several autoimmune diseases, such as systemic lupus erythematosus and rheumatoid arthritis. Moreover, T-cell exhaustion by the PD-1/PD-L1 pathway has a role to play in maintaining peripheral tolerance, but also in regulating antimicrobial defence. Therefore, the PD-1/PD-L1 pathway could limit the damage to the body caused by the exacerbated activation of the immune system during viral infections.6 Conversely, the suppression of antimicrobial defences facilitates the establishment of persistent infections. Notably, several viruses responsible for chronic infections, such as human immunodeficiency virus (HIV)-1, appear to exploit the PD-1/PD-L1 pathway to evade immune detection and establish long-term persistence.2
Given the crucial role of PD-1 protein in immune regulation, understanding the impact of genetic variations in the PDCD1 gene is of significant interest. Therefore, investigating the most prevalent form of genetic variation in the human genome – non-synonymous single nucleotide polymorphisms (nsSNPs) – is of critical scientific importance, as these variants induce amino acid substitutions that may profoundly affect protein structure, stability and hence function. These variations can disrupt normal cellular processes and are often implicated in the pathogenesis of various diseases.7,8 Consequently, nsSNPs in PDCD1 gene may alter the PD-1 protein’s structure, stability or interaction with its ligands, potentially leading to altered immune responses. These effects can be efficiently assessed using in silico approaches, which offer a cost-effective and powerful means of predicting the functional consequences of nsSNPs. By combining diverse data sources, including evolutionary conservation, amino acid physicochemical properties and structural information, computational tools offer valuable insights into the effects of genetic variations.9 Identifying and characterising deleterious nsSNPs not only enhances our understanding of the molecular mechanisms underlying various diseases but also informs the development of targeted therapeutic strategies. In the context of immunotherapy, understanding the functional consequences of PD-1 variants is particularly crucial for advancing precision medicine. Programmed death-1 inhibitors have revolutionised cancer treatment, particularly in melanoma, lung cancer and other malignancies, by enhancing anti-tumour immune responses.10,11 However, genetic variations in PDCD1 gene may influence individual responses to these therapies, affecting efficacy and the risk of immune-related adverse events.12 Furthermore, in infectious diseases, such as HIV, where PD-1 overexpression contributes to T-cell exhaustion, genetic predispositions could inform personalised therapeutic strategies.13 Identifying functionally significant PD-1 variants could help stratify patients for immunotherapy, optimise treatment outcomes and reduce the burden of immune-related complications. This study contributes to a growing body of research supporting the integration of genetic screening in immunotherapeutic decision-making.11 Therefore, in this study, we conducted an in silico analysis to identify and characterise deleterious nsSNPs in the human PDCD1 gene. By leveraging multiple computational prediction tools, we evaluated the potential functional, structural and dynamic impacts of these variants, as well as their pathogenicity on the PD-1 protein. Our findings aim to deepen the understanding of PDCD1 genetic variants and implications thereof for human health, offering valuable insights that could contribute to the development of personalised therapeutic strategies.
Research methods and design
Prediction of the deleterious effect of non-synonymous single nucleotide polymorphisms of the Programmed Cell Death 1 gene
Data collection
The amino acid sequence of the PD-1 protein in Fasta format was retrieved from the Uniprot database (https://www.uniprot.org) (UniprotKB ID Q15116). The data related to PDCD1 gene SNPs were collected from dbSNP (http://www.ncbi.nlm.nih.gov/snp/). A table of variants was realised where we filtered common variants and then classified them in order of priority to identify those which can potentially affect the protein function.
Selection of deleterious non-synonymous single nucleotide polymorphisms
To predict the deleterious effect of nsSNPs of the human PDCD1 gene on protein function, different bioinformatics tools were used. We employed the PredictSNP1.0 (http://loschmidt.chemi.muni.cz/predictsnp1/) for prediction consensus classifier of disease-related mutations. PredictSNP1.0 was chosen for this study because of its comprehensive ability to predict the pathogenicity of single nucleotide polymorphisms (SNPs) by combining eight most powerful predictive algorithms: sorting intolerant from tolerant (SIFT), PolyPhen-1, PolyPhen-2, multivariate analysis of protein polymorphism (MAPP), predictor of human deleterious (PhD)-SNP, screening for non-acceptable polymorphisms (SNAP), PANTHER and nsSNPAnalyzer.14 This platform offers a reliable consensus prediction and integrates results from various tools, allowing for more accurate and consistent identification of potentially harmful nsSNPs. Compared to other computational tools, PredictSNP1.0 is particularly effective in assessing the functional impact of genetic variants, which makes it a suitable choice for this study. With this approach, we generated scores from each computational tool, accompanied by consensus predictions expressed as percentages, derived from observed accuracy values to facilitate straightforward comparisons.14
Evolutionary conservation analysis of programmed death-1
Conservation prediction of the PD-1 protein sequence was analysed with ConSurf (http://consurf.tau.ac.il/), a web server used for calculating the evolutionary conservation of nucleic acid or amino acid positions through phylogenetic relationships between homologous sequences. The algorithm of this tool predicts the important functional regions of a protein by estimating the conservation degree of amino acids on a scale of 1 to 9 based on multiple sequence alignment (MSA). Therefore, 9 represents the most highly conserved residue and 1 represents the least conserved region. This tool also analyses conservation at the nucleotide level. Polymorphisms located in conserved regions are likely to be more deleterious compared to those located in variable regions, as they tend to destabilise protein structure and function.15
Effect on the stability of programmed death-1 protein
The stability of the PD-1 protein was evaluated using the bioinformatics tools MUPro (http://mupro.proteomics.ics.uci.edu/) and I.Mutant 2.0 (http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgi), which have the advantage of not requiring tertiary structures for their calculations. MUPro predicts the effect of single amino acid changes on protein stability, providing a confidence score ranging from −1 to +1, where a score < 0 indicates decreased stability and a score > 0 indicates increased stability.16 Similarly, I.Mutant 2.0 assesses the impact of mutations on protein stability by calculating the Gibbs free energy change (DDG) resulting from amino acid substitutions, with DDG values also ranging from −1 to +1. A positive DDG (> 0) signifies increased stability, while a negative DDG (< 0) indicates decreased stability.17 Together, these tools offer quantitative insights into the effects of specific mutations on PD-1 protein stability, providing valuable data for understanding the functional consequences of genetic variations.18
Pathogenicity prediction of mutations
Pathogenicity predictions were conducted using MutPred2 (Mutation Prediction2) (http://mutpred.mutdb.or/help.html), a machine learning-based tool that integrates genetic and molecular data to estimate the pathogenic potential of amino acid substitutions. MutPred2 predicts the gain or loss of structural and functional properties associated with a mutation. This tool requires only the protein sequence and the specific amino acid substitution as input. It generates scores ranging from 0 (benign) to 1 (pathogenic), along with a p-value that indicates the statistical confidence of the prediction. Outputs with a p-value < 0.05 are considered significant, while those with a p-value < 0.01 are deemed highly significant, reflecting greater confidence in the pathogenicity prediction. MutPred2 provides insights into the functional and structural consequences of mutations, offering valuable information for understanding their potential impact on protein behaviour.19
Structural analysis of the programmed death-1 protein
Proteins are inherently dynamic entities, characterised by the flexibility of their constituent amino acids and their diffusion within the surrounding environment. Likewise, ligands and solvents exhibit their own dynamic behaviours. Molecular dynamics (MD) simulations provide a powerful approach to model and analyse the time-dependent behaviour of these molecular systems.20
Preparation of the 3D structure of programmed death-1
Since the complete 3D structure of the PD-1 protein was not available in the Protein Data Bank (PDB), we resorted to homology modelling, based on the assumption that molecules with homologous sequences share similar structures. In this study, we used Iterative Threading ASSEmbly Refinement (I-TASSER, https://zhanggroup.org/I-TASSER/) to generate the complete 3D structure of PD-1. I-TASSER selects the model with the highest significance of thread alignments and the quality of the predicted structure is assessed using a confidence score (C-score), which ranges from –5 to +2, with higher values indicating greater confidence in the prediction. The C-score is correlated with the model’s template modelling score (TM-score) and the root-mean-square deviation (RMSD). The TM-score, a scale recently introduced to address local error issues with RMSD, ranges from 0 to 1, with a value of 1 indicating a higher structural similarity to improve the accuracy of the generated model and bring it closer to experimental levels. For further computational studies, we refined the PD-1 structure using the ModRefiner algorithm (https://zhanglab.ccmb.med.umich.edu/ModRefiner),21 which refines the backbone topology from the C-alpha traces and adds side chains based on a physics- and knowledge-based force field.22 For constructing the mutated models, mutagenesis was performed using PyMOL (http://www.pymol.org)23 and the 3D structures of the native and mutated proteins were evaluated using the Structural Analysis and Verification Server (http://nihserver.mbi.ucla.edu/SAVES/), employing Verify3D and PROCHECK programmes.24
Molecular dynamics analysis
In this study, GROMACS 5.1.3 was selected for MDs simulations because of its efficiency, reliability and widespread use in molecular modelling.25,26 A 100-nanosecond (ns) MD simulation was conducted on the selected native and mutated 3D structures. The CHARMM (Chemistry at Harvard Macromolecular Mechanics) force field was employed for its versatility and proven accuracy in modelling macromolecular systems using empirical energy functions. Widely recognised as a leading tool for studying the MDs of biological molecules, particularly proteins, CHARMM provided a robust framework for this analysis.27
The homology-generated model served as the starting point for constructing the protein topology. The protein was placed in an aqueous environment within a cubic water box, with a 1 nm buffer surrounding the structure, to simulate physiological conditions.28 The system was neutralised by adding sufficient Na+ and Cl- ions. Energy minimisation was conducted using the steepest descent algorithm for up to 50 000 steps to relax the solvated system and achieve a stable conformation.
After energy minimisation, the system was equilibrated in two stages. Firstly, an NVT (constant number of particles, volume and temperature) ensemble was run using the Berendsen thermostat to heat the system to 300 K and achieve a density of 1000 kg/m3. Secondly, an NPT (constant number of particles, pressure and temperature) ensemble was carried out at 1 bar and 300 K to ensure system stabilisation. Each equilibration phase lasted 500 ps.29
Following equilibration, 100 ns MD simulations were performed for both native and mutant models (wild-type [WT] and D117V). The resulting trajectory files were analysed using various GROMACS utilities, including g_rms, g_rmsf, g_hbond, g_mdmat, g_gyrate and g_sasa, to compute parameters RMSD, root-mean-square fluctuation (RMSF), hydrogen bonds (H-bonds), interaction residues, radius of gyration (Rg) and solvent-accessible surface area (SASA), respectively.
The data generated from these analyses were visualised using X Window System Multi-platform GRaphing tool (XMGRACE) for plotting30 and Yet Another Scientific Artificial Reality Application (YASARA) View for molecular interaction representations.31 These analyses provided insights into the structural and dynamic behaviour of both native and D117V mutant PD-1 protein model under simulated physiological conditions.
Results
Distribution of Programmed Cell Death 1 gene single nucleotide polymorphisms dataset
The complete dataset, retrieved from the dbSNP database, comprised 4478 SNPs, including 2287 intronic, 815 downstream, 452 located in the 3′UTR region, 214 nsSNPs, 22 in the 5′UTR region and 688 of other types. These nsSNPs were selected to investigate their potential deleterious effects on the function of the PD-1 protein. By focusing on the nsSNPs, we aimed to assess their possible impact on PD-1 functionality.
Prediction of deleterious non-synonymous single nucleotide polymorphisms in Programmed Cell Death 1 gene
Following the submission of 214 nsSNPs to PredictSNP1.0, a comprehensive in silico analysis was conducted using all available integrated prediction tools to evaluate their potential deleterious effects on pathogenicity.
The results from MAPP predicted 135 out of 214 nsSNPs (63.1%) to be deleterious, followed by PolyPhen-2 with 110 deleterious nsSNPs (51.4%), SNAP with 91 nsSNPs (42.5%), PolyPhen-1 with 85 nsSNPs (39.7%), PredictSNP with 84 nsSNPs (39.2%), PANTHER with 55 nsSNPs (25.7%), SIFT with 38 nsSNPs (17.8%), PhD-SNP with 12 nsSNPs (5.6%) and nsSNPAnalyser with 0 deleterious predictions (0%) as shown in Figure 1.
 |
FIGURE 1: A graphical representation showing the distribution of 214 non-synonymous single nucleotide polymorphisms predicted as deleterious (in orange) and neutral (in blue) by PredictSNP1.0 server. |
|
Among these, two nsSNPs, D117V (rs77213099) and W286G (rs77510030), were consistently predicted to be deleterious by six out of the seven prediction tools used. These variants are highlighted in Table 1 as the most likely to impact the function of the PD-1 protein.
TABLE 1: Prediction scores of deleterious single nucleotide polymorphisms in the Programmed Cell Death 1 gene by PredictSNP1.0. |
Phylogenetic conservation
Using the ConSurf web server, we calculated the evolutionary conservation score of amino acid residues of the PD-1 protein to explore the possible effects of the two most deleterious nsSNPs. Our results showed that the two selected deleterious nsSNPs (D117V and W286G) were in highly conserved regions, exposed, functional and with a conservation degree of 9 (Figure 2).
 |
FIGURE 2: Results of programmed death-1 protein conservation analysis via ConSurf. |
|
Impact of predicted deleterious mutations on programmed death-1 stability
The protein stability analysis of selected SNPs in PD-1 (D117V and W286G) using I-Mutant 2.0 and MUPro reveals a decrease in stability for both the mutation types. For D117V, both tools predict a decrease in stability with I-Mutant showing a negative DDG of −0.40 and MUPro indicating a similar effect across all models (support vector machine [SVM] and artificial neural network [NN]). Similarly, W286G shows a marked decrease in stability, with I-Mutant and MUPro (DDG −2.03) consistently suggesting a destabilising effect. The results are supported by reliability indices (RI) and predictive models (SVM and NN) (Table 2).
TABLE 2: Protein stability analysis results of selected single nucleotide polymorphisms of programmed death-1 protein using I-Mutant 2.0 and MUPro. |
Prediction of mutation-induced pathogenicity
MutPred2 prediction results showed that the pre-selected D117V and W286G nsSNPs scored 0.805 and 0.572, respectively, suggesting that they could be pathogenic rather than benign variants. Among these, the D117V polymorphism was the only one predicted to cause structural and functional alterations in the PD-1 protein. These alterations include a gain of adenosine diphosphate (ADP)-ribosylation at R112, an altered disordered interface, a modified transmembrane protein and a gain of N-glycosylation at N116, potentially affecting the protein’s stability and function (Table 3).
TABLE 3: Effects of nsSNP D117V on structural and functional properties of programmed death-1 protein resulting from MutPred2. |
Preparation and validation of programmed death-1 models
Although different polymorphisms in the PD-1 protein are reported, the native 3D structure with the complete sequence of 288 amino acids is not yet available on the PDB database. Besides, not all mutant protein structures are analysed and deposited in databases. Therefore, we were interested in our study in modelling the 3D structure of the native PD-1 protein and the mutant proteins in order to be able to compare them structurally and functionally. For this, the I-TASSER web server is deployed using as a homology model the primary sequence of PD-1 obtained from the National Center for Biotechnology Information (NCBI) database.
I-TASSER produced five models, from which we selected only the model with the highest minimum C-score, which suggests higher confidence of the predicted model. The TM-score of our chosen predicted secondary structure highlights the correct topology of the model. The C-score reflects the high quality of our predicted secondary structure and directly correlates with the TM-score and RMSD (Table 4).
TABLE 4: I-TASSER results representing the C-score, template modelling-score and root-mean-square deviation of the selected secondary structure. |
The generated 3D structure was refined using the ModRefiner server with an RMSD value of 1.071 and a TM-score of 0.9729 to the initial model. After refining our native structure, via PyMol, we constructed two mutated structures of the PD-1 protein with the two selected nsSNPs (D117V and W286G). All these 3D structures (native and mutated) were validated using the Verify3D and PROCHECK tools. Verify3D showed that 82.64% of the amino acids have a score ≥ 0.2 by comparing their 3D profile to the 1D, whether for the native or mutated structure (Figure 3). This means that according to Verify3D our structure is validated.
 |
FIGURE 3: Programmed death-1 native 3D structure evaluation graph performed by Verify3D. |
|
For our native structure, Ramachandran plot analysis by PROCHECK showed 159 (68.5%) residues in the favoured region, 62 (25.9%) residues in the additional allowed region, 7 (3%) residues in the generously authorised region and 6 (2.6%) residues in the unauthorised region. Plotting our predicted structure provided a view of the set of excluded regions that show the position of amino acids that are not allowed. This is the case for the 6 residues (Proline130, Tryptophan286, Lysine210, Aspartate85, Serine159 and Arginine203) which were found at very long distances from the predicted model pocket, suggesting that these amino acids are structurally and functionally irrelevant (Figure 4).
 |
FIGURE 4: Ramachandran plot of PROCHECK validation of the native 3D structure of programmed death-1. |
|
Concerning the validation by PROCHECK of the mutated structures (D117V and W286G), the estimation of the stereochemical quality showed that the D117V structure is reliable and valid for further MDs calculations (Figure 4).
Alternatively, the W286G structure is proposed to be irrelevant because of the fact that residue 286 before and after mutagenesis is located in an unauthorised or unfavourable region (Figure 4). These results allowed us to exclude the possibility of exploiting the mutated W286G model in the continuation of our study.
Molecular dynamics simulations
Root-means-square deviation
The conformational stability of the native and mutated PD-1 protein models (WT vs. D117V) during the MD simulation was evaluated using RMSD analysis. The native model exhibited an average RMSD value of 0.670 nm ± 0.103 nm, which was closely comparable to the D117V mutated model, with an RMSD of 0.635 nm ± 0.096 nm. These values indicate stable trajectories for both the models, with minimal positional deviations over time. This consistency confirms the conformational stability of the D117V model, suggesting a well-balanced system (Figure 5a).
 |
FIGURE 5: Molecular dynamics (MD) simulation analysis with: (a) root-mean-square deviation, (b) root-mean-square fluctuation, (c) Rg, (d) solvent-accessible surface area and (e) H-bond values across Cα backbone (in Å) of native (WT) and mutant (D117V) models calculated after 100 ns of MD trajectories. |
|
Over a 100 ns MDs simulation, the native model exhibited an average RMSD of 0.670 nm ± 0.103 nm, whereas the D117V mutated model showed a comparable RMSD of 0.635 nm ± 0.096 nm. Considering a 95% confidence interval, these values suggest that the structural fluctuations of both the models remain within a similar range, indicating minimal impact of the D117V mutation on overall conformational stability.
Root-mean-square fluctuation
The residual flexibility of the PD-1 protein was analysed using RMSF, based on the positions of amino acid residues. The average RMSF values for the native and mutated PD-1 models were 0.309 nm ± 0.137 nm and 0.293 nm ± 0.129 nm, respectively, indicating similar overall flexibility between the two. However, higher fluctuations were observed in arginine residues at positions 30 and 161, as well as in the N-terminal region of the mutated protein compared to the native model (Figure 5b). Conversely, tryptophan at position 23 and arginine at position 231 showed greater stability in the mutated model. These variations suggest that the D117V polymorphism induces slight local fluctuations in residues distant from the mutation site without compromising overall structural stability.
Radius of gyration
The protein compactness, shape and structural stability over time were evaluated by analysing the radius of gyration (Rg) for both the native and D117V mutated models. After 40 ns of MD simulation, the mutated model exhibited a sudden increase in Rg fluctuations, in contrast to the native model. However, the average Rg values remained comparable between the two models, with the mutant showing an average of 2.296 nm ± 0.056 nm and the native model 2.194 nm ± 0.055 nm. These results indicate that despite localised fluctuations, the D117V mutation does not compromise the overall compactness and stability of the PD-1 protein structure throughout the MD simulation (Figure 5c).
Solvent-accessible surface area
The surface area of a biomolecule interacting with solvent molecules was evaluated using the SASA parameter to determine its conformational stability in an aqueous medium. Our results show that the D117V polymorphism appears to abruptly fluctuate significantly between 30 ns and 50 ns. After 82 ns of simulation, both models (WT and D117V) stabilise at the same SASA value of 175 nm2. To summarise, native and mutated PD-1 protein structures maintained an almost similar average SASA value throughout the MD simulation with 180.997 (± 5.831) nm2 and 182.509 (± 7.001) nm2; respectively (Figure 5d). Given that the residues of the PD-1 protein are present on the surface of the protein, the presence of variants at this level induces a conservation of the accessibility of the surface of the solvent to the protein during the MD simulation.
Hydrogen bonds
Hydrogen bonds (H-bonds) induce the formation of secondary and tertiary protein structures. Increases in the number of hydrogens induce stronger protein interactions. The MD simulation of the mutated model highlighted a notable increase in the hydrogen interaction with the native PD-1 protein, causing a stability of the bond between the two models over time, despite the conformational change. However, the average number of hydrogen bonds present during the MD simulation almost did not change and were 125.267 Å ± 8.672 Å for the native model and 127.685 Å ± 9.403 Å for the D117V mutated model in a 100 ns MD simulation (Figure 5e).
Local structural changes
A 100 ns MDs simulation, a comparison of the local structural changes induced by our D117V polymorphism was carried out with YASARA. Our results showed that there is a variation in the number of the different types of interactions between the native and mutated models. We found that in its native state, aspartic acid has two hydrogen bonds with the amino acids ARG94 and ARG114 and one hydrogen bond with ARG69. When valine replaced aspartic acid, one of the hydrogen bonds in ARG114 and the hydrogen bonds with ARG94 and ARG69 disappeared. We also observed a gain of hydrogen bonding with TYR121 (Figure 6b). Furthermore, in the native model, the ARG69 residue had an ionic bond with ASP117, whereas upon substitution of this aspartic acid with a valine this bond disappeared (Figure 6b). We also noticed that SER118 changed its conformation following this variation (Figure 6a). Moreover, the number of hydrophobic bonds has not really changed.
 |
FIGURE 6: Structural model of the native (a) and mutated D117V (b) models after a 100 ns molecular dynamics simulation. Substituted residues are marked in magenta; residues involved in hydrophobic interactions are indicated in cyan. Hydrogen bonds are represented by yellow dotted lines, hydrophobic by green lines and ionic by blue lines. |
|
Discussion
Bioinformatics tools are invaluable in analysing genetic information, providing insights into genome organisation, gene expression, sequence alignment, evolutionary relationships, MDs and the structure–function relationships of macromolecules.32,33 Several studies have leveraged these tools to advance scientific and medical research. For instance, in 2018, Kumar et al. used computational approaches to investigate the impact of missense mutations associated with ᴅ-2-hydroxyglutaric aciduria 2, elucidating molecular structural changes that can inform drug target development for this disease.34 Similarly, bioinformatics tools have been employed to study the association between Gaucher’s and Parkinson’s diseases, facilitating drug discovery for these conditions.35 Moreover, MD simulations have proven effective in evaluating the structural and functional properties of proteins and their assemblies.36
In this study, we conducted an in silico analysis of the human PDCD1 gene to identify potentially deleterious nsSNPs and evaluate their impact on protein structure and stability. Our results showed that the two nsSNPs, D117V (rs77213099) and W286G (rs77510030), were identified by six out of seven predictive tools as highly deleterious, suggesting their potential to significantly affect PD-1 function.
Evolutionary conservation plays a critical role in determining the impact of mutations on human health.37 Using ConSurf, we analysed structural and functional residues by integrating evolutionary conservation data with solvent accessibility predictions. Our analysis revealed that nsSNPs in highly conserved regions are more likely to impair protein function compared to those in non-conserved regions38,39 underscoring their importance in maintaining PD-1’s structural and functional integrity.
Protein stability is essential for its proper function. Our findings suggest that the analysed variants, particularly D117V, may destabilise PD-1 at a functional level. The calculated p-values (Table 3) reinforced the reliability of these predictions, indicating that the D117V mutation could induce structural and functional alterations. However, it remains unclear whether these site-specific changes extend their impact beyond the protein level.
To explore these further, MD simulations were performed to compare the behaviour of the native and mutated forms of PD-1. However, PROCHECK validation of W286G model indicated that the mutated structure was positioned in an unauthorised or unfavourable region of the protein. This means that residue 286, both before and after mutagenesis, was in a stereochemically inappropriate location. This made the model structurally invalid. In contrast, the D117V mutant structure passed the POCHECK validation check, ensuring its stability and suitability for MD simulations. Therefore, the W286G model was deemed unsuitable and excluded from further analyses.
Analysis of RMSD, RMSF, Rg, SASA and H-bond, following 100 ns MD simulation, showed that the D117V nsSNP does not influence the stability, flexibility and overall dimensions of the PD-1 protein. This allows us to conclude that this rs772130993 (D117V) polymorphism does not seem to influence the protein function.
The structure of the PD-1 protein was analysed to identify the different types of bonds between the different residues of the protein. These bonds play a major role in protein stability and folding. Each disruption of these interactions affects the structure and could interrupt the function of the PD-1 protein. The structure of this protein could also be affected by the differences in the physicochemical properties of amino acid variants caused by deleterious nsSNPs.35,38,40 Structural analysis of the D117V variant revealed changes, including the formation and/or loss of bonds, likely because of the distinct physicochemical properties of the substituted amino acids. Aspartic acid, an acidic residue, was replaced by valine, an aliphatic residue with different charges and hydrophobicity, potentially driving the observed structural and functional variations. This may be because of the fact that our polymorphism is located in a discrete region, in the middle of the component sequence of the extracellular region of our protein and therefore does not come into direct contact with the interactions of this protein. Also, the protection of this site is probably because of the fact that this amino acid is located in a helical region and therefore is stacked with adjacent residues, which gives it some resistance and protection. These results do not rule out the fact that this polymorphism, combined with other polymorphisms of the same or other regions, could disrupt protein function. Besides, predicting D117V as a deleterious SNP, but showing no significant functional change via MD is consistent with the literature. Some variants may appear harmful in predictions but demonstrate no substantial biological effects in dynamic environments.41 This reported lack of association between the rs772130993 (D117V) polymorphism and protein function would facilitate research into biomarker development and promise ease in the use of PD-1 inhibitors as treatment for HIV-infected subjects.
While multiple predictive tools identified D117V as a deleterious variant, MD simulations suggest that its structural and functional impact is minimal. This discrepancy highlights the limitations of static computational predictions, which primarily assess evolutionary conservation, physicochemical changes and generalised protein stability. In contrast, MD simulations provide a dynamic perspective, revealing that despite initial predictions, D117V does not significantly alter PD-1’s conformational stability or key functional interactions over time. Similar findings have been reported in other immune checkpoint proteins, where predicted deleterious mutations exhibited minimal phenotypic effects in dynamic cellular environments.42
While our findings suggest that the D117V variation has no significant functional effect, future research should test these predictions using such experimental approaches to clarify its biological impact. Furthermore, investigating the population-specific prevalence of these variations could provide insights into their role in disease susceptibility and immune modulation. Clinically, longitudinal studies assessing the correlation between PDCD1 polymorphisms and patient responses to PD-1 inhibitors in cancer or chronic infections could provide a foundation for personalised immunotherapy.11 Integrating these findings with proteomic and transcriptomic data may reveal broader biological implications of PD-1 polymorphisms, paving the way for optimised immunotherapeutic strategies.
To strengthen the clinical relevance of our findings, future studies should incorporate experimental validation of the predicted effects of PD-1 variants. Functional assays, such as Surface Plasmon Resonance or Isothermal Titration Calorimetry, could assess changes in ligand binding affinity,43 whereas structural analyses using X-ray Crystallography or cryo-Electron Microscopy would provide direct evidence of conformational alterations.44 Additionally, cellular assays evaluating immune activation and cytokine production in PD-1-expressing T cells could determine the physiological impact of these variants.45
Conclusion
This in silico investigation aimed to study the deleterious nsSNPs of the PD-1 protein and examine their impact on the protein function. From 288 nsSNPs, two amino acid substitutions (D117V and W286G) were found to be deleterious, conserved and associated with decreasing stability and altering protein structure. More generally, the results of this prediction work will depend on the results, subsequently obtained on the impact of these polymorphisms on the protein function. Therefore, we realised a study of structural and functional modifications to directly determine the effect of these mutant proteins. Homology modelling and validation of our mutated structures only allowed us to retain the D117V variant. Our results demonstrated that this polymorphism does not have a strong impact on the structure of PD-1 and, therefore, on its function. This deduction could be useful for possible in vitro experimental studies and clinical trials in order to rule out the involvement of this polymorphism in the interactions of the PD-1 protein with its receptors. These findings suggest that anti-PD-1 therapy may offer a promising new approach for treating various mutation-associated diseases. While PD-1/PD-L1 inhibitors have shown considerable success in cancer treatment, their potential benefits for infectious diseases, such as HIV-1, hepatitis B or C virus patients, particularly, in improving T-cell function, are still being explored.2,13 However, deeper insights into PD-1 are needed to address potential treatment resistance.
Acknowledgements
The authors are thankful to the Pasteur Institute of Morocco for providing encouragement, facilities and resources.
Competing interests
The authors declare that they have no financial or personal relationships that may have inappropriately influenced them in writing this article. The author, B.R.K., serves as an editorial board member of this journal. The author has no other competing interests to declare.
Authors’ contributions
H.B. wrote the article. H.B., M.B., L.W. and A.K. conceived and designed the experiments. H.B., M.B., A.A. and A.S. performed experiments, data analysis and interpretation. B.R.K., M.A.S. and S.A. reviewed and edited the article. A.K. and L.W. supervised the project. All authors discussed the results, contributed to the final article and consented to publication.
Ethical considerations
This article followed all ethical standards for research without direct contact with human or animal subjects.
Funding information
This research received no specific grant from any funding agency in the public, commercial or not-for-profit sectors.
Data availability
The datasets generated and/or analysed during the current study are available from the corresponding author, H.B., upon reasonable request.
Disclaimer
The views and opinions expressed in this article are those of the authors and are the product of professional research. They do not necessarily reflect the official policy or position of any affiliated institution, funder, agency or that of the publisher. The authors are responsible for this article’s results, findings and content.
References
- Sharpe AH, Freeman GJ. The B7-CD28 superfamily. Nat Rev Immunol. 2002;2:116–126. https://doi.org/10.1038/nri727
- Keir ME, Butte MJ, Freeman GJ, Sharpe AH. PD-1 and its ligands in tolerance and immunity. Annu Rev Immunol. 2008;26:677–704. https://doi.org/10.1146/annurev.immunol.26.021607.090331
- Sharpe AH, Pauken KE. The diverse functions of the PD1 inhibitory pathway. Nat Rev Immunol. 2018;18:153–167. https://doi.org/10.1038/nri.2017.108
- Topalian SL, Hodi FS, Brahmer JR, et al. Safety, activity, and immune correlates of anti-PD-1 antibody in cancer. N Engl J Med. 2012;366:2443–2454. https://doi.org/10.1056/NEJMoa1200690
- The nobel prize in physiology or medicine 2018 [homepage on the Internet]. NobelPrize.org. [cited 2024 Aug 05]. Available from: https://www.nobelprize.org/prizes/medicine/2018/advanced-information/
- Iwai Y, Terawaki S, Ikegawa M, Okazaki T, Honjo T. PD-1 inhibits antiviral immunity at the effector phase in the liver. J Exp Med. 2003;198:39–50. https://doi.org/10.1084/jem.20022235
- Ng PC, Henikoff S. Predicting the effects of amino acid substitutions on protein function. Annu Rev Genomics Hum Genet. 2006;7:61–80. https://doi.org/10.1146/annurev.genom.7.080505.115630
- Yates CM, Filippis I, Kelley LA, Sternberg MJE. SuSPect: Enhanced prediction of single amino acid variant (SAV) phenotype using network features. J Mol Biol. 2014;426:2692–2701.
- Adzhubei IA, Schmidt S, Peshkin L, et al. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7:248–249. https://doi.org/10.1038/nmeth0410-248
- Acúrcio RC, Pozzi S, Carreira B, et al. Therapeutic targeting of PD-1/PD-L1 blockade by novel small-molecule inhibitors recruits cytotoxic T cells into solid tumor microenvironment. J Immunother Cancer. 2022;10:e004695. https://doi.org/10.1136/jitc-2022-004695
- Ribas A, Wolchok JD. Cancer immunotherapy using checkpoint blockade. Science. 2018;359:1350–1355. https://doi.org/10.1126/science.aar4060
- Zhang J, Zhao T, Xu C, Huang J, Yu H. The association between polymorphisms in the PDCD1 gene and the risk of cancer: A PRISMA-compliant meta-analysis. Medicine (Baltimore). 2016;95:e4423.
- McLane LM, Abdel-Hakeem MS, Wherry EJ. CD8 T cell exhaustion during chronic viral infection and cancer. Annu Rev Immunol. 2019;37:457–495.
- Bendl J, Stourac J, Salanda O, et al. PredictSNP: Robust and accurate consensus classifier for prediction of disease-related mutations. PLoS Comput Biol. 2014;10:e1003440. https://doi.org/10.1371/journal.pcbi.1003440
- Ashkenazy H, Abadi S, Martz E, et al. ConSurf 2016: An improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Res. 2016;44:W344–W350. https://doi.org/10.1093/nar/gkw408
- Cheng J, Randall A, Baldi P. Prediction of protein stability changes for single-site mutations using support vector machines. Proteins. 2006;62:1125–1132. https://doi.org/10.1002/prot.20810
- Capriotti E, Fariselli P, Casadio R. I-Mutant2.0: Predicting stability changes upon mutation from the protein sequence or structure. Nucleic Acids Res. 2005;33:W306–W310.
- Abuzaid O, Idris AB, Yılmaz S, Idris EB, Idris LB, Hassan MA. Prediction of the most deleterious non-synonymous SNPs in the human IL1B gene: Evidence from bioinformatics analyses. BMC Genomic Data. 2024;25:56. https://doi.org/10.1186/s12863-024-01233-x
- Pejaver V, Urresti J, Lugo-Martinez J, et al. Inferring the molecular and phenotypic impact of amino acid variants with MutPred2. Nat Commun. 2020;11:5918.
- Tang H, Thomas PD. PANTHER-PSEP: Predicting disease-causing genetic variants using position-specific evolutionary preservation. Bioinformatics. 2016;32:2230–2232. https://doi.org/10.1093/bioinformatics/btw222
- Yang Y, Faraggi E, Zhao H, Zhou Y. Improving protein fold recognition and template-based modeling by employing probabilistic-based matching between predicted one-dimensional structural properties of query and corresponding native properties of templates. Bioinformatics. 2011;27:2076–2082. https://doi.org/10.1093/bioinformatics/btr350
- Xu D, Zhang Y. Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization. Biophys J. 2011;101:2525–2534.
- Yuan S, Chan HCS, Filipek S, Vogel H. PyMOL and inkscape bridge the data and the data visualization. Structure. 2016;24:2041–2042. https://doi.org/10.1016/j.str.2016.11.012
- Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: A program to check the stereochemical quality of protein structures. J Appl Cryst. 1993;26: 283–291.
- Abraham MJ, Murtola T, Schulz R, et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX. 2015;1–2:19–25. https://doi.org/10.1016/j.softx.2015.06.001
- Berendsen HJC, Van der Spoel D, Van Drunen R. GROMACS: A message-passing parallel molecular dynamics implementation. Comput Phys Commun. 1995;91:43–56.
- Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M. CHARMM: A program for macromolecular energy, minimization, and dynamics calculations. J Comput Chem. 1983;4:187–217. https://doi.org/10.1002/jcc.540040211
- Mark P, Nilsson L. Structure and dynamics of the TIP3P, SPC, and SPC/E Water Models at 298 K. J Phys Chem A. 2001;105:9954–9960.
- Berendsen HJC, Postma JPM, Van Gunsteren WF, DiNola A, Haak JR. Molecular dynamics with coupling to an external bath. J Chem Phys. 1984;81:3684–3690. https://doi.org/10.1063/1.448118
- Humphrey W, Dalke A, Schulten K. VMD: Visual molecular dynamics. J Mol Graph. 1996;14:33–38.
- Krieger E, Vriend G. YASARA view – Molecular graphics for all devices – From smartphones to workstations. Bioinformatics. 2014;30:2981–2982. https://doi.org/10.1093/bioinformatics/btu426
- Mah JTL, Low ESH, Lee E. In silico SNP analysis and bioinformatics tools: A review of the state of the art to aid drug discovery. Drug Discov Today. 2011;16:800–809. https://doi.org/10.1016/j.drudis.2011.07.005
- Hospital A, Goñi JR, Orozco M, Gelpí JL. Molecular dynamics simulations: Advances and applications. Adv Appl Bioinform Chem. 2015;8:37–47.
- Thirumal Kumar D, Jerushah Emerald L, George Priya Doss C, et al. Computational approach to unravel the impact of missense mutations of proteins (D2HGDH and IDH2) causing D-2-hydroxyglutaric aciduria 2. Metab Brain Dis. 2018;33:1699–1710.
- Thirumal Kumar D, Eldous HG, Mahgoub ZA, George Priya Doss C, Zayed H. Computational modelling approaches as a potential platform to understand the molecular genetics association between Parkinson’s and Gaucher diseases. Metab Brain Dis. 2018;33:1835–1847. https://doi.org/10.1007/s11011-018-0286-3
- Sinha S, Tam B, Wang SM. Applications of molecular dynamics simulation in protein study. Membranes. 2022;12:844. https://doi.org/10.3390/membranes12090844
- Ramensky V, Bork P, Sunyaev S. Human non-synonymous SNPs: Server and survey. Nucleic Acids Res. 2002;30:3894–3900.
- Doniger SW, Kim HS, Swain D, et al. A catalog of neutral and deleterious polymorphism in yeast. PLoS Genet. 2008;4:e1000183.
- Miller MP, Kumar S. Understanding human disease mutations through the use of interspecific genetic variation. Hum Mol Genet. 2001;10:2319–2328. https://doi.org/10.1093/hmg/10.21.2319
- Berezin C, Glaser F, Rosenberg J, et al. ConSeq: The identification of functionally and structurally important residues in protein sequences. Bioinformatics. 2004;20:1322–1324. https://doi.org/10.1093/bioinformatics/bth070
- Qi T, Fu J, Zhang W, et al. Mutation of PD-1 immune receptor tyrosine-based switch motif (ITSM) enhances the antitumor activity of cytotoxic T cells. Transl Cancer Res. 2020;9(11):6811–6819. https://doi.org/10.21037/tcr-20-2118
- Ahmed Al-Madhagi H. Computational identification of most deleterious missense mutations in human PD-1 gene. Sci. World J. 2023;2023:4360203. https://doi.org/10.1155/2023/4360203
- Abdiche Y, Malashock D, Pinkerton A, Pons J. Determining kinetics and affinities of protein interactions using a parallel real-time label-free biosensor, the Octet. Anal Biochem. 2008;377:209–217. https://doi.org/10.1016/j.ab.2008.03.035
- Cheng Y, Walz T. The advent of near-atomic resolution in single-particle electron microscopy. Annu Rev Biochem. 2009;78:723–742. https://doi.org/10.1146/annurev.biochem.78.070507.140543
- Freeman GJ, Long AJ, Iwai Y, et al. Engagement of the PD-1 immunoinhibitory receptor by a novel B7 family member leads to negative regulation of lymphocyte activation. J Exp Med. 2000;192:1027–1034. https://doi.org/10.1084/jem.192.7.1027
|