- Title
- Mechanism of action of non-synonymous single nucleotide variations associated with α-carbonic anhydrases II, IV and VIII
- Creator
- Sanyanga, T. Allan
- Subject
- Carbonic anhydrase
- Subject
- Carbonic anhydrase -- Therapeutic use
- Subject
- Nucleotides
- Date Issued
- 2020
- Date
- 2020
- Type
- text
- Type
- Thesis
- Type
- Doctoral
- Type
- PhD
- Identifier
- http://hdl.handle.net/10962/167346
- Identifier
- vital:41470
- Description
- The carbonic anhydrase (CA) group of enzymes are Zinc (Zn2+) metalloproteins responsible for the reversible hydration of CO2 to bicarbonate (BCT or HCO− 3 ) and protons (H+) for the facilitation of acid-base balance and homeostasis within the body. Across all organisms, a minimum of six CA families exist, including, α (alpha), β (beta), γ (gamma), δ (delta), η (eta) and ζ (zeta). Some organisms can have more than one family, with exception to humans that contain the α family solely. The α-CA family comprises of 16 isoforms (CA-I to CA-XV) including the CA-VIII, CA-X and CA-XI acatalytic isoforms. Of the catalytic isoforms, CA-II and CA-IV possess one of the fastest rates of reaction, and any disturbances to the function of these enzymes results in CA deficiencies and undesirable phenotypes. CA-II deficiencies result in osteopetrosis with renal tubular acidosis and cerebral calcification, whereas CA-IV deficiencies result in retinitis pigmentosa 17 (RP17). Phenotypic effects generally manifest as a result of poor protein folding and function due to the presence of non-synonymous single nucleotide variations (nsSNVs). Even within the acatalytic isoforms such as CA-VIII that llosterically regulates the affinity of inositol triphosphate (IP3) for the IP3 receptor type 1 (ITPR1) and regulates calcium (Ca2+) signalling, the presence of SNVs also causes phenotypes cerebellar ataxia, mental retardation, and dysequilibrium syndrome 3 (CAMRQ3). Currently the majority of research into the CAs is focused on the inhibition of these proteins to achieve therapeutic effects in patients via the control of HCO− production or reabsorption as observed in glaucoma and diuretic medications. Little research has therefore been devoted into the identification of stabilising or activating compound that could rescue protein function in the case of deficiencies. The main aim of this research was to identify and characterise the effects of nsSNVs on the structure and function of CA-II, CA-IV and CA-VIII to set a foundation for rare disease studies into the CA group of proteins. Combined bioinformatics approaches divided into four main objectives were implemented. These included variant identification, sequence analysis and protein characterisation, force field (FF) parameter generation, molecular dynamics (MD) simulation and dynamic residue network analysis (DRN). Six variants for each of the CA-II, CA-IV and CA-VIII proteins with pathogenic annotations were identified from the HUMA and Ensembl databases. These included the pathogenic variants K18E, K18Q, H107Y, P236H, P236R and N252D for CA-II. CA-IV included the pathogenic R69H, R219C and R219S, and benign N86K, N177K and V234I variants. CA-VIII included pathogenic S100A, S100P, G162R and R237Q, and benign S100L and E109D variants. CA-II has been more extensively studied than CA-IV and CA-VIII, therefore residues essential to its function and stability are known. To discover important residues and regions within the CA-IV and CA-VIII proteins sequence and motif analysis was performed across the α-CA family, using CA-II as a reference. Sequence analysis identified multiple conserved residues between the two acatalytic CA-II and CA-IV, and the acatalytic CA-VIII isoforms that were proposed to be essential for protein stability. With exception to the benign N86K CA-IV variant, none of the other pathogenic or benign CA-II, CA-IV and CA-VIII SNVs were located at functionally or structurally important residues. Motif analysis identified 11 conserved and important motifs within the α-CA family. Several of the identified variants were located on these motifs including K18E, K18Q, H107Y and N252D (CA-II); N86K, R219C, R219S and V234I (CA-IV); and E109D, G162R and R237Q (CA-VIII). As there were no x-ray crystal structures of the variant proteins, homology modelling was performed to calculate the protein structures for characterisation. In CA-VIII, the substitution of Ser for Pro at position 100 (variant S100P) resulted in destruction of the β-sheet that the SNV was located on. Little is known about the mechanism of interaction between CA-VIII and ITPR1, and residues involved. SiteMap and CPORT were used to identify binding site amino for CA-VIII and results identified 38 potential residues. Traditional FFs are incapable of performing MD simulations of metalloproteins. The AMBER ff14SB FF was extended and Zn2+ FF parameters calculated to add support for metalloprotein MD simulations. In the protein, Zn2+ was noted to have a charge less than +1. Variant effects on protein structure were then investigated using MD simulations. Root mean square deviation (RMSD) and radius of gyration (Rg) results indicated subtle SNV effects to the variant global structure in CA-II and CA-IV. However, with regards to CA-VIII RMSD analysis highlighted that variant presence was associated with increases to the structural rigidity of the protein. Principal component analysis (PCA) in conjunction with free energy analysis was performed to observe variant effects on protein conformational sampling in 3D space. The binding of BCT to CA-II induced greater protein conformational sampling and was associated with higher free energy. In CA-IV and CA-VIII PCA analysis revealed key differences in the mechanism of action of pathogenic and benign SNVs. In CA-IV, wild-type (WT) and benign variant protein structures clustered into single low energy well hinting at the presence of more stable structures. Pathogenic variants were associated with higher free energy and proteins sampled more conformations without settling into a low energy well. PCA analysis of CA-VIII indicated the opposite to CA-IV. Pathogenic variants were clustered into low energy wells, while the WT and benign variants showed greater conformational sampling. Dynamic cross correlation (DCC) analysis was performed using the MD-TASK suite to determine variant effects on residue movement. CA-II WT protein revealed that BCT and CO2 were associated with anti-correlated and correlated residue movement, highlighting at opposite mechanisms. In CA-IV and CA-VIII variant presence resulted in a change to residue correlation compared to the WT proteins. DRN analysis was performed to investigate SNV effects of residue accessibility and communication. Results demonstrated that SNVs are associated with allosteric effects on the CA protein structures, and effects are located on the stability assisting residues of the aromatic clusters and the active site of the proteins. CA-II studies discovered that Glu117 is the most important residue for communication, and variant presence results in a decrease to the usage of the residue. This effect was greatest in the CA-II H107Y SNV, and suggests that variants could have an effect on Zn2+ dissociation from the active site. Decreases to the usage of Zn2+ coordinating residues were also noted. Where this occurred, compensatory increases to the usage of other primary and secondary coordination residues were observed, that could possibly assist with the maintenance of Zn2+ within the active site. The CA-IV variants R69H and R219C highlighted potentially similar pathogenic mechanisms, whereas N86K and N177K hinted at potentially similar benign mechanisms. Within CA-VIII, variant presence was associated with changes to the accessibility of the N-terminal binding site residues. The benign CA-VIII variants highlighted possible compensatory mechanisms, whereby as one group of N-terminal residues loses accessibility, there was an increase to the accessibility of other binding site residues to possibly balance the effect. Catalytically, the proton shuttle residue His64 in CA-II was found to occupy a novel conformation named the “faux in” that brought the imidazole group even closer to the Zn2+ compared to the “in” conformation. Overall, compared to traditional MD simulations the incorporation of DRN allowed more detailed investigations into the variant mechanisms of action. This highlights the importance of network analysis in the study of the effects of missense mutations on the structure and function of proteins. Investigations of diseases at the molecular level is essential in the identification of disease pathogenesis and assists with the development of specifically tailored and better treatment options especially in the cases of genetically associated rare diseases.
- Format
- 246 pages
- Format
- Publisher
- Rhodes University
- Publisher
- Faculty of Science, Biochemistry and Microbiology
- Language
- English
- Rights
- Sanyanga, T. Allan
- Hits: 2689
- Visitors: 2505
- Downloads: 256
Thumbnail | File | Description | Size | Format | |||
---|---|---|---|---|---|---|---|
View Details Download | SOURCE1 | SANYANGA-PHD-TR20-436.pdf | 15 MB | Adobe Acrobat PDF | View Details Download |