Enumeration, conformation sampling and population of libraries of peptide macrocycles for the search of chemotherapeutic cardioprotection agents
- Authors: Sigauke, Lester Takunda
- Date: 2019
- Subjects: Peptides -- Synthesis , Macrocyclic compounds , Drug development , Drug discovery , Cardiovascular system -- Diseases -- Prevention , Proteins -- Synthesis
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/116056 , vital:34293
- Description: Peptides are uniquely endowed with features that allow them to perturb previously difficult to drug biomolecular targets. Peptide macrocycles in particular have seen a flurry of recent interest due to their enhanced bioavailability, tunability and specificity. Although these properties make them attractive hit-candidates in early stage drug discovery, knowing which peptides to pursue is non‐trivial due to the magnitude of the peptide sequence space. Computational screening approaches show promise in their ability to address the size of this search space but suffer from their inability to accurately interrogate the conformational landscape of peptide macrocycles. We developed an in‐silico compound enumerator that was tasked with populating a conformationally laden peptide virtual library. This library was then used in the search for cardio‐protective agents (that may be administered, reducing tissue damage during reperfusion after ischemia (heart attacks)). Our enumerator successfully generated a library of 15.2 billion compounds, requiring the use of compression algorithms, conformational sampling protocols and management of aggregated compute resources in the context of a local cluster. In the absence of experimental biophysical data, we performed biased sampling during alchemical molecular dynamics simulations in order to observe cyclophilin‐D perturbation by cyclosporine A and its mitochondrial targeted analogue. Reliable intermediate state averaging through a WHAM analysis of the biased dynamic pulling simulations confirmed that the cardio‐protective activity of cyclosporine A was due to its mitochondrial targeting. Paralleltempered solution molecular dynamics in combination with efficient clustering isolated the essential dynamics of a cyclic peptide scaffold. The rapid enumeration of skeletons from these essential dynamics gave rise to a conformation laden virtual library of all the 15.2 Billion unique cyclic peptides (given the limits on peptide sequence imposed). Analysis of this library showed the exact extent of physicochemical properties covered, relative to the bare scaffold precursor. Molecular docking of a subset of the virtual library against cyclophilin‐D showed significant improvements in affinity to the target (relative to cyclosporine A). The conformation laden virtual library, accessed by our methodology, provided derivatives that were able to make many interactions per peptide with the cyclophilin‐D target. Machine learning methods showed promise in the training of Support Vector Machines for synthetic feasibility prediction for this library. The synergy between enumeration and conformational sampling greatly improves the performance of this library during virtual screening, even when only a subset is used.
- Full Text:
- Date Issued: 2019
- Authors: Sigauke, Lester Takunda
- Date: 2019
- Subjects: Peptides -- Synthesis , Macrocyclic compounds , Drug development , Drug discovery , Cardiovascular system -- Diseases -- Prevention , Proteins -- Synthesis
- Language: English
- Type: text , Thesis , Doctoral , PhD
- Identifier: http://hdl.handle.net/10962/116056 , vital:34293
- Description: Peptides are uniquely endowed with features that allow them to perturb previously difficult to drug biomolecular targets. Peptide macrocycles in particular have seen a flurry of recent interest due to their enhanced bioavailability, tunability and specificity. Although these properties make them attractive hit-candidates in early stage drug discovery, knowing which peptides to pursue is non‐trivial due to the magnitude of the peptide sequence space. Computational screening approaches show promise in their ability to address the size of this search space but suffer from their inability to accurately interrogate the conformational landscape of peptide macrocycles. We developed an in‐silico compound enumerator that was tasked with populating a conformationally laden peptide virtual library. This library was then used in the search for cardio‐protective agents (that may be administered, reducing tissue damage during reperfusion after ischemia (heart attacks)). Our enumerator successfully generated a library of 15.2 billion compounds, requiring the use of compression algorithms, conformational sampling protocols and management of aggregated compute resources in the context of a local cluster. In the absence of experimental biophysical data, we performed biased sampling during alchemical molecular dynamics simulations in order to observe cyclophilin‐D perturbation by cyclosporine A and its mitochondrial targeted analogue. Reliable intermediate state averaging through a WHAM analysis of the biased dynamic pulling simulations confirmed that the cardio‐protective activity of cyclosporine A was due to its mitochondrial targeting. Paralleltempered solution molecular dynamics in combination with efficient clustering isolated the essential dynamics of a cyclic peptide scaffold. The rapid enumeration of skeletons from these essential dynamics gave rise to a conformation laden virtual library of all the 15.2 Billion unique cyclic peptides (given the limits on peptide sequence imposed). Analysis of this library showed the exact extent of physicochemical properties covered, relative to the bare scaffold precursor. Molecular docking of a subset of the virtual library against cyclophilin‐D showed significant improvements in affinity to the target (relative to cyclosporine A). The conformation laden virtual library, accessed by our methodology, provided derivatives that were able to make many interactions per peptide with the cyclophilin‐D target. Machine learning methods showed promise in the training of Support Vector Machines for synthetic feasibility prediction for this library. The synergy between enumeration and conformational sampling greatly improves the performance of this library during virtual screening, even when only a subset is used.
- Full Text:
- Date Issued: 2019
An in-silico investigation of Morita-Baylis-Hillman accessible heterocyclic analogues for applications as novel HIV-1 C protease inhibitors
- Authors: Sigauke, Lester Takunda
- Date: 2015
- Subjects: Protease inhibitors , Heterocyclic compounds , HIV (Viruses) , HIV infections , Drug resistance , Cheminformatics
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4152 , http://hdl.handle.net/10962/d1017913
- Description: Cheminformatic approaches have been employed to optimize the bis-coumarin scaffold identified by Onywera et al. (2012) as a potential hit against the protease HIV-1 protein. The Open Babel library of commands was used to access functions that were incorporated into a markov chain recursive program that generated 17750 analogues of the bis-coumarin scaffold. The Morita-Baylis-Hillman accessible heterocycles were used to introduce structural diversity within the virtual library. In silico high through-put virtual screening using AutoDock Vina was used to rapidly screen the virtual library ligand set against 61 protease models built by Onywera et al. (2012). CheS-Mapper computed a principle component analysis of the compounds based on 13 selected chemical descriptors. The compounds were plotted against the principle component analysis within a 3 dimensional chemical space in order to inspect the diversity of the virtual library. The physicochemical properties and binding affinities were used to identify the top 3 performing ligands. ACPYPE was used to inspect the constitutional properties and eliminated virtual compounds that possessed open valences. Chromene based ligand 805 and ligand 6610 were selected as the lead candidates from the high-throughput virtual screening procedure we employed. Molecular dynamic simulations of the lead candidates performed for 5 ns allowed the stability of the ligand protein complexes with protease model 305152. The free energy of binding of the leads with protease model 305152 was computed over the first 50 ps of simulation using the molecular mechanics Poisson-Boltzmann method. Analysis structural features and energy profiles from molecular dynamic simulations of the protein–ligand complexes indicated that although ligand 805 had a weaker binding affinity in terms of docking, it outperformed ligand 6610 in terms of complex stability and free energy of binding. Medicinal chemistry approaches will be used to optimize the lead candidates before their analogues will be synthesized and assayed for in vivo protease activity.
- Full Text:
- Date Issued: 2015
- Authors: Sigauke, Lester Takunda
- Date: 2015
- Subjects: Protease inhibitors , Heterocyclic compounds , HIV (Viruses) , HIV infections , Drug resistance , Cheminformatics
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4152 , http://hdl.handle.net/10962/d1017913
- Description: Cheminformatic approaches have been employed to optimize the bis-coumarin scaffold identified by Onywera et al. (2012) as a potential hit against the protease HIV-1 protein. The Open Babel library of commands was used to access functions that were incorporated into a markov chain recursive program that generated 17750 analogues of the bis-coumarin scaffold. The Morita-Baylis-Hillman accessible heterocycles were used to introduce structural diversity within the virtual library. In silico high through-put virtual screening using AutoDock Vina was used to rapidly screen the virtual library ligand set against 61 protease models built by Onywera et al. (2012). CheS-Mapper computed a principle component analysis of the compounds based on 13 selected chemical descriptors. The compounds were plotted against the principle component analysis within a 3 dimensional chemical space in order to inspect the diversity of the virtual library. The physicochemical properties and binding affinities were used to identify the top 3 performing ligands. ACPYPE was used to inspect the constitutional properties and eliminated virtual compounds that possessed open valences. Chromene based ligand 805 and ligand 6610 were selected as the lead candidates from the high-throughput virtual screening procedure we employed. Molecular dynamic simulations of the lead candidates performed for 5 ns allowed the stability of the ligand protein complexes with protease model 305152. The free energy of binding of the leads with protease model 305152 was computed over the first 50 ps of simulation using the molecular mechanics Poisson-Boltzmann method. Analysis structural features and energy profiles from molecular dynamic simulations of the protein–ligand complexes indicated that although ligand 805 had a weaker binding affinity in terms of docking, it outperformed ligand 6610 in terms of complex stability and free energy of binding. Medicinal chemistry approaches will be used to optimize the lead candidates before their analogues will be synthesized and assayed for in vivo protease activity.
- Full Text:
- Date Issued: 2015
Structural studies on yeast eIF5A using biomolecular NMR and molecular dynamics
- Authors: Sigauke, Lester Takunda
- Date: 2015
- Subjects: Molecular dynamics , Reverse transcriptase , HIV (Viruses) , HIV infections , Eukaryotic cells , Yeast
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4547 , http://hdl.handle.net/10962/d1017927
- Description: Eukaryotic initiation factor 5A, eIF5A, is a ubiquitous eukaryotic protein that has been shown to influence the translation initiation of a specific subset of mRNAs. It is the only protein known to undergo hypusination in a two-step post translational modification process involving deoxyhypusine synthase (DHS) and deoxyhypusine hydroxylase (DOHH) enzymes. Hypusination has been shown to influence translation of HIV-1 and HTLV-1 nuclear export signals, while the involvement of active hypusinated eIF5A in induction of IRES mediated processes that initiate pro-apoptotic process have inspired studies into the manipulation of eIF5A in anti-cancer and anti-diabetic therapies. eIF5A oligomerisation in eukaryotic systems has been shown to be influenced by hypusination and the mechanism of dimerisation is RNA dependent. Nuclear magnetic resonance spectroscopy approaches were proposed to solve the structure of the hypusinated eIF5A in solution in order to understand the influence of hypusination on the monomeric arrangement which enhances dimerisation and activates the protein. Cleavage of the 18 kDa protein monomer by introduction of thrombin cleavage site within the flexible domain was thought to give rise to 10 kDa fragments accessible to a 600 MHz NMR spectrometer. Heteronuclear single quantum correlation experiments of the mutated isotopically labelled protein expressed in E. coli showed that the eIF5A protein with a thrombin cleavage insert, eIF5AThr (eIF5A subscript Thr), was unfolded. In silico investigations of the behaviour of eIF5A and eIF5AThr (eIF5A subscript Thr) models in solution using molecular dynamics showed that the mutated model had different solution dynamics to the native model. Chemical shift predictors were used to extract atomic resolution data of solution dynamics and the introduction of rigidity in the flexible loop region of eIF5A affected solution behaviour consistent with lack of in vivo function of eIF5AThr (eIF5A subscript Thr) in yeast. Residual dipolar coupling and T₁ relaxation times were calculated in anticipation of the extraction of experimental data from RDC and relaxation dispersion experiments based on HSQC measurable restraints.
- Full Text:
- Date Issued: 2015
- Authors: Sigauke, Lester Takunda
- Date: 2015
- Subjects: Molecular dynamics , Reverse transcriptase , HIV (Viruses) , HIV infections , Eukaryotic cells , Yeast
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4547 , http://hdl.handle.net/10962/d1017927
- Description: Eukaryotic initiation factor 5A, eIF5A, is a ubiquitous eukaryotic protein that has been shown to influence the translation initiation of a specific subset of mRNAs. It is the only protein known to undergo hypusination in a two-step post translational modification process involving deoxyhypusine synthase (DHS) and deoxyhypusine hydroxylase (DOHH) enzymes. Hypusination has been shown to influence translation of HIV-1 and HTLV-1 nuclear export signals, while the involvement of active hypusinated eIF5A in induction of IRES mediated processes that initiate pro-apoptotic process have inspired studies into the manipulation of eIF5A in anti-cancer and anti-diabetic therapies. eIF5A oligomerisation in eukaryotic systems has been shown to be influenced by hypusination and the mechanism of dimerisation is RNA dependent. Nuclear magnetic resonance spectroscopy approaches were proposed to solve the structure of the hypusinated eIF5A in solution in order to understand the influence of hypusination on the monomeric arrangement which enhances dimerisation and activates the protein. Cleavage of the 18 kDa protein monomer by introduction of thrombin cleavage site within the flexible domain was thought to give rise to 10 kDa fragments accessible to a 600 MHz NMR spectrometer. Heteronuclear single quantum correlation experiments of the mutated isotopically labelled protein expressed in E. coli showed that the eIF5A protein with a thrombin cleavage insert, eIF5AThr (eIF5A subscript Thr), was unfolded. In silico investigations of the behaviour of eIF5A and eIF5AThr (eIF5A subscript Thr) models in solution using molecular dynamics showed that the mutated model had different solution dynamics to the native model. Chemical shift predictors were used to extract atomic resolution data of solution dynamics and the introduction of rigidity in the flexible loop region of eIF5A affected solution behaviour consistent with lack of in vivo function of eIF5AThr (eIF5A subscript Thr) in yeast. Residual dipolar coupling and T₁ relaxation times were calculated in anticipation of the extraction of experimental data from RDC and relaxation dispersion experiments based on HSQC measurable restraints.
- Full Text:
- Date Issued: 2015
- «
- ‹
- 1
- ›
- »