Analysis of predictive power of binding affinity of PBM-derived sequences
- Authors: Matereke, Lavious Tapiwa
- Date: 2015
- Subjects: Transcription factors , Protein binding , DNA-binding proteins , Chromatin , Protein microarrays
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4161 , http://hdl.handle.net/10962/d1018666
- Description: A transcription factor (TF) is a protein that binds to specific DNA sequences as part of the initiation stage of transcription. Various methods of finding these transcription factor binding sites (TFBS) have been developed. In vivo technologies analyze DNA binding regions known to have bound to a TF in a living cell. Most widely used in vivo methods at the moment are chromatin immunoprecipitation followed by deep sequencing (ChIP-seq) and DNase I hypersensitive sites sequencing. In vitro methods derive TFBS based on experiments with TFs and DNA usually in artificial settings or computationally. An example is the Protein Binding Microarray which uses artificially constructed DNA sequences to determine the short sequences that are most likely to bind to a TF. The major drawback of this approach is that binding of TFs in vivo is also dependent on other factors such as chromatin accessibility and the presence of cofactors. Therefore TFBS derived from the PBM technique might not resemble the true DNA binding sequences. In this work, we use PBM data from the UniPROBE motif database, ChIP-seq data and DNase I hypersensitive sites data. Using the Spearman’s rank correlation and area under receiver operating characteristic curve, we compare the enrichment scores which the PBM approach assigns to its identified sequences and the frequency of these sequences in likely binding regions and the human genome as a whole. We also use central motif enrichment analysis (CentriMo) to compare the enrichment of UniPROBE motifs with in vivo derived motifs (from the JASPAR CORE database) in their respective TF ChIP-seq peak region. CentriMo is applied to 14 TF ChIP-seq peak regions from different cell lines. We aim to establish if there is a relationship between the occurrences of UniPROBE 8-mer patterns in likely binding regions and their enrichment score and how well the in vitro derived motifs match in vivo binding specificity. We did not come out with a particular trend showing failure of the PBM approach to predict in vivo binding specificity. Our results show Ets1, Hnf4a and Tcf3 show prediction failure by the PBM technique in terms of our Spearman’s rank correlation for ChIP-seq data and central motif enrichment analysis. However, the PBM technique also matched the in vivo binding specificities of FoxA2, Pou2f2 and Mafk. Failure of the PBM approach was found to be a result of variability in the TF’s binding specificity, the presence of cofactors, narrow binding specificity and the presence ubiquitous binding patterns.
- Full Text:
- Date Issued: 2015
- Authors: Matereke, Lavious Tapiwa
- Date: 2015
- Subjects: Transcription factors , Protein binding , DNA-binding proteins , Chromatin , Protein microarrays
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4161 , http://hdl.handle.net/10962/d1018666
- Description: A transcription factor (TF) is a protein that binds to specific DNA sequences as part of the initiation stage of transcription. Various methods of finding these transcription factor binding sites (TFBS) have been developed. In vivo technologies analyze DNA binding regions known to have bound to a TF in a living cell. Most widely used in vivo methods at the moment are chromatin immunoprecipitation followed by deep sequencing (ChIP-seq) and DNase I hypersensitive sites sequencing. In vitro methods derive TFBS based on experiments with TFs and DNA usually in artificial settings or computationally. An example is the Protein Binding Microarray which uses artificially constructed DNA sequences to determine the short sequences that are most likely to bind to a TF. The major drawback of this approach is that binding of TFs in vivo is also dependent on other factors such as chromatin accessibility and the presence of cofactors. Therefore TFBS derived from the PBM technique might not resemble the true DNA binding sequences. In this work, we use PBM data from the UniPROBE motif database, ChIP-seq data and DNase I hypersensitive sites data. Using the Spearman’s rank correlation and area under receiver operating characteristic curve, we compare the enrichment scores which the PBM approach assigns to its identified sequences and the frequency of these sequences in likely binding regions and the human genome as a whole. We also use central motif enrichment analysis (CentriMo) to compare the enrichment of UniPROBE motifs with in vivo derived motifs (from the JASPAR CORE database) in their respective TF ChIP-seq peak region. CentriMo is applied to 14 TF ChIP-seq peak regions from different cell lines. We aim to establish if there is a relationship between the occurrences of UniPROBE 8-mer patterns in likely binding regions and their enrichment score and how well the in vitro derived motifs match in vivo binding specificity. We did not come out with a particular trend showing failure of the PBM approach to predict in vivo binding specificity. Our results show Ets1, Hnf4a and Tcf3 show prediction failure by the PBM technique in terms of our Spearman’s rank correlation for ChIP-seq data and central motif enrichment analysis. However, the PBM technique also matched the in vivo binding specificities of FoxA2, Pou2f2 and Mafk. Failure of the PBM approach was found to be a result of variability in the TF’s binding specificity, the presence of cofactors, narrow binding specificity and the presence ubiquitous binding patterns.
- Full Text:
- Date Issued: 2015
Comparison of protein binding microarray derived and ChIP-seq derived transcription factor binding DNA motifs
- Hlatshwayo, Nkosikhona Rejoyce
- Authors: Hlatshwayo, Nkosikhona Rejoyce
- Date: 2015
- Subjects: Protein binding , DNA , DNA microarrays , Transcription factors , DNA-protein interactions , Gene regulatory networks
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4146 , http://hdl.handle.net/10962/d1017907
- Description: Transcription factors (TFs) are biologically important proteins that interact with transcription machinery and bind DNA regulatory sequences to regulate gene expression by modulating the synthesis of the messenger RNA. The regulatory sequences comprise of short conserved regions of a specific length called motifs . TFs have very diverse roles in different cells and play a very significant role in development. TFs have been associated with carcinogenesis in various tissue types, as well as developmental and hormone response disorders. They may be responsible for the regulation of oncogenes and can be oncogenic. Consequently, understanding TF binding and knowing the motifs to which they bind is worthy of attention and research focus. Various projects have made the study of TF binding their main focus; nevertheless, much about TF binding remains confounding. Chromatin immunoprecipitation in conjunction with deep sequencing (ChIP-seq) techniques are a popular method used to investigate DNA-TF interactions in vivo. This procedure is followed by motif discovery and motif enrichment analysis using relevant tools. Protein Binding Microarrays (PBMs) are an in vitro method for investigating DNA-TF interactions. We use a motif enrichment analysis tools (CentriMo and AME) and an empirical quality assessment tool (Area under the ROC curve) to investigate which method yields motifs that are a true representation of in vivo binding. Motif enrichment analysis: On average, ChIP-seq derived motifs from the JASPAR Core database outperformed PBM derived ones from the UniPROBE mouse database. However, the performance of motifs derived using these two methods is not much different from each other when using CentriMo and AME. The E-values from Motif enrichment analysis were not too different from each other or 0. CentriMo showed that in 35 cases JASPAR Core ChIP-seq derived motifs outperformed UniPROBE mouse PBM derived motifs, while it was only in 11 cases that PBM derived motifs outperformed ChIP-seq derived motifs. AME showed that in 18 cases JASPAR Core ChIP-seq derived motifs did better, while only it was only in 3 cases that UniPROBE motifs outperformed ChIP-seq derived motifs. We could not distinguish the performance in 25 cases. Empirical quality assessment: Area under the ROC curve values computations followed by a two-sided t-test showed that there is no significant difference in the average performances of the motifs from the two databases (with 95% confidence, mean of differences=0.0088125 p-value= 0.4874, DF=47) .
- Full Text:
- Date Issued: 2015
- Authors: Hlatshwayo, Nkosikhona Rejoyce
- Date: 2015
- Subjects: Protein binding , DNA , DNA microarrays , Transcription factors , DNA-protein interactions , Gene regulatory networks
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: vital:4146 , http://hdl.handle.net/10962/d1017907
- Description: Transcription factors (TFs) are biologically important proteins that interact with transcription machinery and bind DNA regulatory sequences to regulate gene expression by modulating the synthesis of the messenger RNA. The regulatory sequences comprise of short conserved regions of a specific length called motifs . TFs have very diverse roles in different cells and play a very significant role in development. TFs have been associated with carcinogenesis in various tissue types, as well as developmental and hormone response disorders. They may be responsible for the regulation of oncogenes and can be oncogenic. Consequently, understanding TF binding and knowing the motifs to which they bind is worthy of attention and research focus. Various projects have made the study of TF binding their main focus; nevertheless, much about TF binding remains confounding. Chromatin immunoprecipitation in conjunction with deep sequencing (ChIP-seq) techniques are a popular method used to investigate DNA-TF interactions in vivo. This procedure is followed by motif discovery and motif enrichment analysis using relevant tools. Protein Binding Microarrays (PBMs) are an in vitro method for investigating DNA-TF interactions. We use a motif enrichment analysis tools (CentriMo and AME) and an empirical quality assessment tool (Area under the ROC curve) to investigate which method yields motifs that are a true representation of in vivo binding. Motif enrichment analysis: On average, ChIP-seq derived motifs from the JASPAR Core database outperformed PBM derived ones from the UniPROBE mouse database. However, the performance of motifs derived using these two methods is not much different from each other when using CentriMo and AME. The E-values from Motif enrichment analysis were not too different from each other or 0. CentriMo showed that in 35 cases JASPAR Core ChIP-seq derived motifs outperformed UniPROBE mouse PBM derived motifs, while it was only in 11 cases that PBM derived motifs outperformed ChIP-seq derived motifs. AME showed that in 18 cases JASPAR Core ChIP-seq derived motifs did better, while only it was only in 3 cases that UniPROBE motifs outperformed ChIP-seq derived motifs. We could not distinguish the performance in 25 cases. Empirical quality assessment: Area under the ROC curve values computations followed by a two-sided t-test showed that there is no significant difference in the average performances of the motifs from the two databases (with 95% confidence, mean of differences=0.0088125 p-value= 0.4874, DF=47) .
- Full Text:
- Date Issued: 2015
Establishment of human OCT4 as a putative HSP90 client protein: a case for HSP90 chaperoning pluripotency
- Authors: Sterrenberg, Jason Neville
- Date: 2015
- Subjects: Induced pluripotent stem cells , Heat shock proteins , Stem cells , Transcription factors , Molecular chaperones
- Language: English
- Type: Doctoral theses , text
- Identifier: http://hdl.handle.net/10962/194010 , vital:45415 , 10.21504/10962/194010
- Description: The therapeutic potential of stem cells is already being harnessed in clinical trails. Of even greater therapeutic potential has been the discovery of mechanisms to reprogram differentiated cells into a pluripotent stem cell-like state known as induced pluripotent stem cells (iPSCs). Stem cell nature is governed and maintained by a hierarchy of transcription factors, the apex of which is OCT4. Although much research has elucidated the transcriptional regulation of OCT4, OCT4 regulated gene expression profiles and OCT4 transcriptional activation mechanisms in both stem cell biology and cellular reprogramming to iPSCs, the fundamental biochemistry surrounding the OCT4 transcription factor remains largely unknown. In order to analyze the biochemical relationship between HSP90 and human OCT4 we developed an exogenous active human OCT4 expression model with human OCT4 under transcriptional control of a constitutive promoter. We identified the direct interaction between HSP90 and human OCT4 despite the fact that the proteins predominantly display differential subcellular localizations. We show that HSP90 inhibition resulted in degradation of human OCT4 via the ubiquitin proteasome degradation pathway. As human OCT4 and HSP90 did not interact in the nucleus, we suggest that HSP90 functions in the cytoplasmic stabilization of human OCT4. Our analysis suggests HSP90 inhibition inhibits the transcriptional activity of human OCT4 dimers without affecting monomeric OCT4 activity. Additionally our data suggests that the HSP90 and human OCT4 complex is modulated by phosphorylation events either promoting or abrogating the interaction between HSP90 and human OCT4. Our data suggest that human OCT4 displays the characteristics describing HSP90 client proteins, therefore we identify human OCT4 as a putative HSP90 client protein. The regulation of the transcription factor OCT4 by HSP90 provides fundamental insights into the complex biochemistry of stem cell biology. This may also be suggestive that HSP90 not only regulates stem cell biology by maintaining routine cellular homeostasis but additionally through the direct regulation of pluripotency factors. , Thesis (PhD) -- Faculty of Science, Biochemistry and Microbiology, 2015
- Full Text:
- Date Issued: 2015
- Authors: Sterrenberg, Jason Neville
- Date: 2015
- Subjects: Induced pluripotent stem cells , Heat shock proteins , Stem cells , Transcription factors , Molecular chaperones
- Language: English
- Type: Doctoral theses , text
- Identifier: http://hdl.handle.net/10962/194010 , vital:45415 , 10.21504/10962/194010
- Description: The therapeutic potential of stem cells is already being harnessed in clinical trails. Of even greater therapeutic potential has been the discovery of mechanisms to reprogram differentiated cells into a pluripotent stem cell-like state known as induced pluripotent stem cells (iPSCs). Stem cell nature is governed and maintained by a hierarchy of transcription factors, the apex of which is OCT4. Although much research has elucidated the transcriptional regulation of OCT4, OCT4 regulated gene expression profiles and OCT4 transcriptional activation mechanisms in both stem cell biology and cellular reprogramming to iPSCs, the fundamental biochemistry surrounding the OCT4 transcription factor remains largely unknown. In order to analyze the biochemical relationship between HSP90 and human OCT4 we developed an exogenous active human OCT4 expression model with human OCT4 under transcriptional control of a constitutive promoter. We identified the direct interaction between HSP90 and human OCT4 despite the fact that the proteins predominantly display differential subcellular localizations. We show that HSP90 inhibition resulted in degradation of human OCT4 via the ubiquitin proteasome degradation pathway. As human OCT4 and HSP90 did not interact in the nucleus, we suggest that HSP90 functions in the cytoplasmic stabilization of human OCT4. Our analysis suggests HSP90 inhibition inhibits the transcriptional activity of human OCT4 dimers without affecting monomeric OCT4 activity. Additionally our data suggests that the HSP90 and human OCT4 complex is modulated by phosphorylation events either promoting or abrogating the interaction between HSP90 and human OCT4. Our data suggest that human OCT4 displays the characteristics describing HSP90 client proteins, therefore we identify human OCT4 as a putative HSP90 client protein. The regulation of the transcription factor OCT4 by HSP90 provides fundamental insights into the complex biochemistry of stem cell biology. This may also be suggestive that HSP90 not only regulates stem cell biology by maintaining routine cellular homeostasis but additionally through the direct regulation of pluripotency factors. , Thesis (PhD) -- Faculty of Science, Biochemistry and Microbiology, 2015
- Full Text:
- Date Issued: 2015
- «
- ‹
- 1
- ›
- »