Rawat, Vimal (2015). Genome re-annotation and DNA motif identification in Brassicaceae species. PhD thesis, Universität zu Köln.
|
PDF
FINAL_THESIS_ONLINE.pdf Download (8MB) |
Abstract
The DNA sequence analysis field has experienced a paradigm shift caused by the drastic reduction in the sequencing cost and time. With the availability of several reference genome assemblies, understanding of structural and functional aspects of genomes has started growing. Annotating a reference genome is the first and very crucial step that ensures its efficient usability to serve as a community resource. Unlike coding regions, non–coding regions do not translate into proteins but still play a central role in development and physiology of an organism by regulating gene expression. Identification and annotation of these regions are only initial steps, equally interesting and even more rewarding is to decipher the interplay between these two components of a genome. Identification of cis-regulatory elements (CREs), the functional components of the non-coding genome, is paramount to our understanding regarding the gene expression regulation. The role of CREs in regulating rhythmic (diurnal) expression of thousands of genes has been reported in several plants species (including Arabidopsis thaliana) but still only a few CREs have been reported so far. In the first project, using extensive RNA-sequencing data, I substantially improved the annotation and usability of a Brassicaceae species, Arabidopsis lyrata. Gene model coordinates for over 90% genes are corrected, with improved UTRs (untranslated regions) annotation. Over 2,000 genes are now annotated as transposable element (TE)-related genes and around 8% annotated with alternate transcripts. With hundreds of cases of gene-merge and gene-split, improved annotation also corrects coding space of the genome. Experimentally validated data for several such cases strongly supported updated annotation, highlighting the importance of employing species-specific RNA-sequencing data for genome annotation. In the second project, I compared time-series transcriptomics data for two Brassicaceae species, Arabidopsis thaliana and Arabis alpina. Around 30% genes were found under the control of diurnal regulation in both species. An interesting finding regarding phase-shift of the circadian clock genes and their direct targets was also observed. Gene Ontology term enrichment analysis suggested that diurnal genes associated to carbohydrate metabolism are the most affected by this phase shift while light-signaling associated genes are the least affected. I also demonstrated the usefulness of Phylogenetic shadowing to identify enriched CREs in the diurnal genes. Using several recently assembled Brassicaceae genomes, I analyzed the conservation patterns in promoters of orthologous diurnal genes. In total, I identified 54 and 45 DNA motifs for Arabidopsis thaliana and Arabis alpina respectively. Over 65% motifs were found common for both species including previously reported six motifs. Based on recently published open chromatin data, around 30% of the DNA motifs revealed protected sites from an endonuclease (DNase I), indicating their potential role as protein-binding sites. Several phase-specific co-occurring DNA motifs pairs were found conserved in both species, including previously known Evening Element (EE) and ABA Response Element Like (ABREL) pair, underlining the broad conservation of cis-regulation of diurnal expression.
Item Type: | Thesis (PhD thesis) | ||||||||
Translated abstract: |
|
||||||||
Creators: |
|
||||||||
URN: | urn:nbn:de:hbz:38-71468 | ||||||||
Date: | 14 November 2015 | ||||||||
Language: | English | ||||||||
Faculty: | Faculty of Mathematics and Natural Sciences | ||||||||
Divisions: | Faculty of Mathematics and Natural Sciences > Department of Biology > Institute for Genetics | ||||||||
Subjects: | Natural sciences and mathematics Life sciences |
||||||||
Uncontrolled Keywords: |
|
||||||||
Date of oral exam: | 19 January 2016 | ||||||||
Referee: |
|
||||||||
Refereed: | Yes | ||||||||
URI: | http://kups.ub.uni-koeln.de/id/eprint/7146 |
Downloads
Downloads per month over past year
Export
Actions (login required)
View Item |