Over-represented sequences located on UTRs are potentially involved in regulatory functions
Kihoon Yoon,
Daijin Ko,
Carolina Livi,
Nathan Trinklein,
Mark Doderer,
Stephen Kwek and
Luiz Penalva
Additional contact information
Kihoon Yoon: The University of Texas Health
Daijin Ko: The University of Texas at San Antonio
Carolina Livi: The University of Texas at San Antonio
Nathan Trinklein: SwitchGear Genomics
Mark Doderer: The University of Texas at San Antonio
Stephen Kwek: The University of Texas at San Antonio
Luiz Penalva: The University of Texas Health
No 53, Working Papers from College of Business, University of Texas at San Antonio
Abstract:
Eukaryotic gene expression must be coordinated for the proper functioning of biological processes. This coordination can be achieved both at the transcriptional and post-transcriptional levels. In both cases, regulatory sequences placed at either promoter regions or on UTRs function as markers recognized by regulators that can then activate or repress different groups of genes according to necessity. While regulatory sequences involved in transcription are quite well documented, there is a lack of information on sequence elements involved in post-transcriptional regulation. We used a statistical over-representation method to identify novel regulatory elements located on UTRs. An exhaustive search approach was used to calculate the frequency of all possible n-mers (short nucleotide sequences) in 16,160 human genes of NCBI RefSeq sequences and to identify any peculiar usage of n-mers on UTRs. After a stringent filtering process, we identified circa 4,000 highly over-represented n-mers on UTRs. We provide evidence that these n-mers are potentially involved in regulatory functions. Identified n-mers overlap with previously identified binding sites for HuR and Tia1 and, AU-rich and GU-rich sequences. We determined also that over-represented n-mers are particularly enriched in a group of 159 genes directly involved in tumor formation. Finally, a method to cluster n-mer groups allowed the identification of putative gene networks.
Keywords: Over-represented sequences; UTRs; regulatory functions (search for similar items in EconPapers)
JEL-codes: C10 (search for similar items in EconPapers)
Pages: 39 pages
Date: 2008-05-30
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)
Downloads: (external link)
http://interim.business.utsa.edu/wps/MSS/0053MSS-301-2008.pdf Full text (application/pdf)
Our link check indicates that this URL is bad, the error code is: 404 Not Found
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:tsa:wpaper:0098mss
Access Statistics for this paper
More papers in Working Papers from College of Business, University of Texas at San Antonio Contact information at EDIRC.
Bibliographic data for series maintained by Wendy Frost ().