Information Theory and Multivariate Techniques for Analyzing DNA Sequence Data: An Example from Tomato Genes
DOI:
https://doi.org/10.3126/njb.v1i1.3867Keywords:
Diversity analysis, DNA sequences, principal component analysis, tomato genesAbstract
DNA and amino acid sequences are alphabetic symbols having no underlying metric. Use of information theory is one of the solutions for sequence metric problems. The reflection of DNA sequence complexity in phenotype stability might be useful for crop improvement. Shannon-Weaver index (Shannon Entropy, H') and mutual information (MI) index were estimated from DNA sequences of 22 genes, consisted of two gene families of tomato, namely disease resistance and fruit quality. Main objective was use of information theory and multivariate techniques to understand diversity among genes and relate the sequence complexity with phenotypes. The normalized H' value ranged from 0.429 to 0.461. The highest diversity was observed in the gene Crtr-B (beta carotene hydroxylase). Two principal components which accounted for 36.65% variation placed these genes into four groups. Groupings of these genes by both principal component and cluster analyses showed clearly the similarity at phenotypes levels within cluster. Sequences similarity among genes was observed within a family. Diversity assessment of genes applying information theory should link to understand the sequences complexity with respect to gene stability for example stability of resistance gene.
Key words: Diversity analysis; DNA sequences; principal component analysis; tomato genes
Nepal Journal of Biotechnology, 2011, Vol. 1, No. 1 pp.1-9
Downloads
Downloads
How to Cite
Issue
Section
License
Copyright Notice:
The manuscript submitted to NJB must be an original contribution, not previously published and should not be under consideration for publication elsewhere. When the manuscript is accepted for publication, the authors agree to automatically transfer the copyright of the article to the publisher. It should grant permission to any third party, in advance and in perpetuity, the right to use, reproduce or disseminate your article, according to the NJB copyright and license agreement.
Authors transfer copyright to the publisher as part of a journal publishing agreement but have the rights to: Share their article for Personal Use, Internal Institutional Use and Scholarly Sharing purposes, with the NJB applies the Creative Commons Attribution-NonCommercial CC BY-NC license to all the works we publish after Jun 2020 (Before it was CC BY-NC-ND). Under this license, authors agree to make articles legally available for reuse, without permission or fees, for virtually any non-commercial purpose. Anyone may remix, adapt, and build upon your work non-commercially, and although their new works must also acknowledge you and be non-commercial, they don’t have to license their derivative works on the same terms. More details on CC BY-NC refer to its Licence Deed and Legal Code.