Similarity searching in DNA sequences by spectral distortion measures
Pham, Tuan D. (2006) Similarity searching in DNA sequences by spectral distortion measures. Proceedings of the 6th Industrial Conference on Data Mining. ICDM 2006 6th Industrial Conference on Data Mining , 14-15 July 2006, Leipzig, Germany , pp. 24-37.
|PDF (Published Version) - Repository staff only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader|
View at Publisher Website: http://dx.doi.org/10.1007/11790853_3
Searching for similarity among biological sequences is an important research area of bioinformatics because it can provide insight into the evolutionary and genetic relationships between species that open doors to new scientific discoveries such as drug design and treament. In this paper, we introduce a novel measure of similarity between two biological sequences without the need of alignment. The method is based on the concept of spectral distortion measures developed for signal processing. The proposed method was tested using a set of six DNA sequences taken from Escherichia coli K-12 and Shigella flexneri, and one random sequence. It was further tested with a complex dataset of 40 DNA sequences taken from the GenBank sequence database. The results obtained from the proposed method are found superior to some existing methods for similarity measure of DNA sequences.
|Item Type:||Conference Item (Refereed Research Paper - E1)|
|Keywords:||DNA; similarity measure|
|SEO Codes:||92 HEALTH > 9202 Health and Support Services > 920203 Diagnostic Methods @ 34%|
92 HEALTH > 9204 Public Health (excl. Specific Population Health) > 920412 Preventive Medicine @ 33%
92 HEALTH > 9201 Clinical Health (Organs, Diseases and Abnormal Conditions) > 920102 Cancer and Related Disorders @ 33%
|Deposited On:||19 Nov 2009 15:36|
|Last Modified:||10 May 2013 00:39|
Last 12 Months: 0
|Citation Counts with External Providers:||Web of Science: 1|
Repository Staff Only: item control page