Pattern recognition of genomic features with microarrays: site typing of Mycobacterium tuberculosis strains

Citation:

Raychaudhuri S, Stuart JM, Liu X, Small PM, Altman RB. Pattern recognition of genomic features with microarrays: site typing of Mycobacterium tuberculosis strains [Internet]. Proc Int Conf Intell Syst Mol Biol 2000;8:286-95.

Date Published:

2000

Abstract:

Mycobacterium tuberculosis (M. tb.) strains differ in the number and locations of a transposon-like insertion sequence known as IS6110. Accurate detection of this sequence can be used as a fingerprint for individual strains, but can be difficult because of noisy data. In this paper, we propose a non-parametric discriminant analysis method for predicting the locations of the IS6110 sequence from microarray data. Polymerase chain reaction extension products generated from primers specific for the insertion sequence are hybridized to a microarray containing targets corresponding to each open reading frame in M. tb. To test for insertion sites, we use microarray intensity values extracted from small windows of contiguous open reading frames. Rank-transformation of spot intensities and first-order differences in local windows provide enough information to reliably determine the presence of an insertion sequence. The nonparametric approach outperforms all other methods tested in this study.

Publisher's Version

Last updated on 04/29/2020