TY - GEN
T1 - Efficient probe selection in microarray design
AU - Ga̧sieniec, Leszek
AU - Li, Cindy Y.
AU - Sant, Paul
AU - Wong, Prudence W.H.
N1 - Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2006
Y1 - 2006
N2 - The DNA microarray technology, originally developed to measure the level of gene expression, had become one of the most widely used tools in genomic study. Microarrays have been proved to benefit areas including gene discovery, disease diagnosis, and multi-virus discovery. The crux of microarray design lies in how to select a unique probe that distinguishes a given genomic sequence from other sequences. However, in cases that the existence of a unique probe is unlikely, e.g., in the context of a large family of closely homologous genes, the use of a limited number of non-unique probes is still desirable.qyy Due to its significance, probe selection attracts a lot of attention. Various probe selection algorithms have been developed in recent years. Good probe selection algorithms should produce as small number of candidate probes as possible. Efficiency is also crucial because the data involved is usually huge. Most existing algorithms usually select probes by filtering, which is usually not selective enough and quite a large number of probes are returned. We propose a new direction to tackle the problem and give an efficient algorithm to select (randomly) a small set of probes and demonstrate that such a small set of probes is sufficient to distinguish each sequence from all the other sequences. Based on the algorithm, we have developed a probe selection software RANDPS, which runs efficiently and effectively in practice. A number of experiments have been carried out and the results will be discussed.
AB - The DNA microarray technology, originally developed to measure the level of gene expression, had become one of the most widely used tools in genomic study. Microarrays have been proved to benefit areas including gene discovery, disease diagnosis, and multi-virus discovery. The crux of microarray design lies in how to select a unique probe that distinguishes a given genomic sequence from other sequences. However, in cases that the existence of a unique probe is unlikely, e.g., in the context of a large family of closely homologous genes, the use of a limited number of non-unique probes is still desirable.qyy Due to its significance, probe selection attracts a lot of attention. Various probe selection algorithms have been developed in recent years. Good probe selection algorithms should produce as small number of candidate probes as possible. Efficiency is also crucial because the data involved is usually huge. Most existing algorithms usually select probes by filtering, which is usually not selective enough and quite a large number of probes are returned. We propose a new direction to tackle the problem and give an efficient algorithm to select (randomly) a small set of probes and demonstrate that such a small set of probes is sufficient to distinguish each sequence from all the other sequences. Based on the algorithm, we have developed a probe selection software RANDPS, which runs efficiently and effectively in practice. A number of experiments have been carried out and the results will be discussed.
UR - http://www.scopus.com/inward/record.url?scp=50249164160&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50249164160&partnerID=8YFLogxK
U2 - 10.1109/CIBCB.2006.331018
DO - 10.1109/CIBCB.2006.331018
M3 - Conference contribution
AN - SCOPUS:50249164160
SN - 1424406234
SN - 9781424406234
T3 - Proceedings of the 2006 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB'06
SP - 247
EP - 254
BT - Proceedings of the 2006 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB'06
T2 - 3rd Computational Intelligence in Bioinformatics and Computational Biology Symposium, CIBCB
Y2 - 28 September 2006 through 29 September 2006
ER -