TY - GEN
T1 - Transcription factor discovery using support vector machines and heterogeneous data
AU - Barbe, José F.
AU - Tewfik, Ahmed H.
AU - Khodursky, Arkady B.
PY - 2007
Y1 - 2007
N2 - In this work we analyze the suitability of expression and sequence data for discovery of co-regulatory relationships using Support Vector Machines. In addition, we try to assess the possibility of improving such results by heterogeneous data fusion and by estimating a probability of a correct classification. As shown in other studies, we have found that transcription co-expression is a good estimator for genetic co-regulation. We also have found some evidence that operator site sequence motifs can be used to estimate coregulation, but the kernels used for feature extraction did not achieve classification rates comparable to expression data. Finally, the additional information provided by combining sequence and expression data can be exploited to estimate the probability of correct classification.
AB - In this work we analyze the suitability of expression and sequence data for discovery of co-regulatory relationships using Support Vector Machines. In addition, we try to assess the possibility of improving such results by heterogeneous data fusion and by estimating a probability of a correct classification. As shown in other studies, we have found that transcription co-expression is a good estimator for genetic co-regulation. We also have found some evidence that operator site sequence motifs can be used to estimate coregulation, but the kernels used for feature extraction did not achieve classification rates comparable to expression data. Finally, the additional information provided by combining sequence and expression data can be exploited to estimate the probability of correct classification.
UR - http://www.scopus.com/inward/record.url?scp=47049110864&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=47049110864&partnerID=8YFLogxK
U2 - 10.1109/GENSIPS.2007.4365812
DO - 10.1109/GENSIPS.2007.4365812
M3 - Conference contribution
AN - SCOPUS:47049110864
SN - 1424409993
SN - 9781424409990
T3 - GENSIPS'07 - 5th IEEE International Workshop on Genomic Signal Processing and Statistics
BT - 5th IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS'07
T2 - 5th IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS'07
Y2 - 10 June 2007 through 12 June 2007
ER -