DATE: Thursday, Feb. 17, 2005
TIME: 1:30 pm
PLACE: Council Room (SITE 5-084)
TITLE: Data Mining in Genomics and the BioMine project
PRESENTER: A. Fazel Famili
NRC
ABSTRACT:

Knowledge discovery is the process of developing strategies to discover useful and ideally all previously unknown knowledge from historical or real-time data. Applied to microarray related applications, knowledge discovery processes will help in various research and development activities, such as (i) studying data quality for possible anomalous or highly expressions of certain genes or experiments, (ii) identifying relationships between genes and their functions based on time-series or other high throughput genomics profiles, and (iii) discovering models for clinical diagnosis/classifications based on expression profiles among two or more classes.

This presentation consists of three parts. In part one, we provide an overview of data mining in genomics and the BioMine project. In part two of this talk we describe some of our case studies using the BioMiner data mining software that we have built in this project. These are all cases in which real genomics data sets (public or private) have been used for tasks such as gene function identification and gene response analysis. We will describe a few examples explaining complexities and challenges in dealing with real data from genomics field. In the last part of this talk, we share our experiences gained over the last 5 years and describe our future plans.