Data Mining in Functional Genomics and Proteomics involves a close collaboration between researchers from a number of diverse areas, such as biology, medicine, genomics and proteomics to computer science, mathematics and statistics. This collaboration of disciplines has evolved because of the: (i) advances that have occurred in data production and acquisition facilities, such as the introduction of mircroarrays and high throughput genomics and proteomics, (ii) enormous amounts of data that is generated every day that cannot be analyzed using ordinary data mining tools and techniques, and (iii) strong interest from many groups (research institutes, hospitals, academia, pharmaceuticals, etc.) who want to benefit from
this wealth of data. Many efforts to deal with these issues are being undertaken by researchers working in this field.
The aim of this workshop is to bring together researchers working on different topics related to data mining in functional genomics and proteomics. In particular we are interested to focus on current trends and emphasize on what should be the future directions for generic and applied research in this field. The main topics to be addressed during the workshop are integration methodologies for functional genomics and proteomics and also issues related to structuring and disseminating all useful knowledge that increasingly becomes available in this field.
The intended audience are reseachers and practitioners who are working in the above fields and in one or more of the following:
Data pre-processing, data understanding
Data management methods
Data mining architectures, data bases
Machine learning, NN, GA, SVM
Statistics
Soft computing techniques
Gene expression analysis
Gene networks and pathways
Comparative genomics
Post-processing, knowledge/model integration
Integration of various forms of genomics and proteomics data
Genomics knowledge structure and dissemination
Association rules, and knowledge discovery from time series data
Data Visualization, Association Graphs
Biological Modeling and Artificial Life
Dr. A. Fazel Famili (NRC – Canada), Chair
Prof. Xiaohui Liu (Brunel University – UK), Co-Chair
Prof. J.M. Peña (UPM – Spain), Co-Chair
Dr. Joaquin Dopazo (CIPF– Spain)
Prof. Samuel Kaski (Helsinki University of Technology–Finland)
Prof. Ana Teresa Freitas (INESC-ID/IST - Portugal)
Prof. Alexander Schliep, (Max Planck Institute – Germany)
Prof. Henrik Bostrom (Royal Inst. Of Technology - Sweden)
Prof. Evgenii Vityaev (Russian Academy of Science - Russia)
Ing. Sergio Storari (Università di Ferrara - Italy)
Prof. Guillaume Beslon (INSA - Lyon, - France)
| Date | Event |
| June 30, 2007 | Deadline for paper submission |
| July 21, 2007 | Notification of accepted papers |
| July 28, 2007 | Camera-ready workshop notes and related information to ECML/PKDD |
| September 17, 2007 | Workshop is held |
This workshop will be held in Warsaw, Poland on September 17rd, immediately prior to the start of the ECML/PKDD conference. All workshop participants are expected to register for the main conference. Attendance to the workshop will be limited to 35-40 people. The participants will be selected by the organizing committee after reviewing submitted papers or statement of their current research. Those wishing to attend the workshop without submitting a paper should send a statement of their current research to the workshop chair or one of the workshop co-chairs.
The workshop will accept a maximum of 10 accepted papers. Each session will end with a 10-15 minutes summary and discussion. Workshop proceedings will be printed prior to the workshop and will be distributed among the participants.
| Time | Presentation (Authors) |
| 11:00-11:15 | Workshop Introduction and Welcome (F. Famili, J.M. Peña) |
| 11:15-12:30 | Session 1 |
| 12:30-14:00 | Lunch |
| 14:00-15:30 | Session 2 |
| 15:30-16:00 | Coffee Break |
| 16:00-17:15 | Session 3 |
| Paper | Classification of Microarrays with kNN: Comparison of Dimensionality Reduction Methods |
| Authors | S. Deegalla, H. Boström |
| Paper | Discovering informative genes from gene expression data: A multi-strategy approach |
| Authors | F. Famili, S. Phan, Z. Liu, Y. Pan, A. Djebbari, A. Lenferink, M. O’Connor |
| Paper | Breast Cancer Biomarker Selection Using Multiple Offspring Sampling |
| Authors | A. LaTorre, J.M. Peña, S. González, O. Cubo, F. Famili |
| Paper | Combining APRIORI and Bootstrap Techniques for Marker Analysis |
| Authors | G. Gamberoni, E. Lamma, F. Riguzzi, C. Scapoli, S. Storari |
| Paper | Knowledge Discovery in Neuroblastoma-related Biological Data |
| Authors | E. van de Koppel, I. Slavkov, K. Astrahantseff, A. Schramm, J. Schulte, J. Vandesompele, E. de Jong, S. Dzeroski, A. Knobbe |
| Paper | Partially-supervised context-specific independence mixture modeling |
| Authors | B. Georgi, A. Schliep |
| Paper | Using Symmetric Causal Independence Models to Predict Gene Expression from Sequence Data |
| Authors | R. Jurgelenaite, T. Heskes, T. Dijkstra |
| Paper | Identification of cooperative mechanisms in transcription regulatory networks using non-supervised |
| Authors | A. T. Freitas, A. P. Ramalho, C. A. Oliveira, C. S. Nogueira, M. C. Teixeira, I. Sá-Correia, A. L. Oliveira |
| Paper | Generating Data from the Evolution of Artificial Regulatory Networks |
| Authors | Y. Sánchez-Dehesa, J.M. Peña, G. Beslon |
| Paper | Transcription Factor Binding Site Discovery by the Probabilistic Rules |
| Authors | I. Khomicheva, A. Demin, E. Vityaev |
Interested participants are requested to submit their papers in PDF or Postscript. Authors should submit their papers using Springer LNCS format. For templates please refer to: http://www.springer.de/comp/lncs/authors.html. Papers should not exceed 10-12 pages. Submissions should be made to the following:
Dr. A. Fazel Famili fazel.famili@nrc.gc.ca