Research
Big Data Analysis and Biomedical Research meet in our lab: We develop novel Data Mining algorithms for detecting patterns and statistical dependencies in large datasets from Biology and Medicine.
The ultimate goal in our work is to contribute to two big goals of Science in the 21st century: To enable the automatic generation of new knowledge from Big Data through Machine Learning, and to help to gain an understanding of the relationship between diseases and molecular properties of patients, thereby enabling Personalized Medicine.
Below you can find further information for some of our projects:
Machine Learning: High-Dimensional Correlations
We develop methods for measuring statistical dependence between high dimensional variables, two-sample tests to tell whether two samples were drawn from the same distribution, outlier detection algorithms to tell find "unusual" observations in a given dataset, and approaches that detect non-linear dependence between variables.
Machine Learning: Significant Pattern Mining
We develop methods that discover significant patterns in high dimensional datasets while being runtime efficient and statistically sound. Our algorithms can be applied to graphs or collections of sequences and allow to account for dependencies between objects, to control the Family-Wise Error Rate and to correct for categorical covariates.
Computational Biology: Genome-Wide Association Studies
We develop efficient multivariate approaches for the genome-wide discovery of genetic loci that are associated with a phenotype, thereby trying to elucidate the multicausal basis of complex traits.
- chevron_right easyGWAS - an Online Tool for Performing Genome-Wide Association Studies
- chevron_right Tools for SNP x SNP Interaction Discovery
- chevron_right Lasso Model with Population Structure Correction (LMM-Lasso)
- chevron_right Network GWAS (SConES)
- chevron_right Multi-Trait Network GWAS (Multi-SConES)
- chevron_right Finding Genomic Intervals of Genetic Heterogeneity (FAIS)
- chevron_right In silico Phenotyping via Co-training
Computational Biology: Molecular Graph Classification via Graph Kernels
We developed new, fast and scalable similarity measures on graphs, so-called graph kernels. Their prime purpose is to compare molecular graphs or protein structures and to classify them into functional categories.
Personalized Medicine
We coordintate several national and internation networks on personalized medicine: