Lars Juhl Jensen started his research career in Søren Brunak’s group at the Technical University of Denmark (DTU), from where he in 2002 received the Ph.D. degree in bioinformatics for his work on non-homology based protein function prediction. During this time, he also developed methods for visualization of microbial genomes, pattern recognition in promoter regions, and microarray analysis. From 2003 to 2008, he was at the European Molecular Biology Laboratory (EMBL) where he worked on literature mining, integration of large-scale experimental datasets, and analysis of biological interaction networks. Since 2009, he has continued this line of research as a professor at the Novo Nordisk Foundation Center for Protein Research at the Panum Institute in Copenhagen and as a founder, owner and scientific advisor of Intomics A/S. He is a co-author of more than 170 scientific publications that have in total received more than 20,000 citations. He was awarded the Lundbeck Foundation Talent Prize in 2003, his work on cell-cycle research was named “Break-through of the Year” in 2006 by the magazine Ingeniøren, his work on text mining won the first prize in the “Elsevier Grand Challenge: Knowledge Enhancement in the Life Sciences” in 2009, and he was awarded the Lundbeck Foundation Prize for Young Scientists in 2010.
Methodological advances have in recent years given us unprecedented information on the molecular details of living cells. However, it remains a challenge to collect all the available data on individual genes and to integrate the highly heterogeneous evidence available with what is described in the scientific literature. The STRING database aims to address this by consolidating known and predicted protein–protein association data for a large number of organisms.
In my presentation, I will give an overview of the STRING database and describe the general approach we use to unify heterogeneous data, provide comparable quality scores for all evidence types, and automatically mine associations from the biomedical literature. I will also show how STRING and associated resources can be used through Cytoscape to visualize your own omics data on a network or to construct and compare, for example, disease-related protein networks.