Applied Bioinformatics Group

A   A   A
Home > Services >

Skip to content. | Skip to navigation

Assigning subcellular localization to a protein is an important step towards elucidating its interaction partners, function, and potential role(s) in the cellular machinery. Computational tools offer an attractive complement to time-consuming and laborious experimental methods.

We have designed several systems for predicting the subcellular localization of eukaryotic proteins from the amino acid sequence:


YLoc (2010)

YLoc is a highly interpretable subcellular localization predictor that performes comparable to current state-of-the-art predictors. For every prediction, YLoc gives a reasoning why this prediction was made and which biological properties of the protein lead to this prediction. A confidence scores helps users to rate their trust in the prediction. In addition, it predicts the localization sites of multiple-targeted proteins.


SherLoc2 (2009)

SherLoc2 combines the amino acid information, knowledge on domains, and phylogenetic profiles with text-based information. It predicts 11 locations of eukaryotic cells with very high prediction accuracy. In addition, it offers users to include background knowledge by describing their protein.


MultiLoc2 (2009)

MultiLoc2 is a high-accuracy subcellular localization predictor. Similar to its predecessor, it combines information on amino acid composition and N-terminal targeting signals. In addition, knowledge on domains in form of GO terms and phylogenetic profiles are used to gain a higher prediction performance. It is available in two versions: A low-resultion version is specialized on predicting the location of globular proteins. A high-resolution version predicts all 11 main locations of eukaryotic cells.


SherLoc (2006)

SherLoc combines the information obtained from MultiLoc such as amino acid composition and N-terminal sorting signals with text-based information from PubMed abstracts. It supports 11 eukaryotic localizations.


MultiLoc (2006)

MultiLoc integrates several sources of sequence-based information in order to assign subcellular localization and supports 11 eukaryotic localizations. MultiLoc integrates several sources of relevant sequence-based information i.e. N-terminal targeting sequences, amino acid composition, and sequence motifs, in order to provide reliable predictions on a proteome-wide scale. MultiLoc is based on support vector machines (SVMs). TargetLoc, the low resolution version of MultiLoc, was constructed to distinguish globular proteins and support 4 and 3 localizations for plant and non-plant, respectively.