Puneet Varma (Editor)

Protein subcellular localization prediction

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit

Protein subcellular localization prediction (or just protein localization prediction) involves the prediction of where a protein resides in a cell, its subcellular localization.

Contents

In general, prediction tools take as input information about a protein, such as a protein sequence of amino acids, and produce a predicted location within the cell as output, such as the nucleus, Endoplasmic reticulum, Golgi apparatus, extracellular space, or other organelles. The aim is to build tools that can accurately predict the outcome of protein targeting in cells.

Prediction of protein subcellular localization is an important component of bioinformatics based prediction of protein function and genome annotation, and it can aid the identification of drug targets.

Background

Experimentally determining the subcellular localization of a protein can be a laborious and time consuming task. Immunolabeling or tagging (such as with a green fluorescent protein) to view localization using fluorescence microscope are often used. A high throughput alternative is to use prediction.

Through the development of new approaches in computer science, coupled with an increased dataset of proteins of known localization, computational tools can now provide fast and accurate localization predictions for many organisms. This has resulted in subcellular localization prediction becoming one of the challenges being successfully aided by bioinformatics, and machine learning.

Many prediction methods now exceed the accuracy of some high-throughput laboratory methods for the identification of protein subcellular localization. Particularly, some predictors have been developed that can be used to deal with proteins that may simultaneously exist, or move between, two or more different subcellular locations. Experimental validation is typically required to confirm the predicted localizations.

Tools

In 1999 PSORT was the first published program to predict subcellular localization. Subsequent tools and websites have been released using techniques such as neural networks, support vector machine and protein motifs. Predictors can be specialized for proteins in different organisms. Some are specialized for eukaryotic proteins, some for human proteins, and some for plant proteins. Methods for the prediction of bacterial localization predictors, and their accuracy, have been reviewed.

The development of protein subcellular location prediction has been summarized in two comprehensive review articles. Recent tools and an experience report can be found in a recent paper by Meinken and Min (2012).

Application

Knowledge of the subcellular localization of a protein can significantly improve target identification during the drug discovery process. For example, secreted proteins and plasma membrane proteins are easily accessible by drug molecules due to their localization in the extracellular space or on the cell surface.

Bacterial cell surface and secreted proteins are also of interest for their potential as vaccine candidates or as diagnostic targets. Aberrant subcellular localization of proteins has been observed in the cells of several diseases, such as cancer and Alzheimer's disease. Secreted proteins from some archaea that can survive in unusual environments have industrially important applications.

By using prediction a high number of proteins can be assessed in order to find candidates that are trafficked to the desired location.

Predicted Protein Subcellular Location Database

The following protein subcellular location databases are available at: (1) FunSecKB: the fungal secretome knowledgebase at http://bioinformatics.ysu.edu/secretomes/fungi.php (2) FunSecKB2: the fungal secretome and subcellular knowledgebase at http://bioinformatics.ysu.edu/secretomes/fungi2/index.php (3) PlantSecKB: the plant secretome and subcellular knowledgebase at http://bioinformatics.ysu.edu/secretomes/plant/index.php (4) MetazSecKB: the animal and human protein subcellular location database - the whole subcellular proteomes such as secreteome can be searched or downloaded at http://bioinformatics.ysu.edu/secretomes/animal/index.php (5) ProtSecKB: the protist protein subcellular location database - the whole subcellular proteomies such as secretome can be searched or downloaded at http://bioinformatics.ysu.edu/secretomes/protist/index.php

References

Protein subcellular localization prediction Wikipedia