Prediction Method:
Paul Horton, Keun-Joon Park, Takeshi Obayashi & Kenta Nakai,
"Protein Subcellular Localization Prediction with WoLF PSORT",
Proceedings of the 4th Annual Asia Pacific Bioinformatics Conference APBC06, Taipei, Taiwan. pp. 39-48, 2006.
[Abstract]
[Paper]
Developers
WoLF PSORT is being developed by
- Paul HORTON of CBRC
- Keun-Joon PARK at CBRC, now at Korea Center for Disease Control & Prevention
- Takeshi OBAYASHI at Tokyo Institute of Technology, now at Tokyo University Human Genome Center
- C.J. Adams-Collier of Collier Technologies
- Kenta NAKAI of Tokyo University Human Genome Center
Dataset
The dataset is based mainly on annotation from Uniprot and Gene
Ontology. The table below gives a correspondence between our
localization site definitions and Gene Ontology. However, many of our
entries are based solely on Uniprot "Subcellular Localization" field
keywords and in some of these cases the site assignment may not be
completely consistent with the GO cellular component annotation.
Localization Sites and corresponding GO cellular components.
| Abbrev | Localization Site | GO Cellular Component
|
|---|
| chlo | chloroplast | 0009507, 0009543
|
| cyto | cytosol | 0005829
|
| cysk | cytoskeleton | 0005856(2)
|
| E.R. | endoplasmic reticulum | 0005783
|
| extr | extracellular | 0005576, 0005618
|
| golg | Golgi apparatus | 0005794(1)
|
| lyso | lysosome | 0005764
|
| mito | mitochondria | 0005739
|
| nucl | nuclear | 0005634
|
| pero | peroxisome | 0005777(2)
|
| plas | plasma membrane | 0005886
|
| vacu | vacuolar membrane | 0005774(2)
|
Abbreviation, Localization Site, and corresponding GO Cellular Component(s) are given
for each localization site. Numbers in parentheses, such as "0005856(2)" indicate that descendant
"part_of" cellular components were also included, up to the specified depth (2 in this case).
For example, all of the children and grandchildren of "GO:0005856" were included as "cysk".
Stand alone Package
WoLF PSORT package version 0.2
has been released September 2006. It is academic free and also relatively easy
for industrial users to use as well. Please see the package documentation for details.
Prediction Accuracy by Localization Site
The accuracy varies greatly between different localization
sites -- the general trend being that sites with few
uniprot annotated proteins are seldom correctly predicted.
In a separate
localization accuracy by utility page,
we have compiled some statistics to help answer this
question quantitatively.
What's in a name
"WoLF" does not necessarily stand for anything. A rather dramatic
mnemonic would be "Where Life Functions". Originally it was going to
be "Learned Weight Features" but I wanted the acronym to be a
pronouncable English word. Women only Love Fools.
Acknowledgements
- WoLF PSORT Relies heavily on features inherited from PSORT(Nakai & Kanehisa)
- WoLF PSORT also uses some sequence features from iPSORT (Bannai
et. al).
- Dr. Ohta provided valuable advice on the best way to extract localization
data from GO.
- The original server design was done by C.J.Collier. (But he is not to blame for subsequent hacking...)
 |
 |
Copyright (C) National Institute of Advanced Science and Technology (AIST), Computational Biology Research Center (CBRC). All Rights Reserved.
|