runWolfPsortHtmlTables - Run WoLF PSORT subcellular localization prediction on input sequences and output relatively detailed html files as output.
runWolfPsortHtmlTables [OPTIONS] organismType outputDir [queryName]
runWolfPsortHtmlTables (--usage|--help|--man)
Pipe sequences in from standard in.
Run WoLF PSORT subcellular localization prediction on input sequences and write results as html to files under the outputDir directory.
Input in fasta form is expected from standard in.
The summary output is written to the file outputDir/htdocs/results/queryName.html. The output looks roughly like
seq1 details extr_plas: 11.5, plas: 11, extr: 10, E.R.: 4, lyso: 4, pero: 1.5, cyto_pero: 1.5, vacu: 1 seq2 details extr: 25, lyso: 3, plas: 2, nucl: 1, E.R.: 1 seq3 details extr: 31, lyso: 1
Tables showing the values of each localization feature for the query and nearest neighbor sequences can be found by following the ``details'' link.
Each line contains a several localization classes with their scores. The localization classes are:
abbrev. site GO cellular component number
extr extracellular 0005576, 0005618
cysk cytoskeleton 0005856
cyto cytosol(sans cytoskeleton) 0005829
E.R. endoplasmic reticulum 0005783
golg Golgi apparatus 0005794
mito mitochondria 0005739
nucl nucleus 0005634
plas plasma membrane 0005886
pero peroxisome 0005777
vacu vacuolar membrane 0005774
chlo chloroplast 0009507, 0009543
lyso lysozyme 0005764
The GO cellular component number is given for here, but most entries in our current dataset are actually based on the Uniprot and depend on that annotation. Localization classes including underscores indicate the possibility of localizing to two sites, for example ``cyto_nucl'' indicates proteins which can localize to both the cytosol and/or the nucleus. No distinction is made between conditional and constitutive dual localization.
runWolfPsortHtmlTables animal outdir < hamster.fasta
In this section, scriptDir denotes the directory in which this script resides, seqNo represents the number (e.g. 1, 2, etc.) of the input sequence when the query fasta stream contains multiple sequences
Paul Horton horton-p AT aist.go.jp
This Script: Copyright (C) 2004-2006, Paul B. Horton & C.J. Collier, All Rights Reserved.
PSORT: Copyright (C) 1997, 2004-2006, Kenta Nakai & Paul B. Horton, All Rights Reserved.
Paul Horton, Keun-Joon Park, Takeshi Obayashi & Kenta Nakai, ``Protein Subcellular Localization Prediction with WoLF PSORT'', Proceedings of the 4th Annual Asia Pacific Bioinformatics Conference APBC06, Taipei, Taiwan. pp. 39-48, 2006.
runWolfPsortHtmlSummary