Mercurial > repos > yufei-luo > s_mart
diff commons/pyRepetUnit/align/hmmOutputParsing/tests/datas/OutputHmmpfamTest @ 18:94ab73e8a190
Uploaded
author | m-zytnicki |
---|---|
date | Mon, 29 Apr 2013 03:20:15 -0400 |
parents | |
children |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/commons/pyRepetUnit/align/hmmOutputParsing/tests/datas/OutputHmmpfamTest Mon Apr 29 03:20:15 2013 -0400 @@ -0,0 +1,406 @@ +hmmpfam - search one or more sequences against HMM database +HMMER 2.3.2 (Oct 2003) +Copyright (C) 1992-2003 HHMI/Washington University School of Medicine +Freely distributed under the GNU General Public License (GPL) +- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - +HMM file: /home/choede/hmmer3/Pfam_fs +Sequence file: ConsensusTestFile_aaWithoutStop.fsa +- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - + +Query sequence: blumeria_Grouper_590_20:NoCat_1 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +DUF234 Archaea bacterial proteins of unknown 3.2 1.5 1 +DUF1414 Protein of unknown function (DUF1414) 2.9 6.3 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +DUF234 1/1 91 108 .. 5 22 .. 3.2 1.5 +DUF1414 1/1 111 119 .. 1 9 [. 2.9 6.3 + +Alignments of top-scoring domains: +DUF234: domain 1 of 1, from 91 to 108: score 3.2, E = 1.5 + *->VyPNrseIEsGnikevle<-* + VyPN++ IEs ++k++++ + blumeria_G 91 VYPNIEIIESSTPKPLIN 108 + +DUF1414: domain 1 of 1, from 111 to 119: score 2.9, E = 6.3 + *->HkAPvDLSL<-* + H ++DLSL + blumeria_G 111 HXSSPDLSL 119 + +// + +Query sequence: blumeria_Grouper_590_20:NoCat_2 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- + [no hits above thresholds] + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- + [no hits above thresholds] + +Alignments of top-scoring domains: + [no hits above thresholds] +// + +Query sequence: blumeria_Grouper_590_20:NoCat_3 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +CPW_WPC Plasmodium falciparum domain of unknown func 1.5 7.7 1 +HECT HECT-domain (ubiquitin-transferase) 0.0 9.2 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +CPW_WPC 1/1 30 37 .. 1 9 [. 1.5 7.7 +HECT 1/1 55 69 .. 341 355 .] 0.0 9.2 + +Alignments of top-scoring domains: +CPW_WPC: domain 1 of 1, from 30 to 37: score 1.5, E = 7.7 + *->CerdYsisk<-* + C++dYs sk + blumeria_G 30 CQFDYS-SK 37 + +HECT: domain 1 of 1, from 55 to 69: score 0.0, E = 9.2 + *->LllAIneeteGFgle<-* + Ll+A+n+ + G+ ++ + blumeria_G 55 LLTAVNNANTGYTQS 69 + +// + +Query sequence: blumeria_Grouper_590_20:NoCat_4 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +DUF46 Putative integral membrane protein DUF46 6.4 0.11 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +DUF46 1/1 82 91 .. 173 182 .] 6.4 0.11 + +Alignments of top-scoring domains: +DUF46: domain 1 of 1, from 82 to 91: score 6.4, E = 0.11 + *->YkLGlKdVpW<-* + Y+LGlK V++ + blumeria_G 82 YLLGLKGVWY 91 + +// + +Query sequence: blumeria_Grouper_590_20:NoCat_5 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +POC4 POC4 chaperone -1.7 6.3 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +POC4 1/1 121 125 .. 276 280 .] -1.7 6.3 + +Alignments of top-scoring domains: +POC4: domain 1 of 1, from 121 to 125: score -1.7, E = 6.3 + *->SGLYI<-* + SGLYI + blumeria_G 121 SGLYI 125 + +// + +Query sequence: blumeria_Grouper_590_20:NoCat_6 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- + [no hits above thresholds] + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- + [no hits above thresholds] + +Alignments of top-scoring domains: + [no hits above thresholds] +// + +Query sequence: blumeria_Grouper_4152_12:NoCat_1 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +Peptidase_S29 Hepatitis C virus NS3 protease 4.7 1.1 1 +DGOK 2-keto-3-deoxy-galactonokinase 0.5 1.3 1 +TNV_CP Satellite tobacco necrosis virus coat p 0.1 7.9 1 +TrbL TrbL/VirB6 plasmid conjugal transfer pr 0.8 8 1 +Amino_oxidase Flavin containing amine oxidoreductase -0.6 9.1 1 +DUF1301 Protein of unknown function (DUF1301) 0.4 9.9 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +Amino_oxidase 1/1 27 38 .. 471 482 .] -0.6 9.1 +TrbL 1/1 33 67 .. 231 285 .] 0.8 8 +TNV_CP 1/1 58 80 .. 167 189 .. 0.1 7.9 +DGOK 1/1 94 109 .. 283 298 .] 0.5 1.3 +Peptidase_S29 1/1 113 127 .. 1 15 [. 4.7 1.1 +DUF1301 1/1 148 157 .. 1 10 [. 0.4 9.9 + +Alignments of top-scoring domains: +Amino_oxidase: domain 1 of 1, from 27 to 38: score -0.6, E = 9.1 + CS HHHHHHHHHHHH + *->veSGlrAAaril<-* + veSG rAA +l + blumeria_G 27 VESGXRAASTLL 38 + +TrbL: domain 1 of 1, from 33 to 67: score 0.8, E = 8 + *->ksstiesfnlfvllaivllslvlilllkqipgiAqglvgavvtlgga + ++st+ + +++ ++ +++++l+l++ ++iA + blumeria_G 33 AASTLLI-----AVEHIVNCAAFCLALRSSASIA------------- 61 + + vaaalaGG<-* + a+ +GG + blumeria_G 62 --ASNSGG 67 + +TNV_CP: domain 1 of 1, from 58 to 80: score 0.1, E = 7.9 + CS SSSGGGB-TTEEEEEEEES-SS- + *->AdvAaSNgpgAvFvLeigdkvag<-* + A++AaSN g vF+L +++ g + blumeria_G 58 ASIAASNSGGCVFFLPAASSAGG 80 + +DGOK: domain 1 of 1, from 94 to 109: score 0.5, E = 1.3 + *->VdGDeAvrAGLseiAr<-* + VdG A AGL ++ r + blumeria_G 94 VDGVSALFAGLTSAGR 109 + +Peptidase_S29: domain 1 of 1, from 113 to 127: score 4.7, E = 1.1 + *->GevqvLgTaTqsflG<-* + G+v++ + aT+ flG + blumeria_G 113 GTVMRFSGATTVFLG 127 + +DUF1301: domain 1 of 1, from 148 to 157: score 0.4, E = 9.9 + *->gVKvFSlSTS<-* + ++vFS+STS + blumeria_G 148 SARVFSFSTS 157 + +// + +Query sequence: blumeria_Grouper_4152_12:NoCat_2 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +DUF1602 Protein of unknown function (DUF1602) 5.0 0.39 1 +Toxin_18 Conotoxin O-superfamily 4.2 4.4 1 +ABC_transp_aux ABC-type uncharacterized transport sys 1.1 5.2 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +Toxin_18 1/1 21 26 .. 50 55 .] 4.2 4.4 +ABC_transp_aux 1/1 126 137 .. 276 287 .] 1.1 5.2 +DUF1602 1/1 143 159 .. 23 39 .] 5.0 0.39 + +Alignments of top-scoring domains: +Toxin_18: domain 1 of 1, from 21 to 26: score 4.2, E = 4.4 + *->gfCLpr<-* + +fCL+r + blumeria_G 21 NFCLHR 26 + +ABC_transp_aux: domain 1 of 1, from 126 to 137: score 1.1, E = 5.2 + *->wGvrldpdlVlD<-* + wG+r+dp+l+l + blumeria_G 126 WGIRWDPRLALR 137 + +DUF1602: domain 1 of 1, from 143 to 159: score 5.0, E = 0.39 + *->kPPlSSsGalfanplrP<-* + +P l + G+l + +lrP + blumeria_G 143 MPKLNRRGFLASRLLRP 159 + +// + +Query sequence: blumeria_Grouper_4152_12:NoCat_3 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +zf-P11 P-11 zinc finger 3.6 1.8 1 +V-ATPase_G Vacuolar (H+)-ATPase G subunit 2.4 5.3 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +zf-P11 1/1 102 122 .. 1 20 [. 3.6 1.8 +V-ATPase_G 1/1 126 135 .. 1 10 [. 2.4 5.3 + +Alignments of top-scoring domains: +zf-P11: domain 1 of 1, from 102 to 122: score 3.6, E = 1.8 + *->GplnCKs..CWfkdknLveCsd<-* + Gp +C+s+ CW + + L C d + blumeria_G 102 GPHFCRSrxCW-NRDALLGCDD 122 + +V-ATPase_G: domain 1 of 1, from 126 to 135: score 2.4, E = 5.3 + *->sqsqGIQqLL<-* + ++s+GIQ LL + blumeria_G 126 GGSGGIQDLL 135 + +// + +Query sequence: blumeria_Grouper_4152_12:NoCat_4 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +XhoI Restriction endonuclease XhoI 9.9 0.014 1 +Endomucin Endomucin 0.1 6 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +XhoI 1/1 92 122 .. 172 202 .] 9.9 0.014 +Endomucin 1/1 113 119 .. 261 267 .] 0.1 6 + +Alignments of top-scoring domains: +XhoI: domain 1 of 1, from 92 to 122: score 9.9, E = 0.014 + *->klvlerfytrpasllrdaqavlqGiikeprk<-* + ++++ ++yt ++ +d++++ qG k+ ++ + blumeria_G 92 RSSRQKKYTPSRIACSDRSGGAQGKTKSRAI 122 + +Endomucin: domain 1 of 1, from 113 to 119: score 0.1, E = 6 + *->AQGKtKN<-* + AQGKtK + blumeria_G 113 AQGKTKS 119 + +// + +Query sequence: blumeria_Grouper_4152_12:NoCat_5 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +SLT Transglycosylase SLT domain 4.5 1.2 1 +DUF881 Bacterial protein of unknown function 4.0 1.6 1 +DUF2346 Uncharacterized conserved protein (DUF 3.2 2.8 1 +DUF1798 Bacterial domain of unknown function ( 3.0 3.3 1 +LBP_BPI_CETP LBP / BPI / CETP family, N-terminal do 1.8 3.8 1 +Jun Jun-like transcription factor -0.7 6.5 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +DUF1798 1/1 2 19 .. 35 52 .. 3.0 3.3 +DUF881 1/1 46 69 .. 214 237 .] 4.0 1.6 +Jun 1/1 77 93 .. 264 284 .. -0.7 6.5 +SLT 1/1 89 108 .. 1 20 [. 4.5 1.2 +DUF2346 1/1 93 115 .. 63 85 .] 3.2 2.8 +LBP_BPI_CETP 1/1 124 139 .. 191 209 .] 1.8 3.8 + +Alignments of top-scoring domains: +DUF1798: domain 1 of 1, from 2 to 19: score 3.0, E = 3.3 + *->KPfvdevdqllaeWkelA<-* + +P+vd+vd++++++k+lA + blumeria_G 2 VPEVDMVDAEVEKLKTLA 19 + +DUF881: domain 1 of 1, from 46 to 69: score 4.0, E = 1.6 + *->VeksdditiPAydgplklrYAkPv<-* + V + +it+PA++ p +r Ak++ + blumeria_G 46 VAPEKRITVPALPRPAEVRPAKRA 69 + +Jun: domain 1 of 1, from 77 to 93: score -0.7, E = 6.5 + *->qHheNPpgfqhsavgpPRlaa<-* + +++eNPp+ a++pP+l a + blumeria_G 77 AGAENPPL----APQPPALEA 93 + +SLT: domain 1 of 1, from 89 to 108: score 4.5, E = 1.2 + CS HHHHHHHHHHTS-HHHHHHH + *->dliikaaekygidpsllaAi<-* + ++ ++a+ k+ ++p llaAi + blumeria_G 89 PALEAAGRKNTHPPELLAAI 108 + +DUF2346: domain 1 of 1, from 93 to 115: score 3.2, E = 2.8 + *->AtKRrEkhdneLlealeeeEaKk<-* + A R+ h eLl+a+e eE+ + blumeria_G 93 AAGRKNTHPPELLAAIEAEERRA 115 + +LBP_BPI_CETP: domain 1 of 1, from 124 to 139: score 1.8, E = 3.8 + CS HHHHHHHCHHHH...HTTS + *->lCPviessVnslNvhLstl<-* + +C +++ssV++ L++l + blumeria_G 124 ICSTAISSVEAA---LQPL 139 + +// + +Query sequence: blumeria_Grouper_4152_12:NoCat_6 +Accession: [none] +Description: [none] + +Scores for sequence family classification (score includes all domains): +Model Description Score E-value N +-------- ----------- ----- ------- --- +DUF258 Protein of unknown function, DUF258 1.5 3.8 1 +TRAP_alpha Translocon-associated protein (TRAP), alph 0.1 4.1 1 +DUF1289 Protein of unknown function (DUF1289) 2.8 5.3 1 +SOCS_box SOCS box 2.4 9.7 1 + +Parsed for domains: +Model Domain seq-f seq-t hmm-f hmm-t score E-value +-------- ------- ----- ----- ----- ----- ----- ------- +DUF258 1/1 50 62 .. 293 305 .] 1.5 3.8 +SOCS_box 1/1 85 90 .. 1 6 [. 2.4 9.7 +DUF1289 1/1 95 115 .. 36 56 .] 2.8 5.3 +TRAP_alpha 1/1 106 116 .. 317 327 .] 0.1 4.1 + +Alignments of top-scoring domains: +DUF258: domain 1 of 1, from 50 to 62: score 1.5, E = 3.8 + CS -HHHHHHHHHHHH + *->seeRYesYlklle<-* + s++R+++Y+ l++ + blumeria_G 50 SASRFQHYRDLQK 62 + +SOCS_box: domain 1 of 1, from 85 to 90: score 2.4, E = 9.7 + *->prSLqh<-* + prSLqh + blumeria_G 85 PRSLQH 90 + +DUF1289: domain 1 of 1, from 95 to 115: score 2.8, E = 5.3 + *->dERravlqllpqRlaalglkp<-* + +E+ + lq+++qR +++++ + + blumeria_G 95 AEKIHTLQNCLQRSKRRSAGQ 115 + +TRAP_alpha: domain 1 of 1, from 106 to 116: score 0.1, E = 4.1 + *->kRkvKRsvGdD<-* + +R +Rs+G+D + blumeria_G 106 QRSKRRSAGQD 116 + +//