Mercurial > repos > thanhlv > fargene
diff test-data/mbl_superfamily.fasta @ 0:6f78bc71eb12 draft default tip
planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/fargene commit 2e1b1f737f3a3d7cfc6350c6936fbb0bd84a5dad-dirty
author | thanhlv |
---|---|
date | Fri, 19 Jul 2019 08:42:37 -0400 |
parents | |
children |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/mbl_superfamily.fasta Fri Jul 19 08:42:37 2019 -0400 @@ -0,0 +1,293 @@ +>tr_Q9WZL4_ Q9WZL4_THEMA Beta_lactamase OS=Thermotoga maritima (strain ATCC 43589 / MSB8 / DSM 3109 / JCM 10099) GN=TM_0755 PE=1 SV=1 +MPKIWTERIFDDPEIYVLRIDDDRIRYFEAVWEIPEGISYNAYLVKLNGANVLIDGWKGN +YAKEFIDALSKIVDPKEITHIIVNHTEPDHSGSLPATLKTIGHDVEIIASNFGKRLLEGF +YGIKDVTVVKDGEEREIGGKKFKFVMTPWLHWPDTMVTYLDGILFSCDVGGGYLLPEILD +DSNESVVERYLPHVTKYIVTVIGHYKNYILEGAEKLSSLKIKALLPGHGLIWKKDPQRLL +NHYVSVAKGDPKKGKVTVIYDSMYGFVENVMKKAIDSLKEKGFTPVVYKFSDEERPAISE +ILKDIPDSEALIFGVSTYEAEIHPLMRFTLLEIIDKANYEKPVLVFGVHGWAPSAERTAG +ELLKETKFRILSFTEIKGSNMDERKIEEAISLLKKELE +>tr_A5IJ30_ A5IJ30_THEP1 Flavodoxin/nitric oxide synthase OS=Thermotoga petrophila (strain RKU_1 / ATCC BAA_488 / DSM 13995) GN=Tpet_0174 PE=4 SV=1 +MPKIWTERIFDDPEIYVLRIDDDRIRYFEAVWEIPEGISYNAYLVKLNGANVLIDGWKGN +YAKEFIDALSKIVDPKEITHIIVNHTEPDHSGSLPATLKTIGHDVEIIASNFGKRLLEGF +YGIKDVTVVKDGEEREIGGKKFKFVMTPWLHWPDTMVTYLDGILFGCDVGGGYLLPEILD +DSNESVVERYLPHVTKYIVTVIGHYKNYILEGAEKLSSLEIKALLPGHGLIWKKDPQRLL +NHYVSVAKGDPKKGKVTVIYDSMYGFVENVMKKAIDSLKEKGFTPVVYKFSDEERPAISE +ILKDIPDSEALIFGVSTYEAEIHPLMRFTFLEIIDKANYEKPVLVFGVHGWAPSVERTAG +ELLKETKFRILSFTEIKGSNMDEKKIEEAISLLKKELG +>sp_Q9FDN7_FPRA_MOOTA Nitric oxide reductase OS=Moorella thermoacetica (strain ATCC 39073) GN=fprA PE=1 SV=1 +MSQPVAITDGIYWVGAVDWNIRYFHGPAFSTHRGTTYNAYLIVDDKTALVDTVYEPFKEE +LIAKLKQIKDPVKLDYLVVNHTESDHAGAFPAIMELCPDAHVLCTQRAFDSLKAHYSHID +FNYTIVKTGTSVSLGKRSLTFIEAPMLHWPDSMFTYVPEEALLLPNDAFGQHIATSVRFD +DQVDAGLIMDEAAKYYANILMPFSNLITKKLDEIQKINLAIKTIAPSHGIIWRKDPGRII +EAYARWAEGQGKAKAVIAYDTMWLSTEKMAHALMDGLVAGGCEVKLFKLSVSDRNDVIKE +ILDARAVLVGSPTINNDILPVVSPLLDDLVGLRPKNKVGLAFGAYGWGGGAQKILEERLK +AAKIELIAEPGPTVQWVPRGEDLQRCYELGRKIAARIAD +>sp_Q50497_FPRA_METTM Type A flavoprotein FprA OS=Methanothermobacter marburgensis (strain DSM 2133 / 14651 / NBRC 100331 / OCM 82 / Marburg) GN=fprA PE=1 SV=1 +MKAAAKRISDGVYWTGVLDWDLRNYHGYTLQGTTYNAYLVCGDEGVALIDNSYPGTFDEL +MARVEDALQQVGMERVDYIIQNHVEKDHSGVLVELHRRFPEAPIYCTEVAVKGLLKHYPS +LREAEFMTVKTGDVLDLGGKTLTFLETPLLHWPDSMFTLLDEDGILFSNDAFGQHLCCPQ +RLDREIPEYILMDAARKFYANLITPLSKLVLKKFDEVKELGLLERIQMIAPSHGQIWTDP +MKIIEAYTGWATGMVDERVTVIYDTMHGSTRKMAHAIAEGAMSEGVDVRVYCLHEDDRSE +IVKDILESGAIALGAPTIYDEPYPSVGDLLMYLRGLKFNRTLTRKALVFGSMGGNGGATG +TMKELLAEAGFDVACEEEVYYVPTGDELDACFEAGRKLAAEIRR +>tr_E3GY45_ E3GY45_METFV Flavodoxin/nitric oxide synthase OS=Methanothermus fervidus (strain ATCC 43054 / DSM 2088 / JCM 10308 / V24 S) GN=Mfer_0426 PE=4 SV=1 +MKAKAVKIKDGVYWVGVLDWDIRIYHGYTLKGTTYNAYLVFGEDKTCLIDNTYPGTETQL +WARIKDALEKEKREKIDVIVQNHVERDHSGALPQIHKKFPEAPIYCTEIAVDGLKKHYPQ +LKNADFIEVKTGDKLDLGNKTLAFVEAFLLHWPDSMFTLLVEDGILFPNDAFGQHLCYPQ +RYDYEIPEYVLMDAAQKFYANLITPLSKRVLKKFKEIEDLGLLNKIKMIAPSHGQIWTDP +MKIIGAYKDWAEGKCKNKITIIYDTMHYSTQKMAHAIAEGIISEGVDVRMYYLHEDERSE +IVKDILDSKAVAFGTPTIYDKPYPTLGDIIYYLKGLRFDRTGFKKLAITFGSMGGEGGAP +EIIANELKECGFEVIDEYEIFYIPDEKELEKCYEIGRKLAKKVKEM +>sp_Q58158_FPRA_METJA Type A flavoprotein FprA OS=Methanocaldococcus jannaschii (strain ATCC 43067 / DSM 2661 / JAL_1 / JCM 10045 / NBRC 100440) GN=fprA PE=3 SV=1 +MKKYESRRSKIADGVYWVGVLDWDIRMYHGYTLKGTTYNAYLVFGDEKVALIDNTYPGTS +AQMWGRIKDAFEKEGREFKIDVIVQNHVEKDHSGALPEIHKKFPDAPIYCTEVAVEGLKK +HYPSLKDAQFKVVHTGDTVDLGGKTLTFLEAPLLHWPDSMFTFYNEGGILFSNDAFGQHL +CFPAHKRFDKDIPEYVLMDANQKFYANLITPLSKLVLKKFEEVIQLGLLEKIKMIAPSHG +QIWTDPMKVIKAYQDFATGKAAKDKAVIVYDTMHYSTQKMAHAFAEGLMSEGIDVVMYFL +HYDERSEIVKDILDAKAVLFGIPTIYDEPYPSIGDIIYYLRGLKFNRTGFKRLAVTFGSM +GGEGGAVAKIAEDLAKCGFEVINQYELYYVPTEDELTNCYNMGKELAKRIKEMKIE +>tr_A6UPE2_ A6UPE2_METVS Flavodoxin/nitric oxide synthase OS=Methanococcus vannielii (strain SB / ATCC 35089 / DSM 1224) GN=Mevan_0457 PE=4 SV=1 +MKADAVKISDGVYWVGTYDWDIRSYHGYTLKGTTYNAYLVFGTEKVALIDNVYPGTSAQM +WGRIKDAFEKEGRKYNIDVIVQNHVEKDHSGALVEITKKFPESNIYCTEVAVEGLKKHYT +GLKDAPFKVVKSLESVDLGGKTLTFLEAPLLHWPDSMFTLYGEEGILFSNDAFGQHLCYT +KRFDNEIPENVLMDANQKFYANLITPLSKLVLKKFEQVISLGLLENIKMIAPSHGQIWTD +PMKVISAYQDFATGKCKNKATIVYDTMHYSTQKMAHAFAEGLLSEGIDVVIYNLHNDERS +EIVKDILDSKAVLFGIPTINDQPYPSIGDLMYYLRGLRFDRTGLKKLAITFGSMGGKGGA +AKLIGKDLKECGFEVLDDSYEVIYVPKEEELEKCYNAGKRLGIKLN +>gi_687617_gb_AAB88013.1_ flavoprotein g3 [Methanothermobacter thermautotrophicus] +MKAAAKRISDGVYWTGVLDWDLRNYHGYTLQGTTYNAYLVCGDEGVALIDNSYPGTFDELMARVEDALQQ +VGMERVDYIIQNHVEKDHSGVLVELHRRFPEAPIYCTEVAVKGLLKHYPSLREAEFMTVKTGDVLDLGGK +TLTFLETPLLHWPDSMFTLLDEDGILFSNDAFGQHLCCPQRLDREIPEYILMDAARKFYANLITPLSKLV +LKKFDEVKELGLLERIQMIAPSHGQIWTDPMKIIEAYTGWATGMVDERVTVIYDTMHGSTRKMAHAIAEG +AMSEGVDVRVYCLHEDDRSEIVKDILESGAIALGAPTIYDEPYPSVGDLLMYLRGLKFNRTLTRKALVFG +SMGGNGGATGTMKELLAEAGFDVACEEEVYYVPTGDELDACFEAGRKLAAEIRR +>sp_Q8ZRM2_GLO2_SALTY Hydroxyacylglutathione hydrolase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=gloB PE=1 SV=1 +MNLNSIPAFQDNYIWVLTNDEGRCVIVDPGEAAPVLKAIAEHKWMPEAIFLTHHHHDHVG +GVKELLQHFPQMTVYGPAETQDKGATHLVGDGDTIRVLGEKFTLFATPGHTLGHVCYFSR +PYLFCGDTLFSGGCGRLFEGTPSQMYQSLMKINSLPDDTLICCAHEYTLANIKFALSILP +HDSFINEYYRKVKELRVKKQMTLPVILKNERKINLFLRTEDIDLINEINKETILQQPEAR +FAWLRSKKDTF +>sp_O24495_GLO2M_ARATH Hydroxyacylglutathione hydrolase 1, mitochondrial OS=Arabidopsis thaliana GN=GLX2_1 PE=2 SV=2 +MPVISKASSTTTNSSIPSCSRIGGQLCVWPGLRQLCLRKSLLYGVMWLLSMPLKTLRGAR +KTLKITHFCSISNMPSSLKIELVPCSKDNYAYLLHDEDTGTVGVVDPSEAAPVIEALSRK +NWNLTYILNTHHHDDHIGGNAELKERYGAKVIGSAVDKDRIPGIDILLKDSDKWMFAGHE +VRILDTPGHTQGHISFYFPGSATIFTGDLIYSLSCGTLSEGTPEQMLSSLQKIVSLPDDT +NIYCGRENTAGNLKFALSVEPKNETLQSYATRVAHLRSQGLPSIPTTVKVEKACNPFLRI +SSKDIRKSLSIPDSATEAEALRRIQRARDRF +>tr_B9SU05_ B9SU05_RICCO Hydroxyacylglutathione hydrolase, putative OS=Ricinus communis GN=RCOM_0454360 PE=3 SV=1 +MQMISKASCAMASIPCSRVRSGLCIRPGARQLCFRKGLLYGFMHLLSMPFKTLRGASRTL +KVAQFCSVSNMSSSLQIELVPCLRDNYAYLLHDMDTGTVGVVDPSEAVPIIDALTKKNRN +LTYILNTHHHHDHTGGNEELKARYGAKVIGPGTDRDRIPGIDIVLNDGDKWMFAGHEVLV +METPGHTRGHISFYFPGSGSIFTGDTLFSLSCGKLFEGTPEQMHSSLGKIMSLPDDTNIY +CGHEYTLSNSKFALSIEPNNEALRSYAAHVTHLRSKSLPTGGAAQEQNPCV +>sp_Q9SID3_GLO2N_ARATH Hydroxyacylglutathione hydrolase 2, mitochondrial OS=Arabidopsis thaliana GN=At2g31350 PE=1 SV=1 +MQTISKASSATSFFRCSRKLSSQPCVRQLNIRKSLVCRVMKLVSSPLRTLRGAGKSIRVS +KFCSVSNVSSLQIELVPCLKDNYAYILHDEDTGTVGVVDPSEAEPIIDSLKRSGRNLTYI +LNTHHHYDHTGGNLELKDRYGAKVIGSAMDKDRIPGIDMALKDGDKWMFAGHEVHVMDTP +GHTKGHISLYFPGSRAIFTGDTMFSLSCGKLFEGTPKQMLASLQKITSLPDDTSIYCGHE +YTLSNSKFALSLEPNNEVLQSYAAHVAELRSKKLPTIPTTVKMEKACNPFLRSSNTDIRR +ALRIPEAADEAEALGIIRKAKDDF +>tr_D7LD21_ D7LD21_ARALL Glyoxalase 2_5 OS=Arabidopsis lyrata subsp. lyrata GN=GLX2_5 PE=3 SV=1 +MQTISKASSAISFFRCSRKLSSQPCVRQLNLRKGLVCRVMKLVSSPLRTLRGAGKSIRVS +KFCSVSNVSSLQIELVPCLKDNYAYILHDEDTGTVGVVDPSEAEPVIDSLKRSGRNLTYI +LNTHHHYDHTGGNLELKDRYGAKVIGSAMDKDRIPGIDIALKDGDKWMFAGHEVHVMDTP +GHTKGHISLYFPGSRAIFTGDTLFSLSCGKLFEGTPKQMLASLQKIISLPDDTSIYCGHE +YTLSNSKFALSLEPNNEILQSYAAHVAELRSKKLPTIPTTLKMEKACNPFLRSSNTDIRR +ALRIPETADEAEALGIIRKAKDDF +>sp_A1A7Q3_GLO2_ECOK1 Hydroxyacylglutathione hydrolase OS=Escherichia coli O1:K1 / APEC GN=gloB PE=3 SV=1 +MNLNSIPAFDDNYIWVLNDEAGRCLIVDPGDAEPVLNAISANNWQPEAIFLTHHHHDHVG +GVKELVEKFPQIVVYGPQETQDKGTTQVVKDGETAFVLGHEFSVIATPGHTLGHICYFSK +PYLFCGDTLFSGGCGRLFEGTPSQMYQSIKKLSALPDDTLVCCAHEYTLSNMKFALSILP +HDLSINDYYRKVKELRAKNQITLPVILKNERQINVFLRTEDIDLINVINEETLLQQPEER +FAWLRSKKDRF +>sp_Q325T4_GLO2_SHIBS Hydroxyacylglutathione hydrolase OS=Shigella boydii serotype 4 (strain Sb227) GN=gloB PE=3 SV=1 +MNLNSIPAFDDNYIWVLNDEAGRCLIVDPGDAEPVLNAITANNWQPEAIFLTHHHHDHVG +GVKELVEKFPQIVVYGPQETQDKGTTQVVKDGETAFVLGHEFSVITTPGHTLGHICYFSK +PYLFCGDTLFSGGCGRLFEGTALQMYQSLKKLSALPDDTLVCCAHEYTLSNMKFALSIFP +HDLSINDYYRKVKELRAKNQITLPVILKNERQINVFLRTEDIDLINGINEETLLQQPEER +FAWLRSKKDRF +>sp_Q9DB32_HAGHL_MOUSE Hydroxyacylglutathione hydrolase_like protein OS=Mus musculus GN=Haghl PE=2 SV=1 +MKVKVIPVLEDNYMYLIIEEHTREAVAIDVAVAERLLEIAGREGVSLTMVLSTHHHWDHT +RGNAELAHILPGLAVLGADERICALTRRLEHGEGLQFGAIHVRCLLTPGHTSGHMSYFLW +EDDCPDSPALFSGDALSVAGCGWHLEDTAQQMYQSLAKTLGTLPPETKVFCGHEHTLSNL +EFAQKVEPCNEHVQAKLSWAQERDDEDIPTVPSTLGEELMYNPFLRVTEDAVRAFTGQVA +PAQVLEALCRERARFQPAVEPPQPQVRALLALQWGLLSTHQKK +>sp_Q3B7M2_GLO2_BOVIN Hydroxyacylglutathione hydrolase, mitochondrial OS=Bos taurus GN=HAGH PE=2 SV=3 +MVLGRGLLGRWSVAELGAVCARLGLGPALLGSLHHLGLRKSLTVDQGTMKVELLPALTDN +YMYLLIDEDTKEAAIVDPVQPQKVVETARKHGVKLTTVLTTHHHWDHAGGNEKLVKLEPG +LKVYGGDDRIGALTHKVTHLSTLQVGSLHVKCLSTPCHTSGHICYFVTKPNSPEPPAVFT +GDTLFVAGCGKFYEGTADEMYKALLEVLGRLPADTRVYCGHEYTINNLKFARHVEPDNTA +VREKLAWAKEKYSIGEPTVPSTIAEEFTYNPFMRVREKTVQQHAGETEPVATMRAIRKEK +DQFKMPRD +>tr_Q35952_ Q35952_SCOUM Cytochrome b (Fragment) OS=Scopus umbretta PE=3 SV=1 +FGSLLGICLGTQILTGLLLAMHYTADTALAFSSVAHTCRNVQYGWLIRNLHANGASFFFI +CIYLHIGRGFYYGSYLNKETWNTGVILLLTLMATAFVGYVLPWGQMSFWGATVITNLFSA +IPYIGQTLVEWAWGGFSVDNPTLTRFFALHFLLPFIIAGLALIHLTFLHESGSNNPLGIV +SNCDKIPFHPYFSAEDVLGLMLMLLPLMTLAMFSPNLLGDPENFTPANPLVTPPHIKPEW +YFLFAYAILRSIPNKLGGVLALAASVLVLFLVPLLHKSKQRTMAFRPLSQLLFWALTANL +FILTWVGSQPVEHPFIIIGQLASLTYFTILLILFPI +>sp_Q16775_GLO2_HUMAN Hydroxyacylglutathione hydrolase, mitochondrial OS=Homo sapiens GN=HAGH PE=1 SV=2 +MVVGRGLLGRRSLAALGAACARRGLGPALLGVFCHTDLRKNLTVDEGTMKVEVLPALTDN +YMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESG +LKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFT +GDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAA +IREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRREK +DQFKMPRD +>sp_Q4R6C1_GLO2_MACFA Hydroxyacylglutathione hydrolase, mitochondrial OS=Macaca fascicularis GN=HAGH PE=2 SV=2 +MVLGRGLLGRRSLAALGAACARRGLGPALLGVLHHTDLRKNLTVDEGTMKVEVLPALTDN +YMYLVIDDETKEAAIVDPVQPQKVLDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLQSG +LKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVSRPGGSEPPAVFT +GDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAA +IQEKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHARETDPVTTMRAVRKEK +DEFKMPRD +>sp_P0AC84_GLO2_ECOLI g2 Hydroxyacylglutathione hydrolase OS=Escherichia coli (strain K12) GN=gloB PE=1 SV=1 +MNLNSIPAFDDNYIWVLNDEAGRCLIVDPGDAEPVLNAIAANNWQPEAIFLTHHHHDHVG +GVKELVEKFPQIVVYGPQETQDKGTTQVVKDGETAFVLGHEFSVIATPGHTLGHICYFSK +PYLFCGDTLFSGGCGRLFEGTASQMYQSLKKLSALPDDTLVCCAHEYTLSNMKFALSILP +HDLSINDYYRKVKELRAKNQITLPVILKNERQINVFLRTEDIDLINVINEETLLQQPEER +FAWLRSKKDRF +>sp_O24496_GLO2C_ARATH g2 Hydroxyacylglutathione hydrolase cytoplasmic OS=Arabidopsis thaliana GN=GLX2-2 PE=1 SV=2 +MKIFHVPCLQDNYSYLIIDESTGDAAVVDPVDPEKVIASAEKHQAKIKFVLTTHHHWDHA +GGNEKIKQLVPDIKVYGGSLDKVKGCTDAVDNGDKLTLGQDINILALHTPCHTKGHISYY +VNGKEGENPAVFTGDTLFVAGCGKFFEGTAEQMYQSLCVTLAALPKPTQVYCGHEYTVKN +LEFALTVEPNNGKIQQKLAWARQQRQADLPTIPSTLEEELETNPFMRVDKPEIQEKLGCK +SPIDTMREVRNKKDQWRG +>sp_Q05584_GLO2_YEAST g2 Hydroxyacylglutathione hydrolase, cytoplasmic isozyme OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=GLO2 PE=1 SV=1 +MQVKSIKMRWESGGVNYCYLLSDSKNKKSWLIDPAEPPEVLPELTEDEKISVEAIVNTHH +HYDHADGNADILKYLKEKNPTSKVEVIGGSKDCPKVTIIPENLKKLHLGDLEITCIRTPC +HTRDSICYYVKDPTTDERCIFTGDTLFTAGCGRFFEGTGEEMDIALNNSILETVGRQNWS +KTRVYPGHEYTSDNVKFVRKIYPQVGENKALDELEQFCSKHEVTAGRFTLKDEVEFNPFM +RLEDPKVQKAAGDTNNSWDRAQIMDKLRAMKNRM +>1QH5_A_PDBID_CHAIN_SEQUENCE 1QH5 g2 +MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDD +RIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVL +GRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD +PVTTMRAVRREKDQFKMPRD +>sp_P28607_ARS_PSEVC g4 Arylsulfatase OS=Pseudoalteromonas carrageenovora GN=atsA PE=3 SV=1 +MQKISIIFNLFLSLGCLAFTFNGSASETKNEWITLGTMAGPIPNAKHSQPANAMLVNGNT +YVVDAGDGTAGQLAKVGLDIKNVDAVFLSHLHFDHTGGLPAILSLRWQTSARNELVVYGP +PGTQQTVDGIFEYMTYGTLGHYGVPGQVPAPANTNIKVVEVEDGTQLKLPDFTVDVIRNS +HYSWPKGSEEWKKFQALSFKFSLQDYTVVYTGDTGPSSAVEKLSSGVDLLVSEMMDIDHT +VNMIKETNPQMPKGKFIGIHKHLSKHHLSPKQVGELAKAANVGSLVITHMAPGLDTQAEI +DFYTKQVASEYKGPISVAQDLNRYELKR +>sp_Q10568_CPSF2_BOVIN g6 Cleavage and polyadenylation specificity factor subunit 2 OS=Bos taurus GN=CPSF2 PE=1 SV=1 +MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL +SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD +AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR +EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA +GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN +NPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTT +PGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSS +DESDAEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEII +KPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDY +EGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKDIKVYMPKLHETVDAT +SETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDS +EMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPR +LSDFKQVLLREGIQAEFVGGVLVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYA +IV +>sp_P30620_PSO2_YEAST g7 DNA cross-link repair protein PSO2/SNM1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PSO2 PE=1 SV=1 +MSRKSIVQIRRSEVKRKRSSTASSTSEGKTLHKNTHTSSKRQRTLTEFNIPTSSNLPVRS +SSYSFSRFSCSTSNKNTEPVIINDDDHNSICLEDTAKVEITIDTDEEELVSLHDNEVSAI +ENRTEDRIVTELEEQVNVKVSTEVIQCPICLENLSHLELYERETHCDTCIGSDPSNMGTP +KKNIRSFISNPSSPAKTKRDIATSKKPTRVKLVLPSFKIIKFNNGHEIVVDGFNYKASET +ISQYFLSHFHSDHYIGLKKSWNNPDENPIKKTLYCSKITAILVNLKFKIPMDEIQILPMN +KRFWITDTISVVTLDANHCPGAIIMLFQEFLANSYDKPIRQILHTGDFRSNAKMIETIQK +WLAETANETIDQVYLDTTYMTMGYNFPSQHSVCETVADFTLRLIKHGKNKTFGDSQRNLF +HFQRKKTLTTHRYRVLFLVGTYTIGKEKLAIKICEFLKTKLFVMPNSVKFSMMLTVLQNN +ENQNDMWDESLLTSNLHESSVHLVPIRVLKSQETIEAYLKSLKELETDYVKDIEDVVGFI +PTGWSHNFGLKYQKKNDDDENEMSGNTEYCLELMKNDRDNDDENGFEISSILRQYKKYNK +FQVFNVPYSEHSSFNDLVKFGCKLKCSEVIPTVNLNNLWKVRYMTNWFQCWENVRKTRAA +K +>sp_P51973_COMA_NEIGO g8 Competence protein ComA OS=Neisseria gonorrhoeae GN=comA PE=4 SV=1 +MLCVLAGAAYGVFRTEAALSSQWRAEAVSGVPLTVEVTDMPRSDGRRVQFAAKAVDSGGR +TFDLLLSDYKRREWAVGSRWRITARVHPVVGELNLRGLNREAWALSNGVGGVGTVGADRV +LLHGGSGWGIAVWRSRISRNWRQADADGGLSDGIGLMRALSVGEQSALRPGLWQAFRPLG +LTHLVSISGLHVTMVAVLFAWLAKRLLACSPRLPARPRAWVLAAGCAGALFYALLAGFSV +PTQRSVLMLAAFAWAWRRGRLSAWATWWQALAAVLLFDPLAVLGVGTWLSFGLVAALIWA +CAGRLYEGKRQTAVRGQWAASVLSLVLLGYLFASLPLVSPLVNAVSIPWFSWVLTPLALL +GSVVPFAPLQQAGAFLAEYTLRFLVWLADVSPEFAVAAAPLPLLVLAVCAALLLLLPRGL +GLRPWAVLLLAGFVSYRPEAVPENEAAVTVWDAGQGLSVLVRTANRHLLFDTGTVAAAQT +GIVPSLNAAGVRRLDKLVLSHHDSDHDGGFQAVGKIPNGGIYAGQPEFYEGARHCAEQRW +QWDGVDFEFLRPSERKNIDDNGKSCVLRVVAGGAALLVTGDLDTKGEESLVGKYGGNLYS +QVLVLGHHGSNTSSSGVFLNAVSPEYAVASSGYANAYKHPTEAVQNRVRAHGIKLLRTDL +SGALQFGLGRGGVKAQRLRVYKFYWQKKPFE +>sp_P39695_COMEC_BACSU g8 ComE operon protein 3 OS=Bacillus subtilis (strain 168) GN=comEC PE=4 SV=2 +MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFV +LYAVTDSQNVSSYRQGTYQFKAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKE +QLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHIHWNYSVTSIQNCSEPENFKY +KVLSLRKHIISFTNSLLPPDSTGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV +GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKW +RVRSATAICLSYIVLLLFNPYHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSL +IAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGAVAGVLLLSLSASFGRLFFSW +FDLLISWINRLITNIADVDVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG +ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWRE +KQHPFSLGEKVLIPFLTAKGIKQLDALILTHADQDHIGEAEILLKHHKVKRLVIPKGFVS +EPKDEKVLQAAREEGVAIEEVKRGDVLQIKDLQFHVLSPEAPDPASKNNSSLVLWMETGG +MSWILTGDLEKEGEQEVMNVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN +NRYHHPHQKVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN +>sp_P16692_PHNP_ECOLI g10 Phosphoribosyl 1,2-cyclic phosphodiesterase OS=Escherichia coli (strain K12) GN=phnP PE=1 SV=1 +MSLTLTLTGTGGAQGVPAWGCECAACARARRSPQYRRQPCSGVVKFNDAITLIDAGLHDL +ADRWSPGSFQQFLLTHYHMDHVQGLFPLRWGVGDPIPVYGPPDEQGCDDLFKHPGLLDFS +HTVEPFVVFDLQGLQVTPLPLNHSKLTFGYLLETAHSRVAWLSDTAGLPEKTLKFLRNNQ +PQVMVMDCSHPPRADAPRNHCDLNTVLALNQVIRSPRVILTHISHQFDAWLMENALPSGF +EVGFDGMEIGVA +>sp_Q56686_CPDP_ALIFS g16 3',5'-cyclic-nucleotide phosphodiesterase OS=Aliivibrio fischeri GN=cpdP PE=3 SV=1 +MFKNKLAVLFTCLSVFSFSAQSGSFDTVTLGSKGGIQDGNLTAFLIKSEADSNFVMLDAG +SVVNGLIVSEQKGAFKDITVPDSSPYTKVGYLLKDRIKGYFISHAHLDHVAGLIISSPDD +SKKPIYGLAATNKDLMKNYFNWSAWPNFGNKGEGFKLNKYNYVDLQPGVWSPVAETTMSV +VSLPLSHSGGQSTVFILKDSEGDVFAYFGDTGPDEVEKSSAMRTAWSVLAPFVKQGKLKG +IIIEVSFTNETPDKSLFGHLTPNWLVKELSVLEDMNGKGSLKDLNVAISHIKYSLKNSED +PKVIIKKQLVEVNDLGVNFIFPEQGDSLQF +>sp_P22434_PDE1_YEAST g16 3',5'-cyclic-nucleotide phosphodiesterase 1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PDE1 PE=1 SV=2 +MVVFEITILGANGGPTEYGTQCFILKPARTEDPELIAVDGGAGMYQLREMLVQGRNENEG +DDELVPSFYEHDREPIEFFIDSKLNIQKGLSKSLLQSLKRQGEHFESANTMKKTYEVFQG +ITDYYITHPHLDHISGLVVNSPSIYEQENSKKKTIWGLPHTIDVLQKHVFNDLIWPDLTA +ERSRKLKLKCLNPKEVQKCTIFPWDVIPFKVHHGIGVKTGAPVYSTFYIFRDRKSKDCII +VCGDVEQDRRESEESLLEEFWSYVAENIPLVHLKGILVECSCPLSSKPEQLYGHLSPIYL +INELSNLNTLYNSSKGLSGLNVIVTHVKSTPAKRDPRLTILEELRFLAEERNLGDLRISI +ALEGHTLFL +>gi_9802011_gb_AAF99588.1_AF215894_1 dros g4 juvenile hormone-inducible protein 1 [Drosophila melanogaster] +MISANSPLFQFIRSPRLQTLTIRMYLVKSAGSPIYRTLRTLTTSNLMAATIASAKDPLTGPRYEREPNVL +RKKLASVVPGTVNLQVLGSGANGAPAAVYLFTDQARYLFNCGEGTQRLAHEHKTRLSRLEQIFLTQNTWA +SCGGLPGLTLTIQDAGVRDIGLHGPPHLGSMLQSMRRFVVLKNLQVRPNDCSEGACFEDSILKVDSLPLI +NSEDPTKSVINYTCQLKPRAGALNLVKCVEQGVPPGPLLGQLKNGNDITLPDGKVVRSVDVTEASETALS +FAFLDVPSENYLPALLTHGKRLKKLGEEKLTEVALVVHFTSYHISSRQEYKDFVLENFSPEAQHIYLSSP +LNQFSGYAAAHRIQHQLHQLAPQVFPLLGEQLSCQSQTLSLNLKKTKLDEADSEDKANAKANETEEQGVV +AMTNNHLRPRKGLDRTLESKLTPEEYVKETHAVPGFLELLAKFKEEYSFPDNSADSYPKIIFLGTGSCIP +NKTRNVSSILIRTAIDAYVLLDCGEGTYGQIVRLYGHEKGQLILRQLQAIYVSHLHADHHIGLIGLLRER +RQLKPRADPLILLAPRQIKPWLEFYNRQIETVEDAYTLVGNGELLASPLSGEQVERLGITSISTCLVRHC +PNSFGISLTLAAKHNSEPVKITYSGDTMPCQDLIDLGRDSTVLIHEATMEDDLEEEARLKTHSTVSQAIQ +QGRNMNARHTILTHFSQRYAKCPRLPSDEDMQRVAIAFDNMEVTIEDLQHYHKLYPALFAMYAEYTEELE +QRAVKRELKQERKRKLAET +>gi_4416227_gb_AAD20271.1_ cyclase g5 [Streptomyces arenae] +MTDERRQDRTAPDDAWLEEVADGVFAYVQPDGGWCLNNAGLVVSDGRAALVDTAATETRARRLREAVRGV +TAAPPGVLVNTHFHGDHTFGNFVFPEALVVGHERTRSEALAAGLHLTGLWPDVRWGALELAPPALTYRDG +VTLHVGDVRVEVLHPGPAHTTDDSVVWLPEQRVLFTGDIVMPGVTPFCAMGSVSGSLAVLDRLRALGART +VVPGHGPVAGPEVFDATESYLRWVRATARRGLADRLTPMQVARACDLGEFAGLRDSERLLPNLRRAYAEE +QGAAPGAPLDIGELFAEMIEFHGRLPTCRA +>gi_75427924|sp|Q02057|Q02057_STRCO Dehydrase +MTVEVREVAEGVYAYEQAPGGWCVSNAGIVVGGDGALVVDTLSTIPRARRLAEWVDKLAAGPGRTVVNTH +FHGDHAFGNQVFAPGTRIIAHEDMRSAMVTTGLALTGLWPRVDWGEIELRPPNVTFRDRLTLHVGERQVE +LICVGPAHTDHDVVVWLPEERVLFAGDVVMSGVTPFALFGSVAGTLAALDRLAELEPEVVVGGHGPVAGP +EVIDANRDYLRWVQRLAADAVDRRLTPLQAARRADLGAFAGLLDAERLVANLHRAHEELLGGHVRDAMEI +FAELVAYNGGQLPTCLA +>tr_Q9WZW8_Q9WZW8_THEMA Beta_lactamase OS=Thermotoga maritima (strain ATCC 43589 / MSB8 / DSM 3109 / JCM 10099) GN=TM_0864 PE=1 SV=1 +MNIIGFSKALFSTWIYYSPERILFDAGEGVSTTLGSKVYAFKYVFLTHGHVDHIAGLWGV +VNIRNNGMGDREKPLDVFYPEGNRAVEEYTEFIKRANPDLRFSFNVHPLKEGERVFLRNA +GGFKRYVQPFRTKHVSSEVSFGYHIFEVRRKLKKEFQGLDSKEISRLVKEKGRDFVTEEY +HKKVLTISGDSLALDPEEIRGTELLIHECTFLDARDRRYKNHAAIDEVMESVKAAGVKKV +ILYHISTRYIRQLKSVIKKYREEMPDVEILYMDPRKVFEM +>sp_P54548_RNZ_BACSU Ribonuclease Z OS=Bacillus subtilis (strain 168) GN=rnz PE=1 SV=1 +MELLFLGTGAGIPAKARNVTSVALKLLEERRSVWLFDCGEATQHQILHTTIKPRKIEKIF +ITHMHGDHVYGLPGLLGSRSFQGGEDELTVYGPKGIKAFIETSLAVTKTHLTYPLAIQEI +EEGIVFEDDQFIVTAVSVIHGVEAFGYRVQEKDVPGSLKADVLKEMNIPPGPVYQKIKKG +ETVTLEDGRIINGNDFLEPPKKGRSVVFSGDTRVSDKLKELARDCDVLVHEATFAKEDRK +LAYDYYHSTTEQAAVTAKEARAKQLILTHISARYQGDASLELQKEAVDVFPNSVAAYDFL +EVNVPRG +>sp_P0A8V0_RBN_ECOLI Ribonuclease BN OS=Escherichia coli (strain K12) GN=rbn PE=1 SV=1 +MELIFLGTSAGVPTRTRNVTAILLNLQHPTQSGLWLFDCGEGTQHQLLHTAFNPGKLDKI +FISHLHGDHLFGLPGLLCSRSMSGIIQPLTIYGPQGIREFVETALRISGSWTDYPLEIVE +IGAGEILDDGLRKVTAYPLEHPLECYGYRIEEHDKPGALNAQALKAAGVPPGPLFQELKA +GKTITLEDGRQINGADYLAAPVPGKALAIFGDTGPCDAALDLAKGVDVMVHEATLDITME +AKANSRGHSSTRQAATLAREAGVGKLIITHVSSRYDDKGCQHLLRECRSIFPATELANDF +TVFNV