diff test-data/mbl_superfamily.fasta @ 0:6f743c615c41 draft

"planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/fargene commit 867e4a6fad4c2622ad69517e2d4d9ba185109b72"
author iuc
date Thu, 28 Nov 2019 14:39:41 -0500
parents
children
line wrap: on
line diff
--- /dev/null	Thu Jan 01 00:00:00 1970 +0000
+++ b/test-data/mbl_superfamily.fasta	Thu Nov 28 14:39:41 2019 -0500
@@ -0,0 +1,293 @@
+>tr_Q9WZL4_ Q9WZL4_THEMA Beta_lactamase OS=Thermotoga maritima (strain ATCC 43589 / MSB8 / DSM 3109 / JCM 10099) GN=TM_0755 PE=1 SV=1
+MPKIWTERIFDDPEIYVLRIDDDRIRYFEAVWEIPEGISYNAYLVKLNGANVLIDGWKGN
+YAKEFIDALSKIVDPKEITHIIVNHTEPDHSGSLPATLKTIGHDVEIIASNFGKRLLEGF
+YGIKDVTVVKDGEEREIGGKKFKFVMTPWLHWPDTMVTYLDGILFSCDVGGGYLLPEILD
+DSNESVVERYLPHVTKYIVTVIGHYKNYILEGAEKLSSLKIKALLPGHGLIWKKDPQRLL
+NHYVSVAKGDPKKGKVTVIYDSMYGFVENVMKKAIDSLKEKGFTPVVYKFSDEERPAISE
+ILKDIPDSEALIFGVSTYEAEIHPLMRFTLLEIIDKANYEKPVLVFGVHGWAPSAERTAG
+ELLKETKFRILSFTEIKGSNMDERKIEEAISLLKKELE
+>tr_A5IJ30_ A5IJ30_THEP1 Flavodoxin/nitric oxide synthase OS=Thermotoga petrophila (strain RKU_1 / ATCC BAA_488 / DSM 13995) GN=Tpet_0174 PE=4 SV=1
+MPKIWTERIFDDPEIYVLRIDDDRIRYFEAVWEIPEGISYNAYLVKLNGANVLIDGWKGN
+YAKEFIDALSKIVDPKEITHIIVNHTEPDHSGSLPATLKTIGHDVEIIASNFGKRLLEGF
+YGIKDVTVVKDGEEREIGGKKFKFVMTPWLHWPDTMVTYLDGILFGCDVGGGYLLPEILD
+DSNESVVERYLPHVTKYIVTVIGHYKNYILEGAEKLSSLEIKALLPGHGLIWKKDPQRLL
+NHYVSVAKGDPKKGKVTVIYDSMYGFVENVMKKAIDSLKEKGFTPVVYKFSDEERPAISE
+ILKDIPDSEALIFGVSTYEAEIHPLMRFTFLEIIDKANYEKPVLVFGVHGWAPSVERTAG
+ELLKETKFRILSFTEIKGSNMDEKKIEEAISLLKKELG
+>sp_Q9FDN7_FPRA_MOOTA Nitric oxide reductase OS=Moorella thermoacetica (strain ATCC 39073) GN=fprA PE=1 SV=1
+MSQPVAITDGIYWVGAVDWNIRYFHGPAFSTHRGTTYNAYLIVDDKTALVDTVYEPFKEE
+LIAKLKQIKDPVKLDYLVVNHTESDHAGAFPAIMELCPDAHVLCTQRAFDSLKAHYSHID
+FNYTIVKTGTSVSLGKRSLTFIEAPMLHWPDSMFTYVPEEALLLPNDAFGQHIATSVRFD
+DQVDAGLIMDEAAKYYANILMPFSNLITKKLDEIQKINLAIKTIAPSHGIIWRKDPGRII
+EAYARWAEGQGKAKAVIAYDTMWLSTEKMAHALMDGLVAGGCEVKLFKLSVSDRNDVIKE
+ILDARAVLVGSPTINNDILPVVSPLLDDLVGLRPKNKVGLAFGAYGWGGGAQKILEERLK
+AAKIELIAEPGPTVQWVPRGEDLQRCYELGRKIAARIAD
+>sp_Q50497_FPRA_METTM Type A flavoprotein FprA OS=Methanothermobacter marburgensis (strain DSM 2133 / 14651 / NBRC 100331 / OCM 82 / Marburg) GN=fprA PE=1 SV=1
+MKAAAKRISDGVYWTGVLDWDLRNYHGYTLQGTTYNAYLVCGDEGVALIDNSYPGTFDEL
+MARVEDALQQVGMERVDYIIQNHVEKDHSGVLVELHRRFPEAPIYCTEVAVKGLLKHYPS
+LREAEFMTVKTGDVLDLGGKTLTFLETPLLHWPDSMFTLLDEDGILFSNDAFGQHLCCPQ
+RLDREIPEYILMDAARKFYANLITPLSKLVLKKFDEVKELGLLERIQMIAPSHGQIWTDP
+MKIIEAYTGWATGMVDERVTVIYDTMHGSTRKMAHAIAEGAMSEGVDVRVYCLHEDDRSE
+IVKDILESGAIALGAPTIYDEPYPSVGDLLMYLRGLKFNRTLTRKALVFGSMGGNGGATG
+TMKELLAEAGFDVACEEEVYYVPTGDELDACFEAGRKLAAEIRR
+>tr_E3GY45_ E3GY45_METFV Flavodoxin/nitric oxide synthase OS=Methanothermus fervidus (strain ATCC 43054 / DSM 2088 / JCM 10308 / V24 S) GN=Mfer_0426 PE=4 SV=1
+MKAKAVKIKDGVYWVGVLDWDIRIYHGYTLKGTTYNAYLVFGEDKTCLIDNTYPGTETQL
+WARIKDALEKEKREKIDVIVQNHVERDHSGALPQIHKKFPEAPIYCTEIAVDGLKKHYPQ
+LKNADFIEVKTGDKLDLGNKTLAFVEAFLLHWPDSMFTLLVEDGILFPNDAFGQHLCYPQ
+RYDYEIPEYVLMDAAQKFYANLITPLSKRVLKKFKEIEDLGLLNKIKMIAPSHGQIWTDP
+MKIIGAYKDWAEGKCKNKITIIYDTMHYSTQKMAHAIAEGIISEGVDVRMYYLHEDERSE
+IVKDILDSKAVAFGTPTIYDKPYPTLGDIIYYLKGLRFDRTGFKKLAITFGSMGGEGGAP
+EIIANELKECGFEVIDEYEIFYIPDEKELEKCYEIGRKLAKKVKEM
+>sp_Q58158_FPRA_METJA Type A flavoprotein FprA OS=Methanocaldococcus jannaschii (strain ATCC 43067 / DSM 2661 / JAL_1 / JCM 10045 / NBRC 100440) GN=fprA PE=3 SV=1
+MKKYESRRSKIADGVYWVGVLDWDIRMYHGYTLKGTTYNAYLVFGDEKVALIDNTYPGTS
+AQMWGRIKDAFEKEGREFKIDVIVQNHVEKDHSGALPEIHKKFPDAPIYCTEVAVEGLKK
+HYPSLKDAQFKVVHTGDTVDLGGKTLTFLEAPLLHWPDSMFTFYNEGGILFSNDAFGQHL
+CFPAHKRFDKDIPEYVLMDANQKFYANLITPLSKLVLKKFEEVIQLGLLEKIKMIAPSHG
+QIWTDPMKVIKAYQDFATGKAAKDKAVIVYDTMHYSTQKMAHAFAEGLMSEGIDVVMYFL
+HYDERSEIVKDILDAKAVLFGIPTIYDEPYPSIGDIIYYLRGLKFNRTGFKRLAVTFGSM
+GGEGGAVAKIAEDLAKCGFEVINQYELYYVPTEDELTNCYNMGKELAKRIKEMKIE
+>tr_A6UPE2_ A6UPE2_METVS Flavodoxin/nitric oxide synthase OS=Methanococcus vannielii (strain SB / ATCC 35089 / DSM 1224) GN=Mevan_0457 PE=4 SV=1
+MKADAVKISDGVYWVGTYDWDIRSYHGYTLKGTTYNAYLVFGTEKVALIDNVYPGTSAQM
+WGRIKDAFEKEGRKYNIDVIVQNHVEKDHSGALVEITKKFPESNIYCTEVAVEGLKKHYT
+GLKDAPFKVVKSLESVDLGGKTLTFLEAPLLHWPDSMFTLYGEEGILFSNDAFGQHLCYT
+KRFDNEIPENVLMDANQKFYANLITPLSKLVLKKFEQVISLGLLENIKMIAPSHGQIWTD
+PMKVISAYQDFATGKCKNKATIVYDTMHYSTQKMAHAFAEGLLSEGIDVVIYNLHNDERS
+EIVKDILDSKAVLFGIPTINDQPYPSIGDLMYYLRGLRFDRTGLKKLAITFGSMGGKGGA
+AKLIGKDLKECGFEVLDDSYEVIYVPKEEELEKCYNAGKRLGIKLN
+>gi_687617_gb_AAB88013.1_ flavoprotein g3 [Methanothermobacter thermautotrophicus]
+MKAAAKRISDGVYWTGVLDWDLRNYHGYTLQGTTYNAYLVCGDEGVALIDNSYPGTFDELMARVEDALQQ
+VGMERVDYIIQNHVEKDHSGVLVELHRRFPEAPIYCTEVAVKGLLKHYPSLREAEFMTVKTGDVLDLGGK
+TLTFLETPLLHWPDSMFTLLDEDGILFSNDAFGQHLCCPQRLDREIPEYILMDAARKFYANLITPLSKLV
+LKKFDEVKELGLLERIQMIAPSHGQIWTDPMKIIEAYTGWATGMVDERVTVIYDTMHGSTRKMAHAIAEG
+AMSEGVDVRVYCLHEDDRSEIVKDILESGAIALGAPTIYDEPYPSVGDLLMYLRGLKFNRTLTRKALVFG
+SMGGNGGATGTMKELLAEAGFDVACEEEVYYVPTGDELDACFEAGRKLAAEIRR
+>sp_Q8ZRM2_GLO2_SALTY Hydroxyacylglutathione hydrolase OS=Salmonella typhimurium (strain LT2 / SGSC1412 / ATCC 700720) GN=gloB PE=1 SV=1
+MNLNSIPAFQDNYIWVLTNDEGRCVIVDPGEAAPVLKAIAEHKWMPEAIFLTHHHHDHVG
+GVKELLQHFPQMTVYGPAETQDKGATHLVGDGDTIRVLGEKFTLFATPGHTLGHVCYFSR
+PYLFCGDTLFSGGCGRLFEGTPSQMYQSLMKINSLPDDTLICCAHEYTLANIKFALSILP
+HDSFINEYYRKVKELRVKKQMTLPVILKNERKINLFLRTEDIDLINEINKETILQQPEAR
+FAWLRSKKDTF
+>sp_O24495_GLO2M_ARATH Hydroxyacylglutathione hydrolase 1, mitochondrial OS=Arabidopsis thaliana GN=GLX2_1 PE=2 SV=2
+MPVISKASSTTTNSSIPSCSRIGGQLCVWPGLRQLCLRKSLLYGVMWLLSMPLKTLRGAR
+KTLKITHFCSISNMPSSLKIELVPCSKDNYAYLLHDEDTGTVGVVDPSEAAPVIEALSRK
+NWNLTYILNTHHHDDHIGGNAELKERYGAKVIGSAVDKDRIPGIDILLKDSDKWMFAGHE
+VRILDTPGHTQGHISFYFPGSATIFTGDLIYSLSCGTLSEGTPEQMLSSLQKIVSLPDDT
+NIYCGRENTAGNLKFALSVEPKNETLQSYATRVAHLRSQGLPSIPTTVKVEKACNPFLRI
+SSKDIRKSLSIPDSATEAEALRRIQRARDRF
+>tr_B9SU05_ B9SU05_RICCO Hydroxyacylglutathione hydrolase, putative OS=Ricinus communis GN=RCOM_0454360 PE=3 SV=1
+MQMISKASCAMASIPCSRVRSGLCIRPGARQLCFRKGLLYGFMHLLSMPFKTLRGASRTL
+KVAQFCSVSNMSSSLQIELVPCLRDNYAYLLHDMDTGTVGVVDPSEAVPIIDALTKKNRN
+LTYILNTHHHHDHTGGNEELKARYGAKVIGPGTDRDRIPGIDIVLNDGDKWMFAGHEVLV
+METPGHTRGHISFYFPGSGSIFTGDTLFSLSCGKLFEGTPEQMHSSLGKIMSLPDDTNIY
+CGHEYTLSNSKFALSIEPNNEALRSYAAHVTHLRSKSLPTGGAAQEQNPCV
+>sp_Q9SID3_GLO2N_ARATH Hydroxyacylglutathione hydrolase 2, mitochondrial OS=Arabidopsis thaliana GN=At2g31350 PE=1 SV=1
+MQTISKASSATSFFRCSRKLSSQPCVRQLNIRKSLVCRVMKLVSSPLRTLRGAGKSIRVS
+KFCSVSNVSSLQIELVPCLKDNYAYILHDEDTGTVGVVDPSEAEPIIDSLKRSGRNLTYI
+LNTHHHYDHTGGNLELKDRYGAKVIGSAMDKDRIPGIDMALKDGDKWMFAGHEVHVMDTP
+GHTKGHISLYFPGSRAIFTGDTMFSLSCGKLFEGTPKQMLASLQKITSLPDDTSIYCGHE
+YTLSNSKFALSLEPNNEVLQSYAAHVAELRSKKLPTIPTTVKMEKACNPFLRSSNTDIRR
+ALRIPEAADEAEALGIIRKAKDDF
+>tr_D7LD21_ D7LD21_ARALL Glyoxalase 2_5 OS=Arabidopsis lyrata subsp. lyrata GN=GLX2_5 PE=3 SV=1
+MQTISKASSAISFFRCSRKLSSQPCVRQLNLRKGLVCRVMKLVSSPLRTLRGAGKSIRVS
+KFCSVSNVSSLQIELVPCLKDNYAYILHDEDTGTVGVVDPSEAEPVIDSLKRSGRNLTYI
+LNTHHHYDHTGGNLELKDRYGAKVIGSAMDKDRIPGIDIALKDGDKWMFAGHEVHVMDTP
+GHTKGHISLYFPGSRAIFTGDTLFSLSCGKLFEGTPKQMLASLQKIISLPDDTSIYCGHE
+YTLSNSKFALSLEPNNEILQSYAAHVAELRSKKLPTIPTTLKMEKACNPFLRSSNTDIRR
+ALRIPETADEAEALGIIRKAKDDF
+>sp_A1A7Q3_GLO2_ECOK1 Hydroxyacylglutathione hydrolase OS=Escherichia coli O1:K1 / APEC GN=gloB PE=3 SV=1
+MNLNSIPAFDDNYIWVLNDEAGRCLIVDPGDAEPVLNAISANNWQPEAIFLTHHHHDHVG
+GVKELVEKFPQIVVYGPQETQDKGTTQVVKDGETAFVLGHEFSVIATPGHTLGHICYFSK
+PYLFCGDTLFSGGCGRLFEGTPSQMYQSIKKLSALPDDTLVCCAHEYTLSNMKFALSILP
+HDLSINDYYRKVKELRAKNQITLPVILKNERQINVFLRTEDIDLINVINEETLLQQPEER
+FAWLRSKKDRF
+>sp_Q325T4_GLO2_SHIBS Hydroxyacylglutathione hydrolase OS=Shigella boydii serotype 4 (strain Sb227) GN=gloB PE=3 SV=1
+MNLNSIPAFDDNYIWVLNDEAGRCLIVDPGDAEPVLNAITANNWQPEAIFLTHHHHDHVG
+GVKELVEKFPQIVVYGPQETQDKGTTQVVKDGETAFVLGHEFSVITTPGHTLGHICYFSK
+PYLFCGDTLFSGGCGRLFEGTALQMYQSLKKLSALPDDTLVCCAHEYTLSNMKFALSIFP
+HDLSINDYYRKVKELRAKNQITLPVILKNERQINVFLRTEDIDLINGINEETLLQQPEER
+FAWLRSKKDRF
+>sp_Q9DB32_HAGHL_MOUSE Hydroxyacylglutathione hydrolase_like protein OS=Mus musculus GN=Haghl PE=2 SV=1
+MKVKVIPVLEDNYMYLIIEEHTREAVAIDVAVAERLLEIAGREGVSLTMVLSTHHHWDHT
+RGNAELAHILPGLAVLGADERICALTRRLEHGEGLQFGAIHVRCLLTPGHTSGHMSYFLW
+EDDCPDSPALFSGDALSVAGCGWHLEDTAQQMYQSLAKTLGTLPPETKVFCGHEHTLSNL
+EFAQKVEPCNEHVQAKLSWAQERDDEDIPTVPSTLGEELMYNPFLRVTEDAVRAFTGQVA
+PAQVLEALCRERARFQPAVEPPQPQVRALLALQWGLLSTHQKK
+>sp_Q3B7M2_GLO2_BOVIN Hydroxyacylglutathione hydrolase, mitochondrial OS=Bos taurus GN=HAGH PE=2 SV=3
+MVLGRGLLGRWSVAELGAVCARLGLGPALLGSLHHLGLRKSLTVDQGTMKVELLPALTDN
+YMYLLIDEDTKEAAIVDPVQPQKVVETARKHGVKLTTVLTTHHHWDHAGGNEKLVKLEPG
+LKVYGGDDRIGALTHKVTHLSTLQVGSLHVKCLSTPCHTSGHICYFVTKPNSPEPPAVFT
+GDTLFVAGCGKFYEGTADEMYKALLEVLGRLPADTRVYCGHEYTINNLKFARHVEPDNTA
+VREKLAWAKEKYSIGEPTVPSTIAEEFTYNPFMRVREKTVQQHAGETEPVATMRAIRKEK
+DQFKMPRD
+>tr_Q35952_ Q35952_SCOUM Cytochrome b (Fragment) OS=Scopus umbretta PE=3 SV=1
+FGSLLGICLGTQILTGLLLAMHYTADTALAFSSVAHTCRNVQYGWLIRNLHANGASFFFI
+CIYLHIGRGFYYGSYLNKETWNTGVILLLTLMATAFVGYVLPWGQMSFWGATVITNLFSA
+IPYIGQTLVEWAWGGFSVDNPTLTRFFALHFLLPFIIAGLALIHLTFLHESGSNNPLGIV
+SNCDKIPFHPYFSAEDVLGLMLMLLPLMTLAMFSPNLLGDPENFTPANPLVTPPHIKPEW
+YFLFAYAILRSIPNKLGGVLALAASVLVLFLVPLLHKSKQRTMAFRPLSQLLFWALTANL
+FILTWVGSQPVEHPFIIIGQLASLTYFTILLILFPI
+>sp_Q16775_GLO2_HUMAN Hydroxyacylglutathione hydrolase, mitochondrial OS=Homo sapiens GN=HAGH PE=1 SV=2
+MVVGRGLLGRRSLAALGAACARRGLGPALLGVFCHTDLRKNLTVDEGTMKVEVLPALTDN
+YMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESG
+LKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFT
+GDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAA
+IREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETDPVTTMRAVRREK
+DQFKMPRD
+>sp_Q4R6C1_GLO2_MACFA Hydroxyacylglutathione hydrolase, mitochondrial OS=Macaca fascicularis GN=HAGH PE=2 SV=2
+MVLGRGLLGRRSLAALGAACARRGLGPALLGVLHHTDLRKNLTVDEGTMKVEVLPALTDN
+YMYLVIDDETKEAAIVDPVQPQKVLDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLQSG
+LKVYGGDDRIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVSRPGGSEPPAVFT
+GDTLFVAGCGKFYEGTADEMCKALLEVLGRLPPDTRVYCGHEYTINNLKFARHVEPGNAA
+IQEKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHARETDPVTTMRAVRKEK
+DEFKMPRD
+>sp_P0AC84_GLO2_ECOLI g2 Hydroxyacylglutathione hydrolase OS=Escherichia coli (strain K12) GN=gloB PE=1 SV=1
+MNLNSIPAFDDNYIWVLNDEAGRCLIVDPGDAEPVLNAIAANNWQPEAIFLTHHHHDHVG
+GVKELVEKFPQIVVYGPQETQDKGTTQVVKDGETAFVLGHEFSVIATPGHTLGHICYFSK
+PYLFCGDTLFSGGCGRLFEGTASQMYQSLKKLSALPDDTLVCCAHEYTLSNMKFALSILP
+HDLSINDYYRKVKELRAKNQITLPVILKNERQINVFLRTEDIDLINVINEETLLQQPEER
+FAWLRSKKDRF
+>sp_O24496_GLO2C_ARATH g2 Hydroxyacylglutathione hydrolase cytoplasmic OS=Arabidopsis thaliana GN=GLX2-2 PE=1 SV=2
+MKIFHVPCLQDNYSYLIIDESTGDAAVVDPVDPEKVIASAEKHQAKIKFVLTTHHHWDHA
+GGNEKIKQLVPDIKVYGGSLDKVKGCTDAVDNGDKLTLGQDINILALHTPCHTKGHISYY
+VNGKEGENPAVFTGDTLFVAGCGKFFEGTAEQMYQSLCVTLAALPKPTQVYCGHEYTVKN
+LEFALTVEPNNGKIQQKLAWARQQRQADLPTIPSTLEEELETNPFMRVDKPEIQEKLGCK
+SPIDTMREVRNKKDQWRG
+>sp_Q05584_GLO2_YEAST g2 Hydroxyacylglutathione hydrolase, cytoplasmic isozyme OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=GLO2 PE=1 SV=1
+MQVKSIKMRWESGGVNYCYLLSDSKNKKSWLIDPAEPPEVLPELTEDEKISVEAIVNTHH
+HYDHADGNADILKYLKEKNPTSKVEVIGGSKDCPKVTIIPENLKKLHLGDLEITCIRTPC
+HTRDSICYYVKDPTTDERCIFTGDTLFTAGCGRFFEGTGEEMDIALNNSILETVGRQNWS
+KTRVYPGHEYTSDNVKFVRKIYPQVGENKALDELEQFCSKHEVTAGRFTLKDEVEFNPFM
+RLEDPKVQKAAGDTNNSWDRAQIMDKLRAMKNRM
+>1QH5_A_PDBID_CHAIN_SEQUENCE 1QH5 g2
+MKVEVLPALTDNYMYLVIDDETKEAAIVDPVQPQKVVDAARKHGVKLTTVLTTHHHWDHAGGNEKLVKLESGLKVYGGDD
+RIGALTHKITHLSTLQVGSLNVKCLATPCHTSGHICYFVSKPGGSEPPAVFTGDTLFVAGCGKFYEGTADEMCKALLEVL
+GRLPPDTRVYCGHEYTINNLKFARHVEPGNAAIREKLAWAKEKYSIGEPTVPSTLAEEFTYNPFMRVREKTVQQHAGETD
+PVTTMRAVRREKDQFKMPRD
+>sp_P28607_ARS_PSEVC g4 Arylsulfatase OS=Pseudoalteromonas carrageenovora GN=atsA PE=3 SV=1
+MQKISIIFNLFLSLGCLAFTFNGSASETKNEWITLGTMAGPIPNAKHSQPANAMLVNGNT
+YVVDAGDGTAGQLAKVGLDIKNVDAVFLSHLHFDHTGGLPAILSLRWQTSARNELVVYGP
+PGTQQTVDGIFEYMTYGTLGHYGVPGQVPAPANTNIKVVEVEDGTQLKLPDFTVDVIRNS
+HYSWPKGSEEWKKFQALSFKFSLQDYTVVYTGDTGPSSAVEKLSSGVDLLVSEMMDIDHT
+VNMIKETNPQMPKGKFIGIHKHLSKHHLSPKQVGELAKAANVGSLVITHMAPGLDTQAEI
+DFYTKQVASEYKGPISVAQDLNRYELKR
+>sp_Q10568_CPSF2_BOVIN g6 Cleavage and polyadenylation specificity factor subunit 2 OS=Bos taurus GN=CPSF2 PE=1 SV=1
+MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL
+SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD
+AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR
+EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA
+GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN
+NPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTT
+PGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQSKEADIDSS
+DESDAEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEII
+KPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDY
+EGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKDIKVYMPKLHETVDAT
+SETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDS
+EMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPR
+LSDFKQVLLREGIQAEFVGGVLVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYA
+IV
+>sp_P30620_PSO2_YEAST g7 DNA cross-link repair protein PSO2/SNM1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PSO2 PE=1 SV=1
+MSRKSIVQIRRSEVKRKRSSTASSTSEGKTLHKNTHTSSKRQRTLTEFNIPTSSNLPVRS
+SSYSFSRFSCSTSNKNTEPVIINDDDHNSICLEDTAKVEITIDTDEEELVSLHDNEVSAI
+ENRTEDRIVTELEEQVNVKVSTEVIQCPICLENLSHLELYERETHCDTCIGSDPSNMGTP
+KKNIRSFISNPSSPAKTKRDIATSKKPTRVKLVLPSFKIIKFNNGHEIVVDGFNYKASET
+ISQYFLSHFHSDHYIGLKKSWNNPDENPIKKTLYCSKITAILVNLKFKIPMDEIQILPMN
+KRFWITDTISVVTLDANHCPGAIIMLFQEFLANSYDKPIRQILHTGDFRSNAKMIETIQK
+WLAETANETIDQVYLDTTYMTMGYNFPSQHSVCETVADFTLRLIKHGKNKTFGDSQRNLF
+HFQRKKTLTTHRYRVLFLVGTYTIGKEKLAIKICEFLKTKLFVMPNSVKFSMMLTVLQNN
+ENQNDMWDESLLTSNLHESSVHLVPIRVLKSQETIEAYLKSLKELETDYVKDIEDVVGFI
+PTGWSHNFGLKYQKKNDDDENEMSGNTEYCLELMKNDRDNDDENGFEISSILRQYKKYNK
+FQVFNVPYSEHSSFNDLVKFGCKLKCSEVIPTVNLNNLWKVRYMTNWFQCWENVRKTRAA
+K
+>sp_P51973_COMA_NEIGO g8 Competence protein ComA OS=Neisseria gonorrhoeae GN=comA PE=4 SV=1
+MLCVLAGAAYGVFRTEAALSSQWRAEAVSGVPLTVEVTDMPRSDGRRVQFAAKAVDSGGR
+TFDLLLSDYKRREWAVGSRWRITARVHPVVGELNLRGLNREAWALSNGVGGVGTVGADRV
+LLHGGSGWGIAVWRSRISRNWRQADADGGLSDGIGLMRALSVGEQSALRPGLWQAFRPLG
+LTHLVSISGLHVTMVAVLFAWLAKRLLACSPRLPARPRAWVLAAGCAGALFYALLAGFSV
+PTQRSVLMLAAFAWAWRRGRLSAWATWWQALAAVLLFDPLAVLGVGTWLSFGLVAALIWA
+CAGRLYEGKRQTAVRGQWAASVLSLVLLGYLFASLPLVSPLVNAVSIPWFSWVLTPLALL
+GSVVPFAPLQQAGAFLAEYTLRFLVWLADVSPEFAVAAAPLPLLVLAVCAALLLLLPRGL
+GLRPWAVLLLAGFVSYRPEAVPENEAAVTVWDAGQGLSVLVRTANRHLLFDTGTVAAAQT
+GIVPSLNAAGVRRLDKLVLSHHDSDHDGGFQAVGKIPNGGIYAGQPEFYEGARHCAEQRW
+QWDGVDFEFLRPSERKNIDDNGKSCVLRVVAGGAALLVTGDLDTKGEESLVGKYGGNLYS
+QVLVLGHHGSNTSSSGVFLNAVSPEYAVASSGYANAYKHPTEAVQNRVRAHGIKLLRTDL
+SGALQFGLGRGGVKAQRLRVYKFYWQKKPFE
+>sp_P39695_COMEC_BACSU g8 ComE operon protein 3 OS=Bacillus subtilis (strain 168) GN=comEC PE=4 SV=2
+MRNSRLLLPMAAASATAGITAAAYFPAIFLFILFLLIILIKTRHAFLIIVCFFSFILFFV
+LYAVTDSQNVSSYRQGTYQFKAVIDTIPKIDGDRMSMMVETPDKEKWAAAYRIQSAGEKE
+QLLYIEPGMSCELTGTLEEPNHATVPGAFDYNEYLYRQHIHWNYSVTSIQNCSEPENFKY
+KVLSLRKHIISFTNSLLPPDSTGIVQALTVGDRFYVEDEVLTAYQKLGVVHLLAISGLHV
+GILTAGLFYIMIRLGITREKASILLLLFLPLYVMLTGAAPSVLRAALMSGVYLAGSLVKW
+RVRSATAICLSYIVLLLFNPYHLFEAGFQLSFAVSFSLILSSSIFQQVKTSLGQLTIVSL
+IAQLGSLPILLYHFHQFSIISVPMNMLMVPFYTFCILPGAVAGVLLLSLSASFGRLFFSW
+FDLLISWINRLITNIADVDVFTIMIAHPAPVLLFLFTVTIILLLMAIEKRSLSQLMVTGG
+ICCTVMFLLFIYPCLSSEGEVDMIDIGQGDSMFVGAPHQRGRVLIDTGGTLSYSSEPWRE
+KQHPFSLGEKVLIPFLTAKGIKQLDALILTHADQDHIGEAEILLKHHKVKRLVIPKGFVS
+EPKDEKVLQAAREEGVAIEEVKRGDVLQIKDLQFHVLSPEAPDPASKNNSSLVLWMETGG
+MSWILTGDLEKEGEQEVMNVFPNIKADVLKVGHHGSKGSTGEEFIQQLQPKTAIISAGKN
+NRYHHPHQKVLQLLQRHSIRVLRTDQNGTIQYRYKNRVGTFSVYPPYDTSDITETN
+>sp_P16692_PHNP_ECOLI g10 Phosphoribosyl 1,2-cyclic phosphodiesterase OS=Escherichia coli (strain K12) GN=phnP PE=1 SV=1
+MSLTLTLTGTGGAQGVPAWGCECAACARARRSPQYRRQPCSGVVKFNDAITLIDAGLHDL
+ADRWSPGSFQQFLLTHYHMDHVQGLFPLRWGVGDPIPVYGPPDEQGCDDLFKHPGLLDFS
+HTVEPFVVFDLQGLQVTPLPLNHSKLTFGYLLETAHSRVAWLSDTAGLPEKTLKFLRNNQ
+PQVMVMDCSHPPRADAPRNHCDLNTVLALNQVIRSPRVILTHISHQFDAWLMENALPSGF
+EVGFDGMEIGVA
+>sp_Q56686_CPDP_ALIFS g16 3',5'-cyclic-nucleotide phosphodiesterase OS=Aliivibrio fischeri GN=cpdP PE=3 SV=1
+MFKNKLAVLFTCLSVFSFSAQSGSFDTVTLGSKGGIQDGNLTAFLIKSEADSNFVMLDAG
+SVVNGLIVSEQKGAFKDITVPDSSPYTKVGYLLKDRIKGYFISHAHLDHVAGLIISSPDD
+SKKPIYGLAATNKDLMKNYFNWSAWPNFGNKGEGFKLNKYNYVDLQPGVWSPVAETTMSV
+VSLPLSHSGGQSTVFILKDSEGDVFAYFGDTGPDEVEKSSAMRTAWSVLAPFVKQGKLKG
+IIIEVSFTNETPDKSLFGHLTPNWLVKELSVLEDMNGKGSLKDLNVAISHIKYSLKNSED
+PKVIIKKQLVEVNDLGVNFIFPEQGDSLQF
+>sp_P22434_PDE1_YEAST g16 3',5'-cyclic-nucleotide phosphodiesterase 1 OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=PDE1 PE=1 SV=2
+MVVFEITILGANGGPTEYGTQCFILKPARTEDPELIAVDGGAGMYQLREMLVQGRNENEG
+DDELVPSFYEHDREPIEFFIDSKLNIQKGLSKSLLQSLKRQGEHFESANTMKKTYEVFQG
+ITDYYITHPHLDHISGLVVNSPSIYEQENSKKKTIWGLPHTIDVLQKHVFNDLIWPDLTA
+ERSRKLKLKCLNPKEVQKCTIFPWDVIPFKVHHGIGVKTGAPVYSTFYIFRDRKSKDCII
+VCGDVEQDRRESEESLLEEFWSYVAENIPLVHLKGILVECSCPLSSKPEQLYGHLSPIYL
+INELSNLNTLYNSSKGLSGLNVIVTHVKSTPAKRDPRLTILEELRFLAEERNLGDLRISI
+ALEGHTLFL
+>gi_9802011_gb_AAF99588.1_AF215894_1 dros g4 juvenile hormone-inducible protein 1 [Drosophila melanogaster]
+MISANSPLFQFIRSPRLQTLTIRMYLVKSAGSPIYRTLRTLTTSNLMAATIASAKDPLTGPRYEREPNVL
+RKKLASVVPGTVNLQVLGSGANGAPAAVYLFTDQARYLFNCGEGTQRLAHEHKTRLSRLEQIFLTQNTWA
+SCGGLPGLTLTIQDAGVRDIGLHGPPHLGSMLQSMRRFVVLKNLQVRPNDCSEGACFEDSILKVDSLPLI
+NSEDPTKSVINYTCQLKPRAGALNLVKCVEQGVPPGPLLGQLKNGNDITLPDGKVVRSVDVTEASETALS
+FAFLDVPSENYLPALLTHGKRLKKLGEEKLTEVALVVHFTSYHISSRQEYKDFVLENFSPEAQHIYLSSP
+LNQFSGYAAAHRIQHQLHQLAPQVFPLLGEQLSCQSQTLSLNLKKTKLDEADSEDKANAKANETEEQGVV
+AMTNNHLRPRKGLDRTLESKLTPEEYVKETHAVPGFLELLAKFKEEYSFPDNSADSYPKIIFLGTGSCIP
+NKTRNVSSILIRTAIDAYVLLDCGEGTYGQIVRLYGHEKGQLILRQLQAIYVSHLHADHHIGLIGLLRER
+RQLKPRADPLILLAPRQIKPWLEFYNRQIETVEDAYTLVGNGELLASPLSGEQVERLGITSISTCLVRHC
+PNSFGISLTLAAKHNSEPVKITYSGDTMPCQDLIDLGRDSTVLIHEATMEDDLEEEARLKTHSTVSQAIQ
+QGRNMNARHTILTHFSQRYAKCPRLPSDEDMQRVAIAFDNMEVTIEDLQHYHKLYPALFAMYAEYTEELE
+QRAVKRELKQERKRKLAET
+>gi_4416227_gb_AAD20271.1_ cyclase g5 [Streptomyces arenae]
+MTDERRQDRTAPDDAWLEEVADGVFAYVQPDGGWCLNNAGLVVSDGRAALVDTAATETRARRLREAVRGV
+TAAPPGVLVNTHFHGDHTFGNFVFPEALVVGHERTRSEALAAGLHLTGLWPDVRWGALELAPPALTYRDG
+VTLHVGDVRVEVLHPGPAHTTDDSVVWLPEQRVLFTGDIVMPGVTPFCAMGSVSGSLAVLDRLRALGART
+VVPGHGPVAGPEVFDATESYLRWVRATARRGLADRLTPMQVARACDLGEFAGLRDSERLLPNLRRAYAEE
+QGAAPGAPLDIGELFAEMIEFHGRLPTCRA
+>gi_75427924|sp|Q02057|Q02057_STRCO Dehydrase
+MTVEVREVAEGVYAYEQAPGGWCVSNAGIVVGGDGALVVDTLSTIPRARRLAEWVDKLAAGPGRTVVNTH
+FHGDHAFGNQVFAPGTRIIAHEDMRSAMVTTGLALTGLWPRVDWGEIELRPPNVTFRDRLTLHVGERQVE
+LICVGPAHTDHDVVVWLPEERVLFAGDVVMSGVTPFALFGSVAGTLAALDRLAELEPEVVVGGHGPVAGP
+EVIDANRDYLRWVQRLAADAVDRRLTPLQAARRADLGAFAGLLDAERLVANLHRAHEELLGGHVRDAMEI
+FAELVAYNGGQLPTCLA
+>tr_Q9WZW8_Q9WZW8_THEMA Beta_lactamase OS=Thermotoga maritima (strain ATCC 43589 / MSB8 / DSM 3109 / JCM 10099) GN=TM_0864 PE=1 SV=1
+MNIIGFSKALFSTWIYYSPERILFDAGEGVSTTLGSKVYAFKYVFLTHGHVDHIAGLWGV
+VNIRNNGMGDREKPLDVFYPEGNRAVEEYTEFIKRANPDLRFSFNVHPLKEGERVFLRNA
+GGFKRYVQPFRTKHVSSEVSFGYHIFEVRRKLKKEFQGLDSKEISRLVKEKGRDFVTEEY
+HKKVLTISGDSLALDPEEIRGTELLIHECTFLDARDRRYKNHAAIDEVMESVKAAGVKKV
+ILYHISTRYIRQLKSVIKKYREEMPDVEILYMDPRKVFEM
+>sp_P54548_RNZ_BACSU Ribonuclease Z OS=Bacillus subtilis (strain 168) GN=rnz PE=1 SV=1
+MELLFLGTGAGIPAKARNVTSVALKLLEERRSVWLFDCGEATQHQILHTTIKPRKIEKIF
+ITHMHGDHVYGLPGLLGSRSFQGGEDELTVYGPKGIKAFIETSLAVTKTHLTYPLAIQEI
+EEGIVFEDDQFIVTAVSVIHGVEAFGYRVQEKDVPGSLKADVLKEMNIPPGPVYQKIKKG
+ETVTLEDGRIINGNDFLEPPKKGRSVVFSGDTRVSDKLKELARDCDVLVHEATFAKEDRK
+LAYDYYHSTTEQAAVTAKEARAKQLILTHISARYQGDASLELQKEAVDVFPNSVAAYDFL
+EVNVPRG
+>sp_P0A8V0_RBN_ECOLI Ribonuclease BN OS=Escherichia coli (strain K12) GN=rbn PE=1 SV=1
+MELIFLGTSAGVPTRTRNVTAILLNLQHPTQSGLWLFDCGEGTQHQLLHTAFNPGKLDKI
+FISHLHGDHLFGLPGLLCSRSMSGIIQPLTIYGPQGIREFVETALRISGSWTDYPLEIVE
+IGAGEILDDGLRKVTAYPLEHPLECYGYRIEEHDKPGALNAQALKAAGVPPGPLFQELKA
+GKTITLEDGRQINGADYLAAPVPGKALAIFGDTGPCDAALDLAKGVDVMVHEATLDITME
+AKANSRGHSSTRQAATLAREAGVGKLIITHVSSRYDDKGCQHLLRECRSIFPATELANDF
+TVFNV