Mercurial > repos > galaxy-australia > alphafold2
view test-data/multimer_output/msas/B/bfd_uniclust_hits.a3m @ 9:3bd420ec162d draft
planemo upload for repository https://github.com/usegalaxy-au/tools-au commit 7726c3cba165bdc8fc6366ec0ce6596e55657468
author | galaxy-australia |
---|---|
date | Tue, 13 Sep 2022 22:04:12 +0000 |
parents | |
children |
line wrap: on
line source
>chain_B MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH >ERR1719244_1811598 MVQWSDDETKAIQMIWNSVDVNELGPAALRRCLLVYPWTQRYFGKFGDIATPTAIMQNPGVAQHGITVMNGLKLAGGPGGGPGNQPGGQQELWQRGKQQGQQQLWQQGQHGGKQRGqqQRQGQq-PSPRQSX------------------ >tr|W5MMD7|W5MMD7_LEPOC Uncharacterized protein OS=Lepisosteus oculatus OX=7918 PE=3 SV=1 MVTLTAEDKNNIRHVWGMVYKDPEGngAVVVIRLFTDHPETKQYFKRFKNLDTLEQMQTNPRIKLHGKRVMNTLNQVIDNLDDWAavkEILTALAERHRDVHKIHIHNFKLLFDVIIKVYGEALGPAFTDAACESWSKVFQLLYSFLQSVYT >tr|G3WE01|G3WE01_SARHA Hemoglobin subunit mu OS=Sarcophilus harrisii OX=9305 GN=HBM PE=3 SV=1 --MFSAEEQSHIVQIWNYLsgHEAIFGTELLQRLFTVYPSTKSYFPPL-IPG-----LELTQMQNHGEQILMAVGVAVDNMYDLRTALSGLADLHAYGLRVEPTNFHFLIHCFQVMLASHLQSEYTAEMHAAWDKFLTNVAVVLTEKYH >tr|W5PMJ4|W5PMJ4_SHEEP Uncharacterized protein OS=Ovis aries OX=9940 PE=3 SV=1 --SLTRAERTIVVSMWSKIstQADVIGTETLERRVTCVSRGPA-P----GSP------QS-------rgRREAGRKGRNDLEtggqgegAGRTGQRLL-RSRLRACTLSF---PPQFLSHCLLVTLASHFPADFTADAHAAWDKFLSLVSGVLTEKYR >tr|A0A1K0GGD5|A0A1K0GGD5_RAT Globin d1 OS=Rattus norvegicus GN=Glnd1 PE=3 SV=1 ----------------------MYGLEKEp-R------------ETEGCLS---RKLPSNLQRSSAPWRLHGFQNLLERSQGA--------QRAKPG------------HGAHSHSSVKMAL--SQTDH------------------rlvL >ERR1719474_978995 --------------------------------LLQSSWKQ--FRT----------------------------------------FASLSGIRQEELGAGCQHQDLP----------QIQHHLWISEPSTFQQLLtftrsiktftnhylnirclflqmflslrgCVNKDSASRKKH >ERR1719336_830457 ----------------------------------------------------------------------------------SINPQSTVDLGAQYISATPLNYKNHQDIYNSLLSNG------VLVPANVSLIEGMRQDRIDEGEE >tr|F6XB67|F6XB67_XENTR Uncharacterized protein OS=Xenopus tropicalis PE=3 SV=1 -MILSEAEKAAILSLWAKAsgNVNALGAEALERILYIWQNLFSYLESP-VI---L-----KILQTGKGASVYKIR-GLDHLSTKHSILPLL-TVKKCLCLRDAGFKILLSHAIEVTLAVHFPDDFDATAQAAWDKFLAAISTALTSQYR >tr|A0A1L8EXG7|A0A1L8EXG7_XENLA Uncharacterized protein OS=Xenopus laevis GN=XELAEV_18045093mg PE=3 SV=1 -MSLSQAEKTLILAFWNKASglINTIGPQIVNRLLLAYPQLKTHFGNF-NVTPGS-----SDLNTLGIKIITAVGGATQHMDDLPVHLAILTDLHSLTLRIDPGNYKLMIDCIVISMAASLPQDFTAEVQNAMTNFLIIIGDILASKFC >SRR5260364_139532 ------------T----VLapDPnPTPHSASPRRMFLSFPTTKTYFPHF-DLSHGS-----AQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKVSGGPGAIWVEGRDGAFLAGQRITRvAGGVAQAAAAGLGPRPH >tr|A0A096M318|A0A096M318_POEFO Uncharacterized protein OS=Poecilia formosa OX=48698 PE=3 SV=1 ------HDELIITGVFFTSVSECVPP-----VRNIYRQTTNSIENIGNFKNGETFLTNPPVALYVVNMVEFTSKPLMS-LPLNGFYGILDFLK--AKRKNPNGGKLLADCLTIVIASKMGSGFTPEIQATFQKFLAVVVSALGKQYH >tr|A0A146TSR5|A0A146TSR5_FUNHE Hemoglobin cathodic subunit beta (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 IFHFIYFYLSTIHYIFSKIYSFFFFPSSLSIFLIFYPFTHIYFFIFFNLYNSSSITSNPNFSSHFNFFLSFLYKSFNNIYYINTTYKYLIFLHSYKLQFYPYNFNLLSYFLTIFLSFHIFSSFTP---------------------- >tr|A0A146Z291|A0A146Z291_FUNHE Hemoglobin subunit epsilon (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 IFYFSYHYLIIITSIFSNLYYNYFFPNSLIIFLIFYPFTHIYFSNFFNLYNSYSINTNPNIQSHFTNFLHFLYLSFNNIYNINFTYSYFIFLHSYNLHFYPYNFNLLSYFFTIFISSNIFSVIKE---------------------- >tr|H3B4U9|H3B4U9_LATCH Cytoglobin OS=Latimeria chalumnae OX=7897 GN=CYGB PE=3 SV=1 --QLSDTEVESIRQIWSNVytNCENVGVLVLIRFFVNFPSAKQYFSQFRHLEDPLDMERSVQLRKHARRVMGAINTVVENVEDQDKiasVLAPVGKAHALKHKVEPVYFKILSGVILEILAEEYAQHFTPEVQKAWTKLMSIICCHVTATY- >tr|L8HVQ9|L8HVQ9_9CETA Cytoglobin OS=Bos mutus OX=72004 GN=M91_06698 PE=3 SV=1 --ELSEAERKAVQATWARLyaNCEDVGVAILVRNRFWRkKRASSTLEEFQegaqgrdsslGSSQAQKQPGCPQLRKHACRVMGALNTVVENLHDPEKvssVLSLVGKAHALKHKVEPVYFKILSGVILEVIAEEFANDFPPETQRAWAKLRGLIYSHVTAAY- >ERR1711977_7585 -MSLSAKDKTLVKKLWEKAEgkSADIGAEALGRMLVAYPQTKTYFSQWGSDLNPQ----HPQVKKHGAVIMGGVGKAVKNIDDLVRGMGALSELHAFKLRVDPANFKILAHNIIWSWPCTSLQTSPPRPTCPLTSSCRTWLWLCPRDT- >tr|A0A1C4HCU8|A0A1C4HCU8_PROAN Myoglobin (Fragment) OS=Protopterus annectens OX=7888 GN=Mb3 PE=2 SV=1 --MASAAQWDTTLKFWEAhVagDLKKHGHEALVRLFLKNKDSQKHFPKFKDLASEAEMRGSDGLKNHGETVFTALGKALQQRDGIANELRPLAVTHSQNHKIPLEEFENICEVIDVYLAEICPD-YAGETRTSVKAVLDVFSQSMTTLY- >tr|A0A146P967|A0A146P967_FUNHE Hemoglobin subunit alpha OS=Fundulus heteroclitus PE=3 SV=1 ---LSKKEKKLIKDIWERLTpvAEDIGSEALLRMFTSYPGTKTYFSHL-DISPGS-----AHLNSHGKKIVLAIAGGAKDISQLTVTLAPLQTLHAYQLRIDPTNFKSCFHTVCLSRWpvTWAKSSL----RLHTQQWTSTCQPLQPCSL- >tr|A0A146QLZ2|A0A146QLZ2_FUNHE Hemoglobin subunit alpha-2 (Fragment) OS=Fundulus heteroclitus OX=8078 PE=4 SV=1 NIILTSNYNYTFNTFFSKFssNSYSIFSYSLSIILFFYPHTNTYFSHFNYLIPFS-----SPFNNHLstfiflfsxxxXXVMGGVEDDVEKIENMKEGIIRISEMNELNMRVEKEKLKIMEKKIIVV--------------------------------- >tr|A0A024R1G3|A0A024R1G3_HUMAN Myoglobin OS=Homo sapiens GN=MB PE=3 SV=1 AMGLSDGEWQLVLNVWGKVeaDIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNY- >tr|M3YM80|M3YM80_MUSPF Myoglobin OS=Mustela putorius furo GN=MB PE=3 SV=1 -MGLSDGEWQLVLNVWGKVeaDLAGHGQAVLISLCQGLESRKEEKKRDPAHACVSSRRslfVSQDLLFHSDAFLVSLGHRSflaPVSGENGQSQKTQPAHHAQHHRQPWNTEKFISDAIIQVLQSKHAGDFGAEAQAAMKKALELFRNDIAAKY- >tr|A0A1Z5LBJ2|A0A1Z5LBJ2_ORNMO Uncharacterized protein (Fragment) OS=Ornithodoros moubata OX=6938 PE=3 SV=1 --ALSAAERALLRALWKKLgcNVGVYATEALERTLEAFPRTKIYFSHM-DLSP-----GSAQVRAHGQSPRPQGGRRADPRRRPPGRPArrpVRSERpARAHAARGPPPLRAAGPLSAGDPRPALPWRLRPRH-------------------- >tr|S4RW14|S4RW14_PETMA Uncharacterized protein OS=Petromyzon marinus PE=3 SV=1 --ALSGAEKAAIADSWKAVysNYEEAGKAILIKFFTSNPGVQDFFPKFKGLDSADQLSKSAAVRWHAERIINAVNDAVVALDDpekLSLKLKALSKKHAQEFNVDPQYFKVLAVNIVEGVSSA-NGGLGAEAQAAWEKFLSQVSILLKSQY- >tr|Q9Y0D5|Q9Y0D5_MYXGL Hemoglobin OS=Myxine glutinosa GN=Hb PE=2 SV=1 --RTTEGERAAVRASWAVLmkDYEHAGVQILDKFFKANPAAKPFFTKMKDLHTLEDLASSADARWHVERIIQAVNFAVINIEDrekLSNKFVKLSQDHIEEFHVtDPQYFMILSQTILDEVEKR-NGGLSGEGKSGWHKVMTIICKMLKSKY- >tr|A0A1W0WKD0|A0A1W0WKD0_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_10224 PE=3 SV=1 --GLTSNHIKAVRANWKLIekRLPEYGLELFVAYLNKHPDWIGLLPFLKPADMPR-LQQTPRLKAHGTIVLKKLGELLTMLDSppkLIGELLKQGSTHR-ARGLAPENFQAIQHDLNELFVKICGPE---FDIEGWDAVLTLIMTGIEEGL- >tr|K4FYM0|K4FYM0_CALMI Hemoglobin subunit alpha OS=Callorhinchus milii OX=7868 PE=2 SV=1 ---LSKTDKALLSSSVGKIQAQATGSDVLARMFASFPQTKVYFVGFSDYTA-----KGPRVQKHGLTVMTKIIEGIQYLDSLRSFLDALSAKHAHELMVDPVNFGFLGECVLSSLAYQLPD-FSPEMHCAWDKYLCEFAYLLAEKYR >tr|H9GUN8|H9GUN8_ANOCA Uncharacterized protein OS=Anolis carolinensis GN=LOC103282340 PE=3 SV=1 --KMTDLDRRHIREIWTAAfeNPEENGRLVIIRFFSDYPASKQYFK---TVPTDGDLKAHPQVAFHGRRIMVAFSQVIENMENWNQACVlleRLVNNHKNIHQVPSGMFQLLFQAMLCTFDDLLGRTFTPEKRVSWEKFFQVIQEEVEAAYD >tr|H2YFM6|H2YFM6_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1 --SLTTEEVITLRTTWAEiskLGNATVGLAVLHRLFNDCPEVRPFFGSMlppSELSDMDSLKSNPKVVDHASRVALSINNIIQLLEntdELVSYLSFLGKVHG-ERSIPAKHFSDMGPVLLAVISAVLREDLEGVVMQTWAKAYGAIEAGI----- >UPI000197D711 status=active ---LTPKDIYEAKQCWNKAAslgVNKVGVLLFKNIFTIAPEAAKAF-SFGNDP---NFMNNKEMEEHGVKVVMAFDHAVRSLDNIHalqETADGLRDTHSFF-NLSPEHHVIVKEALLQTLKQGLGDEFTDAQRELWNGIYTAIRNMW----- >KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold119418_1 # 1 # 498 # 1 # ID=119418_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510 ---ISPLKLRLVQSSWRQASaDEQAGITAFKFFFEMEPVAIGMF-GLQDIR---DLYNSYELKRIAAKIVKAMTHIVNSFDNFEglrPLIKKLGMMHGEK-GVSPSQYNNFGKAFMQTVEEILGDQFTPETRRAWETFFRILTGAL----- >tr|A0A146PHJ5|A0A146PHJ5_FUNHE Hemoglobin cathodic subunit beta OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 -----------------------------ASWFCGFHWTQRYFPHIWRPLPPPAIAAKFPKGAAWKTVMGGLEIAVKNIGQHKAAYAKLSVMHSEKLHVDPTTSGFLLNASQWVWLPSLPPRLHPWFPGGWQKFR------------ >tr|A0A1E7FQE1|A0A1E7FQE1_9STRA Neuroglobin OS=Fragilariopsis cylindrus CCMP1102 OX=635003 GN=Ngb1 PE=3 SV=1 --------MALVVESWAKIKEIENyeevaGELLFRRIFEIKPDAAAYFKFTDGFETTDeALYKQEVFIKHVKMVILTVTSAVDLLEkeNMdelFRMLKLLGAKH-LSagLKLEKEHYNLVGMALLDTLGKALGDTFTEAVKSAWIGVYAIIASKM----- >tr|A0A150AR53|A0A150AR53_9BACT Uncharacterized protein OS=Flammeovirga sp. SJP92 OX=1775430 GN=AVL50_01545 PE=4 SV=1 ---VSNKQIELVQNSFTLITphRGQVSELFFSKLFKIDSSLESSLMV--DPK------------DQERRLIPMLSAVVNGLVDfelIIPILQDFGRTHV-EYNIQEKHYEAVQKALFYALQTVLQEKWTSEVDDAWSNIFSVLTNIMKE--- >tr|A0A1Q9P386|A0A1Q9P386_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=hmp PE=4 SV=1 ---FSNNDIRVIDELWDLILpiKETITDSFYATLFSLDRTIKPMFKT--DLG------------VQGLRLTDTLTFIIKHMGNiedTIQIVKELGVKHL-EYGTKPYHYDLVLEALLETFDKHLEEKFNSEMRLCWIKLYKFLSELMML--- >tr|A0A1G1B2A9|A0A1G1B2A9_9PROT Uncharacterized protein OS=Methylotenera sp. RIFCSPLOWO2_02_FULL_45_14 OX=1801615 GN=A3I83_03315 PE=3 SV=1 ---MTPMQIDVVQSTWQKVMpfREDIACLFYKRLFEIEPELSMVFKG--DMH------------DCVKKIMFMIDLAILNLGQleeVMPMLQEIGNKYV-QCGMKVDS-NAVRNTLVSTLEQRLGETFTVNVRSDWIQAYDLLVGVMKD--- >sp|Q7SID0|GLBF1_EPTBU Globin-F1 OS=Eptatretus burgeri OX=7764 PE=1 SV=1 --TLTDGDKKAINKIWPKIykEYEQYSLNILLRFLKCFPQAQASFPKFSTKK--SNLEQDPEVKHQAVVIFNKVNEIINSMDNqeeIIKSLKDLSQKHKTVFKVDSIWFKELSSIFVSTIDGG----------AEFEKLFSIICILLRSAY- >tr|K1QF07|K1QF07_CRAGI Neuroglobin OS=Crassostrea gigas GN=CGI_10026082 PE=3 SV=1 --TISEDEKRLVKDSWNLFVsrgdFSDTGSHMYKVLLQDNPHLKTLFSFMKVNGa----PFDSPMFKSHVRNVFTVIGDAVNHIDDLDSLspiLKDLGVKHQ-GYGAKKEYLEPVGNALLCTIEKHLEDDFTQEVHSAWRTFFAVMSYSFA---- >tr|Q3MQ26|Q3MQ26_SPISO Nerve hemoglobin OS=Spisula solidissima OX=6584 GN=nHb PE=2 SV=1 --KLTKAEKDAVANSWAALKQdwKTIGADFFVKLFETYPNIKAYFKSFDNMDMSE-IKQSPKLRAHSINFCHGLNsfiQSLDEPDVLVILVQKLTVNHFRR-KIAVDRFQEAFALYVSYAQD---HAKfDDFTAAAWTKTLKVVADVI----- >SRR3989338_1269240 --DFNDEEIDIIKDTWDAVLYPey---PEEGfnPVLNFSTKFYRRVFehencknlfeE--V------------DMTSQGEKLVKILSVLLVAVQTkslnqdHIHVLRKMGERHRG-YGVSDDMYEIIGGCLLRTLSEVCADVWDDDAKVVWAKLFGVVSEQM----- >tr|A0A2G8K001|A0A2G8K001_STIJA Globin D, coelomic OS=Stichopus japonicus GN=BSL78_21829 PE=4 SV=1 TAQLSEVEKNLIRSSWEQAlkNKKVFGVNVFIKLFIQNPSSQDLFEQLRGIPLE-DLKTHRKMKAHALRVMASLNTLVEQIDEVEiltEMFNNVARTHV-IHKVEKAHYDLLGQVLMEVFSEELGAKFDSATKGAWLKAYVIMENIILDKY- >ERR1712150_314552 MTALTEERKLHIKSSWSSVndDvdLAGNGVEFLVKLFTDFPEYMTFFPAFDGKTPE-EIRSSPKAKMHGKVLMTTLDKIVANLDDLEtviASLHRVVGSHF-PRGVTASHFKATLECFGSFLAVQLGDAFNNDVKNAWGVAVQILASVMEAEY- >tr|A0A132AHZ9|A0A132AHZ9_SARSC Cytoglobin-1-like protein OS=Sarcoptes scabiei GN=QR98_0086180 PE=3 SV=1 -MSLTNRDKEIIVSTWSLIrkDSDQAGIHLFKRFFEANPDYVKYFP-FGDLdDLE-KILVDPRLKWHASRVMAALSTIVDNLDDPVcfeDSLQKVLSSHL-NRKIQLYHFENLKKALVCLFMDKLGpDIMNDETIEAWSKAYDVILDTYRSRL- >sp|Q8T7J9|GLB_YOLEI Globin OS=Yoldia eightsii PE=1 SV=1 -MSFSAAQVDTVRSNWCSMtaDIDAAGYRIFELLFQRNPDYQSKFKAFKGLAVS-ALKGNPNAEKHIRIVLGGLGRILGALNTPEldVIYKEMASNHK-PRGVMKQQFKDMGQAIVTALSEIQSKSGGSFDRATWEALFESVANGIGQYQ- >sp|P0C227|GLB_NERAL Globin OS=Nerita albicilla PE=1 SV=1 LKSLSADQKAAIKSSWAAFaaDITGNGSNVLVQFFKDYPGDQSYFKKFDGKKPD-ELKGDAQLATHASQVFGSLNNMIDSMDDPDkmvGLLCKNASDHI-PRGVRQQQYKELFSTLMNYMQSLPGANVAGDTKAAWDKALNAMANIIDAEQ- >tr|A0A1B6EVA8|A0A1B6EVA8_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.22480 PE=3 SV=1 LEVITERDKYLAREVWMQVETNyvLISKSLFTNWITEFPEHLNFFKGLLD-SSYDDFLTSPKFEQHMANsVLPNVGIMISNLDRptdFRRHILKLAWIHIRKNiALKIDHFNILKGLILRTLKESLGRGIGRDHEVAMFKVITAGFNLFS---- >ERR1719240_1900674 -----------------AVArvlVHGL-ANLHRRALERLDLLLELVDAHRVVVL-RLLHRLdgrldrlHVLRRHLVLVLE------EG---------LLGAVHR-RVGLILH----------LHLRLAIGVRRGE---------------------- >tr|A0A224XVH8|A0A224XVH8_9HEMI Putative hemoglobin-like flavoprotein (Fragment) OS=Panstrongylus lignarius PE=3 SV=1 DIGVCNEDVAGIKETWQTVYNDkEnSGIFLFQVMFEMYPDYEKYFVRFRT-EGQKSLFDNPKFINHVKnRVMDALNDVIVNLENDErlvNILETVGENHK-KRNLRKQEFDNIGKVVIETLRRALGTSFTPKLEEAWTKVINCAMETIGK--- >tr|A0A1B6KZX4|A0A1B6KZX4_9HEMI Uncharacterized protein (Fragment) OS=Graphocephala atropunctata GN=g.7772 PE=3 SV=1 YFHLSLEDKRLAREAWYnNVEGNyViVAKAVFKELFRRAPQAYNFFKHLVD-VNERDMFESPRFKRHMVqRLMVALETIFYNVYWNDvfeNHMYDQGRKHK-KRGVQPAHVKLLLCVIV----------------------------------- >tr|R7TS60|R7TS60_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_200756 PE=3 SV=1 -TFLTDEEVEILKASWNDLNddsdLSSIGKRVFLQAFEMRPEMKKIFP-FDNCWGD-KLLQHPKFQAHAQSFMVIIENSVEQVDNESSDFsdslTLLGQSHSDRIGFTRENVQVFLKAILAVWHDLLKS-SDDRTEKIWSKFLAHVVQIMRNGY- >tr|A0A0X3PJM2|A0A0X3PJM2_SCHSO Globin OS=Schistocephalus solidus OX=70667 GN=GLB PE=3 SV=1 --QLTEVQKTQLCVEWKQICKNKedkyaLGTEVFRLLFTKYPHYIRLFKRFRDLPNLDSIMQSAAFKAHAMRFIGAIDAIMENLDDescLVELLKRLAEEHRPR-GITENDFYKTLDVAYDALSPALKsDDARVALRQLFDTALSVIRQSL----- >sp|P02214|GLB_BUSCA Globin OS=Busycotypus canaliculatus OX=57622 PE=1 SV=1 --GLDGAQKTALKESWKVLGADGptmmkNGSLLFGLLFKTYPDTKKHFKHFDDA-TFAAMDTTGVGKAHGVAVFSGLGSMICSIDDddcVBGLAKKLSRNHLAR-GVSAADFKLLEAVFKZFLDEATQRKATDAQKDADGALLTMLIKAH----- >ERR1719239_1832466 --GLSEKDLVLIRGSWGMLgdlkTRKAHGVELFIQLFRAYPYMCeEYFPWFNDMSDEE-LRTSRKMKAHAHNVMNNIGSYVEVCDDPESlvaLIGKMAETHIP-RNVKALQFKELGDMFLPYLVSMMGAAATTDVQEAWRRLLAALVAVVSQ--- >tr|A0A1I8JIG1|A0A1I8JIG1_9PLAT Uncharacterized protein OS=Macrostomum lignano GN=BOX15_Mlig002954g1 PE=3 SV=1 --MLNEVEKKIILSGWQQAikDKKALGMDVFMTLFEMFPQHQELFRDFKGKSRAE-LEKMPKMRAHALRVVNTLDGAIQSLDDMEVcasSLELIGASHKS-HHLSAKHFEDLNAALAVVFERRLGKA-FVDNKAVWVKLLQGIIPVIQR--- >tr|A7RZB2|A7RZB2_NEMVE Predicted protein OS=Nematostella vectensis GN=v1g204383 PE=3 SV=1 -IPLDAKETQLVRKTWAILGDRqvEVGKSLFLRFFEEHPTSKDLFPEFRNISNEK-IAESPALYGHARRVMKSVDNAVASIENVQVysaYLYELGTRHQ-TRQLSEEQLKFMGGAFLFAMRLHLRKEWSRATSKAWEKIFSFMADAMMR--- >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1887876_1 # 1 # 366 # -1 # ID=1887876_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459 -LPVSDENKDILRESWKRLEEEktTLCKNVFIRLLQLNPNLQDTFPSFKGVALDE-LMNSRSLFLHSKRLMEALEIAISSLDDGQDfteYLTHLGERHT-AISITENHFKIMEKALIFALKDMLGESCTEDVANAWREFFQSMAGTMLA--- >ERR1719401_2606804 ----------------------------------------------------------QKYQAQGSRSQ---GG---ELS-RRrcvPPAQSRRA----RAGLAGDghqahclWHPPGERSEIRGSLRCCGEGSDPKLEMAWTKVFVVVSTTM----- >ERR550519_2895140 ---LSKAERKEAENAWRIFevNLVDNGVDAFLNLVRDHPNRKDAFPWVKPELSEEALRNDPEMKKLAKLVFSAVKPAFKSLGDlqsLTNYYLNIGNELS-LMNIPPVMVSYLSDAFKKTCQKLLGSDYTHSLEASIEYVYDFITSRMFE--- >ERR1719402_597456 ---------------------------ALIA-------LISS----------------------AAGSGCLCDARARPFSM-------LS--AI-KLIRVVSAFRATAKALLPAFEEELGTKYTDDFRYALTTLINFMADNMEK--- >ERR1719423_342041 ----TGRQRVAVQASWRLVapDAKRHGIAIFIRLFKKHPETQLVFKSFKGQQ-PESLADNKRLAAHATTVMASVATLVDNLDDidtLLELLHKVAENHKRR-GLPIQYSTIWWRRWG----QHWTAAASRGGATSSepstrssplstsgskDNSFRNVCKMCEGISR >tr|Q53I62|Q53I62_9ANNE Intracellular haemoglobin (Fragment) OS=Alvinella pompejana GN=hb-i PE=2 SV=1 ------------ADNIAAVrgDVSTHAMNIFVEYFKKFPQHQNAFADYKGKD-PESLKSLPKFKTHTTKVVSKLLDIVEKASDsgaLQSNCTTLAKMPQHK-GLNQQQFADLGAVLVPYLQKALGGACDSA---AWeqayn---------------- >SRR6516164_9760095 -IVTTPQQVQLVKQSFAKTTpiAEQAAGLFYGRLFETAPQLRPLFK--GDI------------KTQGRKLMSTIALAVGSLQKlpeLVPIVQDLGRRYV-GYGVKDDQLRYRRRRAAVDARQGaRGRLHTRCEGRVDLGLYDPrrYDEERRSAA- >SRR5690348_1420512 -----------------------------RHRAESAPAVSGRS------------------HSAKKEADGDDLHDDRRTERfqkAGPGSQEPRRAPC-RLWCDCGGLSIVGEALLWTLEQGLAAEFKPEVRSAWIKLYDMIATTMQAGA- >SRR5258706_3013648 -XMLSEKEITLGRNTWDLIapvT-QEMGIQFYEHLFETSPELKPLFKT--NP------------KDQAMKLMFMLSYFVHRLDKendLRAEIKKLAQRQS-GYGAKPEHYKLIRDTLLCSMQNDLRKPWNKETESSCQ--------------- >SRR3712207_8213275 -RLMREYRLAVIFFFFSSR--RRHTRYWRDWSSDVCSSDLSLFK--GDI------------TEQGRKLMQMIGVAVRSLDRleqVMPAVQALGARHV-GYGRSEERRVGKEGRSRWGPDHX----------------------------- >SRR4029077_8512364 --CVTPQQIDLVQASWKQVVpvSETAAQMFYGRLFFLDPSLRRLVL--RGK------------RGGGERGGAVVLG-RQGEEGeegEGSALIHRDRAQA-AGGP-PPRGPAPGAAA----------------------------RHVRRS-- >SRR5437868_6476409 -----MDEILLLKTSLQKMGpqLEHAAGTFAVRLFQLNPSL-------GEI------------ATRGRELLQMMGAAVQNLGRldqLAPSARQFGRHYA-NCHIREQDYDAVGEAFLWSLGRGLGRDFTEEMEAAWGKVYWLMTEIIRAG-- >SRR5689334_13356078 ------------QVSFTQVApiAETATQLFYARLFELDPDLELLFK--GNL------------SEQGASLCKCSHLRSTVLTGwsnFCQSCNRLAHDTS-AMGFETKTTTQWDRRFCGRYGKGWV------------RPSHLRLSX------ >SRR5437870_6238790 -FDVTPIQVDLIRASWAKVEpiQELAASLFYDRLDRKSTRLNSSHVA-ISY------------AV---------FCLKKKKKKkek---------------YTHEHINNNKV---------------------------------------- >tr|A0A136P213|A0A136P213_9CHLR Globin OS=Chloroflexi bacterium OLB13 GN=UZ13_01312 PE=3 SV=1 -ESLTEHDKKLVQRSFTHIApqNEDIAAVFYARLFELDPDIEHLFS--TGL------------DVQRAKLMRMMADLVNALDApeaLSQSMRELGKQHV-SYGVHDKHYATVGEALIWALRKVCPAVMTPTVTQAWEKTYALFAELAIS--- >tr|A0A0C3QP41|A0A0C3QP41_9GAMM Uncharacterized protein OS=Shewanella sp. cp20 GN=DB48_17865 PE=3 SV=1 -MPLTDEQKRLIQKSYAEIDrqNSNFAAIFYDCLFAMAPLIRPMFKS--ER------------PVFEYHFNELISTAATKVFEfeeIKPRLVVLGQKHR-GYGVTPAQFDVVRSALMLSIQDCLRDTCNPAIEQAWSCYYDEIAKVMIAA-- >SRR5262245_10239308 -GPENARPGNL-RHHYadrgrcsGSLLpeAvqaRSVAGRHVSRRHERAAEE--AAAD-ADG------------RRQGARSA----RSGRGGRRgsrPAPRAIRRDRQAL-RHGRHGS---P------LGARGGTRARFTPSVKKAWATVYGLLATTMKNA-- >SRR3981081_1073077 -VVATPSPSRRRISDFG-------------RLKML-NSGKPEFGAgeGSSC------------CSGRSHLLVAILRHVAGIA------------------------------------------------------------------- >SaaInlV_135m_DNA_2_1039731.scaffolds.fasta_scaffold157242_1 # 1 # 360 # 1 # ID=157242_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.458 --LLSPATRELVRSSFPMVEriAPRAGTMFYGRLFATAPEVLPQFR--RDLS------------QPNFQPaaehrfMQLVLFVrstaeHAGLPGsagHDETVGKLAQRHV-GYTTRAPHYAPLGRALLWTLDECLGADFTPAMRAAWSDTYDVLVASMVAPL- >tr|A0A0P1GRZ8|A0A0P1GRZ8_9RHOB Soluble cytochrome O OS=Thalassobius mediterraneus GN=vhb PE=3 SV=1 MNLLSKDEVALIQGAYRALGpsKGFLTNSFYRRLFAIAPQARPLFP--QDM------------DEQLKKLEHMLDLLVDNLHQpmfFMGKLKRLAKRHV-GYGAQPEHYALVGEALIFALNDITPGGLPDKERALWVEIYTAISNTMIET-- >APLak6261659701_1056019.scaffolds.fasta_scaffold514158_1 # 3 # 230 # 1 # ID=514158_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561 -IELNAKNKALVKEGWKLLIEtqFPnevggneralarFFDEFYRKFFEVNPSGKRLFEE-GGM------------AVQSKALVKMMSMVVTSLENpsnLDLTIERLGGRHE-LYGVSRSDYLAFTNAMCETLETVLGDKCNQEMKESWSLVLNNLSEKMLT--- >SRR3954466_1768845 -SCHDSGTGDARS---ADIRpgradRRQGGGDFLRSVVRGRPHGQAVVP--GRH------------SRAAPQTHRHAGGRGPRLSDLpsiLPAASALAKRHV-DYGARPEHYPVVGAALLWTLERGLGPQWTSEAASAWTAAYATLSSFMIA--- >SRR6185295_9741709 ----------------------------LTTWVKHLRRSIMVCG--DDM------------MDRRKRFTQVVSATVRGLARvdmLLPAVREFGMRHP-LPGEIEQHHANVASALLWMLEKALRKDFTPEVKAAWIKAYGMLSQTIRQS-- >tr|D7G782|D7G782_ECTSI Globin OS=Ectocarpus siliculosus OX=2880 GN=Esi_0008_0247 PE=3 SV=1 --VDVEGYKAEIRRTFALVEpiSVQAAGIFYPTLWEVDTSTKPLFKD-TDM------------DKQGEKLMKTLGVAVAMLNKmdtLKPILENLGRKHV-DYGVTPEMYPSVGKALLITFEKGLGEECTPLTTKAWTWVFGIISSICIAAA- >SRR5215207_7597532 -QTMTRDQIRLVQASFRNVLpiRELAAALFYDRLFEIDPGTRGLFVD-TDL------------RSQGGKLMAAIGMVVHALDApesMVEKLKELARRHV-NYRQLQESSPPDFHRLhrfgsgrgsqRHVVSKGPGVAPVGQ----HVVPTHFASRvsrRLRAC-- >SRR5262249_41212017 -NVMTPEQKRLVRDTWKQVApiADAAADMFYRRLFEIDPTTRELFHA-TDM------------VAQRKKLLQMLAFAISGLDNlgaLVSKVEDLGRRTP-AVALPTRTTIPWAPRCCGPWNRVSVTRGHP----RWRRHGPRstnccpascatlprapsscktcgplrrgrplerqgICCVFRKR-- >ERR1700730_6579985 --RQRLADDGVILRVLQRGLgiELEMEALAREEIGELDPDAarfRPHHA--VGG------------GEVGGRHIELLRRHVDQRPpcHaaaNGSARISLPRGHV-SYGAKPRHYPVVGAALLWTLEKGLGDGWTPEVADAWLTAYSTLSGYMIS--- >tr|A0A0N0UYC0|A0A0N0UYC0_9BACT Uncharacterized protein OS=bacterium 336/3 OX=1664068 GN=AD998_10010 PE=3 SV=1 ------EQKEIIKSSFPRVLihTLKNSTIVYEKLFMDIPEAKDLFKN-TS------------IDKQGQMLVAAIGKIVKGLDNpdiFEKDLVELATRHV-GYGLKPEYFTHFGNALINMFEVSLVDSWDKDLHDAWVAVYQEVAEIMKSVI- >SRR5918994_1539718 -------QQELIRESWQRFEpkIKRASPQFYERLFALDPAVRRLFSG-VNM------------AEQERKLMAMLKEIVPELDRptdLVAAVGRRSPFTP-HpepSGWLDPRYAWMRSRTPLP---CSGEX------------------------- >tagenome__1003787_1003787.scaffolds.fasta_scaffold20949172_5 # 2657 # 2851 # 1 # ID=20949172_5;partial=01;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.626 -------DETALLKGFDLAAdvLDEVIDNFYTELLESYPDLQPLFAH-TNT------------QQQRQKLQDVIYLLIENIHNqdvLESALLSLGERHI-RYGALPEHYPVVAEILESNLKKRLGRSWTKAVSTAWIQLLSAAADVMCRPY- >ERR1700753_815890 --XMKSSTMELLSSSFARVcaDKNNAAGIFYARLFTTAPELRAAFQS--DF------------DSVQWKLMSSLVQIVEFYRVgvdPTSYLADLGRSRQ-GYAAQRAQFDAVGDAILFTLAQVLGQGFGADIRAAWVSAYAA---------- >tr|A0A1H2YYM1|A0A1H2YYM1_9RHOB Hemoglobin-like flavoprotein OS=Albimonas donghaensis OX=356660 GN=SAMN05444336_103306 PE=3 SV=1 AMPLDSTNLARMREMLHILRrdAPDASTDFYQALFERAPELRTLFRD-SDL------------AGQGRKFMAMLGLLVDACEDygrLGNEIRELGRGHA-AYGVEARFFPPMEEALIDTMRSNLGERFTPELEADWRKLYAIVANEMMSP-- >tr|A0A1T2B631|A0A1T2B631_9RHOB Uncharacterized protein OS=Thioclava sp. DLFJ4-1 OX=1915313 GN=BMI85_03370 PE=4 SV=1 EPLLPAERAARVKASAARLDfeDPSLFRDAFARLFAVHPELDQVLPN--SE------------GGQQLKYAAMMEVILSTLDPpeeQELELPGLGQMHV-LFGAEPDYYVWLSEAVIAGLAAKLGDHWTSELAADWAELFSKVSAQMIAG-- >tr|A0A2E1AIS1|A0A2E1AIS1_9CHLR Uncharacterized protein OS=Anaerolineaceae bacterium OX=2024896 GN=CL607_22355 PE=3 SV=1 MSPVTSRQKLLL--HYTLLHldADQMGKLFYDHILAAMPEVAPMFTD---L------------ESQRKHFMKMMIRIVHTIDEpdhLNIVLRELGHIHK-RLHLKPRHFSKMGVAFSNSLAEVMGDRYTPEIGEAWRILYNRVAEAMQSP-- >SRR5262245_62462516 --------IFIFLLFFFFCLcf-CFMFFFFFSSRRRHTRCLSDWSS--DVC------------SSDLQKLLAALALVVRSLHTpekILGPVKKLAVKHV-DYGVRPEHYTYVGNALLRTLKKGFGREFTPELSDAWVEAFRMLAKVMKEA-- >tr|A0A2D6AZC8|A0A2D6AZC8_9BACT Uncharacterized protein OS=Flammeovirgaceae bacterium GN=CMB80_28915 PE=4 SV=1 SNTMTSESINMISKSWDLLSRdPQLVTRFYNRLFDIAPETRRYFK--DDI------------SKQSEKLAHTLNFLVMNLDRldeIKESIEDLGRHHN-KMKIKAEYYVYVKEALLTTIQETLDEQCESGMVEAWDHALSHVASTMINA-- >SRR5262245_55554356 --CVTPEHRLLAQQAFATIQplADELGLLFYSRLFELDGALRGLFKH--DL------------ANQAHSLMAMLQLTIEGLDApeqFTRARTTWGYATWTmGFSRTSTRLLRRPCSGRSSMRX------------------------------ >SRR6516165_4200192 ------AQ--------------------------------------SDL------------VDRGRA------YRLLGLADLvdrrnQAaagGLSLFHRRAV----------------------SAGGVAWADRVLDALSlylcgyelrwpQLDHALGRgavhpdacaSLLRE-- >ERR1700733_1486793 --------------SQAHGGdiVDLyRDVRLVYRLFRRLPPAEQDAIP-GDH------------RRGRLSRaAGRVAL---------APVRRAARRQ---------DRRREG-DVLELRRDGRGDDRRHVFHRDQElswlSDDV--PR-VVRD-- >SRR5215831_4136876 --KHDPPTDLARAEQLQVRCA----DRVKGRRSLLRPSLRDRSRGP-AA--------------LPRKIIRAEGKVdgdANEDRQqssSAQchFASCTPTRRaaQ-GLRCLDGSLWGSGCCLLWTLEQGLGSAFTPEVKAAWSEAYRTLAGAMQEG-- >tr|W5NBV0|W5NBV0_LEPOC Uncharacterized protein OS=Lepisosteus oculatus PE=3 SV=1 -VPLTESQKDLIRESWKVVhqDIARLGIIMFIRLFETHPECKDVFFIFREIDDLQELKMSKELQAHGLRVMSFIEKSVARLAQedkLEQIALELGKCHC-RYNAPPKYYEYVGVQFISAVKPILKDSWSPQVEQAWESLFAYLAAVMKRGYH >ERR1711911_21978 ATGLTARQKRIIAKNWDLVRpnLKEAGVGLFIAYLTKHPEMQARFKSFATVP-LNELAANRKLQAHAANIMYSMTMLVDSLNDvecLVQHLATIGRNHR-RRHLKRHHFQDLAVVIVDFLEAALAAHWSAEARQSWTLALNVIVDQICNVL- >SRR5215218_21909 -CAMNPEQIGLLAESWKGVAgrRDEIARAFYGVLFDRHPELRSMFAH-TDM------------RAQYEKFALMIDEIVQLRTEprqFVRSAVLLGQRHA-AYGVTRDHYGPAGAALIEALAEALGSAFTPAAREAWTEGYLLMSSIMCR--- >SRR5688500_19518083 -LLITPAP--------------------PSAIHTRYLHDALPIAH-VDM------------GAQYEKFAAMVDEIVGLRTEphrFVRSAVLLGQRHA-RYGVTRDHYAPAGAALIEVLDRKSTRLNSSHLVVSYA----VSCSIQ----- >SRR5258706_7695680 --RHDPPPdpadPPVLRPA----RvqGRETRHLDVQAPVPARPRPTPAVQ------------------------------------------------------------------------------------------------------- >SRR4026207_1847514 -PLMTSNQRQLVRQSFDAVRdqAGPFSLLFYGKLFELDPSARRMFHV--DL------------ALQGRKIVDTLATVTESLDRfesIRPRLASLGRQHA-GYGVRPEQYDTITAALLWAIGQALGADFDAPTREAWKLALNAVSTATIEGA- >SRR5260221_10622870 --IVNAAQQELVMTKAEGVvlMPGVTGVLLCALLISANPSFRPLFKS--DM------------RIQGVKLMTMLAMVVYNLPEpgqVLPAIRDRSEEHT-SELQSHSDFVCR--LLLLHX-------------------------------- >SRR6516225_5669596 -NVMTPEQKRLAScfrrggppGSWRRPSppLGIETAQVFRIPCVLPN--AAVHTA-GVS------------DHNNSDTYRAALRPAH---R-AASQTASVRNHE-RIQSETAM--REGL--rrvTYARVLRTGS-hRTPYrnVTP------------------ >SRR5215203_7560530 -RPMTPDQVSLVRDARRAIesRHAEFSAAFHDALHELDVDTCALFRD-TVT------------GGRACNVGAMLDLLQQASDDpraLIEVAAELGRAHA-HAGVRDVHHHVAGVALHRALHRVLGVEFTPAMYEAWAEAFTLLIAVMERAA- >SRR5215470_20101711 -KSMTPQQIALVQCSFKSVApiASKAADLFYDPALRDrsrgaaALPH--------RFV------------G----AEGQADGDASNGHQ--------------QSPSARCHFANRAATLRPA-Q------------------------------- >SRR5919197_1191720 --VLTRDQADIVQLTWRAVLpvGDTFAELFYGRLFALDPQLRRLFR--ENL------------VEQGRNLTAMLSVAAANLARpekISVALRQLGRRPT-RSSRARCSRSLLRDLLRLPLDARRA--VADGVARVVVafaRAVVAIP-RVIHG-- >SRR5690606_39578087 --------------------------------------ADHLSP--LPlP------------TRRSSDLLRMLAFIVKSLDWadrqwredvnpdedLMLVVLALGRRHTELYKIPDESYGAVAEALLWTLDYGLGRSEEHTSELQ--S-------REN---- >SRR3954469_10060132 -QRMTPEHIHTVQSSWNKVLpaGNGKARLLFERLLQTETSLCGLFQ--LDG------------ATWSANLVQMIDVLVTGLSLgdrSAVLTRRVGGRNT-ACPGIEHHYDLIGTALLRTLAKRLRAEFTPRVEAAWAIVYEELVESMRKA-- >SRR6266508_6374850 NFAMTKEQIALVKNSWKLFrkvDACLIGDVFYSKLFFDNPQLRQLFP--ASM------------EERYRKMIDMLSVIISRLDRlneMTKDIKVMALRHE-SHGVKPRHCKLLGNALRWTMERGLGNDWNDDVKEAGLACYTKLIETMIQ--- >SRR5215475_4417451 --PMTPLQRRLLHQSFSRIEpfSQRLGDVFYARFFSTSPAMRALFSR--DI------------KVQQSKFMKVISEIIKLPLlsfsvtdsqdSesLVPGAYWSGMLHG-ALSVKQQDFASMKAALLWALSNCP---------------------------- >tr|V4A5G6|V4A5G6_LOTGI Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_233247 PE=3 SV=1 -ADLTEKDKELVKSSWAKFNegdVIADGAHIYYKLFEKAPEAKEKFGFAKD---GEVSLENKQFKAHVRKVLDVFESVVREIDQlegLLPVLNDLGARHK-SYGVPLKYYEILGSCIMYAWDRKLKM--DADTKKAWGKLYGVVQTEMKKG-- >SRR5262249_25899110 --MMNTQHIARIRLSFAWIApsADVFGELFVANLRALDPSLSGLLA--AEA------------GPQGWQLISILRSIIGGRDRpdrLFWRLQSFGRRLA-GDGLCAEDYDTIGDALMLTLEQCLGERLTPDVAAAWDATYAALAEVVQL--- >ERR1719223_727152 ---PSSAQVDAVTASWDKVAalgAETVGVLLFKRIFEIAPALESELS-EKPTA---IIIGDLTLAREMT----EEEKETIDLEEkeePeeveekeEPEEVDEQETTE-GRIISTESF------------------------------------------- >ERR1719336_2939639 --PLDERDIDLVQQTLGRVAilgLDNVGWVLFMNTFKIAPAAQGLFE-AGFLQlkplnkpfnDMPELAKSSNMKETGGRVVETLAAAVGLLRDlgtLVPILQDLGKKGV-SCGVIPAHYDIFGEALITSLQLALGANFTDPVKNAYLKVYTIVKNTMIG--- >tr|A0A1D8RRN7|A0A1D8RRN7_9GAMM Uncharacterized protein OS=Colwellia sp. PAMC 20917 GN=A3Q34_02175 PE=4 SV=1 ---MTAKQINLVQQSWQKVLilSPDVGDLFYQQLFVLRPELATLLKN--DK------------QdKirANKDFICLLSQEINLLQPielTEEKV---NTSVT-TNDV-KNYQADVENALLLALTMILDKELKIALKRAWISTIKRLVGSIVIEL- >ERR1700730_15638689 --AMTPKQVALVQDSFAKVAltSEAAAVLFYNRLFDIAPQMKAMFP--DDM------------VEQRRKLMSMLAGVVKGLANLeqvFAGRQRTGKAAC-QLRCEGG--ALSGGRRRVAVDAGEGsGGWLDAGSGGcVGHRlWHAVRLHDFPS-- >ERR1712166_353516 -VVAQFAALNAVDDKW-----VTQGVLLFKHMFRINPGMKQMFS-FRDIP-DDELYDSMKLKKHGVSVYTYIEKAVDGWGTpeIADALQKLGARHL-PREVKMEHFDVVGESILTSLSDVFGDQFDDKSREIWTRVYGVIV-------- >tr|A0A1S2XZ06|A0A1S2XZ06_CICAR leghemoglobin-like OS=Cicer arietinum GN=LOC101502441 PE=3 SV=1 MDALTEKQEALVNSSWEAFkkNIPHLSIVFYSSILEKAPESKDMFSFLKNF--DGIPHQNSTLEAHAEKIFDMTRDAAIQLRAkgkIdlaNDvTLEYLASVHV-QKGVTEQHFVVLKEAMLKTIKKAMDDKWSEELSCAWSIPYDQLAATIKKAM- >OlaalgELextract3_1021956.scaffolds.fasta_scaffold1056695_1 # 380 # 499 # -1 # ID=1056695_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.392 -MALTATDVEVIQTTFKeVAEnvgAEKAGIILFKNVFDAAPGAAKLFS-FGRVEgfdPAADHSTNPAVVKHATGVITTVAKAVASLTDlsaVLPMLTALGKRHS-KYGVKKEHFGIVGAAFLKTLSTALGDKYTKEVEAAYTKLWGVVSKTFREAG- >SRR5271157_4306781 -----VSDVEFLKETWGQItDKSSFAERFYSLLLAVFPVAKPLFSK-TDW------------QSQYSLLMASIDYMVMGIKygrNIQPTLHLLGARHD-YYGVAPVFYIPFNACLLITLQK------------------------------ >SRR6266566_5437046 --DLTPENCDFMTEHHDL--------RILGRLVATE---------------------------------------------------------Q-EQPVKDPDHDQIeeatrhrprscPTLFIWPNRRSQPLhrvlmRYMPvpgpRSPPSWCGPPSRSRSHGPRttT-- >SRR5579859_7196529 -GARDD--T-----------gsGQaCSAEFLQGR--------------T-HR------------RSGGDpVLRSPVRNCAAGQSDVsrrHDRTAEKADRHA-CGRCeRSgrLALDPAGreracq--TprrLWRQGcalpgrrrrlvvdAGK-GIGRgvdarrrrrmdhrlrhavrfHDFRSLWQCPG------------ >SRR6185312_354929 ---MVR--A-----------rgSAkC--WKCRWR--------------D-RA--------------SVSnSLPAPATSSAGSACSNfs-------MNGTA---SSkQPefDRVPRGGrgrgrrrKMTpeqVSLVQqsfakvapiseqaavlFYD-RL-FevapavkamfpadmteqrkKLM----------GTLAV-V--- >APLak6261666328_1056055.scaffolds.fasta_scaffold241778_1 # 2 # 196 # 1 # ID=241778_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.415 -GAKTAGGL---NLLFL--AivSS----EPENGFVTISPAAKDLFP-A-DL------------TEQRKKLIATLAIVVNRLSNLqsiLPAARTLTKRHV-NYGAKPEHYPVVGSAVLH-AGgrPRLGLDARSRLrsdGCVWHAVRLDDgrnleHEFANL--- >SRR3954463_16408791 ------QQITLVQESFARLAhdKARFGASFFKRLFKVDPTLEQSFAG-VD------------MQAHALKLVDAISFVVGGLRQpetLVGPVQKLGAARC-CRRCPTSSRTSGPRSSVPPGT------------------------------- >SRR3569832_1984102 ----------------------------------LEPKARSMFNF--RAD------------EDleaNPQFMVHARAMVDMIdmavgflgPDldpLIEDLSHLGKRHI-SYGVKPEYFSIMERAVMFAMEELLDDKLTKEDRTSWQLVFHFMITH------ >tr|B3SDK5|B3SDK5_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_62364 PE=3 SV=1 -SYLNYQERQAIIDSWNAIstEKQKYGTILFLKLFELEPRVKSLFTIF-DFN--EpleDIIQSPHFRSHAMRFMQSLETGVLMGFDkesCDFLFKSLGSRHH-FYDLKSEFLDVIPECILHTIKKGCGNNWSNETADAWKIATKVLCELFREG-- >tr|C1C1M6|C1C1M6_CALCM Non-symbiotic hemoglobin 1 OS=Caligus clemensi OX=344056 GN=HBL1 PE=2 SV=1 MSILTSNELSLISESWKLVvpDLEHHGLSFFLKLFEEYPTYQEKFFPELH-------QDERKIQRHGAIVLKSVGK-LVAFLEankviaLVDAIKRLATNHS-RRGVLREQFYPACRILLEYLAQALGTHLSTEGALAWKRFLGTFVELMQ---- >SRR5450759_1049036 --ALTAEaPYSELKnlCVWSKT------NAGMGSLYRSQHELVFVF-K-NGMRPHINNvelgrfgrnrtniwnyAGASSFGstrdselamHPTVKPLSLVADAIlDCSKRggivldafagsgtTLIAAEKTGRR---GYGTELDPFYADT----------------------ivrrFEDAYGL-KAVHVE--- >DeetaT_11_FD_k123_441726_1 # 2 # 373 # 1 # ID=403715_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.481 --GLTDLQIEMIRSSWEKVTpnKKHHGQLLFHKLFEIAPEMTDLFP-FGDD------FTKPQFTTHALNIMNALDHAIQNLDNpdvLIPKLRELGQMHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG--- >AP82_1055514.scaffolds.fasta_scaffold664619_1 # 53 # 358 # 1 # ID=664619_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.458 ---MSGFALRLVLTQRQKATrkrpiaqyvienhSINFAFHYIDRLFEIAPEMTDLFP-FGDD------FTKPQFTTHALNIMNALDHAIQNLDNpdvLIPKLRELGQMHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG--- >SRR5210317_1560035 ------------------XmtSL----KSSMIGFFRNHQNCAKMFGE--DMR------------DQAQKLAAILQVAFDNLDHvdsLVPILEDVGAKHA-TYAVTPEHYGLVAAALIGTISTELGDAFDERAAESFEAVLGTVANVMISG-- >tr|A0A037ZKD6|A0A037ZKD6_9RHOB Uncharacterized protein OS=Actibacterium mucosum KCTC 23349 GN=ACMU_09600 PE=3 SV=1 --MAHKGRVQTVRDSFQVVrtDADAFARGFYDRLFAKRPEMRGLFAD--DMS------------AQQAKLVTTLVTAVNMFDTpsqLIKPLKQLGASHA-QMGLSQADYQLVVDTIIETLETTLGSAWDVAHDRAWRGLLDFVSNVMQEG-- >SRR5688500_932283 --MLSDAEKQAIRESWQLVLpvVETAADLFYRRLAEQNPALRARGQ--DQL------------VAQRKEFVTTFSFVVRGLAWeasewrsdapdeddLFLGMLALGQRGSRLARLIEQHYSATGDTLLWTLTYALGKRFDAKARAAWMRLYTLLAIALR---- >SRR5688572_29427622 ---------------WALCAprADLLAAAYYQRLFERLPALRIRFP--ADL------------APARQRLVGLLRFVARALYWpaddwrrplpieedLLAILLALSRRHRGLGEVDDAVRAVSREALVAAIGEILAGEANPSIIDTWGKLHDLAADAFVL--- >APIni6443716594_1056825.scaffolds.fasta_scaffold11231735_1 # 3 # 137 # 1 # ID=11231735_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.400 --LLTADERAVLKLDWSRLTrvdQQDMGMRIFLRIFELEPSTKLSFPELYHL-TGDQLISNTLFRCHGARFMRAVAAAVDNVDALdlvvIPNLIQLGRLHQSVDGLRWRHLEVFEQAMTEVWAVELNLSgswSGSTSAVVWSKVFRLITSKVYEGFQ >tr|A7RWR6|A7RWR6_NEMVE Predicted protein OS=Nematostella vectensis OX=45351 GN=v1g203304 PE=3 SV=1 -CDMTYEQKYLIRETWKFLEvsKKEIGVSVYKRFLNMHPGLQTYFSEFKHIKID-NI---NGSHGHPRRLLMAIDNAVTALGDsdsFSAYLVELGRRHH-GMnfRPGPTHFNDLRKCFLSVIEEILATAslWDFQVEEAWNRLFDSITAMILRG-- >SRR6516164_7981020 -SPLTEAQKRLVRESFESMQeyETSVVVLFYGRLFEIAPETRTLFKI--DI------------REQSRSSWIPSGL------------------------------LSIRLTISWNCRQLLR---------NWDESTSltAFSPITMGN-- >SRR6185503_3589201 ---MKAEQLELVIDSLTVIQpiADQIAKSFYKHLFEIAPQTKKLFT--GDM------------DRQGIMLITSLSLAVNGLSDmenTLPSVQALGERHY-SYGVKPEYYQPAVESFLWSLEYHLGDQFTPELKESWRTAFQALADTMLSVY- >tr|A0A0P6AJ75|A0A0P6AJ75_9CRUS Globin OS=Daphnia magna PE=3 SV=1 MDTLKTVNVSAVQNTWAIVNkdLNTHAPHFYVALLTAHPEYQPMFPTIANVP-AGALLNNAALKTLSVNVLTKLSELIGCMGNpdaLNAQLVDLANQHK-GRGTTRAHFDNLSKVLIDFLAAKLGGEFTPEARQAWTATMQGINTVVEA--- >tr|A0A0P5NXY2|A0A0P5NXY2_9CRUS Globin (Fragment) OS=Daphnia magna PE=3 SV=1 MDTLKTVNVSAVQNTWAIVNkdLNTHAPHFYVALLTAHPEYQPMFPTIANVP-AGELLNNAALKTLSVNVLTKLSELIGCMGNpdaLNAQLVDLANQHK-GRGTTRAHFDVSKS-FSNFEC-----PENEVSRKDWTKNLSILQ-------- >tr|Q93101|Q93101_9ANNE Nerve myoglobin OS=Aphrodita aculeata PE=2 SV=1 MAGLSGADIAVIRSTWAKVQgsgSAtDIGRSIFIKFFELDPAAQNEFPCKGESL-AA-LKTNVLLGQHGAKFMEYITTAvNGLDDYagkAHGPLTELGSRHK-TRGTTPANFGKAGEALLAILASVVGGDFTPAAKDAWTKVYNTISSTMQA--- >tr|A0A210Q3Q0|A0A210Q3Q0_MIZYE Neuroglobin OS=Mizuhopecten yessoensis GN=KP79_PYT10061 PE=3 SV=1 -TYLTPRQIHLVQDTWDIIkdDLSKLGVIVFLRLFETEPDLKHLFPKIVQMNEQNKLeWDIDrdMLTKHAVSVMEGLGAAVESLDEsefLNSVLISIGQTHV-KRHVKPQMLKRLWPSLNYGLKQVLQSKYNKEVNEAWKKVYFYIVAHMKRG-- >ERR1719460_671936 --MVDAVVKGDVQRTWELVIPpdsgddhvFAIGKLFFDRIFEVTPGAEALFS-FKGE----DRAESAKFRAHAIKVIKTVGVAVAKLDDletLVPILEDLGKKHV-AYGVVASTTT----SSVWRCCGRSRRGWATNSRPTW---------------- >ERR1712223_635401 IPKLTAEEKSVLQASWANVNkkIEIAGAQTFIRMFESNPETQNQFRKFQGMDL-VQLEQSAEMAQHGKRVLSIVGMTVDNLDNyqiVWDNLIKVGREHF-TFGALPMYFDLMGPHFVIAVRSCLGNDWYEALEYHWLALFNMIVYAMKFGWN >ERR1712062_404977 --ILTNQEISVLKSSWELIAkkIEIAGAHTFLPTFDRDPKCPDN------------------IERHCQRVMSVVGGSIELINDyksLWKHLISLGREHF-GKIREWIFASIAGGSTersgcspssINFLSSKINGNITSKK--CFLQ-YKIVIITQX---- >SRR6266567_6698575 --------------------LIVFTSTCLWSI----RKPNHSLPKR-IC------------VVKLAHCWLHLTTVVAGVlreDNLVPVLQQLGQRHK-SYGVKAEYYPFFRAVLLETFQHYLGPRFTPKMQQAWEEAFEMISTQMLKGA- >SRR5215217_5048650 --RVTARGRAR---HVLLRApvRDRRGRGTTVRRHRHGSAA-----------------------PQ---VRRDARQDRARSGRaatLVPDVAALARRHV-GYGVEDRHYTSVGEALLFALGDTLGDRFTSDVHAAWVEAYALLAALMQR--- >APDOM4702015191_1054821.scaffolds.fasta_scaffold152199_1 # 3 # 686 # -1 # ID=152199_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.531 ------------------------------------------MS--GDF------------SPEQKRYLEGFTS------GLq------IARTGR-GLG-KPAASVPSGPD-----AEHLIAQDQ----------------------- >SRR5262249_5171126 ----EPDSALLVQSTIG-VLvqhQRRFTSELYRRLFGLAPGAQALFRS--DM------------ESQGKMLAHMLEFLVYATSRpetMTLGWRELGRGHD-GCGVGAEYYPAFRQAFLESARVVLDEKHTPQVEKAWADTLDMMIVSMLGP-- >APCry1669189000_1035189.scaffolds.fasta_scaffold267513_1 # 3 # 467 # -1 # ID=267513_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658 -VVLSDQHKKVIVRNWTILStdLSGRGTRIFLLIFGRNPLIKSIFS-FGHLE-GDELVCDPRFKGHALRFMQAVGAVVDNIDDynnaVKPILNDLGRRHTQFKGFKPIYFNEFQDSILQVSENGTCKQngeiriLNPSaagvnfCTPPLGKFSASEMTCIVSsGA- >tr|W6FSH9|W6FSH9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_a PE=2 SV=1 -LDFSDDQKADIKSTWETLYsgnKFQLGVELMANLFKAHPDYQDLFPSLKGIPD---VAGSNELRGHAIRVITGINNFVDALDEeeevMREMLHNMARSHK-PRKLTKTHFNEFAPILLETFEKKVD--MSSKARDAWIALYYSIVDNLFAE-- >tr|W6FIG9|W6FIG9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_b PE=2 SV=1 -MVVSAEQKALIQGAWTPIYagnRFQLGVDIFAHFFKAHPNYANLFPSLVGVPN---PSTSVELRGHAIRVLTGINYFVAALDEkkpvIMEMIHNMARSHK-PRKLTREHFAQFAPVLFDT----IG--VSGPARDAFLPYYNFIADNLFAE-- >tr|A0A023RLQ7|A0A023RLQ7_AERME Globin OS=Aeromonas media WS OX=1208104 GN=B224_3582 PE=3 SV=1 ---MTPEQIELVQRAWGRVTalNNTYVQEVYAELFRLSPDLINLFPDPAG--------------MPVTKVSETLNTVITSLEQLdalGFIIRDLGRRHR-QFNVQSHQFGLLKQALTLVLARRLGEHFTPALSEAWSQMYDEIAALMLEGL- >SRR5437899_2276119 -------------------YpaVQKSGAAVYRPALVAELRDRPY-E--FDI------------QVQLCVYLARMA--------leIVAALN-----AA-GWICVPKDPSPEM------LKAAWAYALDEDAAGVWKSMIAA---------- >ERR1700757_2961956 ------------------------------------------------------------------RFNRLAGRERRAPARtr----ARQSR-------QRPGPSRHDPTrLALSD----------VSEAERTDIVVS------------ >SRR5215213_1430710 ---------------------------------YLYPFLRPMFK--ENI------------QLQARKFSAHVSLVIGNIKDrntLQPMFEEMRNLHL-NHNVKTHHYNYVQEALFYALKNHLVKEWDEHTESAWIKFYNIMASQMAA--- >SRR4051794_22176940 -NRMTEASLQRIASNYELLAgqMQVLTGAFYKRLFAAMPEAQPLFR--IDI------------DLQSQHLAAALALIVRNIRFfdaLEQPLKELGVHHA-HVGVRPEQYPVVCRTMLETFREGSGQSWSPELEADWKAVLELVSRIMMDG-- >SRR5262245_41201456 --XMTPHQILLVKTSFQAALtqRERIAGFFFAELFAREPAMWQLLR--GKT------------GMRWPALVDGLAAIVGSIHRihsIEPVLQWLSWQGA-VRGVGEGQYEAVGQALVAALEAGLGEAFGSEHRRAWMVAVGKVADIMARA-- >tr|A0A0N9QWL5|A0A0N9QWL5_9ANNE Intracellular single-domain globin (Fragment) OS=Eulagiscinae sp. JPG-2015 OX=1732542 PE=2 SV=1 ---VSDAQKALIKSSWAGVDLNAAGVAFLNQMEQKAHDVYAVFKV-G-----GGATSNPKAAALGLKVMTFVDEAVKGIDDMgavGGKLDELAQRHT-KYGAKKAHFPVAGPCFLDALAEVCGGRFSADARAAWSDFYDVIAQHLSA--- >tr|C7FFW0|C7FFW0_BRASE Extracellular tetra-domain globin (Fragment) OS=Branchipolynoe seepensis OX=326992 PE=3 SV=1 ---VSDAQKAAIKASWAGADLQAAGTGFYVHLAAEAPAVYANFNL-G-----ADPH-GAKSQEQGLRVMKFVNQCVNSIDNMaivQAKIDALAHRHM-SYNVKKSDFVPAKPCFLGALADALGGKFNADARAAWAGFYDIIAAGLST--- >ERR1719261_40108 -------TIAVVQGTWQEIKdalgdgvAETAGVILFKHIFRIAPQALALFS-FKDCAGgnvCDELFENKTLRKHAAKVVGTVDTAVGMLKktrQADSRPGQSGQEAR-GLwggagalrcgrgGVVGDAVGRVGRRVYDRGPRGLGGGLRHHQNHN-----DRQELRLHGR-- >ERR1719238_2294225 -----------------------------LKVA----SALREFN-TLRAEGivsEQEFLEM------KAKLLAVGKDELG-RSpsgDTLETLVEAThemdssRRRT-RWtrrarraSRSPTTVGVISCQIK--------KSSTRRTTRRW---------------- >ERR550532_3331206 ------------------------------PLF----PAAH--R-LCRPDGhdgCS---------------------VFGPDRppgE------------------APSTKDIVVTVIL--------X-------------------------- >SRR2546430_16462751 ---------------------------------------------------------------------------------flLSVVIA-----CS-CWCRHVSSlqhdrad-------HPVGLCPGIVADWSPALSQNVGEGFQQDCSD-dG---- >tr|A0A0P6RCU1|A0A0P6RCU1_9RHOB Flavohemoprotein OS=Phaeobacter sp. 11ANDIMAR09 OX=1225647 GN=AN476_12305 PE=3 SV=1 ----ASTCKALVLRSFESErmDLEAFIPLFYSNFFEAYPEARAIFPT--DT------------ERLEAKLLASLTHIAEALESserLDGILSELGQKHR-RMQISDSHFDGFIQSFIRSLATTLGPEWSDQSDEAWSQFLRYVAKRMSFLE- >tr|B7QTL6|B7QTL6_9RHOB Globin, putative OS=Ruegeria sp. R11 OX=439497 GN=RR11_330 PE=3 SV=1 ----APADRDLILASVESQkmELDQFVSLFYAKFFERCPDTRPMFPH--DM------------SLQEEKLLMSLTHIIEALEHpakLRLILLDQGERHK-ALQINDDHFAGFIDSFTGALKDTLQEDWSEETRQAWLRFLQYVAYQMGFLK- >SRR6218665_311178 -TPIYAGHRDVIRRTWPIIAdqMNANGCQIFLCIFELSPGIKRVFA-FGPAMSGAQIVNHPRLVQHASRFMEAMQVAVQHLDELdtvvSPIFINLGKRHIYFEGINADYFNVFSGAILYTWRQVLGERFSAEVRSAWSRLFDFVIQHLRFGY- >GraSoiStandDraft_9_1057307.scaffolds.fasta_scaffold3427870_1 # 1 # 249 # 1 # ID=3427870_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.747 --------ADVIFDSWDAVKripdyDVVVGEMMFRKLFENSPSTLKNFS-FGPRFagKEESLYKSRTFEIHTKAMIKMLEDVLSMIMpDlvpMKKTLKALGARHV-TYGVRPNHYELATEALLSTLESLLGYRWTPQVEEGWKTAIGFITNTMVAG-- >tr|A0A2C9KJS1|A0A2C9KJS1_BIOGL Uncharacterized protein OS=Biomphalaria glabrata PE=3 SV=1 --YVTPKEKELLRSSWNIVsqDISGVGMNIFKKLFDIETDLMKLFKRMLTKGeTGQVVVDSIRLEGHATGVLRQIGLVVENMDNnsaLTTTLIALGEVHA-NYRVRPEMLPLLWPAIRDALKIACEDEFTHQMELAWKHLYDFVTCHLSEG-- >tr|A0A1Y5RHX9|A0A1Y5RHX9_9RHOB Flavohemoprotein OS=Palleronia marisminoris GN=hmp PE=3 SV=1 ---MPNDDMRLIQPSIARIFvvRRSIGQAFYERLFERQPTFRTMFPT--DL------------RTQARTFDDMIALIVKKTGDpeaVTPVLLAIGRRYL-TYGLRPQDLRVIGEVLMEVLCAQTPGGLSPDEAAAWERSFSRAAEVVKL--- >ERR1719321_586101 --ELSYSTVSTVIDSWESVKrqenyAENLGRMIFIKFFDREPEAKTIFGFDGKKMKTdDEFYESRAFLAHGKHFVLILNKAFDMLGPdlemLTDILLDLGGTHRTKYGVKPEYFPVLGDALLECIEEMSDPeRFNDETKACWLEAYNALTEIMTT--- >tr|A0A2D6RHV2|A0A2D6RHV2_9GAMM Methyl-accepting chemotaxis protein (Fragment) OS=Colwelliaceae bacterium OX=2026726 GN=CL811_09640 PE=4 SV=1 ---MTPKQNIAVIESWKKVQpiASQVSQVFYDDLCEKHPSLKALLG--EELS------------SARDQLVAYLNSLVETLVATdevv-I--EDL-AKH-LRIGLAPEQFSDVGPALLTSLEIGLEKDFTATVKRAWTALNKLIVAAMAQ--- >tr|B7J6S4|B7J6S4_ACIF2 Globin domain protein OS=Acidithiobacillus ferrooxidans (strain ATCC 23270 / DSM 14882 / CIP 104768 / NCIMB 8455) OX=243159 GN= ----MAINIQLIQSSGAAVkdLGVQVAEHFYNYMFTHFPEVRKMFPG--------------DMSEQRVRLFNSVILIATNIDTmevLVPYLKELGIGHI-KYDTRPEHYPIVGKSLLNTLKHFLGAAWTQEMAESWIEAYNLASTVCIEA-- >tr|A0A1Q9NIM3|A0A1Q9NIM3_9ARCH Bacterial hemoglobin OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=vhb_2 PE=4 SV=1 --SLNTKDIQLIKNSWEKLteNKKEVRNTFYTGMFEDDPKLKSLFRE--------------SFLSWD-NLPDSFEFMFKHLENlegEILEMKRLGLKHK-TFSVKPKHFPIGRKSLVKTIKQYMGDKYTEELGAAWTKLFDYMSHYMILG-- >ERR1719419_74415 --PFTPEQRTLINETWGNISTKEtgsmgmLAKQVYERLFRSAPGIKRLFKD-SDM------------LAISRAFGGMLGVLVSAVNQplqFQHIVKGLGVRHQ-VYGVKPDHFRIMYTSLVRTFAQILGDKFTSEHKKAWSCLYNWVIDAMQRSMR >ERR1740128_1504408 ---------------LGVSYlarhIVPVDVRFLKEHVKTLFVLSqR---MPGNFV-NETLETRATLLYETLLVMSNLNYWVENLDELdlvVASIQKMATNHA-GRGIMAAQFETIGAVVVEYLKAGLKEALTEEMAGSREKLISTMVSIIKETN- >ERR1719354_333269 -MGLEQSDVEAIQRSWEIVKetakLRVHGVNFFEMRFEMIPDWReKYFSHMGP-------KTSAKFRSHATMIMMTLDSWIENLDDLdlvVDAVLRVGQTHA-DRDILSPQFVEINKVIIVYLETGLGDKFTEEMKESWIKLLDTVVTIIKDGN- >SRR5215207_9441599 -----PEQLALVRGTASIIDavGDSFAERFDDHLFARYPAARRLFP--DDT------------TTHRGQLTDEIVFLVAAAADlhaLLERARALGAPPP-LRRtrrrlparrrgTRRRGRGRRGRSVVGRNG---G-SLA----------------------- >SRR5690349_3556304 -TYLTGQQVLLLKKSFRQMNPAQIAAQFYGTLFQQHPEVKSMFPA--DTV------------ELGSKLMSVFELVVFSFDEKehgrfglqdvlIKPLRALGRKHD-DKGVKPEYYEIANSLLLKIMKE--SEYFTTEMYQSWQLALEHLTYAMQDK-- >tr|A0A2A4JK54|A0A2A4JK54_HELVI Uncharacterized protein OS=Heliothis virescens OX=7102 GN=B5V51_782 PE=3 SV=1 -SGMTLKDVYNVQHSWKTINanPLDNGYLMFFRLFEVNPESKTFFKILDNARTETEMRDNVRFRAHVLNIMAALNNSIENLNKpeiVVVWMEKLGTAHR-RSHVQERHFLIFKDVLVNILKNDLK--LSEAVVKSWGRYVTFIYSYILP--- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9902871_2 # 1417 # 1767 # -1 # ID=9902871_2;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.538 -----ALDTKLIKDSFELAKpiSDKLVKRFYENLYSDYPQSKSLYLD--G-----------QLPESQLAILKAINFIVDNLHNkekLGTFLKTLNERYE-LRLNDSVINQSVCSSFLKTLSEAFGSDWTSELAEQWELTYQMVTSFFQDSK- >OM-RGC.v1.013389558 TARA_082_DCM_0.22-3_C19717715_1_gene515718 COG0552 K03110 ---WHGESVTTVQRSWARIQqlgLENCGTLFYNTLFERWPEAKQLFSLSvrlkhrapgESEREGPDPTNSPALRKLWGKLLSVVGSLVSGACNpaeVVPTFHAVGVRHA-GYKLKVAHFDAFGGVMASVLKHLLGEEFTTEVQHAWTLAINFLTANIRAGFV >tr|A7C4X7|A7C4X7_9GAMM Bacterial hemoglobin OS=Beggiatoa sp. PS GN=BGP_4395 PE=3 SV=1 ---KQHDTIFEIQSTYEKILphLDEFSRLFYQQLFEIKPAFKILFRQT-DL------------RIQKQMVIRMIEVVVQGINNlenFMSIIQRIHQRHY-ELHLKPEDYRLAGQALVLSLEKYFGDEFTPTLKKIWLDFYESIVATMMN--- >UPI0004291969 status=active ---KQSDTVFLVQSTLEKVFpqLDEFTNQFFKKFYELDPSVKEIFYEI-DA------------KNKKQMVVNMIGFLTQGINRfdvIIPSIKEINERHF-GREVKPKYYLIASKALVNVLEDYLGEDFTPEVKQTWIEFYEQIVNFMEA--- >ETNmetMinimDraft_35_1059890.scaffolds.fasta_scaffold55614_2 # 1284 # 1421 # 1 # ID=55614_2;partial=01;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.623 ---KQSDTIFLVQSTLEKVFpqLDKFTDQFFEKFYQLDPSVKKLFNGV-DS------------KNKRQMVVNMIGFLTQGINRfdvIMPSIKEMNERHF-GRDVKPDHYLVAGKTLVNVLEDYLGKDFTPDVKQTWIEFYEQIVHFVED--- >ERR1719506_1011120 -GPITAREGQIVQDSWKAVKkvGGESGHAvikdIFYQHLLKDPNVKQLFRN-------------SDMKLQATKLWQTLHVAVDGLSTsgpWFLCCRIWARLTS-STGSKRS------TSMPWVRRsSTrspraWGPRsrrssrWRGRKCTAWLLRRX----------- >Cyp1metagenome_2_1107374.scaffolds.fasta_scaffold42158_11 # 5761 # 5952 # -1 # ID=42158_11;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.578 -RFLTVAQQNEIIATWAIIKeshaSEAIGMDVFKGLFISAPETFDMFDSFKKDP---DWQNNVHFKHHCKVVINVIGSFVLLLNQpekLISHLEFLGVKHN-FMTITPLQFELLGAELLKAFNKALGARYNSLTKKSWTIFYNKIAEVMQTN-- >SRR5688572_5289639 --TVTPDRQQLIRDSWRALEpnGPRLVELAFLHLLQIAPAARPLMTG-HSL------------PCVCRNVASILDQLIAALDEpkqFVPLAIGLGRSNP-GHGINAALYPAMGEALLWALHLQLGEGLTPELQTAWLEYHHLVSAIMRRA-- >SRR5690349_12423264 --XMTPERQQLVQSSWRKVEpnAARLVELAVLHLVSIAPSVRSHLDG-ATL------------PLLCQRIAAILGRLVETLDEpkqFVPLAISLGRENP-DRGLTAKLYPAMGEALIFALHLQLGDAFTLELQAAWLEFERLATAIMQ---- >SRR5215467_4845699 --------------------------------ALTWPLRR-------------------------RCWGKLLWpswiiwkmCPGCSRPSrswAPSTLGM---------VLLPRCTTGSADALVATLAKPNGEQWTPAHTDAWGEAYRAIVAMMLAGYP >SRR5262245_32871681 -------DPQILRETLELTLaaDDSFPKRFYDRLFTRHPEVIPMFHR--NSP-----------GAQRKMFAQKLIMIVDHVEDpawLARELRTVAQSHV-RYGVRPEMYAWIGEALIETLRDACDSDWSESAERAWRNAYTKIVESIFEV-- >tr|A0A1C4TW82|A0A1C4TW82_9ACTN NAD(P)H-flavin reductase OS=Micromonospora haikouensis OX=686309 GN=GA0070558_10167 PE=4 SV=1 -----RAVSADLGPSWAATAaaVDRAAANFLDTVSDRLPGLLP--------------------ERDHTVVFAALGRLAGGVDDtagRAAALAVLARAHR-GVGLLPQHADLLGDALLAAVARENRAHWTAALATGWERGLRRAVTAVRRA-- >tr|R4LFD5|R4LFD5_9ACTN Globin OS=Actinoplanes sp. N902-109 OX=649831 GN=fhbA PE=4 SV=1 -----GMDPaddaalnEvrrLLGNSLSMAGgpME-VAGRLRAALAQAQPTLFATLPG--GP------------VAQVEQLAEGLTWLIHHVDQppaLVAGFGRLGMALA-ECGVAPQQLQLAGAALAEAMRAGmAAHGWRQDFDQAWRSTWQHAYEWIAHG-- >tr|A0A1H7FRI4|A0A1H7FRI4_9ACTN NAD(P)H-flavin reductase OS=Nonomuraea pusilla OX=46177 GN=SAMN05660976_00171 PE=3 SV=1 -----MLGFQRVRDNFELVAkyGDGVPLYLFSDLFLRVPQLREMFPV--NM------------RSQRERLMGALAFAVEHAGDlaaITPYLHHLARSHR-KFGARPEHYAQWSVSVVNAMRRFSGSAWDDELEREWRDFLTAVSQVMIDA-- >tr|A0A210PV81|A0A210PV81_MIZYE Globin OS=Mizuhopecten yessoensis GN=KP79_PYT16126 PE=3 SV=1 PLGLTERELKMIKVSWDVLAedKKSNGVKFFMTLFTIFPTSKDLFKHFKDVPLDQLKydgettKSNKKMVAHAMSVMYALESYVDSLDDaycLEELVKKVAISHK-PRGIGPDKFKLLTPVLHAVIEDLVKDDDSvdlETIKSGWTKLIDTVCDIVEK--- >tr|A0A1L4CYV2|A0A1L4CYV2_9PROT Uncharacterized protein OS=Silvanigrella aquatica GN=AXG55_04100 PE=3 SV=1 -----NIDIQIIRDSFELTKpiGDQIINRFYENLFLEHPELKEFLSR-GDI------------QKQKEILLNTLVTTIDNLDKpesLSSFLIHLGEKHL-NYNMIEMYNDFIGRNFIKTLSQFLGRYWSDELNRQWNEVYKFISLNLKKG-- >SRR3954469_16801024 -------NYALLRNSFEKLKpvAGKVAERFFDILWNDYPETRDFFKN-TQM------------GPQKFAFFQALVFIVENLDQpesLESYLRGLGASHS-AHGVKKEYYGWGCAALHKTFAQTFADEWNDTLSFEWTKVFAMITSLML---- >SRR6266851_5623532 ------------ACTSPSVRstT-------------------TCAG-----S------------TRNSGYPAGPnSPTHStriSHDTRTDrigpkLIRVHRRRRA-RDGVRPRHYRSAGDALLGALAAHLGSDWTPAAESAWRRAYNLVAEIMIA--- >tr|C3Y526|C3Y526_BRAFL Uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_98913 PE=3 SV=1 -TGLTPTQSRLVKESWKMFlsKKRENGFVIFRVLFTDYPVTRKLFKGVEQldLDAPGQLESSITLRAHVTRFMHSFDTYMESLDDpedLKQLLYDTGKSHL-IHDIKPEYFDVLETVLMKSLRIVFGSKLTPQLEEAWQTAYSHLKVTIKQG-- >SRR5271166_2850757 --RWMRPKRNSCARPSPKSRrsPIKAGAMLYEKMFALDPDLRRLFA--IDI------------ETQGAKLMAVFATAIANLHRldeILPTVRELGRRHV-AFGVKDRDYDTGGVALVQTLEAGLGDAFTPAVRDAWMACYEAITGEMKA--- >SRR6478735_6705068 SPSLTREQKRHIRETFAIIEpaSDLVARLFYMKSVDLDPSLGVLFKS--PN------------RVQRRKFMAAMKVTVLSLDRlqsLQPILKLLGARQR-EEGVTPGHYETFQDAWVWTLEQALQARFPREAKDAWSSLLGEMTAPQRPR-- >tr|F2Q9X8|F2Q9X8_BRAFL Globin OS=Branchiostoma floridae OX=7739 GN=lGb13 PE=2 SV=1 --PLDAWQRFYLQKSWKTVArkSDQAARTVFLRMLQDNPGLRQKWPRISLL-TEEEIPTSPYIKFLGERIFDCLDYIIDNLGDLDhviSELTKLGRQHSDMNVMTPEDVWAIEAAFLAGVQECLEDRFTIKYEEIYSRFIVFVIETMVIGFD >tr|A0A226E0J1|A0A226E0J1_FOLCA Hemocyanin OS=Folsomia candida GN=Fcan01_14017 PE=3 SV=1 KVQLTPDEMIAIKRNWEVIHqdLTGNGMDMYLHWFAAFPHMQKVFKKFAQVP-RDQLKTNDAFKAQATVTLHWIDDMIEAIDSpsdMAAVMKRLGRMHQ-TRHTNIYDFREMVKRIQEVIGTKVGEGYTPAAESGWTKLFAKLVENIGD--- >ERR1700732_4531564 -----ASPNGRRNSARASmlISsqPIRRSPRFSATTW-----------------------------WHRPRC-SCSLWVRSEVNRmeeLGGGLCALGERHV-DYGVKRADYNKLASVLIQTLKEFLVDEFTVELQHAWGTVD------------ >SRR5258708_12476517 ---------VLWEWLVDVGGarWRWFGGRLLEIFLETSPELRSLFHK--DI------------AQETGMLEWMLGSLVKGLNRlleIEGGLRALGRRHR-DYKIDQADHEKVLRALLLTLAEFVGDDFTPQVSRAWKTVYGKIPDTMTDR-- >SRR5882672_7954690 -----------------------------------------------------------------------------------------------HYGNANRYQGVRPSRCIpGESSR-----HRPHGASQPSVG-Q----------- >SRR5215469_12962076 -------------------------------------------------------------------SLSARAGRQAGFGl---SG-----------LGSAAT--taiPTPSTSLTGSTARTTG--cSAPYSR-----TGT----------- >SRR6266704_5570200 --GIN-----KTPGMFEKISssMPLGRVA---TVDDIIPFISFLAS--DD-----------------S---KMITGAEAGGNs--fVLVLTNLRNIH------------------------------------------------------ >SRR5205807_5077868 ---------------------RVGHGRVYPRLYIIARHAAGIYAL-TRP------------VAKPgRPRPVCLVPIHKDIA--vmrVTTDQLLARTPL-GrFGEAAevgqlVHYLVSDAA------RFVS-GATVTIDGAWTAYGGWALR------- >ERR1712137_931585 -------------------------------------MGTSLLG-VDCE-GEEFVKT-DSFVPQAKKFIGLCDSFIDMLGPdaelMAKILEAEGRKH-EKLGIKLEHYSTMGEALISGVKTL--DeKFNDETELCWKLVYCGVTNNLGKAN- >SRR5437868_6667390 --------------------------------------------------------------------------------------REIAASD---------ESEGVGDAEI-------DERRSNRLGDVHRSALGprpvtvrdnhgtrtaVKEGSIRRGV- >ERR1740124_2148144 ----------RTRGAAALLLqgrAQPCGVAQAQEACYVCDEHCRCCSQ-GSgGP---QQacarATGPPAHMPYA----THRCRVCCRIGiraRAPPTQALGKRHV-PYGVLPAHYDVVGQALLATLEGGLGAEWNDQVKASWTAVYGIIAKTMIG--- >SRR3954451_929548 --SMTPEQMQLVRLTLAQAtaDPLALGRDFYRRLFVLAPDLRARFH--GDID------------AESLKLKETLTLAFGALTDmrlLVATLDGLAKRDV-ARGLSEQHCRAIAQSLIWAIERRVGSDFTHQVCNAWIAFMAVAMTCLHG--- >SRR4051794_5741567 --SMRPEQMQLDGLTLADAttDRLARGRDFYRRLSVPAPYLRGRCD--GDVD------------AESAKLKETRTLALRMLGNmrfMVATLDAMAKRDV-ARGLSEQHCRAIAQSLIWALERRLGAGFSRQVCTAWTEFLAVVMTCLHG--- >SRR6516165_10653891 --EPSPNQLHQNRPD---R-RPGGGTLLWPPLRDGSR-NPGAVL--QRR------------GRTGSEANGRSCNRCEQSRRFrgdRPHRTRS----C-KAPRRPEHYALVGSALLWTLEQGLGDEFTPALRAAWAAAYCALSEVMIA--- >tr|A0A1X7UGV4|A0A1X7UGV4_AMPQE Uncharacterized protein OS=Amphimedon queenslandica PE=3 SV=1 -MSLTSAQVALIESTWKVVKkdLQGAGNIMFLKLFQIDVSVRDKFP-FRDVP-YEELEDSESFLKHSLQVMETIDLAITLLlGGemekLVEALVDLGMAHA-MQGLKPEDFDHVGEALVHALGVALGKEFNDEAKKAWTLLYSVVTAKMKEGL- >SRR6266699_274039 -------QGELLETSFQAIVlhGEAFVTAFYERLFTRFPETRAFFAA-TDM------------LEQRKKLQQTLALIVQHIQHpevLGDMLQELGQRHV-TYGIRPEHYPSSERCCWRLSPTFSGSTGRRRTTMPGSRGMRQSAAX------ >SRR5438045_5489985 --------LITRPTSYYLLSlhdalpISLLADVFYSKLFVKNTGLRKMFP--ADL------------QLQRQKLMNMLHFIISNLDQpelFNKEIEGLGLRQD-RKSTRLNSSHLGISYAVFCLKK------------------------------ >tr|A0A1E3GPU1|A0A1E3GPU1_9GAMM Bacterial hemoglobin OS=Methylophaga muralis GN=vhb PE=3 SV=1 -AKLQEQDIALVEQNFAVLMefSDALAERFYQRLFTEYPEIMPLFKS--V-----------TIEGQHKKLLASMVLLIQHLRDtemIEDYLQGLGARHQ-QYGVETSHFEMFIENWLSVVAEFADQKWDSKLQQAWRNVLEYVAELMQSPT- >SRR3954464_793235 --------VDPFRSRFAFGVerEPEVTHRFYDVLFAKYPQVQPLFGR--RSR-----------ADQERMLRDMLVAIVDHVEDppwPQHHPPPPPPNPP-RPAPTP---------------------------------------------- >tr|B7QBW9|B7QBW9_IXOSC Beta chain of the tetrameric hemoglobin, putative OS=Ixodes scapularis OX=6945 GN=8038954 PE=3 SV=1 -TEMTSQEKHVVRDTWAIFKkeVQTSGVAIFVVLFFKHPAYQKLFVAFAADP-IAELPQNPRAIAHALTVAYAITSIIDTLDEpetSAELVRKVATNHVRHPTISGAQFEHMGQAVVEVLAEKLGSAMNHQAVGSWQKFFAFVVRVSQGVF- >tr|A0A1B6H4C1|A0A1B6H4C1_9HEMI Uncharacterized protein OS=Cuerna arida GN=g.19114 PE=3 SV=1 MRRLTEREKENVRLVWKKVedDYPSYGRSVFVKLFDEYPYFKKFFKATIG--NFEDPFMSPRFQKHMLQvLMPTFGGIMDNLDFpeaVNEAVKRLAVSHR-KKELGiaKEHINILGQVIVSVVKRDTL-GCTEEQEEALEKVISIVMAMFC---- >SRR5215813_3453690 ------------------------------------------------------------------------IASDSEIQVspwtrt--GTLAISARRCS-SSRISSGigsdtTFSLYGNCV------------SSSATIAWNTHGD----IQLDS-- >SRR5579859_1863727 -------NISSLQLTILNLLtvEDEFVPRFYNNLFNMYPLARSLFVHTe--I------------SLQYNKLRLMLMMIIRTIHDadgLKIQLQQLGQRHK-YYRVEPEHFAILYIVFVQTVVEYLGPKWTAELEAAWAEAYGTIVRMMDME-- >Dee2metaT_7_FD_contig_123_47857_length_200_multi_10_in_2_out_1_1 # 3 # 200 # -1 # ID=100007_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.434 ----------VLRDREG---lgDPELVVLQRRHLAEHGAILQPLalLARQr--H------------REDLELVRELLLLECDHRVEhprahpaGVGVEGELGVGHH-TERIKRSlspsalLGRWIDLVVVGAVRR---------------HHQGGVVDLRLVE-- >SRR5436853_3450426 --------PVLLKDSFNLVRseEHTSELQSLRHLVCRLLLEKKKKnkTTTV-----------NYIE---KEKLGKLEA-SCPVEqti-------GIGDKQR-DYQ--QMHHPERTEAQ-----KX----------------------------- >tr|A0A1W2WRJ7|A0A1W2WRJ7_CIOIN cytoglobin-1-like OS=Ciona intestinalis GN=LOC100183004 PE=3 SV=1 -MPFTDEELKLLRNSWDEVKklgMKEVGLHIFTGLLNAAPSLRTLFYTI-DLPDEeeltiDVMRENKKVVAHATRIANAISKFIKFLDQpeeLEKLLTSLGESHA-RRQVDPESFEYVAPVILSVIGGHLKLPSNSPTLQAWVKAYGVLRNGIVS--- >tr|A0A1W0WQD3|A0A1W0WQD3_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_08524 PE=3 SV=1 -TGLKKRERLVVQQTFEAIsKklgRAVLGRDIFYLFFQLHPAYLQLFKALRDIP-PEQLKTHPRLKAHGLNAIQALAAVIENLEDTettVLLLEKTGRDHV-RRKLQSKHFEDFHSTTVALLKRELGPSFTPFVEQSWNKAFTVVNTVIL---- >SRR5438034_562795 -------AVETLRNSFERVIerSPNLTRRFYEILFEKYPQTRRMFGL--QS-----------GKGKGNGKGAGARQRLRRChcrlhfgkekaTVvpfPLPVPVPLPAFRD-SYX------------------------------------------------- >SRR3954466_4238475 --------IRRLTRSYDQILsaGDCLPELMFAQLFDRAPELRTLFPD--DM------------GRVKHQFARMLHWLIAHLHEpqkLRIALVDLGRRHQ-EYGVKPDVYPHLCEALVDAMATICADDWNEELCRDWRQTFDLMVHHMLRAY- >ERR1719359_2370951 -------------RLIVTPEhldGCRAGLLALRVVLLHLGEGLGLLG-SDSSGVSdcgVALgeL------------PLQRLDLLGVLLGpr----L---GL--L-NAGVRGLELSLLGRLlrvglselfVAEGLLLGL---------------------------- >tr|A0A212ELK8|A0A212ELK8_DANPL Globin 1 (Fragment) OS=Danaus plexippus plexippus GN=KGM_200313A PE=4 SV=1 -SGLSRRDVFAVQKSWAIVYanPLANGSELLKSPYISRIL----ILLVDKVS-EI----------------GSIVKAATDVE------------------------------------------------------------------- >ERR1719343_803772 -----------------------RAVDCSFDFSRKSPVPRPSLA-SAKKDfngDANSVYDSRKFLDIGKNFIEIVDQAVDMLGPdlqvVAEVLIDLGKKYHNEYDMRPEYYSVLARALIDELEEILGTDkFNTRTKSCWVQVYGAIAADIAA--- >EndMetStandDraft_7_1072992.scaffolds.fasta_scaffold3604113_1 # 1 # 288 # 1 # ID=3604113_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.538 -NNLTDDQKNVIKKTWITIEenRTKIGKQTFIRVFELNPQIKKMMPEFMTADPIEELNSSRKLFGHSKTLMTCLENAVKSLDDnerFVAYLVELGRRHQ-VRPLKAPYFEVIHEALMFSLKDVFQSDWTTETSESWSALFRYMSEAMIIGL- >tr|A0A136A626|A0A136A626_9ALTE Uncharacterized protein OS=Paraglaciecola sp. S66 GN=AX660_04410 PE=3 SV=1 -MILTVEEKSAIKESFAVLLRenANVAECFYNNLFELAPLIKPLFKS--GR------------ENIENHFHELIGTAVNKIDHfndLRADLIALGKRHK-IYGAQQAHFAVVKAAFILSIQYKLKGQCSPFLENSWAKYIDNISSVMIEGL- >ERR1719461_1916292 ------------------------NV-SLFSLFAADPGVQtKYFGHMK---------TDADLEKHGVRVMNSIGAMVRAILDqdddrLITKVHEITRNHQ-PRGINRPLLEFFLSVVLDYLAKALDSHLSKEGGA------------------ >ERR1712179_865199 ---------------------------------------QrKHFPHMM---------NssigksltKSKLKIHGGRVIREISVMVDCVQAgndeaLMAKIKEITVNHG-VmRDImSIEAYRLVLDGLVAFLGSALGDSLNETGHHAWKKLVNNIITGID---- >SRR6266699_3297184 ----ALARGSLATPCFRSHRAqhFQARMpykPVGSLEAARQHAREGLFRS--DME------------RQYFKLMDMIAAIVGTLDKremFQSIISHSGRQHA-QFGAKPLHFAAFGDALIWGLEQQFGAAFTPEMKEAWIKLYDDVQREMMR--- >ERR1719271_149007 --AVSARERRLIERTWEKAKedgCDALGANLLQTLLVAEPQVMQLFP-FKDE---ENVYESLRFKAHASKLAVIIDAAVSLLANpvkLESLLISVATSYEYsFKQMLPEHFPLLGEALIRTLTSIVGgTKFTWQAESAWRKVWTIISTVMIGA-- >ERR1719203_2782565 ---------ITSKFGWTSNmq--------------KIIQSQTHSKT-QDMQ---RDYYLNQK-KTLEI---------------nvRHPLMKELLRRVE-----DNPEDKVAKdMATMMFNTATLRSGFSLKDTVNFAESIELMMRQTLG--- >SRR4029078_13512293 ---------------------vKRVAAELfYVKLFELDSTLKLLLA--D-Q------------QVREQKFMQIVDATVNGLEHsegMMSAVRELGIRHP-LFGDSDEHHGPVATSLFWSLKKCLRKDFSGEECPRAVGGHALC--------- >tr|A0A147B4Z8|A0A147B4Z8_FUNHE Neuroglobin (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 MGELSVKDKELIRGSWESLgkNKVPHGVIMFSRLFELDPALLSLFHYSTKCDSKQDCLSSPEFLDHVTKVMLVIDAAVSHLDDlhsLEEFLLNLGRKHQ-AVGVSTQSFTEVGESLLYMLQCSLGQAYTAPLRQAWLNMYSIVVAVMSRGW- >SRR5262245_48005872 ----VSMHTSPLRASVELVEqrRSEAVRYFYAHLFAGHPELRTVFPI--SA------------VEEHDRLFTALLYVVKNVHAlpmLAAELQQVGRDHR-KFALSAEHYQVVGASFLATGAAILAEAWTSEIGSGWQSAYRMAASVMSD--- >tr|R7WMM5|R7WMM5_9NOCA Flavohemoprotein OS=Rhodococcus rhodnii LMG 5362 GN=Rrhod_2088 PE=3 SV=1 --IFDDRTLRRVRATYKDMAArpdwdSHLAQSFYANLFAENPQLRLLFPA--NL------------EAQTHRMLTAIRYVLDNVEQpdrMLTFLGQLGRDHR-KYGVAREHYEAGGRALLQSLRGSLVtLLWTPTVDAAWSEVVGTIVGTMAD--- >SRR5258708_3005780 --EPTPTDITIVSDSLAPLTkeqVDNVLAAFYHQLFTRQPSLRQLFKSFRSGDQ----PDQQAMKLQRNKLAEIIALGLKLWEKphqLIPALEKLGRQHH-QYGVRDEYYEDVWIALSEVLSEAFGLDRWEDICESWQRFIFLCARHMLNG-- >ERR1719347_1330150 YFCLSESNIKALKSCHPHLkdRKEEFGHLFYSNLFSNHPDLKSLFDQ-TEE----------GRQLQAQRLADTVVAFLEKCDDlpsLLPTFKKIGKRHT-TKGVKPEMYQIIIDNLVDTLEEMLGKeVFSAEVKQEVLESISFLSNAFIK--- >ERR1719284_1036555 ----------DVSASLDLVKrlpnYeQVVGVRLYQKVLAAGPQYVKMFP-SVASsltssNDPEEFLKDPVLLKHLTSYIRMICMAVDLLGPdtelFEEQVRELGAKHS-EYGVSQRYYVVMGKALIQTLEELLGDRFTPSTKQAWEKMYDLMSSTMIKG-- >SRR3974390_2763688 --XMSPETKELLETTWAKVIpiSDVAAGLFYERLFTLDPSLHRLFEN-------------ADMKEQRRKLVQALHAVIYSVDDlpsLIPTLEILGRNHV-RWGGIGGTPRDLGGQSHPEAVGRI-----PNIR---IVAVAvGRPDIMLV--- >APLak6261669570_1056073.scaffolds.fasta_scaffold275140_1 # 52 # 198 # 1 # ID=275140_1;partial=01;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.524 ---WSTRRVKVVQRSWETFKstqaeSTTVGLAVFKRFLRRSPAFLQLFP-FRDQP-LETLFLNAKVRLHCKLFADTVSRTVGLLGDsvaVKASLRELGARHSDLYKVRSGHYAAMGSALLEVLEHNLGESWDEETKTAWEETWAYITEQMQKG-- >ERR1035437_6084348 -SSLDQEMIAIVQVSWENVTPDsrLAASMLAMNLCADDRNIASLFEE--DR------------IKMSRDVMQAVSCIVADLDQpetLVPYFGSLGQLLR-RHGLHESGQQTFATALFLTLGQLLGPRYGPVEHNAWAIAYSFVVRIMIAE-- >ERR1035437_3078414 -SSLDQEMIAIVQVSWENITPNsrLAASMLAMNLCADDLNIASLFEE--DR------------IKMSREVMQTISSIVAGLDQpetLVPYLGSLGKLIR-RHVLHESGQQTFATAFFLPLGQLLGPLYAPVEHNAGAIPX------------ >ERR550534_521252 -TSFKPNEIMEMRVMWNGWvggDMASRGFEMFCKMFEMHPETKDVFA-FMKGSSVAQMQSSSKVLFHVTRVMKYIDEVMRHADRLdevVPILRQVGGRHGTqGYNIQSGYFPFLGNALRQLLKDHFKTRYTAVLDGHFQKMWGFIVKQMQAG-- >ERR1712105_94955 -TEFKPNEIMDMRVMWNGWvsgDLASKGFEMFCKMFEMHPETKNVFA-FMKGSSVAQMQSSAKVLFHVTRVMKYIDEVVKHADKLdevVPIMRQVGGRHGThGYNIQSGYFPHLGEAQRLLLKDFFKDRYTANMDAIFKKLWVFIVKQMQAG-- >ERR1719483_559503 EGPLLAKDVKAIEESFAMVAalgsAKELGIGFFRLLFTTYPEWLEkYFvPNFGDKP-LEEFLMIPRFEVHAPGVIVELSKWVGSLHDldsLVAAIQENARNHY-RRGLNVDHYKKIAGVLLSYISAGLGDSLTTQMETAWTKFLDTMVNVVEEEM- >tr|A0A195EH31|A0A195EH31_9HYME Cytoglobin-2 OS=Trachymyrmex cornetzi GN=ALC57_03526 PE=3 SV=1 -LGLTEKQKKLVQNTWAIVRkdEVSVGVALVIAFFKQYPESQKEFKSFKDVP-LDELPKNKRFQAHCINIVATLGKVIEQMHDpelMEASLINFTEKHK-ARGQTPEQFENLKQVILAAFPSLFGKQYTSEVQEAWKKTLDLIFSRICQ--- >tr|A0A158NI97|A0A158NI97_ATTCE Uncharacterized protein OS=Atta cephalotes GN=105620364 PE=4 SV=1 -----------------------------------------------------------------MNIT--NGTIHDILSGgkNTQKV--FL--FR-HRGRTKEVVEKEEKIRVAGLDtngshradCPKGTDEGREIGDPVTDSLLQMLQKKEK--- >SRR5690606_21296714 ----lmEWERVKLVQESWSSITpL-gaKFTQVFYRKLFDEHPAVVGLFPE--SM------------AEQEQLLSRMINPAISCLPAesvFENMMHKLGNRHS-EYGINEKHYRMFTQSLLETIRESLAERWTDELESAWAEVLSGMSRRMN---- >GraSoiStandDraft_11_1057310.scaffolds.fasta_scaffold26797_1 # 22 # 990 # 1 # ID=26797_1;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.733 --VIvTDSDISGCFSCWQTVVdGkapayiEdsdpnkpsglvWFSNVFYGRLFDVNPEAKKLFRD--NN------------ETKARALGNIISTGLRQIWDranFSKILHGIAVSHC-KLGVKAIQYGLVGDVLLWSFAYTMKNMWDQDLRTSWIAV------------- >SRR5690606_23735845 -TSFVSLNANVLQRSFEFLApqSDRLAKRVFEKLLKDYPQYRPLFAKV-EI------------VDLRQRLIQSLALVVKSAQRpetMVRYLSELGIRHA-EYGITDNDYRPFTSVLLGVLAEFSGARWTPEVKTAWEEVX------------ >SRR5215469_11104805 --TGVAEQHLLDLGGVDVLP--APDDHVFDPA--GDPQVaaviedAQVAGV--QP------------AVWIDGFRGAFGHVEVAEHGLvaarADFPG-LAGRHG-FPSDRV----------------------ADGDLYL----------------- >tr|A0A2T7P4Q7|A0A2T7P4Q7_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_10992 PE=3 SV=1 -PSLTADIRRVVQQSWYRLvehrSLDQLGIPVFLEIFHLTPAAKKLFH-Y-SeKTTIEELEGDRRLREHATRFMNAVGAVVDNLDKknsddLDVMLREMGADHTNISTFNQVYCVIFREALLSVWERNLGKaRFRGELKNAWRALITYMMEVMREGYD >SRR5438128_5040868 --------------------------------------------------------------------------------------------EY-RWAEGSSelaaEFVRLNVDVIV-----TGRLPAVAAKQADIRHSDCVRDSCGP--- >WetSurSiteA1Bulk_404760.scaffolds.fasta_scaffold823987_1 # 3 # 239 # -1 # ID=823987_1;partial=10;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.409 ----------------------------------------------------------------------------------MPN--------------------DSDSCHSVDNSAILHAVLDSAVDGIISIDESGTMESVNA--- >ERR1711918_283694 -----------------------------------------------------------GSECSWMCRC---GIARFEQT----RTTSHKSRRA-TYRvqPDRGILAHPGESCDDHFGGAPWGGLHPEVENAWNVVYGFPSSIMISGPR >SRR5262245_16285966 ---------XMVEGTLDAVSLPALSADFYRRAFDTDPELARMFTA--DR------------RVQEARFATELAAIVRSIRchdEFVPAGRALGPVPR-L-RRDGRPLPRDGRRPAGIagrcprsdvearGGRGMAPRLQPDRRDDAERRPRAGQLGVTSG-- >ERR1712061_521749 ---PVGHMKTAVEQSWERVQalgPVVIGAQEHRDVAVVSRTTST---TSTRI-EESDATAAGSLANPF---------------------------------------------------------------------------------- >tr|X6EW29|X6EW29_9RHIZ Adenylate cyclase OS=Mesorhizobium sp. LNHC209A00 GN=X738_26865 PE=3 SV=1 --------FALAQRSVGLLLddPSAFAAQFYANMFAIQPELEGLFVN-G-T------------GAQGAMLSHMLRTVVSGLERRkhvPAGLQTMGRKHI-GYGVELDHYDSFRGAMLKTIDDIMGAGLTREIEESWSETLDVILGLMKKG-- >SRR5215471_14715706 --------PAGGPALARLLRr-------HLRRV--VSSRLAPLFLR-LAF------------NDAISYDPATGSGGANGSIRLpeeLARKEVAGLARA-V------------------------ERLRPVKE------------------- >SRR5205085_9494957 --------PASGPALSRLLRrhLRCVVTsraapLFLRLAFNDAISFNPATRA-GGC------------NGSirlaeelEREEIQVLSQGIEQLRPLkerFP-HVS----------------------------------------------------------- >SRR5947207_2391870 --IISNRQARRTNDRLQIELaaAQARIGLLYFAQHDRTRAAA---------------------------------ALLEGPDAFdqqRPALRAMGLRHV-AYGVVPAHYDTLATAFLWPLGHRLSPEFSPX--------------------- >tr|N1VSG6|N1VSG6_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 GN=LEP1 -----PDPILEIQKSFDHVLeyNPHWIDSYIDKLKNFSMenvTENQREGDN-ES------------PISSEEFLNSIESIIEKLGNpisVKKEVSKLANIYE-SLGITKKEFPKLLPILLSSLRENLPSEWNPSLESIWTQAITDLTIETIES-- >tr|R8ZTT5|R8ZTT5_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira yanagawae serovar Saopaulo str. Sao Paulo = ATCC 700523 GN=L -----KDQILELQRSLELALqlNPNLARDFYIHFLETKPEFQKFFQNT-DM------------ETQAKKLLAMFGKTIERLGNlnqIQIELQNLGKMHE-EMGIPVTDFGAIAPSLLYALEKSLGDQWNAEWKSIWETALGSLVRLMGMK-- >SRR6478609_9341681 -------DAELLETSLALVDTpdASLDSRFCALLHERHPAVHPGGGD--TA------------ARQAKLLRSAVISVVDHLDDpvwLTETLGDGTARPS-GWQVAPEMCGAVSECMVAAMVEIGGARWTSQMTDAWVEALDAVSGPMLLGS- >SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold1207366_1 # 2 # 214 # -1 # ID=1207366_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.286 -----YASHQSQAASLAKAAprPRVAVLGLrlpsgeSPQLARLGRAFAELLG--AEL------------AAGERLLVLPAeRVehMKLELGLdeaEAYPLPTLGRIHR-NLGPDLVVVGTlapqeprgtlsvtveVKDCLTGAVTATAKVTGPAAELFTLASQvggelrrrlgssalsgneraelraqrpaSPEVAQLYADG-- >tr|V4A611|V4A611_LOTGI Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_233216 PE=3 SV=1 -IGFTETQIDTIRSTWPLLSrnMVRVGTDVFVRIFTEVPTVKELFSSF-NIVDVNDLHKMPTFRAHAEMFMQVLHLVVDNLETpyseLNHELMVLGARHATFSGFKPEYFKFYVKCLIQVWELELGEEFILEVRDCWKIVFDFLVDNMTEGYE >SRR6266542_3322184 MTVMTPEQIEAVEATTAVLapALDDLAADVYARLDRLAPETAELFTG--GPA------------AEVRGRARDDRARHPAPRRLpGacl--------------PARPPARALRGQA------GALRARRC----------------------- >SRR5918994_1217714 ------RDiEAYVRT------gRAA------VPVFESDVLLEDCVTS--AA------------NNDWcgVSTRPRNEVWPGFKVGlerAVPVLEQLGRDHR-RFGAVTAHYDAVGASLLATLRHFFGPAWTPELHQTWSEAYGPVAKVMVTA-- >SRR5207302_4688282 --VVTLEQFRLIQHSWKLVKdGqfaaftaqtliadplGFWGLQLYDTLFALNPSLKPMFKN--TF-------------TQSQMLTEMVGAALGllpgildqalgeektAIDPqLIPILVDLAERHV-SYNVKAAHYGTVGLGLVTTLERTLGSHFDEQKQATCFELWSMMX-------- >SRR5437867_13093015 ---------------------nqnpsPLWRA---------------------RL-------------PR-------VSIAFGlrwfNCnTSkSYSRKCSTNLLNV-GYNVKAEHYGTVGLGLVTTSERTLGSHFDAQTKAAWVELWSLICTVMIP--- >SRR5882757_3847967 ----------------------TSI--------------WPIIIN--TaV------------GirnipQDYRNVARVLRLnqFEF-FTKimvpaAAPYIFTGL---------------RIGIGLSWLAI--------------VAA-------------- >ERR1700737_3002051 ----------------------RDF--------------HHLDLA--DhH------------Q---------HRVagTQW-AN-gsMSNAVWTGV---------------RLKDVLDRAGV--------------KSGAI------------ >SRR3954451_23003713 ----------------------LKS------------TTGEVFLE--G--------------klv-DE-------PGpdRAI-VFQnhsLLPWLTVYG---------------NVAIATDKVFGGSGARSKSKAERHDWVMHNLELVQM---A-- >SRR5206468_1650083 ----------------------TNA------------TMGCVLLE--N--------------rev-NS-------PGaaRRR-QGVcerQDPQRAQRMGDAqpqpradgacqgqA-PG-GDFRRYEAARRHCPRAGHATKSAAARRAVRRAGRADPRAPAGL------ >SRR5258705_633045 ----------------------TSE------------DAGPVALG--N--------------qev-KQ-------PRtqPPV-VFLdpaLPPRPPALD---------------HWLLRAARDAGGP------QPQ-------------------- >SRR5690606_21133184 ----------------------INP------------LHGAVRLN--D--------------aap-RV-------GDpeVGY-LLArdaLLPWRTALR---------------NVTLPLEV---RGI----ERREREQSARKVLRDVGL---E-- >ERR1700682_1967427 ----------------------DRA------------SAGRVVVD--G--------------sev-RG-------PSldRGV-VFQspaLLPWLSALK---------------NVAFAVRSRWPRW-----SDEQVVSHAQKYLDMVHL---T-- >SRR5699024_2544359 ----------------------LSPSSGKIIVAFSSPTSGKIMMD--V--------------ndwtSYKDSEMTALRLkeIGF-IFQeshLLPYLKIRE---------------QLEFVGREAGMDK-------KHARKRAKEILDLFGL---D-- >SRR3954447_21976298 ----------------------RAA------------TGGVVRWS--V--------------dplvAAG-----GRARhpLSM-VFQkdtVLPWRTVAQ---------------NVGLFYALN---RD----RRAGAEGVVDDLIRLAGL---E-- >SRR6266567_262474 --SMTPEQIDLVRKSFDALWpfRRKLADQFYGRFFELAPDTRRLFPN--DME------------RQQLKLMDTIAAIVGTLDQreiFQSIISLTGRKHA-DFGVQTSHFACCFYPKSLEAPAHAGGFLCSSpLNVSWNGARARPYPLMHL--- >OM-RGC.v1.004444255 TARA_034_DCM_0.22-1.6_scaffold509117_1_gene597562 NOG05352 "" --PfLQPTKFELVVNLKTA----------------------KALGL--EVP------------PTLLARADEVAGVGGSAKRishWPPR------------------------------------------QSRWAGLPRRPERH------ >ERR1719401_1263416 ----------NVLTSWNTLKskpnyCDETAALIFERLYELEPKAMSIYE-LPTNVDFKTLRKDAHFKMYARYAFDTMDCTVSMLGpdlfELSGVLHEMGRRHQ-RNGVDRSYLPYMSEALFHALAKMLGPQFTEDDKEAWKGVMDYMISEMVIG-- >ERR1719401_232394 ----------NVLTSWNTLKskpnyCEETATLVFERLYELEPKAMSIYE-LPTNVDFKTLRKDAHFKMYARYAFDTMDCIVSMLGpdlfELSGVLHEMGRRHQ-SNGVDPSYLPYMSEAFVCALSKMLGPQFTEDDKEAWEVVMDYMISEMLIG-- >ERR1711862_565156 ---------------------------------------KIMFH-FPVNMNIETVLKSKIFLQHAKFFVKTLDITIGLLGpdtdIIQDVLLEHSKTYQ-NHGVNSAMYLHMGESILYALEKDLGDvNFTSKDREAWAYFYGTIVGVIVGG-- >GraSoiStandDraft_1057264.scaffolds.fasta_scaffold343999_2 # 425 # 754 # -1 # ID=343999_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.636 ---RRRMDAELLETSLALVDtPdDGLTKRFYALLFERYPAVRPVFPEEmhRDI------------ARQAKMLRSAIISVVDHLDDpvwLTETLGELGARHA-GWGVLAEMYDAVTECMVAAMAEIGGDDWTPYMTDAWTEALDAVSGLMLLGYP >ERR1044072_5206314 ---MAPPQIAVARSTGPKVSPmqQRLAQVFYERLFELDPTTRAFFGG-------------VDLRHHGLKLTETLSAGIEVLGRdgpAPRGS-----------GSGMAALRDGGGCVVHGAGVLPGPRVHDRSPGGLVGGVLG---------- >ERR1719389_1465843 ----RCNRKLGGSAKEEKLRrndgtrfvCKI---FKISRFLKQQPDASAVFG-F-DNN-DEDVHKTPKFIDFANHFVEVIDQAVQMLGPdfelLTDFFVDLGDKHSKEYGIKPKFYPILGRVFM----------------------------------- >tr|Q17153|Q17153_9BIVA Hemoglobin (2 domain) OS=Barbatia lima GN=hemoglobin PE=2 SV=1 ----QPANKGLIRETWNIVAGdRKNGVELMALLFEMAPDSKKEFRRLGDVSPA-NIPNNRKLNGHGITLWYALANFVDQLDNktdLEDVCRKFAVNHV-LRGVLDVKFAWIKEPLAELLKRKCGQRCTEKHVKAWWKLIDVVCAVLEEH-- >tr|Q7M455|Q7M455_BARRE Hemoglobin 35K chain OS=Barbatia reeveana PE=3 SV=1 ----KPANKGLIRETWNMIAGdRKNGVELMALLFEMAPDSKKDFRRLGDVSPS-NIPNNRKLNGHGITLWYALMNFVDQLDNkidLEDVCRKFAVNHV-NRGVLDVKFAWIKEPLAELLRRKCGQTCTDQHIQAWWKLIDVVCAVLEEK-- >SRR5262245_28144535 --CVTEEQIARVRACFDELTPrtPEVVDRFLARFFAQNAPLRALFP--RDLS------------ALKQDFAAGFRHVVRHLHRldtIAPMLMDLGSRQA-RAGLTPGHFGMAREVLLTTLRDVAGPRWNEQLRQDWTEALNTVVSLMVVGA- >ERR1039457_5537378 ---AGPLNPALIRKSLALITagPPRGAGGFSRALFSFDPGVGGLVPA--G------------DERAER----APVRR-------------AGPDRR-AAX------------------------------------------------- >ERR1719498_600299 --------INCVQHAWNVlIIEDRsreflraqesatfvyssciswFYSVFYSRLFNVHPLFRPRLNS--KG------------SKSGKSLVMMIATTINGLRDkdmFQRVVTEMAKNLC-SSGVKPVEYGILG--------------------------------------- >tr|A0A2H8TS68|A0A2H8TS68_9HEMI Neuroglobin (Fragment) OS=Melanaphis sacchari OX=742174 GN=ngb_3 PE=3 SV=1 --YLNKSQTALVKQSWPMITSNNFWTTFYINLFKRNPLYQLQFDRFANVP-FEELESNVHFLAHSFRTGFAFNTAIEHLEKpdeLHRILMDLGEKHR-KFRLTAEHFEAVKDILLCMIEDRIVLTdvpaRNILLVEAWKPCITLVIGVIM---- >SRR5215469_6657410 ---------RLCPVSQSQMSSvvGatTSaaHRITMSPIWVSpCYSFTWLAI--NRY------------TWDRFGLMTMIQTAVENMHQldqILPAVRDLGRRHA-GYGVKAADYNTVAGALLGTLEQALGSEFTSAVRNAWIAYYQTLAGEMKA--- >UPI00001F6528 status=active ---AIIDGLRDLSESFDTLaadeaatApaATELKaavegqfsgvfGAEYAKQTGKQPDTASYTLE---------------------HSAAALAQYHYIVRNphpLGQknKLDKV-AGEA-RYHALHARYHTMLNAYLERFGyydvflidldgdvvysvfkemdyatNLKTGPWRDSgLGRVFRSALESNDtkSTFFDDFA >ERR1712100_346632 ---------------------LFFFFFFFFFFFFFFFFFFFFFS-FKNV---EDLYESPMLKAHGKAVVGAVDAAVHLLDDvskLIPILEELEQFHN-RKKIVAAHYDVVGQAVVNVIGSALNG-LSEEQTNAWVKVYLTIKSVMLA--- >ERR550532_3561775 ---------------------GDSSVSPSGELCSPKTKTPRICSTVLE-----LTMHSADFQAHSGRVFGGLDTVISCLDDeatLVAELAHLKGQHDER-NIPDAYYRHFYQALEKVMNAMLGPCFNY---EAWDACGDIVFHGITGH-- >tr|A0A1I3HEN0|A0A1I3HEN0_9RHOB Nitric oxide dioxygenase OS=Jannaschia pohangensis GN=SAMN04488095_0565 PE=3 SV=1 --LVTNTQARLLSRSLRRISenGAPLARSFYAELFSAHPEVRPMFHS--DLS------------TQYAKFEDMLVVLVADVLNpgvILRPLQDLAKRHV-EYGVTREMYPIVGDIMMRTLRTLDAAPLTGDELEAWDVLLGRVNAFLMDE-- >tr|A0A1Q3FVI8|A0A1Q3FVI8_CULTA Putative globin 1 OS=Culex tarsalis OX=7177 PE=3 SV=1 -TGLTNHQKVALIGAWSLVkkDIISHGRNIFVRFFEENPKYLNYFD-FSQDRTASEIGENKSLHAHALNVMHFIGTLIDyGLYNpamFKCSLSKLMKNHL-KRGVKKEDVTIVCGVIMKYCLEVLDQHQSTTLQVAFASLMKGIADAFD---- >tr|A0A2M4DSC8|A0A2M4DSC8_ANODA Uncharacterized protein OS=Anopheles darlingi OX=43151 PE=3 SV=1 --------------MWCKPthQNpegSSDYISICVRLFQKYPHYTDYFD-FTDDTKADSLVDNKSLFAQSIHIVKAFGSLIEyGLKDprlFHETLKRIARWHE-QRNVYGCDVLLIGEVMLTYLTQTLGRQTPAMLGEAFQKLFQTISYRFP---- >tr|A0A0N8DLE0|A0A0N8DLE0_9CRUS Hemoglobin subunit theta-1 (Fragment) OS=Daphnia magna PE=3 SV=1 -LPLNARQKYSMLASWKGISraLEPTGVYMFIKLFEEHKELLSLFTKFHQLTTRDEQANSEELAEHASSVMSTLDESIRSLDNVDtflLYLHQVGQSHYKVEGFQKEYFWKIRNPFLEAVKMTLGDRYTENIENIYKVSINLVIETLVEGYE >ERR1719383_1265545 -------------HSWKEVGqapADEVAREIFRNIFAIEPGALELFP-FKNES-EDDLwREGGALTVHALKVVSTIDKAVSRLGNmdaVVPMLRKLGIMHV-GPRPQHLGNG-----APMSLP--------RRPTASWRRG------------- >ERR1719383_514948 ----------------------------------------------RGRL-VEGRwRFDSARVKSCVddrqGCVETWQHGRRR-----SNAPQVGNHAR-GLRCAQAHYDVVGQALVTTLASY--CTFTDPVKNAWIKLCGVIKATMVH--- >ERR1712000_66502 --------FPKVQKSWARVLeieakdeSKSFGPIFYNTLFTDFPFLKEqdFKSA--TM------------AEQKMNLPKFITTALSLLGDmpkAVDALQRLGMRHV-LYGTKDAYYPVVGANIIKTLKQILPANEFDQEtQEEWLTLYGVMQKTMIDA-- >SRR5258708_4037766 --------PGAVGPAPGLQPprNRPGARRGQPALMQSPSAGGPPPGP-HrpRR------------THRTPPRRAALVLLRRSLRDldeVVPGLRAMGARHV-RYGARPEHYPVVGAVLIDSMAEVAWDAWRPAYGRAWAAAFDVVSGAMLAG-- >tr|A0A1Y3AX51|A0A1Y3AX51_EURMA Globin-like protein (Fragment) OS=Euroglyphus maynei GN=BLA29_013533 PE=3 SV=1 ----------------------------------------QKFKSFKDIPINfqqnHLIRIDKKLIAHGTYVMYTIGMLVDNLERpdmMRQMLKRLSRNHY-RRRISLKAFERLRDTLLEHLSDILGKEiFHRKTMIAWHKAFGYLLKEIESN-- >SRR5688572_8260099 -----DQEINIVRQTWNRLAaehGNSVAEEFYKRLFECCPHLKDVFKN--DF------------EVHGKEFIENMDHIIIQLDNpcMIREMQILGIKYA-SYGIRYEDYECMKKALFDALKTKLAEHWTPTVMVSWIWFYSTVSHIMKH--- >tr|F2Q9X2|F2Q9X2_BRAFL Globin OS=Branchiostoma floridae GN=lGb7 PE=2 SV=1 -MSLSAADKKLVQESWDKVSkpsFADAGERVFLKLFRRNESTKAHFKKFKDIPS-DQLAGQAVVRDHGEKVCKVLDDFIKGLDGsGDEAVKKVGRMHK-GLGMSNEQIDQMKGAIIEVLADAgFGD---ANYKGAWGKLWDRFMAVHRA--- >tr|A0A1B0G6S0|A0A1B0G6S0_GLOMM Hemoglobin-like flavoprotein OS=Glossina morsitans morsitans PE=3 SV=1 YSTMNSDEVYEIKRTWEIPatTPTESGVAILIRFFTKYPSNLQKFSTFKDMTL-DELKNNPRFKAHANRIMKVFDDSIKTLDDncshLEEIWTKIAQSHF-NRQIEKQSFNELKEVILEVLVAACN--LNDQQTEIWLKLLDFVYEIIFKT-- >tr|V5YM54|V5YM54_9DIPT Globin OS=Polypedilum nubifer GN=PnHb18 PE=2 SV=1 IVALTEADVEIIKRTWKIPsaNPHDSAALIFSTFLEKYPHNQQKFPAFKDKPL-SDIKNTVEFRAHASRIFNVFSSVIDGLDRdtemmkgIKKIIAEVGKFHA-KKKVTKKAHNEVRSVLVDILIEVCK--LSDEEKAAWTKLLDIFFHVMFEC-- >tr|O96457|O96457_9MUSC Hemoglobin OS=Gasterophilus intestinalis GN=glob1 PE=1 SV=1 ---MNSEEVNDIKRTWEVVaaKMTEAGVEMLKRYFKKYPHNLNHFPWFKEIPF-DDLPENARFKTHGTRILRQVDEGVKALSVdfgdkkFDDVWKKLAQTHH-EKKVERRSYNELKDIIIEVVCSCVK--LNEKQVHAYHKFFDRAYDIAFAE-- >SRR4051794_9566520 ---------------KALVEdvAERghrrPMEVFYGARsdhdlydidtmlrmAQSHPWLS-VRPV--VA------------TGpaggPMNSLSGQLPDAVRQYGPwreYDAYLSGPPGMIR--NGVD----ALVGVGV---PSDRIRHDSVEELVAAGDX-------------- >SRR5215470_9890699 -----DFDRGPIRELLKHLAvePDAAMEYLFARLFAAHPDLRGLFPY--GM------------TQTRAAVFGELAAIIGGLDDqerTEQTLARLALGHR-KFGVKDKHYEPFFDAMFVTAQHAAGAAWTGEMAASWRSALDWFGSVMAA--- >SRR5262249_54331370 --IRLRK-------EIDNEWllIASgVLSVIFGLILVAQPGTGALA---------------------LLYVIGIYAILYGILGPrpcCV----------N-RFGAQTALDRG-----------------TSTYRELWNIS----VARLIG--- >SRR4029079_9820506 -VRVDGILVEGLQASLATMQpaAAQIAHGFYTLLFARRPDFRAMFP--EDM------------AAQERKLIATLAFVCEHWRKpaaVSVRLADLGALHQ-GLHVKPEHYPIVCDALVTAVMKHRHEALGPHRAR------------------ >ERR1719310_1734953 ----SASSVKAVQASWAKAEnigLRVVGELFFKELFEASPAAKELFTA--Q-KFGEDAAGQRRFKAHTLNVMQTLSAAVYGLSDlsaLARTLPAPTYAIL-SLSFTLISFTSL--------------SLTPLI-------------------- >ERR1712087_347811 --------------------------------------HEELFTA--QKKFGEDAAGKAHFKAHTLNVMQTLAAAVYGLSDlsaLARTLPARIYAIL-SLSFTLITFTSLSLTPLIYHTLTLKGARARNSGRaaPWIRRPT----------- >tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii GN=F775_23753 PE=3 SV=1 -MAFSEAQEELVLRSWKAMkpDSESIALKFFLRIFEIAPAAKPMFPFLRDAGEDAPLESHPKLKAHAVTVFVMACESATQLRktgDvkvREATLRRLGATHV-RAGVADAHFEVVKTALLDTIEGAVPEMWTPEMKAAWEEAYDQLAAAIKEEM- >SRR5262245_14739337 --PCARARLRPR-------RpaL------Y-AQALPPRRLVPRPVRE--L------------AEAQSRKFMAGLKLGIIALNyedGLTPVIRLVGVRNR-RAGIKVRHHRVMAKALLPTLEQSLETRFTRDTKHAWSSFLTQVTRILSG--- >SRR6266699_2273235 --FFLPFKE-LTEQHFSILGlrkARRAGLVLAQELFEHAPNVGARHSN--AF------------GGRGYCRRMR---------PRtap------VCDSAR-CWAPSCRRQ---APLALR-------------------------SCRPVR--- >tr|A0A084QEN9|A0A084QEN9_STAC4 Uncharacterized protein OS=Stachybotrys chlorohalonata (strain IBT 40285) GN=S40285_06080 PE=4 SV=1 ------------------------------------------------------------MEKYPRIDIRSPAGVSIIYKDvssLDPAQEEIRVLHL-HGG---PEDSPIECTLHKiALKSNPPPVYE-ALSYTWGDAsvtreIVL-NGHVVS--- >ERR1712224_896978 -GCLSHRQSTLIRGSLPMLraQGETITSSFYASLLSAHPELHNIFNS-AN----------QATGRQPRALLNIILAFAAAPNHtaeLIPRLERVCQKHC-SLGIRLTSTTSSASTSS---GPLARSS------------------------- >tr|L8LYK6|L8LYK6_9CYAN Hemoglobin-like flavoprotein OS=Xenococcus sp. PCC 7305 GN=Xen7305DRAFT_00009490 PE=4 SV=1 ----MSLQIGLLEQSFNCIRPyGkLFVSSFHENLFQTNPEIKSLFMGV-E------------SQIQKNRIWDTLVLIMENIrhpNLLNNTLQGLGARLF-THGLLPKHYPLVKKAFLATFKQFLGNEWNSELEQAWKNAYTYFHDLMQEG-- >ERR1022692_2453048 --------XMSLPASFTSICngilGREE--------NSGCPAAKGQFLP--DR------------DAWrRssaLLLFGPLHQASRSTGYvshLHegaArppgrRispDRRPGRQAG-RSGRLRAGPRAGPPQVRGHRRALRRGRRQPAGDTGAFRGRHLDARVMIE--- >tr|S0BCU7|S0BCU7_LAMSA Extracellular globin OS=Lamellibrachia satsuma OX=104711 GN=v2hb-B2 PE=1 SV=1 ---CTTEDRREMQLMWANVWsaqftgrRLAIAQAVFKDLFAHVPDAVGLFDRV-HGT----EIDSSEFKAHCIRVVNGLDSAIGLLSDpstLNEQLSHLATQHQERAGVTKGGFSAIAQSFLRVMPQV-ASCFNP---DAWSRCFNRITNGMTEG-- >tr|A0A1Y1ILY9|A0A1Y1ILY9_KLENI Cytochrome b5 isoform OS=Klebsormidium nitens GN=KFL_008610010 PE=3 SV=1 -PHLTTSDVKLVQESWAKVVeahGVGAVTLFYVNLFTLAPHLESLFKKTKN--------------IQEAMFTDMMMTLVGKLHDwewVVSALEASAIRHL-RYGVSVSMFPAVGQALLQTLDMGLGVHWTPEVKAAWIKLWTAIVSVMSVHL- >SRR5579875_3194573 -------------------------------------------------------------SRCCSRATPSYGRCSRSRCrgpgrrsAtgsPSSSATCRRPGAR-RSCSRRWPGITAGSASvtgtTGRSSRRSGPAWTAELDAAWLAATDWFVSVLAA--- >tr|A0A0L8P0I1|A0A0L8P0I1_KITAU Flavohemoprotein OS=Kitasatospora aureofaciens GN=ADK78_37645 PE=4 SV=1 ----GAADQRVITEYLELVTpfGE-LITHLYETMFRRWPYLRSLFPE--SM------------EFQRAHLARAFWYLIENLHRpddIAEVFGRLGRDHR-KLGVRPVHFQAFEAALCEALRRTAGPRWADAVEQAWVRMLRFAVAAMVS--- >SRR5688572_1436081 --RPAPEVIAAVSASCQAVAdrPVRLAEAFYEHLFEIAPQARTMFP--ADMT------------AQMQRMSDTLVGAIAQLEKfdtaqLEAALRRLGADHRTRHGVEAEQYRYVGHALTRAVRDVAGLAYSGALSSAWIAVYQYIEAHMSAG-- >SRR5947208_57978 --EMTPEQIALVQHSIEVLGprVDTVVERFYQHLFEIDPSVVELFST--DP----------A--VQRRKFeveLRQIIKAISGFDEFAGRAHDLGIRHS-HYGVRARHYRSVGDSLWWAWQSVMGSAVDSEHSKVGEAAQDV---------- >SRR3954454_13764990 ----VLDPAMLVQSTFALVArqRQRFSERFYANLFAIAPETEVQFAG-TPP------------ELRDRMFVEILFLVARSMSrvdEIAPALTELGARHV-AYGTLGSQLPLAKRALLAALRELLGDAMTAEVEAAWSETYDAMAEPMARGM- >SRR5579864_8015183 ----KPDPIFLVHTSFVHLRprMAEFVSNFFRRLLKDSPELAPIFED-ADS------------VRLKTMVAKIFGTTIAGPEqtdQVEADLAELSRRHK-SYGAIPDFLPLVGRAFIATIRESLPDDTTPQTIEAWELLYANTAALMSKGL- >ERR1719483_919245 MAVLSKSESDLIYKSWALAAdeKEKHGGAFMVRLFTEHPEVQaKYFPKM-DMN------DFMLLSKHGSKIMAAVDTLVNYVNDgndekLVKTINHVASSHF-RRGVVTrEAFEIVTEVLMNYLITTLGDHLSPEAQLAWKKLLSVLVEVIA---- >ERR1711860_359782 ----LFSKSNYVFAS---------LSRNTFKLFKDERSLYeKHFSSF-DVN------DILRIRAHGLKVMKAVNSMVEAVSDendesLIDQIHFVAHGHH-LRGITPrNEFEVRRKILNLDYHLLFHyllkkGCLSQSX-------------------- >SRR6266545_1588040 -------------CDLEQAVdtCPA----------A---LVIGLRP--ATMG------------TL---------CYMGGLAsa------AVCCWRHV-RVVTCSQFF-------------------------------TTASPQSRQ--- >DeetaT_16_FD_contig_41_1516467_length_281_multi_3_in_0_out_0_1 # 3 # 167 # 1 # ID=1772959_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.418 ------RRMNLVKQTWRSVEfglGHKATQAFYDRLFANHLDTRRLFAG-VGM------------EGQSRKLYDLLRLAVRSLDDldaIIPTVQEMGRRHARSYGVVRDHYGAVTQAFIEILHQYICSqlghmahsRYLVDVADAWAWCLNLIGNIMAD--- >ERR1719433_537024 --ALRISIVGREKRA-NCTVtlgRVEQGELQVGATVLLVPPGAECGVQSvevdgREVRSAqagefVCMRLLgcQP---SVGHALSSVD---GPLRSatkLKVRSAQAGEFV------------------------------------------------------ >ERR1719161_1849694 --ALRVMVLGMTADKVG-AAlegHVEQGTLRAGTRCLAAlsEGQAECNVQIvllngVEVSHAgpgehVRLKVTgaAAKGFTAGQVLSCIS---NPVRAigkFKAKLRLMSLPEM-LS----------CSLLVL---------------------------------- >ERR1719277_2163216 --EATDAMKGAVQRSWDQIQalgTTVVGEHVYRYFFELVPEAVNCFPVHvrlkyREwiADEPdenGDLRNSAALRNLFAKVLNAIGCTVAGLQDaskLVPLLSSLGARHI-GYGVSEEFWPALGKAINRTLQDLLAEAFTPEVENAWNTVYGFMSQIMVESLR >tr|A0A2G8RXV1|A0A2G8RXV1_9APHY Uncharacterized protein OS=Ganoderma sinense ZZ0214-1 OX=1077348 GN=GSI_12102 PE=3 SV=1 PKPLTAEQRKLITAIVPVLEqhGKTITTLMYNQMLEENPALKNVFSKS-----------KQERGQQPEVLARSLYAYASHIEDlgpIMPFVERIAHKHA-SVHVEPAHYDVVAKYLTNAIIQVVGaDVLAGALYDAWIAAYWNLAYVFIDR-- >ERR1712080_154454 -----DLQKIIVKHQWARSYnegmsREYFGQAIWRAFFKLDPGARRFFTRVRGD-----DISHPKFQAHSLRILGGIDMCLSLIDDvptFEAQMKHLQGQHI-EREVPSYYFDRLGTVLQEVMRAATGYCYDE---VAWGACYKYISDRIKANY- >tr|A0A0S2MLM1|A0A0S2MLM1_9ANNE Extracellular globin OS=Galathealinum brachiosum PE=2 SV=1 -----PLDRILVKAEWAMASdgghkDSELGSSIFRALVNIDPALRGTFSAVGGE-----DMGSAQFRAFAFRVVAGIERLIAVLDVdavLSADLAVLHSQHV-ARDVSAANYESMLSAIMSVVPSAvGNSCFSS---PSWSRCLNVIAAAM----- >tr|A0A066YRR6|A0A066YRR6_9ACTN Putative oxidoreductase OS=Kitasatospora cheerisanensis KCTC 2395 GN=KCH_40190 PE=4 SV=1 --PPDAADLALAGAVLAALRpvADRAMAHFFALMFLRHPELRAVFPA--A------------MDGPREQLLRVLRECVRHGDDpaaLRDRLGPLARRCR-KYGVLSGHYASAADCLVEALARYG-SGWDERAEAAWRRLLAPVARLLVEA-- >ERR1719329_2046659 -----------IKTVWAKIMkevgTLNAGTMLFKNVFMLAPETKQLFPKFRHLK-DDLLLSNESFKNQAKLSISALSNAIMSFDDppkLKRMLMDLGRIYE-SKGVSLATLPIVGNALMATIEAALGNDSCIETFNFFALFYNEGSNMLAEGYK >ERR1719265_1860150 -------------------------------------QALNYFPRFKMnnlLF-SDALFEDEIFKIHAYKLINAITNAIDLLDEpvkLTETLKHLGRIHE-NKGIPAESFVVIINAFNVTVANLISRDSSIETINFFALFMNEGTNLMTDGX- >SRR3569832_2958212 ----PALVRSAPDSAAALRrcRCGGTAEKIAERARADD----------------------------------------PESEKsrgAGADDERIGRTAQ-AIRCSAGRLSSGACCAVGGHGGIGGX-------------------------- >HubBroStandDraft_4_1064222.scaffolds.fasta_scaffold919957_1 # 1 # 597 # -1 # ID=919957_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.524 -HPMDPSRVMRLRISHGWFAPcgEALVARCFQILGEQTPGTRSLFP--ADTA------------SLHPRILRTLRQVLSNAHEfrtLEPPLARLGEKLQ-RRAGGvehlLPHAAAFRDAFICVLAEAGGRSFTHQMEQDWRMLLDGVLGAMIAG-- >tr|Q1GDP0|Q1GDP0_RUEST Globin OS=Ruegeria sp. (strain TM1040) GN=TM1040_2494 PE=4 SV=1 -AILRQIEVQLIKVSFNRVFaqKAALAEKFYHHLFLELPDAEVMFT--RDFS------------HQTEMFARVLTTGMQSLGRdreMMVLVDDLLQRHK-HLGLTLDQMYTAQRALHLAFCEVMQAELTAAEVSAWDNAIGRLCRALAAGI- >ERR1043166_6829872 -LNLTADEIDRVRTSFDQVWaiSSRMADLFYDRLFAGNPFARSLFPA--QQ------------DERKQNFMLNLAVIVAGLDEradMDRSEERLVQAHA-EAGIRVDQSEVMRDALFWSLEQGLGPAWTPGVAAAWRKAYRLLSEHMAS--- >tr|A0A257MW93|A0A257MW93_9GAMM Uncharacterized protein OS=Methylococcaceae bacterium NSP1-2 GN=CG439_2278 PE=4 SV=1 ---VKVKNRLLVKLCIDEISpkIDIVSQLFYQELFHLNIHLKTIFSG--NVT------------FLNRKFINMMAtfKNVKHLEAIENSVEKMGERHVLHYRVQLKHFPTLKKALLLALKKHLGERFNAELEAAWHEVFDDVAEIMQRA-- >SRR5690554_3276444 ----xmSDADRLQVQASVERIRgqMDGFAGCFFDKLFALQPALRELLAT--E-E------------GRRSKLRSMVStlANSRDFDKIAPAIRRLGDRHR-DYGVGVQDYVPVQQALLHAVAQVDPQGQSEQVQQAWSGQFQRISALMEPQ-- >UPI00042C7A07 status=active ---MNDTQRLLVKADIDSLGndINALSQIFYRELFHIDINLKSVFPG--NVV------------FLNRKFANMLAtfKNLGHLEKIGASLEKMGERHLANYGVQLENFAPVRAALLIALRSYFKENFDAEREAAWQAVFDKVADIMKAA-- >SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold510383_1 # 42 # 362 # 1 # ID=510383_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.393 ----mTSKDRALLKECVEYIEsesINELCDIFYKKLFDLDPKIKLILSD--NDV------------VLRRKFFNMFStfKSVKYIDKVSEIILQMGARHK-SYGINEKHLELMKEPLFESLHEVLGDEKFNYYKAGWEIGYQEVENLFKEG-- >ERR1700737_3653126 MTALTADQIARVKATAPVLAehGVTITKHFYKRMFTNHPEWKNVFNQ-AHQQS----------ASQPQALARAVYAYAAHIDNlraLGSAVSHIANKHA-SLNIRPEKYPTCGKICWRQYPKCWAIPSMNPRSTPGAPLMRNSRRFLSGR-- >SRR5919197_656730 --LLDDDTIGLLDESLRLIDdrSDVVVNHFYAAQFATPPPRGLLGSR--AR------------GC--------LGRGVR-----RDGPGDVGRRSR-GGGGRAGLV--EGRD------------------------------------- >SRR5919106_2778213 ----------------------A-VDRFYAA-VLGDPELAGYFTDvdidrvkrhqvlllsdvlggpesyDGPD------------LGQAHRGlgitdghyDKVVGYLVAVFTDLgadGDTIAAAAEVL----ASVK---PQ----I---VEDQAGSRDSHEX-------------------- >tr|F4F3R7|F4F3R7_VERMA Oxidoreductase FAD/NAD(P)-binding domain-containing protein OS=Verrucosispora maris (strain AB-18-032) GN=VAB18032_21340 PE=4 S -------MRDHPAAEVGGIAeavFGRAAARFWDTVQEGCPGLLP--------------------EGDAPLILAGLLRLVGGGDDRpgrLALLTVLGRVYR-EHRLRPDHAALVGA----ALT--VAVPSMPPEAATWRRA----WRlVERA--- >tr|A0A2T3A5F4|A0A2T3A5F4_9PEZI Flavohemoglobin OS=Coniella lustricola OX=2025994 GN=BD289DRAFT_370338 PE=3 SV=1 --ALTFKEAQLVKSTIPFLReqGEELSNLVYGNLVKRNPELNNKLNVI-HLQDG-------RLARALTVVILRFACNINDMSELIPKFERVCNKHC-TVGVQPMHYELLGALVIEAFESLMGDALTPEIRAAWTKAYSILSHMLIGR-- >SRR5439155_13306073 -VLLD-------GGTLRAVRmsGDTRSEPWLKDLWERGVAVGELRRHLllpletppGLP------------VPRGRILCNCFDVAESEIDAfla-------------------------T-SNSIAELqarlkCGTNCGSCLPELRRKSLCDIG----------- >ERR1043166_8897093 ---GTRDQADIVQLTWHSVLpvGGTFAELFYGRLFALDPEVRRLFKD--DI------------VEQGRNLTAMLSVATANLVKperVGRPPGGLHFRRK-D--VDQRVLEREEERVLHQRemlrPHAVSGVALAELMERHADAP---GGVHRHA-- >Wag4MinimDraft_6_1082665.scaffolds.fasta_scaffold479856_1 # 2 # 223 # 1 # ID=479856_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.387 -IALE-------DGRLRAVRlaGDTRAASALLELWERQAPVDAEDLPEtPAH------------ASRGRIICNCYDVSETEIAAy----------------------------RSLADLqaalrCGTSCGSCLPELRAKFGVIPR----------- >tr|A0A2B4SBA2|A0A2B4SBA2_STYPI Serine palmitoyltransferase 2 OS=Stylophora pistillata OX=50429 GN=Sptlc2 PE=3 SV=1 --QISQKQISLVQETWGLVsgDLEKVGVDFYMRLFKANPDVLQLFS-FRDIDKSsdDIMRADDRLKRQGLVTMQHVDLAVNSLNDlgsIVPALRDLGGRHA-MYKVEEHHYVLVGSVLLDTLNNGLGDNFTVEL--FWAALLNTLDKGLGE--- >tr|A0A0C1L0Z1|A0A0C1L0Z1_9BACT Uncharacterized protein OS=Flavihumibacter solisilvae OX=1349421 GN=OI18_18680 PE=4 SV=1 -MEMTPRQMQCVRNSWRNFrdlDPAFFSEPFYAKLFADHPAAKKVFGD--NL------------AEHFSFLHEMLSQLVSRIDRPdqlLITCSRIARNNA-ALGMNEKFYEWYGHALIWTLRQGAGADWNMETEQSWISYYKYLVD------- >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3668839_2 # 105 # 377 # 1 # ID=3668839_2;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.656 --------SGPLAASLAIFEprLEAVTARLVDVLAASSPHLLALFPP-SSE------------PS-----AALLGRFLTRIVEtesLGqPLGDGLGLDAY-PIP-TRDQWEHLVESFIWSLSAVAGKAFSPPMARAWRATGERLFSTMFES-- >LULI01.1.fsa_nt_gb|LULI01000097.1|_29 # 27187 # 28320 # 1 # ID=97_29;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.310 ----------------DEIKgrH---HSMFVDEFERQQPQYKD---------------------------------FWARL---NrGEYQAGEYRRY-GKG-GKEVWIQA---------------------------------------- >SRR5947209_9205436 --------VLSVLRSpssplF---PyttlfRSRltver--DSERDVLMvaggtGIATMRAL--LD--DLA-------------QWgENPRVHLFYGGRTDDDlyaLDd--LHQLDRKST-RLNSSHANISY---Avfclk------------------------------------- >SRR5690606_15697619 --------VRVVAGGwvsralvrqtvpgdrW---RvgapMGElwrdr--DVQRDLVLiaggtGVAPLHAV--VE--DLA-------------GRatQPSSVTLFFGGPTADAlyfLPe--LRELAADLP-WLKLVP--------Vte----------dgsvddgergklPEVVTALGGAWSGHDVLVAGSPGMI-- >SRR5919202_1970091 --------VQMVPGGqvsstmvrslkvgetV---RlgapLGQaltlyag--ERHRDLIMvavgtGLAPLRAH--LE--RIDQ-----------EwqSTgRAPRVRLFHGARLPWGlyeNRl--LQNLAG-RP-WFTYTP--------Vvsddp----------typgrkgwvGDAAAVS-GPLHGLLALVCGSPEMV-- >tr|A0A1D8N423|A0A1D8N423_YARLL Uncharacterized protein OS=Yarrowia lipolytica GN=YALI1_A07937g PE=3 SV=1 -FNMTREDINLTKELWAKLMndPEtlessaaygtptaLFCEQFYTNLMASHAELTSIFP---SI------------KKQSVAVAGVFGLAIKSLDHiekLDEFLWSVGKRHNRMIGVEPIHYRWLGEAMIKTFADRFGDSFTLEMETAWIKIYSYLANKLL---- >SRR6266851_2503075 -----------------------------------------------XM------------RNGSASLPLwPARYGAWTTRRpspNISAPSRSTI-----------ANSVCGRAITNWSARRCSPPSVSSAASGWEAAFNRIATIMIQ--- >SRR6059036_2276597 --ALFPGTSHWVV---AAGMarP-ESKDHPMLTVAQKTLVQ-------DTFA------------IITPIADDAAALLYKKLFEldpSLERM------------------------------------------------------------- >SoimicMinimDraft_1059729.scaffolds.fasta_scaffold91729_1 # 2 # 175 # -1 # ID=91729_1;partial=10;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.661 ----SMEDRLEMIHEWETVWsaeftgrRVLIAQELFSRLFEKDGTTQALFKNVG-G----DDVNSALFKAHCVRITDSIDTIVHMASYtdvEHQLLDHLGDQHAHYDGVLGSHFKLFRECFLEVLPQAIP-CFNS---GAWGRCLKVFQDEIALH-- >ERR1700754_2066947 -------DPGdrQLARELLAGAagGDDLDALvehDRGAVLEIAREAVPVaLAQ-ADR------------DdQLGHLGA--------------DRlLRGPAERPL-GRGAPLQDVALVvhrddavergqqqRAVALAAGAELVGEIWERQERGSLtARRYGSNRSI------ >SRR5208337_544005 --TMTPQQTRLLAQSYAKLEnrLYELGSAIFERLFEIDPHSRPLFK--GNMD------------EQKLKLARLFGEFIRIRarsqhflpvtgkagQVVIPGIGSLGARHEMVYGVRPEQYAHMRDAVLYAIRSLLGNDYNDEIGQAWSEIFDMLAHAMQE--- >tr|A0A2A6CNA4|A0A2A6CNA4_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_32112 PE=3 SV=1 --QCNPRYTALLKSTWSDDfEvLFALGAKMYITAFEgpHGVACKSLFPWVAKYEeAGENYADKSEFRLQALRLVQTIVKALDKVDDlqkLEAYLYAVGHRHV-FYlpvWLDPVYWDVFKasratsylgqstmlksaserDAVQVGVNDHLHKlsKLSTddlaRATLIWTDIIEYIFEYVKEGF- >SRR5437763_1847173 ------------------------------------------------------------MVRQKRHMVALLSQVLGGPKQy---QGRDLAEAHR-SLGISGLHYERVGNYLLASLLiaqapydvinavtdvlagqrdKIVAAAWAAELAADWTDAYSLVARVMVE--- >ERR1719244_1430206 -TGLSRKQRFLLKGSWKGVSrdLESTGVSWFLELFETCPNARGSLRQFSHISLDDDLTENQPFREMTEKVLERLDNALFSIEDadsMRSILLETGDYLRSVVGLNNDIILQSEGPLLSAIQRTLDERYTPQMEVIYTVIVKFMINTMVEX-- >ERR1712228_920792 -----------------------------------------------HISLNEDLTEVQQFREMTEKVLERLDNALFSIEDadsMRSILLEAGDYLRSVVGLNNDIIMRSEGPLLSAIKDFRREIhttngsdlhsdskihdkYNGRMRPL----------------- >SRR2546430_6350501 ----GRResRVRGGQGGWV---sRAIVAEPQRGDVGRSGPAMGRMKVD--RG-------------AGRDVVMVAGGT------GlapMRAIIDDL----A-QWGENPRvhlfyggrgrggPYH------PPSLVSTAAAqPGVPVVavagaeaglshkeagspagggvrHGALAGRG------------ >SRR6195952_1380156 ----VALAGEAVRAIWFRLAdqEADVAHWFGALLFSLAPHLRAQFPA--QA------------DRAARRLLRASIAAMSAVDRpqeFPAAIGTLARETR-ALGLDASADEPVGVALVGAVREFAGELWAPGADAAWVLAYSLAAEPARR--- >ERR1700709_350262 ---------------------------------------GDLDAD--AT-------------AERELLVVAGGRRGGVGpaprGepaGpsgAGGGRPPRPARLA-AGVDVRRttvivgartaedLHT------LDRFAVIGEDaPWLAVVgacesdplelglapgpvvegitrAGPWLEHDVVVA-------- >ERR1700709_656719 ----------------------------------------------------------------ADVVAVAGGP------GasgALALGDDLAAQAA-AGVDVRPttvivggrtpedLHT------LDRFAVIGEDaPWLAVGgacesdpldlelapgtvveaitrAGPWLEHDVVVA-------- >SRR5262245_28534727 -------efHVKTVPGGWV---sASMVNDTQVGDEWKIGPPIGLLGLV--TH-------------SQRDLLLIGGGV------GvapIMSIVPEL----L-RRRSSNRvslfhgvrypheLYL------NGTLDDLAARdPNLEVVkvvsrdrnyagitgslpdvvaqHRDWSAYDVVVS-------- >SRR3569833_3303276 ---------------------------------------------------------------------------------pNNTNHDKH----T-HRKRNPPehqniggkrpedLYV------LDDLRRLTAVsKWLTVTgvteegaipggdrgtlahavaqRGVWEYYDILVS-------- >tr|A0A161TXB5|A0A161TXB5_9DIPT Globin 11 OS=Chironomus riparius OX=315576 PE=2 SV=1 -ATLNADEAKLVKGSWDKVKGQE--DGILYAIFKENPDIQAKFPAFVGKN-LEEIKSNDDFTKHADRIVAAVSKYIELVGNeantpaIKTLLNELGQTHR-SRGATKEQFEKFKSSVAKYLKEHSG-AWSDATGAAWNKAFDEMYAIVFSSL- >tr|V5YNC2|V5YNC2_9DIPT Globin OS=Polypedilum nubifer OX=54969 GN=PnHb4 PE=2 SV=1 -ATLTESEANSVKTSWNLVKDKE--DEILYAIFKENPDIQARFPLFVSKN-LEEIKTSADFKTHADKIVKAISTYINLLGNeantpaIKTTLNELGQRHK-DRGATTEQFEKFKVSVLKYVKEHAT-GLTADAENAWNKAFEEMYKIVFANL- >tr|Q23764|Q23764_CHITU Hemoglobin IA (Fragment) OS=Chironomus thummi OX=7154 PE=4 SV=1 ---------------------------------------------------------------------------------tILAKAKDFGKSHK-SRTS-PAQLDNFRKSLVVYLKGAT--KWDSAVESSWAPVLDFVFSTLKNEL- >ERR1712170_324299 -------------------------------------------------------rVCREKLNVHALCVVAMIDKGISVLDKpcdFVELLLIHGRRHK-NHGVARKTFQTLGNFFIQSFKEVLEDDWTDEIEAAWKIFFRFLNIGLEAGY- >SRR5688572_12388254 --SMNEEQIKLVETGFQSITgrGERFISRFYENFFAASPKAEKLFAQT-EW------------PNQSRKMLLTIMMVVDNLRDaahIKKMLHEANLVHQ-KFTLQADDFDALTDAMLRTLREFLTDDWSKEAEDAWRAAFAKINAIMLEA-- >ERR1044072_9602616 -------LEQSGYTVVGRAAdaRELmLKVRSYVPDVA--------VVD--VR------------MPP------DL--------TddgLRAAAEI-RRSHptV-SVlVLSQHREPAYMLELVGDDASGVGYLL-KDRVRDVTQFVDAVQRVAAGG-- >SRR4051794_28399871 -------EHEAGTDLLELTD------ALVRAGVPCADAAQEAVAG--VE------------LPHGAQLPAER--------LadrLERRRVD---------lD------------------------------RLLRFGEDAG-HLVLGA-- >SRR6266545_7915566 -------ELDTLETTFDLLAprGEELMDIFYARLFAAAPGGRAAVRR--HR------------PSPPEGSPPRR---------ARAPAQV---------aA------------------------------QPRCDRPDAA--------- >SRR4029453_17830486 -------DLQALETSFDLVAsrGDVLMDVFYARLfaaapa------VKPLFAG-TDP------------RRQKAMLLGALVRLRGSLRGppaFVPPLPRPGAGPggE-APlrrhrSPAPEGHAARGPraaAWLPARPAGVRSGaatPRGQARRLWRPAGALPGGRRgpdrLHG-- >SRR3546814_7943381 --------------------------------------vfirlslsliiilvyRFLFFFF-SSR----------RR-HTRCVLVTGVQTCALPIS----TDELIA-------AWAAAYGQ--------------------------------LADLLIA--- >ERR1700737_1149585 ---------------------------------------------------------------------------------kqPDGSAEKHFEQAC-ESGRPTGAVSHCRGTPAGCDQGSVGRRRNRRDHFHRGKGYGNLADILMG--- >tr|A0A255XUI9|A0A255XUI9_9PROT Uncharacterized protein OS=Elstera cyanobacteriorum GN=CHR90_04515 PE=4 SV=1 -PMLSSQSIATVKATAPALRphGLNLVVRTYELLLRDPNI-RMLFDP-A--------------rqvnGDQQHIFAETVIAYVNAMDRldtLKATVKHLTIQQA-LLDAQPQHYDAIAIALIQAIHELFGKDAVREITSAWTEALDVLHQESPG--- >ERR1043165_5678211 ------------------------------------TAglktrkpkgltdsdmdilvpvtA--------------------------ALFLAGMTAYIGILA----LRELSATRLA-SATAAVEHAF--------------------------------LREQISE--- >SRR6476660_7153442 QYMLPQRTIDIVKSTAPILEehGETLTAHFYRRMFAYNPEVAPLFNP-A----------HQRAGSQQKALAAAICAYAANIDNlevLGGAVELIAQKHA-SLRILPEHVRITPESEIISSFYLQpADGGGLPLFKP-GQYITVRVPDARG--- >tr|A0A2D6MWT2|A0A2D6MWT2_9DELT Uncharacterized protein OS=Deltaproteobacteria bacterium OX=2026735 GN=CL908_18525 PE=3 SV=1 -----SEVAERLRSSLEIIAEceATFIRRVYEDLFEQHPKTAELFGG--HS------------RAvRGEMVREVLMYAIEHNEGaswVEENLASLGDQHE-VNGVTLEMYGWFVDSLLRIFAEVSGPDWCAELEGSWRTALELVSDLMSSPE- >SRR3954454_17009507 --PFDPATVAVVRASVTKLpsEPIELTREFYRQLFEIAPQARVLFAE--DMT------------DQTERLLSAILAGVRAMDRpelVEDHLRRWGVVHRRMHGVTNDLYVYVGHALIRALHRIFGH-LETSVSSAWIAVYEWMAAVMIDG-- >ERR1719446_1443192 -----------------------------------------------------------------------------LAQDlsaLCPE---CGFK------VG--TMGVC---QTK------ANDAAIE-----------AKDPPVAT-- >SRR6187402_970848 --GITTADTLLVQTSWNTVSefSTKIIAGFYKHLFASEPEVRPLFKS--NQS------------VQEKRMALMINTIVNSadsLDEFRGSIAQLAKSHV-HMGVKNEYFPIVVKAIISSVEEQYGKGFTSAHKKAWYKILNQISAIMMEE-- >SRR5215510_10546783 -----------------CLDrcRLFVVFYLIACiivlffFFQAEDGIRDGHVtgvqT--CA------------LPIWARLLGAIVTAVQTIEDperFDGYLRALGRDHR-KFHVEPAHFGVVGAALLDALREFSGTQWSHAFEQAWRDAYGMMARKMLA--- >ERR1719150_2276450 -MGLTKAQVAAIQNNWATVSqnMQDVGDALFMRYLTANPGDLSFFPKFQGAGVGPQLHSNEDFQHQTLTVMQFLGQIVAHLGDIPaaeGMLRERVKTHH-PRGISMAQFERLLDLVPRLVQEICGA--SGPTADAWRVAVATLMPSMRDEF- >tr|A0A1K0GS94|A0A1K0GS94_9ACTN Globin OS=Couchioplanes caeruleus subsp. caeruleus OX=56427 GN=BG844_22340 PE=4 SV=1 --GMNPaddaelhAVQRLLISSLEQAGgQVEVATRLRAALAQAGPALFARIP--GGP------------LAQVEQLAEGLAWLAQHTDqPpaLVAGFGRLGAVLA-ECGIAPQQLQLAGAALAEAMRAgMAANGWRQDYDQAwrstWQHAYQWIAHGMVAA-- >ERR1719193_2756600 ----------------------------FM--EKKVPSVIV------FLN-SLSLDDDGALETHALSVMNSVNKVVSRLDQpdrLVQLLHDLGRKHI-SYKANMAFLEPIAKHFILTIKPSVA-EWSPEIEDAWQQAFKVIGHIMQE--- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold5203666_1 # 3 # 269 # -1 # ID=5203666_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.315 -----------------------------------------------------------DFESQGRALTRMLAWIIQNMSNvsqLVPVLAQMGGRHE-IYGVKDADFGTFATTVANSFRSVLGPEIiDDDAHQAWESCISGIGGLMQL--- >SRR5215203_6923026 --PGDSGADRAGRAD---AerDQAGLRRGRG-RLLPPAVRRRPLRggavhhrA--GH----------PTgEADRGAGCGDALDQAPRRVPAPgrh-ARPAAPGLRG-----------------PPAALRHRAG--------------------------- >SRR5215208_6178010 ----GRGRPRPDTAIIRRGVagQPTIRHLFYDRLFEHDPETRLLFR--SDLD------------RQRLRLLTMITAMVGPASDdls---------ATNA-GhAGVPPWRWLSLA-----NARDVADP-------------------------- >tr|A0A074ZZ62|A0A074ZZ62_9TREM Uncharacterized protein OS=Opisthorchis viverrini GN=T265_01589 PE=3 SV=1 -----------MFDELPPATdhLSKK--ITSGRA---LGMICSNAN-VHTLS-NEEIAADTRSKQHILAFMDVLSKAIGALDGgredFCEKLMVLGARHAAIPGMKLEYFKVFKQAILMTWEALMYEEFTEDVRRAWAHLMDYIIGILSEG-- >tr|A0A2A2WQA6|A0A2A2WQA6_9ACTN Oxidoreductase OS=Dietzia natronolimnaea OX=161920 GN=CEY15_08520 PE=4 SV=1 -----STATPPLLALRDLVTDPRFTDLFARALREADPDFRELFPR--DA------------SGVLGEFVRAMSWALETVEnargdeaevaQVVEFARHLGADHR-KLELSTRHHQRFGEALTSTLRHLAGPGWDDRLSTTLGTVYRVLTTALRE--- >tr|A0A2W5I8T1|A0A2W5I8T1_9ACTN Uncharacterized protein OS=Lawsonella clevelandensis OX=1528099 GN=DI579_06450 PE=4 SV=1 -----PTYYTVLGPAITLLRehPEDFMRHFLAAALTYDFHFHTFFPS--VN------------DHHASRYTHALRYILEALDqstndpdcldDVIDFLSQLGCDQR-KYQLTAEQYQSLAAALRDTFALLLPYQWSTELNDALLTSFEHAINVMQS--- >tr|A0A2N6TBK5|A0A2N6TBK5_9CORY NAD(P)H-flavin reductase OS=Corynebacterium kroppenstedtii OX=161879 GN=CJ202_05310 PE=4 SV=1 -----GVHEASLVPVVTVLQtdGSRFVDAVFTHLFARRPSFIRRLPA--DL------------SQLKPSFRRALVHVYAKQAtgngldrRTRRFLRHLAEDHR-SFGVEAPDYVAMGDAIIDAGREIIAPQVTSEEFELFAMATGQIIGLMEE--- >tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 OX=1581089 GN=HMPREF3121_11375 PE=4 SV=1 ------------MRAAAAFGrqAPTIGPEAFRRLLDAEPRFRHMFGG--SK------------TALRDQFMSALSTALVTRAdvgrfpaATIRRLEQLARENR-KFGVAPRDYATLAEHLLDVFGERLPAgpdsgAQVDALREILDEAMSLI-AAAAV--- >tr|A0A1Z5KPX1|A0A1Z5KPX1_FISSO Uncharacterized protein OS=Fistulifera solaris GN=FisN_16Lh317 PE=3 SV=1 --VASPACVMKVINRWETARqrngfDEQLDIDTLLALFKMDPQVKPIYG-FAVEKEVkAQGMQRMGVLIYGLQVVKMFDVILSALGPdeelFYDVVTEMGEQHC-KHGLTPDHFTLLCGAVMGVLETIMDTEWTKDVRAAWSQVIECVNAEIVK--- >tr|V5YLS5|V5YLS5_9DIPT Globin OS=Polypedilum nubifer GN=PnHb25 PE=2 SV=1 -PTFTDAQVATIKGDWNNIK--GQGVEILYHFLNKFPGNYPMFKQFGGKD-LNAAKGTPEFSAQATAIINLLNGVMDKLGSdnagAQAILANLGKTHK-AKGITKEQFQQFREATTELLGNLG---L-GGNLGAWNALFDFVLNVVFTA-- >AP82_1055514.scaffolds.fasta_scaffold183032_1 # 1 # 312 # -1 # ID=183032_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.529 -HPITSEEAETLRTLWSQVK--HREADILYVIFKENPDIQAHFPAFVGKD-LEALRKSLAFAIHSTRIVSFFSKIATLAGDpsnlpaSKTLMNELGSSHK-SRGIQKEFFNKFRASLDGFMQRQS--SWNDNAAVVWNKASDNFYFVLFAS-- >SRR4051812_13904716 -AGMSPEEVALLRHSLDEMRadGPQAAEAFYAELFRLDPSARELFHL--PV------------EQQSVVFFHELDallSAVSDLPAFVERSRRLGRMHA-GRGVRPEHFEAAAAALDAMLLAVYADGASPELRRAWRHAYRMAAQLMQEA-- >ERR1711860_53158 ------IYFSDIKSTWDIVKdeIDQIGMLAFLHLFEAHPEAKTKFKMFEDIPT-DDLKTNEIFQNHAHRVVSVIRKVVGKLDEPsvyLNYLKILGGKHI-MFDADVKYIKQMGYMFLSAIQPTLEKevGITLKYV--FKKTFX----------- >SRR6266536_6175029 --LMTPEQITLVQSSFERLGpqLPAMATRFYQELFTRDPALRPLFTT--PLP------------QQEVRFAEALTEIVRAMprlDELLTHTRAPRRPArrlR-GTGCRLPDPRRRPprrargrpgRQVRRPHTRGMGPRLQPcrrdharrrsrgPAHQQLTTTAAPTASQADGG-- >UPI00012780C8 status=active -MSLTNETKEIIKATVPIIEknEAELTKKIYPLLFTRNPSMKIFFNR-DH----------LRKGTQPRAFIGSIIEYAKNIDNldaIKPLINDIAEKHA-ALNIKPVQYSIVNICLLEVFGKALGTRGTHVVKRAWKDAIEDLANIIIK--- >ERR1017187_3590871 -----QVDCAILKQSFAHIEsvAEKAVGYFYARLFVANPELRSMFPL--------------AMDATRKHFLAALAHIVWSMDDpqeLADYLPGAHRHSA-H---VQRRYVDLPGAVrLGGgdrSHRHSHDPGGagRRGRASLVAGX------------ >ERR1035438_6477963 -----------------------------------------------------------------------GARGSPRPAEpaaLSK--------------------QMIDRPLRAAgaaPSMHNTPPWRfgVRPDRLTIELRADIATVMTQA-- >SRR3546814_3749254 ------------------------CLFFFFCFFFSSIRRHTRCA----LVT-------GVQTCALPILFNAIAAYASNIENlpaLLPAVEKIAQKHT-SFQIKPEQYNIVGTHLLATLDEMFSP--GQGVLDAWGKAYRSEERRV-G--- >SRR6266704_3508957 ---------TITRAEFCAGRsnrgsKQAFACECYATLIRLHPEVKPLFTH-TSM------------EKQAKKFMASLTLVLHVLGKpdvLTTTLQRLGRRHQ-TMGVRVEHYPMVAEALLATLKSGYAVVLLT----LFVQSYMFL---VRKGA- >SRR5215207_7267255 ------QAV-----------agEPEVRGSILRKAVRIGPDRANLVQ--GGP------------RGSEDEAaQHACDDRWSRLSTrdLRLGCRGFGTTSR-TVRCDAGSVFGGRRSL---nleLGRGARTRADPVQARSVERFLQGGSALHVEG-- >SRR5215470_13616785 ----------------------------------------CMVTL--CH------------CSFTqtcscGTRRRGICSRFRWLPSatgWCMRWAGSCPTSR-TSTPSAGTcRTWGASTASSAPSPSTTPTWTPELAADWKAAYDLVAQVMIG--- >SRR4249920_1577195 -----------------------------------------VWPC--TA------------TRCRCSSTRTC-----scgtrrRETCsr-SRWPYSATGSCT-RWP-GSCPTSTTWTTSASTCRTWaaSIASSAPAPAADWKAAYELVAQVMVG--- >SRR5258708_22654124 ------TLARLLKESWSLVEdrADHLANHFYARLFLIDPNLRDMFPV--QM------------AVQRSRLLGALVEPVQTVPNpsqVVPCFLSLALAQP-TIRLLPGQFEAGRSAPIDP--------------------------------- >SRR6266511_448526 --------RRRRRRAATSSGraSHRLRDsRLEARARDRSRRVLDDASS--WV------------EVVRLGDAGEPVVLVSAVAAiahRDVRRVELAREGE-RVRL-------QVLNVDAEEDDLAGEHWSVEYDQAWRDAYDRIARVMIM--- >SRR5579862_1310240 --LMDPLRIRMVQDSLVKLTprEGSIVDLFAAELSGSPHDESETGG--DNIA------------YQrERSVLGIMAAAAPFLHAPeciLDEVVAEI---G-AGRIHPADYDHAANAFLRALKKNLGAEFTADLWEAWLEALWTLCNLLSRT-- >tr|Q5DGY4|Q5DGY4_SCHJA SJCHGC09035 protein OS=Schistosoma japonicum OX=6182 PE=2 SV=1 -LSINDEQLLLLQSSWSIVkqHIEKIGVITFLGIFEQHSDFRDAFTEFRKRK-FVDVKHDPAMQVHGLRVLSIVDKMITRLPKtddIELKLMTIGSKHC-RYVPTIGLISSVSDQLWGAIEPVLkeEGSWSDELAVTWKTVLDYLTKTVR---- >GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6550916_1 # 2 # 442 # 1 # ID=6550916_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510 --------LELIQQTWEKVKphGKEWGPKFYNNMWTKYPEVRTKFFP--E-SKP---------EIQGPRLYASLNFMIKNASDietLKQYCFNMGDRHK-KYHCGAEHFQVVGDAFIMTLTEFLGEDFTPELKQQFQLLYDTVAEMTI---- >ERR1719360_423992 -EPLTQAQKEIIFTSWDAItHKENLGVTIMYRIFTGHQEIKHLWKFADDLKTEEEIRGSKTTQFHAKKVINGVNSAIKAVEAgkeVESlGLDKLGARHF-KYGAKPADFRHFVESLFWAIKTIVPE-VSAEMAAAWTNFVMQIIKQMTN--- >tr|A0A194RIW1|A0A194RIW1_PAPMA Neuroglobin OS=Papilio machaon GN=RR48_08766 PE=3 SV=1 -SPLSAKQQYCMLASWKGIFrqIEKTGIILFVKLFQENEELLHLFEDFRHLQTVEAQVSSTELAEHATKVMHTLDEGIKGLGDMDsffAYVQHVGSTHTQVPGFVADNFMKIEKPFLDAAKTTLGDRYTPNIENIYKITIRFILENLVKGFE >ERR1719153_450463 -MPLSEGTISILKACHPIPvaNREDIGSSFYTLLFQQHPETQNLFPL-SHVSASKGGKPGPQMRS----HPTMPYLIF-HTkqlF------------------------TIIYNTKIQSX-------------------------------- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold8273257_2 # 299 # 427 # 1 # ID=8273257_2;partial=01;start_type=ATG;rbs_motif=TAAA;rbs_spacer=15bp;gc_cont=0.364 ---------NELQTNIEDVYsaGDV-C-----ALFDSSaNRYRPtrtwlscafqgEVAAL-NM------------LGQDKVynegvFFNASHayrSMYAVLGNFNPAQAD-GFEFF-VCNQDKENYE----RMVLKDNKIAGAMFVGSMKNVWSVKQLIEGQVDV---- >ERR1719244_2234371 -VVLEDAEVEGVQTLWAEVSgdLGNFGARVFGRLVHDHPTIRKYFPWGRNDKTEEQLVAAPDTQAHAEEVFGALGKIIGaagHLNDYRSFLVYKGMQHI-PRGVKPEHFDYLKDALVDTLKEELGDKVTPAGEEGLNKVYSFVEKAMSKGL- >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3481696_1 # 1 # 387 # -1 # ID=3481696_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.584 --VLTSNDIALIRESWAYAkDIPAIQTETLLEHFRIQPRTQALFPKFADVP-LNKLPTNDAFIKQARSCVSFGLNFIVANLDNPSLLkDMLGRVdTyG-KWYVDF--MtkeRQMQTTVdifIQVLSKELGGRLSAAAKAAWTRAMTLVFVEMMS--- >ERR1712198_397898 -QGLTEEEITEIQSTWKSIIsdkTSEHGVNILIRFFKNYPEYKaQYFQNLNTLS-EDELRESPKLRSHGAGFVLAITQIISDLDNmliVEEVAKKIARNHY-NKGIREPlNYKLMTNTIIDYIKDIGN--LADGTMQNFRKMFDIFIISVRKKY- >SRR3954447_20457037 ------------------------------------------------------------------HKVKVEDIIVRGGGNLMVEL--MNTDAA-GS-----PLDTPVRAVTDG------TESTAAAREPI--------RLNPG--- >SRR4030088_1427564 --------------------------------------RRGRDGG-QP-------------R-RRELRRDGQepdepDASRRGdrgRPCAGPASR-----------------R--RGSAAGCRSSPPSPAWPALSYEQWRETCDTLHGhTQVLG-- >ERR1700752_5389668 ----------------------------------VVPQVPAARSR-VPL------------R-AASFRRGGLehdpdPKGRVSakqEPV-FGK-------------------D--HGQTIRLSARGQSS---PrRNDAARETTCKEARMtPEQVK-- >SRR6478735_7013605 --IMTPEAIRAIKTSYAAVatQPRQLASRFYSELFTAAPNLRPIFP--ADLT------------LLQGHFEAAIAMVVRNLDEmtaLREPLRDLGAQHV-HWGARPEDYVTAREALIGAVRGTT-RHDRRSAGRCVSRPTRSARpIGSRR--- >SRR5262249_59625092 --SRHRDAAVLVRTFTCAPpaPPGRRASRLYEGPFPADPDLRPRFP--ADLT------------LLQNHFEAALALVIRNLDDmnaLREPLRDLGAQHV-HWGARPEDYVTAREALVKAIGALS-ASWTATLEQYWRSAVTSIIvT-MLX--- >tr|A0A0P5LQ45|A0A0P5LQ45_9CRUS Di-domain hemoglobin OS=Daphnia magna OX=35525 PE=3 SV=1 --LLTANDRRIIRKTRDQAKkDGDVTPPILFRFIKAPPEYQKIFKPFADVP-QAELLGNENFLAQAYTLLAGLHVVIQTLFSqelMANQLNALGGAHQ-PRGATPVMFEQFGGILEEVLSEELGSGFTAEARQAWKNGIAALVAGIA---- >tr|A0A0P5UVQ8|A0A0P5UVQ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna OX=35525 PE=4 SV=1 --LLTANDRRIIRKTWEPRpRrTEDVPPQDPLPFHQGPPRVPEdVQVLRLCSP-SRACEQRKLLGPRPNTILAGLNVVIQSLSThgaYCQPNQRSRSANK-PRGVPPIMFEQFGNVAEEVLAEALGSSFNAEARQAWKNGMRALVTGIT---- >SRR3954451_10251525 --------TSARRqqWTFPRCGptspRPQRPGTRARCTSTPTCSCAIPRPA--RC------------SRSRWRT-SGTGSSPPSATWlpgsttstrSCPSCSSSGGTTG-SSGPSrRTTRPSVPacWPRSSTSTTS-GARNSPRAGRrptTASRAPDVLATVMIE--- >ERR671928_16913 -----------------------------------------------------------------ALYFDGIDTGR-----lrVHQTKLLVQVTGG-PVEYDGRELAVAHGGLDITLEHFD-PGWTPELARDWTQAYQLVAKVMID--- >SRR3712207_8140349 -------------------------------XMIRRPPRSTLFPYTtlFRS------------AHQRDRLFQALGDVVNYVDDldrLVPILQALGRDHR-KFGTVAEQDRKStrLNSSHANI------SYAVfCLKKKKKDSHPSSTTX------ >ETNmetMinimDraft_30_1059905.scaffolds.fasta_scaffold1335019_1 # 137 # 232 # 1 # ID=1335019_1;partial=01;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.573 --PITPEEKDGAMRVWKMILnnrsehflalkrenKekdvqdaencmDYFMHNFYIRLFDIHPNSKQLFHR--SI------------HKQGSFFLRFLSMCVAEVSEpekLDKTMENLANIHN-KLGVKAVEYGIAGEALFHTIHKCVGPEFNHEAAVGWTKVYSVFLKYLI---- >sp|P15447|GLB4_GLYDI Globin, monomeric component M-IV OS=Glycera dibranchiata PE=1 SV=2 -MGLSAAQRQVVASTWKDIAgsdnGAGVGKECFTKFLSAHHDIAAVFG-FSGA-------SDPGVADLGAKVLAQIGVAVSHLGDegkMVAEMKAVGVRHK-GYGykhIKAEYFEPLGASLLSAMEHRIGGKMTAAAKDAWAAAYADISGALISGL- >SRR5256885_11466498 --------------------------------------------------------------------------------------------XM-LLFF---------FSSRRRHTRLQGDWsSDVCSSDLWGAAYQQLADILIG--- >tr|M3IRU3|M3IRU3_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) GN=G210_0056 PE=3 SV=1 -QELTPDQLRLITECIPIMEdlNLTLGSKFYRRTTRRHPHLQSYFNE-TH----------HKLLRQPRAFIFTLIMFAKNIHDltpLRDVIRRIVSKHV-GLQVKPDHYPLLGDVLIETLCDMFPYHmVDDKFKTTWSIVYANLASLLIG--- >tr|Q86G74|Q86G74_PHAPT Hemoglobin II OS=Phacoides pectinatus OX=244486 PE=2 SV=1 MTTLTNPQKAAIRSSWSKFmdNGVSNGQGFYMDLFKAHPETLTPFKSlFGGLT-LAQLQDNPKMKAQSLVFCNGMSSFVDHLDDndmLVVLIQKMAKLHN-NRGIRASDLRTAYDILIHYMEDHNH--MVGGAKDAWEVFVGFICKTLGD--- >sp|P41260|GLB1_PHAPT Hemoglobin-1 OS=Phacoides pectinatus OX=244486 PE=1 SV=4 -MSLSAAQKDNVKSSWAKAsaAWGTAGPEFFMALFDAHDDVFAKFSGlFKGAA-KGTVKNTPEMAAQAQSFKGLVSNWVDNLDNagaLEGQCKTFAANHK-ARGISAGQLEAAFKVLAGFMKS------YGGDEGAWTAVAGALMGMIRP--- >tr|R1EGH0|R1EGH0_EMIHU Putative nitric oxide dioxygenase OS=Emiliania huxleyi OX=2903 GN=EMIHUDRAFT_435200 PE=3 SV=1 -SGMSAETIATVDATAGAVApfALDITKDFYGDMIASLPSvVLTVFNP----AHNVPI-----STHQPEALAASVCAYATNIKDlspLlvpGGAVDAINHRHC-ALNIQPAHYLPVHDHLMGSIAhvlgPKLGDALTPEVAGAWSEAVRFLAKVCIDK-- >ERR1711974_215400 ----------------AKVseNIDINGGILFQKLLTDNPELKELFW-RANKGQQgDQWRNDKNCQKHGKSVILEIGRCLSAVDDaeeFSSLLYKNGVAHK-SRKTTEEHFPLVGEAVIYMLAEALGEELNDECKAAWLGAYGVITEHMLRGL- >AP12_2_1047962.scaffolds.fasta_scaffold738771_1 # 1 # 321 # 1 # ID=738771_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.648 ----------------------------------------------------------------------------------------------------MSFITVPGVAArsSFVwlrestaalrgpalvaliyflgaeaafyigtlsdrifalfwpPNVvLFCALLIVPQRRWWLYIAAAFP-------- >SRR5260370_506041 -----------------VRD-YSSTCSF--------FFFLQAEDG--IR------------DSS--VTGVQ---TCALPIYqerTEQVLSRLAVDHR-KFGVRDKHYEPFFDAVFATAEHAAGPAWTREMATAWRSALDWFGSVMA---- >SRR5580658_2929351 -----APLRAIV-EEVLRSGgg----------------------------------------------------------------------NVAA-GTGVRRNASLFHGAREPPGFYD--MpGLRELSSSYPWFQV---VP-VIS---- >SRR5258708_13478776 -----APLKAII-QGILRA--------------------------------------------------------------------------G-GPLLRRETRPLVGAPRGQKALL--PpHPPGSGSVASRPKG---IS-L------ >SRR6266704_2687724 ------IARPPDR-RPRCGD-GVLLR-P--------AVHRQSRPA-------------------RAVSLRDDANPRGGLPDadrAGQEP--GRRACD-RAGPRPDRQGPpqirrepeALPAVLR-RAVRDGRAFRRPGPDRRDGRGLA---------- >SRR6266536_777504 -----DGYREALDASFARVAssGEKAVAYFYGRLFAATPRLRGLFPA--AM------------DYQRDRLLCALLQITQRLSNraaLSEYLVQLGRDHR-PPGVPPAV--PGGAACEHPNPTLA-pGVAPllsgvraagqrvarVPHPRRPRRLGQHVPGAVH---- >ERR1719498_564827 -RRWTERKRLVIQSSWAALLsahgndRMATGSKIFRKLFTGDTAVLRLFP-FRHQ--ARTLFVSAPFKLHAKLFVDTMTELIANLHDLEkveRDVRELGKRHL-TYGVQPAHFDAMGEALIAVLDESCHhpSdevTLDKEERDAWLGFWGFIAKETQR--- >SRR3569832_1708069 --------------------EEVAGVVLFQRLFEKCPQTKVLFG-FPiDIDpSSKELVTSKRFLMHASYLIQMLDTALNMLGPdqelLTDIMLELGTIQS-AFCVASVCVIC------KELETHLC--f-------------LRLLCQAX---- >SRR6478736_5796684 ------------------------------FMMGV---IASGMVV-TG----------AERRGRPKAVQPGNREWITVIQAinaEGQA-----------------------------IP-PFIIGAGQYHLANWYRDSNLPGNWAIA--- >tr|T0QF73|T0QF73_9STRA Uncharacterized protein OS=Saprolegnia diclina VS20 GN=SDRG_06019 PE=3 SV=1 ---ISKDVQALVLANWAAISsgstPAllKIKpaspvvyfyDYFYGMIFEKAPAVKPLFRS--SI------------IVQGKALINIIQSITSavNAPNVIEKVCDLAYRHN-KYGVKIEYFNLLGKCLLLAMHDCTGDTFTDELREAWRAAYAYMVMVMTP--- >ERR1719210_139600 --------------------------------FTLL-----DPPGQKrnvaqawsAVVqADVAILVVSANPGEFEAGLAK-------------------------GGQTREHAVLAKSAGVENLVVAVNKMDSVDGEGKWSNLryee------I------ >SRR5256886_2416282 -------DREADADREADADrdGDAEPEPLTAPALSSPPAV-PLAPP--RD------------EAARQHdEPEPAPPPDQVPGAadpretagppeppeeppp-------DGKGEP-AAG-----PDPAIAAGQEALRAFARE--afTSAAEEAWTQVYLAGSSLMIK--- >SRR5581483_8202477 -----------PDDPVFDGMqgnvGRvaarylphrEGEAYVAGPVGMVRETIRALTRA--GL------------PRERIHYDDALLAEDKQASAqgvagatahtsrtpessrpgRTGEAGNAGPDGH-IrrvaesdqAGPAGGTAEPGQSGLRDAAADIAPQ--------ADTAHQDGGPHDDQagA--- >ERR671911_2215695 ----------------ELEPacapDKQLVEHVQRlRVEAGAQVVGR------EEERRSRAgqCPRPTSRVDVRGTHDD--------APlecVAEVLVDCGAHAR-VACKVDergraaleLLDRVVPDDLVVDLHAVDEVDGGGQTgHVGPGTSSRRVstarakpQAGTLPQ-- >SRR3954453_16132976 -------NLQALEESFDAVAphGDELMDEFYGRLFEAAPAVKPLFAH-TDL------------KRQKAMLLAALVLVRKWRPAraLSGHRR--GAHRL-HGCRRGARVDGRVRGRL------GRGAWRGRRRDDRGR-------------- >SRR4051794_7197155 ------------------------------PHAAAAPVLPARLAG-RPRPAGAGPISPPARRVGRRVRPLDRVPPPARRDVaraARERLRGRGAARA-AGAGGSDLAPPVRHARVGAAVAVRGDLGGAAGIAAESAPSVLPWTTTRSK-- >SRR6188474_1917881 -----------------------------------------------------------------------------------------------------------LNFVFEkiktKKLIPMTQKQIELVKSTWSTV-----AAMDH--- >ERR1711894_485352 ----------------ILLYnYrfLTYVIYYYYRFLAEDPTVASVFSRV-NVD----DQQSGEWHAHMLRIMGGVDILINMMDDvnvLTEEVKHLRAQHVVREGVTHERMKAFLIIMMDELPKVMT-HFNH---DAWKSCLSKKLKRIGG--- >tr|A0A0S2MLM2|A0A0S2MLM2_9ANNE Extracellular globin OS=Galathealinum brachiosum PE=2 SV=1 ----SEGDADIVIKQWASVMnAavsgenrVVIGRQIFNSLFLKQPAAPALFPY---GS----DLDGAEFGAQMSRVLSGLSNAINSLTDddlNVSIMDHLNKQHVVRDGVTAAAMKDMQVSIEDTLKQLVT-DYND---DAWHDCLGVAIERISV--- >ERR1712217_222699 --------------------IDNIGEVFSQKLFALSPRRHARA----GM--------------EWGPVVKGIGHAVDNLTNLDavaVKYKRLGVLHR-CIGVKEHEMREMGEAFILSLRDVLGKSFGHQAEAGWRAVYCFVAHAMMA--- >DEB0MinimDraft_6_1074348.scaffolds.fasta_scaffold06817_4 # 3572 # 3886 # -1 # ID=6817_4;partial=00;start_type=ATG;rbs_motif=TAA;rbs_spacer=12bp;gc_cont=0.311 ------LQRVRITRQWRKAYgtgshRLDFGLKVFKHLFEAHPTARALFADHHSD----N-VYSPEFEAFSERILNEFDIVIALLDDpaaLSAQINHLKAKIT-KRHVTTEQLTVFGKNTLEVIPEYVGNHFD---HSAWTDCLKRLRSALTV--- >ERR550532_3441629 -----YRQVFQLKNSWKTVSrnLDDTAKENLLKFFRDHPEHKALHKKLTKYEDEASLRESQAFEDAALAVFNTFDEAMDMIekDKVdyaITTLHMAGKSHSAIEGFQPAYFKDMEESFLYAVKLTLGDRFTEATEQNFRRLFEFTTQQMIEGM- >sp|P02210|GLB_APLLI Globin OS=Aplysia limacina PE=1 SV=4 -MSLSAAEADLAGKSWAPVfaNKDANGDAFLVALFEKFPDSANFFADFKGKS-VADIKASPKLRDVSSRIFTRLNEFVNNAADagkMSAMLSQFAKEHV-GFGVGSAQFENVRSMFPGFVASVAAP--PAGADAAWTKLFGLIIDALKA--- >sp|P09965|GLB_DOLAU Globin OS=Dolabella auricularia PE=1 SV=1 --ALSAAEAEVVAKSWGPVfaNKDANGDNFLIALFEAYPDSPNFFADFKGKS-IADIRASPKLRNVSSRIVSRLNEFVSSAADagkMAAMLDQFSKEHA-GFGVGSQQFQNVSAMFPGFVASIAAP--PAGADAAWGKLFGLIIDAMKK--- >sp|P21660|GLBP3_GLYDI Globin, polymeric component P3 OS=Glycera dibranchiata PE=1 SV=1 -MHLTADQVAALKASWPEVSagdgGAQLGLEMFTRYFDENPQMMFVFGY-SG--RTSALKHNSKLQNHGKIIVHQIGQAVSELDDgskFEATLHKLGQEHKGFGDIKGEYFPALGDALLEAMNSKVHG----LDRTLWAAGYRVISDALIAG-- >SRR5690625_2040278 --------------------RDGFGARFTEELLSRYTEIREALPD--EPA------------WVARAVTAVTDALIDVADDpgaLVTVLERLGVDNR-TVGVHSAHYAPIGHALILAARAVGGTAWTPDIERAWVDGFDVAAEVMVT--- >ERR1711963_100213 -TSLSEGTVEVLKACHPLLKdvRRVIGKAFYNRLFKEYPQVKPLFSQ--SD---------AARTHQTLALADALIAFTGRQLLegF-EAKQRGQ-ERS-LRLRSLQAGSWQGLWRLPSRDRGERD---QNEGSQIKPQILTIQ---QD--- >tr|A0A0G4EPR9|A0A0G4EPR9_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_12573 PE=3 SV=1 ---MSDKERgVLIDKTWGLLkeryTLQEIGEELYDNVFKNAPDLRHLFKR-PKELMA---------LKFGEMISTIC-GLFQtDRESLLETMRDLGIRHV-DYGSRPEYFPLFKACLLDTLENLLEDGeFTAATEASWNDMWDEASEMLIS--- >tr|A0A0Q5LAI2|A0A0Q5LAI2_9MICO Uncharacterized protein OS=Frigoribacterium sp. Leaf164 OX=1736282 GN=ASF82_14980 PE=4 SV=1 --VITSSHLTALRSTLPLVeaRAAAIADDFYARLFADRPDLLrDQFNR-GD----------QAQGRQQRELALTIVTVARDVVgtqvgsgpagsatgpavpvapwsspapspwavrvAARETLSRLAQRHA-AIGVTRDEHDVFERHLRDAFAAALGDDWSGVVVDAWLALWRQTRDELVA--- >tr|A0A1Y1I4E0|A0A1Y1I4E0_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_002310190 PE=3 SV=1 -VQLSPFEQQLVQKTWKLLQprLADLGQAVFTHLFQKAPKTRPLYTCPLRLADGDrRTPDGHAIPTHAVEIVSTIGLAACRIGSssrILAVLERLGQRHV-AYGAAPDMFSVFKEAFLVALKKTLGGeHFTAQVHKAWSKALDSVVAHLKKG-- >ERR1719296_130621 ----SVQTNSDVQKSWEKIQeigILRAGEILYKNIFELAPSARETIPPevlekyrissFLvslNEDeLDDAFIENAIWSDRAANIFNVVGHVVRGQHDfgrLVPMLQELGSRHV-GDGMPEAILKVVVPAFKFALHELLGSMLTEDLEHVWMVGLELVNSHMIQGMR >ERR1740115_393061 -NLLTPETVRVVKETSPRIAsmAPALSSSFFKRFLS-HPDLAAYKASR-H-----------NGEAKAAAVAAAVTGIGDSIDNlrsLSGAITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAWDEAIMVLADICVD--- >ERR1740130_2673129 ------------------------------------------KASR-H-----------NGEAKAAAVAAAVTGIGDSIDNlrsLSGAITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAENHRLTINLFL-LE--- >tr|A0A0K2UHU6|A0A0K2UHU6_LEPSM Uncharacterized protein OS=Lepeophtheirus salmonis PE=3 SV=1 --YLSKKQKDLLKRAWVALhnNLSSVGMTTFIKMFETHPEALKFMiPKLTqeeekktqpnySLDSRLDPWHSEKLREHAHRIMKTVSDVISLLNKdeekIEEMLVALGGKHH-GFGVHIEILELMGPHFISAIYPTLKETWTEELQEAWQCLFNYIIALLHIGF- >tr|A0A0B6ZHC3|A0A0B6ZHC3_9EUPU Uncharacterized protein (Fragment) OS=Arion vulgaris OX=1028688 GN=ORF61548 PE=3 SV=1 -TGLSARDRKLIKDTADIIfgqlKLQNKGVVFLIAFFKAYPHHQRYFKMFRGIP-PDELKSIPHTENHGRRVMSNVALLVQHIEEpnvIKEQLVDLLIKHN-PRSVKPRQMKDMLNMFVDFTSQQLGAKFTSQHETAWRKLTTHILSVLEE--- >tr|A0A2H2IJL2|A0A2H2IJL2_CAEJA Uncharacterized protein OS=Caenorhabditis japonica PE=4 SV=1 -------------------------------------------------------MNAVELRRHASVYLKGLGKIIESMRNeeeLGKSMSRIAQAHI-KWNVQRNHVIVSMGKTEIRQRATNSYALKS---------------------- >ERR1719270_1027131 -MSLSTETCNILKICKPLLenNRENIGLTFYKKLFDENPGLKNVFN----MGHQR--GVdd-DKPGRQQFALGQALVAYCLHCESldkLASFVERVANKHV-SFDVQPEQYPVVGGILLATLEEVLGKEtFNEDVKKAVADAYFFLADVFIS--- >ERR1719318_1430785 ----------------------------------------------------M--N-----NAQGNSLANAVVAYCANCDQleaLGPTVAKYTVPTC-KYIFHIS-------S-------TRPLKmFLPI---SX---------------- >ERR1712088_143820 -------------------------------------------------------N-----NAQGNSLANAVVAYCANCDQlelLGPTVAKISSRHV-SLEVTPEQYNVVGGAARQRSlqrssQRCRGRGlLFPG---RHLQGERGKNDRRSQ--- >tr|F6WSS9|F6WSS9_CIOIN uncharacterized protein LOC100181975 OS=Ciona intestinalis OX=7719 GN=LOC100181975 PE=3 SV=2 -MPLTEIEIEGVQESWEKVSsggPKTTGLILMEKLFNTYPASIAVFSHLGIPSKPdgaitvSDLASIGGVSNHAVSLASRIGKLVGLLNNeteLKESSTEVGRIHV-KYGVTSEHVDLLGSVLLSVISENQGLSNTSELIGWWSKTWNIIGNYVK---- >SRR6185503_2239525 ---MDSGHKALIRASFGRALtVADLAVELFsGRLYLLDPALWTLLDLGS--------------RRRQQELVQVLAWAIEHLDRfelLASTLEALARRCV-GNGVREAHFERIAGVLLWTLHQVLGDTYTAGTAAAWRSTSGLIVERMKQ--- >ERR1740129_283753 --PLTRREIRTLGLSWSKFHgcRQEFGVELLVQFFQLVPEASDLFR-FQRE---KTISENPGLKNHADRVVRVLSRVIHNIlslEEVVPDLKALGMKHYMDYGVSPTHYCLFGKALLGTVQTF-GG--TPPEQGCLPKLYEWMSRTMTS--- >ERR1740123_30535 --PLTRREIRTLGLSWSKFHgcRQEFGVELLVQFFQLVPEASDLFR-FQRE---KTISENPGLKNHADRVVRVLSRVIHNIlslEEVVPDLKALGMKHYMDYGVSPTHYCLFGKALLGTVQTF-GG--GGLLARSGAeSVFPPGARA-GD--- >ERR1719193_1971274 --VLTADDIKAIKAIWFPImkNPADLGVALFEKFFLLYPQQKDKFKFMKYD-----DLREKGMRAHGEKVVKKLDEAVLLTlYrsRIKHCFQRIGFSHL-QMGIKEEDMQQLGEAIIATVEDAFVDKLTPEEIGSFKKFIKLFTAEF----- >ERR1719193_859649 ------------------------------------------WRMLKKR-----H------NRDGGKLLH-PLKTILQTcYksRIKNCFQRIGYIHF-RMGVQEEDMEQLGEAIIKTVEAAWGDEFTPEEYAAFRKFMKKFTAAF----- >tr|I2G907|I2G907_9HEMI Hemoglobin A OS=Anisops deanei GN=HbA PE=2 SV=1 -FSLTDREVEVINQSWNQIKAqeLVVGLQMFKTLFQRYPQYERLFTHLH--QSGKSLYEGDRFQRHVVgNIMSSINKVIETLNssdNAVKTLQDMGVKHK-KLDVHRKHFESFVPFVVDAMVSVRMSMSQDEVASAWTKMMEGVASNLSKG-- >ERR1712157_679996 MKPLSFTTMDCVLSSWEQVRripnyRETVGLAILQKLIHRMPEGREVLHMQRNLIknSPPGIESDKLLLAHARAIVNGLDTVVEllgpLIDDISEILREIGKSQYHDYGDSMALWNpLMRECVLEVIQETLKDDYTHELKVAWTDFLGEVAKDIHSG-- >SRR5438477_4839339 ------------------------------------HGIEP-IPH--RY------------AAIRRVVSGRE-----------AQARRVGQRHH-AAREDQRR-------LRGL----ERRRG-RPPARHVRL---------AA--- >SRR5262245_20667862 -----------------GRAdpLTLLCEREIARFRG----------------------------------------------------------------I---ELDGIGRA----TALF------DGPARAVRFARAMIARGRAL--- >UPI0003969FE8 status=active ------RPFEAA---------------DRELLFGRAQDIRAVVEQ--LR------------TDPLVLVTGDSGVGKSSLCRagvLPQIREGALNDVR-RWSVAV---LSPGRWLLDTLGDA----LA----------------------- >OM-RGC.v1.018126893 TARA_122_DCM_0.45-0.8_C18859060_1_gene481717 COG0677 K02474 ------SELW-------RGRprKTSLPAgssiRTRTAvlvplgrgketapssssanfvlnLTDVPPEAQELRiTA--EV------------DDQRIHFQRRVPADVD----kvVMELPEGSLARKV-R--VEVAAFD---------------------------RR-CS-IAAFRA--- >SRR3954454_16888348 -VISRSAVIRHVLPTP----aepaaVDHIGQQVADRTSQQDRGERVLLNRT--------------aHGLR--ALADGAARLRIAAQSvadvtRTPLVGVLRQLRS-ALGDVSHRLCGLSDHAEAllgAIKDVLGDAATDEILAAWGEAYWLLADVliar------ >SRR3954471_17335278 -VISRSAVIRHVLPTP----aepaaVDQIGQQVADRASDKDGGERVLLNRT--------------aHGLR--ALADGAARLRIAIQSiadvmRTPRVGVLGQLGG-ALGDVPHCLSGLSDDALGccaTCGCYLCR--------SRGGASWSFFCHaalr------ >SRR5215204_1408335 -ATGGPTRWATMRGRWPLMS-------MLESIAQSG-SGRPVWYVH-GAR---------DrrahaMGDHARALAADEHAGK---------HRAVRQRT-------------------------------AG--------------------- >tr|A0A167F9Q7|A0A167F9Q7_9ASCO Uncharacterized protein OS=Sugiyamaella lignohabitans OX=796027 GN=AWJ20_2623 PE=3 SV=1 -VVFTPGEISLLRNIWKEISEnnLDhgrglkssqastfFCQQFYENLLGDHPSLQTLFPSL---------------QSQSAAMAWVLGQIIAQLEDVsqaQSVLIKLAKWHSRLMNLEPVHYEYVGSSLLRTLGDRRGDKFTAQEENAWIKLYTFIANVMLK--- >SRR5262249_41403170 ---------QVLKESWARVEgqQEALAAHFYARLFLARPDLRELFPI--------------QMRPQGRRLLVGRARATEPGGAPDgASSRERGRPRR-RYEVSAEHHAVFRECLVAAVRACSGRDWDAEREQAWREGYDVLARRMVA--- >tr|A0A1Z5JNP0|A0A1Z5JNP0_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_8Lh328 PE=3 SV=1 ---LSSTSLLKVIACWEQSKsrggfDETIGIELMLTLFEMNPQARSQFG-FRTDQ---VIDKNnglqrMGILIHGQRFIRTLDCLFSLLgpddDNLEEVLRDFNKESC-QDGMPLPQFLLLLGILVKVMAHTLGGDWTDEVQFCWMEVITHLEVIVT---- >tr|A0A150GQ95|A0A150GQ95_GONPE Uncharacterized protein OS=Gonium pectorale GN=GPECTOR_12g483 PE=3 SV=1 --GMSLEEMEQLQGSWAFLSkgafpgevkeqLESFSVDFFMALFEQSPGLINLFP-FKDVNG---KPIIEQLKVHGLKVFQTIGAVIDMCNNysvLLRVTTDLVARHI-KYGVLAAHYDVLFQVLVGILTNVLGSQFSGTLAAGWVKLAGFILRVVKDVY- >SRR5215203_5896321 ----LVRERRLVREAVAMVdDQDRLIRDFYMIVFAMGGAeVIGMFPT--DMR------------RQRHEFGRALVQWVsaDDPDSIAAHLDQLGGDHR-KFDVQPAHYAVTGEALVAAVRGRCGGRFTAAHEEALRGSYGRLATIMIDG-- >SRR5580698_8666230 ----PDLEKMAARSPWLTVtA-------------------------------------------------------------------SLSAEPV-SLGHGPRTEHgtvADVLARLGTWREHD--------------AYVCGSSAMVAA-- >SRR5919204_299658 --------------------------------------------------------------------------SDlrSGPTSRCTHVRC-----R-QQRSPPRHHRClRPRSPAPSWSARlsagfrssscrpstnRPARRRGRGRSTILASYTRLASVMLDG-- >SRR5688500_16794215 ------YDARVLRGSFAQLRprIAQYSPVFYEHFWRDYPETRPLFG--RNMSKPE-------LDTRINHFM---LWVTENADRphfTIDYIQSVARRHV-GYRIRRRHFAYVDNTNIKTLRELLGDSFTPEVERHWRASFRFLTLLM----- >SRR5947199_2475351 ---------------------DELARAVR---lQ--gSRRIMEEHAC-GAE------------GRQLARLFDERGRLARAPRAVDEPGLELGARvsdgrcglakigdvverivqaedvdavRR-AGGDELADEVIVS-------------rtRADDEtseqrepayrigprtqCSDAFRRGLERPAGAPVQT-- >SRR5919197_1330773 ---------------------RATAGGLYGVLprlR--rgrrRVSVRCNHAG-TDL------------KKQKTMLLGTLVLLRKPLrdlDAIVPKLRELGARHV-ADGDEGGDELLEEQEGKGYGED-EGEgdeafdapLIDEX--------------------- >SRR6266516_4891354 -------------------------------------------------------------------------------GLGDGGRAEGGNRDS-GRGEQLEHLGCVHDVLLSFSESTVSTlphqaarpapaaegagpAITRRetadrapprrhrvggfLRSAGAARARSSIDRMTET-- >SRR6266508_4596506 -------------SAFVRL-tdARRVARCLPSAH---pGDETPSTFPS--ET------------GDPVNLN-----------LEALETSFDLVAPRG-DG-SEATEDDVVGHPGPPA--QVA-PRPRGDRPQAA---------------- >SRR6185295_10958302 --------CILLLVA-----CFLTFKLFFYSMFQDYPEYKNLWPKFRHLN-DEALINTGELSNFCSVYMDGWEKVIGELDDnaaLARELKIIAKTHL-RKGVERshimvakkealcqiriheyCYLQNMMPKMLSLLKEKNGT-LDAEVEEAWKTVFIINADIIE---- >AntAceMinimDraft_18_1070375.scaffolds.fasta_scaffold521461_1 # 3 # 443 # -1 # ID=521461_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.569 --------DD-------------DDDDDDdDRMFHDHPEARALFSRVHGDN-----TYSPDFEAHAQRVLGGLDSCISLMDDpdtLASELGHLKAQHA-DHTdVTAEHFDVSICFSsTDVTSTYTsthckimdrpnYTVFQT--RGQrnltksaSRRAHspvRDHPRGS----- >SRR5476649_891947 -------------------------------------------------------------ATSTRCCS--ATSRKCCRCSikpTRPTASSsarwptpcWLTQEI-SIawNnWARWHRPSStSMCRCKSsgNTIPWSApRCSRRYVKCWAPRWRPmpsstpgpprtvsWRTCWPV--- >tr|A0A2D8PEV6|A0A2D8PEV6_9RHOB Uncharacterized protein OS=Maritimibacter sp. OX=2003363 GN=CMH11_20945 PE=3 SV=1 ---MTSQNAGLIRASLTELFprREEFAERFYERFFEQAPQVRRMFVH--DSE------------KQKLMLYAAIAMTMRGLEServLHSELMAFGSRHA-RLGVREEHFPIFGSAFLETLIHFLPQWDHPDLARAWWGAFTDMSTPIIA--- >SRR5690242_2028058 -------ELALLLQSYGRIGilIPKISENFYRRLFQLRPNLAALFAN--R--------------DADLKVEEMLRRIVAHASDAaaaKAEVQSSGRSHA-QWPLLPEDYRVAGECLIQAIIEAEGAATGSVVASIWRQAYVEVANLMIC--- >OM-RGC.v1.029911412 TARA_036_DCM_0.22-1.6_scaffold294997_1_gene285712 COG0526 K03671 ----------------DRLRarGEPPSGNPYRGAAPYGPGDEALFF--GRR------------AE--------LEVLIDRVQkTpfvLVAGDAGVGKTS------------LCSAGLLPLVREgalGGPRHWACESIACGEEPLAALAAVLARH-- >ERR1719414_683447 MEDLRFETIRCVVQNWERLKynplFEEFAIAFYQRVLRVCPQAKSFFGSSFCLD------DQA---TMTQEFVRLIDRVLDLLGPesqlMVEVLRDLGSRHE-AYGVTVEMYDIMRDAFLLTLEQFEGEKmFTTKVRQAWMTVCSAVADVMMEA-- >ERR1700744_5993147 ---VGLDDRDALGVLRDAFSqdesgsGNELVRRFYNHWVELDVSVRDLFPP--GME------------DQRAAFAQALNWLYservaQRAEEPVAFLAQLGRDHR-KYGVLPSHYETLQRALYATLRSYLSdpsrSAWSDAVDEAAGQSLNLFTGVMSG--- >tr|A0A1E3QTC6|A0A1E3QTC6_9ASCO Uncharacterized protein OS=Babjeviella inositovora NRRL Y-12698 OX=984486 GN=BABINDRAFT_161163 PE=3 SV=1 --NFTPAEIATLKATWSMEAKDTnsgdiadpkntlFGTTsfwehVYSLVGEEHPEVVHLLPP---------------ITHQTQAFSGMVYLCISNLDNlsrLDEYLASLGRRHSRVFNALRLHFEAMGSGVLKSLYNHYGEAFTADISDVWARFYCFLANSLLQ--- >tr|A0A0A9XWX4|A0A0A9XWX4_LYGHE Globin OS=Lygus hesperus OX=30085 GN=GLB_0 PE=3 SV=1 ---ATPEQVAMVKKAFDPLsvDAPGVGKVFFERLFELYPGSQKYFQHLG--STDEELFANPVFQHHCTKVILSVGTMIDNLHSnnrrkNKELFEKLATIHA-KRKVSAQQTPYIKHTLMDILH--L--EPHSAMEKAWINVIDTLF-------- >SRR5687767_4837246 -----EKQVLLVKHSWSYQAgqLENLGTLFTKKLVALNPGLKAPMKR--SL------------AETGSySLMVAMNQIVAALPDLhkaQNHIQVIVTEYA-ALGITRSDYENALIAFLLALEKRLGKSWSDEIREAWIFIFSSLYH------- >tr|A0A0S8AZS8|A0A0S8AZS8_9PROT Uncharacterized protein OS=Betaproteobacteria bacterium SG8_39 GN=AMJ64_12515 PE=3 SV=1 --------TGLITESWNALGagQRAFVEAFYQRFFERYPDYRPLFPL--ELN-----------PRHLEKMVQTIALMADQSQDrgrIAPHMHTLGQAHK-AYDLSARDFDNFKRTFVEVLGERLGRQWSAEAEKAWNDAFDAVLVP------ >tr|Q9NG75|Q9NG75_9CRUS Hemoglobin P polymer OS=Parartemia zietziana PE=2 SV=1 -TGITDAEKQLVQESWELLKPDlmGLGQKVFGRIFTKNPEYQTLFTRvgFGDTP-LTQLMANPAYGAHLIKVMRSFDFVIQNLGKpktLLAYLKNVGADHI-ARNVERRHLQAFSESLIPVMQNELKAKLKPEAVAAWRKGLDRIIGVIDQ--- >SRR5579875_723516 ------------RESFARIAprKEEFVASFYQTLLEKYPHLQRMGAGV-------------DVKRQRKSLLATLQVMLNETDRgeeLRTQFRKPGQRHN-ALQIRAEHYPAFGQTLFETLALY-DPQWTGELRVAWAAALEQCVRFMMEDLN >SRR5579871_3449338 -VPLSALHRYLVRRTFTHLaiHADEVTALFSQRLVELNPALMIIIV---DEA-----------GTQRYRPLEILARVIALMDRpaaLSIQLKLLQAQQQ-R-SVTPDHLRQMGEALLWVIENRLGDSFTPDISAAWLHFYRFLGE------- >SRR5215472_5690244 -----HFDVQVIGAALTRLAdpAVDAAEYFCSHLYSISPDAAALFPS--EL------------AAQRELFADAVIRVQHSLESgsgLAEQLATIGRQSR-KFGVTERHYAAFMLAMEKTARHFDTGG------------------------- >tr|F2UQX2|F2UQX2_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_10302 PE=3 SV=1 ----DDSAMKITQESWAMVEREipNWTDIFYDKMF-SDPNIAKLFP-FS----AGDFKTNEKFQTHTQKVRDTMHTAMTSIrefEKLGPVLKKMGERHA-DYGVIPEHSVNFKEAFLHTLKTGYGDKWNEDLDDAWNQCVDALLE------- >SRR5699024_1886671 -KTLDPQTIETVKKTAPIIKdnVEEIGKTFYNILFSRHPELYNIFNQ-SNQ----------------ERGlqqealaygVYLAGINIVNFEPIQSLVTRVAKNNR-ALKVRPNNTLLLERR------------------------------------- >SRR5271157_2714777 MPSRIVDRLTALRAFFAEMEpqLPVIVARSYERLFDVEPAIALLFK--GNA------------REHQLRFLAKLQSIVKLTRSsqlwpasaatgqiLIPEVLDFGRSHA-KIGVLPVHFSLLNDMIAWTCKEIAPLRFTPLVEEGLAFVFDVLGASLTAK-- >tr|R7TLW3|R7TLW3_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_227018 PE=3 SV=1 ----------CAEITWAILseNRDGLGTEVFVRMFESYPDLKSAFGPLRHMNKKDAGY-EDVLRAHGIRVLSIVEQVLSKRHnmeEVLSILHDLGRKHL-TFSAKVEYIDIVSQMFLFAIESALKEKWNNSTEKSWGEIIRFVTYVMKET-- >SRR3990170_2029843 ----------------------------SPCTTTRSPCWTRPCAS--W------------AT-----------APTGSWAtstpPsssRLPSCAR--CSRR-RWTCSATG----CSRRSPAPRHYAEDVWVPELEDAWLRAYAAMSTTMIEG-- >tr|O97381|O97381_ARTSA Hemoglobin C1 polymer OS=Artemia salina OX=85549 PE=2 SV=1 -TGLSGLEKNAILNTWGKVrgNLQEVGKATFGKLFAAHPEYQQMFRFFQGVQL-AELVDSPKFAAHTQRVVSALDQTLLALNRpsdFVYMIKELGLDHI-NRGTDRSHFENYQVVFVEYLKETLGDSVDEFTVKSFNHVFEVIINFLNEGL- >ERR1719468_1094774 -PPLTSNDRKLIVRSWTIVDqqISQVGLSSFLELFRRAPETLSVFPFLKQLG-PEDMEFYHQLKNHSIRITGVISMLVKQLESeerpadeaIRDLLLDLGRRHF-SYGAKTSHMELLGRVFAESLQPIFEGdPEAKAIQEAWLVFFSVIVFWLQKGFR >SRR5262245_31323877 ----STDGAGLVMASLARVSdrSDQMIASVYEHLFAHRPELRLLFPS--DL------------KHQRAKLAGALRFVIENLRNpehVVTALEELGQRHI-AYGAKVSDLSSLGEALMSALEAHDPNPWDDLTRKAWHSAYDSIARAMSRGM- >ERR1041384_2362020 --------------------ANVLGERKvVAVLYSDLRGFGTL-----SE------------TGHAVDVLERLNDYFD----rMVAAITSHGG-------------------------------------------------------- >tr|B6BNK3|B6BNK3_SULGG Putative globin OS=Sulfurimonas gotlandica (strain DSM 19862 / JCM 16533 / GD1) GN=SMGD1_2554 PE=4 SV=1 MQELSQKHIDIIKESAELItaNDLKITNKMYEILFYKYPHLEMLFEN--------------APDNQFMKLAEALSLYAVNIDKiekLIPALELIAIKHV-EVNIRPGHYSMVGMALIEAIEEVLGKMAPIGFIDAWREVYKYVSDILIE--- >SRR6185437_15632065 -----ADDVAIVRDSYGRIGprGAALTIAFFGLLSDRVPRVRKFFPP--DD------------KDKRAVAKDLFDLVVGHLESqlnVRWVLERMGRRGL-LDTITPSDVSAVGGCLLDALAELDE-AWSPATERAWSRVYDWAASAVV---- >tr|A0A0K8S6V4|A0A0K8S6V4_LYGHE Uncharacterized protein OS=Lygus hesperus PE=3 SV=1 ---ATPEQVAMVKKAFDPLsvDAPGVGKVFFERLFELYPGSQKYFQHLG--STDEELFANPVFQHHCTKVILSVGTMIDNYTQttaekTKSCLRNWQRFTP-NGKFPPSKHLTSS-IHLWTFFTWNHIQPWRKHG------------------- >tr|A0A0S8CN91|A0A0S8CN91_9BACT Uncharacterized protein OS=Nitrospira bacterium SG8_3 GN=AMK69_14025 PE=3 SV=1 --GLPPSDISRIQRSFRMVAsqGEKMASRFYDLLLERSPELQKFFHP-GNLS------------QQHAKFFNGLHSLILHLEHpqaLRAALVQLGEQHQ-GDGIEIQHYPPVVDTLLQVLTEFSGEGMDGETYDAWAHFLHLVRAIMLENH- >tr|A0A0Q9HRJ4|A0A0Q9HRJ4_9BRAD Uncharacterized protein OS=Bosea sp. Root381 GN=ASE63_23130 PE=4 SV=1 ----GDRAISLALASLETMGSeaEQADIMFNIRLLETYPDVYRVFC--MDFA------------PEERSFLRALAFILAHAGPfgaIGPTVRALAPSDK-VCRLISSRYHELEETLMWTLRRRLGVAFTAEVENAWRSVLREAPGVS----- >SRR4051812_34838903 -------------------KPirNRAIKLFFSRLIESHPSLLTVIG--DDYE------------AKARSLRPAVEMIIGCLGNmeaLRPILRSMARSNA-ELGMQEHHYLTAVNTILWTMERCLGSAYSAEVDAAWEDVCWQVCEAM----- >tr|F2UFM9|F2UFM9_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_06664 PE=4 SV=1 -MRLDMEQLKIALGSWTAVVelVPTWHEVFFAELFQAHPETeRLLYSS-DKS--------KSWNERHMARVGKSVGDVIKSLSNyddVIEHLTTGEPHEQ-ACCL--------TDG--YVIGTGLGNT----PRSLWLACGS-------T--- >tr|K0T9D6|K0T9D6_THAOC Uncharacterized protein OS=Thalassiosira oceanica GN=THAOC_11871 PE=4 SV=1 ----------------MEREdssGSL--PSFVSETEIEPSDVQPaaasgenNVDKGRR------------KTSSSSKRTPSITKRIESFSSfksLSSSFS------------------SKLDDERNAGEAGQAERVEsttapESVASGETQGNAGGQHTLN---- >tr|A0A165S3D1|A0A165S3D1_9GAMM Chemotaxis protein OS=Halioglobus sp. HI00S01 GN=A3709_07715 PE=4 SV=1 -----MTAIMMIDRDFTVTYanEAT-----LQLLRDNQATLSSIYPGF---N----------PDKLI--------------------------------GSCIDGFHKNPEHQRNILADPANLPWRTDIEVADLKFS-LNVTAIVDAQ- >tr|A0A1I2IR29|A0A1I2IR29_9GAMM Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor (Fragment) OS=Fontimonas thermophila GN=SAMN04488120_104136 -----KGVIQYINRDFIEVS------------------------GF---S----------ESELI----GSPQNIVRHPDmPveaFADFWAT----------------------------LKDGKPWTGLVKNRCKNGDHywvLANATPLRAN- >CZCB01.1.fsa_nt_gi|955242656|emb|CZCB01016507.1|_3 # 1728 # 2327 # 1 # ID=16507_3;partial=01;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.493 -----GVSSFEMNQQFSAQSsdSIEKNIAAISELWQKYMATnitdeekvladkfvatrgafvkealLPAVDAL---R----------ANdYEKAKLFSTKARDLYNVAHpalVELIQYQAGHAKL-EYDTSVESYKLTRNWTIASLFLAVGFLACFAYFImrSIANPLSvifRVLDNIKSN-- >SRR5918993_5799879 --AMTPEQINLVQRSLPAILaIRDRATARAgERLAVLDRAPGRLFAG-ADI------------GRQGAVLINAVTAAMQALRsgDYGSVLAALSQYHL-SYGIGPQHFRSAGAALARALEQELGSSFTADLGHAWAAACEWVGRII----- >SRR3954452_18192940 --XMEPQQIKALKQSLATVLsAQEALAVRFhQHMRRFEQCPRPLFTG-APL------------ARQGVLLTNAIAICA-SLPskNlsQAVAAGALSQYHA-SYGIASHHFHSAADALALALKDELGHIVSDVAIDAWAEACRMLGQAL----- >SRR6516162_8663010 ---MKAETISTIKATAPVL--KEHGQAITQRMyeiaFDARPDARQLFATT-WM------VSSEEGRKQAGRLAGAVYAYAEHIDDlekLAGGSGAYRaaaRRHE-GPaGNLSGHWSVShgryqgcaKRCCHAGNPRRLARGIX----------------------- >SRR5690348_5860809 --QLPDGSVRLVKKSFAALEpvSADVMQYFYAWLFVQHPELRAMFPL--AM------------TTHRQRVFDALARVVRSTGSpaeFADQISHLARDHR-KFGVRAAHFKPFFAALLAAIREHSTGTWTSATQQAWEEALDCISAGLQT--- >SRR5258705_5637504 ----------LFSQLYQCSKntGRRSRGFSIDTCSKKHPELASMFNA-RDQSD----------GSQARRLAAGVLAYASNIDRlhmLESAITSIGRKHV-SINVRPEQYPIVGKHPLGAIKTVLGDARHPKFWMHGQRPTPNWQRSX----- >SRR3984885_15745818 ---------------------SRAtgGGWLPTRSPTGRSARTSR------T------------GCRRGRCDGNTRPTV--ggPAALGGGQCEDSARDG-KLGLSADHADSAGAGRVdlAAVRHPGGAGV------------------------ >tr|Q7M455|Q7M455_BARRE Hemoglobin 35K chain OS=Barbatia reeveana PE=3 SV=1 -----PANKNLIRSTWNMMVGdRGNGVELMGLLFQRAPDSKIDFKRLGDVS-AENIPYNRKLNGHGITLWYALMNFVDQLDSkkdLEDVCRKFAVNHV-IRGVLDVKFGWIKEPMAELLRRKCGNDCDDA-IQAWWKLIDVICAVLKES-- >HubBroStandDraft_6_1064221.scaffolds.fasta_scaffold2618798_1 # 2 # 181 # -1 # ID=2618798_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.622 ---CSAEDRSIIQEQWKILFkdvdsskiKIAVGRKLVLNLIQRQPDAKVLFDKF-NVD----EPNSPQFSAYALRLFNRIDLIINLLKDpeaLDAALEFNAERYGNIPNIKKAYFQTAAQILAYALPKVLD-DFNA---LSWQSCTRYILTTVASKVS >SRR4051794_1382573 --ALDPALLNLVERSRPRVEhkITELADQLYTALLAQVPGLRTLFPL--DP------------NGRRAPLTDPLIWLLQRLDDrdeLVRRLADLGRDHR-KHRITAAHYETAGHALLDALAHIHGPTWTPPLAAAWTRAYTAATHDML---- >SRR3954470_25015505 --EISEEQARMVKNGWQAAvdAPGDFGSDFYRDLFTVAPGVIGLFS--GDMT------------EQQGRLTHTLAETVELVDQpttLLLLLRASGVRHH-HYEVKHAYFSVMRDTLLNTMERRAGAVFDAAHRQAWEAMFDNMATIMQDG-- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514 --LISSKNLGLIRDTWAMARrDSDIAPKIFLRMFAQHPETQLMFPRFANVP-QSQLMTNKDFLQQAYTCLAGLNFMVKNMDDEDlviKLLSRMASPAFYvDFPTPGQQLDETTRLFLDVMQEELGNSFTADARNAWTTVMNQIHNVLVQQ-- >GraSoiStandDraft_30_1057271.scaffolds.fasta_scaffold222668_2 # 490 # 1347 # 1 # ID=222668_2;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.654 --LLSIKDKALVRESWTLAKsNNEIAPAVLLKMFAENPDAINLFPKISKAK-IGDLKGNKDLYNYAYSSFAGLNMIIKSIDEVKtiaTLFKNSDNPSIFlDSRSASLD-------------------------------------------- >tr|W4FW63|W4FW63_9STRA Uncharacterized protein OS=Aphanomyces astaci OX=112090 GN=H257_12922 PE=4 SV=1 --VLTPRHVELIKANWSAVCagtsafdVEQHgspdkffHRTFYATLFKADPSLRGIFRS--SL------------TLQGKSLASIIKVMTGvvSASNLVERMQALASGHL-KFGVKRQDYATLGVTLIQTLEIISGSSWSRHVKEAYLTAYCLLFYLV----- >tr|A0A024UCA0|A0A024UCA0_9STRA Uncharacterized protein OS=Aphanomyces invadans OX=157072 GN=H310_04772 PE=4 SV=1 --VLTPRHVALIKQNWSAICrgtnafdSTKHgspdkffHRTFYSLLFAVMPSLRCIFRS--SL------------TLQGKSLASIIKVMTGvmSTSNIVERMQTLAEGHL-KFGVRKDDYTTMGVTLIRTLEVISGSIWTKEVKEAYLTAYCFLYYLL----- >tr|R0JHX0|R0JHX0_ANAPL Hemoglobin subunit alpha-A OS=Anas platyrhynchos GN=Anapl_10052 PE=3 SV=1 -------------------------------MFIAYPQTKTYFPHF-DLS-----HGSAQIKAHGKKVAAALVEAVNHIDDIAGALSKLSRRRKKERfQtkPAPKNLPLAAHrCHQLNIASKGTEHygTNPQLAWLSTGHLVSGRELISSKSS >SRR5690625_6805322 --------------RSPSHsqtltLSPYTTLFRSRNLLRNHPELKNYFNT-ANQV----------NGFQPRALASIILQFAKNINHi-yeiVPKLERVCQKHC-SLGVQPRSEEHTSELQ------SRGHTVCRLL-------------------- >tr|F2UFM8|F2UFM8_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) OX=946362 GN=PTSG_06664 PE=3 SV=1 -MRLDMEQLKIALGSWTAVVelVPTWHEVFFAELFQAHPETERLLYS-SDKSK-------SWNERHMARVGKSVGDVIKSLsnyDDVIEHLTALGTRHA-RYGLHVDQLDLFINAFLWTLGAGLGDSWDHSVKKAWMHVLPFILSPLKS--- >SRR6267143_1520378 ---VTLEQIQMVQASFAKIAPivGPATDRKLRRCSALVAGFrkeTRLST--GVS------------KNPGRSEVRGTLCGASCCGSlss------------------------------NWVANIRRGI----------SP-LALAIASI----- >tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii OX=37682 GN=F775_23753 PE=3 SV=1 -STFSEEQEALVLSAWDAMkgDSAAIALKFFLRGRNN-------FVQLAHVE--SPKRRIPVVEERKTDL-----------------IFEIRTKTW-KIGQKSTAYRSW--LLLR--QKSLPa----HAPKGHLSElvpldTIDHTHQET----- >ERR1700722_6370008 ----------------RGIRPhcPavrqhLPCVLPPH--VRAGSVASHAIPQ--LS------------APLTATLTAALEALVGALGDLQPVLVrapALGLRLA-SYGLQPTDISIAASAFLATLDDELDEVSTNAARAAWGCVFWTVA-------- >tr|A0A0M1J4K8|A0A0M1J4K8_9GAMM Uncharacterized protein OS=Achromatium sp. WMS3 OX=1604836 GN=TI05_18490 PE=4 SV=1 SKDIKPTNIYLYQASLNRAiNTSKFCDRLYFNFMNGNIEIANIFKG-RSK------------ERIQHKLQTTLDLVADNANQvpgNNIYLEMLGRIHT-KRHITPEHFKRWKFAVINTIAECDP-NFDTEICAAWEEVLTALIDKLI---- >SRR5260221_159328 ------QALGLVREGFAAVIarPDVFVSELYQDFFTSNPRYRKYFGS-ADIGySGsADIngTGSPEighaaadITRRNAKTVEAATRIVADLDRpgvLLPYLRKLALEYR-KYGVREAHYRAFAGSVMTALERTIGQAWTYEAAEAWVDELTMVASAMLG--- >ERR1719266_796048 -VSGLGTLSIISQASWKAISGeiHSSGVAVFVEIFKAQKEVQQIFQKLNPNPNSSGIkytkdqALKESLHEHGVKVLSGVDEVLSNLDQpslCLSLIRKTGAFHRKLQGFKPKYFKCFEEPFLAMVQSSMGQRFTPQMEIVYQSVASFFVQTLIEGYN >ERR1719402_1083666 -TDLSTNQKNMIRDAYAVFekNGEKNGADAFIYLITQHPDLKKVFP-WGDVS-NEELRENQVFKDHVYVVYKGLKVAIDRIDNLKAtasYYVHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQTSFNNLLQFLVGNMKV--- >ERR1719295_364028 --DLTPEEKRCIQRTIPVIlqEAEMIGTKTYLKTFHNYPLSMIYFEPLRDKLVTEVKQTDDYLKKHGVLFVKFIGELVAEMDDpdsVDLKLKSLGRFHD-DLGVLKQYLEAIGPLFVQAIRPVLMtqasipsatncgvgvsspnSLWTRDTKPSWIRFFRVIALQMKRAY- >ERR1711860_326342 --ELNSDEKTLIVTCSKQLleIQKVLGPQMMQQKFQKV-----------------------WSKEAGEL-KQLYDMR------------------------------------------------------------------------ >SRR5215213_6828293 --------RR-----LG------------------------gRIRC-APdR-----------PQRPPVRPRDATDC---------------VQAHV-PRGA--GRAVHRGRPLpAGGGGPGPGEAVTPEVAAAWEEVYWLFAVQLIG--- >SRR6476659_6585810 ---------------------------------------------------------------------------------------HVAN--A-RFTPC-PTYVDDGAavvtNPGKHRGADAGRAFSENLSVDWNAG-VRTAPPLVA--- >tr|A0A2B4SAV5|A0A2B4SAV5_STYPI Uncharacterized protein OS=Stylophora pistillata GN=AWC38_SpisGene8312 PE=3 SV=1 -------------DTFGPKEsRCREESVCKVRLLELNPNLQDAFPSFRGVS-LDELMNSRSLFLHSKRLMAVVEEAVSSLDDakeLIEDLTNLGERHL-AMSITEKHLKNLQRAGPATNQDAKHRLLANKGTAQIDRHIARMEDTRLP--- >tr|A0A1E4GLJ3|A0A1E4GLJ3_9CAUL Uncharacterized protein OS=Phenylobacterium sp. SCN 70-31 OX=1660129 GN=ABS78_22870 PE=4 SV=1 --ATAFARAADIEASLELLAerDIDPTARVYQRMFELHPQMEPYFW--RDTD--------GKIR--GEMLSLAFAAILDFVGErryADHMIGTEMINHE-GYDVPRDVFATFFAIVRDALRDLLGADWTPVFESAWEEMLAEIESYARQ--- >SRR5699024_10012150 --------XLVCLLSLPCPhpHLNSFPT-RRSSDLSKAPELYNIFNQ-TN----------QERGIQQEALAYSVYAAGENIdqlDNLKELISRVTEKHA-ALGVKAEQYPIVGETLLEAVEDILGSdVATAEVIGAWEKAYNYIADAFIE--- >ERR687884_344007 ------------------------------------------FPR--TT------------TAHNGRAQQSSTANRRaDYPRrapMNNLSRLLKESWT-LVEEQQDKYQVVGDALLEALRTFAGDQWTLEYDQAWRDGYALIAQRMIDG-- >tr|A0A0J1H5I9|A0A0J1H5I9_9GAMM Uncharacterized protein OS=Photobacterium aquae OX=1195763 GN=ABT56_07590 PE=4 SV=1 ------DFHQIFNDSYQRCqRHPQFFQIFYRNFWQQEERFQKMFEN-VDM------------TRQIKMLKLSILMIMLASTSeeAKDNIRRYARRHGPdGIGAQPEDFDIWIDSLLKAVKECD-THYNSDIDKAWRTCFKTGMEIMKQET- >tr|A0A2E7C7Y6|A0A2E7C7Y6_9GAMM Uncharacterized protein OS=Haliea sp. OX=1932666 GN=CME43_15375 PE=4 SV=1 ------TSKELFLHSVTRClTHETFIHAFYLRLFDASEEIRAKFRF-TDL------------EKQNAMLRRSLLLYAEATAgRteALREVNERATTHDRhHLDIQPHLYAVWIDTIVTTARDFD-LQWNDDIEVAWRTILGHVVQQMIRRY- >tr|A0A0F6YJJ2|A0A0F6YJJ2_9DELT Uncharacterized protein OS=Sandaracinus amylolyticus OX=927083 GN=DB32_003309 PE=4 SV=1 --------MDTTLDSFRRLRERGFAHRFYEQLFVADRRVPRLFAG-TDL------------ARQRDLLEHGISMLLAYQRgSalGEIAMRRLALLHGPrGLDIDHDLYAIWLRVFLDVAGELD-PEWTPELAAAWHAQLGASIAEMHRRG- >tr|A0A244CWV0|A0A244CWV0_9GAMM Diguanylate cyclase OS=Pseudoalteromonas ulvae OX=107327 GN=B1199_05805 PE=4 SV=1 ---------------------------------------------M---ET----------VNSKAKVLNKLLIA------tsVVLISFIVSLQLA-GVEMGQSSIIAILVFGIASIG---AMAF-------LYKAVEQIADKLNVIEE >tr|A0A0L0EW98|A0A0L0EW98_9GAMM Chemotaxis protein OS=Pseudoalteromonas rubra OX=43658 GN=AC626_03140 PE=4 SV=1 ---------------------------------------------M---NS----------QSIQSSLNNKIIIA------gvILVISIVVGIQLG-ASGAENMQLVAVALPLFGVVV---ALGY-------LKMALSAVSAQLGCVYR >tr|A4BJG5|A4BJG5_9GAMM Probable methyl-accepting chemotaxis protein OS=Reinekea blandensis MED297 OX=314283 GN=MED297_02020 PE=4 SV=1 ---------------------------------------------M---NQ----------LNN--ALSARILIV------gtgPALLLVILNLALA-GSGSA--TVLNL---------------------------------------- >SRR4026208_2063884 -R-SVRTSKGHRQGHPPAIQkhGGAITTAMDARLFE-NEEVKAMFDQAAQES-----------GEQPRRLANAILAYarnIDKLDMLTAAVERMAQRHV-ETGVKAQHYPYVANALPPTIRDGAGG-------------------------- >ERR1712080_92393 TMSLSAGEITAVTASFEAVKadLGTNIGKVLQKLVAEHPDLKPHFPW-HAVP-TADLLGNDGFKTHAAQVGRGFAEAAGNLSNLsacEGYYVSLGDRHK-TRGFAAAQVPMVADAFVAALQ------LTGDDASGWTKLITFVGSSIVSG-- >ERR1719334_3108017 -TGLTPKQAQAIISSWENLNSEC-SSLLFKQLFTIFPELKEYFG-FSKRELVDKILNSEEMIAHMDATWNGLDKLVLSTQTgtrFAAIGKGLGYNHF-KFEIDRQDVHKFMDFFKQVLKDDLKSQFHGDLEEAWNIWCKAVEDVFIMGY- >SRR5207245_2384740 --NPQPST-HAVTEQVVTLDv-----LPWTSGKLGLGPGKarlsEPLAP--GDT------------LE---SL----------LERQrarIPGfeewVYDArerriheHCTLL-VNGQAEYRRHTAEVEI------------------------------------ >SRR5689334_4915957 -----------------------------TASQRVTP----SLR--GKR------------VPSGQmgdRKVPD-VPIVDAHVHLwdpTAFrmpwLDGNKRLNR-PYGLADYREQTAGLPI------------------------------------ >GraSoiStandDraft_16_1057320.scaffolds.fasta_scaffold2022664_2 # 351 # 797 # -1 # ID=2022664_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.631 --------------------------------------------------------------------MPDFPI-VDSHVHLwdpNHFritwLDGNPRLNQ-RFAIPEYREHTAGIEV------------------------------------ >MudIll2142460700_1097286.scaffolds.fasta_scaffold02451_1 # 3 # 1031 # -1 # ID=2451_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.574 ---------------------------------------miGSRAL--AAL------------FPHPKTFMDTKRPVADTHIHLwdpGYLtypwLETVpaiagph----G-PAELQVQEPETDRFRL------------------------------------ >SaaInlV_200m_DNA_2_1039689.scaffolds.fasta_scaffold02144_7 # 4497 # 5432 # 1 # ID=2144_7;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.499 ----------------------------------------------------------------LQCGVATVRSVIDSHVHFwqpQRLrylwLDEVpair----H-PFTPHELNQATQAIDL------------------------------------ >tr|A0A0K2U629|A0A0K2U629_LEPSM Cytoglobin1like [Saccoglossus kowalevskii] OS=Lepeophtheirus salmonis OX=72036 PE=3 SV=1 MTLLTKKETFLIRESWKLVTPEmtKHAVGYYIGMFVSYPKWQDrFFRRIKGIP-LRDLRNNPILAAHSSQVFSAVSNLLNNLENtevIVEGVKKIARTHW-PLNIRGKELEAGLVLLLDYLEASFPGQISKECGDAWNKMFNAMSGVIVD--- >ERR1719474_2118124 --SLNPTQKCVIVATWHSIFlkhMNFMGKQLFVDLFKVEPNILKYFDAFRDVG-LANLLQSRSFQNHGVRIMNLVKFAVENLDNpekLQDHMHALGRLHV-KKGIDSKYLNIMGPTFCQAIRPMVMaeGQWSIDIEGAWIQLFKILAQMMRVAYE >ERR1719328_19047 -NGMTPEQKQLIDDSFAVLKkdVKGNTIVFYETFFKMNPELVAHFPGVSE-ADLVNLGKNEFIIQRGAKFFNMIETTTHLMESKegcLELVRMLKESVP-EGKVTYDRYKVAKEPFIKMMETALGGNFSAETKAAWRKFFDSLAETTK---- >SRR5581483_4578849 -------QIALLEESFELIAgqSVELADRTLSRLIELDPQFRLLAAR-TEM------------AALRSVLFSVLyvlRRSLHNLNTLAPALETLGALRK-DQELSSEHFGTIGIALLDAMAEVGG--------------------------- >SRR5690349_7596073 -------------------------------------------------------------XMQMTRFTDL-GLRTLMLLasaestgrrvtTRTIAVGANASEHH-VAK----------------------------AVSRLAELGMVMADTLIE--- >SRR2546430_1826610 --SMNTLERQLVRATWIDLaaAPELLAAHVYDRLFTLDPSLRLLFLG-AEL------------SSPGATLTHAIDVAVANLERLEQTVARLGPDGT-IPSVQTET-GILGDALLWAVGSMLGPiACNPAVRGAWAKCCALLV-------- >SRR5262249_54424048 --TMNAYDRELVRSTWVELsaDLEVLAENFFDCLFTLDSSLRLLYLN-TDR------------VASGRALMHVVGLGVANLERLEQIAARAA-DED-VHAIGWKTGGIAGDALLRAVERTLGPaVCSPAVRDAWSRCCATLV-------- >tr|A0A2H1V3P2|A0A2H1V3P2_SPOFR SFRICE_008656 (Fragment) OS=Spodoptera frugiperda GN=SFRICE_008656 PE=4 SV=1 ---LFGSqEFKACCsgMGMGKIGKGGIGPPVtsL--tqrnttqalfhvgflPYLRAAIQwctvqvDNSFDYLGIWT-EpVAFSVDPLLIAWlaykpTVKSEASLPAAVKSLSQtqqIP---------FR-RRSTP----------------------------------------------- >ERR1719309_231760 -TTLTEEEIQTVKTMWAGLleNSADSGLFIFQNFFELYPEQVHRFSFIRDSQGNpiPNYLKSQAMLQHSAMVMDALDGVITGVFehDplLGQMMYNAGYSHH-SKNIAKDDIEKLSNSILEVIKLVASCegSGKATKVEAWRKLLNIVNERFEQGF- >ERR1712168_640531 -----------------------SGLVIFDHFLKMYPQQVKKFQ-FIQDKNgaiQYHYIVEPRMRVHSEMVMNAMDAAVVGIlrgHNVKQELEDLGRQHQ-SLRLK---qeeAAKEQEEREKEEEEEEEKeEE-AET-------------------- >tr|A0A1X2H2S4|A0A1X2H2S4_SYNRA Uncharacterized protein OS=Syncephalastrum racemosum GN=BCR43DRAFT_446018 PE=4 SV=1 --PPTAAQLKVIRRSWELVSdtrwpnepqtmspCQAFSIAFYDALFALDRTIESALSNI--ILQGKalsgilsHLVRTRVVLDEAK------------sidETHFARKLQAIGATYI-EFNVQPYFFDLVGPALISALQRRLKEEYTATIEDAWLTAQHYASYHL----- >sp|Q7M416|GLB1_LIOJA Globin-1 OS=Liolophura japonica OX=13599 PE=1 SV=1 ---ISADQAKALKDDIAVVaqNPNGCGKALFIKMFEMNPGWVEKFPAWKGKS-LDEIKASDKITNHGGKVINELANWINNINSASGILKSQGTAHK-GRSIGIEYFENVLPVIDATFAQQMGGAYTAAMKDALKAAWtGVIVPGMKAGY- >tr|A0A0P5UDG4|A0A0P5UDG4_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 -NILSENDITTMNNSWSILRkRSDFAPKVFVRYFKAKPEAQKLFPEFASIPL-TDLPNNHDFLNAAYSCVASLDYILPHLKIphPerCPVLMELKNKysnvdlkkfgpixxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxrcpvlMELKNKYSNVDLKKFGPIWMTAMQEEMGNALTNEVRDVWKKAFVAFTD------- >ERR1712000_676789 -MSLTPQQSAQIRSSLPVLKseGETITSLLYASLLHNHPDLHNLFNSV-NQANG-------RQPRALLSSASVKGTARWESHQLS-------------------------------MISSRGTCWRPSR-RSWGPSGRLSX-------- >SRR4051795_8230555 ------PAVT---------------------SPRVpA------------------------------------------------FgSPCPVIRQQ-RWTGAI-----IGTRQEGSVP----------SAHSTTSGD------------ >SRR4051812_47002672 ------RLSA---------------------TPARtG---P---------E----------TRE------E-----------eTPSMaERTLTAMYD-DR---R-----AA--------------------------------------- >SRR5215203_3322109 --ELSERTIALVKATVPALEahGLAITRRMYERMFH-NEAIRDLFNQ-SHHG---------ETGSQPKALAAAILAYARNIEIlaaWGEAYWYLAEVLI-ARERLIyqglaaapGGWTGWRDFTV--AEKRCESEVITSFVLRPTDGGPVLRHR------ >SRR3954470_353290 ------ARRS------------------------------------------------------------------------SPLaEGDPRYHVH-QWDRGRQPRRSTRCRVTPPVT----------NIRRYLVGP------------ >SRR3954464_15980397 ------RRVW--LA---------LL----DV-LRRsGP-AT---------V----------VRS------C-----------sEMPLfrPGNAPRSAM-GSVPIK-----SVNLNSLPCTDVLGEDATPEILGAWGEAYWFLADLLIA--- >SRR6478735_1414904 ------SGSR---------------------PARLaS---R---------P----------SW-------------------nHRPIgEATLVNRYG-RS---A-----AGSDVE--------------RIERDLSGT------------ >SRR3954468_7455402 ------APPD--RA---------LT----GGGETVpG---V---------R----------ASR------P-----------rTIDRsGRTLVSQSE-RS---A-----EGSGVE--------------EIERDLSGT------------ >SRR3954470_12739883 --------------------------------------------------TS------ACSRTRTSATCStsrtmarqapsprrspPPWSPMRAISTtsaRSPRVERIAQKHV-GLNILPEHYPAVAESLLGAIKDVLGVTHYSRGLTDDPDWYPYLKKHEWL--- >SRR5215831_13609655 ---------KPCNRSKPFFRinAFCSAvslalrlQRLCELPESAHPQRC----A-SCLK----------TANPAKNVVPKRFGTFISIHLrdtYIFAVSKIGQKHC-GLNILPEHYHYVAESLLGAIKDVLGEAATEEVLSAWGEAYWFLADVLMA--- >ERR1719273_448027 --------------------------------------------------------------------------------------------MD-AWTDVYN-------ALTKVLQ----------SLEDNIKGA------------ >tr|A0A0P5DF02|A0A0P5DF02_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 ---KPANDRRIIRKTWDQAk--------------------------------------------------------KDGDVPpqiLFRFI-------K-AHPEYQKMFKSFADVpqae------LLGNGNFLAQA-YTILAGLNvviqslssqelianQINALGA---- >tr|A0A0N8DDV1|A0A0N8DDV1_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1 --------RRIIRKTWDQAkkdgdvppqilfrfikahpeyqkmfksfadvpqaell----------------------------------------gngNFLAQAYTILAGLNVaiq---ALSSrslLPTKSTRSEVPIS-PVeLPPSCSSNSATSLrksllk------SSAAPSTprpdkpGRTVCALWSLASPRTSRTPK---- >tr|A0A0P5CUZ8|A0A0P5CUZ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 ---------------------------------------------------------------------------------MFNPAGKT----S-GVPATPSFP-PSSSIssrrlpa------prSTSSNSLANLTKCSWVR---------G---- >tr|A0A0N5DPZ7|A0A0N5DPZ7_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 -MNLSAKELQLIEQSWLDIeNKDELGKEVFKRVLLSNEKIRTIFDL--HTCPDDELDQNETFKRHLKSLSLFIGICATSVavgsERLVSIARRIGEKHVNFRWVtfDAEYWLLIKGIMVDVIASKQRPKEVEKVRSAWNTLLSFVISEIKH--- >ERR1711868_89060 --GLDKKQLALLQKTWKDISteMEAQGVRLFVEIFQSNNEVIHVFPSLNPNLKGNraNEVIHEAFKNMEAKLLPESMRFFT---------------------------------------------------------------------- >tr|A0A090L154|A0A090L154_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti GN=SRAE_0000030700 --NLSHEQQALIRKSWRRVPKQNIGKIIYQKIYQKCPELKNFLSS--DN---------NCVERHFRYFGDMLQCTVDSLNELdkalYPWLTVIGSGHA-GFAITTAHWDAFGEALISSIKQWILSgKEHKETVRAWMKLSCYLIDTLAAA-- >SRR5256885_864722 --VLTDRQRAIVQSTVPLLEtgGEALITHFYQTMLGEYPEVRALFSMAHQQ------------sGAQPRALAYSVLMYAKHIDRLEalgDLPAQIDRKST-RLNSSHLVISYAVFCLKKKKRTGSDS--------FTRSE-----RLVV---- >SRR5256885_6575144 -----------------------------------------------------------------------XMVMSMRGPALEaagTTGCRSCSAAV-CCSFF--------FQAEDGIRDYkvtgvqTCAlP---------------ISDILIGA-- >tr|A0A016SWG0|A0A016SWG0_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0168.g192 PE=3 SV=1 --QLTSEEMDLLRSSVRIIseNATEVGCNTYEMIFEQSPYVKEFFH-FTKSD--DDAYRQKQTVQLAQKYMQVLIAFVEGIEDpsiLEPVSAKLIEIHRKvddVQ--MAAHWGVFTECTLYNIRKALEKDehFNDmdrldAAVMLWRMVIRGIVRRLKA--- >SRR5262249_10507301 ---------------------------------------------------------------------------------NvkySSHHQQHGPQAR-GVRSTNLAFCCVWRRTEMG----------P-ATAVWSGVHCRDAAGMDG--- >tr|L7MTK4|L7MTK4_SYMRO Neuroglobin OS=Symsagittifera roscoffensis OX=84072 PE=1 SV=1 -MQVSEEQQSLIMEDVQVLlpNYDDFVEDVLQQFMEENPETFQIFPW-ADASkTAKEMRSHPRFKSHAKSIGKVISDCLVDLNGvkkHEPKLSSLGAMHT-KKKVPTELFGKLGGCILTQVVKRVSeAKWSEEKKEAWLKAYGIITV------- >ERR1712227_290716 --KLSTKTIDLLKGSAAEIKenGTAIATELFKILFERYEVFKDLFPA--DVI------KNG---KMISVLPhalSAFAEFADNMLELDDTINRIVSRHV-SNGVQQWHYPLLEECFIDALDKTLKLDKRPELLQAWKDGFKFLANKVM---- >ERR1711868_248053 --RLTPDTIEALKYTALEIKgrGNDIAKSLFDLLFTRYPVFKDIFPD--ENI------QEG---KMFTVLPialHAFAANCDNIAAIDETLARIVTRHV-DRNVQDWHYPMMEECLIGALRMHLEDDEGMDAMEAWKDGFKYLANKIM---- >SRR5262245_20097952 --EVTPQQIELLEQTLSELRrqSVFAAQLFYCRLFSLRPRLRRLLSG--RP------------DFHGTRLLSVMSAAVAGLSDPghfAGLLSLAARPAVREALLQGDCVRVIGDAVHWMLERHFGGQITVEVREAWRAAHIRITQVIE---- >tr|A0A044TBZ8|A0A044TBZ8_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1 --NFDDAEIQLLRRSWKTIKpeKQT---------VLQCPEVRRFFPFM-NSDLKSCEKKNKRFVFQALRFIQvdmtIFNEIIISSF-----S-------------ndIAILMLVFLECSIHQIRITLLNSkldlWNRKdvdnVIILWWHLNSGICGKIK---- >ERR1719186_618842 -----SVQTREIRGTWVVILaqLQKVGVQCIVDLFELHPFVREHFKEIlvqyGKLDPDNDNALQNVLENHAKLVMNIVHELVVNIDNLdglSERLQKLGLFHV-RNAVPKKYSSTIVAFSHTEMHN--CRdlAFNFPETHELHG-------------- >SRR5688500_15455526 --AITPYDALLLQDSFRAIQqqSGPAAERFFRELFSYDSSLKQLFAS--DRW------------RREEVLMKALGRLVDHLNSpdgVGPHLVELAREHP-AYGLSNYHHLYFGAALFSMLELVLGARFK-LVYGAWFKLFQLAVSEVK---- >SRR5690242_19663030 --VITADDVRMIQESFRRVEsvRASAAERFFRELFCYDEMLRGFFPP--DRW------------SREEQLMSDVRGLSEGLTQpdkLKLAIDALALRLD-GSLRRTPLHLYIGAAWFSTLEMVLGSQFDRRLHAAWYKLFEQVVA------- >tr|A0A1I5XDG1|A0A1I5XDG1_9PSED Globin OS=Pseudomonas borbori OX=289003 GN=SAMN05216190_1566 PE=3 SV=1 -----ADDAALLEETLEMVSsrSEDLTPDVYARFFSRCPAASGLFTvI-DpatPP----------M--GCGQ----MLFEIISLLRDsaagkPYVAsyMQQIATEHaA-FDVRDPALYREFMHSLADVQATLLGPDWSPAHAQAWDRQIAALLRHLP---- >tr|A0A2D8QSR0|A0A2D8QSR0_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP89_08285 PE=4 SV=1 -----SSKDDVIAESLSLVAerAGDVTSVIYEKYFMRCPSAEEVMSH-LDA----------Q--VLGK----MMEEVYRLLMVndyesENDYLNWEVSNHeT-AYNVEPHMYEGFFSAVIDSVREVMGSQWTPALERVWESKCEELRSEIA---- >SRR5207247_8066543 ------LDVQRLQESFARMAmhGDAVPLFFYSDLFLRHPETRDLFPV--SM------------AAQRDRLVDALGRIVSDvehVDADSGDPSGARPEDA-HIQAVRILsnAQQMADNYVADAQEY-----SSQLSTX----------------- >ERR1719419_503384 -TDLSPKEILDIQMSWAEIHQEgLVnpDVLMFKLFFEESESGRLKYSHLLkNVNLDnlnwmRDWTKVQKLKDSIDKTGEALGDVIKSLNyhdRVVDKLYSHGVVHA-KFGVTRKEIHTFCECLLMTLKMELGTNLSQEAQASWERLLKMIVEVFC---- >SRR6266536_694904 ---------------------------------------GTRFA--DSHR------------PPRTMERTGplrDRLALRALRlgvgdvvwEDVPSLKRSMCG-----------AAAAGAAPVVAAVASAAPGDPQKHLKRADQVYAKSILLRMS--- >ERR1719230_2183946 -SWFTDDRERLLKRSWQQLQldsCEEAGALLCRNYCSQSPEDAASC----G--------------MDWSAVIKVIGFPIDRMDNLafvKKRLRCLGANHA-KWETKEHQFQSMKYAFLSAPRDVFANEFTSDLELAWDLLYDFVSTEMIAGL- >tr|A0A090KT29|A0A090KT29_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X0 -TKLTENHRKVIKSSFEIFKknGVPNAHNIFLRMFKEYPDYKNVWSQFKNMS-DEELSQTPLLWKHATTFVFGLERVIRTMDDqemMILMIHSTANQHK-SWGLKKEHFFAMVHLITDILMEEKGEpDEKYAIMEAWESFYDVLGT------- >tr|Q6BBK1|Q6BBK1_9BIVA Hemoglobin chain I OS=Calyptogena kaikoi GN=Hb-I PE=2 SV=1 ---VSASDIKNVQDTWTKLYdqwEAVHASKFYNKLFKDNEDISEAFVKAGTGS-------GIAMKRQALVFGAILQEFVENLSDptaLSLKIKGLCATHK-TRGItNMELFAFALADLVAYMGTTI--SFTAAQKTSWTAVNDVILHQMSSY-- >tr|A0A0N4TEQ4|A0A0N4TEQ4_BRUPA Uncharacterized protein OS=Brugia pahangi PE=3 SV=1 -IPLTRKQKFVLIKNWKGIErdVTTAGIEMFLKMLTEHPEYYEFFN-FRNIANTakEKQASDERLSAHGAAVMKFIGKAISQIENadaFFMLLENNGRQHAHRGAFRPEMFWASYSFTCYSFSNGFIRNFFSNI--------NLLLTKVEMSY- >SRR5690625_5362168 VLRSPPpphpaasslSLRDALPLCAGVVaeHAEEITTVFYRDMFEAHPDLLNVFNV-A----------NQAVGEQPKALAASvVAFADRKSTrlnsSHVA----MSSAVS-CLKRRSPERR-RG--------------------------------------- >tr|A0A177B679|A0A177B679_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_02502 PE=3 SV=1 --GLTKTDINMVLGSWESINNDEASSIFYRELFNTYPDTKSLFVKFYSVD-NDKLIDNPAALKQLRVTWTAITTLIDYLKKgrideANKAIDYLIEKHRKIKTFQGPMFNMALEPLLYLVKEKL---TSQAYIDAYKKVFGAIFLTIISKY- >tr|A0A177AVU9|A0A177AVU9_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_06067 PE=3 SV=1 --HINIKDIERVSTTWDLLDDKKSAIRFYKHLFTIYPQTNKIFVKFHNAK-VDSLGTNAQALKIAKAMWGSASHIIISVSEgnlkeIYKSIDYLIKIHVNVPKFSPTMFELAVKPMVATIQEKI---TDPEILQAYVNIFTVIIEKLKTSY- >ERR1719397_1495121 ---FGAAQTRMIRSSWSIILaqMQTVGVQCIVDLFNLIPYMREHFKKViadsGRMDPDDDSAMQAMLENHAKLVMNIVHQVIINIDDLdliSPKLFRIGVFHK-NTGILPRYLDIMGPVFCNAVRPILLKhkMWSAETEDSWMEVFKVITSIMKRGY- >tr|A7BZS6|A7BZS6_9GAMM Globin OS=Beggiatoa sp. PS GN=BGP_3767 PE=3 SV=1 ---------ELIGQSWDKLAGkhEEMVATFYDRFFDKFPHYRKFFP--ESM------------EHQLKRMAETIALLARVTHEtevTHPHLVKVGSRHT-GYCLAREDLDNFKTIFVQVVGEYCGDDWNQEYQESWTEAfEQHIIPYM----- >ERR1712048_439078 ----------NVTTIWDSIKavpgyEEKFGRMLYEKFYEMEPESFKLFKK-TRQPAAEDVFSDPVFVQHSLEFVRLLDFFIQVLGPdielVEESLVDFGETHQ-DYGVTLDTYSSFGEAMTETVEELLGGngKMDETSRRCWVTAYRYMSMHMTRG-- >ERR1712048_1339107 ----------NVTRWWDEIKripgyEQKLGATLYQKFYDLEPDSFETYTS-NLT-PTEDIYSDSTFLENSATFVHLLDFFVQVLGPdlelVEESLIEFGARNYNDFGItTVDSYSSFGEALL----------------------------------- >SRR6516162_179054 ----RSQTVMDIEESLHHILerEKLVADLFYMVFLEKYPEVRRHFINV-N------------LRRQAVLLTMALQVVVQYYLKgfptAEAYLKILGEEHN-RRGIEPELYPKFCTALLETLSRFHFHDWSEDLAQQWEEALKLAATEMVEASP >tr|K2K1I7|K2K1I7_9RHOB Globin-coupled methyl-accepting chemotaxis protein OS=Celeribacter baekdonensis B30 GN=B30_11265 PE=4 SV=1 ---LAVKQISLVRNDFRRLAPvrPEMFKRFYERLFEIAPHTRDLYS--ESL------------TEEAIRVNGLLEIAFLSLDHpqaMFATLHTLGRDFS-GFGIWETQSDLVVDLLVEVFAEFGGEDWGTELEKAWHSVLSFIAQGMKEG-- >tr|A0A291GF03|A0A291GF03_9RHOB Uncharacterized protein OS=Celeribacter ethanolicus GN=CEW89_16165 PE=4 SV=1 ---PSARQIALVRNNFRALSPkrPDIFIPVYDRQVGEDPKAAAQYD--GSL------------CQRARVLDGLIELALLSADHptaLFATLHKMGQDYA-HYGSWREKHPFLIGQIIKAFAEATDTHWTDELADAWEQFLYFMAEGMLEG-- >SRR4051794_12469468 --------------------------------PPTMHDLRILLAG--DA------------GVRREQVGQALSWLVDNLDQprvVAATCADLGPALQ-QVGASPQRLDALGVLVADALRANFGAAWRQEHYDAWHSSARLVTSWMGQ--- >tr|A0A0S4IT96|A0A0S4IT96_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72670 PE=3 SV=1 ---ASADDIALVASVWVFVkpNLEEVGNEFYDQFFAKHQDLKATiFL-------------GTNFLTQAIRVMEMFDAAIEAMCDpvaLMELLVPLGERHA-LYGIRKEHYDIFWPALCIALKEQLGDKLTDDVVQSLHRVYYKVIQVMLE--- >tr|A0A0S4IT96|A0A0S4IT96_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72670 PE=3 SV=1 ---FTPTIVRTIRTTWAAAtkDMDAFGDRLYTAVFALDRTLKeTIFKG-TN------------MSAQAHHIIETLDSCVRIMDQpnhLMSMLRQLGVRHG-AYGVGRHHYPTIGKALISALEGSLEDKFTLEVNKSWTKFFNVIERSMLEG-- >tr|A0A1V9Z083|A0A1V9Z083_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_04708 PE=3 SV=1 ---PTATDEDLMTQSWDDIIgcklrAEierrkapstepspeaptttsaivQFYDTFFSHLYVINPETRSVFRN--SM------------HVQSKALVNIVGAIRHVlhSDDAKNMVAAMAVRHI-QYGVKLEYFDNLGVAMIQTLSKLAGTTWTTAMADAWHTVIAYIICLIVPHY- >tr|A0A1I7UV11|A0A1I7UV11_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis PE=3 SV=1 MDRLTERQKQIFTETFPVVfkDSRRNGLVLFAKYFSEFPHYKNIWPQFRTLQ-DSALLASNELANHCSVYMSGLKEIVEVMDDeekLTYFMARIARSHV-KWNINKYHITNMLEGVDAVLQRSFGDKLTDEIVNAYHTLYDVIGNLLD---- >tr|A0A0P5Q0G6|A0A0P5Q0G6_9CRUS Uncharacterized protein OS=Daphnia magna PE=3 SV=1 --SMKGRGSCFDQGHLESCKkNGNIAPKAFIRYLKLKPEAQKKFAAFAEVDL-ADLPTNSHFLNQAYTCLAGLNAYSDNLGKNPKSCPYLNSP-AF--KdVKPDELKLFGEVMFNVMEKNWTIIFPRQARKAWKDGLTACDVA------ >tr|A0A258C6P4|A0A258C6P4_9PROT Uncharacterized protein OS=Caulobacterales bacterium 32-67-6 GN=B7Z13_12975 PE=4 SV=1 ------MNTQALLDSLDLVAeHGeDPTPRVYERLFARYPETEALFMG--DTR--------GA--ARGQ----MLRQAIETLLDYlgpnafaANFLRAELHNHS-DIGVPTEIFPRFYQAMAEAFADILGGAWTADMQRAWDDLTAKVEQIVRG--- >ERR1719244_673251 ------GQKDLIIASWREIriCLDEVGFDTFKQLFAHHSDIRAYFPAMKKLSS-NDVEMSRKIKEHSTRIMAVLKLFVDNIYDLekiEPSIEDLGRNHS-FRTLLGLFLSE-------RISGQL--AWR--------RCCFNYLNIS----- >tr|A0A1I3XAR1|A0A1I3XAR1_9PROT Methyl-accepting chemotaxis protein OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_101121 PE=4 SV=1 -----QAAIQRA-EACLTLSadGLVLEA---------NDRFAALL-G---LA----------PAAVADRPHA--ALLTLAERDgatYRRFLDQLAQGR-------------------------------DTVARLWHQGAggagvllELSAAVMAAD-- >tr|A0A1I3XA39|A0A1I3XA39_9PROT Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_10 -----MAAIDMA-QPMMLLGadGVVQDA---------NAPLAALL-G---VS----------ADALAGRPHA--ALLAEAERDsaaFRRFRDAVAAGQ-------------------------------AGHARLRHAGAggntvtlDLMMQPLAAE-- >tr|E3MNQ8|E3MNQ8_CAERE CRE-GLB-30 protein OS=Caenorhabditis remanei GN=Cre-glb-30 PE=3 SV=1 -SHLTPIDREILNKSWAIVskDMQQVAVNIFQMIFEQAPDAKLMFSFM--MKDYKEDKKSNEFIFHAVRFLQVIESTMTHLDDpsqLDAVFLNLGKIHAkheEQLGFSAHYWSVFKECVLFHFRKAMKAHnkFSkhkemsfAEIDSAiilWREVLRFIIDRMKVGYC >ERR1740129_566420 --QLSSASVETVRQTAALVgsRAQEIVEAFYRGLRARYLELFQFFNR-TNQTSN----------RQSRALAVALTafaSKIDELSEIHGLLEMISVKHC-ALAVRPRHYMLVHENLLAAMEEVLEDQLTPSGYDAWSDAILYLVRLLTEQ-- >ERR1719183_2765469 ---------------ADIFmpRLEEIVMRMYNLILEEQHECINIFNT-PSLSPG----------QPLAALAACIRgliEDINVRPRLEHRVEMIAQKHC-AINLQAHNYLGLQGMFMSAAEDVLGADMTPQRFSAWSQALLFICRLVIER-- >tr|A0A0L0FUF5|A0A0L0FUF5_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_07147 PE=4 SV=1 --ICKPEELHtkdlgfivtHTNNPW--GstDEQDFGVDFFRDHADQ----------------------------SGLTSFFSSIVIIACEMYqefepSIPQLQKLGEEAK-HLDIPCHMEDNIVGYVASTLSR-SK-QFDAIEECAIFKLIWRVVLFVLE--- >tr|A0A2E9QYM9|A0A2E9QYM9_9DELT Nitric-oxide synthase OS=Deltaproteobacteria bacterium OX=2026735 GN=CL920_22905 PE=4 SV=1 --ALSS--MKEAKRLWEEGvgLHTAPGSEWVHQLVAERPEWNHFFAS-SDPE------------AFGEALFSTIDSAVHQLDDevsMFSSLREDSELFT-AWDVRACAFSALPDVLVDFVV---E-DHQTVGAQALRTFLRRVCTIVSL--- >tr|A0A0K0EIZ9|A0A0K0EIZ9_STRER Uncharacterized protein OS=Strongyloides stercoralis OX=6248 PE=3 SV=1 -VPLTERQKFLLVKNWKGISrrARDAGTNLFVQLLSEHQELGDYFI-FGNVKakDKYEMLADERIQNHGEAVMRILDSVITSVNDPQemfRILEEQGKQHAIKKNFKPELFREVEDALFYSIKLILDERYTDNMDSIYRIIMKTVLKTLE---- >ERR1719158_1160759 -------NKHLIDETMDRVanaNIAELGVICHKKLFSLSEDVQNYFYK--P---------NTMVAYILEKVLFILSNLSHEPVKIAHEIRALGMRHI-KYNIPPVHFPLFGKSLMYTFSSTLEGFWTDDIEDAWGSVFDFVCRCMTR--- >ERR1719158_1490032 ---------------------------------------------------------GGQLSFICRGHSSRIN------------RNALRVRRsrI-TNRSHSNCFSSYT----------RCSISSITCASAWATCLLR---RL----- >SRR5438270_3151649 ---------------------PQIVDRMYTRLFEVAPRVVKIFEG-KDPT------------KQL-RTVHVLRDSFDDLSALTPELEALGERHA-SWGVQEQDYAIMGPILLEAMAASVDPYWRSEYTTAWAALFQTVEDIMVR--- >tr|I2K200|I2K200_DEKBR Globin, putative OS=Brettanomyces bruxellensis AWRI1499 OX=1124627 GN=AWRI1499_0864 PE=3 SV=1 --QLTREEIDLLRWSWRLVTvdddSTSLGGNTFnAADFSSYLFCIQFYNNFISMD-EKVVEMIPSIRHQASSFADVLNQAIGTLEDLskmQELLTNLGKLHARILGIERSYFKTMGEALIKTFRDWFGNNetfFPLILEEAWIKLYCFLANSIIQ--- >tr|A0A0R3PZJ2|A0A0R3PZJ2_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1 --PFTDEEKSELLRSWKVIeaQKQAVGCDIYEMIFNQL------EP-FLCVSIKAPKELHNKFRIIVICIVGRYEEELSSVNE------------------------------------------------------------------ >tr|A0A183UUV2|A0A183UUV2_TOXCA Uncharacterized protein OS=Toxocara canis PE=3 SV=1 --RLSPRHRNLIIKSWSKTNKSKIARDTFVELFKTSADIRSKFV-FGDV-PIKRLKQEDRFLAHCERFVAALDSVIAHLDEIGaviENAEALGKYDISAepihaamaKDLRNEHWRLFGDILVERIIENDTKqpSGGSEVHAAWKMLGQLLVFHMRLGY- >ERR1719367_1435250 --------KTQLRSTWNVImsDMASIGVVMFLKMFETHPETLSSFIR--NVYSIKEIEmdewYQENLKLHAIRVMAIVEQVIHRLDEVgsvIKILMKRGLSHK-RLGVQRSMLEKMGRSFVLSIQSPLEEanKWDATVEQSWLSMFRFIEFWMGLVY- >ERR1712004_299484 ----------ILRESWKHLqsRIESLGVVTFLSLFNASSETLHTYLTPEDIATLKEQDkdkmLIEKLRVHPLRIMSVLEKTVHRLEDHqrcLKMLRQYGRKHQ-RFGVPPFMFATWPGVFYLYSSPYWKNlsNGMRTFHKLGKACFNSLHLEYRE--- >tr|A0A132A213|A0A132A213_SARSC Globin-like protein 2 OS=Sarcoptes scabiei OX=52283 GN=QR98_0035350 PE=3 SV=1 MTEFEREEIEVLREQWDRIVhyhQECFGMKLFQRLLQLHPEYRPLFG-FEE--TVEEIQNTQRLKAHGINVVYMLNMLFDNFDDmdmIDELIFKLVKLHM-MRGIDQIWLDDIIEPFELVLEEF-NAKIQIERIEVLRKAFIFIKNRMQELY- >SRR4051812_15383594 --PMTSDTIALIRASFRLAaaDPQALSQVFFRRLLLRSPGVQRMFPA--SL------------VRDPQRLVGLIDQVLRLLDRrdmLVEGLQNLGRLQA-PYAALPMHYPLIAGAFREALALRVGTLWSVDMEESWAELQALVIRIMGA--- >SRR4051795_1885912 ----------------------------------------ApRTAR-RRLQ----------PGQPGRRLAAdRAGrvgrGLRQRPAegprtdsrapavadraqarvaghrprpvrRRaRQPVLGHRRRAR-EGGHTGGRRRV----GRGLLADglCPGQPGARPLQRAWRAA-----GDGVAR-- >SRR5690554_337115 -------YVKLLETSFQKAvenvGIEELSTRFFSRFFETFPETNSLFKG-TNIDY----FR----KFKMRVIFDFLIDIVKHPNYAEAHIAQEVMRHQ-MYGLqDKEYYFTLAACLLEAVKSALGDAWTDEDESAWNDILLVFKG------- >ERR1739838_826584 ----LFGSVWPLPLSWDIIShkVDQDGESRFLHKFESNQETEDPILQQ-FT-------QIDASIFNGKSAMIIVALTLENLENyqaLWRNLIRLGRDHF-GYGAQPMYLDLIGPHFVITIRQTLGYDWYEALEYHWLALFELIVYVMKFGWH >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1 -------NLGLVRECWDSICeqytTNELGEMVYDHLFKMAPNLTMLFTKPR--------------SYMAVKMGDMLSMLVSFADSsesMKQQISWLGLRHV-KYKIRPHHIPLMGPVFLAVVAEAAGVHWSQDTEKAWSVLFNMVCVNMADA-- >SRR5690606_39733342 -TEL--YTLSLHDALPIWVAekIGDPTRLVYERLFAEQPEMETLFI--LDTD--------HSARGH------MLTEALNCIFDLlgQRayapvLIQSELTNQD-RKSTRLNSSHVKISX------------------------------------- >tr|A0A0D2X3G1|A0A0D2X3G1_CAPO3 Uncharacterized protein OS=Capsaspora owczarzaki (strain ATCC 30864) OX=595528 GN=CAOG_004918 PE=3 SV=1 ----RHETRDAIQSSWALAIqkhddHdvtpvATFVNILFAKLFEVCPETRLVFGH--DMV------------RQGKSLSSILTgmlEFVVHPKKLQSQVKRLAHMHV-GLGVTPDMFEAFGFSLLYTIRVRIGSAWNQQIERVWVDTYGGVSNILSQH-- >SRR5215208_3780459 --PLSPEAISVVRATAPVVAahADQITAHFYPRMFAAHPALLRIFNQ-GN----------QATGEQSKALAGSVVAyAVQLIDPeapsFDHVMRRIAY-KH-VSLVSARSSTRSSASTCSPRSVRFSA-------------------------- >SRR5687768_12147577 ------------------------------------------------------------------GLAHARMDsVSLK--PpanphcaiktwvlacgvpartaeWRPMSN-L-SDAP-SPSLLSDQSLSV----VQ-TTATVVAAHADEITAAWSEVYWLVALQLV---- >SRR6476660_4664138 --M-VVVGVDAHKrtHTCVAVDgsGRKLGEKTVPATT----------------------------VGNASALRWARSTf-GpdltwgiedvrnvsRRLE----------QELV-NAGQR---VVRVPTHLMARTRasartrgksdsidaTAVARAvpREPDLPVAqHDSVS--RELQLL---- >ERR1719193_1089955 -------------------------------------------------------LKRHRRNRHEGIRFQCNYCDYD----AgqkGNIKSHMDRKHP-EIPYDHTEFQEVRVEKSkysreakqqELDLAAmqGADAFNMNPLAGIGNMMPFNAHIL----- >ERR1719378_1531842 --RFHPgaDGVHRIGGEESQ--aeVRRQRSLSLPKFLDSLSGEKEKFAFNFDSMgnVLPNFHASHAQKIHSMKIMDAIDAVISEIlrDHpIKQRLMDVGYAHY-ELHATSKDIRKLTTAFYKGVKDLIGIDDdNDRHLVAWKDFLNKIEEGFKE--- >ERR1719414_1806212 --DFTLEQIECISTVWANLRqsSADNGLYLLQHFYTLYPEEMQKFDFNLGDRqdFRLNFHRSQLVRDHSMKIMNAFDALISEIvhGRpVKQRMIDIGYEHY-ERDATAQDIRKFTKAIYSGVKDLMDADHdgprraaaghDDRHLAAWKVFLDMLAKGYT---- >ERR1712142_47027 --EFSGEELEYICSVWGNLRmnHPDAGLFLLEKMFLKYPELAKKFDFCRDFFgsYKADAMQTEFMKNHSIKIMNALDTVIAGItaQQpMREAVREIGRDHY-HKKIDKSHMRQMADGMLEGLKEVIGDAKdSTRKLLAWNKLFDMIVEEFGN--- >ERR550534_2245262 ------------------RDlrHPLGLLLALH---------GGFLSFFHGFFgsYKADAMQTEFMKNHSIKIMNALDTVIAGItaQQpMREAVREIGRDHY-HKKIDKIHMRQMADGMLEGLKEVIGDAKdSTRKL------------------- >ERR1719192_2788519 -------RREIIGTMWESFRedSVSSGLFILEHFFSTYPDEMDRFTFASGGQtdketPLAFIMKRERMRIHSAQLMNALDRNGHVYGRspgCMDQAPQSHRG-------------NVCRRTGKSSGIA---------VFKWRVA------------- >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514 -TNLTPQDKQIMKEDWLMINEkKTAVNNLLLKFFRSFPQAQAMFPKLAKVP-LSQLPSNVEFIAIVNSIKNGFKFVIDSADDVGLLRQLAGSQDISvftVPGIPVaQQMQETGRVIVEWVQEEMGDRFAERTRVAWIRGLRSISQAFVSGQ- >tr|A0A0V1CPF8|A0A0V1CPF8_TRIBR Uncharacterized protein OS=Trichinella britovi GN=T03_16047 PE=3 SV=1 -SKFTDEEVELLARTWKKDDfdwLYRIGTDIYTCVFQLAPELKVFFPYVTECeKKNQSWESSKGFRTQALRFVQILGMAVEKTESrmkdddshLHHRLYKLGETHRRfaLKGFTPTHWKGFVIAVRVAMRRAVEAmpNLtpaeCETAIEAWDKLSRYVVHRMEEGY- >SRR3954453_266974 --MLTEKSRPVLEATLPVVgeNIGKIAERFYQHMFGEHPELLdGLFNR-GNQAEG------TQQQALAGSVALFASALVSHPNHLPdHLPPRLTTQTP-RPS-------------TWCRGSRT---STPRSAFART---------SIRS-- >SRR6478609_8547471 --VlvdveevlrvvfgFDLPQTDVVRSvVLGNPgq----I--------IAVHKVDV----------------------AAGGRIGPQGGRVVPHPRDVClV-LRRVHPLR------------------------------------------------------ >SRR3989304_146361 ----------DLEASVQRIldRGKNLADLFYCVFLDRYPELRRHFTAV-DL------------SHQAALLTMALQVIAENHLRpspaAAEYLLVLGHRHH-AWGIERDEFRRLRFCSPPPPQPSHGKGGPAARPRQWRAAIDEAVDTMRAGY- >HigsolmetaGSP17D_1036251.scaffolds.fasta_scaffold61070_2 # 263 # 457 # -1 # ID=61070_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.672 --VLNSIDEDLTTKSWNIVMsgtPtENFkakkldpcfhystslswfYDIFYKKLFELCPDVESMFEN---V----------SLVHQGKLLATVIGSALASLKKpiiLKKRLIALAQSHN-GKGVKAIHYCNMGLALFWSLEEVLGVsVMNEETRTSWVKMYSFMLNIII---- >SRR5215510_2422438 -LQMTKEQIEVVQNTFNKVRPmsGTAAQLFYNRLFDVDPSVRETLL--WTLK------------QGlGADFTPEAEVAWGNAYDFLAAVMQQAAKGA-SMX------------------------------------------------- >Dee2metaT_27_FD_contig_31_2132282_length_204_multi_2_in_0_out_0_1 # 3 # 203 # -1 # ID=1013462_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.592 -----------------------------------------------SAA------------TSNPQF----VAAV----------------------KKAIDYSGL--------LTVAGQGAVQPagiipSVIAGTLPAADALKQDVAG-- >tr|A0A068XSQ8|A0A068XSQ8_HYMMI Neuroglobin OS=Hymenolepis microstoma GN=HmN_000477400 PE=3 SV=1 --YFSEFEKDVLISTWEALLlyTHEHGAFIFRLAAEMCPELKAAYNV--EFNDDDELVISSCALQYSQAYITLIDEAIRSLEDPQEgfydSVLIAGASHATIPQMKPEFFKVLKRATLTTWEGLLGEEFTEDVANSWQTLLDYVVAVMVEGN- >ERR1719193_549257 --IFTDDELAILKDVWAHLKhhTAGAGLTILDHFFKRQHWALERFEALRDMY-GNihpDYMKIDLMRFLAVDLMEGIDIFVTGFFErdpeVTDLIADVGYAYV-KKIIIESEIEIFVDSMLAAMEELLGEDtWK-KNMAPWKKLMPVVAEHFSRGFK >tr|A0A0D6L5L7|A0A0D6L5L7_9BILA Globin OS=Ancylostoma ceylanicum GN=ANCCEY_14144 PE=3 SV=1 -MLPASEVKKLVKSSLERVAigkepkEVQGAKDFYKYMFTHHPDLRRYFKG-AESFTAEDVQKSERFDKQGQRILLAVYILADTFDDeptFRAYARETVNRHR-QFKMDPELWSAFFTVYVNFLASRGP--LSDDQRKAWAQLGKVFD-------- >ERR1719254_19301 ---------------------REIVDDFYPRMFANNPETKALFNPA-NQ------FEEPNRQRMALtnAVL-AYASNIDEPEKLADAVAIISHKHA-GLGIQAAHYPVVHKNSGLHRARHGR-rrdaGGRRGLERG----------------- >ERR1719394_777503 ------------------------------------------------------------------------------------AIRLGDFQHI-CT-TPLPFCRESPQVQALHHSILGPEVVTPEIGQGWSDGVLALAEILYK--- >SRR5262245_29633745 ---------------------------------------------------------------------------LGNHSTrCgRSVESSQSNSTA-DFLNSRRIHDAYSpaiRAAKSKSE------------------------------- >ERR1719193_348913 --KLEQKDIRAIREGWACItaHpgLEKTGVDWLHLSFELQPGTKHHYKNFTNK-TLEEICQTPYMKILAGKYMSEIGILVEHLEHsnfVLMRLENLGHLHA-KMGVPMETLFT----MNIVMQHYFRELYsrqdvPDDCEGAWSKVT------------ >tr|A0A1Y5FEW2|A0A1Y5FEW2_9PROT Uncharacterized protein OS=Halobacteriovorax marinus OX=97084 GN=A9Q84_13980 PE=3 SV=1 -------------------NIDQFVESFYEHFFSLTPEIFELFKN-SEIG------------KQKNEFKISIHTLLINLsqlDKLDSYFKDLGIRHI-CYNVSERHYKLAKESFLYAIKKTYADHWSKVVETKWEEIIDHVTLKMKEG-- >ERR1712238_458974 ---------------------KELIEMTDYPTFDVEGVVLCFL-------------------------------------------EWEHHKHE-NIMTFRD---HAYKALMTG-------TMAPLHHTPWKDALEDTIESYGLA-- >UPI00054DD732 status=active ---------------------------------------------------------------------------------------LTCARDF-FltfVGVERCR-PKLLKQEPQTITSKLGm-A-PMLQSAFWSIRVMRIASS------ >SRR3712207_8863908 --FFFQ---------AEDGirDIGVTGVQTCALPIYARPDLLdGLFNR-GNQAEG------TQQVALAGSVAAFASALVKTPEQLpEQLLNRIRSEER-R--------------------------VGKECRSRWSPYHX----------- >SRR6476659_5675031 --STHRPDQALRGGGRPPHraADNNAKGAATGHRVSGRS---SPAEL-PENSMR------EQQQALAGAVAAFASSLIETPERVpQSLLSRIAHKHA-SLGIRPDQYQVVHDNLMWAIVDVLGDAVTAEVAAAWDEVYWLMGNALINQ-- >tr|M3IW96|M3IW96_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) OX=1245528 GN=G210_5766 PE=3 SV=1 --SLGPVELTQIISSWSKIRnKSQFHQSLYTNLIESNPQIGKIFNN--ND--------KNVISQHALIFGDCFNFVVENIQDnalLDEFLFSFVQENQRFANMATQYLEPMGNSLIRTFRKSLGNNFNSVLELMWIKVYVFIANSILQ--- >ERR1719502_1452556 ---LPPEQSALVRRVWQRLVgTPGAAPILVRQLQSVAPEVAALLS-DA--S-STNGRSNinrGGLhavhtdpHGRAAAVLSEVSELTELLDDsaaLRQRLRQLRAR---MPPVGPEVYPSVGKAFLHFVWEGVGSGYDNATAAAFAALWDQVEETMLE--- >tr|A0A1X6PD63|A0A1X6PD63_PORUM Uncharacterized protein OS=Porphyra umbilicalis OX=2786 GN=BU14_0103s0020 PE=3 SV=1 MGALSDDTVRIVKSTAPVLkvHGGAIVDGFYALLFEQHPAAAAYFNVVPTDGgGGGGGGGRGQSKAQIQRLSMAVllyAESIDQLDTLGPVLERISAKHA-SRGIPAEFYPAVGACLLQSIGRVLGDAATPEIVGAWGEAYGFLADALMA--- >SRR5580704_1734515 ------------APRAELATgvAPDYgSPDDVASRRSQSRACRRTLR--RPTT--------------GAVRGEMLARVIEAILDFIgerryahHLIQCEVVTHE-GYDVPPETFGIFFGVVATTVREQLADAWTDAFDEAWRTLLYDLD-------- >SRR5258708_241677 ------SCGEDPAGSSD-----DHDAD----VVASAGQVEGGVD--LVEH--------------PPALGVPIAAPCQWLVDLEgagacaaNRMAAERVNHE-GVGVPPAALARFFPIVAETCRDLLGEAWTGEIEAAWAGLLTRLA-------- >SRR3954465_11422119 ----PCRSSPTTSGRSPGAs-TRT---------------CStAtRGCW-TGPStgatrpR----------APSRSRWPGPSRsspaHWSRSPSRSpSTCSpgSRTSTTHsasprpppP-PPPPARAERGVVQDNLFWAIVDVLGEAVTPEVAAAWDEVYWLMAYALVN--- >SRR3712207_885952 ------------------------------------------LGR---------------------------GlladGLRAHPPGAgALQR---------PRRAAGDGVAGVggRRGENRERGRREPPPAAGAGTPGVDRAAPPGRCRPGT--- >SRR5215467_2668635 -----------YLHSFPT-rrSSDLPPSALYRHLFTTRPELLDgTSNR-GNQAD----------GNEQQALAGAVGafatALVNTPDRLpENl-LARIAQKHA-SLRITSRSNRLSGQGPIAPL---TEDQ----------HPX------------ >SRR3954465_6877418 --AtaaaTAAASSTDIRATRPASleG-------------HDRPHLDTaEAGR-AQLAD----------GEGDIEVGGVDEvvatqHLLRLHERAvGHlgpPTDARRGAGR-LQGVAAEELGTVRLDLDGELVVRLHDL-----VEDLGRRRRVLALVLVD--- >tr|A0A183INM6|A0A183INM6_9BILA Uncharacterized protein OS=Soboliphyme baturini PE=3 SV=1 -VILSNYQKTLLRDSWLRINktgIRNIGTMIFRRLLTKQRSIKQLFQHITVLEGvfSAGLTPIQAYQHHSLLFVELIDNAIKNIDDLsvlIPTWIEHGAKHARfkAYGFEIEYWDMFGSTMTEAAREWEGWRRHRETIRSWTLLISFIVDRLRQGY- >SRR3954463_14455484 ---AQ--------------------------PRAARPSALRLSRP-GDGAP--------------FLLRAEvACLasGI-----g-----------TF-GPGLRSHPLARLGRS-----RALRGRAVLArCPPKIWSPLD------------ >tr|A0A1I8CQM9|A0A1I8CQM9_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=3 SV=1 MNKLTEKRCDIIKETWEIYKqdGINNTIKIFFHLFTEHPEYKYIWPQFRGIPDS-SFILSSALRNHAEVYTAGLSIIINNMHNkakMYAHIKKIAYAHV-KWIIHQSHVQNMVPGLMMVLKDKVPH-FDDSIEDAWKTLYGVIGSLLE---- >SRR5258707_573086 --------------------------XMILKSFKPNAAIGC-K----TIPT----------W-----FVP-LPTFTAGLTLPKLyplSVFGMRRYN--LGGLGEPH--QVEAALLWLVEKQFEGVLTREMRQAWVQFCQWLV-------- >NOAtaT_7_FD_contig_111_1754_length_212_multi_2_in_0_out_0_1 # 1 # 210 # 1 # ID=13324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.662 --RLKPKDAEYLQDSWKVFlErsggLEGAGKEFYRLLFEKEPDLKKLFQV----P--E--------MSQAAAFMRAISRYVSLLAQpeqLKTAIEMLAFMHV-NLGISETSIFAFAESLLECVEDQLHDWDpgeVEQVMVLLTDLTTYIGRVIA---- >SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold554780_1 # 1 # 420 # 1 # ID=554780_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.669 --VLTSSIYlttgTVVTDFSVIVlDaegsAIEPGEAPYSLRVYFTPASTGTstatIQL----P--S--------GLISDgMLAVGARRLQEETINprrLAGACEAYGATVTSnvlTVNVrksgTASDPCDSTDAISLLFAGGMATWNslgTSVTSADFtmstnvdsdsvTYRLTFEENVFL---- >SRR4051812_4293204 --EPLAAEQELLGQTWSDDFefLYELGASIYQHIFNTIPETRQLFPKIPTINNG----RwceSKEFRAQTLRFVQPLSFAVNNRHDierVAEHLFIIGVKHAKlvERGFRA----EYLDCALVSYFLKIFKFkyFIv---FIGFRT-------------- >ERR1719295_1797159 ----------NIHVTFDLAltsDPKGFAENFYKGLLKEQPDIGQLFLD-----------KNTTFDTQSARFMAMLMHAIKMLDDtdhFTQSLDSLSEAHV-GYGVEVPMLDAFGKSLIAQVKVmnikyfeeqakggggggdekdeSLdimRvGEWTKKQDDSWKWFWSVVVGVMSAG-- >GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold789473_1 # 1 # 552 # -1 # ID=789473_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.562 --RIPPLKGSSLSAGWRTASSsgLS---------------------------------------------RNPRGTVSR-----ESGNTVFQSETF-AGAASPRGGSLL-C--FT--GENEPMGMINNLKT------------------ >ERR1712012_1094824 --SLTTSDIAAIRQSWILAkDaapFEVHGPAFYKLMFETYPSWRFAFNHMGGHLSIEVQIENTRFVKHTVTVFRFIDKCVNDLDNPtqiLENIKMVAKIHA-LQGIGVKDFIIIKAFICSKSD-KVGAGRSKNSFIFFPRFL------------ >ERR1719232_197721 --SLTTSDIAAIRQSWTLAkDaapFEVHGPAFYKLMFETYPSWRLAFTHIGGHLPIEVQIGNSRFVKHTVTVFRFIDKCVNDLDNPtqlMDNIKLVAKIHA-FQGIGVKDFVIIKDVVLNYFSTALGPALTDAAALGWSnfmDLM------------ >tr|A0A085MKY1|A0A085MKY1_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_01110 PE=3 SV=1 --------ASIIKEQISKIEvNEENGGKLYEVFFTVKPEFHKFFD-LKHAPEGKDVAHNQRFKTLGKLFLEKLKRIVMACEDehqLKEEIKGLKMDHD-PRHVGLTELKGAKPILMKFIEQQVG--MTEEQKHAWTEMFKKF--------- >tr|A0A183IBE5|A0A183IBE5_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 --------KHVLMEHMKRLNlTNKLGGKFYHQLFQSlPEAKSQFA---EHFDKLEDVENMKYYQQLGHSLLSLLKELPEHCDDdhaLKQEIMKIKKKHD-EKHVDAKMFKKSKPAILKFLTDNTQ--MTNEEKEAWDHLITHS--------- >ERR1712025_717817 --TLSPEHVDPITESAPSGKakGMVIANNLYRKLFSRHEMFRAMFPE---QS------------QQSGKMIQALPSALydfavncDNMGQMQSVVARIANRHV-QQGVQGFDGTFQFIPKKVDLsliPAGQCEAKLKVALNARQPGtgvgdrFQLHPSEVC---- >ERR1719495_824226 ------QDIENVRKTWEKMIakheLQGVGLVVLTAWMNEHKEIRQVFAK--SFPIIDklekdvldlVQLNDPTLNEHATIMASSFGKMIECLDDteFVQMMIDIGKKHT-GFRVSADSFDTsLNSTLITALMALSEEKEDSPNIKSWKTVVEVMKHYLKQ--- >ERR1719210_734039 --HLSTADVAILKGSWSVLEehVTRVGVDFFIDMMTNHEEIKAVFRQMPNIP-VFELKANEDLNRHGMYILGVIKKIVGKNDDteyLEKLFDDLSDLHR-RLGVEASGMDIFGKVFCKVMRPILLEkkKWKPEIKDSWMTFFSSIVKVMKK--- >tr|A0A2T7P177|A0A2T7P177_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_12319 PE=3 SV=1 -----------ITRSWKCFYekVCSFGVYEFLNLLTDLPEYEEAMRLI-KLTSSYKFLSAMDFNAHFLSMLTIIEKCMARLevDDlplLEDILHKVGTDHI-GRGVNPENFDLVIPPMVAGMKQMLEDKWTEKEDIAWTNFFTLMIHIMQE--- >SRR6476620_7243483 --MLSDTSLPVIQATLPVVgeHIEEIAKRFYKHMFDARPDLLdGLFNR-GNQADG------RQQQALAGSIAAFAGMLVDKPDEVpDHLLSRVAHKHV-SLGLSPDQYQIVHDHLFWAIVDVLGDAVTPEVAAAWDEVYWLMGNMLINKE- >tr|B3RTB3|B3RTB3_TRIAD Predicted protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54902 PE=4 SV=1 -----------------------------------------------------DLIKDPLVRSHGLRFMKAIETMLEIeFDSngCIFLFSAIGNRHC-SYGIEADYLDYVPQAFRFMLTKALGNNYTDKIASVWDEILSHIIKAMQDKV- >tr|A0A2G9TV92|A0A2G9TV92_TELCI Uncharacterized protein (Fragment) OS=Teladorsagia circumcincta GN=TELCIR_17315 PE=4 SV=1 -------------------------------------------------Q-KNSSSNKQAHRKT-----------------tsdTHQDL-RRTRDQP-CEKCPQSPRYHMLEPVLAVVKE-CNDDIDDETIQAWTTLYLIIAD-LIEIY- >tr|A0A2R7X9G6|A0A2R7X9G6_ONCFA Uncharacterized protein (Fragment) OS=Oncopeltus fasciatus OX=7536 GN=OFAS_OFAS019380 PE=3 SV=1 ----PPVDINAVQKSWNGIKsslgdkaPEAVGKLVFENLFSNYPYMLEFFKNYGET--KEDILNNKKFMFHAKeRVFKTFDKTVNNLGNeaeLNNIASWLAEVHV-SRGIKPPDF------------------------------------------- >ERR1712018_1077981 ----------------------LIGCQSFQAFFDRSPEILSHFDKFNAIEI-DGVLVSSALKMHSSRVLAIVEDMVENTGNpekIRTILQDLGRNHY-RQVKPILMhFLX----------------------------------------- >ERR1719199_1665450 --------KPMIRECAAKVvqmDIVELGLRFYVHLFTINPAASAFFTKPKWMI-----------SAIFGGVLRFYVHLFTINPAASAFFTK----------------------------------------------------------- >tr|B3RTB2|B3RTB2_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54901 PE=3 SV=1 -------------------------------LIKLSPATKIYFHGV-DFEkRDSYLAKNTFLRNHAARFMEAINVIIGQdMDIfsVESYFRVVGSKHH-SYNLKLEHVQDISDAFLEMARNALKKKFTKSTEAAWRSFFQMVTDAIKN--- >tr|A0A1B6G4Z3|A0A1B6G4Z3_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.45438 PE=3 SV=1 --RLDDNEMELIREGWKCITeSEDN----FRTAFSSKLaqknLAKVHFKHVENVSITDEGFSHEFLMSHSVDVMNTMHLMFNDIRNPeswMPEILRIATLHK-LFGVTLEDLKRFRCCVIEVLQQCLGEdGYTPQIKDVWDRVLECIEI------- >ERR1719383_1602644 ------------------------------------------FGLH-L---------------------QSTMLVGNDLDpvdERG--PDHCQQALW-TASE-GRTLSHRRREPCRSVLEVLGEdVVTPEIGGAWREAVQALAKILID--- >SRR6185437_4905046 ---------------------------------AENPEMEALFVR--DTA--------AL--VRGQMLAVVMEGFLDFVGDqdYsARLMQIERVNHE-GLGVAGRAPRHCGAAGGRSLTHFPGKP------------------------- >SRR5512135_1032698 --NMDQETLSTVDASLQRCNRdSRFLDLFYEKLLASSPKVREKFAH-TDFV------------RQKRALRSSLWMMLLVAEdeEkgPARYLRGLTAIHGSsGLDIGAELYDFWLDSLLETVAVCDP-EHDAKVNAAWERVMMVGIHYMCTHYH >ERR1719336_1989132 ------------------------------------------------------------QDRKGGgGTPGKLKVTAKYNDGtefVDefntvifaigrdactakmgleGVGVALNPKNG-KVlhneler-TSVDNIYAIGDvldgkpeltPVAIQAGKLLARrlAGTSEVTTDYVNVCTTVF-------- >ERR1719278_462770 --HLSTADVAILKGSWSVLEehVTRVGVDFFIDMMTNHEEIKAVFRQMPNIP-VYELKANEDLNRHGMYILGVIKKIVGKIDDteyLEKLFDDLSDLPL-LLLQQDRPHHLAKNLPKNVHSGSLYAeppvkvaEVVEELLQVLCV-VDLPHNLL----- >ERR1719210_1454089 -----------------------------------------------------------------------------------rrclgyacf----ASFHKSQ-TIlklshdrdrferqkknPQQSSSFRRCGTsmgqsesslTAANLTQAPTLRpaEWDPNMYQSL---------------- >ERR1719284_537611 --------------------TEEIHSEFQSLLLQHNLELLSVFNI-PRQS--------DDVIDAEteeiasHHLAGVVLAFAAHVGHVQRmrELDQLAAKHC-SHNVHPFHYVVLHEHLLDAMRKALSTMLTPEVQYSWSQSLLFFAKILID--- >SRR6266536_2537548 -APLSGREREIAMLAAAGLASKDIAERLYLSVRTVNNHLQHAYTKLG-VSGR------AGLAEQEIKFAEKLTEIVramPRLDELLTHTRALGARHV-SYGVRAADYQTLGNALLAALAAVLGGSFDAPTREAWTLAYNLVAETMLDG-- >SRR3954465_13942299 -HPLTGREREIAMLAAKGILSKDIAARLSLAVRTVDNHLQRAYTKLG-ITGR------DQLADVLAHDTTTHPGPX----------------------------------------------------------------------- >SRR5699024_12637729 --TLPKGDHPLV-----LVsaGIGCTPMVAMLHRLVETA--------------------------------RERQVLVLHADHTpEEHAX------------------------------------------------------------ >ERR671932_89059 --S-PTSCGPARACRSCCCtpTPPRRRSR------------YDgVHEG------------------------LMDLSSFPLPDD--ALFYLCgplpfmravREQLL-DLGVSPRDV--qyeVFGPDLWQADAdeGPGDAPEPgahdllgpEERQGPPPA-WSRPG------- >SRR3712207_7345787 --V-LDDVRALPNATVHVWyeSGAASALP------------VDgVHAG------------------------TMDVRSEEHTSELqSRQYLVCrlllekk--KTI------------kyeSTXX------------------------------------- >KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1083625_1 # 3 # 881 # -1 # ID=1083625_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.686 ----------------------------------------MEYEI--------------CLEPSGIRFMADAGQNIVEAAKqhgIpIKHGCASGScgDCK-GTILsgDSEQGPFMPLLLLPTERAA-G-------MAILCKLYP-RSDLRL---- >tr|A0A044RBY2|A0A044RBY2_ONCVO Uncharacterized protein OS=Onchocerca volvulus PE=3 SV=2 --ILSEIQQELIRQSWQTISgklevtEQCFGFFVYRRVFERNASLKQVFHV-EEYDSLESVPNEHSIFRQMRLFTNLISLAVRHVDELeteiAPAVFRYGQRHY-KFAaesFNEETVRLFCSQVVCTVVDLLETDIDPSCMEAWIDMMRYIGCKLLDGF- >tr|A0A0R3RKB4|A0A0R3RKB4_9BILA Uncharacterized protein OS=Elaeophora elaphi PE=3 SV=1 --ILSEIQQELIRQSWQTITtklesnKRNFGFFLYQRVFKRNSMLKRAFHV-EEYDLLESVPEKHSIFRQMRLFTNLISLAVRHVDELeteiAPAVFRYGQRHY-KFAeeyFNEETVRLFCSQMVCTVADHLGGNVDPACMEAWIDMMRYIGCKLLDGF- >ERR1719384_507171 ------------KKCWNELmkDKVNVGERIFDYILTKEISMSKLFMQ-------------TNIEQQSGIFMVMMDKVVGFLDDkesMNDNLIKLGQLHVEKYGVKTKHFKHFRAAFLKAIKKYLP--WNDRREEVGSSFGLELLIKCRC--- >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1216141_1 # 2 # 73 # -1 # ID=1216141_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.347 ---FPDGVCMATIELTVLPvRpled-----DEKFQIILSEAQGGASFNPNDD--------------G----GKDDGvlTIVIKNTLQDpkgLKVLVESFGFQHL-DFDLTVPRVVVFRDSMVELMEAELQDRFTYKAKDG----------------- >ERR1712214_179591 -------------------------------------------------------------PGHAgRREGRRSARQPGTGKDRqksTKYLLELGKFHR-FSGIPNDYFGVMGTIFVHAVRPYWEEagCASEQTEVVWMMLFAHIARVMTH--- >ERR1719458_2209728 --HLSDEHKTLVIDSWDFVPgfISEAGYKAFTDFVKLCPYYAEAFPFVKKKEEEF-SHLLCEHARKVTGEFGLLAKLISELKTkppeksndqvIHDIMVPLGRRHV-AF-------------------------------------------------- >ERR1711928_171062 ---VSATQESHP-------------------------------LDLDSHE-IQQQRRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFET----LCFRWIQHD-----------CQQYGX--- >ERR1711928_123369 -------------------------------------------------------RRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFES----LCFRWIQHD-----------CQQYGX--- >ERR1740128_75568 ---VTAQEKTLIRATWDQMMfNSEVAPKFMLRLFSEESQHELGgnFaVEHHLVP-GGadegLLLGSNDGFSNTLDVRVG-----------------------SHlLGNDAi-------DVVHDVFQCFLGGSIGRGDlfnglHHNMGRFVQLVDGX------ >ERR1719219_701605 ---VSAAHKSLTRSTWTLMKfNSNVAPKILYKMFTTYPET-QKMyTRLADIP-ASQLMENKQFLALSHSAFAGFNMIVNNMDDPELIKLQLSKVDFPGtFVYPFpgtsLNTSKPPASSWKYSPKN-SAPLSPRKPLPLELPFELRHQGFGK--- >SRR6476646_9453568 --PMLRTRLQLAEASYHRCAeSGAFYNTFYTHLLASDPRIPPMFAR-TEF------------ERQHRLLKHALGLLIIYAKHAnPAMLERIAQRHQ-EIGVLEDLYPAFVESLVLAVAEH-DPEYTPELADAWREALAPGIAFFIKRH- >ERR1719347_2568912 --------------------------LPPPTHFLPLPGINRKVRIFQRQFgnQTSEFLTGKALRDHSIRVMDALDSVIVDTlKgkDIHKQMVDIGYSHL-KMGVEPRQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED--- >ERR1712189_147645 ----------------------------------------------KPDF---RIPDWKSTPRSQHQSHGSLDSVIVDMlKgkDIHKQMVDIGYSHL-KMGVEPKQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED--- >ERR1719412_2466027 --NLRPLDVTNIKESWHSVEqqLVEVGIRVFISLLENQPNIKRTFRKYRSKR-HSELRINEDLQKLILYLICGLKRVVKYLNDnkaMGKYLRRIVKKHS-PTEIDFTRINpaELSTVFCSAIKDIVdahqaasaklqsvsetsspectspSTCWTIEVEESWTTLFGSLLNATR---- >ERR1711860_392201 ------------------------GVHVFLVLFESQPQMKRIFRSYRGKK-HSELRLNEDLQQLVMYLISVLKKIVKYLEEsrtIVKYLRRIAKKYS-SPSIDLARFDphILTPIRVRRRHLFSresivfekRLKWPQK--------------------- >ERR1719266_3067024 --QLAPNDIANIQSSWTLIEpiLLKVEMAWLLLFRHIAGFMRNGYNSVV----TGPL--------------------IRHTTNcatS--TSSRMSNX------------------------------------------------------- >ERR1719264_357726 --EVGLCDALNIQQVWPRIEqyLLPVGTRMYISILDGRCDKIIFCNKACCRKNasksssakstrsvysksvsrtcPNQVILNEELQKFVLLLMGLIRRAAKHLDNpshSAKVIRKVTKKrFG-KLNIDVTKIAfePIALNFIASVREIMtnTRHWNTETEASYYTLIRNLIAYVQ---- >tr|A0A2G2R4B7|A0A2G2R4B7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_07540 PE=4 SV=1 ----------------------SASDKFYNVLQNDLPEFTQLFTN--PE-------------KQHMMFYAALRSIDGLKDNktkLAVYLRSIGVKHK-MLGLTHYHMEIGRNAFEQAIFA-GGKDLTHDQRQFYIDSFSQIEKNM----- >APLak6261687352_1056175.scaffolds.fasta_scaffold62437_1 # 2 # 238 # 1 # ID=62437_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.447 -VRFPKDVIEEAQQAWMSFtmasTKEAAGEALYSAIFHAAPSLQSLYKIPR--------------PTMALRFMNSINAAVAIAHRpsaLKAQAEALGFQHF-DIDVTPSRGDIFREAILEVLDMELGSRFTTRARMAIGAILNYLIGANI---- >GraSoiStandDraft_15_1057317.scaffolds.fasta_scaffold2262553_1 # 37 # 405 # -1 # ID=2262553_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.610 -LQLSQSELFALGRSFELLlqglgnDRDRVGDAIYGAKTANLVVFKDKFITPR--------------AVLSLALFNGFRVLGHKSADpeeLRLFVETMAFKHL-GLDITLQRVTGVTDSFLELCQQNIKD-MPPGSLLAWRKLMTYTGSCFR---- >Go1ome_3_1110792.scaffolds.fasta_scaffold06098_1 # 3 # 227 # -1 # ID=6098_1;partial=10;start_type=ATG;rbs_motif=AAA;rbs_spacer=15bp;gc_cont=0.524 --VLSAGELAAARAAWDLMKDnVKVAESALVKHFVLHPPVQKLIPALADVP-ISELQGTTCSTPSPTRRC--ASPTTX---------------------------------------------------------------------- >ERR1712142_1087278 INALTETEVKVIIDSWDRIHPDKGAKMLFHQFLTDFPLMKIYFG-YQETESVAEIMESEQIKTRCKVVWDVLTKIVHASGDggkLAELVKEVSVKHL-NFNREKKDI----HCFLHALKVTLTC-FSGHLFRPWNIWCKMV--------- >tr|A0A1I2S201|A0A1I2S201_9CORY Uncharacterized protein OS=Corynebacterium spheniscorum OX=185761 GN=SAMN05660282_00995 PE=4 SV=1 --------------------SGHLEPELQLQLYARHPNAQWLLRAG---------------KAVPAELVELSIHAIAAADAegaldalAEARIRDLGLAQR-RFGFPSELYQDIQEIMVSLLRTTGAD-LPFPVEFAAERTIARVCVLLQE--- >tr|Q8NLZ4|Q8NLZ4_CORGL 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases OS=Corynebacterium glutamicum (strain ATCC 13032 / DSM 20 --------------------AQDFLRAVQAKLLTLAPQARGHFPTA--D------------DATHISIAEMVSALLEGTGEegkvddkTLEFFKEAALDAR-RFGLTPEMHSALGEAVRSELLSLCED-LPFENVLFAERAIAATTAVSVE--- >tr|L1MAU4|L1MAU4_9CORY Oxidoreductase, FAD-binding protein OS=Corynebacterium durum F0235 OX=1035195 GN=HMPREF9997_02488 PE=4 SV=1 --------------------PDLFRTLAQRYFLDDCPEARFLFPTD--D------------STAHADLAAALIFVFNHSNAdgsltpkLVSILEQLGRDHR-KFQVADNHYERFGNALNRALKIVGAHAptYA---ITAAEKAITATLETMRR--- >tr|W5Y4C7|W5Y4C7_9CORY Putative oxidoreductase OS=Corynebacterium vitaeruminis DSM 20294 OX=1224164 GN=B843_11695 PE=4 SV=1 --------------------REELSAIAFDMFFATQRDARTRIRA-------------------TPAIADALTLLARSCDSegklpldVEKRFLQRATTLC-AHGLRVDDLEPLAESAHRAMLITAGG-QPFELVLPIERALQQLARTVVE--- >tr|A0A1W1UZL1|A0A1W1UZL1_9CORY NAD(P)H-flavin reductase OS=Corynebacterium glucuronolyticum OX=39791 GN=SAMN05660745_01670 PE=4 SV=1 --------------------SPEFHEHVRANFFDKCPETMLVFPLH--K------------ENVHADLGRVLSFVFDRTPVdghltdeMRTLITQLGKDHR-KYNVSPRYFHPFVECLRDSLLTLCSD-LQFKYLNGADTALGEVSTLLAR--- >tr|U3GX34|U3GX34_9CORY Uncharacterized protein OS=Corynebacterium argentoratense DSM 44202 OX=1348662 GN=CARG_08960 PE=4 SV=1 --------------------LSHFGDLAHSALLRRAPGLIS---FF--G------------PNPHTELTTAVLFILTHSTPgpqdsgtqtplspridaaGAGALRALATEHV-AYMpPDPALYLAAADALCEALRDSCAD-QPFQQVLAAEKALREACSLMAT--- >tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae OX=1717 GN=mphP PE=4 SV=1 --------------------VTAHSIQAVADElraHRAEFIQAANQKP-------------------DSPLADAIVQLVDHTDLdghvpesIATSWLQHAAAAE-SLGVSRDYYLTLADASRSALRHICAD-LPFAEVLGAERAITSIANTLT---- >tr|C0E6D0|C0E6D0_9CORY Oxidoreductase, FAD-binding protein OS=Corynebacterium matruchotii ATCC 33806 OX=566549 GN=CORMATOL_02563 PE=4 SV=1 --------------------GDGFSREVFTTYFRYVPDAQLIVSP-------------------DYPLGDALVGLFHGSDNegnlypeTIEHLRDVTEILA-AHGF--RRYRPLADAISPVLDRYCLD-ISAYDVFIIKRAVRQAAEVMDE--- >tr|A0A0G3GTQ0|A0A0G3GTQ0_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium epidermidicanis OX=1050174 GN=CEPID_01535 PE=4 SV=1 --------------------SPAFRRDVLRDFFSQHPHMRLKFAAN--E------------DHAHTELVFALTYLLENPTD-PELIRTLARDHI-KVSPGQEVVADFFAILHRQIHRYCAD-LPYEEVRQADLKLQEIA-------- >tr|A0A0F6R111|A0A0F6R111_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium kutscheri OX=35755 GN=UL82_09495 PE=4 SV=1 ---------------------------MVASHfYADVPLARLSFRL-------------------QPSLVDTLIAGLSHP--lNITAW---AHDLA-HRGVDRSFYVPLSAALQHAVCHICSA-LPLVDVLAVEHRIDQIMKQLLA--- >SRR5580704_16882803 -------------------------------------------PG--RH------------GCAAPAFLPGAQPYRRCPRgpegPRQPRALSAGTRAR-APKFGERHYEVFRRALIATLQRFAAPRWNETAKHAWETAFNHAATVMIE--- >SRR5690348_1231357 --------------------------------------------------------------------------------arapevrrPRAPLRG------G-QAGADRHASAVCRAELEP------------DRQARMGDRVQPRRRIMID--- >SRR6476620_5060594 ----------PAQVSFWLLEpvADAAMTYFYAQLFAKATWTDREVY-----------------ISGPDHMIVKTA-RVLRERgapdRLIHYDLD----------------------------------------------------------- >tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 OX=582737 GN=TSPGSL018_8354 PE=3 SV=1 ----SSKIITLIEKSWAFVEsrcdLMEVSNKFFERLFQRAPALQNMFTKPK--------------RVQYVMLAKALDLIVRSAGEtkvMNEDIKAIALRHI-KYDIRQEHLNVFGSVLVETLANSVGPeNWDEDISAAWASIYGNIAAVF----- >tr|A0A1Q9C6P6|A0A1Q9C6P6_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene41206 PE=4 SV=1 ---CVCDLAQCRGRSWAAFFvdi--------QAAYYETSRS--LLFEGP---S-----------QDP----------ALVALQLpahVQALISDGALQGL-GI--PQEHIALLQDCvecsfwtftgqtqqvmatsgsrpgdgladvlFGALFAVILtcLEAKCQQCGLVHQSMSDALGVPDR---- >SRR6476646_8240181 -----------------------------------------------------------------------------NINLLF-ALNRHTCPNL-I------------HEPASEFfFGLQRPATH--HEHIRVENIHHL----IK--- >SRR5688572_19725352 -----------------------------KNLFELNPALRPLLPE---STAE-----------QDRLLTRLLNAEAGALAGTRPP----APRSAEGHGNEgTAPCSVAGEALLWTLQEAYGADFTPQARAAWEALYRFVTGTTKSAP- >ERR1719229_1707680 ---------------------QQLGVLLFANLFKKQPLCRNLFAD-SDI------------SKQSLRLLDMFGWLLRSLVKeknqMrLRTLKSLGDRHV-KYGIKIEFFGPMLDSLSDALQDWFGTNYNTQTRVALTTLFQSACNEMMKQ-- >SRR5512139_12076 ------TDLELIEASIEQMlDlETEIIGDTYARLFAHCDGARALFGP--NTYG-------P--RAQ--MVN---ETIIAGLDLLrgepwvHEYMTQHGVRHRHSYEVTDAMYRTYAESLLGAIRERLGDRFTPELEAAWS--------------- >tr|A0A2E3FAX6|A0A2E3FAX6_9RHOB Uncharacterized protein OS=Rhodobacteraceae bacterium OX=1904441 GN=CML69_02715 PE=3 SV=1 ---LPNENLELIRHSFPLIFqhKAEITTKFYEGLFRDAPELRRLFSK--EMNVQ---------KDMLVSVLTTLAKA--SFDEglVESMIARMARVHS-GLGITSGQFRTGEAALLSALDQSVGDLLSETTLDAWKTAVRRVISAMID--- >tr|Q9NAV7|Q9NAV7_9ANNE Dehaloperoxidase B OS=Amphitrite ornata OX=129555 PE=1 SV=1 ---------------------RTYAQDIFLAFLNKYPDEKRNFKNYVGKS-DQELKSMAKFGDHTEKVFNLMMEVADRATDcvpLASDASTLVQMKQHS-GLTTGNFEKLFVALVEYMRA-SGQSFD---SQSWDRFG------------ >tr|A0A0G3G1X4|A0A0G3G1X4_9GAMM Uncharacterized protein OS=Thioalkalivibrio versutus OX=106634 GN=TVD_07385 PE=4 SV=1 --------PPNVESSYRRCcADASFLARFRLALRAADGQVSGIFDP-LSA------------RQQEVMLDASIRAALDFSSGdpqGASRVSEMIHVHGRQgrVPVPPALYPVWLESLIQAVRETDP-HWSDALERRWRAQLMPAVDMFVELYL >ERR1719187_3161387 --ELTDDEINEVQQSWDLLTRsegglREAGLTLNQQLLTAQPHHIRSFEKFRKYKDFDDILKSPEFKTHSYSTVREISLVITNLKHpgvFTQLTQSIGFAHR-RANTPPNQMVDFKSVFINdFIPSQMADKATPNTIKAWEKFMTVFIEHVKE--- >tr|A0A2E0SIT0|A0A2E0SIT0_9PLAN Globin OS=Planctomyces sp. GN=CMJ46_04905 PE=4 SV=1 --PVSMTIVDSVRESYARCrQNPDFFDAFYDHFARKSSEIGPLFSN-TDMQ------------KQNELLSDAIDSLISFSEGdvaARRHLDEIALSHDReHLNIKPEWYPLWMEALRDTIHESDP-GATTQLLADWNTVLQPGVNHIVQQH- >ERR1719487_109746 ----------EIEISHPELlkiGLDNVGTTFYTNLFQDSPQIQMHFIK----P-------NRMLSYIVQKTIEMIGDLHPKPREVMKGLKALAMRHI-KYDAPPEFFGDFESAMLKTLAQSLKSTFTEAVKEAWKAALQFIASTIV---- >ERR1719327_803055 ----------EIEITHPELlkiGLDNVGTTFYTNLFQDSPQIQMHFIK----P-------NRMLSYIVQKTIEMIGDLHPKPREVMKGLKALATSTC-ASSGSRLA--PRPSSTATSI---GRSPFRCRX-------------------- >ERR1719356_1095802 -------------------LMRDIPNTIVALFAI-TVAVfeddySSMLDQ----P-------FlliAVLGFVTLTvilLLNLLIAQLNTTYV-RIYQEVFGWALI-TRGNQIVEV----LD-ACPMS-VWKPFLETLGLDERLE---FNEGDIG---- >ERR1719326_1696685 --------------ASSTQikeLFADVDLS---------------IHA----P-------Ifa---------sTLQSTISSLNNPTELLPLLEDLGKKRI-KYGVQEEHVVAASASLIFTLK-SIDDQWSPQVEAAWTEACNVMQNVAS---- >tr|A0A0N8ALQ3|A0A0N8ALQ3_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 ---------------------------TKARLN----NCMLLFSE---------K---LAAFLaQASPSWPVWNVVIHPCfs--qelMANQLNVLGGAHQ-PRGATPVMLEQFXXXXSPPSSSSSSRKP-PASRNSSPN-------------- >tr|A0A0P5ANB1|A0A0P5ANB1_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 ---GGNDGVETVSDQSNLFVVfAI-FGQGIDGNASEFDEVLLGAGSLLEEL-DEDGGNDGVAVTpDVFPaglniadlVGGQFSLGISQIfgflevlgdASdqsAHTVLPGLSGL-G-VEGAAQRFSKDFLSDVTELLEHDGVSSFNAEARQAWKNGMRALV-------- >tr|A0A0P5ESR8|A0A0P5ESR8_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 ---------------------------------------------FLEDA-SELLEHDGGSS----TGFMGTTESVQLVghqllaeqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGNIGELAEHCLVL--GVGLDEA-EEDLGSDISV-L-------- >tr|A0A0P5I7S0|A0A0P5I7S0_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 ---------------------------------------------FLEDA-AELLEHDGGSS----TGLMGTTESVQLVghqllagqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGKISEGLEHLLVL--GVVLDE-TEEDLGRHISVLL-------- >ERR1712168_1063860 -----------------------------------------------------------CEKAPPIPDCTSSNTVMMRLFKrdpeVAKLIYDVGVQHQ-TRNINEDEMTKMSKSIYSAVQDINVGPHSDKELAALHNLLEVVSYHFKRG-- >SRR5690349_6204932 -TILTDEHRHFIRTSWEKINkrheKTTLGILMFEKVFAFLPDLRNVFGL-NDSS-VSETDRNENFRRHTSLVVNLIDLIIRNIFEmeaeMGPVLLMYGRRHFLKHDLVFQENQLVafAQGLCEFFEEEVDHdddnSLASETKAAWNIF------------- >SRR4051812_9455799 -GTLTPLRCQLLQKSWEAIIakygMFKPGMIMFQNIFKIQPELMEIFQI-SPEK-LGNFGDlPDEKFRHGRIFTNVLNLSVKNCVEleteVAPVLHLYGRRHVSKHNVDMAHHFLLvfAQGITSFLINEVK--------------------------- >tr|A0A1Y3EGL3|A0A1Y3EGL3_9BILA Globin OS=Trichinella nativa GN=D917_02219 PE=3 SV=1 --FLTKSQRQNVVRSWEKVpNKRALGEEIYIQIFMHKPMLKSLFP-FRTVP-VDQLRNNALFTRQAAIFADFIDCVVGYLaiNNgnlIMELSERVGVNHALMTSVnfDPEWWVLFANSVLDCIRQYCEPKFiclpisrhiTRKIMIAWRILLKEVVDRMSEAF- >SRR5260370_37911868 --------GSRRTPAISSVVrGRDFSLRSIRNFFEACPAAVPRFAG-TDFE------------RQHKLLRHAVGVLLIFPKEPegePTVLTRIVERHSRpDLAVPPALYAPFVDSLIATGEQHDP-AFTPEVEHAWRSTAQTVVAYMTSRSX >SRR5229473_1098235 --------GSRRTPAISSVVrGRDFSLRSIRNFFEACPAAVPRFAG-TDFE------------RQHKLLRHAVGVLLIFPKEPegePTVQTRIAERHSRrDLAVPPALCAPFVDSLIATGEQHDP-AFTRRWNTPGGAPPKRS----SPTX- >tr|A0A1I8EE37|A0A1I8EE37_WUCBA Uncharacterized protein OS=Wuchereria bancrofti OX=6293 PE=3 SV=1 ---LSKSQRITIENSWKRATksnaREQVGIQLFARILTARPEMKHLFG-LQKIP-EGRLKYDPRFRRHAIVFIKSFDYIVKNVAykeKLEQHFQALGERHTIlqGRGFDPGYWDTFNDCMRQTVS-LWGKDKDHRTANTWHTLISFVLQNMKIG-- >ERR1719264_1394560 --------ISVVAANFKTVKSnQVLANTLFEHLFELEPSSKALFES-KDL------------TQLKTKFAGFIGQGLKMLqgKNAKKSSGSLPRCTW-RWE------------------------------------------------- >ERR1712226_1819570 ---------------------------------QYDPSSRQVFEN-SNL------------TEHKQRFIGFIGKGIDTTiEGDREEWKDLVDMHV-DIGVTFKHFLAFEDAFLNTLHDLYADTFSDELLCAWIYVL------------ >ERR1719326_1666808 --------LDIVTKSYETVAAnSTFADILFERFFSYDESAKKLFGN-ADM------------ATHKKKLVGFIGKGLKMAqsSDPDGEMRKMAAFHK-EKKVEISHFIFFEESIIYALRGTLGVAFQDELADAWTLVI------------ >ERR1712071_441310 ---IRRQGEDgrqrpvrhrqrtqrnpqtrlLSLESWTQKDrSPERPSQqvvghpkadccSSNRRFSHPPHGRRRPPW---LP-IQDANRLRAFPHQLHHQGRELP-----cRD--pKLsrX-------------------------------------------------------------- >ERR1719432_409132 ---LRHQEHRrarrfrqqqerCPRHFRSNEIQQRSCSQNHAQIVHCLPRDPENVPRIADVA-VSDLMNNRKFLSISYSAFAGFNFILNNMDDPEI--IKLQLSKV----DFPGMfvfpfpgtsqqHQ---dtsr-IVLEVFREELGAAFTAEAASGWTSLLNFVSQALIK--- >ERR1712179_658195 ---VSGNSK-nAVRATFDQMRfNSEVAPKiml---KLFTAYPETQKMFHRIADVA-VSDLMNNRKFLHQLLCL-RRIQLHPQQhgrsrDHQTpTVqgrLP-----RHV----RLPLPwylsaapgyFSHR----IGSVQGRAGRRlh----RR--SRLWMDFSAELRQP--- >ERR1712137_151953 ---LRHQEHRrarrfrqqqerRPRHFRSNEIQQRSCSQNHAQIVHCLPRDPENVPPHrrcprlgfdeqPQIP-VHQLLCLRRIQLHPQ--QHG---RSRDHQTPT--------VQG----RLPRHvrlplpwylsaAP---gyfs-HRIGSFREELGAAFTAEAASGWTSLLNFVSQALIK--- >ERR1711946_32375 ------------------------------------------------------DEQPQIPVHQLLFL-RRIQLHPQQhgrsrDHQTpTVqgrLP-----RHV----RLPLPwylsaapgtYPPS----HSNHTARERTAfqvlFLPQDT--SRIVLEVFRE------- >ERR1719222_1795957 ---VSAKAKSLIRDSWVQMKfNGEIAPKIYLKTFAAHPKTLAMFPQFAKVP-NRVRPHPYEpLLATAGIDYDVKLWIPSPGSEHNInveELMARNArmleetrDTI----TVPATfmirmlas--------MSNFRR-AGNRSTNDE-------------------- >ERR1719222_245222 -------ARSlgrtqesHPLDLDSHEIqqqRRTQNPLQDVHHLSRDPENVHPFGRYTR------------FSAHGEQTVLGFESLCFRwiqhdcqqYGCSRA--DQVAVVQG----RLPRHfrlslpwhfsaTRANPRIILEVFAEELGSTFTKEAAAAWNSLLNFVTKGLEN--- >ERR1711911_103569 ---------------------------------sraDQVAVVQGRLPRHFR---------------------LSLPW----------------------------HfsatranhphhlgsIR--RRTRLHFHQGSRCrleLPfelRHQGFRKQHRRLATHR---SRP--- >tr|A0A0B2VDB7|A0A0B2VDB7_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_13543 PE=3 SV=1 --SMNDDTKGAICEQWHTILalydgdISRVGVAVYQRIFDAEPQLREVFGIPSFV---TDLSEYEPFQRSGKLFMSVVDLCVRNIYALdaemGPVLVMYGRRHYHQqsRGFHLRYMPIFTQCMKEFVSDCLNEKQkTSDSEDGWSLLFDYIAAKIVDG-- >tr|A0A0N0P721|A0A0N0P721_LEPSE Adenylate cyclase-like protein OS=Leptomonas seymouri GN=ABL78_2595 PE=4 SV=1 ---------FTVQGTWNILEkegmLERFAQQLYDELLTQNARLRVYFYGV-DL------------DEQSKSLVRMIGTAVHFYEKpqvTVEMFTKAGARHR-GYGVNGEVFEEMRDAFFRVFPKFVGADVFSAAEEEWQKFWKLMLDLLQH--- >tr|S9WKS4|S9WKS4_9TRYP Adenylate cyclase-like protein OS=Angomonas deanei GN=AGDE_06844 PE=4 SV=1 ---------NTVLHSWKLLEdggkMDDFGDALYADLLNSNPYIRVFFYGV-QL------------SEQPKALMRMLGTAVYSLNNpnkVDDLFVKTGAKHR-GFGVTTETFQSMETSFFKIFPEFIGEDVYEKTKKEWHDFWKYIIKKLDQ--- >tr|A0A2C9KGE7|A0A2C9KGE7_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 PE=3 SV=1 --LVTDSDIQALRSSWATLTAgpdgrNVFGNNFVLWMLKTIPNMRERFEKFNAHQSDEALKNDNEFVKQVKLIVGGLQSFIDNLENpgqLQATIERLAAIHLKmRPSIGAGYFGPLQNNIHDFIEDTLKVGADDAAPKSWTRLLTAFNDVLNSY-- >tr|A0A2E2XNM9|A0A2E2XNM9_9GAMM Uncharacterized protein OS=Cellvibrionaceae bacterium OX=2026723 GN=CL693_20675 PE=4 SV=1 -------DIDWIESSLELLAphADRLGGLVYPRFFVHFPEAETLFGG-GELG-----------KSTQESMIVPLLMGLKDIADGKtymLTIERWLEDHR-EYGVTLPMYSVMLDSLLLGMREAVGDLWTTEMDGAWQEVLARLLLLVEGVY- >tr|L7L9M1|L7L9M1_9ACTN Uncharacterized protein OS=Gordonia hirsuta DSM 44140 = NBRC 16056 OX=1121927 GN=GOHSU_25_00750 PE=4 SV=1 -------IRQAVLESLARYEesHGDPTRAIYERFYRVHPEAIEELAF-D--------------TVLENRMMAGILALLADVADGSidpGGAVYWVSDHV-AWEVSETMIMGMFGAVRDTVREGLGPEWTARMDADWAGLLAALAPAMRDAV- >ERR1712232_1039451 ---------------------------------------------------------SEEMRTHATKVMTFVGNGVASIGNPEkcerfrAECIALGKKNQ-ERGISSQDYDIATQPFVDAVEHSwlqagwrqtdaSGSIWPPGAQGAYTKFYGHMAATIKDG-- >tr|A0A0D6M6J3|A0A0D6M6J3_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=ANCCEY_05408 PE=4 SV=1 --------------------------------MPSCVRTAVTLP-----------------YLEIFEPFVVIEGAVMSLDNlpaLDPILDNLGRRHG-KLEVNGKfrtyYWSTFLECSICIFRKTLTN-------------------------- >SRR2546427_1691122 --------VVLLQTTFLRAAemrigKRNITDFIYEDLFLKRPQLKPMFTN--Q-----------V--LQRHKLGKMLGSIFIHLRDqdwIDEHLRDLGAMHW-RAGATPEVYPWIKDSVLAVLEEGMAPsGWNLRCQREGAGALGVSAQGMLMGY- >tr|A0A183IHG0|A0A183IHG0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 --HFSLREKELLSVSMKKLEqlEEDNAVKIFIRLFQENPAYKSLFPKLRFMG-DADIVNSTALVAHTQLILKMIKTFINGFQNestCAVVLKRAETAHR-KFDIKPSQVSTLFPILMEILDIS-----HNETQAAWKKLFETFSI------- >tr|A0A1B6JRB7|A0A1B6JRB7_9HEMI Uncharacterized protein OS=Homalodisca liturata GN=g.2446 PE=3 SV=1 -ASLTDRDLRLGRATWFKNvDaTPDFGMVIFKELFRQYPDVESYFLHLRGN--AGSIFDSRTFRSHMTeRVVPKLKEVFEALDKpehLNEVMTKLGLYHA-KLGVSGHLVENMLSVILDALKSVMHTKMQPDEETAVRTC------------- >SRR6185369_2033738 ---------------------------LRRVFI-QVASDRSDVSK-TNF------------KFQKLMLRQSLLEMLCfdrGMSGTREEIERLGLRHKV-LGVTPEMYAMWLDSLCEAIKQHDP-SYTPELEQLWRVAMLKSIKE------ >tr|A0A0P5RQ13|A0A0P5RQ13_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1 -TKLTPHQIRDVQRTWEHLRanRNAMVSSIFVKLFKETPRVQKHFAKFANVA-VDALPENGEFNKQIAPVAARLDTIISAMDDklqLLGNINYMRYPHQPPRAIPRQTFEDFARLPIESLEAS---GVSGDDMDSWKGVLTIFVNGVSMRY- >SRR3954451_11513015 --AASPCAQQLRQGCRDRPA---ACQLVLSSGVRDRPGCEIAVQ--GRH------------GEAGPQADGGADGLIDAIDRLDTI--------------------------------------VPAVEAAWTEAYTILATTMKD--- >tr|A0A1S3CW24|A0A1S3CW24_DIACI uncharacterized protein LOC103506299 OS=Diaphorina citri OX=121845 GN=LOC103506299 PE=3 SV=1 --GLTPKMVGLLKCLGVAIKPeaHRHGVNIFKKLFLMDKTVQRMFPKFACD-DMCGLDENPDFHKHVDAVMKSILYMMESSGsvpDMKSTLALQVKIHK-DLCIPDRHFITFGYAINEYLKETLGAKYSEDVECAVAYFWKFVASEMTAKP- >ERR1719244_808981 ------------------------------------------------------------------------------------KAPRTRRPPRAALQRENALFQALSRAFLKAIKVYLP--WSDRREAAWQLLWQRIITQMTL--- >tr|A0A2T5C1R0|A0A2T5C1R0_9BACT Hemoglobin-like flavoprotein OS=Mangrovibacterium marinum OX=1639118 GN=C8N47_108138 PE=4 SV=1 ---MTEADITVIEKSYAQIEAalPRMAKYFFNRANELDSDLDPLFEE--DK------------SKHGEAFVALFGKAVEHLNSPealLPEIKKMEAKLK-YYKFNEEVLNTVGVVFVDTLSFGFGNNFTQDIIDPWVKAYKTYSS------- >tr|A0A1Z4LAZ9|A0A1Z4LAZ9_NOSLI Nitric oxide synthase oxygenase OS=Nostoc linckia NIES-25 GN=nos PE=4 SV=1 --AVPPELLLKMADSWQVMsqNKQQMGIEFYQMLFEKYPFVLPIFGR-ADMD------------YLSLHLFQALEFLVNCLKTgssdeMLRELRFLGQVHG-SADVPTCAYPAITECMIALMERHVP-DLTPQVRQGWVTLLERVINIVK---- >tr|A0A096P8B0|A0A096P8B0_OSTTA Flavoprotein pyridine nucleotide cytochrome reductase OS=Ostreococcus tauri GN=OT_ostta17g00030 PE=4 SV=1 ---------------------------------------------------------------------------------------masvgsgat----DDD-GVDVPVSRCPFAhGTVTVDPYPGYVH-G---KNPRVCPRGCVPRPPSKP---- >ERR1712071_238239 -----ERSFTYWKDSAMMELa--------KWNARLQTPR----------------VYEVKwRRKKRNIPGRVGWRVLGAELWVRSSCRRRIRNRPYQEYFVSyvsiSQQLEETARLIIDALDEELGVRFTSYTRGVWSR-aFHFANSIMAESF- >tr|A0A2D4BL26|A0A2D4BL26_PYTIN Uncharacterized protein OS=Pythium insidiosum GN=PINS_002968 PE=4 SV=1 ---------------------TTLYDVFYAHLEQHSPELKPVFRS--SV------------HIRGKVLVHISVGMRTLIASenFVDKVLPLTKTHR-RFGVKPEHYEPLGRALLHAMQVVAL------ITRDRGRVEEPTSIILI---- >tr|G8YSE7|G8YSE7_PICSO Piso0_001107 protein OS=Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL Y-12695) GN=Piso0 --EITEQDIYRLSSSWNTIHtnsryhNDSFVSRLYANLLAANPKLLPVFSG--EN----------GLQEHSALFGELLSLTMIYLNDmptLKICIAAYARENPLFTEQCCEIVEPMGSALVLTLRQWLGKgVFDNELQELWIKVYVMLANTLL---- >ERR1719431_737524 ---LDMSQISDLQRCWSTLQlhmgEQAIAAAFYNDIITNFPSIQKYFKNIWTESTFtRTIGNMNDVRKHASLVVSRLTNYMGNLHHLsevNEDLKELGMIHAARYHITEEVVEQFVSSMATTVADLLTKedLFDPVLCGAWKRFFFMILTFLSEG-- >tr|A0A0G4H5Q5|A0A0G4H5Q5_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) OX=1169540 GN=Vbra_6604 PE=3 SV=1 ---------------------SEIGIVFLHNLFSNAPTLQKLFVR---PS-----------ATYGRIFGQILKMLLAHLDDPAEvwqNNKELALRHI-KHGVRPSHVPLFSKLIVETFASIGGEEWTAEHTAAWQALWEVTGSELT---- >ERR1719431_2380502 --ELTDDEINEVQQSWDLLTRsegglREAGLTLNQQLLTAQPHHIRSFEKFRKYKDFDDILKSPEFKTHSYSTVREISLVITNLKHpgvFTQLTQSIGFAHR-RANTPPNQMVDFKSVFiNDFIPSQMADKATPNTIKAWEKFMTVFIEHVKEG-- >tr|A0A1W2GS79|A0A1W2GS79_9BACT Uncharacterized protein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4044 PE=4 SV=1 MKDLNIRERKNIRDTWKVLAPniHEFAFSFYSNLHSLDSSLVPLFEN--EF----------GIIKQGDKALYVLGFVVASLDNLmvaregiKKALEGVFMEHQ---HIKRADEQKVMKAFLQAMKSTLRGVWTNEIAISWYRLLSLISAVSI---- >tr|U1JU51|U1JU51_9GAMM Uncharacterized protein OS=Pseudoalteromonas citrea DSM 8771 GN=PCIT_01118 PE=4 SV=1 -MSISPYQYQLLTQSFTTLKPNFhcFCVSLH-TQLKNYNLELA-------------LPSSSkYLLNIEHNIQLFLSEGIALLPQQsalVDLIKRHKPHFD-ALKLSEQDIAVLCHTMLETLQLHLGRQFTLALRNAWRKALHMFANIIKS--- >tr|A0A290TM25|A0A290TM25_PSEO7 Uncharacterized protein OS=Pseudoalteromonas piscicida GN=PPIS_a0207 PE=4 SV=1 -MSITPYQYQLLTQTLASIRPNFhgFCTSWY-NQIQHYDLRMQ-------------IPTNVgQLIIWEHQIFDFVQNCVMRIPQQsnlLHYLQKQRGTLL-FMGTSEKDISVLLFTFYSNAKKSSWQAFYHSSKKRLEQSTVTHRKY------ >tr|A0A2G1B531|A0A2G1B531_9GAMM Globin OS=Pseudoalteromonas sp. 3D05 GN=CSC79_14765 PE=4 SV=1 -MGISTLEKQLLLNSLHVVKPNFhcFSYTFQ-MHVKREPLDML-------------CLSNSKINEKTYILYCVLERIVMHLDNLrtvTPFIEHYAKNLS-NMGMSHQDTDILCNSFLATLKIHLKGCYPPKLESIWQHAINIFKSIVTG--- >tr|A0Y309|A0Y309_9GAMM Uncharacterized protein OS=Alteromonadales bacterium TW-7 GN=ATW7_05751 PE=4 SV=1 ----MNSHKSVLLKSIGIIKPNFhaFTARFH-KKLVESDISMN-------------TLTAEQFNEKSYILYCTLERIIKNIDNPssvAPFLSHHLQFLK-KLNIQQSDIKPLTDIFYVTLVEHLGRFFNEESHLAWRKVLTYFERYTND--- >tr|A0A0K1PX98|A0A0K1PX98_9DELT Uncharacterized protein OS=Labilithrix luteola OX=1391654 GN=AKJ09_04675 PE=4 SV=1 ---------VVLKESWHLSYrrAPDLAARFYEELSWKYPSARRLLDHVFGAQN--------DI---AVCLSTVAGDLLDNVDDpdaFSAAIVALANAHV-SLDIPPHVVAWMEEVLLDTLEGAAGDDWTPEMRTTWRNAYEDLASRLAR--- >SRR4051794_15895678 ------------------------XmvgitqfyTEFYARLDTLDSSGKfdAIlsahtsgTNK---------------IAAKGEILIRIIKFALSIQGdnpavql----QLYLLGKS-HVQKRIRPWQYSIFVEAMIFTISSRLGTEATHEVMEAWVNIFAFILRSMLPQA- >SRR6478672_7358577 --------------------------------------------------------------------------SRmp--CNSstlkRRPSatscTESPTSTSP-WESAPSST-PSSASTYSPRSLRFWATPSPPRSPPRGGEVYWLFALQLV---- >tr|A0A1Z5JZN5|A0A1Z5JZN5_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_19Hh029 PE=3 SV=1 MEDISPDVVSAVQDSWERIKdsspawEDDFGDRFLKSIFTKAPLsYKLLFP-FGTT-SGPAMFESEDFIEAARTASTLMDMSVSLLecemDALFGQLLEIGLEHANFPRIQTSHWSMMRDALLRTLASYssaLSEDCKdlEKVLSAWSLVFDNLSNEMVE--- >ERR1700744_2408068 -----------------------------------HPEAESLFRR--GPS--------MR--CPTGRP----------RSGTPG----GscwtkliASAlSA-RHKSRRLKSSLPLEEIRADVGFLL--DRVVVAIDavgdervvRNDRVLVRLDRVQS---- >tr|A0A2S3QTP4|A0A2S3QTP4_9PROT Uncharacterized protein OS=Halobacteriovorax sp. DA5 OX=2067553 GN=C0Z22_01530 PE=3 SV=1 -------DKDLIIESFARIEpnLKNFTNAFFDNVVILEPGMQKVFAH-AD-------------REQLKaSFIRALSITINNLKNpeyLKYYLQGLGGNQI-KYEVSETYFPIFEEAFIQTLMLFHMNSWTPKLETAWRDCFYYIAEYIS---- >ERR1719216_352717 ----------IIKSSWRIIQnkvIARHGTDFFIEIFDSQF---------KP-P----IGVTPVFQGHGEKMIQVVGKAIETLRDgKspteqesqelWDMLIENGRLYL-GYGALPMYFDVLGTFDCKHSKDNVIVntGNCGKQEM------------------ >tr|A0A2D7G1P9|A0A2D7G1P9_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP96_10880 PE=4 SV=1 -------EQTCIERVLDCAAedQPDFQQRLYDRFYQLAPSAEALMIHIDEE-------------VQGKMLAEVIRLFLsPDVaVTDQQYLLFETKNHAQAYFVEPEMYRALNQALFETLKVGAGRIWSSEVESAVHNRLSKMLHGILEAL- >tr|A0A2E1GZ77|A0A2E1GZ77_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ03_04085 PE=4 SV=1 -------DQAWIETAFDCAAvdNLNFNVDVYQTFYRAEPSVASLMAHIDEL-------------VQNKMLSEVIRLLLnPNIeSEEAGYLNFEVKTHIQGYGVSPLMFLSFNRAVYEVLQSSAARVWEDDLAVAVTRRFAVLSDALTEAL- >tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ23_00915 PE=4 SV=1 -------MQSSIHALLEQVAttDIDFDKKCFERFFQISEEGKTLMAHMDRV-------------HRGKMMAEIYRLMMaRDLDDEADYLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY- >tr|A0A2G2R0S2|A0A2G2R0S2_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_09030 PE=4 SV=1 --IVTPDQAIIIQESFARLStsSDSLIQDILGTIAEGNSDLAVTI-TF----------KSQNLVE---QISTALSHIIDQLhtaDNVAEYVAHFGELLL-AQNVQDENYSSFGEALLSGLENALQNDFTAEVRDAWTSGWAMLSGIMRE--- >SRR5258705_7404034 ----------------------CPTSSSRPVLWAAvrdCAGGQTLVPR--RY------------DGTRLQADGDAGRCGQQSGQSRsrvAGGERSCQASR-RPWREGGYYTPVGAALLWTLEQGFRI-------------------------- >tr|F0W0M6|F0W0M6_9STRA Uncharacterized protein AlNc14C5G666 OS=Albugo laibachii Nc14 OX=890382 GN=AlNc14C5G666 PE=3 SV=1 ---------------------------------LNAPELKPVFKT----------------SKHARnVVLQHIVGGLRTMlahDVHIERVRALTRTHL-QFGVKMEYFDLLGQAVIFSMRHCSGSHWSSEIEEAWRRLYGHCSVILL---- >SRR5271163_4883858 ----------RTDSLYAQLGgkttIASIVDRFYEKVL-ADPDLKPFFAK-ANM------------AGIKQRQAQFLTQALGGPIDA--RNHETRPAHA-SLLSDTRHFERAATHLAVTLSEM----------------------------- >ERR1711911_155006 --DIIRKNCLMLYTNFTATKiaFKWILLCLNCRYFEIKPEAQKLFPAFANVPL-KDLPKNYA-------FLAAVNTCFANVHYLIekagrnprdcPVFSKVV---A-KYD--ARDVKQFGDIMMNSLKSELGSQFTDEIEESWNLALEEIAKMVS---- >tr|A0A286GHZ2|A0A286GHZ2_9BACT Sulfite reductase, alpha subunit (Flavoprotein) OS=Spirosoma fluviale GN=SAMN06269250_4620 PE=4 SV=1 --ALTPDMIRLMRQVGDQLsaDARVIGTDFYHALFQTHPDIIPYFNR-TDID------------SLTEHLMQAVGFLVRSLASgvdITKELRELSQIHT-NFSVPPDAYPKLVEPLLTVMRKH-VPGFSTEQEHAWVILLNRVTNVLRQ--- >ERR550539_353004 --------------------------------------------------------------AMMQHLVKNLHDISRF---dsdIRELLTRLGQQWL-QKRVPLDFAVLLGNEYLEAvlpffHSNV-GATLALKLEVSLAYLYKEAMHFLLL--- >LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3583117_1 # 3 # 191 # -1 # ID=3583117_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561 --ALAPEAVTKMRAGAEAMlaHPQEAGVFFYETLFDARPDLVSLFRT-ANMD------------ALSRHLIDTVVFLSRAADDltgLRDDLRNLARVHQ-VNQIPPSEYAHLAAPLLETLSRF-GHPLDAQMIRGWEVLFDRVSRIVAE--- >ERR1719359_219123 ------------------IdEepmaEVVSGeDALV----AIA-DLlyQKL-------------------------------------SGdeaMAQFLENVDLT--QlanNLRSLlalvfngsdWPEMHLS--gSLiddgYEDFSSILQETL----qaSPg-DDALL--ESLDKL---- >ERR1719487_376807 ------------------EeEgateEVASGeEALV----AIA-DMlyQKL-------------------------------------SGdqaMAEFLENVDLA--QlakNLRTLlaavfegndWPEINLS--aSIidegYEDFSSVLQETL----qtCLg-DNAML--ESLDKL---- >ERR1712100_485805 ---SVGHVVLVV---GRCSfEcrniVVVEGlDGSLDRLLALRkvvgiglGLPilQQL-------------------------------------G-VLRHVGNVA-------------------lKVlrchFLQFSNHVLEVRSRLRldefclvgdivievilrDHgggkHeRD--------------- >ERR1719171_2780585 --NLSEEMITEVQKSWSEVLrrvdsKTEIGRIIYDSLFDRLPHLRKMFKT-NRL-------------TVAMRFANSVHSLVGILNNkeqTEEYVYNMALRHV-QYwsgdgSIAQANMSAFLKAVLIVFDNALDDKWTQRMEEAWGALFSYVGEAMVA--- >ERR1719265_1594411 --------VDTIVKDWAGLDLEKLGDTTFGMMVQNNPEIKTIFGG--DVHPG---VAQQGLKSQAATFVGFMSYAMTWLKKkdfivLEQKMVELGQRHV-HYGVNVSHFVSFQEAMFTALREQLGTRFE-DNKYAWTFT------------- >ERR1740139_1939294 ----DSDTIAVVKQTWKAITalPeqqEYVGMRLLHNlhpcyetsltfllvielyylsYLRVVPSARAFFPPTSD-----SLIDDESFRESASNLMMCIDKAINTLENqrhlrFKALLQTYGKKLS-RLHIPPSCYTMAWFALIETLQDVLEDRFTELMLAYWIDIIDPINT------- >SRR5690606_18427011 ---VSHRN---AHEKHQPCHaKL-------------RPLLRE-----------------PRLLRRLLY--DLSGqLTRR-A--GEVRPERHG-----GAEASAX--------------------------------------------- >SRR5690606_42132731 ---MPMKNTNRVMQSYGRCCaSPGFFDDFYTTFLASSPAVREKSAQ-SDMA------AQKHLLRAGIP--NLVPLARG-M--PDTKLDRKSTRLN----------------------------------------------------- >ERR1719487_109746 -MIMSAEAVQVVQDSFHRVDscvqiRDALEDVFFPHLFASSTQIKELFAD---V----------DLNMQAPMFANILNSTISSLNNpteLRPLLADFGEKCK-KYGVQGEHIATAGESLIFTMKSI-DDQWDAEVEAAWMAACSAMENAA----- >tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1 -----PMEVALVQSTWQRFLesPnlTTEFSAIFQRMFQMVPTAMQAFRYV-NSTDLDSLVANKDLQKVVTMMMSEVNATLQLLDQpqaLISLIRSHGARHA-TYGVTRQWEETMLNAILYAVETKLSPsGFNQSEKNAWRSVLDMLGRNF----- >tr|A0A0C9M7G1|A0A0C9M7G1_9FUNG Type 11 methyltransferase OS=Mucor ambiguus OX=91626 GN=MAM1_0030c02374 PE=3 SV=1 --PPTQAQIDIVRYTWERVSeihldtddPtvsatHAFGLAFYDALFKLDPSLEPLFSNIFQQAralagMVSYIARSPKVTGPNKpksatSLsegcgmstaklekvptireinarkrketnATTFEELVSSAatskpkaeDDeeqLLYKLRELGARHY-FYNVEPKFLALVGPAALSALKTRLGKDFLPEVAEAWTRAHAYAAYHM----- >ERR1719365_124985 -SEMSGKQKKIVWRTWNSMLgkqesdYNDFGINFVLWLFDNFPKMRNKFDELYGR-SRNSLIVDQHFIAHTENVVKELDRLIKDLPFprlLSKRISKLADSHLNqEP-------------------------------------------------- >tr|C9CRM3|C9CRM3_9RHOB Uncharacterized protein OS=Silicibacter sp. TrichCH4B OX=644076 GN=SCH4B_0097 PE=4 SV=1 ---ISSRDIDLLQSSCATAFlkKGVLASAFYNKLFEIEPAYVNKFS---NIN------------KQKIMFEAMLAYCISGITSgykVEALTARLRSYHM-HLEISDIDIANARSALMYALGSVLGEDFHSDLKQAWDAAFSSVSEALR---- >SRR5688500_3946624 ---VDSRTIALIKESFTPIAgrTLELADRFFNNLFTRQTSVRGFFPA--DVTEQ---------KRQLPGVIQTILENGDKLENLEPQLREVGREYA-KQGALPTHYGAVARTFVDTVREMSGIGWQARYTRAWTSLFDSLTKAIV---- >GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6338290_1 # 1 # 129 # -1 # ID=6338290_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.636 -------------------ReagLEQYAGALLRSGFDDLEtllaiedadmkdLGIPaCHVVRlrkklqelqRQRSGTRGDFDASNP---VVAFL-----ENAGLGQya--KLLLQNGFDdmDV-LLDIEDADLKDLGvprghaIKLKKGLRELQLQQYAQEDPMPLHAAA------------ >SRR4051794_36238122 --------RRTAKASYLRLQgggrERAFFAAFYENLLVSCPDVKPFFVP-ERMA------------HQ----QSMLNRAIQLLLDFdracgCPQLRQLADGHA-GYQLTRWHYDQFVEALIRTIEQS-G-ITNPAELSAWRTTVMPAIEFM----- >ADurb_Met_03_Slu_FD_contig_21_1037173_length_469_multi_2_in_0_out_0_1 # 1 # 468 # 1 # ID=69395_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.588 --------RRTALASYLRFQspdkVQKFSRGLYEHLFDRHEELERLFKP--DLK------------AQ----YEALNRALQALVDFrpedpdsAKAIETIATRHR-GYSISKAHLVTFLDAVAVGLACA-D-ERDPETHDAWHEVLVAAFKPF----- >SRR3569833_2455512 --------MKDVQARFGRCClHPNFLDTFYNAFMATSPEVARLFKN-TDF------------TRQKKMLQMSLNLLIShamGIGIVDGYLHQLAAKHSRhHLNPEPQHTTPPPNSLMKAVNQHDP-KYTPSLDHARRTGHGHGIELI----- >SRR5439155_1005251 --------KATtalAKASYDRCCqAPEFLQVFYRNFLAACPEAVPRFAG-TNF------------DQQTRLLRHAIGLLLIfpnQPNKEPNLLARLARGPGPcRRQGCA--CGQ---DRSDRTARTDGAsrqrrcraPCSRRpdarGSRKWVRAAP----------- >SRR5262245_66279004 ---LEPTDRIRAKQSYLKHcmGKNDFYRKFYERFFQGPEGTmakEMFAD--KDL------------NQQYVKLDQSLHYLLNFGDQdmMEpTVLTTTATIHQ-TKGVAPEQLERFIECLIDTLSKDYQV--SGIEVDAWKNVCGP---------- >ERR1719277_2718232 --VLTDETIAIVKSTAPAMKehAYKISETMYQNMFAEKPEIRKLFTP-EDQ----KVQPGQTQKKQPLNLARAIQAYATHIDDldkKKSRIGRRIDrvrkKEC-SIESKNG---FNGK-RSEIVKEELTELERKNVVLrakmdSMEREvkllkKKFLSDIS----- >ERR1719209_1562507 -----------------------------------------------GDHsh-AQSYH-----EVHEHLWRSLAFSVLNQVlsrDkRIKQDLFNLGYTHH-ERGLKEDDMLQLEYAVIDGIHDHLV---TDVHERAWRKVFQLIRIHF----- >ERR1719487_2840864 -----------VRQSWAMIQaiqtS-sagGFGDALFFNISVMSSEIWSLFSV--SKE------------VMAVTFTDAFTLIVSYIADpvgLAEELFGEADGVG-DVGDDQGEGiregdghDLLGHGEQ--TPDLAAHDGDVEEERVAE--------------- >ERR1719171_2815737 ----------------------agaendeelrensgvedsfasgsvptTFNEMFLFNLTVMGAGARK------NKA------------ImWMTEVLTSFDTIVANVANskrLQEECDVLGLRIS-KYPLDFVKLPEFKACMLSSLRSLLPRTWSGTHEVAWSWLWENIERML----- >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 OX=905079 GN=GUITHDRAFT_143733 PE=3 SV=1 --------SARIASSWTELvkksDYAEIGRRIYGS-VKANDTLEPLFR-FTNQ------------TVQGTKFVDMLSSIVENINNPqtiFEKVNELAPMHH-RKGVKAAHMPIMKGIIVSLLKHVLGDEFTNEDEEAWNWIWQYLTQILD---- >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold759411_1 # 1 # 798 # -1 # ID=759411_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.594 ----------IAAQFWEEHiSyksladKLEIGCAIYFGMMVHNKEMKRILKKNlhhHQ-----------SIENSSVKFLDMMGWLLRSLlrSDidLCGSLQQLGAFHR-NMGVNINHFDPMLKSMHETFSYYFPIKYGIQIKYAIDQIFTLAARIMTG--- >ERR1719396_104066 ---------FNIIESWELLRfhpslKEDLGTAIFRELFKEHPELREHFGL--PLVGLDALCKNQTFLSLSNQFVDVFARTMDTLGPdeelMDESIRELGEKCV-SIGIETSHLSLLRKPILSAVEKILLEDFDD---ESWKKFYSILATDLAE--- >tr|A0A0P5AEE1|A0A0P5AEE1_9CRUS Di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1 --KLtp--HQIQDVQRSWENI-rngLNALVSS-IFVKLFKETPRIQKFFAKF------ANVAVD------SLAGn----------------AEYEKQI-ALVD--TPTPNVEFPV-------------------------------------- >tr|A0A0P4WPK3|A0A0P4WPK3_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 --KLap--HQIRDVQTSWENIRgdRNSIVPPSSSSSSRRLPAPRSTSSN--SLA-LPSMP--------------------CpKManttnklllGDklqLLCNINYMRYTHQPPRAIPRERFEDFARLLLDVLSSK---GVSADDMDSWRGVLTIFVDGVS---- >ERR1719510_2339612 --SLTDNEVILIKSSWTYLKPhiNTILIESFMSLFAENSDVKEKFYSFKNHAIEdlnkkrgVGLASTNGLQRHIPRVSRAITKVVNSIENldrVSRYLEMLGKIHQ-QIGIEVQELMMLGAFFINSSKRHLPSSMQAdrHYSDSWLHLFTVISTMMRKGF- >tr|A0A2V3J537|A0A2V3J537_9FLOR Flavohemoprotein OS=Gracilariopsis chorda OX=448386 GN=BWQ96_00611 PE=4 SV=1 ----DPETEALIKNTLPIFtkHSQQIAVQLYANLFEQHPQLKPMFC-LEFLQTPGQCKKSPgtGMSPQAKILSDSIVNFCANLDNIdmmNNAIERICAKHV-SRHVKSDHYPAVAGAFSRAVRQVLKNELSESDLKAWDTAVSALAGVLV---- >tr|A0A2G5SLB2|A0A2G5SLB2_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-17 PE=4 SV=1 -TEMSDEEVSAIREVWIRAKTDNVGKKILQTLIEKRPKFAEYFG-IQSeSLDIRALNQSKEFHLQAHRIQNFLDTAVGSLGFcpissVYDMAHRIGQIHFY-RGVNfgADNWLVFKKVTVDQVTTGATDsSKekdkdetnsngtangkvdteanpipvgiadinnvysgeNCLARLGWNKLMTVIVREMKRGF- >tr|A0A2P8XQA5|A0A2P8XQA5_BLAGE Uncharacterized protein OS=Blattella germanica OX=6973 GN=C0J52_27026 PE=3 SV=1 ---LAREEKKFITESWHAFmrLPPANSVDAFVKFLQENPKYIKFFKSVDGIP-LEDLRYSFRVPKHVTAVLLYVNSMVHCLDNADAMfflSLQVGLMHS-NMGLTVEDFKLFNGYMVNILEDELG--LNDEGVAVWNKVLEIFM-------- >tr|T1FHE7|T1FHE7_HELRO Uncharacterized protein OS=Helobdella robusta OX=6412 GN=20208246 PE=3 SV=1 -----------------------------GTLLQSNPLVKNTFEKFRQMDPMSDFTDSSVFSTHAMVVMSAFEDIFDNLDDseIVKDILEQGKSHG-KFseDFAPETFWAIEEPFMSSMKDILGRKMSSQLEKIYKKTIKFILSVLIKGLR >SRR5580658_3791175 -------DPALVREAWSFVSdrADQLVMNFYAELFYVFKEAPTMFPS--NMT--------RQRQEFGRAVVQWIIS--DDQEGL----------------------------------------------------------------- >SRR3990167_4175368 -TGLTDGEKGMIQQSWNLLSKVEFTKILYKKIFELAPHVRCLFQN--SIES-----------QHENfsIMMDMmINEHINDELDLFAVVLQLAKRHF-HYKVKTDYYSIFRDGFLWSLEQTLSIEtlnktITnestnqpTTIKSIWLKFVNYLISVMV---- >LauGreDrversion2_5_1035112.scaffolds.fasta_scaffold830278_1 # 2 # 232 # -1 # ID=830278_1;partial=10;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.316 -------------------------MAFWN----KHPEPAAQFVA---P----------TQdtltdefepeeeqGISKEQLLSALNAAQT----ALMMIDR----D------FNITYLNqKSVDLLKTHEALFQSIWPNFQATeefllGYCIdlfhanpshqrqmlsnpsNLPYTTTITVKDV- >SoimicmetaTmtHMA_FD_contig_51_4416696_length_1368_multi_2_in_0_out_0_1 # 1 # 216 # -1 # ID=2511055_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.685 --------VALHTVEFAVADPsaRATI--------------------------------------------------ATHGLtpdDMAMLLSK---RE------------LIGPAFPALLDEFYGKVVEN---------------------- >tr|Q5D2M7|Q5D2M7_9TREM Myoglobin 1 OS=Paragonimus westermani OX=34504 GN=myo1 PE=2 SV=1 MAPLTQAEVDGVVSELNPfLAsdakKVELGLGAYKALLTAKPEYIQLFSKLHGLT-IDNVFQSEGIKYYARTLVEDLVKMLTAAAKddeLQKVLVHSGHQHT-TRKVTKQQFLSGEPIFIDFFNKTLSK---PENKAAMEKFLKHAFPVIANN-- >tr|A0A1S8X4B3|A0A1S8X4B3_9TREM Globin OS=Opisthorchis viverrini OX=6198 GN=X801_02811 PE=3 SV=1 MAPLTQSQIAGIHKELLPiLSndeaKTSFGVGAYKAFLGAHPEYIQYFSKLNGLT-IDNVFESEGIKYYGRTLVDEIVKMLTAGADdekLKQVLHDSGKAHT-ARNIDNATFMvsklfmflkrvsemrlarglygpfpifaqSGLPVFVDYFNKSLTV---PENQTAMEAFLNHVFPNISKD-- >ERR1719167_330163 -IDLTDKERELIQHTWWRFREEpYCRLRIMTHYFSANSSIKKKFQR-KNEENAAngNlmtAMVSWNIRRFSIRLVEFMDKVVRDLETEnyqdiYDISELQGAKHYRlKRMVEPGDMEALGQSIQTTISEHFGEKFNRSHILAWRRLFIVICSRF----- >tr|A0A0T6BC68|A0A0T6BC68_9SCAR Uncharacterized protein OS=Oryctes borbonicus OX=1629725 GN=AMK59_2266 PE=3 SV=1 -TGLTSQQKSLIQSTFNVIRPhiLNVGIDLFVRVLEVEPEHHRVLP-FSHIP-IADLHESFEFKFHCLAVVYSCSAIIDHLHDdgiLIPLMKKYASDL--KASIPLDIFQMIHDPLLEALDVHDDVKISEEALEAVRTLLRNLTNFLI---- >ERR1719199_1566639 ---------------------------IFQHSGIQRPVFSTSSSSR-R-------------LCRP-CDLSMAFRPSDVLHSstrLKAQVETMGFGHL-HLDVTPARCKLFHGALVDFFVVELGDKLTPLAAEGWKRVLTYVASGLM---- >ERR1719362_342361 --RLSASAVTFLRSSWEHVPKDSFGMEFMKRACSEEPSLSDVFDC-P-V-------------ARPDNLAKVVQMLLDQAEielvprleRLAHGIAALSFKFG---KLRMSHLAPMKRALVRTVVAFAPGNQKAMTNRAWEAFFYAIAAVVA---- >ETNmetMinimDraft_19_1059907.scaffolds.fasta_scaffold284136_1 # 1 # 639 # -1 # ID=284136_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.595 --RLPKACVSLLRQSWKQVPQASFRKEFFDRLYIEDSSLQQIFQH-PMV-------------EVPENAWNVVQLMLDLLNvenvprleRFVHALAGLAFRHG---RFRLAHLAPIKRALVRTVTSHASKQEKKKLSQAWEAFFYALAAVAA---- >SRR5262245_21272653 ------QNVEVFRASLKRCLaAPYFMSRFYDLFMGSSDEVREHFGD-TDFK------VETRVLADSLYLMAVIAQ-GEAEAPAWTEMSRLAKRHSKaELDICPELYDLWLKCLIEAARLHD-AQFSEAVEQAWRATLAPGIEYLSSRRX >tr|A0A2A4SWC3|A0A2A4SWC3_9GAMM Uncharacterized protein OS=Thiotrichales bacterium GN=COB61_05140 PE=4 SV=1 ------MEFQDIRTSMGRAItHGDLFGRFYDIFLASNPKIKSMFVG-TNLE------TQKALLRQGVNLALMFAE-GKAIGK--SAMNRLRDSHSKsHLGIEPSMYRYWLDSFIKALKEFD-PDFDSALEKQWRQALGAAIEHIAAGYS >tr|A0A1R1LTH4|A0A1R1LTH4_9GAMM Globin OS=Motiliproteus sp. MSK22-1 GN=BGP75_17400 PE=4 SV=1 ------DFEHIFDSSYsrvlAVTYnKQGFFETFYQRFVVADEKVSELFKN-TDMA------RQQKLLESSVYFLRDFYT--TSYAD--DVLQKIAILHSKrVLDIPPALYDLWLEVLLSTVSDFD-PLFDENIELAWRLVLSAGITFMKFKHN >tr|A0A2A2KP63|A0A2A2KP63_9BILA Uncharacterized protein OS=Diploscapter pachys OX=2018661 GN=WR25_06989 PE=3 SV=1 -SGLTREEKRIIQVCWFKCNqkqLRKCAEDIFADILHMDDDLLRLFR-L-DHIQSNRLRDAEFFKSHASNFAIVLSLVVTNLQEhVeqaCEALQNLGRQHAA-F--LDKFFQSMyWDTFTDCFERNPPPAFRKgSEREAWSRMILFIIAQMKIGFQ >tr|A0A1I7TYQ0|A0A1I7TYQ0_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=3 SV=1 -SGLTRDDKRIIETCWFKCSqkqLRKSSCDMFWDILHTDEDILRLFR-L-DHVSPNRLKDNEYFKSHASNLALVLNLVVTNLQDnFeqaQDALQALGYQHLH-L--IDRtHFQSMyWDIFTDCFERNPPPSFRKgAEREVWSRMILFIMGQMKTGYQ >SRR5215204_501118 --RVTRRDWQRLLENWERLQpsADRFATVFFDTLFAWEPQARQLFGG-------------ATLETQFLRFAHLLTSLVSAQDHpdeLDRRIDAVIRCFA-GGDPPRKREDAIRVAVAAMLNDVYAAGITPETRASWQSAYIGVITTIRS--- >tr|H3NRG3|H3NRG3_9GAMM Uncharacterized protein OS=gamma proteobacterium HIMB55 GN=OMB55_00005550 PE=4 SV=1 ----SQSDIAIISESLTLCgdCLEDITPHVYRRFFELDASAASLMEYS-DEH------------MRGR----MFASVLELFlsddpFESDGFLAWELDNHVSSYSVTKSMYESLFKAFFEVAEETLGEDWSGDFERAWTNRIARIMAEVS---- >tr|A0A2V1ABH2|A0A2V1ABH2_9ASCO Uncharacterized protein OS=[Candida] duobushaemulonis OX=1231522 GN=CXQ87_003270 PE=4 SV=1 --QLSTADRNKVRASWGDAMaakdykTEQVIHEMFSSLIEQSEDARDLFEN--KK----------VRAQQETLFAEIMGFTMMYLHNitvLDECMNEFIREnpHIVRCGV--RYLEPMGAVLIQYLRQTLGPQFHAGLETLWVQTYIYIANCIL---- >ERR1719396_219344 -------------NTAAAVAPkaLDITKTFYGGMLQDYPELLAYFNPAHNVP---------ISENQPMALAGSIVAYASNIRDLSPllvpngPLMAICHRHC-ALCITPPQYNVVHENVMKSIAKVLGASSRRRSRPPGARRSSSSRR-PA---- >ERR1719396_178111 --------------------------------------------------------------------AHGPGRLHRRLREQHPglvpaagaqrPADGDLPPAL-RLVYHPPAVQRGARERDEVHRQGPGGVVTPEIAAAWSEAVLFLSKACI---- >SRR3546814_8055804 ---------------------KDITPFFYDRFFALYPEQRANFYHFES--------------TSGTMVNEMITSVLALASNearSEEHT-----------sELQSLMRISYAVFCLKKKNKT----------------------------- >SRR3546814_13566968 ---------------------FTIYTTLSLNVVLPFVTHRSNFDHVES--------------TSESMVIEMITLVLALASKeawLTNSFQNFVAALR-SYgDIPPDAYARLLDVLVVTLAQVAGSRWTDEFETAWRWYVSGM--------- >ERR1719171_2136978 ---------EAIRITVPMLEeigLENVGQVFYGHLFTESPQIQMHFIK------------------PNRMLAYIVRKAIFMVRDlhpkpkeVMAELKPLALRHI-KYDAPPELFADFLVSFTKTLEENLKEGFTTDCAEGWESATNFLANTITR--- >ERR1719171_2291403 ---------PRIcgelwrkqtfklrfnilgkqihspgiPRFFQKMEnvgGLLVSalllaMCFYDPEIvAHEEQIGIHIID------------------RNDAIYYVLEACNACILWllvtnVFGFSvQLSAFKHC-VSQMaeDLAKFGTFAVVFLMAFGCAIhiTMPYDPDFEDMWVTILTLFAI------- >UPI000297C1C9 status=active --ELDEYSIGEVRNGWENLERRCGtPKAAA-EEFLHKVSAAIPKTE--HM------------QKRASTVWSKLNGLLASMHDqsmFTGQLEYLALRHM-NQDISAAEIETFKGLLLEFCASKLGGMMTPEFQYGVSRLVDAVGASYQ---- >ERR1719334_589756 -IMLSPAAIQAIKSSWQHV--KNVGFQFFGHLLfsfwlGNQPRALEIYCLHyhGDKR-KGVVELLPRFRRLGEIYAKRIDTWVSHLDDPftlFLILYEHGFNPP-KKavGINEKDFELMVPSLMDAISSAMGSKMTHRLFEQWKSFWKYVLTQIAEG-- >tr|A0A0E9N6V9|A0A0E9N6V9_9BACT Uncharacterized protein OS=Flavihumibacter petaseus NBRC 106054 OX=1220578 GN=FPE01S_06_00290 PE=4 SV=1 --QMNQQEIQLVCQSWQQAAeePLRLAILFFDRLFEEAPELRQVFRT--PMS------------EKTRQLLVFFGFHINRLASgsIrRPSFEAYVW----EELLTDAQKGFLMETLSDTVAALLKPDWTPALQGAWGSFRK----------- >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1 --------NDLVLSSWDIVRqrteVQELGEKFWKYLNCMSPEQTNLFRR--SL------------SMWGhllHHIVNMLLISITDPEEYYDLMFELTIRHI-RYGVRSEYLNPFGNALFATFEEILSDVWEEKTTKAWKLVWKRATCNMSRG-- >ERR1719242_319529 ------EYKNVLQSTWTKLlqKKEEIGKRIYESIvFDTTC-TT----T-GTSLSTSIIFENTNIGQSASRFMDMLDTVICKLDEpdaLVQKLEALSAFHSSNFNVQKRHYIDFEKGFMKAIKWELGAQRTILHDRAWRWFWNFLISKMC---- >KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1947561_2 # 429 # 647 # 1 # ID=1947561_2;partial=01;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.584 -------------------------------LFETNSDIKTMFAKLKDYETVAELRSSKILEDHSMKVICTIDDAIANLDDMeyvNRMLQTIAQAHSTRFpNFDPEFFM------------------------------------------ >SRR4029077_13489679 -----------VQADVHAISvm--LNLMQPFRALRRRVDQFAKLWL--DPL------------WKTGRKAARIPA--TSTSITGRtgfAGRGRTGKAAC----------------------------------------------------- >SRR5579859_1650388 ------------------------------------------------------------------NFLQALHTILLKMQRhdpsVFQFVQQLGARHE-KYGVTREHFRLVGGFFLTVLQRYVGVLWTRPMQRTWEALFGVLTDVMLFGY- >tr|A0A0N4ZKI8|A0A0N4ZKI8_PARTI Uncharacterized protein OS=Parastrongyloides trichosuri PE=3 SV=1 --GLTYYQIQAIQRAWRHMSkagQVSCGRQIITKIYKNNTEIRNIFQTYVTIENLS-INQMepveWGVLKHGEEIVNLLDYVIKNLNNIemvEEKCEEVGRSHRKmkQYGMKEEHWDSLGEALSETIRENYG--------------------------- >ERR1719326_2865515 --NMPPEAIEQVKATWTKLLsmttHIELGSLMYDALFEKLPKIRSMFVS-------------PRL-ATASRGETNIDRIFGSFSKSas--------------YMrdpssMX----------------------------------------------- >GraSoiStandDraft_16_1057320.scaffolds.fasta_scaffold4300996_1 # 1 # 264 # 1 # ID=4300996_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.629 ------------------------TQAFYEEYFRLCPDSRDLMKHV-DEH------------VQGRMLASVHELLMLPDPDEQaRFIAFETQTHR-SYGARRYMYDRLFRALRSVVRDVSGDDWNPAWTTPGIAASRPCSRAST---- >ERR1719174_1428107 ---------------------------------------------------------------------------VVDCQDqrsTLGYPPSAST----SVRCCVEQVARRaflwrkswfLTTLTIFIAGQ-AiLKYSHLDNLATERLLVFLFRAFI---- >ERR1719284_2194575 ----------------------------------------------------------------------------SWREStssMRPCPPSLKL----LGIASL-------------------HSLKLDEKLEFGNGdIGLPGGIQI---- >ERR1719277_1813735 ----------------------------------------------------------------------------------CMCAAETRIAHL-IGRASVANMHNLRNAVGSEVCLLSSlAIRFEANHVGWAHVsvadvVAVCSSISL---- >ERR1719310_1375130 --MLPQEQSQQLQQAWALVinmsgNRDALADLIYSAFFYRLGePR-APLRNPA--------------GSRSLPFLHGHQHLRRQLRrPwssaqfrrnveLRSHVLGYHRPSG-EHHSX----------------------------------------------- >ERR1719310_407492 --ILPLEQSEQLQQAWALVinmsgNRDALADLIYSAFFGASASLEYLFVTPR--------------AVAAFRFFTGINTFV-AFCgDpaqLRRNSQLRSHvpGHY-NSSCEHHPX------------------------------------------- >MEHZ01.5.fsa_nt_MEHZ011529165.1_2 # 173 # 307 # -1 # ID=206391_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.393 --YMSIDtgnleaakvmlqdlvtiradrsryyyclddlFKWHPDIVWKLTv--------------DAPELLrtmldGMIWRSRV--------------VvngnrrvnyylkhllvDEHGKFSNAM-SCIVKLQDpEIAIHPILVQ----LGDLVWNDLVYWrflrgklslVCTAGIFMVSQSMl-QYVESAGSFEERVATFICRLVV---- >tr|A0A067CC73|A0A067CC73_SAPPC Uncharacterized protein OS=Saprolegnia parasitica (strain CBS 223.65) GN=SPRG_06598 PE=4 SV=1 --ILNTAYLLDCSKSWKLIVtantdrMRQYgksgivlfYDEFFFRLFQRDFTLEEVFP---DI------------GKRGEVLVKAMTFMLKSSaENpkqIVNKCHYLGHRHRSFGGVRPHHWAQYTSTVIEVIMYWLGEYASPDVGAAWSNIVGFFLMHILESF- >ERR1712194_94606 -----------VQDTWISATctfeyKECLGTQLLYNLMHIEPSFLDAAPFFDNTVLLGDGFDDESLIQCAIYIVQCITELVTMLDKyHEPKFRILINSHLSrlaKYNIYPSSFAKVAQALLMTLSDVMQEEFTKKVESYWMSVLIILF-------- >tr|A0A2M8U0Y4|A0A2M8U0Y4_9PROT Uncharacterized protein OS=Ferrovibrio sp. OX=1917215 GN=CTR53_17535 PE=4 SV=1 -SPLSPAHLGLVRATFQILAadRDRLTEMFYARAVALDPHIQRPQ-----LV--------SNMVAQRLQFMLVLTDVVQQLDDLpslAQTAATFARRHG-TYGASDPRFRTARAALAWAVDRILETERNSAIQLAWNAAFDLVEALV----- >tr|A0A1I8F573|A0A1I8F573_9PLAT Uncharacterized protein OS=Macrostomum lignano OX=282301 PE=4 SV=1 -------------------------------------------------------STNQKPPSDGDRLLYWINVQ------ptAQPQLLRGASEGC-VRLFSPRILTRSCISSNLCVRAGRGRNS----SSTeTTSAEGADAVVAA---- >SRR2546429_8650734 ------DAQYLLTESLAVLRpyADELVAEFADRLATGHPALGAIFEP--RL----------------LTVLLELAATYDRPQGLLPALATMGRRYR-RYGAGVEDYAAGGGVLLGTLRDFPGAAWTPAHHGARVRAYAFAAATMM---- >SRR2546423_13669166 ------DDQYLLTESLAVLTpcADELAAEFADRLATGHPALRAIFEP--RL----------------LTVLLELAATYDRPQRLLPALATMGRRYR-RYGAGVEDYAAGGGVLLGTPRDFAGAPGAPAPHRAGGRADAVAAAPPK---- >SRR5690348_18181078 ------------------SrrRHTRWTGDWSSDVCSSDLETRALFRT--EGS------------ELVKG--SMLAMTVEAIIDFAgersGkfrMIACEVMSHD-AYGTSRELRSEERRVGKEC--RFGWVAYPX---------------------- >ERR1719323_1074371 --LIPFEQRTLITEVWNVLQestIRYVSNTMFLpLIVRSNKSLQKCFAALDQSLHGMELVECygSkfDRTKHGSLFLSKlLIRVVPNMDQmdrVLPYLAELGALHQ-RHGVAKQHIDLLGLAFCAAIRGVVAgggvkGGHLHETTKAWITLIQAVCTGMKMGY- >tr|A0A1I8C1X6|A0A1I8C1X6_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=3 SV=1 --DLSPHQIGLIKRAWKNLlksvNENEIAIKLLLRIFQLDPRNLAYFSL-NEYSPFDeyLIKENNIFINHVKTFESTLINVMTHPGNatkLSKHLQQLGGRHVNYTGVTykCSYWKCFIQSLIDVLTLNKDKNTSEDLHEAILILGEFCVEQMKIGY- >tr|A0A0N5CQY3|A0A0N5CQY3_THECL Uncharacterized protein OS=Thelazia callipaeda OX=103827 PE=4 SV=1 --QLNAPQLLLVRKTWAHARSqGalEPAMSIFRNSFFKCSEIRSLIMN------GPKNEGHERLKSHAKAFTEIMDQLICGLETkelIMYELRAAGRSHIFLprdatdnkskgCTFRLAHFEHFASAMIErTLEWGEKKDRNETTQTAWTKIVLFVTEQLREGYQ >SRR4051812_28599342 ------------------------------------------------------------------------------WVRprsRGGRSPRSRSSRS-SARRWPSGRPRPPSTS--RPDMRSGPSscgmsrarwqsifpapsrtgcasPIGVLGDP----------------- >SRR6516225_8820395 ---------------YSVHCegKTNFYRLFYKRFFDKPPKWRTFFRK-HKIS----------MARQY----KLLDQAVASLANFHigaepTSLSHVARVHA-NLQLGREQYAMFTDSFLESISEM-GEK-DED--------------------- >SRR3569833_2822653 ----------------------------------APPERHTVLHE--AI------------VTNPVEVAGAIGWVVEHLHRteeVATACGELGPALARLLAGHEQHLDACGRSIIDAIRTGLADRWKPEFDGATSSAWELVAEWLRRG-- >SRR4051812_2284027 ----------------------------------TLPEMRTVLHD--AA------------IADPHALGRAVVWLMDNLTRpfvVTAGCELIGPALGDLLAEHPRDLEAFEPALTDAFRTALGTAWKPDHVTALHQAWDLTVKW------ >tr|L8JU91|L8JU91_9BACT Uncharacterized protein OS=Fulvivirga imtechensis AK7 GN=C900_03083 PE=4 SV=1 --TMEIGKITLVQNSYGRCL---SSGKLLETfyenFLSSSRDVADKFR-------------NTDFEQQRKLLRHGINLMIMYAaGNIagQTGLKRIKESHSRgRMNIEPRFYALWKAALIKAIAEHD-RDFNVEIKAAWNEVLDKGIVLITEGY- >tr|A0A1Z9IBY6|A0A1Z9IBY6_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium TMED162 GN=CBD22_07770 PE=4 SV=1 MVGVTQTQEQLIEQSLTHYAarHGDPYDAAFQKLYAAAPHYEGLFVL--DTD--EGLR-----RNMMRTTLEMIATYIDDAYAAENLVTGARLVHL-TYEITDD-FDLFFQITRDVIAEGCADIWSDAHAAAWNTMLKDF--------- >ERR1719295_1776256 --YLQPQEIVHIQGSWATVErqLFNLGARVFISLMENQPNIKRTFRQYRNKR-HSELRINEDLQKLIMLLLCGMKRVVKYLNDtkaLTKYLKRMAKRHSPTeidfARINPAEVASVFCAALREIAPAEKDQWTQEVEDSWTSLIGGLLAA------ >ERR1712029_417561 -------------------------------------------------H-GSDWKV-VQVDRIILI-FRTIT--------vIIVRVQSVEKDHI-hT--------RKSF---------TQVLKVETVVEDSWTSLIGGLLAA------ >ERR1712071_338654 ---PTAEEIALIRESWPIVKkNKNVFVEFVLEHFRVHPKTQDLLPEFANLAI-ADMPSNKFFVQLTEtYVVMAMQEIIDNLDNagvLTDLLQCLNSNWYVDyVSLDRQN-RETLRIRRVGQEQKSYSRNMESneiQQQRCPQNLRQAVH------- >ERR1712179_849736 ---PSAGV-------------------------------------------------------------------------PVNKLEENEDFQVLAyYSSAVATFivtnLDQEDILTHILVQQTKP--------------EQFVD------- >tr|A0A077ZE79|A0A077ZE79_TRITR Globin OS=Trichuris trichiura OX=36087 GN=TTRE_0000613901 PE=3 SV=1 -------EWYNFKNFWKTVQrnKDNCAKLMFFKYLEQNPDLLQAYAKLRNMEMNeETAFNNSDFEHLANQYLDVFDEAITTIEsnpgDvssVVEELQNVGKRHRRIscieassfavtttvskDWLSVAILQKLQEGFMEMARQVLQDRFTEKCENSFGKFFDFVAKNLQQGF- >tr|Q7M422|Q7M422_9DIPT Hemoglobin V OS=Tokunagayusurika akamusi OX=28383 PE=1 SV=1 -VGLSDSEEKLVRDAWAPIHGDlqGTANTVFYNYLKKYPSNQDKFETLKGHP-LDEVKDTANFKLIAGRIFTIFDNCVKNVGNdkgFQKVIADMSGPHV-ARPITHGSYNDLRGVIYDSMH------LDSTHGAAWNKMMDNFF-------- >ERR1719253_2317543 ---ILSPAGRVLRLRGPGFLpprcrfgrlspnhccsrvspdriavarrPPPRPRSRPTSSPSPRTSTRGc-WAATRSC----------CSSSTrpttspsprt--SLR--------PSPAPSrptPPTSPTC-LPS-WSPAGPWRPSVTA----------TSPSPSTRCSTSWCTTTSwrpsprswatssrrrsrpagprPSSSSPRP--- >ERR1719253_507459 ---LSQSAIDVVVSVAGRDArrARPRAGPRR----------TDp-WRRRRRA----------ARGG-gpgrragevqtraaegASTLGHGLVR------RGRalgHGLVRHGRGHC-HDS------------------------------------------------- >tr|A0A016TEH5|A0A016TEH5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum OX=53326 GN=Acey_s0110.g162 PE=3 SV=1 ----------------------DTAGEYHKQLFTLHPEIAKYYDA-EDID-PDSIPKAQKFIMLGQQELQFFFRLPDVVDNerqWRSALSSFKE-TFGDNNVPMSEFNKVTDAFLAAMQKNAGG-VTPEQKKEWEELLAKAYADMK---- >tr|A0A0B2W4R6|A0A0B2W4R6_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_05310 PE=3 SV=1 ----------------------DTAGEFHKQLFKKHPDMAAFYDA-EDLD-PDSIPKSQKFIMHGMSELQFFFKLPQAFSDerkWRSALSSFKD-QYEDVGVPMKEFNKTTDAFLAAMEKNAGG-VTAEQKKDWEELLAKAYADMK---- >ERR1711965_451221 -----------------------------------AGAVR---------P------------RP--------AAVI---GFPFPLFP-LLETADMtsvAVGAHPRLRA-----L-----LRDR-G---AWYLTGPQELASVIGRLERLER >SRR5882757_2588511 --SLSSRQQILARRFFDAVEAsdKPLAAMFHERLSEIDDRLDGLLL--EEE---------GCLLREAMVIVRTLSRNVDRLNRMVPIFRAFGRTCA-AQGIASANYEKIAPVLFWIAQECVGSEFSVEMGRALTALYDQLSREMKD--- >SRR5262245_14724532 --------EDVVKKAYQRHCYrqPEFYRSFYENFFSRVPKARAMFK---DMA-----------RQHE-----MLDFALGQLLNysqqqSEpTTLTQFVERHS-RLGLTADDFKRFGEALIATFDSELRGdCEHHRTMAALEIVI------------ >tr|A0A183IYP9|A0A183IYP9_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 ------------------------------GLFTSSPEIRSLFPTLVDW--GDDIKTCQKFRNQGLKFVHVISLSLTTLHDkehLDTLLKEIGTRHVEfmPGGIKMEYWDIFEKAMVKCILQQIRwtDDFDEAIQskaaIAWRILCAYIVQKI----- >tr|A0A0C2M2P6|A0A0C2M2P6_THEKT Uncharacterized protein OS=Thelohanellus kitauei OX=669202 GN=RF11_12769 PE=3 SV=1 --FLTLEERLKLKESWIKIYqkiqdlPdVDITFEIFVRLMERRPEMSKNFE--KDV------YKYSRMKSHSDKMLVILNNMIRNLDDeqkMLKYLSGMVRRHR-NYGIRQGDCKMWEEIFLDIISR------------------------------ >tr|A0A1I7YD88|A0A1I7YD88_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=3 SV=1 --LLTLRQRKILQRSWNKSQrtgLDNIGAHIFLKIYAKDSSVGYLFN-LGNCP-HSELKYRKFFQDHAMTFTRSLDFVMNHLDDLErvsKFCVELGKTHVKfmRRGFKTSFWDIFAEALTECAIDWEGGLRCRDVLNGWRTLVSFVIEEMRKGF- >SRR5262245_33555564 --------------------------TFYEHLFEGAPELRSLFPI--NM------------AAQERKLLLTISVVVKNLDRdeeLKRLALHLRDVHE-GIRIEEGHIEAFLGSLAHAFQQVHGSPFPRH---DWLTLRRAV--------- >SRR3954452_7277257 ------------------------------HLFQANPEIRMLFPI--NM------------AAQARKLLLTISVVVKHLDReteLQRVALHMRDVHS-HIRIDEGHIELFLASLAHAFQQVNGGAFPHQ---DWKNLRRAI--------- >tr|W4XW92|W4XW92_STRPU Uncharacterized protein OS=Strongylocentrotus purpuratus PE=3 SV=1 ---------------------------------STHPEDSLHLHQ--GCCSHLASRESCRFVDQAMQVMQTIGNAIQNFDNKelfNTNMKELGLLHC-PVRDDtlavIHNHEVFKDALYNTLRKSLTESLTPEMTFAWKAF------------- >KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold7330878_1 # 87 # 278 # 1 # ID=7330878_1;partial=01;start_type=ATG;rbs_motif=GGxGG;rbs_spacer=5-10bp;gc_cont=0.391 ------------------------------------MASQTQFvygDE--DTVMACLTKESCRFLEHAMSVFQSVGGLVTSFADPpsdRKFNLDLGLKDQ-PKDVQDRHYKVFMKCLLKSVRFHLADSYDLAMHFAWKAF------------- >SRR3982751_838383 ------GINDQLRESAAMLTsgGteatDAVIRDFYIALFRNAPSLIAIFPG--NPAQGDFG-SDHRGAKQRELLLGALAGLADLYdpgdaermTHLDSVLKRFGRSHAAFtrpdgtvSGATLDEYKAVKDALFSTLVRAAGDRWRAEYTVAWSQAFDYAAASMLL--- >SRR5690606_20444479 ---------DIVKQSFERSkQRKTLATIFYQNLFFLKPKIKNYIKQ-TDF------------AHQEKAIMDEMEFLMAFLDDkdrhARQQILRIAGTHSAkNLNIHPHDYYYWLEALIMTAKEC-DHLWRDDFQYYWRECLSFPLTFIISQYY >tr|M6F3R8|M6F3R8_9LEPT Uncharacterized protein OS=Leptospira kirschneri serovar Bulgarica str. Nikolaevo OX=1240687 GN=LEP1GSC008_4081 PE=4 SV=1 KMNISENQIRSLNESFDIVNLDriKFAELFFIYLKENHPKYENIFSRI-QL-------------EDVKHFMNSARNISLSsVQYsqLERAIQNFGVECL-KICNQAEEIPILEKAWLFALEKWLGPWYSHEVEKSWQEVFKMIHTSS----- >tr|V6I1Y8|V6I1Y8_9LEPT Uncharacterized protein OS=Leptospira alexanderi serovar Manhao 3 str. L 60 OX=1049759 GN=LEP1GSC062_2771 PE=4 SV=1 GMNISENQIRNLNESFDIINLDriKFAEIFFVYLKEKNPKFENIFSKI-QL-------------EEAKSFMNSARNIALSgAQNvqLEKAIQDFKMECI-KICNRTEEIPLLEKAWLFALEEWLGPWYSHRVEESWQKIFQMLYSEE----- >ERR1719272_197188 --SLSATQRASILASWRQLCGEDGGATfcasLLGGAFEAVPETRALAGV-PEAAPEPeAvpeaeaavaapapapakgkagatavpeaaaaveeaaeeaveSAESVALRAAAAHAAVAMEIMAQQLSapeALKESLTELGVKAA-SRGLGcGAPFDRLGEALQTTLQASLGDeAFPEALAEAWRQLYAQASQEIQLQY- >SRR5262249_23394332 -------------------------ELFFSRLFAIEPGLRHCFDG--C------------FLGRRRAFEWMIGAAVRGRPDLRSFIQALEFMVAPSDATVHQECERLRDAFISSLSGSLGPRFTVEMMNGWLAVFELLH-------- >SRR5438034_714626 --SMTEASIIAFNESFERCMaSGRFFDVFYDHFLRSSPEIAAKFQG-TYF------------NRQKRMLNQRPATTVGQpr-------------RSAReSRKTPAAQFVStcqampsaFVSELTKSGSTX----------------------------- >SRR5258708_7736634 ------------------------------RFTGTSDAIREKFKN-SDF------------AVQHQAMADSLYLMAVSvqggPEN-LARHDMKRLYPKHqRMEITASMYDVWLDCFVATARIH-DPECTPAIESAWRECLTPGIAAMKSGA- >SRR5690242_5369812 --LVTEDDLALFLDSFDSCVaNKEFVARFYEIFLSTSPEIRALFAK-TDF------------HHQRRALKASLHVVAACaarrRAD-YSALDELADR--HrELRIEPRHYAVWQESLLAAVSEC-AERWDPDVERVWREGLSEAIAHMAS--- >SRR5512134_285705 --ALTPTHATLVRESWARLAPGrAAAVhRFRARLEAVSPRTAARFTCL-DH------------EAQRDGLMIELDQAIAAtgsDDDLVPALARIARRFR-ESGPASSEYPMVRDALLEVLAEADRGIAPPELRRAWGSLFGLLAALV----- >ERR1719232_1195758 -------ETVIIKDTWETIHkqVKAIGMEAFEKLFALNSDMSAYLPQTDDLDQDETRRLSDKVKSHAKLTMETLEQVIAAIPDMTEvynVITKMKKLHP-----QTGLLEVIGPVFCNTTRHFLliQGRWSLDVQRAWLALFGEVSAMIRASY- >ERR1719189_1497217 -------GRQADEQ----VGreEAGPGHRGHRP----AQDDPAHLRgarDCGQRVRGRARRHGDRGV-QGRGQGEQS-QH--------------HRHQG-----S------HGQ----------lHGRHX----------------------- >ERR550519_213 -------NIVLLRDTWSVIHrqVNTLGMETFQKLFEINSEVSHYVSpscpDLDPd----CIDSTTQAIKAHATHTITILHNTVSNLCNLgd--lagE------------------MNRLGKLHCDLGIDHGil---------------------------- >ETNmetMinimDraft_22_1059887.scaffolds.fasta_scaffold1682169_1 # 3 # 206 # -1 # ID=1682169_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.363 ---------GTVFSQWRRMKIEDFGECMY-RSLVQDASLEKLFRR-------------ERMRTQSLLFAAFIQVALCWLEErdfrkVERDMISLGLRHR-SYGIQPSYVCVFQIALLQTLCQNLNG-LSLQAEISWSVVWSHF--------- >SRR6266567_3650358 ----------------------------------------------------------------------------------RAPSKAWGsgtspmascqstipssersfwkpsatywesaglqrtmmpgrkptkgsarscwkgpthrsqpeqssrqchrydlwererqdkikkgeatldtkqaaqkgfeQQHA-VVIGGSMAGLLAARVLSTHFGQVSVieRDHLPDGA------------------- >SRR5579885_1989414 ------------------------------------------------------------------------------------------xmsnqqssrsgfgGQHA-VVIGASMAGLLASRVLSEHFEQVTVieRDQLPQEV------------------- >SRR5579864_4130097 ------LQIELLETSFQAIApcGEAFVTAFYERLFMRFPQTRAFFAS-AE------------RNIKHVLAKPTIVTTLQPTRSascRTTRIT------F-PSSVGTAGVPISRS------TGYAGs--------------------------- >ERR1719414_1806988 ----TVAQAEKVVAQWDAADQDAFIVAMYQAMMKTHPEWRALFNK-PTGA---PTPAEAEWKKQFDLTKAVLDRGLRsratDVDALKERMHAMAGRHV-NYGVTQTHFQALKPILTDVLAATVTG----ADMDAWSAVTYFMLDSI----- >tr|A0A090RS91|A0A090RS91_9VIBR Uncharacterized protein OS=Vibrio sp. C7 OX=1001886 GN=JCM19233_1279 PE=4 SV=1 -----------------------FLTFFLQHFCSTNPRFAERFCGV-DS------------EQQTKMLKASIILVQnaAENPYIRNNVKSLAKRHKEmNLNIKPEELVAWRESLLATVANFD-PLFDDDIDQACAQRWN----------- >tr|A0A139A347|A0A139A347_GONPR Uncharacterized protein OS=Gonapodya prolifera JEL478 OX=1344416 GN=M427DRAFT_73171 PE=4 SV=1 --MLSAEQARLLKKNWKDIGASsvanpmmFVVAQFYRRLLRK-KGYKRIFEGI-DI------------ETQYFKMQGALTACVEfaeNLDKFADTIRRIGARHA-RYNMTPNMMNDVVDSLVPSLKEFsldHGITWNEEIEEAYDEWLEQVTGYF----- >SRR5262249_57009646 -------------------------------------------------------FRKTDFPRQTRVAADTLFlmaVAAGARDHavAWRGRDRLPGTPPPpGLHSSPRHHPAQLVCPL----------------------------------- >tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 GN=TSPGSL018_8354 PE=3 SV=1 -----------------------VGAGFLKLYAQRNPWAVEQFS-FG-LR-----------PQHAEKMGLALELIVNSATRpqvLQHQLRVLALGHV-QMGIKPEMFKSFEEALFAFLGQVLGAhnTFDEETEGAWRWMWGIVNAVFTQ--- >tr|A0A090LKP0|A0A090LKP0_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_2000335800 PE=4 SV= -EELPKADKDIIISTYNILL--QADPELFSKAWimsaSRSTSIRKAFS----LIDP----NSTHIEVDFTKFSAVIERFFTriiceeKLVNesFEKSCINLGKKHVDfvPIGFHSNYWDIFMNCMIDVIAETVIIAFNEdnkqqqQVQKCWNKFVGRIVFLMQSGF- >tr|A0A0M3JT43|A0A0M3JT43_ANISI Uncharacterized protein OS=Anisakis simplex OX=6269 PE=3 SV=1 -RSFTTPQLTSVFNAHFSMI--QLNPDVIKDCWiktsKRSSSIKKAFG----MLEH----EEPETNASFMNLPITIQAFFKelifelDCDSvkIRQRCEQLGARHVDfsERGFHSNFWDIFQVCTIEVIAEC--NLGLNedqhrSYELAWIHLLSSVVKSMRNGY- >tr|A0A0A9Z6R2|A0A0A9Z6R2_LYGHE Neuroglobin OS=Lygus hesperus OX=30085 GN=NGB PE=4 SV=1 --SLEEDEIERIKKSWVLVKEndfrfiDILRQEMLCDI----MMYELYFNPG-R-KADVCVSELTEFKNHPKNVYSTLDFIVGDLENenvIIEKMIEIGKNHG-RLGISRKHISFMTSTIYQAVECTIGPcMFDRLVDQSWEKFLTSFND------- >SRR3990167_8699843 -------------------------RLFYAHLFAKAAHLKPLFG---DSE-----------DTQNFKVIKMFELIIDNVEDLtqvQPICLDMAKRHS-FYGVKNDFYQYIDEAFVWCIQQQLSLSIQDPIIHAWYAATKYISSIMID--- >SRR5690606_19766530 ----VSDQYTDLQQSFGRCLrDKNFIERFYEVFMASNAEVAAMFAR-TDF------------QKQRLALRRGISVAIFHAAGssVvKRSMQQMADVHSRSgrCPVAPHLYPYWIDSLLTVIAETDA-EADEALLARWREAMGVTIGTFIGAYN >tr|A0A023F5X6|A0A023F5X6_TRIIF Putative globin (Fragment) OS=Triatoma infestans OX=30076 PE=2 SV=1 --ALTADEKEILKESWKNRgiNKSTLAMMWFTKLFKANAEEIVEQNR-GQV--VEELFMDEANFDYVDKLADIFNIVVKNIHKstLcTKLIWEIGMYHC-CLDLRDGYFELMKETLLDTLKENMQPPLTSEQIEAWKKFIGVMFDIVHE--- >tr|A0A0N4YMT1|A0A0N4YMT1_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis PE=4 SV=2 ---LSLEVHDLARAHWIQLHkLNRQSnliQNALLYIVENYKHTRPIWQ-FGlGIDEstkdwKTLLFNNFYFRHHSASIQAAITMVMENMDDrdcMKKLLNEIGAHHF-FYDACEPHLELFEQGMIHSLRTTLVGhvKIDESTEQSWTLFLKDLKTFMGEG-- >ERR1719326_703414 --------------------------------------------E-HPM-------------IPITMTEES----VKLVQDsl-SRVDSLVQV-----RDALQDvFFPHLF--------------------------------------- >ERR1719487_2229452 -----------------------AALSL--------P-------T-EQE-------------SPVTMTAEA----VQMVQDsl-RRVDSAVQV-----RDAMEDvFFPHLF--------------------------------------- >ERR1712176_999243 -------------------------------------------------------------------------SY-AHRDTfdqladAPRTI--FYTQK---------QGHPECSEMVEKMKNIVGDE------------------------- >tr|W8BTT7|W8BTT7_CERCA Uncharacterized protein OS=Ceratitis capitata OX=7213 PE=2 SV=1 -LGLTITERRSLQNGWSIIKqkQRRAALTIYVNLFTEHENLYEVFRSDGV-------LNIEFASQHQKEVLTVFQMIIEQVDNarfVKTMLKELALRHE-AASVTNTQWQLYTNEVRKYFLETLADAISPTFVHALDKLMNFVCN------- >tr|A0A1A9YF90|A0A1A9YF90_GLOFF Uncharacterized protein OS=Glossina fuscipes fuscipes OX=201502 PE=4 SV=1 -MGFTPLEIVALQNIWRLFKkrFKYHSMQIFLAFFNQNHKLIERFRLpSGK-------FQLNYLCQHSEKMLLLYENVIDkCLDNmanFHGIMADVTVSHR-HSGVTYEDVSLKSEHVRRYILDYFANQSSPTLVSALAKLSEHFND------- >ERR1719370_117345 ---------------------------------------------------------NATRMFPAKAALQESVEVmVDVLERrgmWGSGIRDAGISHH-KLGIKRRDMEKLATSILAAISDLLGDcDLDRKllQLNAWKKLLNAIADEFSA--- >ERR1719234_1549997 -----------------------------------------------------SLWhrssiQLEGASNHNKALMNAIDSVmVEVLERrpmSKSGIRDAGISHH-KFGIKRLDMDKLTTAILAAISDVLGDcDLDRKmlQLNAWKKFLNAIGDEFSV--- >ERR1711972_141202 --SISETEKTYCIKEWVKIcsDRSKTGTLLLSHVYQENPQLLTH-PAWKDLS-QDQLKENQHFKNLAEKTMGSVEQILTHIDNVDkvaSMFEQQGKDYK-SAGKSMSH---IMACLETFLPLDHPSlEVTEEYRGITQEILGIIKQSLMKGYR >tr|A0A0N5DD39|A0A0N5DD39_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 --NLTPHQKQLLVQSWPQVQlynRIHGGDAMFARFCEKNSIARETFQKIAVVQSfASNEASESVLKKHEQYLVQLLSEAVENLNNdCEPLLReclDYGAQHVT-LHelLNETVWEQLAEAIIDRIHKVNLVRRHKDLSKAWTMLIILLIDKIREGY- >JI8StandDraft_2_1071088.scaffolds.fasta_scaffold105816_3 # 981 # 1154 # 1 # ID=105816_3;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.718 -----------------------------RNLFKIHPELKHALNI--EIK-KSGIQH-----VPLASIVFSYAANIDNADKFLVIIRHIVDKYS-SLGITVNDCPIIGSLLLDAIKESLGYAATTHLLAAWAEAFGLFTNALVQ--- >ERR1719199_1194134 -------HAGYIEKSRESVlnlDAAQLGADIHVKFLNVYPAAASLFQK--TLR----------M-LITTKIMGTLMAVISDPTGTLEDVRAVGVRHT-KYGISERYLLPFGAMLWEIVGTMLPGMWSDEHSAAWAFYLDFIASTMTRA-- >SRR5882724_2518483 -----EEVRRKARKSYRELQDSAFYCNFYAELFRAAPDVRQLFRNI-NM------------DEQYEKLHAAVGKLLNfrPTDDPNP-MSRHAESHE-RLGLQPKHFEGFRDAFLTALSSRK--TADNYAMDAWRAIFDAGIAYMTTK-- >tr|A0A2P8AX05|A0A2P8AX05_9ACTN Terephthalate 1,2-dioxygenase, reductase component 1 OS=Micromonospora sp. MH33 OX=1945509 GN=tphA1I PE=4 SV=1 ----------PDPQRLLAALgaPDQAADHFWSYMEDRSVRV---LP-----------------QQFAPMFFSTLAEMVARRGDpaaRRAELALMGRMYL-RFGLYPYHHTVVAAAMVDTVRRFAGASWEPDLAGYWEvgcrRSLRLAE-------- >tr|E5XPI8|E5XPI8_9ACTN Uncharacterized protein OS=Segniliparus rugosus ATCC BAA-974 OX=679197 GN=HMPREF9336_01410 PE=4 SV=1 ----------TFVRSFHlELFgaAPELAARFPPGLGEHRGGF---VR-----------------M------AEHILETFAEGADpprLIDLLGQLGRDHR-KHRLDERDYRLAQAAFAKALVATARG---SGDGAFAAraaaLVCQVME-------- >tr|A0A246RU09|A0A246RU09_9ACTN Uncharacterized protein OS=Micromonospora wenchangensis OX=1185415 GN=B5D80_01060 PE=4 SV=1 -------------------------MREADELRSALPDR---LA-----------------AHDAELLIATLRRLATD-PEpaaQAVTLTVLGHAFR-RFALLPHAKLISALAGAD-------------------VPVELLR-------- >tr|A0A085M5J8|A0A085M5J8_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_06691 PE=3 SV=1 -TCLTKRQRRCILKSWRKVqNKAQLGEEIYIQIFMQKPVLKSLFP-FRAT-PVNELHDNVLFTRQAVIFIDFIDNVVAYVGinNgrlLQELCTRVGISHALMtrVNFDPEWWYLFANSVLDGMQKFCLPNFSCEpiatyigsqSMLAWRILLKHVVEMMSDAF- >tr|A0A2C9LD65|A0A2C9LD65_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106067556 PE=3 SV=1 --QLSHKDKLFILNSWLNFrNgkrEEDIGMEAALEMYSIYPEIKDIFTIYRDARM-KHLTDKEMIRTHSQQVASVVDKCVMRMDDAHAfamIAVDEGSVHI---KIQERFMRCYVDCYIREIKKYSKLKWSRANQMAWEVFFDTIVVNMKNGW- >ERR1712086_1089461 -------MG---KEHGDGDSsadaNTAAGLDVMQGKKPEQKESKRWFSlgssaakgkqerS-----------KEEKEEKIADKALEMSAEMYKDPTRIQGETMGLGLRHI-MYNVDPAFFDALVTAYVEEMAVRTT--------------------------- >tr|B3LWC8|B3LWC8_DROAN Uncharacterized protein OS=Drosophila ananassae GN=Dana\GF16358 PE=3 SV=2 --GFTCVEKAALRNAWRLIEPfqRRFGKDNFYNFLTTHQDLIHNFRL--DPRSSDSPINLSKLHGHALAMMKLLARLVQTLDiNLqfRLALDENLPAHL-RRGIDPSYMKMLATALKRYILESsvIQNHNSSTLTSALTQLVSII--------- >tr|B5DW13|B5DW13_DROPS Uncharacterized protein OS=Drosophila pseudoobscura pseudoobscura GN=Dpse\GA26483 PE=3 SV=1 --GFTLCEKVALRQAWNLIRPreRRFGQDVFYTFLNEWYWSISKFKK-------GEDINIALLHAHALTFIRFVGALINESDPImfQVMINENNQTHS-RCRVGADYIAMLGQALTDYILKVLDKVRSPSLEQGLQRIVEKF--------- >ERR1719162_2542559 --------------------RSDIGMCVWNRVFVEDPKAENFFKQ-SN----------Q---RLIYIVTMAIKYSVEFYGDpekTKMAIEALALKHI-MYQVQPRMFMLFVTCYDEEIKARTDD---KLVQSGMHWSISIIASIMA---- >tr|A0A0V1BAT0|A0A0V1BAT0_TRISP Globin-like host-protective antigen OS=Trichinella spiralis OX=6334 GN=T01_2203 PE=3 SV=1 ---------------------MENGGQLLANVFKANPELRKFYDV-EDID-PDDTKKSRLIQQAGGNLLNSVTFMVNNYDNErsfKQEIKEQICDLR-EKGMKLEDARKLKTGFVNYVKSKLSQPMTAKEEKEWDMFFQRFFDALKQ--- >SRR6476620_89806 --------------------RHATRQQRRPDVF----------HER-QRTAGE------D--lnVLRERDVGQVH--ESLARAgvavIDGVVPRIGCEVV-DLSSEMQNG--------FPQGVIL-SAAVGVGDDDG---------------- >tr|A0A2W4R8Q8|A0A2W4R8Q8_9CHLR Uncharacterized protein OS=Chloroflexi bacterium OX=2026724 GN=DIU68_09390 PE=4 SV=1 --RLSRQQKRIIQRTFSAVAvrHDLVARLTIERLRElsRTPAS-TC---FGNTP------------EDRRRLMHLLALLVQRMDDRGA-LHDACVAQTRQMGCDPFeggSTSLLAEAFIGALQSALAGRFEAKTEAAWREFFQMVERVLR---- >tr|A0A0L0FDI4|A0A0L0FDI4_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12917 PE=4 SV=1 ---KTDSEVELIRSSWRALLaGDGtaaqmpllrFVEQYYKRLFRLFPDSRGVFKT-RDTQ--------------SKSLSLLLSIIINVADEpeLemNAKKKKLEMMYK-EYGMNSLLAVIAGRVLIQSLQAFLEAsnKFQASVKDAWVKCYTSIADQL----- >ERR1719203_545915 ---------LILKDTWAVIveQIHELGLPTFVKLFRLSANLRYYYPKHnRPES--TEV--QENINTHFDQLVAVVDDVVRCLPDLsthIQYLRNLGPVHC-DVEVQPRLLELMGPVFAILSDLYCWskadgvirLKWPGYYYFDILLDScemVTIQLLLDLX-- >ERR1719232_1194111 ---------IMLKDTWSGIieQMHELGLTAVVRLFKINYNLRFYNSPNvRYHP-TTHTNvkvlrgttaapatpaavasgstaaataagpsakdqatgksNLEDLSIVFNLLVSIIDHMISSLPNGsspTSHAGRNGksngtkakftlsaATMK-QLQILRQPTDWVGPVFCNTVRPLLLvqGKWSYQVEIAWRLLFRHLVRKNRTFD- >tr|V6U182|V6U182_GIAIN Flavohemoprotein (Fragment) OS=Giardia intestinalis OX=5741 GN=GSB_151570 PE=3 SV=1 -MPLSEDTIKAVEATADLVAaqGLDFTRAFYERMLTRNEELKDVFNLshQRDLRQPKALLDSL--VAYARS-IRKINELhelqeqglpvpAERLAELqgfFAVAERIAHKHA-SVGIQPAQYQIVGAHLLATIEERVTA--DKAILAAWSKAYDFLAHLFV---- >tr|A0A1R1LGI5|A0A1R1LGI5_9GAMM Uncharacterized protein OS=Motiliproteus sp. MSK22-1 OX=1897630 GN=BGP75_23395 PE=4 SV=1 --------LDKIYSTLQLLDdekSEKLINETYSIFFNAHPEAVLLWSK--DDPE-----------SRSKMFNGVILTIIDNLTRpdiFKNNLLSDVKDHD-EYGVDKEMYGGFFLSLTEALKKTLGSEFNQEMELAWKHQLAHIRE------- >ERR1740121_1123239 --------------------------------------------------------------------------------------------------vWIVVGSA----------SVrHR--LrAFGSASGSSSgRRLSGidY--------- >ERR1740121_2035324 ------------------FTplt-----Cqwa-----TPHDGPAQHVL-------------------------CEDGHFahFATDKCesAgHgA-RVQCPSDMPEMcaDttcgggqehccrpaggCTGgERPCPT--------TASASgSA--SgsaSGSASSRRLAgIDYE----------- >ERR1719271_1314470 ----------------------------------------------------------------------------------------------ghRqdeqhglQVPwCHQIPAVRGDC--PGLALQpCR--V---------HrREWC----------- >ERR1719240_2235476 ------------YE---DEE-------------------------------------------------------------------------GAqvdvmkgEDALVATADLLYQKMSEDAN---MQT-lLGNIELAELAsKLQKALa--------- >ERR1740122_169377 -----K------GE--ADKSgnAEAAGGgqGDTPETGAAQDTAAGV-------------------------TDEHS--------KaLGIEISS--FDELkvDqkciaaaIDAwKLFISTAESREAAGEAV---YNA-lFEGAPS--LQALFVTPRAE------ >ERR1719243_286169 ------------------------------------SHPVNV-------------------------LVSDTMwkGY----t-vRgIRRVNYY--VKYMmlTrdgnvsqALGwFKDAADCKIISH-PVNVLVsDT--MwKGIVRKQFLGgRLWFII--------- >ERR1719158_147189 ------------RV--CYLYplvhcNILAVLrelnfdGAAESLCLDAPALLPT-------------------------MLDGLIwrSR----vTeNgQRRVNYY--IKYFivDaeggfskTTEvMTDNGDPTIVCR-PVVSLVtDM--IwGRVAFRTFLYgKAWFLF--------- >ERR1740121_2502219 ---------------------KSFALEVFKRLFAMVPHSESFFKQ-----------SNTRLIFIVSRALDMCMNIYKEPTRLVNEITALGIRHI-MWNIPTTYFDPFVQCMLDEAIVRYGAS--QQAIEGLEWSMRIIASIMV---- >SRR5262245_17232684 ---VEEETRALARYSYLQWlDDDEFFSAFYESFFAGATGAKGKFR---NV------------EQQRLKLRDAMTAVLNFYpGNEPTSLHRLIAVHA-ARDVTGTEIEQFERSFLEVLHQRLVERKIaeqlgpdvvAKIEQGWRELLHPVVQYVMGV-- >ERR1712137_24889 ---LPRESITVIRDTWAMVErNVDIAPKMLLKMFQLYPMTQNLIPLLRGVS-LEDMPTNKRFLQLAYGSQFAMSAIVDKLHRpdmLEEIIG--GGMHAFVDGLSTS-FQMAaTTAlFNKIMTEELGSAYTAEAQEAFIATGDMMTSIMV---- >SRR5262245_32700325 --WLNSNQRDLIRRNWDSssK-RYELCRRIYCRVFARRPEIRRIFSIGYDW----------WRLEI-VTFADFVQSIVDNLDDAkrvRQSAFEFGRDHAKwrRFGFRSDFWVQLAESTTREcvyLDAAVH--PPDESLETWTKFVSIVF-------- >SRR5271165_4656598 ------------------------------XMFYKKPDLKPTFIeIGHhidpendggLT----------WEV-EAQRFTNLLTDLIGNLNNLdrfEELSFDWGRNCVQwrEFGFKPEFWLHFSEAMTTEclyMDQAVH--SVGEVIEAW---------------- >SRR2546423_8132340 ----------------------DVADEMFtARLLELEPQWQRVLS---DEP-----------TEWGRRLLRAIRQAVASFTClggFAEALRELGGVPA--AHVGYRDYERQGAAFVGRLEHSLDKPMAGAMRESWQRVFRLLAE------- >tr|A0A2A3E2S2|A0A2A3E2S2_APICC Globin OS=Apis cerana cerana OX=94128 GN=APICC_08732 PE=3 SV=1 -------------------------------------------------------------EAHCQNTASGCIDALDDVDLMEAILHTIGERHG-RRGQDRQQFIDMKGVIIEVMKDTLKSKFTIEIEAAWDRYP------------ >tr|A0A1W0WMU5|A0A1W0WMU5_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_09357 PE=4 SV=1 --ALTHVQINLVRESWRWLNFnrplQETAVRFFlDFYFKQNPDCLPMFG-MKTVD-----HYNKAFSIHALTVMHAIKYAVEYIGNpeqFQRLFRTVGQTHL-RFGLTDLHVERFLEQWLAFLRANDAKVFDAATVEAWNLAGRIVVSQI----- >ERR1711911_15016 --------VDLVRKILDKAKqNGNVAPKVFFKYFKAKPASMKAFPAISGLA-LSDLPRNGAFLSNVYTCFAGLKAYTLETDV-STRCPVFAKA---SGKYKSEDIDLFTSILKGVVAEELGADYDDVAKEAFEQFLDAVALTVT---- >SRR5690554_6373173 -----------------------LYLSCYDIFMGQSADIGAQLFN-TRMS------------AQHGLLRGGIMWLIMHARGMsDSNIRALGKSHSRdQLYFHPSHYALWLDALMETLYKHVP-EFNLQLELAWRRTLEPSIDKIISMY- >ERR1711879_742838 -----------------------FFEDFYSIFMTKSPDVLNMFAN-TDME------------AQRALLRSGILWLGMHARGMpDTKIRALGESHSKkKDEHQPHVLFHVAGRSDGNAFPPRP-G----LHSRTGANLAPYPTAHVT--- >ERR1719461_1661620 ---------------------IEVGCYTFTQLFSQYPM-MDYLAKFDGLEV-EGVCIGEALRAHADAIGSVVAEIqenAGNPERIRMSLAQAGHRRF-LEGVERAQLDMLGPNMAETViIKDTWevISKQVKSigMESFEKLFSLNSDMSaYLPQ- >ERR550519_213 ---------------------IQVGCDTFTQLFQKYPQVNNYIAEFDDMEV-GGIKVGPALRAHASAVRSVVTEIqenAGNPERIRSSLAAAGHQQL-MAGVERKQLDVLGPVLCHVIRPLVWekGIWSVEVEKSWTHLFDIVACLMKLGY- >tr|A0A173LPQ6|A0A173LPQ6_9ACTN Phenol hydroxylase P5 protein OS=Dietzia timorensis GN=BJL86_2914 PE=4 SV=1 ---------------------PDFRRALEDALNTEAPYLRADLPR--NLD---------GPFA---TFVKLYRFLLTrvedsggdraKVDDVLDLCRELGHDLA-KYNVVEEQYERFGHALNAALARVAGEEWTGELSKVQNQFYVIIARALHK--- >tr|A0A0M3HYR2|A0A0M3HYR2_ASCLU Uncharacterized protein OS=Ascaris lumbricoides OX=6252 PE=4 SV=1 -PSLTPSQVQTIRKSWKHINtkgLYTVIRRCFQQLECMCPSVSNAFNSA-NNQLSANISTVRTLVEHTKFMLILIDRIVENDQDSIIELRRIGASHVVlkeSFGFGENELEKFGEMLAEAFLKLDGIRQSKETSRAWRLVIASMIDQLRAGF- >tr|A0A1I8CNT8|A0A1I8CNT8_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 PE=4 SV=1 -IGLSNYQQKLILQCWPNIYttgnSSTFATNIYPNLCTRNQKAKALLQK-AD---GVAVFSQSeidCTSMHSKLTLEIIDSVVRNFDSnpisLIGYLNEIGHAHRSlkSIGMPSSMWDDLGDSILEGVRRNDLVRKHKELRRAWLAIIAFLTDNLKQGQ- >tr|A0A0N5AJ93|A0A0N5AJ93_9BILA Uncharacterized protein OS=Syphacia muris OX=451379 PE=4 SV=1 --QLTVAQSVLVRKTWAHARnqgSMEPAMSIFRNSFFKSPDIRALMMA-GS-----KNTGYERLKRHAILFTNVMDKLIAGRvEEidsVIEELKNAGKEHACitreQYACpfRTSLLDQFAAAMIErTLEWGEKKDRTEVTQTAWTKIVLFIMEQMKAGFH >tr|A0A0H5S8S8|A0A0H5S8S8_BRUMA BMA-GLB-3 OS=Brugia malayi OX=6279 GN=Bma-glb-3 PE=4 SV=1 --QLSSYQIHLLQQSWQRLRcSPNFFINVFRTVISKNTIAKELFRKT-SIIDGFTSYKCYDVKEHADSLIELIDFALREIHSsikvVQDRCMLMGAAHCNTCeNSMSSSWDQFGDSLAESIAKAEAIRGKRKCLKAWNALLSFIVDRIKGGY- >tr|A0A0N4XUJ2|A0A0N4XUJ2_NIPBR Globin-like protein 9 (inferred by orthology to a C. elegans protein) OS=Nippostrongylus brasiliensis OX=27835 PE=4 SV=1 -ASLSFSQKQALTTSWRLLRpqAAGFFRKILLELEIVSNTVKQIFYKAQFVDAfNKDEENIATMDAHIKLMVKFFDDILASLDDeteCVERMKRIGSCHAVlvrSCGFSSDIWERLGEISMERICAHEIVQKTREASRAWRVLLACIIDELRCGF- >tr|A0A2A2LCK8|A0A2A2LCK8_9BILA Uncharacterized protein OS=Diploscapter pachys OX=2018661 GN=WR25_21707 PE=3 SV=1 -STLSFSQKQALSLSWRALRpqAAALFRKVFLELEIASVKVKQIFYKASLVDAfNRDEENSATMEVHIKLLIKFFDDLIPLLDDekeAVDLIRRIGSTHAIlakSCSFTSDIWERLGEITMERVCTHETLQKTREASRAWRTLLACVIDELRSGF- >tr|A0A261C2G6|A0A261C2G6_9PELO Uncharacterized protein (Fragment) OS=Caenorhabditis latens OX=1503980 GN=FL83_09405 PE=3 SV=1 -ASLTFSQKQALNLSWRLLKpqASACFRKIFLELEIASPKVKQIFYKAALVDAfNKDEDNSATMEVHIKLTTKFFDELLSTLDDeneFVAKIRGIGSAHAIlakGSNFSSDIWERLGEIAMERVCSHEVVTKTREASRAWRTLIAILIDELRGGF- >tr|A0A1Y0I5V1|A0A1Y0I5V1_9GAMM Uncharacterized protein OS=Oleiphilus messinensis GN=OLMES_1782 PE=4 SV=1 -----TQDQRLFWNSFDRCLsspqrDQQFAEDFYQRLYSSDRAIAEIFDR-VSV------------SDQLHAVRQAVYLLQEMTplKQAEITLDKIQAIHH-QheIRLSNAMLDKWLECLLASVELAD-PEFNETVKQAWIDILTPAVHIL----- >tr|A0A1I7TWD1|A0A1I7TWD1_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=4 SV=1 --RLSKIQKRAIRFTWHRLQtrnggkrVENVFEEVFDKLVKNLPNIRDMFST--RMF-LCAMsrGTTSTLRDHSKSCVKMIEAVIKNFDTeKskrtdtgtENDPRVIGRAHSIlkPYGLAGNYWEKFGEVMIDVVLAQEAVRDLPGAGQAWVIFTACLVDQMRAGFD >SRR5439155_18881238 ----------------------PVLQGFQQAVSGFFTEVGRQFPK-NR------------FRQTPRKTQTSFLLVMGNIApgwpECEAYLERIAAAHG-KHGrdIPPHLYDLWLECLLRAVKEC-DDRCSTQVEAAWRYTMGAGILFLKA--- >SRR5256885_16048310 -----------------------FFFNDTATTEIYT-LSLHDALP-IY------------FRKQRRMLQTSFYMLVEYIAlgwpECEAYLERIAAAHG-KHGrdIPPHLYDLWLECLLRAVKEC-DDRCSRSEERRVGKECRSR----WS--- >tr|A0A1I7ZQR2|A0A1I7ZQR2_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=3 SV=1 -IPLTAAQIHLVRTLWRQIFlskgPTVIGSTIFHKFFFKCPKVKEQFRR---CPLPRNFPNHDSFaKAHCKAMSELVDQVIENLENldtMTADLERVGRLHAEVmnGELSTKIWNDIAETFIDCTLEWgDRRCRTETVRKAWALIIAFMIEKIKLG-- >SRR2546427_190033 --NMTYAELAHFDDSLTRCTrEPRFLERFCALFFASSDEVLQKFSQ-TDV------------QKQRRVLQASLYIQLSASPIvtnGSLIFCNPSVTWSIiQVQRSPAMRTLRthSSCPLVGYPLKA-GQCGVGHVPX----------------- >SRR5213596_3505323 -----------------------FLCVIFGLLRRGPSQVHTD----RLA------------EATEDVTGVVPQILMLEADGkpeGAVHLAPLAALHSQqHLDIPPHLYDLWLDCLIQAVRESD-PQCTPETESVWRRMMANGLAFMKVRYH >SRR3569833_2178475 --------------------HPNNHNTNKKTNKTTTHKKTQKNKN-TK------------NTQQKKKLQMSLNLLISHAMGigiVDGYLHQHAEKHSRhHLNVEPHHYTARLNSHMKAVKQHD-PKYSPALEQAWRTGLGHGIELIKS--- >ERR1719347_979638 -PIVTDEEMASINELWSCLRadAMHSSRFIFARFFEAHPEFLEPMPFVKDYYGniSPKYMDTQEMQDYCLKFMSTLDAVMTRVFArdkeALQVMRDIGYSHH-EFGLTSDMTVKFMNKMHDSVLELWGTEASRRDSKALDNIFKTIATEINVG-- >SRR5437762_8994925 ------PAAS--------------SDHHIPSQLAAGTRAKDRKGG-VEY------------PGHVCRGQRRCARDRPHILAspelCIPRACRTKSA------------AFCAVCENRCCETC-RSPPAKKPETARRSAERTG--------- >SRR5690625_2752079 ------SDYSDVQASYGRCVrNRDFIPGFYQRLLSKDKRIAAIFKR-TNW------------SVQNRALRRGISIALTWAGGskiVDRQLEEMADAHS-RKGrvpVDPVLYVFLREALKIGRASCR-ERVGVTVGDGcvpqdESGAATGG--------- >tr|A0A085LV25|A0A085LV25_9BILA Uncharacterized protein (Fragment) OS=Trichuris suis GN=M513_10305 PE=3 SV=1 --EFTAKEFAIAELTWAKLKvrfNNQVGMEIFRQIFASCPKVKNLFGV-QNRE-DQKALCDQRMARHTAIFQDIIELLIVDLSQrsdsLTQSLITLGAQHWFftQRGFRPEFWVIFGNTLVNLIRSLPLSlSQRYLARRTWIKLIVYLLDCVMFGY- >tr|A0A0N5DS84|A0A0N5DS84_TRIMR Uncharacterized protein OS=Trichuris muris PE=3 SV=1 --EFTPKEFAIAELTWAKLKlrfNNQVGLEIFRQIFASCSQVKGLFGL-QNKE-DHTALGDQRMARHTAIFQDIIELLIVDLSKrsdsLTQSLITLGAQHWFfnQRGFRPEYWVIFGNVLVNLIRSLPLSlSQRYLARRTWVKLIVYLLDCVLFGY- >tr|A0A183BUR6|A0A183BUR6_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1 -TGLSAHQIQILQKIWERSPeseISDCARNIMSHLLRSNAQMYQFFDLLGH--SDREIANSPIFARQSANFAVLLDFVLANLLEevqkVCLALQHLGAQHARlRWPIETHHWALFCRCFEDNPPKEV--FLNAEGHDLWKTMINFIIVQMRVGYD >tr|B1KNW6|B1KNW6_SHEWM Uncharacterized protein OS=Shewanella woodyi (strain ATCC 51908 / MS32) OX=392500 GN=Swoo_3305 PE=4 SV=1 -----------FNDSYDFVLrnEELFFSTFYEIFVSSSPQVKAAFKH-TNM------------AKQNEMVRESFGFIICFFVtKiADEQLVKLAIDHKDKFHVDSELYAVFVNSVLAALEKIYP-KYNNECAVAWRITMAPGIEFMKH--- >tr|A0A176H0Y0|A0A176H0Y0_9GAMM Uncharacterized protein OS=Oleiphilus sp. HI0069 OX=1822245 GN=A3741_11335 PE=4 SV=1 -----------FDDSYDFILsnDSNFFDSFYTHFFNSSNLIKNAFAY-IDM------------DKQKQMLRESIKHLVKFYCtNkESEYLKTIARHHADKVRADEYMYKLFVDSFIQAIEDTYP-NFCEEAALVWRCALKPGIDFMNS--- >tr|A0A090LM85|A0A090LM85_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X000017100 PE=4 SV= --NLTTSQIMSIKKSWKHINtkgLFNVLRRCYQRCQSCCPNVAKVFST-ENIKK-QQNIYSCGVSEHTKYFISLLDRIIDNEPNIEHELRNVGKEHAKlyeEYKLSITDIERLGEIIADVFLKLDGIRQNKETSKSWRILIASIIDEVSVGYE >tr|A0A183CLY2|A0A183CLY2_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1 --LLTRTQRVLIENSWKRVKkaavEGGMGAKVFHNVLVAQPDMKLLFGL-EKVP-QGRLKYEGQFRRHAGLLNRTLEYVIKNVQytdKLGQHFRALGKKHCQmngGRAFPTNYWDTFLECILQSVLETDGSisgRYhrCREAALAWRNLVGL---------- >tr|A0A0M4CP70|A0A0M4CP70_SPHS1 Uncharacterized protein OS=Sphingopyxis sp. (strain 113P3) OX=292913 GN=LH20_00550 PE=4 SV=1 ----ERSDAALMEATLAAVAetGIDIRHTLFERFFSAYPERHPAFLNL-DA-------------ASRRMTDETLQILFGLATDegwVWPLVAELVATHR-NYGmLPTDEYDAFIDLAIDELGRAAGRAWTGAHAAAWRRQGEIL--------- >tr|A0A1Y5Q3I5|A0A1Y5Q3I5_9SPHN Uncharacterized protein OS=uncultured Sphingopyxis sp. OX=310581 GN=SPPYR_3232 PE=4 SV=1 ----PARDIAAMEASLAAVAdaGVEIRHALFDRFFDAFPDRRASFMIV-DA-------------SSRRMTDETLAMMLGLAKGegwVWPLVAELVFTHR-AYGpLPIAEYDAFIDMTVEELGTAAGAAWSAPAAAAWQRQAEAL--------- >tr|A0A2N3CVZ2|A0A2N3CVZ2_9PROT Uncharacterized protein OS=Alphaproteobacteria bacterium HGW-Alphaproteobacteria-17 OX=2013663 GN=CVT78_05625 PE=4 SV=1 ----SARDAGQMEASLIAVAdaGIDIRHKLFERFFAAYPERRASFISV-DA-------------ASRRMTDETLQMMFGLAKGedwVWPLVAELVFTHR-SYGaLPIAEYDAFIDMTVEELGLAAGAAWSDETAAALQRHAEAL--------- >tr|A0A0D6LRF9|A0A0D6LRF9_9BILA Globin OS=Ancylostoma ceylanicum GN=ANCCEY_06233 PE=4 SV=1 --PFFRIDNRLVPDSAVAtDMV-QAQIHSYVYSSLQSTVSREMFQKM---SIVEGFRTNQccDLNMHAKVLCDLFDSIVSDLQQaskiVQARCMDVGGSHV---HMNekccGSLWDQLGECLAEVITKVECVRSKRECTKAWIMLISYVVDGMKCGY- >tr|A0A1I7RN92|A0A1I7RN92_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1 --GLTDDQCEQLATAFSNIPdKYYAFEQMFLNLfMKEDPQLAVVFGF-EGIR-PEELRRMSPFRTHVCKFQRFMTTVLDMLPKknreeeLIQIIRMVGRQHCNvkLLSFTAQKWLSFKNGMLNALAKG---GESHKYYSSWNILISFMISEMKDAY- >tr|A0A183BTK8|A0A183BTK8_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=4 SV=1 --QLDDTECEQLSTVFAAMPdKYHLFEACLRPMpMPeVDPQIALTFGM-ANIA-EIELRRKTPFRYSV--------------QKrgreeeLVQIIRMVGRQHCQvkQLSFTAARWLSFKSALTWTFSRG---EQKDKLHVQWSLLISFLICEIKDAY- >SRR5688572_1577071 ---LARHDWHVLLDRWQRLQpnADRFATAFFDTLFGQQPAFLQIFAS-APL------------DAQFLRFAHLLSEIVSAADDadeLPRCVELVVQRFA-NDDCETDRSRAVRAAINAMLTEVSAAHMTPHMRASWHAAYVAVTAIL----- >SRR5690348_16468503 --------------------ADAAMTYFYAELSSAARATWAdrdIYMS----------------GPDHMIVRT--ARALVErg------------------APSRLIHYDLVDPRVTEGQX------------------------------- >SRR5258708_24656334 --------------------ADAAMTYFYAQLFAMDTEIRAMFPA--AM------------DVQRRRFFEGSAGSPLPsraRpttIASCLTCRNSGPHHM-IAETAP---------------------------------------------- >SRR6185437_6364830 --------------------ADAAMTYFYAQLFAMNTEIR-aVFPP--RP------------GPVKRMSRT--SSGACRrtrRs------------AAR-RPRPRPCHTSAGPAR------------------------------------- >tr|A0A016TZT5|A0A016TZT5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0066.g3721 PE=3 SV=1 ----ANKSKKLVIAEWPRLLehEPNLFKIVWSSSAARSTSIKQAFGI-TD---NESPLENESFMKLSPTIQAFFYKLVIsmQLDEdmVRSACEQLGARHVDfiARGFNSNFWDIFLVCMAEAIDATLSSYITDeakraEMILAWQRVFNMIVHHMRTGYN >tr|A0A0R3Q1W4|A0A0R3Q1W4_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=3 SV=1 ----ANRDKKLVIQEWPRLLeqQPHLFQIVWNASSTRSNSIKKAFGI-GD---DESPQENAVFMRLSETIAAFFEKIVItmQLDDdiVRSTCEQLGARHVDfiARGFNSNFWDIFLVCMAETIDETLSSYMTDegkraEMILAWQRVFNMVVHHMRTGYN >ERR550534_360735 ---------ADAKASWANVDTAAFGKAFFKNWMASDPEVKNVFKK-SSFP-----------QGPAQFLVERFDILLGVLDDevaLSQQLMSVAKTHM-DKGVDPEHLVTFQDSFVKTLAGF-DSDWSRERSESWAYVLSHVIT------- >ERR550539_1411929 ---------SLVETSWANVEKEAFGKAFFKNWMAIEPHVDEIFKK-SSFP-----------QGPAQFLVERFDILLDVLEDevaLSNELTVVAKTHM-ERGVEPDDIVTFQDAFLKTLPGF-DSDWTRDRSEAWAYVLSHVIT------- >ERR1719192_2654783 ---------GAQS---APTPPKPVGQTwtkRLSEKLSSEPEVADVFKK-SSFP-----------QGPAQFLVERFDILLDVMDDeasLSKELQVVAKTHM-DKDVSPDDLVTFQDAFLKTLPGF-DSEWTRDRSEAWAYVLSHVIT------- >ERR1719242_19104 ----------------------------------------------------------------------------------------------------------TPLIGMA--AQS-PLSWEQEK-----YVKLgQRWT------- >tr|A0A0C2FEY2|A0A0C2FEY2_9BILA Uncharacterized protein (Fragment) OS=Ancylostoma duodenale GN=ANCDUO_24724 PE=4 SV=1 --SLMPSQVSVIRKSWRHINTKGLITVLSrvfQRFNA----ID-------GQE--YAKVYDMTIYGIIEF-------------------------------------------------------------------------------- >tr|A0A0C2G6K1|A0A0C2G6K1_9BILA Globin OS=Ancylostoma duodenale GN=ANCDUO_17195 PE=4 SV=1 --CLSYKHRKLLRATFQQMNsSGaflKLMEQVFRRLEAKYPDIRSIFLTTAFVNSLSRERSSPPLvrteHDHCKCLVALFEKIMDNLSDdtQLMVIRQYGEKHAQmkESGMSGGMIESFGEIAVAVIASQYSYWIQKPVDDVTrrkgrDEGLVYLNDYEYIIL- >tr|E1NZ07|E1NZ07_CAEEL GLoBin related OS=Caenorhabditis elegans OX=6239 GN=glb-29 PE=4 SV=1 --NLSVKQKKLLRQSFNAMNsGGtflKLMEKIFRRLETKCPDMRSIFLTTAFVNSLSRERQTPPLvkteYDHCKCMVGIFERLIENLENIneqLTMIRHYGEKHAQmaESGFTGAMIEQFGEISVFVIGSQDVVKFNHETVKAWRLLLACVTDEMKVGFD >ERR1719431_1401903 -----------------QLTtnSIRSGFCGRLCETTRyNPDCtsSNTFSMRfRKR--RKNFHSPMINTEISRRILWRRKRLMTRLFKrdpeATKRIYDVGFHHQ-MMSITEHDMTMLSSSIYSAVQDILGKKASDKDLAAWRHLLGLVSYHFKRG-- >tr|A0A1Q9NTV3|A0A1Q9NTV3_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_3 OX=1841598 GN=hmp PE=4 SV=1 ----TSKEADILTQSLKALEekTDDLPKLFYYHFLEPtsNKEIISLFNK-SDM------------TKQYMMFHQSLAIIVSSIKDshlLNQILKDLVKRHK-NYGVKYAHVQIFSSAFYKTIEEIFPK--DEKVKILWIKLINFVLSKFNE--- >ERR1719238_586270 ----PKEVIAEVRRCWEAFIkasgsKEAASEHLYAALYDAVPSVQHLFVT--PR------------VVQAMRFMTQLQTFITLLDQPkqsKVTMEAIGFAHM-QRDITVELCVLVRDAILDLLQVELGDNLSSSAAAGFKGLLNWM--------- >tr|A0A2A2L6E6|A0A2A2L6E6_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_22934 PE=3 SV=1 --KLTKLQKKALKFTWSRLQtrnggkrVESVFEDVFDRVVRYLPQTREMFNT--RAF-LCAIsrNETSSLRDHARMTVRMIDVAVRNLEVetrkrsdtgSDMDPLLIGIVN-----WRGSRYS---CRIINRI-------------------------------- >tr|A0A2G5VGS5|A0A2G5VGS5_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-26 PE=4 SV=1 ------SERSIKLRKYDYEKddgSK--------KLL---SFYKKVREK-------------FTFKRSGSEMVAVVVSVMQSLDEpdkISKMCQEIGQLHA-KYrrskGMKIDYWDKLGEAITETIREYQGWKIHRESLRAATVLVSYVVDQLRFGY- >tr|A0A1I8EM37|A0A1I8EM37_WUCBA Uncharacterized protein OS=Wuchereria bancrofti OX=6293 PE=3 SV=1 -PSLTSAQIHLIRNIWRQVYitkgPTVIGSTLLHGIYFKSKKIKDQFFR-CPFP--HRFPNrDSFNKAHAKAVGEMLDKIVDNLENlesMSGYLFSIGATHANliRRQVSKEIWNLMAEAFIDCTLDWGdKKGRTEASRKAWAFIISFAIEKIKRG-- >SRR5690606_37396704 ---FSDTDTYILHTGLKWIEeaPETFAAKLYQRLLRDHPECQASLHAI-GL------------ESFNRNFIHFLKMVKEELLErhtIHVAPREFLALHALpvEKVRHSNYVIKMGRTFLDIFAELAEDAWSPALESTWNKAIEEVK-------- >GraSoiStandDraft_42_1057292.scaffolds.fasta_scaffold716659_1 # 2 # 607 # -1 # ID=716659_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.685 ------TEIQILENGLRWIKesQDRFGDKFYHRLLREHPEVNPLLQSI-DP------------WSFNKDFVQSVDAIIGEIRAqgdVISPLKDFWPELSStaMTPLKPSELIKVAETFLDLISELAEDAWSPALEYVWRKAIKTVM-------- >SRR5215207_8455447 --------------DFDTVVCSSFAERFYSRLFTHEGGehLRALFPDN--I------------QPQHAQFTTMLGDILAYNFRigRSLLGD-TFRKHI-DFNIRESDVDVFRKAFVEEVGSTFLH--LG---------------------- >ERR1711972_144950 ---------SQVLQSWEQVKllgLESVGEMLRANTFELDPQVVALFRIPGVVSTGEGMLQRMALRRLFSKVLRFVGSVVAGRYDyqrLVETLSR-----------------------LGATRAAGGATEVHFKI------------------- >tr|A0A238BIH0|A0A238BIH0_9BILA Globin OS=Onchocerca flexuosa OX=387005 GN=X798_07861 PE=3 SV=1 --------LFTLKNYWKTVRrnERDCAKMMLAKYLKQNPDNKEKYPKLKNIDVntVDVATANSGFETVAANYLKVFDDVITTVEEkpgdvsdACSRLTAVGKMHRTkVNGMDGSEFQLLEEPFLYMISEILQDRYNDKAENLFRKFYQFCLKYILEGFN >SRR5215467_3799544 ----------QVSESYWRCCtNPLFIEELYQTLFSKCGEIKQLFEQ-KNVS----------MKRQYAMLRYALDIFVDYPHDMTATFPDIARKHT---GLDPRFYETFIEALIETVGKCDPK-WVPSLEHAWRERMT----------- >OlaalgELextract3_1021956.scaffolds.fasta_scaffold865191_2 # 285 # 404 # 1 # ID=865191_2;partial=01;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.492 -----RHEWHVLLERWQKLQpnADRFATVFFDTLFAADPELRQFFGG-ASL------------EAQFLRFAHLMTEIVSAAGDpeeldhrVEVVVQRFARDDS-A----TDQSRAMKLAIAAMLEEVAASDMTRQMRADWKAAYAAVGAM------ >ERR1712159_177610 ---LSTSSLNAVKNSIPLIQqhGNAIAENFYVQ--QIQPTNITFFNRA-HFTS----------GQQAQTLSQFLVLLAQRSDNlelMNTHLRRISNKHV-GFGIKPQHYPIFFENLFVAFKEVLGTKATPELISSWKELVSLVQ-------- >ERR1712159_799488 ---LSTSSLNAVKNSIPLIQqhGNAIAENFYVQ--QIQPTNVPFFNRA-HFAS----------GQQAQTLSQFLVLLAQRSDNlelMNTHLEESPTNML-DSESNHNTTRSSS-----------KTCSLPSKKS------------------ >SoimicmetaTmtLAA_FD_contig_31_10253239_length_247_multi_1_in_0_out_0_1 # 3 # 245 # -1 # ID=589621_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.671 --GLSEYERGLVVNSWKALTkpdfspldGTSSLSNFYDAVWTKWLKIDEF---------ANKMFRSRGFKGRVQHLLRIMGVIIKCAEDPlrgLEQLRSIGVQHC-IWGINSQSFASLALSIIHGLDQANGKEINAELKELWLAL------------- >tr|A0A1V9ZGT6|A0A1V9ZGT6_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_12918 PE=3 SV=1 -PVLTPTNVDICRRTWDLIQtagtdkMRqygkpgiiLFYDEFFYRIFERDTTIREVFPKV---------------QQRAEVLIKAINFILSTRAGtpasvmeTVNACRFLGHKHRAFAKVRPHHFAVYTNTCIEVIMYWLGEFGSHEVGTAWSHTVGFILRHILEAF- >tr|A0A1I7XNU2|A0A1I7XNU2_HETBA Uncharacterized protein OS=Heterorhabditis bacteriophora OX=37862 PE=4 SV=1 ---------NTT------DSglqlEGIVVQNCFIYILSKYKHLRPIWQFGKKIEDneenwTLALYEDFYFRHHCASIQAGLTMIMENKDDpesIKKLLNEIGAHHF-FYDACEPHLELLDQ----------------------------VKGHVSDG-- >tr|A0A2A6BP14|A0A2A6BP14_PRIPA Glb-18 (Fragment) OS=Pristionchus pacificus GN=PRIPAC_48995 PE=4 SV=1 ---STPEDKKLMEKTWSEEFdvLLTLGSDIYNYIFKNMSACKRLFPWIIKYEdEGVDWKKTTEFKDQALKFVQVIDTVVWGIIDgdkSEPFLYDVGQRHVQyaSRGFKASYWDVFLDAMQYAQDQRIPKmnnlnaQEKQRAKQIWHDVAAYIIKHMKSGF- >UPI0002C4E217 status=active --------------------------DFGTAFFEYCPDLKGQFPS--NYA------------L----VTKMIQKFINNViegKNLERLARHYGRTHW-RYDLEERHFLGFAEALADTINIRIGNFGTIELMKIWREEATMICKMLEDQY- >SRR5215831_15107384 ----------------------LFFSKFYTNLFGRADDIEDRFKEL-DM------------ERQYRILNLAIHKLLEFRPEqpaTQKQLRDLSLRHA-KLGLTNHAPAWNR-IH-LDLRGIGA--DGRSsGVAAADKALAX---------- >tr|A0A085LU76|A0A085LU76_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_10599 PE=3 SV=1 --NLTTHQKQLLVQSWPKVQtynRIHGGDAIFARFCEKNSIGRIFQETFQKiavvQSFAINEASESVLKKHEQYLLQLLTQAVENLNNdrepLLRECLAYGAQHI-TLQelLNETVWDQLTEAIIERIHMVSFVRRHRNLSKAWTMLITLLVEKIREGY- >tr|A0A2E0SMS8|A0A2E0SMS8_9PLAN Uncharacterized protein OS=Planctomyces sp. OX=37635 GN=CMJ46_12130 PE=4 SV=1 MSQISERQYHLIHDSYRRCMlADDFLVMFHRNFMEKSPQIPKFFAD-HTL------------QQQHRILAKSVARLVSFVDGkpqaeqdMRDTMRI---LHDGNLRLTPEHYAFWATALMETICTI-DEACNDEVAVAWEQTISYGTGVLK---- >tr|A0A0B2VQV3|A0A0B2VQV3_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_12261 PE=4 SV=1 --NFNKRERVCLRETFQKLAdPkELIGAIFVDIVNDIAPELKKVFGV--DRAPKAAMLKMPKLGGHVARFTDLIDQLTNMVGyteNVlgaWQLVRKTGRAHT-KQYFletnqsarGTNYFALVANTFILEFTPYLTGekeepnvdekkkvrfasTYTStMISDVWARFFKVITAQLTDAF- >tr|A0A1I7YWT2|A0A1I7YWT2_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=4 SV=1 --SFTKKERICLRETYQRLQdPkEIIGRIFLDIVNDVAPEVKKVFGV--ERVPRPNMLKMPKLGGHVARVNDIFDQTTSMLGyteNVlgaWQLIRKTGRAHT-KQQFllenlnqlEKNYFQVVIDYFQEQFLPYLTGekegqerkkvrfaqNYTTiLIEDVWKRFFSILIAQMTDSF- >SRR5512138_1182700 --------HRRVQGSYSTFQatdrADRLYRTFYANLFASVPEARRMFAH-TDWS------------RQYNAINEALKLLLDFDADpqraadAAKQIGSVALKHQ-QYGLGERELRAFEGALLHALRSC-G-ECKPATLEDWRMILAPGFHHMRG--- >SanBayMetagenome_1026888.scaffolds.fasta_scaffold228792_1 # 28 # 387 # -1 # ID=228792_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.353 ----EPNQRALAKASYRTWIepDTRFFEDFYRRFFATTAAKrahsVHKFK---DR------------KEQHDKLRNGMAAVLNFYpGNEPTSLRYVIDVHR-RKKVTEPELKQFSATFLELVSERLNRKLtgtgsaarRKEIMDAWTALFDQVLKHFRE--- >tr|A0A0V1CBX7|A0A0V1CBX7_TRIBR Uncharacterized protein OS=Trichinella britovi GN=T03_16916 PE=3 SV=1 --ELNDNDRQAIRQTWQKIGdHTLWAQRLFAKILVACPAFSKATSF-HSL-AGKHLLNDAKFRSFCQRFADFWQNLVQLLCvsdDpadwqqAVDSIRGLGQRHSLNRKVTfeAPIWLMIKNEIVLSITGY-SDICRSKDCLSWNKLLMFTVAEMKSAF- >SRR5262249_4116633 ---------------------TKFFRSFYEILRE-SPEIHDMFTSP--FS----------VAKQAQKLNNAMEKILNFRTYMnTSSIGREVQRHR-KLNIKPEHYGPFRDDFVKALKKAkIDDGYS---EDAWCAVLDPALDYMRT--- >ERR1719347_1935341 -TGLSQNEVTLIWSHWESLKphKRRLAKRILKVYIKEHPRARELFPNWVDIP-TVELVKLTSFSRKAVDTWEAFSRAWECIDDaplCRKVCYAFGKKHI-ECnarikghgQIDEHHVKNFIRIFLRIILVSAR----EGSEEAWRKATEFFSINFVRG-- >ERR1712142_116161 -THLSQNEITIIWSHWESLKphKLKLAKKILKVYLKEHPKARELFPpHWKGIS-MADLVKLHSFRRKANDTWEAFTRVWECIDDpklCQRVCFTFGKKHV-EWnarlrqtrgQIDEHHLKNFMHCFSKTVLDNSR----AGSSEAWRKATDYFSLHFLRG-- >ERR1719313_2808357 --SLSDATHELLQKTWQAAKPegpg--LGEAWYEELRsdtSYVDDLGVILNF--PV-------------CRPENVSRVVQALLDLLPRecqetpepglmlpvprFTKLLLAAATLAQ----------------------------------------------------- >tr|A0A0D6M2N5|A0A0D6M2N5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum OX=53326 GN=ANCCEY_04360 PE=4 SV=1 --NITPFEIRYLKYSWEKASsTMDIGCELVARLLNDN---RTRFRALIEshsgdLLgsanfAAEDVKKFRRARSVAHGVVMFFNQVISELDEpnsadfIAVISQRLGASHF-RMKvwFQAENWLCVKNCLLDTIMAALQVKkttsfacgktisMsDKKAREVWYKVIQFVIQNMKRGF- >tr|A0A1I7W801|A0A1I7W801_HETBA Uncharacterized protein OS=Heterorhabditis bacteriophora OX=37862 PE=4 SV=1 --NISSQEIQYLKYSWERASsASDIGCELVARLLNDN---RTRFRALIEshsghLLgssnfTADDVKKFKRARAVASGVVMFFNQVISKLDEpdaadkISLLSQSLGASHF-RMKvwFQAENWLCVKNCLLDAIMTALRKNggssllcgkrhmHnIKRATDVWYKVIQFVIQNMKRGF- >tr|A0A0K0DKR1|A0A0K0DKR1_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1 --LLSTLVANNLQIYFSRANnATDVGCELVAGLLNDN---RTRFRALIEshsndWLgsatfTAEDVKKFKRAHSVANGVVMFFNQVISKLDEedaverIALQSQRLGASHF-RMKvwFQAENWLCVKNCLLDTIMAALMTKpfmvcgksitMnQKKSREIWYKVIQFVIQNMKKGF- >tr|A0A1I8BDP5|A0A1I8BDP5_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=4 SV=1 ----MRYTNYLSKIVLARTLnQVDIGNEIVIHLLNDK---RSLFKNLLEqsspyEKeikniyDKKSLSkYSPRSLEISNGVTKFFKNLSLLLnqkgmEIeekedkLVEICKNNGKMHY-QMKvwFQAENWICLENSVIETIIKGNNLEkenFeSNQTIIVWSKLMQAIIGWMKQGF- >tr|A0A158P8J3|A0A158P8J3_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1 --NLRKEQVRALRMTWTRLCepprsnckgIVNLVERVWEKLDRKDSSVRNIFYNAAFvetMHDRCERRrskgSIATLRDHTHFFVSLVSQVIQSLDLnpenILNHVDTIgKSNHAylKQYGFRSQHWEKIGEYFVDVVVIQDCVRGFPEACRAWTILVAALVDRLRAAP- >SRR5262245_41417288 ------------RASYPRCMaSGNLHARIYEAFFAACPEAKPLFDN-TDL------KRQYQLLHQAIVLMLAFH---VSPNrEEPTILSRVAARHS-ELGVhiPPAWFDAFSAAIQQSLEAA-DTQFSDKTREAWAAVLADGIGYMQ---- >ERR1711884_327085 --------------------------------------------------------------------------------------------------SNESFSvIFKHLAFIKYL-HItktglFDELFGQHVCRIRRiLPFKLIIRL-SSNF- >ERR1719471_2433215 -----------------------------------------------------------------KGIMKVVSKVLCHLNDlsrVEDYLRVVGRLHD-SAGVEIAYLSVTGDAFCTSLKRLgtHADIWNDEVKQTWNAFFRVVVDLMSAGY- >SRR6266436_7042579 -----------------------------------------------------------------------------------------------RVFITAqysCRYHSFSATFYVMAGdkerwkVYM-SHQQMSLhARSKDGLYSRRttQGY------ >SRR5437870_11165056 ----------------------------------------------------------------------------------------------------AqysCLNHILSATFYVMAGdkerlkVYM-SHQQMSLhARSKDGLYSRRttQGY------ >SRR4051812_43285676 -------EVEVARDSYKRILddevkEEKFFRSFYQRFFRKCPDAAKEFAA-KEFPRRVAlsGRggnaREGKWPRQYRLIKQAVVLLltFKLLDDteGLTILTDIADKHE-RYP--QEFYDSFRDALIDTVISLDKDsgsgLQRYELRDAWEKSIQPGIDYIMN--- >SRR5262249_5830581 -------DVEVARDSYRRILddverQREFFHTFYGLFLRRCPEAAAVFEA-KGYPALAQlgGPrvedSAGRGPQPPNPLKSAIVMLiaFNILGEkeEPTILDNLVDKHK-GFP--KRYYVAFQDALLETVVQFDDPsrcgMPPDELQHAWKQAIQPGGDYLID--- >tr|A0A2T7PRA6|A0A2T7PRA6_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_02930 PE=4 SV=1 ---FEPHDKTIVAESWKLLRsiFPDLIESAFVEMCRRVPRLKLQFGNV-DVDDD--EERHMNFLKHVWDVSFFFDQLLLYLPfksKLEECSFHIGLVHA-SVEVPAWYVDLFLVEFIRAAQETVQLEWTPAMENAWAVFLRYLCYYMKDA-- >tr|A0A2A6C3W4|A0A2A6C3W4_PRIPA Glb-17 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_39254 PE=3 SV=1 -MELTDEEVAAVRNVWIRAKTEDIGKKILQTLIEKRPKFAEYFGILCQSDklDMNSLKESKEFHLQAHRIQNFLDTAVGSLGYcpvtsIYDMAHRIGQIHF-YRGVNfgADNWLVFKRVTVDQVTKGvtstqasqanlLegtkepevveqhpmadvQNPFSGEnclARLGWNKLMTVIVREMKRGF- >tr|S9VAV3|S9VAV3_9TRYP Uncharacterized protein OS=Angomonas deanei GN=AGDE_12480 PE=4 SV=1 -------------AAWSHLLtspnGGEFCSTLYEKLCQNLTYIPDYIRNLKD---------EE---RVIDHYINVITKTLELYENphvMIDELPKIAARHR-GFGVSSDAFFVMRNIFMELLPEYMDPKVYEQSKKDWLKFWRLVLDLMVSGS- >ERR1719354_143580 ------------------------------------------------------------------AFWDILDHICGHLDRlenLIPQLRDFALQCF-NSGLFSDDYNILGECLVTILSTNFD-PWEETHSDSWAWCLDLVMSTLVT--- >tr|A0A1I3QX19|A0A1I3QX19_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter neptunius OX=588602 GN=SAMN04487991_1987 PE=4 SV=1 ----DEQMIALVKASLKELQphAGAVFATFQSKLAQRAPELAYRYDEV-DP------------ERQGELLFEKLAIAlggVRFLDRLVPALGGVGLDAG-SASLTSCDFARLSEVLIAAFAEVSGNRFDPCIGAAWTTLFEELSWHMFE--- >SRR3954469_11252496 ------------------------------------------------------------------------------------DGGAIRRHHV-RSGIGGPDYGRFGDAIPAVMVDVGGNDLPKPIGGSWGDAFWAVIGRTKQR-- >tr|E0VF27|E0VF27_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236389 PE=3 SV=1 -----------VLNDWPKIRknYKKIFIDSFINYFAENPNYKLLFPSFSNVS-EDDLPFNHCFRLHCFAVYKAINFLMSNWlGEyeedDSKILPVIGKTHF-DRGITLEMMNLYKHSIVYSCNNHLKPNL--KRKLSWQTVFDHIFDY------ >ERR1719461_240742 -----------AVASWNNIDdKTAFGKAFFSNWLESNPRIKDVFAQ-SSFK-----------QGPAQFLVERFDILLGVIEDeeqLAEELYQVAKTHK-KVGVDQSDLYSFQASFMKLFLPS-TLItaqrsqtlgltpFLtssSLLWSRWQLSLPV---------- >ERR1712165_596852 ----------------------------RLFLPSTLTSLQRLETH-----------------GLTPF---------------------------------------------------------------SHVITAP---------- >SRR5580704_4499342 ------------------------LGDFYRRLLQHHPQLAAYFEGV-NI------------DFQVQKLVVVLSTIARDLPDrsvLDRVLFHQGVAHV-ERGIGRGEFNEFIALLANVVSCKTTLVGAAESYAVWYQELSAVATSML---- >tr|A0A0G4HY87|A0A0G4HY87_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_33490 PE=3 SV=1 ------NRIHLLQSSLAACLkmstkEEFVGRLMYDTLMRTLPEPGIIAKR--GR------------TMMSRAFNDtvaALVAFVSEPSHMETYMDWLALRHV-HYKIDTTLFPQFRQAMLVSLEQVMADQWNAEIERAWSEAYEMTSQALQ---- >SRR4051794_14672716 --------------------SPAFAESFYTHLCR-SDAVRDLFVTAHRKRVPAALnrQESpaIPDETQRRKLVDGLKAVLNFRPGcSPSSIDSVAARHV-DLHLTTDHFDVFEKSFLETLEQHVTRSEdreeMEEITHAWEKLFATVRDEMLD--- >ERR1740139_220892 -------TRAALLKSWEMVQeaGTvPAANLLMKHLRERDAEALRVNTSH-ARP-KTGETEEDAVRKLAVRTVQILGSAATGMSDtvsLVQHLHKVGAGFA-GTGIKEGYFAMVRDASPFALRELLGDRFTADIASACRITGPFLASLIIAGLR >ERR1712194_173361 -------TRAVLLKSWEVLAevGTaTAANVLTKHMRELDAEALRSYTSQ-AQP-KDGETEDDVVQKLAVRTVQMFGTAvtA---NDtasLIQHLHKVGAGFA-GTGIEEGYFSLVDKASPLALRELMGDRYTADIASACSMTGDFLTSFVREGFR >ERR1719446_598571 --------------------KKAYGLNAFNRFFCKAATIGNSFQHI-QC-------------ASVCSgnarSPAVSGYLQGAYTlgeCGHLTWPQTHHVQH-FYRLLX---------------------------------------------- >ERR1719240_1501566 ------------------------------------------------------------------------------------------------VQHFYRILRLLLEACCEELADWVKD---PAAVEGVEWALTQIAAIMI---- >ERR1719235_1367256 ---LPGVTVEFLRSSLARISEDEFGDMFVQKLRETGDmlsegTIEGVLNT--PI-------------VRPTNLRKMIVYAL----------------------------------------------------------------------- >SRR3989338_2963815 ---------TPLYHLYKENVppqkERELGLLFYKLLFDSNPELLDFFANV-DLD------------HLSDHLVQTIRLFLESRnslVSLVPAMKALGIIHQ-RAMIPSWAFPLVIENMAKLFSILLGDRFTVELASALVLSFDLLTSF------ >SRR3990167_6716616 ---------NPIYStlknIWlETVStpeiKSAVGELFYKNLFQYHPELLEYFNNV-DMD------------SLALHLSQALDFVFQSInkiGDYksqwRTVLEHLGEVHR-AALIPTWGYPIIGQQILKIFPYNEKAGFSTKQL--etaLATLYREIVII------ >SRR5436309_231744 -------------------------------------EIGQLFEG-RKVT----------MEDQYRKLDRAMFSILSFNRRlKATTLDPQVASHS-EFGLKREYFQFFREAFLAALRETQAS--DDYSREAWSALLNPALAYMSD--- >ERR1719183_3286062 --------AISLRDSWVHIEvlkeeddSGGFGDALIFQLS---VVAQEIFGLV-VTE----------RNALGKIFNRMFSTLVHAMGDpqkFTEEFFVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDSMVRNFR >ERR1719183_785787 --------AISLRDSWVHIEvlkeeddTGGFGDALIFQLS---VVAQEIFGLV-VTE----------RNALGKIFNRMFAVLVQSMADpakFTEEFFVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDCMVRNFR >tr|A0A0N4UGY4|A0A0N4UGY4_DRAME Uncharacterized protein OS=Dracunculus medinensis OX=318479 PE=4 SV=1 --RLSDKQKLWIKLGYKKWRsksKMVPGEWVHAYAIKKYPTMKALFKK--HEN---------LARVYTQTITKIIEMAVESVdslDDsLGPLLISYASENgileERgmasiftirndklllfLEGFDRRFWGYVAEALCALSRDFPLKRHKWDTISAWRIIVLFIVKKLEYGF- >tr|A0A2A6D1B3|A0A2A6D1B3_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_35146 PE=4 SV=1 --TLNHQQRKLIKNGYDSWRkksCISSGRWVHSFVSSKDDRLKEIMEG--NEE---------TTRIHEETITHLLDMAVESLeslDDsLGPLLISYTGPQgvfeEK-DGFDRLYWSRVSEGMCQLARNFPSKANKYETVCAWRIVVLFICNKIELGF- >tr|A0A2A6B4U3|A0A2A6B4U3_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_54703 PE=3 SV=1 --GLTKDKTDLMANLWPSHYgtLYDMGIAAWDKLFAHNPGLKKHFGF-AENDPSSSWKNDERIKKMVLSLQQLLTEAVNTLGfgDtealtsFVNNLRELGGLHRAiADGVNPDAFTLLFAILPEVIVDVTSnrskdgplsSENRSELLAIWRAITRFMANQVMTGW- >SRR5687767_14811217 --------------------SREFMSRFYRRLFAARPELRSQFKNV---------------TTQHDMLAEAIRDLVLFRpGDQEARFLDYVETHR-RMNITVHDIEAFRLAFVAEVIATSMQngnAQARSHGDAWNAALKLGLGVMAK--- >SRR4029453_11133516 -------------------------HLIILKLQRIAMQGAflSVIPAtgFSEH----------FITNSCEFLPK---PQSSSREKalgenEPNILSRIAEMHNKnNYNISPESYKAFVSALTATICGSAPEipePFAPqckisvneknLIKNAWQKALKPGIDYMIMRYS >SRR5262245_37180117 --------INKVHESLKRCRlQPGFFRDFYQQLVKNDAIQ-AIFTKrgLDVL----------KSDKQQWLLREGLDLLISYADEpkspGLHVLSRVAESHSI-YRVGIEMYDGFLEALLVTVRRHDLEfqdP---skddskVIEAAWRRALKPGLDYLKSQRP >SRR5262245_45185474 ---------------------PTFLEAFYKLFTA-DEVVGKRF--vkFDDI----------EWKRQHGLLQQALDACFDFASLlsmqnlrelpEPNAMTKYVVRHGPgrgNLGITSTEYDAFVEALITTVCGNPGNgqaPYDPecadaerkdVIEFAWRRLMKLIVEHFKKVAR >GraSoiStandDraft_39_1057311.scaffolds.fasta_scaffold195098_2 # 276 # 1100 # -1 # ID=195098_2;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.692 ----SFDVFEIAKDSFNRCMgadgGALFFKTFYERLLSKLPVP-yaRQLSQkgVGTS----------SSHRQYDMLRQGIFILLQFGQHklyerEPNILSTVAVLHDQhHHNIPPNLYAAFTGALIDTVAGAPPAiptAFDKqcetdmdIITDAWEKALAPGIRYMTEKYF >tr|M1PA46|M1PA46_9CORY Flavohemoprotein OS=Corynebacterium halotolerans YIM 70093 = DSM 44683 GN=A605_12675 PE=4 SV=1 --------------------SGEFRDEVHRRFYLDVLEARQVFPL--TLR------------ETHVDLASSLAWVLERtssdgtLPDdVLARIRRLGVDHR-RHGFPAEVYPAFLTALRGGLRTVTAEHggVDDPLVDAAGDVFARVCGAMADA-- >tr|A0A097IIH9|A0A097IIH9_9CORY 2-polyprenylphenol hydroxylase OS=Corynebacterium doosanense CAU 212 = DSM 45436 GN=CDOO_12240 PE=4 SV=1 --------------------SEKFRDLVHEQLFSTELQSRQVFPS--SRA------------RSHLDLAPALAWVLERstidarVPDeVMRTARRLGLSHR-RHGFPSEIYTPFADMLVHALREVNFRAdpqLSAGLIIPAETIIRNVCNAMRAS-- >tr|A0A0G3HGP7|A0A0G3HGP7_9CORY Uncharacterized protein OS=Corynebacterium uterequi GN=CUTER_09860 PE=4 SV=1 --------------------PDEFRSRTLTGFFAAEFQARQLFGL--HAT------------QAHDGLPEVIAWALERcgidghVPSeVLDRLQRLALVNR-RFGFAPSAYSSYAEAITTALKDLAYVHfgeVNIlpSQMFAATLALDTCARYMQRA-- >tr|K0YDT0|K0YDT0_9CORY Uncharacterized protein OS=Turicella otitidis ATCC 51513 GN=HMPREF9719_01398 PE=4 SV=1 --------------------RTAFRDATVDYLLRRLPRLRRVAPL--RQR------------HRAEALAERAVGLVARspqgmLRGeDAADLERAGRANR-RLGVPLRVYPVLAQALKAGLRAAFEAAgepYTA-AARDAEALAEAACASLARG-- >SRR6478735_8357209 -----------------------REIAFLVARGLPsKEIAEQLFLSVR---------------TVQNHLQR----IFTKLG-VTSRGEVAGVLQG-LEGPSSX--------------------------------------------- >ERR1712130_811490 ----------------------------------------------EAAlagmKAVEDLGGKFDRTKHGSLFLSVvLTRVVPHLDQrdrVLPYLVELGALHQ-REELQDITLICWVLHIalPSGVWSRVeecVGGYC--TRQPRLGLVWSLPS------- >SRR5436309_12080688 ------------------------MHRFHAHLEQLNPRLRYHLPP--ALL------------RYVrFELLQAVRQQT--PMEVGSGLRRFGVHLR-AQGFEGPDLDTLGAAWLVALDEVLGDRFDSEAREQWLRFYKVLRSA------ >tr|A0A0N4Y9E2|A0A0N4Y9E2_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis OX=27835 PE=4 SV=1 ----------RIQHSFKTASfhltvnqlrsRPTIGDAILKRAISNRPEMRTFLNRLTE----------QQVEHMGKQFYSLIAVSVENIERpeavryfs-RLPFFAMFETYATlcQLGFRPDYFAPLADAAIAECVKLDGGaHKRCETLLAWSQLISAIFTSVRDGY- >tr|A0A183LHE9|A0A183LHE9_9TREM Uncharacterized protein OS=Schistosoma margrebowiei PE=3 SV=1 --------------------KIKVGKEIFRQLLIKNPHYMKMYKPLQSVT-LPQALNLDYLTKMAICYVDNIMKIVRNFNEeekLQETVKYLAAIHT-NRGLTVAHFVSILPIFTDTIVSYME--------------------------- >tr|A0A183WH41|A0A183WH41_TRIRE Uncharacterized protein OS=Trichobilharzia regenti PE=4 SV=1 -----------------------------------------MYKPIQSVT-LPQALNSDYLTTMAIRYVDSIVDIVENFNDeenLQQKIKYLAGKHT-NCGLTVAHFVVSLQILCICVHIWQT--------------------------- >ERR1700755_1321676 ------------------------------------------------------LN-SKG-HRQRDELLNALVSILSKYDPdrpdsqpmieLEADAMGWGRRHASfaalggrPA--GPDQYRVVRDVLWQLLIDASDGRWDAGHTEALVDAYHWVQTIMMW--- >tr|A0A0V1KYG9|A0A0V1KYG9_9BILA Uncharacterized protein OS=Trichinella nativa GN=T02_16304 PE=4 SV=1 --SLSAGELKLLRWLWKQMKqvhQGLASAKLFQIIFATCPEIKRFFGL-AKDT-IDMIINSLSYDNE----------------QLAQLMIAFGCQHSFytRRNFDPKYWNVFGDAMLHLVDDLPLKAFKrYRAKSIWFRFVYFVISHMQLGY- >tr|A0A1I7VKJ4|A0A1I7VKJ4_LOALO Uncharacterized protein OS=Loa loa OX=7209 PE=4 SV=1 ---------------------------------------------------------------------NALKKIIESLKNeqiPYEVLQRISVKHA-RHNIQTHHIQKMIKPLVENVRRALGR-QDENAERAWETLFQTIAII------ >SRR4051812_9951159 MTPLPPEVAQTIRSSCRPLLerQEQFHGDFHASLVDLMPEVPMMREP--A------------GEQVSRWLVECVLWAVNADEPvpmIGATLQGVGLDAH-RLGFPRAGYQAVGHALLRTVRGASQNDWSGTLSSSWIGYHSWLCEYWVSG-- >ERR1711890_22380 -MHLSDTEKSAVVSSWSNVN-SSLLDSVLLQLVQENADMRAAMSR-GDLA-EDSIREQETFKADVTKLTCCITKLVTRLGNTGEVSSCPatCLKNC-P-YLQPKHVPLFISSFCD------KLELTEDAKKGWKFIMEKTAERI----- >ERR1712018_299478 ----------------SDVA-ENHLEDVLLQLVRENSELRSSFSW-GNLP-EDCLRDDDKFKEDVKRLNTCISKVVDILSSSGDApLACPvsSFTSC-P-YLKSVDMPLFIKCFNS------GNKFSENAKSGWTAIFEMAGKKM----- >SRR5262249_47865225 ---MNHRQVELVRSSYERIRrvRHLFADLFNRRLTLIAPVLERLLPP--ET------------ARRDAAALELVEFVVAGLDRLDVLLPALAVQARVwrLKGVEAADYDVAGMALAWTVEQVLV--------------------------- >SRR5215470_9720857 ----------EAKRSYRQFArDISFYRELSKRLFRKIPGIEKKFRH-RTM------------EEQYKVLRDSLWLLLSYASapdQqEPTILSRIAHTYA-R--FPKEWFDTFREVILDVVAQRDP-----SSVRAWKHAMAPGLEYL----- >ERR1719487_1476365 -------YKTILDRCYERMTtqldLVAMVTLFQGIFFGRDIRIQSYFSKP-N-------------ATLRYVVLRIINFLVNVYHkpaAITGELRALGVSHV-KWEIPPDLFVPLGEALFITLEICLGG-------------------------- >ERR1719271_344116 -----------------------IRKDIYSTFFTQAPAGQDYFKQS-N----------TYLHVVADKIMVMTLELYQNPVKMVDDISALGLRHV-GYAIPTELFGPFVSACVEVLMTRTSD---EATIESFRWSLGLTSKML----- >LSQX01.3.fsa_nt_gb|LSQX01333836.1|_8 # 4697 # 5665 # -1 # ID=41498_8;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.475 -----------------------LRQEFFLNFFKLAPSGQDFFKQS-L----------TRLYFIADKIIELCLEIYRQPRAMVEDISGLGLRHV-GYAIPPELFGPFVGSAVEMFSLATTN---ETAIDGFKWAMQLVSKIL----- >SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold703673_1 # 2 # 517 # 1 # ID=703673_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.653 -----------------------SSSIIVSSFMRDssrPCRRVRTIKQS-N----------TRLHFIAESATNMSLKLLQDPWRMVDDVSALGLRHV-GYGIPTEMFGPFTEAAVDALRGHVDE---TLALEAFNWSLSIISQML----- >tr|A0A1I7RTA6|A0A1I7RTA6_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1 -TGMTRHHKMILQKIWMRASeadINECSRNMMSHLLRSNQQLYQMFNLV-GMT-DKEIQQSIPFNRQAANFAMVFDFVITNLTDdlnrVAFALEFLGQHHA-DLGFTIdqPFWALFNRVFEDNPPKLV--FQNPEGHQVWKLMVNFVVRQVKNGY- >tr|E3MDQ4|E3MDQ4_CAERE CRE-GLB-31 protein OS=Caenorhabditis remanei OX=31234 GN=Cre-glb-31 PE=4 SV=1 -------DVERIRAVWMDhINgNDDYFQEVIHRICKRNDGIRCAMLTQnAQHA-ESAAEEDFVLSNIADRISQFFHQLIEddvllNTVELKKCCYDLGRQHS-AYSkkqFKISFWEEFTLTMMDVLEQNYP-QTTKEEQKAWLHFQRFVNENMLDGY- >tr|A0A0B2VIR8|A0A0B2VIR8_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_08540 PE=4 SV=1 ---QTSTRIALLQSSWTSVQtmtSGQFGARIVYSMLRKDPSLFDVFTTVqydgeetplrqtsgliarfynfGSIPdktppnngEetplrqtsgliarkSFDLLTCPQYYEVGDRIMNFMGELIQMMQDgqseqaIIERIRLVGATHY-ERNVmfSSCVWREFKASTLAIVGESTFEseSIRVETLKAWSSFVSLIIREMKNG-- >tr|A0A1Y5SIU2|A0A1Y5SIU2_9RHOB Uncharacterized protein OS=Roseisalinus antarcticus OX=254357 GN=ROA7023_01630 PE=4 SV=1 -------QAELVADSLSRVGdkVIWLASDYYEALFDASPQLHGVLPH--QM------------SEQTNMLGHALAHALANLRDpdgAAPMAQDAGLADR-SARMPPRMRRTIVRTLVHALSLWHGPTWTKDHARAWNEGLLGVA-------- >tr|A0A0N5CYF2|A0A0N5CYF2_THECL Uncharacterized protein OS=Thelazia callipaeda OX=103827 PE=4 SV=1 --ALSTVQRQIVKECMDKA-KDDIAERIYRRIFERRSDFRKFILA---LPD-------KQRWALTDSLHNYLKSAVNQIKDgsaVRKISEDFGAFHVQyrSFGFRPDFFVSTADAVTTEFVLLDAaVHQASDTLCAWSTLTGFMFSSVRDGY- >tr|A0A0K6SA08|A0A0K6SA08_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_8920.t1.CR2 PE=3 SV=1 ---------------------AAMAEKFFELVPKRAPNLRMIFEKRQDI-----------YKHHFGEI---TKRLLAYLDSpeeVWKEDPELAIKHI-EFGVMPCDVPVFANVFLQILAELAGPAWTQRHRDTWDKLFSIVSGALA---- >tr|A0A0G4H7J1|A0A0G4H7J1_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_24983 PE=3 SV=1 ---------------------AVFSREFFKRLSTFAPSVHAVFVKSEEK-----------YTRTIKDL---LGRLLAYIDDpsaIWSDDEELAMRHV-IFGVMPTDIPLYNRVMVQTMAGIAGGEWNLQHDAVWTKMMGLATETLS---- >SRR5215468_7630418 ----SPEVMRVIRFSAGLLAelQDMFVRQLHSEVTALIPGLAA------NG------------RIFCERMVRSLLWAATAgqpPHAAAGALRQVGAANR-RDGFPEERYADVARALVLALRNVSGSSWDNSIGSAWISYFRWAEPHLRAG-- >SRR5215469_6664897 ----APAAGRVGCQSAIRLSrnQDAFIRQLYDDFKELDPDSaqtqAP------DL------------LVFCERMVRALLWVALTdqpLRVVADELRQVGAQNW-YES------------------------------------------------- >SRR4051812_31756681 ----APSVMRLLASCTADLGpqQPELAEALYQRLLELLPEVatlAE------RG------------RPLSDRILHAVLYPTEPgrtPLNVATVVQQVGAQNY-LDGLVGEHYSSVTHAVLHAAREMYRGEWSSALSSAWVEYLLWLRGHLLAG-- >ERR1712232_311801 --------------------RREMSMAIWNRMFKKDPEAERVFKQ-SN----------ERLIFIVEKAFENAAKIYQSPSETREYIQGFLVLMK-LLLMAL--LGRFLSSRAPWL-------------------------------- >tr|A0A0G4HD16|A0A0G4HD16_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6316 PE=4 SV=1 ---LTFEQKeEIVRSAWTTLSstyqLQEIGRVLYETICEEAPGLSSRYTKPGE--------------VMALRFGEMLATLIHlfldFPNDLQQKMEELAIRHV-NYNVDLEYLPVFEISILRTVQELYCeGEFDVEVAT------------------ >tr|A0A2W4YK05|A0A2W4YK05_9SPHN Uncharacterized protein OS=Altererythrobacter marensis OX=543877 GN=DI636_06370 PE=4 SV=1 --------AALIERGLERAAqqLGDITPLVMREFYRRIPEAEASFRHH-APHDPH--------GLEAEMVGNTLHYIMRWHEAPmeiRIDMDTSVPHHRVALDVPPDWYRGMIEAAIDVILSSVPSSA-SDERTAWKQLRDQLVSL------ >tr|A0A1Y6FH01|A0A1Y6FH01_9SPHN Uncharacterized protein OS=Altererythrobacter xiamenensis OX=1316679 GN=SAMN06297468_2444 PE=4 SV=1 --------STLAERSFERLAeqRGDITQDVLERYYRRYPDGRASFEHH-GLGNRA--------ELEGRMVSTTAFLLMQWAQDPggtRIEQGTTIVHHQDTLEIGPRLYLGLIDAVLEVLFETIPDES-AEERAFWLSLRGEIADF------ >tr|A0A2E8LSZ4|A0A2E8LSZ4_9ACTN Uncharacterized protein OS=Actinobacteria bacterium OX=1883427 GN=CL510_01665 PE=4 SV=1 --------SELAQRSLERLSevGGDVTRPVLDAYYARHPDARASFEHH-GLGHTA--------ELEGRMVAESLYLLLTWIEDPataRIDHGTAIVHHNDSLHIPPRWYLGLVDAALDVLLRTVPEDS-PDERALWVALREEFAAF------ >tr|A0A1E4JTP1|A0A1E4JTP1_9SPHN Uncharacterized protein OS=Sphingopyxis sp. SCN 67-31 OX=1660142 GN=ABS88_06340 PE=4 SV=1 --------LELLDRSLTRAAdaIGDITPVVMARYYARHPDAAASFERH-GMGRTS--------ALEHEMVDNCLYCLMYCLERPteiEILLENSVPHHQFTLQVSFDWYRGLVDATIDVIAESVPADA-ADERQVWDEIRSVLGGV------ >tr|A0A2E0VIY1|A0A2E0VIY1_9GAMM Uncharacterized protein OS=Porticoccaceae bacterium OX=2026782 GN=CMK32_09515 PE=4 SV=1 --------NDLILNSFESAAesLGDITPHVYRRFFLQYPEAESLFNIK-GAQFQD--------ELKVQMVRDAIYAYLEYLETPeevEIVFKYTIPQHV-DLDIPIRYFIALLEAVADVVCDSVDDRTQADTKASWSELLQEFRQM------ >ERR1711865_325941 ---------------------SQFGLNAFNRLFDTEPRSEDHFKT-SN----------A---RLSMLATKSLELSMQMYKEptrVMNEVTSLGLRYI-FPAHD----------------------------------------------- >SRR2546421_6426420 ------------------------------XMIRRPPRstlfPYTTLFR-SDF------------ERQNKLLRHAFGLLLIFPNQartEPSVLTRVAERHSRrDLDIPRSEEHTSElqsRSDLVCRLLLEKKK-KNQV-------------------- >tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae GN=mphP PE=4 SV=1 --------------------VTAHSIQAVADELRAHraeFIQAANQ------------------KPD-SPLADAIVQLVDHTDLdghvpesIATSWLQHAAAAE-SLGVSRDYYLTLADASRSALRHICAD-------------------------- >tr|D9QCQ3|D9QCQ3_CORP2 Oxidoreductase OS=Corynebacterium pseudotuberculosis (strain C231) GN=CpC231_1874 PE=4 SV=1 --------------------KDAFHTQVFANF--YHsnPYARATI------------------APS-EQLVPAVISLIGHLENngfisdeVKQKFLEHTKLLD-ARGF--HHYTALASAVRSALQTMCTD-------------------------- >ERR1719474_106261 ----STASLELVLDFWRCTVhrlsvhdRAMMGGDLFRGMSRQDAACRALLESL--N------PTSERMDLWGLRFLDTTGWMLRRANaaDLDASLKAMGAEDR-ARGLTVAYYRVLVERLHSELAARFPTKYSETVQAAMEEVIWSFVRR------ >ERR1719499_858439 ------------------------GRAIIEGMNHE-------------N------TSPNQMDMRTVRLLDTLGWMIRMSciPtmDLKVLYAAWNGMAA-EVGYSAEYHVSWIQYIEAQLTERFPSEYTDSVRSAVRELLRWSIPN------ >ERR1719410_2598304 -------------------------------------------------------------PSHALKILNVFGYVIRNLIHpsnhlkLFKQLQSLGTVHR-AHSLNNEMYEAMLKSFNYAMEEKFANHYKIRIRFCLSQLYRVIVDIMTG--- >ERR1719216_785110 -------------------------------------------------------------PKHTIKIITTFGYIIKNLIYskehtkIFKQLQSLGEMHQ-CHSMInTDIYMELLNAWHFAMEEKFQNKYKNNTRFCFNQLYRLIVDTLMG--- >tr|E0VF51|E0VF51_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236397 PE=3 SV=1 --------VKIVTPTWESIKedFDWYCTKIEETFFQNDTTKKELFTL-PKFEeELTDDVVNKRLFKHSSAVLNFMECIVQFMNGneeTKPVLFVLGRNHY-TIGVNEKLFLEMKDAICSVIKYKIG----TENAKAWDTILQYI--------- >tr|A0A0M3IFG8|A0A0M3IFG8_ASCLU Uncharacterized protein OS=Ascaris lumbricoides OX=6252 PE=3 SV=1 -TGLSMHQKAILTARWRQLPqgiVFDLGKRVFGTLFQKDPNLLVVINL-EHLQGTDAWRDHVNFHMHAQRFTHALSQCMRHLVEpivAADRLQEFGATYAEmedsenfnRSRIPHSYWDRLISAMTSTAKEFHEnpsqksrrnslsvddalvatnerldLQIDSANISAWSALATFVSNQIRFGYE >ERR1719199_711328 ---FKPSHISLIQNQMSALIsefgsIEGAGEFLITQICALDEYVAKLFSG-AAL------------RVQGFKFLGQIARWVTYLADpetVEADLYNLGIRHL-GY-VTQQDFAKFLPaviqCMQKSLKDVLDEQWSALAAESWKMFLGYAGGH------ >ERR1712070_698694 ---------------------------------------------------------------LCFIIARVIDIAAQlfvEPDVCIAEVLQLGLRHI-MYKVPADFFGPFAGIIADEIEARCD--------------------------- >sp|O76243|GLBB_CERLA Body wall hemoglobin OS=Cerebratulus lacteus OX=6221 PE=1 SV=3 -----------------------VVDAFYVELFTAHPQYQDRFA-FKGVA-LGDLKGNAAYQTQASKTVDYITAALAGSAD----AAGLASRHV-GRNVGAPEFTHAKACLAKACA------------------------------- >tr|A0A2C9LKZ0|A0A2C9LKZ0_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106051185 PE=4 SV=1 --GISLADIKVITNQWEDVLrcSDLFGKLLVLYVLDNCPKVNALHPGLHAR--LTDARD-SVEKQIGLRVIQSISCVIHNLNRapaVESMVRDTFKKLQ-QHGYTKNTILECSEAFLSFMNQYFSKRWLKQHSDAWFKVLKALL-------- >SRR5690606_9602430 -------------------------RAFYPILYSSVSGAQELFEA--TVG------------TDNRKMLQILAKLFGfisNVNhsSefMkSDAFIERGKYYA-DHGISETMMRGFSSALVLTLRRTLGELFTISHVRAWGIFLDTISHAL----- >SRR4051812_40179264 -------------------------RIFFPILYSTVPSSQELIEE--AVG------------TDSIKMLQLLVKIFRiisDINhdPevMkSEAFLERGKFYA-DHNISENMLRGFNSALTLSLRRSLGERFTISHVRAWGAFLEMISHSL----- >SRR5690242_7041980 -------------------------RAFYPILFSTVSSSQEIFEE--HIG------------SDQTRMTETLRHVLEffiSVNlnPqiLsSDKVIERAKKYA-DLGISENMLKGFSFSFLKALKQVLGGALSAEAMREMVRLLDNISIQI----- >tr|A0A0G4HHE4|A0A0G4HHE4_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6802 PE=3 SV=1 -------------------------DALLGILFEASPTMRSVFVKNGDL--------------YADLIEHLLRRIIAYADDpgaLWTDDQHLALDHI-NFGMSMSDLPLFGASLMNCLAGVLGENWCDEWQRAWEKAWQICCQSL-----