Mercurial > repos > galaxy-australia > alphafold2
diff test-data/multimer_output/msas/B/bfd_uniclust_hits.a3m @ 9:3bd420ec162d draft
planemo upload for repository https://github.com/usegalaxy-au/tools-au commit 7726c3cba165bdc8fc6366ec0ce6596e55657468
author | galaxy-australia |
---|---|
date | Tue, 13 Sep 2022 22:04:12 +0000 |
parents | |
children |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/test-data/multimer_output/msas/B/bfd_uniclust_hits.a3m Tue Sep 13 22:04:12 2022 +0000 @@ -0,0 +1,2542 @@ +>chain_B +MVHLTPEEKSAVTALWGKVNVDEVGGEALGRLLVVYPWTQRFFESFGDLSTPDAVMGNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKLHVDPENFRLLGNVLVCVLAHHFGKEFTPPVQAAYQKVVAGVANALAHKYH +>ERR1719244_1811598 +MVQWSDDETKAIQMIWNSVDVNELGPAALRRCLLVYPWTQRYFGKFGDIATPTAIMQNPGVAQHGITVMNGLKLAGGPGGGPGNQPGGQQELWQRGKQQGQQQLWQQGQHGGKQRGqqQRQGQq-PSPRQSX------------------ +>tr|W5MMD7|W5MMD7_LEPOC Uncharacterized protein OS=Lepisosteus oculatus OX=7918 PE=3 SV=1 +MVTLTAEDKNNIRHVWGMVYKDPEGngAVVVIRLFTDHPETKQYFKRFKNLDTLEQMQTNPRIKLHGKRVMNTLNQVIDNLDDWAavkEILTALAERHRDVHKIHIHNFKLLFDVIIKVYGEALGPAFTDAACESWSKVFQLLYSFLQSVYT +>tr|G3WE01|G3WE01_SARHA Hemoglobin subunit mu OS=Sarcophilus harrisii OX=9305 GN=HBM PE=3 SV=1 +--MFSAEEQSHIVQIWNYLsgHEAIFGTELLQRLFTVYPSTKSYFPPL-IPG-----LELTQMQNHGEQILMAVGVAVDNMYDLRTALSGLADLHAYGLRVEPTNFHFLIHCFQVMLASHLQSEYTAEMHAAWDKFLTNVAVVLTEKYH +>tr|W5PMJ4|W5PMJ4_SHEEP Uncharacterized protein OS=Ovis aries OX=9940 PE=3 SV=1 +--SLTRAERTIVVSMWSKIstQADVIGTETLERRVTCVSRGPA-P----GSP------QS-------rgRREAGRKGRNDLEtggqgegAGRTGQRLL-RSRLRACTLSF---PPQFLSHCLLVTLASHFPADFTADAHAAWDKFLSLVSGVLTEKYR +>tr|A0A1K0GGD5|A0A1K0GGD5_RAT Globin d1 OS=Rattus norvegicus GN=Glnd1 PE=3 SV=1 +----------------------MYGLEKEp-R------------ETEGCLS---RKLPSNLQRSSAPWRLHGFQNLLERSQGA--------QRAKPG------------HGAHSHSSVKMAL--SQTDH------------------rlvL +>ERR1719474_978995 +--------------------------------LLQSSWKQ--FRT----------------------------------------FASLSGIRQEELGAGCQHQDLP----------QIQHHLWISEPSTFQQLLtftrsiktftnhylnirclflqmflslrgCVNKDSASRKKH +>ERR1719336_830457 +----------------------------------------------------------------------------------SINPQSTVDLGAQYISATPLNYKNHQDIYNSLLSNG------VLVPANVSLIEGMRQDRIDEGEE +>tr|F6XB67|F6XB67_XENTR Uncharacterized protein OS=Xenopus tropicalis PE=3 SV=1 +-MILSEAEKAAILSLWAKAsgNVNALGAEALERILYIWQNLFSYLESP-VI---L-----KILQTGKGASVYKIR-GLDHLSTKHSILPLL-TVKKCLCLRDAGFKILLSHAIEVTLAVHFPDDFDATAQAAWDKFLAAISTALTSQYR +>tr|A0A1L8EXG7|A0A1L8EXG7_XENLA Uncharacterized protein OS=Xenopus laevis GN=XELAEV_18045093mg PE=3 SV=1 +-MSLSQAEKTLILAFWNKASglINTIGPQIVNRLLLAYPQLKTHFGNF-NVTPGS-----SDLNTLGIKIITAVGGATQHMDDLPVHLAILTDLHSLTLRIDPGNYKLMIDCIVISMAASLPQDFTAEVQNAMTNFLIIIGDILASKFC +>SRR5260364_139532 +------------T----VLapDPnPTPHSASPRRMFLSFPTTKTYFPHF-DLSHGS-----AQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKVSGGPGAIWVEGRDGAFLAGQRITRvAGGVAQAAAAGLGPRPH +>tr|A0A096M318|A0A096M318_POEFO Uncharacterized protein OS=Poecilia formosa OX=48698 PE=3 SV=1 +------HDELIITGVFFTSVSECVPP-----VRNIYRQTTNSIENIGNFKNGETFLTNPPVALYVVNMVEFTSKPLMS-LPLNGFYGILDFLK--AKRKNPNGGKLLADCLTIVIASKMGSGFTPEIQATFQKFLAVVVSALGKQYH +>tr|A0A146TSR5|A0A146TSR5_FUNHE Hemoglobin cathodic subunit beta (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 +IFHFIYFYLSTIHYIFSKIYSFFFFPSSLSIFLIFYPFTHIYFFIFFNLYNSSSITSNPNFSSHFNFFLSFLYKSFNNIYYINTTYKYLIFLHSYKLQFYPYNFNLLSYFLTIFLSFHIFSSFTP---------------------- +>tr|A0A146Z291|A0A146Z291_FUNHE Hemoglobin subunit epsilon (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 +IFYFSYHYLIIITSIFSNLYYNYFFPNSLIIFLIFYPFTHIYFSNFFNLYNSYSINTNPNIQSHFTNFLHFLYLSFNNIYNINFTYSYFIFLHSYNLHFYPYNFNLLSYFFTIFISSNIFSVIKE---------------------- +>tr|H3B4U9|H3B4U9_LATCH Cytoglobin OS=Latimeria chalumnae OX=7897 GN=CYGB PE=3 SV=1 +--QLSDTEVESIRQIWSNVytNCENVGVLVLIRFFVNFPSAKQYFSQFRHLEDPLDMERSVQLRKHARRVMGAINTVVENVEDQDKiasVLAPVGKAHALKHKVEPVYFKILSGVILEILAEEYAQHFTPEVQKAWTKLMSIICCHVTATY- +>tr|L8HVQ9|L8HVQ9_9CETA Cytoglobin OS=Bos mutus OX=72004 GN=M91_06698 PE=3 SV=1 +--ELSEAERKAVQATWARLyaNCEDVGVAILVRNRFWRkKRASSTLEEFQegaqgrdsslGSSQAQKQPGCPQLRKHACRVMGALNTVVENLHDPEKvssVLSLVGKAHALKHKVEPVYFKILSGVILEVIAEEFANDFPPETQRAWAKLRGLIYSHVTAAY- +>ERR1711977_7585 +-MSLSAKDKTLVKKLWEKAEgkSADIGAEALGRMLVAYPQTKTYFSQWGSDLNPQ----HPQVKKHGAVIMGGVGKAVKNIDDLVRGMGALSELHAFKLRVDPANFKILAHNIIWSWPCTSLQTSPPRPTCPLTSSCRTWLWLCPRDT- +>tr|A0A1C4HCU8|A0A1C4HCU8_PROAN Myoglobin (Fragment) OS=Protopterus annectens OX=7888 GN=Mb3 PE=2 SV=1 +--MASAAQWDTTLKFWEAhVagDLKKHGHEALVRLFLKNKDSQKHFPKFKDLASEAEMRGSDGLKNHGETVFTALGKALQQRDGIANELRPLAVTHSQNHKIPLEEFENICEVIDVYLAEICPD-YAGETRTSVKAVLDVFSQSMTTLY- +>tr|A0A146P967|A0A146P967_FUNHE Hemoglobin subunit alpha OS=Fundulus heteroclitus PE=3 SV=1 +---LSKKEKKLIKDIWERLTpvAEDIGSEALLRMFTSYPGTKTYFSHL-DISPGS-----AHLNSHGKKIVLAIAGGAKDISQLTVTLAPLQTLHAYQLRIDPTNFKSCFHTVCLSRWpvTWAKSSL----RLHTQQWTSTCQPLQPCSL- +>tr|A0A146QLZ2|A0A146QLZ2_FUNHE Hemoglobin subunit alpha-2 (Fragment) OS=Fundulus heteroclitus OX=8078 PE=4 SV=1 +NIILTSNYNYTFNTFFSKFssNSYSIFSYSLSIILFFYPHTNTYFSHFNYLIPFS-----SPFNNHLstfiflfsxxxXXVMGGVEDDVEKIENMKEGIIRISEMNELNMRVEKEKLKIMEKKIIVV--------------------------------- +>tr|A0A024R1G3|A0A024R1G3_HUMAN Myoglobin OS=Homo sapiens GN=MB PE=3 SV=1 +AMGLSDGEWQLVLNVWGKVeaDIPGHGQEVLIRLFKGHPETLEKFDKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPVKYLEFISECIIQVLQSKHPGDFGADAQGAMNKALELFRKDMASNY- +>tr|M3YM80|M3YM80_MUSPF Myoglobin OS=Mustela putorius furo GN=MB PE=3 SV=1 +-MGLSDGEWQLVLNVWGKVeaDLAGHGQAVLISLCQGLESRKEEKKRDPAHACVSSRRslfVSQDLLFHSDAFLVSLGHRSflaPVSGENGQSQKTQPAHHAQHHRQPWNTEKFISDAIIQVLQSKHAGDFGAEAQAAMKKALELFRNDIAAKY- +>tr|A0A1Z5LBJ2|A0A1Z5LBJ2_ORNMO Uncharacterized protein (Fragment) OS=Ornithodoros moubata OX=6938 PE=3 SV=1 +--ALSAAERALLRALWKKLgcNVGVYATEALERTLEAFPRTKIYFSHM-DLSP-----GSAQVRAHGQSPRPQGGRRADPRRRPPGRPArrpVRSERpARAHAARGPPPLRAAGPLSAGDPRPALPWRLRPRH-------------------- +>tr|S4RW14|S4RW14_PETMA Uncharacterized protein OS=Petromyzon marinus PE=3 SV=1 +--ALSGAEKAAIADSWKAVysNYEEAGKAILIKFFTSNPGVQDFFPKFKGLDSADQLSKSAAVRWHAERIINAVNDAVVALDDpekLSLKLKALSKKHAQEFNVDPQYFKVLAVNIVEGVSSA-NGGLGAEAQAAWEKFLSQVSILLKSQY- +>tr|Q9Y0D5|Q9Y0D5_MYXGL Hemoglobin OS=Myxine glutinosa GN=Hb PE=2 SV=1 +--RTTEGERAAVRASWAVLmkDYEHAGVQILDKFFKANPAAKPFFTKMKDLHTLEDLASSADARWHVERIIQAVNFAVINIEDrekLSNKFVKLSQDHIEEFHVtDPQYFMILSQTILDEVEKR-NGGLSGEGKSGWHKVMTIICKMLKSKY- +>tr|A0A1W0WKD0|A0A1W0WKD0_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_10224 PE=3 SV=1 +--GLTSNHIKAVRANWKLIekRLPEYGLELFVAYLNKHPDWIGLLPFLKPADMPR-LQQTPRLKAHGTIVLKKLGELLTMLDSppkLIGELLKQGSTHR-ARGLAPENFQAIQHDLNELFVKICGPE---FDIEGWDAVLTLIMTGIEEGL- +>tr|K4FYM0|K4FYM0_CALMI Hemoglobin subunit alpha OS=Callorhinchus milii OX=7868 PE=2 SV=1 +---LSKTDKALLSSSVGKIQAQATGSDVLARMFASFPQTKVYFVGFSDYTA-----KGPRVQKHGLTVMTKIIEGIQYLDSLRSFLDALSAKHAHELMVDPVNFGFLGECVLSSLAYQLPD-FSPEMHCAWDKYLCEFAYLLAEKYR +>tr|H9GUN8|H9GUN8_ANOCA Uncharacterized protein OS=Anolis carolinensis GN=LOC103282340 PE=3 SV=1 +--KMTDLDRRHIREIWTAAfeNPEENGRLVIIRFFSDYPASKQYFK---TVPTDGDLKAHPQVAFHGRRIMVAFSQVIENMENWNQACVlleRLVNNHKNIHQVPSGMFQLLFQAMLCTFDDLLGRTFTPEKRVSWEKFFQVIQEEVEAAYD +>tr|H2YFM6|H2YFM6_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1 +--SLTTEEVITLRTTWAEiskLGNATVGLAVLHRLFNDCPEVRPFFGSMlppSELSDMDSLKSNPKVVDHASRVALSINNIIQLLEntdELVSYLSFLGKVHG-ERSIPAKHFSDMGPVLLAVISAVLREDLEGVVMQTWAKAYGAIEAGI----- +>UPI000197D711 status=active +---LTPKDIYEAKQCWNKAAslgVNKVGVLLFKNIFTIAPEAAKAF-SFGNDP---NFMNNKEMEEHGVKVVMAFDHAVRSLDNIHalqETADGLRDTHSFF-NLSPEHHVIVKEALLQTLKQGLGDEFTDAQRELWNGIYTAIRNMW----- +>KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold119418_1 # 1 # 498 # 1 # ID=119418_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510 +---ISPLKLRLVQSSWRQASaDEQAGITAFKFFFEMEPVAIGMF-GLQDIR---DLYNSYELKRIAAKIVKAMTHIVNSFDNFEglrPLIKKLGMMHGEK-GVSPSQYNNFGKAFMQTVEEILGDQFTPETRRAWETFFRILTGAL----- +>tr|A0A146PHJ5|A0A146PHJ5_FUNHE Hemoglobin cathodic subunit beta OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 +-----------------------------ASWFCGFHWTQRYFPHIWRPLPPPAIAAKFPKGAAWKTVMGGLEIAVKNIGQHKAAYAKLSVMHSEKLHVDPTTSGFLLNASQWVWLPSLPPRLHPWFPGGWQKFR------------ +>tr|A0A1E7FQE1|A0A1E7FQE1_9STRA Neuroglobin OS=Fragilariopsis cylindrus CCMP1102 OX=635003 GN=Ngb1 PE=3 SV=1 +--------MALVVESWAKIKEIENyeevaGELLFRRIFEIKPDAAAYFKFTDGFETTDeALYKQEVFIKHVKMVILTVTSAVDLLEkeNMdelFRMLKLLGAKH-LSagLKLEKEHYNLVGMALLDTLGKALGDTFTEAVKSAWIGVYAIIASKM----- +>tr|A0A150AR53|A0A150AR53_9BACT Uncharacterized protein OS=Flammeovirga sp. SJP92 OX=1775430 GN=AVL50_01545 PE=4 SV=1 +---VSNKQIELVQNSFTLITphRGQVSELFFSKLFKIDSSLESSLMV--DPK------------DQERRLIPMLSAVVNGLVDfelIIPILQDFGRTHV-EYNIQEKHYEAVQKALFYALQTVLQEKWTSEVDDAWSNIFSVLTNIMKE--- +>tr|A0A1Q9P386|A0A1Q9P386_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=hmp PE=4 SV=1 +---FSNNDIRVIDELWDLILpiKETITDSFYATLFSLDRTIKPMFKT--DLG------------VQGLRLTDTLTFIIKHMGNiedTIQIVKELGVKHL-EYGTKPYHYDLVLEALLETFDKHLEEKFNSEMRLCWIKLYKFLSELMML--- +>tr|A0A1G1B2A9|A0A1G1B2A9_9PROT Uncharacterized protein OS=Methylotenera sp. RIFCSPLOWO2_02_FULL_45_14 OX=1801615 GN=A3I83_03315 PE=3 SV=1 +---MTPMQIDVVQSTWQKVMpfREDIACLFYKRLFEIEPELSMVFKG--DMH------------DCVKKIMFMIDLAILNLGQleeVMPMLQEIGNKYV-QCGMKVDS-NAVRNTLVSTLEQRLGETFTVNVRSDWIQAYDLLVGVMKD--- +>sp|Q7SID0|GLBF1_EPTBU Globin-F1 OS=Eptatretus burgeri OX=7764 PE=1 SV=1 +--TLTDGDKKAINKIWPKIykEYEQYSLNILLRFLKCFPQAQASFPKFSTKK--SNLEQDPEVKHQAVVIFNKVNEIINSMDNqeeIIKSLKDLSQKHKTVFKVDSIWFKELSSIFVSTIDGG----------AEFEKLFSIICILLRSAY- +>tr|K1QF07|K1QF07_CRAGI Neuroglobin OS=Crassostrea gigas GN=CGI_10026082 PE=3 SV=1 +--TISEDEKRLVKDSWNLFVsrgdFSDTGSHMYKVLLQDNPHLKTLFSFMKVNGa----PFDSPMFKSHVRNVFTVIGDAVNHIDDLDSLspiLKDLGVKHQ-GYGAKKEYLEPVGNALLCTIEKHLEDDFTQEVHSAWRTFFAVMSYSFA---- +>tr|Q3MQ26|Q3MQ26_SPISO Nerve hemoglobin OS=Spisula solidissima OX=6584 GN=nHb PE=2 SV=1 +--KLTKAEKDAVANSWAALKQdwKTIGADFFVKLFETYPNIKAYFKSFDNMDMSE-IKQSPKLRAHSINFCHGLNsfiQSLDEPDVLVILVQKLTVNHFRR-KIAVDRFQEAFALYVSYAQD---HAKfDDFTAAAWTKTLKVVADVI----- +>SRR3989338_1269240 +--DFNDEEIDIIKDTWDAVLYPey---PEEGfnPVLNFSTKFYRRVFehencknlfeE--V------------DMTSQGEKLVKILSVLLVAVQTkslnqdHIHVLRKMGERHRG-YGVSDDMYEIIGGCLLRTLSEVCADVWDDDAKVVWAKLFGVVSEQM----- +>tr|A0A2G8K001|A0A2G8K001_STIJA Globin D, coelomic OS=Stichopus japonicus GN=BSL78_21829 PE=4 SV=1 +TAQLSEVEKNLIRSSWEQAlkNKKVFGVNVFIKLFIQNPSSQDLFEQLRGIPLE-DLKTHRKMKAHALRVMASLNTLVEQIDEVEiltEMFNNVARTHV-IHKVEKAHYDLLGQVLMEVFSEELGAKFDSATKGAWLKAYVIMENIILDKY- +>ERR1712150_314552 +MTALTEERKLHIKSSWSSVndDvdLAGNGVEFLVKLFTDFPEYMTFFPAFDGKTPE-EIRSSPKAKMHGKVLMTTLDKIVANLDDLEtviASLHRVVGSHF-PRGVTASHFKATLECFGSFLAVQLGDAFNNDVKNAWGVAVQILASVMEAEY- +>tr|A0A132AHZ9|A0A132AHZ9_SARSC Cytoglobin-1-like protein OS=Sarcoptes scabiei GN=QR98_0086180 PE=3 SV=1 +-MSLTNRDKEIIVSTWSLIrkDSDQAGIHLFKRFFEANPDYVKYFP-FGDLdDLE-KILVDPRLKWHASRVMAALSTIVDNLDDPVcfeDSLQKVLSSHL-NRKIQLYHFENLKKALVCLFMDKLGpDIMNDETIEAWSKAYDVILDTYRSRL- +>sp|Q8T7J9|GLB_YOLEI Globin OS=Yoldia eightsii PE=1 SV=1 +-MSFSAAQVDTVRSNWCSMtaDIDAAGYRIFELLFQRNPDYQSKFKAFKGLAVS-ALKGNPNAEKHIRIVLGGLGRILGALNTPEldVIYKEMASNHK-PRGVMKQQFKDMGQAIVTALSEIQSKSGGSFDRATWEALFESVANGIGQYQ- +>sp|P0C227|GLB_NERAL Globin OS=Nerita albicilla PE=1 SV=1 +LKSLSADQKAAIKSSWAAFaaDITGNGSNVLVQFFKDYPGDQSYFKKFDGKKPD-ELKGDAQLATHASQVFGSLNNMIDSMDDPDkmvGLLCKNASDHI-PRGVRQQQYKELFSTLMNYMQSLPGANVAGDTKAAWDKALNAMANIIDAEQ- +>tr|A0A1B6EVA8|A0A1B6EVA8_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.22480 PE=3 SV=1 +LEVITERDKYLAREVWMQVETNyvLISKSLFTNWITEFPEHLNFFKGLLD-SSYDDFLTSPKFEQHMANsVLPNVGIMISNLDRptdFRRHILKLAWIHIRKNiALKIDHFNILKGLILRTLKESLGRGIGRDHEVAMFKVITAGFNLFS---- +>ERR1719240_1900674 +-----------------AVArvlVHGL-ANLHRRALERLDLLLELVDAHRVVVL-RLLHRLdgrldrlHVLRRHLVLVLE------EG---------LLGAVHR-RVGLILH----------LHLRLAIGVRRGE---------------------- +>tr|A0A224XVH8|A0A224XVH8_9HEMI Putative hemoglobin-like flavoprotein (Fragment) OS=Panstrongylus lignarius PE=3 SV=1 +DIGVCNEDVAGIKETWQTVYNDkEnSGIFLFQVMFEMYPDYEKYFVRFRT-EGQKSLFDNPKFINHVKnRVMDALNDVIVNLENDErlvNILETVGENHK-KRNLRKQEFDNIGKVVIETLRRALGTSFTPKLEEAWTKVINCAMETIGK--- +>tr|A0A1B6KZX4|A0A1B6KZX4_9HEMI Uncharacterized protein (Fragment) OS=Graphocephala atropunctata GN=g.7772 PE=3 SV=1 +YFHLSLEDKRLAREAWYnNVEGNyViVAKAVFKELFRRAPQAYNFFKHLVD-VNERDMFESPRFKRHMVqRLMVALETIFYNVYWNDvfeNHMYDQGRKHK-KRGVQPAHVKLLLCVIV----------------------------------- +>tr|R7TS60|R7TS60_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_200756 PE=3 SV=1 +-TFLTDEEVEILKASWNDLNddsdLSSIGKRVFLQAFEMRPEMKKIFP-FDNCWGD-KLLQHPKFQAHAQSFMVIIENSVEQVDNESSDFsdslTLLGQSHSDRIGFTRENVQVFLKAILAVWHDLLKS-SDDRTEKIWSKFLAHVVQIMRNGY- +>tr|A0A0X3PJM2|A0A0X3PJM2_SCHSO Globin OS=Schistocephalus solidus OX=70667 GN=GLB PE=3 SV=1 +--QLTEVQKTQLCVEWKQICKNKedkyaLGTEVFRLLFTKYPHYIRLFKRFRDLPNLDSIMQSAAFKAHAMRFIGAIDAIMENLDDescLVELLKRLAEEHRPR-GITENDFYKTLDVAYDALSPALKsDDARVALRQLFDTALSVIRQSL----- +>sp|P02214|GLB_BUSCA Globin OS=Busycotypus canaliculatus OX=57622 PE=1 SV=1 +--GLDGAQKTALKESWKVLGADGptmmkNGSLLFGLLFKTYPDTKKHFKHFDDA-TFAAMDTTGVGKAHGVAVFSGLGSMICSIDDddcVBGLAKKLSRNHLAR-GVSAADFKLLEAVFKZFLDEATQRKATDAQKDADGALLTMLIKAH----- +>ERR1719239_1832466 +--GLSEKDLVLIRGSWGMLgdlkTRKAHGVELFIQLFRAYPYMCeEYFPWFNDMSDEE-LRTSRKMKAHAHNVMNNIGSYVEVCDDPESlvaLIGKMAETHIP-RNVKALQFKELGDMFLPYLVSMMGAAATTDVQEAWRRLLAALVAVVSQ--- +>tr|A0A1I8JIG1|A0A1I8JIG1_9PLAT Uncharacterized protein OS=Macrostomum lignano GN=BOX15_Mlig002954g1 PE=3 SV=1 +--MLNEVEKKIILSGWQQAikDKKALGMDVFMTLFEMFPQHQELFRDFKGKSRAE-LEKMPKMRAHALRVVNTLDGAIQSLDDMEVcasSLELIGASHKS-HHLSAKHFEDLNAALAVVFERRLGKA-FVDNKAVWVKLLQGIIPVIQR--- +>tr|A7RZB2|A7RZB2_NEMVE Predicted protein OS=Nematostella vectensis GN=v1g204383 PE=3 SV=1 +-IPLDAKETQLVRKTWAILGDRqvEVGKSLFLRFFEEHPTSKDLFPEFRNISNEK-IAESPALYGHARRVMKSVDNAVASIENVQVysaYLYELGTRHQ-TRQLSEEQLKFMGGAFLFAMRLHLRKEWSRATSKAWEKIFSFMADAMMR--- +>WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1887876_1 # 1 # 366 # -1 # ID=1887876_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459 +-LPVSDENKDILRESWKRLEEEktTLCKNVFIRLLQLNPNLQDTFPSFKGVALDE-LMNSRSLFLHSKRLMEALEIAISSLDDGQDfteYLTHLGERHT-AISITENHFKIMEKALIFALKDMLGESCTEDVANAWREFFQSMAGTMLA--- +>ERR1719401_2606804 +----------------------------------------------------------QKYQAQGSRSQ---GG---ELS-RRrcvPPAQSRRA----RAGLAGDghqahclWHPPGERSEIRGSLRCCGEGSDPKLEMAWTKVFVVVSTTM----- +>ERR550519_2895140 +---LSKAERKEAENAWRIFevNLVDNGVDAFLNLVRDHPNRKDAFPWVKPELSEEALRNDPEMKKLAKLVFSAVKPAFKSLGDlqsLTNYYLNIGNELS-LMNIPPVMVSYLSDAFKKTCQKLLGSDYTHSLEASIEYVYDFITSRMFE--- +>ERR1719402_597456 +---------------------------ALIA-------LISS----------------------AAGSGCLCDARARPFSM-------LS--AI-KLIRVVSAFRATAKALLPAFEEELGTKYTDDFRYALTTLINFMADNMEK--- +>ERR1719423_342041 +----TGRQRVAVQASWRLVapDAKRHGIAIFIRLFKKHPETQLVFKSFKGQQ-PESLADNKRLAAHATTVMASVATLVDNLDDidtLLELLHKVAENHKRR-GLPIQYSTIWWRRWG----QHWTAAASRGGATSSepstrssplstsgskDNSFRNVCKMCEGISR +>tr|Q53I62|Q53I62_9ANNE Intracellular haemoglobin (Fragment) OS=Alvinella pompejana GN=hb-i PE=2 SV=1 +------------ADNIAAVrgDVSTHAMNIFVEYFKKFPQHQNAFADYKGKD-PESLKSLPKFKTHTTKVVSKLLDIVEKASDsgaLQSNCTTLAKMPQHK-GLNQQQFADLGAVLVPYLQKALGGACDSA---AWeqayn---------------- +>SRR6516164_9760095 +-IVTTPQQVQLVKQSFAKTTpiAEQAAGLFYGRLFETAPQLRPLFK--GDI------------KTQGRKLMSTIALAVGSLQKlpeLVPIVQDLGRRYV-GYGVKDDQLRYRRRRAAVDARQGaRGRLHTRCEGRVDLGLYDPrrYDEERRSAA- +>SRR5690348_1420512 +-----------------------------RHRAESAPAVSGRS------------------HSAKKEADGDDLHDDRRTERfqkAGPGSQEPRRAPC-RLWCDCGGLSIVGEALLWTLEQGLAAEFKPEVRSAWIKLYDMIATTMQAGA- +>SRR5258706_3013648 +-XMLSEKEITLGRNTWDLIapvT-QEMGIQFYEHLFETSPELKPLFKT--NP------------KDQAMKLMFMLSYFVHRLDKendLRAEIKKLAQRQS-GYGAKPEHYKLIRDTLLCSMQNDLRKPWNKETESSCQ--------------- +>SRR3712207_8213275 +-RLMREYRLAVIFFFFSSR--RRHTRYWRDWSSDVCSSDLSLFK--GDI------------TEQGRKLMQMIGVAVRSLDRleqVMPAVQALGARHV-GYGRSEERRVGKEGRSRWGPDHX----------------------------- +>SRR4029077_8512364 +--CVTPQQIDLVQASWKQVVpvSETAAQMFYGRLFFLDPSLRRLVL--RGK------------RGGGERGGAVVLG-RQGEEGeegEGSALIHRDRAQA-AGGP-PPRGPAPGAAA----------------------------RHVRRS-- +>SRR5437868_6476409 +-----MDEILLLKTSLQKMGpqLEHAAGTFAVRLFQLNPSL-------GEI------------ATRGRELLQMMGAAVQNLGRldqLAPSARQFGRHYA-NCHIREQDYDAVGEAFLWSLGRGLGRDFTEEMEAAWGKVYWLMTEIIRAG-- +>SRR5689334_13356078 +------------QVSFTQVApiAETATQLFYARLFELDPDLELLFK--GNL------------SEQGASLCKCSHLRSTVLTGwsnFCQSCNRLAHDTS-AMGFETKTTTQWDRRFCGRYGKGWV------------RPSHLRLSX------ +>SRR5437870_6238790 +-FDVTPIQVDLIRASWAKVEpiQELAASLFYDRLDRKSTRLNSSHVA-ISY------------AV---------FCLKKKKKKkek---------------YTHEHINNNKV---------------------------------------- +>tr|A0A136P213|A0A136P213_9CHLR Globin OS=Chloroflexi bacterium OLB13 GN=UZ13_01312 PE=3 SV=1 +-ESLTEHDKKLVQRSFTHIApqNEDIAAVFYARLFELDPDIEHLFS--TGL------------DVQRAKLMRMMADLVNALDApeaLSQSMRELGKQHV-SYGVHDKHYATVGEALIWALRKVCPAVMTPTVTQAWEKTYALFAELAIS--- +>tr|A0A0C3QP41|A0A0C3QP41_9GAMM Uncharacterized protein OS=Shewanella sp. cp20 GN=DB48_17865 PE=3 SV=1 +-MPLTDEQKRLIQKSYAEIDrqNSNFAAIFYDCLFAMAPLIRPMFKS--ER------------PVFEYHFNELISTAATKVFEfeeIKPRLVVLGQKHR-GYGVTPAQFDVVRSALMLSIQDCLRDTCNPAIEQAWSCYYDEIAKVMIAA-- +>SRR5262245_10239308 +-GPENARPGNL-RHHYadrgrcsGSLLpeAvqaRSVAGRHVSRRHERAAEE--AAAD-ADG------------RRQGARSA----RSGRGGRRgsrPAPRAIRRDRQAL-RHGRHGS---P------LGARGGTRARFTPSVKKAWATVYGLLATTMKNA-- +>SRR3981081_1073077 +-VVATPSPSRRRISDFG-------------RLKML-NSGKPEFGAgeGSSC------------CSGRSHLLVAILRHVAGIA------------------------------------------------------------------- +>SaaInlV_135m_DNA_2_1039731.scaffolds.fasta_scaffold157242_1 # 1 # 360 # 1 # ID=157242_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.458 +--LLSPATRELVRSSFPMVEriAPRAGTMFYGRLFATAPEVLPQFR--RDLS------------QPNFQPaaehrfMQLVLFVrstaeHAGLPGsagHDETVGKLAQRHV-GYTTRAPHYAPLGRALLWTLDECLGADFTPAMRAAWSDTYDVLVASMVAPL- +>tr|A0A0P1GRZ8|A0A0P1GRZ8_9RHOB Soluble cytochrome O OS=Thalassobius mediterraneus GN=vhb PE=3 SV=1 +MNLLSKDEVALIQGAYRALGpsKGFLTNSFYRRLFAIAPQARPLFP--QDM------------DEQLKKLEHMLDLLVDNLHQpmfFMGKLKRLAKRHV-GYGAQPEHYALVGEALIFALNDITPGGLPDKERALWVEIYTAISNTMIET-- +>APLak6261659701_1056019.scaffolds.fasta_scaffold514158_1 # 3 # 230 # 1 # ID=514158_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561 +-IELNAKNKALVKEGWKLLIEtqFPnevggneralarFFDEFYRKFFEVNPSGKRLFEE-GGM------------AVQSKALVKMMSMVVTSLENpsnLDLTIERLGGRHE-LYGVSRSDYLAFTNAMCETLETVLGDKCNQEMKESWSLVLNNLSEKMLT--- +>SRR3954466_1768845 +-SCHDSGTGDARS---ADIRpgradRRQGGGDFLRSVVRGRPHGQAVVP--GRH------------SRAAPQTHRHAGGRGPRLSDLpsiLPAASALAKRHV-DYGARPEHYPVVGAALLWTLERGLGPQWTSEAASAWTAAYATLSSFMIA--- +>SRR6185295_9741709 +----------------------------LTTWVKHLRRSIMVCG--DDM------------MDRRKRFTQVVSATVRGLARvdmLLPAVREFGMRHP-LPGEIEQHHANVASALLWMLEKALRKDFTPEVKAAWIKAYGMLSQTIRQS-- +>tr|D7G782|D7G782_ECTSI Globin OS=Ectocarpus siliculosus OX=2880 GN=Esi_0008_0247 PE=3 SV=1 +--VDVEGYKAEIRRTFALVEpiSVQAAGIFYPTLWEVDTSTKPLFKD-TDM------------DKQGEKLMKTLGVAVAMLNKmdtLKPILENLGRKHV-DYGVTPEMYPSVGKALLITFEKGLGEECTPLTTKAWTWVFGIISSICIAAA- +>SRR5215207_7597532 +-QTMTRDQIRLVQASFRNVLpiRELAAALFYDRLFEIDPGTRGLFVD-TDL------------RSQGGKLMAAIGMVVHALDApesMVEKLKELARRHV-NYRQLQESSPPDFHRLhrfgsgrgsqRHVVSKGPGVAPVGQ----HVVPTHFASRvsrRLRAC-- +>SRR5262249_41212017 +-NVMTPEQKRLVRDTWKQVApiADAAADMFYRRLFEIDPTTRELFHA-TDM------------VAQRKKLLQMLAFAISGLDNlgaLVSKVEDLGRRTP-AVALPTRTTIPWAPRCCGPWNRVSVTRGHP----RWRRHGPRstnccpascatlprapsscktcgplrrgrplerqgICCVFRKR-- +>ERR1700730_6579985 +--RQRLADDGVILRVLQRGLgiELEMEALAREEIGELDPDAarfRPHHA--VGG------------GEVGGRHIELLRRHVDQRPpcHaaaNGSARISLPRGHV-SYGAKPRHYPVVGAALLWTLEKGLGDGWTPEVADAWLTAYSTLSGYMIS--- +>tr|A0A0N0UYC0|A0A0N0UYC0_9BACT Uncharacterized protein OS=bacterium 336/3 OX=1664068 GN=AD998_10010 PE=3 SV=1 +------EQKEIIKSSFPRVLihTLKNSTIVYEKLFMDIPEAKDLFKN-TS------------IDKQGQMLVAAIGKIVKGLDNpdiFEKDLVELATRHV-GYGLKPEYFTHFGNALINMFEVSLVDSWDKDLHDAWVAVYQEVAEIMKSVI- +>SRR5918994_1539718 +-------QQELIRESWQRFEpkIKRASPQFYERLFALDPAVRRLFSG-VNM------------AEQERKLMAMLKEIVPELDRptdLVAAVGRRSPFTP-HpepSGWLDPRYAWMRSRTPLP---CSGEX------------------------- +>tagenome__1003787_1003787.scaffolds.fasta_scaffold20949172_5 # 2657 # 2851 # 1 # ID=20949172_5;partial=01;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.626 +-------DETALLKGFDLAAdvLDEVIDNFYTELLESYPDLQPLFAH-TNT------------QQQRQKLQDVIYLLIENIHNqdvLESALLSLGERHI-RYGALPEHYPVVAEILESNLKKRLGRSWTKAVSTAWIQLLSAAADVMCRPY- +>ERR1700753_815890 +--XMKSSTMELLSSSFARVcaDKNNAAGIFYARLFTTAPELRAAFQS--DF------------DSVQWKLMSSLVQIVEFYRVgvdPTSYLADLGRSRQ-GYAAQRAQFDAVGDAILFTLAQVLGQGFGADIRAAWVSAYAA---------- +>tr|A0A1H2YYM1|A0A1H2YYM1_9RHOB Hemoglobin-like flavoprotein OS=Albimonas donghaensis OX=356660 GN=SAMN05444336_103306 PE=3 SV=1 +AMPLDSTNLARMREMLHILRrdAPDASTDFYQALFERAPELRTLFRD-SDL------------AGQGRKFMAMLGLLVDACEDygrLGNEIRELGRGHA-AYGVEARFFPPMEEALIDTMRSNLGERFTPELEADWRKLYAIVANEMMSP-- +>tr|A0A1T2B631|A0A1T2B631_9RHOB Uncharacterized protein OS=Thioclava sp. DLFJ4-1 OX=1915313 GN=BMI85_03370 PE=4 SV=1 +EPLLPAERAARVKASAARLDfeDPSLFRDAFARLFAVHPELDQVLPN--SE------------GGQQLKYAAMMEVILSTLDPpeeQELELPGLGQMHV-LFGAEPDYYVWLSEAVIAGLAAKLGDHWTSELAADWAELFSKVSAQMIAG-- +>tr|A0A2E1AIS1|A0A2E1AIS1_9CHLR Uncharacterized protein OS=Anaerolineaceae bacterium OX=2024896 GN=CL607_22355 PE=3 SV=1 +MSPVTSRQKLLL--HYTLLHldADQMGKLFYDHILAAMPEVAPMFTD---L------------ESQRKHFMKMMIRIVHTIDEpdhLNIVLRELGHIHK-RLHLKPRHFSKMGVAFSNSLAEVMGDRYTPEIGEAWRILYNRVAEAMQSP-- +>SRR5262245_62462516 +--------IFIFLLFFFFCLcf-CFMFFFFFSSRRRHTRCLSDWSS--DVC------------SSDLQKLLAALALVVRSLHTpekILGPVKKLAVKHV-DYGVRPEHYTYVGNALLRTLKKGFGREFTPELSDAWVEAFRMLAKVMKEA-- +>tr|A0A2D6AZC8|A0A2D6AZC8_9BACT Uncharacterized protein OS=Flammeovirgaceae bacterium GN=CMB80_28915 PE=4 SV=1 +SNTMTSESINMISKSWDLLSRdPQLVTRFYNRLFDIAPETRRYFK--DDI------------SKQSEKLAHTLNFLVMNLDRldeIKESIEDLGRHHN-KMKIKAEYYVYVKEALLTTIQETLDEQCESGMVEAWDHALSHVASTMINA-- +>SRR5262245_55554356 +--CVTPEHRLLAQQAFATIQplADELGLLFYSRLFELDGALRGLFKH--DL------------ANQAHSLMAMLQLTIEGLDApeqFTRARTTWGYATWTmGFSRTSTRLLRRPCSGRSSMRX------------------------------ +>SRR6516165_4200192 +------AQ--------------------------------------SDL------------VDRGRA------YRLLGLADLvdrrnQAaagGLSLFHRRAV----------------------SAGGVAWADRVLDALSlylcgyelrwpQLDHALGRgavhpdacaSLLRE-- +>ERR1700733_1486793 +--------------SQAHGGdiVDLyRDVRLVYRLFRRLPPAEQDAIP-GDH------------RRGRLSRaAGRVAL---------APVRRAARRQ---------DRRREG-DVLELRRDGRGDDRRHVFHRDQElswlSDDV--PR-VVRD-- +>SRR5215831_4136876 +--KHDPPTDLARAEQLQVRCA----DRVKGRRSLLRPSLRDRSRGP-AA--------------LPRKIIRAEGKVdgdANEDRQqssSAQchFASCTPTRRaaQ-GLRCLDGSLWGSGCCLLWTLEQGLGSAFTPEVKAAWSEAYRTLAGAMQEG-- +>tr|W5NBV0|W5NBV0_LEPOC Uncharacterized protein OS=Lepisosteus oculatus PE=3 SV=1 +-VPLTESQKDLIRESWKVVhqDIARLGIIMFIRLFETHPECKDVFFIFREIDDLQELKMSKELQAHGLRVMSFIEKSVARLAQedkLEQIALELGKCHC-RYNAPPKYYEYVGVQFISAVKPILKDSWSPQVEQAWESLFAYLAAVMKRGYH +>ERR1711911_21978 +ATGLTARQKRIIAKNWDLVRpnLKEAGVGLFIAYLTKHPEMQARFKSFATVP-LNELAANRKLQAHAANIMYSMTMLVDSLNDvecLVQHLATIGRNHR-RRHLKRHHFQDLAVVIVDFLEAALAAHWSAEARQSWTLALNVIVDQICNVL- +>SRR5215218_21909 +-CAMNPEQIGLLAESWKGVAgrRDEIARAFYGVLFDRHPELRSMFAH-TDM------------RAQYEKFALMIDEIVQLRTEprqFVRSAVLLGQRHA-AYGVTRDHYGPAGAALIEALAEALGSAFTPAAREAWTEGYLLMSSIMCR--- +>SRR5688500_19518083 +-LLITPAP--------------------PSAIHTRYLHDALPIAH-VDM------------GAQYEKFAAMVDEIVGLRTEphrFVRSAVLLGQRHA-RYGVTRDHYAPAGAALIEVLDRKSTRLNSSHLVVSYA----VSCSIQ----- +>SRR5258706_7695680 +--RHDPPPdpadPPVLRPA----RvqGRETRHLDVQAPVPARPRPTPAVQ------------------------------------------------------------------------------------------------------- +>SRR4026207_1847514 +-PLMTSNQRQLVRQSFDAVRdqAGPFSLLFYGKLFELDPSARRMFHV--DL------------ALQGRKIVDTLATVTESLDRfesIRPRLASLGRQHA-GYGVRPEQYDTITAALLWAIGQALGADFDAPTREAWKLALNAVSTATIEGA- +>SRR5260221_10622870 +--IVNAAQQELVMTKAEGVvlMPGVTGVLLCALLISANPSFRPLFKS--DM------------RIQGVKLMTMLAMVVYNLPEpgqVLPAIRDRSEEHT-SELQSHSDFVCR--LLLLHX-------------------------------- +>SRR6516225_5669596 +-NVMTPEQKRLAScfrrggppGSWRRPSppLGIETAQVFRIPCVLPN--AAVHTA-GVS------------DHNNSDTYRAALRPAH---R-AASQTASVRNHE-RIQSETAM--REGL--rrvTYARVLRTGS-hRTPYrnVTP------------------ +>SRR5215203_7560530 +-RPMTPDQVSLVRDARRAIesRHAEFSAAFHDALHELDVDTCALFRD-TVT------------GGRACNVGAMLDLLQQASDDpraLIEVAAELGRAHA-HAGVRDVHHHVAGVALHRALHRVLGVEFTPAMYEAWAEAFTLLIAVMERAA- +>SRR5215470_20101711 +-KSMTPQQIALVQCSFKSVApiASKAADLFYDPALRDrsrgaaALPH--------RFV------------G----AEGQADGDASNGHQ--------------QSPSARCHFANRAATLRPA-Q------------------------------- +>SRR5919197_1191720 +--VLTRDQADIVQLTWRAVLpvGDTFAELFYGRLFALDPQLRRLFR--ENL------------VEQGRNLTAMLSVAAANLARpekISVALRQLGRRPT-RSSRARCSRSLLRDLLRLPLDARRA--VADGVARVVVafaRAVVAIP-RVIHG-- +>SRR5690606_39578087 +--------------------------------------ADHLSP--LPlP------------TRRSSDLLRMLAFIVKSLDWadrqwredvnpdedLMLVVLALGRRHTELYKIPDESYGAVAEALLWTLDYGLGRSEEHTSELQ--S-------REN---- +>SRR3954469_10060132 +-QRMTPEHIHTVQSSWNKVLpaGNGKARLLFERLLQTETSLCGLFQ--LDG------------ATWSANLVQMIDVLVTGLSLgdrSAVLTRRVGGRNT-ACPGIEHHYDLIGTALLRTLAKRLRAEFTPRVEAAWAIVYEELVESMRKA-- +>SRR6266508_6374850 +NFAMTKEQIALVKNSWKLFrkvDACLIGDVFYSKLFFDNPQLRQLFP--ASM------------EERYRKMIDMLSVIISRLDRlneMTKDIKVMALRHE-SHGVKPRHCKLLGNALRWTMERGLGNDWNDDVKEAGLACYTKLIETMIQ--- +>SRR5215475_4417451 +--PMTPLQRRLLHQSFSRIEpfSQRLGDVFYARFFSTSPAMRALFSR--DI------------KVQQSKFMKVISEIIKLPLlsfsvtdsqdSesLVPGAYWSGMLHG-ALSVKQQDFASMKAALLWALSNCP---------------------------- +>tr|V4A5G6|V4A5G6_LOTGI Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_233247 PE=3 SV=1 +-ADLTEKDKELVKSSWAKFNegdVIADGAHIYYKLFEKAPEAKEKFGFAKD---GEVSLENKQFKAHVRKVLDVFESVVREIDQlegLLPVLNDLGARHK-SYGVPLKYYEILGSCIMYAWDRKLKM--DADTKKAWGKLYGVVQTEMKKG-- +>SRR5262249_25899110 +--MMNTQHIARIRLSFAWIApsADVFGELFVANLRALDPSLSGLLA--AEA------------GPQGWQLISILRSIIGGRDRpdrLFWRLQSFGRRLA-GDGLCAEDYDTIGDALMLTLEQCLGERLTPDVAAAWDATYAALAEVVQL--- +>ERR1719223_727152 +---PSSAQVDAVTASWDKVAalgAETVGVLLFKRIFEIAPALESELS-EKPTA---IIIGDLTLAREMT----EEEKETIDLEEkeePeeveekeEPEEVDEQETTE-GRIISTESF------------------------------------------- +>ERR1719336_2939639 +--PLDERDIDLVQQTLGRVAilgLDNVGWVLFMNTFKIAPAAQGLFE-AGFLQlkplnkpfnDMPELAKSSNMKETGGRVVETLAAAVGLLRDlgtLVPILQDLGKKGV-SCGVIPAHYDIFGEALITSLQLALGANFTDPVKNAYLKVYTIVKNTMIG--- +>tr|A0A1D8RRN7|A0A1D8RRN7_9GAMM Uncharacterized protein OS=Colwellia sp. PAMC 20917 GN=A3Q34_02175 PE=4 SV=1 +---MTAKQINLVQQSWQKVLilSPDVGDLFYQQLFVLRPELATLLKN--DK------------QdKirANKDFICLLSQEINLLQPielTEEKV---NTSVT-TNDV-KNYQADVENALLLALTMILDKELKIALKRAWISTIKRLVGSIVIEL- +>ERR1700730_15638689 +--AMTPKQVALVQDSFAKVAltSEAAAVLFYNRLFDIAPQMKAMFP--DDM------------VEQRRKLMSMLAGVVKGLANLeqvFAGRQRTGKAAC-QLRCEGG--ALSGGRRRVAVDAGEGsGGWLDAGSGGcVGHRlWHAVRLHDFPS-- +>ERR1712166_353516 +-VVAQFAALNAVDDKW-----VTQGVLLFKHMFRINPGMKQMFS-FRDIP-DDELYDSMKLKKHGVSVYTYIEKAVDGWGTpeIADALQKLGARHL-PREVKMEHFDVVGESILTSLSDVFGDQFDDKSREIWTRVYGVIV-------- +>tr|A0A1S2XZ06|A0A1S2XZ06_CICAR leghemoglobin-like OS=Cicer arietinum GN=LOC101502441 PE=3 SV=1 +MDALTEKQEALVNSSWEAFkkNIPHLSIVFYSSILEKAPESKDMFSFLKNF--DGIPHQNSTLEAHAEKIFDMTRDAAIQLRAkgkIdlaNDvTLEYLASVHV-QKGVTEQHFVVLKEAMLKTIKKAMDDKWSEELSCAWSIPYDQLAATIKKAM- +>OlaalgELextract3_1021956.scaffolds.fasta_scaffold1056695_1 # 380 # 499 # -1 # ID=1056695_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.392 +-MALTATDVEVIQTTFKeVAEnvgAEKAGIILFKNVFDAAPGAAKLFS-FGRVEgfdPAADHSTNPAVVKHATGVITTVAKAVASLTDlsaVLPMLTALGKRHS-KYGVKKEHFGIVGAAFLKTLSTALGDKYTKEVEAAYTKLWGVVSKTFREAG- +>SRR5271157_4306781 +-----VSDVEFLKETWGQItDKSSFAERFYSLLLAVFPVAKPLFSK-TDW------------QSQYSLLMASIDYMVMGIKygrNIQPTLHLLGARHD-YYGVAPVFYIPFNACLLITLQK------------------------------ +>SRR6266566_5437046 +--DLTPENCDFMTEHHDL--------RILGRLVATE---------------------------------------------------------Q-EQPVKDPDHDQIeeatrhrprscPTLFIWPNRRSQPLhrvlmRYMPvpgpRSPPSWCGPPSRSRSHGPRttT-- +>SRR5579859_7196529 +-GARDD--T-----------gsGQaCSAEFLQGR--------------T-HR------------RSGGDpVLRSPVRNCAAGQSDVsrrHDRTAEKADRHA-CGRCeRSgrLALDPAGreracq--TprrLWRQGcalpgrrrrlvvdAGK-GIGRgvdarrrrrmdhrlrhavrfHDFRSLWQCPG------------ +>SRR6185312_354929 +---MVR--A-----------rgSAkC--WKCRWR--------------D-RA--------------SVSnSLPAPATSSAGSACSNfs-------MNGTA---SSkQPefDRVPRGGrgrgrrrKMTpeqVSLVQqsfakvapiseqaavlFYD-RL-FevapavkamfpadmteqrkKLM----------GTLAV-V--- +>APLak6261666328_1056055.scaffolds.fasta_scaffold241778_1 # 2 # 196 # 1 # ID=241778_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.415 +-GAKTAGGL---NLLFL--AivSS----EPENGFVTISPAAKDLFP-A-DL------------TEQRKKLIATLAIVVNRLSNLqsiLPAARTLTKRHV-NYGAKPEHYPVVGSAVLH-AGgrPRLGLDARSRLrsdGCVWHAVRLDDgrnleHEFANL--- +>SRR3954463_16408791 +------QQITLVQESFARLAhdKARFGASFFKRLFKVDPTLEQSFAG-VD------------MQAHALKLVDAISFVVGGLRQpetLVGPVQKLGAARC-CRRCPTSSRTSGPRSSVPPGT------------------------------- +>SRR3569832_1984102 +----------------------------------LEPKARSMFNF--RAD------------EDleaNPQFMVHARAMVDMIdmavgflgPDldpLIEDLSHLGKRHI-SYGVKPEYFSIMERAVMFAMEELLDDKLTKEDRTSWQLVFHFMITH------ +>tr|B3SDK5|B3SDK5_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_62364 PE=3 SV=1 +-SYLNYQERQAIIDSWNAIstEKQKYGTILFLKLFELEPRVKSLFTIF-DFN--EpleDIIQSPHFRSHAMRFMQSLETGVLMGFDkesCDFLFKSLGSRHH-FYDLKSEFLDVIPECILHTIKKGCGNNWSNETADAWKIATKVLCELFREG-- +>tr|C1C1M6|C1C1M6_CALCM Non-symbiotic hemoglobin 1 OS=Caligus clemensi OX=344056 GN=HBL1 PE=2 SV=1 +MSILTSNELSLISESWKLVvpDLEHHGLSFFLKLFEEYPTYQEKFFPELH-------QDERKIQRHGAIVLKSVGK-LVAFLEankviaLVDAIKRLATNHS-RRGVLREQFYPACRILLEYLAQALGTHLSTEGALAWKRFLGTFVELMQ---- +>SRR5450759_1049036 +--ALTAEaPYSELKnlCVWSKT------NAGMGSLYRSQHELVFVF-K-NGMRPHINNvelgrfgrnrtniwnyAGASSFGstrdselamHPTVKPLSLVADAIlDCSKRggivldafagsgtTLIAAEKTGRR---GYGTELDPFYADT----------------------ivrrFEDAYGL-KAVHVE--- +>DeetaT_11_FD_k123_441726_1 # 2 # 373 # 1 # ID=403715_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.481 +--GLTDLQIEMIRSSWEKVTpnKKHHGQLLFHKLFEIAPEMTDLFP-FGDD------FTKPQFTTHALNIMNALDHAIQNLDNpdvLIPKLRELGQMHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG--- +>AP82_1055514.scaffolds.fasta_scaffold664619_1 # 53 # 358 # 1 # ID=664619_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.458 +---MSGFALRLVLTQRQKATrkrpiaqyvienhSINFAFHYIDRLFEIAPEMTDLFP-FGDD------FTKPQFTTHALNIMNALDHAIQNLDNpdvLIPKLRELGQMHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG--- +>SRR5210317_1560035 +------------------XmtSL----KSSMIGFFRNHQNCAKMFGE--DMR------------DQAQKLAAILQVAFDNLDHvdsLVPILEDVGAKHA-TYAVTPEHYGLVAAALIGTISTELGDAFDERAAESFEAVLGTVANVMISG-- +>tr|A0A037ZKD6|A0A037ZKD6_9RHOB Uncharacterized protein OS=Actibacterium mucosum KCTC 23349 GN=ACMU_09600 PE=3 SV=1 +--MAHKGRVQTVRDSFQVVrtDADAFARGFYDRLFAKRPEMRGLFAD--DMS------------AQQAKLVTTLVTAVNMFDTpsqLIKPLKQLGASHA-QMGLSQADYQLVVDTIIETLETTLGSAWDVAHDRAWRGLLDFVSNVMQEG-- +>SRR5688500_932283 +--MLSDAEKQAIRESWQLVLpvVETAADLFYRRLAEQNPALRARGQ--DQL------------VAQRKEFVTTFSFVVRGLAWeasewrsdapdeddLFLGMLALGQRGSRLARLIEQHYSATGDTLLWTLTYALGKRFDAKARAAWMRLYTLLAIALR---- +>SRR5688572_29427622 +---------------WALCAprADLLAAAYYQRLFERLPALRIRFP--ADL------------APARQRLVGLLRFVARALYWpaddwrrplpieedLLAILLALSRRHRGLGEVDDAVRAVSREALVAAIGEILAGEANPSIIDTWGKLHDLAADAFVL--- +>APIni6443716594_1056825.scaffolds.fasta_scaffold11231735_1 # 3 # 137 # 1 # ID=11231735_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.400 +--LLTADERAVLKLDWSRLTrvdQQDMGMRIFLRIFELEPSTKLSFPELYHL-TGDQLISNTLFRCHGARFMRAVAAAVDNVDALdlvvIPNLIQLGRLHQSVDGLRWRHLEVFEQAMTEVWAVELNLSgswSGSTSAVVWSKVFRLITSKVYEGFQ +>tr|A7RWR6|A7RWR6_NEMVE Predicted protein OS=Nematostella vectensis OX=45351 GN=v1g203304 PE=3 SV=1 +-CDMTYEQKYLIRETWKFLEvsKKEIGVSVYKRFLNMHPGLQTYFSEFKHIKID-NI---NGSHGHPRRLLMAIDNAVTALGDsdsFSAYLVELGRRHH-GMnfRPGPTHFNDLRKCFLSVIEEILATAslWDFQVEEAWNRLFDSITAMILRG-- +>SRR6516164_7981020 +-SPLTEAQKRLVRESFESMQeyETSVVVLFYGRLFEIAPETRTLFKI--DI------------REQSRSSWIPSGL------------------------------LSIRLTISWNCRQLLR---------NWDESTSltAFSPITMGN-- +>SRR6185503_3589201 +---MKAEQLELVIDSLTVIQpiADQIAKSFYKHLFEIAPQTKKLFT--GDM------------DRQGIMLITSLSLAVNGLSDmenTLPSVQALGERHY-SYGVKPEYYQPAVESFLWSLEYHLGDQFTPELKESWRTAFQALADTMLSVY- +>tr|A0A0P6AJ75|A0A0P6AJ75_9CRUS Globin OS=Daphnia magna PE=3 SV=1 +MDTLKTVNVSAVQNTWAIVNkdLNTHAPHFYVALLTAHPEYQPMFPTIANVP-AGALLNNAALKTLSVNVLTKLSELIGCMGNpdaLNAQLVDLANQHK-GRGTTRAHFDNLSKVLIDFLAAKLGGEFTPEARQAWTATMQGINTVVEA--- +>tr|A0A0P5NXY2|A0A0P5NXY2_9CRUS Globin (Fragment) OS=Daphnia magna PE=3 SV=1 +MDTLKTVNVSAVQNTWAIVNkdLNTHAPHFYVALLTAHPEYQPMFPTIANVP-AGELLNNAALKTLSVNVLTKLSELIGCMGNpdaLNAQLVDLANQHK-GRGTTRAHFDVSKS-FSNFEC-----PENEVSRKDWTKNLSILQ-------- +>tr|Q93101|Q93101_9ANNE Nerve myoglobin OS=Aphrodita aculeata PE=2 SV=1 +MAGLSGADIAVIRSTWAKVQgsgSAtDIGRSIFIKFFELDPAAQNEFPCKGESL-AA-LKTNVLLGQHGAKFMEYITTAvNGLDDYagkAHGPLTELGSRHK-TRGTTPANFGKAGEALLAILASVVGGDFTPAAKDAWTKVYNTISSTMQA--- +>tr|A0A210Q3Q0|A0A210Q3Q0_MIZYE Neuroglobin OS=Mizuhopecten yessoensis GN=KP79_PYT10061 PE=3 SV=1 +-TYLTPRQIHLVQDTWDIIkdDLSKLGVIVFLRLFETEPDLKHLFPKIVQMNEQNKLeWDIDrdMLTKHAVSVMEGLGAAVESLDEsefLNSVLISIGQTHV-KRHVKPQMLKRLWPSLNYGLKQVLQSKYNKEVNEAWKKVYFYIVAHMKRG-- +>ERR1719460_671936 +--MVDAVVKGDVQRTWELVIPpdsgddhvFAIGKLFFDRIFEVTPGAEALFS-FKGE----DRAESAKFRAHAIKVIKTVGVAVAKLDDletLVPILEDLGKKHV-AYGVVASTTT----SSVWRCCGRSRRGWATNSRPTW---------------- +>ERR1712223_635401 +IPKLTAEEKSVLQASWANVNkkIEIAGAQTFIRMFESNPETQNQFRKFQGMDL-VQLEQSAEMAQHGKRVLSIVGMTVDNLDNyqiVWDNLIKVGREHF-TFGALPMYFDLMGPHFVIAVRSCLGNDWYEALEYHWLALFNMIVYAMKFGWN +>ERR1712062_404977 +--ILTNQEISVLKSSWELIAkkIEIAGAHTFLPTFDRDPKCPDN------------------IERHCQRVMSVVGGSIELINDyksLWKHLISLGREHF-GKIREWIFASIAGGSTersgcspssINFLSSKINGNITSKK--CFLQ-YKIVIITQX---- +>SRR6266567_6698575 +--------------------LIVFTSTCLWSI----RKPNHSLPKR-IC------------VVKLAHCWLHLTTVVAGVlreDNLVPVLQQLGQRHK-SYGVKAEYYPFFRAVLLETFQHYLGPRFTPKMQQAWEEAFEMISTQMLKGA- +>SRR5215217_5048650 +--RVTARGRAR---HVLLRApvRDRRGRGTTVRRHRHGSAA-----------------------PQ---VRRDARQDRARSGRaatLVPDVAALARRHV-GYGVEDRHYTSVGEALLFALGDTLGDRFTSDVHAAWVEAYALLAALMQR--- +>APDOM4702015191_1054821.scaffolds.fasta_scaffold152199_1 # 3 # 686 # -1 # ID=152199_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.531 +------------------------------------------MS--GDF------------SPEQKRYLEGFTS------GLq------IARTGR-GLG-KPAASVPSGPD-----AEHLIAQDQ----------------------- +>SRR5262249_5171126 +----EPDSALLVQSTIG-VLvqhQRRFTSELYRRLFGLAPGAQALFRS--DM------------ESQGKMLAHMLEFLVYATSRpetMTLGWRELGRGHD-GCGVGAEYYPAFRQAFLESARVVLDEKHTPQVEKAWADTLDMMIVSMLGP-- +>APCry1669189000_1035189.scaffolds.fasta_scaffold267513_1 # 3 # 467 # -1 # ID=267513_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658 +-VVLSDQHKKVIVRNWTILStdLSGRGTRIFLLIFGRNPLIKSIFS-FGHLE-GDELVCDPRFKGHALRFMQAVGAVVDNIDDynnaVKPILNDLGRRHTQFKGFKPIYFNEFQDSILQVSENGTCKQngeiriLNPSaagvnfCTPPLGKFSASEMTCIVSsGA- +>tr|W6FSH9|W6FSH9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_a PE=2 SV=1 +-LDFSDDQKADIKSTWETLYsgnKFQLGVELMANLFKAHPDYQDLFPSLKGIPD---VAGSNELRGHAIRVITGINNFVDALDEeeevMREMLHNMARSHK-PRKLTKTHFNEFAPILLETFEKKVD--MSSKARDAWIALYYSIVDNLFAE-- +>tr|W6FIG9|W6FIG9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_b PE=2 SV=1 +-MVVSAEQKALIQGAWTPIYagnRFQLGVDIFAHFFKAHPNYANLFPSLVGVPN---PSTSVELRGHAIRVLTGINYFVAALDEkkpvIMEMIHNMARSHK-PRKLTREHFAQFAPVLFDT----IG--VSGPARDAFLPYYNFIADNLFAE-- +>tr|A0A023RLQ7|A0A023RLQ7_AERME Globin OS=Aeromonas media WS OX=1208104 GN=B224_3582 PE=3 SV=1 +---MTPEQIELVQRAWGRVTalNNTYVQEVYAELFRLSPDLINLFPDPAG--------------MPVTKVSETLNTVITSLEQLdalGFIIRDLGRRHR-QFNVQSHQFGLLKQALTLVLARRLGEHFTPALSEAWSQMYDEIAALMLEGL- +>SRR5437899_2276119 +-------------------YpaVQKSGAAVYRPALVAELRDRPY-E--FDI------------QVQLCVYLARMA--------leIVAALN-----AA-GWICVPKDPSPEM------LKAAWAYALDEDAAGVWKSMIAA---------- +>ERR1700757_2961956 +------------------------------------------------------------------RFNRLAGRERRAPARtr----ARQSR-------QRPGPSRHDPTrLALSD----------VSEAERTDIVVS------------ +>SRR5215213_1430710 +---------------------------------YLYPFLRPMFK--ENI------------QLQARKFSAHVSLVIGNIKDrntLQPMFEEMRNLHL-NHNVKTHHYNYVQEALFYALKNHLVKEWDEHTESAWIKFYNIMASQMAA--- +>SRR4051794_22176940 +-NRMTEASLQRIASNYELLAgqMQVLTGAFYKRLFAAMPEAQPLFR--IDI------------DLQSQHLAAALALIVRNIRFfdaLEQPLKELGVHHA-HVGVRPEQYPVVCRTMLETFREGSGQSWSPELEADWKAVLELVSRIMMDG-- +>SRR5262245_41201456 +--XMTPHQILLVKTSFQAALtqRERIAGFFFAELFAREPAMWQLLR--GKT------------GMRWPALVDGLAAIVGSIHRihsIEPVLQWLSWQGA-VRGVGEGQYEAVGQALVAALEAGLGEAFGSEHRRAWMVAVGKVADIMARA-- +>tr|A0A0N9QWL5|A0A0N9QWL5_9ANNE Intracellular single-domain globin (Fragment) OS=Eulagiscinae sp. JPG-2015 OX=1732542 PE=2 SV=1 +---VSDAQKALIKSSWAGVDLNAAGVAFLNQMEQKAHDVYAVFKV-G-----GGATSNPKAAALGLKVMTFVDEAVKGIDDMgavGGKLDELAQRHT-KYGAKKAHFPVAGPCFLDALAEVCGGRFSADARAAWSDFYDVIAQHLSA--- +>tr|C7FFW0|C7FFW0_BRASE Extracellular tetra-domain globin (Fragment) OS=Branchipolynoe seepensis OX=326992 PE=3 SV=1 +---VSDAQKAAIKASWAGADLQAAGTGFYVHLAAEAPAVYANFNL-G-----ADPH-GAKSQEQGLRVMKFVNQCVNSIDNMaivQAKIDALAHRHM-SYNVKKSDFVPAKPCFLGALADALGGKFNADARAAWAGFYDIIAAGLST--- +>ERR1719261_40108 +-------TIAVVQGTWQEIKdalgdgvAETAGVILFKHIFRIAPQALALFS-FKDCAGgnvCDELFENKTLRKHAAKVVGTVDTAVGMLKktrQADSRPGQSGQEAR-GLwggagalrcgrgGVVGDAVGRVGRRVYDRGPRGLGGGLRHHQNHN-----DRQELRLHGR-- +>ERR1719238_2294225 +-----------------------------LKVA----SALREFN-TLRAEGivsEQEFLEM------KAKLLAVGKDELG-RSpsgDTLETLVEAThemdssRRRT-RWtrrarraSRSPTTVGVISCQIK--------KSSTRRTTRRW---------------- +>ERR550532_3331206 +------------------------------PLF----PAAH--R-LCRPDGhdgCS---------------------VFGPDRppgE------------------APSTKDIVVTVIL--------X-------------------------- +>SRR2546430_16462751 +---------------------------------------------------------------------------------flLSVVIA-----CS-CWCRHVSSlqhdrad-------HPVGLCPGIVADWSPALSQNVGEGFQQDCSD-dG---- +>tr|A0A0P6RCU1|A0A0P6RCU1_9RHOB Flavohemoprotein OS=Phaeobacter sp. 11ANDIMAR09 OX=1225647 GN=AN476_12305 PE=3 SV=1 +----ASTCKALVLRSFESErmDLEAFIPLFYSNFFEAYPEARAIFPT--DT------------ERLEAKLLASLTHIAEALESserLDGILSELGQKHR-RMQISDSHFDGFIQSFIRSLATTLGPEWSDQSDEAWSQFLRYVAKRMSFLE- +>tr|B7QTL6|B7QTL6_9RHOB Globin, putative OS=Ruegeria sp. R11 OX=439497 GN=RR11_330 PE=3 SV=1 +----APADRDLILASVESQkmELDQFVSLFYAKFFERCPDTRPMFPH--DM------------SLQEEKLLMSLTHIIEALEHpakLRLILLDQGERHK-ALQINDDHFAGFIDSFTGALKDTLQEDWSEETRQAWLRFLQYVAYQMGFLK- +>SRR6218665_311178 +-TPIYAGHRDVIRRTWPIIAdqMNANGCQIFLCIFELSPGIKRVFA-FGPAMSGAQIVNHPRLVQHASRFMEAMQVAVQHLDELdtvvSPIFINLGKRHIYFEGINADYFNVFSGAILYTWRQVLGERFSAEVRSAWSRLFDFVIQHLRFGY- +>GraSoiStandDraft_9_1057307.scaffolds.fasta_scaffold3427870_1 # 1 # 249 # 1 # ID=3427870_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.747 +--------ADVIFDSWDAVKripdyDVVVGEMMFRKLFENSPSTLKNFS-FGPRFagKEESLYKSRTFEIHTKAMIKMLEDVLSMIMpDlvpMKKTLKALGARHV-TYGVRPNHYELATEALLSTLESLLGYRWTPQVEEGWKTAIGFITNTMVAG-- +>tr|A0A2C9KJS1|A0A2C9KJS1_BIOGL Uncharacterized protein OS=Biomphalaria glabrata PE=3 SV=1 +--YVTPKEKELLRSSWNIVsqDISGVGMNIFKKLFDIETDLMKLFKRMLTKGeTGQVVVDSIRLEGHATGVLRQIGLVVENMDNnsaLTTTLIALGEVHA-NYRVRPEMLPLLWPAIRDALKIACEDEFTHQMELAWKHLYDFVTCHLSEG-- +>tr|A0A1Y5RHX9|A0A1Y5RHX9_9RHOB Flavohemoprotein OS=Palleronia marisminoris GN=hmp PE=3 SV=1 +---MPNDDMRLIQPSIARIFvvRRSIGQAFYERLFERQPTFRTMFPT--DL------------RTQARTFDDMIALIVKKTGDpeaVTPVLLAIGRRYL-TYGLRPQDLRVIGEVLMEVLCAQTPGGLSPDEAAAWERSFSRAAEVVKL--- +>ERR1719321_586101 +--ELSYSTVSTVIDSWESVKrqenyAENLGRMIFIKFFDREPEAKTIFGFDGKKMKTdDEFYESRAFLAHGKHFVLILNKAFDMLGPdlemLTDILLDLGGTHRTKYGVKPEYFPVLGDALLECIEEMSDPeRFNDETKACWLEAYNALTEIMTT--- +>tr|A0A2D6RHV2|A0A2D6RHV2_9GAMM Methyl-accepting chemotaxis protein (Fragment) OS=Colwelliaceae bacterium OX=2026726 GN=CL811_09640 PE=4 SV=1 +---MTPKQNIAVIESWKKVQpiASQVSQVFYDDLCEKHPSLKALLG--EELS------------SARDQLVAYLNSLVETLVATdevv-I--EDL-AKH-LRIGLAPEQFSDVGPALLTSLEIGLEKDFTATVKRAWTALNKLIVAAMAQ--- +>tr|B7J6S4|B7J6S4_ACIF2 Globin domain protein OS=Acidithiobacillus ferrooxidans (strain ATCC 23270 / DSM 14882 / CIP 104768 / NCIMB 8455) OX=243159 GN= +----MAINIQLIQSSGAAVkdLGVQVAEHFYNYMFTHFPEVRKMFPG--------------DMSEQRVRLFNSVILIATNIDTmevLVPYLKELGIGHI-KYDTRPEHYPIVGKSLLNTLKHFLGAAWTQEMAESWIEAYNLASTVCIEA-- +>tr|A0A1Q9NIM3|A0A1Q9NIM3_9ARCH Bacterial hemoglobin OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=vhb_2 PE=4 SV=1 +--SLNTKDIQLIKNSWEKLteNKKEVRNTFYTGMFEDDPKLKSLFRE--------------SFLSWD-NLPDSFEFMFKHLENlegEILEMKRLGLKHK-TFSVKPKHFPIGRKSLVKTIKQYMGDKYTEELGAAWTKLFDYMSHYMILG-- +>ERR1719419_74415 +--PFTPEQRTLINETWGNISTKEtgsmgmLAKQVYERLFRSAPGIKRLFKD-SDM------------LAISRAFGGMLGVLVSAVNQplqFQHIVKGLGVRHQ-VYGVKPDHFRIMYTSLVRTFAQILGDKFTSEHKKAWSCLYNWVIDAMQRSMR +>ERR1740128_1504408 +---------------LGVSYlarhIVPVDVRFLKEHVKTLFVLSqR---MPGNFV-NETLETRATLLYETLLVMSNLNYWVENLDELdlvVASIQKMATNHA-GRGIMAAQFETIGAVVVEYLKAGLKEALTEEMAGSREKLISTMVSIIKETN- +>ERR1719354_333269 +-MGLEQSDVEAIQRSWEIVKetakLRVHGVNFFEMRFEMIPDWReKYFSHMGP-------KTSAKFRSHATMIMMTLDSWIENLDDLdlvVDAVLRVGQTHA-DRDILSPQFVEINKVIIVYLETGLGDKFTEEMKESWIKLLDTVVTIIKDGN- +>SRR5215207_9441599 +-----PEQLALVRGTASIIDavGDSFAERFDDHLFARYPAARRLFP--DDT------------TTHRGQLTDEIVFLVAAAADlhaLLERARALGAPPP-LRRtrrrlparrrgTRRRGRGRRGRSVVGRNG---G-SLA----------------------- +>SRR5690349_3556304 +-TYLTGQQVLLLKKSFRQMNPAQIAAQFYGTLFQQHPEVKSMFPA--DTV------------ELGSKLMSVFELVVFSFDEKehgrfglqdvlIKPLRALGRKHD-DKGVKPEYYEIANSLLLKIMKE--SEYFTTEMYQSWQLALEHLTYAMQDK-- +>tr|A0A2A4JK54|A0A2A4JK54_HELVI Uncharacterized protein OS=Heliothis virescens OX=7102 GN=B5V51_782 PE=3 SV=1 +-SGMTLKDVYNVQHSWKTINanPLDNGYLMFFRLFEVNPESKTFFKILDNARTETEMRDNVRFRAHVLNIMAALNNSIENLNKpeiVVVWMEKLGTAHR-RSHVQERHFLIFKDVLVNILKNDLK--LSEAVVKSWGRYVTFIYSYILP--- +>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9902871_2 # 1417 # 1767 # -1 # ID=9902871_2;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.538 +-----ALDTKLIKDSFELAKpiSDKLVKRFYENLYSDYPQSKSLYLD--G-----------QLPESQLAILKAINFIVDNLHNkekLGTFLKTLNERYE-LRLNDSVINQSVCSSFLKTLSEAFGSDWTSELAEQWELTYQMVTSFFQDSK- +>OM-RGC.v1.013389558 TARA_082_DCM_0.22-3_C19717715_1_gene515718 COG0552 K03110 +---WHGESVTTVQRSWARIQqlgLENCGTLFYNTLFERWPEAKQLFSLSvrlkhrapgESEREGPDPTNSPALRKLWGKLLSVVGSLVSGACNpaeVVPTFHAVGVRHA-GYKLKVAHFDAFGGVMASVLKHLLGEEFTTEVQHAWTLAINFLTANIRAGFV +>tr|A7C4X7|A7C4X7_9GAMM Bacterial hemoglobin OS=Beggiatoa sp. PS GN=BGP_4395 PE=3 SV=1 +---KQHDTIFEIQSTYEKILphLDEFSRLFYQQLFEIKPAFKILFRQT-DL------------RIQKQMVIRMIEVVVQGINNlenFMSIIQRIHQRHY-ELHLKPEDYRLAGQALVLSLEKYFGDEFTPTLKKIWLDFYESIVATMMN--- +>UPI0004291969 status=active +---KQSDTVFLVQSTLEKVFpqLDEFTNQFFKKFYELDPSVKEIFYEI-DA------------KNKKQMVVNMIGFLTQGINRfdvIIPSIKEINERHF-GREVKPKYYLIASKALVNVLEDYLGEDFTPEVKQTWIEFYEQIVNFMEA--- +>ETNmetMinimDraft_35_1059890.scaffolds.fasta_scaffold55614_2 # 1284 # 1421 # 1 # ID=55614_2;partial=01;start_type=GTG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.623 +---KQSDTIFLVQSTLEKVFpqLDKFTDQFFEKFYQLDPSVKKLFNGV-DS------------KNKRQMVVNMIGFLTQGINRfdvIMPSIKEMNERHF-GRDVKPDHYLVAGKTLVNVLEDYLGKDFTPDVKQTWIEFYEQIVHFVED--- +>ERR1719506_1011120 +-GPITAREGQIVQDSWKAVKkvGGESGHAvikdIFYQHLLKDPNVKQLFRN-------------SDMKLQATKLWQTLHVAVDGLSTsgpWFLCCRIWARLTS-STGSKRS------TSMPWVRRsSTrspraWGPRsrrssrWRGRKCTAWLLRRX----------- +>Cyp1metagenome_2_1107374.scaffolds.fasta_scaffold42158_11 # 5761 # 5952 # -1 # ID=42158_11;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.578 +-RFLTVAQQNEIIATWAIIKeshaSEAIGMDVFKGLFISAPETFDMFDSFKKDP---DWQNNVHFKHHCKVVINVIGSFVLLLNQpekLISHLEFLGVKHN-FMTITPLQFELLGAELLKAFNKALGARYNSLTKKSWTIFYNKIAEVMQTN-- +>SRR5688572_5289639 +--TVTPDRQQLIRDSWRALEpnGPRLVELAFLHLLQIAPAARPLMTG-HSL------------PCVCRNVASILDQLIAALDEpkqFVPLAIGLGRSNP-GHGINAALYPAMGEALLWALHLQLGEGLTPELQTAWLEYHHLVSAIMRRA-- +>SRR5690349_12423264 +--XMTPERQQLVQSSWRKVEpnAARLVELAVLHLVSIAPSVRSHLDG-ATL------------PLLCQRIAAILGRLVETLDEpkqFVPLAISLGRENP-DRGLTAKLYPAMGEALIFALHLQLGDAFTLELQAAWLEFERLATAIMQ---- +>SRR5215467_4845699 +--------------------------------ALTWPLRR-------------------------RCWGKLLWpswiiwkmCPGCSRPSrswAPSTLGM---------VLLPRCTTGSADALVATLAKPNGEQWTPAHTDAWGEAYRAIVAMMLAGYP +>SRR5262245_32871681 +-------DPQILRETLELTLaaDDSFPKRFYDRLFTRHPEVIPMFHR--NSP-----------GAQRKMFAQKLIMIVDHVEDpawLARELRTVAQSHV-RYGVRPEMYAWIGEALIETLRDACDSDWSESAERAWRNAYTKIVESIFEV-- +>tr|A0A1C4TW82|A0A1C4TW82_9ACTN NAD(P)H-flavin reductase OS=Micromonospora haikouensis OX=686309 GN=GA0070558_10167 PE=4 SV=1 +-----RAVSADLGPSWAATAaaVDRAAANFLDTVSDRLPGLLP--------------------ERDHTVVFAALGRLAGGVDDtagRAAALAVLARAHR-GVGLLPQHADLLGDALLAAVARENRAHWTAALATGWERGLRRAVTAVRRA-- +>tr|R4LFD5|R4LFD5_9ACTN Globin OS=Actinoplanes sp. N902-109 OX=649831 GN=fhbA PE=4 SV=1 +-----GMDPaddaalnEvrrLLGNSLSMAGgpME-VAGRLRAALAQAQPTLFATLPG--GP------------VAQVEQLAEGLTWLIHHVDQppaLVAGFGRLGMALA-ECGVAPQQLQLAGAALAEAMRAGmAAHGWRQDFDQAWRSTWQHAYEWIAHG-- +>tr|A0A1H7FRI4|A0A1H7FRI4_9ACTN NAD(P)H-flavin reductase OS=Nonomuraea pusilla OX=46177 GN=SAMN05660976_00171 PE=3 SV=1 +-----MLGFQRVRDNFELVAkyGDGVPLYLFSDLFLRVPQLREMFPV--NM------------RSQRERLMGALAFAVEHAGDlaaITPYLHHLARSHR-KFGARPEHYAQWSVSVVNAMRRFSGSAWDDELEREWRDFLTAVSQVMIDA-- +>tr|A0A210PV81|A0A210PV81_MIZYE Globin OS=Mizuhopecten yessoensis GN=KP79_PYT16126 PE=3 SV=1 +PLGLTERELKMIKVSWDVLAedKKSNGVKFFMTLFTIFPTSKDLFKHFKDVPLDQLKydgettKSNKKMVAHAMSVMYALESYVDSLDDaycLEELVKKVAISHK-PRGIGPDKFKLLTPVLHAVIEDLVKDDDSvdlETIKSGWTKLIDTVCDIVEK--- +>tr|A0A1L4CYV2|A0A1L4CYV2_9PROT Uncharacterized protein OS=Silvanigrella aquatica GN=AXG55_04100 PE=3 SV=1 +-----NIDIQIIRDSFELTKpiGDQIINRFYENLFLEHPELKEFLSR-GDI------------QKQKEILLNTLVTTIDNLDKpesLSSFLIHLGEKHL-NYNMIEMYNDFIGRNFIKTLSQFLGRYWSDELNRQWNEVYKFISLNLKKG-- +>SRR3954469_16801024 +-------NYALLRNSFEKLKpvAGKVAERFFDILWNDYPETRDFFKN-TQM------------GPQKFAFFQALVFIVENLDQpesLESYLRGLGASHS-AHGVKKEYYGWGCAALHKTFAQTFADEWNDTLSFEWTKVFAMITSLML---- +>SRR6266851_5623532 +------------ACTSPSVRstT-------------------TCAG-----S------------TRNSGYPAGPnSPTHStriSHDTRTDrigpkLIRVHRRRRA-RDGVRPRHYRSAGDALLGALAAHLGSDWTPAAESAWRRAYNLVAEIMIA--- +>tr|C3Y526|C3Y526_BRAFL Uncharacterized protein OS=Branchiostoma floridae GN=BRAFLDRAFT_98913 PE=3 SV=1 +-TGLTPTQSRLVKESWKMFlsKKRENGFVIFRVLFTDYPVTRKLFKGVEQldLDAPGQLESSITLRAHVTRFMHSFDTYMESLDDpedLKQLLYDTGKSHL-IHDIKPEYFDVLETVLMKSLRIVFGSKLTPQLEEAWQTAYSHLKVTIKQG-- +>SRR5271166_2850757 +--RWMRPKRNSCARPSPKSRrsPIKAGAMLYEKMFALDPDLRRLFA--IDI------------ETQGAKLMAVFATAIANLHRldeILPTVRELGRRHV-AFGVKDRDYDTGGVALVQTLEAGLGDAFTPAVRDAWMACYEAITGEMKA--- +>SRR6478735_6705068 +SPSLTREQKRHIRETFAIIEpaSDLVARLFYMKSVDLDPSLGVLFKS--PN------------RVQRRKFMAAMKVTVLSLDRlqsLQPILKLLGARQR-EEGVTPGHYETFQDAWVWTLEQALQARFPREAKDAWSSLLGEMTAPQRPR-- +>tr|F2Q9X8|F2Q9X8_BRAFL Globin OS=Branchiostoma floridae OX=7739 GN=lGb13 PE=2 SV=1 +--PLDAWQRFYLQKSWKTVArkSDQAARTVFLRMLQDNPGLRQKWPRISLL-TEEEIPTSPYIKFLGERIFDCLDYIIDNLGDLDhviSELTKLGRQHSDMNVMTPEDVWAIEAAFLAGVQECLEDRFTIKYEEIYSRFIVFVIETMVIGFD +>tr|A0A226E0J1|A0A226E0J1_FOLCA Hemocyanin OS=Folsomia candida GN=Fcan01_14017 PE=3 SV=1 +KVQLTPDEMIAIKRNWEVIHqdLTGNGMDMYLHWFAAFPHMQKVFKKFAQVP-RDQLKTNDAFKAQATVTLHWIDDMIEAIDSpsdMAAVMKRLGRMHQ-TRHTNIYDFREMVKRIQEVIGTKVGEGYTPAAESGWTKLFAKLVENIGD--- +>ERR1700732_4531564 +-----ASPNGRRNSARASmlISsqPIRRSPRFSATTW-----------------------------WHRPRC-SCSLWVRSEVNRmeeLGGGLCALGERHV-DYGVKRADYNKLASVLIQTLKEFLVDEFTVELQHAWGTVD------------ +>SRR5258708_12476517 +---------VLWEWLVDVGGarWRWFGGRLLEIFLETSPELRSLFHK--DI------------AQETGMLEWMLGSLVKGLNRlleIEGGLRALGRRHR-DYKIDQADHEKVLRALLLTLAEFVGDDFTPQVSRAWKTVYGKIPDTMTDR-- +>SRR5882672_7954690 +-----------------------------------------------------------------------------------------------HYGNANRYQGVRPSRCIpGESSR-----HRPHGASQPSVG-Q----------- +>SRR5215469_12962076 +-------------------------------------------------------------------SLSARAGRQAGFGl---SG-----------LGSAAT--taiPTPSTSLTGSTARTTG--cSAPYSR-----TGT----------- +>SRR6266704_5570200 +--GIN-----KTPGMFEKISssMPLGRVA---TVDDIIPFISFLAS--DD-----------------S---KMITGAEAGGNs--fVLVLTNLRNIH------------------------------------------------------ +>SRR5205807_5077868 +---------------------RVGHGRVYPRLYIIARHAAGIYAL-TRP------------VAKPgRPRPVCLVPIHKDIA--vmrVTTDQLLARTPL-GrFGEAAevgqlVHYLVSDAA------RFVS-GATVTIDGAWTAYGGWALR------- +>ERR1712137_931585 +-------------------------------------MGTSLLG-VDCE-GEEFVKT-DSFVPQAKKFIGLCDSFIDMLGPdaelMAKILEAEGRKH-EKLGIKLEHYSTMGEALISGVKTL--DeKFNDETELCWKLVYCGVTNNLGKAN- +>SRR5437868_6667390 +--------------------------------------------------------------------------------------REIAASD---------ESEGVGDAEI-------DERRSNRLGDVHRSALGprpvtvrdnhgtrtaVKEGSIRRGV- +>ERR1740124_2148144 +----------RTRGAAALLLqgrAQPCGVAQAQEACYVCDEHCRCCSQ-GSgGP---QQacarATGPPAHMPYA----THRCRVCCRIGiraRAPPTQALGKRHV-PYGVLPAHYDVVGQALLATLEGGLGAEWNDQVKASWTAVYGIIAKTMIG--- +>SRR3954451_929548 +--SMTPEQMQLVRLTLAQAtaDPLALGRDFYRRLFVLAPDLRARFH--GDID------------AESLKLKETLTLAFGALTDmrlLVATLDGLAKRDV-ARGLSEQHCRAIAQSLIWAIERRVGSDFTHQVCNAWIAFMAVAMTCLHG--- +>SRR4051794_5741567 +--SMRPEQMQLDGLTLADAttDRLARGRDFYRRLSVPAPYLRGRCD--GDVD------------AESAKLKETRTLALRMLGNmrfMVATLDAMAKRDV-ARGLSEQHCRAIAQSLIWALERRLGAGFSRQVCTAWTEFLAVVMTCLHG--- +>SRR6516165_10653891 +--EPSPNQLHQNRPD---R-RPGGGTLLWPPLRDGSR-NPGAVL--QRR------------GRTGSEANGRSCNRCEQSRRFrgdRPHRTRS----C-KAPRRPEHYALVGSALLWTLEQGLGDEFTPALRAAWAAAYCALSEVMIA--- +>tr|A0A1X7UGV4|A0A1X7UGV4_AMPQE Uncharacterized protein OS=Amphimedon queenslandica PE=3 SV=1 +-MSLTSAQVALIESTWKVVKkdLQGAGNIMFLKLFQIDVSVRDKFP-FRDVP-YEELEDSESFLKHSLQVMETIDLAITLLlGGemekLVEALVDLGMAHA-MQGLKPEDFDHVGEALVHALGVALGKEFNDEAKKAWTLLYSVVTAKMKEGL- +>SRR6266699_274039 +-------QGELLETSFQAIVlhGEAFVTAFYERLFTRFPETRAFFAA-TDM------------LEQRKKLQQTLALIVQHIQHpevLGDMLQELGQRHV-TYGIRPEHYPSSERCCWRLSPTFSGSTGRRRTTMPGSRGMRQSAAX------ +>SRR5438045_5489985 +--------LITRPTSYYLLSlhdalpISLLADVFYSKLFVKNTGLRKMFP--ADL------------QLQRQKLMNMLHFIISNLDQpelFNKEIEGLGLRQD-RKSTRLNSSHLGISYAVFCLKK------------------------------ +>tr|A0A1E3GPU1|A0A1E3GPU1_9GAMM Bacterial hemoglobin OS=Methylophaga muralis GN=vhb PE=3 SV=1 +-AKLQEQDIALVEQNFAVLMefSDALAERFYQRLFTEYPEIMPLFKS--V-----------TIEGQHKKLLASMVLLIQHLRDtemIEDYLQGLGARHQ-QYGVETSHFEMFIENWLSVVAEFADQKWDSKLQQAWRNVLEYVAELMQSPT- +>SRR3954464_793235 +--------VDPFRSRFAFGVerEPEVTHRFYDVLFAKYPQVQPLFGR--RSR-----------ADQERMLRDMLVAIVDHVEDppwPQHHPPPPPPNPP-RPAPTP---------------------------------------------- +>tr|B7QBW9|B7QBW9_IXOSC Beta chain of the tetrameric hemoglobin, putative OS=Ixodes scapularis OX=6945 GN=8038954 PE=3 SV=1 +-TEMTSQEKHVVRDTWAIFKkeVQTSGVAIFVVLFFKHPAYQKLFVAFAADP-IAELPQNPRAIAHALTVAYAITSIIDTLDEpetSAELVRKVATNHVRHPTISGAQFEHMGQAVVEVLAEKLGSAMNHQAVGSWQKFFAFVVRVSQGVF- +>tr|A0A1B6H4C1|A0A1B6H4C1_9HEMI Uncharacterized protein OS=Cuerna arida GN=g.19114 PE=3 SV=1 +MRRLTEREKENVRLVWKKVedDYPSYGRSVFVKLFDEYPYFKKFFKATIG--NFEDPFMSPRFQKHMLQvLMPTFGGIMDNLDFpeaVNEAVKRLAVSHR-KKELGiaKEHINILGQVIVSVVKRDTL-GCTEEQEEALEKVISIVMAMFC---- +>SRR5215813_3453690 +------------------------------------------------------------------------IASDSEIQVspwtrt--GTLAISARRCS-SSRISSGigsdtTFSLYGNCV------------SSSATIAWNTHGD----IQLDS-- +>SRR5579859_1863727 +-------NISSLQLTILNLLtvEDEFVPRFYNNLFNMYPLARSLFVHTe--I------------SLQYNKLRLMLMMIIRTIHDadgLKIQLQQLGQRHK-YYRVEPEHFAILYIVFVQTVVEYLGPKWTAELEAAWAEAYGTIVRMMDME-- +>Dee2metaT_7_FD_contig_123_47857_length_200_multi_10_in_2_out_1_1 # 3 # 200 # -1 # ID=100007_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.434 +----------VLRDREG---lgDPELVVLQRRHLAEHGAILQPLalLARQr--H------------REDLELVRELLLLECDHRVEhprahpaGVGVEGELGVGHH-TERIKRSlspsalLGRWIDLVVVGAVRR---------------HHQGGVVDLRLVE-- +>SRR5436853_3450426 +--------PVLLKDSFNLVRseEHTSELQSLRHLVCRLLLEKKKKnkTTTV-----------NYIE---KEKLGKLEA-SCPVEqti-------GIGDKQR-DYQ--QMHHPERTEAQ-----KX----------------------------- +>tr|A0A1W2WRJ7|A0A1W2WRJ7_CIOIN cytoglobin-1-like OS=Ciona intestinalis GN=LOC100183004 PE=3 SV=1 +-MPFTDEELKLLRNSWDEVKklgMKEVGLHIFTGLLNAAPSLRTLFYTI-DLPDEeeltiDVMRENKKVVAHATRIANAISKFIKFLDQpeeLEKLLTSLGESHA-RRQVDPESFEYVAPVILSVIGGHLKLPSNSPTLQAWVKAYGVLRNGIVS--- +>tr|A0A1W0WQD3|A0A1W0WQD3_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_08524 PE=3 SV=1 +-TGLKKRERLVVQQTFEAIsKklgRAVLGRDIFYLFFQLHPAYLQLFKALRDIP-PEQLKTHPRLKAHGLNAIQALAAVIENLEDTettVLLLEKTGRDHV-RRKLQSKHFEDFHSTTVALLKRELGPSFTPFVEQSWNKAFTVVNTVIL---- +>SRR5438034_562795 +-------AVETLRNSFERVIerSPNLTRRFYEILFEKYPQTRRMFGL--QS-----------GKGKGNGKGAGARQRLRRChcrlhfgkekaTVvpfPLPVPVPLPAFRD-SYX------------------------------------------------- +>SRR3954466_4238475 +--------IRRLTRSYDQILsaGDCLPELMFAQLFDRAPELRTLFPD--DM------------GRVKHQFARMLHWLIAHLHEpqkLRIALVDLGRRHQ-EYGVKPDVYPHLCEALVDAMATICADDWNEELCRDWRQTFDLMVHHMLRAY- +>ERR1719359_2370951 +-------------RLIVTPEhldGCRAGLLALRVVLLHLGEGLGLLG-SDSSGVSdcgVALgeL------------PLQRLDLLGVLLGpr----L---GL--L-NAGVRGLELSLLGRLlrvglselfVAEGLLLGL---------------------------- +>tr|A0A212ELK8|A0A212ELK8_DANPL Globin 1 (Fragment) OS=Danaus plexippus plexippus GN=KGM_200313A PE=4 SV=1 +-SGLSRRDVFAVQKSWAIVYanPLANGSELLKSPYISRIL----ILLVDKVS-EI----------------GSIVKAATDVE------------------------------------------------------------------- +>ERR1719343_803772 +-----------------------RAVDCSFDFSRKSPVPRPSLA-SAKKDfngDANSVYDSRKFLDIGKNFIEIVDQAVDMLGPdlqvVAEVLIDLGKKYHNEYDMRPEYYSVLARALIDELEEILGTDkFNTRTKSCWVQVYGAIAADIAA--- +>EndMetStandDraft_7_1072992.scaffolds.fasta_scaffold3604113_1 # 1 # 288 # 1 # ID=3604113_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.538 +-NNLTDDQKNVIKKTWITIEenRTKIGKQTFIRVFELNPQIKKMMPEFMTADPIEELNSSRKLFGHSKTLMTCLENAVKSLDDnerFVAYLVELGRRHQ-VRPLKAPYFEVIHEALMFSLKDVFQSDWTTETSESWSALFRYMSEAMIIGL- +>tr|A0A136A626|A0A136A626_9ALTE Uncharacterized protein OS=Paraglaciecola sp. S66 GN=AX660_04410 PE=3 SV=1 +-MILTVEEKSAIKESFAVLLRenANVAECFYNNLFELAPLIKPLFKS--GR------------ENIENHFHELIGTAVNKIDHfndLRADLIALGKRHK-IYGAQQAHFAVVKAAFILSIQYKLKGQCSPFLENSWAKYIDNISSVMIEGL- +>ERR1719461_1916292 +------------------------NV-SLFSLFAADPGVQtKYFGHMK---------TDADLEKHGVRVMNSIGAMVRAILDqdddrLITKVHEITRNHQ-PRGINRPLLEFFLSVVLDYLAKALDSHLSKEGGA------------------ +>ERR1712179_865199 +---------------------------------------QrKHFPHMM---------NssigksltKSKLKIHGGRVIREISVMVDCVQAgndeaLMAKIKEITVNHG-VmRDImSIEAYRLVLDGLVAFLGSALGDSLNETGHHAWKKLVNNIITGID---- +>SRR6266699_3297184 +----ALARGSLATPCFRSHRAqhFQARMpykPVGSLEAARQHAREGLFRS--DME------------RQYFKLMDMIAAIVGTLDKremFQSIISHSGRQHA-QFGAKPLHFAAFGDALIWGLEQQFGAAFTPEMKEAWIKLYDDVQREMMR--- +>ERR1719271_149007 +--AVSARERRLIERTWEKAKedgCDALGANLLQTLLVAEPQVMQLFP-FKDE---ENVYESLRFKAHASKLAVIIDAAVSLLANpvkLESLLISVATSYEYsFKQMLPEHFPLLGEALIRTLTSIVGgTKFTWQAESAWRKVWTIISTVMIGA-- +>ERR1719203_2782565 +---------ITSKFGWTSNmq--------------KIIQSQTHSKT-QDMQ---RDYYLNQK-KTLEI---------------nvRHPLMKELLRRVE-----DNPEDKVAKdMATMMFNTATLRSGFSLKDTVNFAESIELMMRQTLG--- +>SRR4029078_13512293 +---------------------vKRVAAELfYVKLFELDSTLKLLLA--D-Q------------QVREQKFMQIVDATVNGLEHsegMMSAVRELGIRHP-LFGDSDEHHGPVATSLFWSLKKCLRKDFSGEECPRAVGGHALC--------- +>tr|A0A147B4Z8|A0A147B4Z8_FUNHE Neuroglobin (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1 +MGELSVKDKELIRGSWESLgkNKVPHGVIMFSRLFELDPALLSLFHYSTKCDSKQDCLSSPEFLDHVTKVMLVIDAAVSHLDDlhsLEEFLLNLGRKHQ-AVGVSTQSFTEVGESLLYMLQCSLGQAYTAPLRQAWLNMYSIVVAVMSRGW- +>SRR5262245_48005872 +----VSMHTSPLRASVELVEqrRSEAVRYFYAHLFAGHPELRTVFPI--SA------------VEEHDRLFTALLYVVKNVHAlpmLAAELQQVGRDHR-KFALSAEHYQVVGASFLATGAAILAEAWTSEIGSGWQSAYRMAASVMSD--- +>tr|R7WMM5|R7WMM5_9NOCA Flavohemoprotein OS=Rhodococcus rhodnii LMG 5362 GN=Rrhod_2088 PE=3 SV=1 +--IFDDRTLRRVRATYKDMAArpdwdSHLAQSFYANLFAENPQLRLLFPA--NL------------EAQTHRMLTAIRYVLDNVEQpdrMLTFLGQLGRDHR-KYGVAREHYEAGGRALLQSLRGSLVtLLWTPTVDAAWSEVVGTIVGTMAD--- +>SRR5258708_3005780 +--EPTPTDITIVSDSLAPLTkeqVDNVLAAFYHQLFTRQPSLRQLFKSFRSGDQ----PDQQAMKLQRNKLAEIIALGLKLWEKphqLIPALEKLGRQHH-QYGVRDEYYEDVWIALSEVLSEAFGLDRWEDICESWQRFIFLCARHMLNG-- +>ERR1719347_1330150 +YFCLSESNIKALKSCHPHLkdRKEEFGHLFYSNLFSNHPDLKSLFDQ-TEE----------GRQLQAQRLADTVVAFLEKCDDlpsLLPTFKKIGKRHT-TKGVKPEMYQIIIDNLVDTLEEMLGKeVFSAEVKQEVLESISFLSNAFIK--- +>ERR1719284_1036555 +----------DVSASLDLVKrlpnYeQVVGVRLYQKVLAAGPQYVKMFP-SVASsltssNDPEEFLKDPVLLKHLTSYIRMICMAVDLLGPdtelFEEQVRELGAKHS-EYGVSQRYYVVMGKALIQTLEELLGDRFTPSTKQAWEKMYDLMSSTMIKG-- +>SRR3974390_2763688 +--XMSPETKELLETTWAKVIpiSDVAAGLFYERLFTLDPSLHRLFEN-------------ADMKEQRRKLVQALHAVIYSVDDlpsLIPTLEILGRNHV-RWGGIGGTPRDLGGQSHPEAVGRI-----PNIR---IVAVAvGRPDIMLV--- +>APLak6261669570_1056073.scaffolds.fasta_scaffold275140_1 # 52 # 198 # 1 # ID=275140_1;partial=01;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.524 +---WSTRRVKVVQRSWETFKstqaeSTTVGLAVFKRFLRRSPAFLQLFP-FRDQP-LETLFLNAKVRLHCKLFADTVSRTVGLLGDsvaVKASLRELGARHSDLYKVRSGHYAAMGSALLEVLEHNLGESWDEETKTAWEETWAYITEQMQKG-- +>ERR1035437_6084348 +-SSLDQEMIAIVQVSWENVTPDsrLAASMLAMNLCADDRNIASLFEE--DR------------IKMSRDVMQAVSCIVADLDQpetLVPYFGSLGQLLR-RHGLHESGQQTFATALFLTLGQLLGPRYGPVEHNAWAIAYSFVVRIMIAE-- +>ERR1035437_3078414 +-SSLDQEMIAIVQVSWENITPNsrLAASMLAMNLCADDLNIASLFEE--DR------------IKMSREVMQTISSIVAGLDQpetLVPYLGSLGKLIR-RHVLHESGQQTFATAFFLPLGQLLGPLYAPVEHNAGAIPX------------ +>ERR550534_521252 +-TSFKPNEIMEMRVMWNGWvggDMASRGFEMFCKMFEMHPETKDVFA-FMKGSSVAQMQSSSKVLFHVTRVMKYIDEVMRHADRLdevVPILRQVGGRHGTqGYNIQSGYFPFLGNALRQLLKDHFKTRYTAVLDGHFQKMWGFIVKQMQAG-- +>ERR1712105_94955 +-TEFKPNEIMDMRVMWNGWvsgDLASKGFEMFCKMFEMHPETKNVFA-FMKGSSVAQMQSSAKVLFHVTRVMKYIDEVVKHADKLdevVPIMRQVGGRHGThGYNIQSGYFPHLGEAQRLLLKDFFKDRYTANMDAIFKKLWVFIVKQMQAG-- +>ERR1719483_559503 +EGPLLAKDVKAIEESFAMVAalgsAKELGIGFFRLLFTTYPEWLEkYFvPNFGDKP-LEEFLMIPRFEVHAPGVIVELSKWVGSLHDldsLVAAIQENARNHY-RRGLNVDHYKKIAGVLLSYISAGLGDSLTTQMETAWTKFLDTMVNVVEEEM- +>tr|A0A195EH31|A0A195EH31_9HYME Cytoglobin-2 OS=Trachymyrmex cornetzi GN=ALC57_03526 PE=3 SV=1 +-LGLTEKQKKLVQNTWAIVRkdEVSVGVALVIAFFKQYPESQKEFKSFKDVP-LDELPKNKRFQAHCINIVATLGKVIEQMHDpelMEASLINFTEKHK-ARGQTPEQFENLKQVILAAFPSLFGKQYTSEVQEAWKKTLDLIFSRICQ--- +>tr|A0A158NI97|A0A158NI97_ATTCE Uncharacterized protein OS=Atta cephalotes GN=105620364 PE=4 SV=1 +-----------------------------------------------------------------MNIT--NGTIHDILSGgkNTQKV--FL--FR-HRGRTKEVVEKEEKIRVAGLDtngshradCPKGTDEGREIGDPVTDSLLQMLQKKEK--- +>SRR5690606_21296714 +----lmEWERVKLVQESWSSITpL-gaKFTQVFYRKLFDEHPAVVGLFPE--SM------------AEQEQLLSRMINPAISCLPAesvFENMMHKLGNRHS-EYGINEKHYRMFTQSLLETIRESLAERWTDELESAWAEVLSGMSRRMN---- +>GraSoiStandDraft_11_1057310.scaffolds.fasta_scaffold26797_1 # 22 # 990 # 1 # ID=26797_1;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.733 +--VIvTDSDISGCFSCWQTVVdGkapayiEdsdpnkpsglvWFSNVFYGRLFDVNPEAKKLFRD--NN------------ETKARALGNIISTGLRQIWDranFSKILHGIAVSHC-KLGVKAIQYGLVGDVLLWSFAYTMKNMWDQDLRTSWIAV------------- +>SRR5690606_23735845 +-TSFVSLNANVLQRSFEFLApqSDRLAKRVFEKLLKDYPQYRPLFAKV-EI------------VDLRQRLIQSLALVVKSAQRpetMVRYLSELGIRHA-EYGITDNDYRPFTSVLLGVLAEFSGARWTPEVKTAWEEVX------------ +>SRR5215469_11104805 +--TGVAEQHLLDLGGVDVLP--APDDHVFDPA--GDPQVaaviedAQVAGV--QP------------AVWIDGFRGAFGHVEVAEHGLvaarADFPG-LAGRHG-FPSDRV----------------------ADGDLYL----------------- +>tr|A0A2T7P4Q7|A0A2T7P4Q7_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_10992 PE=3 SV=1 +-PSLTADIRRVVQQSWYRLvehrSLDQLGIPVFLEIFHLTPAAKKLFH-Y-SeKTTIEELEGDRRLREHATRFMNAVGAVVDNLDKknsddLDVMLREMGADHTNISTFNQVYCVIFREALLSVWERNLGKaRFRGELKNAWRALITYMMEVMREGYD +>SRR5438128_5040868 +--------------------------------------------------------------------------------------------EY-RWAEGSSelaaEFVRLNVDVIV-----TGRLPAVAAKQADIRHSDCVRDSCGP--- +>WetSurSiteA1Bulk_404760.scaffolds.fasta_scaffold823987_1 # 3 # 239 # -1 # ID=823987_1;partial=10;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.409 +----------------------------------------------------------------------------------MPN--------------------DSDSCHSVDNSAILHAVLDSAVDGIISIDESGTMESVNA--- +>ERR1711918_283694 +-----------------------------------------------------------GSECSWMCRC---GIARFEQT----RTTSHKSRRA-TYRvqPDRGILAHPGESCDDHFGGAPWGGLHPEVENAWNVVYGFPSSIMISGPR +>SRR5262245_16285966 +---------XMVEGTLDAVSLPALSADFYRRAFDTDPELARMFTA--DR------------RVQEARFATELAAIVRSIRchdEFVPAGRALGPVPR-L-RRDGRPLPRDGRRPAGIagrcprsdvearGGRGMAPRLQPDRRDDAERRPRAGQLGVTSG-- +>ERR1712061_521749 +---PVGHMKTAVEQSWERVQalgPVVIGAQEHRDVAVVSRTTST---TSTRI-EESDATAAGSLANPF---------------------------------------------------------------------------------- +>tr|X6EW29|X6EW29_9RHIZ Adenylate cyclase OS=Mesorhizobium sp. LNHC209A00 GN=X738_26865 PE=3 SV=1 +--------FALAQRSVGLLLddPSAFAAQFYANMFAIQPELEGLFVN-G-T------------GAQGAMLSHMLRTVVSGLERRkhvPAGLQTMGRKHI-GYGVELDHYDSFRGAMLKTIDDIMGAGLTREIEESWSETLDVILGLMKKG-- +>SRR5215471_14715706 +--------PAGGPALARLLRr-------HLRRV--VSSRLAPLFLR-LAF------------NDAISYDPATGSGGANGSIRLpeeLARKEVAGLARA-V------------------------ERLRPVKE------------------- +>SRR5205085_9494957 +--------PASGPALSRLLRrhLRCVVTsraapLFLRLAFNDAISFNPATRA-GGC------------NGSirlaeelEREEIQVLSQGIEQLRPLkerFP-HVS----------------------------------------------------------- +>SRR5947207_2391870 +--IISNRQARRTNDRLQIELaaAQARIGLLYFAQHDRTRAAA---------------------------------ALLEGPDAFdqqRPALRAMGLRHV-AYGVVPAHYDTLATAFLWPLGHRLSPEFSPX--------------------- +>tr|N1VSG6|N1VSG6_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 GN=LEP1 +-----PDPILEIQKSFDHVLeyNPHWIDSYIDKLKNFSMenvTENQREGDN-ES------------PISSEEFLNSIESIIEKLGNpisVKKEVSKLANIYE-SLGITKKEFPKLLPILLSSLRENLPSEWNPSLESIWTQAITDLTIETIES-- +>tr|R8ZTT5|R8ZTT5_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira yanagawae serovar Saopaulo str. Sao Paulo = ATCC 700523 GN=L +-----KDQILELQRSLELALqlNPNLARDFYIHFLETKPEFQKFFQNT-DM------------ETQAKKLLAMFGKTIERLGNlnqIQIELQNLGKMHE-EMGIPVTDFGAIAPSLLYALEKSLGDQWNAEWKSIWETALGSLVRLMGMK-- +>SRR6478609_9341681 +-------DAELLETSLALVDTpdASLDSRFCALLHERHPAVHPGGGD--TA------------ARQAKLLRSAVISVVDHLDDpvwLTETLGDGTARPS-GWQVAPEMCGAVSECMVAAMVEIGGARWTSQMTDAWVEALDAVSGPMLLGS- +>SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold1207366_1 # 2 # 214 # -1 # ID=1207366_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.286 +-----YASHQSQAASLAKAAprPRVAVLGLrlpsgeSPQLARLGRAFAELLG--AEL------------AAGERLLVLPAeRVehMKLELGLdeaEAYPLPTLGRIHR-NLGPDLVVVGTlapqeprgtlsvtveVKDCLTGAVTATAKVTGPAAELFTLASQvggelrrrlgssalsgneraelraqrpaSPEVAQLYADG-- +>tr|V4A611|V4A611_LOTGI Uncharacterized protein OS=Lottia gigantea OX=225164 GN=LOTGIDRAFT_233216 PE=3 SV=1 +-IGFTETQIDTIRSTWPLLSrnMVRVGTDVFVRIFTEVPTVKELFSSF-NIVDVNDLHKMPTFRAHAEMFMQVLHLVVDNLETpyseLNHELMVLGARHATFSGFKPEYFKFYVKCLIQVWELELGEEFILEVRDCWKIVFDFLVDNMTEGYE +>SRR6266542_3322184 +MTVMTPEQIEAVEATTAVLapALDDLAADVYARLDRLAPETAELFTG--GPA------------AEVRGRARDDRARHPAPRRLpGacl--------------PARPPARALRGQA------GALRARRC----------------------- +>SRR5918994_1217714 +------RDiEAYVRT------gRAA------VPVFESDVLLEDCVTS--AA------------NNDWcgVSTRPRNEVWPGFKVGlerAVPVLEQLGRDHR-RFGAVTAHYDAVGASLLATLRHFFGPAWTPELHQTWSEAYGPVAKVMVTA-- +>SRR5207302_4688282 +--VVTLEQFRLIQHSWKLVKdGqfaaftaqtliadplGFWGLQLYDTLFALNPSLKPMFKN--TF-------------TQSQMLTEMVGAALGllpgildqalgeektAIDPqLIPILVDLAERHV-SYNVKAAHYGTVGLGLVTTLERTLGSHFDEQKQATCFELWSMMX-------- +>SRR5437867_13093015 +---------------------nqnpsPLWRA---------------------RL-------------PR-------VSIAFGlrwfNCnTSkSYSRKCSTNLLNV-GYNVKAEHYGTVGLGLVTTSERTLGSHFDAQTKAAWVELWSLICTVMIP--- +>SRR5882757_3847967 +----------------------TSI--------------WPIIIN--TaV------------GirnipQDYRNVARVLRLnqFEF-FTKimvpaAAPYIFTGL---------------RIGIGLSWLAI--------------VAA-------------- +>ERR1700737_3002051 +----------------------RDF--------------HHLDLA--DhH------------Q---------HRVagTQW-AN-gsMSNAVWTGV---------------RLKDVLDRAGV--------------KSGAI------------ +>SRR3954451_23003713 +----------------------LKS------------TTGEVFLE--G--------------klv-DE-------PGpdRAI-VFQnhsLLPWLTVYG---------------NVAIATDKVFGGSGARSKSKAERHDWVMHNLELVQM---A-- +>SRR5206468_1650083 +----------------------TNA------------TMGCVLLE--N--------------rev-NS-------PGaaRRR-QGVcerQDPQRAQRMGDAqpqpradgacqgqA-PG-GDFRRYEAARRHCPRAGHATKSAAARRAVRRAGRADPRAPAGL------ +>SRR5258705_633045 +----------------------TSE------------DAGPVALG--N--------------qev-KQ-------PRtqPPV-VFLdpaLPPRPPALD---------------HWLLRAARDAGGP------QPQ-------------------- +>SRR5690606_21133184 +----------------------INP------------LHGAVRLN--D--------------aap-RV-------GDpeVGY-LLArdaLLPWRTALR---------------NVTLPLEV---RGI----ERREREQSARKVLRDVGL---E-- +>ERR1700682_1967427 +----------------------DRA------------SAGRVVVD--G--------------sev-RG-------PSldRGV-VFQspaLLPWLSALK---------------NVAFAVRSRWPRW-----SDEQVVSHAQKYLDMVHL---T-- +>SRR5699024_2544359 +----------------------LSPSSGKIIVAFSSPTSGKIMMD--V--------------ndwtSYKDSEMTALRLkeIGF-IFQeshLLPYLKIRE---------------QLEFVGREAGMDK-------KHARKRAKEILDLFGL---D-- +>SRR3954447_21976298 +----------------------RAA------------TGGVVRWS--V--------------dplvAAG-----GRARhpLSM-VFQkdtVLPWRTVAQ---------------NVGLFYALN---RD----RRAGAEGVVDDLIRLAGL---E-- +>SRR6266567_262474 +--SMTPEQIDLVRKSFDALWpfRRKLADQFYGRFFELAPDTRRLFPN--DME------------RQQLKLMDTIAAIVGTLDQreiFQSIISLTGRKHA-DFGVQTSHFACCFYPKSLEAPAHAGGFLCSSpLNVSWNGARARPYPLMHL--- +>OM-RGC.v1.004444255 TARA_034_DCM_0.22-1.6_scaffold509117_1_gene597562 NOG05352 "" +--PfLQPTKFELVVNLKTA----------------------KALGL--EVP------------PTLLARADEVAGVGGSAKRishWPPR------------------------------------------QSRWAGLPRRPERH------ +>ERR1719401_1263416 +----------NVLTSWNTLKskpnyCDETAALIFERLYELEPKAMSIYE-LPTNVDFKTLRKDAHFKMYARYAFDTMDCTVSMLGpdlfELSGVLHEMGRRHQ-RNGVDRSYLPYMSEALFHALAKMLGPQFTEDDKEAWKGVMDYMISEMVIG-- +>ERR1719401_232394 +----------NVLTSWNTLKskpnyCEETATLVFERLYELEPKAMSIYE-LPTNVDFKTLRKDAHFKMYARYAFDTMDCIVSMLGpdlfELSGVLHEMGRRHQ-SNGVDPSYLPYMSEAFVCALSKMLGPQFTEDDKEAWEVVMDYMISEMLIG-- +>ERR1711862_565156 +---------------------------------------KIMFH-FPVNMNIETVLKSKIFLQHAKFFVKTLDITIGLLGpdtdIIQDVLLEHSKTYQ-NHGVNSAMYLHMGESILYALEKDLGDvNFTSKDREAWAYFYGTIVGVIVGG-- +>GraSoiStandDraft_1057264.scaffolds.fasta_scaffold343999_2 # 425 # 754 # -1 # ID=343999_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.636 +---RRRMDAELLETSLALVDtPdDGLTKRFYALLFERYPAVRPVFPEEmhRDI------------ARQAKMLRSAIISVVDHLDDpvwLTETLGELGARHA-GWGVLAEMYDAVTECMVAAMAEIGGDDWTPYMTDAWTEALDAVSGLMLLGYP +>ERR1044072_5206314 +---MAPPQIAVARSTGPKVSPmqQRLAQVFYERLFELDPTTRAFFGG-------------VDLRHHGLKLTETLSAGIEVLGRdgpAPRGS-----------GSGMAALRDGGGCVVHGAGVLPGPRVHDRSPGGLVGGVLG---------- +>ERR1719389_1465843 +----RCNRKLGGSAKEEKLRrndgtrfvCKI---FKISRFLKQQPDASAVFG-F-DNN-DEDVHKTPKFIDFANHFVEVIDQAVQMLGPdfelLTDFFVDLGDKHSKEYGIKPKFYPILGRVFM----------------------------------- +>tr|Q17153|Q17153_9BIVA Hemoglobin (2 domain) OS=Barbatia lima GN=hemoglobin PE=2 SV=1 +----QPANKGLIRETWNIVAGdRKNGVELMALLFEMAPDSKKEFRRLGDVSPA-NIPNNRKLNGHGITLWYALANFVDQLDNktdLEDVCRKFAVNHV-LRGVLDVKFAWIKEPLAELLKRKCGQRCTEKHVKAWWKLIDVVCAVLEEH-- +>tr|Q7M455|Q7M455_BARRE Hemoglobin 35K chain OS=Barbatia reeveana PE=3 SV=1 +----KPANKGLIRETWNMIAGdRKNGVELMALLFEMAPDSKKDFRRLGDVSPS-NIPNNRKLNGHGITLWYALMNFVDQLDNkidLEDVCRKFAVNHV-NRGVLDVKFAWIKEPLAELLRRKCGQTCTDQHIQAWWKLIDVVCAVLEEK-- +>SRR5262245_28144535 +--CVTEEQIARVRACFDELTPrtPEVVDRFLARFFAQNAPLRALFP--RDLS------------ALKQDFAAGFRHVVRHLHRldtIAPMLMDLGSRQA-RAGLTPGHFGMAREVLLTTLRDVAGPRWNEQLRQDWTEALNTVVSLMVVGA- +>ERR1039457_5537378 +---AGPLNPALIRKSLALITagPPRGAGGFSRALFSFDPGVGGLVPA--G------------DERAER----APVRR-------------AGPDRR-AAX------------------------------------------------- +>ERR1719498_600299 +--------INCVQHAWNVlIIEDRsreflraqesatfvyssciswFYSVFYSRLFNVHPLFRPRLNS--KG------------SKSGKSLVMMIATTINGLRDkdmFQRVVTEMAKNLC-SSGVKPVEYGILG--------------------------------------- +>tr|A0A2H8TS68|A0A2H8TS68_9HEMI Neuroglobin (Fragment) OS=Melanaphis sacchari OX=742174 GN=ngb_3 PE=3 SV=1 +--YLNKSQTALVKQSWPMITSNNFWTTFYINLFKRNPLYQLQFDRFANVP-FEELESNVHFLAHSFRTGFAFNTAIEHLEKpdeLHRILMDLGEKHR-KFRLTAEHFEAVKDILLCMIEDRIVLTdvpaRNILLVEAWKPCITLVIGVIM---- +>SRR5215469_6657410 +---------RLCPVSQSQMSSvvGatTSaaHRITMSPIWVSpCYSFTWLAI--NRY------------TWDRFGLMTMIQTAVENMHQldqILPAVRDLGRRHA-GYGVKAADYNTVAGALLGTLEQALGSEFTSAVRNAWIAYYQTLAGEMKA--- +>UPI00001F6528 status=active +---AIIDGLRDLSESFDTLaadeaatApaATELKaavegqfsgvfGAEYAKQTGKQPDTASYTLE---------------------HSAAALAQYHYIVRNphpLGQknKLDKV-AGEA-RYHALHARYHTMLNAYLERFGyydvflidldgdvvysvfkemdyatNLKTGPWRDSgLGRVFRSALESNDtkSTFFDDFA +>ERR1712100_346632 +---------------------LFFFFFFFFFFFFFFFFFFFFFS-FKNV---EDLYESPMLKAHGKAVVGAVDAAVHLLDDvskLIPILEELEQFHN-RKKIVAAHYDVVGQAVVNVIGSALNG-LSEEQTNAWVKVYLTIKSVMLA--- +>ERR550532_3561775 +---------------------GDSSVSPSGELCSPKTKTPRICSTVLE-----LTMHSADFQAHSGRVFGGLDTVISCLDDeatLVAELAHLKGQHDER-NIPDAYYRHFYQALEKVMNAMLGPCFNY---EAWDACGDIVFHGITGH-- +>tr|A0A1I3HEN0|A0A1I3HEN0_9RHOB Nitric oxide dioxygenase OS=Jannaschia pohangensis GN=SAMN04488095_0565 PE=3 SV=1 +--LVTNTQARLLSRSLRRISenGAPLARSFYAELFSAHPEVRPMFHS--DLS------------TQYAKFEDMLVVLVADVLNpgvILRPLQDLAKRHV-EYGVTREMYPIVGDIMMRTLRTLDAAPLTGDELEAWDVLLGRVNAFLMDE-- +>tr|A0A1Q3FVI8|A0A1Q3FVI8_CULTA Putative globin 1 OS=Culex tarsalis OX=7177 PE=3 SV=1 +-TGLTNHQKVALIGAWSLVkkDIISHGRNIFVRFFEENPKYLNYFD-FSQDRTASEIGENKSLHAHALNVMHFIGTLIDyGLYNpamFKCSLSKLMKNHL-KRGVKKEDVTIVCGVIMKYCLEVLDQHQSTTLQVAFASLMKGIADAFD---- +>tr|A0A2M4DSC8|A0A2M4DSC8_ANODA Uncharacterized protein OS=Anopheles darlingi OX=43151 PE=3 SV=1 +--------------MWCKPthQNpegSSDYISICVRLFQKYPHYTDYFD-FTDDTKADSLVDNKSLFAQSIHIVKAFGSLIEyGLKDprlFHETLKRIARWHE-QRNVYGCDVLLIGEVMLTYLTQTLGRQTPAMLGEAFQKLFQTISYRFP---- +>tr|A0A0N8DLE0|A0A0N8DLE0_9CRUS Hemoglobin subunit theta-1 (Fragment) OS=Daphnia magna PE=3 SV=1 +-LPLNARQKYSMLASWKGISraLEPTGVYMFIKLFEEHKELLSLFTKFHQLTTRDEQANSEELAEHASSVMSTLDESIRSLDNVDtflLYLHQVGQSHYKVEGFQKEYFWKIRNPFLEAVKMTLGDRYTENIENIYKVSINLVIETLVEGYE +>ERR1719383_1265545 +-------------HSWKEVGqapADEVAREIFRNIFAIEPGALELFP-FKNES-EDDLwREGGALTVHALKVVSTIDKAVSRLGNmdaVVPMLRKLGIMHV-GPRPQHLGNG-----APMSLP--------RRPTASWRRG------------- +>ERR1719383_514948 +----------------------------------------------RGRL-VEGRwRFDSARVKSCVddrqGCVETWQHGRRR-----SNAPQVGNHAR-GLRCAQAHYDVVGQALVTTLASY--CTFTDPVKNAWIKLCGVIKATMVH--- +>ERR1712000_66502 +--------FPKVQKSWARVLeieakdeSKSFGPIFYNTLFTDFPFLKEqdFKSA--TM------------AEQKMNLPKFITTALSLLGDmpkAVDALQRLGMRHV-LYGTKDAYYPVVGANIIKTLKQILPANEFDQEtQEEWLTLYGVMQKTMIDA-- +>SRR5258708_4037766 +--------PGAVGPAPGLQPprNRPGARRGQPALMQSPSAGGPPPGP-HrpRR------------THRTPPRRAALVLLRRSLRDldeVVPGLRAMGARHV-RYGARPEHYPVVGAVLIDSMAEVAWDAWRPAYGRAWAAAFDVVSGAMLAG-- +>tr|A0A1Y3AX51|A0A1Y3AX51_EURMA Globin-like protein (Fragment) OS=Euroglyphus maynei GN=BLA29_013533 PE=3 SV=1 +----------------------------------------QKFKSFKDIPINfqqnHLIRIDKKLIAHGTYVMYTIGMLVDNLERpdmMRQMLKRLSRNHY-RRRISLKAFERLRDTLLEHLSDILGKEiFHRKTMIAWHKAFGYLLKEIESN-- +>SRR5688572_8260099 +-----DQEINIVRQTWNRLAaehGNSVAEEFYKRLFECCPHLKDVFKN--DF------------EVHGKEFIENMDHIIIQLDNpcMIREMQILGIKYA-SYGIRYEDYECMKKALFDALKTKLAEHWTPTVMVSWIWFYSTVSHIMKH--- +>tr|F2Q9X2|F2Q9X2_BRAFL Globin OS=Branchiostoma floridae GN=lGb7 PE=2 SV=1 +-MSLSAADKKLVQESWDKVSkpsFADAGERVFLKLFRRNESTKAHFKKFKDIPS-DQLAGQAVVRDHGEKVCKVLDDFIKGLDGsGDEAVKKVGRMHK-GLGMSNEQIDQMKGAIIEVLADAgFGD---ANYKGAWGKLWDRFMAVHRA--- +>tr|A0A1B0G6S0|A0A1B0G6S0_GLOMM Hemoglobin-like flavoprotein OS=Glossina morsitans morsitans PE=3 SV=1 +YSTMNSDEVYEIKRTWEIPatTPTESGVAILIRFFTKYPSNLQKFSTFKDMTL-DELKNNPRFKAHANRIMKVFDDSIKTLDDncshLEEIWTKIAQSHF-NRQIEKQSFNELKEVILEVLVAACN--LNDQQTEIWLKLLDFVYEIIFKT-- +>tr|V5YM54|V5YM54_9DIPT Globin OS=Polypedilum nubifer GN=PnHb18 PE=2 SV=1 +IVALTEADVEIIKRTWKIPsaNPHDSAALIFSTFLEKYPHNQQKFPAFKDKPL-SDIKNTVEFRAHASRIFNVFSSVIDGLDRdtemmkgIKKIIAEVGKFHA-KKKVTKKAHNEVRSVLVDILIEVCK--LSDEEKAAWTKLLDIFFHVMFEC-- +>tr|O96457|O96457_9MUSC Hemoglobin OS=Gasterophilus intestinalis GN=glob1 PE=1 SV=1 +---MNSEEVNDIKRTWEVVaaKMTEAGVEMLKRYFKKYPHNLNHFPWFKEIPF-DDLPENARFKTHGTRILRQVDEGVKALSVdfgdkkFDDVWKKLAQTHH-EKKVERRSYNELKDIIIEVVCSCVK--LNEKQVHAYHKFFDRAYDIAFAE-- +>SRR4051794_9566520 +---------------KALVEdvAERghrrPMEVFYGARsdhdlydidtmlrmAQSHPWLS-VRPV--VA------------TGpaggPMNSLSGQLPDAVRQYGPwreYDAYLSGPPGMIR--NGVD----ALVGVGV---PSDRIRHDSVEELVAAGDX-------------- +>SRR5215470_9890699 +-----DFDRGPIRELLKHLAvePDAAMEYLFARLFAAHPDLRGLFPY--GM------------TQTRAAVFGELAAIIGGLDDqerTEQTLARLALGHR-KFGVKDKHYEPFFDAMFVTAQHAAGAAWTGEMAASWRSALDWFGSVMAA--- +>SRR5262249_54331370 +--IRLRK-------EIDNEWllIASgVLSVIFGLILVAQPGTGALA---------------------LLYVIGIYAILYGILGPrpcCV----------N-RFGAQTALDRG-----------------TSTYRELWNIS----VARLIG--- +>SRR4029079_9820506 +-VRVDGILVEGLQASLATMQpaAAQIAHGFYTLLFARRPDFRAMFP--EDM------------AAQERKLIATLAFVCEHWRKpaaVSVRLADLGALHQ-GLHVKPEHYPIVCDALVTAVMKHRHEALGPHRAR------------------ +>ERR1719310_1734953 +----SASSVKAVQASWAKAEnigLRVVGELFFKELFEASPAAKELFTA--Q-KFGEDAAGQRRFKAHTLNVMQTLSAAVYGLSDlsaLARTLPAPTYAIL-SLSFTLISFTSL--------------SLTPLI-------------------- +>ERR1712087_347811 +--------------------------------------HEELFTA--QKKFGEDAAGKAHFKAHTLNVMQTLAAAVYGLSDlsaLARTLPARIYAIL-SLSFTLITFTSLSLTPLIYHTLTLKGARARNSGRaaPWIRRPT----------- +>tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii GN=F775_23753 PE=3 SV=1 +-MAFSEAQEELVLRSWKAMkpDSESIALKFFLRIFEIAPAAKPMFPFLRDAGEDAPLESHPKLKAHAVTVFVMACESATQLRktgDvkvREATLRRLGATHV-RAGVADAHFEVVKTALLDTIEGAVPEMWTPEMKAAWEEAYDQLAAAIKEEM- +>SRR5262245_14739337 +--PCARARLRPR-------RpaL------Y-AQALPPRRLVPRPVRE--L------------AEAQSRKFMAGLKLGIIALNyedGLTPVIRLVGVRNR-RAGIKVRHHRVMAKALLPTLEQSLETRFTRDTKHAWSSFLTQVTRILSG--- +>SRR6266699_2273235 +--FFLPFKE-LTEQHFSILGlrkARRAGLVLAQELFEHAPNVGARHSN--AF------------GGRGYCRRMR---------PRtap------VCDSAR-CWAPSCRRQ---APLALR-------------------------SCRPVR--- +>tr|A0A084QEN9|A0A084QEN9_STAC4 Uncharacterized protein OS=Stachybotrys chlorohalonata (strain IBT 40285) GN=S40285_06080 PE=4 SV=1 +------------------------------------------------------------MEKYPRIDIRSPAGVSIIYKDvssLDPAQEEIRVLHL-HGG---PEDSPIECTLHKiALKSNPPPVYE-ALSYTWGDAsvtreIVL-NGHVVS--- +>ERR1712224_896978 +-GCLSHRQSTLIRGSLPMLraQGETITSSFYASLLSAHPELHNIFNS-AN----------QATGRQPRALLNIILAFAAAPNHtaeLIPRLERVCQKHC-SLGIRLTSTTSSASTSS---GPLARSS------------------------- +>tr|L8LYK6|L8LYK6_9CYAN Hemoglobin-like flavoprotein OS=Xenococcus sp. PCC 7305 GN=Xen7305DRAFT_00009490 PE=4 SV=1 +----MSLQIGLLEQSFNCIRPyGkLFVSSFHENLFQTNPEIKSLFMGV-E------------SQIQKNRIWDTLVLIMENIrhpNLLNNTLQGLGARLF-THGLLPKHYPLVKKAFLATFKQFLGNEWNSELEQAWKNAYTYFHDLMQEG-- +>ERR1022692_2453048 +--------XMSLPASFTSICngilGREE--------NSGCPAAKGQFLP--DR------------DAWrRssaLLLFGPLHQASRSTGYvshLHegaArppgrRispDRRPGRQAG-RSGRLRAGPRAGPPQVRGHRRALRRGRRQPAGDTGAFRGRHLDARVMIE--- +>tr|S0BCU7|S0BCU7_LAMSA Extracellular globin OS=Lamellibrachia satsuma OX=104711 GN=v2hb-B2 PE=1 SV=1 +---CTTEDRREMQLMWANVWsaqftgrRLAIAQAVFKDLFAHVPDAVGLFDRV-HGT----EIDSSEFKAHCIRVVNGLDSAIGLLSDpstLNEQLSHLATQHQERAGVTKGGFSAIAQSFLRVMPQV-ASCFNP---DAWSRCFNRITNGMTEG-- +>tr|A0A1Y1ILY9|A0A1Y1ILY9_KLENI Cytochrome b5 isoform OS=Klebsormidium nitens GN=KFL_008610010 PE=3 SV=1 +-PHLTTSDVKLVQESWAKVVeahGVGAVTLFYVNLFTLAPHLESLFKKTKN--------------IQEAMFTDMMMTLVGKLHDwewVVSALEASAIRHL-RYGVSVSMFPAVGQALLQTLDMGLGVHWTPEVKAAWIKLWTAIVSVMSVHL- +>SRR5579875_3194573 +-------------------------------------------------------------SRCCSRATPSYGRCSRSRCrgpgrrsAtgsPSSSATCRRPGAR-RSCSRRWPGITAGSASvtgtTGRSSRRSGPAWTAELDAAWLAATDWFVSVLAA--- +>tr|A0A0L8P0I1|A0A0L8P0I1_KITAU Flavohemoprotein OS=Kitasatospora aureofaciens GN=ADK78_37645 PE=4 SV=1 +----GAADQRVITEYLELVTpfGE-LITHLYETMFRRWPYLRSLFPE--SM------------EFQRAHLARAFWYLIENLHRpddIAEVFGRLGRDHR-KLGVRPVHFQAFEAALCEALRRTAGPRWADAVEQAWVRMLRFAVAAMVS--- +>SRR5688572_1436081 +--RPAPEVIAAVSASCQAVAdrPVRLAEAFYEHLFEIAPQARTMFP--ADMT------------AQMQRMSDTLVGAIAQLEKfdtaqLEAALRRLGADHRTRHGVEAEQYRYVGHALTRAVRDVAGLAYSGALSSAWIAVYQYIEAHMSAG-- +>SRR5947208_57978 +--EMTPEQIALVQHSIEVLGprVDTVVERFYQHLFEIDPSVVELFST--DP----------A--VQRRKFeveLRQIIKAISGFDEFAGRAHDLGIRHS-HYGVRARHYRSVGDSLWWAWQSVMGSAVDSEHSKVGEAAQDV---------- +>SRR3954454_13764990 +----VLDPAMLVQSTFALVArqRQRFSERFYANLFAIAPETEVQFAG-TPP------------ELRDRMFVEILFLVARSMSrvdEIAPALTELGARHV-AYGTLGSQLPLAKRALLAALRELLGDAMTAEVEAAWSETYDAMAEPMARGM- +>SRR5579864_8015183 +----KPDPIFLVHTSFVHLRprMAEFVSNFFRRLLKDSPELAPIFED-ADS------------VRLKTMVAKIFGTTIAGPEqtdQVEADLAELSRRHK-SYGAIPDFLPLVGRAFIATIRESLPDDTTPQTIEAWELLYANTAALMSKGL- +>ERR1719483_919245 +MAVLSKSESDLIYKSWALAAdeKEKHGGAFMVRLFTEHPEVQaKYFPKM-DMN------DFMLLSKHGSKIMAAVDTLVNYVNDgndekLVKTINHVASSHF-RRGVVTrEAFEIVTEVLMNYLITTLGDHLSPEAQLAWKKLLSVLVEVIA---- +>ERR1711860_359782 +----LFSKSNYVFAS---------LSRNTFKLFKDERSLYeKHFSSF-DVN------DILRIRAHGLKVMKAVNSMVEAVSDendesLIDQIHFVAHGHH-LRGITPrNEFEVRRKILNLDYHLLFHyllkkGCLSQSX-------------------- +>SRR6266545_1588040 +-------------CDLEQAVdtCPA----------A---LVIGLRP--ATMG------------TL---------CYMGGLAsa------AVCCWRHV-RVVTCSQFF-------------------------------TTASPQSRQ--- +>DeetaT_16_FD_contig_41_1516467_length_281_multi_3_in_0_out_0_1 # 3 # 167 # 1 # ID=1772959_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.418 +------RRMNLVKQTWRSVEfglGHKATQAFYDRLFANHLDTRRLFAG-VGM------------EGQSRKLYDLLRLAVRSLDDldaIIPTVQEMGRRHARSYGVVRDHYGAVTQAFIEILHQYICSqlghmahsRYLVDVADAWAWCLNLIGNIMAD--- +>ERR1719433_537024 +--ALRISIVGREKRA-NCTVtlgRVEQGELQVGATVLLVPPGAECGVQSvevdgREVRSAqagefVCMRLLgcQP---SVGHALSSVD---GPLRSatkLKVRSAQAGEFV------------------------------------------------------ +>ERR1719161_1849694 +--ALRVMVLGMTADKVG-AAlegHVEQGTLRAGTRCLAAlsEGQAECNVQIvllngVEVSHAgpgehVRLKVTgaAAKGFTAGQVLSCIS---NPVRAigkFKAKLRLMSLPEM-LS----------CSLLVL---------------------------------- +>ERR1719277_2163216 +--EATDAMKGAVQRSWDQIQalgTTVVGEHVYRYFFELVPEAVNCFPVHvrlkyREwiADEPdenGDLRNSAALRNLFAKVLNAIGCTVAGLQDaskLVPLLSSLGARHI-GYGVSEEFWPALGKAINRTLQDLLAEAFTPEVENAWNTVYGFMSQIMVESLR +>tr|A0A2G8RXV1|A0A2G8RXV1_9APHY Uncharacterized protein OS=Ganoderma sinense ZZ0214-1 OX=1077348 GN=GSI_12102 PE=3 SV=1 +PKPLTAEQRKLITAIVPVLEqhGKTITTLMYNQMLEENPALKNVFSKS-----------KQERGQQPEVLARSLYAYASHIEDlgpIMPFVERIAHKHA-SVHVEPAHYDVVAKYLTNAIIQVVGaDVLAGALYDAWIAAYWNLAYVFIDR-- +>ERR1712080_154454 +-----DLQKIIVKHQWARSYnegmsREYFGQAIWRAFFKLDPGARRFFTRVRGD-----DISHPKFQAHSLRILGGIDMCLSLIDDvptFEAQMKHLQGQHI-EREVPSYYFDRLGTVLQEVMRAATGYCYDE---VAWGACYKYISDRIKANY- +>tr|A0A0S2MLM1|A0A0S2MLM1_9ANNE Extracellular globin OS=Galathealinum brachiosum PE=2 SV=1 +-----PLDRILVKAEWAMASdgghkDSELGSSIFRALVNIDPALRGTFSAVGGE-----DMGSAQFRAFAFRVVAGIERLIAVLDVdavLSADLAVLHSQHV-ARDVSAANYESMLSAIMSVVPSAvGNSCFSS---PSWSRCLNVIAAAM----- +>tr|A0A066YRR6|A0A066YRR6_9ACTN Putative oxidoreductase OS=Kitasatospora cheerisanensis KCTC 2395 GN=KCH_40190 PE=4 SV=1 +--PPDAADLALAGAVLAALRpvADRAMAHFFALMFLRHPELRAVFPA--A------------MDGPREQLLRVLRECVRHGDDpaaLRDRLGPLARRCR-KYGVLSGHYASAADCLVEALARYG-SGWDERAEAAWRRLLAPVARLLVEA-- +>ERR1719329_2046659 +-----------IKTVWAKIMkevgTLNAGTMLFKNVFMLAPETKQLFPKFRHLK-DDLLLSNESFKNQAKLSISALSNAIMSFDDppkLKRMLMDLGRIYE-SKGVSLATLPIVGNALMATIEAALGNDSCIETFNFFALFYNEGSNMLAEGYK +>ERR1719265_1860150 +-------------------------------------QALNYFPRFKMnnlLF-SDALFEDEIFKIHAYKLINAITNAIDLLDEpvkLTETLKHLGRIHE-NKGIPAESFVVIINAFNVTVANLISRDSSIETINFFALFMNEGTNLMTDGX- +>SRR3569832_2958212 +----PALVRSAPDSAAALRrcRCGGTAEKIAERARADD----------------------------------------PESEKsrgAGADDERIGRTAQ-AIRCSAGRLSSGACCAVGGHGGIGGX-------------------------- +>HubBroStandDraft_4_1064222.scaffolds.fasta_scaffold919957_1 # 1 # 597 # -1 # ID=919957_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.524 +-HPMDPSRVMRLRISHGWFAPcgEALVARCFQILGEQTPGTRSLFP--ADTA------------SLHPRILRTLRQVLSNAHEfrtLEPPLARLGEKLQ-RRAGGvehlLPHAAAFRDAFICVLAEAGGRSFTHQMEQDWRMLLDGVLGAMIAG-- +>tr|Q1GDP0|Q1GDP0_RUEST Globin OS=Ruegeria sp. (strain TM1040) GN=TM1040_2494 PE=4 SV=1 +-AILRQIEVQLIKVSFNRVFaqKAALAEKFYHHLFLELPDAEVMFT--RDFS------------HQTEMFARVLTTGMQSLGRdreMMVLVDDLLQRHK-HLGLTLDQMYTAQRALHLAFCEVMQAELTAAEVSAWDNAIGRLCRALAAGI- +>ERR1043166_6829872 +-LNLTADEIDRVRTSFDQVWaiSSRMADLFYDRLFAGNPFARSLFPA--QQ------------DERKQNFMLNLAVIVAGLDEradMDRSEERLVQAHA-EAGIRVDQSEVMRDALFWSLEQGLGPAWTPGVAAAWRKAYRLLSEHMAS--- +>tr|A0A257MW93|A0A257MW93_9GAMM Uncharacterized protein OS=Methylococcaceae bacterium NSP1-2 GN=CG439_2278 PE=4 SV=1 +---VKVKNRLLVKLCIDEISpkIDIVSQLFYQELFHLNIHLKTIFSG--NVT------------FLNRKFINMMAtfKNVKHLEAIENSVEKMGERHVLHYRVQLKHFPTLKKALLLALKKHLGERFNAELEAAWHEVFDDVAEIMQRA-- +>SRR5690554_3276444 +----xmSDADRLQVQASVERIRgqMDGFAGCFFDKLFALQPALRELLAT--E-E------------GRRSKLRSMVStlANSRDFDKIAPAIRRLGDRHR-DYGVGVQDYVPVQQALLHAVAQVDPQGQSEQVQQAWSGQFQRISALMEPQ-- +>UPI00042C7A07 status=active +---MNDTQRLLVKADIDSLGndINALSQIFYRELFHIDINLKSVFPG--NVV------------FLNRKFANMLAtfKNLGHLEKIGASLEKMGERHLANYGVQLENFAPVRAALLIALRSYFKENFDAEREAAWQAVFDKVADIMKAA-- +>SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold510383_1 # 42 # 362 # 1 # ID=510383_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.393 +----mTSKDRALLKECVEYIEsesINELCDIFYKKLFDLDPKIKLILSD--NDV------------VLRRKFFNMFStfKSVKYIDKVSEIILQMGARHK-SYGINEKHLELMKEPLFESLHEVLGDEKFNYYKAGWEIGYQEVENLFKEG-- +>ERR1700737_3653126 +MTALTADQIARVKATAPVLAehGVTITKHFYKRMFTNHPEWKNVFNQ-AHQQS----------ASQPQALARAVYAYAAHIDNlraLGSAVSHIANKHA-SLNIRPEKYPTCGKICWRQYPKCWAIPSMNPRSTPGAPLMRNSRRFLSGR-- +>SRR5919197_656730 +--LLDDDTIGLLDESLRLIDdrSDVVVNHFYAAQFATPPPRGLLGSR--AR------------GC--------LGRGVR-----RDGPGDVGRRSR-GGGGRAGLV--EGRD------------------------------------- +>SRR5919106_2778213 +----------------------A-VDRFYAA-VLGDPELAGYFTDvdidrvkrhqvlllsdvlggpesyDGPD------------LGQAHRGlgitdghyDKVVGYLVAVFTDLgadGDTIAAAAEVL----ASVK---PQ----I---VEDQAGSRDSHEX-------------------- +>tr|F4F3R7|F4F3R7_VERMA Oxidoreductase FAD/NAD(P)-binding domain-containing protein OS=Verrucosispora maris (strain AB-18-032) GN=VAB18032_21340 PE=4 S +-------MRDHPAAEVGGIAeavFGRAAARFWDTVQEGCPGLLP--------------------EGDAPLILAGLLRLVGGGDDRpgrLALLTVLGRVYR-EHRLRPDHAALVGA----ALT--VAVPSMPPEAATWRRA----WRlVERA--- +>tr|A0A2T3A5F4|A0A2T3A5F4_9PEZI Flavohemoglobin OS=Coniella lustricola OX=2025994 GN=BD289DRAFT_370338 PE=3 SV=1 +--ALTFKEAQLVKSTIPFLReqGEELSNLVYGNLVKRNPELNNKLNVI-HLQDG-------RLARALTVVILRFACNINDMSELIPKFERVCNKHC-TVGVQPMHYELLGALVIEAFESLMGDALTPEIRAAWTKAYSILSHMLIGR-- +>SRR5439155_13306073 +-VLLD-------GGTLRAVRmsGDTRSEPWLKDLWERGVAVGELRRHLllpletppGLP------------VPRGRILCNCFDVAESEIDAfla-------------------------T-SNSIAELqarlkCGTNCGSCLPELRRKSLCDIG----------- +>ERR1043166_8897093 +---GTRDQADIVQLTWHSVLpvGGTFAELFYGRLFALDPEVRRLFKD--DI------------VEQGRNLTAMLSVATANLVKperVGRPPGGLHFRRK-D--VDQRVLEREEERVLHQRemlrPHAVSGVALAELMERHADAP---GGVHRHA-- +>Wag4MinimDraft_6_1082665.scaffolds.fasta_scaffold479856_1 # 2 # 223 # 1 # ID=479856_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.387 +-IALE-------DGRLRAVRlaGDTRAASALLELWERQAPVDAEDLPEtPAH------------ASRGRIICNCYDVSETEIAAy----------------------------RSLADLqaalrCGTSCGSCLPELRAKFGVIPR----------- +>tr|A0A2B4SBA2|A0A2B4SBA2_STYPI Serine palmitoyltransferase 2 OS=Stylophora pistillata OX=50429 GN=Sptlc2 PE=3 SV=1 +--QISQKQISLVQETWGLVsgDLEKVGVDFYMRLFKANPDVLQLFS-FRDIDKSsdDIMRADDRLKRQGLVTMQHVDLAVNSLNDlgsIVPALRDLGGRHA-MYKVEEHHYVLVGSVLLDTLNNGLGDNFTVEL--FWAALLNTLDKGLGE--- +>tr|A0A0C1L0Z1|A0A0C1L0Z1_9BACT Uncharacterized protein OS=Flavihumibacter solisilvae OX=1349421 GN=OI18_18680 PE=4 SV=1 +-MEMTPRQMQCVRNSWRNFrdlDPAFFSEPFYAKLFADHPAAKKVFGD--NL------------AEHFSFLHEMLSQLVSRIDRPdqlLITCSRIARNNA-ALGMNEKFYEWYGHALIWTLRQGAGADWNMETEQSWISYYKYLVD------- +>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3668839_2 # 105 # 377 # 1 # ID=3668839_2;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.656 +--------SGPLAASLAIFEprLEAVTARLVDVLAASSPHLLALFPP-SSE------------PS-----AALLGRFLTRIVEtesLGqPLGDGLGLDAY-PIP-TRDQWEHLVESFIWSLSAVAGKAFSPPMARAWRATGERLFSTMFES-- +>LULI01.1.fsa_nt_gb|LULI01000097.1|_29 # 27187 # 28320 # 1 # ID=97_29;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.310 +----------------DEIKgrH---HSMFVDEFERQQPQYKD---------------------------------FWARL---NrGEYQAGEYRRY-GKG-GKEVWIQA---------------------------------------- +>SRR5947209_9205436 +--------VLSVLRSpssplF---PyttlfRSRltver--DSERDVLMvaggtGIATMRAL--LD--DLA-------------QWgENPRVHLFYGGRTDDDlyaLDd--LHQLDRKST-RLNSSHANISY---Avfclk------------------------------------- +>SRR5690606_15697619 +--------VRVVAGGwvsralvrqtvpgdrW---RvgapMGElwrdr--DVQRDLVLiaggtGVAPLHAV--VE--DLA-------------GRatQPSSVTLFFGGPTADAlyfLPe--LRELAADLP-WLKLVP--------Vte----------dgsvddgergklPEVVTALGGAWSGHDVLVAGSPGMI-- +>SRR5919202_1970091 +--------VQMVPGGqvsstmvrslkvgetV---RlgapLGQaltlyag--ERHRDLIMvavgtGLAPLRAH--LE--RIDQ-----------EwqSTgRAPRVRLFHGARLPWGlyeNRl--LQNLAG-RP-WFTYTP--------Vvsddp----------typgrkgwvGDAAAVS-GPLHGLLALVCGSPEMV-- +>tr|A0A1D8N423|A0A1D8N423_YARLL Uncharacterized protein OS=Yarrowia lipolytica GN=YALI1_A07937g PE=3 SV=1 +-FNMTREDINLTKELWAKLMndPEtlessaaygtptaLFCEQFYTNLMASHAELTSIFP---SI------------KKQSVAVAGVFGLAIKSLDHiekLDEFLWSVGKRHNRMIGVEPIHYRWLGEAMIKTFADRFGDSFTLEMETAWIKIYSYLANKLL---- +>SRR6266851_2503075 +-----------------------------------------------XM------------RNGSASLPLwPARYGAWTTRRpspNISAPSRSTI-----------ANSVCGRAITNWSARRCSPPSVSSAASGWEAAFNRIATIMIQ--- +>SRR6059036_2276597 +--ALFPGTSHWVV---AAGMarP-ESKDHPMLTVAQKTLVQ-------DTFA------------IITPIADDAAALLYKKLFEldpSLERM------------------------------------------------------------- +>SoimicMinimDraft_1059729.scaffolds.fasta_scaffold91729_1 # 2 # 175 # -1 # ID=91729_1;partial=10;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.661 +----SMEDRLEMIHEWETVWsaeftgrRVLIAQELFSRLFEKDGTTQALFKNVG-G----DDVNSALFKAHCVRITDSIDTIVHMASYtdvEHQLLDHLGDQHAHYDGVLGSHFKLFRECFLEVLPQAIP-CFNS---GAWGRCLKVFQDEIALH-- +>ERR1700754_2066947 +-------DPGdrQLARELLAGAagGDDLDALvehDRGAVLEIAREAVPVaLAQ-ADR------------DdQLGHLGA--------------DRlLRGPAERPL-GRGAPLQDVALVvhrddavergqqqRAVALAAGAELVGEIWERQERGSLtARRYGSNRSI------ +>SRR5208337_544005 +--TMTPQQTRLLAQSYAKLEnrLYELGSAIFERLFEIDPHSRPLFK--GNMD------------EQKLKLARLFGEFIRIRarsqhflpvtgkagQVVIPGIGSLGARHEMVYGVRPEQYAHMRDAVLYAIRSLLGNDYNDEIGQAWSEIFDMLAHAMQE--- +>tr|A0A2A6CNA4|A0A2A6CNA4_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_32112 PE=3 SV=1 +--QCNPRYTALLKSTWSDDfEvLFALGAKMYITAFEgpHGVACKSLFPWVAKYEeAGENYADKSEFRLQALRLVQTIVKALDKVDDlqkLEAYLYAVGHRHV-FYlpvWLDPVYWDVFKasratsylgqstmlksaserDAVQVGVNDHLHKlsKLSTddlaRATLIWTDIIEYIFEYVKEGF- +>SRR5437763_1847173 +------------------------------------------------------------MVRQKRHMVALLSQVLGGPKQy---QGRDLAEAHR-SLGISGLHYERVGNYLLASLLiaqapydvinavtdvlagqrdKIVAAAWAAELAADWTDAYSLVARVMVE--- +>ERR1719244_1430206 +-TGLSRKQRFLLKGSWKGVSrdLESTGVSWFLELFETCPNARGSLRQFSHISLDDDLTENQPFREMTEKVLERLDNALFSIEDadsMRSILLETGDYLRSVVGLNNDIILQSEGPLLSAIQRTLDERYTPQMEVIYTVIVKFMINTMVEX-- +>ERR1712228_920792 +-----------------------------------------------HISLNEDLTEVQQFREMTEKVLERLDNALFSIEDadsMRSILLEAGDYLRSVVGLNNDIIMRSEGPLLSAIKDFRREIhttngsdlhsdskihdkYNGRMRPL----------------- +>SRR2546430_6350501 +----GRResRVRGGQGGWV---sRAIVAEPQRGDVGRSGPAMGRMKVD--RG-------------AGRDVVMVAGGT------GlapMRAIIDDL----A-QWGENPRvhlfyggrgrggPYH------PPSLVSTAAAqPGVPVVavagaeaglshkeagspagggvrHGALAGRG------------ +>SRR6195952_1380156 +----VALAGEAVRAIWFRLAdqEADVAHWFGALLFSLAPHLRAQFPA--QA------------DRAARRLLRASIAAMSAVDRpqeFPAAIGTLARETR-ALGLDASADEPVGVALVGAVREFAGELWAPGADAAWVLAYSLAAEPARR--- +>ERR1700709_350262 +---------------------------------------GDLDAD--AT-------------AERELLVVAGGRRGGVGpaprGepaGpsgAGGGRPPRPARLA-AGVDVRRttvivgartaedLHT------LDRFAVIGEDaPWLAVVgacesdplelglapgpvvegitrAGPWLEHDVVVA-------- +>ERR1700709_656719 +----------------------------------------------------------------ADVVAVAGGP------GasgALALGDDLAAQAA-AGVDVRPttvivggrtpedLHT------LDRFAVIGEDaPWLAVGgacesdpldlelapgtvveaitrAGPWLEHDVVVA-------- +>SRR5262245_28534727 +-------efHVKTVPGGWV---sASMVNDTQVGDEWKIGPPIGLLGLV--TH-------------SQRDLLLIGGGV------GvapIMSIVPEL----L-RRRSSNRvslfhgvrypheLYL------NGTLDDLAARdPNLEVVkvvsrdrnyagitgslpdvvaqHRDWSAYDVVVS-------- +>SRR3569833_3303276 +---------------------------------------------------------------------------------pNNTNHDKH----T-HRKRNPPehqniggkrpedLYV------LDDLRRLTAVsKWLTVTgvteegaipggdrgtlahavaqRGVWEYYDILVS-------- +>tr|A0A161TXB5|A0A161TXB5_9DIPT Globin 11 OS=Chironomus riparius OX=315576 PE=2 SV=1 +-ATLNADEAKLVKGSWDKVKGQE--DGILYAIFKENPDIQAKFPAFVGKN-LEEIKSNDDFTKHADRIVAAVSKYIELVGNeantpaIKTLLNELGQTHR-SRGATKEQFEKFKSSVAKYLKEHSG-AWSDATGAAWNKAFDEMYAIVFSSL- +>tr|V5YNC2|V5YNC2_9DIPT Globin OS=Polypedilum nubifer OX=54969 GN=PnHb4 PE=2 SV=1 +-ATLTESEANSVKTSWNLVKDKE--DEILYAIFKENPDIQARFPLFVSKN-LEEIKTSADFKTHADKIVKAISTYINLLGNeantpaIKTTLNELGQRHK-DRGATTEQFEKFKVSVLKYVKEHAT-GLTADAENAWNKAFEEMYKIVFANL- +>tr|Q23764|Q23764_CHITU Hemoglobin IA (Fragment) OS=Chironomus thummi OX=7154 PE=4 SV=1 +---------------------------------------------------------------------------------tILAKAKDFGKSHK-SRTS-PAQLDNFRKSLVVYLKGAT--KWDSAVESSWAPVLDFVFSTLKNEL- +>ERR1712170_324299 +-------------------------------------------------------rVCREKLNVHALCVVAMIDKGISVLDKpcdFVELLLIHGRRHK-NHGVARKTFQTLGNFFIQSFKEVLEDDWTDEIEAAWKIFFRFLNIGLEAGY- +>SRR5688572_12388254 +--SMNEEQIKLVETGFQSITgrGERFISRFYENFFAASPKAEKLFAQT-EW------------PNQSRKMLLTIMMVVDNLRDaahIKKMLHEANLVHQ-KFTLQADDFDALTDAMLRTLREFLTDDWSKEAEDAWRAAFAKINAIMLEA-- +>ERR1044072_9602616 +-------LEQSGYTVVGRAAdaRELmLKVRSYVPDVA--------VVD--VR------------MPP------DL--------TddgLRAAAEI-RRSHptV-SVlVLSQHREPAYMLELVGDDASGVGYLL-KDRVRDVTQFVDAVQRVAAGG-- +>SRR4051794_28399871 +-------EHEAGTDLLELTD------ALVRAGVPCADAAQEAVAG--VE------------LPHGAQLPAER--------LadrLERRRVD---------lD------------------------------RLLRFGEDAG-HLVLGA-- +>SRR6266545_7915566 +-------ELDTLETTFDLLAprGEELMDIFYARLFAAAPGGRAAVRR--HR------------PSPPEGSPPRR---------ARAPAQV---------aA------------------------------QPRCDRPDAA--------- +>SRR4029453_17830486 +-------DLQALETSFDLVAsrGDVLMDVFYARLfaaapa------VKPLFAG-TDP------------RRQKAMLLGALVRLRGSLRGppaFVPPLPRPGAGPggE-APlrrhrSPAPEGHAARGPraaAWLPARPAGVRSGaatPRGQARRLWRPAGALPGGRRgpdrLHG-- +>SRR3546814_7943381 +--------------------------------------vfirlslsliiilvyRFLFFFF-SSR----------RR-HTRCVLVTGVQTCALPIS----TDELIA-------AWAAAYGQ--------------------------------LADLLIA--- +>ERR1700737_1149585 +---------------------------------------------------------------------------------kqPDGSAEKHFEQAC-ESGRPTGAVSHCRGTPAGCDQGSVGRRRNRRDHFHRGKGYGNLADILMG--- +>tr|A0A255XUI9|A0A255XUI9_9PROT Uncharacterized protein OS=Elstera cyanobacteriorum GN=CHR90_04515 PE=4 SV=1 +-PMLSSQSIATVKATAPALRphGLNLVVRTYELLLRDPNI-RMLFDP-A--------------rqvnGDQQHIFAETVIAYVNAMDRldtLKATVKHLTIQQA-LLDAQPQHYDAIAIALIQAIHELFGKDAVREITSAWTEALDVLHQESPG--- +>ERR1043165_5678211 +------------------------------------TAglktrkpkgltdsdmdilvpvtA--------------------------ALFLAGMTAYIGILA----LRELSATRLA-SATAAVEHAF--------------------------------LREQISE--- +>SRR6476660_7153442 +QYMLPQRTIDIVKSTAPILEehGETLTAHFYRRMFAYNPEVAPLFNP-A----------HQRAGSQQKALAAAICAYAANIDNlevLGGAVELIAQKHA-SLRILPEHVRITPESEIISSFYLQpADGGGLPLFKP-GQYITVRVPDARG--- +>tr|A0A2D6MWT2|A0A2D6MWT2_9DELT Uncharacterized protein OS=Deltaproteobacteria bacterium OX=2026735 GN=CL908_18525 PE=3 SV=1 +-----SEVAERLRSSLEIIAEceATFIRRVYEDLFEQHPKTAELFGG--HS------------RAvRGEMVREVLMYAIEHNEGaswVEENLASLGDQHE-VNGVTLEMYGWFVDSLLRIFAEVSGPDWCAELEGSWRTALELVSDLMSSPE- +>SRR3954454_17009507 +--PFDPATVAVVRASVTKLpsEPIELTREFYRQLFEIAPQARVLFAE--DMT------------DQTERLLSAILAGVRAMDRpelVEDHLRRWGVVHRRMHGVTNDLYVYVGHALIRALHRIFGH-LETSVSSAWIAVYEWMAAVMIDG-- +>ERR1719446_1443192 +-----------------------------------------------------------------------------LAQDlsaLCPE---CGFK------VG--TMGVC---QTK------ANDAAIE-----------AKDPPVAT-- +>SRR6187402_970848 +--GITTADTLLVQTSWNTVSefSTKIIAGFYKHLFASEPEVRPLFKS--NQS------------VQEKRMALMINTIVNSadsLDEFRGSIAQLAKSHV-HMGVKNEYFPIVVKAIISSVEEQYGKGFTSAHKKAWYKILNQISAIMMEE-- +>SRR5215510_10546783 +-----------------CLDrcRLFVVFYLIACiivlffFFQAEDGIRDGHVtgvqT--CA------------LPIWARLLGAIVTAVQTIEDperFDGYLRALGRDHR-KFHVEPAHFGVVGAALLDALREFSGTQWSHAFEQAWRDAYGMMARKMLA--- +>ERR1719150_2276450 +-MGLTKAQVAAIQNNWATVSqnMQDVGDALFMRYLTANPGDLSFFPKFQGAGVGPQLHSNEDFQHQTLTVMQFLGQIVAHLGDIPaaeGMLRERVKTHH-PRGISMAQFERLLDLVPRLVQEICGA--SGPTADAWRVAVATLMPSMRDEF- +>tr|A0A1K0GS94|A0A1K0GS94_9ACTN Globin OS=Couchioplanes caeruleus subsp. caeruleus OX=56427 GN=BG844_22340 PE=4 SV=1 +--GMNPaddaelhAVQRLLISSLEQAGgQVEVATRLRAALAQAGPALFARIP--GGP------------LAQVEQLAEGLAWLAQHTDqPpaLVAGFGRLGAVLA-ECGIAPQQLQLAGAALAEAMRAgMAANGWRQDYDQAwrstWQHAYQWIAHGMVAA-- +>ERR1719193_2756600 +----------------------------FM--EKKVPSVIV------FLN-SLSLDDDGALETHALSVMNSVNKVVSRLDQpdrLVQLLHDLGRKHI-SYKANMAFLEPIAKHFILTIKPSVA-EWSPEIEDAWQQAFKVIGHIMQE--- +>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold5203666_1 # 3 # 269 # -1 # ID=5203666_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.315 +-----------------------------------------------------------DFESQGRALTRMLAWIIQNMSNvsqLVPVLAQMGGRHE-IYGVKDADFGTFATTVANSFRSVLGPEIiDDDAHQAWESCISGIGGLMQL--- +>SRR5215203_6923026 +--PGDSGADRAGRAD---AerDQAGLRRGRG-RLLPPAVRRRPLRggavhhrA--GH----------PTgEADRGAGCGDALDQAPRRVPAPgrh-ARPAAPGLRG-----------------PPAALRHRAG--------------------------- +>SRR5215208_6178010 +----GRGRPRPDTAIIRRGVagQPTIRHLFYDRLFEHDPETRLLFR--SDLD------------RQRLRLLTMITAMVGPASDdls---------ATNA-GhAGVPPWRWLSLA-----NARDVADP-------------------------- +>tr|A0A074ZZ62|A0A074ZZ62_9TREM Uncharacterized protein OS=Opisthorchis viverrini GN=T265_01589 PE=3 SV=1 +-----------MFDELPPATdhLSKK--ITSGRA---LGMICSNAN-VHTLS-NEEIAADTRSKQHILAFMDVLSKAIGALDGgredFCEKLMVLGARHAAIPGMKLEYFKVFKQAILMTWEALMYEEFTEDVRRAWAHLMDYIIGILSEG-- +>tr|A0A2A2WQA6|A0A2A2WQA6_9ACTN Oxidoreductase OS=Dietzia natronolimnaea OX=161920 GN=CEY15_08520 PE=4 SV=1 +-----STATPPLLALRDLVTDPRFTDLFARALREADPDFRELFPR--DA------------SGVLGEFVRAMSWALETVEnargdeaevaQVVEFARHLGADHR-KLELSTRHHQRFGEALTSTLRHLAGPGWDDRLSTTLGTVYRVLTTALRE--- +>tr|A0A2W5I8T1|A0A2W5I8T1_9ACTN Uncharacterized protein OS=Lawsonella clevelandensis OX=1528099 GN=DI579_06450 PE=4 SV=1 +-----PTYYTVLGPAITLLRehPEDFMRHFLAAALTYDFHFHTFFPS--VN------------DHHASRYTHALRYILEALDqstndpdcldDVIDFLSQLGCDQR-KYQLTAEQYQSLAAALRDTFALLLPYQWSTELNDALLTSFEHAINVMQS--- +>tr|A0A2N6TBK5|A0A2N6TBK5_9CORY NAD(P)H-flavin reductase OS=Corynebacterium kroppenstedtii OX=161879 GN=CJ202_05310 PE=4 SV=1 +-----GVHEASLVPVVTVLQtdGSRFVDAVFTHLFARRPSFIRRLPA--DL------------SQLKPSFRRALVHVYAKQAtgngldrRTRRFLRHLAEDHR-SFGVEAPDYVAMGDAIIDAGREIIAPQVTSEEFELFAMATGQIIGLMEE--- +>tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 OX=1581089 GN=HMPREF3121_11375 PE=4 SV=1 +------------MRAAAAFGrqAPTIGPEAFRRLLDAEPRFRHMFGG--SK------------TALRDQFMSALSTALVTRAdvgrfpaATIRRLEQLARENR-KFGVAPRDYATLAEHLLDVFGERLPAgpdsgAQVDALREILDEAMSLI-AAAAV--- +>tr|A0A1Z5KPX1|A0A1Z5KPX1_FISSO Uncharacterized protein OS=Fistulifera solaris GN=FisN_16Lh317 PE=3 SV=1 +--VASPACVMKVINRWETARqrngfDEQLDIDTLLALFKMDPQVKPIYG-FAVEKEVkAQGMQRMGVLIYGLQVVKMFDVILSALGPdeelFYDVVTEMGEQHC-KHGLTPDHFTLLCGAVMGVLETIMDTEWTKDVRAAWSQVIECVNAEIVK--- +>tr|V5YLS5|V5YLS5_9DIPT Globin OS=Polypedilum nubifer GN=PnHb25 PE=2 SV=1 +-PTFTDAQVATIKGDWNNIK--GQGVEILYHFLNKFPGNYPMFKQFGGKD-LNAAKGTPEFSAQATAIINLLNGVMDKLGSdnagAQAILANLGKTHK-AKGITKEQFQQFREATTELLGNLG---L-GGNLGAWNALFDFVLNVVFTA-- +>AP82_1055514.scaffolds.fasta_scaffold183032_1 # 1 # 312 # -1 # ID=183032_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.529 +-HPITSEEAETLRTLWSQVK--HREADILYVIFKENPDIQAHFPAFVGKD-LEALRKSLAFAIHSTRIVSFFSKIATLAGDpsnlpaSKTLMNELGSSHK-SRGIQKEFFNKFRASLDGFMQRQS--SWNDNAAVVWNKASDNFYFVLFAS-- +>SRR4051812_13904716 +-AGMSPEEVALLRHSLDEMRadGPQAAEAFYAELFRLDPSARELFHL--PV------------EQQSVVFFHELDallSAVSDLPAFVERSRRLGRMHA-GRGVRPEHFEAAAAALDAMLLAVYADGASPELRRAWRHAYRMAAQLMQEA-- +>ERR1711860_53158 +------IYFSDIKSTWDIVKdeIDQIGMLAFLHLFEAHPEAKTKFKMFEDIPT-DDLKTNEIFQNHAHRVVSVIRKVVGKLDEPsvyLNYLKILGGKHI-MFDADVKYIKQMGYMFLSAIQPTLEKevGITLKYV--FKKTFX----------- +>SRR6266536_6175029 +--LMTPEQITLVQSSFERLGpqLPAMATRFYQELFTRDPALRPLFTT--PLP------------QQEVRFAEALTEIVRAMprlDELLTHTRAPRRPArrlR-GTGCRLPDPRRRPprrargrpgRQVRRPHTRGMGPRLQPcrrdharrrsrgPAHQQLTTTAAPTASQADGG-- +>UPI00012780C8 status=active +-MSLTNETKEIIKATVPIIEknEAELTKKIYPLLFTRNPSMKIFFNR-DH----------LRKGTQPRAFIGSIIEYAKNIDNldaIKPLINDIAEKHA-ALNIKPVQYSIVNICLLEVFGKALGTRGTHVVKRAWKDAIEDLANIIIK--- +>ERR1017187_3590871 +-----QVDCAILKQSFAHIEsvAEKAVGYFYARLFVANPELRSMFPL--------------AMDATRKHFLAALAHIVWSMDDpqeLADYLPGAHRHSA-H---VQRRYVDLPGAVrLGGgdrSHRHSHDPGGagRRGRASLVAGX------------ +>ERR1035438_6477963 +-----------------------------------------------------------------------GARGSPRPAEpaaLSK--------------------QMIDRPLRAAgaaPSMHNTPPWRfgVRPDRLTIELRADIATVMTQA-- +>SRR3546814_3749254 +------------------------CLFFFFCFFFSSIRRHTRCA----LVT-------GVQTCALPILFNAIAAYASNIENlpaLLPAVEKIAQKHT-SFQIKPEQYNIVGTHLLATLDEMFSP--GQGVLDAWGKAYRSEERRV-G--- +>SRR6266704_3508957 +---------TITRAEFCAGRsnrgsKQAFACECYATLIRLHPEVKPLFTH-TSM------------EKQAKKFMASLTLVLHVLGKpdvLTTTLQRLGRRHQ-TMGVRVEHYPMVAEALLATLKSGYAVVLLT----LFVQSYMFL---VRKGA- +>SRR5215207_7267255 +------QAV-----------agEPEVRGSILRKAVRIGPDRANLVQ--GGP------------RGSEDEAaQHACDDRWSRLSTrdLRLGCRGFGTTSR-TVRCDAGSVFGGRRSL---nleLGRGARTRADPVQARSVERFLQGGSALHVEG-- +>SRR5215470_13616785 +----------------------------------------CMVTL--CH------------CSFTqtcscGTRRRGICSRFRWLPSatgWCMRWAGSCPTSR-TSTPSAGTcRTWGASTASSAPSPSTTPTWTPELAADWKAAYDLVAQVMIG--- +>SRR4249920_1577195 +-----------------------------------------VWPC--TA------------TRCRCSSTRTC-----scgtrrRETCsr-SRWPYSATGSCT-RWP-GSCPTSTTWTTSASTCRTWaaSIASSAPAPAADWKAAYELVAQVMVG--- +>SRR5258708_22654124 +------TLARLLKESWSLVEdrADHLANHFYARLFLIDPNLRDMFPV--QM------------AVQRSRLLGALVEPVQTVPNpsqVVPCFLSLALAQP-TIRLLPGQFEAGRSAPIDP--------------------------------- +>SRR6266511_448526 +--------RRRRRRAATSSGraSHRLRDsRLEARARDRSRRVLDDASS--WV------------EVVRLGDAGEPVVLVSAVAAiahRDVRRVELAREGE-RVRL-------QVLNVDAEEDDLAGEHWSVEYDQAWRDAYDRIARVMIM--- +>SRR5579862_1310240 +--LMDPLRIRMVQDSLVKLTprEGSIVDLFAAELSGSPHDESETGG--DNIA------------YQrERSVLGIMAAAAPFLHAPeciLDEVVAEI---G-AGRIHPADYDHAANAFLRALKKNLGAEFTADLWEAWLEALWTLCNLLSRT-- +>tr|Q5DGY4|Q5DGY4_SCHJA SJCHGC09035 protein OS=Schistosoma japonicum OX=6182 PE=2 SV=1 +-LSINDEQLLLLQSSWSIVkqHIEKIGVITFLGIFEQHSDFRDAFTEFRKRK-FVDVKHDPAMQVHGLRVLSIVDKMITRLPKtddIELKLMTIGSKHC-RYVPTIGLISSVSDQLWGAIEPVLkeEGSWSDELAVTWKTVLDYLTKTVR---- +>GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6550916_1 # 2 # 442 # 1 # ID=6550916_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510 +--------LELIQQTWEKVKphGKEWGPKFYNNMWTKYPEVRTKFFP--E-SKP---------EIQGPRLYASLNFMIKNASDietLKQYCFNMGDRHK-KYHCGAEHFQVVGDAFIMTLTEFLGEDFTPELKQQFQLLYDTVAEMTI---- +>ERR1719360_423992 +-EPLTQAQKEIIFTSWDAItHKENLGVTIMYRIFTGHQEIKHLWKFADDLKTEEEIRGSKTTQFHAKKVINGVNSAIKAVEAgkeVESlGLDKLGARHF-KYGAKPADFRHFVESLFWAIKTIVPE-VSAEMAAAWTNFVMQIIKQMTN--- +>tr|A0A194RIW1|A0A194RIW1_PAPMA Neuroglobin OS=Papilio machaon GN=RR48_08766 PE=3 SV=1 +-SPLSAKQQYCMLASWKGIFrqIEKTGIILFVKLFQENEELLHLFEDFRHLQTVEAQVSSTELAEHATKVMHTLDEGIKGLGDMDsffAYVQHVGSTHTQVPGFVADNFMKIEKPFLDAAKTTLGDRYTPNIENIYKITIRFILENLVKGFE +>ERR1719153_450463 +-MPLSEGTISILKACHPIPvaNREDIGSSFYTLLFQQHPETQNLFPL-SHVSASKGGKPGPQMRS----HPTMPYLIF-HTkqlF------------------------TIIYNTKIQSX-------------------------------- +>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold8273257_2 # 299 # 427 # 1 # ID=8273257_2;partial=01;start_type=ATG;rbs_motif=TAAA;rbs_spacer=15bp;gc_cont=0.364 +---------NELQTNIEDVYsaGDV-C-----ALFDSSaNRYRPtrtwlscafqgEVAAL-NM------------LGQDKVynegvFFNASHayrSMYAVLGNFNPAQAD-GFEFF-VCNQDKENYE----RMVLKDNKIAGAMFVGSMKNVWSVKQLIEGQVDV---- +>ERR1719244_2234371 +-VVLEDAEVEGVQTLWAEVSgdLGNFGARVFGRLVHDHPTIRKYFPWGRNDKTEEQLVAAPDTQAHAEEVFGALGKIIGaagHLNDYRSFLVYKGMQHI-PRGVKPEHFDYLKDALVDTLKEELGDKVTPAGEEGLNKVYSFVEKAMSKGL- +>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3481696_1 # 1 # 387 # -1 # ID=3481696_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.584 +--VLTSNDIALIRESWAYAkDIPAIQTETLLEHFRIQPRTQALFPKFADVP-LNKLPTNDAFIKQARSCVSFGLNFIVANLDNPSLLkDMLGRVdTyG-KWYVDF--MtkeRQMQTTVdifIQVLSKELGGRLSAAAKAAWTRAMTLVFVEMMS--- +>ERR1712198_397898 +-QGLTEEEITEIQSTWKSIIsdkTSEHGVNILIRFFKNYPEYKaQYFQNLNTLS-EDELRESPKLRSHGAGFVLAITQIISDLDNmliVEEVAKKIARNHY-NKGIREPlNYKLMTNTIIDYIKDIGN--LADGTMQNFRKMFDIFIISVRKKY- +>SRR3954447_20457037 +------------------------------------------------------------------HKVKVEDIIVRGGGNLMVEL--MNTDAA-GS-----PLDTPVRAVTDG------TESTAAAREPI--------RLNPG--- +>SRR4030088_1427564 +--------------------------------------RRGRDGG-QP-------------R-RRELRRDGQepdepDASRRGdrgRPCAGPASR-----------------R--RGSAAGCRSSPPSPAWPALSYEQWRETCDTLHGhTQVLG-- +>ERR1700752_5389668 +----------------------------------VVPQVPAARSR-VPL------------R-AASFRRGGLehdpdPKGRVSakqEPV-FGK-------------------D--HGQTIRLSARGQSS---PrRNDAARETTCKEARMtPEQVK-- +>SRR6478735_7013605 +--IMTPEAIRAIKTSYAAVatQPRQLASRFYSELFTAAPNLRPIFP--ADLT------------LLQGHFEAAIAMVVRNLDEmtaLREPLRDLGAQHV-HWGARPEDYVTAREALIGAVRGTT-RHDRRSAGRCVSRPTRSARpIGSRR--- +>SRR5262249_59625092 +--SRHRDAAVLVRTFTCAPpaPPGRRASRLYEGPFPADPDLRPRFP--ADLT------------LLQNHFEAALALVIRNLDDmnaLREPLRDLGAQHV-HWGARPEDYVTAREALVKAIGALS-ASWTATLEQYWRSAVTSIIvT-MLX--- +>tr|A0A0P5LQ45|A0A0P5LQ45_9CRUS Di-domain hemoglobin OS=Daphnia magna OX=35525 PE=3 SV=1 +--LLTANDRRIIRKTRDQAKkDGDVTPPILFRFIKAPPEYQKIFKPFADVP-QAELLGNENFLAQAYTLLAGLHVVIQTLFSqelMANQLNALGGAHQ-PRGATPVMFEQFGGILEEVLSEELGSGFTAEARQAWKNGIAALVAGIA---- +>tr|A0A0P5UVQ8|A0A0P5UVQ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna OX=35525 PE=4 SV=1 +--LLTANDRRIIRKTWEPRpRrTEDVPPQDPLPFHQGPPRVPEdVQVLRLCSP-SRACEQRKLLGPRPNTILAGLNVVIQSLSThgaYCQPNQRSRSANK-PRGVPPIMFEQFGNVAEEVLAEALGSSFNAEARQAWKNGMRALVTGIT---- +>SRR3954451_10251525 +--------TSARRqqWTFPRCGptspRPQRPGTRARCTSTPTCSCAIPRPA--RC------------SRSRWRT-SGTGSSPPSATWlpgsttstrSCPSCSSSGGTTG-SSGPSrRTTRPSVPacWPRSSTSTTS-GARNSPRAGRrptTASRAPDVLATVMIE--- +>ERR671928_16913 +-----------------------------------------------------------------ALYFDGIDTGR-----lrVHQTKLLVQVTGG-PVEYDGRELAVAHGGLDITLEHFD-PGWTPELARDWTQAYQLVAKVMID--- +>SRR3712207_8140349 +-------------------------------XMIRRPPRSTLFPYTtlFRS------------AHQRDRLFQALGDVVNYVDDldrLVPILQALGRDHR-KFGTVAEQDRKStrLNSSHANI------SYAVfCLKKKKKDSHPSSTTX------ +>ETNmetMinimDraft_30_1059905.scaffolds.fasta_scaffold1335019_1 # 137 # 232 # 1 # ID=1335019_1;partial=01;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.573 +--PITPEEKDGAMRVWKMILnnrsehflalkrenKekdvqdaencmDYFMHNFYIRLFDIHPNSKQLFHR--SI------------HKQGSFFLRFLSMCVAEVSEpekLDKTMENLANIHN-KLGVKAVEYGIAGEALFHTIHKCVGPEFNHEAAVGWTKVYSVFLKYLI---- +>sp|P15447|GLB4_GLYDI Globin, monomeric component M-IV OS=Glycera dibranchiata PE=1 SV=2 +-MGLSAAQRQVVASTWKDIAgsdnGAGVGKECFTKFLSAHHDIAAVFG-FSGA-------SDPGVADLGAKVLAQIGVAVSHLGDegkMVAEMKAVGVRHK-GYGykhIKAEYFEPLGASLLSAMEHRIGGKMTAAAKDAWAAAYADISGALISGL- +>SRR5256885_11466498 +--------------------------------------------------------------------------------------------XM-LLFF---------FSSRRRHTRLQGDWsSDVCSSDLWGAAYQQLADILIG--- +>tr|M3IRU3|M3IRU3_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) GN=G210_0056 PE=3 SV=1 +-QELTPDQLRLITECIPIMEdlNLTLGSKFYRRTTRRHPHLQSYFNE-TH----------HKLLRQPRAFIFTLIMFAKNIHDltpLRDVIRRIVSKHV-GLQVKPDHYPLLGDVLIETLCDMFPYHmVDDKFKTTWSIVYANLASLLIG--- +>tr|Q86G74|Q86G74_PHAPT Hemoglobin II OS=Phacoides pectinatus OX=244486 PE=2 SV=1 +MTTLTNPQKAAIRSSWSKFmdNGVSNGQGFYMDLFKAHPETLTPFKSlFGGLT-LAQLQDNPKMKAQSLVFCNGMSSFVDHLDDndmLVVLIQKMAKLHN-NRGIRASDLRTAYDILIHYMEDHNH--MVGGAKDAWEVFVGFICKTLGD--- +>sp|P41260|GLB1_PHAPT Hemoglobin-1 OS=Phacoides pectinatus OX=244486 PE=1 SV=4 +-MSLSAAQKDNVKSSWAKAsaAWGTAGPEFFMALFDAHDDVFAKFSGlFKGAA-KGTVKNTPEMAAQAQSFKGLVSNWVDNLDNagaLEGQCKTFAANHK-ARGISAGQLEAAFKVLAGFMKS------YGGDEGAWTAVAGALMGMIRP--- +>tr|R1EGH0|R1EGH0_EMIHU Putative nitric oxide dioxygenase OS=Emiliania huxleyi OX=2903 GN=EMIHUDRAFT_435200 PE=3 SV=1 +-SGMSAETIATVDATAGAVApfALDITKDFYGDMIASLPSvVLTVFNP----AHNVPI-----STHQPEALAASVCAYATNIKDlspLlvpGGAVDAINHRHC-ALNIQPAHYLPVHDHLMGSIAhvlgPKLGDALTPEVAGAWSEAVRFLAKVCIDK-- +>ERR1711974_215400 +----------------AKVseNIDINGGILFQKLLTDNPELKELFW-RANKGQQgDQWRNDKNCQKHGKSVILEIGRCLSAVDDaeeFSSLLYKNGVAHK-SRKTTEEHFPLVGEAVIYMLAEALGEELNDECKAAWLGAYGVITEHMLRGL- +>AP12_2_1047962.scaffolds.fasta_scaffold738771_1 # 1 # 321 # 1 # ID=738771_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.648 +----------------------------------------------------------------------------------------------------MSFITVPGVAArsSFVwlrestaalrgpalvaliyflgaeaafyigtlsdrifalfwpPNVvLFCALLIVPQRRWWLYIAAAFP-------- +>SRR5260370_506041 +-----------------VRD-YSSTCSF--------FFFLQAEDG--IR------------DSS--VTGVQ---TCALPIYqerTEQVLSRLAVDHR-KFGVRDKHYEPFFDAVFATAEHAAGPAWTREMATAWRSALDWFGSVMA---- +>SRR5580658_2929351 +-----APLRAIV-EEVLRSGgg----------------------------------------------------------------------NVAA-GTGVRRNASLFHGAREPPGFYD--MpGLRELSSSYPWFQV---VP-VIS---- +>SRR5258708_13478776 +-----APLKAII-QGILRA--------------------------------------------------------------------------G-GPLLRRETRPLVGAPRGQKALL--PpHPPGSGSVASRPKG---IS-L------ +>SRR6266704_2687724 +------IARPPDR-RPRCGD-GVLLR-P--------AVHRQSRPA-------------------RAVSLRDDANPRGGLPDadrAGQEP--GRRACD-RAGPRPDRQGPpqirrepeALPAVLR-RAVRDGRAFRRPGPDRRDGRGLA---------- +>SRR6266536_777504 +-----DGYREALDASFARVAssGEKAVAYFYGRLFAATPRLRGLFPA--AM------------DYQRDRLLCALLQITQRLSNraaLSEYLVQLGRDHR-PPGVPPAV--PGGAACEHPNPTLA-pGVAPllsgvraagqrvarVPHPRRPRRLGQHVPGAVH---- +>ERR1719498_564827 +-RRWTERKRLVIQSSWAALLsahgndRMATGSKIFRKLFTGDTAVLRLFP-FRHQ--ARTLFVSAPFKLHAKLFVDTMTELIANLHDLEkveRDVRELGKRHL-TYGVQPAHFDAMGEALIAVLDESCHhpSdevTLDKEERDAWLGFWGFIAKETQR--- +>SRR3569832_1708069 +--------------------EEVAGVVLFQRLFEKCPQTKVLFG-FPiDIDpSSKELVTSKRFLMHASYLIQMLDTALNMLGPdqelLTDIMLELGTIQS-AFCVASVCVIC------KELETHLC--f-------------LRLLCQAX---- +>SRR6478736_5796684 +------------------------------FMMGV---IASGMVV-TG----------AERRGRPKAVQPGNREWITVIQAinaEGQA-----------------------------IP-PFIIGAGQYHLANWYRDSNLPGNWAIA--- +>tr|T0QF73|T0QF73_9STRA Uncharacterized protein OS=Saprolegnia diclina VS20 GN=SDRG_06019 PE=3 SV=1 +---ISKDVQALVLANWAAISsgstPAllKIKpaspvvyfyDYFYGMIFEKAPAVKPLFRS--SI------------IVQGKALINIIQSITSavNAPNVIEKVCDLAYRHN-KYGVKIEYFNLLGKCLLLAMHDCTGDTFTDELREAWRAAYAYMVMVMTP--- +>ERR1719210_139600 +--------------------------------FTLL-----DPPGQKrnvaqawsAVVqADVAILVVSANPGEFEAGLAK-------------------------GGQTREHAVLAKSAGVENLVVAVNKMDSVDGEGKWSNLryee------I------ +>SRR5256886_2416282 +-------DREADADREADADrdGDAEPEPLTAPALSSPPAV-PLAPP--RD------------EAARQHdEPEPAPPPDQVPGAadpretagppeppeeppp-------DGKGEP-AAG-----PDPAIAAGQEALRAFARE--afTSAAEEAWTQVYLAGSSLMIK--- +>SRR5581483_8202477 +-----------PDDPVFDGMqgnvGRvaarylphrEGEAYVAGPVGMVRETIRALTRA--GL------------PRERIHYDDALLAEDKQASAqgvagatahtsrtpessrpgRTGEAGNAGPDGH-IrrvaesdqAGPAGGTAEPGQSGLRDAAADIAPQ--------ADTAHQDGGPHDDQagA--- +>ERR671911_2215695 +----------------ELEPacapDKQLVEHVQRlRVEAGAQVVGR------EEERRSRAgqCPRPTSRVDVRGTHDD--------APlecVAEVLVDCGAHAR-VACKVDergraaleLLDRVVPDDLVVDLHAVDEVDGGGQTgHVGPGTSSRRVstarakpQAGTLPQ-- +>SRR3954453_16132976 +-------NLQALEESFDAVAphGDELMDEFYGRLFEAAPAVKPLFAH-TDL------------KRQKAMLLAALVLVRKWRPAraLSGHRR--GAHRL-HGCRRGARVDGRVRGRL------GRGAWRGRRRDDRGR-------------- +>SRR4051794_7197155 +------------------------------PHAAAAPVLPARLAG-RPRPAGAGPISPPARRVGRRVRPLDRVPPPARRDVaraARERLRGRGAARA-AGAGGSDLAPPVRHARVGAAVAVRGDLGGAAGIAAESAPSVLPWTTTRSK-- +>SRR6188474_1917881 +-----------------------------------------------------------------------------------------------------------LNFVFEkiktKKLIPMTQKQIELVKSTWSTV-----AAMDH--- +>ERR1711894_485352 +----------------ILLYnYrfLTYVIYYYYRFLAEDPTVASVFSRV-NVD----DQQSGEWHAHMLRIMGGVDILINMMDDvnvLTEEVKHLRAQHVVREGVTHERMKAFLIIMMDELPKVMT-HFNH---DAWKSCLSKKLKRIGG--- +>tr|A0A0S2MLM2|A0A0S2MLM2_9ANNE Extracellular globin OS=Galathealinum brachiosum PE=2 SV=1 +----SEGDADIVIKQWASVMnAavsgenrVVIGRQIFNSLFLKQPAAPALFPY---GS----DLDGAEFGAQMSRVLSGLSNAINSLTDddlNVSIMDHLNKQHVVRDGVTAAAMKDMQVSIEDTLKQLVT-DYND---DAWHDCLGVAIERISV--- +>ERR1712217_222699 +--------------------IDNIGEVFSQKLFALSPRRHARA----GM--------------EWGPVVKGIGHAVDNLTNLDavaVKYKRLGVLHR-CIGVKEHEMREMGEAFILSLRDVLGKSFGHQAEAGWRAVYCFVAHAMMA--- +>DEB0MinimDraft_6_1074348.scaffolds.fasta_scaffold06817_4 # 3572 # 3886 # -1 # ID=6817_4;partial=00;start_type=ATG;rbs_motif=TAA;rbs_spacer=12bp;gc_cont=0.311 +------LQRVRITRQWRKAYgtgshRLDFGLKVFKHLFEAHPTARALFADHHSD----N-VYSPEFEAFSERILNEFDIVIALLDDpaaLSAQINHLKAKIT-KRHVTTEQLTVFGKNTLEVIPEYVGNHFD---HSAWTDCLKRLRSALTV--- +>ERR550532_3441629 +-----YRQVFQLKNSWKTVSrnLDDTAKENLLKFFRDHPEHKALHKKLTKYEDEASLRESQAFEDAALAVFNTFDEAMDMIekDKVdyaITTLHMAGKSHSAIEGFQPAYFKDMEESFLYAVKLTLGDRFTEATEQNFRRLFEFTTQQMIEGM- +>sp|P02210|GLB_APLLI Globin OS=Aplysia limacina PE=1 SV=4 +-MSLSAAEADLAGKSWAPVfaNKDANGDAFLVALFEKFPDSANFFADFKGKS-VADIKASPKLRDVSSRIFTRLNEFVNNAADagkMSAMLSQFAKEHV-GFGVGSAQFENVRSMFPGFVASVAAP--PAGADAAWTKLFGLIIDALKA--- +>sp|P09965|GLB_DOLAU Globin OS=Dolabella auricularia PE=1 SV=1 +--ALSAAEAEVVAKSWGPVfaNKDANGDNFLIALFEAYPDSPNFFADFKGKS-IADIRASPKLRNVSSRIVSRLNEFVSSAADagkMAAMLDQFSKEHA-GFGVGSQQFQNVSAMFPGFVASIAAP--PAGADAAWGKLFGLIIDAMKK--- +>sp|P21660|GLBP3_GLYDI Globin, polymeric component P3 OS=Glycera dibranchiata PE=1 SV=1 +-MHLTADQVAALKASWPEVSagdgGAQLGLEMFTRYFDENPQMMFVFGY-SG--RTSALKHNSKLQNHGKIIVHQIGQAVSELDDgskFEATLHKLGQEHKGFGDIKGEYFPALGDALLEAMNSKVHG----LDRTLWAAGYRVISDALIAG-- +>SRR5690625_2040278 +--------------------RDGFGARFTEELLSRYTEIREALPD--EPA------------WVARAVTAVTDALIDVADDpgaLVTVLERLGVDNR-TVGVHSAHYAPIGHALILAARAVGGTAWTPDIERAWVDGFDVAAEVMVT--- +>ERR1711963_100213 +-TSLSEGTVEVLKACHPLLKdvRRVIGKAFYNRLFKEYPQVKPLFSQ--SD---------AARTHQTLALADALIAFTGRQLLegF-EAKQRGQ-ERS-LRLRSLQAGSWQGLWRLPSRDRGERD---QNEGSQIKPQILTIQ---QD--- +>tr|A0A0G4EPR9|A0A0G4EPR9_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_12573 PE=3 SV=1 +---MSDKERgVLIDKTWGLLkeryTLQEIGEELYDNVFKNAPDLRHLFKR-PKELMA---------LKFGEMISTIC-GLFQtDRESLLETMRDLGIRHV-DYGSRPEYFPLFKACLLDTLENLLEDGeFTAATEASWNDMWDEASEMLIS--- +>tr|A0A0Q5LAI2|A0A0Q5LAI2_9MICO Uncharacterized protein OS=Frigoribacterium sp. Leaf164 OX=1736282 GN=ASF82_14980 PE=4 SV=1 +--VITSSHLTALRSTLPLVeaRAAAIADDFYARLFADRPDLLrDQFNR-GD----------QAQGRQQRELALTIVTVARDVVgtqvgsgpagsatgpavpvapwsspapspwavrvAARETLSRLAQRHA-AIGVTRDEHDVFERHLRDAFAAALGDDWSGVVVDAWLALWRQTRDELVA--- +>tr|A0A1Y1I4E0|A0A1Y1I4E0_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_002310190 PE=3 SV=1 +-VQLSPFEQQLVQKTWKLLQprLADLGQAVFTHLFQKAPKTRPLYTCPLRLADGDrRTPDGHAIPTHAVEIVSTIGLAACRIGSssrILAVLERLGQRHV-AYGAAPDMFSVFKEAFLVALKKTLGGeHFTAQVHKAWSKALDSVVAHLKKG-- +>ERR1719296_130621 +----SVQTNSDVQKSWEKIQeigILRAGEILYKNIFELAPSARETIPPevlekyrissFLvslNEDeLDDAFIENAIWSDRAANIFNVVGHVVRGQHDfgrLVPMLQELGSRHV-GDGMPEAILKVVVPAFKFALHELLGSMLTEDLEHVWMVGLELVNSHMIQGMR +>ERR1740115_393061 +-NLLTPETVRVVKETSPRIAsmAPALSSSFFKRFLS-HPDLAAYKASR-H-----------NGEAKAAAVAAAVTGIGDSIDNlrsLSGAITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAWDEAIMVLADICVD--- +>ERR1740130_2673129 +------------------------------------------KASR-H-----------NGEAKAAAVAAAVTGIGDSIDNlrsLSGAITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAENHRLTINLFL-LE--- +>tr|A0A0K2UHU6|A0A0K2UHU6_LEPSM Uncharacterized protein OS=Lepeophtheirus salmonis PE=3 SV=1 +--YLSKKQKDLLKRAWVALhnNLSSVGMTTFIKMFETHPEALKFMiPKLTqeeekktqpnySLDSRLDPWHSEKLREHAHRIMKTVSDVISLLNKdeekIEEMLVALGGKHH-GFGVHIEILELMGPHFISAIYPTLKETWTEELQEAWQCLFNYIIALLHIGF- +>tr|A0A0B6ZHC3|A0A0B6ZHC3_9EUPU Uncharacterized protein (Fragment) OS=Arion vulgaris OX=1028688 GN=ORF61548 PE=3 SV=1 +-TGLSARDRKLIKDTADIIfgqlKLQNKGVVFLIAFFKAYPHHQRYFKMFRGIP-PDELKSIPHTENHGRRVMSNVALLVQHIEEpnvIKEQLVDLLIKHN-PRSVKPRQMKDMLNMFVDFTSQQLGAKFTSQHETAWRKLTTHILSVLEE--- +>tr|A0A2H2IJL2|A0A2H2IJL2_CAEJA Uncharacterized protein OS=Caenorhabditis japonica PE=4 SV=1 +-------------------------------------------------------MNAVELRRHASVYLKGLGKIIESMRNeeeLGKSMSRIAQAHI-KWNVQRNHVIVSMGKTEIRQRATNSYALKS---------------------- +>ERR1719270_1027131 +-MSLSTETCNILKICKPLLenNRENIGLTFYKKLFDENPGLKNVFN----MGHQR--GVdd-DKPGRQQFALGQALVAYCLHCESldkLASFVERVANKHV-SFDVQPEQYPVVGGILLATLEEVLGKEtFNEDVKKAVADAYFFLADVFIS--- +>ERR1719318_1430785 +----------------------------------------------------M--N-----NAQGNSLANAVVAYCANCDQleaLGPTVAKYTVPTC-KYIFHIS-------S-------TRPLKmFLPI---SX---------------- +>ERR1712088_143820 +-------------------------------------------------------N-----NAQGNSLANAVVAYCANCDQlelLGPTVAKISSRHV-SLEVTPEQYNVVGGAARQRSlqrssQRCRGRGlLFPG---RHLQGERGKNDRRSQ--- +>tr|F6WSS9|F6WSS9_CIOIN uncharacterized protein LOC100181975 OS=Ciona intestinalis OX=7719 GN=LOC100181975 PE=3 SV=2 +-MPLTEIEIEGVQESWEKVSsggPKTTGLILMEKLFNTYPASIAVFSHLGIPSKPdgaitvSDLASIGGVSNHAVSLASRIGKLVGLLNNeteLKESSTEVGRIHV-KYGVTSEHVDLLGSVLLSVISENQGLSNTSELIGWWSKTWNIIGNYVK---- +>SRR6185503_2239525 +---MDSGHKALIRASFGRALtVADLAVELFsGRLYLLDPALWTLLDLGS--------------RRRQQELVQVLAWAIEHLDRfelLASTLEALARRCV-GNGVREAHFERIAGVLLWTLHQVLGDTYTAGTAAAWRSTSGLIVERMKQ--- +>ERR1740129_283753 +--PLTRREIRTLGLSWSKFHgcRQEFGVELLVQFFQLVPEASDLFR-FQRE---KTISENPGLKNHADRVVRVLSRVIHNIlslEEVVPDLKALGMKHYMDYGVSPTHYCLFGKALLGTVQTF-GG--TPPEQGCLPKLYEWMSRTMTS--- +>ERR1740123_30535 +--PLTRREIRTLGLSWSKFHgcRQEFGVELLVQFFQLVPEASDLFR-FQRE---KTISENPGLKNHADRVVRVLSRVIHNIlslEEVVPDLKALGMKHYMDYGVSPTHYCLFGKALLGTVQTF-GG--GGLLARSGAeSVFPPGARA-GD--- +>ERR1719193_1971274 +--VLTADDIKAIKAIWFPImkNPADLGVALFEKFFLLYPQQKDKFKFMKYD-----DLREKGMRAHGEKVVKKLDEAVLLTlYrsRIKHCFQRIGFSHL-QMGIKEEDMQQLGEAIIATVEDAFVDKLTPEEIGSFKKFIKLFTAEF----- +>ERR1719193_859649 +------------------------------------------WRMLKKR-----H------NRDGGKLLH-PLKTILQTcYksRIKNCFQRIGYIHF-RMGVQEEDMEQLGEAIIKTVEAAWGDEFTPEEYAAFRKFMKKFTAAF----- +>tr|I2G907|I2G907_9HEMI Hemoglobin A OS=Anisops deanei GN=HbA PE=2 SV=1 +-FSLTDREVEVINQSWNQIKAqeLVVGLQMFKTLFQRYPQYERLFTHLH--QSGKSLYEGDRFQRHVVgNIMSSINKVIETLNssdNAVKTLQDMGVKHK-KLDVHRKHFESFVPFVVDAMVSVRMSMSQDEVASAWTKMMEGVASNLSKG-- +>ERR1712157_679996 +MKPLSFTTMDCVLSSWEQVRripnyRETVGLAILQKLIHRMPEGREVLHMQRNLIknSPPGIESDKLLLAHARAIVNGLDTVVEllgpLIDDISEILREIGKSQYHDYGDSMALWNpLMRECVLEVIQETLKDDYTHELKVAWTDFLGEVAKDIHSG-- +>SRR5438477_4839339 +------------------------------------HGIEP-IPH--RY------------AAIRRVVSGRE-----------AQARRVGQRHH-AAREDQRR-------LRGL----ERRRG-RPPARHVRL---------AA--- +>SRR5262245_20667862 +-----------------GRAdpLTLLCEREIARFRG----------------------------------------------------------------I---ELDGIGRA----TALF------DGPARAVRFARAMIARGRAL--- +>UPI0003969FE8 status=active +------RPFEAA---------------DRELLFGRAQDIRAVVEQ--LR------------TDPLVLVTGDSGVGKSSLCRagvLPQIREGALNDVR-RWSVAV---LSPGRWLLDTLGDA----LA----------------------- +>OM-RGC.v1.018126893 TARA_122_DCM_0.45-0.8_C18859060_1_gene481717 COG0677 K02474 +------SELW-------RGRprKTSLPAgssiRTRTAvlvplgrgketapssssanfvlnLTDVPPEAQELRiTA--EV------------DDQRIHFQRRVPADVD----kvVMELPEGSLARKV-R--VEVAAFD---------------------------RR-CS-IAAFRA--- +>SRR3954454_16888348 +-VISRSAVIRHVLPTP----aepaaVDHIGQQVADRTSQQDRGERVLLNRT--------------aHGLR--ALADGAARLRIAAQSvadvtRTPLVGVLRQLRS-ALGDVSHRLCGLSDHAEAllgAIKDVLGDAATDEILAAWGEAYWLLADVliar------ +>SRR3954471_17335278 +-VISRSAVIRHVLPTP----aepaaVDQIGQQVADRASDKDGGERVLLNRT--------------aHGLR--ALADGAARLRIAIQSiadvmRTPRVGVLGQLGG-ALGDVPHCLSGLSDDALGccaTCGCYLCR--------SRGGASWSFFCHaalr------ +>SRR5215204_1408335 +-ATGGPTRWATMRGRWPLMS-------MLESIAQSG-SGRPVWYVH-GAR---------DrrahaMGDHARALAADEHAGK---------HRAVRQRT-------------------------------AG--------------------- +>tr|A0A167F9Q7|A0A167F9Q7_9ASCO Uncharacterized protein OS=Sugiyamaella lignohabitans OX=796027 GN=AWJ20_2623 PE=3 SV=1 +-VVFTPGEISLLRNIWKEISEnnLDhgrglkssqastfFCQQFYENLLGDHPSLQTLFPSL---------------QSQSAAMAWVLGQIIAQLEDVsqaQSVLIKLAKWHSRLMNLEPVHYEYVGSSLLRTLGDRRGDKFTAQEENAWIKLYTFIANVMLK--- +>SRR5262249_41403170 +---------QVLKESWARVEgqQEALAAHFYARLFLARPDLRELFPI--------------QMRPQGRRLLVGRARATEPGGAPDgASSRERGRPRR-RYEVSAEHHAVFRECLVAAVRACSGRDWDAEREQAWREGYDVLARRMVA--- +>tr|A0A1Z5JNP0|A0A1Z5JNP0_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_8Lh328 PE=3 SV=1 +---LSSTSLLKVIACWEQSKsrggfDETIGIELMLTLFEMNPQARSQFG-FRTDQ---VIDKNnglqrMGILIHGQRFIRTLDCLFSLLgpddDNLEEVLRDFNKESC-QDGMPLPQFLLLLGILVKVMAHTLGGDWTDEVQFCWMEVITHLEVIVT---- +>tr|A0A150GQ95|A0A150GQ95_GONPE Uncharacterized protein OS=Gonium pectorale GN=GPECTOR_12g483 PE=3 SV=1 +--GMSLEEMEQLQGSWAFLSkgafpgevkeqLESFSVDFFMALFEQSPGLINLFP-FKDVNG---KPIIEQLKVHGLKVFQTIGAVIDMCNNysvLLRVTTDLVARHI-KYGVLAAHYDVLFQVLVGILTNVLGSQFSGTLAAGWVKLAGFILRVVKDVY- +>SRR5215203_5896321 +----LVRERRLVREAVAMVdDQDRLIRDFYMIVFAMGGAeVIGMFPT--DMR------------RQRHEFGRALVQWVsaDDPDSIAAHLDQLGGDHR-KFDVQPAHYAVTGEALVAAVRGRCGGRFTAAHEEALRGSYGRLATIMIDG-- +>SRR5580698_8666230 +----PDLEKMAARSPWLTVtA-------------------------------------------------------------------SLSAEPV-SLGHGPRTEHgtvADVLARLGTWREHD--------------AYVCGSSAMVAA-- +>SRR5919204_299658 +--------------------------------------------------------------------------SDlrSGPTSRCTHVRC-----R-QQRSPPRHHRClRPRSPAPSWSARlsagfrssscrpstnRPARRRGRGRSTILASYTRLASVMLDG-- +>SRR5688500_16794215 +------YDARVLRGSFAQLRprIAQYSPVFYEHFWRDYPETRPLFG--RNMSKPE-------LDTRINHFM---LWVTENADRphfTIDYIQSVARRHV-GYRIRRRHFAYVDNTNIKTLRELLGDSFTPEVERHWRASFRFLTLLM----- +>SRR5947199_2475351 +---------------------DELARAVR---lQ--gSRRIMEEHAC-GAE------------GRQLARLFDERGRLARAPRAVDEPGLELGARvsdgrcglakigdvverivqaedvdavRR-AGGDELADEVIVS-------------rtRADDEtseqrepayrigprtqCSDAFRRGLERPAGAPVQT-- +>SRR5919197_1330773 +---------------------RATAGGLYGVLprlR--rgrrRVSVRCNHAG-TDL------------KKQKTMLLGTLVLLRKPLrdlDAIVPKLRELGARHV-ADGDEGGDELLEEQEGKGYGED-EGEgdeafdapLIDEX--------------------- +>SRR6266516_4891354 +-------------------------------------------------------------------------------GLGDGGRAEGGNRDS-GRGEQLEHLGCVHDVLLSFSESTVSTlphqaarpapaaegagpAITRRetadrapprrhrvggfLRSAGAARARSSIDRMTET-- +>SRR6266508_4596506 +-------------SAFVRL-tdARRVARCLPSAH---pGDETPSTFPS--ET------------GDPVNLN-----------LEALETSFDLVAPRG-DG-SEATEDDVVGHPGPPA--QVA-PRPRGDRPQAA---------------- +>SRR6185295_10958302 +--------CILLLVA-----CFLTFKLFFYSMFQDYPEYKNLWPKFRHLN-DEALINTGELSNFCSVYMDGWEKVIGELDDnaaLARELKIIAKTHL-RKGVERshimvakkealcqiriheyCYLQNMMPKMLSLLKEKNGT-LDAEVEEAWKTVFIINADIIE---- +>AntAceMinimDraft_18_1070375.scaffolds.fasta_scaffold521461_1 # 3 # 443 # -1 # ID=521461_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.569 +--------DD-------------DDDDDDdDRMFHDHPEARALFSRVHGDN-----TYSPDFEAHAQRVLGGLDSCISLMDDpdtLASELGHLKAQHA-DHTdVTAEHFDVSICFSsTDVTSTYTsthckimdrpnYTVFQT--RGQrnltksaSRRAHspvRDHPRGS----- +>SRR5476649_891947 +-------------------------------------------------------------ATSTRCCS--ATSRKCCRCSikpTRPTASSsarwptpcWLTQEI-SIawNnWARWHRPSStSMCRCKSsgNTIPWSApRCSRRYVKCWAPRWRPmpsstpgpprtvsWRTCWPV--- +>tr|A0A2D8PEV6|A0A2D8PEV6_9RHOB Uncharacterized protein OS=Maritimibacter sp. OX=2003363 GN=CMH11_20945 PE=3 SV=1 +---MTSQNAGLIRASLTELFprREEFAERFYERFFEQAPQVRRMFVH--DSE------------KQKLMLYAAIAMTMRGLEServLHSELMAFGSRHA-RLGVREEHFPIFGSAFLETLIHFLPQWDHPDLARAWWGAFTDMSTPIIA--- +>SRR5690242_2028058 +-------ELALLLQSYGRIGilIPKISENFYRRLFQLRPNLAALFAN--R--------------DADLKVEEMLRRIVAHASDAaaaKAEVQSSGRSHA-QWPLLPEDYRVAGECLIQAIIEAEGAATGSVVASIWRQAYVEVANLMIC--- +>OM-RGC.v1.029911412 TARA_036_DCM_0.22-1.6_scaffold294997_1_gene285712 COG0526 K03671 +----------------DRLRarGEPPSGNPYRGAAPYGPGDEALFF--GRR------------AE--------LEVLIDRVQkTpfvLVAGDAGVGKTS------------LCSAGLLPLVREgalGGPRHWACESIACGEEPLAALAAVLARH-- +>ERR1719414_683447 +MEDLRFETIRCVVQNWERLKynplFEEFAIAFYQRVLRVCPQAKSFFGSSFCLD------DQA---TMTQEFVRLIDRVLDLLGPesqlMVEVLRDLGSRHE-AYGVTVEMYDIMRDAFLLTLEQFEGEKmFTTKVRQAWMTVCSAVADVMMEA-- +>ERR1700744_5993147 +---VGLDDRDALGVLRDAFSqdesgsGNELVRRFYNHWVELDVSVRDLFPP--GME------------DQRAAFAQALNWLYservaQRAEEPVAFLAQLGRDHR-KYGVLPSHYETLQRALYATLRSYLSdpsrSAWSDAVDEAAGQSLNLFTGVMSG--- +>tr|A0A1E3QTC6|A0A1E3QTC6_9ASCO Uncharacterized protein OS=Babjeviella inositovora NRRL Y-12698 OX=984486 GN=BABINDRAFT_161163 PE=3 SV=1 +--NFTPAEIATLKATWSMEAKDTnsgdiadpkntlFGTTsfwehVYSLVGEEHPEVVHLLPP---------------ITHQTQAFSGMVYLCISNLDNlsrLDEYLASLGRRHSRVFNALRLHFEAMGSGVLKSLYNHYGEAFTADISDVWARFYCFLANSLLQ--- +>tr|A0A0A9XWX4|A0A0A9XWX4_LYGHE Globin OS=Lygus hesperus OX=30085 GN=GLB_0 PE=3 SV=1 +---ATPEQVAMVKKAFDPLsvDAPGVGKVFFERLFELYPGSQKYFQHLG--STDEELFANPVFQHHCTKVILSVGTMIDNLHSnnrrkNKELFEKLATIHA-KRKVSAQQTPYIKHTLMDILH--L--EPHSAMEKAWINVIDTLF-------- +>SRR5687767_4837246 +-----EKQVLLVKHSWSYQAgqLENLGTLFTKKLVALNPGLKAPMKR--SL------------AETGSySLMVAMNQIVAALPDLhkaQNHIQVIVTEYA-ALGITRSDYENALIAFLLALEKRLGKSWSDEIREAWIFIFSSLYH------- +>tr|A0A0S8AZS8|A0A0S8AZS8_9PROT Uncharacterized protein OS=Betaproteobacteria bacterium SG8_39 GN=AMJ64_12515 PE=3 SV=1 +--------TGLITESWNALGagQRAFVEAFYQRFFERYPDYRPLFPL--ELN-----------PRHLEKMVQTIALMADQSQDrgrIAPHMHTLGQAHK-AYDLSARDFDNFKRTFVEVLGERLGRQWSAEAEKAWNDAFDAVLVP------ +>tr|Q9NG75|Q9NG75_9CRUS Hemoglobin P polymer OS=Parartemia zietziana PE=2 SV=1 +-TGITDAEKQLVQESWELLKPDlmGLGQKVFGRIFTKNPEYQTLFTRvgFGDTP-LTQLMANPAYGAHLIKVMRSFDFVIQNLGKpktLLAYLKNVGADHI-ARNVERRHLQAFSESLIPVMQNELKAKLKPEAVAAWRKGLDRIIGVIDQ--- +>SRR5579875_723516 +------------RESFARIAprKEEFVASFYQTLLEKYPHLQRMGAGV-------------DVKRQRKSLLATLQVMLNETDRgeeLRTQFRKPGQRHN-ALQIRAEHYPAFGQTLFETLALY-DPQWTGELRVAWAAALEQCVRFMMEDLN +>SRR5579871_3449338 +-VPLSALHRYLVRRTFTHLaiHADEVTALFSQRLVELNPALMIIIV---DEA-----------GTQRYRPLEILARVIALMDRpaaLSIQLKLLQAQQQ-R-SVTPDHLRQMGEALLWVIENRLGDSFTPDISAAWLHFYRFLGE------- +>SRR5215472_5690244 +-----HFDVQVIGAALTRLAdpAVDAAEYFCSHLYSISPDAAALFPS--EL------------AAQRELFADAVIRVQHSLESgsgLAEQLATIGRQSR-KFGVTERHYAAFMLAMEKTARHFDTGG------------------------- +>tr|F2UQX2|F2UQX2_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_10302 PE=3 SV=1 +----DDSAMKITQESWAMVEREipNWTDIFYDKMF-SDPNIAKLFP-FS----AGDFKTNEKFQTHTQKVRDTMHTAMTSIrefEKLGPVLKKMGERHA-DYGVIPEHSVNFKEAFLHTLKTGYGDKWNEDLDDAWNQCVDALLE------- +>SRR5699024_1886671 +-KTLDPQTIETVKKTAPIIKdnVEEIGKTFYNILFSRHPELYNIFNQ-SNQ----------------ERGlqqealaygVYLAGINIVNFEPIQSLVTRVAKNNR-ALKVRPNNTLLLERR------------------------------------- +>SRR5271157_2714777 +MPSRIVDRLTALRAFFAEMEpqLPVIVARSYERLFDVEPAIALLFK--GNA------------REHQLRFLAKLQSIVKLTRSsqlwpasaatgqiLIPEVLDFGRSHA-KIGVLPVHFSLLNDMIAWTCKEIAPLRFTPLVEEGLAFVFDVLGASLTAK-- +>tr|R7TLW3|R7TLW3_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_227018 PE=3 SV=1 +----------CAEITWAILseNRDGLGTEVFVRMFESYPDLKSAFGPLRHMNKKDAGY-EDVLRAHGIRVLSIVEQVLSKRHnmeEVLSILHDLGRKHL-TFSAKVEYIDIVSQMFLFAIESALKEKWNNSTEKSWGEIIRFVTYVMKET-- +>SRR3990170_2029843 +----------------------------SPCTTTRSPCWTRPCAS--W------------AT-----------APTGSWAtstpPsssRLPSCAR--CSRR-RWTCSATG----CSRRSPAPRHYAEDVWVPELEDAWLRAYAAMSTTMIEG-- +>tr|O97381|O97381_ARTSA Hemoglobin C1 polymer OS=Artemia salina OX=85549 PE=2 SV=1 +-TGLSGLEKNAILNTWGKVrgNLQEVGKATFGKLFAAHPEYQQMFRFFQGVQL-AELVDSPKFAAHTQRVVSALDQTLLALNRpsdFVYMIKELGLDHI-NRGTDRSHFENYQVVFVEYLKETLGDSVDEFTVKSFNHVFEVIINFLNEGL- +>ERR1719468_1094774 +-PPLTSNDRKLIVRSWTIVDqqISQVGLSSFLELFRRAPETLSVFPFLKQLG-PEDMEFYHQLKNHSIRITGVISMLVKQLESeerpadeaIRDLLLDLGRRHF-SYGAKTSHMELLGRVFAESLQPIFEGdPEAKAIQEAWLVFFSVIVFWLQKGFR +>SRR5262245_31323877 +----STDGAGLVMASLARVSdrSDQMIASVYEHLFAHRPELRLLFPS--DL------------KHQRAKLAGALRFVIENLRNpehVVTALEELGQRHI-AYGAKVSDLSSLGEALMSALEAHDPNPWDDLTRKAWHSAYDSIARAMSRGM- +>ERR1041384_2362020 +--------------------ANVLGERKvVAVLYSDLRGFGTL-----SE------------TGHAVDVLERLNDYFD----rMVAAITSHGG-------------------------------------------------------- +>tr|B6BNK3|B6BNK3_SULGG Putative globin OS=Sulfurimonas gotlandica (strain DSM 19862 / JCM 16533 / GD1) GN=SMGD1_2554 PE=4 SV=1 +MQELSQKHIDIIKESAELItaNDLKITNKMYEILFYKYPHLEMLFEN--------------APDNQFMKLAEALSLYAVNIDKiekLIPALELIAIKHV-EVNIRPGHYSMVGMALIEAIEEVLGKMAPIGFIDAWREVYKYVSDILIE--- +>SRR6185437_15632065 +-----ADDVAIVRDSYGRIGprGAALTIAFFGLLSDRVPRVRKFFPP--DD------------KDKRAVAKDLFDLVVGHLESqlnVRWVLERMGRRGL-LDTITPSDVSAVGGCLLDALAELDE-AWSPATERAWSRVYDWAASAVV---- +>tr|A0A0K8S6V4|A0A0K8S6V4_LYGHE Uncharacterized protein OS=Lygus hesperus PE=3 SV=1 +---ATPEQVAMVKKAFDPLsvDAPGVGKVFFERLFELYPGSQKYFQHLG--STDEELFANPVFQHHCTKVILSVGTMIDNYTQttaekTKSCLRNWQRFTP-NGKFPPSKHLTSS-IHLWTFFTWNHIQPWRKHG------------------- +>tr|A0A0S8CN91|A0A0S8CN91_9BACT Uncharacterized protein OS=Nitrospira bacterium SG8_3 GN=AMK69_14025 PE=3 SV=1 +--GLPPSDISRIQRSFRMVAsqGEKMASRFYDLLLERSPELQKFFHP-GNLS------------QQHAKFFNGLHSLILHLEHpqaLRAALVQLGEQHQ-GDGIEIQHYPPVVDTLLQVLTEFSGEGMDGETYDAWAHFLHLVRAIMLENH- +>tr|A0A0Q9HRJ4|A0A0Q9HRJ4_9BRAD Uncharacterized protein OS=Bosea sp. Root381 GN=ASE63_23130 PE=4 SV=1 +----GDRAISLALASLETMGSeaEQADIMFNIRLLETYPDVYRVFC--MDFA------------PEERSFLRALAFILAHAGPfgaIGPTVRALAPSDK-VCRLISSRYHELEETLMWTLRRRLGVAFTAEVENAWRSVLREAPGVS----- +>SRR4051812_34838903 +-------------------KPirNRAIKLFFSRLIESHPSLLTVIG--DDYE------------AKARSLRPAVEMIIGCLGNmeaLRPILRSMARSNA-ELGMQEHHYLTAVNTILWTMERCLGSAYSAEVDAAWEDVCWQVCEAM----- +>tr|F2UFM9|F2UFM9_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_06664 PE=4 SV=1 +-MRLDMEQLKIALGSWTAVVelVPTWHEVFFAELFQAHPETeRLLYSS-DKS--------KSWNERHMARVGKSVGDVIKSLSNyddVIEHLTTGEPHEQ-ACCL--------TDG--YVIGTGLGNT----PRSLWLACGS-------T--- +>tr|K0T9D6|K0T9D6_THAOC Uncharacterized protein OS=Thalassiosira oceanica GN=THAOC_11871 PE=4 SV=1 +----------------MEREdssGSL--PSFVSETEIEPSDVQPaaasgenNVDKGRR------------KTSSSSKRTPSITKRIESFSSfksLSSSFS------------------SKLDDERNAGEAGQAERVEsttapESVASGETQGNAGGQHTLN---- +>tr|A0A165S3D1|A0A165S3D1_9GAMM Chemotaxis protein OS=Halioglobus sp. HI00S01 GN=A3709_07715 PE=4 SV=1 +-----MTAIMMIDRDFTVTYanEAT-----LQLLRDNQATLSSIYPGF---N----------PDKLI--------------------------------GSCIDGFHKNPEHQRNILADPANLPWRTDIEVADLKFS-LNVTAIVDAQ- +>tr|A0A1I2IR29|A0A1I2IR29_9GAMM Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor (Fragment) OS=Fontimonas thermophila GN=SAMN04488120_104136 +-----KGVIQYINRDFIEVS------------------------GF---S----------ESELI----GSPQNIVRHPDmPveaFADFWAT----------------------------LKDGKPWTGLVKNRCKNGDHywvLANATPLRAN- +>CZCB01.1.fsa_nt_gi|955242656|emb|CZCB01016507.1|_3 # 1728 # 2327 # 1 # ID=16507_3;partial=01;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.493 +-----GVSSFEMNQQFSAQSsdSIEKNIAAISELWQKYMATnitdeekvladkfvatrgafvkealLPAVDAL---R----------ANdYEKAKLFSTKARDLYNVAHpalVELIQYQAGHAKL-EYDTSVESYKLTRNWTIASLFLAVGFLACFAYFImrSIANPLSvifRVLDNIKSN-- +>SRR5918993_5799879 +--AMTPEQINLVQRSLPAILaIRDRATARAgERLAVLDRAPGRLFAG-ADI------------GRQGAVLINAVTAAMQALRsgDYGSVLAALSQYHL-SYGIGPQHFRSAGAALARALEQELGSSFTADLGHAWAAACEWVGRII----- +>SRR3954452_18192940 +--XMEPQQIKALKQSLATVLsAQEALAVRFhQHMRRFEQCPRPLFTG-APL------------ARQGVLLTNAIAICA-SLPskNlsQAVAAGALSQYHA-SYGIASHHFHSAADALALALKDELGHIVSDVAIDAWAEACRMLGQAL----- +>SRR6516162_8663010 +---MKAETISTIKATAPVL--KEHGQAITQRMyeiaFDARPDARQLFATT-WM------VSSEEGRKQAGRLAGAVYAYAEHIDDlekLAGGSGAYRaaaRRHE-GPaGNLSGHWSVShgryqgcaKRCCHAGNPRRLARGIX----------------------- +>SRR5690348_5860809 +--QLPDGSVRLVKKSFAALEpvSADVMQYFYAWLFVQHPELRAMFPL--AM------------TTHRQRVFDALARVVRSTGSpaeFADQISHLARDHR-KFGVRAAHFKPFFAALLAAIREHSTGTWTSATQQAWEEALDCISAGLQT--- +>SRR5258705_5637504 +----------LFSQLYQCSKntGRRSRGFSIDTCSKKHPELASMFNA-RDQSD----------GSQARRLAAGVLAYASNIDRlhmLESAITSIGRKHV-SINVRPEQYPIVGKHPLGAIKTVLGDARHPKFWMHGQRPTPNWQRSX----- +>SRR3984885_15745818 +---------------------SRAtgGGWLPTRSPTGRSARTSR------T------------GCRRGRCDGNTRPTV--ggPAALGGGQCEDSARDG-KLGLSADHADSAGAGRVdlAAVRHPGGAGV------------------------ +>tr|Q7M455|Q7M455_BARRE Hemoglobin 35K chain OS=Barbatia reeveana PE=3 SV=1 +-----PANKNLIRSTWNMMVGdRGNGVELMGLLFQRAPDSKIDFKRLGDVS-AENIPYNRKLNGHGITLWYALMNFVDQLDSkkdLEDVCRKFAVNHV-IRGVLDVKFGWIKEPMAELLRRKCGNDCDDA-IQAWWKLIDVICAVLKES-- +>HubBroStandDraft_6_1064221.scaffolds.fasta_scaffold2618798_1 # 2 # 181 # -1 # ID=2618798_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.622 +---CSAEDRSIIQEQWKILFkdvdsskiKIAVGRKLVLNLIQRQPDAKVLFDKF-NVD----EPNSPQFSAYALRLFNRIDLIINLLKDpeaLDAALEFNAERYGNIPNIKKAYFQTAAQILAYALPKVLD-DFNA---LSWQSCTRYILTTVASKVS +>SRR4051794_1382573 +--ALDPALLNLVERSRPRVEhkITELADQLYTALLAQVPGLRTLFPL--DP------------NGRRAPLTDPLIWLLQRLDDrdeLVRRLADLGRDHR-KHRITAAHYETAGHALLDALAHIHGPTWTPPLAAAWTRAYTAATHDML---- +>SRR3954470_25015505 +--EISEEQARMVKNGWQAAvdAPGDFGSDFYRDLFTVAPGVIGLFS--GDMT------------EQQGRLTHTLAETVELVDQpttLLLLLRASGVRHH-HYEVKHAYFSVMRDTLLNTMERRAGAVFDAAHRQAWEAMFDNMATIMQDG-- +>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514 +--LISSKNLGLIRDTWAMARrDSDIAPKIFLRMFAQHPETQLMFPRFANVP-QSQLMTNKDFLQQAYTCLAGLNFMVKNMDDEDlviKLLSRMASPAFYvDFPTPGQQLDETTRLFLDVMQEELGNSFTADARNAWTTVMNQIHNVLVQQ-- +>GraSoiStandDraft_30_1057271.scaffolds.fasta_scaffold222668_2 # 490 # 1347 # 1 # ID=222668_2;partial=00;start_type=GTG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.654 +--LLSIKDKALVRESWTLAKsNNEIAPAVLLKMFAENPDAINLFPKISKAK-IGDLKGNKDLYNYAYSSFAGLNMIIKSIDEVKtiaTLFKNSDNPSIFlDSRSASLD-------------------------------------------- +>tr|W4FW63|W4FW63_9STRA Uncharacterized protein OS=Aphanomyces astaci OX=112090 GN=H257_12922 PE=4 SV=1 +--VLTPRHVELIKANWSAVCagtsafdVEQHgspdkffHRTFYATLFKADPSLRGIFRS--SL------------TLQGKSLASIIKVMTGvvSASNLVERMQALASGHL-KFGVKRQDYATLGVTLIQTLEIISGSSWSRHVKEAYLTAYCLLFYLV----- +>tr|A0A024UCA0|A0A024UCA0_9STRA Uncharacterized protein OS=Aphanomyces invadans OX=157072 GN=H310_04772 PE=4 SV=1 +--VLTPRHVALIKQNWSAICrgtnafdSTKHgspdkffHRTFYSLLFAVMPSLRCIFRS--SL------------TLQGKSLASIIKVMTGvmSTSNIVERMQTLAEGHL-KFGVRKDDYTTMGVTLIRTLEVISGSIWTKEVKEAYLTAYCFLYYLL----- +>tr|R0JHX0|R0JHX0_ANAPL Hemoglobin subunit alpha-A OS=Anas platyrhynchos GN=Anapl_10052 PE=3 SV=1 +-------------------------------MFIAYPQTKTYFPHF-DLS-----HGSAQIKAHGKKVAAALVEAVNHIDDIAGALSKLSRRRKKERfQtkPAPKNLPLAAHrCHQLNIASKGTEHygTNPQLAWLSTGHLVSGRELISSKSS +>SRR5690625_6805322 +--------------RSPSHsqtltLSPYTTLFRSRNLLRNHPELKNYFNT-ANQV----------NGFQPRALASIILQFAKNINHi-yeiVPKLERVCQKHC-SLGVQPRSEEHTSELQ------SRGHTVCRLL-------------------- +>tr|F2UFM8|F2UFM8_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) OX=946362 GN=PTSG_06664 PE=3 SV=1 +-MRLDMEQLKIALGSWTAVVelVPTWHEVFFAELFQAHPETERLLYS-SDKSK-------SWNERHMARVGKSVGDVIKSLsnyDDVIEHLTALGTRHA-RYGLHVDQLDLFINAFLWTLGAGLGDSWDHSVKKAWMHVLPFILSPLKS--- +>SRR6267143_1520378 +---VTLEQIQMVQASFAKIAPivGPATDRKLRRCSALVAGFrkeTRLST--GVS------------KNPGRSEVRGTLCGASCCGSlss------------------------------NWVANIRRGI----------SP-LALAIASI----- +>tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii OX=37682 GN=F775_23753 PE=3 SV=1 +-STFSEEQEALVLSAWDAMkgDSAAIALKFFLRGRNN-------FVQLAHVE--SPKRRIPVVEERKTDL-----------------IFEIRTKTW-KIGQKSTAYRSW--LLLR--QKSLPa----HAPKGHLSElvpldTIDHTHQET----- +>ERR1700722_6370008 +----------------RGIRPhcPavrqhLPCVLPPH--VRAGSVASHAIPQ--LS------------APLTATLTAALEALVGALGDLQPVLVrapALGLRLA-SYGLQPTDISIAASAFLATLDDELDEVSTNAARAAWGCVFWTVA-------- +>tr|A0A0M1J4K8|A0A0M1J4K8_9GAMM Uncharacterized protein OS=Achromatium sp. WMS3 OX=1604836 GN=TI05_18490 PE=4 SV=1 +SKDIKPTNIYLYQASLNRAiNTSKFCDRLYFNFMNGNIEIANIFKG-RSK------------ERIQHKLQTTLDLVADNANQvpgNNIYLEMLGRIHT-KRHITPEHFKRWKFAVINTIAECDP-NFDTEICAAWEEVLTALIDKLI---- +>SRR5260221_159328 +------QALGLVREGFAAVIarPDVFVSELYQDFFTSNPRYRKYFGS-ADIGySGsADIngTGSPEighaaadITRRNAKTVEAATRIVADLDRpgvLLPYLRKLALEYR-KYGVREAHYRAFAGSVMTALERTIGQAWTYEAAEAWVDELTMVASAMLG--- +>ERR1719266_796048 +-VSGLGTLSIISQASWKAISGeiHSSGVAVFVEIFKAQKEVQQIFQKLNPNPNSSGIkytkdqALKESLHEHGVKVLSGVDEVLSNLDQpslCLSLIRKTGAFHRKLQGFKPKYFKCFEEPFLAMVQSSMGQRFTPQMEIVYQSVASFFVQTLIEGYN +>ERR1719402_1083666 +-TDLSTNQKNMIRDAYAVFekNGEKNGADAFIYLITQHPDLKKVFP-WGDVS-NEELRENQVFKDHVYVVYKGLKVAIDRIDNLKAtasYYVHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQTSFNNLLQFLVGNMKV--- +>ERR1719295_364028 +--DLTPEEKRCIQRTIPVIlqEAEMIGTKTYLKTFHNYPLSMIYFEPLRDKLVTEVKQTDDYLKKHGVLFVKFIGELVAEMDDpdsVDLKLKSLGRFHD-DLGVLKQYLEAIGPLFVQAIRPVLMtqasipsatncgvgvsspnSLWTRDTKPSWIRFFRVIALQMKRAY- +>ERR1711860_326342 +--ELNSDEKTLIVTCSKQLleIQKVLGPQMMQQKFQKV-----------------------WSKEAGEL-KQLYDMR------------------------------------------------------------------------ +>SRR5215213_6828293 +--------RR-----LG------------------------gRIRC-APdR-----------PQRPPVRPRDATDC---------------VQAHV-PRGA--GRAVHRGRPLpAGGGGPGPGEAVTPEVAAAWEEVYWLFAVQLIG--- +>SRR6476659_6585810 +---------------------------------------------------------------------------------------HVAN--A-RFTPC-PTYVDDGAavvtNPGKHRGADAGRAFSENLSVDWNAG-VRTAPPLVA--- +>tr|A0A2B4SAV5|A0A2B4SAV5_STYPI Uncharacterized protein OS=Stylophora pistillata GN=AWC38_SpisGene8312 PE=3 SV=1 +-------------DTFGPKEsRCREESVCKVRLLELNPNLQDAFPSFRGVS-LDELMNSRSLFLHSKRLMAVVEEAVSSLDDakeLIEDLTNLGERHL-AMSITEKHLKNLQRAGPATNQDAKHRLLANKGTAQIDRHIARMEDTRLP--- +>tr|A0A1E4GLJ3|A0A1E4GLJ3_9CAUL Uncharacterized protein OS=Phenylobacterium sp. SCN 70-31 OX=1660129 GN=ABS78_22870 PE=4 SV=1 +--ATAFARAADIEASLELLAerDIDPTARVYQRMFELHPQMEPYFW--RDTD--------GKIR--GEMLSLAFAAILDFVGErryADHMIGTEMINHE-GYDVPRDVFATFFAIVRDALRDLLGADWTPVFESAWEEMLAEIESYARQ--- +>SRR5699024_10012150 +--------XLVCLLSLPCPhpHLNSFPT-RRSSDLSKAPELYNIFNQ-TN----------QERGIQQEALAYSVYAAGENIdqlDNLKELISRVTEKHA-ALGVKAEQYPIVGETLLEAVEDILGSdVATAEVIGAWEKAYNYIADAFIE--- +>ERR687884_344007 +------------------------------------------FPR--TT------------TAHNGRAQQSSTANRRaDYPRrapMNNLSRLLKESWT-LVEEQQDKYQVVGDALLEALRTFAGDQWTLEYDQAWRDGYALIAQRMIDG-- +>tr|A0A0J1H5I9|A0A0J1H5I9_9GAMM Uncharacterized protein OS=Photobacterium aquae OX=1195763 GN=ABT56_07590 PE=4 SV=1 +------DFHQIFNDSYQRCqRHPQFFQIFYRNFWQQEERFQKMFEN-VDM------------TRQIKMLKLSILMIMLASTSeeAKDNIRRYARRHGPdGIGAQPEDFDIWIDSLLKAVKECD-THYNSDIDKAWRTCFKTGMEIMKQET- +>tr|A0A2E7C7Y6|A0A2E7C7Y6_9GAMM Uncharacterized protein OS=Haliea sp. OX=1932666 GN=CME43_15375 PE=4 SV=1 +------TSKELFLHSVTRClTHETFIHAFYLRLFDASEEIRAKFRF-TDL------------EKQNAMLRRSLLLYAEATAgRteALREVNERATTHDRhHLDIQPHLYAVWIDTIVTTARDFD-LQWNDDIEVAWRTILGHVVQQMIRRY- +>tr|A0A0F6YJJ2|A0A0F6YJJ2_9DELT Uncharacterized protein OS=Sandaracinus amylolyticus OX=927083 GN=DB32_003309 PE=4 SV=1 +--------MDTTLDSFRRLRERGFAHRFYEQLFVADRRVPRLFAG-TDL------------ARQRDLLEHGISMLLAYQRgSalGEIAMRRLALLHGPrGLDIDHDLYAIWLRVFLDVAGELD-PEWTPELAAAWHAQLGASIAEMHRRG- +>tr|A0A244CWV0|A0A244CWV0_9GAMM Diguanylate cyclase OS=Pseudoalteromonas ulvae OX=107327 GN=B1199_05805 PE=4 SV=1 +---------------------------------------------M---ET----------VNSKAKVLNKLLIA------tsVVLISFIVSLQLA-GVEMGQSSIIAILVFGIASIG---AMAF-------LYKAVEQIADKLNVIEE +>tr|A0A0L0EW98|A0A0L0EW98_9GAMM Chemotaxis protein OS=Pseudoalteromonas rubra OX=43658 GN=AC626_03140 PE=4 SV=1 +---------------------------------------------M---NS----------QSIQSSLNNKIIIA------gvILVISIVVGIQLG-ASGAENMQLVAVALPLFGVVV---ALGY-------LKMALSAVSAQLGCVYR +>tr|A4BJG5|A4BJG5_9GAMM Probable methyl-accepting chemotaxis protein OS=Reinekea blandensis MED297 OX=314283 GN=MED297_02020 PE=4 SV=1 +---------------------------------------------M---NQ----------LNN--ALSARILIV------gtgPALLLVILNLALA-GSGSA--TVLNL---------------------------------------- +>SRR4026208_2063884 +-R-SVRTSKGHRQGHPPAIQkhGGAITTAMDARLFE-NEEVKAMFDQAAQES-----------GEQPRRLANAILAYarnIDKLDMLTAAVERMAQRHV-ETGVKAQHYPYVANALPPTIRDGAGG-------------------------- +>ERR1712080_92393 +TMSLSAGEITAVTASFEAVKadLGTNIGKVLQKLVAEHPDLKPHFPW-HAVP-TADLLGNDGFKTHAAQVGRGFAEAAGNLSNLsacEGYYVSLGDRHK-TRGFAAAQVPMVADAFVAALQ------LTGDDASGWTKLITFVGSSIVSG-- +>ERR1719334_3108017 +-TGLTPKQAQAIISSWENLNSEC-SSLLFKQLFTIFPELKEYFG-FSKRELVDKILNSEEMIAHMDATWNGLDKLVLSTQTgtrFAAIGKGLGYNHF-KFEIDRQDVHKFMDFFKQVLKDDLKSQFHGDLEEAWNIWCKAVEDVFIMGY- +>SRR5207245_2384740 +--NPQPST-HAVTEQVVTLDv-----LPWTSGKLGLGPGKarlsEPLAP--GDT------------LE---SL----------LERQrarIPGfeewVYDArerriheHCTLL-VNGQAEYRRHTAEVEI------------------------------------ +>SRR5689334_4915957 +-----------------------------TASQRVTP----SLR--GKR------------VPSGQmgdRKVPD-VPIVDAHVHLwdpTAFrmpwLDGNKRLNR-PYGLADYREQTAGLPI------------------------------------ +>GraSoiStandDraft_16_1057320.scaffolds.fasta_scaffold2022664_2 # 351 # 797 # -1 # ID=2022664_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.631 +--------------------------------------------------------------------MPDFPI-VDSHVHLwdpNHFritwLDGNPRLNQ-RFAIPEYREHTAGIEV------------------------------------ +>MudIll2142460700_1097286.scaffolds.fasta_scaffold02451_1 # 3 # 1031 # -1 # ID=2451_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.574 +---------------------------------------miGSRAL--AAL------------FPHPKTFMDTKRPVADTHIHLwdpGYLtypwLETVpaiagph----G-PAELQVQEPETDRFRL------------------------------------ +>SaaInlV_200m_DNA_2_1039689.scaffolds.fasta_scaffold02144_7 # 4497 # 5432 # 1 # ID=2144_7;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.499 +----------------------------------------------------------------LQCGVATVRSVIDSHVHFwqpQRLrylwLDEVpair----H-PFTPHELNQATQAIDL------------------------------------ +>tr|A0A0K2U629|A0A0K2U629_LEPSM Cytoglobin1like [Saccoglossus kowalevskii] OS=Lepeophtheirus salmonis OX=72036 PE=3 SV=1 +MTLLTKKETFLIRESWKLVTPEmtKHAVGYYIGMFVSYPKWQDrFFRRIKGIP-LRDLRNNPILAAHSSQVFSAVSNLLNNLENtevIVEGVKKIARTHW-PLNIRGKELEAGLVLLLDYLEASFPGQISKECGDAWNKMFNAMSGVIVD--- +>ERR1719474_2118124 +--SLNPTQKCVIVATWHSIFlkhMNFMGKQLFVDLFKVEPNILKYFDAFRDVG-LANLLQSRSFQNHGVRIMNLVKFAVENLDNpekLQDHMHALGRLHV-KKGIDSKYLNIMGPTFCQAIRPMVMaeGQWSIDIEGAWIQLFKILAQMMRVAYE +>ERR1719328_19047 +-NGMTPEQKQLIDDSFAVLKkdVKGNTIVFYETFFKMNPELVAHFPGVSE-ADLVNLGKNEFIIQRGAKFFNMIETTTHLMESKegcLELVRMLKESVP-EGKVTYDRYKVAKEPFIKMMETALGGNFSAETKAAWRKFFDSLAETTK---- +>SRR5581483_4578849 +-------QIALLEESFELIAgqSVELADRTLSRLIELDPQFRLLAAR-TEM------------AALRSVLFSVLyvlRRSLHNLNTLAPALETLGALRK-DQELSSEHFGTIGIALLDAMAEVGG--------------------------- +>SRR5690349_7596073 +-------------------------------------------------------------XMQMTRFTDL-GLRTLMLLasaestgrrvtTRTIAVGANASEHH-VAK----------------------------AVSRLAELGMVMADTLIE--- +>SRR2546430_1826610 +--SMNTLERQLVRATWIDLaaAPELLAAHVYDRLFTLDPSLRLLFLG-AEL------------SSPGATLTHAIDVAVANLERLEQTVARLGPDGT-IPSVQTET-GILGDALLWAVGSMLGPiACNPAVRGAWAKCCALLV-------- +>SRR5262249_54424048 +--TMNAYDRELVRSTWVELsaDLEVLAENFFDCLFTLDSSLRLLYLN-TDR------------VASGRALMHVVGLGVANLERLEQIAARAA-DED-VHAIGWKTGGIAGDALLRAVERTLGPaVCSPAVRDAWSRCCATLV-------- +>tr|A0A2H1V3P2|A0A2H1V3P2_SPOFR SFRICE_008656 (Fragment) OS=Spodoptera frugiperda GN=SFRICE_008656 PE=4 SV=1 +---LFGSqEFKACCsgMGMGKIGKGGIGPPVtsL--tqrnttqalfhvgflPYLRAAIQwctvqvDNSFDYLGIWT-EpVAFSVDPLLIAWlaykpTVKSEASLPAAVKSLSQtqqIP---------FR-RRSTP----------------------------------------------- +>ERR1719309_231760 +-TTLTEEEIQTVKTMWAGLleNSADSGLFIFQNFFELYPEQVHRFSFIRDSQGNpiPNYLKSQAMLQHSAMVMDALDGVITGVFehDplLGQMMYNAGYSHH-SKNIAKDDIEKLSNSILEVIKLVASCegSGKATKVEAWRKLLNIVNERFEQGF- +>ERR1712168_640531 +-----------------------SGLVIFDHFLKMYPQQVKKFQ-FIQDKNgaiQYHYIVEPRMRVHSEMVMNAMDAAVVGIlrgHNVKQELEDLGRQHQ-SLRLK---qeeAAKEQEEREKEEEEEEEKeEE-AET-------------------- +>tr|A0A1X2H2S4|A0A1X2H2S4_SYNRA Uncharacterized protein OS=Syncephalastrum racemosum GN=BCR43DRAFT_446018 PE=4 SV=1 +--PPTAAQLKVIRRSWELVSdtrwpnepqtmspCQAFSIAFYDALFALDRTIESALSNI--ILQGKalsgilsHLVRTRVVLDEAK------------sidETHFARKLQAIGATYI-EFNVQPYFFDLVGPALISALQRRLKEEYTATIEDAWLTAQHYASYHL----- +>sp|Q7M416|GLB1_LIOJA Globin-1 OS=Liolophura japonica OX=13599 PE=1 SV=1 +---ISADQAKALKDDIAVVaqNPNGCGKALFIKMFEMNPGWVEKFPAWKGKS-LDEIKASDKITNHGGKVINELANWINNINSASGILKSQGTAHK-GRSIGIEYFENVLPVIDATFAQQMGGAYTAAMKDALKAAWtGVIVPGMKAGY- +>tr|A0A0P5UDG4|A0A0P5UDG4_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 +-NILSENDITTMNNSWSILRkRSDFAPKVFVRYFKAKPEAQKLFPEFASIPL-TDLPNNHDFLNAAYSCVASLDYILPHLKIphPerCPVLMELKNKysnvdlkkfgpixxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxrcpvlMELKNKYSNVDLKKFGPIWMTAMQEEMGNALTNEVRDVWKKAFVAFTD------- +>ERR1712000_676789 +-MSLTPQQSAQIRSSLPVLKseGETITSLLYASLLHNHPDLHNLFNSV-NQANG-------RQPRALLSSASVKGTARWESHQLS-------------------------------MISSRGTCWRPSR-RSWGPSGRLSX-------- +>SRR4051795_8230555 +------PAVT---------------------SPRVpA------------------------------------------------FgSPCPVIRQQ-RWTGAI-----IGTRQEGSVP----------SAHSTTSGD------------ +>SRR4051812_47002672 +------RLSA---------------------TPARtG---P---------E----------TRE------E-----------eTPSMaERTLTAMYD-DR---R-----AA--------------------------------------- +>SRR5215203_3322109 +--ELSERTIALVKATVPALEahGLAITRRMYERMFH-NEAIRDLFNQ-SHHG---------ETGSQPKALAAAILAYARNIEIlaaWGEAYWYLAEVLI-ARERLIyqglaaapGGWTGWRDFTV--AEKRCESEVITSFVLRPTDGGPVLRHR------ +>SRR3954470_353290 +------ARRS------------------------------------------------------------------------SPLaEGDPRYHVH-QWDRGRQPRRSTRCRVTPPVT----------NIRRYLVGP------------ +>SRR3954464_15980397 +------RRVW--LA---------LL----DV-LRRsGP-AT---------V----------VRS------C-----------sEMPLfrPGNAPRSAM-GSVPIK-----SVNLNSLPCTDVLGEDATPEILGAWGEAYWFLADLLIA--- +>SRR6478735_1414904 +------SGSR---------------------PARLaS---R---------P----------SW-------------------nHRPIgEATLVNRYG-RS---A-----AGSDVE--------------RIERDLSGT------------ +>SRR3954468_7455402 +------APPD--RA---------LT----GGGETVpG---V---------R----------ASR------P-----------rTIDRsGRTLVSQSE-RS---A-----EGSGVE--------------EIERDLSGT------------ +>SRR3954470_12739883 +--------------------------------------------------TS------ACSRTRTSATCStsrtmarqapsprrspPPWSPMRAISTtsaRSPRVERIAQKHV-GLNILPEHYPAVAESLLGAIKDVLGVTHYSRGLTDDPDWYPYLKKHEWL--- +>SRR5215831_13609655 +---------KPCNRSKPFFRinAFCSAvslalrlQRLCELPESAHPQRC----A-SCLK----------TANPAKNVVPKRFGTFISIHLrdtYIFAVSKIGQKHC-GLNILPEHYHYVAESLLGAIKDVLGEAATEEVLSAWGEAYWFLADVLMA--- +>ERR1719273_448027 +--------------------------------------------------------------------------------------------MD-AWTDVYN-------ALTKVLQ----------SLEDNIKGA------------ +>tr|A0A0P5DF02|A0A0P5DF02_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 +---KPANDRRIIRKTWDQAk--------------------------------------------------------KDGDVPpqiLFRFI-------K-AHPEYQKMFKSFADVpqae------LLGNGNFLAQA-YTILAGLNvviqslssqelianQINALGA---- +>tr|A0A0N8DDV1|A0A0N8DDV1_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1 +--------RRIIRKTWDQAkkdgdvppqilfrfikahpeyqkmfksfadvpqaell----------------------------------------gngNFLAQAYTILAGLNVaiq---ALSSrslLPTKSTRSEVPIS-PVeLPPSCSSNSATSLrksllk------SSAAPSTprpdkpGRTVCALWSLASPRTSRTPK---- +>tr|A0A0P5CUZ8|A0A0P5CUZ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 +---------------------------------------------------------------------------------MFNPAGKT----S-GVPATPSFP-PSSSIssrrlpa------prSTSSNSLANLTKCSWVR---------G---- +>tr|A0A0N5DPZ7|A0A0N5DPZ7_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 +-MNLSAKELQLIEQSWLDIeNKDELGKEVFKRVLLSNEKIRTIFDL--HTCPDDELDQNETFKRHLKSLSLFIGICATSVavgsERLVSIARRIGEKHVNFRWVtfDAEYWLLIKGIMVDVIASKQRPKEVEKVRSAWNTLLSFVISEIKH--- +>ERR1711868_89060 +--GLDKKQLALLQKTWKDISteMEAQGVRLFVEIFQSNNEVIHVFPSLNPNLKGNraNEVIHEAFKNMEAKLLPESMRFFT---------------------------------------------------------------------- +>tr|A0A090L154|A0A090L154_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti GN=SRAE_0000030700 +--NLSHEQQALIRKSWRRVPKQNIGKIIYQKIYQKCPELKNFLSS--DN---------NCVERHFRYFGDMLQCTVDSLNELdkalYPWLTVIGSGHA-GFAITTAHWDAFGEALISSIKQWILSgKEHKETVRAWMKLSCYLIDTLAAA-- +>SRR5256885_864722 +--VLTDRQRAIVQSTVPLLEtgGEALITHFYQTMLGEYPEVRALFSMAHQQ------------sGAQPRALAYSVLMYAKHIDRLEalgDLPAQIDRKST-RLNSSHLVISYAVFCLKKKKRTGSDS--------FTRSE-----RLVV---- +>SRR5256885_6575144 +-----------------------------------------------------------------------XMVMSMRGPALEaagTTGCRSCSAAV-CCSFF--------FQAEDGIRDYkvtgvqTCAlP---------------ISDILIGA-- +>tr|A0A016SWG0|A0A016SWG0_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0168.g192 PE=3 SV=1 +--QLTSEEMDLLRSSVRIIseNATEVGCNTYEMIFEQSPYVKEFFH-FTKSD--DDAYRQKQTVQLAQKYMQVLIAFVEGIEDpsiLEPVSAKLIEIHRKvddVQ--MAAHWGVFTECTLYNIRKALEKDehFNDmdrldAAVMLWRMVIRGIVRRLKA--- +>SRR5262249_10507301 +---------------------------------------------------------------------------------NvkySSHHQQHGPQAR-GVRSTNLAFCCVWRRTEMG----------P-ATAVWSGVHCRDAAGMDG--- +>tr|L7MTK4|L7MTK4_SYMRO Neuroglobin OS=Symsagittifera roscoffensis OX=84072 PE=1 SV=1 +-MQVSEEQQSLIMEDVQVLlpNYDDFVEDVLQQFMEENPETFQIFPW-ADASkTAKEMRSHPRFKSHAKSIGKVISDCLVDLNGvkkHEPKLSSLGAMHT-KKKVPTELFGKLGGCILTQVVKRVSeAKWSEEKKEAWLKAYGIITV------- +>ERR1712227_290716 +--KLSTKTIDLLKGSAAEIKenGTAIATELFKILFERYEVFKDLFPA--DVI------KNG---KMISVLPhalSAFAEFADNMLELDDTINRIVSRHV-SNGVQQWHYPLLEECFIDALDKTLKLDKRPELLQAWKDGFKFLANKVM---- +>ERR1711868_248053 +--RLTPDTIEALKYTALEIKgrGNDIAKSLFDLLFTRYPVFKDIFPD--ENI------QEG---KMFTVLPialHAFAANCDNIAAIDETLARIVTRHV-DRNVQDWHYPMMEECLIGALRMHLEDDEGMDAMEAWKDGFKYLANKIM---- +>SRR5262245_20097952 +--EVTPQQIELLEQTLSELRrqSVFAAQLFYCRLFSLRPRLRRLLSG--RP------------DFHGTRLLSVMSAAVAGLSDPghfAGLLSLAARPAVREALLQGDCVRVIGDAVHWMLERHFGGQITVEVREAWRAAHIRITQVIE---- +>tr|A0A044TBZ8|A0A044TBZ8_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1 +--NFDDAEIQLLRRSWKTIKpeKQT---------VLQCPEVRRFFPFM-NSDLKSCEKKNKRFVFQALRFIQvdmtIFNEIIISSF-----S-------------ndIAILMLVFLECSIHQIRITLLNSkldlWNRKdvdnVIILWWHLNSGICGKIK---- +>ERR1719186_618842 +-----SVQTREIRGTWVVILaqLQKVGVQCIVDLFELHPFVREHFKEIlvqyGKLDPDNDNALQNVLENHAKLVMNIVHELVVNIDNLdglSERLQKLGLFHV-RNAVPKKYSSTIVAFSHTEMHN--CRdlAFNFPETHELHG-------------- +>SRR5688500_15455526 +--AITPYDALLLQDSFRAIQqqSGPAAERFFRELFSYDSSLKQLFAS--DRW------------RREEVLMKALGRLVDHLNSpdgVGPHLVELAREHP-AYGLSNYHHLYFGAALFSMLELVLGARFK-LVYGAWFKLFQLAVSEVK---- +>SRR5690242_19663030 +--VITADDVRMIQESFRRVEsvRASAAERFFRELFCYDEMLRGFFPP--DRW------------SREEQLMSDVRGLSEGLTQpdkLKLAIDALALRLD-GSLRRTPLHLYIGAAWFSTLEMVLGSQFDRRLHAAWYKLFEQVVA------- +>tr|A0A1I5XDG1|A0A1I5XDG1_9PSED Globin OS=Pseudomonas borbori OX=289003 GN=SAMN05216190_1566 PE=3 SV=1 +-----ADDAALLEETLEMVSsrSEDLTPDVYARFFSRCPAASGLFTvI-DpatPP----------M--GCGQ----MLFEIISLLRDsaagkPYVAsyMQQIATEHaA-FDVRDPALYREFMHSLADVQATLLGPDWSPAHAQAWDRQIAALLRHLP---- +>tr|A0A2D8QSR0|A0A2D8QSR0_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP89_08285 PE=4 SV=1 +-----SSKDDVIAESLSLVAerAGDVTSVIYEKYFMRCPSAEEVMSH-LDA----------Q--VLGK----MMEEVYRLLMVndyesENDYLNWEVSNHeT-AYNVEPHMYEGFFSAVIDSVREVMGSQWTPALERVWESKCEELRSEIA---- +>SRR5207247_8066543 +------LDVQRLQESFARMAmhGDAVPLFFYSDLFLRHPETRDLFPV--SM------------AAQRDRLVDALGRIVSDvehVDADSGDPSGARPEDA-HIQAVRILsnAQQMADNYVADAQEY-----SSQLSTX----------------- +>ERR1719419_503384 +-TDLSPKEILDIQMSWAEIHQEgLVnpDVLMFKLFFEESESGRLKYSHLLkNVNLDnlnwmRDWTKVQKLKDSIDKTGEALGDVIKSLNyhdRVVDKLYSHGVVHA-KFGVTRKEIHTFCECLLMTLKMELGTNLSQEAQASWERLLKMIVEVFC---- +>SRR6266536_694904 +---------------------------------------GTRFA--DSHR------------PPRTMERTGplrDRLALRALRlgvgdvvwEDVPSLKRSMCG-----------AAAAGAAPVVAAVASAAPGDPQKHLKRADQVYAKSILLRMS--- +>ERR1719230_2183946 +-SWFTDDRERLLKRSWQQLQldsCEEAGALLCRNYCSQSPEDAASC----G--------------MDWSAVIKVIGFPIDRMDNLafvKKRLRCLGANHA-KWETKEHQFQSMKYAFLSAPRDVFANEFTSDLELAWDLLYDFVSTEMIAGL- +>tr|A0A090KT29|A0A090KT29_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X0 +-TKLTENHRKVIKSSFEIFKknGVPNAHNIFLRMFKEYPDYKNVWSQFKNMS-DEELSQTPLLWKHATTFVFGLERVIRTMDDqemMILMIHSTANQHK-SWGLKKEHFFAMVHLITDILMEEKGEpDEKYAIMEAWESFYDVLGT------- +>tr|Q6BBK1|Q6BBK1_9BIVA Hemoglobin chain I OS=Calyptogena kaikoi GN=Hb-I PE=2 SV=1 +---VSASDIKNVQDTWTKLYdqwEAVHASKFYNKLFKDNEDISEAFVKAGTGS-------GIAMKRQALVFGAILQEFVENLSDptaLSLKIKGLCATHK-TRGItNMELFAFALADLVAYMGTTI--SFTAAQKTSWTAVNDVILHQMSSY-- +>tr|A0A0N4TEQ4|A0A0N4TEQ4_BRUPA Uncharacterized protein OS=Brugia pahangi PE=3 SV=1 +-IPLTRKQKFVLIKNWKGIErdVTTAGIEMFLKMLTEHPEYYEFFN-FRNIANTakEKQASDERLSAHGAAVMKFIGKAISQIENadaFFMLLENNGRQHAHRGAFRPEMFWASYSFTCYSFSNGFIRNFFSNI--------NLLLTKVEMSY- +>SRR5690625_5362168 +VLRSPPpphpaasslSLRDALPLCAGVVaeHAEEITTVFYRDMFEAHPDLLNVFNV-A----------NQAVGEQPKALAASvVAFADRKSTrlnsSHVA----MSSAVS-CLKRRSPERR-RG--------------------------------------- +>tr|A0A177B679|A0A177B679_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_02502 PE=3 SV=1 +--GLTKTDINMVLGSWESINNDEASSIFYRELFNTYPDTKSLFVKFYSVD-NDKLIDNPAALKQLRVTWTAITTLIDYLKKgrideANKAIDYLIEKHRKIKTFQGPMFNMALEPLLYLVKEKL---TSQAYIDAYKKVFGAIFLTIISKY- +>tr|A0A177AVU9|A0A177AVU9_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_06067 PE=3 SV=1 +--HINIKDIERVSTTWDLLDDKKSAIRFYKHLFTIYPQTNKIFVKFHNAK-VDSLGTNAQALKIAKAMWGSASHIIISVSEgnlkeIYKSIDYLIKIHVNVPKFSPTMFELAVKPMVATIQEKI---TDPEILQAYVNIFTVIIEKLKTSY- +>ERR1719397_1495121 +---FGAAQTRMIRSSWSIILaqMQTVGVQCIVDLFNLIPYMREHFKKViadsGRMDPDDDSAMQAMLENHAKLVMNIVHQVIINIDDLdliSPKLFRIGVFHK-NTGILPRYLDIMGPVFCNAVRPILLKhkMWSAETEDSWMEVFKVITSIMKRGY- +>tr|A7BZS6|A7BZS6_9GAMM Globin OS=Beggiatoa sp. PS GN=BGP_3767 PE=3 SV=1 +---------ELIGQSWDKLAGkhEEMVATFYDRFFDKFPHYRKFFP--ESM------------EHQLKRMAETIALLARVTHEtevTHPHLVKVGSRHT-GYCLAREDLDNFKTIFVQVVGEYCGDDWNQEYQESWTEAfEQHIIPYM----- +>ERR1712048_439078 +----------NVTTIWDSIKavpgyEEKFGRMLYEKFYEMEPESFKLFKK-TRQPAAEDVFSDPVFVQHSLEFVRLLDFFIQVLGPdielVEESLVDFGETHQ-DYGVTLDTYSSFGEAMTETVEELLGGngKMDETSRRCWVTAYRYMSMHMTRG-- +>ERR1712048_1339107 +----------NVTRWWDEIKripgyEQKLGATLYQKFYDLEPDSFETYTS-NLT-PTEDIYSDSTFLENSATFVHLLDFFVQVLGPdlelVEESLIEFGARNYNDFGItTVDSYSSFGEALL----------------------------------- +>SRR6516162_179054 +----RSQTVMDIEESLHHILerEKLVADLFYMVFLEKYPEVRRHFINV-N------------LRRQAVLLTMALQVVVQYYLKgfptAEAYLKILGEEHN-RRGIEPELYPKFCTALLETLSRFHFHDWSEDLAQQWEEALKLAATEMVEASP +>tr|K2K1I7|K2K1I7_9RHOB Globin-coupled methyl-accepting chemotaxis protein OS=Celeribacter baekdonensis B30 GN=B30_11265 PE=4 SV=1 +---LAVKQISLVRNDFRRLAPvrPEMFKRFYERLFEIAPHTRDLYS--ESL------------TEEAIRVNGLLEIAFLSLDHpqaMFATLHTLGRDFS-GFGIWETQSDLVVDLLVEVFAEFGGEDWGTELEKAWHSVLSFIAQGMKEG-- +>tr|A0A291GF03|A0A291GF03_9RHOB Uncharacterized protein OS=Celeribacter ethanolicus GN=CEW89_16165 PE=4 SV=1 +---PSARQIALVRNNFRALSPkrPDIFIPVYDRQVGEDPKAAAQYD--GSL------------CQRARVLDGLIELALLSADHptaLFATLHKMGQDYA-HYGSWREKHPFLIGQIIKAFAEATDTHWTDELADAWEQFLYFMAEGMLEG-- +>SRR4051794_12469468 +--------------------------------PPTMHDLRILLAG--DA------------GVRREQVGQALSWLVDNLDQprvVAATCADLGPALQ-QVGASPQRLDALGVLVADALRANFGAAWRQEHYDAWHSSARLVTSWMGQ--- +>tr|A0A0S4IT96|A0A0S4IT96_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72670 PE=3 SV=1 +---ASADDIALVASVWVFVkpNLEEVGNEFYDQFFAKHQDLKATiFL-------------GTNFLTQAIRVMEMFDAAIEAMCDpvaLMELLVPLGERHA-LYGIRKEHYDIFWPALCIALKEQLGDKLTDDVVQSLHRVYYKVIQVMLE--- +>tr|A0A0S4IT96|A0A0S4IT96_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72670 PE=3 SV=1 +---FTPTIVRTIRTTWAAAtkDMDAFGDRLYTAVFALDRTLKeTIFKG-TN------------MSAQAHHIIETLDSCVRIMDQpnhLMSMLRQLGVRHG-AYGVGRHHYPTIGKALISALEGSLEDKFTLEVNKSWTKFFNVIERSMLEG-- +>tr|A0A1V9Z083|A0A1V9Z083_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_04708 PE=3 SV=1 +---PTATDEDLMTQSWDDIIgcklrAEierrkapstepspeaptttsaivQFYDTFFSHLYVINPETRSVFRN--SM------------HVQSKALVNIVGAIRHVlhSDDAKNMVAAMAVRHI-QYGVKLEYFDNLGVAMIQTLSKLAGTTWTTAMADAWHTVIAYIICLIVPHY- +>tr|A0A1I7UV11|A0A1I7UV11_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis PE=3 SV=1 +MDRLTERQKQIFTETFPVVfkDSRRNGLVLFAKYFSEFPHYKNIWPQFRTLQ-DSALLASNELANHCSVYMSGLKEIVEVMDDeekLTYFMARIARSHV-KWNINKYHITNMLEGVDAVLQRSFGDKLTDEIVNAYHTLYDVIGNLLD---- +>tr|A0A0P5Q0G6|A0A0P5Q0G6_9CRUS Uncharacterized protein OS=Daphnia magna PE=3 SV=1 +--SMKGRGSCFDQGHLESCKkNGNIAPKAFIRYLKLKPEAQKKFAAFAEVDL-ADLPTNSHFLNQAYTCLAGLNAYSDNLGKNPKSCPYLNSP-AF--KdVKPDELKLFGEVMFNVMEKNWTIIFPRQARKAWKDGLTACDVA------ +>tr|A0A258C6P4|A0A258C6P4_9PROT Uncharacterized protein OS=Caulobacterales bacterium 32-67-6 GN=B7Z13_12975 PE=4 SV=1 +------MNTQALLDSLDLVAeHGeDPTPRVYERLFARYPETEALFMG--DTR--------GA--ARGQ----MLRQAIETLLDYlgpnafaANFLRAELHNHS-DIGVPTEIFPRFYQAMAEAFADILGGAWTADMQRAWDDLTAKVEQIVRG--- +>ERR1719244_673251 +------GQKDLIIASWREIriCLDEVGFDTFKQLFAHHSDIRAYFPAMKKLSS-NDVEMSRKIKEHSTRIMAVLKLFVDNIYDLekiEPSIEDLGRNHS-FRTLLGLFLSE-------RISGQL--AWR--------RCCFNYLNIS----- +>tr|A0A1I3XAR1|A0A1I3XAR1_9PROT Methyl-accepting chemotaxis protein OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_101121 PE=4 SV=1 +-----QAAIQRA-EACLTLSadGLVLEA---------NDRFAALL-G---LA----------PAAVADRPHA--ALLTLAERDgatYRRFLDQLAQGR-------------------------------DTVARLWHQGAggagvllELSAAVMAAD-- +>tr|A0A1I3XA39|A0A1I3XA39_9PROT Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_10 +-----MAAIDMA-QPMMLLGadGVVQDA---------NAPLAALL-G---VS----------ADALAGRPHA--ALLAEAERDsaaFRRFRDAVAAGQ-------------------------------AGHARLRHAGAggntvtlDLMMQPLAAE-- +>tr|E3MNQ8|E3MNQ8_CAERE CRE-GLB-30 protein OS=Caenorhabditis remanei GN=Cre-glb-30 PE=3 SV=1 +-SHLTPIDREILNKSWAIVskDMQQVAVNIFQMIFEQAPDAKLMFSFM--MKDYKEDKKSNEFIFHAVRFLQVIESTMTHLDDpsqLDAVFLNLGKIHAkheEQLGFSAHYWSVFKECVLFHFRKAMKAHnkFSkhkemsfAEIDSAiilWREVLRFIIDRMKVGYC +>ERR1740129_566420 +--QLSSASVETVRQTAALVgsRAQEIVEAFYRGLRARYLELFQFFNR-TNQTSN----------RQSRALAVALTafaSKIDELSEIHGLLEMISVKHC-ALAVRPRHYMLVHENLLAAMEEVLEDQLTPSGYDAWSDAILYLVRLLTEQ-- +>ERR1719183_2765469 +---------------ADIFmpRLEEIVMRMYNLILEEQHECINIFNT-PSLSPG----------QPLAALAACIRgliEDINVRPRLEHRVEMIAQKHC-AINLQAHNYLGLQGMFMSAAEDVLGADMTPQRFSAWSQALLFICRLVIER-- +>tr|A0A0L0FUF5|A0A0L0FUF5_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_07147 PE=4 SV=1 +--ICKPEELHtkdlgfivtHTNNPW--GstDEQDFGVDFFRDHADQ----------------------------SGLTSFFSSIVIIACEMYqefepSIPQLQKLGEEAK-HLDIPCHMEDNIVGYVASTLSR-SK-QFDAIEECAIFKLIWRVVLFVLE--- +>tr|A0A2E9QYM9|A0A2E9QYM9_9DELT Nitric-oxide synthase OS=Deltaproteobacteria bacterium OX=2026735 GN=CL920_22905 PE=4 SV=1 +--ALSS--MKEAKRLWEEGvgLHTAPGSEWVHQLVAERPEWNHFFAS-SDPE------------AFGEALFSTIDSAVHQLDDevsMFSSLREDSELFT-AWDVRACAFSALPDVLVDFVV---E-DHQTVGAQALRTFLRRVCTIVSL--- +>tr|A0A0K0EIZ9|A0A0K0EIZ9_STRER Uncharacterized protein OS=Strongyloides stercoralis OX=6248 PE=3 SV=1 +-VPLTERQKFLLVKNWKGISrrARDAGTNLFVQLLSEHQELGDYFI-FGNVKakDKYEMLADERIQNHGEAVMRILDSVITSVNDPQemfRILEEQGKQHAIKKNFKPELFREVEDALFYSIKLILDERYTDNMDSIYRIIMKTVLKTLE---- +>ERR1719158_1160759 +-------NKHLIDETMDRVanaNIAELGVICHKKLFSLSEDVQNYFYK--P---------NTMVAYILEKVLFILSNLSHEPVKIAHEIRALGMRHI-KYNIPPVHFPLFGKSLMYTFSSTLEGFWTDDIEDAWGSVFDFVCRCMTR--- +>ERR1719158_1490032 +---------------------------------------------------------GGQLSFICRGHSSRIN------------RNALRVRRsrI-TNRSHSNCFSSYT----------RCSISSITCASAWATCLLR---RL----- +>SRR5438270_3151649 +---------------------PQIVDRMYTRLFEVAPRVVKIFEG-KDPT------------KQL-RTVHVLRDSFDDLSALTPELEALGERHA-SWGVQEQDYAIMGPILLEAMAASVDPYWRSEYTTAWAALFQTVEDIMVR--- +>tr|I2K200|I2K200_DEKBR Globin, putative OS=Brettanomyces bruxellensis AWRI1499 OX=1124627 GN=AWRI1499_0864 PE=3 SV=1 +--QLTREEIDLLRWSWRLVTvdddSTSLGGNTFnAADFSSYLFCIQFYNNFISMD-EKVVEMIPSIRHQASSFADVLNQAIGTLEDLskmQELLTNLGKLHARILGIERSYFKTMGEALIKTFRDWFGNNetfFPLILEEAWIKLYCFLANSIIQ--- +>tr|A0A0R3PZJ2|A0A0R3PZJ2_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1 +--PFTDEEKSELLRSWKVIeaQKQAVGCDIYEMIFNQL------EP-FLCVSIKAPKELHNKFRIIVICIVGRYEEELSSVNE------------------------------------------------------------------ +>tr|A0A183UUV2|A0A183UUV2_TOXCA Uncharacterized protein OS=Toxocara canis PE=3 SV=1 +--RLSPRHRNLIIKSWSKTNKSKIARDTFVELFKTSADIRSKFV-FGDV-PIKRLKQEDRFLAHCERFVAALDSVIAHLDEIGaviENAEALGKYDISAepihaamaKDLRNEHWRLFGDILVERIIENDTKqpSGGSEVHAAWKMLGQLLVFHMRLGY- +>ERR1719367_1435250 +--------KTQLRSTWNVImsDMASIGVVMFLKMFETHPETLSSFIR--NVYSIKEIEmdewYQENLKLHAIRVMAIVEQVIHRLDEVgsvIKILMKRGLSHK-RLGVQRSMLEKMGRSFVLSIQSPLEEanKWDATVEQSWLSMFRFIEFWMGLVY- +>ERR1712004_299484 +----------ILRESWKHLqsRIESLGVVTFLSLFNASSETLHTYLTPEDIATLKEQDkdkmLIEKLRVHPLRIMSVLEKTVHRLEDHqrcLKMLRQYGRKHQ-RFGVPPFMFATWPGVFYLYSSPYWKNlsNGMRTFHKLGKACFNSLHLEYRE--- +>tr|A0A132A213|A0A132A213_SARSC Globin-like protein 2 OS=Sarcoptes scabiei OX=52283 GN=QR98_0035350 PE=3 SV=1 +MTEFEREEIEVLREQWDRIVhyhQECFGMKLFQRLLQLHPEYRPLFG-FEE--TVEEIQNTQRLKAHGINVVYMLNMLFDNFDDmdmIDELIFKLVKLHM-MRGIDQIWLDDIIEPFELVLEEF-NAKIQIERIEVLRKAFIFIKNRMQELY- +>SRR4051812_15383594 +--PMTSDTIALIRASFRLAaaDPQALSQVFFRRLLLRSPGVQRMFPA--SL------------VRDPQRLVGLIDQVLRLLDRrdmLVEGLQNLGRLQA-PYAALPMHYPLIAGAFREALALRVGTLWSVDMEESWAELQALVIRIMGA--- +>SRR4051795_1885912 +----------------------------------------ApRTAR-RRLQ----------PGQPGRRLAAdRAGrvgrGLRQRPAegprtdsrapavadraqarvaghrprpvrRRaRQPVLGHRRRAR-EGGHTGGRRRV----GRGLLADglCPGQPGARPLQRAWRAA-----GDGVAR-- +>SRR5690554_337115 +-------YVKLLETSFQKAvenvGIEELSTRFFSRFFETFPETNSLFKG-TNIDY----FR----KFKMRVIFDFLIDIVKHPNYAEAHIAQEVMRHQ-MYGLqDKEYYFTLAACLLEAVKSALGDAWTDEDESAWNDILLVFKG------- +>ERR1739838_826584 +----LFGSVWPLPLSWDIIShkVDQDGESRFLHKFESNQETEDPILQQ-FT-------QIDASIFNGKSAMIIVALTLENLENyqaLWRNLIRLGRDHF-GYGAQPMYLDLIGPHFVITIRQTLGYDWYEALEYHWLALFELIVYVMKFGWH +>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1 +-------NLGLVRECWDSICeqytTNELGEMVYDHLFKMAPNLTMLFTKPR--------------SYMAVKMGDMLSMLVSFADSsesMKQQISWLGLRHV-KYKIRPHHIPLMGPVFLAVVAEAAGVHWSQDTEKAWSVLFNMVCVNMADA-- +>SRR5690606_39733342 +-TEL--YTLSLHDALPIWVAekIGDPTRLVYERLFAEQPEMETLFI--LDTD--------HSARGH------MLTEALNCIFDLlgQRayapvLIQSELTNQD-RKSTRLNSSHVKISX------------------------------------- +>tr|A0A0D2X3G1|A0A0D2X3G1_CAPO3 Uncharacterized protein OS=Capsaspora owczarzaki (strain ATCC 30864) OX=595528 GN=CAOG_004918 PE=3 SV=1 +----RHETRDAIQSSWALAIqkhddHdvtpvATFVNILFAKLFEVCPETRLVFGH--DMV------------RQGKSLSSILTgmlEFVVHPKKLQSQVKRLAHMHV-GLGVTPDMFEAFGFSLLYTIRVRIGSAWNQQIERVWVDTYGGVSNILSQH-- +>SRR5215208_3780459 +--PLSPEAISVVRATAPVVAahADQITAHFYPRMFAAHPALLRIFNQ-GN----------QATGEQSKALAGSVVAyAVQLIDPeapsFDHVMRRIAY-KH-VSLVSARSSTRSSASTCSPRSVRFSA-------------------------- +>SRR5687768_12147577 +------------------------------------------------------------------GLAHARMDsVSLK--PpanphcaiktwvlacgvpartaeWRPMSN-L-SDAP-SPSLLSDQSLSV----VQ-TTATVVAAHADEITAAWSEVYWLVALQLV---- +>SRR6476660_4664138 +--M-VVVGVDAHKrtHTCVAVDgsGRKLGEKTVPATT----------------------------VGNASALRWARSTf-GpdltwgiedvrnvsRRLE----------QELV-NAGQR---VVRVPTHLMARTRasartrgksdsidaTAVARAvpREPDLPVAqHDSVS--RELQLL---- +>ERR1719193_1089955 +-------------------------------------------------------LKRHRRNRHEGIRFQCNYCDYD----AgqkGNIKSHMDRKHP-EIPYDHTEFQEVRVEKSkysreakqqELDLAAmqGADAFNMNPLAGIGNMMPFNAHIL----- +>ERR1719378_1531842 +--RFHPgaDGVHRIGGEESQ--aeVRRQRSLSLPKFLDSLSGEKEKFAFNFDSMgnVLPNFHASHAQKIHSMKIMDAIDAVISEIlrDHpIKQRLMDVGYAHY-ELHATSKDIRKLTTAFYKGVKDLIGIDDdNDRHLVAWKDFLNKIEEGFKE--- +>ERR1719414_1806212 +--DFTLEQIECISTVWANLRqsSADNGLYLLQHFYTLYPEEMQKFDFNLGDRqdFRLNFHRSQLVRDHSMKIMNAFDALISEIvhGRpVKQRMIDIGYEHY-ERDATAQDIRKFTKAIYSGVKDLMDADHdgprraaaghDDRHLAAWKVFLDMLAKGYT---- +>ERR1712142_47027 +--EFSGEELEYICSVWGNLRmnHPDAGLFLLEKMFLKYPELAKKFDFCRDFFgsYKADAMQTEFMKNHSIKIMNALDTVIAGItaQQpMREAVREIGRDHY-HKKIDKSHMRQMADGMLEGLKEVIGDAKdSTRKLLAWNKLFDMIVEEFGN--- +>ERR550534_2245262 +------------------RDlrHPLGLLLALH---------GGFLSFFHGFFgsYKADAMQTEFMKNHSIKIMNALDTVIAGItaQQpMREAVREIGRDHY-HKKIDKIHMRQMADGMLEGLKEVIGDAKdSTRKL------------------- +>ERR1719192_2788519 +-------RREIIGTMWESFRedSVSSGLFILEHFFSTYPDEMDRFTFASGGQtdketPLAFIMKRERMRIHSAQLMNALDRNGHVYGRspgCMDQAPQSHRG-------------NVCRRTGKSSGIA---------VFKWRVA------------- +>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514 +-TNLTPQDKQIMKEDWLMINEkKTAVNNLLLKFFRSFPQAQAMFPKLAKVP-LSQLPSNVEFIAIVNSIKNGFKFVIDSADDVGLLRQLAGSQDISvftVPGIPVaQQMQETGRVIVEWVQEEMGDRFAERTRVAWIRGLRSISQAFVSGQ- +>tr|A0A0V1CPF8|A0A0V1CPF8_TRIBR Uncharacterized protein OS=Trichinella britovi GN=T03_16047 PE=3 SV=1 +-SKFTDEEVELLARTWKKDDfdwLYRIGTDIYTCVFQLAPELKVFFPYVTECeKKNQSWESSKGFRTQALRFVQILGMAVEKTESrmkdddshLHHRLYKLGETHRRfaLKGFTPTHWKGFVIAVRVAMRRAVEAmpNLtpaeCETAIEAWDKLSRYVVHRMEEGY- +>SRR3954453_266974 +--MLTEKSRPVLEATLPVVgeNIGKIAERFYQHMFGEHPELLdGLFNR-GNQAEG------TQQQALAGSVALFASALVSHPNHLPdHLPPRLTTQTP-RPS-------------TWCRGSRT---STPRSAFART---------SIRS-- +>SRR6478609_8547471 +--VlvdveevlrvvfgFDLPQTDVVRSvVLGNPgq----I--------IAVHKVDV----------------------AAGGRIGPQGGRVVPHPRDVClV-LRRVHPLR------------------------------------------------------ +>SRR3989304_146361 +----------DLEASVQRIldRGKNLADLFYCVFLDRYPELRRHFTAV-DL------------SHQAALLTMALQVIAENHLRpspaAAEYLLVLGHRHH-AWGIERDEFRRLRFCSPPPPQPSHGKGGPAARPRQWRAAIDEAVDTMRAGY- +>HigsolmetaGSP17D_1036251.scaffolds.fasta_scaffold61070_2 # 263 # 457 # -1 # ID=61070_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.672 +--VLNSIDEDLTTKSWNIVMsgtPtENFkakkldpcfhystslswfYDIFYKKLFELCPDVESMFEN---V----------SLVHQGKLLATVIGSALASLKKpiiLKKRLIALAQSHN-GKGVKAIHYCNMGLALFWSLEEVLGVsVMNEETRTSWVKMYSFMLNIII---- +>SRR5215510_2422438 +-LQMTKEQIEVVQNTFNKVRPmsGTAAQLFYNRLFDVDPSVRETLL--WTLK------------QGlGADFTPEAEVAWGNAYDFLAAVMQQAAKGA-SMX------------------------------------------------- +>Dee2metaT_27_FD_contig_31_2132282_length_204_multi_2_in_0_out_0_1 # 3 # 203 # -1 # ID=1013462_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.592 +-----------------------------------------------SAA------------TSNPQF----VAAV----------------------KKAIDYSGL--------LTVAGQGAVQPagiipSVIAGTLPAADALKQDVAG-- +>tr|A0A068XSQ8|A0A068XSQ8_HYMMI Neuroglobin OS=Hymenolepis microstoma GN=HmN_000477400 PE=3 SV=1 +--YFSEFEKDVLISTWEALLlyTHEHGAFIFRLAAEMCPELKAAYNV--EFNDDDELVISSCALQYSQAYITLIDEAIRSLEDPQEgfydSVLIAGASHATIPQMKPEFFKVLKRATLTTWEGLLGEEFTEDVANSWQTLLDYVVAVMVEGN- +>ERR1719193_549257 +--IFTDDELAILKDVWAHLKhhTAGAGLTILDHFFKRQHWALERFEALRDMY-GNihpDYMKIDLMRFLAVDLMEGIDIFVTGFFErdpeVTDLIADVGYAYV-KKIIIESEIEIFVDSMLAAMEELLGEDtWK-KNMAPWKKLMPVVAEHFSRGFK +>tr|A0A0D6L5L7|A0A0D6L5L7_9BILA Globin OS=Ancylostoma ceylanicum GN=ANCCEY_14144 PE=3 SV=1 +-MLPASEVKKLVKSSLERVAigkepkEVQGAKDFYKYMFTHHPDLRRYFKG-AESFTAEDVQKSERFDKQGQRILLAVYILADTFDDeptFRAYARETVNRHR-QFKMDPELWSAFFTVYVNFLASRGP--LSDDQRKAWAQLGKVFD-------- +>ERR1719254_19301 +---------------------REIVDDFYPRMFANNPETKALFNPA-NQ------FEEPNRQRMALtnAVL-AYASNIDEPEKLADAVAIISHKHA-GLGIQAAHYPVVHKNSGLHRARHGR-rrdaGGRRGLERG----------------- +>ERR1719394_777503 +------------------------------------------------------------------------------------AIRLGDFQHI-CT-TPLPFCRESPQVQALHHSILGPEVVTPEIGQGWSDGVLALAEILYK--- +>SRR5262245_29633745 +---------------------------------------------------------------------------LGNHSTrCgRSVESSQSNSTA-DFLNSRRIHDAYSpaiRAAKSKSE------------------------------- +>ERR1719193_348913 +--KLEQKDIRAIREGWACItaHpgLEKTGVDWLHLSFELQPGTKHHYKNFTNK-TLEEICQTPYMKILAGKYMSEIGILVEHLEHsnfVLMRLENLGHLHA-KMGVPMETLFT----MNIVMQHYFRELYsrqdvPDDCEGAWSKVT------------ +>tr|A0A1Y5FEW2|A0A1Y5FEW2_9PROT Uncharacterized protein OS=Halobacteriovorax marinus OX=97084 GN=A9Q84_13980 PE=3 SV=1 +-------------------NIDQFVESFYEHFFSLTPEIFELFKN-SEIG------------KQKNEFKISIHTLLINLsqlDKLDSYFKDLGIRHI-CYNVSERHYKLAKESFLYAIKKTYADHWSKVVETKWEEIIDHVTLKMKEG-- +>ERR1712238_458974 +---------------------KELIEMTDYPTFDVEGVVLCFL-------------------------------------------EWEHHKHE-NIMTFRD---HAYKALMTG-------TMAPLHHTPWKDALEDTIESYGLA-- +>UPI00054DD732 status=active +---------------------------------------------------------------------------------------LTCARDF-FltfVGVERCR-PKLLKQEPQTITSKLGm-A-PMLQSAFWSIRVMRIASS------ +>SRR3712207_8863908 +--FFFQ---------AEDGirDIGVTGVQTCALPIYARPDLLdGLFNR-GNQAEG------TQQVALAGSVAAFASALVKTPEQLpEQLLNRIRSEER-R--------------------------VGKECRSRWSPYHX----------- +>SRR6476659_5675031 +--STHRPDQALRGGGRPPHraADNNAKGAATGHRVSGRS---SPAEL-PENSMR------EQQQALAGAVAAFASSLIETPERVpQSLLSRIAHKHA-SLGIRPDQYQVVHDNLMWAIVDVLGDAVTAEVAAAWDEVYWLMGNALINQ-- +>tr|M3IW96|M3IW96_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) OX=1245528 GN=G210_5766 PE=3 SV=1 +--SLGPVELTQIISSWSKIRnKSQFHQSLYTNLIESNPQIGKIFNN--ND--------KNVISQHALIFGDCFNFVVENIQDnalLDEFLFSFVQENQRFANMATQYLEPMGNSLIRTFRKSLGNNFNSVLELMWIKVYVFIANSILQ--- +>ERR1719502_1452556 +---LPPEQSALVRRVWQRLVgTPGAAPILVRQLQSVAPEVAALLS-DA--S-STNGRSNinrGGLhavhtdpHGRAAAVLSEVSELTELLDDsaaLRQRLRQLRAR---MPPVGPEVYPSVGKAFLHFVWEGVGSGYDNATAAAFAALWDQVEETMLE--- +>tr|A0A1X6PD63|A0A1X6PD63_PORUM Uncharacterized protein OS=Porphyra umbilicalis OX=2786 GN=BU14_0103s0020 PE=3 SV=1 +MGALSDDTVRIVKSTAPVLkvHGGAIVDGFYALLFEQHPAAAAYFNVVPTDGgGGGGGGGRGQSKAQIQRLSMAVllyAESIDQLDTLGPVLERISAKHA-SRGIPAEFYPAVGACLLQSIGRVLGDAATPEIVGAWGEAYGFLADALMA--- +>SRR5580704_1734515 +------------APRAELATgvAPDYgSPDDVASRRSQSRACRRTLR--RPTT--------------GAVRGEMLARVIEAILDFIgerryahHLIQCEVVTHE-GYDVPPETFGIFFGVVATTVREQLADAWTDAFDEAWRTLLYDLD-------- +>SRR5258708_241677 +------SCGEDPAGSSD-----DHDAD----VVASAGQVEGGVD--LVEH--------------PPALGVPIAAPCQWLVDLEgagacaaNRMAAERVNHE-GVGVPPAALARFFPIVAETCRDLLGEAWTGEIEAAWAGLLTRLA-------- +>SRR3954465_11422119 +----PCRSSPTTSGRSPGAs-TRT---------------CStAtRGCW-TGPStgatrpR----------APSRSRWPGPSRsspaHWSRSPSRSpSTCSpgSRTSTTHsasprpppP-PPPPARAERGVVQDNLFWAIVDVLGEAVTPEVAAAWDEVYWLMAYALVN--- +>SRR3712207_885952 +------------------------------------------LGR---------------------------GlladGLRAHPPGAgALQR---------PRRAAGDGVAGVggRRGENRERGRREPPPAAGAGTPGVDRAAPPGRCRPGT--- +>SRR5215467_2668635 +-----------YLHSFPT-rrSSDLPPSALYRHLFTTRPELLDgTSNR-GNQAD----------GNEQQALAGAVGafatALVNTPDRLpENl-LARIAQKHA-SLRITSRSNRLSGQGPIAPL---TEDQ----------HPX------------ +>SRR3954465_6877418 +--AtaaaTAAASSTDIRATRPASleG-------------HDRPHLDTaEAGR-AQLAD----------GEGDIEVGGVDEvvatqHLLRLHERAvGHlgpPTDARRGAGR-LQGVAAEELGTVRLDLDGELVVRLHDL-----VEDLGRRRRVLALVLVD--- +>tr|A0A183INM6|A0A183INM6_9BILA Uncharacterized protein OS=Soboliphyme baturini PE=3 SV=1 +-VILSNYQKTLLRDSWLRINktgIRNIGTMIFRRLLTKQRSIKQLFQHITVLEGvfSAGLTPIQAYQHHSLLFVELIDNAIKNIDDLsvlIPTWIEHGAKHARfkAYGFEIEYWDMFGSTMTEAAREWEGWRRHRETIRSWTLLISFIVDRLRQGY- +>SRR3954463_14455484 +---AQ--------------------------PRAARPSALRLSRP-GDGAP--------------FLLRAEvACLasGI-----g-----------TF-GPGLRSHPLARLGRS-----RALRGRAVLArCPPKIWSPLD------------ +>tr|A0A1I8CQM9|A0A1I8CQM9_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=3 SV=1 +MNKLTEKRCDIIKETWEIYKqdGINNTIKIFFHLFTEHPEYKYIWPQFRGIPDS-SFILSSALRNHAEVYTAGLSIIINNMHNkakMYAHIKKIAYAHV-KWIIHQSHVQNMVPGLMMVLKDKVPH-FDDSIEDAWKTLYGVIGSLLE---- +>SRR5258707_573086 +--------------------------XMILKSFKPNAAIGC-K----TIPT----------W-----FVP-LPTFTAGLTLPKLyplSVFGMRRYN--LGGLGEPH--QVEAALLWLVEKQFEGVLTREMRQAWVQFCQWLV-------- +>NOAtaT_7_FD_contig_111_1754_length_212_multi_2_in_0_out_0_1 # 1 # 210 # 1 # ID=13324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.662 +--RLKPKDAEYLQDSWKVFlErsggLEGAGKEFYRLLFEKEPDLKKLFQV----P--E--------MSQAAAFMRAISRYVSLLAQpeqLKTAIEMLAFMHV-NLGISETSIFAFAESLLECVEDQLHDWDpgeVEQVMVLLTDLTTYIGRVIA---- +>SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold554780_1 # 1 # 420 # 1 # ID=554780_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.669 +--VLTSSIYlttgTVVTDFSVIVlDaegsAIEPGEAPYSLRVYFTPASTGTstatIQL----P--S--------GLISDgMLAVGARRLQEETINprrLAGACEAYGATVTSnvlTVNVrksgTASDPCDSTDAISLLFAGGMATWNslgTSVTSADFtmstnvdsdsvTYRLTFEENVFL---- +>SRR4051812_4293204 +--EPLAAEQELLGQTWSDDFefLYELGASIYQHIFNTIPETRQLFPKIPTINNG----RwceSKEFRAQTLRFVQPLSFAVNNRHDierVAEHLFIIGVKHAKlvERGFRA----EYLDCALVSYFLKIFKFkyFIv---FIGFRT-------------- +>ERR1719295_1797159 +----------NIHVTFDLAltsDPKGFAENFYKGLLKEQPDIGQLFLD-----------KNTTFDTQSARFMAMLMHAIKMLDDtdhFTQSLDSLSEAHV-GYGVEVPMLDAFGKSLIAQVKVmnikyfeeqakggggggdekdeSLdimRvGEWTKKQDDSWKWFWSVVVGVMSAG-- +>GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold789473_1 # 1 # 552 # -1 # ID=789473_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.562 +--RIPPLKGSSLSAGWRTASSsgLS---------------------------------------------RNPRGTVSR-----ESGNTVFQSETF-AGAASPRGGSLL-C--FT--GENEPMGMINNLKT------------------ +>ERR1712012_1094824 +--SLTTSDIAAIRQSWILAkDaapFEVHGPAFYKLMFETYPSWRFAFNHMGGHLSIEVQIENTRFVKHTVTVFRFIDKCVNDLDNPtqiLENIKMVAKIHA-LQGIGVKDFIIIKAFICSKSD-KVGAGRSKNSFIFFPRFL------------ +>ERR1719232_197721 +--SLTTSDIAAIRQSWTLAkDaapFEVHGPAFYKLMFETYPSWRLAFTHIGGHLPIEVQIGNSRFVKHTVTVFRFIDKCVNDLDNPtqlMDNIKLVAKIHA-FQGIGVKDFVIIKDVVLNYFSTALGPALTDAAALGWSnfmDLM------------ +>tr|A0A085MKY1|A0A085MKY1_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_01110 PE=3 SV=1 +--------ASIIKEQISKIEvNEENGGKLYEVFFTVKPEFHKFFD-LKHAPEGKDVAHNQRFKTLGKLFLEKLKRIVMACEDehqLKEEIKGLKMDHD-PRHVGLTELKGAKPILMKFIEQQVG--MTEEQKHAWTEMFKKF--------- +>tr|A0A183IBE5|A0A183IBE5_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 +--------KHVLMEHMKRLNlTNKLGGKFYHQLFQSlPEAKSQFA---EHFDKLEDVENMKYYQQLGHSLLSLLKELPEHCDDdhaLKQEIMKIKKKHD-EKHVDAKMFKKSKPAILKFLTDNTQ--MTNEEKEAWDHLITHS--------- +>ERR1712025_717817 +--TLSPEHVDPITESAPSGKakGMVIANNLYRKLFSRHEMFRAMFPE---QS------------QQSGKMIQALPSALydfavncDNMGQMQSVVARIANRHV-QQGVQGFDGTFQFIPKKVDLsliPAGQCEAKLKVALNARQPGtgvgdrFQLHPSEVC---- +>ERR1719495_824226 +------QDIENVRKTWEKMIakheLQGVGLVVLTAWMNEHKEIRQVFAK--SFPIIDklekdvldlVQLNDPTLNEHATIMASSFGKMIECLDDteFVQMMIDIGKKHT-GFRVSADSFDTsLNSTLITALMALSEEKEDSPNIKSWKTVVEVMKHYLKQ--- +>ERR1719210_734039 +--HLSTADVAILKGSWSVLEehVTRVGVDFFIDMMTNHEEIKAVFRQMPNIP-VFELKANEDLNRHGMYILGVIKKIVGKNDDteyLEKLFDDLSDLHR-RLGVEASGMDIFGKVFCKVMRPILLEkkKWKPEIKDSWMTFFSSIVKVMKK--- +>tr|A0A2T7P177|A0A2T7P177_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_12319 PE=3 SV=1 +-----------ITRSWKCFYekVCSFGVYEFLNLLTDLPEYEEAMRLI-KLTSSYKFLSAMDFNAHFLSMLTIIEKCMARLevDDlplLEDILHKVGTDHI-GRGVNPENFDLVIPPMVAGMKQMLEDKWTEKEDIAWTNFFTLMIHIMQE--- +>SRR6476620_7243483 +--MLSDTSLPVIQATLPVVgeHIEEIAKRFYKHMFDARPDLLdGLFNR-GNQADG------RQQQALAGSIAAFAGMLVDKPDEVpDHLLSRVAHKHV-SLGLSPDQYQIVHDHLFWAIVDVLGDAVTPEVAAAWDEVYWLMGNMLINKE- +>tr|B3RTB3|B3RTB3_TRIAD Predicted protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54902 PE=4 SV=1 +-----------------------------------------------------DLIKDPLVRSHGLRFMKAIETMLEIeFDSngCIFLFSAIGNRHC-SYGIEADYLDYVPQAFRFMLTKALGNNYTDKIASVWDEILSHIIKAMQDKV- +>tr|A0A2G9TV92|A0A2G9TV92_TELCI Uncharacterized protein (Fragment) OS=Teladorsagia circumcincta GN=TELCIR_17315 PE=4 SV=1 +-------------------------------------------------Q-KNSSSNKQAHRKT-----------------tsdTHQDL-RRTRDQP-CEKCPQSPRYHMLEPVLAVVKE-CNDDIDDETIQAWTTLYLIIAD-LIEIY- +>tr|A0A2R7X9G6|A0A2R7X9G6_ONCFA Uncharacterized protein (Fragment) OS=Oncopeltus fasciatus OX=7536 GN=OFAS_OFAS019380 PE=3 SV=1 +----PPVDINAVQKSWNGIKsslgdkaPEAVGKLVFENLFSNYPYMLEFFKNYGET--KEDILNNKKFMFHAKeRVFKTFDKTVNNLGNeaeLNNIASWLAEVHV-SRGIKPPDF------------------------------------------- +>ERR1712018_1077981 +----------------------LIGCQSFQAFFDRSPEILSHFDKFNAIEI-DGVLVSSALKMHSSRVLAIVEDMVENTGNpekIRTILQDLGRNHY-RQVKPILMhFLX----------------------------------------- +>ERR1719199_1665450 +--------KPMIRECAAKVvqmDIVELGLRFYVHLFTINPAASAFFTKPKWMI-----------SAIFGGVLRFYVHLFTINPAASAFFTK----------------------------------------------------------- +>tr|B3RTB2|B3RTB2_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54901 PE=3 SV=1 +-------------------------------LIKLSPATKIYFHGV-DFEkRDSYLAKNTFLRNHAARFMEAINVIIGQdMDIfsVESYFRVVGSKHH-SYNLKLEHVQDISDAFLEMARNALKKKFTKSTEAAWRSFFQMVTDAIKN--- +>tr|A0A1B6G4Z3|A0A1B6G4Z3_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.45438 PE=3 SV=1 +--RLDDNEMELIREGWKCITeSEDN----FRTAFSSKLaqknLAKVHFKHVENVSITDEGFSHEFLMSHSVDVMNTMHLMFNDIRNPeswMPEILRIATLHK-LFGVTLEDLKRFRCCVIEVLQQCLGEdGYTPQIKDVWDRVLECIEI------- +>ERR1719383_1602644 +------------------------------------------FGLH-L---------------------QSTMLVGNDLDpvdERG--PDHCQQALW-TASE-GRTLSHRRREPCRSVLEVLGEdVVTPEIGGAWREAVQALAKILID--- +>SRR6185437_4905046 +---------------------------------AENPEMEALFVR--DTA--------AL--VRGQMLAVVMEGFLDFVGDqdYsARLMQIERVNHE-GLGVAGRAPRHCGAAGGRSLTHFPGKP------------------------- +>SRR5512135_1032698 +--NMDQETLSTVDASLQRCNRdSRFLDLFYEKLLASSPKVREKFAH-TDFV------------RQKRALRSSLWMMLLVAEdeEkgPARYLRGLTAIHGSsGLDIGAELYDFWLDSLLETVAVCDP-EHDAKVNAAWERVMMVGIHYMCTHYH +>ERR1719336_1989132 +------------------------------------------------------------QDRKGGgGTPGKLKVTAKYNDGtefVDefntvifaigrdactakmgleGVGVALNPKNG-KVlhneler-TSVDNIYAIGDvldgkpeltPVAIQAGKLLARrlAGTSEVTTDYVNVCTTVF-------- +>ERR1719278_462770 +--HLSTADVAILKGSWSVLEehVTRVGVDFFIDMMTNHEEIKAVFRQMPNIP-VYELKANEDLNRHGMYILGVIKKIVGKIDDteyLEKLFDDLSDLPL-LLLQQDRPHHLAKNLPKNVHSGSLYAeppvkvaEVVEELLQVLCV-VDLPHNLL----- +>ERR1719210_1454089 +-----------------------------------------------------------------------------------rrclgyacf----ASFHKSQ-TIlklshdrdrferqkknPQQSSSFRRCGTsmgqsesslTAANLTQAPTLRpaEWDPNMYQSL---------------- +>ERR1719284_537611 +--------------------TEEIHSEFQSLLLQHNLELLSVFNI-PRQS--------DDVIDAEteeiasHHLAGVVLAFAAHVGHVQRmrELDQLAAKHC-SHNVHPFHYVVLHEHLLDAMRKALSTMLTPEVQYSWSQSLLFFAKILID--- +>SRR6266536_2537548 +-APLSGREREIAMLAAAGLASKDIAERLYLSVRTVNNHLQHAYTKLG-VSGR------AGLAEQEIKFAEKLTEIVramPRLDELLTHTRALGARHV-SYGVRAADYQTLGNALLAALAAVLGGSFDAPTREAWTLAYNLVAETMLDG-- +>SRR3954465_13942299 +-HPLTGREREIAMLAAKGILSKDIAARLSLAVRTVDNHLQRAYTKLG-ITGR------DQLADVLAHDTTTHPGPX----------------------------------------------------------------------- +>SRR5699024_12637729 +--TLPKGDHPLV-----LVsaGIGCTPMVAMLHRLVETA--------------------------------RERQVLVLHADHTpEEHAX------------------------------------------------------------ +>ERR671932_89059 +--S-PTSCGPARACRSCCCtpTPPRRRSR------------YDgVHEG------------------------LMDLSSFPLPDD--ALFYLCgplpfmravREQLL-DLGVSPRDV--qyeVFGPDLWQADAdeGPGDAPEPgahdllgpEERQGPPPA-WSRPG------- +>SRR3712207_7345787 +--V-LDDVRALPNATVHVWyeSGAASALP------------VDgVHAG------------------------TMDVRSEEHTSELqSRQYLVCrlllekk--KTI------------kyeSTXX------------------------------------- +>KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1083625_1 # 3 # 881 # -1 # ID=1083625_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.686 +----------------------------------------MEYEI--------------CLEPSGIRFMADAGQNIVEAAKqhgIpIKHGCASGScgDCK-GTILsgDSEQGPFMPLLLLPTERAA-G-------MAILCKLYP-RSDLRL---- +>tr|A0A044RBY2|A0A044RBY2_ONCVO Uncharacterized protein OS=Onchocerca volvulus PE=3 SV=2 +--ILSEIQQELIRQSWQTISgklevtEQCFGFFVYRRVFERNASLKQVFHV-EEYDSLESVPNEHSIFRQMRLFTNLISLAVRHVDELeteiAPAVFRYGQRHY-KFAaesFNEETVRLFCSQVVCTVVDLLETDIDPSCMEAWIDMMRYIGCKLLDGF- +>tr|A0A0R3RKB4|A0A0R3RKB4_9BILA Uncharacterized protein OS=Elaeophora elaphi PE=3 SV=1 +--ILSEIQQELIRQSWQTITtklesnKRNFGFFLYQRVFKRNSMLKRAFHV-EEYDLLESVPEKHSIFRQMRLFTNLISLAVRHVDELeteiAPAVFRYGQRHY-KFAeeyFNEETVRLFCSQMVCTVADHLGGNVDPACMEAWIDMMRYIGCKLLDGF- +>ERR1719384_507171 +------------KKCWNELmkDKVNVGERIFDYILTKEISMSKLFMQ-------------TNIEQQSGIFMVMMDKVVGFLDDkesMNDNLIKLGQLHVEKYGVKTKHFKHFRAAFLKAIKKYLP--WNDRREEVGSSFGLELLIKCRC--- +>WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1216141_1 # 2 # 73 # -1 # ID=1216141_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.347 +---FPDGVCMATIELTVLPvRpled-----DEKFQIILSEAQGGASFNPNDD--------------G----GKDDGvlTIVIKNTLQDpkgLKVLVESFGFQHL-DFDLTVPRVVVFRDSMVELMEAELQDRFTYKAKDG----------------- +>ERR1712214_179591 +-------------------------------------------------------------PGHAgRREGRRSARQPGTGKDRqksTKYLLELGKFHR-FSGIPNDYFGVMGTIFVHAVRPYWEEagCASEQTEVVWMMLFAHIARVMTH--- +>ERR1719458_2209728 +--HLSDEHKTLVIDSWDFVPgfISEAGYKAFTDFVKLCPYYAEAFPFVKKKEEEF-SHLLCEHARKVTGEFGLLAKLISELKTkppeksndqvIHDIMVPLGRRHV-AF-------------------------------------------------- +>ERR1711928_171062 +---VSATQESHP-------------------------------LDLDSHE-IQQQRRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFET----LCFRWIQHD-----------CQQYGX--- +>ERR1711928_123369 +-------------------------------------------------------RRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFES----LCFRWIQHD-----------CQQYGX--- +>ERR1740128_75568 +---VTAQEKTLIRATWDQMMfNSEVAPKFMLRLFSEESQHELGgnFaVEHHLVP-GGadegLLLGSNDGFSNTLDVRVG-----------------------SHlLGNDAi-------DVVHDVFQCFLGGSIGRGDlfnglHHNMGRFVQLVDGX------ +>ERR1719219_701605 +---VSAAHKSLTRSTWTLMKfNSNVAPKILYKMFTTYPET-QKMyTRLADIP-ASQLMENKQFLALSHSAFAGFNMIVNNMDDPELIKLQLSKVDFPGtFVYPFpgtsLNTSKPPASSWKYSPKN-SAPLSPRKPLPLELPFELRHQGFGK--- +>SRR6476646_9453568 +--PMLRTRLQLAEASYHRCAeSGAFYNTFYTHLLASDPRIPPMFAR-TEF------------ERQHRLLKHALGLLIIYAKHAnPAMLERIAQRHQ-EIGVLEDLYPAFVESLVLAVAEH-DPEYTPELADAWREALAPGIAFFIKRH- +>ERR1719347_2568912 +--------------------------LPPPTHFLPLPGINRKVRIFQRQFgnQTSEFLTGKALRDHSIRVMDALDSVIVDTlKgkDIHKQMVDIGYSHL-KMGVEPRQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED--- +>ERR1712189_147645 +----------------------------------------------KPDF---RIPDWKSTPRSQHQSHGSLDSVIVDMlKgkDIHKQMVDIGYSHL-KMGVEPKQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED--- +>ERR1719412_2466027 +--NLRPLDVTNIKESWHSVEqqLVEVGIRVFISLLENQPNIKRTFRKYRSKR-HSELRINEDLQKLILYLICGLKRVVKYLNDnkaMGKYLRRIVKKHS-PTEIDFTRINpaELSTVFCSAIKDIVdahqaasaklqsvsetsspectspSTCWTIEVEESWTTLFGSLLNATR---- +>ERR1711860_392201 +------------------------GVHVFLVLFESQPQMKRIFRSYRGKK-HSELRLNEDLQQLVMYLISVLKKIVKYLEEsrtIVKYLRRIAKKYS-SPSIDLARFDphILTPIRVRRRHLFSresivfekRLKWPQK--------------------- +>ERR1719266_3067024 +--QLAPNDIANIQSSWTLIEpiLLKVEMAWLLLFRHIAGFMRNGYNSVV----TGPL--------------------IRHTTNcatS--TSSRMSNX------------------------------------------------------- +>ERR1719264_357726 +--EVGLCDALNIQQVWPRIEqyLLPVGTRMYISILDGRCDKIIFCNKACCRKNasksssakstrsvysksvsrtcPNQVILNEELQKFVLLLMGLIRRAAKHLDNpshSAKVIRKVTKKrFG-KLNIDVTKIAfePIALNFIASVREIMtnTRHWNTETEASYYTLIRNLIAYVQ---- +>tr|A0A2G2R4B7|A0A2G2R4B7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_07540 PE=4 SV=1 +----------------------SASDKFYNVLQNDLPEFTQLFTN--PE-------------KQHMMFYAALRSIDGLKDNktkLAVYLRSIGVKHK-MLGLTHYHMEIGRNAFEQAIFA-GGKDLTHDQRQFYIDSFSQIEKNM----- +>APLak6261687352_1056175.scaffolds.fasta_scaffold62437_1 # 2 # 238 # 1 # ID=62437_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.447 +-VRFPKDVIEEAQQAWMSFtmasTKEAAGEALYSAIFHAAPSLQSLYKIPR--------------PTMALRFMNSINAAVAIAHRpsaLKAQAEALGFQHF-DIDVTPSRGDIFREAILEVLDMELGSRFTTRARMAIGAILNYLIGANI---- +>GraSoiStandDraft_15_1057317.scaffolds.fasta_scaffold2262553_1 # 37 # 405 # -1 # ID=2262553_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.610 +-LQLSQSELFALGRSFELLlqglgnDRDRVGDAIYGAKTANLVVFKDKFITPR--------------AVLSLALFNGFRVLGHKSADpeeLRLFVETMAFKHL-GLDITLQRVTGVTDSFLELCQQNIKD-MPPGSLLAWRKLMTYTGSCFR---- +>Go1ome_3_1110792.scaffolds.fasta_scaffold06098_1 # 3 # 227 # -1 # ID=6098_1;partial=10;start_type=ATG;rbs_motif=AAA;rbs_spacer=15bp;gc_cont=0.524 +--VLSAGELAAARAAWDLMKDnVKVAESALVKHFVLHPPVQKLIPALADVP-ISELQGTTCSTPSPTRRC--ASPTTX---------------------------------------------------------------------- +>ERR1712142_1087278 +INALTETEVKVIIDSWDRIHPDKGAKMLFHQFLTDFPLMKIYFG-YQETESVAEIMESEQIKTRCKVVWDVLTKIVHASGDggkLAELVKEVSVKHL-NFNREKKDI----HCFLHALKVTLTC-FSGHLFRPWNIWCKMV--------- +>tr|A0A1I2S201|A0A1I2S201_9CORY Uncharacterized protein OS=Corynebacterium spheniscorum OX=185761 GN=SAMN05660282_00995 PE=4 SV=1 +--------------------SGHLEPELQLQLYARHPNAQWLLRAG---------------KAVPAELVELSIHAIAAADAegaldalAEARIRDLGLAQR-RFGFPSELYQDIQEIMVSLLRTTGAD-LPFPVEFAAERTIARVCVLLQE--- +>tr|Q8NLZ4|Q8NLZ4_CORGL 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases OS=Corynebacterium glutamicum (strain ATCC 13032 / DSM 20 +--------------------AQDFLRAVQAKLLTLAPQARGHFPTA--D------------DATHISIAEMVSALLEGTGEegkvddkTLEFFKEAALDAR-RFGLTPEMHSALGEAVRSELLSLCED-LPFENVLFAERAIAATTAVSVE--- +>tr|L1MAU4|L1MAU4_9CORY Oxidoreductase, FAD-binding protein OS=Corynebacterium durum F0235 OX=1035195 GN=HMPREF9997_02488 PE=4 SV=1 +--------------------PDLFRTLAQRYFLDDCPEARFLFPTD--D------------STAHADLAAALIFVFNHSNAdgsltpkLVSILEQLGRDHR-KFQVADNHYERFGNALNRALKIVGAHAptYA---ITAAEKAITATLETMRR--- +>tr|W5Y4C7|W5Y4C7_9CORY Putative oxidoreductase OS=Corynebacterium vitaeruminis DSM 20294 OX=1224164 GN=B843_11695 PE=4 SV=1 +--------------------REELSAIAFDMFFATQRDARTRIRA-------------------TPAIADALTLLARSCDSegklpldVEKRFLQRATTLC-AHGLRVDDLEPLAESAHRAMLITAGG-QPFELVLPIERALQQLARTVVE--- +>tr|A0A1W1UZL1|A0A1W1UZL1_9CORY NAD(P)H-flavin reductase OS=Corynebacterium glucuronolyticum OX=39791 GN=SAMN05660745_01670 PE=4 SV=1 +--------------------SPEFHEHVRANFFDKCPETMLVFPLH--K------------ENVHADLGRVLSFVFDRTPVdghltdeMRTLITQLGKDHR-KYNVSPRYFHPFVECLRDSLLTLCSD-LQFKYLNGADTALGEVSTLLAR--- +>tr|U3GX34|U3GX34_9CORY Uncharacterized protein OS=Corynebacterium argentoratense DSM 44202 OX=1348662 GN=CARG_08960 PE=4 SV=1 +--------------------LSHFGDLAHSALLRRAPGLIS---FF--G------------PNPHTELTTAVLFILTHSTPgpqdsgtqtplspridaaGAGALRALATEHV-AYMpPDPALYLAAADALCEALRDSCAD-QPFQQVLAAEKALREACSLMAT--- +>tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae OX=1717 GN=mphP PE=4 SV=1 +--------------------VTAHSIQAVADElraHRAEFIQAANQKP-------------------DSPLADAIVQLVDHTDLdghvpesIATSWLQHAAAAE-SLGVSRDYYLTLADASRSALRHICAD-LPFAEVLGAERAITSIANTLT---- +>tr|C0E6D0|C0E6D0_9CORY Oxidoreductase, FAD-binding protein OS=Corynebacterium matruchotii ATCC 33806 OX=566549 GN=CORMATOL_02563 PE=4 SV=1 +--------------------GDGFSREVFTTYFRYVPDAQLIVSP-------------------DYPLGDALVGLFHGSDNegnlypeTIEHLRDVTEILA-AHGF--RRYRPLADAISPVLDRYCLD-ISAYDVFIIKRAVRQAAEVMDE--- +>tr|A0A0G3GTQ0|A0A0G3GTQ0_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium epidermidicanis OX=1050174 GN=CEPID_01535 PE=4 SV=1 +--------------------SPAFRRDVLRDFFSQHPHMRLKFAAN--E------------DHAHTELVFALTYLLENPTD-PELIRTLARDHI-KVSPGQEVVADFFAILHRQIHRYCAD-LPYEEVRQADLKLQEIA-------- +>tr|A0A0F6R111|A0A0F6R111_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium kutscheri OX=35755 GN=UL82_09495 PE=4 SV=1 +---------------------------MVASHfYADVPLARLSFRL-------------------QPSLVDTLIAGLSHP--lNITAW---AHDLA-HRGVDRSFYVPLSAALQHAVCHICSA-LPLVDVLAVEHRIDQIMKQLLA--- +>SRR5580704_16882803 +-------------------------------------------PG--RH------------GCAAPAFLPGAQPYRRCPRgpegPRQPRALSAGTRAR-APKFGERHYEVFRRALIATLQRFAAPRWNETAKHAWETAFNHAATVMIE--- +>SRR5690348_1231357 +--------------------------------------------------------------------------------arapevrrPRAPLRG------G-QAGADRHASAVCRAELEP------------DRQARMGDRVQPRRRIMID--- +>SRR6476620_5060594 +----------PAQVSFWLLEpvADAAMTYFYAQLFAKATWTDREVY-----------------ISGPDHMIVKTA-RVLRERgapdRLIHYDLD----------------------------------------------------------- +>tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 OX=582737 GN=TSPGSL018_8354 PE=3 SV=1 +----SSKIITLIEKSWAFVEsrcdLMEVSNKFFERLFQRAPALQNMFTKPK--------------RVQYVMLAKALDLIVRSAGEtkvMNEDIKAIALRHI-KYDIRQEHLNVFGSVLVETLANSVGPeNWDEDISAAWASIYGNIAAVF----- +>tr|A0A1Q9C6P6|A0A1Q9C6P6_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene41206 PE=4 SV=1 +---CVCDLAQCRGRSWAAFFvdi--------QAAYYETSRS--LLFEGP---S-----------QDP----------ALVALQLpahVQALISDGALQGL-GI--PQEHIALLQDCvecsfwtftgqtqqvmatsgsrpgdgladvlFGALFAVILtcLEAKCQQCGLVHQSMSDALGVPDR---- +>SRR6476646_8240181 +-----------------------------------------------------------------------------NINLLF-ALNRHTCPNL-I------------HEPASEFfFGLQRPATH--HEHIRVENIHHL----IK--- +>SRR5688572_19725352 +-----------------------------KNLFELNPALRPLLPE---STAE-----------QDRLLTRLLNAEAGALAGTRPP----APRSAEGHGNEgTAPCSVAGEALLWTLQEAYGADFTPQARAAWEALYRFVTGTTKSAP- +>ERR1719229_1707680 +---------------------QQLGVLLFANLFKKQPLCRNLFAD-SDI------------SKQSLRLLDMFGWLLRSLVKeknqMrLRTLKSLGDRHV-KYGIKIEFFGPMLDSLSDALQDWFGTNYNTQTRVALTTLFQSACNEMMKQ-- +>SRR5512139_12076 +------TDLELIEASIEQMlDlETEIIGDTYARLFAHCDGARALFGP--NTYG-------P--RAQ--MVN---ETIIAGLDLLrgepwvHEYMTQHGVRHRHSYEVTDAMYRTYAESLLGAIRERLGDRFTPELEAAWS--------------- +>tr|A0A2E3FAX6|A0A2E3FAX6_9RHOB Uncharacterized protein OS=Rhodobacteraceae bacterium OX=1904441 GN=CML69_02715 PE=3 SV=1 +---LPNENLELIRHSFPLIFqhKAEITTKFYEGLFRDAPELRRLFSK--EMNVQ---------KDMLVSVLTTLAKA--SFDEglVESMIARMARVHS-GLGITSGQFRTGEAALLSALDQSVGDLLSETTLDAWKTAVRRVISAMID--- +>tr|Q9NAV7|Q9NAV7_9ANNE Dehaloperoxidase B OS=Amphitrite ornata OX=129555 PE=1 SV=1 +---------------------RTYAQDIFLAFLNKYPDEKRNFKNYVGKS-DQELKSMAKFGDHTEKVFNLMMEVADRATDcvpLASDASTLVQMKQHS-GLTTGNFEKLFVALVEYMRA-SGQSFD---SQSWDRFG------------ +>tr|A0A0G3G1X4|A0A0G3G1X4_9GAMM Uncharacterized protein OS=Thioalkalivibrio versutus OX=106634 GN=TVD_07385 PE=4 SV=1 +--------PPNVESSYRRCcADASFLARFRLALRAADGQVSGIFDP-LSA------------RQQEVMLDASIRAALDFSSGdpqGASRVSEMIHVHGRQgrVPVPPALYPVWLESLIQAVRETDP-HWSDALERRWRAQLMPAVDMFVELYL +>ERR1719187_3161387 +--ELTDDEINEVQQSWDLLTRsegglREAGLTLNQQLLTAQPHHIRSFEKFRKYKDFDDILKSPEFKTHSYSTVREISLVITNLKHpgvFTQLTQSIGFAHR-RANTPPNQMVDFKSVFINdFIPSQMADKATPNTIKAWEKFMTVFIEHVKE--- +>tr|A0A2E0SIT0|A0A2E0SIT0_9PLAN Globin OS=Planctomyces sp. GN=CMJ46_04905 PE=4 SV=1 +--PVSMTIVDSVRESYARCrQNPDFFDAFYDHFARKSSEIGPLFSN-TDMQ------------KQNELLSDAIDSLISFSEGdvaARRHLDEIALSHDReHLNIKPEWYPLWMEALRDTIHESDP-GATTQLLADWNTVLQPGVNHIVQQH- +>ERR1719487_109746 +----------EIEISHPELlkiGLDNVGTTFYTNLFQDSPQIQMHFIK----P-------NRMLSYIVQKTIEMIGDLHPKPREVMKGLKALAMRHI-KYDAPPEFFGDFESAMLKTLAQSLKSTFTEAVKEAWKAALQFIASTIV---- +>ERR1719327_803055 +----------EIEITHPELlkiGLDNVGTTFYTNLFQDSPQIQMHFIK----P-------NRMLSYIVQKTIEMIGDLHPKPREVMKGLKALATSTC-ASSGSRLA--PRPSSTATSI---GRSPFRCRX-------------------- +>ERR1719356_1095802 +-------------------LMRDIPNTIVALFAI-TVAVfeddySSMLDQ----P-------FlliAVLGFVTLTvilLLNLLIAQLNTTYV-RIYQEVFGWALI-TRGNQIVEV----LD-ACPMS-VWKPFLETLGLDERLE---FNEGDIG---- +>ERR1719326_1696685 +--------------ASSTQikeLFADVDLS---------------IHA----P-------Ifa---------sTLQSTISSLNNPTELLPLLEDLGKKRI-KYGVQEEHVVAASASLIFTLK-SIDDQWSPQVEAAWTEACNVMQNVAS---- +>tr|A0A0N8ALQ3|A0A0N8ALQ3_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 +---------------------------TKARLN----NCMLLFSE---------K---LAAFLaQASPSWPVWNVVIHPCfs--qelMANQLNVLGGAHQ-PRGATPVMLEQFXXXXSPPSSSSSSRKP-PASRNSSPN-------------- +>tr|A0A0P5ANB1|A0A0P5ANB1_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 +---GGNDGVETVSDQSNLFVVfAI-FGQGIDGNASEFDEVLLGAGSLLEEL-DEDGGNDGVAVTpDVFPaglniadlVGGQFSLGISQIfgflevlgdASdqsAHTVLPGLSGL-G-VEGAAQRFSKDFLSDVTELLEHDGVSSFNAEARQAWKNGMRALV-------- +>tr|A0A0P5ESR8|A0A0P5ESR8_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 +---------------------------------------------FLEDA-SELLEHDGGSS----TGFMGTTESVQLVghqllaeqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGNIGELAEHCLVL--GVGLDEA-EEDLGSDISV-L-------- +>tr|A0A0P5I7S0|A0A0P5I7S0_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1 +---------------------------------------------FLEDA-AELLEHDGGSS----TGLMGTTESVQLVghqllagqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGKISEGLEHLLVL--GVVLDE-TEEDLGRHISVLL-------- +>ERR1712168_1063860 +-----------------------------------------------------------CEKAPPIPDCTSSNTVMMRLFKrdpeVAKLIYDVGVQHQ-TRNINEDEMTKMSKSIYSAVQDINVGPHSDKELAALHNLLEVVSYHFKRG-- +>SRR5690349_6204932 +-TILTDEHRHFIRTSWEKINkrheKTTLGILMFEKVFAFLPDLRNVFGL-NDSS-VSETDRNENFRRHTSLVVNLIDLIIRNIFEmeaeMGPVLLMYGRRHFLKHDLVFQENQLVafAQGLCEFFEEEVDHdddnSLASETKAAWNIF------------- +>SRR4051812_9455799 +-GTLTPLRCQLLQKSWEAIIakygMFKPGMIMFQNIFKIQPELMEIFQI-SPEK-LGNFGDlPDEKFRHGRIFTNVLNLSVKNCVEleteVAPVLHLYGRRHVSKHNVDMAHHFLLvfAQGITSFLINEVK--------------------------- +>tr|A0A1Y3EGL3|A0A1Y3EGL3_9BILA Globin OS=Trichinella nativa GN=D917_02219 PE=3 SV=1 +--FLTKSQRQNVVRSWEKVpNKRALGEEIYIQIFMHKPMLKSLFP-FRTVP-VDQLRNNALFTRQAAIFADFIDCVVGYLaiNNgnlIMELSERVGVNHALMTSVnfDPEWWVLFANSVLDCIRQYCEPKFiclpisrhiTRKIMIAWRILLKEVVDRMSEAF- +>SRR5260370_37911868 +--------GSRRTPAISSVVrGRDFSLRSIRNFFEACPAAVPRFAG-TDFE------------RQHKLLRHAVGVLLIFPKEPegePTVLTRIVERHSRpDLAVPPALYAPFVDSLIATGEQHDP-AFTPEVEHAWRSTAQTVVAYMTSRSX +>SRR5229473_1098235 +--------GSRRTPAISSVVrGRDFSLRSIRNFFEACPAAVPRFAG-TDFE------------RQHKLLRHAVGVLLIFPKEPegePTVQTRIAERHSRrDLAVPPALCAPFVDSLIATGEQHDP-AFTRRWNTPGGAPPKRS----SPTX- +>tr|A0A1I8EE37|A0A1I8EE37_WUCBA Uncharacterized protein OS=Wuchereria bancrofti OX=6293 PE=3 SV=1 +---LSKSQRITIENSWKRATksnaREQVGIQLFARILTARPEMKHLFG-LQKIP-EGRLKYDPRFRRHAIVFIKSFDYIVKNVAykeKLEQHFQALGERHTIlqGRGFDPGYWDTFNDCMRQTVS-LWGKDKDHRTANTWHTLISFVLQNMKIG-- +>ERR1719264_1394560 +--------ISVVAANFKTVKSnQVLANTLFEHLFELEPSSKALFES-KDL------------TQLKTKFAGFIGQGLKMLqgKNAKKSSGSLPRCTW-RWE------------------------------------------------- +>ERR1712226_1819570 +---------------------------------QYDPSSRQVFEN-SNL------------TEHKQRFIGFIGKGIDTTiEGDREEWKDLVDMHV-DIGVTFKHFLAFEDAFLNTLHDLYADTFSDELLCAWIYVL------------ +>ERR1719326_1666808 +--------LDIVTKSYETVAAnSTFADILFERFFSYDESAKKLFGN-ADM------------ATHKKKLVGFIGKGLKMAqsSDPDGEMRKMAAFHK-EKKVEISHFIFFEESIIYALRGTLGVAFQDELADAWTLVI------------ +>ERR1712071_441310 +---IRRQGEDgrqrpvrhrqrtqrnpqtrlLSLESWTQKDrSPERPSQqvvghpkadccSSNRRFSHPPHGRRRPPW---LP-IQDANRLRAFPHQLHHQGRELP-----cRD--pKLsrX-------------------------------------------------------------- +>ERR1719432_409132 +---LRHQEHRrarrfrqqqerCPRHFRSNEIQQRSCSQNHAQIVHCLPRDPENVPRIADVA-VSDLMNNRKFLSISYSAFAGFNFILNNMDDPEI--IKLQLSKV----DFPGMfvfpfpgtsqqHQ---dtsr-IVLEVFREELGAAFTAEAASGWTSLLNFVSQALIK--- +>ERR1712179_658195 +---VSGNSK-nAVRATFDQMRfNSEVAPKiml---KLFTAYPETQKMFHRIADVA-VSDLMNNRKFLHQLLCL-RRIQLHPQQhgrsrDHQTpTVqgrLP-----RHV----RLPLPwylsaapgyFSHR----IGSVQGRAGRRlh----RR--SRLWMDFSAELRQP--- +>ERR1712137_151953 +---LRHQEHRrarrfrqqqerRPRHFRSNEIQQRSCSQNHAQIVHCLPRDPENVPPHrrcprlgfdeqPQIP-VHQLLCLRRIQLHPQ--QHG---RSRDHQTPT--------VQG----RLPRHvrlplpwylsaAP---gyfs-HRIGSFREELGAAFTAEAASGWTSLLNFVSQALIK--- +>ERR1711946_32375 +------------------------------------------------------DEQPQIPVHQLLFL-RRIQLHPQQhgrsrDHQTpTVqgrLP-----RHV----RLPLPwylsaapgtYPPS----HSNHTARERTAfqvlFLPQDT--SRIVLEVFRE------- +>ERR1719222_1795957 +---VSAKAKSLIRDSWVQMKfNGEIAPKIYLKTFAAHPKTLAMFPQFAKVP-NRVRPHPYEpLLATAGIDYDVKLWIPSPGSEHNInveELMARNArmleetrDTI----TVPATfmirmlas--------MSNFRR-AGNRSTNDE-------------------- +>ERR1719222_245222 +-------ARSlgrtqesHPLDLDSHEIqqqRRTQNPLQDVHHLSRDPENVHPFGRYTR------------FSAHGEQTVLGFESLCFRwiqhdcqqYGCSRA--DQVAVVQG----RLPRHfrlslpwhfsaTRANPRIILEVFAEELGSTFTKEAAAAWNSLLNFVTKGLEN--- +>ERR1711911_103569 +---------------------------------sraDQVAVVQGRLPRHFR---------------------LSLPW----------------------------HfsatranhphhlgsIR--RRTRLHFHQGSRCrleLPfelRHQGFRKQHRRLATHR---SRP--- +>tr|A0A0B2VDB7|A0A0B2VDB7_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_13543 PE=3 SV=1 +--SMNDDTKGAICEQWHTILalydgdISRVGVAVYQRIFDAEPQLREVFGIPSFV---TDLSEYEPFQRSGKLFMSVVDLCVRNIYALdaemGPVLVMYGRRHYHQqsRGFHLRYMPIFTQCMKEFVSDCLNEKQkTSDSEDGWSLLFDYIAAKIVDG-- +>tr|A0A0N0P721|A0A0N0P721_LEPSE Adenylate cyclase-like protein OS=Leptomonas seymouri GN=ABL78_2595 PE=4 SV=1 +---------FTVQGTWNILEkegmLERFAQQLYDELLTQNARLRVYFYGV-DL------------DEQSKSLVRMIGTAVHFYEKpqvTVEMFTKAGARHR-GYGVNGEVFEEMRDAFFRVFPKFVGADVFSAAEEEWQKFWKLMLDLLQH--- +>tr|S9WKS4|S9WKS4_9TRYP Adenylate cyclase-like protein OS=Angomonas deanei GN=AGDE_06844 PE=4 SV=1 +---------NTVLHSWKLLEdggkMDDFGDALYADLLNSNPYIRVFFYGV-QL------------SEQPKALMRMLGTAVYSLNNpnkVDDLFVKTGAKHR-GFGVTTETFQSMETSFFKIFPEFIGEDVYEKTKKEWHDFWKYIIKKLDQ--- +>tr|A0A2C9KGE7|A0A2C9KGE7_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 PE=3 SV=1 +--LVTDSDIQALRSSWATLTAgpdgrNVFGNNFVLWMLKTIPNMRERFEKFNAHQSDEALKNDNEFVKQVKLIVGGLQSFIDNLENpgqLQATIERLAAIHLKmRPSIGAGYFGPLQNNIHDFIEDTLKVGADDAAPKSWTRLLTAFNDVLNSY-- +>tr|A0A2E2XNM9|A0A2E2XNM9_9GAMM Uncharacterized protein OS=Cellvibrionaceae bacterium OX=2026723 GN=CL693_20675 PE=4 SV=1 +-------DIDWIESSLELLAphADRLGGLVYPRFFVHFPEAETLFGG-GELG-----------KSTQESMIVPLLMGLKDIADGKtymLTIERWLEDHR-EYGVTLPMYSVMLDSLLLGMREAVGDLWTTEMDGAWQEVLARLLLLVEGVY- +>tr|L7L9M1|L7L9M1_9ACTN Uncharacterized protein OS=Gordonia hirsuta DSM 44140 = NBRC 16056 OX=1121927 GN=GOHSU_25_00750 PE=4 SV=1 +-------IRQAVLESLARYEesHGDPTRAIYERFYRVHPEAIEELAF-D--------------TVLENRMMAGILALLADVADGSidpGGAVYWVSDHV-AWEVSETMIMGMFGAVRDTVREGLGPEWTARMDADWAGLLAALAPAMRDAV- +>ERR1712232_1039451 +---------------------------------------------------------SEEMRTHATKVMTFVGNGVASIGNPEkcerfrAECIALGKKNQ-ERGISSQDYDIATQPFVDAVEHSwlqagwrqtdaSGSIWPPGAQGAYTKFYGHMAATIKDG-- +>tr|A0A0D6M6J3|A0A0D6M6J3_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=ANCCEY_05408 PE=4 SV=1 +--------------------------------MPSCVRTAVTLP-----------------YLEIFEPFVVIEGAVMSLDNlpaLDPILDNLGRRHG-KLEVNGKfrtyYWSTFLECSICIFRKTLTN-------------------------- +>SRR2546427_1691122 +--------VVLLQTTFLRAAemrigKRNITDFIYEDLFLKRPQLKPMFTN--Q-----------V--LQRHKLGKMLGSIFIHLRDqdwIDEHLRDLGAMHW-RAGATPEVYPWIKDSVLAVLEEGMAPsGWNLRCQREGAGALGVSAQGMLMGY- +>tr|A0A183IHG0|A0A183IHG0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 +--HFSLREKELLSVSMKKLEqlEEDNAVKIFIRLFQENPAYKSLFPKLRFMG-DADIVNSTALVAHTQLILKMIKTFINGFQNestCAVVLKRAETAHR-KFDIKPSQVSTLFPILMEILDIS-----HNETQAAWKKLFETFSI------- +>tr|A0A1B6JRB7|A0A1B6JRB7_9HEMI Uncharacterized protein OS=Homalodisca liturata GN=g.2446 PE=3 SV=1 +-ASLTDRDLRLGRATWFKNvDaTPDFGMVIFKELFRQYPDVESYFLHLRGN--AGSIFDSRTFRSHMTeRVVPKLKEVFEALDKpehLNEVMTKLGLYHA-KLGVSGHLVENMLSVILDALKSVMHTKMQPDEETAVRTC------------- +>SRR6185369_2033738 +---------------------------LRRVFI-QVASDRSDVSK-TNF------------KFQKLMLRQSLLEMLCfdrGMSGTREEIERLGLRHKV-LGVTPEMYAMWLDSLCEAIKQHDP-SYTPELEQLWRVAMLKSIKE------ +>tr|A0A0P5RQ13|A0A0P5RQ13_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1 +-TKLTPHQIRDVQRTWEHLRanRNAMVSSIFVKLFKETPRVQKHFAKFANVA-VDALPENGEFNKQIAPVAARLDTIISAMDDklqLLGNINYMRYPHQPPRAIPRQTFEDFARLPIESLEAS---GVSGDDMDSWKGVLTIFVNGVSMRY- +>SRR3954451_11513015 +--AASPCAQQLRQGCRDRPA---ACQLVLSSGVRDRPGCEIAVQ--GRH------------GEAGPQADGGADGLIDAIDRLDTI--------------------------------------VPAVEAAWTEAYTILATTMKD--- +>tr|A0A1S3CW24|A0A1S3CW24_DIACI uncharacterized protein LOC103506299 OS=Diaphorina citri OX=121845 GN=LOC103506299 PE=3 SV=1 +--GLTPKMVGLLKCLGVAIKPeaHRHGVNIFKKLFLMDKTVQRMFPKFACD-DMCGLDENPDFHKHVDAVMKSILYMMESSGsvpDMKSTLALQVKIHK-DLCIPDRHFITFGYAINEYLKETLGAKYSEDVECAVAYFWKFVASEMTAKP- +>ERR1719244_808981 +------------------------------------------------------------------------------------KAPRTRRPPRAALQRENALFQALSRAFLKAIKVYLP--WSDRREAAWQLLWQRIITQMTL--- +>tr|A0A2T5C1R0|A0A2T5C1R0_9BACT Hemoglobin-like flavoprotein OS=Mangrovibacterium marinum OX=1639118 GN=C8N47_108138 PE=4 SV=1 +---MTEADITVIEKSYAQIEAalPRMAKYFFNRANELDSDLDPLFEE--DK------------SKHGEAFVALFGKAVEHLNSPealLPEIKKMEAKLK-YYKFNEEVLNTVGVVFVDTLSFGFGNNFTQDIIDPWVKAYKTYSS------- +>tr|A0A1Z4LAZ9|A0A1Z4LAZ9_NOSLI Nitric oxide synthase oxygenase OS=Nostoc linckia NIES-25 GN=nos PE=4 SV=1 +--AVPPELLLKMADSWQVMsqNKQQMGIEFYQMLFEKYPFVLPIFGR-ADMD------------YLSLHLFQALEFLVNCLKTgssdeMLRELRFLGQVHG-SADVPTCAYPAITECMIALMERHVP-DLTPQVRQGWVTLLERVINIVK---- +>tr|A0A096P8B0|A0A096P8B0_OSTTA Flavoprotein pyridine nucleotide cytochrome reductase OS=Ostreococcus tauri GN=OT_ostta17g00030 PE=4 SV=1 +---------------------------------------------------------------------------------------masvgsgat----DDD-GVDVPVSRCPFAhGTVTVDPYPGYVH-G---KNPRVCPRGCVPRPPSKP---- +>ERR1712071_238239 +-----ERSFTYWKDSAMMELa--------KWNARLQTPR----------------VYEVKwRRKKRNIPGRVGWRVLGAELWVRSSCRRRIRNRPYQEYFVSyvsiSQQLEETARLIIDALDEELGVRFTSYTRGVWSR-aFHFANSIMAESF- +>tr|A0A2D4BL26|A0A2D4BL26_PYTIN Uncharacterized protein OS=Pythium insidiosum GN=PINS_002968 PE=4 SV=1 +---------------------TTLYDVFYAHLEQHSPELKPVFRS--SV------------HIRGKVLVHISVGMRTLIASenFVDKVLPLTKTHR-RFGVKPEHYEPLGRALLHAMQVVAL------ITRDRGRVEEPTSIILI---- +>tr|G8YSE7|G8YSE7_PICSO Piso0_001107 protein OS=Pichia sorbitophila (strain ATCC MYA-4447 / BCRC 22081 / CBS 7064 / NBRC 10061 / NRRL Y-12695) GN=Piso0 +--EITEQDIYRLSSSWNTIHtnsryhNDSFVSRLYANLLAANPKLLPVFSG--EN----------GLQEHSALFGELLSLTMIYLNDmptLKICIAAYARENPLFTEQCCEIVEPMGSALVLTLRQWLGKgVFDNELQELWIKVYVMLANTLL---- +>ERR1719431_737524 +---LDMSQISDLQRCWSTLQlhmgEQAIAAAFYNDIITNFPSIQKYFKNIWTESTFtRTIGNMNDVRKHASLVVSRLTNYMGNLHHLsevNEDLKELGMIHAARYHITEEVVEQFVSSMATTVADLLTKedLFDPVLCGAWKRFFFMILTFLSEG-- +>tr|A0A0G4H5Q5|A0A0G4H5Q5_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) OX=1169540 GN=Vbra_6604 PE=3 SV=1 +---------------------SEIGIVFLHNLFSNAPTLQKLFVR---PS-----------ATYGRIFGQILKMLLAHLDDPAEvwqNNKELALRHI-KHGVRPSHVPLFSKLIVETFASIGGEEWTAEHTAAWQALWEVTGSELT---- +>ERR1719431_2380502 +--ELTDDEINEVQQSWDLLTRsegglREAGLTLNQQLLTAQPHHIRSFEKFRKYKDFDDILKSPEFKTHSYSTVREISLVITNLKHpgvFTQLTQSIGFAHR-RANTPPNQMVDFKSVFiNDFIPSQMADKATPNTIKAWEKFMTVFIEHVKEG-- +>tr|A0A1W2GS79|A0A1W2GS79_9BACT Uncharacterized protein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4044 PE=4 SV=1 +MKDLNIRERKNIRDTWKVLAPniHEFAFSFYSNLHSLDSSLVPLFEN--EF----------GIIKQGDKALYVLGFVVASLDNLmvaregiKKALEGVFMEHQ---HIKRADEQKVMKAFLQAMKSTLRGVWTNEIAISWYRLLSLISAVSI---- +>tr|U1JU51|U1JU51_9GAMM Uncharacterized protein OS=Pseudoalteromonas citrea DSM 8771 GN=PCIT_01118 PE=4 SV=1 +-MSISPYQYQLLTQSFTTLKPNFhcFCVSLH-TQLKNYNLELA-------------LPSSSkYLLNIEHNIQLFLSEGIALLPQQsalVDLIKRHKPHFD-ALKLSEQDIAVLCHTMLETLQLHLGRQFTLALRNAWRKALHMFANIIKS--- +>tr|A0A290TM25|A0A290TM25_PSEO7 Uncharacterized protein OS=Pseudoalteromonas piscicida GN=PPIS_a0207 PE=4 SV=1 +-MSITPYQYQLLTQTLASIRPNFhgFCTSWY-NQIQHYDLRMQ-------------IPTNVgQLIIWEHQIFDFVQNCVMRIPQQsnlLHYLQKQRGTLL-FMGTSEKDISVLLFTFYSNAKKSSWQAFYHSSKKRLEQSTVTHRKY------ +>tr|A0A2G1B531|A0A2G1B531_9GAMM Globin OS=Pseudoalteromonas sp. 3D05 GN=CSC79_14765 PE=4 SV=1 +-MGISTLEKQLLLNSLHVVKPNFhcFSYTFQ-MHVKREPLDML-------------CLSNSKINEKTYILYCVLERIVMHLDNLrtvTPFIEHYAKNLS-NMGMSHQDTDILCNSFLATLKIHLKGCYPPKLESIWQHAINIFKSIVTG--- +>tr|A0Y309|A0Y309_9GAMM Uncharacterized protein OS=Alteromonadales bacterium TW-7 GN=ATW7_05751 PE=4 SV=1 +----MNSHKSVLLKSIGIIKPNFhaFTARFH-KKLVESDISMN-------------TLTAEQFNEKSYILYCTLERIIKNIDNPssvAPFLSHHLQFLK-KLNIQQSDIKPLTDIFYVTLVEHLGRFFNEESHLAWRKVLTYFERYTND--- +>tr|A0A0K1PX98|A0A0K1PX98_9DELT Uncharacterized protein OS=Labilithrix luteola OX=1391654 GN=AKJ09_04675 PE=4 SV=1 +---------VVLKESWHLSYrrAPDLAARFYEELSWKYPSARRLLDHVFGAQN--------DI---AVCLSTVAGDLLDNVDDpdaFSAAIVALANAHV-SLDIPPHVVAWMEEVLLDTLEGAAGDDWTPEMRTTWRNAYEDLASRLAR--- +>SRR4051794_15895678 +------------------------XmvgitqfyTEFYARLDTLDSSGKfdAIlsahtsgTNK---------------IAAKGEILIRIIKFALSIQGdnpavql----QLYLLGKS-HVQKRIRPWQYSIFVEAMIFTISSRLGTEATHEVMEAWVNIFAFILRSMLPQA- +>SRR6478672_7358577 +--------------------------------------------------------------------------SRmp--CNSstlkRRPSatscTESPTSTSP-WESAPSST-PSSASTYSPRSLRFWATPSPPRSPPRGGEVYWLFALQLV---- +>tr|A0A1Z5JZN5|A0A1Z5JZN5_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_19Hh029 PE=3 SV=1 +MEDISPDVVSAVQDSWERIKdsspawEDDFGDRFLKSIFTKAPLsYKLLFP-FGTT-SGPAMFESEDFIEAARTASTLMDMSVSLLecemDALFGQLLEIGLEHANFPRIQTSHWSMMRDALLRTLASYssaLSEDCKdlEKVLSAWSLVFDNLSNEMVE--- +>ERR1700744_2408068 +-----------------------------------HPEAESLFRR--GPS--------MR--CPTGRP----------RSGTPG----GscwtkliASAlSA-RHKSRRLKSSLPLEEIRADVGFLL--DRVVVAIDavgdervvRNDRVLVRLDRVQS---- +>tr|A0A2S3QTP4|A0A2S3QTP4_9PROT Uncharacterized protein OS=Halobacteriovorax sp. DA5 OX=2067553 GN=C0Z22_01530 PE=3 SV=1 +-------DKDLIIESFARIEpnLKNFTNAFFDNVVILEPGMQKVFAH-AD-------------REQLKaSFIRALSITINNLKNpeyLKYYLQGLGGNQI-KYEVSETYFPIFEEAFIQTLMLFHMNSWTPKLETAWRDCFYYIAEYIS---- +>ERR1719216_352717 +----------IIKSSWRIIQnkvIARHGTDFFIEIFDSQF---------KP-P----IGVTPVFQGHGEKMIQVVGKAIETLRDgKspteqesqelWDMLIENGRLYL-GYGALPMYFDVLGTFDCKHSKDNVIVntGNCGKQEM------------------ +>tr|A0A2D7G1P9|A0A2D7G1P9_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP96_10880 PE=4 SV=1 +-------EQTCIERVLDCAAedQPDFQQRLYDRFYQLAPSAEALMIHIDEE-------------VQGKMLAEVIRLFLsPDVaVTDQQYLLFETKNHAQAYFVEPEMYRALNQALFETLKVGAGRIWSSEVESAVHNRLSKMLHGILEAL- +>tr|A0A2E1GZ77|A0A2E1GZ77_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ03_04085 PE=4 SV=1 +-------DQAWIETAFDCAAvdNLNFNVDVYQTFYRAEPSVASLMAHIDEL-------------VQNKMLSEVIRLLLnPNIeSEEAGYLNFEVKTHIQGYGVSPLMFLSFNRAVYEVLQSSAARVWEDDLAVAVTRRFAVLSDALTEAL- +>tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ23_00915 PE=4 SV=1 +-------MQSSIHALLEQVAttDIDFDKKCFERFFQISEEGKTLMAHMDRV-------------HRGKMMAEIYRLMMaRDLDDEADYLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY- +>tr|A0A2G2R0S2|A0A2G2R0S2_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_09030 PE=4 SV=1 +--IVTPDQAIIIQESFARLStsSDSLIQDILGTIAEGNSDLAVTI-TF----------KSQNLVE---QISTALSHIIDQLhtaDNVAEYVAHFGELLL-AQNVQDENYSSFGEALLSGLENALQNDFTAEVRDAWTSGWAMLSGIMRE--- +>SRR5258705_7404034 +----------------------CPTSSSRPVLWAAvrdCAGGQTLVPR--RY------------DGTRLQADGDAGRCGQQSGQSRsrvAGGERSCQASR-RPWREGGYYTPVGAALLWTLEQGFRI-------------------------- +>tr|F0W0M6|F0W0M6_9STRA Uncharacterized protein AlNc14C5G666 OS=Albugo laibachii Nc14 OX=890382 GN=AlNc14C5G666 PE=3 SV=1 +---------------------------------LNAPELKPVFKT----------------SKHARnVVLQHIVGGLRTMlahDVHIERVRALTRTHL-QFGVKMEYFDLLGQAVIFSMRHCSGSHWSSEIEEAWRRLYGHCSVILL---- +>SRR5271163_4883858 +----------RTDSLYAQLGgkttIASIVDRFYEKVL-ADPDLKPFFAK-ANM------------AGIKQRQAQFLTQALGGPIDA--RNHETRPAHA-SLLSDTRHFERAATHLAVTLSEM----------------------------- +>ERR1711911_155006 +--DIIRKNCLMLYTNFTATKiaFKWILLCLNCRYFEIKPEAQKLFPAFANVPL-KDLPKNYA-------FLAAVNTCFANVHYLIekagrnprdcPVFSKVV---A-KYD--ARDVKQFGDIMMNSLKSELGSQFTDEIEESWNLALEEIAKMVS---- +>tr|A0A286GHZ2|A0A286GHZ2_9BACT Sulfite reductase, alpha subunit (Flavoprotein) OS=Spirosoma fluviale GN=SAMN06269250_4620 PE=4 SV=1 +--ALTPDMIRLMRQVGDQLsaDARVIGTDFYHALFQTHPDIIPYFNR-TDID------------SLTEHLMQAVGFLVRSLASgvdITKELRELSQIHT-NFSVPPDAYPKLVEPLLTVMRKH-VPGFSTEQEHAWVILLNRVTNVLRQ--- +>ERR550539_353004 +--------------------------------------------------------------AMMQHLVKNLHDISRF---dsdIRELLTRLGQQWL-QKRVPLDFAVLLGNEYLEAvlpffHSNV-GATLALKLEVSLAYLYKEAMHFLLL--- +>LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3583117_1 # 3 # 191 # -1 # ID=3583117_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561 +--ALAPEAVTKMRAGAEAMlaHPQEAGVFFYETLFDARPDLVSLFRT-ANMD------------ALSRHLIDTVVFLSRAADDltgLRDDLRNLARVHQ-VNQIPPSEYAHLAAPLLETLSRF-GHPLDAQMIRGWEVLFDRVSRIVAE--- +>ERR1719359_219123 +------------------IdEepmaEVVSGeDALV----AIA-DLlyQKL-------------------------------------SGdeaMAQFLENVDLT--QlanNLRSLlalvfngsdWPEMHLS--gSLiddgYEDFSSILQETL----qaSPg-DDALL--ESLDKL---- +>ERR1719487_376807 +------------------EeEgateEVASGeEALV----AIA-DMlyQKL-------------------------------------SGdqaMAEFLENVDLA--QlakNLRTLlaavfegndWPEINLS--aSIidegYEDFSSVLQETL----qtCLg-DNAML--ESLDKL---- +>ERR1712100_485805 +---SVGHVVLVV---GRCSfEcrniVVVEGlDGSLDRLLALRkvvgiglGLPilQQL-------------------------------------G-VLRHVGNVA-------------------lKVlrchFLQFSNHVLEVRSRLRldefclvgdivievilrDHgggkHeRD--------------- +>ERR1719171_2780585 +--NLSEEMITEVQKSWSEVLrrvdsKTEIGRIIYDSLFDRLPHLRKMFKT-NRL-------------TVAMRFANSVHSLVGILNNkeqTEEYVYNMALRHV-QYwsgdgSIAQANMSAFLKAVLIVFDNALDDKWTQRMEEAWGALFSYVGEAMVA--- +>ERR1719265_1594411 +--------VDTIVKDWAGLDLEKLGDTTFGMMVQNNPEIKTIFGG--DVHPG---VAQQGLKSQAATFVGFMSYAMTWLKKkdfivLEQKMVELGQRHV-HYGVNVSHFVSFQEAMFTALREQLGTRFE-DNKYAWTFT------------- +>ERR1740139_1939294 +----DSDTIAVVKQTWKAITalPeqqEYVGMRLLHNlhpcyetsltfllvielyylsYLRVVPSARAFFPPTSD-----SLIDDESFRESASNLMMCIDKAINTLENqrhlrFKALLQTYGKKLS-RLHIPPSCYTMAWFALIETLQDVLEDRFTELMLAYWIDIIDPINT------- +>SRR5690606_18427011 +---VSHRN---AHEKHQPCHaKL-------------RPLLRE-----------------PRLLRRLLY--DLSGqLTRR-A--GEVRPERHG-----GAEASAX--------------------------------------------- +>SRR5690606_42132731 +---MPMKNTNRVMQSYGRCCaSPGFFDDFYTTFLASSPAVREKSAQ-SDMA------AQKHLLRAGIP--NLVPLARG-M--PDTKLDRKSTRLN----------------------------------------------------- +>ERR1719487_109746 +-MIMSAEAVQVVQDSFHRVDscvqiRDALEDVFFPHLFASSTQIKELFAD---V----------DLNMQAPMFANILNSTISSLNNpteLRPLLADFGEKCK-KYGVQGEHIATAGESLIFTMKSI-DDQWDAEVEAAWMAACSAMENAA----- +>tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1 +-----PMEVALVQSTWQRFLesPnlTTEFSAIFQRMFQMVPTAMQAFRYV-NSTDLDSLVANKDLQKVVTMMMSEVNATLQLLDQpqaLISLIRSHGARHA-TYGVTRQWEETMLNAILYAVETKLSPsGFNQSEKNAWRSVLDMLGRNF----- +>tr|A0A0C9M7G1|A0A0C9M7G1_9FUNG Type 11 methyltransferase OS=Mucor ambiguus OX=91626 GN=MAM1_0030c02374 PE=3 SV=1 +--PPTQAQIDIVRYTWERVSeihldtddPtvsatHAFGLAFYDALFKLDPSLEPLFSNIFQQAralagMVSYIARSPKVTGPNKpksatSLsegcgmstaklekvptireinarkrketnATTFEELVSSAatskpkaeDDeeqLLYKLRELGARHY-FYNVEPKFLALVGPAALSALKTRLGKDFLPEVAEAWTRAHAYAAYHM----- +>ERR1719365_124985 +-SEMSGKQKKIVWRTWNSMLgkqesdYNDFGINFVLWLFDNFPKMRNKFDELYGR-SRNSLIVDQHFIAHTENVVKELDRLIKDLPFprlLSKRISKLADSHLNqEP-------------------------------------------------- +>tr|C9CRM3|C9CRM3_9RHOB Uncharacterized protein OS=Silicibacter sp. TrichCH4B OX=644076 GN=SCH4B_0097 PE=4 SV=1 +---ISSRDIDLLQSSCATAFlkKGVLASAFYNKLFEIEPAYVNKFS---NIN------------KQKIMFEAMLAYCISGITSgykVEALTARLRSYHM-HLEISDIDIANARSALMYALGSVLGEDFHSDLKQAWDAAFSSVSEALR---- +>SRR5688500_3946624 +---VDSRTIALIKESFTPIAgrTLELADRFFNNLFTRQTSVRGFFPA--DVTEQ---------KRQLPGVIQTILENGDKLENLEPQLREVGREYA-KQGALPTHYGAVARTFVDTVREMSGIGWQARYTRAWTSLFDSLTKAIV---- +>GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6338290_1 # 1 # 129 # -1 # ID=6338290_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.636 +-------------------ReagLEQYAGALLRSGFDDLEtllaiedadmkdLGIPaCHVVRlrkklqelqRQRSGTRGDFDASNP---VVAFL-----ENAGLGQya--KLLLQNGFDdmDV-LLDIEDADLKDLGvprghaIKLKKGLRELQLQQYAQEDPMPLHAAA------------ +>SRR4051794_36238122 +--------RRTAKASYLRLQgggrERAFFAAFYENLLVSCPDVKPFFVP-ERMA------------HQ----QSMLNRAIQLLLDFdracgCPQLRQLADGHA-GYQLTRWHYDQFVEALIRTIEQS-G-ITNPAELSAWRTTVMPAIEFM----- +>ADurb_Met_03_Slu_FD_contig_21_1037173_length_469_multi_2_in_0_out_0_1 # 1 # 468 # 1 # ID=69395_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.588 +--------RRTALASYLRFQspdkVQKFSRGLYEHLFDRHEELERLFKP--DLK------------AQ----YEALNRALQALVDFrpedpdsAKAIETIATRHR-GYSISKAHLVTFLDAVAVGLACA-D-ERDPETHDAWHEVLVAAFKPF----- +>SRR3569833_2455512 +--------MKDVQARFGRCClHPNFLDTFYNAFMATSPEVARLFKN-TDF------------TRQKKMLQMSLNLLIShamGIGIVDGYLHQLAAKHSRhHLNPEPQHTTPPPNSLMKAVNQHDP-KYTPSLDHARRTGHGHGIELI----- +>SRR5439155_1005251 +--------KATtalAKASYDRCCqAPEFLQVFYRNFLAACPEAVPRFAG-TNF------------DQQTRLLRHAIGLLLIfpnQPNKEPNLLARLARGPGPcRRQGCA--CGQ---DRSDRTARTDGAsrqrrcraPCSRRpdarGSRKWVRAAP----------- +>SRR5262245_66279004 +---LEPTDRIRAKQSYLKHcmGKNDFYRKFYERFFQGPEGTmakEMFAD--KDL------------NQQYVKLDQSLHYLLNFGDQdmMEpTVLTTTATIHQ-TKGVAPEQLERFIECLIDTLSKDYQV--SGIEVDAWKNVCGP---------- +>ERR1719277_2718232 +--VLTDETIAIVKSTAPAMKehAYKISETMYQNMFAEKPEIRKLFTP-EDQ----KVQPGQTQKKQPLNLARAIQAYATHIDDldkKKSRIGRRIDrvrkKEC-SIESKNG---FNGK-RSEIVKEELTELERKNVVLrakmdSMEREvkllkKKFLSDIS----- +>ERR1719209_1562507 +-----------------------------------------------GDHsh-AQSYH-----EVHEHLWRSLAFSVLNQVlsrDkRIKQDLFNLGYTHH-ERGLKEDDMLQLEYAVIDGIHDHLV---TDVHERAWRKVFQLIRIHF----- +>ERR1719487_2840864 +-----------VRQSWAMIQaiqtS-sagGFGDALFFNISVMSSEIWSLFSV--SKE------------VMAVTFTDAFTLIVSYIADpvgLAEELFGEADGVG-DVGDDQGEGiregdghDLLGHGEQ--TPDLAAHDGDVEEERVAE--------------- +>ERR1719171_2815737 +----------------------agaendeelrensgvedsfasgsvptTFNEMFLFNLTVMGAGARK------NKA------------ImWMTEVLTSFDTIVANVANskrLQEECDVLGLRIS-KYPLDFVKLPEFKACMLSSLRSLLPRTWSGTHEVAWSWLWENIERML----- +>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 OX=905079 GN=GUITHDRAFT_143733 PE=3 SV=1 +--------SARIASSWTELvkksDYAEIGRRIYGS-VKANDTLEPLFR-FTNQ------------TVQGTKFVDMLSSIVENINNPqtiFEKVNELAPMHH-RKGVKAAHMPIMKGIIVSLLKHVLGDEFTNEDEEAWNWIWQYLTQILD---- +>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold759411_1 # 1 # 798 # -1 # ID=759411_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.594 +----------IAAQFWEEHiSyksladKLEIGCAIYFGMMVHNKEMKRILKKNlhhHQ-----------SIENSSVKFLDMMGWLLRSLlrSDidLCGSLQQLGAFHR-NMGVNINHFDPMLKSMHETFSYYFPIKYGIQIKYAIDQIFTLAARIMTG--- +>ERR1719396_104066 +---------FNIIESWELLRfhpslKEDLGTAIFRELFKEHPELREHFGL--PLVGLDALCKNQTFLSLSNQFVDVFARTMDTLGPdeelMDESIRELGEKCV-SIGIETSHLSLLRKPILSAVEKILLEDFDD---ESWKKFYSILATDLAE--- +>tr|A0A0P5AEE1|A0A0P5AEE1_9CRUS Di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1 +--KLtp--HQIQDVQRSWENI-rngLNALVSS-IFVKLFKETPRIQKFFAKF------ANVAVD------SLAGn----------------AEYEKQI-ALVD--TPTPNVEFPV-------------------------------------- +>tr|A0A0P4WPK3|A0A0P4WPK3_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1 +--KLap--HQIRDVQTSWENIRgdRNSIVPPSSSSSSRRLPAPRSTSSN--SLA-LPSMP--------------------CpKManttnklllGDklqLLCNINYMRYTHQPPRAIPRERFEDFARLLLDVLSSK---GVSADDMDSWRGVLTIFVDGVS---- +>ERR1719510_2339612 +--SLTDNEVILIKSSWTYLKPhiNTILIESFMSLFAENSDVKEKFYSFKNHAIEdlnkkrgVGLASTNGLQRHIPRVSRAITKVVNSIENldrVSRYLEMLGKIHQ-QIGIEVQELMMLGAFFINSSKRHLPSSMQAdrHYSDSWLHLFTVISTMMRKGF- +>tr|A0A2V3J537|A0A2V3J537_9FLOR Flavohemoprotein OS=Gracilariopsis chorda OX=448386 GN=BWQ96_00611 PE=4 SV=1 +----DPETEALIKNTLPIFtkHSQQIAVQLYANLFEQHPQLKPMFC-LEFLQTPGQCKKSPgtGMSPQAKILSDSIVNFCANLDNIdmmNNAIERICAKHV-SRHVKSDHYPAVAGAFSRAVRQVLKNELSESDLKAWDTAVSALAGVLV---- +>tr|A0A2G5SLB2|A0A2G5SLB2_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-17 PE=4 SV=1 +-TEMSDEEVSAIREVWIRAKTDNVGKKILQTLIEKRPKFAEYFG-IQSeSLDIRALNQSKEFHLQAHRIQNFLDTAVGSLGFcpissVYDMAHRIGQIHFY-RGVNfgADNWLVFKKVTVDQVTTGATDsSKekdkdetnsngtangkvdteanpipvgiadinnvysgeNCLARLGWNKLMTVIVREMKRGF- +>tr|A0A2P8XQA5|A0A2P8XQA5_BLAGE Uncharacterized protein OS=Blattella germanica OX=6973 GN=C0J52_27026 PE=3 SV=1 +---LAREEKKFITESWHAFmrLPPANSVDAFVKFLQENPKYIKFFKSVDGIP-LEDLRYSFRVPKHVTAVLLYVNSMVHCLDNADAMfflSLQVGLMHS-NMGLTVEDFKLFNGYMVNILEDELG--LNDEGVAVWNKVLEIFM-------- +>tr|T1FHE7|T1FHE7_HELRO Uncharacterized protein OS=Helobdella robusta OX=6412 GN=20208246 PE=3 SV=1 +-----------------------------GTLLQSNPLVKNTFEKFRQMDPMSDFTDSSVFSTHAMVVMSAFEDIFDNLDDseIVKDILEQGKSHG-KFseDFAPETFWAIEEPFMSSMKDILGRKMSSQLEKIYKKTIKFILSVLIKGLR +>SRR5580658_3791175 +-------DPALVREAWSFVSdrADQLVMNFYAELFYVFKEAPTMFPS--NMT--------RQRQEFGRAVVQWIIS--DDQEGL----------------------------------------------------------------- +>SRR3990167_4175368 +-TGLTDGEKGMIQQSWNLLSKVEFTKILYKKIFELAPHVRCLFQN--SIES-----------QHENfsIMMDMmINEHINDELDLFAVVLQLAKRHF-HYKVKTDYYSIFRDGFLWSLEQTLSIEtlnktITnestnqpTTIKSIWLKFVNYLISVMV---- +>LauGreDrversion2_5_1035112.scaffolds.fasta_scaffold830278_1 # 2 # 232 # -1 # ID=830278_1;partial=10;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.316 +-------------------------MAFWN----KHPEPAAQFVA---P----------TQdtltdefepeeeqGISKEQLLSALNAAQT----ALMMIDR----D------FNITYLNqKSVDLLKTHEALFQSIWPNFQATeefllGYCIdlfhanpshqrqmlsnpsNLPYTTTITVKDV- +>SoimicmetaTmtHMA_FD_contig_51_4416696_length_1368_multi_2_in_0_out_0_1 # 1 # 216 # -1 # ID=2511055_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.685 +--------VALHTVEFAVADPsaRATI--------------------------------------------------ATHGLtpdDMAMLLSK---RE------------LIGPAFPALLDEFYGKVVEN---------------------- +>tr|Q5D2M7|Q5D2M7_9TREM Myoglobin 1 OS=Paragonimus westermani OX=34504 GN=myo1 PE=2 SV=1 +MAPLTQAEVDGVVSELNPfLAsdakKVELGLGAYKALLTAKPEYIQLFSKLHGLT-IDNVFQSEGIKYYARTLVEDLVKMLTAAAKddeLQKVLVHSGHQHT-TRKVTKQQFLSGEPIFIDFFNKTLSK---PENKAAMEKFLKHAFPVIANN-- +>tr|A0A1S8X4B3|A0A1S8X4B3_9TREM Globin OS=Opisthorchis viverrini OX=6198 GN=X801_02811 PE=3 SV=1 +MAPLTQSQIAGIHKELLPiLSndeaKTSFGVGAYKAFLGAHPEYIQYFSKLNGLT-IDNVFESEGIKYYGRTLVDEIVKMLTAGADdekLKQVLHDSGKAHT-ARNIDNATFMvsklfmflkrvsemrlarglygpfpifaqSGLPVFVDYFNKSLTV---PENQTAMEAFLNHVFPNISKD-- +>ERR1719167_330163 +-IDLTDKERELIQHTWWRFREEpYCRLRIMTHYFSANSSIKKKFQR-KNEENAAngNlmtAMVSWNIRRFSIRLVEFMDKVVRDLETEnyqdiYDISELQGAKHYRlKRMVEPGDMEALGQSIQTTISEHFGEKFNRSHILAWRRLFIVICSRF----- +>tr|A0A0T6BC68|A0A0T6BC68_9SCAR Uncharacterized protein OS=Oryctes borbonicus OX=1629725 GN=AMK59_2266 PE=3 SV=1 +-TGLTSQQKSLIQSTFNVIRPhiLNVGIDLFVRVLEVEPEHHRVLP-FSHIP-IADLHESFEFKFHCLAVVYSCSAIIDHLHDdgiLIPLMKKYASDL--KASIPLDIFQMIHDPLLEALDVHDDVKISEEALEAVRTLLRNLTNFLI---- +>ERR1719199_1566639 +---------------------------IFQHSGIQRPVFSTSSSSR-R-------------LCRP-CDLSMAFRPSDVLHSstrLKAQVETMGFGHL-HLDVTPARCKLFHGALVDFFVVELGDKLTPLAAEGWKRVLTYVASGLM---- +>ERR1719362_342361 +--RLSASAVTFLRSSWEHVPKDSFGMEFMKRACSEEPSLSDVFDC-P-V-------------ARPDNLAKVVQMLLDQAEielvprleRLAHGIAALSFKFG---KLRMSHLAPMKRALVRTVVAFAPGNQKAMTNRAWEAFFYAIAAVVA---- +>ETNmetMinimDraft_19_1059907.scaffolds.fasta_scaffold284136_1 # 1 # 639 # -1 # ID=284136_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.595 +--RLPKACVSLLRQSWKQVPQASFRKEFFDRLYIEDSSLQQIFQH-PMV-------------EVPENAWNVVQLMLDLLNvenvprleRFVHALAGLAFRHG---RFRLAHLAPIKRALVRTVTSHASKQEKKKLSQAWEAFFYALAAVAA---- +>SRR5262245_21272653 +------QNVEVFRASLKRCLaAPYFMSRFYDLFMGSSDEVREHFGD-TDFK------VETRVLADSLYLMAVIAQ-GEAEAPAWTEMSRLAKRHSKaELDICPELYDLWLKCLIEAARLHD-AQFSEAVEQAWRATLAPGIEYLSSRRX +>tr|A0A2A4SWC3|A0A2A4SWC3_9GAMM Uncharacterized protein OS=Thiotrichales bacterium GN=COB61_05140 PE=4 SV=1 +------MEFQDIRTSMGRAItHGDLFGRFYDIFLASNPKIKSMFVG-TNLE------TQKALLRQGVNLALMFAE-GKAIGK--SAMNRLRDSHSKsHLGIEPSMYRYWLDSFIKALKEFD-PDFDSALEKQWRQALGAAIEHIAAGYS +>tr|A0A1R1LTH4|A0A1R1LTH4_9GAMM Globin OS=Motiliproteus sp. MSK22-1 GN=BGP75_17400 PE=4 SV=1 +------DFEHIFDSSYsrvlAVTYnKQGFFETFYQRFVVADEKVSELFKN-TDMA------RQQKLLESSVYFLRDFYT--TSYAD--DVLQKIAILHSKrVLDIPPALYDLWLEVLLSTVSDFD-PLFDENIELAWRLVLSAGITFMKFKHN +>tr|A0A2A2KP63|A0A2A2KP63_9BILA Uncharacterized protein OS=Diploscapter pachys OX=2018661 GN=WR25_06989 PE=3 SV=1 +-SGLTREEKRIIQVCWFKCNqkqLRKCAEDIFADILHMDDDLLRLFR-L-DHIQSNRLRDAEFFKSHASNFAIVLSLVVTNLQEhVeqaCEALQNLGRQHAA-F--LDKFFQSMyWDTFTDCFERNPPPAFRKgSEREAWSRMILFIIAQMKIGFQ +>tr|A0A1I7TYQ0|A0A1I7TYQ0_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=3 SV=1 +-SGLTRDDKRIIETCWFKCSqkqLRKSSCDMFWDILHTDEDILRLFR-L-DHVSPNRLKDNEYFKSHASNLALVLNLVVTNLQDnFeqaQDALQALGYQHLH-L--IDRtHFQSMyWDIFTDCFERNPPPSFRKgAEREVWSRMILFIMGQMKTGYQ +>SRR5215204_501118 +--RVTRRDWQRLLENWERLQpsADRFATVFFDTLFAWEPQARQLFGG-------------ATLETQFLRFAHLLTSLVSAQDHpdeLDRRIDAVIRCFA-GGDPPRKREDAIRVAVAAMLNDVYAAGITPETRASWQSAYIGVITTIRS--- +>tr|H3NRG3|H3NRG3_9GAMM Uncharacterized protein OS=gamma proteobacterium HIMB55 GN=OMB55_00005550 PE=4 SV=1 +----SQSDIAIISESLTLCgdCLEDITPHVYRRFFELDASAASLMEYS-DEH------------MRGR----MFASVLELFlsddpFESDGFLAWELDNHVSSYSVTKSMYESLFKAFFEVAEETLGEDWSGDFERAWTNRIARIMAEVS---- +>tr|A0A2V1ABH2|A0A2V1ABH2_9ASCO Uncharacterized protein OS=[Candida] duobushaemulonis OX=1231522 GN=CXQ87_003270 PE=4 SV=1 +--QLSTADRNKVRASWGDAMaakdykTEQVIHEMFSSLIEQSEDARDLFEN--KK----------VRAQQETLFAEIMGFTMMYLHNitvLDECMNEFIREnpHIVRCGV--RYLEPMGAVLIQYLRQTLGPQFHAGLETLWVQTYIYIANCIL---- +>ERR1719396_219344 +-------------NTAAAVAPkaLDITKTFYGGMLQDYPELLAYFNPAHNVP---------ISENQPMALAGSIVAYASNIRDLSPllvpngPLMAICHRHC-ALCITPPQYNVVHENVMKSIAKVLGASSRRRSRPPGARRSSSSRR-PA---- +>ERR1719396_178111 +--------------------------------------------------------------------AHGPGRLHRRLREQHPglvpaagaqrPADGDLPPAL-RLVYHPPAVQRGARERDEVHRQGPGGVVTPEIAAAWSEAVLFLSKACI---- +>SRR3546814_8055804 +---------------------KDITPFFYDRFFALYPEQRANFYHFES--------------TSGTMVNEMITSVLALASNearSEEHT-----------sELQSLMRISYAVFCLKKKNKT----------------------------- +>SRR3546814_13566968 +---------------------FTIYTTLSLNVVLPFVTHRSNFDHVES--------------TSESMVIEMITLVLALASKeawLTNSFQNFVAALR-SYgDIPPDAYARLLDVLVVTLAQVAGSRWTDEFETAWRWYVSGM--------- +>ERR1719171_2136978 +---------EAIRITVPMLEeigLENVGQVFYGHLFTESPQIQMHFIK------------------PNRMLAYIVRKAIFMVRDlhpkpkeVMAELKPLALRHI-KYDAPPELFADFLVSFTKTLEENLKEGFTTDCAEGWESATNFLANTITR--- +>ERR1719171_2291403 +---------PRIcgelwrkqtfklrfnilgkqihspgiPRFFQKMEnvgGLLVSalllaMCFYDPEIvAHEEQIGIHIID------------------RNDAIYYVLEACNACILWllvtnVFGFSvQLSAFKHC-VSQMaeDLAKFGTFAVVFLMAFGCAIhiTMPYDPDFEDMWVTILTLFAI------- +>UPI000297C1C9 status=active +--ELDEYSIGEVRNGWENLERRCGtPKAAA-EEFLHKVSAAIPKTE--HM------------QKRASTVWSKLNGLLASMHDqsmFTGQLEYLALRHM-NQDISAAEIETFKGLLLEFCASKLGGMMTPEFQYGVSRLVDAVGASYQ---- +>ERR1719334_589756 +-IMLSPAAIQAIKSSWQHV--KNVGFQFFGHLLfsfwlGNQPRALEIYCLHyhGDKR-KGVVELLPRFRRLGEIYAKRIDTWVSHLDDPftlFLILYEHGFNPP-KKavGINEKDFELMVPSLMDAISSAMGSKMTHRLFEQWKSFWKYVLTQIAEG-- +>tr|A0A0E9N6V9|A0A0E9N6V9_9BACT Uncharacterized protein OS=Flavihumibacter petaseus NBRC 106054 OX=1220578 GN=FPE01S_06_00290 PE=4 SV=1 +--QMNQQEIQLVCQSWQQAAeePLRLAILFFDRLFEEAPELRQVFRT--PMS------------EKTRQLLVFFGFHINRLASgsIrRPSFEAYVW----EELLTDAQKGFLMETLSDTVAALLKPDWTPALQGAWGSFRK----------- +>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1 +--------NDLVLSSWDIVRqrteVQELGEKFWKYLNCMSPEQTNLFRR--SL------------SMWGhllHHIVNMLLISITDPEEYYDLMFELTIRHI-RYGVRSEYLNPFGNALFATFEEILSDVWEEKTTKAWKLVWKRATCNMSRG-- +>ERR1719242_319529 +------EYKNVLQSTWTKLlqKKEEIGKRIYESIvFDTTC-TT----T-GTSLSTSIIFENTNIGQSASRFMDMLDTVICKLDEpdaLVQKLEALSAFHSSNFNVQKRHYIDFEKGFMKAIKWELGAQRTILHDRAWRWFWNFLISKMC---- +>KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1947561_2 # 429 # 647 # 1 # ID=1947561_2;partial=01;start_type=ATG;rbs_motif=AGGAG/GGAGG;rbs_spacer=11-12bp;gc_cont=0.584 +-------------------------------LFETNSDIKTMFAKLKDYETVAELRSSKILEDHSMKVICTIDDAIANLDDMeyvNRMLQTIAQAHSTRFpNFDPEFFM------------------------------------------ +>SRR4029077_13489679 +-----------VQADVHAISvm--LNLMQPFRALRRRVDQFAKLWL--DPL------------WKTGRKAARIPA--TSTSITGRtgfAGRGRTGKAAC----------------------------------------------------- +>SRR5579859_1650388 +------------------------------------------------------------------NFLQALHTILLKMQRhdpsVFQFVQQLGARHE-KYGVTREHFRLVGGFFLTVLQRYVGVLWTRPMQRTWEALFGVLTDVMLFGY- +>tr|A0A0N4ZKI8|A0A0N4ZKI8_PARTI Uncharacterized protein OS=Parastrongyloides trichosuri PE=3 SV=1 +--GLTYYQIQAIQRAWRHMSkagQVSCGRQIITKIYKNNTEIRNIFQTYVTIENLS-INQMepveWGVLKHGEEIVNLLDYVIKNLNNIemvEEKCEEVGRSHRKmkQYGMKEEHWDSLGEALSETIRENYG--------------------------- +>ERR1719326_2865515 +--NMPPEAIEQVKATWTKLLsmttHIELGSLMYDALFEKLPKIRSMFVS-------------PRL-ATASRGETNIDRIFGSFSKSas--------------YMrdpssMX----------------------------------------------- +>GraSoiStandDraft_16_1057320.scaffolds.fasta_scaffold4300996_1 # 1 # 264 # 1 # ID=4300996_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.629 +------------------------TQAFYEEYFRLCPDSRDLMKHV-DEH------------VQGRMLASVHELLMLPDPDEQaRFIAFETQTHR-SYGARRYMYDRLFRALRSVVRDVSGDDWNPAWTTPGIAASRPCSRAST---- +>ERR1719174_1428107 +---------------------------------------------------------------------------VVDCQDqrsTLGYPPSAST----SVRCCVEQVARRaflwrkswfLTTLTIFIAGQ-AiLKYSHLDNLATERLLVFLFRAFI---- +>ERR1719284_2194575 +----------------------------------------------------------------------------SWREStssMRPCPPSLKL----LGIASL-------------------HSLKLDEKLEFGNGdIGLPGGIQI---- +>ERR1719277_1813735 +----------------------------------------------------------------------------------CMCAAETRIAHL-IGRASVANMHNLRNAVGSEVCLLSSlAIRFEANHVGWAHVsvadvVAVCSSISL---- +>ERR1719310_1375130 +--MLPQEQSQQLQQAWALVinmsgNRDALADLIYSAFFYRLGePR-APLRNPA--------------GSRSLPFLHGHQHLRRQLRrPwssaqfrrnveLRSHVLGYHRPSG-EHHSX----------------------------------------------- +>ERR1719310_407492 +--ILPLEQSEQLQQAWALVinmsgNRDALADLIYSAFFGASASLEYLFVTPR--------------AVAAFRFFTGINTFV-AFCgDpaqLRRNSQLRSHvpGHY-NSSCEHHPX------------------------------------------- +>MEHZ01.5.fsa_nt_MEHZ011529165.1_2 # 173 # 307 # -1 # ID=206391_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.393 +--YMSIDtgnleaakvmlqdlvtiradrsryyyclddlFKWHPDIVWKLTv--------------DAPELLrtmldGMIWRSRV--------------VvngnrrvnyylkhllvDEHGKFSNAM-SCIVKLQDpEIAIHPILVQ----LGDLVWNDLVYWrflrgklslVCTAGIFMVSQSMl-QYVESAGSFEERVATFICRLVV---- +>tr|A0A067CC73|A0A067CC73_SAPPC Uncharacterized protein OS=Saprolegnia parasitica (strain CBS 223.65) GN=SPRG_06598 PE=4 SV=1 +--ILNTAYLLDCSKSWKLIVtantdrMRQYgksgivlfYDEFFFRLFQRDFTLEEVFP---DI------------GKRGEVLVKAMTFMLKSSaENpkqIVNKCHYLGHRHRSFGGVRPHHWAQYTSTVIEVIMYWLGEYASPDVGAAWSNIVGFFLMHILESF- +>ERR1712194_94606 +-----------VQDTWISATctfeyKECLGTQLLYNLMHIEPSFLDAAPFFDNTVLLGDGFDDESLIQCAIYIVQCITELVTMLDKyHEPKFRILINSHLSrlaKYNIYPSSFAKVAQALLMTLSDVMQEEFTKKVESYWMSVLIILF-------- +>tr|A0A2M8U0Y4|A0A2M8U0Y4_9PROT Uncharacterized protein OS=Ferrovibrio sp. OX=1917215 GN=CTR53_17535 PE=4 SV=1 +-SPLSPAHLGLVRATFQILAadRDRLTEMFYARAVALDPHIQRPQ-----LV--------SNMVAQRLQFMLVLTDVVQQLDDLpslAQTAATFARRHG-TYGASDPRFRTARAALAWAVDRILETERNSAIQLAWNAAFDLVEALV----- +>tr|A0A1I8F573|A0A1I8F573_9PLAT Uncharacterized protein OS=Macrostomum lignano OX=282301 PE=4 SV=1 +-------------------------------------------------------STNQKPPSDGDRLLYWINVQ------ptAQPQLLRGASEGC-VRLFSPRILTRSCISSNLCVRAGRGRNS----SSTeTTSAEGADAVVAA---- +>SRR2546429_8650734 +------DAQYLLTESLAVLRpyADELVAEFADRLATGHPALGAIFEP--RL----------------LTVLLELAATYDRPQGLLPALATMGRRYR-RYGAGVEDYAAGGGVLLGTLRDFPGAAWTPAHHGARVRAYAFAAATMM---- +>SRR2546423_13669166 +------DDQYLLTESLAVLTpcADELAAEFADRLATGHPALRAIFEP--RL----------------LTVLLELAATYDRPQRLLPALATMGRRYR-RYGAGVEDYAAGGGVLLGTPRDFAGAPGAPAPHRAGGRADAVAAAPPK---- +>SRR5690348_18181078 +------------------SrrRHTRWTGDWSSDVCSSDLETRALFRT--EGS------------ELVKG--SMLAMTVEAIIDFAgersGkfrMIACEVMSHD-AYGTSRELRSEERRVGKEC--RFGWVAYPX---------------------- +>ERR1719323_1074371 +--LIPFEQRTLITEVWNVLQestIRYVSNTMFLpLIVRSNKSLQKCFAALDQSLHGMELVECygSkfDRTKHGSLFLSKlLIRVVPNMDQmdrVLPYLAELGALHQ-RHGVAKQHIDLLGLAFCAAIRGVVAgggvkGGHLHETTKAWITLIQAVCTGMKMGY- +>tr|A0A1I8C1X6|A0A1I8C1X6_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=3 SV=1 +--DLSPHQIGLIKRAWKNLlksvNENEIAIKLLLRIFQLDPRNLAYFSL-NEYSPFDeyLIKENNIFINHVKTFESTLINVMTHPGNatkLSKHLQQLGGRHVNYTGVTykCSYWKCFIQSLIDVLTLNKDKNTSEDLHEAILILGEFCVEQMKIGY- +>tr|A0A0N5CQY3|A0A0N5CQY3_THECL Uncharacterized protein OS=Thelazia callipaeda OX=103827 PE=4 SV=1 +--QLNAPQLLLVRKTWAHARSqGalEPAMSIFRNSFFKCSEIRSLIMN------GPKNEGHERLKSHAKAFTEIMDQLICGLETkelIMYELRAAGRSHIFLprdatdnkskgCTFRLAHFEHFASAMIErTLEWGEKKDRNETTQTAWTKIVLFVTEQLREGYQ +>SRR4051812_28599342 +------------------------------------------------------------------------------WVRprsRGGRSPRSRSSRS-SARRWPSGRPRPPSTS--RPDMRSGPSscgmsrarwqsifpapsrtgcasPIGVLGDP----------------- +>SRR6516225_8820395 +---------------YSVHCegKTNFYRLFYKRFFDKPPKWRTFFRK-HKIS----------MARQY----KLLDQAVASLANFHigaepTSLSHVARVHA-NLQLGREQYAMFTDSFLESISEM-GEK-DED--------------------- +>SRR3569833_2822653 +----------------------------------APPERHTVLHE--AI------------VTNPVEVAGAIGWVVEHLHRteeVATACGELGPALARLLAGHEQHLDACGRSIIDAIRTGLADRWKPEFDGATSSAWELVAEWLRRG-- +>SRR4051812_2284027 +----------------------------------TLPEMRTVLHD--AA------------IADPHALGRAVVWLMDNLTRpfvVTAGCELIGPALGDLLAEHPRDLEAFEPALTDAFRTALGTAWKPDHVTALHQAWDLTVKW------ +>tr|L8JU91|L8JU91_9BACT Uncharacterized protein OS=Fulvivirga imtechensis AK7 GN=C900_03083 PE=4 SV=1 +--TMEIGKITLVQNSYGRCL---SSGKLLETfyenFLSSSRDVADKFR-------------NTDFEQQRKLLRHGINLMIMYAaGNIagQTGLKRIKESHSRgRMNIEPRFYALWKAALIKAIAEHD-RDFNVEIKAAWNEVLDKGIVLITEGY- +>tr|A0A1Z9IBY6|A0A1Z9IBY6_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium TMED162 GN=CBD22_07770 PE=4 SV=1 +MVGVTQTQEQLIEQSLTHYAarHGDPYDAAFQKLYAAAPHYEGLFVL--DTD--EGLR-----RNMMRTTLEMIATYIDDAYAAENLVTGARLVHL-TYEITDD-FDLFFQITRDVIAEGCADIWSDAHAAAWNTMLKDF--------- +>ERR1719295_1776256 +--YLQPQEIVHIQGSWATVErqLFNLGARVFISLMENQPNIKRTFRQYRNKR-HSELRINEDLQKLIMLLLCGMKRVVKYLNDtkaLTKYLKRMAKRHSPTeidfARINPAEVASVFCAALREIAPAEKDQWTQEVEDSWTSLIGGLLAA------ +>ERR1712029_417561 +-------------------------------------------------H-GSDWKV-VQVDRIILI-FRTIT--------vIIVRVQSVEKDHI-hT--------RKSF---------TQVLKVETVVEDSWTSLIGGLLAA------ +>ERR1712071_338654 +---PTAEEIALIRESWPIVKkNKNVFVEFVLEHFRVHPKTQDLLPEFANLAI-ADMPSNKFFVQLTEtYVVMAMQEIIDNLDNagvLTDLLQCLNSNWYVDyVSLDRQN-RETLRIRRVGQEQKSYSRNMESneiQQQRCPQNLRQAVH------- +>ERR1712179_849736 +---PSAGV-------------------------------------------------------------------------PVNKLEENEDFQVLAyYSSAVATFivtnLDQEDILTHILVQQTKP--------------EQFVD------- +>tr|A0A077ZE79|A0A077ZE79_TRITR Globin OS=Trichuris trichiura OX=36087 GN=TTRE_0000613901 PE=3 SV=1 +-------EWYNFKNFWKTVQrnKDNCAKLMFFKYLEQNPDLLQAYAKLRNMEMNeETAFNNSDFEHLANQYLDVFDEAITTIEsnpgDvssVVEELQNVGKRHRRIscieassfavtttvskDWLSVAILQKLQEGFMEMARQVLQDRFTEKCENSFGKFFDFVAKNLQQGF- +>tr|Q7M422|Q7M422_9DIPT Hemoglobin V OS=Tokunagayusurika akamusi OX=28383 PE=1 SV=1 +-VGLSDSEEKLVRDAWAPIHGDlqGTANTVFYNYLKKYPSNQDKFETLKGHP-LDEVKDTANFKLIAGRIFTIFDNCVKNVGNdkgFQKVIADMSGPHV-ARPITHGSYNDLRGVIYDSMH------LDSTHGAAWNKMMDNFF-------- +>ERR1719253_2317543 +---ILSPAGRVLRLRGPGFLpprcrfgrlspnhccsrvspdriavarrPPPRPRSRPTSSPSPRTSTRGc-WAATRSC----------CSSSTrpttspsprt--SLR--------PSPAPSrptPPTSPTC-LPS-WSPAGPWRPSVTA----------TSPSPSTRCSTSWCTTTSwrpsprswatssrrrsrpagprPSSSSPRP--- +>ERR1719253_507459 +---LSQSAIDVVVSVAGRDArrARPRAGPRR----------TDp-WRRRRRA----------ARGG-gpgrragevqtraaegASTLGHGLVR------RGRalgHGLVRHGRGHC-HDS------------------------------------------------- +>tr|A0A016TEH5|A0A016TEH5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum OX=53326 GN=Acey_s0110.g162 PE=3 SV=1 +----------------------DTAGEYHKQLFTLHPEIAKYYDA-EDID-PDSIPKAQKFIMLGQQELQFFFRLPDVVDNerqWRSALSSFKE-TFGDNNVPMSEFNKVTDAFLAAMQKNAGG-VTPEQKKEWEELLAKAYADMK---- +>tr|A0A0B2W4R6|A0A0B2W4R6_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_05310 PE=3 SV=1 +----------------------DTAGEFHKQLFKKHPDMAAFYDA-EDLD-PDSIPKSQKFIMHGMSELQFFFKLPQAFSDerkWRSALSSFKD-QYEDVGVPMKEFNKTTDAFLAAMEKNAGG-VTAEQKKDWEELLAKAYADMK---- +>ERR1711965_451221 +-----------------------------------AGAVR---------P------------RP--------AAVI---GFPFPLFP-LLETADMtsvAVGAHPRLRA-----L-----LRDR-G---AWYLTGPQELASVIGRLERLER +>SRR5882757_2588511 +--SLSSRQQILARRFFDAVEAsdKPLAAMFHERLSEIDDRLDGLLL--EEE---------GCLLREAMVIVRTLSRNVDRLNRMVPIFRAFGRTCA-AQGIASANYEKIAPVLFWIAQECVGSEFSVEMGRALTALYDQLSREMKD--- +>SRR5262245_14724532 +--------EDVVKKAYQRHCYrqPEFYRSFYENFFSRVPKARAMFK---DMA-----------RQHE-----MLDFALGQLLNysqqqSEpTTLTQFVERHS-RLGLTADDFKRFGEALIATFDSELRGdCEHHRTMAALEIVI------------ +>tr|A0A183IYP9|A0A183IYP9_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 +------------------------------GLFTSSPEIRSLFPTLVDW--GDDIKTCQKFRNQGLKFVHVISLSLTTLHDkehLDTLLKEIGTRHVEfmPGGIKMEYWDIFEKAMVKCILQQIRwtDDFDEAIQskaaIAWRILCAYIVQKI----- +>tr|A0A0C2M2P6|A0A0C2M2P6_THEKT Uncharacterized protein OS=Thelohanellus kitauei OX=669202 GN=RF11_12769 PE=3 SV=1 +--FLTLEERLKLKESWIKIYqkiqdlPdVDITFEIFVRLMERRPEMSKNFE--KDV------YKYSRMKSHSDKMLVILNNMIRNLDDeqkMLKYLSGMVRRHR-NYGIRQGDCKMWEEIFLDIISR------------------------------ +>tr|A0A1I7YD88|A0A1I7YD88_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=3 SV=1 +--LLTLRQRKILQRSWNKSQrtgLDNIGAHIFLKIYAKDSSVGYLFN-LGNCP-HSELKYRKFFQDHAMTFTRSLDFVMNHLDDLErvsKFCVELGKTHVKfmRRGFKTSFWDIFAEALTECAIDWEGGLRCRDVLNGWRTLVSFVIEEMRKGF- +>SRR5262245_33555564 +--------------------------TFYEHLFEGAPELRSLFPI--NM------------AAQERKLLLTISVVVKNLDRdeeLKRLALHLRDVHE-GIRIEEGHIEAFLGSLAHAFQQVHGSPFPRH---DWLTLRRAV--------- +>SRR3954452_7277257 +------------------------------HLFQANPEIRMLFPI--NM------------AAQARKLLLTISVVVKHLDReteLQRVALHMRDVHS-HIRIDEGHIELFLASLAHAFQQVNGGAFPHQ---DWKNLRRAI--------- +>tr|W4XW92|W4XW92_STRPU Uncharacterized protein OS=Strongylocentrotus purpuratus PE=3 SV=1 +---------------------------------STHPEDSLHLHQ--GCCSHLASRESCRFVDQAMQVMQTIGNAIQNFDNKelfNTNMKELGLLHC-PVRDDtlavIHNHEVFKDALYNTLRKSLTESLTPEMTFAWKAF------------- +>KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold7330878_1 # 87 # 278 # 1 # ID=7330878_1;partial=01;start_type=ATG;rbs_motif=GGxGG;rbs_spacer=5-10bp;gc_cont=0.391 +------------------------------------MASQTQFvygDE--DTVMACLTKESCRFLEHAMSVFQSVGGLVTSFADPpsdRKFNLDLGLKDQ-PKDVQDRHYKVFMKCLLKSVRFHLADSYDLAMHFAWKAF------------- +>SRR3982751_838383 +------GINDQLRESAAMLTsgGteatDAVIRDFYIALFRNAPSLIAIFPG--NPAQGDFG-SDHRGAKQRELLLGALAGLADLYdpgdaermTHLDSVLKRFGRSHAAFtrpdgtvSGATLDEYKAVKDALFSTLVRAAGDRWRAEYTVAWSQAFDYAAASMLL--- +>SRR5690606_20444479 +---------DIVKQSFERSkQRKTLATIFYQNLFFLKPKIKNYIKQ-TDF------------AHQEKAIMDEMEFLMAFLDDkdrhARQQILRIAGTHSAkNLNIHPHDYYYWLEALIMTAKEC-DHLWRDDFQYYWRECLSFPLTFIISQYY +>tr|M6F3R8|M6F3R8_9LEPT Uncharacterized protein OS=Leptospira kirschneri serovar Bulgarica str. Nikolaevo OX=1240687 GN=LEP1GSC008_4081 PE=4 SV=1 +KMNISENQIRSLNESFDIVNLDriKFAELFFIYLKENHPKYENIFSRI-QL-------------EDVKHFMNSARNISLSsVQYsqLERAIQNFGVECL-KICNQAEEIPILEKAWLFALEKWLGPWYSHEVEKSWQEVFKMIHTSS----- +>tr|V6I1Y8|V6I1Y8_9LEPT Uncharacterized protein OS=Leptospira alexanderi serovar Manhao 3 str. L 60 OX=1049759 GN=LEP1GSC062_2771 PE=4 SV=1 +GMNISENQIRNLNESFDIINLDriKFAEIFFVYLKEKNPKFENIFSKI-QL-------------EEAKSFMNSARNIALSgAQNvqLEKAIQDFKMECI-KICNRTEEIPLLEKAWLFALEEWLGPWYSHRVEESWQKIFQMLYSEE----- +>ERR1719272_197188 +--SLSATQRASILASWRQLCGEDGGATfcasLLGGAFEAVPETRALAGV-PEAAPEPeAvpeaeaavaapapapakgkagatavpeaaaaveeaaeeaveSAESVALRAAAAHAAVAMEIMAQQLSapeALKESLTELGVKAA-SRGLGcGAPFDRLGEALQTTLQASLGDeAFPEALAEAWRQLYAQASQEIQLQY- +>SRR5262249_23394332 +-------------------------ELFFSRLFAIEPGLRHCFDG--C------------FLGRRRAFEWMIGAAVRGRPDLRSFIQALEFMVAPSDATVHQECERLRDAFISSLSGSLGPRFTVEMMNGWLAVFELLH-------- +>SRR5438034_714626 +--SMTEASIIAFNESFERCMaSGRFFDVFYDHFLRSSPEIAAKFQG-TYF------------NRQKRMLNQRPATTVGQpr-------------RSAReSRKTPAAQFVStcqampsaFVSELTKSGSTX----------------------------- +>SRR5258708_7736634 +------------------------------RFTGTSDAIREKFKN-SDF------------AVQHQAMADSLYLMAVSvqggPEN-LARHDMKRLYPKHqRMEITASMYDVWLDCFVATARIH-DPECTPAIESAWRECLTPGIAAMKSGA- +>SRR5690242_5369812 +--LVTEDDLALFLDSFDSCVaNKEFVARFYEIFLSTSPEIRALFAK-TDF------------HHQRRALKASLHVVAACaarrRAD-YSALDELADR--HrELRIEPRHYAVWQESLLAAVSEC-AERWDPDVERVWREGLSEAIAHMAS--- +>SRR5512134_285705 +--ALTPTHATLVRESWARLAPGrAAAVhRFRARLEAVSPRTAARFTCL-DH------------EAQRDGLMIELDQAIAAtgsDDDLVPALARIARRFR-ESGPASSEYPMVRDALLEVLAEADRGIAPPELRRAWGSLFGLLAALV----- +>ERR1719232_1195758 +-------ETVIIKDTWETIHkqVKAIGMEAFEKLFALNSDMSAYLPQTDDLDQDETRRLSDKVKSHAKLTMETLEQVIAAIPDMTEvynVITKMKKLHP-----QTGLLEVIGPVFCNTTRHFLliQGRWSLDVQRAWLALFGEVSAMIRASY- +>ERR1719189_1497217 +-------GRQADEQ----VGreEAGPGHRGHRP----AQDDPAHLRgarDCGQRVRGRARRHGDRGV-QGRGQGEQS-QH--------------HRHQG-----S------HGQ----------lHGRHX----------------------- +>ERR550519_213 +-------NIVLLRDTWSVIHrqVNTLGMETFQKLFEINSEVSHYVSpscpDLDPd----CIDSTTQAIKAHATHTITILHNTVSNLCNLgd--lagE------------------MNRLGKLHCDLGIDHGil---------------------------- +>ETNmetMinimDraft_22_1059887.scaffolds.fasta_scaffold1682169_1 # 3 # 206 # -1 # ID=1682169_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.363 +---------GTVFSQWRRMKIEDFGECMY-RSLVQDASLEKLFRR-------------ERMRTQSLLFAAFIQVALCWLEErdfrkVERDMISLGLRHR-SYGIQPSYVCVFQIALLQTLCQNLNG-LSLQAEISWSVVWSHF--------- +>SRR6266567_3650358 +----------------------------------------------------------------------------------RAPSKAWGsgtspmascqstipssersfwkpsatywesaglqrtmmpgrkptkgsarscwkgpthrsqpeqssrqchrydlwererqdkikkgeatldtkqaaqkgfeQQHA-VVIGGSMAGLLAARVLSTHFGQVSVieRDHLPDGA------------------- +>SRR5579885_1989414 +------------------------------------------------------------------------------------------xmsnqqssrsgfgGQHA-VVIGASMAGLLASRVLSEHFEQVTVieRDQLPQEV------------------- +>SRR5579864_4130097 +------LQIELLETSFQAIApcGEAFVTAFYERLFMRFPQTRAFFAS-AE------------RNIKHVLAKPTIVTTLQPTRSascRTTRIT------F-PSSVGTAGVPISRS------TGYAGs--------------------------- +>ERR1719414_1806988 +----TVAQAEKVVAQWDAADQDAFIVAMYQAMMKTHPEWRALFNK-PTGA---PTPAEAEWKKQFDLTKAVLDRGLRsratDVDALKERMHAMAGRHV-NYGVTQTHFQALKPILTDVLAATVTG----ADMDAWSAVTYFMLDSI----- +>tr|A0A090RS91|A0A090RS91_9VIBR Uncharacterized protein OS=Vibrio sp. C7 OX=1001886 GN=JCM19233_1279 PE=4 SV=1 +-----------------------FLTFFLQHFCSTNPRFAERFCGV-DS------------EQQTKMLKASIILVQnaAENPYIRNNVKSLAKRHKEmNLNIKPEELVAWRESLLATVANFD-PLFDDDIDQACAQRWN----------- +>tr|A0A139A347|A0A139A347_GONPR Uncharacterized protein OS=Gonapodya prolifera JEL478 OX=1344416 GN=M427DRAFT_73171 PE=4 SV=1 +--MLSAEQARLLKKNWKDIGASsvanpmmFVVAQFYRRLLRK-KGYKRIFEGI-DI------------ETQYFKMQGALTACVEfaeNLDKFADTIRRIGARHA-RYNMTPNMMNDVVDSLVPSLKEFsldHGITWNEEIEEAYDEWLEQVTGYF----- +>SRR5262249_57009646 +-------------------------------------------------------FRKTDFPRQTRVAADTLFlmaVAAGARDHavAWRGRDRLPGTPPPpGLHSSPRHHPAQLVCPL----------------------------------- +>tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 GN=TSPGSL018_8354 PE=3 SV=1 +-----------------------VGAGFLKLYAQRNPWAVEQFS-FG-LR-----------PQHAEKMGLALELIVNSATRpqvLQHQLRVLALGHV-QMGIKPEMFKSFEEALFAFLGQVLGAhnTFDEETEGAWRWMWGIVNAVFTQ--- +>tr|A0A090LKP0|A0A090LKP0_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_2000335800 PE=4 SV= +-EELPKADKDIIISTYNILL--QADPELFSKAWimsaSRSTSIRKAFS----LIDP----NSTHIEVDFTKFSAVIERFFTriiceeKLVNesFEKSCINLGKKHVDfvPIGFHSNYWDIFMNCMIDVIAETVIIAFNEdnkqqqQVQKCWNKFVGRIVFLMQSGF- +>tr|A0A0M3JT43|A0A0M3JT43_ANISI Uncharacterized protein OS=Anisakis simplex OX=6269 PE=3 SV=1 +-RSFTTPQLTSVFNAHFSMI--QLNPDVIKDCWiktsKRSSSIKKAFG----MLEH----EEPETNASFMNLPITIQAFFKelifelDCDSvkIRQRCEQLGARHVDfsERGFHSNFWDIFQVCTIEVIAEC--NLGLNedqhrSYELAWIHLLSSVVKSMRNGY- +>tr|A0A0A9Z6R2|A0A0A9Z6R2_LYGHE Neuroglobin OS=Lygus hesperus OX=30085 GN=NGB PE=4 SV=1 +--SLEEDEIERIKKSWVLVKEndfrfiDILRQEMLCDI----MMYELYFNPG-R-KADVCVSELTEFKNHPKNVYSTLDFIVGDLENenvIIEKMIEIGKNHG-RLGISRKHISFMTSTIYQAVECTIGPcMFDRLVDQSWEKFLTSFND------- +>SRR3990167_8699843 +-------------------------RLFYAHLFAKAAHLKPLFG---DSE-----------DTQNFKVIKMFELIIDNVEDLtqvQPICLDMAKRHS-FYGVKNDFYQYIDEAFVWCIQQQLSLSIQDPIIHAWYAATKYISSIMID--- +>SRR5690606_19766530 +----VSDQYTDLQQSFGRCLrDKNFIERFYEVFMASNAEVAAMFAR-TDF------------QKQRLALRRGISVAIFHAAGssVvKRSMQQMADVHSRSgrCPVAPHLYPYWIDSLLTVIAETDA-EADEALLARWREAMGVTIGTFIGAYN +>tr|A0A023F5X6|A0A023F5X6_TRIIF Putative globin (Fragment) OS=Triatoma infestans OX=30076 PE=2 SV=1 +--ALTADEKEILKESWKNRgiNKSTLAMMWFTKLFKANAEEIVEQNR-GQV--VEELFMDEANFDYVDKLADIFNIVVKNIHKstLcTKLIWEIGMYHC-CLDLRDGYFELMKETLLDTLKENMQPPLTSEQIEAWKKFIGVMFDIVHE--- +>tr|A0A0N4YMT1|A0A0N4YMT1_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis PE=4 SV=2 +---LSLEVHDLARAHWIQLHkLNRQSnliQNALLYIVENYKHTRPIWQ-FGlGIDEstkdwKTLLFNNFYFRHHSASIQAAITMVMENMDDrdcMKKLLNEIGAHHF-FYDACEPHLELFEQGMIHSLRTTLVGhvKIDESTEQSWTLFLKDLKTFMGEG-- +>ERR1719326_703414 +--------------------------------------------E-HPM-------------IPITMTEES----VKLVQDsl-SRVDSLVQV-----RDALQDvFFPHLF--------------------------------------- +>ERR1719487_2229452 +-----------------------AALSL--------P-------T-EQE-------------SPVTMTAEA----VQMVQDsl-RRVDSAVQV-----RDAMEDvFFPHLF--------------------------------------- +>ERR1712176_999243 +-------------------------------------------------------------------------SY-AHRDTfdqladAPRTI--FYTQK---------QGHPECSEMVEKMKNIVGDE------------------------- +>tr|W8BTT7|W8BTT7_CERCA Uncharacterized protein OS=Ceratitis capitata OX=7213 PE=2 SV=1 +-LGLTITERRSLQNGWSIIKqkQRRAALTIYVNLFTEHENLYEVFRSDGV-------LNIEFASQHQKEVLTVFQMIIEQVDNarfVKTMLKELALRHE-AASVTNTQWQLYTNEVRKYFLETLADAISPTFVHALDKLMNFVCN------- +>tr|A0A1A9YF90|A0A1A9YF90_GLOFF Uncharacterized protein OS=Glossina fuscipes fuscipes OX=201502 PE=4 SV=1 +-MGFTPLEIVALQNIWRLFKkrFKYHSMQIFLAFFNQNHKLIERFRLpSGK-------FQLNYLCQHSEKMLLLYENVIDkCLDNmanFHGIMADVTVSHR-HSGVTYEDVSLKSEHVRRYILDYFANQSSPTLVSALAKLSEHFND------- +>ERR1719370_117345 +---------------------------------------------------------NATRMFPAKAALQESVEVmVDVLERrgmWGSGIRDAGISHH-KLGIKRRDMEKLATSILAAISDLLGDcDLDRKllQLNAWKKLLNAIADEFSA--- +>ERR1719234_1549997 +-----------------------------------------------------SLWhrssiQLEGASNHNKALMNAIDSVmVEVLERrpmSKSGIRDAGISHH-KFGIKRLDMDKLTTAILAAISDVLGDcDLDRKmlQLNAWKKFLNAIGDEFSV--- +>ERR1711972_141202 +--SISETEKTYCIKEWVKIcsDRSKTGTLLLSHVYQENPQLLTH-PAWKDLS-QDQLKENQHFKNLAEKTMGSVEQILTHIDNVDkvaSMFEQQGKDYK-SAGKSMSH---IMACLETFLPLDHPSlEVTEEYRGITQEILGIIKQSLMKGYR +>tr|A0A0N5DD39|A0A0N5DD39_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 +--NLTPHQKQLLVQSWPQVQlynRIHGGDAMFARFCEKNSIARETFQKIAVVQSfASNEASESVLKKHEQYLVQLLSEAVENLNNdCEPLLReclDYGAQHVT-LHelLNETVWEQLAEAIIDRIHKVNLVRRHKDLSKAWTMLIILLIDKIREGY- +>JI8StandDraft_2_1071088.scaffolds.fasta_scaffold105816_3 # 981 # 1154 # 1 # ID=105816_3;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.718 +-----------------------------RNLFKIHPELKHALNI--EIK-KSGIQH-----VPLASIVFSYAANIDNADKFLVIIRHIVDKYS-SLGITVNDCPIIGSLLLDAIKESLGYAATTHLLAAWAEAFGLFTNALVQ--- +>ERR1719199_1194134 +-------HAGYIEKSRESVlnlDAAQLGADIHVKFLNVYPAAASLFQK--TLR----------M-LITTKIMGTLMAVISDPTGTLEDVRAVGVRHT-KYGISERYLLPFGAMLWEIVGTMLPGMWSDEHSAAWAFYLDFIASTMTRA-- +>SRR5882724_2518483 +-----EEVRRKARKSYRELQDSAFYCNFYAELFRAAPDVRQLFRNI-NM------------DEQYEKLHAAVGKLLNfrPTDDPNP-MSRHAESHE-RLGLQPKHFEGFRDAFLTALSSRK--TADNYAMDAWRAIFDAGIAYMTTK-- +>tr|A0A2P8AX05|A0A2P8AX05_9ACTN Terephthalate 1,2-dioxygenase, reductase component 1 OS=Micromonospora sp. MH33 OX=1945509 GN=tphA1I PE=4 SV=1 +----------PDPQRLLAALgaPDQAADHFWSYMEDRSVRV---LP-----------------QQFAPMFFSTLAEMVARRGDpaaRRAELALMGRMYL-RFGLYPYHHTVVAAAMVDTVRRFAGASWEPDLAGYWEvgcrRSLRLAE-------- +>tr|E5XPI8|E5XPI8_9ACTN Uncharacterized protein OS=Segniliparus rugosus ATCC BAA-974 OX=679197 GN=HMPREF9336_01410 PE=4 SV=1 +----------TFVRSFHlELFgaAPELAARFPPGLGEHRGGF---VR-----------------M------AEHILETFAEGADpprLIDLLGQLGRDHR-KHRLDERDYRLAQAAFAKALVATARG---SGDGAFAAraaaLVCQVME-------- +>tr|A0A246RU09|A0A246RU09_9ACTN Uncharacterized protein OS=Micromonospora wenchangensis OX=1185415 GN=B5D80_01060 PE=4 SV=1 +-------------------------MREADELRSALPDR---LA-----------------AHDAELLIATLRRLATD-PEpaaQAVTLTVLGHAFR-RFALLPHAKLISALAGAD-------------------VPVELLR-------- +>tr|A0A085M5J8|A0A085M5J8_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_06691 PE=3 SV=1 +-TCLTKRQRRCILKSWRKVqNKAQLGEEIYIQIFMQKPVLKSLFP-FRAT-PVNELHDNVLFTRQAVIFIDFIDNVVAYVGinNgrlLQELCTRVGISHALMtrVNFDPEWWYLFANSVLDGMQKFCLPNFSCEpiatyigsqSMLAWRILLKHVVEMMSDAF- +>tr|A0A2C9LD65|A0A2C9LD65_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106067556 PE=3 SV=1 +--QLSHKDKLFILNSWLNFrNgkrEEDIGMEAALEMYSIYPEIKDIFTIYRDARM-KHLTDKEMIRTHSQQVASVVDKCVMRMDDAHAfamIAVDEGSVHI---KIQERFMRCYVDCYIREIKKYSKLKWSRANQMAWEVFFDTIVVNMKNGW- +>ERR1712086_1089461 +-------MG---KEHGDGDSsadaNTAAGLDVMQGKKPEQKESKRWFSlgssaakgkqerS-----------KEEKEEKIADKALEMSAEMYKDPTRIQGETMGLGLRHI-MYNVDPAFFDALVTAYVEEMAVRTT--------------------------- +>tr|B3LWC8|B3LWC8_DROAN Uncharacterized protein OS=Drosophila ananassae GN=Dana\GF16358 PE=3 SV=2 +--GFTCVEKAALRNAWRLIEPfqRRFGKDNFYNFLTTHQDLIHNFRL--DPRSSDSPINLSKLHGHALAMMKLLARLVQTLDiNLqfRLALDENLPAHL-RRGIDPSYMKMLATALKRYILESsvIQNHNSSTLTSALTQLVSII--------- +>tr|B5DW13|B5DW13_DROPS Uncharacterized protein OS=Drosophila pseudoobscura pseudoobscura GN=Dpse\GA26483 PE=3 SV=1 +--GFTLCEKVALRQAWNLIRPreRRFGQDVFYTFLNEWYWSISKFKK-------GEDINIALLHAHALTFIRFVGALINESDPImfQVMINENNQTHS-RCRVGADYIAMLGQALTDYILKVLDKVRSPSLEQGLQRIVEKF--------- +>ERR1719162_2542559 +--------------------RSDIGMCVWNRVFVEDPKAENFFKQ-SN----------Q---RLIYIVTMAIKYSVEFYGDpekTKMAIEALALKHI-MYQVQPRMFMLFVTCYDEEIKARTDD---KLVQSGMHWSISIIASIMA---- +>tr|A0A0V1BAT0|A0A0V1BAT0_TRISP Globin-like host-protective antigen OS=Trichinella spiralis OX=6334 GN=T01_2203 PE=3 SV=1 +---------------------MENGGQLLANVFKANPELRKFYDV-EDID-PDDTKKSRLIQQAGGNLLNSVTFMVNNYDNErsfKQEIKEQICDLR-EKGMKLEDARKLKTGFVNYVKSKLSQPMTAKEEKEWDMFFQRFFDALKQ--- +>SRR6476620_89806 +--------------------RHATRQQRRPDVF----------HER-QRTAGE------D--lnVLRERDVGQVH--ESLARAgvavIDGVVPRIGCEVV-DLSSEMQNG--------FPQGVIL-SAAVGVGDDDG---------------- +>tr|A0A2W4R8Q8|A0A2W4R8Q8_9CHLR Uncharacterized protein OS=Chloroflexi bacterium OX=2026724 GN=DIU68_09390 PE=4 SV=1 +--RLSRQQKRIIQRTFSAVAvrHDLVARLTIERLRElsRTPAS-TC---FGNTP------------EDRRRLMHLLALLVQRMDDRGA-LHDACVAQTRQMGCDPFeggSTSLLAEAFIGALQSALAGRFEAKTEAAWREFFQMVERVLR---- +>tr|A0A0L0FDI4|A0A0L0FDI4_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12917 PE=4 SV=1 +---KTDSEVELIRSSWRALLaGDGtaaqmpllrFVEQYYKRLFRLFPDSRGVFKT-RDTQ--------------SKSLSLLLSIIINVADEpeLemNAKKKKLEMMYK-EYGMNSLLAVIAGRVLIQSLQAFLEAsnKFQASVKDAWVKCYTSIADQL----- +>ERR1719203_545915 +---------LILKDTWAVIveQIHELGLPTFVKLFRLSANLRYYYPKHnRPES--TEV--QENINTHFDQLVAVVDDVVRCLPDLsthIQYLRNLGPVHC-DVEVQPRLLELMGPVFAILSDLYCWskadgvirLKWPGYYYFDILLDScemVTIQLLLDLX-- +>ERR1719232_1194111 +---------IMLKDTWSGIieQMHELGLTAVVRLFKINYNLRFYNSPNvRYHP-TTHTNvkvlrgttaapatpaavasgstaaataagpsakdqatgksNLEDLSIVFNLLVSIIDHMISSLPNGsspTSHAGRNGksngtkakftlsaATMK-QLQILRQPTDWVGPVFCNTVRPLLLvqGKWSYQVEIAWRLLFRHLVRKNRTFD- +>tr|V6U182|V6U182_GIAIN Flavohemoprotein (Fragment) OS=Giardia intestinalis OX=5741 GN=GSB_151570 PE=3 SV=1 +-MPLSEDTIKAVEATADLVAaqGLDFTRAFYERMLTRNEELKDVFNLshQRDLRQPKALLDSL--VAYARS-IRKINELhelqeqglpvpAERLAELqgfFAVAERIAHKHA-SVGIQPAQYQIVGAHLLATIEERVTA--DKAILAAWSKAYDFLAHLFV---- +>tr|A0A1R1LGI5|A0A1R1LGI5_9GAMM Uncharacterized protein OS=Motiliproteus sp. MSK22-1 OX=1897630 GN=BGP75_23395 PE=4 SV=1 +--------LDKIYSTLQLLDdekSEKLINETYSIFFNAHPEAVLLWSK--DDPE-----------SRSKMFNGVILTIIDNLTRpdiFKNNLLSDVKDHD-EYGVDKEMYGGFFLSLTEALKKTLGSEFNQEMELAWKHQLAHIRE------- +>ERR1740121_1123239 +--------------------------------------------------------------------------------------------------vWIVVGSA----------SVrHR--LrAFGSASGSSSgRRLSGidY--------- +>ERR1740121_2035324 +------------------FTplt-----Cqwa-----TPHDGPAQHVL-------------------------CEDGHFahFATDKCesAgHgA-RVQCPSDMPEMcaDttcgggqehccrpaggCTGgERPCPT--------TASASgSA--SgsaSGSASSRRLAgIDYE----------- +>ERR1719271_1314470 +----------------------------------------------------------------------------------------------ghRqdeqhglQVPwCHQIPAVRGDC--PGLALQpCR--V---------HrREWC----------- +>ERR1719240_2235476 +------------YE---DEE-------------------------------------------------------------------------GAqvdvmkgEDALVATADLLYQKMSEDAN---MQT-lLGNIELAELAsKLQKALa--------- +>ERR1740122_169377 +-----K------GE--ADKSgnAEAAGGgqGDTPETGAAQDTAAGV-------------------------TDEHS--------KaLGIEISS--FDELkvDqkciaaaIDAwKLFISTAESREAAGEAV---YNA-lFEGAPS--LQALFVTPRAE------ +>ERR1719243_286169 +------------------------------------SHPVNV-------------------------LVSDTMwkGY----t-vRgIRRVNYY--VKYMmlTrdgnvsqALGwFKDAADCKIISH-PVNVLVsDT--MwKGIVRKQFLGgRLWFII--------- +>ERR1719158_147189 +------------RV--CYLYplvhcNILAVLrelnfdGAAESLCLDAPALLPT-------------------------MLDGLIwrSR----vTeNgQRRVNYY--IKYFivDaeggfskTTEvMTDNGDPTIVCR-PVVSLVtDM--IwGRVAFRTFLYgKAWFLF--------- +>ERR1740121_2502219 +---------------------KSFALEVFKRLFAMVPHSESFFKQ-----------SNTRLIFIVSRALDMCMNIYKEPTRLVNEITALGIRHI-MWNIPTTYFDPFVQCMLDEAIVRYGAS--QQAIEGLEWSMRIIASIMV---- +>SRR5262245_17232684 +---VEEETRALARYSYLQWlDDDEFFSAFYESFFAGATGAKGKFR---NV------------EQQRLKLRDAMTAVLNFYpGNEPTSLHRLIAVHA-ARDVTGTEIEQFERSFLEVLHQRLVERKIaeqlgpdvvAKIEQGWRELLHPVVQYVMGV-- +>ERR1712137_24889 +---LPRESITVIRDTWAMVErNVDIAPKMLLKMFQLYPMTQNLIPLLRGVS-LEDMPTNKRFLQLAYGSQFAMSAIVDKLHRpdmLEEIIG--GGMHAFVDGLSTS-FQMAaTTAlFNKIMTEELGSAYTAEAQEAFIATGDMMTSIMV---- +>SRR5262245_32700325 +--WLNSNQRDLIRRNWDSssK-RYELCRRIYCRVFARRPEIRRIFSIGYDW----------WRLEI-VTFADFVQSIVDNLDDAkrvRQSAFEFGRDHAKwrRFGFRSDFWVQLAESTTREcvyLDAAVH--PPDESLETWTKFVSIVF-------- +>SRR5271165_4656598 +------------------------------XMFYKKPDLKPTFIeIGHhidpendggLT----------WEV-EAQRFTNLLTDLIGNLNNLdrfEELSFDWGRNCVQwrEFGFKPEFWLHFSEAMTTEclyMDQAVH--SVGEVIEAW---------------- +>SRR2546423_8132340 +----------------------DVADEMFtARLLELEPQWQRVLS---DEP-----------TEWGRRLLRAIRQAVASFTClggFAEALRELGGVPA--AHVGYRDYERQGAAFVGRLEHSLDKPMAGAMRESWQRVFRLLAE------- +>tr|A0A2A3E2S2|A0A2A3E2S2_APICC Globin OS=Apis cerana cerana OX=94128 GN=APICC_08732 PE=3 SV=1 +-------------------------------------------------------------EAHCQNTASGCIDALDDVDLMEAILHTIGERHG-RRGQDRQQFIDMKGVIIEVMKDTLKSKFTIEIEAAWDRYP------------ +>tr|A0A1W0WMU5|A0A1W0WMU5_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_09357 PE=4 SV=1 +--ALTHVQINLVRESWRWLNFnrplQETAVRFFlDFYFKQNPDCLPMFG-MKTVD-----HYNKAFSIHALTVMHAIKYAVEYIGNpeqFQRLFRTVGQTHL-RFGLTDLHVERFLEQWLAFLRANDAKVFDAATVEAWNLAGRIVVSQI----- +>ERR1711911_15016 +--------VDLVRKILDKAKqNGNVAPKVFFKYFKAKPASMKAFPAISGLA-LSDLPRNGAFLSNVYTCFAGLKAYTLETDV-STRCPVFAKA---SGKYKSEDIDLFTSILKGVVAEELGADYDDVAKEAFEQFLDAVALTVT---- +>SRR5690554_6373173 +-----------------------LYLSCYDIFMGQSADIGAQLFN-TRMS------------AQHGLLRGGIMWLIMHARGMsDSNIRALGKSHSRdQLYFHPSHYALWLDALMETLYKHVP-EFNLQLELAWRRTLEPSIDKIISMY- +>ERR1711879_742838 +-----------------------FFEDFYSIFMTKSPDVLNMFAN-TDME------------AQRALLRSGILWLGMHARGMpDTKIRALGESHSKkKDEHQPHVLFHVAGRSDGNAFPPRP-G----LHSRTGANLAPYPTAHVT--- +>ERR1719461_1661620 +---------------------IEVGCYTFTQLFSQYPM-MDYLAKFDGLEV-EGVCIGEALRAHADAIGSVVAEIqenAGNPERIRMSLAQAGHRRF-LEGVERAQLDMLGPNMAETViIKDTWevISKQVKSigMESFEKLFSLNSDMSaYLPQ- +>ERR550519_213 +---------------------IQVGCDTFTQLFQKYPQVNNYIAEFDDMEV-GGIKVGPALRAHASAVRSVVTEIqenAGNPERIRSSLAAAGHQQL-MAGVERKQLDVLGPVLCHVIRPLVWekGIWSVEVEKSWTHLFDIVACLMKLGY- +>tr|A0A173LPQ6|A0A173LPQ6_9ACTN Phenol hydroxylase P5 protein OS=Dietzia timorensis GN=BJL86_2914 PE=4 SV=1 +---------------------PDFRRALEDALNTEAPYLRADLPR--NLD---------GPFA---TFVKLYRFLLTrvedsggdraKVDDVLDLCRELGHDLA-KYNVVEEQYERFGHALNAALARVAGEEWTGELSKVQNQFYVIIARALHK--- +>tr|A0A0M3HYR2|A0A0M3HYR2_ASCLU Uncharacterized protein OS=Ascaris lumbricoides OX=6252 PE=4 SV=1 +-PSLTPSQVQTIRKSWKHINtkgLYTVIRRCFQQLECMCPSVSNAFNSA-NNQLSANISTVRTLVEHTKFMLILIDRIVENDQDSIIELRRIGASHVVlkeSFGFGENELEKFGEMLAEAFLKLDGIRQSKETSRAWRLVIASMIDQLRAGF- +>tr|A0A1I8CNT8|A0A1I8CNT8_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 PE=4 SV=1 +-IGLSNYQQKLILQCWPNIYttgnSSTFATNIYPNLCTRNQKAKALLQK-AD---GVAVFSQSeidCTSMHSKLTLEIIDSVVRNFDSnpisLIGYLNEIGHAHRSlkSIGMPSSMWDDLGDSILEGVRRNDLVRKHKELRRAWLAIIAFLTDNLKQGQ- +>tr|A0A0N5AJ93|A0A0N5AJ93_9BILA Uncharacterized protein OS=Syphacia muris OX=451379 PE=4 SV=1 +--QLTVAQSVLVRKTWAHARnqgSMEPAMSIFRNSFFKSPDIRALMMA-GS-----KNTGYERLKRHAILFTNVMDKLIAGRvEEidsVIEELKNAGKEHACitreQYACpfRTSLLDQFAAAMIErTLEWGEKKDRTEVTQTAWTKIVLFIMEQMKAGFH +>tr|A0A0H5S8S8|A0A0H5S8S8_BRUMA BMA-GLB-3 OS=Brugia malayi OX=6279 GN=Bma-glb-3 PE=4 SV=1 +--QLSSYQIHLLQQSWQRLRcSPNFFINVFRTVISKNTIAKELFRKT-SIIDGFTSYKCYDVKEHADSLIELIDFALREIHSsikvVQDRCMLMGAAHCNTCeNSMSSSWDQFGDSLAESIAKAEAIRGKRKCLKAWNALLSFIVDRIKGGY- +>tr|A0A0N4XUJ2|A0A0N4XUJ2_NIPBR Globin-like protein 9 (inferred by orthology to a C. elegans protein) OS=Nippostrongylus brasiliensis OX=27835 PE=4 SV=1 +-ASLSFSQKQALTTSWRLLRpqAAGFFRKILLELEIVSNTVKQIFYKAQFVDAfNKDEENIATMDAHIKLMVKFFDDILASLDDeteCVERMKRIGSCHAVlvrSCGFSSDIWERLGEISMERICAHEIVQKTREASRAWRVLLACIIDELRCGF- +>tr|A0A2A2LCK8|A0A2A2LCK8_9BILA Uncharacterized protein OS=Diploscapter pachys OX=2018661 GN=WR25_21707 PE=3 SV=1 +-STLSFSQKQALSLSWRALRpqAAALFRKVFLELEIASVKVKQIFYKASLVDAfNRDEENSATMEVHIKLLIKFFDDLIPLLDDekeAVDLIRRIGSTHAIlakSCSFTSDIWERLGEITMERVCTHETLQKTREASRAWRTLLACVIDELRSGF- +>tr|A0A261C2G6|A0A261C2G6_9PELO Uncharacterized protein (Fragment) OS=Caenorhabditis latens OX=1503980 GN=FL83_09405 PE=3 SV=1 +-ASLTFSQKQALNLSWRLLKpqASACFRKIFLELEIASPKVKQIFYKAALVDAfNKDEDNSATMEVHIKLTTKFFDELLSTLDDeneFVAKIRGIGSAHAIlakGSNFSSDIWERLGEIAMERVCSHEVVTKTREASRAWRTLIAILIDELRGGF- +>tr|A0A1Y0I5V1|A0A1Y0I5V1_9GAMM Uncharacterized protein OS=Oleiphilus messinensis GN=OLMES_1782 PE=4 SV=1 +-----TQDQRLFWNSFDRCLsspqrDQQFAEDFYQRLYSSDRAIAEIFDR-VSV------------SDQLHAVRQAVYLLQEMTplKQAEITLDKIQAIHH-QheIRLSNAMLDKWLECLLASVELAD-PEFNETVKQAWIDILTPAVHIL----- +>tr|A0A1I7TWD1|A0A1I7TWD1_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=4 SV=1 +--RLSKIQKRAIRFTWHRLQtrnggkrVENVFEEVFDKLVKNLPNIRDMFST--RMF-LCAMsrGTTSTLRDHSKSCVKMIEAVIKNFDTeKskrtdtgtENDPRVIGRAHSIlkPYGLAGNYWEKFGEVMIDVVLAQEAVRDLPGAGQAWVIFTACLVDQMRAGFD +>SRR5439155_18881238 +----------------------PVLQGFQQAVSGFFTEVGRQFPK-NR------------FRQTPRKTQTSFLLVMGNIApgwpECEAYLERIAAAHG-KHGrdIPPHLYDLWLECLLRAVKEC-DDRCSTQVEAAWRYTMGAGILFLKA--- +>SRR5256885_16048310 +-----------------------FFFNDTATTEIYT-LSLHDALP-IY------------FRKQRRMLQTSFYMLVEYIAlgwpECEAYLERIAAAHG-KHGrdIPPHLYDLWLECLLRAVKEC-DDRCSRSEERRVGKECRSR----WS--- +>tr|A0A1I7ZQR2|A0A1I7ZQR2_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=3 SV=1 +-IPLTAAQIHLVRTLWRQIFlskgPTVIGSTIFHKFFFKCPKVKEQFRR---CPLPRNFPNHDSFaKAHCKAMSELVDQVIENLENldtMTADLERVGRLHAEVmnGELSTKIWNDIAETFIDCTLEWgDRRCRTETVRKAWALIIAFMIEKIKLG-- +>SRR2546427_190033 +--NMTYAELAHFDDSLTRCTrEPRFLERFCALFFASSDEVLQKFSQ-TDV------------QKQRRVLQASLYIQLSASPIvtnGSLIFCNPSVTWSIiQVQRSPAMRTLRthSSCPLVGYPLKA-GQCGVGHVPX----------------- +>SRR5213596_3505323 +-----------------------FLCVIFGLLRRGPSQVHTD----RLA------------EATEDVTGVVPQILMLEADGkpeGAVHLAPLAALHSQqHLDIPPHLYDLWLDCLIQAVRESD-PQCTPETESVWRRMMANGLAFMKVRYH +>SRR3569833_2178475 +--------------------HPNNHNTNKKTNKTTTHKKTQKNKN-TK------------NTQQKKKLQMSLNLLISHAMGigiVDGYLHQHAEKHSRhHLNVEPHHYTARLNSHMKAVKQHD-PKYSPALEQAWRTGLGHGIELIKS--- +>ERR1719347_979638 +-PIVTDEEMASINELWSCLRadAMHSSRFIFARFFEAHPEFLEPMPFVKDYYGniSPKYMDTQEMQDYCLKFMSTLDAVMTRVFArdkeALQVMRDIGYSHH-EFGLTSDMTVKFMNKMHDSVLELWGTEASRRDSKALDNIFKTIATEINVG-- +>SRR5437762_8994925 +------PAAS--------------SDHHIPSQLAAGTRAKDRKGG-VEY------------PGHVCRGQRRCARDRPHILAspelCIPRACRTKSA------------AFCAVCENRCCETC-RSPPAKKPETARRSAERTG--------- +>SRR5690625_2752079 +------SDYSDVQASYGRCVrNRDFIPGFYQRLLSKDKRIAAIFKR-TNW------------SVQNRALRRGISIALTWAGGskiVDRQLEEMADAHS-RKGrvpVDPVLYVFLREALKIGRASCR-ERVGVTVGDGcvpqdESGAATGG--------- +>tr|A0A085LV25|A0A085LV25_9BILA Uncharacterized protein (Fragment) OS=Trichuris suis GN=M513_10305 PE=3 SV=1 +--EFTAKEFAIAELTWAKLKvrfNNQVGMEIFRQIFASCPKVKNLFGV-QNRE-DQKALCDQRMARHTAIFQDIIELLIVDLSQrsdsLTQSLITLGAQHWFftQRGFRPEFWVIFGNTLVNLIRSLPLSlSQRYLARRTWIKLIVYLLDCVMFGY- +>tr|A0A0N5DS84|A0A0N5DS84_TRIMR Uncharacterized protein OS=Trichuris muris PE=3 SV=1 +--EFTPKEFAIAELTWAKLKlrfNNQVGLEIFRQIFASCSQVKGLFGL-QNKE-DHTALGDQRMARHTAIFQDIIELLIVDLSKrsdsLTQSLITLGAQHWFfnQRGFRPEYWVIFGNVLVNLIRSLPLSlSQRYLARRTWVKLIVYLLDCVLFGY- +>tr|A0A183BUR6|A0A183BUR6_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1 +-TGLSAHQIQILQKIWERSPeseISDCARNIMSHLLRSNAQMYQFFDLLGH--SDREIANSPIFARQSANFAVLLDFVLANLLEevqkVCLALQHLGAQHARlRWPIETHHWALFCRCFEDNPPKEV--FLNAEGHDLWKTMINFIIVQMRVGYD +>tr|B1KNW6|B1KNW6_SHEWM Uncharacterized protein OS=Shewanella woodyi (strain ATCC 51908 / MS32) OX=392500 GN=Swoo_3305 PE=4 SV=1 +-----------FNDSYDFVLrnEELFFSTFYEIFVSSSPQVKAAFKH-TNM------------AKQNEMVRESFGFIICFFVtKiADEQLVKLAIDHKDKFHVDSELYAVFVNSVLAALEKIYP-KYNNECAVAWRITMAPGIEFMKH--- +>tr|A0A176H0Y0|A0A176H0Y0_9GAMM Uncharacterized protein OS=Oleiphilus sp. HI0069 OX=1822245 GN=A3741_11335 PE=4 SV=1 +-----------FDDSYDFILsnDSNFFDSFYTHFFNSSNLIKNAFAY-IDM------------DKQKQMLRESIKHLVKFYCtNkESEYLKTIARHHADKVRADEYMYKLFVDSFIQAIEDTYP-NFCEEAALVWRCALKPGIDFMNS--- +>tr|A0A090LM85|A0A090LM85_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X000017100 PE=4 SV= +--NLTTSQIMSIKKSWKHINtkgLFNVLRRCYQRCQSCCPNVAKVFST-ENIKK-QQNIYSCGVSEHTKYFISLLDRIIDNEPNIEHELRNVGKEHAKlyeEYKLSITDIERLGEIIADVFLKLDGIRQNKETSKSWRILIASIIDEVSVGYE +>tr|A0A183CLY2|A0A183CLY2_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1 +--LLTRTQRVLIENSWKRVKkaavEGGMGAKVFHNVLVAQPDMKLLFGL-EKVP-QGRLKYEGQFRRHAGLLNRTLEYVIKNVQytdKLGQHFRALGKKHCQmngGRAFPTNYWDTFLECILQSVLETDGSisgRYhrCREAALAWRNLVGL---------- +>tr|A0A0M4CP70|A0A0M4CP70_SPHS1 Uncharacterized protein OS=Sphingopyxis sp. (strain 113P3) OX=292913 GN=LH20_00550 PE=4 SV=1 +----ERSDAALMEATLAAVAetGIDIRHTLFERFFSAYPERHPAFLNL-DA-------------ASRRMTDETLQILFGLATDegwVWPLVAELVATHR-NYGmLPTDEYDAFIDLAIDELGRAAGRAWTGAHAAAWRRQGEIL--------- +>tr|A0A1Y5Q3I5|A0A1Y5Q3I5_9SPHN Uncharacterized protein OS=uncultured Sphingopyxis sp. OX=310581 GN=SPPYR_3232 PE=4 SV=1 +----PARDIAAMEASLAAVAdaGVEIRHALFDRFFDAFPDRRASFMIV-DA-------------SSRRMTDETLAMMLGLAKGegwVWPLVAELVFTHR-AYGpLPIAEYDAFIDMTVEELGTAAGAAWSAPAAAAWQRQAEAL--------- +>tr|A0A2N3CVZ2|A0A2N3CVZ2_9PROT Uncharacterized protein OS=Alphaproteobacteria bacterium HGW-Alphaproteobacteria-17 OX=2013663 GN=CVT78_05625 PE=4 SV=1 +----SARDAGQMEASLIAVAdaGIDIRHKLFERFFAAYPERRASFISV-DA-------------ASRRMTDETLQMMFGLAKGedwVWPLVAELVFTHR-SYGaLPIAEYDAFIDMTVEELGLAAGAAWSDETAAALQRHAEAL--------- +>tr|A0A0D6LRF9|A0A0D6LRF9_9BILA Globin OS=Ancylostoma ceylanicum GN=ANCCEY_06233 PE=4 SV=1 +--PFFRIDNRLVPDSAVAtDMV-QAQIHSYVYSSLQSTVSREMFQKM---SIVEGFRTNQccDLNMHAKVLCDLFDSIVSDLQQaskiVQARCMDVGGSHV---HMNekccGSLWDQLGECLAEVITKVECVRSKRECTKAWIMLISYVVDGMKCGY- +>tr|A0A1I7RN92|A0A1I7RN92_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1 +--GLTDDQCEQLATAFSNIPdKYYAFEQMFLNLfMKEDPQLAVVFGF-EGIR-PEELRRMSPFRTHVCKFQRFMTTVLDMLPKknreeeLIQIIRMVGRQHCNvkLLSFTAQKWLSFKNGMLNALAKG---GESHKYYSSWNILISFMISEMKDAY- +>tr|A0A183BTK8|A0A183BTK8_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=4 SV=1 +--QLDDTECEQLSTVFAAMPdKYHLFEACLRPMpMPeVDPQIALTFGM-ANIA-EIELRRKTPFRYSV--------------QKrgreeeLVQIIRMVGRQHCQvkQLSFTAARWLSFKSALTWTFSRG---EQKDKLHVQWSLLISFLICEIKDAY- +>SRR5688572_1577071 +---LARHDWHVLLDRWQRLQpnADRFATAFFDTLFGQQPAFLQIFAS-APL------------DAQFLRFAHLLSEIVSAADDadeLPRCVELVVQRFA-NDDCETDRSRAVRAAINAMLTEVSAAHMTPHMRASWHAAYVAVTAIL----- +>SRR5690348_16468503 +--------------------ADAAMTYFYAELSSAARATWAdrdIYMS----------------GPDHMIVRT--ARALVErg------------------APSRLIHYDLVDPRVTEGQX------------------------------- +>SRR5258708_24656334 +--------------------ADAAMTYFYAQLFAMDTEIRAMFPA--AM------------DVQRRRFFEGSAGSPLPsraRpttIASCLTCRNSGPHHM-IAETAP---------------------------------------------- +>SRR6185437_6364830 +--------------------ADAAMTYFYAQLFAMNTEIR-aVFPP--RP------------GPVKRMSRT--SSGACRrtrRs------------AAR-RPRPRPCHTSAGPAR------------------------------------- +>tr|A0A016TZT5|A0A016TZT5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0066.g3721 PE=3 SV=1 +----ANKSKKLVIAEWPRLLehEPNLFKIVWSSSAARSTSIKQAFGI-TD---NESPLENESFMKLSPTIQAFFYKLVIsmQLDEdmVRSACEQLGARHVDfiARGFNSNFWDIFLVCMAEAIDATLSSYITDeakraEMILAWQRVFNMIVHHMRTGYN +>tr|A0A0R3Q1W4|A0A0R3Q1W4_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=3 SV=1 +----ANRDKKLVIQEWPRLLeqQPHLFQIVWNASSTRSNSIKKAFGI-GD---DESPQENAVFMRLSETIAAFFEKIVItmQLDDdiVRSTCEQLGARHVDfiARGFNSNFWDIFLVCMAETIDETLSSYMTDegkraEMILAWQRVFNMVVHHMRTGYN +>ERR550534_360735 +---------ADAKASWANVDTAAFGKAFFKNWMASDPEVKNVFKK-SSFP-----------QGPAQFLVERFDILLGVLDDevaLSQQLMSVAKTHM-DKGVDPEHLVTFQDSFVKTLAGF-DSDWSRERSESWAYVLSHVIT------- +>ERR550539_1411929 +---------SLVETSWANVEKEAFGKAFFKNWMAIEPHVDEIFKK-SSFP-----------QGPAQFLVERFDILLDVLEDevaLSNELTVVAKTHM-ERGVEPDDIVTFQDAFLKTLPGF-DSDWTRDRSEAWAYVLSHVIT------- +>ERR1719192_2654783 +---------GAQS---APTPPKPVGQTwtkRLSEKLSSEPEVADVFKK-SSFP-----------QGPAQFLVERFDILLDVMDDeasLSKELQVVAKTHM-DKDVSPDDLVTFQDAFLKTLPGF-DSEWTRDRSEAWAYVLSHVIT------- +>ERR1719242_19104 +----------------------------------------------------------------------------------------------------------TPLIGMA--AQS-PLSWEQEK-----YVKLgQRWT------- +>tr|A0A0C2FEY2|A0A0C2FEY2_9BILA Uncharacterized protein (Fragment) OS=Ancylostoma duodenale GN=ANCDUO_24724 PE=4 SV=1 +--SLMPSQVSVIRKSWRHINTKGLITVLSrvfQRFNA----ID-------GQE--YAKVYDMTIYGIIEF-------------------------------------------------------------------------------- +>tr|A0A0C2G6K1|A0A0C2G6K1_9BILA Globin OS=Ancylostoma duodenale GN=ANCDUO_17195 PE=4 SV=1 +--CLSYKHRKLLRATFQQMNsSGaflKLMEQVFRRLEAKYPDIRSIFLTTAFVNSLSRERSSPPLvrteHDHCKCLVALFEKIMDNLSDdtQLMVIRQYGEKHAQmkESGMSGGMIESFGEIAVAVIASQYSYWIQKPVDDVTrrkgrDEGLVYLNDYEYIIL- +>tr|E1NZ07|E1NZ07_CAEEL GLoBin related OS=Caenorhabditis elegans OX=6239 GN=glb-29 PE=4 SV=1 +--NLSVKQKKLLRQSFNAMNsGGtflKLMEKIFRRLETKCPDMRSIFLTTAFVNSLSRERQTPPLvkteYDHCKCMVGIFERLIENLENIneqLTMIRHYGEKHAQmaESGFTGAMIEQFGEISVFVIGSQDVVKFNHETVKAWRLLLACVTDEMKVGFD +>ERR1719431_1401903 +-----------------QLTtnSIRSGFCGRLCETTRyNPDCtsSNTFSMRfRKR--RKNFHSPMINTEISRRILWRRKRLMTRLFKrdpeATKRIYDVGFHHQ-MMSITEHDMTMLSSSIYSAVQDILGKKASDKDLAAWRHLLGLVSYHFKRG-- +>tr|A0A1Q9NTV3|A0A1Q9NTV3_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_3 OX=1841598 GN=hmp PE=4 SV=1 +----TSKEADILTQSLKALEekTDDLPKLFYYHFLEPtsNKEIISLFNK-SDM------------TKQYMMFHQSLAIIVSSIKDshlLNQILKDLVKRHK-NYGVKYAHVQIFSSAFYKTIEEIFPK--DEKVKILWIKLINFVLSKFNE--- +>ERR1719238_586270 +----PKEVIAEVRRCWEAFIkasgsKEAASEHLYAALYDAVPSVQHLFVT--PR------------VVQAMRFMTQLQTFITLLDQPkqsKVTMEAIGFAHM-QRDITVELCVLVRDAILDLLQVELGDNLSSSAAAGFKGLLNWM--------- +>tr|A0A2A2L6E6|A0A2A2L6E6_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_22934 PE=3 SV=1 +--KLTKLQKKALKFTWSRLQtrnggkrVESVFEDVFDRVVRYLPQTREMFNT--RAF-LCAIsrNETSSLRDHARMTVRMIDVAVRNLEVetrkrsdtgSDMDPLLIGIVN-----WRGSRYS---CRIINRI-------------------------------- +>tr|A0A2G5VGS5|A0A2G5VGS5_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-26 PE=4 SV=1 +------SERSIKLRKYDYEKddgSK--------KLL---SFYKKVREK-------------FTFKRSGSEMVAVVVSVMQSLDEpdkISKMCQEIGQLHA-KYrrskGMKIDYWDKLGEAITETIREYQGWKIHRESLRAATVLVSYVVDQLRFGY- +>tr|A0A1I8EM37|A0A1I8EM37_WUCBA Uncharacterized protein OS=Wuchereria bancrofti OX=6293 PE=3 SV=1 +-PSLTSAQIHLIRNIWRQVYitkgPTVIGSTLLHGIYFKSKKIKDQFFR-CPFP--HRFPNrDSFNKAHAKAVGEMLDKIVDNLENlesMSGYLFSIGATHANliRRQVSKEIWNLMAEAFIDCTLDWGdKKGRTEASRKAWAFIISFAIEKIKRG-- +>SRR5690606_37396704 +---FSDTDTYILHTGLKWIEeaPETFAAKLYQRLLRDHPECQASLHAI-GL------------ESFNRNFIHFLKMVKEELLErhtIHVAPREFLALHALpvEKVRHSNYVIKMGRTFLDIFAELAEDAWSPALESTWNKAIEEVK-------- +>GraSoiStandDraft_42_1057292.scaffolds.fasta_scaffold716659_1 # 2 # 607 # -1 # ID=716659_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.685 +------TEIQILENGLRWIKesQDRFGDKFYHRLLREHPEVNPLLQSI-DP------------WSFNKDFVQSVDAIIGEIRAqgdVISPLKDFWPELSStaMTPLKPSELIKVAETFLDLISELAEDAWSPALEYVWRKAIKTVM-------- +>SRR5215207_8455447 +--------------DFDTVVCSSFAERFYSRLFTHEGGehLRALFPDN--I------------QPQHAQFTTMLGDILAYNFRigRSLLGD-TFRKHI-DFNIRESDVDVFRKAFVEEVGSTFLH--LG---------------------- +>ERR1711972_144950 +---------SQVLQSWEQVKllgLESVGEMLRANTFELDPQVVALFRIPGVVSTGEGMLQRMALRRLFSKVLRFVGSVVAGRYDyqrLVETLSR-----------------------LGATRAAGGATEVHFKI------------------- +>tr|A0A238BIH0|A0A238BIH0_9BILA Globin OS=Onchocerca flexuosa OX=387005 GN=X798_07861 PE=3 SV=1 +--------LFTLKNYWKTVRrnERDCAKMMLAKYLKQNPDNKEKYPKLKNIDVntVDVATANSGFETVAANYLKVFDDVITTVEEkpgdvsdACSRLTAVGKMHRTkVNGMDGSEFQLLEEPFLYMISEILQDRYNDKAENLFRKFYQFCLKYILEGFN +>SRR5215467_3799544 +----------QVSESYWRCCtNPLFIEELYQTLFSKCGEIKQLFEQ-KNVS----------MKRQYAMLRYALDIFVDYPHDMTATFPDIARKHT---GLDPRFYETFIEALIETVGKCDPK-WVPSLEHAWRERMT----------- +>OlaalgELextract3_1021956.scaffolds.fasta_scaffold865191_2 # 285 # 404 # 1 # ID=865191_2;partial=01;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.492 +-----RHEWHVLLERWQKLQpnADRFATVFFDTLFAADPELRQFFGG-ASL------------EAQFLRFAHLMTEIVSAAGDpeeldhrVEVVVQRFARDDS-A----TDQSRAMKLAIAAMLEEVAASDMTRQMRADWKAAYAAVGAM------ +>ERR1712159_177610 +---LSTSSLNAVKNSIPLIQqhGNAIAENFYVQ--QIQPTNITFFNRA-HFTS----------GQQAQTLSQFLVLLAQRSDNlelMNTHLRRISNKHV-GFGIKPQHYPIFFENLFVAFKEVLGTKATPELISSWKELVSLVQ-------- +>ERR1712159_799488 +---LSTSSLNAVKNSIPLIQqhGNAIAENFYVQ--QIQPTNVPFFNRA-HFAS----------GQQAQTLSQFLVLLAQRSDNlelMNTHLEESPTNML-DSESNHNTTRSSS-----------KTCSLPSKKS------------------ +>SoimicmetaTmtLAA_FD_contig_31_10253239_length_247_multi_1_in_0_out_0_1 # 3 # 245 # -1 # ID=589621_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.671 +--GLSEYERGLVVNSWKALTkpdfspldGTSSLSNFYDAVWTKWLKIDEF---------ANKMFRSRGFKGRVQHLLRIMGVIIKCAEDPlrgLEQLRSIGVQHC-IWGINSQSFASLALSIIHGLDQANGKEINAELKELWLAL------------- +>tr|A0A1V9ZGT6|A0A1V9ZGT6_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_12918 PE=3 SV=1 +-PVLTPTNVDICRRTWDLIQtagtdkMRqygkpgiiLFYDEFFYRIFERDTTIREVFPKV---------------QQRAEVLIKAINFILSTRAGtpasvmeTVNACRFLGHKHRAFAKVRPHHFAVYTNTCIEVIMYWLGEFGSHEVGTAWSHTVGFILRHILEAF- +>tr|A0A1I7XNU2|A0A1I7XNU2_HETBA Uncharacterized protein OS=Heterorhabditis bacteriophora OX=37862 PE=4 SV=1 +---------NTT------DSglqlEGIVVQNCFIYILSKYKHLRPIWQFGKKIEDneenwTLALYEDFYFRHHCASIQAGLTMIMENKDDpesIKKLLNEIGAHHF-FYDACEPHLELLDQ----------------------------VKGHVSDG-- +>tr|A0A2A6BP14|A0A2A6BP14_PRIPA Glb-18 (Fragment) OS=Pristionchus pacificus GN=PRIPAC_48995 PE=4 SV=1 +---STPEDKKLMEKTWSEEFdvLLTLGSDIYNYIFKNMSACKRLFPWIIKYEdEGVDWKKTTEFKDQALKFVQVIDTVVWGIIDgdkSEPFLYDVGQRHVQyaSRGFKASYWDVFLDAMQYAQDQRIPKmnnlnaQEKQRAKQIWHDVAAYIIKHMKSGF- +>UPI0002C4E217 status=active +--------------------------DFGTAFFEYCPDLKGQFPS--NYA------------L----VTKMIQKFINNViegKNLERLARHYGRTHW-RYDLEERHFLGFAEALADTINIRIGNFGTIELMKIWREEATMICKMLEDQY- +>SRR5215831_15107384 +----------------------LFFSKFYTNLFGRADDIEDRFKEL-DM------------ERQYRILNLAIHKLLEFRPEqpaTQKQLRDLSLRHA-KLGLTNHAPAWNR-IH-LDLRGIGA--DGRSsGVAAADKALAX---------- +>tr|A0A085LU76|A0A085LU76_9BILA Uncharacterized protein OS=Trichuris suis GN=M513_10599 PE=3 SV=1 +--NLTTHQKQLLVQSWPKVQtynRIHGGDAIFARFCEKNSIGRIFQETFQKiavvQSFAINEASESVLKKHEQYLLQLLTQAVENLNNdrepLLRECLAYGAQHI-TLQelLNETVWDQLTEAIIERIHMVSFVRRHRNLSKAWTMLITLLVEKIREGY- +>tr|A0A2E0SMS8|A0A2E0SMS8_9PLAN Uncharacterized protein OS=Planctomyces sp. OX=37635 GN=CMJ46_12130 PE=4 SV=1 +MSQISERQYHLIHDSYRRCMlADDFLVMFHRNFMEKSPQIPKFFAD-HTL------------QQQHRILAKSVARLVSFVDGkpqaeqdMRDTMRI---LHDGNLRLTPEHYAFWATALMETICTI-DEACNDEVAVAWEQTISYGTGVLK---- +>tr|A0A0B2VQV3|A0A0B2VQV3_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_12261 PE=4 SV=1 +--NFNKRERVCLRETFQKLAdPkELIGAIFVDIVNDIAPELKKVFGV--DRAPKAAMLKMPKLGGHVARFTDLIDQLTNMVGyteNVlgaWQLVRKTGRAHT-KQYFletnqsarGTNYFALVANTFILEFTPYLTGekeepnvdekkkvrfasTYTStMISDVWARFFKVITAQLTDAF- +>tr|A0A1I7YWT2|A0A1I7YWT2_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=4 SV=1 +--SFTKKERICLRETYQRLQdPkEIIGRIFLDIVNDVAPEVKKVFGV--ERVPRPNMLKMPKLGGHVARVNDIFDQTTSMLGyteNVlgaWQLIRKTGRAHT-KQQFllenlnqlEKNYFQVVIDYFQEQFLPYLTGekegqerkkvrfaqNYTTiLIEDVWKRFFSILIAQMTDSF- +>SRR5512138_1182700 +--------HRRVQGSYSTFQatdrADRLYRTFYANLFASVPEARRMFAH-TDWS------------RQYNAINEALKLLLDFDADpqraadAAKQIGSVALKHQ-QYGLGERELRAFEGALLHALRSC-G-ECKPATLEDWRMILAPGFHHMRG--- +>SanBayMetagenome_1026888.scaffolds.fasta_scaffold228792_1 # 28 # 387 # -1 # ID=228792_1;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.353 +----EPNQRALAKASYRTWIepDTRFFEDFYRRFFATTAAKrahsVHKFK---DR------------KEQHDKLRNGMAAVLNFYpGNEPTSLRYVIDVHR-RKKVTEPELKQFSATFLELVSERLNRKLtgtgsaarRKEIMDAWTALFDQVLKHFRE--- +>tr|A0A0V1CBX7|A0A0V1CBX7_TRIBR Uncharacterized protein OS=Trichinella britovi GN=T03_16916 PE=3 SV=1 +--ELNDNDRQAIRQTWQKIGdHTLWAQRLFAKILVACPAFSKATSF-HSL-AGKHLLNDAKFRSFCQRFADFWQNLVQLLCvsdDpadwqqAVDSIRGLGQRHSLNRKVTfeAPIWLMIKNEIVLSITGY-SDICRSKDCLSWNKLLMFTVAEMKSAF- +>SRR5262249_4116633 +---------------------TKFFRSFYEILRE-SPEIHDMFTSP--FS----------VAKQAQKLNNAMEKILNFRTYMnTSSIGREVQRHR-KLNIKPEHYGPFRDDFVKALKKAkIDDGYS---EDAWCAVLDPALDYMRT--- +>ERR1719347_1935341 +-TGLSQNEVTLIWSHWESLKphKRRLAKRILKVYIKEHPRARELFPNWVDIP-TVELVKLTSFSRKAVDTWEAFSRAWECIDDaplCRKVCYAFGKKHI-ECnarikghgQIDEHHVKNFIRIFLRIILVSAR----EGSEEAWRKATEFFSINFVRG-- +>ERR1712142_116161 +-THLSQNEITIIWSHWESLKphKLKLAKKILKVYLKEHPKARELFPpHWKGIS-MADLVKLHSFRRKANDTWEAFTRVWECIDDpklCQRVCFTFGKKHV-EWnarlrqtrgQIDEHHLKNFMHCFSKTVLDNSR----AGSSEAWRKATDYFSLHFLRG-- +>ERR1719313_2808357 +--SLSDATHELLQKTWQAAKPegpg--LGEAWYEELRsdtSYVDDLGVILNF--PV-------------CRPENVSRVVQALLDLLPRecqetpepglmlpvprFTKLLLAAATLAQ----------------------------------------------------- +>tr|A0A0D6M2N5|A0A0D6M2N5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum OX=53326 GN=ANCCEY_04360 PE=4 SV=1 +--NITPFEIRYLKYSWEKASsTMDIGCELVARLLNDN---RTRFRALIEshsgdLLgsanfAAEDVKKFRRARSVAHGVVMFFNQVISELDEpnsadfIAVISQRLGASHF-RMKvwFQAENWLCVKNCLLDTIMAALQVKkttsfacgktisMsDKKAREVWYKVIQFVIQNMKRGF- +>tr|A0A1I7W801|A0A1I7W801_HETBA Uncharacterized protein OS=Heterorhabditis bacteriophora OX=37862 PE=4 SV=1 +--NISSQEIQYLKYSWERASsASDIGCELVARLLNDN---RTRFRALIEshsghLLgssnfTADDVKKFKRARAVASGVVMFFNQVISKLDEpdaadkISLLSQSLGASHF-RMKvwFQAENWLCVKNCLLDAIMTALRKNggssllcgkrhmHnIKRATDVWYKVIQFVIQNMKRGF- +>tr|A0A0K0DKR1|A0A0K0DKR1_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1 +--LLSTLVANNLQIYFSRANnATDVGCELVAGLLNDN---RTRFRALIEshsndWLgsatfTAEDVKKFKRAHSVANGVVMFFNQVISKLDEedaverIALQSQRLGASHF-RMKvwFQAENWLCVKNCLLDTIMAALMTKpfmvcgksitMnQKKSREIWYKVIQFVIQNMKKGF- +>tr|A0A1I8BDP5|A0A1I8BDP5_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=4 SV=1 +----MRYTNYLSKIVLARTLnQVDIGNEIVIHLLNDK---RSLFKNLLEqsspyEKeikniyDKKSLSkYSPRSLEISNGVTKFFKNLSLLLnqkgmEIeekedkLVEICKNNGKMHY-QMKvwFQAENWICLENSVIETIIKGNNLEkenFeSNQTIIVWSKLMQAIIGWMKQGF- +>tr|A0A158P8J3|A0A158P8J3_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1 +--NLRKEQVRALRMTWTRLCepprsnckgIVNLVERVWEKLDRKDSSVRNIFYNAAFvetMHDRCERRrskgSIATLRDHTHFFVSLVSQVIQSLDLnpenILNHVDTIgKSNHAylKQYGFRSQHWEKIGEYFVDVVVIQDCVRGFPEACRAWTILVAALVDRLRAAP- +>SRR5262245_41417288 +------------RASYPRCMaSGNLHARIYEAFFAACPEAKPLFDN-TDL------KRQYQLLHQAIVLMLAFH---VSPNrEEPTILSRVAARHS-ELGVhiPPAWFDAFSAAIQQSLEAA-DTQFSDKTREAWAAVLADGIGYMQ---- +>ERR1711884_327085 +--------------------------------------------------------------------------------------------------SNESFSvIFKHLAFIKYL-HItktglFDELFGQHVCRIRRiLPFKLIIRL-SSNF- +>ERR1719471_2433215 +-----------------------------------------------------------------KGIMKVVSKVLCHLNDlsrVEDYLRVVGRLHD-SAGVEIAYLSVTGDAFCTSLKRLgtHADIWNDEVKQTWNAFFRVVVDLMSAGY- +>SRR6266436_7042579 +-----------------------------------------------------------------------------------------------RVFITAqysCRYHSFSATFYVMAGdkerwkVYM-SHQQMSLhARSKDGLYSRRttQGY------ +>SRR5437870_11165056 +----------------------------------------------------------------------------------------------------AqysCLNHILSATFYVMAGdkerlkVYM-SHQQMSLhARSKDGLYSRRttQGY------ +>SRR4051812_43285676 +-------EVEVARDSYKRILddevkEEKFFRSFYQRFFRKCPDAAKEFAA-KEFPRRVAlsGRggnaREGKWPRQYRLIKQAVVLLltFKLLDDteGLTILTDIADKHE-RYP--QEFYDSFRDALIDTVISLDKDsgsgLQRYELRDAWEKSIQPGIDYIMN--- +>SRR5262249_5830581 +-------DVEVARDSYRRILddverQREFFHTFYGLFLRRCPEAAAVFEA-KGYPALAQlgGPrvedSAGRGPQPPNPLKSAIVMLiaFNILGEkeEPTILDNLVDKHK-GFP--KRYYVAFQDALLETVVQFDDPsrcgMPPDELQHAWKQAIQPGGDYLID--- +>tr|A0A2T7PRA6|A0A2T7PRA6_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_02930 PE=4 SV=1 +---FEPHDKTIVAESWKLLRsiFPDLIESAFVEMCRRVPRLKLQFGNV-DVDDD--EERHMNFLKHVWDVSFFFDQLLLYLPfksKLEECSFHIGLVHA-SVEVPAWYVDLFLVEFIRAAQETVQLEWTPAMENAWAVFLRYLCYYMKDA-- +>tr|A0A2A6C3W4|A0A2A6C3W4_PRIPA Glb-17 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_39254 PE=3 SV=1 +-MELTDEEVAAVRNVWIRAKTEDIGKKILQTLIEKRPKFAEYFGILCQSDklDMNSLKESKEFHLQAHRIQNFLDTAVGSLGYcpvtsIYDMAHRIGQIHF-YRGVNfgADNWLVFKRVTVDQVTKGvtstqasqanlLegtkepevveqhpmadvQNPFSGEnclARLGWNKLMTVIVREMKRGF- +>tr|S9VAV3|S9VAV3_9TRYP Uncharacterized protein OS=Angomonas deanei GN=AGDE_12480 PE=4 SV=1 +-------------AAWSHLLtspnGGEFCSTLYEKLCQNLTYIPDYIRNLKD---------EE---RVIDHYINVITKTLELYENphvMIDELPKIAARHR-GFGVSSDAFFVMRNIFMELLPEYMDPKVYEQSKKDWLKFWRLVLDLMVSGS- +>ERR1719354_143580 +------------------------------------------------------------------AFWDILDHICGHLDRlenLIPQLRDFALQCF-NSGLFSDDYNILGECLVTILSTNFD-PWEETHSDSWAWCLDLVMSTLVT--- +>tr|A0A1I3QX19|A0A1I3QX19_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter neptunius OX=588602 GN=SAMN04487991_1987 PE=4 SV=1 +----DEQMIALVKASLKELQphAGAVFATFQSKLAQRAPELAYRYDEV-DP------------ERQGELLFEKLAIAlggVRFLDRLVPALGGVGLDAG-SASLTSCDFARLSEVLIAAFAEVSGNRFDPCIGAAWTTLFEELSWHMFE--- +>SRR3954469_11252496 +------------------------------------------------------------------------------------DGGAIRRHHV-RSGIGGPDYGRFGDAIPAVMVDVGGNDLPKPIGGSWGDAFWAVIGRTKQR-- +>tr|E0VF27|E0VF27_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236389 PE=3 SV=1 +-----------VLNDWPKIRknYKKIFIDSFINYFAENPNYKLLFPSFSNVS-EDDLPFNHCFRLHCFAVYKAINFLMSNWlGEyeedDSKILPVIGKTHF-DRGITLEMMNLYKHSIVYSCNNHLKPNL--KRKLSWQTVFDHIFDY------ +>ERR1719461_240742 +-----------AVASWNNIDdKTAFGKAFFSNWLESNPRIKDVFAQ-SSFK-----------QGPAQFLVERFDILLGVIEDeeqLAEELYQVAKTHK-KVGVDQSDLYSFQASFMKLFLPS-TLItaqrsqtlgltpFLtssSLLWSRWQLSLPV---------- +>ERR1712165_596852 +----------------------------RLFLPSTLTSLQRLETH-----------------GLTPF---------------------------------------------------------------SHVITAP---------- +>SRR5580704_4499342 +------------------------LGDFYRRLLQHHPQLAAYFEGV-NI------------DFQVQKLVVVLSTIARDLPDrsvLDRVLFHQGVAHV-ERGIGRGEFNEFIALLANVVSCKTTLVGAAESYAVWYQELSAVATSML---- +>tr|A0A0G4HY87|A0A0G4HY87_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_33490 PE=3 SV=1 +------NRIHLLQSSLAACLkmstkEEFVGRLMYDTLMRTLPEPGIIAKR--GR------------TMMSRAFNDtvaALVAFVSEPSHMETYMDWLALRHV-HYKIDTTLFPQFRQAMLVSLEQVMADQWNAEIERAWSEAYEMTSQALQ---- +>SRR4051794_14672716 +--------------------SPAFAESFYTHLCR-SDAVRDLFVTAHRKRVPAALnrQESpaIPDETQRRKLVDGLKAVLNFRPGcSPSSIDSVAARHV-DLHLTTDHFDVFEKSFLETLEQHVTRSEdreeMEEITHAWEKLFATVRDEMLD--- +>ERR1740139_220892 +-------TRAALLKSWEMVQeaGTvPAANLLMKHLRERDAEALRVNTSH-ARP-KTGETEEDAVRKLAVRTVQILGSAATGMSDtvsLVQHLHKVGAGFA-GTGIKEGYFAMVRDASPFALRELLGDRFTADIASACRITGPFLASLIIAGLR +>ERR1712194_173361 +-------TRAVLLKSWEVLAevGTaTAANVLTKHMRELDAEALRSYTSQ-AQP-KDGETEDDVVQKLAVRTVQMFGTAvtA---NDtasLIQHLHKVGAGFA-GTGIEEGYFSLVDKASPLALRELMGDRYTADIASACSMTGDFLTSFVREGFR +>ERR1719446_598571 +--------------------KKAYGLNAFNRFFCKAATIGNSFQHI-QC-------------ASVCSgnarSPAVSGYLQGAYTlgeCGHLTWPQTHHVQH-FYRLLX---------------------------------------------- +>ERR1719240_1501566 +------------------------------------------------------------------------------------------------VQHFYRILRLLLEACCEELADWVKD---PAAVEGVEWALTQIAAIMI---- +>ERR1719235_1367256 +---LPGVTVEFLRSSLARISEDEFGDMFVQKLRETGDmlsegTIEGVLNT--PI-------------VRPTNLRKMIVYAL----------------------------------------------------------------------- +>SRR3989338_2963815 +---------TPLYHLYKENVppqkERELGLLFYKLLFDSNPELLDFFANV-DLD------------HLSDHLVQTIRLFLESRnslVSLVPAMKALGIIHQ-RAMIPSWAFPLVIENMAKLFSILLGDRFTVELASALVLSFDLLTSF------ +>SRR3990167_6716616 +---------NPIYStlknIWlETVStpeiKSAVGELFYKNLFQYHPELLEYFNNV-DMD------------SLALHLSQALDFVFQSInkiGDYksqwRTVLEHLGEVHR-AALIPTWGYPIIGQQILKIFPYNEKAGFSTKQL--etaLATLYREIVII------ +>SRR5436309_231744 +-------------------------------------EIGQLFEG-RKVT----------MEDQYRKLDRAMFSILSFNRRlKATTLDPQVASHS-EFGLKREYFQFFREAFLAALRETQAS--DDYSREAWSALLNPALAYMSD--- +>ERR1719183_3286062 +--------AISLRDSWVHIEvlkeeddSGGFGDALIFQLS---VVAQEIFGLV-VTE----------RNALGKIFNRMFSTLVHAMGDpqkFTEEFFVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDSMVRNFR +>ERR1719183_785787 +--------AISLRDSWVHIEvlkeeddTGGFGDALIFQLS---VVAQEIFGLV-VTE----------RNALGKIFNRMFAVLVQSMADpakFTEEFFVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDCMVRNFR +>tr|A0A0N4UGY4|A0A0N4UGY4_DRAME Uncharacterized protein OS=Dracunculus medinensis OX=318479 PE=4 SV=1 +--RLSDKQKLWIKLGYKKWRsksKMVPGEWVHAYAIKKYPTMKALFKK--HEN---------LARVYTQTITKIIEMAVESVdslDDsLGPLLISYASENgileERgmasiftirndklllfLEGFDRRFWGYVAEALCALSRDFPLKRHKWDTISAWRIIVLFIVKKLEYGF- +>tr|A0A2A6D1B3|A0A2A6D1B3_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_35146 PE=4 SV=1 +--TLNHQQRKLIKNGYDSWRkksCISSGRWVHSFVSSKDDRLKEIMEG--NEE---------TTRIHEETITHLLDMAVESLeslDDsLGPLLISYTGPQgvfeEK-DGFDRLYWSRVSEGMCQLARNFPSKANKYETVCAWRIVVLFICNKIELGF- +>tr|A0A2A6B4U3|A0A2A6B4U3_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_54703 PE=3 SV=1 +--GLTKDKTDLMANLWPSHYgtLYDMGIAAWDKLFAHNPGLKKHFGF-AENDPSSSWKNDERIKKMVLSLQQLLTEAVNTLGfgDtealtsFVNNLRELGGLHRAiADGVNPDAFTLLFAILPEVIVDVTSnrskdgplsSENRSELLAIWRAITRFMANQVMTGW- +>SRR5687767_14811217 +--------------------SREFMSRFYRRLFAARPELRSQFKNV---------------TTQHDMLAEAIRDLVLFRpGDQEARFLDYVETHR-RMNITVHDIEAFRLAFVAEVIATSMQngnAQARSHGDAWNAALKLGLGVMAK--- +>SRR4029453_11133516 +-------------------------HLIILKLQRIAMQGAflSVIPAtgFSEH----------FITNSCEFLPK---PQSSSREKalgenEPNILSRIAEMHNKnNYNISPESYKAFVSALTATICGSAPEipePFAPqckisvneknLIKNAWQKALKPGIDYMIMRYS +>SRR5262245_37180117 +--------INKVHESLKRCRlQPGFFRDFYQQLVKNDAIQ-AIFTKrgLDVL----------KSDKQQWLLREGLDLLISYADEpkspGLHVLSRVAESHSI-YRVGIEMYDGFLEALLVTVRRHDLEfqdP---skddskVIEAAWRRALKPGLDYLKSQRP +>SRR5262245_45185474 +---------------------PTFLEAFYKLFTA-DEVVGKRF--vkFDDI----------EWKRQHGLLQQALDACFDFASLlsmqnlrelpEPNAMTKYVVRHGPgrgNLGITSTEYDAFVEALITTVCGNPGNgqaPYDPecadaerkdVIEFAWRRLMKLIVEHFKKVAR +>GraSoiStandDraft_39_1057311.scaffolds.fasta_scaffold195098_2 # 276 # 1100 # -1 # ID=195098_2;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.692 +----SFDVFEIAKDSFNRCMgadgGALFFKTFYERLLSKLPVP-yaRQLSQkgVGTS----------SSHRQYDMLRQGIFILLQFGQHklyerEPNILSTVAVLHDQhHHNIPPNLYAAFTGALIDTVAGAPPAiptAFDKqcetdmdIITDAWEKALAPGIRYMTEKYF +>tr|M1PA46|M1PA46_9CORY Flavohemoprotein OS=Corynebacterium halotolerans YIM 70093 = DSM 44683 GN=A605_12675 PE=4 SV=1 +--------------------SGEFRDEVHRRFYLDVLEARQVFPL--TLR------------ETHVDLASSLAWVLERtssdgtLPDdVLARIRRLGVDHR-RHGFPAEVYPAFLTALRGGLRTVTAEHggVDDPLVDAAGDVFARVCGAMADA-- +>tr|A0A097IIH9|A0A097IIH9_9CORY 2-polyprenylphenol hydroxylase OS=Corynebacterium doosanense CAU 212 = DSM 45436 GN=CDOO_12240 PE=4 SV=1 +--------------------SEKFRDLVHEQLFSTELQSRQVFPS--SRA------------RSHLDLAPALAWVLERstidarVPDeVMRTARRLGLSHR-RHGFPSEIYTPFADMLVHALREVNFRAdpqLSAGLIIPAETIIRNVCNAMRAS-- +>tr|A0A0G3HGP7|A0A0G3HGP7_9CORY Uncharacterized protein OS=Corynebacterium uterequi GN=CUTER_09860 PE=4 SV=1 +--------------------PDEFRSRTLTGFFAAEFQARQLFGL--HAT------------QAHDGLPEVIAWALERcgidghVPSeVLDRLQRLALVNR-RFGFAPSAYSSYAEAITTALKDLAYVHfgeVNIlpSQMFAATLALDTCARYMQRA-- +>tr|K0YDT0|K0YDT0_9CORY Uncharacterized protein OS=Turicella otitidis ATCC 51513 GN=HMPREF9719_01398 PE=4 SV=1 +--------------------RTAFRDATVDYLLRRLPRLRRVAPL--RQR------------HRAEALAERAVGLVARspqgmLRGeDAADLERAGRANR-RLGVPLRVYPVLAQALKAGLRAAFEAAgepYTA-AARDAEALAEAACASLARG-- +>SRR6478735_8357209 +-----------------------REIAFLVARGLPsKEIAEQLFLSVR---------------TVQNHLQR----IFTKLG-VTSRGEVAGVLQG-LEGPSSX--------------------------------------------- +>ERR1712130_811490 +----------------------------------------------EAAlagmKAVEDLGGKFDRTKHGSLFLSVvLTRVVPHLDQrdrVLPYLVELGALHQ-REELQDITLICWVLHIalPSGVWSRVeecVGGYC--TRQPRLGLVWSLPS------- +>SRR5436309_12080688 +------------------------MHRFHAHLEQLNPRLRYHLPP--ALL------------RYVrFELLQAVRQQT--PMEVGSGLRRFGVHLR-AQGFEGPDLDTLGAAWLVALDEVLGDRFDSEAREQWLRFYKVLRSA------ +>tr|A0A0N4Y9E2|A0A0N4Y9E2_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis OX=27835 PE=4 SV=1 +----------RIQHSFKTASfhltvnqlrsRPTIGDAILKRAISNRPEMRTFLNRLTE----------QQVEHMGKQFYSLIAVSVENIERpeavryfs-RLPFFAMFETYATlcQLGFRPDYFAPLADAAIAECVKLDGGaHKRCETLLAWSQLISAIFTSVRDGY- +>tr|A0A183LHE9|A0A183LHE9_9TREM Uncharacterized protein OS=Schistosoma margrebowiei PE=3 SV=1 +--------------------KIKVGKEIFRQLLIKNPHYMKMYKPLQSVT-LPQALNLDYLTKMAICYVDNIMKIVRNFNEeekLQETVKYLAAIHT-NRGLTVAHFVSILPIFTDTIVSYME--------------------------- +>tr|A0A183WH41|A0A183WH41_TRIRE Uncharacterized protein OS=Trichobilharzia regenti PE=4 SV=1 +-----------------------------------------MYKPIQSVT-LPQALNSDYLTTMAIRYVDSIVDIVENFNDeenLQQKIKYLAGKHT-NCGLTVAHFVVSLQILCICVHIWQT--------------------------- +>ERR1700755_1321676 +------------------------------------------------------LN-SKG-HRQRDELLNALVSILSKYDPdrpdsqpmieLEADAMGWGRRHASfaalggrPA--GPDQYRVVRDVLWQLLIDASDGRWDAGHTEALVDAYHWVQTIMMW--- +>tr|A0A0V1KYG9|A0A0V1KYG9_9BILA Uncharacterized protein OS=Trichinella nativa GN=T02_16304 PE=4 SV=1 +--SLSAGELKLLRWLWKQMKqvhQGLASAKLFQIIFATCPEIKRFFGL-AKDT-IDMIINSLSYDNE----------------QLAQLMIAFGCQHSFytRRNFDPKYWNVFGDAMLHLVDDLPLKAFKrYRAKSIWFRFVYFVISHMQLGY- +>tr|A0A1I7VKJ4|A0A1I7VKJ4_LOALO Uncharacterized protein OS=Loa loa OX=7209 PE=4 SV=1 +---------------------------------------------------------------------NALKKIIESLKNeqiPYEVLQRISVKHA-RHNIQTHHIQKMIKPLVENVRRALGR-QDENAERAWETLFQTIAII------ +>SRR4051812_9951159 +MTPLPPEVAQTIRSSCRPLLerQEQFHGDFHASLVDLMPEVPMMREP--A------------GEQVSRWLVECVLWAVNADEPvpmIGATLQGVGLDAH-RLGFPRAGYQAVGHALLRTVRGASQNDWSGTLSSSWIGYHSWLCEYWVSG-- +>ERR1711890_22380 +-MHLSDTEKSAVVSSWSNVN-SSLLDSVLLQLVQENADMRAAMSR-GDLA-EDSIREQETFKADVTKLTCCITKLVTRLGNTGEVSSCPatCLKNC-P-YLQPKHVPLFISSFCD------KLELTEDAKKGWKFIMEKTAERI----- +>ERR1712018_299478 +----------------SDVA-ENHLEDVLLQLVRENSELRSSFSW-GNLP-EDCLRDDDKFKEDVKRLNTCISKVVDILSSSGDApLACPvsSFTSC-P-YLKSVDMPLFIKCFNS------GNKFSENAKSGWTAIFEMAGKKM----- +>SRR5262249_47865225 +---MNHRQVELVRSSYERIRrvRHLFADLFNRRLTLIAPVLERLLPP--ET------------ARRDAAALELVEFVVAGLDRLDVLLPALAVQARVwrLKGVEAADYDVAGMALAWTVEQVLV--------------------------- +>SRR5215470_9720857 +----------EAKRSYRQFArDISFYRELSKRLFRKIPGIEKKFRH-RTM------------EEQYKVLRDSLWLLLSYASapdQqEPTILSRIAHTYA-R--FPKEWFDTFREVILDVVAQRDP-----SSVRAWKHAMAPGLEYL----- +>ERR1719487_1476365 +-------YKTILDRCYERMTtqldLVAMVTLFQGIFFGRDIRIQSYFSKP-N-------------ATLRYVVLRIINFLVNVYHkpaAITGELRALGVSHV-KWEIPPDLFVPLGEALFITLEICLGG-------------------------- +>ERR1719271_344116 +-----------------------IRKDIYSTFFTQAPAGQDYFKQS-N----------TYLHVVADKIMVMTLELYQNPVKMVDDISALGLRHV-GYAIPTELFGPFVSACVEVLMTRTSD---EATIESFRWSLGLTSKML----- +>LSQX01.3.fsa_nt_gb|LSQX01333836.1|_8 # 4697 # 5665 # -1 # ID=41498_8;partial=00;start_type=ATG;rbs_motif=AGGAGG;rbs_spacer=5-10bp;gc_cont=0.475 +-----------------------LRQEFFLNFFKLAPSGQDFFKQS-L----------TRLYFIADKIIELCLEIYRQPRAMVEDISGLGLRHV-GYAIPPELFGPFVGSAVEMFSLATTN---ETAIDGFKWAMQLVSKIL----- +>SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold703673_1 # 2 # 517 # 1 # ID=703673_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.653 +-----------------------SSSIIVSSFMRDssrPCRRVRTIKQS-N----------TRLHFIAESATNMSLKLLQDPWRMVDDVSALGLRHV-GYGIPTEMFGPFTEAAVDALRGHVDE---TLALEAFNWSLSIISQML----- +>tr|A0A1I7RTA6|A0A1I7RTA6_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1 +-TGMTRHHKMILQKIWMRASeadINECSRNMMSHLLRSNQQLYQMFNLV-GMT-DKEIQQSIPFNRQAANFAMVFDFVITNLTDdlnrVAFALEFLGQHHA-DLGFTIdqPFWALFNRVFEDNPPKLV--FQNPEGHQVWKLMVNFVVRQVKNGY- +>tr|E3MDQ4|E3MDQ4_CAERE CRE-GLB-31 protein OS=Caenorhabditis remanei OX=31234 GN=Cre-glb-31 PE=4 SV=1 +-------DVERIRAVWMDhINgNDDYFQEVIHRICKRNDGIRCAMLTQnAQHA-ESAAEEDFVLSNIADRISQFFHQLIEddvllNTVELKKCCYDLGRQHS-AYSkkqFKISFWEEFTLTMMDVLEQNYP-QTTKEEQKAWLHFQRFVNENMLDGY- +>tr|A0A0B2VIR8|A0A0B2VIR8_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_08540 PE=4 SV=1 +---QTSTRIALLQSSWTSVQtmtSGQFGARIVYSMLRKDPSLFDVFTTVqydgeetplrqtsgliarfynfGSIPdktppnngEetplrqtsgliarkSFDLLTCPQYYEVGDRIMNFMGELIQMMQDgqseqaIIERIRLVGATHY-ERNVmfSSCVWREFKASTLAIVGESTFEseSIRVETLKAWSSFVSLIIREMKNG-- +>tr|A0A1Y5SIU2|A0A1Y5SIU2_9RHOB Uncharacterized protein OS=Roseisalinus antarcticus OX=254357 GN=ROA7023_01630 PE=4 SV=1 +-------QAELVADSLSRVGdkVIWLASDYYEALFDASPQLHGVLPH--QM------------SEQTNMLGHALAHALANLRDpdgAAPMAQDAGLADR-SARMPPRMRRTIVRTLVHALSLWHGPTWTKDHARAWNEGLLGVA-------- +>tr|A0A0N5CYF2|A0A0N5CYF2_THECL Uncharacterized protein OS=Thelazia callipaeda OX=103827 PE=4 SV=1 +--ALSTVQRQIVKECMDKA-KDDIAERIYRRIFERRSDFRKFILA---LPD-------KQRWALTDSLHNYLKSAVNQIKDgsaVRKISEDFGAFHVQyrSFGFRPDFFVSTADAVTTEFVLLDAaVHQASDTLCAWSTLTGFMFSSVRDGY- +>tr|A0A0K6SA08|A0A0K6SA08_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_8920.t1.CR2 PE=3 SV=1 +---------------------AAMAEKFFELVPKRAPNLRMIFEKRQDI-----------YKHHFGEI---TKRLLAYLDSpeeVWKEDPELAIKHI-EFGVMPCDVPVFANVFLQILAELAGPAWTQRHRDTWDKLFSIVSGALA---- +>tr|A0A0G4H7J1|A0A0G4H7J1_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_24983 PE=3 SV=1 +---------------------AVFSREFFKRLSTFAPSVHAVFVKSEEK-----------YTRTIKDL---LGRLLAYIDDpsaIWSDDEELAMRHV-IFGVMPTDIPLYNRVMVQTMAGIAGGEWNLQHDAVWTKMMGLATETLS---- +>SRR5215468_7630418 +----SPEVMRVIRFSAGLLAelQDMFVRQLHSEVTALIPGLAA------NG------------RIFCERMVRSLLWAATAgqpPHAAAGALRQVGAANR-RDGFPEERYADVARALVLALRNVSGSSWDNSIGSAWISYFRWAEPHLRAG-- +>SRR5215469_6664897 +----APAAGRVGCQSAIRLSrnQDAFIRQLYDDFKELDPDSaqtqAP------DL------------LVFCERMVRALLWVALTdqpLRVVADELRQVGAQNW-YES------------------------------------------------- +>SRR4051812_31756681 +----APSVMRLLASCTADLGpqQPELAEALYQRLLELLPEVatlAE------RG------------RPLSDRILHAVLYPTEPgrtPLNVATVVQQVGAQNY-LDGLVGEHYSSVTHAVLHAAREMYRGEWSSALSSAWVEYLLWLRGHLLAG-- +>ERR1712232_311801 +--------------------RREMSMAIWNRMFKKDPEAERVFKQ-SN----------ERLIFIVEKAFENAAKIYQSPSETREYIQGFLVLMK-LLLMAL--LGRFLSSRAPWL-------------------------------- +>tr|A0A0G4HD16|A0A0G4HD16_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6316 PE=4 SV=1 +---LTFEQKeEIVRSAWTTLSstyqLQEIGRVLYETICEEAPGLSSRYTKPGE--------------VMALRFGEMLATLIHlfldFPNDLQQKMEELAIRHV-NYNVDLEYLPVFEISILRTVQELYCeGEFDVEVAT------------------ +>tr|A0A2W4YK05|A0A2W4YK05_9SPHN Uncharacterized protein OS=Altererythrobacter marensis OX=543877 GN=DI636_06370 PE=4 SV=1 +--------AALIERGLERAAqqLGDITPLVMREFYRRIPEAEASFRHH-APHDPH--------GLEAEMVGNTLHYIMRWHEAPmeiRIDMDTSVPHHRVALDVPPDWYRGMIEAAIDVILSSVPSSA-SDERTAWKQLRDQLVSL------ +>tr|A0A1Y6FH01|A0A1Y6FH01_9SPHN Uncharacterized protein OS=Altererythrobacter xiamenensis OX=1316679 GN=SAMN06297468_2444 PE=4 SV=1 +--------STLAERSFERLAeqRGDITQDVLERYYRRYPDGRASFEHH-GLGNRA--------ELEGRMVSTTAFLLMQWAQDPggtRIEQGTTIVHHQDTLEIGPRLYLGLIDAVLEVLFETIPDES-AEERAFWLSLRGEIADF------ +>tr|A0A2E8LSZ4|A0A2E8LSZ4_9ACTN Uncharacterized protein OS=Actinobacteria bacterium OX=1883427 GN=CL510_01665 PE=4 SV=1 +--------SELAQRSLERLSevGGDVTRPVLDAYYARHPDARASFEHH-GLGHTA--------ELEGRMVAESLYLLLTWIEDPataRIDHGTAIVHHNDSLHIPPRWYLGLVDAALDVLLRTVPEDS-PDERALWVALREEFAAF------ +>tr|A0A1E4JTP1|A0A1E4JTP1_9SPHN Uncharacterized protein OS=Sphingopyxis sp. SCN 67-31 OX=1660142 GN=ABS88_06340 PE=4 SV=1 +--------LELLDRSLTRAAdaIGDITPVVMARYYARHPDAAASFERH-GMGRTS--------ALEHEMVDNCLYCLMYCLERPteiEILLENSVPHHQFTLQVSFDWYRGLVDATIDVIAESVPADA-ADERQVWDEIRSVLGGV------ +>tr|A0A2E0VIY1|A0A2E0VIY1_9GAMM Uncharacterized protein OS=Porticoccaceae bacterium OX=2026782 GN=CMK32_09515 PE=4 SV=1 +--------NDLILNSFESAAesLGDITPHVYRRFFLQYPEAESLFNIK-GAQFQD--------ELKVQMVRDAIYAYLEYLETPeevEIVFKYTIPQHV-DLDIPIRYFIALLEAVADVVCDSVDDRTQADTKASWSELLQEFRQM------ +>ERR1711865_325941 +---------------------SQFGLNAFNRLFDTEPRSEDHFKT-SN----------A---RLSMLATKSLELSMQMYKEptrVMNEVTSLGLRYI-FPAHD----------------------------------------------- +>SRR2546421_6426420 +------------------------------XMIRRPPRstlfPYTTLFR-SDF------------ERQNKLLRHAFGLLLIFPNQartEPSVLTRVAERHSRrDLDIPRSEEHTSElqsRSDLVCRLLLEKKK-KNQV-------------------- +>tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae GN=mphP PE=4 SV=1 +--------------------VTAHSIQAVADELRAHraeFIQAANQ------------------KPD-SPLADAIVQLVDHTDLdghvpesIATSWLQHAAAAE-SLGVSRDYYLTLADASRSALRHICAD-------------------------- +>tr|D9QCQ3|D9QCQ3_CORP2 Oxidoreductase OS=Corynebacterium pseudotuberculosis (strain C231) GN=CpC231_1874 PE=4 SV=1 +--------------------KDAFHTQVFANF--YHsnPYARATI------------------APS-EQLVPAVISLIGHLENngfisdeVKQKFLEHTKLLD-ARGF--HHYTALASAVRSALQTMCTD-------------------------- +>ERR1719474_106261 +----STASLELVLDFWRCTVhrlsvhdRAMMGGDLFRGMSRQDAACRALLESL--N------PTSERMDLWGLRFLDTTGWMLRRANaaDLDASLKAMGAEDR-ARGLTVAYYRVLVERLHSELAARFPTKYSETVQAAMEEVIWSFVRR------ +>ERR1719499_858439 +------------------------GRAIIEGMNHE-------------N------TSPNQMDMRTVRLLDTLGWMIRMSciPtmDLKVLYAAWNGMAA-EVGYSAEYHVSWIQYIEAQLTERFPSEYTDSVRSAVRELLRWSIPN------ +>ERR1719410_2598304 +-------------------------------------------------------------PSHALKILNVFGYVIRNLIHpsnhlkLFKQLQSLGTVHR-AHSLNNEMYEAMLKSFNYAMEEKFANHYKIRIRFCLSQLYRVIVDIMTG--- +>ERR1719216_785110 +-------------------------------------------------------------PKHTIKIITTFGYIIKNLIYskehtkIFKQLQSLGEMHQ-CHSMInTDIYMELLNAWHFAMEEKFQNKYKNNTRFCFNQLYRLIVDTLMG--- +>tr|E0VF51|E0VF51_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236397 PE=3 SV=1 +--------VKIVTPTWESIKedFDWYCTKIEETFFQNDTTKKELFTL-PKFEeELTDDVVNKRLFKHSSAVLNFMECIVQFMNGneeTKPVLFVLGRNHY-TIGVNEKLFLEMKDAICSVIKYKIG----TENAKAWDTILQYI--------- +>tr|A0A0M3IFG8|A0A0M3IFG8_ASCLU Uncharacterized protein OS=Ascaris lumbricoides OX=6252 PE=3 SV=1 +-TGLSMHQKAILTARWRQLPqgiVFDLGKRVFGTLFQKDPNLLVVINL-EHLQGTDAWRDHVNFHMHAQRFTHALSQCMRHLVEpivAADRLQEFGATYAEmedsenfnRSRIPHSYWDRLISAMTSTAKEFHEnpsqksrrnslsvddalvatnerldLQIDSANISAWSALATFVSNQIRFGYE +>ERR1719199_711328 +---FKPSHISLIQNQMSALIsefgsIEGAGEFLITQICALDEYVAKLFSG-AAL------------RVQGFKFLGQIARWVTYLADpetVEADLYNLGIRHL-GY-VTQQDFAKFLPaviqCMQKSLKDVLDEQWSALAAESWKMFLGYAGGH------ +>ERR1712070_698694 +---------------------------------------------------------------LCFIIARVIDIAAQlfvEPDVCIAEVLQLGLRHI-MYKVPADFFGPFAGIIADEIEARCD--------------------------- +>sp|O76243|GLBB_CERLA Body wall hemoglobin OS=Cerebratulus lacteus OX=6221 PE=1 SV=3 +-----------------------VVDAFYVELFTAHPQYQDRFA-FKGVA-LGDLKGNAAYQTQASKTVDYITAALAGSAD----AAGLASRHV-GRNVGAPEFTHAKACLAKACA------------------------------- +>tr|A0A2C9LKZ0|A0A2C9LKZ0_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106051185 PE=4 SV=1 +--GISLADIKVITNQWEDVLrcSDLFGKLLVLYVLDNCPKVNALHPGLHAR--LTDARD-SVEKQIGLRVIQSISCVIHNLNRapaVESMVRDTFKKLQ-QHGYTKNTILECSEAFLSFMNQYFSKRWLKQHSDAWFKVLKALL-------- +>SRR5690606_9602430 +-------------------------RAFYPILYSSVSGAQELFEA--TVG------------TDNRKMLQILAKLFGfisNVNhsSefMkSDAFIERGKYYA-DHGISETMMRGFSSALVLTLRRTLGELFTISHVRAWGIFLDTISHAL----- +>SRR4051812_40179264 +-------------------------RIFFPILYSTVPSSQELIEE--AVG------------TDSIKMLQLLVKIFRiisDINhdPevMkSEAFLERGKFYA-DHNISENMLRGFNSALTLSLRRSLGERFTISHVRAWGAFLEMISHSL----- +>SRR5690242_7041980 +-------------------------RAFYPILFSTVSSSQEIFEE--HIG------------SDQTRMTETLRHVLEffiSVNlnPqiLsSDKVIERAKKYA-DLGISENMLKGFSFSFLKALKQVLGGALSAEAMREMVRLLDNISIQI----- +>tr|A0A0G4HHE4|A0A0G4HHE4_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6802 PE=3 SV=1 +-------------------------DALLGILFEASPTMRSVFVKNGDL--------------YADLIEHLLRRIIAYADDpgaLWTDDQHLALDHI-NFGMSMSDLPLFGASLMNCLAGVLGENWCDEWQRAWEKAWQICCQSL-----