view test-data/multimer_output/msas/A/bfd_uniclust_hits.a3m @ 9:3bd420ec162d draft

planemo upload for repository https://github.com/usegalaxy-au/tools-au commit 7726c3cba165bdc8fc6366ec0ce6596e55657468
author galaxy-australia
date Tue, 13 Sep 2022 22:04:12 +0000
parents
children
line wrap: on
line source

>chain_A
MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR
>tr|A0A1K0GGD5|A0A1K0GGD5_RAT Globin d1 OS=Rattus norvegicus GN=Glnd1 PE=3 SV=1
-----------------------MYGLEKEp-R------------ETEGClsrKLPSNLQRSSAPWRLHGFQNLLERSQGA--------QRAKPG------------HGAHSHSSVKMAL--SQTDH------------------rlvL
>tr|F6QUQ8|F6QUQ8_XENTR Uncharacterized protein OS=Xenopus tropicalis OX=8364 PE=3 SV=1
-HWTAEEKAAITSVWQKV--NLEQDGHEALTSISLTFISPLdvvwAYFKG----------AAHNK---------IKFCFNIELKQISLSFHARWKNQNPEQKLERLGEVLVIVLASKLGTAFTPQIQGAWEKFVAVLVDALSQGYN
>ERR1712144_198951
HESLWKRQVRG---evfLGESRPE-VrRDRRRSSG-qDAGGLPPDQTYFSHWaDLSPDSSQVKKHGGVIMGAVGEAVGKIDDIVGAVSNLSSCMPSSSEWTLPTS-------------------------------------------
>tr|A0A096M318|A0A096M318_POEFO Uncharacterized protein OS=Poecilia formosa PE=3 SV=1
VNH-KHDELII---tgvFFTS-------VSECVP-pVRNIYRQTTNSIENIgNFKngetfLTNPPVALYVVNMVEFTSKPLMSL-PLNGFYGILDFLKA--KRKNPNGGKLLADCLTIVIASKMGS-gFTPEIQATFQKFLAVVVSALGKQYH
>ERR1719244_1811598
--WSDDETKAIQMIWNSVD--VNELGPAALRRCLLVYPWTQRYFGKFgDIATPTAimqnpGVAQHGITVMNGLKLAGGPGGGPGNQPGGQQELWQRGKQQGQQQLWQQGQHGGKQRGqqQRQGQq-PSPRQSX------------------
>ERR1719167_1707907
VEWTDFERATIQDIFAKMP--YEEVGPAALARGLIVYPWTQRYFGNFgnLYSAStilvNPLIAKHGTTILHGLDRAMKNMDNIKETYAELSVLHSEKLHVDPDNFRLVSDCLTIVVAGKMGKDFTGEVQAAFQKFLAVVVSALGRHHH
>tr|A0A146QLZ2|A0A146QLZ2_FUNHE Hemoglobin subunit alpha-2 (Fragment) OS=Fundulus heteroclitus OX=8078 PE=4 SV=1
IILTSNYNYTFNTFFSKFSSNSYSIFSYSLSIILFFYPHTNTYFSHFnYLIPFSSPFNNHLstfiflfsxxxXXVMGGVEDDVEKIENMKEGIIRISEMNELNMRVEKEKLKIMEKKIIVV---------------------------------
>tr|A0A147ASE9|A0A147ASE9_FUNHE Cytoglobin (Fragment) OS=Fundulus heteroclitus PE=3 SV=1
EPLSDSEREIIQDTWGHVYKNCEDVGVSVLIRFFVNFPSAKQYFSQFQdMedpeeMEQSSQLRQHACRVMNAINTVVENLNDPEKVSSvlaLVGKAHAMKHKVEPIYFKILSGVILEVLSEDFPDFFTADVQLVWTKLMGALYWHVTGAY-
>tr|L8HUF7|L8HUF7_9CETA Hemoglobin subunit beta (Fragment) OS=Bos mutus OX=72004 GN=M91_21159 PE=3 SV=1
-YLTLEKKATVIDLWSKM--RVAEVGPDTVgrqvFKLLVVYPSTQRFFDYFgDCPLLIygqCFTffvsrhrfllfilvflCFKEDKMMYCFLKQFKKIKK------MIAKRNISK---------YKLRLIWVASHQYFGKEFTPEFQAACQKVVAGVVNALTYKYH
>tr|A0A2Y9DG99|A0A2Y9DG99_TRIMA myoglobin OS=Trichechus manatus latirostris OX=127582 GN=LOC101351845 PE=4 SV=1
MALSDGEWQLVLNVWGKVEADIAGHGLEVLISLFKGHPETLEKFDKFkHLKseeemKACEDLKKHGVTVLTALGGILKKKGHHQAEIQPLAQSHATKHKIPVKYLEFISEAIIHVLQSKHPGDFGADAQGAMSKALELFRNAMAANYK
>tr|M3YM80|M3YM80_MUSPF Myoglobin OS=Mustela putorius furo OX=9669 GN=MB PE=3 SV=1
MGLSDGEWQLVLNVWGKVEADLAGHGQAVLISLCQGLESRKEEKKRDpAHAcvssrrslFVSQDLLFHSDAFLVSLGHRSFLapvSGENGQSQKTQPAHHAQHHRQPWNTEKFISDAIIQVLQSKHAGDFGAEAQAAMKKALELFRNDIAAKYK
>tr|A0A1C4HDU6|A0A1C4HDU6_PROAN Myoglobin (Fragment) OS=Protopterus annectens OX=7888 GN=Mb6b PE=2 SV=1
-------MACPAKFWEEnVVPDAAEHGKNILIRLYKEDPAAQGFFSKYkDTPvselGNNADVKEQGAVVVKALGELLKLKGQHESQLHAMAESHKNTYKIPVEYFPKIFKITDAYLHEKVGAVYA-AIQAAMNVAFDQIADGLKTQYQ
>tr|Q9Y0D5|Q9Y0D5_MYXGL Hemoglobin OS=Myxine glutinosa GN=Hb PE=2 SV=1
-RTTEGERAAVRASWAVLMKDYEHAGVQILDKFFKANPAAKPFFTKMkDLHtledlASSADARWHVERIIQAVNFAVINIEDREklsNKFVKLSQDHIEEFHVtDPQYFMILSQTILDEVEKR-NGGLSGEGKSGWHKVMTIICKMLKSKY-
>ERR1711977_634702
--WTDAERAAISSVWGKID--VGEIGPQALGRLLIVYPWTQRHFSSFgNLSTpaailGNPKVAAHGKTVMAGLERAVKNMDDIKSAYSDLSRCTPRSCMWIPTTSGSWLNAspcvwlpsldvrPSTLMSRRpGRSSWLwssppwadsTTEGLKTHHNQIICSSFL-----
>tr|Q9U6L6|Q9U6L6_MYXGL Hemoglobin OS=Myxine glutinosa OX=7769 GN=Hb PE=2 SV=1
-TLSEGDKKAIRESWPQIYKNFEQNSLAVLLEFLKKFPKAQDSFPKFsakkSHLEQDPAVKLQAEVIINAVNHTIGLMDKEaamKKYLKDLSTKHSTEFQVNPDMFKELSAVFVSTMGGK----------AAYEKLFSIIATLLRSTYD
>ERR1719474_978995
---------------------------------LLQSSWKQ--FRT----------------------------------FASLSGIRQEELGAGCQHQDLP----------QIQHHLWISEPSTFQQL-------------
>ERR1719336_830457
-----------------------------------------------------------------------------SINPQSTVDLGAQYISATPLNYKNHQDIYNSLLSNG------VLVPANVSLI-------------
>tr|B7QI99|B7QI99_IXOSC Globin, putative OS=Ixodes scapularis OX=6945 GN=8041668 PE=3 SV=1
-GLTTSDKCAIKDTWTMFRRETRTNALSLFVALFSRYPEYQKMFPNFADvalkdMMQCPSLTAHALTVIYALASIIESIDDENtmvELIKKNIRNHV-RRSVTPEHFVNINNLLIEVMQVKLRSRMTASVIVSWKKFFAMHDAVTRQTY-
>tr|A0A1W0WKD0|A0A1W0WKD0_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_10224 PE=3 SV=1
-GLTSNHIKAVRANWKLIEKRLPEYGLELFVAYLNKHPDWIGLLPFLKPadmprLQQTPRLKAHGTIVLKKLGELLTMLDSPPkliGELLKQGSTHR-ARGLAPENFQAIQHDLNELFVKICGPE---FDIEGWDAVLTLIMTGIEEGL-
>tr|T1KR38|T1KR38_TETUR Uncharacterized protein OS=Tetranychus urticae OX=32264 GN=107366531 PE=3 SV=1
-LLSDDEVKVIQSIWSSVMKDANTHGMNFFLKFFRENPTFQERFASLRNlkteeEMkASKRLKAHAASVFHAITALVDNLDDLEcvsDMLEKIAANHL-RRKVNWPFFDRIALCIVAFLSETLGTqIMDSKATTAWTKVLNVITETVKRVE-
>tr|A0A2N8ZEM6|A0A2N8ZEM6_9VIBR Globin OS=Vibrio tapetis subsp. tapetis OX=1671868 GN=VTAP4600_A2359 PE=3 SV=1
--LSEQQIYLVQECYRQVEESPHEFAKHYYGKLFELEPRLQALFRN-DLD-------IQGRKLIAMLEVAVNGVKDMGMLVPMltqLTQLahrHN-DYNVKKSHFSLLNTALHHAFEQHLQQAYTDEHRQAWQTLLDFMVDTMK----
>tr|A0A1I0MYA2|A0A1I0MYA2_9RHOB Hemoglobin-like flavoprotein OS=Cognatiyoonia koreensis OX=364200 GN=SAMN04488515_0317 PE=3 SV=1
--LSQTQVDLIRTSAEVLAEANVAATNVFYANLFRVAPGVRNLFSE-DMF-------EQSEKLWNTIVKVVESARDLTEIEADLHALgarHV-HYGAEPGHYVVVTDVLIQTISSMMEDKWTDETQAAWKTALEAVCATML----
>tr|A0A146Z291|A0A146Z291_FUNHE Hemoglobin subunit epsilon (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
---SYHYLIIITSIFSNLY--YNYFFPNSLIIFLIFYPFTHIYFSNFFNLYNSYsintnpNIQSHFTNFLHFLYLSFNNIYNINFTYSYFIFLHSYNLHFYPYNFNLLSYFFTIFISSNIFSVIKE----------------------
>tr|H9GUN8|H9GUN8_ANOCA Uncharacterized protein OS=Anolis carolinensis GN=LOC103282340 PE=3 SV=1
-KMTDLDRRHIREIWTAAFENPEENGRLVIIRFFSDYPASKQYFKTVPTDGdlkAHPQVAFHGRRIMVAFSQVIENMENWNQACVLLErlvNNHKNIHQVPSGMFQLLFQAMLCTFDDLLGRTFTPEKRVSWEKFFQVIQEEVEAAY-
>tr|C3YSB7|C3YSB7_BRAFL Uncharacterized protein OS=Branchiostoma floridae OX=7739 GN=BRAFLDRAFT_96956 PE=3 SV=1
TGLTANQIQLIRDTWQIVYKNKRENCFAIFRILFTDHPSTKSLFRLMDAVdldvpgefEKNVAARAHMVRFMHSFATFMDTLDEPAELRQLLYDLgknH-AKHQVGPELFDALGPILMKALPIVLDGKFTPEVKTAWLTAYTFMSTHLK----
>UPI000197D711 status=active
AGLTPKDIYEAKQCWNKAASlGVNKVGVLLFKNIFTIAPEAAKAFSFGNDPnfMNNKEMEEHGVKVVMAFDHAVRSLDNIHAlqeTADGLRDTHS-FFNLSPEHHVIVKEALLQTLKQGLGDEFTDAQRELWNGIYTAIRNMWVG---
>KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold119418_1 # 1 # 498 # 1 # ID=119418_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510
EQISPLKLRLVQSSWRQAS-ADEQAGITAFKFFFEMEPVAIGMFGLQDIRdlYNSYELKRIAAKIVKAMTHIVNSFDNFEGlrpLIKKLGMMHG-EKGVSPSQYNNFGKAFMQTVEEILGDQFTPETRRAWETFFRILTGALQR---
>SaaInl8_100m_RNA_FD_contig_91_216993_length_256_multi_18_in_0_out_0_1 # 1 # 255 # 1 # ID=160783_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459
NLLPKNTILQVQTSLQKVLQTTKTISPIFYAQLFEIDPSTRPLFSTEND----QQLKQQETKFTLMLSAIVNSLTNLDSlipVLQDLGKKHL-NYKVQKSHYETFGIALLSTFALILADDFTQETKKAWEDTYGLIASIITE---
>tr|A0A091DYW0|A0A091DYW0_FUKDA Cytoglobin OS=Fukomys damarensis GN=H920_02872 PE=3 SV=1
-PPHEGGSCATPLPWGNRDLGPWACVRPDLCRFFVNFPSAKQYFSQFRHmedpleMERSPQLRKHACRVMGALNTVVENLHDPDKvssVLALVGKAHALKHKVEPVYFKTISGVILELIAEECANDFPPEAQRAWAKLRGLIYSHVTAA--
>WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1887876_1 # 1 # 366 # -1 # ID=1887876_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459
-PVSDENKDILRESWKRLEEEKTTLCKNVFIRLLQLNPNLQDTFPSFkgvalDELMNSRSLFLHSKRLMEALEIAISSLDDGQDFTEYLTHLGErHtAISITENHFKIMEKALIFALKDMLGESCTEDVANAWREFFQSMAGT------
>tr|A0A2E1AIS1|A0A2E1AIS1_9CHLR Uncharacterized protein OS=Anaerolineaceae bacterium OX=2024896 GN=CL607_22355 PE=3 SV=1
SPVTSRQKLLL--HYTLLHLDADQMGKLFYDHILAAMPEVAPMFTD---------LESQRKHFMKMMIRIVHTIDEPDHLNIVLRELghiHK-RLHLKPRHFSKMGVAFSNSLAEVMGDRYTPEIGEAWRILYNRVAEAMQS---
>APLak6261659701_1056019.scaffolds.fasta_scaffold514158_1 # 3 # 230 # 1 # ID=514158_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561
IELNAKNKALVKEGWKLLIETQFPnevggneralarFFDEFYRKFFEVNPSGKRLFEEGGM-------AVQSKALVKMMSMVVTSLENPSNLDLTIERLggrHE-LYGVSRSDYLAFTNAMCETLETVLGDKCNQEMKESWSLVLNNLSEK------
>SRR6516164_1622129 
-SDDPRTEATRAghletggsrrrRGSRHVLPSAVR-----------NRPHHAQAIPRDR--------------------------------------YgraTQ-K----AAA--DVGL--rhRWPGX-------------------------------
>SRR6516225_5669596 
-VMTPEQKRLAScfrrggppGSWRRPSPPLGIETAQVFRIPCVLPN--AAVHTAGVSD-------HNNSDTYRAALRPAH---R----AASQTASvrnHE-RIQSETAM--REGL--rrvTYARVLRTGS-hRTPYrnVTP------------------
>SRR5215813_13307430 
-KSTPPRAsyfratdmaaqrkkllqtleqglgqawtPAVAs-AWSEVYRLLSGIMrnaAERVERLQNVWPAPFDAVIX------------------------------------------------------------------------------------------------
>SRR5262249_1440316 
-AMTPEQKRLVEd-TLKQMAASADAAAALFYCRLFEIDPTTRKLLPQTARA-------ATRLGCGIPQLLTDIFAVR----YAAHADFgtfSE-GTHGHSDL--EAGY--hrrlVX----------------------------------
>SRR5260370_32836152 
-SDDPRT-EATRaGHLETSGSRRRRGGRHVLPSVVRNRPTTRTLFRATDMV-------AQRKKLLQTLAFAIGGLDNLDALGSKVEDLgrrHA-GYGVTDAQYDSVGAALLWTLEQGLHH-pPWPRRGPKTTDC-------------
>SRR3989338_1269240 
MDFNDEEIDIIKDTWDAVLYPey---PEEGFNPVLNFSTKFYRRVFEHENckNLFEEVDMTSQGEKLVKILSVLLVAVQTkslnqdHIHVLRKMGERHRG-YGVSDDMYEIIGGCLLRTLSEVCADVWDDDAKVVWAKLFGVVSE-------
>SRR6516164_9760095 
IVTTPQQVQLVKQSFAKTTPIAEQAAGLFYGRLFETAPQLRPLFKG--------DIKTQGRKLMSTIALAVGSLQKLPELVPIVQDLgrrYV-GYGVKDDQLRYRRRRAAVDARQGaRGRLHTRCEGRVDLGLYDPrrYDEERRSAA-
>SRR5690348_1420512 
------------------------------RHRAESAPAVSGRS------------HSAKKEADGDDLHDDRRTERFQKAGPGSQEPrraPC-RLWCDCGGLSIVGEALLWTLEQGLAAEFKPEVRSAWIKLYDMIATTMQAGA-
>SRR5437870_6238790 
FDVTPIQVDLIRASWAKVEPIQELAASLFYDRLDRKSTRLNSSHVAIS-------YAV---------FCLKKKKKKKEK------------YTHEHINNNKV----------------------------------------
>APAra7269096870_1048528.scaffolds.fasta_scaffold62442_1 # 1 # 438 # 1 # ID=62442_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.454
-IIQPSAVSIIQSSFEQIKPNAGRFTRVFYDRLFERDPSLKKLFIR--------DIREQRKKFFRMLGSIVKNLSNPDELEPKLQDLgsrHD-YYSVKREDYRTFFEAFIYTLAAALGNDFDENTRHAWRDFCDYVGAHMCKE--
>SRR5215472_6010456 
-------------------------------------------------------------------------------rMISGPLPDvitAT-ACGKS----TMPASAPLYCGH--LSKGS---------VSISRPMWAMIAV--
>SRR5262245_10239308 
-PENARPGNL-RHHYadrgrcsGSLLPEAvqaRSVAGRHVSRRHERAAEE--AAAdA--------DGRRQGARSA----RSGRGGRRGSRPAPRAIRRdrqAL-RHGRHGS---P------LGARGGTRARFTPSVKKAWATVYGLLATTMKNA--
>SRR5688572_4752169 
-RMNSQQIALVRQTCTEVAPIADSTAEIFYQKLFQLSPSMRSVFAP-G-------LRERGRHLMETVEAATQIMDHRGTMTSAFAELgsrQM-ALAAGNNRYEAVGAALILAFRQGLGPSFTPEARQAWIALFDYIDETMKAD--
>SRR5689334_18520770 
TSMTPDDIALVQESWRKIEPVKEIAAELFYTRLFELDPPLRIVCGD--------DMKDRRKRFTQVVGATVRGLARVDMLLPAVREFgmrHP-LPGEIEQHHANVAGALLWMLEKALRKEFTPEVKAAWIKAYGMLSQTIRQT--
>SRR5215207_7597532 
QTMTRDQIRLVQASFRNVLPIRELAAALFYDRLFEIDPGTRGLFVDT-------DLRSQGGKLMAAIGMVVHALDAPESMVEKLKELarrHV-NYRQLQESSPPDFHRLhrfgsgrgsqRHVVSKGPGVAPVGQ----HVVPTHFASRvsrRLRAC--
>ERR1700730_6579985 
-RQRLADDGVILRVLQRGLGIELEMEALAREEIGELDPDAarfRPHHAV--------GGGEVGGRHIELLRRHVDQRPpcHAAANGSARISLprgHV-SYGAKPRHYPVVGAALLWTLEKGLGDGWTPEVADAWLTAYSTLSGYMISE--
>SRR5262249_2898310 
-ILTADEIERVRNSFDQVWAISARTAELFYGRLSAGNLFAHAPSEA--------ERDDKRQKFMLTLAVVVASLDERADMDSLSERLaqaHT-EAGVRPEPASELREALFWSLEQALGPVWTPAVDAAWRKAYRRLSERMVSI--
>SRR6516165_4200192 
-----AQ--------------------------------------S--------DLVDRGRA------YRLLGLADLVDRrnQAaagGLSLFhrrAV----------------------SAGGVAWADRVLDALSlylcgyelrwpQLDHALGRgavhpdacaSLLRE--
>ERR1700733_1486793 
-------------SQAHGGDIVDLyRDVRLVYRLFRRLPPAEQDAIpG--------DHRRGRLSRaAGRVAL------------APVRRAarrQ---------DRRREG-DVLELRRDGRGDDRRHVFHRDQElswlSDDV--PR-VVRD--
>SRR5580658_8437352 
---------TGAGKFESVQEYADSVVLLFYGRLFELAPPTRGMFKI--------GIPEQARKLMGTLTSLVDALDRFEELRQWLTDLgrrHV-EYKARALPGAGDGAHVGFRAGAGYRV------RPGDEDCVGAVAERGVCG--
>SRR5215831_4136876 
-KHDPPTDLARAEQLQVRCA------DRVKGRRSLLRPSLRDRSRGPA-------A--LPRKIIRAEGKVdgdANEDRQQSSSAQchFASCTptrRaaQ-GLRCLDGSLWGSGCCLLWTLEQGLGSAFTPEVKAAWSEAYRTLAGAMQEG--
>SRR5215469_10861266 
------------------------------------------------------------------LTGAPLTVHPVRDRSPQFSRIgspsgrHA-TARARGQWIRNNSAFRAMTLQQALGSEFTPNVRDAWVAYYQTPAAEMKA---
>tr|A0A1E4AHQ5|A0A1E4AHQ5_9BACT Uncharacterized protein OS=Cytophagaceae bacterium SCN 52-12 GN=ABS46_00305 PE=4 SV=1
-ACTQDQIRIVKKTWSFFRNmSPEFVGDVFYTKLFMDYPDLEKRYPR--------EAQKRYEDLIKMLNMVISRLDRPDELTWALteiANQPH-RIWVTPAHYQKVVSTLIWTLRKGLGNDWTAVVEDAWMSCIKMVESLNAAI--
>SRR5262245_55554356 
-CVTPEHRLLAQQAFATIQPLADELGLLFYSRLFELDGALRGLFKH--------DLANQAHSLMAMLQLTIEGLDAPEQFTRARTTWgyaTWTmGFSRTSTRLLRRPCSGRSSMRX------------------------------
>SRR5260221_10622870 
-IVNAAQQELVMTKAEGVVLMPGVTGVLLCALLISANPSFRPLFKS--------DMRIQGVKLMTMLAMVVYNLPEPGQVLPAIRDRseeHT-SELQSHSDFVCR--LLLLHX--------------------------------
>SRR5918994_240771 
-------------SWKGVAGRRDEIARAFYAVLFDRHPELRSLFAHTD-------MRAQYEKFALMVDEIVQLRTEPRQFVRSAVLLgqrHT-MYGVTRRLVIAPAIRL-DRFAATDSIGFATPSTSALQlllcpRETVRRSGVMS----
>ERR1700730_15638689 
-AMTPKQVALVQDSFAKVALTSEAAAVLFYNRLFDIAPQMKAMFPD--------DMVEQRRKLMSMLAGVVKGLANLEQVFAGRQRTgkaAC-QLRCEGG--ALSGGRRRVAVDAGEGsGGWLDAGSGGcVGHRlWHAVRLHDFPS--
>SRR5258706_7695680 
-RHDPPPdpadPPVLRPA----RVQGRETRHLDVQAPVPARPRPTPAVQ-------------------------------------------------------------------------------------------------
>tr|A0A1W2GRB7|A0A1W2GRB7_9BACT Hemoglobin-like flavoprotein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4043 PE=3 SV=1
-----RELMLVKSCWQTVAPNAIPLAMKFYDDLFEAKPEYRRLFSGD-------M-NKQAEKLMMTLGFLMANVDRVDKIKDAIHKLgalHV-KFKVLPEYYPPVQKALVGAIAQFMDNQWSYEHEDAWNKLISAVGDMMIEGT-
>tr|A0A0N0UYC0|A0A0N0UYC0_9BACT Uncharacterized protein OS=bacterium 336/3 OX=1664068 GN=AD998_10010 PE=3 SV=1
-----EQKEIIKSSFPRVLIHTLKNSTIVYEKLFMDIPEAKDLFKNT-------SIDKQGQMLVAAIGKIVKGLDNPDIFEKDLVELatrHV-GYGLKPEYFTHFGNALINMFEVSLVDSWDKDLHDAWVAVYQEVAEIMKSVI-
>SRR6185312_354929 
--MVR--A-----------RGSAkC--WKCRWR--------------D-------RA--SVSnSLPAPATSSAGSACSNFS--------mngTA---SSkQPefDRVPRGGrgrgrrrKMTpeqVSLVQqsfakvapiseqaavlFYD-RL-FevapavkamfpadmteqrkKLM----------GTLAV-V---
>SRR6201981_618659 
-ERHD--T-----------GGGQpRDAELFQDR--------------A-------DCGQGGGdLLRPPVRDRAAGQIVVSIRHGGAPGQadgDA-DRRGrRSyqSSLDPARgerarq--TpcqLRRQGgalsgrrcrvavdAGE-GTWRgldarcrgcmegglrnpVRLHDL----RGLRQ--------
>APLak6261666328_1056055.scaffolds.fasta_scaffold241778_1 # 2 # 196 # 1 # ID=241778_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.415
-AKTAGGL---NLLFL--AIVSS----EPENGFVTISPAAKDLFP-A-------DLTEQRKKLIATLAIVVNRLSNLQSILPAARTLtkrHV-NYGAKPEHYPVVGSAVLH-AGgrPRLGLDARSRLrsdGCVWHAVRLDDgrnleHEFANL---
>SRR5919197_1191720 
-VLTRDQADIVQLTWRAVLPVGDTFAELFYGRLFALDPQLRRLFRE--------NLVEQGRNLTAMLSVAAANLARPEKISVALRQLgrrPT-RSSRARCSRSLLRDLLRLPLDARRA--VADGVARVVVafaRAVVAIP-RVIHG--
>tr|A0A1I1PNT6|A0A1I1PNT6_9RHOB Nitric oxide dioxygenase OS=Tropicimonas isoalkanivorans GN=SAMN04488094_11525 PE=3 SV=1
MPPSQQELARVKQSFEDLRPHHEPTSYDFYEELFARAPELRQLFRD--------DLKGQGMRFMNTLGLVLDDMTNPNGTtvdYAELGHLHT-TLGVRQAHFEPMEDALMASLGKKLGNEFTADLEEAWRNAFRAFSKKLIEA--
>SRR5262249_25899110 
-MMNTQHIARIRLSFAWIAPSADVFGELFVANLRALDPSLSGLLAA--------EAGPQGWQLISILRSIIGGRDRPDRLFWRLQSFgrrLA-GDGLCAEDYDTIGDALMLTLEQCLGERLTPDVAAAWDATYAALAEVVQL---
>SRR3954451_4172984 
-XMTPEHIHTVQSSWSKVLPVGNGQARLLFERLLQSEASLWGLFQL--------DAATWSANLVQMIDVLVTGLSLGDRRAVltrRIGGRNT-ACPAIEHHYDLIGTALLRTLAKPLRAEFPPSVEAECPPFY------------
>SRR5215470_15672373 
-LMTPEQIALVQSSFERVGPELPALATRFYQELFGRDPALRPLFTT--------DMTLQKVRFAEKLTEIVRAFSRIPARSAPGTSAtgyGS-LTTR--PsAKHSSPR-SLPFSATASTArparrGAS--PTTWWPRPCSRVRQRLGV---
>SRR6266566_5437046 
-DLTPENCDFMTEHHDL----------RILGRLVATE---------------------------------------------------Q-EQPVKDPDHDQIeeatrhrprscPTLFIWPNRRSQPLhrvlmRYMPvpgpRSPPSWCGPPSRSRSHGPR---
>SRR5215203_7560530 
RPMTPDQVSLVRDARRAIESRHAEFSAAFHDALHELDVDTCALFRDTV-------TGGRACNVGAMLDLLQQASDDPRALIEVAAELgraHA-HAGVRDVHHHVAGVALHRALHRVLGVEFTPAMYEAWAEAFTLLIAVMERAA-
>SRR5580658_533798 
-XMHSIMIGHLRDSVSLLPMEDLRPVHEFYRRLFELAPEAQPLFTR--------EAGQQAKKFSDMLAWVIAHLEHADELRKEMRELgarHR-GYGVTADQYASVGSALIWMFQHALGDRFTPEMEEAWLEVFAFISLEAERGA-
>tr|A0A1D8RRN7|A0A1D8RRN7_9GAMM Uncharacterized protein OS=Colwellia sp. PAMC 20917 GN=A3Q34_02175 PE=4 SV=1
--MTAKQINLVQQSWQKVLILSPDVGDLFYQQLFVLRPELATLLKN--------DKQdKirANKDFICLLSQEINLLQPIELTEEKV-nTSVT-TNDV-KNYQADVENALLLALTMILDKELKIALKRAWISTIKRLVGSIVIEL-
>SRR5262249_21459549 
IGQKREPPTVERRHREQVEEAQEDGKIGDD------------------A-------QRLARALLDLFAELVGDLDGPRH---------V-GFLX------------------------------------------------
>APDOM4702015191_1054821.scaffolds.fasta_scaffold152199_1 # 3 # 686 # -1 # ID=152199_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.531
-------------------------------------------MSG--------DFSPEQKRYLEGFTS------GLQ--------IartGR-GLG-KPAASVPSGPD-----AEHLIAQDQT----------------------
>MesohylFT_1024984.scaffolds.fasta_scaffold1796824_1 # 3 # 146 # -1 # ID=1796824_1;partial=10;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.340
EELNFQEIAIVKDTFALVEPHGSKFAQDFYDKFFTMSPEVTSLFAN--------VDRDHSSKMiWNALMLIVYNLENKQQLQNTLFGLgrrHM-NYGVSSHHYLSMGEAIMATLQSYLEanQSWNEEVAAAWERAYNLVSRRMQKG--
>SRR3569623_2148552 
--ISYGTVMQVTLSWDKFKQVQnfqERAGELIFERLFELEPQLRAQYKFSeD--ediKSNPAFASHARTMVDMIDMAVSFLGpDLDPLAEDLEDLgkrHI-AYGVNAVHLPVMEKAVVYAFEELLGDNFIKDDRNAWQVMFHFIITNMGKGM-
>SRR5450759_1049036 
-ALTAEaPYSELKnlCVWSKT--------NAGMGSLYRSQHELVFVF-kN--------GMrphinnvelgrfgrnrtniwnyagassfGstrdselamHPTVKPLSLVADAIlDCSKRGgivldafagsGTTLIAAEKTgrr---GYGTELDPFYADT----------------------ivrrFEDAYGL-KAVHVE---
>SRR5210317_1560035 
-----------------XMTSL----KSSMIGFFRNHQNCAKMFGE--------DMRDQAQKLAAILQVAFDNLDHVDSLVPILEDVgakHA-TYAVTPEHYGLVAAALIGTISTELGDAFDERAAESFEAVLGTVANVMISG--
>ERR1719240_1900674
----------------AVARvLVHGL-ANLHRRALERLDLLLELVDAhRVVVlrllHRLdgrldrlHVLRRHLVLVLE------EG------------LLgavHR-RVGLILH----------LHLRLAIGVRRGE----------------------
>ERR1044072_2403146 
-VLTEEHKKALRHSWRLLEPLGETVSDLFYRRLFEIRPDLRILFPP--------DMAAQKRKLLVMLMFIVKAMDWPIedwaaeidpenDLLlvvLALVRRHSHLYQVTSEHYAPVGEALVWTLEQALGQGFEGAPQKTTGPVCVLGCSPWG----
>SRR5437899_2276119 
------------------YPAVQKSGAAVYRPALVAELRDRPY-EF--------DIQVQLCVYLARMA---------LEIVAALN--AA-GWICVPKDPSPEM------LKAAWAYALDEDAAGVWKSMIAA----------
>ERR1700757_2961956 
-------------------------------------------------------------RFNRLAGRERRAPARTR-----ARQSr-----QRPGPSRHDPTrLALSD----------VSEAERTDIVVS------------
>SRR4029078_1694892 
------------------RNFnPVVIGDSFYSKLFSLKHSLRRMFPG--------VMHEHYLQLVKLLNLIIAALDQPGQLEEefeILARKHR-HYGLTSSHYELFEEAMLWTIERALGKDCNKPIVSRWKTCYLALVRRTIAA--
>tr|A0A1H4HXI9|A0A1H4HXI9_9BURK Adenylate cyclase, class 3 OS=Variovorax sp. YR216 GN=SAMN05444680_12751 PE=3 SV=1
---APDSVLLVQSTIGVLLQHQKRFTQDLYRRLFALAPAAEGLFR-GDM-------DSQGQMLSHMMQFLVHAMSRPEIMALGLRDLgrrHD-GYGVAAEYYPAFRQAFLESARGILDERYTAQVEKAWAETIDMIIESMRGP--
>SRR5687768_10564074 
-RMTPQQTQLVKRSFWIAEGRRTQLAGCFLAELFARDPALWRLFSS--------DPALRRDKLHHAVAGFVASIDRLHPIVPVLEWLafhGA-RHGIGERQHVAIADAFLAAMETVLGESFTPAHRQAWWLACRSVIDVMVHA--
>UPI0004291969 status=active
--KQSDTVFLVQSTLEKVFPQLDEFTNQFFKKFYELDPSVKEIFYEIDA-------KNKKQMVVNMIGFLTQGINRFDVIIPSIKEInerHF-GREVKPKYYLIASKALVNVLEDYLGEDFTPEVKQTWIEFYEQIVNFME----
>tr|A0A2D6RHV2|A0A2D6RHV2_9GAMM Methyl-accepting chemotaxis protein (Fragment) OS=Colwelliaceae bacterium OX=2026726 GN=CL811_09640 PE=4 SV=1
--MTPKQNIAVIESWKKVQPIASQVSQVFYDDLCEKHPSLKALLGE--------ELSSARDQLVAYLNSLVETLVATDEVViEDL-AKH-LRIGLAPEQFSDVGPALLTSLEIGLEKDFTATVKRAWTALNKLIVAAMAQ---
>SRR5215469_12962076 
--------------------------------------------------------------SLSARAGRQAGFGl------SG--------LGSAAT--taiPTPSTSLTGSTARTTG--cSAPYSR-----TGT-----------
>SRR5205807_5077868 
----------------------RVGHGRVYPRLYIIARHAAGIYAlT--------RPVAKPgRPRPVCLVPIHKDIA--VMRVTTDQLLartPL-GrFGEAAevgqlVHYLVSDAA------RFVS-GATVTIDGAWTAYGGWALR-------
>ERR1719223_615602
-MTDKSSSQRVLDSWNAIKSIPnykEVAGVLLFRRIFALAPEAHGLFRFTNGFepnseelFESERLIEHGKGVIATLEAAIDMLGpasDLNPLICFLQELganHQ-RYGVLHDHYPIVGEALIETLSAAMGDKFTDDIKLAWEEIYGIIESNMIDG--
>SRR3954469_4757651 
-SMTEVSVQRLAENYQLLAGRMAALTATFYERLFEAMPSVRPLFKI--------DIALQSQHLAAARALIVRNVRHLDALEEPLTELgvhHA-KVGVRPEQSPPLCRVMIETLRDGSGDRWSPQLESDWTPVLEMVSRIMMAG--
>SRR6516165_10653891 
-EPSPNQLHQNRPD------RRPGGGTLLWPPLRDGSR-NPGAVLQ--------RRGRTGSEANGRSCNRCEQSRRFRGDRPHRTRS-C-KAPRRPEHYALVGSALLWTLEQGLGDEFTPALRAAWAAAYCALSEVMIA---
>tr|A0A210QIU4|A0A210QIU4_MIZYE Neuroglobin OS=Mizuhopecten yessoensis OX=6573 GN=KP79_PYT10777 PE=3 SV=1
-YLTSEQVRLVKQSWLILGEDMAATGLLVFKKLFESNEGMKKLFYKLmRCDSseqlefDQEKLTRHATIVMQGLGAAVESLEDSVfltNVLIAMGERHA-MYNVKTEMVPHLWPAIRDAFKELMGEDFLPAVESAWLHVFEYIGSKFKMG--
>SRR3954465_7515966 
--------------------RGRAVGPSCYAPVSPLHPATSRLCSA--------DLLAAGVRLVDELVSLAVAAGDLATFTDRARAVgmrCC-ACGVVAADYPAFGDALVAAVAEVVGPDWTTAAADAWRRLYTLMSETVLEG--
>SRR5215207_9441599 
----PEQLALVRGTASIIDAVGDSFAERFDDHLFARYPAARRLFPD--------DTTTHRGQLTDEIVFLVAAAADLHALLERARALgapPP-LRRtrrrlparrrgTRRRGRGRRGRSVVGRNG---G-SLA-----------------------
>SRR2546430_16462751 
-----------------------------------------------------------------------------FLLSVVIA--CS-CWCRHVSSlqhdrad-------HPVGLCPGIVADWSPALSQNVGEGFQQDCSD-dG----
>SRR5271166_2850757 
-RWMRPKRNSCARPSPKSRRSPIKAGAMLYEKMFALDPDLRRLFAI--------DIETQGAKLMAVFATAIANLHRLDEILPTVRELgrrHV-AFGVKDRDYDTGGVALVQTLEAGLGDAFTPAVRDAWMACYEAITGEMKA---
>ERR1711915_528574
TAFTEEQEALVKKSWNAMKPNASELGFRFFLRVFEIAPSAKRLFSFLhDSdvpIEKNAKLKAHAITVFKMTCESAVQLREKGTPtfsesnVKDLGKSHF-KYGVVDEHFDVVKFCLLETIKDAVPDIWSLEMKTAWDEAYTQLAEAIKSEM-
>ERR1719460_671936
-MVDAVVKGDVQRTWELVIPPDSgddhvfAIGKLFFDRIFEVTPGAEALFSFKGEdRAESAKFRAHAIKVIKTVGVAVAKLDDLETLVPILEDLgkkHV-AYGVVASTTT----SSVWRCCGRSRRGWATNSRPTW----------------
>SRR6266567_6698575 
---------------------LIVFTSTCLWSI----RKPNHSLPKRI-------CVVKLAHCWLHLTTVVAGVLREDNLVPVLQQLgqrHK-SYGVKAEYYPFFRAVLLETFQHYLGPRFTPKMQQAWEEAFEMISTQMLKGA-
>SRR5688572_5289639 
-TVTPDRQQLIRDSWRALEPNGPRLVELAFLHLLQIAPAARPLMTGH-------SLPCVCRNVASILDQLIAALDEPKQFVPLAIGLgrsNP-GHGINAALYPAMGEALLWALHLQLGEGLTPELQTAWLEYHHLVSAIMRRA--
>SRR5262245_22087501 
-LMTPERQRLVHDSWRTLEPNGTRLVELAVLHLVSIAPSVRSRLDGA-------TLPLVCQHIAGMLGRLVETLDEPKQFVPLAISLgreNP-DRGLTAKLYPAMGEALIFALHLQLGDAFTHELQTAWLEFDRLVCAIMQRG--
>ERR1711916_36627
----LELFKILGILWFLLLMSLRNCSIIDYLR----------------------NI--LKLRLCSLKTCNFKKLNSX-----------------------------------------------------------------
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9902871_2 # 1417 # 1767 # -1 # ID=9902871_2;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.538
----ALDTKLIKDSFELAKPISDKLVKRFYENLYSDYPQSKSLYLDG-------QLPESQLAILKAINFIVDNLHNKEKLGTFLKTLnerYE-LRLNDSVINQSVCSSFLKTLSEAFGSDWTSELAEQWELTYQMVTSFFQDSK-
>1185.fasta_scaffold1192548_1 # 3 # 452 # -1 # ID=1192548_1;partial=10;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.684
-TVTPEQIDLVERSVTELTPIMDEVVADFYTGLFAADPAIETLFAGAGAGaggahGQGDGFAVQRAKFAAQLADILTAVRDHERFLATAAdlgARHR-GYGVHAAHYTLVGRALLDALARHLGDRWTPATADAWRLAYNLTAEAMMA---
>tr|A0A1Y5RHX9|A0A1Y5RHX9_9RHOB Flavohemoprotein OS=Palleronia marisminoris GN=hmp PE=3 SV=1
--MPNDDMRLIQPSIARIFVVRRSIGQAFYERLFERQPTFRTMFPT--------DLRTQARTFDDMIALIVKKTGDPEAVTPVllaIGRRYL-TYGLRPQDLRVIGEVLMEVLCAQTPGGLSPDEAAAWERSFSRAAEVVKL---
>DeetaT_11_FD_k123_441726_1 # 2 # 373 # 1 # ID=403715_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.481
-GLTDLQIEMIRSSWEKVTPNKKHHGQLLFHKLFEIAPEMTDLFPFG-DDFTKPQFTTHALNIMNALDHAIQNLDNPDVLIPKLRELgqmHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG---
>AP82_1055514.scaffolds.fasta_scaffold664619_1 # 53 # 358 # 1 # ID=664619_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.458
--MSGFALRLVLTQRQKATrkrpiaqyvieNHSINFAFHYIDRLFEIAPEMTDLFPFG-DDFTKPQFTTHALNIMNALDHAIQNLDNPDVLIPKLRELgqmHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG---
>SRR6187402_963757 
-------KLHIQNSWLKLG-YSADMITDFYNQLFLLYPRLRPLFKE--------DIRLQARKFTAHITYLINHINDWNRLQRDLDELgkrHV-HYEIKVEYFEYVKEALFPTMRKHMG---------------------------
>tr|A0A1C4TW82|A0A1C4TW82_9ACTN NAD(P)H-flavin reductase OS=Micromonospora haikouensis OX=686309 GN=GA0070558_10167 PE=4 SV=1
-----AVSADLGPSWAATAAAVDRAAANFLDTVSDRLPGLLP--------------ERDHTVVFAALGRLAGGVDDTAGRAAALAVLaraHR-GVGLLPQHADLLGDALLAAVARENRAHWTAALATGWERGLRRAVTAVRRA--
>tr|D6Z7Y9|D6Z7Y9_SEGRD Oxidoreductase FAD-binding domain protein OS=Segniliparus rotundus (strain ATCC BAA-972 / CDC 1076 / CIP 108378 / DSM 44985 / J
-----TDQ-gAAARLLEAVAADPVVFVRSFHVELFRCAPELAERFPS--------GLGGHHAAFVTMTKHILQGFAdgsDPPALIDLLGQLgrdHR-KYQLGEEHYRAAKTALAKALADAARSTRDNE---FCAQAAALVCAVMEQE--
>tr|A0A1Q9NIM3|A0A1Q9NIM3_9ARCH Bacterial hemoglobin OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=vhb_2 PE=4 SV=1
-SLNTKDIQLIKNSWEKLTENKKEVRNTFYTGMFEDDPKLKSLFRE--------SFLSWD-NLPDSFEFMFKHLENLEGEILEMKRLglkHK-TFSVKPKHFPIGRKSLVKTIKQYMGDKYTEELGAAWTKLFDYMSHYMILG--
>ERR1711911_167941
TGLTVRQKRIIAKNWDLVRPNLKEAGAFQVVRDGAT--ERVGRQP-QaAGPrrq---HHVQHDDAGRL-----------AQRCGVSGAAPGHH-RPqspssALETAPFSGEPQFILRR---------------------------------
>ERR1719223_727152
--PSSAQVDAVTASWDKVAALgAETVGVLLFKRIFEIAPALESELSEKPTaiIIGDLTLAREMT----EEEKETIDLEEKEEPeeveekeEPEEVDEqetTE-GRIISTESF-------------------------------------------
>ERR1711871_830988
--------FFFFFFW---------RPPFFFFFLLLRVSSFLPLFVASLPPperlfKVGSPLVAYGATVVRALNVAIGLLTDLPTLVPVLKTAlpsL--FPGAQKEHYGIVGQAALNSLAIALGRYWKEPVKNAWLKIWNTVVAVVFS---
>ERR1712232_1508017
-PLDGRDIALVQTTLGMVAKLGlNTVGKVIFLKVLKLNPNAAQLFTWGKMDaalmwKDGSPAVAHSIKVVQTTATAIGLLTDLDTLVPILQTLgvqHNGspmlpdaygGKGVIPKELDVFAGAVLEALAVALGANFTEPVKNAWIKVYTTADGVMKA---
>SRR5882757_3847967 
-----------------------TSI--------------WPIIIN--------TaVGirnipQDYRNVARVLRLnqFEF-FTKimVPAAAPYIFTGl-------------RIGIGLSWLAI--------------VAA--------------
>ERR1700737_3002051 
-----------------------RDF--------------HHLDLA--------DhHQ---------HRVagTQW-AN--GSMSNAVWTGv-------------RLKDVLDRAGV--------------KSGAI------------
>SRR3954451_23003713 
-----------------------LKS------------TTGEVFLE--------G--klv-DE-------PGpdRAI-VFQn-HSLLPWLTVYg-------------NVAIATDKVFGGSGARSKSKAERHDWVMHNLELVQM---A--
>SRR5206468_1650083 
-----------------------TNA------------TMGCVLLE--------N--rev-NS-------PGaaRRR-QGVc-ERQDPQRAQRmgdAqpqpradgacqgqA-PG-GDFRRYEAARRHCPRAGHATKSAAARRAVRRAGRADPRAPAGL------
>SRR5258705_633045 
-----------------------TSE------------DAGPVALG--------N--qev-KQ-------PRtqPPV-VFLd-PALPPRPPALd-------------HWLLRAARDAGGP------QPQ--------------------
>SRR5690606_21133184 
-----------------------INP------------LHGAVRLN--------D--aap-RV-------GDpeVGY-LLAr-DALLPWRTALr-------------NVTLPLEV---RGI----ERREREQSARKVLRDVGL---E--
>SRR5688500_4892119 
-----------------------QEP------------SEGEVQTF--------G--sra-QC-------PNphTVT-VQQa-YTCFPWLTALg-------------NVEFGLRV--QGK------RDNAREVATEYLHKVGL---G--
>SRR5699024_2544359 
-----------------------LSPSSGKIIVAFSSPTSGKIMMD--------V--ndwtSYKDSEMTALRLkeIGF-IFQe-SHLLPYLKIRe-------------QLEFVGREAGMDK-------KHARKRAKEILDLFGL---D--
>SRR3954447_21976298 
-----------------------RAA------------TGGVVRWS--------V--dplvAAG-----GRARhpLSM-VFQk-DTVLPWRTVAq-------------NVGLFYALN---RD----RRAGAEGVVDDLIRLAGL---E--
>ERR1719419_74415
-PFTPEQRTLINETWGNISTKEtgsmGMLAKQVYERLFRSAPGIKRLFKDSD-------MLAISRAFGGMLGVLVSAVNQPLQFQHIVKGLgvrHQ-VYGVKPDHFRIMYTSLVRTFAQILGDKFTSEHKKAWSCLYNWVIDAMQRSMR
>sp|Q8T7J9|GLB_YOLEI Globin OS=Yoldia eightsii PE=1 SV=1
MSFSAAQVDTVRSNWCSMTADIDAAGYRIFELLFQRNPDYQSKFKAFkGLAvsalKGNPNAEKHIRIVLGGLGRILGALNTPE-LDVIYKemaSNHK-PRGVMKQQFKDMGQAIVTALSEIQSKSGGSFDRATWEALFESVANGIGQYQ-
>sp|P0C227|GLB_NERAL Globin OS=Nerita albicilla PE=1 SV=1
KSLSADQKAAIKSSWAAFAADITGNGSNVLVQFFKDYPGDQSYFKKFdGKKpdelKGDAQLATHASQVFGSLNNMIDSMDDPDKMVGLLCknaSDHI-PRGVRQQQYKELFSTLMNYMQSLPGANVAGDTKAAWDKALNAMANIIDAEQ-
>ERR1719238_612722
------------------------------------------LDGE-------TKPKEDQ-----NLSNPWAATAVTAILIPNLRDLglrHC-RYGCRLEDYELGGKAFMMTIEHFMGDAVTPEVRAAWLWVYGVVQSVMVSM--
>tr|A0A0P6RCU1|A0A0P6RCU1_9RHOB Flavohemoprotein OS=Phaeobacter sp. 11ANDIMAR09 OX=1225647 GN=AN476_12305 PE=3 SV=1
---ASTCKALVLRSFESERMDLEAFIPLFYSNFFEAYPEARAIFPT--------DTERLEAKLLASLTHIAEALESSERLdgiLSELGQKHR-RMQISDSHFDGFIQSFIRSLATTLGPEWSDQSDEAWSQFLRYVAKRMSFLE-
>tr|B3SDK5|B3SDK5_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_62364 PE=3 SV=1
SYLNYQERQAIIDSWNAISTEKQKYGTILFLKLFELEPRVKSLFTIFDFNeplediIQSPHFRSHAMRFMQSLETGVLMGFD-kescDFLFKSLGSRHH-FYDLKSEFLDVIPECILHTIKKGCGNNWSNETADAWKIATKVLCELFREG--
>SRR6266700_8223772 
-FFLPFKE-LTEQHFSILGlRKARRAGLVLAQELFEHAPHVGARHSN--------AFGGRHPNAILAVEPFLRRAKNRDQP------DSG-AWSATSFHFGWNGGFX------------------------------------
>ERR1044072_5206314 
--MAPPQIAVARSTGPKVSPMQQRLAQVFYERLFELDPTTRAFFGGVD-------LRHHGLKLTETLSAGIEVLGRDGPAPRGS--------GSGMAALRDGGGCVVHGAGVLPGPRVHDRSPGGLVGGVLG----------
>SRR6516162_1975606 
-TGVSEQHLLDLGGVDILA----ATDDHVFDPA--GDLQIsavvqdAQVAGT--------YPAVRVDGFGGAFGHVEVAEHGLVAAcADlpg-LAGRHG-LSGDRI----------------------ANGHLDL-----------------
>SRR5947209_12860360 
--------------------LFSRQPRSAGQRLFTRFPQTRTLFAATDM-------LEQRKKLQQSLALIVEHMQHPEVLGDMLKGWtrgTS-PMVFDHSIIP-----------------WSEQ---------------------
>tr|V3ZYY7|V3ZYY7_LOTGI Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_167450 PE=3 SV=1
---------------------------------------------MDDNqesLKENYRFRCHVGLFCETIRIAVEEMREIEEVLLFLKDLgrkHR-MYGATPTYIKTAGEGIVYAIDRKLGNEFTRSMKTSWKKFFTILQDSILEG--
>SRR5438045_5489985 
-------LITRPTSYYLLSlhdaLPISLLADVFYSKLFVKNTGLRKMFP-A-------DLQLQRQKLMNMLHFIISNLDQPELFnkeIEGLGLRQD-RKSTRLNSSHLGISYAVFCLKK------------------------------
>tr|H2ZPV1|H2ZPV1_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1
MHFTDEELDLIRTSWGQVMKlGTKEVGIQIFTRLLNDAPKLRSHFYSIdiaDDEelslevmREKKKVVSHATRIAVAISKFVDFLDKPEELDSlltKLGESHA-RLQVDPGSFEYVAPVILAVIGGHLNLPSNSSTLQAWVKAYGVMRNGIVA---
>SRR3954451_6295623 
-------------XMSTLIKGSPHFSspysptgetDQVPEHLFRLDPSLRALFTRTD-------FVRQRRMLLNMIGVTVRGLDRLDGVVPTLRDLgrrHV-GYGVRPEHLSLSR------LNHWLPrGQADPEVMQGTADfhh--------------
>tr|V9ZVV7|V9ZVV7_AERHY Globin OS=Aeromonas hydrophila 4AK4 GN=AH4AK4_1427 PE=3 SV=1
--MTSEQIELVQRAWGKVTALNNTYVQEVYAELFRLSPELINLFPDPAG--------MPVAKVSDTLNTVITSLEQLDAlsfIIRDLGRRHQ-KFKVQSHQFDLLKQALTLVLARRLGEHFTPALSDAWSQMYDEIAALMLEG--
>SRR5580704_19412242 
----PDIAAFVRFASRFASES-SH-------SQMTIHATIVSQQRQ--------QIEMRTGFX-------------------------------------------------------------------------------
>ERR1700732_4531564 
----ASPNGRRNSARASmlISSQPIRRSPRFSATTW-----------------------WHRPRC-SCSLWVRSEVNRMEELgggLCALGERHV-DYGVKRADYNKLASVLIQTLKEFLVDEFTVELQHAWGTVD------------
>SRR5258708_12476517 
--------VLWEWLVDVGGARWRWFGGRLLEIFLETSPELRSLFHK--------DIAQETGMLEWMLGSLVKGLNRLLEIeggLRALGRRHR-DYKIDQADHEKVLRALLLTLAEFVGDDFTPQVSRAWKTVYGKIPDTMTDR--
>SRR6266699_2567678 
-AItkrrfqAAQAVVQIDDSFnPPDWYpDEHPPMPEIVARFFELAPDAQGLFRG--------DMERQYLKLMNMIAAIVGTLDKREMFksiIGRSGRQHA-QFGAKPLHFAAFGDALIWGLEQQFGAAFTPEMKEAWIKLYDDVQREMMC---
>SRR5690349_3556304 
-YLTGQQVLLLKKSFRQMN--PAQIAAQFYGTLFQQHPEVKSMFPA--------DTVELGSKLMSVFELVVFSFDEKEHgrfglqdvLikpLRALGRKHD-DKGVKPEYYEIANSLLLKIMKE--SEYFTTEMYQSWQLALEHLTYAMQDK--
>tr|A0A0S4IWR4|A0A0S4IWR4_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72665 PE=3 SV=1
-LVTVSSNELVQTSWSWVAHDMVGLGDMFYDQLFMIDSEIEHTlfAGT--------DMKRQAVRVMEMIDAAVQGLNTPETIAEVMFTSglrHA-AYGVQRDHYTVVGKALIAALKAFLARRFTPEVAQAWSVFYNGVQRRMLEG--
>SRR4051794_5741567 
-SMRPEQMQLDGLTLADATTDRLARGRDFYRRLSVPAPYLRGRCDG--------DVDAESAKLKETRTLALRMLGNMRFMVATLDAMakrDV-ARGLSEQHCRAIAQSLIWALERRLGAGFSRQVCTAWTEFLAVVMTCLHG---
>SRR5436853_3450426 
--------VLLKDSFNLVRSEEHTSELQSLRHLVCRLLLEKKKKnkTTTV-----NYIE---KEKLGKLEA-SCPVEQTI----GIGDKQR-DYQ--QMHHPERTEAQ-----KX-----------------------------
>tr|C7FFW0|C7FFW0_BRASE Extracellular tetra-domain globin (Fragment) OS=Branchipolynoe seepensis OX=326992 PE=3 SV=1
--VSDAQKAAIKASWAGAD--LQAAGTGFYVHLAAEAPAVYANFNLGADPH-GAKSQEQGLRVMKFVNQCVNSIDNMAIVQAKIDALahrHM-SYNVKKSDFVPAKPCFLGALADALgG-KFNADARAAWAGFYDIIAAGLST---
>ERR1719506_1011120
-PITAREGQIVQDSWKAVKKVGGESGHavikdIFYQ-HLLKDPNVKQLFRNS-------DMKLQATKLWQTLHVAVDGLSTSGPWFLCCRIWarlTS-STGSKRS------TSMPWVRRsSTrspraWGPRsrrssrWRGRKCTAWLLRRX-----------
>SRR5579862_1310240 
-LMDPLRIRMVQDSLVKLTPREGSIVDLFAAELSGSPHDESETGGD--------NIAYQrERSVLGIMAAAAPFLHAPECILDEVVAEIG-AGRIHPADYDHAANAFLRALKKNLGAEFTADLWEAWLEALWTLCNLLSRT--
>tr|A0A1E3GPU1|A0A1E3GPU1_9GAMM Bacterial hemoglobin OS=Methylophaga muralis GN=vhb PE=3 SV=1
-KLQEQDIALVEQNFAVLMEFSDALAERFYQRLFTEYPEIMPLFKSV-------TIEGQHKKLLASMVLLIQHLRDTEMIEDYLqglGARHQ-QYGVETSHFEMFIENWLSVVAEFADQKWDSKLQQAWRNVLEYVAELMQSPT-
>SRR5438034_562795 
------AVETLRNSFERVIERSPNLTRRFYEILFEKYPQTRRMFGL-Q------SGKGKGNGKGAGARQRLRRChcrlhfgkekaTVVPFPlpvPVPLPAFRD-SYX-------------------------------------------------
>SRR5688572_434377 
-PMDKERAHLVRDTWMVLTPRADEIAAAFYAHLFSLDPDAREMFAHVE-------MTAQGRKFLGMIGTLIRLLDDPADIVIetiPAARRHA-TYGVTGDHLDTGREALMRALERHVARRLHTCRSAGVGRAVRP----------
>SRR5205085_1772709 
-LMENRQAHRTSDRLQIELAAAQARIGLLYFAQHDRAPAARAMFST--------DIGVQSRKFSDMLEVLVEGLDDFDQKRPALRAMglrHV-AYGVVPAHYDTLATAFLWALGHMLYPEFSPEVKGAX----------------
>tr|A0A0N9QWL5|A0A0N9QWL5_9ANNE Intracellular single-domain globin (Fragment) OS=Eulagiscinae sp. JPG-2015 PE=2 SV=1
--VSDAQKALIKSSWAGVD--LNAAGVAFLNQMEQKAHDVYAVFKVGGGATSNPKAAALGLKVMTFVDEAVKGIDDMGAVGGKLDelaQRHT-KYGAKKAHFPVAGPCFLDALAEVCGGRFSADARAAWSDFYDVIAQHLSA---
>tr|A0A0P6AJ75|A0A0P6AJ75_9CRUS Globin OS=Daphnia magna PE=3 SV=1
--LKTVNVSAVQNTWAIVNKDLNTHAPHFYVALLTAHPEYQPMFPTIANVpagalLNNAALKTLSVNVLTKLSELIGCMGNPDALNAQLVDLanqHK-GRGTTRAHFDNLSKVLIDFLAAKLGGEFTPEARQAWTATMQGINTVVEA---
>ERR1719347_1330150
FCLSESNIKALKSCHPHLKDRKEEFGHLFYSNLFSNHPDLKSLFDQTEEG-----RQLQAQRLADTVVAFLEKCDDLPSLLPTFKKIgkrHT-TKGVKPEMYQIIIDNLVDTLEEMLGKeVFSAEVKQEVLESISFLSNAFIK---
>ERR1035437_6084348 
-SLDQEMIAIVQVSWENVTPDSRLAASMLAMNLCADDRNIASLFEE--------DRIKMSRDVMQAVSCIVADLDQPETLVPYFGSLgqlLR-RHGLHESGQQTFATALFLTLGQLLGPRYGPVEHNAWAIAYSFVVRIMIAE--
>SRR6185369_9977853 
------CGVPDPDHV--------RGGG-------TAQERSRRAFLPTA-------VRDRSR-----VPRAVQGHRHAGagRDADDHADLgrrHI-GYGVQLHHYDAVEQALLEMIRRMIGDAFTLDVRLAWSHIYNELVRIMLAG--
>SRR5215471_14715706 
------VPAGGPALARLLRR--------HLRRV--VSSRLAPLFLRLA-------FNDAISYDPATGSGGANGSIRLPEELARKEVAglaRA-V------------------------ERLRPVKE-------------------
>SRR5215813_3453690 
-------------------------------------------------------------------IASDSEIQVSPWtrt--GTLaisARRCS-SSRISSGigsdtTFSLYGNCV------------SSSATIAWNTHGD----IQLDS--
>SRR5579859_1863727 
------NISSLQLTILNLLTVEDEFVPRFYNNLFNMYPLARSLFVHTe--------ISLQYNKLRLMLMMIIRTIHDADGLKIQLqqlGQRHK-YYRVEPEHFAILYIVFVQTVVEYLGPKWTAELEAAWAEAYGTIVRMMDME--
>Dee2metaT_7_FD_contig_123_47857_length_200_multi_10_in_2_out_1_1 # 3 # 200 # -1 # ID=100007_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.434
---------VLRDREG---LGDPELVVLQRRHLAEHGAILQPLalLARQr--------HREDLELVRELLLLECDHRVEHPRahpaGVGVEgelGVGHH-TERIKRSlspsalLGRWIDLVVVGAVRR---------------HHQGGVVDLRLVE--
>SRR5262245_19300173 
-LLTPAQKRLIRESFVTLEPAIDLVGQLFFLKLYRLDPSFRARFGG--------NPETQGRKFMAAVKLAIIALKHDDCLAPMLKLLgvrQR-ILGMKVRDYRMIGKAWTWTLERSLEKRFTRPIKDDWTALLALATRVLSG---
>tr|A0A1S3M8L1|A0A1S3M8L1_SALSA cytoglobin-2-like isoform X1 OS=Salmo salar GN=LOC106571144 PE=3 SV=1
-HLTDEHREIIKETWKVIQENIAKVGIIMFVGLFETHPECKDVFFLFrDVedlerLWNNKELQTHGLRIMHFIEKSVARLNQMErldQLILDLGKSHY-RYNSPPKYYMYVGAEFIRAVQPILKDNWTPEVEEAWKTLFLYITSIMKQGYV
>SRR5258708_4037766 
-------PGAVGPAPGLQPPRNRPGARRGQPALMQSPSAGGPPPGPHrpR-------RTHRTPPRRAALVLLRRSLRDLDEVVPGLRAMgarHV-RYGARPEHYPVVGAVLIDSMAEVAWDAWRPAYGRAWAAAFDVVSGAMLAG--
>OM-RGC.v1.004444255 TARA_034_DCM_0.22-1.6_scaffold509117_1_gene597562	NOG05352	""
-PfLQPTKFELVVNLKTA------------------------KALGL--------EVPPTLLARADEVAGVGGSAKRISHWppr------------------------------------------QSRWAGLPRRPERH------
>SRR5262245_16285966 
--------XMVEGTLDAV--SLPALSADFYRRAFDTDPELARMFTA-D-------RRVQEARFATELAAIVRSIRCHDEFVPagrALGPVPR-L-RRDGRPLPRDGRRPAGIagrcprsdvearGGRGMAPRLQPDRRDDAERRPRAGQLGVTSG--
>SRR6266568_4225566 
----------------------------------------------------------------------------------FFFFQaedGI-RDG-TVTGVQTCALPIFDTVRHFGAGTWTADMQAAWETAVASIGSIMRA---
>SRR5260370_35001365 
-----------------------------------------PTFPP--------AVGAGRKGVSRAVPGAVWSSDQPERLARGVGELardPG-KFGVPEQPYRLFCDALLATVQAFCAGSWSDQVQAAWERALAAITAAMMaggsgapgE---
>SRR5215475_1743066 
----ISYWPLVKQSFARATSDGVAAAEHFYARLFAVNPGIRALFPT--------SMTVQRERMFADLSRVIWSLDTEPECTALLRQIgreHR-RYGVLAKHCEAFLAARGRLLCrHDAGRLIRCERRARVVDCLDRQSRTAVagggl----
>APLak6261665767_1056052.scaffolds.fasta_scaffold282062_1 # 1 # 210 # 1 # ID=282062_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.505
--------------------GRSNLSLVFMKICLKLIPKLNVYLVKLI-------WRSRVKKLLNSLILLVEGLRTPEALIPVLKDLgarHK-GYGIVTEYYPLVGEILLNTFADYLQEDWTPEVAQAWLEIYTTTSNLMLEGAG
>SRR4029079_9820506 
-RVDGILVEGLQASLATMQPAAAQIAHGFYTLLFARRPDFRAMFPE--------DMAAQERKLIATLAFVCEHWRKPAAVSvrlADLGALHQ-GLHVKPEHYPIVCDALVTAVMKHRHEALGPHRAR------------------
>tr|A7RHV8|A7RHV8_NEMVE Predicted protein OS=Nematostella vectensis GN=v1g197347 PE=3 SV=1
IPLSVAQKYLVRETWETIEQHSKAVGKKTFLRmfymssidfiysvvmeskgskdirvlglelafddvknsyrtwrFFEMNPDYQKLFPEFaTLDqvelEQANALHGHAKRVMKAVENAVSAMDDAESFAAyleNLGARHK-ARALKPAYLDAMQVAYTDTIQDLLKTQWTDGTAEAWNKLFRFIADTMKHGL-
>SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold1207366_1 # 2 # 214 # -1 # ID=1207366_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.286
----YASHQSQAASLAKAAPRPRVAVLGLrlpsgeSPQLARLGRAFAELLGA--------ELAAGERLLVLPAeRVehMKLELGLDEAEAYPLPTLgriHR-NLGPDLVVVGTlapqeprgtlsvtveVKDCLTGAVTATAKVTGPAAELFTLASQvggelrrrlgssalsgneraelraqrpaSPEVAQLYADG--
>ERR1712062_404977
-ILTNQEISVLKSSWELIAKKIEIAGAHTFLPTFDRDPKCPDN------------IERHCQRVMSVVGGSIELINDYKSLWKhliSLGREHF-GKIREWIFASIAGGSTersgcspssINFLSSKINGNITSKK--CFLQ-YKIVIITQX----
>SRR5271166_154013 
-VMTRLEIALVHEGFHRMESRLESICMAFCRTLFGLDLSLRPLFPN--------DLQPLAAHLAAGLETAVRSLDDLQPVlvcAPALGLRLA-SHGVVPDDLHTVCAALLATLQSELGDAFTEGVRAAWRRLFWIVAAATIGA--
>ERR1719261_40108
------TIAVVQGTWQEIKDalgdgVAETAGVILFKHIFRIAPQALALFSFKDCAggnvcdelFENKTLRKHAAKVVGTVDTAVGMLKKTRQADSRPGQSgqeAR-GLwggagalrcgrgGVVGDAVGRVGRRVYDRGPRGLGGGLRHHQNHN-----DRQELRLHGR--
>ERR1719238_2294225
------------------------------LKVA----SALREFNTLRAEgivseqefLEM------KAKLLAVGKDELG-RSPSGDTLETLVEAthemdssrrRT-RWtrrarraSRSPTTVGVISCQIK--------KSSTRRTTRRW----------------
>SRR5690349_20281755 
-IMRPEQAALIRTTWAQVTPLGIAAAALFYERLFALDPELAAKFAHTDM-------ERQGKKLLQALTVVVATADRLHTLGPSLEELglgHL-RYGVMDRHYDTVGVRYWPPSKPPLAQRSRhrsrrhgPWPTPAWPPMC-GPARGGR----
>tr|E9IBK1|E9IBK1_SOLIN Uncharacterized protein (Fragment) OS=Solenopsis invicta OX=13686 GN=SINV_03861 PE=3 SV=1
-GLTEKQKRLVQNTWAIVRKDEVSIGVALVLaiarfvyecntksffySYFKQYPEAQKEFKAFkDVPidelSKNKRFQAHCANIVATIGKVIEQMHDPElmeASVINFTEKHK-NRGQTQKQFENLKQMMLDVFPSVFGKQYTPEVQEAWKKMLDLIYSKIYQTL-
>tr|A0A0L7R0Z8|A0A0L7R0Z8_9HYME Globin OS=Habropoda laboriosa OX=597456 GN=WH47_01055 PE=3 SV=1
-GLTGREKRLVRESWSVLRVQSVNTGVAIMTSYFQQYPQYQKVFPAFkDVPldelAASKKFQAHCQNIVSTLSNAIDALNDVDlmeAILHTAGERHG-RRGQGRQEFIDLKGVIIEVMKGALKSRFSTEVEAAWNKTIDVLYLKIFEGI-
>tr|W6FSH9|W6FSH9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_a PE=2 SV=1
LDFSDDQKADIKSTWETLYSgNKFQLGVELMANLFKAHPDYQDLFPSLkGIPdvAGSNELRGHAIRVITGINNFVDALDEEeevmREMLHNMARSHK-PRKLTKTHFNEFAPILLETFEKKVD--MSSKARDAWIALYYSIVDNLFAE--
>tr|N1VSG6|N1VSG6_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 GN=LEP1
----PDPILEIQKSFDHVLEYNPHWIDSYIDKLKNFSMenvTENQREGDNES-------PISSEEFLNSIESIIEKLGNPISVKKEVSKLaniYE-SLGITKKEFPKLLPILLSSLRENLPSEWNPSLESIWTQAITDLTIETIES--
>UPI00001F6528 status=active
----IDGLRDLSESFDTLaadeaatAPAATELKaavegqfsgvfGAEYAKQTGKQPDTASYTLE---------------HSAAALAQYHYIVRNPHPLGQknKLDKVagEA-RYHALHARYHTMLNAYLERFGyydvflidldgdvvysvfkemdyatNLKTGPWRDSgLGRVFRSALESNDtkSTFFDDFA
>SRR3569832_1336210 
---PALVRAAPDSAAALRRCRCGGTAEKIAERARADD----------------------------------PESENSRGAGAemkGLGARHK-QYGVQPEDYPAMRAALLEVMAALAGKAWTPAVAMAWEDALYILTDVMQKAYR
>SRR3569832_1187104 
---PALVRAAPDSAAALRRCRCGGTAEKIAERARADD----------------------------------PESEKSRGAGAddeRIGRTAQ-AIRCSAGRLSSDACCAVGEQNGNGGX--------------------------
>ERR1719259_112507
-GVTGRQRVAVQASWRLVAPDAKRHGVAIFIRLFKKHPETQLVFKSFkGQQpeslADNKRLAAHATTVMASVATLVDNLDDIDTLLELLHKVaenHK-RRGLPIQYFEMVSNTIFDYLVETLGAALDRSGVEGWSNVFRAINSVIAAEYK
>ERR1712107_384356
------------------------------NRIFTEQPNVQQKYFSHmD--iNELGTLGKHGVGFMKKIDLMVTyvKADEDDNLVALIHEItvsHS-KKGIRNAwEFEIVCEILISYFKEAMESEFTSDAEDAWkkffef------LV--------
>tr|Q53I62|Q53I62_9ANNE Intracellular haemoglobin (Fragment) OS=Alvinella pompejana GN=hb-i PE=2 SV=1
-----------ADNIAAVRGDVSTHAMNIFVEYFKKFPQHQNAFADYkGKDpeslKSLPKFKTHTTKVVSKLLDIVEKASDSGALQSNCTTLakmPQ-HKGLNQQQFADLGAVLVPYLQKALGGACDSA---AWeqayn----------------
>tr|A0A132BSZ5|A0A132BSZ5_9RHOB Flavohemoprotein OS=Rhodobacteraceae bacterium O3.65 GN=hmp_2 PE=4 SV=1
-VLHQIDARLVEGSFGTVFARKAELTDVFYKHLFEEMPAARDMFTH-DF-------SRQKEMFARVLATGVRSHRGDATLAPLIENLllqHR-HLGLTSEHMYMAQRALLMAFRVVLTGHLTAAELSAWNAALRRLCQSMAAGL-
>tr|F7RKN3|F7RKN3_9GAMM Globin OS=Shewanella sp. HN-41 GN=SOHN41_01091 PE=3 SV=1
MGLTEIEKEAITSSFSLINHQEQHFATIFYDCLFDMAPLIKPMFKR--------DRKLIEEHFYMIFCAAVDNIHHLDTirtILLELGARHR-NYGVKVLHFPIVKSALILAIQHELKGQSNASIENAWSHYYDVLAAIILEG--
>SRR5579875_3194573 
--------------------------------------------------------SRCCSRATPSYGRCSRSRCrgpgrrsATGSPSSSATCRrpgAR-RSCSRRWPGITAGSASvtgtTGRSSRRSGPAWTAELDAAWLAATDWFVSVLAAA--
>tr|A0A0L8P0I1|A0A0L8P0I1_KITAU Flavohemoprotein OS=Kitasatospora aureofaciens GN=ADK78_37645 PE=4 SV=1
----AADQRVITEYLELVTPFGE-LITHLYETMFRRWPYLRSLFPE--------SMEFQRAHLARAFWYLIENLHRPDDIAEVFGRLgrdHR-KLGVRPVHFQAFEAALCEALRRTAGPRWADAVEQAWVRMLRFAVAAMVSG--
>tr|A0A0G4II14|A0A0G4II14_PLABS Uncharacterized protein OS=Plasmodiophora brassicae OX=37360 GN=PBRA_003666 PE=3 SV=1
-NLTEERIDIVRKTWLTLKSGqgkgerdrlgsnpsvqdaMDLLAVMFFEILFKNAPEVEALFQC--------DLVMQGRRLTTALNNLVDLLGKdaaaISEILTRLAEVHH-PHGIQPEHYDPFGQALLAMVKAGLAEDFTSDVCEAWEHLYSTICSFMIP---
>SRR5262245_46558688 
-EMNRIQVNRLRSSFKWFRPCGPAMIAMVFRSLGDRHPGVRALFPE--------DTSTLNKRLFETLRQVVKALARFHSLEERLMELgarAA-RAGANPAHYRIVRDELLATMAALAREDWSEELARDWTLMLDAVSGAMLRGA-
>SRR4051794_9566520 
--------------KALVEDVAERghrrPMEVFYGARsdhdlydidtmlrmAQSHPWLS-VRPV--------VATGpaggPMNSLSGQLPDAVRQYGPWREYDAYLSGPpgmIR--NGVD----ALVGVGV---PSDRIRHDSVEELVAAGDX--------------
>SRR5258708_3005780 
-EPTPTDITIVSDSLAPLTkEQVDNVLAAFYHQLFTRQPSLRQLFKSFRsgDQPDQQAMKLQRNKLAEIIALGLKLWEKPHQLIPALEKLgrqHH-QYGVRDEYYEDVWIALSEVLSEAFGLDRWEDICESWQRFIFLCARHMLNG--
>ERR1719198_2284224
---------------------------------SDMPSDALDWFTNP-TPe---KRGTPDGGKVVSADVVAVAGQM-----------------------------RELISLPEADVAQGLSQLDP---Q-----DLMVLQ---
>ERR1719223_1791071
---------------------------------------------------------ANSKAT-D-DEAS-KS-D-----------------------------ATKVAVPAGVAAPEPKEEE----P-----VAVMEP---
>SRR6266542_3322184 
-VMTPEQIEAVEATTAVLAPALDDLAADVYARLDRLAPETAELFTG--------GPAAEVRGRARDDRARHPAPRRLpGACl------------PARPPARALRGQA------GALRARRC-----------------------
>tr|A0A194VHM2|A0A194VHM2_9PEZI Flavohemoprotein OS=Valsa mali var. pyri GN=VP1G_10414 PE=3 SV=1
MALTHHEAQLVKSTIPFLKEHGESISDTVYRTLIEKHPELNNTLNLIHL-----KDGRLARALTVVILRFASSINHISELIPKLERIcnkHC-SLGIQPEHYEILGGLIIETFDDAMGPLMTPEMKAAWTKAYRILSNMMIG---
>tr|G9MK89|G9MK89_HYPVG Uncharacterized protein OS=Hypocrea virens (strain Gv29-8 / FGSC 10586) GN=TRIVIDRAFT_143449 PE=4 SV=1
-------------------------------------------------------MNPPEKVDIRSTDGASVIYRDVISLNSPQEEIrvlHL-ESG---SGSSLLKCTLHRvSLQSVQAPSYE-ALSYTWGNEndrraVVV-NGYLVD---
>ERR1022692_2453048 
-------XMSLPASFTSICNgiLGREE--------NSGCPAAKGQFLP--------DRDAWrRssaLLLFGPLHQASRSTGYVSHLHegaArppgrRispDRRPgrqAG-RSGRLRAGPRAGPPQVRGHRRALRRGRRQPAGDTGAFRGRHLDARVMIEA--
>SRR6266581_3027569 
------DTHRLKDSFAKIAMHGDEVPLFFYSDLFIKHPEVRELFPT--------SMKAQRDHLIVALGQIISQVDRVDELSAFLRGLgrdHR-KFGAVAENYEYVRDSLLETIAHFSGAGWTSRLDSQWRSSRRPGRGCGA----
>tr|A0A1D1W7H5|A0A1D1W7H5_RAMVA Uncharacterized protein OS=Ramazzottius varieornatus GN=RvY_17919 PE=3 SV=1
-GLAVKERMLVQRTWKELMqLGRSNVGIELFHQYFTKYPQYVQHFKAFREvPseklKAHPRLKAHATTVVNAMDVIIDSLDDTETAVAVLDKTgrdHD-RRGLSTSAFADLQTTLMMLLGMFLKDSWTPAVEQAWDKALTVVMNTVM----
>ERR1719487_198517
-NLTNNDIDLVHTSWNMILNDtapeyvklkesgddkhancVAWFYTVFYHRLFDVHPACRHLFTR--------EMMTQGSFLVRMISLTLQEMHDMEnfrDMMRSLAEKHC-AYGVKGIEYGIAGDVLLYSLQTVLGSdVFTSAVHFAWRKVYSAMLNHITP---
>SRR5439155_13306073 
-LLD-------GGTLRAVRMSGDTRSEPWLKDLWERGVAVGELRRHLLLPleTPPGLPVPRGRILCNCFDVAESEIDAFLA----------------------T-SNSIAELqarlkCGTNCGSCLPELRRKSLCDIG-----------
>JI10StandDraft_1071094.scaffolds.fasta_scaffold6072973_1 # 3 # 245 # -1 # ID=6072973_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.634
-LVESFGAEGSDKKVTGIRLVGETIASDWLKEVMTSGEFTADIRRWALAPlsAPPSGHAGRGKVVCS-----------------------------------------------------------------------------
>ERR1719326_289429
--------------------------------------------------------AGQRMNLTKFITTAFSLLGTLPDALEALSQLgmrHI-LYQTKDAYWPVVGANVIKTLKIILPAEDFDKEtEEEWATLYGIMQKTILDA--
>GraSoiStandDraft_1057264.scaffolds.fasta_scaffold343999_2 # 425 # 754 # -1 # ID=343999_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.636
--RRRMDAELLETSLALVDTPDDGLTKRFYALLFERYPAVRPVFPEEM----HRDIARQAKMLRSAIISVVDHLDDPVWLtetLGELGARHA-GWGVLAEMYDAVTECMVAAMAEIGGDDWTPYMTDAWTEALDAVSGLMLLGYP
>tr|A0A1I3HEN0|A0A1I3HEN0_9RHOB Nitric oxide dioxygenase OS=Jannaschia pohangensis GN=SAMN04488095_0565 PE=3 SV=1
--VTNTQARLLSRSLRRISENGAPLARSFYAELFSAHPEVRPMFHS-D-------LSTQYAKFEDMLVVLVADVLNPGVILRPLQDLakrHV-EYGVTREMYPIVGDIMMRTLRTLDAAPLTGDELEAWDVLLGRVNAFLMDE--
>SRR5215203_6923026 
-PGDSGADRAGRAD---AERDQAGLRRGRG-RLLPPAVRRRPLRggavhhrAG-H-------PtgeADRGAGCGDALDQAPRRVPAPGRH-ArpaAPGLRG-----------------PPAALRHRAG---------------------------
>SRR5579864_8015183 
---KPDPIFLVHTSFVHLRPRMAEFVSNFFRRLLKDSPELAPIFEDAD-------SVRLKTMVAKIFGTTIAGPEQTDQVeadLAELSRRHK-SYGAIPDFLPLVGRAFIATIRESLPDDTTPQTIEAWELLYANTAALMSKGL-
>SRR5262249_54331370 
-IRLRK-------EIDNEWLLIASgVLSVIFGLILVAQPGTGALA---------------LLYVIGIYAILYGILGPRPCcv---------N-RFGAQTALDRG-----------------TSTYRELWNIS----VARLIG---
>SRR6266536_6175029 
-LMTPEQITLVQSSFERLGPQLPAMATRFYQELFTRDPALRPLFTT--------PLPQQEVRFAEALTEIVRAMPRLDELLThtrAPRRPArrlR-GTGCRLPDPRRRPprrargrpgRQVRRPHTRGMGPRLQPcrrdharrrsrgPAHQQLTTTAAPTASQADGG--
>ERR1700754_2066947 
------DPGdrQLARELLAGAAGGDDLDALvehDRGAVLEIAREAVPVaLAQAD-------RDdQLGHLGA-----------------DRlLRGPaerPL-GRGAPLQDVALVvhrddavergqqqRAVALAAGAELVGEIWERQERGSLtARRYGSNRSI------
>SRR5918995_1637126 
------DVQALEKSFDLVAPRGDDLMEVFYTRLFTAAPAVKPLFAATD-------RRRLKRPNQRSPSVsVSEKQWSMKCSQDQladgqstgasrpetprndcsdsppgelaaKRDQgakTL-SRGGCSGGAIMVPDCRTPTPRGRP----------------------------
>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3668839_2 # 105 # 377 # 1 # ID=3668839_2;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.656
-------SGPLAASLAIFEPRLEAVTARLVDVLAASSPHLLALFPPSSEP-------S-----AALLGRFLTRIVETESLGqPLGDGLgldAY-PIP-TRDQWEHLVESFIWSLSAVAGKAFSPPMARAWRATGERLFSTMFES--
>LULI01.1.fsa_nt_gb|LULI01000097.1|_29 # 27187 # 28320 # 1 # ID=97_29;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.310
---------------DEIKGRH---HSMFVDEFERQQPQYKD---------------------------FWARL------NrGEYQAGeyrRY-GKG-GKEVWIQA----------------------------------------
>SRR6266851_2503075 
------------------------------------------------------XMRNGSASLPLwPARYGAWTTRRPSPNISAPSRSti----------ANSVCGRAITNWSARRCSPPSVSSAASGWEAAFNRIATIMIQA--
>SRR5215204_2071689 
-VVMSNDYQLLKESLALIEPVYDKVTGYFYARLFVENPHLRLMFPL--------TMDLQRDRLFRALVHVVQAVDQPEQVVPMLQQLardHR-KFQVEPAHYDAVGRALIGAIRQYSYGEWSDEIEAAWWRTYSVAARTMIDA--
>SRR5262245_14739337 
-PCARARLRPR-------RPAL------Y-AQALPPRRLVPRPVRE--------LAEAQSRKFMAGLKLGIIALNYEDGLTPVIRLVgvrNR-RAGIKVRHHRVMAKALLPTLEQSLETRFTRDTKHAWSSFLTQVTRILSG---
>ERR1719401_2136855
----------------------rGCMGVTSAPQTLRQVRQCRRLHGGRLArhdrdwsaeegsdeedVWESPALRKLFGKFVNAVGCTVAGLHDMTEIGLP--RRgatKR-MYGSHqR----------------------------------------------
>ERR1700736_6084178 
---------------ARVA--------QALDRVRKAARQRKK------------------EQFTSLLRH-----LNVDTL--------------RTAHYALKRKAAA-----------------------------------
>tr|L8LYK6|L8LYK6_9CYAN Hemoglobin-like flavoprotein OS=Xenococcus sp. PCC 7305 GN=Xen7305DRAFT_00009490 PE=4 SV=1
---MSLQIGLLEQSFNCIRPYGKLFVSSFHENLFQTNPEIKSLFMGVE-------SQIQKNRIWDTLVLIMENIRHPNLLnntLQGLGARLF-THGLLPKHYPLVKKAFLATFKQFLGNEWNSELEQAWKNAYTYFHDLMQEG--
>SRR5919106_2778213 
-----------------------A-VDRFYAA-VLGDPELAGYFTdvdidrvkrhqvlllsdvlggpesydG--------PDLGQAHRGlgitdghyDKVVGYLVAVFTDLGADGDTIAAAaevL----ASVK---PQ----I---VEDQAGSRDSHEX--------------------
>SRR5690348_11784222 
------------------RaePGRAGgvprarga--RRLGEPGGgrarpSRRPLADR-AAD--------GPH-ARaPRQRARPAAGGRRHRLRADAGgargPGAAPGaaaHP-GLDVVP--------Vveqdgg--------PGadpcgpleegtlADVVTRY-GAWADRDVLVCGSPAMI--
>SRR5947209_9205436 
-------VLSVLRSpssplF---PyttLFRSRltver--DSERDVLMvaggtGIATMRAL--LD--------DLA-QWgENPRVHLFYGGRTDDDLYALDd--LHQLdrkST-RLNSSHANISY---Avfclk-------------------------------------
>SRR5438270_814702 
------------------------------------------------------------------------XMTANAVVSPLPSQPprrQP-T----------T-----------GATAMVRLVRESWARI-------EARQ--
>SRR5919202_1970091 
-------VQMVPGGqvsstmvrslkvgetV---RlgAPLGQaltlyag--ERHRDLIMvavgtGLAPLRAH--LE--------RIDqEwqSTgRAPRVRLFHGARLPWGLYENRl--LQNLagRP-WFTYTP--------Vvsddp----------typgrkgwvGDAAAVS-GPLHGLLALVCGSPEMV--
>SRR2546430_6350501 
--GGRResRVRGGQGGWV----SRAIVAEPQRGDVGRSGPAMGRMKVD--------RG-AGRDVVMVAGGT------GLAPMRAIIDDL-A-QWGENPRvhlfyggrgrggPYH------PPSLVSTAAAqPGVPVVavagaeaglshkeagspagggvrHGALAGRG------------
>SRR6195952_1380156 
--DVALAGEAVRAIWFRLADQEADVAHWFGALLFSLAPHLRAQFPA--------QADRAARRLLRASIAAMSAVDRPQEFPAAIGTLareTR-ALGLDASADEPVGVALVGAVREFAGELWAPGADAAWVLAYSLAAEPARR---
>ERR1700709_350262 
----------------------------------------GDLDAD--------AT-AERELLVVAGGRRGGVGpaprGepaGPSGAGGGRPPRparLA-AGVDVRRttvivgartaedLHT------LDRFAVIGEDaPWLAVVgacesdplelglapgpvvegitrAGPWLEHDVVVA--------
>tr|A0A098BFR8|A0A098BFR8_9NOCA Flavohemoprotein OS=Rhodococcus ruber GN=CS378_10080 PE=3 SV=1
--MEAFAVARVQLSFAsivATPGGAERFATAFYTALWSDTVGIRELFPA--------GMETMRQRFATAVGWAVNRLGDPDAVTAFLTQLgrdHR-KYGVRPEHFRSAGRALHTAVRECTPPiLWTDALDRTWARVIDLLVGTMAD---
>SRR3569833_3303276 
------------------------------------------------------------------------------PNNTNHDKH-T-HRKRNPPehqniggkrpedLYV------LDDLRRLTAVsKWLTVTgvteegaipggdrgtlahavaqRGVWEYYDILVS--------
>SRR5215208_6178010 
--NGRGRPRPDTAIIRRGVAGQPTIRHLFYDRLFEHDPETRLLFRS--------DLDRQRLRLLTMITAMVGPASDDLS------ATNA-GhAGVPPWRWLSLA-----NARDVADP--------------------------
>tr|A0A0J9XAH5|A0A0J9XAH5_GEOCN Uncharacterized protein OS=Geotrichum candidum OX=1173061 GN=BN980_GECA07s01957g PE=3 SV=1
-SFSSWEIAEIRQSWASMRDDQLevsqekanvgtasaFFCQQFYENLLGEYPELSVLFPS---------IKSQASSMAGILALVISQLDNLPRVrevLISLGKRHSRIIGVEVTHYELVGNALLRTLSDRIQDEFTPELENAWIKFFTYITNLMLQ---
>ERR1044072_9602616 
------LEQSGYTVVGRAADARELmLKVRSYVPDVA--------VVD-V-------RMPP------DL--------TDDGLRAAAEI-rrsHptV-SVlVLSQHREPAYMLELVGDDASGVGYLL-KDRVRDVTQFVDAVQRVAAGG--
>SRR4051794_28399871 
------EHEAGTDLLELTD--------ALVRAGVPCADAAQEAVAG-V-------ELPHGAQLPAER--------LADRLERRRVD------lD------------------------------RLLRFGEDAG-HLVLGA--
>SRR4029453_17830486 
------DLQALETSFDLVASRGDVLMDVFYARLfaaapa------VKPLFAGTD-------PRRQKAMLLGALVRLRGSLRGPPAFVPPLPRPgagPggE-APlrrhrSPAPEGHAARGPraaAWLPARPAGVRSGaatPRGQARRLWRPAGALPGGRRgpdrLHG--
>SRR5688572_12388254 
-SMNEEQIKLVETGFQSITGRGERFISRFYENFFAASPKAEKLFAQTEWP-------NQSRKMLLTIMMVVDNLRDAAHIKKMLHEAnlvHQ-KFTLQADDFDALTDAMLRTLREFLTDDWSKEAEDAWRAAFAKINAIMLEA--
>tr|A0A0N7Z8G1|A0A0N7Z8G1_9HEMI Putative hemoglobin-like flavoprotein (Fragment) OS=Rhodnius neglectus PE=2 SV=1
-GVSKEGIAAVRKTWEPVYKDKENSGVFLFQVLFELHPDFEKYFARFkSEGakslFDNPMFLFHVkHKVMDSLNEVIDNLENDERLLKILKSVasnHK-KRNIKKEEFVTLGKVVLETLRRALGTAMNPEVEDAWTKVIDCAMSAIG----
>SRR5712691_10715499 
-ALTLEQFRLIQHSWQMVKDGQfnafkaqqliadplGFWGLQLYDTLFELNPALKPMFQNT-F--------TQSQMLTEMVGAALGLlpgiLDQAlgeektavlwylPEYKiviisITYANMSL-SQNIDR----------------------------------------------
>SRR4029450_4347554 
---------------------------------------------SG-V--------TGSSLPKTLVREgvQSLTtpchRKLPlgtektaidpqlLPILVDLAARHV-SYNVKAEHYGTVGLALVTTLERTRGSRVAAPTKAAWVELWSLICTVRIP---
>tr|R7TL54|R7TL54_CAPTE Uncharacterized protein (Fragment) OS=Capitella teleta GN=CAPTEDRAFT_144794 PE=3 SV=1
-KLSAEHKTTIRDTWPLISHSLQDNGIVVFEKIFEVSPSIRTVFAASfGFpaspipDayelSRASNLRDHVTRFMQAVGWSVQHMDDLDTV-ttvfVNLGKRHIHLKSLEPDFFRVFSGALMYVWRSTIGPDlFTAEVRGAWCKLFEFMLQHLAHGY-
>tr|A0A1B6EVA8|A0A1B6EVA8_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.22480 PE=3 SV=1
-VITERDKYLAREVWMQVETNYVLISKSLFTNWITEFPEHLNFFKGLlDSSyddfLTSPKFEQHMaNSVLPNVGIMISNLDRPTDFRRHILKLawiHI-RKniALKIDHFNILKGLILRTLKESLGRGIGRDHEVAMFKVITAGFNLFS----
>tr|N1VY19|N1VY19_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 OX=1257
----KDTILELQRSLELALHLNPNLARDFYVHFLETKPEFQKFFQNTD-------METQAKKLLAMFGRTIERFGNLNQIHNELknlGKMHE-EMGIKVTDLAEIAPSLLYALEKSLGERFQTEWKPIWEEALGSLVRLMS----
>UPI0007D2C88E status=active
-GLDHKQIEIICASWAEVKKFGtEAAGCLLFKKFFIVAPETFSMFDEFkDIPnwEDSTQFKHHCKIVMNIIGGAVGLLRDPESLDSTLEYLglkHE-GFAITQHHFDLMQVELINTFRDALGAKVTPDVERAWNIFYAYIVRIIVCG--
>SRR5437870_4959208 
--MARVNPRSMAHA--------ATAIAAATTRASEFMPTPQFVRTP--------AMPTQRERLLGAIIALVTHFDRPENLLPALTAMgrrHE-TYGVSLGHYAAVGSALLATLRDFAGLAWSPAYEGAWARAYTFAAG-------
>SRR3954447_20457037 
-------------------------------------------------------------HKVKVEDIIVRGGGNL---MVEL--MntdAA-GS-----PLDTPVRAVTDG------TESTAAAREPI--------RLNPG---
>SRR6266545_1588040 
-------G----CDLEQAVDTCPA----------A---LVIGLRPA--------TMGTL---------CYMGGLASA-------AVCcwrHV-RVVTCSQFF-------------------------------TTASPQSRQ---
>SRR6059036_2276597 
-ALFPGTSHWVV---AAGMARP-ESKDHPMLTVAQKTLVQ-----D-T-------FAIITPIADDAAALLYKKLFELDPSLERM----------------------------------------------------------
>SRR5581483_12392512 
-PMTPEQIQLVRLTLAQATAGEPSIGRDFYRRLFVLAPDLRARFQG--------DVEAECPKLKDTLKLAFASLSDLPFLIATLEALARrgVARGLSDQHCRAISKSLLWAIEQRVGSAFTPQVCNAWIAFLAVVVSILR----
>SRR4051812_13904716 
-GMSPEEVALLRHSLDEMRADGPQAAEAFYAELFRLDPSARELFHL--------PVEQQSVVFFHELDALLSAVSDLPAFverSRRLGRMHA-GRGVRPEHFEAAAAALDAMLLAVYADGASPELRRAWRHAYRMAAQLMQEA--
>tr|A0A0N0S3I7|A0A0N0S3I7_9BACI Uncharacterized protein OS=Lysinibacillus contaminans GN=AEA09_04415 PE=4 SV=1
-MLSLETINEIKKIASAISVNGEIIKKIFIEKLQKNVPELLHIFYQIL-QK----SGRSKISLIDAVYSAAMQIEHIDRFVPAVMQVahkHR-SLGIQPEHYPIVGQHLVDSIQEALGNQATEAGIAALQLAFNRIADVFIQV--
>ERR1719171_419597
MGLSAKTIEIVKATAPVMAEHGYAITSAMYGSMLTADPYIASLFNPSHQKVLPgDTHANQPRSLANAVYAYAANIDNLGALTSAVTRIaekHV-SLQIEASQYDVVGEHLMAAVKKVLGDAATEDVCAAWTEAYGFLASLFIST--
>SRR6187431_1436969 
---------GAAQRRRTVWALARKA--------VRIGPDRANLVQG--------GPRGFEDEAaQHACDDRVGAADRPEifdSVVEDLGRRHA-LFGVTPAQYSAVGEALIWSLGEALGPALTRSRREAWSDFYKVVQLSM-----
>SRR5215207_7267255 
-----QAV-----------AGEPEVRGSILRKAVRIGPDRANLVQG--------GPRGSEDEAaQHACDDRWSRLSTR-dlrLGCRGFGTTSR-TVRCDAGSVFGGRRSL---nleLGRGARTRADPVQARSVERFLQGGSALHVEG--
>ERR1719491_1400349
-------------------------------------------------------RQRRFTHMGAASGRPRAAVALPGARA----SLhdrPR-PHEAE-ASVASRCEATIKTLRDLLGDDCTPEVENAWAVVYGFMSSIMVESLR
>SRR5919197_656730 
-LLDDDTIGLLDESLRLIDDRSDVVVNHFYAAQFATPPPRGLLGSR--------ARGC--------LGRGVR--------RDGPGDVgrrSR-GGGGRAGLV--EGRD-------------------------------------
>SRR5688572_8260099 
----DQEINIVRQTWNRLAAeHGNSVAEEFYKRLFECCPHLKDVFKN--------DFEVHGKEFIENMDHIIIQLDNPCMirEMQILGIKYA-SYGIRYEDYECMKKALFDALKTKLAEHWTPTVMVSWIWFYSTVSHIMKH---
>SRR4029077_8414069 
-DMTPAQLQLIKKTLPEINASDDLFAAEFYRQCFDLWPETRSMMPG--------DLTERGRALVAEFIALASCVSgDMDRVVARaheLGVRHR-GHGALRAHHEVVEQAPAAPLASVLEDGWDEPTAQAWH---------------
>SRR6478736_6664572 
--LNAVEIARVRLGFARVVPNCGAFADDFHARLFELAPTTSALFPD--------GVSNRRAKFRQTLVMLMTSLSTPTELKPALAALgnRCRACGVEEADFAAISQALIGTLAAHLGTKLTIADFDAWTALRGRIAGLLTA---
>SRR3546814_7943381 
---------------------------------------vfirlslsliiilvyRFLFFFFSSR-----RR-HTRCVLVTGVQTCALPIS-------TDELIa-----AWAAAYGQ--------------------------------LADLLIA---
>ERR1700737_1149585 
-----------------------------------------------------------------------------KQPDGSAEKHfeqAC-ESGRPTGAVSHCRGTPAGCDQGSVGRRRNRRDHFHRGKGYGNLADILMG---
>tr|A0A254VKN7|A0A254VKN7_9BURK Nitric oxide dioxygenase OS=Xenophilus sp. AP218F GN=CEK28_14595 PE=3 SV=1
-MLDDATRAQIRHSAALLHTVGDQLVEHFYQRLLRHHPELGIFFNATHL-----HKRELQAAMSRAAAFYAEHNDQPENLQPMLQHIackHA-SLGVRPEHYPLIGEHMLKSLEEVLGPLASETVLHTWRMAFSELSGKLIA---
>SRR5215470_13616785 
-----------------------------------------CMVTL--------CHCSFTqtcscGTRRRGICSRFRWLPSATGWCMRWAGScptSR-TSTPSAGTcRTWGASTASSAPSPSTTPTWTPELAADWKAAYDLVAQVMIG---
>SRR4249920_1577195 
------------------------------------------VWPC--------TATRCRCSSTRTC-----scgtrrRETCSR--SRWPYSAtgsCT-RWP-GSCPTSTTWTTSASTCRTWaaSIASSAPAPAADWKAAYELVAQVMVG---
>SRR5688572_1436081 
-RPAPEVIAAVSASCQAVADRPVRLAEAFYEHLFEIAPQARTMFPA--------DMTAQMQRMSDTLVGAIAQLEKFdtAQLeaaLRRLGADHRTRHGVEAEQYRYVGHALTRAVRDVAGLAYSGALSSAWIAVYQYIEAHMSAG--
>ERR1740124_2148144
---------RTRGAAALLLQgRAQPCGVAQAQEACYVCDEHCRCCSQGSgGPqqacarATGPPAHMPYA----THRCRVCCRIGIRARAPPTQALgkrHV-PYGVLPAHYDVVGQALLATLEGGLGAEWNDQVKASWTAVYGIIAKTMIG---
>ERR1711911_258465
-------------------------------------------------ritHGWEHVVQMHAMNVMNSITSIVDTLDNPESLVDDLKQIglnHR-KRPIEAIHFHVSIYAATEGVQHVLSEMIQSNIDDSAKYLRPVDGSQCDS---
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold8273257_2 # 299 # 427 # 1 # ID=8273257_2;partial=01;start_type=ATG;rbs_motif=TAAA;rbs_spacer=15bp;gc_cont=0.364
--------NELQTNIEDVYSAGDV-C-----ALFDSSaNRYRPtrtwlscafqgEVAALNM-------LGQDKVynegvFFNASHAYRSMYAVLGNFNPAQAD-gfeFF-VCNQDKENYE----RMVLKDNKIAGAMFVGSMKNVWSVKQLIEGQVDVS---
>ERR1711934_740551
---SEETIRIVKSTAPAMKQHGYRICTTMFETLFAEHPSLASMFRKEDH-----TVQ-pgesyerQPLLVAqavrhsprflflapdshpllilipfsssSRCTRTPSTSIISPRWSPPsrgERERA------------------------------------------------------
>SRR4051812_844822 
--TEPDTAFIAQSQLARIEAMGEELVQRFYAHLLA-APEMKQLFLHTE-------MARQHRRFLDQLTSAVRELRSPRNATAHLAALgarHR-GYGVKPEHFSLASSALLHALAVVIGKEFDARAASAWKEIIASLVILMNL---
>SRR5680860_1220841 
TQLTAEQKHLIRLSFLRIEPALDLVAQLFFLKLFRLDPSLRKKFSG--------PIDVQARKFAAGAKLAMISLGHEDGLaptLKLLGARHR-QIGIRTRHYRTMSRALVWTLERSLDKAFDRDTKDAWNTLTAQFTKVMAG---
>ERR1719167_531039
MGLEQADIDNIQESWGIAKSKakLREHGVNFFLLLFTTLPEWRsKDFSHLgDGtleeLKTNPKFRAHCVLVMSNLNYWVENLDELDMGGASIQKTavnHA-GRGIMAEQFETVLGVVLKYLQGALAENLTEAMVESWTTLADTIVNIIKELN-
>SRR4030088_1427564 
---------------------------------------RRGRDGGQP--------R-RRELRRDGQepdepDASRRGDRGRPCAGPASR--------------R--RGSAAGCRSSPPSPAWPALSYEQWRETCDTLHGhTQVLG--
>ERR1700752_5389668 
-----------------------------------VVPQVPAARSRVP-------LR-AASFRRGGLehdpdPKGRVSAKQEPV-FGK----------------D--HGQTIRLSARGQSS---PrRNDAARETTCKEARMtPEQVK--
>SRR6218665_550821 
-FLSEEELTAAKSTWVRLQAtrNMQAMGVKIFLRIFELEPATKQAFESFrNLKseelVTNVLFRSHATRFMKAVEVTMNNLDALDVIivpnLKHLGRLHTDFKGFHVEYLKAFEVAMDEVWAEELGTAFSGDCRLAWTKIFSLITTKVMEGYN
>SRR5690606_39778542 
---------------------------------------------------------------------HATSVTSSHPCTPPVPcqcarrpALprlLRSsptrrssdlsL-MIKPEHYPIVgENLLASIRE--VLGe-gATDAVINAWA-EAYGFLA---D---
>tr|A0A257MW93|A0A257MW93_9GAMM Uncharacterized protein OS=Methylococcaceae bacterium NSP1-2 GN=CG439_2278 PE=4 SV=1
--VKVKNRLLVKLCIDEISPKIDIVSQLFYQELFHLNIHLKTIFSG--------NVTFLNRKFINMMATfkNVKHLEAIENSVEKMGERHVLHYRVQLKHFPTLKKALLLALKKHLGERFNAELEAAWHEVFDDVAEIMQRA--
>SRR5690554_3276444 
---xmSDADRLQVQASVERIRGQMDGFAGCFFDKLFALQPALRELLAT--------E-EGRRSKLRSMVSTlaNSRDFDKIAPAIRRLGDRHR-DYGVGVQDYVPVQQALLHAVAQVDPQGQSEQVQQAWSGQFQRISALMEPQ--
>SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold510383_1 # 42 # 362 # 1 # ID=510383_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.393
---mTSKDRALLKECVEYIEsESINELCDIFYKKLFDLDPKIKLILSD--------NDVVLRRKFFNMFSTfkSVKYIDKVSEIILQMGARHK-SYGINEKHLELMKEPLFESLHEVLGDEKFNYYKAGWEIGYQEVENLFKEG--
>SRR5436190_9873117 
-GITHSDILLVQTTWNAVSEFSMKIVAGFYKHLFAAAPEVKPMFTT-ET-------SEQQKRMGSMINTIVNSADSLDEFRgsiSQLAKKHV-HMGVKKEYFPIVVKAIISSVEDQYGSGFTTAHKKAWYKILNEISNIMIEE--
>tr|A0A1X1R5G7|A0A1X1R5G7_9MYCO Uncharacterized protein OS=Mycobacterium bohemicum OX=56425 GN=AWB93_09655 PE=4 SV=1
------TTSPVVVSLELYAEHVGDPIPIIYQRFYTAHPDAEAEFAG-DH-------HLEQRMMGGVLQMLIDLT-EGSfapSGCTYWLWDHI-GWGVTEQMVCDMFEAVVATIREGLGERWTPDMTSSWRDLISRLQPVLHAGF-
>SRR5699024_11940786 
FRRVLFRSEIVKSTAPVLKENSDKIGKRFYEKLFSKAPELYNIFNQTNQER----G-IQQEALAYSVYAAGENIDQLDNLKELISRVtekHA-ALGVKADRKSTRLNSSHVSISYAVFc----------LKKKX------------
>ERR1719310_1734953
---SASSVKAVQASWAKAENIGlRVVGELFFKELFEASPAAKELFTAqkFgEDAAGQRRFKAHTLNVMQTLSAAVYGLSDLSALARTLPAPtyaIL-SLSFTLISFTSL--------------SLTPLI--------------------
>ERR1712087_347811
---------------------------------------HEELFTAqkkFgEDAAGKAHFKAHTLNVMQTLAAAVYGLSDLSALARTLPARiyaIL-SLSFTLITFTSLSLTPLIYHTLTLKGARARNSGRaaPWIRRPT-----------
>SRR5438874_997478 
-----------------------XM------CTMHRHALRFPPAPN--------WAATRTTTPL-TTVTHRTAEVHPGRFAGSLRWLgraHG-KFHAPPAQYDVVRAALMDSLRAFAGEQWLPEYDQAWRDAYDVIARRMIQ---
>SRR6266511_448526 
-------RRRRRRAATSSGRASHRLRDsRLEARARDRSRRVLDDASS--------WVEVVRLGDAGEPVVLVSAVAAIAHRDVRRVELareGE-RVRL-------QVLNVDAEEDDLAGEHWSVEYDQAWRDAYDRIARVMIM---
>SRR3954451_10251525 
-------TSARRqqWTFPRCGPTspRPQRPGTRARCTSTPTCSCAIPRPA--------RCSRSRWRT-SGTGSSPPSATWLPgsttstRSCPSCSSSggtTG-SSGPSrRTTRPSVPacWPRSSTSTTS-GARNSPRAGRrptTASRAPDVLATVMIE---
>ERR671928_16913 
------------------------------------------------------------ALYFDGIDTGR------LRVHQTKLLVqvtGG-PVEYDGRELAVAHGGLDITLEHFD-PGWTPELARDWTQAYQLVAKVMID---
>tr|B7G0J4|B7G0J4_PHATC Predicted protein OS=Phaeodactylum tricornutum (strain CCAP 1055/1) GN=PHATRDRAFT_46237 PE=3 SV=1
-----HRKKMIQQTWRAVEFgLDVDCTRIFYTELFRKYPSVQPMFQHS-------NMEVQAQKLYEVIRVAVRFLDNVQELIPVLKDLgmrHAKHYGVLREHYDAVTEVFISVLNNYILteldcgnaGIWAMEVADAWHWVLTFIGNTMAD---
>GraSoiStandDraft_52_1057288.scaffolds.fasta_scaffold278261_1 # 2 # 652 # 1 # ID=278261_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.575
-----------------------------------------------------------------------------------------------AFLPAQRRAKLM-TSRLSSEPPWKGPAAEPSWHVLG----TMVG---
>SRR4026207_1965376 
-LLRRALCRRA-QSAAAVSRRPDPASGSFRSRHRA----GR---PE--------SGRNGRGRRDPALALLSKTLDEMAPLREPLRDLgaqHV-HWGARPEDYITAREALVAALGA-LSPNWDETLEGDWRRAITAIIVPMIE---
>ERR1719359_2370951
------------RLIVTPEHlDGCRAGLLALRVVLLHLGEGLGLLGSDSSGvsDCGVALgel-------PLQRLDLLGVLLGPR-----L---gl-L-NAGVRGLELSLLGRLlrvglselfVAEGLLLGL----------------------------
>ERR671911_2215695 
---------------ELEPAcaPDKQLVEHVQRlRVEAGAQVVGR-----E-------EerrsragqcprptsRVDVRGTHDD--------APLECVAEVLVDCgahAR-VACKVDergraaleLLDRVVPDDLVVDLHAVDEVDGGGQTgHVGPGTSSRRVstarakpQAGTLPQ--
>SRR4051812_41451604 
-------------------------------QLAAAGPVLGARFAGGD-------RppraaavrprprRVGRRGGPLDRVPPPPRRDAARAAGARLRGRgaaRA-AGAGGRDQPLRVRDARVGAPVAVRGDLGGAAGIAAHYPVVGAVLIASMAD--
>SRR4051812_21433834 
-------------------------------QLAAADPVLGAGHAGGG-------TparaaavrapprRVGRRGRLLDRVPPPPRRDAARAALARLRGRgaaRA-AGAGGCHQPLRVRDARVGAPVAVRGDLGGAAGIAAAGAPSGSPWTLTRSK--
>SRR3974377_1684031 
-IMAPEHKRLLAESFSKLENRLDDLGSLLFQKMFEISPESRSLFKG--------DIEEQKLKVARFFAEVIRRRTRShhflpvtgkggEVIIPgvgPLGARHEINYGVRAKHYGYMREALLYAISTMLGSEYNEEIGRAWGETFDMLAGAMQK---
>APCry1669189000_1035189.scaffolds.fasta_scaffold267513_1 # 3 # 467 # -1 # ID=267513_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658
-VLSDQHKKVIVRNWTILSTDLSGRGTRIFLLIFGRNPLIKSIFSFGHLegdeLVCDPRFKGHALRFMQAVGAVVDNIDDYNNaVkpiLNDLGRRHTQFKGFKPIYFNEFQDSILQVSENGTCKQngeiriLNPSaagvnfCTPPLGKFSASEMTCIVSsGA-
>tr|A0A2W1CGM6|A0A2W1CGM6_HELAM Uncharacterized protein OS=Helicoverpa armigera OX=29058 GN=HaOG211460 PE=4 SV=1
-GMSLRDVYNVQQSWKTIHANPLDNGYLMFFRLFEADPETKTFFKILDNarSeadmKAYVKFKAHILNIMGALNNSVVNLDKPEVvvvWMEKLGTAHQ-KFNIRERHFWVFRDVLVNILQNDLK--LSEPIVKSWGRYVTFIYSHI-----
>tr|F2Q9X8|F2Q9X8_BRAFL Globin OS=Branchiostoma floridae OX=7739 GN=lGb13 PE=2 SV=1
-PLDAWQRFYLQKSWKTVARKSDQAARTVFLRMLQDNPGLRQKWPRISlL-teeeiPTSPYIKFLGERIFDCLDYIIDNLGDLDHVISELtklGRQHSDMNVMTPEDVWAIEAAFLAGVQECLEDRFTIKYEEIYSRFIVFVIETMVIGFD
>SRR4029079_30121 
---------------------------------------------------------------------------------------------------MHGMH--FWflnnHKNNKMTQKQTELVRSTWSMV-----AAMDH---
>SRR3546814_3749254 
-------------------------CLFFFFCFFFSSIRRHTRCA---LVTG--VQTCALPILFNAIAAYASNIENLPALLPAVEKIaqkHT-SFQIKPEQYNIVGTHLLATLDEMFSP--GQGVLDAWGKAYRSEERRV-GK--
>tr|A0A1K0GS94|A0A1K0GS94_9ACTN Globin OS=Couchioplanes caeruleus subsp. caeruleus OX=56427 GN=BG844_22340 PE=4 SV=1
-GMNPaddaelhAVQRLLISSLEQAGGQVEVATR-LRAALAQAGPALFARIPG--------GPLAQVEQLAEGLAWLAQHTDqP-PALVAGFGRLgavLA-ECGIAPQQLQLAGAALAEAMRAgMAANGWRQDYDQAwrstWQHAYQWIAHGMVAA--
>tr|A0A077WN08|A0A077WN08_9FUNG Uncharacterized protein OS=Lichtheimia ramosa OX=688394 GN=LRAMOSA02110 PE=3 SV=1
-PPSQAQLNVIRDSWERVLSTpinnnntdqsstssnstlsttpsaSSAFHHAFFEALFTLDPNLTTWFPN---------VKRQARALTGIVSYVVRapailpvkyktykSLREMhqiqqtldeeeeqwmREQLKALGARHA-VHhQIQIDMLDHVGPALISALYQRLDSEFSPAMRDAWLHALHYVVYYMKQ---
>SRR6267143_1520378 
--VTLEQIQMVQASFAKIAPIVGPATDRKLRRCSALVAGFrkeTRLSTG--------VSKNPGRSEVRGTLCGASCCGSLSS---------------------------NWVANIRRGI----------SP-LALAIASI-----
>tr|M3IRU3|M3IRU3_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) GN=G210_0056 PE=3 SV=1
QELTPDQLRLITECIPIMEDLNLTLGSKFYRRTTRRHPHLQSYFNETHH-----KLLRQPRAFIFTLIMFAKNIHDLTPLRDVIRRIvskHV-GLQVKPDHYPLLGDVLIETLCDMFPYHmVDDKFKTTWSIVYANLASLLIG---
>ERR1712228_269173
---SETMKGDVVRSWDMIQELgTNAVGERIYRVFFELAPEAVEKFPAHvRHkyrewtadeSddeadlR--nsAALRKLFAKVLNAVGCVVAGLLGDAFTPEVEN--awNV-VYGf---------ASSIMISGLKQAKEAAQVRALQDS-DCAV-----------
>ERR1711918_283694
------------------------------------------------------GSECSWMCRC---GIARFEQT-------RTTSHksrRA-TYRvqPDRGILAHPGESCDDHFGGAPWGGLHPEVENAWNVVYGFPSSIMISGPR
>SRR6516162_1580517 
--------RVRRARCSAatesTATNTASVPGCSFAYFFACAYSASA-----------------------------C-ASSCNlnPVMV--SWGAL-GSSLKRSHFDAFGDALIWCLEHQFGAAFPPELREAWITALRRGPNG------
>SRR5262245_22234373 
--SADFDREPIREVLTRLAADPEVTMGYLYAWLFTAYPELRSLFPH--------AMTQTRAAVFGKLVSVLAGLDDRLQTEQALARLaidHR-KFGVKEKHYQPFFDALYVTAQHAAGSAWTREMAAALRSALDWFGSIMQA---
>ERR1719495_1281412
-MFKANEVTELRLSWNAwVAGDLANKGFELFCKMFEKNPDTKNVFDFMKGSsvtqmQGSSKVLFHVTRVMKNIDDVVKHADRLDEIVPILRQVggrHGtQGYNVPSGYFPFLGNALRELLRTKYS-GYNTNLDENWKKLWNFIVKEMHAG--
>ERR1712105_94955
-EFKPNEIMDMRVMWNGwVSGDLASKGFEMFCKMFEMHPETKNVFAFMKGSsvaqmQSSAKVLFHVTRVMKYIDEVVKHADKLDEVVPIMRQVggrHGtHGYNIQSGYFPHLGEAQRLLLKDFFKDRYTANMDAIFKKLWVFIVKQMQAG--
>SRR5260370_506041 
----------------VRD---YSSTCSF--------FFFLQAEDG--------IRDSS--VTGVQ---TCALPIYQERTEQVLSRLavdHR-KFGVRDKHYEPFFDAVFATAEHAAGPAWTREMATAWRSALDWFGSVMA----
>SRR5580658_2929351 
----APLRAIV-EEVLRSGGG------------------------------------------------------------------nvAA-GTGVRRNASLFHGAREPPGFYD--MpGLRELSSSYPWFQV---VP-VIS----
>SRR5258708_13478776 
----APLKAII-QGILRA----------------------------------------------------------------------G-GPLLRRETRPLVGAPRGQKALL--PpHPPGSGSVASRPKG---IS-L------
>SRR6266704_2687724 
-----IARPPDR-RPRCGD---GVLLR-P--------AVHRQSRPA-------------RAVSLRDDANPRGGLPDADRAGQEP--GrraCD-RAGPRPDRQGPpqirrepeALPAVLR-RAVRDGRAFRRPGPDRRDGRGLA----------
>SRR6266536_777504 
----DGYREALDASFARVASSGEKAVAYFYGRLFAATPRLRGLFPA--------AMDYQRDRLLCALLQITQRLSN-rAALSEYLVQLgrdHR-PPGVPPAV--PGGAACEHPNPTLA-pGVAPllsgvraagqrvarVPHPRRPRRLGQHVPGAVH----
>SRR6202030_4225180 
----YRAN--A-EAGTFP----------------------------------------------------------------------D-STQEPPETGPYRVAPSDARLLRKSLaLLEPQSE--------------------
>SRR5256886_2416282 
------DREADADREADADRDGDAEPEPLTAPALSSPPAV-PLAPP--------RDEAARQHdEPEPAPPPDQVPGAAdpretagppeppeeppP--------DgkgEP-AAG-----PDPAIAAGQEALRAFARE--afTSAAEEAWTQVYLAGSSLMIK---
>SRR5581483_8202477 
----------PDDPVFDGMqgNVGRvaarylphrEGEAYVAGPVGMVRETIRALTRA--------GLPRERIHYDDALLAEDKQASAQgvagatahtsrtpessrPGRTGEAGNAgpdGH-IrrvaesdqAGPAGGTAEPGQSGLRDAAADIAPQ--------ADTAHQDGGPHDDQagA---
>tr|A0A2G8KCQ8|A0A2G8KCQ8_STIJA Globin (Fragment) OS=Stichopus japonicus GN=BSL78_17342 PE=4 SV=1
-GLSTVEKDHIRKSWTALMKNKNENATLLIVNLFKMSEGAQDVFPKFKGknpdeLKKSIGVRSHGLRVLAALNSVVENLDDIECLVDMLQHIaHShHPRGTSRKHFEDLGGVVIATFEEALGKKFTDDAKNAWAKAYGVILGVIKSEY-
>ERR1719203_2782565
--------ITSKFGWTSNMQ--------------KIIQSQTHSKTQDMqrDYYLNQK-KTLEI----------------NVRHPLMKELlrrVE-----DNPEDKVAKdMATMMFNTATLRSGFSLKDTVNFAESIELMMRQTLG---
>ERR1719343_1244138
--------LVGV-SWFfSSEKFsGRMQNFWILKALFGTSFPLLfvwvialVIVSIHTGSFIAPLIVX------------------------------------------------------------------------------------
>tr|A0A0P4VK04|A0A0P4VK04_9ANNE Extracellular globin OS=Glossoscolex paulistus GN=HgBp PE=3 SV=1
---SAEDRRELKFIWNYIWASGftdrkAAIAGAVFKDLFQHYPSAHDLFTRVKVdEPDSGEYRSHLIRVANGLDLLIGLLDDTQVLDHQLNHLadqHILRKGVTQQFFKGIGESFARVFPQVS-SCFNV---DAWNRCFHRLANRISKD--
>tr|A0A0S2MLN3|A0A0S2MLN3_SEEJO Extracellular globin OS=Seepiophila jonesi PE=2 SV=1
---NSLERIKVKMQWAKAFGYGasrAKFGDALWTNVFNYAPTVRPIFYSVNSkDMKSPKFQAHVARVLGGLDRVISMLDSEPTLNADLAHLksqHDPR-ELDPTAFVVFRQALIATVAGTFGVCFDV---PAWQQCFNVIAMGITGS--
>tr|A0A2W5I8T1|A0A2W5I8T1_9ACTN Uncharacterized protein OS=Lawsonella clevelandensis OX=1528099 GN=DI579_06450 PE=4 SV=1
-----TYYTVLGPAITLLREHPEDFMRHFLAAALTYDFHFHTFFPS--------VNDHHASRYTHALRYILEALDQstndpdcLDDVIDFLSQLgcdQR-KYQLTAEQYQSLAAALRDTFALLLPYQWSTELNDALLTSFEHAINVMQS---
>tr|A0A177JSP9|A0A177JSP9_9ACTN Oxidoreductase OS=Dietzia cinnamea OX=321318 GN=AYJ66_05610 PE=4 SV=1
-----AQAPPLLALRDLLA--DDRFPDLFARALRATDPDFRELFPR--------DATPVLREFVRAMTWAFETTEYahgdrskVEEVVEFARHLgadHR-KLDLAPRHHQRFGEALTHTLRHLAGRGWDDRLETTLATAYRVLSTALQQ---
>tr|A0A173LPQ6|A0A173LPQ6_9ACTN Phenol hydroxylase P5 protein OS=Dietzia timorensis OX=499555 GN=BJL86_2914 PE=4 SV=1
-----DQLPALLALRELTYRessdVAPDFRRALEDALNTEAPYLRADLPR--------NLDGPFATFVKLYRFLLTRVEDsggdrakVDDVLDLCRELghdLA-KYNVVEEQYERFGHALNAALARVAGEEWTGELSKVQNQFYVIIARALHK---
>tr|A0A2N6TBK5|A0A2N6TBK5_9CORY NAD(P)H-flavin reductase OS=Corynebacterium kroppenstedtii OX=161879 GN=CJ202_05310 PE=4 SV=1
-----VHEASLVPVVTVLQTDGSRFVDAVFTHLFARRPSFIRRLPA--------DLSQLKPSFRRALVHVYAKQATgnglDRRTRRFLRHLaedHR-SFGVEAPDYVAMGDAIIDAGREIIAPQVTSEEFELFAMATGQIIGLMEE---
>tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 OX=1581089 GN=HMPREF3121_11375 PE=4 SV=1
-----------MRAAAAFGRQAPTIGPEAFRRLLDAEPRFRHMFGG--------SKTALRDQFMSALSTALVTRADvgrfPAATIRRLEQLareNR-KFGVAPRDYATLAEHLLDVFGERLPAgpdsgAQVDALREILDEAMSLI-AAAAV---
>tr|M3VCE7|M3VCE7_9ACTN Putative oxidoreductase OS=Gordonia malaquae NBRC 108250 OX=1223542 GN=GM1_049_00130 PE=3 SV=1
-------QPVLTVLRDRIAHDPDRFAVGVFNRLFAETPFLRELFPS--------EMSRMRATFTQVVDHVLDAIANdddHAELIEFLAQLgrdHR-KFGVIGDHYWLMYDALMAEFAAMLGPGWSPDAQEATSHAMMLMTGVMRG---
>tr|A0A2D6MQX9|A0A2D6MQX9_9DELT Uncharacterized protein OS=Deltaproteobacteria bacterium OX=2026735 GN=CL908_08110 PE=3 SV=1
----TEDHELLLQSLDRVMHGEVDLSTRLYERLFSRHPELRELFGP--------NSIPvQEEMITETLISAVDDLEGLpwiEDNMQLLSQKHS-DADVTSEMYDWWAECVIETLAELSAPDWNRRLEELWRKQIARLCELMRAET-
>SRR5207245_2384740 
-NPQPST-HAVTEQVVTLDV------LPWTSGKLGLGPGKarlsEPLAPG--------DTLE---SL----------LERQRARIpgfeewvYDArerriheHCTLL-VNGQAEYRRHTAEVEI------------------------------------
>SRR5689334_4915957 
------------------------------TASQRVTP----SLRG--------KRVPSGQmgdRKVPD-VPIVDAHVHLWDPTafrmpwlDGNKRLNR-PYGLADYREQTAGLPI------------------------------------
>MudIll2142460700_1097286.scaffolds.fasta_scaffold02451_1 # 3 # 1031 # -1 # ID=2451_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.574
----------------------------------------miGSRALA--------ALFPHPKTFMDTKRPVADTHIHLWDPGyltypwlETVpaiagph----G-PAELQVQEPETDRFRL------------------------------------
>SaaInlV_200m_DNA_2_1039689.scaffolds.fasta_scaffold02144_7 # 4497 # 5432 # 1 # ID=2144_7;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.499
-----------------------------------------------------------LQCGVATVRSVIDSHVHFWQPQrlrylwlDEVpair----H-PFTPHELNQATQAIDL------------------------------------
>SRR6266704_3508957 
--------TITRAEFCAGRSNRgskQAFACECYATLIRLHPEVKPLFTHTS-------MEKQAKKFMASLTLVLHVLGKPDVLTTTLQRLgrrHQ-TMGVRVEHYPMVAEALLATLKSGYAVVLLT----LFVQSYMFL---VRKGA-
>SRR6478736_5796684 
-------------------------------FMMGV---IASGMVVTGA-----ERRGRPKAVQPGNREWITVIQAINAEGQA--------------------------IP-PFIIGAGQYHLANWYRDSNLPGNWAIA---
>ERR1711935_979896
-------YSEVMNSWQRVRRvkdFDKTLGVLVFSKFFSKHPDATKIFGIEEEgeelVDTSASFVPQATKFVGLCDNFIDMLGPdsdlLKDILAEEGRKH-ARRGVELYHYPAIGEALISGIRAM--DvKFNDDTELCWRKVYCGVTHDLGKAV-
>ERR1712137_931585
--------------------------------------MGTSLLGVDCEgeefVKT-DSFVPQAKKFIGLCDSFIDMLGPdaelMAKILEAEGRKH-EKLGIKLEHYSTMGEALISGVKTL--DeKFNDETELCWKLVYCGVTNNLGKAN-
>tr|A0A210PV81|A0A210PV81_MIZYE Globin OS=Mizuhopecten yessoensis GN=KP79_PYT16126 PE=3 SV=1
-GLTERELKMIKVSWDVLAEDKKSNGVKFFMTLFTIFPTSKDLFKHFkDVPldqlkydgettKSNKKMVAHAMSVMYALESYVDSLDDAYcleELVKKVAISHK-PRGIGPDKFKLLTPVLHAVIEDLVKDDDSvdlETIKSGWTKLIDTVCDIVEK---
>tr|A0A226E0J1|A0A226E0J1_FOLCA Hemocyanin OS=Folsomia candida GN=Fcan01_14017 PE=3 SV=1
VQLTPDEMIAIKRNWEVIHQDLTGNGMDMYLHWFAAFPHMQKVFKKFaQVPrdqlKTNDAFKAQATVTLHWIDDMIEAIDSPSDMAavmKRLGRMHQ-TRHTNIYDFREMVKRIQEVIGTKVGEGYTPAAESGWTKLFAKLVENIGD---
>SRR5947199_2475351 
----------------------DELARAVR---lQ--gSRRIMEEHAcG--------AEGRQLARLFDERGRLARAP---RAVDEPGLELgarvsdgrcglakigdvverivqaedvdavRR-AGGDELADEVIVS-------------rtRADDEtseqrepayrigprtqCSDAFRRGLERPAGAPVQT--
>SRR6266516_4891354 
-----------------------------------------------------------------------------GLGDGGRAEGgnrDS-GRGEQLEHLGCVHDVLLSFSESTVSTlphqaarpapaaegagpAITRRetadrapprrhrvggfLRSAGAARARSSIDRMTET--
>SRR6266508_4596506 
------------SAFVRL-t-DARRVARCLPSAH---pGDETPSTFPs---------ETGDPVNLN--------------LEALETSFDLvapRG-DG-SEATEDDVVGHPGPPA--QVA-PRPRGDRPQAA----------------
>tr|A0A1Q9CVT6|A0A1Q9CVT6_SYMMI Eukaryotic peptide chain release factor GTP-binding subunit OS=Symbiodinium microadriaticum GN=SUP35 PE=3 SV=1
-VPSSGTISTVQQSWMVVKELgVANIGEIMYKHLFKIAPVTKSLFPVSvRKRyrdwscseeevedgfENSPALRNLFAKVVEAVGSAVAGLHNISRLVAELNALgmrHI-NYNMKEEFFEYGGQALVLTLQDGLGTSLTEDVKQAWVAVYEFISACIISGLR
>ERR1719433_537024
-ALRISIVGREKRA-NCTVTLgRVEQGELQVGATVLLVPPGAECGVQSvEVDgrevrsaqagefvcmRLLgcQP---SVGHALSSVD---GPLRSATKLKVRSAQAgefV------------------------------------------------------
>ERR1719161_1849694
-ALRVMVLGMTADKVG-AALEgHVEQGTLRAGTRCLAAlsEGQAECNVQIvLLNgvevshagpgehvrlKVTgaAAKGFTAGQVLSCIS---NPVRAIGKFKAKLRLMslpEM-LS----------CSLLVL----------------------------------
>ERR1719271_149007
--VSARERRLIERTWEKAKEDgCDALGANLLQTLLVAEPQVMQLFPFKDEenVYESLRFKAHASKLAVIIDAAVSLLANPVKLEsllISVATSYEYsFKQMLPEHFPLLGEALIRTLTSIVGgTKFTWQAESAWRKVWTIISTVMIGAI-
>DEB0MinimDraft_4_1074332.scaffolds.fasta_scaffold429043_1 # 3 # 377 # 1 # ID=429043_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.227
-------LELIQQTWEKVKPHGKEWGPKFYNNMWTKYPEVRAQFFP--E----SKPEIQGPRLYASLNFMIKNATDIETLKqycFNMGDRHK-KYHCAAEHFKVVGDAFIMTLTEFLGDEFTPEIKQQFQLLYDTVAEMTI----
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold5203666_1 # 3 # 269 # -1 # ID=5203666_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.315
------------------------------------------------------DFESQGRALTRMLAWIIQNMSNVSQLVPVLAQMggrHE-IYGVKDADFGTFATTVANSFRSVLGPEIiDDDAHQAWESCISGIGGLMQL---
>SRR5438477_4839339 
-------------------------------------HGIEP-IPH--------RYAAIRRVVSGRE--------------AQARRVgqrHH-AAREDQRR-------LRGL----ERRRG-RPPARHVRL---------AA---
>UPI0003969FE8 status=active
-----RPFEAA-----------------DRELLFGRAQDIRAVVEQ--------LRTDPLVLVTGDSGVGKSSLCRAGVLPQIREGAlndVR-RWSVAV---LSPGRWLLDTLGDA----LA-----------------------
>OM-RGC.v1.018126893 TARA_122_DCM_0.45-0.8_C18859060_1_gene481717	COG0677	K02474
-----SELW-------RGRPRKTSLPAgssiRTRTAvlvplgrgketapssssanfvlnLTDVPPEAQELRiTA--------EVDDQRIHFQRRVPADVD-----KVVMELPEGSlarKV-R--VEVAAFD---------------------------RR-CS-IAAFRA---
>ERR1719491_698649
----------------KLRAsedvsiSLIIFFSGSSSRFFKQQPDASSVFG-FDNNneniHKTPKFIDFANHFVEVIDQAVQMLGPdlelLTDFFVDLGDKHSKEYGIKPKFYPILGRVLMEQLEEMLGHNvFTVHTKVCWLQVYEAFARDMTST--
>tr|A0A147B4Z8|A0A147B4Z8_FUNHE Neuroglobin (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
-ELSVKDKELIRGSWESLGKNKVPHGVIMFSRLFELDPALLSLFHYStkcDSKqdcLSSPEFLDHVTKVMLVIDAAVSHLDDLHSleeFLLNLGRKHQ-AVGVSTQSFTEVGESLLYMLQCSLGQAYTAPLRQAWLNMYSIVVAVMSRGW-
>ERR1740115_393061
NLLTPETVRVVKETSPRIASMAPALSSSFFKRFLS-HPDLAAYKASRH------NGEAKAAAVAAAVTGIGDSIDNLRSLsgaITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAWDEAIMVLADICVD---
>ERR1719469_1495088
NLLTPETVRVVKETSPRIASMAPALSSSFFKRFLS-HPDLAAYKASRH------NGEAKVPLTHTPP-------------FLSLPHPHS-SLPLPSSPFL-------SL---------------------------------
>sp|Q5KSB7|GLBB1_OLIMA Extracellular giant hemoglobin major globin subunit B1 OS=Oligobrachia mashikoi OX=55676 GN=ghbB1 PE=1 SV=1
---SRGDAEVVISEWDQVFNAAmagsseSAVGVAIFDAFFASSGVSPSMFP--GGgDSNNPEFLAQVSRVVSGADIAINSLTNRATCDSLLSHLnaqHRAISGVTGAAVTHLSQAISSVVAQVL-PSAHI---DAWEYCMAYIAAGIGAG--
>ERR1719246_379870
---TEKIKDDVQKSWDRILEVGiLYAGEVLYKKLFEIAPVAEEHLPPHIIAkyqqssfdageedqefVRNATLAKMFSKIFNAVGCAITGLHDLGKLVPMLLSLgarMG-GYWDSckydvaGNPWRYVFARCRASLDDGVRLhIVHHDTGFARGQGSCRVSX-------
>SRR5687767_4837246 
----EKQVLLVKHSWSYQAGQLENLGTLFTKKLVALNPGLKAPMKR--------SLAETGSySLMVAMNQIVAALPDLHKAQNHIQVIvteYA-ALGITRSDYENALIAFLLALEKRLGKSWSDEIREAWIFIFSSLYH-------
>SRR5215212_6395769 
---------------------------------ASLSPELKPLLKK--------LDQEKRLpHLFITVNDIVASIPDFKRSEKQALALiadYA-DKSISLSVYESALIAFLMALEKKLGKHWSSEMREAWILVFASLRQ-------
>ERR1711963_100213
-SLSEGTVEVLKACHPLLKDVRRVIGKAFYNRLFKEYPQVKPLFSQSD-----AARTHQTLALADALIAFTGRQLLEG-F-EAKQRGqeRS-LRLRSLQAGSWQGLWRLPSRDRGERD---QNEGSQIKPQILTIQ---QDI--
>ERR550517_4578
-KFDPDELIALRLSWHAwVAGDLSGKGFDLFAKMFEQRKETKEVFAFAKgtDarqMQNSSKVLFHVSRVMKYIDDTVKHADRLQDVVATLRQIggrHGhNGYDVASAYFPYLGNALRTLIKANYKG-WDSKLEDIWTRLWGFITAQMMH---
>tr|A0A0L0FER9|A0A0L0FER9_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12208 PE=3 SV=1 
MSLTPRQCEMIKSSWKEASQGgkptefrALRFVMDFYSHLFDLAPSTKSMFKG--------GMANQGKALVGMLDIVVNHIDSLATikgDVELLGQRHA-KYGVTSNMYVTAGRALVMALAPRIPDDeDKPECASAWMDAYSFLASIMCN---
>tr|A0A1X7UGV4|A0A1X7UGV4_AMPQE Uncharacterized protein OS=Amphimedon queenslandica PE=3 SV=1
MSLTSAQVALIESTWKVVKKDLQGAGNIMFLKLFQIDVSVRDKFPFRDVPyeelEDSESFLKHSLQVMETIDLAITLLlGGEMEkLVEalvDLGMAHA-MQGLKPEDFDHVGEALVHALGVALGKEFNDEAKKAWTLLYSVVTAKMKEGL-
>ERR1712080_154454
----DLQKIIVKHQWARSYNEgmsREYFGQAIWRAFFKLDPGARRFFTRVrGDDISHPKFQAHSLRILGGIDMCLSLIDDVPTFEAQMKHLqgqHI-EREVPSYYFDRLGTVLQEVMRAATGYCYDE---VAWGACYKYISDRIKANY-
>SRR5476649_891947 
--------------------------------------------------------ATSTRCCS--ATSRKCCRCSIKPTRPTASSsarwptpcWltqEI-SIawNnWARWHRPSStSMCRCKSsgNTIPWSAPrCSRRYVKCWAPRWRPmpsstpgpprtvsWRTCWPV---
>SRR6188768_2515855 
-XMDSGQTALLKASFQRLSTVSELGAELFAGRLYLLDPPLWHHLGLG--------GRSAQHALLRMLARVIEDLDRFEELASTLEAVarrCA-SEGMDAAQFDTIAETLFWTLQQVLGDTYQAPIAAAWREAGGLLIGRMKA---
>APLak6261669570_1056073.scaffolds.fasta_scaffold275140_1 # 52 # 198 # 1 # ID=275140_1;partial=01;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.524
--WSTRRVKVVQRSWETFKStqaESTTVGLAVFKRFLRRSPAFLQLFPFRDQPLetlfLNAKVRLHCKLFADTVSRTVGLLGDSVAVKASLRELgarHSDLYKVRSGHYAAMGSALLEVLEHNLGESWDEETKTAWEETWAYITEQMQK---
>tr|A0A0Q5LAI2|A0A0Q5LAI2_9MICO Uncharacterized protein OS=Frigoribacterium sp. Leaf164 OX=1736282 GN=ASF82_14980 PE=4 SV=1
-VITSSHLTALRSTLPLVEARAAAIADDFYARLFADRPDLLrDQFNRGDQ-----AQGRQQRELALTIVTVARDVVgtqvgsgpagsatgpavpvapwsspapspwavrvAARETLSRLAQRHA-AIGVTRDEHDVFERHLRDAFAAALGDDWSGVVVDAWLALWRQTRDELVA---
>ERR1719383_514948
----------------------------------------------RGRLvegrwRFDSARVKSCVddrqGCVETWQHGRRR--------SNAPQVgnhAR-GLRCAQAHYDVVGQALVTTLASY--CTFTDPVKNAWIKLCGVIKATMVH---
>ERR1719284_1849230
-PLDGRDVALIQHSWKEVGQaPADEVAREIFRNIFAIEPGALELFPFKNESedglwREGGERDFSKYFRHRAWCSGAVSFQKX-----------------------------------------------------------------
>SRR5204863_5655766 
-IMTPEAIGLIKSSYAGVTAIPRQLAARFYHELFTVAPNLRPLFPG-D-------LTNLQGHFEAALALVVRNLDEVEVLRPALRDLgaqHV-HWGARPEDYETARDALVAAIGALS-ANWDETLARDWRRAVTAIIVPMIEG--
>tr|F2Q9X2|F2Q9X2_BRAFL Globin OS=Branchiostoma floridae GN=lGb7 PE=2 SV=1
MSLSAADKKLVQESWDKVSKpSFADAGERVFLKLFRRNESTKAHFKKFkDIPsdqlAGQAVVRDHGEKVCKVLDDFIKGLDGSgDEAVKKVGRMHK-GLGMSNEQIDQMKGAIIEVLADAgFGD---ANYKGAWGKLWDRFMAVHR----
>SRR5580698_8666230 
---PDLEKMAARSPWLTVTA-----------------------------------------------------------------SLsaePV-SLGHGPRTEHgtvADVLARLGTWREHD--------------AYVCGSSAMVAA--
>SRR5919204_299658 
---------------------------------------------------------------------SDL-RSGPTSRCTHVRC--R-QQRSPPRHHRClRPRSPAPSWSARlsagfrssscrpstnRPARRRGRGRSTILASYTRLASVMLDG--
>SRR5262245_42249746 
-AMTPEQIDLVQRNLPAVLSLQNRGP-RFHDHFVAVEPTRQFLFAGAD-------MGRQGAVLIDAIAVAIAASRsrEQ-DLSGALCQFHL-SYGVDAQRFQSAGKALVRMLEEEFGDRYFTQLGDAWIAACERVGQTIL----
>SRR3954454_16888348 
VISRSAVIRHVLPTP----aepAAVDHIGQQVADRTSQQDRGERVLLNRT--------aHGLR--ALADGAARLRIAAQS-vadVTRTPLVGVLrqlRS-ALGDVSHRLCGLSDHAEAllgAIKDVLGDAATDEILAAWGEAYWLLADVliar------
>SRR3954471_17335278 
VISRSAVIRHVLPTP----aepAAVDQIGQQVADRASDKDGGERVLLNRT--------aHGLR--ALADGAARLRIAIQS-iadVMRTPRVGVLgqlGG-ALGDVPHCLSGLSDDALGccaTCGCYLCR--------SRGGASWSFFCHaalr------
>SRR5215204_1408335 
ATGGPTRWATMRGRWPLMS---------MLESIAQSG-SGRPVWYVHGA----RDrrahaMGDHARALAADEHAGK------------HRAVrqrT-------------------------------AG---------------------
>ERR1719446_1443192
------------------------------------------------------------------------LAQDLSALCPE---Cgfk------VG--TMGVC---QTK------ANDAAIE-----------AKDPPVAT--
>sp|P02214|GLB_BUSCA Globin OS=Busycotypus canaliculatus PE=1 SV=1
-GLDGAQKTALKESWKVLGADGPtmmKNGSLLFGLLFKTYPDTKKHFKHFDDaTfaamDTTGVGKAHGVAVFSGLGSMICSIDDDDcvbGLAKKLSRNHL-ARGVSAADFKLLEAVFKZFLDEATQRKATDAQKDADGALLTMLIK-------
>SRR5690242_2028058 
-------LALLLQSYGRIGILIPKISENFYRRLFQLRPNLAALFANR----------DADLKVEEMLRRIVAHASDAAAAKAEVQssgRSHA-QWPLLPEDYRVAGECLIQAIIEAEGAATGSVVASIWRQAYVEVANLMIC---
>tr|A0A2T7P4S4|A0A2T7P4S4_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_10993 PE=3 SV=1
-VLTVQQKDMVQRSWATVMRrDLTAVGMLLFKNLFQQEPRIMTLFSLEasDDedLEQNLRLRLHAARFMQAVGAVIDNLQTPndklSALLSDIGERHSHLHSFHHEYFRAFREAFLTTLEHSLGKDrFKGELRAAWDSVIGFMTREMNHGHK
>SRR5581483_4049588 
---------------MRIAPHKEEFAATFYQALLEKYPHLSQFFVGVD-------LKRQQTSLIATLRAMLNESERGEalrMMFRKIGQKHA-DQQIRAEHYPAFGQTLLDTLALYD-PQWTDDLRKGWATALEQSVRIMMESYH
>SRR5690625_2040278 
--------------------DRDGFGARFTEELLSRYTEIREALPD--------EPAWVARAVTAVTDALIDVADDPGALVTVLERLgvdNR-TVGVHSAHYAPIGHALILAARAVGGTAWTPDIERAWVDGFDVAAEVMVT---
>tr|A0A0Q9HRJ4|A0A0Q9HRJ4_9BRAD Uncharacterized protein OS=Bosea sp. Root381 GN=ASE63_23130 PE=4 SV=1
--MGDRAISLALASLETMGSEAEQADIMFNIRLLETYPDVYRVFCM-D-------FAPEERSFLRALAFILAHAGPFGAIGPTVRALapsDK-VCRLISSRYHELEETLMWTLRRRLGVAFTAEVENAWRSVLREAPGVS-----
>SRR4051812_34838903 
------------------KPIRNRAIKLFFSRLIESHPSLLTVIGD-D-------YEAKARSLRPAVEMIIGCLGNMEALRPILRSMarsNA-ELGMQEHHYLTAVNTILWTMERCLGSAYSAEVDAAWEDVCWQVCEAM-----
>ERR1712110_1394717
-ILSKEETTLISASWDLVATDIPGNGSKFFTFLFDIHPDVRdKYFQPLLQSSTdvQRTLEKHGAKVVNAIGSLVTALNTedDGKLVTIIRQIthnHW-NRAItNSAPYQLVLDALLEFLAVALGSQLSPAGGAAWKKLFDAFVVVV-----
>ERR1711953_1620069
-------------TWAIVKLNMDKHGYKFFIRLFLDHPRIQtKHFSSISTSA--QSLTAHGLRFMMGIDSIIRFLELedEEGLRKRIQQIvtvHF-FKGItDPLDFEVLCNCLVDYLSTEVfGDHQL-----------------------
>ERR1719210_139600
---------------------------------FTLL-----DPPGQkRNvaqawSavvqadvaiLVVSANPGEFEAGLAK-------------------------GGQTREHAVLAKSAGVENLVVAVNKMDSVDGEGKWSNLryee------I------
>ERR1719428_2447797
------------------------------------L-----DAPGLgAYvpavwVaatqadiavLVISAKAGEFEAGISK-------------------------GGVTQEHALLAFSAGVTSIVVAVNKMD--DASVTWGEPrfkt------I------
>ERR1740121_1193106
--LSESERDALQQSWVQVQKVgFDCVGEVFSQKLFELAPSTHARAG------------MEWGPVVKGIGHTVDYLSRLEAVAvryRRLGVLHR-CIGVTERELKEMGDAFILTLRDVLGK--------------------------
>tr|A0A158NI97|A0A158NI97_ATTCE Uncharacterized protein OS=Atta cephalotes GN=105620364 PE=4 SV=1
------------------------------------------------------------MNIT--NGTIHDILSGGK-NtqkV--FL--FR-HRGRTKEVVEKEEKIRVAGLDtngshradCPKGTDEGREIGDPVTDSLLQMLQKKE----
>SRR5260221_159328 
------ALGLVREGFAAVIARPDVFVSELYQDFFTSNPRYRKYFGSADIGySgsadingtGSpeighaaADITRRNAKTVEAATRIVADLDRPGVLLPYLRKLaleYR-KYGVREAHYRAFAGSVMTALERTIGQAWTYEAAEAWVDELTMVASAMLG---
>ERR1711862_565156
----------------------------------------KIMFHFPvnmNIetVLKSKIFLQHAKFFVKTLDITIGLLGpdtdIIQDVLLEHSKTYQ-NHGVNSAMYLHMGESILYALEKDLGDvNFTSKDREAWAYFYGTIVGVIVGG--
>OM-RGC.v1.029911412 TARA_036_DCM_0.22-1.6_scaffold294997_1_gene285712	COG0526	K03671
---------------DRLRARGEPPSGNPYRGAAPYGPGDEALFFG--------RRAE--------LEVLIDRVQkTPFVLVAGDAGVgktS------------LCSAGLLPLVREgalGGPRHWACESIACGEEPLAALAAVLAR---
>tr|A0A2S2QIF8|A0A2S2QIF8_9HEMI Globin OS=Sipha flava OX=143950 GN=GLB PE=3 SV=1 
MALSPVQISRIRRSWSALAQDPTELASALVIRMFKENPEYISLFKRLkGLsideLQSNSQFKAHASKVGGALGATIDHLDKPEKLeelLTDIGIKHR-KYGLSPKHFEVIRNVLIAIIAEAIGD-TDPELLDLWKSSLTGVMSII-----
>SRR3546814_18929724 
-----------------------AITNAVYARLFQNKe--IEASFDRAAQ-----TSGEQTKRPSAENLAYAKNIDKLHNLGSAVSHMvarHM-QTVVRPHQYPHGPTALQHSNSAVPGQQMgTNTDPTP-----------------
>tr|T0QF73|T0QF73_9STRA Uncharacterized protein OS=Saprolegnia diclina VS20 GN=SDRG_06019 PE=3 SV=1
--ISKDVQALVLANWAAISSGSTPallkikpaspvvyFYDYFYGMIFEKAPAVKPLFRS--------SIIVQGKALINIIQSITSAVNAPNviEKVCDLAYRHN-KYGVKIEYFNLLGKCLLLAMHDCTGDTFTDELREAWRAAYAYMVMVMT----
>ERR1719402_1510571
-AISSITKSRSMYLWSIllnrkqhLEAFSVDNGWAAFVVFLLGDPHLLEG----------------GEGS---QDGSSNPYGVFPLRwsnDLHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQASFNNLLQFLVGNMKVGL-
>ERR1719187_1205752
-VLEDAEVEGVQTLWAEVSGDLAQFGARVFGRLVRDQPTIRKYFPWGrnDKTeeqlVDAPDTQKHAEEVFGALGKIIGAADHLNDYrsfLVYKGMQHI-PRGVKPEHFVYLKAALVDTLKEELGDKVTPAGEEGLNKVYSFVEKAMSKGL-
>ERR1719369_91055
-SIKRQFHLHSEAGWAKFAEDVAGNGAATFITLVHDHPEIRSVFPWGgkSY-lsVDDPDIRHHAELVFNGLGVAFNRIGHIHSLdgyYESLGLRHI-ARKVEMSFFDYVGDALSQTFQQILGGGYTADFKSGYSKVYAYVTQHMTAGL-
>ERR550519_2895140
--LSKAERKEAENAWRIFEVNLVDNGVDAFLNLVRDHPNRKDAFPWVkpELSeealRNDPEMKKLAKLVFSAVKPAFKSLGDLQSLtnyYLNIGNELS-LMNIPPVMVSYLSDAFKKTCQKLLGSDYTHSLEASIEYVYDFITSRMFEGM-
>ERR1719150_2276450
MGLTKAQVAAIQNNWATVSQNMQDVGDALFMRYLTANPGDLSFFPKFqGAGvgpqlHSNEDFQHQTLTVMQFLGQIVAHLGDIPAAEGMLRERvktHH-PRGISMAQFERLLDLVPRLVQEICGA--SGPTADAWRVAVATLMPSMRDEF-
>tr|D3DIC1|D3DIC1_HYDTT Bacterial hemoglobin OS=Hydrogenobacter thermophilus (strain DSM 6534 / IAM 12695 / TK-6) GN=hmp PE=3 SV=1
--MSPEARLNIIKSIPFLQSYGERLTSRMYEILFEGNPELKSMFESD-----------DSTKLAGALLAFAQNLERLNVLEPAlnkMALSHV-EAGVKPEHYEKVWDALYKAMTEFG---ISNEIIEAWKEAYYFLAELLIKK--
>SRR3990170_2029843 
-----------------------------SPCTTTRSPCWTRPCAS--------WAT-----------APTGSWAtstpPSSSRLPSCAR-csRR-RWTCSATG----CSRRSPAPRHYAEDVWVPELEDAWLRAYAAMSTTMIEG--
>ERR1719356_276690
-SLTEAELELIETVWAKAKAlVAEEFGMRLYRQVFDIAPEALQLFSFRDDSdpYESAEFKRQGQIVIAAFGKAVAVLRDPEALAPALDSLGDalaiSTDKVMLPHDRSVGKALLRTLRLELKDEFTLEAEKAWAKFWRILARTVQ----
>SRR4051812_22538299 
--INADTAVLIESGWNAAIDANGDFAANFYQNLFAAAPVVIELFSG-D-------MTEQKGRLTHTLAETVALLHNPEHLLLLLRASgvrHH-HYQVKQAYFGVMRNILIDTIAVRAGELFTAVHRQAWEGFFDNMATIMQGG--
>ERR1740128_83505
-GLSQREKQDIRHVWSLVSQDLESAGMGFFLAYFKAHPEYQSKFKAFaKvpmdELKDNRSFQMHAMNVMNAITLIVDTLENPEELVSGLKEMgvnHR-KRRIEAIHFHTWRRCCWPSCRVPWVRLSLNRPrrvgAKRWVSSSAPSWRR------
>tr|A0A1E4GLJ3|A0A1E4GLJ3_9CAUL Uncharacterized protein OS=Phenylobacterium sp. SCN 70-31 OX=1660129 GN=ABS78_22870 PE=4 SV=1
MATAFARAADIEASLELLAERDIDPTARVYQRMFELHPQMEPYFWR-DTD---GK--IRGE----MLSLAFAAILDFVGErryADhmIGTEMinHE-GYDVPRDVFATFFAIVRDALRDLLGADWTPVFESAWEEMLAEIESYARQ---
>tr|A0A2A5EUW5|A0A2A5EUW5_9RHIZ Globin OS=Rhodobiaceae bacterium OX=2026785 GN=COA62_02605 PE=4 SV=1
TQACTAASDPIVASLELVVDKCGDPTELVYKRLFAQHPDMKPLFLL-DKD---NS--VKGN----MLSQVLECFMDFTGKqhyAAnlIACERvnHE-MIGVPPEVFTTFFTTVVDTFKDILQDDWTPVYDAAWSDLVNDLTVSVDE---
>ERR1711860_359782
---LFSKSNYVFAS-----------LSRNTFKLFKDERSLYeKHFSSFDVN-DILRIRAHGLKVMKAVNSMVEAVSDENdeSLIDQIHFvahGHH-LRGITpRNEFEVRRKILNLDYHLLFHyllkkGCLSQSX--------------------
>SRR5256885_15743076 
------------------------------------------------------------KARMQPIATSDDALDRPAATVPALHARgtrTG-ANGVVDQHAETVGEALLWTHSKGSGRSPGaqgasPTIQHRDVHAMGVLTPTFRER--
>ERR1719329_2070839
-AMSDETVATVDATAATVAPHALDITKDFYAEMIESFPSVvLALFNPPHWR---RR-cARTPPTsRTCHHCSCLAAPSTPSITDTarsR-SFRHT-TRWCTTTSCGRWQRC---------SDqSWAARCPTPWST--------------
>SRR5256885_864722 
-VLTDRQRAIVQSTVPLLETGGEALITHFYQTMLGEYPEVRALFSMAHQQ------sGAQPRALAYSVLMYAKHIDRLEALGDlpaQIDRKST-RLNSSHLVISYAVFCLKKKKRTGSDS--------FTRSE-----RLVV----
>SRR5256885_6575144 
------------------------------------------------------------------XMVMSMRGPALEAAGTtgcRSCSAAV-CCSFF--------FQAEDGIRDYkvtgvqTCAlP---------------ISDILIGA--
>tr|T2IER8|T2IER8_CROWT Uncharacterized protein OS=Crocosphaera watsonii WH 8502 GN=CWATWH8502_4740 PE=4 SV=1
----------------------------MYEIAFNERPEYRRFFKNTHMK-SPEEGRKQAAKLAASVYAYASHIDELWTLNKKTIvsvNFTL-NI------SPELK---------------------------------------
>SRR5690625_6805322 
-------------RSPSHSQtltLSPYTTLFRSRNLLRNHPELKNYFNTANQ-----VNGFQPRALASIILQFAKNINHIyeiVPKLERVCQKHC-SLGVQPRSEEHTSELQ------SRGHTVCRLL--------------------
>tr|A0A244CWV0|A0A244CWV0_9GAMM Diguanylate cyclase OS=Pseudoalteromonas ulvae OX=107327 GN=B1199_05805 PE=4 SV=1
----------------------------------------------MET-------VNSKAKVLNKLLIA-------TSVVLISFIvslQLA-GVEMGQSSIIAILVFGIASIG---AMAF-------LYKAVEQIADKLNVIEE
>tr|A0A0L0EW98|A0A0L0EW98_9GAMM Chemotaxis protein OS=Pseudoalteromonas rubra OX=43658 GN=AC626_03140 PE=4 SV=1
----------------------------------------------MNS-------QSIQSSLNNKIIIA-------GVILVISIVvgiQLG-ASGAENMQLVAVALPLFGVVV---ALGY-------LKMALSAVSAQLGCVYR
>SRR5688500_16794215 
-----YDARVLRGSFAQLRPRIAQYSPVFYEHFWRDYPETRPLFGR-NMSKP-----ELDTRINHFMLWVTENADRPHFTIDYiqsVARRHV-GYRIRRRHFAYVDNTNIKTLRELLGDSFTPEVERHWRASFRFLTLLM-----
>ERR1719193_2756600
-----------------------------FM--EKKVPSVIV------FlnslsLDDDGALETHALSVMNSVNKVVSRLDQPDRLVQLLHDLgrkHI-SYKANMAFLEPIAKHFILTIKPSVA-EWSPEIEDAWQQAFKVIGHIMQE---
>ERR1712080_794265
---------ASHVIPGESHGKHQSQRWIVFEKLITDGPEFKAIFGF-PGKRDDPAAQALGSKVLTKVAEAVGCIDDQAKFSSILHaegVRHK-GRKTEAAHFSKLGPAIIYMLGEVG---VAADAQAAWGVAFGLISGEMIKGL-
>tr|A0A1E3PUG6|A0A1E3PUG6_LIPST Uncharacterized protein OS=Lipomyces starkeyi NRRL Y-11557 OX=675824 GN=LIPSTDRAFT_199892 PE=3 SV=1 
-HLTPEDAIAVKESWKETIGLSpantvatssgspaSLFCNQFYQKLFAVRPDLEFMFPDI---------GRQSAAISGLFQVAlamLESIDALDDILLRMGRRHAFVMGIEPEHFELLGEVFIQTMRDRLGERFTPQIETTWVKIYSYLASKMIA---
>SRR3989338_7687732 
--------PLVQATWKQAMDLgdgDKGFGRNFYKNLFTKHPGLLeTLFKGV-------SIANQEKNLPKSITAVLGLLTDMPKAVDALQQLgmrHI-LYGTPDAGYPIVGANVIYTLEMILGSDFTPEAKARWGEIYGVIQTTMIDA--
>tr|A0A1C7N598|A0A1C7N598_9FUNG Uncharacterized protein OS=Choanephora cucurbitarum OX=101091 GN=A0J61_09444 PE=3 SV=1
-PPTQSQIDIVRFTWGHITDTrlpsdkpeispSHAFGLTFYDTIFHIDPDFKKLFPNIiQQakalggmiSylvkspeiisSPSSDdstlhtqvstirqINASKRKRstASTFSELVletaaDdTLghlpdSDVDHFACKLQQLgsrHY-RYGTQIDHFSLFGHAILKSIQARLGKDCLPEVLKAWTRVYSFTMFHMQA---
>SRR6185437_15632065 
----ADDVAIVRDSYGRIGPRGAALTIAFFGLLSDRVPRVRKFFPP--------DDKDKRAVAKDLFDLVVGHLESQLNVRWVLERMgrrGL-LDTITPSDVSAVGGCLLDALAELDE-AWSPATERAWSRVYDWAASAVV----
>SRR5678815_1770797 
-------GARVLASYRRIGSRASAAALAFFVAVQRGSPRVRRVFKH--------DDVDQRTLAKEVFDVVVGHLESPRELRSLLERMgrrGL-VDTVSAGDIDAIGATLVGTLRDFDE-GWSSDVEQAWNAVWTVSYTHLT----
>SRR3984885_15745818 
---------------------ASRAtgGGWLPTRSPTGRSARTSR------------TGCRRGRCDGNTRPTV--GG-PAALGGGQCEDsarDG-KLGLSADHADSAGAGRVdlAAVRHPGGAGV------------------------
>SRR5262245_20097952 
-EVTPQQIELLEQTLSELRRQSVFAAQLFYCRLFSLRPRLRRLLSGR--------PDFHGTRLLSVMSAAVAGLSDPGHFAGLLSLAarpavRE-AL-LQGDCVRVIGDAVHWMLERHFGGQITVEVREAWRAAHIRITQVIE----
>ERR1700722_6370008 
---------------RGIRPHCPavrqhLPCVLPPH--VRAGSVASHAIPQ-L-------SAPLTATLTAALEALVGALGDLQPVlvrAPALGLRLA-SYGLQPTDISIAASAFLATLDDELDEVSTNAARAAWGCVFWTVAL-------
>SRR5581483_4578849 
-----LQIALLEESFELIAGQSVELADRTLSRLIELDPQFRLLAARTE-------MAALRSVLFSVLYVLRRSLHNLNTLAPALETLgalRK-DQELSSEHFGTIGIALLDAMAEVGG---------------------------
>tr|Q17156|Q17156_9BIVA Beta chain of the tetrameric hemoglobin (Intracellular) OS=Barbatia lima PE=2 SV=1
---SEKIKEDLRLTWGILSNELEDTGVTLMLTLFKMEPGSKARFGRFgNIDSgmGRDKLRGHSITLMYALQNFMDSLDNTEKLrcvVDKFAVNHR-IRKISASEFGWIMKPIREVLMERMGQFYDPSFVDAWGKLIGVVQASLARE--
>SRR6266536_694904 
----------------------------------------GTRFAD--------SHRPPRTMERTGPLRDRLALRALRlgvgdvvwEDVPSLKRSmcg-----------AAAAGAAPVVAAVASAAPGDPQKHLKRADQVYAKSILLRMS---
>SRR5262249_10507301 
----------------------------------------------------------------------------NVKYSShhqQHGPQAR-GVRSTNLAFCCVWRRTEMG----------P-ATAVWSGVHCRDAAGMDGA--
>tr|W6FIG9|W6FIG9_9ECHI Hemoglobin OS=Ophiactis simplex OX=533354 GN=Hb_b PE=2 SV=1
MVVSAEQKALIQGAWTPIYAgNRFQLGVDIFAHFFKAHPNYANLFPSLvGVpnPSTSVELRGHAIRVLTGINYFVAALDEKKPvimeMIHNMARSHK-PRKLTREHFAQFAPVLFDTIG------VSGPARDAFLPYYNFIADNLFAE--
>ERR1041384_2362020 
----------------PLAPKANVLGERKvVAVLYSDLRGFGTL-----------SETGHAVDVLERLNDYFD------RMVAAITSHgg-------------------------------------------------------
>SRR5574337_1776253 
--VGLDDRDALRVLHAAFVApvdgngAANGLTAAIFDRWFGTDPSVRDLFPP--------DLDAQRAAFGQAMSWVYGELiaqraQEPVSFLAQLGRDHR-KYGVTQQHYETLSQVLHATLRHRLADAWTGAVDAAARDSLKLIFGVMSG---
>SRR5271167_3167484 
--VGLEDRDALRVLRDAFNQedpgASNELVRQLYAHWFALDTSVRDLFPP--------EMDSQRAAFAHALHWVYGELvaqraQEPVTFLAQLGKDHR-KYGVLPSHYDTLRRALHATLRTQLSDAWTDAVEDTACQSLNLITGVMSG---
>SRR5258707_573086 
---------------------------XMILKSFKPNAAIGC-KTI-P-------TW-----FVP-LPTFTAGLTLPKLYPLSVFGMRRyNLGGLGEPH--QVEAALLWLVEKQFEGVLTREMRQAWVQFCQWLVL-------
>tr|K0T9D6|K0T9D6_THAOC Uncharacterized protein OS=Thalassiosira oceanica GN=THAOC_11871 PE=4 SV=1
---------------MEREDSSgSL--PSFVSETEIEPSDVQPaaasgenNVDKGR------RKTSSSSKRTPSITKRIESFSSFKSLSSSFS---------------SKLDDERNAGEAGQAERVEsttapESVASGETQGNAGGQHTLN----
>SoimicMinimDraft_5_1059733.scaffolds.fasta_scaffold33866_1 # 3 # 488 # 1 # ID=33866_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.741
---------------------------------FELAPASAGLFPAQvrhkyrewttEEvhasdndVRNSPSMRRLFAKMLTVIGCAVASSQNLAALVPEVKSLgarHA-AYGVSEAHWERAADAVRAEPSRSYGGLEGERRRGPHMtrvtarTLTvIFGTMLLVAT--
>tr|A0A165S3D1|A0A165S3D1_9GAMM Chemotaxis protein OS=Halioglobus sp. HI00S01 GN=A3709_07715 PE=4 SV=1
----MTAIMMIDRDFTVTYANEAT-----LQLLRDNQATLSSIYPGFN-------PDKLI--------------------------------GSCIDGFHKNPEHQRNILADPANLPWRTDIEVADLKFS-LNVTAIVDAQ-
>tr|A0A1I2IR29|A0A1I2IR29_9GAMM Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor (Fragment) OS=Fontimonas thermophila GN=SAMN04488120_104136 
----KGVIQYINRDFIEVS--------------------------GFS-------ESELI----GSPQNIVRHPDmPVEAFadfWAT----------------------------LKDGKPWTGLVKNRCKNGDHywvLANATPLRAN-
>CZCB01.1.fsa_nt_gi|955242656|emb|CZCB01016507.1|_3 # 1728 # 2327 # 1 # ID=16507_3;partial=01;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.493
----GVSSFEMNQQFSAQSSDSIEKNIAAISELWQKYMATnitdeekvladkfvatrgafvkealLPAVDALR-------ANdYEKAKLFSTKARDLYNVAHPALVeliQYQAGHAKL-EYDTSVESYKLTRNWTIASLFLAVGFLACFAYFImrSIANPLSvifRVLDNIKSN--
>tr|A0A1I5XDG1|A0A1I5XDG1_9PSED Globin OS=Pseudomonas borbori OX=289003 GN=SAMN05216190_1566 PE=3 SV=1
-----DDAALLEETLEMVSSRSEDLTPDVYARFFSRCPAASGLFTvIDpatPPM-------GCGQ----MLFEIISLLRDSAAgkpyvAsyMQQIATEHaA-FDVRDPALYREFMHSLADVQATLLGPDWSPAHAQAWDRQIAALLRHL-----
>tr|A0A1B0G6S0|A0A1B0G6S0_GLOMM Hemoglobin-like flavoprotein OS=Glossina morsitans morsitans PE=3 SV=1
STMNSDEVYEIKRTWEIPATTPTESGVAILIRFFTKYPSNLQKFSTFkDMTldelKNNPRFKAHANRIMKVFDDSIKTLDDncshLEEIWTKIAQSHF-NRQIEKQSFNELKEVILEVLVAACN--LNDQQTEIWLKLLDFVYEIIFKT--
>tr|A0A1J1IV29|A0A1J1IV29_9DIPT CLUMA_CG015163, isoform A OS=Clunio marinus GN=putative Globin CTT-Z PE=3 SV=1
HVLTPEEIVLVKDSWKIPSANAVDSAELIFYTFLSRYPEHQKRFVRFkDKPlnelKGSPFFRAHASRIYNVFDSVIDGIGKdpenkeVMSFIAESGIFHA-KKKVTKQAHAELRVVLVEILNDVCK--LDEKGNVAWSKLLDIFYHVMFEC--
>tr|Q7M422|Q7M422_9DIPT Hemoglobin V OS=Tokunagayusurika akamusi PE=1 SV=1
VGLSDSEEKLVRDAWAPIHGDLQGTANTVFYNYLKKYPSNQDKFETLkGHPldevKDTANFKLIAGRIFTIFDNCVKNVGNdkgFQKVIADMSGPHV-ARPITHGSYNDLRGVIYDSM----H--LDSTHGAAWNKMMDNFFYVFYEC--
>tr|A0A0G4EPR9|A0A0G4EPR9_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_12573 PE=3 SV=1
---SDKERgVLIDKTWGllKERYTLQEIGEELYDNVFKNAPDLRHLFKRPKEL----MALKFGEMISTIC-GLFQ--TDRESLLEtmrDLGIRHV-DYGSRPEYFPLFKACLLDTLENLLEDGeFTAATEASWNDMWDEASEMLISS--
>sp|P15447|GLB4_GLYDI Globin, monomeric component M-IV OS=Glycera dibranchiata PE=1 SV=2
MGLSAAQRQVVASTWKDIAGsdNGAGVGKECFTKFLSAHHDIAAVFGFSGA--SDPGVADLGAKVLAQIGVAVSHLGDEGKMVAEMkavGVRHK-GYGykhIKAEYFEPLGASLLSAMEHRIGGKMTAAAKDAWAAAYADISGALISGL-
>GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold789473_1 # 1 # 552 # -1 # ID=789473_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.562
-RIPPLKGSSLSAGWRTASSSGLS---------------------------------------RNPRGTVSR--------ESGNTVFqseTF-AGAASPRGGSLL-C--FT--GENEPMGMINNLKT------------------
>tr|A0A1G7K468|A0A1G7K468_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter baekdonensis OX=875171 GN=SAMN04488117_103319 PE=4 SV=1
-MLAVKQISLVRNDFRRLAPARPEMFKWFYDRLFEIAPHTRDLYSE--------SLTEESSRVNGLLEIAFLSLDHPQAMFATLHTLgrdFS-GFGIWETKLHLVVDLLVEVFAEFGGEDWGSELEKAWHSVLIFIAQGMKEG--
>tr|A0A291GF03|A0A291GF03_9RHOB Uncharacterized protein OS=Celeribacter ethanolicus OX=1758178 GN=CEW89_16165 PE=4 SV=1
-MPSARQIALVRNNFRALSPKRPDIFIPVYDRQVGEDPKAAAQYDG--------SLCQRARVLDGLIELALLSADHPTALFATLHKMgqdYA-HYGSWREKHPFLIGQIIKAFAEATDTHWTDELADAWEQFLYFMAEGMLEG--
>tr|Q86G74|Q86G74_PHAPT Hemoglobin II OS=Phacoides pectinatus OX=244486 PE=2 SV=1
TTLTNPQKAAIRSSWSKFMDNGVSNGQGFYMDLFKAHPETLTPFKSLfgGLTlaqlQDNPKMKAQSLVFCNGMSSFVDHLDDNDMLvvlIQKMAKLHN-NRGIRASDLRTAYDILIHYMEDHNH--MVGGAKDAWEVFVGFICKTLG----
>ERR1719468_599295
-ELNEKQIAVIKESWKVLTNEITEIGMLAFLHLFESTPDAQGSFKEFhSMTkdelKHSEIFRNHASRVTGVIKKVVEKIDEPETYLPHLHILgqkHV-MYEIDVNHIDQMGYMFLSGIKTALENknAWNDNARDAWESLLLMVIAEMKK---
>ERR1719329_2046659
----------IKTVWAKIMKEVgtLNAGTMLFKNVFMLAPETKQLFPKFRHlkddlLLSNESFKNQAKLSISALSNAIMSFDDPPKLkrmLMDLGRIYE-SKGVSLATLPIVGNALMATIEAALGNDSCIETFNFFALFYNEGSNMLAEGYK
>ERR1711915_153481
LGLTKRQRFLLKGSWKGISREMQVTGVRVFIQMFQSRPETFQFFPQFqGLDgpeqqKRSEVFQEHSEKVISRIDEALASAENPEVLTGVLLQTgayHRKIDGFNPQLFLCIEEPFLESLSLTLDERYTPQMDSIYKIITKYIIQTVIDGYN
>ERR1719369_313705
TGLTKKQRFLLKSSWKGVSRDLEYTGVKWLVGVFSTQPHTQKYFTNFsSLSldgelQECTEFREMAEKVMERLDNALFHMEEPDTMRSILLETgayHRRIQGFREDMFKDSEAPLLQAIENTLDERYTKQMAEIYTVVVQFFIETIMEGYT
>tr|A0A0S8CN91|A0A0S8CN91_9BACT Uncharacterized protein OS=Nitrospira bacterium SG8_3 GN=AMK69_14025 PE=3 SV=1
-GLPPSDISRIQRSFRMVASQGEKMASRFYDLLLERSPELQKFFHPGN-------LSQQHAKFFNGLHSLILHLEHPQALraaLVQLGEQHQ-GDGIEIQHYPPVVDTLLQVLTEFSGEGMDGETYDAWAHFLHLVRAIMLENH-
>tr|A0A182IYR6|A0A182IYR6_9DIPT Uncharacterized protein OS=Anopheles atroparvus OX=41427 PE=3 SV=1
-GLTKSQKVALIAAWSIVKKDLVTHGRNIFVIFFEEYPQYLDYFDFSASdAtgdlGENRSLHAHALNVMNFIGTLIDyGLNDPDllkCSLARLVRNHR-RRNVTKEDVGAVGGVIMRYCLKALEQHRSKTLEDAFGAFLGTVAAAFE----
>tr|A0A182QXV6|A0A182QXV6_9DIPT Uncharacterized protein OS=Anopheles farauti OX=69004 PE=3 SV=1
-GLTAQEKITLFSAWGLIRKDLDIHGRNMLLLLFHKYPHYVSYFDFTDDaSaqtlVDNKSLYSQSIHVIKTFGSLIEyGLKDPAlfnETLKKITRIHA-ERNVYGKDILTIGDVLLNYLAQVLGRQVSDALPDAFRKLFVTIAGRFP----
>tr|A0A1Y1I4E0|A0A1Y1I4E0_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_002310190 PE=3 SV=1 
-QLSPFEQQLVQKTWKLLQPRLADLGQAVFTHLFQKAPKTRPLYTCPlRLadgdrrTPDGHAIPTHAVEIVSTIGLAACRIGSSSRILAVLErlgQRHV-AYGAAPDMFSVFKEAFLVALKKTLGGeHFTAQVHKAWSKALDSVVAHLKKG--
>SRR5271157_2714777 
-SRIVDRLTALRAFFAEMEPQLPVIVARSYERLFDVEPAIALLFKG--------NAREHQLRFLAKLQSIVKLTRSSqlwpasaatgQILipeVLDFGRSHA-KIGVLPVHFSLLNDMIAWTCKEIAPLRFTPLVEEGLAFVFDVLGASLTAK--
>ERR1719323_206356
-KLSEQEKSVLKSSWAVISKNLEVVGSQMFIEMFQANPDTQHQFSNFrgiDQTelSETPQMIQYRTKVVATIGQVIDNVDNTHMLWDlliKFGRDHF-SYGALPMYFDLMGPHFVIAARNNMGNDWYEALEYHWLALFELIIYIMKFGWH
>ERR1719461_2449329
-----------------------------FLPSFDHDPECPEKISLH------------CQRVMSVVGGSIEHIEDYQCLWKhliSLGRDHF-GKIYEitlgqkSTFYPKIHSLKIpIFTKfTFLKSNFSQNSRFSNIKFL------VISGX-
>SRR5690349_7596073 
--------------------------------------------------------XMQMTRFTDL-GLRTLMLL-asaestgrrvttRTIAVGANASEHH-VAK----------------------------AVSRLAELGMVMADTLIE---
>SRR5215510_2422438 
-QMTKEQIEVVQNTFNKVRPMSGTAAQLFYNRLFDVDPSVRETLLW--------TLKQGlGADFTPEAEVAWGNAYDFLAAVMQQAAKGA-SMX-------------------------------------------------
>tr|A0A158PBC2|A0A158PBC2_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=3 SV=1
-LPNPRERELLRRTWSDEFKFLYELGSSIYIYIFEHNPHCKQLFPSIAKygddYKDSREFRIQALRFVQTISQVVKNIYHMDRLESylyGIGQLHCKyaHRGFKPEYWDDFKDAMEHSLTDHMNSlsDLDAqqrsEAVAIWRKVAHYIISHMRTGY-
>tr|A0A2A6CNA4|A0A2A6CNA4_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_32112 PE=3 SV=1
-QCNPRYTALLKSTWSDDFEVLFALGAKMYITAFEgpHGVACKSLFPWVAKyeeagenYADKSEFRLQALRLVQTIVKALDKVDDLQKLEAylyAVGHRHVFylPVWLDPVYWDVFKasratsylgqstmlksaserDAVQVGVNDHLHKlsKLSTddlaRATLIWTDIIEYIFEYVKEGF-
>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3481696_1 # 1 # 387 # -1 # ID=3481696_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.584
-VLTSNDIALIRESWAYAKDI-PAIQTETLLEHFRIQPRTQALFPKFaDVPlnklPTNDAFIKQARSCVSFGLNFIVANLDNPSLLkDMLGRVdTyG-KWYVDF--MtkeRQMQTTVdifIQVLSKELGGRLSAAAKAAWTRAMTLVFVEMMS---
>ERR1711894_485352
---------------ILLYNY-rfLTYVIYYYYRFLAEDPTVASVFSRVNVdDQQSGEWHAHMLRIMGGVDILINMMDDVNVLTEEVKHLraqHVVREGVTHERMKAFLIIMMDELPKVMT-HFNH---DAWKSCLSKKLKRIG----
>SRR4051812_36412483 
--------RRRTRGSARITWPGYQMRNLLSPRLFDRASAVRVLLPD-DLT-------RLKHQFARTLHWLIGHLHEPQKVriaLVDLGRRHQ-EYGVKAEYYPAICEALVDSLATISADDWNDELARDWRQTFELMVHHMLRAYR
>tr|A0A1Y1IHX6|A0A1Y1IHX6_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_006460015 PE=3 SV=1
-KLSDERILKAQALWDFMEGsafadndrrQFIDRGVKVFENLFELAPQVLTLFPFKDENgrPRRKELEVHVETVMSTTGQVVRQMQDPDSLAPMLTELtalHV-KYGVELIHYDILCSTFLLTFEQLLGPRWNSDYRDVWISIFSFITTFARKAY-
>SRR4051795_8230555 
-----PAVT-----------------------SPRVpA---------------------------------------------FgSPCPvirQQ-RWTGAI-----IGTRQEGSVP----------SAHSTTSGD------------
>SRR5215203_3322109 
-ELSERTIALVKATVPALEAHGLAITRRMYERMFH-NEAIRDLFNQSHH----GETGSQPKALAAAILAYARNIEILAAWGEAYWYLaevLI-ARERLIyqglaaapGGWTGWRDFTV--AEKRCESEVITSFVLRPTDGGPVLRHR------
>SRR3954470_353290 
-----ARRS-----------------------------------------------------------------------SPLaEGDPryhVH-QWDRGRQPRRSTRCRVTPPVT----------NIRRYLVGP------------
>SRR6478735_1414904 
-----SGSR-----------------------PARLaS---R------------P-SW---------------------NHRPIgEATLvnrYG-RS---A-----AGSDVE--------------RIERDLSGT------------
>SRR3954468_7455402 
-----APPD--RA-----------LT----GGGETVpG---V------------R-ASR------P-------------RTIDRsGRTLvsqSE-RS---A-----EGSGVE--------------EIERDLSGT------------
>SRR3954470_12739883 
------------------------------------------------------tsaCSRTRTSATCStsrtmarqapsprrspPPWSPMRAISTTSARSPRVERIaqkHV-GLNILPEHYPAVAESLLGAIKDVLGVTHYSRGLTDDPDWYPYLKKHEWL---
>SRR5215831_13609655 
--------KPCNRSKPFFRINAFCSAvslalrlQRLCELPESAHPQRC----ASCL----K-TANPAKNVVPKRFGTFISIHLRDTYIFAVSKIgqkHC-GLNILPEHYHYVAESLLGAIKDVLGEAATEEVLSAWGEAYWFLADVLMA---
>tr|F2UFM9|F2UFM9_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_06664 PE=4 SV=1
-RLDMEQLKIALGSWTAVVELVPTWHEVFFAELFQAHPETeRLLYSSDKSK---SWNERHMARVGKSVGDVIKSLSNYDDVIEHLTTGephEQ-ACCL--------TDG--YVIGTGLGNT----PRSLWLACGS-----------
>tr|A0A1Z5KPX1|A0A1Z5KPX1_FISSO Uncharacterized protein OS=Fistulifera solaris GN=FisN_16Lh317 PE=3 SV=1
---SPACVMKVINRWETARQRngfDEQLDIDTLLALFKMDPQVKPIYGFAvEKEvkaQgmQRMGVLIYGLQVVKMFDVILSALGPDeElfyDVVTEMGEQHC-KHGLTPDHFTLLCGAVMGVLETIMDTEWTKDVRAAWSQVIECVNAEIVK---
>ERR1712000_676789
MSLTPQQSAQIRSSLPVLKSEGETITSLLYASLLHNHPDLHNLFNSVNQAN-----GRQPRALLSSASVKGTARWESHQLS----------------------------MISSRGTCWRPSR-RSWGPSGRLSX--------
>ERR1719328_19047
-GMTPEQKQLIDDSFAVLKKDVKGNTIVFYETFFKMNPELVAHFPGVseaDLVnlGKNEFIIQRGAKFFNMIETTTHLMESKEGCLELVRMLkesVP-EGKVTYDRYKVAKEPFIKMMETALGGNFSAETKAAWRKFFDSLAETTK----
>SRR4051794_16351730 
-TLTPFEVGVIRTSFRDLQKRSGPAAQRFFRELFSYDAALRELFAP--------SPWTRQENLMSVLSGVIEQIDSSTTLTTHLDEVvrrFP-AFAVNSYYHLYVGAALFAM---------------------------------
>ERR1719187_1205752
-SLSQGENDALKAGFKAAQGKLGDIGANTFANLIANDDSFRQRFPWANsdITveeiKTYAPAIAHGEKVLQGVNVAVKNLDRLNSFVSyfvDEGVKHV-PRRVTVDDFQAFAEAVHPAFQKELGDLYTDDFKNGLTGLLGFISDNMAKG--
>ERR1719187_2594184
-QFTEAEKTILRDTWKGTIQpHMAQNAANLLITYINENPQDRKLFYWGRndKSgmalRVSPGFVTHSQGVFSGVGVGIDRLDNIASLDKfytQLGEDHI-PRGIHEGVFAPMKDAFLQILGHALQEEFTDEAKAAYGKYYDHIAGKMIEG--
>ERR1719309_658292
-HLSGEEKQLLQDTWSRSIApLKHENGANMFIHFITHNPELRREFFWGRnnKTamalRVDVRFASHIRSIFDAIAHGISRLDNMDSLQGyytELGQDHI-PRGVQRVMFAPLADSFMYAVGLALEDQFTPAVKAAYLKYYMHIP--------
>tr|A0A0G4IVL1|A0A0G4IVL1_PLABS Uncharacterized protein (Fragment) OS=Plasmodiophora brassicae GN=PBRA_001183 PE=3 SV=1
MRLSARITNLVKSSWAEAMTLQgrdgMTLQKAFYNHMFTKAPESRAMFKE-DTS-------KQELMFGQMMTDAVNILDNFEELVNKlvyLGEVHR-YLDLAPEHFRVVGESLIGTLEDILGKkRFNAEVKEAWVMVFDLMATIML----
>tr|S6BNG7|S6BNG7_POLVA Globin OS=Polypedilum vanderplanki GN=PvHb32 PE=2 SV=1
-PLSKEQADEVRHAWDKVKSN----EVEILYEIFKAHPDIQNKFPQFagkNLDsiKNNSDFGTHATRIVSFITEIMSLGGKpdllpaIKTRVNEMGQNHR-NRGVTKEQFNEFRSTLTDYVKHHS--SLDGDTEHAWNQAIDNVFFIIFSNL-
>tr|S6B7W8|S6B7W8_POLVA Globin OS=Polypedilum vanderplanki GN=PVHb31 PE=2 SV=1
-TLTADEANLVKSTWSQVKDK----EDEILYDIFKQNPDIQGRFPMFvgkNLDsiKSTEQFKTHADKIVKAIGSYIDLLGNesnsgaIKTILNELGQRHR-DRGASKEQFNEFKTSVLKYVKEHAS-GWNDASGSAWDKAFDDMYKIVFSNL-
>SRR5579871_994368 
-----ADPMNINESIHDILNRDEIVADLFYDVFLDRHPEVRRFFVGVDI-------RQQAIV-LTMMLSIIEDfYHHsypaTARYLRLVGQRHK-ARAIPKEMYLIFCQCLLETLERFHGQNWSAQLSDEWERAFDKASQVLLEGYQ
>SRR5512135_1415087 
-------TELIARTWEALGDRQAQFIEAFYDRFFERFPGYRKLFPHE-LR------TAHLEKMVLTLALLADLSDDRTAIAPRLHKLgaaHK-PFDLELRDFNNFKAVFIEVLGPQLGKQWTAAAAKAWNDAFDAVLIP------
>tr|A0A163MXG7|A0A163MXG7_ABSGL Uncharacterized protein OS=Absidia glauca OX=4829 GN=ABSGL_15412.1 scaffold 16614 PE=3 SV=1
---SQTDIDLVRSSWERVIETqhpsdedgvspAQAFGLVFYAALFHLDPHIRPLFDGTNVMIqakmltfvigclvRAPMVIQRRGPTLKEISTTPTGAEDMEGLAAKIRELgarHH-FYNVEPAHFQLVGPAVDMALRERLKHEYTDAIGQAWLRTHAFVAHHMA----
>SRR5207247_8066543 
------DVQRLQESFARMAMHGDAVPLFFYSDLFLRHPETRDLFPV--------SMAAQRDRLVDALGRIVSDVEHVDADSGDPSGArpeDA-HIQAVRILsnAQQMADNYVADAQEY-----SSQLSTX-----------------
>ERR1719193_187210
-VLTENDIKAIKAIWYPVRQTPADIGAAAFEKFFKLYPHQKEKFWFMkNDDLKEKGMRAHGEKVIKSLDEAVLRTVDrarIRSCLQRLDYIHF-QMGITEEDMEELSDAVVKTIKEVVIdtnKKLTHEELDSFKKFMKMVTAE------
>ERR1719193_859649
-------------------------------------------WRMLkKRH------NRDGGKLLH-PLKTILQTCYksrIKNCFQRIGYIHF-RMGVQEEDMEQLGEAIIKTVEAAWGDEFTPEEYAAFRKFMKKFTAA------
>SRR5580704_1734515 
-----------APRAELATGVAPDYgSPDDVASRRSQSRACRRTLRR-P---------TTGAVRGEMLARVIEAILDFIgeRRYAhHLiqcEVVtHE-GYDVPPETFGIFFGVVATTVREQLADAWTDAFDEAWRTLLYDLDY-------
>SRR5258708_241677 
-----SCGEDPAGSSD-------DHDAD----VVASAGQVEGGVDL-V---------EHPPALGVPIAAPCQWLVDLEgaGACAaNRmaaERVnHE-GVGVPPAALARFFPIVAETCRDLLGEAWTGEIEAAWAGLLTRLAV-------
>ERR1719296_55987
--MDSDMQVAVQKSWEKVQEIGTlAVAELLMKHTLEIDPEAIQLYICKAKPGEDENVLDVARKLfartLFILGSSAAGMADTAHVVKNLTVAGStlANSGVKESYFNTVGTAFQMTLQEVLGDKFTPEVATAWKVAFDFMTAIMVAGMR
>SRR3954451_11513015 
-AASPCAQQLRQGCRDRPA-----ACQLVLSSGVRDRPGCEIAVQG--------RHGEAGPQADGGADGLIDAIDRLDTI--------------------------------------VPAVEAAWTEAYTILATTMKD---
>Dee2metaT_27_FD_contig_31_2132282_length_204_multi_2_in_0_out_0_1 # 3 # 203 # -1 # ID=1013462_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.592
------------------------------------------------------SAATSNPQF-------VAAV-------------------KKAIDYSGL--------LTVAGQGAVQPagiipSVIAGTLPAADALKQDVAG--
>AntAceMinimDraft_18_1070375.scaffolds.fasta_scaffold521461_1 # 3 # 443 # -1 # ID=521461_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.569
-------DD---------------DDDDDDdDRMFHDHPEARALFSRVhGDNTYSPDFEAHAQRVLGGLDSCISLMDDPDTLASELGHLkaqHA-DHTdVTAEHFDVSICFSsTDVTSTYTsthckimdrpnYTVFQT--RGQrnltksaSRRAHspvRDHPRG------
>ERR1719191_324407
--MDDSAMKITQESWAMVEKEVPHWPEIFYDQMFA-DPSVAKLFPFSsGNFKENPKFQEHTQKVKDTMHTAMTSIKEFDKLrpvLYKMGQRHV-AYGTLPEHSTNFKNAFLFTLKAGYGDKWNEDLDDAWNQCVDALL--------
>tr|A0A0P5XAJ2|A0A0P5XAJ2_9CRUS Di-domain hemoglobin OS=Daphnia magna OX=35525 PE=3 SV=1
-LLTANDRRIIRKTWARAKKD-GDVPPQILFRFIKAHPEYQKMFKSFaDVPqaelLGNGNFLAQAYTILAGLNVVIQSLSSQELIANkinALGGAHK-PRGATPIMFEQFVNVAEEVLAEELGSSFNAEARQAWKNGMRALVTGIT----
>ERR1740129_283753
-PLTRREIRTLGLSWSKFHGCRQEFGVELLVQFFQLVPEASDLFRFQRekTISENPGLKNHADRVVRVLSRVIHNILSLEEVVPDLKALgmkHYMDYGVSPTHYCLFGKALLGTVQTF-GG--TPPEQGCLPKLYEWMSRTMTS---
>GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold759510_1 # 2 # 568 # -1 # ID=759510_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.697
----FVTTQCVVENWERLkySPFFDEFVIAFYQRVFRLCPQAKSLFGSSfCLD-DQAAMT---QEFVRLIDRILDLLGPESqlmvEVLRDLGSHHE-AYGVTVEMYDIMRNAFLLTLEQFEGEKmFTSKVRQAWTTVCSAVADVMTEA--
>tr|F2UFM8|F2UFM8_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) OX=946362 GN=PTSG_06664 PE=3 SV=1 
MRLDMEQLKIALGSWTAVVELVPTWHEVFFAELFQAHPETERLLYSSDKSK--SWNERHMARVGKSVGDVIKSLSNYDDViehLTALGTRHA-RYGLHVDQLDLFINAFLWTLGAGLGDSWDHSVKKAWMHVLPFILSPLKS---
>UPI00054DD732 status=active
----------------------------------------------------------------------------------LTCARDF-FltfVGVERCR-PKLLKQEPQTITSKLGm-A-PMLQSAFWSIRVMRIASS---A--
>tr|A0A1E2UUQ1|A0A1E2UUQ1_9GAMM Uncharacterized protein OS=Candidatus Thiodiazotropha endoloripes OX=1818881 GN=A3196_04875 PE=4 SV=1
--ITKTNLKRFQQSLRRIS-LKQGFYDTFYDHFIAQSDEIAAIFHARDM-------DQLKGKLKETLQMVEDALMGKPGvvlYLEMLGRIHT-RLKVDQRHFEMWKYALLSTIERYDD-EYDAEVKMAWEAAIETVVSLMYPES-
>SRR5262245_29633745 
----------------------------------------------------------------------LGNHSTR-cgrSVESSQSNSTA-DFLNSRRIHDAYSpaiRAAKSKSE-------------------------------
>SRR4051795_1885912 
-----------------------------------------ApRTARRRL-----QPGQPGRRLAAdRAGRVGRGlRQRPaegprtdsrapavadraqarvaghrprpvrrraRQPVLGHRRRAR-EGGHTGGRRRV----GRGLLADglCPGQPGARPLQRAWRAA-----GDGVAR--
>ERR1719218_338423
--AEEAGDTVLV---GGAPLgarqRPMATGSKIFRKLFTGDTAVLRLFPFRHQartLFVSAPFKLHAKLFVDTMTELIANLHDLEKVERdvrELGKRHL-TYGVQPAHFDAMGEALIASSTS------------------------------
>SRR6516162_179054 
----SQTVMDIEESLHHILEREKLVADLFYMVFLEKYPEVRRHFINVN-------LRRQAVLLTMALQVVVQYYLKgFptaEAYLKILGEEHN-RRGIEPELYPKFCTALLETLSRFHFHDWSEDLAQQWEEALKLAATEMVEASP
>tr|I2G907|I2G907_9HEMI Hemoglobin A OS=Anisops deanei GN=HbA PE=2 SV=1
-SLTDREVEVINQSWNQIKAQELVVGLQMFKTLFQRYPQYERLFTHLHQSgkslYEGDRFQRHVvGNIMSSINKVIETLNSSDNAVKTLQDMgvkHK-KLDVHRKHFESFVPFVVDAMVSVRMSMSQDEVASAWTKMMEGVASNLSKG--
>tr|A0A0P5UVQ8|A0A0P5UVQ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna OX=35525 PE=4 SV=1
-LLTANDRRIIRKTWEPRpR-RTEDVPPQDPLPFHQGPPRVPEdVQVLRlCSPsracEQRKLLGPRPNTILAGLNVVIQSLSTHGAYCQPNQRSrsaNK-PRGVPPIMFEQFGNVAEEVLAEALGSSFNAEARQAWKNGMRALVTGIT----
>RhiMethySRZTD1v2_1073278.scaffolds.fasta_scaffold3173058_1 # 192 # 530 # 1 # ID=3173058_1;partial=01;start_type=GTG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.740
---------LYSGTNvytgataslLAQADYLSSLIGDTDYPMFDVESVVQLFL----------------------------------------EwehNKHH-DIMGFRN---YPHKSVMTG-------TRAPVHHTPWLQALDDSMECYLNT--
>ERR1719183_2765469
--------------ADIFMPRLEEIVMRMYNLILEEQHECINIFNTPSLS-----PGQPLAALAACIRGLIEDINVRPRLEhrvEMIAQKHC-AINLQAHNYLGLQGMFMSAAEDVLGADMTPQRFSAWSQALLFICRLVIER--
>SRR6478609_8547471 
-VlvdveevlrvvfgFDLPQTDVVRSvVLGNPGQ----I--------IAVHKVDV----------------AAGGRIGPQGGRVVPHPRDVcLV-LRRVHPLR------------------------------------------------------
>ERR1719383_1602644
-------------------------------------------FGLHL----------------QSTMLVGNDLDPVDERG--pdhCQQALW-TASE-GRTLSHRRREPCRSVLEVLGEdVVTPEIGGAWREAVQALAKILIDT--
>ERR1740139_1260005
---TEQMKTDVVSSWGKVLSFGTlTVGRVLCRHTFALSPDMHALFPPHILhkyqeegeTDSNGALSRHFSMILNAVGCVVSSFDQDadLSTITQLGMRHA-SYRVVESHFETIGRALELTLHDILKDDFTPEVRHAWKLVYSFLSLVMIRGI-
>tr|H2ZAE8|H2ZAE8_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1
MEMNAQEIQDVRDSWKRLCADGeKTVGLMLMQKLFNTYPESIKVFSRLGITnkaiitiddlSTNSAASRHAESLTSRIGTLVDLMHNTHefkECSTEVGEIHI-KYGVTAEHVDILGNVLLSVICDSQGLSKSSDLYLCWTKTWEGIAKYVK----
>SRR6185437_12825295 
----------------------------------LIAPRLELILPA-DP-------ARRDAAFLELVDMVVQRLDRLDLLLPMLAAQaHSwGKRDVLDGDYVLAGKALAWTVEQVIK---EPAAIAAWRDTFDFLAGVMRR---
>SRR3954465_11422119 
---PCRSSPTTSGRSPGAS--TRT---------------CStAtRGCWTGPStgatrpRA-----PSRSRWPGPSRSSpahwSRSPSRSpSTCSpgSRTSTTHsasprpppP-PPPPARAERGVVQDNLFWAIVDVLGEAVTPEVAAAWDEVYWLMAYALVNQ--
>SRR3712207_885952 
-------------------------------------------LGR---------------------GLLadglRAHPPGAgALQR---------PRRAAGDGVAGVggRRGENRERGRREPPPAAGAGTPGVDRAAPPGRCRPGTP--
>SRR3954465_6877418 
-AtaaaTAAASSTDIRATRPASLEG-------------HDRPHLDTaEAGRAQLADG-----EGDIEVGGVDEvVAtqhlLRLHERAvGHlgpPTDARRGAGR-LQGVAAEELGTVRLDLDGELVVRLHDL-----VEDLGRRRRVLALVLVDQ--
>SRR3712207_8177874 
-VLSDRARPVVEATLAPVADNIGEiarRRSEER---------------------------------------------------------------------RVGKECRSR-----WSPY-----H-------------------
>tr|A0A0L0FUF5|A0A0L0FUF5_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_07147 PE=4 SV=1
-ICKPEELHtkdlgfivtHTNNPW--GSTDEQDFGVDFFRDHADQ----------------------SGLTSFFSSIVIIACEMYQEfePSIPQLQKLgeeAK-HLDIPCHMEDNIVGYVASTLSR-SKQ-FDAIEECAIFKLIWRVVLFVLE---
>tr|A0A252E791|A0A252E791_9NOSO Nitric-oxide synthase OS=Nostoc sp. 106C OX=1932667 GN=BV375_01385 PE=4 SV=1
-ALPPQMLHQMADCWEVFSQNKQQMGMEFYQILFEKYPFVLPIFGRADMD-------YLSLHLFQAVEFLVRCLRTGSsdNMLQELRFLgqvHS-FADVPSCAYPAVSDTMFVLFEKYLPN-FTPELRQAWQILFDRVVNVIKL---
>tr|A0A2T1LS65|A0A2T1LS65_9CHRO Nitric-oxide synthase OS=Aphanothece hegewaldii CCALA 016 OX=2107694 GN=C7H19_21845 PE=4 SV=1
-ALPPEMLQQMIASWSVFSQNKQEMGMEFYQILFEKYPFVLPIFGRADMD-------YLSLHLFQALEFLMRCLQSGSseEMLQELRFLgqvHS-FADVPTCAYPAIGDTMFTLFEKYVPD-FSPELRQAWQTILERVINVIKL---
>tr|A0A2E9QYM9|A0A2E9QYM9_9DELT Nitric-oxide synthase OS=Deltaproteobacteria bacterium OX=2026735 GN=CL920_22905 PE=4 SV=1
-ALSS--MKEAKRLWEEGVGLHTAPGSEWVHQLVAERPEWNHFFASSDPE-------AFGEALFSTIDSAVHQLDDEVSMFSSLREDselFT-AWDVRACAFSALPDVLVDFVV---ED-HQTVGAQALRTFLRRVCTIVSL---
>HubBroStandDraft_6_1064221.scaffolds.fasta_scaffold2618798_1 # 2 # 181 # -1 # ID=2618798_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.622
---SAEDRSIIQEQWKILFKDVdsskikIAVGRKLVLNLIQRQPDAKVLFDKFNVdEPNSPQFSAYALRLFNRIDLIINLLKDPEALDAALEFnaeRYGNIPNIKKAYFQTAAQILAYALPKVLDD-FNA---LSWQSCTRYILTTVASKVS
>RhiMetdeSRZDD1v2_1073273.scaffolds.fasta_scaffold2404579_2 # 426 # 629 # 1 # ID=2404579_2;partial=01;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.627
---SSEDRRIVQKQWNALFGDVrssrvkIALGSKLLLKLAELRPDAKEALKPIHIdDPTSGEFQAHSFRVLNSLDVFINLLTDAEALDAALDHhskEHSGIAHIKKEHFKVFGEILISSLPKVLDD-FDA---FSWRSCYKYIGQRLTAQLH
>sp|P02210|GLB_APLLI Globin OS=Aplysia limacina PE=1 SV=4
MSLSAAEADLAGKSWAPVFANKDANGDAFLVALFEKFPDSANFFADFKgKSvadiKASPKLRDVSSRIFTRLNEFVNNAADAGKMSAMLSQFakeHV-GFGVGSAQFENVRSMFPGFVASVAAP--PAGADAAWTKLFGLIIDALKA---
>SRR3981081_215795 
-RDDPDQKQLVRAFWKQVVPTAEAAAGLLYRPPFERGPHTPAPARVsrpTAAS-------PARGSLLECWGFQSAAGQAR----------PANGEGGKP----RPPPRRL-----------------------------------
>WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold136029_1 # 443 # 1567 # -1 # ID=136029_1;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.433
-----HHLQFLQQQISAAEPRAGIAMLVFWKNLFELNPSLRPLLGEK--P------GEEDYLLVQFLAAGLAPLFRQTPNTAPTdQDGACAPVNTDeEQQCSVVGEALLWSLEEAFGADFTPKVRSAWETLYRFITVSNKQSY-
>SRR5687768_12147577 
-------------------------------------------------------------GLAHARMDSvSLK--PpanphcaiktwvlacgvparTAEWRPMSNlSDAP-SPSLLSDQSLSV----VQ-TTATVVAAHADEITAAWSEVYWLVALQLVA---
>SRR6476660_4664138 
-M-VVVGVDAHKrtHTCVAVDGSGRKLGEKTVPATT----------------------VGNASALRWARSTFGpdltwgiedvrnvsRRLE----------QELV-NAGQR---VVRVPTHLMARTRasartrgksdsidaTAVARAvpREPDLPVAqHDSVS--RELQLLI---
>tr|R7TLW3|R7TLW3_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_227018 PE=3 SV=1 
-----------EITWAILSENRDGLGTEVFVRMFESYPDLKSAFGPLrHMNKKdagyEDVLRAHGIRVLSIVEQVLSKRHNMEEVLSILHDLgrkHL-TFSAKVEYIDIVSQMFLFAIESALKEKWNNSTEKSWGEIIRFVTYVMKET--
>SRR5918994_1081840 
-----------------------------------------------------------------MLAVAIEALLDRGGegrlagLVGIERMNHV-NIGVPPEVFDGFFALLMEVVRDALGPPPKGGGeragGGGWPPAPRPAGAR------
>ERR1712157_679996
-----TTMDCVLSSWEQVRRIpnyRETVGLAILQKLIHRMPEGREVLHMQrNLIknsppgiESDKLLLAHARAIVNGLDTVVELlgplIDDISEILREIGKSQYHDYGDSMALWNpLMRECVLEVIQETLKDDYTHELKVAWTDFLGEVAKDIHS---
>ERR1719360_423992
-PLTQAQKEIIFTSWDAIT-HKENLGVTIMYRIFTGHQEIKHLWKFADdLKteeeiRGSKTTQFHAKKVINGVNSAIKAVEAgkeVESlGLDKLGARHF-KYGAKPADFRHFVESLFWAIKTIVPE-VSAEMAAAWTNFVMQIIKQMTN---
>SRR6476660_7963253 
------------------A-SHSTFFERFSSNFKAANMSLQPFM-----D-------RQQKLLREDLTKLVMCAENAEFa------TRPGAvALNVSPQLSKFWIDALMLTVREFD-EKFTPELERKWRTILQKGLA-------
>ERR550517_1828149
-------IYYVSikPPKNRLESHIRKqSRVqsdysQDYIKETAIFSFFIQIFHKLNPNPNSsgikytkdqalkESLHEHGVKVLNGVDEVLSNLDQPSLCFSLIRKTgahHRKLQGFKPKYFKCFEEPFLAMVENSLGQRFTPQMETVYRSVATFFVQTLIEGY-
>ERR1719220_3089060
---------------------------latvnIHLRSAFHASSLLIQIFQKLNPNPNSsgikytkdqalkESLHEHGVKVLCGVDEVLSNLDQPSLCLSLIRKTgafHRKLQGFKPKYFKCFEEPFLAMVQSSMGQSFFIFPGllPKWRSFTSPSPASLSK---
>SRR5919199_1911786 
------------ATLPVVSDHIGDIARRFYDHLFGEHPELLdGTFNRGNQAEGTQKV-ALAGSVAVFASALLKRPETVwRDWR--VAEKTD-E-------TADVVSFRMQRIDDRLVKTSLP---GQYVTVQVQMPD----gvrqprqfsltrA--
>SRR6476659_5675031 
-STHRPDQALRGGGRPPHRAADNNAKGAATGHRVSGRS---SPAELPENSMREQQQ-ALAGAVAAFASSLIETPERVpQSLLSRIAHKHA-SLGIRPDQYQVVHDNLMWAIVDVLGDAVTAEVAAAWDEVYWLMGNALINQ--
>tr|A0A1I3XAR1|A0A1I3XAR1_9PROT Methyl-accepting chemotaxis protein OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_101121 PE=4 SV=1
----QAAIQRA-EACLTLSADGLVLEA---------NDRFAALL-GLA-------PAAVADRPHA--ALLTLAERDGATYrrfLDQLAQGR-------------------------------DTVARLWHQGAggagvllELSAAVMAAD--
>tr|A0A1I3XA39|A0A1I3XA39_9PROT Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_10
----MAAIDMA-QPMMLLGADGVVQDA---------NAPLAALL-GVS-------ADALAGRPHA--ALLAEAERDSAAFrrfRDAVAAGQ-------------------------------AGHARLRHAGAggntvtlDLMMQPLAAE--
>tr|M2X1G3|M2X1G3_9NOCA Flavohemoprotein OS=Rhodococcus triatomae BKS 15-14 GN=G419_19149 PE=3 SV=1
-ILSATSRPIIEATLPVVGEHLGEISRIFYRHLFDNLPSLEsDLFNRTNQANGEQ-QKALAGAVAAFATLLVTEEAPPvDEVMSRIAAKHA-SLGIVQVHYDLVHTALFTAIVDVLGDAVTPEVAGAWDEVYWLMANSLMAQ--
>tr|A0A0N5C327|A0A0N5C327_STREA Uncharacterized protein OS=Strongyloides papillosus PE=4 SV=1
-NLSNDQQALIRKSWRRVP--KQSIGKVIYQKMCQKCPELKNFLST-D----NNCVERHFKYFGDMIQCTVDSLNDLDTaLYPWLNVIgsgHG-GFAITTTHWDAFGEALISSIKQWILTgKDHKETVRAWMKLSCSLIDTLAAA--
>ERR1719323_2694698
-RLSDKTVQLLKGSAPELKEKGTQIATHLFLSLFERYPVFRDLFPK-DNVK-S---GKMISVLPHALTVFAENADNMIQLDDIITrivKKHV-DKGVQQWHYPLLEECFLDALSSTLQLQKRPDLLQAWEDGFKFLANKLM----
>ERR1712018_308843
-------CSTPQILCSRVKRKRFTRGHTSFTSLFERYPVFRDLFPK-DN---G---GKMIAVLPHALTVFAEKADNMIELDDIITrivKKHV-SSGVQQWHFPLLEECFLDALSSTLKLDKRPELL-------------------
>ERR1719230_2183946
-WFTDDRERLLKRSWQQLQLdSCEEAGALLCRNYCSQSPEDAASCG-MDW-----------SAVIKVIGFPIDRMDNLAFVKKRLRCLganHA-KWETKEHQFQSMKYAFLSAPRDVFANEFTSDLELAWDLLYDFVSTEMIAGL-
>tr|Q9NG75|Q9NG75_9CRUS Hemoglobin P polymer OS=Parartemia zietziana PE=2 SV=1
-GITDAEKQLVQESWELLKPDLMGLGQKVFGRIFTKNPEYQTLFTRVgfgDTPltqlMANPAYGAHLIKVMRSFDFVIQNLGKPKTLLAYLKNVgadHI-ARNVERRHLQAFSESLIPVMQNELKAKLKPEAVAAWRKGLDRIIGVIDQ---
>tr|A0A0D2WU86|A0A0D2WU86_CAPO3 Uncharacterized protein OS=Capsaspora owczarzaki (strain ATCC 30864) OX=595528 GN=CAOG_006523 PE=3 SV=1
---RHETRDVIKSTWALAIQKQdeadvtpvATFVNVFFGKLFELCPETRLVFGQ-D-------LSLQGKSLSSVLTGMLEFVVHPKKlttQVKSLAVKHV-GLGITPDMFDAFGAALVYTIKTRIGKVWSPQTERVWVDAYGGVNNIITQQ--
>tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii OX=37682 GN=F775_23753 PE=3 SV=1
-TFSEEQEALVLSAWDAMKGDSAAIALKFFLRGRNN-------FVQLaHVEspkRRIPVVEERKTDL-----------------IFEIRTKTW-KIGQKSTAYRSW--LLLR--QKSLPa----HAPKGHLSElvpldTIDHTHQET-----
>tr|A0A2T5C1R0|A0A2T5C1R0_9BACT Hemoglobin-like flavoprotein OS=Mangrovibacterium marinum OX=1639118 GN=C8N47_108138 PE=4 SV=1 
--MTEADITVIEKSYAQIEAALPRMAKYFFNRANELDSDLDPLFEE-DKS-------KHGEAFVALFGKAVEHLNSPEALLPEIKKMEAklKYYKFNEEVLNTVGVVFVDTLSFGFGNNFTQDIIDPWVKAYKTYSSL------
>tr|A0A074ZRQ0|A0A074ZRQ0_9TREM Uncharacterized protein OS=Opisthorchis viverrini GN=T265_04650 PE=3 SV=1
-SLTDAQINGVQSSWKLLKIHIEKIGVIVFLGLFEEHSDFRDAFARFRQkqlsiLTRDPAFQAHGLRVLNVVDKIISRLRRIDTIqdfLLSLGSKHC-RYVPNIELVPAVGEQLLEAIRPVLEEqgLWDDDTAVGWEAVLAYLNCAMRY---
>SRR3954463_14455484 
--AQ----------------------------PRAARPSALRLSRPGDGA-----P----FLLRAEVaCLasGI-----g-----------TF-GPGLRSHPLARLGRS-----RALRGRAVLArCPPKIWSPLD------------
>SRR6476620_12491069 
--LSDQSLSVVQATAPVVAAHADEITAHFYPRMFAAHPELLLVFNQGNQA-----TGEQSKALAGSVvAYAVQLIDPkapsFDHVMRRIAFKHV-SLGIRPERTQLSASICSLPSLRLSATPPPPrpprpgarsigCSRSSWSPR-----KHGST---
>ERR1712198_397898
-GLTEEEITEIQSTWKSIISdKTSEHGVNILIRFFKNYPEYKAqYFQNLnTLSedelRESPKLRSHGAGFVLAITQIISDLDNMlivEEVAKKIARNHY-NKGIREPlNYKLMTNTIIDYIKDIGN--LADGTMQNFRKMFDIFIISVRKK--
>SRR5580700_967641 
--------------------------------------------------------------------XMNRNIG----LFFPLIRHs---------CTYF--AQEPVLeFLG-GFKSAAAD-DQSVRVERIDHL----IE---
>ERR1719464_2687596
-NLTEEEKKVLRTSWAIISQKVDQDGESRFLHKFESNQENEDPILQQ-FT-QIDASICVNCCNIGSSFSWFdsnlcRNllSPSWSTFWLIIAQLvrsTF-FSSS------------------------------------------VKFGM-
>ERR1719375_1958814
-----ETALTVIDSWELLRRKknyAVVVGSGLFKKFFQEEPGAIAIFGFTDEEiesdeepfYQSKRFIDLAKNFVGVIDQAVDMLGPEMEmVGEVFVELSK-QYKIEIQHYMLLGNLLLEELEDVLGaKAFTDHIKSCWVQVFQVLCKDVKKKL-
>ERR671932_89059 
-S-PTSCGPARACRSCCCTPTPPRRRSR------------YdGVHEG------------------LMDLSSFPLPDD--ALFYLCgplpfmravREQLL-DLGVSPRDV--qyeVFGPDLWQADAdeGPGDAPEPgahdllgpEERQGPPPA-WSRPG-------
>SRR3712207_7345787 
-V-LDDVRALPNATVHVWYESGAASALP------------VdGVHAG------------------TMDVRSEEHTSELqSRQYLVCrlllekk--KTI------------kyeSTXX-------------------------------------
>ERR1712168_1470941
--------------------------------------------------lmLTCCkiqKPRNMLMGFSKPWAPQLIVFDTLGSLagyYTSIGVKHI-PRHLEHAHFGWMKASINEVMMSELGDAYTADFESGWDKVISFILERQEL---
>ERR1035438_5604951 
-EQTNDLARIFNDSYERVMHgpgrSSGEFFVAFYDLLTATSDEAASKFGNTDM-------AEQVRTLQSSVPVLLNFFvSsRQDEYLGKLAERHSKrGVDIPPELYDVWLDCLVETVRQFDS-KFNDDVATAWRTVFSKGIEVMTSRYE
>SRR5476649_733261 
------------------------------------VTGVQ-TCAL-PIC---GL--VRGQMFQVTMESLLDFLGDRSygANLIQIERVnHQ-GLGVEPEMFDRFYLTVMATFKDILGAGWTQETETVWGRVIAELTG-------
>ERR1719284_537611
--------ELLEQTAPLVAMRTEEIHSEFQSLLLQHNLELLSVFNIPR---QSDDVIdAETeeiasHHLAGVVLAFAAHVGHVQRmrELDQLAAKHC-SHNVHPFHYVVLHEHLLDAMRKALSTMLTPEVQYSWSQSLLFFAKILIDR--
>SRR5580704_16882803 
--------------------------------------------PG--------RHGCAAPAFLPGAQPYRRCPR-gpEGPRQPRALSAgtrAR-APKFGERHYEVFRRALIATLQRFAAPRWNETAKHAWETAFNHAATVMIE---
>tr|A3VC53|A3VC53_9RHOB Flavohemoprotein-like protein OS=Maritimibacter alkaliphilus HTCC2654 GN=RB2654_17741 PE=3 SV=1
---------MIRACLSDLYSVRIEFSRRFYDRFFEQVPEARRLFVH-NQ-------DKQALMLYAAVAMTMRGMESgrdLDGELIEFGKRHA-RLGVKQDMFPIFGSTFLETLIEYLPHHDHPKIAKAWWGGFTDMSTPII----
>ERR1711953_6095
---------------------------------------------QLGPAdTlciadqaD-GSLSQEIQWIQTTIFqVMLHYT-----------ENvpfHIPP---HKMKFQYFSDPFLGLVHNCLGKEYNSEMRKVYQSVADFLIQTLTEGY-
>ERR1712106_122433
-GLTNKQLSLLITSWKSIGSEMQAQGVTLFVEIFKNNKEVIHAFPLLNPNmKgndamtMNEAFREHGIKVMSRVNEVLHNLEQLNLCVSLIKQPvpiTGVFKGLSPISSRTFTSPSSRWPRQALARSTPRKRKQSTRPX-------------
>tr|A0A0B6ZHC3|A0A0B6ZHC3_9EUPU Uncharacterized protein (Fragment) OS=Arion vulgaris OX=1028688 GN=ORF61548 PE=3 SV=1 
-GLSARDRKLIKDTADIIFGQlkLQNKGVVFLIAFFKAYPHHQRYFKMFrGIPPdelkSIPHTENHGRRVMSNVALLVQHIEEPNVIKEQLVDLlikHN-PRSVKPRQMKDMLNMFVDFTSQQLGAKFTSQHETAWRKLTTHILSVLEE---
>ERR1719502_1452556
-VLPPEQSALVRRVWQRLVGT-PGAAPILVRQLQSVAPEVAALLSDAsstNGRSniNRGglhavhtDPHGRAAAVLSEVSELTELLDDSAALRQRLRQLRARMPPVGPEVYPSVGKAFLHFVWEGVGSGYDNATAAAFAALWDQVEETMLE---
>tr|A0A0K8QCZ9|A0A0K8QCZ9_9MICC HTH-type transcriptional repressor NsrR OS=Arthrobacter sp. Hiyo1 GN=AHiyo1_24440 PE=4 SV=1
--------------------------------------------------------------------------------MKINAFADV-SLRAL--------LVLSSAPAGELL--TTQNIADAVGTPYHHVSKAIVR---
>tr|Q6BBK1|Q6BBK1_9BIVA Hemoglobin chain I OS=Calyptogena kaikoi GN=Hb-I PE=2 SV=1
--VSASDIKNVQDTWTKLYDQwEAVHASKFYNKLFKDNEDISEAFVKAGT-GSGIAMKRQALVFGAILQEFVENLSDPTALSLKIKGLcatHK-TRGItNMELFAFALADLVAYMGTTI--SFTAAQKTSWTAVNDVILHQMSSY--
>SRR5258705_7404034 
----------------------SCPTSSSRPVLWAAvrdCAGGQTLVPR--------RYDGTRLQADGDAGRCGQQSGQSRSRVAGGERScqaSR-RPWREGGYYTPVGAALLWTLEQGFRI--------------------------
>tr|U5EPU4|U5EPU4_9DIPT Putative globin 1 (Fragment) OS=Corethrella appendiculata PE=2 SV=1
--LSENEIAIIERSWNVVKPDLTSAGEAVLYRLFEkyphnQQYFAQFKNVPLESLKGSTSFRKHVIRVMTVLKNAVEALRLDsadekiHELFLEVGNNHA-KRNITKESYNELRESIFVTLTAACE--LNSEEQEVWDKFLNCAFDISL----
>SRR6185295_10958302 
-------CILLLVA-------CFLTFKLFFYSMFQDYPEYKNLWPKFRHLndealINTGELSNFCSVYMDGWEKVIGELDDNAALareLKIIAKTHL-RKGVERshimvakkealcqiriheyCYLQNMMPKMLSLLKEKNGT-LDAEVEEAWKTVFIINADIIE----
>SRR6185295_987807 
--MSETHLELAQESLGRLNA-TPKFCGTFYQFFLESSPVIPPMFAATEFE-------VQCKQLRHGLGLLLAYAKHKnPILLERVALRHSRgDVNATPDLYPLFLESLLKAIAAHDP-SYSPELDQAWRAAVTPGVEYMKSMYD
>tr|A0A0K8S6V4|A0A0K8S6V4_LYGHE Uncharacterized protein OS=Lygus hesperus PE=3 SV=1
--ATPEQVAMVKKAFDPLSVDAPGVGKVFFERLFELYPGSQKYFQHLGStdeeLFANPVFQHHCTKVILSVGTMIDNYTQTtaektKSCLRNWQRFTP-NGKFPPSKHLTSS-IHLWTFFTWNHIQPWRKHG-------------------
>tr|A0A0G3G1X4|A0A0G3G1X4_9GAMM Uncharacterized protein OS=Thioalkalivibrio versutus OX=106634 GN=TVD_07385 PE=4 SV=1
------TPPNVESSYRRCCA-DASFLARFRLALRAADGQVSGIFDPLSA-------RQQEVMLDASIRAALDFSSGDPqgaSRVSEMIHVHGRqgRVPVPPALYPVWLESLIQAVRETDP-HWSDALERRWRAQLMPAVDMFVELYL
>ERR550517_2232778
---------------------------------------------------------------------------------------gpDQ-PKAIPHRCLPQkhrhtgsisrhHGARFLQCCPSHLAE--AQDVERRDGGLLDGSFQSDHEHH-
>ERR1719309_231760
-TLTEEEIQTVKTMWAGLLENSADSGLFIFQNFFELYPEQVHRFSFIrDSQgnpipnyLKSQAMLQHSAMVMDALDGVITGVFEHDPLLGqmmyNAGYSHH-SKNIAKDDIEKLSNSILEVIKLVASCegSGKATKVEAWRKLLNIVNERFEQGF-
>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1
------NLGLVRECWDSICEQYttNELGEMVYDHLFKMAPNLTMLFTKPR--------SYMAVKMGDMLSMLVSFADSSESMkqqISWLGLRHV-KYKIRPHHIPLMGPVFLAVVAEAAGVHWSQDTEKAWSVLFNMVCVNMADA--
>tr|A0A0W1L270|A0A0W1L270_9GAMM Uncharacterized protein OS=Pseudoalteromonas sp. H105 GN=ATS75_15205 PE=4 SV=1
MGINTFEKQLLLNSLTIIKPNFHCFSYTFQMHVKR-ES--------LDMLcLSSs-KINEKTYILYCVLERIVMHLDDLRTVTPFIKHYanNLSNMGMSYEDTDILCNSFLATLKIHLKGCYSPKLENVWQQAISIFRSIVTG---
>tr|A0A063KVI9|A0A063KVI9_9GAMM Hemoglobin OS=Pseudoalteromonas fuliginea GN=DC53_02740 PE=4 SV=1
---MNTNQSVLLKSLQIVKPNFHAFTARFHRKLAE-SG--------IVMNyPTAn-QFNEKSYTFYCVLERIIKHLDNPSSVTPFLTHYleHLNKRNIQQTDIKILCDIFYATLEAHLGQHFCLQSQTAWQEFLTFFENCTNS---
>tr|A0A1V9ZUY0|A0A1V9ZUY0_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_00581 PE=3 SV=1
--PTPKDEELMTRSWDNIIGAkiraelerrklktidadDefeAssvvQFYDVFFAKLFTINPATQPVFRG--------SMHVQSKALVNIVGAIRHILHSEdaTSNIAALALRHI-QYGVKLEFFDSLGLAMIETLSAMGDtGRWNKDVRDAWHTVIAYIICILVPPY-
>SRR4029077_13489679 
----------VQADVHAISVM--LNLMQPFRALRRRVDQFAKLWLD--------PLWKTGRKAARIPA--TSTSITGRTGFAGRGRT-------------------------------------------------------
>tr|A0A016SWG0|A0A016SWG0_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0168.g192 PE=3 SV=1
-QLTSEEMDLLRSSVRIISENATEVGCNTYEMIFEQSPYVKEFFHFTKSdddAYRQKQTVQLAQKYMQVLIAFVEGIEDPSIlepVSAKLIEIHRKvddVQ--MAAHWGVFTECTLYNIRKALEKDehFNDMdrldaAVMLWRMVIRGIVRRLKA---
>ERR550534_835606
------AKKIVDESMNLLAKcDLDEFGTTFYSTVFSLSVDAQQYFYKP-----NAMMKFIAKKVLTIIAAVLHEPDETAHDIRAMGLRHM-KYGVPPDYFPLFGESLTAALPGVLEGYWDDSVRTSWEGIFEFVKNCMTR---
>ERR1712025_717817
-TLSPEHVDPITESAPSGKAKGMVIANNLYRKLFSRHEMFRAMFPEQS---------QQSGKMIQALPSALydfavncDNMGQMQSVVARIANRHV-QQGVQGFDGTFQFIPKKVDLsliPAGQCEAKLKVALNARQPGtgvgdrFQLHPSEVC----
>SRR4051812_15383594 
-PMTSDTIALIRASFRLAAADPQALSQVFFRRLLLRSPGVQRMFPAS--------LVRDPQRLVGLIDQVLRLLDRRDmlvEGLQNLGRLQA-PYAALPMHYPLIAGAFREALALRVGTLWSVDMEESWAELQALVIRIMGA---
>NOAtaT_7_FD_contig_111_1754_length_212_multi_2_in_0_out_0_1 # 1 # 210 # 1 # ID=13324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.662
-RLKPKDAEYLQDSWKVFLERsggLEGAGKEFYRLLFEKEPDLKKLFQV--------PEMSQAAAFMRAISRYVSLLAQPEQLktaIEMLAFMHV-NLGISETSIFAFAESLLECVEDQLHDWDpgeVEQVMVLLTDLTTYIGRVIA----
>SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold554780_1 # 1 # 420 # 1 # ID=554780_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.669
-VLTSSIYlttgTVVTDFSVIVLDAegsAIEPGEAPYSLRVYFTPASTGTstatIQL--------PSGLISDgMLAVGARRLQEETINPRRLagaCEAYGATVTSnvlTVNVrksgTASDPCDSTDAISLLFAGGMATWNslgTSVTSADFtmstnvdsdsvTYRLTFEENVFL----
>SRR5690554_337115 
----DEYVKLLETSFQKAVENvgIEELSTRFFSRFFETFPETNSLFKGTNIDYF---RKFKMRVIFDFLIDIVKHPNYAEAHIAQEVMRHQ-MYGLqDKEYYFTLAACLLEAVKSALGDAWTDEDESAWNDILLVFKG-------
>SRR5690606_3594538 
----EHHLSVVEQTIQQAIGKsgEEALAAELFRRYFERFPETKeRYFHATNIEYF---GVRKFRIIRDFLIDTLKYPNYAEGNMYNEVMRHQ-VYGLkDKEYYFGLIDALMESVQ-------------------------------
>tr|A0A0K1PX98|A0A0K1PX98_9DELT Uncharacterized protein OS=Labilithrix luteola OX=1391654 GN=AKJ09_04675 PE=4 SV=1 
--------VVLKESWHLSYRRAPDLAARFYEELSWKYPSARRLLDHVF--------GAQNdiaVCLSTVAGDLLDNVDDPDAFSAaivALANAHV-SLDIPPHVVAWMEEVLLDTLEGAAGDDWTPEMRTTWRNAYEDLASRLAR---
>ERR1719468_1094774
-PLTSNDRKLIVRSWTIVDQQISQVGLSSFLELFRRAPETLSVFPFLkQLGPEdmefYHQLKNHSIRITGVISMLVKQLESEErpadeairDLLLDLGRRHF-SYGAKTSHMELLGRVFAESLQPIFEGDpEAKAIQEAWLVFFSVIVFWLQKGFR
>ERR1719183_1674583
LALTTDQIEAIRSSFGMVLaaaPSKEAAADTFYQTLYDASKSIQPYFVT--------PRAVAALRFVQEVSAHLSVLDDPKQLKTLVETRsfnHF-AIPVSVAAVAKVRDAIMDLFAAEIGKKFTEEAKLAWKAYFNYVGGAFI----
>ERR1719458_172070
-NLTEEEKKVLRSSWDIISQKVDQDGESRFLHKFESNQETEDPILQQFT--QIDASIFNGKSAMIIVALTLENLE-------KSHQTrtrSL-W---------IWSTT------DVFRLDWST-FRY------------------
>ERR1719278_416587
----------------------------------------------------------kNRRRPVA--TFLLKNLKatsesslYLPGLWSTIR------------------TTIPVPVrrRQPLRLSHP----------RDLLRGCKQRPQ-
>tr|C9CRM3|C9CRM3_9RHOB Uncharacterized protein OS=Silicibacter sp. TrichCH4B OX=644076 GN=SCH4B_0097 PE=4 SV=1 
--ISSRDIDLLQSSCATAFLKKGVLASAFYNKLFEIEPAYVNKFSNI---------NKQKIMFEAMLAYCISGITSgykVEALTARLRSYHM-HLEISDIDIANARSALMYALGSVLGEDFHSDLKQAWDAAFSSVSEALR----
>ERR1719419_503384
-DLSPKEILDIQMSWAEIHQEGlVNPDVLMFKLFFEESESGRLKYSHLlkNVNldnlnwmrdwTKVQKLKDSIDKTGEALGDVIKSLNYHDRVVDKLYSHgvvHA-KFGVTRKEIHTFCECLLMTLKMELGTNLSQEAQASWERLLKMIVEVF-----
>ERR1719295_364028
-DLTPEEKRCIQRTIPVILQEAEMIGTKTYLKTFHNYPLSMIYFEPLrDKLvtevkQTDDYLKKHGVLFVKFIGELVAEMDDPDSvdlKLKSLGRFHD-DLGVLKQYLEAIGPLFVQAIRPVLMtqasipsatncgvgvsspnSLWTRDTKPSWIRFFRVIALQMKRAY-
>ERR1711860_326342
-ELNSDEKTLIVTCSKQLLEIQKVLGPQMMQQKFQKV-----------------WSKEAGEL-KQLYDMR------------------------------------------------------------------------
>tr|A0A2A6B374|A0A2A6B374_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_54161 PE=3 SV=1
--IPDDeekkLtSQILCDSLSLAIvgngEPPVENGQEFYQFLFTIDPRLQSHFVGADEfmgqdPKEPTKFAKQGQRLLMAIHTMAASFDDSEAFDKTVSdliKRHK-DRHVDPALWNKFFGWFVTFLKSKGE--LTSIEEDAWKQLGIRFN--------
>tr|A0A0B1T604|A0A0B1T604_OESDE Globin OS=Oesophagostomum dentatum OX=61180 GN=OESDEN_07088 PE=3 SV=1
--VSAADvRKLTSASMATVPvsspSDKTKHGNDFYQYFFTHHPEVRKYFKGAENyaaddVAKSERFDKLGNDILLAVHVLTETYENDNVFRGVCRdviNRHV-EGgrHLDPALWKQFCSIWVAWLESKGAK-ISADQKAAWDTLSVTFN--------
>tr|A0A0R3RQ08|A0A0R3RQ08_9BILA Uncharacterized protein OS=Elaeophora elaphi OX=1147741 PE=3 SV=1
--MSHSElKAKCIKVMNeVGRvgtdDEAIQHGKNFYKFMFDHHPDLRIYFKGAENysgtdVQNSDrfNYGFSGQRLLLGVRTLIDIYDDIETFKAYARetvNRHI-KFKMDRTLWLAFFTVLVSSLKEHIT--IDEETEKAFLQIGKEFS--------
>tr|A0A1S0U934|A0A1S0U934_LOALO Globin family protein OS=Loa loa OX=7209 GN=LOAG_01385 PE=3 SV=1
--MSHLEmQAKCMKILNeAGRvgtdEEAIQHGKNFYKLFYVWP-----------------SSGFTGQKILLALRIVINTYNDPETFKAYARemvNRHI-RFKMDRTLWLAFFTVLVNSLKEHTR--IDEETEKAFLQIGKEFS--------
>tr|A0A1Y5FEW2|A0A1Y5FEW2_9PROT Uncharacterized protein OS=Halobacteriovorax marinus OX=97084 GN=A9Q84_13980 PE=3 SV=1
-------------------ENIDQFVESFYEHFFSLTPEIFELFKNSEIG-------KQKNEFKISIHTLLINLSQLDkldSYFKDLGIRHI-CYNVSERHYKLAKESFLYAIKKTYADHWSKVVETKWEEIIDHVTLKMKEG--
>tr|T0SGR6|T0SGR6_9PROT Globin OS=Bacteriovorax sp. Seq25_V OX=1201288 GN=M900_0432 PE=3 SV=1
-------------------VNLKKVIDDFYNLFFNEENDLTRIFRNTELT-------LQKHELQKSLELLLSNILDKEevsKYLRDLGVRHI-TYEVKPYHYEQAKQALLLAIKNNLKESDFIKEEKAITEFVTFICINMMNG--
>tr|A0A2E2XNM9|A0A2E2XNM9_9GAMM Uncharacterized protein OS=Cellvibrionaceae bacterium OX=2026723 GN=CL693_20675 PE=4 SV=1
------DIDWIESSLELLAPHADRLGGLVYPRFFVHFPEAETLFGG-GELG-----KSTQESMIVPLLMGLKDIADGKtymlTIERWLED-HR-EYGVTLPMYSVMLDSLLLGMREAVGDLWTTEMDGAWQEVLARLLLLVEGVY-
>tr|L7L9M1|L7L9M1_9ACTN Uncharacterized protein OS=Gordonia hirsuta DSM 44140 = NBRC 16056 OX=1121927 GN=GOHSU_25_00750 PE=4 SV=1
------IRQAVLESLARYEESHGDPTRAIYERFYRVHPEAIEELAF-D--------TVLENRMMAGILALLADVADGSidpgGAVYWVSD-HV-AWEVSETMIMGMFGAVRDTVREGLGPEWTARMDADWAGLLAALAPAMRDAV-
>ERR1719478_64653
-SLPTAQIEAIRNTLNMVISaapSRDAAADTFYQTIYDASRIIQPYFVS--------PRAVQALKFVQGIANDLAVLDDPPQLKilvETRSFGHL-ALPVSVPLVVKVREAIMDLFNVELGSKFTAVAKTGWTAYLNYVGGAYI----
>tr|E3MNQ8|E3MNQ8_CAERE CRE-GLB-30 protein OS=Caenorhabditis remanei GN=Cre-glb-30 PE=3 SV=1
-HLTPIDREILNKSWAIVSKDMQQVAVNIFQMIFEQAPDAKLMFSFMmkDYkeDKKSNEFIFHAVRFLQVIESTMTHLDDPSQldaVFLNLGKIHAkheEQLGFSAHYWSVFKECVLFHFRKAMKAHnkFSkhkemsfAEIDSAiilWREVLRFIIDRMKVGYC
>SRR5690606_31308825 
-FMGYANSDIVLQSYGRCC-RDEPFFEHVYNVFRSQSEDIRDMFTHTDMT-------EQRRLLRAGITWMIMHSRGGgRSKLESLGKSHNrHGYNVPPALYRHWLDALVESVAAYDP-HYDATLEQHWRGVMTPGIEIIASAYX
>SRR5438046_4862914 
-------SNPIERSFELAAERCEDLTPLVYRRLFDAHPEARTMFRTE-GS---EL--VKG----SMLALTIDAVLDFAGertgHFRLIEaevSSHD-AYGTPRELFVAFFGVIAQTLREIVARTGRTTSMRrgGSCSVTSKVSLQGS----
>SRR6266403_3319847 
-----------------------DAARL--SPPVSQTPGSQNDVPKR-RQ---PA--GKG----FNVGADHRRHPGFRRraigELRMIScevQSHD-AYGTPRELFGEFFGAIADTLREILGSDWSPEIE-eAWRELLVELDRVVT----
>SRR6266481_9249308 
--------------------------------------------------------------------------------------------TNWRSLVQFALEEIVTDIDLLL--DRIVVAVDavgdqrvaRDDRILVELDRIQA----
>ERR1700744_2408068 
------------------------------------HPEAESLFRRG-PS---MR--CPT----GRP----------RSgtpg------gscwtkliaSAlSA-RHKSRRLKSSLPLEEIRADVGFLL--DRVVVAIDavgdervvRNDRVLVRLDRVQS----
>ERR1719178_87025
------NKHLIDETMERTADaNISDLGSICHRKLFSLSADVQNYFYKP-----NTMVAYILEKVLYILSNLSHEPVAIAHEIRALGMRHI-KYNIPPIYFPLFGKALVFTFGSTLEGFWTDDIENAWGSVFDFVCRCMTR---
>ERR1719158_1490032
----------------------------------------------------GGQLSFICRGHSSRIN------------RNALRVRRsrI-TNRSHSNCFSSYT----------RCSISSITCASAWATCLLR---RL-----
>tr|A0A044TBZ8|A0A044TBZ8_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1
-NFDDAEIQLLRRSWKTIKPEKQT---------VLQCPEVRRFFPFMNSdlkscEKKNKRFVFQALRFIQvdmtIFNEIIISSF-------s----------NDIAILMLVFLECSIHQIRITLLNSkldlWNRKDvdnvIILWWHLNSGICGKIK----
>SRR5215831_5553854 
-------VTDLHRSLEIAAERGGDIYPAIYDAYFARCAGSRDLMELTDIC-------MRGRMLDSLFELLMA--DDAASQVAYLhfeTKNHS-SWGVQPQMYDNLLTATRDTVRGACGPDWTPAMAAAWDARIGDVIR-------
>tr|X1ZVE5|X1ZVE5_CAPTE Uncharacterized protein OS=Capitella teleta PE=3 SV=1
--LKTEQVALLKSSWQQLCVKrsPYFLGRQIFLRVFELNPEIKKSFQFGEFHgndlINNPMFKIHVKNFVSVIDSSIRSVDSLKTVlAPTLhtlGGTHQSVEGFNKNNLEIFLKAMLLVLRQEFKSALDvddLEVEVAWRKLLEFIVYQIHIGYR
>KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1083625_1 # 3 # 881 # -1 # ID=1083625_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.686
-----------------------------------------MEYEI--------CLEPSGIRFMADAGQNIVEAAKQHGIpIKHGCASgscgdCK-GTILsgDSEQGPFMPLLLLPTERAA-G-------MAILCKLYP-RSDLRL----
>tr|H3NRG3|H3NRG3_9GAMM Uncharacterized protein OS=gamma proteobacterium HIMB55 GN=OMB55_00005550 PE=4 SV=1
---SQSDIAIISESLTLCGDCLEDITPHVYRRFFELDASAASLMEYSDEH-------MRGRMFASVLELFLSdDPFESDGFLAWELDNHVSSYSVTKSMYESLFKAFFEVAEETLGEDWSGDFERAWTNRIARIMAEVS----
>tr|L7MTK4|L7MTK4_SYMRO Neuroglobin OS=Symsagittifera roscoffensis OX=84072 PE=1 SV=1
MQVSEEQQSLIMEDVQVLLPNYDDFVEDVLQQFMEENPETFQIFPWADASKtakemrSHPRFKSHAKSIGKVISDCLVDLNGVKKHepkLSSLGAMHT-KKKVPTELFGKLGGCILTQVVKRVSeAKWSEEKKEAWLKAYGIITV-------
>SRR5215204_501118 
--VTRRDWQRLLENWERLQPSADRFATVFFDTLFAWEPQARQLFGGA-------TLETQFLRFAHLLTSLVSAQDHPDELDRRIDAViRCFAGgDPPRKREDAIRVAVAAMLNDVYAAGITPETRASWQSAYIGVITTIRS---
>tr|A0A1W2GS79|A0A1W2GS79_9BACT Uncharacterized protein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4044 PE=4 SV=1
-DLNIRERKNIRDTWKVLAPNIHEFAFSFYSNLHSLDSSLVPLFENE------FGIIKQGDKALYVLGFVVASLDNLMvareGIKKALEGVFMEHQHIKRADEQKVMKAFLQAMKSTLRGVWTNEIAISWYRLLSLISAVSI----
>SRR5512143_1477374 
----------EPhDSCVRCF-AVPTFVGRFYARLFSEHPDVGRYFVGIDCA-------RQEQLLRASIPLLVLAPGgsaAARAALERLGRHHGpDGIGVEDVHYERWIACFLATVRD-CDHGWSPAVDSAWRHTLAHGVAVMRRAA-
>tr|O97381|O97381_ARTSA Hemoglobin C1 polymer OS=Artemia salina OX=85549 PE=2 SV=1
-GLSGLEKNAILNTWGKVRGNLQEVGKATFGKLFAAHPEYQQMFRFFqGVQlaelVDSPKFAAHTQRVVSALDQTLLALNRPSDFVYMIKELgldHI-NRGTDRSHFENYQVVFVEYLKETLGDSVDEFTVKSFNHVFEVIINFLNEGL-
>SRR5512139_12076 
-----TDLELIEASIEQMLDLETEIIGDTYARLFAHCDGARALFGPNTYG-------PRAQMVN---ETIIAGLDLLRGepwvheYMTQHGVRHRHSYEVTDAMYRTYAESLLGAIRERLGDRFTPELEAAWS---------------
>ERR1719193_549257
-IFTDDELAILKDVWAHLKHHTAGAGLTILDHFFKRQHWALERFEALrDMYgnihpdyMKIDLMRFLAVDLMEGIDIFVTGFFERD---PEVTDLiadvgyaYV-KKIIIESEIEIFVDSMLAAMEELLGEDtWK-KNMAPWKKLMPVVAEHFSRGFK
>SRR3989304_6997408 
--XMTTNLDAVTASYHRCRA-SAGFFDTFYECFPARSEEVAEKFRQTDFT-------RQKLMLRESLISMLLFnlgTGSARAELEQLAKRHSRdRSEEHTSELQSRLHLVC-----------------------------------
>tr|A0A0P5Q0G6|A0A0P5Q0G6_9CRUS Uncharacterized protein OS=Daphnia magna PE=3 SV=1
-SMKGRGSCFDQGHLESCKKN-GNIAPKAFIRYLKLKPEAQKKFAAFaEVdladLPTNSHFLNQAYTCLAGLNAYSDNLGKNPKSCPYLNSPAF---KdVKPDELKLFGEVMFNVMEKNWTIIFPRQARKAWKDGLTACDVA------
>tr|A0A2D4BL26|A0A2D4BL26_PYTIN Uncharacterized protein OS=Pythium insidiosum GN=PINS_002968 PE=4 SV=1
-------------LEKQQNYKVTTLYDVFYAHLEQHSPELKPVFRS--------SVHIRGKVLVHISVGMRTLIASEnfVDKVLPLTKTHR-RFGVKPEHYEPLGRALLHAMQVVAL------ITRDRGRVEEPTSIILIQ---
>tr|A0A024G680|A0A024G680_9STRA Uncharacterized protein OS=Albugo candida GN=BN9_028420 PE=3 SV=1
-------------LdGMQPAERMELLYDTFHKFLELNAPELKPVFKT--------SKHTRNVVLQHIVGGLRTMLAQNvhIERVRALTKTHL-QFGVKMEYFDLLGQAVIFSMRQCSGTHWTNEIEEAWRRLYGHCSVILLR---
>ERR1719474_2118124
-SLNPTQKCVIVATWHSIFlKHMNFMGKQLFVDLFKVEPNILKYFDAFrDVGlanlLQSRSFQNHGVRIMNLVKFAVENLDNPEKLqdhMHALGRLHV-KKGIDSKYLNIMGPTFCQAIRPMVMaeGQWSIDIEGAWIQLFKILAQMMRVAYE
>ERR1719244_357615
-WFVPTEKCIIVATWNTIFfKHMNTMGKHLFMDIFKMEPNVLKYFEAFrDVGlsnvLQSRAFQNHGVRVTNLVKFAVENLDNPEKLkdhMLMLGRLHV-KKGIESRVLDLMGPTFCAAIRPMVMaeGSWSLDIDSAWAKLFRILVQMMIPAYS
>tr|A0A1I2S201|A0A1I2S201_9CORY Uncharacterized protein OS=Corynebacterium spheniscorum OX=185761 GN=SAMN05660282_00995 PE=4 SV=1
----------------LLRQESGHLEPELQLQLYARHPNAQWLLRA--------G-KAVPAELVELSIHAIAAADAEgaldALAEARIRDLglaQR-RFGFPSELYQDIQEIMVSLLRTTGAD-LPFPVEFAAERTIARVCVLLQE---
>tr|A0A2S9Z387|A0A2S9Z387_9CORY Oxidoreductase OS=Corynebacterium sp. 13CS0277 OX=2071994 GN=C1Y63_03975 PE=4 SV=1
----------------ALTRHPELFRRAVTATFTGLCPAAGVLIA----------QPAAHADLPVACAWVLRNSAE-qvsDYAAAVIRQLgceHR-RSSTDPAHYALFARALRAGLDAVAAEDdLEPADVAHAAHLLEHCCTLMRD---
>tr|W5Y4C7|W5Y4C7_9CORY Putative oxidoreductase OS=Corynebacterium vitaeruminis DSM 20294 OX=1224164 GN=B843_11695 PE=4 SV=1
-------------------RNREELSAIAFDMFFATQRDARTRIRA-------------TPAIADALTLLARSCDSEgklpLDVEKRFLQRattLC-AHGLRVDDLEPLAESAHRAMLITAGG-QPFELVLPIERALQQLARTVVE---
>tr|A0A172QXP0|A0A172QXP0_9CORY 2-polyprenylphenol hydroxylase OS=Corynebacterium crudilactis OX=1652495 GN=ccrud_12565 PE=4 SV=1
----------------LVEDNAQDFLRAVKAQLLQLAPQSRGHFPT--------DDDLTHISIAETLSALLDGTGKEgevdEGTLAFFQEAaldAR-RFGITPDMLKALGEAVRTELLELCSD-LPFENVLFAERAIAATSAASIQ---
>tr|A0A1W1UZL1|A0A1W1UZL1_9CORY NAD(P)H-flavin reductase OS=Corynebacterium glucuronolyticum OX=39791 GN=SAMN05660745_01670 PE=4 SV=1
----------------RLRSVSPEFHEHVRANFFDKCPETMLVFPL--------HKENVHADLGRVLSFVFDRTPVDghltDEMRTLITQLgkdHR-KYNVSPRYFHPFVECLRDSLLTLCSD-LQFKYLNGADTALGEVSTLLAR---
>tr|K0YDT0|K0YDT0_9CORY Uncharacterized protein OS=Turicella otitidis ATCC 51513 OX=883169 GN=HMPREF9719_01398 PE=4 SV=1
----------------ILGAQRTAFRDATVDYLLRRLPRLRRVAPL--------RQRHRAEALAERAVGLVARSPQ-gmlrGEDAADLERAgraNR-RLGVPLRVYPVLAQALKAGLRAAFEAAgePYTAAARDAEALAEAACASLAR---
>tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae OX=1717 GN=mphP PE=4 SV=1
----------------LRLVTVTAHSIQAVADElraHRAEFIQAANQKP-------------DSPLADAIVQLVDHTDLDghvpESIATSWLQHaaaAE-SLGVSRDYYLTLADASRSALRHICAD-LPFAEVLGAERAITSIANTLT----
>tr|A0A0G3H0V1|A0A0G3H0V1_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium mustelae OX=571915 GN=CMUST_13735 PE=4 SV=1
----------------LR-ALSEEFSRDVFHSFFRSHPHERLVISP-------------EFPVAAAVSFICHGADANgtlyPETENRLRELaeiIT-AHGF--RSILPFADAITKSIRHYCMR-DDFFGTIAAERAVEQAAEILNH---
>tr|A0A0G3GTQ0|A0A0G3GTQ0_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium epidermidicanis OX=1050174 GN=CEPID_01535 PE=4 SV=1
----------------TLRAKSPAFRRDVLRDFFSQHPHMRLKFAA--------NEDHAHTELVFALTYLLENPTD----PELIRTLardHI-KVSPGQEVVADFFAILHRQIHRYCAD-LPYEEVRQADLKLQEIA--------
>tr|A0A0F6R111|A0A0F6R111_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium kutscheri OX=35755 GN=UL82_09495 PE=4 SV=1
----------------------------MVASHfYADVPLARLSFRL-------------QPSLVDTLIAGLSHP----LNITAW---ahdLA-HRGVDRSFYVPLSAALQHAVCHICSA-LPLVDVLAVEHRIDQIMKQLLA---
>tr|A0A2D7G1P9|A0A2D7G1P9_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP96_10880 PE=4 SV=1
------EQTCIERVLDCAAEDQPDFQQRLYDRFYQLAPSAEALMIHIDEE-------VQGKMLAEVIRLFLSpDVaVTDQQYLLFETKNHAQAYFVEPEMYRALNQALFETLKVGAGRIWSSEVESAVHNRLSKMLHGILEAL-
>tr|A0A2E1GZ77|A0A2E1GZ77_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ03_04085 PE=4 SV=1
------DQAWIETAFDCAAVDNLNFNVDVYQTFYRAEPSVASLMAHIDEL-------VQNKMLSEVIRLLLNpNIeSEEAGYLNFEVKTHIQGYGVSPLMFLSFNRAVYEVLQSSAARVWEDDLAVAVTRRFAVLSDALTEAL-
>tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ23_00915 PE=4 SV=1
------MQSSIHALLEQVATTDIDFDKKCFERFFQISEEGKTLMAHMDRV-------HRGKMMAEIYRLMMArDLDDEADYLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY-
>tr|A0A096P8B0|A0A096P8B0_OSTTA Flavoprotein pyridine nucleotide cytochrome reductase OS=Ostreococcus tauri GN=OT_ostta17g00030 PE=4 SV=1
-------------------------------------------------------------------------------------masvgsgat-DDD-GVDVPVSRCPFAhGTVTVDPYPGYVH-G---KNPRVCPRGCVPRPPSKP----
>SRR6266498_4102119 
----------VATQSYR-MHCQgrPAFYSTFYQRFFQHCPEVKTWFS--NM-------HAQYDKFDQALQFLLNYRHGCMEEPTVLSmtaNKHR-AFKLSACQFDEFERALLETLKESAHE--SDRVLKAWETTIR-----------
>ERR1719474_730311
---------NIHVTFDvALTSDPKGFAEKFYRGLLKEQPDIGQLFLDK-----NTTFDTQSARFMAMLMHAIKMLDDTDHFTQSLDSLseaHV-GYGVEIPMLDAFGKSLISQVKqfnieyyqqqqnhkgddqkeETVdilkVGRWTTKQDDSWKWFWSVVVGVMSAG--
>SRR6266536_2537548 
-PLSGREREIAMLAAAGLA--SKDIAERLYLSVRTVNNHLQHAYTKLGVS-GRAGLAEQEIKFAEKLTEIVRAMPRLDELLthtRALGARHV-SYGVRAADYQTLGNALLAALAAVLGGSFDAPTREAWTLAYNLVAETMLD---
>SRR3954465_13942299 
-PLTGREREIAMLAAKGIL--SKDIAARLSLAVRTVDNHLQRAYTKLGIT-GRDQLADVLAHDTTTHPGPX-----------------------------------------------------------------------
>tr|A0A1Q9C6P6|A0A1Q9C6P6_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene41206 PE=4 SV=1
--CVCDLAQCRGRSWAAFFVDI-------QAAYYETSRS--LLFEG--------PSQDP----------ALVALQLPAHVQAlisDGALQGL-GI--PQEHIALLQDCvecsfwtftgqtqqvmatsgsrpgdgladvlFGALFAVILtcLEAKCQQCGLVHQSMSDALGVPDR----
>tr|A0A0E9N6V9|A0A0E9N6V9_9BACT Uncharacterized protein OS=Flavihumibacter petaseus NBRC 106054 OX=1220578 GN=FPE01S_06_00290 PE=4 SV=1 
-QMNQQEIQLVCQSWQQAAEEPLRLAILFFDRLFEEAPELRQVFRT-P-------MSEKTRQLLVFFGFHINRLASGSIRRPSFEAYVW-EELLTDAQKGFLMETLSDTVAALLKPDWTPALQGAWGSFRK-----------
>tr|A0A2G2R0S2|A0A2G2R0S2_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_09030 PE=4 SV=1
-IVTPDQAIIIQESFARLSTSSDSLIQDILGTIAEGNSDLAVTIT-----FKSQNLVEQIS---TALSHIIDQLhtaDNVAEYVAHFGELLL-AQNVQDENYSSFGEALLSGLENALQNDFTAEVRDAWTSGWAMLSGIMREA--
>SRR3546814_3775940 
-----------ERSLEAVMEAGKDITPFFYDRFFALYTEQRANFYHFES--------TSGTMVNEMITSVLALASNEAWLtnsVQNFVAAHR-SYGdIPTDAYARLQDVLVDNLAQDSKSTSLNTsNYCANsl-LYSVX----------
>SRR3546814_13566968 
----------------------FTIYTTLSLNVVLPFVTHRSNFDHVES--------TSESMVIEMITLVLALASKEAWLtnsFQNFVAALR-SYGdIPPDAYARLLDVLVVTLAQVAGSRWTDEFETAWRWYVSG----------
>ERR1719397_23434
-NLTDCQVRLVLVSWPVILEEFQKVGVQCIVHLFEVVPYMKEHFQQLiNNSgkfdpkDGNvmqTVMENHAKLVMNVVHEVVTNIDALDSVTEkliQVGEKHC-KAGVEQRYLDIVGPIFCNAVRPVLLRsgIWNNRTEEAWMEVFTAIASTMRTGY-
>SRR6478672_7358577 
---------------------------------------------------------------------SRMp--CNSSTlkrrpSatscTESPTSTSP-WESAPSST-PSSASTYSPRSLRFWATPSPPRSPPRGGEVYWLFALQLVA---
>SRR4029450_1817054 
------------------------------------MARLLRVFNQGNQA-----TGEQSKALPGSgVASAV-QLIDPNApslahVMRRIAYKHM-SLGVCAEQYIVVGHYLSRRWARSSVRRSLPRSRQRGRKFigFLPFS--------
>tr|A0A177B679|A0A177B679_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_02502 PE=3 SV=1
-GLTKTDINMVLGSWESIN--NDEASSIFYRELFNTYPDTKSLFVKFySVdndkLIDNPAALKQLRVTWTAITTLIDYLKkgRIDEANKaidYLIEKHRKIKTFQGPMFNMALEPLLYLVKEKL---TSQAYIDAYKKVFGAIFLTIISKY-
>SRR2546427_1691122 
-------VVLLQTTFLRAAEMrigKRNITDFIYEDLFLKRPQLKPMFTNQ---------VLQRHKLGKMLGSIFIHLRDQdwiDEHLRDLGAMHW-RAGATPEVYPWIKDSVLAVLEEGMAPsGWNLRCQREGAGALGVSAQGMLMGY-
>ERR1719244_673251
-----GQKDLIIASWREIRICLDEVGFDTFKQLFAHHSDIRAYFPAMkKLSSndveMSRKIKEHSTRIMAVLKLFVDNIYDLEKIEPSIedlGRNHS-FRTLLGLFLSE-------RISGQL--AWR--------RCCFNYLNIS-----
>ERR1719369_2640530
---SPSQVDMLRSSWVILVRQLDEIGMKVFAKLFTVHSDIAQYFPQAkRPGS-SVFIKDLSHRVMNLLKLIVDNIEKLEMIRDTIrilGEKHY-QIGVRSEHLDLMGPIFCETIRPILVanNVWTHHVGDTWLST-------------
>tr|A0A2G2R4B7|A0A2G2R4B7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_07540 PE=4 SV=1
----------------------QSASDKFYNVLQNDLPEFTQLFTN-P-------E-KQHMMFYAALRSIDGLKDNktkLAVYLRSIGVKHK-MLGLTHYHMEIGRNAFEQAIFA-GGKDLTHDQRQFYIDSFSQIEKNM-----
>tr|A0A2D9F7C7|A0A2D9F7C7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=CMM61_16775 PE=4 SV=1
----------------------EAVAEAFYAALFREAPDVERLFRD-E-------T-NKTVMFVNALESISGLERGdphFADFMAMLGQRHR-DIGITQQHLKAGWTAFNEALDV-GGGNLTLPRRQFYRDAFKKLVAAM-----
>ERR1719378_1531842
--FHPgaDGVHRIGGEESQ--AEVRRQRSLSLPKFLDSLSGEKEKFAFNfDSMgnvlpnfHASHAQKIHSMKIMDAIDAVISEILRDHPIKQRlmdVGYAHY-ELHATSKDIRKLTTAFYKGVKDLIGIDDdNDRHLVAWKDFLNKIEEGFK----
>ERR550534_2245262
-----------------RDLRHPLGLLLALH---------GGFLSFFhGFFgsykadaMQTEFMKNHSIKIMNALDTVIAGITAQQPMREAvreIGRDHY-HKKIDKIHMRQMADGMLEGLKEVIGDAKdSTRKL-------------------
>ERR1719192_2788519
------RREIIGTMWESFREDSVSSGLFILEHFFSTYPDEMDRFTFAsGGQtdketplafiMKRERMRIHSAQLMNALDRNGHVY--GRSpgCMDQapqSHRG-------------NVCRRTGKSSGIA---------VFKWRVA-------------
>ERR1719367_1435250
-------KTQLRSTWNVIMSDMASIGVVMFLKMFETHPETLSSFIR-NVYSikeiemdewYQENLKLHAIRVMAIVEQVIHRLDEVGSVIKILMKRglsHK-RLGVQRSMLEKMGRSFVLSIQSPLEEanKWDATVEQSWLSMFRFIEFWMGLVY-
>ERR1712004_299484
---------ILRESWKHLQSRIESLGVVTFLSLFNASSETLHTYLTPeDIATlkeqdkdkmLIEKLRVHPLRIMSVLEKTVHRLEDHQRCLKMLRQYgrkHQ-RFGVPPFMFATWPGVFYLYSSPYWKNlsNGMRTFHKLGKACFNSLHLEYRE---
>tr|S9TQJ9|S9TQJ9_9TRYP Adenylate cyclase OS=Strigomonas culicis OX=28005 GN=STCU_09709 PE=4 SV=1
--------YTVEATWNILEKegMVDRFGQQLYDQLLTKNPRLRVYFYGVDLD-------EQSKTIVRMLGTAVHSYNNPVRTvefITRAGARHR-GYGVTPSVFREMEVAFFKVFPKFVGLDVFEASEEYWKDFWAVVLDLLSR---
>tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 OX=582737 GN=TSPGSL018_8354 PE=3 SV=1
---SSKIITLIEKSWAFVESRCDlmEVSNKFFERLFQRAPALQNMFTKP--------KRVQYVMLAKALDLIVRSAGETKVmneDIKAIALRHI-KYDIRQEHLNVFGSVLVETLANSVGPeNWDEDISAAWASIYGNIAAVF-----
>LauGreDrversion2_5_1035112.scaffolds.fasta_scaffold830278_1 # 2 # 232 # -1 # ID=830278_1;partial=10;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.316
--------------------------MAFWN----KHPEPAAQFVAP-------TQdtltdefepeeeqGISKEQLLSALNAAQT-------ALMMIDR-D------FNITYLNqKSVDLLKTHEALFQSIWPNFQATeefllGYCIdlfhanpshqrqmlsnpsNLPYTTTITVKDV-
>SoimicmetaTmtHMA_FD_contig_51_4416696_length_1368_multi_2_in_0_out_0_1 # 1 # 216 # -1 # ID=2511055_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.685
------KVALHTVEFAVADPSARATI--------------------------------------------ATHGLTPDDMAMLLSKRE------------LIGPAFPALLDEFYGKVVEN----------------------
>SRR5262245_66279004 
--LEPTDRIRAKQSYLKHCMGKNDFYRKFYERFFQGPEGTmakEMFADK--------DLNQQYVKLDQSLHYLLNFGDQDmmePTVLTTTATIHQ-TKGVAPEQLERFIECLIDTLSKDYQV--SGIEVDAWKNVCGP----------
>JRYH01.1.fsa_nt_gb|JRYH01001677.1|_10 # 8312 # 9718 # 1 # ID=1677_10;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.684
--MPASWVTELQEIWQDFNKrvgSRQAAGEIIYDAVKEAAPRIVIDdFRIP--------RPVWSSRFVDGISSLIAEASDLKMLRKRAEAMgfsHM-SLALSIEKCELLRDVVVSSIEQECgpgKFSAQCIARKALTIVLNYIAGALL----
>sp|Q7M416|GLB1_LIOJA Globin-1 OS=Liolophura japonica OX=13599 PE=1 SV=1 
--ISADQAKALKDDIAVVAQNPNGCGKALFIKMFEMNPGWVEKFPAWKgksldEIKASDKITNHGGKVINELANWINNINSASGILKSQGTAHK-GRSIGIEYFENVLPVIDATFAQQMGGAYTAAMKDALKAAWtGVIVPGMKAGY-
>tr|A0A090KT29|A0A090KT29_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X0
-KLTENHRKVIKSSFEIFKKNGVPNAHNIFLRMFKEYPDYKNVWSQFkNMSdeelSQTPLLWKHATTFVFGLERVIRTMDDQEMMILMIHStanQHK-SWGLKKEHFFAMVHLITDILMEEKGEpDEKYAIMEAWESFYDVLGTL------
>tr|A0A0P5DF02|A0A0P5DF02_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
--KPANDRRIIRKTWDQAk----------------------------------------------------KDGDVPPQILFRFI----K-AHPEYQKMFKSFADVpqae------LLGNGNFLAQA-YTILAGLNvviqslssqelianQINALG-----
>tr|A0A0K0JIN4|A0A0K0JIN4_BRUMA Uncharacterized protein OS=Brugia malayi OX=6279 GN=Bm1_04635 PE=3 SV=2
--LSEIQQELIRQSWQTISAKLEvneqNFGFFVYRRVFEHNPLLKRAFHVEeyDlldSIPREHSIFRQMRLFTNLIALAVRHDNELETeIAPAVFRYGQRHYKFAAEyfnegTVRLFCSQVVCAVADLLEVDIDPACMEAWIDMMRFIGCRLLDGF-
>WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1216141_1 # 2 # 73 # -1 # ID=1216141_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.347
--FPDGVCMATIELTVLPVRpleD-----DEKFQIILSEAQGGASFNPNDD--------G----GKDDGvlTIVIKNTLQDPKGLKVLVESFgfqHL-DFDLTVPRVVVFRDSMVELMEAELQDRFTYKAKDG-----------------
>SRR5690348_18181078 
-----------------SRRRHTRWTGDWSSDVCSSDLETRALFRT------------EGSELVkgSMLAMTVEAIIDFAGersgKFRMIAcEvmSHD-AYGTSRELRSEERRVGKEC--RFGWVAYPX----------------------
>ERR1719191_2635985
--LSTKSLAVVGATLPLVAKAGPSFTQHFYTRIFNAHPALFNTFNISN-----QRTGKQSGALFAAIASCATGLLTsgklPSEMLEGVNHKHC-ALNVAPAHYDVVGEHILGTITDLLNP--GQHVLDAWGELYTALANQCIKR--
>tr|A0A0K2U629|A0A0K2U629_LEPSM Cytoglobin1like [Saccoglossus kowalevskii] OS=Lepeophtheirus salmonis OX=72036 PE=3 SV=1 
--LTKKETFLIRESWKLVTPEMTKHAVGYYIGMFVSYPKWQDRFfRRIkGIplrdLRNNPILAAHSSQVFSAVSNLLNNLENTEVIVegvKKIARTHW-PLNIRGKELEAGLVLLLDYLEASFPGQISKECGDAWNKMFNAMSGVIVD---
>tr|A0A2B4SAV5|A0A2B4SAV5_STYPI Uncharacterized protein OS=Stylophora pistillata GN=AWC38_SpisGene8312 PE=3 SV=1
------------DTFGPK-ESRCREESVCKVRLLELNPNLQDAFPSFrGVsldeLMNSRSLFLHSKRLMAVVEEAVSSLDDAKELIEDLtnlGERHL-AMSITEKHLKNLQRAGPATNQDAKHRLLANKGTAQIDRHIARMEDTRLP---
>GraSoi2013_100cm_1033763.scaffolds.fasta_scaffold146077_1 # 2 # 316 # -1 # ID=146077_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.663
-------------------------------------------------IFESFCLAQ----ML----YETVGMAREPKQERIVS---------------------------------------------------------
>SRR5690606_18427011 
--VSHRN---AHEKHQPCH-AKL-------------RPLLRE-----------------PRLLRRLLYDLSGqLTRrAGEVRPERHG-----GAEASAX---------------------------------------------
>tr|A0A0N4TEQ4|A0A0N4TEQ4_BRUPA Uncharacterized protein OS=Brugia pahangi PE=3 SV=1
-PLTRKQKFVLIKNWKGIERDVTTAGIEMFLKMLTEHPEYYEFFNFRNIANtakekqaSDERLSAHGAAVMKFIGKAISQIENADAFFMLLEnngRQHAHRGAFRPEMFWASYSFTCYSFSNGFIRNFFSNI--------NLLLTKVEMSY-
>tr|A0A2M8U0Y4|A0A2M8U0Y4_9PROT Uncharacterized protein OS=Ferrovibrio sp. OX=1917215 GN=CTR53_17535 PE=4 SV=1 
-PLSPAHLGLVRATFQILAADRDRLTEMFYARAVALDPHIQRPQLV-------SNMVAQRLQFMLVLTDVVQQLDDLPSLaqtAATFARRHG-TYGASDPRFRTARAALAWAVDRILETERNSAIQLAWNAAFDLVEALV-----
>PlaIllAssembly_1097288.scaffolds.fasta_scaffold05791_3 # 3730 # 3864 # -1 # ID=5791_3;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.556
-VPTAQDKQIIRDNINILKAKKSNWGAKTMLKLLKAHPDSIKLFPKFaNVPlhelANNAEFLAYGNVFSAGLNFMIDNIDDPTAVKHILSGKDAskyFVPGVSIrQQLEETFRVAIEAIGEELGPRFTPKTRAAFTRVLRFLNQVQDDGF-
>tr|A0A1I7S4N0|A0A1I7S4N0_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=3 SV=1
----MADRQILLKSLEFMPltRDGEKQGVEIYKYSFANMPAMMPFYHLADGftadsTITSDRFQKLGCKLALATHILANLADQPETLKAYAREHvlrHI-SRKVSPRMFRGFFDILVDWMATKTT--ISEEARREWAKLGDLFSY-------
>ERR550539_353004
---------------------------------------------------------AMMQHLVKNLHDISRF---DSDIrelLTRLGQQWL-QKRVPLDFAVLLGNEYLEAvlpffHSNV-GATLALKLEVSLAYLYKEAMHFLLL---
>LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3583117_1 # 3 # 191 # -1 # ID=3583117_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561
-ALAPEAVTKMRAGAEAMLAHPQEAGVFFYETLFDARPDLVSLFRTANMD-------ALSRHLIDTVVFLSRAADDLTGLrddLRNLARVHQ-VNQIPPSEYAHLAAPLLETLSRF-GHPLDAQMIRGWEVLFDRVSRIVAE---
>ERR1719199_1665450
--------PMIRECAAKVVQmDIVELGLRFYVHLFTINPAASAFFTKPKW-----MISAIFGGVLRFYVHLF--TINPaaSAFFTK-----------------------------------------------------------
>SRR5262249_23394332 
-----------------AIPISGVASELFFSRLFAIEPGLRHCFDG--------CFLGRRRAFEWMIGAAVRGRPDLRSFIQALEFMVAPSDATVHQECERLRDAFISSLSGSLGPRFTVEMMNGWLAVFELLH--------
>tr|A0A2V3J537|A0A2V3J537_9FLOR Flavohemoprotein OS=Gracilariopsis chorda OX=448386 GN=BWQ96_00611 PE=4 SV=1 
---DPETEALIKNTLPIFTKHSQQIAVQLYANLFEQHPQLKPMFCLEFLqTPgqckksPGTGMSPQAKILSDSIVNFCANLDNIDMMNNAIERIcakHV-SRHVKSDHYPAVAGAFSRAVRQVLKNELSESDLKAWDTAVSALAGVLVK---
>SRR5688500_3946624 
---DSRTIALIKESFTPIAGRTLELADRFFNNLFTRQTSVRGFFPA--------DVTEQKRQLPGVIQTILENGDKLENLEPQLREVgreYA-KQGALPTHYGAVARTFVDTVREMSGIGWQARYTRAWTSLFDSLTKAI-----
>ERR550532_2368357
-------ISMVAANFKTVKS-NQVLANTLFEHLFELEPSSKALFESK-------DLTQHKTKFVGFIGQGLKMLqgKNAKKELRELARMHM-EMGVTTLHFVFFEEAMLLGLRAAHGDKFDGELATAWTYVV------------
>ERR1719264_1394560
-------ISVVAANFKTVKS-NQVLANTLFEHLFELEPSSKALFESK-------DLTQLKTKFAGFIGQGLKMLqgKNAKKSSGSLPRCTW-RWE-------------------------------------------------
>tr|I2K200|I2K200_DEKBR Globin, putative OS=Brettanomyces bruxellensis AWRI1499 OX=1124627 GN=AWRI1499_0864 PE=3 SV=1 
-QLTREEIDLLRWSWRLVTVDddSTSLGGNTFnAADFSSYLFCIQFYNNFiSMDekvvEMIPSIRHQASSFADVLNQAIGTLEDLSkmqELLTNLGKLHARILGIERSYFKTMGEALIKTFRDWFGNNetFfPLILEEAWIKLYCFLANSIIQ---
>ERR1719396_178111
---------------------------------------------------------------AHGPGRLHRRLREQHPGLvpaagaqrPadGDLPPAL-RLVYHPPAVQRGARERDEVHRQGPGGVVTPEIAAAWSEAVLFLSKACID---
>tr|A0A2E6CQF7|A0A2E6CQF7_9DELT Globin OS=Sandaracinus sp. GN=CMN31_05165 PE=4 SV=1
--LDHSTLHAVRSSFE-RV-REPAFAAAFYERLLARDPEIRRRFAHTDFE-------RQRELFLHGLFALVDYASGGatgKLAIERLHAMHGpEQLDVPAALFDVWRDVLLETLAEHD-PEWRGELAVAWRAVLGPGIDAVRSP--
>tr|T0T344|T0T344_9PROT Uncharacterized protein OS=Bacteriovorax sp. DB6_IX OX=1353530 GN=M901_0762 PE=4 SV=1
--------TEVRKCYFRSI-ENPHFPKYFYRNLFFLSPKIEDYFKNTD-------WEHQEKALMLGLSHLFHYFDEQdtfhHKQIVRLANVHSHdNLNIHPHMYYYWIEALVMTCKKVDP-QWYEDLQYYLRETVFFPISFMISLYH
>ERR1712080_92393
MSLSAGEITAVTASFEAVKADLGTNIGKVLQKLVAEHPDLKPHFPWHavptADLLGNDGFKTHAAQVGRGFAEAAGNLSNLSaceGYYVSLGDRHK-TRGFAAAQVPMVADAFVAALQ------LTGDDASGWTKLITFVGSSIVSG--
>tr|A0A1X6NYK5|A0A1X6NYK5_PORUM Uncharacterized protein (Fragment) OS=Porphyra umbilicalis OX=2786 GN=BU14_0331s0026 PE=3 SV=1
-PPGPKAVRLLCATAPTLRAAGVPLVHRFGHLLVTRYPAVAARFDVSpaGD--WEGAVVAQVARLTAAFLAAAERMGEPACLNPVLDRIaakHA-ARVLPAGLYASVGDCLLEAVGEVLGDDAPQEVLDAWDAAYAWLGGALAA---
>tr|A0A2S3QTP4|A0A2S3QTP4_9PROT Uncharacterized protein OS=Halobacteriovorax sp. DA5 OX=2067553 GN=C0Z22_01530 PE=3 SV=1
------DKDLIIESFARIEPNLKNFTNAFFDNVVILEPGMQKVFAHADRE-------QLKASFIRALSITINNLKNPEYLKYYLQGLggnQI-KYEVSETYFPIFEEAFIQTLMLFHMNSWTPKLETAWRDCFYYIAEYIS----
>tr|A0A0N4YFT6|A0A0N4YFT6_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis OX=27835 PE=3 SV=1 
-RLSEHQRQIIIETFAEMEHHAVKNGLKMLVKLFSEYPNYKQIWPQFRAIPdsslmNAIALRRHASVYMCGLGAIIHSMKHENELALQMtriAKAHI-KWNVHRSHVVHMLDPVLDIVQE-CNPNYNNEMKQAWTTLYHIIADL-IEIY-
>ERR1719487_109746
MIMSAEAVQVVQDSFHRVDScvqIRDALEDVFFPHLFASSTQIKELFADVDL-------NMQAPMFANILNSTISSLNNPTELRPLLADFgeKCKKYGVQGEHIATAGESLIFTMKSI-DDQWDAEVEAAWMAACSAMENAA-----
>tr|A0A132A213|A0A132A213_SARSC Globin-like protein 2 OS=Sarcoptes scabiei OX=52283 GN=QR98_0035350 PE=3 SV=1
---EREEIEVLREQWDRIVHyHQECFGMKLFQRLLQLHPEYRPLFGFEeTVeeIQNTQRLKAHGINVVYMLNMLFDNFDDMDmidELIFKLVKLHM-MRGIDQIWLDDIIEPFELVLEEF-NAKIQIERIEVLRKAFIFIKNRMQELY-
>tr|A0A1Y3BHE1|A0A1Y3BHE1_EURMA Globin-like protein OS=Euroglyphus maynei OX=6958 GN=BLA29_010084 PE=3 SV=1
---CEEELQSLRIQWDKIVHyQQECFGLKLFLRLLDLHPEYLCLFGFTwDEfnYHETNQLRAHGINVMYMLNMLFDNLNDMDmfdELIGKLIRLHL-CRGIQKSWFDDLCAPFLTILEDF-SEKLSIEHPESIYKAFMFIKNRIQQLY-
>tr|A0A1Q9DB21|A0A1Q9DB21_SYMMI Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Symbiodinium microadriaticum GN=AK812_SmicGene25788 PE=4 SV=1
-KLPSHDVQILRSSWHQLMDavghDREQLGDVLYVGLTGSLAVLKDQFIT--------PRAVMSLRLFNGFRVVVEKADDPAALLNFTETLafkHL-SYEVTQVRAGLVADTFLEVLTQNVTEELPQGAGAVWRQILMYVGSAFR----
>tr|K1PS51|K1PS51_CRAGI Uncharacterized protein OS=Crassostrea gigas OX=29159 GN=CGI_10019581 PE=3 SV=1
----YRQIFNIRNGWKSVARVMEDTAKETLIRLLEKHPEYREKYPMIaSLNteeelRESLEFETYAMQIFGLFDEVIQNLENVDAALDEIEHTg----KQLTLQLITDLEECFMNSLHLVLDERFTDTLQENYRLLYGFVKSNIPQ---
>tr|A0A085MKY1|A0A085MKY1_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_01110 PE=3 SV=1
-------ASIIKEQISKIEVN-EENGGKLYEVFFTVKPEFHKFFdlKHAPEgkdVAHNQRFKTLGKLFLEKLKRIVMACEDEHQLKEEIKGLkmdHD-PRHVGLTELKGAKPILMKFIEQQVG--MTEEQKHAWTEMFKKF---------
>tr|A0A183IBE5|A0A183IBE5_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
-------KHVLMEHMKRLNLT-NKLGGKFYHQLFQSlPEAKSQFAEHFDKledVENMKYYQQLGHSLLSLLKELPEHCDDDHALKQEIMKIkkkHD-EKHVDAKMFKKSKPAILKFLTDNTQ--MTNEEKEAWDHLITHS---------
>ERR1719334_3108017
-GLTPKQAQAIISSWENLN---SECSSLLFKQLFTIFPELKEYFGFSKreLvdkILNSEEMIAHMDATWNGLDKLVLSTQTGTRFaaiGKGLGYNHF-KFEIDRQDVHKFMDFFKQVLKDDLKSQFHGDLEEAWNIWCKAVEDVFIMGY-
>ERR1719347_1061473
----------------KIM---KSClKSRLEHSGFRFSHELIMNFGFAKseLvdkILNSEQMIAHMDATWKGLDKLVLFTQTGTRFapvGKGLGYNHF-KFEIERQDVHKFMESFKQVLKDDLKSQFHGDLKEAWNIWCKAVEDVFIMGY-
>tr|A0A0X3NNN3|A0A0X3NNN3_SCHSO Uncharacterized protein OS=Schistocephalus solidus GN=TR151324 PE=3 SV=1
--FSEFEKDVLLSTWAVLNEEANKHSAAVFTLAGQMFPGLRNLFDIPcaNTekeNCESEAAKRHREAYMKMINGAIECLEYPREdFYDDLLVAgaHYaTIPGMKTEYFKVIKRATLVTWNSLLGEEFTEDVKQSWQSLLDYIITVISEGC-
>tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium GN=CMQ23_00915 PE=4 SV=1
--------SSIHALLEQVATTDIDFDKKCFERFFQISEEGKTLMAHMDRV-------HRG-KMMAEIYRLMMArdLD-DEADyLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY-
>ERR1719359_219123
-----------------IDEepmAEVVSGeDALV----AIA-DLlyQKL-------------------------------SGDEAMAQFLENVdlt--QlanNLRSLlalvfngsdWPEMHLS--gSLiddgYEDFSSILQETL----qaSPg-DDALL--ESLDKLT---
>ERR1719487_376807
-----------------EEEgatEEVASGeEALV----AIA-DMlyQKL-------------------------------SGDQAMAEFLENVdla--QlakNLRTLlaavfegndWPEINLS--aSIidegYEDFSSVLQETL----qtCLg-DNAML--ESLDKLT---
>ERR1712100_485805
---VGHVVLVV---GRCSFEcrnIVVVEGlDGSLDRLLALRkvvgiglGLPilQQL-------------------------------G----VLRHVGNVa-----------------lKVlrchFLQFSNHVLEVRSRLRldefclvgdivievilrDHgggkHeRD---------------
>ERR1719487_109746
-------RKEIEISHPELLKiGLDNVGTTFYTNLFQDSPQIQMHFIKPN-----RML---SYIVQKTIEMIGDLHPKPREVMKGLKALamrHI-KYDAPPEFFGDFESAMLKTLAQSLKSTFTEAVKEAWKAALQFIASTIV----
>ERR1719221_1379514
--------------------vLMRDIPRSAVALFGI-TVAIfeddyRDMNHEPALL--CAVL---LFVTFTvilLMNLLIAQLNTTYV-RIYQDTVgwaLI-NRASTIVEV----LA-TVSRT-KWTRFVDGLGLDEKLE---FNEGDVG----
>ERR1719460_1401436
-------REKIDNTMDVLAKhDMDDLCNKFCNKW-INADEVNGYFDKPS-----GIF---KFILLRILYLVSTIYHDPREISKEARALglrHV-KYSPPEALLPL-----------------------------------------
>SRR4051794_36238122 
------ARRTAKASYLRLQGggRERAFFAAFYENLLVSCPDVKPFFVPERMA-------HQQ----SMLNRAIQLLLDFDRAcgCPQLRqlaDGHA-GYQLTRWHYDQFVEALIRTIEQS-G-ITNPAELSAWRTTVMPAIEFM-----
>tr|E9HGU5|E9HGU5_DAPPU Uncharacterized protein OS=Daphnia pulex OX=6669 GN=DAPPUDRAFT_301206 PE=3 SV=1
-SLSDSDINLIVSSWNFLKKRLSSFAPKVFIGYLEARTDSKKMFPDFAHvniaeLATNVEFRSRACNCVASLNYIIPHLKRSFpvLQCPALKNLKT-KYNQHIDILKSLGIIWVKAMQEELDkKIFTDDVRVVWKKLFSVLKE-------
>tr|A7RWR5|A7RWR5_NEMVE Predicted protein OS=Nematostella vectensis OX=45351 GN=v1g203303 PE=3 SV=1 
-DMTYEQKYLIRETVDNRECVNekDflawRYVCELAAIFLNMHPGLQTYFSEFKhIKiDNINGSHGHPRRLLMAIDNAVTALGDSDSFsayLVELGRRHHgMNFRPGPTHFNDLRKCFLSVIKEILATasLWDFQVEEAWNRLFDSITAMMLR---
>SRR4051812_28599342 
-------------------------------------------------------------------------WVRPRSRGGRSPRSrssRS-SARRWPSGRPRPPSTS--RPDMRSGPSscgmsrarwqsifpapsrtgcasPIGVLGDP-----------------
>SRR3569832_2950508 
-------------KNNKKN-HHPNNHNTKKKANKTTTPKKTQKNKNTNFT-------RQKKMLQMSLNLLIShamGIDIVDGYLHQLAERHSRhRLNIEPHHYAAWLNSLMKAVRQHDP-K-------------------------
>SRR5262249_31239692 
--------------WACCA-RGGAS-R-AY-------AKSRERHARDGFA-------WRP----RAASGTLRageGEPEGEAHLRRLAAIHDRdHHDIRPEPYDRSLDCLPQAGRDRDA-EATPEVEEAWRDVLAPGIAVMKAAY-
>ERR1712048_439078
---------NVTTIWDSIKAVpgyEEKFGRMLYEKFYEMEPESFKLFKK-TRQpaaedvFSDPVFVQHSLEFVRLLDFFIQVLGPdIelvEESLVDFGETHQ-DYGVTLDTYSSFGEAMTETVEELLGGngKMDETSRRCWVTAYRYMSMHMTRG--
>tr|L1IAP2|L1IAP2_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_120658 PE=3 SV=1
--------NFIVSSWRKLLRKvsYADLGLSIYESV-RDVDELEPLFRFTNRV-------VQGTKFVDMLSSIVDNIHSPAEIYVKIADLaplHH-RKGVRGSQMPLMQEIVMRVFDSTLGDDMLEEEKKAWLWMWAFLTKALD----
>ERR1719336_1989132
-------------------------------------------------------QDRKGGgGTPGKLKVTAKYNDGTefvDefntvifaigrdactakmgleGVGVALNPKNG-KVlhneler-TSVDNIYAIGDvldgkpeltPVAIQAGKLLArrLAGTSEVTTDYVNVCTTVF--------
>ERR1719278_462770
-HLSTADVAILKGSWSVLEEHVTRVGVDFFIDMMTNHEEIKAVFRQMpNIPvyelKANEDLNRHGMYILGVIKKIVGKIDDTeylEKLFDDLSDLPL-LLLQQDRPHHLAKNLPKNVHSGSLYaePpvkvaEVVEELLQVLCV-VDLPHNLL-----
>ERR1719186_958210
-HLSTGDVTALKSTWAEVDSQISKVGVEFFLDMFHNHDDVKQTFREHpELPvfelKANEDMHRHSIFVLGAIKTIIKHIDDTeylESFLADLSDKQR-AVGVDANNMELFGKVFVKVMRPVLLekRKWKPEVKDSWMTFFTSIVKVMKK---
>ERR1700748_142917 
-------PALVREAWSFVSDRADQLVANFYAELFFVFKEAPMMFPS-DMT---RQRQEFGRAVVQWII-----SDDQDGLAMHLIQLgadHR-KFDVEPRHYEVAGAAMVNAWKKLAGWKWTPAHEAA-----------------
>tr|A0A1B6KXW2|A0A1B6KXW2_9HEMI Uncharacterized protein OS=Graphocephala atropunctata OX=36148 GN=g.8863 PE=3 SV=1
--LNDVEVEMIQEGWKCITESEDFFRTAFSSIDF-----TPVNFRE-DEHtdderFSRDFLKSHSVHVMNTVRTIVEDVKNPNSWMLELlriATLHK-LYGVTLEDLRKFQCSMLETLKQCLGEcNFSPPMQEVWEKVVECVVI-------
>tr|A0A1S3CW24|A0A1S3CW24_DIACI uncharacterized protein LOC103506299 OS=Diaphorina citri OX=121845 GN=LOC103506299 PE=3 SV=1
-GLTPKMVGLLKCLGVAIKPEAHRHGVNIFKKLFLMDKTVQRMFPKFacdDMcgLDENPDFHKHVDAVMKSILYMMESSGSVPDmksTLALQVKIHK-DLCIPDRHFITFGYAINEYLKETLGAKYSEDVECAVAYFWKFVASEMTAKP-
>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 OX=905079 GN=GUITHDRAFT_143733 PE=3 SV=1 
-------SARIASSWTELVKKsdYAEIGRRIYGSV-KANDTLEPLFRFTNQT-------VQGTKFVDMLSSIVENINNPQTIFEKVNELapmHH-RKGVKAAHMPIMKGIIVSLLKHVLGDEFTNEDEEAWNWIWQYLTQILD----
>tr|A0A0R3PZJ2|A0A0R3PZJ2_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1
-PFTDEEKSELLRSWKVIEAQKQAVGCDIYEMIFNQL------EPFLCVSikapkELHNKFRIIVICIVGRYEEELSSVNE------------------------------------------------------------------
>ERR1719192_2137381
------------TSLNFKHLcvQ-QLLKLPCLPRMFETHPEWRNLWQHMGgkLHiddmLTLPRFVRHTMSNLAYLDKIIRDADDQTKTIAsvqFLAKVHA-VQGIGERDFKQL----------------------------------------
>tr|W2T4S9|W2T4S9_NECAM Globin OS=Necator americanus OX=51031 GN=NECAME_11818 PE=3 SV=1
-----RDFFTLKNYWKAIDRKRQDSAQLFFSRYLNQNSENTKLYPKLkNIDgatvDmtcSDSGFEAMAASYLKVFDDVISIIEekpgDVQaacDKLTSVGKMHKtKGVQVQPKSFQAMEEPFMHMVKEMLQDRFNEKAEGLFRKFFDFCLKYILEGF-
>SRR5512134_285705 
-ALTPTHATLVRESWARLAPGRAAAVHRFRARLEAVSPRTAARFTCLDH-------EAQRDGLMIELDQAIAATGSDDDLVPALARIARrfRESGPASSEYPMVRDALLEVLAEADRGIAPPELRRAWGSLFGLLAALV-----
>tr|A0A1E4RL21|A0A1E4RL21_9ASCO Uncharacterized protein OS=Hyphopichia burtonii NRRL Y-1933 GN=HYPBUDRAFT_5624 PE=4 SV=1
-TLSSSDSQVIKRSWTELQNNnkyhKDEFVSRLFGNLLAANPNLKSVLST-DL-----IIRQQSKMFNDMLGFTIMYLDNEPLLEECMNEFvqeNPSIVALGVQYLEPMGLALIQTFRQWLGSaKFHAGLETLWIKIYVFLANCIL----
>ERR1711973_858157
--VSAAHKSLIRSTWTLMKF-NSNVAPKILYKMFTTYPETQKMFAKIAEVStfdlmENKDFLALSYTFYSQFNLIVNNVDNPEIIKSQVARMISPsFFIDpsasIAQQLERANKIILEIFGEELGSSFTDEAAAAWTSLLKIVYEVVE----
>ERR1711928_171062
--VSATQESHP---------------------------------LDLDSHEiqqqrRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFET----LCFRWIQHD-----------CQQYG----
>ERR1711928_123369
---------------------------------------------------rRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFES----LCFRWIQHD-----------CQQYG----
>ERR1740128_75568
--VTAQEKTLIRATWDQMMF-NSEVAPKFMLRLFSEESQHELGgnFaVEHHLVPggadeglllGSNDGFSNTLDVRVG-----------------------ShLLGNdai---------DVVHDVFQCFLGGSIGRGDlfnglHHNMGRFVQLVDGX------
>ERR1719219_701605
--VSAAHKSLTRSTWTLMKF-NSNVAPKILYKMFTTYPET-QKMyTRLADIPasqlmENKQFLALSHSAFAGFNMIVNNMDDPELIKLQLSKVDFPgTFVYpfpgTSLNTSKPPASSWKYSPKN-SAPLSPRKPLPLELPFELRHQGFG----
>tr|A0A1V9Y3S0|A0A1V9Y3S0_9ACAR Globin-like OS=Tropilaelaps mercedesae OX=418985 GN=BIW11_00005 PE=3 SV=1 
-SLSKEDMELLKGSWQTIRKDSKVIGRSIFVQLFREDPNLIKKFRHLDNIpaeqlPYHPKLLANALSVFYVVTSLIDHADDADtcrELVRKVAATHR-PRNITRQHFETFGVAFLHVVSSMMS----ARALNSWQRGF------------
>ERR1719510_1721190
--LAPNDITNVKSSWTTIETILLQVGIHVFIVLFETQPNMKRTFRQYRGKkhselRINEDLQRTIMYLMSNLKRLVRYINDNRATVKFMRRLakkHS-PLELDLGRIDpnEVATLFCTAIRDAKqickdqngKTSWSTEIEASWANFFGAILGAMR----
>ERR1719264_357726
--VGLCDALNIQQVWPRIEQYLLPVGTRMYISILDGRCDKIIFCNKACCRknasksssakstrsvysksvsrtcpnqvILNEELQKFVLLLMGLIRRAAKHLDNPSHSAKVIRKVtkkrFG-KLNIDVTKIAfePIALNFIASVREIMtnTRHWNTETEASYYTLIRNLIAYVQ----
>ERR1719244_2234371
-DLSTNQKNMIRDAYAVFEKNGEKNGADAFIYLITQHPDLKQVFPWGDVSneelRENQVFKDHVYVVFKGLKVAIDRIDNLKATASyyvHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQTSFNNLLQFLVGNMK----
>ERR1719193_348913
-KLEQKDIRAIREGWACITAHpgLEKTGVDWLHLSFELQPGTKHHYKNFTNKtleeiCQTPYMKILAGKYMSEIGILVEHLEHSNFVlmrLENLGHLHA-KMGVPMETLFTM----NIVMQHYFRELYSrqdvpDDCEGAWSKV-------------
>tr|B3RTB2|B3RTB2_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54901 PE=3 SV=1 
--------------------------------LIKLSPATKIYFHGVDFEkrdsylAKNTFLRNHAARFMEAINVIIGQDMDIfsvESYFRVVGSKHH-SYNLKLEHVQDISDAFLEMARNALKKKFTKSTEAAWRSFFQMVTDAIKN---
>ERR1719229_1707680
----------------------QQLGVLLFANLFKKQPLCRNLFADSDI-------SKQSLRLLDMFGWLLRSLVKEKnqmrlRTLKSLGDRHV-KYGIKIEFFGPMLDSLSDALQDWFGTNYNTQTRVALTTLFQSACNEMMKQ--
>SRR5438046_805262 
--------------------SRRSTG-GSSRS----ARPLDPCSPRPTSI-------GSTGC-CVTPSACCYFPAQPdgePTILARVADRHSRrDLAIDPALYPLFIDSLIDTVKQY-DHEFTPAVEGAWRTAVATGVEYMQSKYX
>SRR5438034_714626 
-SMTEASIIAFNESFERCMAS-GRFFDVFYDHFLRSSPEIAAKFQGTYFN-------RQKRMLNQRPATTVGQPR-----------RSAReSRKTPAAQFVStcqampsaFVSELTKSGSTX-----------------------------
>SRR5258708_7736634 
-------------------------------RFTGTSDAIREKFKNSDFA-------VQHQAMADSLYLMAVSVQGGpenLARHDMKRLYPKHqRMEITASMYDVWLDCFVATARIHD-PECTPAIESAWRECLTPGIAAMKSGA-
>tr|A0A0G4HCC2|A0A0G4HCC2_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6317 PE=3 SV=1 
--------PLIHTSFDNVLERttTEELGVRFYEIVFETAPHLQKLFKK--------PRRLQGRVFANVAALLISGIENPRFLTQELQRLslrHV-GYDIRPEHIPVFGNSLMRTIKEAaLrpspkdgqPFDFSHAHDEAWGALWGRVST-------
>ERR1711965_451221
------------------------------------AGAVR--------P-------RP--------AAVI---GFPFPLFP-LLETADMtsvAVGAHPRLRA-----L-----LRDR-G---AWYLTGPQELASVIGRLERLER
>ERR1712012_1094824
-SLTTSDIAAIRQSWILAKDAApfEVHGPAFYKLMFETYPSWRFAFNHMGGhlSievqIENTRFVKHTVTVFRFIDKCVNDLDNPTQILENIkmvAKIHA-LQGIGVKDFIIIKAFICSKSD-KVGAGRSKNSFIFFPRFL------------
>ERR1719431_737524
--LDMSQISDLQRCWSTLQLHMgeQAIAAAFYNDIITNFPSIQKYFKNIwTEStftrtiGNMNDVRKHASLVVSRLTNYMGNLHHLSEVNEDLKELgmiHAARYHITEEVVEQFVSSMATTVADLLTKedLFDPVLCGAWKRFFFMILTFLSEG--
>SRR5882757_2588511 
-SLSSRQQILARRFFDAVEASDKPLAAMFHERLSEIDDRLDGLLLE-EE----GCLLREAMVIVRTLSRNVDRLNRMVPIFRAFGRTCA-AQGIASANYEKIAPVLFWIAQECVGSEFSVEMGRALTALYDQLSREMKD---
>ERR1719199_2454663
---------LLQAVKYVPARefyatfdeaSKYQLRADVYVKFFADCPVGEGYFKQ--------SNTYLHIIAAKLMDVVVAIYIDPVAVVDmisGVGLRHV-GYAIPIELFPPWVTVWID--------------------RWRSIGAT------
>ERR1719199_1562120
--VPADLAEEAKKAWTMLITaagSKDAVGEALYSAFYE-aAPSLHYLFVT--------PRAVQAMRIFVQVNNFVNLPISPADLKNaveALGFWHM-SMDVTVPRCVVFRDCILDLFVAELGRPIEHSRAPKHWSSAVQFPSPIP----
>ERR1712176_999243
--------------------------------------------------------------------SY-AHRDTFDQLadaprtI--FYTQK---------QGHPECSEMVEKMKNIVGDE-------------------------
>LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514
-VLTAEEVQLVKSSWPIISKD-LKVAENALIKHFILHPPIQKLYTKLaNVPiselKDNDEFHAQAATAVKITHFIVNNLDNDELLTAMLSKVTIPaffvDYMDPIHQLDETTRLFLQAVKEELGNQISERTLAAWKKALDHVMLIMSN---
>tr|A0A1I3QX19|A0A1I3QX19_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter neptunius OX=588602 GN=SAMN04487991_1987 PE=4 SV=1 
--MDEQMIALVKASLKELQPHAGAVFATFQSKLAQRAPELAYRYDEVDP-------ERQGELLFEKLAIALGGVRFLDRLVPALGGVglDAGSASLTSCDFARLSEVLIAAFAEVSGNRFDPCIGAAWTTLFEELSWHMFE---
>SRR3954447_25823703 
-------EDLVKASYHRYCADKISFYKDLYKRFFKRNPDGQRFFVKTS-------MKRQC----RMLDEAVSLLTNFRtgpepTSLSRTARGHA-GLGIEEKHYRDFNVAFVESLQMA-GE-DDEDTLNAWRCMFARGTE-------
>tr|A0A024TW08|A0A024TW08_9STRA Uncharacterized protein OS=Aphanomyces invadans GN=H310_08903 PE=4 SV=1
-VLTKARIETCARSWDKVRTAATdkmkSygkpgivlFYDEFFYRLFQRDSTFRVVFAN---------SKERAEVLIKALMFMLNMRADSPEsvanmqnRCRFLGHKHRSYSLVRPHHFAAYTMTCIEVIMYWMGDEASIMVADAWSNVVGFVLRYLLEPY-
>tr|W4G1Q9|W4G1Q9_9STRA Uncharacterized protein OS=Aphanomyces astaci GN=H257_12218 PE=3 SV=1
-LITKPRLQLCLKTWEVVQSASTdkmkQygkpgiilFYDEFFYRLIERDATFSQVFVN---------VKERGEVLIKALSFILSMRADDPAdvtnmqnRCRFLGHKHRTYARVRPHHFAAYTMTCIEVIMYWLGDDASPLVGDAWSNVVGFVLRYLLEPY-
>tr|A0A2H1V3P2|A0A2H1V3P2_SPOFR SFRICE_008656 (Fragment) OS=Spodoptera frugiperda GN=SFRICE_008656 PE=4 SV=1
--LFGSqEFKACCsgMGMGKIGKGG--IGPPVtsL--tqrnttqalfhvgflPYLRAAIQwctvqvDNSFDYLgIWTepvafSVDPLLIAWlaykpTVKSEASLPAAVKSLSQTQQIp-------FR-RRSTP-----------------------------------------------
>UPI000297C1C9 status=active
--LDEYSIGEVRNGWENLERRCGtPKAA--A-EEFLHKVSAAIPKTE--------HMQKRASTVWSKLNGLLASMHDQSMFTGQLEYLalrHM-NQDISAAEIETFKGLLLEFCASKLGGMMTPEFQYGVSRLVDAVGASYQ----
>SRR5262245_14724532 
-------EDVVKKAYQRHCYRQPEFYRSFYENFFSRVPKARAMFK--D-------MARQHEM----LDFALGQLLNYSqqqsepTTLTQFVERHS-RLGLTADDFKRFGEALIATFDSELRGdCEHHRTMAALEIVI------------
>ERR1712071_238239
----ERSFTYWKDSAMMELA---------KWNARLQTPR-----------vYEVKwRRKKRNIPGRVGWRVLGAELWVRSSCRRRIRNRPYQEYFVSyvsiSQQLEETARLIIDALDEELGVRFTSYTRGVWSR-aFHFANSIMAESF-
>ERR1719204_2878153
------------------------------------------------------------------GITMMMAVVRGRPVRPAVQDigrAHY-SLRVDKDDMRQLATAMISAISDSVGTYMSPDALDAFTKLFEHIVEEFGNGY-
>tr|A0A183IHG0|A0A183IHG0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 
--FSLREKELLSVSMKKLEQLEEDNAVKIFIRLFQENPAYKSLFPKLRFmgdadIVNSTALVAHTQLILKMIKTFINGFQNESTCAVVLKRaetAHR-KFDIKPSQVSTLFPILMEILDIS-----HNETQAAWKKLFETFSIR------
>ERR1712232_1039451
---------------------------------------------------ESEEMRTHATKVMTFVGNGVASIGNPEKCerfraeCIALGKKNQ-ERGISSQDYDIATQPFVDAVEHSwlqagwrqtdaSGSIWPPGAQGAYTKFYGHMAATIKDG--
>tr|A0A0N5DPZ7|A0A0N5DPZ7_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1
-NLSAKELQLIEQSWLDIE-NKDELGKEVFKRVLLSNEKIRTIFDLHtcpdDELDQNETFKRHLKSLSLFIGICATSVAvgseRLVSIARRIGEKHVNFRwvTFDAEYWLLIKGIMVDVIASKQRPKEVEKVRSAWNTLLSFVISEIKHS--
>tr|A0A183UUV2|A0A183UUV2_TOXCA Uncharacterized protein OS=Toxocara canis PE=3 SV=1
-RLSPRHRNLIIKSWSKTN--KSKIARDTFVELFKTSADIRSKFVFGDVPikrlKQEDRFLAHCERFVAALDSVIAHLDEIGAVIEnaeALGKYDIsaepihaAmAKDLRNEHWRLFGDILVERIIENDTkqPSGGSEVHAAWKMLGQLLVFHMRLGY-
>tr|A0A1Z9IBY6|A0A1Z9IBY6_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium TMED162 GN=CBD22_07770 PE=4 SV=1
-GVTQTQEQLIEQSLTHYAARHGDPYDAAFQKLYAAAPHYEGLFVL-DTD---E---GLRRNMMRTtLEMIATYIDDAYAAENlvtGARLVHL-TYEITDD-FDLFFQITRDVIAEGCADIWSDAHAAAWNTMLKDF---------
>ERR1712150_396892
-DFPSDQKQLVVKTWHYVEDHFNEVGITAFMDLFKVSPESKMIFDFLKLyHtddgKFYDLVTKHSLRILGMVSNLVKELKCKsseaadesiHDIILPLGRRHV-QYKANVIQMELLGLLLVKSLLKPIPKEeVGdkeyGQISEAYLVFFRVIVYW------
>ERR1712062_817879
---------------------------TAFMNLFKVSSDLRTTFSFFGYvNvddeKFYKLVTKHSLRIFAMASTLVKELKSRdsdasdrfiHDTLFPLGRKHV-NYRSNLIHMEMLGILILNSLMKTIPRDqLNehryKRMNYAYFQFFRVIVYW------
>ERR1712135_246677
------------------------------------------TL-------SVILKRTAEITAHKIIIVVTFQLKSKdseeadrfiHDTLFPLGRKHV-NYGSNVVHMEMLGLLIVKSLMKTIPRDeVNehrfERINDAYFQFFKVIVYW------
>ERR1719171_2780585
-NLSEEMITEVQKSWSEVLRRvdsKTEIGRIIYDSLFDRLPHLRKMFKTNRL--------TVAMRFANSVHSLVGILNNKEQTeeyVYNMALRHV-QYwsgdgSIAQANMSAFLKAVLIVFDNALDDKWTQRMEEAWGALFSYVGEAMVA---
>ERR1719203_1566926
----------------P------SHPICLRSPkrFTRRSSAVTGnCCNsstQHTTFPNR---TTSPRPWPVPWPPTPPTSSTSRPSscpavpVEAICHRHV-ALAIHPMQYVVVHENLMAAIAEVLGDIVTPAIGAAWSEAVLFLAKAFID---
>ERR1719253_2317543
--ILSPAGRVLRLRGPGFLPprcrfgrlspnhccsrvspdriavarRPPPRPRSRPTSSPSPRTSTRGc-WAATRSCCSS---STrpttspsprT--SLR--------PSPAPSRPtppTSPTC-LPS-WSPAGPWRPSVTA----------TSPSPSTRCSTSWCTTTSwrpsprswatssrrrsrpagprPSSSSPRP---
>ERR1719253_507459
--LSQSAIDVVVSVAGRDARRARPRAGPRR----------TDp-WRRRRRAARG---G-gpgrragevqtraaeGASTLGHGLVR------RGRALghgLVRHGRGHC-HDS-------------------------------------------------
>ERR1719253_479176
--HHQELLHAGVGQPPGAAA--VLQPGPQR----------PRl-HEPAX----------------------------------------------------------------------------------------------
>tr|A0A183EWZ6|A0A183EWZ6_9BILA Uncharacterized protein OS=Gongylonema pulchrum PE=3 SV=1
--LSKRQRVAIENSWKRATKsDAdKHVGIQIFFRILAARPEIKHIFGLQKIPdgrlKYDLRFRRHAVILTKTFDYIVKNLAYKEklqQHFQALGERHTVlqGRGFFPEYWETFSDCMRQTVLLWNK-EKKREITSTWYQLVSKSnFPVRY----
>LauGreDrversion4_1035100.scaffolds.fasta_scaffold358575_1 # 2 # 736 # 1 # ID=358575_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.683
-KMPKDAVQEAQATWQKWImknTDEETAGLCIFEAVFQSMPALQGLFDTT--------TPAQAGKFMKAFTECLQGALSREELklkIETLGFLHM-NIEVTTANTVLFKNAMITCMDKDLQSAFSVSAREVISKLVLYIGGAF-----
>tr|A0A0D6M6J3|A0A0D6M6J3_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=ANCCEY_05408 PE=4 SV=1
---------------------------------MPSCVRTAVTLP-----------YLEIFEPFVVIEGAVMSLDNLPALDPildNLGRRHG-KLEVNGkfrtYYWSTFLECSICIFRKTLTN--------------------------
>tr|A0A1Q9CXH8|A0A1Q9CXH8_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene31162 PE=4 SV=1
-ILLEAQIVEVNECWQGFLdcyAKPEHAGEAIFAAILDAAPSLQTFFRG--------GTALLAGKFVAGYSQMVHNLRNPDGLMGVVEHLgfqHL-DVDINIPRIAIFREAMCDAVSAELGEKLTDLGAYGLRRLVSYAGGALI----
>ERR1719174_1428107
----------------------------------------------------------------------VVDCQDQRSTLGYPPSAst---SVRCCVEQVARRaflwrkswfLTTLTIFIAGQ-AiLKYSHLDNLATERLLVFLFRAFI----
>ERR1719277_1813735
--------------------------------------------------------------------------------CMCAAETriaHL-IGRASVANMHNLRNAVGSEVCLLSSlAIRFEANHVGWAHVsvadvVAVCSSISL----
>ERR1719310_1375130
-MLPQEQSQQLQQAWALVInmsGNRDALADLIYSAFFYRLGePR-APLRN--------PAGSRSLPFLHGHQHLRRQLRrPwssaqfrrNVELRSHVLGYhrpSG-EHHSX-----------------------------------------------
>ERR1719487_3068354
-ILPQEQAEQWRPSASSLVsthSLQSLAIHLAC-VLLLRPYPSdTCTWTS--------LFPVLTSSVM--PSSICSWLSLAASX--------------------------------------------------------------
>MEHZ01.5.fsa_nt_MEHZ011529165.1_2 # 173 # 307 # -1 # ID=206391_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.393
-YMSIDtgnleaakvmlqdlvtiradrsryyyclddlFKWHPDIVWKLTV---------------DAPELLrtmldGMIWRS--------RVVvngnrrvnyylkhllvDEHGKFSNAM-SCIVKLQDP--EIAIHPILvq---LGDLVWNDLVYWrflrgklslVCTAGIFMVSQSMl-QYVESAGSFEERVATFICRLVV----
>tr|A0A2T3VCJ1|A0A2T3VCJ1_9ACTN Oxidoreductase OS=Micromonospora sp. RP3T OX=2135446 GN=C8054_25080 PE=4 SV=1
---------DPGELLASALVVLSPAADYFWSFMEDRSVRF---LPQ-----------QLAPMFFSTLGQMVAGRGDPAGRRAALAVMgrmYR-RFDLQPYHDTVIAAAVVDTVRRFAGASWVPEQAGQWEkgcrQALRLS---------
>tr|E5XPI8|E5XPI8_9ACTN Uncharacterized protein OS=Segniliparus rugosus ATCC BAA-974 OX=679197 GN=HMPREF9336_01410 PE=4 SV=1
---------TFVRSFHlELFGAAPELAARFPPGLGEHRGGF---VRM-----------------AEHILETFAEGADPPRLIDLLGQLgrdHR-KHRLDERDYRLAQAAFAKALVATARG---SGDGAFAAraaaLVCQVM---------
>tr|A0A246RU09|A0A246RU09_9ACTN Uncharacterized protein OS=Micromonospora wenchangensis OX=1185415 GN=B5D80_01060 PE=4 SV=1
--------------------------MREADELRSALPDR---LAA-----------HDAELLIATLRRLATD-PEPAAQAVTLTVLghaFR-RFALLPHAKLISALAGAD-------------------VPVELL---------
>tr|A0A0P5RQ13|A0A0P5RQ13_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1
-KLTPHQIRDVQRTWEHLRANRNAMVSSIFVKLFKETPRVQKHFAKFaNvavdALPENGEFNKQIAPVAARLDTIISAMDDKLQLLGNINYMrypHQPPRAIPRQTFEDFARLPIESLEAS---GVSGDDMDSWKGVLTIFVNGVSMRY-
>tr|A0A0L0FDI4|A0A0L0FDI4_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12917 PE=4 SV=1 
---TDSEVELIRSSWRALLAGDGTaaqmpllrFVEQYYKRLFRLFPDSRGVFKTRD---------TQSKSLSLLLSIIINVADEPElemnAKKKKLEMMYK-EYGMNSLLAVIAGRVLIQSLQAFLEAsnKFQASVKDAWVKCYTSIADQLL----
>tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1
--------DLVLSSWDIVRQRteVQELGEKFWKYLNCMSPEQTNLFRR--------SLSMWGHLLHHIVNMLLISITDPEEYYDLMFELtirHI-RYGVRSEYLNPFGNALFATFEEILSDVWEEKTTKAWKLVWKRATCNMSRG--
>tr|B3RTB3|B3RTB3_TRIAD Predicted protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54902 PE=4 SV=1 
-----------------------------------------------------PLVRSHGLRFMKAIETMLEIEFDSNgciFLFSAIGNRHC-SYGIEADYLDYVPQAFRFMLTKALGNNYTDKIASVWDEILSHIIKAMQDKV-
>ERR1719347_2568912
---------------------------LPPPTHFLPLPGINRKVRIFqRQFgnqtsefLTGKALRDHSIRVMDALDSVIVDTLKGKDIHKqmvDIGYSHL-KMGVEPRQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED---
>ERR1719474_100483
-----EYKNILRSTWSKLLENKEEIGLKIYKSIVfDTTstPtgnglSTSIIF-------ENSDLGQSSSRFIDMLDTVISQLDEPEALTRRLEELskmHSDKYDVRKRHYMDFERGFMKAIKWELGAQRTAQHDRAWRWFWDFMLSKMC----
>ERR1719464_849876
----------------------VLIGCQTFQAFFDRHPQFLSNFDKFNAieidgVLVSSALKMHTSRVLAVVEDIVEKTGNHPRTLGDVR-------------------SSDMSIRPLvFRSgLWTIELE-------------------
>ERR1719232_2219129
----------------------CRPGCVTFTQLFAQYPMMefLGKFDNME-vegVNIGEALKSHAEAIGSVVAEIQENAGNPERIRMSLAGAghrRY-QEGVARQQLDMLGPILAHVIRPLvWEKcLWSVELEKAWTHLFDIVACLMKLGY-
>SRR3990167_8699843 
------------------ANQLEDLCRLFYAHLFAKAAHLKPLFGDSE--------DTQNFKVIKMFELIIDNVEDLTQVQPiclDMAKRHS-FYGVKNDFYQYIDEAFVWCIQQQLSLSIQDPIIHAWYAATKYISSIMID---
>SRR5690554_7960028 
--------NV--QFVSRGC-GGTRFCSLGFPH----PPSATLFPYTTLF-------RSQRHLlrngVMQIILVAR-GMSD--RKLRDLGESHNRsNYNIKPEWYDLRSEEHTSELQSRPH-LVC-------RLLLEKKKKNLNITY-
>ETNmetMinimDraft_19_1059907.scaffolds.fasta_scaffold284136_1 # 1 # 639 # -1 # ID=284136_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.595
--LPKACVSLLRQSWKQVP--QASFRKEFFDRLYIEDSSLQQIFQHPM--------VEVPENAWNVVQLMLDLLNVenvprLERFVHALAGLAFRHGRFRLAHLAPIKRALVRTVTSHASKQEKKKLSQAWEAFFYALAAVAA----
>SRR5438477_815846 
---------------------------------------------------------------------------RPHLSAHECGRLgpaVGQTLGAARNHLDAFGLALIEALNAATSLD-SVPTATEWSDAWDLTVRWTRP---
>SRR3569833_2822653 
-----------------------------------APPERHTVLHE--------AIVTNPVEVAGAIGWVVEHLHRTEEVATACGELgpaLARLLAGHEQHLDACGRSIIDAIRTGLADRWKPEFDGATSSAWELVAEWLRR---
>SRR3954471_21372458 
---------------------------------------------------------------------------------------gpaIA-ALGIAPDKLEPMSLFLVEALLAALSPMVPadRTAGGGGRGAGGGAAAGAAQ---
>tr|A0A2T7P177|A0A2T7P177_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_12319 PE=3 SV=1 
---------VITRSWKCFYEKVCSFGVYEFLNLLTDLPEYEEAMRLIKLTSsykflSAMDFNAHFLSMLTIIEKCMARLevDDLpllEDILHKVGTDHI-GRGVNPENFDLVIPPMVAGMKQMLEDKWTEKEDIAWTNFFTLMIHIMQE---
>ERR1712198_190235
--ISSEEK-hVLIDNLKMTKG-NKKFGANmll---KMFLAHPKTQSLFPNFaKLPvsslSNNAEFVAFGKMMVSGIEIFVN-cWVTNpSanislPTSLWINNLRKPPAWX------------------------------------------------
>ERR1712179_658195
--VSGNSK-nAVRATFDQMRF-NSEVAPKiml---KLFTAYPETQKMFHRIaDVAvsdlMNNRKFLHQLLCL-RRIQLHPQQhgrsrDHQTpTvqgrlP-----RHVRLPLPWYLsAapg-YFSHR----IGSVQGRAGRRlh----RR--SRLWMDFSAELRQP---
>ERR1719419_2176015
----------------------------------------------------------------------------------------------FvfLgssQKTILARsiftkkIVLLTEHTLKISVAVSPLSAADFTILKD--NLKMIN---
>ERR1711946_32375
--------------------------------------------------dEQPQIPVHQLLFL-RRIQLHPQQhgrsrDHQTpTvqgrlP-----RHVRLPLPWYLsAapg-TYPPS----HSNHTARERTAfqvlFLPQDT--SRIVLEVFRE-------
>ERR1719222_1795957
--VSAKAKSLIRDSWVQMKF-NGEIAPKIYLKTFAAHPKTLAMFPQFaKVPnrvrPHPYEpLLATAGIDYDVKLWIPSPGSEHNinveeLMARNArmleetrDTITVPATFMIrMlas----------MSNFRR-AGNRSTNDE--------------------
>ERR1719222_245222
------ARSlgrtqesHPLDLDSHEIqqQ-RRTQNPLQDVHHLSRDPENVHPFGRYtR-------FSAHGEQTVLGFESLCFRwiqhdcqqYGCSRa-DQVAVVQGRLPRHFRLslPwhfSATRANPRIILEVFAEELGSTFTKEAAAAWNSLLNFVTKGLEN---
>ERR1711911_103569
----------------------------------sraDQVAVVQGRLPRHfR----------------LSLPW----------------------HFSAtranhPhhlGSIR--RRTRLHFHQGSRCrleLPfelRHQGFRKQHRRLATHR---SRP---
>SRR6476620_89806 
---------------------RHATRQQRRPDVF----------HERQRTAGE-D--lnVLRERDVGQ---VHESLARagvavIDGVVPRIGCEVV-DLSSEMQNG--------FPQGVIL-SAAVGVGDDDG----------------
>tr|A0A0K0D079|A0A0K0D079_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1
---------------------TRDTAGEYHKQLFTLHPELAKYYDAEDIDPdsvlkvcnaddmrylayssaiQAQKFIMLGQQELQCFFRLPTVVNDERSWRSALSDFkeTFGENnNMPMKEFNKVYDAFFAAMQKHAGG-VTAEQKKEWMALFDKAYEDMKK---
>tr|A0A0P5EFU8|A0A0P5EFU8_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1
-NRPPPDP-RCPEELGKHRNGRNALVSSIFVKLFKETPRIQKFFAKFaNVavdsLAGNAEYEKQIALVADRLDTMISAMGDKLQLLGNINYMrytHT-ERGIPRAPWEDFSRLLLDVLGSK---GVSTDDLDSWKGVMAVFVNGV-----
>tr|L8DEE0|L8DEE0_9GAMM Uncharacterized protein OS=Pseudoalteromonas luteoviolacea B = ATCC 29581 OX=1268239 GN=PALB_34720 PE=4 SV=1
MSISPYQYRILTQSLAVVRPNFHCFCVSLRTQVS-HFQLNN------ALITKTEYAYQQEDGLFRFIHQCVGLTLDHPALVHFISAQakLLKSIEISERDICVICNCFLSTMQLHLGKQYTLAMRNAWRRLLHIIANILNHE--
>tr|A0A290TM25|A0A290TM25_PSEO7 Uncharacterized protein OS=Pseudoalteromonas piscicida OX=43662 GN=PPIS_a0207 PE=4 SV=1
MSITPYQYQLLTQTLASIRPNFHGFCTSWYNQIQ-HYDLRM------QIPTNVGQLIIWEHQIFDFVQNCVMRIPQQSNLLHYLQKQrgTLLFMGTSEKDISVLLFTFYSNAKKSSWQAFYHSSKKRLEQSTVTHRKY------
>ERR1719262_376372
-DVGEKVINEVIKSWQLLIKRVeskTEIGKIDFDSLFDRLPHLRKLFKTNRL--------TVAMRFANSVHTLVGALTSKEqteEFTYNLALRHV-QYWagdasIAQANMSAFLKAVLIVFDNALDEKWTQTMEEAWGALFSYVGEAMVS---
>ERR1719440_1320932
---------------LPSLSLPsLLLPSLLLPSLLFSSLLLPSMFVSPR-------L-STAMRFAMSLHSLITSLESTEKteeFTYNLSLRHV-KYWqgdasIAQENMSAFLGAILLVLENALDERCTQAAT-------------------
>tr|Q9NAV7|Q9NAV7_9ANNE Dehaloperoxidase B OS=Amphitrite ornata OX=129555 PE=1 SV=1
-----------------LRGDLRTYAQDIFLAFLNKYPDEKRNFKNYvGKSDqelkSMAKFGDHTEKVFNLMMEVADRATDCVPLASdasTLVQMKQHS-GLTTGNFEKLFVALVEYMRA-SGQSFD---SQSWDRFG------------
>tr|A0A0M4CP70|A0A0M4CP70_SPHS1 Uncharacterized protein OS=Sphingopyxis sp. (strain 113P3) OX=292913 GN=LH20_00550 PE=4 SV=1
--KERSDAALMEATLAAVAETGIDIRHTLFERFFSAYPERHPAFLNLDAA--SRRMTDETLQILFGLA---TDEGWVWPLVAELVATHR-NYGmLPTDEYDAFIDLAIDELGRAAGRAWTGAHAAAWRRQGEIL---------
>tr|T1HWR1|T1HWR1_RHOPR Uncharacterized protein OS=Rhodnius prolixus PE=3 SV=1
-SLTQNEKELLKDSWKKRGINKSTLAMMWFTKLFKANPEELlkhnhgqileELFM--DQT--N---LDYMDKLAEIFSIVVQNIDKSTlctKLIWELAMYHR-CLDLTESYFQLLKKTLLDTLIENFHPSLTPEQIEAWKKFIGIMFDIIY----
>ERR1719171_2291403
-------IPRIcgelwrkqtfklrfnilgkqihspgiPRFFQKMENVGgLLVSalllaMCFYDPEIvAHEEQIGIHIIDR------------NDAIYYVLEACNACILWLlvTNVFGfsvQLSAFkHC-VSQMaeDLAKFGTFAVVFLMAFGCAIhiTMPYDPDFEDMWVTILTLFAI-------
>ERR1700760_4852051 
----------------------------------AGSPSSPAR----------------------------RPRPA-IATEHdcrtrAPANR-APiTYGSPVD------------------ALACRRAL-NDWFRVPGVP--------
>SRR5690348_16468503 
-------------SFWLLEPVADAAMTYFYAELSSAARATWAdrdIYMS----------GPDHMIVRT--ARALVerg-------------------APSRLIHYDLVDPRVTEGQX-------------------------------
>tr|A0A0B2UXI9|A0A0B2UXI9_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_18450 PE=3 SV=1
-LLTAHQRILLQKSWNKSQKtGLENIGAHVFLKIYHREPSVKTLFGIEDVPhaelKYNKIFQNHAMTFTRSLDFILANLNKLDIVanfCRQLGRRHTQyiTRGFRPEYWDAFAEALTECAIDWEGGLRCREALNGWRTLVGFLIEEMRIGF-
>tr|A0A2W4R8Q8|A0A2W4R8Q8_9CHLR Uncharacterized protein OS=Chloroflexi bacterium OX=2026724 GN=DIU68_09390 PE=4 SV=1 
-RLSRQQKRIIQRTFSAVAVRHDLVARLTIERLRELSRTpASTCFGNT---------PEDRRRLMHLLALLVQRMDDRGALHDACVAQT-RQMGCDPFeggSTSLLAEAFIGALQSALAGRFEAKTEAAWREFFQMVERVLR----
>ERR1711911_155006
-DIIRKNCLMLYTNFTATKIAFKWILLCLNCRYFEIKPEAQKLFPAFaNVPL--KDLP-KNYAFLAAVNTCFANVHYLIekagrnpRDCPVFSKVVA-KYDA--RDVKQFGDIMMNSLKSELGSQFTDEIEESWNLALEEIAKMVS----
>SRR6478735_8357209 
-----------------------EREIAFLVARGLPsKEIAEQLFLS---------VRTVQNHLQR----IFTKLG----VtsrGEVAGVLQG-LEGPSSX---------------------------------------------
>ERR1719487_2840864
----------VRQSWAMIQAIqtssAGGFGDALFFNISVMSSEIWSLFSVS-K-------EVMAVTFTDAFTLIVSYIADPVGLAEELfgeADGVG-DVGDDQGEGiregdghDLLGHGEQ--TPDLAAHDGDVEEERVAE---------------
>ERR1719171_2815737
---------------------agaendeelrensgvedsfasgsvPTTFNEMFLFNLTVMGAGARK----N-K-------AImWMTEVLTSFDTIVANVANSKRLQEECdvlGLRIS-KYPLDFVKLPEFKACMLSSLRSLLPRTWSGTHEVAWSWLWENIERML-----
>SRR5262245_17232684 
--VEEETRALARYSYLQW-LDDDEFFSAFYESFFAGATGAKGKFRN---------VEQQRLKLRDAMTAVLNFYPGnEPTSLHRLIAVHA-ARDVTGTEIEQFERSFLEVLHQRLVERKIAeqlgpdvvaKIEQGWRELLHPVVQYVMG---
>ERR1711962_392431
-KFTAEELEAVKKVWDSLLQNGQNSGLFFFEHFFKIYPDQRAKFSFIhDQYghiepeyMETIAMRNHTMKFMNILGDLLNQVLSrDKRVKQDLSNLgytHH-ERGLKEDDVLQLEYAVIDGIHDHL---VTDVHERAWRKVFQLIRIH------
>ERR1719510_2339612
-SLTDNEVILIKSSWTYLKPHINTILIESFMSLFAENSDVKEKFYSFkNHAiedlnKkrgvglaSTNGLQRHIPRVSRAITKVVNSIENLDRVsryLEMLGKIHQ-QIGIEVQELMMLGAFFINSSKRHLPSSMQADrhYSDSWLHLFTVISTMMRKGF-
>tr|W4GBS3|W4GBS3_9STRA Uncharacterized protein OS=Aphanomyces astaci OX=112090 GN=H257_08997 PE=4 SV=1
-VLTRRHVRLIEANWTLISRGTSSaydetrhgNPDKffhrtYYSLLFAVMPSCRSIFRS--------SMHLQGKSLFAILRAMTSILhcPDIVDRMQALAGRHL-TYGCEKTDYTTAGVTLLKTLEIVSGDQWNYDVKEAYLTAFCLLMYLM-----
>GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold759411_1 # 1 # 798 # -1 # ID=759411_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.594
-----------AQFWEEHISykslaDKLEIGCAIYFGMMVHNKEMKRILKKNlhhhQ------SIENSSVKFLDMMGWLLRSLLRsdidLCGSLQQLGAFHR-NMGVNINHFDPMLKSMHETFSYYFPIKYGIQIKYAIDQIFTLAARIMTG---
>ERR1712214_179591
--------------------------------------------------------PGHAgRREGRRSARQPGTGKDRQKStkyLLELGKFHR-FSGIPNDYFGVMGTIFVHAVRPYWEEagCASEQTEVVWMMLFAHIARVMTH---
>tr|A0A1Y0I5V1|A0A1Y0I5V1_9GAMM Uncharacterized protein OS=Oleiphilus messinensis GN=OLMES_1782 PE=4 SV=1
------DQRLFWNSFDRCLsspQRDQQFAEDFYQRLYSSDRAIAEIFDRVSVS-------DQLHAVRQAVYLLQEMtpLKQAEITLDKIQAIHHqHEIRLSNAMLDKWLECLLASVELADP-EFNETVKQAWIDILTPA---------
>tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1 
---TTQQMELVKTSWTDIV-------------LFQKEAPIASLFSFVESAksdadnlLLNTAMQTHVKKFKAAMTSVVDLLPNLDAagqMMQSVGSRHA-NYGVKQMYIMTMSNAIIYALDLSLSArgKFDQATREAWTVFLGAMSRKFTEGL-
>tr|A0A2B4SF50|A0A2B4SF50_STYPI Gelation factor OS=Stylophora pistillata OX=50429 GN=abpC PE=3 SV=1 
-QMSREHMTLVQDSWHLLKGNLEGMGVDFYISLYKENTDLLCQFPYMSeQStehvmNMDDRVKRKGLVTVQHVKEAVTALRNPGSCVH-----HQKASGFCPRNLQSVGGALLYSLDKSLGQSFTSKEKDAWCTVYGIDVATIG----
>ERR1719199_1566639
----------------------------IFQHSGIQRPVFSTSSSSR--------RLCRP-CDLSMAFRPSDVLHSSTRLKAQVETMgfgHL-HLDVTPARCKLFHGALVDFFVVELGDKLTPLAAEGWKRVLTYVASGL-----
>tr|A0A0B2VDB7|A0A0B2VDB7_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_13543 PE=3 SV=1
-SMNDDTKGAICEQWHTILALydgdISRVGVAVYQRIFDAEPQLREVFGIPsFVtdLSEYEPFQRSGKLFMSVVDLCVRNIYALDAEmgpvLVMYGRRHyhQQSRGFHLRYMPIFTQCMKEFVSDCLNEKQkTSDSEDGWSLLFDYIAAKIVDG--
>tr|A0A2C9KGE7|A0A2C9KGE7_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 PE=3 SV=1
-LVTDSDIQALRSSWATLTAGPdgrNVFGNNFVLWMLKTIPNMRERFEKFNAHqsdealKNDNEFVKQVKLIVGGLQSFIDNLENPGQLQATIERLaaiHLKmRPSIGAGYFGPLQNNIHDFIEDTLKVGADDAAPKSWTRLLTAFNDVLNSY--
>SRR3982751_838383 
-----GINDQLRESAAMLTSGGteatDAVIRDFYIALFRNAPSLIAIFPG-NPAQGdfgsDHRGAKQRELLLGALAGLADLydpgdaerMTHLDSVLKRFGRSHAAfTrpdgtvSGATLDEYKAVKDALFSTLVRAAGDRWRAEYTVAWSQAFDYAAASMLL---
>GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6338290_1 # 1 # 129 # -1 # ID=6338290_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.636
------------------REAGlEQYAGALLRSGFDDLEtllaiedadmkdLGIPaCHVVRlRKKlqelqrqrsgtrgdFDASNP---VVAFL-----ENAGLGQYA---KLLLQNgfddmDV-LLDIEDADLKDLGvprghaIKLKKGLRELQLQQYAQEDPMPLHAAA------------
>LauGreDrversion4_2_1035121.scaffolds.fasta_scaffold1378443_1 # 2 # 412 # -1 # ID=1378443_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.550
------------------------AVRELLSEAVRCVSRGKEHFASIDME-------RQCQ----ILNDAIHMLLDFQAergnaPLRDLAARHK-PFGLTRRHYDIFLTGLLEAIAES-G--IDAAHLAAWQKTLTPAVDFI-----
>tr|A0A0P5AEE1|A0A0P5AEE1_9CRUS Di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1
-KLtp--HQIQDVQRSWENI-rNGLNALVSS-IFVKLFKETPRIQKFFAKf--aNVAVD------SLAGn-------------------AEYEkqi-ALVD--TPTPNVEFPV--------------------------------------
>tr|A0A164VL64|A0A164VL64_9CRUS Hemoglobin OS=Daphnia magna GN=APZ42_022506 PE=3 SV=1
--------------FAKF-gS-----------AAVDSLPGNAEYEKQVaLVadrlDTIISAMDDKLQLLGn-------------------INYMryt-HIERGIQRGTWEVR----------------------------------------
>tr|A0A1Y1Q0V7|A0A1Y1Q0V7_9GAMM Uncharacterized protein OS=Thiotrichaceae bacterium IS1 OX=1934244 GN=BWK78_10305 PE=3 SV=1 
--------ELIGQSWDKLAPRQTEFIDAVYELLFQQHPHYKPLFSE--------SIQREMAKMVETVAMVARvsGESEIsHPRLIKLGERHS-PLQLNRGDLENFKTAYLTVLKQFCP-EWTTECELSWEEDQSLIPG-------
>tr|A0A1B6JRB7|A0A1B6JRB7_9HEMI Uncharacterized protein OS=Homalodisca liturata GN=g.2446 PE=3 SV=1
-SLTDRDLRLGRATWFKNVDATPDFGMVIFKELFRQYPDVESYFLHLRGnAgsiFDSRTFRSHMtERVVPKLKEVFEALDKPEHLnevMTKLGLYHA-KLGVSGHLVENMLSVILDALKSVMHTKMQPDEETAVRTCL------------
>ERR1719323_1074371
--IPFEQRTLITEVWNVLQESTiRYVSNtMFLPLIVRSNKSLQKCFAALDQSlhgmelvecYGSkFDRTKHGSLFLSkLLIRVVPNMDQMDRVLPYLAELgalHQ-RHGVAKQHIDLLGLAFCAAIRGVvagGGvkGGHLHETTKAWITLIQAVCTGMKMGYT
>tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1 
----PMEVALVQSTWQRFLesPNLTTEFSAIFQRMFQMVPTAMQAFRYVnstDLDslVANKDLQKVVTMMMSEVNATLQLLDQPQALISLIRshgARHA-TYGVTRQWEETMLNAILYAVETKLSPsGFNQSEKNAWRSVLDMLGRN------
>ERR1719495_824226
-----QDIENVRKTWEKMIAKheLQGVGLVVLTAWMNEHKEIRQVFAKSfpiiDKlekdvldlvQLNDPTLNEHATIMASSFGKMIECLDDTEfvQMMIDIGKKHT-GFRVSADSFDTsLNSTLITALMALSEEKEDSPNIKSWKTVVEVMKHYLK----
>ERR1719272_197188
-SLSATQRASILASWRQLCGEDggATFCASLLGGAFEAVPETRALAGV-PEAApepeavpeaeaavaapapapakgkagatavpeaaaaveeaaeeavesaESVALRAAAAHAAVAMEIMAQQLSAPEALKESLTELGVkaasRGLGC-GAPFDRLGEALQTTLQASLGDeAFPEALAEAWRQLYAQASQEIQLQY-
>tr|A0A0N8ALQ3|A0A0N8ALQ3_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
----------------------------TKARLN----NCMLLFSE-----k--LAAFLaQASPSWPVWNVVIHPCfs--qelMANQLNVLGGAHQ-PRGATPVMLEQFXXXXSPPSSSSSSRKP-PASRNSSPN--------------
>tr|A0A0P5ANB1|A0A0P5ANB1_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
--GGNDGVETVSDQSNLFVVF-AI-FGQGIDGNASEFDEVLLGAGSLlEELDedggNDGVAVTpDVFPaglniadlVGGQFSLGISQIfgflevlgdASdqsAHTVLPGLSGL-G-VEGAAQRFSKDFLSDVTELLEHDGVSSFNAEARQAWKNGMRAL---------
>tr|A0A0P5ESR8|A0A0P5ESR8_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
----------------------------------------------FlEDASelleHDGGSS----TGFMGTTESVQLVghqllaeqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGNIGELAEHCLVL--GVGLDEA-EEDLGSDISV----------
>tr|A0A0P5I7S0|A0A0P5I7S0_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
----------------------------------------------FlEDAAelleHDGGSS----TGLMGTTESVQLVghqllagqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGKISEGLEHLLVL--GVVLDE-TEEDLGRHISVL---------
>SRR2546423_8132340 
--------------------LADVADEMFTARLLELEPQWQRVLSD--------EPTEWGRRLLRAIRQAVASFTCLGGFAEALRELGgVPAAHVGYRDYERQGAAFVGRLEHSLDKPMAGAMRESWQRVFRLLAEM------
>SRR5260221_7941029 
-----------------------IAEAMFTARLLELEPQWQAVLSD--------ERRQPTQRLLHALRQAVAGFTRLSGFEAALKELGaIPVKGCSHGDYESLGAAFIARLERSRLGPRAHQMRERGETGFSPLSX-------
>SRR5262245_33028046 
--------EHADHNYDSNLRNNANFFHSFYSRLFESSDEIAKLFEQRNV-----TMAEQYRKLDHAMVSILAFNPRLRaTTLDPQIESHA-NFGLSAAHFGLFREAFLHALRETQGA--DEYSQEAWRAILNPALTYMRDK--
>SRR5436309_12080688 
------------ASFAKLLAVWEPLMHRFHAHLEQLNPRLRYHLPPA--------LL---RYVRFELLQAVRQQT-PMEVGSGLRRFgvHLRAQGFEGPDLDTLGAAWLVALDEVLGDRFDSEAREQWLRFYKVLRSAF-----
>tr|A0A139A347|A0A139A347_GONPR Uncharacterized protein OS=Gonapodya prolifera JEL478 OX=1344416 GN=M427DRAFT_73171 PE=4 SV=1 
-MLSAEQARLLKKNWKDIGASSVanpmmFVVAQFYRRLLRK-KGYKRIFEGIDIE-------TQYFKMQGALTACVEfaeNLDKFADTIRRIGARHA-RYNMTPNMMNDVVDSLVPSLKEFsldHGITWNEEIEEAYDEWLEQVTGYF-----
>ERR1740139_1939294
---DSDTIAVVKQTWKAITALPeqqEYVGMRLLHNlhpcyetsltfllvielyylsYLRVVPSARAFFPPTsDSLIDDESFRESASNLMMCIDKAINTLENQRhlrfkALLQTYGKKLS-RLHIPPSCYTMAWFALIETLQDVLEDRFTELMLAYWIDIIDPINT-------
>ERR1712129_538146
-------------------------HGDISSInhpvyytftllnkfthdsYLRVVPSARYFIPVIsDDDI-----TEKGIYLIACIDRVVRLLERQEkrrlqVLLRSYGRILL-RYDINPSNYTTAWLALIDTLQDILKNSFTELMLAYWIDIMEPTNL-------
>tr|A0A1Y6FH01|A0A1Y6FH01_9SPHN Uncharacterized protein OS=Altererythrobacter xiamenensis OX=1316679 GN=SAMN06297468_2444 PE=4 SV=1
-------STLAERSFERLAEQRGDITQDVLERYYRRYPDGRASFEHHGL--GN-RAELEGRMVSTTAFLLMQWAQDPGGTRIEQGTTivhHQDTLEIGPRLYLGLIDAVLEVLFETIPDE-SAEERAFWLSLRGEIADFLE----
>ERR1711879_742838
---------KVFQSYGRSC-NNMVFFEDFYSIFMTKSPDVLNMFANTDME-------AQRALLRSGILWLGMHARGMpDTKIRALGESHSKkKDEHQPHVLFHVAGRSDGNAFPPRP-G----LHSRTGANLAPYPTAHVT---
>ERR1712080_808083
---TAGDVQVILRNWESVWGaqfsgRRVAIGQAVFANFLDRVPDAKDLFKRVKVdQPDSPEFKAHIIRIVNGIDNVLNPLVLILVSnscLVSML----SEMASRLPCSRS----WVPLSTMFFP---------------------------
>tr|M6F3R8|M6F3R8_9LEPT Uncharacterized protein OS=Leptospira kirschneri serovar Bulgarica str. Nikolaevo OX=1240687 GN=LEP1GSC008_4081 PE=4 SV=1
MNISENQIRSLNESFDIVNLDRIKFAELFFIYLKENHPKYENIFSRIQL--------EDVKHFMNSARNISLSSVQYSQLERAIQNFgvECLKICNQAEEIPILEKAWLFALEKWLGPWYSHEVEKSWQEVFKMIHTS------
>SRR6478735_3884488 
---------------------------------------VRRTTLY--------MPRP-DGRGGTMKPVVAAGSL----AIMAFVTVgaqAP-APTPQDRMYAAVRSDDT----AAVSALLQGGA--------------------
>tr|Q25689|Q25689_PSEDC Hemoglobin OS=Pseudoterranova decipiens OX=6271 GN=hemoglobin PE=2 SV=1
--------------------HQKQNGIDLYKHMFEHYPHMRKAFKGReNFtkedVQKDAFFVNKDTRFCWPFVCCDSSYDDEPtfdYFVDALMDRHI-KDDIhlPQEQWHEFWKLFAEYLNEKSHQHLTEAEKHAWSTIGE-----------
>tr|A0A2P8XQA5|A0A2P8XQA5_BLAGE Uncharacterized protein OS=Blattella germanica OX=6973 GN=C0J52_27026 PE=3 SV=1 
--LAREEKKFITESWHAFMRLPPANSVDAFVKFLQENPKYIKFFKSVDGIPledlrYSFRVPKHVTAVLLYVNSMVHCLDNADAMFflsLQVGLMHS-NMGLTVEDFKLFNGYMVNILEDELG--LNDEGVAVWNKVLEIFM--------
>ERR1740121_2035324
-----------------FTPLt-----Cqwa-----TPHDGPAQHVL-------------------CEDGHFahFATDKCesAgHG--ArvQCPSDMPEMcaDttcgggqehccrpaggCTGgERPCPT--------TASASgSA--SgsaSGSASSRRLAgIDYE-----------
>ERR1719240_2235476
-----------YE---DEE---------------------------------------------------------------------GAqvdvmkgEDALVATADLLYQKMSEDAN---MQT-lLGNIELAELAsKLQKALa---------
>ERR1740122_169377
----K------GE--ADKSG-nAEAAGGgqGDTPETGAAQDTAAGV-------------------TDEHS--------KA--LgieISS--FDELkvDqkciaaaIDAwKLFISTAESREAAGEAV---YNA-lFEGAPS--LQALFVTPRAE------
>ERR1719243_286169
-------------------------------------SHPVNV-------------------LVSDTMwkGY----t-vRG--IrrvNYY--VKYMmlTrdgnvsqALGwFKDAADCKIISH-PVNVLVsDT--MwKGIVRKQFLGgRLWFII---S-----
>ERR1719158_147189
-----------RV--CYLYPLvhcNILAVLrelnfdGAAESLCLDAPALLPT-------------------MLDGLIwrSR----vTeNG--QrrvNYY--IKYFivDaeggfskTTEvMTDNGDPTIVCR-PVVSLVtDM--IwGRVAFRTFLYgKAWFLF---T-----
>ERR1712071_338654
--PTAEEIALIRESWPIVKKNKN-VFVEFVLEHFRVHPKTQDLLPEFAnLAiadmPSNKfFVQLTETYVVMAMQEIIDNLDNAGVLTDLLQCLNS-NWYVdyvslDRQN-RETLRIRRVGQEQKSYSRNMESneiQQQRCPQNLRQAVH-------
>ERR1711988_652294
--PSAGEIELIRESWPVIKKNKN-VLAEFVLEHFRVHPKTQELLPELAgIAladlPNNAyFVQLSETYVVLATNEIVDNLDNAGVLVNKLGENED-FQVLayyssAVATFivtnLDQEDILTHILVQQTKP--------------EQFVD-------
>ERR1711911_417752
----------------------------------------AisyPVFPSTSsLKy---------------------------DSLKKYLlDAFIf--NYCT---------LIFFL-------------fIKGNWQLgdgGIgrRIRYS-------
>tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 GN=TSPGSL018_8354 PE=3 SV=1
------------------------VGAGFLKLYAQRNPWAVEQFSF-GLR------PQHAEKMGLALELIVNSATRPQVLQHQLRVLalgHV-QMGIKPEMFKSFEEALFAFLGQVLGAhnTFDEETEGAWRWMWGIVNAVFTQ---
>ERR1719232_1195758
------ETVIIKDTWETIHKQVKAIGMEAFEKLFALNSDMSAYLPQTDDldqdetRRLSDKVKSHAKLTMETLEQVIAAIPDMTEVYNVITKMKK--LHPQTGLLEVIGPVFCNTTRHFllIQGRWSLDVQRAWLALFGEVSAMIRASY-
>ERR1719189_1497217
------GRQADEQ----VGREEAGPGHRGHRP----AQDDPAHLRgarDCGQrvrgraRRHGDRGV-QGRGQGEQS-QH-----------------HR--HQGS------HGQ---------LHGRHX-----------------------
>ERR550519_213
------NIVLLRDTWSVIHRQVNTLGMETFQKLFEINSEVSHYVSpscpDLDPdciDSTTQAIKAHATHTITILHNTVSNLCNLgd--LAGE---------------MNRLGKLHCDLGIDHgiL----------------------------
>ERR1712051_111803
-------------------------------------------------------------------SNF--HASDGHlmdgAFDPnISQIFSF-FYLFQNCEMLVFGPHFVASAMYYLPSPLrEKSTQESWLKLFSVITEIMMS---
>SRR3990167_4175368 
-GLTDGEKGMIQQSWNLLS--KVEFTKILYKKIFELAPHVRCLFQNS--------IESQHENFsimMDMmINEHINDELDLFAVVLQLAKRHF-HYKVKTDYYSIFRDGFLWSLEQTLSIEtlnktITNestnqptTIKSIWLKFVNYLISVMV----
>ERR1712212_288737
-LLTDDELFSVGNLWTNLRESSADSGLYIFQHWFDMFPEVVESFDFAkDQYgnillnlMQTKKMRNHAIGVMNKLDAMMMRLFKRDPevakLIYDVGVHHQ-TRNINEDEMTKMSKSIYSAVQDINVGPHSDKELAALHNLLEVVSYHFKR---
>ERR1719167_330163
-DLTDKERELIQHTWWRFREE-PYCRLRIMTHYFSANSSIKKKFQRKNEENaangnlmtamVSWNIRRFSIRLVEFMDKVVRDLETENyQDIYDISELqgakHYRlKRMVEPGDMEALGQSIQTTISEHFGEKFNRSHILAWRRLFIVICSRF-----
>ERR1719378_576485
-DLTDKERELIQHTWWRFREE-PYCRLRIMTPYFSANSSIKKKFQRKNEENaangnlmtamVSWNIRRFSIRLVEFMDKVVRDLETENyQDIYDISELqgakHYRlKRWWNRETWKLSANRSRQQFR-------------------------------
>tr|A0A1I8F573|A0A1I8F573_9PLAT Uncharacterized protein OS=Macrostomum lignano OX=282301 PE=4 SV=1
--------------------------------------------------STNQKPPSDGDRLLYWINVQ-------PTAQPQllrGASEGC-VRLFSPRILTRSCISSNLCVRAGRGRNS----SSTeTTSAEGADAVVAA----
>ERR1719265_1594411
-------VDTIVKDWAGLD--LEKLGDTTFGMMVQNNPEIKTIFGGDVhPGVAQQGLKSQAATFVGFMSYAMTWLKKkdfivLEQKMVELGQRHV-HYGVNVSHFVSFQEAMFTALREQLGTRFE-DNKYAWTF--------------
>tr|A0A0N5AG16|A0A0N5AG16_9BILA Uncharacterized protein OS=Syphacia muris PE=3 SV=1
--PSRRQCCILHKSWHRAQQCgLD-IGSRIVMQVTKNEPTVWRTVGLTNATGadikYDKNIQYQAALFTKALTTIMSKIDDPEAVseyCRELGRRHVRhvKKGFQTRWWDTFAESLTECVIEWEGttvdltslvfhatkicGQRCKEALNGWRKLVIFIISEMRAGF-
>SRR3989338_2963815 
----PHQMTPLYHLYKENVPpqKERELGLLFYKLLFDSNPELLDFFANVDLD-------HLSDHLVQTIRLFLESrnsLVSLVPAMKALGIIHQ-RAMIPSWAFPLVIENMAKLFSILLGDRFTVELASALVLSFDLLTSFV-----
>SRR3990167_6716616 
-----EYENPIYStlknIWlETVSTpeIKSAVGELFYKNLFQYHPELLEYFNNVDMD-------SLALHLSQALDFVFQSinkIGDYksqwRTVLEHLGEVHR-AALIPTWGYPIIGQQILKIFPYNEKAGFSTKQL--etaLATLYREIVIIM-----
>tr|A0A0Q4Y6B0|A0A0Q4Y6B0_9BURK Uncharacterized protein OS=Pseudorhodoferax sp. Leaf267 GN=ASF43_05025 PE=4 SV=1
------HRVLAKYAYRQwVEPLGMQFSQAFYTRFFQDDKASRAIFERALGPRAAgLilVDDAHHNKLVGSLGKVLNYRRGsPPSSIDDLVPSHR-DKGITIEHLRHFREAFLKTLEAQIDAsdPEKRAVVDAWRQLFEPVLDAMAS---
>tr|A0A1Y5SIU2|A0A1Y5SIU2_9RHOB Uncharacterized protein OS=Roseisalinus antarcticus OX=254357 GN=ROA7023_01630 PE=4 SV=1 
-----PQAELVADSLSRVGDKVIWLASDYYEALFDASPQLHGVLPH--------QMSEQTNMLGHALAHALANLRDPDGAAPMAQDAglADRSARMPPRMRRTIVRTLVHALSLWHGPTWTKDHARAWNEGLLGVAPL------
>SRR5690606_37396704 
--FSDTDTYILHTGLKWIEEAPETFAAKLYQRLLRDHPECQASLHAIGL-------ESFNRNFIHFLKMVKEELLERHTIHVAPREFlalHALpvEKVRHSNYVIKMGRTFLDIFAELAEDAWSPALESTWNKAIEEVKIALW----
>SRR4029453_11903763 
-PMTDAELALFHDSLTRCTSQ-PPFLERFYTLFLAASDEVRHKFRQTD-------FQKQRRLLQASFYMVMLQADGKpEGavHFERIADLHSQrHLDIPPHLYDLWLDCLMQAVREYDP-EWMPGTGGLFWGRVGTCIVFFYMISV
>tr|A0A1R1LGI5|A0A1R1LGI5_9GAMM Uncharacterized protein OS=Motiliproteus sp. MSK22-1 OX=1897630 GN=BGP75_23395 PE=4 SV=1 
------QLDKIYSTLQLLDdEKSEKLINETYSIFFNAHPEAVLLWSKDDPE-------SRSKMFNGVILTIIDNLTRPDIFKnNLLSDVkdHD-EYGVDKEMYGGFFLSLTEALKKTLGSEFNQEMELAWKHQLAHIRE-------
>tr|A0A1H1BYI0|A0A1H1BYI0_9ACTN Group 1 truncated hemoglobin OS=Thermostaphylospora chromogena OX=35622 GN=SAMN04489764_1195 PE=3 SV=1
-------------LYEKIGGgpAVREVVDAFYTDVL-GDTDLKPYFDGIDMA----RLKRHMVVLLC---SVLGGPEGY--RGRELGEAHK-NLGISDEHYAKVGDKLVTALRDH-----------------------------
>tr|A0A1R2BTD0|A0A1R2BTD0_9CILI Uncharacterized protein OS=Stentor coeruleus OX=5963 GN=SteCoe_19762 PE=4 SV=1
-------------IYDRYGGqpFWERILDVFYTKNL-AEPTLQGFFIGKDVE----RAKAMNRSLLA---AALRPEGEH--FPVSIKRTHR-NMDISDAQFGKFAENLISTLGEN-----------------------------
>tr|A0A218QUH5|A0A218QUH5_9CYAN Group 1 truncated hemoglobin OS=Tolypothrix sp. NIES-4075 OX=2005459 GN=NIES4075_64370 PE=3 SV=1
-------------LYDKLGGkpTLDKVVQDFHKRIL-ADNTLQPFFANTDME----KQRQHQVAFFA---QIFEGPNEY--KGRAMEA-tHA-GMNLQQPHFDAIVSHLKESMASV-----------------------------
>tr|A0A1Z4FY87|A0A1Z4FY87_9CYAN Group 1 truncated hemoglobin OS=Calothrix sp. NIES-2098 OX=1954171 GN=NIES2098_33650 PE=3 SV=1
-------------LYEKIGGqaTLDKVVADLHKRIQ-ADSSVNTFFAKTDMA----KQRSHFVAFVA---QLLEGPKQY--AGRPMDK-tHT-GMNIQPQHFDTIAKHLSDAMAAN-----------------------------
>tr|A0A0T6BC68|A0A0T6BC68_9SCAR Uncharacterized protein OS=Oryctes borbonicus OX=1629725 GN=AMK59_2266 PE=3 SV=1 
-GLTSQQKSLIQSTFNVIRPHILNVGIDLFVRVLEVEPEHHRVLPfsHIPIadLHESFEFKFHCLAVVYSCSAIIDHLHDDGILIPLMKKYASdLKASIPLDIFQMIHDPLLEALDVHDDVKISEEALEAVRTLLRNLTNFLID---
>SRR5689334_189301 
-------LDALETSLDLVSPHG----SELMDAFFAERP-----FPAGD-------AGAQRAATLRLMGLLRLCLRDVHSVVALVRDLGA-RHGAQREQ--------------------------------------------
>SoimicmetaTmtLPA_FD_contig_71_176585_length_314_multi_3_in_0_out_0_1 # 2 # 220 # -1 # ID=1957230_1;partial=10;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.685
-----------------------RGGaveevQGPESALLESPPSLDRVATDRS--------AMIPLG-ATGLHGIMTSM--taPSMLqdlVLSLASQHL-DVVLSPPRAIVLRDAILDLFQQELGDGFDSKARSGLSLILNYVCGSFL----
>ERR1712159_177610
---STSSLNAVKNSIPLIQQHGNAIAENFYVQ--QIQPTNITFFNRAHFTS-----GQQAQTLSQFLVLLAQRSDNLELMnthLRRISNKHV-GFGIKPQHYPIFFENLFVAFKEVLGTKATPELISSWKELVSLVQEG------
>ERR1712159_799488
---STSSLNAVKNSIPLIQQHGNAIAENFYVQ--QIQPTNVPFFNRAHFAS-----GQQAQTLSQFLVLLAQRSDNLELMnthLEESPTNML-DSESNHNTTRSSS-----------KTCSLPSKKS------------------
>ERR1719323_2894579
---KVHRQTYDICD----------LILQHIQIITVHCILIQDIDQCCHL-----KTDKQVAAVVNILYQYAMNCDNLNVLENEIAdiiGLAV-NLNMEAWQYPLIAQSLVE----------------------------------
>ERR1711868_248053
---------MIKGTAKTIKEKGSSIITRMHQNLVNKHKEFKTIFPEEIL-----KDAIHMQKAVGLLHGYASNCDNMPVIEADISelvGILI-NVGVENDHYPLVAEALVEAIGTCLGSDTNAETVDAWKQALDFMVVHF-----
>tr|G5ZYB7|G5ZYB7_9PROT Truncated hemoglobin OS=SAR116 cluster alpha proteobacterium HIMB100 OX=909943 GN=HIMB100_00010220 PE=4 SV=1
----------------------SKLVSELYEELS-QNEITAPYFENSNMT----SLMDHQVKFLSQAL---GGPEQY--TGQAMNAAHT-GLKITEAAFTEVAKTIQFILEDN-----------------------------
>SRR5688500_9373349 
-------LPYTTLFRSALGDDAVGMAAELMDRLIADHPHDAHAFMNPEAA--RERMTRETLEAM--LGVA-AREPWGETTIANFVDLHH-NYAsFGADDYAARFAMTMAVMERGAGARGPGGASSAWRRQAA-----------
>ERR1719365_124985
-EMSGKQKKIVWRTWNSMLGkqesDYNDFGINFVLWLFDNFPKMRNKFDELYGRsrnslIVDQHFIAHTENVVKELDRLIKDLPFPRLLSKRISKLadsHLNqEP--------------------------------------------------
>ERR1719199_1194134
-----THAGYIEKSRESVLNlDAAQLGADIHVKFLNVYPAAASLFQKT-L-----RM-LITTKIMGTLMAVIS---DPTGTledVRAVGVRHT-KYGISERYLLPFGAMLWEIVGTMLPGMWSDEHSAAWAFYLDFIASTMTRA--
>ERR1719359_1737517
-----------------------SFGEAFRFNLGMMAPEFMAMFKTLTAE-------QFTDQFTVMVGQIVNYIDDPPKLLEDlyiLSVRHL-HYNTKPGNSLSLGKQ-------------------SWLLCEASFHRIGIG---
>ERR1719487_2229452
-----------------------SAALSL--------P-------T-EQE-------SPVTMTAEA----VQMVQDSL--RRVdsaVQV-----RDAMEDvFFPHLF---------------------------------------
>tr|A0A2E0SMS8|A0A2E0SMS8_9PLAN Uncharacterized protein OS=Planctomyces sp. OX=37635 GN=CMJ46_12130 PE=4 SV=1 
--ISERQYHLIHDSYRRCM-LADDFLVMFHRNFMEKSPQIPKFFAD--H-----TLQQQHRILAKSVARLVSFVDGKPQaeqdMRDTMRILHDGNLRLTPEHYAFWATALMETICTI-DEACNDEVAVAWEQTISYGTGVLK----
>SRR5690349_6204932 
-ILTDEHRHFIRTSWEKINKRHekTTLGILMFEKVFAFLPDLRNVFGLNDSSvsetDRNENFRRHTSLVVNLIDLIIRNIFEMEAemgpVLLMYGRRHFLKHDLVFQE------NQLVAFAQGLCEFfeeevdhdddnsLASETKAAWNIF-------------
>ERR550537_1224553
----------------------NVVGRVVFMNIFKAAPEAKALFPGAREEnmwGPGSKMEQHVIKVVQTLAVAIGGLKDLGPIVPVLEvGLgvgIL-RNRHILSTIHLFRTFWllcIPMIQRIVGHPsscQTQRWSSRCRVVLI-----------
>tr|B5DW13|B5DW13_DROPS Uncharacterized protein OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN=Dpse\GA26483 PE=3 SV=1
-GFTLCEKVALRQAWNLIRPRERRFGQDVFYTFLNEWYWSISKFKKG-EDINIALLHAHALTFIRFVGALINESDPI-MFQVMINENnqtHS-RCRVGADYIAMLGQALTDYILKVLDKVRSPSLEQGLQRIVEKF---------
>tr|A0A1I8CTR5|A0A1I8CTR5_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=3 SV=1
-KMTASQKSVLISSWKFIKPNANFIMRKIFTELESVSPKVKQIFAKAailDCfskesSDaKACTVDEHVRLLSRFIDDVISNIDKEKEVrniLRKVGQSHAGlsnGSLFTSSLWEFLGEIAVAKICQVDYVQKSREAAKAWRLLIAFMTDELRNAF-
>SRR5258705_2725614 
-----SSFPPGPGELRNCCAHRRRRRRALLPAPLRARPVARAHVLR-R-------HAL-RDHFEAALALIIRNLDEMEALAESLLESEW-----------------------------------------------------
>tr|A0A1Z5JZN5|A0A1Z5JZN5_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_19Hh029 PE=3 SV=1
--ISPDVVSAVQDSWERIKDSspawEDDFGDRFLKSIFTKAPLsYKLLFPFGTTSgpamFESEDFIEAARTASTLMDMSVSLLECeMDALFGQlleIGLEHANFPRIQTSHWSMMRDALLRTLASYssaLSEDCKdlEKVLSAWSLVFDNLSNEMVET--
>ERR1719329_2064399
------------SLFVRLGGDvaVDAAVERFYERIL-QDPLLAQIFSRVNL-------AGLKNMQRKFLTMAFGGPDLYDG--LSLRDAHQ-GKGITEAHFAAVAGHLSATLREmAVPDRQHDEVMAIAASTQGNIV--------
>tr|A0A1I8MDY2|A0A1I8MDY2_MUSDO Uncharacterized protein OS=Musca domestica OX=7370 GN=101890360 PE=4 SV=1
NGFTATEIASLRNGWRHFKRRFGYHSKQIFMKFYQEHEQMLEKFRNRMGKFNMQQLHRHPQELLQVYGNLIEqGLDNMtymHVLMTAISQRHR-MFGVTGYEIKLQTDhitlYILALLEKII----SPTFVSGLEKLSRLIN--------
>tr|A0A1Q9NTV3|A0A1Q9NTV3_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_3 OX=1841598 GN=hmp PE=4 SV=1 
--FTSKEADILTQSLKALEEKTDDLPKLFYYHFLEPtsNKEIISLFNKS-------DMTKQYMMFHQSLAIIVSSIKDSHllnQILKDLVKRHK-NYGVKYAHVQIFSSAFYKTIEEIFPKD--EKVKILWIKLINFVLSKFN----
>SRR3990167_8190046 
--MDNAQKLhIVDTILERASELAGDITDSVMAEFYRGDPEAKDLFTHHCPV---DTIRIEAGTVEQALYCFMRWFQSPGEIRILLLGSvphHVETLKVPVNYYHRFLQAMATVIRKTIPAE-SREEIDVWNEICGDLGEIVDA---
>SRR5690625_7611079 
------------------------CALCFYLCFcTDTPPTRTYILSLHDAL---PICQLEGEMVENSLYCLMSWFESPGEIEMLLAGSvphHEETLRVPPHWYEELLEATRSEEHTSELQS-RGHLVC----------CLLLE---
>tr|A0A0K6SA08|A0A0K6SA08_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_8920.t1.CR2 PE=3 SV=1
--------------------VSAAMAEKFFELVPKRAPNLRMIFEKRqDIY------KH---HFGEITKRLLAYLDSPEEVWKedpELAIKHI-EFGVMPCDVPVFANVFLQILAELAGPAWTQRHRDTWDKLFSIVSGALAE---
>SRR5690606_8675308 
-----IDRDLIEASFEHAAETLGDITPFAYQHFFARYPQAEELFLCKG-VQFKNDL--QNQMVRDAIYAFLEYLDTPDEVDIVFKytiPQHL-DLNIPMLYFNGLLEAVAEVVCGATPEAGKAATEASWKVLLESIE--------
>tr|A0A1W0WMU5|A0A1W0WMU5_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_09357 PE=4 SV=1 
-ALTHVQINLVRESWRWLNfnrPLQETAVRFFLDFYFKQNPDCLPMFGMKTVDHYNKAFSIHALTVMHAIKYAVEYIGNPEQfqrLFRTVGQTHL-RFGLTDLHVERFLEQWLAFLRANDAKVFDAATVEAWNLAGRIVVSQI-----
>ERR1719354_143580
-------------------------------------------------------------AFWDILDHICGHLDRLENLIPQLRDFalQCFNSGLFSDDYNILGECLVTILSTNFDP-WEETHSDSWAWCLDLVMSTLVT---
>SRR5215207_8455447 
-------------DFDTVV--CSSFAERFYSRLFTHEGGehLRALFPDN--------IQPQHAQFTTMLGDILAYNFRIGrsLLGD-TFRKHI-DFNIRESDVDVFRKAFVEEVGSTFLH--LG----------------------
>SRR5271170_3229012 
-----------------------------------GRECRRDNrLLLLDAPPATPLgtSqyLDARHRTVSCTSantgvctgpYQPDQLKD---------RKT--VLGGGLR-------------LAQPGSRLSQPLPGRFGESAGX----------
>SRR5216684_1000550 
-----------------------------------LHQGRHRPRVHLGL-------------------------------------------RGGSPAHPPRDPRPRHKRGAIHRA--drhVPPPrPPRQSGAQAdfSDHSRL-----
>SRR5271154_4753691 
-----------------------------------LHQCKHRC-LHWSLPARSA-qrSQDGP----RRRVtlqpPPVRNRGR---------GVSAlsllrsswpniRFYRVETVSCPRDRLCIDLDPISTVKRNLA------GVsDVYLL-RS-------
>SRR4051812_37657562 
-------------------------------------------------------------------------------------------------XMSSGVRFTRWRCESIRARLRapsdhcvTVPVkPSRRSDSAVsaRKAEQ------
>DeeseametaMP0200_FD_k123_38240_1 # 1 # 450 # -1 # ID=33738_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658
-----------------------------------PHACLSTChAANPP----VAI----RARRSSAEGYAR---------------SD--DARGGTA-------------SPPPGRELSSPASAIDPFSRGAISFVSF----
>tr|H3FA75|H3FA75_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=WBGene00108645 PE=4 SV=1
-GLTAYQQKLLIQCWPNIYSTGpgGQFASAIYNRLQNSCPKAKQLLAKANGVavFANSDvdcTAMHSRVTIELLDTAIRNLDAdHAKLTAYlieVGRSHRplRQEGLAIAVWDDLADSLMECVCRYDAVKKHKELRRAWLALIAYIVDNLKNL--
>SRR5437879_6948005 
-----------------------------------------------------------------------PPSTcsWTtslsagrgvRPISSVSASPTA-STaaTLPPHLYDFWLDCLLHAAKECD-QQWSPEVAAAWRYMMGSCSSRLAT---
>tr|A0A0V1B190|A0A0V1B190_TRISP Uncharacterized protein OS=Trichinella spiralis OX=6334 GN=T01_13586 PE=3 SV=1
--LNPKEVILTRNVWAALKEKhQHLVGMEIFRQIFNRRPDLKSLFGVSALdtemALNSTRLHRHTMIFQDVIDILMVNISNVDVniadSLIDLGAQHWvlTKRGFDPAYWLIFGDVLFDLVENVTRKLpSRKRSTNAWRKTIAFMLDCMQIGY-
>SRR5437762_8994925 
------AAS----------------SDHHIPSQLAAGTRAKDRKGGVE-------YPGHVCRGQRRCARDRPHILAsPELCIPRAcrtksA--------------AFCAVCENRCCETCR-SPPAKKPETARRSAERTG---------
>ERR1719204_228700
-QLSPSTVKAVQTSWNNIRSGGpGYFGHLLFSYWLAEHPRALGVYSMYyhdDKkHrvSLLPRFHRLGEVYAKRIDYWVTNLEEPVKLFLMLyehGFNHA-KRGVNLRDFPNMTPSLMDALATALGRQMTLKLYDQWKDFWKFIFMQIAEG--
>tr|A0A2A6CS87|A0A2A6CS87_PRIPA Glb-5 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_35904 PE=4 SV=1
-----DETHLARAHWILLHKMnkQGTVIQSTFEHLMTEFKHTRPIWQFGrniDENvkdwnkelHEDFYFRHHCASVQAAITMIMENKDDIVSLTRVLnevGAHHF-FYDAYEPHLILFEDAMITAMKKVLKGveELDEETERSWRVLLQLTRKHLIEG--
>tr|A0A090LKP0|A0A090LKP0_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_2000335800 PE=4 SV=
-ELPKADKDIIISTYNILLQADPELFSKAWIMSASRSTSIRKAFSLIDPnsTHIEVDFTKFSAVIERFFTRIICEekLVNesFEKSCINLGKKHVDfvPIGFHSNYWDIFMNCMIDVIAETVIIAFNednkqqQQVQKCWNKFVGRIVFLMQSGFK
>tr|A0A2G9URY2|A0A2G9URY2_TELCI Uncharacterized protein OS=Teladorsagia circumcincta OX=45464 GN=TELCIR_05034 PE=4 SV=1
-PIANKTKKLVIQEWPRMLEHQPNLFGIVWISSATRSNSIKKTFGIGANenPEDNEAFMKIWPTVQQFFHKL---------------------------------VCMAETVDQTLCEYYTddlkrAEMILAWQRVFNTIVHHMRTGYI
>tr|A0A0M3JT43|A0A0M3JT43_ANISI Uncharacterized protein OS=Anisakis simplex OX=6269 PE=3 SV=1
-SFTTPQLTSVFNAHFSMIQLNPDVIKDCWIKTSKRSSSIKKAFGMLEHeePETNASFMNLPITIQAFFKELIFEldCDSvkIRQRCEQLGARHVDfsERGFHSNFWDIFQVCTIEVIAEC--NLGLnedqhRSYELAWIHLLSSVVKSMRNGYT
>tr|A0A077Z0R2|A0A077Z0R2_TRITR Globin OS=Trichuris trichiura OX=36087 GN=TTRE_0000042901 PE=3 SV=1
--FTAKEFAIAELTWAKLKVRfNNQVGMEIFRQIFGSCPEVKDLFGLQNKedqkALCDQRMARHTAIFQDIIELLIVDLSQRsDSLtqsLITLGAQHWffTQRGFRPEFWVIFGNTLVNLIRSLPLSlSQRYLARRTWIKLIVYLLDCVMLGY-
>tr|A0A0N5DS84|A0A0N5DS84_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1
--FTPKEFAIAELTWAKLKLRfNNQVGLEIFRQIFASCSQVKGLFGLQNKedhtALGDQRMARHTAIFQDIIELLIVDLSKRsDSLtqsLITLGAQHWffNQRGFRPEYWVIFGNVLVNLIRSLPLSlSQRYLARRTWVKLIVYLLDCVLFGY-
>tr|A0A016V5D5|A0A016V5D5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0017.g3216 PE=3 SV=1
--LNRMQRRALRFTWHRLQTRnggkrVENVFEEVFDRLVRALPCVRDMFTTRMFlcamArNETASLRDHAKVTVKMFDVVLKNMDTDPskrtdtgfPLDpKIIGRAHGplRPYGLTGQYWEKLGETIIDVVLGQEAVRDLPGAGQAWVIFTACLVDQMRAGF-
>ERR1719187_3161387
-ELTDDEINEVQQSWDLLTRSeggLREAGLTLNQQLLTAQPHHIRSFEKFRkykdfdDILKSPEFKTHSYSTVREISLVITNLKHPGVFtqlTQSIGFAHR-RANTPPNQMVDFKSVFINdFIPSQMADKATPNTIKAWEKFMTVFIEHVKEGL-
>ERR1719481_246497
-ELTDDEINEEQQSWDMMTRTegg-lREAGMTLNRQLLTAQHHHIRTFEQFKkykdfdDILKSPEFKAHSYSTVREISLVITNLKHAGTFtqlTQSIGFAHR-RAKVPPNQLVDFRSVFINdFIPSQMADKATPNTIKAWDKFMTVFINHVKEGL-
>ERR1719347_979638
--VTDEEMASINELWSCLRADAMHSSRFIFARFFEAHPEFLEPMPFVkDYygniSpkyMDTQEMQDYCLKFMSTLDAVMTRVFARdkEalQVMRDIGYSHH-EFGLTSDMTVKFMNKMHDSVLELWGTEASRRDSKALDNIFKTIATEINVG--
>tr|A0A1I7TYQ0|A0A1I7TYQ0_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=3 SV=1
-GLTRDDKRIIETCWFKCSQKqLRKSSCDMFWDILHTDEDILRLFRLDHVSpnrlKDNEYFKSHASNLALVLNLVVTNLQDNfEQaqdALQALGYQHLhlIDRtHFQSMYWDIFTDCFE----RNPPPSFRkGAEREVWSRMILFIMGQMKTGYQ
>ERR1719396_104066
---------NIIESWELLRFhpsLKEDLGTAIFRELFKEHPELREHFGLPlvGLdaLCKNQTFLSLSNQFVDVFARTMDTLGPDEELmdesIRELGEKCV-SIGIETSHLSLLRKPILSAVEKILLEDFDD---ESWKKFYSILATDL-----
>ERR1719396_219220
---------NIIESWELLRFhpsLKEDLGTAIFRELFKEHPELREHFGLPlvGLdaLCKNQTFLSLSNQFVDVFARTMDTLGPDEELmdesIRELGKKCF-WKTLMMNHGKN----STPYWEQIWQREFQQ---DKRDKLYSYSNNNN-----
>SRR5215467_3799544 
--------QQVSESYWRCCT-NPLFIEELYQTLFSKCGEIKQLFEQKNV-----SMKRQYAMLRYALDIFVDYPHDMTATFPDIARKHT---GLDPRFYETFIEALIETVGKCDPK-WVPSLEHAWRERMT-----------
>tr|A0A1I7VXG1|A0A1I7VXG1_LOALO Uncharacterized protein OS=Loa loa OX=7209 GN=LOAG_10963 PE=4 SV=1
-QLSSYQIHLLQQSWQRIRS-SPNFFINVFRTVIAKNTIAKELFRKTSIIdgftsYKCYDVKEHADSLIELIDFALQEIHSSTKVVQhrcmLMGATHCNTcENSMSSSWDQFGDSLAESIAKAEAIRGKRKCLQAWNTLLSFIVDRIKGGY-
>SRR3954451_1828621 
--MDPADDALLRQTQGLLRESldfaggAVAVADRLRQALRAARPEVVAALPG--------DAATQTAKLAAGLVWLVDHLDQPPLLVGgsaRLGAALA-ACGVPPRGLQFVGAALAEALRAGSPaGEWRQEFELAWRSTWQHVYEWMQVT--
>SRR5262249_5830581 
------DVEVARDSYRRILDDVerqREFFHTFYGLFLRRCPEAAAVFEAKGYPalaqlggPRvedsAGRGPQPPNPLKSAIVMLiaFNILGEKEepTILDNLVDKHK-GFP--KRYYVAFQDALLETVVQFDDPsrcgMPPDELQHAWKQAIQPGGDYLID---
>tr|A0A2A6CAG8|A0A2A6CAG8_PRIPA Glb-32 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_40555 PE=4 SV=1
-GLTPEQKRILETSWVKATPKqIRKATEDVFASIINHDRSLAVMFRLDDVPinriRENQAFKKHAANFALVLDLVIKNIPDNvDSCcqaLQALGGQHVslRDRGFDSIYWDVFTDCFENNPPATFK---TDIDREAWSAMILFILAQMKLGFR
>tr|A0A0N4XT53|A0A0N4XT53_NIPBR Globin-like protein 26 (inferred by orthology to a C. elegans protein) OS=Nippostrongylus brasiliensis PE=3 SV=1
--ALQALKVILRTTWRHMSKSGqGNCGSTIMRRLFIRNDRVKNVFHHNIMigglLepnaQETHNLQQHYSDIVQFLQFAISNLDHPSRITekcHEIGLKHR-KYktmGMKkkidkkylqAEHWDLLGEAITETIREYQGWKRHRESLRAANILVSFLVDRIRT---
>SRR5215831_15107384 
----------------------KLFFSKFYTNLFGRADDIEDRFKELD-------MERQYRILNLAIHKLLEFRPEQPAtqkQLRDLSLRHA-KLGLTNHAPAWNR-IH-LDLRGIGA--DGRSsGVAAADKALAX----------
>ERR1719234_1549997
--------------------------------------------------slwhrssIQLEGASNHNKALMNAIDSVMvEVLERRPMSksgIRDAGISHH-KFGIKRLDMDKLTTAILAAISDVLGDCdLDRKmlQLNAWKKFLNAIGDEFSVG--
>SRR5262245_32700325 
--LNSNQRDLIRRNWDSssK---RYELCRRIYCRVFARRPEIRRIFSIGYDW----WRLEI-VTFADFVQSIVDNLDDAKRVrqsAFEFGRDHAkwRRFGFRSDFWVQLAESTTREcvyLDAAV--HPPDESLETWTKFVSIVF--------
>tr|A0A2A6C3W4|A0A2A6C3W4_PRIPA Glb-17 OS=Pristionchus pacificus GN=PRIPAC_39254 PE=3 SV=1
-ELTDEEVAAVRNVWIRAK--TEDIGKKILQTLIEKRPKFAEYFGILCqsDKldmnslKESKEFHLQAHRIQNFLDTAVGSLGYCpvtsiYDMAHRIGQIHF-YRGVnfGADNWLVFKRVTVDQVTKGVTSTqasqanllegtkepevveqhpmadvqnpFsgeNCLARLGWNKLMTVIVREMKRGF-
>tr|A0A2G5SLB2|A0A2G5SLB2_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-17 PE=4 SV=1
-EMSDEEVSAIREVWIRAK--TDNVGKKILQTLIEKRPKFAEYFGIQSESldiralNQSKEFHLQAHRIQNFLDTAVGSLGFCpissvYDMAHRIGQIHF-YRGVnfGADNWLVFKKVTVDQVTTGATDSskekdkdetnsngtangkvdteanpipvgiadinnvYsgeNCLARLGWNKLMTVIVREMKRGF-
>tr|A0A0N4ZE39|A0A0N4ZE39_PARTI Uncharacterized protein OS=Parastrongyloides trichosuri PE=3 SV=1
-DLTAEEIEAIRDIWLRAK--NESVGRKILLALIEKKPKFAEYFGIGSENvdpkelLGKREFQLQAHRIQGFLDTAVGSLGYCpmssiYDMAHRIGQIHF-YKGVnfGADNWLVFKKVTVDQVSRVNVEGkdrksnvslgkrnnsgdaedstaetprkesahsfndmYevsNCLARLGWNKFMTVIVREMKRGF-
>SRR5512138_1182700 
--------RRVQGSYSTFQAtdRADRLYRTFYANLFASVPEARRMFAHTDWS-------RQYNAINEALKLLLDFDADPQRaadAAKQIGsvaLKHQ-QYGLGERELRAFEGALLHALRSC-G-ECKPATLEDWRMILAPGFHHMRGA--
>SRR5687768_15481058 
-ELSDRTRDLLVQSLPLMEHRKDALIEGLARYLIGSTGD-----ANQ-------DSELVAIVLTELLIGQASHLVRSSALpdLDDIRLEHS-RLGVQGSHYSRFGDALTPVIRDVLGPKLPREVAGAWGDVFWTVINVI-----
>SRR5687767_13070119 
--ISDRTRDLLAQSLPLMEQRKDALIDRLGAYLGG-AGD-----ADE-------DSELVAIMLTELLISQVGNLLRSGDLqdVGDVGHEHR-MLRIQGRHYSRYGDALSPVIRGVLGPQVPGEVAGAWGDAFWAVIRAV-----
>tr|A0A0R3PFZ5|A0A0R3PFZ5_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1
-KFTQYVGNIVVLAFLNcfatitktvsdtsitvhvdqiqihcdihtsfqcsrekgtsfeqgldfdkTF---IKRLLGLFRLLCFKSALSREMFQKMSIVegfrtNQCCDLNMHAK---------------------arcmDIGGSHV---QMneecCGALWDQLGECLAEVITKVDCVRSKRECTKAWIMLISYVVGGMSLGN-
>ERR1719414_1806988
---TVAQAEKVVAQWDAAD--QDAFIVAMYQAMMKTHPEWRALFNKPTGAptPAEAEWKKQFDLTKAVLDRglrsRATDVDALKERMHAMAGRHV-NYGVTQTHFQALKPILTDVLAATVT----GADMDAWSAVTYFMLDS------
>SRR4051794_33648798 
---------------------DHGSTN-ASTRALAARPTMSAKFGRAT--------AARARHLTRAIQDLVEFREDDgASRFRlHHVPAHA-GMGITREDAEAIRREFVAEVIATFERsggNvSPQMHGDAWNAVSRRRVERCVE---
>tr|A5L2R3|A5L2R3_VIBBS Uncharacterized protein OS=Vibrionales bacterium (strain SWAT-3) GN=VSWAT3_02206 PE=4 SV=1
----------------------QAFLESFLADFCQHNPRFSERFEKVG-------LEQQTKMLKASIILIYNSAGLPsvRNSVKRLGKQHK-DLGmdISEQELNEWFKSLLNTVKKYD-PHYNDQVEQAWTETLDVGLKIMKQ---
>APIni6443716594_1056825.scaffolds.fasta_scaffold2871162_1 # 2 # 304 # 1 # ID=2871162_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.617
----------------------QEFLETFLADFCEHNPRFSERFESIG-------LEQQTKMLKASIILIYNSSGLSsvRNSVKRLGKRHK-DLGmdISEQELNEWFNSLLNTVKKYD-PHYNEQVEQAWAEMLDAGLKIMKQ---
>tr|A0A0N5DFM9|A0A0N5DFM9_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=4 SV=1 
-LLSPAQIKLIRNHWNGLYItiGPTAIGNYLFNRIVFKNPQSRKMLLSLlvDHLSPGYFSKRHARAIGVILNFVMKNLEYPENIsliLKMVGHCHAKlvTVGLDSSIWNVFAEALLECSLEWGeKSRRVDEVRKAWAIIIAFITEKLKAGFN
>tr|A0A183IST0|A0A183IST0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 
-QLNDKDITLIAESWRKIED-RSLWAQRLFAKLFVYRPQLASIMSYQDVSgkklLSNPKFQNFCQRFADFWQDVVSGLCDRgtdddwKqvvALIRELGARHSRipKITFEASIWLHMKSEIVQSIT-GFKDIYRDELCYSWNKLLMFVVTEMKDAF-
>UPI0002C4E217 status=active
-------------------------HEDFGTAFFEYCPDLKGQFPSN--------YALVTKMIQKFINNVIEG-KNLERLARHYGRTHW-RYDLEERHFLGFAEALADTINIRIGNFGTIELMKIWREEATMICKMLEDQY-
>SRR5262245_41417288 
----------------------GNLHARIYEAFFAACPEAKPLFDNTD-------LKRQYQLLHQAIVLMLAFHVSPNreepTILSRVAARHS-ELGvhIPPAWFDAFSAAIQQSLEAA-DTQFSDKTREAWAAVLADGIGYMQ----
>tr|A0A0K0EPG4|A0A0K0EPG4_STRER Uncharacterized protein OS=Strongyloides stercoralis PE=4 SV=1
-GLSFYQQKLILQCWPNIYTtgVGSNFASNIYPTLCCKNSKAKALLQQADGVavFSNSgvdCTTMHSKLTLEIMDSIIKNLDSnPQPIISYLQDTgysHKnlKIQGMNMSMWDDLGDSILEGVRKNELVRKHKELRRAWLAIIAFLIDNLKQG--
>ERR1719183_3286062
-------AISLRDSWVHIEVlkeedDSGGFGDALIFQLSVVA---QEIFGLVVT-----ERNALGKIFNRMFSTLVHAMGDPQKFTEeffVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDSMVRNF-
>tr|A0A0V1BAT0|A0A0V1BAT0_TRISP Globin-like host-protective antigen OS=Trichinella spiralis OX=6334 GN=T01_2203 PE=3 SV=1
-----------------------ENGGQLLANVFKANPELRKFYDVEDIDpddtKKSRLIQQAGGNLLNSVTFMVNNYDNERSFKQEIKEQicdLR-EKGMKLEDARKLKTGFVNYVKSKLSQPMTAKEEKEWDMFFQRFFDALK----
>tr|A0A2E3CX61|A0A2E3CX61_9GAMM Uncharacterized protein OS=Pseudomonadales bacterium GN=CMK89_07570 PE=4 SV=1
-------SDLLNLSLEQIASAIGDPTEPVFTLLYQRHPELAAF-SREDTS-------WQHYMIQEILQNLMEMAENPDTALAIIRDMtlhHQ-MIGLEADTFKGMYRTLHDVVVQHLSGPHREDMTALWEDSVQRICRSVD----
>tr|A0A2G6L250|A0A2G6L250_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium GN=CSA49_02275 PE=4 SV=1
-------TELINLSLEQTVETLGDPVEKIYERMYQRFPDLVSY-KEENED-------WENYMFEEIITNFMSFGDDPETALLTIREMvvhHE-LIGVPREAFKGMYDTLYEVITATFHGPQESEMKAVWQEIVAKIYDCIE----
>SoimicmetaTmtLAA_FD_contig_31_10253239_length_247_multi_1_in_0_out_0_1 # 3 # 245 # -1 # ID=589621_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.671
-GLSEYERGLVVNSWKALTKPdfspldGTSSLSNFYDAVWTKWlkidEFANKMFRSR-------GFKGRVQHLLRIMGVIIKCAEDPLRGLeqlRSIGVQHC-IWGINSQSFASLALSIIHGLDQANGKEINAELKELWLAL-------------
>14BtaG_2_1085337.scaffolds.fasta_scaffold158720_1 # 2 # 106 # 1 # ID=158720_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.467
-GLNDAEIESIKASWKTITNTastngGDTMIVKFYDTVWNRWtkldEVANQMFQSR-------GFKGRAQHLMRIIAILIKFLDDPS-TLtqiKNLGVQHC-VWKINTESFSALAV--------------------------------------
>tr|A0A2T7PRA6|A0A2T7PRA6_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_02930 PE=4 SV=1 
---EPHDKTIVAESWKLLRSIFPDLIESAFVEMCRRVPRLKLQFGNVDVDDDEerhMNFLKHVWDVSFFFDQLLLYLPfksKLEECSFHIGLVHA-SVEVPAWYVDLFLVEFIRAAQETVQLEWTPAMENAWAVFLRYLCYYMKDA--
>tr|A0A183IYP9|A0A183IYP9_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1 
-----------------------------TLGLFTSSPEIRSLFPTLvDWgddIKTCQKFRNQGLKFVHVISLSLTTLHDKehlDTLLKEIGTRHVEfmPGGIKMEYWDIFEKAMVKCILQQI--RWTDDfdeaiqskAAIAWRILCAYIVQKIKIGF-
>ERR1719183_316154
---EPEVSAATKRGWRAWVAdmfaRGIPAGEALYQTIMDDAPSLKHLFTKP--------KPVQAMRFRTVLSSLVQTCDDPERLRvqtETLGYQHL-NLEITVDRAELFRDTIYDFIQMDFGNR-------------------------
>SRR5947209_7523480 
------------------------IAKAFVDQLAHVFPPICAMLPMAT--------KTARYQTACAIAAACKHAHDLGAIAPMIAATgadLS-RHGFTAEHLPAARAAFLNALRKCAGEDWTTVVEKDWNEVISEFAGH------
>ERR1051326_6499376 
------------------------IAKAFVDQLAHVFPPVKGMLPMAT--------KTARYQTACAIAAVCKHASNLNDIAPMIAATgadLS-RRGFTSEHLPAARAAFLNALRKCAGEDWTSVVDTDWNAVISEFAGH------
>tr|A0A1A0K7B8|A0A1A0K7B8_9CORY Uncharacterized protein OS=Corynebacterium sp. EPI-003-04-2554_SCH2473622 GN=A5774_01015 PE=4 SV=1
---------DLASLATHLRAHPATFRDAVHRHFFAALPDARQSFPMD--------ASQAHRGLAESFAAAFDAP-DLDEYFADLGRSHR-RHGFPPDTYPIFATATRQALAEID---LADNVLQQAGALVDDIVAFMSTA--
>tr|A0A127NUX4|A0A127NUX4_9CORY Oxidoreductase FAD-binding domain protein OS=Corynebacterium simulans GN=WM42_1693 PE=4 SV=1
----------MKELGEHIRRHADDYRDAVHQHFFATVAESRQIFALS--------MRDTHPALAPAVAWILDAADdagflpeETIERVRELGKEHR-RHGFPTEIYPKFEASLNEGFIALG---LTQHQLVVAKRAVHTVCTTMAQA--
>tr|A0A0F6QY96|A0A0F6QY96_9CORY Oxidoreductase FAD-binding domain OS=Corynebacterium camporealensis GN=UL81_10405 PE=4 SV=1
----------MKELADHLRRHANEYRDAVHQHFFNTVLESRQIFSLQ--------MRHTHVELAPALAWAFDRAQrdgtltpELEEQLTQLGRDHR-RHGFPPEIYTDFANSLIAGFDALG---LTPYQRQVASHAVTEIANVMANA--
>tr|U3GX34|U3GX34_9CORY Uncharacterized protein OS=Corynebacterium argentoratense DSM 44202 GN=CARG_08960 PE=4 SV=1
---------TLADTLRAEPKRLSHFGDLAHSALLRRAP---GLISFF--------GPNPHTELTTAVLFILTHSTpgpqdsgtqtPLspridaagAGALRALATEHV-AYMPPdPALYLAAADALCEALRDSCA-DQPFQQVLAAEKALREACSLMATH--
>tr|T1FHE7|T1FHE7_HELRO Uncharacterized protein OS=Helobdella robusta OX=6412 GN=20208246 PE=3 SV=1 
------------------------------GTLLQSNPLVKNTFEKFRQmDpmsdfTDSSVFSTHAMVVMSAFEDIFDNLDDSEIVKDILEQgkSHG-KFseDFAPETFWAIEEPFMSSMKDILGRKMSSQLEKIYKKTIKFILSVLIKGLR
>tr|A0A0N4WD13|A0A0N4WD13_HAEPC Uncharacterized protein OS=Haemonchus placei PE=3 SV=1
-CLTPAQILLIRRTWTHARNQGaLEPAISIFREFWKNLNFLQ-FQKLKKSRKCSESFQRHAQIFTTIMDELIANLDNPTATSPSLREsgeKHVFqtrdQYGCpfRATLLDQFASAMIErTLEWGEKKDRTEVTQTGWTKIVLFVVEQIKEGFH
>tr|A0A2R8AKY2|A0A2R8AKY2_9RHOB Uncharacterized protein OS=Aliiroseovarius pelagivivens OX=1639690 GN=ALP8811_01706 PE=4 SV=1 
------------HSLDLLVGQEDAFAHAFFPLLFARAPELRVLFGDNiDD------PTQQVRVLYRMMMAFA---GNDVTLIaglRLIGFRLA-MRGLGADQAELMANTLIGTLKRQLGNSWQSDFAFAWRIE-------------
>tr|E1NZ07|E1NZ07_CAEEL GLoBin related OS=Caenorhabditis elegans OX=6239 GN=glb-29 PE=4 SV=1
-NLSVKQKKLLRQSFNAMNSGGtfLKLMEKIFRRLETKCPDMRSIFLTTAFvnslSreRQTPplvkTEYDHCKCMVGIFERLIENLENINEQLTMirhYGEKHAQmaESGFTGAMIEQFGEISVFVIGSQDVVKFNHETVKAWRLLLACVTDEMKVGFD
>tr|A0A0C2G6K1|A0A0C2G6K1_9BILA Globin OS=Ancylostoma duodenale GN=ANCDUO_17195 PE=4 SV=1
--LSYKHRKLLRATFQQMNSSGafLKLMEQVFRRLEAKYPDIRSIFLTTAFvnslSreRSSPPlvrtEHDHCKCLVALFEKIMDNLSDDTQLmvIRQYGEKHAQmkESGMSGGMIESFGEIAVAVIASQYSYWIQKPVDDVTrrkgrDEGLVYLNDYEYIIL-
>tr|A0A0G4HY87|A0A0G4HY87_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_33490 PE=3 SV=1 
----SNRIHLLQSSLAACLKMstkEEFVGRLMYDTLMRTLPEPGIIAKRGR--------TMMSRAFNDTVAALVAFVSEPshmETYMDWLALRHV-HYKIDTTLFPQFRQAMLVSLEQVMADQWNAEIERAWSEAYEMTSQAL-----
>SRR5262245_61346593 
----DCLRRGLESDFKALV--DESFAASFYKRLFQSRPLLEGRFHN---------LQTQERMLAENLRDLVEFH--PEESagrFLDHVNRHK-PRGITAEDILAFRAAFVAEIVQQGskllAQKIPpGARADAWNA--------------
>SRR4051794_17889687 
----DSLRDAIIDSFSLVS--DERFGLRFYESLQS--HHVGGRFKD---------INEQHRKFIKELRSFVDSE--PPAGlaLRIIAGRHR-PYKLS-----------------------------------------------
>tr|A0A0K0FHQ3|A0A0K0FHQ3_9BILA Uncharacterized protein OS=Strongyloides venezuelensis OX=75913 PE=4 SV=1
-NLTASQIMSIKRSWKHINTKGlFNVLRRCYQRCECCSLAVSMIFSAEQMKkqqhAYSCGVSEHSKYFISLLDRIIDNEPNIEQELRNVGKEHVKlyeEYKLGTADIERLGEIIADVFLKLDGIRQNKETSKSWRILIASIIDEVSVGY-
>SRR5699024_10156350 
----------------------PRFPALFARALRAADPDFRGMFPRD--------PAPVLAEFVRAMTFVLETTeaaaAATartDevvELARPLGADHR-ERDLPPSNRVPTGDARAATLPPLAGSGWTEAPETTLSTAYRVVSTALQ----
>tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 GN=HMPREF3121_11375 PE=4 SV=1
----------------------PTIGPEAFRRLLDAEPRFRHMFGGS--------KTALRDQFMSALSTALVTRadvgRFPaa-tiRRLEQLARENR-KFGVAPRDYATLAEHLLDVFGERLPAGPDSGAQVDALREILDEA-MSL----
>tr|A0A0C2M2P6|A0A0C2M2P6_THEKT Uncharacterized protein OS=Thelohanellus kitauei OX=669202 GN=RF11_12769 PE=3 SV=1 
--LTLEERLKLKESWIKIYQKIqdlpdVDITFEIFVRLMERRPEMSKNFEKD-VY-KYSRMKSHSDKMLVILNNMIRNLDDEQKMLKYLSgmvRRHR-NYGIRQGDCKMWEEIFLDIISRY-----------------------------
>tr|Q5D2M7|Q5D2M7_9TREM Myoglobin 1 OS=Paragonimus westermani OX=34504 GN=myo1 PE=2 SV=1
-PLTQAEVDGVVSELNPFLasdAKKVELGLGAYKALLTAKPEYIQLFSKLHgLTidnvFQSEGIKYYARTLVEDLVKMLTAAAKDDELQKVlvhSGHQHT-TRKVTKQQFLSGEPIFIDFFNKTLSK---PENKAAMEKFLKHAFP-------
>tr|A0A1S8X4B3|A0A1S8X4B3_9TREM Globin OS=Opisthorchis viverrini OX=6198 GN=X801_02811 PE=3 SV=1
-PLTQSQIAGIHKELLPILsndEAKTSFGVGAYKAFLGAHPEYIQYFSKLNgLTidnvFESEGIKYYGRTLVDEIVKMLTAGADDEKLKQVlhdSGKAHT-ARNIDNATFMvsklfmflkrvsemrlarglygpfpifaqSGLPVFVDYFNKSLTV---PENQTAMEAFLNHVFP-------
>tr|A0A1I8C1X6|A0A1I8C1X6_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=3 SV=1 
-DLSPHQIGLIKRAWKNLLKSvnENEIAIKLLLRIFQLDPRNLAYFSLNEYspfdeylIKENNIFINHVKTFESTLINVMTHPGNATKLskhLQQLGGRHV-NYtGVTykCSYWKCFIQSLIDVLTLNKDKNTSEDLHEAILILGEFCVEQMKIGYK
>ERR550539_1089662
---------AATASWNNIDD-KPAFGKAFFKNWLSSNPAIEEEFAKSSFK------QGPAQFLVERFDILLGVIEDEDSLAEELYqvaKTHK-KVGVDQSDLYSFQASFMKTLPSFD-ADFSAETGNAWAYVLSHVI--------
>ERR1719210_3079978
---------QPKRVGRTLT--KQLSEKLFFQNWLDSEPDVAEIFKKSSFP------QGPAQFLVERFDILLDVIDDEVALSKELYvvaKTHM-DRGVSPDDLVTFQDSFLKTLPSFD-SEWTRDRSESWAYVLSHVI--------
>tr|A0A1G0FYS6|A0A1G0FYS6_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium RBG_16_51_14 OX=1798265 GN=A2W28_07810 PE=4 SV=1
---------LFNNSFQRAIiPDSNSFYKRFYEIFVGSDPRIAELFEKTF-------MNLQREMLKQSMTYMMSFSatLEPSDEMKELAEMHGRgKLNIPANLYEIWLESMIKTVEEFD-PKFDENIEIAWRVMMAPGVAYMQS---
>SRR3989338_9975634 
-TIDHRSVQLIKQSAGAIKGQAQAINRLVYEQLRRDHPAAYSLLQQAGL-------P----PLASIVANYAAGIDNLEVFLghaPKIALTHQ-RIDLQEVHFESVASSLFLAFRQALDPDaLSDEALLAWRRAYDH----------
>tr|A0A2A6RLC4|A0A2A6RLC4_9CHLR Globin-coupled histidine kinase OS=Chloroflexi bacterium Kir15-3F OX=2024553 GN=CJ255_07345 PE=4 SV=1
MGLRAEDGATLKALAPKAEAYGPTLTKTFYDRLFA-HANTAEYLQGVD-------MQRLHSMVQTWFMGMFAGVYDRDYArqRLHIGEVHV-KVGLPVRYPLAMIDVVMSFGDQIANESSePAVALAAFQKVLSLDIAIFNQAY-
>tr|A0A085M5J8|A0A085M5J8_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_06691 PE=3 SV=1 
-CLTKRQRRCILKSWRKVQ-NKAQLGEEIYIQIFMQKPVLKSLFPFRATPvnelHDNVLFTRQAVIFIDFIDNVVAYVGINngrllQELCTRVGISHAlmTRVNFDPEWWYLFANSVLDGMQKFCLPNFSCepiatyigsQSMLAWRILLKHVVEMMSDAF-
>SRR5215470_9720857 
---------EAKRSYRQFAR-DISFYRELSKRLFRKIPGIEKKFRHR-------TMEEQYKVLRDSLWLLLSYASAPDqqepTILSRIAHTYA---RFPKEWFDTFREVILDVVAQRDP-----SSVRAWKHAMAPGLE-------
>SRR4051812_31756681 
---APSVMRLLASCTADLGPQQPELAEALYQRLLELLPEVatlAE------------RGRPLSDRILHAVLYPTEPGrt--PLNVatvVQQVGAQNY-LDGLVGEHYSSVTHAVLHAAREMYRGEWSSALSSAWVEYLLWLRGHLLAG--
>ETNmetMinimDraft_22_1059887.scaffolds.fasta_scaffold1682169_1 # 3 # 206 # -1 # ID=1682169_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.363
---------TVFSQWRRMK--IEDFGECMYRSL-VQDASLEKLFRRE-------RMRTQSLLFAAFIQVALCWLEERDfrkveRDMISLGLRHR-SYGIQPSYVCVFQIALLQTLCQNLNG-LSLQAEISWSVVWSH----------
>tr|A0A0A9Z6R2|A0A0A9Z6R2_LYGHE Neuroglobin OS=Lygus hesperus OX=30085 GN=NGB PE=4 SV=1 
--LEEDEIERIKKSWVLVKENDFRFIDILRQEMLCDIMMYELYFNPGrkaDVcVSELTEFKNHPKNVYSTLDFIVGDLENENVIIEkmiEIGKNHG-RLGISRKHISFMTSTIYQAVECTIGPCmFDRLVDQSWEKFLTSFN--------
>tr|A0A0N5AZ47|A0A0N5AZ47_9BILA Uncharacterized protein OS=Syphacia muris OX=451379 PE=4 SV=1
-PISYKNRQLVQSCFRNP---HELLGKRILKKTRDKKPDFDLFLSKLDGK----QRDELEESIKVLLKKVVANIDFIDEVqrlGEEFGANHVqfRKEGFKPEFFGIYADAAVTEctfLDSA--VHPPHQTLDAFSSFISWIFSFVRDGYY
>tr|A0A158N7T9|A0A158N7T9_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=3 SV=1
-GLTAQQKAILATMWRQLPRGvIFDLGKRVFEIIFERDPKLLMIINLEHLQntnqwQEHVNFRMHAQRFTHALSQSMRNLTEPIIAADRLqefGASYVNqenitygslNVVIPHSYWDRLSAAITTTAQEFLNKqqlktskqtltvdnvlllenerrnsrnlfSQVSANINAWSILAQFIANQIRFGYE
>tr|A0A1I7RRX1|A0A1I7RRX1_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1
-GLTSTQKKLVQAKWMEMDGVgILDMGRNVFETLFRREPACLKAIGLGHLThgrnlewRYHVNYRQHVKRFCEAFNEVIRSFEHPRTSIDQLqelGALHANtylkaseERKVPSNYWDGLVFAINYAAKDLQVEsssrgsespsnvifdrrfllpsddlgsstppsptqfsslcvtpqrrsgSVCPRVAEAWNLLAIYAVSQMKFGYE
>tr|A0A0B2V954|A0A0B2V954_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_09629 PE=3 SV=1
-GLSMHQKMIVTAKWRQLPQGfVFDLGKRIFETVFERDPYLLSIISLEHLQgsdewRDHANFHLHAQRFSHVLSQCMRHLSEPIVAADRLqefGAAYAEvedsenfvRSRIPHSYWDRLITAITSTAKELHEDqpqqvrknslsvddallakkdrlalETDSTNACAWNALATFVSNQIRFGYE
>tr|A0A1I7RN92|A0A1I7RN92_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1
-GLTDDQCEQLATAFSNIPDKYYAFEQMFLNLFMKEDPQLAVVFGFEGIRpeelRRMSPFRTHVCKFQRFMTTVLDMLPKknrEEELiqiIRMVGRQHCNvkLLSFTAQKWLSFKNGMLNALAKG---GESHKYYSSWNILISFMISEMKDAY-
>tr|A0A183BTK8|A0A183BTK8_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=4 SV=1
-QLDDTECEQLSTVFAAMPDKYHLFEACLRPMPMPeVDPQIALTFGMANIAeielRRKTPFRYSV--------------QKrgrEEELvqiIRMVGRQHCQvkQLSFTAARWLSFKSALTWTFSRG---EQKDKLHVQWSLLISFLICEIKDAY-
>tr|A0A1I7ZF06|A0A1I7ZF06_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=4 SV=1
-QLDEEQIDTIVDAFAKVSDKYGAFERVFVQLFVYEDKEIAEQFGLASVPeeviKRNQVFRTHVGKFQRFMTTVVELLPKvgrEDELieiLRIVGRQHCNvkQMNFTAAKWLSFKNVLLSVLCKN---DHHDKVYMCWNQLLSFLIYEIKDAY-
>tr|A0A2V7AV10|A0A2V7AV10_9BACT Uncharacterized protein OS=Candidatus Rokubacteria bacterium OX=2053607 GN=DMD92_03445 PE=4 SV=1
-GLGEADVAVIRRTAPIVLTCEAAVTDALYAHFL-QFPATAQFFLGEDGEPDAARLARRKHTLGRWLRETAAVATTHEFSyyLLAVGLShsHRAhgPGGAVPPHFVVGamslaQTALARLFGAELGDpQAALEASLAWNKLLHVHLAVLLLGY-
>tr|A0A2E9LM24|A0A2E9LM24_9CHLR Uncharacterized protein OS=Dehalococcoidia bacterium OX=2026734 GN=CL902_07715 PE=4 SV=1
-GLGQNELDIIESTRELVLSKGEEITAEVYDHFL-RFQETRRFFLNEEKAVDDDRLERRKHSLLRWLRGSLDFKIDEDYPvrLLATGIVhsHPPshraHMGSVPGRFMIGsmsylQTLLAEIFHSEIEDrEEAHRASVAWNKMLMVQLDILQAGY-
>tr|W4MD58|W4MD58_9BACT Uncharacterized protein OS=Candidatus Entotheonella gemina OX=1429439 GN=ETSY2_07185 PE=4 SV=1
-GLSDDERQLIKDSGPIVLGHVRKLTEGIYDQLL-AYPESAQFFTTENGQRDEKRIEDNIQTMISWFRAAVTAPTNQGFIryLVGISQMhaNIPvhrsNNTPVAPRYVIGtisyyQTNLDDILHQHMADpDLARRTCVAWNKWLLVILELMLANY-
>tr|A0A0N5DD39|A0A0N5DD39_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1 
-NLTPHQKQLLVQSWPQVQLYnRIHGGDAMFARFCEKNSIARETFQKIAVvqSfasneASESVLKKHEQYLVQLLSEAVENLNNDcEPLLREcldYGAQHV-TLheLLNETVWEQLAEAIIDRIHKVNLVRRHKDLSKAWTMLIILLIDKIREGY-
>tr|W8BTT7|W8BTT7_CERCA Uncharacterized protein OS=Ceratitis capitata PE=2 SV=1
-GLTITERRSLQNGWSIIKQKQRRAALTIYVNLFTEHENLYEVFRSDGVL-NIEFASQHQKEVLTVFQMIIEQVDNARfvkTMLKELALRHE-AASVTNTQWQLYtnevRKYFLETLADAIS----PTFVHALDKLMNFVCN-------
>tr|A0A0A1X397|A0A0A1X397_ZEUCU Globin, monomeric component M-IV OS=Zeugodacus cucurbitae GN=GLB4_1 PE=3 SV=1
-GLTSTERKSLQNGWTIIKQKQRRAALNIYVNFFTGHEDLYEIFRFNGTL-DIGFASQHQKDVLTVFQMIFEQLDNARfvkTMMKELALRHQ-ASAVTNTMWQLYanevKHYFLKTLNDALS----PTFVHALETLINYICD-------
>SRR5438046_775397 
--VSRETTALARASFERCSA-NGEVPQAFYRNFFARCPPAPALFAPGL-------AAGLAArLLSApaaaeqIFLFTLVAGGTPRTrl-LPP----MSrGX---------------------------------------------------
>AACY02.8.fsa_nt_gi|132068355|gb|AACY021643300.1|_1 # 2 # 748 # -1 # ID=15695_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.288
---------------------------------------------------------------LKG--------SFHFHLlgeLENLDFEFK-FLASWFSEVDIFRDALIDLFEMEMNDqSLTPQGRHVMALLINYVG--------
>ERR1719424_2066333
-------------------------------------P-------------------------------KASTWLRPCTVhllVQSTRQQHL-VSAI-------------SCTTSRRV---------------------------
>tr|A0A0G4HHE4|A0A0G4HHE4_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6802 PE=3 SV=1 
--------------------------DALLGILFEASPTMRSVFVKNGD--------LYADLIEHLLRRIIAYADDPGALWTddqHLALDHI-NFGMSMSDLPLFGASLMNCLAGVLGENWCDEWQRAWEKAWQICCQSL-----
>tr|A0A2C9LD65|A0A2C9LD65_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106067556 PE=3 SV=1 
--LSHKDKLFILNSWLNFRNgkREEDIGMEAALEMYSIYPEIKDIFTIYrDARmkhlTDKEMIRTHSQQVASVVDKCVMRMDDAHAFAMiavDEGSVHI---KIQERFMRCYVDCYIREIKKYSKLKWSRANQMAWEVFFDTIVVNMKNGW-
>SRR4030095_5973293 
-----DHFEIAKDSYARCISggdSGNSFFKTFYHELTRISPEAAVKFKGKgiGET----ETNRQYGILREAIFILLMFGENklgenEPNILSRIAEMHNKnHYNISPESYKSFVSALTATICGSAPDipePFDPqckisvneknLIKIAWQKALKPGIDYMIMRYP
>SRR6478736_3613867 
-----DSFEIAKDSYNRCISgedSGDIFFKTFYNRLVKKLPKD-vaAQLKGKgiGRS----KGHRQYAILREAVFILLQFGQNrlgenEPNILSRIAQMHNKaNYNISPQLYTVFVDALIDTISGLPPDipkPFDSqcsisvyereIIRNAWSEALSPGITYMKDKYX
>SRR5262245_45185474 
----------------------PTFLEAFYKLFTA-DEVVGKRF--VkfDDI----EWKRQHGLLQQALDACFDFASLlsmqnlrelpEPNAMTKYVVRHGPgrgNLGITSTEYDAFVEALITTVCGNPGNgqaPYDPecadaerkdVIEFAWRRLMKLIVEHFKKVAR
>ERR1712142_1087278
-ALTETEVKVIIDSWDRIHPDK--GAKMLFHQFLTDFPLMKIYFGYQETesvaeIMESEQIKTRCKVVWDVLTKIVHASGDGGKLaelVKEVSVKHL-NFNREKKDIHCFLHALKVTLTC-----FSGHLFRPWNIWCKMVEDLF-----
>ERR1719263_534529
---------------------KRTYGLNAFNRFFAKQKKAEDHFNTSN--------ARLSVLAMQGLNLCQDIYKEPTRLvnvVTSLGLKHI-MYNISTEYFDAFVEAMCEELSDWHPGN--QAAVEGVEWALTQIAAIMI----
>ERR1719446_598571
---------------------KKAYGLNAFNRFFCKAATIGNSFQHIQ--------CASVCSgnarSPAVSGYLQGAYTLGECGhltWPQTHHVQH-FYRLLX----------------------------------------------
>ERR1719446_1691251
---------------------KKAYGLNAFNRFFAKQKKAEDHFNTSN--------ARLSVLAMQGLQLCQDIYKEPTRLvnvVTSLGLKHI-MFNISTEYFDAFVEAQCEELAEWHPGN--QSAIEGVEWALTQIAAIMI----
>tr|A0A182EAA6|A0A182EAA6_ONCOC Uncharacterized protein OS=Onchocerca ochengi PE=4 SV=1
------------------mgsgssvpnhgqprnvaggggndgggggnagvengdqqkvdprlpypnfrelftlknywktvRRNERDCAKMMLAKNYLKNYGYSLGII-------------------------------------------------------------------------------------------------
>tr|S9VAV3|S9VAV3_9TRYP Uncharacterized protein OS=Angomonas deanei GN=AGDE_12480 PE=4 SV=1
-------------AWSHLLtsPNGGEFCSTLYEKLCQNLTYIPDYIRNLK------DEERVIDHYINVITKTLELYENPHVMIdelPKIAARHR-GFGVSSDAFFVMRNIFMELLPEYMDPKVYEQSKKDWLKFWRLVLDLMVSG--
>tr|A0A2K6VLK5|A0A2K6VLK5_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1
-NLTTTQLLLVRKTWNHAKNQGaLEPALGIFRNSFYKCGEIRSLIMGGPKNVGYERLKKHAKSFTNIMDSLITGLDAKESVIEELRKagrAHATllrdtsnkfgnksntqliGCPFRLAHFDHFASAMIERtLEWGEKKDRNKTTQTGWTKIVLFIVEQLREGYQ
>SRR4051812_9951159 
-PLPPEVAQTIRSSCRPLLERQEQFHGDFHASLVDLMPEVPMMREPA--------GEQVSRWLVECVLWAVNADEPVPMIGATLqgvGLDAH-RLGFPRAGYQAVGHALLRTVRGASQNDWSGTLSSSWIGYHSWLCEYWVS---
>SRR5690242_179091 
-PLPPEVAQVIRSSCRPLLERQEQFHGEFHASMVDLMPEVPMMREPA--------GEQVSRWLVECVLWAVNADEPLPMIGATLqgvGLDAH-RLGFPRSGYQAVGHALLRTVRGAYQSDWSGTLSSSWIGYHTWLCEYWVS---
>ERR1711972_144950
--------SQVLQSWEQVKLLgLESVGEMLRANTFELDPQVVALFRIPGVVSTGEGMLqrmalrRLFSKVLRFVGSVVAG----------------------RYDYQRLVETLsrLGATRAAGGATEVHFKI-------------------
>tr|A0A1I8CIB1|A0A1I8CIB1_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=4 SV=1
---------------------------LLLVRTFELDPKQKHNFNLDKVDiedlRIHPIFVDYVKSFQPLLLNVFKYTNRATIMskyLQQMGGKLMRytKVSYKSSYWKVFEQALIDVVS---GGNAGDETIEALTILANFCSEQMRIGFR
>tr|K7H1D4|K7H1D4_CAEJA Uncharacterized protein OS=Caenorhabditis japonica PE=4 SV=1
----MDGEYLLFANCPAPGIgDGNDFLYHNGVGLESNCPIVSQCFQSATYSlstnpNQVRTVADHAKYLLQLLDKIIEGDVDAEY-LREIGANHVslkHENGFSNTEWDRFQEIMVEVILKQDGVKQSKETSRAWRLLICSFIELIRDGF-
>tr|A0A0D8XGR1|A0A0D8XGR1_DICVI Uncharacterized protein OS=Dictyocaulus viviparus OX=29172 GN=DICVIV_11062 PE=4 SV=1
---------RIQHCFKAA---RPTIGEAILKRASNNRCEMRILMSRLTD----QQIELMGKQFYMLIAYSVENIERVEMIQQharTLGETyaaLC-RLGFRPDYFTSLADAAIAECVKLDGGtHKstyffnRCETLLAWSQLIGTIFTSVRDGY-
>tr|A0A0N4UGY4|A0A0N4UGY4_DRAME Uncharacterized protein OS=Dracunculus medinensis OX=318479 PE=4 SV=1
MRLSDKQKLWIKLGYKKWRSKsKMVPGEWVHAYAIKKYPTMKALFKKHEN-----LARVYTQTITKIIEMAVESVDSLDDsLGPLLisyasengilEERgmasiftirndkllLF-LEGFDRRFWGYVAEALCALSRDFPLKRHKWDTISAWRIIVLFIVKKLEYGF-
>SRR2546421_6426420 
-------------------------------XMIRRPPRstlfPYTTLFRSD-------FERQNKLLRHAFGLLLIFPNQArtePSVLTRVAERHSRrDLDIPRSEEHTSElqsRSDLVCRLLLEKKK-KNQV--------------------
>LakMenE18May11ns_1017337.scaffolds.fasta_scaffold18991_1 # 3 # 107 # -1 # ID=18991_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.400
--------SLALASYNRCRDCHQEFIREFYDAFIEGLPEPYkEHFQNR---------QRQNTMLDSAIYLLFD-LEAPEnqKLLRSIftGSKTAGkpnpHPAYPIEWYERFLDTLVGQVSHMDRKNWNAEVEASWRNLRENALHLIR----
>ERR1719262_958340
------QKEILDICYAKMTGelDLPAMVTMFQGIFFSRDLRIQSYFSKPNG--------TLRYIVLRIIEFLCNVFHKPAAItkeLRTLGVSHV-KWEIPPDLFVPLGEALF-----------------------------------
>SRR5512142_1307926 
-GLTESDIETIKQSKPIIEKHIPEIVTKFYAHLLR-YPPTRRVFLKKDGSVDQPYVELRMRHLTNFWLRTATGVyDdDYARYIDYVGRAHT-SHGADPHIYIAeryvigqvgfVQHAITDALSRELRhtdEEFEVRAVEAWDKLMMVLLEMLSRAY-
>ETNmetMinimDraft_9_1059917.scaffolds.fasta_scaffold1595668_1 # 1 # 216 # 1 # ID=1595668_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.366
-GFTRADAEIIAQACPIIEEHLPNIVADFYDQLLR-YPPTRKVLLKPDGTIDQEHVEKRMLFQINFWLRSASGVyDdDYASYIDYVGRAHT-SHGADLNIYIAeryvigmvgfMQRAIDQALDSELHdadHTMEDRAEAAWGRLLMVILEMLSRAY-
>ERR1712137_619303
--LPRESITVIRDTWAMVER-NVDIAPKMLLKMFQLYPVTQNLIPLLrGVSledmPTNKRFLQLAYGSQFAMSAIVDKLHRPDMLEEIIGGGmHAFVDGLSTSFQmAATTALFNKIMTEELGSAYTAEAQEAFIATGDMMTSIMVK---
>SRR5580704_4499342 
------------------------TLGDFYRRLLQHHPQLAAYFEGVN-------IDFQVQKLVVVLSTIARDLPDRSVLdrvLFHQGVAHV-ERGIGRGEFNEFIALLANVVSCKTTLVGAAESYAVWYQELSAVATSML----
>ETNmetMinimDraft_24_1059892.scaffolds.fasta_scaffold323471_1 # 1 # 354 # -1 # ID=323471_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.472
---------------------RKKVCTDLYFRLFDVVPASQDYFKQSNT-----RLHFIAELV---INMTLDMYQKPTKMMsqiSALGLRHV-ALNVPTDIFPAFIDVYITVVKEYTN---------------------------
>tr|A0A1Q9F3K1|A0A1Q9F3K1_SYMMI Copper-exporting P-type ATPase A OS=Symbiodinium microadriaticum GN=copA PE=4 SV=1
--LDEFTIKEVQNGWATTEKrlgGPKAAGEHVFGKLKKEVPRTEGMLKRSS--------TV------WHLFTElLQAIDQPKLVqkrLEYIALRHM-NADITTADIEVFRNILFEVCASKLGGLmtpefqYQAQYSFGMGQIIVAVGTS------
>tr|A0A183BUR6|A0A183BUR6_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1 
-GLSAHQIQILQKIWERSPESeISDCARNIMSHLLRSNAQMYQFFDLLGHsdreIANSPIFARQSANFAVLLDFVLANLLEeVQKVclaLQHLGAQHARlRWPIETHHWALFCRCFEDNPPKEV--FLNAEGHDLWKTMINFIIVQMRVGYD
>SRR5690348_61285 
----------------------------------------------------------------RATHWLLDHFDHPGEIVSVLVRYvpalDA-LTGPHSRQLELFGEQITQQVDDEA----------------------------
>JI7StandDraft_1071085.scaffolds.fasta_scaffold2802978_1 # 2 # 235 # -1 # ID=2802978_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.607
--LTPTTIRLLLATSDIVG--SKETADKFYNRLFLHSPELKELFVGGETTtTTSMGIGDQALKFSQMMQWTTRALQQmhlqqkqkqqpsrssggggggdacsngtaPTrrstsAVfrsMTNLGRRHV-RYGVQLKHFHPVKQALLDTIAEL-----------------------------
>ERR1740139_220892
------TRAALLKSWEMVQEAGTVPAAnLLMKHLRERDAEALRVNTSHARPktgeTEEDAVRKLAVRTVQILGSAATGMSDTVSLVQHLHKVgagFA-GTGIKEGYFAMVRDASPFALRELLGDRFTADIASACRITGPFLASLIIAGLR
>ERR1740139_941170
------TRAALLKSWEMVQEAGTISAAnLLMKHMREKDAEALRLNTSQARPktgeTEEDAVRKLAVRTVQILGSAATGMSDTVSLVQHLHKVgagFA-GTGIKEGYFAMVRDASPFVLRELLGDRFTADIESACRITGPFLASLIIAGFR
>tr|E0VF27|E0VF27_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236389 PE=3 SV=1 
---------VVLNDWPKIRKNYKKIFIDSFINYFAENPNYKLLFPSFsNVSeddlPFNHCFRLHCFAVYKAINFLMSNWlGeyeeDDSKILPVIGKTHF-DRGITLEMMNLYKHSIVYSCNNHLKPNL--KRKLSWQTVFDHIFD-------
>SRR4030067_646800 
-AFTQADADAINESRFIIEKDIPEIVSKFYTQLLR-YPPTRKHFMRQDGTLDQEYLQLRMHHLTNFWRRTAYGeFDdNYARYVDYVGRART-SHAGDHRPGCgppagsrglrAGPGNAHLGREPRRGG-CESGGDRRWRKEDRP----------
>ERR1719347_1935341
-GLSQNEVTLIWSHWESLKPHKRRLAKRILKVYIKEHPRARELFPNWvDIPtvelVKLTSFSRKAVDTWEAFSRAWECIDDAPLcrkVCYAFGKKHIEcnarikgHGQIDEHHVKNFIRIFLRIILVSAR----EGSEEAWRKATEFFSINFVRG--
>SRR5690625_6901273 
-------------------TPPETYTPSLHDAL----------PISA--------RASRHVDLTVAIAWALENPAPkVDALVAQLGRDHR-RLGFPPEVYDTFAQDRKSTR-----------LNSShVAISYAVFCLKKKT---
>SRR5580692_4143848 
--SDSGIWPVIRQSAARLSRDEDAFIQELHYEITRLISDPAGAPAP--------DMWVFCERMVRSFLWVAL-TDQPlGVVADtlrKVGVHYW-VEGFPDTLYGEVTHAMVQTVHYLCAHDWSASMGSAWITYFMWIKPHLLAG--
>SRR6266704_2516069 
--SDSGYD---APPAGALARDQGAFIRQLHYDVTSRIPESAVPPAF--------DMWGFCDRMAQTLLWVAL-TDQQpSLVTDtlrQLGAQNW-YEGFPDS---------------------------------------------
>SRR5438132_1665678 
-------RSRVLASYSRVQSgdRARTLYQAFYQQLFRAVPDVEPLFARID-------MVRQYDALNKAIKLLLDYDPQSREstdDIRAVAVIVA-APVIVAVHLNVAApVTVIDKRKGCGS--FGTTVV----AVMGPGVGWGD----
>SRR4051795_10036070 
-------RDQLFISYSHR---DESWLEEFATMLAPVQKSgslnIWSDKEiraGED-------WSAKiQEAMSRARIALLLVSPAFLAsdFIQKTELPKI-LSDHTCRGMHVywvlleqslTEWSPLSQLQAAHP--IKISlseisnvgerrnVIANICRQIANELGQYS----
>ERR1712051_620824
------------EGWATMQDHILNYLSntMMLPFVMRCNKSILKYFVTYESNvsllkfEGSqglAslEKTKHGCWfLTEVLTKVIPNLECLDTCieyLKDLGQKHQ-TQGVRREHLDLLALVYVSAVKEVMA---------------------------
>tr|A0A2C9LKZ0|A0A2C9LKZ0_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106051185 PE=4 SV=1 
--ISLADIKVITNQWEDVLRCSDLFGKLLVLYVLDNCPKVNALHPGLHArlTdARDSVEKQIGLRVIQSISCVIHNLNRAPAVESMVRDTfkkLQ-QHGYTKNTILECSEAFLSFMNQYFSKRWLKQHSDAWFKVLKALL--------
>tr|A8WLI5|A8WLI5_CAEBR Protein CBG24801 OS=Caenorhabditis briggsae GN=CBG24801 PE=4 SV=1
-------------WIFSFQLEG-SKSRTQIERILKKFKNKKKS---------------------------------------------------------------------------------------------------
>tr|A0A1I7RWJ6|A0A1I7RWJ6_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=4 SV=1
-KLSKLQKRALRFTWHRLQTRnggkrVDNVFEDVYDRLMRLVPVMKEMFTTRAFlsamSkHEVATPRDHARFTVKMIDSVIKNLDTDEKkrtdtlseFDPVlIGRAHAvlRPYGFVASIWEKLGETIIDVVLVQDAVRDLPGAGQAWVVLTACLVDQLRAGF-
>tr|A0A2A2L6J3|A0A2A2L6J3_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_22934 PE=4 SV=1
-KLTKLQKKALKFTWSRLQTRnggkrVESVFEDVFDRVVRYLPQTREMFNTR--------------------------------a---FlCAISrneTSslRDHARVIFFLhsfadlcKLHDKCLLL----------IPSA--FTLCFSLCTIYELRGS--
>tr|A0A1I7XY15|A0A1I7XY15_9BILA Uncharacterized protein OS=Steinernema glaseri PE=3 SV=1
----TSSLALLTSTWPDHFGNLFDMGLNALDATFKKHPDLMAYFAFNDRVnwKKEDKVRKVVLALEQTLVHAVSVFGEvhsgdekeeaiqgFEVLLEEIGGLHRAiVPNFVPEHFIKFLAVLPTAIVTTICdkreeimpESDREMLLELWKKISAFMGFHLDAG--
>KNS7NT10metaT_FD_contig_41_844412_length_214_multi_3_in_0_out_0_1 # 3 # 212 # -1 # ID=205324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.619
--IPPKLAVLIREKWQAFLEKfptREQAGEAIYDSFMEEAPSLRPLFKTP--------RSVFGLRFIASLTNLMAVRPAGVTEEagGNHGF-----------------------PAPRLGG--------------------------
>tr|E5SHC1|E5SHC1_TRISP Uncharacterized protein OS=Trichinella spiralis GN=Tsp_03845 PE=3 SV=1
-SLSAGELKLLRWLWKQMKQVHQgLASAKLFQIIFATCPEIKRFFGLAKVS----------------DEKALIDerMRKhmlilqASKLIILFQIISSa-----------------------------------------------------
>SRR5690606_9602430 
-------------------------YRAFYPILYSSVSGAQELFEATVG-TDNRKMLQILAKLFG----FISNVNhSSEFMKsdAFIerGKYYA-DHGISETMMRGFSSALVLTLRRTLGELFTISHVRAWGIFLDTISHAL-----
>SRR5581483_1589235 
---------DIKESFHRILEQKQAVTHLFFTVALGSGHEARLLIWETEG-----------------AGCSVESTDPPQWLC------------PPFTIYAQFTNDLLQALREFHGADWNQELMEQWRMTIERVGQIIFSACR
>SRR5262249_34977875 
------------------LEQKQAVTHLFFTVALSGCHEARLIFWGTEG-----------------AGHSGEFFSSPQMLC------------APLAMYAQFTNDLLRALREFHGADWNPELTEQWRMAIERVGQAIFATYR
>SRR3954454_18132641 
----------------------WRDADRPAWAALNADPEVREFFDR--------PLTrpeADASldrfrsdLAARGWGWWAIELTATGE---------------------LIGMAGLDPTE--DDIP-VAGVEMGWRlarAHWGHGYATEA----
>SRR3954470_12875293 
----------------------WRADDLDAWAAINADPQVRAFLGG--------VLDrgqAAESirrfrtaLAARGWGWWAVELTATGE---------------------LIGIAGLDPVD--EGLP-FDGVEIGWRlarWAWGRGYATEA----
>tr|A0A1Q9EV88|A0A1Q9EV88_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene4882 PE=4 SV=1
----------------------SAFKMEVFETFFATCEQSQEYLKASNA-----KLQFIAGRILDI---MTDMFRTPQSAVkdiSALGLLHA-GYGVREELIQPFVTAFMTAVKNAC----------------------------
>tr|S9TGR2|S9TGR2_9TRYP Uncharacterized protein OS=Strigomonas culicis OX=28005 GN=STCU_11951 PE=4 SV=1 
---------TLEGCWQLLELrpqGLEEIAQAMYFYLLSHNRQLQSYFYGI-------DMEEQGRALVRMLCSTVHTYGRTqtecdpvawsnfEGYLVEMGARHR-SYGVGDNVFHEMRDAFFQQFPHFVDAnSWRI-TCREWHTLWDTIIRLLQQG--
>tr|A0A0A2NAV4|A0A0A2NAV4_ALCFA Uncharacterized protein OS=Alcaligenes faecalis OX=511 GN=JT27_01100 PE=4 SV=1
--VTDAQRDIIKTAAPLLASGDKALTTYFYELILRDSPPMSPLASQ-------------------------------------IANNHL-ALQIQPEHDPMMGTCQLQAVREELIVRMTgNKLIDGWVAAYQQLSNLLIEA--
>SRR3954463_13473713 
-RVTPDDLKHVQRSWAKLCDRRESLLAELT-VTFQSNPALQ--C----------DACCRAEWLLCAGEELVELLPAPSTLASRARVLgDRWPDPLTAPSFEIDGRAWMAAATRCSS-MWSDTIEMAWRQAWLLLSDVLA----
>ERR1711890_22380
MHLSDTEKSAVVSSWSNVNS---SLLDSVLLQLVQENADMRAAMSRGDLAedsiREQETFKADVTKLTCCITKLVTRLGNTGEVSScpATCLKNC-P-YLQPKHVPLFISSFCD------KLELTEDAKKGWKFIMEKTAERI-----
>tr|A0A0B2VKC9|A0A0B2VKC9_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_09473 PE=4 SV=1
--ISPQGRDIIVNCFENS---HADIGNRICMRVFERRSDYQRFILALGKE----KWSWVTNTLRDFIEEVVLRIDDLAKideLSRKYGEDHVelKPFGFKPDFWVSLADAMIVeCVVLDMASHQPTDTVAAWSQLVSLMFSSIRDGY-
>ERR1700761_7028990 
-PLDEEALRIVRHSAGRLTYVTDDFIDWLHREGVALSPEVGHSVAG--------EGWPFCERMAQALLWV-ALTDQPAGvaagVLRRVGADNW-RDGFPDAEYVSVVQALVRVLRGLSGAAQIPAMASAWISCFQWMQPYLLIG--
>tr|A0A2A6D1B3|A0A2A6D1B3_PRIPA Uncharacterized protein OS=Pristionchus pacificus GN=PRIPAC_35146 PE=4 SV=1
-TLNHQQRKLIKNGYDSWRKKsCISSGRWVHSFVSSKDDRLKEIMEGNEE-----TTRIHEETITHLLDMAVESLESLDDsLGPLLISytgpqgvFEE-KDGFDRLYWSRVSEGMCQLARNFPSKANKYETVCAWRIVVLFICNKIELGF-
>tr|A0A0N5AH18|A0A0N5AH18_9BILA Uncharacterized protein OS=Syphacia muris PE=4 SV=1
-SLTEKQKQLIKIGYKKWSEStTVTVGEWVYQYIFHKFPSVKGKFAKDEK-----SLAENQRRITDIIEMAVESVDSLDDsLGSFLVSyssengfLGE-SEGFDRGYWEIVSEALCQLSRHFPVKSHKSDTVLAWRIVILFVINKIEYGF-
>tr|A0A0G4IA00|A0A0G4IA00_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_12404 PE=4 SV=1 
------------------------LAGKVFQKIITKAPSFRKLFVRPDE--------AYTKHFSVFLEQCLDYAQRPRCFWQehnDLAVKHI-IFGVGHNDITMMGRMIVEALQDIGGEGWAEDYAETWQKFWTEISRSL-----
>ERR1719384_273858
-----------LLGTTLTT-KLLSEKLSSRAGWA--QTQTSKMFSLLSFK------QGPAQFLVERFDILLNVIDDEDQLAEQLYqvaKTHK-KVGVDQSDLYSFQASFMKTLPSF-DSDFTAEVGNAWAYTLSH----------