comparison test-data/multimer_output/msas/A/bfd_uniclust_hits.a3m @ 9:3bd420ec162d draft

planemo upload for repository https://github.com/usegalaxy-au/tools-au commit 7726c3cba165bdc8fc6366ec0ce6596e55657468
author galaxy-australia
date Tue, 13 Sep 2022 22:04:12 +0000
parents
children
comparison
equal deleted inserted replaced
8:ca90d17ff51b 9:3bd420ec162d
1 >chain_A
2 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR
3 >tr|A0A1K0GGD5|A0A1K0GGD5_RAT Globin d1 OS=Rattus norvegicus GN=Glnd1 PE=3 SV=1
4 -----------------------MYGLEKEp-R------------ETEGClsrKLPSNLQRSSAPWRLHGFQNLLERSQGA--------QRAKPG------------HGAHSHSSVKMAL--SQTDH------------------rlvL
5 >tr|F6QUQ8|F6QUQ8_XENTR Uncharacterized protein OS=Xenopus tropicalis OX=8364 PE=3 SV=1
6 -HWTAEEKAAITSVWQKV--NLEQDGHEALTSISLTFISPLdvvwAYFKG----------AAHNK---------IKFCFNIELKQISLSFHARWKNQNPEQKLERLGEVLVIVLASKLGTAFTPQIQGAWEKFVAVLVDALSQGYN
7 >ERR1712144_198951
8 HESLWKRQVRG---evfLGESRPE-VrRDRRRSSG-qDAGGLPPDQTYFSHWaDLSPDSSQVKKHGGVIMGAVGEAVGKIDDIVGAVSNLSSCMPSSSEWTLPTS-------------------------------------------
9 >tr|A0A096M318|A0A096M318_POEFO Uncharacterized protein OS=Poecilia formosa PE=3 SV=1
10 VNH-KHDELII---tgvFFTS-------VSECVP-pVRNIYRQTTNSIENIgNFKngetfLTNPPVALYVVNMVEFTSKPLMSL-PLNGFYGILDFLKA--KRKNPNGGKLLADCLTIVIASKMGS-gFTPEIQATFQKFLAVVVSALGKQYH
11 >ERR1719244_1811598
12 --WSDDETKAIQMIWNSVD--VNELGPAALRRCLLVYPWTQRYFGKFgDIATPTAimqnpGVAQHGITVMNGLKLAGGPGGGPGNQPGGQQELWQRGKQQGQQQLWQQGQHGGKQRGqqQRQGQq-PSPRQSX------------------
13 >ERR1719167_1707907
14 VEWTDFERATIQDIFAKMP--YEEVGPAALARGLIVYPWTQRYFGNFgnLYSAStilvNPLIAKHGTTILHGLDRAMKNMDNIKETYAELSVLHSEKLHVDPDNFRLVSDCLTIVVAGKMGKDFTGEVQAAFQKFLAVVVSALGRHHH
15 >tr|A0A146QLZ2|A0A146QLZ2_FUNHE Hemoglobin subunit alpha-2 (Fragment) OS=Fundulus heteroclitus OX=8078 PE=4 SV=1
16 IILTSNYNYTFNTFFSKFSSNSYSIFSYSLSIILFFYPHTNTYFSHFnYLIPFSSPFNNHLstfiflfsxxxXXVMGGVEDDVEKIENMKEGIIRISEMNELNMRVEKEKLKIMEKKIIVV---------------------------------
17 >tr|A0A147ASE9|A0A147ASE9_FUNHE Cytoglobin (Fragment) OS=Fundulus heteroclitus PE=3 SV=1
18 EPLSDSEREIIQDTWGHVYKNCEDVGVSVLIRFFVNFPSAKQYFSQFQdMedpeeMEQSSQLRQHACRVMNAINTVVENLNDPEKVSSvlaLVGKAHAMKHKVEPIYFKILSGVILEVLSEDFPDFFTADVQLVWTKLMGALYWHVTGAY-
19 >tr|L8HUF7|L8HUF7_9CETA Hemoglobin subunit beta (Fragment) OS=Bos mutus OX=72004 GN=M91_21159 PE=3 SV=1
20 -YLTLEKKATVIDLWSKM--RVAEVGPDTVgrqvFKLLVVYPSTQRFFDYFgDCPLLIygqCFTffvsrhrfllfilvflCFKEDKMMYCFLKQFKKIKK------MIAKRNISK---------YKLRLIWVASHQYFGKEFTPEFQAACQKVVAGVVNALTYKYH
21 >tr|A0A2Y9DG99|A0A2Y9DG99_TRIMA myoglobin OS=Trichechus manatus latirostris OX=127582 GN=LOC101351845 PE=4 SV=1
22 MALSDGEWQLVLNVWGKVEADIAGHGLEVLISLFKGHPETLEKFDKFkHLKseeemKACEDLKKHGVTVLTALGGILKKKGHHQAEIQPLAQSHATKHKIPVKYLEFISEAIIHVLQSKHPGDFGADAQGAMSKALELFRNAMAANYK
23 >tr|M3YM80|M3YM80_MUSPF Myoglobin OS=Mustela putorius furo OX=9669 GN=MB PE=3 SV=1
24 MGLSDGEWQLVLNVWGKVEADLAGHGQAVLISLCQGLESRKEEKKRDpAHAcvssrrslFVSQDLLFHSDAFLVSLGHRSFLapvSGENGQSQKTQPAHHAQHHRQPWNTEKFISDAIIQVLQSKHAGDFGAEAQAAMKKALELFRNDIAAKYK
25 >tr|A0A1C4HDU6|A0A1C4HDU6_PROAN Myoglobin (Fragment) OS=Protopterus annectens OX=7888 GN=Mb6b PE=2 SV=1
26 -------MACPAKFWEEnVVPDAAEHGKNILIRLYKEDPAAQGFFSKYkDTPvselGNNADVKEQGAVVVKALGELLKLKGQHESQLHAMAESHKNTYKIPVEYFPKIFKITDAYLHEKVGAVYA-AIQAAMNVAFDQIADGLKTQYQ
27 >tr|Q9Y0D5|Q9Y0D5_MYXGL Hemoglobin OS=Myxine glutinosa GN=Hb PE=2 SV=1
28 -RTTEGERAAVRASWAVLMKDYEHAGVQILDKFFKANPAAKPFFTKMkDLHtledlASSADARWHVERIIQAVNFAVINIEDREklsNKFVKLSQDHIEEFHVtDPQYFMILSQTILDEVEKR-NGGLSGEGKSGWHKVMTIICKMLKSKY-
29 >ERR1711977_634702
30 --WTDAERAAISSVWGKID--VGEIGPQALGRLLIVYPWTQRHFSSFgNLSTpaailGNPKVAAHGKTVMAGLERAVKNMDDIKSAYSDLSRCTPRSCMWIPTTSGSWLNAspcvwlpsldvrPSTLMSRRpGRSSWLwssppwadsTTEGLKTHHNQIICSSFL-----
31 >tr|Q9U6L6|Q9U6L6_MYXGL Hemoglobin OS=Myxine glutinosa OX=7769 GN=Hb PE=2 SV=1
32 -TLSEGDKKAIRESWPQIYKNFEQNSLAVLLEFLKKFPKAQDSFPKFsakkSHLEQDPAVKLQAEVIINAVNHTIGLMDKEaamKKYLKDLSTKHSTEFQVNPDMFKELSAVFVSTMGGK----------AAYEKLFSIIATLLRSTYD
33 >ERR1719474_978995
34 ---------------------------------LLQSSWKQ--FRT----------------------------------FASLSGIRQEELGAGCQHQDLP----------QIQHHLWISEPSTFQQL-------------
35 >ERR1719336_830457
36 -----------------------------------------------------------------------------SINPQSTVDLGAQYISATPLNYKNHQDIYNSLLSNG------VLVPANVSLI-------------
37 >tr|B7QI99|B7QI99_IXOSC Globin, putative OS=Ixodes scapularis OX=6945 GN=8041668 PE=3 SV=1
38 -GLTTSDKCAIKDTWTMFRRETRTNALSLFVALFSRYPEYQKMFPNFADvalkdMMQCPSLTAHALTVIYALASIIESIDDENtmvELIKKNIRNHV-RRSVTPEHFVNINNLLIEVMQVKLRSRMTASVIVSWKKFFAMHDAVTRQTY-
39 >tr|A0A1W0WKD0|A0A1W0WKD0_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_10224 PE=3 SV=1
40 -GLTSNHIKAVRANWKLIEKRLPEYGLELFVAYLNKHPDWIGLLPFLKPadmprLQQTPRLKAHGTIVLKKLGELLTMLDSPPkliGELLKQGSTHR-ARGLAPENFQAIQHDLNELFVKICGPE---FDIEGWDAVLTLIMTGIEEGL-
41 >tr|T1KR38|T1KR38_TETUR Uncharacterized protein OS=Tetranychus urticae OX=32264 GN=107366531 PE=3 SV=1
42 -LLSDDEVKVIQSIWSSVMKDANTHGMNFFLKFFRENPTFQERFASLRNlkteeEMkASKRLKAHAASVFHAITALVDNLDDLEcvsDMLEKIAANHL-RRKVNWPFFDRIALCIVAFLSETLGTqIMDSKATTAWTKVLNVITETVKRVE-
43 >tr|A0A2N8ZEM6|A0A2N8ZEM6_9VIBR Globin OS=Vibrio tapetis subsp. tapetis OX=1671868 GN=VTAP4600_A2359 PE=3 SV=1
44 --LSEQQIYLVQECYRQVEESPHEFAKHYYGKLFELEPRLQALFRN-DLD-------IQGRKLIAMLEVAVNGVKDMGMLVPMltqLTQLahrHN-DYNVKKSHFSLLNTALHHAFEQHLQQAYTDEHRQAWQTLLDFMVDTMK----
45 >tr|A0A1I0MYA2|A0A1I0MYA2_9RHOB Hemoglobin-like flavoprotein OS=Cognatiyoonia koreensis OX=364200 GN=SAMN04488515_0317 PE=3 SV=1
46 --LSQTQVDLIRTSAEVLAEANVAATNVFYANLFRVAPGVRNLFSE-DMF-------EQSEKLWNTIVKVVESARDLTEIEADLHALgarHV-HYGAEPGHYVVVTDVLIQTISSMMEDKWTDETQAAWKTALEAVCATML----
47 >tr|A0A146Z291|A0A146Z291_FUNHE Hemoglobin subunit epsilon (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
48 ---SYHYLIIITSIFSNLY--YNYFFPNSLIIFLIFYPFTHIYFSNFFNLYNSYsintnpNIQSHFTNFLHFLYLSFNNIYNINFTYSYFIFLHSYNLHFYPYNFNLLSYFFTIFISSNIFSVIKE----------------------
49 >tr|H9GUN8|H9GUN8_ANOCA Uncharacterized protein OS=Anolis carolinensis GN=LOC103282340 PE=3 SV=1
50 -KMTDLDRRHIREIWTAAFENPEENGRLVIIRFFSDYPASKQYFKTVPTDGdlkAHPQVAFHGRRIMVAFSQVIENMENWNQACVLLErlvNNHKNIHQVPSGMFQLLFQAMLCTFDDLLGRTFTPEKRVSWEKFFQVIQEEVEAAY-
51 >tr|C3YSB7|C3YSB7_BRAFL Uncharacterized protein OS=Branchiostoma floridae OX=7739 GN=BRAFLDRAFT_96956 PE=3 SV=1
52 TGLTANQIQLIRDTWQIVYKNKRENCFAIFRILFTDHPSTKSLFRLMDAVdldvpgefEKNVAARAHMVRFMHSFATFMDTLDEPAELRQLLYDLgknH-AKHQVGPELFDALGPILMKALPIVLDGKFTPEVKTAWLTAYTFMSTHLK----
53 >UPI000197D711 status=active
54 AGLTPKDIYEAKQCWNKAASlGVNKVGVLLFKNIFTIAPEAAKAFSFGNDPnfMNNKEMEEHGVKVVMAFDHAVRSLDNIHAlqeTADGLRDTHS-FFNLSPEHHVIVKEALLQTLKQGLGDEFTDAQRELWNGIYTAIRNMWVG---
55 >KBSMisStaDraftv2_1062788.scaffolds.fasta_scaffold119418_1 # 1 # 498 # 1 # ID=119418_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.510
56 EQISPLKLRLVQSSWRQAS-ADEQAGITAFKFFFEMEPVAIGMFGLQDIRdlYNSYELKRIAAKIVKAMTHIVNSFDNFEGlrpLIKKLGMMHG-EKGVSPSQYNNFGKAFMQTVEEILGDQFTPETRRAWETFFRILTGALQR---
57 >SaaInl8_100m_RNA_FD_contig_91_216993_length_256_multi_18_in_0_out_0_1 # 1 # 255 # 1 # ID=160783_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459
58 NLLPKNTILQVQTSLQKVLQTTKTISPIFYAQLFEIDPSTRPLFSTEND----QQLKQQETKFTLMLSAIVNSLTNLDSlipVLQDLGKKHL-NYKVQKSHYETFGIALLSTFALILADDFTQETKKAWEDTYGLIASIITE---
59 >tr|A0A091DYW0|A0A091DYW0_FUKDA Cytoglobin OS=Fukomys damarensis GN=H920_02872 PE=3 SV=1
60 -PPHEGGSCATPLPWGNRDLGPWACVRPDLCRFFVNFPSAKQYFSQFRHmedpleMERSPQLRKHACRVMGALNTVVENLHDPDKvssVLALVGKAHALKHKVEPVYFKTISGVILELIAEECANDFPPEAQRAWAKLRGLIYSHVTAA--
61 >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1887876_1 # 1 # 366 # -1 # ID=1887876_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.459
62 -PVSDENKDILRESWKRLEEEKTTLCKNVFIRLLQLNPNLQDTFPSFkgvalDELMNSRSLFLHSKRLMEALEIAISSLDDGQDFTEYLTHLGErHtAISITENHFKIMEKALIFALKDMLGESCTEDVANAWREFFQSMAGT------
63 >tr|A0A2E1AIS1|A0A2E1AIS1_9CHLR Uncharacterized protein OS=Anaerolineaceae bacterium OX=2024896 GN=CL607_22355 PE=3 SV=1
64 SPVTSRQKLLL--HYTLLHLDADQMGKLFYDHILAAMPEVAPMFTD---------LESQRKHFMKMMIRIVHTIDEPDHLNIVLRELghiHK-RLHLKPRHFSKMGVAFSNSLAEVMGDRYTPEIGEAWRILYNRVAEAMQS---
65 >APLak6261659701_1056019.scaffolds.fasta_scaffold514158_1 # 3 # 230 # 1 # ID=514158_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561
66 IELNAKNKALVKEGWKLLIETQFPnevggneralarFFDEFYRKFFEVNPSGKRLFEEGGM-------AVQSKALVKMMSMVVTSLENPSNLDLTIERLggrHE-LYGVSRSDYLAFTNAMCETLETVLGDKCNQEMKESWSLVLNNLSEK------
67 >SRR6516164_1622129
68 -SDDPRTEATRAghletggsrrrRGSRHVLPSAVR-----------NRPHHAQAIPRDR--------------------------------------YgraTQ-K----AAA--DVGL--rhRWPGX-------------------------------
69 >SRR6516225_5669596
70 -VMTPEQKRLAScfrrggppGSWRRPSPPLGIETAQVFRIPCVLPN--AAVHTAGVSD-------HNNSDTYRAALRPAH---R----AASQTASvrnHE-RIQSETAM--REGL--rrvTYARVLRTGS-hRTPYrnVTP------------------
71 >SRR5215813_13307430
72 -KSTPPRAsyfratdmaaqrkkllqtleqglgqawtPAVAs-AWSEVYRLLSGIMrnaAERVERLQNVWPAPFDAVIX------------------------------------------------------------------------------------------------
73 >SRR5262249_1440316
74 -AMTPEQKRLVEd-TLKQMAASADAAAALFYCRLFEIDPTTRKLLPQTARA-------ATRLGCGIPQLLTDIFAVR----YAAHADFgtfSE-GTHGHSDL--EAGY--hrrlVX----------------------------------
75 >SRR5260370_32836152
76 -SDDPRT-EATRaGHLETSGSRRRRGGRHVLPSVVRNRPTTRTLFRATDMV-------AQRKKLLQTLAFAIGGLDNLDALGSKVEDLgrrHA-GYGVTDAQYDSVGAALLWTLEQGLHH-pPWPRRGPKTTDC-------------
77 >SRR3989338_1269240
78 MDFNDEEIDIIKDTWDAVLYPey---PEEGFNPVLNFSTKFYRRVFEHENckNLFEEVDMTSQGEKLVKILSVLLVAVQTkslnqdHIHVLRKMGERHRG-YGVSDDMYEIIGGCLLRTLSEVCADVWDDDAKVVWAKLFGVVSE-------
79 >SRR6516164_9760095
80 IVTTPQQVQLVKQSFAKTTPIAEQAAGLFYGRLFETAPQLRPLFKG--------DIKTQGRKLMSTIALAVGSLQKLPELVPIVQDLgrrYV-GYGVKDDQLRYRRRRAAVDARQGaRGRLHTRCEGRVDLGLYDPrrYDEERRSAA-
81 >SRR5690348_1420512
82 ------------------------------RHRAESAPAVSGRS------------HSAKKEADGDDLHDDRRTERFQKAGPGSQEPrraPC-RLWCDCGGLSIVGEALLWTLEQGLAAEFKPEVRSAWIKLYDMIATTMQAGA-
83 >SRR5437870_6238790
84 FDVTPIQVDLIRASWAKVEPIQELAASLFYDRLDRKSTRLNSSHVAIS-------YAV---------FCLKKKKKKKEK------------YTHEHINNNKV----------------------------------------
85 >APAra7269096870_1048528.scaffolds.fasta_scaffold62442_1 # 1 # 438 # 1 # ID=62442_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.454
86 -IIQPSAVSIIQSSFEQIKPNAGRFTRVFYDRLFERDPSLKKLFIR--------DIREQRKKFFRMLGSIVKNLSNPDELEPKLQDLgsrHD-YYSVKREDYRTFFEAFIYTLAAALGNDFDENTRHAWRDFCDYVGAHMCKE--
87 >SRR5215472_6010456
88 -------------------------------------------------------------------------------rMISGPLPDvitAT-ACGKS----TMPASAPLYCGH--LSKGS---------VSISRPMWAMIAV--
89 >SRR5262245_10239308
90 -PENARPGNL-RHHYadrgrcsGSLLPEAvqaRSVAGRHVSRRHERAAEE--AAAdA--------DGRRQGARSA----RSGRGGRRGSRPAPRAIRRdrqAL-RHGRHGS---P------LGARGGTRARFTPSVKKAWATVYGLLATTMKNA--
91 >SRR5688572_4752169
92 -RMNSQQIALVRQTCTEVAPIADSTAEIFYQKLFQLSPSMRSVFAP-G-------LRERGRHLMETVEAATQIMDHRGTMTSAFAELgsrQM-ALAAGNNRYEAVGAALILAFRQGLGPSFTPEARQAWIALFDYIDETMKAD--
93 >SRR5689334_18520770
94 TSMTPDDIALVQESWRKIEPVKEIAAELFYTRLFELDPPLRIVCGD--------DMKDRRKRFTQVVGATVRGLARVDMLLPAVREFgmrHP-LPGEIEQHHANVAGALLWMLEKALRKEFTPEVKAAWIKAYGMLSQTIRQT--
95 >SRR5215207_7597532
96 QTMTRDQIRLVQASFRNVLPIRELAAALFYDRLFEIDPGTRGLFVDT-------DLRSQGGKLMAAIGMVVHALDAPESMVEKLKELarrHV-NYRQLQESSPPDFHRLhrfgsgrgsqRHVVSKGPGVAPVGQ----HVVPTHFASRvsrRLRAC--
97 >ERR1700730_6579985
98 -RQRLADDGVILRVLQRGLGIELEMEALAREEIGELDPDAarfRPHHAV--------GGGEVGGRHIELLRRHVDQRPpcHAAANGSARISLprgHV-SYGAKPRHYPVVGAALLWTLEKGLGDGWTPEVADAWLTAYSTLSGYMISE--
99 >SRR5262249_2898310
100 -ILTADEIERVRNSFDQVWAISARTAELFYGRLSAGNLFAHAPSEA--------ERDDKRQKFMLTLAVVVASLDERADMDSLSERLaqaHT-EAGVRPEPASELREALFWSLEQALGPVWTPAVDAAWRKAYRRLSERMVSI--
101 >SRR6516165_4200192
102 -----AQ--------------------------------------S--------DLVDRGRA------YRLLGLADLVDRrnQAaagGLSLFhrrAV----------------------SAGGVAWADRVLDALSlylcgyelrwpQLDHALGRgavhpdacaSLLRE--
103 >ERR1700733_1486793
104 -------------SQAHGGDIVDLyRDVRLVYRLFRRLPPAEQDAIpG--------DHRRGRLSRaAGRVAL------------APVRRAarrQ---------DRRREG-DVLELRRDGRGDDRRHVFHRDQElswlSDDV--PR-VVRD--
105 >SRR5580658_8437352
106 ---------TGAGKFESVQEYADSVVLLFYGRLFELAPPTRGMFKI--------GIPEQARKLMGTLTSLVDALDRFEELRQWLTDLgrrHV-EYKARALPGAGDGAHVGFRAGAGYRV------RPGDEDCVGAVAERGVCG--
107 >SRR5215831_4136876
108 -KHDPPTDLARAEQLQVRCA------DRVKGRRSLLRPSLRDRSRGPA-------A--LPRKIIRAEGKVdgdANEDRQQSSSAQchFASCTptrRaaQ-GLRCLDGSLWGSGCCLLWTLEQGLGSAFTPEVKAAWSEAYRTLAGAMQEG--
109 >SRR5215469_10861266
110 ------------------------------------------------------------------LTGAPLTVHPVRDRSPQFSRIgspsgrHA-TARARGQWIRNNSAFRAMTLQQALGSEFTPNVRDAWVAYYQTPAAEMKA---
111 >tr|A0A1E4AHQ5|A0A1E4AHQ5_9BACT Uncharacterized protein OS=Cytophagaceae bacterium SCN 52-12 GN=ABS46_00305 PE=4 SV=1
112 -ACTQDQIRIVKKTWSFFRNmSPEFVGDVFYTKLFMDYPDLEKRYPR--------EAQKRYEDLIKMLNMVISRLDRPDELTWALteiANQPH-RIWVTPAHYQKVVSTLIWTLRKGLGNDWTAVVEDAWMSCIKMVESLNAAI--
113 >SRR5262245_55554356
114 -CVTPEHRLLAQQAFATIQPLADELGLLFYSRLFELDGALRGLFKH--------DLANQAHSLMAMLQLTIEGLDAPEQFTRARTTWgyaTWTmGFSRTSTRLLRRPCSGRSSMRX------------------------------
115 >SRR5260221_10622870
116 -IVNAAQQELVMTKAEGVVLMPGVTGVLLCALLISANPSFRPLFKS--------DMRIQGVKLMTMLAMVVYNLPEPGQVLPAIRDRseeHT-SELQSHSDFVCR--LLLLHX--------------------------------
117 >SRR5918994_240771
118 -------------SWKGVAGRRDEIARAFYAVLFDRHPELRSLFAHTD-------MRAQYEKFALMVDEIVQLRTEPRQFVRSAVLLgqrHT-MYGVTRRLVIAPAIRL-DRFAATDSIGFATPSTSALQlllcpRETVRRSGVMS----
119 >ERR1700730_15638689
120 -AMTPKQVALVQDSFAKVALTSEAAAVLFYNRLFDIAPQMKAMFPD--------DMVEQRRKLMSMLAGVVKGLANLEQVFAGRQRTgkaAC-QLRCEGG--ALSGGRRRVAVDAGEGsGGWLDAGSGGcVGHRlWHAVRLHDFPS--
121 >SRR5258706_7695680
122 -RHDPPPdpadPPVLRPA----RVQGRETRHLDVQAPVPARPRPTPAVQ-------------------------------------------------------------------------------------------------
123 >tr|A0A1W2GRB7|A0A1W2GRB7_9BACT Hemoglobin-like flavoprotein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4043 PE=3 SV=1
124 -----RELMLVKSCWQTVAPNAIPLAMKFYDDLFEAKPEYRRLFSGD-------M-NKQAEKLMMTLGFLMANVDRVDKIKDAIHKLgalHV-KFKVLPEYYPPVQKALVGAIAQFMDNQWSYEHEDAWNKLISAVGDMMIEGT-
125 >tr|A0A0N0UYC0|A0A0N0UYC0_9BACT Uncharacterized protein OS=bacterium 336/3 OX=1664068 GN=AD998_10010 PE=3 SV=1
126 -----EQKEIIKSSFPRVLIHTLKNSTIVYEKLFMDIPEAKDLFKNT-------SIDKQGQMLVAAIGKIVKGLDNPDIFEKDLVELatrHV-GYGLKPEYFTHFGNALINMFEVSLVDSWDKDLHDAWVAVYQEVAEIMKSVI-
127 >SRR6185312_354929
128 --MVR--A-----------RGSAkC--WKCRWR--------------D-------RA--SVSnSLPAPATSSAGSACSNFS--------mngTA---SSkQPefDRVPRGGrgrgrrrKMTpeqVSLVQqsfakvapiseqaavlFYD-RL-FevapavkamfpadmteqrkKLM----------GTLAV-V---
129 >SRR6201981_618659
130 -ERHD--T-----------GGGQpRDAELFQDR--------------A-------DCGQGGGdLLRPPVRDRAAGQIVVSIRHGGAPGQadgDA-DRRGrRSyqSSLDPARgerarq--TpcqLRRQGgalsgrrcrvavdAGE-GTWRgldarcrgcmegglrnpVRLHDL----RGLRQ--------
131 >APLak6261666328_1056055.scaffolds.fasta_scaffold241778_1 # 2 # 196 # 1 # ID=241778_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.415
132 -AKTAGGL---NLLFL--AIVSS----EPENGFVTISPAAKDLFP-A-------DLTEQRKKLIATLAIVVNRLSNLQSILPAARTLtkrHV-NYGAKPEHYPVVGSAVLH-AGgrPRLGLDARSRLrsdGCVWHAVRLDDgrnleHEFANL---
133 >SRR5919197_1191720
134 -VLTRDQADIVQLTWRAVLPVGDTFAELFYGRLFALDPQLRRLFRE--------NLVEQGRNLTAMLSVAAANLARPEKISVALRQLgrrPT-RSSRARCSRSLLRDLLRLPLDARRA--VADGVARVVVafaRAVVAIP-RVIHG--
135 >tr|A0A1I1PNT6|A0A1I1PNT6_9RHOB Nitric oxide dioxygenase OS=Tropicimonas isoalkanivorans GN=SAMN04488094_11525 PE=3 SV=1
136 MPPSQQELARVKQSFEDLRPHHEPTSYDFYEELFARAPELRQLFRD--------DLKGQGMRFMNTLGLVLDDMTNPNGTtvdYAELGHLHT-TLGVRQAHFEPMEDALMASLGKKLGNEFTADLEEAWRNAFRAFSKKLIEA--
137 >SRR5262249_25899110
138 -MMNTQHIARIRLSFAWIAPSADVFGELFVANLRALDPSLSGLLAA--------EAGPQGWQLISILRSIIGGRDRPDRLFWRLQSFgrrLA-GDGLCAEDYDTIGDALMLTLEQCLGERLTPDVAAAWDATYAALAEVVQL---
139 >SRR3954451_4172984
140 -XMTPEHIHTVQSSWSKVLPVGNGQARLLFERLLQSEASLWGLFQL--------DAATWSANLVQMIDVLVTGLSLGDRRAVltrRIGGRNT-ACPAIEHHYDLIGTALLRTLAKPLRAEFPPSVEAECPPFY------------
141 >SRR5215470_15672373
142 -LMTPEQIALVQSSFERVGPELPALATRFYQELFGRDPALRPLFTT--------DMTLQKVRFAEKLTEIVRAFSRIPARSAPGTSAtgyGS-LTTR--PsAKHSSPR-SLPFSATASTArparrGAS--PTTWWPRPCSRVRQRLGV---
143 >SRR6266566_5437046
144 -DLTPENCDFMTEHHDL----------RILGRLVATE---------------------------------------------------Q-EQPVKDPDHDQIeeatrhrprscPTLFIWPNRRSQPLhrvlmRYMPvpgpRSPPSWCGPPSRSRSHGPR---
145 >SRR5215203_7560530
146 RPMTPDQVSLVRDARRAIESRHAEFSAAFHDALHELDVDTCALFRDTV-------TGGRACNVGAMLDLLQQASDDPRALIEVAAELgraHA-HAGVRDVHHHVAGVALHRALHRVLGVEFTPAMYEAWAEAFTLLIAVMERAA-
147 >SRR5580658_533798
148 -XMHSIMIGHLRDSVSLLPMEDLRPVHEFYRRLFELAPEAQPLFTR--------EAGQQAKKFSDMLAWVIAHLEHADELRKEMRELgarHR-GYGVTADQYASVGSALIWMFQHALGDRFTPEMEEAWLEVFAFISLEAERGA-
149 >tr|A0A1D8RRN7|A0A1D8RRN7_9GAMM Uncharacterized protein OS=Colwellia sp. PAMC 20917 GN=A3Q34_02175 PE=4 SV=1
150 --MTAKQINLVQQSWQKVLILSPDVGDLFYQQLFVLRPELATLLKN--------DKQdKirANKDFICLLSQEINLLQPIELTEEKV-nTSVT-TNDV-KNYQADVENALLLALTMILDKELKIALKRAWISTIKRLVGSIVIEL-
151 >SRR5262249_21459549
152 IGQKREPPTVERRHREQVEEAQEDGKIGDD------------------A-------QRLARALLDLFAELVGDLDGPRH---------V-GFLX------------------------------------------------
153 >APDOM4702015191_1054821.scaffolds.fasta_scaffold152199_1 # 3 # 686 # -1 # ID=152199_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.531
154 -------------------------------------------MSG--------DFSPEQKRYLEGFTS------GLQ--------IartGR-GLG-KPAASVPSGPD-----AEHLIAQDQT----------------------
155 >MesohylFT_1024984.scaffolds.fasta_scaffold1796824_1 # 3 # 146 # -1 # ID=1796824_1;partial=10;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.340
156 EELNFQEIAIVKDTFALVEPHGSKFAQDFYDKFFTMSPEVTSLFAN--------VDRDHSSKMiWNALMLIVYNLENKQQLQNTLFGLgrrHM-NYGVSSHHYLSMGEAIMATLQSYLEanQSWNEEVAAAWERAYNLVSRRMQKG--
157 >SRR3569623_2148552
158 --ISYGTVMQVTLSWDKFKQVQnfqERAGELIFERLFELEPQLRAQYKFSeD--ediKSNPAFASHARTMVDMIDMAVSFLGpDLDPLAEDLEDLgkrHI-AYGVNAVHLPVMEKAVVYAFEELLGDNFIKDDRNAWQVMFHFIITNMGKGM-
159 >SRR5450759_1049036
160 -ALTAEaPYSELKnlCVWSKT--------NAGMGSLYRSQHELVFVF-kN--------GMrphinnvelgrfgrnrtniwnyagassfGstrdselamHPTVKPLSLVADAIlDCSKRGgivldafagsGTTLIAAEKTgrr---GYGTELDPFYADT----------------------ivrrFEDAYGL-KAVHVE---
161 >SRR5210317_1560035
162 -----------------XMTSL----KSSMIGFFRNHQNCAKMFGE--------DMRDQAQKLAAILQVAFDNLDHVDSLVPILEDVgakHA-TYAVTPEHYGLVAAALIGTISTELGDAFDERAAESFEAVLGTVANVMISG--
163 >ERR1719240_1900674
164 ----------------AVARvLVHGL-ANLHRRALERLDLLLELVDAhRVVVlrllHRLdgrldrlHVLRRHLVLVLE------EG------------LLgavHR-RVGLILH----------LHLRLAIGVRRGE----------------------
165 >ERR1044072_2403146
166 -VLTEEHKKALRHSWRLLEPLGETVSDLFYRRLFEIRPDLRILFPP--------DMAAQKRKLLVMLMFIVKAMDWPIedwaaeidpenDLLlvvLALVRRHSHLYQVTSEHYAPVGEALVWTLEQALGQGFEGAPQKTTGPVCVLGCSPWG----
167 >SRR5437899_2276119
168 ------------------YPAVQKSGAAVYRPALVAELRDRPY-EF--------DIQVQLCVYLARMA---------LEIVAALN--AA-GWICVPKDPSPEM------LKAAWAYALDEDAAGVWKSMIAA----------
169 >ERR1700757_2961956
170 -------------------------------------------------------------RFNRLAGRERRAPARTR-----ARQSr-----QRPGPSRHDPTrLALSD----------VSEAERTDIVVS------------
171 >SRR4029078_1694892
172 ------------------RNFnPVVIGDSFYSKLFSLKHSLRRMFPG--------VMHEHYLQLVKLLNLIIAALDQPGQLEEefeILARKHR-HYGLTSSHYELFEEAMLWTIERALGKDCNKPIVSRWKTCYLALVRRTIAA--
173 >tr|A0A1H4HXI9|A0A1H4HXI9_9BURK Adenylate cyclase, class 3 OS=Variovorax sp. YR216 GN=SAMN05444680_12751 PE=3 SV=1
174 ---APDSVLLVQSTIGVLLQHQKRFTQDLYRRLFALAPAAEGLFR-GDM-------DSQGQMLSHMMQFLVHAMSRPEIMALGLRDLgrrHD-GYGVAAEYYPAFRQAFLESARGILDERYTAQVEKAWAETIDMIIESMRGP--
175 >SRR5687768_10564074
176 -RMTPQQTQLVKRSFWIAEGRRTQLAGCFLAELFARDPALWRLFSS--------DPALRRDKLHHAVAGFVASIDRLHPIVPVLEWLafhGA-RHGIGERQHVAIADAFLAAMETVLGESFTPAHRQAWWLACRSVIDVMVHA--
177 >UPI0004291969 status=active
178 --KQSDTVFLVQSTLEKVFPQLDEFTNQFFKKFYELDPSVKEIFYEIDA-------KNKKQMVVNMIGFLTQGINRFDVIIPSIKEInerHF-GREVKPKYYLIASKALVNVLEDYLGEDFTPEVKQTWIEFYEQIVNFME----
179 >tr|A0A2D6RHV2|A0A2D6RHV2_9GAMM Methyl-accepting chemotaxis protein (Fragment) OS=Colwelliaceae bacterium OX=2026726 GN=CL811_09640 PE=4 SV=1
180 --MTPKQNIAVIESWKKVQPIASQVSQVFYDDLCEKHPSLKALLGE--------ELSSARDQLVAYLNSLVETLVATDEVViEDL-AKH-LRIGLAPEQFSDVGPALLTSLEIGLEKDFTATVKRAWTALNKLIVAAMAQ---
181 >SRR5215469_12962076
182 --------------------------------------------------------------SLSARAGRQAGFGl------SG--------LGSAAT--taiPTPSTSLTGSTARTTG--cSAPYSR-----TGT-----------
183 >SRR5205807_5077868
184 ----------------------RVGHGRVYPRLYIIARHAAGIYAlT--------RPVAKPgRPRPVCLVPIHKDIA--VMRVTTDQLLartPL-GrFGEAAevgqlVHYLVSDAA------RFVS-GATVTIDGAWTAYGGWALR-------
185 >ERR1719223_615602
186 -MTDKSSSQRVLDSWNAIKSIPnykEVAGVLLFRRIFALAPEAHGLFRFTNGFepnseelFESERLIEHGKGVIATLEAAIDMLGpasDLNPLICFLQELganHQ-RYGVLHDHYPIVGEALIETLSAAMGDKFTDDIKLAWEEIYGIIESNMIDG--
187 >SRR3954469_4757651
188 -SMTEVSVQRLAENYQLLAGRMAALTATFYERLFEAMPSVRPLFKI--------DIALQSQHLAAARALIVRNVRHLDALEEPLTELgvhHA-KVGVRPEQSPPLCRVMIETLRDGSGDRWSPQLESDWTPVLEMVSRIMMAG--
189 >SRR6516165_10653891
190 -EPSPNQLHQNRPD------RRPGGGTLLWPPLRDGSR-NPGAVLQ--------RRGRTGSEANGRSCNRCEQSRRFRGDRPHRTRS-C-KAPRRPEHYALVGSALLWTLEQGLGDEFTPALRAAWAAAYCALSEVMIA---
191 >tr|A0A210QIU4|A0A210QIU4_MIZYE Neuroglobin OS=Mizuhopecten yessoensis OX=6573 GN=KP79_PYT10777 PE=3 SV=1
192 -YLTSEQVRLVKQSWLILGEDMAATGLLVFKKLFESNEGMKKLFYKLmRCDSseqlefDQEKLTRHATIVMQGLGAAVESLEDSVfltNVLIAMGERHA-MYNVKTEMVPHLWPAIRDAFKELMGEDFLPAVESAWLHVFEYIGSKFKMG--
193 >SRR3954465_7515966
194 --------------------RGRAVGPSCYAPVSPLHPATSRLCSA--------DLLAAGVRLVDELVSLAVAAGDLATFTDRARAVgmrCC-ACGVVAADYPAFGDALVAAVAEVVGPDWTTAAADAWRRLYTLMSETVLEG--
195 >SRR5215207_9441599
196 ----PEQLALVRGTASIIDAVGDSFAERFDDHLFARYPAARRLFPD--------DTTTHRGQLTDEIVFLVAAAADLHALLERARALgapPP-LRRtrrrlparrrgTRRRGRGRRGRSVVGRNG---G-SLA-----------------------
197 >SRR2546430_16462751
198 -----------------------------------------------------------------------------FLLSVVIA--CS-CWCRHVSSlqhdrad-------HPVGLCPGIVADWSPALSQNVGEGFQQDCSD-dG----
199 >SRR5271166_2850757
200 -RWMRPKRNSCARPSPKSRRSPIKAGAMLYEKMFALDPDLRRLFAI--------DIETQGAKLMAVFATAIANLHRLDEILPTVRELgrrHV-AFGVKDRDYDTGGVALVQTLEAGLGDAFTPAVRDAWMACYEAITGEMKA---
201 >ERR1711915_528574
202 TAFTEEQEALVKKSWNAMKPNASELGFRFFLRVFEIAPSAKRLFSFLhDSdvpIEKNAKLKAHAITVFKMTCESAVQLREKGTPtfsesnVKDLGKSHF-KYGVVDEHFDVVKFCLLETIKDAVPDIWSLEMKTAWDEAYTQLAEAIKSEM-
203 >ERR1719460_671936
204 -MVDAVVKGDVQRTWELVIPPDSgddhvfAIGKLFFDRIFEVTPGAEALFSFKGEdRAESAKFRAHAIKVIKTVGVAVAKLDDLETLVPILEDLgkkHV-AYGVVASTTT----SSVWRCCGRSRRGWATNSRPTW----------------
205 >SRR6266567_6698575
206 ---------------------LIVFTSTCLWSI----RKPNHSLPKRI-------CVVKLAHCWLHLTTVVAGVLREDNLVPVLQQLgqrHK-SYGVKAEYYPFFRAVLLETFQHYLGPRFTPKMQQAWEEAFEMISTQMLKGA-
207 >SRR5688572_5289639
208 -TVTPDRQQLIRDSWRALEPNGPRLVELAFLHLLQIAPAARPLMTGH-------SLPCVCRNVASILDQLIAALDEPKQFVPLAIGLgrsNP-GHGINAALYPAMGEALLWALHLQLGEGLTPELQTAWLEYHHLVSAIMRRA--
209 >SRR5262245_22087501
210 -LMTPERQRLVHDSWRTLEPNGTRLVELAVLHLVSIAPSVRSRLDGA-------TLPLVCQHIAGMLGRLVETLDEPKQFVPLAISLgreNP-DRGLTAKLYPAMGEALIFALHLQLGDAFTHELQTAWLEFDRLVCAIMQRG--
211 >ERR1711916_36627
212 ----LELFKILGILWFLLLMSLRNCSIIDYLR----------------------NI--LKLRLCSLKTCNFKKLNSX-----------------------------------------------------------------
213 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9902871_2 # 1417 # 1767 # -1 # ID=9902871_2;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.538
214 ----ALDTKLIKDSFELAKPISDKLVKRFYENLYSDYPQSKSLYLDG-------QLPESQLAILKAINFIVDNLHNKEKLGTFLKTLnerYE-LRLNDSVINQSVCSSFLKTLSEAFGSDWTSELAEQWELTYQMVTSFFQDSK-
215 >1185.fasta_scaffold1192548_1 # 3 # 452 # -1 # ID=1192548_1;partial=10;start_type=ATG;rbs_motif=AGGAG;rbs_spacer=5-10bp;gc_cont=0.684
216 -TVTPEQIDLVERSVTELTPIMDEVVADFYTGLFAADPAIETLFAGAGAGaggahGQGDGFAVQRAKFAAQLADILTAVRDHERFLATAAdlgARHR-GYGVHAAHYTLVGRALLDALARHLGDRWTPATADAWRLAYNLTAEAMMA---
217 >tr|A0A1Y5RHX9|A0A1Y5RHX9_9RHOB Flavohemoprotein OS=Palleronia marisminoris GN=hmp PE=3 SV=1
218 --MPNDDMRLIQPSIARIFVVRRSIGQAFYERLFERQPTFRTMFPT--------DLRTQARTFDDMIALIVKKTGDPEAVTPVllaIGRRYL-TYGLRPQDLRVIGEVLMEVLCAQTPGGLSPDEAAAWERSFSRAAEVVKL---
219 >DeetaT_11_FD_k123_441726_1 # 2 # 373 # 1 # ID=403715_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.481
220 -GLTDLQIEMIRSSWEKVTPNKKHHGQLLFHKLFEIAPEMTDLFPFG-DDFTKPQFTTHALNIMNALDHAIQNLDNPDVLIPKLRELgqmHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG---
221 >AP82_1055514.scaffolds.fasta_scaffold664619_1 # 53 # 358 # 1 # ID=664619_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.458
222 --MSGFALRLVLTQRQKATrkrpiaqyvieNHSINFAFHYIDRLFEIAPEMTDLFPFG-DDFTKPQFTTHALNIMNALDHAIQNLDNPDVLIPKLRELgqmHA-GFELTIKEFQVRLFLqrrpsssMLQCVASILHYLYKIsdvLfR-TFYFRTLFISFRTNFG---
223 >SRR6187402_963757
224 -------KLHIQNSWLKLG-YSADMITDFYNQLFLLYPRLRPLFKE--------DIRLQARKFTAHITYLINHINDWNRLQRDLDELgkrHV-HYEIKVEYFEYVKEALFPTMRKHMG---------------------------
225 >tr|A0A1C4TW82|A0A1C4TW82_9ACTN NAD(P)H-flavin reductase OS=Micromonospora haikouensis OX=686309 GN=GA0070558_10167 PE=4 SV=1
226 -----AVSADLGPSWAATAAAVDRAAANFLDTVSDRLPGLLP--------------ERDHTVVFAALGRLAGGVDDTAGRAAALAVLaraHR-GVGLLPQHADLLGDALLAAVARENRAHWTAALATGWERGLRRAVTAVRRA--
227 >tr|D6Z7Y9|D6Z7Y9_SEGRD Oxidoreductase FAD-binding domain protein OS=Segniliparus rotundus (strain ATCC BAA-972 / CDC 1076 / CIP 108378 / DSM 44985 / J
228 -----TDQ-gAAARLLEAVAADPVVFVRSFHVELFRCAPELAERFPS--------GLGGHHAAFVTMTKHILQGFAdgsDPPALIDLLGQLgrdHR-KYQLGEEHYRAAKTALAKALADAARSTRDNE---FCAQAAALVCAVMEQE--
229 >tr|A0A1Q9NIM3|A0A1Q9NIM3_9ARCH Bacterial hemoglobin OS=Candidatus Heimdallarchaeota archaeon LC_2 OX=1841597 GN=vhb_2 PE=4 SV=1
230 -SLNTKDIQLIKNSWEKLTENKKEVRNTFYTGMFEDDPKLKSLFRE--------SFLSWD-NLPDSFEFMFKHLENLEGEILEMKRLglkHK-TFSVKPKHFPIGRKSLVKTIKQYMGDKYTEELGAAWTKLFDYMSHYMILG--
231 >ERR1711911_167941
232 TGLTVRQKRIIAKNWDLVRPNLKEAGAFQVVRDGAT--ERVGRQP-QaAGPrrq---HHVQHDDAGRL-----------AQRCGVSGAAPGHH-RPqspssALETAPFSGEPQFILRR---------------------------------
233 >ERR1719223_727152
234 --PSSAQVDAVTASWDKVAALgAETVGVLLFKRIFEIAPALESELSEKPTaiIIGDLTLAREMT----EEEKETIDLEEKEEPeeveekeEPEEVDEqetTE-GRIISTESF-------------------------------------------
235 >ERR1711871_830988
236 --------FFFFFFW---------RPPFFFFFLLLRVSSFLPLFVASLPPperlfKVGSPLVAYGATVVRALNVAIGLLTDLPTLVPVLKTAlpsL--FPGAQKEHYGIVGQAALNSLAIALGRYWKEPVKNAWLKIWNTVVAVVFS---
237 >ERR1712232_1508017
238 -PLDGRDIALVQTTLGMVAKLGlNTVGKVIFLKVLKLNPNAAQLFTWGKMDaalmwKDGSPAVAHSIKVVQTTATAIGLLTDLDTLVPILQTLgvqHNGspmlpdaygGKGVIPKELDVFAGAVLEALAVALGANFTEPVKNAWIKVYTTADGVMKA---
239 >SRR5882757_3847967
240 -----------------------TSI--------------WPIIIN--------TaVGirnipQDYRNVARVLRLnqFEF-FTKimVPAAAPYIFTGl-------------RIGIGLSWLAI--------------VAA--------------
241 >ERR1700737_3002051
242 -----------------------RDF--------------HHLDLA--------DhHQ---------HRVagTQW-AN--GSMSNAVWTGv-------------RLKDVLDRAGV--------------KSGAI------------
243 >SRR3954451_23003713
244 -----------------------LKS------------TTGEVFLE--------G--klv-DE-------PGpdRAI-VFQn-HSLLPWLTVYg-------------NVAIATDKVFGGSGARSKSKAERHDWVMHNLELVQM---A--
245 >SRR5206468_1650083
246 -----------------------TNA------------TMGCVLLE--------N--rev-NS-------PGaaRRR-QGVc-ERQDPQRAQRmgdAqpqpradgacqgqA-PG-GDFRRYEAARRHCPRAGHATKSAAARRAVRRAGRADPRAPAGL------
247 >SRR5258705_633045
248 -----------------------TSE------------DAGPVALG--------N--qev-KQ-------PRtqPPV-VFLd-PALPPRPPALd-------------HWLLRAARDAGGP------QPQ--------------------
249 >SRR5690606_21133184
250 -----------------------INP------------LHGAVRLN--------D--aap-RV-------GDpeVGY-LLAr-DALLPWRTALr-------------NVTLPLEV---RGI----ERREREQSARKVLRDVGL---E--
251 >SRR5688500_4892119
252 -----------------------QEP------------SEGEVQTF--------G--sra-QC-------PNphTVT-VQQa-YTCFPWLTALg-------------NVEFGLRV--QGK------RDNAREVATEYLHKVGL---G--
253 >SRR5699024_2544359
254 -----------------------LSPSSGKIIVAFSSPTSGKIMMD--------V--ndwtSYKDSEMTALRLkeIGF-IFQe-SHLLPYLKIRe-------------QLEFVGREAGMDK-------KHARKRAKEILDLFGL---D--
255 >SRR3954447_21976298
256 -----------------------RAA------------TGGVVRWS--------V--dplvAAG-----GRARhpLSM-VFQk-DTVLPWRTVAq-------------NVGLFYALN---RD----RRAGAEGVVDDLIRLAGL---E--
257 >ERR1719419_74415
258 -PFTPEQRTLINETWGNISTKEtgsmGMLAKQVYERLFRSAPGIKRLFKDSD-------MLAISRAFGGMLGVLVSAVNQPLQFQHIVKGLgvrHQ-VYGVKPDHFRIMYTSLVRTFAQILGDKFTSEHKKAWSCLYNWVIDAMQRSMR
259 >sp|Q8T7J9|GLB_YOLEI Globin OS=Yoldia eightsii PE=1 SV=1
260 MSFSAAQVDTVRSNWCSMTADIDAAGYRIFELLFQRNPDYQSKFKAFkGLAvsalKGNPNAEKHIRIVLGGLGRILGALNTPE-LDVIYKemaSNHK-PRGVMKQQFKDMGQAIVTALSEIQSKSGGSFDRATWEALFESVANGIGQYQ-
261 >sp|P0C227|GLB_NERAL Globin OS=Nerita albicilla PE=1 SV=1
262 KSLSADQKAAIKSSWAAFAADITGNGSNVLVQFFKDYPGDQSYFKKFdGKKpdelKGDAQLATHASQVFGSLNNMIDSMDDPDKMVGLLCknaSDHI-PRGVRQQQYKELFSTLMNYMQSLPGANVAGDTKAAWDKALNAMANIIDAEQ-
263 >ERR1719238_612722
264 ------------------------------------------LDGE-------TKPKEDQ-----NLSNPWAATAVTAILIPNLRDLglrHC-RYGCRLEDYELGGKAFMMTIEHFMGDAVTPEVRAAWLWVYGVVQSVMVSM--
265 >tr|A0A0P6RCU1|A0A0P6RCU1_9RHOB Flavohemoprotein OS=Phaeobacter sp. 11ANDIMAR09 OX=1225647 GN=AN476_12305 PE=3 SV=1
266 ---ASTCKALVLRSFESERMDLEAFIPLFYSNFFEAYPEARAIFPT--------DTERLEAKLLASLTHIAEALESSERLdgiLSELGQKHR-RMQISDSHFDGFIQSFIRSLATTLGPEWSDQSDEAWSQFLRYVAKRMSFLE-
267 >tr|B3SDK5|B3SDK5_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_62364 PE=3 SV=1
268 SYLNYQERQAIIDSWNAISTEKQKYGTILFLKLFELEPRVKSLFTIFDFNeplediIQSPHFRSHAMRFMQSLETGVLMGFD-kescDFLFKSLGSRHH-FYDLKSEFLDVIPECILHTIKKGCGNNWSNETADAWKIATKVLCELFREG--
269 >SRR6266700_8223772
270 -FFLPFKE-LTEQHFSILGlRKARRAGLVLAQELFEHAPHVGARHSN--------AFGGRHPNAILAVEPFLRRAKNRDQP------DSG-AWSATSFHFGWNGGFX------------------------------------
271 >ERR1044072_5206314
272 --MAPPQIAVARSTGPKVSPMQQRLAQVFYERLFELDPTTRAFFGGVD-------LRHHGLKLTETLSAGIEVLGRDGPAPRGS--------GSGMAALRDGGGCVVHGAGVLPGPRVHDRSPGGLVGGVLG----------
273 >SRR6516162_1975606
274 -TGVSEQHLLDLGGVDILA----ATDDHVFDPA--GDLQIsavvqdAQVAGT--------YPAVRVDGFGGAFGHVEVAEHGLVAAcADlpg-LAGRHG-LSGDRI----------------------ANGHLDL-----------------
275 >SRR5947209_12860360
276 --------------------LFSRQPRSAGQRLFTRFPQTRTLFAATDM-------LEQRKKLQQSLALIVEHMQHPEVLGDMLKGWtrgTS-PMVFDHSIIP-----------------WSEQ---------------------
277 >tr|V3ZYY7|V3ZYY7_LOTGI Uncharacterized protein OS=Lottia gigantea GN=LOTGIDRAFT_167450 PE=3 SV=1
278 ---------------------------------------------MDDNqesLKENYRFRCHVGLFCETIRIAVEEMREIEEVLLFLKDLgrkHR-MYGATPTYIKTAGEGIVYAIDRKLGNEFTRSMKTSWKKFFTILQDSILEG--
279 >SRR5438045_5489985
280 -------LITRPTSYYLLSlhdaLPISLLADVFYSKLFVKNTGLRKMFP-A-------DLQLQRQKLMNMLHFIISNLDQPELFnkeIEGLGLRQD-RKSTRLNSSHLGISYAVFCLKK------------------------------
281 >tr|H2ZPV1|H2ZPV1_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1
282 MHFTDEELDLIRTSWGQVMKlGTKEVGIQIFTRLLNDAPKLRSHFYSIdiaDDEelslevmREKKKVVSHATRIAVAISKFVDFLDKPEELDSlltKLGESHA-RLQVDPGSFEYVAPVILAVIGGHLNLPSNSSTLQAWVKAYGVMRNGIVA---
283 >SRR3954451_6295623
284 -------------XMSTLIKGSPHFSspysptgetDQVPEHLFRLDPSLRALFTRTD-------FVRQRRMLLNMIGVTVRGLDRLDGVVPTLRDLgrrHV-GYGVRPEHLSLSR------LNHWLPrGQADPEVMQGTADfhh--------------
285 >tr|V9ZVV7|V9ZVV7_AERHY Globin OS=Aeromonas hydrophila 4AK4 GN=AH4AK4_1427 PE=3 SV=1
286 --MTSEQIELVQRAWGKVTALNNTYVQEVYAELFRLSPELINLFPDPAG--------MPVAKVSDTLNTVITSLEQLDAlsfIIRDLGRRHQ-KFKVQSHQFDLLKQALTLVLARRLGEHFTPALSDAWSQMYDEIAALMLEG--
287 >SRR5580704_19412242
288 ----PDIAAFVRFASRFASES-SH-------SQMTIHATIVSQQRQ--------QIEMRTGFX-------------------------------------------------------------------------------
289 >ERR1700732_4531564
290 ----ASPNGRRNSARASmlISSQPIRRSPRFSATTW-----------------------WHRPRC-SCSLWVRSEVNRMEELgggLCALGERHV-DYGVKRADYNKLASVLIQTLKEFLVDEFTVELQHAWGTVD------------
291 >SRR5258708_12476517
292 --------VLWEWLVDVGGARWRWFGGRLLEIFLETSPELRSLFHK--------DIAQETGMLEWMLGSLVKGLNRLLEIeggLRALGRRHR-DYKIDQADHEKVLRALLLTLAEFVGDDFTPQVSRAWKTVYGKIPDTMTDR--
293 >SRR6266699_2567678
294 -AItkrrfqAAQAVVQIDDSFnPPDWYpDEHPPMPEIVARFFELAPDAQGLFRG--------DMERQYLKLMNMIAAIVGTLDKREMFksiIGRSGRQHA-QFGAKPLHFAAFGDALIWGLEQQFGAAFTPEMKEAWIKLYDDVQREMMC---
295 >SRR5690349_3556304
296 -YLTGQQVLLLKKSFRQMN--PAQIAAQFYGTLFQQHPEVKSMFPA--------DTVELGSKLMSVFELVVFSFDEKEHgrfglqdvLikpLRALGRKHD-DKGVKPEYYEIANSLLLKIMKE--SEYFTTEMYQSWQLALEHLTYAMQDK--
297 >tr|A0A0S4IWR4|A0A0S4IWR4_BODSA Globin domain-containing protein, putative (Fragment) OS=Bodo saltans GN=BSAL_72665 PE=3 SV=1
298 -LVTVSSNELVQTSWSWVAHDMVGLGDMFYDQLFMIDSEIEHTlfAGT--------DMKRQAVRVMEMIDAAVQGLNTPETIAEVMFTSglrHA-AYGVQRDHYTVVGKALIAALKAFLARRFTPEVAQAWSVFYNGVQRRMLEG--
299 >SRR4051794_5741567
300 -SMRPEQMQLDGLTLADATTDRLARGRDFYRRLSVPAPYLRGRCDG--------DVDAESAKLKETRTLALRMLGNMRFMVATLDAMakrDV-ARGLSEQHCRAIAQSLIWALERRLGAGFSRQVCTAWTEFLAVVMTCLHG---
301 >SRR5436853_3450426
302 --------VLLKDSFNLVRSEEHTSELQSLRHLVCRLLLEKKKKnkTTTV-----NYIE---KEKLGKLEA-SCPVEQTI----GIGDKQR-DYQ--QMHHPERTEAQ-----KX-----------------------------
303 >tr|C7FFW0|C7FFW0_BRASE Extracellular tetra-domain globin (Fragment) OS=Branchipolynoe seepensis OX=326992 PE=3 SV=1
304 --VSDAQKAAIKASWAGAD--LQAAGTGFYVHLAAEAPAVYANFNLGADPH-GAKSQEQGLRVMKFVNQCVNSIDNMAIVQAKIDALahrHM-SYNVKKSDFVPAKPCFLGALADALgG-KFNADARAAWAGFYDIIAAGLST---
305 >ERR1719506_1011120
306 -PITAREGQIVQDSWKAVKKVGGESGHavikdIFYQ-HLLKDPNVKQLFRNS-------DMKLQATKLWQTLHVAVDGLSTSGPWFLCCRIWarlTS-STGSKRS------TSMPWVRRsSTrspraWGPRsrrssrWRGRKCTAWLLRRX-----------
307 >SRR5579862_1310240
308 -LMDPLRIRMVQDSLVKLTPREGSIVDLFAAELSGSPHDESETGGD--------NIAYQrERSVLGIMAAAAPFLHAPECILDEVVAEIG-AGRIHPADYDHAANAFLRALKKNLGAEFTADLWEAWLEALWTLCNLLSRT--
309 >tr|A0A1E3GPU1|A0A1E3GPU1_9GAMM Bacterial hemoglobin OS=Methylophaga muralis GN=vhb PE=3 SV=1
310 -KLQEQDIALVEQNFAVLMEFSDALAERFYQRLFTEYPEIMPLFKSV-------TIEGQHKKLLASMVLLIQHLRDTEMIEDYLqglGARHQ-QYGVETSHFEMFIENWLSVVAEFADQKWDSKLQQAWRNVLEYVAELMQSPT-
311 >SRR5438034_562795
312 ------AVETLRNSFERVIERSPNLTRRFYEILFEKYPQTRRMFGL-Q------SGKGKGNGKGAGARQRLRRChcrlhfgkekaTVVPFPlpvPVPLPAFRD-SYX-------------------------------------------------
313 >SRR5688572_434377
314 -PMDKERAHLVRDTWMVLTPRADEIAAAFYAHLFSLDPDAREMFAHVE-------MTAQGRKFLGMIGTLIRLLDDPADIVIetiPAARRHA-TYGVTGDHLDTGREALMRALERHVARRLHTCRSAGVGRAVRP----------
315 >SRR5205085_1772709
316 -LMENRQAHRTSDRLQIELAAAQARIGLLYFAQHDRAPAARAMFST--------DIGVQSRKFSDMLEVLVEGLDDFDQKRPALRAMglrHV-AYGVVPAHYDTLATAFLWALGHMLYPEFSPEVKGAX----------------
317 >tr|A0A0N9QWL5|A0A0N9QWL5_9ANNE Intracellular single-domain globin (Fragment) OS=Eulagiscinae sp. JPG-2015 PE=2 SV=1
318 --VSDAQKALIKSSWAGVD--LNAAGVAFLNQMEQKAHDVYAVFKVGGGATSNPKAAALGLKVMTFVDEAVKGIDDMGAVGGKLDelaQRHT-KYGAKKAHFPVAGPCFLDALAEVCGGRFSADARAAWSDFYDVIAQHLSA---
319 >tr|A0A0P6AJ75|A0A0P6AJ75_9CRUS Globin OS=Daphnia magna PE=3 SV=1
320 --LKTVNVSAVQNTWAIVNKDLNTHAPHFYVALLTAHPEYQPMFPTIANVpagalLNNAALKTLSVNVLTKLSELIGCMGNPDALNAQLVDLanqHK-GRGTTRAHFDNLSKVLIDFLAAKLGGEFTPEARQAWTATMQGINTVVEA---
321 >ERR1719347_1330150
322 FCLSESNIKALKSCHPHLKDRKEEFGHLFYSNLFSNHPDLKSLFDQTEEG-----RQLQAQRLADTVVAFLEKCDDLPSLLPTFKKIgkrHT-TKGVKPEMYQIIIDNLVDTLEEMLGKeVFSAEVKQEVLESISFLSNAFIK---
323 >ERR1035437_6084348
324 -SLDQEMIAIVQVSWENVTPDSRLAASMLAMNLCADDRNIASLFEE--------DRIKMSRDVMQAVSCIVADLDQPETLVPYFGSLgqlLR-RHGLHESGQQTFATALFLTLGQLLGPRYGPVEHNAWAIAYSFVVRIMIAE--
325 >SRR6185369_9977853
326 ------CGVPDPDHV--------RGGG-------TAQERSRRAFLPTA-------VRDRSR-----VPRAVQGHRHAGagRDADDHADLgrrHI-GYGVQLHHYDAVEQALLEMIRRMIGDAFTLDVRLAWSHIYNELVRIMLAG--
327 >SRR5215471_14715706
328 ------VPAGGPALARLLRR--------HLRRV--VSSRLAPLFLRLA-------FNDAISYDPATGSGGANGSIRLPEELARKEVAglaRA-V------------------------ERLRPVKE-------------------
329 >SRR5215813_3453690
330 -------------------------------------------------------------------IASDSEIQVSPWtrt--GTLaisARRCS-SSRISSGigsdtTFSLYGNCV------------SSSATIAWNTHGD----IQLDS--
331 >SRR5579859_1863727
332 ------NISSLQLTILNLLTVEDEFVPRFYNNLFNMYPLARSLFVHTe--------ISLQYNKLRLMLMMIIRTIHDADGLKIQLqqlGQRHK-YYRVEPEHFAILYIVFVQTVVEYLGPKWTAELEAAWAEAYGTIVRMMDME--
333 >Dee2metaT_7_FD_contig_123_47857_length_200_multi_10_in_2_out_1_1 # 3 # 200 # -1 # ID=100007_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.434
334 ---------VLRDREG---LGDPELVVLQRRHLAEHGAILQPLalLARQr--------HREDLELVRELLLLECDHRVEHPRahpaGVGVEgelGVGHH-TERIKRSlspsalLGRWIDLVVVGAVRR---------------HHQGGVVDLRLVE--
335 >SRR5262245_19300173
336 -LLTPAQKRLIRESFVTLEPAIDLVGQLFFLKLYRLDPSFRARFGG--------NPETQGRKFMAAVKLAIIALKHDDCLAPMLKLLgvrQR-ILGMKVRDYRMIGKAWTWTLERSLEKRFTRPIKDDWTALLALATRVLSG---
337 >tr|A0A1S3M8L1|A0A1S3M8L1_SALSA cytoglobin-2-like isoform X1 OS=Salmo salar GN=LOC106571144 PE=3 SV=1
338 -HLTDEHREIIKETWKVIQENIAKVGIIMFVGLFETHPECKDVFFLFrDVedlerLWNNKELQTHGLRIMHFIEKSVARLNQMErldQLILDLGKSHY-RYNSPPKYYMYVGAEFIRAVQPILKDNWTPEVEEAWKTLFLYITSIMKQGYV
339 >SRR5258708_4037766
340 -------PGAVGPAPGLQPPRNRPGARRGQPALMQSPSAGGPPPGPHrpR-------RTHRTPPRRAALVLLRRSLRDLDEVVPGLRAMgarHV-RYGARPEHYPVVGAVLIDSMAEVAWDAWRPAYGRAWAAAFDVVSGAMLAG--
341 >OM-RGC.v1.004444255 TARA_034_DCM_0.22-1.6_scaffold509117_1_gene597562 NOG05352 ""
342 -PfLQPTKFELVVNLKTA------------------------KALGL--------EVPPTLLARADEVAGVGGSAKRISHWppr------------------------------------------QSRWAGLPRRPERH------
343 >SRR5262245_16285966
344 --------XMVEGTLDAV--SLPALSADFYRRAFDTDPELARMFTA-D-------RRVQEARFATELAAIVRSIRCHDEFVPagrALGPVPR-L-RRDGRPLPRDGRRPAGIagrcprsdvearGGRGMAPRLQPDRRDDAERRPRAGQLGVTSG--
345 >SRR6266568_4225566
346 ----------------------------------------------------------------------------------FFFFQaedGI-RDG-TVTGVQTCALPIFDTVRHFGAGTWTADMQAAWETAVASIGSIMRA---
347 >SRR5260370_35001365
348 -----------------------------------------PTFPP--------AVGAGRKGVSRAVPGAVWSSDQPERLARGVGELardPG-KFGVPEQPYRLFCDALLATVQAFCAGSWSDQVQAAWERALAAITAAMMaggsgapgE---
349 >SRR5215475_1743066
350 ----ISYWPLVKQSFARATSDGVAAAEHFYARLFAVNPGIRALFPT--------SMTVQRERMFADLSRVIWSLDTEPECTALLRQIgreHR-RYGVLAKHCEAFLAARGRLLCrHDAGRLIRCERRARVVDCLDRQSRTAVagggl----
351 >APLak6261665767_1056052.scaffolds.fasta_scaffold282062_1 # 1 # 210 # 1 # ID=282062_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.505
352 --------------------GRSNLSLVFMKICLKLIPKLNVYLVKLI-------WRSRVKKLLNSLILLVEGLRTPEALIPVLKDLgarHK-GYGIVTEYYPLVGEILLNTFADYLQEDWTPEVAQAWLEIYTTTSNLMLEGAG
353 >SRR4029079_9820506
354 -RVDGILVEGLQASLATMQPAAAQIAHGFYTLLFARRPDFRAMFPE--------DMAAQERKLIATLAFVCEHWRKPAAVSvrlADLGALHQ-GLHVKPEHYPIVCDALVTAVMKHRHEALGPHRAR------------------
355 >tr|A7RHV8|A7RHV8_NEMVE Predicted protein OS=Nematostella vectensis GN=v1g197347 PE=3 SV=1
356 IPLSVAQKYLVRETWETIEQHSKAVGKKTFLRmfymssidfiysvvmeskgskdirvlglelafddvknsyrtwrFFEMNPDYQKLFPEFaTLDqvelEQANALHGHAKRVMKAVENAVSAMDDAESFAAyleNLGARHK-ARALKPAYLDAMQVAYTDTIQDLLKTQWTDGTAEAWNKLFRFIADTMKHGL-
357 >SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold1207366_1 # 2 # 214 # -1 # ID=1207366_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.286
358 ----YASHQSQAASLAKAAPRPRVAVLGLrlpsgeSPQLARLGRAFAELLGA--------ELAAGERLLVLPAeRVehMKLELGLDEAEAYPLPTLgriHR-NLGPDLVVVGTlapqeprgtlsvtveVKDCLTGAVTATAKVTGPAAELFTLASQvggelrrrlgssalsgneraelraqrpaSPEVAQLYADG--
359 >ERR1712062_404977
360 -ILTNQEISVLKSSWELIAKKIEIAGAHTFLPTFDRDPKCPDN------------IERHCQRVMSVVGGSIELINDYKSLWKhliSLGREHF-GKIREWIFASIAGGSTersgcspssINFLSSKINGNITSKK--CFLQ-YKIVIITQX----
361 >SRR5271166_154013
362 -VMTRLEIALVHEGFHRMESRLESICMAFCRTLFGLDLSLRPLFPN--------DLQPLAAHLAAGLETAVRSLDDLQPVlvcAPALGLRLA-SHGVVPDDLHTVCAALLATLQSELGDAFTEGVRAAWRRLFWIVAAATIGA--
363 >ERR1719261_40108
364 ------TIAVVQGTWQEIKDalgdgVAETAGVILFKHIFRIAPQALALFSFKDCAggnvcdelFENKTLRKHAAKVVGTVDTAVGMLKKTRQADSRPGQSgqeAR-GLwggagalrcgrgGVVGDAVGRVGRRVYDRGPRGLGGGLRHHQNHN-----DRQELRLHGR--
365 >ERR1719238_2294225
366 ------------------------------LKVA----SALREFNTLRAEgivseqefLEM------KAKLLAVGKDELG-RSPSGDTLETLVEAthemdssrrRT-RWtrrarraSRSPTTVGVISCQIK--------KSSTRRTTRRW----------------
367 >SRR5690349_20281755
368 -IMRPEQAALIRTTWAQVTPLGIAAAALFYERLFALDPELAAKFAHTDM-------ERQGKKLLQALTVVVATADRLHTLGPSLEELglgHL-RYGVMDRHYDTVGVRYWPPSKPPLAQRSRhrsrrhgPWPTPAWPPMC-GPARGGR----
369 >tr|E9IBK1|E9IBK1_SOLIN Uncharacterized protein (Fragment) OS=Solenopsis invicta OX=13686 GN=SINV_03861 PE=3 SV=1
370 -GLTEKQKRLVQNTWAIVRKDEVSIGVALVLaiarfvyecntksffySYFKQYPEAQKEFKAFkDVPidelSKNKRFQAHCANIVATIGKVIEQMHDPElmeASVINFTEKHK-NRGQTQKQFENLKQMMLDVFPSVFGKQYTPEVQEAWKKMLDLIYSKIYQTL-
371 >tr|A0A0L7R0Z8|A0A0L7R0Z8_9HYME Globin OS=Habropoda laboriosa OX=597456 GN=WH47_01055 PE=3 SV=1
372 -GLTGREKRLVRESWSVLRVQSVNTGVAIMTSYFQQYPQYQKVFPAFkDVPldelAASKKFQAHCQNIVSTLSNAIDALNDVDlmeAILHTAGERHG-RRGQGRQEFIDLKGVIIEVMKGALKSRFSTEVEAAWNKTIDVLYLKIFEGI-
373 >tr|W6FSH9|W6FSH9_9ECHI Hemoglobin OS=Ophiactis simplex GN=Hb_a PE=2 SV=1
374 LDFSDDQKADIKSTWETLYSgNKFQLGVELMANLFKAHPDYQDLFPSLkGIPdvAGSNELRGHAIRVITGINNFVDALDEEeevmREMLHNMARSHK-PRKLTKTHFNEFAPILLETFEKKVD--MSSKARDAWIALYYSIVDNLFAE--
375 >tr|N1VSG6|N1VSG6_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 GN=LEP1
376 ----PDPILEIQKSFDHVLEYNPHWIDSYIDKLKNFSMenvTENQREGDNES-------PISSEEFLNSIESIIEKLGNPISVKKEVSKLaniYE-SLGITKKEFPKLLPILLSSLRENLPSEWNPSLESIWTQAITDLTIETIES--
377 >UPI00001F6528 status=active
378 ----IDGLRDLSESFDTLaadeaatAPAATELKaavegqfsgvfGAEYAKQTGKQPDTASYTLE---------------HSAAALAQYHYIVRNPHPLGQknKLDKVagEA-RYHALHARYHTMLNAYLERFGyydvflidldgdvvysvfkemdyatNLKTGPWRDSgLGRVFRSALESNDtkSTFFDDFA
379 >SRR3569832_1336210
380 ---PALVRAAPDSAAALRRCRCGGTAEKIAERARADD----------------------------------PESENSRGAGAemkGLGARHK-QYGVQPEDYPAMRAALLEVMAALAGKAWTPAVAMAWEDALYILTDVMQKAYR
381 >SRR3569832_1187104
382 ---PALVRAAPDSAAALRRCRCGGTAEKIAERARADD----------------------------------PESEKSRGAGAddeRIGRTAQ-AIRCSAGRLSSDACCAVGEQNGNGGX--------------------------
383 >ERR1719259_112507
384 -GVTGRQRVAVQASWRLVAPDAKRHGVAIFIRLFKKHPETQLVFKSFkGQQpeslADNKRLAAHATTVMASVATLVDNLDDIDTLLELLHKVaenHK-RRGLPIQYFEMVSNTIFDYLVETLGAALDRSGVEGWSNVFRAINSVIAAEYK
385 >ERR1712107_384356
386 ------------------------------NRIFTEQPNVQQKYFSHmD--iNELGTLGKHGVGFMKKIDLMVTyvKADEDDNLVALIHEItvsHS-KKGIRNAwEFEIVCEILISYFKEAMESEFTSDAEDAWkkffef------LV--------
387 >tr|Q53I62|Q53I62_9ANNE Intracellular haemoglobin (Fragment) OS=Alvinella pompejana GN=hb-i PE=2 SV=1
388 -----------ADNIAAVRGDVSTHAMNIFVEYFKKFPQHQNAFADYkGKDpeslKSLPKFKTHTTKVVSKLLDIVEKASDSGALQSNCTTLakmPQ-HKGLNQQQFADLGAVLVPYLQKALGGACDSA---AWeqayn----------------
389 >tr|A0A132BSZ5|A0A132BSZ5_9RHOB Flavohemoprotein OS=Rhodobacteraceae bacterium O3.65 GN=hmp_2 PE=4 SV=1
390 -VLHQIDARLVEGSFGTVFARKAELTDVFYKHLFEEMPAARDMFTH-DF-------SRQKEMFARVLATGVRSHRGDATLAPLIENLllqHR-HLGLTSEHMYMAQRALLMAFRVVLTGHLTAAELSAWNAALRRLCQSMAAGL-
391 >tr|F7RKN3|F7RKN3_9GAMM Globin OS=Shewanella sp. HN-41 GN=SOHN41_01091 PE=3 SV=1
392 MGLTEIEKEAITSSFSLINHQEQHFATIFYDCLFDMAPLIKPMFKR--------DRKLIEEHFYMIFCAAVDNIHHLDTirtILLELGARHR-NYGVKVLHFPIVKSALILAIQHELKGQSNASIENAWSHYYDVLAAIILEG--
393 >SRR5579875_3194573
394 --------------------------------------------------------SRCCSRATPSYGRCSRSRCrgpgrrsATGSPSSSATCRrpgAR-RSCSRRWPGITAGSASvtgtTGRSSRRSGPAWTAELDAAWLAATDWFVSVLAAA--
395 >tr|A0A0L8P0I1|A0A0L8P0I1_KITAU Flavohemoprotein OS=Kitasatospora aureofaciens GN=ADK78_37645 PE=4 SV=1
396 ----AADQRVITEYLELVTPFGE-LITHLYETMFRRWPYLRSLFPE--------SMEFQRAHLARAFWYLIENLHRPDDIAEVFGRLgrdHR-KLGVRPVHFQAFEAALCEALRRTAGPRWADAVEQAWVRMLRFAVAAMVSG--
397 >tr|A0A0G4II14|A0A0G4II14_PLABS Uncharacterized protein OS=Plasmodiophora brassicae OX=37360 GN=PBRA_003666 PE=3 SV=1
398 -NLTEERIDIVRKTWLTLKSGqgkgerdrlgsnpsvqdaMDLLAVMFFEILFKNAPEVEALFQC--------DLVMQGRRLTTALNNLVDLLGKdaaaISEILTRLAEVHH-PHGIQPEHYDPFGQALLAMVKAGLAEDFTSDVCEAWEHLYSTICSFMIP---
399 >SRR5262245_46558688
400 -EMNRIQVNRLRSSFKWFRPCGPAMIAMVFRSLGDRHPGVRALFPE--------DTSTLNKRLFETLRQVVKALARFHSLEERLMELgarAA-RAGANPAHYRIVRDELLATMAALAREDWSEELARDWTLMLDAVSGAMLRGA-
401 >SRR4051794_9566520
402 --------------KALVEDVAERghrrPMEVFYGARsdhdlydidtmlrmAQSHPWLS-VRPV--------VATGpaggPMNSLSGQLPDAVRQYGPWREYDAYLSGPpgmIR--NGVD----ALVGVGV---PSDRIRHDSVEELVAAGDX--------------
403 >SRR5258708_3005780
404 -EPTPTDITIVSDSLAPLTkEQVDNVLAAFYHQLFTRQPSLRQLFKSFRsgDQPDQQAMKLQRNKLAEIIALGLKLWEKPHQLIPALEKLgrqHH-QYGVRDEYYEDVWIALSEVLSEAFGLDRWEDICESWQRFIFLCARHMLNG--
405 >ERR1719198_2284224
406 ---------------------------------SDMPSDALDWFTNP-TPe---KRGTPDGGKVVSADVVAVAGQM-----------------------------RELISLPEADVAQGLSQLDP---Q-----DLMVLQ---
407 >ERR1719223_1791071
408 ---------------------------------------------------------ANSKAT-D-DEAS-KS-D-----------------------------ATKVAVPAGVAAPEPKEEE----P-----VAVMEP---
409 >SRR6266542_3322184
410 -VMTPEQIEAVEATTAVLAPALDDLAADVYARLDRLAPETAELFTG--------GPAAEVRGRARDDRARHPAPRRLpGACl------------PARPPARALRGQA------GALRARRC-----------------------
411 >tr|A0A194VHM2|A0A194VHM2_9PEZI Flavohemoprotein OS=Valsa mali var. pyri GN=VP1G_10414 PE=3 SV=1
412 MALTHHEAQLVKSTIPFLKEHGESISDTVYRTLIEKHPELNNTLNLIHL-----KDGRLARALTVVILRFASSINHISELIPKLERIcnkHC-SLGIQPEHYEILGGLIIETFDDAMGPLMTPEMKAAWTKAYRILSNMMIG---
413 >tr|G9MK89|G9MK89_HYPVG Uncharacterized protein OS=Hypocrea virens (strain Gv29-8 / FGSC 10586) GN=TRIVIDRAFT_143449 PE=4 SV=1
414 -------------------------------------------------------MNPPEKVDIRSTDGASVIYRDVISLNSPQEEIrvlHL-ESG---SGSSLLKCTLHRvSLQSVQAPSYE-ALSYTWGNEndrraVVV-NGYLVD---
415 >ERR1022692_2453048
416 -------XMSLPASFTSICNgiLGREE--------NSGCPAAKGQFLP--------DRDAWrRssaLLLFGPLHQASRSTGYVSHLHegaArppgrRispDRRPgrqAG-RSGRLRAGPRAGPPQVRGHRRALRRGRRQPAGDTGAFRGRHLDARVMIEA--
417 >SRR6266581_3027569
418 ------DTHRLKDSFAKIAMHGDEVPLFFYSDLFIKHPEVRELFPT--------SMKAQRDHLIVALGQIISQVDRVDELSAFLRGLgrdHR-KFGAVAENYEYVRDSLLETIAHFSGAGWTSRLDSQWRSSRRPGRGCGA----
419 >tr|A0A1D1W7H5|A0A1D1W7H5_RAMVA Uncharacterized protein OS=Ramazzottius varieornatus GN=RvY_17919 PE=3 SV=1
420 -GLAVKERMLVQRTWKELMqLGRSNVGIELFHQYFTKYPQYVQHFKAFREvPseklKAHPRLKAHATTVVNAMDVIIDSLDDTETAVAVLDKTgrdHD-RRGLSTSAFADLQTTLMMLLGMFLKDSWTPAVEQAWDKALTVVMNTVM----
421 >ERR1719487_198517
422 -NLTNNDIDLVHTSWNMILNDtapeyvklkesgddkhancVAWFYTVFYHRLFDVHPACRHLFTR--------EMMTQGSFLVRMISLTLQEMHDMEnfrDMMRSLAEKHC-AYGVKGIEYGIAGDVLLYSLQTVLGSdVFTSAVHFAWRKVYSAMLNHITP---
423 >SRR5439155_13306073
424 -LLD-------GGTLRAVRMSGDTRSEPWLKDLWERGVAVGELRRHLLLPleTPPGLPVPRGRILCNCFDVAESEIDAFLA----------------------T-SNSIAELqarlkCGTNCGSCLPELRRKSLCDIG-----------
425 >JI10StandDraft_1071094.scaffolds.fasta_scaffold6072973_1 # 3 # 245 # -1 # ID=6072973_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.634
426 -LVESFGAEGSDKKVTGIRLVGETIASDWLKEVMTSGEFTADIRRWALAPlsAPPSGHAGRGKVVCS-----------------------------------------------------------------------------
427 >ERR1719326_289429
428 --------------------------------------------------------AGQRMNLTKFITTAFSLLGTLPDALEALSQLgmrHI-LYQTKDAYWPVVGANVIKTLKIILPAEDFDKEtEEEWATLYGIMQKTILDA--
429 >GraSoiStandDraft_1057264.scaffolds.fasta_scaffold343999_2 # 425 # 754 # -1 # ID=343999_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.636
430 --RRRMDAELLETSLALVDTPDDGLTKRFYALLFERYPAVRPVFPEEM----HRDIARQAKMLRSAIISVVDHLDDPVWLtetLGELGARHA-GWGVLAEMYDAVTECMVAAMAEIGGDDWTPYMTDAWTEALDAVSGLMLLGYP
431 >tr|A0A1I3HEN0|A0A1I3HEN0_9RHOB Nitric oxide dioxygenase OS=Jannaschia pohangensis GN=SAMN04488095_0565 PE=3 SV=1
432 --VTNTQARLLSRSLRRISENGAPLARSFYAELFSAHPEVRPMFHS-D-------LSTQYAKFEDMLVVLVADVLNPGVILRPLQDLakrHV-EYGVTREMYPIVGDIMMRTLRTLDAAPLTGDELEAWDVLLGRVNAFLMDE--
433 >SRR5215203_6923026
434 -PGDSGADRAGRAD---AERDQAGLRRGRG-RLLPPAVRRRPLRggavhhrAG-H-------PtgeADRGAGCGDALDQAPRRVPAPGRH-ArpaAPGLRG-----------------PPAALRHRAG---------------------------
435 >SRR5579864_8015183
436 ---KPDPIFLVHTSFVHLRPRMAEFVSNFFRRLLKDSPELAPIFEDAD-------SVRLKTMVAKIFGTTIAGPEQTDQVeadLAELSRRHK-SYGAIPDFLPLVGRAFIATIRESLPDDTTPQTIEAWELLYANTAALMSKGL-
437 >SRR5262249_54331370
438 -IRLRK-------EIDNEWLLIASgVLSVIFGLILVAQPGTGALA---------------LLYVIGIYAILYGILGPRPCcv---------N-RFGAQTALDRG-----------------TSTYRELWNIS----VARLIG---
439 >SRR6266536_6175029
440 -LMTPEQITLVQSSFERLGPQLPAMATRFYQELFTRDPALRPLFTT--------PLPQQEVRFAEALTEIVRAMPRLDELLThtrAPRRPArrlR-GTGCRLPDPRRRPprrargrpgRQVRRPHTRGMGPRLQPcrrdharrrsrgPAHQQLTTTAAPTASQADGG--
441 >ERR1700754_2066947
442 ------DPGdrQLARELLAGAAGGDDLDALvehDRGAVLEIAREAVPVaLAQAD-------RDdQLGHLGA-----------------DRlLRGPaerPL-GRGAPLQDVALVvhrddavergqqqRAVALAAGAELVGEIWERQERGSLtARRYGSNRSI------
443 >SRR5918995_1637126
444 ------DVQALEKSFDLVAPRGDDLMEVFYTRLFTAAPAVKPLFAATD-------RRRLKRPNQRSPSVsVSEKQWSMKCSQDQladgqstgasrpetprndcsdsppgelaaKRDQgakTL-SRGGCSGGAIMVPDCRTPTPRGRP----------------------------
445 >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3668839_2 # 105 # 377 # 1 # ID=3668839_2;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.656
446 -------SGPLAASLAIFEPRLEAVTARLVDVLAASSPHLLALFPPSSEP-------S-----AALLGRFLTRIVETESLGqPLGDGLgldAY-PIP-TRDQWEHLVESFIWSLSAVAGKAFSPPMARAWRATGERLFSTMFES--
447 >LULI01.1.fsa_nt_gb|LULI01000097.1|_29 # 27187 # 28320 # 1 # ID=97_29;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.310
448 ---------------DEIKGRH---HSMFVDEFERQQPQYKD---------------------------FWARL------NrGEYQAGeyrRY-GKG-GKEVWIQA----------------------------------------
449 >SRR6266851_2503075
450 ------------------------------------------------------XMRNGSASLPLwPARYGAWTTRRPSPNISAPSRSti----------ANSVCGRAITNWSARRCSPPSVSSAASGWEAAFNRIATIMIQA--
451 >SRR5215204_2071689
452 -VVMSNDYQLLKESLALIEPVYDKVTGYFYARLFVENPHLRLMFPL--------TMDLQRDRLFRALVHVVQAVDQPEQVVPMLQQLardHR-KFQVEPAHYDAVGRALIGAIRQYSYGEWSDEIEAAWWRTYSVAARTMIDA--
453 >SRR5262245_14739337
454 -PCARARLRPR-------RPAL------Y-AQALPPRRLVPRPVRE--------LAEAQSRKFMAGLKLGIIALNYEDGLTPVIRLVgvrNR-RAGIKVRHHRVMAKALLPTLEQSLETRFTRDTKHAWSSFLTQVTRILSG---
455 >ERR1719401_2136855
456 ----------------------rGCMGVTSAPQTLRQVRQCRRLHGGRLArhdrdwsaeegsdeedVWESPALRKLFGKFVNAVGCTVAGLHDMTEIGLP--RRgatKR-MYGSHqR----------------------------------------------
457 >ERR1700736_6084178
458 ---------------ARVA--------QALDRVRKAARQRKK------------------EQFTSLLRH-----LNVDTL--------------RTAHYALKRKAAA-----------------------------------
459 >tr|L8LYK6|L8LYK6_9CYAN Hemoglobin-like flavoprotein OS=Xenococcus sp. PCC 7305 GN=Xen7305DRAFT_00009490 PE=4 SV=1
460 ---MSLQIGLLEQSFNCIRPYGKLFVSSFHENLFQTNPEIKSLFMGVE-------SQIQKNRIWDTLVLIMENIRHPNLLnntLQGLGARLF-THGLLPKHYPLVKKAFLATFKQFLGNEWNSELEQAWKNAYTYFHDLMQEG--
461 >SRR5919106_2778213
462 -----------------------A-VDRFYAA-VLGDPELAGYFTdvdidrvkrhqvlllsdvlggpesydG--------PDLGQAHRGlgitdghyDKVVGYLVAVFTDLGADGDTIAAAaevL----ASVK---PQ----I---VEDQAGSRDSHEX--------------------
463 >SRR5690348_11784222
464 ------------------RaePGRAGgvprarga--RRLGEPGGgrarpSRRPLADR-AAD--------GPH-ARaPRQRARPAAGGRRHRLRADAGgargPGAAPGaaaHP-GLDVVP--------Vveqdgg--------PGadpcgpleegtlADVVTRY-GAWADRDVLVCGSPAMI--
465 >SRR5947209_9205436
466 -------VLSVLRSpssplF---PyttLFRSRltver--DSERDVLMvaggtGIATMRAL--LD--------DLA-QWgENPRVHLFYGGRTDDDLYALDd--LHQLdrkST-RLNSSHANISY---Avfclk-------------------------------------
467 >SRR5438270_814702
468 ------------------------------------------------------------------------XMTANAVVSPLPSQPprrQP-T----------T-----------GATAMVRLVRESWARI-------EARQ--
469 >SRR5919202_1970091
470 -------VQMVPGGqvsstmvrslkvgetV---RlgAPLGQaltlyag--ERHRDLIMvavgtGLAPLRAH--LE--------RIDqEwqSTgRAPRVRLFHGARLPWGLYENRl--LQNLagRP-WFTYTP--------Vvsddp----------typgrkgwvGDAAAVS-GPLHGLLALVCGSPEMV--
471 >SRR2546430_6350501
472 --GGRResRVRGGQGGWV----SRAIVAEPQRGDVGRSGPAMGRMKVD--------RG-AGRDVVMVAGGT------GLAPMRAIIDDL-A-QWGENPRvhlfyggrgrggPYH------PPSLVSTAAAqPGVPVVavagaeaglshkeagspagggvrHGALAGRG------------
473 >SRR6195952_1380156
474 --DVALAGEAVRAIWFRLADQEADVAHWFGALLFSLAPHLRAQFPA--------QADRAARRLLRASIAAMSAVDRPQEFPAAIGTLareTR-ALGLDASADEPVGVALVGAVREFAGELWAPGADAAWVLAYSLAAEPARR---
475 >ERR1700709_350262
476 ----------------------------------------GDLDAD--------AT-AERELLVVAGGRRGGVGpaprGepaGPSGAGGGRPPRparLA-AGVDVRRttvivgartaedLHT------LDRFAVIGEDaPWLAVVgacesdplelglapgpvvegitrAGPWLEHDVVVA--------
477 >tr|A0A098BFR8|A0A098BFR8_9NOCA Flavohemoprotein OS=Rhodococcus ruber GN=CS378_10080 PE=3 SV=1
478 --MEAFAVARVQLSFAsivATPGGAERFATAFYTALWSDTVGIRELFPA--------GMETMRQRFATAVGWAVNRLGDPDAVTAFLTQLgrdHR-KYGVRPEHFRSAGRALHTAVRECTPPiLWTDALDRTWARVIDLLVGTMAD---
479 >SRR3569833_3303276
480 ------------------------------------------------------------------------------PNNTNHDKH-T-HRKRNPPehqniggkrpedLYV------LDDLRRLTAVsKWLTVTgvteegaipggdrgtlahavaqRGVWEYYDILVS--------
481 >SRR5215208_6178010
482 --NGRGRPRPDTAIIRRGVAGQPTIRHLFYDRLFEHDPETRLLFRS--------DLDRQRLRLLTMITAMVGPASDDLS------ATNA-GhAGVPPWRWLSLA-----NARDVADP--------------------------
483 >tr|A0A0J9XAH5|A0A0J9XAH5_GEOCN Uncharacterized protein OS=Geotrichum candidum OX=1173061 GN=BN980_GECA07s01957g PE=3 SV=1
484 -SFSSWEIAEIRQSWASMRDDQLevsqekanvgtasaFFCQQFYENLLGEYPELSVLFPS---------IKSQASSMAGILALVISQLDNLPRVrevLISLGKRHSRIIGVEVTHYELVGNALLRTLSDRIQDEFTPELENAWIKFFTYITNLMLQ---
485 >ERR1044072_9602616
486 ------LEQSGYTVVGRAADARELmLKVRSYVPDVA--------VVD-V-------RMPP------DL--------TDDGLRAAAEI-rrsHptV-SVlVLSQHREPAYMLELVGDDASGVGYLL-KDRVRDVTQFVDAVQRVAAGG--
487 >SRR4051794_28399871
488 ------EHEAGTDLLELTD--------ALVRAGVPCADAAQEAVAG-V-------ELPHGAQLPAER--------LADRLERRRVD------lD------------------------------RLLRFGEDAG-HLVLGA--
489 >SRR4029453_17830486
490 ------DLQALETSFDLVASRGDVLMDVFYARLfaaapa------VKPLFAGTD-------PRRQKAMLLGALVRLRGSLRGPPAFVPPLPRPgagPggE-APlrrhrSPAPEGHAARGPraaAWLPARPAGVRSGaatPRGQARRLWRPAGALPGGRRgpdrLHG--
491 >SRR5688572_12388254
492 -SMNEEQIKLVETGFQSITGRGERFISRFYENFFAASPKAEKLFAQTEWP-------NQSRKMLLTIMMVVDNLRDAAHIKKMLHEAnlvHQ-KFTLQADDFDALTDAMLRTLREFLTDDWSKEAEDAWRAAFAKINAIMLEA--
493 >tr|A0A0N7Z8G1|A0A0N7Z8G1_9HEMI Putative hemoglobin-like flavoprotein (Fragment) OS=Rhodnius neglectus PE=2 SV=1
494 -GVSKEGIAAVRKTWEPVYKDKENSGVFLFQVLFELHPDFEKYFARFkSEGakslFDNPMFLFHVkHKVMDSLNEVIDNLENDERLLKILKSVasnHK-KRNIKKEEFVTLGKVVLETLRRALGTAMNPEVEDAWTKVIDCAMSAIG----
495 >SRR5712691_10715499
496 -ALTLEQFRLIQHSWQMVKDGQfnafkaqqliadplGFWGLQLYDTLFELNPALKPMFQNT-F--------TQSQMLTEMVGAALGLlpgiLDQAlgeektavlwylPEYKiviisITYANMSL-SQNIDR----------------------------------------------
497 >SRR4029450_4347554
498 ---------------------------------------------SG-V--------TGSSLPKTLVREgvQSLTtpchRKLPlgtektaidpqlLPILVDLAARHV-SYNVKAEHYGTVGLALVTTLERTRGSRVAAPTKAAWVELWSLICTVRIP---
499 >tr|R7TL54|R7TL54_CAPTE Uncharacterized protein (Fragment) OS=Capitella teleta GN=CAPTEDRAFT_144794 PE=3 SV=1
500 -KLSAEHKTTIRDTWPLISHSLQDNGIVVFEKIFEVSPSIRTVFAASfGFpaspipDayelSRASNLRDHVTRFMQAVGWSVQHMDDLDTV-ttvfVNLGKRHIHLKSLEPDFFRVFSGALMYVWRSTIGPDlFTAEVRGAWCKLFEFMLQHLAHGY-
501 >tr|A0A1B6EVA8|A0A1B6EVA8_9HEMI Uncharacterized protein (Fragment) OS=Cuerna arida OX=1464854 GN=g.22480 PE=3 SV=1
502 -VITERDKYLAREVWMQVETNYVLISKSLFTNWITEFPEHLNFFKGLlDSSyddfLTSPKFEQHMaNSVLPNVGIMISNLDRPTDFRRHILKLawiHI-RKniALKIDHFNILKGLILRTLKESLGRGIGRDHEVAMFKVITAGFNLFS----
503 >tr|N1VY19|N1VY19_9LEPT Adenylate/guanylate cyclase catalytic domain protein OS=Leptospira terpstrae serovar Hualin str. LT 11-33 = ATCC 700639 OX=1257
504 ----KDTILELQRSLELALHLNPNLARDFYVHFLETKPEFQKFFQNTD-------METQAKKLLAMFGRTIERFGNLNQIHNELknlGKMHE-EMGIKVTDLAEIAPSLLYALEKSLGERFQTEWKPIWEEALGSLVRLMS----
505 >UPI0007D2C88E status=active
506 -GLDHKQIEIICASWAEVKKFGtEAAGCLLFKKFFIVAPETFSMFDEFkDIPnwEDSTQFKHHCKIVMNIIGGAVGLLRDPESLDSTLEYLglkHE-GFAITQHHFDLMQVELINTFRDALGAKVTPDVERAWNIFYAYIVRIIVCG--
507 >SRR5437870_4959208
508 --MARVNPRSMAHA--------ATAIAAATTRASEFMPTPQFVRTP--------AMPTQRERLLGAIIALVTHFDRPENLLPALTAMgrrHE-TYGVSLGHYAAVGSALLATLRDFAGLAWSPAYEGAWARAYTFAAG-------
509 >SRR3954447_20457037
510 -------------------------------------------------------------HKVKVEDIIVRGGGNL---MVEL--MntdAA-GS-----PLDTPVRAVTDG------TESTAAAREPI--------RLNPG---
511 >SRR6266545_1588040
512 -------G----CDLEQAVDTCPA----------A---LVIGLRPA--------TMGTL---------CYMGGLASA-------AVCcwrHV-RVVTCSQFF-------------------------------TTASPQSRQ---
513 >SRR6059036_2276597
514 -ALFPGTSHWVV---AAGMARP-ESKDHPMLTVAQKTLVQ-----D-T-------FAIITPIADDAAALLYKKLFELDPSLERM----------------------------------------------------------
515 >SRR5581483_12392512
516 -PMTPEQIQLVRLTLAQATAGEPSIGRDFYRRLFVLAPDLRARFQG--------DVEAECPKLKDTLKLAFASLSDLPFLIATLEALARrgVARGLSDQHCRAISKSLLWAIEQRVGSAFTPQVCNAWIAFLAVVVSILR----
517 >SRR4051812_13904716
518 -GMSPEEVALLRHSLDEMRADGPQAAEAFYAELFRLDPSARELFHL--------PVEQQSVVFFHELDALLSAVSDLPAFverSRRLGRMHA-GRGVRPEHFEAAAAALDAMLLAVYADGASPELRRAWRHAYRMAAQLMQEA--
519 >tr|A0A0N0S3I7|A0A0N0S3I7_9BACI Uncharacterized protein OS=Lysinibacillus contaminans GN=AEA09_04415 PE=4 SV=1
520 -MLSLETINEIKKIASAISVNGEIIKKIFIEKLQKNVPELLHIFYQIL-QK----SGRSKISLIDAVYSAAMQIEHIDRFVPAVMQVahkHR-SLGIQPEHYPIVGQHLVDSIQEALGNQATEAGIAALQLAFNRIADVFIQV--
521 >ERR1719171_419597
522 MGLSAKTIEIVKATAPVMAEHGYAITSAMYGSMLTADPYIASLFNPSHQKVLPgDTHANQPRSLANAVYAYAANIDNLGALTSAVTRIaekHV-SLQIEASQYDVVGEHLMAAVKKVLGDAATEDVCAAWTEAYGFLASLFIST--
523 >SRR6187431_1436969
524 ---------GAAQRRRTVWALARKA--------VRIGPDRANLVQG--------GPRGFEDEAaQHACDDRVGAADRPEifdSVVEDLGRRHA-LFGVTPAQYSAVGEALIWSLGEALGPALTRSRREAWSDFYKVVQLSM-----
525 >SRR5215207_7267255
526 -----QAV-----------AGEPEVRGSILRKAVRIGPDRANLVQG--------GPRGSEDEAaQHACDDRWSRLSTR-dlrLGCRGFGTTSR-TVRCDAGSVFGGRRSL---nleLGRGARTRADPVQARSVERFLQGGSALHVEG--
527 >ERR1719491_1400349
528 -------------------------------------------------------RQRRFTHMGAASGRPRAAVALPGARA----SLhdrPR-PHEAE-ASVASRCEATIKTLRDLLGDDCTPEVENAWAVVYGFMSSIMVESLR
529 >SRR5919197_656730
530 -LLDDDTIGLLDESLRLIDDRSDVVVNHFYAAQFATPPPRGLLGSR--------ARGC--------LGRGVR--------RDGPGDVgrrSR-GGGGRAGLV--EGRD-------------------------------------
531 >SRR5688572_8260099
532 ----DQEINIVRQTWNRLAAeHGNSVAEEFYKRLFECCPHLKDVFKN--------DFEVHGKEFIENMDHIIIQLDNPCMirEMQILGIKYA-SYGIRYEDYECMKKALFDALKTKLAEHWTPTVMVSWIWFYSTVSHIMKH---
533 >SRR4029077_8414069
534 -DMTPAQLQLIKKTLPEINASDDLFAAEFYRQCFDLWPETRSMMPG--------DLTERGRALVAEFIALASCVSgDMDRVVARaheLGVRHR-GHGALRAHHEVVEQAPAAPLASVLEDGWDEPTAQAWH---------------
535 >SRR6478736_6664572
536 --LNAVEIARVRLGFARVVPNCGAFADDFHARLFELAPTTSALFPD--------GVSNRRAKFRQTLVMLMTSLSTPTELKPALAALgnRCRACGVEEADFAAISQALIGTLAAHLGTKLTIADFDAWTALRGRIAGLLTA---
537 >SRR3546814_7943381
538 ---------------------------------------vfirlslsliiilvyRFLFFFFSSR-----RR-HTRCVLVTGVQTCALPIS-------TDELIa-----AWAAAYGQ--------------------------------LADLLIA---
539 >ERR1700737_1149585
540 -----------------------------------------------------------------------------KQPDGSAEKHfeqAC-ESGRPTGAVSHCRGTPAGCDQGSVGRRRNRRDHFHRGKGYGNLADILMG---
541 >tr|A0A254VKN7|A0A254VKN7_9BURK Nitric oxide dioxygenase OS=Xenophilus sp. AP218F GN=CEK28_14595 PE=3 SV=1
542 -MLDDATRAQIRHSAALLHTVGDQLVEHFYQRLLRHHPELGIFFNATHL-----HKRELQAAMSRAAAFYAEHNDQPENLQPMLQHIackHA-SLGVRPEHYPLIGEHMLKSLEEVLGPLASETVLHTWRMAFSELSGKLIA---
543 >SRR5215470_13616785
544 -----------------------------------------CMVTL--------CHCSFTqtcscGTRRRGICSRFRWLPSATGWCMRWAGScptSR-TSTPSAGTcRTWGASTASSAPSPSTTPTWTPELAADWKAAYDLVAQVMIG---
545 >SRR4249920_1577195
546 ------------------------------------------VWPC--------TATRCRCSSTRTC-----scgtrrRETCSR--SRWPYSAtgsCT-RWP-GSCPTSTTWTTSASTCRTWaaSIASSAPAPAADWKAAYELVAQVMVG---
547 >SRR5688572_1436081
548 -RPAPEVIAAVSASCQAVADRPVRLAEAFYEHLFEIAPQARTMFPA--------DMTAQMQRMSDTLVGAIAQLEKFdtAQLeaaLRRLGADHRTRHGVEAEQYRYVGHALTRAVRDVAGLAYSGALSSAWIAVYQYIEAHMSAG--
549 >ERR1740124_2148144
550 ---------RTRGAAALLLQgRAQPCGVAQAQEACYVCDEHCRCCSQGSgGPqqacarATGPPAHMPYA----THRCRVCCRIGIRARAPPTQALgkrHV-PYGVLPAHYDVVGQALLATLEGGLGAEWNDQVKASWTAVYGIIAKTMIG---
551 >ERR1711911_258465
552 -------------------------------------------------ritHGWEHVVQMHAMNVMNSITSIVDTLDNPESLVDDLKQIglnHR-KRPIEAIHFHVSIYAATEGVQHVLSEMIQSNIDDSAKYLRPVDGSQCDS---
553 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold8273257_2 # 299 # 427 # 1 # ID=8273257_2;partial=01;start_type=ATG;rbs_motif=TAAA;rbs_spacer=15bp;gc_cont=0.364
554 --------NELQTNIEDVYSAGDV-C-----ALFDSSaNRYRPtrtwlscafqgEVAALNM-------LGQDKVynegvFFNASHAYRSMYAVLGNFNPAQAD-gfeFF-VCNQDKENYE----RMVLKDNKIAGAMFVGSMKNVWSVKQLIEGQVDVS---
555 >ERR1711934_740551
556 ---SEETIRIVKSTAPAMKQHGYRICTTMFETLFAEHPSLASMFRKEDH-----TVQ-pgesyerQPLLVAqavrhsprflflapdshpllilipfsssSRCTRTPSTSIISPRWSPPsrgERERA------------------------------------------------------
557 >SRR4051812_844822
558 --TEPDTAFIAQSQLARIEAMGEELVQRFYAHLLA-APEMKQLFLHTE-------MARQHRRFLDQLTSAVRELRSPRNATAHLAALgarHR-GYGVKPEHFSLASSALLHALAVVIGKEFDARAASAWKEIIASLVILMNL---
559 >SRR5680860_1220841
560 TQLTAEQKHLIRLSFLRIEPALDLVAQLFFLKLFRLDPSLRKKFSG--------PIDVQARKFAAGAKLAMISLGHEDGLaptLKLLGARHR-QIGIRTRHYRTMSRALVWTLERSLDKAFDRDTKDAWNTLTAQFTKVMAG---
561 >ERR1719167_531039
562 MGLEQADIDNIQESWGIAKSKakLREHGVNFFLLLFTTLPEWRsKDFSHLgDGtleeLKTNPKFRAHCVLVMSNLNYWVENLDELDMGGASIQKTavnHA-GRGIMAEQFETVLGVVLKYLQGALAENLTEAMVESWTTLADTIVNIIKELN-
563 >SRR4030088_1427564
564 ---------------------------------------RRGRDGGQP--------R-RRELRRDGQepdepDASRRGDRGRPCAGPASR--------------R--RGSAAGCRSSPPSPAWPALSYEQWRETCDTLHGhTQVLG--
565 >ERR1700752_5389668
566 -----------------------------------VVPQVPAARSRVP-------LR-AASFRRGGLehdpdPKGRVSAKQEPV-FGK----------------D--HGQTIRLSARGQSS---PrRNDAARETTCKEARMtPEQVK--
567 >SRR6218665_550821
568 -FLSEEELTAAKSTWVRLQAtrNMQAMGVKIFLRIFELEPATKQAFESFrNLKseelVTNVLFRSHATRFMKAVEVTMNNLDALDVIivpnLKHLGRLHTDFKGFHVEYLKAFEVAMDEVWAEELGTAFSGDCRLAWTKIFSLITTKVMEGYN
569 >SRR5690606_39778542
570 ---------------------------------------------------------------------HATSVTSSHPCTPPVPcqcarrpALprlLRSsptrrssdlsL-MIKPEHYPIVgENLLASIRE--VLGe-gATDAVINAWA-EAYGFLA---D---
571 >tr|A0A257MW93|A0A257MW93_9GAMM Uncharacterized protein OS=Methylococcaceae bacterium NSP1-2 GN=CG439_2278 PE=4 SV=1
572 --VKVKNRLLVKLCIDEISPKIDIVSQLFYQELFHLNIHLKTIFSG--------NVTFLNRKFINMMATfkNVKHLEAIENSVEKMGERHVLHYRVQLKHFPTLKKALLLALKKHLGERFNAELEAAWHEVFDDVAEIMQRA--
573 >SRR5690554_3276444
574 ---xmSDADRLQVQASVERIRGQMDGFAGCFFDKLFALQPALRELLAT--------E-EGRRSKLRSMVSTlaNSRDFDKIAPAIRRLGDRHR-DYGVGVQDYVPVQQALLHAVAQVDPQGQSEQVQQAWSGQFQRISALMEPQ--
575 >SaaInlStandDraft_5_1057022.scaffolds.fasta_scaffold510383_1 # 42 # 362 # 1 # ID=510383_1;partial=01;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.393
576 ---mTSKDRALLKECVEYIEsESINELCDIFYKKLFDLDPKIKLILSD--------NDVVLRRKFFNMFSTfkSVKYIDKVSEIILQMGARHK-SYGINEKHLELMKEPLFESLHEVLGDEKFNYYKAGWEIGYQEVENLFKEG--
577 >SRR5436190_9873117
578 -GITHSDILLVQTTWNAVSEFSMKIVAGFYKHLFAAAPEVKPMFTT-ET-------SEQQKRMGSMINTIVNSADSLDEFRgsiSQLAKKHV-HMGVKKEYFPIVVKAIISSVEDQYGSGFTTAHKKAWYKILNEISNIMIEE--
579 >tr|A0A1X1R5G7|A0A1X1R5G7_9MYCO Uncharacterized protein OS=Mycobacterium bohemicum OX=56425 GN=AWB93_09655 PE=4 SV=1
580 ------TTSPVVVSLELYAEHVGDPIPIIYQRFYTAHPDAEAEFAG-DH-------HLEQRMMGGVLQMLIDLT-EGSfapSGCTYWLWDHI-GWGVTEQMVCDMFEAVVATIREGLGERWTPDMTSSWRDLISRLQPVLHAGF-
581 >SRR5699024_11940786
582 FRRVLFRSEIVKSTAPVLKENSDKIGKRFYEKLFSKAPELYNIFNQTNQER----G-IQQEALAYSVYAAGENIDQLDNLKELISRVtekHA-ALGVKADRKSTRLNSSHVSISYAVFc----------LKKKX------------
583 >ERR1719310_1734953
584 ---SASSVKAVQASWAKAENIGlRVVGELFFKELFEASPAAKELFTAqkFgEDAAGQRRFKAHTLNVMQTLSAAVYGLSDLSALARTLPAPtyaIL-SLSFTLISFTSL--------------SLTPLI--------------------
585 >ERR1712087_347811
586 ---------------------------------------HEELFTAqkkFgEDAAGKAHFKAHTLNVMQTLAAAVYGLSDLSALARTLPARiyaIL-SLSFTLITFTSLSLTPLIYHTLTLKGARARNSGRaaPWIRRPT-----------
587 >SRR5438874_997478
588 -----------------------XM------CTMHRHALRFPPAPN--------WAATRTTTPL-TTVTHRTAEVHPGRFAGSLRWLgraHG-KFHAPPAQYDVVRAALMDSLRAFAGEQWLPEYDQAWRDAYDVIARRMIQ---
589 >SRR6266511_448526
590 -------RRRRRRAATSSGRASHRLRDsRLEARARDRSRRVLDDASS--------WVEVVRLGDAGEPVVLVSAVAAIAHRDVRRVELareGE-RVRL-------QVLNVDAEEDDLAGEHWSVEYDQAWRDAYDRIARVMIM---
591 >SRR3954451_10251525
592 -------TSARRqqWTFPRCGPTspRPQRPGTRARCTSTPTCSCAIPRPA--------RCSRSRWRT-SGTGSSPPSATWLPgsttstRSCPSCSSSggtTG-SSGPSrRTTRPSVPacWPRSSTSTTS-GARNSPRAGRrptTASRAPDVLATVMIE---
593 >ERR671928_16913
594 ------------------------------------------------------------ALYFDGIDTGR------LRVHQTKLLVqvtGG-PVEYDGRELAVAHGGLDITLEHFD-PGWTPELARDWTQAYQLVAKVMID---
595 >tr|B7G0J4|B7G0J4_PHATC Predicted protein OS=Phaeodactylum tricornutum (strain CCAP 1055/1) GN=PHATRDRAFT_46237 PE=3 SV=1
596 -----HRKKMIQQTWRAVEFgLDVDCTRIFYTELFRKYPSVQPMFQHS-------NMEVQAQKLYEVIRVAVRFLDNVQELIPVLKDLgmrHAKHYGVLREHYDAVTEVFISVLNNYILteldcgnaGIWAMEVADAWHWVLTFIGNTMAD---
597 >GraSoiStandDraft_52_1057288.scaffolds.fasta_scaffold278261_1 # 2 # 652 # 1 # ID=278261_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.575
598 -----------------------------------------------------------------------------------------------AFLPAQRRAKLM-TSRLSSEPPWKGPAAEPSWHVLG----TMVG---
599 >SRR4026207_1965376
600 -LLRRALCRRA-QSAAAVSRRPDPASGSFRSRHRA----GR---PE--------SGRNGRGRRDPALALLSKTLDEMAPLREPLRDLgaqHV-HWGARPEDYITAREALVAALGA-LSPNWDETLEGDWRRAITAIIVPMIE---
601 >ERR1719359_2370951
602 ------------RLIVTPEHlDGCRAGLLALRVVLLHLGEGLGLLGSDSSGvsDCGVALgel-------PLQRLDLLGVLLGPR-----L---gl-L-NAGVRGLELSLLGRLlrvglselfVAEGLLLGL----------------------------
603 >ERR671911_2215695
604 ---------------ELEPAcaPDKQLVEHVQRlRVEAGAQVVGR-----E-------EerrsragqcprptsRVDVRGTHDD--------APLECVAEVLVDCgahAR-VACKVDergraaleLLDRVVPDDLVVDLHAVDEVDGGGQTgHVGPGTSSRRVstarakpQAGTLPQ--
605 >SRR4051812_41451604
606 -------------------------------QLAAAGPVLGARFAGGD-------RppraaavrprprRVGRRGGPLDRVPPPPRRDAARAAGARLRGRgaaRA-AGAGGRDQPLRVRDARVGAPVAVRGDLGGAAGIAAHYPVVGAVLIASMAD--
607 >SRR4051812_21433834
608 -------------------------------QLAAADPVLGAGHAGGG-------TparaaavrapprRVGRRGRLLDRVPPPPRRDAARAALARLRGRgaaRA-AGAGGCHQPLRVRDARVGAPVAVRGDLGGAAGIAAAGAPSGSPWTLTRSK--
609 >SRR3974377_1684031
610 -IMAPEHKRLLAESFSKLENRLDDLGSLLFQKMFEISPESRSLFKG--------DIEEQKLKVARFFAEVIRRRTRShhflpvtgkggEVIIPgvgPLGARHEINYGVRAKHYGYMREALLYAISTMLGSEYNEEIGRAWGETFDMLAGAMQK---
611 >APCry1669189000_1035189.scaffolds.fasta_scaffold267513_1 # 3 # 467 # -1 # ID=267513_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658
612 -VLSDQHKKVIVRNWTILSTDLSGRGTRIFLLIFGRNPLIKSIFSFGHLegdeLVCDPRFKGHALRFMQAVGAVVDNIDDYNNaVkpiLNDLGRRHTQFKGFKPIYFNEFQDSILQVSENGTCKQngeiriLNPSaagvnfCTPPLGKFSASEMTCIVSsGA-
613 >tr|A0A2W1CGM6|A0A2W1CGM6_HELAM Uncharacterized protein OS=Helicoverpa armigera OX=29058 GN=HaOG211460 PE=4 SV=1
614 -GMSLRDVYNVQQSWKTIHANPLDNGYLMFFRLFEADPETKTFFKILDNarSeadmKAYVKFKAHILNIMGALNNSVVNLDKPEVvvvWMEKLGTAHQ-KFNIRERHFWVFRDVLVNILQNDLK--LSEPIVKSWGRYVTFIYSHI-----
615 >tr|F2Q9X8|F2Q9X8_BRAFL Globin OS=Branchiostoma floridae OX=7739 GN=lGb13 PE=2 SV=1
616 -PLDAWQRFYLQKSWKTVARKSDQAARTVFLRMLQDNPGLRQKWPRISlL-teeeiPTSPYIKFLGERIFDCLDYIIDNLGDLDHVISELtklGRQHSDMNVMTPEDVWAIEAAFLAGVQECLEDRFTIKYEEIYSRFIVFVIETMVIGFD
617 >SRR4029079_30121
618 ---------------------------------------------------------------------------------------------------MHGMH--FWflnnHKNNKMTQKQTELVRSTWSMV-----AAMDH---
619 >SRR3546814_3749254
620 -------------------------CLFFFFCFFFSSIRRHTRCA---LVTG--VQTCALPILFNAIAAYASNIENLPALLPAVEKIaqkHT-SFQIKPEQYNIVGTHLLATLDEMFSP--GQGVLDAWGKAYRSEERRV-GK--
621 >tr|A0A1K0GS94|A0A1K0GS94_9ACTN Globin OS=Couchioplanes caeruleus subsp. caeruleus OX=56427 GN=BG844_22340 PE=4 SV=1
622 -GMNPaddaelhAVQRLLISSLEQAGGQVEVATR-LRAALAQAGPALFARIPG--------GPLAQVEQLAEGLAWLAQHTDqP-PALVAGFGRLgavLA-ECGIAPQQLQLAGAALAEAMRAgMAANGWRQDYDQAwrstWQHAYQWIAHGMVAA--
623 >tr|A0A077WN08|A0A077WN08_9FUNG Uncharacterized protein OS=Lichtheimia ramosa OX=688394 GN=LRAMOSA02110 PE=3 SV=1
624 -PPSQAQLNVIRDSWERVLSTpinnnntdqsstssnstlsttpsaSSAFHHAFFEALFTLDPNLTTWFPN---------VKRQARALTGIVSYVVRapailpvkyktykSLREMhqiqqtldeeeeqwmREQLKALGARHA-VHhQIQIDMLDHVGPALISALYQRLDSEFSPAMRDAWLHALHYVVYYMKQ---
625 >SRR6267143_1520378
626 --VTLEQIQMVQASFAKIAPIVGPATDRKLRRCSALVAGFrkeTRLSTG--------VSKNPGRSEVRGTLCGASCCGSLSS---------------------------NWVANIRRGI----------SP-LALAIASI-----
627 >tr|M3IRU3|M3IRU3_CANMX Uncharacterized protein OS=Candida maltosa (strain Xu316) GN=G210_0056 PE=3 SV=1
628 QELTPDQLRLITECIPIMEDLNLTLGSKFYRRTTRRHPHLQSYFNETHH-----KLLRQPRAFIFTLIMFAKNIHDLTPLRDVIRRIvskHV-GLQVKPDHYPLLGDVLIETLCDMFPYHmVDDKFKTTWSIVYANLASLLIG---
629 >ERR1712228_269173
630 ---SETMKGDVVRSWDMIQELgTNAVGERIYRVFFELAPEAVEKFPAHvRHkyrewtadeSddeadlR--nsAALRKLFAKVLNAVGCVVAGLLGDAFTPEVEN--awNV-VYGf---------ASSIMISGLKQAKEAAQVRALQDS-DCAV-----------
631 >ERR1711918_283694
632 ------------------------------------------------------GSECSWMCRC---GIARFEQT-------RTTSHksrRA-TYRvqPDRGILAHPGESCDDHFGGAPWGGLHPEVENAWNVVYGFPSSIMISGPR
633 >SRR6516162_1580517
634 --------RVRRARCSAatesTATNTASVPGCSFAYFFACAYSASA-----------------------------C-ASSCNlnPVMV--SWGAL-GSSLKRSHFDAFGDALIWCLEHQFGAAFPPELREAWITALRRGPNG------
635 >SRR5262245_22234373
636 --SADFDREPIREVLTRLAADPEVTMGYLYAWLFTAYPELRSLFPH--------AMTQTRAAVFGKLVSVLAGLDDRLQTEQALARLaidHR-KFGVKEKHYQPFFDALYVTAQHAAGSAWTREMAAALRSALDWFGSIMQA---
637 >ERR1719495_1281412
638 -MFKANEVTELRLSWNAwVAGDLANKGFELFCKMFEKNPDTKNVFDFMKGSsvtqmQGSSKVLFHVTRVMKNIDDVVKHADRLDEIVPILRQVggrHGtQGYNVPSGYFPFLGNALRELLRTKYS-GYNTNLDENWKKLWNFIVKEMHAG--
639 >ERR1712105_94955
640 -EFKPNEIMDMRVMWNGwVSGDLASKGFEMFCKMFEMHPETKNVFAFMKGSsvaqmQSSAKVLFHVTRVMKYIDEVVKHADKLDEVVPIMRQVggrHGtHGYNIQSGYFPHLGEAQRLLLKDFFKDRYTANMDAIFKKLWVFIVKQMQAG--
641 >SRR5260370_506041
642 ----------------VRD---YSSTCSF--------FFFLQAEDG--------IRDSS--VTGVQ---TCALPIYQERTEQVLSRLavdHR-KFGVRDKHYEPFFDAVFATAEHAAGPAWTREMATAWRSALDWFGSVMA----
643 >SRR5580658_2929351
644 ----APLRAIV-EEVLRSGGG------------------------------------------------------------------nvAA-GTGVRRNASLFHGAREPPGFYD--MpGLRELSSSYPWFQV---VP-VIS----
645 >SRR5258708_13478776
646 ----APLKAII-QGILRA----------------------------------------------------------------------G-GPLLRRETRPLVGAPRGQKALL--PpHPPGSGSVASRPKG---IS-L------
647 >SRR6266704_2687724
648 -----IARPPDR-RPRCGD---GVLLR-P--------AVHRQSRPA-------------RAVSLRDDANPRGGLPDADRAGQEP--GrraCD-RAGPRPDRQGPpqirrepeALPAVLR-RAVRDGRAFRRPGPDRRDGRGLA----------
649 >SRR6266536_777504
650 ----DGYREALDASFARVASSGEKAVAYFYGRLFAATPRLRGLFPA--------AMDYQRDRLLCALLQITQRLSN-rAALSEYLVQLgrdHR-PPGVPPAV--PGGAACEHPNPTLA-pGVAPllsgvraagqrvarVPHPRRPRRLGQHVPGAVH----
651 >SRR6202030_4225180
652 ----YRAN--A-EAGTFP----------------------------------------------------------------------D-STQEPPETGPYRVAPSDARLLRKSLaLLEPQSE--------------------
653 >SRR5256886_2416282
654 ------DREADADREADADRDGDAEPEPLTAPALSSPPAV-PLAPP--------RDEAARQHdEPEPAPPPDQVPGAAdpretagppeppeeppP--------DgkgEP-AAG-----PDPAIAAGQEALRAFARE--afTSAAEEAWTQVYLAGSSLMIK---
655 >SRR5581483_8202477
656 ----------PDDPVFDGMqgNVGRvaarylphrEGEAYVAGPVGMVRETIRALTRA--------GLPRERIHYDDALLAEDKQASAQgvagatahtsrtpessrPGRTGEAGNAgpdGH-IrrvaesdqAGPAGGTAEPGQSGLRDAAADIAPQ--------ADTAHQDGGPHDDQagA---
657 >tr|A0A2G8KCQ8|A0A2G8KCQ8_STIJA Globin (Fragment) OS=Stichopus japonicus GN=BSL78_17342 PE=4 SV=1
658 -GLSTVEKDHIRKSWTALMKNKNENATLLIVNLFKMSEGAQDVFPKFKGknpdeLKKSIGVRSHGLRVLAALNSVVENLDDIECLVDMLQHIaHShHPRGTSRKHFEDLGGVVIATFEEALGKKFTDDAKNAWAKAYGVILGVIKSEY-
659 >ERR1719203_2782565
660 --------ITSKFGWTSNMQ--------------KIIQSQTHSKTQDMqrDYYLNQK-KTLEI----------------NVRHPLMKELlrrVE-----DNPEDKVAKdMATMMFNTATLRSGFSLKDTVNFAESIELMMRQTLG---
661 >ERR1719343_1244138
662 --------LVGV-SWFfSSEKFsGRMQNFWILKALFGTSFPLLfvwvialVIVSIHTGSFIAPLIVX------------------------------------------------------------------------------------
663 >tr|A0A0P4VK04|A0A0P4VK04_9ANNE Extracellular globin OS=Glossoscolex paulistus GN=HgBp PE=3 SV=1
664 ---SAEDRRELKFIWNYIWASGftdrkAAIAGAVFKDLFQHYPSAHDLFTRVKVdEPDSGEYRSHLIRVANGLDLLIGLLDDTQVLDHQLNHLadqHILRKGVTQQFFKGIGESFARVFPQVS-SCFNV---DAWNRCFHRLANRISKD--
665 >tr|A0A0S2MLN3|A0A0S2MLN3_SEEJO Extracellular globin OS=Seepiophila jonesi PE=2 SV=1
666 ---NSLERIKVKMQWAKAFGYGasrAKFGDALWTNVFNYAPTVRPIFYSVNSkDMKSPKFQAHVARVLGGLDRVISMLDSEPTLNADLAHLksqHDPR-ELDPTAFVVFRQALIATVAGTFGVCFDV---PAWQQCFNVIAMGITGS--
667 >tr|A0A2W5I8T1|A0A2W5I8T1_9ACTN Uncharacterized protein OS=Lawsonella clevelandensis OX=1528099 GN=DI579_06450 PE=4 SV=1
668 -----TYYTVLGPAITLLREHPEDFMRHFLAAALTYDFHFHTFFPS--------VNDHHASRYTHALRYILEALDQstndpdcLDDVIDFLSQLgcdQR-KYQLTAEQYQSLAAALRDTFALLLPYQWSTELNDALLTSFEHAINVMQS---
669 >tr|A0A177JSP9|A0A177JSP9_9ACTN Oxidoreductase OS=Dietzia cinnamea OX=321318 GN=AYJ66_05610 PE=4 SV=1
670 -----AQAPPLLALRDLLA--DDRFPDLFARALRATDPDFRELFPR--------DATPVLREFVRAMTWAFETTEYahgdrskVEEVVEFARHLgadHR-KLDLAPRHHQRFGEALTHTLRHLAGRGWDDRLETTLATAYRVLSTALQQ---
671 >tr|A0A173LPQ6|A0A173LPQ6_9ACTN Phenol hydroxylase P5 protein OS=Dietzia timorensis OX=499555 GN=BJL86_2914 PE=4 SV=1
672 -----DQLPALLALRELTYRessdVAPDFRRALEDALNTEAPYLRADLPR--------NLDGPFATFVKLYRFLLTRVEDsggdrakVDDVLDLCRELghdLA-KYNVVEEQYERFGHALNAALARVAGEEWTGELSKVQNQFYVIIARALHK---
673 >tr|A0A2N6TBK5|A0A2N6TBK5_9CORY NAD(P)H-flavin reductase OS=Corynebacterium kroppenstedtii OX=161879 GN=CJ202_05310 PE=4 SV=1
674 -----VHEASLVPVVTVLQTDGSRFVDAVFTHLFARRPSFIRRLPA--------DLSQLKPSFRRALVHVYAKQATgnglDRRTRRFLRHLaedHR-SFGVEAPDYVAMGDAIIDAGREIIAPQVTSEEFELFAMATGQIIGLMEE---
675 >tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 OX=1581089 GN=HMPREF3121_11375 PE=4 SV=1
676 -----------MRAAAAFGRQAPTIGPEAFRRLLDAEPRFRHMFGG--------SKTALRDQFMSALSTALVTRADvgrfPAATIRRLEQLareNR-KFGVAPRDYATLAEHLLDVFGERLPAgpdsgAQVDALREILDEAMSLI-AAAAV---
677 >tr|M3VCE7|M3VCE7_9ACTN Putative oxidoreductase OS=Gordonia malaquae NBRC 108250 OX=1223542 GN=GM1_049_00130 PE=3 SV=1
678 -------QPVLTVLRDRIAHDPDRFAVGVFNRLFAETPFLRELFPS--------EMSRMRATFTQVVDHVLDAIANdddHAELIEFLAQLgrdHR-KFGVIGDHYWLMYDALMAEFAAMLGPGWSPDAQEATSHAMMLMTGVMRG---
679 >tr|A0A2D6MQX9|A0A2D6MQX9_9DELT Uncharacterized protein OS=Deltaproteobacteria bacterium OX=2026735 GN=CL908_08110 PE=3 SV=1
680 ----TEDHELLLQSLDRVMHGEVDLSTRLYERLFSRHPELRELFGP--------NSIPvQEEMITETLISAVDDLEGLpwiEDNMQLLSQKHS-DADVTSEMYDWWAECVIETLAELSAPDWNRRLEELWRKQIARLCELMRAET-
681 >SRR5207245_2384740
682 -NPQPST-HAVTEQVVTLDV------LPWTSGKLGLGPGKarlsEPLAPG--------DTLE---SL----------LERQRARIpgfeewvYDArerriheHCTLL-VNGQAEYRRHTAEVEI------------------------------------
683 >SRR5689334_4915957
684 ------------------------------TASQRVTP----SLRG--------KRVPSGQmgdRKVPD-VPIVDAHVHLWDPTafrmpwlDGNKRLNR-PYGLADYREQTAGLPI------------------------------------
685 >MudIll2142460700_1097286.scaffolds.fasta_scaffold02451_1 # 3 # 1031 # -1 # ID=2451_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.574
686 ----------------------------------------miGSRALA--------ALFPHPKTFMDTKRPVADTHIHLWDPGyltypwlETVpaiagph----G-PAELQVQEPETDRFRL------------------------------------
687 >SaaInlV_200m_DNA_2_1039689.scaffolds.fasta_scaffold02144_7 # 4497 # 5432 # 1 # ID=2144_7;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=11-12bp;gc_cont=0.499
688 -----------------------------------------------------------LQCGVATVRSVIDSHVHFWQPQrlrylwlDEVpair----H-PFTPHELNQATQAIDL------------------------------------
689 >SRR6266704_3508957
690 --------TITRAEFCAGRSNRgskQAFACECYATLIRLHPEVKPLFTHTS-------MEKQAKKFMASLTLVLHVLGKPDVLTTTLQRLgrrHQ-TMGVRVEHYPMVAEALLATLKSGYAVVLLT----LFVQSYMFL---VRKGA-
691 >SRR6478736_5796684
692 -------------------------------FMMGV---IASGMVVTGA-----ERRGRPKAVQPGNREWITVIQAINAEGQA--------------------------IP-PFIIGAGQYHLANWYRDSNLPGNWAIA---
693 >ERR1711935_979896
694 -------YSEVMNSWQRVRRvkdFDKTLGVLVFSKFFSKHPDATKIFGIEEEgeelVDTSASFVPQATKFVGLCDNFIDMLGPdsdlLKDILAEEGRKH-ARRGVELYHYPAIGEALISGIRAM--DvKFNDDTELCWRKVYCGVTHDLGKAV-
695 >ERR1712137_931585
696 --------------------------------------MGTSLLGVDCEgeefVKT-DSFVPQAKKFIGLCDSFIDMLGPdaelMAKILEAEGRKH-EKLGIKLEHYSTMGEALISGVKTL--DeKFNDETELCWKLVYCGVTNNLGKAN-
697 >tr|A0A210PV81|A0A210PV81_MIZYE Globin OS=Mizuhopecten yessoensis GN=KP79_PYT16126 PE=3 SV=1
698 -GLTERELKMIKVSWDVLAEDKKSNGVKFFMTLFTIFPTSKDLFKHFkDVPldqlkydgettKSNKKMVAHAMSVMYALESYVDSLDDAYcleELVKKVAISHK-PRGIGPDKFKLLTPVLHAVIEDLVKDDDSvdlETIKSGWTKLIDTVCDIVEK---
699 >tr|A0A226E0J1|A0A226E0J1_FOLCA Hemocyanin OS=Folsomia candida GN=Fcan01_14017 PE=3 SV=1
700 VQLTPDEMIAIKRNWEVIHQDLTGNGMDMYLHWFAAFPHMQKVFKKFaQVPrdqlKTNDAFKAQATVTLHWIDDMIEAIDSPSDMAavmKRLGRMHQ-TRHTNIYDFREMVKRIQEVIGTKVGEGYTPAAESGWTKLFAKLVENIGD---
701 >SRR5947199_2475351
702 ----------------------DELARAVR---lQ--gSRRIMEEHAcG--------AEGRQLARLFDERGRLARAP---RAVDEPGLELgarvsdgrcglakigdvverivqaedvdavRR-AGGDELADEVIVS-------------rtRADDEtseqrepayrigprtqCSDAFRRGLERPAGAPVQT--
703 >SRR6266516_4891354
704 -----------------------------------------------------------------------------GLGDGGRAEGgnrDS-GRGEQLEHLGCVHDVLLSFSESTVSTlphqaarpapaaegagpAITRRetadrapprrhrvggfLRSAGAARARSSIDRMTET--
705 >SRR6266508_4596506
706 ------------SAFVRL-t-DARRVARCLPSAH---pGDETPSTFPs---------ETGDPVNLN--------------LEALETSFDLvapRG-DG-SEATEDDVVGHPGPPA--QVA-PRPRGDRPQAA----------------
707 >tr|A0A1Q9CVT6|A0A1Q9CVT6_SYMMI Eukaryotic peptide chain release factor GTP-binding subunit OS=Symbiodinium microadriaticum GN=SUP35 PE=3 SV=1
708 -VPSSGTISTVQQSWMVVKELgVANIGEIMYKHLFKIAPVTKSLFPVSvRKRyrdwscseeevedgfENSPALRNLFAKVVEAVGSAVAGLHNISRLVAELNALgmrHI-NYNMKEEFFEYGGQALVLTLQDGLGTSLTEDVKQAWVAVYEFISACIISGLR
709 >ERR1719433_537024
710 -ALRISIVGREKRA-NCTVTLgRVEQGELQVGATVLLVPPGAECGVQSvEVDgrevrsaqagefvcmRLLgcQP---SVGHALSSVD---GPLRSATKLKVRSAQAgefV------------------------------------------------------
711 >ERR1719161_1849694
712 -ALRVMVLGMTADKVG-AALEgHVEQGTLRAGTRCLAAlsEGQAECNVQIvLLNgvevshagpgehvrlKVTgaAAKGFTAGQVLSCIS---NPVRAIGKFKAKLRLMslpEM-LS----------CSLLVL----------------------------------
713 >ERR1719271_149007
714 --VSARERRLIERTWEKAKEDgCDALGANLLQTLLVAEPQVMQLFPFKDEenVYESLRFKAHASKLAVIIDAAVSLLANPVKLEsllISVATSYEYsFKQMLPEHFPLLGEALIRTLTSIVGgTKFTWQAESAWRKVWTIISTVMIGAI-
715 >DEB0MinimDraft_4_1074332.scaffolds.fasta_scaffold429043_1 # 3 # 377 # 1 # ID=429043_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.227
716 -------LELIQQTWEKVKPHGKEWGPKFYNNMWTKYPEVRAQFFP--E----SKPEIQGPRLYASLNFMIKNATDIETLKqycFNMGDRHK-KYHCAAEHFKVVGDAFIMTLTEFLGDEFTPEIKQQFQLLYDTVAEMTI----
717 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold5203666_1 # 3 # 269 # -1 # ID=5203666_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.315
718 ------------------------------------------------------DFESQGRALTRMLAWIIQNMSNVSQLVPVLAQMggrHE-IYGVKDADFGTFATTVANSFRSVLGPEIiDDDAHQAWESCISGIGGLMQL---
719 >SRR5438477_4839339
720 -------------------------------------HGIEP-IPH--------RYAAIRRVVSGRE--------------AQARRVgqrHH-AAREDQRR-------LRGL----ERRRG-RPPARHVRL---------AA---
721 >UPI0003969FE8 status=active
722 -----RPFEAA-----------------DRELLFGRAQDIRAVVEQ--------LRTDPLVLVTGDSGVGKSSLCRAGVLPQIREGAlndVR-RWSVAV---LSPGRWLLDTLGDA----LA-----------------------
723 >OM-RGC.v1.018126893 TARA_122_DCM_0.45-0.8_C18859060_1_gene481717 COG0677 K02474
724 -----SELW-------RGRPRKTSLPAgssiRTRTAvlvplgrgketapssssanfvlnLTDVPPEAQELRiTA--------EVDDQRIHFQRRVPADVD-----KVVMELPEGSlarKV-R--VEVAAFD---------------------------RR-CS-IAAFRA---
725 >ERR1719491_698649
726 ----------------KLRAsedvsiSLIIFFSGSSSRFFKQQPDASSVFG-FDNNneniHKTPKFIDFANHFVEVIDQAVQMLGPdlelLTDFFVDLGDKHSKEYGIKPKFYPILGRVLMEQLEEMLGHNvFTVHTKVCWLQVYEAFARDMTST--
727 >tr|A0A147B4Z8|A0A147B4Z8_FUNHE Neuroglobin (Fragment) OS=Fundulus heteroclitus OX=8078 PE=3 SV=1
728 -ELSVKDKELIRGSWESLGKNKVPHGVIMFSRLFELDPALLSLFHYStkcDSKqdcLSSPEFLDHVTKVMLVIDAAVSHLDDLHSleeFLLNLGRKHQ-AVGVSTQSFTEVGESLLYMLQCSLGQAYTAPLRQAWLNMYSIVVAVMSRGW-
729 >ERR1740115_393061
730 NLLTPETVRVVKETSPRIASMAPALSSSFFKRFLS-HPDLAAYKASRH------NGEAKAAAVAAAVTGIGDSIDNLRSLsgaITAISHRHV-ALSVEPDLYPIAHQSMMEALEETLGEEATPELKEAWDEAIMVLADICVD---
731 >ERR1719469_1495088
732 NLLTPETVRVVKETSPRIASMAPALSSSFFKRFLS-HPDLAAYKASRH------NGEAKVPLTHTPP-------------FLSLPHPHS-SLPLPSSPFL-------SL---------------------------------
733 >sp|Q5KSB7|GLBB1_OLIMA Extracellular giant hemoglobin major globin subunit B1 OS=Oligobrachia mashikoi OX=55676 GN=ghbB1 PE=1 SV=1
734 ---SRGDAEVVISEWDQVFNAAmagsseSAVGVAIFDAFFASSGVSPSMFP--GGgDSNNPEFLAQVSRVVSGADIAINSLTNRATCDSLLSHLnaqHRAISGVTGAAVTHLSQAISSVVAQVL-PSAHI---DAWEYCMAYIAAGIGAG--
735 >ERR1719246_379870
736 ---TEKIKDDVQKSWDRILEVGiLYAGEVLYKKLFEIAPVAEEHLPPHIIAkyqqssfdageedqefVRNATLAKMFSKIFNAVGCAITGLHDLGKLVPMLLSLgarMG-GYWDSckydvaGNPWRYVFARCRASLDDGVRLhIVHHDTGFARGQGSCRVSX-------
737 >SRR5687767_4837246
738 ----EKQVLLVKHSWSYQAGQLENLGTLFTKKLVALNPGLKAPMKR--------SLAETGSySLMVAMNQIVAALPDLHKAQNHIQVIvteYA-ALGITRSDYENALIAFLLALEKRLGKSWSDEIREAWIFIFSSLYH-------
739 >SRR5215212_6395769
740 ---------------------------------ASLSPELKPLLKK--------LDQEKRLpHLFITVNDIVASIPDFKRSEKQALALiadYA-DKSISLSVYESALIAFLMALEKKLGKHWSSEMREAWILVFASLRQ-------
741 >ERR1711963_100213
742 -SLSEGTVEVLKACHPLLKDVRRVIGKAFYNRLFKEYPQVKPLFSQSD-----AARTHQTLALADALIAFTGRQLLEG-F-EAKQRGqeRS-LRLRSLQAGSWQGLWRLPSRDRGERD---QNEGSQIKPQILTIQ---QDI--
743 >ERR550517_4578
744 -KFDPDELIALRLSWHAwVAGDLSGKGFDLFAKMFEQRKETKEVFAFAKgtDarqMQNSSKVLFHVSRVMKYIDDTVKHADRLQDVVATLRQIggrHGhNGYDVASAYFPYLGNALRTLIKANYKG-WDSKLEDIWTRLWGFITAQMMH---
745 >tr|A0A0L0FER9|A0A0L0FER9_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12208 PE=3 SV=1
746 MSLTPRQCEMIKSSWKEASQGgkptefrALRFVMDFYSHLFDLAPSTKSMFKG--------GMANQGKALVGMLDIVVNHIDSLATikgDVELLGQRHA-KYGVTSNMYVTAGRALVMALAPRIPDDeDKPECASAWMDAYSFLASIMCN---
747 >tr|A0A1X7UGV4|A0A1X7UGV4_AMPQE Uncharacterized protein OS=Amphimedon queenslandica PE=3 SV=1
748 MSLTSAQVALIESTWKVVKKDLQGAGNIMFLKLFQIDVSVRDKFPFRDVPyeelEDSESFLKHSLQVMETIDLAITLLlGGEMEkLVEalvDLGMAHA-MQGLKPEDFDHVGEALVHALGVALGKEFNDEAKKAWTLLYSVVTAKMKEGL-
749 >ERR1712080_154454
750 ----DLQKIIVKHQWARSYNEgmsREYFGQAIWRAFFKLDPGARRFFTRVrGDDISHPKFQAHSLRILGGIDMCLSLIDDVPTFEAQMKHLqgqHI-EREVPSYYFDRLGTVLQEVMRAATGYCYDE---VAWGACYKYISDRIKANY-
751 >SRR5476649_891947
752 --------------------------------------------------------ATSTRCCS--ATSRKCCRCSIKPTRPTASSsarwptpcWltqEI-SIawNnWARWHRPSStSMCRCKSsgNTIPWSAPrCSRRYVKCWAPRWRPmpsstpgpprtvsWRTCWPV---
753 >SRR6188768_2515855
754 -XMDSGQTALLKASFQRLSTVSELGAELFAGRLYLLDPPLWHHLGLG--------GRSAQHALLRMLARVIEDLDRFEELASTLEAVarrCA-SEGMDAAQFDTIAETLFWTLQQVLGDTYQAPIAAAWREAGGLLIGRMKA---
755 >APLak6261669570_1056073.scaffolds.fasta_scaffold275140_1 # 52 # 198 # 1 # ID=275140_1;partial=01;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.524
756 --WSTRRVKVVQRSWETFKStqaESTTVGLAVFKRFLRRSPAFLQLFPFRDQPLetlfLNAKVRLHCKLFADTVSRTVGLLGDSVAVKASLRELgarHSDLYKVRSGHYAAMGSALLEVLEHNLGESWDEETKTAWEETWAYITEQMQK---
757 >tr|A0A0Q5LAI2|A0A0Q5LAI2_9MICO Uncharacterized protein OS=Frigoribacterium sp. Leaf164 OX=1736282 GN=ASF82_14980 PE=4 SV=1
758 -VITSSHLTALRSTLPLVEARAAAIADDFYARLFADRPDLLrDQFNRGDQ-----AQGRQQRELALTIVTVARDVVgtqvgsgpagsatgpavpvapwsspapspwavrvAARETLSRLAQRHA-AIGVTRDEHDVFERHLRDAFAAALGDDWSGVVVDAWLALWRQTRDELVA---
759 >ERR1719383_514948
760 ----------------------------------------------RGRLvegrwRFDSARVKSCVddrqGCVETWQHGRRR--------SNAPQVgnhAR-GLRCAQAHYDVVGQALVTTLASY--CTFTDPVKNAWIKLCGVIKATMVH---
761 >ERR1719284_1849230
762 -PLDGRDVALIQHSWKEVGQaPADEVAREIFRNIFAIEPGALELFPFKNESedglwREGGERDFSKYFRHRAWCSGAVSFQKX-----------------------------------------------------------------
763 >SRR5204863_5655766
764 -IMTPEAIGLIKSSYAGVTAIPRQLAARFYHELFTVAPNLRPLFPG-D-------LTNLQGHFEAALALVVRNLDEVEVLRPALRDLgaqHV-HWGARPEDYETARDALVAAIGALS-ANWDETLARDWRRAVTAIIVPMIEG--
765 >tr|F2Q9X2|F2Q9X2_BRAFL Globin OS=Branchiostoma floridae GN=lGb7 PE=2 SV=1
766 MSLSAADKKLVQESWDKVSKpSFADAGERVFLKLFRRNESTKAHFKKFkDIPsdqlAGQAVVRDHGEKVCKVLDDFIKGLDGSgDEAVKKVGRMHK-GLGMSNEQIDQMKGAIIEVLADAgFGD---ANYKGAWGKLWDRFMAVHR----
767 >SRR5580698_8666230
768 ---PDLEKMAARSPWLTVTA-----------------------------------------------------------------SLsaePV-SLGHGPRTEHgtvADVLARLGTWREHD--------------AYVCGSSAMVAA--
769 >SRR5919204_299658
770 ---------------------------------------------------------------------SDL-RSGPTSRCTHVRC--R-QQRSPPRHHRClRPRSPAPSWSARlsagfrssscrpstnRPARRRGRGRSTILASYTRLASVMLDG--
771 >SRR5262245_42249746
772 -AMTPEQIDLVQRNLPAVLSLQNRGP-RFHDHFVAVEPTRQFLFAGAD-------MGRQGAVLIDAIAVAIAASRsrEQ-DLSGALCQFHL-SYGVDAQRFQSAGKALVRMLEEEFGDRYFTQLGDAWIAACERVGQTIL----
773 >SRR3954454_16888348
774 VISRSAVIRHVLPTP----aepAAVDHIGQQVADRTSQQDRGERVLLNRT--------aHGLR--ALADGAARLRIAAQS-vadVTRTPLVGVLrqlRS-ALGDVSHRLCGLSDHAEAllgAIKDVLGDAATDEILAAWGEAYWLLADVliar------
775 >SRR3954471_17335278
776 VISRSAVIRHVLPTP----aepAAVDQIGQQVADRASDKDGGERVLLNRT--------aHGLR--ALADGAARLRIAIQS-iadVMRTPRVGVLgqlGG-ALGDVPHCLSGLSDDALGccaTCGCYLCR--------SRGGASWSFFCHaalr------
777 >SRR5215204_1408335
778 ATGGPTRWATMRGRWPLMS---------MLESIAQSG-SGRPVWYVHGA----RDrrahaMGDHARALAADEHAGK------------HRAVrqrT-------------------------------AG---------------------
779 >ERR1719446_1443192
780 ------------------------------------------------------------------------LAQDLSALCPE---Cgfk------VG--TMGVC---QTK------ANDAAIE-----------AKDPPVAT--
781 >sp|P02214|GLB_BUSCA Globin OS=Busycotypus canaliculatus PE=1 SV=1
782 -GLDGAQKTALKESWKVLGADGPtmmKNGSLLFGLLFKTYPDTKKHFKHFDDaTfaamDTTGVGKAHGVAVFSGLGSMICSIDDDDcvbGLAKKLSRNHL-ARGVSAADFKLLEAVFKZFLDEATQRKATDAQKDADGALLTMLIK-------
783 >SRR5690242_2028058
784 -------LALLLQSYGRIGILIPKISENFYRRLFQLRPNLAALFANR----------DADLKVEEMLRRIVAHASDAAAAKAEVQssgRSHA-QWPLLPEDYRVAGECLIQAIIEAEGAATGSVVASIWRQAYVEVANLMIC---
785 >tr|A0A2T7P4S4|A0A2T7P4S4_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_10993 PE=3 SV=1
786 -VLTVQQKDMVQRSWATVMRrDLTAVGMLLFKNLFQQEPRIMTLFSLEasDDedLEQNLRLRLHAARFMQAVGAVIDNLQTPndklSALLSDIGERHSHLHSFHHEYFRAFREAFLTTLEHSLGKDrFKGELRAAWDSVIGFMTREMNHGHK
787 >SRR5581483_4049588
788 ---------------MRIAPHKEEFAATFYQALLEKYPHLSQFFVGVD-------LKRQQTSLIATLRAMLNESERGEalrMMFRKIGQKHA-DQQIRAEHYPAFGQTLLDTLALYD-PQWTDDLRKGWATALEQSVRIMMESYH
789 >SRR5690625_2040278
790 --------------------DRDGFGARFTEELLSRYTEIREALPD--------EPAWVARAVTAVTDALIDVADDPGALVTVLERLgvdNR-TVGVHSAHYAPIGHALILAARAVGGTAWTPDIERAWVDGFDVAAEVMVT---
791 >tr|A0A0Q9HRJ4|A0A0Q9HRJ4_9BRAD Uncharacterized protein OS=Bosea sp. Root381 GN=ASE63_23130 PE=4 SV=1
792 --MGDRAISLALASLETMGSEAEQADIMFNIRLLETYPDVYRVFCM-D-------FAPEERSFLRALAFILAHAGPFGAIGPTVRALapsDK-VCRLISSRYHELEETLMWTLRRRLGVAFTAEVENAWRSVLREAPGVS-----
793 >SRR4051812_34838903
794 ------------------KPIRNRAIKLFFSRLIESHPSLLTVIGD-D-------YEAKARSLRPAVEMIIGCLGNMEALRPILRSMarsNA-ELGMQEHHYLTAVNTILWTMERCLGSAYSAEVDAAWEDVCWQVCEAM-----
795 >ERR1712110_1394717
796 -ILSKEETTLISASWDLVATDIPGNGSKFFTFLFDIHPDVRdKYFQPLLQSSTdvQRTLEKHGAKVVNAIGSLVTALNTedDGKLVTIIRQIthnHW-NRAItNSAPYQLVLDALLEFLAVALGSQLSPAGGAAWKKLFDAFVVVV-----
797 >ERR1711953_1620069
798 -------------TWAIVKLNMDKHGYKFFIRLFLDHPRIQtKHFSSISTSA--QSLTAHGLRFMMGIDSIIRFLELedEEGLRKRIQQIvtvHF-FKGItDPLDFEVLCNCLVDYLSTEVfGDHQL-----------------------
799 >ERR1719210_139600
800 ---------------------------------FTLL-----DPPGQkRNvaqawSavvqadvaiLVVSANPGEFEAGLAK-------------------------GGQTREHAVLAKSAGVENLVVAVNKMDSVDGEGKWSNLryee------I------
801 >ERR1719428_2447797
802 ------------------------------------L-----DAPGLgAYvpavwVaatqadiavLVISAKAGEFEAGISK-------------------------GGVTQEHALLAFSAGVTSIVVAVNKMD--DASVTWGEPrfkt------I------
803 >ERR1740121_1193106
804 --LSESERDALQQSWVQVQKVgFDCVGEVFSQKLFELAPSTHARAG------------MEWGPVVKGIGHTVDYLSRLEAVAvryRRLGVLHR-CIGVTERELKEMGDAFILTLRDVLGK--------------------------
805 >tr|A0A158NI97|A0A158NI97_ATTCE Uncharacterized protein OS=Atta cephalotes GN=105620364 PE=4 SV=1
806 ------------------------------------------------------------MNIT--NGTIHDILSGGK-NtqkV--FL--FR-HRGRTKEVVEKEEKIRVAGLDtngshradCPKGTDEGREIGDPVTDSLLQMLQKKE----
807 >SRR5260221_159328
808 ------ALGLVREGFAAVIARPDVFVSELYQDFFTSNPRYRKYFGSADIGySgsadingtGSpeighaaADITRRNAKTVEAATRIVADLDRPGVLLPYLRKLaleYR-KYGVREAHYRAFAGSVMTALERTIGQAWTYEAAEAWVDELTMVASAMLG---
809 >ERR1711862_565156
810 ----------------------------------------KIMFHFPvnmNIetVLKSKIFLQHAKFFVKTLDITIGLLGpdtdIIQDVLLEHSKTYQ-NHGVNSAMYLHMGESILYALEKDLGDvNFTSKDREAWAYFYGTIVGVIVGG--
811 >OM-RGC.v1.029911412 TARA_036_DCM_0.22-1.6_scaffold294997_1_gene285712 COG0526 K03671
812 ---------------DRLRARGEPPSGNPYRGAAPYGPGDEALFFG--------RRAE--------LEVLIDRVQkTPFVLVAGDAGVgktS------------LCSAGLLPLVREgalGGPRHWACESIACGEEPLAALAAVLAR---
813 >tr|A0A2S2QIF8|A0A2S2QIF8_9HEMI Globin OS=Sipha flava OX=143950 GN=GLB PE=3 SV=1
814 MALSPVQISRIRRSWSALAQDPTELASALVIRMFKENPEYISLFKRLkGLsideLQSNSQFKAHASKVGGALGATIDHLDKPEKLeelLTDIGIKHR-KYGLSPKHFEVIRNVLIAIIAEAIGD-TDPELLDLWKSSLTGVMSII-----
815 >SRR3546814_18929724
816 -----------------------AITNAVYARLFQNKe--IEASFDRAAQ-----TSGEQTKRPSAENLAYAKNIDKLHNLGSAVSHMvarHM-QTVVRPHQYPHGPTALQHSNSAVPGQQMgTNTDPTP-----------------
817 >tr|T0QF73|T0QF73_9STRA Uncharacterized protein OS=Saprolegnia diclina VS20 GN=SDRG_06019 PE=3 SV=1
818 --ISKDVQALVLANWAAISSGSTPallkikpaspvvyFYDYFYGMIFEKAPAVKPLFRS--------SIIVQGKALINIIQSITSAVNAPNviEKVCDLAYRHN-KYGVKIEYFNLLGKCLLLAMHDCTGDTFTDELREAWRAAYAYMVMVMT----
819 >ERR1719402_1510571
820 -AISSITKSRSMYLWSIllnrkqhLEAFSVDNGWAAFVVFLLGDPHLLEG----------------GEGS---QDGSSNPYGVFPLRwsnDLHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQASFNNLLQFLVGNMKVGL-
821 >ERR1719187_1205752
822 -VLEDAEVEGVQTLWAEVSGDLAQFGARVFGRLVRDQPTIRKYFPWGrnDKTeeqlVDAPDTQKHAEEVFGALGKIIGAADHLNDYrsfLVYKGMQHI-PRGVKPEHFVYLKAALVDTLKEELGDKVTPAGEEGLNKVYSFVEKAMSKGL-
823 >ERR1719369_91055
824 -SIKRQFHLHSEAGWAKFAEDVAGNGAATFITLVHDHPEIRSVFPWGgkSY-lsVDDPDIRHHAELVFNGLGVAFNRIGHIHSLdgyYESLGLRHI-ARKVEMSFFDYVGDALSQTFQQILGGGYTADFKSGYSKVYAYVTQHMTAGL-
825 >ERR550519_2895140
826 --LSKAERKEAENAWRIFEVNLVDNGVDAFLNLVRDHPNRKDAFPWVkpELSeealRNDPEMKKLAKLVFSAVKPAFKSLGDLQSLtnyYLNIGNELS-LMNIPPVMVSYLSDAFKKTCQKLLGSDYTHSLEASIEYVYDFITSRMFEGM-
827 >ERR1719150_2276450
828 MGLTKAQVAAIQNNWATVSQNMQDVGDALFMRYLTANPGDLSFFPKFqGAGvgpqlHSNEDFQHQTLTVMQFLGQIVAHLGDIPAAEGMLRERvktHH-PRGISMAQFERLLDLVPRLVQEICGA--SGPTADAWRVAVATLMPSMRDEF-
829 >tr|D3DIC1|D3DIC1_HYDTT Bacterial hemoglobin OS=Hydrogenobacter thermophilus (strain DSM 6534 / IAM 12695 / TK-6) GN=hmp PE=3 SV=1
830 --MSPEARLNIIKSIPFLQSYGERLTSRMYEILFEGNPELKSMFESD-----------DSTKLAGALLAFAQNLERLNVLEPAlnkMALSHV-EAGVKPEHYEKVWDALYKAMTEFG---ISNEIIEAWKEAYYFLAELLIKK--
831 >SRR3990170_2029843
832 -----------------------------SPCTTTRSPCWTRPCAS--------WAT-----------APTGSWAtstpPSSSRLPSCAR-csRR-RWTCSATG----CSRRSPAPRHYAEDVWVPELEDAWLRAYAAMSTTMIEG--
833 >ERR1719356_276690
834 -SLTEAELELIETVWAKAKAlVAEEFGMRLYRQVFDIAPEALQLFSFRDDSdpYESAEFKRQGQIVIAAFGKAVAVLRDPEALAPALDSLGDalaiSTDKVMLPHDRSVGKALLRTLRLELKDEFTLEAEKAWAKFWRILARTVQ----
835 >SRR4051812_22538299
836 --INADTAVLIESGWNAAIDANGDFAANFYQNLFAAAPVVIELFSG-D-------MTEQKGRLTHTLAETVALLHNPEHLLLLLRASgvrHH-HYQVKQAYFGVMRNILIDTIAVRAGELFTAVHRQAWEGFFDNMATIMQGG--
837 >ERR1740128_83505
838 -GLSQREKQDIRHVWSLVSQDLESAGMGFFLAYFKAHPEYQSKFKAFaKvpmdELKDNRSFQMHAMNVMNAITLIVDTLENPEELVSGLKEMgvnHR-KRRIEAIHFHTWRRCCWPSCRVPWVRLSLNRPrrvgAKRWVSSSAPSWRR------
839 >tr|A0A1E4GLJ3|A0A1E4GLJ3_9CAUL Uncharacterized protein OS=Phenylobacterium sp. SCN 70-31 OX=1660129 GN=ABS78_22870 PE=4 SV=1
840 MATAFARAADIEASLELLAERDIDPTARVYQRMFELHPQMEPYFWR-DTD---GK--IRGE----MLSLAFAAILDFVGErryADhmIGTEMinHE-GYDVPRDVFATFFAIVRDALRDLLGADWTPVFESAWEEMLAEIESYARQ---
841 >tr|A0A2A5EUW5|A0A2A5EUW5_9RHIZ Globin OS=Rhodobiaceae bacterium OX=2026785 GN=COA62_02605 PE=4 SV=1
842 TQACTAASDPIVASLELVVDKCGDPTELVYKRLFAQHPDMKPLFLL-DKD---NS--VKGN----MLSQVLECFMDFTGKqhyAAnlIACERvnHE-MIGVPPEVFTTFFTTVVDTFKDILQDDWTPVYDAAWSDLVNDLTVSVDE---
843 >ERR1711860_359782
844 ---LFSKSNYVFAS-----------LSRNTFKLFKDERSLYeKHFSSFDVN-DILRIRAHGLKVMKAVNSMVEAVSDENdeSLIDQIHFvahGHH-LRGITpRNEFEVRRKILNLDYHLLFHyllkkGCLSQSX--------------------
845 >SRR5256885_15743076
846 ------------------------------------------------------------KARMQPIATSDDALDRPAATVPALHARgtrTG-ANGVVDQHAETVGEALLWTHSKGSGRSPGaqgasPTIQHRDVHAMGVLTPTFRER--
847 >ERR1719329_2070839
848 -AMSDETVATVDATAATVAPHALDITKDFYAEMIESFPSVvLALFNPPHWR---RR-cARTPPTsRTCHHCSCLAAPSTPSITDTarsR-SFRHT-TRWCTTTSCGRWQRC---------SDqSWAARCPTPWST--------------
849 >SRR5256885_864722
850 -VLTDRQRAIVQSTVPLLETGGEALITHFYQTMLGEYPEVRALFSMAHQQ------sGAQPRALAYSVLMYAKHIDRLEALGDlpaQIDRKST-RLNSSHLVISYAVFCLKKKKRTGSDS--------FTRSE-----RLVV----
851 >SRR5256885_6575144
852 ------------------------------------------------------------------XMVMSMRGPALEAAGTtgcRSCSAAV-CCSFF--------FQAEDGIRDYkvtgvqTCAlP---------------ISDILIGA--
853 >tr|T2IER8|T2IER8_CROWT Uncharacterized protein OS=Crocosphaera watsonii WH 8502 GN=CWATWH8502_4740 PE=4 SV=1
854 ----------------------------MYEIAFNERPEYRRFFKNTHMK-SPEEGRKQAAKLAASVYAYASHIDELWTLNKKTIvsvNFTL-NI------SPELK---------------------------------------
855 >SRR5690625_6805322
856 -------------RSPSHSQtltLSPYTTLFRSRNLLRNHPELKNYFNTANQ-----VNGFQPRALASIILQFAKNINHIyeiVPKLERVCQKHC-SLGVQPRSEEHTSELQ------SRGHTVCRLL--------------------
857 >tr|A0A244CWV0|A0A244CWV0_9GAMM Diguanylate cyclase OS=Pseudoalteromonas ulvae OX=107327 GN=B1199_05805 PE=4 SV=1
858 ----------------------------------------------MET-------VNSKAKVLNKLLIA-------TSVVLISFIvslQLA-GVEMGQSSIIAILVFGIASIG---AMAF-------LYKAVEQIADKLNVIEE
859 >tr|A0A0L0EW98|A0A0L0EW98_9GAMM Chemotaxis protein OS=Pseudoalteromonas rubra OX=43658 GN=AC626_03140 PE=4 SV=1
860 ----------------------------------------------MNS-------QSIQSSLNNKIIIA-------GVILVISIVvgiQLG-ASGAENMQLVAVALPLFGVVV---ALGY-------LKMALSAVSAQLGCVYR
861 >SRR5688500_16794215
862 -----YDARVLRGSFAQLRPRIAQYSPVFYEHFWRDYPETRPLFGR-NMSKP-----ELDTRINHFMLWVTENADRPHFTIDYiqsVARRHV-GYRIRRRHFAYVDNTNIKTLRELLGDSFTPEVERHWRASFRFLTLLM-----
863 >ERR1719193_2756600
864 -----------------------------FM--EKKVPSVIV------FlnslsLDDDGALETHALSVMNSVNKVVSRLDQPDRLVQLLHDLgrkHI-SYKANMAFLEPIAKHFILTIKPSVA-EWSPEIEDAWQQAFKVIGHIMQE---
865 >ERR1712080_794265
866 ---------ASHVIPGESHGKHQSQRWIVFEKLITDGPEFKAIFGF-PGKRDDPAAQALGSKVLTKVAEAVGCIDDQAKFSSILHaegVRHK-GRKTEAAHFSKLGPAIIYMLGEVG---VAADAQAAWGVAFGLISGEMIKGL-
867 >tr|A0A1E3PUG6|A0A1E3PUG6_LIPST Uncharacterized protein OS=Lipomyces starkeyi NRRL Y-11557 OX=675824 GN=LIPSTDRAFT_199892 PE=3 SV=1
868 -HLTPEDAIAVKESWKETIGLSpantvatssgspaSLFCNQFYQKLFAVRPDLEFMFPDI---------GRQSAAISGLFQVAlamLESIDALDDILLRMGRRHAFVMGIEPEHFELLGEVFIQTMRDRLGERFTPQIETTWVKIYSYLASKMIA---
869 >SRR3989338_7687732
870 --------PLVQATWKQAMDLgdgDKGFGRNFYKNLFTKHPGLLeTLFKGV-------SIANQEKNLPKSITAVLGLLTDMPKAVDALQQLgmrHI-LYGTPDAGYPIVGANVIYTLEMILGSDFTPEAKARWGEIYGVIQTTMIDA--
871 >tr|A0A1C7N598|A0A1C7N598_9FUNG Uncharacterized protein OS=Choanephora cucurbitarum OX=101091 GN=A0J61_09444 PE=3 SV=1
872 -PPTQSQIDIVRFTWGHITDTrlpsdkpeispSHAFGLTFYDTIFHIDPDFKKLFPNIiQQakalggmiSylvkspeiisSPSSDdstlhtqvstirqINASKRKRstASTFSELVletaaDdTLghlpdSDVDHFACKLQQLgsrHY-RYGTQIDHFSLFGHAILKSIQARLGKDCLPEVLKAWTRVYSFTMFHMQA---
873 >SRR6185437_15632065
874 ----ADDVAIVRDSYGRIGPRGAALTIAFFGLLSDRVPRVRKFFPP--------DDKDKRAVAKDLFDLVVGHLESQLNVRWVLERMgrrGL-LDTITPSDVSAVGGCLLDALAELDE-AWSPATERAWSRVYDWAASAVV----
875 >SRR5678815_1770797
876 -------GARVLASYRRIGSRASAAALAFFVAVQRGSPRVRRVFKH--------DDVDQRTLAKEVFDVVVGHLESPRELRSLLERMgrrGL-VDTVSAGDIDAIGATLVGTLRDFDE-GWSSDVEQAWNAVWTVSYTHLT----
877 >SRR3984885_15745818
878 ---------------------ASRAtgGGWLPTRSPTGRSARTSR------------TGCRRGRCDGNTRPTV--GG-PAALGGGQCEDsarDG-KLGLSADHADSAGAGRVdlAAVRHPGGAGV------------------------
879 >SRR5262245_20097952
880 -EVTPQQIELLEQTLSELRRQSVFAAQLFYCRLFSLRPRLRRLLSGR--------PDFHGTRLLSVMSAAVAGLSDPGHFAGLLSLAarpavRE-AL-LQGDCVRVIGDAVHWMLERHFGGQITVEVREAWRAAHIRITQVIE----
881 >ERR1700722_6370008
882 ---------------RGIRPHCPavrqhLPCVLPPH--VRAGSVASHAIPQ-L-------SAPLTATLTAALEALVGALGDLQPVlvrAPALGLRLA-SYGLQPTDISIAASAFLATLDDELDEVSTNAARAAWGCVFWTVAL-------
883 >SRR5581483_4578849
884 -----LQIALLEESFELIAGQSVELADRTLSRLIELDPQFRLLAARTE-------MAALRSVLFSVLYVLRRSLHNLNTLAPALETLgalRK-DQELSSEHFGTIGIALLDAMAEVGG---------------------------
885 >tr|Q17156|Q17156_9BIVA Beta chain of the tetrameric hemoglobin (Intracellular) OS=Barbatia lima PE=2 SV=1
886 ---SEKIKEDLRLTWGILSNELEDTGVTLMLTLFKMEPGSKARFGRFgNIDSgmGRDKLRGHSITLMYALQNFMDSLDNTEKLrcvVDKFAVNHR-IRKISASEFGWIMKPIREVLMERMGQFYDPSFVDAWGKLIGVVQASLARE--
887 >SRR6266536_694904
888 ----------------------------------------GTRFAD--------SHRPPRTMERTGPLRDRLALRALRlgvgdvvwEDVPSLKRSmcg-----------AAAAGAAPVVAAVASAAPGDPQKHLKRADQVYAKSILLRMS---
889 >SRR5262249_10507301
890 ----------------------------------------------------------------------------NVKYSShhqQHGPQAR-GVRSTNLAFCCVWRRTEMG----------P-ATAVWSGVHCRDAAGMDGA--
891 >tr|W6FIG9|W6FIG9_9ECHI Hemoglobin OS=Ophiactis simplex OX=533354 GN=Hb_b PE=2 SV=1
892 MVVSAEQKALIQGAWTPIYAgNRFQLGVDIFAHFFKAHPNYANLFPSLvGVpnPSTSVELRGHAIRVLTGINYFVAALDEKKPvimeMIHNMARSHK-PRKLTREHFAQFAPVLFDTIG------VSGPARDAFLPYYNFIADNLFAE--
893 >ERR1041384_2362020
894 ----------------PLAPKANVLGERKvVAVLYSDLRGFGTL-----------SETGHAVDVLERLNDYFD------RMVAAITSHgg-------------------------------------------------------
895 >SRR5574337_1776253
896 --VGLDDRDALRVLHAAFVApvdgngAANGLTAAIFDRWFGTDPSVRDLFPP--------DLDAQRAAFGQAMSWVYGELiaqraQEPVSFLAQLGRDHR-KYGVTQQHYETLSQVLHATLRHRLADAWTGAVDAAARDSLKLIFGVMSG---
897 >SRR5271167_3167484
898 --VGLEDRDALRVLRDAFNQedpgASNELVRQLYAHWFALDTSVRDLFPP--------EMDSQRAAFAHALHWVYGELvaqraQEPVTFLAQLGKDHR-KYGVLPSHYDTLRRALHATLRTQLSDAWTDAVEDTACQSLNLITGVMSG---
899 >SRR5258707_573086
900 ---------------------------XMILKSFKPNAAIGC-KTI-P-------TW-----FVP-LPTFTAGLTLPKLYPLSVFGMRRyNLGGLGEPH--QVEAALLWLVEKQFEGVLTREMRQAWVQFCQWLVL-------
901 >tr|K0T9D6|K0T9D6_THAOC Uncharacterized protein OS=Thalassiosira oceanica GN=THAOC_11871 PE=4 SV=1
902 ---------------MEREDSSgSL--PSFVSETEIEPSDVQPaaasgenNVDKGR------RKTSSSSKRTPSITKRIESFSSFKSLSSSFS---------------SKLDDERNAGEAGQAERVEsttapESVASGETQGNAGGQHTLN----
903 >SoimicMinimDraft_5_1059733.scaffolds.fasta_scaffold33866_1 # 3 # 488 # 1 # ID=33866_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.741
904 ---------------------------------FELAPASAGLFPAQvrhkyrewttEEvhasdndVRNSPSMRRLFAKMLTVIGCAVASSQNLAALVPEVKSLgarHA-AYGVSEAHWERAADAVRAEPSRSYGGLEGERRRGPHMtrvtarTLTvIFGTMLLVAT--
905 >tr|A0A165S3D1|A0A165S3D1_9GAMM Chemotaxis protein OS=Halioglobus sp. HI00S01 GN=A3709_07715 PE=4 SV=1
906 ----MTAIMMIDRDFTVTYANEAT-----LQLLRDNQATLSSIYPGFN-------PDKLI--------------------------------GSCIDGFHKNPEHQRNILADPANLPWRTDIEVADLKFS-LNVTAIVDAQ-
907 >tr|A0A1I2IR29|A0A1I2IR29_9GAMM Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor (Fragment) OS=Fontimonas thermophila GN=SAMN04488120_104136
908 ----KGVIQYINRDFIEVS--------------------------GFS-------ESELI----GSPQNIVRHPDmPVEAFadfWAT----------------------------LKDGKPWTGLVKNRCKNGDHywvLANATPLRAN-
909 >CZCB01.1.fsa_nt_gi|955242656|emb|CZCB01016507.1|_3 # 1728 # 2327 # 1 # ID=16507_3;partial=01;start_type=ATG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.493
910 ----GVSSFEMNQQFSAQSSDSIEKNIAAISELWQKYMATnitdeekvladkfvatrgafvkealLPAVDALR-------ANdYEKAKLFSTKARDLYNVAHPALVeliQYQAGHAKL-EYDTSVESYKLTRNWTIASLFLAVGFLACFAYFImrSIANPLSvifRVLDNIKSN--
911 >tr|A0A1I5XDG1|A0A1I5XDG1_9PSED Globin OS=Pseudomonas borbori OX=289003 GN=SAMN05216190_1566 PE=3 SV=1
912 -----DDAALLEETLEMVSSRSEDLTPDVYARFFSRCPAASGLFTvIDpatPPM-------GCGQ----MLFEIISLLRDSAAgkpyvAsyMQQIATEHaA-FDVRDPALYREFMHSLADVQATLLGPDWSPAHAQAWDRQIAALLRHL-----
913 >tr|A0A1B0G6S0|A0A1B0G6S0_GLOMM Hemoglobin-like flavoprotein OS=Glossina morsitans morsitans PE=3 SV=1
914 STMNSDEVYEIKRTWEIPATTPTESGVAILIRFFTKYPSNLQKFSTFkDMTldelKNNPRFKAHANRIMKVFDDSIKTLDDncshLEEIWTKIAQSHF-NRQIEKQSFNELKEVILEVLVAACN--LNDQQTEIWLKLLDFVYEIIFKT--
915 >tr|A0A1J1IV29|A0A1J1IV29_9DIPT CLUMA_CG015163, isoform A OS=Clunio marinus GN=putative Globin CTT-Z PE=3 SV=1
916 HVLTPEEIVLVKDSWKIPSANAVDSAELIFYTFLSRYPEHQKRFVRFkDKPlnelKGSPFFRAHASRIYNVFDSVIDGIGKdpenkeVMSFIAESGIFHA-KKKVTKQAHAELRVVLVEILNDVCK--LDEKGNVAWSKLLDIFYHVMFEC--
917 >tr|Q7M422|Q7M422_9DIPT Hemoglobin V OS=Tokunagayusurika akamusi PE=1 SV=1
918 VGLSDSEEKLVRDAWAPIHGDLQGTANTVFYNYLKKYPSNQDKFETLkGHPldevKDTANFKLIAGRIFTIFDNCVKNVGNdkgFQKVIADMSGPHV-ARPITHGSYNDLRGVIYDSM----H--LDSTHGAAWNKMMDNFFYVFYEC--
919 >tr|A0A0G4EPR9|A0A0G4EPR9_VITBC Uncharacterized protein OS=Vitrella brassicaformis (strain CCMP3155) GN=Vbra_12573 PE=3 SV=1
920 ---SDKERgVLIDKTWGllKERYTLQEIGEELYDNVFKNAPDLRHLFKRPKEL----MALKFGEMISTIC-GLFQ--TDRESLLEtmrDLGIRHV-DYGSRPEYFPLFKACLLDTLENLLEDGeFTAATEASWNDMWDEASEMLISS--
921 >sp|P15447|GLB4_GLYDI Globin, monomeric component M-IV OS=Glycera dibranchiata PE=1 SV=2
922 MGLSAAQRQVVASTWKDIAGsdNGAGVGKECFTKFLSAHHDIAAVFGFSGA--SDPGVADLGAKVLAQIGVAVSHLGDEGKMVAEMkavGVRHK-GYGykhIKAEYFEPLGASLLSAMEHRIGGKMTAAAKDAWAAAYADISGALISGL-
923 >GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold789473_1 # 1 # 552 # -1 # ID=789473_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.562
924 -RIPPLKGSSLSAGWRTASSSGLS---------------------------------------RNPRGTVSR--------ESGNTVFqseTF-AGAASPRGGSLL-C--FT--GENEPMGMINNLKT------------------
925 >tr|A0A1G7K468|A0A1G7K468_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter baekdonensis OX=875171 GN=SAMN04488117_103319 PE=4 SV=1
926 -MLAVKQISLVRNDFRRLAPARPEMFKWFYDRLFEIAPHTRDLYSE--------SLTEESSRVNGLLEIAFLSLDHPQAMFATLHTLgrdFS-GFGIWETKLHLVVDLLVEVFAEFGGEDWGSELEKAWHSVLIFIAQGMKEG--
927 >tr|A0A291GF03|A0A291GF03_9RHOB Uncharacterized protein OS=Celeribacter ethanolicus OX=1758178 GN=CEW89_16165 PE=4 SV=1
928 -MPSARQIALVRNNFRALSPKRPDIFIPVYDRQVGEDPKAAAQYDG--------SLCQRARVLDGLIELALLSADHPTALFATLHKMgqdYA-HYGSWREKHPFLIGQIIKAFAEATDTHWTDELADAWEQFLYFMAEGMLEG--
929 >tr|Q86G74|Q86G74_PHAPT Hemoglobin II OS=Phacoides pectinatus OX=244486 PE=2 SV=1
930 TTLTNPQKAAIRSSWSKFMDNGVSNGQGFYMDLFKAHPETLTPFKSLfgGLTlaqlQDNPKMKAQSLVFCNGMSSFVDHLDDNDMLvvlIQKMAKLHN-NRGIRASDLRTAYDILIHYMEDHNH--MVGGAKDAWEVFVGFICKTLG----
931 >ERR1719468_599295
932 -ELNEKQIAVIKESWKVLTNEITEIGMLAFLHLFESTPDAQGSFKEFhSMTkdelKHSEIFRNHASRVTGVIKKVVEKIDEPETYLPHLHILgqkHV-MYEIDVNHIDQMGYMFLSGIKTALENknAWNDNARDAWESLLLMVIAEMKK---
933 >ERR1719329_2046659
934 ----------IKTVWAKIMKEVgtLNAGTMLFKNVFMLAPETKQLFPKFRHlkddlLLSNESFKNQAKLSISALSNAIMSFDDPPKLkrmLMDLGRIYE-SKGVSLATLPIVGNALMATIEAALGNDSCIETFNFFALFYNEGSNMLAEGYK
935 >ERR1711915_153481
936 LGLTKRQRFLLKGSWKGISREMQVTGVRVFIQMFQSRPETFQFFPQFqGLDgpeqqKRSEVFQEHSEKVISRIDEALASAENPEVLTGVLLQTgayHRKIDGFNPQLFLCIEEPFLESLSLTLDERYTPQMDSIYKIITKYIIQTVIDGYN
937 >ERR1719369_313705
938 TGLTKKQRFLLKSSWKGVSRDLEYTGVKWLVGVFSTQPHTQKYFTNFsSLSldgelQECTEFREMAEKVMERLDNALFHMEEPDTMRSILLETgayHRRIQGFREDMFKDSEAPLLQAIENTLDERYTKQMAEIYTVVVQFFIETIMEGYT
939 >tr|A0A0S8CN91|A0A0S8CN91_9BACT Uncharacterized protein OS=Nitrospira bacterium SG8_3 GN=AMK69_14025 PE=3 SV=1
940 -GLPPSDISRIQRSFRMVASQGEKMASRFYDLLLERSPELQKFFHPGN-------LSQQHAKFFNGLHSLILHLEHPQALraaLVQLGEQHQ-GDGIEIQHYPPVVDTLLQVLTEFSGEGMDGETYDAWAHFLHLVRAIMLENH-
941 >tr|A0A182IYR6|A0A182IYR6_9DIPT Uncharacterized protein OS=Anopheles atroparvus OX=41427 PE=3 SV=1
942 -GLTKSQKVALIAAWSIVKKDLVTHGRNIFVIFFEEYPQYLDYFDFSASdAtgdlGENRSLHAHALNVMNFIGTLIDyGLNDPDllkCSLARLVRNHR-RRNVTKEDVGAVGGVIMRYCLKALEQHRSKTLEDAFGAFLGTVAAAFE----
943 >tr|A0A182QXV6|A0A182QXV6_9DIPT Uncharacterized protein OS=Anopheles farauti OX=69004 PE=3 SV=1
944 -GLTAQEKITLFSAWGLIRKDLDIHGRNMLLLLFHKYPHYVSYFDFTDDaSaqtlVDNKSLYSQSIHVIKTFGSLIEyGLKDPAlfnETLKKITRIHA-ERNVYGKDILTIGDVLLNYLAQVLGRQVSDALPDAFRKLFVTIAGRFP----
945 >tr|A0A1Y1I4E0|A0A1Y1I4E0_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_002310190 PE=3 SV=1
946 -QLSPFEQQLVQKTWKLLQPRLADLGQAVFTHLFQKAPKTRPLYTCPlRLadgdrrTPDGHAIPTHAVEIVSTIGLAACRIGSSSRILAVLErlgQRHV-AYGAAPDMFSVFKEAFLVALKKTLGGeHFTAQVHKAWSKALDSVVAHLKKG--
947 >SRR5271157_2714777
948 -SRIVDRLTALRAFFAEMEPQLPVIVARSYERLFDVEPAIALLFKG--------NAREHQLRFLAKLQSIVKLTRSSqlwpasaatgQILipeVLDFGRSHA-KIGVLPVHFSLLNDMIAWTCKEIAPLRFTPLVEEGLAFVFDVLGASLTAK--
949 >ERR1719323_206356
950 -KLSEQEKSVLKSSWAVISKNLEVVGSQMFIEMFQANPDTQHQFSNFrgiDQTelSETPQMIQYRTKVVATIGQVIDNVDNTHMLWDlliKFGRDHF-SYGALPMYFDLMGPHFVIAARNNMGNDWYEALEYHWLALFELIIYIMKFGWH
951 >ERR1719461_2449329
952 -----------------------------FLPSFDHDPECPEKISLH------------CQRVMSVVGGSIEHIEDYQCLWKhliSLGRDHF-GKIYEitlgqkSTFYPKIHSLKIpIFTKfTFLKSNFSQNSRFSNIKFL------VISGX-
953 >SRR5690349_7596073
954 --------------------------------------------------------XMQMTRFTDL-GLRTLMLL-asaestgrrvttRTIAVGANASEHH-VAK----------------------------AVSRLAELGMVMADTLIE---
955 >SRR5215510_2422438
956 -QMTKEQIEVVQNTFNKVRPMSGTAAQLFYNRLFDVDPSVRETLLW--------TLKQGlGADFTPEAEVAWGNAYDFLAAVMQQAAKGA-SMX-------------------------------------------------
957 >tr|A0A158PBC2|A0A158PBC2_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=3 SV=1
958 -LPNPRERELLRRTWSDEFKFLYELGSSIYIYIFEHNPHCKQLFPSIAKygddYKDSREFRIQALRFVQTISQVVKNIYHMDRLESylyGIGQLHCKyaHRGFKPEYWDDFKDAMEHSLTDHMNSlsDLDAqqrsEAVAIWRKVAHYIISHMRTGY-
959 >tr|A0A2A6CNA4|A0A2A6CNA4_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_32112 PE=3 SV=1
960 -QCNPRYTALLKSTWSDDFEVLFALGAKMYITAFEgpHGVACKSLFPWVAKyeeagenYADKSEFRLQALRLVQTIVKALDKVDDLQKLEAylyAVGHRHVFylPVWLDPVYWDVFKasratsylgqstmlksaserDAVQVGVNDHLHKlsKLSTddlaRATLIWTDIIEYIFEYVKEGF-
961 >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold3481696_1 # 1 # 387 # -1 # ID=3481696_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.584
962 -VLTSNDIALIRESWAYAKDI-PAIQTETLLEHFRIQPRTQALFPKFaDVPlnklPTNDAFIKQARSCVSFGLNFIVANLDNPSLLkDMLGRVdTyG-KWYVDF--MtkeRQMQTTVdifIQVLSKELGGRLSAAAKAAWTRAMTLVFVEMMS---
963 >ERR1711894_485352
964 ---------------ILLYNY-rfLTYVIYYYYRFLAEDPTVASVFSRVNVdDQQSGEWHAHMLRIMGGVDILINMMDDVNVLTEEVKHLraqHVVREGVTHERMKAFLIIMMDELPKVMT-HFNH---DAWKSCLSKKLKRIG----
965 >SRR4051812_36412483
966 --------RRRTRGSARITWPGYQMRNLLSPRLFDRASAVRVLLPD-DLT-------RLKHQFARTLHWLIGHLHEPQKVriaLVDLGRRHQ-EYGVKAEYYPAICEALVDSLATISADDWNDELARDWRQTFELMVHHMLRAYR
967 >tr|A0A1Y1IHX6|A0A1Y1IHX6_KLENI Uncharacterized protein OS=Klebsormidium nitens OX=105231 GN=KFL_006460015 PE=3 SV=1
968 -KLSDERILKAQALWDFMEGsafadndrrQFIDRGVKVFENLFELAPQVLTLFPFKDENgrPRRKELEVHVETVMSTTGQVVRQMQDPDSLAPMLTELtalHV-KYGVELIHYDILCSTFLLTFEQLLGPRWNSDYRDVWISIFSFITTFARKAY-
969 >SRR4051795_8230555
970 -----PAVT-----------------------SPRVpA---------------------------------------------FgSPCPvirQQ-RWTGAI-----IGTRQEGSVP----------SAHSTTSGD------------
971 >SRR5215203_3322109
972 -ELSERTIALVKATVPALEAHGLAITRRMYERMFH-NEAIRDLFNQSHH----GETGSQPKALAAAILAYARNIEILAAWGEAYWYLaevLI-ARERLIyqglaaapGGWTGWRDFTV--AEKRCESEVITSFVLRPTDGGPVLRHR------
973 >SRR3954470_353290
974 -----ARRS-----------------------------------------------------------------------SPLaEGDPryhVH-QWDRGRQPRRSTRCRVTPPVT----------NIRRYLVGP------------
975 >SRR6478735_1414904
976 -----SGSR-----------------------PARLaS---R------------P-SW---------------------NHRPIgEATLvnrYG-RS---A-----AGSDVE--------------RIERDLSGT------------
977 >SRR3954468_7455402
978 -----APPD--RA-----------LT----GGGETVpG---V------------R-ASR------P-------------RTIDRsGRTLvsqSE-RS---A-----EGSGVE--------------EIERDLSGT------------
979 >SRR3954470_12739883
980 ------------------------------------------------------tsaCSRTRTSATCStsrtmarqapsprrspPPWSPMRAISTTSARSPRVERIaqkHV-GLNILPEHYPAVAESLLGAIKDVLGVTHYSRGLTDDPDWYPYLKKHEWL---
981 >SRR5215831_13609655
982 --------KPCNRSKPFFRINAFCSAvslalrlQRLCELPESAHPQRC----ASCL----K-TANPAKNVVPKRFGTFISIHLRDTYIFAVSKIgqkHC-GLNILPEHYHYVAESLLGAIKDVLGEAATEEVLSAWGEAYWFLADVLMA---
983 >tr|F2UFM9|F2UFM9_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) GN=PTSG_06664 PE=4 SV=1
984 -RLDMEQLKIALGSWTAVVELVPTWHEVFFAELFQAHPETeRLLYSSDKSK---SWNERHMARVGKSVGDVIKSLSNYDDVIEHLTTGephEQ-ACCL--------TDG--YVIGTGLGNT----PRSLWLACGS-----------
985 >tr|A0A1Z5KPX1|A0A1Z5KPX1_FISSO Uncharacterized protein OS=Fistulifera solaris GN=FisN_16Lh317 PE=3 SV=1
986 ---SPACVMKVINRWETARQRngfDEQLDIDTLLALFKMDPQVKPIYGFAvEKEvkaQgmQRMGVLIYGLQVVKMFDVILSALGPDeElfyDVVTEMGEQHC-KHGLTPDHFTLLCGAVMGVLETIMDTEWTKDVRAAWSQVIECVNAEIVK---
987 >ERR1712000_676789
988 MSLTPQQSAQIRSSLPVLKSEGETITSLLYASLLHNHPDLHNLFNSVNQAN-----GRQPRALLSSASVKGTARWESHQLS----------------------------MISSRGTCWRPSR-RSWGPSGRLSX--------
989 >ERR1719328_19047
990 -GMTPEQKQLIDDSFAVLKKDVKGNTIVFYETFFKMNPELVAHFPGVseaDLVnlGKNEFIIQRGAKFFNMIETTTHLMESKEGCLELVRMLkesVP-EGKVTYDRYKVAKEPFIKMMETALGGNFSAETKAAWRKFFDSLAETTK----
991 >SRR4051794_16351730
992 -TLTPFEVGVIRTSFRDLQKRSGPAAQRFFRELFSYDAALRELFAP--------SPWTRQENLMSVLSGVIEQIDSSTTLTTHLDEVvrrFP-AFAVNSYYHLYVGAALFAM---------------------------------
993 >ERR1719187_1205752
994 -SLSQGENDALKAGFKAAQGKLGDIGANTFANLIANDDSFRQRFPWANsdITveeiKTYAPAIAHGEKVLQGVNVAVKNLDRLNSFVSyfvDEGVKHV-PRRVTVDDFQAFAEAVHPAFQKELGDLYTDDFKNGLTGLLGFISDNMAKG--
995 >ERR1719187_2594184
996 -QFTEAEKTILRDTWKGTIQpHMAQNAANLLITYINENPQDRKLFYWGRndKSgmalRVSPGFVTHSQGVFSGVGVGIDRLDNIASLDKfytQLGEDHI-PRGIHEGVFAPMKDAFLQILGHALQEEFTDEAKAAYGKYYDHIAGKMIEG--
997 >ERR1719309_658292
998 -HLSGEEKQLLQDTWSRSIApLKHENGANMFIHFITHNPELRREFFWGRnnKTamalRVDVRFASHIRSIFDAIAHGISRLDNMDSLQGyytELGQDHI-PRGVQRVMFAPLADSFMYAVGLALEDQFTPAVKAAYLKYYMHIP--------
999 >tr|A0A0G4IVL1|A0A0G4IVL1_PLABS Uncharacterized protein (Fragment) OS=Plasmodiophora brassicae GN=PBRA_001183 PE=3 SV=1
1000 MRLSARITNLVKSSWAEAMTLQgrdgMTLQKAFYNHMFTKAPESRAMFKE-DTS-------KQELMFGQMMTDAVNILDNFEELVNKlvyLGEVHR-YLDLAPEHFRVVGESLIGTLEDILGKkRFNAEVKEAWVMVFDLMATIML----
1001 >tr|S6BNG7|S6BNG7_POLVA Globin OS=Polypedilum vanderplanki GN=PvHb32 PE=2 SV=1
1002 -PLSKEQADEVRHAWDKVKSN----EVEILYEIFKAHPDIQNKFPQFagkNLDsiKNNSDFGTHATRIVSFITEIMSLGGKpdllpaIKTRVNEMGQNHR-NRGVTKEQFNEFRSTLTDYVKHHS--SLDGDTEHAWNQAIDNVFFIIFSNL-
1003 >tr|S6B7W8|S6B7W8_POLVA Globin OS=Polypedilum vanderplanki GN=PVHb31 PE=2 SV=1
1004 -TLTADEANLVKSTWSQVKDK----EDEILYDIFKQNPDIQGRFPMFvgkNLDsiKSTEQFKTHADKIVKAIGSYIDLLGNesnsgaIKTILNELGQRHR-DRGASKEQFNEFKTSVLKYVKEHAS-GWNDASGSAWDKAFDDMYKIVFSNL-
1005 >SRR5579871_994368
1006 -----ADPMNINESIHDILNRDEIVADLFYDVFLDRHPEVRRFFVGVDI-------RQQAIV-LTMMLSIIEDfYHHsypaTARYLRLVGQRHK-ARAIPKEMYLIFCQCLLETLERFHGQNWSAQLSDEWERAFDKASQVLLEGYQ
1007 >SRR5512135_1415087
1008 -------TELIARTWEALGDRQAQFIEAFYDRFFERFPGYRKLFPHE-LR------TAHLEKMVLTLALLADLSDDRTAIAPRLHKLgaaHK-PFDLELRDFNNFKAVFIEVLGPQLGKQWTAAAAKAWNDAFDAVLIP------
1009 >tr|A0A163MXG7|A0A163MXG7_ABSGL Uncharacterized protein OS=Absidia glauca OX=4829 GN=ABSGL_15412.1 scaffold 16614 PE=3 SV=1
1010 ---SQTDIDLVRSSWERVIETqhpsdedgvspAQAFGLVFYAALFHLDPHIRPLFDGTNVMIqakmltfvigclvRAPMVIQRRGPTLKEISTTPTGAEDMEGLAAKIRELgarHH-FYNVEPAHFQLVGPAVDMALRERLKHEYTDAIGQAWLRTHAFVAHHMA----
1011 >SRR5207247_8066543
1012 ------DVQRLQESFARMAMHGDAVPLFFYSDLFLRHPETRDLFPV--------SMAAQRDRLVDALGRIVSDVEHVDADSGDPSGArpeDA-HIQAVRILsnAQQMADNYVADAQEY-----SSQLSTX-----------------
1013 >ERR1719193_187210
1014 -VLTENDIKAIKAIWYPVRQTPADIGAAAFEKFFKLYPHQKEKFWFMkNDDLKEKGMRAHGEKVIKSLDEAVLRTVDrarIRSCLQRLDYIHF-QMGITEEDMEELSDAVVKTIKEVVIdtnKKLTHEELDSFKKFMKMVTAE------
1015 >ERR1719193_859649
1016 -------------------------------------------WRMLkKRH------NRDGGKLLH-PLKTILQTCYksrIKNCFQRIGYIHF-RMGVQEEDMEQLGEAIIKTVEAAWGDEFTPEEYAAFRKFMKKFTAA------
1017 >SRR5580704_1734515
1018 -----------APRAELATGVAPDYgSPDDVASRRSQSRACRRTLRR-P---------TTGAVRGEMLARVIEAILDFIgeRRYAhHLiqcEVVtHE-GYDVPPETFGIFFGVVATTVREQLADAWTDAFDEAWRTLLYDLDY-------
1019 >SRR5258708_241677
1020 -----SCGEDPAGSSD-------DHDAD----VVASAGQVEGGVDL-V---------EHPPALGVPIAAPCQWLVDLEgaGACAaNRmaaERVnHE-GVGVPPAALARFFPIVAETCRDLLGEAWTGEIEAAWAGLLTRLAV-------
1021 >ERR1719296_55987
1022 --MDSDMQVAVQKSWEKVQEIGTlAVAELLMKHTLEIDPEAIQLYICKAKPGEDENVLDVARKLfartLFILGSSAAGMADTAHVVKNLTVAGStlANSGVKESYFNTVGTAFQMTLQEVLGDKFTPEVATAWKVAFDFMTAIMVAGMR
1023 >SRR3954451_11513015
1024 -AASPCAQQLRQGCRDRPA-----ACQLVLSSGVRDRPGCEIAVQG--------RHGEAGPQADGGADGLIDAIDRLDTI--------------------------------------VPAVEAAWTEAYTILATTMKD---
1025 >Dee2metaT_27_FD_contig_31_2132282_length_204_multi_2_in_0_out_0_1 # 3 # 203 # -1 # ID=1013462_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.592
1026 ------------------------------------------------------SAATSNPQF-------VAAV-------------------KKAIDYSGL--------LTVAGQGAVQPagiipSVIAGTLPAADALKQDVAG--
1027 >AntAceMinimDraft_18_1070375.scaffolds.fasta_scaffold521461_1 # 3 # 443 # -1 # ID=521461_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.569
1028 -------DD---------------DDDDDDdDRMFHDHPEARALFSRVhGDNTYSPDFEAHAQRVLGGLDSCISLMDDPDTLASELGHLkaqHA-DHTdVTAEHFDVSICFSsTDVTSTYTsthckimdrpnYTVFQT--RGQrnltksaSRRAHspvRDHPRG------
1029 >ERR1719191_324407
1030 --MDDSAMKITQESWAMVEKEVPHWPEIFYDQMFA-DPSVAKLFPFSsGNFKENPKFQEHTQKVKDTMHTAMTSIKEFDKLrpvLYKMGQRHV-AYGTLPEHSTNFKNAFLFTLKAGYGDKWNEDLDDAWNQCVDALL--------
1031 >tr|A0A0P5XAJ2|A0A0P5XAJ2_9CRUS Di-domain hemoglobin OS=Daphnia magna OX=35525 PE=3 SV=1
1032 -LLTANDRRIIRKTWARAKKD-GDVPPQILFRFIKAHPEYQKMFKSFaDVPqaelLGNGNFLAQAYTILAGLNVVIQSLSSQELIANkinALGGAHK-PRGATPIMFEQFVNVAEEVLAEELGSSFNAEARQAWKNGMRALVTGIT----
1033 >ERR1740129_283753
1034 -PLTRREIRTLGLSWSKFHGCRQEFGVELLVQFFQLVPEASDLFRFQRekTISENPGLKNHADRVVRVLSRVIHNILSLEEVVPDLKALgmkHYMDYGVSPTHYCLFGKALLGTVQTF-GG--TPPEQGCLPKLYEWMSRTMTS---
1035 >GraSoiStandDraft_56_1057294.scaffolds.fasta_scaffold759510_1 # 2 # 568 # -1 # ID=759510_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.697
1036 ----FVTTQCVVENWERLkySPFFDEFVIAFYQRVFRLCPQAKSLFGSSfCLD-DQAAMT---QEFVRLIDRILDLLGPESqlmvEVLRDLGSHHE-AYGVTVEMYDIMRNAFLLTLEQFEGEKmFTSKVRQAWTTVCSAVADVMTEA--
1037 >tr|F2UFM8|F2UFM8_SALR5 Uncharacterized protein OS=Salpingoeca rosetta (strain ATCC 50818 / BSB-021) OX=946362 GN=PTSG_06664 PE=3 SV=1
1038 MRLDMEQLKIALGSWTAVVELVPTWHEVFFAELFQAHPETERLLYSSDKSK--SWNERHMARVGKSVGDVIKSLSNYDDViehLTALGTRHA-RYGLHVDQLDLFINAFLWTLGAGLGDSWDHSVKKAWMHVLPFILSPLKS---
1039 >UPI00054DD732 status=active
1040 ----------------------------------------------------------------------------------LTCARDF-FltfVGVERCR-PKLLKQEPQTITSKLGm-A-PMLQSAFWSIRVMRIASS---A--
1041 >tr|A0A1E2UUQ1|A0A1E2UUQ1_9GAMM Uncharacterized protein OS=Candidatus Thiodiazotropha endoloripes OX=1818881 GN=A3196_04875 PE=4 SV=1
1042 --ITKTNLKRFQQSLRRIS-LKQGFYDTFYDHFIAQSDEIAAIFHARDM-------DQLKGKLKETLQMVEDALMGKPGvvlYLEMLGRIHT-RLKVDQRHFEMWKYALLSTIERYDD-EYDAEVKMAWEAAIETVVSLMYPES-
1043 >SRR5262245_29633745
1044 ----------------------------------------------------------------------LGNHSTR-cgrSVESSQSNSTA-DFLNSRRIHDAYSpaiRAAKSKSE-------------------------------
1045 >SRR4051795_1885912
1046 -----------------------------------------ApRTARRRL-----QPGQPGRRLAAdRAGRVGRGlRQRPaegprtdsrapavadraqarvaghrprpvrrraRQPVLGHRRRAR-EGGHTGGRRRV----GRGLLADglCPGQPGARPLQRAWRAA-----GDGVAR--
1047 >ERR1719218_338423
1048 --AEEAGDTVLV---GGAPLgarqRPMATGSKIFRKLFTGDTAVLRLFPFRHQartLFVSAPFKLHAKLFVDTMTELIANLHDLEKVERdvrELGKRHL-TYGVQPAHFDAMGEALIASSTS------------------------------
1049 >SRR6516162_179054
1050 ----SQTVMDIEESLHHILEREKLVADLFYMVFLEKYPEVRRHFINVN-------LRRQAVLLTMALQVVVQYYLKgFptaEAYLKILGEEHN-RRGIEPELYPKFCTALLETLSRFHFHDWSEDLAQQWEEALKLAATEMVEASP
1051 >tr|I2G907|I2G907_9HEMI Hemoglobin A OS=Anisops deanei GN=HbA PE=2 SV=1
1052 -SLTDREVEVINQSWNQIKAQELVVGLQMFKTLFQRYPQYERLFTHLHQSgkslYEGDRFQRHVvGNIMSSINKVIETLNSSDNAVKTLQDMgvkHK-KLDVHRKHFESFVPFVVDAMVSVRMSMSQDEVASAWTKMMEGVASNLSKG--
1053 >tr|A0A0P5UVQ8|A0A0P5UVQ8_9CRUS Putative di-domain hemoglobin OS=Daphnia magna OX=35525 PE=4 SV=1
1054 -LLTANDRRIIRKTWEPRpR-RTEDVPPQDPLPFHQGPPRVPEdVQVLRlCSPsracEQRKLLGPRPNTILAGLNVVIQSLSTHGAYCQPNQRSrsaNK-PRGVPPIMFEQFGNVAEEVLAEALGSSFNAEARQAWKNGMRALVTGIT----
1055 >RhiMethySRZTD1v2_1073278.scaffolds.fasta_scaffold3173058_1 # 192 # 530 # 1 # ID=3173058_1;partial=01;start_type=GTG;rbs_motif=GGAGG;rbs_spacer=5-10bp;gc_cont=0.740
1056 ---------LYSGTNvytgataslLAQADYLSSLIGDTDYPMFDVESVVQLFL----------------------------------------EwehNKHH-DIMGFRN---YPHKSVMTG-------TRAPVHHTPWLQALDDSMECYLNT--
1057 >ERR1719183_2765469
1058 --------------ADIFMPRLEEIVMRMYNLILEEQHECINIFNTPSLS-----PGQPLAALAACIRGLIEDINVRPRLEhrvEMIAQKHC-AINLQAHNYLGLQGMFMSAAEDVLGADMTPQRFSAWSQALLFICRLVIER--
1059 >SRR6478609_8547471
1060 -VlvdveevlrvvfgFDLPQTDVVRSvVLGNPGQ----I--------IAVHKVDV----------------AAGGRIGPQGGRVVPHPRDVcLV-LRRVHPLR------------------------------------------------------
1061 >ERR1719383_1602644
1062 -------------------------------------------FGLHL----------------QSTMLVGNDLDPVDERG--pdhCQQALW-TASE-GRTLSHRRREPCRSVLEVLGEdVVTPEIGGAWREAVQALAKILIDT--
1063 >ERR1740139_1260005
1064 ---TEQMKTDVVSSWGKVLSFGTlTVGRVLCRHTFALSPDMHALFPPHILhkyqeegeTDSNGALSRHFSMILNAVGCVVSSFDQDadLSTITQLGMRHA-SYRVVESHFETIGRALELTLHDILKDDFTPEVRHAWKLVYSFLSLVMIRGI-
1065 >tr|H2ZAE8|H2ZAE8_CIOSA Uncharacterized protein OS=Ciona savignyi OX=51511 PE=3 SV=1
1066 MEMNAQEIQDVRDSWKRLCADGeKTVGLMLMQKLFNTYPESIKVFSRLGITnkaiitiddlSTNSAASRHAESLTSRIGTLVDLMHNTHefkECSTEVGEIHI-KYGVTAEHVDILGNVLLSVICDSQGLSKSSDLYLCWTKTWEGIAKYVK----
1067 >SRR6185437_12825295
1068 ----------------------------------LIAPRLELILPA-DP-------ARRDAAFLELVDMVVQRLDRLDLLLPMLAAQaHSwGKRDVLDGDYVLAGKALAWTVEQVIK---EPAAIAAWRDTFDFLAGVMRR---
1069 >SRR3954465_11422119
1070 ---PCRSSPTTSGRSPGAS--TRT---------------CStAtRGCWTGPStgatrpRA-----PSRSRWPGPSRSSpahwSRSPSRSpSTCSpgSRTSTTHsasprpppP-PPPPARAERGVVQDNLFWAIVDVLGEAVTPEVAAAWDEVYWLMAYALVNQ--
1071 >SRR3712207_885952
1072 -------------------------------------------LGR---------------------GLLadglRAHPPGAgALQR---------PRRAAGDGVAGVggRRGENRERGRREPPPAAGAGTPGVDRAAPPGRCRPGTP--
1073 >SRR3954465_6877418
1074 -AtaaaTAAASSTDIRATRPASLEG-------------HDRPHLDTaEAGRAQLADG-----EGDIEVGGVDEvVAtqhlLRLHERAvGHlgpPTDARRGAGR-LQGVAAEELGTVRLDLDGELVVRLHDL-----VEDLGRRRRVLALVLVDQ--
1075 >SRR3712207_8177874
1076 -VLSDRARPVVEATLAPVADNIGEiarRRSEER---------------------------------------------------------------------RVGKECRSR-----WSPY-----H-------------------
1077 >tr|A0A0L0FUF5|A0A0L0FUF5_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_07147 PE=4 SV=1
1078 -ICKPEELHtkdlgfivtHTNNPW--GSTDEQDFGVDFFRDHADQ----------------------SGLTSFFSSIVIIACEMYQEfePSIPQLQKLgeeAK-HLDIPCHMEDNIVGYVASTLSR-SKQ-FDAIEECAIFKLIWRVVLFVLE---
1079 >tr|A0A252E791|A0A252E791_9NOSO Nitric-oxide synthase OS=Nostoc sp. 106C OX=1932667 GN=BV375_01385 PE=4 SV=1
1080 -ALPPQMLHQMADCWEVFSQNKQQMGMEFYQILFEKYPFVLPIFGRADMD-------YLSLHLFQAVEFLVRCLRTGSsdNMLQELRFLgqvHS-FADVPSCAYPAVSDTMFVLFEKYLPN-FTPELRQAWQILFDRVVNVIKL---
1081 >tr|A0A2T1LS65|A0A2T1LS65_9CHRO Nitric-oxide synthase OS=Aphanothece hegewaldii CCALA 016 OX=2107694 GN=C7H19_21845 PE=4 SV=1
1082 -ALPPEMLQQMIASWSVFSQNKQEMGMEFYQILFEKYPFVLPIFGRADMD-------YLSLHLFQALEFLMRCLQSGSseEMLQELRFLgqvHS-FADVPTCAYPAIGDTMFTLFEKYVPD-FSPELRQAWQTILERVINVIKL---
1083 >tr|A0A2E9QYM9|A0A2E9QYM9_9DELT Nitric-oxide synthase OS=Deltaproteobacteria bacterium OX=2026735 GN=CL920_22905 PE=4 SV=1
1084 -ALSS--MKEAKRLWEEGVGLHTAPGSEWVHQLVAERPEWNHFFASSDPE-------AFGEALFSTIDSAVHQLDDEVSMFSSLREDselFT-AWDVRACAFSALPDVLVDFVV---ED-HQTVGAQALRTFLRRVCTIVSL---
1085 >HubBroStandDraft_6_1064221.scaffolds.fasta_scaffold2618798_1 # 2 # 181 # -1 # ID=2618798_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.622
1086 ---SAEDRSIIQEQWKILFKDVdsskikIAVGRKLVLNLIQRQPDAKVLFDKFNVdEPNSPQFSAYALRLFNRIDLIINLLKDPEALDAALEFnaeRYGNIPNIKKAYFQTAAQILAYALPKVLDD-FNA---LSWQSCTRYILTTVASKVS
1087 >RhiMetdeSRZDD1v2_1073273.scaffolds.fasta_scaffold2404579_2 # 426 # 629 # 1 # ID=2404579_2;partial=01;start_type=GTG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.627
1088 ---SSEDRRIVQKQWNALFGDVrssrvkIALGSKLLLKLAELRPDAKEALKPIHIdDPTSGEFQAHSFRVLNSLDVFINLLTDAEALDAALDHhskEHSGIAHIKKEHFKVFGEILISSLPKVLDD-FDA---FSWRSCYKYIGQRLTAQLH
1089 >sp|P02210|GLB_APLLI Globin OS=Aplysia limacina PE=1 SV=4
1090 MSLSAAEADLAGKSWAPVFANKDANGDAFLVALFEKFPDSANFFADFKgKSvadiKASPKLRDVSSRIFTRLNEFVNNAADAGKMSAMLSQFakeHV-GFGVGSAQFENVRSMFPGFVASVAAP--PAGADAAWTKLFGLIIDALKA---
1091 >SRR3981081_215795
1092 -RDDPDQKQLVRAFWKQVVPTAEAAAGLLYRPPFERGPHTPAPARVsrpTAAS-------PARGSLLECWGFQSAAGQAR----------PANGEGGKP----RPPPRRL-----------------------------------
1093 >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold136029_1 # 443 # 1567 # -1 # ID=136029_1;partial=00;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.433
1094 -----HHLQFLQQQISAAEPRAGIAMLVFWKNLFELNPSLRPLLGEK--P------GEEDYLLVQFLAAGLAPLFRQTPNTAPTdQDGACAPVNTDeEQQCSVVGEALLWSLEEAFGADFTPKVRSAWETLYRFITVSNKQSY-
1095 >SRR5687768_12147577
1096 -------------------------------------------------------------GLAHARMDSvSLK--PpanphcaiktwvlacgvparTAEWRPMSNlSDAP-SPSLLSDQSLSV----VQ-TTATVVAAHADEITAAWSEVYWLVALQLVA---
1097 >SRR6476660_4664138
1098 -M-VVVGVDAHKrtHTCVAVDGSGRKLGEKTVPATT----------------------VGNASALRWARSTFGpdltwgiedvrnvsRRLE----------QELV-NAGQR---VVRVPTHLMARTRasartrgksdsidaTAVARAvpREPDLPVAqHDSVS--RELQLLI---
1099 >tr|R7TLW3|R7TLW3_CAPTE Uncharacterized protein OS=Capitella teleta OX=283909 GN=CAPTEDRAFT_227018 PE=3 SV=1
1100 -----------EITWAILSENRDGLGTEVFVRMFESYPDLKSAFGPLrHMNKKdagyEDVLRAHGIRVLSIVEQVLSKRHNMEEVLSILHDLgrkHL-TFSAKVEYIDIVSQMFLFAIESALKEKWNNSTEKSWGEIIRFVTYVMKET--
1101 >SRR5918994_1081840
1102 -----------------------------------------------------------------MLAVAIEALLDRGGegrlagLVGIERMNHV-NIGVPPEVFDGFFALLMEVVRDALGPPPKGGGeragGGGWPPAPRPAGAR------
1103 >ERR1712157_679996
1104 -----TTMDCVLSSWEQVRRIpnyRETVGLAILQKLIHRMPEGREVLHMQrNLIknsppgiESDKLLLAHARAIVNGLDTVVELlgplIDDISEILREIGKSQYHDYGDSMALWNpLMRECVLEVIQETLKDDYTHELKVAWTDFLGEVAKDIHS---
1105 >ERR1719360_423992
1106 -PLTQAQKEIIFTSWDAIT-HKENLGVTIMYRIFTGHQEIKHLWKFADdLKteeeiRGSKTTQFHAKKVINGVNSAIKAVEAgkeVESlGLDKLGARHF-KYGAKPADFRHFVESLFWAIKTIVPE-VSAEMAAAWTNFVMQIIKQMTN---
1107 >SRR6476660_7963253
1108 ------------------A-SHSTFFERFSSNFKAANMSLQPFM-----D-------RQQKLLREDLTKLVMCAENAEFa------TRPGAvALNVSPQLSKFWIDALMLTVREFD-EKFTPELERKWRTILQKGLA-------
1109 >ERR550517_1828149
1110 -------IYYVSikPPKNRLESHIRKqSRVqsdysQDYIKETAIFSFFIQIFHKLNPNPNSsgikytkdqalkESLHEHGVKVLNGVDEVLSNLDQPSLCFSLIRKTgahHRKLQGFKPKYFKCFEEPFLAMVENSLGQRFTPQMETVYRSVATFFVQTLIEGY-
1111 >ERR1719220_3089060
1112 ---------------------------latvnIHLRSAFHASSLLIQIFQKLNPNPNSsgikytkdqalkESLHEHGVKVLCGVDEVLSNLDQPSLCLSLIRKTgafHRKLQGFKPKYFKCFEEPFLAMVQSSMGQSFFIFPGllPKWRSFTSPSPASLSK---
1113 >SRR5919199_1911786
1114 ------------ATLPVVSDHIGDIARRFYDHLFGEHPELLdGTFNRGNQAEGTQKV-ALAGSVAVFASALLKRPETVwRDWR--VAEKTD-E-------TADVVSFRMQRIDDRLVKTSLP---GQYVTVQVQMPD----gvrqprqfsltrA--
1115 >SRR6476659_5675031
1116 -STHRPDQALRGGGRPPHRAADNNAKGAATGHRVSGRS---SPAELPENSMREQQQ-ALAGAVAAFASSLIETPERVpQSLLSRIAHKHA-SLGIRPDQYQVVHDNLMWAIVDVLGDAVTAEVAAAWDEVYWLMGNALINQ--
1117 >tr|A0A1I3XAR1|A0A1I3XAR1_9PROT Methyl-accepting chemotaxis protein OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_101121 PE=4 SV=1
1118 ----QAAIQRA-EACLTLSADGLVLEA---------NDRFAALL-GLA-------PAAVADRPHA--ALLTLAERDGATYrrfLDQLAQGR-------------------------------DTVARLWHQGAggagvllELSAAVMAAD--
1119 >tr|A0A1I3XA39|A0A1I3XA39_9PROT Methyl-accepting chemotaxis sensory transducer with Pas/Pac sensor OS=Roseomonas stagni DSM 19981 OX=1123062 GN=SAMN02745775_10
1120 ----MAAIDMA-QPMMLLGADGVVQDA---------NAPLAALL-GVS-------ADALAGRPHA--ALLAEAERDSAAFrrfRDAVAAGQ-------------------------------AGHARLRHAGAggntvtlDLMMQPLAAE--
1121 >tr|M2X1G3|M2X1G3_9NOCA Flavohemoprotein OS=Rhodococcus triatomae BKS 15-14 GN=G419_19149 PE=3 SV=1
1122 -ILSATSRPIIEATLPVVGEHLGEISRIFYRHLFDNLPSLEsDLFNRTNQANGEQ-QKALAGAVAAFATLLVTEEAPPvDEVMSRIAAKHA-SLGIVQVHYDLVHTALFTAIVDVLGDAVTPEVAGAWDEVYWLMANSLMAQ--
1123 >tr|A0A0N5C327|A0A0N5C327_STREA Uncharacterized protein OS=Strongyloides papillosus PE=4 SV=1
1124 -NLSNDQQALIRKSWRRVP--KQSIGKVIYQKMCQKCPELKNFLST-D----NNCVERHFKYFGDMIQCTVDSLNDLDTaLYPWLNVIgsgHG-GFAITTTHWDAFGEALISSIKQWILTgKDHKETVRAWMKLSCSLIDTLAAA--
1125 >ERR1719323_2694698
1126 -RLSDKTVQLLKGSAPELKEKGTQIATHLFLSLFERYPVFRDLFPK-DNVK-S---GKMISVLPHALTVFAENADNMIQLDDIITrivKKHV-DKGVQQWHYPLLEECFLDALSSTLQLQKRPDLLQAWEDGFKFLANKLM----
1127 >ERR1712018_308843
1128 -------CSTPQILCSRVKRKRFTRGHTSFTSLFERYPVFRDLFPK-DN---G---GKMIAVLPHALTVFAEKADNMIELDDIITrivKKHV-SSGVQQWHFPLLEECFLDALSSTLKLDKRPELL-------------------
1129 >ERR1719230_2183946
1130 -WFTDDRERLLKRSWQQLQLdSCEEAGALLCRNYCSQSPEDAASCG-MDW-----------SAVIKVIGFPIDRMDNLAFVKKRLRCLganHA-KWETKEHQFQSMKYAFLSAPRDVFANEFTSDLELAWDLLYDFVSTEMIAGL-
1131 >tr|Q9NG75|Q9NG75_9CRUS Hemoglobin P polymer OS=Parartemia zietziana PE=2 SV=1
1132 -GITDAEKQLVQESWELLKPDLMGLGQKVFGRIFTKNPEYQTLFTRVgfgDTPltqlMANPAYGAHLIKVMRSFDFVIQNLGKPKTLLAYLKNVgadHI-ARNVERRHLQAFSESLIPVMQNELKAKLKPEAVAAWRKGLDRIIGVIDQ---
1133 >tr|A0A0D2WU86|A0A0D2WU86_CAPO3 Uncharacterized protein OS=Capsaspora owczarzaki (strain ATCC 30864) OX=595528 GN=CAOG_006523 PE=3 SV=1
1134 ---RHETRDVIKSTWALAIQKQdeadvtpvATFVNVFFGKLFELCPETRLVFGQ-D-------LSLQGKSLSSVLTGMLEFVVHPKKlttQVKSLAVKHV-GLGITPDMFDAFGAALVYTIKTRIGKVWSPQTERVWVDAYGGVNNIITQQ--
1135 >tr|N1QXN3|N1QXN3_AEGTA Non-symbiotic hemoglobin OS=Aegilops tauschii OX=37682 GN=F775_23753 PE=3 SV=1
1136 -TFSEEQEALVLSAWDAMKGDSAAIALKFFLRGRNN-------FVQLaHVEspkRRIPVVEERKTDL-----------------IFEIRTKTW-KIGQKSTAYRSW--LLLR--QKSLPa----HAPKGHLSElvpldTIDHTHQET-----
1137 >tr|A0A2T5C1R0|A0A2T5C1R0_9BACT Hemoglobin-like flavoprotein OS=Mangrovibacterium marinum OX=1639118 GN=C8N47_108138 PE=4 SV=1
1138 --MTEADITVIEKSYAQIEAALPRMAKYFFNRANELDSDLDPLFEE-DKS-------KHGEAFVALFGKAVEHLNSPEALLPEIKKMEAklKYYKFNEEVLNTVGVVFVDTLSFGFGNNFTQDIIDPWVKAYKTYSSL------
1139 >tr|A0A074ZRQ0|A0A074ZRQ0_9TREM Uncharacterized protein OS=Opisthorchis viverrini GN=T265_04650 PE=3 SV=1
1140 -SLTDAQINGVQSSWKLLKIHIEKIGVIVFLGLFEEHSDFRDAFARFRQkqlsiLTRDPAFQAHGLRVLNVVDKIISRLRRIDTIqdfLLSLGSKHC-RYVPNIELVPAVGEQLLEAIRPVLEEqgLWDDDTAVGWEAVLAYLNCAMRY---
1141 >SRR3954463_14455484
1142 --AQ----------------------------PRAARPSALRLSRPGDGA-----P----FLLRAEVaCLasGI-----g-----------TF-GPGLRSHPLARLGRS-----RALRGRAVLArCPPKIWSPLD------------
1143 >SRR6476620_12491069
1144 --LSDQSLSVVQATAPVVAAHADEITAHFYPRMFAAHPELLLVFNQGNQA-----TGEQSKALAGSVvAYAVQLIDPkapsFDHVMRRIAFKHV-SLGIRPERTQLSASICSLPSLRLSATPPPPrpprpgarsigCSRSSWSPR-----KHGST---
1145 >ERR1712198_397898
1146 -GLTEEEITEIQSTWKSIISdKTSEHGVNILIRFFKNYPEYKAqYFQNLnTLSedelRESPKLRSHGAGFVLAITQIISDLDNMlivEEVAKKIARNHY-NKGIREPlNYKLMTNTIIDYIKDIGN--LADGTMQNFRKMFDIFIISVRKK--
1147 >SRR5580700_967641
1148 --------------------------------------------------------------------XMNRNIG----LFFPLIRHs---------CTYF--AQEPVLeFLG-GFKSAAAD-DQSVRVERIDHL----IE---
1149 >ERR1719464_2687596
1150 -NLTEEEKKVLRTSWAIISQKVDQDGESRFLHKFESNQENEDPILQQ-FT-QIDASICVNCCNIGSSFSWFdsnlcRNllSPSWSTFWLIIAQLvrsTF-FSSS------------------------------------------VKFGM-
1151 >ERR1719375_1958814
1152 -----ETALTVIDSWELLRRKknyAVVVGSGLFKKFFQEEPGAIAIFGFTDEEiesdeepfYQSKRFIDLAKNFVGVIDQAVDMLGPEMEmVGEVFVELSK-QYKIEIQHYMLLGNLLLEELEDVLGaKAFTDHIKSCWVQVFQVLCKDVKKKL-
1153 >ERR671932_89059
1154 -S-PTSCGPARACRSCCCTPTPPRRRSR------------YdGVHEG------------------LMDLSSFPLPDD--ALFYLCgplpfmravREQLL-DLGVSPRDV--qyeVFGPDLWQADAdeGPGDAPEPgahdllgpEERQGPPPA-WSRPG-------
1155 >SRR3712207_7345787
1156 -V-LDDVRALPNATVHVWYESGAASALP------------VdGVHAG------------------TMDVRSEEHTSELqSRQYLVCrlllekk--KTI------------kyeSTXX-------------------------------------
1157 >ERR1712168_1470941
1158 --------------------------------------------------lmLTCCkiqKPRNMLMGFSKPWAPQLIVFDTLGSLagyYTSIGVKHI-PRHLEHAHFGWMKASINEVMMSELGDAYTADFESGWDKVISFILERQEL---
1159 >ERR1035438_5604951
1160 -EQTNDLARIFNDSYERVMHgpgrSSGEFFVAFYDLLTATSDEAASKFGNTDM-------AEQVRTLQSSVPVLLNFFvSsRQDEYLGKLAERHSKrGVDIPPELYDVWLDCLVETVRQFDS-KFNDDVATAWRTVFSKGIEVMTSRYE
1161 >SRR5476649_733261
1162 ------------------------------------VTGVQ-TCAL-PIC---GL--VRGQMFQVTMESLLDFLGDRSygANLIQIERVnHQ-GLGVEPEMFDRFYLTVMATFKDILGAGWTQETETVWGRVIAELTG-------
1163 >ERR1719284_537611
1164 --------ELLEQTAPLVAMRTEEIHSEFQSLLLQHNLELLSVFNIPR---QSDDVIdAETeeiasHHLAGVVLAFAAHVGHVQRmrELDQLAAKHC-SHNVHPFHYVVLHEHLLDAMRKALSTMLTPEVQYSWSQSLLFFAKILIDR--
1165 >SRR5580704_16882803
1166 --------------------------------------------PG--------RHGCAAPAFLPGAQPYRRCPR-gpEGPRQPRALSAgtrAR-APKFGERHYEVFRRALIATLQRFAAPRWNETAKHAWETAFNHAATVMIE---
1167 >tr|A3VC53|A3VC53_9RHOB Flavohemoprotein-like protein OS=Maritimibacter alkaliphilus HTCC2654 GN=RB2654_17741 PE=3 SV=1
1168 ---------MIRACLSDLYSVRIEFSRRFYDRFFEQVPEARRLFVH-NQ-------DKQALMLYAAVAMTMRGMESgrdLDGELIEFGKRHA-RLGVKQDMFPIFGSTFLETLIEYLPHHDHPKIAKAWWGGFTDMSTPII----
1169 >ERR1711953_6095
1170 ---------------------------------------------QLGPAdTlciadqaD-GSLSQEIQWIQTTIFqVMLHYT-----------ENvpfHIPP---HKMKFQYFSDPFLGLVHNCLGKEYNSEMRKVYQSVADFLIQTLTEGY-
1171 >ERR1712106_122433
1172 -GLTNKQLSLLITSWKSIGSEMQAQGVTLFVEIFKNNKEVIHAFPLLNPNmKgndamtMNEAFREHGIKVMSRVNEVLHNLEQLNLCVSLIKQPvpiTGVFKGLSPISSRTFTSPSSRWPRQALARSTPRKRKQSTRPX-------------
1173 >tr|A0A0B6ZHC3|A0A0B6ZHC3_9EUPU Uncharacterized protein (Fragment) OS=Arion vulgaris OX=1028688 GN=ORF61548 PE=3 SV=1
1174 -GLSARDRKLIKDTADIIFGQlkLQNKGVVFLIAFFKAYPHHQRYFKMFrGIPPdelkSIPHTENHGRRVMSNVALLVQHIEEPNVIKEQLVDLlikHN-PRSVKPRQMKDMLNMFVDFTSQQLGAKFTSQHETAWRKLTTHILSVLEE---
1175 >ERR1719502_1452556
1176 -VLPPEQSALVRRVWQRLVGT-PGAAPILVRQLQSVAPEVAALLSDAsstNGRSniNRGglhavhtDPHGRAAAVLSEVSELTELLDDSAALRQRLRQLRARMPPVGPEVYPSVGKAFLHFVWEGVGSGYDNATAAAFAALWDQVEETMLE---
1177 >tr|A0A0K8QCZ9|A0A0K8QCZ9_9MICC HTH-type transcriptional repressor NsrR OS=Arthrobacter sp. Hiyo1 GN=AHiyo1_24440 PE=4 SV=1
1178 --------------------------------------------------------------------------------MKINAFADV-SLRAL--------LVLSSAPAGELL--TTQNIADAVGTPYHHVSKAIVR---
1179 >tr|Q6BBK1|Q6BBK1_9BIVA Hemoglobin chain I OS=Calyptogena kaikoi GN=Hb-I PE=2 SV=1
1180 --VSASDIKNVQDTWTKLYDQwEAVHASKFYNKLFKDNEDISEAFVKAGT-GSGIAMKRQALVFGAILQEFVENLSDPTALSLKIKGLcatHK-TRGItNMELFAFALADLVAYMGTTI--SFTAAQKTSWTAVNDVILHQMSSY--
1181 >SRR5258705_7404034
1182 ----------------------SCPTSSSRPVLWAAvrdCAGGQTLVPR--------RYDGTRLQADGDAGRCGQQSGQSRSRVAGGERScqaSR-RPWREGGYYTPVGAALLWTLEQGFRI--------------------------
1183 >tr|U5EPU4|U5EPU4_9DIPT Putative globin 1 (Fragment) OS=Corethrella appendiculata PE=2 SV=1
1184 --LSENEIAIIERSWNVVKPDLTSAGEAVLYRLFEkyphnQQYFAQFKNVPLESLKGSTSFRKHVIRVMTVLKNAVEALRLDsadekiHELFLEVGNNHA-KRNITKESYNELRESIFVTLTAACE--LNSEEQEVWDKFLNCAFDISL----
1185 >SRR6185295_10958302
1186 -------CILLLVA-------CFLTFKLFFYSMFQDYPEYKNLWPKFRHLndealINTGELSNFCSVYMDGWEKVIGELDDNAALareLKIIAKTHL-RKGVERshimvakkealcqiriheyCYLQNMMPKMLSLLKEKNGT-LDAEVEEAWKTVFIINADIIE----
1187 >SRR6185295_987807
1188 --MSETHLELAQESLGRLNA-TPKFCGTFYQFFLESSPVIPPMFAATEFE-------VQCKQLRHGLGLLLAYAKHKnPILLERVALRHSRgDVNATPDLYPLFLESLLKAIAAHDP-SYSPELDQAWRAAVTPGVEYMKSMYD
1189 >tr|A0A0K8S6V4|A0A0K8S6V4_LYGHE Uncharacterized protein OS=Lygus hesperus PE=3 SV=1
1190 --ATPEQVAMVKKAFDPLSVDAPGVGKVFFERLFELYPGSQKYFQHLGStdeeLFANPVFQHHCTKVILSVGTMIDNYTQTtaektKSCLRNWQRFTP-NGKFPPSKHLTSS-IHLWTFFTWNHIQPWRKHG-------------------
1191 >tr|A0A0G3G1X4|A0A0G3G1X4_9GAMM Uncharacterized protein OS=Thioalkalivibrio versutus OX=106634 GN=TVD_07385 PE=4 SV=1
1192 ------TPPNVESSYRRCCA-DASFLARFRLALRAADGQVSGIFDPLSA-------RQQEVMLDASIRAALDFSSGDPqgaSRVSEMIHVHGRqgRVPVPPALYPVWLESLIQAVRETDP-HWSDALERRWRAQLMPAVDMFVELYL
1193 >ERR550517_2232778
1194 ---------------------------------------------------------------------------------------gpDQ-PKAIPHRCLPQkhrhtgsisrhHGARFLQCCPSHLAE--AQDVERRDGGLLDGSFQSDHEHH-
1195 >ERR1719309_231760
1196 -TLTEEEIQTVKTMWAGLLENSADSGLFIFQNFFELYPEQVHRFSFIrDSQgnpipnyLKSQAMLQHSAMVMDALDGVITGVFEHDPLLGqmmyNAGYSHH-SKNIAKDDIEKLSNSILEVIKLVASCegSGKATKVEAWRKLLNIVNERFEQGF-
1197 >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1
1198 ------NLGLVRECWDSICEQYttNELGEMVYDHLFKMAPNLTMLFTKPR--------SYMAVKMGDMLSMLVSFADSSESMkqqISWLGLRHV-KYKIRPHHIPLMGPVFLAVVAEAAGVHWSQDTEKAWSVLFNMVCVNMADA--
1199 >tr|A0A0W1L270|A0A0W1L270_9GAMM Uncharacterized protein OS=Pseudoalteromonas sp. H105 GN=ATS75_15205 PE=4 SV=1
1200 MGINTFEKQLLLNSLTIIKPNFHCFSYTFQMHVKR-ES--------LDMLcLSSs-KINEKTYILYCVLERIVMHLDDLRTVTPFIKHYanNLSNMGMSYEDTDILCNSFLATLKIHLKGCYSPKLENVWQQAISIFRSIVTG---
1201 >tr|A0A063KVI9|A0A063KVI9_9GAMM Hemoglobin OS=Pseudoalteromonas fuliginea GN=DC53_02740 PE=4 SV=1
1202 ---MNTNQSVLLKSLQIVKPNFHAFTARFHRKLAE-SG--------IVMNyPTAn-QFNEKSYTFYCVLERIIKHLDNPSSVTPFLTHYleHLNKRNIQQTDIKILCDIFYATLEAHLGQHFCLQSQTAWQEFLTFFENCTNS---
1203 >tr|A0A1V9ZUY0|A0A1V9ZUY0_9STRA Uncharacterized protein OS=Achlya hypogyna OX=1202772 GN=ACHHYP_00581 PE=3 SV=1
1204 --PTPKDEELMTRSWDNIIGAkiraelerrklktidadDefeAssvvQFYDVFFAKLFTINPATQPVFRG--------SMHVQSKALVNIVGAIRHILHSEdaTSNIAALALRHI-QYGVKLEFFDSLGLAMIETLSAMGDtGRWNKDVRDAWHTVIAYIICILVPPY-
1205 >SRR4029077_13489679
1206 ----------VQADVHAISVM--LNLMQPFRALRRRVDQFAKLWLD--------PLWKTGRKAARIPA--TSTSITGRTGFAGRGRT-------------------------------------------------------
1207 >tr|A0A016SWG0|A0A016SWG0_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0168.g192 PE=3 SV=1
1208 -QLTSEEMDLLRSSVRIISENATEVGCNTYEMIFEQSPYVKEFFHFTKSdddAYRQKQTVQLAQKYMQVLIAFVEGIEDPSIlepVSAKLIEIHRKvddVQ--MAAHWGVFTECTLYNIRKALEKDehFNDMdrldaAVMLWRMVIRGIVRRLKA---
1209 >ERR550534_835606
1210 ------AKKIVDESMNLLAKcDLDEFGTTFYSTVFSLSVDAQQYFYKP-----NAMMKFIAKKVLTIIAAVLHEPDETAHDIRAMGLRHM-KYGVPPDYFPLFGESLTAALPGVLEGYWDDSVRTSWEGIFEFVKNCMTR---
1211 >ERR1712025_717817
1212 -TLSPEHVDPITESAPSGKAKGMVIANNLYRKLFSRHEMFRAMFPEQS---------QQSGKMIQALPSALydfavncDNMGQMQSVVARIANRHV-QQGVQGFDGTFQFIPKKVDLsliPAGQCEAKLKVALNARQPGtgvgdrFQLHPSEVC----
1213 >SRR4051812_15383594
1214 -PMTSDTIALIRASFRLAAADPQALSQVFFRRLLLRSPGVQRMFPAS--------LVRDPQRLVGLIDQVLRLLDRRDmlvEGLQNLGRLQA-PYAALPMHYPLIAGAFREALALRVGTLWSVDMEESWAELQALVIRIMGA---
1215 >NOAtaT_7_FD_contig_111_1754_length_212_multi_2_in_0_out_0_1 # 1 # 210 # 1 # ID=13324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.662
1216 -RLKPKDAEYLQDSWKVFLERsggLEGAGKEFYRLLFEKEPDLKKLFQV--------PEMSQAAAFMRAISRYVSLLAQPEQLktaIEMLAFMHV-NLGISETSIFAFAESLLECVEDQLHDWDpgeVEQVMVLLTDLTTYIGRVIA----
1217 >SoiMetStandDraft_2_1073263.scaffolds.fasta_scaffold554780_1 # 1 # 420 # 1 # ID=554780_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.669
1218 -VLTSSIYlttgTVVTDFSVIVLDAegsAIEPGEAPYSLRVYFTPASTGTstatIQL--------PSGLISDgMLAVGARRLQEETINPRRLagaCEAYGATVTSnvlTVNVrksgTASDPCDSTDAISLLFAGGMATWNslgTSVTSADFtmstnvdsdsvTYRLTFEENVFL----
1219 >SRR5690554_337115
1220 ----DEYVKLLETSFQKAVENvgIEELSTRFFSRFFETFPETNSLFKGTNIDYF---RKFKMRVIFDFLIDIVKHPNYAEAHIAQEVMRHQ-MYGLqDKEYYFTLAACLLEAVKSALGDAWTDEDESAWNDILLVFKG-------
1221 >SRR5690606_3594538
1222 ----EHHLSVVEQTIQQAIGKsgEEALAAELFRRYFERFPETKeRYFHATNIEYF---GVRKFRIIRDFLIDTLKYPNYAEGNMYNEVMRHQ-VYGLkDKEYYFGLIDALMESVQ-------------------------------
1223 >tr|A0A0K1PX98|A0A0K1PX98_9DELT Uncharacterized protein OS=Labilithrix luteola OX=1391654 GN=AKJ09_04675 PE=4 SV=1
1224 --------VVLKESWHLSYRRAPDLAARFYEELSWKYPSARRLLDHVF--------GAQNdiaVCLSTVAGDLLDNVDDPDAFSAaivALANAHV-SLDIPPHVVAWMEEVLLDTLEGAAGDDWTPEMRTTWRNAYEDLASRLAR---
1225 >ERR1719468_1094774
1226 -PLTSNDRKLIVRSWTIVDQQISQVGLSSFLELFRRAPETLSVFPFLkQLGPEdmefYHQLKNHSIRITGVISMLVKQLESEErpadeairDLLLDLGRRHF-SYGAKTSHMELLGRVFAESLQPIFEGDpEAKAIQEAWLVFFSVIVFWLQKGFR
1227 >ERR1719183_1674583
1228 LALTTDQIEAIRSSFGMVLaaaPSKEAAADTFYQTLYDASKSIQPYFVT--------PRAVAALRFVQEVSAHLSVLDDPKQLKTLVETRsfnHF-AIPVSVAAVAKVRDAIMDLFAAEIGKKFTEEAKLAWKAYFNYVGGAFI----
1229 >ERR1719458_172070
1230 -NLTEEEKKVLRSSWDIISQKVDQDGESRFLHKFESNQETEDPILQQFT--QIDASIFNGKSAMIIVALTLENLE-------KSHQTrtrSL-W---------IWSTT------DVFRLDWST-FRY------------------
1231 >ERR1719278_416587
1232 ----------------------------------------------------------kNRRRPVA--TFLLKNLKatsesslYLPGLWSTIR------------------TTIPVPVrrRQPLRLSHP----------RDLLRGCKQRPQ-
1233 >tr|C9CRM3|C9CRM3_9RHOB Uncharacterized protein OS=Silicibacter sp. TrichCH4B OX=644076 GN=SCH4B_0097 PE=4 SV=1
1234 --ISSRDIDLLQSSCATAFLKKGVLASAFYNKLFEIEPAYVNKFSNI---------NKQKIMFEAMLAYCISGITSgykVEALTARLRSYHM-HLEISDIDIANARSALMYALGSVLGEDFHSDLKQAWDAAFSSVSEALR----
1235 >ERR1719419_503384
1236 -DLSPKEILDIQMSWAEIHQEGlVNPDVLMFKLFFEESESGRLKYSHLlkNVNldnlnwmrdwTKVQKLKDSIDKTGEALGDVIKSLNYHDRVVDKLYSHgvvHA-KFGVTRKEIHTFCECLLMTLKMELGTNLSQEAQASWERLLKMIVEVF-----
1237 >ERR1719295_364028
1238 -DLTPEEKRCIQRTIPVILQEAEMIGTKTYLKTFHNYPLSMIYFEPLrDKLvtevkQTDDYLKKHGVLFVKFIGELVAEMDDPDSvdlKLKSLGRFHD-DLGVLKQYLEAIGPLFVQAIRPVLMtqasipsatncgvgvsspnSLWTRDTKPSWIRFFRVIALQMKRAY-
1239 >ERR1711860_326342
1240 -ELNSDEKTLIVTCSKQLLEIQKVLGPQMMQQKFQKV-----------------WSKEAGEL-KQLYDMR------------------------------------------------------------------------
1241 >tr|A0A2A6B374|A0A2A6B374_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=PRIPAC_54161 PE=3 SV=1
1242 --IPDDeekkLtSQILCDSLSLAIvgngEPPVENGQEFYQFLFTIDPRLQSHFVGADEfmgqdPKEPTKFAKQGQRLLMAIHTMAASFDDSEAFDKTVSdliKRHK-DRHVDPALWNKFFGWFVTFLKSKGE--LTSIEEDAWKQLGIRFN--------
1243 >tr|A0A0B1T604|A0A0B1T604_OESDE Globin OS=Oesophagostomum dentatum OX=61180 GN=OESDEN_07088 PE=3 SV=1
1244 --VSAADvRKLTSASMATVPvsspSDKTKHGNDFYQYFFTHHPEVRKYFKGAENyaaddVAKSERFDKLGNDILLAVHVLTETYENDNVFRGVCRdviNRHV-EGgrHLDPALWKQFCSIWVAWLESKGAK-ISADQKAAWDTLSVTFN--------
1245 >tr|A0A0R3RQ08|A0A0R3RQ08_9BILA Uncharacterized protein OS=Elaeophora elaphi OX=1147741 PE=3 SV=1
1246 --MSHSElKAKCIKVMNeVGRvgtdDEAIQHGKNFYKFMFDHHPDLRIYFKGAENysgtdVQNSDrfNYGFSGQRLLLGVRTLIDIYDDIETFKAYARetvNRHI-KFKMDRTLWLAFFTVLVSSLKEHIT--IDEETEKAFLQIGKEFS--------
1247 >tr|A0A1S0U934|A0A1S0U934_LOALO Globin family protein OS=Loa loa OX=7209 GN=LOAG_01385 PE=3 SV=1
1248 --MSHLEmQAKCMKILNeAGRvgtdEEAIQHGKNFYKLFYVWP-----------------SSGFTGQKILLALRIVINTYNDPETFKAYARemvNRHI-RFKMDRTLWLAFFTVLVNSLKEHTR--IDEETEKAFLQIGKEFS--------
1249 >tr|A0A1Y5FEW2|A0A1Y5FEW2_9PROT Uncharacterized protein OS=Halobacteriovorax marinus OX=97084 GN=A9Q84_13980 PE=3 SV=1
1250 -------------------ENIDQFVESFYEHFFSLTPEIFELFKNSEIG-------KQKNEFKISIHTLLINLSQLDkldSYFKDLGIRHI-CYNVSERHYKLAKESFLYAIKKTYADHWSKVVETKWEEIIDHVTLKMKEG--
1251 >tr|T0SGR6|T0SGR6_9PROT Globin OS=Bacteriovorax sp. Seq25_V OX=1201288 GN=M900_0432 PE=3 SV=1
1252 -------------------VNLKKVIDDFYNLFFNEENDLTRIFRNTELT-------LQKHELQKSLELLLSNILDKEevsKYLRDLGVRHI-TYEVKPYHYEQAKQALLLAIKNNLKESDFIKEEKAITEFVTFICINMMNG--
1253 >tr|A0A2E2XNM9|A0A2E2XNM9_9GAMM Uncharacterized protein OS=Cellvibrionaceae bacterium OX=2026723 GN=CL693_20675 PE=4 SV=1
1254 ------DIDWIESSLELLAPHADRLGGLVYPRFFVHFPEAETLFGG-GELG-----KSTQESMIVPLLMGLKDIADGKtymlTIERWLED-HR-EYGVTLPMYSVMLDSLLLGMREAVGDLWTTEMDGAWQEVLARLLLLVEGVY-
1255 >tr|L7L9M1|L7L9M1_9ACTN Uncharacterized protein OS=Gordonia hirsuta DSM 44140 = NBRC 16056 OX=1121927 GN=GOHSU_25_00750 PE=4 SV=1
1256 ------IRQAVLESLARYEESHGDPTRAIYERFYRVHPEAIEELAF-D--------TVLENRMMAGILALLADVADGSidpgGAVYWVSD-HV-AWEVSETMIMGMFGAVRDTVREGLGPEWTARMDADWAGLLAALAPAMRDAV-
1257 >ERR1719478_64653
1258 -SLPTAQIEAIRNTLNMVISaapSRDAAADTFYQTIYDASRIIQPYFVS--------PRAVQALKFVQGIANDLAVLDDPPQLKilvETRSFGHL-ALPVSVPLVVKVREAIMDLFNVELGSKFTAVAKTGWTAYLNYVGGAYI----
1259 >tr|E3MNQ8|E3MNQ8_CAERE CRE-GLB-30 protein OS=Caenorhabditis remanei GN=Cre-glb-30 PE=3 SV=1
1260 -HLTPIDREILNKSWAIVSKDMQQVAVNIFQMIFEQAPDAKLMFSFMmkDYkeDKKSNEFIFHAVRFLQVIESTMTHLDDPSQldaVFLNLGKIHAkheEQLGFSAHYWSVFKECVLFHFRKAMKAHnkFSkhkemsfAEIDSAiilWREVLRFIIDRMKVGYC
1261 >SRR5690606_31308825
1262 -FMGYANSDIVLQSYGRCC-RDEPFFEHVYNVFRSQSEDIRDMFTHTDMT-------EQRRLLRAGITWMIMHSRGGgRSKLESLGKSHNrHGYNVPPALYRHWLDALVESVAAYDP-HYDATLEQHWRGVMTPGIEIIASAYX
1263 >SRR5438046_4862914
1264 -------SNPIERSFELAAERCEDLTPLVYRRLFDAHPEARTMFRTE-GS---EL--VKG----SMLALTIDAVLDFAGertgHFRLIEaevSSHD-AYGTPRELFVAFFGVIAQTLREIVARTGRTTSMRrgGSCSVTSKVSLQGS----
1265 >SRR6266403_3319847
1266 -----------------------DAARL--SPPVSQTPGSQNDVPKR-RQ---PA--GKG----FNVGADHRRHPGFRRraigELRMIScevQSHD-AYGTPRELFGEFFGAIADTLREILGSDWSPEIE-eAWRELLVELDRVVT----
1267 >SRR6266481_9249308
1268 --------------------------------------------------------------------------------------------TNWRSLVQFALEEIVTDIDLLL--DRIVVAVDavgdqrvaRDDRILVELDRIQA----
1269 >ERR1700744_2408068
1270 ------------------------------------HPEAESLFRRG-PS---MR--CPT----GRP----------RSgtpg------gscwtkliaSAlSA-RHKSRRLKSSLPLEEIRADVGFLL--DRVVVAIDavgdervvRNDRVLVRLDRVQS----
1271 >ERR1719178_87025
1272 ------NKHLIDETMERTADaNISDLGSICHRKLFSLSADVQNYFYKP-----NTMVAYILEKVLYILSNLSHEPVAIAHEIRALGMRHI-KYNIPPIYFPLFGKALVFTFGSTLEGFWTDDIENAWGSVFDFVCRCMTR---
1273 >ERR1719158_1490032
1274 ----------------------------------------------------GGQLSFICRGHSSRIN------------RNALRVRRsrI-TNRSHSNCFSSYT----------RCSISSITCASAWATCLLR---RL-----
1275 >tr|A0A044TBZ8|A0A044TBZ8_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1
1276 -NFDDAEIQLLRRSWKTIKPEKQT---------VLQCPEVRRFFPFMNSdlkscEKKNKRFVFQALRFIQvdmtIFNEIIISSF-------s----------NDIAILMLVFLECSIHQIRITLLNSkldlWNRKDvdnvIILWWHLNSGICGKIK----
1277 >SRR5215831_5553854
1278 -------VTDLHRSLEIAAERGGDIYPAIYDAYFARCAGSRDLMELTDIC-------MRGRMLDSLFELLMA--DDAASQVAYLhfeTKNHS-SWGVQPQMYDNLLTATRDTVRGACGPDWTPAMAAAWDARIGDVIR-------
1279 >tr|X1ZVE5|X1ZVE5_CAPTE Uncharacterized protein OS=Capitella teleta PE=3 SV=1
1280 --LKTEQVALLKSSWQQLCVKrsPYFLGRQIFLRVFELNPEIKKSFQFGEFHgndlINNPMFKIHVKNFVSVIDSSIRSVDSLKTVlAPTLhtlGGTHQSVEGFNKNNLEIFLKAMLLVLRQEFKSALDvddLEVEVAWRKLLEFIVYQIHIGYR
1281 >KBSSwiStaDraftv2_1062776.scaffolds.fasta_scaffold1083625_1 # 3 # 881 # -1 # ID=1083625_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.686
1282 -----------------------------------------MEYEI--------CLEPSGIRFMADAGQNIVEAAKQHGIpIKHGCASgscgdCK-GTILsgDSEQGPFMPLLLLPTERAA-G-------MAILCKLYP-RSDLRL----
1283 >tr|H3NRG3|H3NRG3_9GAMM Uncharacterized protein OS=gamma proteobacterium HIMB55 GN=OMB55_00005550 PE=4 SV=1
1284 ---SQSDIAIISESLTLCGDCLEDITPHVYRRFFELDASAASLMEYSDEH-------MRGRMFASVLELFLSdDPFESDGFLAWELDNHVSSYSVTKSMYESLFKAFFEVAEETLGEDWSGDFERAWTNRIARIMAEVS----
1285 >tr|L7MTK4|L7MTK4_SYMRO Neuroglobin OS=Symsagittifera roscoffensis OX=84072 PE=1 SV=1
1286 MQVSEEQQSLIMEDVQVLLPNYDDFVEDVLQQFMEENPETFQIFPWADASKtakemrSHPRFKSHAKSIGKVISDCLVDLNGVKKHepkLSSLGAMHT-KKKVPTELFGKLGGCILTQVVKRVSeAKWSEEKKEAWLKAYGIITV-------
1287 >SRR5215204_501118
1288 --VTRRDWQRLLENWERLQPSADRFATVFFDTLFAWEPQARQLFGGA-------TLETQFLRFAHLLTSLVSAQDHPDELDRRIDAViRCFAGgDPPRKREDAIRVAVAAMLNDVYAAGITPETRASWQSAYIGVITTIRS---
1289 >tr|A0A1W2GS79|A0A1W2GS79_9BACT Uncharacterized protein OS=Reichenbachiella faecimaris OX=692418 GN=SAMN04488029_4044 PE=4 SV=1
1290 -DLNIRERKNIRDTWKVLAPNIHEFAFSFYSNLHSLDSSLVPLFENE------FGIIKQGDKALYVLGFVVASLDNLMvareGIKKALEGVFMEHQHIKRADEQKVMKAFLQAMKSTLRGVWTNEIAISWYRLLSLISAVSI----
1291 >SRR5512143_1477374
1292 ----------EPhDSCVRCF-AVPTFVGRFYARLFSEHPDVGRYFVGIDCA-------RQEQLLRASIPLLVLAPGgsaAARAALERLGRHHGpDGIGVEDVHYERWIACFLATVRD-CDHGWSPAVDSAWRHTLAHGVAVMRRAA-
1293 >tr|O97381|O97381_ARTSA Hemoglobin C1 polymer OS=Artemia salina OX=85549 PE=2 SV=1
1294 -GLSGLEKNAILNTWGKVRGNLQEVGKATFGKLFAAHPEYQQMFRFFqGVQlaelVDSPKFAAHTQRVVSALDQTLLALNRPSDFVYMIKELgldHI-NRGTDRSHFENYQVVFVEYLKETLGDSVDEFTVKSFNHVFEVIINFLNEGL-
1295 >SRR5512139_12076
1296 -----TDLELIEASIEQMLDLETEIIGDTYARLFAHCDGARALFGPNTYG-------PRAQMVN---ETIIAGLDLLRGepwvheYMTQHGVRHRHSYEVTDAMYRTYAESLLGAIRERLGDRFTPELEAAWS---------------
1297 >ERR1719193_549257
1298 -IFTDDELAILKDVWAHLKHHTAGAGLTILDHFFKRQHWALERFEALrDMYgnihpdyMKIDLMRFLAVDLMEGIDIFVTGFFERD---PEVTDLiadvgyaYV-KKIIIESEIEIFVDSMLAAMEELLGEDtWK-KNMAPWKKLMPVVAEHFSRGFK
1299 >SRR3989304_6997408
1300 --XMTTNLDAVTASYHRCRA-SAGFFDTFYECFPARSEEVAEKFRQTDFT-------RQKLMLRESLISMLLFnlgTGSARAELEQLAKRHSRdRSEEHTSELQSRLHLVC-----------------------------------
1301 >tr|A0A0P5Q0G6|A0A0P5Q0G6_9CRUS Uncharacterized protein OS=Daphnia magna PE=3 SV=1
1302 -SMKGRGSCFDQGHLESCKKN-GNIAPKAFIRYLKLKPEAQKKFAAFaEVdladLPTNSHFLNQAYTCLAGLNAYSDNLGKNPKSCPYLNSPAF---KdVKPDELKLFGEVMFNVMEKNWTIIFPRQARKAWKDGLTACDVA------
1303 >tr|A0A2D4BL26|A0A2D4BL26_PYTIN Uncharacterized protein OS=Pythium insidiosum GN=PINS_002968 PE=4 SV=1
1304 -------------LEKQQNYKVTTLYDVFYAHLEQHSPELKPVFRS--------SVHIRGKVLVHISVGMRTLIASEnfVDKVLPLTKTHR-RFGVKPEHYEPLGRALLHAMQVVAL------ITRDRGRVEEPTSIILIQ---
1305 >tr|A0A024G680|A0A024G680_9STRA Uncharacterized protein OS=Albugo candida GN=BN9_028420 PE=3 SV=1
1306 -------------LdGMQPAERMELLYDTFHKFLELNAPELKPVFKT--------SKHTRNVVLQHIVGGLRTMLAQNvhIERVRALTKTHL-QFGVKMEYFDLLGQAVIFSMRQCSGTHWTNEIEEAWRRLYGHCSVILLR---
1307 >ERR1719474_2118124
1308 -SLNPTQKCVIVATWHSIFlKHMNFMGKQLFVDLFKVEPNILKYFDAFrDVGlanlLQSRSFQNHGVRIMNLVKFAVENLDNPEKLqdhMHALGRLHV-KKGIDSKYLNIMGPTFCQAIRPMVMaeGQWSIDIEGAWIQLFKILAQMMRVAYE
1309 >ERR1719244_357615
1310 -WFVPTEKCIIVATWNTIFfKHMNTMGKHLFMDIFKMEPNVLKYFEAFrDVGlsnvLQSRAFQNHGVRVTNLVKFAVENLDNPEKLkdhMLMLGRLHV-KKGIESRVLDLMGPTFCAAIRPMVMaeGSWSLDIDSAWAKLFRILVQMMIPAYS
1311 >tr|A0A1I2S201|A0A1I2S201_9CORY Uncharacterized protein OS=Corynebacterium spheniscorum OX=185761 GN=SAMN05660282_00995 PE=4 SV=1
1312 ----------------LLRQESGHLEPELQLQLYARHPNAQWLLRA--------G-KAVPAELVELSIHAIAAADAEgaldALAEARIRDLglaQR-RFGFPSELYQDIQEIMVSLLRTTGAD-LPFPVEFAAERTIARVCVLLQE---
1313 >tr|A0A2S9Z387|A0A2S9Z387_9CORY Oxidoreductase OS=Corynebacterium sp. 13CS0277 OX=2071994 GN=C1Y63_03975 PE=4 SV=1
1314 ----------------ALTRHPELFRRAVTATFTGLCPAAGVLIA----------QPAAHADLPVACAWVLRNSAE-qvsDYAAAVIRQLgceHR-RSSTDPAHYALFARALRAGLDAVAAEDdLEPADVAHAAHLLEHCCTLMRD---
1315 >tr|W5Y4C7|W5Y4C7_9CORY Putative oxidoreductase OS=Corynebacterium vitaeruminis DSM 20294 OX=1224164 GN=B843_11695 PE=4 SV=1
1316 -------------------RNREELSAIAFDMFFATQRDARTRIRA-------------TPAIADALTLLARSCDSEgklpLDVEKRFLQRattLC-AHGLRVDDLEPLAESAHRAMLITAGG-QPFELVLPIERALQQLARTVVE---
1317 >tr|A0A172QXP0|A0A172QXP0_9CORY 2-polyprenylphenol hydroxylase OS=Corynebacterium crudilactis OX=1652495 GN=ccrud_12565 PE=4 SV=1
1318 ----------------LVEDNAQDFLRAVKAQLLQLAPQSRGHFPT--------DDDLTHISIAETLSALLDGTGKEgevdEGTLAFFQEAaldAR-RFGITPDMLKALGEAVRTELLELCSD-LPFENVLFAERAIAATSAASIQ---
1319 >tr|A0A1W1UZL1|A0A1W1UZL1_9CORY NAD(P)H-flavin reductase OS=Corynebacterium glucuronolyticum OX=39791 GN=SAMN05660745_01670 PE=4 SV=1
1320 ----------------RLRSVSPEFHEHVRANFFDKCPETMLVFPL--------HKENVHADLGRVLSFVFDRTPVDghltDEMRTLITQLgkdHR-KYNVSPRYFHPFVECLRDSLLTLCSD-LQFKYLNGADTALGEVSTLLAR---
1321 >tr|K0YDT0|K0YDT0_9CORY Uncharacterized protein OS=Turicella otitidis ATCC 51513 OX=883169 GN=HMPREF9719_01398 PE=4 SV=1
1322 ----------------ILGAQRTAFRDATVDYLLRRLPRLRRVAPL--------RQRHRAEALAERAVGLVARSPQ-gmlrGEDAADLERAgraNR-RLGVPLRVYPVLAQALKAGLRAAFEAAgePYTAAARDAEALAEAACASLAR---
1323 >tr|A0A2C8D7D3|A0A2C8D7D3_CORDP Phenol hydroxylase P5 protein OS=Corynebacterium diphtheriae OX=1717 GN=mphP PE=4 SV=1
1324 ----------------LRLVTVTAHSIQAVADElraHRAEFIQAANQKP-------------DSPLADAIVQLVDHTDLDghvpESIATSWLQHaaaAE-SLGVSRDYYLTLADASRSALRHICAD-LPFAEVLGAERAITSIANTLT----
1325 >tr|A0A0G3H0V1|A0A0G3H0V1_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium mustelae OX=571915 GN=CMUST_13735 PE=4 SV=1
1326 ----------------LR-ALSEEFSRDVFHSFFRSHPHERLVISP-------------EFPVAAAVSFICHGADANgtlyPETENRLRELaeiIT-AHGF--RSILPFADAITKSIRHYCMR-DDFFGTIAAERAVEQAAEILNH---
1327 >tr|A0A0G3GTQ0|A0A0G3GTQ0_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium epidermidicanis OX=1050174 GN=CEPID_01535 PE=4 SV=1
1328 ----------------TLRAKSPAFRRDVLRDFFSQHPHMRLKFAA--------NEDHAHTELVFALTYLLENPTD----PELIRTLardHI-KVSPGQEVVADFFAILHRQIHRYCAD-LPYEEVRQADLKLQEIA--------
1329 >tr|A0A0F6R111|A0A0F6R111_9CORY 2-polyprenylphenol hydroxylase-like oxidoreductase OS=Corynebacterium kutscheri OX=35755 GN=UL82_09495 PE=4 SV=1
1330 ----------------------------MVASHfYADVPLARLSFRL-------------QPSLVDTLIAGLSHP----LNITAW---ahdLA-HRGVDRSFYVPLSAALQHAVCHICSA-LPLVDVLAVEHRIDQIMKQLLA---
1331 >tr|A0A2D7G1P9|A0A2D7G1P9_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMP96_10880 PE=4 SV=1
1332 ------EQTCIERVLDCAAEDQPDFQQRLYDRFYQLAPSAEALMIHIDEE-------VQGKMLAEVIRLFLSpDVaVTDQQYLLFETKNHAQAYFVEPEMYRALNQALFETLKVGAGRIWSSEVESAVHNRLSKMLHGILEAL-
1333 >tr|A0A2E1GZ77|A0A2E1GZ77_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ03_04085 PE=4 SV=1
1334 ------DQAWIETAFDCAAVDNLNFNVDVYQTFYRAEPSVASLMAHIDEL-------VQNKMLSEVIRLLLNpNIeSEEAGYLNFEVKTHIQGYGVSPLMFLSFNRAVYEVLQSSAARVWEDDLAVAVTRRFAVLSDALTEAL-
1335 >tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium OX=1913989 GN=CMQ23_00915 PE=4 SV=1
1336 ------MQSSIHALLEQVATTDIDFDKKCFERFFQISEEGKTLMAHMDRV-------HRGKMMAEIYRLMMArDLDDEADYLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY-
1337 >tr|A0A096P8B0|A0A096P8B0_OSTTA Flavoprotein pyridine nucleotide cytochrome reductase OS=Ostreococcus tauri GN=OT_ostta17g00030 PE=4 SV=1
1338 -------------------------------------------------------------------------------------masvgsgat-DDD-GVDVPVSRCPFAhGTVTVDPYPGYVH-G---KNPRVCPRGCVPRPPSKP----
1339 >SRR6266498_4102119
1340 ----------VATQSYR-MHCQgrPAFYSTFYQRFFQHCPEVKTWFS--NM-------HAQYDKFDQALQFLLNYRHGCMEEPTVLSmtaNKHR-AFKLSACQFDEFERALLETLKESAHE--SDRVLKAWETTIR-----------
1341 >ERR1719474_730311
1342 ---------NIHVTFDvALTSDPKGFAEKFYRGLLKEQPDIGQLFLDK-----NTTFDTQSARFMAMLMHAIKMLDDTDHFTQSLDSLseaHV-GYGVEIPMLDAFGKSLISQVKqfnieyyqqqqnhkgddqkeETVdilkVGRWTTKQDDSWKWFWSVVVGVMSAG--
1343 >SRR6266536_2537548
1344 -PLSGREREIAMLAAAGLA--SKDIAERLYLSVRTVNNHLQHAYTKLGVS-GRAGLAEQEIKFAEKLTEIVRAMPRLDELLthtRALGARHV-SYGVRAADYQTLGNALLAALAAVLGGSFDAPTREAWTLAYNLVAETMLD---
1345 >SRR3954465_13942299
1346 -PLTGREREIAMLAAKGIL--SKDIAARLSLAVRTVDNHLQRAYTKLGIT-GRDQLADVLAHDTTTHPGPX-----------------------------------------------------------------------
1347 >tr|A0A1Q9C6P6|A0A1Q9C6P6_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene41206 PE=4 SV=1
1348 --CVCDLAQCRGRSWAAFFVDI-------QAAYYETSRS--LLFEG--------PSQDP----------ALVALQLPAHVQAlisDGALQGL-GI--PQEHIALLQDCvecsfwtftgqtqqvmatsgsrpgdgladvlFGALFAVILtcLEAKCQQCGLVHQSMSDALGVPDR----
1349 >tr|A0A0E9N6V9|A0A0E9N6V9_9BACT Uncharacterized protein OS=Flavihumibacter petaseus NBRC 106054 OX=1220578 GN=FPE01S_06_00290 PE=4 SV=1
1350 -QMNQQEIQLVCQSWQQAAEEPLRLAILFFDRLFEEAPELRQVFRT-P-------MSEKTRQLLVFFGFHINRLASGSIRRPSFEAYVW-EELLTDAQKGFLMETLSDTVAALLKPDWTPALQGAWGSFRK-----------
1351 >tr|A0A2G2R0S2|A0A2G2R0S2_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_09030 PE=4 SV=1
1352 -IVTPDQAIIIQESFARLSTSSDSLIQDILGTIAEGNSDLAVTIT-----FKSQNLVEQIS---TALSHIIDQLhtaDNVAEYVAHFGELLL-AQNVQDENYSSFGEALLSGLENALQNDFTAEVRDAWTSGWAMLSGIMREA--
1353 >SRR3546814_3775940
1354 -----------ERSLEAVMEAGKDITPFFYDRFFALYTEQRANFYHFES--------TSGTMVNEMITSVLALASNEAWLtnsVQNFVAAHR-SYGdIPTDAYARLQDVLVDNLAQDSKSTSLNTsNYCANsl-LYSVX----------
1355 >SRR3546814_13566968
1356 ----------------------FTIYTTLSLNVVLPFVTHRSNFDHVES--------TSESMVIEMITLVLALASKEAWLtnsFQNFVAALR-SYGdIPPDAYARLLDVLVVTLAQVAGSRWTDEFETAWRWYVSG----------
1357 >ERR1719397_23434
1358 -NLTDCQVRLVLVSWPVILEEFQKVGVQCIVHLFEVVPYMKEHFQQLiNNSgkfdpkDGNvmqTVMENHAKLVMNVVHEVVTNIDALDSVTEkliQVGEKHC-KAGVEQRYLDIVGPIFCNAVRPVLLRsgIWNNRTEEAWMEVFTAIASTMRTGY-
1359 >SRR6478672_7358577
1360 ---------------------------------------------------------------------SRMp--CNSSTlkrrpSatscTESPTSTSP-WESAPSST-PSSASTYSPRSLRFWATPSPPRSPPRGGEVYWLFALQLVA---
1361 >SRR4029450_1817054
1362 ------------------------------------MARLLRVFNQGNQA-----TGEQSKALPGSgVASAV-QLIDPNApslahVMRRIAYKHM-SLGVCAEQYIVVGHYLSRRWARSSVRRSLPRSRQRGRKFigFLPFS--------
1363 >tr|A0A177B679|A0A177B679_9METZ Uncharacterized protein OS=Intoshia linei OX=1819745 GN=A3Q56_02502 PE=3 SV=1
1364 -GLTKTDINMVLGSWESIN--NDEASSIFYRELFNTYPDTKSLFVKFySVdndkLIDNPAALKQLRVTWTAITTLIDYLKkgRIDEANKaidYLIEKHRKIKTFQGPMFNMALEPLLYLVKEKL---TSQAYIDAYKKVFGAIFLTIISKY-
1365 >SRR2546427_1691122
1366 -------VVLLQTTFLRAAEMrigKRNITDFIYEDLFLKRPQLKPMFTNQ---------VLQRHKLGKMLGSIFIHLRDQdwiDEHLRDLGAMHW-RAGATPEVYPWIKDSVLAVLEEGMAPsGWNLRCQREGAGALGVSAQGMLMGY-
1367 >ERR1719244_673251
1368 -----GQKDLIIASWREIRICLDEVGFDTFKQLFAHHSDIRAYFPAMkKLSSndveMSRKIKEHSTRIMAVLKLFVDNIYDLEKIEPSIedlGRNHS-FRTLLGLFLSE-------RISGQL--AWR--------RCCFNYLNIS-----
1369 >ERR1719369_2640530
1370 ---SPSQVDMLRSSWVILVRQLDEIGMKVFAKLFTVHSDIAQYFPQAkRPGS-SVFIKDLSHRVMNLLKLIVDNIEKLEMIRDTIrilGEKHY-QIGVRSEHLDLMGPIFCETIRPILVanNVWTHHVGDTWLST-------------
1371 >tr|A0A2G2R4B7|A0A2G2R4B7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=COB59_07540 PE=4 SV=1
1372 ----------------------QSASDKFYNVLQNDLPEFTQLFTN-P-------E-KQHMMFYAALRSIDGLKDNktkLAVYLRSIGVKHK-MLGLTHYHMEIGRNAFEQAIFA-GGKDLTHDQRQFYIDSFSQIEKNM-----
1373 >tr|A0A2D9F7C7|A0A2D9F7C7_9PROT Uncharacterized protein OS=Rhodospirillaceae bacterium OX=1898112 GN=CMM61_16775 PE=4 SV=1
1374 ----------------------EAVAEAFYAALFREAPDVERLFRD-E-------T-NKTVMFVNALESISGLERGdphFADFMAMLGQRHR-DIGITQQHLKAGWTAFNEALDV-GGGNLTLPRRQFYRDAFKKLVAAM-----
1375 >ERR1719378_1531842
1376 --FHPgaDGVHRIGGEESQ--AEVRRQRSLSLPKFLDSLSGEKEKFAFNfDSMgnvlpnfHASHAQKIHSMKIMDAIDAVISEILRDHPIKQRlmdVGYAHY-ELHATSKDIRKLTTAFYKGVKDLIGIDDdNDRHLVAWKDFLNKIEEGFK----
1377 >ERR550534_2245262
1378 -----------------RDLRHPLGLLLALH---------GGFLSFFhGFFgsykadaMQTEFMKNHSIKIMNALDTVIAGITAQQPMREAvreIGRDHY-HKKIDKIHMRQMADGMLEGLKEVIGDAKdSTRKL-------------------
1379 >ERR1719192_2788519
1380 ------RREIIGTMWESFREDSVSSGLFILEHFFSTYPDEMDRFTFAsGGQtdketplafiMKRERMRIHSAQLMNALDRNGHVY--GRSpgCMDQapqSHRG-------------NVCRRTGKSSGIA---------VFKWRVA-------------
1381 >ERR1719367_1435250
1382 -------KTQLRSTWNVIMSDMASIGVVMFLKMFETHPETLSSFIR-NVYSikeiemdewYQENLKLHAIRVMAIVEQVIHRLDEVGSVIKILMKRglsHK-RLGVQRSMLEKMGRSFVLSIQSPLEEanKWDATVEQSWLSMFRFIEFWMGLVY-
1383 >ERR1712004_299484
1384 ---------ILRESWKHLQSRIESLGVVTFLSLFNASSETLHTYLTPeDIATlkeqdkdkmLIEKLRVHPLRIMSVLEKTVHRLEDHQRCLKMLRQYgrkHQ-RFGVPPFMFATWPGVFYLYSSPYWKNlsNGMRTFHKLGKACFNSLHLEYRE---
1385 >tr|S9TQJ9|S9TQJ9_9TRYP Adenylate cyclase OS=Strigomonas culicis OX=28005 GN=STCU_09709 PE=4 SV=1
1386 --------YTVEATWNILEKegMVDRFGQQLYDQLLTKNPRLRVYFYGVDLD-------EQSKTIVRMLGTAVHSYNNPVRTvefITRAGARHR-GYGVTPSVFREMEVAFFKVFPKFVGLDVFEASEEYWKDFWAVVLDLLSR---
1387 >tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 OX=582737 GN=TSPGSL018_8354 PE=3 SV=1
1388 ---SSKIITLIEKSWAFVESRCDlmEVSNKFFERLFQRAPALQNMFTKP--------KRVQYVMLAKALDLIVRSAGETKVmneDIKAIALRHI-KYDIRQEHLNVFGSVLVETLANSVGPeNWDEDISAAWASIYGNIAAVF-----
1389 >LauGreDrversion2_5_1035112.scaffolds.fasta_scaffold830278_1 # 2 # 232 # -1 # ID=830278_1;partial=10;start_type=ATG;rbs_motif=TAA;rbs_spacer=11bp;gc_cont=0.316
1390 --------------------------MAFWN----KHPEPAAQFVAP-------TQdtltdefepeeeqGISKEQLLSALNAAQT-------ALMMIDR-D------FNITYLNqKSVDLLKTHEALFQSIWPNFQATeefllGYCIdlfhanpshqrqmlsnpsNLPYTTTITVKDV-
1391 >SoimicmetaTmtHMA_FD_contig_51_4416696_length_1368_multi_2_in_0_out_0_1 # 1 # 216 # -1 # ID=2511055_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.685
1392 ------KVALHTVEFAVADPSARATI--------------------------------------------ATHGLTPDDMAMLLSKRE------------LIGPAFPALLDEFYGKVVEN----------------------
1393 >SRR5262245_66279004
1394 --LEPTDRIRAKQSYLKHCMGKNDFYRKFYERFFQGPEGTmakEMFADK--------DLNQQYVKLDQSLHYLLNFGDQDmmePTVLTTTATIHQ-TKGVAPEQLERFIECLIDTLSKDYQV--SGIEVDAWKNVCGP----------
1395 >JRYH01.1.fsa_nt_gb|JRYH01001677.1|_10 # 8312 # 9718 # 1 # ID=1677_10;partial=00;start_type=ATG;rbs_motif=GGA/GAG/AGG;rbs_spacer=5-10bp;gc_cont=0.684
1396 --MPASWVTELQEIWQDFNKrvgSRQAAGEIIYDAVKEAAPRIVIDdFRIP--------RPVWSSRFVDGISSLIAEASDLKMLRKRAEAMgfsHM-SLALSIEKCELLRDVVVSSIEQECgpgKFSAQCIARKALTIVLNYIAGALL----
1397 >sp|Q7M416|GLB1_LIOJA Globin-1 OS=Liolophura japonica OX=13599 PE=1 SV=1
1398 --ISADQAKALKDDIAVVAQNPNGCGKALFIKMFEMNPGWVEKFPAWKgksldEIKASDKITNHGGKVINELANWINNINSASGILKSQGTAHK-GRSIGIEYFENVLPVIDATFAQQMGGAYTAAMKDALKAAWtGVIVPGMKAGY-
1399 >tr|A0A090KT29|A0A090KT29_STRRB Globin family and Globin-like domain and Globin,structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_X0
1400 -KLTENHRKVIKSSFEIFKKNGVPNAHNIFLRMFKEYPDYKNVWSQFkNMSdeelSQTPLLWKHATTFVFGLERVIRTMDDQEMMILMIHStanQHK-SWGLKKEHFFAMVHLITDILMEEKGEpDEKYAIMEAWESFYDVLGTL------
1401 >tr|A0A0P5DF02|A0A0P5DF02_9CRUS Di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
1402 --KPANDRRIIRKTWDQAk----------------------------------------------------KDGDVPPQILFRFI----K-AHPEYQKMFKSFADVpqae------LLGNGNFLAQA-YTILAGLNvviqslssqelianQINALG-----
1403 >tr|A0A0K0JIN4|A0A0K0JIN4_BRUMA Uncharacterized protein OS=Brugia malayi OX=6279 GN=Bm1_04635 PE=3 SV=2
1404 --LSEIQQELIRQSWQTISAKLEvneqNFGFFVYRRVFEHNPLLKRAFHVEeyDlldSIPREHSIFRQMRLFTNLIALAVRHDNELETeIAPAVFRYGQRHYKFAAEyfnegTVRLFCSQVVCAVADLLEVDIDPACMEAWIDMMRFIGCRLLDGF-
1405 >WetSurMetagenome_2_1015567.scaffolds.fasta_scaffold1216141_1 # 2 # 73 # -1 # ID=1216141_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.347
1406 --FPDGVCMATIELTVLPVRpleD-----DEKFQIILSEAQGGASFNPNDD--------G----GKDDGvlTIVIKNTLQDPKGLKVLVESFgfqHL-DFDLTVPRVVVFRDSMVELMEAELQDRFTYKAKDG-----------------
1407 >SRR5690348_18181078
1408 -----------------SRRRHTRWTGDWSSDVCSSDLETRALFRT------------EGSELVkgSMLAMTVEAIIDFAGersgKFRMIAcEvmSHD-AYGTSRELRSEERRVGKEC--RFGWVAYPX----------------------
1409 >ERR1719191_2635985
1410 --LSTKSLAVVGATLPLVAKAGPSFTQHFYTRIFNAHPALFNTFNISN-----QRTGKQSGALFAAIASCATGLLTsgklPSEMLEGVNHKHC-ALNVAPAHYDVVGEHILGTITDLLNP--GQHVLDAWGELYTALANQCIKR--
1411 >tr|A0A0K2U629|A0A0K2U629_LEPSM Cytoglobin1like [Saccoglossus kowalevskii] OS=Lepeophtheirus salmonis OX=72036 PE=3 SV=1
1412 --LTKKETFLIRESWKLVTPEMTKHAVGYYIGMFVSYPKWQDRFfRRIkGIplrdLRNNPILAAHSSQVFSAVSNLLNNLENTEVIVegvKKIARTHW-PLNIRGKELEAGLVLLLDYLEASFPGQISKECGDAWNKMFNAMSGVIVD---
1413 >tr|A0A2B4SAV5|A0A2B4SAV5_STYPI Uncharacterized protein OS=Stylophora pistillata GN=AWC38_SpisGene8312 PE=3 SV=1
1414 ------------DTFGPK-ESRCREESVCKVRLLELNPNLQDAFPSFrGVsldeLMNSRSLFLHSKRLMAVVEEAVSSLDDAKELIEDLtnlGERHL-AMSITEKHLKNLQRAGPATNQDAKHRLLANKGTAQIDRHIARMEDTRLP---
1415 >GraSoi2013_100cm_1033763.scaffolds.fasta_scaffold146077_1 # 2 # 316 # -1 # ID=146077_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.663
1416 -------------------------------------------------IFESFCLAQ----ML----YETVGMAREPKQERIVS---------------------------------------------------------
1417 >SRR5690606_18427011
1418 --VSHRN---AHEKHQPCH-AKL-------------RPLLRE-----------------PRLLRRLLYDLSGqLTRrAGEVRPERHG-----GAEASAX---------------------------------------------
1419 >tr|A0A0N4TEQ4|A0A0N4TEQ4_BRUPA Uncharacterized protein OS=Brugia pahangi PE=3 SV=1
1420 -PLTRKQKFVLIKNWKGIERDVTTAGIEMFLKMLTEHPEYYEFFNFRNIANtakekqaSDERLSAHGAAVMKFIGKAISQIENADAFFMLLEnngRQHAHRGAFRPEMFWASYSFTCYSFSNGFIRNFFSNI--------NLLLTKVEMSY-
1421 >tr|A0A2M8U0Y4|A0A2M8U0Y4_9PROT Uncharacterized protein OS=Ferrovibrio sp. OX=1917215 GN=CTR53_17535 PE=4 SV=1
1422 -PLSPAHLGLVRATFQILAADRDRLTEMFYARAVALDPHIQRPQLV-------SNMVAQRLQFMLVLTDVVQQLDDLPSLaqtAATFARRHG-TYGASDPRFRTARAALAWAVDRILETERNSAIQLAWNAAFDLVEALV-----
1423 >PlaIllAssembly_1097288.scaffolds.fasta_scaffold05791_3 # 3730 # 3864 # -1 # ID=5791_3;partial=00;start_type=ATG;rbs_motif=AGGA;rbs_spacer=5-10bp;gc_cont=0.556
1424 -VPTAQDKQIIRDNINILKAKKSNWGAKTMLKLLKAHPDSIKLFPKFaNVPlhelANNAEFLAYGNVFSAGLNFMIDNIDDPTAVKHILSGKDAskyFVPGVSIrQQLEETFRVAIEAIGEELGPRFTPKTRAAFTRVLRFLNQVQDDGF-
1425 >tr|A0A1I7S4N0|A0A1I7S4N0_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=3 SV=1
1426 ----MADRQILLKSLEFMPltRDGEKQGVEIYKYSFANMPAMMPFYHLADGftadsTITSDRFQKLGCKLALATHILANLADQPETLKAYAREHvlrHI-SRKVSPRMFRGFFDILVDWMATKTT--ISEEARREWAKLGDLFSY-------
1427 >ERR550539_353004
1428 ---------------------------------------------------------AMMQHLVKNLHDISRF---DSDIrelLTRLGQQWL-QKRVPLDFAVLLGNEYLEAvlpffHSNV-GATLALKLEVSLAYLYKEAMHFLLL---
1429 >LakMenE01Jun11ns_1017448.scaffolds.fasta_scaffold3583117_1 # 3 # 191 # -1 # ID=3583117_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.561
1430 -ALAPEAVTKMRAGAEAMLAHPQEAGVFFYETLFDARPDLVSLFRTANMD-------ALSRHLIDTVVFLSRAADDLTGLrddLRNLARVHQ-VNQIPPSEYAHLAAPLLETLSRF-GHPLDAQMIRGWEVLFDRVSRIVAE---
1431 >ERR1719199_1665450
1432 --------PMIRECAAKVVQmDIVELGLRFYVHLFTINPAASAFFTKPKW-----MISAIFGGVLRFYVHLF--TINPaaSAFFTK-----------------------------------------------------------
1433 >SRR5262249_23394332
1434 -----------------AIPISGVASELFFSRLFAIEPGLRHCFDG--------CFLGRRRAFEWMIGAAVRGRPDLRSFIQALEFMVAPSDATVHQECERLRDAFISSLSGSLGPRFTVEMMNGWLAVFELLH--------
1435 >tr|A0A2V3J537|A0A2V3J537_9FLOR Flavohemoprotein OS=Gracilariopsis chorda OX=448386 GN=BWQ96_00611 PE=4 SV=1
1436 ---DPETEALIKNTLPIFTKHSQQIAVQLYANLFEQHPQLKPMFCLEFLqTPgqckksPGTGMSPQAKILSDSIVNFCANLDNIDMMNNAIERIcakHV-SRHVKSDHYPAVAGAFSRAVRQVLKNELSESDLKAWDTAVSALAGVLVK---
1437 >SRR5688500_3946624
1438 ---DSRTIALIKESFTPIAGRTLELADRFFNNLFTRQTSVRGFFPA--------DVTEQKRQLPGVIQTILENGDKLENLEPQLREVgreYA-KQGALPTHYGAVARTFVDTVREMSGIGWQARYTRAWTSLFDSLTKAI-----
1439 >ERR550532_2368357
1440 -------ISMVAANFKTVKS-NQVLANTLFEHLFELEPSSKALFESK-------DLTQHKTKFVGFIGQGLKMLqgKNAKKELRELARMHM-EMGVTTLHFVFFEEAMLLGLRAAHGDKFDGELATAWTYVV------------
1441 >ERR1719264_1394560
1442 -------ISVVAANFKTVKS-NQVLANTLFEHLFELEPSSKALFESK-------DLTQLKTKFAGFIGQGLKMLqgKNAKKSSGSLPRCTW-RWE-------------------------------------------------
1443 >tr|I2K200|I2K200_DEKBR Globin, putative OS=Brettanomyces bruxellensis AWRI1499 OX=1124627 GN=AWRI1499_0864 PE=3 SV=1
1444 -QLTREEIDLLRWSWRLVTVDddSTSLGGNTFnAADFSSYLFCIQFYNNFiSMDekvvEMIPSIRHQASSFADVLNQAIGTLEDLSkmqELLTNLGKLHARILGIERSYFKTMGEALIKTFRDWFGNNetFfPLILEEAWIKLYCFLANSIIQ---
1445 >ERR1719396_178111
1446 ---------------------------------------------------------------AHGPGRLHRRLREQHPGLvpaagaqrPadGDLPPAL-RLVYHPPAVQRGARERDEVHRQGPGGVVTPEIAAAWSEAVLFLSKACID---
1447 >tr|A0A2E6CQF7|A0A2E6CQF7_9DELT Globin OS=Sandaracinus sp. GN=CMN31_05165 PE=4 SV=1
1448 --LDHSTLHAVRSSFE-RV-REPAFAAAFYERLLARDPEIRRRFAHTDFE-------RQRELFLHGLFALVDYASGGatgKLAIERLHAMHGpEQLDVPAALFDVWRDVLLETLAEHD-PEWRGELAVAWRAVLGPGIDAVRSP--
1449 >tr|T0T344|T0T344_9PROT Uncharacterized protein OS=Bacteriovorax sp. DB6_IX OX=1353530 GN=M901_0762 PE=4 SV=1
1450 --------TEVRKCYFRSI-ENPHFPKYFYRNLFFLSPKIEDYFKNTD-------WEHQEKALMLGLSHLFHYFDEQdtfhHKQIVRLANVHSHdNLNIHPHMYYYWIEALVMTCKKVDP-QWYEDLQYYLRETVFFPISFMISLYH
1451 >ERR1712080_92393
1452 MSLSAGEITAVTASFEAVKADLGTNIGKVLQKLVAEHPDLKPHFPWHavptADLLGNDGFKTHAAQVGRGFAEAAGNLSNLSaceGYYVSLGDRHK-TRGFAAAQVPMVADAFVAALQ------LTGDDASGWTKLITFVGSSIVSG--
1453 >tr|A0A1X6NYK5|A0A1X6NYK5_PORUM Uncharacterized protein (Fragment) OS=Porphyra umbilicalis OX=2786 GN=BU14_0331s0026 PE=3 SV=1
1454 -PPGPKAVRLLCATAPTLRAAGVPLVHRFGHLLVTRYPAVAARFDVSpaGD--WEGAVVAQVARLTAAFLAAAERMGEPACLNPVLDRIaakHA-ARVLPAGLYASVGDCLLEAVGEVLGDDAPQEVLDAWDAAYAWLGGALAA---
1455 >tr|A0A2S3QTP4|A0A2S3QTP4_9PROT Uncharacterized protein OS=Halobacteriovorax sp. DA5 OX=2067553 GN=C0Z22_01530 PE=3 SV=1
1456 ------DKDLIIESFARIEPNLKNFTNAFFDNVVILEPGMQKVFAHADRE-------QLKASFIRALSITINNLKNPEYLKYYLQGLggnQI-KYEVSETYFPIFEEAFIQTLMLFHMNSWTPKLETAWRDCFYYIAEYIS----
1457 >tr|A0A0N4YFT6|A0A0N4YFT6_NIPBR Uncharacterized protein OS=Nippostrongylus brasiliensis OX=27835 PE=3 SV=1
1458 -RLSEHQRQIIIETFAEMEHHAVKNGLKMLVKLFSEYPNYKQIWPQFRAIPdsslmNAIALRRHASVYMCGLGAIIHSMKHENELALQMtriAKAHI-KWNVHRSHVVHMLDPVLDIVQE-CNPNYNNEMKQAWTTLYHIIADL-IEIY-
1459 >ERR1719487_109746
1460 MIMSAEAVQVVQDSFHRVDScvqIRDALEDVFFPHLFASSTQIKELFADVDL-------NMQAPMFANILNSTISSLNNPTELRPLLADFgeKCKKYGVQGEHIATAGESLIFTMKSI-DDQWDAEVEAAWMAACSAMENAA-----
1461 >tr|A0A132A213|A0A132A213_SARSC Globin-like protein 2 OS=Sarcoptes scabiei OX=52283 GN=QR98_0035350 PE=3 SV=1
1462 ---EREEIEVLREQWDRIVHyHQECFGMKLFQRLLQLHPEYRPLFGFEeTVeeIQNTQRLKAHGINVVYMLNMLFDNFDDMDmidELIFKLVKLHM-MRGIDQIWLDDIIEPFELVLEEF-NAKIQIERIEVLRKAFIFIKNRMQELY-
1463 >tr|A0A1Y3BHE1|A0A1Y3BHE1_EURMA Globin-like protein OS=Euroglyphus maynei OX=6958 GN=BLA29_010084 PE=3 SV=1
1464 ---CEEELQSLRIQWDKIVHyQQECFGLKLFLRLLDLHPEYLCLFGFTwDEfnYHETNQLRAHGINVMYMLNMLFDNLNDMDmfdELIGKLIRLHL-CRGIQKSWFDDLCAPFLTILEDF-SEKLSIEHPESIYKAFMFIKNRIQQLY-
1465 >tr|A0A1Q9DB21|A0A1Q9DB21_SYMMI Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Symbiodinium microadriaticum GN=AK812_SmicGene25788 PE=4 SV=1
1466 -KLPSHDVQILRSSWHQLMDavghDREQLGDVLYVGLTGSLAVLKDQFIT--------PRAVMSLRLFNGFRVVVEKADDPAALLNFTETLafkHL-SYEVTQVRAGLVADTFLEVLTQNVTEELPQGAGAVWRQILMYVGSAFR----
1467 >tr|K1PS51|K1PS51_CRAGI Uncharacterized protein OS=Crassostrea gigas OX=29159 GN=CGI_10019581 PE=3 SV=1
1468 ----YRQIFNIRNGWKSVARVMEDTAKETLIRLLEKHPEYREKYPMIaSLNteeelRESLEFETYAMQIFGLFDEVIQNLENVDAALDEIEHTg----KQLTLQLITDLEECFMNSLHLVLDERFTDTLQENYRLLYGFVKSNIPQ---
1469 >tr|A0A085MKY1|A0A085MKY1_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_01110 PE=3 SV=1
1470 -------ASIIKEQISKIEVN-EENGGKLYEVFFTVKPEFHKFFdlKHAPEgkdVAHNQRFKTLGKLFLEKLKRIVMACEDEHQLKEEIKGLkmdHD-PRHVGLTELKGAKPILMKFIEQQVG--MTEEQKHAWTEMFKKF---------
1471 >tr|A0A183IBE5|A0A183IBE5_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
1472 -------KHVLMEHMKRLNLT-NKLGGKFYHQLFQSlPEAKSQFAEHFDKledVENMKYYQQLGHSLLSLLKELPEHCDDDHALKQEIMKIkkkHD-EKHVDAKMFKKSKPAILKFLTDNTQ--MTNEEKEAWDHLITHS---------
1473 >ERR1719334_3108017
1474 -GLTPKQAQAIISSWENLN---SECSSLLFKQLFTIFPELKEYFGFSKreLvdkILNSEEMIAHMDATWNGLDKLVLSTQTGTRFaaiGKGLGYNHF-KFEIDRQDVHKFMDFFKQVLKDDLKSQFHGDLEEAWNIWCKAVEDVFIMGY-
1475 >ERR1719347_1061473
1476 ----------------KIM---KSClKSRLEHSGFRFSHELIMNFGFAKseLvdkILNSEQMIAHMDATWKGLDKLVLFTQTGTRFapvGKGLGYNHF-KFEIERQDVHKFMESFKQVLKDDLKSQFHGDLKEAWNIWCKAVEDVFIMGY-
1477 >tr|A0A0X3NNN3|A0A0X3NNN3_SCHSO Uncharacterized protein OS=Schistocephalus solidus GN=TR151324 PE=3 SV=1
1478 --FSEFEKDVLLSTWAVLNEEANKHSAAVFTLAGQMFPGLRNLFDIPcaNTekeNCESEAAKRHREAYMKMINGAIECLEYPREdFYDDLLVAgaHYaTIPGMKTEYFKVIKRATLVTWNSLLGEEFTEDVKQSWQSLLDYIITVISEGC-
1479 >tr|A0A2E8WN13|A0A2E8WN13_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium GN=CMQ23_00915 PE=4 SV=1
1480 --------SSIHALLEQVATTDIDFDKKCFERFFQISEEGKTLMAHMDRV-------HRG-KMMAEIYRLMMArdLD-DEADyLNWEAQNHETAYFVPGRLYPIFMRAFKETVAETLDYGWTKADEDAFARRCDQIVTEIQSRY-
1481 >ERR1719359_219123
1482 -----------------IDEepmAEVVSGeDALV----AIA-DLlyQKL-------------------------------SGDEAMAQFLENVdlt--QlanNLRSLlalvfngsdWPEMHLS--gSLiddgYEDFSSILQETL----qaSPg-DDALL--ESLDKLT---
1483 >ERR1719487_376807
1484 -----------------EEEgatEEVASGeEALV----AIA-DMlyQKL-------------------------------SGDQAMAEFLENVdla--QlakNLRTLlaavfegndWPEINLS--aSIidegYEDFSSVLQETL----qtCLg-DNAML--ESLDKLT---
1485 >ERR1712100_485805
1486 ---VGHVVLVV---GRCSFEcrnIVVVEGlDGSLDRLLALRkvvgiglGLPilQQL-------------------------------G----VLRHVGNVa-----------------lKVlrchFLQFSNHVLEVRSRLRldefclvgdivievilrDHgggkHeRD---------------
1487 >ERR1719487_109746
1488 -------RKEIEISHPELLKiGLDNVGTTFYTNLFQDSPQIQMHFIKPN-----RML---SYIVQKTIEMIGDLHPKPREVMKGLKALamrHI-KYDAPPEFFGDFESAMLKTLAQSLKSTFTEAVKEAWKAALQFIASTIV----
1489 >ERR1719221_1379514
1490 --------------------vLMRDIPRSAVALFGI-TVAIfeddyRDMNHEPALL--CAVL---LFVTFTvilLMNLLIAQLNTTYV-RIYQDTVgwaLI-NRASTIVEV----LA-TVSRT-KWTRFVDGLGLDEKLE---FNEGDVG----
1491 >ERR1719460_1401436
1492 -------REKIDNTMDVLAKhDMDDLCNKFCNKW-INADEVNGYFDKPS-----GIF---KFILLRILYLVSTIYHDPREISKEARALglrHV-KYSPPEALLPL-----------------------------------------
1493 >SRR4051794_36238122
1494 ------ARRTAKASYLRLQGggRERAFFAAFYENLLVSCPDVKPFFVPERMA-------HQQ----SMLNRAIQLLLDFDRAcgCPQLRqlaDGHA-GYQLTRWHYDQFVEALIRTIEQS-G-ITNPAELSAWRTTVMPAIEFM-----
1495 >tr|E9HGU5|E9HGU5_DAPPU Uncharacterized protein OS=Daphnia pulex OX=6669 GN=DAPPUDRAFT_301206 PE=3 SV=1
1496 -SLSDSDINLIVSSWNFLKKRLSSFAPKVFIGYLEARTDSKKMFPDFAHvniaeLATNVEFRSRACNCVASLNYIIPHLKRSFpvLQCPALKNLKT-KYNQHIDILKSLGIIWVKAMQEELDkKIFTDDVRVVWKKLFSVLKE-------
1497 >tr|A7RWR5|A7RWR5_NEMVE Predicted protein OS=Nematostella vectensis OX=45351 GN=v1g203303 PE=3 SV=1
1498 -DMTYEQKYLIRETVDNRECVNekDflawRYVCELAAIFLNMHPGLQTYFSEFKhIKiDNINGSHGHPRRLLMAIDNAVTALGDSDSFsayLVELGRRHHgMNFRPGPTHFNDLRKCFLSVIKEILATasLWDFQVEEAWNRLFDSITAMMLR---
1499 >SRR4051812_28599342
1500 -------------------------------------------------------------------------WVRPRSRGGRSPRSrssRS-SARRWPSGRPRPPSTS--RPDMRSGPSscgmsrarwqsifpapsrtgcasPIGVLGDP-----------------
1501 >SRR3569832_2950508
1502 -------------KNNKKN-HHPNNHNTKKKANKTTTPKKTQKNKNTNFT-------RQKKMLQMSLNLLIShamGIDIVDGYLHQLAERHSRhRLNIEPHHYAAWLNSLMKAVRQHDP-K-------------------------
1503 >SRR5262249_31239692
1504 --------------WACCA-RGGAS-R-AY-------AKSRERHARDGFA-------WRP----RAASGTLRageGEPEGEAHLRRLAAIHDRdHHDIRPEPYDRSLDCLPQAGRDRDA-EATPEVEEAWRDVLAPGIAVMKAAY-
1505 >ERR1712048_439078
1506 ---------NVTTIWDSIKAVpgyEEKFGRMLYEKFYEMEPESFKLFKK-TRQpaaedvFSDPVFVQHSLEFVRLLDFFIQVLGPdIelvEESLVDFGETHQ-DYGVTLDTYSSFGEAMTETVEELLGGngKMDETSRRCWVTAYRYMSMHMTRG--
1507 >tr|L1IAP2|L1IAP2_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_120658 PE=3 SV=1
1508 --------NFIVSSWRKLLRKvsYADLGLSIYESV-RDVDELEPLFRFTNRV-------VQGTKFVDMLSSIVDNIHSPAEIYVKIADLaplHH-RKGVRGSQMPLMQEIVMRVFDSTLGDDMLEEEKKAWLWMWAFLTKALD----
1509 >ERR1719336_1989132
1510 -------------------------------------------------------QDRKGGgGTPGKLKVTAKYNDGTefvDefntvifaigrdactakmgleGVGVALNPKNG-KVlhneler-TSVDNIYAIGDvldgkpeltPVAIQAGKLLArrLAGTSEVTTDYVNVCTTVF--------
1511 >ERR1719278_462770
1512 -HLSTADVAILKGSWSVLEEHVTRVGVDFFIDMMTNHEEIKAVFRQMpNIPvyelKANEDLNRHGMYILGVIKKIVGKIDDTeylEKLFDDLSDLPL-LLLQQDRPHHLAKNLPKNVHSGSLYaePpvkvaEVVEELLQVLCV-VDLPHNLL-----
1513 >ERR1719186_958210
1514 -HLSTGDVTALKSTWAEVDSQISKVGVEFFLDMFHNHDDVKQTFREHpELPvfelKANEDMHRHSIFVLGAIKTIIKHIDDTeylESFLADLSDKQR-AVGVDANNMELFGKVFVKVMRPVLLekRKWKPEVKDSWMTFFTSIVKVMKK---
1515 >ERR1700748_142917
1516 -------PALVREAWSFVSDRADQLVANFYAELFFVFKEAPMMFPS-DMT---RQRQEFGRAVVQWII-----SDDQDGLAMHLIQLgadHR-KFDVEPRHYEVAGAAMVNAWKKLAGWKWTPAHEAA-----------------
1517 >tr|A0A1B6KXW2|A0A1B6KXW2_9HEMI Uncharacterized protein OS=Graphocephala atropunctata OX=36148 GN=g.8863 PE=3 SV=1
1518 --LNDVEVEMIQEGWKCITESEDFFRTAFSSIDF-----TPVNFRE-DEHtdderFSRDFLKSHSVHVMNTVRTIVEDVKNPNSWMLELlriATLHK-LYGVTLEDLRKFQCSMLETLKQCLGEcNFSPPMQEVWEKVVECVVI-------
1519 >tr|A0A1S3CW24|A0A1S3CW24_DIACI uncharacterized protein LOC103506299 OS=Diaphorina citri OX=121845 GN=LOC103506299 PE=3 SV=1
1520 -GLTPKMVGLLKCLGVAIKPEAHRHGVNIFKKLFLMDKTVQRMFPKFacdDMcgLDENPDFHKHVDAVMKSILYMMESSGSVPDmksTLALQVKIHK-DLCIPDRHFITFGYAINEYLKETLGAKYSEDVECAVAYFWKFVASEMTAKP-
1521 >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 OX=905079 GN=GUITHDRAFT_143733 PE=3 SV=1
1522 -------SARIASSWTELVKKsdYAEIGRRIYGSV-KANDTLEPLFRFTNQT-------VQGTKFVDMLSSIVENINNPQTIFEKVNELapmHH-RKGVKAAHMPIMKGIIVSLLKHVLGDEFTNEDEEAWNWIWQYLTQILD----
1523 >tr|A0A0R3PZJ2|A0A0R3PZJ2_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1
1524 -PFTDEEKSELLRSWKVIEAQKQAVGCDIYEMIFNQL------EPFLCVSikapkELHNKFRIIVICIVGRYEEELSSVNE------------------------------------------------------------------
1525 >ERR1719192_2137381
1526 ------------TSLNFKHLcvQ-QLLKLPCLPRMFETHPEWRNLWQHMGgkLHiddmLTLPRFVRHTMSNLAYLDKIIRDADDQTKTIAsvqFLAKVHA-VQGIGERDFKQL----------------------------------------
1527 >tr|W2T4S9|W2T4S9_NECAM Globin OS=Necator americanus OX=51031 GN=NECAME_11818 PE=3 SV=1
1528 -----RDFFTLKNYWKAIDRKRQDSAQLFFSRYLNQNSENTKLYPKLkNIDgatvDmtcSDSGFEAMAASYLKVFDDVISIIEekpgDVQaacDKLTSVGKMHKtKGVQVQPKSFQAMEEPFMHMVKEMLQDRFNEKAEGLFRKFFDFCLKYILEGF-
1529 >SRR5512134_285705
1530 -ALTPTHATLVRESWARLAPGRAAAVHRFRARLEAVSPRTAARFTCLDH-------EAQRDGLMIELDQAIAATGSDDDLVPALARIARrfRESGPASSEYPMVRDALLEVLAEADRGIAPPELRRAWGSLFGLLAALV-----
1531 >tr|A0A1E4RL21|A0A1E4RL21_9ASCO Uncharacterized protein OS=Hyphopichia burtonii NRRL Y-1933 GN=HYPBUDRAFT_5624 PE=4 SV=1
1532 -TLSSSDSQVIKRSWTELQNNnkyhKDEFVSRLFGNLLAANPNLKSVLST-DL-----IIRQQSKMFNDMLGFTIMYLDNEPLLEECMNEFvqeNPSIVALGVQYLEPMGLALIQTFRQWLGSaKFHAGLETLWIKIYVFLANCIL----
1533 >ERR1711973_858157
1534 --VSAAHKSLIRSTWTLMKF-NSNVAPKILYKMFTTYPETQKMFAKIAEVStfdlmENKDFLALSYTFYSQFNLIVNNVDNPEIIKSQVARMISPsFFIDpsasIAQQLERANKIILEIFGEELGSSFTDEAAAAWTSLLKIVYEVVE----
1535 >ERR1711928_171062
1536 --VSATQESHP---------------------------------LDLDSHEiqqqrRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFET----LCFRWIQHD-----------CQQYG----
1537 >ERR1711928_123369
1538 ---------------------------------------------------rRTQNPLQDVHHL----------SRDPENVHPFGRYTRFS------A-HGEQTVLGFES----LCFRWIQHD-----------CQQYG----
1539 >ERR1740128_75568
1540 --VTAQEKTLIRATWDQMMF-NSEVAPKFMLRLFSEESQHELGgnFaVEHHLVPggadeglllGSNDGFSNTLDVRVG-----------------------ShLLGNdai---------DVVHDVFQCFLGGSIGRGDlfnglHHNMGRFVQLVDGX------
1541 >ERR1719219_701605
1542 --VSAAHKSLTRSTWTLMKF-NSNVAPKILYKMFTTYPET-QKMyTRLADIPasqlmENKQFLALSHSAFAGFNMIVNNMDDPELIKLQLSKVDFPgTFVYpfpgTSLNTSKPPASSWKYSPKN-SAPLSPRKPLPLELPFELRHQGFG----
1543 >tr|A0A1V9Y3S0|A0A1V9Y3S0_9ACAR Globin-like OS=Tropilaelaps mercedesae OX=418985 GN=BIW11_00005 PE=3 SV=1
1544 -SLSKEDMELLKGSWQTIRKDSKVIGRSIFVQLFREDPNLIKKFRHLDNIpaeqlPYHPKLLANALSVFYVVTSLIDHADDADtcrELVRKVAATHR-PRNITRQHFETFGVAFLHVVSSMMS----ARALNSWQRGF------------
1545 >ERR1719510_1721190
1546 --LAPNDITNVKSSWTTIETILLQVGIHVFIVLFETQPNMKRTFRQYRGKkhselRINEDLQRTIMYLMSNLKRLVRYINDNRATVKFMRRLakkHS-PLELDLGRIDpnEVATLFCTAIRDAKqickdqngKTSWSTEIEASWANFFGAILGAMR----
1547 >ERR1719264_357726
1548 --VGLCDALNIQQVWPRIEQYLLPVGTRMYISILDGRCDKIIFCNKACCRknasksssakstrsvysksvsrtcpnqvILNEELQKFVLLLMGLIRRAAKHLDNPSHSAKVIRKVtkkrFG-KLNIDVTKIAfePIALNFIASVREIMtnTRHWNTETEASYYTLIRNLIAYVQ----
1549 >ERR1719244_2234371
1550 -DLSTNQKNMIRDAYAVFEKNGEKNGADAFIYLITQHPDLKQVFPWGDVSneelRENQVFKDHVYVVFKGLKVAIDRIDNLKATASyyvHLGQAHV-TRGATDPAFEAVIEAVLHTFKNLLGDKYTEDFQTSFNNLLQFLVGNMK----
1551 >ERR1719193_348913
1552 -KLEQKDIRAIREGWACITAHpgLEKTGVDWLHLSFELQPGTKHHYKNFTNKtleeiCQTPYMKILAGKYMSEIGILVEHLEHSNFVlmrLENLGHLHA-KMGVPMETLFTM----NIVMQHYFRELYSrqdvpDDCEGAWSKV-------------
1553 >tr|B3RTB2|B3RTB2_TRIAD Uncharacterized protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54901 PE=3 SV=1
1554 --------------------------------LIKLSPATKIYFHGVDFEkrdsylAKNTFLRNHAARFMEAINVIIGQDMDIfsvESYFRVVGSKHH-SYNLKLEHVQDISDAFLEMARNALKKKFTKSTEAAWRSFFQMVTDAIKN---
1555 >ERR1719229_1707680
1556 ----------------------QQLGVLLFANLFKKQPLCRNLFADSDI-------SKQSLRLLDMFGWLLRSLVKEKnqmrlRTLKSLGDRHV-KYGIKIEFFGPMLDSLSDALQDWFGTNYNTQTRVALTTLFQSACNEMMKQ--
1557 >SRR5438046_805262
1558 --------------------SRRSTG-GSSRS----ARPLDPCSPRPTSI-------GSTGC-CVTPSACCYFPAQPdgePTILARVADRHSRrDLAIDPALYPLFIDSLIDTVKQY-DHEFTPAVEGAWRTAVATGVEYMQSKYX
1559 >SRR5438034_714626
1560 -SMTEASIIAFNESFERCMAS-GRFFDVFYDHFLRSSPEIAAKFQGTYFN-------RQKRMLNQRPATTVGQPR-----------RSAReSRKTPAAQFVStcqampsaFVSELTKSGSTX-----------------------------
1561 >SRR5258708_7736634
1562 -------------------------------RFTGTSDAIREKFKNSDFA-------VQHQAMADSLYLMAVSVQGGpenLARHDMKRLYPKHqRMEITASMYDVWLDCFVATARIHD-PECTPAIESAWRECLTPGIAAMKSGA-
1563 >tr|A0A0G4HCC2|A0A0G4HCC2_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6317 PE=3 SV=1
1564 --------PLIHTSFDNVLERttTEELGVRFYEIVFETAPHLQKLFKK--------PRRLQGRVFANVAALLISGIENPRFLTQELQRLslrHV-GYDIRPEHIPVFGNSLMRTIKEAaLrpspkdgqPFDFSHAHDEAWGALWGRVST-------
1565 >ERR1711965_451221
1566 ------------------------------------AGAVR--------P-------RP--------AAVI---GFPFPLFP-LLETADMtsvAVGAHPRLRA-----L-----LRDR-G---AWYLTGPQELASVIGRLERLER
1567 >ERR1712012_1094824
1568 -SLTTSDIAAIRQSWILAKDAApfEVHGPAFYKLMFETYPSWRFAFNHMGGhlSievqIENTRFVKHTVTVFRFIDKCVNDLDNPTQILENIkmvAKIHA-LQGIGVKDFIIIKAFICSKSD-KVGAGRSKNSFIFFPRFL------------
1569 >ERR1719431_737524
1570 --LDMSQISDLQRCWSTLQLHMgeQAIAAAFYNDIITNFPSIQKYFKNIwTEStftrtiGNMNDVRKHASLVVSRLTNYMGNLHHLSEVNEDLKELgmiHAARYHITEEVVEQFVSSMATTVADLLTKedLFDPVLCGAWKRFFFMILTFLSEG--
1571 >SRR5882757_2588511
1572 -SLSSRQQILARRFFDAVEASDKPLAAMFHERLSEIDDRLDGLLLE-EE----GCLLREAMVIVRTLSRNVDRLNRMVPIFRAFGRTCA-AQGIASANYEKIAPVLFWIAQECVGSEFSVEMGRALTALYDQLSREMKD---
1573 >ERR1719199_2454663
1574 ---------LLQAVKYVPARefyatfdeaSKYQLRADVYVKFFADCPVGEGYFKQ--------SNTYLHIIAAKLMDVVVAIYIDPVAVVDmisGVGLRHV-GYAIPIELFPPWVTVWID--------------------RWRSIGAT------
1575 >ERR1719199_1562120
1576 --VPADLAEEAKKAWTMLITaagSKDAVGEALYSAFYE-aAPSLHYLFVT--------PRAVQAMRIFVQVNNFVNLPISPADLKNaveALGFWHM-SMDVTVPRCVVFRDCILDLFVAELGRPIEHSRAPKHWSSAVQFPSPIP----
1577 >ERR1712176_999243
1578 --------------------------------------------------------------------SY-AHRDTFDQLadaprtI--FYTQK---------QGHPECSEMVEKMKNIVGDE-------------------------
1579 >LakMenE18May11ns_1017448.scaffolds.fasta_scaffold9549672_1 # 1 # 642 # 1 # ID=9549672_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.514
1580 -VLTAEEVQLVKSSWPIISKD-LKVAENALIKHFILHPPIQKLYTKLaNVPiselKDNDEFHAQAATAVKITHFIVNNLDNDELLTAMLSKVTIPaffvDYMDPIHQLDETTRLFLQAVKEELGNQISERTLAAWKKALDHVMLIMSN---
1581 >tr|A0A1I3QX19|A0A1I3QX19_9RHOB Hemoglobin-like flavoprotein OS=Celeribacter neptunius OX=588602 GN=SAMN04487991_1987 PE=4 SV=1
1582 --MDEQMIALVKASLKELQPHAGAVFATFQSKLAQRAPELAYRYDEVDP-------ERQGELLFEKLAIALGGVRFLDRLVPALGGVglDAGSASLTSCDFARLSEVLIAAFAEVSGNRFDPCIGAAWTTLFEELSWHMFE---
1583 >SRR3954447_25823703
1584 -------EDLVKASYHRYCADKISFYKDLYKRFFKRNPDGQRFFVKTS-------MKRQC----RMLDEAVSLLTNFRtgpepTSLSRTARGHA-GLGIEEKHYRDFNVAFVESLQMA-GE-DDEDTLNAWRCMFARGTE-------
1585 >tr|A0A024TW08|A0A024TW08_9STRA Uncharacterized protein OS=Aphanomyces invadans GN=H310_08903 PE=4 SV=1
1586 -VLTKARIETCARSWDKVRTAATdkmkSygkpgivlFYDEFFYRLFQRDSTFRVVFAN---------SKERAEVLIKALMFMLNMRADSPEsvanmqnRCRFLGHKHRSYSLVRPHHFAAYTMTCIEVIMYWMGDEASIMVADAWSNVVGFVLRYLLEPY-
1587 >tr|W4G1Q9|W4G1Q9_9STRA Uncharacterized protein OS=Aphanomyces astaci GN=H257_12218 PE=3 SV=1
1588 -LITKPRLQLCLKTWEVVQSASTdkmkQygkpgiilFYDEFFYRLIERDATFSQVFVN---------VKERGEVLIKALSFILSMRADDPAdvtnmqnRCRFLGHKHRTYARVRPHHFAAYTMTCIEVIMYWLGDDASPLVGDAWSNVVGFVLRYLLEPY-
1589 >tr|A0A2H1V3P2|A0A2H1V3P2_SPOFR SFRICE_008656 (Fragment) OS=Spodoptera frugiperda GN=SFRICE_008656 PE=4 SV=1
1590 --LFGSqEFKACCsgMGMGKIGKGG--IGPPVtsL--tqrnttqalfhvgflPYLRAAIQwctvqvDNSFDYLgIWTepvafSVDPLLIAWlaykpTVKSEASLPAAVKSLSQTQQIp-------FR-RRSTP-----------------------------------------------
1591 >UPI000297C1C9 status=active
1592 --LDEYSIGEVRNGWENLERRCGtPKAA--A-EEFLHKVSAAIPKTE--------HMQKRASTVWSKLNGLLASMHDQSMFTGQLEYLalrHM-NQDISAAEIETFKGLLLEFCASKLGGMMTPEFQYGVSRLVDAVGASYQ----
1593 >SRR5262245_14724532
1594 -------EDVVKKAYQRHCYRQPEFYRSFYENFFSRVPKARAMFK--D-------MARQHEM----LDFALGQLLNYSqqqsepTTLTQFVERHS-RLGLTADDFKRFGEALIATFDSELRGdCEHHRTMAALEIVI------------
1595 >ERR1712071_238239
1596 ----ERSFTYWKDSAMMELA---------KWNARLQTPR-----------vYEVKwRRKKRNIPGRVGWRVLGAELWVRSSCRRRIRNRPYQEYFVSyvsiSQQLEETARLIIDALDEELGVRFTSYTRGVWSR-aFHFANSIMAESF-
1597 >ERR1719204_2878153
1598 ------------------------------------------------------------------GITMMMAVVRGRPVRPAVQDigrAHY-SLRVDKDDMRQLATAMISAISDSVGTYMSPDALDAFTKLFEHIVEEFGNGY-
1599 >tr|A0A183IHG0|A0A183IHG0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
1600 --FSLREKELLSVSMKKLEQLEEDNAVKIFIRLFQENPAYKSLFPKLRFmgdadIVNSTALVAHTQLILKMIKTFINGFQNESTCAVVLKRaetAHR-KFDIKPSQVSTLFPILMEILDIS-----HNETQAAWKKLFETFSIR------
1601 >ERR1712232_1039451
1602 ---------------------------------------------------ESEEMRTHATKVMTFVGNGVASIGNPEKCerfraeCIALGKKNQ-ERGISSQDYDIATQPFVDAVEHSwlqagwrqtdaSGSIWPPGAQGAYTKFYGHMAATIKDG--
1603 >tr|A0A0N5DPZ7|A0A0N5DPZ7_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1
1604 -NLSAKELQLIEQSWLDIE-NKDELGKEVFKRVLLSNEKIRTIFDLHtcpdDELDQNETFKRHLKSLSLFIGICATSVAvgseRLVSIARRIGEKHVNFRwvTFDAEYWLLIKGIMVDVIASKQRPKEVEKVRSAWNTLLSFVISEIKHS--
1605 >tr|A0A183UUV2|A0A183UUV2_TOXCA Uncharacterized protein OS=Toxocara canis PE=3 SV=1
1606 -RLSPRHRNLIIKSWSKTN--KSKIARDTFVELFKTSADIRSKFVFGDVPikrlKQEDRFLAHCERFVAALDSVIAHLDEIGAVIEnaeALGKYDIsaepihaAmAKDLRNEHWRLFGDILVERIIENDTkqPSGGSEVHAAWKMLGQLLVFHMRLGY-
1607 >tr|A0A1Z9IBY6|A0A1Z9IBY6_9RHIZ Uncharacterized protein OS=Rhizobiales bacterium TMED162 GN=CBD22_07770 PE=4 SV=1
1608 -GVTQTQEQLIEQSLTHYAARHGDPYDAAFQKLYAAAPHYEGLFVL-DTD---E---GLRRNMMRTtLEMIATYIDDAYAAENlvtGARLVHL-TYEITDD-FDLFFQITRDVIAEGCADIWSDAHAAAWNTMLKDF---------
1609 >ERR1712150_396892
1610 -DFPSDQKQLVVKTWHYVEDHFNEVGITAFMDLFKVSPESKMIFDFLKLyHtddgKFYDLVTKHSLRILGMVSNLVKELKCKsseaadesiHDIILPLGRRHV-QYKANVIQMELLGLLLVKSLLKPIPKEeVGdkeyGQISEAYLVFFRVIVYW------
1611 >ERR1712062_817879
1612 ---------------------------TAFMNLFKVSSDLRTTFSFFGYvNvddeKFYKLVTKHSLRIFAMASTLVKELKSRdsdasdrfiHDTLFPLGRKHV-NYRSNLIHMEMLGILILNSLMKTIPRDqLNehryKRMNYAYFQFFRVIVYW------
1613 >ERR1712135_246677
1614 ------------------------------------------TL-------SVILKRTAEITAHKIIIVVTFQLKSKdseeadrfiHDTLFPLGRKHV-NYGSNVVHMEMLGLLIVKSLMKTIPRDeVNehrfERINDAYFQFFKVIVYW------
1615 >ERR1719171_2780585
1616 -NLSEEMITEVQKSWSEVLRRvdsKTEIGRIIYDSLFDRLPHLRKMFKTNRL--------TVAMRFANSVHSLVGILNNKEQTeeyVYNMALRHV-QYwsgdgSIAQANMSAFLKAVLIVFDNALDDKWTQRMEEAWGALFSYVGEAMVA---
1617 >ERR1719203_1566926
1618 ----------------P------SHPICLRSPkrFTRRSSAVTGnCCNsstQHTTFPNR---TTSPRPWPVPWPPTPPTSSTSRPSscpavpVEAICHRHV-ALAIHPMQYVVVHENLMAAIAEVLGDIVTPAIGAAWSEAVLFLAKAFID---
1619 >ERR1719253_2317543
1620 --ILSPAGRVLRLRGPGFLPprcrfgrlspnhccsrvspdriavarRPPPRPRSRPTSSPSPRTSTRGc-WAATRSCCSS---STrpttspsprT--SLR--------PSPAPSRPtppTSPTC-LPS-WSPAGPWRPSVTA----------TSPSPSTRCSTSWCTTTSwrpsprswatssrrrsrpagprPSSSSPRP---
1621 >ERR1719253_507459
1622 --LSQSAIDVVVSVAGRDARRARPRAGPRR----------TDp-WRRRRRAARG---G-gpgrragevqtraaeGASTLGHGLVR------RGRALghgLVRHGRGHC-HDS-------------------------------------------------
1623 >ERR1719253_479176
1624 --HHQELLHAGVGQPPGAAA--VLQPGPQR----------PRl-HEPAX----------------------------------------------------------------------------------------------
1625 >tr|A0A183EWZ6|A0A183EWZ6_9BILA Uncharacterized protein OS=Gongylonema pulchrum PE=3 SV=1
1626 --LSKRQRVAIENSWKRATKsDAdKHVGIQIFFRILAARPEIKHIFGLQKIPdgrlKYDLRFRRHAVILTKTFDYIVKNLAYKEklqQHFQALGERHTVlqGRGFFPEYWETFSDCMRQTVLLWNK-EKKREITSTWYQLVSKSnFPVRY----
1627 >LauGreDrversion4_1035100.scaffolds.fasta_scaffold358575_1 # 2 # 736 # 1 # ID=358575_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.683
1628 -KMPKDAVQEAQATWQKWImknTDEETAGLCIFEAVFQSMPALQGLFDTT--------TPAQAGKFMKAFTECLQGALSREELklkIETLGFLHM-NIEVTTANTVLFKNAMITCMDKDLQSAFSVSAREVISKLVLYIGGAF-----
1629 >tr|A0A0D6M6J3|A0A0D6M6J3_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=ANCCEY_05408 PE=4 SV=1
1630 ---------------------------------MPSCVRTAVTLP-----------YLEIFEPFVVIEGAVMSLDNLPALDPildNLGRRHG-KLEVNGkfrtYYWSTFLECSICIFRKTLTN--------------------------
1631 >tr|A0A1Q9CXH8|A0A1Q9CXH8_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene31162 PE=4 SV=1
1632 -ILLEAQIVEVNECWQGFLdcyAKPEHAGEAIFAAILDAAPSLQTFFRG--------GTALLAGKFVAGYSQMVHNLRNPDGLMGVVEHLgfqHL-DVDINIPRIAIFREAMCDAVSAELGEKLTDLGAYGLRRLVSYAGGALI----
1633 >ERR1719174_1428107
1634 ----------------------------------------------------------------------VVDCQDQRSTLGYPPSAst---SVRCCVEQVARRaflwrkswfLTTLTIFIAGQ-AiLKYSHLDNLATERLLVFLFRAFI----
1635 >ERR1719277_1813735
1636 --------------------------------------------------------------------------------CMCAAETriaHL-IGRASVANMHNLRNAVGSEVCLLSSlAIRFEANHVGWAHVsvadvVAVCSSISL----
1637 >ERR1719310_1375130
1638 -MLPQEQSQQLQQAWALVInmsGNRDALADLIYSAFFYRLGePR-APLRN--------PAGSRSLPFLHGHQHLRRQLRrPwssaqfrrNVELRSHVLGYhrpSG-EHHSX-----------------------------------------------
1639 >ERR1719487_3068354
1640 -ILPQEQAEQWRPSASSLVsthSLQSLAIHLAC-VLLLRPYPSdTCTWTS--------LFPVLTSSVM--PSSICSWLSLAASX--------------------------------------------------------------
1641 >MEHZ01.5.fsa_nt_MEHZ011529165.1_2 # 173 # 307 # -1 # ID=206391_2;partial=01;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.393
1642 -YMSIDtgnleaakvmlqdlvtiradrsryyyclddlFKWHPDIVWKLTV---------------DAPELLrtmldGMIWRS--------RVVvngnrrvnyylkhllvDEHGKFSNAM-SCIVKLQDP--EIAIHPILvq---LGDLVWNDLVYWrflrgklslVCTAGIFMVSQSMl-QYVESAGSFEERVATFICRLVV----
1643 >tr|A0A2T3VCJ1|A0A2T3VCJ1_9ACTN Oxidoreductase OS=Micromonospora sp. RP3T OX=2135446 GN=C8054_25080 PE=4 SV=1
1644 ---------DPGELLASALVVLSPAADYFWSFMEDRSVRF---LPQ-----------QLAPMFFSTLGQMVAGRGDPAGRRAALAVMgrmYR-RFDLQPYHDTVIAAAVVDTVRRFAGASWVPEQAGQWEkgcrQALRLS---------
1645 >tr|E5XPI8|E5XPI8_9ACTN Uncharacterized protein OS=Segniliparus rugosus ATCC BAA-974 OX=679197 GN=HMPREF9336_01410 PE=4 SV=1
1646 ---------TFVRSFHlELFGAAPELAARFPPGLGEHRGGF---VRM-----------------AEHILETFAEGADPPRLIDLLGQLgrdHR-KHRLDERDYRLAQAAFAKALVATARG---SGDGAFAAraaaLVCQVM---------
1647 >tr|A0A246RU09|A0A246RU09_9ACTN Uncharacterized protein OS=Micromonospora wenchangensis OX=1185415 GN=B5D80_01060 PE=4 SV=1
1648 --------------------------MREADELRSALPDR---LAA-----------HDAELLIATLRRLATD-PEPAAQAVTLTVLghaFR-RFALLPHAKLISALAGAD-------------------VPVELL---------
1649 >tr|A0A0P5RQ13|A0A0P5RQ13_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1
1650 -KLTPHQIRDVQRTWEHLRANRNAMVSSIFVKLFKETPRVQKHFAKFaNvavdALPENGEFNKQIAPVAARLDTIISAMDDKLQLLGNINYMrypHQPPRAIPRQTFEDFARLPIESLEAS---GVSGDDMDSWKGVLTIFVNGVSMRY-
1651 >tr|A0A0L0FDI4|A0A0L0FDI4_9EUKA Uncharacterized protein OS=Sphaeroforma arctica JP610 OX=667725 GN=SARC_12917 PE=4 SV=1
1652 ---TDSEVELIRSSWRALLAGDGTaaqmpllrFVEQYYKRLFRLFPDSRGVFKTRD---------TQSKSLSLLLSIIINVADEPElemnAKKKKLEMMYK-EYGMNSLLAVIAGRVLIQSLQAFLEAsnKFQASVKDAWVKCYTSIADQLL----
1653 >tr|L1IS81|L1IS81_GUITH Uncharacterized protein OS=Guillardia theta CCMP2712 GN=GUITHDRAFT_143733 PE=3 SV=1
1654 --------DLVLSSWDIVRQRteVQELGEKFWKYLNCMSPEQTNLFRR--------SLSMWGHLLHHIVNMLLISITDPEEYYDLMFELtirHI-RYGVRSEYLNPFGNALFATFEEILSDVWEEKTTKAWKLVWKRATCNMSRG--
1655 >tr|B3RTB3|B3RTB3_TRIAD Predicted protein OS=Trichoplax adhaerens OX=10228 GN=TRIADDRAFT_54902 PE=4 SV=1
1656 -----------------------------------------------------PLVRSHGLRFMKAIETMLEIEFDSNgciFLFSAIGNRHC-SYGIEADYLDYVPQAFRFMLTKALGNNYTDKIASVWDEILSHIIKAMQDKV-
1657 >ERR1719347_2568912
1658 ---------------------------LPPPTHFLPLPGINRKVRIFqRQFgnqtsefLTGKALRDHSIRVMDALDSVIVDTLKGKDIHKqmvDIGYSHL-KMGVEPRQIEKFLMGVYIGIKEKQQKKDSDQVMMAWKKFFNVLAEGFED---
1659 >ERR1719474_100483
1660 -----EYKNILRSTWSKLLENKEEIGLKIYKSIVfDTTstPtgnglSTSIIF-------ENSDLGQSSSRFIDMLDTVISQLDEPEALTRRLEELskmHSDKYDVRKRHYMDFERGFMKAIKWELGAQRTAQHDRAWRWFWDFMLSKMC----
1661 >ERR1719464_849876
1662 ----------------------VLIGCQTFQAFFDRHPQFLSNFDKFNAieidgVLVSSALKMHTSRVLAVVEDIVEKTGNHPRTLGDVR-------------------SSDMSIRPLvFRSgLWTIELE-------------------
1663 >ERR1719232_2219129
1664 ----------------------CRPGCVTFTQLFAQYPMMefLGKFDNME-vegVNIGEALKSHAEAIGSVVAEIQENAGNPERIRMSLAGAghrRY-QEGVARQQLDMLGPILAHVIRPLvWEKcLWSVELEKAWTHLFDIVACLMKLGY-
1665 >SRR3990167_8699843
1666 ------------------ANQLEDLCRLFYAHLFAKAAHLKPLFGDSE--------DTQNFKVIKMFELIIDNVEDLTQVQPiclDMAKRHS-FYGVKNDFYQYIDEAFVWCIQQQLSLSIQDPIIHAWYAATKYISSIMID---
1667 >SRR5690554_7960028
1668 --------NV--QFVSRGC-GGTRFCSLGFPH----PPSATLFPYTTLF-------RSQRHLlrngVMQIILVAR-GMSD--RKLRDLGESHNRsNYNIKPEWYDLRSEEHTSELQSRPH-LVC-------RLLLEKKKKNLNITY-
1669 >ETNmetMinimDraft_19_1059907.scaffolds.fasta_scaffold284136_1 # 1 # 639 # -1 # ID=284136_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.595
1670 --LPKACVSLLRQSWKQVP--QASFRKEFFDRLYIEDSSLQQIFQHPM--------VEVPENAWNVVQLMLDLLNVenvprLERFVHALAGLAFRHGRFRLAHLAPIKRALVRTVTSHASKQEKKKLSQAWEAFFYALAAVAA----
1671 >SRR5438477_815846
1672 ---------------------------------------------------------------------------RPHLSAHECGRLgpaVGQTLGAARNHLDAFGLALIEALNAATSLD-SVPTATEWSDAWDLTVRWTRP---
1673 >SRR3569833_2822653
1674 -----------------------------------APPERHTVLHE--------AIVTNPVEVAGAIGWVVEHLHRTEEVATACGELgpaLARLLAGHEQHLDACGRSIIDAIRTGLADRWKPEFDGATSSAWELVAEWLRR---
1675 >SRR3954471_21372458
1676 ---------------------------------------------------------------------------------------gpaIA-ALGIAPDKLEPMSLFLVEALLAALSPMVPadRTAGGGGRGAGGGAAAGAAQ---
1677 >tr|A0A2T7P177|A0A2T7P177_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_12319 PE=3 SV=1
1678 ---------VITRSWKCFYEKVCSFGVYEFLNLLTDLPEYEEAMRLIKLTSsykflSAMDFNAHFLSMLTIIEKCMARLevDDLpllEDILHKVGTDHI-GRGVNPENFDLVIPPMVAGMKQMLEDKWTEKEDIAWTNFFTLMIHIMQE---
1679 >ERR1712198_190235
1680 --ISSEEK-hVLIDNLKMTKG-NKKFGANmll---KMFLAHPKTQSLFPNFaKLPvsslSNNAEFVAFGKMMVSGIEIFVN-cWVTNpSanislPTSLWINNLRKPPAWX------------------------------------------------
1681 >ERR1712179_658195
1682 --VSGNSK-nAVRATFDQMRF-NSEVAPKiml---KLFTAYPETQKMFHRIaDVAvsdlMNNRKFLHQLLCL-RRIQLHPQQhgrsrDHQTpTvqgrlP-----RHVRLPLPWYLsAapg-YFSHR----IGSVQGRAGRRlh----RR--SRLWMDFSAELRQP---
1683 >ERR1719419_2176015
1684 ----------------------------------------------------------------------------------------------FvfLgssQKTILARsiftkkIVLLTEHTLKISVAVSPLSAADFTILKD--NLKMIN---
1685 >ERR1711946_32375
1686 --------------------------------------------------dEQPQIPVHQLLFL-RRIQLHPQQhgrsrDHQTpTvqgrlP-----RHVRLPLPWYLsAapg-TYPPS----HSNHTARERTAfqvlFLPQDT--SRIVLEVFRE-------
1687 >ERR1719222_1795957
1688 --VSAKAKSLIRDSWVQMKF-NGEIAPKIYLKTFAAHPKTLAMFPQFaKVPnrvrPHPYEpLLATAGIDYDVKLWIPSPGSEHNinveeLMARNArmleetrDTITVPATFMIrMlas----------MSNFRR-AGNRSTNDE--------------------
1689 >ERR1719222_245222
1690 ------ARSlgrtqesHPLDLDSHEIqqQ-RRTQNPLQDVHHLSRDPENVHPFGRYtR-------FSAHGEQTVLGFESLCFRwiqhdcqqYGCSRa-DQVAVVQGRLPRHFRLslPwhfSATRANPRIILEVFAEELGSTFTKEAAAAWNSLLNFVTKGLEN---
1691 >ERR1711911_103569
1692 ----------------------------------sraDQVAVVQGRLPRHfR----------------LSLPW----------------------HFSAtranhPhhlGSIR--RRTRLHFHQGSRCrleLPfelRHQGFRKQHRRLATHR---SRP---
1693 >SRR6476620_89806
1694 ---------------------RHATRQQRRPDVF----------HERQRTAGE-D--lnVLRERDVGQ---VHESLARagvavIDGVVPRIGCEVV-DLSSEMQNG--------FPQGVIL-SAAVGVGDDDG----------------
1695 >tr|A0A0K0D079|A0A0K0D079_ANGCA Uncharacterized protein OS=Angiostrongylus cantonensis OX=6313 PE=4 SV=1
1696 ---------------------TRDTAGEYHKQLFTLHPELAKYYDAEDIDPdsvlkvcnaddmrylayssaiQAQKFIMLGQQELQCFFRLPTVVNDERSWRSALSDFkeTFGENnNMPMKEFNKVYDAFFAAMQKHAGG-VTAEQKKEWMALFDKAYEDMKK---
1697 >tr|A0A0P5EFU8|A0A0P5EFU8_9CRUS Putative di-domain hemoglobin (Fragment) OS=Daphnia magna OX=35525 PE=3 SV=1
1698 -NRPPPDP-RCPEELGKHRNGRNALVSSIFVKLFKETPRIQKFFAKFaNVavdsLAGNAEYEKQIALVADRLDTMISAMGDKLQLLGNINYMrytHT-ERGIPRAPWEDFSRLLLDVLGSK---GVSTDDLDSWKGVMAVFVNGV-----
1699 >tr|L8DEE0|L8DEE0_9GAMM Uncharacterized protein OS=Pseudoalteromonas luteoviolacea B = ATCC 29581 OX=1268239 GN=PALB_34720 PE=4 SV=1
1700 MSISPYQYRILTQSLAVVRPNFHCFCVSLRTQVS-HFQLNN------ALITKTEYAYQQEDGLFRFIHQCVGLTLDHPALVHFISAQakLLKSIEISERDICVICNCFLSTMQLHLGKQYTLAMRNAWRRLLHIIANILNHE--
1701 >tr|A0A290TM25|A0A290TM25_PSEO7 Uncharacterized protein OS=Pseudoalteromonas piscicida OX=43662 GN=PPIS_a0207 PE=4 SV=1
1702 MSITPYQYQLLTQTLASIRPNFHGFCTSWYNQIQ-HYDLRM------QIPTNVGQLIIWEHQIFDFVQNCVMRIPQQSNLLHYLQKQrgTLLFMGTSEKDISVLLFTFYSNAKKSSWQAFYHSSKKRLEQSTVTHRKY------
1703 >ERR1719262_376372
1704 -DVGEKVINEVIKSWQLLIKRVeskTEIGKIDFDSLFDRLPHLRKLFKTNRL--------TVAMRFANSVHTLVGALTSKEqteEFTYNLALRHV-QYWagdasIAQANMSAFLKAVLIVFDNALDEKWTQTMEEAWGALFSYVGEAMVS---
1705 >ERR1719440_1320932
1706 ---------------LPSLSLPsLLLPSLLLPSLLFSSLLLPSMFVSPR-------L-STAMRFAMSLHSLITSLESTEKteeFTYNLSLRHV-KYWqgdasIAQENMSAFLGAILLVLENALDERCTQAAT-------------------
1707 >tr|Q9NAV7|Q9NAV7_9ANNE Dehaloperoxidase B OS=Amphitrite ornata OX=129555 PE=1 SV=1
1708 -----------------LRGDLRTYAQDIFLAFLNKYPDEKRNFKNYvGKSDqelkSMAKFGDHTEKVFNLMMEVADRATDCVPLASdasTLVQMKQHS-GLTTGNFEKLFVALVEYMRA-SGQSFD---SQSWDRFG------------
1709 >tr|A0A0M4CP70|A0A0M4CP70_SPHS1 Uncharacterized protein OS=Sphingopyxis sp. (strain 113P3) OX=292913 GN=LH20_00550 PE=4 SV=1
1710 --KERSDAALMEATLAAVAETGIDIRHTLFERFFSAYPERHPAFLNLDAA--SRRMTDETLQILFGLA---TDEGWVWPLVAELVATHR-NYGmLPTDEYDAFIDLAIDELGRAAGRAWTGAHAAAWRRQGEIL---------
1711 >tr|T1HWR1|T1HWR1_RHOPR Uncharacterized protein OS=Rhodnius prolixus PE=3 SV=1
1712 -SLTQNEKELLKDSWKKRGINKSTLAMMWFTKLFKANPEELlkhnhgqileELFM--DQT--N---LDYMDKLAEIFSIVVQNIDKSTlctKLIWELAMYHR-CLDLTESYFQLLKKTLLDTLIENFHPSLTPEQIEAWKKFIGIMFDIIY----
1713 >ERR1719171_2291403
1714 -------IPRIcgelwrkqtfklrfnilgkqihspgiPRFFQKMENVGgLLVSalllaMCFYDPEIvAHEEQIGIHIIDR------------NDAIYYVLEACNACILWLlvTNVFGfsvQLSAFkHC-VSQMaeDLAKFGTFAVVFLMAFGCAIhiTMPYDPDFEDMWVTILTLFAI-------
1715 >ERR1700760_4852051
1716 ----------------------------------AGSPSSPAR----------------------------RPRPA-IATEHdcrtrAPANR-APiTYGSPVD------------------ALACRRAL-NDWFRVPGVP--------
1717 >SRR5690348_16468503
1718 -------------SFWLLEPVADAAMTYFYAELSSAARATWAdrdIYMS----------GPDHMIVRT--ARALVerg-------------------APSRLIHYDLVDPRVTEGQX-------------------------------
1719 >tr|A0A0B2UXI9|A0A0B2UXI9_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_18450 PE=3 SV=1
1720 -LLTAHQRILLQKSWNKSQKtGLENIGAHVFLKIYHREPSVKTLFGIEDVPhaelKYNKIFQNHAMTFTRSLDFILANLNKLDIVanfCRQLGRRHTQyiTRGFRPEYWDAFAEALTECAIDWEGGLRCREALNGWRTLVGFLIEEMRIGF-
1721 >tr|A0A2W4R8Q8|A0A2W4R8Q8_9CHLR Uncharacterized protein OS=Chloroflexi bacterium OX=2026724 GN=DIU68_09390 PE=4 SV=1
1722 -RLSRQQKRIIQRTFSAVAVRHDLVARLTIERLRELSRTpASTCFGNT---------PEDRRRLMHLLALLVQRMDDRGALHDACVAQT-RQMGCDPFeggSTSLLAEAFIGALQSALAGRFEAKTEAAWREFFQMVERVLR----
1723 >ERR1711911_155006
1724 -DIIRKNCLMLYTNFTATKIAFKWILLCLNCRYFEIKPEAQKLFPAFaNVPL--KDLP-KNYAFLAAVNTCFANVHYLIekagrnpRDCPVFSKVVA-KYDA--RDVKQFGDIMMNSLKSELGSQFTDEIEESWNLALEEIAKMVS----
1725 >SRR6478735_8357209
1726 -----------------------EREIAFLVARGLPsKEIAEQLFLS---------VRTVQNHLQR----IFTKLG----VtsrGEVAGVLQG-LEGPSSX---------------------------------------------
1727 >ERR1719487_2840864
1728 ----------VRQSWAMIQAIqtssAGGFGDALFFNISVMSSEIWSLFSVS-K-------EVMAVTFTDAFTLIVSYIADPVGLAEELfgeADGVG-DVGDDQGEGiregdghDLLGHGEQ--TPDLAAHDGDVEEERVAE---------------
1729 >ERR1719171_2815737
1730 ---------------------agaendeelrensgvedsfasgsvPTTFNEMFLFNLTVMGAGARK----N-K-------AImWMTEVLTSFDTIVANVANSKRLQEECdvlGLRIS-KYPLDFVKLPEFKACMLSSLRSLLPRTWSGTHEVAWSWLWENIERML-----
1731 >SRR5262245_17232684
1732 --VEEETRALARYSYLQW-LDDDEFFSAFYESFFAGATGAKGKFRN---------VEQQRLKLRDAMTAVLNFYPGnEPTSLHRLIAVHA-ARDVTGTEIEQFERSFLEVLHQRLVERKIAeqlgpdvvaKIEQGWRELLHPVVQYVMG---
1733 >ERR1711962_392431
1734 -KFTAEELEAVKKVWDSLLQNGQNSGLFFFEHFFKIYPDQRAKFSFIhDQYghiepeyMETIAMRNHTMKFMNILGDLLNQVLSrDKRVKQDLSNLgytHH-ERGLKEDDVLQLEYAVIDGIHDHL---VTDVHERAWRKVFQLIRIH------
1735 >ERR1719510_2339612
1736 -SLTDNEVILIKSSWTYLKPHINTILIESFMSLFAENSDVKEKFYSFkNHAiedlnKkrgvglaSTNGLQRHIPRVSRAITKVVNSIENLDRVsryLEMLGKIHQ-QIGIEVQELMMLGAFFINSSKRHLPSSMQADrhYSDSWLHLFTVISTMMRKGF-
1737 >tr|W4GBS3|W4GBS3_9STRA Uncharacterized protein OS=Aphanomyces astaci OX=112090 GN=H257_08997 PE=4 SV=1
1738 -VLTRRHVRLIEANWTLISRGTSSaydetrhgNPDKffhrtYYSLLFAVMPSCRSIFRS--------SMHLQGKSLFAILRAMTSILhcPDIVDRMQALAGRHL-TYGCEKTDYTTAGVTLLKTLEIVSGDQWNYDVKEAYLTAFCLLMYLM-----
1739 >GraSoiStandDraft_29_1057270.scaffolds.fasta_scaffold759411_1 # 1 # 798 # -1 # ID=759411_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.594
1740 -----------AQFWEEHISykslaDKLEIGCAIYFGMMVHNKEMKRILKKNlhhhQ------SIENSSVKFLDMMGWLLRSLLRsdidLCGSLQQLGAFHR-NMGVNINHFDPMLKSMHETFSYYFPIKYGIQIKYAIDQIFTLAARIMTG---
1741 >ERR1712214_179591
1742 --------------------------------------------------------PGHAgRREGRRSARQPGTGKDRQKStkyLLELGKFHR-FSGIPNDYFGVMGTIFVHAVRPYWEEagCASEQTEVVWMMLFAHIARVMTH---
1743 >tr|A0A1Y0I5V1|A0A1Y0I5V1_9GAMM Uncharacterized protein OS=Oleiphilus messinensis GN=OLMES_1782 PE=4 SV=1
1744 ------DQRLFWNSFDRCLsspQRDQQFAEDFYQRLYSSDRAIAEIFDRVSVS-------DQLHAVRQAVYLLQEMtpLKQAEITLDKIQAIHHqHEIRLSNAMLDKWLECLLASVELADP-EFNETVKQAWIDILTPA---------
1745 >tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1
1746 ---TTQQMELVKTSWTDIV-------------LFQKEAPIASLFSFVESAksdadnlLLNTAMQTHVKKFKAAMTSVVDLLPNLDAagqMMQSVGSRHA-NYGVKQMYIMTMSNAIIYALDLSLSArgKFDQATREAWTVFLGAMSRKFTEGL-
1747 >tr|A0A2B4SF50|A0A2B4SF50_STYPI Gelation factor OS=Stylophora pistillata OX=50429 GN=abpC PE=3 SV=1
1748 -QMSREHMTLVQDSWHLLKGNLEGMGVDFYISLYKENTDLLCQFPYMSeQStehvmNMDDRVKRKGLVTVQHVKEAVTALRNPGSCVH-----HQKASGFCPRNLQSVGGALLYSLDKSLGQSFTSKEKDAWCTVYGIDVATIG----
1749 >ERR1719199_1566639
1750 ----------------------------IFQHSGIQRPVFSTSSSSR--------RLCRP-CDLSMAFRPSDVLHSSTRLKAQVETMgfgHL-HLDVTPARCKLFHGALVDFFVVELGDKLTPLAAEGWKRVLTYVASGL-----
1751 >tr|A0A0B2VDB7|A0A0B2VDB7_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_13543 PE=3 SV=1
1752 -SMNDDTKGAICEQWHTILALydgdISRVGVAVYQRIFDAEPQLREVFGIPsFVtdLSEYEPFQRSGKLFMSVVDLCVRNIYALDAEmgpvLVMYGRRHyhQQSRGFHLRYMPIFTQCMKEFVSDCLNEKQkTSDSEDGWSLLFDYIAAKIVDG--
1753 >tr|A0A2C9KGE7|A0A2C9KGE7_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 PE=3 SV=1
1754 -LVTDSDIQALRSSWATLTAGPdgrNVFGNNFVLWMLKTIPNMRERFEKFNAHqsdealKNDNEFVKQVKLIVGGLQSFIDNLENPGQLQATIERLaaiHLKmRPSIGAGYFGPLQNNIHDFIEDTLKVGADDAAPKSWTRLLTAFNDVLNSY--
1755 >SRR3982751_838383
1756 -----GINDQLRESAAMLTSGGteatDAVIRDFYIALFRNAPSLIAIFPG-NPAQGdfgsDHRGAKQRELLLGALAGLADLydpgdaerMTHLDSVLKRFGRSHAAfTrpdgtvSGATLDEYKAVKDALFSTLVRAAGDRWRAEYTVAWSQAFDYAAASMLL---
1757 >GraSoiStandDraft_41_1057321.scaffolds.fasta_scaffold6338290_1 # 1 # 129 # -1 # ID=6338290_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.636
1758 ------------------REAGlEQYAGALLRSGFDDLEtllaiedadmkdLGIPaCHVVRlRKKlqelqrqrsgtrgdFDASNP---VVAFL-----ENAGLGQYA---KLLLQNgfddmDV-LLDIEDADLKDLGvprghaIKLKKGLRELQLQQYAQEDPMPLHAAA------------
1759 >LauGreDrversion4_2_1035121.scaffolds.fasta_scaffold1378443_1 # 2 # 412 # -1 # ID=1378443_1;partial=10;start_type=ATG;rbs_motif=GGAG/GAGG;rbs_spacer=5-10bp;gc_cont=0.550
1760 ------------------------AVRELLSEAVRCVSRGKEHFASIDME-------RQCQ----ILNDAIHMLLDFQAergnaPLRDLAARHK-PFGLTRRHYDIFLTGLLEAIAES-G--IDAAHLAAWQKTLTPAVDFI-----
1761 >tr|A0A0P5AEE1|A0A0P5AEE1_9CRUS Di-domain hemoglobin (Fragment) OS=Daphnia magna PE=3 SV=1
1762 -KLtp--HQIQDVQRSWENI-rNGLNALVSS-IFVKLFKETPRIQKFFAKf--aNVAVD------SLAGn-------------------AEYEkqi-ALVD--TPTPNVEFPV--------------------------------------
1763 >tr|A0A164VL64|A0A164VL64_9CRUS Hemoglobin OS=Daphnia magna GN=APZ42_022506 PE=3 SV=1
1764 --------------FAKF-gS-----------AAVDSLPGNAEYEKQVaLVadrlDTIISAMDDKLQLLGn-------------------INYMryt-HIERGIQRGTWEVR----------------------------------------
1765 >tr|A0A1Y1Q0V7|A0A1Y1Q0V7_9GAMM Uncharacterized protein OS=Thiotrichaceae bacterium IS1 OX=1934244 GN=BWK78_10305 PE=3 SV=1
1766 --------ELIGQSWDKLAPRQTEFIDAVYELLFQQHPHYKPLFSE--------SIQREMAKMVETVAMVARvsGESEIsHPRLIKLGERHS-PLQLNRGDLENFKTAYLTVLKQFCP-EWTTECELSWEEDQSLIPG-------
1767 >tr|A0A1B6JRB7|A0A1B6JRB7_9HEMI Uncharacterized protein OS=Homalodisca liturata GN=g.2446 PE=3 SV=1
1768 -SLTDRDLRLGRATWFKNVDATPDFGMVIFKELFRQYPDVESYFLHLRGnAgsiFDSRTFRSHMtERVVPKLKEVFEALDKPEHLnevMTKLGLYHA-KLGVSGHLVENMLSVILDALKSVMHTKMQPDEETAVRTCL------------
1769 >ERR1719323_1074371
1770 --IPFEQRTLITEVWNVLQESTiRYVSNtMFLPLIVRSNKSLQKCFAALDQSlhgmelvecYGSkFDRTKHGSLFLSkLLIRVVPNMDQMDRVLPYLAELgalHQ-RHGVAKQHIDLLGLAFCAAIRGVvagGGvkGGHLHETTKAWITLIQAVCTGMKMGYT
1771 >tr|A0A2T7PY45|A0A2T7PY45_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_00940 PE=3 SV=1
1772 ----PMEVALVQSTWQRFLesPNLTTEFSAIFQRMFQMVPTAMQAFRYVnstDLDslVANKDLQKVVTMMMSEVNATLQLLDQPQALISLIRshgARHA-TYGVTRQWEETMLNAILYAVETKLSPsGFNQSEKNAWRSVLDMLGRN------
1773 >ERR1719495_824226
1774 -----QDIENVRKTWEKMIAKheLQGVGLVVLTAWMNEHKEIRQVFAKSfpiiDKlekdvldlvQLNDPTLNEHATIMASSFGKMIECLDDTEfvQMMIDIGKKHT-GFRVSADSFDTsLNSTLITALMALSEEKEDSPNIKSWKTVVEVMKHYLK----
1775 >ERR1719272_197188
1776 -SLSATQRASILASWRQLCGEDggATFCASLLGGAFEAVPETRALAGV-PEAApepeavpeaeaavaapapapakgkagatavpeaaaaveeaaeeavesaESVALRAAAAHAAVAMEIMAQQLSAPEALKESLTELGVkaasRGLGC-GAPFDRLGEALQTTLQASLGDeAFPEALAEAWRQLYAQASQEIQLQY-
1777 >tr|A0A0N8ALQ3|A0A0N8ALQ3_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
1778 ----------------------------TKARLN----NCMLLFSE-----k--LAAFLaQASPSWPVWNVVIHPCfs--qelMANQLNVLGGAHQ-PRGATPVMLEQFXXXXSPPSSSSSSRKP-PASRNSSPN--------------
1779 >tr|A0A0P5ANB1|A0A0P5ANB1_9CRUS Putative di-domain hemoglobin OS=Daphnia magna PE=3 SV=1
1780 --GGNDGVETVSDQSNLFVVF-AI-FGQGIDGNASEFDEVLLGAGSLlEELDedggNDGVAVTpDVFPaglniadlVGGQFSLGISQIfgflevlgdASdqsAHTVLPGLSGL-G-VEGAAQRFSKDFLSDVTELLEHDGVSSFNAEARQAWKNGMRAL---------
1781 >tr|A0A0P5ESR8|A0A0P5ESR8_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
1782 ----------------------------------------------FlEDASelleHDGGSS----TGFMGTTESVQLVghqllaeqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGNIGELAEHCLVL--GVGLDEA-EEDLGSDISV----------
1783 >tr|A0A0P5I7S0|A0A0P5I7S0_9CRUS Uncharacterized protein OS=Daphnia magna PE=4 SV=1
1784 ----------------------------------------------FlEDAAelleHDGGSS----TGLMGTTESVQLVghqllagqgld--ddVQTGQDGVGLGQE-VSVAQKLGLGKISEGLEHLLVL--GVVLDE-TEEDLGRHISVL---------
1785 >SRR2546423_8132340
1786 --------------------LADVADEMFTARLLELEPQWQRVLSD--------EPTEWGRRLLRAIRQAVASFTCLGGFAEALRELGgVPAAHVGYRDYERQGAAFVGRLEHSLDKPMAGAMRESWQRVFRLLAEM------
1787 >SRR5260221_7941029
1788 -----------------------IAEAMFTARLLELEPQWQAVLSD--------ERRQPTQRLLHALRQAVAGFTRLSGFEAALKELGaIPVKGCSHGDYESLGAAFIARLERSRLGPRAHQMRERGETGFSPLSX-------
1789 >SRR5262245_33028046
1790 --------EHADHNYDSNLRNNANFFHSFYSRLFESSDEIAKLFEQRNV-----TMAEQYRKLDHAMVSILAFNPRLRaTTLDPQIESHA-NFGLSAAHFGLFREAFLHALRETQGA--DEYSQEAWRAILNPALTYMRDK--
1791 >SRR5436309_12080688
1792 ------------ASFAKLLAVWEPLMHRFHAHLEQLNPRLRYHLPPA--------LL---RYVRFELLQAVRQQT-PMEVGSGLRRFgvHLRAQGFEGPDLDTLGAAWLVALDEVLGDRFDSEAREQWLRFYKVLRSAF-----
1793 >tr|A0A139A347|A0A139A347_GONPR Uncharacterized protein OS=Gonapodya prolifera JEL478 OX=1344416 GN=M427DRAFT_73171 PE=4 SV=1
1794 -MLSAEQARLLKKNWKDIGASSVanpmmFVVAQFYRRLLRK-KGYKRIFEGIDIE-------TQYFKMQGALTACVEfaeNLDKFADTIRRIGARHA-RYNMTPNMMNDVVDSLVPSLKEFsldHGITWNEEIEEAYDEWLEQVTGYF-----
1795 >ERR1740139_1939294
1796 ---DSDTIAVVKQTWKAITALPeqqEYVGMRLLHNlhpcyetsltfllvielyylsYLRVVPSARAFFPPTsDSLIDDESFRESASNLMMCIDKAINTLENQRhlrfkALLQTYGKKLS-RLHIPPSCYTMAWFALIETLQDVLEDRFTELMLAYWIDIIDPINT-------
1797 >ERR1712129_538146
1798 -------------------------HGDISSInhpvyytftllnkfthdsYLRVVPSARYFIPVIsDDDI-----TEKGIYLIACIDRVVRLLERQEkrrlqVLLRSYGRILL-RYDINPSNYTTAWLALIDTLQDILKNSFTELMLAYWIDIMEPTNL-------
1799 >tr|A0A1Y6FH01|A0A1Y6FH01_9SPHN Uncharacterized protein OS=Altererythrobacter xiamenensis OX=1316679 GN=SAMN06297468_2444 PE=4 SV=1
1800 -------STLAERSFERLAEQRGDITQDVLERYYRRYPDGRASFEHHGL--GN-RAELEGRMVSTTAFLLMQWAQDPGGTRIEQGTTivhHQDTLEIGPRLYLGLIDAVLEVLFETIPDE-SAEERAFWLSLRGEIADFLE----
1801 >ERR1711879_742838
1802 ---------KVFQSYGRSC-NNMVFFEDFYSIFMTKSPDVLNMFANTDME-------AQRALLRSGILWLGMHARGMpDTKIRALGESHSKkKDEHQPHVLFHVAGRSDGNAFPPRP-G----LHSRTGANLAPYPTAHVT---
1803 >ERR1712080_808083
1804 ---TAGDVQVILRNWESVWGaqfsgRRVAIGQAVFANFLDRVPDAKDLFKRVKVdQPDSPEFKAHIIRIVNGIDNVLNPLVLILVSnscLVSML----SEMASRLPCSRS----WVPLSTMFFP---------------------------
1805 >tr|M6F3R8|M6F3R8_9LEPT Uncharacterized protein OS=Leptospira kirschneri serovar Bulgarica str. Nikolaevo OX=1240687 GN=LEP1GSC008_4081 PE=4 SV=1
1806 MNISENQIRSLNESFDIVNLDRIKFAELFFIYLKENHPKYENIFSRIQL--------EDVKHFMNSARNISLSSVQYSQLERAIQNFgvECLKICNQAEEIPILEKAWLFALEKWLGPWYSHEVEKSWQEVFKMIHTS------
1807 >SRR6478735_3884488
1808 ---------------------------------------VRRTTLY--------MPRP-DGRGGTMKPVVAAGSL----AIMAFVTVgaqAP-APTPQDRMYAAVRSDDT----AAVSALLQGGA--------------------
1809 >tr|Q25689|Q25689_PSEDC Hemoglobin OS=Pseudoterranova decipiens OX=6271 GN=hemoglobin PE=2 SV=1
1810 --------------------HQKQNGIDLYKHMFEHYPHMRKAFKGReNFtkedVQKDAFFVNKDTRFCWPFVCCDSSYDDEPtfdYFVDALMDRHI-KDDIhlPQEQWHEFWKLFAEYLNEKSHQHLTEAEKHAWSTIGE-----------
1811 >tr|A0A2P8XQA5|A0A2P8XQA5_BLAGE Uncharacterized protein OS=Blattella germanica OX=6973 GN=C0J52_27026 PE=3 SV=1
1812 --LAREEKKFITESWHAFMRLPPANSVDAFVKFLQENPKYIKFFKSVDGIPledlrYSFRVPKHVTAVLLYVNSMVHCLDNADAMFflsLQVGLMHS-NMGLTVEDFKLFNGYMVNILEDELG--LNDEGVAVWNKVLEIFM--------
1813 >ERR1740121_2035324
1814 -----------------FTPLt-----Cqwa-----TPHDGPAQHVL-------------------CEDGHFahFATDKCesAgHG--ArvQCPSDMPEMcaDttcgggqehccrpaggCTGgERPCPT--------TASASgSA--SgsaSGSASSRRLAgIDYE-----------
1815 >ERR1719240_2235476
1816 -----------YE---DEE---------------------------------------------------------------------GAqvdvmkgEDALVATADLLYQKMSEDAN---MQT-lLGNIELAELAsKLQKALa---------
1817 >ERR1740122_169377
1818 ----K------GE--ADKSG-nAEAAGGgqGDTPETGAAQDTAAGV-------------------TDEHS--------KA--LgieISS--FDELkvDqkciaaaIDAwKLFISTAESREAAGEAV---YNA-lFEGAPS--LQALFVTPRAE------
1819 >ERR1719243_286169
1820 -------------------------------------SHPVNV-------------------LVSDTMwkGY----t-vRG--IrrvNYY--VKYMmlTrdgnvsqALGwFKDAADCKIISH-PVNVLVsDT--MwKGIVRKQFLGgRLWFII---S-----
1821 >ERR1719158_147189
1822 -----------RV--CYLYPLvhcNILAVLrelnfdGAAESLCLDAPALLPT-------------------MLDGLIwrSR----vTeNG--QrrvNYY--IKYFivDaeggfskTTEvMTDNGDPTIVCR-PVVSLVtDM--IwGRVAFRTFLYgKAWFLF---T-----
1823 >ERR1712071_338654
1824 --PTAEEIALIRESWPIVKKNKN-VFVEFVLEHFRVHPKTQDLLPEFAnLAiadmPSNKfFVQLTETYVVMAMQEIIDNLDNAGVLTDLLQCLNS-NWYVdyvslDRQN-RETLRIRRVGQEQKSYSRNMESneiQQQRCPQNLRQAVH-------
1825 >ERR1711988_652294
1826 --PSAGEIELIRESWPVIKKNKN-VLAEFVLEHFRVHPKTQELLPELAgIAladlPNNAyFVQLSETYVVLATNEIVDNLDNAGVLVNKLGENED-FQVLayyssAVATFivtnLDQEDILTHILVQQTKP--------------EQFVD-------
1827 >ERR1711911_417752
1828 ----------------------------------------AisyPVFPSTSsLKy---------------------------DSLKKYLlDAFIf--NYCT---------LIFFL-------------fIKGNWQLgdgGIgrRIRYS-------
1829 >tr|A0A061RCY3|A0A061RCY3_9CHLO Hemoglobin-like flavoprotein OS=Tetraselmis sp. GSL018 GN=TSPGSL018_8354 PE=3 SV=1
1830 ------------------------VGAGFLKLYAQRNPWAVEQFSF-GLR------PQHAEKMGLALELIVNSATRPQVLQHQLRVLalgHV-QMGIKPEMFKSFEEALFAFLGQVLGAhnTFDEETEGAWRWMWGIVNAVFTQ---
1831 >ERR1719232_1195758
1832 ------ETVIIKDTWETIHKQVKAIGMEAFEKLFALNSDMSAYLPQTDDldqdetRRLSDKVKSHAKLTMETLEQVIAAIPDMTEVYNVITKMKK--LHPQTGLLEVIGPVFCNTTRHFllIQGRWSLDVQRAWLALFGEVSAMIRASY-
1833 >ERR1719189_1497217
1834 ------GRQADEQ----VGREEAGPGHRGHRP----AQDDPAHLRgarDCGQrvrgraRRHGDRGV-QGRGQGEQS-QH-----------------HR--HQGS------HGQ---------LHGRHX-----------------------
1835 >ERR550519_213
1836 ------NIVLLRDTWSVIHRQVNTLGMETFQKLFEINSEVSHYVSpscpDLDPdciDSTTQAIKAHATHTITILHNTVSNLCNLgd--LAGE---------------MNRLGKLHCDLGIDHgiL----------------------------
1837 >ERR1712051_111803
1838 -------------------------------------------------------------------SNF--HASDGHlmdgAFDPnISQIFSF-FYLFQNCEMLVFGPHFVASAMYYLPSPLrEKSTQESWLKLFSVITEIMMS---
1839 >SRR3990167_4175368
1840 -GLTDGEKGMIQQSWNLLS--KVEFTKILYKKIFELAPHVRCLFQNS--------IESQHENFsimMDMmINEHINDELDLFAVVLQLAKRHF-HYKVKTDYYSIFRDGFLWSLEQTLSIEtlnktITNestnqptTIKSIWLKFVNYLISVMV----
1841 >ERR1712212_288737
1842 -LLTDDELFSVGNLWTNLRESSADSGLYIFQHWFDMFPEVVESFDFAkDQYgnillnlMQTKKMRNHAIGVMNKLDAMMMRLFKRDPevakLIYDVGVHHQ-TRNINEDEMTKMSKSIYSAVQDINVGPHSDKELAALHNLLEVVSYHFKR---
1843 >ERR1719167_330163
1844 -DLTDKERELIQHTWWRFREE-PYCRLRIMTHYFSANSSIKKKFQRKNEENaangnlmtamVSWNIRRFSIRLVEFMDKVVRDLETENyQDIYDISELqgakHYRlKRMVEPGDMEALGQSIQTTISEHFGEKFNRSHILAWRRLFIVICSRF-----
1845 >ERR1719378_576485
1846 -DLTDKERELIQHTWWRFREE-PYCRLRIMTPYFSANSSIKKKFQRKNEENaangnlmtamVSWNIRRFSIRLVEFMDKVVRDLETENyQDIYDISELqgakHYRlKRWWNRETWKLSANRSRQQFR-------------------------------
1847 >tr|A0A1I8F573|A0A1I8F573_9PLAT Uncharacterized protein OS=Macrostomum lignano OX=282301 PE=4 SV=1
1848 --------------------------------------------------STNQKPPSDGDRLLYWINVQ-------PTAQPQllrGASEGC-VRLFSPRILTRSCISSNLCVRAGRGRNS----SSTeTTSAEGADAVVAA----
1849 >ERR1719265_1594411
1850 -------VDTIVKDWAGLD--LEKLGDTTFGMMVQNNPEIKTIFGGDVhPGVAQQGLKSQAATFVGFMSYAMTWLKKkdfivLEQKMVELGQRHV-HYGVNVSHFVSFQEAMFTALREQLGTRFE-DNKYAWTF--------------
1851 >tr|A0A0N5AG16|A0A0N5AG16_9BILA Uncharacterized protein OS=Syphacia muris PE=3 SV=1
1852 --PSRRQCCILHKSWHRAQQCgLD-IGSRIVMQVTKNEPTVWRTVGLTNATGadikYDKNIQYQAALFTKALTTIMSKIDDPEAVseyCRELGRRHVRhvKKGFQTRWWDTFAESLTECVIEWEGttvdltslvfhatkicGQRCKEALNGWRKLVIFIISEMRAGF-
1853 >SRR3989338_2963815
1854 ----PHQMTPLYHLYKENVPpqKERELGLLFYKLLFDSNPELLDFFANVDLD-------HLSDHLVQTIRLFLESrnsLVSLVPAMKALGIIHQ-RAMIPSWAFPLVIENMAKLFSILLGDRFTVELASALVLSFDLLTSFV-----
1855 >SRR3990167_6716616
1856 -----EYENPIYStlknIWlETVSTpeIKSAVGELFYKNLFQYHPELLEYFNNVDMD-------SLALHLSQALDFVFQSinkIGDYksqwRTVLEHLGEVHR-AALIPTWGYPIIGQQILKIFPYNEKAGFSTKQL--etaLATLYREIVIIM-----
1857 >tr|A0A0Q4Y6B0|A0A0Q4Y6B0_9BURK Uncharacterized protein OS=Pseudorhodoferax sp. Leaf267 GN=ASF43_05025 PE=4 SV=1
1858 ------HRVLAKYAYRQwVEPLGMQFSQAFYTRFFQDDKASRAIFERALGPRAAgLilVDDAHHNKLVGSLGKVLNYRRGsPPSSIDDLVPSHR-DKGITIEHLRHFREAFLKTLEAQIDAsdPEKRAVVDAWRQLFEPVLDAMAS---
1859 >tr|A0A1Y5SIU2|A0A1Y5SIU2_9RHOB Uncharacterized protein OS=Roseisalinus antarcticus OX=254357 GN=ROA7023_01630 PE=4 SV=1
1860 -----PQAELVADSLSRVGDKVIWLASDYYEALFDASPQLHGVLPH--------QMSEQTNMLGHALAHALANLRDPDGAAPMAQDAglADRSARMPPRMRRTIVRTLVHALSLWHGPTWTKDHARAWNEGLLGVAPL------
1861 >SRR5690606_37396704
1862 --FSDTDTYILHTGLKWIEEAPETFAAKLYQRLLRDHPECQASLHAIGL-------ESFNRNFIHFLKMVKEELLERHTIHVAPREFlalHALpvEKVRHSNYVIKMGRTFLDIFAELAEDAWSPALESTWNKAIEEVKIALW----
1863 >SRR4029453_11903763
1864 -PMTDAELALFHDSLTRCTSQ-PPFLERFYTLFLAASDEVRHKFRQTD-------FQKQRRLLQASFYMVMLQADGKpEGavHFERIADLHSQrHLDIPPHLYDLWLDCLMQAVREYDP-EWMPGTGGLFWGRVGTCIVFFYMISV
1865 >tr|A0A1R1LGI5|A0A1R1LGI5_9GAMM Uncharacterized protein OS=Motiliproteus sp. MSK22-1 OX=1897630 GN=BGP75_23395 PE=4 SV=1
1866 ------QLDKIYSTLQLLDdEKSEKLINETYSIFFNAHPEAVLLWSKDDPE-------SRSKMFNGVILTIIDNLTRPDIFKnNLLSDVkdHD-EYGVDKEMYGGFFLSLTEALKKTLGSEFNQEMELAWKHQLAHIRE-------
1867 >tr|A0A1H1BYI0|A0A1H1BYI0_9ACTN Group 1 truncated hemoglobin OS=Thermostaphylospora chromogena OX=35622 GN=SAMN04489764_1195 PE=3 SV=1
1868 -------------LYEKIGGgpAVREVVDAFYTDVL-GDTDLKPYFDGIDMA----RLKRHMVVLLC---SVLGGPEGY--RGRELGEAHK-NLGISDEHYAKVGDKLVTALRDH-----------------------------
1869 >tr|A0A1R2BTD0|A0A1R2BTD0_9CILI Uncharacterized protein OS=Stentor coeruleus OX=5963 GN=SteCoe_19762 PE=4 SV=1
1870 -------------IYDRYGGqpFWERILDVFYTKNL-AEPTLQGFFIGKDVE----RAKAMNRSLLA---AALRPEGEH--FPVSIKRTHR-NMDISDAQFGKFAENLISTLGEN-----------------------------
1871 >tr|A0A218QUH5|A0A218QUH5_9CYAN Group 1 truncated hemoglobin OS=Tolypothrix sp. NIES-4075 OX=2005459 GN=NIES4075_64370 PE=3 SV=1
1872 -------------LYDKLGGkpTLDKVVQDFHKRIL-ADNTLQPFFANTDME----KQRQHQVAFFA---QIFEGPNEY--KGRAMEA-tHA-GMNLQQPHFDAIVSHLKESMASV-----------------------------
1873 >tr|A0A1Z4FY87|A0A1Z4FY87_9CYAN Group 1 truncated hemoglobin OS=Calothrix sp. NIES-2098 OX=1954171 GN=NIES2098_33650 PE=3 SV=1
1874 -------------LYEKIGGqaTLDKVVADLHKRIQ-ADSSVNTFFAKTDMA----KQRSHFVAFVA---QLLEGPKQY--AGRPMDK-tHT-GMNIQPQHFDTIAKHLSDAMAAN-----------------------------
1875 >tr|A0A0T6BC68|A0A0T6BC68_9SCAR Uncharacterized protein OS=Oryctes borbonicus OX=1629725 GN=AMK59_2266 PE=3 SV=1
1876 -GLTSQQKSLIQSTFNVIRPHILNVGIDLFVRVLEVEPEHHRVLPfsHIPIadLHESFEFKFHCLAVVYSCSAIIDHLHDDGILIPLMKKYASdLKASIPLDIFQMIHDPLLEALDVHDDVKISEEALEAVRTLLRNLTNFLID---
1877 >SRR5689334_189301
1878 -------LDALETSLDLVSPHG----SELMDAFFAERP-----FPAGD-------AGAQRAATLRLMGLLRLCLRDVHSVVALVRDLGA-RHGAQREQ--------------------------------------------
1879 >SoimicmetaTmtLPA_FD_contig_71_176585_length_314_multi_3_in_0_out_0_1 # 2 # 220 # -1 # ID=1957230_1;partial=10;start_type=ATG;rbs_motif=AGGA/GGAG/GAGG;rbs_spacer=11-12bp;gc_cont=0.685
1880 -----------------------RGGaveevQGPESALLESPPSLDRVATDRS--------AMIPLG-ATGLHGIMTSM--taPSMLqdlVLSLASQHL-DVVLSPPRAIVLRDAILDLFQQELGDGFDSKARSGLSLILNYVCGSFL----
1881 >ERR1712159_177610
1882 ---STSSLNAVKNSIPLIQQHGNAIAENFYVQ--QIQPTNITFFNRAHFTS-----GQQAQTLSQFLVLLAQRSDNLELMnthLRRISNKHV-GFGIKPQHYPIFFENLFVAFKEVLGTKATPELISSWKELVSLVQEG------
1883 >ERR1712159_799488
1884 ---STSSLNAVKNSIPLIQQHGNAIAENFYVQ--QIQPTNVPFFNRAHFAS-----GQQAQTLSQFLVLLAQRSDNLELMnthLEESPTNML-DSESNHNTTRSSS-----------KTCSLPSKKS------------------
1885 >ERR1719323_2894579
1886 ---KVHRQTYDICD----------LILQHIQIITVHCILIQDIDQCCHL-----KTDKQVAAVVNILYQYAMNCDNLNVLENEIAdiiGLAV-NLNMEAWQYPLIAQSLVE----------------------------------
1887 >ERR1711868_248053
1888 ---------MIKGTAKTIKEKGSSIITRMHQNLVNKHKEFKTIFPEEIL-----KDAIHMQKAVGLLHGYASNCDNMPVIEADISelvGILI-NVGVENDHYPLVAEALVEAIGTCLGSDTNAETVDAWKQALDFMVVHF-----
1889 >tr|G5ZYB7|G5ZYB7_9PROT Truncated hemoglobin OS=SAR116 cluster alpha proteobacterium HIMB100 OX=909943 GN=HIMB100_00010220 PE=4 SV=1
1890 ----------------------SKLVSELYEELS-QNEITAPYFENSNMT----SLMDHQVKFLSQAL---GGPEQY--TGQAMNAAHT-GLKITEAAFTEVAKTIQFILEDN-----------------------------
1891 >SRR5688500_9373349
1892 -------LPYTTLFRSALGDDAVGMAAELMDRLIADHPHDAHAFMNPEAA--RERMTRETLEAM--LGVA-AREPWGETTIANFVDLHH-NYAsFGADDYAARFAMTMAVMERGAGARGPGGASSAWRRQAA-----------
1893 >ERR1719365_124985
1894 -EMSGKQKKIVWRTWNSMLGkqesDYNDFGINFVLWLFDNFPKMRNKFDELYGRsrnslIVDQHFIAHTENVVKELDRLIKDLPFPRLLSKRISKLadsHLNqEP--------------------------------------------------
1895 >ERR1719199_1194134
1896 -----THAGYIEKSRESVLNlDAAQLGADIHVKFLNVYPAAASLFQKT-L-----RM-LITTKIMGTLMAVIS---DPTGTledVRAVGVRHT-KYGISERYLLPFGAMLWEIVGTMLPGMWSDEHSAAWAFYLDFIASTMTRA--
1897 >ERR1719359_1737517
1898 -----------------------SFGEAFRFNLGMMAPEFMAMFKTLTAE-------QFTDQFTVMVGQIVNYIDDPPKLLEDlyiLSVRHL-HYNTKPGNSLSLGKQ-------------------SWLLCEASFHRIGIG---
1899 >ERR1719487_2229452
1900 -----------------------SAALSL--------P-------T-EQE-------SPVTMTAEA----VQMVQDSL--RRVdsaVQV-----RDAMEDvFFPHLF---------------------------------------
1901 >tr|A0A2E0SMS8|A0A2E0SMS8_9PLAN Uncharacterized protein OS=Planctomyces sp. OX=37635 GN=CMJ46_12130 PE=4 SV=1
1902 --ISERQYHLIHDSYRRCM-LADDFLVMFHRNFMEKSPQIPKFFAD--H-----TLQQQHRILAKSVARLVSFVDGKPQaeqdMRDTMRILHDGNLRLTPEHYAFWATALMETICTI-DEACNDEVAVAWEQTISYGTGVLK----
1903 >SRR5690349_6204932
1904 -ILTDEHRHFIRTSWEKINKRHekTTLGILMFEKVFAFLPDLRNVFGLNDSSvsetDRNENFRRHTSLVVNLIDLIIRNIFEMEAemgpVLLMYGRRHFLKHDLVFQE------NQLVAFAQGLCEFfeeevdhdddnsLASETKAAWNIF-------------
1905 >ERR550537_1224553
1906 ----------------------NVVGRVVFMNIFKAAPEAKALFPGAREEnmwGPGSKMEQHVIKVVQTLAVAIGGLKDLGPIVPVLEvGLgvgIL-RNRHILSTIHLFRTFWllcIPMIQRIVGHPsscQTQRWSSRCRVVLI-----------
1907 >tr|B5DW13|B5DW13_DROPS Uncharacterized protein OS=Drosophila pseudoobscura pseudoobscura OX=46245 GN=Dpse\GA26483 PE=3 SV=1
1908 -GFTLCEKVALRQAWNLIRPRERRFGQDVFYTFLNEWYWSISKFKKG-EDINIALLHAHALTFIRFVGALINESDPI-MFQVMINENnqtHS-RCRVGADYIAMLGQALTDYILKVLDKVRSPSLEQGLQRIVEKF---------
1909 >tr|A0A1I8CTR5|A0A1I8CTR5_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=3 SV=1
1910 -KMTASQKSVLISSWKFIKPNANFIMRKIFTELESVSPKVKQIFAKAailDCfskesSDaKACTVDEHVRLLSRFIDDVISNIDKEKEVrniLRKVGQSHAGlsnGSLFTSSLWEFLGEIAVAKICQVDYVQKSREAAKAWRLLIAFMTDELRNAF-
1911 >SRR5258705_2725614
1912 -----SSFPPGPGELRNCCAHRRRRRRALLPAPLRARPVARAHVLR-R-------HAL-RDHFEAALALIIRNLDEMEALAESLLESEW-----------------------------------------------------
1913 >tr|A0A1Z5JZN5|A0A1Z5JZN5_FISSO Uncharacterized protein OS=Fistulifera solaris OX=1519565 GN=FisN_19Hh029 PE=3 SV=1
1914 --ISPDVVSAVQDSWERIKDSspawEDDFGDRFLKSIFTKAPLsYKLLFPFGTTSgpamFESEDFIEAARTASTLMDMSVSLLECeMDALFGQlleIGLEHANFPRIQTSHWSMMRDALLRTLASYssaLSEDCKdlEKVLSAWSLVFDNLSNEMVET--
1915 >ERR1719329_2064399
1916 ------------SLFVRLGGDvaVDAAVERFYERIL-QDPLLAQIFSRVNL-------AGLKNMQRKFLTMAFGGPDLYDG--LSLRDAHQ-GKGITEAHFAAVAGHLSATLREmAVPDRQHDEVMAIAASTQGNIV--------
1917 >tr|A0A1I8MDY2|A0A1I8MDY2_MUSDO Uncharacterized protein OS=Musca domestica OX=7370 GN=101890360 PE=4 SV=1
1918 NGFTATEIASLRNGWRHFKRRFGYHSKQIFMKFYQEHEQMLEKFRNRMGKFNMQQLHRHPQELLQVYGNLIEqGLDNMtymHVLMTAISQRHR-MFGVTGYEIKLQTDhitlYILALLEKII----SPTFVSGLEKLSRLIN--------
1919 >tr|A0A1Q9NTV3|A0A1Q9NTV3_9ARCH Flavohemoprotein OS=Candidatus Heimdallarchaeota archaeon LC_3 OX=1841598 GN=hmp PE=4 SV=1
1920 --FTSKEADILTQSLKALEEKTDDLPKLFYYHFLEPtsNKEIISLFNKS-------DMTKQYMMFHQSLAIIVSSIKDSHllnQILKDLVKRHK-NYGVKYAHVQIFSSAFYKTIEEIFPKD--EKVKILWIKLINFVLSKFN----
1921 >SRR3990167_8190046
1922 --MDNAQKLhIVDTILERASELAGDITDSVMAEFYRGDPEAKDLFTHHCPV---DTIRIEAGTVEQALYCFMRWFQSPGEIRILLLGSvphHVETLKVPVNYYHRFLQAMATVIRKTIPAE-SREEIDVWNEICGDLGEIVDA---
1923 >SRR5690625_7611079
1924 ------------------------CALCFYLCFcTDTPPTRTYILSLHDAL---PICQLEGEMVENSLYCLMSWFESPGEIEMLLAGSvphHEETLRVPPHWYEELLEATRSEEHTSELQS-RGHLVC----------CLLLE---
1925 >tr|A0A0K6SA08|A0A0K6SA08_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_8920.t1.CR2 PE=3 SV=1
1926 --------------------VSAAMAEKFFELVPKRAPNLRMIFEKRqDIY------KH---HFGEITKRLLAYLDSPEEVWKedpELAIKHI-EFGVMPCDVPVFANVFLQILAELAGPAWTQRHRDTWDKLFSIVSGALAE---
1927 >SRR5690606_8675308
1928 -----IDRDLIEASFEHAAETLGDITPFAYQHFFARYPQAEELFLCKG-VQFKNDL--QNQMVRDAIYAFLEYLDTPDEVDIVFKytiPQHL-DLNIPMLYFNGLLEAVAEVVCGATPEAGKAATEASWKVLLESIE--------
1929 >tr|A0A1W0WMU5|A0A1W0WMU5_HYPDU Uncharacterized protein OS=Hypsibius dujardini OX=232323 GN=BV898_09357 PE=4 SV=1
1930 -ALTHVQINLVRESWRWLNfnrPLQETAVRFFLDFYFKQNPDCLPMFGMKTVDHYNKAFSIHALTVMHAIKYAVEYIGNPEQfqrLFRTVGQTHL-RFGLTDLHVERFLEQWLAFLRANDAKVFDAATVEAWNLAGRIVVSQI-----
1931 >ERR1719354_143580
1932 -------------------------------------------------------------AFWDILDHICGHLDRLENLIPQLRDFalQCFNSGLFSDDYNILGECLVTILSTNFDP-WEETHSDSWAWCLDLVMSTLVT---
1933 >SRR5215207_8455447
1934 -------------DFDTVV--CSSFAERFYSRLFTHEGGehLRALFPDN--------IQPQHAQFTTMLGDILAYNFRIGrsLLGD-TFRKHI-DFNIRESDVDVFRKAFVEEVGSTFLH--LG----------------------
1935 >SRR5271170_3229012
1936 -----------------------------------GRECRRDNrLLLLDAPPATPLgtSqyLDARHRTVSCTSantgvctgpYQPDQLKD---------RKT--VLGGGLR-------------LAQPGSRLSQPLPGRFGESAGX----------
1937 >SRR5216684_1000550
1938 -----------------------------------LHQGRHRPRVHLGL-------------------------------------------RGGSPAHPPRDPRPRHKRGAIHRA--drhVPPPrPPRQSGAQAdfSDHSRL-----
1939 >SRR5271154_4753691
1940 -----------------------------------LHQCKHRC-LHWSLPARSA-qrSQDGP----RRRVtlqpPPVRNRGR---------GVSAlsllrsswpniRFYRVETVSCPRDRLCIDLDPISTVKRNLA------GVsDVYLL-RS-------
1941 >SRR4051812_37657562
1942 -------------------------------------------------------------------------------------------------XMSSGVRFTRWRCESIRARLRapsdhcvTVPVkPSRRSDSAVsaRKAEQ------
1943 >DeeseametaMP0200_FD_k123_38240_1 # 1 # 450 # -1 # ID=33738_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.658
1944 -----------------------------------PHACLSTChAANPP----VAI----RARRSSAEGYAR---------------SD--DARGGTA-------------SPPPGRELSSPASAIDPFSRGAISFVSF----
1945 >tr|H3FA75|H3FA75_PRIPA Uncharacterized protein OS=Pristionchus pacificus OX=54126 GN=WBGene00108645 PE=4 SV=1
1946 -GLTAYQQKLLIQCWPNIYSTGpgGQFASAIYNRLQNSCPKAKQLLAKANGVavFANSDvdcTAMHSRVTIELLDTAIRNLDAdHAKLTAYlieVGRSHRplRQEGLAIAVWDDLADSLMECVCRYDAVKKHKELRRAWLALIAYIVDNLKNL--
1947 >SRR5437879_6948005
1948 -----------------------------------------------------------------------PPSTcsWTtslsagrgvRPISSVSASPTA-STaaTLPPHLYDFWLDCLLHAAKECD-QQWSPEVAAAWRYMMGSCSSRLAT---
1949 >tr|A0A0V1B190|A0A0V1B190_TRISP Uncharacterized protein OS=Trichinella spiralis OX=6334 GN=T01_13586 PE=3 SV=1
1950 --LNPKEVILTRNVWAALKEKhQHLVGMEIFRQIFNRRPDLKSLFGVSALdtemALNSTRLHRHTMIFQDVIDILMVNISNVDVniadSLIDLGAQHWvlTKRGFDPAYWLIFGDVLFDLVENVTRKLpSRKRSTNAWRKTIAFMLDCMQIGY-
1951 >SRR5437762_8994925
1952 ------AAS----------------SDHHIPSQLAAGTRAKDRKGGVE-------YPGHVCRGQRRCARDRPHILAsPELCIPRAcrtksA--------------AFCAVCENRCCETCR-SPPAKKPETARRSAERTG---------
1953 >ERR1719204_228700
1954 -QLSPSTVKAVQTSWNNIRSGGpGYFGHLLFSYWLAEHPRALGVYSMYyhdDKkHrvSLLPRFHRLGEVYAKRIDYWVTNLEEPVKLFLMLyehGFNHA-KRGVNLRDFPNMTPSLMDALATALGRQMTLKLYDQWKDFWKFIFMQIAEG--
1955 >tr|A0A2A6CS87|A0A2A6CS87_PRIPA Glb-5 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_35904 PE=4 SV=1
1956 -----DETHLARAHWILLHKMnkQGTVIQSTFEHLMTEFKHTRPIWQFGrniDENvkdwnkelHEDFYFRHHCASVQAAITMIMENKDDIVSLTRVLnevGAHHF-FYDAYEPHLILFEDAMITAMKKVLKGveELDEETERSWRVLLQLTRKHLIEG--
1957 >tr|A0A090LKP0|A0A090LKP0_STRRB Globin-like domain and Globin, structural domain-containing protein OS=Strongyloides ratti OX=34506 GN=SRAE_2000335800 PE=4 SV=
1958 -ELPKADKDIIISTYNILLQADPELFSKAWIMSASRSTSIRKAFSLIDPnsTHIEVDFTKFSAVIERFFTRIICEekLVNesFEKSCINLGKKHVDfvPIGFHSNYWDIFMNCMIDVIAETVIIAFNednkqqQQVQKCWNKFVGRIVFLMQSGFK
1959 >tr|A0A2G9URY2|A0A2G9URY2_TELCI Uncharacterized protein OS=Teladorsagia circumcincta OX=45464 GN=TELCIR_05034 PE=4 SV=1
1960 -PIANKTKKLVIQEWPRMLEHQPNLFGIVWISSATRSNSIKKTFGIGANenPEDNEAFMKIWPTVQQFFHKL---------------------------------VCMAETVDQTLCEYYTddlkrAEMILAWQRVFNTIVHHMRTGYI
1961 >tr|A0A0M3JT43|A0A0M3JT43_ANISI Uncharacterized protein OS=Anisakis simplex OX=6269 PE=3 SV=1
1962 -SFTTPQLTSVFNAHFSMIQLNPDVIKDCWIKTSKRSSSIKKAFGMLEHeePETNASFMNLPITIQAFFKELIFEldCDSvkIRQRCEQLGARHVDfsERGFHSNFWDIFQVCTIEVIAEC--NLGLnedqhRSYELAWIHLLSSVVKSMRNGYT
1963 >tr|A0A077Z0R2|A0A077Z0R2_TRITR Globin OS=Trichuris trichiura OX=36087 GN=TTRE_0000042901 PE=3 SV=1
1964 --FTAKEFAIAELTWAKLKVRfNNQVGMEIFRQIFGSCPEVKDLFGLQNKedqkALCDQRMARHTAIFQDIIELLIVDLSQRsDSLtqsLITLGAQHWffTQRGFRPEFWVIFGNTLVNLIRSLPLSlSQRYLARRTWIKLIVYLLDCVMLGY-
1965 >tr|A0A0N5DS84|A0A0N5DS84_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1
1966 --FTPKEFAIAELTWAKLKLRfNNQVGLEIFRQIFASCSQVKGLFGLQNKedhtALGDQRMARHTAIFQDIIELLIVDLSKRsDSLtqsLITLGAQHWffNQRGFRPEYWVIFGNVLVNLIRSLPLSlSQRYLARRTWVKLIVYLLDCVLFGY-
1967 >tr|A0A016V5D5|A0A016V5D5_9BILA Uncharacterized protein OS=Ancylostoma ceylanicum GN=Acey_s0017.g3216 PE=3 SV=1
1968 --LNRMQRRALRFTWHRLQTRnggkrVENVFEEVFDRLVRALPCVRDMFTTRMFlcamArNETASLRDHAKVTVKMFDVVLKNMDTDPskrtdtgfPLDpKIIGRAHGplRPYGLTGQYWEKLGETIIDVVLGQEAVRDLPGAGQAWVIFTACLVDQMRAGF-
1969 >ERR1719187_3161387
1970 -ELTDDEINEVQQSWDLLTRSeggLREAGLTLNQQLLTAQPHHIRSFEKFRkykdfdDILKSPEFKTHSYSTVREISLVITNLKHPGVFtqlTQSIGFAHR-RANTPPNQMVDFKSVFINdFIPSQMADKATPNTIKAWEKFMTVFIEHVKEGL-
1971 >ERR1719481_246497
1972 -ELTDDEINEEQQSWDMMTRTegg-lREAGMTLNRQLLTAQHHHIRTFEQFKkykdfdDILKSPEFKAHSYSTVREISLVITNLKHAGTFtqlTQSIGFAHR-RAKVPPNQLVDFRSVFINdFIPSQMADKATPNTIKAWDKFMTVFINHVKEGL-
1973 >ERR1719347_979638
1974 --VTDEEMASINELWSCLRADAMHSSRFIFARFFEAHPEFLEPMPFVkDYygniSpkyMDTQEMQDYCLKFMSTLDAVMTRVFARdkEalQVMRDIGYSHH-EFGLTSDMTVKFMNKMHDSVLELWGTEASRRDSKALDNIFKTIATEINVG--
1975 >tr|A0A1I7TYQ0|A0A1I7TYQ0_9PELO Uncharacterized protein OS=Caenorhabditis tropicalis OX=1561998 PE=3 SV=1
1976 -GLTRDDKRIIETCWFKCSQKqLRKSSCDMFWDILHTDEDILRLFRLDHVSpnrlKDNEYFKSHASNLALVLNLVVTNLQDNfEQaqdALQALGYQHLhlIDRtHFQSMYWDIFTDCFE----RNPPPSFRkGAEREVWSRMILFIMGQMKTGYQ
1977 >ERR1719396_104066
1978 ---------NIIESWELLRFhpsLKEDLGTAIFRELFKEHPELREHFGLPlvGLdaLCKNQTFLSLSNQFVDVFARTMDTLGPDEELmdesIRELGEKCV-SIGIETSHLSLLRKPILSAVEKILLEDFDD---ESWKKFYSILATDL-----
1979 >ERR1719396_219220
1980 ---------NIIESWELLRFhpsLKEDLGTAIFRELFKEHPELREHFGLPlvGLdaLCKNQTFLSLSNQFVDVFARTMDTLGPDEELmdesIRELGKKCF-WKTLMMNHGKN----STPYWEQIWQREFQQ---DKRDKLYSYSNNNN-----
1981 >SRR5215467_3799544
1982 --------QQVSESYWRCCT-NPLFIEELYQTLFSKCGEIKQLFEQKNV-----SMKRQYAMLRYALDIFVDYPHDMTATFPDIARKHT---GLDPRFYETFIEALIETVGKCDPK-WVPSLEHAWRERMT-----------
1983 >tr|A0A1I7VXG1|A0A1I7VXG1_LOALO Uncharacterized protein OS=Loa loa OX=7209 GN=LOAG_10963 PE=4 SV=1
1984 -QLSSYQIHLLQQSWQRIRS-SPNFFINVFRTVIAKNTIAKELFRKTSIIdgftsYKCYDVKEHADSLIELIDFALQEIHSSTKVVQhrcmLMGATHCNTcENSMSSSWDQFGDSLAESIAKAEAIRGKRKCLQAWNTLLSFIVDRIKGGY-
1985 >SRR3954451_1828621
1986 --MDPADDALLRQTQGLLRESldfaggAVAVADRLRQALRAARPEVVAALPG--------DAATQTAKLAAGLVWLVDHLDQPPLLVGgsaRLGAALA-ACGVPPRGLQFVGAALAEALRAGSPaGEWRQEFELAWRSTWQHVYEWMQVT--
1987 >SRR5262249_5830581
1988 ------DVEVARDSYRRILDDVerqREFFHTFYGLFLRRCPEAAAVFEAKGYPalaqlggPRvedsAGRGPQPPNPLKSAIVMLiaFNILGEKEepTILDNLVDKHK-GFP--KRYYVAFQDALLETVVQFDDPsrcgMPPDELQHAWKQAIQPGGDYLID---
1989 >tr|A0A2A6CAG8|A0A2A6CAG8_PRIPA Glb-32 OS=Pristionchus pacificus OX=54126 GN=PRIPAC_40555 PE=4 SV=1
1990 -GLTPEQKRILETSWVKATPKqIRKATEDVFASIINHDRSLAVMFRLDDVPinriRENQAFKKHAANFALVLDLVIKNIPDNvDSCcqaLQALGGQHVslRDRGFDSIYWDVFTDCFENNPPATFK---TDIDREAWSAMILFILAQMKLGFR
1991 >tr|A0A0N4XT53|A0A0N4XT53_NIPBR Globin-like protein 26 (inferred by orthology to a C. elegans protein) OS=Nippostrongylus brasiliensis PE=3 SV=1
1992 --ALQALKVILRTTWRHMSKSGqGNCGSTIMRRLFIRNDRVKNVFHHNIMigglLepnaQETHNLQQHYSDIVQFLQFAISNLDHPSRITekcHEIGLKHR-KYktmGMKkkidkkylqAEHWDLLGEAITETIREYQGWKRHRESLRAANILVSFLVDRIRT---
1993 >SRR5215831_15107384
1994 ----------------------KLFFSKFYTNLFGRADDIEDRFKELD-------MERQYRILNLAIHKLLEFRPEQPAtqkQLRDLSLRHA-KLGLTNHAPAWNR-IH-LDLRGIGA--DGRSsGVAAADKALAX----------
1995 >ERR1719234_1549997
1996 --------------------------------------------------slwhrssIQLEGASNHNKALMNAIDSVMvEVLERRPMSksgIRDAGISHH-KFGIKRLDMDKLTTAILAAISDVLGDCdLDRKmlQLNAWKKFLNAIGDEFSVG--
1997 >SRR5262245_32700325
1998 --LNSNQRDLIRRNWDSssK---RYELCRRIYCRVFARRPEIRRIFSIGYDW----WRLEI-VTFADFVQSIVDNLDDAKRVrqsAFEFGRDHAkwRRFGFRSDFWVQLAESTTREcvyLDAAV--HPPDESLETWTKFVSIVF--------
1999 >tr|A0A2A6C3W4|A0A2A6C3W4_PRIPA Glb-17 OS=Pristionchus pacificus GN=PRIPAC_39254 PE=3 SV=1
2000 -ELTDEEVAAVRNVWIRAK--TEDIGKKILQTLIEKRPKFAEYFGILCqsDKldmnslKESKEFHLQAHRIQNFLDTAVGSLGYCpvtsiYDMAHRIGQIHF-YRGVnfGADNWLVFKRVTVDQVTKGVTSTqasqanllegtkepevveqhpmadvqnpFsgeNCLARLGWNKLMTVIVREMKRGF-
2001 >tr|A0A2G5SLB2|A0A2G5SLB2_9PELO Uncharacterized protein OS=Caenorhabditis nigoni GN=Cni-glb-17 PE=4 SV=1
2002 -EMSDEEVSAIREVWIRAK--TDNVGKKILQTLIEKRPKFAEYFGIQSESldiralNQSKEFHLQAHRIQNFLDTAVGSLGFCpissvYDMAHRIGQIHF-YRGVnfGADNWLVFKKVTVDQVTTGATDSskekdkdetnsngtangkvdteanpipvgiadinnvYsgeNCLARLGWNKLMTVIVREMKRGF-
2003 >tr|A0A0N4ZE39|A0A0N4ZE39_PARTI Uncharacterized protein OS=Parastrongyloides trichosuri PE=3 SV=1
2004 -DLTAEEIEAIRDIWLRAK--NESVGRKILLALIEKKPKFAEYFGIGSENvdpkelLGKREFQLQAHRIQGFLDTAVGSLGYCpmssiYDMAHRIGQIHF-YKGVnfGADNWLVFKKVTVDQVSRVNVEGkdrksnvslgkrnnsgdaedstaetprkesahsfndmYevsNCLARLGWNKFMTVIVREMKRGF-
2005 >SRR5512138_1182700
2006 --------RRVQGSYSTFQAtdRADRLYRTFYANLFASVPEARRMFAHTDWS-------RQYNAINEALKLLLDFDADPQRaadAAKQIGsvaLKHQ-QYGLGERELRAFEGALLHALRSC-G-ECKPATLEDWRMILAPGFHHMRGA--
2007 >SRR5687768_15481058
2008 -ELSDRTRDLLVQSLPLMEHRKDALIEGLARYLIGSTGD-----ANQ-------DSELVAIVLTELLIGQASHLVRSSALpdLDDIRLEHS-RLGVQGSHYSRFGDALTPVIRDVLGPKLPREVAGAWGDVFWTVINVI-----
2009 >SRR5687767_13070119
2010 --ISDRTRDLLAQSLPLMEQRKDALIDRLGAYLGG-AGD-----ADE-------DSELVAIMLTELLISQVGNLLRSGDLqdVGDVGHEHR-MLRIQGRHYSRYGDALSPVIRGVLGPQVPGEVAGAWGDAFWAVIRAV-----
2011 >tr|A0A0R3PFZ5|A0A0R3PFZ5_ANGCS Uncharacterized protein OS=Angiostrongylus costaricensis PE=4 SV=1
2012 -KFTQYVGNIVVLAFLNcfatitktvsdtsitvhvdqiqihcdihtsfqcsrekgtsfeqgldfdkTF---IKRLLGLFRLLCFKSALSREMFQKMSIVegfrtNQCCDLNMHAK---------------------arcmDIGGSHV---QMneecCGALWDQLGECLAEVITKVDCVRSKRECTKAWIMLISYVVGGMSLGN-
2013 >ERR1719414_1806988
2014 ---TVAQAEKVVAQWDAAD--QDAFIVAMYQAMMKTHPEWRALFNKPTGAptPAEAEWKKQFDLTKAVLDRglrsRATDVDALKERMHAMAGRHV-NYGVTQTHFQALKPILTDVLAATVT----GADMDAWSAVTYFMLDS------
2015 >SRR4051794_33648798
2016 ---------------------DHGSTN-ASTRALAARPTMSAKFGRAT--------AARARHLTRAIQDLVEFREDDgASRFRlHHVPAHA-GMGITREDAEAIRREFVAEVIATFERsggNvSPQMHGDAWNAVSRRRVERCVE---
2017 >tr|A5L2R3|A5L2R3_VIBBS Uncharacterized protein OS=Vibrionales bacterium (strain SWAT-3) GN=VSWAT3_02206 PE=4 SV=1
2018 ----------------------QAFLESFLADFCQHNPRFSERFEKVG-------LEQQTKMLKASIILIYNSAGLPsvRNSVKRLGKQHK-DLGmdISEQELNEWFKSLLNTVKKYD-PHYNDQVEQAWTETLDVGLKIMKQ---
2019 >APIni6443716594_1056825.scaffolds.fasta_scaffold2871162_1 # 2 # 304 # 1 # ID=2871162_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.617
2020 ----------------------QEFLETFLADFCEHNPRFSERFESIG-------LEQQTKMLKASIILIYNSSGLSsvRNSVKRLGKRHK-DLGmdISEQELNEWFNSLLNTVKKYD-PHYNEQVEQAWAEMLDAGLKIMKQ---
2021 >tr|A0A0N5DFM9|A0A0N5DFM9_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=4 SV=1
2022 -LLSPAQIKLIRNHWNGLYItiGPTAIGNYLFNRIVFKNPQSRKMLLSLlvDHLSPGYFSKRHARAIGVILNFVMKNLEYPENIsliLKMVGHCHAKlvTVGLDSSIWNVFAEALLECSLEWGeKSRRVDEVRKAWAIIIAFITEKLKAGFN
2023 >tr|A0A183IST0|A0A183IST0_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
2024 -QLNDKDITLIAESWRKIED-RSLWAQRLFAKLFVYRPQLASIMSYQDVSgkklLSNPKFQNFCQRFADFWQDVVSGLCDRgtdddwKqvvALIRELGARHSRipKITFEASIWLHMKSEIVQSIT-GFKDIYRDELCYSWNKLLMFVVTEMKDAF-
2025 >UPI0002C4E217 status=active
2026 -------------------------HEDFGTAFFEYCPDLKGQFPSN--------YALVTKMIQKFINNVIEG-KNLERLARHYGRTHW-RYDLEERHFLGFAEALADTINIRIGNFGTIELMKIWREEATMICKMLEDQY-
2027 >SRR5262245_41417288
2028 ----------------------GNLHARIYEAFFAACPEAKPLFDNTD-------LKRQYQLLHQAIVLMLAFHVSPNreepTILSRVAARHS-ELGvhIPPAWFDAFSAAIQQSLEAA-DTQFSDKTREAWAAVLADGIGYMQ----
2029 >tr|A0A0K0EPG4|A0A0K0EPG4_STRER Uncharacterized protein OS=Strongyloides stercoralis PE=4 SV=1
2030 -GLSFYQQKLILQCWPNIYTtgVGSNFASNIYPTLCCKNSKAKALLQQADGVavFSNSgvdCTTMHSKLTLEIMDSIIKNLDSnPQPIISYLQDTgysHKnlKIQGMNMSMWDDLGDSILEGVRKNELVRKHKELRRAWLAIIAFLIDNLKQG--
2031 >ERR1719183_3286062
2032 -------AISLRDSWVHIEVlkeedDSGGFGDALIFQLSVVA---QEIFGLVVT-----ERNALGKIFNRMFSTLVHAMGDPQKFTEeffVLSSRHG-RYGVQEHLFPLFQQSIMVTLRSLIPQVWNDTLEDAWSWFYLFCQDSMVRNF-
2033 >tr|A0A0V1BAT0|A0A0V1BAT0_TRISP Globin-like host-protective antigen OS=Trichinella spiralis OX=6334 GN=T01_2203 PE=3 SV=1
2034 -----------------------ENGGQLLANVFKANPELRKFYDVEDIDpddtKKSRLIQQAGGNLLNSVTFMVNNYDNERSFKQEIKEQicdLR-EKGMKLEDARKLKTGFVNYVKSKLSQPMTAKEEKEWDMFFQRFFDALK----
2035 >tr|A0A2E3CX61|A0A2E3CX61_9GAMM Uncharacterized protein OS=Pseudomonadales bacterium GN=CMK89_07570 PE=4 SV=1
2036 -------SDLLNLSLEQIASAIGDPTEPVFTLLYQRHPELAAF-SREDTS-------WQHYMIQEILQNLMEMAENPDTALAIIRDMtlhHQ-MIGLEADTFKGMYRTLHDVVVQHLSGPHREDMTALWEDSVQRICRSVD----
2037 >tr|A0A2G6L250|A0A2G6L250_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium GN=CSA49_02275 PE=4 SV=1
2038 -------TELINLSLEQTVETLGDPVEKIYERMYQRFPDLVSY-KEENED-------WENYMFEEIITNFMSFGDDPETALLTIREMvvhHE-LIGVPREAFKGMYDTLYEVITATFHGPQESEMKAVWQEIVAKIYDCIE----
2039 >SoimicmetaTmtLAA_FD_contig_31_10253239_length_247_multi_1_in_0_out_0_1 # 3 # 245 # -1 # ID=589621_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.671
2040 -GLSEYERGLVVNSWKALTKPdfspldGTSSLSNFYDAVWTKWlkidEFANKMFRSR-------GFKGRVQHLLRIMGVIIKCAEDPLRGLeqlRSIGVQHC-IWGINSQSFASLALSIIHGLDQANGKEINAELKELWLAL-------------
2041 >14BtaG_2_1085337.scaffolds.fasta_scaffold158720_1 # 2 # 106 # 1 # ID=158720_1;partial=10;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.467
2042 -GLNDAEIESIKASWKTITNTastngGDTMIVKFYDTVWNRWtkldEVANQMFQSR-------GFKGRAQHLMRIIAILIKFLDDPS-TLtqiKNLGVQHC-VWKINTESFSALAV--------------------------------------
2043 >tr|A0A2T7PRA6|A0A2T7PRA6_POMCA Uncharacterized protein OS=Pomacea canaliculata OX=400727 GN=C0Q70_02930 PE=4 SV=1
2044 ---EPHDKTIVAESWKLLRSIFPDLIESAFVEMCRRVPRLKLQFGNVDVDDDEerhMNFLKHVWDVSFFFDQLLLYLPfksKLEECSFHIGLVHA-SVEVPAWYVDLFLVEFIRAAQETVQLEWTPAMENAWAVFLRYLCYYMKDA--
2045 >tr|A0A183IYP9|A0A183IYP9_9BILA Uncharacterized protein OS=Soboliphyme baturini OX=241478 PE=3 SV=1
2046 -----------------------------TLGLFTSSPEIRSLFPTLvDWgddIKTCQKFRNQGLKFVHVISLSLTTLHDKehlDTLLKEIGTRHVEfmPGGIKMEYWDIFEKAMVKCILQQI--RWTDDfdeaiqskAAIAWRILCAYIVQKIKIGF-
2047 >ERR1719183_316154
2048 ---EPEVSAATKRGWRAWVAdmfaRGIPAGEALYQTIMDDAPSLKHLFTKP--------KPVQAMRFRTVLSSLVQTCDDPERLRvqtETLGYQHL-NLEITVDRAELFRDTIYDFIQMDFGNR-------------------------
2049 >SRR5947209_7523480
2050 ------------------------IAKAFVDQLAHVFPPICAMLPMAT--------KTARYQTACAIAAACKHAHDLGAIAPMIAATgadLS-RHGFTAEHLPAARAAFLNALRKCAGEDWTTVVEKDWNEVISEFAGH------
2051 >ERR1051326_6499376
2052 ------------------------IAKAFVDQLAHVFPPVKGMLPMAT--------KTARYQTACAIAAVCKHASNLNDIAPMIAATgadLS-RRGFTSEHLPAARAAFLNALRKCAGEDWTSVVDTDWNAVISEFAGH------
2053 >tr|A0A1A0K7B8|A0A1A0K7B8_9CORY Uncharacterized protein OS=Corynebacterium sp. EPI-003-04-2554_SCH2473622 GN=A5774_01015 PE=4 SV=1
2054 ---------DLASLATHLRAHPATFRDAVHRHFFAALPDARQSFPMD--------ASQAHRGLAESFAAAFDAP-DLDEYFADLGRSHR-RHGFPPDTYPIFATATRQALAEID---LADNVLQQAGALVDDIVAFMSTA--
2055 >tr|A0A127NUX4|A0A127NUX4_9CORY Oxidoreductase FAD-binding domain protein OS=Corynebacterium simulans GN=WM42_1693 PE=4 SV=1
2056 ----------MKELGEHIRRHADDYRDAVHQHFFATVAESRQIFALS--------MRDTHPALAPAVAWILDAADdagflpeETIERVRELGKEHR-RHGFPTEIYPKFEASLNEGFIALG---LTQHQLVVAKRAVHTVCTTMAQA--
2057 >tr|A0A0F6QY96|A0A0F6QY96_9CORY Oxidoreductase FAD-binding domain OS=Corynebacterium camporealensis GN=UL81_10405 PE=4 SV=1
2058 ----------MKELADHLRRHANEYRDAVHQHFFNTVLESRQIFSLQ--------MRHTHVELAPALAWAFDRAQrdgtltpELEEQLTQLGRDHR-RHGFPPEIYTDFANSLIAGFDALG---LTPYQRQVASHAVTEIANVMANA--
2059 >tr|U3GX34|U3GX34_9CORY Uncharacterized protein OS=Corynebacterium argentoratense DSM 44202 GN=CARG_08960 PE=4 SV=1
2060 ---------TLADTLRAEPKRLSHFGDLAHSALLRRAP---GLISFF--------GPNPHTELTTAVLFILTHSTpgpqdsgtqtPLspridaagAGALRALATEHV-AYMPPdPALYLAAADALCEALRDSCA-DQPFQQVLAAEKALREACSLMATH--
2061 >tr|T1FHE7|T1FHE7_HELRO Uncharacterized protein OS=Helobdella robusta OX=6412 GN=20208246 PE=3 SV=1
2062 ------------------------------GTLLQSNPLVKNTFEKFRQmDpmsdfTDSSVFSTHAMVVMSAFEDIFDNLDDSEIVKDILEQgkSHG-KFseDFAPETFWAIEEPFMSSMKDILGRKMSSQLEKIYKKTIKFILSVLIKGLR
2063 >tr|A0A0N4WD13|A0A0N4WD13_HAEPC Uncharacterized protein OS=Haemonchus placei PE=3 SV=1
2064 -CLTPAQILLIRRTWTHARNQGaLEPAISIFREFWKNLNFLQ-FQKLKKSRKCSESFQRHAQIFTTIMDELIANLDNPTATSPSLREsgeKHVFqtrdQYGCpfRATLLDQFASAMIErTLEWGEKKDRTEVTQTGWTKIVLFVVEQIKEGFH
2065 >tr|A0A2R8AKY2|A0A2R8AKY2_9RHOB Uncharacterized protein OS=Aliiroseovarius pelagivivens OX=1639690 GN=ALP8811_01706 PE=4 SV=1
2066 ------------HSLDLLVGQEDAFAHAFFPLLFARAPELRVLFGDNiDD------PTQQVRVLYRMMMAFA---GNDVTLIaglRLIGFRLA-MRGLGADQAELMANTLIGTLKRQLGNSWQSDFAFAWRIE-------------
2067 >tr|E1NZ07|E1NZ07_CAEEL GLoBin related OS=Caenorhabditis elegans OX=6239 GN=glb-29 PE=4 SV=1
2068 -NLSVKQKKLLRQSFNAMNSGGtfLKLMEKIFRRLETKCPDMRSIFLTTAFvnslSreRQTPplvkTEYDHCKCMVGIFERLIENLENINEQLTMirhYGEKHAQmaESGFTGAMIEQFGEISVFVIGSQDVVKFNHETVKAWRLLLACVTDEMKVGFD
2069 >tr|A0A0C2G6K1|A0A0C2G6K1_9BILA Globin OS=Ancylostoma duodenale GN=ANCDUO_17195 PE=4 SV=1
2070 --LSYKHRKLLRATFQQMNSSGafLKLMEQVFRRLEAKYPDIRSIFLTTAFvnslSreRSSPPlvrtEHDHCKCLVALFEKIMDNLSDDTQLmvIRQYGEKHAQmkESGMSGGMIESFGEIAVAVIASQYSYWIQKPVDDVTrrkgrDEGLVYLNDYEYIIL-
2071 >tr|A0A0G4HY87|A0A0G4HY87_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_33490 PE=3 SV=1
2072 ----SNRIHLLQSSLAACLKMstkEEFVGRLMYDTLMRTLPEPGIIAKRGR--------TMMSRAFNDTVAALVAFVSEPshmETYMDWLALRHV-HYKIDTTLFPQFRQAMLVSLEQVMADQWNAEIERAWSEAYEMTSQAL-----
2073 >SRR5262245_61346593
2074 ----DCLRRGLESDFKALV--DESFAASFYKRLFQSRPLLEGRFHN---------LQTQERMLAENLRDLVEFH--PEESagrFLDHVNRHK-PRGITAEDILAFRAAFVAEIVQQGskllAQKIPpGARADAWNA--------------
2075 >SRR4051794_17889687
2076 ----DSLRDAIIDSFSLVS--DERFGLRFYESLQS--HHVGGRFKD---------INEQHRKFIKELRSFVDSE--PPAGlaLRIIAGRHR-PYKLS-----------------------------------------------
2077 >tr|A0A0K0FHQ3|A0A0K0FHQ3_9BILA Uncharacterized protein OS=Strongyloides venezuelensis OX=75913 PE=4 SV=1
2078 -NLTASQIMSIKRSWKHINTKGlFNVLRRCYQRCECCSLAVSMIFSAEQMKkqqhAYSCGVSEHSKYFISLLDRIIDNEPNIEQELRNVGKEHVKlyeEYKLGTADIERLGEIIADVFLKLDGIRQNKETSKSWRILIASIIDEVSVGY-
2079 >SRR5699024_10156350
2080 ----------------------PRFPALFARALRAADPDFRGMFPRD--------PAPVLAEFVRAMTFVLETTeaaaAATartDevvELARPLGADHR-ERDLPPSNRVPTGDARAATLPPLAGSGWTEAPETTLSTAYRVVSTALQ----
2081 >tr|A0A1F2EUM8|A0A1F2EUM8_9CORY Uncharacterized protein OS=Corynebacterium sp. HMSC11E11 GN=HMPREF3121_11375 PE=4 SV=1
2082 ----------------------PTIGPEAFRRLLDAEPRFRHMFGGS--------KTALRDQFMSALSTALVTRadvgRFPaa-tiRRLEQLARENR-KFGVAPRDYATLAEHLLDVFGERLPAGPDSGAQVDALREILDEA-MSL----
2083 >tr|A0A0C2M2P6|A0A0C2M2P6_THEKT Uncharacterized protein OS=Thelohanellus kitauei OX=669202 GN=RF11_12769 PE=3 SV=1
2084 --LTLEERLKLKESWIKIYQKIqdlpdVDITFEIFVRLMERRPEMSKNFEKD-VY-KYSRMKSHSDKMLVILNNMIRNLDDEQKMLKYLSgmvRRHR-NYGIRQGDCKMWEEIFLDIISRY-----------------------------
2085 >tr|Q5D2M7|Q5D2M7_9TREM Myoglobin 1 OS=Paragonimus westermani OX=34504 GN=myo1 PE=2 SV=1
2086 -PLTQAEVDGVVSELNPFLasdAKKVELGLGAYKALLTAKPEYIQLFSKLHgLTidnvFQSEGIKYYARTLVEDLVKMLTAAAKDDELQKVlvhSGHQHT-TRKVTKQQFLSGEPIFIDFFNKTLSK---PENKAAMEKFLKHAFP-------
2087 >tr|A0A1S8X4B3|A0A1S8X4B3_9TREM Globin OS=Opisthorchis viverrini OX=6198 GN=X801_02811 PE=3 SV=1
2088 -PLTQSQIAGIHKELLPILsndEAKTSFGVGAYKAFLGAHPEYIQYFSKLNgLTidnvFESEGIKYYGRTLVDEIVKMLTAGADDEKLKQVlhdSGKAHT-ARNIDNATFMvsklfmflkrvsemrlarglygpfpifaqSGLPVFVDYFNKSLTV---PENQTAMEAFLNHVFP-------
2089 >tr|A0A1I8C1X6|A0A1I8C1X6_MELHA Uncharacterized protein OS=Meloidogyne hapla OX=6305 PE=3 SV=1
2090 -DLSPHQIGLIKRAWKNLLKSvnENEIAIKLLLRIFQLDPRNLAYFSLNEYspfdeylIKENNIFINHVKTFESTLINVMTHPGNATKLskhLQQLGGRHV-NYtGVTykCSYWKCFIQSLIDVLTLNKDKNTSEDLHEAILILGEFCVEQMKIGYK
2091 >ERR550539_1089662
2092 ---------AATASWNNIDD-KPAFGKAFFKNWLSSNPAIEEEFAKSSFK------QGPAQFLVERFDILLGVIEDEDSLAEELYqvaKTHK-KVGVDQSDLYSFQASFMKTLPSFD-ADFSAETGNAWAYVLSHVI--------
2093 >ERR1719210_3079978
2094 ---------QPKRVGRTLT--KQLSEKLFFQNWLDSEPDVAEIFKKSSFP------QGPAQFLVERFDILLDVIDDEVALSKELYvvaKTHM-DRGVSPDDLVTFQDSFLKTLPSFD-SEWTRDRSESWAYVLSHVI--------
2095 >tr|A0A1G0FYS6|A0A1G0FYS6_9GAMM Uncharacterized protein OS=Gammaproteobacteria bacterium RBG_16_51_14 OX=1798265 GN=A2W28_07810 PE=4 SV=1
2096 ---------LFNNSFQRAIiPDSNSFYKRFYEIFVGSDPRIAELFEKTF-------MNLQREMLKQSMTYMMSFSatLEPSDEMKELAEMHGRgKLNIPANLYEIWLESMIKTVEEFD-PKFDENIEIAWRVMMAPGVAYMQS---
2097 >SRR3989338_9975634
2098 -TIDHRSVQLIKQSAGAIKGQAQAINRLVYEQLRRDHPAAYSLLQQAGL-------P----PLASIVANYAAGIDNLEVFLghaPKIALTHQ-RIDLQEVHFESVASSLFLAFRQALDPDaLSDEALLAWRRAYDH----------
2099 >tr|A0A2A6RLC4|A0A2A6RLC4_9CHLR Globin-coupled histidine kinase OS=Chloroflexi bacterium Kir15-3F OX=2024553 GN=CJ255_07345 PE=4 SV=1
2100 MGLRAEDGATLKALAPKAEAYGPTLTKTFYDRLFA-HANTAEYLQGVD-------MQRLHSMVQTWFMGMFAGVYDRDYArqRLHIGEVHV-KVGLPVRYPLAMIDVVMSFGDQIANESSePAVALAAFQKVLSLDIAIFNQAY-
2101 >tr|A0A085M5J8|A0A085M5J8_9BILA Uncharacterized protein OS=Trichuris suis OX=68888 GN=M513_06691 PE=3 SV=1
2102 -CLTKRQRRCILKSWRKVQ-NKAQLGEEIYIQIFMQKPVLKSLFPFRATPvnelHDNVLFTRQAVIFIDFIDNVVAYVGINngrllQELCTRVGISHAlmTRVNFDPEWWYLFANSVLDGMQKFCLPNFSCepiatyigsQSMLAWRILLKHVVEMMSDAF-
2103 >SRR5215470_9720857
2104 ---------EAKRSYRQFAR-DISFYRELSKRLFRKIPGIEKKFRHR-------TMEEQYKVLRDSLWLLLSYASAPDqqepTILSRIAHTYA---RFPKEWFDTFREVILDVVAQRDP-----SSVRAWKHAMAPGLE-------
2105 >SRR4051812_31756681
2106 ---APSVMRLLASCTADLGPQQPELAEALYQRLLELLPEVatlAE------------RGRPLSDRILHAVLYPTEPGrt--PLNVatvVQQVGAQNY-LDGLVGEHYSSVTHAVLHAAREMYRGEWSSALSSAWVEYLLWLRGHLLAG--
2107 >ETNmetMinimDraft_22_1059887.scaffolds.fasta_scaffold1682169_1 # 3 # 206 # -1 # ID=1682169_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.363
2108 ---------TVFSQWRRMK--IEDFGECMYRSL-VQDASLEKLFRRE-------RMRTQSLLFAAFIQVALCWLEERDfrkveRDMISLGLRHR-SYGIQPSYVCVFQIALLQTLCQNLNG-LSLQAEISWSVVWSH----------
2109 >tr|A0A0A9Z6R2|A0A0A9Z6R2_LYGHE Neuroglobin OS=Lygus hesperus OX=30085 GN=NGB PE=4 SV=1
2110 --LEEDEIERIKKSWVLVKENDFRFIDILRQEMLCDIMMYELYFNPGrkaDVcVSELTEFKNHPKNVYSTLDFIVGDLENENVIIEkmiEIGKNHG-RLGISRKHISFMTSTIYQAVECTIGPCmFDRLVDQSWEKFLTSFN--------
2111 >tr|A0A0N5AZ47|A0A0N5AZ47_9BILA Uncharacterized protein OS=Syphacia muris OX=451379 PE=4 SV=1
2112 -PISYKNRQLVQSCFRNP---HELLGKRILKKTRDKKPDFDLFLSKLDGK----QRDELEESIKVLLKKVVANIDFIDEVqrlGEEFGANHVqfRKEGFKPEFFGIYADAAVTEctfLDSA--VHPPHQTLDAFSSFISWIFSFVRDGYY
2113 >tr|A0A158N7T9|A0A158N7T9_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=3 SV=1
2114 -GLTAQQKAILATMWRQLPRGvIFDLGKRVFEIIFERDPKLLMIINLEHLQntnqwQEHVNFRMHAQRFTHALSQSMRNLTEPIIAADRLqefGASYVNqenitygslNVVIPHSYWDRLSAAITTTAQEFLNKqqlktskqtltvdnvlllenerrnsrnlfSQVSANINAWSILAQFIANQIRFGYE
2115 >tr|A0A1I7RRX1|A0A1I7RRX1_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1
2116 -GLTSTQKKLVQAKWMEMDGVgILDMGRNVFETLFRREPACLKAIGLGHLThgrnlewRYHVNYRQHVKRFCEAFNEVIRSFEHPRTSIDQLqelGALHANtylkaseERKVPSNYWDGLVFAINYAAKDLQVEsssrgsespsnvifdrrfllpsddlgsstppsptqfsslcvtpqrrsgSVCPRVAEAWNLLAIYAVSQMKFGYE
2117 >tr|A0A0B2V954|A0A0B2V954_TOXCA Uncharacterized protein OS=Toxocara canis OX=6265 GN=Tcan_09629 PE=3 SV=1
2118 -GLSMHQKMIVTAKWRQLPQGfVFDLGKRIFETVFERDPYLLSIISLEHLQgsdewRDHANFHLHAQRFSHVLSQCMRHLSEPIVAADRLqefGAAYAEvedsenfvRSRIPHSYWDRLITAITSTAKELHEDqpqqvrknslsvddallakkdrlalETDSTNACAWNALATFVSNQIRFGYE
2119 >tr|A0A1I7RN92|A0A1I7RN92_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus OX=6326 PE=3 SV=1
2120 -GLTDDQCEQLATAFSNIPDKYYAFEQMFLNLFMKEDPQLAVVFGFEGIRpeelRRMSPFRTHVCKFQRFMTTVLDMLPKknrEEELiqiIRMVGRQHCNvkLLSFTAQKWLSFKNGMLNALAKG---GESHKYYSSWNILISFMISEMKDAY-
2121 >tr|A0A183BTK8|A0A183BTK8_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=4 SV=1
2122 -QLDDTECEQLSTVFAAMPDKYHLFEACLRPMPMPeVDPQIALTFGMANIAeielRRKTPFRYSV--------------QKrgrEEELvqiIRMVGRQHCQvkQLSFTAARWLSFKSALTWTFSRG---EQKDKLHVQWSLLISFLICEIKDAY-
2123 >tr|A0A1I7ZF06|A0A1I7ZF06_9BILA Uncharacterized protein OS=Steinernema glaseri OX=37863 PE=4 SV=1
2124 -QLDEEQIDTIVDAFAKVSDKYGAFERVFVQLFVYEDKEIAEQFGLASVPeeviKRNQVFRTHVGKFQRFMTTVVELLPKvgrEDELieiLRIVGRQHCNvkQMNFTAAKWLSFKNVLLSVLCKN---DHHDKVYMCWNQLLSFLIYEIKDAY-
2125 >tr|A0A2V7AV10|A0A2V7AV10_9BACT Uncharacterized protein OS=Candidatus Rokubacteria bacterium OX=2053607 GN=DMD92_03445 PE=4 SV=1
2126 -GLGEADVAVIRRTAPIVLTCEAAVTDALYAHFL-QFPATAQFFLGEDGEPDAARLARRKHTLGRWLRETAAVATTHEFSyyLLAVGLShsHRAhgPGGAVPPHFVVGamslaQTALARLFGAELGDpQAALEASLAWNKLLHVHLAVLLLGY-
2127 >tr|A0A2E9LM24|A0A2E9LM24_9CHLR Uncharacterized protein OS=Dehalococcoidia bacterium OX=2026734 GN=CL902_07715 PE=4 SV=1
2128 -GLGQNELDIIESTRELVLSKGEEITAEVYDHFL-RFQETRRFFLNEEKAVDDDRLERRKHSLLRWLRGSLDFKIDEDYPvrLLATGIVhsHPPshraHMGSVPGRFMIGsmsylQTLLAEIFHSEIEDrEEAHRASVAWNKMLMVQLDILQAGY-
2129 >tr|W4MD58|W4MD58_9BACT Uncharacterized protein OS=Candidatus Entotheonella gemina OX=1429439 GN=ETSY2_07185 PE=4 SV=1
2130 -GLSDDERQLIKDSGPIVLGHVRKLTEGIYDQLL-AYPESAQFFTTENGQRDEKRIEDNIQTMISWFRAAVTAPTNQGFIryLVGISQMhaNIPvhrsNNTPVAPRYVIGtisyyQTNLDDILHQHMADpDLARRTCVAWNKWLLVILELMLANY-
2131 >tr|A0A0N5DD39|A0A0N5DD39_TRIMR Uncharacterized protein OS=Trichuris muris OX=70415 PE=3 SV=1
2132 -NLTPHQKQLLVQSWPQVQLYnRIHGGDAMFARFCEKNSIARETFQKIAVvqSfasneASESVLKKHEQYLVQLLSEAVENLNNDcEPLLREcldYGAQHV-TLheLLNETVWEQLAEAIIDRIHKVNLVRRHKDLSKAWTMLIILLIDKIREGY-
2133 >tr|W8BTT7|W8BTT7_CERCA Uncharacterized protein OS=Ceratitis capitata PE=2 SV=1
2134 -GLTITERRSLQNGWSIIKQKQRRAALTIYVNLFTEHENLYEVFRSDGVL-NIEFASQHQKEVLTVFQMIIEQVDNARfvkTMLKELALRHE-AASVTNTQWQLYtnevRKYFLETLADAIS----PTFVHALDKLMNFVCN-------
2135 >tr|A0A0A1X397|A0A0A1X397_ZEUCU Globin, monomeric component M-IV OS=Zeugodacus cucurbitae GN=GLB4_1 PE=3 SV=1
2136 -GLTSTERKSLQNGWTIIKQKQRRAALNIYVNFFTGHEDLYEIFRFNGTL-DIGFASQHQKDVLTVFQMIFEQLDNARfvkTMMKELALRHQ-ASAVTNTMWQLYanevKHYFLKTLNDALS----PTFVHALETLINYICD-------
2137 >SRR5438046_775397
2138 --VSRETTALARASFERCSA-NGEVPQAFYRNFFARCPPAPALFAPGL-------AAGLAArLLSApaaaeqIFLFTLVAGGTPRTrl-LPP----MSrGX---------------------------------------------------
2139 >AACY02.8.fsa_nt_gi|132068355|gb|AACY021643300.1|_1 # 2 # 748 # -1 # ID=15695_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.288
2140 ---------------------------------------------------------------LKG--------SFHFHLlgeLENLDFEFK-FLASWFSEVDIFRDALIDLFEMEMNDqSLTPQGRHVMALLINYVG--------
2141 >ERR1719424_2066333
2142 -------------------------------------P-------------------------------KASTWLRPCTVhllVQSTRQQHL-VSAI-------------SCTTSRRV---------------------------
2143 >tr|A0A0G4HHE4|A0A0G4HHE4_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_6802 PE=3 SV=1
2144 --------------------------DALLGILFEASPTMRSVFVKNGD--------LYADLIEHLLRRIIAYADDPGALWTddqHLALDHI-NFGMSMSDLPLFGASLMNCLAGVLGENWCDEWQRAWEKAWQICCQSL-----
2145 >tr|A0A2C9LD65|A0A2C9LD65_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106067556 PE=3 SV=1
2146 --LSHKDKLFILNSWLNFRNgkREEDIGMEAALEMYSIYPEIKDIFTIYrDARmkhlTDKEMIRTHSQQVASVVDKCVMRMDDAHAFAMiavDEGSVHI---KIQERFMRCYVDCYIREIKKYSKLKWSRANQMAWEVFFDTIVVNMKNGW-
2147 >SRR4030095_5973293
2148 -----DHFEIAKDSYARCISggdSGNSFFKTFYHELTRISPEAAVKFKGKgiGET----ETNRQYGILREAIFILLMFGENklgenEPNILSRIAEMHNKnHYNISPESYKSFVSALTATICGSAPDipePFDPqckisvneknLIKIAWQKALKPGIDYMIMRYP
2149 >SRR6478736_3613867
2150 -----DSFEIAKDSYNRCISgedSGDIFFKTFYNRLVKKLPKD-vaAQLKGKgiGRS----KGHRQYAILREAVFILLQFGQNrlgenEPNILSRIAQMHNKaNYNISPQLYTVFVDALIDTISGLPPDipkPFDSqcsisvyereIIRNAWSEALSPGITYMKDKYX
2151 >SRR5262245_45185474
2152 ----------------------PTFLEAFYKLFTA-DEVVGKRF--VkfDDI----EWKRQHGLLQQALDACFDFASLlsmqnlrelpEPNAMTKYVVRHGPgrgNLGITSTEYDAFVEALITTVCGNPGNgqaPYDPecadaerkdVIEFAWRRLMKLIVEHFKKVAR
2153 >ERR1712142_1087278
2154 -ALTETEVKVIIDSWDRIHPDK--GAKMLFHQFLTDFPLMKIYFGYQETesvaeIMESEQIKTRCKVVWDVLTKIVHASGDGGKLaelVKEVSVKHL-NFNREKKDIHCFLHALKVTLTC-----FSGHLFRPWNIWCKMVEDLF-----
2155 >ERR1719263_534529
2156 ---------------------KRTYGLNAFNRFFAKQKKAEDHFNTSN--------ARLSVLAMQGLNLCQDIYKEPTRLvnvVTSLGLKHI-MYNISTEYFDAFVEAMCEELSDWHPGN--QAAVEGVEWALTQIAAIMI----
2157 >ERR1719446_598571
2158 ---------------------KKAYGLNAFNRFFCKAATIGNSFQHIQ--------CASVCSgnarSPAVSGYLQGAYTLGECGhltWPQTHHVQH-FYRLLX----------------------------------------------
2159 >ERR1719446_1691251
2160 ---------------------KKAYGLNAFNRFFAKQKKAEDHFNTSN--------ARLSVLAMQGLQLCQDIYKEPTRLvnvVTSLGLKHI-MFNISTEYFDAFVEAQCEELAEWHPGN--QSAIEGVEWALTQIAAIMI----
2161 >tr|A0A182EAA6|A0A182EAA6_ONCOC Uncharacterized protein OS=Onchocerca ochengi PE=4 SV=1
2162 ------------------mgsgssvpnhgqprnvaggggndgggggnagvengdqqkvdprlpypnfrelftlknywktvRRNERDCAKMMLAKNYLKNYGYSLGII-------------------------------------------------------------------------------------------------
2163 >tr|S9VAV3|S9VAV3_9TRYP Uncharacterized protein OS=Angomonas deanei GN=AGDE_12480 PE=4 SV=1
2164 -------------AWSHLLtsPNGGEFCSTLYEKLCQNLTYIPDYIRNLK------DEERVIDHYINVITKTLELYENPHVMIdelPKIAARHR-GFGVSSDAFFVMRNIFMELLPEYMDPKVYEQSKKDWLKFWRLVLDLMVSG--
2165 >tr|A0A2K6VLK5|A0A2K6VLK5_ONCVO Uncharacterized protein OS=Onchocerca volvulus OX=6282 PE=4 SV=1
2166 -NLTTTQLLLVRKTWNHAKNQGaLEPALGIFRNSFYKCGEIRSLIMGGPKNVGYERLKKHAKSFTNIMDSLITGLDAKESVIEELRKagrAHATllrdtsnkfgnksntqliGCPFRLAHFDHFASAMIERtLEWGEKKDRNKTTQTGWTKIVLFIVEQLREGYQ
2167 >SRR4051812_9951159
2168 -PLPPEVAQTIRSSCRPLLERQEQFHGDFHASLVDLMPEVPMMREPA--------GEQVSRWLVECVLWAVNADEPVPMIGATLqgvGLDAH-RLGFPRAGYQAVGHALLRTVRGASQNDWSGTLSSSWIGYHSWLCEYWVS---
2169 >SRR5690242_179091
2170 -PLPPEVAQVIRSSCRPLLERQEQFHGEFHASMVDLMPEVPMMREPA--------GEQVSRWLVECVLWAVNADEPLPMIGATLqgvGLDAH-RLGFPRSGYQAVGHALLRTVRGAYQSDWSGTLSSSWIGYHTWLCEYWVS---
2171 >ERR1711972_144950
2172 --------SQVLQSWEQVKLLgLESVGEMLRANTFELDPQVVALFRIPGVVSTGEGMLqrmalrRLFSKVLRFVGSVVAG----------------------RYDYQRLVETLsrLGATRAAGGATEVHFKI-------------------
2173 >tr|A0A1I8CIB1|A0A1I8CIB1_9BILA Uncharacterized protein OS=Rhabditophanes sp. KR3021 OX=114890 PE=4 SV=1
2174 ---------------------------LLLVRTFELDPKQKHNFNLDKVDiedlRIHPIFVDYVKSFQPLLLNVFKYTNRATIMskyLQQMGGKLMRytKVSYKSSYWKVFEQALIDVVS---GGNAGDETIEALTILANFCSEQMRIGFR
2175 >tr|K7H1D4|K7H1D4_CAEJA Uncharacterized protein OS=Caenorhabditis japonica PE=4 SV=1
2176 ----MDGEYLLFANCPAPGIgDGNDFLYHNGVGLESNCPIVSQCFQSATYSlstnpNQVRTVADHAKYLLQLLDKIIEGDVDAEY-LREIGANHVslkHENGFSNTEWDRFQEIMVEVILKQDGVKQSKETSRAWRLLICSFIELIRDGF-
2177 >tr|A0A0D8XGR1|A0A0D8XGR1_DICVI Uncharacterized protein OS=Dictyocaulus viviparus OX=29172 GN=DICVIV_11062 PE=4 SV=1
2178 ---------RIQHCFKAA---RPTIGEAILKRASNNRCEMRILMSRLTD----QQIELMGKQFYMLIAYSVENIERVEMIQQharTLGETyaaLC-RLGFRPDYFTSLADAAIAECVKLDGGtHKstyffnRCETLLAWSQLIGTIFTSVRDGY-
2179 >tr|A0A0N4UGY4|A0A0N4UGY4_DRAME Uncharacterized protein OS=Dracunculus medinensis OX=318479 PE=4 SV=1
2180 MRLSDKQKLWIKLGYKKWRSKsKMVPGEWVHAYAIKKYPTMKALFKKHEN-----LARVYTQTITKIIEMAVESVDSLDDsLGPLLisyasengilEERgmasiftirndkllLF-LEGFDRRFWGYVAEALCALSRDFPLKRHKWDTISAWRIIVLFIVKKLEYGF-
2181 >SRR2546421_6426420
2182 -------------------------------XMIRRPPRstlfPYTTLFRSD-------FERQNKLLRHAFGLLLIFPNQArtePSVLTRVAERHSRrDLDIPRSEEHTSElqsRSDLVCRLLLEKKK-KNQV--------------------
2183 >LakMenE18May11ns_1017337.scaffolds.fasta_scaffold18991_1 # 3 # 107 # -1 # ID=18991_1;partial=10;start_type=ATG;rbs_motif=None;rbs_spacer=None;gc_cont=0.400
2184 --------SLALASYNRCRDCHQEFIREFYDAFIEGLPEPYkEHFQNR---------QRQNTMLDSAIYLLFD-LEAPEnqKLLRSIftGSKTAGkpnpHPAYPIEWYERFLDTLVGQVSHMDRKNWNAEVEASWRNLRENALHLIR----
2185 >ERR1719262_958340
2186 ------QKEILDICYAKMTGelDLPAMVTMFQGIFFSRDLRIQSYFSKPNG--------TLRYIVLRIIEFLCNVFHKPAAItkeLRTLGVSHV-KWEIPPDLFVPLGEALF-----------------------------------
2187 >SRR5512142_1307926
2188 -GLTESDIETIKQSKPIIEKHIPEIVTKFYAHLLR-YPPTRRVFLKKDGSVDQPYVELRMRHLTNFWLRTATGVyDdDYARYIDYVGRAHT-SHGADPHIYIAeryvigqvgfVQHAITDALSRELRhtdEEFEVRAVEAWDKLMMVLLEMLSRAY-
2189 >ETNmetMinimDraft_9_1059917.scaffolds.fasta_scaffold1595668_1 # 1 # 216 # 1 # ID=1595668_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.366
2190 -GFTRADAEIIAQACPIIEEHLPNIVADFYDQLLR-YPPTRKVLLKPDGTIDQEHVEKRMLFQINFWLRSASGVyDdDYASYIDYVGRAHT-SHGADLNIYIAeryvigmvgfMQRAIDQALDSELHdadHTMEDRAEAAWGRLLMVILEMLSRAY-
2191 >ERR1712137_619303
2192 --LPRESITVIRDTWAMVER-NVDIAPKMLLKMFQLYPVTQNLIPLLrGVSledmPTNKRFLQLAYGSQFAMSAIVDKLHRPDMLEEIIGGGmHAFVDGLSTSFQmAATTALFNKIMTEELGSAYTAEAQEAFIATGDMMTSIMVK---
2193 >SRR5580704_4499342
2194 ------------------------TLGDFYRRLLQHHPQLAAYFEGVN-------IDFQVQKLVVVLSTIARDLPDRSVLdrvLFHQGVAHV-ERGIGRGEFNEFIALLANVVSCKTTLVGAAESYAVWYQELSAVATSML----
2195 >ETNmetMinimDraft_24_1059892.scaffolds.fasta_scaffold323471_1 # 1 # 354 # -1 # ID=323471_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.472
2196 ---------------------RKKVCTDLYFRLFDVVPASQDYFKQSNT-----RLHFIAELV---INMTLDMYQKPTKMMsqiSALGLRHV-ALNVPTDIFPAFIDVYITVVKEYTN---------------------------
2197 >tr|A0A1Q9F3K1|A0A1Q9F3K1_SYMMI Copper-exporting P-type ATPase A OS=Symbiodinium microadriaticum GN=copA PE=4 SV=1
2198 --LDEFTIKEVQNGWATTEKrlgGPKAAGEHVFGKLKKEVPRTEGMLKRSS--------TV------WHLFTElLQAIDQPKLVqkrLEYIALRHM-NADITTADIEVFRNILFEVCASKLGGLmtpefqYQAQYSFGMGQIIVAVGTS------
2199 >tr|A0A183BUR6|A0A183BUR6_GLOPA Uncharacterized protein OS=Globodera pallida OX=36090 PE=3 SV=1
2200 -GLSAHQIQILQKIWERSPESeISDCARNIMSHLLRSNAQMYQFFDLLGHsdreIANSPIFARQSANFAVLLDFVLANLLEeVQKVclaLQHLGAQHARlRWPIETHHWALFCRCFEDNPPKEV--FLNAEGHDLWKTMINFIIVQMRVGYD
2201 >SRR5690348_61285
2202 ----------------------------------------------------------------RATHWLLDHFDHPGEIVSVLVRYvpalDA-LTGPHSRQLELFGEQITQQVDDEA----------------------------
2203 >JI7StandDraft_1071085.scaffolds.fasta_scaffold2802978_1 # 2 # 235 # -1 # ID=2802978_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.607
2204 --LTPTTIRLLLATSDIVG--SKETADKFYNRLFLHSPELKELFVGGETTtTTSMGIGDQALKFSQMMQWTTRALQQmhlqqkqkqqpsrssggggggdacsngtaPTrrstsAVfrsMTNLGRRHV-RYGVQLKHFHPVKQALLDTIAEL-----------------------------
2205 >ERR1740139_220892
2206 ------TRAALLKSWEMVQEAGTVPAAnLLMKHLRERDAEALRVNTSHARPktgeTEEDAVRKLAVRTVQILGSAATGMSDTVSLVQHLHKVgagFA-GTGIKEGYFAMVRDASPFALRELLGDRFTADIASACRITGPFLASLIIAGLR
2207 >ERR1740139_941170
2208 ------TRAALLKSWEMVQEAGTISAAnLLMKHMREKDAEALRLNTSQARPktgeTEEDAVRKLAVRTVQILGSAATGMSDTVSLVQHLHKVgagFA-GTGIKEGYFAMVRDASPFVLRELLGDRFTADIESACRITGPFLASLIIAGFR
2209 >tr|E0VF27|E0VF27_PEDHC uncharacterized protein OS=Pediculus humanus subsp. corporis OX=121224 GN=8236389 PE=3 SV=1
2210 ---------VVLNDWPKIRKNYKKIFIDSFINYFAENPNYKLLFPSFsNVSeddlPFNHCFRLHCFAVYKAINFLMSNWlGeyeeDDSKILPVIGKTHF-DRGITLEMMNLYKHSIVYSCNNHLKPNL--KRKLSWQTVFDHIFD-------
2211 >SRR4030067_646800
2212 -AFTQADADAINESRFIIEKDIPEIVSKFYTQLLR-YPPTRKHFMRQDGTLDQEYLQLRMHHLTNFWRRTAYGeFDdNYARYVDYVGRART-SHAGDHRPGCgppagsrglrAGPGNAHLGREPRRGG-CESGGDRRWRKEDRP----------
2213 >ERR1719347_1935341
2214 -GLSQNEVTLIWSHWESLKPHKRRLAKRILKVYIKEHPRARELFPNWvDIPtvelVKLTSFSRKAVDTWEAFSRAWECIDDAPLcrkVCYAFGKKHIEcnarikgHGQIDEHHVKNFIRIFLRIILVSAR----EGSEEAWRKATEFFSINFVRG--
2215 >SRR5690625_6901273
2216 -------------------TPPETYTPSLHDAL----------PISA--------RASRHVDLTVAIAWALENPAPkVDALVAQLGRDHR-RLGFPPEVYDTFAQDRKSTR-----------LNSShVAISYAVFCLKKKT---
2217 >SRR5580692_4143848
2218 --SDSGIWPVIRQSAARLSRDEDAFIQELHYEITRLISDPAGAPAP--------DMWVFCERMVRSFLWVAL-TDQPlGVVADtlrKVGVHYW-VEGFPDTLYGEVTHAMVQTVHYLCAHDWSASMGSAWITYFMWIKPHLLAG--
2219 >SRR6266704_2516069
2220 --SDSGYD---APPAGALARDQGAFIRQLHYDVTSRIPESAVPPAF--------DMWGFCDRMAQTLLWVAL-TDQQpSLVTDtlrQLGAQNW-YEGFPDS---------------------------------------------
2221 >SRR5438132_1665678
2222 -------RSRVLASYSRVQSgdRARTLYQAFYQQLFRAVPDVEPLFARID-------MVRQYDALNKAIKLLLDYDPQSREstdDIRAVAVIVA-APVIVAVHLNVAApVTVIDKRKGCGS--FGTTVV----AVMGPGVGWGD----
2223 >SRR4051795_10036070
2224 -------RDQLFISYSHR---DESWLEEFATMLAPVQKSgslnIWSDKEiraGED-------WSAKiQEAMSRARIALLLVSPAFLAsdFIQKTELPKI-LSDHTCRGMHVywvlleqslTEWSPLSQLQAAHP--IKISlseisnvgerrnVIANICRQIANELGQYS----
2225 >ERR1712051_620824
2226 ------------EGWATMQDHILNYLSntMMLPFVMRCNKSILKYFVTYESNvsllkfEGSqglAslEKTKHGCWfLTEVLTKVIPNLECLDTCieyLKDLGQKHQ-TQGVRREHLDLLALVYVSAVKEVMA---------------------------
2227 >tr|A0A2C9LKZ0|A0A2C9LKZ0_BIOGL Uncharacterized protein OS=Biomphalaria glabrata OX=6526 GN=106051185 PE=4 SV=1
2228 --ISLADIKVITNQWEDVLRCSDLFGKLLVLYVLDNCPKVNALHPGLHArlTdARDSVEKQIGLRVIQSISCVIHNLNRAPAVESMVRDTfkkLQ-QHGYTKNTILECSEAFLSFMNQYFSKRWLKQHSDAWFKVLKALL--------
2229 >tr|A8WLI5|A8WLI5_CAEBR Protein CBG24801 OS=Caenorhabditis briggsae GN=CBG24801 PE=4 SV=1
2230 -------------WIFSFQLEG-SKSRTQIERILKKFKNKKKS---------------------------------------------------------------------------------------------------
2231 >tr|A0A1I7RWJ6|A0A1I7RWJ6_BURXY Uncharacterized protein OS=Bursaphelenchus xylophilus PE=4 SV=1
2232 -KLSKLQKRALRFTWHRLQTRnggkrVDNVFEDVYDRLMRLVPVMKEMFTTRAFlsamSkHEVATPRDHARFTVKMIDSVIKNLDTDEKkrtdtlseFDPVlIGRAHAvlRPYGFVASIWEKLGETIIDVVLVQDAVRDLPGAGQAWVVLTACLVDQLRAGF-
2233 >tr|A0A2A2L6J3|A0A2A2L6J3_9BILA Uncharacterized protein OS=Diploscapter pachys GN=WR25_22934 PE=4 SV=1
2234 -KLTKLQKKALKFTWSRLQTRnggkrVESVFEDVFDRVVRYLPQTREMFNTR--------------------------------a---FlCAISrneTSslRDHARVIFFLhsfadlcKLHDKCLLL----------IPSA--FTLCFSLCTIYELRGS--
2235 >tr|A0A1I7XY15|A0A1I7XY15_9BILA Uncharacterized protein OS=Steinernema glaseri PE=3 SV=1
2236 ----TSSLALLTSTWPDHFGNLFDMGLNALDATFKKHPDLMAYFAFNDRVnwKKEDKVRKVVLALEQTLVHAVSVFGEvhsgdekeeaiqgFEVLLEEIGGLHRAiVPNFVPEHFIKFLAVLPTAIVTTICdkreeimpESDREMLLELWKKISAFMGFHLDAG--
2237 >KNS7NT10metaT_FD_contig_41_844412_length_214_multi_3_in_0_out_0_1 # 3 # 212 # -1 # ID=205324_1;partial=11;start_type=Edge;rbs_motif=None;rbs_spacer=None;gc_cont=0.619
2238 --IPPKLAVLIREKWQAFLEKfptREQAGEAIYDSFMEEAPSLRPLFKTP--------RSVFGLRFIASLTNLMAVRPAGVTEEagGNHGF-----------------------PAPRLGG--------------------------
2239 >tr|E5SHC1|E5SHC1_TRISP Uncharacterized protein OS=Trichinella spiralis GN=Tsp_03845 PE=3 SV=1
2240 -SLSAGELKLLRWLWKQMKQVHQgLASAKLFQIIFATCPEIKRFFGLAKVS----------------DEKALIDerMRKhmlilqASKLIILFQIISSa-----------------------------------------------------
2241 >SRR5690606_9602430
2242 -------------------------YRAFYPILYSSVSGAQELFEATVG-TDNRKMLQILAKLFG----FISNVNhSSEFMKsdAFIerGKYYA-DHGISETMMRGFSSALVLTLRRTLGELFTISHVRAWGIFLDTISHAL-----
2243 >SRR5581483_1589235
2244 ---------DIKESFHRILEQKQAVTHLFFTVALGSGHEARLLIWETEG-----------------AGCSVESTDPPQWLC------------PPFTIYAQFTNDLLQALREFHGADWNQELMEQWRMTIERVGQIIFSACR
2245 >SRR5262249_34977875
2246 ------------------LEQKQAVTHLFFTVALSGCHEARLIFWGTEG-----------------AGHSGEFFSSPQMLC------------APLAMYAQFTNDLLRALREFHGADWNPELTEQWRMAIERVGQAIFATYR
2247 >SRR3954454_18132641
2248 ----------------------WRDADRPAWAALNADPEVREFFDR--------PLTrpeADASldrfrsdLAARGWGWWAIELTATGE---------------------LIGMAGLDPTE--DDIP-VAGVEMGWRlarAHWGHGYATEA----
2249 >SRR3954470_12875293
2250 ----------------------WRADDLDAWAAINADPQVRAFLGG--------VLDrgqAAESirrfrtaLAARGWGWWAVELTATGE---------------------LIGIAGLDPVD--EGLP-FDGVEIGWRlarWAWGRGYATEA----
2251 >tr|A0A1Q9EV88|A0A1Q9EV88_SYMMI Uncharacterized protein OS=Symbiodinium microadriaticum GN=AK812_SmicGene4882 PE=4 SV=1
2252 ----------------------SAFKMEVFETFFATCEQSQEYLKASNA-----KLQFIAGRILDI---MTDMFRTPQSAVkdiSALGLLHA-GYGVREELIQPFVTAFMTAVKNAC----------------------------
2253 >tr|S9TGR2|S9TGR2_9TRYP Uncharacterized protein OS=Strigomonas culicis OX=28005 GN=STCU_11951 PE=4 SV=1
2254 ---------TLEGCWQLLELrpqGLEEIAQAMYFYLLSHNRQLQSYFYGI-------DMEEQGRALVRMLCSTVHTYGRTqtecdpvawsnfEGYLVEMGARHR-SYGVGDNVFHEMRDAFFQQFPHFVDAnSWRI-TCREWHTLWDTIIRLLQQG--
2255 >tr|A0A0A2NAV4|A0A0A2NAV4_ALCFA Uncharacterized protein OS=Alcaligenes faecalis OX=511 GN=JT27_01100 PE=4 SV=1
2256 --VTDAQRDIIKTAAPLLASGDKALTTYFYELILRDSPPMSPLASQ-------------------------------------IANNHL-ALQIQPEHDPMMGTCQLQAVREELIVRMTgNKLIDGWVAAYQQLSNLLIEA--
2257 >SRR3954463_13473713
2258 -RVTPDDLKHVQRSWAKLCDRRESLLAELT-VTFQSNPALQ--C----------DACCRAEWLLCAGEELVELLPAPSTLASRARVLgDRWPDPLTAPSFEIDGRAWMAAATRCSS-MWSDTIEMAWRQAWLLLSDVLA----
2259 >ERR1711890_22380
2260 MHLSDTEKSAVVSSWSNVNS---SLLDSVLLQLVQENADMRAAMSRGDLAedsiREQETFKADVTKLTCCITKLVTRLGNTGEVSScpATCLKNC-P-YLQPKHVPLFISSFCD------KLELTEDAKKGWKFIMEKTAERI-----
2261 >tr|A0A0B2VKC9|A0A0B2VKC9_TOXCA Uncharacterized protein OS=Toxocara canis GN=Tcan_09473 PE=4 SV=1
2262 --ISPQGRDIIVNCFENS---HADIGNRICMRVFERRSDYQRFILALGKE----KWSWVTNTLRDFIEEVVLRIDDLAKideLSRKYGEDHVelKPFGFKPDFWVSLADAMIVeCVVLDMASHQPTDTVAAWSQLVSLMFSSIRDGY-
2263 >ERR1700761_7028990
2264 -PLDEEALRIVRHSAGRLTYVTDDFIDWLHREGVALSPEVGHSVAG--------EGWPFCERMAQALLWV-ALTDQPAGvaagVLRRVGADNW-RDGFPDAEYVSVVQALVRVLRGLSGAAQIPAMASAWISCFQWMQPYLLIG--
2265 >tr|A0A2A6D1B3|A0A2A6D1B3_PRIPA Uncharacterized protein OS=Pristionchus pacificus GN=PRIPAC_35146 PE=4 SV=1
2266 -TLNHQQRKLIKNGYDSWRKKsCISSGRWVHSFVSSKDDRLKEIMEGNEE-----TTRIHEETITHLLDMAVESLESLDDsLGPLLISytgpqgvFEE-KDGFDRLYWSRVSEGMCQLARNFPSKANKYETVCAWRIVVLFICNKIELGF-
2267 >tr|A0A0N5AH18|A0A0N5AH18_9BILA Uncharacterized protein OS=Syphacia muris PE=4 SV=1
2268 -SLTEKQKQLIKIGYKKWSEStTVTVGEWVYQYIFHKFPSVKGKFAKDEK-----SLAENQRRITDIIEMAVESVDSLDDsLGSFLVSyssengfLGE-SEGFDRGYWEIVSEALCQLSRHFPVKSHKSDTVLAWRIVILFVINKIEYGF-
2269 >tr|A0A0G4IA00|A0A0G4IA00_9ALVE Uncharacterized protein OS=Chromera velia CCMP2878 OX=1169474 GN=Cvel_12404 PE=4 SV=1
2270 ------------------------LAGKVFQKIITKAPSFRKLFVRPDE--------AYTKHFSVFLEQCLDYAQRPRCFWQehnDLAVKHI-IFGVGHNDITMMGRMIVEALQDIGGEGWAEDYAETWQKFWTEISRSL-----
2271 >ERR1719384_273858
2272 -----------LLGTTLTT-KLLSEKLSSRAGWA--QTQTSKMFSLLSFK------QGPAQFLVERFDILLNVIDDEDQLAEQLYqvaKTHK-KVGVDQSDLYSFQASFMKTLPSF-DSDFTAEVGNAWAYTLSH----------