view Roary/t/data/sequences_with_unknowns.faa @ 1:997f4f276c1c draft

Uploaded
author dereeper
date Fri, 14 May 2021 20:42:33 +0000
parents c47a5f61bc9f
children
line wrap: on
line source

>1234_00186 GGDEF family protein
MNFENEYDLEKLVNNSLDLLTIVDLSGNVLLVNPAFERTLGWKKEDLVGKDPFHLLHPED
KESTYKEFEKLNQGLLTLSFQNRYICADGQYRYFSWTASPDLVSGLVYVTGRDITDVIES
NRKISQLAVKLKETNDRLFEQASTDPLTKLKNRRMFNEELNNLIHACDKESHPLSLLMID
ADHFKDYNDKFGHIAGDKVLVELASILTKTFRKKDVLARYGGEEFIAALPNTSEPEANQI
AERLVQTVREFSWEKRSVTISVGITTYNFNPTSKSINSEYLLNLIEQADKALYCSKVSGR
XXXXXXXXXXXXXXXXXXXYYI*
>1234_00325 only a few unknowns so keep
MAFDPSVPQQQAQAPAGTLLFPEGSSANTLNVLHSGTVRYLTEVPGGRKLELFKLNGANL
TPGSVALFTSGRYPFHLQAEEACVISTYAMNRDTISKSVGSRVSLGLMVARTLLREITEL
FKKSNQIRKITSEIEKVNDNLSILYYQFNPSVFPDIKPGSPIPEVSADVVDPVMRLCREN
LKLFFDNGGILPDRPSPQFLEEEHESQLTRLYPEEIDFQDGEFNFIRKLVMQDPKILNVL
FTADPSMLAYVCSKLANVLDQISGILKTCLTDLDEAFRIFFIGENSLVEKFYLILDITSS
GYGTAPAEFVIPVLGAFAGKIEKYKNGHQALFGVPVANISPNTQAFQSKAVTLAKKMEET
APKVQAPVTSSATAGVDVDAIRKELDNSASVIIQFSGLGAEQIKEFSALMVKVKSLKNPL
DPEGDNRKVRRTLGRHYWDMYQECFTKYMNSNRNVPKPVELMLKYGYFDETLVDDSQIAF
MYTQKDPANFTSNVPISLGTEWLEKVFKREVPTSLDEMGQNFFEKVKLENRNIVIKKESD
IPPELDNPDTRLKFEFASLYEANVRLTSGSPATHFPILTKFHSQMAIDKSYVSKKILEEV
VHELMAVDYSIFHREVIYNNNELGITKEFIQKCVIPDFILVPSIGTKVMMWQDLSIHRGA
GSKESPGRIVLPIFAQGDLKTMVADALAAFRWELTKSILGAEWNNVGNPSITADYTDYIQ
FFKKNKDLSMEIKEKLASDFKRFRNDRDIFANDYQLWMKYEADGVQRLNKVVRGIFYRHI
PFSKQVRDKVAXXXXXXXXXXXXXLLPLQRSITDLLILEIENILK*
>1234_00377 Uncharacterized protein conserved in bacteria
MTLNEFAKNVLFGSGLEDKLFSPPVHPVDIRSFDFLNVPSLPAREKKIQISEQKSKIPRL
EQLFNEENRIITLHHFANHELMAIELFAWAILKFQDAPSSIRFGLYRTLLEEQTHLKMYL
SEMKKGGMELGDRPLNFIFWKQVPKMQTLEKFXXXXXXXXXXXXXXXXXF*