# HG changeset patch # User rnateam # Date 1474378247 14400 # Node ID 15974dd175155967af61776ea1d8ce8d6473562a # Parent f0606dfd5195275b4f48151ca459a3b70fde1f19 planemo upload for repository https://github.com/bgruening/galaxytools/tree/master/tools/mafft commit fc3727b1b39cd5654a523d03e0df2b9ac87ddcda diff -r f0606dfd5195 -r 15974dd17515 mafft.xml --- a/mafft.xml Tue Feb 23 07:28:36 2016 -0500 +++ b/mafft.xml Tue Sep 20 09:30:47 2016 -0400 @@ -16,7 +16,6 @@ $outputAlignment; - + #if $getTree == "--treeout" mv ${inputSequences}.tree $outputTree; #end if diff -r f0606dfd5195 -r 15974dd17515 test-data/mafft_fftns_result.aln --- a/test-data/mafft_fftns_result.aln Tue Feb 23 07:28:36 2016 -0500 +++ b/test-data/mafft_fftns_result.aln Tue Sep 20 09:30:47 2016 -0400 @@ -1,504 +1,504 @@ > 1== M63632 1 Lampetra japonica rhodopsin <>[BBRC174,1125-1132'91] --------------------MNGTE------------------------GDNF-------- -YVP----F-SNKTGLARSPY----------------EYPQY-------YLAEPWK----- -----YSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMANLFMVLFG-F -TVTMYTSMN-GYFV--FGPTMCSIEGFFATLGGEVALWSLVVLAIERYIVICKPMGN-FR -FGNTHAIMGVAFTWIMALAC-AAPPLVG-W-----SRYIPEGMQCSCGPDYYTLNPNFNN -ESYVVYMFVVHFLVPFVIIFFCYGRLLCTV----KE------------------------ ----------------------------------------------------AAAAQQ--- +--------------------------------MNGTE--------------GDNF----- +-------------YVP-----F-SNKTG----------LARSPYEYPQY-YLAEPWK--- +--------------YSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMA +NLFMVLFG-FTVTMYTSMN-GYFV--FGPTMCSIEGFFATLGGEVALWSLVVLAIERYIV +ICKPMGN-FRFGNTHAIMGVAFTWIMALAC-AAPPLVG-W-----SRYIPEGMQCSCGPD +YYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTV----KE-------------- +------------------------------------------------------AAAAQQ ------------------------------------------------------------ ---------------ESASTQK------AEKEVTRMVVLMVIGFLVCWVPYASVAFYIFT- -HQGS--DFGATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTLCC---------GKN -PLGD-DE--SGASTSKTEVSSVS-TSPV-------------------------------- ---------------------------------------------SPA------------- ------- +--------------------ESASTQK------AEKEVTRMVVLMVIGFLVCWVPYASVA +FYIFT-HQGS--DFGATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTLCC------ +--GKNPLGDDE--SGASTSKTEVSSVS-TSPVS--------------------------- +-----------------------------------PA----------------------- +-- > 2== U22180 1 rat opsin [J.Mol.Neurosci.5(3),207-209'94] --------------------MNGTE------------------------GPNF-------- -YVP----F-SNITGVVRSPF----------------EQPQY-------YLAEPWQ----- -----FSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVFGG-F -TTTLYTSLH-GYFV--FGPTGCNLEGFFATLGGEIGLWSLVVLAIERYVVVCKPMSN-FR -FGENHAIMGVAFTWVMALAC-AAPPLVG-W-----SRYIPEGMQCSCGIDYYTLKPEVNN -ESFVIYMFVVHFTIPMIVIFFCYGQLVFTV----KE------------------------ ----------------------------------------------------AAAQQQ--- +--------------------------------MNGTE--------------GPNF----- +-------------YVP-----F-SNITG----------VVRSPFEQPQY-YLAEPWQ--- +--------------FSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVA +DLFMVFGG-FTTTLYTSLH-GYFV--FGPTGCNLEGFFATLGGEIGLWSLVVLAIERYVV +VCKPMSN-FRFGENHAIMGVAFTWVMALAC-AAPPLVG-W-----SRYIPEGMQCSCGID +YYTLKPEVNNESFVIYMFVVHFTIPMIVIFFCYGQLVFTV----KE-------------- +------------------------------------------------------AAAQQQ ------------------------------------------------------------ ---------------ESATTQK------AEKEVTRMVIIMVIFFLICWLPYASVAMYIFT- -HQGS--NFGPIFMTLPAFFAKTASIYNPIIYIMMNKQFRNCMLTSLCC---------GKN -PLGD-DE--ASATASKTE------TSQV-------------------------------- ---------------------------------------------APA------------- ------- +--------------------ESATTQK------AEKEVTRMVIIMVIFFLICWLPYASVA +MYIFT-HQGS--NFGPIFMTLPAFFAKTASIYNPIIYIMMNKQFRNCMLTSLCC------ +--GKNPLGDDE--ASATASKTE------TSQVA--------------------------- +-----------------------------------PA----------------------- +-- > 3== M92038 1 chicken green sensitive cone opsin [PNAS89,5932-5936'9 --------------------MNGTE------------------------GINF-------- -YVP----M-SNKTGVVRSPF----------------EYPQY-------YLAEPWK----- -----YRLVCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFG-F -TVTFYTAWN-GYFV--FGPVGCAVEGFFATLGGQVALWSLVVLAIERYIVVCKPMGN-FR -FSATHAMMGIAFTWVMAFSC-AAPPLFG-W-----SRYMPEGMQCSCGPDYYTHNPDYHN -ESYVLYMFVIHFIIPVVVIFFSYGRLICKV----RE------------------------ ----------------------------------------------------AAAQQQ--- +--------------------------------MNGTE--------------GINF----- +-------------YVP-----M-SNKTG----------VVRSPFEYPQY-YLAEPWK--- +--------------YRLVCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVA +DLFMACFG-FTVTFYTAWN-GYFV--FGPVGCAVEGFFATLGGQVALWSLVVLAIERYIV +VCKPMGN-FRFSATHAMMGIAFTWVMAFSC-AAPPLFG-W-----SRYMPEGMQCSCGPD +YYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKV----RE-------------- +------------------------------------------------------AAAQQQ ------------------------------------------------------------ ---------------ESATTQK------AEKEVTRMVILMVLGFMLAWTPYAVVAFWIFT- -NKGA--DFTATLMAVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTICC---------GKN -PFGD-EDVSSTVSQSKTEVSSVS-SSQV-------------------------------- ---------------------------------------------SPA------------- ------- +--------------------ESATTQK------AEKEVTRMVILMVLGFMLAWTPYAVVA +FWIFT-NKGA--DFTATLMAVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTICC------ +--GKNPFGDEDVSSTVSQSKTEVSSVS-SSQVS--------------------------- +-----------------------------------PA----------------------- +-- > 4=p A45229 opsin, green-sensitive (clone GFgr-1) - goldfish --------------------MNGTE------------------------GKNF-------- -YVP----M-SNRTGLVRSPF----------------EYPQY-------YLAEPWQ----- -----FKILALYLFFLMSMGLPINGLTLVVTAQHKKLRQPLNFILVNLAVAGTIMVCFG-F -TVTFYTAIN-GYFV--LGPTGCAVEGFMATLGGEVALWSLVVLAIERYIVVCKPMGS-FK -FSSSHAFAGIAFTWVMALAC-AAPPLFG-W-----SRYIPEGMQCSCGPDYYTLNPDYNN -ESYVIYMFVCHFILPVAVIFFTYGRLVCTV----KA------------------------ ----------------------------------------------------AAAQQQ--- +--------------------------------MNGTE--------------GKNF----- +-------------YVP-----M-SNRTG----------LVRSPFEYPQY-YLAEPWQ--- +--------------FKILALYLFFLMSMGLPINGLTLVVTAQHKKLRQPLNFILVNLAVA +GTIMVCFG-FTVTFYTAIN-GYFV--LGPTGCAVEGFMATLGGEVALWSLVVLAIERYIV +VCKPMGS-FKFSSSHAFAGIAFTWVMALAC-AAPPLFG-W-----SRYIPEGMQCSCGPD +YYTLNPDYNNESYVIYMFVCHFILPVAVIFFTYGRLVCTV----KA-------------- +------------------------------------------------------AAAQQQ ------------------------------------------------------------ ---------------DSASTQK------AEREVTKMVILMVFGFLIAWTPYATVAAWIFF- -NKGA--DFSAKFMAIPAFFSKSSALYNPVIYVLLNKQFRNCMLTTIFC---------GKN -PLGD-DE-SSTVSTSKTEVSS------V-------------------------------- ---------------------------------------------SPA------------- ------- +--------------------DSASTQK------AEREVTKMVILMVFGFLIAWTPYATVA +AWIFF-NKGA--DFSAKFMAIPAFFSKSSALYNPVIYVLLNKQFRNCMLTTIFC------ +--GKNPLGDDE-SSTVSTSKTEVSS------VS--------------------------- +-----------------------------------PA----------------------- +-- > 5=p B45229 opsin, green-sensitive (clone GFgr-2) - goldfish --------------------MNGTE------------------------GNNF-------- -YVP----L-SNRTGLVRSPF----------------EYPQY-------YLAEPWQ----- -----FKLLAVYMFFLICLGLPINGLTLICTAQHKKLRQPLNFILVNLAVAGAIMVCFG-F -TVTFYTAIN-GYFA--LGPTGCAVEGFMATLGGEVALWSLVVLAIERYIVVCKPMGS-FK -FSSTHASAGIAFTWVMAMAC-AAPPLVG-W-----SRYIPEGIQCSCGPDYYTLNPEYNN -ESYVLYMFICHFILPVTIIFFTYGRLVCTV----KA------------------------ ----------------------------------------------------AAAQQQ--- +--------------------------------MNGTE--------------GNNF----- +-------------YVP-----L-SNRTG----------LVRSPFEYPQY-YLAEPWQ--- +--------------FKLLAVYMFFLICLGLPINGLTLICTAQHKKLRQPLNFILVNLAVA +GAIMVCFG-FTVTFYTAIN-GYFA--LGPTGCAVEGFMATLGGEVALWSLVVLAIERYIV +VCKPMGS-FKFSSTHASAGIAFTWVMAMAC-AAPPLVG-W-----SRYIPEGIQCSCGPD +YYTLNPEYNNESYVLYMFICHFILPVTIIFFTYGRLVCTV----KA-------------- +------------------------------------------------------AAAQQQ ------------------------------------------------------------ ---------------DSASTQK------AEREVTKMVILMVLGFLVAWTPYATVAAWIFF- -NKGA--AFSAQFMAIPAFFSKTSALYNPVIYVLLNKQFRSCMLTTLFC---------GKN -PLGD-EE-SSTVSTSKTEVSS------V-------------------------------- ---------------------------------------------SPA------------- ------- +--------------------DSASTQK------AEREVTKMVILMVLGFLVAWTPYATVA +AWIFF-NKGA--AFSAQFMAIPAFFSKTSALYNPVIYVLLNKQFRSCMLTTLFC------ +--GKNPLGDEE-SSTVSTSKTEVSS------VS--------------------------- +-----------------------------------PA----------------------- +-- > 6== L11864 1 Carassius auratus blue cone opsin [Biochemistry32,208- --------------------MKQVPEF----------------------HEDF-------- -YIPIPLDI-NNLS--AYSPF----------------LVPQD-------HLGNQGI----- -----FMAMSVFMFFIFIGGASINILTILCTIQFKKLRSHLNYILVNLSIANLFVAIFG-S -PLSFYSFFN-RYFI--FGATACKIEGFLATLGGMVGLWSLAVVAFERWLVICKPLGN-FT -FKTPHAIAGCILPWISALAA-SLPPLFG-W-----SRYIPEGLQCSCGPDWYTTNNKYNN -ESYVMFLFCFCFAVPFGTIVFCYGQLLITL----KL------------------------ ----------------------------------------------------AAKAQA--- +--------------------------------MKQVPEF------------HEDF----- +-------------YIPIP-LDI-NNLS------------AYSPFLVPQD-HLGNQGI--- +--------------FMAMSVFMFFIFIGGASINILTILCTIQFKKLRSHLNYILVNLSIA +NLFVAIFG-SPLSFYSFFN-RYFI--FGATACKIEGFLATLGGMVGLWSLAVVAFERWLV +ICKPLGN-FTFKTPHAIAGCILPWISALAA-SLPPLFG-W-----SRYIPEGLQCSCGPD +WYTTNNKYNNESYVMFLFCFCFAVPFGTIVFCYGQLLITL----KL-------------- +------------------------------------------------------AAKAQA ------------------------------------------------------------ ---------------DSASTQK------AEREVTKMVVVMVLGFLVCWAPYASFSLWIVS- -HRGE--EFDLRMATIPSCLSKASTVYNPVIYVLMNKQFRSCMM-KMVC---------GKN --IEE-DE--ASTSSQVTQVSS------V-------------------------------- ---------------------------------------------APEK------------ ------- +--------------------DSASTQK------AEREVTKMVVVMVLGFLVCWAPYASFS +LWIVS-HRGE--EFDLRMATIPSCLSKASTVYNPVIYVLMNKQFRSCMM-KMVC------ +--GKN-IEEDE--ASTSSQVTQVSS------VA--------------------------- +-----------------------------------PEK---------------------- +-- > 7== M13299 1 human BCP <>[Science232(4747),193-202'86] --------------------MRKMS------------------------EEEF-------- -YL-----F-KNIS--SVGPW----------------DGPQY-------HIAPVWA----- -----FYLQAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFS-V -FPVFVASCN-GYFV--FGRHVCALEGFLGTVAGLVTGWSLAFLAFERYIVICKPFGN-FR -FSSKHALTVVLATWTIGIGV-SIPPFFG-W-----SRFIPEGLQCSCGPDWYTVGTKYRS -ESYTWFLFIFCFIVPLSLICFSYTQLLRAL----KA------------------------ ----------------------------------------------------VAAQQQ--- +--------------------------------MRKMS--------------EEEF----- +-------------YL------F-KNIS------------SVGPWDGPQY-HIAPVWA--- +--------------FYLQAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFG +GFLLCIFS-VFPVFVASCN-GYFV--FGRHVCALEGFLGTVAGLVTGWSLAFLAFERYIV +ICKPFGN-FRFSSKHALTVVLATWTIGIGV-SIPPFFG-W-----SRFIPEGLQCSCGPD +WYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRAL----KA-------------- +------------------------------------------------------VAAQQQ ------------------------------------------------------------ ---------------ESATTQK------AEREVSRMVVVMVGSFCVCYVPYAAFAMYMVN- -NRNH--GLDLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIM-KMVC---------GKA --MTD-ES--DTCSSQKTEVSTVS-STQV-------------------------------- ---------------------------------------------GPN------------- ------- +--------------------ESATTQK------AEREVSRMVVVMVGSFCVCYVPYAAFA +MYMVN-NRNH--GLDLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIM-KMVC------ +--GKA-MTDES--DTCSSQKTEVSTVS-STQVG--------------------------- +-----------------------------------PN----------------------- +-- > 8=opsin, greensensitive human (fragment) S07060 ------------------------------------------------------------ ------------------------------------------------------------ ---------------------------------------------------DLAETVIA-S -TISIVNQVS-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWLVVCKPFGN-VR -FDAKLAIVGIAFSWIWAAVW-TAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSSYPGV -QSYMIVLMVTCCITPLSIIVLCYLQVWLAI----RA------------------------ ----------------------------------------------------VAKQQK--- ------------------------------------------------------------ ---------------ESESTQK------AEKEVTRMVVVMVLAFC---------------- +DLAETVIA-STISIVNQVS-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWLV +VCKPFGN-VRFDAKLAIVGIAFSWIWAAVW-TAPPIFG-W-----SRYWPHGLKTSCGPD +VFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAI----RA-------------- +------------------------------------------------------VAKQQK +------------------------------------------------------------ +--------------------ESESTQK------AEKEVTRMVVVMVLAFC---------- ------------------------------------------------------------ ------------------------------------------------------------ ------------------------------------------------------------ ------- +-- > 9== K03494 1 human GCP <>[Science232(4747),193-202'86] --------------------MAQQWSL----------QRLAGRHPQDSYEDST-------- -QSSI-FTY-TNSNS-TRGPF----------------EGPNY-------HIAPRWV----- -----YHLTSVWMIFVVIASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIA-S -TISVVNQVY-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWMVVCKPFGN-VR -FDAKLAIVGIAFSWIWAAVW-TAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSSYPGV -QSYMIVLMVTCCITPLSIIVLCYLQVWLAI----RA------------------------ ----------------------------------------------------VAKQQK--- +--------------------------------MAQQWSLQRLAGRHPQDSYEDST----- +-------------QSSI--FTY-TNSNS-----------TRGPFEGPNY-HIAPRWV--- +--------------YHLTSVWMIFVVIASVFTNGLVLAATMKFKKLRHPLNWILVNLAVA +DLAETVIA-STISVVNQVY-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWMV +VCKPFGN-VRFDAKLAIVGIAFSWIWAAVW-TAPPIFG-W-----SRYWPHGLKTSCGPD +VFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAI----RA-------------- +------------------------------------------------------VAKQQK ------------------------------------------------------------ ---------------ESESTQK------AEKEVTRMVVVMVLAFCFCWGPYAFFACFAAA- -NPGY--PFHPLMAALPAFFAKSATIYNPVIYVFMNRQFRNCIL-QLF----------GKK --VDD-GS--ELSSASKTEVSSV---SSV-------------------------------- ---------------------------------------------SPA------------- ------- +--------------------ESESTQK------AEKEVTRMVVVMVLAFCFCWGPYAFFA +CFAAA-NPGY--PFHPLMAALPAFFAKSATIYNPVIYVFMNRQFRNCIL-QLF------- +--GKK-VDDGS--ELSSASKTEVSSV---SSVS--------------------------- +-----------------------------------PA----------------------- +-- > 10== Z68193 1 human Red Opsin <>[] --------------------MAQQWSL----------QRLAGRHPQDSYEDST-------- -QSSI-FTY-TNSNS-TRGPF----------------EGPNY-------HIAPRWV----- -----YHLTSVWMIFVVTASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIA-S -TISIVNQVS-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWLVVCKPFGN-VR -FDAKLAIVGIAFSWIWSAVW-TAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSSYPGV -QSYMIVLMVTCCIIPLAIIMLCYLQVWLAI----RA------------------------ ----------------------------------------------------VAKQQK--- +--------------------------------MAQQWSLQRLAGRHPQDSYEDST----- +-------------QSSI--FTY-TNSNS-----------TRGPFEGPNY-HIAPRWV--- +--------------YHLTSVWMIFVVTASVFTNGLVLAATMKFKKLRHPLNWILVNLAVA +DLAETVIA-STISIVNQVS-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWLV +VCKPFGN-VRFDAKLAIVGIAFSWIWSAVW-TAPPIFG-W-----SRYWPHGLKTSCGPD +VFSGSSYPGVQSYMIVLMVTCCIIPLAIIMLCYLQVWLAI----RA-------------- +------------------------------------------------------VAKQQK ------------------------------------------------------------ ---------------ESESTQK------AEKEVTRMVVVMIFAYCVCWGPYTFFACFAAA- -NPGY--AFHPLMAALPAYFAKSATIYNPVIYVFMNRQFRNCIL-QLF----------GKK --VDD-GS--ELSSASKTEVSSV---SSV-------------------------------- ---------------------------------------------SPA------------- ------- +--------------------ESESTQK------AEKEVTRMVVVMIFAYCVCWGPYTFFA +CFAAA-NPGY--AFHPLMAALPAYFAKSATIYNPVIYVFMNRQFRNCIL-QLF------- +--GKK-VDDGS--ELSSASKTEVSSV---SSVS--------------------------- +-----------------------------------PA----------------------- +-- > 11== M92036 1 Gecko gecko P521 [PNAS89,6841-6845'92] --------------------MTEAWNV----------AVFAARRSRDD-DDTT-------- -RGSV-FTY-TNTNN-TRGPF----------------EGPNY-------HIAPRWV----- -----YNLVSFFMIIVVIASCFTNGLVLVATAKFKKLRHPLNWILVNLAFVDLVETLVA-S -TISVFNQIF-GYFI--LGHPLCVIEGYVVSSCGITGLWSLAIISWERWFVVCKPFGN-IK -FDSKLAIIGIVFSWVWAWGW-SAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSVELGC -QSFMLTLMITCCFLPLFIIIVCYLQVWMAI----RA------------------------ ----------------------------------------------------VAAQQK--- +--------------------------------MTEAWNVAVFAARRSRDD-DDTT----- +-------------RGSV--FTY-TNTNN-----------TRGPFEGPNY-HIAPRWV--- +--------------YNLVSFFMIIVVIASCFTNGLVLVATAKFKKLRHPLNWILVNLAFV +DLVETLVA-STISVFNQIF-GYFI--LGHPLCVIEGYVVSSCGITGLWSLAIISWERWFV +VCKPFGN-IKFDSKLAIIGIVFSWVWAWGW-SAPPIFG-W-----SRYWPHGLKTSCGPD +VFSGSVELGCQSFMLTLMITCCFLPLFIIIVCYLQVWMAI----RA-------------- +------------------------------------------------------VAAQQK ------------------------------------------------------------ ---------------ESESTQK------AEREVSRMVVVMIVAFCICWGPYASFVSFAAA- -NPGY--AFHPLAAALPAYFAKSATIYNPVIYVFMNRQFRNCIM-QLF----------GKK --VDD-GS--EASTTSRTEVSSVS-NSSV-------------------------------- ---------------------------------------------APA------------- ------- +--------------------ESESTQK------AEREVSRMVVVMIVAFCICWGPYASFV +SFAAA-NPGY--AFHPLAAALPAYFAKSATIYNPVIYVFMNRQFRNCIM-QLF------- +--GKK-VDDGS--EASTTSRTEVSSVS-NSSVA--------------------------- +-----------------------------------PA----------------------- +-- > 12== M62903 1 chicken visual pigment <>[BBRC173,1212-1217'90] --------------------MA-AWEA----------AFAARRRHEE--EDTT-------- -RDSV-FTY-TNSNN-TRGPF----------------EGPNY-------HIAPRWV----- -----YNLTSVWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVADLGETVIA-S -TISVINQIS-GYFI--LGHPMCVVEGYTVSACGITALWSLAIISWERWFVVCKPFGN-IK -FDGKLAVAGILFSWLWSCAW-TAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSSDPGV -QSYMVVLMVTCCFFPLAIIILCYLQVWLAI----RA------------------------ ----------------------------------------------------VAAQQK--- +--------------------------------MAA-WEAAFAARRRHEE--EDTT----- +-------------RDSV--FTY-TNSNN-----------TRGPFEGPNY-HIAPRWV--- +--------------YNLTSVWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVA +DLGETVIA-STISVINQIS-GYFI--LGHPMCVVEGYTVSACGITALWSLAIISWERWFV +VCKPFGN-IKFDGKLAVAGILFSWLWSCAW-TAPPIFG-W-----SRYWPHGLKTSCGPD +VFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVWLAI----RA-------------- +------------------------------------------------------VAAQQK ------------------------------------------------------------ ---------------ESESTQK------AEKEVSRMVVVMIVAYCFCWGPYTFFACFAAA- -NPGY--AFHPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIL-QLF----------GKK --VDD-GS--EVST-SRTEVSSVS-NSSV-------------------------------- ---------------------------------------------SPA------------- ------- +--------------------ESESTQK------AEKEVSRMVVVMIVAYCFCWGPYTFFA +CFAAA-NPGY--AFHPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIL-QLF------- +--GKK-VDDGS--EVST-SRTEVSSVS-NSSVS--------------------------- +-----------------------------------PA----------------------- +-- > 13== S75720 1 chicken P-opsin <>[Science267(5203),1502-1506'95] --------------------MS---------------------------SNSS-------- -QAP--------PNG-TPGPF----------------DGPQW------PYQAPQST----- -----YVGVAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVADLLVTLCG-S -SVSLSNNIN-GFFV--FGRRMCELEGFMVSLTGIVGLWSLAILALERYVVVCKPLGD-FQ -FQRRHAVSGCAFTWGWALLW-SAPPLLG-W-----SSYVPEGLRTSCGPNWYTGGSNN-- -NSYILSLFVTCFVLPLSLILFSYTNLLLTL----RA------------------------ ----------------------------------------------------AAAQQK--- +--------------------------------MS-----------------SNSS----- +-------------QAP---------PNG-----------TPGPFDGPQWPYQAPQST--- +--------------YVGVAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVA +DLLVTLCG-SSVSLSNNIN-GFFV--FGRRMCELEGFMVSLTGIVGLWSLAILALERYVV +VCKPLGD-FQFQRRHAVSGCAFTWGWALLW-SAPPLLG-W-----SSYVPEGLRTSCGPN +WYTGGSNN--NSYILSLFVTCFVLPLSLILFSYTNLLLTL----RA-------------- +------------------------------------------------------AAAQQK ------------------------------------------------------------ ---------------EADTTQR------AEREVTRMVIVMVMAFLLCWLPYSTFALVVAT- -HKGI--IIQPVLASLPSYFSKTATVYNPIIYVFMNKQFQSCLL-EMLCCGY-----QPQR --TGK-AS--PGTPGPHADVTAAGLRNKV-------------------------------- ---------------------------------------------MPAHP---V------- ------- +--------------------EADTTQR------AEREVTRMVIVMVMAFLLCWLPYSTFA +LVVAT-HKGI--IIQPVLASLPSYFSKTATVYNPIIYVFMNKQFQSCLL-EMLCCGY--- +-QPQR-TGKAS--PGTPGPHADVTAAGLRNKVM--------------------------- +-----------------------------------PAHPV-------------------- +-- > 14== M17718 1 D.melanogaster Rh3 <>[J.Neurosci.7,1550-1557'87] -----------MESGNVSSSLFGNVST----------ALRPEARL----SA---------- --ETRLLGW--------NVPP----------------EELR--------HIPEHWLTYPEP -PESMNYLLGTLYIFFTLMSMLGNGLVIWVFSAAKSLRTPSNILVINLAFCDFMMMVK--T -PIFIYNSFH-QGYA--LGHLGCQIFGIIGSYTGIAAGATNAFIAYDRFNVITRPMEG--K -MTHGKAIAMIIFIYMYATPW-VVACYTETW-----GRFVPEGYLTSCTFDYLT--DNFDT -RLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKA------------------------ ----------------------------------------------------LRDQAKKM- ---------------------------------NVESL----------------------- ------------RSNVDKNKET------AEIRIAKAAITICFLFFCSWTPYGVMSLIGAF- -GDKT--LLTPGATMIPACACKMVACIDPFVYAISHPRYRMELQKRCPWLAL--------N -EKAP-ES-SAVASTSTTQEP-QQ-TTAA-------------------------------- +----------MES--GNV-----------SSSLFGNVSTAL----------RPEA----- +-------------RLSA------ETRLL----------GWNVPPEELR--HIPEHWLTYP +E--------PPESMNYLLGTLYIFFTLMSMLGNGLVIWVFSAAKSLRTPSNILVINLAFC +DFMMMVK--TPIFIYNSFH-QGYA--LGHLGCQIFGIIGSYTGIAAGATNAFIAYDRFNV +ITRPMEG--KMTHGKAIAMIIFIYMYATPW-VVACYTETW-----GRFVPEGYLTSCTFD +YLT--DNFDTRLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKA-------------- +------------------------------------------------------LRDQAK +KMN---------------VESLRS------------------------------------ +-------------------NVDKNKET------AEIRIAKAAITICFLFFCSWTPYGVMS +LIGAF-GDKT--LLTPGATMIPACACKMVACIDPFVYAISHPRYRMELQKRCPWLAL--- +---NE-KAPES----SAVASTSTTQEPQQTTAA--------------------------- ------------------------------------------------------------ ------- +-- > 15== X65879 1 Drosophila pseudoobscura Dpse\Rh3 <>[Genetics132(1),193-204'92 -----------MEYHNVSSVL-GNVSS----------VLRPDARL----SA---------- --ESRLLGW--------NVPP----------------DELR--------HIPEHWLIYPEP -PESMNYLLGTLYIFFTVISMIGNGLVMWVFSAAKSLRTPSNILVINLAFCDFMMMIK--T -PIFIYNSFH-QGYA--LGHLGCQIFGVIGSYTGIAAGATNAFIAYDRYNVITRPMEG--K -MTHGKAIAMIIFIYLYATPW-VVACYTESW-----GRFVPEGYLTSCTFDYLT--DNFDT -RLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKA------------------------ ----------------------------------------------------LRDQAKKM- ---------------------------------NVDSL----------------------- ------------RSNVDKSKEA------AEIRIAKAAITICFLFFASWTPYGVMSLIGAF- -GDKT--LLTPGATMIPACTCKMVACIDPFVYAISHPRYRMELQKRCPWLAI--------S -EKAP-ES-RAAISTSTTQEQ-QQ-TTAA-------------------------------- +----------MEY--HNV-----------SSVL-GNVSSVL----------RPDA----- +-------------RLSA------ESRLL----------GWNVPPDELR--HIPEHWLIYP +E--------PPESMNYLLGTLYIFFTVISMIGNGLVMWVFSAAKSLRTPSNILVINLAFC +DFMMMIK--TPIFIYNSFH-QGYA--LGHLGCQIFGVIGSYTGIAAGATNAFIAYDRYNV +ITRPMEG--KMTHGKAIAMIIFIYLYATPW-VVACYTESW-----GRFVPEGYLTSCTFD +YLT--DNFDTRLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKA-------------- +------------------------------------------------------LRDQAK +KMN---------------VDSLRS------------------------------------ +-------------------NVDKSKEA------AEIRIAKAAITICFLFFASWTPYGVMS +LIGAF-GDKT--LLTPGATMIPACTCKMVACIDPFVYAISHPRYRMELQKRCPWLAI--- +---SE-KAPES----RAAISTSTTQEQQQTTAA--------------------------- ------------------------------------------------------------ ------- +-- > 16== M17730 1 D.melanogaster Rh4 opsin <>[J.Neurosci.7,1558-1566'87] -----------ME------PLCNASEP----------PLRPEAR-----SSGN-------- -GDLQFLGW--------NVPP----------------DQIQ--------YIPEHWLTQLEP -PASMHYMLGVFYIFLFCASTVGNGMVIWIFSTSKSLRTPSNMFVLNLAVFDLIMCLK--A -PIF--NSFH-RGFAIYLGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPMNR--N -MTFTKAVIMNIIIWLYCTPW-VVLPLTQFW-----DRFVPEGYLTSCSFDYLS--DNFDT -RLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKA------------------------ ----------------------------------------------------LREQAKKM- ---------------------------------NVESL----------------------- ------------RSNVDKSKET------AEIRIAKAAITICFLFFVSWTPYGVMSLIGAF- -GDKS--LLTQGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGV--------N -EKSG-EI-SSAQST-TTQEQ-QQ-TTAA-------------------------------- +----------ME-------------------PLCNASEPPL----------RPEA----- +-------------R-SSG---NGDLQFL----------GWNVPPDQIQ--YIPEHWLTQL +E--------PPASMHYMLGVFYIFLFCASTVGNGMVIWIFSTSKSLRTPSNMFVLNLAVF +DLIMCLK--APIF--NSFH-RGFAIYLGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNV +ITKPMNR--NMTFTKAVIMNIIIWLYCTPW-VVLPLTQFW-----DRFVPEGYLTSCSFD +YLS--DNFDTRLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKA-------------- +------------------------------------------------------LREQAK +KMN---------------VESLRS------------------------------------ +-------------------NVDKSKET------AEIRIAKAAITICFLFFVSWTPYGVMS +LIGAF-GDKS--LLTQGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGV--- +---NE-KSGEI----SSAQSTTTQEQ-QQTTAA--------------------------- ------------------------------------------------------------ ------- +-- > 17== X65880 1 Drosophila pseudoobscura Dpse\Rh4 <>[Genetics132(1),193-204'92 -----------MD------ALCNASEP----------PLRPEARM----SSGS-------- -DELQFLGW--------NVPP----------------DQIQ--------YIPEHWLTQLEP -PASMHYMLGVFYIFLFFASTLGNGMVIWIFSTSKSLRTPSNMFVLNLAVFDLIMCLK--A -PIFIYNSFH-RGFA--LGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPMNR--N -MTFTKAVIMNIIIWLYCTPW-VVLPLTQFW-----DRFVPEGYLTSCSFDYLS--DNFDT -RLFVGTIFLFSFVVPTLMILYYYSQIVGHVFNHEKA------------------------ ----------------------------------------------------LREQAKKM- ---------------------------------NVESL----------------------- ------------RSNVDKSKET------AEIRIAKAAITICFLFFVSWTPYGVMSLIGAF- -GDKS--LLTPGATMIPACTCKLVACIEPFVYAISHPRYRMELQKRCPWLGV--------N -EKSG-EA-SSAQST-TTQEQTQQ-TSAA-------------------------------- +----------MD-------------------ALCNASEPPL----------RPEA----- +-------------RMSSG---SDELQFL----------GWNVPPDQIQ--YIPEHWLTQL +E--------PPASMHYMLGVFYIFLFFASTLGNGMVIWIFSTSKSLRTPSNMFVLNLAVF +DLIMCLK--APIFIYNSFH-RGFA--LGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNV +ITKPMNR--NMTFTKAVIMNIIIWLYCTPW-VVLPLTQFW-----DRFVPEGYLTSCSFD +YLS--DNFDTRLFVGTIFLFSFVVPTLMILYYYSQIVGHVFNHEKA-------------- +------------------------------------------------------LREQAK +KMN---------------VESLRS------------------------------------ +-------------------NVDKSKET------AEIRIAKAAITICFLFFVSWTPYGVMS +LIGAF-GDKS--LLTPGATMIPACTCKLVACIEPFVYAISHPRYRMELQKRCPWLGV--- +---NE-KSGEA----SSAQSTTTQEQTQQTSAA--------------------------- ------------------------------------------------------------ ------- +-- > 18== D50584 1 Hemigrapsus sanguineus opsin BcRh2 [J.Exp.Biol.1 --------------------MTNATGP----------QMAYYGAA----SMD--------- ------FGYPEGVSIVDFVRP----------------EIKP--------YVHQHWYNYPPV -NPMWHYLLGVIYLFLGTVSIFGNGLVIYLFNKSAALRTPANILVVNLALSDLIMLTTN-V -PFFTYNCFSGGVWM--FSPQYCEIYACLGAITGVCSIWLLCMISFDRYNIICNGFNG-PK -LTTGKAVVFALISWVIAIGC-ALPPFFG-W-----GNYILEGILDSCSYDYLT--QDFNT -FSYNIFIFVFDYFLPAAIIVFSYVFIVKAIFAHEAA------------------------ ----------------------------------------------------MRAQAKKM- ---------------------------------NVSTL----------------------- ------------RS-NEADAQR------AEIRIAKTALVNVSLWFICWTPYALISLKGVM- -GDTS--GITPLVSTLPALLAKSCSCYNPFVYAISHPKYRLAITQHLPWFCV------HET -ETKS-ND-DSQSNSTVAQDKA--------------------------------------- +--------------------------------MTNATGPQM----------AYYG----- +-------------AASMD-FGYPEGVSI----------VDFVRPEIKP--YVHQHWYNYP +P--------VNPMWHYLLGVIYLFLGTVSIFGNGLVIYLFNKSAALRTPANILVVNLALS +DLIMLTTN-VPFFTYNCFSGGVWM--FSPQYCEIYACLGAITGVCSIWLLCMISFDRYNI +ICNGFNG-PKLTTGKAVVFALISWVIAIGC-ALPPFFG-W-----GNYILEGILDSCSYD +YLT--QDFNTFSYNIFIFVFDYFLPAAIIVFSYVFIVKAIFAHEAA-------------- +------------------------------------------------------MRAQAK +KMN---------------VSTLRS------------------------------------ +--------------------NEADAQR------AEIRIAKTALVNVSLWFICWTPYALIS +LKGVM-GDTS--GITPLVSTLPALLAKSCSCYNPFVYAISHPKYRLAITQHLPWFCV--- +---HE-TETKS-NDDSQSNSTVAQDKA--------------------------------- ------------------------------------------------------------ ------- +-- > 19== D50583 1 Hemigrapsus sanguineus opsin BcRh1 [J.Exp.Biol.1 --------------------MANVTGP----------QMAFYGSG----AAT--------- ------FGYPEGMTVADFVPD----------------RVKH--------MVLDHWYNYPPV -NPMWHYLLGVVYLFLGVISIAGNGLVIYLYMKSQALKTPANMLIVNLALSDLIMLTTN-F -PPFCYNCFSGGRWM--FSGTYCEIYAALGAITGVCSIWTLCMISFDRYNIICNGFNG-PK -LTQGKATFMCGLAWVISVGW-SLPPFFG-W-----GSYTLEGILDSCSYDYFT--RDMNT -ITYNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAA------------------------ ----------------------------------------------------MRAQAKKM- ---------------------------------NVTNL----------------------- ------------RS-NEAETQR------AEIRIAKTALVNVSLWFICWTPYAAITIQGLL- -GNAE--GITPLLTTLPALLAKSCSCYNPFVYAISHPKFRLAITQHLPWFCV------HEK -DPND-VE-ENQSSNTQTQEKS--------------------------------------- +--------------------------------MANVTGPQM----------AFYG----- +-------------SGAAT-FGYPEGMTV----------ADFVPDRVKH--MVLDHWYNYP +P--------VNPMWHYLLGVVYLFLGVISIAGNGLVIYLYMKSQALKTPANMLIVNLALS +DLIMLTTN-FPPFCYNCFSGGRWM--FSGTYCEIYAALGAITGVCSIWTLCMISFDRYNI +ICNGFNG-PKLTQGKATFMCGLAWVISVGW-SLPPFFG-W-----GSYTLEGILDSCSYD +YFT--RDMNTITYNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAA-------------- +------------------------------------------------------MRAQAK +KMN---------------VTNLRS------------------------------------ +--------------------NEAETQR------AEIRIAKTALVNVSLWFICWTPYAAIT +IQGLL-GNAE--GITPLLTTLPALLAKSCSCYNPFVYAISHPKFRLAITQHLPWFCV--- +---HE-KDPND-VEENQSSNTQTQEKS--------------------------------- ------------------------------------------------------------ ------- +-- > 20== K02320 1 D.melanogaster opsin <>[Cell40,851-858'85] -----------ME---SFAVAAAQLGP----------HFAPLS------------------ -----------NGSVVDKVTP----------------DMAH--------LISPYWNQFPAM -DPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAISDFGIMITN-T -PMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGMAG-RP -MTIPLALGKM---------------------------YVPEGNLTSCGIDYLE--RDWNP -RSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA------------------------ ----------------------------------------------------MREQAKKM- ---------------------------------NVKSL----------------------- ------------RS-SEDAEKS------AEGKLAKVALVTITLWFMAWTPYLVINCMGLF- -KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF------GKV -DDGK-SS-DAQSQATASEAESKA------------------------------------- +---------------MES-----------FAVAAAQLGPHF----------APLS----- +------------------------NGSV----------VDKVTPDMAH--LISPYWNQFP +A--------MDPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAIS +DFGIMITN-TPMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQV +IVKGMAG-RPMTIPLALGKM---------------------------YVPEGNLTSCGID +YLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA-------------- +------------------------------------------------------MREQAK +KMN---------------VKSLRS------------------------------------ +--------------------SEDAEKS------AEGKLAKVALVTITLWFMAWTPYLVIN +CMGLF-KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF--- +---GK-VDDGK-SSDAQSQATASEAESKA------------------------------- ------------------------------------------------------------ ------- +-- > 21== K02315 1 D.melanogaster ninaE <>[Cell40,839-850'85] -----------ME---SFAVAAAQLGP----------HFAPLS------------------ -----------NGSVVDKVTP----------------DMAH--------LISPYWNQFPAM -DPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAISDFGIMITN-T -PMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGMAG-RP -MTIPLALGKIAYIWFMSSIW-CLAPAFG-W-----SRYVPEGNLTSCGIDYLE--RDWNP -RSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA------------------------ ----------------------------------------------------MREQAKKM- ---------------------------------NVKSL----------------------- ------------RS-SEDAEKS------AEGKLAKVALVTITLWFMAWTPYLVINCMGLF- -KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF------GKV -DDGK-SS-DAQSQATASEAESKA------------------------------------- +---------------MES-----------FAVAAAQLGPHF----------APLS----- +------------------------NGSV----------VDKVTPDMAH--LISPYWNQFP +A--------MDPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAIS +DFGIMITN-TPMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQV +IVKGMAG-RPMTIPLALGKIAYIWFMSSIW-CLAPAFG-W-----SRYVPEGNLTSCGID +YLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA-------------- +------------------------------------------------------MREQAK +KMN---------------VKSLRS------------------------------------ +--------------------SEDAEKS------AEGKLAKVALVTITLWFMAWTPYLVIN +CMGLF-KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF--- +---GK-VDDGK-SSDAQSQATASEAESKA------------------------------- ------------------------------------------------------------ ------- +-- > 22== X65877 1 Drosophila pseudoobscura Dpse\ninaE <>[Genetics132(1),193-204' -----------MD---SFAAVATQLGP----------QFAAPS------------------ -----------NGSVVDKVTP----------------DMAH--------LISPYWDQFPAM -DPIWAKILTAYMIIIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAISDFGIMITN-T -PMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGMAG-RP -MTIPLALGKIAYIWFMSTIWCCLAPVFG-W-----SRYVPEGNLTSCGIDYLE--RDWNP -RSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA------------------------ ----------------------------------------------------MREQAKKM- ---------------------------------NVKSL----------------------- ------------RS-SEDADKS------AEGKLAKVALVTISLWFMAWTPYLVINCMGLF- -KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF------GKV -DDGK-SS-EAQSQATTSEAESKA------------------------------------- +---------------MDS-----------FAAVATQLGPQF----------AAPS----- +------------------------NGSV----------VDKVTPDMAH--LISPYWDQFP +A--------MDPIWAKILTAYMIIIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAIS +DFGIMITN-TPMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQV +IVKGMAG-RPMTIPLALGKIAYIWFMSTIWCCLAPVFG-W-----SRYVPEGNLTSCGID +YLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA-------------- +------------------------------------------------------MREQAK +KMN---------------VKSLRS------------------------------------ +--------------------SEDADKS------AEGKLAKVALVTISLWFMAWTPYLVIN +CMGLF-KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF--- +---GK-VDDGK-SSEAQSQATTSEAESKA------------------------------- ------------------------------------------------------------ ------- +-- > 23== M12896 1 D.melanogaster Rh2 <>[Cell44,705-710'86] ------MERSHLP---ETPFDLAHSGP----------RFQAQSSG---------------- -----------NGSVLDNVLP----------------DMAH--------LVNPYWSRFAPM -DPMMSKILGLFTLAIMIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFSDFCMMASQ-S -PVMIINFYY-ETWV--LGPLWCDIYAGCGSLFGCVSIWSMCMIAFDRYNVIVKGING-TP -MTIKTSIMKILFIWMMAVFW-TVMPLIG-W-----SAYVPEGNLTACSIDYMT--RMWNP -RSYLITYSLFVYYTPLFLICYSYWFIIAAVAAHEKA------------------------ ----------------------------------------------------MREQAKKM- ---------------------------------NVKSL----------------------- ------------RS-SEDCDKS------AEGKLAKVALTTISLWFMAWTPYLVICYFGLF- -KI-D--GLTPLTTIWGATFAKTSAVYNPIVYGISHPKYRIVLKEKCPMCVF------GNT -DEPKPDA-PASDTETTSEADSKA------------------------------------- +----------MERSHLPE-----------TPFDLAHSGPRF----------QAQS----- +-------------SG---------NGSV----------LDNVLPDMAH--LVNPYWSRFA +P--------MDPMMSKILGLFTLAIMIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFS +DFCMMASQ-SPVMIINFYY-ETWV--LGPLWCDIYAGCGSLFGCVSIWSMCMIAFDRYNV +IVKGING-TPMTIKTSIMKILFIWMMAVFW-TVMPLIG-W-----SAYVPEGNLTACSID +YMT--RMWNPRSYLITYSLFVYYTPLFLICYSYWFIIAAVAAHEKA-------------- +------------------------------------------------------MREQAK +KMN---------------VKSLRS------------------------------------ +--------------------SEDCDKS------AEGKLAKVALTTISLWFMAWTPYLVIC +YFGLF-KI-D--GLTPLTTIWGATFAKTSAVYNPIVYGISHPKYRIVLKEKCPMCVF--- +---GN-TDEPKPDAPASDTETTSEADSKA------------------------------- ------------------------------------------------------------ ------- +-- > 24== X65878 1 Drosophila pseudoobscura Dpse\Rh2 <>[Genetics132(1),193-204'92 ------MERSLLP---EPPLAMALLGP----------RFEAQTGG---------------- -----------NRSVLDNVLP----------------DMAP--------LVNPHWSRFAPM -DPTMSKILGLFTLVILIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFSDFCMMASQ-S -PVMIINFYY-ETWV--LGPLWCDIYAACGSLFGCVSIWSMCMIAFDRYNVIVKGING-TP -MTIKTSIMKIAFIWMMAVFW-TIMPLIG-W-----SSYVPEGNLTACSIDYMT--RQWNP -RSYLITYSLFVYYTPLFMICYSYWFIIATVAAHEKA------------------------ ----------------------------------------------------MRDQAKKM- ---------------------------------NVKSL----------------------- ------------RS-SEDCDKS------AENKLAKVALTTISLWFMAWTPYLIICYFGLF- -KI-D--GLTPLTTIWGATFAKTSAVYNPIVYGISHPNDRLVLKEKCPMCVC------GTT -DEPKPDA-PPSDTETTSEAESKD------------------------------------- +----------MERSLLPE-----------PPLAMALLGPRF----------EAQT----- +-------------GG---------NRSV----------LDNVLPDMAP--LVNPHWSRFA +P--------MDPTMSKILGLFTLVILIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFS +DFCMMASQ-SPVMIINFYY-ETWV--LGPLWCDIYAACGSLFGCVSIWSMCMIAFDRYNV +IVKGING-TPMTIKTSIMKIAFIWMMAVFW-TIMPLIG-W-----SSYVPEGNLTACSID +YMT--RQWNPRSYLITYSLFVYYTPLFMICYSYWFIIATVAAHEKA-------------- +------------------------------------------------------MRDQAK +KMN---------------VKSLRS------------------------------------ +--------------------SEDCDKS------AENKLAKVALTTISLWFMAWTPYLIIC +YFGLF-KI-D--GLTPLTTIWGATFAKTSAVYNPIVYGISHPNDRLVLKEKCPMCVC--- +---GT-TDEPKPDAPPSDTETTSEAESKD------------------------------- ------------------------------------------------------------ ------- +-- > 25== U26026 1 Apis mellifera long-wavelength rhodopsin <>[] --------------------MIAVSGP----------SYEAFSYG----GQA--------- -----RF---NNQTVVDKVPP----------------DMLH--------LIDANWYQYPPL -NPMWHGILGFVIGMLGFVSAMGNGMVVYIFLSTKSLRTPSNLFVINLAISNFLMMFCM-S -PPMVINCYY-ETWV--LGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGLSG-KP -LSINGALIRIIAIWLFSLGW-TIAPMFG-W-----NRYVPEGNMTACGTDYFN--RGLLS -ASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKN------------------------ ----------------------------------------------------MREQAKKM- ---------------------------------NVASL----------------------- ------------RS-SENQNTS------AECKLAKVALMTISLWFMAWTPYLVINFSGIF- -NL-V--KISPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFAKFPSLAC-------AA -EPSS-DA-VSTTSGTTTVTDNEK-SNA--------------------------------- +--------------------------------MIAVSGPSY----------EAFS----- +-------------YGGQARF---NNQTV----------VDKVPPDMLH--LIDANWYQYP +P--------LNPMWHGILGFVIGMLGFVSAMGNGMVVYIFLSTKSLRTPSNLFVINLAIS +NFLMMFCM-SPPMVINCYY-ETWV--LGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNV +IVKGLSG-KPLSINGALIRIIAIWLFSLGW-TIAPMFG-W-----NRYVPEGNMTACGTD +YFN--RGLLSASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKN-------------- +------------------------------------------------------MREQAK +KMN---------------VASLRS------------------------------------ +--------------------SENQNTS------AECKLAKVALMTISLWFMAWTPYLVIN +FSGIF-NL-V--KISPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFAKFPSLAC--- +----A-AEPSS-DAVSTTSGTTTVTDNEKSNA---------------------------- ------------------------------------------------------------ ------- +-- > 26== L03781 1 Limulus polyphemus opsin <>[PNAS90,6150-6154'93] ----------------------MANQL----------SYSSLGWP----YQP--------- -----------NASVVDTMPK----------------EMLY--------MIHEHWYAFPPM -NPLWYSILGVAMIILGIICVLGNGMVIYLMMTTKSLRTPTNLLVVNLAFSDFCMMAFM-M -PTMTSNCFA-ETWI--LGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGMAA-AP -LTHKKATLLLLFVWIWSGGW-TILPFFG-W-----SRYVPEGNLTSCTVDYLT--KDWSS -ASYVVIYGLAVYFLPLITMIYCYFFIVHAVAEHEKQ------------------------ ----------------------------------------------------LREQAKKM- ---------------------------------NVASL----------------------- ------------RANADQQKQS------AECRLAKVAMMTVGLWFMAWTPYLIISWAGVF- -SSGT--RLTPLATIWGSVFAKANSCYNPIVYGISHPRYKAALYQRFPSLAC------GSG -ESGS-DV-KSEASATTTMEEKPK-IPEA-------------------------------- +----------------------------------MANQLSY----------SSLG----- +-------------WPYQP------NASV----------VDTMPKEMLY--MIHEHWYAFP +P--------MNPLWYSILGVAMIILGIICVLGNGMVIYLMMTTKSLRTPTNLLVVNLAFS +DFCMMAFM-MPTMTSNCFA-ETWI--LGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNV +IVRGMAA-APLTHKKATLLLLFVWIWSGGW-TILPFFG-W-----SRYVPEGNLTSCTVD +YLT--KDWSSASYVVIYGLAVYFLPLITMIYCYFFIVHAVAEHEKQ-------------- +------------------------------------------------------LREQAK +KMN---------------VASLRA------------------------------------ +-------------------NADQQKQS------AECRLAKVAMMTVGLWFMAWTPYLIIS +WAGVF-SSGT--RLTPLATIWGSVFAKANSCYNPIVYGISHPRYKAALYQRFPSLAC--- +---GS-GESGS-DVKSEASATTTMEEKPKIPEA--------------------------- ------------------------------------------------------------ ------- +-- > 27== X07797 1 Octopus dofleini rhodopsin <>[FEBS232(1),69-72'88] -------------------------------------MVESTTLV----NQT--------- ------WWY--NPTVD----------------------------------IHPHWAKFDPI -PDAVYYSVGIFIGVVGIIGILGNGVVIYLFSKTKSLQTPANMFIINLAMSDLSFSAINGF -PLKTISAFM-KKWI--FGKVACQLYGLLGGIFGFMSINTMAMISIDRYNVIGRPMAASKK -MSHRRAFLMIIFVWMWSIVW-SVGPVFN-W-----GAYVPEGILTSCSFDYLS--TDPST -RSFILCMYFCGFMLPIIIIAFCYFNIVMSVSNHEKE------------------------ ----------------------------------------------------MAAMAKRL- ---------------------------------NAKEL----------------------- ------------R--KAQAGAS------AEMKLAKISMVIITQFMLSWSPYAIIALLAQF- -GPAE--WVTPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWLLTCCQFDEKEC -EDAN-DA-EEEVVASER--GGES-RDAAQMKEMMAMMQKMQAQQAAYQPPPPPQGY--PP -QGYPPQGAYPPPQGYPPQGYPPQGYPPQGYPPQGAPPQVEAPQGAPPQG---VDNQAYQA ------- +---------------------------------------MV----------ESTT----- +-------------LVNQT-WWY--NPTV----------D------------IHPHWAKFD +P--------IPDAVYYSVGIFIGVVGIIGILGNGVVIYLFSKTKSLQTPANMFIINLAMS +DLSFSAINGFPLKTISAFM-KKWI--FGKVACQLYGLLGGIFGFMSINTMAMISIDRYNV +IGRPMAASKKMSHRRAFLMIIFVWMWSIVW-SVGPVFN-W-----GAYVPEGILTSCSFD +YLS--TDPSTRSFILCMYFCGFMLPIIIIAFCYFNIVMSVSNHEKE-------------- +------------------------------------------------------MAAMAK +RLN---------------AKELR------------------------------------- +--------------------KAQAGAS------AEMKLAKISMVIITQFMLSWSPYAIIA +LLAQF-GPAE--WVTPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWLLTCCQ +FDEKE-CEDAN-DAEEEVVASER--GGESRDAAQMKEMMAMMQKMQAQQAAYQPPPPPQG +Y--PPQGYPPQGAYPPPQGYPPQGYPPQGYPPQGYPPQGAPPQVEAPQGAPPQGVDNQAY +QA > 28== X70498 1 Todarodes pacificus rhodopsin [FEBS317(1-2),5-11'93] --------------------------------------MGRDLRD----NET--------- ------WWY--NPSIV----------------------------------VHPHWREFDQV -PDAVYYSLGIFIGICGIIGCGGNGIVIYLFTKTKSLQTPANMFIINLAFSDFTFSLVNGF -PLMTISCFL-KKWI--FGFAACKVYGFIGGIFGFMSIMTMAMISIDRYNVIGRPMAASKK -MSHRRAFIMIIFVWLWSVLW-AIGPIFG-W-----GAYTLEGVLCNCSFDYIS--RDSTT -RSNILCMFILGFFGPILIIFFCYFNIVMSVSNHEKE------------------------ ----------------------------------------------------MAAMAKRL- ---------------------------------NAKEL----------------------- ------------R--KAQAGAN------AEMRLAKISIVIVSQFLLSWSPYAVVALLAQF- -GPLE--WVTPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVLTCCQFDDKET -EDDK-DA-ETEIPAGESSDAAPS-ADAAQMKEMMAMMQKMQQQQAAY----PPQGYAPPP -QGYPPQGY--PPQGYPPQGYPPQGYPP---PPQGAPPQ-GAPPAAPPQG---VDNQAYQA ------- +----------------------------------------M----------GRDL----- +-------------RDNET-WWY--NPSI----------V------------VHPHWREFD +Q--------VPDAVYYSLGIFIGICGIIGCGGNGIVIYLFTKTKSLQTPANMFIINLAFS +DFTFSLVNGFPLMTISCFL-KKWI--FGFAACKVYGFIGGIFGFMSIMTMAMISIDRYNV +IGRPMAASKKMSHRRAFIMIIFVWLWSVLW-AIGPIFG-W-----GAYTLEGVLCNCSFD +YIS--RDSTTRSNILCMFILGFFGPILIIFFCYFNIVMSVSNHEKE-------------- +------------------------------------------------------MAAMAK +RLN---------------AKELR------------------------------------- +--------------------KAQAGAN------AEMRLAKISIVIVSQFLLSWSPYAVVA +LLAQF-GPLE--WVTPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVLTCCQ +FDDKE-TEDDK-DAETEIPAGESSDAAPSADAAQMKEMMAMMQKMQQQQAAY----PPQG +YAPPPQGYPPQGY--PPQGYPPQGYPPQGYPP---PPQGAPPQ-GAPPAAPPQGVDNQAY +QA > 29== L21195 1 human serotonin 5-HT7 receptor protein 30== L15228 1 rat 5HT-7 serotonin receptor <>[JBC268,18200-18204'93] ------------------------------------------------------------ --MPHLLSGFLEVTASPAPTW----------------DAPPDNVSGCGEQIN--------Y -GRVEKVVIGSILTLITLLTIAGNCLVVISVSFVKKLRQPSNYLIVSLALADLSVAVAV-M -PFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLGITRPLTYPVR -QNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQDF--------- --GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF--------------------- -----------------------------------P--------GFPR----VQPES---- ----VISL-----------------NGVVKLQ--------KEVEECAN------------- ------LSRLLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLSTARPFI -CGTSCSCIPLWVERTCLWLGYANSLINPFIYAFFNRDLRPTSRSLLQCQYR--------- ------NINRKLSAAGMHEALKLA------------------------------------- --------------------------------------------ERPERSEFVLQNSDHCG -KKGHDT +----MPHLLSGFLEVTAS-----PAPTW------------DAPPDNVS--GCGEQIN--- +---------YGRVEKVVIGSILTLITLLTIAGNCLVVISVSFVKKLRQPSNYLIVSLALA +DLSVAVAV-MPFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLG +ITRPLTYPVRQNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQD +F----------GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF----------- +--------------------------------------------P----GFPRVQPESVI +SLNG--------------VVKLQ----------------------KEVEECANLSR---- +--------------LLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLS +TARPFICGTSCSCIPLWVERTCLWLGYANSLINPFIYAFFNRDLRPTSRSLLQC------ +---QY-RNINR--KLSAAGMHEALKLAER------------------------------- +-----------------------------------PERSEFVL-QNSDHCGKKGHDT--- +-- > 31=p A47425 serotonin receptor 5HT-7 - rat ------------------------------------------------------------ --MPHLLSGFLEVTASPAPTW----------------DAPPDNVSGCGEQIN--------Y -GRVEKVVIGSILTLITLLTIAGNCLVVISVSFVKKLRQPSNYLIVSLALADLSVAVAV-M -PFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLGITRPLTYPVR -QNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQDF--------- --GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF--------------------- -----------------------------------P--------GFPR----VQPES---- ----VISL-----------------NGVVKLQ--------KEVEECAN------------- ------LSRLLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLSTARPFI -CGTSCSCIPLWVERTCLWLGYANSLINPFIYAFFNRDLRTTYRSLLQCQYR--------- ------NINRKLSAAGMHEALKLA------------------------------------- --------------------------------------------ERPERSEFVLQNSDHCG -KKGHDT +----MPHLLSGFLEVTAS-----PAPTW------------DAPPDNVS--GCGEQIN--- +---------YGRVEKVVIGSILTLITLLTIAGNCLVVISVSFVKKLRQPSNYLIVSLALA +DLSVAVAV-MPFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLG +ITRPLTYPVRQNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQD +F----------GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF----------- +--------------------------------------------P----GFPRVQPESVI +SLNG--------------VVKLQ----------------------KEVEECANLSR---- +--------------LLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLS +TARPFICGTSCSCIPLWVERTCLWLGYANSLINPFIYAFFNRDLRTTYRSLLQC------ +---QY-RNINR--KLSAAGMHEALKLAER------------------------------- +-----------------------------------PERSEFVL-QNSDHCGKKGHDT--- +-- > 32== M83181 1 human serotonin receptor <>[JBC267(11),7553-7562'92] -----------MD-------VLSPG------------QGNNTTSPPAPFETGG-------- -----------NTTGISDVTV---------------------------------------- ---SYQVITSLLLGTLIFCAVLGNACVVAAIALERSLQNVANYLIGSLAVTDLMVSVLV-L -PMAALYQVL-NKWT--LGQVTCDLFIALDVLCCTSSILHLCAIALDRYWAITDPIDYVNK -RTPRRAAALISLTWLIGFLI-SIPPMLG-WRTPEDRSDPDA---CTISKDH--------- --GYTIYSTFGAFYIPLLLMLVLYGRIFRAARFRIRK------------------------ ----------------TVKKVEKTGADTRHGASPAPQPKKS-----------VNGESGSR- ---------NWRLGVESKAGGALCANGAVRQGDDGAALEVIEVHRVGNSKEHLPLPSEAG- --PTPCAPASFERKNERNAEAKRKMALARERKTVKTLGIIMGTFILCWLPFFIVALVLPF- -CESSC-HMPTLLGAIINWLGYSNSLLNPVIYAYFNKDFQNAFKKIIKCKFC--------- ------RQ----------------------------------------------------- +----------MDVLSPG------------QGNNTTSPPAPF----------ETGG----- +-------------NTTGI-----SDVTV-------------------------------- +------------SYQVITSLLLGTLIFCAVLGNACVVAAIALERSLQNVANYLIGSLAVT +DLMVSVLV-LPMAALYQVL-NKWT--LGQVTCDLFIALDVLCCTSSILHLCAIALDRYWA +ITDPIDYVNKRTPRRAAALISLTWLIGFLI-SIPPMLG-WRTPEDRSDPDA---CTISKD +H----------GYTIYSTFGAFYIPLLLMLVLYGRIFRAARFRIRK-------------- +-------------------------TVKKVEKTGADTRHGASPAP---------QPKKS- +-VNGESGSRNWRL-----GVESKAGGALCANGAVRQGDDGAALEVIEVHRVGNSKEHLPL +PSEAG--PTPCAPASFERKNERNAEAKRKMALARERKTVKTLGIIMGTFILCWLPFFIVA +LVLPF-CESSC-HMPTLLGAIINWLGYSNSLLNPVIYAYFNKDFQNAFKKIIKC------ +---KF-CRQ--------------------------------------------------- ------------------------------------------------------------ ------- +-- > 33=p A35181 serotonin receptor class 1A - rat -----------MD-------VFSFG------------QGNNTTASQEPFGTGG-------- -----------NVTSISDVTF---------------------------------------- ---SYQVITSLLLGTLIFCAVLGNACVVAAIALERSLQNVANYLIGSLAVTDLMVSVLV-L -PMAALYQVL-NKWT--LGQVTCDLFIALDVLCCTSSILHLCAIALDRYWAITDPIDYVNK -RTPRRAAALISLTWLIGFLI-SIPPMLG-WRTPEDRSDPDA---CTISKDH--------- --GYTIYSTFGAFYIPLLLMLVLYGRIFRAARFRIRK------------------------ ----------------TVRKVEKKGAGTSLGTSSAPPPKKS-----------LNGQPGSG- ---------DWRRCAENRAVGTPCTNGAVRQGDDEATLEVIEVHRVGNSKEHLPLPSESG- --SNSYAPACLERKNERNAEAKRKMALARERKTVKTLGIIMGTFILCWLPFFIVALVLPF- -CESSC-HMPALLGAIINWLGYSNSLLNPVIYAYFNKDFQNAFKKIIKCKFC--------- ------RR----------------------------------------------------- +----------MDVFSFG------------QGNNTTASQEPF----------GTGG----- +-------------NVTSI-----SDVTF-------------------------------- +------------SYQVITSLLLGTLIFCAVLGNACVVAAIALERSLQNVANYLIGSLAVT +DLMVSVLV-LPMAALYQVL-NKWT--LGQVTCDLFIALDVLCCTSSILHLCAIALDRYWA +ITDPIDYVNKRTPRRAAALISLTWLIGFLI-SIPPMLG-WRTPEDRSDPDA---CTISKD +H----------GYTIYSTFGAFYIPLLLMLVLYGRIFRAARFRIRK-------------- +-------------------------TVRKVEKKGAGTSLGTSSAP---------PPKKS- +-LNGQPGSGDWRR-----CAENRAVGTPCTNGAVRQGDDEATLEVIEVHRVGNSKEHLPL +PSESG--SNSYAPACLERKNERNAEAKRKMALARERKTVKTLGIIMGTFILCWLPFFIVA +LVLPF-CESSC-HMPALLGAIINWLGYSNSLLNPVIYAYFNKDFQNAFKKIIKC------ +---KF-CRR--------------------------------------------------- ------------------------------------------------------------ ------- +-- > 34== L06803 1 Lymnaea stagnalis serotonin receptor <>[PNAS90,11-15'93] -MANFTFGDLALD-------VARMG-----GLASTPSGLRSTGLTTPGLSPTG-------- -----------LVTSDFNDSYGLTGQFINGSHSSRSRDNASANDTSATNMTDDRYWSLTVY -SHEHLVLTSVILGLFVLCCIIGNCFVIAAVMLERSLHNVANYLILSLAVADLMVAVLV-M -PLSVVSEIS-KVWF--LHSEVCDMWISVDVLCCTASILHLVAIAMDRYWAVTS-IDYIRR -RSARRILLMIMVVWIVALFI-SIPPLFG-WRDP--NNDPDKTGTCIISQDK--------- --GYTIFSTVGAFYLPMLVMMIIYIRIWLVARSRIRKDKFQMTKARLKTEETTLVASPKTE -YSVVSDCNGCNSPDSTTEKKKRRAPFKSYGCSPRPERKKNRAKKLPENANGVNSNSSS-- ---------SERLKQIQIETAEAFANGCA----EEASIAMLERQ-CNNGKKISSNDTPYS- -------------RTREKLELK------RERKAARTLAIITGAFLICWLPFFIIALIGPF- -VDPE--GIPPFARSFVLWLGYFNSLLNPIIYTIFSPEFRSAFQKILFGKYR--------- ------RGHR--------------------------------------------------- +MANFTFGDLALDVARMG-----GLASTPSGLRSTGLTTPGL----------SPTG----- +-------------LVTSD-----FNDSYGLTGQFINGSHSSRSRDNAS--ANDTSATNMT +DDRYWSLTVYSHEHLVLTSVILGLFVLCCIIGNCFVIAAVMLERSLHNVANYLILSLAVA +DLMVAVLV-MPLSVVSEIS-KVWF--LHSEVCDMWISVDVLCCTASILHLVAIAMDRYWA +VTS-IDYIRRRSARRILLMIMVVWIVALFI-SIPPLFG-WRDP--NNDPDKTGTCIISQD +K----------GYTIFSTVGAFYLPMLVMMIIYIRIWLVARSRIRKDKFQMTKARLKTEE +TTLVASPKTEYSVVSDCNGCNSPDSTTEKKKRRAPFKSYGCSPRPERKKNRAKKLPENAN +GVNSNSSS----------SERLKQIQIETAEAFANGCAEEASIAMLERQ-CNNGKKISSN +DTPYS-------------RTREKLELK------RERKAARTLAIITGAFLICWLPFFIIA +LIGPF-VDPE--GIPPFARSFVLWLGYFNSLLNPIIYTIFSPEFRSAFQKILFG------ +---KY-RRGHR------------------------------------------------- ------------------------------------------------------------ ------- +-- > 35=p A47174 serotonin receptor, 5HTlym receptor - great pond snail -MANFTFGDLALD-------VARMG-----GLASTPSGLRSTGLTTPGLSPTG-------- -----------LVTSDFNDSYGLTGQFINGSHSSRSRDNASANDTSATNMTDDRYWSLTVY -SHEHLVLTSVILGLFVLCCIIGNCFVIAAVMLERSLHNVANYLILSLAVADLMVAVLV-M -PLSVVSEIS-KVWF--LHSEVCDMWISVDVLCCTASILHLVAIAMDRYWAVTS-IDYIRR -RSARRILLMIMVVWIVALFI-SIPPLFG-WRDP--NNDPDKTGTCIISQDK--------- --GYTIFSTVGAFYLPMLVMMIIYIRIWLVARSRIRKDKFQMTKARLKTEETTLVASPKTE -YSVVSDCNGCNSPDSTTEKKKRRAPFKSYGCSPRPERKKNRAKKLPENANGVNSNSSS-- ---------SERLKQIQIETAEAFANGCA----EEASIAMLERQ-CNNGKKISSNDTPYS- -------------RTREKLELK------RERKAARTLAIITGAFLICWLPFFIIALIGPF- -VDPE--GIPPFARSFVLWLGYFNSLLNPIIYTIFSPEFRSAFQKILFGKYR--------- ------RGHR--------------------------------------------------- +MANFTFGDLALDVARMG-----GLASTPSGLRSTGLTTPGL----------SPTG----- +-------------LVTSD-----FNDSYGLTGQFINGSHSSRSRDNAS--ANDTSATNMT +DDRYWSLTVYSHEHLVLTSVILGLFVLCCIIGNCFVIAAVMLERSLHNVANYLILSLAVA +DLMVAVLV-MPLSVVSEIS-KVWF--LHSEVCDMWISVDVLCCTASILHLVAIAMDRYWA +VTS-IDYIRRRSARRILLMIMVVWIVALFI-SIPPLFG-WRDP--NNDPDKTGTCIISQD +K----------GYTIFSTVGAFYLPMLVMMIIYIRIWLVARSRIRKDKFQMTKARLKTEE +TTLVASPKTEYSVVSDCNGCNSPDSTTEKKKRRAPFKSYGCSPRPERKKNRAKKLPENAN +GVNSNSSS----------SERLKQIQIETAEAFANGCAEEASIAMLERQ-CNNGKKISSN +DTPYS-------------RTREKLELK------RERKAARTLAIITGAFLICWLPFFIIA +LIGPF-VDPE--GIPPFARSFVLWLGYFNSLLNPIIYTIFSPEFRSAFQKILFG------ +---KY-RRGHR------------------------------------------------- ------------------------------------------------------------ ------- +-- > 36== X95604 1 Bombyx mori serotonin receptor [InsectBiochem.Mol.Bi --MEGAEGQEELD-------WEAL-------YLRLP--LQNCSWNSTGWEPNW-------- -----------NVTVVPNTTW---------WQASAPFDTPAALVRAAAK------------ ---------AVVLGLLILATVVGNVFVIAAILLERHLRSAANNLILSLAVADLLVACLV-M -PLGAVYEVV-QRWT--LGPELCDMWTSGDVLCCTASILHLVAIALDRYWAVTN-IDYIHA -STAKRVGMMIACVWTVSFFV-CIAQLLG-WKDPDWNQRVSEDLRCVVSQDV--------- --GYQIFATASSFYVPVLIILILYWRIYQTARKRIR------------------------- ---------------------RRRGATARGGVGPPP---------VPAGGALVAGGGSGGI -AAAVVAVIGRPLPTISETTTTGFTNVSS----NNTS---PEKQSCANGLEADPPTTGYGA -VAAAYYPSLVRRKPKEAADSK------RERKAAKTLAIITGAFVACWLPFFVLAILVPT- -CDCE---VSPVLTSLSLWLGYFNSTLNPVIYTVFSPEFRHAFQRLLCGRRV--------- ------RRRRA-------------------------------------------------- ----------------------------------------------PQ------------- ------- +-MEGAEGQEELDWEAL-------YLRLP--LQNCSWNSTGW----------EPNW----- +-------------NVTVV-----PNTTW---------WQASAPFDTPA--ALVRAAAK-- +------------------AVVLGLLILATVVGNVFVIAAILLERHLRSAANNLILSLAVA +DLLVACLV-MPLGAVYEVV-QRWT--LGPELCDMWTSGDVLCCTASILHLVAIALDRYWA +VTN-IDYIHASTAKRVGMMIACVWTVSFFV-CIAQLLG-WKDPDWNQRVSEDLRCVVSQD +V----------GYQIFATASSFYVPVLIILILYWRIYQTARKRIR--------------- +------------------------------RRRGATARGGVGPPP---------VPAGGA +LVAGGGSGGIAAAVVAVIGRPLPTISETTTTGFTNVSSNNTS---PEKQSCANGLEADPP +TTGYGAVAAAYYPSLVRRKPKEAADSK------RERKAAKTLAIITGAFVACWLPFFVLA +ILVPT-CDCE---VSPVLTSLSLWLGYFNSTLNPVIYTVFSPEFRHAFQRLLCG------ +---RR-VRRRR--A---------------------------------------------- +-----------------------------------PQ----------------------- +-- diff -r f0606dfd5195 -r 15974dd17515 test-data/mafft_nwns_result.aln --- a/test-data/mafft_nwns_result.aln Tue Feb 23 07:28:36 2016 -0500 +++ b/test-data/mafft_nwns_result.aln Tue Sep 20 09:30:47 2016 -0400 @@ -1,270 +1,270 @@ CLUSTAL format alignment by MAFFT NW-NS-2 (v7.221) -1== -------------------MNGTE------------------------GDNF-------- -2== -------------------MNGTE------------------------GPNF-------- -3== -------------------MNGTE------------------------GINF-------- -4=p -------------------MNGTE------------------------GKNF-------- -5=p -------------------MNGTE------------------------GNNF-------- -6== -------------------MKQVPEF----------------------HEDF-------- -7== -------------------MRKMS------------------------EEEF-------- +1== --------------------------------MNGTE--------------GDNF----- +2== --------------------------------MNGTE--------------GPNF----- +3== --------------------------------MNGTE--------------GINF----- +4=p --------------------------------MNGTE--------------GKNF----- +5=p --------------------------------MNGTE--------------GNNF----- +6== --------------------------------MKQVPEF------------HEDF----- +7== --------------------------------MRKMS--------------EEEF----- 8=opsin, ------------------------------------------------------------ -9== -------------------MAQQWSL----------QRLAGRHPQDSYEDST-------- -10== -------------------MAQQWSL----------QRLAGRHPQDSYEDST-------- -11== -------------------MTEAWNV----------AVFAARRSRDD-DDTT-------- -12== -------------------MA-AWEA----------AFAARRRHEE--EDTT-------- -13== -------------------MS---------------------------SNSS-------- -14== ----------MESGNVSSSLFGNVST----------ALRPEARL----SA---------- -15== ----------MEYHNVSSVL-GNVSS----------VLRPDARL----SA---------- -16== ----------ME------PLCNASEP----------PLRPEAR-----SSGN-------- -17== ----------MD------ALCNASEP----------PLRPEARM----SSGS-------- -18== -------------------MTNATGP----------QMAYYGAA----SMD--------- -19== -------------------MANVTGP----------QMAFYGSG----AAT--------- -20== ----------ME---SFAVAAAQLGP----------HFAPLS------------------ -21== ----------ME---SFAVAAAQLGP----------HFAPLS------------------ -22== ----------MD---SFAAVATQLGP----------QFAAPS------------------ -23== -----MERSHLP---ETPFDLAHSGP----------RFQAQSSG---------------- -24== -----MERSLLP---EPPLAMALLGP----------RFEAQTGG---------------- -25== -------------------MIAVSGP----------SYEAFSYG----GQA--------- -26== ---------------------MANQL----------SYSSLGWP----YQP--------- -27== ------------------------------------MVESTTLV----NQT--------- -28== -------------------------------------MGRDLRD----NET--------- -29== ---------MMD-------VNSSGRPDLYGHLRSF-LLPEVGRGLPDLSPDGGADPVAGS +9== --------------------------------MAQQWSLQRLAGRHPQDSYEDST----- +10== --------------------------------MAQQWSLQRLAGRHPQDSYEDST----- +11== --------------------------------MTEAWNVAVFAARRSRDD-DDTT----- +12== --------------------------------MAA-WEAAFAARRRHEE--EDTT----- +13== --------------------------------MS-----------------SNSS----- +14== ----------MES--GNV-----------SSSLFGNVSTAL----------RPEA----- +15== ----------MEY--HNV-----------SSVL-GNVSSVL----------RPDA----- +16== ----------ME-------------------PLCNASEPPL----------RPEA----- +17== ----------MD-------------------ALCNASEPPL----------RPEA----- +18== --------------------------------MTNATGPQM----------AYYG----- +19== --------------------------------MANVTGPQM----------AFYG----- +20== ---------------MES-----------FAVAAAQLGPHF----------APLS----- +21== ---------------MES-----------FAVAAAQLGPHF----------APLS----- +22== ---------------MDS-----------FAAVATQLGPQF----------AAPS----- +23== ----------MERSHLPE-----------TPFDLAHSGPRF----------QAQS----- +24== ----------MERSLLPE-----------PPLAMALLGPRF----------EAQT----- +25== --------------------------------MIAVSGPSY----------EAFS----- +26== ----------------------------------MANQLSY----------SSLG----- +27== ---------------------------------------MV----------ESTT----- +28== ----------------------------------------M----------GRDL----- +29== ---------MMDVNSSGRPDLYGHLRSF-LLPEVGRGLPDL----------SPDGGADPV 30== ------------------------------------------------------------ 31=p ------------------------------------------------------------ -32== ----------MD-------VLSPG------------QGNNTTSPPAPFETGG-------- -33=p ----------MD-------VFSFG------------QGNNTTASQEPFGTGG-------- -34== MANFTFGDLALD-------VARMG-----GLASTPSGLRSTGLTTPGLSPTG-------- -35=p MANFTFGDLALD-------VARMG-----GLASTPSGLRSTGLTTPGLSPTG-------- -36== -MEGAEGQEELD-------WEAL-------YLRLP--LQNCSWNSTGWEPNW-------- +32== ----------MDVLSPG------------QGNNTTSPPAPF----------ETGG----- +33=p ----------MDVFSFG------------QGNNTTASQEPF----------GTGG----- +34== MANFTFGDLALDVARMG-----GLASTPSGLRSTGLTTPGL----------SPTG----- +35=p MANFTFGDLALDVARMG-----GLASTPSGLRSTGLTTPGL----------SPTG----- +36== -MEGAEGQEELDWEAL-------YLRLP--LQNCSWNSTGW----------EPNW----- -1== YVP----F-SNKTGLARSPY----------------EYPQY-------YLAEPWK----- -2== YVP----F-SNITGVVRSPF----------------EQPQY-------YLAEPWQ----- -3== YVP----M-SNKTGVVRSPF----------------EYPQY-------YLAEPWK----- -4=p YVP----M-SNRTGLVRSPF----------------EYPQY-------YLAEPWQ----- -5=p YVP----L-SNRTGLVRSPF----------------EYPQY-------YLAEPWQ----- -6== YIPIPLDI-NNLS--AYSPF----------------LVPQD-------HLGNQGI----- -7== YL-----F-KNIS--SVGPW----------------DGPQY-------HIAPVWA----- +1== -------------YVP-----F-SNKTG----------LARSPYEYPQY-YLAEPWK--- +2== -------------YVP-----F-SNITG----------VVRSPFEQPQY-YLAEPWQ--- +3== -------------YVP-----M-SNKTG----------VVRSPFEYPQY-YLAEPWK--- +4=p -------------YVP-----M-SNRTG----------LVRSPFEYPQY-YLAEPWQ--- +5=p -------------YVP-----L-SNRTG----------LVRSPFEYPQY-YLAEPWQ--- +6== -------------YIPIP-LDI-NNLS------------AYSPFLVPQD-HLGNQGI--- +7== -------------YL------F-KNIS------------SVGPWDGPQY-HIAPVWA--- 8=opsin, ------------------------------------------------------------ -9== QSSI-FTY-TNSNS-TRGPF----------------EGPNY-------HIAPRWV----- -10== QSSI-FTY-TNSNS-TRGPF----------------EGPNY-------HIAPRWV----- -11== RGSV-FTY-TNTNN-TRGPF----------------EGPNY-------HIAPRWV----- -12== RDSV-FTY-TNSNN-TRGPF----------------EGPNY-------HIAPRWV----- -13== QAP--------PNG-TPGPF----------------DGPQW------PYQAPQST----- -14== -ETRLLGW--------NVPP----------------EELR--------HIPEHWLTYPEP -15== -ESRLLGW--------NVPP----------------DELR--------HIPEHWLIYPEP -16== GDLQFLGW--------NVPP----------------DQIQ--------YIPEHWLTQLEP -17== DELQFLGW--------NVPP----------------DQIQ--------YIPEHWLTQLEP -18== -----FGYPEGVSIVDFVRP----------------EIKP--------YVHQHWYNYPPV -19== -----FGYPEGMTVADFVPD----------------RVKH--------MVLDHWYNYPPV -20== ----------NGSVVDKVTP----------------DMAH--------LISPYWNQFPAM -21== ----------NGSVVDKVTP----------------DMAH--------LISPYWNQFPAM -22== ----------NGSVVDKVTP----------------DMAH--------LISPYWDQFPAM -23== ----------NGSVLDNVLP----------------DMAH--------LVNPYWSRFAPM -24== ----------NRSVLDNVLP----------------DMAP--------LVNPHWSRFAPM -25== ----RF---NNQTVVDKVPP----------------DMLH--------LIDANWYQYPPL -26== ----------NASVVDTMPK----------------EMLY--------MIHEHWYAFPPM -27== -----WWY--NPTVD----------------------------------IHPHWAKFDPI -28== -----WWY--NPSIV----------------------------------VHPHWREFDQV -29== WAPHLLS---EVTASPAPTW----------------DAPPDNASGCGEQIN--------Y -30== -MPHLLSGFLEVTASPAPTW----------------DAPPDNVSGCGEQIN--------Y -31=p -MPHLLSGFLEVTASPAPTW----------------DAPPDNVSGCGEQIN--------Y -32== ----------NTTGISDVTV---------------------------------------- -33=p ----------NVTSISDVTF---------------------------------------- -34== ----------LVTSDFNDSYGLTGQFINGSHSSRSRDNASANDTSATNMTDDRYWSLTVY -35=p ----------LVTSDFNDSYGLTGQFINGSHSSRSRDNASANDTSATNMTDDRYWSLTVY -36== ----------NVTVVPNTTW---------WQASAPFDTPAALVRAAAK------------ +9== -------------QSSI--FTY-TNSNS-----------TRGPFEGPNY-HIAPRWV--- +10== -------------QSSI--FTY-TNSNS-----------TRGPFEGPNY-HIAPRWV--- +11== -------------RGSV--FTY-TNTNN-----------TRGPFEGPNY-HIAPRWV--- +12== -------------RDSV--FTY-TNSNN-----------TRGPFEGPNY-HIAPRWV--- +13== -------------QAP---------PNG-----------TPGPFDGPQWPYQAPQST--- +14== -------------RLSA------ETRLL----------GWNVPPEELR--HIPEHWLTYP +15== -------------RLSA------ESRLL----------GWNVPPDELR--HIPEHWLIYP +16== -------------R-SSG---NGDLQFL----------GWNVPPDQIQ--YIPEHWLTQL +17== -------------RMSSG---SDELQFL----------GWNVPPDQIQ--YIPEHWLTQL +18== -------------AASMD-FGYPEGVSI----------VDFVRPEIKP--YVHQHWYNYP +19== -------------SGAAT-FGYPEGMTV----------ADFVPDRVKH--MVLDHWYNYP +20== ------------------------NGSV----------VDKVTPDMAH--LISPYWNQFP +21== ------------------------NGSV----------VDKVTPDMAH--LISPYWNQFP +22== ------------------------NGSV----------VDKVTPDMAH--LISPYWDQFP +23== -------------SG---------NGSV----------LDNVLPDMAH--LVNPYWSRFA +24== -------------GG---------NRSV----------LDNVLPDMAP--LVNPHWSRFA +25== -------------YGGQARF---NNQTV----------VDKVPPDMLH--LIDANWYQYP +26== -------------WPYQP------NASV----------VDTMPKEMLY--MIHEHWYAFP +27== -------------LVNQT-WWY--NPTV----------D------------IHPHWAKFD +28== -------------RDNET-WWY--NPSI----------V------------VHPHWREFD +29== AGSWAPHLLS---EVTAS-----PAPTW------------DAPPDNAS--GCGEQIN--- +30== ----MPHLLSGFLEVTAS-----PAPTW------------DAPPDNVS--GCGEQIN--- +31=p ----MPHLLSGFLEVTAS-----PAPTW------------DAPPDNVS--GCGEQIN--- +32== -------------NTTGI-----SDVTV-------------------------------- +33=p -------------NVTSI-----SDVTF-------------------------------- +34== -------------LVTSD-----FNDSYGLTGQFINGSHSSRSRDNAS--ANDTSATNMT +35=p -------------LVTSD-----FNDSYGLTGQFINGSHSSRSRDNAS--ANDTSATNMT +36== -------------NVTVV-----PNTTW---------WQASAPFDTPA--ALVRAAAK-- + + +1== --------------YSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMA +2== --------------FSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVA +3== --------------YRLVCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVA +4=p --------------FKILALYLFFLMSMGLPINGLTLVVTAQHKKLRQPLNFILVNLAVA +5=p --------------FKLLAVYMFFLICLGLPINGLTLICTAQHKKLRQPLNFILVNLAVA +6== --------------FMAMSVFMFFIFIGGASINILTILCTIQFKKLRSHLNYILVNLSIA +7== --------------FYLQAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFG +8=opsin, ------------------------------------------------------------ +9== --------------YHLTSVWMIFVVIASVFTNGLVLAATMKFKKLRHPLNWILVNLAVA +10== --------------YHLTSVWMIFVVTASVFTNGLVLAATMKFKKLRHPLNWILVNLAVA +11== --------------YNLVSFFMIIVVIASCFTNGLVLVATAKFKKLRHPLNWILVNLAFV +12== --------------YNLTSVWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVA +13== --------------YVGVAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVA +14== E--------PPESMNYLLGTLYIFFTLMSMLGNGLVIWVFSAAKSLRTPSNILVINLAFC +15== E--------PPESMNYLLGTLYIFFTVISMIGNGLVMWVFSAAKSLRTPSNILVINLAFC +16== E--------PPASMHYMLGVFYIFLFCASTVGNGMVIWIFSTSKSLRTPSNMFVLNLAVF +17== E--------PPASMHYMLGVFYIFLFFASTLGNGMVIWIFSTSKSLRTPSNMFVLNLAVF +18== P--------VNPMWHYLLGVIYLFLGTVSIFGNGLVIYLFNKSAALRTPANILVVNLALS +19== P--------VNPMWHYLLGVVYLFLGVISIAGNGLVIYLYMKSQALKTPANMLIVNLALS +20== A--------MDPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAIS +21== A--------MDPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAIS +22== A--------MDPIWAKILTAYMIIIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAIS +23== P--------MDPMMSKILGLFTLAIMIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFS +24== P--------MDPTMSKILGLFTLVILIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFS +25== P--------LNPMWHGILGFVIGMLGFVSAMGNGMVVYIFLSTKSLRTPSNLFVINLAIS +26== P--------MNPLWYSILGVAMIILGIICVLGNGMVIYLMMTTKSLRTPTNLLVVNLAFS +27== P--------IPDAVYYSVGIFIGVVGIIGILGNGVVIYLFSKTKSLQTPANMFIINLAMS +28== Q--------VPDAVYYSLGIFIGICGIIGCGGNGIVIYLFTKTKSLQTPANMFIINLAFS +29== ---------YGRVEKVVIGSILTLITLLTIAGNCLVVISVCFVKKLRQPSNYLIVSLALA +30== ---------YGRVEKVVIGSILTLITLLTIAGNCLVVISVSFVKKLRQPSNYLIVSLALA +31=p ---------YGRVEKVVIGSILTLITLLTIAGNCLVVISVSFVKKLRQPSNYLIVSLALA +32== ------------SYQVITSLLLGTLIFCAVLGNACVVAAIALERSLQNVANYLIGSLAVT +33=p ------------SYQVITSLLLGTLIFCAVLGNACVVAAIALERSLQNVANYLIGSLAVT +34== DDRYWSLTVYSHEHLVLTSVILGLFVLCCIIGNCFVIAAVMLERSLHNVANYLILSLAVA +35=p DDRYWSLTVYSHEHLVLTSVILGLFVLCCIIGNCFVIAAVMLERSLHNVANYLILSLAVA +36== ------------------AVVLGLLILATVVGNVFVIAAILLERHLRSAANNLILSLAVA -1== ----YSALAAYMFFLILVGFPVNFLTLFVTVQHKKLRTPLNYILLNLAMANLFMVLFG-F -2== ----FSMLAAYMFLLIVLGFPINFLTLYVTVQHKKLRTPLNYILLNLAVADLFMVFGG-F -3== ----YRLVCCYIFFLISTGLPINLLTLLVTFKHKKLRQPLNYILVNLAVADLFMACFG-F -4=p ----FKILALYLFFLMSMGLPINGLTLVVTAQHKKLRQPLNFILVNLAVAGTIMVCFG-F -5=p ----FKLLAVYMFFLICLGLPINGLTLICTAQHKKLRQPLNFILVNLAVAGAIMVCFG-F -6== ----FMAMSVFMFFIFIGGASINILTILCTIQFKKLRSHLNYILVNLSIANLFVAIFG-S -7== ----FYLQAAFMGTVFLIGFPLNAMVLVATLRYKKLRQPLNYILVNVSFGGFLLCIFS-V -8=opsin, --------------------------------------------------DLAETVIA-S -9== ----YHLTSVWMIFVVIASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIA-S -10== ----YHLTSVWMIFVVTASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIA-S -11== ----YNLVSFFMIIVVIASCFTNGLVLVATAKFKKLRHPLNWILVNLAFVDLVETLVA-S -12== ----YNLTSVWMIFVVAASVFTNGLVLVATWKFKKLRHPLNWILVNLAVADLGETVIA-S -13== ----YVGVAVLMGTVVACASVVNGLVIVVSICYKKLRSPLNYILVNLAVADLLVTLCG-S -14== PESMNYLLGTLYIFFTLMSMLGNGLVIWVFSAAKSLRTPSNILVINLAFCDFMMMVK--T -15== PESMNYLLGTLYIFFTVISMIGNGLVMWVFSAAKSLRTPSNILVINLAFCDFMMMIK--T -16== PASMHYMLGVFYIFLFCASTVGNGMVIWIFSTSKSLRTPSNMFVLNLAVFDLIMCLK--A -17== PASMHYMLGVFYIFLFFASTLGNGMVIWIFSTSKSLRTPSNMFVLNLAVFDLIMCLK--A -18== NPMWHYLLGVIYLFLGTVSIFGNGLVIYLFNKSAALRTPANILVVNLALSDLIMLTTN-V -19== NPMWHYLLGVVYLFLGVISIAGNGLVIYLYMKSQALKTPANMLIVNLALSDLIMLTTN-F -20== DPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAISDFGIMITN-T -21== DPIWAKILTAYMIMIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAISDFGIMITN-T -22== DPIWAKILTAYMIIIGMISWCGNGVVIYIFATTKSLRTPANLLVINLAISDFGIMITN-T -23== DPMMSKILGLFTLAIMIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFSDFCMMASQ-S -24== DPTMSKILGLFTLVILIISCCGNGVVVYIFGGTKSLRTPANLLVLNLAFSDFCMMASQ-S -25== NPMWHGILGFVIGMLGFVSAMGNGMVVYIFLSTKSLRTPSNLFVINLAISNFLMMFCM-S -26== NPLWYSILGVAMIILGIICVLGNGMVIYLMMTTKSLRTPTNLLVVNLAFSDFCMMAFM-M -27== PDAVYYSVGIFIGVVGIIGILGNGVVIYLFSKTKSLQTPANMFIINLAMSDLSFSAINGF -28== PDAVYYSLGIFIGICGIIGCGGNGIVIYLFTKTKSLQTPANMFIINLAFSDFTFSLVNGF -29== GRVEKVVIGSILTLITLLTIAGNCLVVISVCFVKKLRQPSNYLIVSLALADLSVAVAV-M -30== GRVEKVVIGSILTLITLLTIAGNCLVVISVSFVKKLRQPSNYLIVSLALADLSVAVAV-M -31=p GRVEKVVIGSILTLITLLTIAGNCLVVISVSFVKKLRQPSNYLIVSLALADLSVAVAV-M -32== --SYQVITSLLLGTLIFCAVLGNACVVAAIALERSLQNVANYLIGSLAVTDLMVSVLV-L -33=p --SYQVITSLLLGTLIFCAVLGNACVVAAIALERSLQNVANYLIGSLAVTDLMVSVLV-L -34== SHEHLVLTSVILGLFVLCCIIGNCFVIAAVMLERSLHNVANYLILSLAVADLMVAVLV-M -35=p SHEHLVLTSVILGLFVLCCIIGNCFVIAAVMLERSLHNVANYLILSLAVADLMVAVLV-M -36== --------AVVLGLLILATVVGNVFVIAAILLERHLRSAANNLILSLAVADLLVACLV-M - . - -1== TVTMYTSMN-GYFV--FGPTMCSIEGFFATLGGEVALWSLVVLAIERYIVICKPMGN-FR -2== TTTLYTSLH-GYFV--FGPTGCNLEGFFATLGGEIGLWSLVVLAIERYVVVCKPMSN-FR -3== TVTFYTAWN-GYFV--FGPVGCAVEGFFATLGGQVALWSLVVLAIERYIVVCKPMGN-FR -4=p TVTFYTAIN-GYFV--LGPTGCAVEGFMATLGGEVALWSLVVLAIERYIVVCKPMGS-FK -5=p TVTFYTAIN-GYFA--LGPTGCAVEGFMATLGGEVALWSLVVLAIERYIVVCKPMGS-FK -6== PLSFYSFFN-RYFI--FGATACKIEGFLATLGGMVGLWSLAVVAFERWLVICKPLGN-FT -7== FPVFVASCN-GYFV--FGRHVCALEGFLGTVAGLVTGWSLAFLAFERYIVICKPFGN-FR -8=opsin, TISIVNQVS-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWLVVCKPFGN-VR -9== TISVVNQVY-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWMVVCKPFGN-VR -10== TISIVNQVS-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWLVVCKPFGN-VR -11== TISVFNQIF-GYFI--LGHPLCVIEGYVVSSCGITGLWSLAIISWERWFVVCKPFGN-IK -12== TISVINQIS-GYFI--LGHPMCVVEGYTVSACGITALWSLAIISWERWFVVCKPFGN-IK -13== SVSLSNNIN-GFFV--FGRRMCELEGFMVSLTGIVGLWSLAILALERYVVVCKPLGD-FQ -14== PIFIYNSFH-QGYA--LGHLGCQIFGIIGSYTGIAAGATNAFIAYDRFNVITRPMEG--K -15== PIFIYNSFH-QGYA--LGHLGCQIFGVIGSYTGIAAGATNAFIAYDRYNVITRPMEG--K -16== PIF--NSFH-RGFAIYLGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPMNR--N -17== PIFIYNSFH-RGFA--LGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNVITKPMNR--N -18== PFFTYNCFSGGVWM--FSPQYCEIYACLGAITGVCSIWLLCMISFDRYNIICNGFNG-PK -19== PPFCYNCFSGGRWM--FSGTYCEIYAALGAITGVCSIWTLCMISFDRYNIICNGFNG-PK -20== PMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGMAG-RP -21== PMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGMAG-RP -22== PMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQVIVKGMAG-RP -23== PVMIINFYY-ETWV--LGPLWCDIYAGCGSLFGCVSIWSMCMIAFDRYNVIVKGING-TP -24== PVMIINFYY-ETWV--LGPLWCDIYAACGSLFGCVSIWSMCMIAFDRYNVIVKGING-TP -25== PPMVINCYY-ETWV--LGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNVIVKGLSG-KP -26== PTMTSNCFA-ETWI--LGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNVIVRGMAA-AP -27== PLKTISAFM-KKWI--FGKVACQLYGLLGGIFGFMSINTMAMISIDRYNVIGRPMAASKK -28== PLMTISCFL-KKWI--FGFAACKVYGFIGGIFGFMSIMTMAMISIDRYNVIGRPMAASKK -29== PFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLGITRPLTYPVR -30== PFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLGITRPLTYPVR -31=p PFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLGITRPLTYPVR -32== PMAALYQVL-NKWT--LGQVTCDLFIALDVLCCTSSILHLCAIALDRYWAITDPIDYVNK -33=p PMAALYQVL-NKWT--LGQVTCDLFIALDVLCCTSSILHLCAIALDRYWAITDPIDYVNK -34== PLSVVSEIS-KVWF--LHSEVCDMWISVDVLCCTASILHLVAIAMDRYWAVTS-IDYIRR -35=p PLSVVSEIS-KVWF--LHSEVCDMWISVDVLCCTASILHLVAIAMDRYWAVTS-IDYIRR -36== PLGAVYEVV-QRWT--LGPELCDMWTSGDVLCCTASILHLVAIALDRYWAVTN-IDYIHA - : : * : : :*: : : +1== NLFMVLFG-FTVTMYTSMN-GYFV--FGPTMCSIEGFFATLGGEVALWSLVVLAIERYIV +2== DLFMVFGG-FTTTLYTSLH-GYFV--FGPTGCNLEGFFATLGGEIGLWSLVVLAIERYVV +3== DLFMACFG-FTVTFYTAWN-GYFV--FGPVGCAVEGFFATLGGQVALWSLVVLAIERYIV +4=p GTIMVCFG-FTVTFYTAIN-GYFV--LGPTGCAVEGFMATLGGEVALWSLVVLAIERYIV +5=p GAIMVCFG-FTVTFYTAIN-GYFA--LGPTGCAVEGFMATLGGEVALWSLVVLAIERYIV +6== NLFVAIFG-SPLSFYSFFN-RYFI--FGATACKIEGFLATLGGMVGLWSLAVVAFERWLV +7== GFLLCIFS-VFPVFVASCN-GYFV--FGRHVCALEGFLGTVAGLVTGWSLAFLAFERYIV +8=opsin, DLAETVIA-STISIVNQVS-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWLV +9== DLAETVIA-STISVVNQVY-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWMV +10== DLAETVIA-STISIVNQVS-GYFV--LGHPMCVLEGYTVSLCGITGLWSLAIISWERWLV +11== DLVETLVA-STISVFNQIF-GYFI--LGHPLCVIEGYVVSSCGITGLWSLAIISWERWFV +12== DLGETVIA-STISVINQIS-GYFI--LGHPMCVVEGYTVSACGITALWSLAIISWERWFV +13== DLLVTLCG-SSVSLSNNIN-GFFV--FGRRMCELEGFMVSLTGIVGLWSLAILALERYVV +14== DFMMMVK--TPIFIYNSFH-QGYA--LGHLGCQIFGIIGSYTGIAAGATNAFIAYDRFNV +15== DFMMMIK--TPIFIYNSFH-QGYA--LGHLGCQIFGVIGSYTGIAAGATNAFIAYDRYNV +16== DLIMCLK--APIF--NSFH-RGFAIYLGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNV +17== DLIMCLK--APIFIYNSFH-RGFA--LGNTWCQIFASIGSYSGIGAGMTNAAIGYDRYNV +18== DLIMLTTN-VPFFTYNCFSGGVWM--FSPQYCEIYACLGAITGVCSIWLLCMISFDRYNI +19== DLIMLTTN-FPPFCYNCFSGGRWM--FSGTYCEIYAALGAITGVCSIWTLCMISFDRYNI +20== DFGIMITN-TPMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQV +21== DFGIMITN-TPMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQV +22== DFGIMITN-TPMMGINLYF-ETWV--LGPMMCDIYAGLGSAFGCSSIWSMCMISLDRYQV +23== DFCMMASQ-SPVMIINFYY-ETWV--LGPLWCDIYAGCGSLFGCVSIWSMCMIAFDRYNV +24== DFCMMASQ-SPVMIINFYY-ETWV--LGPLWCDIYAACGSLFGCVSIWSMCMIAFDRYNV +25== NFLMMFCM-SPPMVINCYY-ETWV--LGPLFCQIYAMLGSLFGCGSIWTMTMIAFDRYNV +26== DFCMMAFM-MPTMTSNCFA-ETWI--LGPFMCEVYGMAGSLFGCASIWSMVMITLDRYNV +27== DLSFSAINGFPLKTISAFM-KKWI--FGKVACQLYGLLGGIFGFMSINTMAMISIDRYNV +28== DFTFSLVNGFPLMTISCFL-KKWI--FGFAACKVYGFIGGIFGFMSIMTMAMISIDRYNV +29== DLSVAVAV-MPFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLG +30== DLSVAVAV-MPFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLG +31=p DLSVAVAV-MPFVSVTDLIGGKWI--FGHFFCNVFIAMDVMCCTASIMTLCVISIDRYLG +32== DLMVSVLV-LPMAALYQVL-NKWT--LGQVTCDLFIALDVLCCTSSILHLCAIALDRYWA +33=p DLMVSVLV-LPMAALYQVL-NKWT--LGQVTCDLFIALDVLCCTSSILHLCAIALDRYWA +34== DLMVAVLV-MPLSVVSEIS-KVWF--LHSEVCDMWISVDVLCCTASILHLVAIAMDRYWA +35=p DLMVAVLV-MPLSVVSEIS-KVWF--LHSEVCDMWISVDVLCCTASILHLVAIAMDRYWA +36== DLLVACLV-MPLGAVYEVV-QRWT--LGPELCDMWTSGDVLCCTASILHLVAIALDRYWA + . : : * : : :*: -1== FGNTHAIMGVAFTWIMALAC-AAPPLVG-W-----SRYIPEGMQCSCGPDYYTLNPNFNN -2== FGENHAIMGVAFTWVMALAC-AAPPLVG-W-----SRYIPEGMQCSCGIDYYTLKPEVNN -3== FSATHAMMGIAFTWVMAFSC-AAPPLFG-W-----SRYMPEGMQCSCGPDYYTHNPDYHN -4=p FSSSHAFAGIAFTWVMALAC-AAPPLFG-W-----SRYIPEGMQCSCGPDYYTLNPDYNN -5=p FSSTHASAGIAFTWVMAMAC-AAPPLVG-W-----SRYIPEGIQCSCGPDYYTLNPEYNN -6== FKTPHAIAGCILPWISALAA-SLPPLFG-W-----SRYIPEGLQCSCGPDWYTTNNKYNN -7== FSSKHALTVVLATWTIGIGV-SIPPFFG-W-----SRFIPEGLQCSCGPDWYTVGTKYRS -8=opsin, FDAKLAIVGIAFSWIWAAVW-TAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSSYPGV -9== FDAKLAIVGIAFSWIWAAVW-TAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSSYPGV -10== FDAKLAIVGIAFSWIWSAVW-TAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSSYPGV -11== FDSKLAIIGIVFSWVWAWGW-SAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSVELGC -12== FDGKLAVAGILFSWLWSCAW-TAPPIFG-W-----SRYWPHGLKTSCGPDVFSGSSDPGV -13== FQRRHAVSGCAFTWGWALLW-SAPPLLG-W-----SSYVPEGLRTSCGPNWYTGGSNN-- -14== MTHGKAIAMIIFIYMYATPW-VVACYTETW-----GRFVPEGYLTSCTFDYLT--DNFDT -15== MTHGKAIAMIIFIYLYATPW-VVACYTESW-----GRFVPEGYLTSCTFDYLT--DNFDT -16== MTFTKAVIMNIIIWLYCTPW-VVLPLTQFW-----DRFVPEGYLTSCSFDYLS--DNFDT -17== MTFTKAVIMNIIIWLYCTPW-VVLPLTQFW-----DRFVPEGYLTSCSFDYLS--DNFDT -18== LTTGKAVVFALISWVIAIGC-ALPPFFG-W-----GNYILEGILDSCSYDYLT--QDFNT -19== LTQGKATFMCGLAWVISVGW-SLPPFFG-W-----GSYTLEGILDSCSYDYFT--RDMNT -20== MTIPLALGKM---------------------------YVPEGNLTSCGIDYLE--RDWNP -21== MTIPLALGKIAYIWFMSSIW-CLAPAFG-W-----SRYVPEGNLTSCGIDYLE--RDWNP -22== MTIPLALGKIAYIWFMSTIWCCLAPVFG-W-----SRYVPEGNLTSCGIDYLE--RDWNP -23== MTIKTSIMKILFIWMMAVFW-TVMPLIG-W-----SAYVPEGNLTACSIDYMT--RMWNP -24== MTIKTSIMKIAFIWMMAVFW-TIMPLIG-W-----SSYVPEGNLTACSIDYMT--RQWNP -25== LSINGALIRIIAIWLFSLGW-TIAPMFG-W-----NRYVPEGNMTACGTDYFN--RGLLS -26== LTHKKATLLLLFVWIWSGGW-TILPFFG-W-----SRYVPEGNLTSCTVDYLT--KDWSS -27== MSHRRAFLMIIFVWMWSIVW-SVGPVFN-W-----GAYVPEGILTSCSFDYLS--TDPST -28== MSHRRAFIMIIFVWLWSVLW-AIGPIFG-W-----GAYTLEGVLCNCSFDYIS--RDSTT -29== QNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQDF--------- -30== QNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQDF--------- -31=p QNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQDF--------- -32== RTPRRAAALISLTWLIGFLI-SIPPMLG-WRTPEDRSDPDA---CTISKDH--------- -33=p RTPRRAAALISLTWLIGFLI-SIPPMLG-WRTPEDRSDPDA---CTISKDH--------- -34== RSARRILLMIMVVWIVALFI-SIPPLFG-WRDP--NNDPDKTGTCIISQDK--------- -35=p RSARRILLMIMVVWIVALFI-SIPPLFG-WRDP--NNDPDKTGTCIISQDK--------- -36== STAKRVGMMIACVWTVSFFV-CIAQLLG-WKDPDWNQRVSEDLRCVVSQDV--------- - : +1== ICKPMGN-FRFGNTHAIMGVAFTWIMALAC-AAPPLVG-W-----SRYIPEGMQCSCGPD +2== VCKPMSN-FRFGENHAIMGVAFTWVMALAC-AAPPLVG-W-----SRYIPEGMQCSCGID +3== VCKPMGN-FRFSATHAMMGIAFTWVMAFSC-AAPPLFG-W-----SRYMPEGMQCSCGPD +4=p VCKPMGS-FKFSSSHAFAGIAFTWVMALAC-AAPPLFG-W-----SRYIPEGMQCSCGPD +5=p VCKPMGS-FKFSSTHASAGIAFTWVMAMAC-AAPPLVG-W-----SRYIPEGIQCSCGPD +6== ICKPLGN-FTFKTPHAIAGCILPWISALAA-SLPPLFG-W-----SRYIPEGLQCSCGPD +7== ICKPFGN-FRFSSKHALTVVLATWTIGIGV-SIPPFFG-W-----SRFIPEGLQCSCGPD +8=opsin, VCKPFGN-VRFDAKLAIVGIAFSWIWAAVW-TAPPIFG-W-----SRYWPHGLKTSCGPD +9== VCKPFGN-VRFDAKLAIVGIAFSWIWAAVW-TAPPIFG-W-----SRYWPHGLKTSCGPD +10== VCKPFGN-VRFDAKLAIVGIAFSWIWSAVW-TAPPIFG-W-----SRYWPHGLKTSCGPD +11== VCKPFGN-IKFDSKLAIIGIVFSWVWAWGW-SAPPIFG-W-----SRYWPHGLKTSCGPD +12== VCKPFGN-IKFDGKLAVAGILFSWLWSCAW-TAPPIFG-W-----SRYWPHGLKTSCGPD +13== VCKPLGD-FQFQRRHAVSGCAFTWGWALLW-SAPPLLG-W-----SSYVPEGLRTSCGPN +14== ITRPMEG--KMTHGKAIAMIIFIYMYATPW-VVACYTETW-----GRFVPEGYLTSCTFD +15== ITRPMEG--KMTHGKAIAMIIFIYLYATPW-VVACYTESW-----GRFVPEGYLTSCTFD +16== ITKPMNR--NMTFTKAVIMNIIIWLYCTPW-VVLPLTQFW-----DRFVPEGYLTSCSFD +17== ITKPMNR--NMTFTKAVIMNIIIWLYCTPW-VVLPLTQFW-----DRFVPEGYLTSCSFD +18== ICNGFNG-PKLTTGKAVVFALISWVIAIGC-ALPPFFG-W-----GNYILEGILDSCSYD +19== ICNGFNG-PKLTQGKATFMCGLAWVISVGW-SLPPFFG-W-----GSYTLEGILDSCSYD +20== IVKGMAG-RPMTIPLALGKM---------------------------YVPEGNLTSCGID +21== IVKGMAG-RPMTIPLALGKIAYIWFMSSIW-CLAPAFG-W-----SRYVPEGNLTSCGID +22== IVKGMAG-RPMTIPLALGKIAYIWFMSTIWCCLAPVFG-W-----SRYVPEGNLTSCGID +23== IVKGING-TPMTIKTSIMKILFIWMMAVFW-TVMPLIG-W-----SAYVPEGNLTACSID +24== IVKGING-TPMTIKTSIMKIAFIWMMAVFW-TIMPLIG-W-----SSYVPEGNLTACSID +25== IVKGLSG-KPLSINGALIRIIAIWLFSLGW-TIAPMFG-W-----NRYVPEGNMTACGTD +26== IVRGMAA-APLTHKKATLLLLFVWIWSGGW-TILPFFG-W-----SRYVPEGNLTSCTVD +27== IGRPMAASKKMSHRRAFLMIIFVWMWSIVW-SVGPVFN-W-----GAYVPEGILTSCSFD +28== IGRPMAASKKMSHRRAFIMIIFVWLWSVLW-AIGPIFG-W-----GAYTLEGVLCNCSFD +29== ITRPLTYPVRQNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQD +30== ITRPLTYPVRQNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQD +31=p ITRPLTYPVRQNGKCMAKMILSVWLLSASI-TLPPLFG-W-----AQNVNDDKVCLISQD +32== ITDPIDYVNKRTPRRAAALISLTWLIGFLI-SIPPMLG-WRTPEDRSDPDA---CTISKD +33=p ITDPIDYVNKRTPRRAAALISLTWLIGFLI-SIPPMLG-WRTPEDRSDPDA---CTISKD +34== VTS-IDYIRRRSARRILLMIMVVWIVALFI-SIPPLFG-WRDP--NNDPDKTGTCIISQD +35=p VTS-IDYIRRRSARRILLMIMVVWIVALFI-SIPPLFG-WRDP--NNDPDKTGTCIISQD +36== VTN-IDYIHASTAKRVGMMIACVWTVSFFV-CIAQLLG-WKDPDWNQRVSEDLRCVVSQD + : : : -1== ESYVVYMFVVHFLVPFVIIFFCYGRLLCTV----KE------------------------ -2== ESFVIYMFVVHFTIPMIVIFFCYGQLVFTV----KE------------------------ -3== ESYVLYMFVIHFIIPVVVIFFSYGRLICKV----RE------------------------ -4=p ESYVIYMFVCHFILPVAVIFFTYGRLVCTV----KA------------------------ -5=p ESYVLYMFICHFILPVTIIFFTYGRLVCTV----KA------------------------ -6== ESYVMFLFCFCFAVPFGTIVFCYGQLLITL----KL------------------------ -7== ESYTWFLFIFCFIVPLSLICFSYTQLLRAL----KA------------------------ -8=opsin, QSYMIVLMVTCCITPLSIIVLCYLQVWLAI----RA------------------------ -9== QSYMIVLMVTCCITPLSIIVLCYLQVWLAI----RA------------------------ -10== QSYMIVLMVTCCIIPLAIIMLCYLQVWLAI----RA------------------------ -11== QSFMLTLMITCCFLPLFIIIVCYLQVWMAI----RA------------------------ -12== QSYMVVLMVTCCFFPLAIIILCYLQVWLAI----RA------------------------ -13== NSYILSLFVTCFVLPLSLILFSYTNLLLTL----RA------------------------ -14== RLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKA------------------------ -15== RLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKA------------------------ -16== RLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKA------------------------ -17== RLFVGTIFLFSFVVPTLMILYYYSQIVGHVFNHEKA------------------------ -18== FSYNIFIFVFDYFLPAAIIVFSYVFIVKAIFAHEAA------------------------ -19== ITYNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAA------------------------ -20== RSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA------------------------ -21== RSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA------------------------ -22== RSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA------------------------ -23== RSYLITYSLFVYYTPLFLICYSYWFIIAAVAAHEKA------------------------ -24== RSYLITYSLFVYYTPLFMICYSYWFIIATVAAHEKA------------------------ -25== ASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKN------------------------ -26== ASYVVIYGLAVYFLPLITMIYCYFFIVHAVAEHEKQ------------------------ -27== RSFILCMYFCGFMLPIIIIAFCYFNIVMSVSNHEKE------------------------ -28== RSNILCMFILGFFGPILIIFFCYFNIVMSVSNHEKE------------------------ -29== -GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF--------------------- -30== -GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF--------------------- -31=p -GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF--------------------- -32== -GYTIYSTFGAFYIPLLLMLVLYGRIFRAARFRIRK------------------------ -33=p -GYTIYSTFGAFYIPLLLMLVLYGRIFRAARFRIRK------------------------ -34== -GYTIFSTVGAFYLPMLVMMIIYIRIWLVARSRIRKDKFQMTKARLKTEETTLVASPKTE -35=p -GYTIFSTVGAFYLPMLVMMIIYIRIWLVARSRIRKDKFQMTKARLKTEETTLVASPKTE -36== -GYQIFATASSFYVPVLIILILYWRIYQTARKRIR------------------------- - * : * : +1== YYTLNPNFNNESYVVYMFVVHFLVPFVIIFFCYGRLLCTV----KE-------------- +2== YYTLKPEVNNESFVIYMFVVHFTIPMIVIFFCYGQLVFTV----KE-------------- +3== YYTHNPDYHNESYVLYMFVIHFIIPVVVIFFSYGRLICKV----RE-------------- +4=p YYTLNPDYNNESYVIYMFVCHFILPVAVIFFTYGRLVCTV----KA-------------- +5=p YYTLNPEYNNESYVLYMFICHFILPVTIIFFTYGRLVCTV----KA-------------- +6== WYTTNNKYNNESYVMFLFCFCFAVPFGTIVFCYGQLLITL----KL-------------- +7== WYTVGTKYRSESYTWFLFIFCFIVPLSLICFSYTQLLRAL----KA-------------- +8=opsin, VFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAI----RA-------------- +9== VFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYLQVWLAI----RA-------------- +10== VFSGSSYPGVQSYMIVLMVTCCIIPLAIIMLCYLQVWLAI----RA-------------- +11== VFSGSVELGCQSFMLTLMITCCFLPLFIIIVCYLQVWMAI----RA-------------- +12== VFSGSSDPGVQSYMVVLMVTCCFFPLAIIILCYLQVWLAI----RA-------------- +13== WYTGGSNN--NSYILSLFVTCFVLPLSLILFSYTNLLLTL----RA-------------- +14== YLT--DNFDTRLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKA-------------- +15== YLT--DNFDTRLFVACIFFFSFVCPTTMITYYYSQIVGHVFSHEKA-------------- +16== YLS--DNFDTRLFVGTIFFFSFVCPTLMILYYYSQIVGHVFSHEKA-------------- +17== YLS--DNFDTRLFVGTIFLFSFVVPTLMILYYYSQIVGHVFNHEKA-------------- +18== YLT--QDFNTFSYNIFIFVFDYFLPAAIIVFSYVFIVKAIFAHEAA-------------- +19== YFT--RDMNTITYNICIFIFDFFLPASVIVFSYVFIVKAIFAHEAA-------------- +20== YLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA-------------- +21== YLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA-------------- +22== YLE--RDWNPRSYLIFYSIFVYYIPLFLICYSYWFIIAAVSAHEKA-------------- +23== YMT--RMWNPRSYLITYSLFVYYTPLFLICYSYWFIIAAVAAHEKA-------------- +24== YMT--RQWNPRSYLITYSLFVYYTPLFMICYSYWFIIATVAAHEKA-------------- +25== YFN--RGLLSASYLVCYGIWVYFVPLFLIIYSYWFIIQAVAAHEKN-------------- +26== YLT--KDWSSASYVVIYGLAVYFLPLITMIYCYFFIVHAVAEHEKQ-------------- +27== YLS--TDPSTRSFILCMYFCGFMLPIIIIAFCYFNIVMSVSNHEKE-------------- +28== YIS--RDSTTRSNILCMFILGFFGPILIIFFCYFNIVMSVSNHEKE-------------- +29== F----------GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF----------- +30== F----------GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF----------- +31=p F----------GYTIYSTAVAFYIPMSVMLFMYYQIYKAARKSAAKHKF----------- +32== H----------GYTIYSTFGAFYIPLLLMLVLYGRIFRAARFRIRK-------------- +33=p H----------GYTIYSTFGAFYIPLLLMLVLYGRIFRAARFRIRK-------------- +34== K----------GYTIFSTVGAFYLPMLVMMIIYIRIWLVARSRIRKDKFQMTKARLKTEE +35=p K----------GYTIFSTVGAFYLPMLVMMIIYIRIWLVARSRIRKDKFQMTKARLKTEE +36== V----------GYQIFATASSFYVPVLIILILYWRIYQTARKRIR--------------- + * : * : -1== ---------------------------------------------------AAAAQQ--- -2== ---------------------------------------------------AAAQQQ--- -3== ---------------------------------------------------AAAQQQ--- -4=p ---------------------------------------------------AAAQQQ--- -5=p ---------------------------------------------------AAAQQQ--- -6== ---------------------------------------------------AAKAQA--- -7== ---------------------------------------------------VAAQQQ--- -8=opsin, ---------------------------------------------------VAKQQK--- -9== ---------------------------------------------------VAKQQK--- -10== ---------------------------------------------------VAKQQK--- -11== ---------------------------------------------------VAAQQK--- -12== ---------------------------------------------------VAAQQK--- -13== ---------------------------------------------------AAAQQK--- -14== ---------------------------------------------------LRDQAKKM- -15== ---------------------------------------------------LRDQAKKM- -16== ---------------------------------------------------LREQAKKM- -17== ---------------------------------------------------LREQAKKM- -18== ---------------------------------------------------MRAQAKKM- -19== ---------------------------------------------------MRAQAKKM- -20== ---------------------------------------------------MREQAKKM- -21== ---------------------------------------------------MREQAKKM- -22== ---------------------------------------------------MREQAKKM- -23== ---------------------------------------------------MREQAKKM- -24== ---------------------------------------------------MRDQAKKM- -25== ---------------------------------------------------MREQAKKM- -26== ---------------------------------------------------LREQAKKM- -27== ---------------------------------------------------MAAMAKRL- -28== ---------------------------------------------------MAAMAKRL- -29== ----------------------------------P--------GFPR----VEPDS---- -30== ----------------------------------P--------GFPR----VQPES---- -31=p ----------------------------------P--------GFPR----VQPES---- -32== ---------------TVKKVEKTGADTRHGASPAPQPKKS-----------VNGESGSR- -33=p ---------------TVRKVEKKGAGTSLGTSSAPPPKKS-----------LNGQPGSG- -34== YSVVSDCNGCNSPDSTTEKKKRRAPFKSYGCSPRPERKKNRAKKLPENANGVNSNSSS-- -35=p YSVVSDCNGCNSPDSTTEKKKRRAPFKSYGCSPRPERKKNRAKKLPENANGVNSNSSS-- -36== --------------------RRRGATARGGVGPPP---------VPAGGALVAGGGSGGI +1== ------------------------------------------------------AAAAQQ +2== ------------------------------------------------------AAAQQQ +3== ------------------------------------------------------AAAQQQ +4=p ------------------------------------------------------AAAQQQ +5=p ------------------------------------------------------AAAQQQ +6== ------------------------------------------------------AAKAQA +7== ------------------------------------------------------VAAQQQ +8=opsin, ------------------------------------------------------VAKQQK +9== ------------------------------------------------------VAKQQK +10== ------------------------------------------------------VAKQQK +11== ------------------------------------------------------VAAQQK +12== ------------------------------------------------------VAAQQK +13== ------------------------------------------------------AAAQQK +14== ------------------------------------------------------LRDQAK +15== ------------------------------------------------------LRDQAK +16== ------------------------------------------------------LREQAK +17== ------------------------------------------------------LREQAK +18== ------------------------------------------------------MRAQAK +19== ------------------------------------------------------MRAQAK +20== ------------------------------------------------------MREQAK +21== ------------------------------------------------------MREQAK +22== ------------------------------------------------------MREQAK +23== ------------------------------------------------------MREQAK +24== ------------------------------------------------------MRDQAK +25== ------------------------------------------------------MREQAK +26== ------------------------------------------------------LREQAK +27== ------------------------------------------------------MAAMAK +28== ------------------------------------------------------MAAMAK +29== --------------------------------------------P----GFPRVEPDSVI +30== --------------------------------------------P----GFPRVQPESVI +31=p --------------------------------------------P----GFPRVQPESVI +32== -------------------------TVKKVEKTGADTRHGASPAP---------QPKKS- +33=p -------------------------TVRKVEKKGAGTSLGTSSAP---------PPKKS- +34== TTLVASPKTEYSVVSDCNGCNSPDSTTEKKKRRAPFKSYGCSPRPERKKNRAKKLPENAN +35=p TTLVASPKTEYSVVSDCNGCNSPDSTTEKKKRRAPFKSYGCSPRPERKKNRAKKLPENAN +36== ------------------------------RRRGATARGGVGPPP---------VPAGGA 1== ------------------------------------------------------------ @@ -280,158 +280,158 @@ 11== ------------------------------------------------------------ 12== ------------------------------------------------------------ 13== ------------------------------------------------------------ -14== --------------------------------NVESL----------------------- -15== --------------------------------NVDSL----------------------- -16== --------------------------------NVESL----------------------- -17== --------------------------------NVESL----------------------- -18== --------------------------------NVSTL----------------------- -19== --------------------------------NVTNL----------------------- -20== --------------------------------NVKSL----------------------- -21== --------------------------------NVKSL----------------------- -22== --------------------------------NVKSL----------------------- -23== --------------------------------NVKSL----------------------- -24== --------------------------------NVKSL----------------------- -25== --------------------------------NVASL----------------------- -26== --------------------------------NVASL----------------------- -27== --------------------------------NAKEL----------------------- -28== --------------------------------NAKEL----------------------- -29== ---VIAL-----------------NGIVKLQ--------KEVEECAN------------- -30== ---VISL-----------------NGVVKLQ--------KEVEECAN------------- -31=p ---VISL-----------------NGVVKLQ--------KEVEECAN------------- -32== --------NWRLGVESKAGGALCANGAVRQGDDGAALEVIEVHRVGNSKEHLPLPSEAG- -33=p --------DWRRCAENRAVGTPCTNGAVRQGDDEATLEVIEVHRVGNSKEHLPLPSESG- -34== --------SERLKQIQIETAEAFANGCA----EEASIAMLERQ-CNNGKKISSNDTPYS- -35=p --------SERLKQIQIETAEAFANGCA----EEASIAMLERQ-CNNGKKISSNDTPYS- -36== AAAVVAVIGRPLPTISETTTTGFTNVSS----NNTS---PEKQSCANGLEADPPTTGYGA +14== KMN---------------VESLRS------------------------------------ +15== KMN---------------VDSLRS------------------------------------ +16== KMN---------------VESLRS------------------------------------ +17== KMN---------------VESLRS------------------------------------ +18== KMN---------------VSTLRS------------------------------------ +19== KMN---------------VTNLRS------------------------------------ +20== KMN---------------VKSLRS------------------------------------ +21== KMN---------------VKSLRS------------------------------------ +22== KMN---------------VKSLRS------------------------------------ +23== KMN---------------VKSLRS------------------------------------ +24== KMN---------------VKSLRS------------------------------------ +25== KMN---------------VASLRS------------------------------------ +26== KMN---------------VASLRA------------------------------------ +27== RLN---------------AKELR------------------------------------- +28== RLN---------------AKELR------------------------------------- +29== ALNG--------------IVKLQ----------------------KEVEECANLSR---- +30== SLNG--------------VVKLQ----------------------KEVEECANLSR---- +31=p SLNG--------------VVKLQ----------------------KEVEECANLSR---- +32== -VNGESGSRNWRL-----GVESKAGGALCANGAVRQGDDGAALEVIEVHRVGNSKEHLPL +33=p -LNGQPGSGDWRR-----CAENRAVGTPCTNGAVRQGDDEATLEVIEVHRVGNSKEHLPL +34== GVNSNSSS----------SERLKQIQIETAEAFANGCAEEASIAMLERQ-CNNGKKISSN +35=p GVNSNSSS----------SERLKQIQIETAEAFANGCAEEASIAMLERQ-CNNGKKISSN +36== LVAGGGSGGIAAAVVAVIGRPLPTISETTTTGFTNVSSNNTS---PEKQSCANGLEADPP -1== --------------ESASTQK------AEKEVTRMVVLMVIGFLVCWVPYASVAFYIFT- -2== --------------ESATTQK------AEKEVTRMVIIMVIFFLICWLPYASVAMYIFT- -3== --------------ESATTQK------AEKEVTRMVILMVLGFMLAWTPYAVVAFWIFT- -4=p --------------DSASTQK------AEREVTKMVILMVFGFLIAWTPYATVAAWIFF- -5=p --------------DSASTQK------AEREVTKMVILMVLGFLVAWTPYATVAAWIFF- -6== --------------DSASTQK------AEREVTKMVVVMVLGFLVCWAPYASFSLWIVS- -7== --------------ESATTQK------AEREVSRMVVVMVGSFCVCYVPYAAFAMYMVN- -8=opsin, --------------ESESTQK------AEKEVTRMVVVMVLAFC---------------- -9== --------------ESESTQK------AEKEVTRMVVVMVLAFCFCWGPYAFFACFAAA- -10== --------------ESESTQK------AEKEVTRMVVVMIFAYCVCWGPYTFFACFAAA- -11== --------------ESESTQK------AEREVSRMVVVMIVAFCICWGPYASFVSFAAA- -12== --------------ESESTQK------AEKEVSRMVVVMIVAYCFCWGPYTFFACFAAA- -13== --------------EADTTQR------AEREVTRMVIVMVMAFLLCWLPYSTFALVVAT- -14== -----------RSNVDKNKET------AEIRIAKAAITICFLFFCSWTPYGVMSLIGAF- -15== -----------RSNVDKSKEA------AEIRIAKAAITICFLFFASWTPYGVMSLIGAF- -16== -----------RSNVDKSKET------AEIRIAKAAITICFLFFVSWTPYGVMSLIGAF- -17== -----------RSNVDKSKET------AEIRIAKAAITICFLFFVSWTPYGVMSLIGAF- -18== -----------RS-NEADAQR------AEIRIAKTALVNVSLWFICWTPYALISLKGVM- -19== -----------RS-NEAETQR------AEIRIAKTALVNVSLWFICWTPYAAITIQGLL- -20== -----------RS-SEDAEKS------AEGKLAKVALVTITLWFMAWTPYLVINCMGLF- -21== -----------RS-SEDAEKS------AEGKLAKVALVTITLWFMAWTPYLVINCMGLF- -22== -----------RS-SEDADKS------AEGKLAKVALVTISLWFMAWTPYLVINCMGLF- -23== -----------RS-SEDCDKS------AEGKLAKVALTTISLWFMAWTPYLVICYFGLF- -24== -----------RS-SEDCDKS------AENKLAKVALTTISLWFMAWTPYLIICYFGLF- -25== -----------RS-SENQNTS------AECKLAKVALMTISLWFMAWTPYLVINFSGIF- -26== -----------RANADQQKQS------AECRLAKVAMMTVGLWFMAWTPYLIISWAGVF- -27== -----------R--KAQAGAS------AEMKLAKISMVIITQFMLSWSPYAIIALLAQF- -28== -----------R--KAQAGAN------AEMRLAKISIVIVSQFLLSWSPYAVVALLAQF- -29== -----LSRLLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLSTARPFI -30== -----LSRLLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLSTARPFI -31=p -----LSRLLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLSTARPFI -32== -PTPCAPASFERKNERNAEAKRKMALARERKTVKTLGIIMGTFILCWLPFFIVALVLPF- -33=p -SNSYAPACLERKNERNAEAKRKMALARERKTVKTLGIIMGTFILCWLPFFIVALVLPF- -34== ------------RTREKLELK------RERKAARTLAIITGAFLICWLPFFIIALIGPF- -35=p ------------RTREKLELK------RERKAARTLAIITGAFLICWLPFFIIALIGPF- -36== VAAAYYPSLVRRKPKEAADSK------RERKAAKTLAIITGAFVACWLPFFVLAILVPT- - * . : +1== --------------------ESASTQK------AEKEVTRMVVLMVIGFLVCWVPYASVA +2== --------------------ESATTQK------AEKEVTRMVIIMVIFFLICWLPYASVA +3== --------------------ESATTQK------AEKEVTRMVILMVLGFMLAWTPYAVVA +4=p --------------------DSASTQK------AEREVTKMVILMVFGFLIAWTPYATVA +5=p --------------------DSASTQK------AEREVTKMVILMVLGFLVAWTPYATVA +6== --------------------DSASTQK------AEREVTKMVVVMVLGFLVCWAPYASFS +7== --------------------ESATTQK------AEREVSRMVVVMVGSFCVCYVPYAAFA +8=opsin, --------------------ESESTQK------AEKEVTRMVVVMVLAFC---------- +9== --------------------ESESTQK------AEKEVTRMVVVMVLAFCFCWGPYAFFA +10== --------------------ESESTQK------AEKEVTRMVVVMIFAYCVCWGPYTFFA +11== --------------------ESESTQK------AEREVSRMVVVMIVAFCICWGPYASFV +12== --------------------ESESTQK------AEKEVSRMVVVMIVAYCFCWGPYTFFA +13== --------------------EADTTQR------AEREVTRMVIVMVMAFLLCWLPYSTFA +14== -------------------NVDKNKET------AEIRIAKAAITICFLFFCSWTPYGVMS +15== -------------------NVDKSKEA------AEIRIAKAAITICFLFFASWTPYGVMS +16== -------------------NVDKSKET------AEIRIAKAAITICFLFFVSWTPYGVMS +17== -------------------NVDKSKET------AEIRIAKAAITICFLFFVSWTPYGVMS +18== --------------------NEADAQR------AEIRIAKTALVNVSLWFICWTPYALIS +19== --------------------NEAETQR------AEIRIAKTALVNVSLWFICWTPYAAIT +20== --------------------SEDAEKS------AEGKLAKVALVTITLWFMAWTPYLVIN +21== --------------------SEDAEKS------AEGKLAKVALVTITLWFMAWTPYLVIN +22== --------------------SEDADKS------AEGKLAKVALVTISLWFMAWTPYLVIN +23== --------------------SEDCDKS------AEGKLAKVALTTISLWFMAWTPYLVIC +24== --------------------SEDCDKS------AENKLAKVALTTISLWFMAWTPYLIIC +25== --------------------SENQNTS------AECKLAKVALMTISLWFMAWTPYLVIN +26== -------------------NADQQKQS------AECRLAKVAMMTVGLWFMAWTPYLIIS +27== --------------------KAQAGAS------AEMKLAKISMVIITQFMLSWSPYAIIA +28== --------------------KAQAGAN------AEMRLAKISIVIVSQFLLSWSPYAVVA +29== --------------LLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLS +30== --------------LLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLS +31=p --------------LLKHERKNISIFK------REQKAATTLGIIVGAFTVCWLPFFLLS +32== PSEAG--PTPCAPASFERKNERNAEAKRKMALARERKTVKTLGIIMGTFILCWLPFFIVA +33=p PSESG--SNSYAPACLERKNERNAEAKRKMALARERKTVKTLGIIMGTFILCWLPFFIVA +34== DTPYS-------------RTREKLELK------RERKAARTLAIITGAFLICWLPFFIIA +35=p DTPYS-------------RTREKLELK------RERKAARTLAIITGAFLICWLPFFIIA +36== TTGYGAVAAAYYPSLVRRKPKEAADSK------RERKAAKTLAIITGAFVACWLPFFVLA + * . : -1== HQGS--DFGATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTLCC---------GKN -2== HQGS--NFGPIFMTLPAFFAKTASIYNPIIYIMMNKQFRNCMLTSLCC---------GKN -3== NKGA--DFTATLMAVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTICC---------GKN -4=p NKGA--DFSAKFMAIPAFFSKSSALYNPVIYVLLNKQFRNCMLTTIFC---------GKN -5=p NKGA--AFSAQFMAIPAFFSKTSALYNPVIYVLLNKQFRSCMLTTLFC---------GKN -6== HRGE--EFDLRMATIPSCLSKASTVYNPVIYVLMNKQFRSCMM-KMVC---------GKN -7== NRNH--GLDLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIM-KMVC---------GKA +1== FYIFT-HQGS--DFGATFMTLPAFFAKSSALYNPVIYILMNKQFRNCMITTLCC------ +2== MYIFT-HQGS--NFGPIFMTLPAFFAKTASIYNPIIYIMMNKQFRNCMLTSLCC------ +3== FWIFT-NKGA--DFTATLMAVPAFFSKSSSLYNPIIYVLMNKQFRNCMITTICC------ +4=p AWIFF-NKGA--DFSAKFMAIPAFFSKSSALYNPVIYVLLNKQFRNCMLTTIFC------ +5=p AWIFF-NKGA--AFSAQFMAIPAFFSKTSALYNPVIYVLLNKQFRSCMLTTLFC------ +6== LWIVS-HRGE--EFDLRMATIPSCLSKASTVYNPVIYVLMNKQFRSCMM-KMVC------ +7== MYMVN-NRNH--GLDLRLVTIPSFFSKSACIYNPIIYCFMNKQFQACIM-KMVC------ 8=opsin, ------------------------------------------------------------ -9== NPGY--PFHPLMAALPAFFAKSATIYNPVIYVFMNRQFRNCIL-QLF----------GKK -10== NPGY--AFHPLMAALPAYFAKSATIYNPVIYVFMNRQFRNCIL-QLF----------GKK -11== NPGY--AFHPLAAALPAYFAKSATIYNPVIYVFMNRQFRNCIM-QLF----------GKK -12== NPGY--AFHPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIL-QLF----------GKK -13== HKGI--IIQPVLASLPSYFSKTATVYNPIIYVFMNKQFQSCLL-EMLCCGY-----QPQR -14== GDKT--LLTPGATMIPACACKMVACIDPFVYAISHPRYRMELQKRCPWLAL--------N -15== GDKT--LLTPGATMIPACTCKMVACIDPFVYAISHPRYRMELQKRCPWLAI--------S -16== GDKS--LLTQGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGV--------N -17== GDKS--LLTPGATMIPACTCKLVACIEPFVYAISHPRYRMELQKRCPWLGV--------N -18== GDTS--GITPLVSTLPALLAKSCSCYNPFVYAISHPKYRLAITQHLPWFCV------HET -19== GNAE--GITPLLTTLPALLAKSCSCYNPFVYAISHPKFRLAITQHLPWFCV------HEK -20== KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF------GKV -21== KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF------GKV -22== KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF------GKV -23== KI-D--GLTPLTTIWGATFAKTSAVYNPIVYGISHPKYRIVLKEKCPMCVF------GNT -24== KI-D--GLTPLTTIWGATFAKTSAVYNPIVYGISHPNDRLVLKEKCPMCVC------GTT -25== NL-V--KISPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFAKFPSLAC-------AA -26== SSGT--RLTPLATIWGSVFAKANSCYNPIVYGISHPRYKAALYQRFPSLAC------GSG -27== GPAE--WVTPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWLLTCCQFDEKEC -28== GPLE--WVTPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVLTCCQFDDKET -29== CGTSCSCIPLWVERTFLWLGYANSLINPFIYAFFNRDLRTTYRSLLQCQYR--------- -30== CGTSCSCIPLWVERTCLWLGYANSLINPFIYAFFNRDLRPTSRSLLQCQYR--------- -31=p CGTSCSCIPLWVERTCLWLGYANSLINPFIYAFFNRDLRTTYRSLLQCQYR--------- -32== CESSC-HMPTLLGAIINWLGYSNSLLNPVIYAYFNKDFQNAFKKIIKCKFC--------- -33=p CESSC-HMPALLGAIINWLGYSNSLLNPVIYAYFNKDFQNAFKKIIKCKFC--------- -34== VDPE--GIPPFARSFVLWLGYFNSLLNPIIYTIFSPEFRSAFQKILFGKYR--------- -35=p VDPE--GIPPFARSFVLWLGYFNSLLNPIIYTIFSPEFRSAFQKILFGKYR--------- -36== CDCE---VSPVLTSLSLWLGYFNSTLNPVIYTVFSPEFRHAFQRLLCGRRV--------- +9== CFAAA-NPGY--PFHPLMAALPAFFAKSATIYNPVIYVFMNRQFRNCIL-QLF------- +10== CFAAA-NPGY--AFHPLMAALPAYFAKSATIYNPVIYVFMNRQFRNCIL-QLF------- +11== SFAAA-NPGY--AFHPLAAALPAYFAKSATIYNPVIYVFMNRQFRNCIM-QLF------- +12== CFAAA-NPGY--AFHPLAAALPAYFAKSATIYNPIIYVFMNRQFRNCIL-QLF------- +13== LVVAT-HKGI--IIQPVLASLPSYFSKTATVYNPIIYVFMNKQFQSCLL-EMLCCGY--- +14== LIGAF-GDKT--LLTPGATMIPACACKMVACIDPFVYAISHPRYRMELQKRCPWLAL--- +15== LIGAF-GDKT--LLTPGATMIPACTCKMVACIDPFVYAISHPRYRMELQKRCPWLAI--- +16== LIGAF-GDKS--LLTQGATMIPACTCKLVACIDPFVYAISHPRYRLELQKRCPWLGV--- +17== LIGAF-GDKS--LLTPGATMIPACTCKLVACIEPFVYAISHPRYRMELQKRCPWLGV--- +18== LKGVM-GDTS--GITPLVSTLPALLAKSCSCYNPFVYAISHPKYRLAITQHLPWFCV--- +19== IQGLL-GNAE--GITPLLTTLPALLAKSCSCYNPFVYAISHPKFRLAITQHLPWFCV--- +20== CMGLF-KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF--- +21== CMGLF-KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF--- +22== CMGLF-KF-E--GLTPLNTIWGACFAKSAACYNPIVYGISHPKYRLALKEKCPCCVF--- +23== YFGLF-KI-D--GLTPLTTIWGATFAKTSAVYNPIVYGISHPKYRIVLKEKCPMCVF--- +24== YFGLF-KI-D--GLTPLTTIWGATFAKTSAVYNPIVYGISHPNDRLVLKEKCPMCVC--- +25== FSGIF-NL-V--KISPLFTIWGSLFAKANAVYNPIVYGISHPKYRAALFAKFPSLAC--- +26== WAGVF-SSGT--RLTPLATIWGSVFAKANSCYNPIVYGISHPRYKAALYQRFPSLAC--- +27== LLAQF-GPAE--WVTPYAAELPVLFAKASAIHNPIVYSVSHPKFREAIQTTFPWLLTCCQ +28== LLAQF-GPLE--WVTPYAAQLPVMFAKASAIHNPMIYSVSHPKFREAISQTFPWVLTCCQ +29== TARPFICGTSCSCIPLWVERTFLWLGYANSLINPFIYAFFNRDLRTTYRSLLQC------ +30== TARPFICGTSCSCIPLWVERTCLWLGYANSLINPFIYAFFNRDLRPTSRSLLQC------ +31=p TARPFICGTSCSCIPLWVERTCLWLGYANSLINPFIYAFFNRDLRTTYRSLLQC------ +32== LVLPF-CESSC-HMPTLLGAIINWLGYSNSLLNPVIYAYFNKDFQNAFKKIIKC------ +33=p LVLPF-CESSC-HMPALLGAIINWLGYSNSLLNPVIYAYFNKDFQNAFKKIIKC------ +34== LIGPF-VDPE--GIPPFARSFVLWLGYFNSLLNPIIYTIFSPEFRSAFQKILFG------ +35=p LIGPF-VDPE--GIPPFARSFVLWLGYFNSLLNPIIYTIFSPEFRSAFQKILFG------ +36== ILVPT-CDCE---VSPVLTSLSLWLGYFNSTLNPVIYTVFSPEFRHAFQRLLCG------ -1== PLGD-DE--SGASTSKTEVSSVS-TSPV-------------------------------- -2== PLGD-DE--ASATASKTE------TSQV-------------------------------- -3== PFGD-EDVSSTVSQSKTEVSSVS-SSQV-------------------------------- -4=p PLGD-DE-SSTVSTSKTEVSS------V-------------------------------- -5=p PLGD-EE-SSTVSTSKTEVSS------V-------------------------------- -6== -IEE-DE--ASTSSQVTQVSS------V-------------------------------- -7== -MTD-ES--DTCSSQKTEVSTVS-STQV-------------------------------- +1== --GKNPLGDDE--SGASTSKTEVSSVS-TSPVS--------------------------- +2== --GKNPLGDDE--ASATASKTE------TSQVA--------------------------- +3== --GKNPFGDEDVSSTVSQSKTEVSSVS-SSQVS--------------------------- +4=p --GKNPLGDDE-SSTVSTSKTEVSS------VS--------------------------- +5=p --GKNPLGDEE-SSTVSTSKTEVSS------VS--------------------------- +6== --GKN-IEEDE--ASTSSQVTQVSS------VA--------------------------- +7== --GKA-MTDES--DTCSSQKTEVSTVS-STQVG--------------------------- 8=opsin, ------------------------------------------------------------ -9== -VDD-GS--ELSSASKTEVSSV---SSV-------------------------------- -10== -VDD-GS--ELSSASKTEVSSV---SSV-------------------------------- -11== -VDD-GS--EASTTSRTEVSSVS-NSSV-------------------------------- -12== -VDD-GS--EVST-SRTEVSSVS-NSSV-------------------------------- -13== -TGK-AS--PGTPGPHADVTAAGLRNKV-------------------------------- -14== EKAP-ES-SAVASTSTTQEP-QQ-TTAA-------------------------------- -15== EKAP-ES-RAAISTSTTQEQ-QQ-TTAA-------------------------------- -16== EKSG-EI-SSAQST-TTQEQ-QQ-TTAA-------------------------------- -17== EKSG-EA-SSAQST-TTQEQTQQ-TSAA-------------------------------- -18== ETKS-ND-DSQSNSTVAQDKA--------------------------------------- -19== DPND-VE-ENQSSNTQTQEKS--------------------------------------- -20== DDGK-SS-DAQSQATASEAESKA------------------------------------- -21== DDGK-SS-DAQSQATASEAESKA------------------------------------- -22== DDGK-SS-EAQSQATTSEAESKA------------------------------------- -23== DEPKPDA-PASDTETTSEADSKA------------------------------------- -24== DEPKPDA-PPSDTETTSEAESKD------------------------------------- -25== EPSS-DA-VSTTSGTTTVTDNEK-SNA--------------------------------- -26== ESGS-DV-KSEASATTTMEEKPK-IPEA-------------------------------- -27== EDAN-DA-EEEVVASER--GGES-RDAAQMKEMMAMMQKMQAQQAAYQPPPPPQGY--PP -28== EDDK-DA-ETEIPAGESSDAAPS-ADAAQMKEMMAMMQKMQQQQAAY----PPQGYAPPP -29== -----NINRKLSAAGMHEALKLA------------------------------------- -30== -----NINRKLSAAGMHEALKLA------------------------------------- -31=p -----NINRKLSAAGMHEALKLA------------------------------------- -32== -----RQ----------------------------------------------------- -33=p -----RR----------------------------------------------------- -34== -----RGHR--------------------------------------------------- -35=p -----RGHR--------------------------------------------------- -36== -----RRRRA-------------------------------------------------- +9== --GKK-VDDGS--ELSSASKTEVSSV---SSVS--------------------------- +10== --GKK-VDDGS--ELSSASKTEVSSV---SSVS--------------------------- +11== --GKK-VDDGS--EASTTSRTEVSSVS-NSSVA--------------------------- +12== --GKK-VDDGS--EVST-SRTEVSSVS-NSSVS--------------------------- +13== -QPQR-TGKAS--PGTPGPHADVTAAGLRNKVM--------------------------- +14== ---NE-KAPES----SAVASTSTTQEPQQTTAA--------------------------- +15== ---SE-KAPES----RAAISTSTTQEQQQTTAA--------------------------- +16== ---NE-KSGEI----SSAQSTTTQEQ-QQTTAA--------------------------- +17== ---NE-KSGEA----SSAQSTTTQEQTQQTSAA--------------------------- +18== ---HE-TETKS-NDDSQSNSTVAQDKA--------------------------------- +19== ---HE-KDPND-VEENQSSNTQTQEKS--------------------------------- +20== ---GK-VDDGK-SSDAQSQATASEAESKA------------------------------- +21== ---GK-VDDGK-SSDAQSQATASEAESKA------------------------------- +22== ---GK-VDDGK-SSEAQSQATTSEAESKA------------------------------- +23== ---GN-TDEPKPDAPASDTETTSEADSKA------------------------------- +24== ---GT-TDEPKPDAPPSDTETTSEAESKD------------------------------- +25== ----A-AEPSS-DAVSTTSGTTTVTDNEKSNA---------------------------- +26== ---GS-GESGS-DVKSEASATTTMEEKPKIPEA--------------------------- +27== FDEKE-CEDAN-DAEEEVVASER--GGESRDAAQMKEMMAMMQKMQAQQAAYQPPPPPQG +28== FDDKE-TEDDK-DAETEIPAGESSDAAPSADAAQMKEMMAMMQKMQQQQAAY----PPQG +29== ---QY-RNINR--KLSAAGMHEALKLAER------------------------------- +30== ---QY-RNINR--KLSAAGMHEALKLAER------------------------------- +31=p ---QY-RNINR--KLSAAGMHEALKLAER------------------------------- +32== ---KF-CRQ--------------------------------------------------- +33=p ---KF-CRR--------------------------------------------------- +34== ---KY-RRGHR------------------------------------------------- +35=p ---KY-RRGHR------------------------------------------------- +36== ---RR-VRRRR--A---------------------------------------------- -1== --------------------------------------------SPA------------- -2== --------------------------------------------APA------------- -3== --------------------------------------------SPA------------- -4=p --------------------------------------------SPA------------- -5=p --------------------------------------------SPA------------- -6== --------------------------------------------APEK------------ -7== --------------------------------------------GPN------------- +1== -----------------------------------PA----------------------- +2== -----------------------------------PA----------------------- +3== -----------------------------------PA----------------------- +4=p -----------------------------------PA----------------------- +5=p -----------------------------------PA----------------------- +6== -----------------------------------PEK---------------------- +7== -----------------------------------PN----------------------- 8=opsin, ------------------------------------------------------------ -9== --------------------------------------------SPA------------- -10== --------------------------------------------SPA------------- -11== --------------------------------------------APA------------- -12== --------------------------------------------SPA------------- -13== --------------------------------------------MPAHP---V------- +9== -----------------------------------PA----------------------- +10== -----------------------------------PA----------------------- +11== -----------------------------------PA----------------------- +12== -----------------------------------PA----------------------- +13== -----------------------------------PAHPV-------------------- 14== ------------------------------------------------------------ 15== ------------------------------------------------------------ 16== ------------------------------------------------------------ @@ -445,52 +445,52 @@ 24== ------------------------------------------------------------ 25== ------------------------------------------------------------ 26== ------------------------------------------------------------ -27== QGYPPQGAYPPPQGYPPQGYPPQGYPPQGYPPQGAPPQVEAPQGAPPQG---VDNQAYQA -28== QGYPPQGY--PPQGYPPQGYPPQGYPP---PPQGAPPQ-GAPPAAPPQG---VDNQAYQA -29== -------------------------------------------ERPERPEFVLQNADYCR -30== -------------------------------------------ERPERSEFVLQNSDHCG -31=p -------------------------------------------ERPERSEFVLQNSDHCG +27== Y--PPQGYPPQGAYPPPQGYPPQGYPPQGYPPQGYPPQGAPPQVEAPQGAPPQGVDNQAY +28== YAPPPQGYPPQGY--PPQGYPPQGYPPQGYPP---PPQGAPPQ-GAPPAAPPQGVDNQAY +29== -----------------------------------PERPEFVL-QNADYCRKKGHDS--- +30== -----------------------------------PERSEFVL-QNSDHCGKKGHDT--- +31=p -----------------------------------PERSEFVL-QNSDHCGKKGHDT--- 32== ------------------------------------------------------------ 33=p ------------------------------------------------------------ 34== ------------------------------------------------------------ 35=p ------------------------------------------------------------ -36== ---------------------------------------------PQ------------- +36== -----------------------------------PQ----------------------- -1== ------ -2== ------ -3== ------ -4=p ------ -5=p ------ -6== ------ -7== ------ -8=opsin, ------ -9== ------ -10== ------ -11== ------ -12== ------ -13== ------ -14== ------ -15== ------ -16== ------ -17== ------ -18== ------ -19== ------ -20== ------ -21== ------ -22== ------ -23== ------ -24== ------ -25== ------ -26== ------ -27== ------ -28== ------ -29== KKGHDS -30== KKGHDT -31=p KKGHDT -32== ------ -33=p ------ -34== ------ -35=p ------ -36== ------ - +1== -- +2== -- +3== -- +4=p -- +5=p -- +6== -- +7== -- +8=opsin, -- +9== -- +10== -- +11== -- +12== -- +13== -- +14== -- +15== -- +16== -- +17== -- +18== -- +19== -- +20== -- +21== -- +22== -- +23== -- +24== -- +25== -- +26== -- +27== QA +28== QA +29== -- +30== -- +31=p -- +32== -- +33=p -- +34== -- +35=p -- +36== -- +