view test-data/sample_output_collection.fasta @ 4:7a5ff5359b13 draft

"planemo upload for repository https://github.com/galaxyproteomics/tools-galaxyp/tree/master/tools/metanovo commit c220dc85d59698a73b0f173d46e269e27264d6d8"
author galaxyp
date Fri, 22 Apr 2022 13:31:08 +0000
parents
children
line wrap: on
line source

>sp|Q61539|ERR2_MOUSE Steroid hormone receptor ERR2 OS=Mus musculus OX=10090 GN=Esrrb PE=1 SV=2
MSSEDRHLGSSCGSFIKTEPSSPSSGIDALSHHSPSGSSDASGGFGIALSTHANGLDSPP
MFAGAGLGGNPCRKSYEDCTSGIMEDSAIKCEYMLNAIPKRLCLVCGDIASGYHYGVASC
EACKAFFKRTIQGNIEYNCPATNECEITKRRRKSCQACRFMKCLKVGMLKEGVRLDRVRG
GRQKYKRRLDSENSPYLNLPISPPAKKPLTKIVSNLLGVEQDKLYAMPPNDIPEGDIKAL
TTLCELADRELVFLINWAKHIPGFPSLTLGDQMSLLQSAWMEILILGIVYRSLPYDDKLA
YAEDYIMDEEHSRLVGLLDLYRAILQLVRRYKKLKVEKEEFMILKALALANSDSMYIENL
EAVQKLQDLLHEALQDYELSQRHEEPRRAGKLLLTLPLLRQTAAKAVQHFYSVKLQGKVP
MHKLFLEMLEAKV
>sp|O88689|PCDA4_MOUSE Protocadherin alpha-4 OS=Mus musculus OX=10090 GN=Pcdha4 PE=1 SV=1
MEFSWGSGQESQRLLLSFLLLAIWEAGNSQIHYSIPEEAKHGTFVGRIAQDLGLELTELV
PRLFRVASKDRGDLLEVNLQNGILFVNSRIDREELCGRSAECSIHLEVIVDRPLQVFHVE
VEVRDINDNPPRFPTTQKNLFIAESRPLDTWFPLEGASDADIGINAVLTYRLSPNDYFSL
EKPSNDERVKGLGLVLRKSLDREETPEIILVLTVTDGGKPELTGSVQLLITVLDANDNAP
VFDRSLYTVKLPENVPNGTLVVKVNASDLDEGVNGDIMYSFSTDISPNVKYKFHIDPVSG
EIIVKGYIDFEECKSYEILIEGIDKGQLPLSGHCKVIVQVEDINDNVPELEFKSLSLPIR
ENSPVGTVIALISVSDRDTGVNGQVTCSLTSHVPFKLVSTFKNYYSLVLDSALDRETTAD
YKVVVTARDGGSPSLWATASVSVEVADVNDNAPVFAQPEYTVFVKENNPPGAHIFTVSAM
DADAQENALVSYSLVERRVGERLLSSYVSVHAESGKVFALQPLDHEELELLRFQVSARDA
GVPALGSNVTLQVFVLDENDNAPTLLEPEAGVSGGIVSRLVSRSVGAGHVVAKVRAVDAD
SGYNAWLSYELQSSEGNSRSLFRVGLYTGEISTTRILDEADSPRQRLLVLVKDHGDPAMI
VTATVLVSLVENGPVPKAPSRVSTSVTHSEASLVDVNVYLIIAICAVSSLLVLTLLLYTA
LRCSTVPSESVCGPPKPVMVCSSAVGSWSYSQQRRQRVCSGEYPPKTDLMAFSPSLSDSR
DREDQLQSAEDSSGKPRQPNPDWRYSASLRAGMHSSVHLEEAGILRAGPGGPDQQWPTVS
SATPEPEAGEVSPPVGAGVNSNSWTFKYGPGNPKQSGPGELPDKFIIPGSPAIISIRQEP
ANNQIDKSDFITFGKKEETKKKKKKKKGNKTQEKKEKGNSTTDNSDQ
>sp|Q486J8|GCST_COLP3 Aminomethyltransferase OS=Colwellia psychrerythraea (strain 34H / ATCC BAA-681) OX=167879 GN=gcvT PE=3 SV=1
MTNKTVLHAKHLASGAKMVDFFGWDMPINYGSQIEEHHAVRTDAGMFDVSHMTIVDVQGA
DAKAFLRRLVINDVAKLATPGKALYTGMLNEEGGVIDDLIIYFFSDTDYRLVVNSATRVK
DLAWMTKQSTGFDITITERPEFGMLAVQGPEAKAKVAKLLTAEQIEAVEGMKPFFGVQVG
DLFIATTGYTGEDGYEIIVPNNSAEDFWQKLLDEGVVPCGLGARDTLRLEAGMNLYGLDM
DETVSPLAANMAWTISWEPTDRDFIGRDVLTAQKAAGDQPKLVGLVLEAKGVLRSHQVVV
TEFGNGEITSGTFSPTLGHSVALARVPRSVKVGDTIEVEMRKKLIKVQVTKPSFVRNGKK
VF
>sp|Q8JGS1|STIL_DANRE SCL-interrupting locus protein homolog OS=Danio rerio OX=7955 GN=stil PE=1 SV=2
MNRVQVDFKGLPAHILENSIAAESLQNTRSSDNVLTPLTFPKSKVALWDPSANGEVVSLH
FSYYRNPRLFLVEKALRLAHRHARQTNKPRFFCFLLGTLAVDSDEEGVTITLDRFDPGRE
QTGCLGKAPTALLPGDILVPCVFEAQHAASSTVHSSEDLNISFKMLQHFCCSKELLELSK
LLTLRAQLSCSENMDRLTFNLSWAAVTLACTLDAVPIRAVPIIPTALARNLSSPAGVTQN
SKRGFLTMDQTRKLLLILESDPKAYTLPLVGIWLSGVTHIHNPLVWAWCLRYLHSSSLQD
KVLSEGGTFLVVLYSLTHRDPEFYQCKPSTGQQQLSFQLLTSRDSLTLYKNVEPSEGRPL
QFELSSENQNQETVLFEEVLSQSVLTGTTLGAASAAPQNKLSISDHDSGVEDEDLSPRPS
PNPHPVSQQTKRVHPLVPELSMVLDGSFLDGSVVNTQGSTPLSHSQSNVHRRNSSPALQG
LSVLRPLVQGSVTKPPPIRRPLTPILSQPKNKLHPNPSQQTPQHSVSRKSLPSMRRSREG
SSASSVSSSSSSSSTKNASPNGSFHQQRQRLSQGFPNKPQLIYSGPPTSGHSSAKKSSSV
PSQTPVPHPSQHRIFHSTPAVNPCNCCTNHPSVAPLYQNNTWQGTPGYPTAVHSPCVFHC
SPETVPPGDHCLSPSRQSLGCRVSPTKSPVCYHSTPPHYSPSSGPCVPTIISNKGLVEQT
PSCQAQCCQVKGSKEPCLDTPMGLLPADAYRMLIDQERQLKLLQLQIQKLLESQSKVPEV
SSEQNAQQQRPNQVPASPPKRTSVSIAVGTGASLFWSTPQETSTHEASSLEWQTETEPKS
GCQNDSTVTSRDRSESACHYSEEHCPGSPQHPTSPQHNTSSGFGVQMFQSPVLGESASMY
YQSQSQSKDLSENREIDDPRFYHELLGQVQSRLQDSVIVEDKVEQDQQSLLKTQSLSPVV
HQSRKPLTTSSIPQTQKTKQPSSPPNQDRVLSATLKQLQQFGVNIDLDSSQEKTTRATVE
SASTLACINPEAVIPRLALSEPVGASIWGPSGSVDLSLEANAIALKYLSDSQLSRLSLGS
QSSSPHSDPSTILLRRPAVEKSNVALSILSPSNMSLATCKYMKKYGLIEGEISSEEEQED
PIQVDSALGCSVQHETSKTISLGQEREEQNTAVLKNITNKPVVNLHTSPIDSQEQILQDL
RPKMQLLLRGGTNSEKENATKRNLIERRSSLTENQRTQEVVDPQGSVGNFLDLSRLRQLP
KLF
>sp|O14108|ETA2_SCHPO DNA-binding protein eta2 OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=eta2 PE=1 SV=1
MMLAIDMTINENQGTRSNLESPTLSCSSKGAMQERDVMFTDHNTFNITNNKSRPGSLMKS
MKRKDVYEFDEDNEFEFEMGSLIHKPSRAHSLGGTSEPVSDDHKDCMEATRQLLENSPLS
SVVVKTCSDHASKRKIARSSSDDSESKVESTNSFNAKKRKDAWTEEHEKWFQARIDELLT
IRSISREQMIEILEDEHAGSRLQGFLESVASFLNRKENSLLKYMRAFFQVAGYEKIDIGS
LAAEEDSQLNFSLEDAQVIQKVVLSYCNNEGVDLQEFGFRMSSSSLRHTNINFLYNELRE
LLPTSISRKGIIRYLKEIYKPLDPKDRNAWEESELKKLYTLVEQEGTRWNSIANKLGTSP
AACMSQWRFVVGTSTQETIDRRKLWTNEEEAKLLDLVKSSYRSSFHTKKMTSLFTHNNHT
TSNIQREIPASDSIAWHSISKKLGTKSPESCRKQYEKTIASYSSNQRQEEDQGKKRKKRK
KKKSKGKRKFYVADSLKLLEHVQRQCGEAISINAIDWKGIVKQMPKWSEEELRAQATNLV
ASVRGWKKTRLSESVRIAITDLKSLPPDV
>sp|Q14934|NFAC4_HUMAN Nuclear factor of activated T-cells, cytoplasmic 4 OS=Homo sapiens OX=9606 GN=NFATC4 PE=1 SV=2
MGAASCEDEELEFKLVFGEEKEAPPLGAGGLGEELDSEDAPPCCRLALGEPPPYGAAPIG
IPRPPPPRPGMHSPPPRPAPSPGTWESQPARSVRLGGPGGGAGGAGGGRVLECPSIRITS
ISPTPEPPAALEDNPDAWGDGSPRDYPPPEGFGGYREAGGQGGGAFFSPSPGSSSLSSWS
FFSDASDEAALYAACDEVESELNEAASRFGLGSPLPSPRASPRPWTPEDPWSLYGPSPGG
RGPEDSWLLLSAPGPTPASPRPASPCGKRRYSSSGTPSSASPALSRRGSLGEEGSEPPPP
PPLPLARDPGSPGPFDYVGAPPAESIPQKTRRTSSEQAVALPRSEEPASCNGKLPLGAEE
SVAPPGGSRKEVAGMDYLAVPSPLAWSKARIGGHSPIFRTSALPPLDWPLPSQYEQLELR
IEVQPRAHHRAHYETEGSRGAVKAAPGGHPVVKLLGYSEKPLTLQMFIGTADERNLRPHA
FYQVHRITGKMVATASYEAVVSGTKVLEMTLLPENNMAANIDCAGILKLRNSDIELRKGE
TDIGRKNTRVRLVFRVHVPQGGGKVVSVQAASVPIECSQRSAQELPQVEAYSPSACSVRG
GEELVLTGSNFLPDSKVVFIERGPDGKLQWEEEATVNRLQSNEVTLTLTVPEYSNKRVSR
PVQVYFYVSNGRRKRSPTQSFRFLPVICKEEPLPDSSLRGFPSASATPFGTDMDFSPPRP
PYPSYPHEDPACETPYLSEGFGYGMPPLYPQTGPPPSYRPGLRMFPETRGTTGCAQPPAV
SFLPRPFPSDPYGGRGSSFSLGLPFSPPAPFRPPPLPASPPLEGPFPSQSDVHPLPAEGY
NKVGPGYGPGEGAPEQEKSRGGYSSGFRDSVPIQGITLEEVSEIIGRDLSGFPAPPGEEP
PA