From the sea urchin genome project
http://annotation.hgsc.bcm.tmc.edu/Urchin/cgi-bin/pubSearchGene.cgi
search was with
P450 as synonym. Found 137 hits
including one reductase.
Nov. 29,
2006 (in progress, expression data
links still being added.
These are not
named yet.)
GLEAN is a gene
prediction program that merged four other gene prediction
algorithm results
from sea urchin. (see p. 14 of the supp. section of the Science
paper (Science
314, 939-952 2006)
A quote from
http://www.systemix.org/sea-urchin/primer.html
"[The] GLEAN
set for example is very conservative and therefore misses many existing
genes."
>GLEAN3_00279
expression
data for GLEAN3_00279
This prediction is short (219 aa),
missing N-terminal half
MKESKDQNGNDVFCADHVVMATWDMFLGGYEATYKTIKWAIAFLIHHPEVQAKIQSEM
TAGFGDRMPTWEDRAKLPYLEATVLETLRQSSMSSMNGPRRTTCDTQVKGYDVAKDTT
VFCNLWWVHHDPKNWKDPMSFRPEHFLTEDGKVSVPRAFLAFSLGSRACIGSQLALMQ
LILLLGGLLQRFNLRQAVGEPLPDLEPGSELVRTVGDYTMCLEVR
>GLEAN3_00436
expression
data for GLEAN_00436
MEINFLGFSFSLTWFLVGAIFIIYICNDLWIRSYWTRHGFKSPKSVPIFGNTLQWTKG
IQHAFLDYFKKQGTVFGVYDFHRPGLCITDLDLLRNIMVKDFGSFTNRWATPLLNKDL
LDALFFSKDDHWKGIRNIITPTFSAAKMKLMSPLINKCADRLLKHLEKHQKIHGNIE
CRELVGAFVMDTIASCAFGLNVDCQENRHHPFVTNAKKAFAGNTFSSPLLRIDYLQLL
LDAQMRDITGNNKNKADGIHGDPEDEAIDEGGLQKKSSSKIRLNESEVISQALIFFLA
GYETTNTTCGYMLYLLATHPDVQDKLVNEIDDVAPEAEDVGYQSISKMPYLEQIFCET
ERIYPPALMTDRVCNEPFDINGFTVPKGMRIFIPIFTIHHDPNLWPDPETYDPDRFSK
ENREKHHPCAWMPFGTGPRNCVGMRFAMMEAKMVIVRILQKYQIETCPQTEIPPKQGT
NGLISPPNGITLRAVKRN
>GLEAN3_00645
expression
data for GLEAN3_00645
This gene is about 75 aa longer
than usual at the N-terminal
MATRLSRFLARARPSRSSGVRPQTFHDAHASSCPLAAVPSTLSPHASTTTPSTPSTTK
TSTTSTTSAPSSISDLPSNNNLSLGVSNGCTFDQAPASINPSERNCTIFDDVKSFKDI
PGPKGWPILGTLGTYLSGKGLERIYDHQIDFTKKYGPIWKERLGTLEFVNLAKPDLVE
VMYRNDSRYPRRLDMKPWKVYRAHRDESLGVLTHEGKEWHRARRTIGRKIMPPREVAK
FTGVINEIVTDMVERLRFVRDTKGEGDGVVPELQNEMYKWSMESIFKLLLETRIGCK
EPIPEKSQGFINAVGQMFASGQIMFLTSPEWFRRWNLKPWRDHEAAWDYIFNFAKKSV
DARLQELATLAEEGDVVEDSGILTHLLASQQLSMKEVYSNACELLLAGVDTTSNTLAW
ALYELSRHPDTQDRLAKEVEGALAAKGRCIPEHDDLPNMPLLKGVIKETLRLYPVVPA
NNRALDKDVTIGGYHIPKGTMIGSLQYVMGRDPEIYPDPDTFLPERWLKGSEAISSEE
KASSFSSLPFGFGPRMCIGRRIAELEMHLALARISTSFKLRYHGTEEVFPAIRGILVP
GSPLKISFRDR
>GLEAN3_00746
expression
data for GLEAN3_00746
Missing the C-terminal
MSRNIGFAHLATRYGGLISLRLGTKLTVVLNDVDSVKQSLVKEADVFTDRNVPIEYSL
ALPLEGSLAFASGDKWRELRTFVAKAFRQFEVEKEKLESVIVRESERLCKAFQDEANK
SSSDSQGFDPHHIINNAVSNIICSMCFGQSYEYSDPEFHHILTVVGNSFRSLSPSGLL
LISRYFYYTPLYREVRHNLAVLQDFIRGIVQRHQETFDPDNTRDIIDMYLAEVRQLKE
DGMDTYLKEENIWRGVFDIFLAASESMTSALRWIFLYLIDNPELQRQV
>GLEAN3_01228
expression
data for GLEAN3_01228
no seq. provided, Scaffold 91975
12823-12912, 11663-11790, 11303-11390, 10483-10647
(minus strand exons) This seq
derived partially from two of those sequences
best hit is to a Fugu partial
sequence like CYP4V
DFVLDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRWAN
RTEKEKLLFCYGDGPRCCVGTSLMKNLI
CYP4V5-like
fragment from Fugu FS:S000209 Scaffold_209 52% to 4V5
Score = 55.8 bits (133), Expect = 1e-11
Identities = 23/40 (57%), Positives =
29/40 (72%)
Query:
1 DFVLDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRW
40
D + G+KVPKG+ IYVS HRDP+VF+
P+SF P RW
Sbjct:
1
DSTIAGFKVPKGHGAIYVSHAVHRDPDVFQQPDSFLPERW 40
SEQUENCE
180 Ciona intestinalis
Score = 212 (74.6 bits), Expect =
6.3e-20, P = 6.3e-20
Identities = 35/68 (51%), Positives = 48/68
(70%)
Query: 4
LDGYKVPKGYAVIYVSSIAHRDPNVFKNPNSFDPTRWAN-RTEKEKLLFCYGDGPRCCVG 62
L+ Y +PK YA++YV+ AHRDP
VF PNSF+P RW +E L+ C+G
GPRCCVG
Sbjct: 310
LESYDLPKDYAIMYVTHAAHRDPEVFPEPNSFNPDRWITCNAGQEDLVLCFGSGPRCCVG 369
Query: 63 TSLMKNLI 70
T L++ ++
Sbjct: 370 TDLIRRIL 377
>GLEAN3_01525
expression
data for GLEAN3_01525
This seq is short on both ends
YSILPKDLKYKREVRAAANEIRAMGRRCILARMDALRRGDQVPRDILTYILQESNNSE
GGIKDFDLEELVDEFFTFFGAGQETTFNLLSFTLLHLGRNPQVMKKLRDEIDTVLKGR
NYVEYSDVSKMKYLTLVLKEALRINPPFGHIHRLLPHEMDLCGYKVPKGSVVMIPVYG
MGRNEKHFKNPEKFDPERFTRDEDSPLFAYIPFSLGARACIGQTFAMEGFQLDTVIR
>GLEAN3_01622
expression
data for GLEAN3_01622
MDLYTFLDPYAFFIMLMTVLVLYTQWFEYPISNVKPLPGPRKWSILGNLPQLIAARFD
INDFHERLRKMYGAVVGYRIYHHNVVVINDMKLAREVANHPDFCGRMRDELHANLSKE
PNAGILTSYGHTWKEHHDVLSGFVTALSKSKQLEDICTIEATKFIKVLTDQTDKDLII
NHYSVQTLAVIFCQVIFGKVLDVNDDIVKEVVDIVMERQRESPSATRRLLPSYVNNLT
GSGAKRRHHLDVQFVALVKTLVGDHTKQTEILIPCFVRRYWDERHEKLQSMEDGSMTS
LTSDNLLWACIDLVTVCVDEASLLLTQSLLLLASHPEVQSKLKEDIRPEAKEQDDQEP
SAYLAAVQLETQRLAPDNPWSTSMMALSDVKLGDFEILQDAVVVANWRYLRQGDDQGW
DEPTQFQPGRFLGEDGSLVEHPQMISFPAGKRACPADAMSSSASSMLIRHICQRFKLS
LPAQTTHPCDKNGLIQRSSLFKTKLRIVPH
>GLEAN3_01792
expression
data for GLEAN3_01792
MVSFYVGPKPVILLNSYPVIFEAFRHPDLQDRPSSRLLQEIMGLKNSGIAFSNGEVWK
EQRKFTHSVFRSLGVGKKSYEDIVSAEMTQLSSVIKEKKSSPFDPSTLFMQTVSNIIC
SIVFGTQYTYDDDGFKKILHLVNLNVQLSGGGGTFLFLPIPGISKIPFGKMKLMTNNV
RSLNAFIQSQIESHVKNWDPENQRDFIDQYLNKLDETKDTVSSFTKLNLTGCIADLFL
AGSDTTTTTLNWCILFMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILE
VQRKGVVVPLGVPHVAAADTSIRGYTIPKGAVIVSNISEVLNSEELWKDSGAFKPERF
LTADGELIKRDELVFFSTGKTKDQYFFFLCLIRGFIDYIYMVTFKDTILQIKFT
>GLEAN3_02371
expression
data for GLEAN3_02731
MSHLYSICVGFVNLSSVFILMAMISLLAVGYFIKTRGSRFPAGPLGVPVLGYLPFLDC
RRLHRSLMSLGRRYGNVFSLKIGAQTVVVLNSADVIRECLVGKATSFDGRPVWMLNKV
NQGRGIANEQPTKKWQAHRKAWSKVTRSLTLHDLDMEGKISHDIARILSVLESSQGHP
IDVSKTVHMALCNIICSLCFGKSFSYNDLDFQHLLKMADKFFCYLSSASAVNFFPILW
YLPLKANKAVAEGYEGIFGFVKNLVEEKQDGVGRMQDRGFIDVCLDETKSKKNCDVVK
MKSKKTSADMPGRNIDRNAETTNDNKNDSISSADSHCHQPCGNGTACKEMDNDDENDV
LETLDEYSRRVICEDVTYVATDMFIGGAETTHAGVMWSIAFMVLYPDIQIRVQKELDD
MVGDQHLPVWSDRHHLPYTQACLQEILRLGNVIPIAIPHVTTRDVTMSNGCHLPQGTT
VLSNLYACHMDPSAWESPREFRTERFLDSEGNPKRFDHFMPFSIGRRMCLGEQLARME
LFLAFTHIFIKYDVSFAPNERQPCLEGRPGITHSPYRFKICAIPRKATKMER
>GLEAN3_02380
expression
data for GLEAN3_02380
MDVFNEQSFIMAKKMEKFADQSEPFNIFPQITYCVLDIICDTAMGKSINAQGEGDNEY
VTAVISMTNLVQERMKKPWFWPDLLYDNIQSGKKHANNLRVLHDMTTKIIKQRLQEPP
RALDGSEEDAVAGKRRRIAFLDLLLQMHREDPSFTLEDIREEVDTFMFEGHDTTAAAA
SWTILMIGRHPEVQTRLHEELDEVFGDSDRPITADDLQKLQYLNCVFKETLRLCPSVP
MIGRDLEEDCVIDGKVVPKGTLVVLGIYALHRDPEQFPDPEKFDPDRFLLENSTKRHP
YAYVPFSAGPRNCIGQKFAMMEDKVILANLMRKFSVQAIQTMEETNPLGELIMRPRDG
IYVKLSRRK
>GLEAN3_02590
expression
data for GLEAN3_02590
MVDTFNASADHLVNYLMERADGLQEVRMYDAFNNITLDIIAKVTFNMELNIIDDPDNP
FPNSILTCLNAIMTSINNPWMQYGLTKKDRKTKADVKAACDFLRGFGRDCVNARLAAK
ARGEELPNDMLTNILQASQDLEGNENFGIDEMLDEFVTFFIAGQETTSNLMSFTLEML
GKHPHVLQKVQDELDEKLGSQMFISFQDMGKLDYLMLVLKESLRMYAPAPVVTRVTGT
EVKASTGLIIPKGSQLNLSPFVMGRMSEYFEDPLVFRPERFVESKHSPYAYFPFALGQ
RSCIGQQFALIEARIIMAKLLQQFKFDLVPGQNWNLSYAVTIRPDDGCRNYVSLRH
>GLEAN3_02656
expression
data for GLEAN3_02656
MSSVVNLLLSVIEYIGIANLLTTITICLVTFWYIRMNYGASFHYRLPPGPPVIPYPFD
FLMALFPSSNRFGRMIFLSKRYGDITCLQLGRRMIIMISSPSLVKELIVKQADVTSNR
SAPPFFTSVMDYKGGIIFSDGEPWQEIRRFSLSALRNFGMGRKGIETRIQEEARNLCN
AFAQHSSEAFDAMHLTNAAVSNIICSITFGRRLEYDDPVFMDMIQRLRNVTSGVGINP
IIRELMMRFPILLKTPLLGKPKANLLGVKNFITKEVIEHIESYHSSDIRDIIDMYLTE
AERLEEEKKKTNEDRTKPYLNKDDIWMCVFDLFLAGTETTSTALLWFLLVMAAKPDVQ
EKVISEIDLVVGRERAPCMEDRKDMPYTDATLTEVLRFRPVAPLGVPHVTAKDVEIGE
YTIPRDTEIMINIIGIHLDPRLWDEPEEFNPHRFLSEDGKTVIKPQDAFTVFGAGRRV
CLGEQLARMEFYLFATTLMQRFRFRLPEGESGDFGFEHCLGTIAPKNFKILAQER
>GLEAN3_02658
expression
data for GLEAN3_02658
MESLKGYVTDWWNDMGIITSLLFGLFGSVIVFGIVRQTFGKREKLPPGPLKIPFPLNI
IASLFSKEGKMDMILSAKRKYGDLVSIEIDGQPKILVGSPELVRELFVKHGDVSSSRA
ASKPLKAMINYTGGIIFGESWQPLRRFSLSALRTFGMGKRSIASKITEEARIFTDGLW
ESQTVEGVNPSGLLNKAISNVICSITFGKRFEYSDPDYIIQVENIKKIMAGGNIPPVL
RIVAQRFPVIFRSPLSKNAKKRLQSLKDFIVQQVDEHDRTFEGDDIRDIVDSFLAECR
RLQEKGEQECYIRKEDIWMSVWELFIAGTETTASALAWFFLFMAAHPDIQEQVFKEIV
TTIGSERAPVYDDRKNMPYTEATLAEVLRHRPVAPFGLPHVADETIHVRDYVIPKGST
ILANILGIHHDPELFPEPSKFDPNRFLSDDGQTFVKNDAYTAFGLGRRSCLGEQLAKM
EFFLFATSVLQRFTIRLPAGATPDYSIGHRMMTLLPSDFYIHLDKR
>GLEAN3_02660
expression
data for GLEAN3_02660
MELNAGFRILLYALLVITSTVGVAFVVFSWLLYRFNGKYGHLPCPRPTSFFFGHLSHI
NAVSKKMDMQTLTLMHQWMRDLGPVYCIRFILQPMVICSNPPTVKVGFDMDINAVENS
DCPFPSAIALALQGMDATFKNPLLMVDPRSKARQYRRSVRDAVKLVRNIGQDCIEKRR
EAKKKGEELPKDILTCILQAEDDVGNGEKIETADLIDEFGTFFVAGQETTSNLLAFTL
LEIGRHKDVAKRLKDEVDEVLGSKSIIEHSDLTKLEYMSRVFKETMRLNPPVSGASRE
LASDVQSCGYVIPKGASVLFLSYLTSRLEEYFDDPLLFNPDRFIPTDETPRHFFAYFP
FAIGQRNCIGQQLALIEAKVILAKLLQSFDFRLEQSQRHIIVSQVTNKPIDGCKNYIS
LRD
>GLEAN3_02831
expression
data for GLEAN3_02831
only half of gene from I-helix on,
missing PERF motif
MSELMPKTAYFCIFIAGIQTTVETLYWAMALLATYPEIQEKIRAEIDDVIGRDRLPTI
NDRGKLPYTEASLYEVLRYSSIAPTALPHATTRDTEFEHFMDGSGTIRDHPPSFLPFG
AGRRGCLGEAVAKADLFLIFTWFLQNYTFSKVQGKESEDILKMIPQTASGRLLLSYEI
MINKRD
>GLEAN3_02832
expression
data for GLEAN3_02832
short at N-term, green = C-helix
MLPSPFHVIHFQVILSSSQHVRKGFLYRLLEPWLGLGLLTSTGQKWFHRRKMLTPTFH
FSILQCFMDVFNEQSTILVKKLEKFADKSETINIFPLVTNCVLDIICDTAMGRCTNAQ
EDGDNEYVQAVGRMGELVIARAKNPLTWPDYLFGKLNAGKEHDKTLKILHDVTDNMIQ
ERLKEPPSVTHEDEDETVARRRKRIAFLDLLLAMHREDASFTLKDIREEVDTFMFEGH
DTTAAAISWAILEIGQHPDIQERLHAELDEVFGDSIRPVTSDDLSRLSYLTRIVKESL
RIIPAVPMVARSLDEDIVLDGKVVPKEAMIMLHIYALHQDPQQFPDPDQFDPDRFLPE
NAEKRHPYAFVPFSAGPRNCIGQKFAMMETKLTLANIFRRFSIESVQTIEGAKPAGQL
ILRPVEGNILVKLSWRK
>GLEAN3_02884
expression
data for GLEAN3_02884
missing N-term half, seems
long on the end
MRILIDLANKSDDSSSKEGLYEAIKNQTIEFLTSGMLSLSSAASTMLYHLAKKPSVLA
KVREELNDYGLLTDYSPLTLNKINQCRYVMQVIKESLRMVPPIGATFRTALKTFELDG
KQIPKGWTVFWSIRETIHLSDSFTDTKNFDPDRFSPQRQEDKVAGRFSMPVFGHGTHS
CIGQNFALLTLRILLIELARTCDFETPNVDKIKFSWIPSPKPKCGMPTRFTRKILVDQ
NENEVSYSSKSSTCPQGEDSRERSEEGVHQRSTTCHAA
>GLEAN3_02898
expression
data for GLEAN3_02898
MDLYTFLDPYAFFIMLMTVLVLYTQWFEYPISNVKPLPGPRTWSILGNLPQLIAARFD
INDFHERLRKMYGAVVGYRIYHHNVVVINDLKLVREVANHPDFCGRMRSKLHANLSKE
PNVGILTSYGHTWKEHHDVLSGFVTALSKSKQLEDICTIEATKFIKVLTDQTDKDLII
HHYSVQTLAVIFCQVIFGKVLDVNDDIVKEVVDIVMERQRESASATRRLLPSYVNNLT
GSGAKRRHQLNVQFVALVKTLVGDHTKQTEILIPCFVRRYWDERHEKLQSMEDGSMTS
LTSDNLLWACIDLVTVCVDEASLLLTQSLLLLASHPEVQSKLKEDICPKAREQDDQEP
SAYLAAVQLETQRLAPDNPWSTSLMALSDVKLADFEILQDAVVVANWRYLRQGDDQDW
DEPTQFQPGRFLGEDGSLVEHPQMISFPAGKRACPADAMSSSASSMLIRHICQRFKLS
LPAQTTHPCDKNGLIQRSSLFKTKLRIVPH
>GLEAN3_02899
expression
data for GLEAN3_02899
C-terminal half only
NGTVSSYDELNTVFSLADIFAAASETTTTTLRWLMLRMVTHPDIQKKIQDEIDDA
VGRDRLPTLLDKPKLPYTEATILEVMRLNFVTPLGVPHQYTGRGELSFHGYDIPE
GAIVFANYWSISRHPKLWDDPEVFRPERFLNEEGKCVKPDELTPFSIGKRVCIGESL
AKMELSLLFTHLLHRFTFTLPDGVPSYSMKDSLGAVNTPVSYQL
>GLEAN3_03038
expression
data for GLEAN3_03038
MGYDQLFNSVTVTTYLVLIITVVLITRYWINTKRESEKVARSGGRPLPGPWGLPVIGN
MLSLDQSGPHLSLMKLAKDYGNVFKIQMGSRPVLVLNGLEATKKALTKQPVVFAGRPD
LFTFKVINETNVYGPSISFSNYSEQWKLHRKLAETSLRHFTAGGQIQVLEKVIESEAQ
ELVRYLKASNKKSGANLKGLINLSVSNIVLWYLLSKRSSYKDKKLIELLGFADKFSKA
TGNGNPVDFLPWLKFIMTKSTRKLKEWQEQFGQMLQGIIEEHHAVYEPGSERDIVDHI
ITTSTKGGDEELKRLGLDGKTLLQIAYDFFGAGYETTSATLEWMLLYMASYPQVQSDV
QGEIDQFVGRGRLPTLNDFDNLPLTQSCLLEIQRHASVVPFAVPHSTTKDTVLDGYFV
PKDMVVFVNLYSAHFDPEVWDQPEAFNARRFLAQDGTLDKDKEKLVVAFGLGRRKCIG
SDLARINIFIYFTTLLHQLNLSCPQGQKVSLESAFGLVRRPVEFNVEVVPRS
>GLEAN3_03231
expression
data for GLEAN3_03231
MIVIASAVGTALCAVYYLKEHYDYKVRKLPPGPTSIPFIGDVLSINPKSPHLSLIELA
KTYGDIFSIKLGSERVVVLNNAELIKSAYRGVDISHRPDLFCMDVLAGGKGFLTCKDE
IQRQMHMKICRQAMRVVNNADMGEKILQEANNLVNHFEAQNEKPFDPQCDLHIASLNI
LCNFVFGERYEHDHPEMREILDFSAEISKLLSPMHPVNAMPWLRHFPNKWFASLFKAK
NQRDRILMKKYVEHVVTYKAGRVRDMLDGLLAETTEAVAENNKQAVALLTPEHIIINM
WLIFFAGSDTVTNTLQWSLLYMAVFPKKQAKAQEELERIIGQSKRLSLEHKSQLPYLQ
AVINEVLRYSSLTILGVPHAAARDTEINGYFIPKGTSVMANFWSVHHNEDAWDQPEEF
LPERFLNDQGNLKEPSQLPHFMPFSVGKRRCLGANVAKAELFLMLGRLLQEFSFEMPQ
DVEADLQGEAAVSLIPKPYKIVAKKRHIIQPETI
>GLEAN3_03232
expression
data for GLEAN3_03232
MDSASQLLLLHPTSVILPYGHSFENFTLNLYDMLSIKTTALQPVSAPHGSMAPILIPL
MIVVSIIVYYCQEWMEFNVDKVPLGPTSLPFIGNVLQLKSKALHLSLTDMAAKFGNVF
SMKLGRQRVVVLNDAQTVKTAYNGPNFTNRPSVFSIEFFVSKGFMACQKKHDFRIHTK
LMKHAFQAVTYTSLGDKLVEEAMDLINQFDSYNGDAFNPREDLNLVSLNILHNIAFGK
RYQKGDTELHEIFDYSSRIMKGVSPVHPVNLIPWLQRYPNPWMKDLSDARDKRDNMLL
RFYEEHKETYVEGEIRDMVDALIKVEQDAEEAGDMETLRLLSPLHIITNIWTIFFAGT
DTIHNALLWVLLYMAVFPEVQRKVQEEIDRVIGDRTPCLEDEQTLPYLCATFYETLRF
SCLSLLGVPHAAVCDTTLNGYHIPAGTQVLANFWAINHDETTWDNPSEFRPERFLNDN
GQLRNMKDFPHFMPFSTGRRACLGKNIGKSGIIVLSACLLQKLNIAVPTGEGAQQPTL
EPEVHFDLVPKQYDIVVTRRDSMRS
>GLEAN3_03606
expression
data for GLEAN3_03606
N-term seems about 150 aa too long
MEAYVTEGTTKEGDDILEEPGHQQFKPEATLLLGDISLECDRGHYLNKDLETCVPCPA
GYYLDVASEICTPCNRGFFQDDQAQTDCKPCKNGTSTRNQGATHSNQCEVLCGPGYFS
STGFSPCMWCEQGSYQSLHGQTRCEKCPLMSTTPDHSGTSIDDCQDKMFDLSATSVFL
GVVIFLLCLWVVRRPKNLPPGPWSLPIIGYNFKPGLIHEAYMDLAKKYGPIFSLRRGS
FVFVVLNDRESITQALVKSGEFFSDRFVPGHFNWAIPDPGKKATISFSNGKPWVDLRK
FSLPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGEPTDLAITFSKATANIICQLLF
SRRYEYSDGEMSKVLKDNEEILSLIPETDLVNIFEPLIHTSNQRYKRYREANFANRDF
IMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFWRVVFDLFAGGTDTTAVVTS
WAILYLSVHADVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATITEVMRIRPILPVS
VPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEPDRFNPDRFLSADGKQFQKN
EAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFEFKLPPNQPNPSMRGANGITMNPE
PYKVCAIER
>GLEAN3_03607
expression
data for GLEAN3_03607
missing N-term, green = C-helix
MYTPCATIAFSNGKPWVDLRRFSLPALRSFGFGKQSLVPQINLEARYLSEEIRDLRGE
PTDLATTFSKATANIICQLLFSQRYEYSDGEMSRILKSMDEFVSLIPETDLVNIFETL
IHTPKYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFW
RVLLDFFIGGTDTTAVITSWAILFLSVHSDVQRKVQAELDTVVGRGRQPNTLDRPDLP
YCNATLTEVMRIRPILPVSVPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEP
YKFNPDRFLSADGKQFQKNEAWMPFGVGRRECVGSQLAKMETFLLFTNLFQQFEFKLP
PNQPNPSMRGANAFTMNPQPYKICAIER
>GLEAN3_03687
expression
data for GLEAN3_03687
missing N-term and C-term (no heme
signature)
MVMTNSKHTKSHVDYDGFQAIFGSRFAGRSLVNETDHSRWSQRRALFNPAFHRQYVGF
GLDLNAMEDPSCPFPVAATKVLQGLQKSLLVPWYSYIPSAHARNFRKEVKEACHLIRE
IGRDCILSRIEAKARGDATPQDILTYLLDASKELQGDQNFGLEEMIDEFVTFFVAGQE
TTGNHLSFTLQQICRYPEVLKKLLIEIEEVLGDKPFVDYSDLPKLEYLMLVMKESMRQ
FPPVSGSTRSLAHEIECCGYTIPKGTRLRVNHFIMGKMEKYFDDPEEFRPERFQVSDE
TYKSRVGVE
>GLEAN3_05160
expression
data for GLEAN3_05160
MAAVEFITENAGTLLVFVTTFTIVLTFLRNRTKTGFNIPGGPTALPFFGNLFSLKPHD
EFLDSLVGWSKEYGSIYSLYLGRKLILIVNGSEAVKELTCKTGLDFADRGESVAFLAL
NPNRLGLFDTLADEEWKRQRKFAHTTLRGMGFGKVSLEPLIQTEARHVISYFKTKVGS
LLDPSVAFSVTSSNIINKLVFNRRVDHGSESGDELVRHVRNLMVSDEGAVLTLFPSLW
PVMRYADSKWMKLYDSWAHIESFFHQEIKIREDRIDACKGELEASDFLTAYLIHRRAD
PEGYSLEALDFVLVDLFLAGTDTVSTTLTWAVLYLVSHPEIQQKIHDEIINEIGDKEV
SMSHRSGLTYTQAALLEVQRLATVAPLIPRESSRDAVLCGHKLPKGTTLLINLWSLHY
DQDVWKDPKQFNPDRFLNDDGTLVKQPESFLPFSAGKRMCMGDQLAKMELFLFFTMIV
QQIKFSLPPGADPLPMDGLLEIARKPKPFQVQVSAWSD
>GLEAN3_05253
expression
data for GLEAN3_05253
missing N-term, green = C-helix,
missing middle of gene from C-helix to EXXR motif
MGHGLVSEVDHEKWFHRRAIMNPAFRRKLQAEVDEVLGQKHTIQYEDLAKLEYMMRVL
KETLRLYSPVGGTTRLTAHPINYNGITIPAKCTVTVMSYVMSRMEEYFDDPLLFNPDR
FKPSEDGSLPRHLYAYFPFSMGQRSCIGQQFALIEARVILAKLLQRFEFRLEQSQSMG
ILDELTAKPMGRCRNYITLRA
>GLEAN3_05439
expression
data for GLEAN3_05439
MGLFVVIVVYVLGLFACLLGIGLLVFVAAILRTEQKFSDLPGPKRDSFLFGNIFSIFE
AKKKLGQPIGLIIYTLSQDHGPVFWIRVFARGHVFASAPEIVKELLLNSRHIKPQSFY
KNFQYVFGARFMGNGLVSEINHEKWFHRRAIMNPAFRRKYLMGMMEKFNAESEKLCEH
LRDKADGRNEVEMLNELNNVTLDVIANVSFSMTTGAVGDPSCPFPAAVTLGLRGMQTI
IDKPWYSFDPRSESRKTRQETREAVNLIRETGRQQIEERIAARQRGDHVPNDILTVIL
DCANDLVGDKDFQMENMIDEFVTIFFAGQETTSNLLAFTIQQLGRNPDVAKKVQAEVD
EVLGQKPYVEYDDLAKLEYMMRVFKETLRLYPPVIGTTRVTAHPVIYKDMTIPAGVSV
SVLSVVMSRMEEYFDDPLLFNPDRFKPQEDENLPRHFYAYIPFSVGQRSCIGQQFAMI
EARVILGKLLQRFEFHLNQSQRLDMLDELTLKPLDKCKNYLTLRA
>GLEAN3_05521
expression
data for GLEAN3_05521
missing the middle of the gene, no
I-helix or EXXR motif
MAVLGTISGLFSMTTVYLMVAVVVIFFIYDYWTHTYFKRLGIPGPPPKPIFGNALDFG
EQPIFELLDEYANKYGPVVGIYMWRRPLLLVTDLDILRDIMVKDYHRFYNKFPMPVSS
GSFDNALFLLRDAQTDRECGETCSYKGYKIEKGMQIWIPAYTIQRDPTYWPDPLKFDP
ERFTKENREGRNPFTWLPFGAGPRICIGMRFALMEAKMAIVRSLQVVHFEISPLTKIP
PDVGLTSLLSKKSSYTLKITQRH
>GLEAN3_05655
expression
data for GLEAN3_05655
MENEYGSNNPIPPRGSRFPAGPLGVPVLGYLPFLDCRRLHRSLMSLGRRYGNVFSLKI
GAQTVVVLNSADVIRECLVQKATSFDGRPVWMLNKVNQGRGIANEQPTKKWQAHRKAW
SKVTRSLTSYDLDMEGKISHDIDRILSVLESSQGHPIDVSKTVHMALCNIICSLCFGK
SFSYNDLDFQHLLKMADKFFRYLSSASAVNFFPILWYLPLKANKAVAEGYEGIFGFVK
NLVEEKEDGVGRTQDRGFIDVCLDDAKSKKNCDGVKMRSEKTFADMPGGNIDRSAETT
ADNKNDSTSSAASHCHRPCGNSRACKEMDDDENDILKTLDEYSRRIICEDVTYVATDM
FIGGAETTHAGVMWAIVFMVLYPDIQIRVQKELDDMIGDQHLPVWSDRHHLPYTQACL
QEILRLGNVIPVAIPHVTTRDVTMSNGCHLPQGTTVLSNLYSCHMDPSAWESPREFRP
ERFLDSEGNPKRFDHFMPFSIGRRMCLGEQLARMELFLAFTHIFIKYDVSLAPNERQP
CLEGRPGITHSPYRFKICAVPRKATKWNGKDSQTSSD
>GLEAN3_05665
expression
data for GLEAN3_05665
sequence starts in I-helix,
missnig N-term
MGFGKVALEPIIQEEVIGDEPVCLSHRGSMPYTEATLMELQRLASVVPIVPHAAFRDS
ELGGYHIPKGTRMWINMWSLFRDPVDWPNPNKCDPHRFLNKDGTKVVHPDSFMPFSAG
RRVCMGEQLAKMELFLFFVSLMQRFKYVFADPDTTPSFEGVVGLNRQPRPYKVKAILR
NK
>GLEAN3_05668
MTVFAPCRNLLEINVDIIQTMMFSVFVTTFLLGCTVSICCAWLLRRPKNLPPGPWSLP
IIGYRAGSGLIHECYASLAQRYGPIFSFRRGPFLVVVLNNREIIQQALVKNADVFSDR
FTPKHVNLRIPASNPNASVVWSSGKPWGDLRKFGLTALRAFGMGKKSIEPHINLEVRY
LCEEIEKLRGHPTNLLTMLNNTTANVICRIVFGRRFEYDDEEFQGILRGFFENFSTIS
GSDLVNVFSFLLYMPWYKDFRRNLSQSKSFIMAQLKLHTSSFEKDNIRDIIDAYLADE
GISKDFTVEEFCRVVLDFFAAGTETTSTALSWAILYLCLYPDVQTKVQSELDTVVGRG
RQPAISDRARLPYCEAALMESMRIRPVLPLSLPHTTSRSVSLGGFTIPQGTTIIPNIW
AAHHDPEVWDDPDDFRPERFLENHDSPTVKKNDSWIPFGLVKYDIFLFFPDKLRSSRV
SGGSASQDGVLSTVQRFEFKLPPDQSTHSKRGISGLTMPPEPYEVCAIER
>GLEAN3_05931
MALWNLAPVTVLVLSVLFSFLGYRCLLCMIAVWTEVKRIVHIRRTLPGAPPHWLFGNV
LQEPGPVTPGFEWHRNMPKSYRRLHVFWAGWKPIVVLNHPESVRNVLNGSVGTVKSEV
YRLFDEWLGQPMATTDGNLWKRHRRLITNSFHFNVLKSHIPKMNQIADTLISVISQRG
VAGELLELHKTTSAFSSDVILQCAFSYESGCQEGVSEYMKSVSTLAGIVMKRSLSPHL
TYDFIFRRSSLYKSWRKEIDVLHRLQEKLISDRREQHRQTGCTEFSKGKDILDTLMLA
KDTEGGLTDREMRDEINSFVFGGIDTTSSALTWLLYLLATHPEYQTKVQEEIDVLFQD
RDPEVCSEDLHRTPFLMKCVKESQRMYSFVTPGRLLTEPLVVDGLTVPAGTELTLFTY
QLHHNPDVWGDDHMTFRPNRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAA
IRIFDKFGFTLVRDSTPVLRVVSVPEDEILLGNTFYNEIYFNLLLVLLQREDLHRTPF
LMKCVKESQRMYSFVSPGRLLTEPLVVDGLTVPAGTELTLFTYQLHHNPDVWGDDHMT
FRPSRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAMIRIFDKFCFTLVRDS
KPVLRVVSVPEDEILLGIHPRSKK
>GLEAN3_06386
MVPCVSRVIRVGFAHLATRYGGLISLRLGTKLTVVLNDVDSVKQSLVKEADVFTDRNV
PIEYSLALPLEGSLAFASGDKWRELRTFVAKAFRQFEVEKEKLESVIVRESERLCKAF
QDEANKSSSDIQGFDPHHIINNAVSNIICSMCFGQSYEYSDPEFHHILTVVGNSFRSL
SPSGLLLVSRYFYYTPLYREVRRNLAVLQDFIRGIVQRHQETFDPDNTRDIIDMYLMS
NSEVGCRVWAPKLDAGVVCRSCMPGLGAEVRYLGAEIGCRDWLPRLGAGVGCRGWVSE
LGLEFGYRGLATRMAADAEIRADILRVIGPTRNPTIKDRDELPLIEATCMEVLRLRPP
APMGGPRQAKRDVEVKGYVIPEGTLVLTNVWYIHQNAENWHEPERFNPSRFLSEDGKE
LVKNPAFMPYGAX
>GLEAN3_06574
MGFTGTEPGYWLLAGVLGLLVAWLVKHLLTRPKNLPPGPTGLPFLGVAWQFVKGGSNP
LALFASWSKKYGEMVSFYVGPKPIIILNSYPVIFEAFRHPDLQDRLSSRILQEIMGLK
NSGIVLSIGEVWKEQRKFTHSVFRSLGVGKKSYEDTVAAEMTQLCSVIKEKKGSPFDP
STHFMQAISNVICSIVFGTQYTYDDAAFNKDVTRGGPRSRRVHVKAPCVSFPVALLGA
NRILDLVNMNVQLAGGGGSFVFLPIPGISKIPFGKMKLFTDNIRTFTAFIQSQIESHE
KNRDPDSQRDFIDQYLNKLDETKDTVSSFTKLNLTVCIADLFSAGSDTTTTTLKWCIL
YMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILESFFGCNPDFYLSKVQ
RKGVVVPLGVPHVAAADTTIRGYTIPKGAIIVSNIFEVLNSEELWKDSGAFEPERFLT
ADGELIKRDELIFFSTGRRVCIGEQLARMETFLGFTSLLHRFTFKKPDDSSTLSFEGV
LGISRNPVPYMTCAVVRE
>GLEAN3_06989
No seq. provided Scaffold 72639
63174-64724 (no introns)
MADFQFLNSMSVVTYVVVIATSILITRYWINTKRKSEKVAKSGGRPLPGPWGIPVFGN
MLSLDQSGPHLPLMKLAKVYGNVFKIQMGSRPVLVLNGLEAIKKALTKQPAVFAGRPD
LFTFNVINETNVYGPSIGFSTYSEQWKLHRKLAEASLRHFTAGGQVQVLEKVVESEAQ
ELVRYLKASYEEKNTNLKGMLSLSVSNILLWYLLSKRSSYEDKKLIELLGFADKFSKA
TGSGNPIDFLPWLRFIMTKHTMKLKEFQKQLGQMLQGIIEEHHAVYEPGSERDIVDHI
ITTSTEIGEEELKRLGLAGNTLLQTVHDFFGAGYETTSATLEWMLLYMASYPQVQSDV
QGEIDKVIGRHRLPTLNDFDNLPLTQSCLLEIQRHASVIPFAIPHSTTKDTVLDEYFV
PKDMVVFVNLYSAHFDPEVWDQPEVFNARRFLAQDGTLDKDKEKLVVAFGLGRRKCIG
SDLARINLFIYFTTLLHQLNFSCPQGQKVSLDSAFALIRRPLEFNVELVPRS
>GLEAN3_07306
MYVFKRKAFVDLARKYGPVFSVRRGPLLFVVLNDRESMKQALVKSGEFFSDRFIPGHL
RWGIPDTEKNASIVWSNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLESRYLADEIR
ALHGEPTDLSALLNKATANVISQLVFGRRYEYDDPEFIGALEALADAFGLVSETDPVI
VFESLVHTPWYKPYRGCIFKLRDFVMSHLHNHQETFQKDNIRDFVDAFLAHDISKDYS
LDTFWRIVLDFFAAGTETTAVVTSWAILYLAVHPDVQKKVQDELDAVVGRGRQPDTSD
RPNLPYCDATLMEIMRIRPVLPVSLPHLTSADVSIGPYTIPKGTIVIPNLWAVHHDPK
EWCDPHLFNPDRFLSADGQTVVKNEAWMPFSIGRRDCLGMQLAKMESFLLFANLFQQF
DFKLPPDQPTPSMRGHTGLSMPPESYQICAIER
>GLEAN3_07335
seq begins at I-helix, missing
N-term half
VNTFLLGGIDTTSSALTWLLYLLATHPEYQTKIQEEVDELFQDREIREVRREDLHRTP
FLMKCVKESQRMYSFVSSSRLLTEPLVVDGLTVPAGTDLAIYPYQLHHNPDVWSDDHM
TFRPSRFDRQNVEGRDPFAFIPFSAGARNCIAQQFALQEIQVAAIRIYDKFGFTLVRD
STPVFRIVSVPEDEILLGIHPRRET
>GLEAN3_07404
MSESKAKSLPGPWGLPILGNLLNLGSDEPHVALMKLAKEYGNVYQIRLGSRPVVVLCG
QEAIRSALVRQPVAFAGRPDLASFQFIRKNTDVTVAFQTNDATWKLHRKLAESSIRFF
VSGQRKSSLESTVTQEVESLINYWTNNNNNDDNLQIDPVNLIKLSVSNVMLWCILGKR
HSLDDSRLREFVSKSDDFAVAVGSGNIADFMPWVRFVTVRTTQNLKKLLNTFRGWFMP
YIEEHRLQYEQDSEKDILDYLVTSSNKLDEADLQQLGVSREALQISTVFDLFGAGFDT
TSAIITWQLYYAIRHPAVIARVQAELDDVVGRDRLPNLTDRDHLPLTQAFLLETFRHS
SVVPFTIPHSTTTDTTLLGYHIPKDTVVFVNLYSVHHDPDTWHRPSDFDPERFIDNET
GLVDKTKADAIMPFGAGRRRCIGAEMGRIQSFLYFAILIHQCHVSSLDKASLDNKPIY
GLTMRPSPFNICVSRR
>GLEAN3_07406
No sequence data provided
>GLEAN3_07409
MIDLLGAGESGFGQFNALTITATLGVVTATLALWTIMRPNRFPPGPRGLPIVGSIFSL
NDSPEVVFGEWAKKYGDIFGFKAGERWMVVLNRQALIKEAVLKQGVDFAGRPDFYSVN
IFTEGFKDIVFSPYSETWKLHRKLAHSALRHFATGKPLQGLISSVYPKVEKKLAMTEG
QPIDPKVLITLIMYNVLAQMCFGLSYELEDPNVTQWMDVNNDVNEKLGLGLAADIFSW
AKYIPTSGPRMIKEITETMFGFLRSQVDEAREHYDPENINDFYSLLLKAQEDARKEGE
NVDKLTDTHIFQTVADIFGAGIQTTVETLYWAMALLVTYPEIQAKIRAEIDDVIGRDR
LPTINDRGNLPYTEASLYEVLRYSSIAPIAVPHATSRDTEFGGYHIPKGTVVFINTHS
MHYDPQEWDQPDKFLPEHFMDGSGTVREHPPSFLPFGAGRRGCLGEAVAKADLFLIFS
WFLQNYTFSKVPGKESEDILNMIPQTAFGRLLLSYEIMINKRD
>GLEAN3_07558
MSVIFKLVITVATLCLLHRVYRCVRRWLRLYRRANYMNRNVPSVEPHWLWGNVFDNPG
IFQAGLDWHIKAVSRCRRMLVYWGFWTPCLIVNHPDTMKDVLKSSTISRRPAGRTGYH
ILGEWLGQGIATSDGPHWRRNRKIVTPFFQSNCVRKHHVPVINKTTDIVLQKLSIKAD
SATPFDVSHLMFNLSGDIIYRVALSHELGCQEGGSPVLKSLNILTEIGTSRILNPIMA
NDFIFRMTSQYNMWKEALEVVNSLSEKLIRERKRMHEETSCETATKSRDLLDTLILST
YSDDKRLSHREIRDEGGKSHLPVLTREVNTCIFGGVDTTGSALTWFFMDMAKHPEHQT
KVQEEIDDALGDHPDDYIDSFDQHTFPYLMQCIRESLRLHAMLMPARGLIEPLVVEGV
TIPPKTGIMIDFYQLHHNPDIWGDDHMEYRPERFDQGNVARREALAFLPFSVGPHQCI
GQRIAMQELEIIGIRVLRRFSLGLVKEAKTKFRIVGKPASEVLLTIKHRQNSSNL
>GLEAN3_08152
MEWIVQSDTSLWLMTGAIALVVMWFVKGLMSRPKNLPPGPVGLPFLGVAWEMMRAKPD
PLALYSSWADQYGDVVSFKVGPMTFVLLNSYEVIVEAFSHPDLNNRPKSQMVEEALGL
ANNGVILANGKVWEEQRKFAHSVFRSLGVGKKSYEDTVATEMDQLRAAIEEKEGSSFD
PNLLFGQAVANIVCSVAFGTQYKYSDVEFRQILNLMNDNMELSSGGGALLFLPLPGIS
KIPFGAPKKMVKNMKEVNKFLLNQIESHEKCLDLDHPRDFIDLYLKKMAETKVLETSS
YSKINLNSAIGDLLFAGTETTTTTLKWCILYMMAHPEMQSRVQRELDHVVGRERLPRL
DDRNDLPYTNAVLIEVQRKGAVAALGVPHVAAEDTSFRGHTIPKGATILSNIWKVLNS
EELWKDSDAFKPDRFLTEDGKLIKREELIFFSVGRRVCLGEQLARMETFLGFTSLLHR
FTFKKPDNSPPLSFEGVLGITRNTMPYLTCALSRD
>GLEAN3_08632
MLAVRWYRQRSRLPGKLPPGPWGLPLIGSIFSLGKQPHLTLMDMAKKYGNVFTLNLAG
QLVVVLNGYESVREALVKKAGVFAGRPHLALTQELTEGQGIVVADYGPIWREQRKFTL
TALRNFGFGKESFEAEIQEEIRYMLAAFKEKDGSSFNSDHIVEIAVSNVVCNFNFGRR
FEYQDPKFQLLVDSIYRFAEIGTNAAAINFFPFLKHLPIPEIREVFEIDKVVRSFLQR
MIEDHKETLDETRQRDFIDAYLLEIKRRQEAMKKEEKISEADASFTETYLFHILNDLI
FAGTETMSSTIRWAILYMIVQPEVQKKVQEELDTVIAPGELPSWRDRNRMPYTEATLM
EIQRMANITALTFPHKTTEDTELAGYSIPKDTHVFINLYSVHVDPTQWKEPEKFMPER
FLSDEGGVIHNPALMPFSAGRRACPGEKLARMELFLFFSSMLRYFTLTPAPENPNPSI
YKNYGISLNPKPYKICVIPRQ
>GLEAN3_09118
seq begins at I-helix, missing
N-term half
MPVSKLQETHRFIHEHLRDYSRMNGSHKAGILTTVETLYWAMALLATYPEIQAKIRVE
IDDVIGRDRLPTINDRGKLPYTEASLYEVLRYSSIVPNALPHATTRDTEFGGYHIPKG
TTVIINTYSMHYDPQEWDQPDKFLPEHFMDGSGTVRDHPPSFLPFGAGRRGCLGEAVA
KADLFLIFTWFMQNYTFSKVPGKESEEILKMIPQTASGRLLLSYEIMINKRD
>GLEAN3_09512
missing N-term green = C-helix
MVSFYVGPKPIIILNSYPVIFEAFRHPDLQDRLSSRLLQEIMGLKNSGIGFSNGEVWK
EQRKFTHSVFRSLGVGKKSYEDTVSAEMTQLSSVIKEKKSSPFDPSTLFMQAVSNIIC
SIIFGTQYTYDDEAFKEILHLVNLNVKLSGGGGSFLFLPIPGISKIPFGKMKLMTNTI
RSLNAFIQSQIESHEKNWDPENQKDFIDQYLNKLDETKDTVSSFTKLNLTGCIADLFL
AGSDTTATTLKWCILFMMAYPQVQSRVQDELDHVVGRERIPRLDDIKDLPYTNAVILE
VQRKGVVVPLGVPHVAAADTTIRGYTIPKGAVIVSNISEVLNSEELWKDSGAFNPERF
LTADGELIKRDELIFFSSGRRACPGEQLARMETFLGFTSLLQRFAFKKPENSLPLSFE
GVLGVSRNPVPYMTCTVVRE
>GLEAN3_09692
missing C-term, no EXXR, PERF or
heme signature
MGFTETEPGYWLLAGVLAGVLGTWLVKHLITRPKNLPPGPTGLPLLGVAWQIYKGGSD
PLALFASWSKKYGEIVSFYVGPKPIVILNSYPVIFEAFRHPDLQDRLSSRLLQEIMGL
KNSGIGFSNGEVWKEQRKFTHSVFRSLGVGKKSYEDTVSAEMIQLSSVIKEKKSSPFD
PSTLFMQAVSNIICSIIFGTQYTYDDDAFKKILHLVNLNVKLSGGGGSFLFLPIPGIS
KIPFGKMKLMTNTIRSLNAFIQSQIESHEKNWDPENQRDFIDQYLNKLDETKDTISSF
NKLNLTGCIADLFLAGSDTTATTLKWCILFMMAYPQVQSRVQDELDHVVGRERIPRLD
DIKDQRLWGLQT
>GLEAN3_09825
missing N-term
MSRVLERMNETLTLIPETDLVNIFEPLIHTSIQRYKRYRESIFSDRDFIMSHLKSHQE
TFQKDNIRDFTDAFLADDISKEFKLEHFWRVLLDFFAGGTDTTAVVTSWAILFLSVHP
DVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIRPVLPVSVPHMTSDNVS
FRGFTIPKGSIIIPNLYAVHHDPKEWNEPDKFNPDRFLSADGKQFQMNDAWMPFGVGR
RDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANGITMKPKPYKICAIER
>GLEAN3_10143
MYTPCATIAFSNGKPWVDLRRFSLPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGE
PTDLATTFSKATANIICQLLFSQRYEYSDGEMSRVLKSMEEFLSLIPETDLVNIFETL
IHTPRYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFELEHFW
RVLLDFFVGGTDTTAVVTSWAILFLSVHPDVQRKVQAELDTVVGRGRQPNTLDRPDLP
YCNATLTEVMRIRPILPVSVPHMTSDNVSFRGFTIPKGSIIVPNLWAVHHDPKEWNEP
YKFNPDRFLSADEKQFQKNEAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFEFKLP
PNQPNPSMRGANGITMNPEPYKVCAIER
>GLEAN3_10246
sequence begins at I-helix,
missing N-term
MFFAATDTIHEVLLWLTLFIATRPDIQLRMRKEIDTVMNSGGSESLYDQRSRLPVVRS
IILEVLRMRPVAPMGVPHVVSDTGSIDGYSIPKGSQIFSNIWGMHNDPDIWKDPSVFN
PDRFLSNTGVLDEVKAKLVIPFSIGPRSCPGEQVARRIIFQTLVSLISRYRISLPAGR
PTPRLVNEERSFTMSPERFKIVFSRIQADPVQLKDLEIPTTE
>GLEAN3_10549
missing C-term, green = C-helix
MVAIELVILVLVLWMVKSFVSKRYGKQYRLPPGPPALPIIGNLPVFMTGKAPYRVMKN
LADKYGSVFSLKIGTTPFVVLNDVDSIREALHHRSVDYAGRPCLYSVSLMSRNNCDIV
FGDYTPGWRLHRRATAAALHKNIRGVGCEAMGIKITREAEYFTQLLMKKAGPVVDIEN
ELNLAIINIISQLTFATRYTLDDPEYQALVDSNQKFSSLLKPGDPIDVFPALKVRGNT
IVIFNCFYLGQHNI
>GLEAN3_10563
MFSLLSTSIILGALVFLICFWFVRRPKNLPPGPWTLPIIGYRFKTGHVHEAYVELAKK
YGPVFSIRRGPFLFVVLNDRESVKQALVKSGEFFSDRSVPGPINWTIPDAKKKASIVW
SNGKSWVDLRKFSLPALRSFGFGKKSLVPQINLEARYLAEEIKALHGEPMDPSAVLNK
ATANVMAQLIFGRRYEYDDAEFIEILQVMVDIFAFVSDNDPVNVFEPLIHTPWYKPYR
EMMIKQNKFIHAQLYFHRETFQKDNIRDFVDAFLADDISKEYTLEDFGRIVLDFFTAG
TDTTAVVTSWALLFLSVHSDVQRKVQNELDAVVGRGRQPDTSDRPNLPYCDATLMEIM
RLRPVVPISLPHMTSADASLGPYTIPKGTIIVPNLWAVHHDPKEWCDPHLFNPNRFLS
ADGQTVVKNEAWMPFSTGRRDCLGMQLAMMETFLLFTNLFQQFEFKLPPDQPTHSMRG
HPGLTMPPESYKICAIER
>GLEAN3_10576
MFFLSSTSVLLGVLGFLFCLWFVRRPKNLPPGPWSLPIIGYRFDSGLIHEAFVDLARK
YGPVFSVRRGPFLFVVLNDRASMKQALVKSGEFFSDRFVPGHIRWGIPDAEKNAAIVW
SNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLEARYLAEEIKALHGEPMDPSALLNK
ATANIIVQLVFGRRYEYDDPEFIGALQAMVDIFALVSDTDPVNVFESLVHTPWYKPYR
DCTFKLRDFIMSHLYNHQETFQKDNIRDFVDAFLAHDISKDYSMDTFWRIVLDFFAAG
TETTAVVTSWAILYLAVYPDVQKKVQNELDAVVGRGRQPNTSDRPNLPYCDATLMEVM
RIRPVLPVALPHMTSADVSIGPYTIPKGTIVIPNLWAVHHDPKEWCDPQLFNPDRFLS
ADGQTVVKNEAWMPFSIGRRDCLGVQLAKMESFLLFTNLFQQFEFKLPPDQPTHSMRG
HPGLSMPPESYKICAIER
>GLEAN3_10719
No seq. provided Scaffold
58259 989-2431 no introns
MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPVLV
LNGFQAIKEALTKQPAIFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET
CLRHFTEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS
KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLL
SGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE
TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH
AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWEKPEVFNPRRFLAQD
GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPDGQKASFDATY
KFPRRPKVFNVQIDPR
>GLEAN3_10720
No seq. provided Scaffold 58259 10218-11660 no introns
nearly identical to GLEAN3_10719
MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPLLV
LNGFQAIKEALTKQPAVFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET
CLRHFSEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS
KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLF
SGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE
TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH
AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWEKPEVFNPRRFLAQD
GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPDGQKASFDATY
KFPRRPKVFNVQIDPR
>GLEAN3_10721
No seq. provided Scaffold 58259 20505-21947 no introns
nearly identical to GLEAN3_10719
MADNQFLNGRPLPGPWGLPVIGNMLSLDQTGPHLSLMKLAKVYGNVFKIHMGNRPVLV
LNGFQAIKEALTKQPAVFAGRPDLFTFKVIDETNVYGPSISFSTYSEQLKLHRKLAET
CLRHFSEGGQAQFLEGVAKGEAEELVRHLKVSKCEGGLDIRRLIRLSVTNMILWYMFS
KRSSYDSKLLNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEYRKKFGKLF
SGVIDEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAFDFFPAGTE
TVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQSCLLEIQRH
AAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWETPEVFNPRRFLAQD
GTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPEGQKASLDATY
KFPRRPKVFNVQIDPR
>GLEAN3_11009
MVFTILGFTVSWPLVGTLVAIIVGYEIWHMTYFKRKGVKGPIPFPVVGHSIDLFQGIH
WKSQEYVRKYGNVFGMFMFSSPVFFISDLDMVKDIAVKSFSKFMNHKTFVLKSRPFDK
ALIGLKDQHWKDVRNVITPTFSAVKMKQLSGLLNESSDTLVKNFGAIQEKEGNVKVFS
LFEAFTMDGIAKCAFGIQVDSQNDPSDPFVTHAKDLMKNRFALKFVIAGIFPRIGKIM
DYFEVSTLPPEVSSFFISVVDQAIAARKESGGSKMKDFLQLMLNAAQENKSEKEMIEE
KPQDEKVEGDLYALIEDQDNKSYFDLKKSKKTTLTKDEIYGQAMLFFLAGYDTINTTL
GFLTYSLATNPEVQDKLIEEIDNETPARDDVGYNSVAKMSYLDNVVCEILRLYPAGLI
IERQCNETHVCNGLTIPKDSQIFFPVYAIHRDPAIWPDPEKFDPERFTKENREGRHPL
AWMPFGAGPRNCIGMRFALMEIKMATVRLLQKYRFETCPETQIPPKFAKMSLKPDDDL
YLRVVERS
>GLEAN3_12080
MAVVLPAGIGVTVCIVALSAFLFYYIARFLRIFLLISKFDGPPAVPIFGNTLQFKSDP
REHFLYDVGVREKYRHQSGGVIRMWFGPLPIVLIYNAVHCEALMNSSRHINKGTEYTF
MEPWLGKGLLTSGGQKWFHRRKLLTPTFHFSILQSFMEVFNEQSMILVKKLGKFADES
ETVNIFPLVTHCVLDIICDTAMGKRTNAQSDNENEYVKAVGRMSHLIVSRIRNFMLWP
DWIYNRTDAGKDHEKALQILHGVTNKVITTKTQMTPNGGTYEDDAVTRKRKRIAFLDL
LLQMHREDATFTLEAIREEVDTFMFEGHDTTAAAVNWALLLIGQHPTVQARLHDEIDQ
VFGDSERPITSDDLSELSYLSCVVKESLRLLPSVPGIGRDLDEDIIVNGKVVPKGASV
FLSIYGIHHDPEQFPDPERFDPDRFLPENSTKRHPFAFIPFSAGPRNCIGQKFAMMED
KVLLINILRRFSIKSLQTLDETKPAGQIVLRPADGNILVKLSHRK
>GLEAN3_12081
MAAVLSAGIGVTVCIVALSAFLFYYIARLLRIFLLISKFDGPPAVPIFGNTLQFKSDP
RDIFLYYQRNFERYRQETGGITRIWYGPLPAVIVYSAKHGEALMNSSHHTSKGIQYKC
LEPWLGKGLLTSSGKKWTHRRKLLTPTFHFSILQNFMEVFNEQSIILVKKLEKFAEKS
ETFNIFPFVTHCVLDIICDTAMGKRTNAQEDSENEYVRAVDSMSRLVVRRMRNPLCWS
DWIYYNMGVGKEHEKSLNVLHKVTNEMIQERLSDIPSNENGTNQDDAVTRKRKRIAFL
DLLLQMHREDATFTLEDIREEVDTFMFEGHDTTSAAVSWALLLIGQHPEVQVRLHDEI
DQVFGESERPVTSDDLSKLPYLSRVIKETLRITPPVPGLSRELDEDIVVDGKVLPKEA
MVIINIYGIHHDPEQFPDPDRFDPDRFLPENSTKRHPFAFIPFSAGPRNCIGQKFAMM
EDKVILINLLRRFSVKSLQTLDEAKPAGLLILRPAEGTILVKLSLRK
>GLEAN3_12148
N-term only
MANSVNIFESVGGVFGEMTLATMILVSTIFVLGIAWAFKSLLGPQNKDIKLPPRLPTG
IPFLGQAVAFNKSPIDFLEDAYEKYGDVFSFTMVGKTFTYLIGSEASALLFNSKNENL
NAEEVYSNLTVPVFGKGVAYDVPNPVSLINLIIDYIRGA
>GLEAN3_12500
missing N-term, seq begins at KYG
motif
MDMAKKYGPIFSLRRGPFLVVVLNDIATIKQAFVKSSEFFQNRFVPGHIRWGVPDTNK
NASIVWSSGKPWKDHRKFSLHALRSFGFGKKSLVPQINLEARYLADEIKNLRGAPSDL
SPTLNKATANVVAQLVFGRRYEYDDLQYIEVLQAMVAIFAGISITDPVVVFESLIHTP
WYKHYRDATFKARDFIMCQLYMHRDTFQKDNIRDYVDAFLADDISKEYTLEEFWRIVL
DLFAAGTETTTLVTSWAILYLAVHPDVQKKVQNELDRVVGRGRQPTTTDRPDLPYCDA
TLMEVMRIRPVLPVLSPRECLGIQLARMESFLLFTNLFQQFEFKLPSDQPTASLRGQP
GLSMHPHPYHVCAIER
>GLEAN3_12926
missing C-term
MGPFVVIVVYVLGLFACLLGIGLLVFVAAILRTEQKFSDLPGPKRDSFLFGNIFSIFE
AKKKLGQPIGLIIYTLSQDHGPVFWIRVFARGHVFASAPEIVKELLLNSRHIKPQSFY
KNFQYVFGARFMGNGLVSEINHEKWFHRRAIMNPAFRRKYLMGMMEKFNAESEKLCEH
LRDKADGRNEVEMLNELNNVTLDVIANVSFSMTTGAVGDPSCPFPAAVTLGLRGMQTI
IDKPWYSFDPRSESRKTRQETREAVNLIRETGRQQIEERIAARQRGDHVPNDILTVIL
DCANDLVGDEDFQMENMIDEFVTIFFAGKLFSAPENCNYYS
>GLEAN3_13039
MDFRWHQLTPYHLLVTALVGLVAAWFVKHLMRLPKNLPPGPTGLPLLGVISKFIGTSD
PLALLSEWADRYGEITSFKLGPQLVVLLNSYDAVAEAFRHPDLQGKPRSRMTEEASGF
GPNAGLLIGAGQSWKEQRKFTLSVFRNLGVGKKSYEDTVAAEMAQLGRAIKETKGSPI
DPNILVGQAVANVICTVIFGTQYKYDDSEFQHLLYLMNFNMSRSGAGGWLFFIPIPGM
SKVPFGVIKAMMTNAKEFYAFIDHHIEAHKKNGGPDIPKDFIDQYLIKLKEMKGTESS
FSHSSLKAVISDLFFAGMETTTTTLKWCILYMMAYPEVQSRVQAEMDQVVGRERLPGL
DDRQNLPYTCAVLMEVQRKGAVVAMGVPHMAAADVTIKGYTIPKGTIIFPNILNVLNN
KNFWDDSDAFTPERFLSDDGTLIKREELIFFSTGRRVCLGEQLARMETFLGFTSLLHR
FTFKKPDNSPPLSFEAVLGGTRSSLPYLTCAIPRDPTENVA
>GLEAN3_13102
Missing N-term, green = C-helix
VTPLFSDDLKDALFNSKDEHWKGVRNIVTPTFSAAKMKLMSPLINKCADRLLKHLEKR
QELHGNIECRELVGAFVMDTIASCAFGLNIDCQENRFGPFVTNAKKAFSGNIFTSPYL
LLVSLFPWTLPILEYLDFNIIDRKSTQFFADIIKKTSAVRRESKNSDSAKRIDYLQLL
LDAQERDITGNNKNKADGIHGDPEDEAIDEGGLQKKSSKKIRLNESEVISQALIFFLA
GYETTNTTCGFLLYFLATNPDVQDKLVDEINDVAPKAEDVGYQSLSKMPYLEQIFCET
ERIYPPAIMIDRVCNEPFNVNGFTVPKGMRIFIPIYTIHRNPNLWPDPETFDPDRFRK
ENREKHHPCAWMPFGTGPRACVGMRFAIMEAKMVIVRILQKYQIETCPQTEMPPKQAT
NGFLSPPNGITLRAVKRN
>GLEAN3_13199
MEKCEISSSQLESSLPYPSSFTSVILCVITLYLVYRLFRSLREKKDDGRRYPPGPKGL
PLIGSLLDLRSDVLMKMKRFADEYGSIYTINLGPTNLVVLNNKDVIKEAFVTNAACMS
HRPQVMFDTLKQSQGIILKQPGDLHRKNRNFLSTGFRMIKLGVPEFSYHLHPHIDNFK
DKLIASKGQPLPICSTSHRLLFEIFTDLMLGEVPDPNSEKVNKLLDILYIALSPNPML
ALVGNLIPNWVKSALSDWIDMADCLAKVTRYLKEHIHQHRETYEGDPTNLINAFLHKE
KLELEKTSKLREESSSGGDSDTEIEKAAIGTSFMVNDQEENEKSLAAILLDVFQAGCT
SVVASLQWLLLDLAIHQDVQDKLRMEVDRNVEQRNQRPSLLLRSDNFPYTKAVMLESI
RLHPPAPLGAPHCAAQDTKLAGYDIDKGTIILPNFLMALQDFNQWEDACDFRPERFLN
EDGTLATRETLGFSIGPRQCPGKLLSNTVISSILTDVIQDFTLHLPPDHNFSLKGNPG
IIYNPQPFSIIAKQRTS
>GLEAN3_14092
MTVLGMISGLFSMTTVYLSGAVVVIFFIYDYWTHTYFKRLGIPGPPPKPIFGNALDFS
KPMYDLLDEYANKYGPVVGIYMMRRPLLLIRDLDMLRDIMVKDYHKFYNKFRVPISSG
NLDNALFLLRDAQWKRVRDIVTPTFTGKKMKMMSGILNEGADTMVEKLLQSCKEDGNI
DSLELFGSFVMDSVASCAFGLKVNSQEDKDNLFITNGKRLFDFIFNSPVFMAVYVIQF
FTDVTEKAVALRESHPSKQRVDFLQLLIDAKNGKSKMASENDKDDIHNKYFEDTGAEE
NISTKTQKYMTREELLAQGMIFLAGGYETTASLLTFASYLLASNPDCQDKLIAEIDDI
APKRDDVGYTTISKMPYLDQVVCEVLRIYPPGPLTDRECGETCTYKGYKIEKGTQILI
PTYTIQHDPTYWPDPLKFDPERFTKANREGRNPFTWLPFGAGPRICIGMRFALMEAKM
ALVRSLQVVRFGISPLTKIPPDLGNTSLLSKKSSYTLKVVQRH
>GLEAN3_14573
MAFTILGFTVSWPLVGTLVAIIVGYEIWHMTYFKRKGVKGPIPFPVVGHSIDLFQGIH
WKTQEYVKKYGNVFGLFMLSSPVFVISDLEMVKDITVKSFSKFMNHKTFVLKSRPFDK
VLTSLKDQHWKDVRNVITPSFSAVKMKQLSVLLNESSDTLVKNFGAIQRREGKVKVFD
LFEAFTMDGIAKCGFGIQVDSQNDPSDPFVTNAKDLMKNTFSLKFLIAATFPGIGKIM
DYYGVSTISPKIASFFLSVVDQAIAARKESGGSKMKDFLQLMLNAAQENKSEKEKPQE
EKVEGDLHALIEDQDNKSYFDLKKSKKTTLTKDEIYGQAIIFFLAGYATIHTTLGFVT
YCLATNPEVQDKLIEEIDNETPTRDDVGYNSIAKMSYLDNVVCEILRLYPASVLVERQ
CNETHVCNGITIPKDSQVMFPVFAIHRDPALWPDPEKFDPERFTKENREGRHPLAWMP
FGAGPRNCIGMRFALMEIKMATVRLLQKYRFETCPETPIPVKFAKMGLKPDDDLYLRV
VERS
>GLEAN3_14843
MDKNLPDVLTDDVLINAVSVTTYLMLIITSILFTRYWINAKRESDKVAKSGGRPLPGP
WGLPVFGSMLSLDPSGPHLSLMKLAKVYGHVFKIQMGSRPVLVLNGLKAIRNALVKQA
VVFAGRPDLLTLKAINSANLFGPSLSFSQYSEEWKLHRKITETSLRHFTAGSQVPFVE
SVAKGEAEELVRYLKVPKCERSVDIPCLVRLSVSNLMLWFMFRKRASYDDKKFIESIN
ILDGLSEAGSGNLVDFLPWLRFFSLDSSTGFLEAQKRFNKSVRGLIDERHELYDPESA
QHVFMDEIIALGHVREEDEFQRLGLAGRTVLQNAFDFFGAGLGTTSATLEWAMVHMAL
FPKAQYEVQCEIDKVVGRDRLPTLNDREYLPLTQSCLLEIQRYAAVTPFGIPHSTTKD
TILDGYFVPKDMVVFVNLYSANFDPEVWDEPEVFNPRRFLAQDGALDDEKMKLLKAFG
LGRRRCVGSGLASINLFIYFSTLLHQLRFSCPEGGEVNLEFRFGITLHPKNLNVHIES
R
>GLEAN3_14874
seq starts at I-helix, missing
N-term half
MIFLSAGYETTAALLTYASYLLASNPDCQDKLIAEIDDIAPKRDDVGYTTISKMPYLD
QVVCEVLRIYPPGILSDRECGETCTYKGYKIEKGLQIWIPTFTIQRDPTHWPDPLKFN
PERFTKANREGRNPFTWLPFGAGPRICIGMRFALMEAKMALVRSLQVVRFEISPLTKI
PPDVGNTSLLSKKSSYTLKIVQRH
>GLEAN3_15096
missing PERF and heme signature at
C-term
MDLDTSFITDVSLVGGALKATFGVLLLYFIYCYFNDNGSQYHHPPGPKGLPFLGSVLS
MIFTRKHPHEVSLKGVAFANGEVWKEQRRLLMSVLKKADVDGISYDDIIVSQAERMLA
EFEKTNGQSFDPRDLIHKTIGNVVMKILTGVTYDYDDLEFSRVMTNTVRVFTLLGEAG
ILASIPALAAIPSPTKWKIIKAWEEIYGFLKEIIERRKARFDTNQKATDFMGSFVKAM
CAKGTEIGPASCDEINLMVSSFDLLLAGWDTISTTLAWGLHTLASYPEAQRRVHQEIK
DVIGLDRLPQFSDHHSMPVTMATIAECMRFRPTLAVHIPHVATQDCKIGGYDIPKGGQ
ITLNMWSKGLSWRATGKN
>GLEAN3_15120
short at the N-term
MVLKANRYREKLGRIPVHLSLLKCVSFILLRLFGPICSITLRVSDSRISTPMKNKIRH
FYSALSSYEFAPSVKKLGDAKIDWGVSGWRPTKEAAGLSRTQHGIVSETPGGSSHISY
FFVSFALITLIAYQNLVQTFQLNQSSFTVVDIAVVVTVSAAVIIVVVVVVVFALLSGI
KRYMRQVIDETLRVSILAPYAARYQDFDVVLGGHVIPKGTPVIHALGVSLQSEKYFPN
PKTFNPENFSAANVKKRPRDAYHPFGFAGRRVCPGQQFAYKEVAIFLAVFIRAFKVKV
VPGQKPEHVHGLVSHLVNKDGGEVWITVEKR
>GLEAN3_15156
sequence starts in I-helix,
missing N-term
MNPDIQEKLINEVDEVTPSRDSVGYSSIATMPYLDQVVCETLRLYPPAGISNRMCNET
FVHNGLMIEKGTQVFIPVYTIHRDPDYWPEPEKFDPDRFTKEAKAKQHPFAWQPFGAG
PRNCIGMRFALMETKMAVVRTLQNYRFETCAETENFWEQTLSGNLY
>GLEAN3_15256
MDSVPLDQLGTLLIVTLLCMVGLRFLISWIRDGKKRKLPPGPTAWPLVGSIPFMIRSK
LPPLDLLTALADDHGDILRLRIMTTEVVILSSYDIIKEAYNHPNLQGRPRIVPIEDYI
GRRNIGVAGSSGPVWREHRQFVFSAFRHFSTGPQNIDHIIHVEARTLVSEVSTLSRKP
FNPHKVLNNAVSNVIAVMLFGKAYNYADDDFNQLQEIINTNLQEAASGGIYLFIPFLA
KVPFSPANKIKQGIKRFTDFTKKIVNEHRHVFEPDHPRDIIDLYLKKIQEDHEGGIES
SFDETNLHVLCGDLFGASTETTNASLKWSIIYMMIHPEVQTRVQEELDRVVGRDRLPE
LDDRASLPFTLATLHEIQRMGHVFPLSIPHACTADMRLADYDIPEGTLVVSNLWRLSR
DSRIWSDPNEFRPDRFLNENGDCIKPEALIPFSIGKRVCIGESIARKEIFVFFSSLLH
RFHFSIPAGGSPPSLQGRLGLVYDTQDYITCASPRT
>GLEAN3_15442
MIETPASSRDEPNLTVISSLLSLTSLTTSSLLLCVLVLVLVLVILTVDRRNSVTPVED
GVTLLPGPRGALATVKVLLGMTKRASRYMFERTKTYGKVFGIGDGPDSYVIIMDYASL
EEAFVTKADLCSDRGGLIRTMLESVLPERGSIFNNGTAWNEVRGFVAKTVKNPDFGAT
TISRCVSETIPTLCSELDALVDGPFDPKRTYQCFCVDVFHRIAFGIGISYENCTVKKF
IDNLDTVVAQTLDPSTLLLTRPSLINSFLFRGYRSKLRSSSSYVKTRAEEVLAKLDAP
EQSSSLIALFVKYIEGIRSSGKASYSEPDDGWVIVYNMFFAATDTIHEVLLWLTLFIA
TRPDIQLRMRKEIDTVMNSGGSESLYDQRSRLPVVRSIILEVLRMRPVAPMGVPHVVS
DTGSIDGYSIPKGSQIFSNIWGMHNDPDIWKDPSAFNPDRFLSNTGVLDEVKAKLVIP
FSIGPRSCPGEQVARRIIFQTLVSLISRYRISLPAGRPTPQLVNEERSFTMSPERFKV
IFSRIQADPVQLKDLEIPTTE
>GLEAN3_15568
missing N-term
MDLFDESATRLRDHLIPLADDKTLVRMLEHFNNVALDVIGKVGFDMDINAVENSDCPF
PSAVALALQGINVAFQNPLTLVDPRSKARQYRRSVRDAVKLVFKETMRLNPPVSGLSR
VLASDVQSCGYVIPKGTTIMFMTYVTSRLEEFFDDPLLFNPDRFIPTDETPRHFFAYY
PFAIGQRNCIGQQIALIESKVILAKLLQSFDFRLEQSQRHIILREVTNKPIDGCKNYI
SLRD
>GLEAN3_16056
missing N-term starts at KYG motif
EYFQKYGKIVGVYNFRKPTLLVSDPDVVRKILVKDFDHFVNRQEAAIKTDLLAGGMLA
AKDERWRKLRSLLSPSFTSVKLKSMVPLMRECAEVLVDIFTKREEEGKTIECKELYGP
YAMDVIGSCAFGLRVNSQTNQDEPFIKYAKRIFNFNVTNPAFLISMLFPWTAPIFNFF
KLGSVPKDIEQFFLGVFEKAIADRQANGDTGKSDFLELMMSAHKEDDDVNIMNDAEDD
IADLAQETSWKPKQKTALNRQEIIGNSITFLLAGYETSSTALGFVSYLLATNQHVQDL
LVAEVKEMAPSQSDVTFETVTKMEYLNMVIMEAMRIYPPGEVLERVCNKEITYNNITI
KKGQYIFIPIWNLQHDSDLWPDPETFDPERFSKDRRGNNHPCGWLPFGIGPHTCIGVR
FAMLEIKVAMVCILQKFRFDTCPETEIPPKLGNLSFLTPPNGITLRITPHSRYNLRSR
>GLEAN3_16152
missing the middle of the gene
between C-helix and heme signature
MEVEVLGNAMSLTSLLVLIALGCIVLLLLNDYWAHGYWKRRGIPFEPPLPIIANMRVF
AKDIPQAAGELRSMVSILEGERWKEVRNILTPTFSAAKMKQMSNLINTNADTLVKYIG
LKQEEKGRIECSEFTKEAKAKQHPLAWQPFGAGPRNCIAMRFALMETKMAVVRVLQNY
RFETCAETEIPPTFGRSGFLTPPNGVTLSAVPRTDKVA
>GLEAN3_16251
MSYRCAVCYCDHEMEADMRVYSRNGRSDLLALFSSWADNYGDIVSFNVGPKLIVLLNS
YEVVVEAFRNPDLQNRPGSQIMKEISGGGKNGVLLSVGDTWREQRKFVHSVFRSLGVG
KKSYEDIVAREMDQLGSAIEEKRGTPFDPNTIFGQAVANIICSIVLGTQYKYDDTDFK
HIIELLIRNIELFGGGAWIFFLPLPGISKIPFGAVKALVSNTRELNTFVRTMIKSHEL
NLDPEHPKDFIDQYLNKLNETKGTDSSFSQANLEAAIKDLFFAGTDTTTNTLNWSILF
MMAHPDIQSRVQTELDQVVGRHRLPKLDDRKNLPYTSAVLMEVQRKGAVLALGVPHVA
AADTQVRGYTIPKGATILSNLYKVLNREDLWGDTNDFRPERFLSKEGEFIKREELIFF
SSGRRMCLGEQLARMETYLGFTSLLQRFTFKKPDHTPPLSFAGVLGGTRNTVPYVTCA
IPRETKI
>GLEAN3_16442
MALLTDSYGAALAIISLVLTLIWYLKNSSSWRYRNLPPGPFGLPILGILPYLDEKNPS
VDVLRMSKRYGPIFGGFLGSHRAVFLNDYDTIKETFARTDDVFSDRPRIAIFELYSDG
QGVACCYLENHWKEQRRFTLRALRNYGMGRLQGSLRNAIDVEVSHLMTAMSKIKKPFD
PRIYVESAVCNITCKMVFGRRYEYEDAQFTEFLHNLEKIFELSGVAGVVNYLSWMKYV
PFSGYHALGDCVRHLENGLFTKECEEHQKVFDPNRSPRDLIEAFLLELHKKQQEKVEN
NGEDMAPTGFNHQQMLHFIADVFAAGTDTTASTLKWSLIHLMKFEDEQRKIQEELDHV
IGRDRMPSLEDKPNLPYTQAFLAESLRYGCAGPLGVPHGAKEDTKLRGYDIPKGTVIV
ANLYAVLYDPKVFPEPDKFNPSRFIDDEGRVKTSLIERAHSQFGIGVPHGAKDDTKLR
GYDIPKGTVIVANLYAVLYDPKVFPEPDKFNPSRFIDEEGRVKTSLIERAHSQFGIGR
RQCVGMDMALMERFVFITNLIHRFNARAPNGPDSVTIDARGGLAFVPMPFEMQLTER
>GLEAN3_16816
starts at C-helix
MKCSIIFEEGKSWSDMRRFFLLVFRKFGMGKRSAGDRVNEEARVLCKEFILAEGKPFD
PFRTVNYAVANIIRSITAGQRFEYSNGRFQQMVQLITDVLGAENFYGPLEMMPWLYFT
PLFTKERNMCNALIGLMKQEVVEHRETVDVDNPRDVIDMCLAEIDRAKDAGEETFFDE
SLLWRTLLDIYLAGTETVATTLTWAMLYMAGYPHIQEKVCQEIDDVIGHEGTPHYDDR
TRMPYTEATIAELLRIRPIAPLGIPHINTESIVFRGYLLPERSIIMMNMVGMHLDPEL
WPDPESFDPTRFLSEDGKATKKPEGHMPFGAGRRICLGEQLAKVELFLFFTNLLQKFT
FRIPDGVLPDYGFGHRTSTLAPKKFDIVASLR
>GLEAN3_17582
MIDLLGAGESGFGQFNALTITATLGVVTATLALWTIVRPKRFPPGPRGLPIVGSIFSM
NDSPEIVFGEWAKRYGDIFGFKAGERWMVVLNRQALIKEAVLKQGVDFAGRPDFYSLE
IFTDGFKDIAFSPYSETWKLHRKLAHTALRHFATGKPLQDLISSVYPKVDKKLAMTEG
QPIDPKVLINLIMYNVLAQMCFGRSYELEDPTVAQWMDTNDDLNEKFGLGLAADMFSW
ARYIPTSGPRMIKELTEKLWGFLRLQVDETREHYDPENINNFYSLLLKAQEDARKEGE
NVDKLTDTHVFQTISDIFGGGYHIPKGTVVMINTHSMHYDPQEWDQPDKFLPEHFMDG
SGTVRDHPPSFLPFGAGRRGCLGEAVAKADLFLIFTWFMQNYTFSRAPGKESEDILKM
IPQTASGRILLSYEIMINKRD
>GLEAN3_17986
MSSSVFYYGFTPFILAGLIAFLLSFILVRSGKKKYKNLPPRGPMEWPVLGGLPSLSGK
DMPHVIIADMAKKYGPIFGMKMGSFYAVILSDYSLVRQAFAKSNDEFSDRPKITMIEN
IVQGKGLIVSYFGQMQTEHRRFSLSALRSLGMGKFKMEETIVDEARQLAMIFTSRQSK
PFVPFHDIVVSVSNIICWLSLGKRFDYNDESFKAILASLFESFEIAEIAGLFNFLPHW
SGKCTTEEQEKSRRSPTCYVEMYTEQMIEAERETPGEHTFTKENMYHAVADLFAAGTE
TTATTLKWALMYMILHQDVQEKIHKELDDVVGRNRLPSLTDRPNLPYTEATLLEIQRF
ATITPLGVPHAPVQDTVLNGYDIPKGTVVLPNLWAIHHEPDLWKNPDEFNPARFLNPD
TKQVVQREELIPFSIGRRRCLGEQLAKVELFIFFTHLLHRFQFTTPEGAPPLTMKARN
GATLNPEEFEICASLIDDVM
>GLEAN3_18242
MPHLRLVDLNSFSRSERRPPSWSPMSSYPDMYSGPQQMLQRCVIVQKDEKGYGLTVTG
DNPVFIQSVKDHGAAYKAGVQTGDRIIKELLTRSKYLKAPDQYRGLKSLYGARTLGNG
LISELNHEVWMKKRALFNPAFHRKYLMGLMNEFNSCSAKLVNHLIPLSDGQTEVVMTK
EFERLTMEIIGKVGFGLEDDIIGNPDSPLCQLFPKVMSGMQSVYRRPLLKYSILPKDI
KYKHEVRAAANEIRAVGRRCILARIDALRRGDQVPQDILTYILQESNNLEGIKDFDLE
DMVDEFVTFFGAGQETTSNLLSFTLLHLGRNPQVLKKLRDEIDTVLKGRNYVEYSEVS
KMKYLTLVLKETLRMNPPVGMLNRVLPYEMDLCGYKVPKGSTVLMPIYGMGRDEKHFK
NPEKFDPERFTRDEDSPLFAYIPFSLGARSCIGQTFAMIEAKVVICKLIQQLEFQLVP
NQSFEFVQEVTLKPKDGCKSYITMRNL
>GLEAN3_18372
MGSLLDLVDTSTALIALLVFLLSLVVTHRFYQNGDDSKSLPPRGPREWPVIGIIPSLV
GDEPYKLLARMSPQYGPVFLTRIGMISTLIINDFDSMKEALVRSGDDFADRPKFGMME
YITKGKGIISAHVGDSQREKRRFGLAALRELGMGKSRLVENISDEIETLCDVFMRYEG
KPFNPAHDITVSVSNIISWIVFGKRFKHNDTAFKRFLEIIHEGIEICESTGAAGFIPM
LQYLPLPVWKKLADNDREWKEFDAVMIAEALQRPASEEPCFVQMYNEEMKRMEQRAKA
TAAGSKASDGTDGIDASEETVIFNEEDMRQTLGDLFAAGTDTTATTLNWAILYLILHP
DVQCKIHEELDNVFGRNRKPSLEDRKYIPYTEAVLLEVQRKATIVPLGVPHAASWDTK
LNGHHIPAKTLILPNIWAVHHDPKIWKNPDEFQPERFLDDKGTTVLQREELIPFCIGR
RKCLGEQLAKMELYYFFTHLLHRFEFRISEGALPPSTVGKPGATRNPMPFSVCAIPRF
S
>GLEAN3_18468
MAGIVSVLSVVVQSVGSPALVVGGITILLYLIYQVFYQGGKKYPPGPTGLPVFGSVFS
LLLTKRQQHEVVNEWSHKYGPIFAFKMLGTRVYVLTDPGLVSDAFHSSPDINDRMSMA
IADEITGRPNNGLVASVGEVWKEHRRFFLTVFKKIDSGVIRFEEIIGSEVERFLKEVE
KKDGESFDPCNAINVGVANIITRLVTGVNYDHEDKDFQHLIDCSDRIFQIMGPAGLFT
LIPILAKIPSPVKTEITNLSLEILKMVRKSIEIHKTEFDPEEPARDFISCYLKEMHRR
KEENDPGTFDDINLLTSSFDALMGGWETSTTAISWMTFTLAAHPEAQRRVRQEILDVV
GSDRFPRFSDRRSMPITEATLAECTRLRPAAAIHIPHIASRDCKLGGYDVTKGSAITT
DLWYLALSPNLWKDPQDFKPERFLDAEGQFNKKLEPLPFGYGKRVCPGEQLARMEIFI
FVTYLLQNFSLTLPEGAPTSNSGKHGLTNKPNPFKINAKKI
>GLEAN3_18515
MANSGYVASSLFFGFQEICYRECIIVLTTFLFVRLLVSRGLFKKSTPPSNPPPPSPTG
LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAVREAFVEKADA
FSDRMVPPLLGEALGKRGNNHLYKHKKTYTRSLIWPAGGGWKKTRKFGHSAFRSMGLG
HRRIEEKVVEEGRILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFV
KIVGDIERFSQNLSLTSLGNVFPLLYYTPLYTDFRRPMRDIISYIKSMVKQHQQNFDP
LNVRDIIDLYIVELKQQREQQIPGTVEENLEWRLSIELYIAGMVTTSDSMLWAILIVA
QHPQLMKEIQKEIDDVIGDERSPKMADQDALPLIVATLMEVSRIRPAVPLGVPRLTSR
DATLQGYHIPKDTVVIMNLWEVLNSPKYWDKPEEFNPYRFLSEDRRSVVNHPAFIPFG
IGLRACLGEKLARMQMFIIFTSLLQRFTFCLADDSRSAGMKGKAGMAYSPLKYDIILK
RRENSD
>GLEAN3_19082
missing N-term
MASHGLTYGNSLFLLFAPSALKDNIRDFVDAFLADDISKEYTLEDFGRIVLDFFTAGT
DTTAVVTSWALLFLSVHSDVQRKVQNELDAVVGRGRQPDTSDRPNLPYCDATLMEIMR
LRPVVPISLPHMTSADASLGPYTIPKGTIVVPNLWAVHHDPKEWCDPYLFNPNRFLSA
DGQTVVKNEAWMPFSTGRRDCLGMQLAMMETFLLFTNLFQQFEFKLPPDQPTHSMRGH
PGLTMPPESYKICATER
>GLEAN3_19464
MFIPADTLDFFRDVNKKTIALRKSSSSGPNSKRVDFLQLLLDAQKQENIDKESKPEEE
EEDDDDIHADLDKHHDVQYHSRGLGKKATLTDNEVFAQGLIFFTAGSETTTTLLGFIA
YSLATNPEAQEKLIDEIDRVTPSRDSVGYKSVATMPYLDQVVCETLRIFPPAAVSNRM
CNETFVHNGLMIEKGTQVFIPVYTIHRDPDYWPEPEKFVPERFTKEAKAKQHPLAWQP
FGAGPRNCIGMRFALMETKMAVVRVFQNYRLETNAQTEIPPKFNRTGFLTPPNGITLT
AVPRTDKVAY
>GLEAN3_19883
MESADAVALTLCGVVLLASLAVWSMSRPRGFPPGPLGVPILGSVFSMTASPEEIFVRW
SKQYGDVFGFKVGQRMIVVINHQDAIREALLKHGIEFAGRPDFFSFGVFTYGFKDIIF
SSYSDTWKLHHKFAHAALRQFASGKRLENLLHGSIAKMEGYLSKMDDSEPVDPKLVIT
LLLYNVMSTMCFGKDYDFLDENLIAWKENNEKLNQEFGMGVAADFFGWAQYLPTPGVK
MVHRMRDELHRFVWDELDSHKKNFDPDNVRDVFDMLLLAQKEAADSGNQDAEKLSDTH
LVLTVSNMFAAGIQNTTETLYWALAFMVEHRDIQSRCQAEIDRAIGRKRVPVVDDRGS
LPYTEATLYEVMRYSSILPIAVPHATTCDVNFRGYRIPKGAVVMFNTYSMHYDPNAWE
SPTSFKPEHFLEDDGSVQQHPSSFLPFGAGRRSCLGEVAAKTDLIIIFTWLLQNYCFK
TVPGKETTSLLRVIENSAVGRNLENYELIMEMRK
>GLEAN3_19898
MAVLEISSIVTIPTLVALFVVVVLYLIYRTNSRTALRHPPGPSELPFIGSIFTLLFTK
LQQHELLFKWSGEYGGIFSFTLAGGRVVVLSDKELIQETLQDPFVNDRIPFPACITIY
GRENTGIAFASGAVWKEQQRFFLTVFRKVNTSVIRFEDIIQTQGQRLLNELDRLEGQL
FDPSNIVTVTISNVILKIVTGKTYEYADPGLLRMATCSDRLADIIGPAGLLGMIPGLA
SLPLPAKRQLTDLMREMIDFIRDHVEEHKANFNPDEPATDFMSCYLKEIANSSGIPDT
TFDETNMLQTCWDAMFPGYETTISAFRWALHLLASHPQVQSRIRQEIFDAVERDRLPA
YTDRHNMPYMESTVVECLRMRPVIPYVAPHIVSRDRKIAGYDVAVGTRITVNLWFLAR
SPTLWEDPDEFRPERFLDENGCYDRSREPLSFSYGRRTCPGDQLARMELFLILSYILQ
RFTLTLPDGAPRNLEGSMGATYKPYQYELRATKNHGARSSASNSVPE
>GLEAN3_19899
MSGDQIKDLVNWLEVVNDWTNKYGDVISFKLLNIRVYVLNNVQMSHDVMGSLSMMGRM
PVPIIKTLTGRHNAGVALSNGDVWKEQRRFFLSVFKRADAKGIRFEEIVGAQAERMLT
EIENTQSARFDPQPIIYTSIGNIITKIVTGITYDYDDDDFHRFMTHSNRVLDLMGPAG
ILASVPILAVLPSPAKRPLTKEWDAMYDFIKKSIASHKSEFNPNQEAADYIESYLHEI
AKRTTENNKGSFDEINLLACSFDLIVAGLETSATSISWSLHTLACYPEAQHRVRQEIF
DVIGQERDPRFSDRHRMPIAMATIAECMRLRPAIDLHIARVAEQDCQVGEYDIPKGSV
VTTNMSHLFLSPSLWEEPLEFRPDRFLTEDGQFDRKREPIFFGTGRRVCPGEQLARTE
IFLFITSILQRFTLSLAEGVPKDLPCYHSITNRPDPYQIHAEKIV
>GLEAN3_20229
missing N-term starts with C-helix
MKTIYNYHDQSFCSWGQKWFHRRKMLTPTFHFSILQSFMDVFNEQSMILVKKLEKFAE
KSETVNIFPLVAYCVLDIICITAMGRSSNAQENSENEYVKAVGRMSELVTIRGKNPLL
WPDFIFDKIKTGKEHKETLKILHGVTNKERLKESPKDTNTDDDDVDVVARTRKRIAFL
DLLLQMHREDASFTLGDIREEVDTFMFEGHDTTAAAAGWAILLIGHHPDVQARLHMEI
DEVFGDSMRPVTSDDLSRLQYLTCVVKETLRLIPSVPSIFRKLDEDIMLDGKVVPKET
LVALSIYGLHQDPDEFPNPERFDPDRFLPVNAEERHPYAYVPFSAGPRNCIGQKFAMM
EDKIILASILRRFSIKSIQTIDEAKPAGQLILRPAEGNMLVNLSRRK
>GLEAN3_20233
missing N-term
MPMEINKMGDAAAKGMFDFLEKILKRHRQAYNPEDEPKDLIEAFLHEEANKKAQNVDV
GSFTHQCLKQVVWDLMLGGIETVSTSLSWYFLFMAQYPDVQAKVQAELDKVIGRDRLP
SLSDRQNLPFTESTSVECYRFCRLINVQIPHRTRCDVKLGGYDIPKGTNVGTNASWLS
SSPSLWENSEEFVPDRFIDKETGLYNSKLEAGLVEFGVGRRVCPGEQLARTELFVIFT
HILHRFTLRLPEGTPITRNGINGLTHHPKPYKITAAPRR
>GLEAN3_20234
MAQVMSTDFLTHNLSTLLTTGLTIILVVLFSLRRKNDSSKKLTHPPGPSGIPVLGNLL
TVAFSKLPQHELINGWSKKYGDVFWFKIFGTGVVIINDKEVMQKTFQNTDVCDRAPFF
ASDYVLGRENNGIAFANGGSWKEQRKFFVSIFRKVDIGVSRLEDIVGGQVKNIVTQVK
NLKGEKFDPAIMVNMSIANIITRIITGVTYDWEDKAFQDIVAYSGKLFDISGPAGLFS
LNPVFARLPLPVNFEIRRLCRTIMDFIQVSIDEHRKNFDPNAPVKDFIGSYLKEMASA
DANNLKSFDDTNMIVSCFDALLGGWETVASTLRWTIYLLSKHPEAQGRMRQEIHDLVG
DERFPTIADRQYLPLVEATIAECMRFIPAVPIHLPHVTSQDCNIGGFDVPKGSAVAAN
LLHFAKSPMFWEDPEVFRPERFLDDAGNFMRGLEPVPFGYGKRQCPGELLARMEMFIF
TTFLVQNFELKLPAGIKDDLQGVLGLTWRPNHYFIEGHPTNANEE
>GLEAN3_20751
missing both N and C-terms, stops
in heme signature
MVDEHGETLDDQHIRDIIDLYILEIREQKKRGDEIVFNETTVWRGIFDMISAGTGTTS
TTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSDREDLPLTMATIMEIQRFRPV
SPLGFPRFTNKDAEVKGYSIPKGTAAIVNIWELMHGTKYWEEPDKFKPSRFLSDDGRS
IIKPDAFIPFGIGRL
>GLEAN3_20752
missing N-term
MVDEHGKTLDDQHIRDIIDLYILEIREQKKRGDEIVFDETTVWRGIFDMISAGTGTTS
TTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSDREDLPLTMATIMEVQRFRPV
SPLGFPRFTNKDAEVQGYSIPKGTAAIVNIWELMHSTKYWEEPDKFKPSRFLSDDGRS
IIKPDAFIPFGIGSRVCLGEILAKMDVFLTFTFLLQRFTFRLPEDSPTN
>GLEAN3_20753
MASSGYVASSIFLGLQEISFRECIVFVMTCLFVRLLVSRGSFKKSTPPSNPPPPSPTG
LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAVREAFVEKADA
FSDRMVPPLLGEALGKRGSLIWPAGGGWKKTRKFGHSAFRSMGLGHRRIEEKVVEEGR
ILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFVKIVGDIERFSQNL
SLTSLGNVFPLLYYTPLYTDFRRPMQDIISYIKSMVKQHQQNFDPLNVRDIIDLYIVE
LKQQREQQIPGTVEENLEWRLSIELYIAGMVTTSDSMLWAILIVAQHPQLMKEIQKEI
DDVIGDERSPKMADQDALPLIVATLMEVSRIRPAVPLGLPRLTSRDATLQGYHIPKDT
GCYSLLYNNYNNINSSSVVIMNLWEVLNSPKYWDKPEEFNPYRFLSEDRRSVVNHPAF
IPFGIGLRACLGEKLARMQMFIIFTSLLQRFTFCLADDSRSAGMKGKAGMAYSPLKYD
IILKRRENSD
>GLEAN3_20754
MEMGLFKVERLRDGLICFAVFLLAKWLLGSKTRKSKLPLPPSPPGLPLIGHLLMFRRD
FRKVFRGLKEKYGSVFSIRVGPQRTVVMSEIDAVKEAFVNQAEAFSDRHMPPLLKQGL
GSAGSFAFDNGPVWKARRRLGLSAMRTFGVGKYSMEQRIIEEAKMLCETIENNQETSF
NPKGLITHSVSSIICGITFGQRFDLNDPEFNDLISRIHSALSSISFFTVGQLFPFLLY
TPLYSKFRNLISSLKTLVTRMVDEHGKTLDDQHIRDIIDLYILEIREQKKRGDEIVFD
ETRVWRGIFDLFAAGTGTTSTTLLWGILLMAAHPDAMEQVQNEIDRVIGDHRPPSMSD
REALPLTMATIMEIQRFRPVSPLGFPRFTNKDAEVKGYSIPKGTAAIVNIWELMHSTK
YWEEPDKFKPSRFLSDDGRSIVKPDAFIPFGIGSRVCLGEILAKMEVFLTFTCLLQRF
TFHLSEDSPTNIDAIPGVALIPYDFLIMAEKRV
>GLEAN3_20756
MANRSYVASSMFLGLQEISFRECIVFVTTCLFVRLLVSRGSFKKSTPPSNPPPPSPTG
LPFFGNLFSFQEDMRHLFRKWTNEIGGIFRLRIGPVMDIVVINDVEAIREAFVEKADA
FSDRMVPPLLGVAFGKRGSLIWPAGDGWKKTRKFGHSAFRSMGLGNRRIEDKIVEEGR
ILVQACEMKEGEPFNPRVLITNAVGNMIACLVTGSRYDYGDPDFVKIVGDIERLSENT
SLTSLGNVFPFLYYTPLYTGFRRPMQDIISYIKSMVKQHQQNFDPLNVRDIIDLYMVE
LKQQREQQIPGVVEENLEWRLVVELFLAGMVTTSDSLLWAILIVAQHPQLMKEIQKEI
DDVIGEERPPKMADQDALPLIVATLMEVSRIRPVTPLALPRLTSRDATVQGYHIPKDT
VVLMNLWEVLNTPKNWDKPEEFNPYRFLSEDRRSVVDHPAFIPFGIGLRACLGEKLAR
MQMFIIFTSLLQKFTFCLADDSRSVGMKGKAGIAYSPLKYDIILKRRENSD
>GLEAN3_20876
MAFMFSVWTFAVPVCLLYLIKKLWYWYNISKSDPNCSAPLPGGSMGWPIIGETFHVLF
QGPSFYEKNFAKFGRVFKTHLFLNPTVRVHGPDNIAKILQGENDIVTSTKPVRMAIEK
WCQSDEVLAANVLRHLSFRVAADVLCGFHYTDEDVTELCQKFDDIMKGIMTIPYDVPF
SRFRRALQARDFIKQKIDENVTRKANLDPEEDTQDAMRILIDLANKSDDSSSKEGLYE
AIKNQTIEFLTSGMLSLSSAASTMLYHLAKKPSVLAKVREELNDYGLLTDYSPLTLNK
INQCRYVMQVIKESLRMVPPIGATFRTALKTFELDGKQIPKGWTVFWSIRETIHLSDS
FTDTKNFDPDRFSPQRQEDKVAGRFSMPVFGHGTHSCIGQNFALLTLRILLIELARTC
DFETPNVDKIKFSWIPSPKPKCGMPTRFTRKILVDQNENEVSYSSKSSTCPQGEDSRE
RSEEGVHQRSTTCHAA
>GLEAN3_21087
MIDLLGDSSLVQVNGLTITATLGVVAISLALWTMVRPRNFPPGPRGLPIVGSLFSLND
NLEKIFTGWIKQYGDIFGFKLGEHWMVVLNHQEIIKEALLKQSVQFAGRPRSPSAEWM
TDGYKDIAFGMYGETWKLHRKLGHTSLRHFATGEALQKLLSTTFPKLEVEVAKTAGKP
VDPKNIISLLLNNVLSEMCFGKRFEMHDPKVLEFIKLNDEMNAEIGMGFPEDVFSILQ
YVPYKKRDIIRGFMNKIKELVIEQIGDHKETHTQGEVTDIYHLLLDAQEKAMAEDSNM
EKLSDEHLLWTIMDIFSAGIQTSTDTLYWALALMTEHPEIADRVRKEIDDVIGGDRIP
VIEDRGSLPYAEATLYEVLRYSSIAPILLPHATTCDTTLRGHAIPKGTTVMMNVYSMH
FDPREWKDPENFCPEHFLAENGKVRDHPPSFLPFGAGRRVCLGEAVAKADLFLIFTWF
MQNYTFSKVPGKESESLTNMIPMSAAGRLLVPYEIMINKRE
>GLEAN3_21185
missing N-term
MLKYDVEFSGRPQFYSAEIASDGFKDIVFSNYSDTWKLHRKLAHSAFRKFATGHKLDD
LVHGVIPQVQQMLTENSGKPFDPKLMITLTLYNVFSKMCFSESYDFSDPRMLGYITMS
QELAEKVGFGLVADYFPWARFLSYNGASYIQRMKQKYMEWTSPQLQRHRETLDPTGIE
TSMQRLYWAIALMTEYPDVQAKIQGEIEVEIGRGRHPVIKDRGNLPYTEATLYEMIRY
STVGPVGVPHATMVDTKFRGYLIPKGTTVLLNQYSMHFDEREWTDPHSFRPEHFLDES
GTTRLHPSSFQPFGTGRRSCLGEAVAKADLFLIFTWLLQNYTFRKVPEYSEENLLKLL
PETAAGRVLQNYKILIEER
>GLEAN3_21251
missing N-term
MPSMQGVQYALDGYIKKYGNCCGFYQFLTPAILVADPDMIKQIMVKHFDRFHNRMMSE
MVREPASRLVQNLKKVCGTDRQGIVELKELFGAYIMDTIGLCAFGLNVDSQKNKDDPF
VLNAKRILLQFRLYITGKIPGVSHILPLLGLTVFPPEVLEFFIAIIDQAVEQRKTNKA
AKTVDFLQLVIDSTKDNACEDEDDDDDATTNGTSSSSHAIDKKRKRQALTRQELTSQA
LAFFVAGYETTTTALCFTAYLLATNPDQQDRLIDEIDRLVPTAEDITYDNLKKLTCLE
NFISESLRLYPPGAL
>GLEAN3_21255
MEVLQQESPGSCLTVWLVFAATALLILYLTWNYTYFLRRGIPGPLPFPVLGNVLDLKE
GVQYALDGYIKKYGNCCGFYQFLTPAILVADPDMIKQIMVKHFDRFHNRMYIPVDNPM
LRSALFLSSDDQWKKIRHTLTPAFSGRKMKLMSEMVREPASRLVQNLRKACGTDRQGI
VELKELFGAYIMDTIGLCAFGLNVDSQKNKDDPFVLNAKRFFSSLDFTSPAMIISMLT
SKIPGVSHILPLLGLTVFPPDVLEFFIAIIDQAVEQRKMNKAAKTVDFLQLVIDSTKD
NACEDEDDDDDATTNGTSSSSHAIDKKRKRQPLTRQELTSQALAFFVAGYETTTTALC
FTAYLLATNPDQQDRLIDEIDRLVPTAEDITYDNLKKLTCLENFISESLRLYPPGALI
NRVCNQTTTINGHVFQAGDSVSLPIWSVHRDPEHWPNPEKFDPDRFNKENAAGRAPLA
WLPFGGGPRFCIGLRFAMMEIRTATVMLLQQFRLEVAPETEIPPVLGKMGFINPPNGI
TLRVVERSK
>GLEAN3_22109
missing N-term
MSDLALGFFSNDFSSLMSSKGSIFGQRWKQLRRFGHRALRHFGINQGSMEDSIIEEGR
LMAEAFEMKEGTQFNPKVLVTNTVGNVITRFVFGFRFEYGDPVFEKLAEDFELSFESF
SITSLGSVFPFLYYTPLYTNFRRPIQDVVSYIKSVVNDHRRSFDPLHLRDVIDLHMAE
VKGQRDENKMAAIEEGWEWKFIFELYAAGLATTTDTLRWAILIAAQHPQLVHEVQKEI
DDVIGGRSPKFSDQSLMPLTGATMLEITRIRPVVPLGVPRCTSRDAVVQGYHIPKNTT
VLMNLWELQTSPKYWKEPEKFNPYRFLSEDRKTIVSHRALNPFGIGLRVCLGEKLARM
QLFIILTSLLQRFTLRLADDSLSVGMEGKAGIAYGPLDYNIVLEKRHV
>GLEAN3_22110
METVKKSSLDFFADYAIYGREVLVFATVLLFVRWILLQWSYNKNSKPSAPPLPYPPGL
PFFGNMFSFRTDMRDILRKWTNDLGGLFRLRIGPMVDIVVISDVDAIQEAFVDQANAF
SERDIPPLLEVAMGTNGSIFGQRWKQLRRFGHRALRHFGINQGSMEDSIIEEGRLMAE
AFEMKEGTQFNPKVLVTNTVGNVITRFVFGFRFEYGDPVFEKLAEDFEHFFESFSITS
LGSVFPFLYYTPLYTNFRRPIQNVVSYIKSVVNDHRRSFDPLHLRDVIDLHMAEVKGQ
RDENKMAAMEEGWEWKFIFELYSAGLATTTDTLRWAILIAAQHPQLVHEVQKEIDDVI
GGRSPKFSDQSLMPLTGATMLEITRIRPVVPLGVPRCTSRDAVVQGYHIPKNTTVLMN
LWELQTSPKYWKEPDKFNPYRFLSEDHKTIVSHRALNPFGIGLRVCLGEKLARMQLFI
IFTSLLQRFTLRLADDSLAVGMEGKAGLAYGPLDYNIVLEKRNV
>GLEAN3_22432
MDLNQMTIYHLLVTALVGLVAVWIVKHLMTRPKNLPPGPTGIPLLGVAFKLIRSRSDL
LALFSSWADKYGDITSFRAGPKLVVHLNSYEVISEAFRHPDLQSRPHSQMLTEMSGVG
QLGLILGVGDSWKEHRKFTFSVFRSLGVGKKSYEDTIAAEMAQLGGAIEEKKGTAFNP
NVLFEQAVANVICSIVFGTQYQYSDTDFQHVLYLLNNIARLAGTGGWAFFIPIPGISK
TPFGFVKQFVNTVDRLLAFSQAQLDSHMKDLDPDHPKDLIDQYLIKLEETQGAASYFD
QLNLKYLISDLFIAGTETTTTTLKWCFIYMMAFPEVQSCVQAELDQVVGRERMPGWQD
RKNLPYTCAVLMEVQRKGAVGAMGVPHVAAADVTIKGYTIPKDTIIFSNIWNVLNNKN
FWNDSDAFTPERFLSEDGTLVKRDDLIIFSTGRRVCIGEQIARMETFLGFTSLLHRFT
FKKPDNSPPLSFDGVLGGTRSCLPYTTCALKRE
>GLEAN3_22590
MDFGISLTTFLLGMVTFLVTLMIVWSRTGRTYANLPPKGPPEWPIIGSLLSLAGEHPP
HQILANMAKKYGPIFSMQMGSFYAVVLSDYSLIKQAFAKSGDDFSDRPKIAMIEHLAE
GKGLISGYNSPAQIEQRRFTFSALRSLGMGKFRMEETISEEIQHLIKSFTERNTQPFI
PFHDITVSVSNVICWLNFGKRFEYNDDDFKGVLEALFGIVEITEIGGVFNFLPFLRFL
PGSGLKKCFKHKAIFDKHMYPLIRKIKQSYDDKFESAECYIHMYADKIAEAAKQNIVP
NLFNDGNMVLAVADLFVAGTETTATTLKWALLYMVVNQDVQKRVQEELDRVVGVDRLP
SLLDKPNLPYTEATLLEIQRKASIVPLGVPHAPVQDTILYGYDIPKGTVVLPNLWAIH
HDPNLWKNPDEFNPDRFLKPDTKHVVQREELIPFSIGRRRCIGEELAKVELYLFFTHL
LCRFHFILPPGAPQPTMEGRSLATFVPHDYSLCAIPRCRDKDFEES
>GLEAN3_22593
MYASVEKMSSSVFYYGFTPFILAGLIAFLLSFILVRSGKKKYKNLPPRGPREWPVLGG
LPSLAGTDMPHVTIADMAKKYGPIFGMKMGSFYAVILSDYSLVRQAFAKSNDEFSDRP
KITMIENLVQGKGLIVSYFGQMQTEHRRFSLSALRSLGMGKFKMEETIVDEARQLATI
FASRQSKPFVPFHDIVVSVSNIICWLSLGKRFDYNDESFKAILASLFESFEIAEVAGL
FNFLPFLRFLPGSGFKRMIALQQNHDRFMKPLVLKVHEEQEKSRRSPTCYVEMYTEQM
IEAERETPGKHTFTKENMYHAVADLFAAGTETTATTLKWALMYMLLHKDVQEKIHKEL
DDVVGRNRLPSLTDRPSLPYTEATLLEIQRFATITPLGVPHAPVQDTVLNGYDIPEGT
VILPNLWAIHHDPDLWKNPDEFNPDRFLSPDTKQVVQREELIPFSIGRRRCLGEQLAK
VELFIFFTHLLHRFQFTTPEGAPPLTMKARNGATLNPEEFEICASLIDDVM
>GLEAN3_23067
MSLLRFLLRISTKANRQTVLLTVASVFLALLTAYRHTLTRTKHGRRRRRLPPGPTNYP
IVGILPFLGRDDPGKSVLELSKKYGPVFYGRLGNFDAIFLNDYESVREAFAKSGDAFS
DRPRFTTFEAWSGGRGVVCCYYQNHWKEQRRFLLKLLRNLGMGRANNSLEDRILQEAV
LLTRSIGETNGESFDPRIPMTMAVSNIICGLTFGKRFDYDNKEFEKFICTLERCFEMT
DIAGPTNYFTLLKYVPFSSLKKMGAMARSLERGLFTREKDAHKLILNTNDCKDMIDFF
LKEVRENESKEGRGEAPNSSFKEEFLPSLIGDIFAAGTDTSACAMYWVMLYSIKYPKY
QRRAQEELDRVVGRHRLPRQSDRPNLPFVVAMLAESLRCSAGGPFGVPHAAVEDTTFR
GYDVPKGTIMIANHYALLHDPEVFGVDTEVFNPCRFFDDDGTVIKSSKTGLAVSRRAC
PGEQLAQSERFIFFSHLIHQFDFSAPDGPDSVSLDSHFGLSRSPFPYKLRAERRPIDL
QLPDITL
>GLEAN3_23068
MKKYGAASPKTLKSTLQFLLRLMGNRRHTTYILALISLVLATISLRIGGQGRRRRRGK
LPPGPMGWPIVGMLPYMDPENPAQSVWDMSKQYGSVFCGRLGSHLAVFINDYETIKEA
FSRKDDAFSDRPRITMFEVYTNGHGVATCYFDNHWKEQRRFAMKAFRHFGVGKPDSVF
EEYLMRESKHLLHAYSRKHVAFNLKPFLDMAACNIISSMTFGNRFSYTERTFQQFLQA
LDDIFHAADVAGVTNYLSFMKYLPFSGYSSLDRRSRELESGLFVREREGHKRTYVASE
TRDLIDAFLHEIRKKKEEGVDSDKTGFSDKMILHFIADMFAAGTDTTSNSTQWLLLFA
AKYEKEQLRVQEELDRVVGRDRLPTLSDRKNLPYTEAFMVESMRCAVGGPFAVPHAAV
RDTEFKGFFIPKGTVIVSNILAVLRDPKMFPEPEEFRPSRFLSEDGNLRKDLIERVSA
QFGVGRRTCIGDQMARAERYIFLTHLIHTYNLSGASGPDSLSLKSHGGLTRNPVPYDV
ILTERTPTVASAKKSHGF
>GLEAN3_23808
ELWNLTYFWRRGIKCSRPLPLLGNILSCTKGFYSYIYTCQQNYGNIFGLYFFRSPLIV
VCDPDVLRDVLVKSFSKFSNHYQFFLKNRPLDQGLLDLRDQRWKDVRSVISPTFSALK
MKQMSPVINECCDVLVTNLKKKQGEDINVNSVFEAFTMDGIAKCAFGLQVDSQNNPDD
PFVKHAKRIMDGTVESPVAMIAGVFPPAAYLFNALDIALFPAETRKFFRDVVVKAIEL
KKRSAGNERRDFLQLMMNAHNDDEHDSRSKVIKDDNDLHNLIEDHQDKHVSNAQNTKT
QLTMDEIFAQAVLFFVAGYETTNVTLGFLAYALATNPDIQDRLIEEVDEVTPTRDSVD
YNSIAKMSLLDMVVCETLRLYPPAVMGDRCCSETHTVNGLTIEKGVQFLYSIYNIQRD
PTLWPEPEKFDPSRFTKENRANRHPFAWIPFGAGPRNCIGMRFALMEIKMAVVRILQK
YRFVPSPKTDIPVKFGTGNTLKPDGGIFLRVEERQ*
>GLEAN3_24078
MLPPNINQNGTQNKLRITIMIDATAFLTRGTAFLAGETIECAITFSCPEATGSGQPEC
LAWASAQLHCQCNVKESKVIIPPQPITRRPSEEWENIQSSSNTSFAASRGLDEATIFE
GANHTDAANPFLEGDRPKNTLLDRAVQILTTITSRKSPSVAFSNGDVWKEQRRFLLSV
FKRADAKGIRFEEIVGAQAERMLTEIEKTKSARFDPQPIINTSIGSIITKIVSGITYE
FDDDDFHRLVTHSNRVFDLLGPAGGILSSVPILAAVPSPAKRQLTKEWDTMYDFIKKN
IASHKSEFNPNQEAADYIESYLHEIAKRTTENNKGSFDEINLLECSFDVLLGGWETSA
TTISWSLHTLACYPEAQNRVRQEIFDVIGQERDPRFSDRHQMPIAMSTIAECTRLRPA
FDFHLPHVAEQDCQVGEYDIPKGSVVTTNMSHLFLSPTLWEEPLEFRPDRFLTEDGQF
DRKREPIIFGTAFHPQSGRRRSKRSPMLPQVHQSTGSLPDTC
>GLEAN3_24113
MFIPRINHFAMEFDWHQLTPYHLLATALVGLMAAWFVKHLMRLPKNLPPGPTGLPLLG
VVSKFFGTSDPLALLSEWADMYGEITSFKIGTQLVVLLNSYDAVAEAFRHPDLQDRPR
CRIKEELSGCGRNAGLLIGVGQSWKEQRKFTLSVFRSLGVGKKSYEDTVAAEMTQLGR
AIKEKKGSPFDPNILLGQAVANVICTIIFGTQYTYSDTEFQQLLSLLNFSMRRSGVGG
WLFFIPIPGISKVPFGVIKTMASNAKEFYAFIDDHIETHKKNGVPDVPRDFIDQYLIK
LKEMKGTKSSFSHHSLKSVIADLFFAGMETTTTTLKWFILYMMAYPEVQSRVQAELDH
VVGRERLPGLSDGKNLPYTCAVIMEVQRKGAVVAMGVPHLASVDVTIRGYTIPKGTII
YPNIWKVMNSKNFWDDSDAFKPERFLSEDGKLTNRQELIYFSTGRRICPGEQLARMET
FLGFTSLLHRFTFKKPDNSPPLSLDAILGGTRSSLPYLTCAIPRDPAENVA
>GLEAN3_24320
missing the middle between KYG and
I-helix motifs
MFDLSATSVFLGGVIFLLCFWVVRRPRNLPPGPWSLPIIGYNFKPGLIHEAFMDLAKK
YGPIFSLRRGSFVFVILNDRESITQALVKSGEFFSDRFVPGHFNWGIPDPGKKGGTDT
TAVVTSWAILYLSVHPEVQRKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIR
PVLPVSIPHMTSDNVSFRGFTIPKGSIIIPNLYAVHHDPKEWNDPDTFNPDRFLSADG
KQFQMNDAWMPFGVGRRDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANG
ITMKPKPYKICAIER
>GLEAN3_24836
MASMAQQILHRHVGTMVTSEHLEDAKPYDEIPGPKGVPFFGSLFDYTGLGPYKLDKLH
EATVDRFRNFGPIFKETLAGVTHVHIIDPTDVRELFRNEGRSPKRTPIDAMVKYRRTR
KRHIGMANTEGEEWQRLRKPVQHLLMKPQSVYAYIPIMEECADDFVTLMKETQDIESQ
EVPEFNHKMQRWTLESVCAIVFDTRMGCLNTGLDNNPEALDMIVSVSNFFDSLKDLTF
SFPFWKFGITTKAWTKFTEAQDYFFGMSKKYATIALDRMKSAVGTSTQSESEGTFLEC
LLARRDFDIDSLCDLMNDLMIAAVETTASTLAFNLYCLAKNPEVQERVFQEINQAIPP
GTKITAQSLQNLPYLKACIKETVRVFPTVDGTNRIPSREIALAGYRIPPDTIVRIHCI
AGLMEEYFPSPEKFKPERWLRGDAECENIDPYLILPFGYGSRMCTGRRMAEQDLYIML
IKIVQNFKIEWHHEEDMELIFRLTNVPDRPAQFRFVSRT
>GLEAN3_25299
missing some seq between pro rich
region and C-helix
MIVDLVRDVFDVGFSSILLGIATTLVLVWMMKWMSTRRRLNLPPGPMAMPWPLNFVMD
LLLLLQAKGGIVFSEGDDWKQHRRFGLAALRNFGMGKKSLENSINTEARILNEVLSGK
IGKPFDPSFVINNAVSNIICAITFGRRFEYSDPKFKEMIQRINLFAAEDPGILSDLPT
FISTARRRNLLEVRKFLEEEINEHKSTFEPSDVRDVIDLYLKEIYRAREAGEKNELDL
TKAWPLVFDFFLAGTETTSTTLLWAFLLMAGHPEVQEKIVAEISSVIGSGATPRYEDR
KLMPYTEATLVEVLRYRPIAPIGVPHRATSDLKVKGYDIVEDVIIVINILHIHHDPKI
WGDPEVFRPERFLTEDGTALIKHEAYMPFGVGRRVCLGEQLAKMELFLFFTNILQQFK
ITMPPGVQPNYDFGHRALTTLVPKAYEILLEER
>GLEAN3_25595
missing C-term after EXXR motif
MNDLSDVQYGDVFSFTMVGKTFTYLIGSEASALLFNSKNENLNAEEVYSNLTVPVFGK
GVAYDVPNPVFVEQKKMLKTGLNIQQFKRHIPLIEEETREYFNRWGDSGEKNLFVALS
ELIILTASRCLHGKEIRSMLDEEVAQLYADLDGGFTHLAWLAPSWIPFPSFLRRDRAH
RKIKQIFYKAIAKRRETGVENDMLQTLIDSKYKSGRPLSDDEIAGMCIGLLLAGQHTS
STTSAWLGFFLARDKEVQDRCNAEQIKVCGDASTEVSYDQLKDMQLLDHCVKEALRLR
PPIMTMMRVAKSPLVSVPIIGPVVPRES
>GLEAN3_25829
MAMILTVLGYVALLICVILGLTIAVLSAAILRTEYVLKDVPTPKNKSFFLGDIKTFGV
WSEKIGMSVANILYLMTKDLGPVYYVRVFWKAHFFASAPEIVKELLLNSRHIKSQEEY
HSFQKLYGERFMGHGLVSEVDHEKWFHKRAIMNPAFRRKYLISLMEEYNSGSEALCQK
LKEKADGKIEVPMLDELNKVTLDVIAKIAFSMDTNAIGDPDCPFPAAITAALQGLQSL
HERPWSTIDPRCISMRRSTRKAVHFIRETGRDQIKERIQARKRGDHVPSDLLNLILDI
ANDLEGDKDFEMENMLDEFVTLFIAGQETTSNLLAFTILQLGRHPDVAKKLQAEIDEV
LGQKHKIQYEDLAKLEYMMRVLKETLRLYSPVGGTTRLTAHPINYNGITIPAKCSLTV
MSYVTSRMEEYFDDPLLFNPDRFKPSEDGSLPRHLYAYFPFSMGQRSCIGQQFALIEA
RVILAKLLQRFEFRLEQSQSMGILDELTAKPMGRCRNYITLRA
>GLEAN3_25830
MAMILTVLGYVALLICVILGLAIAVLSAAILRKEYELKDVPTPKNKSFFLGDIKTIGA
WREKELLLNSRHIKPQEIYHGFQKLYGERFMGHGLVSEVDHEKWFHRRAIMNPAFRRK
YLISLMEEYNSGSEALCQKLKEKADGKIEVPMLDELNKVTLDVIAKIAFSMDTNAIGD
PDCPFPAAITAALQGLQSLHERPWSTMDPRCISMRRSTRKAIHLIRETGRDQIKERIR
ARKRADHVPSDLLNLILDIANDLEGDKDFGMENMLDEFVTLFIAGQETTSNLLAFTIL
QLGRHPDVAKKLQAEVDEVLGQKHTIQYEDLAKLEYMMRVLKETLRLYSPVGGTTRLT
AHPVKYNGITIPAKATVTVMSYVMSRMEEYFDDPLLFNPDRFKPSEDGSLPRHLYAYF
PFSMGQRSCIGQQFALIEARVILAKLLQRFEFRLEQSQSMGILDELTSKPMGRCKNYI
TLRA
>GLEAN3_25863
MGVDPVRDVFDVGSSSILLGIATSLVLVWMVRWMSTRRRLNLPPGPMPVPWPLNIVTE
LLLKLQGTNPFVLLGKLAERYGEVVYIDNGGRRFVFLASPEAVNEVLVKQADITSGRE
DFWELKAVTHYHGGIIFSEGDDWIQHRRFGLAALRNFGMGKKSLENSVNAEARILNEV
LSEKIGKSFDPFFAMNNAVSNIICAITFGRRFEYSDPKFKEMIQRINYFVAEDPGPMS
NLPTFISNARRRNLVAVRKFLEEEVNEHKSTFEPSDVRDVIDLYLKEIYRAREAGEKN
ELDLTKAWPLVFDFFLAGTETTSTTLLWAFLFMAGHPEVQEKIVAEINSVIGSGATPR
FEDRKLMPYTEATLVEVLRYRPIAPTGVPHRATSDLKVKGYAIPDGVNIAVNILYIHH
DPKIWGDPEVFRPERFLTEDGKALIKHEAYMPFGVGRRICLGEQLAKMELFLFFTNIL
QQFKITLPPGVQPNYDFGHRVTTLVPKAYEIILEER
>GLEAN3_25956
missing N-term starts with C-helix
MVLYDLFEVYYREQTTASSVIWENGDKWKSSRRTTLTAFRKSLSSPSTLGATVKSSWQ
HLRVSLQSLNGQHCNPRSEITSAIAEIISSLCFGHCVDVRSDPKIADIMQAQHNISRG
LSGISLGHIFPHLYHTPLYSKLRRDIEQYTQFMREVVKDHKLRLVGNDDGKCKDGNRK
EYITDVTDCYLQQTQHGGKVAVEEEDVWRVMFDLMGAGTETTSETLLWGILYLCLFPD
VQRQVHSEIDENMALYGPDDVINYDQRKSLPYTQATILELLRCACVAPLSLPRVTTRD
LTVLGYSIPRGTEIWVNEWAINRDRNVWNKPHDFNPARFLDDSGRVDPKLKESIISFG
EGRRMCLGRCLAEHMLFVFFTNLVRWFTFRLPEDCRETPNVNEGNFGLTYSPPQFDVI
ITPRVFGN
>GLEAN3_25969
missing N-term
MTLWYMFSKRSSYDSPLQNDILNLTDKAEKPSGSGNPVDFLPWLRFITTKYKTTLQEY
RKKFGKLFSGVIEEHAVLYEPGSERDVIDHLISTGHNSGEEELQRLGLAGKALYQSAF
DFFPAGTETVLATLEWIMVYMVTYPNVQSDVQHEIDQFVGRGRLPTLNDRDNLPLTQS
CLLEIQRHAAAVPFAVPHSTTQDTVLDGYYVPKDMLVFVSLYSAHFDPDVWETPEVFN
PRRFLAQDGTLDEEKTKLAIPFSLGPRRCLGSGVARIQLFIYFTTFLHQLNFKCPEGQ
KASLDATYKFPRRPKVFNVQIDPR
>GLEAN3_26188
missing C-term after EXXR motif
MWRRPLLLVRDLDMLRDIMVKDHHRFYNKFPLPMSSGSFDNALFLLRDAQWKRVHDIM
TPTFTGKKMKMMSGILNEGADDMVEKLLQSCKEDGNIDSLDNGLPYAYSVFTFQCGIY
PSLLYIKIKLNGLKKIVGIIEKRVDFLQLLIDAKNGKSKMANENDEDDIHNKYFKDAG
ADENILTKTQKYMTREELLAQSMVFLVAGYDTTATLLTFASYLLASNPDCQDKLIAEI
DDIAPKRDDVGYTTISKMPYLDQVVCEVLRIYPPAVV
>GLEAN3_26225
MTYFKRKGIKGPSPYPVVGNSLDLFQGIHWTTQDYVKKYGNVFGMFMFSSPVFFISDL
EMVKDITVKSFSKFMNHKTFVLKSRPFDKALTGLKDQHWKDVRNVITPTFSAVKMKQL
SVLLNESSDTLVKNFGAIQEKDGKVKVFGLFEAFTMDGIAKCAFGIQVDSQNDPSDPF
VTNAKDLMKNRFSLSTMIAVTFPRIGKIMDYYGVSTMSPKISSFFISVVDQAIAARKE
SGGSKMKDFLQLMLNAAQENKSEKEMIEEKPQDEKVEGDLHALIEDQGNKSYFDLKKS
KKTTLTKDEIYGQAIIFFLAGYETINTTLGFLTYSLATNPEVQDKLIEEIDNETPTRE
DVGYNSIAKMSYLDNVVCEILRLYPAGVIVERQCNETHVCNGLTIPKDSQVMFPVFAI
HRDPAIWPDPEKFDPERFTKENREGRHPLAWMPFGAGPRNCIGMRFALMEIKMATVRL
LQKYRFETCPETQIPPKFAKMSLKPDDDLYLRVVERS
>GLEAN3_26360
missing N-term starts at C-helix
MGFELLCTKATIAFSNGKPWVDLRRFALPALRSFGFGKQSLVPQINLEARYLSEEIRN
LRGEPTDLATTFSKATANIICQLLFSQRYEYSDGEMSRILKSMDEFLSLIHETDLVNI
FETLIHTPRYKPYRESIFSDRDFIMSHLNSHRETFQKDNIRDFTDAFLADDISKEFEL
EHFWRVLLDLFIGGTDTTASVTSWTILFLSVHPDVQRKVQAELDTVVGRGRQPNTLDR
PDLPYCNATLTEVMRIRPILPVSVPHMTSDDVSFRGFTIPKGSIIIPNLWAVHHDPKE
WNEPDTFNPDRFLSADGKQFQKNEAWMPFGVGRRDCVGSQLAKMETFLLFTNLFQQFE
FKLPPNQPNPSMRGANRFTMNPQPYKICAIER
>GLEAN3_26373
MAMAYRRLRHRLLLPPGPWFLQLRMVYYMLTMRLPCRLEKIYEAQSKKYGKMFSFSLG
SDIKIVISDPDIIREAFGKHRAAFSDRRSPSVYAKSFGGEHNHRILFANGRLWKTSRR
VVAPCLRNIGTWGQQMESIILREADSLTDKLAELGSKPFDPRHHLTRAVANIIHMMVH
GYNLDPDDPRLQQMLDSFQLVVTHPMVTSNTLRYLPFLYYLPTTRPLRQAFQLLKTML
STAVSTQANGYDQSSINTIVKRIMQAAEEDDDVGEDRLKTKEVWRLVYDLMLAGSDTT
SNTLKWAVLVMAYYPDIQKKVHNEIDCVLGERSPCLQDKKSMSYIQATIMEVLRVRPV
LPLALEHLTSETTVLRGYTIPKGAILIPNVWSCHHNPEYWDDPEVFNPGRFLSEDNQS
LGEEPHAFLPFSFGPRKCLGDQLAKNELFLIFAKIMQRFHIDFAPGDRPESFDEGFDK
GLILRPNNMKVRLQPRKIYSHEE
>GLEAN3_26477
METKVLPFDVNSYTLGVAFCTCFLSWVAVKRYRTEAKEESLHSLPPGPPKWPLLGNIP
GMMMAGSSSKYLSDLVSEYGPVYTLYMGPRPVVVLNKYDIMKQALMDGHLLSDRGDLP
ALDDYFKGRGMMMAGSSSKYLSDLVSEYGPVYTLYMGPRPVVVLNKYDIMKQALMDGH
LLSDRGDLPALDDYFKGRGIVGSHYTEYWKNTRSFVIHSLRQFGVDRQSFESVIASQA
DILVEDIRKRGTFDPLLCLQSNVCSVICGVVFGKQYNLDNPRFTDLLETVYKALRAIE
YAAVSSLVPGLHYLPGCSYQKFYACMDKLMRFFREEINAHDETLDPDSPRDVIDMYLS
KIRRASPEDPTYGLFSKTNLEFVVNDLFMAGTETSASVLYWAILLIASNPGIQERLQK
EVDDVTTHCDHVSIADAASMPYTRATIDESLRYASVAISSMRSTTSTMTLGKYRFEVG
TWVMLNYEYAMFDPDHWSEPKRFKPERFLDDHGQYCPDEQLIPFSIGKRNCIGEKLGR
MELFIFFTNIIKNFELVFPDDAPHPSVEAATGGMMRVPNRYSITFKDRTTTAA
>GLEAN3_27152
missing N-term starts with I-helix
MSELMLKFLCFIAGIQTTVETLYWAMALLATYPEIQAKIRAEIDDVIGRDRLPTINDR
GKLVYTEASLYEVLRYSSIVPTNLHATTRDTEFGGYHIPKGTVVLINTHSMHYDPQEW
DQPDKFLPEHFMDGSGTVREHPPSFLPFGAGRRGCLGEAVAKADLFLIFSWFLQNYTF
SKVPGKESEDILNMIPQTAFGRLLISYEIMINKRD
>GLEAN3_27153
MIDLLGAGESGFGQINALTITATLGMVTATLALWTIVRPKRFPPGPRGLPIVGSIYSM
NDSPEVVFGEWAKKYGDIFGFKAGERWIVVLNRQALIKEAVLKQSVGFADRPDFLSLD
IFSDGFKDIAFSPYSETWKLHRKLAHSALRHFATGKPLQDLISSVYPKVEKKLAMTEG
QPIDPKVLITLLMYNVLAQMCFGRSLLLKAQEDARKEGENVDKLTDTHIFQTIADIFT
AGILTTVETLYWAMALLVTYPEIQAKIRVEIDDVIGRDRLPTINDRGKLPYTEASLYE
VLRYSSIVPNALPHATTRDTEFGGYHIPKGTTVIINTYSMHYDPQEWDQPDKFLPEHF
MDGSGTVRDHPPSFLPFGAGRRGCLGEAVAKADLFLIFTWFMQNYTFSKAPGKESEDI
LKMIPQTASGRLLLSYEIMINKRD
>GLEAN3_27613
MHLTCTVVKVVMVVGMLMVMTPNSDALSGKTPPGPGLLGSFTQLKRVVSDFFRSRSAA
EMISEKEGGLYQLNLGLSKICFINSRGFRDEAFLEKYHLFSDRGVESNKKQLPEELMQ
WRGTIRDNGAASDEARAFVNSTLKETSYLRTTVETQIEEAIAFFDDELTGAIPSDLVV
PKMIFREHVLDVMTRIAFDTAGHRNTISGQWQFQKFFEDVDALSNSVLFQPGFISTVL
PNTKVDDYETNLVGVAKYLKERAEEAIYTNRETSLVQRYSDFFLKRESSIAQPEDGWR
LMYDFFTAGVNTVPEVITWDIYYLASDLDLQDKIRQEMKNFGTDVTYDDRTSLPRTMS
FLTEVHRLRPVATLGIPHAASADGTIGGYAIDKETLIIPNHWAMHNDPSDWQSDPQVF
DATRFLKLNGNDLTFDSDKARIVTPFSVGPRSCPGQEVARRTIFLTIANLVQKYTVRF
PEGPVVDVSNVKGSFTMYAPKFEVLLEPRSRPPPQRN
>GLEAN3_27796
missing N-term and some middle seq
starts at C-helix
MDSIASCGFGLKVNSQEDKDDPFVEHVARFFKVSFTSPAFMAVSFMPFLKHIFEYLNI
TLFPRDVIEFFSDVTEKAVALRENNTSEKRADFLQLLIDAKNGRTETANENDDDDIHN
KYFKNAGTEDDVPTKAQKYMTREELLGQAIIFLAAGYETTSALLTLTSYLLATNPDHQ
DKLIAEIDDIAPKRDDVSYSTISKMPYLDQVVCEALRIYPPATLTDRECGETCTYKGI
KIEKGAHIWIPPYTLHHDPHHWPNPTKFDPDRFSKENREGRNPFTWIPFGAGPRICIG
MRFALMETKMALVRSLQTVRFEVSPLTKIPPDIGQSQILSSTASYMLKVVRRH
>GLEAN3_28152
MAIGIGIIAFIMALVAIVTVPLLKFIRLWIVFNKFHGPPAIPLLGNALQFKQDPNELV
EQVSSWVEEYRTKSKGLMRLWLGPFPVLFIYNPHEVEPGKEPTRHIYEESEGKRLGRQ
ITGKKWFHRRKMLTPTFHFAILHNFIEVFDEQSNTLCQKFGEHAQKQDLVNICPLVTL
CVLDIISETAMGKQLHAQDESNNDYVNAVMRMSDIIQQRQKKPWFWPDPLFDNTKSGK
EHAQCLQILHKMTNTLIDERSKEENQKVDSNDNDGSKRRIAFLDLLLKMQRDDPSFTK
SDIREEVDTFMFEGHDTTAALASWAMFLIGHHTRVQKKLHQELDSVFGDSDRPVTADD
LQKLPYLTCVLKETLRIFPSVPIVGRDLQEDCIIDGKLAPRGTLLIIAIGSLHRDPTQ
FPEPLMFDPDRFLPEFSSKRHPFSYVPFSAGPRNCIGQRFALMEDKVLLANVLRCFSL
ESTQSLKDTMPIAELILRPSEGIHMKITRRKGISNLDSKLLQRKQNNQRSRLVTV
>GLEAN3_28699
missing N-term
FVVLNDRESITQALVKSGEFFSDRFVPGHFNWGIPDPGKKATIAHSNGKPWVDLRRFS
LPALRSFGFGKQSLVPQINLEARYLSEEIRNLRGEPTDLLTTFSKATANIICQLLFSQ
RYEYSDGEMSRVLERMNETLTLIPETDLVNIFEPLIHTSIQRYKRYRESLFSDRDFIM
SHLKSHQETFQKDNIRDFTDAYLADDISEEFELEHFWRVLLDFFAGGTDTTAVVTSWA
ILFLSVHPEVQGKVQTELDTVVGRGRQPNTLDRPDLPYCNATLTEVMRIRPVLPVSVP
HMTSDNVSFRGFTIPKGSIIIPNLYAVHHDPKEWNEPDTFNPDRFLSADRKQFQMNDA
WMPFGVGRRDCVGSQLAKMEAFLLFTNLFQQFEFKLPPNQPTPSMCGANGITMKPKPY
KICAIER
>GLEAN3_28922
missing C-term
MGFTETEPGYWLLAGVLGLLVAWLVKHLITGPKNLPPGPTGLPLLGVAWQIYKGGSDP
LALFAAWSKKYGEMVSFFVGPKPIIILNSYPVIFEAFRHPDLQDRLGSRLLQEIIGLK
NSGIAFSNGDVWKEQRKFTHSVFRSLGVGKKSYEDTVSAEMTQLSSVIKEKKGSPFDP
STLFMQAVSNIICSIVFGTQYTYDDEAFKEIPFGKMKLMTNNIRSVNAFIQSQIESHE
KNRDPDNQRDFIDQYLNKLDETKVK
>GLEAN3_28934
MFFLSSTSVLLGVLGFLFCLWFVRRPKNLPPGPWSLPIIGYRFDSGLIHEAFVDLTRK
YGPVFSVRRGPFLFVVLNDRESMKQALVKSGEFFSERFVPGHIRWGIPDAEKNASISW
SNGKPWDDQRKFSLPALRSFGFGKKSLVPQINLEARYLADEIKALHGEPMDLSAVLNK
ATANIIVQLIFSRRYEYDDPEFIGALEAMADIFSLVSDTDPVNVFESLVHTPWYKPYR
DCTSKLRDFIMSHLNNHQETFQKDNIRDFVDAFLAHDISKDYSLDKFWRILLDFFAGG
TETTAVVTSWAILYLAVHPDVQKKVQTELDVVVGRGRQPNTNDRPNLPYCDATLMEIM
RIRPVIPVSLPHMTSADVSLGPYTLPKGTIVIPNLWAVHHDPKEWCDPHLFNPDRFLS
ADGQTVVKNEAWMPFSIGRRDCLGMQLAKMESFLLFANLFQQFDFKLPPDQPTHSMRG
HPGLSMPPESYKICAIER