472
cytochrome P450 sequence pieces from Amphioxus. Very fragmentary with
two
haplotypes for many genes.
From JGI
Branchiostoma assembly Jan 30, 2008
Search for
P450 at 1.0e-5 or less (481 results, some false positives)
This file
has clans 7, mito, 19, 20, 26, 46, 51, 74
CYP7 clan
(12 sequences) includes CYP39, no
CYP8 sequences found
$$$$$$$$$$
>fgenesh2_pg.scaffold_10000055|Brafl1
41% to CYP7A1
MPCSSCLAVMIKVMIPTTSTDHGWNLYPPVSLCRQGGVTPPTGWVVTPESYAILCPSWQQVFCEQYQCLSAATLLPAGLL
EAPPLFSYYSGARGVVHGGQGPVGHAPPPDSEMFSENVEYSGAITEQELWFEIYCLFQWNKTVFLFRVRGSSSSFEVTQT
NSPDSVRQALITAGLSSTRLRRAGGTCSTRDDYTYIMWKFLITLTSCFCMKRSNKVSPVEDGDVKEETAPGEEEMNRGTT
ECPVVTAQPPMSQPRSSKSSADVLAELRQDGLLPLNTRGESVAFQVPASEPDAPPRRPVKLAKLEETLQERRERVKKEPA
GSRTKLRQQLSDAANRRDEMLQNRSRKLAESSRRAKAKARAAKKERKSTAFVISSVSDTDAIVPRDSEKAQALEKRLSKR
RKRVAKRITAEDMKKQQELAAERRRRSNKVSPVEDGDVEKEPAPGEEEMVRGTTECPVVTAQPPMSQPRSSKSSADVLAE
LRQDGLLPLNTRGESVAFQVPLVKPASEPDAPPRRPVKLAKLEETLQERRERVKKEPAGSRSKLRQQLSDAANRRDEMLQ
NRSRKLAESSRRARAKARAAKKEGKSTAFVISSVSDTDAIVPRDSEKAQALEKRLSKRRKRVAKRITAEDMKKQQELAAE
RRRCHIDRLAYLSTYSK
MVTELLGVCLAVVLVFVLLQVTTRRRRPGEPPLEPGPLPYLGVALEFSRNPLGFITSRWKKYG
DVFTVRLAGHYTTFVLDPHSFTHAIRNSKVLDFRVFSSKIAHRAFGMPIVYGTHRDWVRADSDALYPKELQGQGLEKVTE
VMMNNLQSAMLAATDVKDKWNKGELWSFVYRIMFSASYKTLFGRHKEDEEETARLLHAMEEFQKYDKRFPEIISNVPWWL
MGQTKKRYEYLKSMVSPTELSQRGVSDFIRMRQEIYADGNLSPDEMTGFNFATMWASLSNTVPAAFWTLFYLLKDPVAMD
AVREEVNQILKETGQSLETVKEAGEMLHVTREQLNDMKCLGSAINEALRMCSASIIIRVATEDAELALESGSTFRIRKGD
RVALYPGFLHMDPEVFDDPETFKYDRFLENGMEKTTFYKNGRKLRHYLLPFGHGASMCPGRFFALNEIKQFVTIVVCYFN
MELMEKQTPPKDQSRAGLGTLAPLKECLFRYSLK*
>fgenesh2_pg.scaffold_63000051|Brafl1
38% to CYP7A danio
96% to fgenesh2_pg.scaffold_10000055|Brafl1
MVTELLGVCLAVVLVFVLLQVTTRRRRPGEPPLEPGPLPYLGVALEFSRNPLGFITSRWKKYGDVFTVRLAGHYTTFVLD
PHSFTHAIRNSKVLDFRVFSSKIAHRAFGMPIVYGTHRDWVRADSDALYPKELQGQGLEKVTEVMMTNLQSAMLAATDVK
AEWNKGELWSFVYRIMFSASYKTLFGRHKEDEEETARLLHAMEEFQKYDKRFPEIISNVPW
(gap)
CLMGQTKKRYEYLKSNTVP
AAFWTLFYLLKDPVAMDAVRAEVDQILKETGQSLETVKEAGKMIHVTREQLNDMKCLGSAINEALRMCSASIIIRVATED
AELALESGSTFRVRKGDRVALYPGFLHMDPEVFDDPETFKYDRFLENGMEKTTFYKNGRKLRHYLLPFGHGVSMCPGRFF
ALNEIKQFVTIVVCYFNMELMEKQTPPKDQSRAGLGTLAPLKECLFRYSLK*
>fgenesh2_pg.scaffold_1047000003|Brafl1
only 6 aa
diffs to fgenesh2_pg.scaffold_10000055|Brafl1
MVTELLGVCLAVVLVFVLLQVTTRRRRPGEPPLEPGPLPYLGVALEFSRNPLGFITSRWRKYGDVFTVRLAGHYTTFVLD
PHSFTHAIRNS STGGTEDQRSTTCVQIR ASYKTLFGRHKEEKDETALLLHAMEEFQKYDKRFPEIISNVPWWLMGHTKKR
YEYLK
DTCRHDRQCISRHGVGSCCAPRRPIFSPLPVCKSAGQVGDTCQRSGERLAYPTSVGRRQYIFTCPCAEGLQCELF
SGYADIGTCVPVQY*
>estExt_fgenesh2_pg.C_4350040|Brafl1
25% to CYP8b.c, 23% to CYP7B1 human
MGSVLGTLQLLGWNNQMLKPNREEDFVEKNIGFPCRVVTGNKTVQSVFDIDLFKKEEFCFGVVGEVRKDFTEGVCPCILS
NGKIHEKNKGFLMEVIAKAGEDIPPSTALSVLSNISKWGSTPMSDFESKLTDVAADAFLPNIFGESTHFHGEEIRLYRSG
AI AVRLSIVKALTGRNLDEERRAMTSILEKIKTSERYQQLLDLGKSYGLGEKEATAQLLFPVFINGAYGLAAHLVCTFAC
LDTISAEDREELREEALAALKNHRGLTRESLEEMPKIESFVLEVLRFCPNPVFWSTIATCPTTVEYTTDSGEHTLKIEEG
ERVYASSYWALRDPAVFDKPEDFMWRRFLGPEGDALRKHHVTFHGRLTDTPAVNNHMCPGKDVSLSALKGSIAIFNTFFG
WELQEPPFWTGKKLSRGSLPDNEVKIKSFWVQHPE
DLKEIFPSHFQDIVNEVDDVGDIDVLVKTKTGKYSGSGTNSNVYI
RLFDDKGHQSRELQLDVWWKDDFEKGQEGQYKLKDIKVAAPIVKIELFRDGCHPDDDWYCESVSVQLNPDNNGPTYDFPV
NRWIRQNDHVWLSPGGGEPPKDDVNPIDD*
>CYP7 estExt_fgenesh2_pg.C_10470002|Brafl1 40% to
CYP7B1
no allele
MISGILAGCLVVLVVAILVQAVGRKRDPNEPPLESGPVPYLGVALQFAMDSLKFIRSRQKKYGDVFTVKLAGKYTTFVLD
PHSYSDVMRQHKILDFKTVGMDIVERGFGTTHFEKTGRAHVLHTADAYFPVHLQGNALDPLTNTMMGHLQTAMLADIGEAA*
$$$$$$$$
>estExt_fgenesh2_pg.C_1950037|Brafl1
27% to CYP7D1, 30% to CYP7B1
MGGVWSNTYGFIKGVTDGVHMMKPEGEHPSVVRTNPGLPVVALMNQDTIHYAINPETYKKEPYSFGPVGVSKDVLRGHCP
SMFSNDEDHRRKKALLVDAYKQGEKSLPSILFNQIKAHFGEWSRLKDVPDFEERVFHIMSETLTEALFGRKIDGQLCFTW
LNGLITEAKTWIPMPSLAWKRRQAIKAIPELLKAIETAPKYRELVQLCHTHGVEVEEGIFTILYGTLFNGCAAQTAAIVS
SVARLHTLSDAEKN
EIIQTTLQVLEKHGGVSEESLGEMKTLESFILEVLRLHPPVFNYWVLARKDLVISPEKENIKVRKG
ERMLGCCFFAQRDGSVFPDPDRFRWNRFLDEQGGQKKHLFFPRGSFTEAADLNSHQCPGQDIGFFMMKTTLSVFLCYCSW
ELKDAPVWSDKPIRVGNPDDPVRLVRFNFRSEQ
AGRALTQGNRLVLIRAQVCLAVWTLTHLSVSRLVLKLDATTMPRNQR
APGSGGLPVSERRTRGHEKEIEAGWERSKFNEFVSDLVSLERSLPDTRPVRCHKAQVLDNLPTTSVIICFCEEAVSTLLR
SVHSVINRSPPHLLKEIILVDDASTAAYLKEDLDTYMSKFPQVKIVHLPEREGLIRARLRGAEIATGDVLTFLDSHIECN
VGWLEPLLDRIGRNRTTVPCPSIDRINDNTFGYEAANENMRGGFNWGMKFDWVSLPPGEDDRRYQDIWSQNEIIKSPTMA
GGLFSIDRRFFWELGGYDPGFQIWGAENLEISFKDIFYALNPHVENEIANAGDVSDRKRMREQLGCKSFQWYIDHVYPEI
TIPDLRAKARGEVKNRAMSLCLDAVYGEKVGAYFCHGEGGQQSFTLRMDDKIMLRWFFSVCLAAGLPIRNHKGAFLLTKK
PCTAPEVIAWNHTKGGPLVDQKTGKCLGVVNLSPEEHLVALRPCNQQRVQDWTFQNYLVDM*
>estExt_fgenesh2_pg.C_3320046|Brafl1
27% to CYP7D1
MGGVWSNTYGFIKGVTDGVHMMKPEGEHPSVVRTNPGLPVVALMNQDTIQYALNPETYKKEPYSFGPV
GVSKDVLRGHCP
SMFSNDEDHRRKKALLVDAYKQGEKSLSSILFNQIKAHFGEWSRLKDVPDFEERVFHIMSETLTEALFGRKIDGQLCFTW
LNGLITEAKTWIPMPSLAWKRRQAIKAIPELLKAIETAPKYRELVQLCHTHGVEVEEGIFTILYGTLFNGCAAQTAAIVS
SVARLHTLSDAEKNEIIQTTLQVLEKHGGVSEESLGDMKTLESFILEVLRLHPPVFNYWVLARKDLVISPEKENIKVCKG
ERMLGCCFFAQRDGSVFPDPDRFRWNRFLDEQGGQKKHLFFPRGSFMEAADLNSHQCPGQDIGFFMMKTTLSVLLCYCSW
ELKDAPVWSDKPIRVGNPDDPVRLVRFNFRSE QAGRALVNTSAKKI*
>estExt_fgenesh2_pg.C_1940045|Brafl1
27% to CYP7D1
MGGVWSDTFGFIKGLVHGPHMMKPEGEHPSVFRANPGVPAVVLLNRDTIQYAFNPETYEKEPYSFGPVCAAKDVVGGHCP
SMFSNDEDHRRKKALLIDVYKQGQKTLPSVFFSQIKAHFEEWSRLEDVPDFEERVFHITSETLTEALFGKKIDGRLCYTW
GNGIPTDFRTWIPIPPAARKRRQAVEVLPALLKAIKETPKYQELVQLCHTHGVEVEEGILTILYGTLFNGCGAQTATIIS
SVACLHTLSDAEKNEIIQTTLQVLEKRGG
ISEESLSEMKTLESFILEVLRLHPPVFNYWALARKDLVISPEKENIKVCKG
ERMVGSCFWAQRDGSVFPDPDRFRWNRFLDEDEQGGQKKHLFFPRGSWTEAADLDSHYCPGQDIGFFILKVLLAVLLGYC
SWELKDAPV WSDNTFRLGNPDDPVRLARFNFRSEQAGRALGIRPDNIAPNAI*
>estExt_fgenesh2_pg.C_510020|Brafl1
30% to CYP4V6
90% to estExt_fgenesh2_pg.C_1940045|Brafl1
87%
to estExt_fgenesh2_pg.C_3320046|Brafl1
87% to
estExt_fgenesh2_pg.C_1950037|Brafl1
MKPKGEHPSAFRMNNGVPAVVLLTRDTIQYAFNPETYEKDPYSFGPGGVSKDVVRGHCPSMFSNDEDHRRKKALLIDVYK
RGQKTLPSVFFSQIKEHLEEWSRLEDVPDFEERVFHIMSETLTEALFGRKIDGELCFTWLNGLLTDFKTWIPIPSMSRKR
RLAIEALPALLKAIKEAPKYQELVQLCHTHGVEVEEGIFTILYGTLFNGCAAQCAAIVSSVARLHTLSDTEKNDIIQTTL
QVLEKHGG VSEESLGEMKTLESFILEVLRLHPPVFNFWCLARKDLVISPEKENIKVCKGERMVGCCFWAQRDESVFPDPD
RFRWNRFLDEDKQGGQKKHLFFPRGSWTEAPDLDSHQCPGQDIGFFMMKALLAVLLGY CSWELTAAPMWSDKTIRVGNPD
DPVRLARFNFRSEQAGRALGIRPDNIAPNAI*
$$$$$$$$$
>CYP39 amphioxus 49% to CYP39
zebrafish, start MET not certain,
2 choices
MATTIGEHSPGDELYNAFKY
MILFSLCFAFFSWRNIVKKGRPPCMDGWIPWFGCAIDFGKAPLDFIEETKRK
(0)
LGPVFTIVAAGRWMTFVTEPEDITTFFQSPNLDFQKAVQDPVSHT
(1)
ASVSTESFFQHHTKIHDTIKGRLAPANLHSFCSNLWGEFKQQLEQLEHHGKDDLNTLVRR
(2)
CMFAAVVNNLFGAENVPTDKDRIQEFSDIFVKYDADFEYGSQLPPFFLR
(2)
EWAESKKWLLSLFSRSIANMERKETESQ
(0)
TLLQSLTKMVDRPHAPNYALLMLWASQANAVP(0)
MSFWVLAMILSNEDVHAAVKKEVQDNLGSP
(1)
GDEPITEEDLKKLPLLKRCIMETIRLRSPGVITRAVDKPLRIR
(0)
KYIVPKGHLLMMSPYWAHRNPNFFPEPDKFLP (0)
DRWLDADLEKNLFLDGFVGFGGGRYQCPGR
(2)
WFALMEMQMLLAMMIQMFDFKLLGEVPKEVCQNFNYLISIHII*
>fgenesh2_pg.scaffold_124000018|Brafl1
45% to CYP39A1
MATTIGEHSPGDELYNAFKYMILFSLCFAFFSWRNIVKKGRPPCMDGWIPWFGCAIDFGKAPLDFIEETKRKLGPVFTIV
AAGRWMTFVTEPEDITTFFQSPNLDFQKAVQDPVSHTASVSTESFFQHHTKIHDTIKGRLAPANLHSFCSNLWGEFKQQL
EQLEHHGKDDLNTLVRRCMFAAVVNNLFGAENVPTDKDRIQEFSDIFVKYDADFEYGSQLPPFFLRSIANMER
(gap)
KETESQT
LLQSLTKMVDRPHAPNYALLMLWASQANAVPMSFWVLAMILSNEDVHAAVKKEVQDNLGSPGDEPITEEDLKKLPLLKRC
IMETIRLRSPGVITRAVDKPLRIRKYIVPKGHLLMMSPYWAHRNPNFFPEPDKFLPDRWLDADLEKNLFLDGFVGFGGGR
YQCPGRWFALMEMQMLLAMMIQMFDFKLLGEVPKESPLHVVGTQQPVGPCPVEWTKI*
>CYP39A1 fgenesh2_pg.scaffold_124000030|Brafl1 45% to
N-term
MATTIGEHSPGDELYNAFKYMILFSLCFAFFSWRNIVKKGRPPCMDGWIPWFGCAIDFGKAPLDFIEETKRKLGPVFTIV
AAGRWMTFVTEPEDITTFFQSPNLDFQKAVQDPVSHTASVSTESFFQHHTKIHDTIKGRLAPANLHSFCSNLWGEFKQQL
EQLEHHGKDDLNTLVRRCMFAAVVNNLFGAENVPTDKDRIQEFSDIFVKYDADFEYGSQLPPFFLREWAESKKWLLSLFS
RSIANMERKETESQTLLQSLTKMVDRPHAPNYALLMLWASQANAVP
(gap)
KYIVPKGHLLMMSPYWAHRNPNFFPEPDKFLP
LSDLDGETRSMVEKMMYDQRQKAMGLPTSDEQKKEDVLKKFMEQHPEMDFSKAKFC*
Mito clan
(28 sequences, some duplicates)
$$$$$$$$
>CYP11amphi
mixed seq 43% to Gene C, 35% to Gene B, 34% to gene D
36%
to 27B1 fugu, 38% to 11A1 fugu, 33% to CYP24 fugu, 32% to 27C1 fugu
37%
to chicken CYP11A1, 39% to catfish Ictalurus punctatus 11A1
This
is a probable CYP11A gene
(2)
EAKPFSALPGPPSVPVLGNFLHMWWEGLLEKEKLNKNHIMFTDFFRQYGPIFR (2)
(2)
LKIVNVDMVSIKDPVAVQELFRKEGKYPARIDIKPWRRYREISGKATGVFLS (2)
(2)
NGKDWQKNRSIMARPMLRPKHVSTYVSNLDTVSADMIKRLRVLQARADGIEV PNISDELFKWALE (1)
(1)
SICTVLFNERMGYLQDNISQDAQDFIQGIHTIFLTTNTVIFPDADVHRFLRTKPWRQSVEAWDTVFRV(1)
(1)
GEKVMVRKLQEALEREERGEGEDDQPNFLAFVNSTGRLTKDEIYSNTIELMGAAIDT (0)
(0)
TSNTLLWTLYELSRRPELQDRLYQEVTQVIGQDKVMTWDHLKDLHLLKAIIKETLR (2) 885
(2)
MYPVVHNVSRLLQEDTVLMGYRLPAK (0)
(1)
TCVVAQVYAMGRDPQLFPDPDEFKPERWLRTGEAHDEINPYSSLPFGFGPRSCL (1)
(1)
GRRVAEVELQLLLAK (0)
(0)
MSQQFVLSQVEPEEISSVAQPLLMPETPLHLRFVDRK*
>fgenesh2_pg.scaffold_28000018|Brafl1
34% to CYP27C1 98% to 11amphi above 6 aa diffs
MMSVPVISGSRQRLSAVVGRAVSPWRPQGHIRVRALVGYRSGLVGPRTVPSPVQTYSTAAVGSTSHHNDDSEAKPFSALP
GPPSVPVLGNFLHMWWEGLLEKEKLNKNHIMFTDFFRQYGPIFRLKIVNVDMVSIKDPVAVQELFRKEGKYPARIDIKPW
RRYREISGKATGVFLSNGKDWQKNRSIMARPMLRPKHVSTYVSNLDTVSADMIKRLRVLQARADGIEVPNISDELFKWAL
ESICTVLFNERMGYLQDNISQDAQDFIQGIHTIFLTTNTVIFPDADVHRFLRTKPWRQSVQAWDTVFRVGEKVMVRKLQE
ALEREERGEGEDDQPNFLAFVNSTGRLTKDEIYSNTIELMGAAIDTTSNTLLWTLYELSRRPELQDRLHQEVTQVIGQDK
VMTWDHLKDLHLLKAIIKETLRMYPVAPNVSRVLQEDTVLMGYMLPAKTCVVAQVYAMGRDPQLFPDPDEFKPERWLRTG
EAHDEINPYSSLPFGFGPRSCLGRRVAEVELQLLLAKMSQQFVLSQVEPEEISSVAQPLLMPETPLHLRFVDRK*
$$$$$$$$$$$
this
block related to gene B
>Gene
B 84% to Gene D, 35% to CYP11 amphi, 33% to Gene C
30%
to CYP24 Fugu, 30% to 27A3 fugu, 27B fugu, 27C fugu, 30% to 11A fugu
in
nr blast best mammal hit is CYP24 mouse, but Drosphila hits are better.
34%
to 49A1 D. melanogaster
MYQLLSAARHQGQSLFRVCRARSLAALKTTYRPQSNKAEESVTYDTAARPFEEIPGPKGLPLIGTALEYTPF(1)
(1)
GQFKMITNLRESFRERTRTYGSIYRERIGPLDLVVISDPKEIEKVFRNE
GRYPERIELASIKVYREIKKLPTGLINL (2)
(2)
NGPEWQRVRSSVQKDLMRPKTVGAYASLQDDVTRDLVDVIRALIGKEESGGQVQNFINYVYRWALE (1)
(1)
AISVVVLDKRLGCLTLGDLEPGSDAKLMIDGVNDFFDAFVKLEMSATGL YKYISTPTWRKFAKAVDQFHR (2)
(2)
VAEKLLKEKLAKTTTEDGKPAESDTDFLQSLLSRNDVTFEEAMEMAVDLLSAGIDT
(0)
SGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVVKETFR (2)
(2)
VYPTVLNNVRRLDQDIVLSGYVVPAK (0)
(0)
TTILLAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCL (1)
(1)
GRRFAEQELHLGLIR (0)
(0)
IVQNFHVGWAGEDMKQDNRIILAPDRDSFVFSERT*
estExt_gwp.C_8820003|Brafl1 34% to CYP24
2134 6.9e-224 1
>fgenesh2_pg.scaffold_214000064|Brafl1
3 genes fused,
31% to
CYP24
MYQLLSAARHQGQSLFRVCGARSLAALKTPCRPQSNKAEESVTYDTAARPFEEIPGPKGLPLIGTALEYTPFGQFKMITN
LRESFRERTRTYGSIYRERIGPLDLVVISDPKEIEKVFRNEGRYPERIELASIKVYREIKKLPTGLINLNGPEWQRVRSS
VQKDLMRPKTVGAYASLQDDVTRDLVDVIRALIGKEESGGQVQNFINYVYRWALEAISVVVLDKRLGCLTLGDLQPGSDA
KLMIDGVNDYFASLVKLEMSATGLYKYISTPTWRKFAKAIDQWHFVAAKLLKEKLAKSATKDGKPAESDTDFLQSLLSRS
DVTFEEAMLMAVDLMAAGIDTSGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVVKETFRVYP
TVLNNVRRLDQDIVLSGYVVPAKTTILLAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRF
AEQELHLGLIR
30% to
CYP24
SNKAEESVTYDTAARPFEE
IPGPKGLPLIGTALEYTPFGQFKMITNLRGSFRERTRTYGSIYRERIGPL
DLVVISDPTEIEKVFRNEGRYPERIELASIKVYREIKKLPAGLINLNGPEWQRVRSSVQKDLMRPKTVGAYASLQDDVTR
DLVDVIRALIGKEESGGQVQNFINYVYRWALEAISVVVLDKRLGCLTLGDLQPGSDAKLMIDGVNDYFASLVKLEMSATG
LYKYVSTPTWRKFAKAIDQWHLVAAKLLKEKLAKTATKDGKPAESDTDFLQSLLSRSDVTFEEAMLMAVDLMAAGIDTSG
NTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVVKETFRVYPTVLNNVRRLDRDIVLSGYVVPAK
TTILMAHDVISSLPEYYPEPEVYRPERWLRDDESSSVQPFTLLPFGYGPRMCIDPNKKVRMY
31% to
CYP27A1
RLQRAVRHQGQSLFRVCG
ARSLAALKTTVTQTQSTRAEESGVYDTAARPFEEIPGPKGLPFIGTGWDYSPFGRFPIKTNFRDSFRERTRTYGSIYRER
IGPLDLVVISDPKEIGKVFRNEGKYPERPPMGSIKTYREVRKLPTGIANLNGPEWQRVRSSVQKDLMRPKTVGAYASLQD
DVTRDLVDVIRALIGREESGGQVQNFTNYVYRWALEAISVVVLDKRLGCLTLGDLEPGSDAKLMIDGVNDFFDAFVKLEM
SATGLYKYISTPTWRKFAKAVDQFHSVAEKLLKEKLAKTTTEDGKPAESDTDFLQSLLSRNDVTFEEAMEMAVDLLSAGI
DTTGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVRQETFRIYPTALSNMRTLDRDMVLSGYA
VPAKTIVLMAHDVISSLPEYYPEPEVYRPERWLRDDESSGVQPFTLLPFGYGPRMCIGRRFAEQELHLGLIRIVQNFHVG
WAGEDMKQVHRLILSPDRDTFVFSERT*
>fgenesh2_pg.scaffold_214000063|Brafl1
34% to CYP24
MSLLQRAVRQQGQSLFRVCGVRSLAALKTTYRLQSTRAEESVADDTAARPFEEIPGPKGLPLIGTALEYSPFGRFPIKTN
LRSSYRERTKIFGSIYREKIGPLDLVVISDPKEIEKVFRNEGRYPERLPLESIKAYRELKKLPAGVVNLNGPEWQRVRSS
VQKDLMRPKTVGAYASLQDDVTRDLVDVIRALIGKEESGGQVQNFINYVYRWALEAISVVVLDKRLGCLTLDDLEPGSDA
KLMIDGVNDFFDSFVVLETSATGLYKYISTPTWRRFEKAIDQWHTVAAKLLKEKLAKGATEEGKPAESDTDFLQSLLSRN
DVTFEEAMMTVVELLAGGIDTTGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIGDKVLNRMHYLRAVVKETFRVYP
TVPNNLRKLDRDIVLSGYRVPAKTTVFMVDDVISSLPEYYPEPEVYRPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRF
AEQELHLGLIRIVQNFHVGWAGEDMKQVNRMVFAPDRDTFVFSERT*
>fgenesh2_pg.scaffold_214000062|Brafl1
two genes fused
30% to
CYP27A3 31% to CYP24
MQTLFSDWTGFSAFWTGQIFPKTPHTIDDFDSGLGSQSTRAEESVAYDTAARPFEEIPGPKGLPLIGTGLDYAPFGRFPL
KTHLRESFRERTKAYGSIYREKLGPLDLVVISDPKEIEKVFRNEG
(gap)
RNGPEWQRVRSSVQKDLMRPKTVGAYASLQDDVTR
DLVDVIRALIGKEGSGGQVQNFTNFVYRWALEAISVVVLDKRLGCLTLDDLEPGSDAKLMIDGVNDFFNAAVKLELSGAG
RLYKYISTPTWRKFANAIDQWHGVAAKLLKEKLTKSAAEDGKPAESDTDFLQSLLSRNDVTFEEAMLMAVDLMAAGIDTT
GNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVVKETFRLCPTVGNNIRTLDRDMVLSGYVVPA
KTKIFMAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRFAEQELHLGLIRLAVRHQG
QSLLRVCGARSLAALKPTY
25% to
CYP24
RLQSTRAEESVADGTAARPFEEIPGPKGLPLIGTALDYTPFGRFPLKTNFRESFRERTRTYGSIY
REKIGPRELVVISDPKDIQKVYRNEGRYPERPQVDSIKTYREMKKLPAGIVVLNGPEWQRVRSSVQKDLMRPKTVGAYAS
LQDDVTRDLVDVIRALIGKEGSGGQVHNFINYVYRWTLESIGVVVLDKRLGCLTLGDLEPGSDAQLMIGGVNDFFNAFSK
LEMSATGLYKYISTPTWRKFQKAIDQWHTVAAKLLKEKLTQSTIEDGKPAESDTDFLQSLLSRNDVTFEEAMEMALDLL
(gap,
missing I-helix) 37% to 27C1
VYPTFLNNVRTLDRDIVLSGYVVPGKTIIIIGNDIISSLSEYYPEPEVYKPERWLRDDEFSSVQPFTLLPFGYGPRMCIGR
RFAEQELHLGLIRIVQNFH
VGWAGEDMKQENRMVFAPDRDTFVFSERT*
>fgenesh2_pg.scaffold_214000072|Brafl1
33% to CYP27A3
MATGRATSRRNGQWGATLAIREINGPEWQR
VRSSVQKDLMRPKTVGAYASLQDDVTRDLVDVIRALIGKKESGGQVQNFT
NYVYRWALEAISMVVLDKRLGCLTLNDLEPGSDAKLMIDGVNDFFDAFVKLEMSATGLYKYISTPTWRKFAKAFDQWHAV
AEKLLKEKLAKSAAEEGKPAESDTDFLQRLLSSKDITFEEAMMMAVDLMAAGIDTTGNTLMFNLFCLAKNPEAQEKLYRE
IQEVVPAGQPIDDKVLNRMHYLRAVRQETFRFYPTVLSNTRILDRDVVLSGYFVPAKTIVLMAHDVISSLPVYYPEPEVY
KPERWLRGDESSSVQPFALLPFGYGPRMCIGRRLAEQELHLGLIRIVQNFHVGWAGEDMKQNNRIILAPDRDTFVFSART*
>e_gw.882.7.1|Brafl1
RERTKIFGSIYREKIGPLDLVVISDPKEIEKVFRNEGRYPERLPLESIKAYRELKKLPAGVVNLNGPEWQRVRSSVQKDL
MRPKTVGAYASLQDDVTRDLVDVIRALIGKEESGGQVHNFINYVYRWALEAISVVVLDKRLGCLTLDDLEPGSDAKLMID
GVNDFFDSFVVLETSATGLYKYISTPTWRRFEKAIDQWHTVAAKLLKEKLAKSAAEDGKPAESDTNFLQSLLSRSDVTFE
EAMMTVVELLAGGIDTTGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIGDKVLNRMHYLRAVVKETFRVYPTVPNN
LRKLDRDIVLSGYRVPAKTTVFMVDDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRFAEQEL
HLGLIRVGSFAV*
>estExt_gwp.C_8820003|Brafl1
34% to CYP24
MSLLPRVVRHHGRLFNVCSARSLVTYRSQSTRAEESVAYDTAARPFEKIPGPKGLPLIGTGLDYAPFGRFPLKTHLRESF
RERTKAYGSIYREKLGPLDLVVISDPKEIEKVFRNEGRYPERVQLESVRTYREIKKLPIGVVNLNGPEWQRVRSSVQKDL
MRPKTVGAYASLQDDVTRDLVDVIRALIGKEGSGGQVQNFTNFVYRWALEAISVVVLDKRLGCLTLDDLVPGSDAKLMID
GVNDFFNAAVKLEMSGAGRLYKYISTPTWRKFANAIDQWHGVAAKLLKEKLAKSAAEEGKPAESDTDFLQSLLSRSDVTF
EEAMLMAVDLMAAGIDTTGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAEQPIDDKVLNRMHYLRAVVKETFRLCPTVGN
NIRTLDRDMVLSGYVVPAKTKIFMAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRFAEQE
LHLGLIRVSFVALFRH*
$$$$$$$$$
>Gene
D 84% to gene B, 34% to CYP11 amphi, 30% to gene C
31%
to CYP24 fugu
MSLLPRVVRHHGRLFNVCSARSLVTYRSQSTRAEESVAYDTAAR
PFEKIPGPKGLPLIGTGLDYAPF (1)
(1)
GRFPIKTNLRDSYRERTKTYGSIYREKIGPRELVVISDPKDIQKVYRNE
GRYPERPQVDSIKTYREMKKLPAGIVVL
(2)
(2)
NGPEWQRVRSSVQKDLMRPKTVGAYASLQDDVTRDLVDVIKALIGKEESGGQVHNFINYVYRWTLE (1)
(1)
AISVVVLDKRLGCLTLGDLEPGSDAQMMIGGVNDFFNAFAKLEMSATGL YKYISTPTWRKFQKAIDQWHT (2)
(2)
VAAKLLKEKLTQSTIEDGKPAESDTDFLQSLLSRNDVTFEEAMEMALDLLSAGIDT (0)
(0)
TGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAEQPIDDKVLNRMHYLRAVVKETFR (2)
(2)
LCPTVGNNIRTLDRDMVLSGYVVPAK (0)
(0)
TKIFMAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCI (1)
(1)
GRRFAEQELHLGLIR (0)
(0)
IVQNFHVGWAGEDMKQVNRLVLSPDRDSFVFSARA*
$$$$$$$
>GENE
F 61% TO GENES D AND B
MSRILQIVGRRAAFTQAGLQNVPVWRPLGGRNGRGAASSAAATEQTTVQDGAARPFDEIPGPRGLPFIGTALDYSPF
(1)
(1)
GRFPIHTKMANSTIERYQTYGKIYREKIGLRDMVFVCDPKDIETVFRSDGRLPERPIPESIATYRRLKNKPLGVALL
(2)
(2)
NGEEWFRLRRSVNKDMMRPKAVGAYATMQDEVSRELVGLIQGVVRKGKTAGQVPDFTKLLYKWGLE (1)
(1)
ALSLVVLGKRLGCLTLDQLPEDSDAQRMIGAVNDFFYSFAKLQMSFPLFRYIRTPGWTTFERAMDTVSS (2)
(2)
ITEKMIGERLEKLRQMEEPPDEADFLTSLLSREDMNLDEAIQMSVDLLQGAIDT (0)
(0)
TAHTLVFNLYCLAKNPDAQQKLYEEILEVVPPEQPIDDRVLNKMHYLRAVVKETFR (2)
(2)
MYPTLLSTARTLTRDVVLSGYHVPAK (0)
(0)
TNVMLAQNVISTLPEYYPEPESYIPERWLRTESSNVQSFSLLPFGYGPRMCI (1)
(1)
GRRFAEQELYLGLVR (0)
(0)
IIQNFHVGWDGEDMKQVWRIFNAPDRDTFVFSERKS*
>fgenesh2_pg.scaffold_119000067|Brafl1
CYP29% to CYP11A1
same as
gene F above
MTHTGNADGSVHGIEILANGSLQDKYSLSQGDMDGPIVPVNETITADGVQRNVILVNDQFPGPTLEVMEGAQVVVTVVNE
LLREATSLHFHGMYMRGVPYMDGVPYVTQCPILPMHSFTYRFKAEPAGTHWYHSHLGSQKEDGLYGAFIVHKNSIPTTPS
LPMFLQDWWHDDFNTIDVDSAYMEHRGPGRFFGPWQERGFSFEGTELTALNFKSALINGRGRYNNNSAPLTRFEISSGET
LRFRLINAGAEYTFRVSIDAHSMTVVANDGHDVEPVHVQSILVFPGESYDFEVVGDPSNSGTYWIRAQTLWAGKGPDVEP
EDRLQEVRAILAYDNAPTDEDPNSAMQTCTENSPCRVLNCPFPAFPAGSNTECIYVSDLNSTEEYSMSDESETEEYFFNF
GYQIGSSVNGRKFDTPKKPLIFKAPYDITPCEATCETDGCKCTYMVEIPLGKTIRFVLMDLGVESEGHHLIHLHGYDFRV
LAMGFPVHNETTGRWISQNADIDCGNDNKCNMASWNVTRPNLNYNKPPIRDTVVIPARGYTVIEFRSNNPGFWYFHCHQT
THMNEGMSMIIAEALDKLPALPYGFPTCGDFTGTEKPPGRGRTVAAMEQSVTKVELDHTQLVIIIVISAAMSATIALAAV
GIYNARAKVNAFQRQVVKRSYVVCDQALGPQVLTTDKPLDTRHKPRGIMHLLNAFILPCLCVTMATTQRCTDDVCEFTLV
VRYARTMTHTERDGEVHGIEILTNGSLQDKYSLSQGDMDGPIVPVEETITADGVQRNVIVVNDQFPGPTLEVIEGAQVVV
TVVNNLLREATSLHFHGMYMRGVPYMDGVPYVTQCPILPMHSFTYRFMAEPAGTHWYHSHLGSQKEEGLYGAFIVHKNSM
PTTPSLPMFLQDWWHDDFNNIDVDSAFMEHRGPGRFFVPWQNRGFSFDGNKLSSVRFISALINGRGRYNNNSAPLTRFEI
SPGETLRFRLINAGAEYTFRVSIDAHSMTVVANDGHDVEPVQVQSILVFPGESYDFEVVGDPSNSGTYWIRAQTLWAGKG
PDVEPEDRLQEVRAILAYDNDPTDEDPNSDMQNCTENSPCRVLNCPFPAFPAGSNTECVYVSDLNSTEEYSMPDESETEE
YFFNFGYQIGSSVNGRKFATPKKPLIFKAPYDITPCEATCETDGCTCTYTTEIPLGKTIRFVLMSLGFGSGGHHVIHLHG
YDFRVLAMGFPEYNETTGRWITQNDDINCGDDNKCNMAAWNVARPNLNYNKPPTRDTVVIPARGYTVIEFRSNNPGFWLF
HCHQTTHMKEGMSMIIAEALDKLPALPYGFPTCGDFTGTEKPPGRGRTAAAMEQSVTLVELDNTQLVIIIVVSAAMSATI
ALAAVGIYNARVNKSKEKMIDTP
IVGRRAAFTQAGLQNVPVWRPLGGRNGRGAASSAAATEQTTVQDGAARPFDEIPGPR
GLPFIGTALDYSPFGRFPIHTKMANSTIERYQTYGKIYREKIGLRDMVFVCDPKDIETVFRSDGRLPERPIPESIATYRR
LKNKPLGVALLNGEEWFRLRRSVNKDMMRPKAVGAYATMQDEVSRELVGLIQGVVRKGKTAGQVPDFTKLLYKWGLESLS
LVVLGKRLGCLTLDQLPEDSDAQRMIGAVNDFFYSFAKLQMSFPLFRYIRTPGWTTFERAMDTVSSITEKMIGERLEKLR
QMEEPPDEADFLTSLLSREDMNLDEAIQMSVDLLQGAIDTTAHTLVFNLYCLAKNPDAQQKLYEEILEVVPPEQPIDDRV
LNKMHYLRAVVKETFR
CAINSIMARHRTLHHGHRRKLSFIIPVLLVYVLVSAFLDLTYSGYMAKHVSDGDSHQTITTTEG
TNMTKLLWEGLSRLEQMDQQRANLTEKLKNIAKMANVSEEAIGPWLSQLRPMTIVDAPAGNRTALLTCQDIAEIRISNPM
GKGVTKVVELGNYQGHGVAVKRVLPTVKDVRECKRTIERSGWNKCFVFPNYKLLKEILLLQQLKHPNIVQLLGYCVQNEE
TDENLAEHGVVSVTEMGTKFHVGRARKMDWKMRLKMAIDLASLLDYLEHSPMGSLLMADFKVEQFVWVGGKVKLTDLDDV
SNVERKCAVDSDCWVDKKDVGVPCTNGSCRGLNAKHNMNGAYKTILRHIMVHTGTEETALREDLRSVSISAASLHSRLLQ
LLDKELAIDSPTHR*
$$$$$$$
>Gene
G 55% to amphi 11
MFLGLMRCQTPSQTYSTGPQAASHPQLDPP
AKPFSALPEPMKGLPGILKTLVVLCTGGMSRKAQLKSHVVIGQLFQMYGPILR
(2)
NRFGNFDMVNICDPDAAREVFKVEGKYPERLDIAPWRLHREDAGKELAVLLG
(2)
NDKKWHKNRTVVSRPMLRPQSVAAYVLKIDDVATDMLQHIRSVRAGPDGTEVLDLENELFKWALE(1)
SISAVLFNERMGLLQDNIPQDAQDFINGMHDAFDSLTRAMTPDARLHKLLNTKSWQKNKQAWDT
(0)
GEKVMDRQLQRAEERQARGEADDGQLDFLWFISSREKLTKEEIYANAIELMGAAIDT
(0)
TSTTLLWTLYQLCHRPDLQDKLYQEVTQVIGQDEVITYDHLKNLHLFKAVIKETLR
(2)
LHPVAFAITRVIQQDTVLMGYKIPAK
TVVMVSLYDMARDPRLYKNPEEYRPERWLRGAEDYVDTHPYAYLPFGFGTRSCI
(1)
GRRVAETELQVLLAK
(0)
ICQQFVLKQRNPRVIPAMTKGILMPAEKMDICFIERQ*
>e_gw.241.76.1|Brafl1
33% to CYP27C1, 99% to gene G above
FGNFDMVNICDPDAAREVFKVEGKYPERLDIAPWRLHREDAGKELAVLLGNDKKWHKNRTVVSRPMLRPQSVAAYVLKID
DVATDMLQHIRSVRAGPDGTEVLDLENELFKWALESISAVLFNERMGLLQDNIPQDAQDFINGMHDAFDSLTRAMTPDAR
LHKLLNTKSWQKNKQAWDTVFKIGEKVMDRQLQRAEERQARGEADDGQLDFLWFISSREKLTKEEIYANAIELMGAAIDT
TSTTLLWTLYQLCHRPDLQDKLYQEVTQVIGQDEVITYDHLKNLHLFKAVIKETLRLHPVAFAITRVIQQDTVLMGYKIP
AKTVVMVSLYDMARDPRLYKNPEEYRPERWLRGAEDYVDTHPYAYLPFGFGTRSCIGRRVAETELQVLLAKICQQFVLKQ
RNPRVIPAMTKGILMPAEKMDICFIERQ*
>fgenesh2_pg.scaffold_140000032|Brafl1
31% to CYP11A2
90% or
more to gene G above
MIRLCALTQRRSAATIVGRWLDFHRGARAASQGLLRCQTPNQPYSSGPQAASHPQLDPPVKPFSALPEPMKGMPGILKFL
VVLCTGGMSRKAQLKSHMMIGQLFQMYGPILRNRFGNFDMVNTCDPDAAREVFKVEGKYPERLDIAPWRLHREDAGKELA
VLLGNDKKWHKNRTVVSRPMLRPQSVAAYVLKIDDVATDMLQHIRSVRAGPEGTEVLDLENELFKWALESISAVLFNERM
GLLQDNIPQDAQDFINGMHDAFDSLTRAMTPDARLHKLLNTKSWQKNKQAWDTVFKIGEKVMDRQLQRAEERQAR
(gap) 37%
to CYP27A.c
GEADDGQLDFLSFISSREKLTKEEIYANAIELMGAAIDT
VNSTSMSITLSQLVTDTVHE
TSTTLLWTLYQLCHRPDLQDKLYQEV
TQVIGQDEVITFDHLKNLHLFKAVIKETLRLHPVAFAITRVIQQDTILMGYEIPAKTVVMVSLYDMARDPRLYKHPEEYR
PERWLRGAEDYVDTHPYAYLPFGFGTRSCIGRRVAETELQVLLAKICQQFVLKQRNPRVIPAMTKGILMPAEKMDICFIE
RQ*
$$$$$
>fgenesh2_pg.scaffold_283000056|Brafl1
29% to CYP24
MSHILKIAGRRTAVRHQLRLPGFWRFCGRQGVRGAATTATAAEQVAPEETVRPFQE
IPGPKGLPFIGTALDYSPFGRFPIHTQLGNSAIERY
KTHGKIYREKLGPGREMVFVCDPKDIGTVFRSDGRLPERPPVNSIATYRKMRKKPPGLGNLMGEDWHR
VRSSVNKEMMRPKSVGAYATMQDDVSREMAELIQTVVRKGDSGGQVDNFMNLMHKWGLESLSLVILGKRMGCLTLDQLAE
DSDAQRMISAVLEFFLYFGKLEMSLPFYRYFSTPAWKKFETAMDTMN
(Gap)
SLLSQKDMTLDEAVMMAIELLTGAFESTANTLA
FNLYCLAKNPAAQQKLYEEIMNVVPPGQPIDDRVLNKMSYLRAVFKETSRLYPTIFFNARTLTRDVVLSGYHVP
AKIIQKFHVGWDGEDMKQIYKIFNTPDRDTFIFRERE*
>e_gw.77.176.1|Brafl1
33% to CYP24
93% to
fgenesh2_pg.scaffold_283000056|Brafl1 (allele)
KTYGKIYREKLGPGREMVFVCDPRDIGTVFRSDGRLPQRPPVNSLATYRKMRKKPLGLGNLMGEDWHRVRSSVNKEMMRP
KSVGAYATMQDDVSREMAEQIQTVVRKGDSGGQVDNFMNLMHKWGLESLSLVILGKRLDCLTLDQLAEDSDAQRMISAVL
EFFLYFGKLEMSLPLYKYFNTPAWKRFVRALDTMN
RYAICPIQERILTELSKLEEPPQETDFLSN LLSQKDMTLDEAVMM
AIELLTGAFESTANTLAFNLYCLAKNPAAQQKLYEEILEVVPPGQPIDDRVLNKMSYLRAVFKETSRLYPTIFFNARTLT
RDVVLSGYHVPAKTQIIMANNVISTLPEYYPDPEAYIPERWLRTESSAANVQAFALLPFGYGARMCVGRFLPVKNRSVS*
$$$$$$$$$
>fgenesh2_pg.scaffold_191000017|Brafl1
27% to CYP27C1
MGITGVLGRRCDAVMRSGRVFNGQWKCGRSSLRNVGLCILRKSSSTVTNVGMETCVDPTANKTDVAVRPFHEIPGPKGLP
IIGSLWEYTFLGKLDPRRFDEVLWNRYQEYGKIYKEDLGPRGTFVRIADPGDIETVYRNEGRYPHRPSFPLVRESMEAAG
QELLKHRARSESSFNGQGLEWYRTRSAVNRTLLRRSGVALFHPTLNEISDDFLTLLKRSLDENNTVPDITWQIRRHNTEV
AGTTIFGRRPGCLEPDFSGSCQTSEMIKSIDDFFASWLKLEIGFPLTKYLLKDTWNGYMNAHRNILRIVKYHMDLDVEYE
DSRPSVLGYLLSESSLSDTDAAMSAVELFVGGMQSSSHADMFQLYELARHPHVQETIRREVTEALPKGEAVTSAHLHKLP
YLKAFVKETFRFHPVGLLHMRILDRDVVLSGYRVPAHTTIEIPMSVLGRLEELYPQADRFLPERWLRRGPNGFRSRMFSH
VTPFGHGPRACIGRRLAEDKFYIQIAKLVQNFDLHCDEEVGTVTGCFQELSPTPNIRFTPR*
$$$$$$$$
>estExt_GenewiseH_1.C_30140|Brafl1
33% to 11A2, 33% to CYP27A1
MGGWMDKFHLHMQNRWRQYGSIYKENIGPQEIVCMFDPEDVAPVLRAEGRYPRRYAFDSFYLAREIMGHKLGVFLENDEK
WQQYRTVMNKKLLRPQQAAAFTPLMDEAASNFMSYLRRKRDQGGMVTDLQAHLFRWAMESGCTAMFNQHLGLLSEDPPQL
AKDFISSTMAVLDTTNTMMTIPPKVHKALNTKAWKEHLEGWQTSFRVTKQLIEEIMERGLEKESEEDEEIPDLVSYLLSV
KLRPEEVLANIVDVLGGAVDTTSNTMAFTMHTLARHPDIQEKLHDEVMRVAPDHQAPVTQEQVHKMPYLRGVIKEVLRLY
PVAYVFSRVLNHDAVVHGYKIPAGTNLVVCPYVMGRDPNSYDDPEEFRPERWYRENSKSVKAFSWLPFGFGARGCVGRRI
AETEMHLVLIRICQNFLLEQEKDEELVGRIRLVLIPDKSVDLKLIDRN*
>e_gw.29.150.1|Brafl1
32% to CYP27C1
92% to
estExt_GenewiseH_1.C_30140|Brafl1
IAQNRWQQYGSIYKENIGPQEIVCMFDPEDVAPVLRAEGRYPRRYAFDSFYLAREIMGHKLGVFLENDEKWQQYRTVMNK
KLLRPQQAAAFTPLMDEAASNFMSYLRRKRDQGGMVTDLQAHLFRWAMESGCTAMFNQHLGLLSEDPPQLAKDFISSTMA
VLDTTNTMMTIPPKPGVKTYCTNVAPGSFLSSLELVFIMERGLKKESEEDEEIPDLVSYLLSVKLRPEEVLANIVDVLGG
AVDTTSNTMAFTMHTLARHPNIQEKLHDEVMRVAPDRQAPVTQEQVHKMPYLRGVIKEALRLYPVAYVFSRVLNHDAVVH
GYKIPAGTNLVVCPYVMGRDPNSYDDPEEFRPERWYRENSKSVKAFSWLPFGFGARGCVGRRIAETEMHLVLIRICQNFL
LEQEKDEELVGRIRLVLIPDKSVDLKLIDRN*
>e_gw.3.68.1|Brafl1
33% to CYP11A2
89% to estExt_GenewiseH_1.C_30140|Brafl1
GQEGATAKPFEAIPGPKGLPLVGTALHAAMGGWMDKFHLHMQNRWRQYGSIYKEIIGPQEIVCMFDPEDVAAVLRAEGRY
PRRHSVDSFYLAREIMGHKLGVLLENDEKWQQYRTVMNKKLLRPQQAAAFTPMMDEAASNFMSYLRRKRDQGGMVTDLQA
HLFRWAMESGCTAMFNQHLGLLSEDPPQLAKDFISCSMAILDTTNTMMTIPPKVHKALNTNAWKEHLEGWQTSFRVTKQL
IEEIMERELKKENEEDEEISDLVSYLLSVKLRPEEVLANIVDVLGGAVDTTSNTMAFTMHTLARHPDIQEKLHDEVMRVA
PDRQAPVTQEQVQKMPYLRGVIKEILRLYPVAYIFSRVLNHDAVVHGYKIPAGTNLVVCPYVMGRDPKSYDNPEEFRPER
WYRENRESVKAFSWLPFGFGARGCVGRRIAETEMHLVLIRICQNFVLEQKKDEELVGRIRLVLIPDKSVDLKLTDRN*
$$$$$$$$$
>fgenesh2_pg.scaffold_410000012|Brafl1
27% to CYP24
MQTRVKATVPTLRETGRYGVGKLHERHLDLHRQYGDICREKLLGREIVHVFSREIAQEVFMQEGRYPGRTVIEPDALYRT
TRGIPLGLLSLQDAEWHRLRRLAQDRILRPAVQSAVLPNMDRIAQEFVMRTDMLRSPGSDVMERNYKDELHLWSLEW
(gap)
KLIFSLPLYKVVPTPTWRKLAAAQDTFFRLSENYIKQVLTDSGDGDPETQDSLLLHLLRKSELSKEEVSATMTDLFQGGIDTT
TNGMMYSLFALAKNPEVQELVCQEIRTHLPEGARVTPEVLGKMKYLKAVIKETFRVCLPGCCRLWPVIFGTARQYDYDVV
LGGYDVPAKTEILVHHRVMCRQDKYFRDPLTFDPTRWLRDEKTPRVPTYLFMPFGHGVRMCIGMLNIILTIRRRFAEQQL
QLLVIRMLQRFHVECEEAELRQVFSLVLLPDRNPRFIFRRRQGETA*
>gw.501.20.1|Brafl1
30% to CYP24
LKKLHESFFERYRQFGKISKETIGNKCFVSVYDPRDIETLFRTEGPNPSWMQLMALGEVRKRLGKPLGMINETGQKWRQL
RYAAQSKLLNPKSVSSFVPVLDEISRDFVEKLRTGRSAATLEPTIDLDAELRKWSLESVVSATLGIRLGCLQKHRQIPDK
DTEDLLQSSDAFLDTWSKLELGPPLYMLYPTKTWRKFLRANELWLSAAGRMIDRSLDRSESERDPLQPEVTLLEHIVTRK
ELTPDDVVMIITELIFAGIESTAVAMTYNLYTMAKNQHVQEKVRREVNAVVGKSGKVTQDALKSLKYVKACIKETSRVLP
AFSMRNRILDKEIVLAGYRVPPNVIIRVLTHVTGQLPEYVVEPDRFAPERWLRDDTTIPKPHPFAVRPFGVGTRSCIGQR
LAEQELGILLAKV
>fgenesh2_pg.scaffold_44000117|Brafl1
87% to gw.501.20.1|Brafl1
MAFAVLMMMAAAVLPNFARSAITLIPMGSTYLPYGFDPAGAPLYGMGDRGAVEQLTYDADNYRIYTVGEARILNVIDISD
PKNAALVYQLQLPGGATDVDSCGRFVAVSIHDDFKVLPGTVLIYSMYDTTRKNMTLLHQIQVGALPDMVKFTKDCMTLVT
CNEGEPGLDESGNFVDPEGSASVIAFQSTNLGQESAPTVRTATFRKFDSLAEEYNSRGVRWTLPMIQVGSEVMEFNLSQT
LEPEYVAYNSDGSKAYIALQENNAIAVLDMATATFDDIYPLGSKYWGTASIDTSNEDGGSLVSRNLKSQRVQKAMNLTSQ
LGCAVFSSIDGLDPENPDKYSSLHLFGGRGFSVWDADDLSLVWDSGDDVERMVAKYYPTIFNSDYDEEFFNSTPAARFDH
RSCKKGPETESLAIGEVDGKTAFFVGNERSSTILVYSLADEDIITPVFQSIHFSGRTDLTWRQAYQDRVVGDIDPEDMRF
VSTRDSPTNSPLLLVAGTVSGTVSVYEVAESDDDGVSTAGKMKRAWLQHLVAKKLGADAVSIGRSGGETSTFSPPVTRY
25% to
CYP27B1
RQFGKISKETIGNKTFVSVYDPRDIETLFRTEGPNPSWMQLMALGEVRKRLGKPLGMINETGQKWRQLRYAAQSKLLNPKS
VSSFVPVLDEISRDFVEKLRTGRSAATLEPTIDLDAELRKWSLESVVSATLGIRLGCLQKHRQIPDKDTEDLLQSSDAFL
DTWSKLELGPPLYMLYPTKTWRKFLR
ANELW 38% to CYP27B1
LRVLPAFSMRNRILDKEIVLSGYRVPPNVIIRVLTHVTGQLPEYVVEPD
RFAPERWLRDDTTIPKPHPFAVRPFGVGTRSCIGQRLAEQELGILLAKMIQQFHIE
CDGEMEQIFNIANKPDLSGTFKFTEL*
>Gene
C 38% to CYP11 amphi, 34% to Gene E, 34% to Gene B
42%
to 27B1 Fugu, 38% to 27C1 fugu, 42% to 27A1 fugu (but not first exon)
37%
to 11A1 fugu, 36% to CYP24 fugu (Best match to CYP27B)
42%
to Xenopus trop. 27B1, 41% to Xenopus laevis 27A1
MAQQILRNSSVCSLVRPNSRALVSVAPAATVQQNRPLKEMPGPTNKLGQLWWGFKNRSRMHEAQ (0)
(0)
LEQERKYGRMWQSSFGFNPNVNVAHVALAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNQ (2)
(2)
NGPEWRHLRTAVSKRIMRPKEVPR (2)
(2)
YGDSMNEVVTDMIDRFKDLRDTTGGGKTVPDLTNELYKWAME (1)
(1)
SIATVLFDTRLGCLEREMPEKTQQFIDSIATMFRTAFLVSALKPWMLTYLGLGVWKRHVEAWDVIFSV(1)
(1)
AHENIDRKVLDIDARLSRGEDLVGSFLTYMLTGTDVTKKDLYATVTELLLAGVDT (0)
(0)
TSNTMVWTLYELARHPELQERLHQEVTSVVSPGQIPTVDDVKNMALLKNVIKEILR (2)
(2)
VYPVLPANGRVLDKDIVLDGYNIPKG (0)
(0)
TQFAILHYNMTRDPEVFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCA (1)
(1)
GRRLAEMEMYLVLAR (0)
(0)
LVQTFEVRQLTPGEVVRPVTRALLVPGDPVHLEFIDRP*
>CYP27 40% to 27B1 Fugu, 37%
to 27C1 fugu, 40% to 27A1 fugu (but not first exon)
35%
to 11A1 fugu, 34% to CYP24 fugu (Best match to CYP27B)
MAQQILRNSSVCSLVRPNSRALVSVAPAATVQQNRPLKEMPGPTNKLGQLWWGFKNRSRMHEAQ (0)
(0)
LEQERKYGRMWQSSFGFNPNVNVAHVALAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNQ (2)
(2)
NGPEWRHLRTAVSKRIMRPKEVPR (2)
(2)
YGDSMNEVVTDMIDRFKDLRDTTGGGKTVPDLTNELYKWAME (1)
(1)
SIATVLFDTRLGCLEREMPEKTQQFIDSIATMFRTAFLVSALKPWMLTYLGLGVWKRHVEAWDVIFSV(1)
(1)
AHENIDRKVLDIDARLSRGEDLVGSFLTYMLTGTDVTKKDLYATVTELLLAGVDT (0)
(0)
TSNTMVWTLYELARHPELQERLHQEVTSVVSPGQIPTVDDVKNMALLKNVIKEILR (2)
(2)
VYPVLPANGRVLDKDIVLDGYNIPKG (0)
(0)
TQFAILHYNMTRDPEVFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCA (1)
(1)
GRRLAEMEMYLVLAR (0)
(0)
LVQTFEVRQLTPGEVVRPVTRALLVPGDPVHLEFIDRP*
>CYP27 fgenesh2_pg.scaffold_25000096|Brafl1
MAQQILRNSSVCSLVRPNSRALVSVAPAATVQQNRPLKEMPGPTNKLGQLWWGFKNRSRMHEAQLEQERKYGRMWQSSFG
FNPNVNVAHVALAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNHNGPEWRHLRTAVSKRIMRPKEVPRYGDSMNEV
VTDMIDRFKDLRDTTGGGKTVPDLTNELYKWAMESIATVLFDTRLGCLEREMPEKTQQFIDSIATMFKTAFLVSALKPWM
LTYLGLGVWKRHVEAWDVIFSVAHENIDRKVLDIDARLSRGEDLDGSFLTYMLTGTDVTKKDLYATVTELLLAGVDTTSN
TMVWTLYELARHPELQERLHQEVTSVVSPGQIPTVDDVKNMALLKNVIKEILRVYPVLPANGRVLDKDIVLDGYNIPKGT
QFAILHYNMTRDPEVFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCAGRRLAEMEMYLVLARLVQTFEVRQLTPGE
VVRPVTRALLVPGDPVHLEFIDRP*
>CYP27 e_gw.25.105.1|Brafl1
QLEQERKYGRMWQSSFGFNPNVNVAHVSLAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNHNGPEWRHLRTAVSKR
IMRPKEVPRYGDSMNEVVTDMITRFKDLRDTTGGGKTVPDLTNELYKWAMESIATVLFDTRLGCLEREMPEKTQQFIDSI
ATMFRTAFLVSALKPWMLTYLGLGVWKRHVEAWDVIFSVGESHENIDRKVLDIDARLSRGEDLDGSFLTYMLTGTDVTKK
DLYATVTELLLAGVDTTSNTMVWTLYELARHPELQDRLHREVTSVVSPGQIPTVDDVKNMALLKNVIKEILRVYPVLPAN
GRVLDKDIVLDGYSIPKGTQFAILHYNMTRDPEAFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCAGRRLAEMEMY
LVLARLVQTFEVRQLTPGEVVRPVTRALLVPGDPVHLEFIDRP*
CYP19 clan
(2 subfamilies)
>CYP19
amphioxus 37% to CYP19 zebrafish ovarian, 38% to brain form
41%
to e_gw.484.33.1 so there are two CYP19 subfamilies
in Amphioxus
two
possible start METs
MLQFLVIESRGSFPLNRSRTRHGITSQIEADGCS
MDTGEGWDVLLVVLLVVLVWYYIRETWTSGIDGIFPP (1)
(1)
GPPYIPLLTPLWTLWVFLHDGIWAATAGYAAKYGDFVRVWLGTEQTFIISR (2)
(2)
ASAAAHVLKSSKYRARFGDPSGLAQIGMNGSGVIFNNDVQSWKFLRFFFVK (1)
(1)
VLDRAAGVSAIATRRQLANIRDIASSNPDGAVDVVTLMRRITLEIGNRLFLGVNIEN (1)
(1)
DLEVVNTINGYFAAWEFFMIRPKVLQLIYPTLYRKHQTAV (2)
(2)
RALQDVVGKLVDKKRAVMNGDEAEEEFSIPKGEHDFAAALIQAQ (0)
(0)
EFGQVSASCVRQCVTEMLLAGPDTMSVHIYFILLHIAEHGLENGILREIREVL (1)
(1)
GDRDPTRDDLSKMVFLDHVIN ESMRARPVVTFVMRHAEEEDHVDGYVIPKG
TNVIINLVAVHQDPRHFP
EPETFDPDHFKEK (0)
(0)
VPSTQFMPFGLGVRSCVGRTIAPLQMKAVLITLLRMYQLSPSRDHQSLEVSRNLSEHPTEPGSMFLYPRLETI*
>estExt_gwp.C_90165|Brafl1
45% to CYP19
96% to
assembled seq above
MFSLQECGQVSASCVRQCVTE
MLVAGPDTMSVNIYFILLHIAEHGLENGILREIREVLGDRDPTRDDLSKMVFLDHVINE
SMRARPVVTFVMRHAEEEDHVDGYVIPKGTNVIINLVAVHQDPRHFPEPETFDPDHFKEKVPSTQFMPFGLGVRSCVGRT
IAPLQMKAVLITLLRMYQLSP
>estExt_fgenesh2_pg.C_90115|Brafl1
39% to CYP19a C-term
98%
to estExt_gwp.C_90165|Brafl1
MLVAGPDTMSVNIYFILLHIAEHGLESGILREIREVLDRDPTRDDLSKMVFLDHVINESMRTRPVVTFVMRHAEEEDHVD
GYVIPKGTNVIINLVAVHQDPRHFPEPETFDPDHFKEKVPSTQFMPFGLGVRSCVGRTIAPLQMKAVLITLLRMYQLSP
SRDHQSLEVSRNLSEHPTEPGSMFLYPRLETI*
>CYP19 scaffold_484 96% to first 3 exons below on
e_gw.484.33.1|Brafl1
290078
MSGVMSVLTEQLQTWSAGLTCVTAVIVTGAALVLTWGGWASGRSVDVP (1) 289935
288972
GPPWLLGFGPLMSFARFIWMGVPVAAAHYGARYGDFVRVWIAGERTYVITR (2) 288820
288346
PSAPWPVLKSTNSCRRFGSRTGLRPIGMYQNGIIWNGDDGWRVLRGFFQK (1) 288197
>CYP19 e_gw.484.33.1|Brafl1 38% to CYP19 human and
danio (-) strand
first exon
is a guess, no frameshifts exist in e_gw.1098.5.1 so it may be correct
294278
MSGVMSVLTEQLQTWSAGLTCVTAVIVTGAALVLTWGGWASGRSVDVP 294135
293172
GPPWLLGFGPLMSFARFIWMGVPVAAAHYGARYGDFVRVWIAGERTYVITR
(2) 293020
292546
PSAAWHVLKSNNYCRRFGSRTGLSTIGMYQNGIIWNGDDGWRVLRGFFQK
(1) 292397
287888
ALNADTLNRATSAAVDATYRQMGNIAALQQKAADGKIEALDFLRRITLEVTNNLTLGVHIAD
(1) 287703
287339
PDDLVERIVRYFKAWEFFLLRPPIMYLMTPKLYWKHCQAV
(2) 287220
286970
NDLNDAIAELLTNKRQELKTAPPSDKPDFATCLLQAE
(0) 286860
286169
ERGEVSPAHVQQCVLEMLLAGTDTSSVSMYYLLVSVAENPQVELKVLEEMRDIL
(1) 286008
286565 ERGEVSPAHVQQCVLEMVL 286509 (duplicate exon 7 seq)
285823
GERDPTKADLPQLVYLEQVIKEAMRIKPVGPVIMRQAKEDDR
(2) 285695
285428
IDGIETPAGTNIILNLADMHRRQDNFPAPDDFNPQHFDNK
(0) 285309
284605
DFKGEYVPFGTGPKGCIGQFLAMIEMKAIMCTLLRKHHLRAIPGESLEGIETHWDIAQQPVNASYMYFEERN*
284387
>CYP19 e_gw.1098.5.1|Brafl1
95% to
e_gw.484.33.1|Brafl1 yellow
exon 9 is wrong, exon 9 is in a seq gap
49213
MSGVMYVLTEQLQAWSAGLTCVTAVIVTGAALVLTWGGWASGRSVDVP (1) 49070
48196
GPPWLLGFGPLMSFARFIWMGVPVAAAHYGARYGDFVRVWIAGERTYVITR 48044
47584
PSAAWHVLKSNNYCRRFGSRTGLSTIGMYQNGIIWNGDDGWRVLRGFFQK
47435
47273
ALNADTLNRATSAAVDATYRQMGNIAVLQQKTADGKIEALDFLRRITLEVTNNLTLGVHIAD
(1) 47088
46510
PDDLVERIVRYFKAWEFFLLRPPIMYLMTPKLYWKHCQAV
46397
46144
NDLNDAIAELLTNKRQELKTVPPSDKPDFATCLLQAE
46034
45737 ERGEVSPAHVQQCVLEM 45687 (duplicate exon 7 seq)
45639
ERGEVSPAHVQQCVLEMLLAGTDTSSVSMYYLLVSVAENPQVELKVLEEMRDIL
45478
45352
GERDPTKADLPQLVYLEQVIKEAMRIKPVGPVIMRQAKEDDR
(2) 45227
SVFITIPLLYGNVNISITLYYALTKLLTHPPLQ
44076
DFKGEYVPFGTGPKGCIGQFLAMIEMKAIMCTLLRKYHLRAIPGESLEGIETHWDIAQQPVNASYMYFEERN*
43858
$$$$$$$
CYP20 clan
>CYP20 e_gw.479.56.1|Brafl1 39% to CYP20
MLDYAIFAITFVVFLIAAVLYLYPGSNKITTIPGLEPSDPKDGNMDDVGRAGSLHEFLLKLHAEYGDIASFWWGQQLVVS
LAAPELWKQHERAFDRPPLLFKGFEPLWGTMSITYANGVDGRTRRKLYDPSFGHEAMKHYFSIFQELGQEMAKNWASMEG
DQHIPLQAHMLALTTKATTRCSFGDAFKDEKECVQFSRNFNICWCDVEERVNGSHPTEGSPREKKFQEARGKLQATIGRV
VKYRRENPPPPQEQLFIDVLIEGDLPEEQVFGDAITYMVGGFHTTANLLTWALYFIATHEEVEEKLYQELSDVLGKKGEV
TPDNIPQLVYLRQVLDETLRCAVVTPWGARYMDLDAEIGGHIVPAKTPVIHAFGVVLQDERFWPEPNKFDPERFDAENSK
GRHKLAFQPFGSAGGRKCPGYRFTYVETTVFLSILCRQFKLHLVDGQVVKPRHGLVTRPVDEIWITVTKRD*
>CYP20 estExt_GenewiseH_1.C_860218|Brafl1
88% to e_gw.479.56.1|Brafl1
MLDYAIFAITFVVFLIAAVLYLYPGSNKITTIPGLEPSDPKDGNMDDVGRAGSLHEFLLKLHAEYGDIASFWWGQQLVVS
LAAPELWKQHERAFDRPPLLFKGFEPMFGAMSITYANSVDGRTRRKLYDPSFGHEALKHYFSIFQELGQEMASKWESTKG
DQHIPLHAHMMALALKTFTRSSFGDSFKDEKECVQFGRNYGICWNDMEERIKGSHPTEGSPREKKFKEALGKLHATIARV
AKYRRENPPPPQEQLFIDVLIEGNLPEEQVLCDAMTFTVGGFHTSGNLLTWALYYIATHEEVEEKLHQELSDVLGKKGEV
TPDNISQLVYLRQVLDESLRCAVIAPWGARYMDLDAEVGGHIVPAKTPVIHAFGVVLQDERIWPEPNKFDPDRFDAENSK
GRHKLAFQPFGFAGGRKCPGYRFAYTWTSVFLSILCRQFKLHLVDGQVVKPCHGFVTRPVDEIWITVTKRD*
>CYP20 fgenesh2_pg.scaffold_86000110|Brafl1
87% to e_gw.479.56.1|Brafl1
MLDYAIFAITFVVFLIATVLYLYPGANKITTIPGLEPSDPKDGNLGDLGRAGSLHEFLLKLHAEYGDIASFWWGQQLVVS
LGAPELWKQHERIFDRPRFEPLIGAKSIQYANSVDGRTRRKLYDPSYGHNAMKHYYSIFQELGQEMAKKWESMKGDQHIP
LHAHIIALAMKAITRSSFGDAFKDEKECVQFGRNYDICWNDMEERIKGSYPTEGSPREKKFEEAKGKLHATIARVAKYRR
ENPPPPQEQLFIDVLIEGDLPEEQVLCDAMTYMVGGFHTSGNLLTWALYFIATHEEVEEKLYQELSDVLGKKGEVTPDNI
SQLVYLRQVLDESLRCAVVAPWGARYMDLDAEVGGHIVPAKTPVIHAFGVVLQDERIWPEPNKFDPERFDAENIKGRHKL
AFQPFGFAGGRKCPGYRFTYVETTVFLSILCRQFKFHLVDGQVVTPWHGLVTRPLDEIWITVTKRD*
>CYP20 e_gw.89.28.1|Brafl1
83% to e_gw.479.56.1|Brafl1
MLDYAIFAITFVVFLIATGLYLYPGPNKITTIPGLEPSDPKDGNLGDIGRAGSLHEFLLKLHAEYGDIASFWWGQQLVVS
LGAPELWKQHERIFDRPPLLFKGFEPLIGAKSIQYANGLDGRTRRKLYDPSFGHNAMKYYYSIFQELGQEMAQKWESMEG
DQHIPLRAHTIDLTMKAITRCSFGDTFKDEECLQFSRNYDICWDDINERTKGNYPVEGSPREKKFQEALGRLHTTIGRVA
KYRRENPPPPQEQLFIDLLIEGDLPEEQVRAKSHTYWTISSVMTLYHCLLLTWALYFIATHKEVEEKLYQELIDVLGKKE
DVTPDNISQLVYLRQVLDETLRCAVVGPWGARYMDLDIEIGGHIVPAKTPVIHAFGVVLQDERIWPEPNKFDPERFDAES
SKGRHKLAFQPFGFAGGRKCPGYKFSYAETSVFLSILCRQFKLHLVDGQVVTWHGIIMITRPVDEIWITVTKRD*
>CYP20 e_gw.86.147.1|Brafl1
83% to e_gw.479.56.1|Brafl1
MLDYAIFAITFVVFLIAAVLYLYPKSNKITTIPGLEPSDPKDGNLGDVGRAGALHEFLLKLHAEYGDIASFWWGQQLVVS
LGAPELWKQHERIFDRPPLLFKGFEPLIGAMSIQYANHVDGMTRRKLYDPSFGHEAMKHYYSIFQELGQEMAKKWETMEG