472 cytochrome P450 sequence pieces from Amphioxus. Very fragmentary with

two haplotypes for many genes.

From JGI Branchiostoma assembly Jan 30, 2008

Search for P450 at 1.0e-5 or less (481 results, some false positives)

 

This file has clans 7, mito, 19, 20, 26, 46, 51, 74

 

CYP7 clan (12 sequences)  includes CYP39, no CYP8 sequences found

 

$$$$$$$$$$

 

>fgenesh2_pg.scaffold_10000055|Brafl1 41% to CYP7A1

MPCSSCLAVMIKVMIPTTSTDHGWNLYPPVSLCRQGGVTPPTGWVVTPESYAILCPSWQQVFCEQYQCLSAATLLPAGLL

EAPPLFSYYSGARGVVHGGQGPVGHAPPPDSEMFSENVEYSGAITEQELWFEIYCLFQWNKTVFLFRVRGSSSSFEVTQT

NSPDSVRQALITAGLSSTRLRRAGGTCSTRDDYTYIMWKFLITLTSCFCMKRSNKVSPVEDGDVKEETAPGEEEMNRGTT

ECPVVTAQPPMSQPRSSKSSADVLAELRQDGLLPLNTRGESVAFQVPASEPDAPPRRPVKLAKLEETLQERRERVKKEPA

GSRTKLRQQLSDAANRRDEMLQNRSRKLAESSRRAKAKARAAKKERKSTAFVISSVSDTDAIVPRDSEKAQALEKRLSKR

RKRVAKRITAEDMKKQQELAAERRRRSNKVSPVEDGDVEKEPAPGEEEMVRGTTECPVVTAQPPMSQPRSSKSSADVLAE

LRQDGLLPLNTRGESVAFQVPLVKPASEPDAPPRRPVKLAKLEETLQERRERVKKEPAGSRSKLRQQLSDAANRRDEMLQ

NRSRKLAESSRRARAKARAAKKEGKSTAFVISSVSDTDAIVPRDSEKAQALEKRLSKRRKRVAKRITAEDMKKQQELAAE

RRRCHIDRLAYLSTYSK

 

MVTELLGVCLAVVLVFVLLQVTTRRRRPGEPPLEPGPLPYLGVALEFSRNPLGFITSRWKKYG

DVFTVRLAGHYTTFVLDPHSFTHAIRNSKVLDFRVFSSKIAHRAFGMPIVYGTHRDWVRADSDALYPKELQGQGLEKVTE

VMMNNLQSAMLAATDVKDKWNKGELWSFVYRIMFSASYKTLFGRHKEDEEETARLLHAMEEFQKYDKRFPEIISNVPWWL

MGQTKKRYEYLKSMVSPTELSQRGVSDFIRMRQEIYADGNLSPDEMTGFNFATMWASLSNTVPAAFWTLFYLLKDPVAMD

AVREEVNQILKETGQSLETVKEAGEMLHVTREQLNDMKCLGSAINEALRMCSASIIIRVATEDAELALESGSTFRIRKGD

RVALYPGFLHMDPEVFDDPETFKYDRFLENGMEKTTFYKNGRKLRHYLLPFGHGASMCPGRFFALNEIKQFVTIVVCYFN

MELMEKQTPPKDQSRAGLGTLAPLKECLFRYSLK*

>fgenesh2_pg.scaffold_63000051|Brafl1 38% to CYP7A danio

96% to fgenesh2_pg.scaffold_10000055|Brafl1

MVTELLGVCLAVVLVFVLLQVTTRRRRPGEPPLEPGPLPYLGVALEFSRNPLGFITSRWKKYGDVFTVRLAGHYTTFVLD

PHSFTHAIRNSKVLDFRVFSSKIAHRAFGMPIVYGTHRDWVRADSDALYPKELQGQGLEKVTEVMMTNLQSAMLAATDVK

AEWNKGELWSFVYRIMFSASYKTLFGRHKEDEEETARLLHAMEEFQKYDKRFPEIISNVPW

(gap)

CLMGQTKKRYEYLKSNTVP

AAFWTLFYLLKDPVAMDAVRAEVDQILKETGQSLETVKEAGKMIHVTREQLNDMKCLGSAINEALRMCSASIIIRVATED

AELALESGSTFRVRKGDRVALYPGFLHMDPEVFDDPETFKYDRFLENGMEKTTFYKNGRKLRHYLLPFGHGVSMCPGRFF

ALNEIKQFVTIVVCYFNMELMEKQTPPKDQSRAGLGTLAPLKECLFRYSLK*

>fgenesh2_pg.scaffold_1047000003|Brafl1

only 6 aa diffs to fgenesh2_pg.scaffold_10000055|Brafl1

MVTELLGVCLAVVLVFVLLQVTTRRRRPGEPPLEPGPLPYLGVALEFSRNPLGFITSRWRKYGDVFTVRLAGHYTTFVLD

PHSFTHAIRNS STGGTEDQRSTTCVQIR ASYKTLFGRHKEEKDETALLLHAMEEFQKYDKRFPEIISNVPWWLMGHTKKR

YEYLK

DTCRHDRQCISRHGVGSCCAPRRPIFSPLPVCKSAGQVGDTCQRSGERLAYPTSVGRRQYIFTCPCAEGLQCELF

SGYADIGTCVPVQY*

 

>estExt_fgenesh2_pg.C_4350040|Brafl1 25% to CYP8b.c, 23% to CYP7B1 human

MGSVLGTLQLLGWNNQMLKPNREEDFVEKNIGFPCRVVTGNKTVQSVFDIDLFKKEEFCFGVVGEVRKDFTEGVCPCILS

NGKIHEKNKGFLMEVIAKAGEDIPPSTALSVLSNISKWGSTPMSDFESKLTDVAADAFLPNIFGESTHFHGEEIRLYRSG

AI AVRLSIVKALTGRNLDEERRAMTSILEKIKTSERYQQLLDLGKSYGLGEKEATAQLLFPVFINGAYGLAAHLVCTFAC

LDTISAEDREELREEALAALKNHRGLTRESLEEMPKIESFVLEVLRFCPNPVFWSTIATCPTTVEYTTDSGEHTLKIEEG

ERVYASSYWALRDPAVFDKPEDFMWRRFLGPEGDALRKHHVTFHGRLTDTPAVNNHMCPGKDVSLSALKGSIAIFNTFFG

WELQEPPFWTGKKLSRGSLPDNEVKIKSFWVQHPE DLKEIFPSHFQDIVNEVDDVGDIDVLVKTKTGKYSGSGTNSNVYI

RLFDDKGHQSRELQLDVWWKDDFEKGQEGQYKLKDIKVAAPIVKIELFRDGCHPDDDWYCESVSVQLNPDNNGPTYDFPV

NRWIRQNDHVWLSPGGGEPPKDDVNPIDD*

 

>CYP7 estExt_fgenesh2_pg.C_10470002|Brafl1 40% to CYP7B1

no allele

MISGILAGCLVVLVVAILVQAVGRKRDPNEPPLESGPVPYLGVALQFAMDSLKFIRSRQKKYGDVFTVKLAGKYTTFVLD

PHSYSDVMRQHKILDFKTVGMDIVERGFGTTHFEKTGRAHVLHTADAYFPVHLQGNALDPLTNTMMGHLQTAMLADIGEAA*

 

$$$$$$$$

 

>estExt_fgenesh2_pg.C_1950037|Brafl1 27% to CYP7D1, 30% to CYP7B1

MGGVWSNTYGFIKGVTDGVHMMKPEGEHPSVVRTNPGLPVVALMNQDTIHYAINPETYKKEPYSFGPVGVSKDVLRGHCP

SMFSNDEDHRRKKALLVDAYKQGEKSLPSILFNQIKAHFGEWSRLKDVPDFEERVFHIMSETLTEALFGRKIDGQLCFTW

LNGLITEAKTWIPMPSLAWKRRQAIKAIPELLKAIETAPKYRELVQLCHTHGVEVEEGIFTILYGTLFNGCAAQTAAIVS

SVARLHTLSDAEKN EIIQTTLQVLEKHGGVSEESLGEMKTLESFILEVLRLHPPVFNYWVLARKDLVISPEKENIKVRKG

ERMLGCCFFAQRDGSVFPDPDRFRWNRFLDEQGGQKKHLFFPRGSFTEAADLNSHQCPGQDIGFFMMKTTLSVFLCYCSW

ELKDAPVWSDKPIRVGNPDDPVRLVRFNFRSEQ AGRALTQGNRLVLIRAQVCLAVWTLTHLSVSRLVLKLDATTMPRNQR

APGSGGLPVSERRTRGHEKEIEAGWERSKFNEFVSDLVSLERSLPDTRPVRCHKAQVLDNLPTTSVIICFCEEAVSTLLR

SVHSVINRSPPHLLKEIILVDDASTAAYLKEDLDTYMSKFPQVKIVHLPEREGLIRARLRGAEIATGDVLTFLDSHIECN

VGWLEPLLDRIGRNRTTVPCPSIDRINDNTFGYEAANENMRGGFNWGMKFDWVSLPPGEDDRRYQDIWSQNEIIKSPTMA

GGLFSIDRRFFWELGGYDPGFQIWGAENLEISFKDIFYALNPHVENEIANAGDVSDRKRMREQLGCKSFQWYIDHVYPEI

TIPDLRAKARGEVKNRAMSLCLDAVYGEKVGAYFCHGEGGQQSFTLRMDDKIMLRWFFSVCLAAGLPIRNHKGAFLLTKK

PCTAPEVIAWNHTKGGPLVDQKTGKCLGVVNLSPEEHLVALRPCNQQRVQDWTFQNYLVDM*

>estExt_fgenesh2_pg.C_3320046|Brafl1 27% to CYP7D1

MGGVWSNTYGFIKGVTDGVHMMKPEGEHPSVVRTNPGLPVVALMNQDTIQYALNPETYKKEPYSFGPV GVSKDVLRGHCP

SMFSNDEDHRRKKALLVDAYKQGEKSLSSILFNQIKAHFGEWSRLKDVPDFEERVFHIMSETLTEALFGRKIDGQLCFTW

LNGLITEAKTWIPMPSLAWKRRQAIKAIPELLKAIETAPKYRELVQLCHTHGVEVEEGIFTILYGTLFNGCAAQTAAIVS

SVARLHTLSDAEKNEIIQTTLQVLEKHGGVSEESLGDMKTLESFILEVLRLHPPVFNYWVLARKDLVISPEKENIKVCKG

ERMLGCCFFAQRDGSVFPDPDRFRWNRFLDEQGGQKKHLFFPRGSFMEAADLNSHQCPGQDIGFFMMKTTLSVLLCYCSW

ELKDAPVWSDKPIRVGNPDDPVRLVRFNFRSE QAGRALVNTSAKKI*

 

 

>estExt_fgenesh2_pg.C_1940045|Brafl1 27% to CYP7D1

MGGVWSDTFGFIKGLVHGPHMMKPEGEHPSVFRANPGVPAVVLLNRDTIQYAFNPETYEKEPYSFGPVCAAKDVVGGHCP

SMFSNDEDHRRKKALLIDVYKQGQKTLPSVFFSQIKAHFEEWSRLEDVPDFEERVFHITSETLTEALFGKKIDGRLCYTW

GNGIPTDFRTWIPIPPAARKRRQAVEVLPALLKAIKETPKYQELVQLCHTHGVEVEEGILTILYGTLFNGCGAQTATIIS

SVACLHTLSDAEKNEIIQTTLQVLEKRGG ISEESLSEMKTLESFILEVLRLHPPVFNYWALARKDLVISPEKENIKVCKG

ERMVGSCFWAQRDGSVFPDPDRFRWNRFLDEDEQGGQKKHLFFPRGSWTEAADLDSHYCPGQDIGFFILKVLLAVLLGYC

SWELKDAPV WSDNTFRLGNPDDPVRLARFNFRSEQAGRALGIRPDNIAPNAI*

>estExt_fgenesh2_pg.C_510020|Brafl1 30% to CYP4V6

90% to estExt_fgenesh2_pg.C_1940045|Brafl1

87% to estExt_fgenesh2_pg.C_3320046|Brafl1

87% to estExt_fgenesh2_pg.C_1950037|Brafl1

MKPKGEHPSAFRMNNGVPAVVLLTRDTIQYAFNPETYEKDPYSFGPGGVSKDVVRGHCPSMFSNDEDHRRKKALLIDVYK

RGQKTLPSVFFSQIKEHLEEWSRLEDVPDFEERVFHIMSETLTEALFGRKIDGELCFTWLNGLLTDFKTWIPIPSMSRKR

RLAIEALPALLKAIKEAPKYQELVQLCHTHGVEVEEGIFTILYGTLFNGCAAQCAAIVSSVARLHTLSDTEKNDIIQTTL

QVLEKHGG VSEESLGEMKTLESFILEVLRLHPPVFNFWCLARKDLVISPEKENIKVCKGERMVGCCFWAQRDESVFPDPD

RFRWNRFLDEDKQGGQKKHLFFPRGSWTEAPDLDSHQCPGQDIGFFMMKALLAVLLGY CSWELTAAPMWSDKTIRVGNPD

DPVRLARFNFRSEQAGRALGIRPDNIAPNAI*

 

$$$$$$$$$

 

>CYP39 amphioxus 49% to CYP39 zebrafish,  start MET not certain, 2 choices

MATTIGEHSPGDELYNAFKY

MILFSLCFAFFSWRNIVKKGRPPCMDGWIPWFGCAIDFGKAPLDFIEETKRK (0)

LGPVFTIVAAGRWMTFVTEPEDITTFFQSPNLDFQKAVQDPVSHT (1)

ASVSTESFFQHHTKIHDTIKGRLAPANLHSFCSNLWGEFKQQLEQLEHHGKDDLNTLVRR (2)

CMFAAVVNNLFGAENVPTDKDRIQEFSDIFVKYDADFEYGSQLPPFFLR (2)

EWAESKKWLLSLFSRSIANMERKETESQ (0)

TLLQSLTKMVDRPHAPNYALLMLWASQANAVP(0)

MSFWVLAMILSNEDVHAAVKKEVQDNLGSP (1)

GDEPITEEDLKKLPLLKRCIMETIRLRSPGVITRAVDKPLRIR (0)

KYIVPKGHLLMMSPYWAHRNPNFFPEPDKFLP  (0)

DRWLDADLEKNLFLDGFVGFGGGRYQCPGR (2)

WFALMEMQMLLAMMIQMFDFKLLGEVPKEVCQNFNYLISIHII*

>fgenesh2_pg.scaffold_124000018|Brafl1 45% to CYP39A1

MATTIGEHSPGDELYNAFKYMILFSLCFAFFSWRNIVKKGRPPCMDGWIPWFGCAIDFGKAPLDFIEETKRKLGPVFTIV

AAGRWMTFVTEPEDITTFFQSPNLDFQKAVQDPVSHTASVSTESFFQHHTKIHDTIKGRLAPANLHSFCSNLWGEFKQQL

EQLEHHGKDDLNTLVRRCMFAAVVNNLFGAENVPTDKDRIQEFSDIFVKYDADFEYGSQLPPFFLRSIANMER

(gap)

KETESQT

LLQSLTKMVDRPHAPNYALLMLWASQANAVPMSFWVLAMILSNEDVHAAVKKEVQDNLGSPGDEPITEEDLKKLPLLKRC

IMETIRLRSPGVITRAVDKPLRIRKYIVPKGHLLMMSPYWAHRNPNFFPEPDKFLPDRWLDADLEKNLFLDGFVGFGGGR

YQCPGRWFALMEMQMLLAMMIQMFDFKLLGEVPKESPLHVVGTQQPVGPCPVEWTKI*

>CYP39A1 fgenesh2_pg.scaffold_124000030|Brafl1 45% to N-term

MATTIGEHSPGDELYNAFKYMILFSLCFAFFSWRNIVKKGRPPCMDGWIPWFGCAIDFGKAPLDFIEETKRKLGPVFTIV

AAGRWMTFVTEPEDITTFFQSPNLDFQKAVQDPVSHTASVSTESFFQHHTKIHDTIKGRLAPANLHSFCSNLWGEFKQQL

EQLEHHGKDDLNTLVRRCMFAAVVNNLFGAENVPTDKDRIQEFSDIFVKYDADFEYGSQLPPFFLREWAESKKWLLSLFS

RSIANMERKETESQTLLQSLTKMVDRPHAPNYALLMLWASQANAVP

(gap)

KYIVPKGHLLMMSPYWAHRNPNFFPEPDKFLP

LSDLDGETRSMVEKMMYDQRQKAMGLPTSDEQKKEDVLKKFMEQHPEMDFSKAKFC*

 

Mito clan (28 sequences, some duplicates)

 

$$$$$$$$

 

>CYP11amphi mixed seq 43% to Gene C, 35% to Gene B, 34% to gene D

36% to 27B1 fugu, 38% to 11A1 fugu, 33% to CYP24 fugu, 32% to 27C1 fugu

37% to chicken CYP11A1, 39% to catfish Ictalurus punctatus 11A1

This is a probable CYP11A gene

(2) EAKPFSALPGPPSVPVLGNFLHMWWEGLLEKEKLNKNHIMFTDFFRQYGPIFR (2)

(2) LKIVNVDMVSIKDPVAVQELFRKEGKYPARIDIKPWRRYREISGKATGVFLS (2)

(2) NGKDWQKNRSIMARPMLRPKHVSTYVSNLDTVSADMIKRLRVLQARADGIEV PNISDELFKWALE (1)

(1) SICTVLFNERMGYLQDNISQDAQDFIQGIHTIFLTTNTVIFPDADVHRFLRTKPWRQSVEAWDTVFRV(1)

(1) GEKVMVRKLQEALEREERGEGEDDQPNFLAFVNSTGRLTKDEIYSNTIELMGAAIDT (0)

(0) TSNTLLWTLYELSRRPELQDRLYQEVTQVIGQDKVMTWDHLKDLHLLKAIIKETLR (2) 885

(2) MYPVVHNVSRLLQEDTVLMGYRLPAK (0)

(1) TCVVAQVYAMGRDPQLFPDPDEFKPERWLRTGEAHDEINPYSSLPFGFGPRSCL (1)

(1) GRRVAEVELQLLLAK (0)

(0) MSQQFVLSQVEPEEISSVAQPLLMPETPLHLRFVDRK*

 

>fgenesh2_pg.scaffold_28000018|Brafl1 34% to CYP27C1 98% to 11amphi above 6 aa diffs

MMSVPVISGSRQRLSAVVGRAVSPWRPQGHIRVRALVGYRSGLVGPRTVPSPVQTYSTAAVGSTSHHNDDSEAKPFSALP

GPPSVPVLGNFLHMWWEGLLEKEKLNKNHIMFTDFFRQYGPIFRLKIVNVDMVSIKDPVAVQELFRKEGKYPARIDIKPW

RRYREISGKATGVFLSNGKDWQKNRSIMARPMLRPKHVSTYVSNLDTVSADMIKRLRVLQARADGIEVPNISDELFKWAL

ESICTVLFNERMGYLQDNISQDAQDFIQGIHTIFLTTNTVIFPDADVHRFLRTKPWRQSVQAWDTVFRVGEKVMVRKLQE

ALEREERGEGEDDQPNFLAFVNSTGRLTKDEIYSNTIELMGAAIDTTSNTLLWTLYELSRRPELQDRLHQEVTQVIGQDK

VMTWDHLKDLHLLKAIIKETLRMYPVAPNVSRVLQEDTVLMGYMLPAKTCVVAQVYAMGRDPQLFPDPDEFKPERWLRTG

EAHDEINPYSSLPFGFGPRSCLGRRVAEVELQLLLAKMSQQFVLSQVEPEEISSVAQPLLMPETPLHLRFVDRK*

 

$$$$$$$$$$$

 

this block related to gene B

 

>Gene B 84% to Gene D, 35% to CYP11 amphi, 33% to Gene C

30% to CYP24 Fugu, 30% to 27A3 fugu, 27B fugu, 27C fugu, 30% to 11A fugu

in nr blast best mammal hit is CYP24 mouse, but Drosphila hits are better.

34% to 49A1 D. melanogaster

    MYQLLSAARHQGQSLFRVCRARSLAALKTTYRPQSNKAEESVTYDTAARPFEEIPGPKGLPLIGTALEYTPF(1)

(1) GQFKMITNLRESFRERTRTYGSIYRERIGPLDLVVISDPKEIEKVFRNE

    GRYPERIELASIKVYREIKKLPTGLINL (2)

(2) NGPEWQRVRSSVQKDLMRPKTVGAYASLQDDVTRDLVDVIRALIGKEESGGQVQNFINYVYRWALE (1)

(1) AISVVVLDKRLGCLTLGDLEPGSDAKLMIDGVNDFFDAFVKLEMSATGL YKYISTPTWRKFAKAVDQFHR (2)

(2) VAEKLLKEKLAKTTTEDGKPAESDTDFLQSLLSRNDVTFEEAMEMAVDLLSAGIDT

(0) SGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVVKETFR (2)

(2) VYPTVLNNVRRLDQDIVLSGYVVPAK (0)

(0) TTILLAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCL (1)

(1) GRRFAEQELHLGLIR (0)

(0) IVQNFHVGWAGEDMKQDNRIILAPDRDSFVFSERT*

estExt_gwp.C_8820003|Brafl1   34% to CYP24                    2134  6.9e-224  1

 

>fgenesh2_pg.scaffold_214000064|Brafl1 3 genes fused,

31% to CYP24

MYQLLSAARHQGQSLFRVCGARSLAALKTPCRPQSNKAEESVTYDTAARPFEEIPGPKGLPLIGTALEYTPFGQFKMITN

LRESFRERTRTYGSIYRERIGPLDLVVISDPKEIEKVFRNEGRYPERIELASIKVYREIKKLPTGLINLNGPEWQRVRSS

VQKDLMRPKTVGAYASLQDDVTRDLVDVIRALIGKEESGGQVQNFINYVYRWALEAISVVVLDKRLGCLTLGDLQPGSDA

KLMIDGVNDYFASLVKLEMSATGLYKYISTPTWRKFAKAIDQWHFVAAKLLKEKLAKSATKDGKPAESDTDFLQSLLSRS

DVTFEEAMLMAVDLMAAGIDTSGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVVKETFRVYP

TVLNNVRRLDQDIVLSGYVVPAKTTILLAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRF

AEQELHLGLIR

30% to CYP24

SNKAEESVTYDTAARPFEE

IPGPKGLPLIGTALEYTPFGQFKMITNLRGSFRERTRTYGSIYRERIGPL

DLVVISDPTEIEKVFRNEGRYPERIELASIKVYREIKKLPAGLINLNGPEWQRVRSSVQKDLMRPKTVGAYASLQDDVTR

DLVDVIRALIGKEESGGQVQNFINYVYRWALEAISVVVLDKRLGCLTLGDLQPGSDAKLMIDGVNDYFASLVKLEMSATG

LYKYVSTPTWRKFAKAIDQWHLVAAKLLKEKLAKTATKDGKPAESDTDFLQSLLSRSDVTFEEAMLMAVDLMAAGIDTSG

NTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVVKETFRVYPTVLNNVRRLDRDIVLSGYVVPAK

TTILMAHDVISSLPEYYPEPEVYRPERWLRDDESSSVQPFTLLPFGYGPRMCIDPNKKVRMY

31% to CYP27A1

RLQRAVRHQGQSLFRVCG

ARSLAALKTTVTQTQSTRAEESGVYDTAARPFEEIPGPKGLPFIGTGWDYSPFGRFPIKTNFRDSFRERTRTYGSIYRER

IGPLDLVVISDPKEIGKVFRNEGKYPERPPMGSIKTYREVRKLPTGIANLNGPEWQRVRSSVQKDLMRPKTVGAYASLQD

DVTRDLVDVIRALIGREESGGQVQNFTNYVYRWALEAISVVVLDKRLGCLTLGDLEPGSDAKLMIDGVNDFFDAFVKLEM

SATGLYKYISTPTWRKFAKAVDQFHSVAEKLLKEKLAKTTTEDGKPAESDTDFLQSLLSRNDVTFEEAMEMAVDLLSAGI

DTTGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVRQETFRIYPTALSNMRTLDRDMVLSGYA

VPAKTIVLMAHDVISSLPEYYPEPEVYRPERWLRDDESSGVQPFTLLPFGYGPRMCIGRRFAEQELHLGLIRIVQNFHVG

WAGEDMKQVHRLILSPDRDTFVFSERT*

 

>fgenesh2_pg.scaffold_214000063|Brafl1 34% to CYP24

MSLLQRAVRQQGQSLFRVCGVRSLAALKTTYRLQSTRAEESVADDTAARPFEEIPGPKGLPLIGTALEYSPFGRFPIKTN

LRSSYRERTKIFGSIYREKIGPLDLVVISDPKEIEKVFRNEGRYPERLPLESIKAYRELKKLPAGVVNLNGPEWQRVRSS

VQKDLMRPKTVGAYASLQDDVTRDLVDVIRALIGKEESGGQVQNFINYVYRWALEAISVVVLDKRLGCLTLDDLEPGSDA

KLMIDGVNDFFDSFVVLETSATGLYKYISTPTWRRFEKAIDQWHTVAAKLLKEKLAKGATEEGKPAESDTDFLQSLLSRN

DVTFEEAMMTVVELLAGGIDTTGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIGDKVLNRMHYLRAVVKETFRVYP

TVPNNLRKLDRDIVLSGYRVPAKTTVFMVDDVISSLPEYYPEPEVYRPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRF

AEQELHLGLIRIVQNFHVGWAGEDMKQVNRMVFAPDRDTFVFSERT*

 

>fgenesh2_pg.scaffold_214000062|Brafl1 two genes fused

30% to CYP27A3 31% to CYP24

MQTLFSDWTGFSAFWTGQIFPKTPHTIDDFDSGLGSQSTRAEESVAYDTAARPFEEIPGPKGLPLIGTGLDYAPFGRFPL

KTHLRESFRERTKAYGSIYREKLGPLDLVVISDPKEIEKVFRNEG

(gap)

RNGPEWQRVRSSVQKDLMRPKTVGAYASLQDDVTR

DLVDVIRALIGKEGSGGQVQNFTNFVYRWALEAISVVVLDKRLGCLTLDDLEPGSDAKLMIDGVNDFFNAAVKLELSGAG

RLYKYISTPTWRKFANAIDQWHGVAAKLLKEKLTKSAAEDGKPAESDTDFLQSLLSRNDVTFEEAMLMAVDLMAAGIDTT

GNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIDDKVLNRMHYLRAVVKETFRLCPTVGNNIRTLDRDMVLSGYVVPA

KTKIFMAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRFAEQELHLGLIRLAVRHQG

QSLLRVCGARSLAALKPTY

25% to CYP24

RLQSTRAEESVADGTAARPFEEIPGPKGLPLIGTALDYTPFGRFPLKTNFRESFRERTRTYGSIY

REKIGPRELVVISDPKDIQKVYRNEGRYPERPQVDSIKTYREMKKLPAGIVVLNGPEWQRVRSSVQKDLMRPKTVGAYAS

LQDDVTRDLVDVIRALIGKEGSGGQVHNFINYVYRWTLESIGVVVLDKRLGCLTLGDLEPGSDAQLMIGGVNDFFNAFSK

LEMSATGLYKYISTPTWRKFQKAIDQWHTVAAKLLKEKLTQSTIEDGKPAESDTDFLQSLLSRNDVTFEEAMEMALDLL

(gap, missing I-helix) 37% to 27C1

VYPTFLNNVRTLDRDIVLSGYVVPGKTIIIIGNDIISSLSEYYPEPEVYKPERWLRDDEFSSVQPFTLLPFGYGPRMCIGR

RFAEQELHLGLIRIVQNFH

VGWAGEDMKQENRMVFAPDRDTFVFSERT*

 

>fgenesh2_pg.scaffold_214000072|Brafl1 33% to CYP27A3

MATGRATSRRNGQWGATLAIREINGPEWQR

VRSSVQKDLMRPKTVGAYASLQDDVTRDLVDVIRALIGKKESGGQVQNFT

NYVYRWALEAISMVVLDKRLGCLTLNDLEPGSDAKLMIDGVNDFFDAFVKLEMSATGLYKYISTPTWRKFAKAFDQWHAV

AEKLLKEKLAKSAAEEGKPAESDTDFLQRLLSSKDITFEEAMMMAVDLMAAGIDTTGNTLMFNLFCLAKNPEAQEKLYRE

IQEVVPAGQPIDDKVLNRMHYLRAVRQETFRFYPTVLSNTRILDRDVVLSGYFVPAKTIVLMAHDVISSLPVYYPEPEVY

KPERWLRGDESSSVQPFALLPFGYGPRMCIGRRLAEQELHLGLIRIVQNFHVGWAGEDMKQNNRIILAPDRDTFVFSART*

 

>e_gw.882.7.1|Brafl1

RERTKIFGSIYREKIGPLDLVVISDPKEIEKVFRNEGRYPERLPLESIKAYRELKKLPAGVVNLNGPEWQRVRSSVQKDL

MRPKTVGAYASLQDDVTRDLVDVIRALIGKEESGGQVHNFINYVYRWALEAISVVVLDKRLGCLTLDDLEPGSDAKLMID

GVNDFFDSFVVLETSATGLYKYISTPTWRRFEKAIDQWHTVAAKLLKEKLAKSAAEDGKPAESDTNFLQSLLSRSDVTFE

EAMMTVVELLAGGIDTTGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAGQPIGDKVLNRMHYLRAVVKETFRVYPTVPNN

LRKLDRDIVLSGYRVPAKTTVFMVDDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRFAEQEL

HLGLIRVGSFAV*

 

>estExt_gwp.C_8820003|Brafl1 34% to CYP24

MSLLPRVVRHHGRLFNVCSARSLVTYRSQSTRAEESVAYDTAARPFEKIPGPKGLPLIGTGLDYAPFGRFPLKTHLRESF

RERTKAYGSIYREKLGPLDLVVISDPKEIEKVFRNEGRYPERVQLESVRTYREIKKLPIGVVNLNGPEWQRVRSSVQKDL

MRPKTVGAYASLQDDVTRDLVDVIRALIGKEGSGGQVQNFTNFVYRWALEAISVVVLDKRLGCLTLDDLVPGSDAKLMID

GVNDFFNAAVKLEMSGAGRLYKYISTPTWRKFANAIDQWHGVAAKLLKEKLAKSAAEEGKPAESDTDFLQSLLSRSDVTF

EEAMLMAVDLMAAGIDTTGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAEQPIDDKVLNRMHYLRAVVKETFRLCPTVGN

NIRTLDRDMVLSGYVVPAKTKIFMAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCIGRRFAEQE

LHLGLIRVSFVALFRH*

 

$$$$$$$$$

 

>Gene D 84% to gene B, 34% to CYP11 amphi, 30% to gene C

31% to CYP24 fugu

    MSLLPRVVRHHGRLFNVCSARSLVTYRSQSTRAEESVAYDTAAR

    PFEKIPGPKGLPLIGTGLDYAPF (1)

(1) GRFPIKTNLRDSYRERTKTYGSIYREKIGPRELVVISDPKDIQKVYRNE

    GRYPERPQVDSIKTYREMKKLPAGIVVL (2)

(2) NGPEWQRVRSSVQKDLMRPKTVGAYASLQDDVTRDLVDVIKALIGKEESGGQVHNFINYVYRWTLE (1)

(1) AISVVVLDKRLGCLTLGDLEPGSDAQMMIGGVNDFFNAFAKLEMSATGL YKYISTPTWRKFQKAIDQWHT (2)

(2) VAAKLLKEKLTQSTIEDGKPAESDTDFLQSLLSRNDVTFEEAMEMALDLLSAGIDT (0)

(0) TGNTLMFNLFCLAKNPEAQEKLYREIQEVVPAEQPIDDKVLNRMHYLRAVVKETFR (2)

(2) LCPTVGNNIRTLDRDMVLSGYVVPAK (0)

(0) TKIFMAHDVISSLPEYYPEPEVYKPERWLRDDESSSVQPFTLLPFGYGPRMCI (1)

(1) GRRFAEQELHLGLIR (0)

(0) IVQNFHVGWAGEDMKQVNRLVLSPDRDSFVFSARA*

 

$$$$$$$

 

>GENE F 61% TO GENES D AND B

    MSRILQIVGRRAAFTQAGLQNVPVWRPLGGRNGRGAASSAAATEQTTVQDGAARPFDEIPGPRGLPFIGTALDYSPF (1)

(1) GRFPIHTKMANSTIERYQTYGKIYREKIGLRDMVFVCDPKDIETVFRSDGRLPERPIPESIATYRRLKNKPLGVALL (2)

(2) NGEEWFRLRRSVNKDMMRPKAVGAYATMQDEVSRELVGLIQGVVRKGKTAGQVPDFTKLLYKWGLE (1)

(1) ALSLVVLGKRLGCLTLDQLPEDSDAQRMIGAVNDFFYSFAKLQMSFPLFRYIRTPGWTTFERAMDTVSS (2)

(2) ITEKMIGERLEKLRQMEEPPDEADFLTSLLSREDMNLDEAIQMSVDLLQGAIDT (0)

(0) TAHTLVFNLYCLAKNPDAQQKLYEEILEVVPPEQPIDDRVLNKMHYLRAVVKETFR (2)

(2) MYPTLLSTARTLTRDVVLSGYHVPAK (0)

(0) TNVMLAQNVISTLPEYYPEPESYIPERWLRTESSNVQSFSLLPFGYGPRMCI (1)

(1) GRRFAEQELYLGLVR (0)

(0) IIQNFHVGWDGEDMKQVWRIFNAPDRDTFVFSERKS*

 

>fgenesh2_pg.scaffold_119000067|Brafl1 CYP29% to CYP11A1

same as gene F above

MTHTGNADGSVHGIEILANGSLQDKYSLSQGDMDGPIVPVNETITADGVQRNVILVNDQFPGPTLEVMEGAQVVVTVVNE

LLREATSLHFHGMYMRGVPYMDGVPYVTQCPILPMHSFTYRFKAEPAGTHWYHSHLGSQKEDGLYGAFIVHKNSIPTTPS

LPMFLQDWWHDDFNTIDVDSAYMEHRGPGRFFGPWQERGFSFEGTELTALNFKSALINGRGRYNNNSAPLTRFEISSGET

LRFRLINAGAEYTFRVSIDAHSMTVVANDGHDVEPVHVQSILVFPGESYDFEVVGDPSNSGTYWIRAQTLWAGKGPDVEP

EDRLQEVRAILAYDNAPTDEDPNSAMQTCTENSPCRVLNCPFPAFPAGSNTECIYVSDLNSTEEYSMSDESETEEYFFNF

GYQIGSSVNGRKFDTPKKPLIFKAPYDITPCEATCETDGCKCTYMVEIPLGKTIRFVLMDLGVESEGHHLIHLHGYDFRV

LAMGFPVHNETTGRWISQNADIDCGNDNKCNMASWNVTRPNLNYNKPPIRDTVVIPARGYTVIEFRSNNPGFWYFHCHQT

THMNEGMSMIIAEALDKLPALPYGFPTCGDFTGTEKPPGRGRTVAAMEQSVTKVELDHTQLVIIIVISAAMSATIALAAV

GIYNARAKVNAFQRQVVKRSYVVCDQALGPQVLTTDKPLDTRHKPRGIMHLLNAFILPCLCVTMATTQRCTDDVCEFTLV

VRYARTMTHTERDGEVHGIEILTNGSLQDKYSLSQGDMDGPIVPVEETITADGVQRNVIVVNDQFPGPTLEVIEGAQVVV

TVVNNLLREATSLHFHGMYMRGVPYMDGVPYVTQCPILPMHSFTYRFMAEPAGTHWYHSHLGSQKEEGLYGAFIVHKNSM

PTTPSLPMFLQDWWHDDFNNIDVDSAFMEHRGPGRFFVPWQNRGFSFDGNKLSSVRFISALINGRGRYNNNSAPLTRFEI

SPGETLRFRLINAGAEYTFRVSIDAHSMTVVANDGHDVEPVQVQSILVFPGESYDFEVVGDPSNSGTYWIRAQTLWAGKG

PDVEPEDRLQEVRAILAYDNDPTDEDPNSDMQNCTENSPCRVLNCPFPAFPAGSNTECVYVSDLNSTEEYSMPDESETEE

YFFNFGYQIGSSVNGRKFATPKKPLIFKAPYDITPCEATCETDGCTCTYTTEIPLGKTIRFVLMSLGFGSGGHHVIHLHG

YDFRVLAMGFPEYNETTGRWITQNDDINCGDDNKCNMAAWNVARPNLNYNKPPTRDTVVIPARGYTVIEFRSNNPGFWLF

HCHQTTHMKEGMSMIIAEALDKLPALPYGFPTCGDFTGTEKPPGRGRTAAAMEQSVTLVELDNTQLVIIIVVSAAMSATI

ALAAVGIYNARVNKSKEKMIDTP IVGRRAAFTQAGLQNVPVWRPLGGRNGRGAASSAAATEQTTVQDGAARPFDEIPGPR

GLPFIGTALDYSPFGRFPIHTKMANSTIERYQTYGKIYREKIGLRDMVFVCDPKDIETVFRSDGRLPERPIPESIATYRR

LKNKPLGVALLNGEEWFRLRRSVNKDMMRPKAVGAYATMQDEVSRELVGLIQGVVRKGKTAGQVPDFTKLLYKWGLESLS

LVVLGKRLGCLTLDQLPEDSDAQRMIGAVNDFFYSFAKLQMSFPLFRYIRTPGWTTFERAMDTVSSITEKMIGERLEKLR

QMEEPPDEADFLTSLLSREDMNLDEAIQMSVDLLQGAIDTTAHTLVFNLYCLAKNPDAQQKLYEEILEVVPPEQPIDDRV

LNKMHYLRAVVKETFR CAINSIMARHRTLHHGHRRKLSFIIPVLLVYVLVSAFLDLTYSGYMAKHVSDGDSHQTITTTEG

TNMTKLLWEGLSRLEQMDQQRANLTEKLKNIAKMANVSEEAIGPWLSQLRPMTIVDAPAGNRTALLTCQDIAEIRISNPM

GKGVTKVVELGNYQGHGVAVKRVLPTVKDVRECKRTIERSGWNKCFVFPNYKLLKEILLLQQLKHPNIVQLLGYCVQNEE

TDENLAEHGVVSVTEMGTKFHVGRARKMDWKMRLKMAIDLASLLDYLEHSPMGSLLMADFKVEQFVWVGGKVKLTDLDDV

SNVERKCAVDSDCWVDKKDVGVPCTNGSCRGLNAKHNMNGAYKTILRHIMVHTGTEETALREDLRSVSISAASLHSRLLQ

LLDKELAIDSPTHR*

 

$$$$$$$

>Gene G 55% to amphi 11

MFLGLMRCQTPSQTYSTGPQAASHPQLDPP

AKPFSALPEPMKGLPGILKTLVVLCTGGMSRKAQLKSHVVIGQLFQMYGPILR (2)

NRFGNFDMVNICDPDAAREVFKVEGKYPERLDIAPWRLHREDAGKELAVLLG (2)

NDKKWHKNRTVVSRPMLRPQSVAAYVLKIDDVATDMLQHIRSVRAGPDGTEVLDLENELFKWALE(1)

SISAVLFNERMGLLQDNIPQDAQDFINGMHDAFDSLTRAMTPDARLHKLLNTKSWQKNKQAWDT (0)

GEKVMDRQLQRAEERQARGEADDGQLDFLWFISSREKLTKEEIYANAIELMGAAIDT (0)

TSTTLLWTLYQLCHRPDLQDKLYQEVTQVIGQDEVITYDHLKNLHLFKAVIKETLR (2)

LHPVAFAITRVIQQDTVLMGYKIPAK

TVVMVSLYDMARDPRLYKNPEEYRPERWLRGAEDYVDTHPYAYLPFGFGTRSCI (1)

GRRVAETELQVLLAK (0)

ICQQFVLKQRNPRVIPAMTKGILMPAEKMDICFIERQ*

>e_gw.241.76.1|Brafl1 33% to CYP27C1, 99% to gene G above

FGNFDMVNICDPDAAREVFKVEGKYPERLDIAPWRLHREDAGKELAVLLGNDKKWHKNRTVVSRPMLRPQSVAAYVLKID

DVATDMLQHIRSVRAGPDGTEVLDLENELFKWALESISAVLFNERMGLLQDNIPQDAQDFINGMHDAFDSLTRAMTPDAR

LHKLLNTKSWQKNKQAWDTVFKIGEKVMDRQLQRAEERQARGEADDGQLDFLWFISSREKLTKEEIYANAIELMGAAIDT

TSTTLLWTLYQLCHRPDLQDKLYQEVTQVIGQDEVITYDHLKNLHLFKAVIKETLRLHPVAFAITRVIQQDTVLMGYKIP

AKTVVMVSLYDMARDPRLYKNPEEYRPERWLRGAEDYVDTHPYAYLPFGFGTRSCIGRRVAETELQVLLAKICQQFVLKQ

RNPRVIPAMTKGILMPAEKMDICFIERQ*

>fgenesh2_pg.scaffold_140000032|Brafl1 31% to CYP11A2

90% or more to gene G above

MIRLCALTQRRSAATIVGRWLDFHRGARAASQGLLRCQTPNQPYSSGPQAASHPQLDPPVKPFSALPEPMKGMPGILKFL

VVLCTGGMSRKAQLKSHMMIGQLFQMYGPILRNRFGNFDMVNTCDPDAAREVFKVEGKYPERLDIAPWRLHREDAGKELA

VLLGNDKKWHKNRTVVSRPMLRPQSVAAYVLKIDDVATDMLQHIRSVRAGPEGTEVLDLENELFKWALESISAVLFNERM

GLLQDNIPQDAQDFINGMHDAFDSLTRAMTPDARLHKLLNTKSWQKNKQAWDTVFKIGEKVMDRQLQRAEERQAR

(gap) 37% to CYP27A.c

GEADDGQLDFLSFISSREKLTKEEIYANAIELMGAAIDT

VNSTSMSITLSQLVTDTVHE

TSTTLLWTLYQLCHRPDLQDKLYQEV

TQVIGQDEVITFDHLKNLHLFKAVIKETLRLHPVAFAITRVIQQDTILMGYEIPAKTVVMVSLYDMARDPRLYKHPEEYR

PERWLRGAEDYVDTHPYAYLPFGFGTRSCIGRRVAETELQVLLAKICQQFVLKQRNPRVIPAMTKGILMPAEKMDICFIE

RQ*

 

$$$$$

 

>fgenesh2_pg.scaffold_283000056|Brafl1 29% to CYP24

MSHILKIAGRRTAVRHQLRLPGFWRFCGRQGVRGAATTATAAEQVAPEETVRPFQE

IPGPKGLPFIGTALDYSPFGRFPIHTQLGNSAIERY

KTHGKIYREKLGPGREMVFVCDPKDIGTVFRSDGRLPERPPVNSIATYRKMRKKPPGLGNLMGEDWHR

VRSSVNKEMMRPKSVGAYATMQDDVSREMAELIQTVVRKGDSGGQVDNFMNLMHKWGLESLSLVILGKRMGCLTLDQLAE

DSDAQRMISAVLEFFLYFGKLEMSLPFYRYFSTPAWKKFETAMDTMN

(Gap)

SLLSQKDMTLDEAVMMAIELLTGAFESTANTLA

FNLYCLAKNPAAQQKLYEEIMNVVPPGQPIDDRVLNKMSYLRAVFKETSRLYPTIFFNARTLTRDVVLSGYHVP

AKIIQKFHVGWDGEDMKQIYKIFNTPDRDTFIFRERE*

>e_gw.77.176.1|Brafl1 33% to CYP24

93% to fgenesh2_pg.scaffold_283000056|Brafl1 (allele)

KTYGKIYREKLGPGREMVFVCDPRDIGTVFRSDGRLPQRPPVNSLATYRKMRKKPLGLGNLMGEDWHRVRSSVNKEMMRP

KSVGAYATMQDDVSREMAEQIQTVVRKGDSGGQVDNFMNLMHKWGLESLSLVILGKRLDCLTLDQLAEDSDAQRMISAVL

EFFLYFGKLEMSLPLYKYFNTPAWKRFVRALDTMN RYAICPIQERILTELSKLEEPPQETDFLSN LLSQKDMTLDEAVMM

AIELLTGAFESTANTLAFNLYCLAKNPAAQQKLYEEILEVVPPGQPIDDRVLNKMSYLRAVFKETSRLYPTIFFNARTLT

RDVVLSGYHVPAKTQIIMANNVISTLPEYYPDPEAYIPERWLRTESSAANVQAFALLPFGYGARMCVGRFLPVKNRSVS*

 

 

$$$$$$$$$

 

>fgenesh2_pg.scaffold_191000017|Brafl1 27% to CYP27C1

MGITGVLGRRCDAVMRSGRVFNGQWKCGRSSLRNVGLCILRKSSSTVTNVGMETCVDPTANKTDVAVRPFHEIPGPKGLP

IIGSLWEYTFLGKLDPRRFDEVLWNRYQEYGKIYKEDLGPRGTFVRIADPGDIETVYRNEGRYPHRPSFPLVRESMEAAG

QELLKHRARSESSFNGQGLEWYRTRSAVNRTLLRRSGVALFHPTLNEISDDFLTLLKRSLDENNTVPDITWQIRRHNTEV

AGTTIFGRRPGCLEPDFSGSCQTSEMIKSIDDFFASWLKLEIGFPLTKYLLKDTWNGYMNAHRNILRIVKYHMDLDVEYE

DSRPSVLGYLLSESSLSDTDAAMSAVELFVGGMQSSSHADMFQLYELARHPHVQETIRREVTEALPKGEAVTSAHLHKLP

YLKAFVKETFRFHPVGLLHMRILDRDVVLSGYRVPAHTTIEIPMSVLGRLEELYPQADRFLPERWLRRGPNGFRSRMFSH

VTPFGHGPRACIGRRLAEDKFYIQIAKLVQNFDLHCDEEVGTVTGCFQELSPTPNIRFTPR*

 

$$$$$$$$

 

>estExt_GenewiseH_1.C_30140|Brafl1 33% to 11A2, 33% to CYP27A1

MGGWMDKFHLHMQNRWRQYGSIYKENIGPQEIVCMFDPEDVAPVLRAEGRYPRRYAFDSFYLAREIMGHKLGVFLENDEK

WQQYRTVMNKKLLRPQQAAAFTPLMDEAASNFMSYLRRKRDQGGMVTDLQAHLFRWAMESGCTAMFNQHLGLLSEDPPQL

AKDFISSTMAVLDTTNTMMTIPPKVHKALNTKAWKEHLEGWQTSFRVTKQLIEEIMERGLEKESEEDEEIPDLVSYLLSV

KLRPEEVLANIVDVLGGAVDTTSNTMAFTMHTLARHPDIQEKLHDEVMRVAPDHQAPVTQEQVHKMPYLRGVIKEVLRLY

PVAYVFSRVLNHDAVVHGYKIPAGTNLVVCPYVMGRDPNSYDDPEEFRPERWYRENSKSVKAFSWLPFGFGARGCVGRRI

AETEMHLVLIRICQNFLLEQEKDEELVGRIRLVLIPDKSVDLKLIDRN*

>e_gw.29.150.1|Brafl1 32% to CYP27C1

92% to estExt_GenewiseH_1.C_30140|Brafl1

IAQNRWQQYGSIYKENIGPQEIVCMFDPEDVAPVLRAEGRYPRRYAFDSFYLAREIMGHKLGVFLENDEKWQQYRTVMNK

KLLRPQQAAAFTPLMDEAASNFMSYLRRKRDQGGMVTDLQAHLFRWAMESGCTAMFNQHLGLLSEDPPQLAKDFISSTMA

VLDTTNTMMTIPPKPGVKTYCTNVAPGSFLSSLELVFIMERGLKKESEEDEEIPDLVSYLLSVKLRPEEVLANIVDVLGG

AVDTTSNTMAFTMHTLARHPNIQEKLHDEVMRVAPDRQAPVTQEQVHKMPYLRGVIKEALRLYPVAYVFSRVLNHDAVVH

GYKIPAGTNLVVCPYVMGRDPNSYDDPEEFRPERWYRENSKSVKAFSWLPFGFGARGCVGRRIAETEMHLVLIRICQNFL

LEQEKDEELVGRIRLVLIPDKSVDLKLIDRN*

 

>e_gw.3.68.1|Brafl1 33% to CYP11A2

89% to estExt_GenewiseH_1.C_30140|Brafl1

GQEGATAKPFEAIPGPKGLPLVGTALHAAMGGWMDKFHLHMQNRWRQYGSIYKEIIGPQEIVCMFDPEDVAAVLRAEGRY

PRRHSVDSFYLAREIMGHKLGVLLENDEKWQQYRTVMNKKLLRPQQAAAFTPMMDEAASNFMSYLRRKRDQGGMVTDLQA

HLFRWAMESGCTAMFNQHLGLLSEDPPQLAKDFISCSMAILDTTNTMMTIPPKVHKALNTNAWKEHLEGWQTSFRVTKQL

IEEIMERELKKENEEDEEISDLVSYLLSVKLRPEEVLANIVDVLGGAVDTTSNTMAFTMHTLARHPDIQEKLHDEVMRVA

PDRQAPVTQEQVQKMPYLRGVIKEILRLYPVAYIFSRVLNHDAVVHGYKIPAGTNLVVCPYVMGRDPKSYDNPEEFRPER

WYRENRESVKAFSWLPFGFGARGCVGRRIAETEMHLVLIRICQNFVLEQKKDEELVGRIRLVLIPDKSVDLKLTDRN*

 

$$$$$$$$$

 

>fgenesh2_pg.scaffold_410000012|Brafl1 27% to CYP24

MQTRVKATVPTLRETGRYGVGKLHERHLDLHRQYGDICREKLLGREIVHVFSREIAQEVFMQEGRYPGRTVIEPDALYRT

TRGIPLGLLSLQDAEWHRLRRLAQDRILRPAVQSAVLPNMDRIAQEFVMRTDMLRSPGSDVMERNYKDELHLWSLEW

(gap)

KLIFSLPLYKVVPTPTWRKLAAAQDTFFRLSENYIKQVLTDSGDGDPETQDSLLLHLLRKSELSKEEVSATMTDLFQGGIDTT

TNGMMYSLFALAKNPEVQELVCQEIRTHLPEGARVTPEVLGKMKYLKAVIKETFRVCLPGCCRLWPVIFGTARQYDYDVV

LGGYDVPAKTEILVHHRVMCRQDKYFRDPLTFDPTRWLRDEKTPRVPTYLFMPFGHGVRMCIGMLNIILTIRRRFAEQQL

QLLVIRMLQRFHVECEEAELRQVFSLVLLPDRNPRFIFRRRQGETA*

 

 

>gw.501.20.1|Brafl1 30% to CYP24

LKKLHESFFERYRQFGKISKETIGNKCFVSVYDPRDIETLFRTEGPNPSWMQLMALGEVRKRLGKPLGMINETGQKWRQL

RYAAQSKLLNPKSVSSFVPVLDEISRDFVEKLRTGRSAATLEPTIDLDAELRKWSLESVVSATLGIRLGCLQKHRQIPDK

DTEDLLQSSDAFLDTWSKLELGPPLYMLYPTKTWRKFLRANELWLSAAGRMIDRSLDRSESERDPLQPEVTLLEHIVTRK

ELTPDDVVMIITELIFAGIESTAVAMTYNLYTMAKNQHVQEKVRREVNAVVGKSGKVTQDALKSLKYVKACIKETSRVLP

AFSMRNRILDKEIVLAGYRVPPNVIIRVLTHVTGQLPEYVVEPDRFAPERWLRDDTTIPKPHPFAVRPFGVGTRSCIGQR

LAEQELGILLAKV

>fgenesh2_pg.scaffold_44000117|Brafl1

87% to gw.501.20.1|Brafl1

MAFAVLMMMAAAVLPNFARSAITLIPMGSTYLPYGFDPAGAPLYGMGDRGAVEQLTYDADNYRIYTVGEARILNVIDISD

PKNAALVYQLQLPGGATDVDSCGRFVAVSIHDDFKVLPGTVLIYSMYDTTRKNMTLLHQIQVGALPDMVKFTKDCMTLVT

CNEGEPGLDESGNFVDPEGSASVIAFQSTNLGQESAPTVRTATFRKFDSLAEEYNSRGVRWTLPMIQVGSEVMEFNLSQT

LEPEYVAYNSDGSKAYIALQENNAIAVLDMATATFDDIYPLGSKYWGTASIDTSNEDGGSLVSRNLKSQRVQKAMNLTSQ

LGCAVFSSIDGLDPENPDKYSSLHLFGGRGFSVWDADDLSLVWDSGDDVERMVAKYYPTIFNSDYDEEFFNSTPAARFDH

RSCKKGPETESLAIGEVDGKTAFFVGNERSSTILVYSLADEDIITPVFQSIHFSGRTDLTWRQAYQDRVVGDIDPEDMRF

VSTRDSPTNSPLLLVAGTVSGTVSVYEVAESDDDGVSTAGKMKRAWLQHLVAKKLGADAVSIGRSGGETSTFSPPVTRY

25% to CYP27B1

RQFGKISKETIGNKTFVSVYDPRDIETLFRTEGPNPSWMQLMALGEVRKRLGKPLGMINETGQKWRQLRYAAQSKLLNPKS

VSSFVPVLDEISRDFVEKLRTGRSAATLEPTIDLDAELRKWSLESVVSATLGIRLGCLQKHRQIPDKDTEDLLQSSDAFL

DTWSKLELGPPLYMLYPTKTWRKFLR

ANELW 38% to CYP27B1

LRVLPAFSMRNRILDKEIVLSGYRVPPNVIIRVLTHVTGQLPEYVVEPD

RFAPERWLRDDTTIPKPHPFAVRPFGVGTRSCIGQRLAEQELGILLAKMIQQFHIE

CDGEMEQIFNIANKPDLSGTFKFTEL*

 

>Gene C 38% to CYP11 amphi, 34% to Gene E, 34% to Gene B

42% to 27B1 Fugu, 38% to 27C1 fugu, 42% to 27A1 fugu (but not first exon)

37% to 11A1 fugu, 36% to CYP24 fugu (Best match to CYP27B)

42% to Xenopus trop. 27B1, 41% to Xenopus laevis 27A1

    MAQQILRNSSVCSLVRPNSRALVSVAPAATVQQNRPLKEMPGPTNKLGQLWWGFKNRSRMHEAQ (0)

(0) LEQERKYGRMWQSSFGFNPNVNVAHVALAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNQ (2)

(2) NGPEWRHLRTAVSKRIMRPKEVPR (2)

(2) YGDSMNEVVTDMIDRFKDLRDTTGGGKTVPDLTNELYKWAME (1)

(1) SIATVLFDTRLGCLEREMPEKTQQFIDSIATMFRTAFLVSALKPWMLTYLGLGVWKRHVEAWDVIFSV(1)

(1) AHENIDRKVLDIDARLSRGEDLVGSFLTYMLTGTDVTKKDLYATVTELLLAGVDT (0)

(0) TSNTMVWTLYELARHPELQERLHQEVTSVVSPGQIPTVDDVKNMALLKNVIKEILR (2)

(2) VYPVLPANGRVLDKDIVLDGYNIPKG (0)

(0) TQFAILHYNMTRDPEVFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCA (1)

(1) GRRLAEMEMYLVLAR (0)

(0) LVQTFEVRQLTPGEVVRPVTRALLVPGDPVHLEFIDRP*

>CYP27 40% to 27B1 Fugu, 37% to 27C1 fugu, 40% to 27A1 fugu (but not first exon)

35% to 11A1 fugu, 34% to CYP24 fugu (Best match to CYP27B)

    MAQQILRNSSVCSLVRPNSRALVSVAPAATVQQNRPLKEMPGPTNKLGQLWWGFKNRSRMHEAQ (0)

(0) LEQERKYGRMWQSSFGFNPNVNVAHVALAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNQ (2)

(2) NGPEWRHLRTAVSKRIMRPKEVPR (2)

(2) YGDSMNEVVTDMIDRFKDLRDTTGGGKTVPDLTNELYKWAME (1)

(1) SIATVLFDTRLGCLEREMPEKTQQFIDSIATMFRTAFLVSALKPWMLTYLGLGVWKRHVEAWDVIFSV(1)

(1) AHENIDRKVLDIDARLSRGEDLVGSFLTYMLTGTDVTKKDLYATVTELLLAGVDT (0)

(0) TSNTMVWTLYELARHPELQERLHQEVTSVVSPGQIPTVDDVKNMALLKNVIKEILR (2)

(2) VYPVLPANGRVLDKDIVLDGYNIPKG (0)

(0) TQFAILHYNMTRDPEVFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCA (1)

(1) GRRLAEMEMYLVLAR (0)

(0) LVQTFEVRQLTPGEVVRPVTRALLVPGDPVHLEFIDRP*

>CYP27 fgenesh2_pg.scaffold_25000096|Brafl1

MAQQILRNSSVCSLVRPNSRALVSVAPAATVQQNRPLKEMPGPTNKLGQLWWGFKNRSRMHEAQLEQERKYGRMWQSSFG

FNPNVNVAHVALAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNHNGPEWRHLRTAVSKRIMRPKEVPRYGDSMNEV

VTDMIDRFKDLRDTTGGGKTVPDLTNELYKWAMESIATVLFDTRLGCLEREMPEKTQQFIDSIATMFKTAFLVSALKPWM

LTYLGLGVWKRHVEAWDVIFSVAHENIDRKVLDIDARLSRGEDLDGSFLTYMLTGTDVTKKDLYATVTELLLAGVDTTSN

TMVWTLYELARHPELQERLHQEVTSVVSPGQIPTVDDVKNMALLKNVIKEILRVYPVLPANGRVLDKDIVLDGYNIPKGT

QFAILHYNMTRDPEVFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCAGRRLAEMEMYLVLARLVQTFEVRQLTPGE

VVRPVTRALLVPGDPVHLEFIDRP*

>CYP27 e_gw.25.105.1|Brafl1

QLEQERKYGRMWQSSFGFNPNVNVAHVSLAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNHNGPEWRHLRTAVSKR

IMRPKEVPRYGDSMNEVVTDMITRFKDLRDTTGGGKTVPDLTNELYKWAMESIATVLFDTRLGCLEREMPEKTQQFIDSI

ATMFRTAFLVSALKPWMLTYLGLGVWKRHVEAWDVIFSVGESHENIDRKVLDIDARLSRGEDLDGSFLTYMLTGTDVTKK

DLYATVTELLLAGVDTTSNTMVWTLYELARHPELQDRLHREVTSVVSPGQIPTVDDVKNMALLKNVIKEILRVYPVLPAN

GRVLDKDIVLDGYSIPKGTQFAILHYNMTRDPEAFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCAGRRLAEMEMY

LVLARLVQTFEVRQLTPGEVVRPVTRALLVPGDPVHLEFIDRP*

 

CYP19 clan (2 subfamilies)

 

>CYP19 amphioxus 37% to CYP19 zebrafish ovarian, 38% to brain form

41% to e_gw.484.33.1 so there are two CYP19 subfamilies in Amphioxus

two possible start METs

    MLQFLVIESRGSFPLNRSRTRHGITSQIEADGCS

    MDTGEGWDVLLVVLLVVLVWYYIRETWTSGIDGIFPP (1)

(1) GPPYIPLLTPLWTLWVFLHDGIWAATAGYAAKYGDFVRVWLGTEQTFIISR (2)

(2) ASAAAHVLKSSKYRARFGDPSGLAQIGMNGSGVIFNNDVQSWKFLRFFFVK (1)

(1) VLDRAAGVSAIATRRQLANIRDIASSNPDGAVDVVTLMRRITLEIGNRLFLGVNIEN (1)

(1) DLEVVNTINGYFAAWEFFMIRPKVLQLIYPTLYRKHQTAV (2)

(2) RALQDVVGKLVDKKRAVMNGDEAEEEFSIPKGEHDFAAALIQAQ (0)

(0) EFGQVSASCVRQCVTEMLLAGPDTMSVHIYFILLHIAEHGLENGILREIREVL (1)

(1) GDRDPTRDDLSKMVFLDHVIN ESMRARPVVTFVMRHAEEEDHVDGYVIPKG

    TNVIINLVAVHQDPRHFP EPETFDPDHFKEK (0)

(0) VPSTQFMPFGLGVRSCVGRTIAPLQMKAVLITLLRMYQLSPSRDHQSLEVSRNLSEHPTEPGSMFLYPRLETI*

 

 

>estExt_gwp.C_90165|Brafl1 45% to CYP19

96% to assembled seq above

MFSLQECGQVSASCVRQCVTE

MLVAGPDTMSVNIYFILLHIAEHGLENGILREIREVLGDRDPTRDDLSKMVFLDHVINE

SMRARPVVTFVMRHAEEEDHVDGYVIPKGTNVIINLVAVHQDPRHFPEPETFDPDHFKEKVPSTQFMPFGLGVRSCVGRT

IAPLQMKAVLITLLRMYQLSP

>estExt_fgenesh2_pg.C_90115|Brafl1 39% to CYP19a C-term

98% to estExt_gwp.C_90165|Brafl1

MLVAGPDTMSVNIYFILLHIAEHGLESGILREIREVLDRDPTRDDLSKMVFLDHVINESMRTRPVVTFVMRHAEEEDHVD

GYVIPKGTNVIINLVAVHQDPRHFPEPETFDPDHFKEKVPSTQFMPFGLGVRSCVGRTIAPLQMKAVLITLLRMYQLSP

SRDHQSLEVSRNLSEHPTEPGSMFLYPRLETI*

 

>CYP19 scaffold_484 96% to first 3 exons below on e_gw.484.33.1|Brafl1

290078 MSGVMSVLTEQLQTWSAGLTCVTAVIVTGAALVLTWGGWASGRSVDVP (1) 289935

288972 GPPWLLGFGPLMSFARFIWMGVPVAAAHYGARYGDFVRVWIAGERTYVITR (2) 288820

288346 PSAPWPVLKSTNSCRRFGSRTGLRPIGMYQNGIIWNGDDGWRVLRGFFQK (1) 288197

 

>CYP19 e_gw.484.33.1|Brafl1 38% to CYP19 human and danio (-) strand

first exon is a guess, no frameshifts exist in e_gw.1098.5.1 so it may be correct

294278 MSGVMSVLTEQLQTWSAGLTCVTAVIVTGAALVLTWGGWASGRSVDVP 294135

293172 GPPWLLGFGPLMSFARFIWMGVPVAAAHYGARYGDFVRVWIAGERTYVITR (2) 293020

292546 PSAAWHVLKSNNYCRRFGSRTGLSTIGMYQNGIIWNGDDGWRVLRGFFQK (1) 292397

287888 ALNADTLNRATSAAVDATYRQMGNIAALQQKAADGKIEALDFLRRITLEVTNNLTLGVHIAD (1) 287703

287339 PDDLVERIVRYFKAWEFFLLRPPIMYLMTPKLYWKHCQAV (2) 287220

286970 NDLNDAIAELLTNKRQELKTAPPSDKPDFATCLLQAE (0) 286860

286169 ERGEVSPAHVQQCVLEMLLAGTDTSSVSMYYLLVSVAENPQVELKVLEEMRDIL (1) 286008

286565 ERGEVSPAHVQQCVLEMVL 286509 (duplicate exon 7 seq)

285823 GERDPTKADLPQLVYLEQVIKEAMRIKPVGPVIMRQAKEDDR (2) 285695

285428 IDGIETPAGTNIILNLADMHRRQDNFPAPDDFNPQHFDNK (0) 285309

284605 DFKGEYVPFGTGPKGCIGQFLAMIEMKAIMCTLLRKHHLRAIPGESLEGIETHWDIAQQPVNASYMYFEERN* 284387

 

>CYP19 e_gw.1098.5.1|Brafl1

95% to e_gw.484.33.1|Brafl1 yellow exon 9 is wrong, exon 9 is in a seq gap

49213 MSGVMYVLTEQLQAWSAGLTCVTAVIVTGAALVLTWGGWASGRSVDVP (1) 49070

48196 GPPWLLGFGPLMSFARFIWMGVPVAAAHYGARYGDFVRVWIAGERTYVITR 48044

47584 PSAAWHVLKSNNYCRRFGSRTGLSTIGMYQNGIIWNGDDGWRVLRGFFQK 47435

47273 ALNADTLNRATSAAVDATYRQMGNIAVLQQKTADGKIEALDFLRRITLEVTNNLTLGVHIAD (1) 47088

46510 PDDLVERIVRYFKAWEFFLLRPPIMYLMTPKLYWKHCQAV 46397

46144 NDLNDAIAELLTNKRQELKTVPPSDKPDFATCLLQAE 46034

45737 ERGEVSPAHVQQCVLEM 45687 (duplicate exon 7 seq)

45639 ERGEVSPAHVQQCVLEMLLAGTDTSSVSMYYLLVSVAENPQVELKVLEEMRDIL 45478

45352 GERDPTKADLPQLVYLEQVIKEAMRIKPVGPVIMRQAKEDDR (2) 45227

      SVFITIPLLYGNVNISITLYYALTKLLTHPPLQ

44076 DFKGEYVPFGTGPKGCIGQFLAMIEMKAIMCTLLRKYHLRAIPGESLEGIETHWDIAQQPVNASYMYFEERN* 43858

 

$$$$$$$

 

CYP20 clan

 

>CYP20 e_gw.479.56.1|Brafl1 39% to CYP20

MLDYAIFAITFVVFLIAAVLYLYPGSNKITTIPGLEPSDPKDGNMDDVGRAGSLHEFLLKLHAEYGDIASFWWGQQLVVS

LAAPELWKQHERAFDRPPLLFKGFEPLWGTMSITYANGVDGRTRRKLYDPSFGHEAMKHYFSIFQELGQEMAKNWASMEG

DQHIPLQAHMLALTTKATTRCSFGDAFKDEKECVQFSRNFNICWCDVEERVNGSHPTEGSPREKKFQEARGKLQATIGRV

VKYRRENPPPPQEQLFIDVLIEGDLPEEQVFGDAITYMVGGFHTTANLLTWALYFIATHEEVEEKLYQELSDVLGKKGEV

TPDNIPQLVYLRQVLDETLRCAVVTPWGARYMDLDAEIGGHIVPAKTPVIHAFGVVLQDERFWPEPNKFDPERFDAENSK

GRHKLAFQPFGSAGGRKCPGYRFTYVETTVFLSILCRQFKLHLVDGQVVKPRHGLVTRPVDEIWITVTKRD*

>CYP20 estExt_GenewiseH_1.C_860218|Brafl1

88% to e_gw.479.56.1|Brafl1

MLDYAIFAITFVVFLIAAVLYLYPGSNKITTIPGLEPSDPKDGNMDDVGRAGSLHEFLLKLHAEYGDIASFWWGQQLVVS

LAAPELWKQHERAFDRPPLLFKGFEPMFGAMSITYANSVDGRTRRKLYDPSFGHEALKHYFSIFQELGQEMASKWESTKG

DQHIPLHAHMMALALKTFTRSSFGDSFKDEKECVQFGRNYGICWNDMEERIKGSHPTEGSPREKKFKEALGKLHATIARV

AKYRRENPPPPQEQLFIDVLIEGNLPEEQVLCDAMTFTVGGFHTSGNLLTWALYYIATHEEVEEKLHQELSDVLGKKGEV

TPDNISQLVYLRQVLDESLRCAVIAPWGARYMDLDAEVGGHIVPAKTPVIHAFGVVLQDERIWPEPNKFDPDRFDAENSK

GRHKLAFQPFGFAGGRKCPGYRFAYTWTSVFLSILCRQFKLHLVDGQVVKPCHGFVTRPVDEIWITVTKRD*

>CYP20 fgenesh2_pg.scaffold_86000110|Brafl1

87% to e_gw.479.56.1|Brafl1

MLDYAIFAITFVVFLIATVLYLYPGANKITTIPGLEPSDPKDGNLGDLGRAGSLHEFLLKLHAEYGDIASFWWGQQLVVS

LGAPELWKQHERIFDRPRFEPLIGAKSIQYANSVDGRTRRKLYDPSYGHNAMKHYYSIFQELGQEMAKKWESMKGDQHIP

LHAHIIALAMKAITRSSFGDAFKDEKECVQFGRNYDICWNDMEERIKGSYPTEGSPREKKFEEAKGKLHATIARVAKYRR

ENPPPPQEQLFIDVLIEGDLPEEQVLCDAMTYMVGGFHTSGNLLTWALYFIATHEEVEEKLYQELSDVLGKKGEVTPDNI

SQLVYLRQVLDESLRCAVVAPWGARYMDLDAEVGGHIVPAKTPVIHAFGVVLQDERIWPEPNKFDPERFDAENIKGRHKL

AFQPFGFAGGRKCPGYRFTYVETTVFLSILCRQFKFHLVDGQVVTPWHGLVTRPLDEIWITVTKRD*

>CYP20 e_gw.89.28.1|Brafl1

83% to e_gw.479.56.1|Brafl1

MLDYAIFAITFVVFLIATGLYLYPGPNKITTIPGLEPSDPKDGNLGDIGRAGSLHEFLLKLHAEYGDIASFWWGQQLVVS

LGAPELWKQHERIFDRPPLLFKGFEPLIGAKSIQYANGLDGRTRRKLYDPSFGHNAMKYYYSIFQELGQEMAQKWESMEG

DQHIPLRAHTIDLTMKAITRCSFGDTFKDEECLQFSRNYDICWDDINERTKGNYPVEGSPREKKFQEALGRLHTTIGRVA

KYRRENPPPPQEQLFIDLLIEGDLPEEQVRAKSHTYWTISSVMTLYHCLLLTWALYFIATHKEVEEKLYQELIDVLGKKE

DVTPDNISQLVYLRQVLDETLRCAVVGPWGARYMDLDIEIGGHIVPAKTPVIHAFGVVLQDERIWPEPNKFDPERFDAES

SKGRHKLAFQPFGFAGGRKCPGYKFSYAETSVFLSILCRQFKLHLVDGQVVTWHGIIMITRPVDEIWITVTKRD*

>CYP20 e_gw.86.147.1|Brafl1

83% to e_gw.479.56.1|Brafl1

MLDYAIFAITFVVFLIAAVLYLYPKSNKITTIPGLEPSDPKDGNLGDVGRAGALHEFLLKLHAEYGDIASFWWGQQLVVS

LGAPELWKQHERIFDRPPLLFKGFEPLIGAMSIQYANHVDGMTRRKLYDPSFGHEAMKHYYSIFQELGQEMAKKWETMEG