Rat
cytochrome P450s
108 Rat P450 sequences. This is a beginning of a revision of
the rat P450s.
I am currently looking for more
members in the 7 gene clusters seen in mouse.
The April 1, 2004 Nature issue
on the rat genome had a figure showing 84 P450s
in the rat on a tree
diagram. I am looking for these
and more. There are some
major nomenclature problems due
to naming the rat genes for the closest match in
the database, usually a mouse gene. This will not work if there is not an
orthologous relationship. Many of the names in Genbank will need
to be changed.
The 4F gene cluster appears to
be conserved with all 9 functional genes occurring
in the same order and
orientation as in the mouse 4f cluster.
4F5 is the ortholog of
4f16, 4F4 is the ortholog of
4f15, 4F1 is the ortholog of 4f14 and 4F6 is the ortholog
of 4f13. The other new rat genes (4F39, 4F17,
4F37, 4F40 and 4F18) will be named
for their ortholog in the
mouse. The pseudogenes are not
conserved.
Gene order and orientation(+/-)
is: 4F39+, 4F17+, 4F5/4f16+, 4F37+, 4F40+,
4F4/4f15+, 4F1/4f14-,
4F6/4f13-, 4F18+
The CYP2ABFGST cluster has 14
full length genes, one complete pseudogene with
a few splice site errors
(2B16P) and 9 small pseudogene fragments.
The gene order is:
2S1-, 2B1+, 2B2+, 2B3+, 2B16P+,
2B14P+, 2B21-, 2B12+, 2BNEW+, 2B15+, 2G1+,
2A3+, 2ANEW+, 2A2+, 2F4+, 2T1+
Only 2S1 and 2B21 are oriented
opposite to the cluster major orientation (+).
2b23 in mouse is also (-) and
these two appear to be in orthologous locations, so
the orientation may be
preserved. In the mouse 2a22 is
oriented opposite to the
other genes, but it was on a
small contig that might be incorrectly oriented.
The rat has three genes between
2B21 and 2G1. The mouse has 2b19
in this
location, so the rat may have
expanded the 2b19 gene to three genes.
If we assume
this is correct, there is a
reasonable orthologous relationship of genes in the rat and mouse clusters.
2S1/2s1, 2B1/2b10, 2B2/2b13,
2B3/2b9, 2B21/2b23, 2B12/2b19, 2BNEW/2b19,
2B15/2b19, 2G1/2g1, 2A3/2a5,
2ANEW/2a22, 2A2/2a12, 2F4/2f2, 2T1/2t4.
Last modified Oct. 12, 2004
D. Nelson
>CYP1A1
X00469
MPSVYGFPAFTSATELLLAVTTFCLGFWVVRVTRTWVPKGLKSP
PGPWGLPFMGHVLTLGKNPHLSLTKLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVK
QGDDFKGRPDLYSFTLIANGQSMTFNPDSGPLWAARRRLAQNALKSFSIASDPTLASS
CYLEEHVSKEAEYLISKFQKLMAEVGHFDPFKYLVVSVANVICAICFGRRYDHDDQEL
LSIVNLSNEFGEVTGSGYPADFIPILRYLPNSSLDAFKDLNKKFYSFMKKLIKEHYRT
FEKGHIRDITDSLIEHCQDRRLDENANVQLSDDKVITIVFDLFGAGFDTITTAISWSL
MYLVTNPRIQRKIQEELDTVIGRDRQPRLSDRPQLPYLEAFILETFRHSSFVPFTIPH
STIRDTSLNGFYIPKGHCVFVNQWQVNHDQELWGDPNEFRPERFLTSSGTLDKHLSEK
VILFGLGKRKCIGETIGRLEVFLFLAILLQQMEFNVSPGEKVDMTPAYGLTLKHARCE
HFQVQMRSSGPQHLQA
>CYP1A2
K02422
MAFSQYISLAPELLLATAIFCLVFWVLRGTRTQVPKGLKSPPGP
WGLPFIGHMLTLGKNPHLSLTKLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVKQGD
DFKGRPDLYSFTLITNGKSMTFNPDSGPVWAARRRLAQDALKSFSIASDPTSVSSCYL
EEHVSKEANHLISKFQKLMAEVGHFEPVNQVVESVANVIGAMCFGKNFPRKSEEMLNL
VKSSKDFVENVTSGNAVDFFPVLRYLPNPALKRFKNFNDNFVLSLQKTVQEHYQDFNK
NSIQDITGALFKHSENYKDNGGLIPQEKIVNIVNDIFGAGFETVTTAIFWSILLLVTE
PKVQRKIHEELDTVIGRDRQPRLSDRPQLPYLEAFILEIYRYTSFVPFTIPHSTTRDT
SLNGFHIPKECCIFINQWQVNHDEKQWKDPFVFRPERFLTNDNTAIDKTLSEKVMLFG
LGKRRCIGEIPAKWEVFLFLAILLHQLEFTVPPGVKVDLTPSYGLTMKPRTCEHVQAW
PRFSK
>CYP1B1
U09540
MATSLSADSPQQLSSLSTQQTILLLLVSVLAIVHLGQWLLRQWR
RKPWSSPPGPFPWPLIGNAASVGRASHLYFARLARRYGDVFQIRLGSCPVVVLNGESA
IHQALVQQGGVFADRPPFASFRVVSGGRSLAFGHYSERWKERRRAAYGTMRAFSTRHP
RSRGLLEGHALGEARELVAVLVRRCAGGACLDPTQPIIVAVANVMSAVCFGCRYNHDD
AEFLELLSHNEEFGRTVGAGSLVDVMPWLQLFPNPVRTIFREFEQINRNFSNFVLDKF
LRHRESLVPGAAPRDMMDAFILSAEKKATGDPGDSPSGLDLEDVPATITDIFGASQDT
LSTALLWLLILFTRYPDVQARVQAELDQVVGRDRLPCMSDQPNLPYVMAFLYESMRFT
SFLPVTLPHATTANTFVLGYYIPKNTVVFVNQWSVNHDPAKWSNPEDFDPARFLDKDG
FINKALASSVMIFSVGKRRCIGEELSKTLLFLFISILAHQCNFKANQNEPSNMSFSYG
LSIKPKSFKIHVSLRESMKLLDSAVEKLQAEEACQ
>CYP2A1-de2b
exon 2 pseudogene Chr1 (-) only 240bp from Cyp2a22 ortholog start Met
82084718
YNAVKEALVDQAEGFSGQGEQA 82084653
>CYP2A1
NP_036824 88% T0 2A2 chr1 (+) Cyp2a22 ortholog
82084958
MLDTGLLLVVILASLSVMLLVSLWQQKIRGRLPPGPTPLPFIGNYLQLNTKDVYSSITQ 82085134
82085434
LSERYGPVFTIHLGPRRVVVLYGYDAVKEALVDQAEEFSGRGEQATYNTLFKGY 82085595
82088031
GVAFSSGERAKQLRRLSIATLRDFGVGKRGVEERILEEAGYLIKMLQGTC 82088180
82088398
GAPIDPTIYLSKTVSNVISSIVFGERFDYEDTEFLSLLQMMGQMNRFAASPTG 82088556
82089778
QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82089957
82093158
EKNGNSEFHMKNLVMTTLSLFFAGSETVSSTLRYGFLLLMKHPDVE 82093295
82093737
AKVHEEIEQVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82093925
82094440
ATDVFPILGSLMTDPKFFPSPKDFDPQNFLDDKGQLKKNAAFLPFST 82094580
82098022
GKRFCLGDGLAKMELFLLLTTILQNFRFKFPMKLEDINESPKPLGFTRIIPKYTMSFMPI 82098201
>CYP2A2-de2b
exon 2 pseudogene Chr1 (-)
82115528
LKPHWVVVLYEWDAVKEALGDQAEELSG*GEQANL 82115445
>CYP2A2
J04187 Cyp2a12 ortholog
82117349
MLDTGLLLVVILASLSVMFLVSLWQQKIRERLPPGPTPLPFIGNYLQLNMKDVYSSITQ 82117525
82117991
LSERYGPVFTIHLGPRRIVVLYGYDAVKEALVDQAEEFSGRGELPTFNILFKGY 82118152
82123228
GFSLSNVEQAKRIRRFTIATLRDFGVGKRDVQECILEEAGYLIKTLQGTC 82123377
82123595
GAPIDPSIYLSKTVSNVINSIVFGNRFDYEDKEFLSLLEMIDEMNIFAASATG 82123753
82124978
QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82125157
82139054
EKYVNSEFHMNNLVMSSLGLLFAGTGSVSSTLYHGFLLLMKHPDVE 82139191
82139607
AKVHEEIERVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82139795
82140311
GTDVFPIIGSLMTEPKFFPNHKDFNPQHFLDDKGQLKKNAAFLPFSI 82140451
82141451
GKRFCLGDSLAKMELFLLLTTILQNFRFKFPMNLEDINEYPSPIGFTRIIPNYTMSFMPI 82141630
>CYP2A3
J02852 NM_012542 exon 4 in a seq gap in genome seq chr1 (+) Cyp2a5 ortholog
82023007
MLASGLLLVASVAFLSVLVLMSVWKQRKLSGKLPPGPTPLPFIGNYLQLNTEKMYSSLMK 82023186
82023453
ISQRYGPVFTIHLGPRRVVVLCGQEAVKEALVDQAEEFSGRGEQATFDWLFKGY 82023614
82024296
GVAFSSGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIESFRKTN 82024445
GALIDPTFYLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTG
82026488
QLYEMFSSVMKHLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMLE 82026667
82028068
EKKNPNTEFYMKNLVLTTLNLFFAGTETVSTTLRYGFLLLMKHPDIE 82028208
82028659
AKVHEEIDRVIGRNRQAKYEDRMKMPYTEAVIHEIQRFADMIPMGLARRVTKDTKFREFLLPK
82028847
82029417
GTEVFPMLGSVLKDPKFFSNPNDFNPKHFLDDKGQFKKSDAFVPFSI 82029557
82030741
GKRYCFGEGLARMELFLFLTNIMQNFCFKSPQAPQDIDVSPRLVGFATIPPNYTMSFLSR 82030920
>CYP2A3-de1b
exon 1 pseudogene Chr1 (+)
82052140
MLGSRLLLVAVLSCLCVMVFMPVWQQQYRDTIPPG 82052244
>CYP2B3-se1[9]
exon 9 100% match to 2B3 chr1 (+)
81263180
GKRMCLGEGIARSELFLFFTTILQNYSVSSPVDPNTIDMTPKESGLAKVAPVYKICFVAR* 81263362
>CYP2B3-se2[1]
duplicate exon 1 100% match Chr1 (-)
81308557
MDTSVLLLLAVLLSFLLFLVRGHAKVHGHLPPGPRPLPLLGNLLQMDRGGFRKSFIQ 81308387
>CYP2B1
J00719 Rn.91353 chr1 (+) 1 aa diff to CYP2B1
81344956
MEPSILLLLALLVGFLLLLVRGHPKSRGNFPPGPRPLPLLGNLLQLDRGGLLNSFMQ 81345126
81357886
LREKYGDVFTVHLGPRPVVMLCGTDTIKEALVGQAEDFSGRGTIAVIEPIFKEY 81358047
81358200
GVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKSQ 81358349
81360925
GAPLDPTFLFQCITANIICSIVFGERFDYTDRQFLRLLELFYRTFSLLSSFSS 81361083
81361768
QVFEFFSGFLKYFPGAHRQISKNLQEILDYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81361947
81362389
EKSNHHTVFHHENLMISLLSLFFAGTETSSTTLRYGFLLMLKYPHVA
(1) 81362529
81363958
EKVQKEIDQVIGSHRLPTLDDRSKMPYTDAVIHEIQRFSDLVPIGVPHRVTKDTMFRGYLLPK 81364146
81364315
NTEVYPILSSALHDPQYFDHPDSFNPEHFLDANGALKKSEAFMPFST 81364455
81368014
GKRICLGEGIARNELFLFFTTILQNFSVSSHLAPKDIDLTPKESGIGKIPPTYQICFSAR 81368193
>CYP2B2
J00720 Rn.91353 chr1 (+) 4aa diffs with CYP2B2 14aa diffs to CYP2B1
81423536
MEPSILLLLALLVGFLLLLVRGHPKSRGNFPPGPRPLPLLGNLLQLDRGGLLNSFMQ (0) 81423706
81426789
FREKYGDVFTVHLGPRPVVMLCGTDTIKEALVGQAEDFSGRGTIAVIEPIFKEY
(1) 81426950
81427104
GVFFANGERWKALRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKSQ
(1) 81427253
81429793
GAPLDPTFLFQCITANIICSIVFGERFDYTDRQFLRLLELFYRTFSLLSSFSSQ 81429954
81430659
VFEFFSGFLKYFPGAHRQISKNLQEILDYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81430835
81431274
EKSNHHTEFHHENLMISLLSLFFAGTETGSTTLRYGFLLMLKYPHVT (1) 81431414
81432829
EKVQKEIDQVIGSHRPPSLDDRTKMPYTDAVIHEIQRFADLAPIGLPHRVTKDTMFRGYLLPK
81433017
81433190
NTEVYPILSSALHDPQYFDHPDTFNPEHFLDADGTLKKSEAFMPFST (1) 81433330
81436959
GKRICLGEGIARNELFLFFTTILQNFSVSSHLAPKDIDLTPKESGIAKIPPTYQICFSAR 81437138
>CYP2B3
M20406 chr1 (+) exon 9 not adjacent to this gene. Found at 81263180-81263359
81486567
MDTSVLLLLAVLLSFLLFLVRGHAKVHGHLPPGPRPLPLLGNLLQMDRGGFRKSFIQ (0) 81486737
81514647
LQEKHGDVFTVYFGPRPVVMLCGTQTIREALVDHAEAFSGRGIIAVLQPIMQEY (1) 81514808
81514950
GVSFVNEERWKILRRLFVATMRDFGIGKQSVEDQIKEEAKCLVEELKNHQ (1) 81515099
81516395
GVSLDPTFLFQCVTGNIICSIVFGERFDYRDRQFLRLLDLLYRTFSLISSFSSQ (0) 81516556
81530756
MFEVYSDFLKYFPGVHREIYKNLKEVLDYIDHSVENHRATLDPNAPRDFIDTFLLHMEK (0) 81530932
81531383
EKLNHYTEFHHWNLMISVLFLFLAGTESTSNTLCYGFLLMLKYPHVA (1) 81531523
81536877
EKVQKEIDQVIGSQRVPTLDDRSKMPYTEAVIHEIQRFSDVSPMGLPCRITKDTLFRGYLLPK (0) 81537065
81537233
NTEVYFILSSALHDPQYFEQPDTFNPEHFLDANGALKKCEAFMPFSI (1) 81537373
GKRMCLGEGIARSELFLFFTTILQNYSVSSPVDPNTIDMTPKESGLAKVAPVYKICFVAR
>CYP2B32P
pseudogene partial Chr1 (+)
81806528
VLLLLTLIVGFLLFLVSQSQPKTHGHLPPGLCPLPFLGNLLQIKRRGLLNSFMQ 81806689
81808348
AQEKYGDVLTVHPGPRPVVRLCGTDTIREFLFDQAGTFSGQGTVAVLNPVVHGY 81808509
exon
3 missing
81809871
GVPLIPTSFFQRIAANIICSIVFGECFDYKDHQFLHLLDLIYQTFALMAPCPARS 81810035
81810759
VFQLFSGFLKYFPGVHKQISKNLQEILNYIGHSVEKHMATLDPSAPRDFINTYLLHMEN 81810935
81811666
EKSNHHTEFHHQTSVLSHFFDGTETTSTTLCCSFLIMLKYHHVK 81811797
>CYP2B12-de9b
exon 9 Chr1 (-)
81829155
GKFICLGEGIG*NESFIFFTGILQNLSLASPVAPENIDLTPIKSGAGKIPSTYQIHILSR 81829012
>CYP2B12
X63545, S48369, NM_017156 Rn.108913 chr1 (+) 87% to 2b19 possible ortholog
81858238
MEFGVLLLLTLTVGFLLFLVSQSQPKTHGHLPPGPRPLPFLGNLLQMNRRGFLNSFMQ 81858411
81860089
LQEKYGDVFTVHLGPRPVVILCGTDTIREALVDQAEAFSGRGTVAVLHPVVQGY 81860250
81860393
GVIFATGERWKTLRRFSLVTMKEFGMGKRSVDERIKEEAQCLVEELKKYK 81860542
81860739
GAPLNPTFLFQSIAANTICSIVFGERFDYKDHQFLHLLDLVYKTSVLMGSLSSQ 81860900
81861646
VFELYSGFLKYFPGAHKQIFKNLQEMLNYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81861822
81862554
EKSNHHTEFNHQNLVISVLSLFFAGTETTSTTLRCTFLIMLKYPHVA 81862694
81864745
EKVQKEIDQVIGSHRLPTPDDRTKMPYTDAVIHEIQRFADLTPIGLPHRVTKDTVFRGYLLPK 81864933
81865086
NTEVYPILSSALHDPRYFEQPDTFNPEHFLDANGALKKSEAFLPFST 81865226
81868929
GKRICLGEGIARNELFIFFTAILQNFTLASPVAPEDIDLTPINIGVGKIPSPYQINFLSR 81869108
>CYP2B14P
U33540 exon 1 add Chr1 (+) exons 7,8,9 72% to 2B21 to this pesudogene
81706300
MKPNVLLLLAILLSFLLFLVRGHAKVHGHLPPGPRPLPILGNLLQMDRGGLLQSF 81706464
81728276
EKVQKEIGEVTGSHWFPILYSSKIPNTEAVIPEIQR 81728383
81728385
FSDLSSVVLPQRVTKDTFFQGFLLHK 81728462
81728634
NTEVYPILSSVLHDPQ 81728681
81728681
VLEYPVTFNPEHFLDANGALKKNEAFTPFSR 81728773
>CYP2B21
AF159245 Chr1 (-)
81765226
MDPSVLLLFALFTGFLLLLIRGQGNGYGHLPPGPCPLPLLGNVLQMDRRGLLKSFIQ 81765056
81759108
LRDKYGDVVTVHLGPRPIVMLYGTETIREALVDHAEAFSGRGTVAVVQPIIQDY 81758947
81758804
GMIFANGERWKILRRFSLATMRDFGMGKRSVEERIKEEAQCLVEELKKYK 81758655
81757889
GAPLDPTFHLQCITANIICSIVFGERFDYTDHQFLHLLDLFYEILSLVSSFSSQ 81757728
81749057
VFELFPGFLKYFPGTHRHISKNIEEILNFIGHCVEKHRATLDPSTPRDFIDTYLLRMEK
81748881
81748412
EKLNHHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVA 81748272
81747109
EKVQKEIDQVIGSHRVPTLDDRIKMPYTDAVIHEIQRFSDLVPIGLPHRVTKDTLFRGYLLPK 81746921
81746748
NIEVYPILSSALHDPQYFEHPDTFNPEHFLDANGALKKNEAFLPFST 81746608
81736831
GKRVCLGEGIARNELFLFFTTILQNFSVSSPVSPKDIDLTPKESGFAKIPPTYQICFLSRQLG 81736643
>CYP2B31
86% to 2b19 possible ortholog
81918041
MELGVFLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQ 81918214
81919826
LQEKYGDVFTVHLGPRPVVILCGTDTMREALVDQAEAFSGRGTVAVLHPVVQGY 81919987
81920130
GVIFANGERWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYK 81920279
81922129
GALLNPTSIFQSIAANIICSIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQ 81922290
81923031
VFELFSGFLKYFPGVHKQISKNLQEILNYIDHSVEKHRATLDPNTPRDFIDTYLLHMEK 81923207
81923977
EKSNHHTEFHHQNLVISVLSLFFAGTETTSTTLRYSFLIMLKYPHVA 81924117
81926113
EKVQKEIDQVISSHRLPTLDDRIKMPYTDAVIHEIQRFADLAPIGLPHRVTKDTMFRGYLLPK 81926301
81926476
NTEVYPILSSALHDPRYFDHPDTFNPEHFLDANGTLKKSEAFLPFST 81926616
81930286
GKRTCLGEGIARNELFIFFTALLQNFSLASPVAPEDIDLTPINSGAGKIPSPYQINFLSR 81930465
>CYP2B15
D17343 to D17349 86% to 2b19 exons 2-4 in a seq gap in the genome
seq
Chr1 (+)
81945068
MELGVLLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQ 81945241
LQEKYGDVFTVHLGPRPVVILCGTDTIREALVDQAEAFSGRGTVAVLHPVVQGY
GVIFANGERWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYK
ALLNPTSIFQSIAANIICSIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQ
81950148
VFELFSGFLKYFPGVHKQISKNLQEILNYIDHSVEKHRATLDPNTPRDFINTYLLRMEK 81950324
81951073
EKSNHHTEFHHQNLVISVLSLFFTGTETTSTTLRYSFLIMLKYPHVA 81951213
81953132
EKVQKEIDQVIGSHRLPTLDDRTKMPYTDAVIHEIQRFADLIPIGLPHRVTNDTMFLGYLLPK 81953320
81953491
NTEVYPILSSALHDPRYFDHPDTFNPEHFLDVNGTLKKSEAFLPFST 81953631
81957185
GKRICLGEGIAQNELFIFFTAILQNFSLASPVAPEDIDLSPINSGISKIPSPYQIHFLSRCVG 81957373
>CYP2B16P
U33541 to U33546 bad boundary introns 1,5,7 chr1 (+)
81633949
MEPSVLLLLAVLLSFLLLLVRGHAKIHGRLPPGPCPVPLLGNLLQMDRRGLLKSFIQ (?) 81634119
81641847
LQEKYGDVFTVHLGLRPVVVLCGTQTIREALVDHAEAFSGRGTIAGLEPVFQDY (1) 81642008
81642149
GIFFSSGEQWKTLRRFSMATMRDFGMRKKSVEERIKEESQCLVEELKKYQ (1) 81642298
81642886
GAPLDPTFLFQCITSNIICSIVFGECFDYTDHQFLHLLDLMYQTFSLLSSIFSQ (0) 81643047
81645234
VFELFPGVLKYFPGAHRQISRNLHEILDFIGQSVEKHRATLDPNAPRDFIYTYLLHMEK (?) 81645410
81645864
QKSNHYTEFHHWNLLSSVLSLFFAGTETSSTTLRYGFLIMLKYPHIT (1) 81646004
81654168
EKVQKEIDCVIGSHRLPTLDDRSKMPYTEAVIHEIQRFSDLAPIGTPHRVIKDTIFRGYLLPK (?) 81654356
81654524
QNTEVFPILSSVLHDPQYFEQPDIFNLQHFLDANGALKIIEAFLPFST (1) 81654667
81659608
GKRICLGESIARNELFLFFTTILQNFSVSSPVAPKDIDLTPKESGIGRIPQVYQICFLAH* 81659781
>CYP2C6v1_v1-de1b2b3b4b5b
upstream pseudogene 96% identical to seq c
93%
identical to seq upstream of CYP2C6v2 allele (temp name = CYP2Cnewb)
243935799
MDLVMLLVLTLSCLIFLSIWRQSSGRGKLP 243935888
243935888
SGPTPLPIIGNFFHLDLKNITQSLTN 243935965
243937699
FSKVNGSVFTLYFGMKPIVILHGYEAIKEGLIDHGEEFTERGSFPVAEKINKGL 243937860
243938035
GIAFSHGNRWKEIRRFTLMTLQNLGMGKKSIEDRVQEESRCLV 243938163
243939079
GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLVEKLNENIKIVSSPWI* 243939231
243940291
FCSSFPVFIDYCPGSHMTLAKNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLINWKQ 243940467
>CYP2C6_v1
M13711 two aa changes to match many ESTs (lower case mi) due to frameshift
97%
to 2C77 and 2C6v2
243955584
MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS 243955751
243964779
FSKVYGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKD 243964937
243965112
LGIVFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTN 243965264
243966104
GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 243966265
243967336
FCSFFPVLIDYCPGSHTTLAKNVYHIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 243967512
243984646
ENHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKCPEVT 243984786
243989157
AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAmiHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 243989345
243990948
GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 243991088
243992245
GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL 243992424
>CYP2C6P
M18336 J03509 M18774 an alternate splice version of 2C6
exon
8 is skipped and replaced by a cryptic exon just past the true exon 8
The
GT boundary of the true exon 8 are the first two nucleotides of CYP2C6_v3
Cryptic
exon 8
MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTSFSKV 200
201
YGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDLGIVFSHGNRW 380
381
KEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTNGSPCDPTFILGCAPCNVICS 560
561
IIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQFCSFFPVLIDYCPGSHTTLAKNVYHI 740
741
RNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQENHNPHSEFTLENLSITVTDLFGAGTE 920
921
TTSTTLRYALLLLLKCPEVTAKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFID 1100
1101
LIPTNLPHAVTCDIKFRNYLIPK 1169
CYP2C6_v2
CK224594.1 CK224593.1 note: the _v2 means alternative splice version 2
CYP2C6_v3
CK224595.1 CK224596.1 (3 nuc shorter at the joint uses the second AG)
Beginning of exon 7 AGCTAAAG
TCCAGGAAGA GATTGATCGT 243989183
GTGGTTGGCA
AACATCGCAG CCCTTGCATG CAGGACAGGA GCCGCATGCC CTACACAGAT 243989243
GCCATGATTC
ATGAGGTCCA GAGGTTCATT GACCTCATTC CTACCAACCT GCCACATGCG 243989303
GTGACCTGTG
ACATTAAGTT CAGGAACTAC CTAATACCCA AG GT end of exon 7
Beginning
of cryptic exon out of frame agcaggtaa
tagaaactca 243991103
tttccatggt
tccagtgaca tgcagaaccg tggggactta gagtgtgact ctacatgtgc 243991163
tgatagcttg
catctgcatg ataaggagca taattttcat tgtgtatgca ctgtcctgga 243991223
tatgaccacc
ttctttatca gggt end of cryptic exon
normal
exon 9
1328
GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL
>CYP2C6v2-de1b2b3b4b4c5b
upstream pseudogene
EST
CK224599.1 = 100% match with 4 frameshifts) so this is a real gene
clone_lib="RALIUNN03
Sprague-Dawley rat female liver
The
CYP2C6_v1 sequence is also seen in this same mRNA library
This
GNOMON prediction adds two
upstream exons that do not belong to this gene
58596732 MDLVMLLVLTLSCLILLSIWRQSSGRGKHP
58596643 exon 1 frameshift
58596643 SGPTPLPIIGNFFHLDLNNITQSLTS (0)
58596566 exon 1
58594823 FSKVNGSVFTLYFGMKLIVILHGYAATKEGLIDHGEEFTKRGSFPVAEKINKGL (1) exon 2 58594662
58594487 GIAFSHGNRWKEIRRFTLMTLQNLGMGKESIEDRVQEETQCLV*ELRKTN
(1) exon 3 58594338
58593451
GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58593296
58592013
GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58591858
58590797
FCSSFPVFIDYCLGSHMTLA 58590738
58590736
NVYHTRNYILKKIKEHQESLDVTNPHDFIDYDLIKWKQ 58590620
AVSIKRNS
>CYP2C6v2
allele not in figure, 13 aa diffs to CYP2C6_v1 XM_215255 NW_047916
we
are assigning this allele status but it may be a separate gene
(temp
name = CYP2Cnewb)
58578624
MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 58578457
58576741
FSKVYGPVFTLYFGLKPTVILHGYEAVKEALIDHGEEFAERGSFPVVEKINKDL (1) 58576583
58576405
GIAFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDHVQEEARCLVEELRKTN 58576256
58575415
GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKVLSSPWTQ 58575254
58574189
FCSFFPVLIDYCPGSHTTLAKNIYYIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 58574013
58554666
ESHNPHLEFTLENLSVTVTDLFGAGTETTSTTLRYALLLLLKYPEVT 58554526
58534931
AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 58534743
58533131
GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 58532991
58531833
GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 58531654
>CYP2C7
M18335 exons 1,2,3 and 6 are in sequence gaps 93% to 2C7 variant and 2C81
the
yellow labels are from a random Chr1 piece that is similar to the CYP2C7 N-term
differences
with the published 2C7 sequence M18335 are in cyan
MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK
FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMNENVTKGF
GIVFSNGNRWKEMRRFTIMNFRNLGIGKRNIEDRVQEEAQCLVEELRKTK
243849546
GSPCDPSLILNCAPCNVICSITFQNYFDYKDKEMLTFMEKVNENLKIMSSPWMQ
243849385
243847566
VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 243847390
243829444
GSPCDPSLILNCAPCNVICSITFQNHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243829283
this
duplicate exon 4 is not in the right sequence order
ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT
243803857
AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFINFVPTNLPHAVTCDIKFRNYLIPK 243803669
243800623
GTKVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 243800483
243799465
GKRACVGEGLARMQLFLFLTTILQNFNLKSLVHPKDIDTMPVLNGFASLPPTYQLCFIPS 243799286
>CYP2C7 variant unmapped 93% to 2C7
88% to 2C81
3463873
MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK 3464040
3479907
FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF
3480068
3480234 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 3480383
3489182 GSPCDPSLILNCAPCNVICSITFQSHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 3489343
3491162 VCNSFPSLVDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 3491338
3505354
ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 3505494
3406504 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK
3406692
3408304 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 3408444
3409602 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFPGFASLPPFYELCFIPS 3409778
New
frags on the plus strand between 2C7 and 2C6
>CYP2C79-se1[9]
frag q Exon 9 100% to 2C79
243885148
GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI* 243885330
>CYP2C-se6[9]
frag p exon 9 100% to CYP2C82P-de9b
243895387
GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 243895497
>seq
upstream of 2C11
>CYP26A1
AF439720, NM_130408 Chr1 1Mb upstream of CYP2C cluster
242138769
MGLPALLASALCTFVLPLLLFLAALKLWDLYCVSSRDRSCALPLPPG
TMGFPFFGETLQMVLQ (0) 242138581
242138389
RRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GEHRLVSVHWPASVRTILGAGCLSNLHDSSHKQRKK (0) 242138165
242137906
VIMQAFNREALQCYVPVIAEEVSGCLEQWLSCGERGLLVYPEV
KRLMFRIAMRILLGCEPGPAGGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR
(0) 242137616
242137537
GVKPRNLIHARIEENIRAKIRRLQAAERNAGCKDALQLLIEHSWERGERLDMQ
(0) 242137379
242136717
ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREEIKSK (0) 242136583
242136000
GLLCKSHHEDKLDMETLEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELN (0) 242135848
242135595
GYQIPKGWNVIYSICDTHDVADSFTNKEEFNPDRFTSLHPEDTSRFSFIPFGGGLRSCRSKEFAKI
LLKIFTVELARRCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFQGDI* 242135254
>CYP26C1
XM_217935 94% TO 26C1 MOUSE Chr1 1Mb upstream of CYP2C cluster
242151281
MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKG
SMGWPFFGETLHWLVQ (0) 242151079
242150553
GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0) 242150422
242149883
VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVAVYQAAKALTFRMAAR
ILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK
(0) 242149608
242148160
GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0) 242148005
242146368
ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP
DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0) 242146051
242144220
GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYI
PFGGGARSCLGQELAQAVLQLLAVELVRTARWELATPAFPVMQTVPIVHPVD
GLLLLFHPLPTLGAGDGSPF* 242143843
>CYP2C11
J02657 72% to CYP2C6_v1
243377899
MDPVLVLVLTLSSLLLLSLWRQSFGRGKLPPGPTPLPIIGNTLQIYMKDIGQSIKK 243378066
243379842
FSKVYGPIFTLYLGMKPFVVLHGYEAVKEALVDLGEEFSGRGSFPVSERVNKGL 243380003
243380160
GVIFSNGMQWKEIRRFSIMTLRTFGMGKRTIEDRIQEEAQCLVEELRKSK 243380309
GAPFDPTFILGCAPCNVICSIIFQNRFDYKDPTFLNLMHRFNENFRLFSSPWLQVCNT
FPAIIDYFPGSHNQVLKNFFYIKNYVLEKVKEHQESLDKDNPRDFIDCFLNKMEQEKH
NPQSEFTLESLVATVTDMFGAGTETTSTTLRYGLLLLLKHVDVTAKVQEEIERVIGRN
RSPCMKDRSQMPYTDAVVHEIQRYIDLVPTNLPHLVTRDIKFRNYFIPKGTNVIVSLS
SILHDDKEFPNPEKFDPGHFLDERGNFKKSDYFMPFSA
243416959
GKRICAGEALARTELFLFFTTILQNFNLKSLVDVKDIDTTPAISGFGHLPPFYEACFIPVQRADSLSSHL*
243417171
>CYP2C24
92% to 2C80, M86678 has alternative splice first exon
no
ESTs have this splice
CK481568.1
matches exons 1,2,3,4
CO565602.1
matched the end of the gene sequence and extends it a little 6 aa
Used
this EST to blast the trace files to find the end of exon 7
MDPVLVLVLTLSCLLLLSLWRQSSGRGKLPPGPTPLPIIGNILQIDVKDISKSFTN CK481568.1 exon
1
QLSCSRKFGLTCGPEAQ
243522306 FTDKLTAKCHSSVSLHIDLPGNLL
243522235 yellow region not P450 seq.
243522073
FSKIYGPVFTLYFGPKPTVVVHGYEAVKEALDDLGEEFSGRGSFPIVERMNNGL 243521912
243521366
GVIFSNGTKWKELRHFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN
243521217
243518830
GSLCDPTFILSCAPSNVICSVVFHNRFDYKDENFLNLMEKLNENFKILNSPWMQ 243518669
VCNALPAFIDYLPGSHNRVIKNFAEI
676
677
KSYILRRVKEHQETLDMDNPRDFIDCFLIKMEQEKHNPRTEFTIEILMATVSDVFVAGSE 856
857
TTSTTLRYGLLLLLKHIEVT
AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRY 1030
gnl|ti|132779224
rts18e73.g from trace files for exon 7
AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK
>CYP2C80
XM_217906.2 GNOMON exon 2 on AC109577.4 in HTGS 92% to 2C24, 73% to 2C11
MGWLSDP
wrong N-term from GNOMON prediction (temp name = CYP2CNEWC)
Correct
N-term possibly in a sequence gap
244632544
FSEVYGPVFTLYFGLKPTVVVYGYEVVKEVLDGEEFSGRGVFPIVTKVNNDL 244632389
this exon 2 does
not match 2C24
244632205
GVIFSNGTKWKELRRFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 244632056
244628281
GSLCDPTFILSCAPSNVICSVIFHNRFDYKDENFLNLMEKFNENFKILNSPWMQ 244628120
244624041
VCNAIPAFIDYLPGSHNKVIKNFAEIKSYILRRVKEHQETLDMDNPRDFIDCFLIKIE 244623868
244620080
QEKHNPCTEFTIQSLVATVTDVFVAGSETTSTTLRYGLLLLLKHTEVT 244619937
244619006
AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK 244618818
244616897
GTDLITSLTSVLHDDKEFPNPEVFDPGHFLDEHGNFKRSDYFMPFSS 244616757
244614348
GKRMCVGEALARMELFLLLTTIVQNFNLKSFVATKDIDTTPLTNTFGCVPPSYQLYFTPR* 244614166
>2C80
EST no ESTs have this splice
CK481568.1
matches second exon
MDPVLVLVLTLSCLLLLSLWRQSSGRGKLPPGPTPLPIIGNILQIDVKDISKSFTN
FSKIYGPVFTLYFGPKPTVVVHGYEAVKEALDDLGEEFSGRGSFPIVERMNNGL
GVIFSNGTKWKELRHFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN
GSLCDPTFILSCAPS
>CYP2C79
XM_219933 minus strand 72% to 2C6_v1 95% to seq e, 100% to seq q (exon 9),
93%
to seq z (exon 5) (temp name = CYP2CNEWD)
244590183
MILGVFLGLFLTCLLLLSLWKQNFQRRNLPPGPTPLPIIGNILQIDLKDISKSLRN 244590016
244575990
FSKVYGPVFTLYFGRKPAVVLHGYEAVKEALIDHGEEFAGRGIFPVAEKFNKNC 244575829
244575612
GVVFSSGRTWKEMRRFSLMTLRNFGMGKRSIEDRVQEEARCLVDELRKTN 244575463
244553851
GVPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLALIDILNENVEILSSPWIQ 244553690
244525726
ICNNFPAIIDYLPGRHRKLLKNFAFAKHYFLAKVIQHQESLDINNPRDFIDCFLIKMEQ 244525550
244524359
EKHNPKTEFTCENLIFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT 244524219
244517844
AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQRYIDLLPTSLPHALTCDMKFRDYFIPK 244517656
244516177
GTTVIASLTSVLYDDKEFPNPEKFDPSHFLDENGKVKKSDYFFPFST 244516037
244496745
GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI 244496566
>CYP2C79-de9b
exon 9 62% to 2C79 2 aa diffs to seq d and seq p minus strand
244491372
G*WICVREDLAQMTLFLFCPTILKNFNLNSQVNPKEL 244491262
interval
between 2C79 and 2C6
>CYP2C6-se1[1:2:3:2:3]
frag n exons 1,2,3 2C6 like pseudogene plus strand exon 2,3 100% to seq m
244044941
MDHTTGTYTLSLILLSL*RQSSGRGKIPPGPTPLPIIDNLLQLDIKNVTQYLAN (0) 244045102
244050420
LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244050581
244050793
XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244050873
frag
m Exons 2,3 2C6 like pseudogene 100% to seq n
244052306
LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244052467
244052679
XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244052759
>CYP2C7-se2[2:3]
frag k exons 2,3 = 100% to 2C7 variant, 2 aa diffs to 2C7
exons
2,3,6,7,9 (6,7 and 9 have 1 aa diff to 2C7)
244064158
FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF 244064319
244064485
GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 244064634
>CYP2C7-se1[6:7:9]
frag j exons 6,7,9 (6,7 and 9 have 1 aa diff to 2C7)
244103321
ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 244103461
244120225
AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFIDFVPTNLPHAVTCDIKFRNYLIPK 244120413
244124319
FLXXXLQNFNLKSLXHPKDIDTMPVLNXXASLPPTYQLCFIPS 244124447
>CYP2C13-se1[6]
frag h 72% to 2C13 exon 6 plus strand 100% to seq s
70%
to 2C12 exon 6 h
244165142
ENGNQQMNYTQEHLATMVTDLL 244165207
244165209
FGGRETLNSTMRFAFLFLMKYPYTT 244165284
>CYP2C22-se1[8]
frag g exon 8 72% to 2C22 minus
strand
244201638
KFDHGNFLDDR 244201606
244201606
GNFK*NDYFMAFLA 244201565
>CYP2C13-se3[1:2:3:2:3:]
frag f Exons 1,2,3,2,3 exon 1 =
66% to 2C13 Minus Strand
exons
2,3 = 57% to 2C13
two
identical copies of exons 2,3 100% to seq v exons 2,3
244215468
SQSFLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 244215328
244214467
FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 244214306
244214137
GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213988
244213484
R*FS*RGWFSIFGKFSKVQ 244213428
244213259
GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213110
>CYP2C82P
frag e Exons 1,4,4,5,6,7,8,9 almost an exact duplicate of seqs w,x,y,z,
exons
6-9 of the wxyz cluster in a seq gap Plus Strand
244218695
MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLINE (0) 244218865
244233879
LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 244234019
244240189
GVPCDPTFILGCAPCNVICSIVFQNHFNYKGQEFLALIDTLNENVEILSSPWIQ 244240350
244265531
ICNNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ 244265707
244266904
KHNPKTEFTCKNLIFTASDLFAAGTETTSPTLRYSLLLLPKYPEV 244267038
244273480
AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQ*YIDLLPTSLPHALTCDMKFRDYFIPK 244273668
244275197
GTTVIASLTSVLYDDKEFPNPEKFDLSHFLDENGKFKKSDYFFPFST 244275337
244286429
GKRICVGEGLAQTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIP 244286605
>CYP2C82P-de9b
frag d Exon 9 identical to seq p
244289962
GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 244290072
>CYP2C77-de1b2b3b4b5b
frag c Pseudogene 96% to 2C6_v1 exons 1-5 with partial deletion of exon 3 Plus
Strand
244337898
MDLVMLLVLTLSCLILLSIWSQSSGRGKLP 244337987
244337987
SGPTPLPIIGNFFHLDLKNITQSLTS 244338064
244339793
FSKVNGSVFTLYFGMKPIVILHGYEAIK*GLIDHREEFTERGSFPVAEKINKGL 244339954
244340129
GIAFSHGNRWKEIRRFTLMTLQNLGMGK 244340212
244341157
GSPCDPTFILGCAPCNVICSIIFQNSFDYKDQDFLSLMEKLNENIKIVSSPWI* 244341318
244342872
FCSSFPVFIDYCPGIHMTLA 244342931
244342933
KNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLIKWKQ 244343049
>CYP2C77
variant of 2C6 13 aa diffs to CYP2C6_v1, 16 aa diffs to 2C6v2
This
gene has three frameshifts
244357850
MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 244358017
244359760
FSKVYGPVFTLYFGMKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDL (1) 244359921
244360096
GIIFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEE 244360230
244360232
MRKTN 244360246
244361085
GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 244361246
244362321
FCSFFPVLIDYCPGSHTTLAKNVYHIRNYL 244362410
244362412
LKKIKEHQESLDVTNPQDFIDYYLIKWKQ 244362498
244381928
ESHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKYPEIT 244382068
244392235
AKVQEEIDRVFGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPM 244392423
244394012
GTTIITSLSSVLHDSKEFPNPEIFDPGHFLDGNGKFKKSDYFMPFSA 244394152
244395307
GKRMFAGEGLA 244395339
244395341
RMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 244395487
>CYP82P-se[1:4:4:5]
frag z Exon 5 minus strand 1 aa diff to seq e
243632036
ICDNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ (0) 243631860
frag
y Exon 4 minus strand 92% to seq e
243654367
GVPCDPTFILGCAPCNVICSIVFQNHFNYKDQEFLALIE 243654251
243654249
LNENVEILSSP*IQ 243654208
frag
x exon 4 minus strand 100% to seq e short exon 4
243659542
LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 243659402
frag
w Exon 1 minus strand 100% to seq e
243675609
MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLIN 243675442
>CYP2C13-se4[1:2:3]
frag v Exon 1 (+) 59% to 2C13
243678671
FLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 243678802
Exon
2 (+) 48% to 2C79
243679647
FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 243679808
Exon
3 (+) 100% to seq f
243679977
GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 243680126
>CYP2C7-se4[8:9]
frag u Exon 8 minus strand exon 8 = 87% to frag 2, 8+9 = 63% to 2C7
243726168
GVMVITSLSSALHDNKEFPNPKRFDPG*FLDRNGNFKKTDYFILFSA 243726028
Exon
9 minus strand 60% to 2C7
243723025
CVGEGLTPIELFLFLTRILQNFNLKHLTHTEAVDTTPVLSRLTSVSPALKLFFIP 243722861
>CYP2C7-se3[8]
frag t Exon 8 minus strand 82% to 2C7
243749788
TIVIT*LTSVLHDSKKFPNPEMLDSGHFLDENGNFKKSEYFMPFSA 243749651
>CYP2C13-se2[6:7]
frag s Exons 6-7 minus strand 72% to 2C12 exon 6 100% to seq h
243766431
ENGNQQMNYTQEHLATMVTDLL 243766366
243766364
FGGRETLNSTMRFAFLFLMKYPYTT 243766290
243760156
XQINEEIGQVIWRHHSPSMLDWSHMIYTNAMVHEVQRYIDLAPNGVVCEVNCDTKYPRDYFIPK 243759968
>CYP2C7-de7b
frag r Exon 7 (+) 100% to seq a CYP2C81-de7b
243792966
RVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 243793151
>CYP2C81
93% to 2C7 28 aa diffs missing exon 1 Plus Strand, 91% to seq j (exons 6,7)
93%
to seq k (exons 2,3)
244672079
FSKTYGPVFTLYLGSQPTVILHGYEAIKKALIDHGEKFSGRGSYPMIENVTKGF 244672240
244672408
GIAFSNGNRWKEIRRFTIMTLRNLGMGKRNIEDRVQEEAQRLVEELRKTK 244672557
244681144
GSPCDPSFILNCAPCNVICSITFQNHFDYKDKEILTFMEKVNENVKIMSSPRMQ 244681305
244683123
VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVEYYLIKQKQ 244683299
244699290
ANHIEQSEYSHENLACSIMDLIGAGTETMSSTLRYALLLLMKYPHVP 244699430
244713313
AKVQEEIDHVIGRYRSPCMQDRSHMPYTDAMIHEVQRFINFVPTNLLHAVTCDIKFRNYLIPK 244713501
244717457
GTKVLTSLTSVLHGSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 244717597
244718606
GKRACVGEGLARMELFLFLTTILQNFKLKSLVHPKDIDTRPVLNGFASLPPTYQFCFIPS 244718785
>CYP2C81-de7b
frag a Exon 7 minus Strand 100% to seq r, 80% to 2C13
244724629
LRVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 244724441
>CYP2C81-de8b
frag 1 Exon 8 93% to 2C7 Plus Strand
244737232
GTTVLTSLTSVLHDSKEFPNPEMFDPGHFLDENRNFKKSDYFMPFSA 244737372
>CYP2C81-de8c
frag 2 Exon 8 76% to 2C13 Plus Strand 87% to seq u
244764239
GMMVITSLSSVLHYNKEFPNPERFDPGYFLDGNGNFKKTDYFILFSA 244764379
>CYP2C81-de1d
frag 3 Exon 1 with frameshift Plus
Strand 85% to seq e 83% TO SEQ w
244783632
MDLVVVL 244783652
244783654
CSVSSLLLFSLWRQSSWRRKLPPGPNPLPIIGNFLQIDLNNLCQSLNN (0) 244783797
>CYP2C81-de6e7e
frag 4 exon 6 70% to 2C13 Plus Strand
244799349
ELELEHLGSMVTDLFFAGIESIRTTMIFALLFLLNTHTSQ 244799468
exon
7 82% to 2C13, 86% to seq r and seq a
244801583
LQNRSHMPYTNAMVHEVQRYSDIVPNNIVHEVTSDTKFRNYFIPK (0) 244801717
>CYP2C81-de1f2f3f
frag 5 Exons 1,2,3 84% to 2C7 variant Minus Strand
244826982
MDLVTFLVLTLSSLILLSLWR*NSRRRKLPPGPTPLLIIGNFLQLDVKNVSQSLTM (0) 244826815
244813456
FSKAYGPVFTLYLGSQPTVILHGYEAVKETLIDHGEEFSGRGSFPMVEKAFKCF 244813295
244813129
GIVFSNGNR*KEIRQFIIMTLQNLGMGKRNIEDHVQEEAQCLVEELRKTK 244812980
very
large gap 244845025-245223024 378kb
>CYP2C13v1
100% first 5 exons
Note
this seq also on 100.0%
Un ++ 17276272 17282257
Exons
6-9 are on
99.1% Un ++ 17323193
17358099 2 aa diffs to 2C13 J02861
CYP2C12
is also on this same contig 99.6% Un
++ 17388090 17446950 2 aa diffs
Minus
Strand HSPs:
245246208
MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTN (0) 245246041
245244920
FSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQ (1) 245244759
245244599
GIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN 245244450
245240888
GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQ (0) 245240727
245239607
VFNIFPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ 245239431
>CYP2C13-de1b2b
frag 7 Exon 1 76% to 2C13 Minus Strand
245307855
MDPIVVLVLSLSCLLFLSLWRNNSRRGKLPPGPTPLPIIRNYLQLDMKDIC*SLTK (0) 245307688
frag
6 Exon 2 83% to 2C13 Minus Strand
245292652
FSKTYGPVYTLYFGSQPTVLLYGYEALKEALIDYGEAFSGRGRIPIHEKVSKGQ 245292491
>CYP2C22-se2[1:2]
frag 9 Exon 1 61% to 2C22 Minus Strand
245347583
MDLFIILWICFACLSLFFLWNQLHYKEKLPPGPVPLPIVGNILQVNIKSIIKSLNI (0) 245347416
frag
8 Exon 2 79% to 2C22 Minus Strand
245334622
LAKEYGPVFTVYLGMKPTVVLHGHKALKEALIDRANEFSVKMQSSLLSKESQGL (1) 245334461
>CYP2C12
J03786 80% to 2C13
MDPFVVLVLSLSFLLLLYLWRPSPGRGKLPPGPTPLPIFGNFLQ
IDMKDIRQSISNFSKTYGPVFTLYFGSQPTVVLHGYEAVKEALIDYGEEFSGRGRMPV
FEKATKGLGISFSRGNVWRATRHFTVNTLRSLGMGKRTIEIKVQEEAEWLVMELKKTK
GSPCDPKFIIGCAPCNVICSIIFQNRFDYKDKDFLSLIENVNEYIKIVSTPAFQVFNA
FPILLDYCPGNHKTHSKHFAAIKSYLLKKIKEHEESLDVSNPRDFIDYFLIQRCQENG
NQQMNYTQEHLAILVTNLFIGGTETSSLTLRFALLLLMKYPHITDKVQEEIGQVIGRH
RSPCMLDRIHMPYTNAMIHEVQRYIDLAPNGLLHEVTCDTKFRDYFIPKGTAVLTSLT
SVLHARKEFPNPEMFDPGHFLDENGNFKKSDYFMPFSAGKRKCVGEGLASMELFLFLT
TILQNFKLKSLSDPKDIDINSIRSEFSSIPPTFQLCFIPV
>CYP2C13v1
J02861 80% to 2C12
MDPVVVLLLSLFFLLFLSLWRPSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTN
FSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQ
GIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN
GSPCDPQFIMGCAPGNVICSIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQ
VFNIFPILLDYCPGNHNIYFKNHTWLKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ
ENANQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVT
AKVQEEIDHVIGRHRSPCMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHEVTCDTKFRNYFIPK
GTAVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSA
GKRMCLGESLARMELFLFLTTILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL
>CYP2C13v2
Not in figure probable 2C13 allele NM_138514 7AA DIFFS TO 2C13v1 (98%)
80%
to 2C12 (temp name = CYP2CNEWA)
MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQ
VDMKDIRQSLTNFSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPI
CEKVAKGQGIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN
GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQVFNI
FPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQENA
NQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVTAKVQEEIDHVIGRH
RSPSMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHDVTCDTKFRNYFIPKGTAVLTSLT
SVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSAGKRMCLGESLARMELFLFLT
TILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL
>CYP2C22
M58041 61% to 2C79
245425985
MALFIFLGIWLSCLVFLFLWNQHHVRRKLPPGPTPLPIFGNILQVGVKNMSKSMCM 245425818
LAKEYGPVFTMYLGMKPTVVLYGYEVLKEALIDRGEEFSDKMHSSMLSKVSQGL
GIVFSNGEIWKQTRRFSLMVLRSMGMGKRTIENRIQEEVVYLLEALRKTN
GSPCDPSFLLACVPCNLISSVIFQHRFDYSDEKFQKFIENFHTKIEILASPWAQ
LCSAYPVLYYLPGIHNKFLKDVTEQKKFILMEINRHRASLNLSNPQDFIDYFLIKMEKEKHN
EKSEFTMDNLIVTIGDLFGAGTETTSSTIKYGLLLLLKYPEVTAKIQEEITRVIGRHR
RPCMQDRNHMPYTDAVLHEIQRYIDFVPIPLPRKTTQDVEFRGYHIPK
GTSVMACLTSALHDDKEFPNPEKFDPGHFLDEKGNFKKSDYFMAFSA
GRRACIGEGLARMEMFLILTSILQHFILKPLVNPEDIDTTPVQPGLLSLPPPFQLCFIPV
>CYP2C23
X55446 59% to 2C11
MELLGFTTLALVVSVTCLSLLSVWTKLRTRGRLPPGPHPPSHYW
ESTATEPQGHPASLSKLAKEYGPVYTLYFGTSPTVVLHGYDVVKEALLQQGDEFLGRG
PLPIIEDTHKGYGLIFSNGERWKVMRRFSLMTLRNFGMGKRSLEERVQEEAWCLVEEL
QKTKAQPFDPTFILACAPCNVICSILFNDRFQYNDKTFLNLMDLLNKNFQQVNSVWCQ
MYNLWPTIIKYLPGKHIEFAKRIDDVKNFILEKVKEHQKSLDPANPRDYIDCFLSKIE
EEKDNLKSEFHLENLAVCGSNLFTAGTETTSTTLRFGLLLLMKYPEVQAKVHEELDRV
IGRHQPPSMKDKMKLPYTDAVLHEIQRYITLVGSSLPHAVVQDTKFRDYVIPKGTTVL
PMLSSVMLDQKEFANPEKFDPGHFLDKNGCFKKTDYFVPFSLGKRACVGESLARMELF
LFFTTLLQKFSLKTLVEPKDLDIKPITTGIINLPPPYKLCLVPR
>CYP2D1
J02867
MELLNGTGLWSMAIFTVIFILLVDLMHRRHRWTSRYPPGPVPWP
VLGNLLQVDLSNMPYSLYKLQHRYGDVFSLQKGWKPMVIVNRLKAVQEVLVTHGEDTA
DRPPVPIFKCLGVKPRSQGVILASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEA
GHLCDAFTAQAGQSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMVKLVEESLTE
VSGFIPEVLNTFPALLRIPGLADKVFQGQKTFMALLDNLLAENRTTWDPAQPPRNLTD
AFLAEVEKAKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQRRV
QQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRFTSCDIEVQDFVI
PKGTTLIINLSSVLKDETVWEKPHRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVPVGQPRPSTHGFFAFPVAPLPYQLCAVVREQGL
>CYP2D
pseudogene Chr 7 ++ 120811066 120811206 2aa diff to 2D2/2D3
exon 8
between
2D1 and 2D3
GTTLIPNLSSLLNDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA
>CYP2D2
X52027 X52455
MGLLIGDDLWAVVIFTAIFLLLVDLVHRHKFWTAHYPPGPVPLP
GLGNLLQVDFENMPYSLYKLRSRYGDVFSLQIAWKPVVVINGLKAVRELLVTYGEDTA
DRPLLPIYNHLGYGNKSKGVVLAPYGPEWREQRRFSVSTLRDFGVGKKSLEQWVTEEA
GHLCDTFAKEAEHPFNPSILLSKAVSNVIASLVYARRFEYEDPFFNRMLKTLKESFGE
DTGFMAEVLNAIPILLQIPGLPGKVFPKLNSFIALVDKMLIEHKKSWDPAQPPRDMTD
AFLAEMQKAKGNPESSFNDENLRLVVIDLFMAGMVTTSTTLSWALLLMILHPDVQRRV
HEEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADIVPTNIPHMTSRDIKFQGFLI
PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVLAGRPRPSTHGVYALPVTPQPYQLCAVAR
>CYP2D3
X52028
MELLAGTGLWPMAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWP
VLGNLLQVDLCNMPYSMYKLQNRYGDVFSLQMGWKPVVVINGLKAVQELLVTCGEDTA
DRPEMPIFQHIGYGHKAKGVVLCTYGPEWREQRRFSVSTLRNFGVGKKSLEQWVTDEA
SHLCDALTAEAGRPLDPYTLLNKAVCNVIASLIYARRFDYGDPDFIKVLKILKESMGE
QTGLFPEVLNMFPVLLRIPGLADKVFPGQKTFLTMVDNLVTEHKKTWDPDQPPRDLTD
AFLAEIEKAKGNPESSFNDANLRLVVNDLFGAGMVTTSITLTWALLLMILHPDVQCRV
QQEIDEVIGQVRHPEMADQAHMPFTNAVIHEVQRFADIVPMNLPHKTSRDIEVQGFLI
PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVPTGQPRPSDYGVFAFLLSPSPYQLCAFKR
>CYP2D4
M22331, X52029, X52457 I,T,P
seen in ESTs TDI
or ANV seen in ESTs same lib
MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWP
VLGNLLQIDFQNMPAGFQKLRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTA
DRPPLHFNDQSGFGPRSQGVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEA
RCLCAAFADHSGFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEE
ESGFLPMLLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTD
AFLAEVEKAKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILHPDVQCRV
QQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLI
PKGTTLITNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVPTGQPRPSDYGIFGALTTPRPYQLCASPR
>CYP2D18
U48219 S77859 ONLY 5 AA DIFFS probably = 2D4
MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWP
VLGNLLQIDFQNMPAGFQKLRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTA
DRPPLHFNDQSGFGPRSQGVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEA
RCLCAAFADHSGFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEE
ESGFLPMLLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTD
AFLAEVEKAKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILRPDVQCRV
QQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLI
PKGTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVPAGQPRPSNYGVFGALTTPRPYQLCASPR
>CYP2D5
X52030 X52458
MELLNGTGLWPMAIFTVIFILLVDLMHRHQRWTSRYPPGPVPWP
VLGNLLQVDPSNMPYSMYKLQHRYGDVFSLQMGWKPMVIVNRLKAVQEVLVTHGEDTA
DRPPVPIFKCLGVKPRSQGVVFASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEA
GHLCDAFTAQNGRSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMLTLVEESLIE
VSGFIPEVLNTFPALLRIPGLADKVFQGQKTFMAFLDNLLAENRTTWDPAQPPRNLTD
AFLAEVEKAKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQRRV
QQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRITSCDIEVQDFVI
PKGTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQHFSFSVPAGQPRPSTLGNFAISVAPLPYQLCAAVREQGH
>CYP2D
pseudogene z chr7:120386407-120386565
exon 9 (+ strand) 73% to 2D3
ACLGEPLTCMELFLFFICLLQSFSFSVKAGQPRPSNHGIFEMPISPSSYQLCA
>CYP2E1
J02627
MAVLGITIALLVWVATLLVISIWKKIYNSWNLPPGPFPLPILGN
IFQLDLKDIPKSFTKLAKRFGPVFTLHLGSRRIVVLHGYKAVKEVLLNHKNEFSGRGD
IPVFQEYKNKGIIFNNGPTWKDVRRFSLSILRDWGMGKQGNEARIQREAQFLVEELKK
TKGQPFDPTFLIGCAPCNVIADILFNKRFDYNDKKCLRLMSLFNENFYLLSTPWIQLY
NNFADYLRYLPGSHRKIMKNVSEIKQYTLEKAKEHLQSLDINCARDVTDCLLIEMEKE
KHSQEPMYTMENVSVTLADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG
PSRVPAVRDRLDMPYMDAVVHEIQRFINLVPSNLPHEATRDTVFQGYVIPKGTVVIPT
LDSLLYDSHEFPDPEKFKPEHFLNENGKFKYSDYFKAFSAGKRVCVGEGLARMELFLL
LSAILQHFNLKSLVDPKDIDLSPVTVGFGSIPPQFKLCVIPRS
>CYP2F4
AF017393 end of exon 5 and exon 6 in seq gap in genome seq chr1 (+)
82269864
MDGVSTAILLLLLAVISLSLTFTSWGKGQLPPGPKPLPILGNLLQLRSQDLLTSLTK 82270034
82270123
LSKDYGSVFTVYLGPRRVIVLSGYQTVKEALVDKGEEFSGRGSYPIFFNFTKGN 82270284
82272477
GIAFSDGERWKILRRFSVQILRNFGMGKRSIEERILEEGSFLLDVLRKTE 82272626
82276791
GKPFDPVFILSRSVSNIICSVIFGSRFDYDDERLLTIIHFINDNFQIMSSPWGE 82276952
82277413
MYNIFPSLLDWVPGPHRRVFRNFGGMKD 82277496
LIARSVREHQDSLDPNSPRDFIDCFLTKMV
QEKQDPLSHFNMDTLLMTTHNLLFGGTETVGTTLRHAFLILMKYPKVQ
82279507
ARVQEEIDCVVGRSRMPTLEDRASMPYTDAVIHEVQRFADVIPMNLPHRVIRDTPFRGFLIPK 82279695
82281147
GTDVITLLNTVHYDSDQFKTPQEFNPEHFLDANQSFKKSPAFMPFSA 82281287
82282297
GRRLCLGEPLARMELFIYLTSILQNFTLHPLVEPEDIDLTPLSSGLGNLPRPFQLCMRIR 82282476
>CYP2G1
M33296 J04715 M34444 chr1 (+)
81996311
MALGGAFSIFMTLCLSCLLILIAWKRTSRGGKLPPGPTPIPFLGNLLQVRIDATFQSFLK 81996490
81997087
LQKKYGSVFTVYFGPRPVVILCGHEAVKEALVDQADDFSGRGEMPTLEKNFQGY 81997248
81998826
GLALSNGERWKILRRFSLTVLRNFGMGKRSIEERIQEEAGYLLEELHKVK 81998975
82001483
GAPIDPTFYLSRTVSNVICSVVFGKRFDYEDQRFRSLMKMINESFVEMSMPWA 82001641
82002014
QLYDMYWGVIQYFPGRHNRLYNLIEELKDFIASRVKINEASFDPSNPRDFIDCFLIKMY 82002190
82004063
QDKSDPHSEFNLKNLVLTTLNLFFAGTETVSSTLRYGFLLLMKYPEVE 82004206
82005533
AKIHEEINQVIGTHRTPRVDDRAKMPYTDAVIHEIQRLTDIVPLGVPHNVIRDTHFRGYFLPK 82005721
82005841
GTDVYPLIGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNDAFVAFSS 82005981
82007190
GKRICVGEALARMELFLYFTSILQRFSLRSLVPPADIDIAHKISGFGNIPPTYELCFMAR 82007369
>CYP2J3
U39943
MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYP
PGPWRLPLVGCLFHLDPKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFT
QMEHNFLNRPVTLLRKHLFNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQ
EEAYHLVEAIKDEGGLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEA
MCLESSMMCQLYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRD
FIDAFLKEMAKYPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ
EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAG
FNLPKGTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSMGKRACLG
EQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL
>CYP2J3
91% to mouse 2j9 exon 8 in a seq gap
116772039
MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ
116771830
116767788
FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN 116767791
116766010
GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEG 116765861
116765445
GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ 116765284
116760602
LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAK 116760426
116758387
YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ 116758247
116754923
EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPK 116754735
GTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM
116749991
GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL 116749815
>CYP2J3P1
U40000
24
MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLD 203
204
PKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHL 383
384
FNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQGEAYHLVEAIKDEGGLPFDP 563
564
HFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQLYNIFPRILQYL 743
744
PGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEGRDFIDAFLKEMAKYPDKTTTSFNEEN 923
924 LICSTLDLFFAGTETTSTTLRWALLCMALYPEVQEKMQAEIDRVIGQGRQPNLADRDSMP
1103
1104
YTNAVIHEVQRIGNIIPFNVPRKVAVDTYLAGFNLPKGTMILTNLTALHRDPKEWATPDT 1283
1284
FNPEHFLENGQFKKRESFLPFSM 1352
1415
GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL 1591
>CYP2J3P2
U40004
13 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLD
192
193
PKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHL 372
373
FNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEGGLPFDP 552
553
HFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQLYNIFPRILQYL 732
733 PGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAKYPDKTTTSFNEEN
912
913
LICSTLDLFFAGTETTSTTLRWALLCMALYPEVQEKMQAEIDRVIGQGRQPNLADRDSMP 1092
1093
YTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPKG 1206
RDPKEWATPDTFNPEHFLENGQFKKRESFLPFSMGKRACLGEQLARSEL 1348
1349
FIFITSLIQKFTFKPPVNEKVSLQFRMSVTISPVSHRLCAIPRL 1480
>CYP2J5P
exons 1-4 69% to 2j5 mouse now a pseudogene ortholog
116785102
MITSLSSLVTSSWAALLLRTLLLAAVTFLFLAGILRRHRPKDYQPGPWRLPFVGNFFQIDFEQSHLVLQK
116784893
116784415
FAKKYGNVFSLELDRPSVVVVTGQPLIKTKMFTHLEQNFANHFVTSVRKRAIGNN 116784251
116781318
GLITSNGQTWKEKRRFALMTLKNFGLGKKSLEQRMHE*AFHLVEARREEG 116781169
116780474
GQPVDLHLINNAVANVICSITFGGRFEYEDCQFQEMPTLLDEALHV 116780337
>CYP2J4
116734902
MLATAGSLIATIWAALHLRTLLVAALTFLLLADYFKTRRPKNYPPGPWGLPFVGNIFQLDFGQPHLSIQP
116734693
116725983
FVKKYGNIFSLNLGDITSVVITGLPLIKETFTHIEQNILNRPLSVMQERITNKN 116725822
116723426
GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRMQEEAHYLVEAIREEK 116723277
116722875
GKPFNPHFSINNAVSNIICSVTFGERFEYHDSRFQEMLRLLDEVMYLETTMISQ 116722714
116718583
LYNIFPWIMKYIPGSHQTVFRNWEKLKLFVSSMIDDHRKDWNPEEPRDFIDAFLKEMSK 116718407
116716306
YPEKTTSFNEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEVQ 116716169
116713582
EKVQAEIDRVIGQKRAASLADRESMPYTNAVIHEVQRMGNIIPLNVPREVAMDTTLNGFHLPK 116713394
116711364
GTMVLTNLTALHRDPKEWATPDVFNPEHFLENGQFKKRESFLPFSM 116711227
116708412
GKRACLGEQLARSELFIFFTSLMQKFTFKPPTNEKLSLKFRNGLTLSPVTHRICAVPRE* 116708233
>CYP2J4-de6b w
116706163
XXXXXXSFCEENLTCRTLDFLYAGIDTISNRLHWVLLLTCVNPEXX 116706053 exon 6
>CYP2J16-de2b5b9b
x
116691748
KKYGNIFGLNLGDLTSEVITGLLLSKE 116691668 exon 2
116684743
FYDIFPYLMKYIPGITSNCFQKLGKLKLFVSCMTDEHRRDWNPEDPRNFTDALLKEMMK 116684567 exon 5
116677505
GKRACPGEQLARSKLFIFFTALIQKFTF 116677422
116677420
RLGMKSILGLTLSPVTHHI*ALSKQ 116677346 exon 9
>CYP2J16
116664772
MLATVGSLLAKIWSAINFWTLLLTLLTFLLLADYLKNRRPNNYPPGPWRLPFVGNLFQFDLNISHLHLRIQQ
116664557
116654396
FVKKYGNLISLDFGNISVVVITGLPLIKEALINNEQNFLKRPIVPSRYRVFKDN 116654235
116651622
GIFFANVHKWKEQRRFALTMLKNFGLGKKSLEQCIQEEAHHLVEVIGEEK 116651473
116650955
GQPFDPHFRINNAVSNIICSITFGERFEYDDSQFQELLKLADEVICSEASMTSV 116650794
116640170
LYNVFPLIFKYLPGPHQTVFKNWEKLKSIVANMIDRHRKDWNPDEPRDFVDAFLTEMTK 116639994
116638624
YPDKTTTSFNEENLIATTLDLFFAGTETTSTTLRWALLYITLNPEVQ 116638484
116627938
EKVHSEIDRVIGHGRLPSTDDQDAMPYTNAVIHEVLRMGNIIPLNVPREVTADSTLAGFHLPK 116627750
116624337
GKMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSV 116624200
116612610
GKRACPGEKLAKSELFIFFTALMQNFTFKAPTNEKLSLKLRKGLSLYPVSYRICAVPR 116612437
>CYP2J16-de5c6c9c
y
72%
to 2j6 mouse
116604392 LYNIFPWIMNYGPGSHQ 116604342 116604222 exon
5
116604345
SVFRNWEKLKLFVSCMIDNKQRWVP
116604271 exon 5
116602255 YPEKSTSFSQGHLFCSTLNLFRAGSET 116602175 exon 6
116591992
GKRACPGEQMAISELFSFFAAFMQ 116591921 exon 9
116591919
KFTFHLAINEKLRMKFRNGLTLP*SSHLYC 116591830 exon 9
>CYP2J17P
116584536
MLATASCLVANVCSAIPLWTLLLAALSWLPQKQAPQKQPSRALAPAIFGNLFQFDLDVSQLHSGI*PSKK
116584327 exon 1
116581102
FVTKYGNLISLDFGNTSSVIISGLPLIKEALTDM 116581001 exon 2
116580637 EQNLLKCIVLASREHVFKNN 116580578 exon 2 last
half
116570454
LYNVFPFIIKYL
116570419 exon 5
116570408
NQTFFRNWENLNLFVSHMMESHRKDWNPVEPRDFIDAFLTYMTKEDD 116570268 exon 5 last half
116566151
KVHSEIDGVTGHGRPPSTGDRDSMPYTNAVIYEVLRMDNINPLKVPREVTADSTLDEFCLSK 116565966 exon 7
116563406
GTMVLINLTALYRESKEWTTQDTFNPEHFLENGMFKKRESF 116563284 exon 8
116559748
KFTFKPPISEKLSLKFRTGLTLSHVSCRI*SIHR
116559647 exon 9
>CYP2J18P
63%
to 2j6 mouse
116551335 MLGTQDILEAGIWALLH 116551285 exon 1
116551282 RTLLLAAVTFLLLADYLKTGNK 116551217 exon 1
116551217 KKYPWGPCNPPVMNNLFQLDLEQ 116551149 exon 1
116537661
LYNAFLSIMKYHPGSHQ 116537611 exon 5
116537614
SVFRNWEKLIWRMSHIAENHCKG*NPAEL 116537528 exon 5
116537523
REFIDAFLTKMTK 116537485 exon 5
116534551
YPDKTTTNFNEENLICA 116534501 exon 6
116534498
LEFLFARTEITSTTLSWVLLYLSANPGVQ 116534412 exon 6
116529361
LFIFFTSLMQKFTFKPPISEKLILKFRMGLILSPVCH*ICVVPRQ* 116529224 exon 9
>CYP2J10
XM_233199 ortholog of mouse
Cyp2j12
Predicted
GNOMON 86% to 2j12 mouse (LOC313373), mRNA.
2J10
seq specific rev primer
matches 116499966-116499989
forward primer 1 = 116515946 116515968
116516004
MLSTEDTLEAAIRALLHFRTLLLAAVTFLFLANYLKTRRPKNYPPGPWRLPFVGNLFQLDVKQPHVVIQK
116515795
116508667
FVKKYGNLTSLDFGTIPSVVITGLPLIKEAFTNTEQNFLNRPVTPLRKRVFNNN 116508506
116505791
GLIMSNGQTWKEQRRFTMTTLKNFGLGKRSLEQRIQEEANYLVEAIGADK 116505642
116505144
GQPFDPHFKINSAVSNIICSITFGERFEYEDSLFQELLRLLDEASCLESSMMCQ 116504983
116500081
LYNVFPTIIKYLPGSHQTVLRNWEKLKLFISCMMDSHQKDWNPDEPRDFIDAFLTEMAK
116499905
116496152
YRDKTTTSFNKENLIYSTLDLFFAGSETTSNILRWSLLYITTNPEVQ 116496012
116489147
EKVHSEIDRVIGHRRQPSTGDRDAMPYTNAVIHEVLRMGNIIPLNVPREMTADSTLAGFHLPK 116488959
116488244
GTTILTNLTGLHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSM 116488107
116479687
GKRACPGEQLARTELFIFFTALMQNFTFKPPVNETLSLKFRNGLTLAPVSHRICAVPRQ 116479511
>CYP2J13
XM_233198 1455 bp ortholog of mouse Cyp2j13
Predicted
GNOMON Rattus norvegicus similar to CYP2J4 (LOC313372), mRNA.
Missing exon 1 74% to XM_233199, 79% to 2J4
78% to 2J3 90% to 2j13 mouse
116449294
FVKKYGNVISLDLGIMSSVIISSLPLIKEAFSHLDENFINRPIFPLQKHIFNDN 116449133
116446157
GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAHHLVEAIGEEE 116446008
116445630
GQPFDPHFKINNAVSNIICSITFGERFEYHDSQFQELLKLLDKAMYLGTPMMIH 116445469
116440971
LYNMFPWIIKHLPGQHQTLLATWGKLKSYIADIIENHREDWNPAEPRDFIDAFLNEMAK 116440795
116428766
YPDKTTTSFNEENLICSTLDLFLAGTETTSTTLRWAVLYMALYPEVQ 116428626
116426881
EKVQAEIDQVIGQEKHPSLADRDSMPYTNAVVHEIQRMGNIVPLNVPREVAVDTTLAGFHLPK 116426693
116426568
GSVVMTNLTALHMDPKEWATPDVFNPEHFLENGQFKKRDSFLPFSM 116426431
116423270
GKRACLGEQLARSELFIFFTALMQKFTFKPPTNEKLSLKFRLGITISPVSHRICAVPRL 116423094
>CYP2R1
XM_341909
MFQLPGVQTCAGALAGAFLLLLLVLVVRQLLRQRRPAGFPPGPP
RLPFIGNICSLALSADLPHVYMRKQSRVFGEIFSLDLGGISTVVLNGYDVVKECLVHQ
SEIFADRPCLPLFMKMTKMGGLLNSRYGRGWIDHRRLAVNSFHYFGSGQKSFESKILE
ETWSLIDAIETYKGRPFDLKQLITNAVSNITNLILFGERFTYEDTDFQHMIELFSENV
ELAASAPVFLYNAFPWIGILPFGKHQRLFRNADVVYDFLSKLIEKAAVNRKPHLPQNF
VDAYLDEMDKGQNDPLSTFSKENLIFSVGELIIAGTETTTNVLRWAVLFMALYPNIQG
QVHKEIDLIMGHDRRPSWEDKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGY
SIPKGTTVITNLYSVHFDEKYWKDPDMFYPERFLDSSGYFTKKEALIPFSLGRRHCLG
EQLARMEMFLFFTSLLQQFHLHFPHELVPNLKPRLGMTLQPQAYLICAERR
>CYP2S1
(XM_218347 N-term incorrect) CK473647.1 EST with N-term chr1 (-)
no
duplicate exon 4 as in the mouse
81101539
MEAASTWALLLALLLLLLALTLPRTPARGQLPPGPTPLPLLGNLLQLRPGALYSGFLR 81101366
81100365
LSKKYGPVFTVHLGPWRRVVVLVGHDAIREALGGQAEEFSGRGTLATLDKTFDGH 81100201
81097187
GVFFANGERWKQLRKFTLLALRDLGMGKREGEELIQEEVQNLVKAFQRTE 81097038
81095195
GRPFNPSMLLAQATSNVVCSLIFGIRLPYEDKEFQAVIQAASGTLLGISSPWG 81095037
81094957
QAYEMFSWLLQPLPGPHTQLQHHLGTLAAFTIQQVQRHQGRSHTSGPARDVVDAFLQKMA 81094778
81093521
QEKEDPGTEFTEKNLLMTVTYLLFAGTMTIGATIRYALLLLLKYPQVQK 81093375
81090341
KRVREELIQELGPSRTPSLSDRVRLPYTDAVLHEAQRLLALVPMGMPHTVTKTTSFRGYTLPK 81090156
81088138
GTEVFPLIGSVLHDPAVFRNPEEFHPSRFLDDDGRIRKHEAFLPYSL 81087998
81087846
GKRVCLGEGLARAELWLFFTSILQAFSLDTPCPPGDLSLKPAVRGLFNIPPDFQLQVWPTGDQSR 81087652
>CYP2T1
AF368269 (Genbank translation
is incorrect in green region) chr1 (+)
82307700
MVTCEATLLLLLILTLMLMSWGWLAHQARARMQKDLPPGPAPLPLLGNLLQLQSGHLDRVLME 82307888
82308155
LSSRWGPVFTVWLGPRPAVVLSGYAALRDALVLQADAFSGRGSMAVFERFTHGN 82308316
82308634
GIVFSNGPRWRTLRNFALGALKEFGVGTSTIEERILEETACVLDEFQATM
82308783
82309030
GAPFDPRRLLDNAVSNVICTVVFGKRYNYGDPEFLRLLDLFSDNFRIMSSRWGE 82309191
82309932
TYNMFPSFMDWIPGPHHRIFKNFQELRLFISEQIQWHRQSRQTGEPRDFIDCFLEQMDK 82310108
82310183
EHQDPESHFQDETLVMTTHNLFFGGTETTSTTLRYGLLIMLKYPEVA 82310323
82310430
AKVQEELDATVGRTRAPSLADRAHLPYTNAVLHEIQRFISVLPLGLPRALIRDVNLRNHFLHK 82310618
82310840
GTFVIPLLVSAHRDPTQFKDPDHFNPTNFLDDQGEFQNNDAFMPFAP 82310840
82311073
GKRMCLGAGLARSEIFLFLTAILQKFSLLPVGSPADIDLTPQCTGLGNVPPAFQLRLVAR 82311252
>CYP2U1
XM_227677 gc boundary caused missassembly 90% to mouse 2U1
MSSIGGLRPAAGEQPGVGPHLQAVGGALLLCGLAVLLDWVWLQR
QRAGGIPPGPKPRPLVGNFGYLLLPRFLRLHFWLGSGSQTDTVGRHVYLARLARVYGN
IFSFFIGHRLVVVLSDFQSVREALVQQAEVFSDRPRMPLISILTKEKGIVFAHYGPIW
KQQRRFSHSTLRHFGLGKLSLEPRIIEEFAYVKAEMQKHGEAPFSPFPVISNAVSNII
CSLCFGQRFDYTNKEFKKVLDFMSRGLEICLHSQLFLINLCPWFYYLPFGPFKELRQI
ERDITCFLKNIIKEHQESLDANNPQDFIDMYLLHTQEEKDKCKGTNFDEDYLFYIIGD
LFIAGTDTTTNSLLWCLLYMSLNPGVQKKVHEEIERVIGRDRAPSLTDKAQMPYTEAT
IMEVQRLSMVVPLAIPHMTSEKT
(1)
GYSIPKG
TVVLPNLWSIHRDPVIWEKPDDFCPHRFLDDQGQLLKRETFIPFGIGKRVCMGEQLAK
MELFLMFVSLMQSFTFALPEGSEKPIMTGRFGLTLAPHPFNVTVSKR
>CYP2W1
XM_221971 92% to mouse 2W1
MELLVLCVWGILLLLGLWGLLRGCAQDPSMTRQWPPGPRPLPFL
GNLHLLGVTHQDRALMELSERYGPMFTIHLGSQKTVVLSGYEVVREALVGTGHELADR
PPIPIFQLIQRGGGIFFSSGVHWKVARQFTVRTLQSLGIRQPPMVGKVLQELVCLKGQ
LDSYGGQPFPLTLLGWAPCNITFTLLFGQRFDYQDPVFVSLLSLIDQVMVLLGSPGIQ
LFNTFPRLGALLRLHRPVLSKIEEVRTILRTLLEAQRPPLPNGSPARSYVEALLQQGQ
DDPEDMFSEANVLACTLDMVMAGTETTAATLQWAVFLMVKHPHVQGRVQEELDRVLGP
GQLPQPEDQRALPYTSAVLHEVQRYITLLPHVPRCTAADIQLGGYLLPKGTPVIPLLT
SVLLDKTQWETPSQFNPNHFLDAKGCFMKRGAFLPFSTGRRVCVGESLARTELFLLFA
GLLQQYHLLPPPGLSPADLDLRPAPAFTMRPPAQTLRVVPRS
>CYP2AB1
(XM_221297 N-terminal incorrect) AC107471.6 N-term 92% to mouse
189790
MFSLFGGMAFLAGSFLLLKLAALCWRRSHLPPGPFPFPLLGNLWQLNFRLHPNMLFQ (0) 189620
LAQTHGNVFTVWLGSTPIVVLNGFRAVKEALVSNSEQFSGRPLTPFFRD
LFGEKGVICSNGLTWRQQRRFCLTTLRELGLGKQALELQLQHEAAELAEVFHQEQGRA
FDPQVPIIRSTTRVIGALVFGHHFLSEEPIFLELIRAINLGLAFASTTWRRLYDMFPW
ALRYLSGPHQKIFQYHEAVRGFIHHEIIRHKLRTPEAPKDFISCYLSQITKAMDDPVS
TFSEENLIQVVIDLFLGGTDTTATTLHWAIIYLVHHRAIQERVQQELDEVLGTAQAVC
YEDRERLPYTRAVLHEVQRLSSVVAVGAVRQCVTPTWMHGYYVSKGTIILPNLASVLC
DPECWETPHQFNPGHFLDKDGDFVTNEAFLPFSAGHRVCPGEQLARMELFLMFATLLR
TFRFQLPEGSQGLRLEYVFGGTLQPQPQKICAVPRLSSLSPREP
>CYP2AC1
NW_044163.1|Rn9_1523 chromosome 9
3425457
MSGFDFSAILALLGLILILILNIKDFMAKASKRQCPPGPKPWPVIGNLHILNLKRPYQTMLE 3425272
3423187
LSKKYGPIYSIQMGPRKVVVLSGYETVKDALVNYGNQFGERSQVPIFERLFDGK 3423026
3415443
GIAFAHGETWKTMRRFSLSTLRDFGMGKRTIEDTIVVECQHLIQSFESHK 3415294
3412018
GKPFEIKRVLNASVANVIVSMLLGKRFDYEDPQFLRLLTLIGENIKLIGNPSIV 3411857
3410639
LFNIFPILGFLLRSHKKVLRNRDELFSFIRRTFLEHCHNLDKNDPRSFIDAFLVKQQ 3410469
3410029
ENNKSADYFNEENLLALVSNLFTAGTETTAATLRWGIILMMRYPEVQS 3409886
3408812
KVHDEIHKVVGSAQPRIEHRTQMPYTDAVIHEIQRVANILPTSLPHETSTDVVFKNYYIPK 3408627
3406238
GTEVITLLTSVLRDQTQWETPDAFNPAHFLSSKGRFVKKEAFMPFSV 3406098
3402907
GRRMCAGEPLAKMELFLFFTSLMQKFTFQPPPGVSYLDLDLTPDIGFTIQPLPHKICALLRTSAL* 3402710
>CYP3A2
M13646
MDLLSALTLETWVLLAVILVLLYRLGTHRHGIFKKQGIPGPKPL
PFLGTVLNYYKGLGRFDMECYKKYGKIWGLFDGQTPVFAIMDTEMIKNVLVKECFSVF
TNRRDFGPVGIMGKAVSVAKDEEWKRYRALLSPTFTSGRLKEMFPIIEQYGDILVKYL
KQEAETGKPVTMKKVFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKTKKLLRFDFFDP
LFLSVVLFPFLTPIYEMLNICMFPKDSIAFFQKFVHRIKETRLDSKHKHRVDFLQLML
NAHNNSKDEVSHKALSDVEIIAQSVIFIFAGYETTSSTLSFVLYFLATHPDIQKKLQE
EIDGALPSKAPPTYDIVMEMEYLDMVLNETLRLYPIGNRLERVCKKDIELDGLFIPKG
SVVTIPTYALHHDPQHWPKPEEFHPERFSKENKGSIHPYVYLPFGNGPRNCIDMRFAL
MNMKLALTKVLQNFSFQPCKETQIPLKLSRQAILEPEKPIVLKVLPRDAVINGA
>CYP3A
pseudogene 200kb from 3A9
17387311
LYEGRQPVLAITDPDIIKTVLVKECYSTFTNRR 17387213 exon 4
>CYP3A9
U46118
MDLIPNFSMETWLLLVISLVLLYLYGTHSHGIFKKLGIPGPKPL
PFLGTILAYRKGFWEFDKYCHKKYGKLWGLYDGRQPVLAITDPDIIKTVLVKECYSTF
TNRRNFGPVGILKKAISISEDEEWKRIRALLSPTFTSGKLKEMFPIINQYTDMLVRNM
RQGSEEGKPTSMKDIFGAYSMDVITATSFGVNVDSLNNPQDPFVEKVKKLLKFDIFDP
LFLSVTLFPFLTPLFEALNVSMFPRDVIDFFKTSVERMKENRMKEKEKQRMDFLQLMI
NSQNSKVKDSHKALSDVEIVAQSVIFIFAGYETTSSALSFVLYLLAIHPDIQKKLQDE
IDAALPNKAHATYDTLLQMEYLDMVVNETLRLYPIAGRLERVCKTDVEINGVFIPKGT
VVMIPTFALHKDPHYWPEPEEFRPERFSKKNQDNINPYMYLPFGNGPRNCIGMRFALM
NMKVALVRVLQNFSFQPCKETQIPLKLSKQGLLQPEKPLLLKVVSRDETVNGA
>CYP3A
pseudogene z 64% to 3A18
9216052
FGPVGFMKKAVTISEDDEGKRLRPLLSPVFTSGK 9216153
9216386
LWLLHFGLMCSPSSGSVMSVKHLRQEEKGEPIHMKE 9216493
9216722
FSGAYSMNGIAGASFGVNVDSLNN 9216781
XXXXXXXXXXXXXXXXXXXXXXXX
VVLFPFLTQI
>CYP3A18
X79991
MEIIPNLSIETWVLLATSLMLFYIYGTYSHGLFKKLGIPGPKPV
PLFGTIFNYGDGMWKFDDDCYKKYGKIWGFYEGPQPFLAIMDPEIIKMVLVKECYSVF
TNRRCFGPMGFMKKAITMSEDEEWKRLRTILSPTFTSGKLKEMFPLMRQYGDTLLKNL
RREEAKGEPINMKDIFGAYSMDVITGTSFGVNVDSLNNPQDPFVQKAKKILKFQIFDP
FLLSVVLFPFLTPIYEMLNFSIFPRQSMNFFKKFVKTMKKNRLDSNQKNRVDFLQLMM
NTQNSKGQESQKALSDLEMAAQAIIFIFGGYDATSTSISFIMYELATRPNVQKKLQNE
IDRALPNKAPVTYDALMEMEYLDMVVNESLRLYPIATRLDRVSKKDVEINGVFIPKGT
VVTIPIYPLHRNPEYWLEPEEFNPERFSKENKGSIDPYVYLPFGNGPRNCIGMRFALI
SMKLAVIGVLQNFNIQPCEKTQIPLKISRQPIFQPEGPIILKLVSRD
>CYP3A23/CYP3A1
D13912
MDLLSALTLETWVLLAVVLVLLYGFGTRTHGLFKKQGIPGPKPL
PFFGTVLNYYMGLWKFDVECHKKYGKIWGLFDGQMPLFAITDTEMIKNVLVKECFSVF
TNRRDFGPVGIMGKAISVSKDEEWKRYRALLSPTFTSGRLKEMFPVIEQYGDILVKYL
RQEKGKPVPVKEVFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKAKKLLRIDFFDPLF
LSVVLFPFLTPVYEMLNICMFPKDSIEFFKKFVYRMKETRLDSVQKHRVDFLQLMMNA
HNDSKDKESHTALSDMEITAQSIIFIFAGYEPTSSTLSFVLHSLATHPDTQKKLQEEI
DRALPNKAPPTYDTVMEMEYLDMVLNETLRLYPIGNRLERVCKKDVEINGVFMPKGSV
VMIPSYALHRDPQHWPEPEEFRPERFSKENKGSIDPYVYLPFGNGPRNCIGMRFALMN
MKLALTKVLQNFSFQPCKETQIPLKLSRQGLLQPTKPIILKVVPRDEIITGS
>CYP3A62
AB084894 80% to CYP3A9, 78% to Cyp3a13
MDLIPNISLETWMLLATILVLLYLYGTSTHGNFKKLGISGPKPL
PFVGNILAYRHGFWEFDRHCHKKYGDIWGFYEGRQPILAITDPDIIKTVLVKECYSTF
TNRRSFGPAGILKKAITLSEDEEWKRLRTLLSPTFTSGKLKEMFPIINQYADLLVKNV
KHEAEKGNPITMKDIFGAYSMDVITGTSFGVNVDSLNNPQNPFVQKVKKLLKFNFLDP
FFLSVILFPFLTPVFEAFDITVFPKDVMKFFRTSVERMKENRMQEKVKQRLDFLQLMI
NSQSSGDKESHQGLTDVEIVAQSIFFIFAGYETTSSALSFALYLLATHPDLQKKLQDE
IDAALPNKAPVTYDVLVEMEYLDMVLNETLRLFPVGGRLERVCKKDVEINGVFIPKGT
VVMVPTFALHKDPKCWPEPEEFCPERFRKKNQDSINPYIYLPFGNGPRNCIGMRFALM
NMKIALVRVLQNFSFGLCKETQIPLKLRKKGFFQPEKPIILRAVSRD
>CYP3A71P
new pseudogene 78% to 3A2
9497641
LFEWHTPVFAITDREMIKNVLVKECFSVFTNWR 9497543 exon 4
9468424
DLGPMGIMNKSIAF*KDEEWKRYRALLSPMFTSGKLKV 9468311
9468234
MFPIIKLYGDILVKYLRQEAEKGKPVSVKE 9468145
9467886
IFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKTKKFLRLDYFDPLFISV 9467737
9464756
GLFPFLKPIYDMLNISVFPKDSIAFFKNFVYSMKESHLDSKQK 9464631
9453844
YQVDFFQLMMNAHNNSSESHK 9453782
9451670
FPALSDIEIIAQSIIFTFGGYDTTSSTLSFVLYSLATHSDVQKKLQEEIDHALPNK 9451503
9449744
ASPTYDIVMEMEYLDMVFNETLRLYPVTGRLHRMCKKDIELDGVFIPKG
SMVMIPLYPLQHDPQHWPEPEEFRPE 9449520
9447351
RFSKENKCRTGHYVYLPFGNGPRNCLGMRFALMSMKLAVTKVLQNFSFHPCKET 9447190
9444444
QIPLKLSKQVILKPEKPIVLKVVPRDGVING 9444352
>CYP3A73
chr12_random_1.5 (from UCSC browser)
MDLVSALSLETWLLLAIILVLFYR
(2)
FGTRTHGIFKKQGIPGPKPLPFLGTVLNYYR
(0)
GLWKFDMECYKKCGKIWG
(2)
LFDGQTPVFAIMDTEMIKSVLVKECFSVFTNRR
(0)
NIGPVGIMSKSISVAKDEEWKRYRAFLSPTFTSGRLKE
(0)
MFPIIEHYGDILVKYLKQKVEKGKPLAMKE
(2)
VFGAYSMDVITSTSFEVNINSINNPKDPFVEKVKKFQRFDFFDPLFLSV
(1)
VLFPFLTPIYEMLNICLFPKDSVAFFQKFVYRMKQTRLDSKHK
(0)
HRVDFLQLMMNAHNNSKDKVSHK
(1)
ALSDIEIVAQAIIFIFASYETTSSTLSFVLYSLATHPDSQKKLQEEIDRALPNK
(0)
APPTYDTVMEMEYLDMVLNETPRLYPIGYRLERVCKKDIKLDGVFIPKGSVVMIPFYTLQHDPQHWPEPEEFLPER
(2)
FSKENKGSIDPYVYLPFGNGPRNCIGMRFALMNMKLALTKVLQNFSFQLCEETQ
(0)
IPLKLSRQRLFGPEKPIVLKVVPRDAVITGA*
>CYP4A1
M14972 NM_175837 1 AA DIFF
MSVSALSSTRFTGSISGFLQVASVLGLLLLLVKAVQFYLQRQWL
LKAFQQFPSPPFHWFFGHKQFQGDKELQQIMTCVENFPSAFPRWFWGSKAYLIVYDPD
YMKVILGRSDPKANGVYRLLAPWIGYGLLLLNGQPWFQHRRMLTPAFHYDILKPYVKN
MADSIRLMLDKWEQLAGQDSSIEIFQHISLMTLDTVMKCAFSHNGSVQVDGNYKSYIQ
AIGNLNDLFHSRVRNIFHQNDTIYNFSSNGHLFNRACQLAHDHTDGVIKLRKDQLQNA
GELEKVKKKRRLDFLDILLLARMENGDSLSDKDLRAEVDTFMFEGHDTTASGVSWIFY
ALATHPKHQQRCREEVQSVLGDGSSITWDHLDQIPYTTMCIKEALRLYPPVPGIVREL
STSVTFPDGRSLPKGIQVTLSIYGLHHNPKVWPNPEVFDPSRFAPDSPRHSHSFLPFS
GGARNCIGKQFAMSEMKVIVALTLLRFELLPDPTKVPIPLPRLVLKSKNGIYLYLKKL
H
>CYP4A2
M57719 M33938
MGFSVFSPTRSLDGVSGFFQGAFLLSLFLVLFKAVQFYLRRQWL
LKALEKFPSTPSHWLWGHNLKDREFQQVLTWVEKFPGACLQWLSGSTARVLLYDPDYV
KVVLGRSDPKPYQSLAPWIGYGLLLLNGKKWFQHRRMLTPAFHYDILKPYVKIMADSV
SIMLDKWEKLDDQDHPLEIFHYVSLMTLDTVMKCAFSHQGSVQLDVNSRSYTKAVEDL
NNLIFFRVRSAFYGNSIIYNMSSDGRLSRRACQIAHEHTDGVIKTRKAQLQNEEELQK
ARKKRHLDFLDILLFAKMEDGKSLSDEDLRAEVDTFMFEGHDTTASGISWVFYALATH
PEHQERCREEVQSILGDGTSVTWDHLDQMPYTTMCIKEALRLYSPVPSVSRELSSPVT
FPDGRSIPKGIRVTILIYGLHHNPSYWPNPKVFDPSRFSPDSPRHSHAYLPFSGGARN
CIGKQFAMNELKVAVALTLLRFELLPDPTRIPVPMPRLVLKSKNGIHLRLKKLR
>CYP4A8v1
M37828
MSGSALSFTIFPGSILGFLQIATVLTVLLLLLKTAQFYLHRRWL
LRATQQFPSPPSHWFFGHKIPKDQDFQDILTRVKNFPSACPQWLWGSNVRIQVYDPEY
MKLILGRSDPKAHGSYRFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDTLKPYVGIM
ADSVRIMLDKWEQIVGQDSTLEIFQHITLMTLDTIMKCAFSQEGSVQLDRKYKSYIKA
VEDLNNLFFFRVQNMFHQNDFIYSLSSNGRKAHNAWQLAHDYTDQVIKSRKAQLQDEE
ELQKVKQKRRLDFLDILLFARIENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYA
LATNPEHQQGCRKEIQSLLGDGASITWDDLDKMPYTTMCIKEALRIYPPVTAVSRMLS
TPVTFPDGRSLPKGITVMLSFYGLHHNPTVWPNPEVFDPYRFAPESSRHSHSFLPFSG
GARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPIPIPRLVLKSKNGIYLRLKKLQ
>CYP4A8v2
97% TO 4A8 BC081771
MSGSALSFTIFPGSILGFLQIATVLTVLLLLFKTAQFYLHRRWL
LRATQQFPSPPSHWFFGHKIPKDQEFQDILTRVKNFPSACPQWLWGSNVRIQVYDPDY
MKLILGRSDPKSHHSYRFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDTLKPYVGIM
ADSVRIMLDKWEQIVGQDSTLEIFQHITLMTLDTIMKCAFSQEGSVQLDRKYKSYIKA
VEDLNNLSFFRIRNIFHQNDIIYSLSSNGRKARSAWQLAHEHTDQVIKSRKAQLQDEE
ELQKVKQKRRLDFLDILLFARIENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYA
LATNPEHQQGCRKEIQSLLGDGASITWDDLDKMPYTTMCIKEALRIYPPVTAVSRMLS
TPVTFPDGRSLPKGITVMLSFYGLHHNPTVWPNPEVFDPYRFAPESSRHSHSFLPFSG
GARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPIPIPRLVLKSKNGIYLRLKKLQ
>CYP4A8v2-de1b rat
UCSC browser a in fig
CYP4A exon 1 pseudogene
chr5:135545150-135545338 (- strand)
135545338
MSIFELSHITTGFGISGLLQMVSWLGLLLLLLFKAAQYYLHRQWIIKSVQHFPSPPSHWFFGN 135545150
>CYP4A8v2-de5c6c12c rat
UCSC browser b in fig
135596695-135596537 (-
strand) exon 5
SVLLPQNKWE*TISDSLEIFQCASLITLATILMCVFSY*DNVHLN
135596212-135596066 (-
strand) exon 6
HSQTYTQVVGILNNLRNAFPQSDIFYRMTADGHRTKNAFLIAHKHSDFV
135594009-135593884 (-
strand) exon 12
KQFIMNEMKVVITLTLLCFEWLLDPTRVSVSISGFLLNPRMG
>CYP4A8v2-de4d12d rat
UCSC browser c in fig
135628319-135628215 (-
strand) A exon 4
51% to 4A8
LSNDQTWFQHY*HI*TPLLHCDILKSNVRIVADCI
135617441-135617274 (-
strand) B exon 12
75% to 4A2 76% to 4A8
RICIGKQLAMNAQKLAVALTLLQFELLPDPTRVPIPTEKLVLKSKNGIHLHLRKLQ
>CYP4A34P
new pseudogene seq between 4A3 and 4A2 T
65% to 4A2
135837702- 135845966 (+ strand)
MGIFELSHITTVFGISRLLQMVFWLGLLLLLFKAAQYYLRRQWIIKSFQQFPFPPSHWLFGNFLK
135838759
KDQDLQQIRLWVEKFPTACVRWFWGNHACVLIYDPD*MKVILG*S 135838887 aa 65-107
(seq gap)
GYSLLLLNGKKWFQHRQMLTPAFHSDILKPYVGIMA
FSIFLLQDKWEELVGQDCPLEIYQDISLMTMETLINCAFSYQGSVQLE
NSRS*IKAVEDLTHLIHFRVRNGFH*SNIIYNLSSNGGSFHCACQIAHKHKG
DRVIRRRKVQLQSGVELEKIWKKWHLDLLDILLFAQ
EDGKSLSDEDLHAEVDTFMFEGHDTAARGISWIFYALPTHPEHQERCKEEVQSILGDGTSVTW
DHLDQMPYTTMCIKKALRLYPPGPAVSRELSTPVTFPDGCSSPKNSRISVV
IFGLHHNPRL*PNPE
VLDPFRFAPDVPQHTHAFLPFSAGAR
NCIRKLFAMNELKVAVTLTLL*LELLPDPTRVPFLVARTVLKSKIRIYLHLKKLK
>CYP4A34P-de12b
C-term aa 453-508 with one frameshift
135831318
RNCIGEHFAMNELKVAMALTLL 135831383
135831383
QFELLPDPTRIPIPIPRLVLKSKNGIYLHLKKLQ 135831484
>CYP4A3
M33936
MGFSVFTPTRSLDGVSGFFQGAFLLSLFLVLFKAVQFYLRRQWL
LKALEKFPSTPSHWLWGHDLKDREFQQVLTWVEKFPGACLQWLSGSKTRVLLYDPDYV
KVVLGRSDPKASGIYQFLAPWIGYGLLLLNGKKWFQHRRMLTPAFHYGILKPYVKIMA
DSVNIMLDKWEKLDDQDHPLEIFHYVSLMTLDTVMKCAFSHQGSVQLDVNSRSYTKAV
EDLNNLTFFRVRSAFYGNSIIYNMSSDGRLSRRACQIAHEHTDGVIKMRKAQLQNEEE
LQKARKKRHLDFLDILLFAKMEDGKSLSDEDLRAEVDTFMFEGHDTTASGISWVFYAL
ATHPEHQERCREEVQSILGDGTSVTWDHLDQIPYTTMCIKEALRLYPPVPSVSRELSS
PVTFPDGRSIPKGITTTILIYGLHHNPSYWPNPKVFDPSRFSPDSPRHSHAYLPFSGG
ARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPVPMARLVLKSKNGIHLRLKKLR
>CYP4A33P
135689641-135700825 (+) between 4A8 and 4A2
DKWEQIVGQDSTLEIVQHNTLMTLDTIMKCAFSQEGSVQLDR
KYKSYIKAVGDLNNLSFFRIWNIFHQNDIIYSLSSNGCQANSAC*LAHEHT
DQVIKSRKAQLQDEEELQKVKQKRRLDFLDILLFAR
IENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYALATNPEHQQGCRKEIQSLLGDGASITW
DDLDKMPYTTMCIKEALSIYPPVPSVSRMLSTPVTFPDGCSLPK
GITAVLSFYGHHHNPTL*PNPE
VFDPYRVFPXSSQHSHLFLPFSGGAR
NCIGKQFAMNELKVAIALTLLCLRLLPDPTRIPIPIPRLVLKSKNGIYLHLKKLQ
>CYP4A33P-de4b5b12b rat
UCSC browser
135662099-135662209 (+
strand) F exon 4
LSNDQTWFQH*HILTPLFHYGILKTNVRIIVDSVHEM
135671387-135671512 (+
strand) G exon 5
DISDSLEIFQCASLIALATIMMCAFSYQDNVHLNRSVTSQSF
135676914-135677039 (+
strand) S exon 12
KQFIMNEMKVVITLTLLCFEWLLDPTRVSVSISGFLLNPRMG
>CYP4A33P-de10c11c rat
UCSC browser
135712750-135712827 (+
strand) exons 10
GVLISFSICGLHHNPRLWPNAE
(0)
135713048-135713125 (+
strand) R exon 11
VFDPFRFAPDVLRHTHAFLPFSAGAR
>CYP4B1
M29853
MVLNFLSPSLSRLGLWASVVILMVIVLKLFSLLLRRQKLARAMD
SFPGPPTHWLFGHALEIQKLGSLDKVVSWAQQFPHAHPLWFGQFVGFLNIYEPDYAKA
VYSRGDPKAADVYDFFLQWIGKGLLVLDGPKWFQHRKLLTPGFHYDVLKPYVAIFAES
TRMMLDKWEKKASENKSFDIFCDVGHMALDTLMKCTFGKGDSGLGHRDNSYYLAVSDL
TLLMQQRIDSFQYHNDFIYWLTPHGRRFLRACKIAHDHTDEVIRQRKAALQDEKERKK
IQQRRHLDFLDILLGVRDESGIKLSDAELRAEVDTFMFEGHDTTTSGISWFLYCMALY
PEHQQLCREEVRGILGDQDSFQWDDLAKMTYLTMCMKECFRLYPPVPQVYRQLNKPVT
FVDGRSLPAGSLISLHIYALHRNSTVWPDPEVFDPLRFSPENAAGRHPFAFMPFSAGP
RNCIGQQFAMNEMKVVTALCLLRFEFSLDPSKMPIKVPQLILRSKNGIHLYLKPLASR
SGK
>CYP4F39
UPSTREAM OF 4F5 chr7 (+) 94% to mouse 4f39 = ortholog
13051717
MLPITDYLLYLLGLEKTAFRVYVLSALLLFLLFLLFRLLLQAFKLFS
DFRITCRRLSCFPEPPGRHWLLGHMSM 13051938
13054230
YLPNEKGLQNEKKVLDTMHHIILAWVGPFLPLLVLVHPDYIKPVLGAS 13054373
13064279
AAIAPKDEFFYSFLKPWL 13064332
13064958
GDGLLISKGNKWSRHRRLLTPAFHFDILKPYMKIFNQSVNIMH 13065086
13065271
AKWRRHLAEGSVTSFDMFEHVSLMTLDSLQKCVFSYSSDCQE 13065396
13067956
KLSDYISSIIELSALVVRRQYRLHHYLDFIYYLTADGRRFRQACDTVHNFTTEVIQQRRR
ALRELGAEAWLKAKQGKTLDFIDVLLLAK 13068222
13072211
DEEGKELSDEDIRAEADTFMFE 13072276
13072388
GHDTTSSGLSWALFNLAKYPEYQDKCREEIQEVMKGRELEELDW 13072519
13076560
DDLTQLPFTTMCIKESLRQFPPVTLISRRCTEDIKLPDGRIIPK 13076691
13078724
GIICLVSIYGTHYNPLVWPDSK 13078789
13079451
VYNPYRFDPDIPQQRSPLAFVPFSAGP 13079531
13079947
RNCIGQSFAMAEMRVVVALTLLRFRLSVDRTRKVRRKPELILRTENGLWLNV
EPLPSRAGVPRGPTEPEVQAPPAQA* 13080177
>CYP4F17
= CYP4F19temp AI030199 EST CHR7 13095557
13103056 chr7 (+)
90%
to 4f17 next closest 82%, probable ortholog of 4f17
13095557
MLQLSLSWLGRGPVTVSPWQLLLVVGTSLLLARILAWISAFYDN
YCRLRCFPQPPSRHWFWGHLNL 13095754
13102916
VKNNEEGLQLLAEMSHQFQDIHLCWIGIFYPILRLIHPKFIGPILQA 13103056
13103866
AAAVAPKEMIFYGFLKPWL 13103922
13104011
GDGLLVSAGEKWSRQRRLLTPAFHFDILKPYVKNFNKSVNIMH 13104139
13105431
AKWQRLTAKGSARLDMFEHISLMTLDSLQKCVFSFDSNCQE 13105553
13106281
SPSEYIAAIQELSSLIVKRHHQPFLYMDFLYYLTADGRRFRKACDLVHNFTDAVIRERRR
TLSSQSVDEFLKSKTKSKTLDFIDVLLLAK 13106550
13106863
DEHGKELSDEDIRAEADTFMFG 13106928
13107118
GHDTTASALSWILYNLARHPEHQERCRQEVRELLRDREPEEIEW 13107249
13110034
DDLTQLPFLTMCIKESLRLHPPVTVISRCCTQDVVLPDGRVIPK 13110165
13110226
GNDCIISIFGVHHNPSVWPDPE 13110303
13110457
VYDPFRFDSENPQKRSPLAFIPFSAGP 13110537
13110878
RNCIGQTFAMNEMKVAVALTLLRFRLLPDDKEPRRKPELILRAEGGLWLRVEPLSTGAQ 13111054
>CYP4F5/4f16
13119940 13133265 chr7 (+) 3 aa
diffs to mRNA U39207 90% to 4f16 89% to 4f37
13119940
MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCENCSRLRCFPQSPKRNWFLGHLGT 13120137
13122954
IQSNEEGMRLVTEMGQTFRDIHLCWLGPVIPVLRLVDPAFVAPLLQAP 13123097
13125947
ALVAPKDTTFLRFLKPWL 13126000
13126086
GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKIFNQSVNIMH 13126214
13227610
VKWKHLCVEGSAHLEMFENISLMTLDSLQKCLFGFDSNCQE 13227732
13128135
SPSEYISAILELSSLIIKRSQQLFLYLDFLYYRTADGRRFRKACDLVHNFTDAVIRERRR
LLSSQGTDEFLESKTKSKSKTLDFIDVLLLAK
13128410
13129071
DEHGKELSDEDIRAEADTFMFG 13129136
13129327
GHDTTASALSWILYNLARHPEYQERCRQEVWELLRDREPEEIEW 13129458
13132295
DDLAQLPFLTMCIKESLRLHPPAIDLLRRCTQDIVLPDGRVIPK 13132426
13132538
GNICVISIFGIHHNPSVWPDPE 13132603
13132764
VFDPFRFDSENRQKRSPLSFIPFSAGP 13132844
13133089
RNCIGQTFAMNEMKVVVALTLLRFRVLPDDKEPRRKPEIILRAEGGLWLRMEPLSTDTQ 13133265
>CYP4F5?
AF288818 7aa diffs to 4F5 probably same gene
MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCEN
CSRLRCFPQSPKRNWFLGHLGT
IQSNEEGMRLVTEMGQTFRDIHLCWLGPVIPVLRLVDPAFVAPLLQAP
ALVAPKDPTFLHFLKPWL
GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKIFNQSVNIMH
AKWKHLCLEGSVRLEMFENISLMTLDSLQKCLFGFDSNCQE
SPSEYISAILELSSLIIKRSQQLFLYLDFLYYRTADGRRFRKACDLLHNFTDAVIRERRR
LLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK
DEHGKELSDEDIRAEADTFMFG
GHDTTASALSWILYNLARHPEYQERCRQEVWELLRDREPEEIEW
DDLAQLPFLTMCIKESLRLHPPAIDLLRRCTRHIVLPDGRVIPK
GNICVISIFGIHHNPSVWPDPE
VFDPFRFDSENRQKRSPLSFIPFSAGP
RNCIGQTFAMNEMKVVVALTLLRFRVLPDDKEPRRKPEIILRAEGGLWLRMEPLSTDTQ
>CYP4F37
94% to 4F5 chr7 (+) 89% to 4f16 88% to 4f37
13149326
MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCENCSRLRCFPQSPKRNWFLGHLGV 13149523
13159662
IQSNEEGMQLVTEMGQTFRDVHLIWLGPVSPVLRLVDPAFVAPLLQAP 13159805
13162623
ALVAPKDPTFLHFLKPWL 13162676
13162768
GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKTFNQSVNIMH 13162896
13164072
AKWKHLCLEGSARLEMFENISLMTLDSLQKCLFGFDSNCQE 13164194
13164825
SPSEYISATLELSSLTRKRSYKLFLYLDFLYYRTADGQRFRKACDLVHSFTDAVIRERRR
LLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK 13165100
13165761
DEHGKELSDEDIRAEADTFMFG 13165826
13165992
GHDTTASALSWILYNLASHPEYQERCRQEVWELLRDREPEEIEW 13166123
13168834
DDLAQLPFLTMCIKESLRLHPPAVDLLRRCTQDIVLPDGRVIPK 13168965
13169077
GNICVISIFGIHHNPSVWPDPE 13169142
13169302
VYDPFRFDPENRQKRSPLSFIPFSAGP 13169382
13169627
RNCIGQTFAMNEVKVAVGLTLLRFRFLPDDKEPRRKPELILRAEGGLWLRVELLSRDTQ 13169803
>CYP4F43P
pseudogene chr7 (+) strand exons 4, 5, 9, 10, 11, 12
13179984
RDGVFLISFDKWNHHHCLLTPAFHFDNLVL 13180073
*VKIFNQSVNIIH
13181413
VSFLKAKWKCLFSEGSACLEIFENLTTLDSLQKCLFSLDSNCQE 13181544
13207589
NDLAQLPFLTMCIKASLQLYPQDTNLICSCT 13207681
*DILLPDG*VIPK
XXXXXXXXXGVHHSPSVWTDPX
13208327
VYYPFPFDSKNPQKISPLAFMPFSVGP 13208407
13208782
RNCKRQTYPMSERKVALVLKLLHFHTIPGEIDPPRQPELILSLEGRLWLLKESLSVG 13208952
>CYP4F44P
pseudogene MISSING EXON 1 AND HALF OF EXON 2 90% to 4f16
13215861
LGPVIPVLRLVDPAFVAPLLQAP 13215929
13219942
ALVAPKDMNFYGFLKPWL 13219995
13220082
GDGLLLSSGDKWNRHRXLTPAFHFDILKPYVKIFNQSVNIMH 13220206
13227610
VKWKHLCVEGSAHLEMFENISLMTLDSLQKCLFGFDSNCQE 13227732
13228571
SPSEYISAILELSSLTIKRSYQLFLYLDFLYYRTADGRRFRKAC
DLVHSFTDAVIRERRRLLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK 13228846
13230087
DEHGKELSDEDIRAEADTFMFG 13230152
13230344
GHDTTASTLSWILYNLARHPEYQESCLQEVWELLRDREPEEIEW 13230475
13239583
DDLAQLPFLTMCIKESLRLHPPAVDLLRRCTQDIVLPDGRVIPK 13239714
13239826
GNICVISIFGIHHNPSVWPDPE 13239891
13240051
VYDPFRFDPESRQKRSPLSFIPFSAGP 13240131
13240378
RNCIGQTFAMNEMKVAVALTLLRFRLLPDDKEPRRKPEIILRAEGGLRLLVEPLSGGA* 13240554
>CYP4F40
91% to 4f40 next closest = 82% probable ortholog of 4f40
13268033
MRHLDLSWLGLGPMSASPWLLLSLVGVSWFLTRCLTQIYTLYAK
CQRLCGFPQPPKRSWFWGHLGM 13268230
13270621
SPPTEEGMKQMTELVATYPQGFMTWLGPIVPLITLCHPDIIRSVLSAS 13270764
13273375
AAVAPKDGIFYSFLKPWL 13273428
13273519
GDGLLVSASDKWSRHRSMLTPAFHFNILKPYVKIFNDSTNIMH 13273647
13275412
AKWLRLASGGSAHLDMFENISLMTLDTLQKCVFSFNSNCQE 13275534
13276605
KPSEYIAAILELSALVVKRNEQLLLHMDLLYRLTPDGRRFYKACHLVHDF
TYAVIQERRRTLPKHGGDDVIKAKAKSKTLDFIDVLLLSK 13276874
13279547
DEDGKELSDEDIRAEADTFMFEG 13279615
13281122
GHDTTASGLSWILYNLAKHPEYQERCRQEVQELLRDRDSEEIEW 13281253
13281122
DDLAQLPFLTMCIKESLRLHPPVTMVSRCCTQDISLPDGRVIPK 13281253
13281331
GIICIINIFATHHNPTVWQDPE 13281396
13281524
VYDPFRFDPENIQARSPLAFIPFSAGP 13281604
13281923
RNCIGQTFAMNEMKVAVALTLLRFRVLPDDKEPRRKPELILRAEDGLWLRVEPLSAQA 13282096
>CYP4F4/4f15
U39206 chr7 (+) strand 92% to 4f15 next closest 83% probable ortholog of 4f15
13293478
MPQLDLSWLGLGPMSASPWLLLLLVGASWLLVRVLTQTYIFYRT
YQHLCDFPQPPKWNWFLGHLGM 13293675
13296360
ITPTEQGLKQVTKLVATYPQGFMTWLGPILPIITLCHPDVIRSVLSA 13296500
13298373
SASVALKEVIFYSFLKPWL 13298429
13298517
GDGLLLSDGDKWSCHRRMLTPAFHFNILKPYVKIFNDSTNIMH 13298645
13301094
AKWQDLASGGSARLDMFKNISLMTLDSLQKCVFSFDSNCQE 13301216
13303748
KPSEYISAILELSALVAKRYQQLLLHTDSLYQLTHNGRRFHKACKLVHNFTDAVIQGRRR
ALPSQHEDDILKAKARSKTLDFIDVLLLTK 13304017
13305908
DEDGKELSDEDIRAEADTFMFE 13305973
13306182
GHDTTASGLSWILYNLARHPEYQERCRQEVRELLRDRESTEIEW 13306313
13307962
DDLAQLPFLTMCIKESLRLHPPVTVISRRCTQDIVLPDGRVIPK 13308093
13308178
GVICIINIFATHHNPTVWPDPE 13308243
13308394
VYDPFRFDPENIKDRSPLAFIPFSAGP 13308474
13308851
RNCIGQTFAMNEMKVALALTLLRFRVLPDDKEPRRKPELILRAEGGLWLRVEPLSTQ 13309021
>CYP4F1/4f14
M94548 chr7 12 exons (-) strand 95% to 4f14 probable ortholog
13600726
MSQLSLSWLGLGPEVAFPWQTLLLFGASWILAQILTQIYAAYRN
FRRLRGFPQPPKRNWLMGHVGM 13600529
13598616
VTPTEQGLKELTRLVGTYPQGFLMWIGPMVPVITLCHSDIVRSILNAS 13598473
13595808
AAVALKDVIFYTILKPWL 13595755
13595666
GDGLLVSAGDKWSRHRRMLTPAFHFNILKPYVKIFNDSTNIMH 13595538
13595386
AKWKRLISEGSSRLDMFEHVSLMTLDSLQKCVFSFDSNCQE 13595264
13593683
KSSEYIAAILELSALVAKRHQQPLLFMDLLYNLTPDGMRFHKACNLVHEFTDAVIRERRR
TLPDQGLDEFLKSKAKSKTLDFIDVLLLTK 13593414
13592552
DEDGKELSDEDIRAEADTFMFE 13592552
13592326
GHDTTASGLSWILYNLANDPEYQERCRQEVQELLRDRDPEEIEW
13592195
13591103
DDLAQLPFLTMCIKESLRLHPPVTVISRCCTQDILLPDGRTIPK 13590972
13590900
GIICLISIFGIHHNPSVWPDPE 13590835
13590677
VYNPFRFDPENIKDSSPLAFIPFSAGP 13590597
13590229
RNCIGQTFAMSEMKVALALTLLRFRLLPDDKEPRRQPELILRAEGGLWLRVEPLTAGAQ 13590053
>CYP4F6/4f13
U39208 chr7 (-) strand 91% to 4f13 probable ortholog
13635825
MLQLSLSRLGMGSLTASPWHLLLLGGASWILARILAWIYTFYDN
CCRLRCFPQPPKPSWFWGHLTL 13635628
13629996
MKNNEEGMQFIAHLGRNFRDIHLSWVGPVYPILRLVHPNVIAPLLQA 13629856
13620199
SAAVAPKEMTLYGFLKPWL 13620143
13620054
GDGLLMSAGEKWNHHRRLLTPAFHFDILKSYVKIFNKSVNTMH 13619926
13618162
AKWQRLTAKGSARLDMFEHISLMTLDSLQKCIFSFDSNCQE 13618040
13616381
SNSEYIAAILELSSLIVKRQRQPFLYLDFLYYLTADGRRFRKACDVVHNFTDAVIRERRS
TLNTQGVDEFLKARAKTKTLDFIDVLLLAK 13616112
13615792
DEHGKGLSDVDIRAEADTFMFG 13615727
13615538
GHDTTASALSWILYNLARHPEYQERCRQEVRELLRDREPEEIEW 13615407
13610183
DDLAQLPFLTMCIKESLRLHPPVLLISRCCSQDIVLPDGRVIPK 13610052
13609958
GNICVISIFGVHHNPSVWPDPE 13609893
13609762
VYNPFRFDPENPQKRSPLAFIPFSAGP 13609682
13609348
RNCIGQTFAMSEIKVALALTLLRFCVLPDDKEPRRKPELILRAEGGLWL
RVEPLSTVTSQLPWDLLAHPPTS 13609133
>CYP4F18
XM_224708 ASSEMBLY
MODIFIED from Genbank entry 77% to 4F1 chr7 (+) strand
92%
to 4f18 probable ortholog, 4f18 is also distant from the 4f cluster in mouse
18197903
MPLLSLSWLGLGHTAASPWLLLLLVGASCLLAYILPQVYAVFEN
SRRLRRFPQPPTRNWLFGHLGL 18198100
18202714
IQSSEEGLLYIQSLSRTFRDVCCWWVGPWHPVIRIFHPAFIKPVILA
18202854
18203794
PASVAPKDRVFYRFLKPWL 18203850
18203937
GDGLLLSTGDKWSRHRHMLTPAFHFNILKPYVKIFNDSTNIMH 18204065
18206788
AKWQRLASQGSARLDMFEHISLMTLDSLQKCVFSFDSNCQE 18206910
18208877
KPSEYITAILELSALVARRHQSLLLYVDLFYHLTRDGMRFRKACRLVHDFTDAVIRERRR
TLPDQGGDDALKAKAKAKTLDFIDVLLLSK 18209146
18211298
DEHGEALSDEDIRAEADTFMFG 18211363
18211538
GHDTTASGLSWILYNLAKHPEYQERCRQEVRELLRDREPEEIEW 18211669
18222463
DDLAQLPFLTMCIKESLRLHPPATAISRCCTQDIMLPDGRVIPK 18222594
18222676
GVICRISIFGTHHNPAVWPDPE 18222741
18223428
VYNPFRFDADNGEGRSPLAFIPFSAGP 18223508
18223827
RNCIGQTFAMSEMKVALALTLLRFRVLPDDKEPRRKPELILRAEGGLWLRVEPLSAGAH 18224003
>CYP4V
XM_341440 extra intron in the middle removed, CK366141.1 EST at boundary
there
is a gc-at boundary. 92% to mouse 4v3
MLWLWLGLSGQKLLLWGAASAVSVAGATVLLNILQMLVSYARKW
QQMRPIPSVARAYPLVGHALFMKPNNTEFFQQIIQYTEEFRHLPIIKLWIGPVPLVAL
YKAENVEVILTSSKQIDKSFMYKFLQPWLGLGLLTS
(2)
TGSKWRARRKMLTPSFHFTILEDFLDVM
NEQANILVNKLEKHVNQEAFNCFFPITLCALDIICETAMGKNIGAQSNGDSEYVRTVY
RMSDMIYRRMKMPWFWFDLWYLMFKEGRDHKKGLKSLHTFTNNVIAERVNARKAEQDC
IGAGRGPLPSKTKRKAFLDLLLSVTDEEGNKLSHEDIREEVDTFMFEGHDTTAAAINW
SLYLLGSNPEVQRKVDKELDDVFGRSHRPVTLEDLKKLKYLDCVIKETLRVFPSVPLF
ARSLSEDCEVAGYKISKGTEAVIIPYALHRDPRYFPDPEEFQPERFFPENSQGRHPYA
YVPFSAGPRNCIGQKFAVMEEKTILACILREFWIESNQKREELGLAGDLILRPNNGIW
IKLKRRHEDDP
>CYP4X1
AF439343, NM_145675
MEASWLENRWARPLHLALVFCLALVLMQAVKLYLRRQRLLRDLR
PFPGPTAHWLLGHQKFLQEDNMEKLDEIVKEYPCAFPCWVGPFQAFFYIYDPDYAKIF
LSRTDPKTQYLHQLMTPFLGRGLLNLDGPRWFQHRCLLTPAFHQDILKPCVDMMAHSV
NMMLDKWEKTWTTQETTIEVFEHINLMTLDIIMKCAFGQETNCQINGTYESYVKATFE
LGEIISSRLYNFWHHHDIIFKLSPKGHCFQELGKVIHQCTEKIIQDRKKTLKDQVNQD
DTQTSQNFLDIVLSAQAGDEKAFSDADLRSEVNTFMWAGHDASAASISWLLYCLALNP
EHQDRCRTEIRSILGDGSSITWEQLDEIPYTTMCIKETLRLIPPIPSISRELSKPLTL
PDGHSLPAGMTVVLSIWGLHHNPAVWKDPKVFDPLRFTKENSEQRHPCAFLPFSSGPR
NCIGQQFAMLELKVAIALTLLRFRVAADLTRPPAFSSHTVLRPKHGIYLHLKKLPEC
>CYP5A1
D28773
MEVLGLLKFEVSGTVVTVTLSVVLLALLKWYSTSAFSRLRKLGI
RHPEPSPFVGNLMFFRQGFWESHLELRERYGPLCGYYLGRRMYIVISDPDMIKEVLVE
NFSNFSNRMASGLEPKLIADSVLMLRDRRWEEVRGALMSAFSPEKLNEMTPLISQACE
LLLSHLKHSAASGDAFDIQRCYCCFTTNVVASVAFGIEVNSQDAPEDPFVQHCQRVFA
FSTPRPLLALILSFPSIMVPLARILPNKNRDELNGFFNTLIRNVIALRDKQTAEERRG
DFLQMVLDAQRSMSSVGVEAFDMVTEALSSAECMGDPPQRCHPTSTAKPLTVDEIAGQ
AFLFLIAGHEITTNTLSFITYLLATHPECQERLLKEVDLFMEKHPAPEYCNLQEGLPY
LDMVVAETLRMYPPAFRFTREAAQDCEVLGQHIPAGSVLEIAVGALHHDPEHWPNPET
FDPERFTAEARLQQKPFTYLPFGAGPRSCLGVRLGLLVVKLTLLQVLHKFRFEACPET
QVPLQLESKSALCPKNGVYVKIVSR
>CYP7A1
J05460
MMTISLIWGIAVLVSCCIWFIVGIRRRKAGEPPLENGLIPYLGC
ALKFGSNPLEFLRANQRKHGHVFTCKLMGKYVHFITNSLSYHKVLCHGKYFDWKKFHY
TTSAKAFGHRSIDPNDGNTTENINNTFTKTLQGDALCSLSEAMMQNLQSVMRPPGLPK
SKSNAWVTEGMYAFCYRVMFEAGYLTLFGRDISKTDTQKALILNNLDNFKQFDQVFPA
LVAGLPIHLFKTAHKAREKLAEGLKHKNLCVRDQVSELIRLRMFLNDTLSTFDDMEKA
KTHLAILWASQANTIPATFWSLFQMIRSPEAMKAASEEVSGALQSAGQELSSGGSAIY
LDQVQLNDLPVLDSIIKEALRLSSASLNIRTAKEDFTLHLEDGSYNIRKDDMIALYPQ
LMHLDPEIYPDPLTFKYDRYLDESGKAKTTFYSNGNKLKCFYMPFGSGATICPGRLFA
VQEIKQFLILMLSCFELEFVESQVKCPPLDQSRAGLGILPPLHDIEFKYKLKH
>CYP7B1
XM_342218, U36992
MEGATTPDAASPGPLSLLGLLFAVTLLLPVLFLLTRRTRRPCEP
PLIKGWIPYLGMALKFWKDPLAFLQTLQRQYGDTFTVLLGGKYITFVLNPFQYQYVMK
NPKQLSFEKFSRRLSAKAFSVKKLLTDDDLSNDIHRGYLLLQGKSLDGLLETMIQEVK
EIFESRLLKLTDWNTARVFDFCSSLVFEITFTTIYGKILAANKKQIISELRDDFLKFD
DHFPYLVSDIPIQLLRNAEFMQKKIIKCLTPEKVAQMQRRSEIVQERQEMLKKYYGHE
EFEIGAHHLGLLWASLANTIPAMFWAMYYLLQHPEAMEVLRDEIDSFLQSTGQKKGPG
ISVHFTREQLDSLVCLESAILEVLRLCSYSSIIREVQEDMDFSSESRSYRLRKGDFVA
VFPPMIHNDPEVFDAPKDFRFDRFVEDGKKKTTFFKGGKKLKSYIIPFGLGTSKCPGR
YFAINEMKLLVIILLTYFDLEVIDTKPIGLNHSRMFLGIQHPDSDISFRYKAKSWRS
>CYP8A1
U53855 Rn.73051
MSWAALLGLLAVLLLLLLLLSRRRARRPGEPPLDLGSIPWLGHA
LEFGKDAASFLTRMKEKHGDIFTVLVGGRYVTVLLDPHSYDTVVWDLRTRLDFHPYAI
FLMERIFDLQLPNFNPSEEKARMKPTLMHKDLQALTEAMYTNLRTVLLGDSTEGGSGW
QEKGLLEFSYSSLLSAGYLTLYGVEASPRTHESQALDRDHSADVFRTFRQLDLMLPKL
ARGSLSVGDKDHACSVKSRLWKLLSPAGLASRADRSSWLESYLRHLEEMGVSEDMQAR
ALVLQLWATQGNMGPTAFWLLLFLLKNPEALDAVHAELKRIVWQAEKPVLQMTALPQK
ILDSMPVLDSVLNETLRLTAAPFITREVMADLALPMADRREFSLRRGDRLLLFPFLSP
QKDPEIYTEPEVFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNQCLGKSYAIN
SIKQFVVLLLTHFDLELVSEDTEVPEFDLSRYGFGLMQPEEDVPIRYRTRL
>CYP8B1
NM_031241, AB009686
MLWGSVLGALLMAVGCLCLSLLPRHRRPWEPPLDKGFVPWLGHT
MAFRKNMFEFLKGMRAKHGDVFTLQLGGQYFTFVMDPLSFGPIIKSTQKVLDFVTYAR
ELVFKVFGYQSMDEDHQMLHVASTKHLMGQGLEDLNRAMLDSLSLVMLGPKGRSLGAR
SWCEDGLFHFCYSILFKAGFLSLFGCTKDKEQDLDEADELFRKFRRFDLLFPRFVYSL
LGPLEWVEVSQLQRLFHQRLSVEQNLEKDGISNWLGFMLRFLRERGMASSMQDKFNFM
MLWASQGNTGPTCFWALLFLLKHQDAMKAVREEATRVLGEARLEAETSFAFTLSALKC
TPVLDSVMEETLRLCATPTLLGVVQEDYVLKMASGQEYQIRRGDKVALFPYLSVHMDP
DIHPEPTTFKYNRFLNPDGTRKVDFYKSGKKIHHYNMPWGSGVSICPGRFFAPSEMKT
FVLLMVMYFDFELVDPDMPVPPIDPRRWGFGTSQPSHEVRFRYRLKPMQ
>CYP11A1
J05156
MLAKGLCLRSVLVKSCQPFLSPVWQGPGLATGNGAGISSTNSPR
SFNEIPSPGDNGWINLYHFLRENGTHRIHYHHMQNFQKYGPIYREKLGNMESVYILDP
KDAATLFSCEGPNPERYLVPPWVAYHQYYQRPIGVLFKSSDAWRKDRIVLNQEVMAPD
SIKNFVPLLEGVAQDFIKVLHRRIKQQNSGKFSGDISDDLFRFAFESITSVVFGERLG
MLEEIVDPESQRFIDAVYQMFHTSVPMLNMPPDLFRLFRTKTWKDHAAAWDVIFSKAD
EYTQNFYWDLRQKRDFSKYPGVLYSLLGGNKLPFKNIQANITEMLAGGVDTTSMTLQW
NLYEMAHNLKVQEMLRAEVLAARRQAQGDMAKMVQLVPLLKASIKETLRLHPISVTLQ
RYIVNDLVLRNYKIPAKTLVQVASYAMGRESSFFPNPNKFDPTRWLEKSQNTTHFRYL
GFGWGVRQCLGRRIAELEMTIFLINVLENFRIEVQSIRDVGTKFNLILMPEKPIFFNF
QPLKQDLGSTMPRKGDTV
>CYP11B1
pseudogene? XM_343261
MALRVTADVWLARPWQCLHRTRALGTTAKVAPKTLKPFEAIPQY
SRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQ
VESILPHRMPLEPWVAHRELRGLRRGVFLL
gap
then two of the next exon
GHDLYPESLKFTHALHSMFTSTTQLILLPKSLTRWTSTQVWKGHFESWDIISEY
GHDLYPESLKFTHALHSMFTSTTQLILLPKSLTRWTSTQVWKGHFESWDIISEY
SHKCIKNVYRELAEGRQKSWSVISEMVAQSTLSMDAIHANSMEIIAEVLTR
TAISLVMTLFELARNPDVQQALQQESLAAEASIAANPQKAISDLPLLRAALKETLR
LYPVGSYLERILNSDLVLQNYHVPAGTFVIIYLYSMGRNPAVFPRPERYMPQRWLERKRS
FQHLAFGFGVRQCLGRRLAEVEMLLLLHH
MPKSFQVETQEKEDVQMAYRFILMPSSIPLLTFRPVS
>CYP11B1
X15431
MALRVTADVWLARPWQCLHRTRALGTTAKVAPKTLKPFEAIPQY
SRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQ
VESILPHRMPLEPWVAHRELRGLRRGVFLL
NGADWRFNRLQLNPNMLSPKAIQSFVPF
VDVVARDFVENLKKRMLENVHGSMSINIQSNMFNYTMEASHFVISGERLGLT GHDLKP
ESVTFTHALHSMFKSTTQLMFLPKSLTRWTSTRVWKEHFDSWDIISEY
VTKCIKNVYR
ELAEGRQQSWSVISEMVAQSTLSMDAIHANSMELIA GSVDT TAISLVMTLFELARNPD
VQQALRQESLAAEASIVANPQKAMSDLPLLRAALKETLRLYPVGSFVERIVHSDLVLQ
NYHVPAGTFVIIYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAFGFGVRQCLGRR
LAEVEMLLLLHH
MLKTFQVETLRQEDMQMVFRFLLMPSSSPFLTFRPVS
>CYP11B2
D00567
MGACDNDFIELHSRVTADVWLARPWQCLHRTRALGTTATLAPKT
LKPFEAIPQYSRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVM
LPEDAEKLHQVESILPRRMHLEPWVAHRELRGLRRGVFLLNGAEWRFNRLKLNPNVLS
PKAVQNFVPMVDEVARDFLEALKKKVRQNARGSLTMDVQQSLFNYTIEASNFALFGER
LGLLGHDLNPGSLKFIHALHSMFKSTTQLLFLPRSLTRWTSTQVWKEHFDAWDVISEY
ANRCIWKVHQELRLGSSQTYSGIVAALITQGALPLDAIKANSMELTAGSVDTTAIPLV
MTLFELARNPDVQQALRQETLAAEASIAANPQKAMSDLPLLRAALKETLRLYPVGGFL
ERILNSDLVLQNYHVPAGTLVLLYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAF
GFGVRQCLGRRLAEVEMLLLLHHMLKTFQVETLRQEDVQMAYRFVLMPSSSPVLTFRP
IS
>CYP11B3
U14907
MALRVTADVWARPWQCLHRTRALGSTATQAPKTLKPFEAIPQYS
RNKWLKMIQILREQSQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQV
ESILPRRMTLESWVAHRELRGLRRGVFLLNGADWRFNRLQLNPNMLSPKAVQSFVPFV
DVVARDFVENLKKRMLENVHGSMSMDIQSNVFNYTMEASHFVISGERLGLTGHDLNPE
SLKFIHALHSMFKSTTQLMFLPKNLTRWTSTQVWKGHFESWDIISEYVTKCIKNVYRE
LAEGRQQSWSVISEMVAQSTLSMDAIHANSMELIAGSVDTTAISLVMTLFELARNPDV
QQALRQESLAAEASIAANPQKAMSDLPLLRAALKETLRLYPIGSSLERIVDSDLVLQN
YHVPAGTLVIIYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAFGFGVRQCLGRRL
AEVEVLLLLHHMLKIFQVETLRQEDVQMAYRFVLMPNPRLVLTIRPVS
>CYP17A1
M31681
MWELVGLLLLILAYFFWVKSKTPGAKLPRSLPSLPLVGSLPFLP
RRGHMHVNFFKLQEKYGPIYSLRLGTTTTVIIGHYQLAREVLIKKGKEFSGRPQMVTQ
SLLSDQGKGVAFADAGSSWHLHRKLVFSTFSLFKDGQKLEKLICQEAKSLCDMMLAHD
KESIDLSTPIFMSVTNIICAICFNISYEKNDPKLTAIKTFTEGIVDATGDRNLVDIFP
WLTIFPNKGLEVIKGYAKVRNEVLTGIFEKCREKFDSQSISSLTDILIQAKMNSDNNN
SCEGRDPDVFSDRHILATVGDIFGAGIETTTTVLKWILAFLVHNPEVKKKIQKEIDQY
VGFSRTPTFNDRSHLLMLEATIREVLRIRPVAPMLIPHKANVDSSIGEFTVPKDTHVV
VNLWALHHDENEWDQPDQFMPERFLDPTGSHLITPTQSYLPFGAGPRSCIGEALARQE
LFVFTALLLQRFDLDVSDDKQLPRLEGDPKVVFLIDPFKVKITVRQAWMDAQAEVST
>CYP19A1
M33986
MFLEMLNPMHYNVTIMVPETVPVSAMPLLLIMGLLLLIRNCESS
SSIPGPGYCLGIGPLISHGRFLWMGIGSACNYYNKMYGEFMRVWISGEETLIISKSSS
MVHVMKHSNYISRFGSKRGLQCIGMHENGIIFNNNPSLWRTVRPFFMKALTGPGLIRM
VEVCVESIKQHLDRLGDVTDNSGYVDVVTLMRHIMLDTSNTLFLGIPLDESSIVKKIQ
GYFNAWQALLIKPNIFFKISWLYRKYERSVKDLKDEIEILVEKKRQKVSSAEKLEDCM
DFATDLIFAERRGDLTKENVNQCILEMLIAAPDTMSVTLYVMLLLIAEYPEVETAILK
EIHTVVGDRDIRIGDVQNLKVVENFINESLRYQPVVDLVMRRALEDDVIDGYPVKKGT
NIILNIGRMHRLEYFPKPNEFTLENFEKNVPYRYFQPFGFGPRSCAGKYIAMVMMKVV
LVTLLKRFHVKTLQKRCIENMPKNNDLSLHLDEDSPIVEIIFRHIFNTPFLQCLYISL
>CYP20A1
NM_199401 XM_237189 BC061716.1
MLDFAIFAVTFLLALVGAVLYLYPASRQASGIPGLTPTEEKDGN
LPDIVNSGSLHEFLVNLHGRYGPVVSFWFGRRLVVSLGTADALKQHFNPNKTLDPFET
MLKSLLGYRSGAGSGSEDHVRRRLYGDAVTAALQSNFPLLLKLSEELLDKWLSYPETQ
HIPLSQHMLGFALKFVTRMVLGDTFEGEQEVIRFQKIHGTVWSEIGKGFLDGSLDKNT
TRKNQYQEALMQLEAILKKIIKERKGGDFSQHTFIDSLVQRNLNEQQILEDSVVFSLA
GCIVTARLCTWAIHFLTTAEEVQKKLHKEVDHVLGKGPITSEKIEQLRYCQQVLCETV
RTAKLTPVSAQLQDIEGKVGPFIIPKETLVLYALGVVLQDASTWPSPHKFDPDRFADE
PVMKVFSSLGFSGTWECPELRFAYVVTTVLVSVLLKKLHLLAVDRQVFEMKYELVTSC
REETWITVSERH
>CYP21
U56853
MLLPGLLLLLLLLLLAGTRWLWGQWKLWKLRLPPLAPGFLHFLQ
PNLPVYLFGLAQKLGPIYRIRLGLQDVVVLNSNKTIEEALIQKWVDFAGRPQILDGKM
NFDLSMGDYSLTWKAHKKLSRSALVLGMRDSMEPLVEQLTQEFCERMRAQAGASVAIH
KEFSLLTCSIISCLTFGDKQDSTLLNATHSCVRDLLKAWNHWSVQILDIIPFLRFFPN
PGLWKLKQFQESRDHIVMQELKRHKDSLVAGQWKDMIDYMLQGVEKQRDARDPGQLHE
RHVHMSVVDLFVGGTETTAATLSWAVAFLLHHPEIQKRLQEELDLKLAPSSQLLYKNR
MQLPLLMATIAEVLRLRPVVPMALPHRATKASSISGYDIPKDTIIIPNIQGANLDEMV
WELPSKFWPDRFLESGKSPRIPTFGCGARVCLGEPLARLEFFVVLARLLQTFTLLPPP
DGTLPSLQPLPYTGINLLIPPFQVRLQPRNLAPQDQGQKSSTG
>CYP21-ps
pseudogene fragment AY091789
LFGLAQKLGPIYRIAWG
DAVVLNSNKTIEEALIQKWVDFTG*PQILDGK
>CYP24A1
X59506
MSCPIDKRRTLIAFLRRLRDLGQPPRSVTSKASASRAPKEVPLC
PLMTDGETRNVTSLPGPTNWPLLGSLLEIFWKGGLKKQHDTLAEYHKKYGQIFRMKLG
SFDSVHLGSPSLLEALYRTESAHPQRLEIKPWKAYRDHRNEAYGLMILEGQEWQRVRS
AFQKKLMKPVEIMKLDKKINEVLADFLERMDELCDERGRIPDLYSELNKWSFESICLV
LYEKRFGLLQKETEEEALTFITAIKTMMSTFGKMMVTPVELHKRLNTKVWQAHTLAWD
TIFKSVKPCIDNRLQRYSQQPGADFLCDIYQQDHLSKKELYAAVTELQLAAVETTANS
LMWILYNLSRNPQAQRRLLQEVQSVLPDNQTPRAEDLRNMPYLKACLKESMRLTPSVP
FTTRTLDKPTVLGEYALPKGTVLTLNTQVLGSSEDNFEDSHKFRPERWLQKEKKINPF
AHLPFGIGKRMCIGRRLAELQLHLALCWIIQKYDIVATDNEPVEMLHLGILVPSRELP
IAFRPR
>CYP26A1
AF439720, NM_130408 Chr1 1Mb upstream of CYP2C cluster
242138769
MGLPALLASALCTFVLPLLLFLAALKLWDLYCVSSRDRSCALPLPPG
TMGFPFFGETLQMVLQ (0) 242138581
242138389
RRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GEHRLVSVHWPASVRTILGAGCLSNLHDSSHKQRKK (0) 242138165
242137906
VIMQAFNREALQCYVPVIAEEVSGCLEQWLSCGERGLLVYPEV
KRLMFRIAMRILLGCEPGPAGGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR
(0) 242137616
242137537
GVKPRNLIHARIEENIRAKIRRLQAAERNAGCKDALQLLIEHSWERGERLDMQ
(0) 242137379
242136717
ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREEIKSK (0) 242136583
242136000
GLLCKSHHEDKLDMETLEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELN (0) 242135848
242135595
GYQIPKGWNVIYSICDTHDVADSFTNKEEFNPDRFTSLHPEDTSRFSFIPFGGGLRSCRSKEFAKI
LLKIFTVELARRCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFQGDI* 242135254
>CYP26B1
AY245532, NM_181087
MLFEGLELVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKS
CKLPIPKGSMGFPLIGETGHWLLQGSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENV
RKILLGEHQLVSTEWPRSARVLLGPNTVANSIGDIHRNKRKVFSKIFSHEALESYLPK
IQLVIQDTLRAWSSQPEAINVYQEAQRLTFRMAVRVLLGFSIPEEDLGNLFEVYQQFV
ENVFSLPVDLPFSGYRRGIQARQILQKGLEKAVREKLQCTQGKDYSDALDILIESSKE
HGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPAVLEKLREELRAQGLLHG
GGCPCEGTLRLDMLSGLRYLDCVIKEVMRLFTPVSGGYRTVLQTFELDGFQIPKGWSV
MYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLF
LKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPETEAML
SATV
>CYP26C1
XM_217935 94% TO 26C1 MOUSE
718
MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKGSMGWPFFG 897
898
ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0)
VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVA 1257
1258
VYQAAKALTFRMAARILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK 1422
1705
GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0)
1888
ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP 2064
2065
DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYS 2244
2245
IRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYIPFGGGARSCLGQELAQAVLQ 2424
2425
LLAVELVRTARWELATPAFPVMQTVPIVHPVDGLLLLFHPLPTLGAGDGSPF* 2583
>CYP26C1
XM_217935 94% TO 26C1 MOUSE Chr1 1Mb upstream of CYP2C cluster
242151281
MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKG
SMGWPFFGETLHWLVQ (0) 242151079
242150553
GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0) 242150422
242149883
VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVAVYQAAKALTFRMAAR
ILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK (0) 242149608
242148160
GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0) 242148005
242146368
ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP
DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0) 242146051
242144220
GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYI
PFGGGARSCLGQELAQAVLQLLAVELVRTARWELATPAFPVMQTVPIVHPVD
GLLLLFHPLPTLGAGDGSPF* 242143843
>CYP27A1
M38566
MAVLSRMRLRWALLDTRVMGHGLCPQGARAKAAIPAALRDHEST
EGPGTGQDRPRLRSLAELPGPGTLRFLFQLFLRGYVLHLHELQALNKAKYGPMWTTTF
GTRTNVNLASAPLLEQVMRQEGKYPIRDSMEQWKEHRDHKGLSYGIFITQGQQWYHLR
HSLNQRMLKPAEAALYTDALNEVISDFIARLDQVRTESASGDQVPDVAHLLYHLALEA
ICYILFEKRVGCLEPSIPEDTATFIRSVGLMFKNSVYVTFLPKWSRPLLPFWKRYMNN
WDNIFSFGEKMIHQKVQEIEAQLQAAGPDGVQVSGYLHFLLTKELLSPQETVGTFPEL
ILAGVDTTSNTLTWALYHLSKNPEIQEALHKEVTGVVPFGKVPQNKDFAHMPLLKAVI
KETLRLYPVVPTNSRIITEKETEINGFLFPKNTQFVLCTYVVSRDPSVFPEPESFQPH
RWLRKREDDNSGIQHPFGSVPFGYGVRSCLGRRIAELEMQLLLSRLIQKYEVVLSPGM
GEVKSVSRIVLVPSKKVSLRFLQRQ
>CYP27B1
AB001992
MTQAVKLASRVFHRVQLPSQLGSDSVLRSLSDIPGPSTPSFLAE
LFCKGGLSRLHELQVHGAARYGPIWSGSFGTLRTVYVADPALVEQLLRQESHCPERCS
FSSWSEHRRRHQRACGLLTADGEEWQRLRSLLAPLLLRPQAAAGYAGTLDSVVSDLVR
RLRRQRGRGSGLPDLVLDVAGEFYKFGLEGIGAVLLGSRLGCLEAEVPPDTETFIEAV
GSVFVSTLLTMAMPSWLHRLIPGPWARLCRDWDQMFAFAQKHVEQREGEAAVRNQGKP
EEDLPTGHHLTHFLFREKVSVQSIVGNVTELLLAGVDTVSNTLSWALYELSRHPEVQS
ALHSEITGAVNPGSYAHLQATALSQLPLLKAVIKEVLRLYPVVPGNSRVPDRDICVGN
YVIPQDTLVSLCHYATSRDPAQFREPNSFNPARWLGEGPAPHPFASLPFGFGKRSCIG
RRLAELELQMALAQILTHFEVLPEPGALPVKPMTRTVLVPERSIHLQFVDR
>CYP39A1
XM_236983 (INCORRECT END)
AC107523.4
MGIMELFSPIAIAVLGSCVLFLFSRWKNLRGPPCIQGWIPWIGA
GFEFGKAPLEFIEKARIKYGPVFTVFAVGKRMTFVTEEEGINVLLKSKHVDFELAVQR
PLYHTAWIPKNIFFALHEKLYVLMKGKMGTFNTHHFTGQLTEEFHDQLEGLGTHGTMD
LNDFVRYLLYPATLNTLFMKGLFLTDKRKIKEFYQHFKTYDEGFEYGSQLPEWLLRNW
SKSKRWLLALFEKNIGDIKTHGSAGHSETLLQAVLGMVETETRLHSPNYGLVMLWASL
ANAAPIAFWTLAYILSHPDLHRTIVESISSVFGTAGKDKIQVSENDLKKLLLIKWCIL
ESIRLRAPGVITRKVVKPVKILNHTVPSGDLLMLSPFWLHRNPKYFPEPESFKPERWK
EANLDKYIFLDYFMAFGGGKFQCPGR
147499
WFALLEIQLCIILVLYKYECSLLDPLPKQA (1) 147410
146814
SLHLVGVPQPAGKCRIEYKQRV* 146746
>CYP46A1
XM_343108
MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPSFLLG
HLPYFWKKDEACGRVLQDVFLDWAKKYGPVVRVNVFHKTSVIVTSPESVKKFLMSTKY
NKDSKMYRAIQTVFGERLFGQGLVSECDYGRWYKQRRVMDLAFSRSSLVSLMGTFNEK
AEQLMEILEAKADGQTPVSMQDMLTCATIDILAKAAFGMETSMLLGAQKPLSQAVKVM
LEGISASRNTLAKFMPGKRKQLREIRESIRLLRQVGKDWVQRRREALKRGEDVPADIL
TQILKAEEGAQDDEVLLDNFVTFFIAGHETSANHLAFTVMELSRQPEIVARLQAEVDE
VVGSKRHLDYEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL
FSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQMEVKVV
MAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC
>CYP51A1
U17697
MEQVTGGNLLSTLLIACAFTLSLVYLFRLAVGHMVQLPAGAKSP
PYIYSPIPFLGHAIAFGKSPIEFLENAYEKYGPVFSFTMVGKTFTYLLGSDAAALLFN
SKNEDLNAEEVYGRLTTPVFGKGVAYDVPNAVFLEQKKILKSGLNIAHFKQYVSIIEK
EAKKYFKSWGESGERNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFS
HAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRLSKEPAEDILQTLLDSTYKDG
RPLTDDEIAGMLIGLLLAGQHTSSTTSAWMGFFLARDKPLQDKCYLEQKTVCGEDLPP
LTYEQLKDLNLLDRCIKETLRLRPPI MTMMRMAKTPQTV AGYTIPPGHQVCVSPTVNQ
RLKDSWVERLDFNPDRYLQDNPASGEKFAYVPFGAGRHRCIGENFAYVQIKTIWSTML
RLYEFDLINGYFPSVNYTTMIHTPENPVIRYKRR SK
>CYP51P1 pseudogene D87997 92% to CYP51A1 rat
398
MEQVTGGNLLSTLLIACAFTLSLVYLFRLAVGHMVQLPAGAESPPCIYSPIPFLGHAI
FGKSPIEFLENAYEKSGPVFSFTMVSKTFTYLLGSDAAALL 696
697 FNSKNEDLNAEEVYGRLTTPVFGKXXXXXXX 768
788
NAVFLEHKKILKSGLNIAHFKQYVSITEKEAKEYFKSWGESGERNVFEALSELIILTASH 967
968
CLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAI 1147
1148
QKRRLSKEPAEDILQTLLDSTYKDGRPLTDDVIAGMLIGLLLAGXXXXXX
1281
TSAWMGFFLVRDKPLQGKCYLEQKAVCGEDLPPLTYE*LKDLNLLDRCIKE 1433
TLRLRPPIMTMMRMAKTPQ
1489
1490
NVAGCTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQGNPASGEKFAYVPFGAGRHH 1669
1670
CIGENFAYVQIKTIWSTMLHLYEFDLINGYFLSVNYTTMIHTPENPVIRYKMK 1828
>CYP51P1 pseudogene XM_234202
MEQVTGGNLLSTLLIACAFTLSLVYLF
RLAVGHMVQLPAGAESPPCIYSPIPFLGHAIAFGKSPIEFLENAYEKSGPVFSFTMVG
KTFTYLLGSDAAALLFNSKNEDLNAEEVYGRLTTPVFGKGVAYDVPNAVFLEHKKILK
SGLNIAHFKQYVSITEKEAKEYFKSWGESGERNVFEALSELIILTASHCLHGKEIRSQ
LNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRLSKE
PAEDILQTLLDSTYKDGRPLTDDVIAGMLIGLLLAGHSAWMGFFLVRDKPLQGKCYLE
QKAVCGEDLPPLTYE*LKDLNLLDRCIKETLRLRPPIMTMMRMAKTPQNVAGCTIPPG
HQVCVSPTVNQRLKDSWVERLDFNPDRYLQGNPASGEKFAYVPFGAGRHHCIGENFAY
VQIKTIWSTMLHLYEFDLINGYFLSVNYTTMIHTPENPVIRYKMK
>CYP51P2 pseudogene D78370
MEQVTGGNLLSTLLIACAFTLSLV
(fs)
NLFRLAVGHMVQLHAGAESPPCIY
SPIPFLGHRIAFGKSPIEFLENAYEKSGPVFSFTMVDKTFTYLLGSDAAALLFNSKNEDL
NAEEVYGRLTTPVFGKGVAYDVPNADFLEHKKLLKSGLNIAHFKQYVSITEKEAKEYFKS
WGESGERNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWL
PLPSFRRRDRAHREIKNIFYKAIQKRRLSKEPAEDILQTLLDSTYKDGRPLTDDVIAGML
IGLLLAGHSAWMGFFLVRDKPLQGKCYLEQKAVCGEALHPLTYE*LKDLNLLDRCIKETL
RLESPPI
(fs)
VTM
(fs)
VRIGQAPSRMCAGCTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQ
GNPASGEKFAYVPFGAGRHHCIGENFAYVQIKTIWSTMLHLYEFDLINGYFLSVNYTTMI
HTPENPVIRYKMK