Rat cytochrome P450s
108 Rat P450 sequences. This is a beginning of a revision of the rat P450s.
I am currently looking for more members in the 7 gene clusters seen in mouse.
The April 1, 2004 Nature issue on the rat genome had a figure showing 84 P450s
in the rat on a tree diagram. I am looking for these and more. There are some
major nomenclature problems due to naming the rat genes for the closest match in
the database, usually a mouse gene. This will not work if there is not an
orthologous relationship. Many of the names in Genbank will need to be changed.
The 4F gene cluster appears to be conserved with all 9 functional genes occurring
in the same order and orientation as in the mouse 4f cluster. 4F5 is the ortholog of
4f16, 4F4 is the ortholog of 4f15, 4F1 is the ortholog of 4f14 and 4F6 is the ortholog
of 4f13. The other new rat genes (4F39, 4F17, 4F37, 4F40 and 4F18) will be named
for their ortholog in the mouse. The pseudogenes are not conserved.
Gene order and orientation(+/-) is: 4F39+, 4F17+, 4F5/4f16+, 4F37+, 4F40+,
4F4/4f15+, 4F1/4f14-, 4F6/4f13-, 4F18+
The CYP2ABFGST cluster has 14 full length genes, one complete pseudogene with
a few splice site errors (2B16P) and 9 small pseudogene fragments.
The gene order is:
2S1-, 2B1+, 2B2+, 2B3+, 2B16P+, 2B14P+, 2B21-, 2B12+, 2BNEW+, 2B15+, 2G1+,
2A3+, 2ANEW+, 2A2+, 2F4+, 2T1+
Only 2S1 and 2B21 are oriented opposite to the cluster major orientation (+).
2b23 in mouse is also (-) and these two appear to be in orthologous locations, so
the orientation may be preserved. In the mouse 2a22 is oriented opposite to the
other genes, but it was on a small contig that might be incorrectly oriented.
The rat has three genes between 2B21 and 2G1. The mouse has 2b19 in this
location, so the rat may have expanded the 2b19 gene to three genes. If we assume
this is correct, there is a reasonable orthologous relationship of genes in the rat and mouse clusters.
2S1/2s1, 2B1/2b10, 2B2/2b13, 2B3/2b9, 2B21/2b23, 2B12/2b19, 2BNEW/2b19,
2B15/2b19, 2G1/2g1, 2A3/2a5, 2ANEW/2a22, 2A2/2a12, 2F4/2f2, 2T1/2t4.
Last modified March 30, 2007
D. Nelson
>Cyp1a1 M26129,X00469
MPSVYGFPAFTSATELLLAVTTFCLGFWVVRVTRTWVPKGLKSP
PGPWGLPFMGHVLTLGKNPHLSLTKLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVK
QGDDFKGRPDLYSFTLIANGQSMTFNPDSGPLWAARRRLAQNALKSFSIASDPTLASS
CYLEEHVSKEAEYLISKFQKLMAEVGHFDPFKYLVVSVANVICAICFGRRYDHDDQEL
LSIVNLSNEFGEVTGSGYPADFIPILRYLPNSSLDAFKDLNKKFYSFMKKLIKEHYRT
FEKGHIRDITDSLIEHCQDRRLDENANVQLSDDKVITIVFDLFGAGFDTITTAISWSL
MYLVTNPRIQRKIQEELDTVIGRDRQPRLSDRPQLPYLEAFILETFRHSSFVPFTIPH
STIRDTSLNGFYIPKGHCVFVNQWQVNHDQELWGDPNEFRPERFLTSSGTLDKHLSEK
VILFGLGKRKCIGETIGRLEVFLFLAILLQQMEFNVSPGEKVDMTPAYGLTLKHARCE
HFQVQMRSSGPQHLQA
>Cyp1a2 K02422
MAFSQYISLAPELLLATAIFCLVFWVLRGTRTQVPKGLKSPPGP
WGLPFIGHMLTLGKNPHLSLTKLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVKQGD
DFKGRPDLYSFTLITNGKSMTFNPDSGPVWAARRRLAQDALKSFSIASDPTSVSSCYL
EEHVSKEANHLISKFQKLMAEVGHFEPVNQVVESVANVIGAMCFGKNFPRKSEEMLNL
VKSSKDFVENVTSGNAVDFFPVLRYLPNPALKRFKNFNDNFVLSLQKTVQEHYQDFNK
NSIQDITGALFKHSENYKDNGGLIPQEKIVNIVNDIFGAGFETVTTAIFWSILLLVTE
PKVQRKIHEELDTVIGRDRQPRLSDRPQLPYLEAFILEIYRYTSFVPFTIPHSTTRDT
SLNGFHIPKECCIFINQWQVNHDEKQWKDPFVFRPERFLTNDNTAIDKTLSEKVMLFG
LGKRRCIGEIPAKWEVFLFLAILLHQLEFTVPPGVKVDLTPSYGLTMKPRTCEHVQAW
PRFSK
>Cyp1b1 U09540
MATSLSADSPQQLSSLSTQQTILLLLVSVLAIVHLGQWLLRQWR
RKPWSSPPGPFPWPLIGNAASVGRASHLYFARLARRYGDVFQIRLGSCPVVVLNGESA
IHQALVQQGGVFADRPPFASFRVVSGGRSLAFGHYSERWKERRRAAYGTMRAFSTRHP
RSRGLLEGHALGEARELVAVLVRRCAGGACLDPTQPIIVAVANVMSAVCFGCRYNHDD
AEFLELLSHNEEFGRTVGAGSLVDVMPWLQLFPNPVRTIFREFEQINRNFSNFVLDKF
LRHRESLVPGAAPRDMMDAFILSAEKKATGDPGDSPSGLDLEDVPATITDIFGASQDT
LSTALLWLLILFTRYPDVQARVQAELDQVVGRDRLPCMSDQPNLPYVMAFLYESMRFT
SFLPVTLPHATTANTFVLGYYIPKNTVVFVNQWSVNHDPAKWSNPEDFDPARFLDKDG
FINKALASSVMIFSVGKRRCIGEELSKTLLFLFISILAHQCNFKANQNEPSNMSFSYG
LSIKPKSFKIHVSLRESMKLLDSAVEKLQAEEACQ
>Cyp2a1-de2b frag e in 2abfgst cluster map exon 2 pseudogene, Chr1 (-) only 240bp from Cyp2a1 start Met
82084718 YNAVKEALVDQAEGFSGQGEQA 82084653
>Cyp2a1 NP_036824, 88% T0 2A2 chr1 (+) Cyp2a22 ortholog
82084958 MLDTGLLLVVILASLSVMLLVSLWQQKIRGRLPPGPTPLPFIGNYLQLNTKDVYSSITQ 82085134
82085434 LSERYGPVFTIHLGPRRVVVLYGYDAVKEALVDQAEEFSGRGEQATYNTLFKGY 82085595
82088031 GVAFSSGERAKQLRRLSIATLRDFGVGKRGVEERILEEAGYLIKMLQGTC 82088180
82088398 GAPIDPTIYLSKTVSNVISSIVFGERFDYEDTEFLSLLQMMGQMNRFAASPTG 82088556
82089778 QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82089957
82093158 EKNGNSEFHMKNLVMTTLSLFFAGSETVSSTLRYGFLLLMKHPDVE 82093295
82093737 AKVHEEIEQVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82093925
82094440 ATDVFPILGSLMTDPKFFPSPKDFDPQNFLDDKGQLKKNAAFLPFST 82094580
82098022 GKRFCLGDGLAKMELFLLLTTILQNFRFKFPMKLEDINESPKPLGFTRIIPKYTMSFMPI 82098201
>Cyp2a2-de2b exon 2 pseudogene Chr1 (-) frag f in 2abfgst cluster map
82115528 LKPHWVVVLYEWDAVKEALGDQAEELSG*GEQANL 82115445
>Cyp2a2 M34392, J04187 Cyp2a12 ortholog
82117349 MLDTGLLLVVILASLSVMFLVSLWQQKIRERLPPGPTPLPFIGNYLQLNMKDVYSSITQ 82117525
82117991 LSERYGPVFTIHLGPRRIVVLYGYDAVKEALVDQAEEFSGRGELPTFNILFKGY 82118152
82123228 GFSLSNVEQAKRIRRFTIATLRDFGVGKRDVQECILEEAGYLIKTLQGTC 82123377
82123595 GAPIDPSIYLSKTVSNVINSIVFGNRFDYEDKEFLSLLEMIDEMNIFAASATG 82123753
82124978 QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82125157
82139054 EKYVNSEFHMNNLVMSSLGLLFAGTGSVSSTLYHGFLLLMKHPDVE 82139191
82139607 AKVHEEIERVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82139795
82140311 GTDVFPIIGSLMTEPKFFPNHKDFNPQHFLDDKGQLKKNAAFLPFSI 82140451
82141451 GKRFCLGDSLAKMELFLLLTTILQNFRFKFPMNLEDINEYPSPIGFTRIIPNYTMSFMPI 82141630
>Cyp2a3 M33190 J02852, NM_012542
exon 4 in a seq gap in genome seq chr1 (+) Cyp2a5 ortholog
82023007 MLASGLLLVASVAFLSVLVLMSVWKQRKLSGKLPPGPTPLPFIGNYLQLNTEKMYSSLMK 82023186
82023453 ISQRYGPVFTIHLGPRRVVVLCGQEAVKEALVDQAEEFSGRGEQATFDWLFKGY 82023614
82024296 GVAFSSGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIESFRKTN 82024445
GALIDPTFYLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTG
82026488 QLYEMFSSVMKHLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMLE 82026667
82028068 EKKNPNTEFYMKNLVLTTLNLFFAGTETVSTTLRYGFLLLMKHPDIE 82028208
82028659 AKVHEEIDRVIGRNRQAKYEDRMKMPYTEAVIHEIQRFADMIPMGLARRVTKDTKFREFLLPK 82028847
82029417 GTEVFPMLGSVLKDPKFFSNPNDFNPKHFLDDKGQFKKSDAFVPFSI 82029557
82030741 GKRYCFGEGLARMELFLFLTNIMQNFCFKSPQAPQDIDVSPRLVGFATIPPNYTMSFLSR 82030920
>Cyp2a3-de1b exon 1 pseudogene Chr1 (+)frag d in 2abfgst cluster map
82052140 MLGSRLLLVAVLSCLCVMVFMPVWQQQYRDTIPPG 82052244
>Cyp2b3-se1[9] exon 9 100% match to 2B3, chr1 (+), frag a in 2abfgst cluster map
81263180 GKRMCLGEGIARSELFLFFTTILQNYSVSSPVDPNTIDMTPKESGLAKVAPVYKICFVAR* 81263362
>Cyp2b3-se2[1] duplicate exon 1 100% match, Chr1 (-), frag b in 2abfgst cluster map
81308557 MDTSVLLLLAVLLSFLLFLVRGHAKVHGHLPPGPRPLPLLGNLLQMDRGGFRKSFIQ 81308387
>Cyp2b1 J00719, D00250 chr1 (+) 1 aa diff to CYP2B1
81344956 MEPSILLLLALLVGFLLLLVRGHPKSRGNFPPGPRPLPLLGNLLQLDRGGLLNSFMQ 81345126
81357886 LREKYGDVFTVHLGPRPVVMLCGTDTIKEALVGQAEDFSGRGTIAVIEPIFKEY 81358047
81358200 GVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKSQ 81358349
81360925 GAPLDPTFLFQCITANIICSIVFGERFDYTDRQFLRLLELFYRTFSLLSSFSS 81361083
81361768 QVFEFFSGFLKYFPGAHRQISKNLQEILDYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81361947
81362389 EKSNHHTVFHHENLMISLLSLFFAGTETSSTTLRYGFLLMLKYPHVA (1) 81362529
81363958 EKVQKEIDQVIGSHRLPTLDDRSKMPYTDAVIHEIQRFSDLVPIGVPHRVTKDTMFRGYLLPK 81364146
81364315 NTEVYPILSSALHDPQYFDHPDSFNPEHFLDANGALKKSEAFMPFST 81364455
81368014 GKRICLGEGIARNELFLFFTTILQNFSVSSHLAPKDIDLTPKESGIGKIPPTYQICFSAR 81368193
>Cyp2b2 J00720-J00728 Rn.91353 chr1 (+) genome has 4aa diffs with CYP2B2 mRNA, 14aa diffs to CYP2B1
81423536 MEPSILLLLALLVGFLLLLVRGHPKSRGNFPPGPRPLPLLGNLLQLDRGGLLNSFMQ (0) 81423706
81426789 FREKYGDVFTVHLGPRPVVMLCGTDTIKEALVGQAEDFSGRGTIAVIEPIFKEY (1) 81426950
81427104 GVFFANGERWKALRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKSQ (1) 81427253
81429793 GAPLDPTFLFQCITANIICSIVFGERFDYTDRQFLRLLELFYRTFSLLSSFSSQ 81429954
81430659 VFEFFSGFLKYFPGAHRQISKNLQEILDYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81430835
81431274 EKSNHHTEFHHENLMISLLSLFFAGTETGSTTLRYGFLLMLKYPHVT (1) 81431414
81432829 EKVQKEIDQVIGSHRPPSLDDRTKMPYTDAVIHEIQRFADLAPIGLPHRVTKDTMFRGYLLPK 81433017
81433190 NTEVYPILSSALHDPQYFDHPDTFNPEHFLDADGTLKKSEAFMPFST (1) 81433330
81436959 GKRICLGEGIARNELFLFFTTILQNFSVSSHLAPKDIDLTPKESGIAKIPPTYQICFSAR 81437138
>Cyp2b3 M20406 chr1 (+) exon 9 not adjacent to this gene. Found at 81263180-81263359
81486567 MDTSVLLLLAVLLSFLLFLVRGHAKVHGHLPPGPRPLPLLGNLLQMDRGGFRKSFIQ (0) 81486737
81514647 LQEKHGDVFTVYFGPRPVVMLCGTQTIREALVDHAEAFSGRGIIAVLQPIMQEY (1) 81514808
81514950 GVSFVNEERWKILRRLFVATMRDFGIGKQSVEDQIKEEAKCLVEELKNHQ (1) 81515099
81516395 GVSLDPTFLFQCVTGNIICSIVFGERFDYRDRQFLRLLDLLYRTFSLISSFSSQ (0) 81516556
81530756 MFEVYSDFLKYFPGVHREIYKNLKEVLDYIDHSVENHRATLDPNAPRDFIDTFLLHMEK (0) 81530932
81531383 EKLNHYTEFHHWNLMISVLFLFLAGTESTSNTLCYGFLLMLKYPHVA (1) 81531523
81536877 EKVQKEIDQVIGSQRVPTLDDRSKMPYTEAVIHEIQRFSDVSPMGLPCRITKDTLFRGYLLPK (0) 81537065
81537233 NTEVYFILSSALHDPQYFEQPDTFNPEHFLDANGALKKCEAFMPFSI (1) 81537373
GKRMCLGEGIARSELFLFFTTILQNYSVSSPVDPNTIDMTPKESGLAKVAPVYKICFVAR
>Cyp2b32-ps pseudogene partial Chr1 (+)
81806528 VLLLLTLIVGFLLFLVSQSQPKTHGHLPPGLCPLPFLGNLLQIKRRGLLNSFMQ 81806689
81808348 AQEKYGDVLTVHPGPRPVVRLCGTDTIREFLFDQAGTFSGQGTVAVLNPVVHGY 81808509
exon 3 missing
81809871 GVPLIPTSFFQRIAANIICSIVFGECFDYKDHQFLHLLDLIYQTFALMAPCPARS 81810035
81810759 VFQLFSGFLKYFPGVHKQISKNLQEILNYIGHSVEKHMATLDPSAPRDFINTYLLHMEN 81810935
81811666 EKSNHHTEFHHQTSVLSHFFDGTETTSTTLCCSFLIMLKYHHVK 81811797
>Cyp2b12-de9b exon 9 Chr1 (-)
81829155 GKFICLGEGIG*NESFIFFTGILQNLSLASPVAPENIDLTPIKSGAGKIPSTYQIHILSR 81829012
>Cyp2b12 X63545, S48369, NM_017156 Rn.108913 chr1 (+) 87% to 2b19 possible ortholog
81858238 MEFGVLLLLTLTVGFLLFLVSQSQPKTHGHLPPGPRPLPFLGNLLQMNRRGFLNSFMQ 81858411
81860089 LQEKYGDVFTVHLGPRPVVILCGTDTIREALVDQAEAFSGRGTVAVLHPVVQGY 81860250
81860393 GVIFATGERWKTLRRFSLVTMKEFGMGKRSVDERIKEEAQCLVEELKKYK 81860542
81860739 GAPLNPTFLFQSIAANTICSIVFGERFDYKDHQFLHLLDLVYKTSVLMGSLSSQ 81860900
81861646 VFELYSGFLKYFPGAHKQIFKNLQEMLNYIGHIVEKHRATLDPSAPRDFIDTYLLRMEK 81861822
81862554 EKSNHHTEFNHQNLVISVLSLFFAGTETTSTTLRCTFLIMLKYPHVA 81862694
81864745 EKVQKEIDQVIGSHRLPTPDDRTKMPYTDAVIHEIQRFADLTPIGLPHRVTKDTVFRGYLLPK 81864933
81865086 NTEVYPILSSALHDPRYFEQPDTFNPEHFLDANGALKKSEAFLPFST 81865226
81868929 GKRICLGEGIARNELFIFFTAILQNFTLASPVAPEDIDLTPINIGVGKIPSPYQINFLSR 81869108
>Cyp2b14-ps U33540 exon 1 add Chr1 (+) exons 7,8,9 72% to 2B21 to this pesudogene
81706300 MKPNVLLLLAILLSFLLFLVRGHAKVHGHLPPGPRPLPILGNLLQMDRGGLLQSF 81706464
81728276 EKVQKEIGEVTGSHWFPILYSSKIPNTEAVIPEIQR 81728383
81728385 FSDLSSVVLPQRVTKDTFFQGFLLHK 81728462
81728634 NTEVYPILSSVLHDPQ 81728681
81728681 VLEYPVTFNPEHFLDANGALKKNEAFTPFSR 81728773
>Cyp2b21 AF159245 Chr1 (-)
81765226 MDPSVLLLFALFTGFLLLLIRGQGNGYGHLPPGPCPLPLLGNVLQMDRRGLLKSFIQ 81765056
81759108 LRDKYGDVVTVHLGPRPIVMLYGTETIREALVDHAEAFSGRGTVAVVQPIIQDY 81758947
81758804 GMIFANGERWKILRRFSLATMRDFGMGKRSVEERIKEEAQCLVEELKKYK 81758655
81757889 GAPLDPTFHLQCITANIICSIVFGERFDYTDHQFLHLLDLFYEILSLVSSFSSQ 81757728
81749057 VFELFPGFLKYFPGTHRHISKNIEEILNFIGHCVEKHRATLDPSTPRDFIDTYLLRMEK 81748881
81748412 EKLNHHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVA 81748272
81747109 EKVQKEIDQVIGSHRVPTLDDRIKMPYTDAVIHEIQRFSDLVPIGLPHRVTKDTLFRGYLLPK 81746921
81746748 NIEVYPILSSALHDPQYFEHPDTFNPEHFLDANGALKKNEAFLPFST 81746608
81736831 GKRVCLGEGIARNELFLFFTTILQNFSVSSPVSPKDIDLTPKESGFAKIPPTYQICFLSRQLG 81736643
>Cyp2b31 86% to 2b19 possible ortholog
81918041 MELGVFLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQ 81918214
81919826 LQEKYGDVFTVHLGPRPVVILCGTDTMREALVDQAEAFSGRGTVAVLHPVVQGY 81919987
81920130 GVIFANGERWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYK 81920279
81922129 GALLNPTSIFQSIAANIICSIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQ 81922290
81923031 VFELFSGFLKYFPGVHKQISKNLQEILNYIDHSVEKHRATLDPNTPRDFIDTYLLHMEK 81923207
81923977 EKSNHHTEFHHQNLVISVLSLFFAGTETTSTTLRYSFLIMLKYPHVA 81924117
81926113 EKVQKEIDQVISSHRLPTLDDRIKMPYTDAVIHEIQRFADLAPIGLPHRVTKDTMFRGYLLPK 81926301
81926476 NTEVYPILSSALHDPRYFDHPDTFNPEHFLDANGTLKKSEAFLPFST 81926616
81930286 GKRTCLGEGIARNELFIFFTALLQNFSLASPVAPEDIDLTPINSGAGKIPSPYQINFLSR 81930465
>Cyp2b15 D17343 to D17349 86% to 2b19 exons 2-4 in a seq gap in the genome
seq Chr1 (+)
81945068 MELGVLLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQ 81945241
LQEKYGDVFTVHLGPRPVVILCGTDTIREALVDQAEAFSGRGTVAVLHPVVQGY
GVIFANGERWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYK
GALLNPTSIFQSIAANIICSIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQ
81950148 VFELFSGFLKYFPGVHKQISKNLQEILNYIDHSVEKHRATLDPNTPRDFINTYLLRMEK 81950324
81951073 EKSNHHTEFHHQNLVISVLSLFFTGTETTSTTLRYSFLIMLKYPHVA 81951213
81953132 EKVQKEIDQVIGSHRLPTLDDRTKMPYTDAVIHEIQRFADLIPIGLPHRVTNDTMFLGYLLPK 81953320
81953491 NTEVYPILSSALHDPRYFDHPDTFNPEHFLDVNGTLKKSEAFLPFST 81953631
81957185 GKRICLGEGIAQNELFIFFTAILQNFSLASPVAPEDIDLSPINSGISKIPSPYQIHFLSRCVG 81957373
>Cyp2b16-ps U33541 to U33546 bad boundary introns 1,5,7 chr1 (+)
81633949 MEPSVLLLLAVLLSFLLLLVRGHAKIHGRLPPGPCPVPLLGNLLQMDRRGLLKSFIQ (?) 81634119
81641847 LQEKYGDVFTVHLGLRPVVVLCGTQTIREALVDHAEAFSGRGTIAGLEPVFQDY (1) 81642008
81642149 GIFFSSGEQWKTLRRFSMATMRDFGMRKKSVEERIKEESQCLVEELKKYQ (1) 81642298
81642886 GAPLDPTFLFQCITSNIICSIVFGECFDYTDHQFLHLLDLMYQTFSLLSSIFSQ (0) 81643047
81645234 VFELFPGVLKYFPGAHRQISRNLHEILDFIGQSVEKHRATLDPNAPRDFIYTYLLHMEK (?) 81645410
81645864 QKSNHYTEFHHWNLLSSVLSLFFAGTETSSTTLRYGFLIMLKYPHIT (1) 81646004
81654168 EKVQKEIDCVIGSHRLPTLDDRSKMPYTEAVIHEIQRFSDLAPIGTPHRVIKDTIFRGYLLPK (?) 81654356
81654524 QNTEVFPILSSVLHDPQYFEQPDIFNLQHFLDANGALKIIEAFLPFST (1) 81654667
81659608 GKRICLGESIARNELFLFFTTILQNFSVSSPVAPKDIDLTPKESGIGRIPQVYQICFLAH* 81659781
>Cyp2c6v1-de1b2b3b4b5b
upstream pseudogene 96% identical to seq c
93% identical to seq upstream of CYP2C6v2 allele (temp name = CYP2Cnewb)
243935799 MDLVMLLVLTLSCLIFLSIWRQSSGRGKLP 243935888
243935888 SGPTPLPIIGNFFHLDLKNITQSLTN 243935965
243937699 FSKVNGSVFTLYFGMKPIVILHGYEAIKEGLIDHGEEFTERGSFPVAEKINKGL 243937860
243938035 GIAFSHGNRWKEIRRFTLMTLQNLGMGKKSIEDRVQEESRCLV 243938163
243939079 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLVEKLNENIKIVSSPWI* 243939231
243940291 FCSSFPVFIDYCPGSHMTLAKNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLINWKQ 243940467
>CYP2C6v1-de1b2b3b4b5b NW_047565.2 based on RGSC v3.4
12022325 MDLVMLLVLTLSCLIFLSIWRQSSGRGKLP 12022414
12026817 FCSSFPVFIDYCPGSHMTLAKNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLINWKQ 12026993
>Cyp2c6v1 seq NW_047565.2 based on RGSC v3.4 = M13711
note 2c77-ps seems to have a duplicate in 2c6, only 13 aa diffs
12042110 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS 12042277
12051305 FSKVYGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKD 12051463
12051638 LGIVFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTN 12051790
12052630 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 12052791
12053862 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 12054038
12071172 ENHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKCPEVT 12071312
12075683 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 12075868
12077474 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 12077614
12078771 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL 12078950
12080882 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL 12081061
>CYP2C6_v1 M13711 two aa changes to match many ESTs (lower case mi) due to frameshift
97% to 2C77 and 2C6v2
243955584 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS 243955751
243964779 FSKVYGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKD 243964937
243965112 LGIVFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTN 243965264
243966104 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 243966265
243967336 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 243967512
243984646 ENHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKCPEVT 243984786
243989157 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAmiHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 243989345
243990948 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 243991088
243992245 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL 243992424
>CYP2C6P M18336 J03509 M18774 an alternate splice version of 2C6
exon 8 is skipped and replaced by a cryptic exon just past the true exon 8
The GT boundary of the true exon 8 are the first two nucleotides of CYP2C6_v3
Cryptic exon 8
MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTSFSKV 200
201 YGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDLGIVFSHGNRW 380
381 KEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTNGSPCDPTFILGCAPCNVICS 560
561 IIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQFCSFFPVLIDYCPGSHTTLAKNVYHI 740
741 RNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQENHNPHSEFTLENLSITVTDLFGAGTE 920
921 TTSTTLRYALLLLLKCPEVTAKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFID 1100
1101 LIPTNLPHAVTCDIKFRNYLIPK 1169
>CYP2C6_v2 CK224594.1 CK224593.1 note: the _v2 means alternative splice version 2
CYP2C6_v3 CK224595.1 CK224596.1 (3 nuc shorter at the joint uses the second AG)
Beginning of exon 7 AGCTAAAG TCCAGGAAGA GATTGATCGT 243989183
GTGGTTGGCA AACATCGCAG CCCTTGCATG CAGGACAGGA GCCGCATGCC CTACACAGAT 243989243
GCCATGATTC ATGAGGTCCA GAGGTTCATT GACCTCATTC CTACCAACCT GCCACATGCG 243989303
GTGACCTGTG ACATTAAGTT CAGGAACTAC CTAATACCCA AG GT end of exon 7
Beginning of cryptic exon out of frame agcaggtaa tagaaactca 243991103
tttccatggt tccagtgaca tgcagaaccg tggggactta gagtgtgact ctacatgtgc 243991163
tgatagcttg catctgcatg ataaggagca taattttcat tgtgtatgca ctgtcctgga 243991223
tatgaccacc ttctttatca gggt end of cryptic exon
normal exon 9
1328 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL
>CYP2C6v2-de1b2b3b4b4c5b upstream pseudogene
EST CK224599.1 = 100% match with 4 frameshifts) so this is a real gene
clone_lib="RALIUNN03 Sprague-Dawley rat female liver
The CYP2C6_v1 sequence is also seen in this same mRNA library
This GNOMON prediction adds two upstream exons that do not belong to this gene
58596732 MDLVMLLVLTLSCLILLSIWRQSSGRGKHP 58596643 exon 1 frameshift
58596643 SGPTPLPIIGNFFHLDLNNITQSLTS (0) 58596566 exon 1
58594823 FSKVNGSVFTLYFGMKLIVILHGYAATKEGLIDHGEEFTKRGSFPVAEKINKGL (1) exon 2 58594662
58594487 GIAFSHGNRWKEIRRFTLMTLQNLGMGKESIEDRVQEETQCLV*ELRKTN (1) exon 3 58594338
58593451 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58593296
58592013 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58591858
58590797 FCSSFPVFIDYCLGSHMTLA 58590738
58590736 NVYHTRNYILKKIKEHQESLDVTNPHDFIDYDLIKWKQ 58590620
AVSIKRNS
>CYP2C6v2 allele not in figure, 13 aa diffs to CYP2C6_v1 XM_215255 NW_047916
we are assigning this allele status but it may be a separate gene
(temp name = CYP2Cnewb)
58578624 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 58578457
58576741 FSKVYGPVFTLYFGLKPTVILHGYEAVKEALIDHGEEFAERGSFPVVEKINKDL (1) 58576583
58576405 GIAFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDHVQEEARCLVEELRKTN 58576256
58575415 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKVLSSPWTQ 58575254
58574189 FCSFFPVLIDYCPGSHTTLAKNIYYIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 58574013
58554666 ESHNPHLEFTLENLSVTVTDLFGAGTETTSTTLRYALLLLLKYPEVT 58554526
58534931 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 58534743
58533131 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 58532991
58531833 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 58531654
>Cyp2c7 M18335 exons 1,2,3 and 6 are in sequence gaps 93% to 2C7 variant and 2C81
the yellow labels are from a random Chr1 piece that is similar to the CYP2C7 N-term
differences with the published 2C7 sequence M18335 are in cyan
MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK
FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMNENVTKGF
GIVFSNGNRWKEMRRFTIMNFRNLGIGKRNIEDRVQEEAQCLVEELRKTK
243849546 GSPCDPSLILNCAPCNVICSITFQNYFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243849385
243847566 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 243847390
243829444 GSPCDPSLILNCAPCNVICSITFQNHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243829283
this duplicate exon 4 is not in the right sequence order
ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT
243803857 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFINFVPTNLPHAVTCDIKFRNYLIPK 243803669
243800623 GTKVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 243800483
243799465 GKRACVGEGLARMQLFLFLTTILQNFNLKSLVHPKDIDTMPVLNGFASLPPTYQLCFIPS 243799286
11915970 GSPCDPSLILNCAPCNVICSITFQNHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 11915809
11936072 GSPCDPSLILNCAPCNVICSITFQNYFDYKDKEMLTFMEKVNENLKIMSSPWMQ 11935911
11885991 GKRACVGEGLARMQLFLFLTTILQNFNLKSLVHPKDIDTMPVLNGFASLPPTYQLCFIPS 11885812
>CYP2C7 variant unmapped 93% to 2C7 88% to 2C81
3463873 MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK 3464040
3479907 FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF 3480068
3480234 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 3480383
3489182 GSPCDPSLILNCAPCNVICSITFQSHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 3489343
3491162 VCNSFPSLVDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 3491338
3505354 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 3505494
3406504 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 3406692
3408304 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 3408444
3409602 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFPGFASLPPFYELCFIPS 3409778
New frags on the plus strand between 2C7 and 2C6
>Cyp2c79-se1[9]
frag q Exon 9 100% to 2C79
243885148 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI* 243885330
11971674 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI 11971853
>Cyp2c-se6[9]
frag p exon 9 100% to Cyp2c82-ps-de9b
243895387 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 243895497
11981913 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 11982023
>seq upstream of 2C11
>CYP26A1 AF439720, NM_130408 Chr1 1Mb upstream of CYP2C cluster
242138769 MGLPALLASALCTFVLPLLLFLAALKLWDLYCVSSRDRSCALPLPPG
TMGFPFFGETLQMVLQ (0) 242138581
242138389 RRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GEHRLVSVHWPASVRTILGAGCLSNLHDSSHKQRKK (0) 242138165
242137906 VIMQAFNREALQCYVPVIAEEVSGCLEQWLSCGERGLLVYPEV
KRLMFRIAMRILLGCEPGPAGGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR (0) 242137616
242137537 GVKPRNLIHARIEENIRAKIRRLQAAERNAGCKDALQLLIEHSWERGERLDMQ (0) 242137379
242136717 ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREEIKSK (0) 242136583
242136000 GLLCKSHHEDKLDMETLEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELN (0) 242135848
242135595 GYQIPKGWNVIYSICDTHDVADSFTNKEEFNPDRFTSLHPEDTSRFSFIPFGGGLRSCRSKEFAKI
LLKIFTVELARRCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFQGDI* 242135254
>CYP26C1 XM_217935 94% TO 26C1 MOUSE Chr1 1Mb upstream of CYP2C cluster
242151281 MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKG
SMGWPFFGETLHWLVQ (0) 242151079
242150553 GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0) 242150422
242149883 VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVAVYQAAKALTFRMAAR
ILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK (0) 242149608
242148160 GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0) 242148005
242146368 ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP
DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0) 242146051
242144220 GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYI
PFGGGARSCLGQELAQAVLQLLAVELVRTARWELATPAFPVMQTVPIVHPVD
GLLLLFHPLPTLGAGDGSPF* 242143843
>Cyp2c11 J02657 72% to CYP2C6_v1
243377899 MDPVLVLVLTLSSLLLLSLWRQSFGRGKLPPGPTPLPIIGNTLQIYMKDIGQSIKK 243378066
243379842 FSKVYGPIFTLYLGMKPFVVLHGYEAVKEALVDLGEEFSGRGSFPVSERVNKGL 243380003
243380160 GVIFSNGMQWKEIRRFSIMTLRTFGMGKRTIEDRIQEEAQCLVEELRKSK 243380309
GAPFDPTFILGCAPCNVICSIIFQNRFDYKDPTFLNLMHRFNENFRLFSSPWLQVCNT
FPAIIDYFPGSHNQVLKNFFYIKNYVLEKVKEHQESLDKDNPRDFIDCFLNKMEQEKH
NPQSEFTLESLVATVTDMFGAGTETTSTTLRYGLLLLLKHVDVTAKVQEEIERVIGRN
RSPCMKDRSQMPYTDAVVHEIQRYIDLVPTNLPHLVTRDIKFRNYFIPKGTNVIVSLS
SILHDDKEFPNPEKFDPGHFLDERGNFKKSDYFMPFSA
243416959 GKRICAGEALARTELFLFFTTILQNFNLKSLVDVKDIDTTPAISGFGHLPPFYEACFIPVQRADSLSSHL* 243417171
>Cyp2c24 92% to 2C80, M86678 has alternative splice first exon
no ESTs have this splice
CK481568.1 matches exons 1,2,3,4
CO565602.1 matched the end of the gene sequence and extends it a little 6 aa
Used this EST to blast the trace files to find the end of exon 7
gnl|ti|132779224 rts18e73.g from trace files for exon 7
NW_001084774.1 from 50448200 to 50492967 (+) strand
MDPVLVLVLTLSCLLLLSLWRQSSGRGKLPPGPTPLPIIGNILQIDVKDISKSFTN CK481568.1 exon 1
243522073 FSKIYGPVFTLYFGPKPTVVVHGYEAVKEALDDLGEEFSGRGSFPIVERMNNGL 243521912
243521366 GVIFSNGTKWKELRHFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 243521217
243518830 GSLCDPTFILSCAPSNVICSVVFHNRFDYKDENFLNLMEKLNENFKILNSPWMQ 243518669
VCNALPAFIDYLPGSHNRVIKNFAEI 676
677 KSYILRRVKEHQETLDMDNPRDFIDCFLIKME
QEKHNPRTEFTIESLMATVSDVFVAGSETTSTTLRYGLLLLLKHTEVT
AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK
GTDLVTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFKKSDYFMPFST
GKRMCVGEALARMELFLLLTTIVQNFNLKSFVDTKDIDTTPMANTFGRVPPSYQLCFIPR*
Exon 4
11605356 GSLCDPTFILSCAPSNVICSVVFHNRFDYKDENFLNLMEKLNENFKILNSPWMQ 11605195
>2C24 EST no ESTs have this splice
CK481568.1 matches second exon
MDPVLVLVLTLSCLLLLSLWRQSSGRGKLPPGPTPLPIIGNILQIDVKDISKSFTN
FSKIYGPVFTLYFGPKPTVVVHGYEAVKEALDDLGEEFSGRGSFPIVERMNNGL
GVIFSNGTKWKELRHFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN
GSLCDPTFILSCAPS
>Cyp2c80
CYP2C80 91% to 2C24 EST BP503815.1 has N-terminal, NW_001084774.1
50565229 MDAVLVLVLILSSLLLLSFWRQNSERRKLPPGPTPLPIIGNILQIDVKDISKSIKK 50565396
50591470 FSEVYGPVFTLYFGLKPTVVVYGYEVVKEVLDGEEFSGRGVFPIVTKVNNDL 50591625
50591809 GVIFSNGTKWKELRRFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 50591958
50595754 GSLCDPTFILSCAPSNVICSVIFHNRFDYKDENFLNLMEKFNENFKILNSPWMQ 50595915
50599994 VCNAIPAFIDYLPGSHNKVIKNFAEIKSYILRRVKEHQETLDMDNPRDFIDCFLIKIE 50600167
50603953 QEKHNPCTEFTIQSLVATVTDVFVAGSETTSTTLRYGLLLLLKHTEVT 50604096
50605025 AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK 50605213
50607134 GTDLITSLTSVLHDDKEFPNPEVFDPGHFLDEHGNFKRSDYFMPFSS 50607274
50609683 GKRMCVGEALARMELFLLLTTIVQNFNLKSFVATKDIDTTPLTNTFGCVPPSYQLYFTPR* 50609865
>Cyp2c80 XM_217906.2 GNOMON exon 2 on AC109577.4 in HTGS 92% to 2C24, 73% to 2C11
MGWLSDP wrong N-term from GNOMON prediction (temp name = CYP2CNEWC)
Correct N-term possibly in a sequence gap
244632544 FSEVYGPVFTLYFGLKPTVVVYGYEVVKEVLDGEEFSGRGVFPIVTKVNNDL 244632389
this exon 2 does not match 2C24
244632205 GVIFSNGTKWKELRRFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 244632056
244628281 GSLCDPTFILSCAPSNVICSVIFHNRFDYKDENFLNLMEKFNENFKILNSPWMQ 244628120
244624041 VCNAIPAFIDYLPGSHNKVIKNFAEIKSYILRRVKEHQETLDMDNPRDFIDCFLIKIE 244623868
244620080 QEKHNPCTEFTIQSLVATVTDVFVAGSETTSTTLRYGLLLLLKHTEVT 244619937
244619006 AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK 244618818
244616897 GTDLITSLTSVLHDDKEFPNPEVFDPGHFLDEHGNFKRSDYFMPFSS 244616757
244614348 GKRMCVGEALARMELFLLLTTIVQNFNLKSFVATKDIDTTPLTNTFGCVPPSYQLYFTPR* 244614166
>Cyp2c79 XM_219933 minus strand 72% to 2C6_v1 95% to seq e, 100% to seq q (exon 9),
93% to seq z (exon 5) (temp name = CYP2CNEWD)
244590183 MILGVFLGLFLTCLLLLSLWKQNFQRRNLPPGPTPLPIIGNILQIDLKDISKSLRN 244590016
244575990 FSKVYGPVFTLYFGRKPAVVLHGYEAVKEALIDHGEEFAGRGIFPVAEKFNKNC 244575829
244575612 GVVFSSGRTWKEMRRFSLMTLRNFGMGKRSIEDRVQEEARCLVDELRKTN 244575463
244553851 GVPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLALIDILNENVEILSSPWIQ 244553690
244525726 ICNNFPAIIDYLPGRHRKLLKNFAFAKHYFLAKVIQHQESLDINNPRDFIDCFLIKMEQ 244525550
244524359 EKHNPKTEFTCENLIFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT 244524219
244517844 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQRYIDLLPTSLPHALTCDMKFRDYFIPK 244517656
244516177 GTTVIASLTSVLYDDKEFPNPEKFDPSHFLDENGKVKKSDYFFPFST 244516037
244496745 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI 244496566
>2c79 exon 8 closest match is to 2c82-ps only 2 aa diffs
12604370 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQRYIDLLPTSLPHALTCDMKFRDYFIPK 12604182
12602703 GTTVIASLTSVLYDDKEFPNPEKFDPSHFLDENGKVKKSDYFFPFST 12602563
>2c11 exon 8
11502602 GTNVIVSLSSILHDDKEFPNPEKFDPGHFLDERGNFKKSDYFMPFS 11502739
>Cyp2c79-de9b exon 9 62% to 2C79 2 aa diffs to seq d and seq p minus strand
244491372 G*WICVREDLAQMTLFLFCPTILKNFNLNSQVNPKEL 244491262
>Cyp2c79-de9b NW_047565.2 based on RGSC v3.4
12577898 G*WICVREDLAQMTLFLFCPTILKNFNLNSQVNPKEL 12577788
interval between 2C79 and 2C6
>Cyp2c6-se1 [1:2:3:2:3]
>Cyp2c6-se1[1:2:3:2:3] frag n+m exons 1,2,3 2C6 like pseudogene plus strand exon 2,3 100% to seq m
244044941 MDHTTGTYTLSLILLSL*RQSSGRGKIPPGPTPLPIIDNLLQLDIKNVTQYLAN (0) 244045102
244050420 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244050581
244050793 XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244050873
frag m Exons 2,3 2C6 like pseudogene 100% to seq n
244052306 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244052467
244052679 XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244052759
>CYP2C6-se1[1:2:3:2:3] frag n+m NW_047565.2 (+) strand
12131467 MDHTTGTYTLSLILLSL*RQSSGRGKIPPGPTPLPIIDNLLQLDIKNVTQYLAN (0) 12131628
12136946 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 12137107
12137319 KTFTLMTLQNLRMGKGNIEDHVQE*AQ 12137399
12138832 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 12138993
12139205 KTFTLMTLQNLRMGKGNIEDHVQE*AQ 12139285
>Cyp2c7-se1[2:3:6:7:9]
frag j+k exons 2,3,6,7,9 (6,7 and 9 have 1 aa diff to 2C7)
exons 2,3 = 100% to 2C7 variant, 2 aa diffs to 2C7
Cyp2c7-se2[2:3] = frag k now joined with frag j
Cyp2c7-se2 and Cyp2c7-se1 appear to be parts of a mirror duplication of Cyp2c7
244064158 FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF 244064319
244064485 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 244064634
244103321 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 244103461
244120225 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFIDFVPTNLPHAVTCDIKFRNYLIPK 244120413
244124319 FLXXXLQNFNLKSLXHPKDIDTMPVLNXXASLPPTYQLCFIPS 244124447
>CYP2C7-se1[2:3:6:7:9] frag j+k NW_047565.2 (+) strand
12150684 FSKTYGPVFTLYLGSQPTVILHGYEAIKEA
12150774 LIDNGEKFSGRGSYPMIENVTKGF 12150845
12151011 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 12151160
12189847 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 12189987
12206751 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFIDFVPTNLPHAVTCDIKFRNYLIPK 12206939
12210845 FLXXXLQNFNLKSLXHPKDIDTMPVLNXXASLPPTYQLCFIPS 12210973
>CYP2C7? 1 aa diff to frag j (-)
11890383 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFINFVPTNLPHAVTCDIKFRNYLIPK 11890195
11885940 FLTTILQNFNLKSLVHPKDIDTMPVLNGFASLPPTYQLCFIPS 11885812
>Cyp2c13-se1[6]
frag h 72% to 2C13 exon 6 plus strand 100% to seq s
70% to 2C12 exon 6 h
244165142 ENGNQQMNYTQEHLATMVTDLL 244165207
244165209 FGGRETLNSTMRFAFLFLMKYPYTT 244165284
>Cyp2c13-se1[6] NW_047565.2 (+) strand
12251668 ENGNQQMNYTQEHLATMVTDLLF 12251736
12251735 FGGRETLNSTMRFAFLFLMKYPYTT 12251809
>frag s identical to Cyp2c13-se1[6] NW_047565.2
11852957 ENGNQQMNYTQEHLATMVTDLLF 11852889
11852890 FGGRETLNSTMRFAFLFLMKYPYTT 11852816
>Cyp2c22-se1[8]
frag g exon 8 72% to 2C22 minus strand
244201638 KFDHGNFLDDR 244201606
244201606 GNFK*NDYFMAFLA 244201565
>CYP2C22-se1[8] frag g exon 8
12288164 KFDHGNFLDDR 12288132 (-)
12288133 GNFK*NDYFMAFLA 12288092 (-)
>Cyp2c13-se3[1:2:3:2:3]
frag f Exons 1,2,3,2,3 exon 1 = 66% to 2C13 Minus Strand
exons 2,3 = 57% to 2C13
two identical copies of exons 2,3 100% to seq v exons 2,3
244215468 SQSFLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 244215328
244214467 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 244214306
244214137 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213988
244213484 R*FS*RGWFSIFGKFSKVQ 244213428
244213259 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213110
>CYP2C13-se3[1:2:3:2:3:] frag f NW_047565.2
12301994 SQSFLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 12301854 (-)
12300993 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 12300832 (-)
12300663 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 12300514 (-)
12300010 R*FS*RGWFSIFGKFSKVQ 12299954 (-)
12299785 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 12299636 (-)
>Cyp2c82-ps frag e Exons 1,4,4,5,6,7,8,9 almost an exact duplicate of seqs w,x,y,z,
exons 6-9 of the wxyz cluster in a seq gap Plus Strand
also NW_047565.2 12305221 to 12373089
244218695 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLINE (0) 244218865
244233879 LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 244234019
244240189 GVPCDPTFILGCAPCNVICSIVFQNHFNYKGQEFLALIDTLNENVEILSSPWIQ 244240350
244265531 ICNNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ 244265707
244266904 KHNPKTEFTCKNLIFTASDLFAAGTETTSPTLRYSLLLLPKYPEV 244267038
244273480 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQ*YIDLLPTSLPHALTCDMKFRDYFIPK 244273668
244275197 GTTVIASLTSVLYDDKEFPNPEKFDLSHFLDENGKFKKSDYFFPFST 244275337
244286429 GKRICVGEGLAQTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIP 244286605
>Cyp2c82-de9b frag d Exon 9 identical to seq p
244289962 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 244290072
>CYP2C82P-de9b frag d Exon 9 NW_047565.2
12376437 FFTGKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 12376556
>Cyp2c77-ps-de1b2b3b4b5b
frag c Pseudogene 96% to 2C6_v1 exons 1-5 with partial deletion of exon 3 Plus Strand
244337898 MDLVMLLVLTLSCLILLSIWSQSSGRGKLP 244337987
244337987 SGPTPLPIIGNFFHLDLKNITQSLTS 244338064
244339793 FSKVNGSVFTLYFGMKPIVILHGYEAIK*GLIDHREEFTERGSFPVAEKINKGL 244339954
244340129 GIAFSHGNRWKEIRRFTLMTLQNLGMGK 244340212
244341157 GSPCDPTFILGCAPCNVICSIIFQNSFDYKDQDFLSLMEKLNENIKIVSSPWI* 244341318
244342872 FCSSFPVFIDYCPGIHMTLA 244342931
244342933 KNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLIKWKQ 244343049
>Cyp2c77-ps variant of 2C6 13 aa diffs to CYP2C6_v1, 16 aa diffs to 2C6v2
This gene has three frameshifts
244357850 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 244358017
244359760 FSKVYGPVFTLYFGMKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDL (1) 244359921
244360096 GIIFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEE 244360230
244360232 MRKTN 244360246
244361085 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 244361246
244362321 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYL 244362410
244362412 LKKIKEHQESLDVTNPQDFIDYYLIKWKQ 244362498
244381928 ESHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKYPEIT 244382068
244392235 AKVQEEIDRVFGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPM 244392423
244394012 GTTIITSLSSVLHDSKEFPNPEIFDPGHFLDGNGKFKKSDYFMPFSA 244394152
244395307 GKRMFAGEGLA 244395339
244395341 RMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 244395487
2c77-ps seq
12468454 ESHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKYPEIT 12468594
12478761 AKVQEEIDRVFGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPM 12478949
12480538 GTTIITSLSSVLHDSKEFPNPEIFDPGHFLDGNGKFKKSDYFMPFSA 12480678
12481833 GKRMFAGEGLA 12481865
12481867 RMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 12482013
>Cyp2c82-ps-se1[1:4:4:5]
frag z Exon 5 minus strand 1 aa diff to Cyp2c82-ps
243632036 ICDNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ (0) 243631860
NW_047565.2
11718562 ICDNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ 11718386
frag y Exon 4 minus strand 92% to seq e
243654367 GVPCDPTFILGCAPCNVICSIVFQNHFNYKDQEFLALIE 243654251
243654249 LNENVEILSSP*IQ 243654208
11740893 GVPCDPTFILGCAPCNVICSIVFQNHFNYKDQEFLALIE 11740777
11740774 LNENVEILSSP*IQ 11740733
frag x exon 4 minus strand 100% to seq e short exon 4
243659542 LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 243659402
11746068 LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 11745928
frag w Exon 1 minus strand 100% to seq e
243675609 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLIN 243675442
11762135 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLIN 11761968
>Cyp2c13-se4[1:2:3]
frag v Exon 1 (+) 59% to 2C13
243678671 FLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 243678802
Exon 2 (+) 48% to 2C79
243679647 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 243679808
Exon 3 (+) 100% to seq f
243679977 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 243680126
>Cyp2c13-se4[1:2:3] frag v NW_047565.2 identical to frag f
11765188 SQSFLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 11765328 (+)
11766173 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 11766334 (+)
11766503 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 11766652 (+)
>Cyp2c22-se3[8]
new frag 10, identical to frag g
11774859 KFDHGNFLDDR 11774891 (+)
11774890 GNFK*NDYFMAFLA 11774931 (+)
>Cyp2c22-se4[8]
new frag 11, identical to frag g
11776742 KFDHGNFLDDR 11776774 (+)
11776773 GNFK*NDYFMAFLA 11776814 (+)
>Cyp2c7-se4[8:9]
frag u Exon 8 minus strand exon 8 = 87% to frag 2, 8+9 = 63% to 2C7
243726168 GVMVITSLSSALHDNKEFPNPKRFDPG*FLDRNGNFKKTDYFILFSA 243726028
Exon 9 minus strand 60% to 2C7
243723025 CVGEGLTPIELFLFLTRILQNFNLKHLTHTEAVDTTPVLSRLTSVSPALKLFFIP 243722861
11812694 GVMVITSLSSALHDNKEFPNPKRFDPG*FLDRNGNFKKTDYFILFSA 11812554
11809551 CVGEGLTPIELFLFLTRILQNFNLKHLTHTEAVDTTPVLSRLTSVSPALKLFFIP 11809387
>Cyp2c7-se3[8]
frag t Exon 8 minus strand 82% to 2C7
243749788 TIVIT*LTSVLHDSKKFPNPEMLDSGHFLDENGNFKKSEYFMPFSA 243749651
11836314 TIVIT*LTSVLHDSKKFPNPEMLDSGHFLDENGNFKKSEYFMPFSA 11836177
>Cyp2c13-se2[6:7]
frag s Exons 6-7 minus strand 72% to 2C12 exon 6 100% to seq h
243766431 ENGNQQMNYTQEHLATMVTDLL 243766366
243766364 FGGRETLNSTMRFAFLFLMKYPYTT 243766290
243760156 XQINEEIGQVIWRHHSPSMLDWSHMIYTNAMVHEVQRYIDLAPNGVVCEVNCDTKYPRDYFIPK 243759968
11852957 ENGNQQMNYTQEHLATMVTDLLF 11852889
11852890 FGGRETLNSTMRFAFLFLMKYPYTT
11846721 IQINEEIGQVIWRHHSPSMLDWSHMIYTNAMVHEVQRYIDLAPNGVVCEVNCDTKYPRDYFIPK 11846494
>Cyp2c7-de7b frag r Exon 7 (+) 100% to seq a CYP2C81-de7b
243792966 RVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 243793151
11879492 RVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 11879677
>Cyp2c81 93% to 2C7 28 aa diffs missing exon 1 Plus Strand, 91% to seq j (exons 6,7)
93% to seq k (exons 2,3)
244672079 FSKTYGPVFTLYLGSQPTVILHGYEAIKKALIDHGEKFSGRGSYPMIENVTKGF 244672240
244672408 GIAFSNGNRWKEIRRFTIMTLRNLGMGKRNIEDRVQEEAQRLVEELRKTK 244672557
244681144 GSPCDPSFILNCAPCNVICSITFQNHFDYKDKEILTFMEKVNENVKIMSSPRMQ 244681305
244683123 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVEYYLIKQKQ 244683299
244699290 ANHIEQSEYSHENLACSIMDLIGAGTETMSSTLRYALLLLMKYPHVP 244699430
244713313 AKVQEEIDHVIGRYRSPCMQDRSHMPYTDAMIHEVQRFINFVPTNLLHAVTCDIKFRNYLIPK 244713501
244717457 GTKVLTSLTSVLHGSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 244717597
244718606 GKRACVGEGLARMELFLFLTTILQNFKLKSLVHPKDIDTRPVLNGFASLPPTYQFCFIPS 244718785
>Cyp2c81 NW_047565.2, based on RGSC v3.4 (+) strand, missing exon 1, also AAHX01006946.1, and AC120070.5, might be pseudogene
12758605 FSKTYGPVFTLYLGSQPTVILHGYEAIKKALIDHGEKFSGRGSYPMIENVTKGFG 12758769
12758934 GIAFSNGNRWKEIRRFTIMTLRNLGMGKRNIEDRVQEEAQRLVEELRKTK 12759083
12767670 GSPCDPSFILNCAPCNVICSITFQNHFDYKDKEILTFMEKVNENVKIMSSPRMQ 12767831
12769649 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVEYYLIKQKQ 12769825
12785816 ANHIEQSEYSHENLACSIMDLIGAGTETMSSTLRYALLLLMKYPHVP 12785956
12799839 AKVQEEIDHVIGRYRSPCMQDRSHMPYTDAMIHEVQRFINFVPTNLLHAVTCDIKFRNYLIPK 12800027
12803983 GTKVLTSLTSVLHGSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 12804123
12805132 GKRACVGEGLARMELFLFLTTILQNFKLKSLVHPKDIDTRPVLNGFASLPPTYQFCFIPS* 12805314
>Cyp2c81-de7b frag a Exon 7 minus Strand 100% to seq r, 80% to 2C13
244724629 LRVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 244724441
>CYP2C81-de7b frag a Exon 7, NW_047565.2 based on RGSC v3.4 (-) strand
12811155 LRVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 12810967
>Cyp2c81-de8b frag 1 Exon 8 93% to 2C7 Plus Strand
244737232 GTTVLTSLTSVLHDSKEFPNPEMFDPGHFLDENRNFKKSDYFMPFSA 244737372
>CYP2C81-de8b frag 1 Exon 8 NW_047565.2 based on RGSC v3.4 (+) strand
12823758 GTTVLTSLTSVLHDSKEFPNPEMFDPGHFLDENRNFKKSDYFMPFSA 12823898
>Cyp2c81-de8c frag 2 Exon 8 76% to 2C13 Plus Strand 87% to seq u
244764239 GMMVITSLSSVLHYNKEFPNPERFDPGYFLDGNGNFKKTDYFILFSA 244764379
>CYP2C81-de8c frag 2 Exon 8 NW_047565.2 based on RGSC v3.4 (+) strand
12850765 GMMVITSLSSVLHYNKEFPNPERFDPGYFLDGNGNFKKTDYFILFSA 12850905
>Cyp2c81-de3h 71% to CYP2C13-se4[1:2:3] exon 3
3225 FSHKNVWKVNRFFTHTTLKNFRMGKGITKNKVHEEVEWLVKELKK 3359
>Cyp2c81-de1d frag 3 Exon 1 with frameshift Plus Strand 85% to seq e 83% TO SEQ w
244783632 MDLVVVL 244783652
244783654 CSVSSLLLFSLWRQSSWRRKLPPGPNPLPIIGNFLQIDLNNLCQSLNN (0) 244783797
>CYP2C81-de1d frag 3 on map, NW_001084775
6139 SVSSLLLFSLWRQSSWRRKLPPGPNPLPIIGNFLQIDLNNLCQSLNN (0) 6279
>Cyp2c81-de6e7e frag 4 exon 6 70% to 2C13 Plus Strand
244799349 ELELEHLGSMVTDLFFAGIESIRTTMIFALLFLLNTHTSQ 244799468
exon 7 82% to 2C13, 86% to seq r and seq a
244801583 LQNRSHMPYTNAMVHEVQRYSDIVPNNIVHEVTSDTKFRNYFIPK (0) 244801717
>CYP2C81-de6e7e frag 4 on map, NW_001084775
21739 ELELEHLGSMVTDLFFAGIESIRTTMIFALLFLLNTHTSQ 21858
23973 LQNRSHMPYTNAMVHEVQRYSDIVPNNIVHEVTSDTKFRNYFIPK (0) 24107
>CYP2C81-de6e7e frag 4 on map, NW_047565.2 based on RGSC v3.4 (+) strand
12885875 ELELEHLGSMVTDLFFAGIESIRTTMIFALLFLLNTHTSQ 12885994
12888109 LQNRSHMPYTNAMVHEVQRYSDIVPNNIVHEVTSDTKFRNYFIPK 12888243
>Cyp2c81-de4g5g (-) strand, exons 4,5 NW_001084775
91407 GSPCDPQFIMRCTSCNVICSIILQNHFDYEDRIFLA 91300
91312 DFLSLIEIVNESNKILSSPGIQ 91247
90145 VFDAFPLLLDFCP
90051 ENKKSLDVTNPQDFIDCFLIHRRQGNG 89971
>Cyp2c81-de1f2f3f (-) strand, frag 5 on map, NW_001084775
49635 MDLVTFLVLTLSSLILLSLWR*NSRRRKLPPGPTPLLIIGNFLQLDVKNVSQSLTM (0) 49468
36143 FSKAYGPVFTLYLGSQPTVILHGYEAVKETLIDHGEEFSGRGSFPMVEKAFKCF 35982
35816 GIVFSNGNR*KEIRQFIIMTLQNLGMGKRNIEDHVQEEAQCLVEELRKTK 35667
>CYP2C81-de1f2f3f frag 5 Exons 1,2,3 84% to 2C7 variant Minus Strand
244826982 MDLVTFLVLTLSSLILLSLWR*NSRRRKLPPGPTPLLIIGNFLQLDVKNVSQSLTM (0) 244826815
244813456 FSKAYGPVFTLYLGSQPTVILHGYEAVKETLIDHGEEFSGRGSFPMVEKAFKCF 244813295
244813129 GIVFSNGNR*KEIRQFIIMTLQNLGMGKRNIEDHVQEEAQCLVEELRKTK 244812980
very large gap 244845025-245223024 378kb
>Cyp2c13v1 100% first 5 exons
Note this seq also on 100.0% Un ++ 17276272 17282257
Exons 6-9 are on 99.1% Un ++ 17323193 17358099 2 aa diffs to 2C13 J02861
CYP2C12 is also on this same contig 99.6% Un ++ 17388090 17446950 2 aa diffs
Minus Strand HSPs:
245246208 MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTN (0) 245246041
245244920 FSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQ (1) 245244759
245244599 GIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN 245244450
245240888 GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQ (0) 245240727
245239607 VFNIFPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ 245239431
>Cyp2c13-de1b2b frag 7 Exon 1 76% to 2C13 Minus Strand
245307855 MDPIVVLVLSLSCLLFLSLWRNNSRRGKLPPGPTPLPIIRNYLQLDMKDIC*SLTK (0) 245307688
frag 6 Exon 2 83% to 2C13 Minus Strand
245292652 FSKTYGPVYTLYFGSQPTVLLYGYEALKEALIDYGEAFSGRGRIPIHEKVSKGQ 245292491
>Cyp2c22-se2[1:2] frag 9 Exon 1 61% to 2C22 Minus Strand
245347583 MDLFIILWICFACLSLFFLWNQLHYKEKLPPGPVPLPIVGNILQVNIKSIIKSLNI (0) 245347416
frag 8 Exon 2 79% to 2C22 Minus Strand
245334622 LAKEYGPVFTVYLGMKPTVVLHGHKALKEALIDRANEFSVKMQSSLLSKESQGL (1) 245334461
>Cyp2c12 J03786 80% to 2C13
MDPFVVLVLSLSFLLLLYLWRPSPGRGKLPPGPTPLPIFGNFLQ
IDMKDIRQSISNFSKTYGPVFTLYFGSQPTVVLHGYEAVKEALIDYGEEFSGRGRMPV
FEKATKGLGISFSRGNVWRATRHFTVNTLRSLGMGKRTIEIKVQEEAEWLVMELKKTK
GSPCDPKFIIGCAPCNVICSIIFQNRFDYKDKDFLSLIENVNEYIKIVSTPAFQVFNA
FPILLDYCPGNHKTHSKHFAAIKSYLLKKIKEHEESLDVSNPRDFIDYFLIQRCQENG
NQQMNYTQEHLAILVTNLFIGGTETSSLTLRFALLLLMKYPHITDKVQEEIGQVIGRH
RSPCMLDRIHMPYTNAMIHEVQRYIDLAPNGLLHEVTCDTKFRDYFIPKGTAVLTSLT
SVLHARKEFPNPEMFDPGHFLDENGNFKKSDYFMPFSAGKRKCVGEGLASMELFLFLT
TILQNFKLKSLSDPKDIDINSIRSEFSSIPPTFQLCFIPV
>Cyp2c12-de8b 1 aa diff to Cyp2c81-de8c, (-) strand NW_001084775
127213 GMMVITSLSSVLHYNKEFPNPERFYPGYFLDGNGNFKKT 127088
>Cyp2c13v1 J02861 80% to 2C12
MDPVVVLLLSLFFLLFLSLWRPSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTN
FSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQ
GIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN
GSPCDPQFIMGCAPGNVICSIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQ
VFNIFPILLDYCPGNHNIYFKNHTWLKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ
ENANQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVT
AKVQEEIDHVIGRHRSPCMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHEVTCDTKFRNYFIPK
GTAVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSA
GKRMCLGESLARMELFLFLTTILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL
>Cyp2c13-de4c5c between 2c12 and 2c13 exons 4,5
202080 GSPCDPQLITGCPPCNIFCAITLKNHFG*RR*DFLTLIHRNNERTKILSSPWLQ 202241
203414 VCNAFPILLDYCPGNH 203461
203494 KNYILEKVKNVKNHWNVTNHQDFTDYFLIQRXX 203586
>CYP2C13v2 Not in figure probable 2C13 allele NM_138514 7AA DIFFS TO 2C13v1 (98%)
80% to 2C12 (temp name = CYP2CNEWA)
MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQ
VDMKDIRQSLTNFSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPI
CEKVAKGQGIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN
GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQVFNI
FPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQENA
NQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVTAKVQEEIDHVIGRH
RSPSMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHDVTCDTKFRNYFIPKGTAVLTSLT
SVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSAGKRMCLGESLARMELFLFLT
TILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL
>Cyp2c22 M58041 61% to 2C79
245425985 MALFIFLGIWLSCLVFLFLWNQHHVRRKLPPGPTPLPIFGNILQVGVKNMSKSMCM 245425818
LAKEYGPVFTMYLGMKPTVVLYGYEVLKEALIDRGEEFSDKMHSSMLSKVSQGL
GIVFSNGEIWKQTRRFSLMVLRSMGMGKRTIENRIQEEVVYLLEALRKTN
GSPCDPSFLLACVPCNLISSVIFQHRFDYSDEKFQKFIENFHTKIEILASPWAQ
LCSAYPVLYYLPGIHNKFLKDVTEQKKFILMEINRHRASLNLSNPQDFIDYFLIKMEKEKHN
EKSEFTMDNLIVTIGDLFGAGTETTSSTIKYGLLLLLKYPEVTAKIQEEITRVIGRHR
RPCMQDRNHMPYTDAVLHEIQRYIDFVPIPLPRKTTQDVEFRGYHIPK
GTSVMACLTSALHDDKEFPNPEKFDPGHFLDEKGNFKKSDYFMAFSA
GRRACIGEGLARMEMFLILTSILQHFILKPLVNPEDIDTTPVQPGLLSLPPPFQLCFIPV
>Cyp2c23 X55446 59% to 2C11
MELLGFTTLALVVSVTCLSLLSVWTKLRTRGRLPPGPHPPSHYW
ESTATEPQGHPASLSKLAKEYGPVYTLYFGTSPTVVLHGYDVVKEALLQQGDEFLGRG
PLPIIEDTHKGYGLIFSNGERWKVMRRFSLMTLRNFGMGKRSLEERVQEEAWCLVEEL
QKTKAQPFDPTFILACAPCNVICSILFNDRFQYNDKTFLNLMDLLNKNFQQVNSVWCQ
MYNLWPTIIKYLPGKHIEFAKRIDDVKNFILEKVKEHQKSLDPANPRDYIDCFLSKIE
EEKDNLKSEFHLENLAVCGSNLFTAGTETTSTTLRFGLLLLMKYPEVQAKVHEELDRV
IGRHQPPSMKDKMKLPYTDAVLHEIQRYITLVGSSLPHAVVQDTKFRDYVIPKGTTVL
PMLSSVMLDQKEFANPEKFDPGHFLDKNGCFKKTDYFVPFSLGKRACVGESLARMELF
LFFTTLLQKFSLKTLVEPKDLDIKPITTGIINLPPPYKLCLVPR
>Cyp2d1 J02867
MELLNGTGLWSMAIFTVIFILLVDLMHRRHRWTSRYPPGPVPWP
VLGNLLQVDLSNMPYSLYKLQHRYGDVFSLQKGWKPMVIVNRLKAVQEVLVTHGEDTA
DRPPVPIFKCLGVKPRSQGVILASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEA
GHLCDAFTAQAGQSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMVKLVEESLTE
VSGFIPEVLNTFPALLRIPGLADKVFQGQKTFMALLDNLLAENRTTWDPAQPPRNLTD
AFLAEVEKAKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQRRV
QQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRFTSCDIEVQDFVI
PKGTTLIINLSSVLKDETVWEKPHRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVPVGQPRPSTHGFFAFPVAPLPYQLCAVVREQGL
>2d3-de8b CYP2D pseudogene Chr 7 ++ 120811066 120811206 2aa diff to 2D2/2D3 exon 8
between 2D1 and 2D3
GTTLIPNLSSLLNDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA
>Cyp2d2 X52027 X52455
MGLLIGDDLWAVVIFTAIFLLLVDLVHRHKFWTAHYPPGPVPLP
GLGNLLQVDFENMPYSLYKLRSRYGDVFSLQIAWKPVVVINGLKAVRELLVTYGEDTA
DRPLLPIYNHLGYGNKSKGVVLAPYGPEWREQRRFSVSTLRDFGVGKKSLEQWVTEEA
GHLCDTFAKEAEHPFNPSILLSKAVSNVIASLVYARRFEYEDPFFNRMLKTLKESFGE
DTGFMAEVLNAIPILLQIPGLPGKVFPKLNSFIALVDKMLIEHKKSWDPAQPPRDMTD
AFLAEMQKAKGNPESSFNDENLRLVVIDLFMAGMVTTSTTLSWALLLMILHPDVQRRV
HEEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADIVPTNIPHMTSRDIKFQGFLI
PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVLAGRPRPSTHGVYALPVTPQPYQLCAVAR
>Cyp2d3 X52028
MELLAGTGLWPMAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWP
VLGNLLQVDLCNMPYSMYKLQNRYGDVFSLQMGWKPVVVINGLKAVQELLVTCGEDTA
DRPEMPIFQHIGYGHKAKGVVLCTYGPEWREQRRFSVSTLRNFGVGKKSLEQWVTDEA
SHLCDALTAEAGRPLDPYTLLNKAVCNVIASLIYARRFDYGDPDFIKVLKILKESMGE
QTGLFPEVLNMFPVLLRIPGLADKVFPGQKTFLTMVDNLVTEHKKTWDPDQPPRDLTD
AFLAEIEKAKGNPESSFNDANLRLVVNDLFGAGMVTTSITLTWALLLMILHPDVQCRV
QQEIDEVIGQVRHPEMADQAHMPFTNAVIHEVQRFADIVPMNLPHKTSRDIEVQGFLI
PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVPTGQPRPSDYGVFAFLLSPSPYQLCAFKR
>Cyp2d4v1 M22331, X52029, X52457 I,T,P seen in ESTs TDI or ANV seen in ESTs same lib
MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWP
VLGNLLQIDFQNMPAGFQKLRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTA
DRPPLHFNDQSGFGPRSQGVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEA
RCLCAAFADHSGFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEE
ESGFLPMLLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTD
AFLAEVEKAKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILHPDVQCRV
QQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLI
PKGTTLITNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVPTGQPRPSDYGIFGALTTPRPYQLCASPR
>Cyp2d4v2 CYP2D18X U48219 S77859 ONLY 5 AA DIFFS probably = 2D4
MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWP
VLGNLLQIDFQNMPAGFQKLRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTA
DRPPLHFNDQSGFGPRSQGVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEA
RCLCAAFADHSGFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEE
ESGFLPMLLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTD
AFLAEVEKAKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILRPDVQCRV
QQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLI
PKGTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQRFSFSVPAGQPRPSNYGVFGALTTPRPYQLCASPR
>Cyp2d5 X52030 X52458
MELLNGTGLWPMAIFTVIFILLVDLMHRHQRWTSRYPPGPVPWP
VLGNLLQVDPSNMPYSMYKLQHRYGDVFSLQMGWKPMVIVNRLKAVQEVLVTHGEDTA
DRPPVPIFKCLGVKPRSQGVVFASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEA
GHLCDAFTAQNGRSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMLTLVEESLIE
VSGFIPEVLNTFPALLRIPGLADKVFQGQKTFMAFLDNLLAENRTTWDPAQPPRNLTD
AFLAEVEKAKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQRRV
QQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRITSCDIEVQDFVI
PKGTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEP
LARMELFLFFTCLLQHFSFSVPAGQPRPSTLGNFAISVAPLPYQLCAAVREQGH
>Cyp2d-se2[9] CYP2D pseudogene z chr7:120386407-120386565 exon 9 (+ strand) 73% to 2D3
ACLGEPLTCMELFLFFICLLQSFSFSVKAGQPRPSNHGIFEMPISPSSYQLCA
>Cyp2e1 J02627
MAVLGITIALLVWVATLLVISIWKKIYNSWNLPPGPFPLPILGN
IFQLDLKDIPKSFTKLAKRFGPVFTLHLGSRRIVVLHGYKAVKEVLLNHKNEFSGRGD
IPVFQEYKNKGIIFNNGPTWKDVRRFSLSILRDWGMGKQGNEARIQREAQFLVEELKK
TKGQPFDPTFLIGCAPCNVIADILFNKRFDYNDKKCLRLMSLFNENFYLLSTPWIQLY
NNFADYLRYLPGSHRKIMKNVSEIKQYTLEKAKEHLQSLDINCARDVTDCLLIEMEKE
KHSQEPMYTMENVSVTLADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG
PSRVPAVRDRLDMPYMDAVVHEIQRFINLVPSNLPHEATRDTVFQGYVIPKGTVVIPT
LDSLLYDSHEFPDPEKFKPEHFLNENGKFKYSDYFKAFSAGKRVCVGEGLARMELFLL
LSAILQHFNLKSLVDPKDIDLSPVTVGFGSIPPQFKLCVIPRS
>Cyp2f4 AF017393 end of exon 5 and exon 6 in seq gap in genome seq chr1 (+)
82269864 MDGVSTAILLLLLAVISLSLTFTSWGKGQLPPGPKPLPILGNLLQLRSQDLLTSLTK 82270034
82270123 LSKDYGSVFTVYLGPRRVIVLSGYQTVKEALVDKGEEFSGRGSYPIFFNFTKGN 82270284
82272477 GIAFSDGERWKILRRFSVQILRNFGMGKRSIEERILEEGSFLLDVLRKTE 82272626
82276791 GKPFDPVFILSRSVSNIICSVIFGSRFDYDDERLLTIIHFINDNFQIMSSPWGE 82276952
82277413 MYNIFPSLLDWVPGPHRRVFRNFGGMKD 82277496
LIARSVREHQDSLDPNSPRDFIDCFLTKMV
QEKQDPLSHFNMDTLLMTTHNLLFGGTETVGTTLRHAFLILMKYPKVQ
82279507 ARVQEEIDCVVGRSRMPTLEDRASMPYTDAVIHEVQRFADVIPMNLPHRVIRDTPFRGFLIPK 82279695
82281147 GTDVITLLNTVHYDSDQFKTPQEFNPEHFLDANQSFKKSPAFMPFSA 82281287
82282297 GRRLCLGEPLARMELFIYLTSILQNFTLHPLVEPEDIDLTPLSSGLGNLPRPFQLCMRIR 82282476
>Cyp2g1 M33296 J04715 M34444 chr1 (+)
81996311 MALGGAFSIFMTLCLSCLLILIAWKRTSRGGKLPPGPTPIPFLGNLLQVRIDATFQSFLK 81996490
81997087 LQKKYGSVFTVYFGPRPVVILCGHEAVKEALVDQADDFSGRGEMPTLEKNFQGY 81997248
81998826 GLALSNGERWKILRRFSLTVLRNFGMGKRSIEERIQEEAGYLLEELHKVK 81998975
82001483 GAPIDPTFYLSRTVSNVICSVVFGKRFDYEDQRFRSLMKMINESFVEMSMPWA 82001641
82002014 QLYDMYWGVIQYFPGRHNRLYNLIEELKDFIASRVKINEASFDPSNPRDFIDCFLIKMY 82002190
82004063 QDKSDPHSEFNLKNLVLTTLNLFFAGTETVSSTLRYGFLLLMKYPEVE 82004206
82005533 AKIHEEINQVIGTHRTPRVDDRAKMPYTDAVIHEIQRLTDIVPLGVPHNVIRDTHFRGYFLPK 82005721
82005841 GTDVYPLIGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNDAFVAFSS 82005981
82007190 GKRICVGEALARMELFLYFTSILQRFSLRSLVPPADIDIAHKISGFGNIPPTYELCFMAR 82007369
>Cyp2j3 U39943
MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYP
PGPWRLPLVGCLFHLDPKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFT
QMEHNFLNRPVTLLRKHLFNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQ
EEAYHLVEAIKDEGGLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEA
MCLESSMMCQLYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRD
FIDAFLKEMAKYPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ
EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAG
FNLPKGTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSMGKRACLG
EQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL
>CYP2J3 91% to mouse 2j9 exon 8 in a seq gap
116772039 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ 116771830
116767788 FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN 116767791
116766010 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEG 116765861
116765445 GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ 116765284
116760602 LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAK 116760426
116758387 YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ 116758247
116754923 EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPK 116754735
GTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM
116749991 GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL 116749815
>Cyp2j3-ps1 U40000
24 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLD 203
204 PKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHL 383
384 FNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQGEAYHLVEAIKDEGGLPFDP 563
564 HFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQLYNIFPRILQYL 743
744 PGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEGRDFIDAFLKEMAKYPDKTTTSFNEEN 923
924 LICSTLDLFFAGTETTSTTLRWALLCMALYPEVQEKMQAEIDRVIGQGRQPNLADRDSMP 1103
1104 YTNAVIHEVQRIGNIIPFNVPRKVAVDTYLAGFNLPKGTMILTNLTALHRDPKEWATPDT 1283
1284 FNPEHFLENGQFKKRESFLPFSM 1352
1415 GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL 1591
>Cyp2j3-ps2 U40004
13 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLD 192
193 PKQPHLSLQQFVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHL 372
373 FNKNGLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEGGLPFDP 552
553 HFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQLYNIFPRILQYL 732
733 PGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAKYPDKTTTSFNEEN 912
913 LICSTLDLFFAGTETTSTTLRWALLCMALYPEVQEKMQAEIDRVIGQGRQPNLADRDSMP 1092
1093 YTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPKG 1206
RDPKEWATPDTFNPEHFLENGQFKKRESFLPFSMGKRACLGEQLARSEL 1348
1349 FIFITSLIQKFTFKPPVNEKVSLQFRMSVTISPVSHRLCAIPRL 1480
>Cyp2j5-psexons 1-4 69% to 2j5 mouse now a pseudogene ortholog
116785102 MITSLSSLVTSSWAALLLRTLLLAAVTFLFLAGILRRHRPKDYQPGPWRLPFVGNFFQIDFEQSHLVLQK 116784893
116784415 FAKKYGNVFSLELDRPSVVVVTGQPLIKTKMFTHLEQNFANHFVTSVRKRAIGNN 116784251
116781318 GLITSNGQTWKEKRRFALMTLKNFGLGKKSLEQRMHE*AFHLVEARREEG 116781169
116780474 GQPVDLHLINNAVANVICSITFGGRFEYEDCQFQEMPTLLDEALHV 116780337
>Cyp2j4
116734902 MLATAGSLIATIWAALHLRTLLVAALTFLLLADYFKTRRPKNYPPGPWGLPFVGNIFQLDFGQPHLSIQP 116734693
116725983 FVKKYGNIFSLNLGDITSVVITGLPLIKETFTHIEQNILNRPLSVMQERITNKN 116725822
116723426 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRMQEEAHYLVEAIREEK 116723277
116722875 GKPFNPHFSINNAVSNIICSVTFGERFEYHDSRFQEMLRLLDEVMYLETTMISQ 116722714
116718583 LYNIFPWIMKYIPGSHQTVFRNWEKLKLFVSSMIDDHRKDWNPEEPRDFIDAFLKEMSK 116718407
116716306 YPEKTTSFNEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEVQ 116716169
116713582 EKVQAEIDRVIGQKRAASLADRESMPYTNAVIHEVQRMGNIIPLNVPREVAMDTTLNGFHLPK 116713394
116711364 GTMVLTNLTALHRDPKEWATPDVFNPEHFLENGQFKKRESFLPFSM 116711227
116708412 GKRACLGEQLARSELFIFFTSLMQKFTFKPPTNEKLSLKFRNGLTLSPVTHRICAVPRE* 116708233
>Cyp2j4-de6b frag. w
116706163 XXXXXXSFCEENLTCRTLDFLYAGIDTISNRLHWVLLLTCVNPEXX 116706053 exon 6
>Cyp2j16-de2b5b9b frag. x
116691748 KKYGNIFGLNLGDLTSEVITGLLLSKE 116691668 exon 2
116684743 FYDIFPYLMKYIPGITSNCFQKLGKLKLFVSCMTDEHRRDWNPEDPRNFTDALLKEMMK 116684567 exon 5
116677505 GKRACPGEQLARSKLFIFFTALIQKFTF 116677422
116677420 RLGMKSILGLTLSPVTHHI*ALSKQ 116677346 exon 9
>Cyp2j16
116664772 MLATVGSLLAKIWSAINFWTLLLTLLTFLLLADYLKNRRPNNYPPGPWRLPFVGNLFQFDLNISHLHLRIQQ 116664557
116654396 FVKKYGNLISLDFGNISVVVITGLPLIKEALINNEQNFLKRPIVPSRYRVFKDN 116654235
116651622 GIFFANVHKWKEQRRFALTMLKNFGLGKKSLEQCIQEEAHHLVEVIGEEK 116651473
116650955 GQPFDPHFRINNAVSNIICSITFGERFEYDDSQFQELLKLADEVICSEASMTSV 116650794
116640170 LYNVFPLIFKYLPGPHQTVFKNWEKLKSIVANMIDRHRKDWNPDEPRDFVDAFLTEMTK 116639994
116638624 YPDKTTTSFNEENLIATTLDLFFAGTETTSTTLRWALLYITLNPEVQ 116638484
116627938 EKVHSEIDRVIGHGRLPSTDDQDAMPYTNAVIHEVLRMGNIIPLNVPREVTADSTLAGFHLPK 116627750
116624337 GKMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSV 116624200
116612610 GKRACPGEKLAKSELFIFFTALMQNFTFKAPTNEKLSLKLRKGLSLYPVSYRICAVPR 116612437
>Cyp2j16-de5c6c9c frag y
72% to 2j6 mouse
116604392 LYNIFPWIMNYGPGSHQ 116604342 116604222 exon 5
116604342 VFRNWEKLKLFVSCMIDNKQRWVP 116604271 exon 5
116602255 YPEKSTSFSQGHLFCSTLNLFRAGSET 116602175 exon 6
116591992 GKRACPGEQMAISELFSFFAAFMQ 116591921 exon 9
116591919 KFTFHLAINEKLRMKFRNGLTLP*SSHLYC 116591830 exon 9
>Cyp2j17-ps
116584536 MLATASCLVANVCSAIPLWTLLLAALSWLPQKQAPQKQPSRALAPAIFGNLFQFDLDVSQLHSGI*PSKK 116584327 exon 1
116581102 FVTKYGNLISLDFGNTSSVIISGLPLIKEALTDM 116581001 exon 2
116580637 EQNLLKCIVLASREHVFKNN 116580578 exon 2 last half
116570454 LYNVFPFIIKYL 116570419 exon 5
116570408 NQTFFRNWENLNLFVSHMMESHRKDWNPVEPRDFIDAFLTYMTKEDD 116570268 exon 5 last half
116566151 KVHSEIDGVTGHGRPPSTGDRDSMPYTNAVIYEVLRMDNINPLKVPREVTADSTLDEFCLSK 116565966 exon 7
116563406 GTMVLINLTALYRESKEWTTQDTFNPEHFLENGMFKKRESF 116563284 exon 8
116559748 KFTFKPPISEKLSLKFRTGLTLSHVSCRI*SIHR 116559647 exon 9
>Cyp2j18-ps
63% to 2j6 mouse
116551335 MLGTQDILEAGIWALLH 116551285 exon 1
116551282 RTLLLAAVTFLLLADYLKTGNK 116551217 exon 1
116551217 KKYPWGPCNPPVMNNLFQLDLEQ 116551149 exon 1
116537661 LYNAFLSIMKYHPGSHQ 116537611 exon 5
116537611 VFRNWEKLIWRMSHIAENHCKG*NPAEL 116537528 exon 5
116537523 REFIDAFLTKMTK 116537485 exon 5
116534551 YPDKTTTNFNEENLICA 116534501 exon 6
116534498 LEFLFARTEITSTTLSWVLLYLSANPGVQ 116534412 exon 6
116529361 LFIFFTSLMQKFTFKPPISEKLILKFRMGLILSPVCH*ICVVPRQ* 116529224 exon 9
>Cyp2j10 XM_233199 ortholog of mouse Cyp2j12
Predicted GNOMON 86% to 2j12 mouse (LOC313373), mRNA.
2J10 seq specific rev primer matches 116499966-116499989
forward primer 1 = 116515946 116515968
116516004 MLSTEDTLEAAIRALLHFRTLLLAAVTFLFLANYLKTRRPKNYPPGPWRLPFVGNLFQLDVKQPHVVIQK 116515795
116508667 FVKKYGNLTSLDFGTIPSVVITGLPLIKEAFTNTEQNFLNRPVTPLRKRVFNNN 116508506
116505791 GLIMSNGQTWKEQRRFTMTTLKNFGLGKRSLEQRIQEEANYLVEAIGADK 116505642
116505144 GQPFDPHFKINSAVSNIICSITFGERFEYEDSLFQELLRLLDEASCLESSMMCQ 116504983
116500081 LYNVFPTIIKYLPGSHQTVLRNWEKLKLFISCMMDSHQKDWNPDEPRDFIDAFLTEMAK 116499905
116496152 YRDKTTTSFNKENLIYSTLDLFFAGSETTSNILRWSLLYITTNPEVQ 116496012
116489147 EKVHSEIDRVIGHRRQPSTGDRDAMPYTNAVIHEVLRMGNIIPLNVPREMTADSTLAGFHLPK 116488959
116488244 GTTILTNLTGLHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSM 116488107
116479687 GKRACPGEQLARTELFIFFTALMQNFTFKPPVNETLSLKFRNGLTLAPVSHRICAVPRQ 116479511
>Cyp2j13 XM_233198 1455 bp ortholog of mouse Cyp2j13
Predicted GNOMON Rattus norvegicus similar to CYP2J4 (LOC313372), mRNA.
Missing exon 1 74% to XM_233199, 79% to 2J4 78% to 2J3 90% to 2j13 mouse
116449294 FVKKYGNVISLDLGIMSSVIISSLPLIKEAFSHLDENFINRPIFPLQKHIFNDN 116449133
116446157 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAHHLVEAIGEEE 116446008
116445630 GQPFDPHFKINNAVSNIICSITFGERFEYHDSQFQELLKLLDKAMYLGTPMMIH 116445469
116440971 LYNMFPWIIKHLPGQHQTLLATWGKLKSYIADIIENHREDWNPAEPRDFIDAFLNEMAK 116440795
116428766 YPDKTTTSFNEENLICSTLDLFLAGTETTSTTLRWAVLYMALYPEVQ 116428626
116426881 EKVQAEIDQVIGQEKHPSLADRDSMPYTNAVVHEIQRMGNIVPLNVPREVAVDTTLAGFHLPK 116426693
116426568 GSVVMTNLTALHMDPKEWATPDVFNPEHFLENGQFKKRDSFLPFSM 116426431
116423270 GKRACLGEQLARSELFIFFTALMQKFTFKPPTNEKLSLKFRLGITISPVSHRICAVPRL 116423094
>Cyp2r1 XM_341909
MFQLPGVQTCAGALAGAFLLLLLVLVVRQLLRQRRPAGFPPGPP
RLPFIGNICSLALSADLPHVYMRKQSRVFGEIFSLDLGGISTVVLNGYDVVKECLVHQ
SEIFADRPCLPLFMKMTKMGGLLNSRYGRGWIDHRRLAVNSFHYFGSGQKSFESKILE
ETWSLIDAIETYKGRPFDLKQLITNAVSNITNLILFGERFTYEDTDFQHMIELFSENV
ELAASAPVFLYNAFPWIGILPFGKHQRLFRNADVVYDFLSKLIEKAAVNRKPHLPQNF
VDAYLDEMDKGQNDPLSTFSKENLIFSVGELIIAGTETTTNVLRWAVLFMALYPNIQG
QVHKEIDLIMGHDRRPSWEDKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGY
SIPKGTTVITNLYSVHFDEKYWKDPDMFYPERFLDSSGYFTKKEALIPFSLGRRHCLG
EQLARMEMFLFFTSLLQQFHLHFPHELVPNLKPRLGMTLQPQAYLICAERR
>Cyp2s1 (XM_218347 N-term incorrect) CK473647.1 EST with N-term chr1 (-)
no duplicate exon 4 as in the mouse
81101539 MEAASTWALLLALLLLLLALTLPRTPARGQLPPGPTPLPLLGNLLQLRPGALYSGFLR 81101366
81100365 LSKKYGPVFTVHLGPWRRVVVLVGHDAIREALGGQAEEFSGRGTLATLDKTFDGH 81100201
81097187 GVFFANGERWKQLRKFTLLALRDLGMGKREGEELIQEEVQNLVKAFQRTE 81097038
81095195 GRPFNPSMLLAQATSNVVCSLIFGIRLPYEDKEFQAVIQAASGTLLGISSPWG 81095037
81094957 QAYEMFSWLLQPLPGPHTQLQHHLGTLAAFTIQQVQRHQGRSHTSGPARDVVDAFLQKMA 81094778
81093521 QEKEDPGTEFTEKNLLMTVTYLLFAGTMTIGATIRYALLLLLKYPQVQ 81093378
81090341 KRVREELIQELGPSRTPSLSDRVRLPYTDAVLHEAQRLLALVPMGMPHTVTKTTSFRGYTLPK 81090156
81088138 GTEVFPLIGSVLHDPAVFRNPEEFHPSRFLDDDGRIRKHEAFLPYSL 81087998
81087846 GKRVCLGEGLARAELWLFFTSILQAFSLDTPCPPGDLSLKPAVRGLFNIPPDFQLQVWPTGDQSR 81087652
>Cyp2t1 AF368269 (Genbank translation is incorrect in green region) chr1 (+)
82307700 MVTCEATLLLLLILTLMLMSWGWLAHQARARMQKDLPPGPAPLPLLGNLLQLQSGHLDRVLME 82307888
82308155 LSSRWGPVFTVWLGPRPAVVLSGYAALRDALVLQADAFSGRGSMAVFERFTHGN 82308316
82308634 GIVFSNGPRWRTLRNFALGALKEFGVGTSTIEERILEETACVLDEFQATM 82308783
82309030 GAPFDPRRLLDNAVSNVICTVVFGKRYNYGDPEFLRLLDLFSDNFRIMSSRWGE 82309191
82309932 TYNMFPSFMDWIPGPHHRIFKNFQELRLFISEQIQWHRQSRQTGEPRDFIDCFLEQMDK 82310108
82310183 EHQDPESHFQDETLVMTTHNLFFGGTETTSTTLRYGLLIMLKYPEVA 82310323
82310430 AKVQEELDATVGRTRAPSLADRAHLPYTNAVLHEIQRFISVLPLGLPRALIRDVNLRNHFLHK 82310618
82310840 GTFVIPLLVSAHRDPTQFKDPDHFNPTNFLDDQGEFQNNDAFMPFAP 82310840
82311073 GKRMCLGAGLARSEIFLFLTAILQKFSLLPVGSPADIDLTPQCTGLGNVPPAFQLRLVAR 82311252
>Cyp2u1 XM_227677 gc boundary caused missassembly 90% to mouse 2U1
MSSIGGLRPAAGEQPGVGPHLQAVGGALLLCGLAVLLDWVWLQR
QRAGGIPPGPKPRPLVGNFGYLLLPRFLRLHFWLGSGSQTDTVGRHVYLARLARVYGN
IFSFFIGHRLVVVLSDFQSVREALVQQAEVFSDRPRMPLISILTKEKGIVFAHYGPIW
KQQRRFSHSTLRHFGLGKLSLEPRIIEEFAYVKAEMQKHGEAPFSPFPVISNAVSNII
CSLCFGQRFDYTNKEFKKVLDFMSRGLEICLHSQLFLINLCPWFYYLPFGPFKELRQI
ERDITCFLKNIIKEHQESLDANNPQDFIDMYLLHTQEEKDKCKGTNFDEDYLFYIIGD
LFIAGTDTTTNSLLWCLLYMSLNPGVQKKVHEEIERVIGRDRAPSLTDKAQMPYTEAT
IMEVQRLSMVVPLAIPHMTSEKT (1)
GYSIPKG
TVVLPNLWSIHRDPVIWEKPDDFCPHRFLDDQGQLLKRETFIPFGIGKRVCMGEQLAK
MELFLMFVSLMQSFTFALPEGSEKPIMTGRFGLTLAPHPFNVTVSKR
>Cyp2w1 XM_221971 92% to mouse 2W1
MELLVLCVWGILLLLGLWGLLRGCAQDPSMTRQWPPGPRPLPFL
GNLHLLGVTHQDRALMELSERYGPMFTIHLGSQKTVVLSGYEVVREALVGTGHELADR
PPIPIFQLIQRGGGIFFSSGVHWKVARQFTVRTLQSLGIRQPPMVGKVLQELVCLKGQ
LDSYGGQPFPLTLLGWAPCNITFTLLFGQRFDYQDPVFVSLLSLIDQVMVLLGSPGIQ
LFNTFPRLGALLRLHRPVLSKIEEVRTILRTLLEAQRPPLPNGSPARSYVEALLQQGQ
DDPEDMFSEANVLACTLDMVMAGTETTAATLQWAVFLMVKHPHVQGRVQEELDRVLGP
GQLPQPEDQRALPYTSAVLHEVQRYITLLPHVPRCTAADIQLGGYLLPKGTPVIPLLT
SVLLDKTQWETPSQFNPNHFLDAKGCFMKRGAFLPFSTGRRVCVGESLARTELFLLFA
GLLQQYHLLPPPGLSPADLDLRPAPAFTMRPPAQTLRVVPRS
>Cyp2ab1 (XM_221297 N-terminal incorrect) AC107471.6 N-term 92% to mouse, XM_001059961.1(short)
189790 MFSLFGGMAFLAGSFLLLKLAALCWRRSHLPPGPFPFPLLGNLWQLNFRLHPNMLFQ (0) 189620
LAQTHGNVFTVWLGSTPIVVLNGFRAVKEALVSNSEQFSGRPLTPFFRD
LFGEKGVICSNGLTWRQQRRFCLTTLRELGLGKQALELQLQHEAAELAEVFHQEQGRA
FDPQVPIIRSTTRVIGALVFGHHFLSEEPIFLELIRAINLGLAFASTTWRRLYDMFPW
ALRYLSGPHQKIFQYHEAVRGFIHHEIIRHKLRTPEAPKDFISCYLSQITKAMDDPVS
TFSEENLIQVVIDLFLGGTDTTATTLHWAIIYLVHHRAIQERVQQELDEVLGTAQAVC
YEDRERLPYTRAVLHEVQRLSSVVAVGAVRQCVTPTWMHGYYVSKGTIILPNLASVLC
DPECWETPHQFNPGHFLDKDGDFVTNEAFLPFSAGHRVCPGEQLARMELFLMFATLLR
TFRFQLPEGSQGLRLEYVFGGTLQPQPQKICAVPRLSSLSPREP
>Cyp2ac1 NW_044163.1|Rn9_1523 chromosome 9 XM_001067416.1
XM_236969.3, RGD1564244_predicted
3425457 MSGFDFSAILALLGLILILILNIKDFMAKASKRQCPPGPKPWPVIGNLHILNLKRPYQTMLE 3425272
3423187 LSKKYGPIYSIQMGPRKVVVLSGYETVKDALVNYGNQFGERSQVPIFERLFDGK 3423026
3415443 GIAFAHGETWKTMRRFSLSTLRDFGMGKRTIEDTIVVECQHLIQSFESHK 3415294
3412018 GKPFEIKRVLNASVANVIVSMLLGKRFDYEDPQFLRLLTLIGENIKLIGNPSIV 3411857
3410639 LFNIFPILGFLLRSHKKVLRNRDELFSFIRRTFLEHCHNLDKNDPRSFIDAFLVKQQE 3410466
3410029 ENNKSADYFNEENLLALVSNLFTAGTETTAATLRWGIILMMRYPEVQS 3409886
3408812 KVHDEIHKVVGSAQPRIEHRTQMPYTDAVIHEIQRVANILPTSLPHETSTDVVFKNYYIPK 3408627
3406238 GTEVITLLTSVLRDQTQWETPDAFNPAHFLSSKGRFVKKEAFMPFSV 3406098
3402907 GRRMCAGEPLAKMELFLFFTSLMQKFTFQPPPGVSYLDLDLTPDIGFTIQPLPHKICALLRTSAL* 3402710
>Cyp3a2 M13646
MDLLSALTLETWVLLAVILVLLYRLGTHRHGIFKKQGIPGPKPL
PFLGTVLNYYKGLGRFDMECYKKYGKIWGLFDGQTPVFAIMDTEMIKNVLVKECFSVF
TNRRDFGPVGIMGKAVSVAKDEEWKRYRALLSPTFTSGRLKEMFPIIEQYGDILVKYL
KQEAETGKPVTMKKVFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKTKKLLRFDFFDP
LFLSVVLFPFLTPIYEMLNICMFPKDSIAFFQKFVHRIKETRLDSKHKHRVDFLQLML
NAHNNSKDEVSHKALSDVEIIAQSVIFIFAGYETTSSTLSFVLYFLATHPDIQKKLQE
EIDGALPSKAPPTYDIVMEMEYLDMVLNETLRLYPIGNRLERVCKKDIELDGLFIPKG
SVVTIPTYALHHDPQHWPKPEEFHPERFSKENKGSIHPYVYLPFGNGPRNCIDMRFAL
MNMKLALTKVLQNFSFQPCKETQIPLKLSRQAILEPEKPIVLKVLPRDAVINGA
>Cyp3a9 U46118
MDLIPNFSMETWLLLVISLVLLYLYGTHSHGIFKKLGIPGPKPL
PFLGTILAYRKGFWEFDKYCHKKYGKLWGLYDGRQPVLAITDPDIIKTVLVKECYSTF
TNRRNFGPVGILKKAISISEDEEWKRIRALLSPTFTSGKLKEMFPIINQYTDMLVRNM
RQGSEEGKPTSMKDIFGAYSMDVITATSFGVNVDSLNNPQDPFVEKVKKLLKFDIFDP
LFLSVTLFPFLTPLFEALNVSMFPRDVIDFFKTSVERMKENRMKEKEKQRMDFLQLMI
NSQNSKVKDSHKALSDVEIVAQSVIFIFAGYETTSSALSFVLYLLAIHPDIQKKLQDE
IDAALPNKAHATYDTLLQMEYLDMVVNETLRLYPIAGRLERVCKTDVEINGVFIPKGT
VVMIPTFALHKDPHYWPEPEEFRPERFSKKNQDNINPYMYLPFGNGPRNCIGMRFALM
NMKVALVRVLQNFSFQPCKETQIPLKLSKQGLLQPEKPLLLKVVSRDETVNGA
>3A9-se1[1:2:4]
193 kb downstream of Cyp3a9 on opposite strand
100% match to Cyp3a9
MDLIPNFSMETWLLLVISLVLLYL
YGTHSHGIFKKLGIPGPKPLPFLGTILAYRK
LYDGRQPVLAITDPDIIKTVLVKECYSTFTNRR
>Cyp3a18-se1[5:6:7:8] pseudogene z 64% to 3A18
9216052 FGPVGFMKKAVTISEDDEGKRLRPLLSPVFTSGK 9216153
9216386 LWLLHFGLMCSPSSGSVMSVKHLRQEEKGEPIHMKE 9216493
9216722 FSGAYSMNGIAGASFGVNVDSLNN 9216781
XXXXXXXXXXXXXXXXXXXXXXXX
VVLFPFLTQI
>Cyp3a18 X79991
MEIIPNLSIETWVLLATSLMLFYIYGTYSHGLFKKLGIPGPKPV
PLFGTIFNYGDGMWKFDDDCYKKYGKIWGFYEGPQPFLAIMDPEIIKMVLVKECYSVF
TNRRCFGPMGFMKKAITMSEDEEWKRLRTILSPTFTSGKLKEMFPLMRQYGDTLLKNL
RREEAKGEPINMKDIFGAYSMDVITGTSFGVNVDSLNNPQDPFVQKAKKILKFQIFDP
FLLSVVLFPFLTPIYEMLNFSIFPRQSMNFFKKFVKTMKKNRLDSNQKNRVDFLQLMM
NTQNSKGQESQKALSDLEMAAQAIIFIFGGYDATSTSISFIMYELATRPNVQKKLQNE
IDRALPNKAPVTYDALMEMEYLDMVVNESLRLYPIATRLDRVSKKDVEINGVFIPKGT
VVTIPIYPLHRNPEYWLEPEEFNPERFSKENKGSIDPYVYLPFGNGPRNCIGMRFALI
SMKLAVIGVLQNFNIQPCEKTQIPLKISRQPIFQPEGPIILKLVSRD
>Cyp3a23/3a1 D13912
MDLLSALTLETWVLLAVVLVLLYGFGTRTHGLFKKQGIPGPKPL
PFFGTVLNYYMGLWKFDVECHKKYGKIWGLFDGQMPLFAITDTEMIKNVLVKECFSVF
TNRRDFGPVGIMGKAISVSKDEEWKRYRALLSPTFTSGRLKEMFPVIEQYGDILVKYL
RQEKGKPVPVKEVFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKAKKLLRIDFFDPLF
LSVVLFPFLTPVYEMLNICMFPKDSIEFFKKFVYRMKETRLDSVQKHRVDFLQLMMNA
HNDSKDKESHTALSDMEITAQSIIFIFAGYEPTSSTLSFVLHSLATHPDTQKKLQEEI
DRALPNKAPPTYDTVMEMEYLDMVLNETLRLYPIGNRLERVCKKDVEINGVFMPKGSV
VMIPSYALHRDPQHWPEPEEFRPERFSKENKGSIDPYVYLPFGNGPRNCIGMRFALMN
MKLALTKVLQNFSFQPCKETQIPLKLSRQGLLQPTKPIILKVVPRDEIITGS
>Cyp3a62 AB084894 80% to CYP3A9, 78% to Cyp3a13
MDLIPNISLETWMLLATILVLLYLYGTSTHGNFKKLGISGPKPL
PFVGNILAYRHGFWEFDRHCHKKYGDIWGFYEGRQPILAITDPDIIKTVLVKECYSTF
TNRRSFGPAGILKKAITLSEDEEWKRLRTLLSPTFTSGKLKEMFPIINQYADLLVKNV
KHEAEKGNPITMKDIFGAYSMDVITGTSFGVNVDSLNNPQNPFVQKVKKLLKFNFLDP
FFLSVILFPFLTPVFEAFDITVFPKDVMKFFRTSVERMKENRMQEKVKQRLDFLQLMI
NSQSSGDKESHQGLTDVEIVAQSIFFIFAGYETTSSALSFALYLLATHPDLQKKLQDE
IDAALPNKAPVTYDVLVEMEYLDMVLNETLRLFPVGGRLERVCKKDVEINGVFIPKGT
VVMVPTFALHKDPKCWPEPEEFCPERFRKKNQDSINPYIYLPFGNGPRNCIGMRFALM
NMKIALVRVLQNFSFGLCKETQIPLKLRKKGFFQPEKPIILRAVSRD
>3a62-de11b rat
GenEMBL NW_001084671.1 11932479-11932637
11932637 KKNQDSISPYVYLPFGFRPRNCIGMRFALMNMKVALVRVLQNFSFQPCKEIQL 11932479
>Cyp3a71-ps new pseudogene 78% to 3A2
9497641 LFEWHTPVFAITDREMIKNVLVKECFSVFTNWR 9497543 exon 4
9468424 DLGPMGIMNKSIAF*KDEEWKRYRALLSPMFTSGKLKV 9468311
9468234 MFPIIKLYGDILVKYLRQEAEKGKPVSVKE 9468145
9467886 IFGAYSMDVITSTSFGVNVDSLNNPKDPFVEKTKKFLRLDYFDPLFISV 9467737
9464756 GLFPFLKPIYDMLNISVFPKDSIAFFKNFVYSMKESHLDSKQK 9464631
9453844 YQVDFFQLMMNAHNNSSESHK 9453782
9451670 FPALSDIEIIAQSIIFTFGGYDTTSSTLSFVLYSLATHSDVQKKLQEEIDHALPNK 9451503
9449744 ASPTYDIVMEMEYLDMVFNETLRLYPVTGRLHRMCKKDIELDGVFIPKG
SMVMIPLYPLQHDPQHWPEPEEFRPE 9449520
9447351 RFSKENKCRTGHYVYLPFGNGPRNCLGMRFALMSMKLAVTKVLQNFSFHPCKET 9447190
9444444 QIPLKLSKQVILKPEKPIVLKVVPRDGVING 9444352
>Cyp3a73 chr12_random_1.5 (from UCSC browser)
MDLVSALSLETWLLLAIILVLFYR (2)
FGTRTHGIFKKQGIPGPKPLPFLGTVLNYYR (0)
GLWKFDMECYKKCGKIWG (2)
LFDGQTPVFAIMDTEMIKSVLVKECFSVFTNRR (0)
NIGPVGIMSKSISVAKDEEWKRYRAFLSPTFTSGRLKE (0)
MFPIIEHYGDILVKYLKQKVEKGKPLAMKE (2)
VFGAYSMDVITSTSFEVNINSINNPKDPFVEKVKKFQRFDFFDPLFLSV (1)
VLFPFLTPIYEMLNICLFPKDSVAFFQKFVYRMKQTRLDSKHK (0)
HRVDFLQLMMNAHNNSKDKVSHK (1)
ALSDIEIVAQAIIFIFASYETTSSTLSFVLYSLATHPDSQKKLQEEIDRALPNK (0)
APPTYDTVMEMEYLDMVLNETPRLYPIGYRLERVCKKDIKLDGVFIPKGSVVMIPFYTLQHDPQHWPEPEEFLPER (2)
FSKENKGSIDPYVYLPFGNGPRNCIGMRFALMNMKLALTKVLQNFSFQLCEETQ (0)
IPLKLSRQRLFGPEKPIVLKVVPRDAVITGA*
>Cyp3a85-ps rat
GenEMBL NW_001084671.1 4655562-4666355 (-) strand
86% to 3A2 new gene
(numbering is relative, taken from a larger sequence)
missing first four exons on two different contigs
67133 DLGPLGIMSKAITFSKVEEWKRYRAFLSPTFTSGKLKE 67020
66942 MFPIVEQHGDILVKYLRREDEKGKPAPVKQ 66853
VFGAYSMDEITSTSFGVSVD 66373
66372 FLTNPKDTFVEKTKKLLRFDFFDPLFLSVG
63877 LFPFLTPIYEMLNICMFPKDSIAFLQKFVYRMKETRLDSKHK
62887 HRVDFLQLMLNAHNNSKDEVSHK 62819
60567 ALSDVEIIAQSVTFIFAGYEITSSTLSFVLYSLATYPDIQKKLQEEIDGALPNK 60406
58571 APPTYDIVMEMEYLDMVLNETLRLYPVGNRLERVCKKDIELGGVFIPKGSVV 58416
58415 MIPTYPLQRDPQHWTEPEEFHPER 58344
56749 FSKENKGSIDPYVFLPFGHGPRNCIGMRFALMNMKLALTKVLQNFSFQPCKETQ 56588
53720 IPLKLSIQAILEPEKAIVLKVVPWDAVITGA 53628
>Cyp4a1 M14972 NM_175837 1 AA DIFF
MSVSALSSTRFTGSISGFLQVASVLGLLLLLVKAVQFYLQRQWL
LKAFQQFPSPPFHWFFGHKQFQGDKELQQIMTCVENFPSAFPRWFWGSKAYLIVYDPD
YMKVILGRSDPKANGVYRLLAPWIGYGLLLLNGQPWFQHRRMLTPAFHYDILKPYVKN
MADSIRLMLDKWEQLAGQDSSIEIFQHISLMTLDTVMKCAFSHNGSVQVDGNYKSYIQ
AIGNLNDLFHSRVRNIFHQNDTIYNFSSNGHLFNRACQLAHDHTDGVIKLRKDQLQNA
GELEKVKKKRRLDFLDILLLARMENGDSLSDKDLRAEVDTFMFEGHDTTASGVSWIFY
ALATHPKHQQRCREEVQSVLGDGSSITWDHLDQIPYTTMCIKEALRLYPPVPGIVREL
STSVTFPDGRSLPKGIQVTLSIYGLHHNPKVWPNPEVFDPSRFAPDSPRHSHSFLPFS
GGARNCIGKQFAMSEMKVIVALTLLRFELLPDPTKVPIPLPRLVLKSKNGIYLYLKKL
H
>Cyp4a2 M57719 M33938
MGFSVFSPTRSLDGVSGFFQGAFLLSLFLVLFKAVQFYLRRQWL
LKALEKFPSTPSHWLWGHNLKDREFQQVLTWVEKFPGACLQWLSGSTARVLLYDPDYV
KVVLGRSDPKPYQSLAPWIGYGLLLLNGKKWFQHRRMLTPAFHYDILKPYVKIMADSV
SIMLDKWEKLDDQDHPLEIFHYVSLMTLDTVMKCAFSHQGSVQLDVNSRSYTKAVEDL
NNLIFFRVRSAFYGNSIIYNMSSDGRLSRRACQIAHEHTDGVIKTRKAQLQNEEELQK
ARKKRHLDFLDILLFAKMEDGKSLSDEDLRAEVDTFMFEGHDTTASGISWVFYALATH
PEHQERCREEVQSILGDGTSVTWDHLDQMPYTTMCIKEALRLYSPVPSVSRELSSPVT
FPDGRSIPKGIRVTILIYGLHHNPSYWPNPKVFDPSRFSPDSPRHSHAYLPFSGGARN
CIGKQFAMNELKVAVALTLLRFELLPDPTRIPVPMPRLVLKSKNGIHLRLKKLR
>Cyp4a8v1 M37828
MSGSALSFTIFPGSILGFLQIATVLTVLLLLLKTAQFYLHRRWL
LRATQQFPSPPSHWFFGHKIPKDQDFQDILTRVKNFPSACPQWLWGSNVRIQVYDPEY
MKLILGRSDPKAHGSYRFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDTLKPYVGIM
ADSVRIMLDKWEQIVGQDSTLEIFQHITLMTLDTIMKCAFSQEGSVQLDRKYKSYIKA
VEDLNNLFFFRVQNMFHQNDFIYSLSSNGRKAHNAWQLAHDYTDQVIKSRKAQLQDEE
ELQKVKQKRRLDFLDILLFARIENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYA
LATNPEHQQGCRKEIQSLLGDGASITWDDLDKMPYTTMCIKEALRIYPPVTAVSRMLS
TPVTFPDGRSLPKGITVMLSFYGLHHNPTVWPNPEVFDPYRFAPESSRHSHSFLPFSG
GARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPIPIPRLVLKSKNGIYLRLKKLQ
>Cyp4a8v2 97% TO 4A8v1 BC081771
MSGSALSFTIFPGSILGFLQIATVLTVLLLLFKTAQFYLHRRWL
LRATQQFPSPPSHWFFGHKIPKDQEFQDILTRVKNFPSACPQWLWGSNVRIQVYDPDY
MKLILGRSDPKSHHSYRFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDTLKPYVGIM
ADSVRIMLDKWEQIVGQDSTLEIFQHITLMTLDTIMKCAFSQEGSVQLDRKYKSYIKA
VEDLNNLSFFRIRNIFHQNDIIYSLSSNGRKARSAWQLAHEHTDQVIKSRKAQLQDEE
ELQKVKQKRRLDFLDILLFARIENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYA
LATNPEHQQGCRKEIQSLLGDGASITWDDLDKMPYTTMCIKEALRIYPPVTAVSRMLS
TPVTFPDGRSLPKGITVMLSFYGLHHNPTVWPNPEVFDPYRFAPESSRHSHSFLPFSG
GARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPIPIPRLVLKSKNGIYLRLKKLQ
>Cyp4a8v2-de1b rat
UCSC browser a in fig
CYP4A exon 1 pseudogene chr5:135545150-135545338 (- strand)
135545338 MSIFELSHITTGFGISGLLQMVSWLGLLLLLLFKAAQYYLHRQWIIKSVQHFPSPPSHWFFGN 135545150
>Cyp4a8v2-de5c6c12c rat
UCSC browser b in fig
135596695-135596537 (- strand) exon 5
SVLLPQNKWE*TISDSLEIFQCASLITLATILMCVFSY*DNVHLN
135596212-135596066 (- strand) exon 6
HSQTYTQVVGILNNLRNAFPQSDIFYRMTADGHRTKNAFLIAHKHSDFV
135594009-135593884 (- strand) exon 12
KQFIMNEMKVVITLTLLCFEWLLDPTRVSVSISGFLLNPRMG
>Cyp4a8v2-de4d12d rat
UCSC browser c in fig
135628319-135628215 (- strand) A exon 4
51% to 4A8
LSNDQTWFQHY*HI*TPLLHCDILKSNVRIVADCI
135617441-135617274 (- strand) B exon 12
75% to 4A2 76% to 4A8
RICIGKQLAMNAQKLAVALTLLQFELLPDPTRVPIPTEKLVLKSKNGIHLHLRKLQ
>Cyp4a34-ps new pseudogene seq between 4A3 and 4A2 T
65% to 4A2 135837702- 135845966 (+ strand)
MGIFELSHITTVFGISRLLQMVFWLGLLLLLFKAAQYYLRRQWIIKSFQQFPFPPSHWLFGNFLK
135838759 KDQDLQQIRLWVEKFPTACVRWFWGNHACVLIYDPD*MKVILG*S 135838887 aa 65-107
(seq gap)
GYSLLLLNGKKWFQHRQMLTPAFHSDILKPYVGIMA
FSIFLLQDKWEELVGQDCPLEIYQDISLMTMETLINCAFSYQGSVQLE
NSRS*IKAVEDLTHLIHFRVRNGFH*SNIIYNLSSNGGSFHCACQIAHKHKG
DRVIRRRKVQLQSGVELEKIWKKWHLDLLDILLFAQ
EDGKSLSDEDLHAEVDTFMFEGHDTAARGISWIFYALPTHPEHQERCKEEVQSILGDGTSVTW
DHLDQMPYTTMCIKKALRLYPPGPAVSRELSTPVTFPDGCSSPKNSRISVV
IFGLHHNPRL*PNPE
VLDPFRFAPDVPQHTHAFLPFSAGAR
NCIRKLFAMNELKVAVTLTLL*LELLPDPTRVPFLVARTVLKSKIRIYLHLKKLK
>Cyp4a34-ps-de12b C-term aa 453-508 with one frameshift
135831318 RNCIGEHFAMNELKVAMALTLL 135831383
135831383 QFELLPDPTRIPIPIPRLVLKSKNGIYLHLKKLQ 135831484
>Cyp4a3 M33936
MGFSVFTPTRSLDGVSGFFQGAFLLSLFLVLFKAVQFYLRRQWL
LKALEKFPSTPSHWLWGHDLKDREFQQVLTWVEKFPGACLQWLSGSKTRVLLYDPDYV
KVVLGRSDPKASGIYQFLAPWIGYGLLLLNGKKWFQHRRMLTPAFHYGILKPYVKIMA
DSVNIMLDKWEKLDDQDHPLEIFHYVSLMTLDTVMKCAFSHQGSVQLDVNSRSYTKAV
EDLNNLTFFRVRSAFYGNSIIYNMSSDGRLSRRACQIAHEHTDGVIKMRKAQLQNEEE
LQKARKKRHLDFLDILLFAKMEDGKSLSDEDLRAEVDTFMFEGHDTTASGISWVFYAL
ATHPEHQERCREEVQSILGDGTSVTWDHLDQIPYTTMCIKEALRLYPPVPSVSRELSS
PVTFPDGRSIPKGITTTILIYGLHHNPSYWPNPKVFDPSRFSPDSPRHSHAYLPFSGG
ARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPVPMARLVLKSKNGIHLRLKKLR
>Cyp4a33-ps 135689641-135700825 (+) between 4A8 and 4A2
DKWEQIVGQDSTLEIVQHNTLMTLDTIMKCAFSQEGSVQLDR
KYKSYIKAVGDLNNLSFFRIWNIFHQNDIIYSLSSNGCQANSAC*LAHEHT
DQVIKSRKAQLQDEEELQKVKQKRRLDFLDILLFAR
IENGSSLSDKDLRAEVDTFMFEGHDTTASGISWIFYALATNPEHQQGCRKEIQSLLGDGASITW
DDLDKMPYTTMCIKEALSIYPPVPSVSRMLSTPVTFPDGCSLPK
GITAVLSFYGHHHNPTL*PNPE
VFDPYRVFPXSSQHSHLFLPFSGGAR
NCIGKQFAMNELKVAIALTLLCLRLLPDPTRIPIPIPRLVLKSKNGIYLHLKKLQ
>Cyp4a33-ps-de4b5b12b rat
UCSC browser
135662099-135662209 (+ strand) F exon 4
LSNDQTWFQH*HILTPLFHYGILKTNVRIIVDSVHEM
135671387-135671512 (+ strand) G exon 5
DISDSLEIFQCASLIALATIMMCAFSYQDNVHLNRSVTSQSF
135676914-135677039 (+ strand) S exon 12
KQFIMNEMKVVITLTLLCFEWLLDPTRVSVSISGFLLNPRMG
>Cyp4a33-ps-de10c11c rat
UCSC browser
135712750-135712827 (+ strand) exons 10
GVLISFSICGLHHNPRLWPNAE (0)
135713048-135713125 (+ strand) R exon 11
VFDPFRFAPDVLRHTHAFLPFSAGAR
>Cyp4b1 M29853
MVLNFLSPSLSRLGLWASVVILMVIVLKLFSLLLRRQKLARAMD
SFPGPPTHWLFGHALEIQKLGSLDKVVSWAQQFPHAHPLWFGQFVGFLNIYEPDYAKA
VYSRGDPKAADVYDFFLQWIGKGLLVLDGPKWFQHRKLLTPGFHYDVLKPYVAIFAES
TRMMLDKWEKKASENKSFDIFCDVGHMALDTLMKCTFGKGDSGLGHRDNSYYLAVSDL
TLLMQQRIDSFQYHNDFIYWLTPHGRRFLRACKIAHDHTDEVIRQRKAALQDEKERKK
IQQRRHLDFLDILLGVRDESGIKLSDAELRAEVDTFMFEGHDTTTSGISWFLYCMALY
PEHQQLCREEVRGILGDQDSFQWDDLAKMTYLTMCMKECFRLYPPVPQVYRQLNKPVT
FVDGRSLPAGSLISLHIYALHRNSTVWPDPEVFDPLRFSPENAAGRHPFAFMPFSAGP
RNCIGQQFAMNEMKVVTALCLLRFEFSLDPSKMPIKVPQLILRSKNGIHLYLKPLASR
SGK
>Cyp4f39 UPSTREAM OF 4F5 chr7 (+) 94% to mouse 4f39 = ortholog
CK475948.1 EST
13051717 MLPITDYLLYLLGLEKTAFRVYVLSALLLFLLFLLFRLLLQAFKLFS
DFRITCRRLSCFPEPPGRHWLLGHMSM 13051938
13054230 YLPNEKGLQNEKKVLDTMHHIILAWVGPFLPLLVLVHPDYIKPVLGAS 13054373
13064279 AAIAPKDEFFYSFLKPWL 13064332
13064958 GDGLLISKGNKWSRHRRLLTPAFHFDILKPYMKIFNQSVNIMH 13065086
13065271 AKWRRHLAEGSVTSFDMFEHVSLMTLDSLQKCVFSYSSDCQE 13065396
13067956 KLSDYISSIIELSALVVRRQYRLHHYLDFIYYLTADGRRFRQACDTVHNFTTEVIQQRRR
ALRELGAEAWLKAKQGKTLDFIDVLLLAK 13068222
13072211 DEEGKELSDEDIRAEADTFMFE 13072276
13072388 GHDTTSSGLSWALFNLAKYPEYQDKCREEIQEVMKGRELEELDW 13072519
13076560 DDLTQLPFTTMCIKESLRQFPPVTLISRRCTEDIKLPDGRIIPK 13076691
13078724 GIICLVSIYGTHYNPLVWPDSK 13078789
13079451 VYNPYRFDPDIPQQRSPLAFVPFSAGP 13079531
13079947 RNCIGQSFAMAEMRVVVALTLLRFRLSVDRTRKVRRKPELILRTENGLWLNV
EPLPSRAGVPRGPTEPEVQAPPAQA* 13080177
>Cyp4f17 = CYP4F19temp AI030199 EST CHR7 13095557 13103056 chr7 (+)
90% to 4f17 next closest 82%, probable ortholog of 4f17
13095557 MLQLSLSWLGRGPVTVSPWQLLLVVGTSLLLARILAWISAFYDN
YCRLRCFPQPPSRHWFWGHLNL 13095754
13102916 VKNNEEGLQLLAEMSHQFQDIHLCWIGIFYPILRLIHPKFIGPILQA 13103056
13103866 AAAVAPKEMIFYGFLKPWL 13103922
13104011 GDGLLVSAGEKWSRQRRLLTPAFHFDILKPYVKNFNKSVNIMH 13104139
13105431 AKWQRLTAKGSARLDMFEHISLMTLDSLQKCVFSFDSNCQE 13105553
13106281 SPSEYIAAIQELSSLIVKRHHQPFLYMDFLYYLTADGRRFRKACDLVHNFTDAVIRERRR
TLSSQSVDEFLKSKTKSKTLDFIDVLLLAK 13106550
13106863 DEHGKELSDEDIRAEADTFMFG 13106928
13107118 GHDTTASALSWILYNLARHPEHQERCRQEVRELLRDREPEEIEW 13107249
13110034 DDLTQLPFLTMCIKESLRLHPPVTVISRCCTQDVVLPDGRVIPK 13110165
13110226 GNDCIISIFGVHHNPSVWPDPE 13110303
13110457 VYDPFRFDSENPQKRSPLAFIPFSAGP 13110537
13110878 RNCIGQTFAMNEMKVAVALTLLRFRLLPDDKEPRRKPELILRAEGGLWLRVEPLSTGAQ 13111054
>Cyp4f5 13119940 13133265 chr7 (+) 3 aa diffs to mRNA U39207 90% to 4f16 89% to 4f37, probable ortholog of 4f16 based on location
13119940 MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCENCSRLRCFPQSPKRNWFLGHLGT 13120137
13122954 IQSNEEGMRLVTEMGQTFRDIHLCWLGPVIPVLRLVDPAFVAPLLQAP 13123097
13125947 ALVAPKDTTFLRFLKPWL 13126000
13126086 GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKIFNQSVNIMH 13126214
13227610 VKWKHLCVEGSAHLEMFENISLMTLDSLQKCLFGFDSNCQE 13227732
13128135 SPSEYISAILELSSLIIKRSQQLFLYLDFLYYRTADGRRFRKACDLVHNFTDAVIRERRR
LLSSQGTDEFLESKTKSKSKTLDFIDVLLLAK 13128410
13129071 DEHGKELSDEDIRAEADTFMFG 13129136
13129327 GHDTTASALSWILYNLARHPEYQERCRQEVWELLRDREPEEIEW 13129458
13132295 DDLAQLPFLTMCIKESLRLHPPAIDLLRRCTQDIVLPDGRVIPK 13132426
13132538 GNICVISIFGIHHNPSVWPDPE 13132603
13132764 VFDPFRFDSENRQKRSPLSFIPFSAGP 13132844
13133089 RNCIGQTFAMNEMKVVVALTLLRFRVLPDDKEPRRKPEIILRAEGGLWLRMEPLSTDTQ 13133265
>CYP4F5? AF288818 7aa diffs to 4F5 probably same gene
MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCEN
CSRLRCFPQSPKRNWFLGHLGT
IQSNEEGMRLVTEMGQTFRDIHLCWLGPVIPVLRLVDPAFVAPLLQAP
ALVAPKDPTFLHFLKPWL
GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKIFNQSVNIMH
AKWKHLCLEGSVRLEMFENISLMTLDSLQKCLFGFDSNCQE
SPSEYISAILELSSLIIKRSQQLFLYLDFLYYRTADGRRFRKACDLLHNFTDAVIRERRR
LLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK
DEHGKELSDEDIRAEADTFMFG
GHDTTASALSWILYNLARHPEYQERCRQEVWELLRDREPEEIEW
DDLAQLPFLTMCIKESLRLHPPAIDLLRRCTRHIVLPDGRVIPK
GNICVISIFGIHHNPSVWPDPE
VFDPFRFDSENRQKRSPLSFIPFSAGP
RNCIGQTFAMNEMKVVVALTLLRFRVLPDDKEPRRKPEIILRAEGGLWLRMEPLSTDTQ
>Cyp4f37 94% to 4F5 chr7 (+) 89% to 4f16 88% to 4f37
13149326 MPWLTVSGLDLGSVVTSTWHLLLLGAASWILARILAWTYSFCENCSRLRCFPQSPKRNWFLGHLGV 13149523
13159662 IQSNEEGMQLVTEMGQTFRDVHLIWLGPVSPVLRLVDPAFVAPLLQAP 13159805
13162623 ALVAPKDPTFLHFLKPWL 13162676
13162768 GDGLFLSSGDKWSRHRRLLTPAFHFDILKPYVKTFNQSVNIMH 13162896
13164072 AKWKHLCLEGSARLEMFENISLMTLDSLQKCLFGFDSNCQE 13164194
13164825 SPSEYISATLELSSLTRKRSYKLFLYLDFLYYRTADGQRFRKACDLVHSFTDAVIRERRR
LLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK 13165100
13165761 DEHGKELSDEDIRAEADTFMFG 13165826
13165992 GHDTTASALSWILYNLASHPEYQERCRQEVWELLRDREPEEIEW 13166123
13168834 DDLAQLPFLTMCIKESLRLHPPAVDLLRRCTQDIVLPDGRVIPK 13168965
13169077 GNICVISIFGIHHNPSVWPDPE 13169142
13169302 VYDPFRFDPENRQKRSPLSFIPFSAGP 13169382
13169627 RNCIGQTFAMNEVKVAVGLTLLRFRFLPDDKEPRRKPELILRAEGGLWLRVELLSRDTQ 13169803
>Cyp4f43-ps pseudogene chr7 (+) strand exons 4, 5, 9, 10, 11, 12
13179984 RDGVFLISFDKWNHHHCLLTPAFHFDNLVL 13180073
*VKIFNQSVNIIH
13181413 VSFLKAKWKCLFSEGSACLEIFENLTTLDSLQKCLFSLDSNCQE 13181544
13207589 NDLAQLPFLTMCIKASLQLYPQDTNLICSCT 13207681
*DILLPDG*VIPK
XXXXXXXXXGVHHSPSVWTDPX
13208327 VYYPFPFDSKNPQKISPLAFMPFSVGP 13208407
13208782 RNCKRQTYPMSERKVALVLKLLHFHTIPGEIDPPRQPELILSLEGRLWLLKESLSVG 13208952
>Cyp4f44-ps pseudogene MISSING EXON 1 AND HALF OF EXON 2 90% to 4f16
13215861 LGPVIPVLRLVDPAFVAPLLQAP 13215929
13219942 ALVAPKDMNFYGFLKPWL 13219995
13220082 GDGLLLSSGDKWNRHRXLTPAFHFDILKPYVKIFNQSVNIMH 13220206
13227610 VKWKHLCVEGSAHLEMFENISLMTLDSLQKCLFGFDSNCQE 13227732
13228571 SPSEYISAILELSSLTIKRSYQLFLYLDFLYYRTADGRRFRKAC
DLVHSFTDAVIRERRRLLSSQGVDEFLESKTKSKSKTLDFIDVLLLAK 13228846
13230087 DEHGKELSDEDIRAEADTFMFG 13230152
13230344 GHDTTASTLSWILYNLARHPEYQESCLQEVWELLRDREPEEIEW 13230475
13239583 DDLAQLPFLTMCIKESLRLHPPAVDLLRRCTQDIVLPDGRVIPK 13239714
13239826 GNICVISIFGIHHNPSVWPDPE 13239891
13240051 VYDPFRFDPESRQKRSPLSFIPFSAGP 13240131
13240378 RNCIGQTFAMNEMKVAVALTLLRFRLLPDDKEPRRKPEIILRAEGGLRLLVEPLSGGA* 13240554
>Cyp4f40 91% to 4f40 next closest = 82% probable ortholog of 4f40
13268033 MRHLDLSWLGLGPMSASPWLLLSLVGVSWFLTRCLTQIYTLYAK
CQRLCGFPQPPKRSWFWGHLGM 13268230
13270621 SPPTEEGMKQMTELVATYPQGFMTWLGPIVPLITLCHPDIIRSVLSAS 13270764
13273375 AAVAPKDGIFYSFLKPWL 13273428
13273519 GDGLLVSASDKWSRHRSMLTPAFHFNILKPYVKIFNDSTNIMH 13273647
13275412 AKWLRLASGGSAHLDMFENISLMTLDTLQKCVFSFNSNCQE 13275534
13276605 KPSEYIAAILELSALVVKRNEQLLLHMDLLYRLTPDGRRFYKACHLVHDF
TYAVIQERRRTLPKHGGDDVIKAKAKSKTLDFIDVLLLSK 13276874
13279547 DEDGKELSDEDIRAEADTFMFE 13279612
13281122 GHDTTASGLSWILYNLAKHPEYQERCRQEVQELLRDRDSEEIEW 13281253
13281122 DDLAQLPFLTMCIKESLRLHPPVTMVSRCCTQDISLPDGRVIPK 13281253
13281331 GIICIINIFATHHNPTVWQDPE 13281396
13281524 VYDPFRFDPENIQARSPLAFIPFSAGP 13281604
13281923 RNCIGQTFAMNEMKVAVALTLLRFRVLPDDKEPRRKPELILRAEDGLWLRVEPLSAQA 13282096
>Cyp4f4 U39206 chr7 (+) strand 92% to 4f15 next closest 83% probable ortholog of 4f15
13293478 MPQLDLSWLGLGPMSASPWLLLLLVGASWLLVRVLTQTYIFYRT
YQHLCDFPQPPKWNWFLGHLGM 13293675
13296360 ITPTEQGLKQVTKLVATYPQGFMTWLGPILPIITLCHPDVIRSVLSA 13296500
13298373 SASVALKEVIFYSFLKPWL 13298429
13298517 GDGLLLSDGDKWSCHRRMLTPAFHFNILKPYVKIFNDSTNIMH 13298645
13301094 AKWQDLASGGSARLDMFKNISLMTLDSLQKCVFSFDSNCQE 13301216
13303748 KPSEYISAILELSALVAKRYQQLLLHTDSLYQLTHNGRRFHKACKLVHNFTDAVIQGRRR
ALPSQHEDDILKAKARSKTLDFIDVLLLTK 13304017
13305908 DEDGKELSDEDIRAEADTFMFE 13305973
13306182 GHDTTASGLSWILYNLARHPEYQERCRQEVRELLRDRESTEIEW 13306313
13307962 DDLAQLPFLTMCIKESLRLHPPVTVISRRCTQDIVLPDGRVIPK 13308093
13308178 GVICIINIFATHHNPTVWPDPE 13308243
13308394 VYDPFRFDPENIKDRSPLAFIPFSAGP 13308474
13308851 RNCIGQTFAMNEMKVALALTLLRFRVLPDDKEPRRKPELILRAEGGLWLRVEPLSTQ 13309021
>Cyp4f1 M94548 chr7 12 exons (-) strand 95% to 4f14 probable ortholog
13600726 MSQLSLSWLGLGPEVAFPWQTLLLFGASWILAQILTQIYAAYRN
FRRLRGFPQPPKRNWLMGHVGM 13600529
13598616 VTPTEQGLKELTRLVGTYPQGFLMWIGPMVPVITLCHSDIVRSILNAS 13598473
13595808 AAVALKDVIFYTILKPWL 13595755
13595666 GDGLLVSAGDKWSRHRRMLTPAFHFNILKPYVKIFNDSTNIMH 13595538
13595386 AKWKRLISEGSSRLDMFEHVSLMTLDSLQKCVFSFDSNCQE 13595264
13593683 KSSEYIAAILELSALVAKRHQQPLLFMDLLYNLTPDGMRFHKACNLVHEFTDAVIRERRR
TLPDQGLDEFLKSKAKSKTLDFIDVLLLTK 13593414
13592552 DEDGKELSDEDIRAEADTFMFE 13592552
13592326 GHDTTASGLSWILYNLANDPEYQERCRQEVQELLRDRDPEEIEW 13592195
13591103 DDLAQLPFLTMCIKESLRLHPPVTVISRCCTQDILLPDGRTIPK 13590972
13590900 GIICLISIFGIHHNPSVWPDPE 13590835
13590677 VYNPFRFDPENIKDSSPLAFIPFSAGP 13590597
13590229 RNCIGQTFAMSEMKVALALTLLRFRLLPDDKEPRRQPELILRAEGGLWLRVEPLTAGAQ 13590053
>Cyp4f6 U39208 chr7 (-) strand 91% to 4f13 probable ortholog
13635825 MLQLSLSRLGMGSLTASPWHLLLLGGASWILARILAWIYTFYDN
CCRLRCFPQPPKPSWFWGHLTL 13635628
13629996 MKNNEEGMQFIAHLGRNFRDIHLSWVGPVYPILRLVHPNVIAPLLQA 13629856
13620199 SAAVAPKEMTLYGFLKPWL 13620143
13620054 GDGLLMSAGEKWNHHRRLLTPAFHFDILKSYVKIFNKSVNTMH 13619926
13618162 AKWQRLTAKGSARLDMFEHISLMTLDSLQKCIFSFDSNCQE 13618040
13616381 SNSEYIAAILELSSLIVKRQRQPFLYLDFLYYLTADGRRFRKACDVVHNFTDAVIRERRS
TLNTQGVDEFLKARAKTKTLDFIDVLLLAK 13616112
13615792 DEHGKGLSDVDIRAEADTFMFG 13615727
13615538 GHDTTASALSWILYNLARHPEYQERCRQEVRELLRDREPEEIEW 13615407
13610183 DDLAQLPFLTMCIKESLRLHPPVLLISRCCSQDIVLPDGRVIPK 13610052
13609958 GNICVISIFGVHHNPSVWPDPE 13609893
13609762 VYNPFRFDPENPQKRSPLAFIPFSAGP 13609682
13609348 RNCIGQTFAMSEIKVALALTLLRFCVLPDDKEPRRKPELILRAEGGLWL
RVEPLSTVTSQLPWDLLAHPPTS 13609133
>Cyp4f18 XM_224708 ASSEMBLY MODIFIED from Genbank entry 77% to 4F1 chr7 (+) strand
92% to 4f18 probable ortholog, 4f18 is also distant from the 4f cluster in mouse
18197903 MPLLSLSWLGLGHTAASPWLLLLLVGASCLLAYILPQVYAVFEN
SRRLRRFPQPPTRNWLFGHLGL 18198100
18202714 IQSSEEGLLYIQSLSRTFRDVCCWWVGPWHPVIRIFHPAFIKPVILA 18202854
18203794 PASVAPKDRVFYRFLKPWL 18203850
18203937 GDGLLLSTGDKWSRHRHMLTPAFHFNILKPYVKIFNDSTNIMH 18204065
18206788 AKWQRLASQGSARLDMFEHISLMTLDSLQKCVFSFDSNCQE 18206910
18208877 KPSEYITAILELSALVARRHQSLLLYVDLFYHLTRDGMRFRKACRLVHDFTDAVIRERRR
TLPDQGGDDALKAKAKAKTLDFIDVLLLSK 18209146
18211298 DEHGEALSDEDIRAEADTFMFG 18211363
18211538 GHDTTASGLSWILYNLAKHPEYQERCRQEVRELLRDREPEEIEW 18211669
18222463 DDLAQLPFLTMCIKESLRLHPPATAISRCCTQDIMLPDGRVIPK 18222594
18222676 GVICRISIFGTHHNPAVWPDPE 18222741
18223428 VYNPFRFDADNGEGRSPLAFIPFSAGP 18223508
18223827 RNCIGQTFAMSEMKVALALTLLRFRVLPDDKEPRRKPELILRAEGGLWLRVEPLSAGAH 18224003
>Cyp4v3 XM_341440 extra intron in the middle removed, CK366141.1 EST at boundary
there is a gc-at boundary. 92% to mouse 4v3
MLWLWLGLSGQKLLLWGAASAVSVAGATVLLNILQMLVSYARKW
QQMRPIPSVARAYPLVGHALFMKPNNTEFFQQIIQYTEEFRHLPIIKLWIGPVPLVAL
YKAENVEVILTSSKQIDKSFMYKFLQPWLGLGLLTS (2)
TGSKWRARRKMLTPSFHFTILEDFLDVM
NEQANILVNKLEKHVNQEAFNCFFPITLCALDIICETAMGKNIGAQSNGDSEYVRTVY
RMSDMIYRRMKMPWFWFDLWYLMFKEGRDHKKGLKSLHTFTNNVIAERVNARKAEQDC
IGAGRGPLPSKTKRKAFLDLLLSVTDEEGNKLSHEDIREEVDTFMFEGHDTTAAAINW
SLYLLGSNPEVQRKVDKELDDVFGRSHRPVTLEDLKKLKYLDCVIKETLRVFPSVPLF
ARSLSEDCEVAGYKISKGTEAVIIPYALHRDPRYFPDPEEFQPERFFPENSQGRHPYA
YVPFSAGPRNCIGQKFAVMEEKTILACILREFWIESNQKREELGLAGDLILRPNNGIW
IKLKRRHEDDP
>Cyp4x1 AF439343, NM_145675
MEASWLENRWARPLHLALVFCLALVLMQAVKLYLRRQRLLRDLR
PFPGPTAHWLLGHQKFLQEDNMEKLDEIVKEYPCAFPCWVGPFQAFFYIYDPDYAKIF
LSRTDPKTQYLHQLMTPFLGRGLLNLDGPRWFQHRCLLTPAFHQDILKPCVDMMAHSV
NMMLDKWEKTWTTQETTIEVFEHINLMTLDIIMKCAFGQETNCQINGTYESYVKATFE
LGEIISSRLYNFWHHHDIIFKLSPKGHCFQELGKVIHQCTEKIIQDRKKTLKDQVNQD
DTQTSQNFLDIVLSAQAGDEKAFSDADLRSEVNTFMWAGHDASAASISWLLYCLALNP
EHQDRCRTEIRSILGDGSSITWEQLDEIPYTTMCIKETLRLIPPIPSISRELSKPLTL
PDGHSLPAGMTVVLSIWGLHHNPAVWKDPKVFDPLRFTKENSEQRHPCAFLPFSSGPR
NCIGQQFAMLELKVAIALTLLRFRVAADLTRPPAFSSHTVLRPKHGIYLHLKKLPEC
>Cyp5a1 D28773
MEVLGLLKFEVSGTVVTVTLSVVLLALLKWYSTSAFSRLRKLGI
RHPEPSPFVGNLMFFRQGFWESHLELRERYGPLCGYYLGRRMYIVISDPDMIKEVLVE
NFSNFSNRMASGLEPKLIADSVLMLRDRRWEEVRGALMSAFSPEKLNEMTPLISQACE
LLLSHLKHSAASGDAFDIQRCYCCFTTNVVASVAFGIEVNSQDAPEDPFVQHCQRVFA
FSTPRPLLALILSFPSIMVPLARILPNKNRDELNGFFNTLIRNVIALRDKQTAEERRG
DFLQMVLDAQRSMSSVGVEAFDMVTEALSSAECMGDPPQRCHPTSTAKPLTVDEIAGQ
AFLFLIAGHEITTNTLSFITYLLATHPECQERLLKEVDLFMEKHPAPEYCNLQEGLPY
LDMVVAETLRMYPPAFRFTREAAQDCEVLGQHIPAGSVLEIAVGALHHDPEHWPNPET
FDPERFTAEARLQQKPFTYLPFGAGPRSCLGVRLGLLVVKLTLLQVLHKFRFEACPET
QVPLQLESKSALCPKNGVYVKIVSR
>Cyp7a1 J05460
MMTISLIWGIAVLVSCCIWFIVGIRRRKAGEPPLENGLIPYLGC
ALKFGSNPLEFLRANQRKHGHVFTCKLMGKYVHFITNSLSYHKVLCHGKYFDWKKFHY
TTSAKAFGHRSIDPNDGNTTENINNTFTKTLQGDALCSLSEAMMQNLQSVMRPPGLPK
SKSNAWVTEGMYAFCYRVMFEAGYLTLFGRDISKTDTQKALILNNLDNFKQFDQVFPA
LVAGLPIHLFKTAHKAREKLAEGLKHKNLCVRDQVSELIRLRMFLNDTLSTFDDMEKA
KTHLAILWASQANTIPATFWSLFQMIRSPEAMKAASEEVSGALQSAGQELSSGGSAIY
LDQVQLNDLPVLDSIIKEALRLSSASLNIRTAKEDFTLHLEDGSYNIRKDDMIALYPQ
LMHLDPEIYPDPLTFKYDRYLDESGKAKTTFYSNGNKLKCFYMPFGSGATICPGRLFA
VQEIKQFLILMLSCFELEFVESQVKCPPLDQSRAGLGILPPLHDIEFKYKLKH
>Cyp7b1 XM_342218, U36992
MEGATTPDAASPGPLSLLGLLFAVTLLLPVLFLLTRRTRRPCEP
PLIKGWIPYLGMALKFWKDPLAFLQTLQRQYGDTFTVLLGGKYITFVLNPFQYQYVMK
NPKQLSFEKFSRRLSAKAFSVKKLLTDDDLSNDIHRGYLLLQGKSLDGLLETMIQEVK
EIFESRLLKLTDWNTARVFDFCSSLVFEITFTTIYGKILAANKKQIISELRDDFLKFD
DHFPYLVSDIPIQLLRNAEFMQKKIIKCLTPEKVAQMQRRSEIVQERQEMLKKYYGHE
EFEIGAHHLGLLWASLANTIPAMFWAMYYLLQHPEAMEVLRDEIDSFLQSTGQKKGPG
ISVHFTREQLDSLVCLESAILEVLRLCSYSSIIREVQEDMDFSSESRSYRLRKGDFVA
VFPPMIHNDPEVFDAPKDFRFDRFVEDGKKKTTFFKGGKKLKSYIIPFGLGTSKCPGR
YFAINEMKLLVIILLTYFDLEVIDTKPIGLNHSRMFLGIQHPDSDISFRYKAKSWRS
>Cyp8a1 U53855 Rn.73051
MSWAALLGLLAVLLLLLLLLSRRRARRPGEPPLDLGSIPWLGHA
LEFGKDAASFLTRMKEKHGDIFTVLVGGRYVTVLLDPHSYDTVVWDLRTRLDFHPYAI
FLMERIFDLQLPNFNPSEEKARMKPTLMHKDLQALTEAMYTNLRTVLLGDSTEGGSGW
QEKGLLEFSYSSLLSAGYLTLYGVEASPRTHESQALDRDHSADVFRTFRQLDLMLPKL
ARGSLSVGDKDHACSVKSRLWKLLSPAGLASRADRSSWLESYLRHLEEMGVSEDMQAR
ALVLQLWATQGNMGPTAFWLLLFLLKNPEALDAVHAELKRIVWQAEKPVLQMTALPQK
ILDSMPVLDSVLNETLRLTAAPFITREVMADLALPMADRREFSLRRGDRLLLFPFLSP
QKDPEIYTEPEVFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNQCLGKSYAIN
SIKQFVVLLLTHFDLELVSEDTEVPEFDLSRYGFGLMQPEEDVPIRYRTRL
>Cyp8b1 NM_031241, AB009686
MLWGSVLGALLMAVGCLCLSLLPRHRRPWEPPLDKGFVPWLGHT
MAFRKNMFEFLKGMRAKHGDVFTLQLGGQYFTFVMDPLSFGPIIKSTQKVLDFVTYAR
ELVFKVFGYQSMDEDHQMLHVASTKHLMGQGLEDLNRAMLDSLSLVMLGPKGRSLGAR
SWCEDGLFHFCYSILFKAGFLSLFGCTKDKEQDLDEADELFRKFRRFDLLFPRFVYSL
LGPLEWVEVSQLQRLFHQRLSVEQNLEKDGISNWLGFMLRFLRERGMASSMQDKFNFM
MLWASQGNTGPTCFWALLFLLKHQDAMKAVREEATRVLGEARLEAETSFAFTLSALKC
TPVLDSVMEETLRLCATPTLLGVVQEDYVLKMASGQEYQIRRGDKVALFPYLSVHMDP
DIHPEPTTFKYNRFLNPDGTRKVDFYKSGKKIHHYNMPWGSGVSICPGRFFAPSEMKT
FVLLMVMYFDFELVDPDMPVPPIDPRRWGFGTSQPSHEVRFRYRLKPMQ
>Cyp11a1 J05156
MLAKGLCLRSVLVKSCQPFLSPVWQGPGLATGNGAGISSTNSPR
SFNEIPSPGDNGWINLYHFLRENGTHRIHYHHMQNFQKYGPIYREKLGNMESVYILDP
KDAATLFSCEGPNPERYLVPPWVAYHQYYQRPIGVLFKSSDAWRKDRIVLNQEVMAPD
SIKNFVPLLEGVAQDFIKVLHRRIKQQNSGKFSGDISDDLFRFAFESITSVVFGERLG
MLEEIVDPESQRFIDAVYQMFHTSVPMLNMPPDLFRLFRTKTWKDHAAAWDVIFSKAD
EYTQNFYWDLRQKRDFSKYPGVLYSLLGGNKLPFKNIQANITEMLAGGVDTTSMTLQW
NLYEMAHNLKVQEMLRAEVLAARRQAQGDMAKMVQLVPLLKASIKETLRLHPISVTLQ
RYIVNDLVLRNYKIPAKTLVQVASYAMGRESSFFPNPNKFDPTRWLEKSQNTTHFRYL
GFGWGVRQCLGRRIAELEMTIFLINVLENFRIEVQSIRDVGTKFNLILMPEKPIFFNF
QPLKQDLGSTMPRKGDTV
>Cyp11b1-ps pseudogene? XM_343261
MALRVTADVWLARPWQCLHRTRALGTTAKVAPKTLKPFEAIPQY
SRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQ
VESILPHRMPLEPWVAHRELRGLRRGVFLL
gap then two of the next exon
GHDLYPESLKFTHALHSMFTSTTQLILLPKSLTRWTSTQVWKGHFESWDIISEY
GHDLYPESLKFTHALHSMFTSTTQLILLPKSLTRWTSTQVWKGHFESWDIISEY
SHKCIKNVYRELAEGRQKSWSVISEMVAQSTLSMDAIHANSMEIIAEVLTR
TAISLVMTLFELARNPDVQQALQQESLAAEASIAANPQKAISDLPLLRAALKETLR
LYPVGSYLERILNSDLVLQNYHVPAGTFVIIYLYSMGRNPAVFPRPERYMPQRWLERKRS
FQHLAFGFGVRQCLGRRLAEVEMLLLLHH
MPKSFQVETQEKEDVQMAYRFILMPSSIPLLTFRPVS
>Cyp11b1 X15431
MALRVTADVWLARPWQCLHRTRALGTTAKVAPKTLKPFEAIPQY
SRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQ
VESILPHRMPLEPWVAHRELRGLRRGVFLL NGADWRFNRLQLNPNMLSPKAIQSFVPF
VDVVARDFVENLKKRMLENVHGSMSINIQSNMFNYTMEASHFVISGERLGLT GHDLKP
ESVTFTHALHSMFKSTTQLMFLPKSLTRWTSTRVWKEHFDSWDIISEY VTKCIKNVYR
ELAEGRQQSWSVISEMVAQSTLSMDAIHANSMELIA GSVDT TAISLVMTLFELARNPD
VQQALRQESLAAEASIVANPQKAMSDLPLLRAALKETLRLYPVGSFVERIVHSDLVLQ
NYHVPAGTFVIIYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAFGFGVRQCLGRR
LAEVEMLLLLHH MLKTFQVETLRQEDMQMVFRFLLMPSSSPFLTFRPVS
>Cyp11b2 X52766, D00567
MAMALRVTADVWLARPWQCLHRTRALGTTATLAPKT
LKPFEAIPQYSRNKWLKMIQILREQGQENLHLEMHQAFQELGPIFRHSAGGAQIVSVM
LPEDAEKLHQVESILPRRMHLEPWVAHRELRGLRRGVFLLNGAEWRFNRLKLNPNVLS
PKAVQNFVPMVDEVARDFLEALKKKVRQNARGSLTMDVQQSLFNYTIEASNFALFGER
LGLLGHDLNPGSLKFIHALHSMFKSTTQLLFLPRSLTRWTSTQVWKEHFDAWDVISEY
ANRCIWKVHQELRLGSSQTYSGIVAALITQGALPLDAIKANSMELTAGSVDTTAIPLV
MTLFELARNPDVQQALRQETLAAEASIAANPQKAMSDLPLLRAALKETLRLYPVGGFL
ERILNSDLVLQNYHVPAGTLVLLYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAF
GFGVRQCLGRRLAEVEMLLLLHHMLKTFQVETLRQEDVQMAYRFVLMPSSSPVLTFRP
IS
>Cyp11b3 U14907
MALRVTADVWARPWQCLHRTRALGSTATQAPKTLKPFEAIPQYS
RNKWLKMIQILREQSQENLHLEMHQAFQELGPIFRHSAGGAQIVSVMLPEDAEKLHQV
ESILPRRMTLESWVAHRELRGLRRGVFLLNGADWRFNRLQLNPNMLSPKAVQSFVPFV
DVVARDFVENLKKRMLENVHGSMSMDIQSNVFNYTMEASHFVISGERLGLTGHDLNPE
SLKFIHALHSMFKSTTQLMFLPKNLTRWTSTQVWKGHFESWDIISEYVTKCIKNVYRE
LAEGRQQSWSVISEMVAQSTLSMDAIHANSMELIAGSVDTTAISLVMTLFELARNPDV
QQALRQESLAAEASIAANPQKAMSDLPLLRAALKETLRLYPIGSSLERIVDSDLVLQN
YHVPAGTLVIIYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAFGFGVRQCLGRRL
AEVEVLLLLHHMLKIFQVETLRQEDVQMAYRFVLMPNPRLVLTIRPVS
>Cyp17a1 M31681
MWELVGLLLLILAYFFWVKSKTPGAKLPRSLPSLPLVGSLPFLP
RRGHMHVNFFKLQEKYGPIYSLRLGTTTTVIIGHYQLAREVLIKKGKEFSGRPQMVTQ
SLLSDQGKGVAFADAGSSWHLHRKLVFSTFSLFKDGQKLEKLICQEAKSLCDMMLAHD
KESIDLSTPIFMSVTNIICAICFNISYEKNDPKLTAIKTFTEGIVDATGDRNLVDIFP
WLTIFPNKGLEVIKGYAKVRNEVLTGIFEKCREKFDSQSISSLTDILIQAKMNSDNNN
SCEGRDPDVFSDRHILATVGDIFGAGIETTTTVLKWILAFLVHNPEVKKKIQKEIDQY
VGFSRTPTFNDRSHLLMLEATIREVLRIRPVAPMLIPHKANVDSSIGEFTVPKDTHVV
VNLWALHHDENEWDQPDQFMPERFLDPTGSHLITPTQSYLPFGAGPRSCIGEALARQE
LFVFTALLLQRFDLDVSDDKQLPRLEGDPKVVFLIDPFKVKITVRQAWMDAQAEVST
>Cyp19a1 M33986
MFLEMLNPMHYNVTIMVPETVPVSAMPLLLIMGLLLLIRNCESS
SSIPGPGYCLGIGPLISHGRFLWMGIGSACNYYNKMYGEFMRVWISGEETLIISKSSS
MVHVMKHSNYISRFGSKRGLQCIGMHENGIIFNNNPSLWRTVRPFFMKALTGPGLIRM
VEVCVESIKQHLDRLGDVTDNSGYVDVVTLMRHIMLDTSNTLFLGIPLDESSIVKKIQ
GYFNAWQALLIKPNIFFKISWLYRKYERSVKDLKDEIEILVEKKRQKVSSAEKLEDCM
DFATDLIFAERRGDLTKENVNQCILEMLIAAPDTMSVTLYVMLLLIAEYPEVETAILK
EIHTVVGDRDIRIGDVQNLKVVENFINESLRYQPVVDLVMRRALEDDVIDGYPVKKGT
NIILNIGRMHRLEYFPKPNEFTLENFEKNVPYRYFQPFGFGPRSCAGKYIAMVMMKVV
LVTLLKRFHVKTLQKRCIENMPKNNDLSLHLDEDSPIVEIIFRHIFNTPFLQCLYISL
>Cyp20a1 NM_199401 XM_237189 BC061716.1
MLDFAIFAVTFLLALVGAVLYLYPASRQASGIPGLTPTEEKDGN
LPDIVNSGSLHEFLVNLHGRYGPVVSFWFGRRLVVSLGTADALKQHFNPNKTLDPFET
MLKSLLGYRSGAGSGSEDHVRRRLYGDAVTAALQSNFPLLLKLSEELLDKWLSYPETQ
HIPLSQHMLGFALKFVTRMVLGDTFEGEQEVIRFQKIHGTVWSEIGKGFLDGSLDKNT
TRKNQYQEALMQLEAILKKIIKERKGGDFSQHTFIDSLVQRNLNEQQILEDSVVFSLA
GCIVTARLCTWAIHFLTTAEEVQKKLHKEVDHVLGKGPITSEKIEQLRYCQQVLCETV
RTAKLTPVSAQLQDIEGKVGPFIIPKETLVLYALGVVLQDASTWPSPHKFDPDRFADE
PVMKVFSSLGFSGTWECPELRFAYVVTTVLVSVLLKKLHLLAVDRQVFEMKYELVTSC
REETWITVSERH
>Cyp20a1-ps chr13:18044633-18044858 UCSC browser
part of exons 2,5,6,7 introns removed and part of exon 6 deleted
exon 2 added at end of exon 7
RLYGDAVTAALQSNFPLLrKLSEELLnKWLSYPETQyIPLSQH exons 5,6
FQKIHdTVWSEIGKGFLDGSLDKNTTwKNQYQ exons 6,7
ASRQSSGIPGL exon 2
>Cyp21a1 U56853
MLLPGLLLLLLLLLLAGTRWLWGQWKLWKLRLPPLAPGFLHFLQ
PNLPVYLFGLAQKLGPIYRIRLGLQDVVVLNSNKTIEEALIQKWVDFAGRPQILDGKM
NFDLSMGDYSLTWKAHKKLSRSALVLGMRDSMEPLVEQLTQEFCERMRAQAGASVAIH
KEFSLLTCSIISCLTFGDKQDSTLLNATHSCVRDLLKAWNHWSVQILDIIPFLRFFPN
PGLWKLKQFQESRDHIVMQELKRHKDSLVAGQWKDMIDYMLQGVEKQRDARDPGQLHE
RHVHMSVVDLFVGGTETTAATLSWAVAFLLHHPEIQKRLQEELDLKLAPSSQLLYKNR
MQLPLLMATIAEVLRLRPVVPMALPHRATKASSISGYDIPKDTIIIPNIQGANLDEMV
WELPSKFWPDRFLESGKSPRIPTFGCGARVCLGEPLARLEFFVVLARLLQTFTLLPPP
DGTLPSLQPLPYTGINLLIPPFQVRLQPRNLAPQDQGQKSSTG
>Cyp21a1-ps pseudogene fragment AY091789
LFGLAQKLGPIYRIAWG
DAVVLNSNKTIEEALIQKWVDFTG*PQILDGK
>Cyp24a1 X59506
MSCPIDKRRTLIAFLRRLRDLGQPPRSVTSKASASRAPKEVPLC
PLMTDGETRNVTSLPGPTNWPLLGSLLEIFWKGGLKKQHDTLAEYHKKYGQIFRMKLG
SFDSVHLGSPSLLEALYRTESAHPQRLEIKPWKAYRDHRNEAYGLMILEGQEWQRVRS
AFQKKLMKPVEIMKLDKKINEVLADFLERMDELCDERGRIPDLYSELNKWSFESICLV
LYEKRFGLLQKETEEEALTFITAIKTMMSTFGKMMVTPVELHKRLNTKVWQAHTLAWD
TIFKSVKPCIDNRLQRYSQQPGADFLCDIYQQDHLSKKELYAAVTELQLAAVETTANS
LMWILYNLSRNPQAQRRLLQEVQSVLPDNQTPRAEDLRNMPYLKACLKESMRLTPSVP
FTTRTLDKPTVLGEYALPKGTVLTLNTQVLGSSEDNFEDSHKFRPERWLQKEKKINPF
AHLPFGIGKRMCIGRRLAELQLHLALCWIIQKYDIVATDNEPVEMLHLGILVPSRELP
IAFRPR
>Cyp26a1 AF439720, NM_130408 Chr1 1Mb upstream of CYP2C cluster
242138769 MGLPALLASALCTFVLPLLLFLAALKLWDLYCVSSRDRSCALPLPPG
TMGFPFFGETLQMVLQ (0) 242138581
242138389 RRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GEHRLVSVHWPASVRTILGAGCLSNLHDSSHKQRKK (0) 242138165
242137906 VIMQAFNREALQCYVPVIAEEVSGCLEQWLSCGERGLLVYPEV
KRLMFRIAMRILLGCEPGPAGGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR (0) 242137616
242137537 GVKPRNLIHARIEENIRAKIRRLQAAERNAGCKDALQLLIEHSWERGERLDMQ (0) 242137379
242136717 ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREEIKSK (0) 242136583
242136000 GLLCKSHHEDKLDMETLEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELN (0) 242135848
242135595 GYQIPKGWNVIYSICDTHDVADSFTNKEEFNPDRFTSLHPEDTSRFSFIPFGGGLRSCRSKEFAKI
LLKIFTVELARRCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFQGDI* 242135254
>Cyp26b1 AY245532, NM_181087
MLFEGLELVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKS
CKLPIPKGSMGFPLIGETGHWLLQGSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENV
RKILLGEHQLVSTEWPRSARVLLGPNTVANSIGDIHRNKRKVFSKIFSHEALESYLPK
IQLVIQDTLRAWSSQPEAINVYQEAQRLTFRMAVRVLLGFSIPEEDLGNLFEVYQQFV
ENVFSLPVDLPFSGYRRGIQARQILQKGLEKAVREKLQCTQGKDYSDALDILIESSKE
HGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPAVLEKLREELRAQGLLHG
GGCPCEGTLRLDMLSGLRYLDCVIKEVMRLFTPVSGGYRTVLQTFELDGFQIPKGWSV
MYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLF
LKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPETEAML
SATV
>Cyp26c1 XM_217935 94% TO 26C1 MOUSE
718 MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKGSMGWPFFG 897
898 ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0)
VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVA 1257
1258 VYQAAKALTFRMAARILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK 1422
1705 GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0)
1888 ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP 2064
2065 DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYS 2244
2245 IRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYIPFGGGARSCLGQELAQAVLQ 2424
2425 LLAVELVRTARWELATPAFPVMQTVPIVHPVDGLLLLFHPLPTLGAGDGSPF* 2583
>CYP26C1 XM_217935 94% TO 26C1 MOUSE Chr1 1Mb upstream of CYP2C cluster
242151281 MFSWGLSCLSMLGAAGTALLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKG
SMGWPFFGETLHWLVQ (0) 242151079
242150553 GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL (0) 242150422
242149883 VLARVFSRPALEQFVPRLQEALRREVRSWCAAQRPVAVYQAAKALTFRMAAR
ILLGLQLDEARCTELAQTFERLVENLFSLPLDVPFSGLRK (0) 242149608
242148160 GIRARDQLYQHLDEVIAEKLREELTAEPGDALHLIINSARELGRELSVQELK (0) 242148005
242146368 ELAVELLFAAFFTTASASTSLILLLLQHPAAIAKIQQELSAQGLGSPCSCAPRASGSRP
DCSCEPDLSLAVLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0) 242146051
242144220 GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGVESEDARGSGGRFHYI
PFGGGARSCLGQELAQAVLQLLAVELVRTARWELATPAFPVMQTVPIVHPVD
GLLLLFHPLPTLGAGDGSPF* 242143843
>Cyp27a1 M38566
MAVLSRMRLRWALLDTRVMGHGLCPQGARAKAAIPAALRDHEST
EGPGTGQDRPRLRSLAELPGPGTLRFLFQLFLRGYVLHLHELQALNKAKYGPMWTTTF
GTRTNVNLASAPLLEQVMRQEGKYPIRDSMEQWKEHRDHKGLSYGIFITQGQQWYHLR
HSLNQRMLKPAEAALYTDALNEVISDFIARLDQVRTESASGDQVPDVAHLLYHLALEA
ICYILFEKRVGCLEPSIPEDTATFIRSVGLMFKNSVYVTFLPKWSRPLLPFWKRYMNN
WDNIFSFGEKMIHQKVQEIEAQLQAAGPDGVQVSGYLHFLLTKELLSPQETVGTFPEL
ILAGVDTTSNTLTWALYHLSKNPEIQEALHKEVTGVVPFGKVPQNKDFAHMPLLKAVI
KETLRLYPVVPTNSRIITEKETEINGFLFPKNTQFVLCTYVVSRDPSVFPEPESFQPH
RWLRKREDDNSGIQHPFGSVPFGYGVRSCLGRRIAELEMQLLLSRLIQKYEVVLSPGM
GEVKSVSRIVLVPSKKVSLRFLQRQ
>Cyp27b1 AB001992
MTQAVKLASRVFHRVQLPSQLGSDSVLRSLSDIPGPSTPSFLAE
LFCKGGLSRLHELQVHGAARYGPIWSGSFGTLRTVYVADPALVEQLLRQESHCPERCS
FSSWSEHRRRHQRACGLLTADGEEWQRLRSLLAPLLLRPQAAAGYAGTLDSVVSDLVR
RLRRQRGRGSGLPDLVLDVAGEFYKFGLEGIGAVLLGSRLGCLEAEVPPDTETFIEAV
GSVFVSTLLTMAMPSWLHRLIPGPWARLCRDWDQMFAFAQKHVEQREGEAAVRNQGKP
EEDLPTGHHLTHFLFREKVSVQSIVGNVTELLLAGVDTVSNTLSWALYELSRHPEVQS
ALHSEITGAVNPGSYAHLQATALSQLPLLKAVIKEVLRLYPVVPGNSRVPDRDICVGN
YVIPQDTLVSLCHYATSRDPAQFREPNSFNPARWLGEGPAPHPFASLPFGFGKRSCIG
RRLAELELQMALAQILTHFEVLPEPGALPVKPMTRTVLVPERSIHLQFVDR
>Cyp39a1 XM_236983 (INCORRECT END) AC107523.4
MGIMELFSPIAIAVLGSCVLFLFSRWKNLRGPPCIQGWIPWIGA
GFEFGKAPLEFIEKARIKYGPVFTVFAVGKRMTFVTEEEGINVLLKSKHVDFELAVQR
PLYHTAWIPKNIFFALHEKLYVLMKGKMGTFNTHHFTGQLTEEFHDQLEGLGTHGTMD
LNDFVRYLLYPATLNTLFMKGLFLTDKRKIKEFYQHFKTYDEGFEYGSQLPEWLLRNW
SKSKRWLLALFEKNIGDIKTHGSAGHSETLLQAVLGMVETETRLHSPNYGLVMLWASL
ANAAPIAFWTLAYILSHPDLHRTIVESISSVFGTAGKDKIQVSENDLKKLLLIKWCIL
ESIRLRAPGVITRKVVKPVKILNHTVPSGDLLMLSPFWLHRNPKYFPEPESFKPERWK
EANLDKYIFLDYFMAFGGGKFQCPGR
147499 WFALLEIQLCIILVLYKYECSLLDPLPKQA (1) 147410
146814 SLHLVGVPQPAGKCRIEYKQRV* 146746
>Cyp46a1 XM_343108
MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPSFLLG
HLPYFWKKDEACGRVLQDVFLDWAKKYGPVVRVNVFHKTSVIVTSPESVKKFLMSTKY
NKDSKMYRAIQTVFGERLFGQGLVSECDYGRWYKQRRVMDLAFSRSSLVSLMGTFNEK
AEQLMEILEAKADGQTPVSMQDMLTCATIDILAKAAFGMETSMLLGAQKPLSQAVKVM
LEGISASRNTLAKFMPGKRKQLREIRESIRLLRQVGKDWVQRRREALKRGEDVPADIL
TQILKAEEGAQDDEVLLDNFVTFFIAGHETSANHLAFTVMELSRQPEIVARLQAEVDE
VVGSKRHLDYEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL
FSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQMEVKVV
MAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC
>Cyp51a1 U17697
MEQVTGGNLLSTLLIACAFTLSLVYLFRLAVGHMVQLPAGAKSP
PYIYSPIPFLGHAIAFGKSPIEFLENAYEKYGPVFSFTMVGKTFTYLLGSDAAALLFN
SKNEDLNAEEVYGRLTTPVFGKGVAYDVPNAVFLEQKKILKSGLNIAHFKQYVSIIEK
EAKKYFKSWGESGERNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFS
HAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRLSKEPAEDILQTLLDSTYKDG
RPLTDDEIAGMLIGLLLAGQHTSSTTSAWMGFFLARDKPLQDKCYLEQKTVCGEDLPP
LTYEQLKDLNLLDRCIKETLRLRPPI MTMMRMAKTPQTV AGYTIPPGHQVCVSPTVNQ
RLKDSWVERLDFNPDRYLQDNPASGEKFAYVPFGAGRHRCIGENFAYVQIKTIWSTML
RLYEFDLINGYFPSVNYTTMIHTPENPVIRYKRR SK
>Cyp51a1-ps1 pseudogene D87997 92% to CYP51A1 rat
398 MEQVTGGNLLSTLLIACAFTLSLVYLFRLAVGHMVQLPAGAESPPCIYSPIPFLGHAI
FGKSPIEFLENAYEKSGPVFSFTMVSKTFTYLLGSDAAALL 696
697 FNSKNEDLNAEEVYGRLTTPVFGKXXXXXXX 768
788 NAVFLEHKKILKSGLNIAHFKQYVSITEKEAKEYFKSWGESGERNVFEALSELIILTASH 967
968 CLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAI 1147
1148 QKRRLSKEPAEDILQTLLDSTYKDGRPLTDDVIAGMLIGLLLAGXXXXXX
1281 TSAWMGFFLVRDKPLQGKCYLEQKAVCGEDLPPLTYE*LKDLNLLDRCIKE 1433
TLRLRPPIMTMMRMAKTPQ 1489
1490 NVAGCTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQGNPASGEKFAYVPFGAGRHH 1669
1670 CIGENFAYVQIKTIWSTMLHLYEFDLINGYFLSVNYTTMIHTPENPVIRYKMK 1828
>Cyp51a1-ps1 pseudogene XM_234202
MEQVTGGNLLSTLLIACAFTLSLVYLF
RLAVGHMVQLPAGAESPPCIYSPIPFLGHAIAFGKSPIEFLENAYEKSGPVFSFTMVG
KTFTYLLGSDAAALLFNSKNEDLNAEEVYGRLTTPVFGKGVAYDVPNAVFLEHKKILK
SGLNIAHFKQYVSITEKEAKEYFKSWGESGERNVFEALSELIILTASHCLHGKEIRSQ
LNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRLSKE
PAEDILQTLLDSTYKDGRPLTDDVIAGMLIGLLLAGHSAWMGFFLVRDKPLQGKCYLE
QKAVCGEDLPPLTYE*LKDLNLLDRCIKETLRLRPPIMTMMRMAKTPQNVAGCTIPPG
HQVCVSPTVNQRLKDSWVERLDFNPDRYLQGNPASGEKFAYVPFGAGRHHCIGENFAY
VQIKTIWSTMLHLYEFDLINGYFLSVNYTTMIHTPENPVIRYKMK
>Cyp51-ps2 pseudogene D78370 this has no exact match in the genome
it is probably a poor quality version of Cyp51a1-ps1
MEQVTGGNLLSTLLIACAFTLSLV (fs)
NLFRLAVGHMVQLHAGAESPPCIY
SPIPFLGHRIAFGKSPIEFLENAYEKSGPVFSFTMVDKTFTYLLGSDAAALLFNSKNEDL
NAEEVYGRLTTPVFGKGVAYDVPNADFLEHKKLLKSGLNIAHFKQYVSITEKEAKEYFKS
WGESGERNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWL
PLPSFRRRDRAHREIKNIFYKAIQKRRLSKEPAEDILQTLLDSTYKDGRPLTDDVIAGML
IGLLLAGHSAWMGFFLVRDKPLQGKCYLEQKAVCGEALHPLTYE*LKDLNLLDRCIKETL
RLESPPI (fs)
VTM (fs)
VRIGQAPSRMCAGCTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQ
GNPASGEKFAYVPFGAGRHHCIGENFAYVQIKTIWSTMLHLYEFDLINGYFLSVNYTTMI
HTPENPVIRYKMK