P450s that have appeared since the 1993 P450 nomenclature update. This is part A of the list covering CYP1 to CYP2 This includes references that were incomplete and duplications of sequences that were already in the update. If a sequence is assigned an accession number that was not in the old update it is included in this list. This list was last revised on Jan. 31, 2003. Added all human genes and pseudogenes Compiled by David R. Nelson A new format is being designed to make the entries more useful, with links to Genbank and Medline and access to the protein sequence. As time permits the entries in the 1993 P450 Nomenclature Update will be added to make the listing more comprehensive. For the time being, I will leave the old text format in place below the newer table format, but eventually the text version will be deleted. Any comments are welcome. 1A Subfamily 1B Subfamily 2A Subfamily 2B Subfamily 2C Subfamily 2D Subfamily 2E Subfamily 2F Subfamily 2G Subfamily 2H Subfamily 2J Subfamily 2K Subfamily 2L Subfamily 2M Subfamily 2N Subfamily 2P Subfamily 2Q Subfamily 2R Subfamily 2S Subfamily 2T Subfamily 2U Subfamily 2V Subfamily 2W Subfamily 2X Subfamily 2Y Subfamily 2Z Subfamily 2AA Subfamily 2AB Subfamily 2AC Subfamily 2AD Subfamily 2AE Subfamily 2AF Subfamily
|
Cytochrome P450 Data CYP1 to CYP2 (Under Construction) |
||||||
|
|
|
|||||
|
P450 gene |
Species |
Medline Entry |
Comment |
Protein Sequence |
Genbank Accession |
|
|
|
|
|||||
|
CYP1A1 |
human |
none |
3' UTR |
D12525 D01198 |
|
|
|
CYP1A1 |
human |
none |
3' UTR |
D12525 D01198 |
|
|
|
CYP1A1 |
human |
none |
3' UTR |
D12525 D01198 |
|
|
|
CYP1A1 |
human |
none |
5' UTR |
D10855 D01150 |
|
|
|
CYP1A1 |
human |
none |
5' UTR |
D10855 D01150 |
|
|
|
CYP1A1 |
Cavia cobaya |
none |
D11043 PIR S43414 |
|
||
|
|
|
|||||
|
|
||||||
1A Subfamily CYP1A1 human GenEMBL D12525 D01198 (650bp) Kawajiri,K., Watanabe,J., Gotoh,O., Tagashira,Y., and Sogawa,K. Structure and drug inducibility of the human cytochrome P-450c gene. Eur. J. Biochem. 159, 219-225 (1986) Kubota,M., Sogawa,K., Kaizu,Y., Sawaya,T., Watanabe,J., Kawajiri,K., Gotoh,O. and Fujii-Kuriyama,Y. Xenobiotic responsive element in the 5'-upstream region of the human P-450c gene. J. Biochem. 110, 232-236 (1991) Hayashi,S.-i., Watanabe,J., Nakachi,K. and Kawajiri,K. Genetic linkage of lung cancer-associated MspI polymorphisms with amino acid replacement in the heme binding region of the human cytochrome P450IA1 gene. J. Biochem. 110, 407-411 (1991) CYP1A1 human GenEMBL D10855 D01150 (4144bp) Kawajiri,K., Watanabe,J., Gotoh,O., Tagashira,Y., and Sogawa,K. Structure and drug inducibility of the human cytochrome P-450c gene. Eur. J. Biochem. 159, 219-225 (1986) Kubota,M., Sogawa,K., Kaizu,Y., Sawaya,T., Watanabe,J., Kawajiri,K., Gotoh,O. and Fujii-Kuriyama,Y. Xenobiotic responsive element in the 5'-upstream region of the human P-450c gene. J. Biochem. 110, 232-236 (1991) Note: these refs are the same as the two earlier accession numbers. CYP1A1 Macaca mulatta (rhesus monkey) No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 94% to CYP1A1 human, 73% to CYP1A2 human, ortholog of CYP1A1 CYP1A1 Cavia Cobaya (guinea pig) GenEMBL D11043 (2674bp) PIR S43414 (516 amino acids) Ohgiya,S. Ishizaki,K. and Shinriki,N. Molecular cloning of guinea pig CYP1A1: complete primary structure and fast mobility of expressed protein on electrophoresis. Biochim. Biophys. Acta 1216, 237-244 (1993) CYP1A1 rat GenEMBL I00732 (1800bp) Oeda,K., Sakaki,T., Ohkawa,H., Yabusaki,Y., Murakami,H., Nakamura,K. and Shimizu,M. Cytochrome P-450MC gene, expression plasmid carrying the said gene, yeasts transformed with the said plasmid and a process for producing cytochrome P-450MC by culturing the said transformant yeasts. Patent: US 4766068-A 1 23-AUG-1988 CYP1A1 rat PIR A93513 (524 amino acids) Yabusaki, Y., Shimizu, M., Murakami, H., Nakamura, K., Oeda, K. and Ohkawa, H. Nucleotide sequence of a full-length cDNA coding for 3-methylcholanthrene-induced rat liver cytochrome P-450MC. Nucleic Acids Res. 12, 2929-2938 (1984) CYP1A1 rat PIR S45716 (524 amino acids) Omata, Y., Robinson, R.C., Gelboin, H.V., Pincus, M.R., Friedman, F.K. Specificity of the cytochrome P-450 interaction with cytochrome b(5). FEBS Lett. 346, 241-245 (1994) CYP1A1 rat PIR D60822 (19 amino acids) Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and Oesch, F. Effect of nutritional imbalances on cytochrome P-450 isozymes in rat liver. Biochem. Pharmacol. 37, 3245-3249 (1988) CYP1A1 hamster GenEMBL D10913 (8700bp) Swiss Q00557 (524 amino acids) Sagami,I., Ohmachi,T., Fujii,H., Kikuchi,H. and Watanabe,M. Hamster cytochrome P-450 IA gene family, P-450IA1 and P-450IA2 in lung and liver: cDNA cloning and sequence analysis J. Biochem. 110, 641-647 (1991) CYP1A1 hamster PIR JS0746 (524 amino acids) Ohgiya, S., Goda, T., Ishizaki, K., Morimoto, M., Sakamoto,T., Kamataki, T. and Shinriki, N. unpublished (1992) CYP1A1 rabbit PIR A25143 (464 amino acids) Okino, S.T., Quattrochi, L.C., Barnes, H.J., Osanto, S., Griffin, K.J., Johnson, E.F. and Tukey, R.H. Cloning and characterization of cDNAs encoding 2,3,7, 8-tetrachlorodibenzo-p-dioxin-inducible rabbit mRNAs for cytochrome P-450 isozymes 4 and 6. Proc. Natl. Acad. Sci. U.S.A. 82, 5310-5314 (1985) CYP1A1 Macaca irus (crab eating macaque monkey) GenEMBL D17575 (2602bp) Ohmachi,T., Sagami,I., Kikuchi,H., Fujii,H., Suzaki,Y., Fujiwara,T. and Watanabe,M. Molecular cloning and sequence analysis of cDNA encoding a crab-eating monkey (Macaca irus) cytocrome P-450 unpublished (1993) CYP1A1 Macaca fasicularis (crab eating macaque monkey) Swiss P33616 (512 amino acids) Komori, M. Kikuchi,O. Kitada,M. Kamataki T. Molecular cloning of monkey 1A1 cDNA and expression in yeast. Biochim. Biophys. Acta 1131, 23-29 (1992) CYP1A1 Sus scrofa (pig) GenEMBL AB052254 Misaki Kojima Submitted to nomenclature committee Oct. 27, 2000 82% to human CYP1A1, 74% to human 1A2 CYP1A1 Ovis aries (sheep) GenEMBL S79795 (2585bp) Hazinski,T.A., Noisin,E., Hamon,I. and DeMatteo,A. Sheep lung cytochrome P4501A1 (CYP1A1): cDNA cloning and transcriptional regulation by oxygen tension J. Clin. Invest. 96 (4), 2083-2089 (1995) CYP1A1 Bos taurus (cow) See cattle page for details MFSVFGLPIPISATELLLASAVFCL VFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLMLGKNPHVVLSQLSQRYGDVLQIRIG CTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLITNGQSMTFNPDSGPVWAARRRL AQNALKSFSTASDPASSSSCYLEEHVNKEAKYLLGKFQELMSGPGRFDPYRYIVVSVA NVICAICFGRRYDHNDQEFLSLVNLSNEFGEITASGNPSDFIPVLRYLPNTALDLFKD LNQRFYVFVQKIVKEHYKTFEKGHIRDITDSLIEHCQDKRLDENANIQLSDEKIINVV IDLFGAGFDTVTTALSWSLLYLVTSPRVQKKIQEELDTVIGRARRPRLSDRPQLPYLE AFILETFRHSSFVPFTIPHSTTRDSNLNGFYIPKGRCVFVNQWQINHDQKLWEDPSEF RPERFLTADGTINKVLSEKVIIFGLGKRKCIGETIARLEVFLFLAILLHQVEFCVTPG VKVDMTPVYGLTMKYARCEHFQAHMRS CYP1A1 Canis familiaris (dog) AACN010067442.1 Canis familiaris ctg19866850684014, 79% to 1A1 human N-term AACN010089968.1 Canis familiaris ctg19866851895459, 84% to 1A1 C-term full length combined seq = 81% to 1A1 1868 MFRLSIPISASELLLASTVFCLVLWVVKAWQPRLPKGLKSPPGPWGWPLLGNVLTLGKSPHLALS 2062 2063 RLSQRYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGDDFKGRPDLYSFSLVTDGQSLTFS 2242 2243 PDSGPVWAARRRLAQNALKSFSIASDPASSCSCYLEEHVSKEAEVLLSRLQEQMAEVGRF 2422 2423 DPYRYIVVSVANVICAMCFSKRYDHDDQELLSLVNLSNEFGEGVASANPLDFFPILRYLP 2602 2603 NPALDFFKDLNKRFYSFMQKMVKEHYKTFEK 2695 133 GQIRDVTDSLIEHCQDKRLDENANIQLSDEKIVNVVLDLFGA 258 347 GFDTVTTAISWSLLYLVTNPNVQKKIQKEL 436 529 DTVIGRARQPRLSDRPQLPYMEAFILETFRHASFVPFTIPH STTRDTSLSGFYIPKGRCVFVNQWQINHDQ 885 1038 KLWGNPSEFQPERFLTLDGTINKALSEKVILFGLGKRKCIGETIARLEVFLFLAILLQQ 1217 1218 VEFSVPEGTKVDMTPIYGLTMKHARCEHFQVRVRTEGAERSAA* 1349 CYP1A1 horse No accession number Heather Knych Submitted to nomenclature committee Oct. 14, 2007 80% to CYP1A1 human, 70% to CYP1A2 human CYP1A1 Macropus eugenii (tamar wallaby) no accession number Ross McKinnon submitted to nomenclature committee 9/7/98 98 amino acid C-terminal fragment is 82% identical to macaque 1A1 CYP1A1 Monodelphis domestica (opossum) UCSC Browser Oct 2006 assembly chr1 23141664- 23146346 (-) strand Syntenic with human CYP1A1 adjacent to EDC3 and CYP1A2 73% to 1A1 hum 65% to 1A2 hum Built_from_P56591_and_others 489177 - 493862 bp (489.2 Kb) on chromosome fragment scaffold_14927 This transcript is located in sequence: contig_43733 MTSILSLLGFSKSFTVTELLVVSAVFCLVFWIIDSYHQRVPKGFKSPPGPWAWPLIGNVL TLGKNPHLVLTQMREKYGDVMQIQIGSTPVLVLSGLETIRHALVKQGDDFKGRPDLYSFS LILDGESLSFGPDSGEVWAARRKLTQNALKAFSISSSPSSSFCYLEEHVIKEAEYLIQKF QEQKGHFDPVRYIVVSVANVICAICFGQRYDHDDQELLNIVRLSNKFGEVAASGNPVDFI PILRYLPNSKITAFRDLNEKIVAFTQKLVKEHYRKFEKGCIRDITDSLIEHCQEKKLDEN ANIMLSEKKVVNVVIDLFGAGFDTVTTAISWGLMYLVAKPEVQKKIHEELDTVIGRERLP QLSDKTQLPYMEAFILETFRHSSFLPFTIPHSTTRDITLNGFYIPKGRCVFVNQWQINHD PKIWGDPSVFRPERFLSVDGTINKALSEKVIMFGLGKRKCIGETIARWEVFLFLSILLHR MEFSVPSGVKVDLTPVYGLTMKHIPCEHFQTKLRS CYP1A1 Balaenoptera acutorostrata (Minke whale) no accession number Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita submitted to nomenclature committee 5/15/98 CYP1A1 Balaenoptera acutorostrata (Minke whale) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 82% to CYP1A1 human, 74% to CYP1A2 CYP1A1 Pusa sibrica (Baikal seal) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 82% to CYP1A1 human, 75% to CYP1A2 CYP1A1 Phocoenoides dalli (Dall's porpoise) no accession number Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita submitted to nomenclature committee 5/15/98 CYP1A1 Eumetopias jubatus (Steller sea lion) no accession number Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita clone #1 submitted to nomenclature committee 5/15/98 CYP1A1 Phoca largha (Spotted seal) no accession number Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita submitted to nomenclature committee 5/15/98 CYP1A1 Phoca fasciata (Ribbon seal) no accession number Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita submitted to nomenclature committee 6/29/99 revised 2/27/01 CYP1A1 Halichoerus grypus (grey seal, gray seal) No accession number Rachel Tilley Submitted to nomenclature committee 3/19/2001 Name grey seal 1 CYP1A1 Phoca groenlandica (harp seal) No accession number Rachel Tilley Submitted to nomenclature committee 3/19/2001 Name harp seal 1 Cyp1a1 mouse GenEMBL K02588 (2619bp) Kimura,S., Gonzalez,F.J. and Nebert,D.W. The murine Ah locus. J. Biol. Chem. 259, 10705-10713 (1984) Cyp1a1 mouse GenEMBL M10021 (8809bp) PIR A24953 (30 amino acids) Gonzalez,F.J., Mackenzie,P.I., Kimura,S. and Nebert,D.W. Isolation and characterization of full-length mouse cDNA and genomic clones of 3-methylcholanthrene-inducible cytochrome P-1-450 and P-3-450 Gene 29, 281-292 (1984) Cyp1a1 mouse GenEMBL X01681 (6214bp) Kimura,S., Gonzalez,F.J. and Nebert,D.W. The murine Ah locus: Comparison of the complete cytochrome P1-450 and P3-450 cDNA nucleotide and amino acid sequences J. Biol. Chem. 259, 10705-10713 (1984) Cyp1a1 mouse GenEMBL M11515 (8850bp) Kimura,S. and Nebert,D.W. Comparison of the mouse P-1-450 gene and flanking sequences from a MOPC 41 plasmacytoma and normal liver. DNA 4, 365-375 (1985) Cyp1a1 mouse GenEMBL M25623 (410bp) Peterson,T.C., Gonzalez,F.J. and Nebert,D.W. Methylation differences in the murine P-1-450 and P-3-450 genes in wild-type and mutant hepatoma cell culture Biochem. Pharmacol. 35, 2107-2114 (1986) Cyp1a1 mouse GenEMBL M33935 (474bp) Jones,J.E. and Nebert,D.W. Transcriptional start site in the mouse Cyp1a1 (cytochrome P-1-450) gene. DNA 8, 527-534 (1989) Cyp1a1 mouse PIR C24406 (24 amino acids) Cheng, K.C., Park, S.S., Krutzsch, H.C., Grantham, P.H., Gelboin, H.V. and Friedman, F.K. Amino-terminal sequence and structure of monoclonal antibody immunopurified cytochromes P-450. Biochemistry 25, 2397-2402 (1986) CYP1A1 Xenopus tropicalis (frog) CYP1A2 human GenEMBL M38504 (3149bp) Jaiswal,A.K., Nebert,D.W., McBride,W.O. and Gonzalez,F.J. Human P-3-450: cDNA and complete protein sequence, repetitive Alu sequences in the 3' nontranslated region, and localization of gene to chromosome 15 J. Exp. Pathol. 3, 1-17 (1987) CYP1A2 human GenEMBL U02993 (3293bp) Quattrochi,L.C. and Tukey,R.H. The human cytochrome Cyp1A2 gene contains regulatory elements responsive to 3-methylcholanthrene Mol. Pharmacol. 36, 66-71 (1989) CYP1A2 human PIR A25892 (515 amino acids) Quattrochi, L.C., Pendurthi, U.R., Okino, S.T., Potenza, C. and Tukey, R.H. Human cytochrome P-450 4 mRNA and gene: part of a multigene family that contains Alu sequences in its mRNA. Proc. Natl. Acad. Sci. U.S.A. 83, 6731-6735 (1986) CYP1A2 human PIR A60881 (18 amino acids) Wrighton, S.A., Campanile, C., Thomas, P.E., Maines, S.L., Watkins, P.B., Parker, G., Mendez-Picon, G., Haniu, M., Shively, J.E., Levin, W. and Guzelian, P.S. Identification of a human liver cytochrome P-450 homologous to the major isosafrole-inducible cytochrome P-450 in the rat. Mol. Pharmacol. 29, 405-410 (1986) CYP1A2 Macaca fascicularis (cynomolgus monkey) GenEMBL D86474 Sakuma,T., Hieda,M., Igarashi,T., Ohgiya,S., Nagata,R., Nemoto,N. and Kamataki,T. Molecular cloning and functional analysis of cynomolgus monkey CYP1A2 Biochem. Pharmacol. 56 (1), 131-139 (1998) CYP1A2 Macaca fuscata (Japanese macaque) GenEMBL AB185338 (hold till 7/22/2005) Shizuo Narimatsu Submitted to nomenclature committee 8/28/2004 99% identical to cynomolgus monkey CYP1A2 92.4% to human CYP1A2 CYP1A2 rabbit PIR B27821 (516 amino acids) Kagawa, N., Mihara, K., Sato, R. Structural analysis of cloned cDNAs for polycyclic hydrocarbon-inducible forms of rabbit liver microsomal cytochrome P-450. J. Biochem. 101, 1471-1479 (1987) CYP1A2 dog PIR A60463 (16 amino acids) Ohta, K., Motoya, M., Komori, M., Miura, T., Kitada, M. and Kamataki, T. A novel form of cytochrome P-450 in beagle dogs. P-450-D3 is a low spin form of cytochrome P-450 but with catalytic and structural properties similar to P-450d. Biochem. Pharmacol. 38, 91-96 (1989) CYP1A2 Canis familiaris (dog) UCSC Browser chr30:40816888-40821608 (+) strand May 2005 assembly AACN010103563.1 Canis familiaris ctg19866850724666, 90% to 1A2 AACN010517076.1 Canis familiaris ctg19866850724664, 82% to 1A2 human N-term AACN010004324.1 Canis familiaris ctg19866850196532, 86% to 1A2 C-term combined sequence for 1A2 362 MALSQMATELLLASTIFCLILWVVKVWQPRLPKGLKSPPGPWGWPLLGNVLTLGKSPHLALS 177 176 RLSQRYGDVLQIRIGSTPVLVLSSLDTIRQALVRQGDDFKGRPDLYSFSLVT DGQSLTFSPDSGPVWAARRRLAQNALNTFSIASDPASSCSCYLEE 771 770 HVSKEAEALLSRLQEQMAEVGRFDPYNQVLMSVANVIGAMCFGHHFSQRSEEMLPLLMSS 591 590 SDFVETVSSGNPLDFFPILQYMPNSALQRFKNFNQTFVQSLQKIVQEHYQDFDE 429 RSVQDITGALLKHNEKSSRASDGHIPQEKIVNLINDIFGA GFDTVTTAISWSLMYLVANPEIQRKIQKEL DTVIGRARQPRLSDRPQLPLMEAFILEIFRHTSFVPFTIPHS (2) 631 TTKNTTLKGFYIPKECCVFINQWQVNHDQ 717 1789 QVWGDPFAFRPERFLTADGTAINKTLSEKVMLFGMGKRRCIGEVLAKWEIFLFLAILLQ 1968 1969 RLEFSVPAGVRVDLTPIYGLTMKHTRCEHVQARPRFSIK* 2088 CYP1A2 Bos taurus (cow) See cattle page for details MALSQLSPFSAMELLLASAIFCLVFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLTLG KNPHVVLSQLSQRYGDVLQIRIGCTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLVT DGQSMTFNPDSGPVWAARRRLAQNALNTFSVASD PSSSSSCYLEDHVSKEAEALLGKFQELMSGPGRFDPYGHVVASV ANVIGAMCFGQHFPQSSKEMLSLVESSHDFVESASSGNPVDFFPILKYLPNPALQRFK SFNQRFLQFVRKTVQEHYQDFDKNSIQDIIGALFKHSEDNSRASSRLISQEKTVNLVN DLFAAGFDTITTAISWSLMYLVTNPKIQRKIQEELD RVVGRARRPRLSDRPQLPYLES FILETFRHSSFVPFTIPHSTTRDTTLNGFFIPKERCVFINQWQVNHDPKLWGDPSVFR PERFLTSDGTTIDKTASEKVLLFGMGKRRCIGEVMARWEVFLFLAILLQRLEFSVPPG VKVDLTPTYGLTMKHARCEHMQARLRFPIK CYP1A2 Sus scrofa (miniature pig) no accession number Haitao Shang Submitted to nomenclature committee May 23, 2007 86% to 1A2hum, 75% to 1A1hum partial seq. CYP1A2 Sus scrofa (miniature pig) GenEMBL CB483208.1 KLWGDPSEFRPERFLTADGTAIHKTMSEEVILFGMGKRRCIGEVLAKWEVFLFLAILLQQ LEFSVPP CYP1A2 rat PIR B24406 (25 amino acids) Cheng, K.C., Park, S.S., Krutzsch, H.C., Grantham, P.H., Gelboin, H.V. and Friedman, F.K. Amino-terminal sequence and structure of monoclonal antibody immunopurified cytochromes P-450. Biochemistry 25, 2397-2402 (1986) CYP1A2 rat GenEMBL X01031 (1106bp) PIR A44612 (367 amino acids) Yabusaki, Y., Murakami, H., Nakamura, K., Nomura, N., Shimizu, M., Oeda, K. and Ohkawa, H. Characterization of complementary DNA clones coding for two forms of 3-methylcholanthrene-inducible rat liver cytochrome P-450. J. Biochem. 96, 793-804 (1984) CYP1A2 rat PIR S26822 (19 amino acids) Botelho, L.H., Ryan, D.E., Yuan, P.M., Kutny, R., Shively, J.E. and Levin, W. Amino-terminal and carboxy-terminal sequence of hepatic microsomal cytochrome P-450d, a unique hemoprotein from rats treated with isosafrole. Biochemistry 21, 1152-1155 (1982) CYP1A2 rat PIR D60822 (22 amino acids) Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and Oesch, F. Effect of nutritional imbalances on cytochrome P-450 isozymes in rat liver. Biochem. Pharmacol. 37, 3245-3249 (1988) CYP1A2 rat PIR A61400 (513 amino acids) Woelfel, C.; Platt, K.L.; Dogra, S.; Glatt, H.; Waechter, F.; Doehmer, J. Stable expression of rat cytochrome P450IA2 cDNA and hydroxylation of 17beta-estrodiol and 2-aminofluorene in V79 Chinese hamster cells. Mol. Carcinog. 4, 489-498 (1991) CYP1A2 hamster GenEMBL D10914 (9719bp) Sagami,I., Ohmachi,T., Fujii,H., Kikuchi,H. and Watanabe,M. Hamster cytochrome P-450 IA gene family, P-450IA1 and P-450IA2 in lung and liver: cDNA cloning and sequence analysis J. Biochem. 110, 641-647 (1991) CYP1A2 Mesocricetus auratus (hamster) GenEMBL M63787 M34446 (1868bp) Lai,T.S. and Chiang, J.Y.L. Cloning and characterization of two major 3-methylcholanthrene inducible hamster liver cytochrome P-450s. Arch. Biochem Biophys. 283, 429-439 (1990) clone MC4 note: M34446 is incorrectly included in the GenBank entry for CYP2A8 and CYP2A9. M34446 should only be in the CYP1A2 hamster entry. CYP1A2 Cavia cobaya (guinea pig) GenEMBL D50457 (1760bp) Mori,T., Itoh,S., Ohgiya,S., Ishizaki,K. and Kamataki,T. Effect of ascorbic acid on expression of several forms of cytochrome P-450 of guinea pig Unpublished (1995) CYP1A2 Cavia porcellus (guinea pig) GenEMBL U23501 (1757bp) Black,V.H. unpublished 1995 CYP1A2 Monodelphis domestica (opossum) UCSC Browser Oct 2006 assembly chr1 23173195 - 23183937 (+) strand Syntenic with human CYP1A2 adjacent to CYP1A1 and CSK 70% to 1A2, 65% to 1A1 Built_from_Q64391_and_others 451687 - 462429 bp (451.7 Kb) on chromosome fragment scaffold_14927 This transcript is located in sequence: contig_91822 MVSSLLASISISELLLASVIFCLVFWVTRSSHQRVPKGLKSPPGPWAWPLFGNVWTLGKN PHLTLAQLSEKYGDVMKIHIGSTPVIVLSGLETIRQALVKQGEDFKGRPDLYSSTFVADG YSLAFNPDSGEVWAVRRKLAQNALNTFSVSSSPSSSSCYLEEHVNKEVKHLIQKFQELME GVGCFDPYRHIVASVANVISAMCFSQRYEDHKNPEFTTLINASHEFVESATSGNPVDFFP ILRYIPNPQLQRFKEFNQRFLKFLQNTIREHHKAFDENNIQDITGALYKHSQDKAFGNTS SSVPEMLIINLINDIFGAGFDTVTTAISWSLMYLVTNPKVQKKIQQELDTVIGRDRWPLL SDRPQLPFMEAFILEIFRHTSFVPFTIPHSTTRATTLNNFYIPKGTCVFVNQWQTNHDPK LWEDPSVFRPERFLSADGTVNKALSEKVILFGLGKRRCIGETIARWEVFLFLAILLHQIE FSVPSGVKVDMTPTYGLTMKHPRCEHFQARPRFSR CYP1A2 chicken GenEMBL M64537 (884bp) Swiss Q01741 (258 amino acids) Murti,J.R., Adiga,P.R. and Padmanaban,G. Estradiol-17-Beta induces polyaromatic hydrocarbon-inducible cytochrome p-450 in chicken liver Biochem. Biophys. Res. Commun. 175, 928-935 (1991) Note: previously called 1A2 CYP1A2 Eumetopias jubatus (Steller sea lion) no accession number Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita clone #2 submitted to nomenclature committee 5/15/98 CYP1A2 Phoca fasciata (Ribbon seal) no accession number Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita submitted to nomenclature committee 6/28/99 revised 2/27/01 CYP1A1/CYP1A2 chimera Phoca fasciata (Ribbon seal) no accession number Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita submitted to nomenclature committee 6/28/99 on 2/27/01 the authors sent the following message "... we believe that the production of the chimera sequence could be the result of a PCR defect." CYP1A2 Halichoerus grypus (grey seal, gray seal) No accession number Rachel Tilley Submitted to nomenclature committee 3/19/2001 Name grey seal 2 CYP1A2 Phoca groenlandica (harp seal) No accession number Rachel Tilley Submitted to nomenclature committee 3/19/2001 Name harp seal 2 Cyp1a2 mouse GenEMBL K02589 (1893bp) Kimura,S., Gonzalez,F.J. and Nebert,D.W. The murine Ah locus. J. Biol. Chem. 259, 10705-10713 (1984) Cyp1a2 mouse PIR A93512 (513 amino acids) Kimura, S., Gonzalez, F.J. and Nebert, D.W. Mouse cytochrome P-3-450: complete cDNA and amino acid sequence. Nucleic Acids Res. 12, 2917-2928 (1984) Cyp1a2 mouse GenEMBL X01682 (6715bp) Kimura,S., Gonzalez,F.J. and Nebert,D.W. The murine Ah locus: Comparison of the complete cytochrome P1-450 and P3-450 cDNA nucleotide and amino acid sequences J. Biol. Chem. 259, 10705-10713 (1984) Cyp1a2 mouse GenEMBL M25624 (510bp) Peterson,T.C., Gonzalez,F.J. and Nebert,D.W. Methylation differences in the murine P-1-450 and P-3-450 genes in wild-type and mutant hepatoma cell culture Biochem. Pharmacol. 35, 2107-2114 (1986) Cyp1a2 mouse PIR B92495 (513 amino acids) Gonzalez, F.J., Kimura, S. and Nebert, D.W. J. Biol. Chem. 260, 11884-11889 (1985) Erratum Cyp1a2 mouse GenEMBL M10022 (8865bp) PIR B24953 (30 amino acids) Gonzalez,F.J., Mackenzie,P.I., Kimura,S. and Nebert,D.W. Isolation and characterization of full-length mouse cDNA and genomic clones of 3-methylcholanthrene-inducible cytochrome P-1-450 and P-3-450 Gene 29, 281-292 (1984) Cyp1a2 mouse PIR A45955 (42 amino acids) PIR B45955 (39 amino acids) Peterson, T.C., Gonzalez, F.J. and Nebert, D.W. Methylation differences in the murine P-1-450 and P-3-450 genes in wild-type and mutant hepatoma cell culture. Biochem. Pharmacol. 35, 2107-2114 (1986) Cyp1a2 mouse PIR D24406 (25 amino acids) PIR E24406 (25 amino acids) Cheng, K.C., Park, S.S., Krutzsch, H.C., Grantham, P.H., Gelboin, H.V. and Friedman, F.K. Amino-terminal sequence and structure of monoclonal antibody immunopurified cytochromes P-450. Biochemistry 25, 2397-2402 (1986) CYP1A2 Balaenoptera acutorostrata (Minke whale) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 82% to CYP1A2 human, 69% to CYP1A1 CYP1A2 Pusa sibrica (Baikal seal) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 80% to CYP1A1 human, 69% to CYP1A2 Fish Cytochrome P450s are undergoing a revision to their nomenclature. Initially there appeared to be just one fish 1A gene per species, but that is not true as shown by Amy Berndtson in trout. Until an adequate nomenclature can be devised, these fish sequences are listed as CYP1A, without a number following the subfamily. This does not affect the mammalian gene designations, though it may affect the chicken sequences. CYP1A1 Oncorhynchus mykiss (trout) GenEMBL S69278 (5023bp) Berndtson,A.K. and Chen,T.T. Two unique CYP1 genes are expressed in response to 3-methylcholanthrene treatment in rainbow trout. Arch. Biochem. Biophys. 310, 187-195 (1994) Note: published as CYP1A2, but it is more similar to Heilmann's sequence than Berndtson's 1A1 (97.9% identical). CYP1A1 Oncorhynchus mykiss (trout) GenEMBL U62797(1697bp) Bailey,G., You,L. and Harttig,U. Cloning, sequencing and functional expression of two trout CYP1A cDNAs in yeast unpublished (1997) incorrectly called 1A2 CYP1A3 Oncorhynchus mykiss (trout) GenEMBL U62796(2401bp) Bailey,G., You,L. and Harttig,U. Cloning, sequencing and functional expression of two trout CYP1A cDNAs in yeast unpublished (1997) incorrectly called 1A1 CYP1A Oncorhynchus mykiss (trout) GenEMBL AF015660 Bailey,G., You,L. and Harttig,U. Cloning,sequencing and aflatoxin B1 metabolism by multiple rainbow trout CYP1A cDNAs expressed in yeast Unpublished 8 amino acid differences with U62797 CYP1A3 Oncorhynchus mykiss (trout) GenEMBL S69277 (5524bp) Berndtson,A.K. and Chen,T.T. Two unique CYP1 genes are expressed in response to 3-methylcholanthrene treatment in rainbow trout. Arch. Biochem. Biophys. 310, 187-195 (1994) Note: published as CYP1A1. This sequence is 96.7% identical to Heilmann's 1A1 sequence. CYP1A1/CYP1A3 chimera Oncorhynchus mykiss (trout) PIR A28789 (522 amino acids) Heilmann, L.J., Sheen, Y.Y., Bigelow, S.W. and Nebert, D.W. Trout P450IA1: cDNA and deduced protein sequence, expression in liver, and evolutionary significance. DNA 7, 379-387 (1988) Published as CYP1A1 note: subsequent analysis has shown that the 5' end of this sequence comes from the 1A3 gene and the switch over occurs between base 271 and base 435 with base 1 as the A of the ATG start codon. CYP1A Pleuronectes platessa (plaice, a fish) GenEMBL X73631 (2411bp) PIR S34184 (521 amino acids) Leaver,M.J., Pirrit,L. and George,S.G. Cytochrome P450 1A1 cDNA from plaice (Pleuronectes platessa) Mol. Marine Biol. Biotechnol. 2, 338-345 (1993) CYP1A Opsanus tau ( oyster toadfish) GenEMBL U14161 (2352bp) Morrison, H.G., Oleksiak, M.F., Cornell, N.W., Sogin,M.L. and Stegeman, J.J. Identification of Cytochrome P450 1A genes from two teleost fish, toadfish (Opsanus tau) and scup (Stenotomus chrysops), and pyhlogenetic analysis of CYP1A genes. Biochem. J. 308, 97-104 (1995) CYP1A Stenotomus chrysops (scup, a fish) GenEMBL U14162 (1566bp) Morrison, H.G., Oleksiak, M.F., Cornell, N.W., Sogin,M.L. and Stegeman, J.J. Identification of Cytochrome P450 1A genes from two teleost fish, toadfish (Opsanus tau) and scup (Stenotomus chrysops), and pyhlogenetic analysis of CYP1A genes. Biochem. J. 308, 97-104 (1995) CYP1A Chaetodon capistratus (four-eye butterfly fish) GenEMBL U19855 (2552bp) Vrolijk,N.H., Lin,C. and Chen,T.T. Characterization and expression of a CYP1A gene from the tropical teleost, Chaetodon capistratus. Unpublished 1995 CYP1A Dicentrarchus labrax (european sea bass) GenEMBL U78316(1563bp) Stien,X., Amichot,M., Berge,J.-B. and Lafaurie,M. Molecular cloning of a CYP1A cDNA from the teleost fish Dicentrarchus labrax. Unpublished (1995) CYP1A1v2 Dicentrarchus labrax (european sea bass) No accession number Alessandra Salvetti Submitted to nomenclature committee 11/26/99 94% identical to U78316 probably an allele CYP1A Microgadus tomcod (Atlantic tomcod) GenEMBL L41886 (2497bp) L41917 Roy,N.K., Konkle,B.A., Kreamer,G.-L., Grunwald,C. and Wirgin,I.I. Characterization and prevalence of a polymorphism in the 3' untranslated region of cytochrome P4501A1 in cancer-prone Atlantic tomcod Arch. Biochem. Biophys. (1995) In press probable frameshift detected by O. Gotoh. in the beginning of the sequence. CYP1A Microgadus tomcod (Atlantic tomcod) GenEMBL L41917 (6837bp) Roy,N.K., Konkle,B. and Wirgin,I.I. Functional characterization of Cytochrome P4501A1 regulatory sequences in cancer-prone Atlantic tomcod. Unpublished (1995) CYP1A Pagrus major (wild red sea bream) no accession number Mizukami,M., Okauchi,M., Ariyoshi,T. and Kito,H. The isolation and sequence of cDNA encoding a 3-methylcholanthrene- inducible cytochrome P450 from wild red sea bream, Pagrus major. Marine Biol. 120, 343-349 (1994) CYP1A Sparus aurata (gilthead sea bream) GenEMBL AF011223, AF005719 CYP1A Liza aurata GenEMBL AF022433 Cousinou,M., Lopez-Barea,J. and Dorado,G. CYP1A Liza saliens (leaping mullet) GenEMBL AF072899 Alaattin Sen and Don Buhler submitted to nomenclature committee 96% identical to Liza aurata CYP1A Limanda limanda GenEMBL AJ001724 Robertson,F.E., McPhail,M.E., Rankin,R., Stagg,R.M. and Craft,J.A. CYP1A Platichthys flesus (European flounder) GenEMBL AJ132353 Williams,T.D., Lee,J.S. and Chipman,J.K. The cytochrome P450 1A gene (CYP1A) from European flounder (Platichthys flesus), analysis of regulatory regions and development of a dual luciferase reporter gene assay. Unpublished CYP1A1 Salmo salar (salmon) No accession number Christopher Rees Weiming Li submitted to nomenclature committee Nov. 9, 2001 a second gene is being isolated so this is called 1A1 rather than just CYP1A. This does not imply orthology to the mammalian 1A1, 1A2. The CYP1A gene duplications in fish and mammals occurred independently. CYP1A Anguilla anguilla (European eel) GenEMBL AF420257 Mahata,S.C., Mitsuo,R., Aoki,J.-y., Kato,H. and Itakura,T. Two forms of cytochrome P450 cDNA from 3-methylcholanthrene-treated European eel Anguilla anguilla Fish. Sci. 69 (3), 615-624 (2003) 98% identical to CYP1A9 from Japanese eel (clear ortholog) note: Eels have two CYP1A sequences. This one is 80% identical to Salmo salar CYP1A. CYP1A9 is 77% to the same Salmo CYP1A Therefore, CYP1A9 is a recent duplication in eels that is diverging Away from the parent sequence called CYP1A (no number after A). Called CYPEuMC1 CYP1A Anguilla japonica (Japanese eel) GenEMBL AB015638 Mitsuo,R., Itakura,T. and Sato,M. Cloning and Sequencing of Cytochrome P450 1A Complementary DNA in Eel (Anguilla japonica) Mar. Biotechnol. 1 (4), 353-358 (1999) 98% identical to CYP1A9 from European eel (clear ortholog) note: Eels have two CYP1A sequences. This one is 81% identical to Salmo salar CYP1A. CYP1A9 is 78% to the same Salmo CYP1A Therefore, CYP1A9 is a recent duplication in eels that is diverging Away from the parent sequence called CYP1A (no number after A). Called CYPJaMC1 CYP1A Danio rerio (zebrafish) GenEMBL AY398333.1, AB078927.1 Gene is on CAAK02015935.1 (exon 1), CAAK02015934 (exons 2-6) MALTILPILGPISVSESLVAIITICLVYLLMRLNRTKIPDGLQK LPGPKPLPIIGNVLEIGNNPHLSLTAMSKCYGPVFQIQIGMRPVVVLSGNDVIRQALL KQGEEFSGRPELYSTKFISDGKSLAFSTDQVGVWRARRKLALNALRTFSTVQGKSPKY SCALEEHISNEGLYLVQRLHSVMKADGSFDPFRHIVVSVANVICGICFGRRHSHDDDE LVRLVNMSDEFGKIVGSGNPADFIPFLRILPSTTMKKFLDINERFSKFMKRLVMEHYDTFDK (0) DNIRDITDSLINHCEDRKLDENSNLQVSDEKIVGIVNDLFGA (1) GFDTISTALSWAVVYLVHYPEVQERLQREL (1) DEKIGKDRTPLLSDRANLPLLESFILEIFRHSSFLPFTIPHC (2) TSKDTSLNGYFIPKDTCVFVNQWQVNHDP (2) ELWKDPSSFIPDRFLTADGTELNKLEGEKVLVFGLGKRRCIGESIGRAEVFLFLAILL QRLKFTGMPGEMLDMTPEYGLTMKHKRCLLRVTPQPVF CYP1A Gobiocypris rarus (a rare minnow) GenEMBL EU106660 Jiayin Dai Submitted to nomenclature committee 4/19/2008 87% to CYP1A Danio CYP1A Callorhinchus milii (elephant shark, Chondrichthyes) Trace file 1573735839 78% to 1A zebrafish 1576735840 these two trace files are mate pairs IRDITDSLIEHCQDKKMDENANIQVSDEKIINIVNDLFGA (1) GFDTITTGLSWAVMYLVLYPDLQKRLQDEI (1) DEKIGKDRSPRLSDRSRLPYTDAFILETFRYSSFLPFTIPHC (2) TTKDTALNGYFIPKNTCVFVNQWQVNHDE (2) CYP1A Petromyzon marinus (sea lamprey) Trace files 1255373015 (DAVV exon +) 1386924597 (DAVV exon +) 1210995499 (DAVV exon +) 1437249679 (TTRD exon +) 1468852008 (TTRD exon +) 1442353648 (TTRD exon +) 1439550570 (ALWDE exon -) mate = 1442736929 = (TTRD exon +) 56% to 1A1 and 1A2 human, 61% to Bos 1A2 N-term part seems to be in a seq gap DAVVGRQRRPSLNDRRQLPFTEAFILEVLRHSSVVPFTIPHS (2) TTRDTVLQGFFIPKDTCIFINQWQVNHDS (2) ALWDEPFAFRPERFLSEDQSSVDRTRAANLLSFGTGKRRCMGEAVARSELFLFLSILLHHL RIRTADGQAPDMSAVYGLSLKHRTCLLLAESRS* CYP1A4 Gallus gallus (chicken) GenEMBL X99453(2098bp) Gilday,D.J., Gannon,M., Yutzey,K., Bader,D. and Rifkind,A. Molecular cloning and expression of two novel avian cytochrome P450 1A enzymes induced by 2,3,7,8-tetrachlorodibenzo-p-dioxin. J. Biol. Chem. 271, 33054-33059 (1996) CYP1A4 Phalacrocorax carbo (Commmon Cormorant) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 78% to CYP1A4 chicken, 72% to CYP1A5 chicken, 59% to CYP1A zebrafish CYP1A5 Gallus gallus (chicken) GenEMBL X99454(1845bp) Gilday,D.J., Gannon,M., Yutzey,K., Bader,D. and Rifkind,A. Molecular cloning and expression of two novel avian cytochrome P450 1A enzymes induced by 2,3,7,8-tetrachlorodibenzo-p-dioxin. J. Biol. Chem. 271, 33054-33059 (1996) CYP1A5 Meleagris gallopavo (turkey) No accession number Roger Coulombe, Jr. Submitted to nomenclature committee May 5, 2004 95% to chicken 1A5 CYP1A5 Phalacrocorax carbo (Commmon Cormorant) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 78% to CYP1A5 chicken, 69% to CYP1A4 chicken, 58% to CYP1A zebrafish CYP1A5 Corvus macrorhynchos (Jungle crow) No accession number Hisato Iwata submitted to nomenclature committee 4/15/05 75% to 1A5 chicken 67% to 1A4 chicken CYP1A6 Xenopus laevis (African clawed frog) GenEMBL AB022087 Fujita,Y. and Ohi,H. Xenopus laevis mRNA for cytochrome P450, cDNA clone MC1 unpublished(1999) In press clone MC1 CYP1A7 Xenopus laevis (African clawed frog) GenEMBL AB022088 Fujita,Y. and Ohi,H. Xenopus laevis mRNA for cytochrome P450, cDNA clone MC2 unpublished(1999) In press clone MC2 CYP1A8PX human NT_008580.9 Pseudogene 43% identcal to 1A2 human Renamed CYP1D1P orthologous to fish 1D1 NT_008580.9|Hs9_8737 chromosome 9 4822084 MILDLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQVSPPGP*SFPIIENLLQLGDHPY 4822260 4822261 LTLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLHKDGEHFAGRPNMHTFSFLAEGKS 4822440 4822441 LSFSVNYGESWKLHKKIASKAL*TFSNAEAKSSTCSCSLEEHVTEEISELVTVFVELTSK 4822620 4822621 NGSFDPRNAITCVVANIVCALCFGKR*DHSDEEFLRIVKTNDDLLKASSAANPADFIPCL 4822800 4822801 HYLPLKIINAPLEFYQALNGFIALHVQDHLATYGK 4822905 (0) 4824790 DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVSDLFGA 4824912 (1) 4829424 GFETVSTCLCWSFLYLIHYPEIQARIQEEI 4829513 (1) 4829611 RPPRFEDRKILPYTEAFVSEVFRHASFLPFTIPHS 4829715 (2) 4832677 TTADTTLNGYFIPRKTCTFINMYQVNHDE 4832763 (2) 4835676 TIWDNHSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITTVLQQFK 4835858 4835859 LKK*PRAKLDLTPTYGLVMRPKLYQLQAELHPSGSSSA* 4835975 CYP1A8PX ortholog Bos taurus (cow) Renamed CYP1D1P orthologous to fish 1D1 See cattle page for details MIFGMAVTSGEVTTSRIILVMVFVFVRELGNKGRKEVFPPGPWSLPIVENLLQLG DHLYFTFMEMRKKYGDVFLIKLGMVPVLVVNGMEMVKEVLLRNGEHFAA*PNV LTFSFLAQ*KSLTFS NYGENWTLHKKIASNALRTFPKAETKSSTRSCLLEKHVIEEVSELVKV FTELTSRSGSFEPRGAITCAMANVV CTLCFGKRYDHSDEEFLRIVKTDHDLLKASSAANPADFIPYF*YLPLRIINAPQEFYHARNQ FIALHIRDHLTT CPQDHIQDITDALINACHNKYAVAKITILNDDEIISTVSDLVGAG FEIISTCIYWSFLYLIYYPEIQVKIQEEI DGNTGMKSPRFENRKILP YTEAFINEIFRHTSFLPFTIPHC (2) TTADTTLNGYFIPRKTCTFINMYQVNHDE (2) TIWDNPNLLRPERFLNENRELNKNLIEKIFIFGMGIQKCL REEVAQNEVFVFITTVLQQLTLKKCPVVKLDLTPTYGLVMKPKPYQLPAEPRSMGSSCS* CYP1A8PX ortholog Xenopus tropicalis (frog) This is not a pseudogene in frogs It needs a new subfamily name, since it is Separate from the CYP1A subfamily See Xenopus page for seq Renamed CYP1D1 CYP1A9 Anguilla anguilla (European eel) GenEMBL AF420258 Mahata,S.C., Mitsuo,R., Aoki,J.-y., Kato,H. and Itakura,T. Two forms of cytochrome P450 cDNA from 3-methylcholanthrene-treated European eel Anguilla anguilla Fish. Sci. 69 (3), 615-624 (2003) 98% identical to CYP1A9 from Japanese eel (clear ortholog) note: Eels have two CYP1A sequences. CYP1A is 80% identical to Salmo salar CYP1A. This seq is 77% to the same Salmo CYP1A Therefore, this is a recent duplication in eels that is diverging Away from the parent sequence called CYP1A (no number after A). Called CYPEuMC2 CYP1A9 Anguilla japonica (Japanese eel) GenEMBL AB020414 Mitsuo,R., Itakura,T. and Sato,M. Cloning and Sequencing of Cytochrome P450 1A Complementary DNA in Eel (Anguilla japonica) Mar. Biotechnol. 1 (4), 353-358 (1999) 98% identical to CYP1A9 from European eel (clear ortholog) note: Eels have two CYP1A sequences. CYP1A is 81% identical to Salmo salar CYP1A. This seq is 78% to the same Salmo CYP1A Therefore, this is a recent duplication in eels that is diverging Away from the parent sequence called CYP1A (no number after A). Called CYPJaMC2 1B Subfamily CYP1B1 human GenEMBL U03688 (5102bp) Sutter,T.R., Tang,Y.M., Hayes,C.L., Wo,Y.-Y.P., Jabs,E.W., Li,X., Yin,H., Cody,C.W. and Greenlee,W.F. Complete cDNA sequence of a human dioxin-inducible mRNA identifies a new gene subfamily of cytochrome P450 that maps to chromosome 2. J. Biol. Chem. 269, 13092-13099 (1994) *** Note The CYP1B1 gene has been linked to primary congenital glaucoma**** See April 97 Human Molecular Genetics CYP1B1 human GenEMBL U56438 (12177bp) Tang,Y.M., Wo,Y.-Y.P., Stewart,J., Hawkins,A.L., Griffin,C.A., Sutter,T.R. and Greenlee,W.F. Isolation and characterization of the human cytochrome P450 CYP1B1 gene. J. Biol. Chem. 271, 28324-28330 (1996) CYP1B1 Bos taurus (cow) See cattle page for details MATGLSPDDHLSPTLLSVQQTMLLLLLSVLAAVHVGQWLLRQRRRQPGSAPPGPFAWPLI GNAASMGSAPHLLFARLARRYGDVFQIHLGSCRVVVLNGERAIRQALVHQSAAFADRPPF ASFRLVSGGRSLAFGQYSESWKAQRRAAHSTMRAFSTRQPRGRRVLEGHVVGEVRELVEL LVRRSAGGAFLDPRPLTLVAVANVMSALCFGCRYSHDDAEFLELLSHNEEFGRTVGAGSL VDVLPWLQRFPNPVRTAFREFEQLNRNFSNFVLDKFLRHRESLRPGAAPRDMMDAFIHSA GADSGDGGPRLDVDYVPATVTDIFGASQDTLSTALQWLLVLFTR (2) YSEVQARVQAELDQVVGRHRLPTLEDQPRLPYVMAFLYEAMRFSSFVPVTIPHATTANAS VLGYHIPKDTVVFVNQWSVNHDPVKWSNPEDFDPTRFLDKDGLINKDLTGSVMVFSVGKR RCIGEEISKMQLFLFISILAHQCNFKANPDEPSKMDFNYGLTIKPKSFKINVTLRESMEL LDSAVQKLQVEKECQ* CYP1B1 rat GenEMBL X83867 (2321bp) Battacharyya,K.K., Brake,P.B., Eltom,S.E., Otto,S.A. and Jefcoate,C.R. Identification of a rat adrenal cytochrome P450 active in polycyclic hydrocarbon metabolism as a rat CYP1B1. Demonstration of a unique tissue-specific pattern of hormonal and aryl; hydrocarbon receptor-linked regulation. J. Biol. Chem. 270 11595-11602 (1995) CYP1B1 rat GenEMBL U09540(4964bp) Nigel Walker Walker,N.J., Gastel,J.A., Costa,L.T., Clark,G.C., Lucier,G.W. and Sutter,T.R. Rat CYP1B1: an adrenal cytochrome P450 that exhibits sex-dependent expression in livers and kidneys of TCDD-treated animals. Carcinogenesis 16 (6), 1319-1327 (1995) Cyp1b1 mouse GenEMBL U02479 (317bp) Shen,Z., Wells,R., Liu,J. and Elkind,M.M. Identification of a cytochrome P450 gene by reverse transcription- PCR using degenerate primers containing inosine. Proc. Natl. Acad. Sci. USA 90, 11483-11487 (1993) Note: only 104 amino acids by PCR. Cyp1b1 mouse GenEMBL U03283 (5128bp) Shen,Z., Liu,J., Wells,R.L. and Elkind,M.M. cDNA cloning, sequence analysis, and induction by aryl hydrocarbons of a murine cytochrome P450 gene, Cyp1b1. DNA Cell Biol. 13, 763-769 (1994) Cyp1b1 mouse GenEMBL X78445 (2006bp) Savas,U., Bhattacharyya,K.K., Christou,M., Alexander,D.L. and Jefcoat,C.R. Mouse cytochrome P450EF, representative of a new 1B subfamily of cytochrome P450s. Cloning, sequence determination, and tissue expression. J. Biol. Chem. 269, 14905-14911 (1994) CYP1B1 Xenopus tropicalis (frog) See Xenopus page for seq CYP1B1X Fundulus heteroclitus (killifish) GenEMBL AF235140 Celine Godard, Maya Said and John Stegeman Submitted to nomenclature committee Feb. 16, 2000 This seq is a CYP1C2 sequence not CYP1B1 CYP1B1 Platichthys flesus (European flounder) GenEMBL AY304550 68% to 1B1 fugu IKTIFXNFKKLNLEFGEFIRDKVIEHRKTIQSSTTRDMTDALIM ALDKLGDKTELTGGKDYVSPTMGDIFGASQDTLSTALQWIVLILVKYPEMQLRVQQEV DKVVERTRLPSIEDQLQL CYP1B1 Danio rerio (zebrafish) no accession number 66% to 1B1 fugu ctg26141 Length = 651601 4 exons EST BQ419016 494367 MMDVLLALRDLLQLSTRSVLLSLMVCLMLMFRRRQLVPGPFSWPVIGNAAQLGNTP 494534 494535 HFYLSRMAQKYGDVFQIKLGSRNVVVLNGDAIKEALVKKATDFAGRPDFASFRFVSNGKS 494714 494715 MAFGNYTPWWKLHRKVAQSTVRNFSTANIQTKQTFEKHIVSEIGELIRLFLNKSREQQFF 494894 494895 QPHRYLVVSVANTMSAVCFGNRYAYDDAEFQQVVGRNDQFTKTVGAGSMVDVMPWMQYFP 495074 495075 NPIRTLFDQFKELNKEFCAFIELKVSEHRKTISPSHVRDMTDAFIVALDKGLSGGSGVSL 495254 495255 DKEFVPPTISDIF 495293 495379 GASQDTLSTALQWIILLLVR 495438 497442 YPEIQKRLQEDVDRVVDRSRLPTIADQPHLPYLMAFIYEVMRFTSFTPLTIPHS 497603 497604 TTKDTSINGYPIPKDTVIFVNQWSLNHDPTKWDQPEVFNPQRFLDEDGSLNKDLTTNVLI 497783 497784 FSLGKRRCIGEDVSKIQLFLFTSVLVHQCSFKAESTPNMDYEYGLTLKPKPFKVSVTARD 497963 497964 SSDLLDSLVGTSQTPTEKR 498020 CYP1B1 Danio rerio (zebrafish) GenEMBL AF235139 Celine Godard, Maya Said and John Stegeman Submitted to nomenclature committee Feb. 16, 2000 SQDTLSTALQWIILLLVRYPEIQKRLQEDVDRVVDRSRLPTIAD QPHLPYLMAFIYEAMRFTSFTPLTIPHSTTKDTSINGYPIPKDTVIFVNQWSLNHDPT KWDQPEVF CYP1B1P Danio rerio (zebrafish) No accession number (from trace index) gnl|ti|30343474 zfishB-a1803b07.p1c Length = 630 probable 1B1 pseudogene zebrafish IADQPHLPYMMAFIYEVMRFTSFTP TTNVLIFSLGKRRCIGEDVSKIQLFLFTSVMVHQ*RIKAESTPNMGYVXXXXX LKPKPFKVSVTARDSSDQLISLAGTSQTPTEK CYP1B1 Cyprinus carpio (common carp) GenEMBL AB048942 73% to 1B1 fugu LSTALQWIILLLVRYPEVQKRLQEDVDKVADRSRLPTIADQPHL PYVMAFIYEVMRFTSFVPVTIPYSTTTDTSINGYPIPKDTVIFV CYP1B1 Cyprinus carpio (common carp) No accession number Itakura, T. and El-kady M.A.H. Submitted to nomenclature committee 10/17/2003 Full length sequence 1 aa diff to fragment on AB048942 91% to CYP1B2 carp 64% to 1B1 fugu 53% to 1C1 fugu clone name carp1B1a CYP1B1 Stenella coeruleoalba (striped dolphin) no accession number Celine Godard, Maya Said and John Stegeman submitted to nomenclature committee Nov. 20, 1998 PCR fragment 90% identical to human 1B1 I-helix to PERF motif region CYP1B1 Pusa sibirica (Baikal seal) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 84% to 1B1 human CYP1B1 Pleuronectes platessa (plaice) GenEMBL AJ249074 Michael Leaver submitted to Nomenclature Committee 3/11/99 full length seq. MFLQDPPAMDVTLEGIDPVTLRAVLLACVTLLFSLHLWRWLGGQ PSVPGPPGPLAWPLIGNAAEMGKLPHLYLTRMAHKYGNVFQIKLGSRTVVVLNGDSIK QALVKQGTDFAGRPDFASFKYIFDGDSLAFGPFTDWWKVHRRVAQSTVRTFSTGNADT KKTFEHHVLCEFRELLQLFVGKTEQQRFFQPMTYLVVSTANIMSAVCFGKRYAYEDEE FLQVVGRNDQFTQTVGAGSIVDVMPWLQYFPNPIRTIFDNFKKLNLEFGQFIRDKVIE HRKTIQSSTTRDMTDALIVALDKLGDKSELTGGKDYVSPTMGDIFGASQDTLSTALQW IVLILVKYPEMQLRIQQEVDKVVDRTRLPSIEDQLQLPYIMAFVYEVMRFTSFVPLTI PHSTVTDTSIMGYTIPKNTVIFINQWSINHDPALWSHPETFDPQRFLDQNGALNKDLT SSVLIFSLGKRRCIGEELSKMQLFLFTALIAHQCHISPDPARPPKLDYTYGLTLKPCA FSIAVALRGHDMSLLDEATRSSAEEVKGEPSSDSQTKN CYP1B1 Fugu rubripes (Takifugu rubripes) Japanese pufferfish Scaffold_1553 complete gene Scaffold_11030 Scaffold_10662 54% TO 1B1 human 51% to 1B1 mouse AL024920.1 AL015454.1 cosmid 077P23 80% to CYP1B from pleuronectes platessa FC:C013F14aE4 LGU7740.y1 FC:C077P23aC12 AL015446.1 077P23 FC:C077P23aD8 2460 MKVIQEEVSPEAGALLLACATLLVSLQLWRWRRRRPGGCPPGPRAWPIIGNAAQLGHAPHL 2278 2277 YFTRMAQRFGNVFQIKLGSRTVVVLNGDAIKQALVRKGLEFAGRPDFTSFKYISNGHSL 2101 2100 AFGTVTDWWKSHRRVAQSTVRMFSTGNLQTKKTFERHLTCEVRELLHLFLGKTKELQYFQ 1921 1920 PMNYLVVSTANVISAVCFGKRYSYEDEEFQQVVGRNDQFTRTVGAGSIVDVMPWL 1756 1755 QYFPNPVKSIFDNFKRLNKEFSDFIRDKVTEHRKSIRPSSVRDMTDAFIVSLDKLSE 1585 1584 KTGVPLWKDYVIPTVGDVFGASQDTLSTALQWIFLVLVR 1468 (2) 294 YPDMQQRLQEEVDLVVGRQRLPCIEDQQQLPWVMAFIYEVMRFTSFVPLTIPHSTTTDTT 115 114 IMGYTIPKNTIIFINQWSINHDPTIWSHPET 13 FDPNRFLNPSGSLNKDLTSRMLIFSMGKRRCIGEELSKLHLFLFTALIGHQCHITDDPA KPTTMDYNYGLTLKPRGFYVALTLRGDMRLLDEAASRPPAEEPGRGPLADP* CYP1B1 Tetraodon nigroviridis (freshwater pufferfish) No accession number 80% to CYP1B1 fugu missing first 50 aa and last 18 aa FS_CONTIG_703_2 Length = 26665 69 NAAQLGKAPHLYFASRAERYGNVFQIRLGARSVVVLNGDAIRQALVKQGPEFAGRPDFAS 248 249 FGFISDGRSMAFGTATDWWKVHRRVAHSTVRMFSSGNAQTKKAFERHITSEVRELLRLFLRST 437 439 RAQRFFQPLAPLVVSTANVMSAVCFGKRYSYEDEEFQQVVGRNDQFTRTVGAGSVVDVMP 618 619 WLQYFPNPVKTIFDDFKRLNREFNSFIRDKVSEQ 720 722 RKTIQSSSVRDMTDALIASLDRLSAKTGVP 811 812 LWKEYVTPTVGDVFGASQDTLSTALQWIFLVLV 910 1486 RYPDVQQRLQKEVDQVVGRQRLPCLEDQQQLPWVMAFIYEVMRFTSFMPLTIPHSTTTDT 1665 1666 TIGGYSIPRNTVVFINQWSVNHDPAIWPQPETFDPDRFLNPNGSLNKDLTSSVLIFSLGK 1845 1846 RRCIGEELAKLHLFLFTALMGHQCRLASDPARPPSLDWNYGLTLKPHAFHIAVSLRGDMRLLDQ 2037 CYP1B1 Anguilla japonica (Japanese eel) GenEMBL AB048940 73% to 1B1 fugu LSTALQWIILVLVRFPDIQKQLREEVDKVVDSSRLPSIEDQPRL PYVMAFLYEVMRFTSFIPVTIPHSTTTDTAIQGYRIPKDTVVFI CYP1B1 Oreochromis niloticus (Nile tilapia) GenEMBL AB048944 80% to 1B1 fugu LSTALQWIILILVKYPEIQVRLQQEVDKVVDRSRVPAIEDQQQL PYVMAFIYEVMRFTSFLPLTIPHSTTTDTSIMGYTVPKNTVIFI CYP1B1 Callorhinchus milii (elephant shark, Chondrichthyes) Trace files 1573810313 1573059473 57% to 1B1 zebrafish only 49% to 1C MNAVRVLAGQFTQSMQPVLAVALVVLTLLQVCKWMQQPSEQCRRRPPGPFPWPII GNATQIGKVPHISFSRMARRYGNVFQIKLGSRSVVVLNGEECIREALVRKAEQFSGRPDF ASFNEVSGGRSLAFRSYCDRWKFHRRIAHSTVRAFSTNNPDTKKTFQRHVVGEVQQLSSR RQ CYP1B1 Petromyzon marinus (sea lamprey) Trace files 1172235440, 1468167059, 1466822831, 1172788718, 1373603965, 1464676455 54% to 1B1 zebrafish, 48% to 1C2, 53% to CYP1B3 Petromyzon marinus SSNVVEFALLVALEARRWLLLRRARSSRGPPGPFPWPILGNALQLGSAPHLAMCRMARRY GDVFMMKLGGRPVLVLNGATAIRQALVKQGAD FAGRPAFPSFSVVSDGNSMAFGGYSSLWKMHRCVAQST LRHFSSSGNAEARADLERYV VSEAGALVGIMLERSDGGRYFNPSRLFILAIANVMSALCFGRRYDYDNSEFREIV SRNDKFGRTVGAGSLVDVMPWLLYFPNPVRTAYRDFVALNMEFNAFTRRKVEQHRADFKA GGVPRDITDSLIAAVEVERPRSRSGEALSGRHVSGAVNDIFGASQDTLSTALMWLLMFLV RFPRAQRRVQEEVD RVAGRHRLPCLEDRASLPYTEAFVFETLRYSSFVPV TIPHSTTTDTVIAGYCVPKDTVVFVNQWSSNHDPERWRDPETFEPTRFL DESGTRVDKDLASNVLIFSVGKRRCIGDDISKMQLLLFAAILAHQCSFEADPAQTMT IDKSYGLTLKPMPFEVRARVRDHVLAECFADARRQL* CYP1B3v1 Petromyzon marinus (sea lamprey) Trace 1373790297 first exon 49% to 1B1 fugu, 50% to 1C1 zebrafish 1437356431 mate pair = 1438643165 = C=term of 1223244203 seq 1290968067 52% to Stenotomus chrysops P450 1C1 combined frags 49% to 1B1 zebrafish 45% to 1C2 zebrafish, 39% to 1A1 zebrafsih 1223244203, 1473037756, 1427240599, 1446950979 51% to 1B1 1438643165 = extreme C-term = mate pair of 1437356431 whole seq 51% to 1B1 human, 50% to 1B1 fugu, 49% to 1B1 zebrafish MQSTLAILAVNPSRTPTSTASFTSTSTQLSIPSSHLPPPPPPPSIQPSSPAC TLSQLPAHSPSAAASSPAVAAAPLHSLRTLPGPTPWPFVGNSLQLGPMPHLTFQRMASTY GPLFRIRLGSRDVVVLNGDSLVREALVCRGSEFAGRPAFRSFSMVSGGHSV AFGGYCELWRLHRRLAQSTLRAFSTGGTDARR ALDGHVMMEADELLRVMMA SCRRSTAGSVDPAQALVVAVANVRSALCFRRRYWHED AESSSSDRNERSGAAVGAGSVVDVMPW LLRFPNPVRAAFDDIRRANEDLSEFVRDKVRQRRGAAAVVGPGTRSVRDMM DALIAHVDGGAVAGGGAAEAAAGDGEGGEAAGGGRGGGGPRLGASHVEATLCDVFGASQD TLSTGLLWLILLAVRHPEEQARVQGEVDRVVGRTRLPSAADRARMPRTEAFVCEVLRYSS FVPVTIPHATTRDTRLAGYSIPRDTVVFVNQWSVNHDPGVFEEPHAFRPARF LDAEGTALDRALARRVMIFSAGRRRCIGEELSRLELFLFTAVMLHQV DFVAPPGHGPPGTEAVCGGLTLKPKPFSVALVPRGDPLGPGCAPQP* CYP1B3v2 Petromyzon marinus (sea lamprey) Trace files 1468808835, 1424613767 , 1489836465 allele of 1223244203? 4 aa diffs and one indel of 1aa PVRAAFDDFRRANEDL SEFVRDKVRQRRGAAAVVGPGTRSVRDMMDALISHVDGGAVAGGAAEAAAGDGEGGEAAGGERGGGGP RLGASHVEATLCDVFGASQDTLSTGLLWLILLAVRHPEEQARVQGEVDRVVGRTRLPSAA DRARMPRTEAFVCEVLRYSSFVPVTIPHATTRDTRLAGYSIPRDTVVFVNQWSVNHDPGV FEEPHAFRPARFLDAEGTALDRALARRVMIFSAARFRCIGEELSRLELFL CYP1B2X Stenotomus chrysops (scup, a fish) no accession number Celine Godard, Maya Said, and John Stegeman. submitted to nomenclature committee full length 4/21/99 81% identical to scup 1B3 renamed CYP1C1 CYP1B3X Stenotomus chrysops (scup, a fish) no accession number Celine Godard, Maya Said, and John Stegeman. submitted to nomenclature committee Aug. 26, 1998 full length 4/21/99 63% identical to human 1B1 over C-terminal PCR fragment I-helix to heme formerly 1B1, reaassigned to CYP1C2 Note: the CYP1B2 and 1B3 names from scup were never published. It now appears that some fish like carp do have two CYP1B sequences, so the CYP1B2 name is going to be used to indicate this fact. 10/20/2003 CYP1B2 Cyprinus carpio (common carp) No accession number Itakura, T. and El-kady M.A.H. Submitted to nomenclature committee 10/17/2003 Full length sequence 3 aa diffs to fragment on AB048942 91% to CYP1B2 carp 64% to 1B1 fugu 53% to 1C1 fugu clone name carp1B1b CYP1C1 Gallus gallus (chicken) XM_001233594.1 55% to CYP1C2 Fugu, 54% to 1C1 zebrafish MSAMGTPNGAAMAPVLSPHSALLLIAVVLTAI LLLARTRHKATRGQSPPGPFASPLVGNVLQMGRLPHLTFMRMACRYGAIFQLRLGRHRVV VLNGEAAIRRALVGLGTRFAGRPDFPSFGLVSGGRSIAFGGCTPQWRARRRLAHAALRAH STVAEVERHVVAEAGDLVRLFLRHSQGGAYFQPCPLLVVANANVLCALCFGRRYDHADGE FTALLGRNDRFGQTVGAGSLVDVLPWLLRFPNPVRHVYRDFQALNRELHGFVQAKVAQHR QTFDWRAVRDISDVMIASVERGGGSPDGLGPEDVEGAMTDIFGAGQDTTSTALSWIILLL LKHPQVQQDLQAELDRVVGRSRLPTAEDRPHLPLLEAFIYETLRYSSFVPITIPHATTAD VELEGFRIPKGTVVFVNQWSVNHDCSKWPEPQRFDPTRFLDKQQRLDRERAGSVMIFSAG QRRCIGDQLSKLQIFLFTAILLHQCSFHANPAEHLTMDCIHGLALKPLPFTVNVRPRIPL LIQP* CYP1C1 Stenotomus chrysops (scup, a fish) no accession number Celine Godard, Maya Said, and John Stegeman. submitted to nomenclature committee full length 4/21/99 81% identical to scup 1C2 formerly 1B2, reaassigned after consultation with the submitters and comparison to the Fugu genomic orthologs (see below) CYP1C1 Danio rerio (zebrafish) GenEMBL CAAK02055884.1 6714 bp gene seq (revised seq shown below) contig NA9599 Length = 11279 78% to 1C1 73% to 1C2 fugu 53% to 1B1 Note: CYP1C probably arose by a retrotransposition of a 1B1 cDNA Since 1C has no introns and it is more similar to 1B1 than 1A MEAEFGLKSSSIMREWSGQVQPALIASFI 3411 ILFFLEACLWVRNLTFKKRLPGPFAWPLVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGC 3232 3231 SDIVVLNGDAAIRKALVQHSTEFAGRPNFVSFQMISGGRSLTFTNYSKQWKTHRKVAQST 3052 3051 LRAFSMANSQTRKTFEQHVVGEAMDLVQKFLRLSADGRHFNPAHEATVAAANVICALCF 2872 2871 GKRYGHDDPEFRTLLGRVNKFGETVGAGSLVDVMPWLQS 2755 2753 FPNPVRSVYQNFKTINKGVFNYVKDKVLQHRDTYDRDVTRDMSDAIIGVIEHGKEST 2583 2582 LTKDFVESTVTDLIGAGQDTVSTAMQWMLLLLVKYPSIQSKLQEQIDKVVGRDRLPSIE 2406 2405 DRCNLAYLDAFIYETMRFTSFVP 2337 2337 VTIPHSTTSDVTIEGLHIPKDTVVFINQWSVNHDPQKWSDPHIFNPSRFLDENGALN 2167 2166 KDLTSSVMIFSTGKRRCIGEQIAKVEVFLFSAILLHQCKFERDPSQDLSMDCSYGLALKP 1987 1986 LHYTISAKLRGKLFGLVSPA* 1924 CYP1C1 Fugu rubripes No accession number Scaffold_3008b comp(8676-10253) no introns complete gene 86% to scup 1C1 75% to scup 1C2 10253 MALDTEFGVKSSSITREWSGQVQPALVASFLFLFCLEACLWVRNLRHKRRL 10100 PGPFAWPVVGNAMQLGQMPHITFAKLAKKYGNVYQIRLGCSNI 9972 9971 VVLNGDQAIHQALIEHSTEFAGRPNFVSFQMISGGRSLTFTNYSKQWKVHRKLAQSSLRA 9792 9791 FSSANKQTKIAFEQHVTAEANELVQAFLRYSTDGRYFDPAHEFTVAAANVMCALCFGKRY 9612 9611 GHDDHEFRCLLKKLNKFGETVGAGSLVDVMPWLQSFPNPVRSLYENFKSLNEEFFNFV 9438 9437 KNKVQEHRESFDPNVTRDMSDAMINVIEERKDGTLSKEFAEATITDLIGAGQDTVS 9270 9269 TVLQWIVLLLVKHPDKQAKLHELMDKVVGQDRLPTTEDRSSLAYLDAFIYETMRFTSFVP 9090 9089 VTIPHSTTSDVTIEGLRIPKDTVVFINQWSVNHDPLKWKDPHVFDPSRFLNENGDLNKDL 8910 8909 TSGVMIFSSGKRRCIGSQIAKVEVFLFAAILLHQCSFESDPSDPLTLDCSYGLTLKP 8739 LRCFVSAKPRGKLLGLVSPA* 8676 CYP1C1 Tetraodon nigroviridis (freshwater pufferfish) No accession number FS_CONTIG_2073_3 Length = 9880 87% to 1C1 70% to 1C2 5630 MALDTEFSVKSSGITREWSGQIQPALVASFLFLFCLEACLWVRNLRQKRRLPGPFAWPV 5806 5807 VGNAMQLGQMPHITFAKLAKKYGNVYQIRLGCSNIVVLNGDQAIXX 5938 5943 QALIQHSTEFAGRPNFVSFQMISGGRSLTFTSYSKQWKAHRKVAQSSLRAFSSANNQTKK 6122 6123 AFEQHVTAEANKLVQTFLHYSTDGKYFDPAHDFTIAAANVMCALCFGKRYGHDDQGVQVP 6302 6303 VNEVGQVWPRTVGAGSLVDVMPWLQSFPNPVRSVYENFKSLNEEFFSFVKNKVSEHRESF 6482 6483 DPNVTRDMSDAMINVIEGRKDSTLTKEFVEATVTDLIGAGQDTISTVMQWIILLLV 6650 6651 KYPDMQAKLHELVDKVVGQDRLPTVEDRSSLAYLDAFIYETMRFTSFVPVTIPHSTTSDV 6830 6831 TIEGLHIPKKDTVVFINQWSVNHDPLKWEG 6919 PHVLGPSRFLDDNGDLKKDLNKGVMIFSSGKRRCIGNQIAK 7041 7053 FLFTAILLHQCSFESNPSDPVTLDCSYGLTLKPLRCFVNAKPRGKLLGVVSPA 7211 CYP1C1 Anguilla japonica (Japanese eel) No accession number Itakura, T. and El-kady M.A.H. Submitted to nomenclature committee 10/17/2003 Full length sequence 100% match to frag on AB048941 80% to 1C1 fugu 76% to 1C2 fugu 52% to 1B1 fugu clone name Japanese eel 1C CYP1C1 Anguilla japonica (Japanese eel) GenEMBL AB048941 81% to 1C1 78% to 1C2 fugu VSTLLQWILLLLVKYPHIQAKLQEQIDKVVGRDRLPCMEDKSSL AYLDAFVYETMRFTSFVPVTIPHSTTSDVTIEGVHIPRDTVVFI CYP1C1 Cyprinus carpio (common carp) No accession number Itakura, T. and El-kady M.A.H. Submitted to nomenclature committee 10/17/2003 Full length sequence 2 aa diffs to frag on AB048943 77% to 1C1 fugu 73% to 1C2 fugu 50% to 1B1 fugu clone name carp1C1a CYP1C1 Cyprinus carpio (common carp) GenEMBL AB048943 80% to 1C1 and 1C2 fugu VSTVMQWILLLLVKYPSIQTKLQEQIDKVVGRGRLPSIEDKSNL AYLDAFIYETMRYTSFVPVTIPHSTTSDVTIEGLHIPKDTVVFI CYP1C1 Callorhinchus milii (elephant shark, Chondrichthyes) Trace file 1576746999 57% to 1C2 tetraodon, 53% to 1B1 Pleuronectes, 49% to 1B1 fugu This genomic fragment spans the location of 1B1s only intron w/o an intron therefore this is probably 1C, an intronless gene LVTVRTLYRDFKRLNQEFFGFVSGKVGQRRRTFVPGRTRDMSDAFIAVVDGAAAAGHGLS GEHVEGTVNDVMGAGQDTTSTALGWVLFHLIRHPDVQARLQEEMDRAVGRGRLPGTGDRG RLPYLQAFIHEVCRFTSFVPLTIPHATTSRVTLHGYDLPEDTVVFVNQWSVNHDGAKWKE PETFEPGRFLDPDGSVNRALADSVMIFSAGKRRCLGDQLAKTQMFLFTAILIHQCAFEAN PGDVLSLDCLYGLSLKPLPFKLRVRLRDTYRGVGRQREPPPPPTHTHTQKHSTGQGHTHR DPSPTHTQRERDSQQDRDPTHHTPHRPLSTPVINVRN CYP1C1 Petromyzon marinus (sea lamprey) Trace files 1434207733, 1193330571, 1179606703, 1483258470, 1194048496, 1482130588, 1161783303, 1206198102 1193734487, 1468865778, 1293288933, 1162763713 53% to 1C2 Fugu 48% to 1B1 fugu (no intron so probably 1C) MTAAESMEALPVVAAGGGAQLWDISHPPV LFFLLSALLILLVTLEARKHGRSHQQQQKHSAPDPPGPLGFPIVGNSLQLGPM PHLTLNAMAQRYGAVFRIHLGHEPVVVLTGEEI IHEALVKRGAEFAGRPDFPSFALVSGGNSMSFKTYSELWRVHRRLAHSTLRAF FTGTAATRRVFEGHVRLEAAELCAMLAEATSRAGGCGVDPSEPTVVAVANVISAVCFGKR YEHDDAEFRGLLRNNERFSKTVGAGSVVDVMPWLMRFPNPVRSIFRDFEQMNNEFFAFVQ RKVREHRDSYDPAATPRDMIDALIGHIDGGGGDSDDEGDAADGPSWRWRCARGAPEVGAA YVDSTLTDVFGAG QDTMSTSLMWFVLLCAKHPELQADMQRDIDRVVGRERLPRLDDRPQLACVDAFVCEMMRH VSYVPFTIPHATTTDTELNGYRVAKGTVVFVNQWSVNHDPAIWRDPERFDPSRFL DETGAALDRDLARRVMIFSAGKRRCIGYEMAKMQLFLFCSALLH QLSISVPPGHVVSLEGVYGLSLKPKYLSVAFTPREQLLGGRPGEAEE* CYP1C fragment Petromyzon marinus (sea lamprey) Trace file 1483490875 frame3_ORF1 86% to CYP1C1 Petromyzon TRRLAH CTLRALFTGMATTRRVFEGHVRLEAAELCAMLHEQQNRAGGRGIESIERTVVAVANVISA VCFGKRYEHEDAEFRGLLRNNERFSKTLGAGSVLEVIPWIMRFPNPARSIIREFEQMNNE FFALMQRKVREHRDSYDPAATPRDMIDALIGHIDGGGGDSDDEGDAADG QSWRWRCARGAPEVG CYP1C2 Stenotomus chrysops (scup, a fish) no accession number Celine Godard, Maya Said, and John Stegeman. submitted to nomenclature committee Aug. 26, 1998 full length 4/21/99 63% identical to human 1B1 over C-terminal PCR fragment I-helix to heme formerly 1B3, reaassigned after consultation with the submitters and comparison to the Fugu genomic orthologs (see below) CYP1C2 Danio rerio (zebrafish) no accession number contig NA2067 Length = 8014 EST CD758525 see zfish41356-444a08.p1c Zfish44625-3160d07.q1k 73% to 1C1 fugu and 74% to 1C2 fugu MAQSDSEFSILKEWSGQIQPALIASFI 1098 ILCCLEACFWVRNITLKKKRLPGPFAWPLVGNAMQLGQMPHITFSKLAKKYGNVYQIRLG 1277 1278 SSDIVVLNGESAIRSALLQHSTEFAGRPNFVSFQYVSGGTSMTFASYSKQWKMHRKIAQS 1457 1458 TIRAFSSANSQTKKSFEKHIVAEAVDLVETFL 1553 KIQHFNPSHELTVAAANIICALCFRKRYGHDDLX (from EST CD758525) (C-terminal inverted) 2818 IKNVLGNVNKFSETVGAGSLVDVMPWLQTFPNPIRSIFQSFKDLNSDFFSFVKGKVVEHRL 2636 2635 SYDPEVIRDMSDAFIGVMDHADEETGLTEAHTEGTVSDLIGAGLDTVSTALNWMLLL 2465 2464 LVKYPSIQSKLQEQIDKVVGRDRLPSIEDRCNLAYLDAFIYETMRFTSFVPVTIPHSTTS 2285 2284 DVTIEGLHIPKDTVVFINQWSVNHDPQKWSDPHIFNPSRFLDENGALDKDLTNSVMIFSI 2105 2104 GRRRCIGDQIAKVEVFLISAILIHQLTFESDPSQDLTLNCSYGLTLKPFDYKISAKPR 1931 1930 GSIVN* 1913 CYP1C2 Fugu rubripes No accession number Scaffold_3008a comp(5208-6770) no introns complete gene 83% to scup 1C2 78% to scup 1C1 6770 MEEDFGVKGSSSITREWSGHVQPALVAFFVFLFCVEACLWAKNLKRRL 6626 PGPFAWPVVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGCSDI 6498 6497 VVLNGARVIRQALIEHSTEFAGRPNFVSFQNVSGGKSMAFTSYSKQWRMHRKIAQSTIRA 6318 6317 FSSANSQTKKVFEQQIVAEATELVEVFLKLGARGQHFNPAHELTVAAANVICALCFGRRY 6138 6137 GHDDQEFRDVLRRIDKFGQTVGAGSLVDVMPWLQSFPNPVRSMFRSFEALNREFFGF 5967 5966 VQLKVEQHRETFDPEVTRDMSDAIISVLEKSDGETALTKDYTEVTMADLIGAGLDTV 5796 5795 STALHWMLLLLVKHPELQSKLHQLIDRVVGRNRLPSIEDRSSLAYLDAFIYETMRFTSFV 5616 5615 PVTIPHSTTSDVTIEGLRIPKDTVVFINQWSVNQDPLMWKDPHVFDPSRFMDEEGSLDRD 5436 5435 LACNVMIFSAGKRRCIGDQIAKVEVFLFFAVLLHQCSFESSADEDLTLNCSYGLTLKPL 5259 5258 DFSITAKLRGKLLKSP* 5208 CYP1C2 Tetraodon nigroviridis (freshwater pufferfish) No accession number 84% to CYP1C2 fugu 73% to CYP1C1 fugu CNS_TRUECNSCONTIG_6508_2 Length = 4645 1369 MEEEFCVEGGSSSIREWSGHIQAALVAFFVFLFCLEARLWAKNL 1501 KRRLPGPFAWPVVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGCSDIVVLNGDRVIRQAL 1680 1681 IQHSTEFAGRPNFVSFQTVSGGKGMTFSSYSKRWKMHRKIAQSTIRAFSSANSQTKENFE 1860 1861 QQIAAEATELVEVFLKLSARGQHFNPEHELTVAAANVICALCFGKRYGHDDAEFRELLHR 2040 2041 VNMFGQTVGAGSLVDVMPWLQSFPNPVRSMFKSFKTLNRQFFGFVQLKLKEHRETFDPKV 2220 2221 TRDMSDAIISVLDRSASEYGLTKDNAEGTVSDLIGAGLDTVSTALHWMLLLLVKHPQ 2391 2392 LQHKLQQLIDQVVGRNRLPSIGDRSSLAYLDAFIYETMRFTSFVPVTIPHSTTSDVTIEG 2571 2572 LRIPKDTVVFINQWSVNHDSLMWTDPHVFDPSRFLDEQGSLNRDLASNVMIFSAGKRRCI 2751 2752 GTQIAKAEIFLFLAILLHQCSFERSAGEEPSLDCSYGLTLKPLDYRITAKLRGKLLKSP 2928 CYP1C2 Fundulus heteroclitus (killifish) GenEMBL AF235140 Celine Godard, Maya Said and John Stegeman Submitted to nomenclature committee Feb. 16, 2000 Formerly named CYP1B1, but reassigned 10/21/2003 CYP1C2 Cyprinus carpio (common carp) No accession number Itakura, T. and El-kady M.A.H. Submitted to nomenclature committee 10/17/2003 Full length sequence 5 aa diffs to frag on AB048943 73% to 1C2 fugu 72% to 1C1 fugu 51% to 1B1 fugu clone name carp1C1b CYP1D1P/CYP1A8PX human NT_008580.9 Pseudogene 43% identcal to 1A2 human Renamed CYP1D1P orthologous to fish 1D1 NT_008580.9|Hs9_8737 chromosome 9 4822084 MILDLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQVSPPGP*SFPIIENLLQLGDHPY 4822260 4822261 LTLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLHKDGEHFAGRPNMHTFSFLAEGKS 4822440 4822441 LSFSVNYGESWKLHKKIASKAL*TFSNAEAKSSTCSCSLEEHVTEEISELVTVFVELTSK 4822620 4822621 NGSFDPRNAITCVVANIVCALCFGKR*DHSDEEFLRIVKTNDDLLKASSAANPADFIPCL 4822800 4822801 HYLPLKIINAPLEFYQALNGFIALHVQDHLATYGK 4822905 (0) 4824790 DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVSDLFGA 4824912 (1) 4829424 GFETVSTCLCWSFLYLIHYPEIQARIQEEI 4829513 (1) 4829611 RPPRFEDRKILPYTEAFVSEVFRHASFLPFTIPHS 4829715 (2) 4832677 TTADTTLNGYFIPRKTCTFINMYQVNHDE 4832763 (2) 4835676 TIWDNHSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITTVLQQFK 4835858 4835859 LKK*PRAKLDLTPTYGLVMRPKLYQLQAELHPSGSSSA* 4835975 CYP1D1 Macaca mulatta (rhesus monkey) chr15 from UCSC browser 81802360-81816347 92% to human 1D1P MILNLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQLSPPGPWSFPIIGNLLQLGEHPYL TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLLKDGEHFAGRPNMHTFSFLAEGKSL SFSVNYGESWKLHKKIASKALRTLSNAEAKSSTCSCLLEEHVTEEVSELVTVFVELSSKN GGFDPRNAITCAVANVVCALCFGKRYDHSDEEFLKIVKTNDDLLKASSAANPADFIPCLR YLPLQIINAPREFYRALNGFIALHVQDHLATYDK (0) DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVNDLFGA GFETVSTCLYWSFLYLIHYPEIQAKIQEEI (1) DGNIGLKPPRFEDRKILPYT EAFISEVFRHASFLPFTIPHCNTADTTLNGYFIPRKTCTFINMYQVNHDETIWDNPSLFR PDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITAVLQQLKLKKCPRAKL DLTPTYGLVMRPKPYQLEAERRSSGSSSASILRLRGGFLTQFRKIDELNLLN* CYP1D1P/CYP1A8PX ortholog Bos taurus (cow) Renamed CYP1D1P orthologous to fish 1D1 See cattle page for details MIFGMAVTSGEVTTSRIILVMVFVFVRELGNKGRKEVFPPGPWSLPIVENLLQLG DHLYFTFMEMRKKYGDVFLIKLGMVPVLVVNGMEMVKEVLLRNGEHFAA*PNV LTFSFLAQ*KSLTFS NYGENWTLHKKIASNALRTFPKAETKSSTRSCLLEKHVIEEVSELVKV FTELTSRSGSFEPRGAITCAMANVV CTLCFGKRYDHSDEEFLRIVKTDHDLLKASSAANPADFIPYF*YLPLRIINAPQEFYHARNQ FIALHIRDHLTT CPQDHIQDITDALINACHNKYAVAKITILNDDEIISTVSDLVGAG FEIISTCIYWSFLYLIYYPEIQVKIQEEI DGNTGMKSPRFENRKILP YTEAFINEIFRHTSFLPFTIPHC (2) TTADTTLNGYFIPRKTCTFINMYQVNHDE (2) TIWDNPNLLRPERFLNENRELNKNLIEKIFIFGMGIQKCL REEVAQNEVFVFITTVLQQLTLKKCPVVKLDLTPTYGLVMKPKPYQLPAEPRSMGSSCS* CYP1D1P dog UCSC browser chr 1 87915406-87928215 (-) strand 57% to human 1D1P VIAELISKNGNFGLRSVITCVVVVVNVICILCFSMRYD HI*EEFLRIHKMNAHLLETSSEANPADFMPCFLYRPL*IINAYQEFYQAPN*FIALHDHLTTYDN DHI*AIADALINACHNKYGTMEAATINDDEIISTMNGLFGA GLETIAIFLFWGFLF IIHFFQVKTWGWESVRFEHRKIIPYTEASIN*IFRYAPFLPLAIPHC (2) STTEDTVQNGYFIPRKSCTFISMC*INHNQ NIWDNPKLFRSQRFINENRE*KS*EQNVDIWNGTLEVSHRR**RNEICIFITSV CYP1D1P Oryctolagus cuniculus (rabbit) GenEMBL AAGW01268851.1 57% to human 1D1P, only 30% to 1A1 2347 VSVFVRALGSRNRKQVSTAGP*AFSNLFQLGAYPFLI**RGERNRDVFLFTFVVLP 2514 2515 VVVVNGMEMVKKTLLSDGKHFSGRPDMHTIAFLEEGKGLSSFVTHGES*KLYFQCVSNAL 2694 2695 CTFSKVEAK FSTYSCLLEEHITEE ASELMKVFVELTTKSGNFG 2825 2826 LRNAIPWHDQN 2857 IVGALCFGKRYDHNDGKSLSVVK SNGLFKFPSKAKPQ FIPQFHYLPLQIINIP*WL 3030 3031 YQALNQFTDLQVQGHLRMYDK 3093 CYP1D1P Sus scrofa GenEMBL CT232614.1, CT282345.1 77% to human 1D1P only 32% to 1A1 human 376 VFVFVRALRNNGRKQVFPPGSCSFPIIGNLQLGGHPYLTFMEMRKKYGVVFFIKLGVMPV 555 556 LVVNGMEMVKQVLLKGGEHVAGRLHMHTFSFLAKGKSLTFLANYRESCKLCKKIASNAL* 735 736 TFSQEETKSPTCSCFLEEHVVEEVSELVKVFAELTSNSCSFDCRSAI 876 TVVANIVFALCFGKRYDHSDEEFLRIVKT CYP1D1 Otolemur garnettii (small-eared galago) GenEMBL WGS seq. AAQR01460136.1 N-terminal 6245 MISHLAITPREVTISLVILVIVFVFLRVLRSKGRKQVSPPGPLSFPIIGNLLQLGEHPYL 6066 6065 TFMEMRRQYGDIFLLRLGTVPVVVVNGVEMVKQVLLKDGEYFAGRPNMHTFSFLAEGKSL 5886 5885 TFSVNYGESWKLHKKIASNALRTFSKAEAKSSTCSCLLEEHVTEEASELVKVFVELTSKN 5706 5705 GSFNPRSAITCAVANVVCALCFGKRYDHGDEEFLRIVKTNDDLLKASSAANPADFIPCFR 5526 5525 YLPLRIINAPREFYQALNRFIALQVQDHLTTYDK 5424 CYP1D1 Myotis lucifugus (little brown bat) GenEMBL WGS seq AAPE01629621 MULTIPLE FRAMESHIFTS, BUT NO STOPS, MAY BE SEQ ERRORS 13312 MILDKAITPEEVTTSLIILVIVFVFVRALMSKGRRQVSLPGPWSFPLIGNLLQLGDHPFL 13133 13132 TFTEMRKKYGDVFLIKLGMVPVVVVNGMEMVKHVLLKDGEHFAGRPNMHTFSFLAEGKSF 12953 12952 SFSVNYGESWKLHKKIASSALRTFSKAEAKSSTCSCLLEEQVIEEVSELVKVFAELTSKK 12773 12772 GSFEPRNAITCAVANVVCALCFGKRYDHSDEEFIRIVKTNDDLLKASSAANPADFIPCFR 12593 12592 YLPLRIINAPREFYRALNEFITLHVQDHLTTYDK (0) 12491 11217 DHMRDITDALINTCHKKICTTKXXXLNDDE II STVNDIXGA (1) 11131 10594 GFETVSTCLYWSFLYLIYYPEIQARIQEEI (1) 10415 DGNIGLKPPRFEDRKMLPYTEAFINEVFRHASFIPFTIPHC (2) 10293 8366 TTADTTLNGYFIPKNTCTFINMYQVNHDE 8280 5747 TIWDIQS VFSPERFLNENRELNKSLXX 5610 5601 KVLIFGMGIRKCLGEDVARNEVFLFITMVLQQLKLHKCPRAELDLTPTYGLAMKPKPYQL 5422 5421 QAEPRSADSAS* 5386 CYP1D1 Tupaia belangeri (northern tree shrew) GenEMBL WGS seq. AAPY01014831.1 N-terminal 1294 MIFHLAVTPGEVTITLIILVVIFVFVKTLGNKGRKRLSPPGPWSFPIIGNLFQLGDHPYL 1115 1114 TFMEMRKKYGDVFMLRLGMVPVLVVNGMEMVKQVLLKDTEHFAGRPDMHSFSFLAEGKSL 935 934 SFSVNYGESWKLHKKIASNALRTFSKAEAKSSTCSCLLEEHVTEEVSELVKVFTESTSKN 755 754 GSFDPRNAITCAVANVVCALCFGKRYDHSDKEFLRIIKTNDDLLKASSAANPVDFIPCFR 575 574 YLPLRIINAPREFYRALNKFIALHVQDHITTYDK 473 CYP1D1 Sorex araneu (European shrew) GenEMBL WGS seq. AALT01503634.1 12376 MIFNVAVNSGDLSTSLIVFVVVFVIVRALGSKGRKQGFPPGPRALPILGNLLQLGDYPYL 12197 12196 TFMEMRKKYGDVFLIRLGMVPVVVVNGMETVKQVLLKDGEKFAGRPKMHTFSFLAEGKSL 12017 12016 SFSVNYGESWKLQKKIASNSLRTFSKAEAKSSSCSCLLEEHVLEEVSELISIFEKLTSEN 11837 11836 GSFDPRNAITCAVANIVCALCFGKRYDHSDEEFLRIVKTNDDILKASSAANPADFIPCFR 11657 11656 YLPLPIVNGPRKFYRALNQFISLHVRDHYTTYDK 11555 9964 QDHIRDITDALISTCQNKYSSKKATLNDDEVISVVNDIFGA 9842 6041 GFETVSTCLYWSFLYLIQYPEIQVKVQEEI 5952 5868 IGLKSPTFEDRKILPYTEAFITEVFRHASFIPLTIPH 5758 2010 TVDTTLNGYFIPKKTCTFINMYQVNHDE 1927 CYP1D1 Echinops telfairi (small Madagascar hedgehog) GenEMBL WGS seq. AAIY01323088.1 1272 MMFDSAAVPGEVTASLLVLVIVFVFIRARESQEGKKIPPPGPWSFPIIGNLLQLGAHPYL 1093 1092 TFMEMRKKYGDVFLIKLGVVPVLVVNGMEMVRRVLARDGEHFAGRPAMHTFSFLAEGKSF 913 912 SFSVNYGESWKLHKKIASNALRTFSKAEAKSSTCSCLLEEHVAEEVAVLVRAFAELTSTN 733 732 GSFEPRSVITCAVANVVCALCFGKRYEHSDEEFLKVVQTNDELLKASSAANPADFIPCFR 553 552 YLPLRIINAPREFYQALNQFITRHVQDHLTTYDK CYP1D1 Loxodonta africana (African Elephant) GenEMBL WGS seq. AAGU01360158.1 9163 MIFSLAVTPGEATTCLIVLVIVFVFVRALRNRDGKQVSLPGPWSFPIIGNLPQIGDHPYL 8984 8983 TFMEMRKKYGDVFLIRLGMVPVVVVNGMEMVKQVLLKDGEKFAGRPNMHTFSVLAEKKSL 8804 8803 SFSVNYGESWKLHKKIASNALRTFSKAEAKSSTCSCLLEEHVTEEVSELVKVFAELTSKN 8624 8623 GSFEPRSVITCSVANVVCALCFGKRYEHNDEEFLQIVKTNDELLKASSAANPADFIPCFR 8444 8443 YLPLGVINAPRKFYQALYQFIALHVQDHLTTYDKVRI 8333 6611 QDHIRDITDALINTCHNKHAATKTATLNDDEIINTVGDLFGA 6486 24XX GFETVSTCLYWSFLYLIRYPEIQAKIQEEI DGNIGLKSPRFDDRKILPYTEAFVNEIFRHASFFPFTIPH 2139 CYP1D1 Monodelphis domestica (gray short-tailed opossum) GenEMBL XM_001373076.1 72% to 1D1P human not a pseudogene Built_from_Q9PTY7_and_others 405900 - 420186 bp (405.9 Kb) on chromosome fragment scaffold_15058 This transcript is located in sequence: contig_41044 MFVIETISKEVTISFLVLMIVFIFIRALGNRNKKHMSPPGPRPFPIIGNLLQLGDHPYLTFMEMKKKYG DVFLIKLGMVPVVVVNGTEMVKKGLLKDGENFAGRPHMYTFSFFAEGKSLSFSVNYGESW KLHKKIAMNALRNFSKAEAKSSTCSCVLEEHVTEEASELVKIFSKLSLKQGSFDPKSSIT CAVANVVCALCFGKRYGHFDKEFLRIIKTNEEFLKASSAANPADFIPCFRYLPLRIIHAP REFYCQLNHFIEQHVQDHITTFDKNHLRDITDALVSICRDKSATIKTATLSDNEIISTVS DIFGAGFETVSGFLHWSFLYLIYYPEIQAKIHEEIDGIIGFKPPRFKDRKNLPYTEAFIN EIFRHTTFVPFTIPHCTTKDTTLNGYFIPQKTCVFFNMYQVNHDETLWENPDSFQPERFL NEKGEMNKNLVEKVLIFGMGIRKCLGEDVARNEVFIFIVSILQQLKLKKCPEVQLDLTPV YGLVMKPKPYQLIVEPRFHVNSST* CYP1D1 Ornithorhynchus anatinus (duckbill platypus) GenEMBL AAPN01253410.1 16801-19436, AAPN01253411.1 386-472 AAPN01253413.1 1531-1812 74% to 1D1 opossum MIPGELTTSLLMLVIVLISINVLRNRGQKPPSPPGPWALPVIGNLLQLGEHPYLSFIEMR KKYGDVFLIKLGMVPVVVVNGMEPVKRVLFQDGENYAGRPNMHTFSFFANGKSLSFSTNY GDSWKHHKKMAINALKSFSKAEAKSSTCSCLLEEHVCGEVSELVKIFTELTATQGNFDPR GSLTCAVANVVCALCFGKRYEHTDEKFLKVIKINDDLLKASSAVNPADFIPCFRYLPLRV VNAPREYYHMLNQFIMQHVQEHYVTYDE (0) GYLRDITDALISICYDKNSTGKTPILPDDTIISTVNDIFGA (1) GFDTVSTCLNWSFLYLINYPEIQTKIQAEI (1) DGNIGLKPPRFEDRKNLPYTEAFINEIFRHTTFLPFTIPHC (2) TTADTILNGYFIPQKTCVFVNIYQVNHDE (2) TLWEKPDLFRPERFLNENGELNKGLVEKVLIFGLGIRKCLGEDVARNEIFIFITNVLQHL KLEKCSGAQLDLTPVYGLSMKPKPYHIKAEPRF* CYP1D2P Ornithorhynchus anatinus (duckbill platypus) GenEMBL AAPN01177473.1 87% to CYP1D1 Ornithorhynchus processed pseudogene no introns DDTIISTANDIFGAGFDTVSTCLSRRFL*LINYREIQTKIQAEIDGNIGQEPPRFEDRKNLP FTEGFINEIFRHTTFLPFTIPHCTTADISGYFIPQKTCIFVNKYQVNHDETLWENPDLFRPERFLNEN CYP1D1 Anolis carolinensis lizard FG695750.1 FG777243.1 FG739979.1 FG695729 ESTs Genomic AAWZ01004734.1 63% to 1D1P human MFFSTEVSFSEVTITLFVVAAIFISIHMLMKTKRPHPPGPWSLPILGNLLQVEEHPYI 231 SFQRMRKKYGDVFQIKLGMVPVVVVNGLDAVKQVLLRDGESFAGRPDMHTFSFFADGDSM 411 SFSVNYGESWKLQKKIAGRALKLLSKSEAKSSTCSCLLEEHVCDEASELVKILLELSKN 588 GGFDPAAVTTCTAANVVCALCFGKRYNHNDEEFLGVIKLNDDFVKASSAFNPADFIPCLR 768 YLPLPAAKVARTFYRKLNDF 828 VSACVEYHCTTYDK (0) NYVRDITDALINVGNEKKEDGKTAALSDKKIISTVNDIFGA (1) GFSTVSACLLWIYLYLISKPEIQTKIQEEI (1) GLRPPRFDDRKYLHYTEAFINEIFRHCSFLPFTIPHC (2) STTRDAVLNGYYIPQSTCIFINMYQVNHDE (2) RDVWEDPYSFKPERFLNESGELNKSLVEKVLIFGMGIRKCLGEELARNEVFVIITTIL QQLRLEKPPEDKLDLTPMYGLTMSPKPYRLQAALRT* CYP1D1/CYP1A8PX ortholog Xenopus tropicalis (frog) This is not a pseudogene in frogs It needs a new subfamily name, since it is Separate from the CYP1A subfamily Renamed CYP1D1 DN053435 DN024870 DN024871 mate pair to DN024870 DN025714.1 51% to CYP1A8P ortholog MESAVKKTLMDMMPMLLKASISFLTVLLVMSILWKKRNSLPGPWAVPI VGNFFQLGDQIHITLTDMRNRYGDVFQIKLGLMPIVVVSGLETVKRVLLKEGENFADRPN FYSFSLFSNGSSMTFSEKYGESWKIHKKIMKNALRNLSNESTNSSNCSCRLEEYVCAEAS DLVQELTDLSAEKVAFDPSQSIVITVANVVCALSFGKRYDHHDKEFLTLIDFNNDLRKA AGGGLLADFIPILRFIPSSSVKALKKFVQSFHSFIAKCVKDHFATFEENNIRDITDA LIQLCKERKSEDKNQLLSDDQIISTVNDIFGAGFDTITSALLWAIFYLLRYPEFQDKIHK EIEEKIGCNRAPRFNDRKDLHYTEAFINEVLRHSSFVPFGLPHCTTMDTKLNGYFLPKGT CVFTNLYQVNHDNTVWKDADMFMPERFLDQNGQIIKSLTEKVLVFGMGVRKCLGEDVARN EMFVIMTIMMQRLKLVKSTKHELDPIPVYGLTLKPKPYYLVAKVRT* CYP1D1 Danio rerio (zebrafish) GenEMBL NM_001007310 5 introns Note: CYP1C has no introns, 1B1 has 1 intron (not shared with 1D1) CYP1A zebrafish has the same five introns 50% to CYP1A7 Xenopus, 49% to mouse Cyp1a1, 46% to 1A zebrafish 41% to 1C2 zebrafish, 36% to 1B1 zebrafish 89108 MNLENISHTATSEVTLILCAFALLLLALHGRRRAPGVPVPPGPRPWPIVG NFLQMEEQVHLSLTNLRVQYGDVFQVKMGSLVVVVLSGYTTIKEALVRQGDA FAGRPDLYTFSAVANGTSMTFSEKYGEAWVLHKKICKNALRTFSQTEPKDSNASCLLE ERICVEAIDMVETLKAQGEEFGDSGIDPVQLLVTSVANVVCTLCFGKRYSHNDKEFLT IVHINNEVLRLFAAGNLADFFPIFRYLPSPSLRKMVEFINRMNNFMERNIMEHLVNFDT (0) 89938 94917 NCIRDITDALIAMCEDRQEDKESAVLSNSQIVHSVIDIFGA (1) 95039 95618 GFDTIITGLQWSLLYLIKFPNIQDKIVQEI (1) 95707 98382 DNQVGMDRLPQFKDRPNMPYTEAFINEVFRHASYMPFTIPHC (2) 98507 98613 TTENITLNGYFIPKDTCVFINQYQVNHDI (2) 98700 101355 EIWDDPESFRPERFLTLSGHLNKSLTEKVMIFGMGIRRCLGDNIARLEM FVFLTTLLHRLHIENVPGQELDLSSTFGLTMKPRPYRIKIIPRN* 101636 CYP1D1 Pimephales promelas (Cyprinid fish) GenEMBL DT309726.1 EST testis About 80% to zebrafish 1D1 69 MYLEEISRTTNVTSGLTLFLCAFALLLLALHGRRRGPGCSFPPGPKPWPLVGNLFQMGEQ 248 249 IHLSLTNLRVQYGDVFQVQMGSLVVVVLSGYSTIKEALVRKGEAFAGRPDLFTFSAVANG 428 429 TSMTFSEKYGEAWVLHKKICRNALRTFSQAEPRDSSASCLLEEHICTEAMEMVKALKEQG 608 609 DK 614 missing some sequence here 614 GNLADFFPIFRYLPSPSLRKMVQHIGRMNSFMECNIREHLITFDRNCIRDITDALIAMSE 793 794 DRQEDEETAMLSNSQIVHSVIDI 862 CYP1D1 Callorhinchus milii (elephant shark, Chondrichthyes) GenEMBL CW874708.1 CW863449.1 GSS sequences AAVX01473941.1 WGS Trace archive files 1573350467 (exon 5) 1574214913 (exon 6) 1573943089 (exon 2) About 67% to Gasterosteus aculeatus (stickleback) 1D1 PVEPITSTVANVICALCFGKRYEHNDKEFLNIVHTNHEVMRTFASGNVADVFPFFRYLPS PSLKSMIKFVNRLNNFMIKSIQEHYTTFDK GFDTIITGLQWCLLYLIQYPEFQTRIQQEI (1) 144 DEKVGQSRLPRFEDRTLLPFTEAFINEVFRHTTYMPFTIPHC (2) 19 TTASTTLNGYFIPKDTCVFINQYQVNHDE (2) CYP1D1 Oryzias latipes GenEMBL BAAF03028505.1 WGS seq 69% to zebrafish 1D1, only 48% to CYP1A 25653 MLSGTLPIA 25626 ESLSASLSSVTVVLFLIALGLMAIRVQKSRSSPFNVKDDSHLDLTAFPSPPGPTPWPIVG 25447 25446 NLFQMGNQMHLSLTLLRAKHGDVFK (0) 24429 LRLGSLPVVVLSGYNTIRQALVRQGEDFAGRPELFTFSAVADGTSMTFSEKFGPAWLLH 24253 24252 KKLCKNALRSFSQAAPRGSGATCLLEEHVCAEAAEMLEMIREQSAKVELDSEMTDGASKG 24073 24072 VDPVKPLVTSVANVVCALCFGKRYDHNDKEFLTIVNINNEVLKLFAAGNLADFFPVFRYF 23893 23892 PSLSLKELVQYIRRMNGFMERRIEEHMHTFDK (0) 23800 23189 NYIRDITDALIALCEDREKSKEMSLLSDTQIIHSVIDIFGA (1) 23067 22979 GFDTIIAGLQWSLLYLIKFPDVQRRIHQEI (1) 22890 20183 DEHIGSARMPNFSDKSKMPFTEAFIYEVFRHAAYVPFTIPHC (2) 20058 19961 TTRHTTLNGYFIPKDTCVFINQYQVNHDK (2) 19875 19791 DLWGDPEQFCPDRFLGHSGQLNKELTEKVLIFGMGKRRCLGDGFARLEMFVFLATLLHGL 19612 19611 RIENVPGQKLDLGTDFGLTMKPHPYKITVSSRFTEM* 19501 CYP1D1 Gasterosteus aculeatus (stickleback) GenEMBL AANH01001861.1 77% to Oryzias 1D1 54662 MRVTFGIFPIKENTCASLSSVTVVLCLINLLLMALVCRKNHCHNSRLDHTKYPTPPGPT 54486 54485 PWPLVGNLLQMGDQIHLSLTRLRLQYGDVFK (0) 54393 54293 MRLGSLTVVVLSGHNTIRQALVRQGEAFAGRPDLFTFSAVANGTSMTFSEKYGPAWMLHK 54114 54113 KLCKNALRSFSRAEPRESGATCLLEEHVCAEAAEMVEVMYEQAAAEREMGHKVMGI 53946 53945 DPVVPVVTSVANVVCALCFGKRYDYNDKEFLTIVHINNEVLRIFAAGNMADFFPVFRYFP 53766 53765 SPSLRKMVQHIQRMNGFMERSIEEHINTFDK (0) 53673 53010 NYIRDITDALIALCEDREENQDTSLLSKSQIIHTVVDIFGA (1) 52888 52795 GFDTIIAGLQWSLLYLIKYPDIQDRIHQEI (1) 52706 51800 DDHIGIARLPMFSDKPKMPFTEAFMYEVFRHASYVPFTIPHC (2) 51675 51589 TTRNITLNGYFIPKDTCVFINQYQVNHD (2) 51506 51396 DLWGDPDRFRPARFLGSLGLLNKELTEKVLIFGVGKRRCLGDGLARLEMFVFLTTLLHRT 51217 51216 RIENVPGQQLDLSTDFGLTMKPRPYRITISSRF* 51115 CYP1E1 Ciona intestinalis (sea squirt) JGI Ciona genome ver.2 gene model 131189 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1As, but only about 33% identical to CYP1As Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution MMITAAILLDAGRSFAVPVAFTAVSVLTLYVCLRKRQGIPPGPTAWPLVGNL FSMGRQSHLILESMRKTYGDVFSVYFGSTLVVVVNGKAVEECLSTHSAR (2) YSMRPELHTAQYILEGKSFAFSHIAVSKHKRYRTLAVAVVKQLVNGGGEKTDVAV KHGLQNGTRHSSIEERIFMEAACMCDKLLETSDSPDLKDEILKVITKEL (2) LSEYELDEISRVVENLRNSNEAIMLVNFIPAVRMLWRNGLQKYIQLTQSLNR (2) FFERCIRNRKAQLATVSNGHTEDNGVRLTNGVDCTVKFWQKLKNDPQYEESRVMKV (0) VADLFGARVDTMTVALAWMIVYWSTYQAAQERAQKEIDHFVKNEKRLPR (2) YSERNQLPYTMALIMEVERHCSFVPFTLPHAPAQDTMLNGYLIPKGTMMLISMRSINHDTAVWDSPAQFR (2) PERFLLDQSGGFNSALAEQVMLFGAGRRRCAGEALGRMQIFLYSVLFLRKCTFRR SDKDGHVLPESLAGISLIPQTMCVSISRREADGSKNTEP* CYP1E1 Ciona savignyi (sea squirt) Ortholog of C. intestinalis CYP1E1 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1As, but only about 33% identical to CYP1As Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution 75% identical to C. intestinalis CYP1E1 paired_scaffold_63 595236 SICLPITAFALSLIYLHRRKRDNLPPGPFAWPVLGNLLSLRSNSTAALEEIRRTYGDV 595063 595062 YSLYFGSRLVVVVNGKAVEECLSTRSAK 5949795 594724 RFSMRPELFTAQYVLGGKSFAFSHMDVETHRRYRKLAVGVVKELLVSTHERSQPTTMEEV 594545 594544 NRIPPQSIEDQIYAQAKRLCVGLFDIYASNSKSGQLDIRKEIMRRISFEM 594395 594161 LWEHELADLSELVEDLRNSNDATLILNFIPISRYLWKKGLRKYIKINQDLNK 592629 FFSRCFDRRNPHVANGSDCCKSEETCDVLSGIDCVLKLWQQLKDDPQFEENRVMKLVRKLFKCN 592438 591699 VGDLFGANVDTMTVALAWMIVYWSTYHQAQTRAQEEIDRFVETNFHLPRY 591550 591042 RYSDRSQLPFVMALIWEVARHCSFVPFALPHAPVEDTTLNGYLIPSGTVMMISMRSVNHDQTLWDS 590845 590844 PGEFR 590830 590562 PERFISSETGVFNKGLADRVMLFGGGRRRCAGEALARMQLFLFSVSILRSCTIRRVDHS 590386 590385 DVLPD 590371 CYP1F1 Ciona intestinalis (sea squirt) JGI Ciona genome ver.2 gene model 136792 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution MLVQILTATFWTLIP NSFGDLLIYAILVLTIVIYVKSLKRDKEWLALPGPIPW PLVGNA PFLGAEPHKKLLELSL KYGPVYRLKMGGIKTVVLCNAEVVRSALIKQREAFSGRPKFSSYKAVS AGESVVFNDEET LPP WRSH KSKIVRHMHKYTTSIRTRDKVTDLINTECMMMVTELDRISRSKCVNPENVIRM ALANVMCAVCFGNRFEYDNE (0) EFQKLLSMNTEFGAVIELGPIIDAMPWIK (0) VIPKFKKAIADYLKINLQLDTWSRHR (2) VDGVLKTFDNDDVTNVVASMTSEVLEKKSAGESREITESETKTIAALSADILGA GQHTTSTTFFWVINLLLCFPKVLNKLTEEVRSKLGNRLPTLEDRTSLPYMDAVLTE VLRFSSPLSSTIPHSTLKDVKLAGHTIKRGTMVIISQYAVNHDPQNWKNPENFDPERFLTK NEGGEIIFNESLSEKVLAFSIGERKCPGSQLSRMLLFLATTLLVQVSDLSADLERPPT AAAEYGLILRPKHLSIKLTLREHWQRRDSIRA* CYP1F1 Ciona savignyi (sea squirt) Ortholog of C. intestinalis CYP1F1 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution paired_scaffold_56 66% to C. intestinalis CYP1F1 957040 VLIYISMVSIVVIYVKSVKRNKEFMALPGPTPWPIVGNAPFLGKQPHKTLLQLSQK 956873 956872 YGPIYRLKMGSVEAVILCDLDVIRCALIKQREVFSGRPKFESYKAVSAGESVVFNDSESL 956693 956692 APWKSHKSKILRHLHKFATSVRTKEKVNNIITTECMLMLQCLHRRSQDGFVDPEDVIRMT 956513 956512 IANVMCAVCYGNRFEYENE 956456 950636 GQHTTSGTFFWVINILLFYPKVLQRITNEVRSKIGERIPTLEDQADLPYVEAFLTEV 950466 949639 VLRFASPLSSTIPHSTTKDTTLKGYKIKRNTMVIISQYSVNHDPKIWRNPEVFDPERFLTRDENTNLVFND 949427 949426 ALAEKVLSFSVGERKCPGSRMSQMVLFLATCLLVHTGTLYPNPDRPPS 949283 949282 PVDDAQYGLILRPEYISMKFLLDKKW 949205 CYP1F2 Ciona intestinalis (sea squirt) JGI Ciona genome ver.2 gene model 143263 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution MDSLVFVLVDTVLVMKYQILLLLVIVYAIKLLAASQSRRLNIPGPYPWPVIGNVIEMGGQPQFSLTNMAK (?) RYGPVYLMKLGTADVLVLNNYEVIKEALLRQRRIFGGRPIFDSFKKISQGLGVVFNSTMT QGDEWMKLKMTIVKHVHRFVSSEETKGYVAHHVQMEAVELVRILTEKCRS SPNEVIFPIEQINLAIANVVCAIMFGHRYQHGNK (0) EFQDLISLNEQFGDVIGSGSQVDVIPWMK (0) IFPKFRNALKVFDFLTNRLNNWMRLR (2) TKEHRLTYKHGVIRDIVDSFIAESIDHPEQSALNDDVIMALTTDVFGA GQDTMSTTMQWVFVYMMHFKECQRK IHAELDSVIGPGELPHISDRRRLPYLEAVMHEIFRHSTFTSTTIPHVTTQDTVLDGHFIP KGILVFINQFGANHDPNHWVDPDKFIPERFLDGKGNLISRPHDRYLLFSTGARKCPG DELSRMLILHFMATMFALCEVSSDPQKPATL DAVYNLSMRPKELRTIVRS RNLPFLKNSVAQMSEADSHVLTVPGETTSFLTSRVESTVPDNQESQFSDNDFEKVDTKIP KRKVFSRPTLTHDDINGNNVRKRGNLHQSAMYRIQLAT* CYP1F2 Ciona savignyi (sea squirt) Ortholog of C. intestinalis CYP1F2 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution paired_scaffold_142 77% to C. intestinalis CYP1F2 222183 FRRYGPIYLIKLGTADVLILNNYDVIKEALIRQRGVFSGRPVFESFKKISQ 222031 222030 GRGIVFNSSLTQGAAWQRMKMTIVKHLHRFIASPQTKGFVAGHVQKETVQLVHILSEKCR 221851 221850 SSTNQAIEPVENINLAVANVVCSIMFGHRYQHGNK 221746 219363 LHRTREHRQSYKHGVIRDLVDSFIAESIDKPGQLLNDDVIMALTTDVFGAGQDT 219202 219201 MSTTLQWIFVYMMRFKECQKK 219139 218667 IHAELDSVLKPGSLPQIKDRARLPYLEAVMHEIFRHSTFTTTTIPHVTTEDTVLRGYHLPKET 218479 218478 LIFINQYAANHDPEHWVEPDKFIPERFLDEKGNLISRPHDRYLLFSTGSRKCPGDELSRM 218299 218298 LILYLMANIFTLCEISPDPNQPTTLDAVYTLSMRPKNVKTVVRVR 218164 CYP1F3 Ciona intestinalis (sea squirt) JGI Ciona genome ver.2 gene model 138492 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1Bs and CYP1Cs, but only about 33% identical to vert CYP1s Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution LPYPRGLPIIGNIHQMGNFPHVKLTEWSKQFGDFYRIKMGRYDALVVNGHENIR (2) NCLAKKSAAFAGRPPFETSKLIEEGLSISFSNYS (2) PEWERQKQCTIKALKLYTSGSDKRSTMEETVSSHAKQLAEDLINSADQQ (0) GLVGDLHDTVIYSTTSVSSTICFGRSFTRQDPELKEFLRNFQSFDKAMGASQIINFWPFLKYFPVLGKSFR (0) NLKTYMDQYWNFTLSMLEQHWDTYVPNNMRDLADCLWAQSNQ (0) NRQLTDQQRRIAYGASDAFGAGFDTISAMITWSIFYMAVFPEHQRK (0) IREEIDRLETSMFSLRHHGDVCPYTQAWLYEVLRH ISVSPLLVPHYTVKQVEVNGTMIPAGVVVLFNVAN (0) ADRDTRVWENPEQFEPERFLARDPTTGGARVVASETSKI LNWGAGKRRCPGAELSRHELFIYIANLVKLCYIE QAVEGIEPAIPWPCTPGISTKPKAFRVKVTQR* CYP1F3 Ciona savignyi (sea squirt) Ortholog of C. intestinalis CYP1F3 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution paired_scaffold_3 56% to C. intestinalis CYP1F3 LPSPRGLPIIGNVHQLTTSPHVKLSEWAKEFGDLFRIKMGCFDTLVVTGYDNIR (2) TALVKHSVAFAGRPPYETSKLFSNGLSLAFNNY (2) SPAWEKQKRCTVKALKLYTAGPDLQKRNAMEDTASYQANLLVDQLLASVNK (0) DAITNPDEIVHHSATNVISNICFGRSFSKNDPELQKFVSINRAFDRAMGSAQIVNFWPFLKSVPVLGRSYQ NLKAHMDVFWDFVFPNLKEHWKTYNPSNIRDIADCLWYQSH TSSKRDLQRRIASAASDIFGAGYDTTHKVVLWSLFYMAAFPQYQQKV RDIFRVSEVKMY TLRHHGDECPYVQAWIYEVLRHTSLAPILLPHYTTKEVTLNGVRIPAGVV KKYHTIQAHKDPKIWKNPDEFDPGHFLEEDGSKLRSEAVHKLLSWGAGKRRCPGAELSRHE IFVFVTTLVRRAYIGQAVDGVEPAFPWNTTGGISISPDPFRVKITER CYP1F4 Ciona intestinalis (sea squirt) JGI Ciona genome ver.2 gene model 132188 Clusters inside the vertebrate CYP1s on NJ trees Closest to CYP1Bs and CYP1Cs, but only about 29% identical to vert CYP1s Note: the Ciona genome is greatly diverged from the Vertebrate line and seems to be undergoing rapid evolution No ortholog is found in C. savignyi MESVWVVIKWVKETMMSNSSFETIVAVATLLLLLMFVSENWNWLKIPGPI PWPIIGNLGSLKGTKFLSIHEMYKIYGRIFRLKFGRVEAVVLCDVELIKE ALLDRGRSLSGRPQFASYRLVSGCKSVVTNDPRCLREWVNY KSTMVQTLCSISKNNEMKELMNERIGSVLVYMIQELEKGGDGQNFAEDIVTKTVANFLCT VCYGGTYDFNSK (0) EFNNLIEMSRHYTDNLSKSILRDMIPLAE (0) ILPSVNKGRADFAKTSYHLHLWFLKR (2) VEEVIQHFQPNKLNDLASVMVSDLTNDPTENISNITEKDRNSIAAIINDLVQ (1) GYHSLYSMALWVVTYMIKYPEEVKKIENELNEVLDDYLPTLHDQESLPHTMAFINE (0) VLRCRPSLPLAVPHSATEDTKLGGYDISKDTMVVASLYSANRDPKVWANPDQFDPSR FLAKDDLGVTVLDETKVEQVFTFSLGDRKCPGEDIGRSFLFLTTAYLAHTCKLKPDPAK PPTFQTKPGSITRPKDFGVQLNVKKCWLGVFKPDDNEE* 2A Subfamily CYP2A1 rat PIR C41425 (12 amino acids) Imaoka, S., Kamataki, T. and Funae, Y. Purification and characterization of six cytochromes P-450 from hepatic microsomes of immature female rats. J. Biochem. 102, 843-851 (1987) CYP2A1 rat GenEMBl J02669 1 aa diff to genome seq (lower case) 82084958 MLDTGLLLVVILASLSVMLLVSLWQQKIRGRLPPGPTPLPFIGN YLQLNTKDVYSSITQLSERYGPVFTIHLGPRRVVVLYGYDAVKEALVDQAEEFSGRGE QATYNTLFKGYGVAFSSGERAKQLRRLSIATLRDFGVGKRGVEERILEEAGYLIKMLQ GTCGAPIDPTIYLSKTVSNVISSIVFGERFDYEDTEFLSLLQMMGQMNRFAASPTGQL YDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE EKNGNSEFHMKNLVMTTLSLFFAGSETVSSTLRYGFLLLMKHPDVEAKVHEEIEQVIG RNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPKaTDVFPI LGSLMTDPKFFPSPKDFDPQNFLDDKGQLKKNAAFLPFSTGKRFCLGDGLAKMELFLL LTTILQNFRFKFPMKLEDINESPKPLGFTRIIPKYTMSFMPI CYP2A1 rat NP_036824 88% T0 2A2 chr1 (+) Cyp2a22 ortholog 82084958 MLDTGLLLVVILASLSVMLLVSLWQQKIRGRLPPGPTPLPFIGNYLQLNTKDVYSSITQ 82085134 82085434 LSERYGPVFTIHLGPRRVVVLYGYDAVKEALVDQAEEFSGRGEQATYNTLFKGY 82085595 82088031 GVAFSSGERAKQLRRLSIATLRDFGVGKRGVEERILEEAGYLIKMLQGTC 82088180 82088398 GAPIDPTIYLSKTVSNVISSIVFGERFDYEDTEFLSLLQMMGQMNRFAASPTG 82088556 82089778 QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82089957 82093158 EKNGNSEFHMKNLVMTTLSLFFAGSETVSSTLRYGFLLLMKHPDVE 82093295 82093737 AKVHEEIEQVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82093925 82094440 GTDVFPILGSLMTDPKFFPSPKDFDPQNFLDDKGQLKKNAAFLPFST 82094580 82098022 GKRFCLGDGLAKMELFLLLTTILQNFRFKFPMKLEDINESPKPLGFTRIIPKYTMSFMPI 82098201 CYP2A1-de2b rat exon 2 pseudogene Chr1 (-) only 240 bp from CYP2A1 start Met frag e in fig below 82084718 YNAVKEALVDQAEGFSGQGEQA 82084653 rat, mouse and human 2ABFGST clusters CYP2A2 rat PIR S26821 (27 amino acids) Matsumoto, T., Emi, Y., Kawabata, S. and Omura, T. Purification and characterization of three male-specific and one female-specific forms of cytochrome P-450 from rat liver microsomes. J. Biochem. 100, 1359-1371 (1986) CYP2A2 rat J04187 Cyp2a12 ortholog 82117349 MLDTGLLLVVILASLSVMFLVSLWQQKIRERLPPGPTPLPFIGNYLQLNMKDVYSSITQ 82117525 82117991 LSERYGPVFTIHLGPRRIVVLYGYDAVKEALVDQAEEFSGRGELPTFNILFKGY 82118152 82123228 GFSLSNVEQAKRIRRFTIATLRDFGVGKRDVQECILEEAGYLIKTLQGTC 82123377 82123595 GAPIDPSIYLSKTVSNVINSIVFGNRFDYEDKEFLSLLEMIDEMNIFAASATG 82123753 82124978 QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82125157 82139054 EKYVNSEFHMNNLVMSSLGLLFAGTGSVSSTLYHGFLLLMKHPDVE 82139191 82139607 AKVHEEIERVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82139795 82140311 GTDVFPIIGSLMTEPKFFPNHKDFNPQHFLDDKGQLKKNAAFLPFSI 82140451 82141451 GKRFCLGDSLAKMELFLLLTTILQNFRFKFPMNLEDINEYPSPIGFTRIIPNYTMSFMPI 82141630 CYP2A2-de2b rat exon 2 pseudogene Chr1 (-) frag f in fig below 82115528 LKPHWVVVLYEWDAVKEALGDQAEELSG*GEQANL 82115445 rat, mouse and human 2ABFGST clusters CYP2A3 rat J02852 NM_012542 exon 4 in a seq gap in genome seq chr1 (+) mouse Cyp2a5 ortholog 82023007 MLASGLLLVASVAFLSVLVLMSVWKQRKLSGKLPPGPTPLPFIGNYLQLNTEKMYSSLMK 82023186 82023453 ISQRYGPVFTIHLGPRRVVVLCGQEAVKEALVDQAEEFSGRGEQATFDWLFKGY 82023614 82024296 GVAFSSGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIESFRKTN 82024445 GALIDPTFYLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTG 82026488 QLYEMFSSVMKHLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMLE 82026667 82028068 EKKNPNTEFYMKNLVLTTLNLFFAGTETVSTTLRYGFLLLMKHPDIE 82028208 82028659 AKVHEEIDRVIGRNRQAKYEDRMKMPYTEAVIHEIQRFADMIPMGLARRVTKDTKFREFLLPK 82028847 82029417 GTEVFPMLGSVLKDPKFFSNPNDFNPKHFLDDKGQFKKSDAFVPFSI 82029557 82030741 GKRYCFGEGLARMELFLFLTNIMQNFCFKSPQAPQDIDVSPRLVGFATIPPNYTMSFLSR 82030920 CYP2A3-de1b rat exon 1 pseudogene Chr1 (+)frag d in fig below 82052140 MLGSRLLLVAVLSCLCVMVFMPVWQQQYRDTIPPG 82052244 rat, mouse and human 2ABFGST clusters Cyp2a4 mouse GenEMBL J04631 (multiple genomic fragments) PIR A30499 (494 amino acids) PIR A33531 (494 amino acids) Lindberg,R., Burkhart,B., Ichikawa,T. and Negishi,M. The structure and characterization of type I P-450-15-alpha gene as major steroid 15-alpha-hydroxylase and its comparison with type II P-450-15-alpha gene J. Biol. Chem. 264, 6465-6471 (1989) Cyp2a4 mouse PIR S16067 (494 amino acids) Squires, E.J. and Negishi, M. Reciprocal regulation of sex-dependent expression of testosterone 15-alpha-hydroxylase (P-450-15-alpha) in liver and kidney of male mice by androgen. Evidence for a single gene. J. Biol. Chem. 263, 4166-4171 (1987) Note: 2a-4 and 2a-5 differ at 11 positions. This sequence is 2a-4 like at 9/11 positions. Cyp2a4-de7b mouse GenEMBL AC087157.1 + strand w in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004) detritus exon 7 between Cyp2a4 and Cyp2b9 37037 AKIHEEINQVIGTHRTPRVDDRAKMP 37114 37114 YTDAVIHEIQRLTDIVPLGIPHNVT 37188 37190 RDTHFRGY 37213 Cyp2a5 mouse GenEMBL J04631 (multiple genomic fragments) PIR B30499 (494 amino acids) PIR B33531 (494 amino acids) Lindberg,R., Burkhart,B., Ichikawa,T. and Negishi,M. The structure and characterization of type I P-450-15-alpha gene as major steroid 15-alpha-hydroxylase and its comparison with type II P-450-15-alpha gene J. Biol. Chem. 264, 6465-6471 (1989) Cyp2a5 mouse PIR S16068 (494 amino acids) Squires, E.J. and Negishi, M. Reciprocal regulation of sex-dependent expression of testosterone 15-alpha-hydroxylase (P-450-15-alpha) in liver and kidney of male mice by androgen. Evidence for a single gene. J. Biol. Chem. 263, 4166-4171 (1987) Note: 2a-4 and 2a-5 differ at 11 positions. This sequence is 2a-4 like at 5/11 positions, and 2a-5 like at 6/11 positions Cyp2a4 or 5 mouse PIR S03979 (21 amino acids) Lang, M.A., Juvonen, R., Jaervinen, P., Honkakoski, P. and Raunio, H. Mouse liver P450Coh: genetic regulation of the pyrazole-inducible enzyme and comparison with other P450 isoenzymes. Arch. Biochem. Biophys. 271, 139-148 (1989) CYP2A6 human PIR S17220 (20 amino acids) Maurice, M., Emiliani, S., Dalet-Beluche, I., Derancourt, J. and Lange, R. Isolation and characterization of a cytochrome P450 of the IIA subfamily from human liver microsomes. Eur. J. Biochem. 200, 511-517 (1991) CYP2A6 human PIR A61272 (13 amino acids) Yun, C.H., Shimada, T. and Guengerich, F.P. Purification and characterization of human liver microsomal cytochrome P-450 2A6. Mol. Pharmacol. 40, 679-685 (1991) CYP2A6v2 human GenEMBL U22027(7215bp) Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. and Gonzalez, F.J. A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am. J. Hum. Genet. 57, 651-660 (1995) CYP2A7 human GenEMBL U22029(2282bp) Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. and Gonzalez, F.J. A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am. J. Hum. Genet. 57, 651-660 (1995) CYP2A7 baboon (Papio sp.) Swiss P80055 (20 amino acids) PIR S21737 (20 amino acids) Purification of two cytochrome P450 isozymes related to CYP2A and CYP3A gene families from monkey (baboon, Papio papio) liver microsomes. Cross reactivity with human forms. Dalet-Beluche I., Boulenc X., Fabre G., Maurel P., Bonfils C. Eur. J. Biochem. 204, 641-648 (1992) MLASGLLLVALLACLTVMVL 100% to CYP2A7 human CYP2A7PTX human (retired name see CYP2A18PN) GenEMBL U22030(1192bp) Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. and Gonzalez, F.J. A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am. J. Hum. Genet. 57, 651-660 (1995) There are two human pseudogenes of 2A7 on chromosome 19. They are Located adjacent to each other. This one is telomeric. CYP2A7PCX human (retired name see CYP2A18PN) GenEMBL U22044(1192bp) Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. and Gonzalez, F.J. A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am. J. Hum. Genet. 57, 651-660 (1995) There are two human pseudogenes of 2A7 on chromosome 19. They are Located adjacent to each other. This one is centromeric. CYP2A8 Mesocricetus auratus (hamster) GenEMBL M63788 M34446 M34447 (1771bp) Lai,T.S. and Chiang, J.Y.L. Cloning and characterization of two major 3-methylcholanthrene inducible hamster liver cytochrome P-450s. Arch. Biochem Biophys. 283, 429-439 (1990) clone MC1 note: M34446 is incorrectly included in this GenBank entry and in the 2A9 entry. M34446 should only be in the CYP1A2 hamster entry. CYP2A9 Mesocricetus auratus (hamster) GenEMBL M63789 M34446 M34448 (918bp) Lai,T.S. and Chiang, J.Y.L. Cloning and characterization of two major 3-methylcholanthrene inducible hamster liver cytochrome P-450s. Arch. Biochem Biophys. 283, 429-439 (1990) clone MC1-81 3 prime end note: M34446 is incorrectly included in this GenBank entry and in the 2A8 entry. M34446 should only be in the CYP1A2 hamster entry. CYP2A9 Syrian hamster GenEMBL D86953 Kurose,K., Tohkin,M., Ushio,F. and Fukuhara,M. Cloning and characterization of syrian hamster testosterone 7alpha-hydroxylase, CYP2A9 Arch. Biochem. Biophys. 351, 60-65 (1998) clone name P450SH2A-1 1 amino acid difference with MC1-81 of Lai and Chiang (incomplete seq.) CYP2A10 rabbit GenEMBL L10236 (1641bp) Swiss Q05555 (494 amino acids) Peng.H.-M., Coon,M.J. and Ding,X. Isolation and heterologous expression of cloned cDNAs for two rabbit nasal microsomal proteins CYP2A10 and CYP2A11 that are related to nasal microsomal cytochrome P-450 form a. J. Biol. Chem. 268,17253-17260 (1993) CYP2A10/11 rabbit PIR A31944 (23 amino acids) Ding, X. and Coon, M.J. Purification and characterization of two unique forms of cytochrome P-450 from rabbit nasal microsomes. Biochemistry 27, 8330-8337 (1988) CYP2A11 rabbit GenEMBL L10237 (2484bp) Swiss Q05556 (494 amino acids) Peng.H.-M., Coon,M.J. and Ding,X. Isolation and heterologous expression of cloned cDNAs for two rabbit nasal microsomal proteins CYP2A10 and CYP2A11 that are related to nasal microsomal cytochrome P-450 form a. J. Biol. Chem. 268, 17253-17260 (1993) Cyp2a12 mouse GenEMBL L06463 (1665bp) PIR S32491 (492 amino acids) Iwasaki,M., Juvonen,R., Lindberg,R. and Negishi,M.M. Site-directed mutagenesis of mouse steroid 7 alpha- hydroxylase cytochrome P-450 (7 alpha): Role of residue 209 in determining steroid-cytochrome P-450 interaction. Biochemical J. 291, 569-573 (1993) Note: called 7 alpha hydroxylase, but this sequence is very different from CYP7 sequences. It is actually a 2A sequence. Cyp2a12-de1b2b mouse GenEMBL NW_000310 (52646-53186) also NT_039413.1 - strand note: nuc. numbering same in both detritus exons 1 and 2 = s in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004) Between 2a12 and 2f2 Old name Cyp2a20p 53186 MTLS 53175 53173 MLLVAVLTCFIAMITMSVLR*KKLLGKMPPGPTPLPFLGNFLELDTKKFYDSFLRVVGREM 52988 52810 IREWYGPVFTVHLGTYSAVVPWGYDVVKETLVDQAEQFSGRGEQAFLDWFFKGYG 52646 CYP2A13 human GenEMBL U22028(8778bp) Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. and Gonzalez, F.J. A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am. J. Hum. Genet. 57, 651-660 (1995) CYP2A13 Canis familiaris (dog) XM_541608.2 91% to CYP2A13 human There is a second CYP2A in dog CYP2A25 that is 87% to CYP2A13 This seq is the probable ortholog of CYP2A13 Dog cluster order 2S(-), 2B(+), 2G(+), 2A25(+), 2A13(-), 2F(+), 2T(+) Note: this seq is the same as Seq 2 sent by Tom Rushmore On 6/28/05 except for 3 aa diffs CYP2A13 Canis familiaris (dog) NW_876270.1 43229491-43235490 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 92% to human 2A13 probable ortholog MLASGLLLVALLACLTIIVLMSVWKQRKLGGKLPPGPTPLPFIGNYLQLNTEQMYNSLMKISERYGPVFTIHLGP RPVVVLCGHEAVKEALVDQAEEFSGRGEQATFDWLFKGYGVAFSNGERAKQLRRFSITTLRDFGVGKRGIEERIQ EEAGFLIEALRGTRGAFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSMGQLYEMFYS VMKHLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMQEEQNNPNTEFYLKNLVLTTLNLFF AGTETVSTTLRYGFLLLMKHPDVEAKVHEEIDRVIGKNRQPKFEDRAKMPYTEAVIHEIQRFGDMIPMGVARRVI KDTKFREFLLPKGTEVFPMLGSVLRDAKFFSNPQDFHPQHFLDEKGQFKKSDAFVPFSIGKRYCFGEGLARMELF LFLTTILQNFHFKSPQLPQDIDVSPKHVGFATIPRNYTMSFQPR* CYP2A13 Bos taurus (cow) See cattle page for details 90% to 2A13 86% to 2A7 MLASGLLLVALLACLTIMVLMSVWRQRNLKGKLPPGPTPLPFIGNYLQLNTEQMCNSLMK ISEHYGPVFTV HLGTRQIVVLCGYDAVKEALVDQAEEFSGRGKQATFDWLFKGYGVAFSNGERAKQLRRFS ITTLRDFGVGKRGIEERIQEEAGFLIEAFRGTRS AFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTGQ 1 LYEMFYSVMKYLPGPQQQAFKELQGLEDFIAKKVEQNQRTLDPNSPRDFIDSFLIRMQEEKENPNTEFYRK 177 178 NLVMTTLNLFFAGTETVSTTMRDGFLLLMKHPDVEAKIHEEIDRVIGKNRQPKFEDRAKM 357 358 PYTEAVIHEIQRFGDMIPMGLARRVTKDTKFRDFLLPKGTEVFPMLGSVLRDPKFFSNPR 537 538 DFNPQHFLDEKGQFKKSDAFVPFSIGKRYCFGESLARMELFLFFTTIMQNFRFKSPQS 711 712 PQDINVSPKLVGFATIPPNYTMSFLPR* CYP2A13 frag. Bos taurus (cow) PIR A35704 (18 amino acids) Lazard, D., Tal, N., Rubinstein, M., Khen, M., Lancet, D. and Zupko, K. Identification and biochemical analysis of novel olfactory-specific cytochrome P-450IIA and UDP-glucuronosyl transferase Biochemistry 29, 7433-7440 (1990) MXYLPGPQQQAFKELQGL 1 aa diff to human CYP2A13 and one uncalled amino acid CYP2A13 horse GenEMBL XM_001499763 Heather Knych Submitted to nomenclature committee Oct. 21, 2007 88% to CYP2A13 human, 89% to dog CYP2A13 CYP2A14 Cricetulus griseus (Chinese hamster) GenEMBL D86954 Fukuhara,M., Kurose, K., Aiba, N., Matsunaga, N., Omata, W., Kato, K., and Kimura, M. A Major Phenobarbital-Inducible P450 Isozyme, CYP2A14, in the Chinese Hamster Liver: Purification, Characterization, and cDNA Cloning" Arch. Biochem. Biophys. 359, 241-248 (1998) clone P450CH2A-2 85% identical to 2A3 and 2a5 CYP2A15 Cricetulus griseus (Chinese hamster) GenEMBL AB022916 Kouichi Kurose, Emi Isozaki, Masahiro Tohkin, and Morio Fukuhara Cloning and expression analysis of a new member of the cytochrome P450, CYP2A15 from the Chinese hamster, encoding testosterone 7alpha- Hydroxylase. Archives of Biochemistry and Biophysics (1999) Vol. 371 pp270-276 91% identical to CYP2A9 CYP2A16 Mesocricetus auratus (Syrian hamster) GenEMBL D86952 Masahiro Tohkin, Kouichi Kurose, Emi Isozaki, and Morio Fukuhara Molecular cloning, heterologous expression, and characterization of a novel member of CYP2A in Syrian hamster" Biochimica et Biophysica Acta (1999) Vol.1446 pp438-442 94% identical to CYP2A3 CYP2A17 Cricetulus griseus (Chinese hamster) No accession number Kouichi KUROSE 86% identical to CYP2A14 submitted to nomenclature committee 11/29/99 CYP2A18PC human pseudogene AC008537 Hoffman S.M.G., Nelson, D.R. and Keeney, D.S. Organization, strtucture and evolution of the CYP2 gene cluster On human chromosome 19. Pharmacogenetics 11, 687-698 2001 C-terminal part of P450 only. This is the opposite end of the pseudogene CYP2A18PN. This gene appears to be split by a 2B6, 2B7P1 insertion. CYP2A18PN human pseudogene AC008537 Hoffman S.M.G., Nelson, D.R. and Keeney, D.S. Organization, strtucture and evolution of the CYP2 gene cluster On human chromosome 19. Pharmacogenetics 11, 687-698 2001 N-terminal part of P450 only. This is the opposite end of the pseudogene CYP2A18PC. This gene appears to be split by a 2B6, 2B7P1 insertion. This name replaces the old designations CYP2A7PT and CYP2A7PC. There now seems to be only one copy of this pair in the sequenced human genome. CYP2A18PN human pseudogene (formerly CYP2A7PT) GenEMBL U22030(1192bp) Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. and Gonzalez, F.J. A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am. J. Hum. Genet. 57, 651-660 (1995) There are two human pseudogenes of 2A7 on chromosome 19. They are Located adjacent to each other. This one is telomeric. Note added 4/10/2001 This gene appears to be split by a 2B6, 2B7P1 insertion. This name replaces the old designations CYP2A7PT and CYP2A7PC. There now seems to be only one copy of this pair in the sequenced human genome. CYP2A18PN human pseudogene (formerly CYP2A7PC) GenEMBL U22044(1192bp) Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. and Gonzalez, F.J. A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am. J. Hum. Genet. 57, 651-660 (1995) There are two human pseudogenes of 2A7 on chromosome 19. They are Located adjacent to each other. This one is centromeric. Note added 4/10/2001 This gene appears to be split by a 2B6, 2B7P1 insertion. This name replaces the old designations CYP2A7PT and CYP2A7PC. There now seems to be only one copy of this pair in the sequenced human genome. CYP2A19 Sus scrofa (pig) GenEMBL AB052255 Misaki Kojima Submitted to nomenclature committee Oct. 27, 2000 89% to human CYP2A13 clone name c7 Cyp2a20pX mouse GenEMBL NW_000310 (52646-53186) 53186 MTLS (frameshift) MLLVAVLTCFIAMITMSVLR*KKLLGK MPPGPTPLPFLGNFLELDTKKFYDSFLRVVLGREM (0) 52988 52810 IREWYGPVFTVHLGTYSAVVPWGYDVVKETLVDQAEQFSGRGEQAFLDWFFKGYG 52646 renamed Cyp2a12-de1b2b Cyp2a21-ps mouse GenEMBL NW_000308.1, NW_033707.1, NT_039411.1 93% to Cyp2a5 runs off end NW_000308.1|Mm7_WIFeb01_154 also on NW_033707.1|MmUn_WIFeb01_40262 t in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004) between 2a22 and 2a12 NT_039411.1 + strand seq = 20,879bp runs off end 15607 FFLGKRGIEEHIQEEVGLLIDSFRKTNG 15690 15948 GAFIDTTFYLSRTVSNVISSIIFRDRFDYEDKEFLSLL*MMLGSFQFTATSMGQ 16109 17609 LYEMFSSVMKHLSGPQQQAFKELQGLEDFITKKVEHNQRTLDPNSPRDFIDSFLIRMLE 17785 19308 EKKNPNTEFYMKNLVLTTQNLFFAGTETVSTTLRYGFLLLMKHPDIE 19448 19888 AKVHKEIDWVTGRNWQPKYEDRMKMPYAEAVIHEIQRFADMIPMGLARRVTKDTKFRDFLLPK 20076 20678 GTEVFPMLGSVLKDPKFFFNPKDFNPKHFLDDKGQFKKSDAFVPFSIG 20821 Cyp2a22 mouse GenEMBL NW_000308.1|Mm7_WIFeb01_154 Also on NT_039411.1 - strand 93% to Cyp2a12 between 2a5 and 2a12 NW_000308.1 MLGSGLLLVAILVFLSVMVLVSVWQQKIRGKLPPGPIPLPFIGNYLQLNRKDVYSSITQ 392 LQEHYGPVFTIHLGPRRVVVLYGYDAVKEALEDNAEEFSGRGEQATFNTLFKGYG 834 VTFSNGERAKQLRRFSIATLKDFGLGKRGMEERIQEEAGCLIKMLQGTC 1495 GAPIDPTMYLSKTVSNVISSIVFGDRFNYEDKEFLSLLQMMSQMNQFAASPTGQ 1874 LYDMFHSVMKYLPGPQQQIIKDSHKLEDFMIQKVKHNHSTLDPNSPRGFIDSFLIHMQK 3263 EKNFNSEFHMKNLVMTSLNLFFAGSETVSSLLRYGFLLLMKHPDVE 4834 AKVHEEIDRVIGRNRQPQYEDHMKMPYTQAVIHEIQR 5365 FSNFAPLGIPRRITKDTSFRGFFLPK 5443 GTDVFPIMGSLMIDPKFFSSPKDFNPQHFLDDKGQLKKIPAFLPFSI 6101 GKRSCLGYSLGKMQLFLFFTTILQNFRFKFPRKLEDINESPKPEGFTRIIP 7191 KYTMSFVPI* 7221 Cyp2a22-de1b2b mouse GenEMBL NW_011833.1|MmUn_WIFeb01_20427 between 2a22 and 2a5 93% to Cyp2a12-de1b2b old name = Cyp2a23p u in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004) MLLVAILTCFIAMITMSVLR*RKVLGKIPPGPTPLPFLGNFLELDTKKFYDSFLRV VLGREM IRELYGPVFTVHLGTHSAVVPWGYDVVKEALVDQAEQFSGRGEQAFLDWFFKDYG CYP2A23 Macaca mulatta (rhesus monkey) No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 93% to CYP2A13, 92% to CYP2A6 human, possible ortholog of CYP2A13 CYP2A23 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP2A#1_27B2 98% to 2A23 Macaca mulatta 8 aa diffs note 2A23 and 2A24 are very similar to 2A6 and 2A13, but I cannot assign orthologs without mapping data. CYP2A24 Macaca mulatta (rhesus monkey) No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 94% to CYP2A6, 93% to CYP2A13 human, possible ortholog of CYP2A6 CYP2A24 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP2A#2_2-G10 98% to 2A24 Macaca mulatta 8 aa diffs note 2A23 and 2A24 are very similar to 2A6 and 2A13, but I cannot assign orthologs without mapping data. CYP2A23/24 Macaca fascicularis (cynomolgus monkey) PIR S36874 (13 amino acids) Ohmori, S., Horie, T., Guengerich, F.P., Kiuchi, M.and Kitada,M. Purification and characterization of two forms of hepatic microsomal cytochrome P450 from untreated cynomolgus monkeys. Arch. Biochem. Biophys. 305, 405-413 (1993) Identical to first 13 aa of CYP2A23 or CYP2A24 MLASGLLLVALLA CYP2A25 Canis familiaris (dog) XM_541607.2, NM_001048027 87% to CYP2A13 human There is a second CYP2A in dog that is 91% to CYP2A13 That seq is the probable ortholog of CYP2A13 Dog cluster order 2S(-), 2B(+), 2G(+), 2A25(+), 2A13(-), 2F(+), 2T(+) Note: this seq is the same as Seq 1 sent by Tom Rushmore On 6/28/05 except for a short frameshifted region CYP2A25 Canis familiaris (dog) NW_876270.1:43197750-43203984 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 88% to human 2A13 MVASGILLVALLTCLTVMVLMSVWRQWKLLEKLPPGPTPLPFIGNYLQLNIQQMSDSFMKISKRYGPVFTIHLGP RRVVVLCGYEAVKEALVDQAEEFSGRGAQATFDTLFKGYGVTFSNGERAKQLRRFSITTLRDFGVGKRGIEERIQ EEAGFLIEALRGTRGAFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSMGQLCEMFHS VIKYLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMQEEQNNPNTEFHLKNLVLTTLNLFF AGTETVSTTLRYGFLLLMKHPDVEAKVHEEIDRVIGKNRQPKFEDRAKMPYTEAVIHEIQRFGDIIPLSLARRVI KDTKFREFLLPKGTEVFPMLGSVLRDAKFFSNPQDFHPQHFLDEKGQFKKSDAFVPFSIGKRYCFGEGLARMELF LFLTTILQNFHFKSPQLPQDIDVSPKLVGLATIPRNYTMSFQPR* 2B Subfamily CYP2B1 or 2 rat PIR A92255 (22 amino acids) B92255 (22 amino acids) Botelho, L.H., Ryan, D.E. and Levin, W. Amino acid compositions and partial amino acid sequences of three highly purified forms of liver microsomal cytochrome P-450 from rats treated with polychlorinated biphenyls, phenobarbital, or 3-methylcholanthrene. J. Biol. Chem. 254, 5635-5640 (1979) CYP2B1 or 2 rat PIR A60822 (20 amino acids) Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and Oesch, F. Effect of nutritional imbalances on cytochrome P-450 isozymes in rat liver. Biochem. Pharmacol. 37, 3245-3249 (1988) CYP2B2 rat GenEMBL S51970 (2946bp) Hoffmann,M., Mager,W.H., Scholte,B.J., Civil,A. and Planta,R.J. Analysis of the promoter of the cytochrome P-450 2B2 gene in the rat. Gene Expr. 2, 353-363 (1992) promoter region, no coding sequence CYP2B2 rat GenEMBL L28169 (1401bp) Shephard,E.E.A. unpublished (1993) promoter region CYP2B2 rat GenEMBL I00525 (427bp) White,P.C., Dupont,B. and New,M.I. Genetic probe used in the detection of adrenal hyperplasia Patent: US 4720454-A 3 19-JAN-1988 Includes I-helix region CYP2B3 rat GenEMBL U16209 to U16214 Jean,A., Reiss,A., Desrochers,M., Dubois,S., Trottier,E., Trottier,Y., Wirtanen,L., Adesnik,M., Waxman,D.J. and Anderson,A. Rat liver cytochrome P450 2B3: structure of the CYP2B3 gene and immunological identification of a constitutive P450 2B3-like protein in rat liver. DNA Cell Biol. 13, 781-792 (1994) CYP2B3-se1[9] rat exon 9 100% match to 2B3 chr1 (+)frag a in fig below 81263180 GKRMCLGEGIARSELFLFFTTILQNYSVSSPVDPNTIDMTPKESGLAKVAPVYKICFVAR* 81263362 rat, mouse and human 2ABFGST clusters CYP2B3-se2[1] rat duplicate exon 1 100% match Chr1 (-)frag b in fig below 81308557 MDTSVLLLLAVLLSFLLFLVRGHAKVHGHLPPGPRPLPLLGNLLQMDRGGFRKSFIQ 81308387 rat, mouse and human 2ABFGST clusters CYP2B4 rabbit GenEMBL L10912 (2026bp) Ryan,R., Grimm,S.W., Kedzie,K.M., Halpert,J.R. and Philpot,R.M. Expression and induction of cytochromes P450 2B and P450 4B, identification of P450 2B-Bx, and functional comparison of four highly related forms of P450 2B. unpublished (1993) CYP2B4 rabbit GenEMBL S64259 (2028bp) PIR S35666 (491 amino acids) Ryan,R., Grimm,S.W., Kedzie,K.M., Halpert,J.R. and Philpot,R.M. Cloning, sequencing, and functional studies of phenobarbital-inducible forms of cytochrome P450 2B and 4B expressed in rabbit kidney Arch. Biochem. Biophys. 304, 454-463 (1993) CYP2B4 rabbit Swiss P00177 PIR S31277 (491 amino acids) S31278 (491 amino acids) PIR S31279 (491 amino acids) Gasser R., Negishi M., Philpot R.M. Primary structures of multiple forms of cytochrome P-450 isozyme 2 derived from rabbit pulmonary and hepatic cDNAs. Mol. Pharmacol. 32, 22-30 (1988) CYP2B5 rabbit CYP2B6 human PIR S04579 (139 amino acids) PIR S04580 (170 amino acids) Miles, J.S.,Spurr, N.K., Gough, A.C., Jowett,T., McLaren, A.W., Brook,J.D. and Wolf, C.R. A novel human cytochrome P450 gene (P450IIB): chromosomal localization and evidence for alternative splicing. Nuc. Acids Res. 16, 5783-5795 (1988) CYP2B6 human GenEMBL M29874 Yamano,S., Nhamburo,P.T., Aoyama,T., Meyer,U.A., Inaba,T., Kalow,W., Gelboin,H.V., McBride,O.W. and Gonzalez,F.J. cDNA cloning and sequence and cDNA-directed expression of human P450 IIB1: identification of a normal and two variant cDNAs derived from the CYP2B locus on chromosome 19 and differential expression of the IIB mRNAs in human liver. Biochemistry 28, 7340-7348 (1989) clone name hIIB1 CYP2B6 Macaca mulatta (rhesus monkey) No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 91% to CYP2B6, probable ortholog of CYP2B6 name changed to reflect orthology formerly CYP2B30 CYP2B6 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP2B6 3 aa diffs to CYP2B6 Macaca mulatta CYP2B6 Macaca fasicularis (cynomolgus monkey) No accession Wu Zhicong Submitted to nomenclature committee 10/30/2006 91% to human 2B6, 90% to human 2B7P1 4 amino acids diffs to Yasuhiro Unos seq CYP2B6 Bos taurus (cow) See cattle page for details MELSMLLLFALLTGLLVLLARGRPKAHGRLPPGPRPLPFLGNLLQMDRKGLLKSFLR FQQKYGDVFTVYLGPRPVVIICGTEAIREALVDQAEVFSGRAKIAVVDPIFQGY GVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQDEAQCLVEELRKSQ GALQDPVFYFHSITANIICSIVFGKRFDYRDPEFLRLLELLFQSFVLISSLSSQ LFELYSSFLKYFPGSHRQIYKNLQEINVFIGRSVEQHRETLDPNAPRDFIDCYLLRMEKDKSNPQSQFDHQN LIMSVLSLFFAGTETTSTTLRYGFLLMLKYPHITERIQKEIDQVIGSYR PALDDRAQMPYTDAVIHEIQRFADLIPIGVPHMVTKDTHFRGYILPK GTEVYPVLSSALHESCYFEKPDDFNPDHFLDANGVVKKNDAFMPFSI GKRICLGEGIARIELFLFFTTILQNFSVASPVAPEDIDLTPQESGVGNVPPNYRIQFLPRQRG* CYP2B7P1 human GenEMBL M29873 Yamano,S., Nhamburo,P.T., Aoyama,T., Meyer,U.A., Inaba,T., Kalow,W., Gelboin,H.V., McBride,O.W. and Gonzalez,F.J. cDNA cloning and sequence and cDNA-directed expression of human P450 IIB1: identification of a normal and two variant cDNAs derived from the CYP2B locus on chromosome 19 and differential expression of the IIB mRNAs in human liver. Biochemistry 28, 7340-7348 (1989) clone name hIIB3 This entry was originally made then discontinued as 2B7PX because an article by Miles et al. Nuc. Acids res. 18, 189 (1990) showed evidence of alternative splicing of CYP2B6. I thought that this explained the difference. However, on going back and looking at the sequences and the EST data and mRNAs, there are clearly two different genes in the 2B human subfamily. M29873 has an in frame stop codon, making it a pseudogene. CYP2B7P Bos taurus (cow) See cattle page for details stop codon same as in human 2B7 PALDDRAQMPYTDTVIHEIQRFADLISIGVSHMDAKDAHF*GYILPK Cyp2b8 rat Cyp2b9 mouse GenEMBL M60267 to M60273, also AH000038 Lakso,M., Masaki,R., Noshiro,M. and Negishi,M. Structures and characterization of sex-specific mouse cytochrome P-450 genes as members within a large family. Duplication boundary and evolution Eur. J. Biochem. 195, 477-486 (1991) Cyp2b9-de9b mouse GenEMBL XM_145463, XP_145463, NT_039410.1 x in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004) detritus exon 9 between Cyp2a4 and Cyp2b9 old name = Cyp2b25p NT_039410.1 - strand 196560 SGTRICLGEGIARSELFLFFTTILQ 196486 196484 NFSVSSPVAPKDIDITLKESGLAKIPPVYKISFLAH* 196374 Cyp2b10 mouse GenEMBL M21856, PIR A60559 (15 amino acids) Bornheim, L.M. and Correia, M.A. Purification and characterization of a mouse liver cytochrome P-450 induced by cannabidiol. Mol. Pharmacol. 36, 377-383 (1989) Note: the genome of mouse has only one sequence for Cyp2b10 and Cyp2b20. They are derived from the same gene. The Cyp2b10 mRNA M21856 appears to contain errors in the sequence. No exact match for it can be found in the mouse genome. This mRNA has an extra exon called exon 8b (27 nucleotides in the heme binding peptide region). This appears to be an alternative splice variant of this gene. The Cyp2b20 sequence matches the genomic sequence and represents the correct 2b10 sequence. The Cyp2b20 name has been discontinued and Cyp2b10 has been retained since it is the older of the two names. GenEMBL M21856 (sequence Cyp2b10 was based on) MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLLQMDRGGLLKSLIQ LREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVAVVEPTFKEY GVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS QGAPLDPTFLFQCITANVICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQ MFELFSGFLKYFPGAHRQISKNLQELLDYIGHSVERHKATLDPSVPRDFIDIYLLRMEK EKSNQNAEFHHQNLMMSVLSLFFVGTETSSTTLHYGFLLMLKYPHVTEKVQKEIDQVIGS HRLPTLDDRTKMPYSDAVIHEIQRFSDLIPIGVPHRVTKDTLFRGYLLPKNTEVYPIL SSALHDPQYFEQPDSFNPDQFLDANGALKKSEAFLPFST Exon 8b GQIFDQKSV GKRICLGESIARSELFLFFTSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR GenEMBL AK028103 from RIKEN (corrected Cyp2b10/Cyp2b20 sequence) MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLL QMDRGGLLKSFIQLREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVA VVEPTFKEYGVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS QGAPLDPTFLFQCITANIICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQMFE LFSGFLKYFPGAHRQISKNLQELLDYIGHSVEKHRATLDPSVPRDFIDIYLLRMEKEK SNQHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVAEKVQKEIDQVIGS HRLPTLDDRTKMPYTDAVIHEIQRFSDLIPIGVPHRVTKDTMFRGYLLPKNTEVYPIL SSALHDPQYFEQPDSFNPDHFLDANGALKKSEAFLPFSTGKRICLGESIARNELFLFF TSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR CYP2B11 Canis familiaris (dog) NW_876270.1: 43114807- Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 78% to human 2B6 MELSVLLLLALLTGLLLLMARGHPKAYGHLPPGPRPLPILGNFLQMDRKGLLKSFLRLQEKYGDVFTVYLGPRRT VMLCGIDAIREALVDNAEAFSGRGKIAVVEPVFQGYGVVFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEA QCLVEELRKTEGVLQDPTFFFHSMTANIICSIVFGKRFGYKDPEFLRLMNLFYVSFALISSFSSQMFELFHSFLK YFPGTHRQVYNNLQEIKAFIARMVEKHRETLDPSAPRDFIDAYLIRMDKEKAEPSSEFHHRNLIDSALSLFFAGT ETTSTTLRYGFLLMLKYPHIAERIYKEIDQVIGPHRLPSLDDRAKMPYTDAVIHEIQRFGDLLPIGVPHMVTKDI CFRGYIIPKGTEVFPILHSALNDPHYFEKPDVFNPDHFLDANGALKKNEAFIPFSIGKRICLGEGIARMELFLFF TTILQNFSVASPMAPEDIDLTPQEIGVGKLPPVYQISFLSR* CYP2B12 rat GenEMBL S48369 X63545 (2528bp) Swiss P33272 (492 amino acids) PIR S27160 (492 amino acids) Friedberg,T., Grassow,M.A., Bartlomowicz-Oesch,B., Siegert,P, Arand,M., Adesnik,M. and Oesch,F. Sequence of a novel cytochrome CYP2B cDNA coding for a protein which is expressed in a sebaceous gland, but not in the liver. Biochem. J. 287, 775-783 (1992) CYP2B12-de9b rat exon 9 Chr1 (-) frag c in fig. below 81829155 GKFICLGEGIG*NESFIFFTGILQNLSLASPVAPENIDLTPIKSGAGKIPSTYQIHILSR 81829012 rat, mouse and human 2ABFGST clusters Cyp2b13 mouse GenEMBL M60352 to M60358, also AH000037, NT_039410.1 Lakso,M., Masaki,R., Noshiro,M. and Negishi,M. Structures and characterization of sex-specific mouse cytochrome P-450 genes as members within a large family. Duplication boundary and evolution. Eur. J. Biochem. 195, 477-486 (1991) Cyp2b13-de1b2b7b mouse GenEMBL NT_039410.1 + strand y in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004) detritus exons 1,2,7 between Cyp2b13 and Cyp2b26-ps 43894 XXXXXXDIFYMGAQPLLVLCGYEV*WEAPVDHSEVFLVYEDKAIIDPSSKKW 44031 ex 1 44377 XXFFVNGKPWNIVN*FLLTTTKDFEWKKRSIDNQIKVETLDLLLEC*KPHGDP 44529 ex 2 48130 LPVFVHWAQKPYTQASIHEIWRYGDFTHIG 48219 ex 7 CYP2B14X rat discontinued number see CYP2B16P CYP2B14P rat GenEMBL U33540 Eric Trottier, Stéphane Dubois, Andréa Jean and Alan Anderson Identification of CYP2B14P and CYP2B16P, two apparent pseudogenes in the rat cytochrome P450 2B (CYP2B) subfamily. Biochemical Pharmacology, 52, 963-965 (1996) exon 1, add Chr1 (+) exons 7,8,9 72% to 2B21 to this pesudogene 81706300 MKPNVLLLLAILLSFLLFLVRGHAKVHGHLPPGPRPLPILGNLLQMDRGGLLQSF 81706464 81728276 EKVQKEIGEVTGSHWFPILYSSKIPNTEAVIPEIQR 81728383 81728385 FSDLSSVVLPQRVTKDTFFQGFLLHK 81728462 81728634 NTEVYPILSSVLHDPQ 81728681 81728681 VLEYPVTFNPEHFLDANGALKKNEAFTPFSR 81728773 CYP2B15 rat GenEMBL D17343 to D17349 Nakayama,K., Suwa,Y., Mizukami,Y., Sogawa,K. and Fujii- Kuriyama, Y. Cloning and sequencing of a novel rat cytochrome P450 2B-encoding gene. Gene 136, 333-336 (1993) most similar to 2B12, 89% identical MELGVLLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQLQ EKYGDVFTVHLGPRPVVILCGTDTIREALVDQAEAFSGRGTVAVLHPVVQGYGVIFANGE RWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYKALLNPTSIFQSIAANIIC SIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQVFELFSGFLKYFPGVHKQISKNLQE ILNYIDHSVEKHRATLDPNTPRDFINTYLLRMEKEKSNHHTEFHHQNLVISVLSLFFTGT ETTSTTLRYSFLIMLKYPHVAEKVQKEIDQVIGSHRLPTLDDRTKMPYTDAVIHEIQRFA DLIPIGLPHRVTNDTMFLGYLLPKNTEVYPILSSALHDPRYFDHPDTFNPEHFLDVNGTL KKSEAFLPFSTGKRICLGEGIAQNELFIFFTAILQNFSLASPVAPEDIDLSPINSGISKI PSPYQIHFLSRCVG CYP2B16P rat GenEMBL U33541 to U33546 Eric Trottier, Stéphane Dubois, Andréa Jean and Alan Anderson Identification of CYP2B14P and CYP2B16P, two apparent pseudogenes in the rat cytochrome P450 2B (CYP2B) subfamily. Biochemical Pharmacology, 52, 963-965 (1996) note: previously called CYP2B14 in 1993 update. This gene has a complete coding sequence but there is a defect in the splice junction in intron 1. Exon 1 MEPSVLLLLAVLLSFLLLLVRGHAKIHGRLPPGPCPVPLLGNLLQMDRRGLLKSFIQLR Exon 2 EKYGDVFTVHLGLRPVVVLCGTQTIREALVDHAEAFSGRGTIAGLEPVFQDYG Exon 3 IFFSSGEQWKTLRRFSMATMRDFGMRKKSVEERIKEESQCLVEELKKYQG Exon 4 APLDPTFLFQCITSNIICSIVFGECFDYTDHQFLHLLDLMYQTFSLLSSIFSQ Exon 5 VFELFPGVLKYFPGAHRQISRNLHEILDFIGQSVEKHRATLDPNAPRDFIYTYLLHMEK Exon 6 QKSNHYTEFHHWNLLSSVLSLFFAGTETSSTTLRYGFLIMLKYPHI Exon 7 EKVQKEIDCVIGSHRLPTLDDRSKMPYTEAVIHEIQRFSDLAPIGTPHRVIKDTIFRGYLLPK Exon 8 QNTEVFPILSSVLHDPQYFEQPDIFNLQHFLDANGALKIIEAFLPFSTGK Exon 9 TGKRICLGESIARNELFLFFTTILQNFSVSSPVAPKDIDLTPKESGIGRIPQVYQICFLA CYP2B17/2B6 Cercopithecus aethiops (African green monkey) PIR JT0676 (491 amino acids) Ohmori, S.; Sakamoto, Y.; Nakasa, H.; Horie, T.; Saito, K.; Kitada, M. Nucleotide and amino acid sequences of monkey P450 2B gene subfamily. Unpublished 91% to human 2B6 probable ortholog CYP2B18 guinea pig no accession number (437 amino acids) Oguri, K. submitted to nomenclature committee Cyp2b19 mouse GenEMBL AF047529, also NT_039410.1 + strand Diane Keeney, D.S. (1998) The Novel Skin-Specific Cytochrome P450 Cyp2b19 Maps to Proximal Chromosome 7 in the Mouse, near a Cluster of Cyp2 Family Genes. Genomics 53, 417-419. Between 2b23 and 2g1 Cyp2b19-de7b8b9b mouse GenEMBL NT_039410.1 old name = Cyp2b24p v in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004) detritus exons 7,8,9 between 2b19 and 2b23 NT_039410.1 + strand 695673 EKVQKETDQVIGSHQLPTLDDRTKMPYTDTVIHEIQRFSDLAAIDLPHRVTIHTLSQVYLLPK 695861 696036 NTEVYPILSSVLLDP 696080 696083 QYFEQLDCFNPEHFLDANGTLKKSEAFLPFST 696178 702801 GKHVCLGKGIAHNELFLFFPTILQNFPVSVPLAPKDIDITPKESGTGKIPQCTRSAS 702971 Cyp2b20X mouse GenEMBL X99715(1416bp) Damon,M., Fautrel,A., Marc,N., Guillouzo,A. and Corcos,L. Isolation of a new mouse cDNA clone: hybrid form of cytochrome P450 2b10 and NADPH-cytochrome P450 oxidoreductase Biochem. Biophys. Res. Commun. 226 (3), 900-905 (1996) This clone has a part of the NADPH cytochrome P450 reductase on the opposite strand at the end of the P450 sequence. note: this sequence was accidentally given the name Cyp2b19. That name is assigned to a mouse keratinocyte P450 cloned by Diane Keeney. The reductase sequence at the end of this gene seems to be a cloning error, because it cannot be found in the genomic DNA sequence. Cyp2b20 has been merged with Cyp2b10. Though the Cyp2b20 sequence is more like the genomic sequence, the Cyp2b10 name has precedence. GenEMBL AF128849 Marc,N., Damon,M., Fautrel,A., Guillouzo,A. and Corcos,L. Isolation of a cyp2b10-like cDNA and of a clone derived from a cyp2b10-like pseudogene Biochem. Biophys. Res. Commun. 258 (1), 11-16 (1999) This sequence is 100% identical to Cyp2b20 and 97% identical to Cyp2b10 MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLL QMDRGGLLKSFIQLREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVA VVEPTFKEYGVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS QGAPLDPTFLFQCITANIICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQMFE LFSGFLKYFPGAHRQISKNLQELLDYIGHSVEKHRATLDPSVPRDFIDIYLLRMEKEK SNQHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVAEKVQKEIDQVIGS HRLPTLDDRTKMPYTDAVIHEIQRFSDLIPIGVPHRVTKDTMFRGYLLPKNTEVYPIL SSALHDPQYFEQPDSFNPDHFLDANGALKKSEAFLPFSTGKRICLGESIARNELFLFF TSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR Cyp2b20X mouse GenEMBL AK028103 100% identical to AF128849 Now renamed Cyp2b10 (the corrected sequence) MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLL QMDRGGLLKSFIQLREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVA VVEPTFKEYGVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS QGAPLDPTFLFQCITANIICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQMFE LFSGFLKYFPGAHRQISKNLQELLDYIGHSVEKHRATLDPSVPRDFIDIYLLRMEKEK SNQHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVAEKVQKEIDQVIGS HRLPTLDDRTKMPYTDAVIHEIQRFSDLIPIGVPHRVTKDTMFRGYLLPKNTEVYPIL SSALHDPQYFEQPDSFNPDHFLDANGALKKSEAFLPFSTGKRICLGESIARNELFLFF TSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR Cyp2b20p1X mouse GenEMBL AF129405 Marc,N., Damon,M., Fautrel,A., Guillouzo,A. and Corcos,L. Isolation of a cyp2b10-like cDNA and of a clone derived from a cyp2b10-like pseudogene Biochem. Biophys. Res. Commun. 258 (1), 11-16 (1999) This sequence is 100% identical to Cyp2b20 from amino acid 64 on This seq is partial, starting at amino acid 60 with a stop codon at amino acid 63. Full length cDNAs AK028103 and AF128849 do not have this stop codon and it is not found in genomic DNA. This probably represents a sequence derived from the Cyp2b10 gene. CYP2B21 rat GenEMBL AF159245 Nicola Brookman Amissah and Peter Swann CYP2B22 Sus scrofa (pig) GenEMBL AB052256 Misaki Kojima Submitted to nomenclature committee Oct. 27, 2000 78% to rabbit CYP2B4 clone name c780 Cyp2b23 mouse NW_000307 618973-640139, also XM_145466 Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman Next to Cyp2b19-de7b8b9b and 2b19 on chr 7 Cyp2b24pX mouse NW_000307 692575-699876 Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman Next to 2b19 on chr 7 Renamed Cyp2b19-de7b8b9b Cyp2b25pX mouse NW_000307 195792-195980 Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman Next to 2b9 on chr 7 Renamed Cyp2b9-de9b Cyp2b26-ps mouse GenEMBL AC087157 22100-26200 Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman Between 2b9 and 2b13 on chr 7 Cyp2b27-ps mouse NW_000303 2122792-2130037 Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman Between 2b13 and 2b28-ps on chr 7 Cyp2b28-ps mouse NW_000303 2064442-2094900 Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman Between 2b27-ps and 2b10 on chr 7 CYP2B29 hamster No accession number Pedro Dominguez Submitted to nomenclature committee Dec. 17, 2002 77% to cyp2b10 CYP2B30X Macaca mulatta (rhesus monkey) No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 91% to CYP2B6, probable ortholog of CYP2B6 name changed to reflect orthology = CYP2B6 CYP2B31 rat 86% to 2b19 possible ortholog 81918041 MELGVFLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQ 81918214 81919826 LQEKYGDVFTVHLGPRPVVILCGTDTMREALVDQAEAFSGRGTVAVLHPVVQGY 81919987 81920130 GVIFANGERWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYK 81920279 81922129 GALLNPTSIFQSIAANIICSIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQ 81922290 81923031 VFELFSGFLKYFPGVHKQISKNLQEILNYIDHSVEKHRATLDPNTPRDFIDTYLLHMEK 81923207 81923977 EKSNHHTEFHHQNLVISVLSLFFAGTETTSTTLRYSFLIMLKYPHVA 81924117 81926113 EKVQKEIDQVISSHRLPTLDDRIKMPYTDAVIHEIQRFADLAPIGLPHRVTKDTMFRGYLLPK 81926301 81926476 NTEVYPILSSALHDPRYFDHPDTFNPEHFLDANGTLKKSEAFLPFST 81926616 81930286 GKRTCLGEGIARNELFIFFTALLQNFSLASPVAPEDIDLTPINSGAGKIPSPYQINFLSR 81930465 CYP2B32P rat pseudogene partial Chr1 (+) 81806528 VLLLLTLIVGFLLFLVSQSQPKTHGHLPPGLCPLPFLGNLLQIKRRGLLNSFMQ 81806689 81808348 AQEKYGDVLTVHPGPRPVVRLCGTDTIREFLFDQAGTFSGQGTVAVLNPVVHGY 81808509 exon 3 missing 81809871 GVPLIPTSFFQRIAANIICSIVFGECFDYKDHQFLHLLDLIYQTFALMAPCPARS 81810035 81810759 VFQLFSGFLKYFPGVHKQISKNLQEILNYIGHSVEKHMATLDPSAPRDFINTYLLHMEN 81810935 81811666 EKSNHHTEFHHQTSVLSHFFDGTETTSTTLCCSFLIMLKYHHVK 81811797 CYP2B guinea pig Swiss P34033 (20 amino acids) Narimatsu S., Akutsu Y., Matsunaga T., Watanabe K., Yamamoto I., Yoshimura H. Purification of a cytochrome P450 isozyme belonging to a subfamily of P450IIB from liver microsomes of guinea pigs. Biochem. Biophys. Res. Commun. 172, 607-613 (1990) PIR S28205 (31 amino acids) Yamada, H., Kaneko, H., Takeuchi, K., Oguri, K. and Yoshimura,H. Tissue-specific expression, induction, and inhibition through metabolic intermediate-complex formation of guinea pig cytochrome P450 belonging to the CYP2B subfamily. Arch. Biochem. Biophys. 299, 248-254 (1992) Note: These two fragments are identical over the first 20 amino acids. Cyp2b mouse PIR A21630 (25 amino acids) Stupans, I., Ikeda, T., Kessler, D.J. and Nebert, D.W. Characterization of a cDNA clone for mouse phenobarbital-inducible cytochrome p-450b. DNA 3, 129-137 (1984) This fragment has one amino acid difference with 2b-9, 2b-10 and 2b-13 Cyp2b mouse GenEMBL M60359 (997bp) Lakso,M., Masaki,R., Noshiro,M. and Negishi,M. Structures and characterization of sex-specific mouse cytochrome P-450 genes as members within a large family. Duplication boundary and evolution. Eur. J. Biochem. 195, 477-486 (1991) N-terminal 57 amino acid fragment very similar to Cyp2b-13. CYP2b scup (fish Stenotomus chrysops) N-terminal fragment (20 amino acids) Klotz et al. Arch. Biochem. Biophys. 249, 326-338 (1986) 2C Subfamily CYP2C1 rabbit GenEMBL D26152 (1695bp) Noshiro,M., Ishida, H. and Okuda, K. unpublished (1993) CYP2C2 rabbit CYP2C3 rabbit CYP2C4 rabbit CYP2C5 rabbit GenEMBL M55664 (2340bp) Pendurthi,U.R., Lamb,J.G., Nguyen,N., Johnson,E.F. and Tukey,R.H. Characterization of the CYP2C5 gene in 21L III/J rabbits: Allelic variations affects the expression of P450IIC5 J. Biol. Chem. 265, 14662-14668 (1990) CYP2C5 rabbit PIR S16715 (143 amino acids) PIR S20227 (145 amino acids) Zhao, J., Leighton, J.K. and Kemper, B. Characterization of rabbit cytochrome P450IIC4 cDNA and induction by phenobarbital of related hepatic mRNA levels. Biochem. Biophys. Res. Commun. 146, 224-231 (1987) CYP2C6 rat PIR A41425 (17 amino acids) Imaoka, S., Kamataki, T. and Funae, Y. Purification and characterization of six cytochromes P-450 from hepatic microsomes of immature female rats. J. Biochem. 102, 843-851 (1987) rat 2C cluster in chromosome order CYP2C6v1_v1-de1b2b3b4b5b rat upstream pseudogene frag o, 96% identical to seq c 93% identical to seq upstream of CYP2C6v2 allele (temp name = CYP2Cnewb) 243935799 MDLVMLLVLTLSCLIFLSIWRQSSGRGKLP 243935888 243935888 SGPTPLPIIGNFFHLDLKNITQSLTN 243935965 243937699 FSKVNGSVFTLYFGMKPIVILHGYEAIKEGLIDHGEEFTERGSFPVAEKINKGL 243937860 243938035 GIAFSHGNRWKEIRRFTLMTLQNLGMGKKSIEDRVQEESRCLV 243938163 243939079 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLVEKLNENIKIVSSPWI* 243939231 243940291 FCSSFPVFIDYCPGSHMTLAKNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLINWKQ 243940467 CYP2C6v1_v1 rat GenEMBL M13711 two aa changes to match many ESTs (lower case mi) due to frameshift 97% to 2C77 and 2C6v2 243955584 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS 243955751 243964779 FSKVYGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKD 243964937 243965112 LGIVFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTN 243965264 243966104 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 243966265 243967336 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 243967512 243984646 ENHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKCPEVT 243984786 243989157 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAmiHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 243989345 243990948 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 243991088 243992245 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL 243992424 CYP2C6v2-de1b2b3b4b4c5b rat upstream pseudogene EST CK224599.1 = 100% match with 4 frameshifts) so this is a real gene clone_lib="RALIUNN03 Sprague-Dawley rat female liver The CYP2C6_v1 sequence is also seen in this same mRNA library This GNOMON prediction adds two upstream exons that do not belong to this gene 58596732 MDLVMLLVLTLSCLILLSIWRQSSGRGKHP 58596643 exon 1 frameshift 58596643 SGPTPLPIIGNFFHLDLNNITQSLTS (0) 58596566 exon 1 58594823 FSKVNGSVFTLYFGMKLIVILHGYAATKEGLIDHGEEFTKRGSFPVAEKINKGL (1) exon 2 58594662 58594487 GIAFSHGNRWKEIRRFTLMTLQNLGMGKESIEDRVQEETQCLV*ELRKTN (1) exon 3 58594338 58593451 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58593296 58592013 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58591858 58590797 FCSSFPVFIDYCLGSHMTLA 58590738 58590736 NVYHTRNYILKKIKEHQESLDVTNPHDFIDYDLIKWKQ 58590620 CYP2C6v2 rat allele not in figure, 13 aa diffs to CYP2C6_v1 XM_215255 NW_047916 we are assigning this allele status but it may be a separate gene (temp name = CYP2Cnewb) 58578624 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 58578457 58576741 FSKVYGPVFTLYFGLKPTVILHGYEAVKEALIDHGEEFAERGSFPVVEKINKDL (1) 58576583 58576405 GIAFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDHVQEEARCLVEELRKTN 58576256 58575415 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKVLSSPWTQ 58575254 58574189 FCSFFPVLIDYCPGSHTTLAKNIYYIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 58574013 58554666 ESHNPHLEFTLENLSVTVTDLFGAGTETTSTTLRYALLLLLKYPEVT 58554526 58534931 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 58534743 58533131 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 58532991 58531833 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 58531654 CYP2C6P rat GenEMBL M18336 J03509 M18774 an alternate splice version of 2C6 exon 8 is skipped and replaced by a cryptic exon just past the true exon 8 The GT boundary of the true exon 8 are the first two nucleotides of CYP2C6_v3 Cryptic exon 8 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTSFSKV 200 201 YGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDLGIVFSHGNRW 380 381 KEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTNGSPCDPTFILGCAPCNVICS 560 561 IIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQFCSFFPVLIDYCPGSHTTLAKNVYHI 740 741 RNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQENHNPHSEFTLENLSITVTDLFGAGTE 920 921 TTSTTLRYALLLLLKCPEVTAKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFID 1100 1101 LIPTNLPHAVTCDIKFRNYLIPK 1169 CYP2C6_v2 CK224594.1 CK224593.1 note: the _v2 means alternative splice version 2 CYP2C6_v3 CK224595.1 CK224596.1 (3 nuc shorter at the joint uses the second AG) Beginning of exon 7 AGCTAAAG TCCAGGAAGA GATTGATCGT 243989183 GTGGTTGGCA AACATCGCAG CCCTTGCATG CAGGACAGGA GCCGCATGCC CTACACAGAT 243989243 GCCATGATTC ATGAGGTCCA GAGGTTCATT GACCTCATTC CTACCAACCT GCCACATGCG 243989303 GTGACCTGTG ACATTAAGTT CAGGAACTAC CTAATACCCA AG GT end of exon 7 Beginning of cryptic exon out of frame agcaggtaa tagaaactca 243991103 tttccatggt tccagtgaca tgcagaaccg tggggactta gagtgtgact ctacatgtgc 243991163 tgatagcttg catctgcatg ataaggagca taattttcat tgtgtatgca ctgtcctgga 243991223 tatgaccacc ttctttatca gggt end of cryptic exon normal exon 9 1328 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL rat 2C cluster in chromosome order see this link for color coded figure of intron boundaries >interval between 2C6 and 2C77 CYP2C6-se1[1:2:3:2:3] rat frag n exons 1,2,3 2C6 like pseudogene plus strand exon 2,3 100% to seq m 244044941 MDHTTGTYTLSLILLSL*RQSSGRGKIPPGPTPLPIIDNLLQLDIKNVTQYLAN (0) 244045102 244050420 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244050581 244050793 XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244050873 frag m Exons 2,3 2C6 like pseudogene 100% to seq n 244052306 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244052467 244052679 XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244052759 CYP2C7 rat GenEMBL X12595 (1179bp) Stroem,A., Nilsson,A.G. and Zaphiropoulos,P. 5' flanking sequence of the gene for rat cytochrome p-450f Nucleic Acids Res. 0, 0-0 (1988) rat 2C cluster in chromosome order CYP2C7 rat PIR S24582 (66 amino acids) Stroem, A. unpublished rat 2C cluster in chromosome order CYP2C7 rat PIR A60563 (56 amino acids) Westin, S., Stroem, A., Gustafsson, J.A., and Zaphiropoulos, P.G. Growth hormone regulation of the cytochrome P-450IIC subfamily in the rat: inductive, repressive, and transcriptional effects on P-450f (IIC7) and P-450-PB1 (IIC6) gene expression. Mol. Pharmacol. 38, 192-197 (1990) rat 2C cluster in chromosome order CYP2C7 rat PIR A27425 (23 amino acids) Favreau, L.V., Malchoff, D.M., Mole, J.E. and Schenkman, J.B. Responses to insulin by two forms of rat hepatic microsomal cytochrome P-450 that undergo major (RLM6) and minor (RLM5b) elevations in diabetes. J. Biol. Chem. 262, 14319-14326 (1987) rat 2C cluster in chromosome order CYP2C7 rat GenEMBL M18335 exons 1,2,3 and 6 are in sequence gaps 93% to 2C7 variant and 2C81 MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMNENVTKGF GIVFSNGNRWKEMRRFTIMNFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 243849546 GSPCDPSLILNCAPCNVICSITFQNYFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243849385 243847566 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 243847390 243829444 GSPCDPSLILNCAPCNVICSITFQNHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243829283 this duplicate exon 4 is not in the right sequence order ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 243803857 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFINFVPTNLPHAVTCDIKFRNYLIPK 243803669 243800623 GTKVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 243800483 243799465 GKRACVGEGLARMQLFLFLTTILQNFNLKSLVHPKDIDTMPVLNGFASLPPTYQLCFIPS 243799286 CYP2C7-de7b rat frag r Exon 7 (+) 100% to seq a CYP2C81-de7b 243792966 RVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 243793151 CYP2C7 rat variant unmapped 93% to 2C7 88% to 2C81 3463873 MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK 3464040 3479907 FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF 3480068 3480234 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 3480383 3489182 GSPCDPSLILNCAPCNVICSITFQSHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 3489343 3491162 VCNSFPSLVDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 3491338 3505354 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 3505494 3406504 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 3406692 3408304 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 3408444 3409602 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFPGFASLPPFYELCFIPS 3409778 CYP2C7-se1[6:7:9] rat frag j exons 6,7,9 (6,7 and 9 have 1 aa diff to 2C7) 244103321 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 244103461 244120225 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFIDFVPTNLPHAVTCDIKFRNYLIPK 244120413 244124319 FLXXXLQNFNLKSLXHPKDIDTMPVLNXXASLPPTYQLCFIPS 244124447 CYP2C7-se2[2:3] rat frag k exons 2,3 = 100% to 2C7 variant, 2 aa diffs to 2C7 exons 2,3 244064158 FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF 244064319 244064485 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 244064634 CYP2C7-se3[8] rat frag t Exon 8 minus strand 82% to 2C7 243749788 TIVIT*LTSVLHDSKKFPNPEMLDSGHFLDENGNFKKSEYFMPFSA 243749651 CYP2C7-se4[8:9] rat frag u Exon 8 minus strand exon 8 = 87% to frag 2, 8+9 = 63% to 2C7 243726168 GVMVITSLSSALHDNKEFPNPKRFDPG*FLDRNGNFKKTDYFILFSA 243726028 Exon 9 minus strand 60% to 2C7 243723025 CVGEGLTPIELFLFLTRILQNFNLKHLTHTEAVDTTPVLSRLTSVSPALKLFFIP 243722861 CYP2C8 human PIR S15075 (56 amino acids) Ged, C. and Beaune, P. Isolation of the human cytochrome P-450 IIC8 gene: multiple glucocorticoid responsive elements in the 5' region. Biochim. Biophys. Acta 1088, 433-435 (1991) CYP2C8 human GenEMBL Y00498 (1866bp) Kimura,S., Pastewka,J., Gelboin,H.V. and Gonzalez,J. cDNA and amino acid sequences of two members of the human P450IIC gene subfamily Nucleic Acids Res. 15, 10053-10054 (1987) CYP2C8 human PIR S16902 (349 amino acids) Shephard, E.A., Phillips, I.R., Santisteban, I., Palmer, C.N.A. and Povey, S. Cloning, expression and chromosomal localization of a member of the human cytochrome P450IIC gene sub-family. Ann. Hum. Genet. 53, 23-31 (1989) CYP2C8 human no accession number D.C. Zeldin, R.N. Dubois, J.R. Falck, and J.H. Capdevila. Molecular Cloning, Expression, and Characterization of an Endogenous Human Cytochrome P450 Arachidonic Acid Epoxygenase Isoform. Arch. Biochem. Biophys. 322: 76-86 (1995) CYP2C8-de6b human GenEMBL NT_008769.11|Hs10_8926 detritus exon 6 between 2C9 and 2C8 old name CYP2C60P 8439669 EKDNQPLKFTIENLVGNVPDLFVAGTEMTSTTLRYGLLLLLKHPELT 8439809 CYP2C8 Cercopithecus aethiops (African green monkey) DQ022200.1 Booth-Genthe,C.L., Peteraf,S. and Tang,C. Merck Research laboratories 92% to human CYP2C8, 78% to human CYP2C19 CYP2C8/2C20 Macaca fasicularis (cynomolgus monkey) GenEMBL S53046 (1901bp) Swiss P33262 (490 amino acids) PIR S28166 (490 amino acids) Komori,M., Kikuchi,O., Sakuma,T., Funaki,J., Kitada,M. and Kamataki,T. Molecular cloning of monkey liver cytochrome P-450 cDNAs: similarity of the primary sequences to human cytochromes P-450. Biochim. Biophys. Acta 1171, 141-146 (1992) Note: As comparisons between primates begin to involve large scale sequencing, the CYP2C20 genes assigned earlier to two Macaca species appear to be orthologous to human CYP2C8. I have acknowledged this by using the 2C8 name. Since both names will be in the literature, both will be kept, but 2C8 is now the preferred name. CYP2C8/2C20 Macaca fasicularis (cynomolgus monkey) PIR A60466 (22 amino acids) Ohi, H., Toratani, S., Komori, M., Miura, T., Kitada, M. and Kamataki, T. Comparative study of cytochrome P-450 in liver microsomes. A form of monkey cytochrome P-450, P-450-MK1, immunochemically cross-reactive with antibodies to rat P-450-male. Biochem. Pharmacol. 38, 361-365 (1989) Note: As comparisons between primates begin to involve large scale sequencing, the CYP2C20 genes assigned earlier to two Macaca species appear to be orthologous to human CYP2C8. I have acknowledged this by using the 2C8 name. Since both names will be in the literature, both will be kept, but 2C8 is now the preferred name. CYP2C8/2C20 Macaca mulatta (rhesus monkey) name change from CYP2C74 No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 91% to CYP2C8, 78% to CYP2C19, probable ortholog of CYP2C8 formerly CYP2C74. There are only 3 amino acid differences to Macaca fasicularis (cynomolgus monkey) GenEMBL S53046 Since this is the clear ortholog of that earlier sequence the name has been changed to reflect the orthology. Note: As comparisons between primates begin to involve large scale sequencing, the CYP2C20 genes assigned earlier to two Macaca species appear to be orthologous to human CYP2C8. I have acknowledged this by using the 2C8 name. Since both names will be in the literature, both will be kept, but 2C8 is now the preferred name. CYP2C8 Callithrix jacchus (white-tufted-ear marmoset) GenEMBL AB242600, release date 2006-11-19 Narimatsu, S., Torigoe, F.,Hanioka, N. and Miyata, A. 88% to 2C8 of Cercopithecus aethiops, 87% to 2C8 human 78% to 2C9 human, 77% to 2C18, 77% to 2C19 CYP2C9 human GenEMBL S46963 (1814bp) PIR A48390 (477 amino acids) B48390 (475 amino acids) Ohgiya,S., Komori,M., Ohi,H., Shiramatsu,K., Shinriki,N. and Kamataki,T. Six-base deletion occurring in messages of human cytochrome P-450 in the CYP2C subfamily results in reduction of tolbutamide hydroxylase activity. Biochem. Int. 27, 1073-1081 (1992) CYP2C9 human GenEMBL L16877 to L16883 Goldstein,J.A., Raucy,J.L., Blaisdell,J.A., Faletto,M.B. and Romkes,M. Cloning and expression of complementary DNAs for multiple members of the human cytochrome P450IIC subfamily. Biochemistry 30, 3247-3255 (1991) de Morais,S.M., Schweikl,H., Blaisdell,J.A. and Goldstein,J.A. Gene structure and upstream regulatory regions of human CYP2C9 and CYP2C18. Biochem. Biophys. Res. Commun. 194, 194-201 (1993) CYP2C9 human PIR B61265 (225 amino acids) Srivastava, P.K., Yun, C.H., Beaune, P.H., Ged, C. and Guengerich, F.P. Separation of human liver microsomal tolbutamide hydroxylase and (S)-mephenytoin 4'-hydroxylase cytochrome P-450 enzymes. Mol. Pharmacol. 40, 69-79 (1991) 2C10 has D at position 417 while 2C9 has G. This sequence does not include position 417. The only other amino acid difference between 2C9 and 2C10 is at position 358 where 2C9 has Y and 2C10 has C. This sequence has Y at 358. CYP2C9 human PIR S26634 (29 amino acids) PIR S23777 (25 amino acids) Shimada, T., Misono, K.S. and Guengerich, F.P. Human liver microsomal cytochrome P-450 mephenytoin 4-hydroxylase, a prototype of genetic polymorphism in oxidative drug metabolism. J. Biol. Chem. 261, 909-921 (1986) CYP2C9 human PIR S39377 (20 amino acids) Sandhu, P., Baba, T. and Guengerich, F.P. Expression of modified cytochrome P450 2C10 (2C9) in Escherichia coli, purification, and reconstitution of catalytic activity. Arch. Biochem. Biophys. 306, 443-450 (1993) CYP2C9-de1b human GenEMBL NT_008769.11|Hs10_8926 same as AL133513.12, might work for alt splice detritus exon 1 32kb upstream of 2C9 8335895 MDPAVALVLCLSCLFLLSLWRQSSGRGRLLFGPTPLLIIGNILQLDVKDMSKSLTNVSMLYAPL 8336086 CYP2C9-de2c3c human GenEMBL NT_008769.11|Hs10_8926 detritus exons 2,3 between 2C9 and 2C8 old name CYP2C59P 8437561 LSQFSKVYVPVFTVYFDIKLVLELHGYEVVKEALIDHGEEFSGKGIFPVSKKS**G 8437394 8437211 FRIIFSNGKRCKDIWLFLLMTLWNCRMVKRS 8437119 8437115 MEKHVQGEAQCLRQELRRTK 8437058 CYP2C9 Macaca fasicularis (cynomolgus monkey) No accession Wu Zhicong Submitted to nomenclature committee 10/30/2006 93% to human 2C9, 91% to 2C19, 81% to 2C18, 76% to 2C8 CYP2C10X human PIR A61265 (79 amino acids) Srivastava, P.K., Yun, C.H., Beaune, P.H., Ged, C. and Guengerich, F.P. Separation of human liver microsomal tolbutamide hydroxylase and (S)-mephenytoin 4'-hydroxylase cytochrome P-450 enzymes. Mol. Pharmacol. 40, 69-79 (1991) 2C10 has D at position 417 while 2C9 has G. This sequence shows the D at position 417. The only other amino acid difference between 2C9 and 2C10 is at position 358 where 2C9 has Y and 2C10 has C. This sequence does not include the 358 region. The 2C10 gene is in some doubt. Others have searched 100 samples looking for it and have not found it. This gene may not exist. CYP2C11 rat GenEMBL S68251 (139bp) Habib,S.L., Srikanth,N.S., Scappaticci,F.A., Faletto,M.B., Maccubbin,A., Farber,E., Ghoshal,A.K. and Gurtoo,H.L. Altered expression of cytochrome P450 mRNA during chemical-induced hepatocarcinogenesis and following partial hepatectomy Toxicol. Appl. Pharmacol. 124, 139-148 (1994) rat 2C cluster in chromosome order CYP2C11 rat PIR A60782 (500 amino acids) Stroem, A., Mode, A., Zaphiropoulos, P., Nilsson, A.G., Morgan, E., Gustafsson, J.A. Cloning and pretranslational hormonal regulation of testosterone 16alpha-hydroxylase (P-450-16alpha) in male rat liver. Acta Endocrinol. 118, 314-320 (1988) rat 2C cluster in chromosome order CYP2C11 rat PIR A60783 (500 amino acids) Zaphiropoulos, P.G., Mode, A., Stroem, A., Husman, B., Andersson, G., Gustafsson, J.A. Sequence and regulation of two growth-hormone-controlled, sex-specific isozymes of cytochrome P-450 in rat liver, P-450-15beta and P-450-16alpha. Acta Med. Scand. Suppl. 723, 161-167 (1988) rat 2C cluster in chromosome order CYP2C11 rat GenEMBL X79081 (2140bp) PIR S44310 (56 amino acids) Strom,A., Equchi,H., Mode,A., Tollet,P., Stromstedt,P.E. and Gustafson,J. Characterization of the proximal promoter and two silencer elements in the CYP2C gene expressed in rat liver. DNA Cell Biol. 13, 805-819 (1994) rat 2C cluster in chromosome order CYP2C11 rat PIR S26818 (500 amino acids) Matsumoto, T., Emi, Y., Kawabata, S. and Omura, T. J. Biochem. (1986) 100, 1359-1371 Purification and characterization of three male-specific and one female-specific forms of cytochrome P-450 from rat liver microsomes. rat 2C cluster in chromosome order CYP2C11 rat GenEMBL U33173(1856bp) Yoshioka,H., Morohashi,K., Sogawa,K., Miyata,T., Kawajiri,K., Hirose,T., Inayama,S., Fujii-Kuriyama,Y. and Omura,T. Structural analysis and specific expression of microsomal cytochrome P-450(M-1) mRNA in male rat livers. J. Biol. Chem. 262 (4), 1706-1711 (1987) Erratum:[J Biol Chem 1986 Jun 15;262(17):8438]] Biagini,C. and Celier,C. cDNA-directed expression of two allelic variants of cytochrome P450 2C11 using COS1 and SF21 insect cells. Arch. Biochem. Biophys. 326 (2), 298-305 (1996) rat 2C cluster in chromosome order CYP2C11 rat GenEMBL J02657 72% to CYP2C6_v1 243377899 MDPVLVLVLTLSSLLLLSLWRQSFGRGKLPPGPTPLPIIGNTLQIYMKDIGQSIKK 243378066 243379842 FSKVYGPIFTLYLGMKPFVVLHGYEAVKEALVDLGEEFSGRGSFPVSERVNKGL 243380003 243380160 GVIFSNGMQWKEIRRFSIMTLRTFGMGKRTIEDRIQEEAQCLVEELRKSK 243380309 GAPFDPTFILGCAPCNVICSIIFQNRFDYKDPTFLNLMHRFNENFRLFSSPWLQVCNT FPAIIDYFPGSHNQVLKNFFYIKNYVLEKVKEHQESLDKDNPRDFIDCFLNKMEQEKH NPQSEFTLESLVATVTDMFGAGTETTSTTLRYGLLLLLKHVDVTAKVQEEIERVIGRN RSPCMKDRSQMPYTDAVVHEIQRYIDLVPTNLPHLVTRDIKFRNYFIPKGTNVIVSLS SILHDDKEFPNPEKFDPGHFLDERGNFKKSDYFMPFSA 243416959 GKRICAGEALARTELFLFFTTILQNFNLKSLVDVKDIDTTPAISGFGHLPPFYEACFIPVQRADSLSSHL* 243417171 CYP2C12 rat Swiss B60783 (490 amino acids) Zaphiropoulos, P.G., Mode, A., Stroem, A., Husman, B., Andersson, G., Gustafsson, J.A. Sequence and regulation of two growth-hormone-controlled, sex-specific isozymes of cytochrome P-450 in rat liver, P-450-15beta and P-450-16alpha. Acta Med. Scand. Suppl. 723, 161-167 (1988) rat 2C cluster in chromosome order CYP2C12 rat PIR S26819 (490 amino acids) Matsumoto, T., Emi, Y., Kawabata, S. and Omura, T. J. Biochem. (1986) 100, 1359-1371 Purification and characterization of three male-specific and one female-specific forms of cytochrome P-450 from rat liver microsomes. rat 2C cluster in chromosome order CYP2C12 rat PIR B41425 (19 amino acids) Imaoka, S., Kamataki, T. and Funae, Y. Purification and characterization of six cytochromes P-450 from hepatic microsomes of immature female rats. J. Biochem. 102, 843-851 (1987) rat 2C cluster in chromosome order CYP2C12 rat GenEMBL J03786 80% to 2C13 MDPFVVLVLSLSFLLLLYLWRPSPGRGKLPPGPTPLPIFGNFLQ IDMKDIRQSISNFSKTYGPVFTLYFGSQPTVVLHGYEAVKEALIDYGEEFSGRGRMPV FEKATKGLGISFSRGNVWRATRHFTVNTLRSLGMGKRTIEIKVQEEAEWLVMELKKTK GSPCDPKFIIGCAPCNVICSIIFQNRFDYKDKDFLSLIENVNEYIKIVSTPAFQVFNA FPILLDYCPGNHKTHSKHFAAIKSYLLKKIKEHEESLDVSNPRDFIDYFLIQRCQENG NQQMNYTQEHLAILVTNLFIGGTETSSLTLRFALLLLMKYPHITDKVQEEIGQVIGRH RSPCMLDRIHMPYTNAMIHEVQRYIDLAPNGLLHEVTCDTKFRDYFIPKGTAVLTSLT SVLHARKEFPNPEMFDPGHFLDENGNFKKSDYFMPFSAGKRKCVGEGLASMELFLFLT TILQNFKLKSLSDPKDIDINSIRSEFSSIPPTFQLCFIPV CYP2C13 rat GenEMBL X79810 (1944bp) Legraverend,C., Eguchi,H., Strom,A., Lahuna,O., Mode,A., Tollet,P., Westin,S. and Gustafsson,J.A. Transactivation of the rat CYP2C13 gene promoter involves HNF-1, HNF-3 and members of the orphan receptor subfamily. Biochemistry 33, 9889-9897 (1994) rat 2C cluster in chromosome order CYP2C13 rat PIR S26820 (30 amino acids) Matsumoto, T., Emi, Y., Kawabata, S. and Omura, T. Purification and characterization of three male-specific and one female-specific forms of cytochrome P-450 from rat liver microsomes. J. Biochem. 100, 1359-1371 (1986) rat 2C cluster in chromosome order CYP2C13v1 rat 100% first 5 exons Note this seq also on 100.0% Un ++ 17276272 17282257 Exons 6-9 are on 99.1% Un ++ 17323193 17358099 2 aa diffs to 2C13 J02861 CYP2C12 is also on this same contig 99.6% Un ++ 17388090 17446950 2 aa diffs Minus Strand HSPs: 245246208 MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTN (0) 245246041 245244920 FSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQ (1) 245244759 245244599 GIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN 245244450 245240888 GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQ (0) 245240727 245239607 VFNIFPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ 245239431 CYP2C13v1 rat GenEMBL J02861 80% to 2C12 MDPVVVLLLSLFFLLFLSLWRPSSGRGKLPPGPTPLPIIGNFFQ VDMKDIRQSLTNFSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPI CEKVAKGQGIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN GSPCDPQFIMGCAPGNVICSIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQVFNI FPILLDYCPGNHNIYFKNHTWLKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ ENANQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVT AKVQEEIDHVIGRH RSPCMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHEVTCDTKFRNYFIPKGTAVLTSLT SVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSAGKRMCLGESLARMELFLFLT TILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL CYP2C13v2 rat Not in figure probable 2C13 allele NM_138514 7AA DIFFS TO 2C13v1 (98%) 80% to 2C12 (temp name = CYP2CNEWA) MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQ VDMKDIRQSLTNFSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPI CEKVAKGQGIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQVFNI FPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQENA NQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVTAKVQEEIDHVIGRH RSPSMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHDVTCDTKFRNYFIPKGTAVLTSLT SVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSAGKRMCLGESLARMELFLFLT TILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL CYP2C13-de1b2b rat frag 7 Exon 1 76% to 2C13 Minus Strand 245307855 MDPIVVLVLSLSCLLFLSLWRNNSRRGKLPPGPTPLPIIRNYLQLDMKDIC*SLTK (0) 245307688 frag 6 Exon 2 83% to 2C13 Minus Strand 245292652 FSKTYGPVYTLYFGSQPTVLLYGYEALKEALIDYGEAFSGRGRIPIHEKVSKGQ 245292491 CYP2C13-se1[6] rat frag h 72% to 2C13 exon 6 plus strand 100% to seq s 70% to 2C12 exon 6 244165142 ENGNQQMNYTQEHLATMVTDLL 244165207 244165209 FGGRETLNSTMRFAFLFLMKYPYTT 244165284 rat 2C cluster in chromosome order CYP2C13-se2[6:7] rat frag s Exons 6-7 minus strand 72% to 2C12 exon 6 100% to seq h 243766431 ENGNQQMNYTQEHLATMVTDLL 243766366 243766364 FGGRETLNSTMRFAFLFLMKYPYTT 243766290 243760156 XQINEEIGQVIWRHHSPSMLDWSHMIYTNAMVHEVQRYIDLAPNGVVCEVNCDTKYPRDYFIPK 243759968 rat 2C cluster in chromosome order CYP2C13-se3[1:2:3:2:3:] rat frag f Exons 1,2,3,2,3 exon 1 = 66% to 2C13 Minus Strand exons 2,3 = 57% to 2C13 two identical copies of exons 2,3 100% to seq v exons 2,3 244215468 SQSFLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 244215328 244214467 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 244214306 244214137 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213988 244213484 R*FS*RGWFSIFGKFSKVQ 244213428 244213259 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213110 CYP2C13-se4[1:2:3] rat frag v Exon 1 (+) 59% to 2C13 243678671 FLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 243678802 Exon 2 (+) 48% to 2C79 243679647 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 243679808 Exon 3 (+) 100% to seq f 243679977 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 243680126 rat 2C cluster in chromosome order CYP2C14 rabbit CYP2C15 rabbit CYP2C16 rabbit CYP2C17X human discontinued number See CYP2C18/19 CYP2C18 human GenEMBL L16869 to L16876 Swiss P33260 (490 amino acids) Romkes,M., Faletto,M.B., Blaisdell,J.A., Raucy,J.L. and Goldstein,J.A. Cloning and expression of complementary DNAs for multiple members of the human cytochrome P450IIC subfamily. Biochemistry 30, 3247-3255 (1991) de Morais,S.M., Schweikl,H., Blaisdell,J.A. and Goldstein,J.A. Gene structure and upstream regulatory regions of human CYP2C9 and CYP2C18. Biochem. Biophys. Res. Commun. 194, 194-201 (1993) Romkes,M., Faletto,M.B., Blaisdell,J.A., Raucy,J.L. and Goldstein,J.A. Correction: Cloning and expression of complementary DNAs for multiple members of the human cytochrome P450IIC subfamily. Biochemistry 32, 1390-1390 (1993) CYP2C18 human GenEMBL S63419 S63421 S63424 S63426 X56452 (multiple genomic fragments) PIR S45369 (56 amino acids) Ged,C. and Beaune,P. Partial sequence and polymerase chain reaction-mediated analysis of expression of the human CYP2C18 gene Pharmacogenetics 2, 109-115 (1992) CYP2C18 human PIR A61269 (490 amino acids) Furuya, H., Meyer, U.A., Gelboin, H.V. and Gonzalez, F.J. Polymerase chain reaction-directed identification, cloning, and quantification of human CYP2C18 mRNA. Mol. Pharmacol. 40, 375-382 (1991) CYP2C18/19 human GenEMBL M61858 J05326 (1276bp) Swiss P33259 (270 amino acids) Goldstein,J.A., Raucy,J.L., Blaisdell,J.A., Faletto,M.B. and Romkes,M. Cloning and expression of complementary DNAs for multiple members of the human cytochrome P450IIC subfamily Biochemistry 30, 3247-3255 (1991) This sequence named 2C17 was later found to be a splice of 2C18 amd 2C19. Therefore, there is no 2C17 sequence. CYP2C18/19 human GenEMBL L07093 (2395bp) Romkes,M., Faletto,M.B., Blaisdell,J.A., Raucy,J.L. and Goldstein,J.A. Correction: Cloning and expression of complementary cDNAs for multiple members of the human cytochrome P450IIC subfamily Biochemistry 32, 1390-1390 (1993) CYP2C18 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 9/29/2005 3 aa diffs to rhesus 2C18, 95% to human 2C18 only 80% to 2C19 complete sequence CYP2C18 Macaca fasicularis (cynomolgus monkey) No accession Wu Zhicong Submitted to nomenclature committee 10/30/2006 96% to 2C18 human, 81% to 2C9, 81% to 2C19, 76% to 2C8 3 amino acid diffs to Unos seq. CYP2C18 Macaca mulatta (Rhesus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 9/29/2005 3 aa diffs to M. fasicularis 2C18 complete sequence CYP2C19 human Swiss P33261 (490 amino acids) Romkes,M., Faletto,M.B., Blaisdell,J.A., Raucy,J.L. and Goldstein,J.A. Cloning and expression of complementary DNAs for multiple members of the human cytochrome P450IIC subfamily. Biochemistry 30, 3247-3255 (1991) CYP2C19 human GenEMBL L31506 (129bp) GenEMBL L31507 (129bp) De Morais,S.M.F., Wilkinson,G.R., Blaisdell,J.A., Nakamura,K., Meyer,U.A. and Goldstein,J.A. The major genetic defect responsible for the polymorphism of S-mephenytoin metabolism in humans J. Biol. Chem. 269, 15419-14522 (1994) CYP2C19 human GenEMBL L32982 (329bp) wild type exon 4 GenEMBL L32983 (329bp) mutant exon 4 De Morais,S.M.F., Wilkinson,G.R., Blaisdell,J.A., Meyer,U.A., Nakamura,K. and Goldstein,J.A. Identification of a new genetic defect responsible for the polymorphism of S-mephenytoin metabolism in Japanese Mol. Pharmacol. 46, 594-598 (1994) CYP2C19 human PIR S38753 (16 amino acids) Wrighton, S.A., Stevens, J.C., Becker, G.W., and van den Branden,M. Isolation and characterization of human liver cytochrome P450 2C19: correlation between 2C19 and S-mephenytoin 4'-hydroxylation. Arch. Biochem. Biophys. 306, 240-245 (1993) CYP2C20/2C8 Macaca fasicularis (cynomolgus monkey) GenEMBL S53046 (1901bp) Swiss P33262 (490 amino acids) PIR S28166 (490 amino acids) Komori,M., Kikuchi,O., Sakuma,T., Funaki,J., Kitada,M. and Kamataki,T. Molecular cloning of monkey liver cytochrome P-450 cDNAs: similarity of the primary sequences to human cytochromes P-450. Biochim. Biophys. Acta 1171, 141-146 (1992) CYP2C8 will be the preferred name for this seq in the future. CYP2C20/2C8 Macaca fasicularis (cynomolgus monkey) PIR A60466 (22 amino acids) Ohi, H., Toratani, S., Komori, M., Miura, T., Kitada, M. and Kamataki, T. Comparative study of cytochrome P-450 in liver microsomes. A form of monkey cytochrome P-450, P-450-MK1, immunochemically cross-reactive with antibodies to rat P-450-male. Biochem. Pharmacol. 38, 361-365 (1989) CYP2C8 will be the preferred name for this seq in the future. CYP2C20/2C8 Macaca mulatta (rhesus monkey) name change from CYP2C74 No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 91% to CYP2C8, 78% to CYP2C19, probable ortholog of CYP2C8 formerly CYP2C74. There are only 3 amino acid differences to Macaca fasicularis (cynomolgus monkey) GenEMBL S53046 Since this is the clear ortholog of that earlier sequence the name has been changed to reflect the orthology. CYP2C8 will be the preferred name for this seq in the future. CYP2C21 Canis familiaris (dog) NW_876285.1: 8748112-8724707 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 70% to human 2C19 MDLFIVLVICLSCLISFFLWNQNRAKGKLPPGPTPLPIIGNILQINTKNVSKSLSKLAENYGPVFTVYFGMKPTV VLYGYEAVKEALIDRSEEFSGRGHFPLLDWTIQGLGIVFSNGEKWKQTRRFSLTVLRNMGMGKKTVEDRIQEEAL YLVEALKKTNASPCDPTFLLGCAPCNVICSIIFQNRFEYDDKDFLTLLEYFHENLLISSTSWIQLYNAFPLLIHY LPGSHHVLFKNIANQFKFISEKIKEHEESLNFSNPRDFIDYFLIKIEKEKHNKQSEFTMDNLIITIWDVFSAGTE TTSTTLRYGLLVLLKHPDVTAKVQEEIHRVVGRHRSPCMQDRSCMPYTDAVVHEIQRYIDLVPNNLPHSVTQDIK FREYLIPKGTTILTSLTSVLHDEKGFPNPDQFDPGHFLDENGSFKKSDYFMAFSAGKRVCVGEGLARMELFLLLT NILQHFTLKPLVDPKDIDTTPIANGLGATPPSYKLCFVPV* CYP2C22 rat GenEMBL M58041 61% to 2C79 245425985 MALFIFLGIWLSCLVFLFLWNQHHVRRKLPPGPTPLPIFGNILQVGVKNMSKSMCM 245425818 LAKEYGPVFTMYLGMKPTVVLYGYEVLKEALIDRGEEFSDKMHSSM LSKVSQGLGIVFSNGEIWKQTRRFSLMVLRSMGMGKRTIENRIQEEVVYLLEALRKTN GSPCDPSFLLACVPCNLISSVIFQHRFDYSDEKFQKFIENFHTKIEILASPWAQLCSA YPVLYYLPGIHNKFLKDVTEQKKFILMEINRHRASLNLSNPQDFIDYFLIKMEKEKHN EKSEFTMDNLIVTIGDLFGAGTETTSSTIKYGLLLLLKYPEVTAKIQEEITRVIGRHR RPCMQDRNHMPYTDAVLHEIQRYIDFVPIPLPRKTTQDVEFRGYHIPK GTSVMACLTSALHDDKEFPNPEKFDPGHFLDEKGNFKKSDYFMAFSA GRRACIGEGLARMEMFLILTSILQHFILKPLVNPEDIDTTPVQPGLLSLPPPFQLCFIPV rat 2C cluster in chromosome order CYP2C22-se2[1:2] rat frag 9 Exon 1 61% to 2C22 Minus Strand 245347583 MDLFIILWICFACLSLFFLWNQLHYKEKLPPGPVPLPIVGNILQVNIKSIIKSLNI (0) 245347416 frag 8 Exon 2 79% to 2C22 Minus Strand 245334622 LAKEYGPVFTVYLGMKPTVVLHGHKALKEALIDRANEFSVKMQSSLLSKESQGL (1) 245334461 CYP2C23 rat GenEMBL U04733 (1919bp) Karara,A., Makita,K., Jacobson,H.R., Falck,J.R., Guengerich,F.P., DuBois,R.N.and Capdevila,J.H. Molecular cloning, expression, and enzymatic characterization of the rat kidney cytochrome P-450 arachidonic acid epoxygenase. J. Biol. Chem. 268, 13565-13570 (1993) rat 2C cluster in chromosome order CYP2C23 rat GenEMBL S67064 (265bp) Imaoka,S., Wedlund,P.J., Ogawa,H., Kimura,S., Gonzalez,F.J. and Kim,H.Y. Identification of CYP2C23 expressed in rat kidney as an arachadonic acid epoxygenase. J. Pharmacol. Exp. Ther. 267, 1012-1016 (1993) rat 2C cluster in chromosome order CYP2C23 rat PIR S29817 (20 amino acids) Marie, S.; Roussel, F.; Cresteil, T. Age- and tissue-dependent expression of CYP2C23 in the rat. Biochim. Biophys. Acta 1172, 124-130 (1993) note: This sequence is diiferent from GenEMBL U04733 and S67064 by one amino acid. PIR S13101, SwissProt P24470 and GenEMBL X55446 are all equivalent, but they have a frame shift in the sequence in the region of this 20 amino acid fragment. Amino acids 38-54 are affected. rat 2C cluster in chromosome order CYP2C23 rat GenEMBL X55446 59% to 2C11 MELLGFTTLALVVSVTCLSLLSVWTKLRTRGRLPPGPHPPSHYW ESTATEPQGHPASLSKLAKEYGPVYTLYFGTSPTVVLHGYDVVKEALLQQGDEFLGRG PLPIIEDTHKGYGLIFSNGERWKVMRRFSLMTLRNFGMGKRSLEERVQEEAWCLVEEL QKTKAQPFDPTFILACAPCNVICSILFNDRFQYNDKTFLNLMDLLNKNFQQVNSVWCQ MYNLWPTIIKYLPGKHIEFAKRIDDVKNFILEKVKEHQKSLDPANPRDYIDCFLSKIE EEKDNLKSEFHLENLAVCGSNLFTAGTETTSTTLRFGLLLLMKYPEVQAKVHEELDRV IGRHQPPSMKDKMKLPYTDAVLHEIQRYITLVGSSLPHAVVQDTKFRDYVIPKGTTVL PMLSSVMLDQKEFANPEKFDPGHFLDKNGCFKKTDYFVPFSLGKRACVGESLARMELF LFFTTLLQKFSLKTLVEPKDLDIKPITTGIINLPPPYKLCLVPR CYP2C24 rat GenEMBL S59647 (226bp) GenEMBL S59648 (187bp) GenEMBL S59652 (380bp) Zaphiropoulos,P.G. Differential expression of cytochrome P450 2C24 and transcripts in rat kidney and prostate: evidence indicative of alternative and possibly trans splicing events. Biochem. Biophys. Res. Commun. 192, 778-786 (1993) rat 2C cluster in chromosome order CYP2C24 rat Swiss P33273 (434 amino acids) PIR PT0435 (302 amino acids) PIR JH0451 (434 amino acids) Zaphiropoulos,P.G. cDNA cloning and regulation of a novel rat cytochrome P450 of the 2C gene sufamily (P450IIC24). Biochem. Biophys. Res. Commun. 180, 645-651 (1991) rat 2C cluster in chromosome order CYP2C24 rat 92% to 2C80, M86678 has alternative splice first exon seen only in M86678 exons 2-4 only 2 aa diffs to 2C24 on M86678 no ESTs contain the yellow region but CK481568.1 covers exons 1,2,3,4 CO565602.1 matched the end of the gene sequence and extends it a little 6 aa Used this EST to blast the trace files to find the end of exon 7 MDPVLVLVLTLSCLLLLSLWRQSSGRGKLPPGPTPLPIIGNILQIDVKDISKSFTN CK481568.1 exon 1 QLSCSRKFGLTCGPEAQ rat repeat seq found in many rat BACs 243522306 FTDKLTAKCHSSVSLHIDLPGNLL 243522235 yellow region not P450 seq. 243522073 FSKIYGPVFTLYFGPKPTVVVHGYEAVKEALDDLGEEFSGRGSFPIVERMNNGL 243521912 243521366 GVIFSNGTKWKELRHFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 243521217 243518830 GSLCDPTFILSCAPSNVICSVVFHNRFDYKDENFLNLMEKLNENFKILNSPWMQ 243518669 VCNALPAFIDYLPGSHNRVIKNFAEI 676 677 KSYILRRVKEHQETLDMDNPRDFIDCFLIKMEQEKHNPRTEFTIEILMATVSDVFVAGSE 856 857 TTSTTLRYGLLLLLKHIEVT gnl|ti|132779224 rts18e73.g from trace files for exon 7 AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK rat 2C cluster in chromosome order CYP2C25 Mesocricetus auratus (Syrian hamster) GenEMBL X63022 (1829bp, incorrectly given as X60322 in Table 3 of the 1993 nomenclature update) Sakuma,T., Masaki,K., Itoh,S., Yokoi,T. and Kamataki,T. Sex-related difference in the expression of cytochrome P450 in hamsters: cDNA cloning and examination of the expression of three distinct CYP2C cDNAs. Molec. Pharmacol. 45, 228-236 (1994) CYP2C26 Mesocricetus auratus (Syrian hamster) GenEMBL D11435 (1808bp) Swiss P33263 (490 amino acids) Sakuma,T., Masaki,K., Itoh,S., Yokoi,T. and Kamataki,T. Sex-related difference in the expression of cytochrome P450 in hamsters: cDNA cloning and examination of the expression of three distinct CYP2C cDNAs. Molec. Pharmacol. 45, 228-236 (1994) CYP2C27 Mesocricetus auratus (Syrian hamster) GenEMBL D11436 (1784bp) Swiss P33264 (490 amino acids) Sakuma,T., Masaki,K., Itoh,S., Yokoi,T. and Kamataki,T. Sex-related difference in the expression of cytochrome P450 in hamsters: cDNA cloning and examination of the expression of three distinct CYP2C cDNAs. Molec. Pharmacol. 45, 228-236 (1994) CYP2C28 Mesocricetus auratus (Syrian hamster) GenEMBL D11437 (1556bp) Swiss P33265 (490 amino acids) Sakuma,T., Masaki,K., Itoh,S., Yokoi,T. and Kamataki,T. Sex-related difference in the expression of cytochrome P450 in hamsters: cDNA cloning and examination of the expression of three distinct CYP2C cDNAs. Molec. Pharmacol. 45, 228-236 (1994) Cyp2c29 mouse GenEMBL D17674 (1751bp) also BC013895 Matsunaga,T., Watanabe,K., Yamamoto,I., Negishi, M., Gonzalez,F.J. and Yoshimura, H. cDNA cloning and sequence of CYP2C29 encoding P-450 MUT-2, a microsomal aldehyde oxygenase. Biochim. Biophys. Acta 1184, 299-301 (1994) Cyp2c29 mouse PIR A61268 (16 amino acids) Bornheim, L.M. and Correia, M.A. Purification and characterization of a mouse liver cytochrome P-450 induced by cannabidiol. Mol. Pharmacol. 36, 377-383 (1989) Cyp2c29v2 mouse no accession number Gang Luo and Joyce A. Goldstein clone M2c9k submitted to Nomenclature Committee CYP2C30 rabbit GenEMBL D26153 Noshiro,M., Ishida,H. and Okuda,K. unpublished (1993) CYP2C31 Capra hircus (dwarf goat) GenEMBL X76502 (1185bp) PIR JC2199 (284 amino acids) PIR S39314 (284 amino acids) Zeilmaker,W.M., Van't Klooster,G.A.E., Gremmels-Gerhmann,F.J. Van Miert,A.S.J. and Horbach,G.J.M.J. cDNA and deduced amino acid sequence of a dwarf goat liver cytochrome P450-fragment belonging to the CYP2C gene subfamily. Biochem. Biophys. Res. Commun. 200, 120-125 (1994) CYP2C32 pig GenEMBL U35733.1 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) most similar to 2C24 Clone name CL1 CYP2C33v1 pig GenEMBL U35837 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name CL7 CYP2C33v2 pig GenEMBL U35838 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name CL8 CYP2C33v3 pig GenEMBL U35839 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name PF1 CYP2C33v4 Sus scrofa (pig) GenEMBL AB052257 Misaki Kojima Submitted to nomenclature committee Oct. 27, 2000 2 amino acids diffs with 2C33v1 and v2 clone name c296 CYP2C34v1 pig GenEMBL U35840.1 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name PF15 CYP2C34v2 pig GenEMBL U35841.1 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name CL6 CYP2C34v3 pig GenEMBL U35842.1 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name Cl12 CYP2C34v4 pig GenEMBL U35843.1 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name Cl13 CYP2C35 pig GenEMBL U35844.1 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name PF11/14 CYP2C36 pig GenEMBL U35845.1 (681bp) Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B Cytochrome P450 genes expressed in porcine ovaries: identification of novel forms, evidence for gene conversion, and evolutionary relationships. Biochem. Biophys. Res. Commun. 212, 433-441 (1995) Clone name PF13 CYP2C37 macaque [name conflict, reassigned to CYP2C43] no accession number S. Ohmori submitted to Nomenclature Committee Cyp2c37 mouse AF047542 NM_010001, also AK005017 Luo G, Zeldin DC, Blaisdell JA, Hodgson E, Goldstein JA. Cloning and expression of murine CYP2Cs and their ability to metabolize arachidonic acid. Arch Biochem Biophys. 357, 45-57 1998. clone M2c10b submitted to Nomenclature Committee Cyp2c38 mouse AF047725 Luo G, Zeldin DC, Blaisdell JA, Hodgson E, Goldstein JA. Cloning and expression of murine CYP2Cs and their ability to metabolize arachidonic acid. Arch Biochem Biophys. 357, 45-57 1998. clone M2c13f submitted to Nomenclature Committee Cyp2c39 mouse AF047726 NM_010003 Luo G, Zeldin DC, Blaisdell JA, Hodgson E, Goldstein JA. Cloning and expression of murine CYP2Cs and their ability to metabolize arachidonic acid. Arch Biochem Biophys. 357, 45-57 1998. clone M2c9d submitted to Nomenclature Committee Cyp2c39-ie6b mouse GenEMBL NT_039689.1 Internal exon 6 (duplicate exon) 5895730 ANHIQQAEFSLENLACTINNLFAAGTETTSTSLINARLLFVRDPNVT 5895870 Cyp2c40 mouse AF047727 NM_010004 (NW_000147 exons 2-6 only) Luo G, Zeldin DC, Blaisdell JA, Hodgson E, Goldstein JA. Cloning and expression of murine CYP2Cs and their ability to metabolize arachidonic acid. Arch Biochem Biophys. 357, 45-57 1998. Tsao CC, Foley J, Coulter SJ, Maronpot R, Zeldin DC, Goldstein JA. CYP2C40, a unique arachidonic acid 16-hydroxylase, is the major CYP2C in murine intestinal tract. Mol Pharmacol. 58, 279-87 2000 clone M2c9h submitted to Nomenclature Committee CYP2C41 dog NM_001003334, AF016248 Stephen R. Bai and Joyce A. Goldstein clone M2c9h submitted to Nomenclature Committee MDPVVVLVLCLSCCLLLSLWKQSSRKGKLPPGPTPLPFIGNILQ LDKDINKSLSNLSKAYGPVFTLYFGMKPTVVLHGYDAVKETLIDLGEEFSARGRFPIA EKVSGGHGIIFTSGNRWKEMRRFALTTLRNLGMGKSDLESRVQEEACYLVEELRKTNA LPCDPTFVLGCASCNVICSIIFQNRFDYTDQTLIGFLEKLNENFRILSSPWIQAYNSF PALLHYLPGSHNTIFKNFAFIKSYILEKIKEHQESFDVNNPRDFIDYFLIKMEQEKHN QPLEFTFENLKTIATDLFGAGTETTSTTLRYGLLLLLKHPEVTVKVQEEIDRVIGRHQ SPHMQDRSRMPYTNAVLHEIQRYIDLVPNSLPHAVTCDVKFRNYVIPKGTTILISLSS VLSDEKEFPRPEIFDPAHFLDDSGNFKKSDYFMAFSAGKRICVGEGLARMELFLFLTT ILQKFTLKPLVDPKDIDTTPLASGFGHVPPTYQLCFIPV CYP2C42 pig GenEMBL Z93098 (1307bp) Nissen,P.H., Winteroe,A.K. and Fredholm,M. Characterization and mapping of three porcine genes belonging to the cytochrome P450 superfamily Unpublished clone 10b03 CYP2C42P1 pig GenEMBL Z93100 (1758bp) Nissen,P.H., Winteroe,A.K. and Fredholm,M. Characterization and mapping of three porcine genes belonging to the cytochrome P450 superfamily Unpublished clone 15d09 (pseudogene) CYP2C43 Macaca mulatta (rhesus monkey) no accession number Matsunaga T, Ohmori S, Ishida M, Sakamoto Y, Nakasa H, Kitada M. Molecular Cloning of Monkey CYP2C43 cDNA and Expression in Yeast. Drug Metab Pharmacokinet. 2002;17(2):117-24. submitted to Nomenclature Committee [name conflict, formerly CYP2C37 reassigned to CYP2C43] CYP2C43 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP2C9v1 92% to 2C9 human, 93% to 2C75, 77% to 2C20, 77% to 2C74 99% to rhesus 2C43 Cyp2c44 mouse no accession number Christian Helvig and Jorge H. Capdevila submitted to nomenclature committee Oct. 2, 1998 most similar to CYP2C23 (87% identical) MELLGLPTLALLVLVMSLSLLSVWTKMRTGGRLPPGPTPLPIIGNILQLDLKDIPASLSK LAKEYGPVYTLYFGSWPTVVLHGYDVVKEALLNQGDEFLGRGPLPIIEDSQKGH GIVFSEGERWKLLRRFSLMTLKNFGMGKRSLEERVQEEARCLVEELHKTE AQPFDPTFILACAPCNVICSILFNERFPYNDKTFLNLMDLLNKNFYQLNSIWIQ MYNLWPTIMKYIPGKHREFSKRLGGVKNFILEKVKEHQEFLDPANPRDYIDCFLSKIEE EKHSLKSDFNLENLAICGSNLFTAGTETTSTTLRFGLLLLVKHPEVQ AKVHEELDRVIGRHQPPSMKDKMKLPYTDAVLHEIQRYITLLPSSLPHAVVQDTKFRHYVIPK GTAVFPFLSSILLDQKEFPNPEKFDPGHFLDKNGCFKKTDYFVPFSL GKRSCVGEGLARMELFLFFTTILQKFSLKALVEPKDLDIKPVTTGLFNLPPPYKLRLVPR CYP2C45 gallus gallus (chicken) No accession number Manuel Baader Submitted to nomenclature committee Nov. 22, 1999 57% identical to CYP2C9 CYP2C46 rat No accession number Lars von Buchholtz Submitted to nomenclature committee March 6, 2000 91% to 2C24 CYP2C47 Phascolarctos cinereus (koala) No accession number Ross McKinnon Submitted to nomenclature committee May 25, 2000 60% identical to many 2C sequences CYP2C48 Phascolarctos cinereus (koala) No accession number Brett Jones and Ross McKinnon Submitted to nomenclature committee Nov. 6, 2000 92% identical to 2C47 CYP2C49 Sus scrofa (pig) GenEMBL AB052258 Misaki Kojima Submitted to nomenclature committee Oct. 27, 2000 92% to 2C35 and 2C34v1, v3, v4 80% to 2C18,78% to 2C9, 77% to 2C19 and 75% to 2C8 clone name c195 Cyp2c50 mouse GenEML BC011222.1, NT_039692 GSS AZ589908 one exon only ESTs AI118193 ue34e02.x1, opposite end = AI098787 ue34e02.y1 AI097740 AI117011 AI119501 AI314482 BF385641 AI528254 AA968308 AI876138 AI097678 AI226027 BF384486 BF659471 AI529923 AI266900 uj08d09.x1, opposite end AI226027 uj08d09.y1, Joyce Golstein and Cheng-Chung Tsao submitted to nomenclature committee 3/1/2001 94% to 2c37; 75% 2c39,2c29v2; 74% 2c38; 68% 2c40; 53% 2c44 name 2C heart NT_039692 + strand 176707 MDPILVLVFTLSCLFLLSLWRQSSERGKLPPGPTPLPIIGNILQINVKDICQSFTN 176874 177228 LSKVYGPVYTLYLGRKPTVVLHGYEAVKEALVDHGEEFAGRGRLPVFDKATNGM 177389 177552 GIIFSKGNVWKNTRRFSLTTLRNLGMGKRSIEDRVQEEARCLVEELRKTN 177701 177951 GSPCDPTFILGCAPCNVICSIIFQDRFDYKDRDFLNLMEKLNEITKIMSTPWLQ 178112 179211 VCNTFPVLLDYCPGSHNKVFKNYACIKNFLLEKIKEHEESLDVTIPRDFIDYFLINGGQ 179387 183835 ENGNYPLKNRLEHLAITVTDLFSAGTETTSTTLRYALLLLLKYPHVT 183975 185072 AKVQEEIEHVIGKHRRPCMQDRSHMPYTDAMIHEVQRFIDLVPNSLPHEVTCDIKFRNYFIPK 185260 198149 GTNVITSLSSVLRDSKEFPNPEKFDPGHFLDENGKFKKSDYFMPFST 198289 200344 GKRICAGEGLARMELFLFLTSILQNFNLKPLVHPKDIDVTPMLIGLASVPPAFQLCFIPS 200523 Cyp2c51X? mouse No accession number Joyce Golstein and Cheng-Chung Tsao submitted to nomenclature committee 3/1/2001 69% to 2c29v2; 69% 2c37; 68% 2c38; 67% 2c39; 67% 2c40 no exact hits in nr, htgs, est, gss or sts on 3/5/01 name 2C aorta note: this seq appears to be a combination between 2c52p and 2c69 it may not be a real gene Cyp2c52-ps mouse GenEMBL XM_140720 Joyce Golstein and Cheng-Chung Tsao submitted to nomenclature committee 3/1/2001 78% to 2c51, 70% to 2c29v2, 2c38; 67% to 2c39, 2c37; 61% to 2c40 missing PYTD in K-helix no exact hits in nr, htgs, est, gss or sts on 3/5/01 name 2C kidney, 2C eye sequence shown is from Ensembl mouse version 3 628318 MDPVLVLVLTLSCLLLLS*WRQNSGRGKLPPGPTPLPIIGNILQIDVKNTGQSVGK 628367 630645 FSKVYGPVFTLYFGMKPSVVLHGYEAVKEALVDLGEGFSGRGSFPVAEKASKGL 630806 630954 GIIFSNGMKWKEIRRFSVMT 631013 frameshift 631012 LRNFGMGKRSVEDRVQEEARCLVEELRNGK 631101 636385 XAPCDPTFILGCAPCNVICSIIFQKRFDYKDQTFLNLMDKFNENFRILSTPWIQ 636425 639913 VCNTFPAIIDYFPGSHNQVLKNFSYIKKNYVLEKVKKHQESLDMENPRDFIDCFLIKMKQ 639972 710041 EKHSLQSEFTHESLVATVTDMFGAGTETTSNTLRYGLLLLLKHVDIT 710181 713060 AKVQEEIERVVGRHRSPCVQDRSHM 713134 4 aa deletion and f.s. 713136 AVVHETQRYIVLIPTNLPHSVTCDAKFRNYFIPK 713237 715864 GTTVITSLTSMLHDDKEFPNPEKFDPGYFLDERGNVKKSDYFVPFSA 716004 717828 GKRMCAGEGLTGMELFLFFTIILQNFNLKPLVDVKDIDTTPVVSGFGHVPPLYQARFIPV* 718010 Cyp2c53-ps mouse AC078913.5 seq b assembled from parts 74% to 2c39 Old assembly included some N- and C-term parts not from this gene TNFSKVYGPVFTLYLGMKPTVVLHGYEAVKEALIDHGEEFAVKGIFPLAEKNSK FTLMTLKNLGMGKRNIEDRVQEEAQCLVEELRKTNG SLCDPTFILGCAPCNVTCSIIFQNHFDYKDQDFLSLMEKINENTKIVSTPWIQ LIVYCPGSHKTVPENAYYIEIYILKKIKEHQESLDVTNPLDFIDYYLIKCKQ GAGTETSTTLRYALLLLMTYPEVT Cyp2c53-ps mouse AY227735 NW_000145 Hong Wang, Joyce Goldstein, Darryl Zeldin Between Cyp2c66 and Cyp2c29 on chr 19 Temp name 2CN6 74% to 2c29 note: this is a pseudogene. There are three stop codons and the C-helix WXXXR motif is missing MDLISFLMLTLFCLILLSLWSQSSGRGKLPPGPTPVPIIVSLLQLDVKNITQSSTN FSKVYGPVFTLYLGMKPTVVLHGYEAVKEALIDHGEEFAVKGIFPLAEKNSKAL LSGFML*FLFLFV*EFTLMTLKNLGMGKRNIEDRVQEEAQCLVEELRKTN GSLCDPTFILGCAPCNVTCSIIFQNHFDYKDQDFLSLMEKINENTKIVSTPWIQ VVKFSPVLIVYCPGSHKTVPENAYYIEIYILKKIKEHQESLDVTNPLDFIDYYLIKCKQ EYHNHYSELTLKILSTTVTDFFGAGTETTSTTLRYALLLLMTYPEVT AKIQDENDHVVGKHRNLCMQDRSHMPYTFAMIH*VQRFIDLLPTNLPHAVTCDIKFRNYIILK GTAVITSLSSVLHDRKEFLNPEMFDPGHFLDGNGNFKKSDHFMPFSA GKRVCVGEGLACMELFLFLTTALQNFKLKPLVHPKDINTTPVLNGFASVPLFYELCSIPL* Cyp2c54 mouse GenEMBL NT_039692 - strand Darryl Zeldin submitted to nomenclature committee 3/18/2002 clone name N1 92% to 2c50 91% to 2c37 76% to 2c29 73% to 2c38 74% to 2c39 70% to 2c40 67% to 2c55 66% to 2c53p 59% to 2c44 67% to 2c52p 68% to 2c51 160912 MDPILVLVLTLSCLFLLSLWRQSYERGKLPPGPTPLPIIGNILQIDVKDICQSFTN 160745 159630 LSRVYGPVYTLYLGRKPTVVLHGYEAVKEALVDHGDVFAGRGRLPVFDKATNGM 159469 159306 GIGFSNGSVWKNTRHFSLMTLRNLGMGKRSIEDRVQEEARCLVEELRKTN 159157 158708 GSPCDPTFILGCAPCNVICSIIFQDRFDYKDRDFLNLLEKLDEISKILSTPWLQ 158547 157443 VCNTFPALLDYCPGSHNQFFKNYAYIKNFLLEKIREHKESLDVTIPRDFIDYFLIKGAQ 157267 134958 EDDNHPLKNNFEHLAITVTDLFIGGTESMSTTLRYALLLLLKYPHVT 134818 133577 AKVQEEIEHVIGKHRRPCMQDRSHMPYTNAMIHEVQRFIDLVPNNLPHEVTCDIKFRNYFIPK 133389 127646 GTTVITSLSSVLRDSKEFPNPEKFDPGHFLDENGKFKKSDYFMPFST 127506 125732 GKRICAGEGLARMELFLFLTSILQNFNLKPLVHPKDIDITPMLIGLGSVPPAFQLCFIPS 125553 Cyp2c55 mouse GenEMBL NT_039689.1 + strand Darryl Zeldin submitted to nomenclature committee 3/18/2002 clone name N3 71% to 2c29 70% to 2c39 70% top 2c38 69% to 2c37 69% to 2c50 65% to 2c40 58% to 2c44 53% to 2c53p 59% to 2c52p 67% to 2c54 67% to 2c51 5347110 MDPVLVLVLTLSCLLLLSLWRQNSGRGKLPPGPTPFPIIGNILQIDIKNISKSFNY 5347277 5351084 FSKVYGPVFTLYFGSKPTVVVHGYEAVKEALDDLGEEFSGRGSFQIFERINNDL 5351245 5351753 GVIFSNGTKWKELRRFSIMTLRSFGMGKRSIEDRIQEEASCLVEELRKAN 5351902 5358706 GSLCDPTFILSCAPSNVICSVIFHNRFDYKDEKFLNLMERLNENFKILNSPWMQ 5358867 5371382 VYNALPTLINYLPGSHNKVIKNFTEIKSYILGRVKEHQETLDMDNPRDFIDCFLIKMEQ 5371558 5374359 EKHNPHSEFTIESLMATVTDIFVAGTETTNITLRYGLLLLLKHTEVT 5374499 5375564 AKVQAEIDHVIGRHRSPCMQDRTRMPYTDAMVHEIQRYIDLIPNNVPHAATCNVRFRSYFIPK 5375752 5378482 GTELVTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFKKSDYFMPFSI 5378622 5382398 GKRMCVGEALARTELFLILTTILQNFNLKSLVDTKDIDTTPVANTFGRVPPSYQLYFIPR 5382577 CYP2C56PX human = CYP2C-se1[7] (see below) CYP2C57PX human = CYP2AC1P a new subfamily in mammals (see below) CYP2C58P human NT_008769.11|Hs10_8926 solo exons 1,2,3 between 2C19 and 2C9 same as AL133513.12 an alternative name for this sequence would be CYP2C19-de1b2b3b 8303126 LDLAAVLMLCLSCLLLLSL*TQISGRGKLPSDSTPLQVIESILQMADKDICKSSSNLSTLY 8302944 8296311 SLYFDMKLVLVLHGYEVLKKALIHHGEEFSGKGIFPVSKK 8296192 8295999 IIFSNRKPCKEIWPFLLMTLWNCGVVKRS 8295913 8295911 LGKHVQVEAHCIVWELRRTK 8295852 CYP2C59PX human = CYP2C9-de2c3c (see above) CYP2C60PX human = CYP2C8-de6b (see above) CYP2C61PX human = CYP2C-se2[1:2] (see below) CYP2C62P human AL138921 NT_030059 chromosome 10 50% to 2C8 Chr10q24.31 101999343-102031105 - strand build 33 5Mb upstream of 2C8 LAMCVTCLIFFLVWKKSPSPTPLPTIGNRLQRNPKD CISFQLAKEYSSVYTLYFGSWPTMVFHGYKAVKEALIDQGDKSLGRGHIPIIDDAQKRY TSAQPFDSTFILASAPCNL CSFLFKECFQYKNETFLSLMGLLNENVK TTVLPLLSLVLFSYKQFP GHFLDKNGCFNKTDYFLPFSLGK CYP2C63PX human = CYP2C-se3[1] (see below) CYP2C64PX human = CYP2C-se4[1] (see below) Cyp2c65 mouse AY227733 NW_000145 also NT_039689.1 Hong Wang, Joyce Goldstein, Darryl Zeldin Between Cyp2c55 and Cyp2c66 on chr 19 Temp name 2CN4 93% to Cyp2c66 73% to 2c29 NT_039689.1 + strand 5398093 MVLGVFLGLLLTCLLLLSLWRQNSQRRNLPPGPTPLPIIGNILQLDLKDISKSLRN 5398260 5406366 FSKVYGPVFTLYLGRNPAVVLHGYEAVKEAFTDHGEEFAGRGVFPVFDKFKKNC 5406527 5406732 GVVFSSGRTWKEMRRFSLMTLRNFGMGRRSIEDRIQEEARCLVDELRKTKG 5406884 5409456 EPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLTFLDILNENVEILSSPWIQ 5409614 5410489 ICNNFPAVIDYLPGRHRKLHKNFAFAEHYFLSKVKQHQESLDINNPRDFIDCFLIKMEQ 5410665 5419474 EKHNPKTEFTCENLVFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT 5419614 5424846 AKVQEEIDCVIGRHRSPCMQDRHSMPYTDAVLHEIQRYIDLLPTSLPHAVTRDVKFREYLIPK 5425034 5427909 GTTVIASLTSVLYDDKEFPNPEKFDPSHFLDERGKFKKSDYFFPFST 5428049 5430603 GKRICVGEGLARAELFLFLTTILQNFNLKSPVDLKDLDTTPVANGFASVPPKFQICFIPI* 5430785 Cyp2c65-de9b mouse GenEMBL NT_039689.1 + strand z in Figure 2D Nelson et al. Pharmacogenetics 14, 1-18 (2004) detritus exon 9 between Cyp2c65 and Cyp2c66 5432237 RS*LYIPPTPGKCICVRDNLAQMKLFLFLTTILYNFNLKSVDPQELDTT 5432383 Cyp2c66 mouse AY227734 NW_000145 Hong Wang, Joyce Goldstein, Darryl Zeldin Between Cyp2c65 and Cyp2c53p on chr 19 Temp name 2CN5 93% to Cyp2c65 73% to 2c29 MVLGVFLGLLLTCLLLLSLWKQNSQRRNLPPGPTPLPIIGNILQLDLKDISKSLRN FSKVYGPVFTLYLGKKPAVVLHGYKAVKEALIDHGEEFAGRGTFPVADKFIRVL GVVFSSGRTWKEMRRFSLMTLRNFGMGKRSIEDRVQEEARCLVDELRKTK GVPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLTFIDILNENVEILSSPWIQ VCNNFPAIIDYLPGRHRKLLKNFDFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ EKHNPKTEFTCENLIFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT AKVQAEIDCVIGRHRSPCMQDRHSMPYTDAVLHEIQRYIDLLPTSLPHAVTRDVKFREYLIPK GTTVIASLTSVLYDDKEFLNPERFDPSHFLDESGKFKKSDYFFPFST GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKDLDTTPVANGFVSVPPKFQICFISI* Cyp2c67 mouse GenEMBL NW_030157.1 (aa 1-274 exons 1-5 minus strand) GenEMBL NW_022459.1 (aa 275-320 exon 6 plus strand) GenEMBL NW_021833.1 (aa 321-431 exons 7-8 plus strand) Part of exon 9 not found GenEMBL NW_020256.1 (aa 469-491 end of exon 9 plus strand) Hong Wang, Joyce Goldstein, Darryl Zeldin Between Cyp2c39 and Cyp2c68 on chr 19 Temp name 2CN7 95% to Cyp2c40 MDPFVVLVLCLSFLLVLSLWRQRSARGNLPPGPTPLPIIGNYHLIDMKDIGQCLTN FSKTYGPVFTLYFGSQPIVVLHGYEAMKEAFIDHGEEFSGRGRFPFFDKVTKGK GIGFSHGNVWKATRVFTINTLRNLGMGKRTIENKVQEEAQWLMKELKKTN GLPCDPQFIIGCAPCNVICSIVFQNRFDYKDKDFLSLIGK VNECTEILSSPGCQIFNAVPILIDYCPGRHNKFFKNHTWIKSYLLEKIKE HEESLDVTNPRDFIDYFLIQRCQKKGIEHMEYTIEHLATLVTDLVFGGTE SLSSTMRFALLLLMKHTHITAKVQEEIDNVIGRHRSPCMQDRNHMPYTNA MVHEVQRYVDLGPISLVHEVTCDTKFRNYFIPKGTQVMTSLTSVLHDSTE FPNPEVFDPGHFLDDNGNFKKSDYFVPFSAGKRICVGESLARMELFLFLT TILQNFKLKPLVDPKDIDMTPKHSGFSKIPPNFQMCFIPVE* Cyp2c68 mouse GenEMBL NW_034810.1 (aa 1-161 exons 1-3 plus strand) Exon 4 not found GenEMBL NW_012728.1 (aa 215-273 exon 5 minus strand) Exon 6 not found GenEMBL NW_024952.1 (aa 321-383 exon 7, 2 copies on this contig) GenEMBL NW_012306.1 (aa 356-431 part of exon 7 and exon 8) Exon 9 not found Hong Wang, Joyce Goldstein, Darryl Zeldin Between Cyp2c67 and Cyp2c40 on chr 19 Temp name 2CN8 96% to Cyp2c40 1 MDPFVVLVLC LSFLLLLSLW RQRSARGNLP PGPTPLPIIG NYHLIDMKDI 51 GQCLTNFSKI YGPVFTLYFG SQPIVILHGY EAMKEAFIDY GEEFSGRGRI 101 PVFDKVSKGK GIGFSHGNVW KATRVFTVNT LRNLGMGKRT IETKVQEEAQ 151 WLMKELKKTN GSPCDPQFII GCAPCNVICS IVFQNRFDYK DKDFLSLIGK 201 VNECTEILSS PECQIFNAVP ILIDYCPGSH NKFLKNHTWI KSYLLEKIKE 251 HEESLDVTNP RDFVDYFLIQ RRQKNGIEHM DYTIEHLATL VTDLVFGGTE 301 TLSSTMRFAL LLLMKHTHIT AKVQEEIDNV IGRHRSPCMQ DRNHMPYTNA 351 MVHEVQRYID LGPNGVVHEV TCDTKFRNYF IPKGTQVMTS LTSVLHDSTE 401 FPNPEVFDPG HFLDDNGNFK KSDYFVPFSA Cyp2c69 mouse GenEMBL NW_024021.1 (aa 1-56 exon 1 plus strand) GenEMBL NW_009479.1 (aa 57-160 exon 2-3 minus strand) GenEMBL NW_014461.1 (aa 161-214 exon 4 plus strand) Exon 5 not found GenEMBL NW_024085.1 (aa 276-320 exon 6 plus strand) GenEMBL NW_021729.1 (aa 321-491 exons 7-9 plus strand) Hong Wang, Joyce Goldstein, Darryl Zeldin Between Cyp2c40 and Cyp2c37 on chr 19 Temp name 2CN9 95% to Cyp2c40 1 MDPFVVLVLC LSFMLLLSLW RQRSARRNLP PGPTPLPIIG NYHLIDMKDI 51 GQCLTNFSKT YGPVFTLYFG SQPIVVLHGY EAIKEALIDH GEVFSGRGRF 101 PFFDKVSKGK GIGFSHGNVW KATRVFTVNT LRNLGMGKRT IENKVQEEAQ 151 WLMKELKKTN GSPCDPQFII GCAPCNVICS IVFQNRFDYK DKDFLSLIGK 201 VNECTEILSS PGCQIFNAVP ILIDYCPGRH NKFFKNHTWI KSYLLEKIKE 251 HEESLDVTNP RDFIDYFLIQ RRQKNGIEHM EYTIEHLATL VTDLVFGGTE 301 TLSSTMRFAL LLLMKHTHIT AKVQEEIDNV IGRHRSPCMQ DRKHMPYTNA 351 MVHEVQRYVD LGPTSLVHEV TCDTKFRNYF IPKGTQVMTS LSSVLHDSTE 401 FPNPEVFDPG HFLDDNGNFK KSDYFVPFSA GKRICVGESL ARMELFLFLT 451 TILQNFKLKP LVDPKDIDTT PKYSGFSKIP PKFQMCFIPV E* Cyp2c70 mouse AY227736 NW_000148 NP_663474 LOC226105, NT_039692 Hong Wang, Joyce Goldstein, Darryl Zeldin 50kb downstream of Cyp2c50 on chr 19 Temp name 2CN10 59% to Cyp2c29 MALFIFLGIWLSCFLFLFLWNQHRGRGKLPPGPTPLPIVGNILQVYVKNISKSMGM LAKKYGPVFTVYLGMKPTVVLHGYKAMKEALIDQGDEFSDKTDSSLLSRTSQGL GIVFSNGETWKQTRRFSLMVLRSMGMGKKTIEDRIQEEILYMLDALRKTN GSPCDPSFLLACVPCNVISTVIFQHRFDYNDQTFQDFMENFHRKIEILASPWSQ LCSAYPILYYLPGIHNRFLKDVTQQKKFILEEINRHQKSLDLSNPQDFIDYFLIKMEK EKHNQKSEFTMDNLVVSIGDLFGAGTETTSSTVKYGLLLLLKYPEVT AKIQEEIAHVIGRHRRPTMQDRNHMPYTDAVLHEIQRYIDFVPIPSPRKTTQDVEFRGYHIPK GTSVMACLTSVLNDDKEFPNPEKFDPGHFLDEKGNFKKSDYFVAFSA GRRACIGEGLARMEMFLILTNILQHFTLKPLVKPEDIDTKPVQTGLLHVPPPFELCFIPV Cyp2c71-ps mouse GenEMBL NW_000148 Between 2c69 and 2c37 on chr 19 69% to Cyp2c69 14397 CP*SYNIFF*IIHVLSYLLEKIKENEELMDVTNP*DFIDYFLIQRHQ 14537 exon 5 32761 GTTVLTPLSSVLHDSKEFPNPEMFDPDHFLDGNGNFK*SDYFMPFSAGNR 32910 exon 8 39051 MCMGESLALMELILFLTTILQNF*LKSLVDLKDNNITPVYSGL 39179 39180 F*VPPTFLVCFISV 39221 exon 9 Cyp2c71-de1b mouse GenEMBL NW_000148 x in Figure 2D Nelson et al. Pharmacogenetics 14, 1-18 (2004) detritus exon 1 between Cyp2c71-ps and Cyp2c69 8628 MGPFVVLVLRLSFLLLLSL*RQRSGRGKLPPGLTPCSINGNFLQIDMKDTHQSLTN 8461 exon 1 (in opposite orientation to exons of 2c71-ps) Cyp2c72-ps mouse NW_000145 Hong Wang, Joyce Goldstein, Darryl Zeldin Between 2c29and 2c38 Temp name 2CN11 88% to 2c38, 87% to 2c39 1 MDLITFLVLT LSSLILLLLW RQRSGRGRLP PGPTPFPIIG NFLQIDGKNF 51 SQSLTNFSKA YGPMFTLYLG SQPIAVLHGY EAVKEALIDH GEEFSGRRNI 101 PMAEKINNSL GVIFSNGNRW KEIRHFTLTI LRNLGMGKRN IEDRVQEEAQ 151 CLVEELRKTN Cyp2c73-ps mouse GenEMBL NW_000100.1 Mm14_WIFeb01_281 A chr 14 2C seq 55% to 2C29 27513950 GMGNRTIEDHI*EEACSLVDELRKTNGVRCNSTFILGC 27514063 27514066 PCNVICFIFFFQNRFDYKYQGILNENVEIVSSPWIQICNNFPAIIDHLPERHRKFLEDFAFDK ILVKVIQHQESLNINNPQEFINSFLIEMKQEEYNPKIEFAYENLILTASDMFAAGTETS TTLR*SLLLLFKDP*VTAKVQEETDHVIVRHRSPCIQDKNLMPYTNALLHEIQRYLDLLP T*LYHGKTCCMKFKNCLIYKGIIVIESSTYVLHDDNEFSNPERFDPSHF CYP2C74X Macaca mulatta (rhesus monkey) No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 91% to CYP2C8, 78% to CYP2C19, probable ortholog of CYP2C8 renamed CYP2C20. There are only 3 amino acid differences to Macaca fasicularis (cynomolgus monkey) GenEMBL S53046 Since this is the clear ortholog of that earlier sequence the name has been changed to reflect the orthology. CYP2C75 Macaca mulatta (rhesus monkey) No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 93% to CYP2C9, 92% to CYP2C19, possible ortholog of CYP2C9 94% to 2C43 CYP2C75 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP2C9v3 2 amino acid differences to 2C75 of Macaca mulatta 93% to 2C9 human, 93% to 2C43, 76% to 2C20 Macaca fasicularis CYP2C76 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name Novel_mfCYP2C 72% to 2C18 human, 71% to 2C43, 69% to 2C20 Macaca fasicularis, 71% to 2C75 Macaca mulatta, 69% to 2C74 Macaca mulatta CYP2C76 Callithrix jacchus (white-tufted-ear marmoset) No accession number Yasuhiro Uno Submitted to nomenclature committee 9/29/2005 83 aa 100% to CYP2C76 Macaca fasicularis covers I-helix region CYP2C76 Cercopithecus aethiops (African green monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 9/29/2005 N-term 168 aa 100% to CYP2C76 Macaca fasicularis CYP2C76 Macaca mulatta (rhesus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 9/29/2005 98% to CYP2C76 Macaca fasicularis complete sequence CYP2C77 rat variant of 2C6 13 aa diffs to CYP2C6v1_v1, 16 aa diffs to 2C6v2 This gene has three frameshifts 244357850 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 244358017 244359760 FSKVYGPVFTLYFGMKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDL (1) 244359921 244360096 GIIFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEE 244360230 244360232 MRKTN 244360246 244361085 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 244361246 244362321 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYL 244362410 244362412 LKKIKEHQESLDVTNPQDFIDYYLIKWKQ 244362498 244381928 ESHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKYPEIT 244382068 244392235 AKVQEEIDRVFGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPM 244392423 244394012 GTTIITSLSSVLHDSKEFPNPEIFDPGHFLDGNGKFKKSDYFMPFSA 244394152 244395307 GKRMFAGEGLA 244395339 244395341 RMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 244395487 rat 2C cluster in chromosome order CYP2C77-de1b2b3b4b5b rat frag c Pseudogene 96% to 2C6_v1 exons 1-5 with partial deletion of exon 3 Plus Strand 244337898 MDLVMLLVLTLSCLILLSIWSQSSGRGKLP 244337987 244337987 SGPTPLPIIGNFFHLDLKNITQSLTS 244338064 244339793 FSKVNGSVFTLYFGMKPIVILHGYEAIK*GLIDHREEFTERGSFPVAEKINKGL 244339954 244340129 GIAFSHGNRWKEIRRFTLMTLQNLGMGK 244340212 244341157 GSPCDPTFILGCAPCNVICSIIFQNSFDYKDQDFLSLMEKLNENIKIVSSPWI* 244341318 244342872 FCSSFPVFIDYCPGIHMTLA 244342931 244342933 KNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLIKWKQ 244343049 rat 2C cluster in chromosome order CYP2C78 Balaenoptera acutorostrata (Minke whale) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 58-60% to all four CYP2Cs in human CYP2C79 rat GenEMBL XM_219933 minus strand 72% to 2C6_v1 95% to seq e, 100% to seq q (exon 9), 93% to seq z (exon 5) (temp name = CYP2CNEWD) 244590183 MILGVFLGLFLTCLLLLSLWKQNFQRRNLPPGPTPLPIIGNILQIDLKDISKSLRN 244590016 244575990 FSKVYGPVFTLYFGRKPAVVLHGYEAVKEALIDHGEEFAGRGIFPVAEKFNKNC 244575829 244575612 GVVFSSGRTWKEMRRFSLMTLRNFGMGKRSIEDRVQEEARCLVDELRKTN 244575463 244553851 GVPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLALIDILNENVEILSSPWIQ 244553690 244525726 ICNNFPAIIDYLPGRHRKLLKNFAFAKHYFLAKVIQHQESLDINNPRDFIDCFLIKMEQ 244525550 244524359 EKHNPKTEFTCENLIFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT 244524219 244517844 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQRYIDLLPTSLPHALTCDMKFRDYFIPK 244517656 244516177 GTTVIASLTSVLYDDKEFPNPEKFDPSHFLDENGKVKKSDYFFPFST 244516037 244496745 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI 244496566 rat 2C cluster in chromosome order CYP2C79-de9b rat exon 9 62% to 2C79 2 aa diffs to seq d and seq p 244491372 G*WICVREDLAQMTLFLFCPTILKNFNLNSQVNPKEL 244491262 rat 2C cluster in chromosome order CYP2C79-se1[9] rat frag q Exon 9 100% to 2C79 243885148 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI* 243885330 CYP2C80 rat GenEMBL XM_217906.2 GNOMON exon 2 on AC109577.4 in HTGS 92% to 2C24, 73% to 2C11 (temp name = CYP2CNEWC) MGWLSDP wrong N-term from GNOMON prediction Correct N-term possibly in a sequence gap 244632544 FSEVYGPVFTLYFGLKPTVVVYGYEVVKEVLDGEEFSGRGVFPIVTKVNNDL 244632389 this exon 2 does not match 2C24 244632205 GVIFSNGTKWKELRRFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 244632056 244628281 GSLCDPTFILSCAPSNVICSVIFHNRFDYKDENFLNLMEKFNENFKILNSPWMQ 244628120 244624041 VCNAIPAFIDYLPGSHNKVIKNFAEIKSYILRRVKEHQETLDMDNPRDFIDCFLIKIE 244623868 244620080 QEKHNPCTEFTIQSLVATVTDVFVAGSETTSTTLRYGLLLLLKHTEVT 244619937 244619006 AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK 244618818 244616897 GTDLITSLTSVLHDDKEFPNPEVFDPGHFLDEHGNFKRSDYFMPFSS 244616757 244614348 GKRMCVGEALARMELFLLLTTIVQNFNLKSFVATKDIDTTPLTNTFGCVPPSYQLYFTPR* 244614166 rat 2C cluster in chromosome order CYP2C81 rat 93% to 2C7 28 aa diffs missing exon 1 Plus Strand, 91% to seq j (exons 6,7) 93% to seq k (exons 2,3) 244672079 FSKTYGPVFTLYLGSQPTVILHGYEAIKKALIDHGEKFSGRGSYPMIENVTKGF 244672240 244672408 GIAFSNGNRWKEIRRFTIMTLRNLGMGKRNIEDRVQEEAQRLVEELRKTK 244672557 244681144 GSPCDPSFILNCAPCNVICSITFQNHFDYKDKEILTFMEKVNENVKIMSSPRMQ 244681305 244683123 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVEYYLIKQKQ 244683299 244699290 ANHIEQSEYSHENLACSIMDLIGAGTETMSSTLRYALLLLMKYPHVP 244699430 244713313 AKVQEEIDHVIGRYRSPCMQDRSHMPYTDAMIHEVQRFINFVPTNLLHAVTCDIKFRNYLIPK 244713501 244717457 GTKVLTSLTSVLHGSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 244717597 244718606 GKRACVGEGLARMELFLFLTTILQNFKLKSLVHPKDIDTRPVLNGFASLPPTYQFCFIPS 244718785 rat 2C cluster in chromosome order CYP2C81-de7b rat frag a Exon 7 minus Strand 100% to seq r, 80% to 2C13 244724629 LRVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 244724441 CYP2C81-de8b rat frag 1 Exon 8 93% to 2C7 Plus Strand 244737232 GTTVLTSLTSVLHDSKEFPNPEMFDPGHFLDENRNFKKSDYFMPFSA 244737372 CYP2C81-de8c rat frag 2 Exon 8 76% to 2C13 Plus Strand 87% to seq u 244764239 GMMVITSLSSVLHYNKEFPNPERFDPGYFLDGNGNFKKTDYFILFSA 244764379 CYP2C81-de1d rat frag 3 Exon 1 with frameshift Plus Strand 85% to seq e 83% TO SEQ w 244783632 MDLVVVL 244783652 244783654 CSVSSLLLFSLWRQSSWRRKLPPGPNPLPIIGNFLQIDLNNLCQSLNN (0) 244783797 CYP2C81-de6e7e rat frag 4 exon 6 70% to 2C13 Plus Strand 244799349 ELELEHLGSMVTDLFFAGIESIRTTMIFALLFLLNTHTSQ 244799468 exon 7 82% to 2C13, 86% to seq r and seq a 244801583 LQNRSHMPYTNAMVHEVQRYSDIVPNNIVHEVTSDTKFRNYFIPK (0) 244801717 CYP2C81-de1f2f3f rat frag 5 Exons 1,2,3 84% to 2C7 variant Minus Strand 244826982 MDLVTFLVLTLSSLILLSLWR*NSRRRKLPPGPTPLLIIGNFLQLDVKNVSQSLTM (0) 244826815 244813456 FSKAYGPVFTLYLGSQPTVILHGYEAVKETLIDHGEEFSGRGSFPMVEKAFKCF 244813295 244813129 GIVFSNGNR*KEIRQFIIMTLQNLGMGKRNIEDHVQEEAQCLVEELRKTK 244812980 CYP2C82P rat frag e Exons 1,4,4,5,6,7,8,9 almost an exact duplicate of seqs w,x,y,z, exons 6-9 of the wxyz cluster in a seq gap 244218695 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLINE (0) 244218865 244233879 LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 244234019 244240189 GVPCDPTFILGCAPCNVICSIVFQNHFNYKGQEFLALIDTLNENVEILSSPWIQ 244240350 244265531 ICNNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ 244265707 244266904 KHNPKTEFTCKNLIFTASDLFAAGTETTSPTLRYSLLLLPKYPEV 244267038 244273480 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQ*YIDLLPTSLPHALTCDMKFRDYFIPK 244273668 244275197 GTTVIASLTSVLYDDKEFPNPEKFDLSHFLDENGKFKKSDYFFPFST 244275337 244286429 GKRICVGEGLAQTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIP 244286605 >CYP2C82P-de9b frag d Exon 9 identical to seq p 244289962 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 244290072 rat 2C cluster in chromosome order >CYP2C82P-se[1:4:4:5] rat frag z Exon 5 minus strand 1 aa diff to CYP2C82P 243632036 ICDNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ (0) 243631860 frag y Exon 4 minus strand 92% to CYP2C82P 243654367 GVPCDPTFILGCAPCNVICSIVFQNHFNYKDQEFLALIE 243654251 243654249 LNENVEILSSP*IQ 243654208 frag x exon 4 minus strand 100% to CYP2C82P short exon 4 243659542 LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 243659402 frag w Exon 1 minus strand 100% to CYP2C82P 243675609 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLIN 243675442 rat 2C cluster in chromosome order CYP2C83 Cercopithecus aethiops (African green monkey) No accession number Catherine Booth-Genthe Merck Research laboratories 92% to human CYP2C9, 90% to human CYP2C19 cannot tell if this is the ortholog of 2C9 or 2C19 without map information 98% to 2C43 probable ortholog, name may be changed to 2C43 CYP2C84 Phalacrocorax carbo (Common cormorant) No accession number Hisato Iwata submitted to nomenclature committee 5/19/05 81% to 2C45 chicken (possible ortholog), 56% to 2C11 rat CYP2C85 Bos taurus (cow) See cattle page for details MDLPVVLVLCLCCLLLISLWKQSSGKGKLPPGPTPLPILGNILQLDVKDISKSVSN LSKVYGPVFTLYFGMNPLVVLHGYEAVKEALIGLGEEFSGRGSCPVIQRASKGY GVIFSNGKIWKETRRFSLMTLRDFGMGKRSMEDRVQQEACCLVEELRKTD GLPCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ EKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPEVT AKVQEEIDHVIGRHRSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK GTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFST GKRICVGEGLARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV* CYP2C86 Bos taurus (cow) See cattle page for details MERLEITTLALVICVTCLVFLFVWKKSHKGLGKLPPGPTPLPIIGNLMQLNLKDIPASLSK LAKQYGPVYTLHLGSQTTVVLHGYEVVKEALIDQGDEFLGRAHFPIIDDTQRGY GLIFSNGDTWKQMRRFSSLMTLRDFGMGKRSLEERIQEEAQFLVEEFRKSE AQPFNPAVTLSCATCNIICSILFNERFHYQDKTLHSLLDLLNENFNRISSLWNQ IYNLWPKLIKPLPGEHRAFSKRLKDVHYFVLEKVKEHQKSLNHNNPRDYIDCFLSRMEQ EKQNPESQFHLENLATCGSNLFSAGVETTTATLSYGFLLLMKYPEVQ AKVHEEIDRVIGRTRSPCMKDKMKLPYTEAVLHEIQRYVTLVPSNLPHAVVQDTKFRQYVIPK GTTVLPLLSSILYDCKEFPNPEKFDPGHFLDKNGSFRKTKYFVAFSI GKRACVGEGLAQMELFLFFTTILQNFVLKPLGETKDIETKPIVIGLINMPPPFKLCLIPR* CYP2C87 Bos taurus (cow) See cattle page for details MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNIFQLDVKNISKSLTS LSKVYGPVFTVYFGMKPTVVLHGYEAVKEALIDLGEEFSRRGSFPVIERNVKGH GIVFSNGKTWKETRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTN GLPCDPTFILGCAPCNVICSIIFQNRFDYKDQTFLNLMKTINENIKILGSPWIQ VLNIFPVLLDFFPWSYSYKKLYTNTAYVKNYVLEKTREHQASLDINNPRDFIDCFLIKMEQ EKHNHQSEYTFENLTITVSDLFGAGTETTSTTLRYGLLLLLKHPEVT AKIQEEIDRVIGRHRSPCMQDRTHMPYMDAVLHEIQRYIDLAPTSVPHAVNCDVKFRNYLIPK GTDILTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFRKSDYFMAFSAGKRVCVGEGLA RMELFLFLTTILQTFTLKSVVDPKDLDTTPAVTGIANVPPPYQLCFIPV* CYP2C87-de2b Bos taurus (cow) 6kb downstream of 2C87 without an intervening exon 1, same orientation LSKVCGPVFTVYFGMKPTVVLHGYEALQEALIDLGEEFSGRYSFPVNEKTRRGH CYP2C88 Bos taurus (cow) See cattle page for details MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNILQLDVKNISKSLTN LSKVYGPVFTVYFGMKPIVVLHGYEAVKEALIDLGEEFSGRGMFPLAERANIVN GILFSNGKTWKEIRRFSLMTLRNFGMGKRSIEDRVQEEACCLVEELRKTN GLPCDPTFILGCAPCNVICSIIFQNRFDYKDPVFLDLMERLNEILRILSSPWVQ VCNNFPALFDYLPGSHNKVLKNVANLKSFVLEKAMEHKASLDINNPRDYIDCFLIRMEQ EKQNQQLEFTLENLTTTVFDLFGAGTETMSTTLRYGLLLLLKHPEVT AKVQEEIDRVIGRHRSPCMQDRSHMPYTDAVVHEIQRYIDLVPSSLPHMVTHDIELRNYIIPK GTGVLVSLTSVLYDDKVFPNPEMFDPGHFLDDSGNFKKSDHFMPFSA GKRICAGESLARMEVFLFLTVILQKFTLKSVVDPKDIDTTPIANGFASVPPPYKLCFIPL CYP2C89 Bos taurus (cow) See cattle page for details XXXXXGPVFTLYFGMKPTVVLHGYEAVKQVLIDQSEEFSGRGSLPVADNINQGL GIVFSNGEIWKQTRRFSLMVLRNMGMGKRTIEHRIQEEALCLVEALKKTN GSPCDPTLLLSCAPCNVICSIIFRNRFEYNDERLLTLIKYFNENSRLVSTPWVE LYNTFPSLLHYFPGSHNTIFKNMTEQRKFILEEIKKHQESLDLNNPQDFIDYFLIKMEK EKHNKHSEFTMDNLITTVWDVFSAGTETTSLTLRYGLLLLLKHPEVT AKVQEEIDRVVGRNRSPCMQDKSCMPYTDAVLHEIQRYIDLVPSSMPHAATQDVKFREYLIPK GTVILTSLTSVLHDDNEFSNPGQFDPGHFLDESGNFKKTDHFMAFSA GKRVCVGEGLARMELFLLLVSILQHFTLKSVVDPKHIDTAPSFKGLISIPPFCEMCFIPV* 1292 CYP2C90 Bos taurus (cow) See cattle page for details LSNTYGPVFTVYFGLRPTVVLHGYEAVKEALIDQGEEFSGRGNIPMSQRVNKGY GIIFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEAHCLVEELRKTN GSPCDPTFILGCAPCNVICSIIFQNRFDYTDQNFLNLLDKFNENLQVVSSPWMQ VCNTFPILIDYFPGSHNKLFKNFAYIRSYVLEKVKEHQATLDINNPRDFIDCFLIKMEQ EKHNQEMEFTFENLIASVSDLFGAGTETTSTTLRYGLLMLLKHPEVT AKVQEEIDRVIGRHRSPCMQDRSHMPYMDAVVHEIQRYIDLVPTNLPHAVTRDIKFRNYLIPK GTTVVTSLSSVLHDEKEFPNPKVFDPAHFLDESGNFKKSDYFMAFSA GKRSCVGEGLARMELFLFLTTILQKFTLKSVVDPKDLDTTPVSSGFGHVPPPYQLCFTPL* CYP2C91 Sus scrofa (miniature pig) no accession number Haitao Shang Submitted to nomenclature committee May 23, 2007 Partial seq. differs from known pig sequences 66% to 2C36 frameshift and small deletion pseudogene? CYP2C92 horse No accession number Heather Knych Submitted to nomenclature committee June 25, 2007 83% to CYP2C87 cow, 81% to CYP2C49 pig CYP2C-se1[7] human NT_022154.9|Hs2_22310 2C pseudogene fragment chr 2 old CYP2C56P Chr2q24.3 165142570-165142755 + strand Build 33 1768955 SKVQEETDHAVGRHWRPCMQDRSHMPYTEAMVHEVQRH*PHPTNVPHALTSDIKFRNYLLPK 1769140 CYP2C-se2[1:2] human NT_008583.11|Hs10_8740 Chr10q21.3 66415290-66415135 - strand, Build 33 in MER1_type repeat chromosome 10 pseudogene frag parts of exons 1 and 2 old name = CYP2C61P 1832658 KGKLPHDLTSFLFVGNILQLNSKNLSKSITMLAKDYGPGFTVYFGIKPTVVV 1832813 CYP2C-se3[1] human NT_011512.5|Hs21_11669 chromosome 21 51% to 2C9 chr21q21.2 25740563-25740423 build 33 - strand bracketed by L1 repeats old name = CYP2C63P 12398358 CPSCLILLFLWNGSYAKGKLLPGPIPLPIV*NILPLRSMNTSKSISMVS 12398212 CYP2C-se4[1] human NT_011602.7|HsX_11759 2C pseudogene fragment chr X 57% to 2C8 ChrXq28 147659303-147659476 + strand Build 33 inside MTMR1 intron 3 (myotubularin-related protein 1) old name = CYP2C64P 435396 ASVDLAAVLVLFLSHFLFLSLWKQSSEREKLLPGPTPIRIIGNILELDLKDICKSLSDVN 435575 435576 MLYAPL 435593 Cyp2c-se5[9] mouse GenEMBL NW_000107.1|Mm16_WIFeb01_286 2c exon 9 fragment on chr 16 42687727 PFSTGKLICVGEGLARAELLLLLTTILQNFNLKSPVDLKDLDTIPVANG 42687873 CYP2C-se6[9] rat frag p exon 9 100% to CYP2C82P-de9b 243895387 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 243895497 CYP2C rat no accession number (639bp) Zaphiropoulos,P. submitted to nomenclature committee 82% amino acid identity to exon 2 of 2C24 CYP2C rat no accession number (397bp) Zaphiropoulos,P. submitted to nomenclature committee similar to exon 3 of 2C7 possible pseudogene, with stop codon at location of conserved trp. CYP2C rat PIR B60822 (19 amino acids) Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and Oesch, F. Effect of nutritional imbalances on cytochrome P-450 isozymes in rat liver. Biochem. Pharmacol. 37, 3245-3249 (1988) CYP2C dog PIR A60465 (33 amino acids) Komori, M., Shimada, H., Miura, T. and Kamataki, T. Interspecies homology of liver microsomal cytochrome P-450. A form of dog cytochrome P-450 (P-450-D1) crossreactive with antibodies to rat P-450-male. Biochem. Pharmacol. 38, 235-240 (1989) Note: probable N-terminal of 2C21 which is missing the N-terminal region CYP2C horse PIR PN0659 (16 amino acids) Komori, M., Higami, A., Imai, Y., Imaoka, S. and Funae, Y. Purification and characterization of a form of P450 from horse liver microsomes. J. Biochem. 114, 445-448 (1993) 2D Subfamily CYP2D1 rat PIR A30495 (19 amino acids) Gonzalez, F.J., Matsunaga, T., Nagata, K., Meyer, U.A., Nebert, D.W., Pastewka, J., Kozak, C.A., Gillette, J., Gelboin, H.V. and Hardwick, J.P. Debrisoquine 4-hydroxylase: characterization of a new P450 gene subfamily, regulation, chromosomal mapping, and molecular analysis of the DA rat polymorphism. DNA 6, 149-161 (1987) CYP2D1 rat PIR S39761 (13 amino acids) Ohishi, N., Imaoka, S., Suzuki, T. and Funae, Y. Characterization of two P-450 isozymes placed in the rat CYP2D subfamily. Biochim. Biophys. Acta 1158, 227-236 (1993) CYP2D1 rat GenEMBL J02867 chr7: 120808284-120803991 (- strand) MELLNGTGLWSMAIFTVIFILLVDLMHRRHRWTSRYPPGPVPWPVLGNLLQVDLSNMPYS LYKLQHRYGDVFSLQKGWKPMVIVNRLKAVQEVLVTHGEDTA DRPPVPIFKCLGVKPRSQGVILASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEA GHLCDAFTAQAGQSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMVKLVEESLTE VSGFIPEVLNTFPALLRIPGLADKVFQGQKTFMALLDNLLAENRTTWDPAQPPRNLTD AFLAEVEKAKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQRRV QQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRFTSCDIEVQDFVI PKGTTLIINLSSVLKDETVWEKPHRFHPEHFLDAQGNFVKHEAFMPFSA GRRACLGEPLARMELFLFFTCLLQRFSFSVPVGQPRPSTHGFFAFPVAPLPYQLCAVVREQGL CYP2D2 rat GenEMBL X52027 X52455 chr7: 120834409-120830514 (- strand) MGLLIGDDLWAVVIFTAIFLLLVDLVHRHKFWTAHYPPGPVPLPGLGNLLQVDFENMPYS LYKLRSRYGDVFSLQIAWKPVVVINGLKAVRELLVTYGEDTA DRPLLPIYNHLGYGNKSKGVVLAPYGPEWREQRRFSVSTLRDFGVGKKSLEQWVTEEA GHLCDTFAKEAEHPFNPSILLSKAVSNVIASLVYARRFEYEDPFFNRMLKTLKESFGE DTGFMAEVLNAIPILLQIPGLPGKVFPKLNSFIALVDKMLIEHKKSWDPAQPPRDMTD AFLAEMQKAKGNPESSFNDENLRLVVIDLFMAGMVTTSTTLSWALLLMILHPDVQRRV HEEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADIVPTNIPHMTSRDIKFQGFLI PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA GRRACLGEPLARMELFLFFTCLLQRFSFSVLAGRPRPSTHGVYALPVTPQPYQLCAVAR CYP2D3 rat GenEMBL X52028 Chr7: 120817315-120813086 (- strand) MELLAGTGLWPMAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQVDLCNMPYS MYKLQNRYGDVFSLQMGWKPVVVINGLKAVQELLVTCGEDTA DRPEMPIFQHIGYGHKAKGVVLCTYGPEWREQRRFSVSTLRNFGVGKKSLEQWVTDEA SHLCDALTAEAGRPLDPYTLLNKAVCNVIASLIYARRFDYGDPDFIKVLKILKESMGE QTGLFPEVLNMFPVLLRIPGLADKVFPGQKTFLTMVDNLVTEHKKTWDPDQPPRDLTD AFLAEIEKAKGNPESSFNDANLRLVVNDLFGAGMVTTSITLTWALLLMILHPDVQCRV QQEIDEVIGQVRHPEMADQAHMPFTNAVIHEVQRFADIVPMNLPHKTSRDIEVQGFLI PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA GRRACLGEPLARMELFLFFTCLLQRFSFSVPTGQPRPSDYGVFAFLLSPSPYQLCAFKR CYP2D3-de8b rat UCSC browser Chr 7 (+ strand) 120811066-120811206 2aa diff to 2D2/2D3 exon 8 lies between 2D1 and 2D3, a in fig. below GTTLIPNLSSLLNDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA rat, mouse and human 2D clusters CYP2D4_v1 rat GenEMBL M22331.1 X52029 ONLY 5 AA DIFFS to CYP2D4_v2 120781146-120776576 (- strand) note: 2D18 is an alternate splice of an untranslated exon of the 2D4 gene. The 5 aa diffs are allelic variation both haplotypes are found in the same library see Supporting document MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQIDFQNMPAGFQK () LRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTADRPPLHFNDQSGFGPRSQ () GVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEARCLCAAFADHS () GFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEEESGFLPM () LLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTDAFLAEVEK () AKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILRPDVQC () RVQQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLIPK () GTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA () GRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSNYGVFGALTTPRPYQLCASPR CYP2D4_v2 rat GenEMBL U48219 S77859 ONLY 5 AA DIFFS to CYP2D4_v1 120781146-120776576 (- strand) note: 2D18 is an alternate splice of an untranslated exon of the 2D4 gene. The 5 aa diffs are allelic variation both haplotypes are found in the same library see Supporting document CYP2D5 rat GenEMBL X52030 X52458 chr7: 120799154-120794726 (- strand) MELLNGTGLWPMAIFTVIFILLVDLMHRHQRWTSRYPPGPVPWPVLGNLLQVDPSNMPYSMYK LQHRYGDVFSLQMGWKPMVIVNRLKAVQEVLVTHGEDTADRPPVPIFKCLGVKPRSQ GVVFASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEAGHLCDAFTAQN GRSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMLTLVEESLIEVSGFIPE VLNTFPALLRIPGLADKVFQGQKTFMAFLDNLLAENRTTWDPAQPPRNLTDAFLAEVEK AKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQR RVQQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRITSCDIEVQDFVIPK GTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA GRRACLGEPLARMELFLFFTCLLQHFSFSVPAGQPRPSTLGNFAISVAPLPYQLCAAVREQGH CYP2D6 human GenEMBL M24499 (1195bp) Manns,M.P., Johnson,E.F., Griffin,K.J., Tan,E.M. and Sullivan,K.F. Major antigen of liver kidney microsomal autoantibodies in idiopathic autoimmune hepatitis is cytochrome P450db1 J. Clin. Invest. 83, 1066-1072 (1989) CYP2D6 human GenEMBL A20907 (1768bp) Genetic assay for cytochrome p450 Patent: WO 9110745-A 13 25-JUL-1991; CYP2D6 human GenEMBL M33189 (5503bp) Gonzalez,F.J. unpublished (1990) Note on the 2D6 locus. The normal situation is CYP2D8P, CYP2D7P, CYP2D6 Alleles with an extra pseudogene have been found CYP2D8P, CYP2D7AP, CYP2D7BP, CYP2D6 Heim,M.H. and Meyer,U.A. Evolution of a highly polymorphic human gene locus for a drug metabolizing enzyme. Genomics 14,49-58 (1992) The 2D7AP sequence is 94.7% identical to CYP2D7P The 2D7BP sequence is created by gene conversion between 2D7AP and CYP2D6 and it is named CYP2D8BP below. CYP2D7P human GenEMBL M33387 The typical human 2D7 pseudogene In the 1996 nomenclature this was named CYP2D7P1 CYP2D7P1 human Same as CYP2D7P CYP2D7P2 human Same as CYP2D7AP CYP2D7AP human GenEMBL X58467 (13,278bp) Heim,M.H. and Meyer,U.A. Evolution of a highly polymorphic human gene locus for a drug metabolizing enzyme. Genomics 14,49-58 (1992) Note: CYP2D7AP is 94.7% identical to CYP2D7P, both are pseudogenes. In the 1996 nomenclature this was named CYP2D7P2 CYP2D7BP human This is the authors name for CYP2D8BP below In the 1996 nomenclature this was named CYP2D8P2 CYP2B8P human GenEMBL M33387 The typical human 2D8 pseudogene In the 1996 nomenclature this was named CYP2D8P1 CYP2D8P1 human Same as CYP2D8P CYP2D8P2 human Same as CYP2D7BP and CYP2D8BP CYP2D8BP human GenEMBL X58468 (13,677bp) Heim,M.H. and Meyer,U.A. Evolution of a highly polymorphic human gene locus for a drug metabolizing enzyme. Genomics 14,49-58 (1992) This gene is called CYP2D7BP by the authors Note: CYP2D8P is a chimeric gene composed of part of CYP2D7AP and part of CYP2D6. There are only 14 base changes in 13,677 base pairs relative to these parents. This gene is different from CYP2D8P. It is a pseudogene. In the 1996 nomenclature this was named CYP2D8P2 Cyp2d9 mouse GenEMBL J04471 M24262 (846bp) M24267 (3367bp) Wong,G., Itakura,T., Kawajiri,K., Skow,L. and Negishi,M. Gene family of male-specific testosterone 16-alpha-hydroxylase (C-P-450-16-alpha) in mice: Organization, differential regulation, and chromosome location J. Biol. Chem. 264, 2920-2927 (1989) Cyp2d9-de1b2b mouse GenEMBL NT_039621.1 + strand x in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004) exons 1 and 2 8-10kb upstream of 2d9 43879793 MELLTGTDLWSVAIFTVIFILPVDLLHRRQRWTSRCPPGPVPWPVLGNLLQVDLDNMPYSLYK 79981 43880823 XXNRYGDMFSLHMAWKPMVVINGLKAMKEVLLTCGEDTADSPPVPIYEHRGXXXXXX 80969 Cyp2d9-de1c5c6c7c mouse GenEMBL NT_039621.1 + strand y in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004) exons 1,5,6,7 between 2b9 and 2b10 (uup) 43869836 MELLTGTELWPVAIITVIFILLVDLMHYHQLWTSHY 69943 43869943 PPGPVLWPVLGNLLQMDLHNMPHSMYK 70023 43872058 VLNTFPILLCIPGWADKVFPG*STFLTMVDKLVTEPKRT*DPDQPPCDLIDAFLAEMXX 72228 43872341 AKGNPSSNFNDANLRLVVFNLFGAGIVTSSITLTWVLLLMVLHPDVQ 72481 43872703 RLHQETDEVIGHVWWPERQSQX 72765 43872768 LMPYTNAVIHEVQHYTGIIPIPLPHRTSSDIEMQDFLITK 72887 Cyp2d9-de1d6d7d mouse GenEMBL NT_039621.1 - strand z in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004) exons 1,6,7 10kb upstream of Cyp2d9-de1c5c6c7c 43859756 MELLTGTSLWPVAILTVIFILLQDLMHQQKCCTSCYLPGTVLWTLQRNLLQVDLHSMPHSLCK 59568 43858655 AKGNLESSFNDANLSLVVLDQFGTGIVASSVTLTWGLLLTILNPDVQ 58515 43858292 RMQQEIDKVIEHVW*TEMVHQAYMPYTNAAIHEVQRYKDIIPIPLPHRTSSDVEMQDFLITK 58107 Cyp2d10 mouse GenEMBL J04471 M24263 M24265 M24268 (4828bp) Wong,G., Itakura,T., Kawajiri,K., Skow,L. and Negishi,M. Gene family of male-specific testosterone 16-alpha-hydroxylase (C-P-450-16-alpha) in mice: Organization, differential regulation, and chromosome location J. Biol. Chem. 264, 2920-2927 (1989) Cyp2d11 mouse GenEMBL J04471 M24264 M24266 (5661bp) Wong,G., Itakura,T., Kawajiri,K., Skow,L. and Negishi,M. Gene family of male-specific testosterone 16-alpha-hydroxylase (C-P-450-16-alpha) in mice: Organization, differential regulation, and chromosome location J. Biol. Chem. 264, 2920-2927 (1989) Cyp2d12 mouse no accession number Negishi,M. submitted to nomenclature committee in 1990, but never published. ESTs AI116003 ue25f10.x1 (295-end 2 diffs, 1fs) AI785325 uj40c11.x1 (326-end 1 diff) AI527869 uj30b05.y1 (1-241 4 diffs, 2fs) AA986388 uc82e10.x1 (307-end 4 diffs) Public Cyp2d12 from EST sequences. Places where ESTs do not match Negishi's sequence are shown in (). The EST seq is given. In these sites Y, G, N, A and R are observed in multiple ESTs and they are probably the correct amino acids F at the last variable site is seen twice and S is seen twice so this may be a polymorphic site MELLTGTDLWSVAIFTVIFILLVDLM (Y) RRQSWTSCYPPGPVPWPVL (G) NLLQVDL (N) NMPYSL YKLQNRYGDVFSLQMAWKPMVVINRMKAMKEVLLTCGEDTADRPPVPIFEHLGFKPRSQGMIFAPYGPEWREQ RRFSLSSLRNFGLGRKSLEEWVIKEAGHLCDAFTTQAGQYINPNTMLKK (A) TCNVIASLIFARRFEYED PYLIRMLKVLEDSLTELSGLIPEVINTFPILLHIPRLAD (53 amino acid gap) ENLRMVVIDLFTAGILTTSTTLSWALLLMILHPDVQRRVQQEIDEVIGQVRHPEMADQAHMPYTNAVIHEVQRFGDIVPLHLPRITSRDIEVQDFLIPKGTILLPNMSSVHMDDTVWEKPLRFHPEHFLDAQGHFVKHEAFITFSAG (R) RSCLGEPLARMELFLFFTCLLQRFSFSVPDGQPQPSDHRVF (F) IMVAPSPYQLCAVIREQGH* Cyp2d12-de1b5b6b7b mouse GenEMBL NT_039621.1 - strand detritus exons 1,5,6,7 fragments 7kb upstream of 2d12 v in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004) 44005713 M*LLTGTGLWPVAIFTIIFILLQDLMHHLKLWTSCYPPGTVPWPL 44005579 44003512 NTLPDSPAHPRVA*QVSPGTMTFLTMMDKLVTEQKRTWDPDHPLCNLTDAFLAEMEK 44003342 44003204 AKGSPQSSFKGANLCLVVLDQFDAGIVTTSITLT*GLLLTILNPRVQ 44003064 44002849 RVQQEINKVIGHV**PEMVDQDHMSYSNAVMYEVQHYADIITIPLAHKTFSDVEVQGSLITK 44002664 Cyp2d12-de5c6c7c mouse GenEMBL NT_039621.1 - strand detritus exons 5,6,7 w in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004) 43998271 PRVA*QVSPGTMTFLTMMDKLVTEHKRTWDPGHPLCNLTDAFLAEMEK 33998128 43997989 AKGSPQSSFKGANLCLVVLDQFDAGIVTASITLTWGLLLTILHPGVQS 33997846 43997629 RVQQEINKVIGHVW*PEMVDQDRMSYSNAVMYEVQRYADIITIPLAHKTFSDVEVQGSLITK 33997444 Cyp2d13 mouse no accession number Negishi,M. submitted to nomenclature committee in 1990, but never published. no exact matches in the Genbank EST database as of 10/20/97 sequence may be erroneous, or a rare transcript. Cyp2d13 mouse No accession number Brian Libby partial Cyp2d13 gene sequence The top half of the sequence below is from Brian Libby This sequence matches Negishi's except at one amino acid shown in parentheses. The bottom half is from EST BF533324 Dr. Negishi's sequence called "ce" is complete, but still unpublished. (see note to Cyp2d26) Public Cyp2d13 seq from BF533324 EST and Brian Libby. One extra amino acid seen in EST BF533324 is shown as [D]. Two amino acids that do not agree are shown in (). The EST sequence is given at the T and G sites. MELLTGTGLWPVAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQVDLDNMPYSLYKL QNRYGDVFSLQMAWKPVVVISGLKAVREVLVTCGEDTADRPEMPIFQHLGYGEKAKGVVF APYGPEWRELRRFSVSTLRNLGLGKKSLEQWVTEEAGHLCDAFTAQAGSPLDPYTLLNKAVCNV IASLIYARRFEYGDPDFIKMLKILKENMGENTGLFPE (15 amino acid gap) DKVFPGQKTFLTLVNKLVTEHKRTWDP [D] QPPRDLTDAFLAEMEKAKGNPKSSFNEANLRL VVFDLFGAGIVTSSITLTWALLLMILHPDVQRRVQEEIDEVIGQVRCPEMADQAHMPYTNAVIH EVQRFADIVPMNLPHKTSHDIEVQGFLIPKGTTLIPNLSS (T) LKDETVWEKPLRFHPEHFL DAQGHFVKPEAFMPFSAGRRACLGEPL (G) RMELFLFFTCLLQRFSFLVPAGQPQPSDYGIF TFLVSPSPYQLCAFTRDQATN* Cyp2d13 mouse GenEMBL AC087902.4, EST BF533324, NT_039621.1 NT_039621.1 - strand 44100884 MELLTGTGLWPVAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQVDLDNMPYSLYK 44100696 44099867 LQNRYGDVFSLQMAWKPVVVISGLKAVREVLVTCGEDTADRPEMPIFQHLGYGEKAK 44099697 44099412 GVVFAPYGPEWRELRRFSVSTLRNLGLGKKSLEQWVTEEAGHLCDAFTAQAG 44099257 44099169 SPLDPYTLLNKAVCNVIASLIYARRFEYGDPDFIKMLKILKENMGENTGLFPE 44099017 44098352 VLNTFPILLHIPGLADKVFPGQKTFLTLVNKLVTEHKRTWDPDQPPRDLTDAFLAEMEK 44098176 44098036 AKGNPKSSFNEANLRLVVFDLFGAGIVTSSITLTWALLLMILHPDVQ 44097896 44097675 RRVQEEIDEVIGQVRCPEMADQAHMPYTNAVIHEVQRFADIVPMNLPHKTSHDI 44097514 44097515 LEVQGFLIPK 44097486 44097091 GTTLIPNLSSALKDETVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSAG 44096948 44095907 RRACLGEPLARMELFLFFTCLLQRFSFLVPAGQPQPSDYGIFTFLVSPSPYQLCAFTR* 44095731 CYP2D14 bovine GenEMBL S45538 X68013 (1538bp) Swiss Q01361 (487 amino acids) PIR S29295 S37284 (500 amino acids) PIR S29862 (500 amino acids) Tsuneoka,Y., Matsuo,Y., Higuchi,R. and Ichikawa,Y. Characterization of the cytochrome P-450IID subfamily in bovine liver. Nuceotide sequences and microheterogeneity. Eur. J. Biochem. 208, 739-746 (1992). CYP2D14 Bos taurus (cow) See cattle page for details MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPTPLPVLGNLLQVDFEDPRPSFNQ LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPPAVYEHLGYGPRAEG VILARYGDAWREQRRFSLTTLRNFGLGKKSLEQWVTEEASCLCAAFADQA GRPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIIKLLDLTEDGLKEEFNLVRKV VEAVPVLLSIPGLAARVFPAQKAFMALIDELIAEQKMTRDPTQPPRHLTDAFLDEVKE AKGNPESSFNDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSEHGVFAFLVTPAPYQLCAVPR* CYP2D15 dog GenEMBL D17397 (1665bp) Sakamoto,K., Kirita,S., (Aoyama,J., Baba,T. and Matsubara,T.) cDNA cloning and characterization of dog P-450 2D. Arch. Biochem. Biophys. 319, 372-382 (1995) check authors on paper MGLLTGDTLGPLAVAVAIFLLLVDLMHRRRRWATRYPPGPTPVP MVGNLLQMDFQEPICYFSQLQGRFGNVFSLELAWTPVVVLNGLEAVREALVHRSEDTA DRPPMPIYDHLGLGPESQGLFLARYGRAWREQRRFSLSTLRNFGLGRKSLEQWVTEEA SCLCAAFAEQAGRPFGPGALLNKAVSNVISSLTYGRRFEYDDPRLLQLLELTQQALKQ DSGFLREALNSIPVLLHIPGLASKVFSAQKAIITLTNEMIQEHRKTRDPTQPPRHLID AFVDEIEKAKGNPKTSFNEENLCMVTSDLFIAGMVSTSITLTWALLLMILHPDVQRRV QQEIDEVIGREQLPEMGDQTRMPFTVAVIHEVQRFGDIVPLGVPHMTSRDTEVQGFLI PKGTTLITNLSSVLKDEKVWKKPFRFYPEHFLDAQGHFVKHEAFMPFSAGRRVCLGEP LARMELFLFFTCLLQRFSFSVPAGQPRPSDHGVFTFLKVPAPFQLCVEPR CYP2D15 dog AB004268 Tasaki,T., Ito,S., Kamataki,T. and Fujita,S. unpublished CYP2D15 Canis familiaris (dog) NW_876251.1:6772718-6776665 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 the dog genome has a seq gap between exons 3 and 4 with poor quality seq there. The C-terminal is also missing, trust the mRNA seq for this CYP. CYP2D16 guinea pig GenEMBL U21486 (1666bp)(500 amino acids) Jiang,Q. Voigt,J.M. and Colby,H. Molecular Cloning and sequencing of a guinea pig cytochrome P4502D (CYP2D16): high level expression in adrenal microsomes. Biochem. Biophys. Res. Commun. 209, 1149-1156 (1995) CYP2D17 Macaca fasicularis ( cynomolgus monkey) GenEMBL U38218(1494bp) Laddison,K.J., Speirs,A., Mankowski,D.C., Tweedie,D. and Lawton,M. Cloning, Sequencing and expression of the cynomolgus monkey liver cytochrome P450 that is orthologous to human CYP2D6. ISSX abstracts number 367 (1995) 94% identity to human 2D6 CYP2D17 Macaca fasicularis (cynomolgus monkey) GenEMBL ESTs BB889442, BB891868, BB878205, BB889386, BB890418, BB890246, BB882021, BB881437 L388 polymorphic with F Three aa differ from U38218 (I297 = M in U38218, N337 = D in U38218, R426 = H in U38218) MELDALVPLAVTVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLG NLLHVDFKNTPYCFDQLRRRFGNVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRP PVPINQVLGFGPRSQGVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACL CAAFTDQAGRPFRPNSLLDKAVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESG FLREVLNAIPLLLRIPGLAGKVLRSQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFL AEMEKAKGNPESSFNEENLRI VVADLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQE IDN VIGQVRRPEMGDQARMPYTTAVIHEVQRFGDIVPLGVTHMTSRDIELQGFL IPKG TTLFTNLSSVLKDEAVWEKPFRFHPEHFLDAQGR FVKPEAFLPFSAGRRACLGEPLAR MELFLFFTCLLQRFSFSVPAGQPRPSHHGVFAFLVTPSPYELCAVPR CYP2D17 Macaca mulatta (Rhesus monkey) GenEMBL DR774034.1 N-term EST MELDALVPLAVTVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ LRHRFGDVFSLQLAWTPVVVLNGLAAAREALVTCGEDTADRPPVPINQVLGFGPRSQGVFLAR CYP2D17 Macaca nemestrina (pig-tailed macaque) GenEMBL CO774286.1 only 3 aa diffs with 2D17 M. fasicularis MELDALVPLAVTVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ LRRRFGNVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRPPVPINQVLGFGPRSQGVF LARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFTDQAGRPFRPNSLLDK AVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESGFLREVLNAIPLLLRIPGLAGKV LRSQKVFLTQLDELLTEHRMTWDPXXPPRDLTEAFLGKMEKAKGNPE CYP2D18X rat GenEMBL U48219, S77859 Kawashima,H. and Strobel,H.W. cDNA cloning of a novel rat brain cytochrome P450 belonging to the CYP2D subfamily. Biochem Biophys Res. Commun. 209, 535-540 (1995) Kawashima,H., Sequeira, D.J., Nelson, D.R. and Strobel,H.W. Protein expression and catalytic activity toward imipramine N- demethylation of a novel rat brain cytochrome P450 CYP2D18. Biochem Biophys Res. Commun. submitted note: this gene was cloned and sequenced from two independent libraries. This appears [not] to be a distinct gene from CYP2D4. note: 2D18 is an alternate splice of an untranslated exon of the 2D4 gene. The 5 aa diffs are allelic variation both haplotypes are found in the same library This gene can be distinguished from CYP2D4 as alternative splice variant CYP2D4_v2 CYP2D18X rat GenEMBL U48219 S77859 ONLY 5 AA DIFFS to 2D4 Chr7: 120781146-120776576 (- strand) note: 2D18 is an alternate splice of an untranslated exon of the 2D4 gene. The 5 aa diffs are allelic variation both haplotypes are found in the same library This gene can be distinguished from CYP2D4 as alternative splice variant CYP2D4_v2 CYP2D19 Callithrix jacchus (white-tufted-ear marmoset) GenEMBL D29822 Igarashi,T., Sakuma,T., Isogai,M., Nagata,R. and Kamataki,T. Marmoset liver cytochrome P450s: study for expression and molecular cloning of their cDNAs Arch. Biochem. Biophys. 339 (1), 85-91 (1997) 91% to 2D17, 90% to 2D42 CYP2D20 hamster T. Sakuma 95% identical to CYP2D27 CYP2D20 Syrian hamster no accession number Kouichi Kurose submitted to nomenclature committee 7/13/99 clone name SH2D3 1 amino acid diff with Sakumas sequence CYP2D21 Sus scrofa (miniature pig) GenEMBL D89502 Sakuma,T., Shimojima,T., Miwa,K. and Kamataki,T. Cloning CYP2D21 and CYP3A22 cDNAs from liver of miniature pigs Drug Metab. Disp. 32, 376-378 (2004) 8 amino acid differences to CYP2D25 Cyp2d22 mouse no accession number J. Leonard and N. Blume submitted to nomenclature committee 88% identical to rat 2D4 Cyp2d22 mouse GenEMBL AF221525 NM_019823 frameshift x2 in exon 6, NT_039621.1 NT_039621.1 - strand 43812601 MRLPTGAELWPIAIFTVIFLILVNLMHWRQRWTAHYPPGPMPWPVLGNLLHMDFQNMPAGFQK 12413 43811089 LRGRYGDLFSLQLASESVVVLNGLTALREALVKHSEDTADRPPLHFNDLLGFGPRSQ 10919 43810677 GIVLARYGPAWRQQRRFSVSTMHHFGLGKKSLEQWVTEEARCLCAAFADHTG 10522 43810448 PFSPNTLLDKAVCNVIASLLYACRFEYDDPRFIRLLGLLKETLKE 10314 43809907 FLNVFPMLLRIPGLVGKVFPGKRAFVTMLDELLAEHKTTWDPTQPPRDLTDAFLAEVEK 9731 43809546 AKGNPESSFNDE 9511 43809509 NLRTVVGDLFSAGM 9468 43809466 VTTSTTLSWALMLMILHPDVQ 9404 43809193 RVQQEIDEVIGQVQCPEMADQARMPYTNAVIHEVQRFADILPLGVPHKTSRDIELQGFLIPK 9008 43808581 GTTLITNLSSALKDETVWEKPLCFHPEHFLDAQGHFVKPEAFMPFSA 8441 43808344 GRRSCLGEPLARMELFLFFTCLLQRFSISVPDGQPQPSDHGVFRALTTPCPYQLCALPR 8168 CYP2D23 rabbit no accession number Yukio Yamamoto submitted to nomenclature committee Clone name rabbit 2D/Clone I CYP2D24 rabbit no accession number Yukio Yamamoto submitted to nomenclature committee Clone name rabbit 2D/Clone II CYP2D25 Sus scrofa (pig) GenEMBL Y16417, NM_214394 Postlind, H., Axen, E., Bergman, T. and Wikvall, K. (1997) Cloning, structure and expression of a cDNA encoding vitamin D3 25-hydroxylase. Biochem. Biophys. Res. Commun. 241, 491-497. note: this is a microsomal emzyme different from the mitochondrial CYP27 which also has vitamin D3 25-hydroxylase activity. Cyp2d26 mouse GenEMBL NT_039621.1 - strand 68 ESTs see UNIGENE Mm.29064 MGLLVGDDLWAVVIFTAIFLLLVDLVHRRQRWTACYPPGPVPFPGLGNLLQVDFENIPYS FYKLQNRYGNVFSLQMAWKPVVVVNGLKAVRELLVTYGEDTSDRPLMPIYNHIGYGHKSK GVILAPYGPEWREQRRFSVSTLRDFGLGKKSLEQWVTEEAGHLCDAFTKEAEHPFNPSPL LSKAVSNVIASLIYARRFEYEDPFFNRMLKTLKESLGEDTGFVGEVLNAIPMLLHIPGLP DKAFPKLNSFIALVNKMLIEHDLTWDPAQPPRDLTDAFLAEVEKAKGNPESSFNDKNLRI VVIDLFMAGMVTTSTTLSWALLLMILHPDVQRRVHQEIDEVIGHVRHPEMADQARMPYTN AVIHEVQRFADIVPTNLPHMTSRDIKFQDFFIPKGTTLIPNLSSVLKDETVWEKPLRFYP EHFLDAQGHFVKHEAFMPFSAGRRSCLGEPLARMELFLFFTCLLQRFSFSVPDGQPRPSD YGIYTMPVTPEPYQLCAVAR Note: Brian Libby (bjl@jax.org) at The Jackson Laboratory has given his permission to post sequence data he has on the 2d26 gene and a partial Cyp2d13 gene from mouse. He will make the BAC clone available to anyone who wants it. The BAC has at least two and maybe more P450 sequences. I am putting a link to a pdf version of the 2D26 gene sequence file here. It is color coded with additional information, such as sequencing primers and restriction sites. CYP2D26 gene sequence Cyp2d26-de1b7b8b mouse GenEMBL NT_039621.1 - strand 10kb upstream of 2d26, exon 1 aa 1-19, 36-57, exon 7,8 on the edge of the mouse 2d cluster s in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004) NT_039621.1 - strand 44262890 MGLQTGLWPMVISTALFCM 44262834 44262801 YPPSPVPLPELGSLLQVKFENM 44262736 44260947 GHVQKETDGIMGQVWLPQMSHQACMSFT 44260864 44260862 NAMIREV*HFRDTILVNLSHVTFCEIEI*GFXXXX 44260770 44260251 XXXXLITNLSLVLKNEITWEMPSPTPS*TFLESEGHLMKQETFMPXXX 44260129 CYP2D27 syrian hamster no accession number Kouichi Kurose 95% identical to CYP2D20 submitted to nomenclature committee 6/29/99 CYP2D28 syrian hamster no accession number Kouichi Kurose 71% identical to CYP2D27 73% to CYP2D20 clone name SH2D2 submitted to nomenclature committee 7/13/99 CYP2D29 Macaca fuscata (Japanese monkey) GenEMBL AF301911 (release date March 1, 2001) Shizuo Narimatsu, Hiroyuki Hichiya, Shigeo Yamamoto, Kazuo Asaoka Submitted to nomenclature committee Oct. 16, 2000 95% to CYP2D6 CYP2D30 Callithrix jacchus (white-tufted-ear marmoset) GenEMBL AY082602 Hichiya,H., Yamamoto,S., Asaoka,K. and Narimatsu,S. Complementary DNA cloning and characterization of a cytochrome P450 2D enzyme from Marmoset monkey liver Unpublished submitted to nomenclature committee 3/5/02 33 diffrerences to 2D19 also from marmoset. 93% to 2D19, 91% to 2D29, 90% to 2D17 CYP2D31P human NT_022676.10|Hs3_22832 chromosome 3 2D6 pseudogene fragment I-helix 899650 NQENLV*VVIDLFLGGTDTTATTLCWALIHMIQHGAVQ 899537 Cyp2d32-ps mouse GenEMBL XM_194978, NT_039621.1 exons 4,5,6,7,8,9 NT_039621.1 + strand (vvp = old temp. name) 43898939 AMSPHNPNHLLDKAICNVIASLIYACRFKYGDPDIIK 33899049 ILKVLKESM*KKIVFIPD 43899746 VLNIFPIVLSISGLGDKVLPGKKVSLAIVDKMLTDXXX 33899850 43899865 TWDPD*SHCDLTDAFLAEMEQ 33899927 43900101 LHLLILHLLGAGIVMSSVTLTWTLLLMI*NPDVQ 33900202 43900439 XXXXEIDKVIGQVWHPEMADQVLMPFTNAVIHEVKCSEDITAMALPHRNSLHSNVQGFLIPK 33900612 43901007 GKSLITNLSSELKDEAIWEKPLCFHPEYFLDAKGHFV*HEPFMAFSE 33901147 43901248 GHQACLREPLACMELFLFFTFLLQRFSFSMSDGQPLPSEYSIYAMPVTPEPCQFCAVVQYQG 33901433 Cyp2d33-ps mouse GenEMBL NT_039621.1 exons 4,5,6,7,8,9 NT_039621.1 + strand 3kb downstream of 2d12 44019279 XXXNPYHLLDKAVCNVIPSLIYACCFNYGDPDNRMLKLLKKKSMKKKIGFISD 44019428 44020071 VLNTFPTLLGISGLAEKVFSGQKTSFTIVNKMFTEH 44020178 44020190 DPDQPPRDLTDAFLAEMEK 44020246 44020381 AKGNSERSFREPNLYLIILDLLGPGIVTSLVTLTWSLLLVIQQPDVQ 44020521 44020745 XXXXEIDKVIG*VWHPEMAD*ILMPFTNVVIHEVKRFEDITAMVLPQRTSPDIDVHGF 44020906 44022181 XXXLIPDLSSMLKDETVWEKPLHFHPKNFLDAQGHFL*FEAFMPFSEG 44022315 44022418 QACLGQPLDQIVLFLFITCLLQCFSFSLPKGQPPPSD*GIYAMPVTPAPSQLCAVVVR*EEQWH 44022609 Cyp2d34 mouse GenEMBL NT_039621.1 85% to 2d10 87% to 2dww/2d11 NT_039621.1 - strand old temp. name = tt 44079756 MELLTGTGLWSVAIFTVIFLILVDLMHRRQHWTSRYPPGPVPWPVLGNLLQVDLDNIPYSLYK 44079568 44077878 LQNRYGDVFSLQMAWKPVVVINGLKAMQEVLLTCGKDTADHPPVPIFEYLGFKSKSQ 44077708 44077439 GVVLASYGPEWREQRQFSVSTLRNFGLGKKSLEEWVTKEAKHLCDAFTARAG 44077284 44077192 QSINPNTMLNNAVCNVIASLIFARRFEYEDPFLIRMLKMREESLKEVTGFIPG 44077037 44076407 VLNTFPILLRIPGLADMVFQSQKTFMAILDNLVTENRTTWDPDQPPRNLADAFLAEIQK 44076231 44076048 AKGNPESSFNDENLCMVVSDLFTAGMVTTSTTLSCALLLMILHPDVQ 44075908 44075711 RRVQQEIDAVIGQVRCPEMADQARMPYTNAVIHEVQRFGDIIPLNIPRITSRDIEVQDFLIPK 44075523 44075229 GTILIPNMSSMLKDETVWEKPLRFYPEHFLDAQGHFVKPEAFMPFSAG 44075086 44074985 RRSCLGEPLARMELFLFFTCLLQRFSFSVPAGQPQPSDHRIFAIPVAPYPYQVCAIMREQGH* 44074797 Cyp2d34-de1b2b7b8b mouse GenEMBl NT_039621.1 detritus exons 1,2,7,8 about 4 kb downstream of 2d34 u in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004) NT_039621.1 - strand 44070344 MELLTGTGL 44070318 44070324 WPVAIFTVIFILLVDLMHRHQHWTSRCPPGPVPWPVLGDLLQVNVYNIPYSLYK 44070163 44069514 LKKSCGDMFSLHMGWKPMVMIKGLKSVQDVLVTCGEDTADCPKIPVFHYI 44069365 44067376 QVQKEIDKVIGQVWHPEMADLGLMPFKKSVIHEVHHFADITAIP 44067245 44066770 QGKSFIPNLCSMLKDETVWEKPLHFHPKHFLDAQGHFVKHEVFMPFSAG 44066624 Cyp2d35-ps mouse GenEMBL NT_039621.1 This seq was assembled from several smaller pieces found earlier NT_039621.1 - strand 44113633 VIWLLTGTGL 44113604 44113610 WPVAIFTVIFILLVDLIHLCQHWTSCYPPGPVPCPVLGNLLQVDLYNMPYSLYK 44113449 44112585 MFSLQMVWKPMVLIKELKSVQDVLVTCGGGTVDRPEIPIFHHIGCGPKAK 44112436 44112148 XXLLASYGPEW*EQRPFSVSILCNFSQGKKFLEQSVTDEAGHICDTFTAQAG 44111999 44111917 SPLKPYTLLDKTLCNVIVSLIYAHRFKYGGPDIIKMLKVLKDNMGGKIGLIPE 44111759 44111115 VLNTFPVLLHIPGLADKVFPGKKTFLTIMDKLVTEHKKIWDLYQPSCDLTGAFLAEMEK 44110939 44110801 AKGNPESSFRESNLCLVVLDLLGDGIVTSSVTLTWGLLLTILHLDVQ 44110661 44110375 MPYTNAVIHEVPCYDDIIPIFLPHRTSSDVEMQDFLITK 44110259 44109226 SVLNDETVWEKSLCFLPDHFLDAQGNFVKPEAFMPFSAG 44109110 44109006 XQACLREPLAHMELFLFFTCLLQHFSFSVPAGQPLLSDYGIYTMPVSPEPYQLCAVVC* 44108833 Cyp2d36-ps mouse GenEMBL NT_039621.1 NT_039621.1 - strand 44142171 MELLTETDLWPVAIFTVIFILLVELMHQCQR*TSFYTPGPVPWPLLGNLLQVDLDNMPYSLYK 44141983 44141174 NHYGDMSSLHMG*KSMVVISGLKAVQDVLVTC 44141079 44139955 GEDTTDCPEIPIFQHIGCGPKAK 44139887 44139615 GVVPAPYGLEWQEQR*FSVSTLCNFGL 44139535 44139535 GKKSLKQWVMEEAGH 44139491 44139399 SPLNPFPLLDKAGLNVSASLIYAHCFE*EDPVIIKMLTVLRK 44139274 44139026 VLNTFSIPLHIRGLADKAFPVQKTFLTIVDKMLTEHKRT*DPDKPP*DLIDAYLAKMKK 44138850 44138722 XXGNPESSFNETNLXX 44138687 44138681 VVLDQLGARIMTISITLT*VLLLMILHPHVQ 44138589 44138362 VGQYINKVISQVWHSGMADQGLMPFINVVIHEVQHFADIIAIPLPHRTSPDIKVLGSLIPK 44138180 44130610 GMNLIPNLSSVFKDNTVWEKPFCFHPEQFLDAQGHFVKHKAFMPFSAG 44130467 44130363 XQACLGDPLACMELFLFFTCILQRFSFSVPAGQPLHSDYGIYAMPVTPEPCQFCLV 44130199 Cyp2d37-ps mouse GenEMBL NT_039621.1 Old temp name = hhp, 3 frameshifts and a stop codon 81% to 2d13 NT_039621.1 - strand 44151915 MELLTGTGLWPVVIVTVIFILLVDMLHRCQRWTSCCPPDPVPWPVLGNLLQVDLDNMPYNLYK 44151727 44150957 LHNRYGDVFSLQMGWNHMAVINGLKVIQEVLVTCGEDTADRPEMPIFPHLGYGQKAK 44150787 44150509 GVVLAPYGPEWKEQR*FSASTLCNFSLGKKSLEQWVMEEVGHLFDVFTAHA 44150357 44150275 GSPLNPYPLLDKAVCNVIVSLIYAHRFEYGDPDFIKMLKVLKENMGENIGLFSE 44150114 44149452 VLNTFPILLRIPGLADKVFPGQKTFLIMVDKLVTEHKRTWNSDQPPRDLTDAFMAEMEK 44149276 44149137 AKGNPESSFNDANLCLVVLDLLGAATVTTSTTLSWALLLMILHPDVQ 44148997 44148774 QVQQEIDEVIWYVWLPEMADQVCMPFTNAVIHEVQ 44148670 44148653 XXXDIIPITLPHRTSRDIEVWGFLIPK 44148582 44148149 GMTLISNLF 44148123 44148124 SVLKDETVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSA 44148011 44147914 GHRSCLGEPLALMELFLFFTCLLQRFSFSMPAGQSLPSDYGIYTMPVTPAPYQLCAVV 44147741 Cyp2d38-ps mouse GenEMBL XP_194978, LOC271298 chr 15 XM_194978, NT_039621.1 - strand 44166184 PVAIFTVILILLVNLMHRLQCWTSRYPPGPVPWLVLGNLLQADLHNMTYNLYK 44166026 44165213 LQNWCGDVFSLQMISKPVVVIKGLNAVGE 44165127 44165125 LLVSCGEGTAEWPEIPIFHHIVCGPKTK 44165042 44164762 GVILAP*GCEWREQR 44164718 44164722 RGSVSILCNFSLGKKSLEQCVMEKAGHICDAFTVQAG 44164612 44164557 SSLNPLSLLDKSLCNVVAYLIYA 44164489 Cyp2d39-ps mouse GenEMBL NT_039621.1 Old temp name jj Cyp2d26 like pseudogene exons 4,5,6,7,8(partial),9 NT_039621.1 - strand 44178330 FDYGDPDIIKMLKALKENKGEKIGMIPH 44178247 44177610 VLNTFPILLHILELADKVFPGQKT 44177539 44177539 ILTMVDKLVIAHKRTGDCEKPHQELTD 44177459 44177454 AFLAEREX 44177434 44177299 AKGNPESSFNDANLCLVVLDLFGGGILTSSITLTWAL*LVILHP 44177168 44176934 RVQQDEVIVHVW*PKMANQANMSYSNAAIHEIQCYADIIPIHLPDRTSLDI*VQGFLLPK 44176755 44176344 GTKIIPNLSSVI 44176309 44175091 GHQVCLGEPLASMELFLFFTCLLQCFSFLVPTG*PQPSNYGIYAMPVTPEPYQLCAVV 44174918 44175055 MELFLFFTCLLQCFSFLV 44175002 note 9kb from rest of N-term at 2d32p Cyp2d40 mouse GenEMBL NT_039621.1 Old temp name = rr 84% to 2d13 NT_039621.1 - strand 44223024 MELLTGTDLWPVAIFTVIFILLVDLLHRRQRWTSRYPPGPVPWPVLGNLLQVDLDNMPYSFYK 44222836 44222037 LQNHYGDMFSLQMGWNAMVIVNGLKAVQEALVTCGEYTADRPEMPIFPHLGYGQKDK 44221867 44221588 GLVLAPYGPEWQEQRRFSMSTMRNFGLGKKSLEQWVTEEAGHLCDAFTDQA 44221436 44221354 GSPLNPYTLLNKAVCNVIASLIYAHRFKYKDPDFIKMLKVLKENTREKIGLIPE 44221193 44220527 VVKMFPIVLRIPGLADKIFPGQKTFLTMVDKLVTEHKRTWDPDQPPRDLTDAFMAEMET 44220351 44220212 AKGNPESSFNEANLRLVVLDLFGGGIVTTSATLTWALLLMILHPDVQ 44220072 44219854 RRVQEEIDEVIGQARRPEMADQARMPYTNAVIHEVQRFADIAPMTLPHRTSCDIEVQGFLIPK 44219666 44219246 GTTLICNLSSVLKDETVWEKPLRFYPEHFLDAQGHFVKPEAFMPFSA 44219100 44218999 GRRACLGEPLVRMELFLFFTCLLQRFSFSVPDGQPLPSDYGIYSMVVSPAPYQLCAVVR* 44218820 Cyp2d40-de7b9b mouse GenEMBL NT_039621.1 detritus exons 7,9 fragment NT_039621.1 - strand t in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004) 44201031 VQQEINKFIGQVWRPETAVIHEVQCFANITPITLPHRTSCDIEVQGFLTPK 44200879 44200789 PSDYGIYSMPVTLEPYQLCVVVQ 44200721 Cyp2d41-ps mouse GenEMBL NT_039621.1 old temp name = ssp, 82% to 2d13 one stop codon possible pseudogene NT_039621.1 - strand 44241024 MELLTGTDLWPVAIFTVIFILLVDLMHRHQRWTSRYPPGPVLWPVLGNLLQVDLDNMPYSLYK 44240836 44240062 LQNRYGDVFSLKLGRNPMVIVNRLMAVQEVLVTCGENTADRPEMPIFLPPSNGQKAK 44239892 44239602 GLAFAPYGPEWQEQKRFSMSTLRNFGLGKKLLEQ*MTKEAGHLCDAFTAQA 44239450 44239368 GSPLNPYTLLEKAMCNVIASLVYAHCFEYEDPDCIKMLRALKEYMIEKIGLIPEV 44239204 44238543 VKMFPIVLRIPGLADKIFPGQTTFLTMVDKLLTEHKRTWDPDQPPRDLIDAFLAEMEK 44238370 44238242 AKGNPESSFNEANLRQIVLDLFGAGTAPTSTTLSWALLLMILHPDVQ 44238102 44237884 SLVQEEIDEVIGQARRPEMADQARMPYTNAVIHEVQRFADIAPMTLPHRTSCDIEVQGFLIPK 44237696 44237268 QGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSA 44237125 44237024 GRRSCLGESLARMELFLFFTCLLQRFSFSVPDGQPQPSDYGIYSILVSPAPYQLCAVVR 44236848 CYP2D42 Macaca mulatta (rhesus monkey) No accession number Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 93% to CYP2D6, probable ortholog of CYP2D6 CYP2D43 Bos taurus (cow) See cattle page for details 94% to 2D14 cow 5681 MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPMPLPVLGNLLQVDFEDPRPSFNQ LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPQALYKHLGFGPRAEG 6760 7291 VILARYGNAWREQRRFSLSTLRNFGLGKKSLEQWVTEEASCLCAAFADQA 7449 7550 GHPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIVKLLDVMEDGLKEEMKIMRQV 7714 8109 VEAVPVLLSIPGLAAKVVPGQKAFMTLVDELIAEQKMTRDPTQPPRHLTDAFLDEVKE 8288 AKGNPESSFSDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR 8591 8806 RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK 8985 9424 GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA 9603 GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSDHGVFVALVTPAPYQLCAVPR 9843 CYP2D44 Macaca fasicularis (cynomolgus monkey) No accession number ESTs BB890306, BB877128, BB888901, BB887284, BB877988, BB881640 Yasuhiro Uno Submitted to nomenclature committee 9/29/2005 93% to M. mulatta 2D42, 92% to 2D17 M. fasicularis 91% to 2D6 differs from 2D17 another cynomolgus seq. complete sequence CYP2D45v1 Xenopus tropicalis (frog) See Xenopus page for seq CYP2D45v2 Xenopus tropicalis (frog) See Xenopus page for seq CYP2D46 Xenopus tropicalis (frog) See Xenopus page for seq CYP2D47 Xenopus tropicalis (frog) See Xenopus page for seq CYP2D48 Xenopus laevis GenEMBL BC077934 56% TO CHICKEN 2D49 MSLLSQLCSFAFGCNVFTLGIICTLCLLLLDYMKRKKPCKNFPP SPPSKPFVGNLLQLNFRNLNNSFKQLSKQYGDVMSLQVFWKPVVVLNGLEVMKEALIQ KSEDTADRPEFHVLEILGFVGNNKAVVLANYGQSWKDLRRFTLSTLRDFGMGKKSLEE RVREEAGYLCAAFQSEQGRPFDPHILLNTAVSNVICSIIFGERFEYDDHTFQKLLCLI EESVKAESGAVPQIIASLPWSSKIPGLAKMFFQPRIRMLKYLQEIIKDHQQTWDSGHT RDFIDAFMLEMEKAKGVKDSNFNEQNLLLTTADLFSAGSETTNTTLRWGLLFMLLYPN VQRKVHEEIDHVIGRTRKPTMGDVLQMPYTNAVIHEIQRYVDIVPLSVPHMTYRDTHI QGFFIPKGVTIMTNLSSVLKDEKAWEKPFQFYPEHFLDRDGKFVKREAFMAFSAGRRV CLGEQLARMELFLFFTTLLQRFSFQIPNGEPSPREDPVFVFLQLPHDYKMCAKVR CYP2D49 Gallus gallus (chicken) chr1:46131304-46140141 ENSGALT00000019412.2 MTLLLWLSSWSNISVLGVFLTVFTILVDFMKRRKKWSRYPPGPMPLPFVG TMPYVNYYNPHLSFEKFRKKFGNIFSLQNCWTNVVVLNGYKTVKEALVNK SEDFADRPYMPVYEHLGYGHKSEGLVLARYGHLWKELRKFTLTTLRNFGM GKKSLEERVTEEAGFLCSAISSEGGHPFDPRFLVNNAVCNVICTITYGER FDYGDKTFKKLLTLFENSLNEEAGFLPQLLNVAPVLLRIPGLPQKIFPCQ KAYVDFTQMLIDKHKETWNPAYIRDFTDAFLKEMAKGKEAEENGFNKSNL TLVTADLLVAGSETTATTLRWAFLFMLLYPEIQSKVHKEIDKVIGRNRPP TMADQVNMPYTNAVIHEVQRFGDVVPMGLPHMTYRDTELQGFFIPKGTTI ITNLTSVLKDETAWKKPNEFYPEHFLNENGQFVRPEAFLPFSAGRRACLG EQLTRMELFIFFTTLMQKFTFVFPEDQPRPREDSHFAFTNSPHPYQLRAV PSITQDQGK CYP2D50 horse No accession number Heather Knych Submitted to nomenclature committee Oct. 3, 2007 80% to cattle CYP2D14 and CYP2D43 Cyp2d-se1[1:8:9] mouse GenEMBL NT_039621.1 old temp name = xxp about 400,000 bp from the main Cyp2d cluster + strand solo exons 1,8(partial),9 frameshift in exon 1 ortholog to CYP2D-se2[9] rat 43401344 MGLLTS 1361 43401361 LLSVAIFAAIFLLLVDIMQRCQCWATCYLLLLDFQNMPYSLYK 1489 43402076 EETVWEKPLRFHPELFLDAQGHFVKPEAFMPFSA 2177 43402729 GHRSCLGEPLACMKLFLFFTCLLQRFSFSVPDGQPQPSNCGVFPFLVAPSLYQLCAVLLKQGH 2917 CYP2D-se2[9] rat UCSC browser chr7:120386407-120386565 exon 9 (+ strand) 73% to 2D3 ortholog to Cyp2d-se1[1:8:9] mouse ACLGEPLTCMELFLFFICLLQSFSFSVKAGQPRPSNHGIFEMPISPSSYQLCA 2E Subfamily CYP2E1 human PIR A60554 (18 amino acids) Robinson, R.C., Shorr, R.G.L., Varrichio, A., Park, S.S., Gelboin, H.V., Miller, H. and Friedman, F.K. Human liver cytochrome P-450 related to a rat acetone-inducible, nitrosamine-metabolizing cytochrome P-450: identification and isolation. Pharmacology 39, 137-144 (1989) CYP2E1 Macaca fasicularis (cynomolgus monkey) No accession Wu Zhicong Submitted to nomenclature committee 10/30/2006 Only 3 aa diffs to CYP2E1 Macaca mulatta (rhesus monkey) Note: the 2E1 seq from 1992 S55205 differs from this seq at 12 amino acids and a frameshifted region, but this seq matches rhesus monkey at 9/11 sites so this seq is probably more accurate. One site is not included in the shorter S55205 seq. CYP2E1 Macaca fasicularis (monkey) GenEMBL S55205 (1508bp) Swiss P33266 (449 amino acids) PIR S28167 (449 amino acids) Komori,M., Kikuchi,O., Sakuma,T., Funaki,J., Kitada,M. and Kamataki,T. Molecular cloning of monkey liver cytochrome P-450 cDNAs: similarity of the primary sequences to human cytochromes P-450. Biochim. Biophys. Acta 1171, 141-146 (1992) CYP2E1 Macaca mulatta (rhesus monkey) NM_001040213 Brian A. Carr, Merck & Co. Inc. Submitted to nomenclature committee 4/22/2004 94% to CYP2E1, ortholog of CYP2E1 CYP2E1 Mesocricetus auratus (hamster) GenEMBL D17449 (2512bp) Sakuma,T., Takai,M., Yokoi,T. and Kamataki,T. Molecular cloning and sequence analysis of hamster CYP2E1 Biochim. Biophys. Acta 1217, 229-231 (1993) CYP2E1 hamster PIR S27176 (34 amino acids) Puccini, P., Menicagli, S., Longo, V., Santucci, A. and Gervasi,P.G. Purification and characterization of an acetone-inducible cytochrome P-450 from hamster liver microsomes. Biochem. J. 287, 863-870 (1992) CYP2E1 rat GenEMBL S48325 (1093bp) Richardson,T.H., Schenkman,J.B., Turcan,R., Goldfarb,P.S. and Gibson,G.G. Molecular cloning of a cDNA for rat diabetes-inducible cytochrome P450RLM6:hormonal regulation and similarity to the cytochrome P4502E1 gene. Xenobiotica 22, 621-631 (1992) CYP2E1 rat PIR B27425 (34 amino acids) Favreau, L.V., Malchoff, D.M., Mole, J.E. and Schenkman, J.B. Responses to insulin by two forms of rat hepatic microsomal cytochrome P-450 that undergo major (RLM6) and minor (RLM5b) elevations in diabetes. J. Biol. Chem. 262, 14319-14326 (1987) CYP2E1 rat GenEMBL AF061442 Yoo,M. and Shin,S.W. The complete coding sequence of the rat brain cytochrome P450 2E1 Unpublished Cyp2e1 mouse GenEMBL L11650 (1827bp) Swiss Q05421 (493 amino acids) Davis,J.F. and Felder,M.R. Mouse ethanol-inducible cytochrome P450 (P450IIE1). Characterization of cDNA clones and testosterone induction in kidney tissue. J. Biol. Chem. 268, 24933-24939 (1993) Cyp2e1 mouse PIR A21231 (39 amino acids) Ryskov, A.P., Ivanov, P.L., Kramerov, D.A. and Georgiev, G.P. Mouse ubiquitous B2 repeat in polysomal and cytoplasmic poly (A)+RNAs: uniderectional orientation and 3'-end localization. Nucleic Acids Res. 11, 6541-6558 (1983) C-terminal 39 amino acids CYP2E1v1 dog no accession number Susan M. Lankford and Stephen A. Bai submitted to nomenclature committee CYP2E1v2 dog no accession number Susan M. Lankford and Stephen A. Bai submitted to nomenclature committee note: only one amino acid difference with 2E1v1 CYP2E1 Canis familiaris (dog) NW_876287.1: 395882-405665 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 77% to human CYP2E1 MAALGITVALLVWMATLMLISIWKQIYSRWKLPPGPFPLPIIGNILQVDIKNVPKSLAKLAEQYGPVFTLYLGSQ RTVVLHGYKAVKEVLLDHKNDLSGRGEVFAFQSHKDRGITFNNGPGWKDTRRLSLSTLRDYGMGKRGNEERIQRE IPFLLEALRGTRGQPFDPTFLLGFAPFNVIADILFHKHFDYSDQTGLRIQKLFNENFHLLSTGWLQLYNIFPSYL HYLPGSHRKVLRNVAELKDYSLERVKEHQESLDPTCSRDFTDCLLQELQKERYGTEPWYTLDNIAVTVADLFFAG TETTSTTLRYGLLILMKYPEVEEKLHEEIDRVIGPSRVPAIKDRLEMPYMDAVVHEIQRFIDLLPSNLPHVANQD TMFRGYVIPKGTVVIPTLDSVLFDKQEFPDPEKFKPEHFLNENGKFKYSDYFKAFSAGKRVCVGKSLARMELFLF LSAILQHFNLKSLVDPKDIDLSPCTIGFAKIPPHYKLCVVPRSG* CYP2E2 rabbit GenEMBL J03726 (multiple genomic fragments) GenEMBL M19162 (multiple genomic fragments) GenEMBL M19163 (multiple genomic fragments) Khani,S.C., Porter,T.D., Fujita,V.S. and Coon,M.J. Organization and differential expression of two highly similar genes in the rabbit alchol-inducible cytochrome P-450 subfamily J. Biol. Chem. 263, 7170-7175 (1988) CYP2E1 sus scrofa (pig) GenEMBL AB000885.1 Kimura,M., Kawakami,K., Suzuki,H. and Hamasima,N. Cloning of the pig cytochrome P-450-j gene Unpublished CYP2E1 sus scrofa (pig) GenEMBL AB052259 Misaki Kojima 2 amino acid differences with AB000885.1 Submitted to nomenclature committee Oct. 27, 2000 clone name c469 CYP2E1 Bos taurus (cow) GenEMBL AJ001715 van Raak,M., Natsuhori,M., Ligtenberg,M., Kleij,L., ten Berghe,D., de Groene,E.M., Van Miert,A.S., Witkamp,R.F. and Horbach,G.J. Isolation of a full length cytochrome P450 (CYP2E) cDNA sequence and its functional expression in V79 cells Unpublished 79% to human 2E1 MAALGITVALLVWMATLLFISIWKHIYSSWKLPPGPFPLPIIGNLLQLDIKNIPKSFTR LAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNN GIIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQ GQPFDPTFVVGFAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQ LYNNFPDYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEM AKERHSVDPMYTLENIAVTVADLLFAGTETTSTTLRYGLLILMKYPEVE EKLHEEIDRVIGPSRIPAVKDRLDMPYLDAVVHEIQRFIDLLPSNLLHEATQDTVFRGYVIPK GTVVIPTLDSVLHDRQEFPEPEKFKPEHFLNENGKFKYSDHFKAFSA GKRVCVGEGLARMELFLLLAAILQHFNLKSLVDPKDIDLSPIAIGFGKIPPRYKLCLIPRSKV* CYP2E1 horse No accession number Heather Knych Submitted to nomenclature committee Oct. 17, 2007 CYP2E1 Balaenoptera acutorostrata (Minke whale) No accession number Iwata Hisato submitted to nomenclature committee 1/6/05 84% to CYP2E1 cow, 76% to CYP2E1 human 2F Subfamily CYP2F1 human GenEMBL J02906 MDSISTAILLLLLALVCLLLTLSSRDKGKLPPGPRPLSILGNLL LLCSQDMLTSLTKLSKEYGSMYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYP AFFNFTKGNGIAFSSGDRWKVLRQFSIQILRNFGMGKRSIEERILEEGSFLLADVRKT EGEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGELYD ILDPRFPSLLDWVPGPHQRIFQNFKCLRDLIAHSVHDHQASSPRDFIQCFLTKMAEEK EDPLSHFHMDTLLMTTHNLLFGGTKTVSTTLHHAFLALMKYPKVQARVQEEIDLVVGR ARLPALKDRAAMPYTDAVIHEVQRFADIIPMNLPHRVTRDTAFRGFLIPKGTDVITLL NTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAGRRLCLGELLARMELFLYL TAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLRPR CYP2F1 Bos taurus (cow) See cattle page for details LSKEFGAVYTVYLGPRRVVVLSGYQAVKEALVDQAEEFGGRGDYPVFFNFTKGN GIAFSNGDRWKVLRKYSVQILRNFGMGKRTIEERILEEGHFLLEELRKTQ GKPFDPTFVVSRSVSNIICSVIFGSRFDYDDDRPLSIIHLINENFQIMSSPWGE MYNIFPNLLDWVPGPHRRLFKNYGRIKDIIARSVREHQASLDPNSPRDFIDCFLTRWH QEKQDPLSHFFMDTLLMTTHNLLFGGTETVGTTLRHAFRLLMKYPEVQ VRVQEEIDRVVGHERLPTVEDRAAMPYTDAVIHEVQRFADVIPMSLPHRVTRDTNFRGFTIPR GTDVITLLNTVHYDPSQFLKPKEFNPEHFLDANMSFKKSPAFMPFSA GRRLCLGEALARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPYQLCVLAR CYP2F1P human AC008537.3 93% identical to 2F1 Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H., Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E., Idle,J.R. and Gonzalez, F.J. A genetic polymorphism in coumarin 7-hydroxylation: sequence of the human CYP2A genes and identification of variant CYP2A6 alleles. Am. J. Hum. Genet. 57, 651-660 (1995) There are two 2F1 genes, and one pseudogene of 2F1 on chromosome 19. GEPFDPTFVLSRSRSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGE LYDIFPSLLNWVPGPHQRIFQNFKCLRDLIAHSVHDHQASLDPRSPRDFIHCFLTKMAE KKEDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLAFMKYPKVQ AHVQEEINLVVGHVRLPALKDRAAMPYTDMVIHEVQRFADIIPMNLPHRITRDTAFHGFLIPK GTDVITLLNTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAG HRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLHPR CYP2F1 Canis familiaris (dog) NW_876313.1:NW_876270.1:43272128-43283098 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 86% to human CYP2F1 MDGVSTAILLGLLALAFLFLILNSRGKSQLPPGPRPLPFLGNLLQLRSQDMLTSLTKSKEYGSVYTVHLGPRRVV VLSGYQAVKEALVDQGEDFSGRGDYPVFFNFTKGNGIAFSNGDRWKVLRRFSVQILRNFGMGKRSIEERILEEGS FLLAELRKTEGKPFDPTFVLSRSVSNIICSVIFGSRFDYDDERLLTIIRLINDNFQIMSGPWGEQLYNIFPSLLD WIPGPHRRLFQNFGCMKDLIARSVRDHQDSLDPRCPRDFIDCFLNKMAQEKQDPHSHFHMDTLLMTTHNLIFGGT ETVGTTLRHAFLVLMKYPKVQARVQEEIDRVVGRARLPALEDRAAMPYTDAVIHEVQRFADVIPMNLPHRVIRDT PFRGFLLPKGTDIITLLNTVHYDPNQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAGRRLCLGESLARMELFLYL TAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLRLRTR* Cyp2f2 mouse GenEMBL M77497, NT_039413.1 + strand Swiss P33267 (491 amino acids) Ritter J.K., Owens I.S., Negishi M., Nagata K., Sheen Y.Y., Gillette J.R. and Sasame H.A. Mouse pulmonary cytochrome P-450 naphthalene hydroxylase: cDNA cloning, sequence and expression in Saccharomyces cerevisiae. Biochemistry 30, 11430-11437(1991) CYP2F3 goat GenEMBL AF016293 Huifen Wang, Diane L. Lanza, and Garold S. Yost. Cloning and expression of CYP2F3, a cytochrome P450 that bioactivates The selective pneumotoxins 3-methylindole and naphthalene submitted CYP2F4 rat GenEMBL AF017393 R. Michael Baldwin and Alan Buckpitt submitted to nomenclature committee CYP2F5 Gorilla gorilla GenEMBL AF372494 Chen,N., Whitehead,S.E., Caillat,A.W., Gavit,K., Isphording,D.R., Kovacevic,D., McCreary,M.B. and Hoffman,S.M. Identification and cross-species comparisons of CYP2F subfamily genes in mammals Mutat. Res. 499 (2), 155-161 (2002) CYP2F6 Macaca mulatta (rhesus monkey) No accession number Mike Baldwin Pdf file of nucleotide/amino acid alignment This file shows polymorphism data The particular sequence shown is a pseudogene due to A premature stop codon. PDF file for the sequences of a non-truncated version Pdf files from Mike Baldwin 2G Subfamily CYP2G1P human GenEMBL S80997, S80998, S80999 Sheng J, Ding X Biochem. Biophys. Res. Commun. 218, 570-574 (1996) Identification of human genes related to olfactory-specific CYP2G1. 2 PCR fragments for a human 2G1 are presented and 2 more PCR fragments from two possible 2G1 pseudogenes are also shown. 86% identical to rat 2G1 CYP2G1P human GenEMBL AC008537 genomic DNA in 93 fragments Sequence is assembled from fragments and it may need to be revised The * indicate intron locations except the last one that is a stop codon. The sequence is 78% identical to rat 2G1. There is a frameshift after YMGP on the second line. CYP2G1 is 58-59% identical to some CYP2A sequences so it may actually Be a CYP2A sequence. The 2G subfamily might be absorbed by CYP2A CYP2G1P revised seq AC008537 missing exons 4, 5 and 6 MELGGAVTIFLALRLSCLLILIAWKRMDKAGKLPPGPTPILFLGHLLQVRTDATFQSFMK LREKYSPVFTVYMGP (fs) RPVVVLCGHEAVKEALIDQADEFSGRGELASIKQNFQGHG VALANGERWRILRRFSLTILRDFGMGKQSIKERIQEEASYLLEEFQKTK AKIHEEINQVIGPHRLPRVDDRVKMPYTDVVIHEIQRLVDIVPMGVPHNIIQDTQFRGYLLPK GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSSGRGK RICLGEAMARMELFLYFTSTLQNFSLCSLVPLVDIDITPKLSGFGNITPTYELCLVAR CYP2G1 chimp Not a pseudogene CYP2G1 Bos taurus (cow) See cattle page for details 88% to human pseudogene 2G2P 3860 MELGGAFTIFLALCLSCLLILIAWKRMSKGGKLPPGPTPIPFLGNVLQVRTDATFQSFMK(0) 4039 4854 LKEKYGPVFTVYMGPRPVVVLCGHEAVKEALVDRADEFSGRGELASVERNFQGH(1)5015 6748 GVALANGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGFLLVELRKTR(1)6897 8738 GARIEPTFFLSRTVSNVISSVVFGSRFDYEDQQFLKLLQMINQSFIEMSTSWAQ (0) 8899 9151 LYDMYSGIMQYLPGRHNRIYYLIEELKDFIASKVKINEASLDPQNPRDFIDCFLIKMHQ(0) 9327 300 DKNNPHTEFNLKNLVLTTLNLFFAGTETVSSTLRYGLLLMMKHPEVE(1)145 997 AKIHEEIDQVIGPHRIPSVDDRAKMPYTDAVIHEIQRLTDIVPMGVPHNVIRDTHFRGYLLPK(0) 1185 1314 GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGHFKKNEAFVPFSS(1) 1454 586 GKRICLGEAMARMELFLYFTSILQNFSLRSLVPPADIDITPKVSGFGNIPPTYELCFMVR(1) 765 CYP2G1 rat GenEMBL M33296 CYP2G1 rabbit PIR B31944 (50 amino acids) Ding, X. and Coon, M.J. Purification and characterization of two unique forms of cytochrome P-450 from rabbit nasal microsomes. Biochemistry 27, 8330-8337 (1988) Cyp2g1 mouse GenEMBL L81171, NM_013809, NT_039410.1 Hua, Z., Zhang, Q.Y., Su, T., Lipinskas, T.W., Ding, X. cDNA cloning, heterologous expression, and characterization of mouse CYP2G1, an olfactory-specific steroid hydroxylase. Arch. Biochem. Biophys. 340, 208-214 (1997) 94.9% identical to rat CYP2G1 CYP2G1 Canis familiaris (dog) chr1:115782146-115791970 UCSC broswer May 2005 assembly 90% to human 2G2P MELGGAFTIFLALSLSCLLILIAWKRNSKGGKLPPGPTPIPFLGNVLQVRTDATFQSFMK LREKYGPIFTVYMGPRPVVVLCGHEAVKEALVDRADEFSGRGELASIERNFQGH GVALANGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGFLLEELRKTK GSPIEPTFFLSRTVSNVISSVVFGSRFDYEDKQFLKLLQMINESFIEMSTPWAQ LYDMYSGIMQYLPGRHNRIYYLIEELKDFIASRVKINEASLDPQNPRDFIDCFLIKMHQ DTNNPHTEFNLKNLVLTTLNLFFAGTETVSFTLRYGLLLMMKHPEVE AKIHEEIDQVIGPHRIPSVDDRAKMPYTDAVIHEIQRLTDIVPMGVPHNVIRDTHFRGYLLPK GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSS GKRICLGEAMARMELFLYFTSILQNFSLHSLVPPADIDITPRVSGFGNIPPTYELCLKAR CYP2G2P human AC008962 comp(28700-40696) seq of gene has two in frame stop codons MEMGGAVTIFLALCLSCLLILIAWK*MNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK LREKYSPVFTVYMGPRPVVVLCGHEAVKEALVDQADEFSGRGELASIKQNFQGHG VALANGERWRIL*RFPLTILRDFGMGKRSIEERIQEEASYLLEEFRKTK GAPIDPIFLLSRTVSNVISSVVFRSRFDYEDKQFLNLLRLINESFIEMSTPWAQ LYDMYSGIMQYLPGRHNLIYYLVEELKDFIASRVKINEASFDPQNPRDFIDCFLIKMH QDKNNPRTEFNLKNLVLTTLNLFFAGTETVSSTLRYGFLLLMKHPEVE AKIHEEINQVIGPHRLPRVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNLIRDTQFRGYLLPK GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSSGR GKRICLGEAMDRMELFLYFTSTLQNFSLHSLVPPVDIDITPKLSGFGNIPPTYELCLVAR* CYP2G2 Macaca mulatta (rhesus monkey) Note this does not look like a pseudogene exon 2 = trace archive file 456149111 MELGGAVTIFLALCLSCLLVLIAWKRMNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK (0) LKEKYGPLFTVYMGLWPVVVLCGHEAVKEALIDQADEFSGRGKLASIEQNFQGH (1) GVALANGERWRILRRFSLTILRDFGMGKRSIEERILEEASYLLEEFRKTK (1) GAPIDPTFLLSRTVSNVISSVVFGSRFDYEDKQFLNLLRLINESFIEMSTPWAQ (0) LYDMYSGIMQYLPGRHNRVYYLIEQLKDFIASRVKINEASFDSQNPRDFIDCFLIKMHQ (0) DKNNPRTEFNLKNLVLTALNLFFAGTETVSSTLRYGFLLLMKHPEVE (1) ARIHEEINQVIGPHRLPSVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNVIRDTQFRGYLLPK (0) GTDVFPLLGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNEAFVPFSS (1) GKRICLGEAMARMELFLYFTSILQNFSPRSLVPPADIDITPKLSGFGNIPPTYELCLVAR 2H Subfamily CYP2H1 chicken PIR D44107 (22 amino acids) Nakai, K., Ward, A.M., Gannon, M. and Rifkind, A.B. Beta-naphthoflavone induction of a cytochrome P-450 arachidonic acid epoxygenase in chick embryo liver distinct from the aryl hydrocarbon hydroxylase and from phenobarbital-induced arachidonate epoxygenase. J. Biol. Chem. 267, 19503-19512 (1992) CYP2H2 chicken PIR E44107 (25 amino acids) Nakai, K., Ward, A.M., Gannon, M. and Rifkind, A.B. Beta-naphthoflavone induction of a cytochrome P-450 arachidonic acid epoxygenase in chick embryo liver distinct from the aryl hydrocarbon hydroxylase and from phenobarbital-induced arachidonate epoxygenase. J. Biol. Chem. 267, 19503-19512 (1992) 2J Subfamily CYP2J1 rabbit GenEMBL D90405 Kikuta, Y., Sogawa, K., Haniu, M., Kinosaki, M., Kusunose, E., Nojima, Y., Yamamoto, S., Ichihara, K., Kusunose, M. and Fujii-Kuriyama, Y. A novel species of cytochrome P-450 (P-450ib) specific for the small intestine of rabbits. J. Biol. Chem. 266, 17821-17825 (1991) CYP2J2 human GenEMBL U37143 (1876bp) Wu, S., Moomaw, C., Tomer, K.B., Capdevila, J.H., Falck, J.R., and Zeldin, D.C. Molecular Cloning and Expression of CYP2J2, a Human Cytochrome P450 Arachidonic Acid Epoxygenase Highly Expressed in Heart. J. Biol. Chem., 271: 3460-3468 (1996) CYP2J2 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP2J2_2-B5 94% to 2J2 human CYP2J2 Canis familiaris (dog) NW_876313.1 :19927114-19956047 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 78% to human CYP2J2 MLAAVGSLAATLWAVLHLRTLLLGAVAFLFFADFLKRRRPKNYPPGPVPLPFVGNFFHLDFEQSHLKLQRFVKKY GNVFSVQMGDMPLVVVTGLPLIKEVLVDQNQVFVNRPITPIRERVFKNSGLIMSSGQIWKEQRRFTLATLKNFGL GRKSIEERIQEEAHHLIQAIEEENGQPFNPHFKINNAVSNIICSITFGKRFEYQDEQFQELLRLLDEVTCLETSM RCQLYNVFPWIIKFLPGPHQKLFNDWEKLKLFIAHMTENHRRDWNPAEPRDFIDAYLKEMEKGNATSSFHEENLI YSTLDLFFAGTETTSTTLRWGLLYLALNPEIQEKVQAEIDRVIGQSQLPGLAVRESMPYTNAFIHEVQRMGNIVP LNVPREVTGDTTLAGYYLPKGTVIVTNLTALHRDPAEWATPDTFNPEHFLENGQFKKREAFLPFSIGKRVCIGEQ LARSELFIFFTSLVQRFTFRPPDNEKLSLEFRTGLTISPVSHRLRAIPRS* CYP2J3 rat GenEMBL U39943 (1778bp) Wu, S., Murphy, E., Gabel, S., Chen, W., Tomer, K.B., Foley, J., Steenbergen, C., Falck, J.R., Moomow, C.R., and Zeldin, D.C. Molecular Cloning, Expression, and Functional Significance of a Cytochrome P450 Highly Expressed in Rat Heart Myocytes. submitted. 91% to mouse 2j9 exon 8 in a seq gap UCSC browser chr5 shown below 116772039 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ 116771830 116767788 FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN 116767791 116766010 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEG 116765861 116765445 GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ 116765284 116760602 LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAK 116760426 116758387 YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ 116758247 116754923 EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPK 116754735 GTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM 116749991 GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL 116749815 CYP2J3P1 rat GenEMBL U40000 (1909bp) Wu, S., Murphy, E., Gabel, S., Chen, W., Tomer, K.B., Foley, J., Steenbergen, C., Falck, J.R., Moomow, C.R., and Zeldin, D.C. Molecular Cloning, Expression, and Functional Significance of a Cytochrome P450 Highly Expressed in Rat Heart Myocytes. submitted. Not a true pseudogene, but an alternative splice variant of CYP2J3 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQGEAYHLVEAIKDEG GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEGRDFIDAFLKEMAK YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPRKVAVDTYLAGFNLPK GTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM (GC boundary, retains intron) GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL CYP2J3P2 rat GenEMBL U40004 Wu, S., Murphy, E., Gabel, S., Chen, W., Tomer, K.B., Foley, J., Steenbergen, C., Falck, J.R., Moomow, C.R., and Zeldin, D.C. Molecular Cloning, Expression, and Functional Significance of a Cytochrome P450 Highly Expressed in Rat Heart Myocytes. submitted. Not a true pseudogene, but an alternative splice variant of CYP2J3 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEG GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAK YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPKG (small deletion) RDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKVSLQFRMSVTISPVSHRLCAIPRL CYP2J4 rat GenEMBL L81170 (1826bp) Zhang,Q.-Y., Ding,X., Kaminsky,L.S. cDNA cloning, heterologous expression, and characterization of rat intestinal CYP2J4 Arch. Biochem. Biophys. 340, 270-278 (1997) UCSC browser chr5 shown below 116734902 MLATAGSLIATIWAALHLRTLLVAALTFLLLADYFKTRRPKNYPPGPWGLPFVGNIFQLDFGQPHLSIQP 116734693 116725983 FVKKYGNIFSLNLGDITSVVITGLPLIKETFTHIEQNILNRPLSVMQERITNKN 116725822 116723426 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRMQEEAHYLVEAIREEK 116723277 116722875 GKPFNPHFSINNAVSNIICSVTFGERFEYHDSRFQEMLRLLDEVMYLETTMISQ 116722714 116718583 LYNIFPWIMKYIPGSHQTVFRNWEKLKLFVSSMIDDHRKDWNPEEPRDFIDAFLKEMSK 116718407 116716306 YPEKTTSFNEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEVQ 116716169 116713582 EKVQAEIDRVIGQKRAASLADRESMPYTNAVIHEVQRMGNIIPLNVPREVAMDTTLNGFHLPK 116713394 116711364 GTMVLTNLTALHRDPKEWATPDVFNPEHFLENGQFKKRESFLPFSM 116711227 116708412 GKRACLGEQLARSELFIFFTSLMQKFTFKPPTNEKLSLKFRNGLTLSPVTHRICAVPRE* 116708233 CYP2J4-de6b rat UCSC browser chr5: 116706163-116706053 (- strand) exon 6, frag w in fig. below 116706163 XXXXXXSFCEENLTCRTLDFLYAGIDTISNRLHWVLLLTCVNPEXX 116706053 rat, mouse and human 2J cluster Cyp2j5 mouse GenEMBL U62294 (1886bp), NT_039263.1 J. Ma and D.C. Zeldin, unpublished. clone JM-6 CYP2J5P rat UCSC browser Chr5: 116785102-116780337 (- strand) exons 1-4 69% to 2j5 mouse now a pseudogene ortholog 116785102 MITSLSSLVTSSWAALLLRTLLLAAVTFLFLAGILRRHRPKDYQPGPWRLPFVGNFFQIDFEQSHLVLQK 116784893 116784415 FAKKYGNVFSLELDRPSVVVVTGQPLIKTKMFTHLEQNFANHFVTSVRKRAIGNN 116784251 116781318 GLITSNGQTWKEKRRFALMTLKNFGLGKKSLEQRMHE*AFHLVEARREEG 116781169 116780474 GQPVDLHLINNAVANVICSITFGGRFEYEDCQFQEMPTLLDEALHV 116780337 Cyp2j5-de2b mouse GenEMBL NT_039263.1|Mm4_39303_30 detritus exon 2 q in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7613530 FVKKYGNLFSLELDSISVEVVSGLL 7613456 7613456 LIKEMFTHLDHNFVNRPVSAIQKHV 7613382 Cyp2j5-de9b mouse GenEMBL NT_039263.1|Mm4_39303_30 detritus exon 9 r in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7603742 GK*ACPGEHLAISELFIIFTDLM*NFTFKAPINQKLSLS 763626 7603626 FRNGLTLSPVSYHICAVPQQ* 7603564 Cyp2j6 mouse GenEMBL U62295 (2046bp) NT_039263.1 J. Ma and D.C. Zeldin, unpublished. clone JM-15 Cyp2j6-de6b mouse GenEMBL NT_039263.1|Mm4_39303_30 detritus exon 6 fragment s in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7513690 TGFNKENLTCDTLDLLSGGIDTTSNGVHWVLLYRSVNKE 7513574 Cyp2j7 mouse GenEMBL XM_143894.1, NT_039263.1|Mm4_39303_30, AF218856 D.C. Zeldin, unpublished. Cyp2j7-de9b mouse GenEMBL NT_039263.1|Mm4_39303_30 from old Cyp2jzzp w in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7177505 GKGACLGKQLAMSQLFIFFTSLMQKSTFKPPINENLSLKFTMSP 7177374 7177375 LSPVSHHIYAVPRQ 7177334 Cyp2j7-de9c mouse GenEMBL NT_039263.1|Mm4_39303_30 from old Cyp2jzzp x in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7157638 GNRACPGEQLAMIELFIFFTALMQKCTFKSTVNEKLGLKIRLDLPLSPVSHHICAVPRQ 7157462 Cyp2j7-de9d mouse GenEMBL NT_039263.1|Mm4_39303_30 from old Cyp2jzzp y in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7138888 GKRTCHGKQLARSELFIFFTALMHIFTLNPPISKKLSLKFSMGLAFSPVSH*ICVVPTQ 7138712 Cyp2j8 mouse GenEMBL NT_039263.1|Mm4_39303_30 AF218857 AI429871 vv77f02.y1 69-184 (EST), AA760476 vv77f02.r1 69-227 (EST), AZ393698 283-329 (GSS), AI606765 vv77f02.x1 330-476 (EST) AZ057726 422-463 (GSS), XM_131520.1 (from nr) AL772157.1 htgs AC102925.1 D.C. Zeldin, unpublished. clone WQ4-1 Cyp2j8-de2b mouse GenEMBL NT_039263.1|Mm4_39303_30 detritus exon 2 t in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7429084 LEKYGNNFSLILGD*TLVVITELLLTKEACIHMEQNILNHPATFIQECNSKK 7428929 Cyp2j8-de9b mouse GenEMBL NT_039263.1|Mm4_39303_30 detritus exon 9 u in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7417728 ERLIRSKIFSFTLSLKMKSSIYMEVFSFKP 7417639 Cyp2j8-de9c mouse GenEMBL NT_039263.1|Mm4_39303_30 detritus exon 9 v in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7414356 EQLARSEMFIFFIALMEKFTFKASVNEKLSLKFRMGFNLPQVSHNICAVPRY* 7414198 Cyp2j9 mouse GenEMBL NT_039263.1|Mm4_39303_30 AK018422 lung, also AF336850 D.C. Zeldin, unpublished. clone WQ24-1 CYP2J10 rat GenEMBL XM_233199 Yu Z, Huse LM, Adler P, Graham L, Ma J, Zeldin DC, Kroetz DL. Mol Pharmacol 2000 May;57(5):1011-20 Increased CYP2J expression and epoxyeicosatrienoic acid formation in spontaneously hypertensive rat kidney. ortholog of mouse Cyp2j12 Predicted by GNOMON 86% to 2j12 mouse (LOC313373), mRNA. 2J10 seq specific rev primer matches 116499966-116499989 forward primer 1 = 116515946 116515968 116516004 MLSTEDTLEAAIRALLHFRTLLLAAVTFLFLANYLKTRRPKNYPPGPWRLPFVGNLFQLDVKQPHVVIQK 116515795 116508667 FVKKYGNLTSLDFGTIPSVVITGLPLIKEAFTNTEQNFLNRPVTPLRKRVFNNN 116508506 116505791 GLIMSNGQTWKEQRRFTMTTLKNFGLGKRSLEQRIQEEANYLVEAIGADK 116505642 116505144 GQPFDPHFKINSAVSNIICSITFGERFEYEDSLFQELLRLLDEASCLESSMMCQ 116504983 116500081 LYNVFPTIIKYLPGSHQTVLRNWEKLKLFISCMMDSHQKDWNPDEPRDFIDAFLTEMAK 116499905 116496152 YRDKTTTSFNKENLIYSTLDLFFAGSETTSNILRWSLLYITTNPEVQ 116496012 116489147 EKVHSEIDRVIGHRRQPSTGDRDAMPYTNAVIHEVLRMGNIIPLNVPREMTADSTLAGFHLPK 116488959 116488244 GTTILTNLTGLHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSM 116488107 116479687 GKRACPGEQLARTELFIFFTALMQNFTFKPPVNETLSLKFRNGLTLAPVSHRICAVPRQ 116479511 Cyp2j11 mouse GenEMBL XM_131521, AC091461.3 Unigene Mm.26915, NT_039263.1 Joan Graves, Hong Wang, and Darryl Zeldin Clone name CYP2JA Cyp2j12 mouse GenEMBL XM_143892 (genbank entry missing part of exon 4) NT_039263.1|Mm4_39303_30 Cyp2j13 mouse GenEMBL NT_039263.1|Mm4_39303_30 Map view locus LOC230459 Joan Graves, Hong Wang, and Darryl Zeldin Clone name CYP2JC CYP2J13 rat GenEMBL XM_233198 1455 bp ortholog of mouse Cyp2j13 Predicted GNOMON Rattus norvegicus similar to CYP2J4 (LOC313372) mRNA. Missing exon 1 74% to XM_233199, 79% to 2J4 78% to 2J3 90% to 2j13 mouse 116449294 FVKKYGNVISLDLGIMSSVIISSLPLIKEAFSHLDENFINRPIFPLQKHIFNDN 116449133 116446157 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAHHLVEAIGEEE 116446008 116445630 GQPFDPHFKINNAVSNIICSITFGERFEYHDSQFQELLKLLDKAMYLGTPMMIH 116445469 116440971 LYNMFPWIIKHLPGQHQTLLATWGKLKSYIADIIENHREDWNPAEPRDFIDAFLNEMAK 116440795 116428766 YPDKTTTSFNEENLICSTLDLFLAGTETTSTTLRWAVLYMALYPEVQ 116428626 116426881 EKVQAEIDQVIGQEKHPSLADRDSMPYTNAVVHEIQRMGNIVPLNVPREVAVDTTLAGFHLPK 116426693 116426568 GSVVMTNLTALHMDPKEWATPDVFNPEHFLENGQFKKRDSFLPFSM 116426431 116423270 GKRACLGEQLARSELFIFFTALMQKFTFKPPTNEKLSLKFRLGITISPVSHRICAVPRL 116423094 Cyp2j13de1X mouse Detritus exon 1 7kb downstream of 2j13 (exon 8) Note: this is an early and incorrect nomenclature for Cyp2j13-de8b Cyp2j13-de8b mouse GenEMBL NT_039263.1|Mm4_39303_30 detritus exon 8 ABOUT 7000BP DOWNSTREAM OF 2J13 z in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004) 7025751 GSVVLTNLTALQVDPKD*ATPDVVIPEHFLKNGEF*KGESFLPFSIG 7025611 >Cyp2j14-ps mouse GenEMBL NT_039263.1|Mm4_39303_30 exons 3,4,9 7377737 XXXXXSNGQTWKEQKRFALMILKNFELGKKSLEQHIQEEANHLLEAMGEEK 7377600 7376950 GQPFDPHY 7376927 7376925 VSNIICFITFGDHFEYDDNKFQELLKLTDETLCSEASMMLV 7376803 7353938 GKRSCPGEQMAISELFIFFT 7353879 7353880 LFTQKFTFSPPVNEKLKFKNGLTLSPVSHHICAVPRQ* 7353767 >Cyp2j15-ps mouse GenEMBL NT_039263.1|Mm4_39303_30 exons 3,4,5,9 7271792 GFI*SSSQIWKD*RFILMTLKHFGLGKILVHLMQGESCCHLVGA 7271661 7271288 GQHSDLHFIINNAVCNIIFSVTFDCFLETHDCRFQEMLKLMDEFICLETTMLHQ 7271127 7245486 LYNVFPHLMKYILVSLQTVFRN 7245421 7245421 RGKLKLLASCMIDKHVRDWNPD*PRDFIDVFFKEMMK 7245311 7232303 GKRACHGEQLARSELFIF*TALIQKFVFKVPVNEKLSLKFRLGFPLPPVNHHIYAVPRD* 7232124 CYP2J16-de2b5b9b rat UCSC browser (- strand) frag x in figure below 116691748 KKYGNIFGLNLGDLTSEVITGLLLSKE 116691668 exon 2 116684743 FYDIFPYLMKYIPGITSNCFQKLGKLKLFVSCMTDEHRRDWNPEDPRNFTDALLKEMMK 116684567 exon 5 116677505 GKRACPGEQLARSKLFIFFTALIQKFTF 116677422 116677420 RLGMKSILGLTLSPVTHHI*ALSKQ 116677346 exon 9 rat, mouse and human 2J cluster CYP2J16 rat UCSC browser (- strand) 116664772 MLATVGSLLAKIWSAINFWTLLLTLLTFLLLADYLKNRRPNNYPPGPWRLPFVGNLFQFDLNISHLHLRIQQ 116664557 116654396 FVKKYGNLISLDFGNISVVVITGLPLIKEALINNEQNFLKRPIVPSRYRVFKDN 116654235 116651622 GIFFANVHKWKEQRRFALTMLKNFGLGKKSLEQCIQEEAHHLVEVIGEEK 116651473 116650955 GQPFDPHFRINNAVSNIICSITFGERFEYDDSQFQELLKLADEVICSEASMTSV 116650794 116640170 LYNVFPLIFKYLPGPHQTVFKNWEKLKSIVANMIDRHRKDWNPDEPRDFVDAFLTEMTK 116639994 116638624 YPDKTTTSFNEENLIATTLDLFFAGTETTSTTLRWALLYITLNPEVQ 116638484 116627938 EKVHSEIDRVIGHGRLPSTDDQDAMPYTNAVIHEVLRMGNIIPLNVPREVTADSTLAGFHLPK 116627750 116624337 GKMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSV 116624200 116612610 GKRACPGEKLAKSELFIFFTALMQNFTFKAPTNEKLSLKLRKGLSLYPVSYRICAVPR 116612437 rat, mouse and human 2J cluster CYP2J16-de5c6c9c rat UCSC browser (- strand) 72% to 2j6 mouse, frag y in fig below 116604392 LYNIFPWIMNYGPGSHQ 116604342 116604222 exon 5 116604345 SVFRNWEKLKLFVSCMIDNKQRWVP 116604271 exon 5 116602255 YPEKSTSFSQGHLFCSTLNLFRAGSET 116602175 exon 6 116591992 GKRACPGEQMAISELFSFFAAFMQ 116591921 exon 9 116591919 KFTFHLAINEKLRMKFRNGLTLP*SSHLYC 116591830 exon 9 rat, mouse and human 2J cluster CYP2J17P rat UCSC browser (- strand) 116584536 MLATASCLVANVCSAIPLWTLLLAALSWLPQKQAPQKQPSRALAPAIFGNLFQFDLDVSQLHSGI*PSKK 116584327 exon 1 116581102 FVTKYGNLISLDFGNTSSVIISGLPLIKEALTDM 116581001 exon 2 116580637 EQNLLKCIVLASREHVFKNN 116580578 exon 2 last half 116570454 LYNVFPFIIKYL 116570419 exon 5 116570408 NQTFFRNWENLNLFVSHMMESHRKDWNPVEPRDFIDAFLTYMTKEDD 116570268 exon 5 last half 116566151 KVHSEIDGVTGHGRPPSTGDRDSMPYTNAVIYEVLRMDNINPLKVPREVTADSTLDEFCLSK 116565966 exon 7 116563406 GTMVLINLTALYRESKEWTTQDTFNPEHFLENGMFKKRESF 116563284 exon 8 116559748 KFTFKPPISEKLSLKFRTGLTLSHVSCRI*SIHR 116559647 exon 9 CYP2J18P rat UCSC browser (- strand) 63% to 2j6 mouse 116551335 MLGTQDILEAGIWALLH 116551285 exon 1 116551282 RTLLLAAVTFLLLADYLKTGNK 116551217 exon 1 116551217 KKYPWGPCNPPVMNNLFQLDLEQ 116551149 exon 1 116537661 LYNAFLSIMKYHPGSHQ 116537611 exon 5 116537614 SVFRNWEKLIWRMSHIAENHCKG*NPAEL 116537528 exon 5 116537523 REFIDAFLTKMTK 116537485 exon 5 116534551 YPDKTTTNFNEENLICA 116534501 exon 6 116534498 LEFLFARTEITSTTLSWVLLYLSANPGVQ 116534412 exon 6 116529361 LFIFFTSLMQKFTFKPPISEKLILKFRMGLILSPVCH*ICVVPRQ* 116529224 exon 9 Cyp2jbbpX mouse XM_143896 Map view locus LOC230464 exons 3-4 and exon 9 temporary placeholder name for Cyp2j14-ps Cyp2jzzpX mouse Map view locus LOC230460 3 C-term fragments ABOUT 19KB APART temporary placeholder name note this is an old name for Cyp2j7-de9b, Cyp2j7-de9c, Cyp2j7-de9d CYP2J19 Gallus gallus (chicken) NW_060417.1 weakly like a CYP2J, 52% to 2J2 human BI390850.1 EST all the best hits are CYP2Js 12644 MDFRFWPISQLGKLNVSMLLVVLVMFLLIIDFVRKRRPRNFPPGPQLFPLVGTIVDLRQPLHLEMQK 12444 10910 LTARYGNIFSVQFGGLTFVVVSGYQMVREALVHQAEIFADRPHIPLLQEIFRGF 10749 10125 GLISSNGHIWRQQRKFVSATLKSIAVSFESKVQEESRYLVEAMEEEK 9985 8514 GQPFDPHYKINSAVSNIICSITFGNRFNYHDSNFQELLHLLAETLLLIGSFWGQ 8299 7615 LYNAFPLIMRWLPGPFRKIFRHWEKLQRFVRGVIAKHKEDLDQSDLGDYIDCYLKEIEK 7439 7077 CKGDTNSYFHEENLLCSTLDLFLTGTETTATAIRWALLYMAAYPHIQ 6937 6401 EKVQLEIDAVIGQCRQPTMEDKEHMPYTSAVLSEVLRMGNIVPLGVPRMSTNDTTLAGFHVPK 6213 5285 GTTLMTSLTSIMFDKNVWETPDTFNPEHFLENGQYRRREAFLPFSA 5148 4669 GKRACPGEQLARTELFIFFTALLQKFTFQAPSATVLSFAFTLSLTRCPKPFQLCALPR 4496 CYP2J20 Gallus gallus (chicken) NW_060417.1 weakly like a CYP2J, 52% to 2J2 human This sequence joins with the rest of the gene on NW_060416.1|Gga8_WGA225_1 joined by EST BI064782.1 (part of a 6 gene CYP2J cluster) 1641 MLRFLWDSISLQMLFIFLLVFLLVSDYMKRRKPKDFPPGPFSFPFLGNVQFMFAKDPVVAIQK 1453 943 FIEKHGDIFRTQVGSMSFVIVNGLPLIKEALVTQGENFMDRPEFPTNTEFFNKF 782 574 GLVSSNGHLWKQQRRFTLTTLRNFGLGKRSLEERIQEECRFLTDAFRDEQ 425 GNPFNPHLKVNNAVSNIICSVTFGNRFEYHDEDFQNLLRLMNETAILQGKIMSQ 15531671 LYNFFPSVIKYFPGSHQTVIKNGRLMKRFVCKKISKHKEDLSPSESRDFIDSYLQEMAK 15531495 15531239 PNGRDFCEDNLVACTLDLFFAGTETTSTTIRWALLYMAIYPEIQ 15531108 15530636 ARVQAEIDAVIGQARQPSLEDRSNMPYTNAVIHEVQRKGNIIPFNVPRQAVKDTVLAGFRVPK 15530448 15529975 GTILIPNLSSVMFDMKEWETPHSFNPGHFLKDGQFWKREAFMPFSI 15529838 15529096 GKRACLGELLARAELFLFFTALLQKFTFQAPPDTILDLKFTHGMTLAPQPYMICAVPR 15528923 CYP2J21 Gallus gallus (chicken) NW_060416.1|Gga8_WGA225_1 (part of a 6 gene CYP2J cluster) 15526022 MLRFLWDSISLQMLFVFLLVFLLVSDYMKKRKPKDFPPGPFSFPFLGNMEFIIAKDPVAVTEK 15525834 15525310 FIEKHGDIFSTQVGSMSFVIVNGLPLIKEALVTQGENFMDRPEFPINTEFLNKF 15524941 GLVFSNGHLWKQQRRFTLTTLRNFGLGKRSLEERIQEECRFLTDAFRDEQ 15524792 15523650 GNPFNPHLKVNNAVSNIICSVTFGNRFEYHDEDFQNLLRLMDETVTLQGEPMSQ 15523489 15522627 LYAFFPSIIKYFPGSHQTVLKNEKLMKRFVCKKISKHKEDLSPSESRDFIDSYLQEMAK 15522451 15522209 KPNGSDFCEDNMVSCTLDLFFAGTETTSTTIRWALLYMAIYPEIQ 15522075 15521605 ARVQAEIDAVIGQARQPSLEDRSNMPYTNAVIHEVQRKGNIIPFNXXXXXXXXXXXXXXXXXX 15521471 15521289 XXLLIPNLSSVMSYKKQWETPHSFNPGHFLKDGQFWNREAFMPFSI 15521158 15520424 GKRACLGELLARAELFLFFTSLLQKFTFQAPPDTILDLKFTVGITLAPQPYKICAVPR 15520251 CYP2J22 Gallus gallus (chicken) NW_060416.1|Gga8_WGA225_1 (part of a 6 gene CYP2J cluster) 15518269 MLRFLWDSISLQMLFVFLLVFLLVSDYMKKRKPKDFPPGPFALPFLGNVQLMVAKDPVSTVQK 15518081 15517552 XXEKHGDIFSMQVGSMSFVIVNGLQMIKEALVTQGENFMDRPEFPMNAEVFNKF 15517403 15517205 GLLSSNGHLWKQQRRFTLTTLRNFGLGKRSLEERIQEECRFLTDAFRDEQ 15517056 15515960 GNPFNPHLKINNAVSNVICSITFGNRFEYHDEDFQNLLRLMDETVTLHGKIMSQ 15515799 15514587 LYTFFPSIVKYLPGSHQTVIKNGKLMKDFVCNVISKHKEDLNPSESRDFIDSYLQEMAK 15514411 15514166 PDSSDFCEDNLVSCTLDLFFAGTETTSTTIRWALLFMAMYPEIQ 15514035 15513576 ARVQAEIDAVIGQARQPSLEDRNNMPYTNAVIHEVQRKGNIIPFNALRLTVKDTVLAGFRVSK 15513388 15512873 GTILIPNLSSVMYDKKEWETPHSFNPGHFLKDGQFWKREAFMPFSI 15512736 15512011 GKRACLGELLARAELFLFFTSLLQKFTFQAPPDTILDFKFTMGITLAPRPYKICAVPR 15511838 CYP2J23 Gallus gallus (chicken) NW_060416.1|Gga8_WGA225_1 (part of a 6 gene CYP2J cluster) 15510424 MLRFLWDSISLQMLFVFLLVFLLVSDYMKRRKPKDFPPSPFSFPFLGNVQFMFAKDPVVATQK 15510236 15509668 XXEKLGDIFSMQAGSQSFVIVNGLPLIKEALVTQGENFMDRPEIPLDTDIFSKL 15509519 15509300 GLISSSGHLWKQQRRFTLTTLRNFGLGKRSLEERIQEECRFLTEAFRDEQ 15509151 15508915 GNPFNPHLKINNAVSNIICSVTFGNRFEYHDENFQTLLRLMDETVTLHEKIMSQ 15508754 15508232 LYNAFPSIVKYLPGSHQTIFKNWRLMKDFVNEKISKHKEDLNPSESRDFIDSYLQEMAK 15508056 15507812 PSGSEFHEENLVACALDLLFAGTETTSTTIRWALLFMAVYPEIQ 15507681 15507221 AHVQAEIDAVIGQARQPALEDRNNMPYTNAVIHEVQRKGNIIPFNVPRQAVKDTVLAGFRVPK 15507033 15506561 GTILIPNLSSVMYDKKEWETPHSFNPGHFLKDGQFWKREAFMPFSI 15506424 15505718 GKRACLGELLARAELFLFFTSLLQKFTFQAPPDTILDFKFTMGITLAPRPYKICAVPR 15505545 CYP2J24P Gallus gallus (chicken) NW_060416.1|Gga8_WGA225_1 (part of a 6 gene CYP2J cluster) 15504220 DSMKRQWLNFFKSIVGQQQLHCADYMKRRKPKDFPPSPFSFPFLGNV*FMFAKDPVVATQK 15504038 15503534 IIEEHGDIFSMQVGTQSFVIVNGLPLIKEALVTQGENFMDRPEIPMNAEVFSKL 15503385 15503168 GLLSSNGHL*KQQRRFTLTTL*NLGLGKRSLEERIQKECQFLTDAFRDEQ 15503019 15501515 GNPFNPHLKVNNAVSNVICSITFGNWFEYHDKDFQNLLQLMDETATFYGKIMNQ 15501354 gap 15501024 PNGSDFCGDNLVLCTLDLFFAGTETTSTTIRWALLFMAIYPEIQ 15500893 gap 15498733 GKRACLGELLARVEIFLFFTSLLQKFTFQAPPDTILDVKFTMGITLAPQPYKICAVPR 15498560 CYP2J25 Phalacrocorax carbo (Common cormorant) No accession number Hisato Iwata submitted to nomenclature committee 5/19/05 78% to 2J23, 76% to 2J22, 70% to 2J21, 75% to 2J20 55% to 2J19 CYP2J26 Bos taurus (cow) See cattle page for details MLEALGSLVAALWTTLRPGIVLLGAFVFLLFADFLKRQHPKNYPPGPLRLPFIGNFFHLDLGKGILVPQQ VVKKYGNIIRLDFGVIHFIVITGLPYIKEALVNQEQNFVNRPMIPLQKHIFNNK GLVRSNGQVWKEQRRFTLTTLRNFGLGRKSLEERIQEEVTYLIQAIGEEN GQPFDPHFIINNAVSNIICSITFGERFDYKDDQFQELLRLLDEILCIQASVCCQ LYNAFPRIMNFLPGSHHTLFRKWEKLKMFVANVIENHRKDWNPAEARDFIDAYLQEIEK 11676 HKGNATSSFDDENLICSTLDLFLAGTETTSTTLRWGLLFMALNPEIQ 14705 EKVQAEIDRVLGQSQKVSTASRESMPYTNAVIHEVQRMGNIVPMNVPREVTVDTVLAGYH 15236 LVKGTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRESLTSSPASYRLCAIPRA* 25310 CYP2J27 Bos taurus (cow) See cattle page for details MLEALGSLAAALWAALRPGTVLLGAVVFLFLDDFLKRRRPKNYPPGPPPLPEVGNFFQLDFDKAHLSLQR FVKKYGNVFSVDFGIFRSVLITGLPLIKEALVHQDQNFANRPLIPIEKRIFNNK 37352 GLIMSNGHVWKEQRRFALTTLRNFGLGKKSLEERIQEEAAYLIQEIGEEN 39667 GQPFDPHFTINNAVSNIICSITFGERFDYQDDQFQELLRLFDEMMHLRTSTCCQ 40221 LYNIFPRIMSFLPGPQHALFSKWEKLKMFIAGVVENHKRDWNPAEARDFIDAYLQEIEK 42145 HKGNATSCFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ 43949 EKVQAEIDRVLGQSQKPSMAARESMPYTNAVIHEVLRMGNILPLNVPREVTVDTVLAGYRLPK GTMVTTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRMSMTLSPLSHRLCAIPRA* CYP2J27-ie5b Bos taurus (cow) See cattle page for details extra internal exon 5 LSNVFPRIMNFLPGPQHTLFSKWEKLKMFIAGVIENHKRDWNPAEARDFVDAY 41591 CYP2J28 Bos taurus (cow) See cattle page for details MLEALGSLAAALWAALRPGTVLLGAIVFLLLTDLLNRRRPKNYPPGPPRLPFVGNFFQLDFEQGHLSLQR FVKKYGNLFSLEFGDLPSVVITGLPLIKEVLVYQDQNFVNRPISPIRERVFKKN GLIMSNGHIWKEQRRFSLTALRNFGLGRKSLEERIQEEVAYLIQAIGEEK GQPFNPHFKINNAVSNIICSITFGERFDYQDDQFQELLRLLDEVTYLETTVWCQ LYNVFPRIMNFLPGPHQMLFSNWRKLKMFVARVIENHKRDWNPAEARDFIDAYLQETEK HKGNAASSFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ 716 EKVQAEIDKVLDESQQPSMATRESMPYTNAVIHEVQRMGNILPLNVPREVTVDTVLAGYHLPK GTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKREAFLPFSI GKRMCLGEQLARTELFIFFTSLLQKFTFRPPEHEELSLKFRMGLTLSPVSHCLCAVPRA* CYP2J29 Bos taurus (cow) See cattle page for details MLSSLAAALWAALRPGTVLLGAVAFLFFADFLKRRRPKNFPPGPAGLPFVGNSFQLDPEKVHLTLQQ FVKKYGNVFSLDFGTFPSILITGLPLIKEALVHQGENFSKRPVMPLQERIFNTK GLIMSSGHIWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQMIREEN GKPFDPHFIINNAVSNIICSITFGERFDYQDSQFRELLRLLDEVLNLHTSLCCQ LYSVFPRIMNFVPGPHQTLFSNLEKLKMFVAEMIENHKRDWNPAEARDFIDAYLQEIEK 8435 HKGGDASSFREENLIYSTLDLFLAGTETTSTSLRWGLLYMALNPEIQ 5634 EKVQAEIDRVLGQSQQPSTAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK 5455 GTVVVTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI 2548 GKRMCLGEQLARAELFIFFTSLLQKFTFRPPENEKLSLKFRVSLTLAPISHRLCAVPRG* CYP2J30 Bos taurus (cow) See cattle page for details MLEALSSLATALWAALRPDTVLLGTLAFLLFVDFLKRRHPKNYPPGPPGLPFVGNLFQLDPEKVPLVLHQ FVKKYGNVFSLDFGTVPSVLITGLPLIKEVLVHQGQIFSNRPIVPLQEHIINNK GLIMSSGQLWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQTIREEN GQPFDPHLTINNAVSNIICSITFGERFDYQDDQFQELLRMLDEILNLQTSMCCQ LYNVFPRIMNFLPGPHQALFSNMEKMKMFVARMIENHKRDWNPAEARDFIDAYLQEIEK HKGDATSSFQEENLIYNTLDLFLAGTETTSTSLRWGLLFMALNPEIQ EKVQAEIDRVLGQSQQPSMAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK 15084 GTMVMTNLTALHRDPTEWATPDTFNPEHFLENGQFKKRESFLPFSI 12265 GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEQLSLKFRVSLTLAPVSHRLCAVPRG* CYP2J31P Bos taurus (cow) See cattle page for details MGAAAFLFVVHLKRRRGKNYPPGPPGLPFLGNFFHLDLKQLHLSLQQ IVKKYGNMISLEMGGFSTVFFKWIAQNQRSPCLPGPKLVNHPIQRIQENIFKKH 5343 GLIMSNGHIWKEQRRSALTTLRNFGLGRKILEECIQEEAAYLIQTVGEEN 8001 XQPFDPHFTINNAVSNIVCSIAFGELFDYQDSXXQELLRLMDEAMYLQTSVRCRV 8538 LYNFFARIMNFLPGPHQTLFIKWEKLNMFIDSVIENHRRDWNPAEPRDFTDA 15856 GMWMCPGEQLARTELFIFFTSLLQKFTFRPPGDEKLSLQFRVSLTISSVSHWLC 16020 CYP2J32v1 pig BW982013.1 CB287444.1, Z84061.1, BE014607.1 97% to CJ016505.1, 80% to 2J27 cow, ALGSLAEALWTALRPSTILLGAVAFLFFADFLKKRRPKNYPPGPPRLPFIGNLFHLDLDK GHLSLQRFVKKYGNVFSLDFGALSSVVITGLPFIKEAFVHQDKNFSNRPIVPIQQRVFKD KGVVMSNGQVWKEQRRFALTTLRNFGLGKKSLEERIQEEAQYLIQAIGEENGQPFNPHFK INNAVSNIICSITFGERFDYQDNQFQELLKLLDEVMCLQTSVWCQIYNIIPWIMKFLPGP HQTLFSNWEKLKMFVAHVIENHRRDWNPAEARDFIEAYLQEIEKHTGDATSSFQEENLICS TLDLFVAGTDTTSTTLRWGLLYMALYPEIQEKVQAEIDRVLGQLQQPSSSARESMPYTNA CYP2J32v2 pig CJ016505.1 NRPTVPIQQRVFKDKGVVMSNGQVWKEQRRFALTTLRNSGLGKKSLEERIQEEAQYLIQA IGEENGQPFNPRFKINNAVSNIICSITFGERFDYQDDQFQELLKLLDEVMCLQTSVWCQI YNIIPWIMKFLPGPHQTLFSNWEKLKMFVAHVIENHRRDWNPAEARDFIDAYLQEIEKHK GDATSSFQEENLICSTLDLFVAGTETTSTTLRWGLLYMALYPEIQEK VQAEIDRVLGXLQQPSTAARESMPYTNA CYP2J33 pig BP170090.1 CK453810.1, BW982704.1, DB811462.1 DB817476.1, DY414727.1 DY418828.1 85% to CJ016505.1 80% to 2J28 cow MTQALGSLAEALWTALHPSTLLLGAVTFLFFADFLKKRRPKNYPPGPLRLPFVGNLFHLD FEKAHLSLQRFVKKYGNIFSLDLCALSAVVVTGLPLIKEVLVHQNQKFANRPILPIQDRV FKNKGVVTSSGQVWKEQRRFTLTTLRNFGLGKKSLEERIQEEAQYLIQAIGEENGQPFNP QFKISNAVSNIICSITFGKRFDYQDDQFQELLRLLREVTHLQTLLWCQLFNVFPRIMKFL PGPHQTLFSDWEKLEMFIARVIENHRRDWNPAEARDFIDAYLQ EIEKNKGNATSSFHEENLICSTLDLLFPG TDTTLITLRWGLLYMALHPEIQEKVQAEIDRVLGQSQQPSTAARESMPYTNAVIHEVQRM GNIIPLNVPREVAEDTTLAGYHLPKGTMVLTNLTAL HRDPAEWATPNIFNPEHFLENGKFKKREAFLPFSIGKRACLGEQLARTELFVFFTSLLQK FSFRPPDNEKLSLKFRVGLTLSPVTYCICAVPRA* CYP2J34 pig BW981916.1, CJ028862.1, BW967356.1, CJ025847.1, BP142154.1 BP168104.1, CJ025026.1, BW967863.1, 83% to BW982013.1, 80% to 2J28 cow MTPALGFLAEALWTALRPSTLLLGAVAFLFFADFLKRRSPKNYPPGPPRLPFLGNFFHLD VEKGHLALQRFVKEYGNIISLDSSVFSSVVITGLPLIKEAFVHQDQHFANRPMIPTQERV FKKNGLIMSNGQVWKEQRRFALTTLRNFGLGKKSLEERIQEEAQYLIQAIGEENGQPFNP HFKINNAVSNIICSITFGKRFDYQDDRFQELLRLLDEVTCQHTSVQVQLYNMFPRIMKFL PGPHQTLFSNWEKLQIFVACVIENHKRDWNPAEARDFIDAYLQEIEKHKGNATSSFQEEN LIFTTLDLFFAGTETTSTTLRWGLLYMALYPE CYP2J35 pig BW960287.1, BI359857.1 75% to 2J28 cow MLGAVGFLAEVFGTALGPSALLLSAVAFLFVADILKRWRPKNYPPGPLRLPFVGNFLHLD FEQWHLSLQRFVKKYGNVLSLDLGAFSSVVITGLPLIKEALVHQDQNFVNRPINLNQV FQKNGLIMSNGQVWKEQRRFALTTLRNFGLGKKSLEERIQEEAQYLIQAVREENGQPFDP HFKINNAVSNIICSITFGERFDYQDDQFQELLRLLDEVTCL PKLVRVQLFNVFPRIMKLLPGPHQIIFSNREKLRMF IARVIENHRRDWNPAEARDFIDAYLREIEKGSSPSVFNEENLICSTLDLFFAGTETTS TTL CYP2J36 Anolis carolinensis (anole lizard) scaffold 23 3305369-3326894 (-) strand (small gap in exon 8) 55% to CYP2J2, 43% TO CYP2C8 3358582 MWFHAFAIFWETISLQVILGFLATFLLLTDYVKRRRPRGFPPGPIPLPFLGNLLSYDAKKPHLYNQK 3358382 3357138 LVAIYGNVFSLQLGNIHIVFLNGLQAVKEALINQGESFLDRPKVPITYDVSKTF 3356977 3351644 GVITSNGQTWKQQRRFVMSTLRNFGLGKTYLEERIQEESRFLVAAIEDEK 3351495 3348890 GQPFDPYHQINNAVSNVICSVTFGNRFDYHDSDFQKLLHLLDETGVFLRNIWSH 3348729 3347734 LYNAFPSLMRRLPGPHQTYFKNWEQLKSFVRKIIEKHKEDWNPLKTKDFIDAYLNEMAK 3347558 3346355 FKENASSTFHMENLLQSTLDLFVAGTETTSATLHWAVLYMAVYPEIQ 3346215 3343877 AKVQAEIDSVIGQSHLPAMADRDNMPYTNAVIHEIQRRSSIVVVNAPRLTANDTQVAGFHLPK 3343689 3337326 xxxxxxxLTSILFDKNEWETPNVFNPNHFLKNGQFMKREAFVPFST 3337210 3335618 GKRACPGEQMAKMELFLVFTTLLQKFTFQAPKGVKLSLDSKTGHVLKPKPYQICAISR* 3335442 CYP2J37P Anolis carolinensis (anole lizard) scaffold 23 3305369-3326894 (-) strand pseudogene 57% to CYP2J2, 43% TO CYP2R1 3326894 MLCHCFAVFWEALSLKIVFVFLFTFLIIADYIRQRRPRGFPPGPRPLPFVGNLFSVDITKPHLSSEK 3326694 3325276 FMEIYGKIFSLQLGKFPFVIVNGLQLVKEALIHQNENFVDRPILPIIYDHSKTF 3325115 3322787 GLIMSNGLSWKQQRRFALSTLRNFGLGKRSLEEQIQEESRFLVGAIEDEK 3322638 3320225 GQPFDSHYQINNAVSNVICSVTFGKCFDYHDSQFQKLLHLLDEMGNVQAGFWGM 3320064 3309149 AYNTFPALMKLLPGPHQTVFKNWDQLKSFVRKIIEKHQNWNPLETRDFIDAYLNEIAK 3308976 3308595 LKD*ASSSFHMENLLQ*TIDLFIAGTETETTSATLRWAVLYMAIYPDIQ 3308449 3307295 GKVQAEIDSVIGQSRSLTMADRDSLPYTNAVIHEIQRMGNILPFSAPRVAVNDTRLAGFYLPK 3307107 3305985 GTILLPNLTSLLFDKDEWDTPNKFNPNHFLKDGQFMKREAFIPFSI 3305848 3305545 GKRSCLGEQLARMELFLFFTTLMQKFTFQAPNGLRLSLDFKIGNALSPKPYKICAISR* 3305369 CYP2J38 Anolis carolinensis (anole lizard) scaffold 23 3277211-3297585 (-) strand 57% to CYP2J2, 43% TO CYP2C18 3297585 MLFHCFAVFWETLSLKAVLVFLATFLIVADYVRRIHSRGFPPGPMPLPFVGNLLHLDAEKPHFSTQK (0) 3297385 3295355 LADIYGNVFSLQLGNRHFVFVNGLEIVKEVLIHHGENFLDRPKFPIISDHAKTL 3295194 3294395 GLVMSNGLPWKQQRRFALSTLRNFGLGKRSLEERIQEESRFLAGAIENEK 3294246 3288794 GQPFDPHYQINNAVSNVICSITFGNRFDYHDSQFQKLLHLLNETGIIQRSIWAQ 3288633 3286768 LYNIFPALMKQLPGPHQTIFKNWEQLKYFVRTIIKKHQENRNPLETRDFIDAYLNEMTK 3286592 3285518 FKENVSSSFHMENLLQSALDLFIAGTETTSTTLRWALLYMAIYPEIQ 3285378 3282591 ERVQSEIDSVIGQSRPPAMTDRDNLPYTNAVIHEIQRISNILPLNVPRLTTNNTEIAGFHLPK 3282403 3280566 GTILICNLTSVLFDKDEWDTPKKFNPNHFLSNGQFRIREAFVPFSA 3280429 3277387 GKRACLGERLARMELFLFFTALIQKFSFQAPKGVELSLDFKMSLTLSPNQYHICAVSR* 3277211 CYP2J pig BF191621.1, BX914614.2, BQ601924.1 85% to 2J30 cow possible end of 2J34 or 2J35 GQSQQPSIAARECMPYTNA VIHEVQRMGNIIPMNVPREAAEGTTLAGYHLPKGTMVLTNL TALHRDPAEWTTPDRFNPEHFLENGQFKKREAFLPFSIGKRACLGEQLARTELFVFFTSL LQKFTFRPPDNEKLSLKFRMGLTLSPVTYRICAVPRA 2K Subfamily CYP2K1 Onchorhynchus mykiss (rainbow trout) GenEMBL L11528 (1853bp) PIR S45644 (504 amino acids) Buhler,D.R., Yang,Y.-H., Dreher,T.W., Miranda,C.L. and Wang,J.-L. Cloning and sequencing of the major rainbow trout constitutive cytochrome P450 (P450 2K1): Identification of a new P450 gene subfamily and its expression in mature rainbow trout liver and trunk kidney. Arch. Biochem. Biophys. 312, 45-51 (1994) CYP2K1v2 Onchorhynchus mykiss (rainbow trout) GenEMBL AF045052 Buhler,D.R. note: 98.6% identical to 2K1 may be an allele (5L1FL) submitted to nomenclature committee CYP2K1v3 Onchorhynchus mykiss (rainbow trout) GenEMBL AF045053 Buhler,D.R. note: 98.4% identical to 2K1 may be an allele (5L6FL) submitted to nomenclature committee CYP2K2 Fundulus heteroclitus (killifish) John Stegeman submitted to nomenclature committee CYP2K3 Onchorhynchus mykiss (rainbow trout) GenEMBL AF043551 Buhler,D.R. (5L7FL) 96.5% identical to 2K1 CYP2K4 Onchorhynchus mykiss (rainbow trout) GenEMBL AF043296 Yang,Y.-H., Andersson,T.B., Ryu,B.-W., Wang,J.-L. and Buhler,D.R. CYP2K4: A New Cytochrome P450 Isoform from Male Trunk Kidney of Post-Spawning Rainbow Trout. Unpublished kid8 from kidney CYP2K5 Onchorhynchus mykiss (rainbow trout) GenEMBL AF151524 Buhler,D.R. 80% identical to 2K1 clone name KM2-2 from sexually mature male trunk kidney library CYP2K6 Danio rerio (zebrafish) No accession number Wang-Buhler, J.L., Yang, Y.H., Lee, S.J. and Buhler, D.R. Submitted to nomenclature committee 6/16/2000 CYP2K7 Danio rerio (zebrafish) GenEMBL AI722500 EST 88% to CYP2K6 Full length translation of this EST allowing framshifts INNLFGAGXDTTVTTLRWGLLLFAKYPEIQAKVHDEIDSVIGERQPVPDDRKNLPYTDAVIHEIQRFADILPIG LLRQTSCDVHLNGYLIKKGTSVFPLIASVLRDENEWETPDSFNPKHFLNKQGQFVKKDAFMPFGAGRRLCIGES LARMELFLFFTSLLQHFCFTPPPGVSEDELDLTPVVGFTLSPMPHKLCAVKRF* CYP2K7 Danio rerio (zebrafish) No accession number Donald R. Buhler EST AI722087 fd19b07.y1, AI722500 fd19b07.x1, BF157099 fl60g01.y1 Submitted to nomenclature committee 2/10/2001 503 amino acids, 76% to 2K6, 59% to CYP2K4, CYP2K5 CYP2K8 Danio rerio (zebrafish) No accession number Yea-Huey Yang, Jun-Lan Wang-Buhler and Donald R. Buhler EST 78% to CYP2K5 clone name F2R Submitted to nomenclature committee 7/1/2000 CYP2K9 Fugu rubripes (pufferfish) No accession number Scaffold_12487 3037 MIEDLFESSTSGFLMVAIVSLLLLQ LCFSFISREKRKDLPGPEALPLLGNLHQLDLKRLDCHLVQ 3231 (0) 3299 LSQKYGPIFRVYLASKKVVVLAGYTAVKQALVNQAEDFGEREIFPIFHDFNKGN 3460 (1) 3527 GILFTNGDQWKEMRRFALMTLKDFGMGKRTIEEKIIKECQYLIEAFEQHQ 3676 (1) GEAFSNAQVISYATSNIISAIMYGRRFDYKDPTFQAMIERDHEVIHLTGSPSIQ (0) IYNIFPWLGPFLKTWRYIMKKVEINIESTRRIIGEMKETRNP GTCRCFVDAFLIHKENQE (0) 4483 ESDVNAHYYHEDNLLHCAMNLFGAGTDTTATTLQWGLLYITKYPHIQ 4623 (1) 4692 DGVQEELRRVVGNRQVRVEDRKNLPYMEAVIHETQRMANIVPMSLPHRTS*DTFQGYVIKK (0?) GTMVIPLLTSVLYDESQWEKPHTFNPAHFLDDEGRFVRRDAFMPFSA 5095 (1) 5164 GRRMCLGEGLARMELFLFFASLLQHFRFKPAPGVSEDSLDLTPVVGITLNPLTHKLRAISRF* 5352 CYP2K10 Fugu rubripes (pufferfish) No accession number Scaffold_19693 296 SKKYGPVFKVHFGPRKVVVLAGHKTVKEALVGNAEQFGDRDISPIFYDMNQGHG 457 LKU76565.x1 missing exon 3 and part of exon 4 2727 YATSNIISSIVYGSRFDYDDPRFINMVNRVNEVIRLTGSAPIQ (0) LYNIFPGLANWIKNRQLLLKQVAMNLRDMTDLIQQLKDTLNPGVCRGFVDCFLLRKQKAV (0) 2184 DSGVIDSLYNEKNLLYSLSNLFGAGTDTTATTLRWGLLLMAKYPRIQG QVQQELSMVVGNRRVCVEDRKNLPYVDAV 1813 1812 IHEIQRLGNIAPMAVPHKTARDVEFRGYFIEK 1717 1286 GTTVFPLLTSVLYDENEWETPHTFNPSHFLDKDGKFIKRDAFMPFSA 1146 1063 GRRLCLGEGLAKMEIFLFFTSLLQQFRFTPPPGVGEDELDLTPVVGFTLSPSPHKLCAIPRQ* CYP2K11 Fugu rubripes (pufferfish) No accession number Scaffold_10791 missing exons 1-4 about 176 aa 5891 VYDLFPWIGPLVNNKKLFQSLFAANKKQNLQLFAAAKEMLNPQMCRSFVDSFLARQQILE 5721 (0) 4989 KSGTNVHFHDENLMSTVMNLFNAGTDTTATTLRWGLLLMAKYPLIQ (1?) 4750 DQVQEELRRVIGSRQVQVEDRKSLPFTDAVIHETQRLANIVPMALPHKTSQDVTLQGFFIEK 4571 (0) GTTVYPLLTSVLYDETEWEKPLNFYPAHFLDKDGKFVKREAFLPFSA 4355 (1) 4287 GRRICLGEGLAKMELFIFFSTLLQHFRFRPPPGVSEDHLDLTPRVGLTLNPSAHKLCAVSCL* 3999 CYP2K12P Fugu rubripes (pufferfish) No accession number Scaffold_3103 Length = 27036 59% to scaf 10791 Heme junction missing the conserved Gly, no uspstream seq found With these defects and a frameshift this is probably a pseudogene LKB99171.x1 50% TO 2C37 17897 DQVQEELSRVIG 17862 frameshift 17860 SRQVQEGDRKNLSFTNAVIHETQSGHVALTSLPHVTNQDIIFRGHFLKKG 17711 (1) 17388 NYMEDTASVASVLLEETEWEHPHTFYPSHFLEKDRKFVKRDAFLPFSA 17242 (1) 17176 ISRACPGETLARVELFIFLVTLLQHFCFTLAPGVSPDELHVTPSIGSNHSPVAYRLCTVSCM* 16988 CYP2K13P Fugu rubripes (pufferfish) No accession number Scaffold_12487 pseudogene frag of 2K9 660 VRVEDRKNLPYMEAVIHETQRMANIVPMSLPHRTSXX frameshift RHLPGIRHQK CYP2K14P Fugu rubripes (pufferfish) No accession number Scaffold_13436b Length = 4942 pseudogene of 2K9 = LGW56404.x1 50% to 2A7 two partial genes in this contig both on minus strand Scaffold_13436b pseudogene of Scaffold_12487 (fs) = frameshift 3958 VRVEDRKNLPYMEAVIHETQRMANIVPMSLPHRTSRD (fs) RHLLX (fs) GIRH (fs) QK (1?) alternative frame RDTSFSGDTSSKRFTALFELAHVYV GTMVIPL (fs) LTSVLYDESQWEKPHTFNPAHFLDDEGRFVRRDAFMPFSA (1) GRRMCLGE (deletion 3 nuc) RMELF (insertion 12 nuc) LFF (deletion 33 nuc) VSVDSLDLTPVVGITLNPLTHNLRAISRF* 3368 CYP2K15P Fugu rubripes (pufferfish) No accession number Scaffold_13758 pseudogene 41% to LKB99171.x1 50% TO 2C37 Length = 5303 FC:C094J16aF1, FC:C007E01aF1 pseudogene 740 KGRITQRHFHDEKLMMTVSSHLAAGTHLDTYTALRQEPLVMAK*PEVQ 883 exon 6 (1) 52% to 2K11 Exons 7 and 8 deleted 1284 (1) GLRSCPGEG*SRMKLFIFIVILLQHLCFSSSPVLMEEDLELKTVLGSILNPINCVLFVGRER* 1472 exon 9 48% to 2K9 CYP2K16 seq.c Danio rerio (zebrafish) ctg12742 68% to 2K8 57491 MAFLDALLHVSSTGTLICFLLLLLVAYLLFLRSQSDENEPPGPKPLPLLGNLLMLDVNKPHLSLCE 57294 52779 MAKQFGPVFKVYFGPKKVVVLAGYKAVKQALVNYAEAFGDREIMPLFHDFTKGH 52618 52022 GIIFANGESWREMRRFALTNLRDFGMGKKKIEEKIIEETCHLREEFEKFX 51876 50840 GKPFETAQLMNYAASSVISSIVYGRRFEYTDPQLRTMVDRANESVRLSGSASVQ 50679 50581 LYNMFPFLGPLLKNWRQLMKNLHLDIEEISELVNGLHQTLNHQDLRGFVDSFLVRKQX 50411 50317 DQDSGEKDSHFHEQNLIYTVGNLFVAGTDTTSTTLRWSLLLMAKYPHIQ 50171 43796 DRVQEEIDQVIGGRQPVSEDRKNLPYTDAVIHETQRLANIVPMSIPHMTSSDITFNGYFIKK 43614 43440 GTCIFPLLTSVLWDEDEWETPHIFNPNHFLDEQGRFVKRDAFMPFSA 43300 42178 GRRICLGESLARMELFLFFTSLLQYFRFTPPPGVSEDELELTPAVGFTLNPIAHKLCAVKR 41996 CYP2K17 seq.d Danio rerio (zebrafish) ctg12742 BI427723 zfishC-a1846d04.p1c zfishC-a1146b02.p1c 66780 MAVVESLLHFSSAGTLLGTLLLLLVFYRLSRDSEFQKKRKDPPGPKPIPLLGNLLTLDLSRPFDSLCE 66577 63586 LSKTYGNVYQVFLGPKKVVVLIGHKTVKEALVNYADEFGERDITPIFRXXXXXX 63443 63238 GILFSNGESWKEMRRFAISNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFX 63092 62992 GKPFDTTQPVNYAVSNIISSIVYGSRFEYTDPRFTEMVDRANENIRVSGSVSMX 62834 62747 LYNIFPWLGLFLNSKRTVVRNMLKNRAEFMKLITGLQETLNIHDRRGFVDSFLIRKQX 62577 60380 XXXXGKKDSYFHAENLLMTVGNLFAAGTDTTGTTLRWGLMLMAKYPQIQ 60246 60158 XRVQEEIDRVIGGRQPVVEDRKKLPYTDAVIHEIQRLANIVPMNLPHVTSCDVTFNGYFIKK 59976 59893 GTTVIPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFMPFSA 59753 59675 GRRVCLGESLARMELFLFFASLLQSYRFTTPPGVSEDELDLKGTVGVTLNPSPHKLCAIKRF 59490 CYP2K18 seq.e Danio rerio (zebrafish) ctg12742 MISSING FIRST TWO INTRONS EXON 3 IS DUPL. MAY BE A PSEUDOGENE 93% to 2K19, 91% to 2K21 zfishK-a1004a03.p1c (100% over 29aa) also matches 2K19, 2K20 78359 MAVVESLLQFASTGTLLGALLLFLVLYLVSSGSESQKEGKEPPGPKPLPLVGNLLTLDLTRPFDT 78165 78164 FFKLSKTYGNVFQVYLGPEKAVVLVGYKTVKEALVNYAEEFGDREIGPGFSIMNDEH 77912 77911 GILFSNGENWKEMRRFALSNLADFGMGKRRSEEK 75750 GILFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKF 75604 75522 GKPFDTTQPVNYAVSNIISSIVYGSRFEYTDPQFTEMVDRANENVRVGGSISMX 75364 75253 LYDIFPWLGPFLKNKRIIVENIIQSRVQMTKLITALLETLNPNDPRGFVDSFLIRKXX 75086 74916 XQKSGKKDSYFHEENLMMTVTNLFIAGTDTTGTTLRWGLMLMAKYPHIQ 74773 74686 XRVQEEIDRVIGGRQPVVEDRKKLPYTDAVIHEIQRLANIVPLSLPHRTTSDITFNGYFIKK 74504 74413 GTTVVPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVRRDAFMPFSA 74273 73457 GRRVCLGESLARMELFLFFTSLLQSYRFTTPPGVSEDELDLKGIVGITLNPSPHKLCAIRR 73275 CYP2K19 seq.f Danio rerio (zebrafish) ctg12742 91% to 2K21 AF221128 (1 aa diff) zfishC-a678c11.p1c (near perfect) 90000 MAVVESLLQFASTGTLLAALLLFLVLYLVSSGSQKEGKEPPGPKPLPLLGNLLTLDLTRAFDTFFE 89803 (0) 89722 LSKTYGNVFQVFLGPRKTVVLVGYKTVKEALVNYAEQFGDREIGPGFRIMNDEH 89561 (1) 89232 GILFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFE 89086 (1) 89004 GKPFDTTQPVNYAVSNIISSIVYGSRFEYTDPQFTEMVDRANENVRVGGSVSMW 88843 (0) 88707 FHEMFPWVGPFLKSKRIIVENIIQSRAQMTKLITALLETLNPNDPRGFVDSFLTRKLSDE 88528 (0) 88365 KSGKKDSYFHEENLIMTVTNLFVAGTDTTGTTLRWGLMLMAKYPQIQ 88225 (1) 88137 DRVQEEIDRVIGGRQPVVEDRKKLPYTDAVIHEIQRLANIVPLSLPHKTTSDITFNGYFIKK 87952 (0) 87861 GTTVVPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFIPFSA 87721 (1) 84188 GRRVCLGESLARMELFLFFTSLLQSYRFTTPPGVSEDELDLKGIVGITLNPSPHKLCAIRRS* 84000 CYP2K19 Danio rerio (zebrafish) GenEMBL AL919697 Tseng, H.-P., Wang-Buhler, J.-L., Hu, C.-H., Hseu, T.-H., Peng, J.R. and Buhler, D.R. Submitted to nomenclature committee Oct. 14, 2004 JR8 CYP2K20 seq.g Danio rerio (zebrafish) ctg12742 88% to 2K19 and 2K21 zfishC-a1699d01.q1c (100% over 57aa) AF221128 (1 aa diff) zfishC-a678c11.p1c (near perfect) zfishC-a1101c09.q1c (100% over 39aa) 104280 MAVVESLLQFASTSALLGALLLLLVLYLASSGSTSQKEGKEPPGPKPLPLVGNLLTLDLTRSFDTFFE 104077 103997 LSKTYGNIFQVFLGHRKTVVLVGYKTVKEALVNYAEVFGDREIGPGFKXXXXX 103854 102358 GILFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFX 102212 102123 GKPFDTTQPVNYAVSNIISSIVYGSRFEYIDPRFTEMVARANENVRVGGSFSMX 101965 101852 IYNIFPWLGPFLKNRAVVVKNITQNRAEKKKLITALLETLNPHDPRGFVDSFLIHKXX 101685 101522 XQKSGKKDSYFHEENLMLTVANLFAAGTDTTGTTLRWGLMLMAKYPHIQ 101379 101300 DRVQEEIDRVIGGRQPVVDDRKKLPYTDAVIHEIQRLANIVPLSLPHRTTSDITFNGYFIKK 101118 97108 GTTVVPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFMPFSA 96968 92153 GRRVCLGESLARMELFLFFTSLLQSYRFTTPPGVSEDALDLKGIVGITLNPSPHKLCAIRR 91971 >CYP2K21 seq.h Danio rerio (zebrafish) ctg12742 91% to 2K19 zfishB-a619a12.q1c (near perfect) 112093 MAVVESLLQFASTGTLLGALLLFLVLYLVSSGSGSQKEGKEPPGPKPLPLLGNLLTLDLTRAFDTFFE 111890 111821 LSKTYGNIFQVYLGPKKTVVLVGYKTVKEALVNHAEAFGDREIGPSFRIMNDXX 111666 109983 GIVFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFX 109837 109744 GKPFDTTEPVNYAVSNIISSIVYGSRFEYTDPQFTEMVDRANENVRVGGSISMX 109586 109441 LYNMFPWLGPFLKNKRIVVRNIIQSRAQMTKLITALLETLNPNDPRGFVDSFLIHKXX 109274 109110 XQKSGKKNSYFHNENLMMNVANLFVAGTDTTGTTLRWGLMLMAKYPQIQ 108967 108879 XRVQEEIDRVIGGRQPAVEDRKKLPYTDAVIHEIQRFANIVPLNLPHTTSCDITFNGYFIKK 108697 108484 GTTVIPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFMPFSA 108344 107905 GRRICLGESLARMELFLFFTSLLQSYRFTTPPGVSEDELDLKGIVGITLNPSPHKLCAIRR 107723 >CYP2K21-de1 seq.i Danio rerio (zebrafish) ctg12742 PSEUDOGENE PARTIAL EXON 1? 113358 MAAVETLLQFASTGSLLSALLLLLVWYLVSSESTYQKKGKEPPGPKPLPLLGNLLT 113191 >CYP2K22 Danio rerio (zebrafish) ctg11670 zfishC-a643a08.p1c MISSING EXON 6 GREATER THAN 95% to 2K7. 9aa diffs in the first exon, only 3 aa diffs in the rest 33920 MALVAALLPGLGFTVSTILAFLLLFLVISYFFSSKDKGKYPPGPKPLPVLGNLHILDLKNTYMSLWK 34120 37393 LSKQYGPVYTVHMGPRTVVVLSGYKVVKEALVNLSEEFGERDISPIFQDFNEGY 37554 37635 GIVFSNGENWKEMRRFALSNLRDFGMGKKRSEELITEEIKYLKEEIERFX 37781 39367 GKPFETKLPLAMAISNVIALIVYSIRFEYNSPKFHRAIVRANENAKLVGSPSVQ 39528 42486 LYNMFPWLRLFVANQKRVVDNVQESFKQIGEIVNGLKKTLNPQSPRGIVDKFLIQQQK 42659 45851 AKVHDEIDSVIGERQPVPDDRKNLPYTDAVIHEIQRFADILPIGLLRQTSCDVHLNGYLIKK 46036 46115 GTSVFPLIASVLRDENEWETPDSFNPKHFLNKQGQFVKKDAFMPFGA 46255 49040 GRRLCIGESLARMELFLFFTSLLQHFCFTPPPGVSEDELDLTPVVGFTLSPMPHKLCAVKRF 49225 CYP2K23 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr XI (-) strand 9794341-9797707 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 61% to Fugu 2K11, 65% to 2K10 MSLFGDFVVYLCSSTSTFLGAVVLLLVLYLVSNSLTRRELRKVPPGPSPLPLLGNLLQLDLKRPYVTLCELSKKH GSVFTVYLGTSRVVVLAGYKAVKEALVNHREEFGDRDISPIFYDLNHGHGILFANGESWKEMRRFALTNLRDFGM GKQLSEHKILEECQYLMEVFEKHQGTEFIYTASPVNYATSNIISAIVYGSRFEYNDPQFMSMVERSNESISVVGS VQIQLYNMFPKLVSWTKKRQLLLNNLTRTVRDVKELILHLKDTLHPQFCRGLVDCFLIQMQKDEEARVNTHYNEK NLIFTVTNLFSAGTDTTATTLRWSLLLMAKYPHIQDQVQEELSRVVGSRQVRVEDRRNLPYTDAVIHETQRLANI VPLAIPHKTSRDVTFQGFFISAGTTVIPLLTSVLRDESEWESPNSFNPSHFLDTEGKFIRRDAFMPFSAGSRACP GESLARMELFLFFTSLLQRFRFTPPPGVKEDDLDLTPAVGFTLTPSPHELCAVSCEGIQNEKII* CYP2K24 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr XI (+) strand 9720129-9723291 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 59% to 2K10 MLMLEDLFLSYVTVALMLVLMCILVSLFFRSKDKRREPPGPQPLPLLGNLLQMDLKRLDRSLVD (0) LSKKYGSVFTVHLGPQKVVVLAGYKTVKQALVNHAVEFGERRIPQFGNDLMLSDSYR (2) KGIFFANGESWKEMRRFALSNLKDFGMGR KAAEDKIIEEIQYLIEVFERHE (1) GQPFSTGQPMNYAVSNIICSIVYGSRFEYRDKDFKLMVDRANENIQLAGS PSVLLFDMYPGIFHWASNRMRLKRNVFENHKRIKQLIGHLQETFNVELCRGFVDSFLAQKKKLEDSGITDSYYNI ENLVSTVGNLFSGGTDTTSSTLRWGLLLMAKYPRIQYQVQEELSRVVGSRQVRVEDRRNLPYTDAVIHETQRLAN VVPLAIPHKTSQDVTFQGFFIKGGTTVFPLLTSVHHDESEWESPNSFNPSHFLDTEGKFIRRDAFMPFSAGRRAC PGESLARMELFLFFTSLLQLFRFTPPPGVKEDDLDLTPVVGFTLTPSPHELCAVSREGIQNE* CYP2K25 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr XI (-) strand 9676173-9679867 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 59% to Fugu 2K10, 52% to 2K8 Danio MENLFLQLNSTTILLGTVGILLLLYVFLTNFDHKRKEPPGPRPLPLFGNLLHLNLKSFHMTLYELSKKYGSVFSV HLGPQKVVVLAGYKTVKQALVNHAVEFGERYVSPTGHDLSNGIVFGNGESWKEMRRFALTNLRDFGMGKKAAEDK IIEEIQYLFEVFDRHQGQPFNTGQSMNYAVSNIICSIVYGSRFEYSDEEFRLMVDRVNYNIRLAGSPSAKLFDMY PWLFQWTSNRKRLTRNVTENRNQIKRLIGRLQETLNVHMCRGFVDSFLAHKQKLEDLKITDSHYNMENLVSTVSN LFAAGTNTSGTTLRWGLLLMAKYPHIQGKVQEELSRVVGNRQVRAKDRMNLPFADAVIHETQRFANVLPVTIAHK TSTDVTFQGYFIKKGTTVFPLMTSVLWDESEWETPRTFNPAHFLDKDGKFFKRDALMPFGAGRRACPGESLARME LFLFFTSFLQRFRFTPPPGIKEDDLDLTPAVGLTLAPSPHELCAVSREGIQNE* CYP2K26 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr XVIII (-) strand 12862313-12864957 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 73% to Fugu 2K11 see EST DN708008.1 MGIVDQVLESSSSASLLGVLLVLLLVYLASSFSLGSPKDRKEPPGPTPLPLIGNLLQLDLKRPYNTLLKLSKKYG SVFTVYMGPEKVVVLAGYKTVKEALVNRAEEFGDRQAMLIIREFNQGHGVIWSNGDSWKDMRRFALTNLRDFGMG KRASEDKIIEECEHLIEVFKKHK (1) GEPFDTTQPMNYAVSNIICSIVYGSRFEYDDPQFTSLVDRTNRTIQLV GSPSIQLYNLFPWIGKWIANRNEVETLITANKKQNLQLFSRLKETLNPLMCRGFVDAFLVRKQNLEESKNTNSHF NDDNLMQTVLNLFAAGTDTTATTLRWGLLFMVKNPKIQ (1 GC boundary) DRVREELSEVVGSRQVQVEDRKKLPFTDAVIHETQRLANIVP MAIPHKTTQDVTFQGHFIKKGTTVFPLLTSVLYDESEWEEPHSFHPAHFLDADGKFIKRDAFMPFSAGRRVCLGE SLARMELFIFFSTLLQRFRFTAPPGVSVEDLDLTPRVGFTLNPSTHKLCAVPCV* CYP2K27 Oryzias latipes (medaka) chr8:11128109:11132739: (-) strand Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 66% to Fugu 2K10 MDLLMPLVSSPTTVIGAVFLLLVLYLASAGSTSRDLGKDPPGPRPLPLLGNLLQLDPRRPHKALCELSKSYG PVFTVYFGIQKVVVLAGYKTVKEALVNNAEEFGDRDITPMFQDMNKGHGILFANGESWKELRRFALTTLRDFGMG KRIAEEKILEECDYLIQGLEKHQGRKFDLTCPLNYATSNIISSIVYGSRFDYDDPRFRNLVSRANETIRINGHPL THLYNMFPRWFRWIKNRKIILNNVEMTVKDVKDLVKHLKETLNPSVCRGFVDCFLIKKQKEEDSCVKESHFTEQN LVFSVSNLFAAGTDTTATTLRWGLLLMAKYPHIQDKVHEELAKVLGGRQVRVDDRKNLPYADAVIHEIQRVANII PMSIPHKTNRDVTFHGYLIQKGTTVIPLLASVLNDENEWESPHTFNPHHFLSKEGKFVKRDAFMPFSAGRRACLG ESLAKMELFLFFTSLLQRFHFTPPPGVSEEELDLTPAMGFVLAPSSHELCAVSLQ* CYP2K28 Oryzias latipes (medaka) Chr8: 11120126:11125947: (-) strand Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 62% to Fugu 2K19 MIQYIFRFMPASVSLMWVIVGVLVLLFLYFQLSFFNWREPPGPRPLPLLGNLFQVDLKRLDQSLFDLSKKYGPVF VVNFGPKKVVVLAGYRTVKQALVNQAKEFGNREVTPIFYDFNKEHGILFANGESWNEMRRFALSTLRDFGMGKRI SEQNIIEECRWLIEELEKLQGKPFDNTHTISYAVSNVLSGLMFGKRFDYQDPLLQAIVDRDNEIIYLTGTVSILL YNMFPWLGPWLKNWKTLMKNMEAAKTDMKKIIAELKDTLDPDTRRCFVDAFLTQKQNLKEVNGSHYHDDNLLYTV MNLFAAGTDTTATTIEWCLLFMAKYPHIQERVQEELNWVVGSRQVRIEDRKNLPFTDAVIHESQRLANIAPMAIP HTTSKDVTFQGYFIKKGTTVLPLLTSVLYDESEWESPRTFNPSHFLDKEGKFLKRGAFMPFSAGRRVCLGESLAR MDIFLFFTSLLQHFSFTPPPGVSEDELDLTPVVGFTLSPQPQGLCAVRRQ* CYP2K29 Oryzias latipes (medaka) Chr24: 11283779:11289362: (+) strand Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 68% to Fugu 2K11 MQILDFFQSYSSVSLVGILAVLVLYFISQFIFNSEQHGQEPPGPRPLPIIGNLMQIDLKRPYKTLEEFSKTYGPV FTVFFGGEKVVVLAGYKTVKNALVNHDEEFGERAIPPIIQELNKGLGVLWSNGDIWRDIRRFALTNLRDFGMGKK ACEDKITEECQYLLEVFKKFKGNAFDTTKPLNYAVSNIICSMVYGSRFEYDDPKFTSMVDRTNRNIQLSGSPTLQ AYNMVPWLFKWVASRREVHECAAANRKQNQSIFSHLKETLNPQMCRGFVDAFLVKGQTLEKSGVTNSAFNDENLL MTVIHLFAAGTETTSTTLRWGLLLMAKYPKIQDQVQDELRRVIGDRMVQVSDRKNLPFTDAVIHEIQRLASIVPT ALPHKTSKDVTFQGYFIKKGTTVFPLLTSVLHDANEWEKPHTFYPAHFLDKDGKFVKREAFIPFSAGRRICLGES LARMELFMFFTTLLQNFCFTPPPGVSKEELSLTPCGGITVGPVPHKLCAVPCSE* CYP2K30 Oryzias latipes (medaka) Chr24: 11290118:11301397: (+) strand Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 63% to Fugu 2K11 MGVWDTLLPSLSPSSLLGAGVLLLLVFLFCPHRTSSQKHRKEPPGPTPIPILGNLHQLDLKRPDQTFMKFAKKYG SVFTVYMGPKKTVVLTGYKTMKEALVNYAEEFGEREAPTVAKEAHLDCGVVWANGASWREMRRFALSTLRDFGMG KRACEDKIIPECHSLLKEIRKFQGEAFDPTLIINSAVCNVICSMVYGTRFEYDDPDFRTILSRTMKGIQLLGSPG VQLHNLFPRIGRLFLSASKQINQIFTANKNYHLKLLKETFTPHTCKSIADAFQLRQQEEDGFPNSHFHDANILVT IMNLFTAGTETTAATLRWALLFMAKYPKIQDQVQEELSRVMEGRQVTVEDRQRLPFTDAVIHETQRKANIIPLSL LHRTSQDVTFKGFFIEKGTTVIPVLTSVLYDENEWEKPNIFYPAHFLSKDGKFLKRDAFMPFSAGRRLCLGESLA RMELFLFFSTLLQHFRIAPPLGVSEEELDLTPRPGGTLSPQPHKLCLVSLK* 2L Subfamily CYP2L1 Panulirus argus (spiny lobster) GenEMBL U44826 (1601bp) James, M.O., Boyle, S.M., Trapido-Rosenthal, H., Carr, W.E. and Shiverick K.T. cDNA and protein sequence of a major form of P450, CYP2L, in the hepatopancreas of the spiny lobster Panulirus argus. Arch. Biochem. Biophys. 329, 31-38 (1996) CYP2L2 spiny lobster no accession number Sean Boyle and Margaret O. James submitted to nomenclature committee 2M Subfamily CYP2M1 Onchorhynchus mykiss (rainbow trout) GenEMBL U16657 Yang,Y.H., Wang,J.L. and Buhler,D.R. cDNA cloning and characterization of a novel cytochrome P450 from rainbow trout. Abstracts of the VII International Congress of Toxicology, Vol. 7, No. 1, 10-P-2 (1995) Yang,Y.H., Wang,J.L., Miranda, C.L. and Buhler,D.R. CYP2M1: cloning, sequencing, and expression of a new cytochrome P450 from rainbow trout liver with fatty acid (omega-6)-hydroxylation activity. Arch. Biochem. Biophys. 352, 271-280 (1998) Note: 42% identical to CYP2K1 2N Subfamily CYP2N1 Fundulus heteroclitus (killifish) John Stegeman submitted to nomenclature committee CYP2N2 Fundulus heteroclitus (killifish) John Stegeman submitted to nomenclature committee CYP2N3 Stenotomus chrysops (scup) No accession number Agnes Knorr, Andrew McArthur John Stegeman Submitted to nomenclature committee Nov. 3, 2000 73% to 2N1 CYP2N4 Chaetodon mertensii (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 CYP2N5 Chaetodon punctatofasciatus (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 CYP2N6 Chaetodon auriga (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 CYP2N7 Chaetodon xanthurus (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 CYP2N8 Chaetodon plebius (butterfly fish) No accession number Bryan DeBusk Submitted to nomenclature committee July 19, 2001 CYP2N9 Fugu rubripes (pufferfish) No accession number Scaffold_3261a 9342 MWLWDLVLWLRLTGFLLPVLIVLLIIMYSLRQKDPPNFPPGPPALPLLGNIFNIEAKQPHLYLTK 9148 (0) LADVYGSVFCIRLGRHKTVFVSGWKMVKEAIVTQADSFVDRPYSPMATRIYSGNS LKG95403.y1 AGLFFSNGHVWRKQRRFAMATLRSFGLANGSMELSICEESRHLQEAMESQK LKG95403.y1 8235 GEPFDPVPLLNNAVANIICQIVFGRRFDYTDHMFQRMLHHLTEMAYLEGSIWAL 8074 (0) 7991 LYDSFPALMKHLPGPHNGIFSSSSSLQGFIWREIQRHKSDLDPSNPRDYIDAFLIEEG 7818 (0) 7743 NGNNQLGFEERNLVLCCLDLFLAGSETTSKTLQWGLIYLIRKPHIQ 7606 (1) EKVQVEIDRPIGRTRQPTMADRPNLPYTDAVIHEIQRMGNIVPLNGPSNGCQGTRPWRGYFIPK (0) GTSVMPNLTSVLFDKNEWETPDTFNPEHFLDAEGKFVRREAFLPFSA 7246 (1) 7162 GRRACLGEGLARMELLLFFVSLCQRFHFSTLDRVELSTEGITGATRTPYPFKIYAQVR* 6986 CYP2N10 Fugu rubripes (pufferfish) No accession number Scaffold_3261b 13883 MWLYSVLSWDFTSLLLFFFVLILFANYLKNRDPPNFPPGPFAFPIVGNFFTMDSKNLHLYFNK 13695 (0) 12557 LADVHGNVFSFRLGGDKMVCVSGHKMVKEAIVTQADNFVDRPYDPISARVYGGQT 12393 (1) DGLFQSNGEVWKRQRRFALSTLRNFGLGKNILEQSICEEAQHLLEEMRSHG 12153 (1) GKPFNPARLFNNTVSNIICQLVMGKRFEYSDHKFQMLLKYLSEVLVLEGSFWGQ 11913 (0) 11814 LYEAFPSVMKHLPGPHNKVFSHFNHLKDFMNEEIQNHKKDLDHNNPRDYIDAFIIEMEK 11638 (0) NKDTNLGFTETNLAMCSLDLFIAGTETTATTLLWDLVYLINNPDIQ 11413 (1) 11290 GKVQAEIDQVIGQNRQPTMADRPNLPYTDAVIHEIQRMGNIVPLNGPRMAAKDTTLGGYFIPK 11102 (0) 11018 GTSLMPILTSVLFDKNEWETPDKFNPGHFLDAEGKFKKREALLPFSA (1) GKRVCLGEGLAKMELFLFFVSLFQNFTFFVPGGAELNTEGITGTTRVPHPFEILARPR* 10619 CYP2N11 Fugu rubripes (pufferfish) No accession number Scaffold_3261c MWPLQLLLDFDIRALLLFISVLLLIGDYFRYKNPPNFPPGPMSLPFVGSFFSVDSKHPHNYFIQ (0) 18495 MAELYGKLFSIRLGSGKIVFACGYKMVKEAIVTQADNFVDRPFNAFGDRIYMGQR 18331 (1) 18251 DGLFQNNGEVWKRQQHFALSTLRNFGLGKNILEQSICEEAQHLLEEMRSHG (1) GKPFDPASLFTRAVSNIICQLVMGKRFEYSDHKFQMLLKYLSELLVLEGSFWGQ 17859 (0) LYQAFPSVMKHLPGPHNKVFSHYNHLKDFMNEEIQNHKKNLNHNNPRDYIDAFIIEMEK (0) 17498 NKDTNLGFTETNLVLCSLDLFLAGTQTTATTLLWALVYLINNPDIQ 17364 (1) 16988 EKVQAEIDQVIGQTRQPTMADRPNLPYTDAVIHEIQRMGNIVPLNASRMAAKDTTLGGYFIPK 16800 (0) GTSLLPILTSVLFDKNEWETPDKFNPGHFLDAEGKFKKREAFLPFSA (1) 16492 GKRVCLGEGLVKMELFLFFVSLFQKFSYSVSGGAELSTEGITGITRVPHPFEIHTRPRSF* 16310 CYP2N12X Fugu rubripes (pufferfish) No accession number Scaffold_3261d Renamed CYP2AD1 22960 XCLNIHTGIALSNGYMWKKQRKFAHTHLRYFGEGQKLLENHIQMESKFMCEAFKDEQ 22811 (1? Bad boundary) 22727 GKPFDPQYTITNAVGNIISALVFGHRFEYSDASFRRILELDNEAVVLAGSARTQ 22566 (0) 22482 LYDSFPSLMKHLPGPHQTVHANYGKITDFLKKEVDKHMEEWNPEDPRDYVDTYLSEMEK 22306 (0) 21959 MNQDPQGGFNVETLLICILDLIEAGTESAATTLRWGLVFILNYPDVQ 21819 (1) 21739 EKVQEEIDRVIGQSRQPAMADRPNMPYTDAVIHEIQRFANVVPAGFPKMATKDTTVGGYFIPK 21551 (0) 21462 GLAITTMLSSVLFDKNEWETPDVFNPNHFLDSEGRFRKRDAFIPFSA 21322 (1) 21218 GKRVCIGENLAKMELFLFFTSILQHFNLSPVPGQMPSLEGILGFTYSPQPFRMIVAPR* 21042 CYP2N13 Danio rerio (zebrafish) CYP2N14 Micropterus salmoides (largemouth bass) No accession number Alex J. McNally submitted to nomenclature committee May, 31, 2005 74% to 2N10 CYP2N15 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr VIII (+) strand 19111307-19114904 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 69% to 2N11 see ESTs CD506195.1, CD504080.1, CD507761.1 the genome assembly is missing the lower case region MWLFHFLLGFDLKGLFLFMVVFFIIADIFKNRNPANYPPGPLSLPIVGNffsverkhphiyftk LADIYGNVFSVRL GRNKTVFVSGYKMVKEAIVTQADNFVDRPDNAMADRVYSGDSGGLFMSNGETWKRQRRFALSTLRSFGLGKSTME QSICEEIRHLQEEIEKEKGEPFNPASLFNNAVSNIICQLVMGRRFDYCDHNFQSMLTYLCEILRLQGSVWGLLYD SFPRVMKHLPGSHNKIFSHYDSLLDFMNKEVESHKKDLDHSDPGDYIDAFIIEMEKHNESDLGFTEANLALCSLD LFLAGSETTSTTLLWALVYLMKYPDIQDKVQVEIDGVIGRSRQPSMADRPNLPYTEAVLHEIQRMGNIVPLNGAR MATKHTTLGGYLIPKGTTVMPSLTSVLFDKTEWETPHTFNPGHFLGAEGKFVRREAFLPFSAGKRVCPGEGLAKM ELFLFLVGLLQKFSFSVPDGVELSTEGITGVTRVPHPFKVYAKAR* CYP2N16 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr VIII (+) strand 19116076-19119924 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 77% to 2N9, 62% to Fugu 2N10 MSLCGFLLRFGPPEFLLLFFAFLLLVCFWAKKDPPNFPPGPPSLPFLGNIFNIESKQPHIYLTKLADVYGNVFCI RLGRHRTVFVSGWKMVKEAIVTQADHFVDRPYSPMVTRIYSGNSGLFFSNGKVWRRQRRFAMSTLRTFGLANSSM EQSICEESRHLQEALEKEKGEPFDPVPLINNAVANIICQIVFGRRFDYTDHNFQSMLRNLTDMAYLEGSIWALLY DAFPAVMKHVPGPHNGIFRSSRSLEASIRAEIERHKLDLDPTNPRDYIDLFLIEEKHSKNRDLGFDEGNLVLCCL DLFLAGSETTSKTLQWGLVYLIKSPHIQVQAEIDGVIGPTRHPTMADRPNLPFTDAVIHEIQRVGNVVPLNGLRM AAKDTTLGGYFIPKGTSVMANLTSVLFDPAEWEKPDSFHPAHFLDAGGRFVRREAFLPFSAGKRACLGEGLARAE LFLFFVTLLQKYHFTTLEGVELRGDGVIGATRTPHPFKVYAEAR* CYP2N17 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr XVI (-) strand 2228495-2232907 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 51% to Fugu 2N9, 71% to 2N12, see ESTs DT966028.1, DW631570.1 CYP2N18 Oryzias latipes (medaka) Chr4: 28082010:28087962: (-)strand Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 67% to Fugu 2N11 MWLDSFLLSFDLKALVLFIFLFLLIADWIKHRKPANFPPGPLGLPFVGNFLTIDGKHPHIYFSKMAESYGNVFSV RLGSQATVFVSGYKMVKEALVTQAENFVDRPFSEIGGRFYEGNSNGLFFSNGEKWKKQRRFALSTLRTFGLGKNT MEQSICEEIRHLQQQIENEKGGPFSPAGLFNNAVSNIICQLVMGKRFDYDDNNFQVMMKYISEAVQLEGSIWGIL YESFPGLMKHLPGSHNKIFRNYKIVQDFLAQEIKIHKQDLDPNNPRDYIDSFIIEMEKHQNSDLGFNDANLAFCS LDLFVAGTETTSTTLMWALIYLIKHPDVQVKVQQEIDRVIGQNRLPSMADRPNLPYTDAVVHEIQRIGNIVPLNG LRVAAKDTTLGGYFIPKGTALMPMLTSVLFDKTEWETPDTFNPEHFLDADGKFVKKEAFLPFSAGKRVCLGEGLA RMELFLFLVGLLQKFSFSVPEGVELSTEGITGTTRVPHPYKVYAKVR* CYP2N19 Oryzias latipes (medaka) Chr4: 28070384:28074070: (-) strand Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 74% to Fugu 2N9 MWLCVWCQWCGLTGTLFFIFAVFFVLCLVKQKDPPHFPPGPPALPVLGNIFSIDSKQPHIYLTKLADVYGNVFCI RLGRHKTVFVTGWKTVKEALVTQADNFVDRPYSPMVTRIYGGNSAGLFFSNGSVWKRQRRFAMTMLRTFGAAKSS TEQSICEESRHLLEAMEMEGGEPFDPVPLLNKAVSNIICQIVFGRRFDYSDTDFQAMLTNLTDMAYLEGSVWALL YDAFPALMKYLPGPHNSIFSSSKSLETTIRREINRHKQDLDPSNPRDYIDKFLMEERHNRKIHSGFEEENLVLCC LDLFLAGSETTSKTLQWGLIYLITNPHIQDKVQAEMDRVVGHSRQPTTADRTNMPYTDAVIHEIQRMGNIVPLNG LRMAAKDTTLGGYIIPKGTAVMPNLTSVLFDKTEWETPDNFNPEHFLDADGKLLRKEAFLPFSAGRRACLGEGLA RMELFLFFVTLFQRFHFSAAAGVELRTEGIIGATRTPHPFQIIAKPR* 2P Subfamily CYP2P1 Fundulus heteroclitus (killifish) John Stegeman submitted to nomenclature committee CYP2P2 Fundulus heteroclitus (killifish) GenEMBL AF117342 John Stegeman submitted to nomenclature committee CYP2P3 Fundulus heteroclitus (killifish) GenEMBL AF117343 John Stegeman submitted to nomenclature committee CYP2P4 Fugu rubripes (pufferfish) No accession number Scaffold_3261e MEAILSTLGLEWMDGRTILIFLLVFVLLADYIKNRVPSNFPPGPWPLPLIGDLHRINPSRLHLQFAE (0) 24760 FAGKYGNIFSLRLFGGRVVVLNGYKTVREALVEKGENFVDRPLIPLFEAFAGNR 24924 (1) 24994 GLVISNGNPWKHQRRFALHTLRNFGIGKKSLEPSIQQECHYLAEAFAQHKG 25156 gap missing exon 4 26236 VYNTFPWLLKWLPGTHQTIFSEIKTVINFVDLKIQEHKRNFDPSSLRDYIDCFLAEMGE 26412 (0) 26493 KEDVESGFDMKNLSICTMDLFGAGTETTTTTLQWGLLYMIYYPHIQ 26630 (1) seq runs off end of contig missing exons 7,8 and 9 CYP2P fragment Fugu rubripes (pufferfish) No accession number probably exon 7 of 2P4 Fc:c161F04y1 LPC.61739.y1 Fc:c161E03y1 LPC.61451.y1 60% to 2J9 71% to 2P1 KVYAEISAVIGSSREPSITDRDNMPYTNAVIHEMQRMANIIPLNVVHMASSDTTIxxxxxxx CYP2P fragment Fugu rubripes (pufferfish) No accession number Scaffold_2841 probably exons 8 and 9 of 2P4 80% to LPC61680 66% to 2D9 Length = 29344 LPC61680.x1 LPC22842.y1 LPC61776.x1 LPC61672.x1 Fc:c161P11x1 Fc:c161P09x1 66% to 2D9 LPC61488.x1 64% to 2d9 Fc:c161O11x1 93% to LPC61680 probably same gene 67% to CYP2K, upstream sequence runs off scaffold 62% to 2Z2 over 106 aa 59% to 2K10 over 108 aa 60% to 2N12 over 100 aa 80% to 2P2 179 GTIIMPTLNSVLHDESMWETPHSFNPQHFLDQDGKFRKREAFLPFSA 319 442 GKRVCLGEQLARMELFLFFTSLLQRFSFSMADGEQPSLDFQLGGARFPKPYRLRAILR* 618 CYP2P5P Fugu rubripes (pufferfish) No accession number pseudogene fragment Fc:c060E24y1 LPC.22843.y1 56% TO 2W1 PKG TO HEME 70% to scaf 2841 exon 8 GTIVVPTLNSVLPDESVWETPHSLDPPLFLDL*RXFRVREAFLPFFA CYP2P6 Danio rerio (zebrafish) ctg24224.g NEW 77% TO 2p9 1209157 MDLLHIYEWIDIKAVLFFACVFLLLSNYIQNKTPKNFPPGPWPLPIIGNLYHIDFNKIHLEVEK 1209348 1209657 LSEKYGSVVSVHLFGQRTVILNGYKQVKEVYIQQGDNVADRPELPMIHDIAGDN 1209818 1209977 GLVAPSGYKWKQQRRFALSTLRNFGLGKKSLEPSINLECHYLNEAISNEN 1210126 1210235 GRPFDPHLLLNNAISNVICVLVFGNRFDYSDHHFQTLLNNINEAMYLDGTIWAQ 1210396 1210482 LYNSHPRIMRLLPGPHKKNITLWNKVIDFARERVKEHRVDYDPSNPRDYVDCFLAEMEK 1210658 1210736 LKDDTAAGFDVENLCICTLDLFVAGTETTSTTLSWSLLYMIKYPEIQ 1210876 1212110 AKVQEEIDRVIGSSRQPSVSDRDNMPYTNAVIHEIQRFGNIAALNLPRAAVKDIQVGKYLIPK 1212298 1212390 GTIVIGNLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFS 1212527 1213310 GKRVCLGEQLARMELFLFFTSLLQHFTFSSPAGVEPSFNYKLGTTRAPKPFKLCAVSR 1213483 CYP2P7 Danio rerio (zebrafish) ctg24224.h 81% to 2p9, 62% to 2P3 (Fundulus) 1214731 MDVLQFYKWLDIKTVLVFLVVFLFLSDYIRNKSPKNFPPGPWSLPFIGHIHHIEHKKVHLQFLK 1214922 1216466 FAEKYGKIFSIRLFGPRIVVLDGYKLVKEVYLQQGDNLADRPILPMFYDITEDK 1216627 1217670 GLIGSNGYKWKHQRRFALSTFRTFGLGKKSLEPSILLECSCLNDAFSNEQ 1217819 1217891 XPFDPRLLLNNAVSNVICALVFSNRFDYSDHHFQTLLKHINEVLYLEGTVWAQ 1218046 1218134 LYNFFPWLMRRLPGPHQKIFVLLNKVIDFVREKVNEHRVDYDPSNPRDYIDCFLAEMEK 1218310 1218399 LKDDTAAGFDVENLCICTLDLFVAGTETTSTTLYWGLLYIIKYPEIQ 1218539 1218632 AKVQQEIDAVVGGSRQPSVSDRDNMPYTNAVIHEIQRMGNIVPLNVFRITVEDTQIGEYSIPK 1218820 1218907 GTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFS 1219044 1219144 GKRVCLGEQLARMELFLFFSSLLQRFTFSPPAGVEPSLDYKLGGTHSPQPYKLCAVPR 1219317 CYP2P8 Danio rerio (zebrafish) ctg24224.i 90% TO 2p9 1221362 MDLWYLYEWIDIKSILIFLCVFLLLGDYIKNKAPKNFPPGPWSLPIIGDLHHIDNSKIHLQFTK 1221553 1221722 FAERYGNIFSFRLFGPRIVVLNGYNLVKEVYIKQGDNLADRPVLPLFYEIIGDK 1228152 1221974 GLILSSGYKWKHQRRFALSTLRNFGLGKKSLEPSINVECGFLNEAISNEQ 1222123 1222203 GRPFDPRLLLNNAVSNVICVLVFGNRFDYSDHHFQTLLKNISEAVYLEGSICNQ 1222364 1224317 LYNMFPWLMERLPGPHKTIITLWRKVTDFVREKVNEHRVDYDPSNPRDYIDCFLTEMEK 1224493 1224582 LKDDTAAGFDVENLCICSLDLFVAGTETTSTTLYWGLLYMIKYPEIQ 1224722 1224812 AKVQEEIDAVVGGSRQPSVSDRDNMPYTNAVIHEIQRMGNIAPINLARSTSEDTQIGNYSIPK 1225000 1225184 GTMVTSNLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFS 1225321 1225421 GKRVCLGEQLARMELFLFFSSLLQRFTFSPPAGVEPSLDYKMGGTHCPKPFKLCAVPR 1225594 CYP2P8-de7,8 Danio rerio (zebrafish) ctg24224.j EXONS 7,8 pseudogene 1226868 PSVSDRDNMPYTNSVIHEIQSIGNIGPLNVFGITVK 1226975 1227088 GTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFS 1227225 CYP2P9 Danio rerio (zebrafish) ctg24224.k 98% (7 AA DIFFS) TO 2p9 1227637 MDLWYLYEWIDIKSILIFLCVFLLLGDYIKNKAPKNFPPGPWSLPIIGDLHHIDNSKIHLQFTK 1227831 1227991 FAERYGNIFSLRLFGPRIVVLNGYNLVKEVYIKQGDNLADRPVLPLFYEIIGDK 1228131 1228249 GIVLSSGYKWKHQRRFALSTLRNFGLGKKSLEPSINLECGFLNEAISNEQ 1228398 1228473 GQPFDPRLLLNNAVSNVICVLVFGNRFDYSDHHFQTLLKHINEAIYLEGGICAQ 1228634 1228798 LYNMFPWLMQRLPGSHKKVITLWKKVIDFIRQKVNEHKVDHDPLNPRDYIDCFLAEMEK 1228974 1229073 LKDDTAAGFDVENLCICTLDLFVAGTETTSTTLYWGLLYMMKYPVIQ 1229210 1229290 AKVQEEIDRVVGGSRHPSVSDRDNMPYTNAVIHEIQRMGNIIPINVTRTTSEDIRIGKYSVPK 1229475 1229629 GTMVTSNLTSVLFDESEWETPHSFNPGHFLNAEGKFRRRDAFLPFSL 1229769 1229866 GKRVCLGEQLARMELFLFFSSLLQRFTFSPPAGVEPSLDYKLGATHCPQPYKLCAVPR 1230039 CYP2P9 Danio rerio (zebrafish) GenEMBL BC056816, NM_200620 61% to CYP2P3 zfishK-a583c07.p1c zfishC-a1218e09.p1ca MDLWDLYEWIDIKSILIFLCVFLLLGDYIKNKAPKNFPPGPWSLPIIGDLHHIDNSKIHLQFTK FAERYGNIFSLRLFGPRIVVLNGYNLVKEVYIKQGDNLADRPVLPLFYEIIGDK GIVLSSGYKWKHQRRFALSTLRNFGLGKKSLEPSINLECGFLNEAISNEQ GQPFDPRLLLNNAVSNVICVLVFGNRFDYSDHHFQTLLKHINEAIYLEGGICAQ LYNMFPWLMQRLPGSHKKVITLWKKVIDFIRQKVNEHRVDHDPLNPRDYIDCFLAEMDK LKDDTAAGFDVENLCICTLDLFVAGTETTSTTLYWGLLYMMKYPGIQ AKVQEEIDRVVGGSRQPSVSDRDNMPYTNAVIHEIQRMGNIIPINVTRTTSEDIRIGKYSVPK GTMVTSNLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFSL GKRVCLGEQLARMELFLFFSSLLQRFTFSPPAGVEPSLDYKLGATHCPQPYQLCAVPR CYP2P10 Danio rerio (zebrafish) GenEMBL BC049521, NM_201511 84% to CYP2p9 zfishG-a2632g08.q1c MDMFYFYEWVDIKSILIFLCVFLLLSDYIKNKAPKNFPPGPWSL PFIGDLHHIDPNKIHLQFTEFAEKYGKIFSFRLFGSRIVVLNGYNLVKEVYTQQGDNL ADRPTLPITSAIIGDNRGLVASSGYKWKHQRRFALTTLRNFGLGKKNLELSINFECGF LNEAISNEQGRPFNPRLLLNNAVSNVICVLVFGNRFEYSDHHFQNLLNKINESVYLEG SIFVHLYNMFPWLMQLLPGPHKKLITLWQRVTDFVREKVNEHRVDYDPSSLRDYIDCF LAEMEKHKDDTAAGFDVENLCMCTLDLFVAGTETTSTTLYWGLLYMIKYPEIQAKVQQ EIDAVVGSSRQPSGSDRDNMPYTNAVIHEIQRMGNIIPLNVVRTTSEDTRIEKYSIPK GTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFSLGKRVCLGEQLA RMELFLFFSSVLQRFTFSPPAGVEPSLDFKMGFTRCPKPYKLCAVPR CYP2P10 Danio rerio (zebrafish) ctg24224.l 3 AA DIFFS TO 2P10 1232262 MDMFYFYEWVDIKSILIFLCVFLLLSDYIKNKAPKNFPPGPWSLPFIGDLHHIDPNKIHLQFTE 1232411 1233540 FAEKYGKIFSFRLFGSRIVVLNGYNLVKEVYTQQGDNLADRPTLPITSAIIGDNR 1233677 1233779 GLVASSGYKWKHQRRFALTTLRNFGLGKKNLELSINFECGFLNEAISNEQ 1233928 1234024 GRPFNPRLLLNNAVSNVICVLVFGNRFEYSDHHFQNLLNKINESVYLEGSIFVH 1234173 1237098 LYNMFPWLMQLLPGPHKKLITLWQRVTDFVREKVNEHRADYDQSSLRDYIDCFLAEMEK 1237274 1237383 HKDDTAAGFDVENLCMCTLDLFVAGTETTSTTLYWGLLYMIKYPEIQ 1237523 1237610 AKVQQEIDAVVGSSRQPSGSDRDNMPYTNAVIHEIQRMGNIIPLNVVRTTSEDTRIEKYSIPK 1237798 1239047 GTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFSL 1239184 1239282 GKRVCLGEQLARMELFLFFTSVLQRFTFSPPAGVEPSLDFKMGFTRCPKPYKLCAVPR 1239455 CYP2P10-de9 Danio rerio (zebrafish) ctg24224.m 3 AA DIFFS TO 2P10 1242741 MELFLFFSSLLYF 1242779 1242772 FTFSLPADVKPSLGYKMGAHTVP 1242840 CYP2P-se1 Danio rerio (zebrafish) ctg24224.n solo exon (pseudogene) 1243476 MDMLHFYEWIDIKSILIFVCVFLLLSDFIKNKTPKNFPPGPWSLPIIGDIHHIDPSKLHLQLSE 1243667 CYP2P fragment Atlantic salmon GenMEBL BI468047 EST00457 77% to CYP2P10 1 DPSSPRDFIDCFLNEIEKCEDDTRAGFNLENLSFCTLDLFVAGTETTSTTLYWGLLFMIN 180 181 YPEIQAKVQAEIDAVVRSSRQPSMEDRDSMPYTDAVIHETQRMGNIIPLNVSRMATKDTE 360 361 VGGYTIPKNTIVLGTLQSILFDESEWETPHTFNPGHFLDQEGKFRKRDAFLPFSLGKRVC 540 541 PXEQLAKMELFLFFTSLLQRFTFFSPPGVEPSL 639 CYP2P11 Micropterus salmoides (largemouth bass) No accession number David Barber Submitted to nomenclature committee 5/21/04 73% to CYP2P3 CYP2P12 Oryzias latipes (medaka) chr4 28112615:28120754 (+) Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 61% to Zebrafish 2P10, 69% to CYP2P3 MEGITSVLGLEWVDTWTILIFLFVFLLLSDFLANRRPKNFPPGPHSLPFIGDLHRIQPARLHVQFTEFAEKYGNV FSLHLLGERTVILNGYKQVKEALVQQGDDFVDRPTIPLFVDTIDNKGIVMSNGNSWKQQRRFALHTLRNFGLGKK TMETYIQNECHYITQTFADKQGKPFDAQFLINNAVSNIICCLVFGERFEYSDQEYQKILRNLNDLLILEGSVSAM LYNMFPWLMKRLPGPHQKIFSLTRKIIDFVKIKINEHKGNFDPSAPEDYIDSFLIEMEKVNKDSGFDIDNMCICT MDLFLAGTETTTTTLYWGLLYMIYYPDIQGKVHAEIDAVIGSSRQPSMADKESMPYTDAVIHEIQRMGDIVPQGV FRQANRDTTLDKYTIPKGTIIVPALHSVLHDESMWDNPHSFDPKNFLDKDGKFCKREAFNPFGAGKRVCLGEQLA RMELFLFFTSLFQRFSFSAPTGEQLSLESRMGATRCPKPFRVIAAPR* CYP2P13 Oryzias latipes (medaka) chr4 28123180:28130065 (+) Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 63% to Zebrafish 2P10, 75% TO CYP2P3 MEAITAVLGFEWIDSRSLLIFLFVFLLLSDYLANRRPKNFPPGPHSLPFIGDLHRINPSRLHLQLTEFAEKYGNV FSLHLFGERAVILNGHKHVKEALVQRGDDFVDRPSIPLFEQFYSNKGIVVSNGYPWKQQRRFALHTLRNFGLGKK TMEKYMQEECRYLTEAFGEYKVKPFNAQALINNAVSNIICCLVFGERYEYSDKQYQQILQDINEIMILQGGFAAQ LFNSFPWLMKKLPGPHQKILTLLAKLIDFAKVKISEHKENLDPSSPKDYIDSFLIEMAQNENQESSFDISNLCMC TLDLFIAGTETTTTTLHWGLLYMIYYADIQEKVQAEIDAVIGSSRQPSMADKENMPYTDAVIHEIQRMGNILPLG VLRMASKDTTLDKYTIPKGTMIIPTLNSVLHDESMWETPHSFNPKHFLDKDGKFRKREAFNPFGAGKRVCLGEQL ARMELFLFFTSLLQRFSFSAPAGEQPSLENRMGATRCPKPYRLCAVPR* 2Q Subfamily CYP2Q1 Xenopus laevis (african clawed frog) GenEMBL D50560 (2237bp) Ohi, H., Sugata, E., Fujita, Y., Saito, H., Saguchi, K., Murayama, N. and Higuchi, S. Cloning and expression analysis of a cDNA coding for a dexamethasone-inducible cytochrome P450 in Xenopus laevis Biochem. Mol. Biol. Internatl., 45, 689-697 (1998). Saito, H., Ohi, H., Sugata, E., Murayama, N., Fujita,Y. and Higuchi,S. Purification and characterization of a cytochrome P450 from liver microsomes of Xenopus laevis Arch. Biochem. Biophys., 345, 56-64 (1997) CYP2Q2 Xenopus tropicalis (frog) See Xenopus page for seq CYP2Q3 Xenopus tropicalis (frog) See Xenopus page for seq 2R Subfamily CYP2R1 human AC018795.4 also AC025730 AC025748 Mikael Oscarson submitted to nomencalture committee 9/4/98 missing N-terminal (approximately 80 amino acids) Unigene entry Hs.16846 ESTs AA058765 zk65e06.r1, AA099882 zl90c08.r1, AA115448 zl04h11.r1 AI280096 qh85e09.x1, AA732048 nz87c04.s1, AA449325 zx06e11.s1, AI221745 qg93e12.x1, AA088847 zl90c08.s1, AA235247 zs37b03.s1, AA115449 zl04h11.s1, AI431661 tg74h07.x1, AI376519 te59a09.x1, T83549 yd44f12.r1, T91507 ye20c08.s1, R11612 yf47e10.r1, T91536 ye20c08.r1, AA449583 zx06e11.r1, T83719 yd65h05.r1 AA663042 MWKLWRAEEGAAALGGALFLLLFALGVRQLLKQRRPMGFPPGPPGLPFIGNIY SLAASSELPHVYMRKQSQVYGE IFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSR YGRGWVDHRRLAVNSFRYFGYGQKSFESKILEETKFFNDAIETYKGRPFDFKQLITNAVS NITNLIIFGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFR NAAVVYDFLSRLIEKASVNRKPQLPQHFVDAYLDEMDQGKNDPSSTFSKENLIFSVGELI IAGTETTTNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKCKMPYTEAVLHEV LRFCNIVPLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDS SGYFAKKEALVPFSLGRRHCLGEHLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMT LQPQPYLICAERR CYP2R1 Macaca mulatta (rhesus monkey) partial IFSLDLGGISTVVLNGYDVVKECLVHQSGIFADRPCLPLFMKMTKMGGLLNSRYGQGWVE HRRLAVNSFRYFGYGQKSFESKILEETKFFTDAIETYKGRPFDFKQLITSAVSNITNLII FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNASVVYD FLSRLIEKASVNRKPQLPQHFVDAYFDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKFKMPYTEAVLHEVLRFCNIV PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK EALVPFSLGRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL ICAERR CYP2R1 Bos taurus (cow) See cattle page for details MWEPHSAEAFVAALGGVFFLLLFALGVRQLLKQRRPSGFPPGPSGLPFIGNIYSLAASAELPHVYMKKQSQVYGE (0) IFSLDLGGISAVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMG (1) GLLNSRYGRGWVDHRKLAVNSFRCFGYGQKSFESKILEETKFFIDAVETYNGSPFDLKQLV TNAVSNITNLVIFGERFTYEDTDFQHMIELFSENVELAASATVFLYNAFPWIGILPFGKH QQLFRNAAVVYDFLSRLIEKASINRKPQLPQHFVDAYLDEMERSKNDPSSTFSKENLIFS VGELIIAGTETTTNVLRWAVLFMALYPNIQ (1) GQVQKEIDLIIGPSGKPSWDEKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGYSI PKGTTVITNLYSVHFDEKYWRDPEIFYPERFLDSSGHFAKKEALIPFSL (1) GRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPNLKPRLGMTLQPQPYLICAERR* CYP2R1 Sus scrofa (miniature pig) no accession number Haitao Shang Submitted to nomenclature committee May 23, 2007 95% to human 2R1 partial seq. CYP2R1 Sus scrofa (miniature pig) BW980853.1, BG732954.1, BI359965.1 95% to human 2R1, lower case = cow seq MWEPPGAEVFPAALGGVL 2 FLLLFALGVRQLLKQRRPSGFPPGPSGLPFIGNIYSLAASAELPHIYMKKQSQVYGEIFS 181 182 LDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSRYGRGWVDHRR 361 362 LAVNSFRSFGYGQKSFESKILEETKFFMDAIETYSSRPFDFKQLITNAVSNITNLIIFGE 541 542 RFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNAAVVYDFLS 721 722 RLIEKASINRKPQSPQHFVDAYLDEMDQGEKDPSSTFSKENLIFSVGELIIAGTETTTNV 901 902 LRWAILFMALYPNIQGR 952 vqkeidliigpsgkpswdekckmpyteavlhevlrfcnivplgifhatsedavvrgysi pkgttvitnlysvhfdekywrdpeifyperfldssghfakkealipfsl (1) GRRHCLGEQLARMEMFLFFTALLQRFHLHFPHelvpnlkprlgmtlqpqpylicaerr* CYP2R1 Canis familiaris (dog) NW_876313.1:37769697-37744500 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 93% to human CYP2R1 MRGPPGAEACAAGLGAALLLLLFVLGVRQLLKQRRPAGFPPGPSGLPFIGNIYSLAASGELAHVYMRKQSRVYGE IFSLDLGGISAVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSRYGRGWVDHRKLAVNSFRCFGYG QKSFESKILEETNFFIDAIETYKGRPFDLKQLITNAVSNITNLIIFGERFTYEDTDFQHMIELFSENVELAASAS VFLYNAFPWIGIIPFGKHQQLFRNAAVVYDFLSRLIEKASINRKPQSPQHFVDAYLNEMDQGKNDPSCTFSKENL IFSVGELIIAGTETTTNVLRWAILFMALYPNIQGQVQKEIDLIMGPTGKPSWDDKCKMPYTEAVLHEVLRFCNIV PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRNPEIFYPERFLDSSGYFAKKEALVPFSLGKRHCLG EQLARMEMFLFFTALLQRLHFPHGLVPDLKPRLGMTLQPQPYLICAERR* CYP2r1 mouse GenEMBL XM_146091.1 1 MLELPGARACAGALAGALLLLLFVLVVRQLLRQRRPAGFPPGPPRLPFVGNICSLALSAD 180 181 LPHVYMRKQSRVYGE IFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLL 540 541 NSRYGRGWIDHRRLAVNSFHYFGSGQKSFESKILEETWSLIDAIETYKGGPFDLKQLITN 720 721 AVSNITNLILFGERFTYEDTDFQHMIELFSENVELAASAPVFLYNAFPWIGILPFGKHQR 900 901 LFRNADVVYDFLSRLIEKAAVNRKPHLPHHFVDAYLDEMDQGQNDPLSTFSKENLIFSVG 1080 1081ELIIAGTETTTNVLRWAILFMALYPNIQGQVHKEIDLIVGHNRRPSWEYKCKMPYTEAVL 1260 1261HEVLRFCNIVPLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWKDPDMFYPERF 1440 1441LDSNGYFTKKEALIPFSLGRRHCLGEQLARMEMFLFFTSLLQQFHLHFPHELVPNLKPRL 1620 1621GMTLQPQPYLICAERR 1668 CYP2R1 chicken XM_420996 Gnomon prediction seems too long 80% to human 2R1 MGPAAGDAEPEAAAGGGPWLL LALPPLLLLFALVVRQLLKQRRPPGFPPGPAGLPLIGNIHSLGAEQPHVYMRRQSQIH GQIFSLDLGGISAIVLNGYDAVKECLVHQSEIFADRPSFPLFKKLTNMGGLLNSKYGR GWTEHRKLAVNTFRTFGYGQRSFEHKISEESVFFLDAIDTYKGRPFDLKHLITNAVSN ITNLIIFGERFTYEDTEFQHMIEIFSENIELAASASVFLYNAFPWIGILPFGKHQQLF KNAAEVYDFLHKLIERVSENRKSQSPRHFIDAYLDEMDCNKNDPESTYSRENLIFSVG ELIIAGTETTTNVLRWAVLFMALYPNIQGHVQKEIDLVIGPNKMPALEEKCKMPYTEA VLHEVLRFCNIVPLGIFHATSKDTVVRGYSIPEGTTVITNLYSVHFDEKYWNNPEVFF PERFLDSNGQFVKKDAFIPFSLGRRHCLGEQLARMELFLFFTSLLQRFHLRFPHGGIP DLKPRLGMTLQPQPYLICAERR CYP2R1 Xenopus tropicalis (frog) See Xenopus page for seq CYP2R1 Danio rerio (zebrafish) CYP2R1 Fugu rubripes (pufferfish) No accession number Scaffold_7138 69% to human 2R1 MVPAQSPPLVPPSRDQALLGLACLTVAFLAVLLVRQLVK QRRPPGFPPGPSPIPIIGNIMSLATEPHVFLKKQSEVHGQ (0) IFSIDLGGILTVVLNGYDCIRECLYNQSEVFADRPSLPLFKKMTKMG 12808 12701 GLLNCKYSKGWIEHRKLACNSFRYFGSGQRLFERKISEECMFLVDAIDQHKGKAFNPKHL 12522 12521 VTNAVSNITNLIIFGQRFTYDDHNFQHMIELFSENVELAVSGWALLYNAFPWIEYLPFGK 12342 12341 HQKLFFNAAEVYDFLLRVTKEFSQGRVPHMPRHYVDAYLDELERNAGDPNSSFSYENLIY 12162 12161 SVGELIIAGTETTTNTLRWAMLYMALYPNIQG RVHREIDSVLANGRMPTLEDKQKMPYVEAVLHEVLRFCNIVPLGIFRATSQDA 11802 11801 NVNGYTIPKGTMVITNLYSVHFDEKYWSDPGVFSPQRFLDANGNFVRREAFLPFSLG 11631 11535 GRRQCLGEQLARMEMFLFFTTLLQRFHLQFPVGTIPTIAPKLGMTLQPKPYSICAVRR 11362 HQKSLISVTTPCHK* 11317 CYP2R1 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr II (+) strand 9716095-9718823 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 88% to Fugu 2R1 MVSIKAQSLVPVSCAQALLGVVCLAVALLAFLLVRQLVKQRRPPGFPPGPSPIPVIGNIFSLATEPHVFLKRQSE VHGQIFSLDLGGILTVVLTGYDCVRECLYNQGEVFADRPSLPLFKKMTKMGGLLNCKYGKGWIEHRKLACNSFRY FGSGQKQFERKISEECMFFVDAIDEHKGKPFNPKHLVTNAVSNITNLIIFGQRFTYDDRNFQHMIEIFSENVELA VSGWALLYNAFPWIEYVPFGKHQKLFRNAAEVYDFLQEVIQSFSQGRVPHSPRHYVDAYLDDLERSAGAPDSSFS YENLIYSVGELIIAGTETTTNTLRWAMLYMALYPNIQERVHREIDSVLANERAPTLEDKQKMPYVEAVLHEVLRF CNIVPLGIFRATSQEAKVNGYTIPKGTMVITNLYSVHCDEKYWNDPGAFSPQRFLDSNGNFVRREAFLPFSLGRR CCLGEQLARMEMFLFFTTLLQRFHLQFPAGSIPTVTPKLGMTLQPKPYSICAVRRQQKSPCFGDTPYPN* CYP2R1 Oryzias latipes (medaka) chr3 17795604:17802282 Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 87% to Fugu 2R1 MVSLTAASVVPVSRAMALLSVGCLAAALMAYLLVRQLVKQRRPPGFPPGPSPIPIIGNIFSLATEPHVFLKRQSE VHGQIFSLDLGGIMTVVLNGYDCVKECLYHQSEVFADRPSLPLFKKMTKMGGLLNSKYGKGWNDHRKLACNSFRY FGSGLRLFERKISEECMFFVDAIDEHKGKPFNPKHLVTNAVSNITNLIIFGQRFTYDDRDFQHMIELFSENVELA VSGWALLYNAFPWIEYMPFGKHQKLFRNAMEVYDFLLEVIKRFSHGRVPHVPRHYVDAYLDELEQNSGDPSSSFS YENLIYSVGELIIAGTETTTNTLRWAMLYMALYPNIQERVHREIDSVLTNGRAPTLEDKHKMPFVEAVLHEILRF CNIVPLGIFRATSQEAKVNGYTIPKGTMVITNLYSVHFDEKYWNEPGVFSPQRFLDSSGNFVRREAFLPFSLGKR HCLGEQLARMEMFLFFTTLLQRFHLQFPPGTVPTVTPKLGMTLQPKHYSICAIRRQQKVPNS* CYP2R2P Fugu rubripes (pufferfish) No accession number Fc:c104I03x1 LPC.39565.x1 77% to fugu 2R1 MAY BE PSEUDOGENE OF scaf 7138 exon 8 201 DSVLANGRMPTLEDKQKMPYVEAVLHEVLRFCNIVPLGIFRATS*DANVNGYTIPKGTM 220 221 VITNLYSWHFYEKNWSKTGAFSHPKCLWDAHGHFCEWLMASMPGSFG 518 CYP2R3P Fugu rubripes (pufferfish) No accession number Fc:c068L08y2 LPC.26046.y2 67% to fugu 2R1 exon 8 possible pseudogene fragment LYYTKIXTVLARVEIPTLEDKQKMPYLEAVLPEVLRFCDIVPLGLFRATSAGADVNGFTIPGGAVLIAILCSGRF 2S Subfamily CYP2S1 human GenEMBL AF335278 AC011510 ESTs T84852, AA315278, AA300981 and AA301039 AA316621, AA496320, AA422150 Rylander, T., Neve, E.P.A., Ingelman-Sundberg, M. and Oscarson, M Identification and tissue distribution of the novel human cytochrome P450 2S1 (CYP2S1) Biocem. Biophys. Res. Commun. 281, 529-535 2001 There is no UNIGENE entry for any of these ESTs 52% identical to CYP2B subfamily members and 50% with CYP2A members 50% with CYP2G1. AC011510 one exon per line 78% to mouse 2s1 49% to 2B6 47% to 2A13 MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMR LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAVVRAAGGTLLGVSSQGGQ TYEMFSWFLRPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ EEQNPGTEFTNKNMLMTVIYLLFAGTMTVSTTVGYTLLLLMKYPHVQ KWVREELNRELGAGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ GTEVFPLLGSILHEPNIFKHPEEFNPDRFLDADGRFRKHEAFLPFSL GKRVCLGEGLAKAEVFLFFTTILQAFSLESPCPPDTLSLKPTVSGLFNIPPAFQLQVRPTDLHSTTQTR CYP2S1 Macaca mulatta (rhesus monkey) AC011510 exons 2,3 from CO649282.1, gene fragmented on multiple scaffolds MEATGTWALLLALALLLLLTLALSGTRARGQLPPGPTPLPLLGNLLQLRPGALYSGLMR (0) LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH (1) GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE (1) GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAMVRAAGGTLLGVSSRGGQ (0) TYEMCSWFLWPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ (0) EEQNPDTEFTNKNMLMTVIYLLFAGTMTVSATVGYTLLLLMKYPHVQ (1) KRVREELTQELGSGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ (0) GTEVFPLLGSILHDPSIFKHPEEFNPDHFLDADGRFRKHEAFLPFSL (1) GKRVCLGEGLAKAELFLFFTTILQAFSLESPCPLDSLSLKPTISGLFNIPPAFQLQVRPTDLHSTTQTT* CYP2S1 Bos taurus (cow) See cattle page for details MEAAGTWALLLLLLLLVVTLVLPATWDRGHLPPGPTPLPLLGNLLQLRPGALYLGLLR LSKKYGPVFTVYLGPWRRVVVLVGHEAVQEALGGQAEEFSGRGTVATLDGTFDSH GVFFSNGERWRQLRKFTTLALRDLGMGKREGEELIQAEARCLVEALQGTK GRPFDPSLLLAQATCNIICSLVFDLRLPYDNEEFQAVVRAAGGIAVGVSSPWGQ TYEMFSRFLQRLPGPHTQLLRHLGTVAAFAAQQVWQHKGSLGTSGPVRDLVDAFLLKMAK EKQDPNTEFTAKNLLMTVVYLLFAGTVTVSTTIRYTLLLLLKYPQVQ ERVQEELMRELGAGQRPSLGDRARLPYTDAVLHEAQRLLALVPMGIPRALTKTTRFRGYTLPQ GTEVFPLLGSILHDPAVFEEPKEFNPGRFLDADGKFKKHEAFLPFSL GKRVCLGEGLARTELFLLFTAILQAFSLEGPCPLGALSLQPAISGLFNIPQAFQLQFRPR* CYP2S1 Sus scrofa (pig) DT323081.1 85% to CYP2S1 cow MEAAGTWALLLVLVLLLLLALALPGIRTGGHLPPGPAPLPLL GNLLQLRPGAL YLGLMRLSKKYGPVFTVYLGPWRRVVVLVGREAVQEALGGQAEEFSGRGMVATLDGTFDS HGVFFSSGERWRQLRKVTMLALRDLGMGKREGEELIQAEAQRLVEEIRGTKGRPLDPSLL LAQATSNIICSLIFGRRFPYDNEEFQAVVRAAGGTVVGVSSPWGQTYEMFSRVLQYLPGP HTQLLGHLGTLAAFAVQQV CYP2S1 Canis familiaris (dog) NW_876270.1: 43044442-43033913 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 80% to human CYP2S1 MEAAGTWTLLLALLLLLLLLALARPRTRGHLPPGPPPLPLLGNLLQLRPGALYSGLLRLSKKYGPVFTVYLGPWR RVVVLVGHEAVQEALGGQAEEFSGRGMLATLDGTFGGHGVFFSNGERWRQLRRLTTLALRDLGMGKREGEELIQA EAQSLVEAFQGTVGRPFDPSLLLAQATSNIICSLTFGLRFPYEDKEFQAVVQAAGGTVLGVSSPWGQTYEMFSWL LQHLPGPHTQLLSHLSVLATFAVQQVQRHKESLDTSGPPHDVVDAFLLKMAKEEQDPNTELTDKNLLMTVIYLLF AGTVTVSTTVRYTLLLLLKYPQVQERVREELSRELGAGRAPGLGDRARLPYTDAVLHEAQRLLALVPMGVPRALA RTTCFRGYTLPQGTEVFPLLGSVLHDPEIFDEPEEFNPDRFLDADGRFQKQEAFLPFSLGKRICLGEGLAHAELF LLLTTILQAFSLESPSPPGALSLQPAVSGLFNIPPAFQLRVRP* Cyp2s1 mouse GenEMBL AA967201 ua50f06.r1 80% IDENTICAL TO HUMAN CYP2S1 AC073725.2, AC087155.1, NT_039407.1 AA967201 ua50f06.r1 80% IDENTICAL TO HUMAN CYP2S1 AA562979 vl64a09.r1 AA543966 vj69d06.r1 AA472776 vg94b11.r1 AI481433 vg94b11.x1 NT_039407.1 - strand 1933418 MEAASTWALLLALLLLLLLLSLTLFRTPARGYLPPGPTPLPLLGNLLQLRPGALYSGLLR 1933239 1931966 LSKKYGPVFTVYLGPWRRVVVLVGHDAVREALGGQAEEFSGRGTLATLDKTFDGHG 1931799 1928473 GVFFANGERWKQLRKFTLLALRDLGMGKREGEELIQAEVQSLVEAFQKTE 1928324 1925993 GRPFNPSMLLAQATSNVVCSLVFGIRLPYDDKEFQAVIQAASGTLLGISSPWGQ 1925832 1925752 AYEMFSWLLQPLPGPHTQLQHHLGTLAAFTIQQVQKHQGRFQTSGPARDVVDAFLLKMAQ 1925573 1924579 EKQDPGTEFTEKNLLMTVTYLLFAGTMTIGATIRYALLLLLRYPQVQ 1924439 1922453 QRVREELIQELGPGRAPSLSDRVRLPYTDAVLHEAQRLLALVPMGMPHTITRTTCFRGYTLPK 1922265 1920451 GTEVFPLIGSILHDPAVFQNPGEFHPGRFLDEDGRLRKHEAFLPYSL 1920311 1920154 GKRVCLGEGLARAELWLFFTSILQAFSLETPCPPGDLSLKPAISGLFNIPPDFQLRVWPTGDQSR* 1919957 >Cyp2s1-ie4b mouse GenEMBL NT_039407.1 + strand 2s internal exon 4 partial duplication z in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004) 1927805 QAASGTLIGISSP*GQ 1927852 2T Subfamily CYP2T1 rat No accession number Lars von Buchholtz Submitted to nomenclature committee 3/6/2000 73% to CYP2T2P CYP2T2P human GenEMBL AC008537 RAQMRGSLPPRPRPLPLLGNL QLQSGGLDRALHSLSGRWGRVFTVRLGPRPAVGLCGYAALRDALVLQADA VSGRGSMAVFERFTRGNGILFSNRPCWWTLRNFALGALKKFGLGTRTVEA RVLEEAACLLDEFQATIGAPFDPVRLLDNAVSNVICSLVFGNRYRYGDPE FLRLLNLFSDNFCIISSRWGESLMDWLPGPHHRIFRNFSE LRVISEQIQRHWQMRQPAEPRDFIDCLTRWVRHGQQDPESHFQE*TSVM TTHFFFGVTETTSTTLCYGLLILLKYLEVAAKVQELDPVVGWRPAPSL DYRVCLPYANAVLLEIQCFISVVPLGLPRTLTLDTHLHSHCLPKGTFVIP LLVTAHRDPTQFKDPDCFNPTNFLDKGKFQGNDAFMPFAPAKQMCLG TGLAHSGIFLFLTATLQRFCLLPVVRPGTINLTQCTGLGSVPPDFQLQPVAC CYP2T2 Canis familiaris (dog) chr1:115897947-115901169 UCSC browser May 2005 assembly 78% to mouse Cyp2t4 MFTALLLLLLLLLLLALARRSWGAQGTRTQGALPPGPTPLPLLGNLLQLESRRLDRALME (0) LSGRWGPVFTVRLGPRPAVVLCGYSALRDALVLQADAFSGRGAMAVFERFTHGN GIVFSNGLRWRTLRNFALGALKEFGLGTRTIEERILEEAACLLGEFQATT GAPFDPRRLLGNAVSNVICSVVFGNRYGYEDPEFQRLLDLFNDNFRIMSSRWGE MYNVFPTLLDWLPGPHHRIFQNFTELRVFISEQIQRHQQTRQPGKPRDFIDCFLDQMDK EQNDPESHFQEETLVMTTHNLFFGGTETTSTTLRYGLLILLKYPEVA AKVQAELDAVVGQSRTPRLGDREHLPYTNAVLHEIQRFISVLPLGLPRALTRDTHLHGYFLPK GTFVIPLLVSSHRDPTQFKDPDCFNPTNFLDDKGEFQTNDAFMPFAP GKRMCLGAGLARSEIFLFFTAILQRFCLLPVGNPANIDLSPQCTGLGNIPPAFQL RLVAR CYP2T2P ortholog Bos taurus (cow) See cattle page for details MMISGIIALSLLVLLLAPARWGWGARSTQRQGALPPRATPLRLLGSLLQLRIWRPGPCTHG LSGRCGPVFTVCLGQCPVVVLCRYAALRDALVLQADAFSGRGAMAVFKRFTRGN GIAFSKGPRWPTLRNFALGALKEFGLGTQTIEERVLEEAACLLGDFQATGG GAPFDPQRLLDNAVSNVICSVVLGNHYGYEDMEFLRLLDLFNDNFRIMSSRWGE XXXXXSLLDWLPGLHH*IFRNFAXLRVFISQQIQLHQQTR*SGKPHDFIDXXXXXXX GTENPESHFQAETLAMTMHNLFFGXXETTSTTLRYGLILLKYSFVA AKVQAELDDMVGRMCAPTLEDREHLPYTNTVLHEIQCFISVVPFGLPSALTCDTHLRGYFLPK GTFVIPLLVSTHWVPTQFKNPECFNPTNFLNDQGEFQSNAFTPFAL GTCLGAGLAPTDIFLFLTSILLRFFLLPVGSHSDTDLTPQCTGLGNVPPAFQLRLVAR* CYP2T3P human GenEMBL AC008962 C-terminal missing RAQMRGSLPPRPRLLPLLGNLQLQSGGLDRALHS LSGRWGRVFTVRLGPRPAVVLCGYAALRDALVLQADAVSGRGSMAVFERFTRGN GILFSNRPCWWTLRNFALGAPKKFGLDTRTIEARVLDEAACLLDEFQATI GAPFDPVRLLGNAVSSVTCSCLREPLWLRGPGVPEVLNLLSDNFRIMSSKWGE SLMDWLPGPHHQIFQNFSELQVFISEQIQQHWHMRQPAEPRDFIDCLARWVRHG QQDPESHFQEETLVMTMQLFFFFFFGGTETTSTTLC AKGQELDPVVGQRPVPSPD DHVQWPYTNAVLLEIQRFISVVPRTLTLDTHLHSHCLAKG Cyp2t4 mouse GenEMBl NT_039413.1 + strand 157707 MVTCLALLLLLLILMLLLWWGGVVRRQAQMQKDLPPGPAPLPLLGNLLQLQSGDLDRVLME 157889 158219 LSSHWGPVFTVWLGPLPAVVLCGYEALRDALVLQADAFSGRGAMAVFDRFTCGN 158380 158742 GIVFSNGPRWHSLRNFALGVLRELGVGRSTIEDRILEEAACVLDEFQATM 158891 159103 GAPFDPQQLLDSAVSNVICTVVFGKRYDYGDPEFRRLLNLFSDNFCIMSSRWAE 159264 159884 IYNMFPSFMDWIPGPHNRIFKNFQELRLFISEQIQWHWQSRQTGEPRDFIDCFLDQMDK 160060 160137 EQQDLESHFQDETLVMTTHDLFFGGTETTSTTLRYGLLIMLKYPEVA 160277 160379 AKVQEELDATVGRTWAPRIEDRARLPYTNAVLHEIQRFISVLPLGLPRALTRDVNLKNHFLHK 160567 160818 GTFVIPLLVSAHRDPTQFKDPDHFNPTNFLDDHGEFQNNDAFMPFAL 160958 161048 GKRMCLGAGLARSEIFLFLTAILQKFSLLPVGSPANINLNPQCTGLGNVPPAFQLRLVAR* 161230 2U Subfamily CYP2U1 human AC025090, (AC000016 has C-term) 41% to 2N1 new CYP2 subfamily MSSPGPSQPPAEDPPWPARLLRAPLGLLRLDPSGGALLLCGLVALLGWSWLRRRRARGI 77036 PPGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVIGPQVLLAHLARVYGSI 76863 76862 FSFFIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVT 76734 105008 GPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPF 105160 105161 SIISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLLVNICPWLYYLPF 105340 105341 GPFKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEE 105517 105518 YLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ 105622 107396 KVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSENT 107554 109370 LQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGIG 109540 KRVCMGEQLAKMELFLMFVSLMQSFAFALPEDSKKPLLTGRFGLTLAPHPFNITISRR CYP2U1 Macaca mulatta (rhesus monkey) note gc boundary between exons 7,8 MSSPGPPQPPAEDPPWPARLLRAPLGLLRMDPSGDALLLCGLVAVLGWSWLRRRRARGIP PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVVGPQVLLAHLARVYGSIFSF FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEK (1) GVVFAHYGPIWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSI ISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLMVNICPWLYYLPFGP FKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLF YIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ (1) EKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSGNT (1) VLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGI (1) GKRVCMGEQLAKMELFLMFVSLMQSFAFALPEKSKKPLLTGRFGLTLAPHPFNITISRR* CYP2U1 Bos taurus (cow) See cattle page for details MASPGLPQPPTEDAAWPLRLLHAPPGLLRLDPTGGALLLLVLAALLGWSW LWRLPERGIPPGPAPWPVVGNFGFVLLPRFLRRKSWPYRRARNGGMNASGQGVQLLLADL GRVYGNIFSFFIGHYLVVVLNDFHSVREALVQQAEVFSDRPRVPLTSIMTKGKGIVFAHY GPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFRYVKEEMQKHGDAPFNPFPIVNNAVSN IICSLCFGRRFDYTNSEFKQMLTFMSRALEVCLNTQLLLVNICSWLYNLPFGPFKELRQI EKDLTLFLKKIIKDHRESLDVENPQDFIDMYLLHVEEEKKNNSNSGFDEDYLFYIIGDLF IAGTDTTTNSLLWCLLYMSLHPNIQEKIHEEIARVIGADRAPSLTDKAQMPYTEATIMEV QRLSTVVPLSIPHMTSEKT VLQGFTIPKGTIILPNLWSVHRDPAIWE KPNDFYPDRFLDDQGQLIKKETFIPFGI GKRVCMGEQLAKMELFLMFVSLMQSFTFVLPKDSKPILTGKYGLTLAPHPFNIIISKR CYP2U1 Canis familiaris (dog) NW_8762971.1:28366254- 28348146 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 75% to human CYP2U1 WLHRRTPVAAAGGAAGAGGHSSARGPQLLLADLARAYGAVFSFFIGRHLVVVLSDFRSVRAALVQQAEIFSDRPR VPLVSLVTKEKGIVFAHYGPVWKQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKEEMQKHGEDPFNPFPIVNNA VSNIICSLCFGQRFDYTNSEFKKMLRLMSRALEICLNSQLLLVNICSWLYYLPFGPFKELRQIEKDITTFLKKII KDHKESLNVENPQDFIDMYLLQVEEERKNNSNSSFNEDYLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDIQEK VQEEIERVIGADRVPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSEKTLQGYTIPKGTVILPNLWSVHRDP AIWEKPDDFYPNRFLDDQGQLIKKETFIPFGIGKRVCMGEQLAKMELFLMFVSLMQSFTFALPKDSKKPILTGRY GLTLAPHPFNIVISKR* Cyp2u1 mouse GenEMBL AK018458 16 days embryo lung cDNA about 78% MSSLG DQRPAAGEQPGARLHVRA TGGALLLCLLAVLLGWVWLRRQRACGI PPGPKPRPLVGNFGHLLVPRFLRPQFWLGS GSQTDTVGQHVYLARMARVYGNI FSFFIGHRLVVVLSDFHSVREALVQQAEVFSDRPRMPLISIMT KEKGIVFAHY GPIWKQQRRFSHSTLRHFGLGKLSLEPRIIEEFAYVKEAMQKHGEAPFSPF PIISNAVSNIICSLCFGQRFDYTNKEFKKVLDFMSRGLEICLHSQLFLINICPWFYYLPF GPFKELRQIERDISCFLKNIIREHQESLDASNPQDFIDMYLLHMEEEQGASRRSSFDED YLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ KKVHEEIERVIGCDRAPSLTDKAQMPYTEATIMEVQRLSMVVPLAIPHMTSEKT VLQGFTIPKGTVVLINLWSVHRDPAIWEKPDDFCPHRFLDDQGQLLKRETFIPFGIG CYP2U1 Xenopus tropicalis (frog) See Xenopus page for seq CYP2U1 Danio rerio (zebrafish) CYP2U1-de1b Danio rerio (zebrafish) CYP2U1 Fugu rubripes (pufferfish) No accession number Scaffold_8899 56% to human 2U1 MMSLSWLQSLSSSILTLVIMIILHHLFKCYQKRHGFANIPPGPKPWPVVGNFGGFL (0) NAAAVLTELAKVYGNVYSIYVGSQLVVVLNGYKVVRDALSNHPDVFSDRPDIPA ISIMTKISGIVFAPYGPLWQKHRRFCLSTLRNFGLGRLGLEPCIVEGLTNIKTELLRLE EESGGAGVDPAPVISNAVSNVICSLVLGHRFNHDDQEFRSMLRLMDRGLEICVNSPAVLI NVFPLLYHLPFGVFRELRQVERDITAFL KRFIANHQETLDPNNPRDLTDMYLKEISARREAGDVDSGFTED YLFYIIGDLFIAGTDTTANSVLWVILYMASYPDIQ KVQAEIDGVVGPLRTPSLSDKGKLPFTEAAIMEVQRLTTVVPLAIPHMTSET EFMGYTIPKGTVVLPNLWSVHRDPTEWDDPDSFDPTRFLDEDGTLLRKECFIPFGIG RRVCMGAQLAKMELFLTVTNLLQTFHFRLPEGAPRPPLQGRFGLTLAPCPYTVCINPR CYP2U1 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr IX (-) strand 8019744-8022277 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 73% to Fugu 2U1 MASLSWPSGADLSRVDVVALLLASLLLALCLFDVHRRRRDLANIPPGPTPWPLVGNLGFSLVPALFRRRFGEKPV DKNAMVLLTERAAVYGNVYSMFVGSQLMVVLNGYEAVKDALSNHPEVFSDRPDIPAITIMTKRKGIVFAPYGPVW RKQRKFCHTTLRSFGLGKLSLEPCIQQGLTTVKTELLHLSKKSGATGVDPAPLISNAVSNVICSLILGQRFHHED RQFRSMLDLMDRGLEICVSSPAVLINVFPLLYYWPFGVFRELRRVEGDITAFLKRIIATHRETLDPDNPRDLVDM YLMEMSAQQAAGEEDSSFTEDYLFYIIGDLFIAGTDTTANSVLWVLLYMVLHPDIQDKVQTEMDEVVGTHRTPSS TDKGSLPFTEATIMEVQRMTVAVPLAIPHMASETTEFRGYTIPKGTVIVPNLWSVHRDPTVWDEPDRFNPARFLD EEGQLLRKECFIPFGIGRRVCMGEQLAKTELFLTVTSLLQAFRFRLPEGAPPPSLTGRFGLTLAPCPYAVCVSPR G* CYP2U1 Oryzias latipes (medaka) chr1 20316302:20324749 Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 66% to Fugu 2U1 MVSSSFGLIWSSVLSLSNLLTSLLFLLVYYLVRFYQKQRTIYKNIPPGPKPWPVVGNFGNFFVPPSVRTKIAGQP NSTNAIEIEALRQQATVFGNIHSLFIGGQLIVVLHGFHLIRDALLNQPEVFSDRPDIPLVTILTKRKGIVFAPYG PVWRKQRKFCHTTLRSFGLGKLSLEPCIQRGLAGVKAELLRLNEERGSAGVDPATLIGNSVSNVICSLILGQCFH HHDVEFRTMIRLMEHGLKICINSPAVLINIFPLLYYLPFGVFKELRQVERDITAFLKRIIAKHRDTLDPDNPRDL TDMYLIEMLTQQAAGEEDSSFTDDYLFYVIGDLFIAGTDTTTNSILWFLLYMILHPDVQDKAQAEIDGVVGKHRV PSVTDKGSLPFTEATIMEVQRLHSVVPLAIPHMTSETTVFRGYTIPKGTVIFPNLWSVHRDPTLWEDADSFNPSR FLDNEGNLLRKEYFIPFGIGRRVCMGEQLAKMELFLTVTTLLQAFKFRHPEGNPPPTVKERFGLTMAPCPFSVCV TPRGGPNLNP* 2V Subfamily CYP2V1 Danio rerio (zebrafish) GenEMBL AB026158 Ohta,M., Saitou,T., Yoshizaki,G. and Otsuki,A. Identification of a Cytochrome P450(CYP2) cDNA for Zebrafish Also found as an EST from Yea-Huey Yang, Jun-Lan Wang-Buhler and Donald R. Buhler Submitted to nomenclature committee 7/1/2000 Note: AB026158 has at least 2 frameshifts and some other probable errors. Buhlers sequence seems to be more accurate. CYP2V1 Danio rerio (zebrafish) No accession number Tseng, H.-P., Wang-Buhler, J.-L., Yang, Y.-H., Hu, C.-H., Buhler, D.R. submitted to nomenclature committee 12/08/2003 51% to CYP2Z2 clone name YH-F4-FL 2W Subfamily CYP2W1 human GenEMBL AC073957.3 chromosome 7 clone RP11-449P15 40% to 2F1 MALLLLLFLGLLGLWGLLCACAQDPSPAARWAPGLRPLPLVGNLHLLRLSQQDRSLME LSERYGPVFTVHLGRQKTVVLTGFEAVKEALAGPGQELADRP PIAIFQLIQRGGGIFFSSGARWRAARQFTVRALHSLGVGREPVADKILQELKCLSGQL DGYRGRPFPLALLGWAPSNITFALLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQL FNVHPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHVCPGDPVCSYVDALIQQGQG DDPEGLFAEANAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQGRVQEELDRVLGP GRTPRLEDQQALPYTSAVLHEVQRFITLLPHVPRCTAADTQLGGFLLPKGTPVIPLLT SVLLDETQWQTPGQFNPGHFLDANGHFVKREAFLPFSA GRRVCVGERLARTELFLLFAGLLQRYRLLPPPGVSPASLDTTPARAFTMRPRPRALCAVPRP* CYP2W1 Macaca mulatta rhesus monkey AC073957.7 chromosome 7 LSERYGPVFTVHLGCQKTVVLTGFEVVKEALAGPGQELADRPPIAIFQLIQRGG (1) GIFFSSGARWRAARQFTVRALHSLGVGRKPVADKILQELKCLLGQLDGYR (1) GQPFPLALLGWAPSNITFTLLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQ (0) LFNVYPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHMRPGDPVCSYVDALIQQGQ (0) GDDPEGLFAEDNAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQ (1) GRVQEELDRVLRRGRPPQPEDQQVLPYTSAVLHEVQRFITLLPHVPRCTATDMQLGGFLLPK (0) GTPVIPLLTSVLLDETQWQTPDQFNPGHFLDADGHFVKQEAFLPFSA (1) GRRVCVGERLARTELFLLFAGLLQKYYLLPPPGVSPASLDTTPAQAFTMRPRAQALCAVPRP CYP2W1 Bos taurus (cow) See cattle page for details Partial seq. LGKQYGPVFTVHLGHQKTVVLTGYEAVKEALVGTGQELAGRPPIAIFQLINGGG (1) GVFFSSGPRWRAARQLTVRALHGLGVGRAPVANKVLQELRCLTAQLDSYE (1) GRPFPLALLRWAPSNITFTLLFGQRFDYRDPVFLSLLGLVDEVMVLLGKPSVQ (0) LFNLYPRLVALLQLHRPVLRKIEEVRAILRALLEARRHRTPPRGPQQSYLDALIQQGQ (0) XXXXX XXXXXXXXXXXXXXXXPRPEDVHALPYTNAVLHEVQRFITLLPHAPRCTVANTQLGPYLLPK GTPVLALLNSVLLDETQWKTPRQFNPGHFLDANGRFVKRPAFLPFSA CYP2W1 Canis familiaris (dog) NW_876319.1: 293563-287849 Joanna Wilson and students submitted to nomenclature committee Feb. 17, 2009 83% to human CYP2W1 MALLLLGILLLLGLWGLLRTCTRTPSSASRWPPGPRPLPLIGNLHLLRVSQQDQSLMELSEQYGPVFTVHLGRQK TVVLAGYEAVREALVGTGPELADRPPIAIFQLIQGGGGIFFSSGARWRAARQFTIRTLHGLGVGRGPMADNVLQE LRCLMGQLDCYRGQPFPLALLGWAPSNITFTLLFGRRFDYQDPVFVSLLSLIDEVMVLLGTPSLQLFNIYPWLGA LFQLHRPVLRKIEEVRAILRTLLKARRPSMPGGGPVQSYMDALIQQGQGKDPQGLFAEANMVACTLDMVMAGTET TSATLQWAALLMGKHPSVQCRVQEELDRVLGPGRAPQLEDQRSLPYTNAVLHEVQRFITLLPHVPRCMAADTQLG GYLLPKGTPVIPLLSSVLLDKTQWETPRQFNPGHFLDAEGRFVKRAAFLPFSAGRRVCVGESLARSELFLLFAGL LHRYRLLPPPGLSPDALDTTPAPAFTMRPPAQALCAVPRPGGYDQGDWGRV* The following cDNA AK000366.1 has been reported from Japan in a project to identify Full length cDNAs. This is a part of the 2W1 gene. The reported sequence shown below is not full length. It is missing the N-terminal exon and the C-terminal exon. If one translates the sequence upstream of the ATG shown below, one finds the N-terminal exon sequence as shown above, however, there are only about 7 amino acids worth before the sequence runs out and stops. Similarly, if the genomic clone is searched downstream of the end of the cDNA, a clear heme binding sequence is found and another exon is identified. The last exon has a problem. It is too long if allowed to run until it hits a natural stop codon. However, in another frame there is a sequence LCAVPRP* that is identical to the end of CYP2D6 and this sequence is at the right location for this to be the end of the 2W1 gene. I suspect there is a frameshift between the heme binding region and the LCAVPRP* sequence. I have shown the 2W1 gene with this frameshift, though the exact location is uncertain. Cyp2w1 MOUSE GenEMBL XM_144624 WHOLE mRNA PARTS from GSS AZ515172 AZ329864 AZ983190 BH076787 MALLLLGVWGILLLLGLWGLLQGCTRSPSLAPRWPPGPRPLPFL GNLHLLGVTQQDRALMELSERYGPMFTIHLGSQKTVVLSGYEVVREALVGTGHELADR PPIPIFQHIQRGGGIFFSSGARWRAGRQFTVRTLQSLGVQQPSMVGKVLQELACLKGQ LDSYGGQPLPLALLGWAPCNITFTLLFGQRFDYQDPVFVSLLSLIDQVMVLLGSPGIQ LFNTFPRLGAFLRLHRPVLSKIEEVRTILRTLLETRRPPLPTGGPAQSYVEALLQQGQ EDDPEDMFGEANVLACTLDMVMAGTETTAATLQWAVFLMVKHPHVQGRVQEELDRVLG PGQLPQPEHQRALPYTSAVLHEVQRYITLLPHVPRCTAADIQLGGYLLPKGTPVIPLL TSVLLDKTQWETPSQFNPNHFLDAKGRFMKRGAFLPFSAGRRVCVGKSLARTELFLLF AGLLQRYRLLPPPGLSPADLDLRPAPAFTMRP (end may be frameshifted) PAQTFSYDSVYSGAKAAYPYVEVGSWPFIWHHGAEGVSAQCSGPTLS 2X Subfamily CYP2X1 Ictalurus punctatus (catfish) GenEMBL AF315346.1 Schlenk,D., Furnes,B. and Zhou,X. Isolation and cloning of a new P450 2 family gene from Ictalurus Punctatus. Unpublished 42% to 2N2 CYP2X2 Fugu rubripes (pufferfish) No accession number Scaffold_4007 MVTSVILLCLGVVVLVLLLRSQRPKNFPPGPPVLPLLGSILELALDNPLQDFER (0) 12453 LRKKYGNVYSLFLGTRPAVVISGLKNIKEALVTKGSDFSGRPQDMILSI 12629 possible frameshift DAIKTN (1) 13208 VIMQDYNLVWKEHRRFALTTMRNFGMGKTSMEDRIHGEIEYIVNTLEKNN (1) GKTLSPHLMFHNAASNIICQVLFGTRYEYDDHFIREIVRCFTENAKISNGPWAM (0) LYDSIPLVRYLPLPFKNAFKNVE (0) TAENLVKDLFVEHKKTRMSGDPRDFVDCYFDELDK (0) RGKDRSSFSENMLTMYALDLHFAGTDTTSNTLLTGFLYLMNYPHIQ (1) ERCHQEIDKVLQDNETVTYDARNQMPYMQ (0) 15630 AVIHEVQRVANTVPLSVFHCTTKDTEFMGYSIPK 15731 (0) 15853 GTLIIPHLASVLKEEGQWKFPNEFNPDNFLNDDGEFVKPEAFMP 15984 (frameshift) XST (1) 16100 GPRVCLGEGLARMELFLIIVTLLHKFQFIWPEDAGEPDYTPIFGATQTPKPYRMKIQLRK* 16282 CYP2X3 Fugu rubripes (pufferfish) No accession number Scaffold_10845 missing exons 1-4 off the end of the scaffold 7527 LYDSFPAVRYLPLPFKRGFEMFK 7450 (0) 7381 MSHERYLEMFVETKKTRVPGKPRHFVDAYMDELEK 7277 (0) 7193 RGDEAFFSEDQLCAIILDLHFAGTDTTANTLLSGLLYLMKYPHIQ 7057 (1) 6289 EYCQQEIDKVMQGKNEVSFEDRVQMPYVQ 6203 (0) 6105 AVIHEIQRTANTVPLSVFHCTTRDTELMGYSIPK 6004 (0) exon 9 5617 GTLIIPNLSSVLNEKGQWKSSHEFNPENFLNENGEFVQPEAFMPFST 5477 (1) 5244 GPRVCLGEGLARMELFIILVSLLRKFRFIWPEDAEEPDLTPVFGVTQTPKPYSLKVQVRSRC* 5056 CYP2X4X Fugu rubripes (pufferfish) discontinued name No accession number FE:EFRy002apsE4 EST exons 10 and 11 Length = 458 395-496 51% to 2D6 87% to Scaffold_10845 (CYP2X3) Note: this EST is not in the current Fugu databases and appears to have been removed. It may have been a poor quality sequence of CYP2X3 (March 2, 2005) SSPKGTIIIPNLSSVLNEKGQWKCPHEFHPGNFLNENGEFVKPEAFVPFST GPRVCLGEGLARMELFIILVTLLRRFKFIWPEDAEEPDLTPIFGLTQTPKPYRLKVQIRSSFK* CYP2X5P Fugu rubripes (pufferfish) No accession number Scaffold_3538 57% to FE:EFRy002apsE4 51% to 2D6 Length = 26272 61% to 2X2 59% to scaf 10845 (CYP2X3) first 8 exons missing off end of scaffold E in EXXR motif missing, one bad boundary, no exon 11 found Possible pseudogene 25728 (0) PGIHKVQRIANTVPLNVQYCTMKETQLMAHLLPR 25627 exon 9 bad boundary 25349 (0) ETLIIQNLNSRQNEEGQWKFPHKSRPENFLNDQGEFVKTEDFMLFSA 25209 (1) exon 10 CYP2X6 Danio rerio (zebrafish) ctg22265.a 66% to CYP2X1 708019 MLGSSLLVVICILLIFFLIRVKKPKNFPPGPPPVPIFGNLLQLNLANPLKDFEK 707858 707784 FAEKYGEIFSLYTGSRPAVILNSFAVIKEALVTKAQDFSGRPQDFMISHATENKGN 707617 705571 IVLADYGPVLKGHRRFALMTMRNFGLGKQSMEERILGEISHVVDYLDKNA GKRVDPHIMFHNVASNVISLLLFGCRFDYNSEFLQCYIQLINEISKIINGPWNM 705149 703459 IYDTFPLLRILPLPFKKAFDHVKVIKSMNLKLIDEHKSTRVPGEPRDFIDCYLDELDK 703286 703161 GKNCVSTFSEDKLLMSIMDLHFAGTDTISNTLLTAFLYLMNHPEVQ 703024 702766 VKCQQEIDDVLEGKDQVTYEDRHNMPYTLAVIHEVQRVANTVPLSVFHCTTRDTELMGYSIPK 702578 702500 GTIIIPNLTRVLKEEGQWKFPYEFNPANFLNEQGQFEKPEAFIPFST 702360 701098 GLRMCLGEGLARMELFLIFVTLLRRFQFVWPEDAGKPDYTPVFGLTLTPKPYRMHIRRRETVKQ* 700904 CYP2X7 Danio rerio (zebrafish) ctg22265.b CYP2X1 Missing C-term BC053412 AI959373 fd08g05.y1 CK030199 AI959373 zfishC-a2684d06.q1c ctg11087 = BC053412 FILLS IN exons 3,4 in a GAP IN ctg22265 718880 MLEVSVLILICIFLVFFLIRIKRPKNFPPGPPPVPIFGNLLQINMVDPLKEFER 718719 718641 LAEKYGNIFSLYTGSKPAVFLNNFEVIKEALVTKAQDFSGRPQDLMISHL 718492 TGNKG 670408 VVLADYGPLWKDHRRFALMTLRNFGLGKQSMEERILGEISHIVDFLDKNT 670557 670648 GKTVDPQIMFHNIASNVINLVLFGCRFDYNNEFLRGYIQRIAENLRILNGPWNM 670809 717005 IYDTFPLLRILPLPFKKAFDNVKIIKSMNRKLIDEHKSTRVPGQPRDFIDCYLDELDK 716832 716723 VKNCVST 716703 716703 FSEDQLIMNIMDMSFAGTDTTSNTLLAAFLYLMNHPDVQ () 716439 VKCQQEIDDVLEGKDQVTYEDRHNMPYTLAVIHEVQRVANIVPLSVLHCTTRDTELMGYSIPK 716251 () 716170 GTVIIPNLTVVLKEEGQWKFPHEFNPANFLNEQGQFEKPEAFIPFST 716030 713418 GPRVCLGEGLARMELFLIFVTLLRRF 713341 QFVWPEDAGKPDYTPVFGLTMTPKPYRMHIRRRNTVKQ CYP2X7-de9a Danio rerio (zebrafish) ctg22265.c CYP2X1 pseudogene? C-term 92% to 2X.b zfish41361-135c06.q1c zfish45283253h10.q1k zfish43795-291e06.p1c 720930 LGEGLARMELFLVFVTLLRRFQFVWLEDAGKPDYTPVFRHTMTPKPYRMHIRRR 720769 CYP2X7-de9b Danio rerio (zebrafish) ctg22265.d CYP2X1 pseudogene? C-term 87% to 2X.b 727710 GPRVCLGEGLARMELFLVFVTLLRRYKFVWPRDAGKPDYTPVFGITMTPKPYRMLIRRRDTVQ 727519 CYP2X8 Danio rerio (zebrafish) ctg21275 87% to 2X.a 1267572 MLGSSVLVLICILLVFLLIRIQRPKNFPPGPSPLPIFGNLLHFNLANPLKEFER 1267411 1267339 FAEKYGNIFSLYTGSRPAVFLNSFAVIKEALVTKAQDFSGRPQDFMISHLTECKGN 1267172 1263893 VVLADYGPLWKDHRRFALMTLRNFGLGKQSMEERILGEISHVVGYLDKNI 1263744 1263630 GKTVDPQVMFHNVASNVISLVLFGRRFDYNSETLQCYIQLITEISKILNGPWNM 1263469 1262072 IYDTLPFLRILPLPFKKGFDHVKVLKGMNLKLIDEHKSTRVPGKPRDFIDCYLDELDK 1261899 1261775 RKNEVSTFSEDQLLMYILDLYFAGTDTTSNTLLTAFLYLMNHPEVQ 1261638 1261335 VKCQQEIDDVLEGKDQVSYEDRDNMPYTLAVIHEVQRVANTVPLSVFHCTTRDTELMGYSIPK 1261147 1261074 GTLIIPNLTIVLKEEGQWKFPHEFNPANFLNEQGQFEKPEAFIPFST 1260934 1269368 GPRVCLGEGLARMELFLVMVTLLRRFQFVWPNDAGKPDYTP 1269244 VYGVTLTPQPYRMHIKRRETVRX 1269179 CYP2X9 Danio rerio (zebrafish) ctg9731 exons 1-4 67% to 2X6 first 39 aa = 2X6 100%, FRAMESHIFT IN EXON 3 66640 MLGSSLLVVICILLIFFLIRVKKPKNFPPGPPPVPIFGNMLQLNINNPLKDFER 66479 66305 LANRYGNIYSLYFGSKPWVVLNGFEALKEALVTKAVDFAGRPQDLMVNRVTKGGGE 66138 65961 VILSDYGPSWKE HRRFALMTLRNFGLGKQSMEERILGEVSHIIDKLEKR 65819 65727 GTAFDPQTMFHNAASNIICIVLFGSRYDYDDEFLKLFIHLYTENAKIANGPWAM 65566 ctg21275 exons 5-9 77% to 2X.b trace CF996180 joins these two contigs 1272272 IYDTFPMFRYLPLPFRKAFANASKARELSTQLVEEHKKTWVPGEPRDFIDCYLDELDK 1272099 1271302 RGNDGSSFSEAQLILYVLDLHFAGTDTTSNTLLTGFLYLMTHPEVQ 1271165 1269858 AKCQQEIDDVLEDKDQASYEDRHSMPYTQAVIHEVQRVANTVPLSVFHCTTKDTELMGYNIPK 1269670 1269601 GTFVIPNLGSALKEEGQWKFPHEFNPANFLNEQGEFEKPEAFVPFSA 1269461 1259306 GPRVCLGEGLACTELFLVFVTLLRRYKFVWPRDAGKPDYTPVFGITMTPKPYRMHIRWRNTVKQ 1259115 CYP2X10 Danio rerio (zebrafish) ctg24117.a 55% to 2X.b 57088 MLTALVLLCLGAFLLYLQLRIRRPKNFPPGPAPVPIFGNLLQLNRISPLKDFD 56930 (0) 56850 KFAQHYGSIYGIYIGSQPAVVLTGQKMIKEALITQAAEFSGRSNNMMFSHVTGGK 56686 (1) 56439 GVIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESA 56287 (1) 56129 GKSIDPQHLYHQAASNIIASIIFGSRFNYKDAYFQTLITSVEDLTKITIGPWAM 55968 (0) 55706 LYEIAPVLRIFPLPFQKAFQYFEQITKHVLKVVEEHKTSRVAGEPRDLIDCYLEEMEN 55533 (0) 55465 KSDHRTSFDESQMVTLLFDLFIAGTETTSNTLRTLTLYLMTYTHIQ 55328 (1) 54962 EQCQREIDEVLGARDHVTYEDRNAMHFVQAVIHEGQRVADIAPLSMFHSAKTDTQLRGYSIPK 54774 (0) 54580 GTIIIPYLSSSLREESQWKFPHEFNPQNFLNEKGEFVKNDAFMPFSA 54440 (1) 54228 GPRVCLGENLARMELFLILVTVLRRFRLVWPKDAGEPDFTYIYGGTQSVKPYRVIVEPRMHGEACKFVD* 54019 CYP2X10 Danio rerio (zebrafish) GenEMBL AY825256 Tseng, H.-P., Corely-Smith, C., Hu, C.-H., Wang-Buhler, J.-L., Hseu, T.-H., and Buhler, D.R. Submitted to nomenclature committee Oct. 14, 2004 Clone 898HuHP MLTALVLLCLGAFLLYLQLRIRRPKNFPPGPAPVPIFGNLLQLN RISPLKDFDKFAQHYGSIYGIYIGSQPAVVLTGQKMIKEALITQAAEFSGRSNNMMFS HVTGGKGVIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESAG KSIDPQHLYHQAASNIIASIIFGSRFNYKDAYFQTLITSVEDLTKITIGPWAMLYEIA PVLRIFPLPFQKAFQYFEQITKHVLKVVEEHKTSRVAGEPRDLIDCYLEEMENKSDHR TSFDESQMVTLLFDLFIAGTETTSNTLRTLTLYLMTYTHIQEQCQREIDEVLGARDHV TYEDRNAMHFVQAVIHEGQRVADIAPLSMFHSAKTDTQLRGYSIPKGTIIIPYLSSSL REESQWKFPHEFNPQNFLNEKGEFVKNDAFMPFSAGPRVCLGENLARMELFLILVTVL RRFRLVWPKDEGEPDFTYIYGGTQSVKPYRVIVEPRMHGEACKFVD CYP2X11 Danio rerio (zebrafish) ctg24117.b zfishI-a36g12.q1c EXONS 1-7 85% to CYP2 Length = 544 80259 MLTALVLLCLGAFLLYLQVRIRRPKDFPPGPAPVPFFGNLLQLNRINPIKDLDK 80420 80510 FAQHYGSIYGIYIGSKPAVVLTGQKMIKEALITQAAEFAGRPNHMMISHITRSKGS 80677 80848 VIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESA 80997 82938 GKSIDPQHLYHQAASNIIASVIFGSRFNYKDEYFQTLIQTMEKLTKIAIGTWAM 83099 83317 LYEIAPVLRIFPLPFWKAFHYFEKITRHSLKVVEEHKKSFVAGEPKDLIDCYLEEMKK 83490 83572 RADQRTTFDEAQMVTLLFDLYLAGTETTSNTLRTLTLF 83685 88976 EQCQREIDEVLGARDHVTYEDRNDMHFVQAVIHEGQRVADIVPLNVFHTARTDTQLRGYSIPK 89164 92540 GTIIIPYLSSSLREESQWKFPHEFNPQNFLNEKGEFVKNDAFMPFSA 92680 92791 GPRVCLGENLARMELFLILVTVLRKFRLVWPKDAGEPDFTYIYGGTQSLKPYPMIVKLR 92967 CYP2X11-de1 Danio rerio (zebrafish) ctg24117.c EXON 1 94557 MLTALVLLCLGAFLLYLQLRIRRPRNFPPGPAPVPIFGNLLQLNHINPIKDLDK 94718 CYP2X12 Danio rerio (zebrafish) GenEMBL AY825257 EST partial seq CN509498.1 Tseng, H.-P., Corely-Smith, C., Hu, C.-H., Wang-Buhler, J.-L., Hseu, T.-H., and Buhler, D.R. Submitted to nomenclature committee Oct. 14, 2004 Clone s898HuHP full length seq. 91% to 2X10 MLTALVLLCLGAFLLYLQLRIRRPRNFPPGPAPVPIFGNLLQLN HINPIKDLDKFAQHYGSIYGIYIGSKPAVVLTGQKMIKEALITQGAEFAGRSNKMMVS HVTRSKGVIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESAG KPIDPQHLYHQAASNIIASIIFRSRFDYQDEYFQTLITTMEKLTKIAIGPWAMLYEIA PVLRIFPLPFHKAFQYFEQITNHVLKVVEEHKTSRVAGEPRDLIDCYLEEMNRRSDKH TTFDESQMVTLLFDLFIAGTETTSNTLRTLTLYLMTYTHIQEQCQREIDEVLGARDHV TYEDRNAMHFVQAVIHEGQRVADIVPLSMFHTARTDTQLRGYSIPKGTIIIPYLSSSL REEGQWKFPHEFNPQNFLNEKGEFVKNDAFMPFSAGPRVCLGENLARMELFLILVTVL RKFRLVWPKDAEEPDFTYIYGGTQSLKPYPMIVKLRTPGETHEYAK CYP2X13 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr XIX (-) strand 19940206-19948532 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 72% to Fugu 2X2 MFASIILLLICIVFIVIQLKSRRPKNFPPGPPVWPILGNILDLSLENPLKDFERLRKTYGNVYSLFLGPKPVVVI NEMKTIKEALVTKGVDFAGRPQDLLINDSSERELVMTDYGSSWKEQRRFALMNLRNFGMGKDSMEERIHGEIQYT VDTLEKSIGKSFSPQNMFHNAASNIICQVLFGKRFEYEDETIKTVVQCFTENAKIANGPWAMIYDSFPLIRSLPL PFRRAFKNVETCRKIAKSLMNEHKQTRVPGEPRDFVDCYLDRLDK (0) PGDRSSFSEAQLTMYILDLHFAGTDTTSNTLLTGFLYLMNYPHVQ (1) EPVFKYGNMIFKYFFI ERCQQEIDMVLEGKDQASSEDRNNMPYVQ (0) AVIHEFQRVANTVPLSIFHSTTKDTELNGYSIPKGTLIIPNLT SVLNEEGQWKFPNEFNPENFLNDQGEFVKPEAFMPFSAGPRMCLGEGLARMELFLFTVTLLRKFKFIWPEDAGEP DFTPVYGVTLTPKPYRMKVQLRVSQKIPH* CYP2X14 Oryzias latipes (medaka) chr6 21423000:21438000 Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 67% to Fugu 2X2 MFVSLILLWLCICILFLQLKPRRPKNFPPGPPVLPMLGNLLHLSLDNPLKDFDRLRNSYGNVYSLFLGPKPAVII NGFKAMKEAMVIKATDFAGRPQDLFVNDVSKRKGVILADYGESWRDHRRFALMTLRNFGLGKKSMEERISEEIQH TIKTLENNIGKLFSPQIMFHNAASNIICQVLFGKRFEYDDEIIKTIVQCFTRNSKIANGPWAMIYDSIPLIRKLP LPFREAFKNAEICVDVGTHLVNEHKETRIPGKPRDFVDCYLDEMEKVRGDDSSFSEDQLIIYALDLHFAGTDTTS NTLLTGFFYLINYPHIQDKCQQEIDRVLEEKQQVTFEDRHNMPYMQAVIHEVQRIANTVPLSVFHSTTKETELMG YTIPKGTMIIQNMGSVLREDGQWKFPHDFNPENFLNEKGEFVKPEAFMPFSAGPRMCLGEGLARMELFIIMVTLL RKFKFTWPEDAGEPDFTPVYGVTLTPKPYFMKVQLRSKP* CYP2X fragment a Fugu rubripes (pufferfish) No accession number Scaffold_9193 Length = 9721 51% to scaf 4007 possible exon 1 of 2X3 or 2X4 LGL47087.y1 Length = 725 2 family N-term exon 1 333 MLVSLALLLAAAFGLWVFFQIQRPKNFPPGPPPIPLFGNLLEIQLDNPIADLER 172 (0) CYP2X fragment b Fugu rubripes (pufferfish) No accession number possible exon 2 of 2X3 or 2X4 LED83776.x1 75% to scaf 4007 exon 2 not in new version of fugu databases LAKRYGNVYGLFLGSRPAVVINGVSAL 2Y Subfamily CYP2Y1 Fugu rubripes (pufferfish) No accession number Scaffold_39a from an early version of the genome 12087 MDLTVMLLTATLLLVVLWILNAHTRKHTRLPPGPRGIPVLGNLLQLDKKAPFKSLLK 11917 (0) 11768 LSENYGPVLTVALGPQRTVVLVGYEAVKDALVDHADDFTGRGPVPFLMKVTRGY 11607 (1) 11166 GLAISNGERWRQLRRFTLTTLRDFGMGRKGMEEWIQEESKHLVTRIKSTE 11011 (1) 10937 GAPFDPTFFLSCTVSNVICCLVFGQ 10869 frameshift 10867 GFSYDDEHFLSLLHIISETIQFGSSASGL 10781 (0) 10700 MYNLFPRLMEWLPGRHREMFGKIEKVRAFTMEKIEEHQDTLDPSSPRDYIDCFLMRLQQ 10524 (0) 10452 EKPQPNTEFNYDNLVSTVLNLYLAGTETTSSTIRYALNVLIRHPKIQ 10312 (1) 10187 EKMQEDIDSVIGQGRCPYVEDRKSLPFTDAVLHEIQRYLDMIPFSIPHYALQDISFRGYTIPK 9999 (0) 9924 DTLIIPLLHSVLKDDKMWETPGSFN 9850 frameshift 9850 PLQHFVDGNGSFKKNPAFLPFSAG 9779 (1) 9687 GKRACVGESVARHGDIPSLSSHLVQHFTLSX 9595 frameshift 9593 PGGPDSVDLTPEYSSFANVPRKYKIIATPRCNKRLCIVI* 9471 GenEMBL CAAB01000830 WGS section of Genbank 25-JUL-2002 Note: the frameshift in exon 7 did not exist in the earlier version above This is probably a sequence error 19218 MDLTVMLLTATLLLVVLWILNAHTRKHTRLPPGPRGIPVLGNLLQLDKKAPFKSLLK (0) 19048 18899 LSENYGPVLTVALGPQRTVVLVGYEAVKDALVDHADDFTGRGPVPFLMKVTRGY (1) 18738 18297 GLAISNGERWRQLRRFTLTTLRDFGMGRKGMEEWIQEESKHLVTRIKSTE (1) 18148 18074 GAPFDPTFFLSCTVSNVICCLVFGQRFSYDDEHFLSLLHIISETIQFGSSASGL (0) 17913 17832 MYNLFPRLMEWLPGRHREMFGKIEKVRAFTMEKIEEHQDTLDPSSPRDYIDCFLMRLQQ (0) 17656 17585 EKPQPNTEFNYDNLVSTVLNLYLAGTETTSSTIRYALNVLIRHPKIQ (1) 17445 17323 EKMQEDIDSVIGQGRCPYVEDRKSLPFTDAVLHEIQRYLDMIPFSIPHYALQDISFRG 17147 17145 YTIPK (0) 17131 17056 DTLIIPLLHSVLKDDKMWETPGSFNPQHFLDGNGSFKKNPAFLPFSA (1) 16916 16823 GKRACVGESLARMEIFLFVVSLVQHFTLSCPGGPDSVDLTPEYSSFANVPRKYKIIATPRWQ* 16635 CYP2Y2 Fugu rubripes (pufferfish) No accession number Scaffold_39b from an early version of the genome 15595 MEFSVTLILAGLVLAFFWFILQKRKYNLPPGPTTLPLVGNLPQLDKKQPFKSFTE 15431 (0) 15356 LSKSYGPVMTLYLGWQRTVVLTGYEVVKEALVDQAEDFTGRGPLPFLLKATNGY 15195 (1) 15078 GLGISNGERWRQLRRFTLSTLRDFGMGRKGMEEWIQEESKHLTARIKTLK 14944 (1) 14815 VKPFDPTFLLGCTVSNVICCMVFGERFSYDDKQFLELLRVIAEVLRFNSSFLGQ 14654 (0) 14549 MYNVFPWILEHLPGPQHTMFSHVNFLREFIKKKIQEHKESLDPSSPRDYIDTFLIRMEQ 14373 (0) 14282 EKNLPNTEFHYENLVSTVLNLFLAGTETTSSTLRYALGVLIKHPNVQ 14142 (1) 14046 EEMQREIDNVVRQDQCPKMEDRKSLPFTDAVIHEVQRFLDIVPFGLPHYALKDITFRGYSIPK 13858 (0) 13775 GTVIIPLLHSVLKGDQWETPWAFNPKHFLDQNGSFKKNSAFLPFSA 13638 (1) 13390 GKRSCVGESLARMELFIFLVTLLKDFTFSCIEGPDSISLNPQYSGFANLPRNYEIVATPR* 13208 GenEMBL CAAB01000830 WGS section of Genbank 25-JUL-2002 22434 MEFSVTLILAGLVLAFFWFILQKRKYNLPPGPTTLPLVGNLPQLDKKQPFKSFTE (0) 22270 22195 LSKSYGPVMTLYLGWQRTVVLTGYEVVKEALVDQAEDFTGRGPLPFLLKATNGY (1) 22034 21935 GLGISNGERWRQLRRFTLSTLRDFGMGRKGMEEWIQEESKHLTARIKTLK (1) 21786 21654 VKPFDPTFLLGCTVSNVICCMVFGERFSYDDKQFLELLRVIAEVLRFNSSFLGQ (0) 21493 21388 MYNVFPWILEHLPGPQHTMFSHVNFLREFIKKKIQEHKESLDPSSPRDYIDTFLIRMEQ (0) 21212 21121 EKNLPNTEFHYENLVSTVLNLFLAGTETTSSTLRYALGVLIKHPNVQ (1) 20981 20885 EEMQREIDNVVRQDQCPKMEDRKSLPFTDAVIHEVQRFLDIVPFGLPHYALKDITFRGYSIPK (0) 20697 20614 GTVIIPLLHSVLKGDQWETPWAFNPKHFLDQNGSFKKNSAFLPFSA (1) 20477 20296 GKRSCVGESLARMELFIFLVTLLKDFTFSCIEGPDSISLNPQYSGFANLPRNYEIVATPR* 20047 CYP2Y3 Danio rerio (zebrafish) GenEMBL ESTs CK016257, CK869788, CK706387, CB891035 Zebrafish blast server May 04 sequence NA1608 62% to 2Y1 and 64% to 2Y2 45% to CYP2B6 45% to 2B3 30425 MELSSSLLLVLVLTVLMLIRWRRKENGLSLPPGPLALPLIGNLLTLDKSAPFKSFMK 30595 (0) 32151 WRKTYGSVMTVHLGPQRMVVLVGYETVKEALVDQAEDFAPRAPIAFMNRIVKGY (1?) GLAISNGERWRQLRRFTLTTLRDFGMGRKQMEQWIQEESRYLLKSFEETK 32519 (1?) 32651 SKPVDPTFFFSRTVSNVICSLVFGQRFDYEDKNFLQLLQIISKLLRFLSSPWGQ 32812 (0?) 33063 LYNIFPQVMERFSSRHHAILKDVENIRTFIRNKVKEHEQRLDFSDPSDFIDCFLIRLTQ 33239 (0?) 33356 EKDKRKLDTEFHKDNLMATVLNLFVAGTETTSTTLRYALMLLIKHPQIQ 33502 (1?) 34553 EQMQREIDRVIGQNRIPTMEDRKSLPFTDAVIHEVQRYMDIVPLSLPHYAMKDITFRGYKIPK 34741 (0) 34907 DTVIIPMLHSVLRDEGQWETPWTFNPEHFLDSNGNFKKNPAFMPFSA 35047 (1) 35424 GKRSCVGESLARMELFLFTVSLLQKFTFSSPNGPDGIDLSPELSSFANMPRFYELIASPR* 35606 CYP2Y4 Danio rerio (zebrafish) GenEMBL EST AL916779 Zebrafish blast server May 04 sequence NA1608 42397 MELSSSLLLVLVLTVLMLIRWRRKENGLSLPPGPLALPLIGNLLTVETSAPFKSFMK 42567 (0) missing exon 2 XXXXTNGERWRQTERFTLTTLRDFGMGRKRMEQWIQEESRYLLKSFEETK SKPVDPLFFMSRAVSNVICSLVFGQRFDYEDKNFLQLLQIISNLMRFASSPWGQ LYNIFPKVMEILPGRHHTMFGEIDDLKSSIMTII 44325 44326 KEHEENLDPSDPKDFIDCFLIRLNQ (0?) QEKHNPDT 44524 44525 EFHKENMFATSLNLFTAGTETTSTTLRYALMLLIKHPHIQ 44989 EQMQREIDCVIGQNRIPTMEDRKSLPFTDAVIHEVQRCLDIAPLNVPHYALKDITFRGYKIPK 45177 (0) DTVIIPMLHSVLRD 45348 45349 EGHWETPWTFNPEHFLDSNGNFKKNPAFMPFSA 45447 (1) 46751 GKRVCVGESLARMEIFLFIVSLLQKFSFSSPNGPDSIDPSPELSSFGNMPRLYELIASPR 46930 CYP2Y5 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr I (-) strand 16588689-16592714 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 68% to Fugu 2Y1, 70% to 2Y2 MDFSATVFLAGLILALLWLFGVKNRRKYLLPPGPFALPLIGNLPQLDKNAPFKSILKFSETHGPVMTVHLGWQRV VFLVGYDAVKEALVDQGDDFTGRGPLPFLMKVTKGYGLAISNGERWRQLRRFSLSTLRDFGMGRKGMEVWIQEES RHLRARMESFKASPFNPRFLLSRTVSNVICCLVFGERFGYEDKKFLHLLNTISEVLDFLNSPVGQLYNIFPWLMG HLPGSQHACFAKAEKLREFIETKIHQHKATLDPSSPRDFIDCFLIRINQEKDNPKTEFHYENLISTVLNLFLAGT ETTSSTIRFALSVLIKYPNIQEKMQTEIDGVIGQSCVPSMENRKSLPFTDAVIHEVQRFLDIVPFSIPHYALHDI SFRGYTIPKDTMIIPMLHSVLKEERNWATPQSFNPQHFLDQNDNFKKNPSFLPFSAGKRACVGESLARMELFIFL VSLLQNFTFSSTGGPDSINLIPEYSSFANLPRTYQIIATPR* CYP2Y6 Oryzias latipes (medaka) chr 13 2357422:2368485 Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 68% to Fugu 2Y1 MDLSTSLILVVLTTVLLWLLNRRNSRKQHLPPGPPALPLIGNLLQLDKKRPFRTIVELSKTHGPVMTIYMGWQRA VALVGYDAVKEALVDQADDFVGRAPLPFLYRATRGYGIGISNGERWRQLRRFALTTLRDFGMGRKGMEQWIQEES RHIRAKINTFKGKPFDPTFILSCTVSNVICCLVYGERFNYDDKQFLELLQIISEVPRFNSSPMGAMYNLFPWLME RLPGRQHTIFGYIEDIRKFAKNKIQEHKDKLDPSSPRDFIDCFLLRMDQEKDNPTSEFHYENLLAMVLNLFLAGT ETTSSTIRYALSVLIKHPKIQEKMQEEIDSVIGRERCPSMEERKSLPFTDAVIHEVQRFMDLTPFSLPHYSLKDI SFRGYTIPKDTMIFPMLHSVLREDKLWSSPWSFNPQNFLDQNGNFKKNPGFVPFSAGKRACVGESLARMELFLFI VSFLQDFTFSAPNGPDSINLVPEYSSLANLPRRYELIATPR 2Z Subfamily CYP2Z1 Fugu rubripes (pufferfish) No accession number Scaffold_2993a MGLIVSVFGSHADWSISTLLLFTAVFILMVNWIRNRRPPSFPPGPWTLPVVGNMHNLAHHRMHLNLME (0) 16293 LAETYGNVFSIQLGQEWMVVLNGPTILKEALVNQGDSVADRPNLQLIIDSCHGL (1) 16785 GLGFSSGHLWKQQRQFAISTLRYFGSGSKSLEPVVLEEFAHCAKQFSEFK 16937 (1) 17023 GKPFAPQLMFYNIVTNIICSLVFGHRFEYGDKNFEKLMNSFGRCLQIEASVCAQ 17184 (0) 17262 LYNSFPRLMGCLPGPHQTVKRIYQNIRDFIREEMKEHKKGLDPSTPRDYIDCYLNKIKK 17435 (0) XXXXXXXXXXXNL (fs) VIX (fs) VWNX (fs) FVPX (fs) TNT (fs) TTFTYRWLFLFMA (fs) KYPEMQ (1) 17899 EKVQAEIDEVIGQSRRATMDDCVNMPYTNAVIHESLRMGNVVPLSLLHATGRDIQLEGYTIPK 18087 (0) 18158 GTTVIANLTSALFDKNEWETPFAFNPGHFLDEEGRFRKRTAFLPFSA (1) 18388 GRRLCLGENLARMMLFLFFTSFMQDFTISFPAGVSPAMEYHHFGVTLAPHPFDICAVSR* 18567 CYP2Z2 Fugu rubripes (pufferfish) No accession number Scaffold_2993b MHWIFDLIGSFLAGDFKSLLFFLLIFILTADYLRNRRSGSFPPGPMAIPIIGNMLSLDRSRTHESLTQ (0) 21437 LAETYGNVYSLRTGQTWMVVVNSFKVVREALVTHGESVSDRPDLPLQDEIAHGK 21273 (1) 20946 GVISSNGHLWKQQRRFALSTLRLFGFGKKSLEPFITDEFTHCANIFRSYK 20815 (1) 20726 GKPLPPHLILNNVVSNIICSLVFGHRFEYGDKNFKNLIKLFDQSLQIEASVWAE 20565 (0) 20473 LYNSFPLLMKHVPGPHQTVKKIWNEVKDFVRNELKEHRKNWDPSDPRDYIDCYLREIQX 20300 (?) gap at boundary 19990 SGQSDSTFDEENLVICVMDLFVPGSETTSTTLRWAFLYMAKYPEIQ (1) 19748 EKVQAEIDRVVGQSRPLTMDDRVNLPYTDAVLHEIQRFGNIVPLSLPHVTNKAIQLEGYNIPK 19560 (0) 19470 GIMIIPNLTSALFDKNEWETPCTFNPGHFLDNEGKFRKRAAFIPFSA 19330 (1) 19220 GKRLCLGENLARMELFLFFTSFMQHFTFSMPAGVKPDMSFRFGVTLAPKPYEICAIPR* 19044 CYP2Z3 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr VIII (-) strand 15162832-15165857 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 51% to Fugu 2N9, 71% to CYP2Z2 probable ortholog of CYP2Z2 MDSIFSICGSYFTLDVKSFLLFAVVFLLSADYIKNRRPGSFPPGPPALPIVGHIFNLDYKRVHVSLTQ LAGRYGDVYSLRMGHRWMVVLNGITVLKEALVTQGDSLADRPDLPLQHDIAHGL GVIFSNGNTWKQQRRFALSALRHFGFGK KSLEPVILDEFTYCVKDFNSHKGKPFDPHLIVNNVVSNVICSLVFGHRFEYGDEKFLKLMKWFGDALELEASIWA QLYNSFPVLMRRLPGPHKDLQHIWNNVKDFIGVELKEHKQNWDPSDQRDYIDCYLNEIQTGQADNTFDEENLVLC VLDLFLAGSETTSTTLRWAFLYMVKYPEIQAKVQAEIDRVIGQSRLPSMEDRANMPYTDAVIHEVQRMANIVPLS LPHITSKDIQLGGYTIPKGVTIIPNLTSVLFDKNEWETPDTFNPGHFLNAEGKFVKSAAFIPFSAGKRLCLGENL AKMELFLFFTSFMQRFTFSMPPGVKPVMDFRFGITLAPFPYEVCVTSR* CYP2Z4 Gasterosteus aculeatus (three-spined stickleback) UCSC browser Chr VIII (-) strand 15162832-15165857 Joanna Wilson and students submitted to nomenclature committee Nov. 6, 2007 missing exon found in ESTs DN671369.1, DW642948.1 revised seq 59% to 2Z2 MDQLSGVSSTWLWLDGRSLLLFTLVVLVTAEYLRARRPSGFPPGPWPFPLVGNMFSLDPSNVHGDMTK (0) LAEKYGKVYSLKMGPLWSVVLNGLSAVQEGLAEGDYANGRPDFAIHSDVLPEL (1) GIVFSNGH SWKQQRRFALITLKYFGVGKKSLESSILEEFIHASKEIASHEGKPFKPNVLMRNAVSNIICALVFGHRFEYSNEK FQKMLTLLDNGTRIEASIWAQMYNAFPVLMRRLPGPHRTLQGIYGEILDLIKTEVDQHREDFNPSEPRDFIDCYL NEMEKVADAGFNEDNLLMCSFDLFGAGTETTSTTLLWAFLYMAKYPEIQAKVQAEVGRVIGPSRQPSMKDRANMP YTDAVIHEVQRIGNIVPLSLPHITSRDVQLGGYTIPKGVTIIPNLTSVLFDKNEWETPDTFNPGHFLNEEGKFVK PAAFIPFSAGRLCLGENLARMELFLFFSSFMQRFSWSMPAGVEPLLKPRFGITLSPEPYEICAISR* CYP2Z5 Oryzias latipes (medaka) chr4 31513782:31524077 Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 70% to Fugu 2Z2 MDLFSSTIGLMLEWDLKSLLLFLSVFIITADYIKNRRPLSFPPGPPGLPILGNIFTVDVGRPHESFSKLAAEYGD LYSLRFGQRWTVVLNGHKALKEALVTKGDSVVDRPHLPLQDEIAKGLGVIFSNGANWTEQRRFALSTLRYFGFGK KSLEPVILNEFAHCAEELKRFKGEPLDPHLIINNTVSNIICHLVFGHRFNYGDKKFKKLMLLFDRALQIEASIWA QLYNSFTLIMRCLPGPHKTLQHIWREVQDFIGEELKEHKKSWDPSDARDYIDFYLTEIQKTKGQEGSTFDEENLI MCVLDLFVAGSETTSTTLRWAFLYMAKYPEIQEKVQAEIHKVIGKSRPPCMEDRAELPYTDAVIHEVQRIGNIVP LSLPHATNKDVQLGGFTIPKGVLIIPNLTSVLFDEKEWETPHAFNPGHFLNKDGKFVKRGAFIPFSAGKRLCLGE NLARMELFLFFTSFMQHFSFSMPAGVEPVLDYRAGLTLAPKPYKICVQASSEK* 2AA Subfamily CYP2AA1 Danio rerio (zebrafish) GenEMBL AF497969 Afonso Bainy and John Stegeman 74% to 2AA2 submitted to nomenclature committee 4/5/02 MLVGLVKLDLASVGLTLFLGLIFLVLFEIFRINSYKFRFPPGPT PLPFVGNLPHFLKSPMEFIRSMPQYGEMTTIFFGRKPVIMLNTIQLAKEAYVQDAFSG RPAIPLFDWITNGLGIVMVTFNNSWRQQRRFALHTLRNFGLGKKTVEDRVLEESRYLI AEMLKEEGKSMNPQHALQNAISNIICSIVFGDRFEYDNKRFEYLLKTLNENIMLAGSA AGQIFNLVPFIKHFPGPHQKIKQNADELLGFIRDEAKEHKQTLDPDSPRDFIDAYLLE IEKQKSSKDSTFHEENLVVSASDLFLAGTDTTETTIRWGLINLIQNPDVQERCHEEIV RVLGYDRLPSMDDRDKLPYTLATVYEIQRCANIAPNVMHQTILPTRLHGYDIPQGTII LTNLAAIFSNKDHWKHPDAFNPENFLDENGHFSKPESFIPFSLGPRVCLGETLARTEL FLFITALLQRIRFSWPPDAKPIDMDGIMGLVRSPQTFNVVCHSRDNVK CYP2AA1 Danio rerio (zebrafish) Chr 23 2AA1 partial seq missing exons 7-9 (broken gen may indicate incorrect genome assembly here) 1 66 + Chr:23 38951974 38952171 - 345 2AA1 2 diffs 211044 MLVGLVKLDLASVGLTLFLGLIFLVLFEIFRIHSYKGRFPPGPTPLPFVGNLPHFLKSPMEFIRS 210850 62 117 + Chr:23 38949574 38949741 - 453 2AA1 1 diff 208602 MPQYGEMTTIFFGRKPVIMLNTIQLAKEAYVQDVFSGRPAIPLFDWITNGL 208450 116 214 + Chr:23 38945155 38945454 - 439 2AA1 1 diff 204324 GIVMVTFNNSWRQQRRFALHTLRNFGLGKKTVEDRVLEESRHLIAEMLKEE 204172 168 268 + Chr:23 38944809 38945129 - 471 2AA1 100% 204002 GKSMNPQHALQNAISNIICSIVFGDRFEYDNKRFEYLLKTLNENIMLAGSAAGQ 203841 222 395 + Chr:23 38944376 38944861 - 529 2AA1 100% 203734 IFNLVPFIKHFPGPHQKIKQNADELLGFIRDEAKEHKQTLDPDSPRDFIDAYLLEIEK 203561 Chr:23 38944462 38944602 - 2AA1 1 diff 203475 QKPSKDSTFHEENLVVSASDLFLAGTDTTETTIRWGLINLIQNPDVQ 203335 3 exon fragment exons 7,8,9 2AA1 like sequence. This gene is broken by an insertion of 2AA8 exons 1-8 I think the 2AA8 sequence needs to be moved to reunite 2AA1 fragments and make a whole 2AA1 and A whole 2AA8 318 388 + Chr:23 38934350 38934556 - 550 2AA1 2 diffs ERCHEEIVRVLGYDRLPSMDDRDKLPYTLATVYEIQRCANIAPNVMHQTILPTKLHGYNIPQ 193223 359 440 + Chr:23 38931225 38931455 - 446 2AA1 1 diff GTIILTNLAAIFSNKEHWKHPDAFNPENFLDENGHFSKPESFIPFSL 190116 436 498 + Chr:23 38927597 38927785 - 549 2AA1 100% 186658 GPRVCLGETLARTELFLFITALLQRIRFSWPPDAKPIDMDGIMGLVRSPQTFNVVCHSRDNVK 186470 8 aa diffs to original Stegeman sequence AF497969, 3kb upstream of 2AA10 211044 MLVGLVKLDLASVGLTLFLGLIFLVLFEIFRIHSYKGRFPPGPTPLPFVGNLPHFLKSPMEFIRS 210850 208602 MPQYGEMTTIFFGRKPVIMLNTIQLAKEAYVQDVFSGRPAIPLFDWITNGL 208450 204324 GIVMVTFNNSWRQQRRFALHTLRNFGLGKKTVEDRVLEESRHLIAEMLKEE 204172 204002 GKSMNPQHALQNAISNIICSIVFGDRFEYDNKRFEYLLKTLNENIMLAGSAAGQ 203841 203734 IFNLVPFIKHFPGPHQKIKQNADELLGFIRDEAKEHKQTLDPDSPRDFIDAYLLEIEK 203561 203475 QKPSKDSTFHEENLVVSASDLFLAGTDTTETTIRWGLINLIQNPDVQ 203335 ERCHEEIVRVLGYDRLPSMDDRDKLPYTLATVYEIQRCANIAPNVMHQTILPTKLHGYNIPQ 193223 GTIILTNLAAIFSNKEHWKHPDAFNPENFLDENGHFSKPESFIPFSL 190116 186658 GPRVCLGETLARTELFLFITALLQRIRFSWPPDAKPIDMDGIMGLVRSPQTFNVVCHSRDNVK 186470 CYP2AA2 Danio rerio (zebrafish) AI657973 fc19c11.y1, AI958603 fc94a10.y1, AI544967 fb69h12.y1 BI887677 AI444248 fb40e01.y1 zfishC-a1385b03.q1c zfishC-a2172h09.q1c zfishG-a67c10.q1c these last three are from the zebrafish blast server 48% to 2J1 74% to CYP2AA1 intron phases from closely related zebrafish genomic sequences MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGFNRL (0) exon 1 MAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPAIDWTSNGC (1) exon 2 GIIMATFNNSWKQQRRFALHTLRNFGLGKKSLESRVLEESQYLIAEMLKDE (1) exon 3 GRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSAAGQ (0) exon 4 IFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEK (0) exon 5 QKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQ (1) exon 6 ERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ (0) exon 7 GTIIMTNYTAIFSNKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGV (1) exon 8 GPRICLGDTLAKTALFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPETFSIICCSRDTKE* exon 9 CYP2AA3v1 Danio rerio (zebrafish) BC055136 ctg14330 Zv3 05/2004 zfishC-a1177h12.q1c Z35723-a631b05.p1c zfishI-a76h10.q1c 131529 MFTALLKLDLASVGLTLFLGLIILVLFEIFRIHSYKRRTPPGPTPLPFVGTIPHFLKNPLGFIR 131720 131812 MSQYGDMSTMYLGRKPAIFLNTIQLAKETLVQDTFSGKPYLPVIEWISKGFG 131967 132158 GIAMVTFNHSWRQQRRFALHTLKNFGLGKKSVEDRVLEESRYLIAEM LKDEGKPVDPHHPIQNAVSNIICSIVFGDRFEYNNKRFEYLLKMLNETIMLAGSASGR IFNLVPFIKHFPGPHQKIKKNTDDLLIFLGDEVEEHRKTLDPGSPRDFIDAYLLEIEK QKFNKDSTFHEGNLLASAGDLFMAGTDTTETTIRWGLLLLIQNPDVQERCHEEIVRVL GYDRLPSMNDRDRLPYTLATVHEIQRYGNIIPILIHETILPTKLQGYSIPQGTTIVTN IQAIFSSKDHWKHPDTFNPENFLEDGHFIKPESFIMFSLGPRSCLGEMLARTELFLFI TSLLQRIHFSWPPDAKPIDMDGIFGLVHSPQTFNVICRSRDTK CYP2AA3v1 Danio rerio (zebrafish) GenEMBL AL923007 Tseng, H.-P., Wang-Buhler, J.-L., Hu, C.-H., Hseu, T.-H., Peng, J.R. and Buhler, D.R. Submitted to nomenclature committee Oct. 14, 2004 JR12 Note: multiple ESTs and mRNAs support both 2AA3v1 and 2AA3v2 Even though they only have 6-8 aa differences CYP2AA3v2 Danio rerio (zebrafish) GenEMBl CK698285.1 EST and UCSC genomic seq. CYP2AA3v2 Danio rerio (zebrafish) ctg14330 (7 aa diffs in the last four exons to 2AA3v1) 131529 MFTALLKLDLASVGLTLFLGLIILVLFEIFRIHSYKRRTPPGPTPLPFVGTIPHFLKNPLGFIRS 131723(0) 131812 MSQYGDMSTMYLGRKPAIFLNTIQLAKETLVQDTFSGKPYLPVIEWISKGF 131964 (1) 132158 GIAMVTFNHSWRQQRRFALHTLKNFGLGKKSVEDRVLEESRYLIAEMLKDE 133974 GKPVDPHHPIQNAVSNIICSIVFGDRFEYNNKRFEYLLKMLNETIMLAGSASGR 134135 (0) 136328 IFNLVPFIKHFPGPHQKIKKNTDDLLIFLGDEVEEHRKTLDPGSPRDFIDAYLLEIEK 136501 (0) 136597 QKFNKDSTFHEGNLLASAGDLFMAGTDTTETTIRWGLLFLIQNPDVQ 136739 (1) ERCHEEIVQVLGYDRLPSMDDRDRLPYTLATVHEIQRYGNIIPILIHETILPTKLQGYSIPQ 140163 (0) 143088 GTTIVTNIQAIFSSKDHWKHPDSFNPENFLEDRHFIKPESFIMFSL 143225 (1) 143308 GPRSCLGEILARTELFLFITSLLQRIHFSWPPDAKPIDMDGIFGLVHSPQAFNVICRSRDTK 143493 CYP2AA4 Danio rerio (zebrafish) ctg14330 77% to 2AA1 missing exons 1,2 dup exons 3,8 zfishB-a33e04.q1c zfishB-a46b05.q1c zfishC-a2901c10.p1c zfishK-a149h03.q1c AI266900 (exons 1,2,3) This is an older version of the sequence, use the newer version below MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGFNRL MAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPVIDWTSNGC 716 GIIMATFNNSWKQQRRFALHTLRNFGLGKKSIESRVLEESQYLFAELLKDE 868 1337 GKSMNPQHALQNAVSNIICSIVFGERFDYDNKRLGYLLKILNENMMLTGSAIGQ 1498 4286 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLEIEK 4459 QKSNKDSTFQEENLIGSAIDLFFAGTDSTATSIRWGLLFLIQNPDVQ 4684 5758 ERCHEEIVQVLGYDRLPCMDDCDRLPYTHATVHEIQRFAKTVPFGVFHETIWPTKLHGFDIPQ 5946 6575 GTMIMTNLAAIFSSKEHWKHPDTFNPENFLDENGHFSKPESYIPFSL 6715 8861 GLRACIGESLVRTELFLFATVLLQRIHFSWPPNAKPIDMDGIMGLVHSPQTFNVICRSRDTK 9046 CYP2AA4-ie3 Danio rerio (zebrafish) ctg14330 dup exon 3 1089 QRRFALHTLRNFGLGKKTVEDRVLEESQYLIAEMLKDEG CYP2AA4-ie8 Danio rerio (zebrafish) ctg14330 dup exon 8 6375 GTMIMTNLAAIFSSKEHWKHPDTFNPENFLDENGHFSKPESY 6500 CYP2AA4 Danio rerio (zebrafish) Chr 23 96% exons 4,9 do not match older version of 2AA4 above CK697338.1 only has three diffs in exon 4 to this seq EB851360.1 matches exon 3 and exon 4 to YDNK 100% EB982730.1 matches exon 4 and part of exon 5 with 1 diff near the end There is EST support for this exon 4 in context. No ESTs match the old exons 4 or 9 No exact match for the old exons 4 or 9 is found in the new assembly 275732 GEPVNPHHALQNAVSNIFCSIMFGERFDYDNKRLGYLLKILNENMMLTGSAIGQ 275571 278301 MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGFNRL 278107 278025 MAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPVIDWTSNGC 277873 276540 GIIMATFNNSWKQQRRFALHTLRNFGLGKKSIESRVLEESQYLFAELLKDE 276388 GEPVNPHHALQNAVSNIFCSIMFGERFDYDNKRLGYLLKILNENMMLTGSAIGQ 275571 272893 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLEIEK 272720 272635 QKSNKDSTFQEENLIGSAIDLFFAGTDSTATSIRWGLLFLIQNPDVQ 272495 271423 ERCHEEIVQVLGYDRLPCMDDCDRLPYTHATVHEIQRFAKTVPFGVFHETIWPTKLHGFDIPQ 271235 270806 GTMIMTNLAAIFSSKEHWKHPDTFNPENFLDENGHFSKPESYIPFSL 270666 268511 GLRACIGESLVRTELFLFATVLLQRIHFSWPPDAKPLDMDGIVGIVRYPQTFSIICCSRDSKK 268323 Chr:23 39019324 39019347 - 2AA4 100% 278301 MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGF 278116 62 117 + Chr:23 39018997 39019164 - 315 2AA4 100% 278025 MAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPVIDWTSNGC 277873 116 168 + Chr:23 39017512 39017670 - 375 2AA4 100% 276540 GIIMATFNNSWKQQRRFALHTLRNFGLGKKSIESRVLEESQYLFAELLKDE 276388 55 222 + Chr:23 39016695 39017207 - 347 zfishB-a496h01.q1c 100% GEPVNPHHALQNAVSNIFCSIMFGERFDYDNKRLGYLLKILNENMMLTGSAIGQ 275571 222 292 + Chr:23 39013805 39014020 - 422 2AA4 100% 272893 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLEIEK 272720 273 326 + Chr:23 39013622 39013774 - 305 2AA4 100% 272635 QKSNKDSTFQEENLIGSAIDLFFAGTDSTATSIRWGLLFLIQNPDVQ 272495 327 388 + Chr:23 39012362 39012550 - 394 2AA4 100% 271423 ERCHEEIVQVLGYDRLPCMDDCDRLPYTHATVHEIQRFAKTVPFGVFHETIWPTKLHGFDIPQ 271235 388 440 + Chr:23 39011775 39011936 - 406 2AA.e 100% 270806 GTMIMTNLAAIFSSKEHWKHPDTFNPENFLDENGHFSKPESYIPFSL 270666 433 498 + Chr:23 39009450 39009644 - 408 new 84% to 2AA4 268511 GLRACIGESLVRTELFLFATVLLQRIHFSWPPDAKPLDMDGIVGIVRYPQTFSIICCSRDSKK 268323 Chr:23 39003564 39003746 - 2AA5 exon 9 2 aa diffs 5.7kb downstream 262622 GPRSCLGETLAKTELFLFITSLLQRIRFSWLPDAKPLDMDGIMGIVRYPQPFSIICCSRDTK 262437 CYP2AA5X Danio rerio (zebrafish) ctg14330 90% to 2AA2 This sequence discontinued since it is probably an incorrect assembly of 2AA9 77910 MFTALLKLDLAFVGLTLFLGLIFLVVFEISRIYSYKCRFPPGPTPLPFVGNLPHLLKKPMEFIRX 78101 78196 LSQYGEMTTMYLGRKPAIVLNTYQVAKEALVQEAFAGRPHLPIIEWITKGL 78348 78622 GIVMVTFNHSWKQQRRFAQHTLRNFGLGKKSLESRVLEESQYLIAEMLKDE 78774 78980 GRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSAAGQ 79141 IFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEK 81107 QKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQ 81249 85877 ERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ 86065 86197 GTIIMTNLAAILSDKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGVG 86340 95083 PRSCLGETLAKTELFLFITSLLQRIRFSWLPDAKPLYMDGIMGIVRYPQPFIIICCSRDTK 95265 CYP2AA6 Danio rerio (zebrafish) NA16005 Exons 4-7, 9 fd54c03.y1 AW019538 = fd54c03.x1 AI658337 fc21h01.y1 fc21h01.x1 CA473712 73% to 2AA1 MLAALLKLDLSSVGLSLFLGLIFLVLFEIFRIHSCKGRFPPGPTPLPFVGSIPHFLNNPMGFIKS LSQYGEMTTVYPGRKPAIILNTLQLMKEALVQNGSSFSGRPPVPVFNWVTDGY GIVMATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISEFLKAE 4444 GKPFNPQHAIHNAAANIICSIVFGDRFDYDNKSFTYLLEIIKENIIQAGSLVGQ 4605 (0) 7506 VFNLVPIIKHVPGPHQKIYQNGQAFKSFIRESVKEHRQTLDPDSPRDFIDAYLLEMEK 7679 7891 QKSTQDSTFHEDNMVMAVGDLFLAGSDTTATTIRWGLIYLTQNPDVQ 8031 10145 ERCHEEIVRVLGFDRLPSMDDRDRLPYTLATVHEFQRCANLVPTGVPHETTQATKLRGYDIPQ 10334 10407 GTQILINLTEILTNKEHWKHPDTFNPENFLDDKGHFFKPEAFLPFSL 10547 GPRACLGETLAKAELFLFVTSLLQRIRFSWPTGEKLPDMNGIFGIVRSPKPFNIICHSRGSKH* 10813 CYP2AA6-ie6 Danio rerio (zebrafish) NA16005 Duplicate exon 6 (3 aa diffs) 9660 QKSTQDSTFHEDNMVMSVGDLFFAGTDTTATTIRWGLLYLTQNPDVQ 9800 CYP2AA6 Chr:23 39065253 39065285 - 324158 MLAALLKLDLSSVGLSLFLGLIFLVLFEIFRIHSCKGRFPPGPTPLPFVGSIPHFLNNPMGFIKS 324045 62 148 + Chr:23 39064791 39065066 - 308 2AA6 100% 323927 LSQYGEMTTVYPGRKPAIILNTLQLMKEALVQNGSSFSGRPPVPVFNWVTDGY 323769 52 168 + Chr:23 39062897 39063256 - 349 2AA6 100% GIVMATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISEFLKAE 321773 168 222 + Chr:23 39056644 39056808 - 320 2AA6 100% 315681 GKPFNPQHAIHNAAANIICSIVFGDRFDYDNKSFTYLLEIIKENIIQAGSLVGQ 315520 221 279 + Chr:23 39053573 39053749 - 389 2AA6 100% 312619 VFNLVPIIKHVPGPHQKIYQNGQAFKSFIRESVKEHRQTLDPDSPRDFIDAYLLEMEK 312446 266 326 + Chr:23 39053221 39053439 - 322 2AA6 100% QKSTQDSTFHEDNMVMAVGDLFLAGSDTTATTIRWGLIYLTQNPDVQ 312094 325 431 + Chr:23 39051558 39051914 - 321 2AA6 1 diff 310781 ERCHEEIVRVLGFDRLPSMDDRDRLPYTHATVHEFQRCANL 389 459 + Chr:23 39051431 39051646 - 342 2AA6 100% 310519 GTQILINLTEILTNKEHWKHPDTFNPENFLDDKGHFFKPEAFLPFSL 310379 433 494 + Chr:23 39051255 39051440 - 390 2AA6 100% 310304 GPRACLGETLAKAELFLFVTSLLQRIRFSWPTGEKLPDMNGIFGIVRSPKPFNIICHSR 310128 CYP2AA7 Danio rerio (zebrafish) NA16005 Exons 1-7 83% to 2AA1 96% (6 diffs) to AI964243 EST269357 zfishG-a606c02.p1c AI964243 probably = AI964242 and BQ605503 17072 MFAALLKLDLASVGLTLFLGLIFLVLFEIFRINSYKGRFPPGPTPLPFVGNLPHSLKNPMEFIRS 17266 17365 MPQYGEMTTMYLGRRPAIVLNTIQLAKEAFVQDAFSGRPFLPVMDWVANGL 17517 17817 GIIMVTFNHSWRQQRRFALHTLRNFGLGKKSVEDRVLEESRYLIAEILKGE 17969 19246 GKPMSPHHPIQNAVSNIICSIVFGDRFDYDNKRFEYLLELLNENFVLTGSAAGQ 19407 19622 IFNLAPFIKHFPGPHQKIKQNANELLAFIQGEVKEHKKTLDPDSPRDFIDAYLLEIEK 19795 QKSNKDSTFHEGNLAISTADLFLAGTDTTSTTIRWGLLFLTQNPDVQ 20038 20933 ERCHEEIVQVLGYDRLPSMDDRDKLPYTLATVHEIQRCANLVPFGVIHETIQPTKLRGYDIPQ 21121 GTVVMTNLAAILSDKEHWKHPDTFNPENFLDENGHFSKPESFIPFSL GPRFCLGETLAKMELFLFITSLLQRIRFSSPPDAKPIDMDGIMGIVRYPQPFSIICCSRDTKE* 1 66 + Chr:23 39045108 39045305 - 304 2AA7 100% 304178 MFAALLKLDLASVGLTLFLGLIFLVLFEIFRINSYKGRFPPGPTPLPFVGNLPHSLKNPMEFIRS 303984 62 117 + Chr:23 39044857 39045024 - 373 2AA7 100% 303885 MPQYGEMTTMYLGRRPAIVLNTIQLAKEAFVQDAFSGRPFLPVMDWVANGL 303733 117 168 + Chr:23 39044405 39044560 - 398 2AA7 100% 303433 GIIMVTFNHSWRQQRRFALHTLRNFGLGKKSVEDRVLEESRYLIAEILKGE 303281 168 232 + Chr:23 39042937 39043131 - 374 2AA7 100% 302004 GKPMSPHHPIQNAVSNIICSIVFGDRFDYDNKRFEYLLELLNENFVLTGSAAGQ 301843 197 289 + Chr:23 39042555 39042800 - 468 2AA7 100% IFNLAPFIKHFPGPHQKIKQNANELLAFIQGEVKEHKKTLDPDSPRDFIDAYLLEIEK 301455 277 326 + Chr:23 39042339 39042488 - 316 2AA7 100% 301352 QKSNKDSTFHEGNLAISTADLFLAGTDTTSTTIRWGLLFLTQNPDVQ 301212 327 388 + Chr:23 39041256 39041444 - 461 2AA7 100% 300317 ERCHEEIVQVLGYDRLPSMDDRDKLPYTLATVHEIQRCANLVPFGVIHETIQPTKLRGYDIPQ 300129 388 440 + Chr:23 39039665 39039826 - 384 2AA6 like 2 diffs 298696 GTVVMTNLAAILSDKEHWKHPDTFNPENFLDKNGQFSKPESFIPFSL 298556 last exon is missing in genome assembly, use EST seq CYP2AA8 Danio rerio (zebrafish) NA3313 78% to 2AA7 zfishC-a402h10.p1c zfishC-a440h04.p1c Chr 23 (probably an assembly error, since this gene breaks 2AA1 in half) 540 MFSALLKLDLAFAGMTLILSLIFMFLLEIFRIHSFKSRFPPGPSPLPFVGNLPVFLKNPMEFIRS 734 811 LSQYGEMTTIYLGRKPTIMLNTVQLAKEVLIQDAFAGKPSLPVLDWVSNGL 963 1198 GIVMVTFNHSWRQQRRFALHTLRNFGLGRKSVESRVLEESQYLIAELLKKK 1350 1544 GKSVNPHHALQNAFSNVICSIVFGDRFDYDDKRFEHFLEILGKSMILTGSTAGQ 1705 3903 IFNFAPIIKHFPGPHQKIKKNADELSGFFQHEVKEHKKTLDPGSPRDYIDAYLLEMEK 4076 QKSNKDSTFHDENLIGSTTDLFVAGSDSTATTFRWGLLFLIQNPDVQ 4304 4703 ERCHKEIVQVLGYDRLPSMEDRDRLPYTLATVHEIQRCANLAPFGLIHETIQPTKLQGYDLPR 4891 GTTIIVNLTAIFSNKENWKHPDTFNPENFLDESGQFSKHESFIPFSL 5173 8867 GVRVCLGETLARTELFLFITALLQRIRFSLPPDAKPMDMDGILSVLRYPQNFSFICCSRDTKE 9055 CYP2AA9v1 Danio rerio (zebrafish) GenEMBL AY825258, AL922288 ESTs AI544967.1, CK708594.1 EST BI887677 matches 2AA2 with 1 diff and 2AA9 with 2 diffs Tseng, H.-P., Wang-Buhler, J.-L., Hu, C.-H., Hseu, T.-H., Peng, J.R. and Buhler, D.R. Submitted to nomenclature committee Oct. 14, 2004 JR11 94% to 2AA5 MFTALLKVDLASVGLTLFLGLIFLVVFEIFRIYSYKCRFPPGPT PLPFVGNLPHLLKKPMEFIRSLSQYGEMTTMYLGRKPAIVLNTYQVAKEALVQEAFAG RPHLPIIEWITKGLGIVMVTFNNSWKQQRRFAQHTLRNFGLGKKSLESRVLEESQYLI AEMLKDEGRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSA AGQIFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLE IEKQKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQERCHEEIV QVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETAQPTKLRGYNIPQGTI IMTNYTAIFSNKEHWKHPDTFNPENFLDENGHFSKPKCFIAFGVGPRICLGDTLAKTA LFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPETFCIICCSRDTKE CYP2AA9v2 Danio rerio (zebrafish) Chr 23 98% (7 diffs) to 2AA9v1 possible haplotype seq Note 2AA2 has only 3 aa diffs with 2AA9v2 from aa 122 to aa 499. Only 1 diff in exons 4-9. There are 4 aa diffs to 2AA9v1 in same region However, ESTs EB965911.1 and CF416995.1 match 2AA2 seq over the first 200 aa EB965911.1 100% and CF416995.1 3 aa diffs so 2AA1 is supported As distinct from 2AA9 2AA9v2 is 100% to 2AA5 in exons 1-7 but differs in exons 8,9 no ESTs match CYP2AA5 exons 8,9. Genomic seq for 2AA5 exon 9 is found with 2 aa diffs at Chr:23 39003564-39003746 55kb away. This was probably an error in an earlier assembly of contig ctg14330. in this contig exon 8 has 4 aa diffs from 2AA9 exon 8 in a close region, possibly seq errors. I think 2AA5 may not exist but 2AA9 is the correct version of this gene. 231378 MFTALLKLDLAFVGLTLFLGLIFLVVFEISRIYSYKCRFPPGPTPLPFVGNLPHLLKKPMEFIRS 231184 231092 LSQYGEMTTMYLGRKPAIVLNTYQVAKEALVQEAFAGRPHLPIIEWITKGL 230940 230666 GIVMVTFNHSWKQQRRFAQHTLRNFGLGKKSLESRVLEESQYLIAEMLKDE 230514 230308 GRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSAAGQ 230147 228436 IFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEK 228263 228179 QKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQ 228039 224651 ERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ 224463 224331 GTIIMTNYTAIFSNKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGV 224191 GPRICLGDTLAKTALFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPETFCIICCSRDxx 217157 1 66 + Chr:23 38972308 38972505 - 287 2AA5 100% 231378 MFTALLKLDLAFVGLTLFLGLIFLVVFEISRIYSYKCRFPPGPTPLPFVGNLPHLLKKPMEFIRS 231184 62 117 + Chr:23 38972064 38972231 - 312 2AA5 100% 231092 LSQYGEMTTMYLGRKPAIVLNTYQVAKEALVQEAFAGRPHLPIIEWITKGL 230940 116 168 + Chr:23 38971638 38971796 - 379 2AA5 100% 230666 GIVMVTFNHSWKQQRRFAQHTLRNFGLGKKSLESRVLEESQYLIAEMLKDE 230514 168 222 + Chr:23 38971271 38971435 - 387 2AA5 100% 2AA2 100% 230308 GRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSAAGQ 230147 222 295 + Chr:23 38969345 38969563 - 386 2AA5 100% 2AA2 100% 228436 IFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEK 228263 274 326 + Chr:23 38969166 38969324 - 304 2AA5 100% 2AA2 100% 228179 QKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQ 228039 327 388 + Chr:23 38965590 38965778 - 434 2AA5 100% 2AA2 100% 224651 ERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ 224463 388 440 + Chr:23 38965300 38965461 - 362 2AA2 100% 224331 GTIIMTNYTAIFSNKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGV 224191 412 495 + Chr:23 38958284 38958565 - 404 NA54442 100%, 1 AA DIFF WITH 2AA2 217336 GPRICLGDTLAKTALFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPETFCIICCSRD 217157 Chr:23 39003564 39003746 - 2AA5 exon 9 2 aa diffs 262619 PRSCLGETLAKTELFLFITSLLQRIRFSWLPDAKPLDMDGIMGIVRYPQPFSIICCSRDTK 262437 CYP2AA10 Danio rerio (zebrafish) Chr 23 (see below) 85% to CYP2AA1 183623 MLAALLKLDLASVGLTLFLGLIFLVLFEIFRIYSYKGRFPPGPTPLPFVGNLPHLLKNPMGFKRS 179490 LSEYGGLATVFIGRKPAISINTIQLAKEALVQDVFSGRPALPIFDWISHGL 179338 177849 GIIMVTFNHSWRQQRRFALHTLRNFGLGKKTVEDRVLEESQYLIAEMLKDE 177697 177562 GKSMNPQHALQNAVSNIICSIVFGDRFEYDNKRFEYLLKILNENIMLTGSAAGQ 177401 176529 IYNLVPFIKHFPGPHQKIKQNADDLFNFIRDEAKEHKQTLDPDSPRDFIDAYLLEIEK 176353 176306 QKFNKDSTFHEEHLVVSTSDLFLAGTDTTETTIRWGLIYLIQNPDVQ 176130 174181 ERCHEEIVQVLGYDRLPSMDDRDKLPYTLATVHEIQRYGNIAPKLLHETIRRTKLHGYDIPQ 173996 GTTIIANFTAMFSDKELWKHPDAFNPENFLDENGQFSKPEYFFPFSL GPRACLGETLARTELFLFITSLLQRIRFSWPPNAKPIDMDGIVGIVRSPEPFNIICHSRDxx* 170736 1 110 + Chr:23 38924403 38924750 - 318 new 5 diffs to 2AA7 183623 MLAALLKLDLASVGLTLFLGLIFLVLFEIFRIYSYKGRFPPGPTPLPFVGNLPHLLKNPMGFKRS 62 117 + Chr:23 38920462 38920629 - 327 2AA.g 100% 179490 LSEYGGLATVFIGRKPAISINTIQLAKEALVQDVFSGRPALPIFDWISHGL 179338 116 206 + Chr:23 38918752 38918979 - 417 2AA.g 100% 177849 GIIMVTFNHSWRQQRRFALHTLRNFGLGKKTVEDRVLEESQYLIAEMLKDE 177697 168 229 + Chr:23 38918486 38918689 - 438 2AA1 like 2 diffs 177562 GKSMNPQHALQNAVSNIICSIVFGDRFEYDNKRFEYLLKILNENIMLTGSAAGQ 177401 178 302 + Chr:23 38917411 38917794 - 507 2AA.e 100% 176529 IYNLVPFIKHFPGPHQKIKQNADDLFNFIRDEAKEHKQTLDPDSPRDFIDAYLLEIEK 176353 269 326 + Chr:23 38917257 38917439 - 363 2AA.e 100% 176306 QKFNKDSTFHEEHLVVSTSDLFLAGTDTTETTIRWGLIYLIQNPDVQ 176130 327 388 + Chr:23 38915123 38915308 - 451 2AA.e 4 diffs 174181 ERCHEEIVQVLGYDRLPSMDDRDKLPYTLATVHEIQRYGNIAPKLLHETIRRTKLHGYDIPQ 173996 367 445 + Chr:23 38913823 38914083 - 351 2AA.f 100% GTTIIANFTAMFSDKELWKHPDAFNPENFLDENGQFSKPEYFFPFSL 430 495 + Chr:23 38911863 38912060 - 475 new 85% to 2AA1 GPRACLGETLARTELFLFITSLLQRIRFSWPPNAKPIDMDGIVGIVRSPEPFNIICHSRD 170736 CYP2AA10-de8b9b Danio rerio (zebrafish) Chr 23 (see below) 87% to 2AA3v1 162569 GTAIVTNFEAIFSSKDHWKHPDAFNPKNFLEDEHFSKLESFIAFSL 162346 xxRSCLGEMLVRTELFLFITSLLQRIHFSWPPDAKPIDMDGIMGLVHSPQTFNVICRSRD 162173 389 440 + Chr:23 38903541 38903696 - 321 new 80% to 2AA3 162569 GTAIVTNFEAIFSSKDHWKHPDAFNPKNFLEDEHFSKLESFIAFSL 438 495 + Chr:23 38903300 38903473 - 439 new 89% to 2AA3 162346 xxRSCLGEMLVRTELFLFITSLLQRIHFSWPPDAKPIDMDGIMGLVHSPQTFNVICRSRD 162173 CYP2AA11 Danio rerio (zebrafish) Chr 23 (see below) 86% to CYP2AA6 293503 MLVGLLKLDLSSVGLSLFLGLFCLALFEICRIRIYKGRYPPGPTPLPFVGTIPHFLKNPMGFIRS 293309 293192 LSQYGEMTTVYLGRKPAIILNALQLMKEAFVQNGSSFSGRPPVPVLTWVNQGY 293034 289501 GIIMAMDGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISELLRVE 289349 288521 GKPFNPQHAIHNAAANIICSIVFGDRFDYDNKSFTYLLEIIKENLDLAGSFAGQ 288360 284034 MVNLVPIIKNLPGPHQKIYQNGEEFKSFIRESVKAHRETLDPDSPRDFIDAYLLEMEK 283861 283191 QKSTQDSTFHEDNMVMSVGDLFFAGTDTTATTIRWGLLYLTQNPDVQ 283051 ERCHDEIVQVLGFDCFPSMDDRDQLPYTLATVHEIQRCANVAPSGVPHQTTKPIKLRGYDIPQ 282518 282449 GTQILINLMGILANKEHWKHPDTFNPENFLDDKGHFFKPEAFLPFSL 282309 GPRACLGETLAKTELFLFVTSLLQRIRFSWPTGEKWPDMNRILSVIRSPEPFNIICYSRDS 282053 1 66 + Chr:23 39034433 39034630 - 282 2AA.d 100% 293503 MLVGLLKLDLSSVGLSLFLGLFCLALFEICRIRIYKGRYPPGPTPLPFVGTIPHFLKNPMGFIRS 293309 62 138 + Chr:23 39034071 39034331 - 285 NEW 86% to 2AA6 293192 LSQYGEMTTVYLGRKPAIILNALQLMKEAFVQNGSSFSGRPPVPVLTWVNQGY 293034 117 178 + Chr:23 39030437 39030628 - 317 NA1642 100% 289501 GIIMAMDGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISELLRVE 289349 168 224 + Chr:23 39029478 39029648 - 333 NA1642 100% 5 diffs to 2AA6 288521 GKPFNPQHAIHNAAANIICSIVFGDRFDYDNKSFTYLLEIIKENLDLAGSFAGQ 288360 221 279 + Chr:23 39024988 39025164 - 363 new 89% to 2AA6 284034 MVNLVPIIKNLPGPHQKIYQNGEEFKSFIRESVKAHRETLDPDSPRDFIDAYLLEMEK 283861 273 326 + Chr:23 39024178 39024339 - 321 CYP2AA6-ie6 100% 283191 QKSTQDSTFHEDNMVMSVGDLFFAGTDTTATTIRWGLLYLTQNPDVQ 283051 312 396 + Chr:23 39023630 39023878 - 423 new 79% to 2AA6 ERCHDEIVQVLGFDCFPSMDDRDQLPYTLATVHEIQRCANVAPSGVPHQTTKPIKLRGYDIPQ 282518 388 465 + Chr:23 39023343 39023579 - 348 2AA6 3 diffs 282449 GTQILINLMGILANKEHWKHPDTFNPENFLDDKGHFFKPEAFLPFSL 282309 429 496 + Chr:23 39023180 39023374 - 378 new 83% to 2AA.a GPRACLGETLAKTELFLFVTSLLQRIRFSWPTGEKWPDMNRILSVIRSPEPFNIICYSRDS 282053 CYP2AA12 Danio rerio (zebrafish) Chr 23 (see below) 83% to 2AA6 358763 MFASLLKLDLASVGLTLFLGLIFLVVFEIFRIRSYSGRFPPGPTPLPFVGTIPHFLKDSMGFIRS 358569 84% to 2AA7 358463 LSQYGEMTTVYLGRKPAMVLNTLQVIKEAIVQNGTSSSGRPSIPILTWITEGY 358305 GIVLATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLVPEMLKLE 353402 353113 GKPFDPQHAIQNAVSNIICSIVFGDRFEYDNKRFEYLLEIIKENINQAGSLIGQ 352952 352042 VFNLIPIIKHFPGPHQKIYQNAEELKSFIRESTKSHRETLDPDSPRDFIDAYLLEMEK 351869 351605 QKSSQDSSFHEDNMVMSVADLFLAGSDTTATTIRWGLIYLTQNPDVQ 351465 335727 ERCHEEIVRVLGYDRLPCMDDRDRLPYTLATVHELQRCGNIVPSSVPHETTQPMKLRGYDIPQ 335539 335464 GTQMLINLSDILANKEHWKHPDTFNPENFLDDKGHFYRPEAFLPFSL 335324 GPRVCLGETLAKTELFLFITSLLQRIRFSWPTGEKWPNMDGIVSVVRSPEPFKIICHSR 335067 Chr:23 39099777 39099809 - 358763 MFASLLKLDLASVGLTLFLGLIFLVVFEIFRIRSYSGRFPPGPTPLPFVGTIPHFLKDSMGFIRS 358569 84% to 2AA7 62 117 + Chr:23 39099429 39099602 - 274 2AA.d 100% 358463 LSQYGEMTTVYLGRKPAMVLNTLQVIKEAIVQNGTSSSGRPSIPILTWITEGY 358305 100 174 + Chr:23 39094517 39094729 - 346 2AA.d 1 diff GIVLATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLVPEMLKLE 353402 167 233 + Chr:23 39094031 39094243 - 355 2AA.d 100% 353113 GKPFDPQHAIQNAVSNIICSIVFGDRFEYDNKRFEYLLEIIKENINQAGSLIGQ 352952 221 293 + Chr:23 39092960 39093172 - 421 2AA.d 100% 352042 VFNLIPIIKHFPGPHQKIYQNAEELKSFIRESTKSHRETLDPDSPRDFIDAYLLEMEK 351869 270 326 + Chr:23 39092592 39092762 - 325 2AA.d 100% 351605 QKSSQDSSFHEDNMVMSVADLFLAGSDTTATTIRWGLIYLTQNPDVQ 351465 324 388 + Chr:23 39076666 39076866 - 431 2AA.a 100% 335727 ERCHEEIVRVLGYDRLPCMDDRDRLPYTLATVHELQRCGNIVPSSVPHETTQPMKLRGYDIPQ 335539 389 486 + Chr:23 39076256 39076591 - 327 2AA.a 100% 335464 GTQMLINLSDILANKEHWKHPDTFNPENFLDDKGHFYRPEAFLPFSL 335324 429 494 + Chr:23 39076194 39076382 - 390 2AA.a 100% GPRVCLGETLAKTELFLFITSLLQRIRFSWPTGEKWPNMDGIVSVVRSPEPFKIICHSR 335067 Chr:23 38974638 38974682 - 233555 QTFSIICCSRNTKE* 233511 (pseudogene piece after the gene) 2AB Subfamily CYP2AB1P human GenEMBL NT_022676.10|Hs3_22832 also AC068644.15 chr3q27.1 185030751-185015757 - strand build 33 old name = 2D31P NT_005962.297 (genescan predicted protein has errors) 75% to 2ab1 mouse which is a functional gene MLSLLSGLALLAISFLLLKLGTFCWDRSCLPPGPLPFPILGNLWQLCFQLHPETLLQ LAQSVFTVWVGPIPVAVLSGFQVVKEALVSNSEQFSGRSLTPLFQDLFGER GIICSSGHTWRQKRRFCLVMI*GLGL GKLALEVQLQKEAAELAEAFRQEQGRPFDPQVSIVRST VRVIGALVFGHHFLLEDPIFQELTQAIDFGLAFVSTVWRRLYDVFPWALC HLPGPHQEIFRYQEVVLSLIHQEITRHKLRAPEAPRDFISCYLAQISK AMDDPVSTFNQENLV*VVIDLFLGGTDTTATTLCWALIHMIQHGAVQG TVQLELDEVLGAAPVVCYEDRKRLPYTX AVLHDVQRLSSVMAMGAVRQCVTSTRVCSYPVSK GTIILPNLASVLYDPECWETPRQFNPGHFSDKDGNFVANEAFLPFSAGHRVYPAD QLAQMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTWQPQPQEICAVPR CYP2AB1P Bos taurus (cow) See cattle page for details MCPLLIWLGLLAASFLLLKFSIIYWERNHLPPDPFPFPILGNPWQLSFQLHPATLLQ LAQTHGHVFTVWVGPTPVVVLCSFQA KEALVSHSEQLSGWPLTPLFQDLAGERG GVICSSGRTRRQ*RRFCLAALQGLG*GPLALELRLQEEAAGLVEAFHWEQ GGPFDPQAPIVRSTARVTGALVFGRHFLSEDPFFQELI*ATNFGLAFXXXXXX QLNDLFPWAFRCLPGPYREMFRYQKAVRGYIHREIMRHKLRTSEAPKDFISCYLAQIIK ATDDPVSTFNEENLIQVVVGLFLGGTDTTGTTLYWVLIYMIQYGAIQS ERVQQELVTVLGTSGAICYKDHEQLPHICTLLHEAQRLSSVA*V AVCQCVTSTHVHGHPVPK GTIILPNLAAVLCDPECWRTSRQFNPGHFLDKDGNFVVRDIFPPFSA GHQMCLGD*LAQMKLLLMFATLLGTFSFQLPGRSPGLRLEYNFGGTRKPLPQKIYAVSRLNCPHPGPREEVL* CYP2AB1P Canis familiaris (dog) AACN010195735.1 exons 8,9 75% to cyp2ab1 mouse KRELPPGSFPFPSENPWQLSFQLYPETL (N-term fragment) 1543 GTIILPNLASVLLDPECWETPQQFNPGLFLDMGGNFLVNEAFLPFSA 1683 GHQVGPGDHLALMELFLMFANPFRTFWFQLPEGSLG*DLQYIWGTL*PQPQKICAVP 1941 CYP2AB1 Monodelphis domestica (short-tailed opossum) XM_001374342 Added N-term and removed some C-term seq and internal seq from the prediction 61% to CYP2AB1P human MFSLATGLAILATSFLLLR MLAFFLARTQFPPGPCPLPILGNLLQLLSPGACYPTLLPLTRKY GSIFTVWLGSTPVVVLNGFQAVKDALVTHSEDFADRPVTPLFEDLFGDKGIISTSGHA WQQQRRFGLITLRALGMGKKVLEQRLQEEAQYLVEIFHRQNGTSFDPHVPIVRAAANV ICALVFGHRFPHGDPFFQELMKAIDFGLAFVNTIWRR (0) LYDAFPW LLRQLPGPHRKIFRYQEIVKSLICQEIERHKQRVPEDLEDFISCYLAQITKRKDDPAS TFDEENLIQVIIDLFLGGTETTATTLRWALLYMIHHRDVQGKVQQELDTVLGPSRVIS FKDRKLLPYTNAVLHEVQRFCSVISVGAVRKCGTATTVQGFPIQKGTIVLPNLASVLC DPEHWETPWQFNPGHFLDGEGNFVIHEAFLPFSAGHRVCLGELLAKVELFLVFAHLLR EFRLRAPAGASTNERDYILWGTKQPRPYDICASPRLGRFQGGPRKDRLEAAEMQREGG TDQ* Cyp2ab1 mouse GenEMBL NW_000107.1 39% to Cyp2j5 new subfamily in Cyp2 EST BY749683.1 B6-derived CD11 +ve dendritic cells, rat ortholog XM_221297.1 91% NW_000107.1|Mm16_WIFeb01_286 MFSLFSGMAFLAGSCLLLKLATLCWRRSHLPPGPFPFPLLGNLWQLNFQLHPNMLFQ LAQTHGSVFTVWLGSTPIVVLSGFRAVKEALVSNSEQFSGRPLTPFFRDLFGEKG VICSNGLTWRQQRRFCLTTLRELGLGKQALEVQLQHEAAELAKVFLQEEGRA FDPQIPIIRSTTRVIGTLVFGHHFLSEEPIFLELIQAINLGLAFASTIWRR LYDMFPWALRHLSGPHQKIFQYHEAVRGFIRHEIIRHKLRTAEAPKDFINCYLSQITK AIDDPVSTFSEENLIQVVIDLFLGGTDTTATTLHWALIYLVHHRAIQG RVQQELDEMLGAAQTICYEDRERLPYTRAVLHEVQRLSSVVAVGAVRQCVTSTWMHGYYVPK GTIILPNLASVLYDPECWESPHQFNPGHFLDKDGNFVANEAFLPFSA GHRVCPGEQLARMELFLMFATLLRTFQFQLPEGSQDLGLEYVFGGTLQPQPQKICAVLR CYP2AB1 rat XM_221297 N-terminal incorrect, AC107471.6 N-term 92% to mouse 189790 MFSLFGGMAFLAGSFLLLKLAALCWRRSHLPPGPFPFPLLGNLWQLNFRLHPNMLFQ (0) 189620 LAQTHGNVFTVWLGSTPIVVLNGFRAVKEALVSNSEQFSGRPLTPFFRD LFGEKGVICSNGLTWRQQRRFCLTTLRELGLGKQALELQLQHEAAELAEVFHQEQGRA FDPQVPIIRSTTRVIGALVFGHHFLSEEPIFLELIRAINLGLAFASTTWRRLYDMFPW ALRYLSGPHQKIFQYHEAVRGFIHHEIIRHKLRTPEAPKDFISCYLSQITKAMDDPVS TFSEENLIQVVIDLFLGGTDTTATTLHWAIIYLVHHRAIQERVQQELDEVLGTAQAVC YEDRERLPYTRAVLHEVQRLSSVVAVGAVRQCVTPTWMHGYYVSKGTIILPNLASVLC DPECWETPHQFNPGHFLDKDGDFVTNEAFLPFSAGHRVCPGEQLARMELFLMFATLLR TFRFQLPEGSQGLRLEYVFGGTLQPQPQKICAVPRLSSLSPREP CYP2AB1 Gallus gallus (chicken) chr9:15,039,303-15,044,379 (+) 49% to human 2AB1P, 51% to mouse, 54% to Xenopus This seq is named 2AB1 since it is the most like the single Xenopus sequence. 18744 MLGIVELFVALVASLLILQFLKLQWMRSQLPPGPVPLPIIGNLWLLDFKLRRETLAK 18914 19663 LTNIYGNIYTVWMGQTPVVVLNGYKAVKDAIVTHSEETSGRPLTPFYRDMMGEK 19824 19958 GIFLTSGHTWKQQRRFGMTIIRSLGFGKNNLEHQIQTEASHLLHIFANTK 20107 21368 GRPFNPRTSIVHAIANIICAVVFGHRFSSEDESFSKLIKAVYFVIYFQATIWGR (0) 21529 21710 MYDAFPWLMHRFPGPHQKVFAYNNFMHNLVMNEIQMHEREKAGDPQDLIDFYLTQIAK 21884 22115 TKDDPTSTFNKDNMVQTVVDLLLGGTETTSTTLLWALLYMVQYPEIQ 22255 22786 ERVQREIEAVLEPSHVISYEDRKRLPYTNAVIHETLRYSNITSVGVPRLCVRNTTLLGFHIKK 22974 23285 GTLVLPNLHSVVYDSDHWATPCKFDPNHFLDVDGNFVNKEAFLPFSA 23425 23644 GHRVCLGEQMARVELFIFFTNLLRAFTFQLPEGVKEINPEYVLGAILQPHPYEICAVPR 23820 CYP2AB2 Gallus gallus (chicken) XM_422750 2 P450s fused together during annotation error chr9:15,031,052-15,037,949 (-) MGINVLSPPEKNSEFYHVLFLLGLQFLRLQWRSRRFPPGPIPFP IIGSIWWINFRADHGSLKKLAKAYGNICTLWLGHKPIVVLYGFKAVKDGLTTNSEDVS GRLQTYLFNRFSSGKGTAEFQWMEHRVLYLKQEWLNWFLPASYPSKHRGTRIGSLQTS PMGSSEKSIGLEQLSERDHRISWWEKPEHQRRFGIATLRKLGMGNKGMERGIQAEARH LVEFFRSKDGRAVDPSFPIVHAVSNVICAVVFGHRFSLQDETFRRLMEAYNGIVAFGN SYFYYTKNVPNSTYDEENMLQSVFDLFLGGSETTATTLRWALLYMVAYPDIQEKVQKE LDAVLGSSHQIDYEDRKKLPYTNAVIHEIIRFSSIILITIPRQAVKDTTVLGYQVPKG TIIMANIDSTLFDPEYWETPHQFNPGHFLDKDGNFVIREAFLAFSAGHRVCLGEVMAK MELFIIFCSLLQIFKFTPPEGDKEINLSFVFGSTMKPHPYKL CAVLR CYP2AB3 Gallus gallus (chicken) XM_422750 2 P450s fused together during annotation error chr9:15,022,695-15,027,829 (-) 46% to mouse 2ab1 7270 MLAVSAVLVCLAASLLLVQFLGMQWKRRQLPPGPAPFPLFGNLLQMKFQIHHDILXX 7106 MASMYGNIFTLWLTGTPVVVLHGY 6690 6689 QAVKEGMTAHAEEVAGRPLSRAFRLMTNGN 6618 6266 GVMFSNGHLWKQQRRFGLLTMRKMGVGKQNQECQIQEEAHHLVQYLRNTK 6117 5699 GKPLDPAVPVTHTVSNVICALILGHRFSIEDKRFLRLVEAVDDISAFANSVSFY 5538 4840 VHDQVPWIATHFLTRCKKALASIDTMRALLEEEIGSHKGKVDENQDFIGYYLDQMAK 4670 4111 SKEDAGATYDKANLLQTIFDLFLAGTETTATTLRWALLYMVAYPDVQ 3971 3128 KKVHKELDAVLGSSRLICYKDRKNLPYTNAVIHEIQRYSNIVLIALPRYTVKDTELLGFPIPK 2946 DTIVLVNID 2769 2768 SVLSDPEKWETPDQFNPGHFLDKDGNFVHREAFLPFSI 2655 2354 GHRACMGELLARLELFIIFCTLLQAFTFTLPDGVNEVSTKFVFSS 2178 2177 TKKPPPHQICAIPR 2136 CYP2AB4 Gallus gallus (chicken) XM_426708 seq was added to mRNA translation to correct it chr9:15,009,527-15,018,429 (-) MNPVKAAAMLSINQVMIALVVFLLVMQFLKLQRARRCLPPGPIP LPVLGTLLQLNFQINRDVLMKLAKTYGNVFTLWFGWAPVIILNGFQAVKDGMTTHPED VSGRLVSPFFRAMAKGKGIMLATGHMWKQQRRFALKTLRNLGLGKRGLEQRVQEEALH LLEFFASLKEKPLDPYYPLIHSVSNVICAVVYGHRFSRGDETFHELIRATEHIFKFGG SLLHHLYEIFPWLMCRLPGPHKKALSCYDILSSFTRREIREHKEREIPDEPRDFIDFY LAHIEKSGDEPKSTYNEENMVYSINDLFLGGSETTSTTLNWGLLYMVAYPDVQEKVQK ELDAVLGPSQMICYEHRRKVPYTNAVIHEIQRFSNIISIGMPRVCVRNTTLLGFPLKK GSIVLPNIASSLYDP EHWETPRQFNPAHFLDKDGNFVSQEAFLPFSIGHRVCLGEHLARTELFIFFANLLRAF TFQLPEGVTTINTEPIFGGTLQPHPYKVCAIPR CYP2AB1 Xenopus laevis (African clawed frog) GenEMBL BC074149.1 46% to 2AB1P hum, 49% to mouse, 54% to chicken MSFTQETWSLQQILLAFLVCVIAVKYIKMRWAA RSLPPGPTPLPLIGNLWALRFKLHPKTLRKIAVSYGDIYTLWLGHTPLVVLSGCRSVRNG LISHSEELSGRPVDGLMQALTNERGIGSTNGHTWKQQRRFGLMTLRNLGLGKRGLESRIQ EEAQCLVESLAAKNGEPVNPSDLIVHAVANVISAVVFGHRFSIEDPTFQEMVRCNGCIVT NLGTAWGRIYDAFPWLMRFV PGPHQSSFAAMAYLTAFIKKEIKLHELNGPNEQPQDLIEY YLAQIAKTKHEPDNTFDEANMIQTVI DLFIAGTETTATSLQWALLYMVAFPEIQKKVQEE LDTVLDGSQLAYYEDKKRLPFTNAVIHEVQRYGNIASVGMLRSCIRKVTVNGYQLEKNTM VLPNLDSVLHDQHQWETPYKFNPNHFLDKNGNFCTSEAFLPFSAGHRVCLGEQLARFELL IFFTTLLRRFNIELPEGITEVNTKYVFKMTLQPHPYEICAVPR* CYP2AB1 Xenopus tropicalis (frog) GenEMBL CX984262.1 CX984263.2 ESTs scaffold_535:154,346-161,099 131 MSFTQDTWSFQQILLALLVCVITIKYIKMKWAAKNLPPGPTPLPLLGNLWALRFKLHP 304 305 KTLRKMAKSYGDIYTLWLGHTPLVVLSGCKSVRNGLITHSEELSGRPVDGFMTALTNERG 484 485 IGTTNGHTWKQQRRFGLMTLRNLGLGKRGLESRIQEEAQCLVESLAAKNGEPINPSDLIV 664 665 LAVANVISAVVFGHRFSIEDPTFQEMVKCNSSLVSGLGTAWGRMYDAFPWLMRYV 829 PGPHQKSFAAIDYLAAFIKKEIKLHEINSSKDDPQDMIDYYLTQIEK (0) TKHELDTTFDEENMIQVVI 893 DLFIAGTETTAISLSGALLYMVAFPEIQKKVQKELDTVLDGSPLAYYEDRKKLPFTNAVI 714 713 HEVQRYGNIASVGIPRSCIRKVTVNGYQLNKNTIVLPNLDSVLHDQRQWETPYKFNPNHF 534 533 LDKNGDFCTNEAFLPFSAGHRVCLGEQLARFELFIFFTTILRRFSIELPKGVTEVNTDYV 354 353 FKMTLQPHPYEICAIPR 303 2AC Subfamily CYP2AC1P human AC022650 6p12.3 41% to 2C9 pseudogene 2 in frame stops 68% to rat CYP2AC1 (XM_236969.1) functional gene old name CYP2C57P GIAFSHGETWKTMRRFSLTTLRNFGMGEWIIEDTIIEECQNLIQ NMFLVLGFLLKSHKTILRNRDELFSFIRMAFLDHHHKLDKNDPRNFTDVFLVTQQE ENDTFADYFSDKKLVTLVNNLFTTGTETTASTLHWGILLVMRYPEVQS KVHNEITKVVVSAQS*LAHRTQMTHTDAVI*EVQRFANILPTSLSHATTTNIFKNYCIPK GTEVIILLASVARDQAQWEKPDTFNPEHFLNSKEKFIKREAFLPF CYP2AC1P Bos taurus (cow) See cattle page for details 67% to rat 2AC1 MSGFESSFILPILSLILIFILNIKIVMTKASKQHFPPVPRPLPIIGNLHILNLKRPYQTMLE (0) LSQKYGSIYSIQIGPRKVAVLxGYETVKDVLVNHTDQFGEWFHVPISERLFEGK GIFFSHSDTSKIIRFTLTTSQNFGMGKKALEDTIIGESQHLIRNFETDKG GKPFEVKTLTNASVANINVSVLLGKGFDYQNTPFLRLLTLIDQSVKLIVSPPTA LFNMFPVLRFLLKTYKNILRNKDELFSFIRMTFLHHHHKLDKNDPRSLTDAFLVRQQE DTSTDYFNDDTLVVLVNNLFAAGTESMVSTLCWGILFMSRYPEIQS KVHDEIAKVMGSTQP*MAH*TQMPYTDAVILEVQRFADILPTGLPRATTTNTIFKNNYIPK GTEVIFLLTSVL*DQTQWENPATFNPEHFLDSIEKFIKKEAFISFSV (1) SPL*CAGESLAKMELLLFFMSLLQKFTFQPPPGVSHLDLDPTRDTGVVIQPMPHKIRALPRA CYP2AC1 Canis familiaris (dog) XM_847513.1 MSGFDSSIILPILSLLLIFLLNIKIFMTKASKQHFPPGPR PLPIIGNLHILNlkrpyqtmleLSQKYGSIYSIQMGPKKVVVLSGYETVKDALVNYGD QFGERSQVPIFERLFEGKGIVFSHGETWKTMRRFSLATLRNFGMGKRIIEDTIIEECQ HLIWSFESHR GKPFEVKTVMNASVANVIVSVLLGKRFDYQDTQFLRLLTLIGENVKLI GGPRIA LFNMFPVLGFLLKSHKTVLRNRDELFAFIRMTFLDHQHKFDKNDPRSFIDAF LVRQQE EKDTSTTYFSDENLVALVSNLFAAGTETTATTLCWALLLMMRYPEVQKKVCD EITKVVGSAQPRITHRTQMPYTDAVIHEVQRFANILPTGLPHATTTNVMFKNYYIPKG TEVITLLTSVLRDQTQWEKPDTFNPNHFLSSTGKFIKKEAFMPFSLGRRMCAGESLAK MELFLFFTSLMQKFTFQPPPGVSHLDLDLTPDIGFTTRPMPHKICALLRA* CYP2AC1 Monodelphis domestica (short-tailed opossum) XM_001369570.1 MSNGGHSLVPQMSIEFWEQRPTQGANIYHGHYPPGPKPLPVIGN LHILNLKRPYQTMLELSKKYGPIFSLRMGPKTVVVLSGYETVKDALVNYSEQFGERAR IPIFERIFEGKGIVFSHGENWKITRRFSLTTLRNFGMGKRVIEERILEECHHLIQVFE SHQGKPFEISTIMSASVANIIVSILFGKRFDYKDPQFLRLLHLIGENIRLAGGPSITI FNMFPVLGFLLQDLKRVLRNRDELFSFIRTTFLKHLRKLDKNDQRSFIDAFLIKQQEK DKSDDYFNNDNLVALVSNLFAAGTETTSSTLRWGILLMMKYPEIQKKVHNEITEVIGS AQPRIEHRTQMPYTDAVIHEIQRFSNILPMNLSRETTTDVIFKNYYIPKGTEVITLLT SVLQDQTQWEKPCTFHPQHFLTKEGKFIKRDAFLPFSAGQRMCAGESLAKMELFLFFT SLLQKFTFCPSPGVSNSDLDLTPDIGFTTRPQPYKICALPYF Cyp2ac1-ps mouse GenEMBL NW_000130.1|Mm17_WIFeb01_308 MISSING EXON 2 probably in a seq gap Rat ortholog is 80% identical MSGFDFSAMLALLGLSLILILHINVFMAKASKHQSPPGRKSWPVIGNLHIXXXXXXXXXXXX GIAYAHGKCWKTMRRFSLTTLRNFLMGKRIIEDTIVTECQHLIQCFESHK GLVLGM*RLLKASIANVIVSVLLGKWFDYQDSQFLRLLTLIGENMKLIGNPSIV LLNMFPILGFLLRSKKKVLRNRVELFSFIRMAFLEHCHNRNKSDPRSLIDAFLVRQQG ENNTSANHFNEENLLALVSNLFTARTKTTASTLHWGIILMMLYPEVQS 556747 KVRGEIIKVVGSAQPRIEHRIQMPYTDTVIHEIE (fs) RVANILPTSLFHETTTDVAFKNYYIPK GTEIITLLTSVLQDQTQWEASDAFDPAHFLSPKGTFVKKESFVPFSW 561380 GCHMCAGEPLAKMELFLFFTSLMQKFIFQSPxx (fs) VSHLDLDLTPDIGFIMQSQPHKICALVRASAL CYP2AC1 Rattus norvegicus (rat) GenEMBL NW_044163.1|Rn9_1523 genomic ortholog to 2ac1 chromosome 9 3425457 MSGFDFSAILALLGLILILILNIKDFMAKASKRQCPPGPKPWPVIGNLHILNLKRPYQTMLE 3425272 3423187 LSKKYGPIYSIQMGPRKVVVLSGYETVKDALVNYGNQFGERSQVPIFERLFDGK 3423026 3415443 GIAFAHGETWKTMRRFSLSTLRDFGMGKRTIEDTIVVECQHLIQSFESHK 3415294 3412018 GKPFEIKRVLNASVANVIVSMLLGKRFDYEDPQFLRLLTLIGENIKLIGNPSIV 3411857 3410639 LFNIFPILGFLLRSHKKVLRNRDELFSFIRRTFLEHCHNLDKNDPRSFIDAFLVKQQ 3410469 3410029 ENNKSADYFNEENLLALVSNLFTAGTETTAATLRWGIILMMRYPEVQS 3409886 3408812 KVHDEIHKVVGSAQPRIEHRTQMPYTDAVIHEIQRVANILPTSLPHETSTDVVFKNYYIPK 3408627 3406238 GTEVITLLTSVLRDQTQWETPDAFNPAHFLSSKGRFVKKEAFMPFSV 3406098 3402907 GRRMCAGEPLAKMELFLFFTSLMQKFTFQPPPGVSYLDLDLTPDIGFTIQPLPHKICALLRTSAL* 3402710 CYP2AC1 Xenopus laevis (African clawed frog) GenEMBL CB558367.1 NICHD_XGC_Kid1 Xenopus laevis cDNA clone CB559919.1 NICHD_XGC_Kid1 Xenopus laevis cDNA clone BJ030802.1 NIBB Mochii normalized Xenopus neurula cDNA clone 61% identical to rat 2ac1 from PPGP to end MFLGDPVTVLLTVVLCLILANLLYGRKRNNFKNF PPGPKPLPVIGNINIINLKRPYLTYLELWKKYGPVFSIQIGGQKMVVLCGYETVKDALVNYAEEFSERPK IPIFRDISKEYGVLFSHGENWKVMRRFTLSTLRDFGMGRSSIEDRINEECDFLVEK FKSYKGKPFENTMIINAAVANIIVSIILGHRFDYQDPIFLRLMSLINENIRLSGSPTVML YNVFPSVMRWLPGSHKTIAKNAAENQR FIRKTFTKHRDKLDVNDQRTLVDAFLVKQQEKNVNVQYFHDENLTMIVSNLFAAGMETT SSTIRWGLLLMMKYPEIQKNVQNEIEKVIGQSQPQTEHRKSMPYTDAVLHEIQRFGNIVP MNLPHATAQDVTFRGYFLPKGTFVIPLLTSVLYDQTHFEKPHEFYPQHFLDSEGNFVKNE AFLPFSAGKRSCAGENLAKTELFLFFTSLLQNFTFQASSGRRT* CYP2AC1 Phalacrocorax carbo (Common cormorant) No accession number Hisato Iwata submitted to nomenclature committee 5/19/05 61% to CYP2AC1 rat 76% to 2AC1 chicken 70% to 2AC2 chicken CYP2AC1 Gallus gallus (chicken) NW_060338.1|Gga3_WGA147_1 chr 3 XM_420052.1, BG641890.1 EST BU120706.1 3967773 MDWASVVPVGLLMILILLLILKTQDFWRSQGKFPPGPQPLPIIGNLHIMDLKKIGQTMLQ (0) 3967952 3968877 LSETYGPVFTVQMGMRKVVVLSGYDTVKEALVNHADAFVGRPKIPIVEKAGKGK 3969038 3969203 GVVFSSGENWKVMRRFTLTTLRDFGMGKKAIEDYVVEEYGYLADVIESQK 3969352 3970285 GKPLEMTHLMNSAVANVIVSILLGKRFEYEDPTFKRLVSLINENMRLFGSPSVS 3970446 3971108 LYNMFPILGPFLKDNKSFLENVKEVNDFIKVTFTKYLQVLDK 3971233 3971234 NDQRSFIDAFLVKQQE 3971281 3971703 QNEKANKFFDDENLTEVVRNLFTAGMDTTATTLRWGLLLMMKYPEIQ 3971843 3971973 KKVQEEIDRVIGSNPPRTE 3972029 HRTKMPY 3972269 TDAVIHEIQRFANILPLNLPHETTMDVTIKGYFIPK 3972376 3972609 GTYIIPLLNSVLQDKTQWEKPCSFHPEHFLNSEGKFVKKDAFIPFSA 3972749 3973027 GRRICAGETLAKMELFLFFTSLLQRFTFQPPPGISSSDLDLSAPPRFVIAPVTHEVCAVSRS 3973212 CYP2AC2 Gallus gallus (chicken) NW_060338.1|Gga3_WGA147_1, chr 3 BG710846.1 EST, XM_420053.1 3974997 MALVFILTFLFIMKIGGLWSNHWRKNFPPGPRALPIIGNLHLFDLKRPYRTYLQ 3975158 3976589 LSKEYGPVFSVQMGQRKIVVISGYETVKEALINQADAFAERPKIPIFEDLTRGN 3976750 3977081 GIVFAHGENWKVMRRFTLTTLRDFGMGKRAIEDRIVEEYGYLIDNVGSQE 3977230 3977626 GKPFDASKIINAAVANIIVSILLGKRFDYKDSRFIRLQHLTNESMRLAGKPLVT 3977787 3978987 MYNIFPYLGFLLRANKTLLKNRDEFHAYVKATFLENLKTLDKNDQRSFIDAFLVKQQE 3979160 3979765 EKSITNGYFHNGNLLSLVSNLFTAGVETISTTLNWSFLLMLKYPEIQSKVQ 3979917 3980773 EEIEQVIGSNPPRIEHRTQMPYTDAVIHEVQRFANILPLDLPHETAEDVTLKDYFIPK 3980946 3981123 GTYIIPLLTSVLRDKSQWEKPDMFYPEHFLDSKGKFVKKDAFMPFSA 3981263 3982308 GRRICAGETLAKMELFLFFTSLLQRFTFQPPPGVSSSDLDLSPAISFNVVPKPYKICAVARS 3982493 2AD Subfamily CYP2AD1 Fugu rubripes (pufferfish) No accession number Scaffold_805, Old Scaffold_3261d Formerly CYP2N12 92399 MILQKIFAYMDFSSWVLLIFLVLLITDVIRNWTPHNFPPGPWAMPFVGNIFTGVDFRTIEK (0) 92217 92102 LSQKYGPVFSLRRGNTRTVFINGYKMVKEALVSQLDSFEDRPVVPLFHVVFKGI (1) 91941 91785 GIALSNGYMWKKQRKFAHTHLRYFGEGQKLLENHIQMESKFMCEAFKDEQ 91633 (1 gc boundary ?) 91552 GKPFDPQYTITNAVGNIISALVFGHRFEYSDASFRRILELDNEAVVLAGSARTQ 91391 (0) 91307 LYDSFPSLMKHLPGPHQTVHANYGKITDFLKKEVDKHMEEWNPEDPRDYVDTYLSEMEK 91131 (0) 90784 MNQDPQGGFNVETLLICILDLIEAGTESAATTLRWGLVFILNYPDVQ 90644 (1) 90564 EKVQEEIDRVIGQSRQPAMADRPNMPYTDAVIHEIQRFANVVPAGFPKMATKDTTVGGYFIPK 90376 (0) 90287 GLAITTMLSSVLFDKNEWETPDVFNPNHFLDSEGRFRKRDAFIPFSA 90147 (1) 90043 GKRVCIGENLAKMELFLFFTSILQHFNLSPVPGQMPSLEGILGFTYSPQPFRMIVAPR* 89867 CYP2AD2 Danio rerio (zebrafish) GenEMBL AF248042 Tanguay R.L. 75% to 2AD3 MILHLIYDSFDFKSWIIFFVVFLIIAEMIKNRTPSNYPPGPWPL PFLGTVFTKMDFKNINKLAKVYGKVFSLRVGSEKMIIVSGYKMVKEALVTQNDSFVLR PPVPLFHKVYKGIGLTMSNGYIWRSHRRFAASHLRTFGEGKKNLELGIQQECVYLCDA FKAEKEPFNPIFILHGAVSNTVACLTFGQRFDYNDEWYQEILRLDNQCVQLAGSPRVQ LYNAFPKLLDYLPGPHQKVFSNYKKITQSLKDEIIKHREDWDPANPRDFIDNYLTEME KKKSDPQAGFNIESLIISCLDIVEAGTETGATTLRWGLLFMIKFPEIQKKVQAEIDRV IGQSRQPCLDDRVNMPYTEAVLHEIQRFGDVVPLGFPKQAAVDTKIGNYFIPKGTSIT TNLSSVLHDPNEWETPDTFNPGHFLDKNGQFRKRDAFLPFSAGKRACVGELLARNVLF LFFTSLLQQFTLSKCPGEEPSLEGEIWFTYAPAPFRISVSVR CYP2AD3 Danio rerio (zebrafish) No accession number Tseng, H.-P., Wang-Buhler, J.-L., Yang, Y.-H., Hu, C.-H., Buhler, D.R. submitted to nomenclature committee 12/08/2003 75% to CYP2AD2 60% to CYP2AD1 clone name YH-B1-FL CYP2AD4 Oryzias latipes GenEMBL BJ494553 EST 70% to CYP2AD1 CYP2AD5 Gasterosteus aculeatus GenEMBL CD499490 EST 67% to CYP2AD1 CYP2AD6 Danio rerio (zebrafish) CYP2AD7 Oryzias latipes (medaka) chr4 28086098:28094682 Joanna Wilson and students submitted to nomenclature committee Jan. 25, 2008 61%ID to Zebrafish 2AD2 73% to 2AD1 (FORMERLY 2N12) probable GC boundary based on mRNA EF546460.1 MIFQALFDRMDFNSWLVFGFVLLLLIDIVKTWKPPKFPPGPLSVPFLGNVFTGVDFKTMEKLSQDFGPVFSLRRG SERMVFISGYKMVKEALVTQLDSFVDRPIVPLFHVVFKGLGIALSNGYLWKKQRKFANAHLRYFGEGQKSLERYI EIESNFLCDAFKEEQ (1 GC boundary) GRPFNPHYLITNAVGNIISSVVFGHRFEYSDPSFRKVLELDNEAVVLSGSARTQLYDAFP SLLNYLPGPHQTVHANYREIVCFLRKEIEKHQEEWNPEDPRDYIDVYLSEMEKTKQDPQAGFNIETLVVSTLDLI EAGTETTATTLRWGLMFMLHHPEIQEKVQEEIDRVIGQSRQPAMSDRPNLPYTDAVIHEIQRMGNIVPLGFPKMA SKDTTLGGYFIPKGTPITTILSSVLFDKNEWETPHVFNPGHFLDSEGRFLKKEAFLPFSAGKRMCLGEHLAKMEL FLFFSTLLQRFTFKPVPGEMPSLEGVLGFTHSPEEFRFLALPR* 2AE Subfamily CYP2AE1 Danio rerio (zebrafish) NA7219 zfishG-a147a09.q1c zfishG-a1551g08.q1c Z35723-a848d07.q1c 49% to 2P6 48% to 2N13 46% to 2V1 46% to 2AD2 28876 MSSVFSQLIGQWLDVQGFLIFLCVLLLVKHFRDVYSKNMPPGPFPLPFVGNLTNIGFSDP 28715 28714 LGSFQR 28697 (0) 28473 IAEKYGDVCTLYLGTKPCILMTGYDTLKEAFVEQADIFTDRPYFPIVDKLGN 28336 (1?) 26270 AGLIMSSGHMWRQQRRFALATLKYFGVGKKTLENAILQECRFLCDSLQAER 26118 25139 GLPFDPQHLVTNAVSNIICGLVFGHRFEYDDHQFHLMQTYINNILQLPISNWGR 24978 24700 LYNEFPTLMSLLPGKHQTAFASMSKLQPFLKEEITKHQQDREPSSPRDYIDCYLEEIEK 24524 21648 QCKDSDAEFTEENLMFCVVDLFGAGTETTSNTLRWALAFMVKYPDVQ 21508 21386 EKVQSEIDQVIGQTRQPLMDDRTNLPYTYAVIHEIQRFANIVTFTPPRVANKDTTVGGQLIPK 21198 18506 GVIVLPMLKPILLDKKEYSTPYDFNPDHFLDQNGKFLKKENFIPFSI 18366 14291 GKRMCPGEQLAGMELFLFFISLMQHFTFLPPEGETLSLKIFLAIASAPAPFRI 14133 KAVPRQCDNTAS* CYP2AE1-de9 Danio rerio (zebrafish) NA7219 extra exon 9 6kb downstream of 2AE1 8074 GKRMCPGEQLARMELFLFFISLMQHFTFLPVEGQKLSLKGTTSVSSAPQPFQI 7916 2AF Subfamily CYP2AF1 Phalacrocorax carbo (Common cormorant) No accession number Hisato Iwata submitted to nomenclature committee 5/19/05 45% to 2C11 rat this is a new vertebrate subfamily