P450s that have appeared since the 1993 P450 nomenclature update.
      This is part A of the list covering CYP1 to CYP2
      This includes references that were incomplete and duplications
      of sequences that were already in the update.  If a sequence 
      is assigned an accession number that was not in the old update
      it is included in this list.  
      This list was last revised on Jan. 31, 2003. 
      Added all human genes and pseudogenes

      Compiled by David R. Nelson

      A new format is being designed to make the entries more useful, with links to 
      Genbank and Medline and access to the protein sequence.  As time permits the   
      entries in the 1993 P450 Nomenclature Update will be added to make the  
      listing more comprehensive.  For the time being, I will leave the old text 
      format in place below the newer table format, but eventually the text version 
      will be deleted.  Any comments are welcome.

1A Subfamily

1B Subfamily

2A Subfamily

2B Subfamily

2C Subfamily

2D Subfamily

2E Subfamily

2F Subfamily

2G Subfamily

2H Subfamily

2J Subfamily

2K Subfamily

2L Subfamily

2M Subfamily

2N Subfamily

2P Subfamily

2Q Subfamily

2R Subfamily

2S Subfamily

2T Subfamily

2U Subfamily

2V Subfamily

2W Subfamily

2X Subfamily

2Y Subfamily

2Z Subfamily

2AA Subfamily

2AB Subfamily

2AC Subfamily

2AD Subfamily

2AE Subfamily

2AF Subfamily

Updated on March 5, 1999

Cytochrome P450 Data CYP1 to CYP2 (Under Construction)


 

P450 gene

Species

Medline Entry

Comment

Protein Sequence

Genbank Accession

 


 

CYP1A1

human

Kawajiri 1986

none

3' UTR

D12525 D01198

 

CYP1A1

human

Kubota 1991

none

3' UTR

D12525 D01198

 

CYP1A1

human

Hayashi 1991

none

3' UTR

D12525 D01198

 

CYP1A1

human

Kawajiri 1986

none

5' UTR

D10855 D01150

 

CYP1A1

human

Kubota 1991

none

5' UTR

D10855 D01150

 

CYP1A1

Cavia cobaya
(guinea pig)

Ohgiya 1993

none

Get Seq

D11043 PIR S43414

 


 


Return to Cytochrome P450 Homepage

1A Subfamily


CYP1A1      human
            GenEMBL D12525 D01198 (650bp)
            Kawajiri,K., Watanabe,J., Gotoh,O., Tagashira,Y., and Sogawa,K.
            Structure and drug inducibility of the human cytochrome P-450c
            gene.
            Eur. J. Biochem. 159, 219-225 (1986)

            Kubota,M., Sogawa,K., Kaizu,Y., Sawaya,T., Watanabe,J.,
            Kawajiri,K., Gotoh,O. and Fujii-Kuriyama,Y.
            Xenobiotic responsive element in the 5'-upstream region
            of the human P-450c gene.
            J. Biochem. 110, 232-236 (1991)
      
            Hayashi,S.-i., Watanabe,J., Nakachi,K. and Kawajiri,K.
            Genetic linkage of lung cancer-associated MspI polymorphisms
            with amino acid replacement in the heme binding region of
            the human cytochrome P450IA1 gene.
            J. Biochem. 110, 407-411 (1991)

CYP1A1      human
            GenEMBL D10855 D01150 (4144bp)
            Kawajiri,K., Watanabe,J., Gotoh,O., Tagashira,Y., and Sogawa,K.
            Structure and drug inducibility of the human cytochrome P-450c
            gene.
            Eur. J. Biochem. 159, 219-225 (1986)

            Kubota,M., Sogawa,K., Kaizu,Y., Sawaya,T., Watanabe,J.,
            Kawajiri,K., Gotoh,O. and Fujii-Kuriyama,Y.
            Xenobiotic responsive element in the 5'-upstream region
            of the human P-450c gene.
            J. Biochem. 110, 232-236 (1991)
            Note: these refs are the same as the two earlier accession numbers.

CYP1A1      Macaca mulatta (rhesus monkey)
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            94% to CYP1A1 human, 73% to CYP1A2 human, ortholog of CYP1A1

CYP1A1      Cavia Cobaya (guinea pig)
            GenEMBL D11043 (2674bp)
            PIR S43414 (516 amino acids)
            Ohgiya,S. Ishizaki,K. and Shinriki,N.
            Molecular cloning of guinea pig CYP1A1: complete primary structure 
            and fast mobility of expressed protein on electrophoresis.
            Biochim. Biophys. Acta 1216, 237-244 (1993)

CYP1A1      rat
            GenEMBL I00732 (1800bp)
            Oeda,K., Sakaki,T., Ohkawa,H., Yabusaki,Y., Murakami,H.,
            Nakamura,K. and Shimizu,M.
            Cytochrome P-450MC gene, expression plasmid carrying the said gene,
            yeasts transformed with the said plasmid and a process for producing
            cytochrome P-450MC by culturing the said transformant yeasts.
            Patent: US 4766068-A 1 23-AUG-1988

CYP1A1      rat
            PIR A93513 (524 amino acids)
            Yabusaki, Y., Shimizu, M., Murakami, H., Nakamura, K., Oeda,
            K. and Ohkawa, H.
            Nucleotide sequence of a full-length cDNA coding for
            3-methylcholanthrene-induced rat liver cytochrome P-450MC.
            Nucleic Acids Res. 12, 2929-2938 (1984)

CYP1A1      rat
            PIR S45716 (524 amino acids)
            Omata, Y., Robinson, R.C., Gelboin, H.V., Pincus, M.R.,
            Friedman, F.K.
            Specificity of the cytochrome P-450 interaction with
            cytochrome b(5).
            FEBS Lett.  346, 241-245 (1994)

CYP1A1      rat
            PIR D60822 (19 amino acids)
            Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and
            Oesch, F.
            Effect of nutritional imbalances on cytochrome P-450 isozymes
            in rat liver.
            Biochem. Pharmacol. 37, 3245-3249 (1988)

CYP1A1      hamster
            GenEMBL D10913 (8700bp) Swiss Q00557 (524 amino acids)
            Sagami,I., Ohmachi,T., Fujii,H., Kikuchi,H. and Watanabe,M.
            Hamster cytochrome P-450 IA gene family, P-450IA1 and P-450IA2 in
            lung and liver: cDNA cloning and sequence analysis
            J. Biochem. 110, 641-647 (1991)

CYP1A1      hamster
            PIR JS0746 (524 amino acids)
            Ohgiya, S., Goda, T., Ishizaki, K., Morimoto, M., Sakamoto,T.,
            Kamataki, T. and Shinriki, N.
            unpublished (1992)

CYP1A1      rabbit
            PIR A25143 (464 amino acids)
            Okino, S.T., Quattrochi, L.C., Barnes, H.J., Osanto, S.,
            Griffin, K.J., Johnson, E.F. and Tukey, R.H.
            Cloning and characterization of cDNAs encoding 2,3,7,
            8-tetrachlorodibenzo-p-dioxin-inducible rabbit mRNAs for
            cytochrome P-450 isozymes 4 and 6.
            Proc. Natl. Acad. Sci. U.S.A. 82, 5310-5314 (1985)

CYP1A1      Macaca irus (crab eating macaque monkey)
            GenEMBL D17575 (2602bp)
            Ohmachi,T., Sagami,I., Kikuchi,H., Fujii,H., Suzaki,Y., Fujiwara,T.
            and Watanabe,M.
            Molecular cloning and sequence analysis of cDNA encoding a
            crab-eating monkey (Macaca irus) cytocrome P-450
            unpublished (1993)

CYP1A1      Macaca fasicularis (crab eating macaque monkey)
            Swiss P33616 (512 amino acids)
            Komori, M. Kikuchi,O. Kitada,M. Kamataki T.
            Molecular cloning of monkey 1A1 cDNA and expression in yeast.
            Biochim. Biophys. Acta 1131, 23-29 (1992)

CYP1A1      Sus scrofa (pig)
            GenEMBL AB052254
            Misaki Kojima
            Submitted to nomenclature committee Oct. 27, 2000
            82% to human CYP1A1, 74% to human 1A2

CYP1A1      Ovis aries (sheep)
            GenEMBL S79795 (2585bp)
            Hazinski,T.A., Noisin,E., Hamon,I. and DeMatteo,A.
            Sheep lung cytochrome P4501A1 (CYP1A1): cDNA cloning and
            transcriptional regulation by oxygen tension
            J. Clin. Invest. 96 (4), 2083-2089 (1995)

CYP1A1      Bos taurus (cow)
            See cattle page for details
MFSVFGLPIPISATELLLASAVFCL
VFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLMLGKNPHVVLSQLSQRYGDVLQIRIG
CTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLITNGQSMTFNPDSGPVWAARRRL
AQNALKSFSTASDPASSSSCYLEEHVNKEAKYLLGKFQELMSGPGRFDPYRYIVVSVA
NVICAICFGRRYDHNDQEFLSLVNLSNEFGEITASGNPSDFIPVLRYLPNTALDLFKD
LNQRFYVFVQKIVKEHYKTFEKGHIRDITDSLIEHCQDKRLDENANIQLSDEKIINVV
IDLFGAGFDTVTTALSWSLLYLVTSPRVQKKIQEELDTVIGRARRPRLSDRPQLPYLE
AFILETFRHSSFVPFTIPHSTTRDSNLNGFYIPKGRCVFVNQWQINHDQKLWEDPSEF
RPERFLTADGTINKVLSEKVIIFGLGKRKCIGETIARLEVFLFLAILLHQVEFCVTPG
VKVDMTPVYGLTMKYARCEHFQAHMRS

CYP1A1       Canis familiaris (dog)
             AACN010067442.1 Canis familiaris ctg19866850684014, 
             79% to 1A1 human N-term
             AACN010089968.1 Canis familiaris ctg19866851895459, 
             84% to 1A1 C-term
             full length combined seq = 81% to 1A1 
1868 MFRLSIPISASELLLASTVFCLVLWVVKAWQPRLPKGLKSPPGPWGWPLLGNVLTLGKSPHLALS 2062
2063 RLSQRYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGDDFKGRPDLYSFSLVTDGQSLTFS 2242
2243 PDSGPVWAARRRLAQNALKSFSIASDPASSCSCYLEEHVSKEAEVLLSRLQEQMAEVGRF 2422
2423 DPYRYIVVSVANVICAMCFSKRYDHDDQELLSLVNLSNEFGEGVASANPLDFFPILRYLP 2602
2603 NPALDFFKDLNKRFYSFMQKMVKEHYKTFEK 2695
 133 GQIRDVTDSLIEHCQDKRLDENANIQLSDEKIVNVVLDLFGA 258
 347 GFDTVTTAISWSLLYLVTNPNVQKKIQKEL 436
 529 DTVIGRARQPRLSDRPQLPYMEAFILETFRHASFVPFTIPH 
     STTRDTSLSGFYIPKGRCVFVNQWQINHDQ 885
1038 KLWGNPSEFQPERFLTLDGTINKALSEKVILFGLGKRKCIGETIARLEVFLFLAILLQQ 1217
1218 VEFSVPEGTKVDMTPIYGLTMKHARCEHFQVRVRTEGAERSAA* 1349

CYP1A1       horse 
             No accession number
             Heather Knych
             Submitted to nomenclature committee Oct. 14, 2007
             80% to CYP1A1 human, 70% to CYP1A2 human

CYP1A1       Macropus eugenii (tamar wallaby)
             no accession number
             Ross McKinnon
             submitted to nomenclature committee 9/7/98
             98 amino acid C-terminal fragment is 82% identical to macaque 1A1

CYP1A1       Monodelphis domestica (opossum)
             UCSC Browser Oct 2006 assembly chr1 23141664- 23146346 (-) strand
             Syntenic with human CYP1A1 adjacent to EDC3 and CYP1A2
             73% to 1A1 hum 65% to 1A2 hum Built_from_P56591_and_others
             489177 - 493862 bp (489.2 Kb) on chromosome fragment scaffold_14927
             This transcript is located in sequence: contig_43733
MTSILSLLGFSKSFTVTELLVVSAVFCLVFWIIDSYHQRVPKGFKSPPGPWAWPLIGNVL
TLGKNPHLVLTQMREKYGDVMQIQIGSTPVLVLSGLETIRHALVKQGDDFKGRPDLYSFS
LILDGESLSFGPDSGEVWAARRKLTQNALKAFSISSSPSSSFCYLEEHVIKEAEYLIQKF
QEQKGHFDPVRYIVVSVANVICAICFGQRYDHDDQELLNIVRLSNKFGEVAASGNPVDFI
PILRYLPNSKITAFRDLNEKIVAFTQKLVKEHYRKFEKGCIRDITDSLIEHCQEKKLDEN
ANIMLSEKKVVNVVIDLFGAGFDTVTTAISWGLMYLVAKPEVQKKIHEELDTVIGRERLP
QLSDKTQLPYMEAFILETFRHSSFLPFTIPHSTTRDITLNGFYIPKGRCVFVNQWQINHD
PKIWGDPSVFRPERFLSVDGTINKALSEKVIMFGLGKRKCIGETIARWEVFLFLSILLHR
MEFSVPSGVKVDLTPVYGLTMKHIPCEHFQTKLRS

CYP1A1      Balaenoptera acutorostrata  (Minke whale)
            no accession number
            Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita
            submitted to nomenclature committee 5/15/98

CYP1A1     Balaenoptera acutorostrata (Minke whale)
           No accession number
           Iwata Hisato
           submitted to nomenclature committee 1/6/05 
           82% to CYP1A1 human, 74% to CYP1A2

CYP1A1     Pusa sibrica (Baikal seal)
           No accession number
           Iwata Hisato
           submitted to nomenclature committee 1/6/05 
           82% to CYP1A1 human, 75% to CYP1A2

CYP1A1      Phocoenoides dalli (Dall's porpoise)
            no accession number
            Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita
            submitted to nomenclature committee  5/15/98

CYP1A1      Eumetopias jubatus (Steller sea lion)
            no accession number
            Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita
            clone #1
            submitted to nomenclature committee 5/15/98

CYP1A1      Phoca largha (Spotted seal)
            no accession number
            Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita
            submitted to nomenclature committee 5/15/98

CYP1A1      Phoca fasciata (Ribbon seal)
            no accession number
            Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita
            submitted to nomenclature committee 6/29/99 revised 2/27/01

CYP1A1      Halichoerus grypus (grey seal, gray seal)
            No accession number
            Rachel Tilley
            Submitted to nomenclature committee 3/19/2001
            Name grey seal 1

CYP1A1      Phoca groenlandica (harp seal)
            No accession number
            Rachel Tilley
            Submitted to nomenclature committee 3/19/2001
            Name harp seal 1

Cyp1a1      mouse
            GenEMBL K02588 (2619bp)
            Kimura,S., Gonzalez,F.J. and Nebert,D.W.
            The murine Ah locus.
            J. Biol. Chem. 259, 10705-10713 (1984)

Cyp1a1      mouse
            GenEMBL M10021 (8809bp)
            PIR A24953 (30 amino acids)
            Gonzalez,F.J., Mackenzie,P.I., Kimura,S. and Nebert,D.W.
            Isolation and characterization of full-length mouse cDNA and
            genomic clones of 3-methylcholanthrene-inducible cytochrome
            P-1-450 and P-3-450
            Gene 29, 281-292 (1984)

Cyp1a1     mouse
            GenEMBL X01681 (6214bp)
            Kimura,S., Gonzalez,F.J. and Nebert,D.W.
            The murine Ah locus: Comparison of the complete cytochrome P1-450
            and P3-450 cDNA nucleotide and amino acid sequences
            J. Biol. Chem. 259, 10705-10713 (1984)

Cyp1a1     mouse
            GenEMBL M11515 (8850bp)
            Kimura,S. and Nebert,D.W.
            Comparison of the mouse P-1-450 gene and flanking sequences from a
            MOPC 41 plasmacytoma and normal liver.
            DNA 4, 365-375 (1985)

Cyp1a1     mouse
            GenEMBL M25623 (410bp)
            Peterson,T.C., Gonzalez,F.J. and Nebert,D.W.
            Methylation differences in the murine P-1-450 and P-3-450 genes in
            wild-type and mutant hepatoma cell culture
            Biochem. Pharmacol. 35, 2107-2114 (1986)

Cyp1a1     mouse
            GenEMBL M33935 (474bp)
            Jones,J.E. and Nebert,D.W.
            Transcriptional start site in the mouse Cyp1a1 (cytochrome P-1-450) gene.
            DNA 8, 527-534 (1989)

Cyp1a1     mouse
            PIR C24406 (24 amino acids) 
            Cheng, K.C., Park, S.S., Krutzsch, H.C., Grantham, P.H.,
            Gelboin, H.V. and Friedman, F.K.
            Amino-terminal sequence and structure of monoclonal antibody
            immunopurified cytochromes P-450.
            Biochemistry 25, 2397-2402 (1986)

CYP1A1      Xenopus tropicalis (frog)

CYP1A2      human
            GenEMBL M38504 (3149bp)
            Jaiswal,A.K., Nebert,D.W., McBride,W.O. and Gonzalez,F.J.
            Human P-3-450: cDNA and complete protein sequence, repetitive Alu
            sequences in the 3' nontranslated region, and localization of gene
            to chromosome 15
            J. Exp. Pathol. 3, 1-17 (1987)

CYP1A2      human
            GenEMBL U02993 (3293bp)
            Quattrochi,L.C. and Tukey,R.H.
            The human cytochrome Cyp1A2 gene contains regulatory elements
            responsive to 3-methylcholanthrene
            Mol. Pharmacol. 36, 66-71 (1989)

CYP1A2      human
            PIR A25892 (515 amino acids)
            Quattrochi, L.C., Pendurthi, U.R., Okino, S.T., Potenza, C. and
            Tukey, R.H.
            Human cytochrome P-450 4 mRNA and gene: part of a multigene
            family that contains Alu sequences in its mRNA.
            Proc. Natl. Acad. Sci. U.S.A. 83, 6731-6735 (1986)

CYP1A2      human
            PIR A60881 (18 amino acids)
            Wrighton, S.A., Campanile, C., Thomas, P.E., Maines, S.L.,
            Watkins, P.B., Parker, G., Mendez-Picon, G., Haniu, M.,
            Shively, J.E., Levin, W. and Guzelian, P.S.
            Identification of a human liver cytochrome P-450 homologous
            to the major isosafrole-inducible cytochrome P-450 in the rat.
            Mol. Pharmacol. 29, 405-410 (1986)

CYP1A2      Macaca fascicularis (cynomolgus monkey)
            GenEMBL D86474
            Sakuma,T., Hieda,M., Igarashi,T., Ohgiya,S., Nagata,R., Nemoto,N.
            and Kamataki,T.
            Molecular cloning and functional analysis of cynomolgus monkey
            CYP1A2
            Biochem. Pharmacol. 56 (1), 131-139 (1998)

CYP1A2      Macaca fuscata (Japanese macaque)
            GenEMBL AB185338 (hold till 7/22/2005)
            Shizuo Narimatsu 
            Submitted to nomenclature committee 8/28/2004
            99% identical to cynomolgus monkey CYP1A2 
            92.4% to human CYP1A2

CYP1A2      rabbit
            PIR B27821 (516 amino acids)
            Kagawa, N., Mihara, K., Sato, R.
            Structural analysis of cloned cDNAs for polycyclic
            hydrocarbon-inducible forms of rabbit liver microsomal
            cytochrome P-450.
            J. Biochem. 101, 1471-1479 (1987) 

CYP1A2      dog
            PIR A60463 (16 amino acids)
            Ohta, K., Motoya, M., Komori, M., Miura, T., Kitada, M. and
            Kamataki, T.
            A novel form of cytochrome P-450 in beagle dogs. P-450-D3 is
            a low spin form of cytochrome P-450 but with catalytic and
            structural properties similar to P-450d.
            Biochem. Pharmacol. 38, 91-96 (1989)

CYP1A2       Canis familiaris (dog)
             UCSC Browser chr30:40816888-40821608 (+) strand May 2005 assembly
             AACN010103563.1 Canis familiaris ctg19866850724666, 
             90% to 1A2
             AACN010517076.1 Canis familiaris ctg19866850724664, 
             82% to 1A2 human N-term
             AACN010004324.1 Canis familiaris ctg19866850196532, 
             86% to 1A2 C-term
             combined sequence for 1A2
 362 MALSQMATELLLASTIFCLILWVVKVWQPRLPKGLKSPPGPWGWPLLGNVLTLGKSPHLALS 177
 176 RLSQRYGDVLQIRIGSTPVLVLSSLDTIRQALVRQGDDFKGRPDLYSFSLVT
     DGQSLTFSPDSGPVWAARRRLAQNALNTFSIASDPASSCSCYLEE 771
770  HVSKEAEALLSRLQEQMAEVGRFDPYNQVLMSVANVIGAMCFGHHFSQRSEEMLPLLMSS 591
590  SDFVETVSSGNPLDFFPILQYMPNSALQRFKNFNQTFVQSLQKIVQEHYQDFDE 429
     RSVQDITGALLKHNEKSSRASDGHIPQEKIVNLINDIFGA
     GFDTVTTAISWSLMYLVANPEIQRKIQKEL
     DTVIGRARQPRLSDRPQLPLMEAFILEIFRHTSFVPFTIPHS (2)
 631 TTKNTTLKGFYIPKECCVFINQWQVNHDQ 717
1789 QVWGDPFAFRPERFLTADGTAINKTLSEKVMLFGMGKRRCIGEVLAKWEIFLFLAILLQ 1968
1969 RLEFSVPAGVRVDLTPIYGLTMKHTRCEHVQARPRFSIK* 2088

CYP1A2      Bos taurus (cow)
            See cattle page for details
MALSQLSPFSAMELLLASAIFCLVFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLTLG
KNPHVVLSQLSQRYGDVLQIRIGCTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLVT
DGQSMTFNPDSGPVWAARRRLAQNALNTFSVASD
PSSSSSCYLEDHVSKEAEALLGKFQELMSGPGRFDPYGHVVASV
ANVIGAMCFGQHFPQSSKEMLSLVESSHDFVESASSGNPVDFFPILKYLPNPALQRFK
SFNQRFLQFVRKTVQEHYQDFDKNSIQDIIGALFKHSEDNSRASSRLISQEKTVNLVN
DLFAAGFDTITTAISWSLMYLVTNPKIQRKIQEELD
RVVGRARRPRLSDRPQLPYLES
FILETFRHSSFVPFTIPHSTTRDTTLNGFFIPKERCVFINQWQVNHDPKLWGDPSVFR
PERFLTSDGTTIDKTASEKVLLFGMGKRRCIGEVMARWEVFLFLAILLQRLEFSVPPG
VKVDLTPTYGLTMKHARCEHMQARLRFPIK

CYP1A2    Sus scrofa (miniature pig) 
          no accession number
          Haitao Shang
          Submitted to nomenclature committee May 23, 2007
          86% to 1A2hum, 75% to 1A1hum
          partial seq.

CYP1A2    Sus scrofa (miniature pig) 
          GenEMBL CB483208.1
KLWGDPSEFRPERFLTADGTAIHKTMSEEVILFGMGKRRCIGEVLAKWEVFLFLAILLQQ 
LEFSVPP

CYP1A2      rat
            PIR B24406 (25 amino acids)
            Cheng, K.C., Park, S.S., Krutzsch, H.C., Grantham, P.H.,
            Gelboin, H.V. and Friedman, F.K.
            Amino-terminal sequence and structure of monoclonal antibody
            immunopurified cytochromes P-450.
            Biochemistry 25, 2397-2402 (1986)

CYP1A2      rat
            GenEMBL X01031 (1106bp) PIR A44612 (367 amino acids)
            Yabusaki, Y., Murakami, H., Nakamura, K., Nomura, N.,
            Shimizu, M., Oeda, K. and Ohkawa, H.
            Characterization of complementary DNA clones coding for two
            forms of 3-methylcholanthrene-inducible rat liver
            cytochrome P-450.
            J. Biochem. 96, 793-804 (1984)

CYP1A2      rat
            PIR S26822 (19 amino acids)
            Botelho, L.H., Ryan, D.E., Yuan, P.M., Kutny, R., Shively,
            J.E. and Levin, W.
            Amino-terminal and carboxy-terminal sequence of hepatic
            microsomal cytochrome P-450d, a unique hemoprotein from
            rats treated with isosafrole.
            Biochemistry 21, 1152-1155 (1982)

CYP1A2      rat
            PIR D60822 (22 amino acids)
            Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and
            Oesch, F.
            Effect of nutritional imbalances on cytochrome P-450 isozymes
            in rat liver.
            Biochem. Pharmacol. 37, 3245-3249 (1988)

CYP1A2      rat
            PIR A61400 (513 amino acids)
            Woelfel, C.; Platt, K.L.; Dogra, S.; Glatt, H.; Waechter, F.;
            Doehmer, J.
            Stable expression of rat cytochrome P450IA2 cDNA and
            hydroxylation of 17beta-estrodiol and 2-aminofluorene in
            V79 Chinese hamster cells.
            Mol. Carcinog. 4, 489-498 (1991) 

CYP1A2      hamster
            GenEMBL D10914 (9719bp)
            Sagami,I., Ohmachi,T., Fujii,H., Kikuchi,H. and Watanabe,M.
            Hamster cytochrome P-450 IA gene family, P-450IA1 and P-450IA2 in
            lung and liver: cDNA cloning and sequence analysis
            J. Biochem. 110, 641-647 (1991)

CYP1A2        Mesocricetus auratus (hamster)
             GenEMBL M63787 M34446 (1868bp)
             Lai,T.S. and Chiang, J.Y.L.
             Cloning and characterization of two major 3-methylcholanthrene inducible hamster 
             liver cytochrome P-450s.
             Arch. Biochem Biophys. 283, 429-439 (1990)
             clone MC4
             note: M34446 is incorrectly included in the GenBank entry
             for CYP2A8 and CYP2A9. M34446 should only be in the CYP1A2 hamster entry.

CYP1A2      Cavia cobaya (guinea pig)
            GenEMBL D50457 (1760bp)
            Mori,T., Itoh,S., Ohgiya,S., Ishizaki,K. and Kamataki,T.
            Effect of ascorbic acid on expression of several forms of
            cytochrome P-450 of guinea pig
            Unpublished (1995)

CYP1A2      Cavia porcellus (guinea pig)
            GenEMBL U23501 (1757bp)
            Black,V.H.
            unpublished 1995

CYP1A2       Monodelphis domestica (opossum)
             UCSC Browser Oct 2006 assembly chr1 23173195 - 23183937 (+) strand
             Syntenic with human CYP1A2 adjacent to CYP1A1 and CSK
             70% to 1A2, 65% to 1A1 Built_from_Q64391_and_others
             451687 - 462429 bp (451.7 Kb) on chromosome fragment scaffold_14927
             This transcript is located in sequence: contig_91822
MVSSLLASISISELLLASVIFCLVFWVTRSSHQRVPKGLKSPPGPWAWPLFGNVWTLGKN
PHLTLAQLSEKYGDVMKIHIGSTPVIVLSGLETIRQALVKQGEDFKGRPDLYSSTFVADG
YSLAFNPDSGEVWAVRRKLAQNALNTFSVSSSPSSSSCYLEEHVNKEVKHLIQKFQELME
GVGCFDPYRHIVASVANVISAMCFSQRYEDHKNPEFTTLINASHEFVESATSGNPVDFFP
ILRYIPNPQLQRFKEFNQRFLKFLQNTIREHHKAFDENNIQDITGALYKHSQDKAFGNTS
SSVPEMLIINLINDIFGAGFDTVTTAISWSLMYLVTNPKVQKKIQQELDTVIGRDRWPLL
SDRPQLPFMEAFILEIFRHTSFVPFTIPHSTTRATTLNNFYIPKGTCVFVNQWQTNHDPK
LWEDPSVFRPERFLSADGTVNKALSEKVILFGLGKRRCIGETIARWEVFLFLAILLHQIE
FSVPSGVKVDMTPTYGLTMKHPRCEHFQARPRFSR

CYP1A2      chicken
            GenEMBL M64537 (884bp)
            Swiss Q01741 (258 amino acids)
            Murti,J.R., Adiga,P.R. and Padmanaban,G.
            Estradiol-17-Beta induces polyaromatic hydrocarbon-inducible
            cytochrome p-450 in chicken liver
            Biochem. Biophys. Res. Commun. 175, 928-935 (1991)
            Note: previously called 1A2

CYP1A2      Eumetopias jubatus (Steller sea lion)
            no accession number
            Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita
            clone #2
            submitted to nomenclature committee 5/15/98

CYP1A2      Phoca fasciata (Ribbon seal)
            no accession number
            Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita
            submitted to nomenclature committee 6/28/99 revised 2/27/01

CYP1A1/CYP1A2 chimera  Phoca fasciata (Ribbon seal)
            no accession number
            Ikuko Teramitsu, Yukio Yamamoto, and Shoichi Fujita
            submitted to nomenclature committee 6/28/99
            on 2/27/01 the authors sent the following message
            "... we believe that the production of the chimera 
            sequence could be the result of a PCR defect."

CYP1A2      Halichoerus grypus (grey seal, gray seal)
            No accession number
            Rachel Tilley
            Submitted to nomenclature committee 3/19/2001
            Name grey seal 2

CYP1A2      Phoca groenlandica (harp seal)
            No accession number
            Rachel Tilley
            Submitted to nomenclature committee 3/19/2001
            Name harp seal 2

Cyp1a2      mouse
            GenEMBL K02589 (1893bp)
            Kimura,S., Gonzalez,F.J. and Nebert,D.W.
            The murine Ah locus.
            J. Biol. Chem. 259, 10705-10713 (1984)

Cyp1a2     mouse
            PIR A93512 (513 amino acids)
            Kimura, S., Gonzalez, F.J. and Nebert, D.W.
            Mouse cytochrome P-3-450: complete cDNA and amino acid
            sequence.
            Nucleic Acids Res. 12, 2917-2928 (1984)

Cyp1a2     mouse
            GenEMBL X01682 (6715bp)
            Kimura,S., Gonzalez,F.J. and Nebert,D.W.
            The murine Ah locus: Comparison of the complete cytochrome P1-450
            and P3-450 cDNA nucleotide and amino acid sequences
            J. Biol. Chem. 259, 10705-10713 (1984)

Cyp1a2     mouse
            GenEMBL M25624 (510bp)
            Peterson,T.C., Gonzalez,F.J. and Nebert,D.W.
            Methylation differences in the murine P-1-450 and P-3-450 genes in
            wild-type and mutant hepatoma cell culture
            Biochem. Pharmacol. 35, 2107-2114 (1986)

Cyp1a2     mouse
            PIR B92495 (513 amino acids)
            Gonzalez, F.J., Kimura, S. and Nebert, D.W.
            J. Biol. Chem. 260, 11884-11889 (1985)
            Erratum

Cyp1a2     mouse
            GenEMBL M10022 (8865bp)
            PIR B24953 (30 amino acids)
            Gonzalez,F.J., Mackenzie,P.I., Kimura,S. and Nebert,D.W.
            Isolation and characterization of full-length mouse cDNA and
            genomic clones of 3-methylcholanthrene-inducible cytochrome
            P-1-450 and P-3-450
            Gene 29, 281-292 (1984)

Cyp1a2     mouse
            PIR A45955 (42 amino acids) PIR B45955 (39 amino acids)
            Peterson, T.C., Gonzalez, F.J. and Nebert, D.W.
            Methylation differences in the murine P-1-450 and P-3-450
            genes in wild-type and mutant hepatoma cell culture.
            Biochem. Pharmacol. 35, 2107-2114 (1986)

Cyp1a2     mouse
            PIR D24406 (25 amino acids) PIR E24406 (25 amino acids)
            Cheng, K.C., Park, S.S., Krutzsch, H.C., Grantham, P.H.,
            Gelboin, H.V. and Friedman, F.K.
            Amino-terminal sequence and structure of monoclonal antibody
            immunopurified cytochromes P-450.
            Biochemistry 25, 2397-2402 (1986)

CYP1A2     Balaenoptera acutorostrata (Minke whale)
           No accession number
           Iwata Hisato
           submitted to nomenclature committee 1/6/05 
           82% to CYP1A2 human, 69% to CYP1A1

CYP1A2     Pusa sibrica (Baikal seal)
           No accession number
           Iwata Hisato
           submitted to nomenclature committee 1/6/05 
           80% to CYP1A1 human, 69% to CYP1A2

Fish Cytochrome P450s are undergoing a revision to their nomenclature.  Initially 
there appeared to be just one fish 1A gene per species, but that is not true as shown 
by Amy Berndtson in trout.  Until an adequate nomenclature can be devised, these fish 
sequences are listed as CYP1A, without a number following the subfamily.  This does 
not affect the mammalian gene designations, though it may affect the chicken 
sequences.

CYP1A1      Oncorhynchus mykiss (trout)
            GenEMBL S69278 (5023bp)
            Berndtson,A.K. and Chen,T.T.
            Two unique CYP1 genes are expressed in response to 
            3-methylcholanthrene treatment in rainbow trout.
            Arch. Biochem. Biophys. 310, 187-195 (1994)
            Note: published as CYP1A2, but it is more similar to Heilmann's sequence
            than Berndtson's 1A1 (97.9% identical).

CYP1A1      Oncorhynchus mykiss (trout)
            GenEMBL U62797(1697bp)
            Bailey,G., You,L. and Harttig,U.
            Cloning, sequencing and functional expression of two trout CYP1A
            cDNAs in yeast
            unpublished (1997)
            incorrectly called 1A2

CYP1A3      Oncorhynchus mykiss (trout)
            GenEMBL U62796(2401bp)
            Bailey,G., You,L. and Harttig,U.
            Cloning, sequencing and functional expression of two trout CYP1A
            cDNAs in yeast
            unpublished (1997)
            incorrectly called 1A1

CYP1A      Oncorhynchus mykiss (trout)
           GenEMBL AF015660
           Bailey,G., You,L. and Harttig,U.
           Cloning,sequencing and aflatoxin B1 metabolism by multiple rainbow
           trout CYP1A cDNAs expressed in yeast
           Unpublished
           8 amino acid differences with U62797

CYP1A3      Oncorhynchus mykiss (trout)
            GenEMBL S69277 (5524bp)
            Berndtson,A.K. and Chen,T.T.
            Two unique CYP1 genes are expressed in response to 
            3-methylcholanthrene treatment in rainbow trout.
            Arch. Biochem. Biophys. 310, 187-195 (1994)
            Note: published as CYP1A1.  This sequence is 96.7% identical to
            Heilmann's 1A1 sequence.

CYP1A1/CYP1A3 chimera      Oncorhynchus mykiss (trout)
            PIR A28789 (522 amino acids)
            Heilmann, L.J., Sheen, Y.Y., Bigelow, S.W. and Nebert, D.W.
            Trout P450IA1: cDNA and deduced protein sequence, expression
            in liver, and evolutionary significance.
            DNA 7, 379-387 (1988)
            Published as CYP1A1
            note:  subsequent analysis has shown that the 5' end of this sequence
            comes from the 1A3 gene and the switch over occurs between base 271 
            and base 435 with base 1 as the A of the ATG start codon.

CYP1A       Pleuronectes platessa (plaice, a fish)
            GenEMBL X73631 (2411bp) PIR S34184 (521 amino acids)
            Leaver,M.J., Pirrit,L. and George,S.G.
            Cytochrome P450 1A1 cDNA from plaice (Pleuronectes platessa) 
            Mol. Marine Biol. Biotechnol. 2, 338-345 (1993)

CYP1A       Opsanus tau ( oyster toadfish)
            GenEMBL U14161 (2352bp)
            Morrison, H.G., Oleksiak, M.F., Cornell, N.W., Sogin,M.L. and 
            Stegeman, J.J.
            Identification of Cytochrome P450 1A genes from two teleost fish, toadfish
            (Opsanus tau) and scup (Stenotomus chrysops), and pyhlogenetic analysis 
            of CYP1A genes.
            Biochem. J. 308, 97-104 (1995)

CYP1A       Stenotomus chrysops (scup, a fish)
            GenEMBL U14162 (1566bp)
            Morrison, H.G., Oleksiak, M.F., Cornell, N.W., Sogin,M.L. and 
            Stegeman, J.J.
            Identification of Cytochrome P450 1A genes from two teleost fish, toadfish
            (Opsanus tau) and scup (Stenotomus chrysops), and pyhlogenetic analysis 
            of CYP1A genes.
            Biochem. J. 308, 97-104 (1995)

CYP1A       Chaetodon capistratus (four-eye butterfly fish)
            GenEMBL U19855 (2552bp)
            Vrolijk,N.H., Lin,C. and Chen,T.T.
            Characterization and expression of a CYP1A gene from the tropical
            teleost, Chaetodon capistratus.
            Unpublished 1995

CYP1A       Dicentrarchus labrax (european sea bass)
            GenEMBL U78316(1563bp)
            Stien,X., Amichot,M., Berge,J.-B. and Lafaurie,M.
            Molecular cloning of a CYP1A cDNA from the teleost fish
            Dicentrarchus labrax.
            Unpublished (1995)

CYP1A1v2    Dicentrarchus labrax (european sea bass)
            No accession number
            Alessandra Salvetti
            Submitted to nomenclature committee 11/26/99
            94% identical to U78316 probably an allele

CYP1A       Microgadus tomcod (Atlantic tomcod)
            GenEMBL L41886 (2497bp) L41917
            Roy,N.K., Konkle,B.A., Kreamer,G.-L., Grunwald,C. and Wirgin,I.I.
            Characterization and prevalence of a polymorphism in the 3'
            untranslated region of cytochrome P4501A1 in cancer-prone Atlantic tomcod
            Arch. Biochem. Biophys. (1995) In press
            probable frameshift detected by O. Gotoh. in the beginning of the sequence.

CYP1A       Microgadus tomcod (Atlantic tomcod)
            GenEMBL  L41917 (6837bp)
            Roy,N.K., Konkle,B. and Wirgin,I.I.
            Functional characterization of Cytochrome P4501A1 regulatory
            sequences in cancer-prone Atlantic tomcod.
            Unpublished (1995)

CYP1A       Pagrus major (wild red sea bream)
            no accession number
            Mizukami,M., Okauchi,M., Ariyoshi,T. and Kito,H.
            The isolation and sequence of cDNA encoding a 3-methylcholanthrene-
            inducible cytochrome P450 from wild red sea bream, Pagrus major.
            Marine Biol. 120, 343-349 (1994)

CYP1A      Sparus aurata (gilthead sea bream)
            GenEMBL AF011223, AF005719

CYP1A       Liza aurata 
            GenEMBL AF022433
            Cousinou,M., Lopez-Barea,J. and Dorado,G.

CYP1A      Liza saliens (leaping mullet)
           GenEMBL AF072899
           Alaattin Sen and Don Buhler
           submitted to nomenclature committee
           96% identical to Liza aurata

CYP1A      Limanda limanda
           GenEMBL AJ001724
           Robertson,F.E., McPhail,M.E., Rankin,R., Stagg,R.M. and Craft,J.A.

CYP1A       Platichthys flesus (European flounder)
            GenEMBL AJ132353
            Williams,T.D., Lee,J.S. and Chipman,J.K.
            The cytochrome P450 1A gene (CYP1A) from European flounder
            (Platichthys flesus), analysis of regulatory regions and
            development of a dual luciferase reporter gene assay.
            Unpublished

CYP1A1      Salmo salar (salmon)
            No accession number
            Christopher Rees Weiming Li
            submitted to nomenclature committee Nov. 9, 2001
            a second gene is being isolated so this is called 1A1 
            rather than just CYP1A.  This does not imply orthology to the 
            mammalian 1A1, 1A2.  The CYP1A gene duplications in fish and mammals 
            occurred independently.

CYP1A      Anguilla anguilla (European eel)
           GenEMBL AF420257
           Mahata,S.C., Mitsuo,R., Aoki,J.-y., Kato,H. and Itakura,T.
           Two forms of cytochrome P450 cDNA from 3-methylcholanthrene-treated
           European eel Anguilla anguilla
           Fish. Sci. 69 (3), 615-624 (2003)
           98% identical to CYP1A9 from Japanese eel (clear ortholog)
           note: Eels have two CYP1A sequences.  This one is 80% identical to
           Salmo salar CYP1A.  CYP1A9 is 77% to the same Salmo CYP1A
           Therefore, CYP1A9 is a recent duplication in eels that is diverging
           Away from the parent sequence called CYP1A (no number after A).
           Called CYPEuMC1 

CYP1A      Anguilla japonica (Japanese eel)
           GenEMBL AB015638
           Mitsuo,R., Itakura,T. and Sato,M.
           Cloning and Sequencing of Cytochrome P450 1A Complementary DNA in
           Eel (Anguilla japonica)
           Mar. Biotechnol. 1 (4), 353-358 (1999)           
           98% identical to CYP1A9 from European eel (clear ortholog)
           note: Eels have two CYP1A sequences.  This one is 81% identical to
           Salmo salar CYP1A.  CYP1A9 is 78% to the same Salmo CYP1A
           Therefore, CYP1A9 is a recent duplication in eels that is diverging
           Away from the parent sequence called CYP1A (no number after A).
           Called CYPJaMC1 

CYP1A      Danio rerio (zebrafish)
           GenEMBL AY398333.1, AB078927.1
           Gene is on CAAK02015935.1 (exon 1), CAAK02015934 (exons 2-6)
MALTILPILGPISVSESLVAIITICLVYLLMRLNRTKIPDGLQK
LPGPKPLPIIGNVLEIGNNPHLSLTAMSKCYGPVFQIQIGMRPVVVLSGNDVIRQALL
KQGEEFSGRPELYSTKFISDGKSLAFSTDQVGVWRARRKLALNALRTFSTVQGKSPKY
SCALEEHISNEGLYLVQRLHSVMKADGSFDPFRHIVVSVANVICGICFGRRHSHDDDE
LVRLVNMSDEFGKIVGSGNPADFIPFLRILPSTTMKKFLDINERFSKFMKRLVMEHYDTFDK (0)
DNIRDITDSLINHCEDRKLDENSNLQVSDEKIVGIVNDLFGA (1)
GFDTISTALSWAVVYLVHYPEVQERLQREL (1)
DEKIGKDRTPLLSDRANLPLLESFILEIFRHSSFLPFTIPHC (2)
TSKDTSLNGYFIPKDTCVFVNQWQVNHDP (2)
ELWKDPSSFIPDRFLTADGTELNKLEGEKVLVFGLGKRRCIGESIGRAEVFLFLAILL
QRLKFTGMPGEMLDMTPEYGLTMKHKRCLLRVTPQPVF

CYP1A      Gobiocypris rarus (a rare minnow)
           GenEMBL EU106660
           Jiayin Dai
           Submitted to nomenclature committee 4/19/2008
           87% to CYP1A Danio

CYP1A      Callorhinchus milii (elephant shark, Chondrichthyes)
           Trace file 1573735839 78% to 1A zebrafish
           1576735840  these two trace files are mate pairs
IRDITDSLIEHCQDKKMDENANIQVSDEKIINIVNDLFGA (1)
GFDTITTGLSWAVMYLVLYPDLQKRLQDEI (1)
DEKIGKDRSPRLSDRSRLPYTDAFILETFRYSSFLPFTIPHC (2)
TTKDTALNGYFIPKNTCVFVNQWQVNHDE (2)

CYP1A     Petromyzon marinus  (sea lamprey)
          Trace files
          1255373015 (DAVV exon +)
          1386924597 (DAVV exon +)
          1210995499 (DAVV exon +)
          1437249679 (TTRD exon +)
          1468852008 (TTRD exon +)
          1442353648 (TTRD exon +)
          1439550570 (ALWDE exon -) mate = 1442736929 = (TTRD exon +)
          56% to 1A1 and 1A2 human, 61% to Bos 1A2
          N-term part seems to be in a seq gap
DAVVGRQRRPSLNDRRQLPFTEAFILEVLRHSSVVPFTIPHS (2)
TTRDTVLQGFFIPKDTCIFINQWQVNHDS (2)
ALWDEPFAFRPERFLSEDQSSVDRTRAANLLSFGTGKRRCMGEAVARSELFLFLSILLHHL
RIRTADGQAPDMSAVYGLSLKHRTCLLLAESRS*

CYP1A4      Gallus gallus (chicken)
            GenEMBL X99453(2098bp)
            Gilday,D.J., Gannon,M., Yutzey,K., Bader,D. and Rifkind,A.
            Molecular cloning and expression of two novel avian cytochrome P450
            1A enzymes induced by 2,3,7,8-tetrachlorodibenzo-p-dioxin.
            J. Biol. Chem. 271, 33054-33059 (1996)

CYP1A4      Phalacrocorax carbo (Commmon Cormorant)
            No accession number
            Iwata Hisato
            submitted to nomenclature committee 1/6/05 
            78% to CYP1A4 chicken, 72% to CYP1A5 chicken, 59% to CYP1A zebrafish

CYP1A5      Gallus gallus (chicken)
            GenEMBL X99454(1845bp)
            Gilday,D.J., Gannon,M., Yutzey,K., Bader,D. and Rifkind,A.
            Molecular cloning and expression of two novel avian cytochrome P450
            1A enzymes induced by 2,3,7,8-tetrachlorodibenzo-p-dioxin.
            J. Biol. Chem. 271, 33054-33059 (1996)

CYP1A5      Meleagris gallopavo (turkey) 
            No accession number
            Roger Coulombe, Jr.
            Submitted to nomenclature committee May 5, 2004
            95% to chicken 1A5

CYP1A5      Phalacrocorax carbo (Commmon Cormorant)
            No accession number
            Iwata Hisato
            submitted to nomenclature committee 1/6/05 
            78% to CYP1A5 chicken, 69% to CYP1A4 chicken, 58% to CYP1A zebrafish

CYP1A5      Corvus macrorhynchos (Jungle crow)
            No accession number
            Hisato Iwata
            submitted to nomenclature committee 4/15/05 
            75% to 1A5 chicken 67% to 1A4 chicken

CYP1A6      Xenopus laevis (African clawed frog)
            GenEMBL AB022087
            Fujita,Y. and Ohi,H.
            Xenopus laevis mRNA for cytochrome P450, cDNA clone MC1
            unpublished(1999) In press
            clone MC1

CYP1A7      Xenopus laevis (African clawed frog)
            GenEMBL AB022088
            Fujita,Y. and Ohi,H.
            Xenopus laevis mRNA for cytochrome P450, cDNA clone MC2
            unpublished(1999) In press
            clone MC2

CYP1A8PX     human
            NT_008580.9 
            Pseudogene 43% identcal to 1A2 human
            Renamed CYP1D1P orthologous to fish 1D1
NT_008580.9|Hs9_8737 chromosome 9 
4822084 MILDLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQVSPPGP*SFPIIENLLQLGDHPY 4822260
4822261 LTLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLHKDGEHFAGRPNMHTFSFLAEGKS 4822440
4822441 LSFSVNYGESWKLHKKIASKAL*TFSNAEAKSSTCSCSLEEHVTEEISELVTVFVELTSK 4822620
4822621 NGSFDPRNAITCVVANIVCALCFGKR*DHSDEEFLRIVKTNDDLLKASSAANPADFIPCL 4822800
4822801 HYLPLKIINAPLEFYQALNGFIALHVQDHLATYGK 4822905 (0)
4824790 DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVSDLFGA 4824912 (1)
4829424 GFETVSTCLCWSFLYLIHYPEIQARIQEEI 4829513 (1)
4829611 RPPRFEDRKILPYTEAFVSEVFRHASFLPFTIPHS 4829715 (2)
4832677 TTADTTLNGYFIPRKTCTFINMYQVNHDE 4832763 (2)
4835676 TIWDNHSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITTVLQQFK 4835858
4835859 LKK*PRAKLDLTPTYGLVMRPKLYQLQAELHPSGSSSA* 4835975

CYP1A8PX ortholog  Bos taurus (cow)
            Renamed CYP1D1P orthologous to fish 1D1
            See cattle page for details
MIFGMAVTSGEVTTSRIILVMVFVFVRELGNKGRKEVFPPGPWSLPIVENLLQLG
DHLYFTFMEMRKKYGDVFLIKLGMVPVLVVNGMEMVKEVLLRNGEHFAA*PNV
LTFSFLAQ*KSLTFS
NYGENWTLHKKIASNALRTFPKAETKSSTRSCLLEKHVIEEVSELVKV
FTELTSRSGSFEPRGAITCAMANVV
CTLCFGKRYDHSDEEFLRIVKTDHDLLKASSAANPADFIPYF*YLPLRIINAPQEFYHARNQ
FIALHIRDHLTT
CPQDHIQDITDALINACHNKYAVAKITILNDDEIISTVSDLVGAG
FEIISTCIYWSFLYLIYYPEIQVKIQEEI
DGNTGMKSPRFENRKILP
YTEAFINEIFRHTSFLPFTIPHC (2)
TTADTTLNGYFIPRKTCTFINMYQVNHDE (2)
TIWDNPNLLRPERFLNENRELNKNLIEKIFIFGMGIQKCL
REEVAQNEVFVFITTVLQQLTLKKCPVVKLDLTPTYGLVMKPKPYQLPAEPRSMGSSCS*

CYP1A8PX ortholog  Xenopus tropicalis (frog)
           This is not a pseudogene in frogs
           It needs a new subfamily name, since it is 
           Separate from the CYP1A subfamily
           See Xenopus page for seq
           Renamed CYP1D1

CYP1A9     Anguilla anguilla (European eel)
           GenEMBL AF420258
           Mahata,S.C., Mitsuo,R., Aoki,J.-y., Kato,H. and Itakura,T.
           Two forms of cytochrome P450 cDNA from 3-methylcholanthrene-treated
           European eel Anguilla anguilla
           Fish. Sci. 69 (3), 615-624 (2003)
           98% identical to CYP1A9 from Japanese eel (clear ortholog)
           note: Eels have two CYP1A sequences.  CYP1A is 80% identical to
           Salmo salar CYP1A.  This seq is 77% to the same Salmo CYP1A
           Therefore, this is a recent duplication in eels that is diverging
           Away from the parent sequence called CYP1A (no number after A).
           Called CYPEuMC2 
           

CYP1A9     Anguilla japonica (Japanese eel)
           GenEMBL AB020414
           Mitsuo,R., Itakura,T. and Sato,M.
           Cloning and Sequencing of Cytochrome P450 1A Complementary DNA in
           Eel (Anguilla japonica)
           Mar. Biotechnol. 1 (4), 353-358 (1999)           
           98% identical to CYP1A9 from European eel (clear ortholog)
           note: Eels have two CYP1A sequences.  CYP1A is 81% identical to
           Salmo salar CYP1A.  This seq is 78% to the same Salmo CYP1A
           Therefore, this is a recent duplication in eels that is diverging
           Away from the parent sequence called CYP1A (no number after A).
           Called CYPJaMC2

1B Subfamily


CYP1B1      human
            GenEMBL  U03688 (5102bp)
            Sutter,T.R., Tang,Y.M., Hayes,C.L., Wo,Y.-Y.P., Jabs,E.W.,
            Li,X., Yin,H., Cody,C.W. and Greenlee,W.F.
            Complete cDNA sequence of a human dioxin-inducible mRNA
            identifies a new gene subfamily of cytochrome P450 that maps to
            chromosome 2.
            J. Biol. Chem. 269, 13092-13099 (1994)

*** Note The CYP1B1 gene has been linked to primary congenital glaucoma****
See April 97 Human Molecular Genetics

CYP1B1      human
            GenEMBL U56438 (12177bp)
            Tang,Y.M., Wo,Y.-Y.P., Stewart,J., Hawkins,A.L., Griffin,C.A.,
            Sutter,T.R. and Greenlee,W.F.
            Isolation and characterization of the human cytochrome P450 CYP1B1
            gene.
            J. Biol. Chem. 271, 28324-28330 (1996)

CYP1B1      Bos taurus (cow)
            See cattle page for details
MATGLSPDDHLSPTLLSVQQTMLLLLLSVLAAVHVGQWLLRQRRRQPGSAPPGPFAWPLI
GNAASMGSAPHLLFARLARRYGDVFQIHLGSCRVVVLNGERAIRQALVHQSAAFADRPPF
ASFRLVSGGRSLAFGQYSESWKAQRRAAHSTMRAFSTRQPRGRRVLEGHVVGEVRELVEL
LVRRSAGGAFLDPRPLTLVAVANVMSALCFGCRYSHDDAEFLELLSHNEEFGRTVGAGSL
VDVLPWLQRFPNPVRTAFREFEQLNRNFSNFVLDKFLRHRESLRPGAAPRDMMDAFIHSA
GADSGDGGPRLDVDYVPATVTDIFGASQDTLSTALQWLLVLFTR (2)
YSEVQARVQAELDQVVGRHRLPTLEDQPRLPYVMAFLYEAMRFSSFVPVTIPHATTANAS
VLGYHIPKDTVVFVNQWSVNHDPVKWSNPEDFDPTRFLDKDGLINKDLTGSVMVFSVGKR
RCIGEEISKMQLFLFISILAHQCNFKANPDEPSKMDFNYGLTIKPKSFKINVTLRESMEL
LDSAVQKLQVEKECQ*

CYP1B1       rat
            GenEMBL X83867 (2321bp)
            Battacharyya,K.K., Brake,P.B., Eltom,S.E., Otto,S.A. and Jefcoate,C.R.
            Identification of a rat adrenal cytochrome P450 active in polycyclic hydrocarbon 
            metabolism as a rat CYP1B1.  Demonstration of a unique tissue-specific pattern of 
            hormonal and aryl; hydrocarbon receptor-linked regulation.
            J. Biol. Chem. 270 11595-11602 (1995)

CYP1B1      rat
            GenEMBL U09540(4964bp)
            Nigel Walker
            Walker,N.J., Gastel,J.A., Costa,L.T., Clark,G.C., Lucier,G.W. and
            Sutter,T.R.
            Rat CYP1B1: an adrenal cytochrome P450 that exhibits sex-dependent
            expression in livers and kidneys of TCDD-treated animals.
            Carcinogenesis 16 (6), 1319-1327 (1995)

Cyp1b1     mouse
            GenEMBL U02479 (317bp)
            Shen,Z., Wells,R., Liu,J. and Elkind,M.M.
            Identification of a cytochrome P450 gene by reverse transcription-
            PCR using degenerate primers containing inosine.
            Proc. Natl. Acad. Sci. USA 90, 11483-11487 (1993)
            Note: only 104 amino acids by PCR. 

Cyp1b1     mouse
            GenEMBL U03283 (5128bp)
            Shen,Z., Liu,J., Wells,R.L. and Elkind,M.M.
            cDNA cloning, sequence analysis, and induction by aryl hydrocarbons
            of a murine cytochrome P450 gene, Cyp1b1.
            DNA Cell Biol. 13, 763-769 (1994)

Cyp1b1     mouse
           GenEMBL X78445 (2006bp)
           Savas,U., Bhattacharyya,K.K., Christou,M., Alexander,D.L. and 
           Jefcoat,C.R.
           Mouse cytochrome P450EF, representative of a new 1B subfamily of 
           cytochrome P450s. Cloning, sequence determination, and tissue
           expression.
           J. Biol. Chem. 269, 14905-14911 (1994)

CYP1B1     Xenopus tropicalis (frog)
           See Xenopus page for seq

CYP1B1X    Fundulus heteroclitus (killifish)
           GenEMBL AF235140
           Celine Godard, Maya Said and John Stegeman
           Submitted to nomenclature committee Feb. 16, 2000
           This seq is a CYP1C2 sequence not CYP1B1

CYP1B1     Platichthys flesus (European flounder)
           GenEMBL AY304550 
           68% to 1B1 fugu
IKTIFXNFKKLNLEFGEFIRDKVIEHRKTIQSSTTRDMTDALIM
ALDKLGDKTELTGGKDYVSPTMGDIFGASQDTLSTALQWIVLILVKYPEMQLRVQQEV
DKVVERTRLPSIEDQLQL

CYP1B1     Danio rerio (zebrafish)
           no accession number
           66% to 1B1 fugu ctg26141 Length = 651601 
           4 exons
           EST BQ419016
494367 MMDVLLALRDLLQLSTRSVLLSLMVCLMLMFRRRQLVPGPFSWPVIGNAAQLGNTP 494534
494535 HFYLSRMAQKYGDVFQIKLGSRNVVVLNGDAIKEALVKKATDFAGRPDFASFRFVSNGKS 494714
494715 MAFGNYTPWWKLHRKVAQSTVRNFSTANIQTKQTFEKHIVSEIGELIRLFLNKSREQQFF 494894
494895 QPHRYLVVSVANTMSAVCFGNRYAYDDAEFQQVVGRNDQFTKTVGAGSMVDVMPWMQYFP 495074
495075 NPIRTLFDQFKELNKEFCAFIELKVSEHRKTISPSHVRDMTDAFIVALDKGLSGGSGVSL 495254
495255 DKEFVPPTISDIF 495293
495379 GASQDTLSTALQWIILLLVR  495438
497442 YPEIQKRLQEDVDRVVDRSRLPTIADQPHLPYLMAFIYEVMRFTSFTPLTIPHS 497603
497604 TTKDTSINGYPIPKDTVIFVNQWSLNHDPTKWDQPEVFNPQRFLDEDGSLNKDLTTNVLI 497783
497784 FSLGKRRCIGEDVSKIQLFLFTSVLVHQCSFKAESTPNMDYEYGLTLKPKPFKVSVTARD 497963
497964 SSDLLDSLVGTSQTPTEKR 498020

CYP1B1     Danio rerio (zebrafish)
           GenEMBL AF235139 
           Celine Godard, Maya Said and John Stegeman
           Submitted to nomenclature committee Feb. 16, 2000
SQDTLSTALQWIILLLVRYPEIQKRLQEDVDRVVDRSRLPTIAD
QPHLPYLMAFIYEAMRFTSFTPLTIPHSTTKDTSINGYPIPKDTVIFVNQWSLNHDPT
KWDQPEVF

CYP1B1P    Danio rerio (zebrafish)
           No accession number (from trace index)
           gnl|ti|30343474 zfishB-a1803b07.p1c Length = 630
           probable 1B1 pseudogene zebrafish
IADQPHLPYMMAFIYEVMRFTSFTP
TTNVLIFSLGKRRCIGEDVSKIQLFLFTSVMVHQ*RIKAESTPNMGYVXXXXX
LKPKPFKVSVTARDSSDQLISLAGTSQTPTEK

CYP1B1     Cyprinus carpio (common carp) 
           GenEMBL AB048942
           73% to 1B1 fugu
LSTALQWIILLLVRYPEVQKRLQEDVDKVADRSRLPTIADQPHL
PYVMAFIYEVMRFTSFVPVTIPYSTTTDTSINGYPIPKDTVIFV

CYP1B1     Cyprinus carpio (common carp)
           No accession number
           Itakura, T. and El-kady M.A.H.
           Submitted to nomenclature committee 10/17/2003
           Full length sequence 1 aa diff to fragment on AB048942
           91% to CYP1B2 carp  64% to 1B1 fugu 53% to 1C1 fugu
           clone name carp1B1a

CYP1B1     Stenella coeruleoalba (striped dolphin)
           no accession number
           Celine Godard, Maya Said and John Stegeman
           submitted to nomenclature committee Nov. 20, 1998
           PCR fragment 90% identical to human 1B1 I-helix to PERF motif region

CYP1B1    Pusa sibirica (Baikal seal)
          No accession number
          Iwata Hisato
          submitted to nomenclature committee 1/6/05 
          84% to 1B1 human
     
CYP1B1     Pleuronectes platessa (plaice)
           GenEMBL AJ249074
           Michael Leaver
           submitted to Nomenclature Committee 3/11/99
           full length seq.
MFLQDPPAMDVTLEGIDPVTLRAVLLACVTLLFSLHLWRWLGGQ
PSVPGPPGPLAWPLIGNAAEMGKLPHLYLTRMAHKYGNVFQIKLGSRTVVVLNGDSIK
QALVKQGTDFAGRPDFASFKYIFDGDSLAFGPFTDWWKVHRRVAQSTVRTFSTGNADT
KKTFEHHVLCEFRELLQLFVGKTEQQRFFQPMTYLVVSTANIMSAVCFGKRYAYEDEE
FLQVVGRNDQFTQTVGAGSIVDVMPWLQYFPNPIRTIFDNFKKLNLEFGQFIRDKVIE
HRKTIQSSTTRDMTDALIVALDKLGDKSELTGGKDYVSPTMGDIFGASQDTLSTALQW
IVLILVKYPEMQLRIQQEVDKVVDRTRLPSIEDQLQLPYIMAFVYEVMRFTSFVPLTI
PHSTVTDTSIMGYTIPKNTVIFINQWSINHDPALWSHPETFDPQRFLDQNGALNKDLT
SSVLIFSLGKRRCIGEELSKMQLFLFTALIAHQCHISPDPARPPKLDYTYGLTLKPCA
FSIAVALRGHDMSLLDEATRSSAEEVKGEPSSDSQTKN

CYP1B1     Fugu rubripes (Takifugu rubripes) Japanese pufferfish
           Scaffold_1553 complete gene Scaffold_11030 Scaffold_10662   
           54% TO 1B1 human 51% to 1B1 mouse
           AL024920.1 AL015454.1 cosmid 077P23 
           80% to CYP1B from pleuronectes platessa
           FC:C013F14aE4 LGU7740.y1 FC:C077P23aC12 
           AL015446.1 077P23 FC:C077P23aD8
2460 MKVIQEEVSPEAGALLLACATLLVSLQLWRWRRRRPGGCPPGPRAWPIIGNAAQLGHAPHL 2278
2277 YFTRMAQRFGNVFQIKLGSRTVVVLNGDAIKQALVRKGLEFAGRPDFTSFKYISNGHSL 2101
2100 AFGTVTDWWKSHRRVAQSTVRMFSTGNLQTKKTFERHLTCEVRELLHLFLGKTKELQYFQ 1921
1920 PMNYLVVSTANVISAVCFGKRYSYEDEEFQQVVGRNDQFTRTVGAGSIVDVMPWL 1756
1755 QYFPNPVKSIFDNFKRLNKEFSDFIRDKVTEHRKSIRPSSVRDMTDAFIVSLDKLSE 1585
1584 KTGVPLWKDYVIPTVGDVFGASQDTLSTALQWIFLVLVR 1468 (2)
 294 YPDMQQRLQEEVDLVVGRQRLPCIEDQQQLPWVMAFIYEVMRFTSFVPLTIPHSTTTDTT 115
 114 IMGYTIPKNTIIFINQWSINHDPTIWSHPET 13
     FDPNRFLNPSGSLNKDLTSRMLIFSMGKRRCIGEELSKLHLFLFTALIGHQCHITDDPA
     KPTTMDYNYGLTLKPRGFYVALTLRGDMRLLDEAASRPPAEEPGRGPLADP*

CYP1B1     Tetraodon nigroviridis (freshwater pufferfish)
           No accession number
           80% to CYP1B1 fugu missing first 50 aa and last 18 aa
           FS_CONTIG_703_2 Length = 26665
  69 NAAQLGKAPHLYFASRAERYGNVFQIRLGARSVVVLNGDAIRQALVKQGPEFAGRPDFAS 248
 249 FGFISDGRSMAFGTATDWWKVHRRVAHSTVRMFSSGNAQTKKAFERHITSEVRELLRLFLRST 437
 439 RAQRFFQPLAPLVVSTANVMSAVCFGKRYSYEDEEFQQVVGRNDQFTRTVGAGSVVDVMP 618
 619 WLQYFPNPVKTIFDDFKRLNREFNSFIRDKVSEQ 720
 722 RKTIQSSSVRDMTDALIASLDRLSAKTGVP 811
 812 LWKEYVTPTVGDVFGASQDTLSTALQWIFLVLV 910
1486 RYPDVQQRLQKEVDQVVGRQRLPCLEDQQQLPWVMAFIYEVMRFTSFMPLTIPHSTTTDT 1665
1666 TIGGYSIPRNTVVFINQWSVNHDPAIWPQPETFDPDRFLNPNGSLNKDLTSSVLIFSLGK 1845
1846 RRCIGEELAKLHLFLFTALMGHQCRLASDPARPPSLDWNYGLTLKPHAFHIAVSLRGDMRLLDQ 2037

CYP1B1     Anguilla japonica (Japanese eel)
           GenEMBL AB048940 
           73% to 1B1 fugu
LSTALQWIILVLVRFPDIQKQLREEVDKVVDSSRLPSIEDQPRL
PYVMAFLYEVMRFTSFIPVTIPHSTTTDTAIQGYRIPKDTVVFI

CYP1B1     Oreochromis niloticus (Nile tilapia)
           GenEMBL AB048944 
           80% to 1B1 fugu
LSTALQWIILILVKYPEIQVRLQQEVDKVVDRSRVPAIEDQQQL
PYVMAFIYEVMRFTSFLPLTIPHSTTTDTSIMGYTVPKNTVIFI

CYP1B1     Callorhinchus milii (elephant shark, Chondrichthyes)
           Trace files 1573810313  1573059473   
           57% to 1B1 zebrafish only 49% to 1C
MNAVRVLAGQFTQSMQPVLAVALVVLTLLQVCKWMQQPSEQCRRRPPGPFPWPII
GNATQIGKVPHISFSRMARRYGNVFQIKLGSRSVVVLNGEECIREALVRKAEQFSGRPDF
ASFNEVSGGRSLAFRSYCDRWKFHRRIAHSTVRAFSTNNPDTKKTFQRHVVGEVQQLSSR
RQ

CYP1B1     Petromyzon marinus  (sea lamprey)
           Trace files 1172235440, 1468167059, 1466822831, 1172788718, 
           1373603965, 1464676455
           54% to 1B1 zebrafish, 48% to 1C2, 53% to CYP1B3 Petromyzon marinus
SSNVVEFALLVALEARRWLLLRRARSSRGPPGPFPWPILGNALQLGSAPHLAMCRMARRY
GDVFMMKLGGRPVLVLNGATAIRQALVKQGAD
FAGRPAFPSFSVVSDGNSMAFGGYSSLWKMHRCVAQST
LRHFSSSGNAEARADLERYV
VSEAGALVGIMLERSDGGRYFNPSRLFILAIANVMSALCFGRRYDYDNSEFREIV
SRNDKFGRTVGAGSLVDVMPWLLYFPNPVRTAYRDFVALNMEFNAFTRRKVEQHRADFKA
GGVPRDITDSLIAAVEVERPRSRSGEALSGRHVSGAVNDIFGASQDTLSTALMWLLMFLV
RFPRAQRRVQEEVD RVAGRHRLPCLEDRASLPYTEAFVFETLRYSSFVPV
TIPHSTTTDTVIAGYCVPKDTVVFVNQWSSNHDPERWRDPETFEPTRFL
DESGTRVDKDLASNVLIFSVGKRRCIGDDISKMQLLLFAAILAHQCSFEADPAQTMT
IDKSYGLTLKPMPFEVRARVRDHVLAECFADARRQL*

CYP1B3v1   Petromyzon marinus  (sea lamprey)
           Trace 1373790297 first exon 49% to 1B1 fugu, 50% to 1C1 zebrafish
           1437356431 mate pair = 1438643165 = C=term of 1223244203 seq
           1290968067  52% to Stenotomus chrysops P450 1C1
           combined frags 49% to 1B1 zebrafish
           45% to 1C2 zebrafish, 39% to 1A1 zebrafsih
           1223244203, 1473037756, 1427240599, 1446950979  51% to 1B1
           1438643165 = extreme C-term = mate pair of 1437356431
           whole seq 51% to 1B1 human, 50% to 1B1 fugu, 49% to 1B1 zebrafish
MQSTLAILAVNPSRTPTSTASFTSTSTQLSIPSSHLPPPPPPPSIQPSSPAC
TLSQLPAHSPSAAASSPAVAAAPLHSLRTLPGPTPWPFVGNSLQLGPMPHLTFQRMASTY
GPLFRIRLGSRDVVVLNGDSLVREALVCRGSEFAGRPAFRSFSMVSGGHSV
AFGGYCELWRLHRRLAQSTLRAFSTGGTDARR  ALDGHVMMEADELLRVMMA
SCRRSTAGSVDPAQALVVAVANVRSALCFRRRYWHED
AESSSSDRNERSGAAVGAGSVVDVMPW
LLRFPNPVRAAFDDIRRANEDLSEFVRDKVRQRRGAAAVVGPGTRSVRDMM
DALIAHVDGGAVAGGGAAEAAAGDGEGGEAAGGGRGGGGPRLGASHVEATLCDVFGASQD
TLSTGLLWLILLAVRHPEEQARVQGEVDRVVGRTRLPSAADRARMPRTEAFVCEVLRYSS
FVPVTIPHATTRDTRLAGYSIPRDTVVFVNQWSVNHDPGVFEEPHAFRPARF
LDAEGTALDRALARRVMIFSAGRRRCIGEELSRLELFLFTAVMLHQV
DFVAPPGHGPPGTEAVCGGLTLKPKPFSVALVPRGDPLGPGCAPQP*

CYP1B3v2   Petromyzon marinus  (sea lamprey)
           Trace files 1468808835, 1424613767 , 1489836465
           allele of 1223244203?  4 aa diffs and one indel of 1aa
PVRAAFDDFRRANEDL
SEFVRDKVRQRRGAAAVVGPGTRSVRDMMDALISHVDGGAVAGGAAEAAAGDGEGGEAAGGERGGGGP
RLGASHVEATLCDVFGASQDTLSTGLLWLILLAVRHPEEQARVQGEVDRVVGRTRLPSAA
DRARMPRTEAFVCEVLRYSSFVPVTIPHATTRDTRLAGYSIPRDTVVFVNQWSVNHDPGV
FEEPHAFRPARFLDAEGTALDRALARRVMIFSAARFRCIGEELSRLELFL

CYP1B2X    Stenotomus chrysops (scup, a fish)
           no accession number
           Celine Godard, Maya Said, and John Stegeman.
           submitted to nomenclature committee full length 4/21/99
           81% identical to scup 1B3 
           renamed CYP1C1 

CYP1B3X    Stenotomus chrysops (scup, a fish)
           no accession number
           Celine Godard, Maya Said, and John Stegeman.
           submitted to nomenclature committee Aug. 26, 1998 full length 4/21/99
           63% identical to human 1B1 over C-terminal PCR fragment 
           I-helix to heme
           formerly 1B1, reaassigned to CYP1C2

Note: the CYP1B2 and 1B3 names from scup were never published.
It now appears that some fish like carp do have two CYP1B sequences, so the
CYP1B2 name is going to be used to indicate this fact. 10/20/2003

CYP1B2     Cyprinus carpio (common carp)
           No accession number
           Itakura, T. and El-kady M.A.H.
           Submitted to nomenclature committee 10/17/2003
           Full length sequence 3 aa diffs to fragment on AB048942
           91% to CYP1B2 carp  64% to 1B1 fugu 53% to 1C1 fugu
           clone name carp1B1b

CYP1C1     Gallus gallus (chicken)
           XM_001233594.1
           55% to CYP1C2 Fugu, 54% to 1C1 zebrafish
MSAMGTPNGAAMAPVLSPHSALLLIAVVLTAI
LLLARTRHKATRGQSPPGPFASPLVGNVLQMGRLPHLTFMRMACRYGAIFQLRLGRHRVV
VLNGEAAIRRALVGLGTRFAGRPDFPSFGLVSGGRSIAFGGCTPQWRARRRLAHAALRAH
STVAEVERHVVAEAGDLVRLFLRHSQGGAYFQPCPLLVVANANVLCALCFGRRYDHADGE
FTALLGRNDRFGQTVGAGSLVDVLPWLLRFPNPVRHVYRDFQALNRELHGFVQAKVAQHR
QTFDWRAVRDISDVMIASVERGGGSPDGLGPEDVEGAMTDIFGAGQDTTSTALSWIILLL
LKHPQVQQDLQAELDRVVGRSRLPTAEDRPHLPLLEAFIYETLRYSSFVPITIPHATTAD
VELEGFRIPKGTVVFVNQWSVNHDCSKWPEPQRFDPTRFLDKQQRLDRERAGSVMIFSAG
QRRCIGDQLSKLQIFLFTAILLHQCSFHANPAEHLTMDCIHGLALKPLPFTVNVRPRIPL
LIQP*

CYP1C1     Stenotomus chrysops (scup, a fish)
           no accession number
           Celine Godard, Maya Said, and John Stegeman.
           submitted to nomenclature committee full length 4/21/99
           81% identical to scup 1C2 
           formerly 1B2, reaassigned after consultation with the submitters
           and comparison to the Fugu genomic orthologs (see below)

CYP1C1     Danio rerio (zebrafish)
           GenEMBL CAAK02055884.1 6714 bp gene seq (revised seq shown below)
           contig NA9599  Length = 11279
           78% to 1C1 73% to 1C2 fugu 53% to 1B1
           Note: CYP1C probably arose by a retrotransposition of a 1B1 cDNA
           Since 1C has no introns and it is more similar to 1B1 than 1A
     MEAEFGLKSSSIMREWSGQVQPALIASFI
3411 ILFFLEACLWVRNLTFKKRLPGPFAWPLVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGC 3232
3231 SDIVVLNGDAAIRKALVQHSTEFAGRPNFVSFQMISGGRSLTFTNYSKQWKTHRKVAQST 3052
3051 LRAFSMANSQTRKTFEQHVVGEAMDLVQKFLRLSADGRHFNPAHEATVAAANVICALCF 2872
2871 GKRYGHDDPEFRTLLGRVNKFGETVGAGSLVDVMPWLQS 2755 
2753 FPNPVRSVYQNFKTINKGVFNYVKDKVLQHRDTYDRDVTRDMSDAIIGVIEHGKEST 2583
2582 LTKDFVESTVTDLIGAGQDTVSTAMQWMLLLLVKYPSIQSKLQEQIDKVVGRDRLPSIE 2406
2405 DRCNLAYLDAFIYETMRFTSFVP 2337
2337 VTIPHSTTSDVTIEGLHIPKDTVVFINQWSVNHDPQKWSDPHIFNPSRFLDENGALN 2167
2166 KDLTSSVMIFSTGKRRCIGEQIAKVEVFLFSAILLHQCKFERDPSQDLSMDCSYGLALKP 1987
1986 LHYTISAKLRGKLFGLVSPA* 1924

CYP1C1     Fugu rubripes 
           No accession number
           Scaffold_3008b comp(8676-10253) no introns complete gene
           86% to scup 1C1 75% to scup 1C2
10253 MALDTEFGVKSSSITREWSGQVQPALVASFLFLFCLEACLWVRNLRHKRRL
10100 PGPFAWPVVGNAMQLGQMPHITFAKLAKKYGNVYQIRLGCSNI 9972
9971  VVLNGDQAIHQALIEHSTEFAGRPNFVSFQMISGGRSLTFTNYSKQWKVHRKLAQSSLRA 9792
9791  FSSANKQTKIAFEQHVTAEANELVQAFLRYSTDGRYFDPAHEFTVAAANVMCALCFGKRY 9612
9611  GHDDHEFRCLLKKLNKFGETVGAGSLVDVMPWLQSFPNPVRSLYENFKSLNEEFFNFV 9438
9437  KNKVQEHRESFDPNVTRDMSDAMINVIEERKDGTLSKEFAEATITDLIGAGQDTVS 9270
9269  TVLQWIVLLLVKHPDKQAKLHELMDKVVGQDRLPTTEDRSSLAYLDAFIYETMRFTSFVP 9090
9089  VTIPHSTTSDVTIEGLRIPKDTVVFINQWSVNHDPLKWKDPHVFDPSRFLNENGDLNKDL 8910
8909  TSGVMIFSSGKRRCIGSQIAKVEVFLFAAILLHQCSFESDPSDPLTLDCSYGLTLKP 8739
      LRCFVSAKPRGKLLGLVSPA* 8676

CYP1C1     Tetraodon nigroviridis (freshwater pufferfish)
           No accession number
           FS_CONTIG_2073_3 Length = 9880
           87% to 1C1 70% to 1C2
5630 MALDTEFSVKSSGITREWSGQIQPALVASFLFLFCLEACLWVRNLRQKRRLPGPFAWPV 5806
5807 VGNAMQLGQMPHITFAKLAKKYGNVYQIRLGCSNIVVLNGDQAIXX 5938
5943 QALIQHSTEFAGRPNFVSFQMISGGRSLTFTSYSKQWKAHRKVAQSSLRAFSSANNQTKK 6122
6123 AFEQHVTAEANKLVQTFLHYSTDGKYFDPAHDFTIAAANVMCALCFGKRYGHDDQGVQVP 6302
6303 VNEVGQVWPRTVGAGSLVDVMPWLQSFPNPVRSVYENFKSLNEEFFSFVKNKVSEHRESF 6482
6483 DPNVTRDMSDAMINVIEGRKDSTLTKEFVEATVTDLIGAGQDTISTVMQWIILLLV 6650
6651 KYPDMQAKLHELVDKVVGQDRLPTVEDRSSLAYLDAFIYETMRFTSFVPVTIPHSTTSDV 6830
6831 TIEGLHIPKKDTVVFINQWSVNHDPLKWEG
6919 PHVLGPSRFLDDNGDLKKDLNKGVMIFSSGKRRCIGNQIAK 7041
7053 FLFTAILLHQCSFESNPSDPVTLDCSYGLTLKPLRCFVNAKPRGKLLGVVSPA 7211

CYP1C1     Anguilla japonica (Japanese eel)
           No accession number
           Itakura, T. and El-kady M.A.H.
           Submitted to nomenclature committee 10/17/2003
           Full length sequence 100% match to frag on AB048941
           80% to 1C1 fugu 76% to 1C2 fugu  52% to 1B1 fugu
           clone name Japanese eel 1C

CYP1C1     Anguilla japonica (Japanese eel)
           GenEMBL AB048941 
           81% to 1C1 78% to 1C2 fugu
VSTLLQWILLLLVKYPHIQAKLQEQIDKVVGRDRLPCMEDKSSL
AYLDAFVYETMRFTSFVPVTIPHSTTSDVTIEGVHIPRDTVVFI

CYP1C1     Cyprinus carpio (common carp)
           No accession number
           Itakura, T. and El-kady M.A.H.
           Submitted to nomenclature committee 10/17/2003
           Full length sequence 2 aa diffs to frag on AB048943
           77% to 1C1 fugu 73% to 1C2 fugu 50% to 1B1 fugu
           clone name carp1C1a

CYP1C1     Cyprinus carpio (common carp) 
           GenEMBL AB048943
           80% to 1C1 and 1C2 fugu
VSTVMQWILLLLVKYPSIQTKLQEQIDKVVGRGRLPSIEDKSNL
AYLDAFIYETMRYTSFVPVTIPHSTTSDVTIEGLHIPKDTVVFI

CYP1C1     Callorhinchus milii (elephant shark, Chondrichthyes)
           Trace file 1576746999   
           57% to 1C2 tetraodon, 53% to 1B1 Pleuronectes, 49% to 1B1 fugu
           This genomic fragment spans the location of 1B1s only 
           intron w/o an intron therefore this is probably 1C, an intronless gene
LVTVRTLYRDFKRLNQEFFGFVSGKVGQRRRTFVPGRTRDMSDAFIAVVDGAAAAGHGLS
GEHVEGTVNDVMGAGQDTTSTALGWVLFHLIRHPDVQARLQEEMDRAVGRGRLPGTGDRG
RLPYLQAFIHEVCRFTSFVPLTIPHATTSRVTLHGYDLPEDTVVFVNQWSVNHDGAKWKE
PETFEPGRFLDPDGSVNRALADSVMIFSAGKRRCLGDQLAKTQMFLFTAILIHQCAFEAN
PGDVLSLDCLYGLSLKPLPFKLRVRLRDTYRGVGRQREPPPPPTHTHTQKHSTGQGHTHR
DPSPTHTQRERDSQQDRDPTHHTPHRPLSTPVINVRN

CYP1C1     Petromyzon marinus  (sea lamprey) 
           Trace files 1434207733, 1193330571,
           1179606703, 1483258470, 1194048496, 1482130588, 1161783303, 1206198102
           1193734487, 1468865778, 1293288933, 1162763713
           53% to 1C2 Fugu  48% to 1B1 fugu (no intron so probably 1C)
MTAAESMEALPVVAAGGGAQLWDISHPPV
LFFLLSALLILLVTLEARKHGRSHQQQQKHSAPDPPGPLGFPIVGNSLQLGPM
PHLTLNAMAQRYGAVFRIHLGHEPVVVLTGEEI
IHEALVKRGAEFAGRPDFPSFALVSGGNSMSFKTYSELWRVHRRLAHSTLRAF
FTGTAATRRVFEGHVRLEAAELCAMLAEATSRAGGCGVDPSEPTVVAVANVISAVCFGKR
YEHDDAEFRGLLRNNERFSKTVGAGSVVDVMPWLMRFPNPVRSIFRDFEQMNNEFFAFVQ
RKVREHRDSYDPAATPRDMIDALIGHIDGGGGDSDDEGDAADGPSWRWRCARGAPEVGAA
YVDSTLTDVFGAG
QDTMSTSLMWFVLLCAKHPELQADMQRDIDRVVGRERLPRLDDRPQLACVDAFVCEMMRH
VSYVPFTIPHATTTDTELNGYRVAKGTVVFVNQWSVNHDPAIWRDPERFDPSRFL
DETGAALDRDLARRVMIFSAGKRRCIGYEMAKMQLFLFCSALLH
QLSISVPPGHVVSLEGVYGLSLKPKYLSVAFTPREQLLGGRPGEAEE*

CYP1C fragment  Petromyzon marinus  (sea lamprey) 
           Trace file 1483490875 
           frame3_ORF1   86% to CYP1C1 Petromyzon
TRRLAH
CTLRALFTGMATTRRVFEGHVRLEAAELCAMLHEQQNRAGGRGIESIERTVVAVANVISA
VCFGKRYEHEDAEFRGLLRNNERFSKTLGAGSVLEVIPWIMRFPNPARSIIREFEQMNNE
FFALMQRKVREHRDSYDPAATPRDMIDALIGHIDGGGGDSDDEGDAADG
QSWRWRCARGAPEVG

CYP1C2     Stenotomus chrysops (scup, a fish)
           no accession number
           Celine Godard, Maya Said, and John Stegeman.
           submitted to nomenclature committee Aug. 26, 1998 full length 4/21/99
           63% identical to human 1B1 over C-terminal PCR fragment 
           I-helix to heme
           formerly 1B3, reaassigned after consultation with the submitters
           and comparison to the Fugu genomic orthologs (see below)

CYP1C2     Danio rerio (zebrafish)
           no accession number
           contig NA2067  Length = 8014 EST CD758525 
           see zfish41356-444a08.p1c Zfish44625-3160d07.q1k
           73% to 1C1 fugu and 74% to 1C2 fugu
     MAQSDSEFSILKEWSGQIQPALIASFI
1098 ILCCLEACFWVRNITLKKKRLPGPFAWPLVGNAMQLGQMPHITFSKLAKKYGNVYQIRLG 1277
1278 SSDIVVLNGESAIRSALLQHSTEFAGRPNFVSFQYVSGGTSMTFASYSKQWKMHRKIAQS 1457
1458 TIRAFSSANSQTKKSFEKHIVAEAVDLVETFL 1553
     KIQHFNPSHELTVAAANIICALCFRKRYGHDDLX (from EST CD758525)
     (C-terminal inverted)
2818 IKNVLGNVNKFSETVGAGSLVDVMPWLQTFPNPIRSIFQSFKDLNSDFFSFVKGKVVEHRL 2636
2635 SYDPEVIRDMSDAFIGVMDHADEETGLTEAHTEGTVSDLIGAGLDTVSTALNWMLLL 2465
2464 LVKYPSIQSKLQEQIDKVVGRDRLPSIEDRCNLAYLDAFIYETMRFTSFVPVTIPHSTTS 2285
2284 DVTIEGLHIPKDTVVFINQWSVNHDPQKWSDPHIFNPSRFLDENGALDKDLTNSVMIFSI 2105
2104 GRRRCIGDQIAKVEVFLISAILIHQLTFESDPSQDLTLNCSYGLTLKPFDYKISAKPR 1931
1930 GSIVN* 1913

CYP1C2     Fugu rubripes 
           No accession number
           Scaffold_3008a comp(5208-6770) no introns complete gene
           83% to scup 1C2 78% to scup 1C1
6770 MEEDFGVKGSSSITREWSGHVQPALVAFFVFLFCVEACLWAKNLKRRL
6626 PGPFAWPVVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGCSDI 6498
6497 VVLNGARVIRQALIEHSTEFAGRPNFVSFQNVSGGKSMAFTSYSKQWRMHRKIAQSTIRA 6318
6317 FSSANSQTKKVFEQQIVAEATELVEVFLKLGARGQHFNPAHELTVAAANVICALCFGRRY 6138
6137 GHDDQEFRDVLRRIDKFGQTVGAGSLVDVMPWLQSFPNPVRSMFRSFEALNREFFGF 5967
5966 VQLKVEQHRETFDPEVTRDMSDAIISVLEKSDGETALTKDYTEVTMADLIGAGLDTV 5796
5795 STALHWMLLLLVKHPELQSKLHQLIDRVVGRNRLPSIEDRSSLAYLDAFIYETMRFTSFV 5616
5615 PVTIPHSTTSDVTIEGLRIPKDTVVFINQWSVNQDPLMWKDPHVFDPSRFMDEEGSLDRD 5436
5435 LACNVMIFSAGKRRCIGDQIAKVEVFLFFAVLLHQCSFESSADEDLTLNCSYGLTLKPL 5259
5258 DFSITAKLRGKLLKSP* 5208 
           
CYP1C2    Tetraodon nigroviridis (freshwater pufferfish)
           No accession number
           84% to CYP1C2 fugu 73% to CYP1C1 fugu
           CNS_TRUECNSCONTIG_6508_2 Length = 4645
1369 MEEEFCVEGGSSSIREWSGHIQAALVAFFVFLFCLEARLWAKNL
1501 KRRLPGPFAWPVVGNAMQLGQMPHITFSKLAKKYGNVYQIRLGCSDIVVLNGDRVIRQAL 1680
1681 IQHSTEFAGRPNFVSFQTVSGGKGMTFSSYSKRWKMHRKIAQSTIRAFSSANSQTKENFE 1860
1861 QQIAAEATELVEVFLKLSARGQHFNPEHELTVAAANVICALCFGKRYGHDDAEFRELLHR 2040
2041 VNMFGQTVGAGSLVDVMPWLQSFPNPVRSMFKSFKTLNRQFFGFVQLKLKEHRETFDPKV 2220
2221 TRDMSDAIISVLDRSASEYGLTKDNAEGTVSDLIGAGLDTVSTALHWMLLLLVKHPQ 2391
2392 LQHKLQQLIDQVVGRNRLPSIGDRSSLAYLDAFIYETMRFTSFVPVTIPHSTTSDVTIEG 2571
2572 LRIPKDTVVFINQWSVNHDSLMWTDPHVFDPSRFLDEQGSLNRDLASNVMIFSAGKRRCI 2751
2752 GTQIAKAEIFLFLAILLHQCSFERSAGEEPSLDCSYGLTLKPLDYRITAKLRGKLLKSP 2928

CYP1C2     Fundulus heteroclitus (killifish)
           GenEMBL AF235140
           Celine Godard, Maya Said and John Stegeman
           Submitted to nomenclature committee Feb. 16, 2000
           Formerly named CYP1B1, but reassigned 10/21/2003

CYP1C2     Cyprinus carpio (common carp)
           No accession number
           Itakura, T. and El-kady M.A.H.
           Submitted to nomenclature committee 10/17/2003
           Full length sequence 5 aa diffs to frag on AB048943
           73% to 1C2 fugu 72% to 1C1 fugu 51% to 1B1 fugu
           clone name carp1C1b

CYP1D1P/CYP1A8PX     human
            NT_008580.9 
            Pseudogene 43% identcal to 1A2 human
            Renamed CYP1D1P orthologous to fish 1D1
NT_008580.9|Hs9_8737 chromosome 9 
4822084 MILDLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQVSPPGP*SFPIIENLLQLGDHPY 4822260
4822261 LTLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLHKDGEHFAGRPNMHTFSFLAEGKS 4822440
4822441 LSFSVNYGESWKLHKKIASKAL*TFSNAEAKSSTCSCSLEEHVTEEISELVTVFVELTSK 4822620
4822621 NGSFDPRNAITCVVANIVCALCFGKR*DHSDEEFLRIVKTNDDLLKASSAANPADFIPCL 4822800
4822801 HYLPLKIINAPLEFYQALNGFIALHVQDHLATYGK 4822905 (0)
4824790 DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVSDLFGA 4824912 (1)
4829424 GFETVSTCLCWSFLYLIHYPEIQARIQEEI 4829513 (1)
4829611 RPPRFEDRKILPYTEAFVSEVFRHASFLPFTIPHS 4829715 (2)
4832677 TTADTTLNGYFIPRKTCTFINMYQVNHDE 4832763 (2)
4835676 TIWDNHSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITTVLQQFK 4835858
4835859 LKK*PRAKLDLTPTYGLVMRPKLYQLQAELHPSGSSSA* 4835975

CYP1D1      Macaca mulatta (rhesus monkey)
            chr15  from UCSC browser 81802360-81816347
            92% to human 1D1P
MILNLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQLSPPGPWSFPIIGNLLQLGEHPYL
TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLLKDGEHFAGRPNMHTFSFLAEGKSL
SFSVNYGESWKLHKKIASKALRTLSNAEAKSSTCSCLLEEHVTEEVSELVTVFVELSSKN
GGFDPRNAITCAVANVVCALCFGKRYDHSDEEFLKIVKTNDDLLKASSAANPADFIPCLR
YLPLQIINAPREFYRALNGFIALHVQDHLATYDK (0)
DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVNDLFGA
GFETVSTCLYWSFLYLIHYPEIQAKIQEEI (1)
DGNIGLKPPRFEDRKILPYT
EAFISEVFRHASFLPFTIPHCNTADTTLNGYFIPRKTCTFINMYQVNHDETIWDNPSLFR
PDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITAVLQQLKLKKCPRAKL
DLTPTYGLVMRPKPYQLEAERRSSGSSSASILRLRGGFLTQFRKIDELNLLN*

CYP1D1P/CYP1A8PX ortholog  Bos taurus (cow)
            Renamed CYP1D1P orthologous to fish 1D1
            See cattle page for details
MIFGMAVTSGEVTTSRIILVMVFVFVRELGNKGRKEVFPPGPWSLPIVENLLQLG
DHLYFTFMEMRKKYGDVFLIKLGMVPVLVVNGMEMVKEVLLRNGEHFAA*PNV
LTFSFLAQ*KSLTFS
NYGENWTLHKKIASNALRTFPKAETKSSTRSCLLEKHVIEEVSELVKV
FTELTSRSGSFEPRGAITCAMANVV
CTLCFGKRYDHSDEEFLRIVKTDHDLLKASSAANPADFIPYF*YLPLRIINAPQEFYHARNQ
FIALHIRDHLTT
CPQDHIQDITDALINACHNKYAVAKITILNDDEIISTVSDLVGAG
FEIISTCIYWSFLYLIYYPEIQVKIQEEI
DGNTGMKSPRFENRKILP
YTEAFINEIFRHTSFLPFTIPHC (2)
TTADTTLNGYFIPRKTCTFINMYQVNHDE (2)
TIWDNPNLLRPERFLNENRELNKNLIEKIFIFGMGIQKCL
REEVAQNEVFVFITTVLQQLTLKKCPVVKLDLTPTYGLVMKPKPYQLPAEPRSMGSSCS*

CYP1D1P    dog
           UCSC browser chr 1 87915406-87928215 (-) strand
           57% to human 1D1P
VIAELISKNGNFGLRSVITCVVVVVNVICILCFSMRYD
HI*EEFLRIHKMNAHLLETSSEANPADFMPCFLYRPL*IINAYQEFYQAPN*FIALHDHLTTYDN
DHI*AIADALINACHNKYGTMEAATINDDEIISTMNGLFGA
GLETIAIFLFWGFLF
IIHFFQVKTWGWESVRFEHRKIIPYTEASIN*IFRYAPFLPLAIPHC (2)
STTEDTVQNGYFIPRKSCTFISMC*INHNQ
NIWDNPKLFRSQRFINENRE*KS*EQNVDIWNGTLEVSHRR**RNEICIFITSV

CYP1D1P    Oryctolagus cuniculus (rabbit)
           GenEMBL AAGW01268851.1
           57% to human 1D1P, only 30% to 1A1
2347 VSVFVRALGSRNRKQVSTAGP*AFSNLFQLGAYPFLI**RGERNRDVFLFTFVVLP 2514
2515 VVVVNGMEMVKKTLLSDGKHFSGRPDMHTIAFLEEGKGLSSFVTHGES*KLYFQCVSNAL 2694
2695 CTFSKVEAK
     FSTYSCLLEEHITEE
     ASELMKVFVELTTKSGNFG 2825
2826 LRNAIPWHDQN
2857 IVGALCFGKRYDHNDGKSLSVVK
     SNGLFKFPSKAKPQ
     FIPQFHYLPLQIINIP*WL 3030
3031 YQALNQFTDLQVQGHLRMYDK 3093

CYP1D1P    Sus scrofa
           GenEMBL CT232614.1, CT282345.1
           77% to human 1D1P only 32% to 1A1 human
376  VFVFVRALRNNGRKQVFPPGSCSFPIIGNLQLGGHPYLTFMEMRKKYGVVFFIKLGVMPV  555
556  LVVNGMEMVKQVLLKGGEHVAGRLHMHTFSFLAKGKSLTFLANYRESCKLCKKIASNAL*  735
736  TFSQEETKSPTCSCFLEEHVVEEVSELVKVFAELTSNSCSFDCRSAI  876
     TVVANIVFALCFGKRYDHSDEEFLRIVKT

CYP1D1     Otolemur garnettii (small-eared galago)
           GenEMBL WGS seq. AAQR01460136.1 N-terminal
6245  MISHLAITPREVTISLVILVIVFVFLRVLRSKGRKQVSPPGPLSFPIIGNLLQLGEHPYL  6066
6065  TFMEMRRQYGDIFLLRLGTVPVVVVNGVEMVKQVLLKDGEYFAGRPNMHTFSFLAEGKSL  5886
5885  TFSVNYGESWKLHKKIASNALRTFSKAEAKSSTCSCLLEEHVTEEASELVKVFVELTSKN  5706
5705  GSFNPRSAITCAVANVVCALCFGKRYDHGDEEFLRIVKTNDDLLKASSAANPADFIPCFR  5526
5525  YLPLRIINAPREFYQALNRFIALQVQDHLTTYDK 5424

CYP1D1     Myotis lucifugus (little brown bat) 
           GenEMBL WGS seq AAPE01629621
           MULTIPLE FRAMESHIFTS, BUT NO STOPS, MAY BE SEQ ERRORS
13312  MILDKAITPEEVTTSLIILVIVFVFVRALMSKGRRQVSLPGPWSFPLIGNLLQLGDHPFL  13133
13132  TFTEMRKKYGDVFLIKLGMVPVVVVNGMEMVKHVLLKDGEHFAGRPNMHTFSFLAEGKSF  12953
12952  SFSVNYGESWKLHKKIASSALRTFSKAEAKSSTCSCLLEEQVIEEVSELVKVFAELTSKK  12773
12772  GSFEPRNAITCAVANVVCALCFGKRYDHSDEEFIRIVKTNDDLLKASSAANPADFIPCFR  12593
12592  YLPLRIINAPREFYRALNEFITLHVQDHLTTYDK (0)  12491
11217  DHMRDITDALINTCHKKICTTKXXXLNDDE II STVNDIXGA (1) 11131
10594  GFETVSTCLYWSFLYLIYYPEIQARIQEEI (1)
10415  DGNIGLKPPRFEDRKMLPYTEAFINEVFRHASFIPFTIPHC (2) 10293
 8366  TTADTTLNGYFIPKNTCTFINMYQVNHDE  8280
 5747  TIWDIQS VFSPERFLNENRELNKSLXX  5610
 5601  KVLIFGMGIRKCLGEDVARNEVFLFITMVLQQLKLHKCPRAELDLTPTYGLAMKPKPYQL  5422
 5421  QAEPRSADSAS*  5386

CYP1D1     Tupaia belangeri (northern tree shrew)
           GenEMBL WGS seq. AAPY01014831.1 N-terminal
1294  MIFHLAVTPGEVTITLIILVVIFVFVKTLGNKGRKRLSPPGPWSFPIIGNLFQLGDHPYL  1115
1114  TFMEMRKKYGDVFMLRLGMVPVLVVNGMEMVKQVLLKDTEHFAGRPDMHSFSFLAEGKSL  935
934   SFSVNYGESWKLHKKIASNALRTFSKAEAKSSTCSCLLEEHVTEEVSELVKVFTESTSKN  755
754   GSFDPRNAITCAVANVVCALCFGKRYDHSDKEFLRIIKTNDDLLKASSAANPVDFIPCFR  575
574   YLPLRIINAPREFYRALNKFIALHVQDHITTYDK  473

CYP1D1     Sorex araneu (European shrew)
           GenEMBL WGS seq. AALT01503634.1 
12376  MIFNVAVNSGDLSTSLIVFVVVFVIVRALGSKGRKQGFPPGPRALPILGNLLQLGDYPYL  12197
12196  TFMEMRKKYGDVFLIRLGMVPVVVVNGMETVKQVLLKDGEKFAGRPKMHTFSFLAEGKSL  12017
12016  SFSVNYGESWKLQKKIASNSLRTFSKAEAKSSSCSCLLEEHVLEEVSELISIFEKLTSEN  11837
11836  GSFDPRNAITCAVANIVCALCFGKRYDHSDEEFLRIVKTNDDILKASSAANPADFIPCFR  11657
11656  YLPLPIVNGPRKFYRALNQFISLHVRDHYTTYDK  11555
 9964  QDHIRDITDALISTCQNKYSSKKATLNDDEVISVVNDIFGA  9842
 6041  GFETVSTCLYWSFLYLIQYPEIQVKVQEEI  5952
 5868  IGLKSPTFEDRKILPYTEAFITEVFRHASFIPLTIPH  5758
 2010  TVDTTLNGYFIPKKTCTFINMYQVNHDE  1927

CYP1D1     Echinops telfairi (small Madagascar hedgehog)
           GenEMBL WGS seq. AAIY01323088.1
1272  MMFDSAAVPGEVTASLLVLVIVFVFIRARESQEGKKIPPPGPWSFPIIGNLLQLGAHPYL  1093
1092  TFMEMRKKYGDVFLIKLGVVPVLVVNGMEMVRRVLARDGEHFAGRPAMHTFSFLAEGKSF  913
912   SFSVNYGESWKLHKKIASNALRTFSKAEAKSSTCSCLLEEHVAEEVAVLVRAFAELTSTN  733
732   GSFEPRSVITCAVANVVCALCFGKRYEHSDEEFLKVVQTNDELLKASSAANPADFIPCFR  553
552   YLPLRIINAPREFYQALNQFITRHVQDHLTTYDK

CYP1D1    Loxodonta africana (African Elephant)
          GenEMBL WGS seq. AAGU01360158.1
9163  MIFSLAVTPGEATTCLIVLVIVFVFVRALRNRDGKQVSLPGPWSFPIIGNLPQIGDHPYL  8984
8983  TFMEMRKKYGDVFLIRLGMVPVVVVNGMEMVKQVLLKDGEKFAGRPNMHTFSVLAEKKSL  8804
8803  SFSVNYGESWKLHKKIASNALRTFSKAEAKSSTCSCLLEEHVTEEVSELVKVFAELTSKN  8624
8623  GSFEPRSVITCSVANVVCALCFGKRYEHNDEEFLQIVKTNDELLKASSAANPADFIPCFR  8444
8443  YLPLGVINAPRKFYQALYQFIALHVQDHLTTYDKVRI  8333
6611  QDHIRDITDALINTCHNKHAATKTATLNDDEIINTVGDLFGA  6486
24XX  GFETVSTCLYWSFLYLIRYPEIQAKIQEEI
      DGNIGLKSPRFDDRKILPYTEAFVNEIFRHASFFPFTIPH  2139

CYP1D1     Monodelphis domestica (gray short-tailed opossum)
           GenEMBL XM_001373076.1
           72% to 1D1P human
           not a pseudogene Built_from_Q9PTY7_and_others
           405900 - 420186 bp (405.9 Kb) on chromosome fragment scaffold_15058
           This transcript is located in sequence: contig_41044
MFVIETISKEVTISFLVLMIVFIFIRALGNRNKKHMSPPGPRPFPIIGNLLQLGDHPYLTFMEMKKKYG
DVFLIKLGMVPVVVVNGTEMVKKGLLKDGENFAGRPHMYTFSFFAEGKSLSFSVNYGESW
KLHKKIAMNALRNFSKAEAKSSTCSCVLEEHVTEEASELVKIFSKLSLKQGSFDPKSSIT
CAVANVVCALCFGKRYGHFDKEFLRIIKTNEEFLKASSAANPADFIPCFRYLPLRIIHAP
REFYCQLNHFIEQHVQDHITTFDKNHLRDITDALVSICRDKSATIKTATLSDNEIISTVS
DIFGAGFETVSGFLHWSFLYLIYYPEIQAKIHEEIDGIIGFKPPRFKDRKNLPYTEAFIN
EIFRHTTFVPFTIPHCTTKDTTLNGYFIPQKTCVFFNMYQVNHDETLWENPDSFQPERFL
NEKGEMNKNLVEKVLIFGMGIRKCLGEDVARNEVFIFIVSILQQLKLKKCPEVQLDLTPV
YGLVMKPKPYQLIVEPRFHVNSST*

CYP1D1     Ornithorhynchus anatinus (duckbill platypus)
           GenEMBL AAPN01253410.1 16801-19436, AAPN01253411.1 386-472
           AAPN01253413.1 1531-1812
           74% to 1D1 opossum
MIPGELTTSLLMLVIVLISINVLRNRGQKPPSPPGPWALPVIGNLLQLGEHPYLSFIEMR
KKYGDVFLIKLGMVPVVVVNGMEPVKRVLFQDGENYAGRPNMHTFSFFANGKSLSFSTNY
GDSWKHHKKMAINALKSFSKAEAKSSTCSCLLEEHVCGEVSELVKIFTELTATQGNFDPR
GSLTCAVANVVCALCFGKRYEHTDEKFLKVIKINDDLLKASSAVNPADFIPCFRYLPLRV
VNAPREYYHMLNQFIMQHVQEHYVTYDE (0)
GYLRDITDALISICYDKNSTGKTPILPDDTIISTVNDIFGA (1)
GFDTVSTCLNWSFLYLINYPEIQTKIQAEI (1)
DGNIGLKPPRFEDRKNLPYTEAFINEIFRHTTFLPFTIPHC (2)
TTADTILNGYFIPQKTCVFVNIYQVNHDE (2)
TLWEKPDLFRPERFLNENGELNKGLVEKVLIFGLGIRKCLGEDVARNEIFIFITNVLQHL
KLEKCSGAQLDLTPVYGLSMKPKPYHIKAEPRF*

CYP1D2P    Ornithorhynchus anatinus (duckbill platypus)
           GenEMBL AAPN01177473.1
           87% to CYP1D1 Ornithorhynchus
           processed pseudogene no introns
DDTIISTANDIFGAGFDTVSTCLSRRFL*LINYREIQTKIQAEIDGNIGQEPPRFEDRKNLP
FTEGFINEIFRHTTFLPFTIPHCTTADISGYFIPQKTCIFVNKYQVNHDETLWENPDLFRPERFLNEN

CYP1D1     Anolis carolinensis lizard
           FG695750.1 FG777243.1 FG739979.1 FG695729 ESTs
           Genomic AAWZ01004734.1 
           63% to 1D1P human
MFFSTEVSFSEVTITLFVVAAIFISIHMLMKTKRPHPPGPWSLPILGNLLQVEEHPYI 231
SFQRMRKKYGDVFQIKLGMVPVVVVNGLDAVKQVLLRDGESFAGRPDMHTFSFFADGDSM 411
SFSVNYGESWKLQKKIAGRALKLLSKSEAKSSTCSCLLEEHVCDEASELVKILLELSKN 588
GGFDPAAVTTCTAANVVCALCFGKRYNHNDEEFLGVIKLNDDFVKASSAFNPADFIPCLR 768
YLPLPAAKVARTFYRKLNDF 828
VSACVEYHCTTYDK (0)
NYVRDITDALINVGNEKKEDGKTAALSDKKIISTVNDIFGA (1)
GFSTVSACLLWIYLYLISKPEIQTKIQEEI (1)
GLRPPRFDDRKYLHYTEAFINEIFRHCSFLPFTIPHC (2)
STTRDAVLNGYYIPQSTCIFINMYQVNHDE (2)
RDVWEDPYSFKPERFLNESGELNKSLVEKVLIFGMGIRKCLGEELARNEVFVIITTIL
QQLRLEKPPEDKLDLTPMYGLTMSPKPYRLQAALRT*

CYP1D1/CYP1A8PX ortholog  Xenopus tropicalis (frog)
           This is not a pseudogene in frogs
           It needs a new subfamily name, since it is 
           Separate from the CYP1A subfamily
           Renamed CYP1D1
           DN053435 DN024870 
           DN024871 mate pair to DN024870
           DN025714.1
           51% to CYP1A8P ortholog
MESAVKKTLMDMMPMLLKASISFLTVLLVMSILWKKRNSLPGPWAVPI
VGNFFQLGDQIHITLTDMRNRYGDVFQIKLGLMPIVVVSGLETVKRVLLKEGENFADRPN
FYSFSLFSNGSSMTFSEKYGESWKIHKKIMKNALRNLSNESTNSSNCSCRLEEYVCAEAS
DLVQELTDLSAEKVAFDPSQSIVITVANVVCALSFGKRYDHHDKEFLTLIDFNNDLRKA
AGGGLLADFIPILRFIPSSSVKALKKFVQSFHSFIAKCVKDHFATFEENNIRDITDA
LIQLCKERKSEDKNQLLSDDQIISTVNDIFGAGFDTITSALLWAIFYLLRYPEFQDKIHK
EIEEKIGCNRAPRFNDRKDLHYTEAFINEVLRHSSFVPFGLPHCTTMDTKLNGYFLPKGT
CVFTNLYQVNHDNTVWKDADMFMPERFLDQNGQIIKSLTEKVLVFGMGVRKCLGEDVARN
EMFVIMTIMMQRLKLVKSTKHELDPIPVYGLTLKPKPYYLVAKVRT*

CYP1D1     Danio rerio (zebrafish)
           GenEMBL NM_001007310 5 introns
           Note: CYP1C has no introns, 1B1 has 1 intron (not shared with 1D1)
           CYP1A zebrafish has the same five introns
           50% to CYP1A7 Xenopus, 49% to mouse Cyp1a1, 46% to 1A zebrafish
           41% to 1C2 zebrafish, 36% to 1B1 zebrafish
 89108 MNLENISHTATSEVTLILCAFALLLLALHGRRRAPGVPVPPGPRPWPIVG
       NFLQMEEQVHLSLTNLRVQYGDVFQVKMGSLVVVVLSGYTTIKEALVRQGDA
       FAGRPDLYTFSAVANGTSMTFSEKYGEAWVLHKKICKNALRTFSQTEPKDSNASCLLE
       ERICVEAIDMVETLKAQGEEFGDSGIDPVQLLVTSVANVVCTLCFGKRYSHNDKEFLT
       IVHINNEVLRLFAAGNLADFFPIFRYLPSPSLRKMVEFINRMNNFMERNIMEHLVNFDT (0) 89938
 94917 NCIRDITDALIAMCEDRQEDKESAVLSNSQIVHSVIDIFGA (1) 95039
 95618 GFDTIITGLQWSLLYLIKFPNIQDKIVQEI (1) 95707
 98382 DNQVGMDRLPQFKDRPNMPYTEAFINEVFRHASYMPFTIPHC (2) 98507
 98613 TTENITLNGYFIPKDTCVFINQYQVNHDI (2) 98700
101355 EIWDDPESFRPERFLTLSGHLNKSLTEKVMIFGMGIRRCLGDNIARLEM
       FVFLTTLLHRLHIENVPGQELDLSSTFGLTMKPRPYRIKIIPRN* 101636

CYP1D1   Pimephales promelas (Cyprinid fish)
         GenEMBL DT309726.1 EST testis 
         About 80% to zebrafish 1D1
69   MYLEEISRTTNVTSGLTLFLCAFALLLLALHGRRRGPGCSFPPGPKPWPLVGNLFQMGEQ  248
249  IHLSLTNLRVQYGDVFQVQMGSLVVVVLSGYSTIKEALVRKGEAFAGRPDLFTFSAVANG  428
429  TSMTFSEKYGEAWVLHKKICRNALRTFSQAEPRDSSASCLLEEHICTEAMEMVKALKEQG  608
609  DK  614
missing some sequence here
614  GNLADFFPIFRYLPSPSLRKMVQHIGRMNSFMECNIREHLITFDRNCIRDITDALIAMSE  793
794  DRQEDEETAMLSNSQIVHSVIDI  862

CYP1D1   Callorhinchus milii (elephant shark, Chondrichthyes)
         GenEMBL CW874708.1 CW863449.1 GSS sequences
         AAVX01473941.1 WGS
         Trace archive files 1573350467 (exon 5) 1574214913 (exon 6)
         1573943089 (exon 2)
         About 67% to Gasterosteus aculeatus (stickleback) 1D1
     PVEPITSTVANVICALCFGKRYEHNDKEFLNIVHTNHEVMRTFASGNVADVFPFFRYLPS
     PSLKSMIKFVNRLNNFMIKSIQEHYTTFDK

     GFDTIITGLQWCLLYLIQYPEFQTRIQQEI (1)
144  DEKVGQSRLPRFEDRTLLPFTEAFINEVFRHTTYMPFTIPHC (2)  19
     TTASTTLNGYFIPKDTCVFINQYQVNHDE (2)

CYP1D1   Oryzias latipes
         GenEMBL BAAF03028505.1 WGS seq
         69% to zebrafish 1D1, only 48% to CYP1A
25653 MLSGTLPIA
25626 ESLSASLSSVTVVLFLIALGLMAIRVQKSRSSPFNVKDDSHLDLTAFPSPPGPTPWPIVG 25447
25446 NLFQMGNQMHLSLTLLRAKHGDVFK (0)
24429 LRLGSLPVVVLSGYNTIRQALVRQGEDFAGRPELFTFSAVADGTSMTFSEKFGPAWLLH 24253
24252 KKLCKNALRSFSQAAPRGSGATCLLEEHVCAEAAEMLEMIREQSAKVELDSEMTDGASKG 24073
24072 VDPVKPLVTSVANVVCALCFGKRYDHNDKEFLTIVNINNEVLKLFAAGNLADFFPVFRYF 23893
23892 PSLSLKELVQYIRRMNGFMERRIEEHMHTFDK (0) 23800
23189 NYIRDITDALIALCEDREKSKEMSLLSDTQIIHSVIDIFGA (1) 23067
22979 GFDTIIAGLQWSLLYLIKFPDVQRRIHQEI (1) 22890
20183 DEHIGSARMPNFSDKSKMPFTEAFIYEVFRHAAYVPFTIPHC (2) 20058
19961 TTRHTTLNGYFIPKDTCVFINQYQVNHDK (2) 19875
19791 DLWGDPEQFCPDRFLGHSGQLNKELTEKVLIFGMGKRRCLGDGFARLEMFVFLATLLHGL 19612
19611 RIENVPGQKLDLGTDFGLTMKPHPYKITVSSRFTEM* 19501

CYP1D1   Gasterosteus aculeatus (stickleback)
         GenEMBL AANH01001861.1
         77% to Oryzias 1D1
54662 MRVTFGIFPIKENTCASLSSVTVVLCLINLLLMALVCRKNHCHNSRLDHTKYPTPPGPT 54486
54485 PWPLVGNLLQMGDQIHLSLTRLRLQYGDVFK (0) 54393
54293 MRLGSLTVVVLSGHNTIRQALVRQGEAFAGRPDLFTFSAVANGTSMTFSEKYGPAWMLHK 54114
54113 KLCKNALRSFSRAEPRESGATCLLEEHVCAEAAEMVEVMYEQAAAEREMGHKVMGI 53946
53945 DPVVPVVTSVANVVCALCFGKRYDYNDKEFLTIVHINNEVLRIFAAGNMADFFPVFRYFP 53766
53765 SPSLRKMVQHIQRMNGFMERSIEEHINTFDK (0) 53673
53010 NYIRDITDALIALCEDREENQDTSLLSKSQIIHTVVDIFGA (1) 52888
52795 GFDTIIAGLQWSLLYLIKYPDIQDRIHQEI (1) 52706
51800 DDHIGIARLPMFSDKPKMPFTEAFMYEVFRHASYVPFTIPHC (2) 51675
51589 TTRNITLNGYFIPKDTCVFINQYQVNHD (2) 51506
51396 DLWGDPDRFRPARFLGSLGLLNKELTEKVLIFGVGKRRCLGDGLARLEMFVFLTTLLHRT 51217
51216 RIENVPGQQLDLSTDFGLTMKPRPYRITISSRF* 51115

CYP1E1   Ciona intestinalis (sea squirt)
         JGI Ciona genome ver.2 gene model 131189
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1As, but only about 33% identical to CYP1As
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
MMITAAILLDAGRSFAVPVAFTAVSVLTLYVCLRKRQGIPPGPTAWPLVGNL
FSMGRQSHLILESMRKTYGDVFSVYFGSTLVVVVNGKAVEECLSTHSAR (2)
YSMRPELHTAQYILEGKSFAFSHIAVSKHKRYRTLAVAVVKQLVNGGGEKTDVAV
KHGLQNGTRHSSIEERIFMEAACMCDKLLETSDSPDLKDEILKVITKEL (2)
LSEYELDEISRVVENLRNSNEAIMLVNFIPAVRMLWRNGLQKYIQLTQSLNR (2)
FFERCIRNRKAQLATVSNGHTEDNGVRLTNGVDCTVKFWQKLKNDPQYEESRVMKV (0)
VADLFGARVDTMTVALAWMIVYWSTYQAAQERAQKEIDHFVKNEKRLPR (2)
YSERNQLPYTMALIMEVERHCSFVPFTLPHAPAQDTMLNGYLIPKGTMMLISMRSINHDTAVWDSPAQFR (2)
PERFLLDQSGGFNSALAEQVMLFGAGRRRCAGEALGRMQIFLYSVLFLRKCTFRR
SDKDGHVLPESLAGISLIPQTMCVSISRREADGSKNTEP*

CYP1E1   Ciona savignyi (sea squirt)
         Ortholog of C. intestinalis CYP1E1 
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1As, but only about 33% identical to CYP1As
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
         75% identical to C. intestinalis CYP1E1
         paired_scaffold_63
595236 SICLPITAFALSLIYLHRRKRDNLPPGPFAWPVLGNLLSLRSNSTAALEEIRRTYGDV 595063
595062 YSLYFGSRLVVVVNGKAVEECLSTRSAK 5949795
594724 RFSMRPELFTAQYVLGGKSFAFSHMDVETHRRYRKLAVGVVKELLVSTHERSQPTTMEEV 594545
594544 NRIPPQSIEDQIYAQAKRLCVGLFDIYASNSKSGQLDIRKEIMRRISFEM 594395
594161 LWEHELADLSELVEDLRNSNDATLILNFIPISRYLWKKGLRKYIKINQDLNK 
592629 FFSRCFDRRNPHVANGSDCCKSEETCDVLSGIDCVLKLWQQLKDDPQFEENRVMKLVRKLFKCN 592438
591699 VGDLFGANVDTMTVALAWMIVYWSTYHQAQTRAQEEIDRFVETNFHLPRY 591550
591042 RYSDRSQLPFVMALIWEVARHCSFVPFALPHAPVEDTTLNGYLIPSGTVMMISMRSVNHDQTLWDS 590845
590844 PGEFR 590830
590562 PERFISSETGVFNKGLADRVMLFGGGRRRCAGEALARMQLFLFSVSILRSCTIRRVDHS 590386
590385 DVLPD 590371

CYP1F1   Ciona intestinalis (sea squirt)
         JGI Ciona genome ver.2 gene model 136792
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
MLVQILTATFWTLIP
NSFGDLLIYAILVLTIVIYVKSLKRDKEWLALPGPIPW PLVGNA PFLGAEPHKKLLELSL
KYGPVYRLKMGGIKTVVLCNAEVVRSALIKQREAFSGRPKFSSYKAVS AGESVVFNDEET
LPP WRSH KSKIVRHMHKYTTSIRTRDKVTDLINTECMMMVTELDRISRSKCVNPENVIRM
ALANVMCAVCFGNRFEYDNE (0) 
EFQKLLSMNTEFGAVIELGPIIDAMPWIK (0)
VIPKFKKAIADYLKINLQLDTWSRHR (2)
VDGVLKTFDNDDVTNVVASMTSEVLEKKSAGESREITESETKTIAALSADILGA 
GQHTTSTTFFWVINLLLCFPKVLNKLTEEVRSKLGNRLPTLEDRTSLPYMDAVLTE 
VLRFSSPLSSTIPHSTLKDVKLAGHTIKRGTMVIISQYAVNHDPQNWKNPENFDPERFLTK
NEGGEIIFNESLSEKVLAFSIGERKCPGSQLSRMLLFLATTLLVQVSDLSADLERPPT
AAAEYGLILRPKHLSIKLTLREHWQRRDSIRA*

CYP1F1   Ciona savignyi (sea squirt)
         Ortholog of C. intestinalis CYP1F1
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
         paired_scaffold_56 66% to C. intestinalis CYP1F1
957040 VLIYISMVSIVVIYVKSVKRNKEFMALPGPTPWPIVGNAPFLGKQPHKTLLQLSQK 956873
956872 YGPIYRLKMGSVEAVILCDLDVIRCALIKQREVFSGRPKFESYKAVSAGESVVFNDSESL 956693
956692 APWKSHKSKILRHLHKFATSVRTKEKVNNIITTECMLMLQCLHRRSQDGFVDPEDVIRMT 956513
956512 IANVMCAVCYGNRFEYENE 956456
950636 GQHTTSGTFFWVINILLFYPKVLQRITNEVRSKIGERIPTLEDQADLPYVEAFLTEV 950466
949639 VLRFASPLSSTIPHSTTKDTTLKGYKIKRNTMVIISQYSVNHDPKIWRNPEVFDPERFLTRDENTNLVFND 949427
949426 ALAEKVLSFSVGERKCPGSRMSQMVLFLATCLLVHTGTLYPNPDRPPS 949283
949282 PVDDAQYGLILRPEYISMKFLLDKKW 949205

CYP1F2   Ciona intestinalis (sea squirt)
         JGI Ciona genome ver.2 gene model 143263
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
MDSLVFVLVDTVLVMKYQILLLLVIVYAIKLLAASQSRRLNIPGPYPWPVIGNVIEMGGQPQFSLTNMAK (?)
RYGPVYLMKLGTADVLVLNNYEVIKEALLRQRRIFGGRPIFDSFKKISQGLGVVFNSTMT
QGDEWMKLKMTIVKHVHRFVSSEETKGYVAHHVQMEAVELVRILTEKCRS
SPNEVIFPIEQINLAIANVVCAIMFGHRYQHGNK (0)
EFQDLISLNEQFGDVIGSGSQVDVIPWMK (0)
IFPKFRNALKVFDFLTNRLNNWMRLR (2)
TKEHRLTYKHGVIRDIVDSFIAESIDHPEQSALNDDVIMALTTDVFGA
GQDTMSTTMQWVFVYMMHFKECQRK
IHAELDSVIGPGELPHISDRRRLPYLEAVMHEIFRHSTFTSTTIPHVTTQDTVLDGHFIP 
KGILVFINQFGANHDPNHWVDPDKFIPERFLDGKGNLISRPHDRYLLFSTGARKCPG 
DELSRMLILHFMATMFALCEVSSDPQKPATL
DAVYNLSMRPKELRTIVRS
RNLPFLKNSVAQMSEADSHVLTVPGETTSFLTSRVESTVPDNQESQFSDNDFEKVDTKIP
KRKVFSRPTLTHDDINGNNVRKRGNLHQSAMYRIQLAT*

CYP1F2   Ciona savignyi (sea squirt)
         Ortholog of C. intestinalis CYP1F2
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
         paired_scaffold_142 77% to C. intestinalis CYP1F2
222183 FRRYGPIYLIKLGTADVLILNNYDVIKEALIRQRGVFSGRPVFESFKKISQ 222031
222030 GRGIVFNSSLTQGAAWQRMKMTIVKHLHRFIASPQTKGFVAGHVQKETVQLVHILSEKCR 221851
221850 SSTNQAIEPVENINLAVANVVCSIMFGHRYQHGNK 221746
219363 LHRTREHRQSYKHGVIRDLVDSFIAESIDKPGQLLNDDVIMALTTDVFGAGQDT 219202
219201 MSTTLQWIFVYMMRFKECQKK 219139
218667 IHAELDSVLKPGSLPQIKDRARLPYLEAVMHEIFRHSTFTTTTIPHVTTEDTVLRGYHLPKET 218479
218478 LIFINQYAANHDPEHWVEPDKFIPERFLDEKGNLISRPHDRYLLFSTGSRKCPGDELSRM 218299
218298 LILYLMANIFTLCEISPDPNQPTTLDAVYTLSMRPKNVKTVVRVR 218164

CYP1F3   Ciona intestinalis (sea squirt)
         JGI Ciona genome ver.2 gene model 138492
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1Bs and CYP1Cs, but only about 33% identical to vert CYP1s
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
LPYPRGLPIIGNIHQMGNFPHVKLTEWSKQFGDFYRIKMGRYDALVVNGHENIR (2)
NCLAKKSAAFAGRPPFETSKLIEEGLSISFSNYS (2)
PEWERQKQCTIKALKLYTSGSDKRSTMEETVSSHAKQLAEDLINSADQQ (0)
GLVGDLHDTVIYSTTSVSSTICFGRSFTRQDPELKEFLRNFQSFDKAMGASQIINFWPFLKYFPVLGKSFR (0)
NLKTYMDQYWNFTLSMLEQHWDTYVPNNMRDLADCLWAQSNQ (0)
NRQLTDQQRRIAYGASDAFGAGFDTISAMITWSIFYMAVFPEHQRK (0)
IREEIDRLETSMFSLRHHGDVCPYTQAWLYEVLRH
ISVSPLLVPHYTVKQVEVNGTMIPAGVVVLFNVAN (0)
ADRDTRVWENPEQFEPERFLARDPTTGGARVVASETSKI
LNWGAGKRRCPGAELSRHELFIYIANLVKLCYIE
QAVEGIEPAIPWPCTPGISTKPKAFRVKVTQR*

CYP1F3   Ciona savignyi (sea squirt)
         Ortholog of C. intestinalis CYP1F3
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1Bs and CYP1Cs, but only about 33% identical to CYP1Bs
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
         paired_scaffold_3 56% to C. intestinalis CYP1F3
LPSPRGLPIIGNVHQLTTSPHVKLSEWAKEFGDLFRIKMGCFDTLVVTGYDNIR
(2) TALVKHSVAFAGRPPYETSKLFSNGLSLAFNNY
(2) SPAWEKQKRCTVKALKLYTAGPDLQKRNAMEDTASYQANLLVDQLLASVNK (0)
DAITNPDEIVHHSATNVISNICFGRSFSKNDPELQKFVSINRAFDRAMGSAQIVNFWPFLKSVPVLGRSYQ
NLKAHMDVFWDFVFPNLKEHWKTYNPSNIRDIADCLWYQSH
TSSKRDLQRRIASAASDIFGAGYDTTHKVVLWSLFYMAAFPQYQQKV
RDIFRVSEVKMY
TLRHHGDECPYVQAWIYEVLRHTSLAPILLPHYTTKEVTLNGVRIPAGVV
KKYHTIQAHKDPKIWKNPDEFDPGHFLEEDGSKLRSEAVHKLLSWGAGKRRCPGAELSRHE
IFVFVTTLVRRAYIGQAVDGVEPAFPWNTTGGISISPDPFRVKITER

CYP1F4   Ciona intestinalis (sea squirt)
         JGI Ciona genome ver.2 gene model 132188
         Clusters inside the vertebrate CYP1s on NJ trees
         Closest to CYP1Bs and CYP1Cs, but only about 29% identical to vert CYP1s
         Note: the Ciona genome is greatly diverged from the 
         Vertebrate line and seems to be undergoing rapid evolution
         No ortholog is found in C. savignyi
MESVWVVIKWVKETMMSNSSFETIVAVATLLLLLMFVSENWNWLKIPGPI
PWPIIGNLGSLKGTKFLSIHEMYKIYGRIFRLKFGRVEAVVLCDVELIKE
ALLDRGRSLSGRPQFASYRLVSGCKSVVTNDPRCLREWVNY
KSTMVQTLCSISKNNEMKELMNERIGSVLVYMIQELEKGGDGQNFAEDIVTKTVANFLCT
VCYGGTYDFNSK (0)
EFNNLIEMSRHYTDNLSKSILRDMIPLAE (0)
ILPSVNKGRADFAKTSYHLHLWFLKR (2)
VEEVIQHFQPNKLNDLASVMVSDLTNDPTENISNITEKDRNSIAAIINDLVQ (1)
GYHSLYSMALWVVTYMIKYPEEVKKIENELNEVLDDYLPTLHDQESLPHTMAFINE (0)
VLRCRPSLPLAVPHSATEDTKLGGYDISKDTMVVASLYSANRDPKVWANPDQFDPSR
FLAKDDLGVTVLDETKVEQVFTFSLGDRKCPGEDIGRSFLFLTTAYLAHTCKLKPDPAK
PPTFQTKPGSITRPKDFGVQLNVKKCWLGVFKPDDNEE*

2A Subfamily

CYP2A1      rat
            PIR C41425 (12 amino acids)
            Imaoka, S., Kamataki, T. and Funae, Y.
            Purification and characterization of six cytochromes P-450
            from hepatic microsomes of immature female rats.
            J. Biochem. 102, 843-851 (1987)

CYP2A1      rat
            GenEMBl J02669
            1 aa diff to genome seq (lower case)
82084958 MLDTGLLLVVILASLSVMLLVSLWQQKIRGRLPPGPTPLPFIGN
YLQLNTKDVYSSITQLSERYGPVFTIHLGPRRVVVLYGYDAVKEALVDQAEEFSGRGE
QATYNTLFKGYGVAFSSGERAKQLRRLSIATLRDFGVGKRGVEERILEEAGYLIKMLQ
GTCGAPIDPTIYLSKTVSNVISSIVFGERFDYEDTEFLSLLQMMGQMNRFAASPTGQL
YDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE
EKNGNSEFHMKNLVMTTLSLFFAGSETVSSTLRYGFLLLMKHPDVEAKVHEEIEQVIG
RNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPKaTDVFPI
LGSLMTDPKFFPSPKDFDPQNFLDDKGQLKKNAAFLPFSTGKRFCLGDGLAKMELFLL
LTTILQNFRFKFPMKLEDINESPKPLGFTRIIPKYTMSFMPI

CYP2A1      rat
            NP_036824 88% T0 2A2 chr1 (+) Cyp2a22 ortholog
82084958 MLDTGLLLVVILASLSVMLLVSLWQQKIRGRLPPGPTPLPFIGNYLQLNTKDVYSSITQ 82085134
82085434 LSERYGPVFTIHLGPRRVVVLYGYDAVKEALVDQAEEFSGRGEQATYNTLFKGY 82085595
82088031 GVAFSSGERAKQLRRLSIATLRDFGVGKRGVEERILEEAGYLIKMLQGTC 82088180
82088398 GAPIDPTIYLSKTVSNVISSIVFGERFDYEDTEFLSLLQMMGQMNRFAASPTG 82088556
82089778 QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82089957
82093158 EKNGNSEFHMKNLVMTTLSLFFAGSETVSSTLRYGFLLLMKHPDVE 82093295
82093737 AKVHEEIEQVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82093925
82094440 GTDVFPILGSLMTDPKFFPSPKDFDPQNFLDDKGQLKKNAAFLPFST 82094580
82098022 GKRFCLGDGLAKMELFLLLTTILQNFRFKFPMKLEDINESPKPLGFTRIIPKYTMSFMPI 82098201

CYP2A1-de2b rat
            exon 2 pseudogene Chr1 (-) only 240 bp from CYP2A1 start Met
frag e in fig below
82084718 YNAVKEALVDQAEGFSGQGEQA 82084653
rat, mouse and human 2ABFGST clusters

CYP2A2      rat
            PIR S26821 (27 amino acids)
            Matsumoto, T.,  Emi, Y.,  Kawabata, S. and Omura, T.
            Purification and characterization of three male-specific and
            one female-specific forms of cytochrome P-450 from rat
            liver microsomes.
            J. Biochem. 100, 1359-1371 (1986)

CYP2A2      rat
            J04187 Cyp2a12 ortholog
82117349 MLDTGLLLVVILASLSVMFLVSLWQQKIRERLPPGPTPLPFIGNYLQLNMKDVYSSITQ 82117525
82117991 LSERYGPVFTIHLGPRRIVVLYGYDAVKEALVDQAEEFSGRGELPTFNILFKGY 82118152 
82123228 GFSLSNVEQAKRIRRFTIATLRDFGVGKRDVQECILEEAGYLIKTLQGTC 82123377 
82123595 GAPIDPSIYLSKTVSNVINSIVFGNRFDYEDKEFLSLLEMIDEMNIFAASATG 82123753 
82124978 QLYDMFHSVMKYLPGPQQQIIKVTQKLEDFMIEKVRQNHSTLDPNSPRNFIDSFLIRMQE 82125157
82139054 EKYVNSEFHMNNLVMSSLGLLFAGTGSVSSTLYHGFLLLMKHPDVE 82139191 
82139607 AKVHEEIERVIGRNRQPQYEDHMKMPYTQAVINEIQRFSNLAPLGIPRRIIKNTTFRGFFLPK 82139795 
82140311 GTDVFPIIGSLMTEPKFFPNHKDFNPQHFLDDKGQLKKNAAFLPFSI 82140451 
82141451 GKRFCLGDSLAKMELFLLLTTILQNFRFKFPMNLEDINEYPSPIGFTRIIPNYTMSFMPI  82141630

CYP2A2-de2b rat
            exon 2 pseudogene Chr1 (-) frag f in fig below
82115528 LKPHWVVVLYEWDAVKEALGDQAEELSG*GEQANL 82115445
rat, mouse and human 2ABFGST clusters

CYP2A3      rat
            J02852 NM_012542 exon 4 in a seq gap in genome seq chr1 (+) 
            mouse Cyp2a5 ortholog
82023007 MLASGLLLVASVAFLSVLVLMSVWKQRKLSGKLPPGPTPLPFIGNYLQLNTEKMYSSLMK 82023186
82023453 ISQRYGPVFTIHLGPRRVVVLCGQEAVKEALVDQAEEFSGRGEQATFDWLFKGY 82023614
82024296 GVAFSSGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIESFRKTN 82024445
         GALIDPTFYLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTG
82026488 QLYEMFSSVMKHLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMLE 82026667
82028068 EKKNPNTEFYMKNLVLTTLNLFFAGTETVSTTLRYGFLLLMKHPDIE 82028208
82028659 AKVHEEIDRVIGRNRQAKYEDRMKMPYTEAVIHEIQRFADMIPMGLARRVTKDTKFREFLLPK 82028847
82029417 GTEVFPMLGSVLKDPKFFSNPNDFNPKHFLDDKGQFKKSDAFVPFSI 82029557
82030741 GKRYCFGEGLARMELFLFLTNIMQNFCFKSPQAPQDIDVSPRLVGFATIPPNYTMSFLSR 82030920

CYP2A3-de1b rat
            exon 1 pseudogene Chr1 (+)frag d in fig below
82052140 MLGSRLLLVAVLSCLCVMVFMPVWQQQYRDTIPPG 82052244
rat, mouse and human 2ABFGST clusters

Cyp2a4      mouse
            GenEMBL J04631 (multiple genomic fragments)
            PIR A30499 (494 amino acids) PIR A33531 (494 amino acids)
            Lindberg,R., Burkhart,B., Ichikawa,T. and Negishi,M.
            The structure and characterization of type I P-450-15-alpha gene as
            major steroid 15-alpha-hydroxylase and its comparison with type II
            P-450-15-alpha gene
            J. Biol. Chem. 264, 6465-6471 (1989)

Cyp2a4      mouse
            PIR S16067 (494 amino acids)
            Squires, E.J. and Negishi, M.
            Reciprocal regulation of sex-dependent expression of
            testosterone 15-alpha-hydroxylase (P-450-15-alpha) in liver
            and kidney of male mice by androgen. Evidence for a single gene.
            J. Biol. Chem. 263, 4166-4171 (1987)
            Note: 2a-4 and 2a-5 differ at 11 positions.  This sequence is 2a-4 like at
            9/11 positions.

Cyp2a4-de7b  mouse
            GenEMBL AC087157.1 + strand
            w in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            detritus exon 7 between Cyp2a4 and Cyp2b9
37037 AKIHEEINQVIGTHRTPRVDDRAKMP 37114
37114 YTDAVIHEIQRLTDIVPLGIPHNVT 37188
37190 RDTHFRGY 37213

Cyp2a5      mouse
            GenEMBL J04631 (multiple genomic fragments)
            PIR B30499 (494 amino acids) PIR B33531 (494 amino acids)
            Lindberg,R., Burkhart,B., Ichikawa,T. and Negishi,M.
            The structure and characterization of type I P-450-15-alpha gene as
            major steroid 15-alpha-hydroxylase and its comparison with type II
            P-450-15-alpha gene
            J. Biol. Chem. 264, 6465-6471 (1989)

Cyp2a5      mouse
            PIR S16068 (494 amino acids)
            Squires, E.J. and Negishi, M.
            Reciprocal regulation of sex-dependent expression of
            testosterone 15-alpha-hydroxylase (P-450-15-alpha) in liver
            and kidney of male mice by androgen. Evidence for a single gene.
            J. Biol. Chem. 263, 4166-4171 (1987)
            Note: 2a-4 and 2a-5 differ at 11 positions.  This sequence is 2a-4 like at
            5/11 positions, and 2a-5 like at 6/11 positions
            
Cyp2a4 or 5 mouse
            PIR S03979 (21 amino acids)
            Lang, M.A., Juvonen, R., Jaervinen, P., Honkakoski, P. and
            Raunio, H.
            Mouse liver P450Coh: genetic regulation of the
            pyrazole-inducible enzyme and comparison with other P450
            isoenzymes.
            Arch. Biochem. Biophys. 271, 139-148 (1989)

CYP2A6      human
            PIR S17220 (20 amino acids)
            Maurice, M., Emiliani, S., Dalet-Beluche, I., Derancourt, J.
            and Lange, R.
            Isolation and characterization of a cytochrome P450 of the
            IIA subfamily from human liver microsomes.
            Eur. J. Biochem. 200, 511-517 (1991)

CYP2A6      human
            PIR A61272 (13 amino acids)
            Yun, C.H., Shimada, T. and Guengerich, F.P.
            Purification and characterization of human liver microsomal
            cytochrome P-450 2A6.
            Mol. Pharmacol. 40, 679-685 (1991)

CYP2A6v2    human
            GenEMBL U22027(7215bp)
            Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H.,
            Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E.,
            Idle,J.R. and Gonzalez, F.J.
            A genetic polymorphism in coumarin 7-hydroxylation: sequence of the
            human CYP2A genes and identification of variant CYP2A6 alleles.
            Am. J. Hum. Genet. 57, 651-660 (1995)

CYP2A7      human
            GenEMBL U22029(2282bp)
            Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H.,
            Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E.,
            Idle,J.R. and Gonzalez, F.J.
            A genetic polymorphism in coumarin 7-hydroxylation: sequence of the
            human CYP2A genes and identification of variant CYP2A6 alleles.
            Am. J. Hum. Genet. 57, 651-660 (1995)

CYP2A7      baboon (Papio sp.)
            Swiss P80055 (20 amino acids) PIR S21737 (20 amino acids)
            Purification of two cytochrome P450 isozymes related to CYP2A
            and CYP3A gene families from monkey (baboon, Papio papio)
            liver microsomes. Cross reactivity with human forms.
            Dalet-Beluche I., Boulenc X., Fabre G., Maurel P., Bonfils C.
            Eur. J. Biochem. 204, 641-648 (1992)
            MLASGLLLVALLACLTVMVL 
            100% to CYP2A7 human

CYP2A7PTX   human (retired name see CYP2A18PN)
            GenEMBL U22030(1192bp)
            Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H.,
            Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E.,
            Idle,J.R. and Gonzalez, F.J.
            A genetic polymorphism in coumarin 7-hydroxylation: sequence of the
            human CYP2A genes and identification of variant CYP2A6 alleles.
            Am. J. Hum. Genet. 57, 651-660 (1995)
            There are two human pseudogenes of 2A7 on chromosome 19.  They are 
            Located adjacent to each other.  This one is telomeric.

CYP2A7PCX   human (retired name see CYP2A18PN)
            GenEMBL U22044(1192bp)
            Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H.,
            Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E.,
            Idle,J.R. and Gonzalez, F.J.
            A genetic polymorphism in coumarin 7-hydroxylation: sequence of the
            human CYP2A genes and identification of variant CYP2A6 alleles.
            Am. J. Hum. Genet. 57, 651-660 (1995)
            There are two human pseudogenes of 2A7 on chromosome 19.  They are 
            Located adjacent to each other.  This one is centromeric.

CYP2A8       Mesocricetus auratus (hamster)
             GenEMBL M63788 M34446 M34447 (1771bp)
             Lai,T.S. and Chiang, J.Y.L.
             Cloning and characterization of two major 3-methylcholanthrene inducible hamster 
             liver cytochrome P-450s.
             Arch. Biochem Biophys. 283, 429-439 (1990)
             clone MC1 note: M34446 is incorrectly included in this GenBank entry
             and in the 2A9 entry. M34446 should only be in the CYP1A2 hamster entry.

CYP2A9       Mesocricetus auratus (hamster)
             GenEMBL M63789 M34446 M34448 (918bp)
             Lai,T.S. and Chiang, J.Y.L.
             Cloning and characterization of two major 3-methylcholanthrene inducible hamster 
             liver cytochrome P-450s.
             Arch. Biochem Biophys. 283, 429-439 (1990)
             clone MC1-81 3 prime end 
             note: M34446 is incorrectly included in this GenBank entry
             and in the 2A8 entry. M34446 should only be in the CYP1A2 hamster entry.

CYP2A9      Syrian hamster
            GenEMBL D86953
            Kurose,K., Tohkin,M., Ushio,F. and Fukuhara,M.
            Cloning and characterization of syrian hamster testosterone
            7alpha-hydroxylase, CYP2A9
            Arch. Biochem. Biophys. 351, 60-65 (1998)
            clone name P450SH2A-1
            1 amino acid difference with MC1-81 of Lai and Chiang (incomplete seq.)

CYP2A10     rabbit
            GenEMBL L10236 (1641bp) Swiss Q05555 (494 amino acids)
            Peng.H.-M., Coon,M.J. and Ding,X.
            Isolation and heterologous expression of cloned cDNAs
            for two rabbit nasal microsomal proteins CYP2A10 and CYP2A11
            that are related to nasal microsomal cytochrome P-450 form a.
            J. Biol. Chem. 268,17253-17260 (1993)

CYP2A10/11  rabbit
            PIR A31944 (23 amino acids)
            Ding, X. and Coon, M.J.
            Purification and characterization of two unique forms of
            cytochrome P-450 from rabbit nasal microsomes.
            Biochemistry 27, 8330-8337 (1988)

CYP2A11     rabbit
            GenEMBL L10237 (2484bp) Swiss Q05556 (494 amino acids)
            Peng.H.-M., Coon,M.J. and Ding,X.
            Isolation and heterologous expression of cloned cDNAs
            for two rabbit nasal microsomal proteins CYP2A10 and CYP2A11
            that are related to nasal microsomal cytochrome P-450 form a.
            J. Biol. Chem. 268, 17253-17260 (1993)

Cyp2a12     mouse
            GenEMBL L06463 (1665bp) PIR S32491 (492 amino acids)
            Iwasaki,M., Juvonen,R., Lindberg,R. and Negishi,M.M.
            Site-directed mutagenesis of mouse steroid 7 alpha-
            hydroxylase cytochrome P-450 (7 alpha): Role of residue
            209 in determining steroid-cytochrome P-450 interaction.
            Biochemical J. 291, 569-573 (1993)
            Note: called 7 alpha hydroxylase, but this sequence is very
            different from CYP7 sequences.  It is actually a 2A sequence.

Cyp2a12-de1b2b  mouse
            GenEMBL NW_000310 (52646-53186) also NT_039413.1 - strand
            note: nuc. numbering same in both
            detritus exons 1 and 2 = s in Figure 2B 
            Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            Between 2a12 and 2f2
            Old name Cyp2a20p
53186 MTLS 53175
53173 MLLVAVLTCFIAMITMSVLR*KKLLGKMPPGPTPLPFLGNFLELDTKKFYDSFLRVVGREM 52988
52810 IREWYGPVFTVHLGTYSAVVPWGYDVVKETLVDQAEQFSGRGEQAFLDWFFKGYG 52646

CYP2A13     human
            GenEMBL U22028(8778bp)
            Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H.,
            Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E.,
            Idle,J.R. and Gonzalez, F.J.
            A genetic polymorphism in coumarin 7-hydroxylation: sequence of the
            human CYP2A genes and identification of variant CYP2A6 alleles.
            Am. J. Hum. Genet. 57, 651-660 (1995)

CYP2A13     Canis familiaris (dog)
            XM_541608.2
            91% to CYP2A13 human 
            There is a second CYP2A in dog CYP2A25 that is 87% to CYP2A13
            This seq is the probable ortholog of CYP2A13
            Dog cluster order 2S(-), 2B(+), 2G(+), 2A25(+), 2A13(-), 2F(+), 2T(+)
            Note: this seq is the same as Seq 2 sent by Tom Rushmore
            On 6/28/05 except for 3 aa diffs

CYP2A13    Canis familiaris (dog)
           NW_876270.1 43229491-43235490
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           92% to human 2A13 probable ortholog
MLASGLLLVALLACLTIIVLMSVWKQRKLGGKLPPGPTPLPFIGNYLQLNTEQMYNSLMKISERYGPVFTIHLGP
RPVVVLCGHEAVKEALVDQAEEFSGRGEQATFDWLFKGYGVAFSNGERAKQLRRFSITTLRDFGVGKRGIEERIQ
EEAGFLIEALRGTRGAFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSMGQLYEMFYS
VMKHLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMQEEQNNPNTEFYLKNLVLTTLNLFF
AGTETVSTTLRYGFLLLMKHPDVEAKVHEEIDRVIGKNRQPKFEDRAKMPYTEAVIHEIQRFGDMIPMGVARRVI
KDTKFREFLLPKGTEVFPMLGSVLRDAKFFSNPQDFHPQHFLDEKGQFKKSDAFVPFSIGKRYCFGEGLARMELF
LFLTTILQNFHFKSPQLPQDIDVSPKHVGFATIPRNYTMSFQPR*

CYP2A13     Bos taurus (cow)
            See cattle page for details
            90% to 2A13 86% to 2A7
MLASGLLLVALLACLTIMVLMSVWRQRNLKGKLPPGPTPLPFIGNYLQLNTEQMCNSLMK
ISEHYGPVFTV
HLGTRQIVVLCGYDAVKEALVDQAEEFSGRGKQATFDWLFKGYGVAFSNGERAKQLRRFS
ITTLRDFGVGKRGIEERIQEEAGFLIEAFRGTRS
AFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTGQ
1 LYEMFYSVMKYLPGPQQQAFKELQGLEDFIAKKVEQNQRTLDPNSPRDFIDSFLIRMQEEKENPNTEFYRK 177
178 NLVMTTLNLFFAGTETVSTTMRDGFLLLMKHPDVEAKIHEEIDRVIGKNRQPKFEDRAKM 357
358 PYTEAVIHEIQRFGDMIPMGLARRVTKDTKFRDFLLPKGTEVFPMLGSVLRDPKFFSNPR 537
538 DFNPQHFLDEKGQFKKSDAFVPFSIGKRYCFGESLARMELFLFFTTIMQNFRFKSPQS 711
712 PQDINVSPKLVGFATIPPNYTMSFLPR*

CYP2A13 frag. Bos taurus (cow)
            PIR A35704 (18 amino acids)
            Lazard, D., Tal, N., Rubinstein, M., Khen, M., Lancet, D. and
            Zupko, K.
            Identification and biochemical analysis of novel olfactory-specific
            cytochrome P-450IIA and UDP-glucuronosyl transferase
            Biochemistry 29, 7433-7440 (1990)
            MXYLPGPQQQAFKELQGL
            1 aa diff to human CYP2A13 and one uncalled amino acid

CYP2A13      horse 
             GenEMBL XM_001499763
             Heather Knych
             Submitted to nomenclature committee Oct. 21, 2007
             88% to CYP2A13 human, 89% to dog CYP2A13

CYP2A14      Cricetulus griseus (Chinese hamster)
             GenEMBL D86954 
             Fukuhara,M., Kurose, K., Aiba, N., Matsunaga, N., Omata, W., Kato, K.,
             and Kimura, M.
             A Major Phenobarbital-Inducible P450 Isozyme, CYP2A14, in the
             Chinese Hamster Liver: Purification, Characterization, and cDNA 
             Cloning"
             Arch. Biochem. Biophys. 359, 241-248 (1998)
             clone P450CH2A-2 85% identical to 2A3 and 2a5

CYP2A15      Cricetulus griseus (Chinese hamster)
             GenEMBL AB022916
             Kouichi Kurose, Emi Isozaki, Masahiro Tohkin, and Morio Fukuhara
             Cloning and expression analysis of a new member of the cytochrome 
             P450, CYP2A15 from the Chinese hamster, encoding testosterone 7alpha-
             Hydroxylase.
             Archives of Biochemistry and Biophysics (1999) Vol. 371 pp270-276
             91% identical to CYP2A9

CYP2A16      Mesocricetus auratus (Syrian hamster)
             GenEMBL D86952
             Masahiro Tohkin, Kouichi Kurose, Emi Isozaki, and Morio Fukuhara
             Molecular cloning, heterologous expression, and characterization of 
             a novel member of CYP2A in Syrian hamster"
             Biochimica et Biophysica Acta (1999) Vol.1446 pp438-442
             94% identical to CYP2A3

CYP2A17      Cricetulus griseus (Chinese hamster)
             No accession number
             Kouichi KUROSE
             86% identical to CYP2A14
             submitted to nomenclature committee 11/29/99

CYP2A18PC   human pseudogene
            AC008537 
            Hoffman S.M.G., Nelson, D.R. and Keeney, D.S.
            Organization, strtucture and evolution of the CYP2 gene cluster
            On human chromosome 19.
            Pharmacogenetics 11, 687-698 2001 
            C-terminal part of P450 only.  This is the opposite end of the 
            pseudogene CYP2A18PN.  This gene appears to be split by a 2B6, 2B7P1 
            insertion. 

CYP2A18PN   human pseudogene
            AC008537 
            Hoffman S.M.G., Nelson, D.R. and Keeney, D.S.
            Organization, strtucture and evolution of the CYP2 gene cluster
            On human chromosome 19.
            Pharmacogenetics 11, 687-698 2001
            N-terminal part of P450 only.  This is the opposite end of the 
            pseudogene CYP2A18PC.  This gene appears to be split by a 2B6, 2B7P1 
            insertion.  This name replaces the old designations CYP2A7PT
            and CYP2A7PC.  There now seems to be only one copy of this pair
            in the sequenced human genome.

CYP2A18PN   human pseudogene (formerly CYP2A7PT)
            GenEMBL U22030(1192bp)
            Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H.,
            Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E.,
            Idle,J.R. and Gonzalez, F.J.
            A genetic polymorphism in coumarin 7-hydroxylation: sequence of the
            human CYP2A genes and identification of variant CYP2A6 alleles.
            Am. J. Hum. Genet. 57, 651-660 (1995)
            There are two human pseudogenes of 2A7 on chromosome 19.  They are 
            Located adjacent to each other.  This one is telomeric.
            Note added 4/10/2001 This gene appears to be split by a 2B6, 2B7P1 
            insertion.  This name replaces the old designations CYP2A7PT
            and CYP2A7PC.  There now seems to be only one copy of this pair
            in the sequenced human genome.

CYP2A18PN   human pseudogene (formerly CYP2A7PC)
            GenEMBL U22044(1192bp)
            Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H.,
            Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E.,
            Idle,J.R. and Gonzalez, F.J.
            A genetic polymorphism in coumarin 7-hydroxylation: sequence of the
            human CYP2A genes and identification of variant CYP2A6 alleles.
            Am. J. Hum. Genet. 57, 651-660 (1995)
            There are two human pseudogenes of 2A7 on chromosome 19.  They are 
            Located adjacent to each other.  This one is centromeric.
            Note added 4/10/2001 This gene appears to be split by a 2B6, 2B7P1 
            insertion.  This name replaces the old designations CYP2A7PT
            and CYP2A7PC.  There now seems to be only one copy of this pair
            in the sequenced human genome.

CYP2A19     Sus scrofa (pig)
            GenEMBL AB052255
            Misaki Kojima
            Submitted to nomenclature committee Oct. 27, 2000
            89% to human CYP2A13
            clone name c7

Cyp2a20pX    mouse
            GenEMBL NW_000310 (52646-53186)
      53186 MTLS (frameshift) MLLVAVLTCFIAMITMSVLR*KKLLGK
            MPPGPTPLPFLGNFLELDTKKFYDSFLRVVLGREM (0) 52988
      52810 IREWYGPVFTVHLGTYSAVVPWGYDVVKETLVDQAEQFSGRGEQAFLDWFFKGYG 52646
            renamed Cyp2a12-de1b2b

Cyp2a21-ps  mouse
            GenEMBL NW_000308.1, NW_033707.1, NT_039411.1
            93% to Cyp2a5 
            runs off end NW_000308.1|Mm7_WIFeb01_154 also on 
            NW_033707.1|MmUn_WIFeb01_40262
            t in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            between 2a22 and 2a12
NT_039411.1 + strand seq = 20,879bp runs off end
15607 FFLGKRGIEEHIQEEVGLLIDSFRKTNG 15690
15948 GAFIDTTFYLSRTVSNVISSIIFRDRFDYEDKEFLSLL*MMLGSFQFTATSMGQ 16109
17609 LYEMFSSVMKHLSGPQQQAFKELQGLEDFITKKVEHNQRTLDPNSPRDFIDSFLIRMLE 17785
19308 EKKNPNTEFYMKNLVLTTQNLFFAGTETVSTTLRYGFLLLMKHPDIE 19448
19888 AKVHKEIDWVTGRNWQPKYEDRMKMPYAEAVIHEIQRFADMIPMGLARRVTKDTKFRDFLLPK 20076
20678 GTEVFPMLGSVLKDPKFFFNPKDFNPKHFLDDKGQFKKSDAFVPFSIG 20821

Cyp2a22     mouse
            GenEMBL NW_000308.1|Mm7_WIFeb01_154
            Also on NT_039411.1 - strand
            93% to Cyp2a12
            between 2a5 and 2a12
NW_000308.1
MLGSGLLLVAILVFLSVMVLVSVWQQKIRGKLPPGPIPLPFIGNYLQLNRKDVYSSITQ 392
LQEHYGPVFTIHLGPRRVVVLYGYDAVKEALEDNAEEFSGRGEQATFNTLFKGYG 834
VTFSNGERAKQLRRFSIATLKDFGLGKRGMEERIQEEAGCLIKMLQGTC 1495
GAPIDPTMYLSKTVSNVISSIVFGDRFNYEDKEFLSLLQMMSQMNQFAASPTGQ 1874
LYDMFHSVMKYLPGPQQQIIKDSHKLEDFMIQKVKHNHSTLDPNSPRGFIDSFLIHMQK 3263
EKNFNSEFHMKNLVMTSLNLFFAGSETVSSLLRYGFLLLMKHPDVE 4834
AKVHEEIDRVIGRNRQPQYEDHMKMPYTQAVIHEIQR 5365
FSNFAPLGIPRRITKDTSFRGFFLPK 5443
GTDVFPIMGSLMIDPKFFSSPKDFNPQHFLDDKGQLKKIPAFLPFSI 6101
GKRSCLGYSLGKMQLFLFFTTILQNFRFKFPRKLEDINESPKPEGFTRIIP 7191
KYTMSFVPI* 7221

Cyp2a22-de1b2b  mouse
            GenEMBL NW_011833.1|MmUn_WIFeb01_20427
            between 2a22 and 2a5
            93% to Cyp2a12-de1b2b
            old name = Cyp2a23p 
            u in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
MLLVAILTCFIAMITMSVLR*RKVLGKIPPGPTPLPFLGNFLELDTKKFYDSFLRV
VLGREM
IRELYGPVFTVHLGTHSAVVPWGYDVVKEALVDQAEQFSGRGEQAFLDWFFKDYG

CYP2A23     Macaca mulatta (rhesus monkey)
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            93% to CYP2A13, 92% to CYP2A6 human, possible ortholog of CYP2A13

CYP2A23     Macaca fasicularis (cynomolgus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 1/11/2005
            Clone name mfCYP2A#1_27B2 
            98% to 2A23 Macaca mulatta 8 aa diffs
            note 2A23 and 2A24 are very similar to 2A6 and 2A13, but I
            cannot assign orthologs without mapping data.

CYP2A24     Macaca mulatta (rhesus monkey)
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            94% to CYP2A6, 93% to CYP2A13 human, possible ortholog of CYP2A6

CYP2A24     Macaca fasicularis (cynomolgus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 1/11/2005
            Clone name mfCYP2A#2_2-G10
            98% to 2A24 Macaca mulatta 8 aa diffs
            note 2A23 and 2A24 are very similar to 2A6 and 2A13, but I
            cannot assign orthologs without mapping data.

CYP2A23/24  Macaca fascicularis (cynomolgus monkey)
            PIR S36874 (13 amino acids)
            Ohmori, S., Horie, T., Guengerich, F.P., Kiuchi, M.and Kitada,M.
            Purification and characterization of two forms of hepatic microsomal 
            cytochrome P450 from untreated cynomolgus monkeys.
            Arch. Biochem. Biophys. 305, 405-413 (1993)
            Identical to first 13 aa of CYP2A23 or CYP2A24
            MLASGLLLVALLA

CYP2A25     Canis familiaris (dog)
            XM_541607.2, NM_001048027
            87% to CYP2A13 human 
            There is a second CYP2A in dog that is 91% to CYP2A13
            That seq is the probable ortholog of CYP2A13
            Dog cluster order 2S(-), 2B(+), 2G(+), 2A25(+), 2A13(-), 2F(+), 2T(+)
            Note: this seq is the same as Seq 1 sent by Tom Rushmore
            On 6/28/05 except for a short frameshifted region

CYP2A25    Canis familiaris (dog)
           NW_876270.1:43197750-43203984
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           88% to human 2A13 
MVASGILLVALLTCLTVMVLMSVWRQWKLLEKLPPGPTPLPFIGNYLQLNIQQMSDSFMKISKRYGPVFTIHLGP
RRVVVLCGYEAVKEALVDQAEEFSGRGAQATFDTLFKGYGVTFSNGERAKQLRRFSITTLRDFGVGKRGIEERIQ
EEAGFLIEALRGTRGAFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSMGQLCEMFHS
VIKYLPGPQQQAFKELQGLEDFITKKVEQNQRTLDPNSPRDFIDSFLIRMQEEQNNPNTEFHLKNLVLTTLNLFF
AGTETVSTTLRYGFLLLMKHPDVEAKVHEEIDRVIGKNRQPKFEDRAKMPYTEAVIHEIQRFGDIIPLSLARRVI
KDTKFREFLLPKGTEVFPMLGSVLRDAKFFSNPQDFHPQHFLDEKGQFKKSDAFVPFSIGKRYCFGEGLARMELF
LFLTTILQNFHFKSPQLPQDIDVSPKLVGLATIPRNYTMSFQPR*

2B Subfamily

CYP2B1 or 2 rat
            PIR A92255 (22 amino acids) B92255 (22 amino acids)
            Botelho, L.H., Ryan, D.E. and Levin, W.
            Amino acid compositions and partial amino acid sequences of
            three highly purified forms of liver microsomal cytochrome
            P-450 from rats treated with polychlorinated biphenyls,
            phenobarbital, or 3-methylcholanthrene.
            J. Biol. Chem. 254, 5635-5640 (1979)

CYP2B1 or 2 rat
            PIR A60822 (20 amino acids)
            Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and
            Oesch, F.
            Effect of nutritional imbalances on cytochrome P-450 isozymes
            in rat liver.
            Biochem. Pharmacol. 37, 3245-3249 (1988)

CYP2B2      rat
            GenEMBL S51970 (2946bp)
            Hoffmann,M., Mager,W.H., Scholte,B.J., Civil,A. and Planta,R.J.
            Analysis of the promoter of the cytochrome P-450 2B2 gene in the 
            rat.
            Gene Expr. 2, 353-363 (1992)
            promoter region, no coding sequence

CYP2B2      rat
            GenEMBL L28169 (1401bp)
            Shephard,E.E.A.
            unpublished (1993)
            promoter region

CYP2B2      rat
            GenEMBL I00525 (427bp)
            White,P.C., Dupont,B. and New,M.I.
            Genetic probe used in the detection of adrenal hyperplasia
            Patent: US 4720454-A 3 19-JAN-1988
            Includes I-helix region

CYP2B3      rat 
            GenEMBL U16209 to U16214
            Jean,A., Reiss,A., Desrochers,M., Dubois,S., Trottier,E., Trottier,Y.,
            Wirtanen,L., Adesnik,M., Waxman,D.J. and Anderson,A.
            Rat liver cytochrome P450 2B3: structure of the CYP2B3 gene and 
            immunological identification of a constitutive P450 2B3-like protein in
            rat liver.
            DNA Cell Biol. 13, 781-792 (1994)

CYP2B3-se1[9] rat
            exon 9 100% match to 2B3 chr1 (+)frag a in fig below
81263180 GKRMCLGEGIARSELFLFFTTILQNYSVSSPVDPNTIDMTPKESGLAKVAPVYKICFVAR* 81263362
rat, mouse and human 2ABFGST clusters

CYP2B3-se2[1] rat
            duplicate exon 1 100% match Chr1 (-)frag b in fig below
81308557 MDTSVLLLLAVLLSFLLFLVRGHAKVHGHLPPGPRPLPLLGNLLQMDRGGFRKSFIQ 81308387
rat, mouse and human 2ABFGST clusters

CYP2B4      rabbit
            GenEMBL L10912 (2026bp)
            Ryan,R., Grimm,S.W., Kedzie,K.M., Halpert,J.R. and
            Philpot,R.M.
            Expression and induction of cytochromes P450 2B and P450 4B,
            identification of P450 2B-Bx, and functional comparison of four
            highly related forms of P450 2B.
            unpublished (1993)

CYP2B4      rabbit 
            GenEMBL S64259 (2028bp) PIR S35666 (491 amino acids)
            Ryan,R., Grimm,S.W., Kedzie,K.M., Halpert,J.R. and Philpot,R.M.
            Cloning, sequencing, and functional studies of
            phenobarbital-inducible forms of cytochrome P450 2B and 4B
            expressed in rabbit kidney
            Arch. Biochem. Biophys. 304, 454-463 (1993)

CYP2B4      rabbit
            Swiss P00177 PIR S31277 (491 amino acids) S31278 (491 amino acids)
            PIR S31279 (491 amino acids)
            Gasser R., Negishi M., Philpot R.M.
            Primary structures of multiple forms of cytochrome P-450 isozyme 2
            derived from rabbit pulmonary and hepatic cDNAs.
            Mol. Pharmacol. 32, 22-30 (1988)

CYP2B5      rabbit

CYP2B6      human
            PIR S04579 (139 amino acids) PIR S04580 (170 amino acids)
            Miles, J.S.,Spurr, N.K.,  Gough, A.C., Jowett,T., McLaren, A.W.,
            Brook,J.D. and Wolf, C.R.
            A novel human cytochrome P450 gene (P450IIB): chromosomal
            localization and evidence for alternative splicing.
            Nuc. Acids Res. 16, 5783-5795 (1988)

CYP2B6      human
            GenEMBL M29874
            Yamano,S., Nhamburo,P.T., Aoyama,T., Meyer,U.A., Inaba,T.,
            Kalow,W., Gelboin,H.V., McBride,O.W. and Gonzalez,F.J.
            cDNA cloning and sequence and cDNA-directed expression of human
            P450 IIB1: identification of a normal and two variant cDNAs derived
            from the CYP2B locus on chromosome 19 and differential expression
            of the IIB mRNAs in human liver.
            Biochemistry 28, 7340-7348 (1989)
            clone name hIIB1

CYP2B6      Macaca mulatta (rhesus monkey)
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            91% to CYP2B6, probable ortholog of CYP2B6
            name changed to reflect orthology formerly CYP2B30

CYP2B6      Macaca fasicularis (cynomolgus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 1/11/2005
            Clone name mfCYP2B6
            3 aa diffs to CYP2B6 Macaca mulatta

CYP2B6      Macaca fasicularis (cynomolgus monkey)
            No accession 
            Wu Zhicong
            Submitted to nomenclature committee 10/30/2006
            91% to human 2B6, 90% to human 2B7P1
            4 amino acids diffs to Yasuhiro Unos seq

CYP2B6      Bos taurus (cow)
            See cattle page for details
MELSMLLLFALLTGLLVLLARGRPKAHGRLPPGPRPLPFLGNLLQMDRKGLLKSFLR
FQQKYGDVFTVYLGPRPVVIICGTEAIREALVDQAEVFSGRAKIAVVDPIFQGY
GVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQDEAQCLVEELRKSQ
GALQDPVFYFHSITANIICSIVFGKRFDYRDPEFLRLLELLFQSFVLISSLSSQ
LFELYSSFLKYFPGSHRQIYKNLQEINVFIGRSVEQHRETLDPNAPRDFIDCYLLRMEKDKSNPQSQFDHQN
LIMSVLSLFFAGTETTSTTLRYGFLLMLKYPHITERIQKEIDQVIGSYR
PALDDRAQMPYTDAVIHEIQRFADLIPIGVPHMVTKDTHFRGYILPK
GTEVYPVLSSALHESCYFEKPDDFNPDHFLDANGVVKKNDAFMPFSI
GKRICLGEGIARIELFLFFTTILQNFSVASPVAPEDIDLTPQESGVGNVPPNYRIQFLPRQRG*

CYP2B7P1    human
            GenEMBL M29873
            Yamano,S., Nhamburo,P.T., Aoyama,T., Meyer,U.A., Inaba,T.,
            Kalow,W., Gelboin,H.V., McBride,O.W. and Gonzalez,F.J.
            cDNA cloning and sequence and cDNA-directed expression of human
            P450 IIB1: identification of a normal and two variant cDNAs derived
            from the CYP2B locus on chromosome 19 and differential expression
            of the IIB mRNAs in human liver.
            Biochemistry 28, 7340-7348 (1989)
            clone name hIIB3
            This entry was originally made then discontinued as 2B7PX because an article by      
            Miles et al. Nuc. Acids res. 18, 189 (1990) showed evidence of alternative splicing 
            of CYP2B6.  I thought that this explained the difference.  However, on going back 
            and looking at the sequences and the EST data and mRNAs, there are clearly two 
            different genes in the 2B human subfamily.  M29873 has an in frame stop codon, 
            making it a pseudogene.

CYP2B7P     Bos taurus (cow)
            See cattle page for details
            stop codon same as in human 2B7
PALDDRAQMPYTDTVIHEIQRFADLISIGVSHMDAKDAHF*GYILPK

Cyp2b8      rat

Cyp2b9      mouse
            GenEMBL M60267 to M60273, also AH000038
            Lakso,M., Masaki,R., Noshiro,M. and Negishi,M.
            Structures and characterization of sex-specific mouse cytochrome
            P-450 genes as members within a large family. Duplication boundary
            and evolution
            Eur. J. Biochem. 195, 477-486 (1991)

Cyp2b9-de9b mouse
            GenEMBL XM_145463, XP_145463, NT_039410.1
            x in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            detritus exon 9 between Cyp2a4 and Cyp2b9
            old name = Cyp2b25p
NT_039410.1 - strand 
196560 SGTRICLGEGIARSELFLFFTTILQ 196486
196484 NFSVSSPVAPKDIDITLKESGLAKIPPVYKISFLAH* 196374

Cyp2b10     mouse
            GenEMBL M21856, PIR A60559 (15 amino acids)
            Bornheim, L.M. and Correia, M.A.
            Purification and characterization of a mouse liver cytochrome
            P-450 induced by cannabidiol.
            Mol. Pharmacol. 36, 377-383 (1989)
Note: the genome of mouse has only one sequence for Cyp2b10 and Cyp2b20.   They are derived from the same gene.  The Cyp2b10 mRNA M21856 appears to contain errors in the sequence.  No exact match for it can be found in the mouse genome.
This mRNA has an extra exon called exon 8b (27 nucleotides in the heme binding peptide region).  This appears to be an alternative splice variant of this gene.
The Cyp2b20 sequence matches the genomic sequence and represents the correct 2b10 sequence.  The Cyp2b20 name has been discontinued and Cyp2b10 has been retained
since it is the older of the two names.
GenEMBL M21856 (sequence Cyp2b10 was based on)
MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLLQMDRGGLLKSLIQ
LREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVAVVEPTFKEY
GVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS
QGAPLDPTFLFQCITANVICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQ
MFELFSGFLKYFPGAHRQISKNLQELLDYIGHSVERHKATLDPSVPRDFIDIYLLRMEK
EKSNQNAEFHHQNLMMSVLSLFFVGTETSSTTLHYGFLLMLKYPHVTEKVQKEIDQVIGS
HRLPTLDDRTKMPYSDAVIHEIQRFSDLIPIGVPHRVTKDTLFRGYLLPKNTEVYPIL
SSALHDPQYFEQPDSFNPDQFLDANGALKKSEAFLPFST
Exon 8b GQIFDQKSV
GKRICLGESIARSELFLFFTSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR

GenEMBL AK028103 from RIKEN (corrected Cyp2b10/Cyp2b20 sequence)
MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLL
QMDRGGLLKSFIQLREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVA
VVEPTFKEYGVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS
QGAPLDPTFLFQCITANIICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQMFE
LFSGFLKYFPGAHRQISKNLQELLDYIGHSVEKHRATLDPSVPRDFIDIYLLRMEKEK
SNQHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVAEKVQKEIDQVIGS
HRLPTLDDRTKMPYTDAVIHEIQRFSDLIPIGVPHRVTKDTMFRGYLLPKNTEVYPIL
SSALHDPQYFEQPDSFNPDHFLDANGALKKSEAFLPFSTGKRICLGESIARNELFLFF
TSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR

CYP2B11    Canis familiaris (dog)
           NW_876270.1: 43114807-
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           78% to human 2B6
MELSVLLLLALLTGLLLLMARGHPKAYGHLPPGPRPLPILGNFLQMDRKGLLKSFLRLQEKYGDVFTVYLGPRRT
VMLCGIDAIREALVDNAEAFSGRGKIAVVEPVFQGYGVVFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEA
QCLVEELRKTEGVLQDPTFFFHSMTANIICSIVFGKRFGYKDPEFLRLMNLFYVSFALISSFSSQMFELFHSFLK
YFPGTHRQVYNNLQEIKAFIARMVEKHRETLDPSAPRDFIDAYLIRMDKEKAEPSSEFHHRNLIDSALSLFFAGT
ETTSTTLRYGFLLMLKYPHIAERIYKEIDQVIGPHRLPSLDDRAKMPYTDAVIHEIQRFGDLLPIGVPHMVTKDI
CFRGYIIPKGTEVFPILHSALNDPHYFEKPDVFNPDHFLDANGALKKNEAFIPFSIGKRICLGEGIARMELFLFF
TTILQNFSVASPMAPEDIDLTPQEIGVGKLPPVYQISFLSR*

CYP2B12     rat 
            GenEMBL S48369 X63545 (2528bp) Swiss P33272 (492 amino acids)
            PIR S27160 (492 amino acids)
            Friedberg,T., Grassow,M.A., Bartlomowicz-Oesch,B., Siegert,P,
            Arand,M., Adesnik,M. and Oesch,F.
            Sequence of a novel cytochrome CYP2B cDNA coding for a 
            protein which is expressed in a sebaceous gland, but not in the liver.
            Biochem. J. 287, 775-783 (1992)

CYP2B12-de9b rat
            exon 9 Chr1 (-) frag c in fig. below
81829155 GKFICLGEGIG*NESFIFFTGILQNLSLASPVAPENIDLTPIKSGAGKIPSTYQIHILSR 81829012
rat, mouse and human 2ABFGST clusters

Cyp2b13     mouse
            GenEMBL M60352 to M60358, also AH000037, NT_039410.1
            Lakso,M., Masaki,R., Noshiro,M. and Negishi,M.
            Structures and characterization of sex-specific mouse cytochrome
            P-450 genes as members within a large family. Duplication boundary
            and evolution.
            Eur. J. Biochem. 195, 477-486 (1991)

Cyp2b13-de1b2b7b mouse
            GenEMBL NT_039410.1 + strand
            y in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            detritus exons 1,2,7 between Cyp2b13 and Cyp2b26-ps
43894 XXXXXXDIFYMGAQPLLVLCGYEV*WEAPVDHSEVFLVYEDKAIIDPSSKKW 44031 ex 1
44377 XXFFVNGKPWNIVN*FLLTTTKDFEWKKRSIDNQIKVETLDLLLEC*KPHGDP 44529 ex 2
48130 LPVFVHWAQKPYTQASIHEIWRYGDFTHIG 48219 ex 7

CYP2B14X    rat 
            discontinued number see CYP2B16P

CYP2B14P    rat
            GenEMBL U33540
            Eric Trottier, Stéphane Dubois, Andréa Jean and Alan 
            Anderson 
            Identification of CYP2B14P and CYP2B16P, two apparent pseudogenes in 
            the rat cytochrome P450 2B (CYP2B) subfamily.
            Biochemical Pharmacology, 52, 963-965 (1996)
            exon 1, add Chr1 (+) exons 7,8,9 72% to 2B21 to this pesudogene
81706300 MKPNVLLLLAILLSFLLFLVRGHAKVHGHLPPGPRPLPILGNLLQMDRGGLLQSF 81706464
81728276 EKVQKEIGEVTGSHWFPILYSSKIPNTEAVIPEIQR 81728383
81728385 FSDLSSVVLPQRVTKDTFFQGFLLHK 81728462
81728634 NTEVYPILSSVLHDPQ 81728681 
81728681 VLEYPVTFNPEHFLDANGALKKNEAFTPFSR 81728773

CYP2B15     rat
            GenEMBL D17343 to D17349
            Nakayama,K., Suwa,Y., Mizukami,Y., Sogawa,K. and Fujii-
            Kuriyama, Y. 
            Cloning and sequencing of a novel rat cytochrome P450 2B-encoding 
            gene.
            Gene 136, 333-336 (1993)
            most similar to 2B12, 89% identical
MELGVLLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQLQ
EKYGDVFTVHLGPRPVVILCGTDTIREALVDQAEAFSGRGTVAVLHPVVQGYGVIFANGE
RWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYKALLNPTSIFQSIAANIIC
SIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQVFELFSGFLKYFPGVHKQISKNLQE
ILNYIDHSVEKHRATLDPNTPRDFINTYLLRMEKEKSNHHTEFHHQNLVISVLSLFFTGT
ETTSTTLRYSFLIMLKYPHVAEKVQKEIDQVIGSHRLPTLDDRTKMPYTDAVIHEIQRFA
DLIPIGLPHRVTNDTMFLGYLLPKNTEVYPILSSALHDPRYFDHPDTFNPEHFLDVNGTL
KKSEAFLPFSTGKRICLGEGIAQNELFIFFTAILQNFSLASPVAPEDIDLSPINSGISKI
PSPYQIHFLSRCVG

CYP2B16P    rat
            GenEMBL U33541 to U33546
            Eric Trottier, Stéphane Dubois, Andréa Jean and Alan Anderson 
            Identification of CYP2B14P and CYP2B16P, two apparent pseudogenes in the rat      
            cytochrome P450 2B (CYP2B) subfamily.
            Biochemical Pharmacology, 52, 963-965 (1996)
            note: previously called CYP2B14 in 1993 update.  This gene has a complete
            coding sequence but there is a defect in the splice junction in intron 1.
Exon 1 MEPSVLLLLAVLLSFLLLLVRGHAKIHGRLPPGPCPVPLLGNLLQMDRRGLLKSFIQLR
Exon 2 EKYGDVFTVHLGLRPVVVLCGTQTIREALVDHAEAFSGRGTIAGLEPVFQDYG
Exon 3 IFFSSGEQWKTLRRFSMATMRDFGMRKKSVEERIKEESQCLVEELKKYQG
Exon 4 APLDPTFLFQCITSNIICSIVFGECFDYTDHQFLHLLDLMYQTFSLLSSIFSQ
Exon 5 VFELFPGVLKYFPGAHRQISRNLHEILDFIGQSVEKHRATLDPNAPRDFIYTYLLHMEK
Exon 6 QKSNHYTEFHHWNLLSSVLSLFFAGTETSSTTLRYGFLIMLKYPHI
Exon 7 EKVQKEIDCVIGSHRLPTLDDRSKMPYTEAVIHEIQRFSDLAPIGTPHRVIKDTIFRGYLLPK
Exon 8 QNTEVFPILSSVLHDPQYFEQPDIFNLQHFLDANGALKIIEAFLPFSTGK
Exon 9 TGKRICLGESIARNELFLFFTTILQNFSVSSPVAPKDIDLTPKESGIGRIPQVYQICFLA

CYP2B17/2B6 Cercopithecus aethiops (African green monkey)
            PIR JT0676 (491 amino acids)
            Ohmori, S.; Sakamoto, Y.; Nakasa, H.; Horie, T.; Saito, K.; Kitada, M.
            Nucleotide and amino acid sequences of monkey P450 2B gene
            subfamily.
            Unpublished
            91% to human 2B6 probable ortholog

CYP2B18     guinea pig
            no accession number (437 amino acids)
            Oguri, K. 
            submitted to nomenclature committee

Cyp2b19     mouse
            GenEMBL AF047529, also NT_039410.1 + strand
            Diane Keeney, D.S. (1998) The Novel Skin-Specific Cytochrome P450 
            Cyp2b19 Maps to Proximal Chromosome 7 in the Mouse, near a Cluster of 
            Cyp2 Family Genes.
            Genomics 53, 417-419.
            Between 2b23 and 2g1

Cyp2b19-de7b8b9b mouse
            GenEMBL NT_039410.1
            old name = Cyp2b24p 
            v in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            detritus exons 7,8,9 between 2b19 and 2b23
NT_039410.1 + strand 
695673 EKVQKETDQVIGSHQLPTLDDRTKMPYTDTVIHEIQRFSDLAAIDLPHRVTIHTLSQVYLLPK 695861
696036 NTEVYPILSSVLLDP 696080
696083 QYFEQLDCFNPEHFLDANGTLKKSEAFLPFST 696178
702801 GKHVCLGKGIAHNELFLFFPTILQNFPVSVPLAPKDIDITPKESGTGKIPQCTRSAS 702971

Cyp2b20X    mouse
            GenEMBL X99715(1416bp)
            Damon,M., Fautrel,A., Marc,N., Guillouzo,A. and Corcos,L.
            Isolation of a new mouse cDNA clone: hybrid form of cytochrome P450
            2b10 and NADPH-cytochrome P450 oxidoreductase
            Biochem. Biophys. Res. Commun. 226 (3), 900-905 (1996)
            This clone has a part of the NADPH cytochrome P450 reductase on the
            opposite strand at the end of the P450 sequence.
            note: this sequence was accidentally given the name Cyp2b19.  That 
            name is assigned to a mouse keratinocyte P450 cloned by Diane Keeney.
            The reductase sequence at the end of this gene seems to be a cloning 
            error, because it cannot be found in the genomic DNA sequence.
            Cyp2b20 has been merged with Cyp2b10.  Though the Cyp2b20 sequence 
            is more like the genomic sequence, the Cyp2b10 name has precedence.

            GenEMBL AF128849
            Marc,N., Damon,M., Fautrel,A., Guillouzo,A. and Corcos,L.
            Isolation of a cyp2b10-like cDNA and of a clone derived from a
            cyp2b10-like pseudogene
            Biochem. Biophys. Res. Commun. 258 (1), 11-16 (1999)
            This sequence is 100% identical to Cyp2b20 and 97% identical to   
            Cyp2b10
MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLL
QMDRGGLLKSFIQLREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVA
VVEPTFKEYGVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS
QGAPLDPTFLFQCITANIICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQMFE
LFSGFLKYFPGAHRQISKNLQELLDYIGHSVEKHRATLDPSVPRDFIDIYLLRMEKEK
SNQHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVAEKVQKEIDQVIGS
HRLPTLDDRTKMPYTDAVIHEIQRFSDLIPIGVPHRVTKDTMFRGYLLPKNTEVYPIL
SSALHDPQYFEQPDSFNPDHFLDANGALKKSEAFLPFSTGKRICLGESIARNELFLFF
TSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR

Cyp2b20X    mouse  
            GenEMBL AK028103 100% identical to AF128849
            Now renamed Cyp2b10 (the corrected sequence)
MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLL
QMDRGGLLKSFIQLREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVA
VVEPTFKEYGVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKS
QGAPLDPTFLFQCITANIICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQMFE
LFSGFLKYFPGAHRQISKNLQELLDYIGHSVEKHRATLDPSVPRDFIDIYLLRMEKEK
SNQHTEFHHQNLMMSVLSLFFAGTETSSTTLRYGFLLMLKYPHVAEKVQKEIDQVIGS
HRLPTLDDRTKMPYTDAVIHEIQRFSDLIPIGVPHRVTKDTMFRGYLLPKNTEVYPIL
SSALHDPQYFEQPDSFNPDHFLDANGALKKSEAFLPFSTGKRICLGESIARNELFLFF
TSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR

Cyp2b20p1X  mouse
            GenEMBL AF129405
            Marc,N., Damon,M., Fautrel,A., Guillouzo,A. and Corcos,L.
            Isolation of a cyp2b10-like cDNA and of a clone derived from a
            cyp2b10-like pseudogene
            Biochem. Biophys. Res. Commun. 258 (1), 11-16 (1999)
            This sequence is 100% identical to Cyp2b20 from amino acid 64 on
            This seq is partial, starting at amino acid 60 with a stop codon
            at amino acid 63.  Full length cDNAs AK028103 and AF128849 do not
            have this stop codon and it is not found in genomic DNA.
            This probably represents a sequence derived from the Cyp2b10 gene.

CYP2B21     rat
            GenEMBL AF159245
            Nicola Brookman Amissah and Peter Swann

CYP2B22     Sus scrofa (pig)
            GenEMBL AB052256
            Misaki Kojima
            Submitted to nomenclature committee Oct. 27, 2000
            78% to rabbit CYP2B4
            clone name c780

Cyp2b23     mouse
            NW_000307 618973-640139, also XM_145466
            Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman
            Next to Cyp2b19-de7b8b9b and 2b19 on chr 7

Cyp2b24pX   mouse 
            NW_000307 692575-699876
            Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman
            Next to 2b19 on chr 7
            Renamed Cyp2b19-de7b8b9b

Cyp2b25pX   mouse
            NW_000307 195792-195980
            Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman
            Next to 2b9 on chr 7
            Renamed Cyp2b9-de9b

Cyp2b26-ps  mouse
            GenEMBL AC087157 22100-26200
            Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman
            Between 2b9 and 2b13 on chr 7

Cyp2b27-ps  mouse
            NW_000303 2122792-2130037
            Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman
            Between 2b13 and 2b28-ps on chr 7

Cyp2b28-ps  mouse
            NW_000303 2064442-2094900
            Haoyi Wang, Kyle Donley, Diane Keeney, Susan Hoffman
            Between 2b27-ps and 2b10 on chr 7

CYP2B29     hamster
            No accession number
            Pedro Dominguez
            Submitted to nomenclature committee Dec. 17, 2002
            77% to cyp2b10

CYP2B30X    Macaca mulatta (rhesus monkey)
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            91% to CYP2B6, probable ortholog of CYP2B6
            name changed to reflect orthology = CYP2B6

CYP2B31     rat
            86% to 2b19 possible ortholog
81918041 MELGVFLLLTFTVGFLLLLASQNRPKTHGHLPPGPRPLPFLGNLLQMNRRGLLRSFMQ 81918214
81919826 LQEKYGDVFTVHLGPRPVVILCGTDTMREALVDQAEAFSGRGTVAVLHPVVQGY 81919987
81920130 GVIFANGERWKILRRFSLVTMRNFGMGKRSVEERIKEEAQCLVEELKKYK 81920279
81922129 GALLNPTSIFQSIAANIICSIVFGERFDYKDHQFLRLLDLIYQTFSLMGSLSSQ 81922290
81923031 VFELFSGFLKYFPGVHKQISKNLQEILNYIDHSVEKHRATLDPNTPRDFIDTYLLHMEK 81923207
81923977 EKSNHHTEFHHQNLVISVLSLFFAGTETTSTTLRYSFLIMLKYPHVA 81924117
81926113 EKVQKEIDQVISSHRLPTLDDRIKMPYTDAVIHEIQRFADLAPIGLPHRVTKDTMFRGYLLPK 81926301
81926476 NTEVYPILSSALHDPRYFDHPDTFNPEHFLDANGTLKKSEAFLPFST 81926616
81930286 GKRTCLGEGIARNELFIFFTALLQNFSLASPVAPEDIDLTPINSGAGKIPSPYQINFLSR 81930465

CYP2B32P    rat
            pseudogene partial Chr1 (+)
81806528 VLLLLTLIVGFLLFLVSQSQPKTHGHLPPGLCPLPFLGNLLQIKRRGLLNSFMQ 81806689
81808348 AQEKYGDVLTVHPGPRPVVRLCGTDTIREFLFDQAGTFSGQGTVAVLNPVVHGY 81808509
exon 3 missing
81809871 GVPLIPTSFFQRIAANIICSIVFGECFDYKDHQFLHLLDLIYQTFALMAPCPARS 81810035
81810759 VFQLFSGFLKYFPGVHKQISKNLQEILNYIGHSVEKHMATLDPSAPRDFINTYLLHMEN 81810935
81811666 EKSNHHTEFHHQTSVLSHFFDGTETTSTTLCCSFLIMLKYHHVK 81811797

CYP2B       guinea pig
            Swiss P34033 (20 amino acids)
            Narimatsu S., Akutsu Y., Matsunaga T., Watanabe K., Yamamoto I.,
            Yoshimura H.
            Purification of a cytochrome P450 isozyme belonging to a subfamily of 
            P450IIB from liver microsomes of guinea pigs.
            Biochem. Biophys. Res. Commun. 172, 607-613 (1990)
            PIR S28205 (31 amino acids)
            Yamada, H., Kaneko, H., Takeuchi, K., Oguri, K. and Yoshimura,H.
            Tissue-specific expression, induction, and inhibition through
            metabolic intermediate-complex formation of guinea pig
            cytochrome P450 belonging to the CYP2B subfamily.
            Arch. Biochem. Biophys. 299, 248-254 (1992)
            Note:  These two fragments are identical over the first 20 amino acids.

Cyp2b       mouse
            PIR A21630 (25 amino acids)
            Stupans, I., Ikeda, T., Kessler, D.J. and Nebert, D.W.
            Characterization of a cDNA clone for mouse
            phenobarbital-inducible cytochrome p-450b.
            DNA 3, 129-137 (1984)
            This fragment has one amino acid difference with 2b-9, 2b-10 and 2b-13

Cyp2b       mouse
            GenEMBL M60359 (997bp)
            Lakso,M., Masaki,R., Noshiro,M. and Negishi,M.
            Structures and characterization of sex-specific mouse cytochrome
            P-450 genes as members within a large family. Duplication boundary
            and evolution.
            Eur. J. Biochem. 195, 477-486 (1991)
            N-terminal 57 amino acid fragment very similar to Cyp2b-13.

CYP2b       scup (fish Stenotomus chrysops)
            N-terminal fragment (20 amino acids)
            Klotz et al. Arch. Biochem. Biophys.  249, 326-338 (1986)

2C Subfamily

CYP2C1      rabbit 
            GenEMBL D26152 (1695bp)
            Noshiro,M., Ishida, H. and Okuda, K.
            unpublished (1993)

CYP2C2      rabbit 

CYP2C3      rabbit 

CYP2C4      rabbit 

CYP2C5      rabbit
            GenEMBL M55664 (2340bp)
            Pendurthi,U.R., Lamb,J.G., Nguyen,N., Johnson,E.F. and Tukey,R.H.
            Characterization of the CYP2C5 gene in 21L III/J rabbits: Allelic
            variations affects the expression of P450IIC5
            J. Biol. Chem. 265, 14662-14668 (1990)

CYP2C5      rabbit
            PIR S16715 (143 amino acids) PIR S20227 (145 amino acids)
            Zhao, J., Leighton, J.K. and Kemper, B.
            Characterization of rabbit cytochrome P450IIC4 cDNA and
            induction by phenobarbital of related hepatic mRNA levels.
            Biochem. Biophys. Res. Commun. 146, 224-231 (1987)

CYP2C6      rat
            PIR A41425 (17 amino acids)
            Imaoka, S., Kamataki, T. and Funae, Y.
            Purification and characterization of six cytochromes P-450
            from hepatic microsomes of immature female rats.
            J. Biochem. 102, 843-851 (1987)
rat 2C cluster in chromosome order

CYP2C6v1_v1-de1b2b3b4b5b rat
upstream pseudogene frag o, 96% identical to seq c
93% identical to seq upstream of CYP2C6v2 allele (temp name = CYP2Cnewb)
243935799 MDLVMLLVLTLSCLIFLSIWRQSSGRGKLP 243935888
243935888 SGPTPLPIIGNFFHLDLKNITQSLTN 243935965
243937699 FSKVNGSVFTLYFGMKPIVILHGYEAIKEGLIDHGEEFTERGSFPVAEKINKGL 243937860
243938035 GIAFSHGNRWKEIRRFTLMTLQNLGMGKKSIEDRVQEESRCLV 243938163
243939079 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLVEKLNENIKIVSSPWI* 243939231
243940291 FCSSFPVFIDYCPGSHMTLAKNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLINWKQ 243940467

CYP2C6v1_v1  rat
             GenEMBL M13711 
two aa changes to match many ESTs (lower case mi) 
due to frameshift 97% to 2C77 and 2C6v2
243955584 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS 243955751
243964779 FSKVYGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKD 243964937
243965112 LGIVFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTN 243965264
243966104 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 243966265
243967336 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 243967512
243984646 ENHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKCPEVT 243984786
243989157 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAmiHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 243989345
243990948 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 243991088
243992245 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL 243992424

CYP2C6v2-de1b2b3b4b4c5b rat
upstream pseudogene 
EST CK224599.1 = 100% match with 4 frameshifts) so this is a real gene
clone_lib="RALIUNN03 Sprague-Dawley rat female liver 
The CYP2C6_v1 sequence is also seen in this same mRNA library
This GNOMON prediction adds two upstream exons that do not belong to this gene
58596732 MDLVMLLVLTLSCLILLSIWRQSSGRGKHP 58596643 exon 1 frameshift
58596643 SGPTPLPIIGNFFHLDLNNITQSLTS (0) 58596566 exon 1
58594823 FSKVNGSVFTLYFGMKLIVILHGYAATKEGLIDHGEEFTKRGSFPVAEKINKGL (1) exon 2 58594662
58594487 GIAFSHGNRWKEIRRFTLMTLQNLGMGKESIEDRVQEETQCLV*ELRKTN (1) exon 3 58594338
58593451 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58593296
58592013 GSPCDPTFILGCAPCNVICSIIFQNCFDYKDQDFLSLMEKLNENIKIVSSPW 58591858
58590797 FCSSFPVFIDYCLGSHMTLA 58590738
58590736 NVYHTRNYILKKIKEHQESLDVTNPHDFIDYDLIKWKQ 58590620

CYP2C6v2  rat
allele not in figure, 13 aa diffs to CYP2C6_v1 XM_215255 NW_047916 
we are assigning this allele status but it may be a separate gene
(temp name = CYP2Cnewb)
58578624 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 58578457
58576741 FSKVYGPVFTLYFGLKPTVILHGYEAVKEALIDHGEEFAERGSFPVVEKINKDL (1) 58576583
58576405 GIAFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDHVQEEARCLVEELRKTN 58576256
58575415 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKVLSSPWTQ 58575254
58574189 FCSFFPVLIDYCPGSHTTLAKNIYYIRNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQ 58574013
58554666 ESHNPHLEFTLENLSVTVTDLFGAGTETTSTTLRYALLLLLKYPEVT 58554526
58534931 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 58534743
58533131 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 58532991
58531833 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 58531654

CYP2C6P   rat
          GenEMBL M18336 J03509 M18774 
an alternate splice version of 2C6
exon 8 is skipped and replaced by a cryptic exon just past the true exon 8
The GT boundary of the true exon 8 are the first two nucleotides of CYP2C6_v3
Cryptic exon 8
     MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTSFSKV 200
 201 YGPVFTLYFGTKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDLGIVFSHGNRW 380
 381 KEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEELRKTNGSPCDPTFILGCAPCNVICS 560
 561 IIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQFCSFFPVLIDYCPGSHTTLAKNVYHI 740
 741 RNYLLKKIKEHQESLDVTNPRDFIDYYLIKWKQENHNPHSEFTLENLSITVTDLFGAGTE 920
 921 TTSTTLRYALLLLLKCPEVTAKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFID 1100
1101 LIPTNLPHAVTCDIKFRNYLIPK 1169

CYP2C6_v2 CK224594.1 CK224593.1 note: the _v2 means alternative splice version 2

CYP2C6_v3 CK224595.1 CK224596.1 (3 nuc shorter at the joint uses the second AG)
               Beginning of exon 7 AGCTAAAG TCCAGGAAGA GATTGATCGT  243989183
GTGGTTGGCA AACATCGCAG CCCTTGCATG CAGGACAGGA GCCGCATGCC CTACACAGAT  243989243
GCCATGATTC ATGAGGTCCA GAGGTTCATT GACCTCATTC CTACCAACCT GCCACATGCG  243989303
GTGACCTGTG ACATTAAGTT CAGGAACTAC CTAATACCCA AG GT end of exon 7
Beginning of cryptic exon out of frame       agcaggtaa tagaaactca  243991103
tttccatggt tccagtgaca tgcagaaccg tggggactta gagtgtgact ctacatgtgc  243991163
tgatagcttg catctgcatg ataaggagca taattttcat tgtgtatgca ctgtcctgga  243991223
tatgaccacc ttctttatca gggt    end of cryptic exon
normal exon 9
1328 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLHPKDIDTTPVFNGFASLPPFYELCFIPL
rat 2C cluster in chromosome order
see this link for color coded figure of intron boundaries

>interval between 2C6 and 2C77

CYP2C6-se1[1:2:3:2:3] rat
frag n exons 1,2,3 2C6 like pseudogene plus strand exon 2,3 100% to seq m
244044941 MDHTTGTYTLSLILLSL*RQSSGRGKIPPGPTPLPIIDNLLQLDIKNVTQYLAN (0) 244045102
244050420 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244050581
244050793 XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244050873
frag m Exons 2,3 2C6 like pseudogene 100% to seq n
244052306 LSKVHGPVLTLYFWMKSNVVLHVDEAVNEDLIDHGE*FAVRRSIPLAEKLIKAL 244052467
244052679 XXXXXXXXXXXXXKTFTLMTLQNLRMGKGNIEDHVQE*AQ 244052759

CYP2C7      rat
            GenEMBL X12595 (1179bp)
            Stroem,A., Nilsson,A.G. and Zaphiropoulos,P.
            5' flanking sequence of the gene for rat cytochrome p-450f
            Nucleic Acids Res. 0, 0-0 (1988)
rat 2C cluster in chromosome order

CYP2C7      rat
            PIR S24582 (66 amino acids)
            Stroem, A.
            unpublished
rat 2C cluster in chromosome order

CYP2C7      rat
            PIR A60563 (56 amino acids)
            Westin, S., Stroem, A., Gustafsson, J.A., and Zaphiropoulos, P.G.
            Growth hormone regulation of the cytochrome P-450IIC
            subfamily in the rat: inductive, repressive, and
            transcriptional effects on P-450f (IIC7) and P-450-PB1
            (IIC6) gene expression.
            Mol. Pharmacol. 38, 192-197 (1990)
rat 2C cluster in chromosome order

CYP2C7      rat
            PIR A27425 (23 amino acids)
            Favreau, L.V., Malchoff, D.M., Mole, J.E. and Schenkman, J.B.
            Responses to insulin by two forms of rat hepatic microsomal
            cytochrome P-450 that undergo major (RLM6) and minor
            (RLM5b) elevations in diabetes.
            J. Biol. Chem. 262, 14319-14326 (1987)
rat 2C cluster in chromosome order

CYP2C7     rat
           GenEMBL M18335 
exons 1,2,3 and 6 are in sequence gaps 93% to 2C7 variant and 2C81
MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK  
          FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMNENVTKGF   
          GIVFSNGNRWKEMRRFTIMNFRNLGIGKRNIEDRVQEEAQCLVEELRKTK       
243849546 GSPCDPSLILNCAPCNVICSITFQNYFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243849385
243847566 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 243847390
243829444 GSPCDPSLILNCAPCNVICSITFQNHFDYKDKEMLTFMEKVNENLKIMSSPWMQ 243829283 
this duplicate exon 4 is not in the right sequence order
          ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT
243803857 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFINFVPTNLPHAVTCDIKFRNYLIPK 243803669
243800623 GTKVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 243800483
243799465 GKRACVGEGLARMQLFLFLTTILQNFNLKSLVHPKDIDTMPVLNGFASLPPTYQLCFIPS 243799286

CYP2C7-de7b     rat
frag r Exon 7 (+) 100% to seq a CYP2C81-de7b
243792966 RVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 243793151

CYP2C7     rat
variant unmapped 93% to 2C7 88% to 2C81
3463873 MDLVTFLVLTLSSLILLSLWRQSSRRRKLPPGPTPLPIIGNFLQIDVKNISQSLTK 3464040 
3479907 FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF   3480068
3480234 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK       3480383
3489182 GSPCDPSLILNCAPCNVICSITFQSHFDYKDKEMLTFMEKVNENLKIMSSPWMQ   3489343
3491162 VCNSFPSLVDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVDYYLIKQKQ 3491338
3505354 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 3505494
3406504 AKVQEEIDRVVGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPK 3406692
3408304 GTTIITSLSSVLHDSKEFPDPEIFDPGHFLDGNGKFKKSDYFMPFSA 3408444
3409602 GKRMCAGEGLARMELFLFLTTILQNFKLKSVLQPKDIDTTPVFPGFASLPPFYELCFIPS 3409778

CYP2C7-se1[6:7:9] rat
frag j exons 6,7,9 (6,7 and 9 have 1 aa diff to 2C7) 
244103321 ANNIEQSEYSHENLTCSIMDLIGAGTETMSTTLRYALLLLMKYPHVT 244103461
244120225 AKVQEEIDRVIGRHRSPCMQDRKHMPYTDAMIHEVQRFIDFVPTNLPHAVTCDIKFRNYLIPK 244120413
244124319 FLXXXLQNFNLKSLXHPKDIDTMPVLNXXASLPPTYQLCFIPS 244124447

CYP2C7-se2[2:3] rat
frag k exons 2,3 = 100% to 2C7 variant, 2 aa diffs to 2C7 exons 2,3
244064158 FSKTYGPVFTLYLGSQPTVILHGYEAIKEALIDNGEKFSGRGSYPMIENVTKGF 244064319
244064485 GIVFSNGNRWKEMRRFTIMTFRNLGIGKRNIEDRVQEEAQCLVEELRKTK 244064634

CYP2C7-se3[8]   rat
frag t Exon 8 minus strand 82% to 2C7
243749788 TIVIT*LTSVLHDSKKFPNPEMLDSGHFLDENGNFKKSEYFMPFSA 243749651

CYP2C7-se4[8:9] rat
frag u Exon 8 minus strand exon 8 = 87% to frag 2, 8+9 = 63% to 2C7
243726168 GVMVITSLSSALHDNKEFPNPKRFDPG*FLDRNGNFKKTDYFILFSA 243726028
Exon 9 minus strand 60% to 2C7
243723025 CVGEGLTPIELFLFLTRILQNFNLKHLTHTEAVDTTPVLSRLTSVSPALKLFFIP 243722861

CYP2C8      human
            PIR S15075 (56 amino acids)
            Ged, C. and Beaune, P.
            Isolation of the human cytochrome P-450 IIC8 gene: multiple
            glucocorticoid responsive elements in the 5' region.
            Biochim. Biophys. Acta 1088, 433-435 (1991)

CYP2C8      human
            GenEMBL Y00498 (1866bp)
            Kimura,S., Pastewka,J., Gelboin,H.V. and Gonzalez,J.
            cDNA and amino acid sequences of two members of the human P450IIC
            gene subfamily
            Nucleic Acids Res. 15, 10053-10054 (1987)

CYP2C8      human
            PIR S16902 (349 amino acids)
            Shephard, E.A., Phillips, I.R., Santisteban, I., Palmer,
            C.N.A. and Povey, S.
            Cloning, expression and chromosomal localization of a member
            of the human cytochrome P450IIC gene sub-family.
            Ann. Hum. Genet. 53, 23-31 (1989)

CYP2C8      human
            no accession number
            D.C. Zeldin, R.N. Dubois, J.R. Falck, and J.H. Capdevila. 
            Molecular Cloning, Expression, and Characterization of an Endogenous Human
            Cytochrome P450 Arachidonic Acid Epoxygenase Isoform.
            Arch. Biochem. Biophys. 322: 76-86 (1995)

CYP2C8-de6b human 
            GenEMBL NT_008769.11|Hs10_8926
            detritus exon 6 between 2C9 and 2C8
            old name CYP2C60P
            8439669 EKDNQPLKFTIENLVGNVPDLFVAGTEMTSTTLRYGLLLLLKHPELT 8439809

CYP2C8      Cercopithecus aethiops (African green monkey)
            DQ022200.1
            Booth-Genthe,C.L., Peteraf,S. and Tang,C.
            Merck Research laboratories
            92% to human CYP2C8, 78% to human CYP2C19

CYP2C8/2C20  Macaca fasicularis (cynomolgus monkey)
            GenEMBL S53046 (1901bp) Swiss P33262 (490 amino acids)
            PIR S28166 (490 amino acids)
            Komori,M., Kikuchi,O., Sakuma,T., Funaki,J., Kitada,M.
            and Kamataki,T.
            Molecular cloning of monkey liver cytochrome P-450 cDNAs:
            similarity of the primary sequences to human cytochromes P-450.
            Biochim. Biophys. Acta 1171, 141-146 (1992)
            Note: As comparisons between primates begin to involve large
            scale sequencing, the CYP2C20 genes assigned earlier 
            to two Macaca species appear to be orthologous to human CYP2C8.  
            I have acknowledged this by using the 2C8 name.
            Since both names will be in the literature, both will be 
            kept, but 2C8 is now the preferred name.

CYP2C8/2C20   Macaca fasicularis (cynomolgus monkey)
            PIR A60466 (22 amino acids)
            Ohi, H., Toratani, S., Komori, M., Miura, T., Kitada, M. and
            Kamataki, T.
            Comparative study of cytochrome P-450 in liver microsomes. A
            form of monkey cytochrome P-450, P-450-MK1,
            immunochemically cross-reactive with antibodies to rat
            P-450-male.
            Biochem. Pharmacol. 38, 361-365 (1989)
            Note: As comparisons between primates begin to involve large
            scale sequencing, the CYP2C20 genes assigned earlier 
            to two Macaca species appear to be orthologous to human CYP2C8.  
            I have acknowledged this by using the 2C8 name.
            Since both names will be in the literature, both will be 
            kept, but 2C8 is now the preferred name.

CYP2C8/2C20  Macaca mulatta (rhesus monkey) name change from CYP2C74
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            91% to CYP2C8, 78% to CYP2C19, probable ortholog of CYP2C8
            formerly CYP2C74.  There are only 3 amino acid differences to 
            Macaca fasicularis (cynomolgus monkey) GenEMBL S53046
            Since this is the clear ortholog of that earlier sequence 
            the name has been changed to reflect the orthology.
            Note: As comparisons between primates begin to involve large
            scale sequencing, the CYP2C20 genes assigned earlier 
            to two Macaca species appear to be orthologous to human CYP2C8.  
            I have acknowledged this by using the 2C8 name.
            Since both names will be in the literature, both will be 
            kept, but 2C8 is now the preferred name.

CYP2C8      Callithrix jacchus (white-tufted-ear marmoset)
            GenEMBL AB242600, release date 2006-11-19
            Narimatsu, S., Torigoe, F.,Hanioka, N. and Miyata, A.
            88% to 2C8 of Cercopithecus aethiops, 87% to 2C8 human
            78% to 2C9 human, 77% to 2C18, 77% to 2C19

CYP2C9      human
            GenEMBL S46963 (1814bp) PIR A48390 (477 amino acids)
            B48390 (475 amino acids)
            Ohgiya,S., Komori,M., Ohi,H., Shiramatsu,K., Shinriki,N. and
            Kamataki,T.
            Six-base deletion occurring in messages of human cytochrome P-450
            in the CYP2C subfamily results in reduction of tolbutamide
            hydroxylase activity.
            Biochem. Int. 27, 1073-1081 (1992)

CYP2C9      human
            GenEMBL L16877 to L16883
            Goldstein,J.A., Raucy,J.L., Blaisdell,J.A., Faletto,M.B. 
            and Romkes,M.
            Cloning and expression of complementary DNAs for multiple 
            members of the human cytochrome P450IIC subfamily.
            Biochemistry 30, 3247-3255 (1991)

            de Morais,S.M., Schweikl,H., Blaisdell,J.A. and Goldstein,J.A.
            Gene structure and upstream regulatory regions of human 
            CYP2C9 and CYP2C18.
            Biochem. Biophys. Res. Commun. 194, 194-201 (1993)

CYP2C9      human
            PIR B61265 (225 amino acids)
            Srivastava, P.K., Yun, C.H., Beaune, P.H., Ged, C. and
            Guengerich, F.P.
            Separation of human liver microsomal tolbutamide hydroxylase
            and (S)-mephenytoin 4'-hydroxylase cytochrome P-450
            enzymes.
            Mol. Pharmacol. 40, 69-79 (1991)
            2C10 has D at position 417 while 2C9 has G.  This sequence does not 
            include position 417.  The only other amino acid difference between 2C9 
            and 2C10 is at position 358 where 2C9 has Y and 2C10 has C.  This 
            sequence has Y at 358.

CYP2C9      human
            PIR S26634 (29 amino acids) PIR S23777 (25 amino acids)
            Shimada, T., Misono, K.S. and Guengerich, F.P.
            Human liver microsomal cytochrome P-450 mephenytoin
            4-hydroxylase, a prototype of genetic polymorphism in
            oxidative drug metabolism.
            J. Biol. Chem. 261, 909-921 (1986)

CYP2C9      human
            PIR S39377 (20 amino acids)
            Sandhu, P., Baba, T. and Guengerich, F.P.
            Expression of modified cytochrome P450 2C10 (2C9) in
            Escherichia coli, purification, and reconstitution of
            catalytic activity.
            Arch. Biochem. Biophys. 306, 443-450 (1993)

CYP2C9-de1b human
            GenEMBL NT_008769.11|Hs10_8926 
            same as AL133513.12, might work for alt splice
            detritus exon 1 32kb upstream of 2C9
8335895 MDPAVALVLCLSCLFLLSLWRQSSGRGRLLFGPTPLLIIGNILQLDVKDMSKSLTNVSMLYAPL 8336086

CYP2C9-de2c3c human
            GenEMBL NT_008769.11|Hs10_8926 
            detritus exons 2,3 between 2C9 and 2C8
            old name CYP2C59P
8437561 LSQFSKVYVPVFTVYFDIKLVLELHGYEVVKEALIDHGEEFSGKGIFPVSKKS**G 8437394
8437211 FRIIFSNGKRCKDIWLFLLMTLWNCRMVKRS 8437119
8437115 MEKHVQGEAQCLRQELRRTK 8437058

CYP2C9      Macaca fasicularis (cynomolgus monkey)
            No accession 
            Wu Zhicong
            Submitted to nomenclature committee 10/30/2006
            93% to human 2C9, 91% to 2C19, 81% to 2C18, 76% to 2C8

CYP2C10X     human
            PIR A61265 (79 amino acids)
            Srivastava, P.K., Yun, C.H., Beaune, P.H., Ged, C. and
            Guengerich, F.P.
            Separation of human liver microsomal tolbutamide hydroxylase
            and (S)-mephenytoin 4'-hydroxylase cytochrome P-450
            enzymes.
            Mol. Pharmacol. 40, 69-79 (1991)
            2C10 has D at position 417 while 2C9 has G.  This sequence shows the D at 
            position 417.  The only other amino acid difference between 2C9 and 2C10
            is at position 358 where 2C9 has Y and 2C10 has C.  This sequence does 
            not include the 358 region.
            The 2C10 gene is in some doubt.  Others have searched 100 samples looking for it 
            and have not found it.  This gene may not exist.

CYP2C11     rat
            GenEMBL S68251 (139bp)
            Habib,S.L., Srikanth,N.S., Scappaticci,F.A., Faletto,M.B.,
            Maccubbin,A., Farber,E., Ghoshal,A.K. and Gurtoo,H.L.
            Altered expression of cytochrome P450 mRNA during chemical-induced
            hepatocarcinogenesis and following partial hepatectomy
            Toxicol. Appl. Pharmacol. 124, 139-148 (1994)
rat 2C cluster in chromosome order

CYP2C11     rat
            PIR A60782 (500 amino acids)
            Stroem, A., Mode, A., Zaphiropoulos, P., Nilsson, A.G.,
            Morgan, E., Gustafsson, J.A.
            Cloning and pretranslational hormonal regulation of
            testosterone 16alpha-hydroxylase (P-450-16alpha) in male
            rat liver.
            Acta Endocrinol. 118, 314-320 (1988)
rat 2C cluster in chromosome order

CYP2C11     rat
            PIR A60783 (500 amino acids)
            Zaphiropoulos, P.G., Mode, A., Stroem, A., Husman, B.,
            Andersson, G., Gustafsson, J.A.
            Sequence and regulation of two growth-hormone-controlled,
            sex-specific isozymes of cytochrome P-450 in rat liver,
            P-450-15beta and P-450-16alpha.
            Acta Med. Scand. Suppl.  723, 161-167 (1988)
rat 2C cluster in chromosome order

CYP2C11     rat
            GenEMBL X79081 (2140bp) PIR S44310 (56 amino acids)
            Strom,A., Equchi,H., Mode,A., Tollet,P., Stromstedt,P.E. and
            Gustafson,J.
            Characterization of the proximal promoter and two silencer elements
            in the CYP2C gene expressed in rat liver.
            DNA Cell Biol. 13, 805-819 (1994)
rat 2C cluster in chromosome order

CYP2C11     rat
            PIR S26818 (500 amino acids)
            Matsumoto, T., Emi, Y., Kawabata, S. and Omura, T.
            J. Biochem. (1986) 100, 1359-1371
            Purification and characterization of three male-specific and
            one female-specific forms of cytochrome P-450 from rat
            liver microsomes.
rat 2C cluster in chromosome order

CYP2C11     rat
            GenEMBL U33173(1856bp)
            Yoshioka,H., Morohashi,K., Sogawa,K., Miyata,T., Kawajiri,K.,
            Hirose,T., Inayama,S., Fujii-Kuriyama,Y. and Omura,T.
            Structural analysis and specific expression of microsomal
            cytochrome P-450(M-1) mRNA in male rat livers.
            J. Biol. Chem. 262 (4), 1706-1711 (1987)
            Erratum:[J Biol Chem 1986 Jun 15;262(17):8438]]

            Biagini,C. and Celier,C.
            cDNA-directed expression of two allelic variants of cytochrome P450
            2C11 using COS1 and SF21 insect cells.
            Arch. Biochem. Biophys. 326 (2), 298-305 (1996)
rat 2C cluster in chromosome order

CYP2C11     rat
            GenEMBL J02657 
            72% to CYP2C6_v1
243377899 MDPVLVLVLTLSSLLLLSLWRQSFGRGKLPPGPTPLPIIGNTLQIYMKDIGQSIKK 243378066
243379842 FSKVYGPIFTLYLGMKPFVVLHGYEAVKEALVDLGEEFSGRGSFPVSERVNKGL 243380003
243380160 GVIFSNGMQWKEIRRFSIMTLRTFGMGKRTIEDRIQEEAQCLVEELRKSK 243380309
GAPFDPTFILGCAPCNVICSIIFQNRFDYKDPTFLNLMHRFNENFRLFSSPWLQVCNT
FPAIIDYFPGSHNQVLKNFFYIKNYVLEKVKEHQESLDKDNPRDFIDCFLNKMEQEKH
NPQSEFTLESLVATVTDMFGAGTETTSTTLRYGLLLLLKHVDVTAKVQEEIERVIGRN
RSPCMKDRSQMPYTDAVVHEIQRYIDLVPTNLPHLVTRDIKFRNYFIPKGTNVIVSLS
SILHDDKEFPNPEKFDPGHFLDERGNFKKSDYFMPFSA
243416959 GKRICAGEALARTELFLFFTTILQNFNLKSLVDVKDIDTTPAISGFGHLPPFYEACFIPVQRADSLSSHL*
243417171

CYP2C12     rat
            Swiss B60783 (490 amino acids)
            Zaphiropoulos, P.G., Mode, A., Stroem, A., Husman, B.,
            Andersson, G., Gustafsson, J.A.
            Sequence and regulation of two growth-hormone-controlled,
            sex-specific isozymes of cytochrome P-450 in rat liver,
            P-450-15beta and P-450-16alpha.
            Acta Med. Scand. Suppl. 723, 161-167 (1988) 
rat 2C cluster in chromosome order

CYP2C12     rat
            PIR S26819 (490 amino acids)
            Matsumoto, T., Emi, Y., Kawabata, S. and Omura, T.
            J. Biochem. (1986) 100, 1359-1371
            Purification and characterization of three male-specific and
            one female-specific forms of cytochrome P-450 from rat
            liver microsomes.
rat 2C cluster in chromosome order

CYP2C12     rat
            PIR B41425 (19 amino acids)
            Imaoka, S., Kamataki, T. and Funae, Y.
            Purification and characterization of six cytochromes P-450
            from hepatic microsomes of immature female rats.
            J. Biochem. 102, 843-851 (1987)
rat 2C cluster in chromosome order

CYP2C12     rat
            GenEMBL J03786 
            80% to 2C13 
MDPFVVLVLSLSFLLLLYLWRPSPGRGKLPPGPTPLPIFGNFLQ
IDMKDIRQSISNFSKTYGPVFTLYFGSQPTVVLHGYEAVKEALIDYGEEFSGRGRMPV
FEKATKGLGISFSRGNVWRATRHFTVNTLRSLGMGKRTIEIKVQEEAEWLVMELKKTK
GSPCDPKFIIGCAPCNVICSIIFQNRFDYKDKDFLSLIENVNEYIKIVSTPAFQVFNA
FPILLDYCPGNHKTHSKHFAAIKSYLLKKIKEHEESLDVSNPRDFIDYFLIQRCQENG
NQQMNYTQEHLAILVTNLFIGGTETSSLTLRFALLLLMKYPHITDKVQEEIGQVIGRH
RSPCMLDRIHMPYTNAMIHEVQRYIDLAPNGLLHEVTCDTKFRDYFIPKGTAVLTSLT
SVLHARKEFPNPEMFDPGHFLDENGNFKKSDYFMPFSAGKRKCVGEGLASMELFLFLT
TILQNFKLKSLSDPKDIDINSIRSEFSSIPPTFQLCFIPV

CYP2C13     rat
            GenEMBL X79810 (1944bp)
            Legraverend,C., Eguchi,H., Strom,A., Lahuna,O., Mode,A.,
            Tollet,P., Westin,S. and Gustafsson,J.A.
            Transactivation of the rat CYP2C13 gene promoter involves HNF-1, 
            HNF-3 and members of the orphan receptor subfamily. 
            Biochemistry 33, 9889-9897 (1994)
rat 2C cluster in chromosome order

CYP2C13     rat
            PIR S26820 (30 amino acids)
            Matsumoto, T., Emi, Y., Kawabata, S. and Omura, T.
            Purification and characterization of three male-specific and
            one female-specific forms of cytochrome P-450 from rat
            liver microsomes.
            J. Biochem. 100, 1359-1371 (1986)
rat 2C cluster in chromosome order

CYP2C13v1   rat
100% first 5 exons
Note this seq also on 100.0%    Un  ++   17276272  17282257
Exons 6-9 are on       99.1%    Un  ++   17323193  17358099 2 aa diffs to 2C13 J02861
CYP2C12 is also on this same contig 99.6%    Un  ++   17388090  17446950 2 aa diffs
Minus Strand HSPs:
245246208 MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTN (0) 245246041
245244920 FSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQ (1) 245244759
245244599 GIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN 245244450
245240888 GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQ (0) 245240727
245239607 VFNIFPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ 245239431

CYP2C13v1   rat
             GenEMBL J02861 
             80% to 2C12
MDPVVVLLLSLFFLLFLSLWRPSSGRGKLPPGPTPLPIIGNFFQ
VDMKDIRQSLTNFSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPI
CEKVAKGQGIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN
GSPCDPQFIMGCAPGNVICSIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQVFNI
FPILLDYCPGNHNIYFKNHTWLKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQ
ENANQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVT
AKVQEEIDHVIGRH
RSPCMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHEVTCDTKFRNYFIPKGTAVLTSLT
SVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSAGKRMCLGESLARMELFLFLT
TILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL

CYP2C13v2    rat
Not in figure probable 2C13 allele NM_138514 7AA DIFFS TO 2C13v1 (98%)
80% to 2C12 (temp name = CYP2CNEWA)
MDPVVVLLLSLFFLLFLSLWRLSSGRGKLPPGPTPLPIIGNFFQ
VDMKDIRQSLTNFSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPI
CEKVAKGQGIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTN
GSPCDPQFIMGCAPGNVICCIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQVFNI
FPILLDYCPGNHNIYLKNYTWVKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQENA
NQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVTAKVQEEIDHVIGRH
RSPSMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHDVTCDTKFRNYFIPKGTAVLTSLT
SVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSAGKRMCLGESLARMELFLFLT
TILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL

CYP2C13-de1b2b rat
frag 7 Exon 1 76% to 2C13 Minus Strand
245307855 MDPIVVLVLSLSCLLFLSLWRNNSRRGKLPPGPTPLPIIRNYLQLDMKDIC*SLTK (0) 245307688
frag 6 Exon 2 83% to 2C13 Minus Strand
245292652 FSKTYGPVYTLYFGSQPTVLLYGYEALKEALIDYGEAFSGRGRIPIHEKVSKGQ 245292491

CYP2C13-se1[6] rat
frag h 72% to 2C13 exon 6 plus strand 
100% to seq s 70% to 2C12 exon 6
244165142 ENGNQQMNYTQEHLATMVTDLL 244165207
244165209 FGGRETLNSTMRFAFLFLMKYPYTT 244165284
rat 2C cluster in chromosome order

CYP2C13-se2[6:7] rat
frag s Exons 6-7 minus strand 72% to 2C12 exon 6 100% to seq h
243766431 ENGNQQMNYTQEHLATMVTDLL 243766366
243766364 FGGRETLNSTMRFAFLFLMKYPYTT 243766290
243760156 XQINEEIGQVIWRHHSPSMLDWSHMIYTNAMVHEVQRYIDLAPNGVVCEVNCDTKYPRDYFIPK 243759968
rat 2C cluster in chromosome order

CYP2C13-se3[1:2:3:2:3:] rat
frag f Exons 1,2,3,2,3  exon 1 = 66% to 2C13 Minus Strand 
exons 2,3 = 57% to 2C13
two identical copies of exons 2,3 100% to seq v exons 2,3
244215468 SQSFLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 244215328
244214467 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 244214306
244214137 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213988
244213484                                    R*FS*RGWFSIFGKFSKVQ 244213428
244213259 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 244213110

CYP2C13-se4[1:2:3] rat
frag v Exon 1 (+) 59% to 2C13
243678671 FLLLLSLSSQISSKGKLPLDPTSLPILGYFF*VLMKDICQSLIN 243678802
Exon 2 (+) 48% to 2C79
243679647 FLKTSGPLYTQHFSLQPAVVFCGYAAVKGAFVDHSR*FS*RGWFSIFGKFSKVQ 243679808
Exon 3 (+) 100% to seq f
243679977 GIGFSHKNVWKVKRFFTLITLKNLHMGNDNIKNKVQEEAQCLVKELKKIN 243680126
rat 2C cluster in chromosome order

CYP2C14     rabbit

CYP2C15     rabbit

CYP2C16     rabbit

CYP2C17X    human
            discontinued number     See CYP2C18/19

CYP2C18     human
            GenEMBL L16869 to L16876 Swiss P33260 (490 amino acids)
            Romkes,M., Faletto,M.B., Blaisdell,J.A., Raucy,J.L. and 
            Goldstein,J.A.
            Cloning and expression of complementary DNAs for multiple 
            members of the human cytochrome P450IIC subfamily.
            Biochemistry 30, 3247-3255 (1991)

            de Morais,S.M., Schweikl,H., Blaisdell,J.A. and Goldstein,J.A.
            Gene structure and upstream regulatory regions of human 
            CYP2C9 and CYP2C18.
            Biochem. Biophys. Res. Commun. 194, 194-201 (1993)

            Romkes,M., Faletto,M.B., Blaisdell,J.A., Raucy,J.L. and 
            Goldstein,J.A.
            Correction: Cloning and expression of complementary DNAs 
            for multiple members of the human cytochrome P450IIC subfamily.
            Biochemistry 32, 1390-1390 (1993)

CYP2C18     human
            GenEMBL S63419 S63421 S63424 S63426 
            X56452 (multiple genomic fragments) PIR S45369 (56 amino acids)
            Ged,C. and Beaune,P.
            Partial sequence and polymerase chain reaction-mediated analysis of
            expression of the human CYP2C18 gene
            Pharmacogenetics 2, 109-115 (1992)

CYP2C18     human
            PIR A61269 (490 amino acids)
            Furuya, H., Meyer, U.A., Gelboin, H.V. and Gonzalez, F.J.
            Polymerase chain reaction-directed identification, cloning,
            and quantification of human CYP2C18 mRNA.
            Mol. Pharmacol. 40, 375-382 (1991)

CYP2C18/19  human
            GenEMBL M61858 J05326 (1276bp) Swiss P33259 (270 amino acids)
            Goldstein,J.A., Raucy,J.L., Blaisdell,J.A., Faletto,M.B. and
            Romkes,M.
            Cloning and expression of complementary DNAs for multiple members
            of the human cytochrome P450IIC subfamily
            Biochemistry 30, 3247-3255 (1991)
            This sequence named 2C17 was later found to be a splice of 2C18 amd 
            2C19.  Therefore, there is no 2C17 sequence.

CYP2C18/19  human
            GenEMBL L07093 (2395bp)
            Romkes,M., Faletto,M.B., Blaisdell,J.A., Raucy,J.L. and
            Goldstein,J.A.
            Correction: Cloning and expression of complementary cDNAs for
            multiple members of the human cytochrome P450IIC subfamily
            Biochemistry 32, 1390-1390 (1993)

CYP2C18     Macaca fasicularis (cynomolgus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 9/29/2005
            3 aa diffs to rhesus 2C18, 95% to human 2C18 only 80% to 2C19
            complete sequence

CYP2C18     Macaca fasicularis (cynomolgus monkey)
            No accession 
            Wu Zhicong
            Submitted to nomenclature committee 10/30/2006
            96% to 2C18 human, 81% to 2C9, 81% to 2C19, 76% to 2C8
            3 amino acid diffs to Unos seq.

CYP2C18     Macaca mulatta (Rhesus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 9/29/2005
            3 aa diffs to M. fasicularis 2C18
            complete sequence

CYP2C19     human
            Swiss P33261 (490 amino acids)
            Romkes,M., Faletto,M.B., Blaisdell,J.A., Raucy,J.L. and 
            Goldstein,J.A.
            Cloning and expression of complementary DNAs for multiple 
            members of the human cytochrome P450IIC subfamily.
            Biochemistry 30, 3247-3255 (1991)

CYP2C19     human
            GenEMBL L31506 (129bp)
            GenEMBL L31507 (129bp)
            De Morais,S.M.F., Wilkinson,G.R., Blaisdell,J.A., Nakamura,K.,
            Meyer,U.A. and Goldstein,J.A.
            The major genetic defect responsible for the polymorphism of
            S-mephenytoin metabolism in humans
            J. Biol. Chem. 269, 15419-14522 (1994)

CYP2C19     human
            GenEMBL L32982 (329bp) wild type exon 4
            GenEMBL L32983 (329bp) mutant exon 4
            De Morais,S.M.F., Wilkinson,G.R., Blaisdell,J.A., Meyer,U.A.,
            Nakamura,K. and Goldstein,J.A.
            Identification of a new genetic defect responsible for the
            polymorphism of S-mephenytoin metabolism in Japanese
            Mol. Pharmacol. 46, 594-598 (1994)

CYP2C19     human
            PIR S38753 (16 amino acids)
            Wrighton, S.A., Stevens, J.C., Becker, G.W., and van den Branden,M.
            Isolation and characterization of human liver cytochrome P450
            2C19: correlation between 2C19 and S-mephenytoin
            4'-hydroxylation.
            Arch. Biochem. Biophys. 306, 240-245 (1993)

CYP2C20/2C8  Macaca fasicularis (cynomolgus monkey)
            GenEMBL S53046 (1901bp) Swiss P33262 (490 amino acids)
            PIR S28166 (490 amino acids)
            Komori,M., Kikuchi,O., Sakuma,T., Funaki,J., Kitada,M.
            and Kamataki,T.
            Molecular cloning of monkey liver cytochrome P-450 cDNAs:
            similarity of the primary sequences to human cytochromes P-450.
            Biochim. Biophys. Acta 1171, 141-146 (1992)
            CYP2C8 will be the preferred name for this seq in the future.

CYP2C20/2C8  Macaca fasicularis (cynomolgus monkey)
            PIR A60466 (22 amino acids)
            Ohi, H., Toratani, S., Komori, M., Miura, T., Kitada, M. and
            Kamataki, T.
            Comparative study of cytochrome P-450 in liver microsomes. A
            form of monkey cytochrome P-450, P-450-MK1,
            immunochemically cross-reactive with antibodies to rat
            P-450-male.
            Biochem. Pharmacol. 38, 361-365 (1989)
            CYP2C8 will be the preferred name for this seq in the future.

CYP2C20/2C8  Macaca mulatta (rhesus monkey) name change from CYP2C74
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            91% to CYP2C8, 78% to CYP2C19, probable ortholog of CYP2C8
            formerly CYP2C74.  There are only 3 amino acid differences to 
            Macaca fasicularis (cynomolgus monkey) GenEMBL S53046
            Since this is the clear ortholog of that earlier sequence 
            the name has been changed to reflect the orthology.
            CYP2C8 will be the preferred name for this seq in the future.

CYP2C21    Canis familiaris (dog)
           NW_876285.1: 8748112-8724707
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           70% to human 2C19
MDLFIVLVICLSCLISFFLWNQNRAKGKLPPGPTPLPIIGNILQINTKNVSKSLSKLAENYGPVFTVYFGMKPTV
VLYGYEAVKEALIDRSEEFSGRGHFPLLDWTIQGLGIVFSNGEKWKQTRRFSLTVLRNMGMGKKTVEDRIQEEAL
YLVEALKKTNASPCDPTFLLGCAPCNVICSIIFQNRFEYDDKDFLTLLEYFHENLLISSTSWIQLYNAFPLLIHY
LPGSHHVLFKNIANQFKFISEKIKEHEESLNFSNPRDFIDYFLIKIEKEKHNKQSEFTMDNLIITIWDVFSAGTE
TTSTTLRYGLLVLLKHPDVTAKVQEEIHRVVGRHRSPCMQDRSCMPYTDAVVHEIQRYIDLVPNNLPHSVTQDIK
FREYLIPKGTTILTSLTSVLHDEKGFPNPDQFDPGHFLDENGSFKKSDYFMAFSAGKRVCVGEGLARMELFLLLT
NILQHFTLKPLVDPKDIDTTPIANGLGATPPSYKLCFVPV*

CYP2C22     rat
            GenEMBL M58041 
            61% to 2C79
245425985 MALFIFLGIWLSCLVFLFLWNQHHVRRKLPPGPTPLPIFGNILQVGVKNMSKSMCM 245425818
LAKEYGPVFTMYLGMKPTVVLYGYEVLKEALIDRGEEFSDKMHSSM
LSKVSQGLGIVFSNGEIWKQTRRFSLMVLRSMGMGKRTIENRIQEEVVYLLEALRKTN
GSPCDPSFLLACVPCNLISSVIFQHRFDYSDEKFQKFIENFHTKIEILASPWAQLCSA
YPVLYYLPGIHNKFLKDVTEQKKFILMEINRHRASLNLSNPQDFIDYFLIKMEKEKHN
EKSEFTMDNLIVTIGDLFGAGTETTSSTIKYGLLLLLKYPEVTAKIQEEITRVIGRHR
RPCMQDRNHMPYTDAVLHEIQRYIDFVPIPLPRKTTQDVEFRGYHIPK
GTSVMACLTSALHDDKEFPNPEKFDPGHFLDEKGNFKKSDYFMAFSA
GRRACIGEGLARMEMFLILTSILQHFILKPLVNPEDIDTTPVQPGLLSLPPPFQLCFIPV
rat 2C cluster in chromosome order

CYP2C22-se2[1:2] rat
frag 9 Exon 1 61% to 2C22 Minus Strand
245347583 MDLFIILWICFACLSLFFLWNQLHYKEKLPPGPVPLPIVGNILQVNIKSIIKSLNI (0) 245347416
frag 8 Exon 2 79% to 2C22 Minus Strand
245334622 LAKEYGPVFTVYLGMKPTVVLHGHKALKEALIDRANEFSVKMQSSLLSKESQGL (1) 245334461

CYP2C23     rat
            GenEMBL U04733 (1919bp)
            Karara,A., Makita,K., Jacobson,H.R., Falck,J.R.,
            Guengerich,F.P., DuBois,R.N.and Capdevila,J.H.
            Molecular cloning, expression, and enzymatic characterization of the 
            rat kidney cytochrome P-450 arachidonic acid epoxygenase.
            J. Biol. Chem. 268, 13565-13570 (1993)
rat 2C cluster in chromosome order

CYP2C23     rat
            GenEMBL S67064 (265bp)
            Imaoka,S., Wedlund,P.J., Ogawa,H., Kimura,S., Gonzalez,F.J. 
            and Kim,H.Y.
            Identification of CYP2C23 expressed in rat kidney as an arachadonic 
            acid epoxygenase.
            J. Pharmacol. Exp. Ther. 267, 1012-1016 (1993)
rat 2C cluster in chromosome order

CYP2C23     rat
            PIR S29817 (20 amino acids)
            Marie, S.; Roussel, F.; Cresteil, T.
            Age- and tissue-dependent expression of CYP2C23 in the rat.
            Biochim. Biophys. Acta 1172, 124-130 (1993) 
            note: This sequence is diiferent from GenEMBL U04733 and S67064
            by one amino acid. PIR S13101, SwissProt P24470 and GenEMBL 
            X55446 are all equivalent, but they have a frame shift in the sequence 
            in the region of this 20 amino acid fragment. Amino acids 38-54 are affected.
rat 2C cluster in chromosome order

CYP2C23     rat
            GenEMBL X55446
            59% to 2C11
MELLGFTTLALVVSVTCLSLLSVWTKLRTRGRLPPGPHPPSHYW
ESTATEPQGHPASLSKLAKEYGPVYTLYFGTSPTVVLHGYDVVKEALLQQGDEFLGRG
PLPIIEDTHKGYGLIFSNGERWKVMRRFSLMTLRNFGMGKRSLEERVQEEAWCLVEEL
QKTKAQPFDPTFILACAPCNVICSILFNDRFQYNDKTFLNLMDLLNKNFQQVNSVWCQ
MYNLWPTIIKYLPGKHIEFAKRIDDVKNFILEKVKEHQKSLDPANPRDYIDCFLSKIE
EEKDNLKSEFHLENLAVCGSNLFTAGTETTSTTLRFGLLLLMKYPEVQAKVHEELDRV
IGRHQPPSMKDKMKLPYTDAVLHEIQRYITLVGSSLPHAVVQDTKFRDYVIPKGTTVL
PMLSSVMLDQKEFANPEKFDPGHFLDKNGCFKKTDYFVPFSLGKRACVGESLARMELF
LFFTTLLQKFSLKTLVEPKDLDIKPITTGIINLPPPYKLCLVPR

CYP2C24     rat
            GenEMBL S59647 (226bp)
            GenEMBL S59648 (187bp)
            GenEMBL S59652 (380bp)
            Zaphiropoulos,P.G.
            Differential expression of cytochrome P450 2C24 and transcripts
            in rat kidney and prostate: evidence indicative of alternative
            and possibly trans splicing events.
            Biochem. Biophys. Res. Commun. 192, 778-786 (1993)
rat 2C cluster in chromosome order

CYP2C24     rat
            Swiss P33273 (434 amino acids) PIR PT0435 (302 amino acids)
            PIR JH0451 (434 amino acids)
            Zaphiropoulos,P.G.
            cDNA cloning and regulation of a novel rat cytochrome P450 of the 2C 
            gene sufamily (P450IIC24).
            Biochem. Biophys. Res. Commun. 180, 645-651 (1991)
rat 2C cluster in chromosome order

CYP2C24     rat
92% to 2C80, M86678 has alternative splice first exon seen only in M86678 
exons 2-4 only 2 aa diffs to 2C24 on M86678
no ESTs contain the yellow region but CK481568.1 covers exons 1,2,3,4
CO565602.1 matched the end of the gene sequence and extends it a little 6 aa
Used this EST to blast the trace files to find the end of exon 7
MDPVLVLVLTLSCLLLLSLWRQSSGRGKLPPGPTPLPIIGNILQIDVKDISKSFTN CK481568.1 exon 1
          QLSCSRKFGLTCGPEAQ rat repeat seq found in many rat BACs
243522306 FTDKLTAKCHSSVSLHIDLPGNLL 243522235 yellow region not P450 seq.
243522073 FSKIYGPVFTLYFGPKPTVVVHGYEAVKEALDDLGEEFSGRGSFPIVERMNNGL 243521912
243521366 GVIFSNGTKWKELRHFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 243521217
243518830 GSLCDPTFILSCAPSNVICSVVFHNRFDYKDENFLNLMEKLNENFKILNSPWMQ 243518669
    VCNALPAFIDYLPGSHNRVIKNFAEI 676
677 KSYILRRVKEHQETLDMDNPRDFIDCFLIKMEQEKHNPRTEFTIEILMATVSDVFVAGSE 856
857 TTSTTLRYGLLLLLKHIEVT
gnl|ti|132779224 rts18e73.g from trace files for exon 7
AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK
rat 2C cluster in chromosome order

CYP2C25      Mesocricetus auratus (Syrian hamster)
            GenEMBL X63022 (1829bp, incorrectly given as X60322 in Table 3
            of the 1993 nomenclature update)
            Sakuma,T., Masaki,K., Itoh,S., Yokoi,T. and Kamataki,T.
            Sex-related difference in the expression of cytochrome P450 in
            hamsters: cDNA cloning and examination of the expression of three 
            distinct CYP2C cDNAs.
            Molec. Pharmacol. 45, 228-236 (1994)

CYP2C26     Mesocricetus auratus (Syrian hamster)
            GenEMBL D11435 (1808bp) Swiss P33263 (490 amino acids)
            Sakuma,T., Masaki,K., Itoh,S., Yokoi,T. and Kamataki,T.
            Sex-related difference in the expression of cytochrome P450 in
            hamsters: cDNA cloning and examination of the expression of three 
            distinct CYP2C cDNAs.
            Molec. Pharmacol. 45, 228-236 (1994)

CYP2C27     Mesocricetus auratus (Syrian hamster)
            GenEMBL D11436 (1784bp) Swiss P33264 (490 amino acids)
            Sakuma,T., Masaki,K., Itoh,S., Yokoi,T. and Kamataki,T.
            Sex-related difference in the expression of cytochrome P450 in
            hamsters: cDNA cloning and examination of the expression of three 
            distinct CYP2C cDNAs.
            Molec. Pharmacol. 45, 228-236 (1994)

CYP2C28     Mesocricetus auratus (Syrian hamster)
            GenEMBL D11437 (1556bp) Swiss P33265 (490 amino acids)
            Sakuma,T., Masaki,K., Itoh,S., Yokoi,T. and Kamataki,T.
            Sex-related difference in the expression of cytochrome P450 in
            hamsters: cDNA cloning and examination of the expression of three 
            distinct CYP2C cDNAs.
            Molec. Pharmacol. 45, 228-236 (1994)

Cyp2c29     mouse
            GenEMBL D17674 (1751bp) also BC013895
            Matsunaga,T., Watanabe,K., Yamamoto,I., Negishi, M.,
            Gonzalez,F.J. and Yoshimura, H. 
            cDNA cloning and sequence of CYP2C29 encoding P-450 MUT-2,
            a microsomal aldehyde oxygenase.
            Biochim. Biophys. Acta 1184, 299-301 (1994) 

Cyp2c29     mouse
            PIR A61268 (16 amino acids)
            Bornheim, L.M. and Correia, M.A.
            Purification and characterization of a mouse liver cytochrome
            P-450 induced by cannabidiol.
            Mol. Pharmacol. 36, 377-383 (1989)

Cyp2c29v2   mouse
            no accession number
            Gang Luo and Joyce A. Goldstein
            clone M2c9k
            submitted to Nomenclature Committee

CYP2C30     rabbit
            GenEMBL D26153
            Noshiro,M., Ishida,H. and Okuda,K. 
            unpublished (1993)

CYP2C31     Capra hircus (dwarf goat)
            GenEMBL X76502 (1185bp) PIR JC2199 (284 amino acids) 
            PIR S39314 (284 amino acids)
            Zeilmaker,W.M., Van't Klooster,G.A.E., Gremmels-Gerhmann,F.J.
            Van Miert,A.S.J. and Horbach,G.J.M.J.
            cDNA and deduced amino acid sequence of a dwarf goat liver 
            cytochrome P450-fragment belonging to the CYP2C gene subfamily.
            Biochem. Biophys. Res. Commun. 200, 120-125 (1994)

CYP2C32     pig
            GenEMBL U35733.1 (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            most similar to 2C24
            Clone name CL1

CYP2C33v1   pig
            GenEMBL U35837 (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name CL7

CYP2C33v2   pig
            GenEMBL U35838 (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name CL8

CYP2C33v3   pig
            GenEMBL U35839 (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name PF1

CYP2C33v4   Sus scrofa (pig)
            GenEMBL AB052257 
            Misaki Kojima
            Submitted to nomenclature committee Oct. 27, 2000
            2 amino acids diffs with 2C33v1 and v2
            clone name c296

CYP2C34v1   pig
            GenEMBL U35840.1 (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name PF15

CYP2C34v2   pig
            GenEMBL U35841.1  (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name CL6

CYP2C34v3   pig
            GenEMBL U35842.1 (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name Cl12

CYP2C34v4   pig
            GenEMBL U35843.1 (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name Cl13

CYP2C35     pig
            GenEMBL U35844.1 (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name PF11/14

CYP2C36     pig
            GenEMBL U35845.1  (681bp)
            Zaphiropoulos,P.G., Skantz,A., Eliasson,M. and Ahlberg,M.B
            Cytochrome P450 genes expressed in porcine ovaries: identification of novel 
            forms, evidence for gene conversion, and evolutionary relationships.
            Biochem. Biophys. Res. Commun. 212, 433-441 (1995)
            Clone name PF13

CYP2C37     macaque [name conflict, reassigned to CYP2C43]
            no accession number
            S. Ohmori
            submitted to Nomenclature Committee

Cyp2c37     mouse
            AF047542 NM_010001, also AK005017
            Luo G, Zeldin DC, Blaisdell JA, Hodgson E, Goldstein JA.
            Cloning and expression of murine CYP2Cs and their ability to 
            metabolize arachidonic acid.
            Arch Biochem Biophys. 357, 45-57 1998.
            clone M2c10b
            submitted to Nomenclature Committee

Cyp2c38     mouse
            AF047725
            Luo G, Zeldin DC, Blaisdell JA, Hodgson E, Goldstein JA.
            Cloning and expression of murine CYP2Cs and their ability to 
            metabolize arachidonic acid.
            Arch Biochem Biophys. 357, 45-57 1998.
            clone M2c13f
            submitted to Nomenclature Committee

Cyp2c39     mouse
            AF047726 NM_010003
            Luo G, Zeldin DC, Blaisdell JA, Hodgson E, Goldstein JA.
            Cloning and expression of murine CYP2Cs and their ability to 
            metabolize arachidonic acid.
            Arch Biochem Biophys. 357, 45-57 1998.
            clone M2c9d
            submitted to Nomenclature Committee

Cyp2c39-ie6b mouse
            GenEMBL NT_039689.1
            Internal exon 6 (duplicate exon)
5895730 ANHIQQAEFSLENLACTINNLFAAGTETTSTSLINARLLFVRDPNVT 5895870

Cyp2c40     mouse
            AF047727 NM_010004 (NW_000147 exons 2-6 only)
            Luo G, Zeldin DC, Blaisdell JA, Hodgson E, Goldstein JA.
            Cloning and expression of murine CYP2Cs and their ability to 
            metabolize arachidonic acid.
            Arch Biochem Biophys. 357, 45-57 1998.
            Tsao CC, Foley J, Coulter SJ, Maronpot R, Zeldin DC, Goldstein JA.
            CYP2C40, a unique arachidonic acid 16-hydroxylase, is the major CYP2C 
            in murine intestinal tract.
            Mol Pharmacol. 58, 279-87 2000
            clone M2c9h
            submitted to Nomenclature Committee

CYP2C41     dog
            NM_001003334, AF016248
            Stephen R. Bai and Joyce A. Goldstein
            clone M2c9h
            submitted to Nomenclature Committee
MDPVVVLVLCLSCCLLLSLWKQSSRKGKLPPGPTPLPFIGNILQ
LDKDINKSLSNLSKAYGPVFTLYFGMKPTVVLHGYDAVKETLIDLGEEFSARGRFPIA
EKVSGGHGIIFTSGNRWKEMRRFALTTLRNLGMGKSDLESRVQEEACYLVEELRKTNA
LPCDPTFVLGCASCNVICSIIFQNRFDYTDQTLIGFLEKLNENFRILSSPWIQAYNSF
PALLHYLPGSHNTIFKNFAFIKSYILEKIKEHQESFDVNNPRDFIDYFLIKMEQEKHN
QPLEFTFENLKTIATDLFGAGTETTSTTLRYGLLLLLKHPEVTVKVQEEIDRVIGRHQ
SPHMQDRSRMPYTNAVLHEIQRYIDLVPNSLPHAVTCDVKFRNYVIPKGTTILISLSS
VLSDEKEFPRPEIFDPAHFLDDSGNFKKSDYFMAFSAGKRICVGEGLARMELFLFLTT
ILQKFTLKPLVDPKDIDTTPLASGFGHVPPTYQLCFIPV

CYP2C42     pig
            GenEMBL Z93098 (1307bp)
            Nissen,P.H., Winteroe,A.K. and Fredholm,M.
            Characterization and mapping of three porcine genes belonging to
            the cytochrome P450 superfamily
            Unpublished
            clone 10b03

CYP2C42P1   pig
            GenEMBL Z93100 (1758bp)
            Nissen,P.H., Winteroe,A.K. and Fredholm,M.
            Characterization and mapping of three porcine genes belonging to
            the cytochrome P450 superfamily
            Unpublished
            clone 15d09 (pseudogene)

CYP2C43     Macaca mulatta (rhesus monkey)
            no accession number
            Matsunaga T, Ohmori S, Ishida M, Sakamoto Y, Nakasa H, Kitada M.
            Molecular Cloning of Monkey CYP2C43 cDNA and Expression in Yeast.
            Drug Metab Pharmacokinet. 2002;17(2):117-24.
            submitted to Nomenclature Committee
            [name conflict, formerly CYP2C37 reassigned to CYP2C43]

CYP2C43     Macaca fasicularis (cynomolgus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 1/11/2005
            Clone name mfCYP2C9v1
            92% to 2C9 human, 93% to 2C75, 77% to 2C20, 77% to 2C74
            99% to rhesus 2C43

Cyp2c44    mouse
           no accession number 
           Christian Helvig and Jorge H. Capdevila
           submitted to nomenclature committee Oct. 2, 1998
           most similar to CYP2C23 (87% identical)
MELLGLPTLALLVLVMSLSLLSVWTKMRTGGRLPPGPTPLPIIGNILQLDLKDIPASLSK
LAKEYGPVYTLYFGSWPTVVLHGYDVVKEALLNQGDEFLGRGPLPIIEDSQKGH
GIVFSEGERWKLLRRFSLMTLKNFGMGKRSLEERVQEEARCLVEELHKTE
AQPFDPTFILACAPCNVICSILFNERFPYNDKTFLNLMDLLNKNFYQLNSIWIQ
MYNLWPTIMKYIPGKHREFSKRLGGVKNFILEKVKEHQEFLDPANPRDYIDCFLSKIEE
EKHSLKSDFNLENLAICGSNLFTAGTETTSTTLRFGLLLLVKHPEVQ
AKVHEELDRVIGRHQPPSMKDKMKLPYTDAVLHEIQRYITLLPSSLPHAVVQDTKFRHYVIPK
GTAVFPFLSSILLDQKEFPNPEKFDPGHFLDKNGCFKKTDYFVPFSL
GKRSCVGEGLARMELFLFFTTILQKFSLKALVEPKDLDIKPVTTGLFNLPPPYKLRLVPR

CYP2C45    gallus gallus (chicken)
           No accession number
           Manuel Baader
           Submitted to nomenclature committee Nov. 22, 1999
           57% identical to CYP2C9

CYP2C46     rat 
            No accession number 
            Lars von Buchholtz
            Submitted to nomenclature committee March 6, 2000 
            91% to 2C24

CYP2C47     Phascolarctos cinereus (koala) 
            No accession number 
            Ross McKinnon
            Submitted to nomenclature committee May 25, 2000
            60% identical to many 2C sequences

CYP2C48     Phascolarctos cinereus (koala) 
            No accession number 
            Brett Jones and Ross McKinnon
            Submitted to nomenclature committee Nov. 6, 2000
            92% identical to 2C47 

CYP2C49     Sus scrofa (pig)
            GenEMBL AB052258 
            Misaki Kojima
            Submitted to nomenclature committee Oct. 27, 2000
            92% to 2C35 and 2C34v1, v3, v4
            80% to 2C18,78% to 2C9, 77% to 2C19 and 75% to 2C8
            clone name c195

Cyp2c50     mouse
            GenEML BC011222.1, NT_039692
            GSS AZ589908 one exon only
            ESTs AI118193 ue34e02.x1, opposite end = AI098787 ue34e02.y1 
            AI097740 AI117011 AI119501 AI314482 BF385641 AI528254
            AA968308 AI876138 AI097678 AI226027 BF384486 BF659471 AI529923
            AI266900 uj08d09.x1, opposite end AI226027 uj08d09.y1,
            Joyce Golstein and Cheng-Chung Tsao
            submitted to nomenclature committee 3/1/2001
            94% to 2c37; 75% 2c39,2c29v2; 74% 2c38; 68% 2c40; 53% 2c44
            name 2C heart
NT_039692 + strand
176707 MDPILVLVFTLSCLFLLSLWRQSSERGKLPPGPTPLPIIGNILQINVKDICQSFTN 176874
177228 LSKVYGPVYTLYLGRKPTVVLHGYEAVKEALVDHGEEFAGRGRLPVFDKATNGM 177389
177552 GIIFSKGNVWKNTRRFSLTTLRNLGMGKRSIEDRVQEEARCLVEELRKTN 177701
177951 GSPCDPTFILGCAPCNVICSIIFQDRFDYKDRDFLNLMEKLNEITKIMSTPWLQ 178112
179211 VCNTFPVLLDYCPGSHNKVFKNYACIKNFLLEKIKEHEESLDVTIPRDFIDYFLINGGQ 179387
183835 ENGNYPLKNRLEHLAITVTDLFSAGTETTSTTLRYALLLLLKYPHVT 183975
185072 AKVQEEIEHVIGKHRRPCMQDRSHMPYTDAMIHEVQRFIDLVPNSLPHEVTCDIKFRNYFIPK 185260
198149 GTNVITSLSSVLRDSKEFPNPEKFDPGHFLDENGKFKKSDYFMPFST 198289
200344 GKRICAGEGLARMELFLFLTSILQNFNLKPLVHPKDIDVTPMLIGLASVPPAFQLCFIPS 200523

Cyp2c51X?   mouse
            No accession number
            Joyce Golstein and Cheng-Chung Tsao
            submitted to nomenclature committee 3/1/2001
            69% to 2c29v2; 69% 2c37; 68% 2c38; 67% 2c39; 67% 2c40
            no exact hits in nr, htgs, est, gss or sts on 3/5/01
            name 2C aorta
            note: this seq appears to be a combination between 2c52p and 2c69
            it may not be a real gene

Cyp2c52-ps  mouse
            GenEMBL XM_140720
            Joyce Golstein and Cheng-Chung Tsao
            submitted to nomenclature committee 3/1/2001
            78% to 2c51, 70% to 2c29v2, 2c38; 67% to 2c39, 2c37; 61% to 2c40
            missing PYTD in K-helix
            no exact hits in nr, htgs, est, gss or sts on 3/5/01
            name 2C kidney, 2C eye
sequence shown is from Ensembl mouse version 3
628318 MDPVLVLVLTLSCLLLLS*WRQNSGRGKLPPGPTPLPIIGNILQIDVKNTGQSVGK 628367
630645 FSKVYGPVFTLYFGMKPSVVLHGYEAVKEALVDLGEGFSGRGSFPVAEKASKGL 630806
630954 GIIFSNGMKWKEIRRFSVMT 631013  frameshift                        
631012 LRNFGMGKRSVEDRVQEEARCLVEELRNGK 631101                        
636385 XAPCDPTFILGCAPCNVICSIIFQKRFDYKDQTFLNLMDKFNENFRILSTPWIQ 636425
639913 VCNTFPAIIDYFPGSHNQVLKNFSYIKKNYVLEKVKKHQESLDMENPRDFIDCFLIKMKQ 639972
710041 EKHSLQSEFTHESLVATVTDMFGAGTETTSNTLRYGLLLLLKHVDIT 710181     
713060 AKVQEEIERVVGRHRSPCVQDRSHM 713134  4 aa deletion and f.s.         
713136 AVVHETQRYIVLIPTNLPHSVTCDAKFRNYFIPK 713237                        
715864 GTTVITSLTSMLHDDKEFPNPEKFDPGYFLDERGNVKKSDYFVPFSA 716004       
717828 GKRMCAGEGLTGMELFLFFTIILQNFNLKPLVDVKDIDTTPVVSGFGHVPPLYQARFIPV* 718010


Cyp2c53-ps  mouse
            AC078913.5 seq b assembled from parts 74% to 2c39 
            Old assembly included some N- and C-term parts not from this gene
TNFSKVYGPVFTLYLGMKPTVVLHGYEAVKEALIDHGEEFAVKGIFPLAEKNSK

FTLMTLKNLGMGKRNIEDRVQEEAQCLVEELRKTNG
SLCDPTFILGCAPCNVTCSIIFQNHFDYKDQDFLSLMEKINENTKIVSTPWIQ
LIVYCPGSHKTVPENAYYIEIYILKKIKEHQESLDVTNPLDFIDYYLIKCKQ
GAGTETSTTLRYALLLLMTYPEVT

Cyp2c53-ps  mouse 
            AY227735 NW_000145
            Hong Wang, Joyce Goldstein, Darryl Zeldin
            Between Cyp2c66 and Cyp2c29 on chr 19
            Temp name 2CN6
            74% to 2c29
            note: this is a pseudogene.  There are three stop codons 
            and the C-helix WXXXR motif is missing
MDLISFLMLTLFCLILLSLWSQSSGRGKLPPGPTPVPIIVSLLQLDVKNITQSSTN
FSKVYGPVFTLYLGMKPTVVLHGYEAVKEALIDHGEEFAVKGIFPLAEKNSKAL
LSGFML*FLFLFV*EFTLMTLKNLGMGKRNIEDRVQEEAQCLVEELRKTN
GSLCDPTFILGCAPCNVTCSIIFQNHFDYKDQDFLSLMEKINENTKIVSTPWIQ
VVKFSPVLIVYCPGSHKTVPENAYYIEIYILKKIKEHQESLDVTNPLDFIDYYLIKCKQ
EYHNHYSELTLKILSTTVTDFFGAGTETTSTTLRYALLLLMTYPEVT
AKIQDENDHVVGKHRNLCMQDRSHMPYTFAMIH*VQRFIDLLPTNLPHAVTCDIKFRNYIILK
GTAVITSLSSVLHDRKEFLNPEMFDPGHFLDGNGNFKKSDHFMPFSA
GKRVCVGEGLACMELFLFLTTALQNFKLKPLVHPKDINTTPVLNGFASVPLFYELCSIPL*

Cyp2c54     mouse
            GenEMBL NT_039692 - strand
            Darryl Zeldin
            submitted to nomenclature committee 3/18/2002
            clone name N1
            92% to 2c50 91% to 2c37 76% to 2c29 73% to 2c38 74% to 2c39 
            70% to 2c40 67% to 2c55 66% to 2c53p 59% to 2c44 67% to 2c52p 
            68% to 2c51
160912 MDPILVLVLTLSCLFLLSLWRQSYERGKLPPGPTPLPIIGNILQIDVKDICQSFTN 160745
159630 LSRVYGPVYTLYLGRKPTVVLHGYEAVKEALVDHGDVFAGRGRLPVFDKATNGM 159469
159306 GIGFSNGSVWKNTRHFSLMTLRNLGMGKRSIEDRVQEEARCLVEELRKTN 159157
158708 GSPCDPTFILGCAPCNVICSIIFQDRFDYKDRDFLNLLEKLDEISKILSTPWLQ 158547
157443 VCNTFPALLDYCPGSHNQFFKNYAYIKNFLLEKIREHKESLDVTIPRDFIDYFLIKGAQ 157267
134958 EDDNHPLKNNFEHLAITVTDLFIGGTESMSTTLRYALLLLLKYPHVT 134818
133577 AKVQEEIEHVIGKHRRPCMQDRSHMPYTNAMIHEVQRFIDLVPNNLPHEVTCDIKFRNYFIPK 133389
127646 GTTVITSLSSVLRDSKEFPNPEKFDPGHFLDENGKFKKSDYFMPFST 127506
125732 GKRICAGEGLARMELFLFLTSILQNFNLKPLVHPKDIDITPMLIGLGSVPPAFQLCFIPS 125553

Cyp2c55     mouse
            GenEMBL NT_039689.1 + strand
            Darryl Zeldin
            submitted to nomenclature committee 3/18/2002
            clone name N3
            71% to 2c29 70% to 2c39 70% top 2c38 69% to 2c37 69% to 2c50 
            65% to 2c40 58% to 2c44 53% to 2c53p 59% to 2c52p 67% to 2c54
            67% to 2c51
5347110 MDPVLVLVLTLSCLLLLSLWRQNSGRGKLPPGPTPFPIIGNILQIDIKNISKSFNY 5347277
5351084 FSKVYGPVFTLYFGSKPTVVVHGYEAVKEALDDLGEEFSGRGSFQIFERINNDL 5351245
5351753 GVIFSNGTKWKELRRFSIMTLRSFGMGKRSIEDRIQEEASCLVEELRKAN 5351902
5358706 GSLCDPTFILSCAPSNVICSVIFHNRFDYKDEKFLNLMERLNENFKILNSPWMQ 5358867
5371382 VYNALPTLINYLPGSHNKVIKNFTEIKSYILGRVKEHQETLDMDNPRDFIDCFLIKMEQ 5371558
5374359 EKHNPHSEFTIESLMATVTDIFVAGTETTNITLRYGLLLLLKHTEVT 5374499
5375564 AKVQAEIDHVIGRHRSPCMQDRTRMPYTDAMVHEIQRYIDLIPNNVPHAATCNVRFRSYFIPK 5375752
5378482 GTELVTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFKKSDYFMPFSI 5378622
5382398 GKRMCVGEALARTELFLILTTILQNFNLKSLVDTKDIDTTPVANTFGRVPPSYQLYFIPR 5382577

CYP2C56PX   human = CYP2C-se1[7] (see below)

CYP2C57PX   human = CYP2AC1P a new subfamily in mammals (see below)

CYP2C58P    human
            NT_008769.11|Hs10_8926 
            solo exons 1,2,3 between 2C19 and 2C9 
            same as AL133513.12
            an alternative name for this sequence would be CYP2C19-de1b2b3b
8303126 LDLAAVLMLCLSCLLLLSL*TQISGRGKLPSDSTPLQVIESILQMADKDICKSSSNLSTLY 8302944 
8296311 SLYFDMKLVLVLHGYEVLKKALIHHGEEFSGKGIFPVSKK 8296192
8295999 IIFSNRKPCKEIWPFLLMTLWNCGVVKRS 8295913
8295911 LGKHVQVEAHCIVWELRRTK 8295852

CYP2C59PX   human = CYP2C9-de2c3c (see above)

CYP2C60PX   human = CYP2C8-de6b (see above)

CYP2C61PX   human = CYP2C-se2[1:2] (see below)

CYP2C62P    human
            AL138921 NT_030059 chromosome 10 50% to 2C8
            Chr10q24.31 101999343-102031105 - strand build 33
            5Mb upstream of 2C8
LAMCVTCLIFFLVWKKSPSPTPLPTIGNRLQRNPKD
CISFQLAKEYSSVYTLYFGSWPTMVFHGYKAVKEALIDQGDKSLGRGHIPIIDDAQKRY
TSAQPFDSTFILASAPCNL
CSFLFKECFQYKNETFLSLMGLLNENVK
TTVLPLLSLVLFSYKQFP
GHFLDKNGCFNKTDYFLPFSLGK

CYP2C63PX   human = CYP2C-se3[1] (see below)

CYP2C64PX   human = CYP2C-se4[1] (see below)

Cyp2c65     mouse 
            AY227733 NW_000145 also NT_039689.1
            Hong Wang, Joyce Goldstein, Darryl Zeldin
            Between Cyp2c55 and Cyp2c66 on chr 19
            Temp name 2CN4
            93% to Cyp2c66 73% to 2c29
NT_039689.1 + strand
5398093 MVLGVFLGLLLTCLLLLSLWRQNSQRRNLPPGPTPLPIIGNILQLDLKDISKSLRN 5398260
5406366 FSKVYGPVFTLYLGRNPAVVLHGYEAVKEAFTDHGEEFAGRGVFPVFDKFKKNC 5406527
5406732 GVVFSSGRTWKEMRRFSLMTLRNFGMGRRSIEDRIQEEARCLVDELRKTKG 5406884
5409456 EPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLTFLDILNENVEILSSPWIQ 5409614
5410489 ICNNFPAVIDYLPGRHRKLHKNFAFAEHYFLSKVKQHQESLDINNPRDFIDCFLIKMEQ 5410665
5419474 EKHNPKTEFTCENLVFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT 5419614
5424846 AKVQEEIDCVIGRHRSPCMQDRHSMPYTDAVLHEIQRYIDLLPTSLPHAVTRDVKFREYLIPK 5425034
5427909 GTTVIASLTSVLYDDKEFPNPEKFDPSHFLDERGKFKKSDYFFPFST 5428049
5430603 GKRICVGEGLARAELFLFLTTILQNFNLKSPVDLKDLDTTPVANGFASVPPKFQICFIPI* 5430785

Cyp2c65-de9b mouse
            GenEMBL NT_039689.1 + strand
            z in Figure 2D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            detritus exon 9 between Cyp2c65 and Cyp2c66
5432237 RS*LYIPPTPGKCICVRDNLAQMKLFLFLTTILYNFNLKSVDPQELDTT 5432383

Cyp2c66     mouse 
            AY227734 NW_000145
            Hong Wang, Joyce Goldstein, Darryl Zeldin
            Between Cyp2c65 and Cyp2c53p on chr 19
            Temp name 2CN5
            93% to Cyp2c65 73% to 2c29
MVLGVFLGLLLTCLLLLSLWKQNSQRRNLPPGPTPLPIIGNILQLDLKDISKSLRN
FSKVYGPVFTLYLGKKPAVVLHGYKAVKEALIDHGEEFAGRGTFPVADKFIRVL
GVVFSSGRTWKEMRRFSLMTLRNFGMGKRSIEDRVQEEARCLVDELRKTK
GVPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLTFIDILNENVEILSSPWIQ
VCNNFPAIIDYLPGRHRKLLKNFDFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ
EKHNPKTEFTCENLIFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT
AKVQAEIDCVIGRHRSPCMQDRHSMPYTDAVLHEIQRYIDLLPTSLPHAVTRDVKFREYLIPK
GTTVIASLTSVLYDDKEFLNPERFDPSHFLDESGKFKKSDYFFPFST
GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKDLDTTPVANGFVSVPPKFQICFISI*

Cyp2c67     mouse 
            GenEMBL NW_030157.1 (aa 1-274 exons 1-5 minus strand)
            GenEMBL NW_022459.1 (aa 275-320 exon 6 plus strand)
            GenEMBL NW_021833.1 (aa 321-431 exons 7-8 plus strand)
                                Part of exon 9 not found
            GenEMBL NW_020256.1 (aa 469-491 end of exon 9 plus strand)
            Hong Wang, Joyce Goldstein, Darryl Zeldin
            Between Cyp2c39 and Cyp2c68 on chr 19
            Temp name 2CN7
            95% to Cyp2c40
MDPFVVLVLCLSFLLVLSLWRQRSARGNLPPGPTPLPIIGNYHLIDMKDIGQCLTN
FSKTYGPVFTLYFGSQPIVVLHGYEAMKEAFIDHGEEFSGRGRFPFFDKVTKGK
GIGFSHGNVWKATRVFTINTLRNLGMGKRTIENKVQEEAQWLMKELKKTN
GLPCDPQFIIGCAPCNVICSIVFQNRFDYKDKDFLSLIGK
VNECTEILSSPGCQIFNAVPILIDYCPGRHNKFFKNHTWIKSYLLEKIKE
HEESLDVTNPRDFIDYFLIQRCQKKGIEHMEYTIEHLATLVTDLVFGGTE
SLSSTMRFALLLLMKHTHITAKVQEEIDNVIGRHRSPCMQDRNHMPYTNA
MVHEVQRYVDLGPISLVHEVTCDTKFRNYFIPKGTQVMTSLTSVLHDSTE
FPNPEVFDPGHFLDDNGNFKKSDYFVPFSAGKRICVGESLARMELFLFLT
TILQNFKLKPLVDPKDIDMTPKHSGFSKIPPNFQMCFIPVE*

Cyp2c68     mouse 
            GenEMBL NW_034810.1 (aa 1-161 exons 1-3 plus strand)
                                          Exon 4 not found
            GenEMBL NW_012728.1 (aa 215-273 exon 5 minus strand)
                                            Exon 6 not found
            GenEMBL NW_024952.1 (aa 321-383 exon 7, 2 copies on this contig)
            GenEMBL NW_012306.1 (aa 356-431 part of exon 7 and exon 8)
                                            Exon 9 not found
            Hong Wang, Joyce Goldstein, Darryl Zeldin
            Between Cyp2c67 and Cyp2c40 on chr 19
            Temp name 2CN8
            96% to Cyp2c40
  1  MDPFVVLVLC LSFLLLLSLW RQRSARGNLP PGPTPLPIIG NYHLIDMKDI 
 51  GQCLTNFSKI YGPVFTLYFG SQPIVILHGY EAMKEAFIDY GEEFSGRGRI 
101  PVFDKVSKGK GIGFSHGNVW KATRVFTVNT LRNLGMGKRT IETKVQEEAQ 
151  WLMKELKKTN GSPCDPQFII GCAPCNVICS IVFQNRFDYK DKDFLSLIGK 
201  VNECTEILSS PECQIFNAVP ILIDYCPGSH NKFLKNHTWI KSYLLEKIKE 
251  HEESLDVTNP RDFVDYFLIQ RRQKNGIEHM DYTIEHLATL VTDLVFGGTE 
301  TLSSTMRFAL LLLMKHTHIT AKVQEEIDNV IGRHRSPCMQ DRNHMPYTNA 
351  MVHEVQRYID LGPNGVVHEV TCDTKFRNYF IPKGTQVMTS LTSVLHDSTE 
401  FPNPEVFDPG HFLDDNGNFK KSDYFVPFSA 

Cyp2c69     mouse 
            GenEMBL NW_024021.1 (aa 1-56 exon 1 plus strand)
            GenEMBL NW_009479.1 (aa 57-160 exon 2-3 minus strand)
            GenEMBL NW_014461.1 (aa 161-214 exon 4 plus strand)
                                            Exon 5 not found
            GenEMBL NW_024085.1 (aa 276-320 exon 6 plus strand)
            GenEMBL NW_021729.1 (aa 321-491 exons 7-9 plus strand)
            Hong Wang, Joyce Goldstein, Darryl Zeldin
            Between Cyp2c40 and Cyp2c37 on chr 19
            Temp name 2CN9
            95% to Cyp2c40
  1  MDPFVVLVLC LSFMLLLSLW RQRSARRNLP PGPTPLPIIG NYHLIDMKDI 
 51  GQCLTNFSKT YGPVFTLYFG SQPIVVLHGY EAIKEALIDH GEVFSGRGRF 
101  PFFDKVSKGK GIGFSHGNVW KATRVFTVNT LRNLGMGKRT IENKVQEEAQ 
151  WLMKELKKTN GSPCDPQFII GCAPCNVICS IVFQNRFDYK DKDFLSLIGK 
201  VNECTEILSS PGCQIFNAVP ILIDYCPGRH NKFFKNHTWI KSYLLEKIKE 
251  HEESLDVTNP RDFIDYFLIQ RRQKNGIEHM EYTIEHLATL VTDLVFGGTE 
301  TLSSTMRFAL LLLMKHTHIT AKVQEEIDNV IGRHRSPCMQ DRKHMPYTNA 
351  MVHEVQRYVD LGPTSLVHEV TCDTKFRNYF IPKGTQVMTS LSSVLHDSTE 
401  FPNPEVFDPG HFLDDNGNFK KSDYFVPFSA GKRICVGESL ARMELFLFLT 
451  TILQNFKLKP LVDPKDIDTT PKYSGFSKIP PKFQMCFIPV E*

Cyp2c70     mouse 
            AY227736 NW_000148 NP_663474 LOC226105, NT_039692
            Hong Wang, Joyce Goldstein, Darryl Zeldin
            50kb downstream of Cyp2c50 on chr 19
            Temp name 2CN10
            59% to Cyp2c29
MALFIFLGIWLSCFLFLFLWNQHRGRGKLPPGPTPLPIVGNILQVYVKNISKSMGM
LAKKYGPVFTVYLGMKPTVVLHGYKAMKEALIDQGDEFSDKTDSSLLSRTSQGL
GIVFSNGETWKQTRRFSLMVLRSMGMGKKTIEDRIQEEILYMLDALRKTN
GSPCDPSFLLACVPCNVISTVIFQHRFDYNDQTFQDFMENFHRKIEILASPWSQ
LCSAYPILYYLPGIHNRFLKDVTQQKKFILEEINRHQKSLDLSNPQDFIDYFLIKMEK
EKHNQKSEFTMDNLVVSIGDLFGAGTETTSSTVKYGLLLLLKYPEVT
AKIQEEIAHVIGRHRRPTMQDRNHMPYTDAVLHEIQRYIDFVPIPSPRKTTQDVEFRGYHIPK
GTSVMACLTSVLNDDKEFPNPEKFDPGHFLDEKGNFKKSDYFVAFSA
GRRACIGEGLARMEMFLILTNILQHFTLKPLVKPEDIDTKPVQTGLLHVPPPFELCFIPV

Cyp2c71-ps  mouse 
            GenEMBL NW_000148 
            Between 2c69 and 2c37 on chr 19
            69% to Cyp2c69
14397 CP*SYNIFF*IIHVLSYLLEKIKENEELMDVTNP*DFIDYFLIQRHQ 14537 exon 5
32761 GTTVLTPLSSVLHDSKEFPNPEMFDPDHFLDGNGNFK*SDYFMPFSAGNR 32910 exon 8
39051 MCMGESLALMELILFLTTILQNF*LKSLVDLKDNNITPVYSGL 39179
39180 F*VPPTFLVCFISV 39221 exon 9

Cyp2c71-de1b  mouse 
            GenEMBL NW_000148 
            x in Figure 2D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            detritus exon 1 between Cyp2c71-ps and Cyp2c69
8628 MGPFVVLVLRLSFLLLLSL*RQRSGRGKLPPGLTPCSINGNFLQIDMKDTHQSLTN 8461
exon 1 (in opposite orientation to exons of 2c71-ps)

Cyp2c72-ps  mouse 
            NW_000145
            Hong Wang, Joyce Goldstein, Darryl Zeldin
            Between 2c29and 2c38
            Temp name 2CN11
            88% to 2c38, 87% to 2c39
       1  MDLITFLVLT LSSLILLLLW RQRSGRGRLP PGPTPFPIIG NFLQIDGKNF 
      51  SQSLTNFSKA YGPMFTLYLG SQPIAVLHGY EAVKEALIDH GEEFSGRRNI 
     101  PMAEKINNSL GVIFSNGNRW KEIRHFTLTI LRNLGMGKRN IEDRVQEEAQ 
     151  CLVEELRKTN

Cyp2c73-ps  mouse
            GenEMBL NW_000100.1 Mm14_WIFeb01_281
            A chr 14 2C seq 55% to 2C29 
27513950 GMGNRTIEDHI*EEACSLVDELRKTNGVRCNSTFILGC 27514063
27514066 PCNVICFIFFFQNRFDYKYQGILNENVEIVSSPWIQICNNFPAIIDHLPERHRKFLEDFAFDK ILVKVIQHQESLNINNPQEFINSFLIEMKQEEYNPKIEFAYENLILTASDMFAAGTETS TTLR*SLLLLFKDP*VTAKVQEETDHVIVRHRSPCIQDKNLMPYTNALLHEIQRYLDLLP
T*LYHGKTCCMKFKNCLIYKGIIVIESSTYVLHDDNEFSNPERFDPSHF

CYP2C74X    Macaca mulatta (rhesus monkey)
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            91% to CYP2C8, 78% to CYP2C19, probable ortholog of CYP2C8
            renamed CYP2C20.  There are only 3 amino acid differences to 
            Macaca fasicularis (cynomolgus monkey) GenEMBL S53046
            Since this is the clear ortholog of that earlier sequence 
            the name has been changed to reflect the orthology.

CYP2C75     Macaca mulatta (rhesus monkey)
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            93% to CYP2C9, 92% to CYP2C19, possible ortholog of CYP2C9
            94% to 2C43

CYP2C75     Macaca fasicularis (cynomolgus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 1/11/2005
            Clone name mfCYP2C9v3
            2 amino acid differences to 2C75 of Macaca mulatta
            93% to 2C9 human, 93% to 2C43, 76% to 2C20 Macaca fasicularis

CYP2C76     Macaca fasicularis (cynomolgus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 1/11/2005
            Clone name Novel_mfCYP2C
            72% to 2C18 human, 71% to 2C43, 69% to 2C20 Macaca fasicularis, 
            71% to 2C75 Macaca mulatta, 69% to 2C74 Macaca mulatta

CYP2C76     Callithrix jacchus (white-tufted-ear marmoset)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 9/29/2005
            83 aa 100% to CYP2C76 Macaca fasicularis 
            covers I-helix region

CYP2C76     Cercopithecus aethiops (African green monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 9/29/2005
            N-term 168 aa 100% to CYP2C76 Macaca fasicularis

CYP2C76     Macaca mulatta (rhesus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 9/29/2005
            98% to CYP2C76 Macaca fasicularis
            complete sequence

CYP2C77    rat
variant of 2C6 13 aa diffs to CYP2C6v1_v1, 16 aa diffs to 2C6v2
This gene has three frameshifts
244357850 MDLVMLLVLTLTCLILLSIWRQSSGRGKLPPGPIPLPIIGNIFQLNVKNITQSLTS (0) 244358017
244359760 FSKVYGPVFTLYFGMKPTVILHGYEAVKEALIDHGEEFAERGSFPVAEKINKDL (1) 244359921
244360096 GIIFSHGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEARCLVEE 244360230
244360232 MRKTN 244360246
244361085 GSPCDPTFILGCAPCNVICSIIFQNRFDYKDQDFLNLMEKLNENMKILSSPWTQ 244361246
244362321 FCSFFPVLIDYCPGSHTTLAKNVYHIRNYL 244362410
244362412 LKKIKEHQESLDVTNPQDFIDYYLIKWKQ 244362498 
244381928 ESHNPHSEFTLENLSITVTDLFGAGTETTSTTLRYALLLLLKYPEIT 244382068
244392235 AKVQEEIDRVFGKHRSPCMQDRSRMPYTDAMIHEVQRFIDLIPTNLPHAVTCDIKFRNYLIPM 244392423
244394012 GTTIITSLSSVLHDSKEFPNPEIFDPGHFLDGNGKFKKSDYFMPFSA 244394152
244395307 GKRMFAGEGLA 244395339
244395341 RMELFLFLTTILQNFKLKSVLQPKDIDTTPVFHGFASLPPFYELCFIPL 244395487
rat 2C cluster in chromosome order

CYP2C77-de1b2b3b4b5b rat
frag c Pseudogene 96% to 2C6_v1 exons 1-5 with partial deletion of exon 3 Plus Strand
244337898 MDLVMLLVLTLSCLILLSIWSQSSGRGKLP 244337987
244337987 SGPTPLPIIGNFFHLDLKNITQSLTS 244338064
244339793 FSKVNGSVFTLYFGMKPIVILHGYEAIK*GLIDHREEFTERGSFPVAEKINKGL 244339954
244340129 GIAFSHGNRWKEIRRFTLMTLQNLGMGK 244340212
244341157 GSPCDPTFILGCAPCNVICSIIFQNSFDYKDQDFLSLMEKLNENIKIVSSPWI* 244341318
244342872 FCSSFPVFIDYCPGIHMTLA 244342931
244342933 KNVYHTRNYILKKIKEHQESLDVTNPHDFIDYYLIKWKQ 244343049
rat 2C cluster in chromosome order

CYP2C78    Balaenoptera acutorostrata (Minke whale)
           No accession number
           Iwata Hisato
           submitted to nomenclature committee 1/6/05 
           58-60% to all four CYP2Cs in human

CYP2C79  rat
         GenEMBL XM_219933 
minus strand 72% to 2C6_v1 95% to seq e, 100% to seq q (exon 9), 
93% to seq z (exon 5) (temp name = CYP2CNEWD)
244590183 MILGVFLGLFLTCLLLLSLWKQNFQRRNLPPGPTPLPIIGNILQIDLKDISKSLRN 244590016
244575990 FSKVYGPVFTLYFGRKPAVVLHGYEAVKEALIDHGEEFAGRGIFPVAEKFNKNC 244575829
244575612 GVVFSSGRTWKEMRRFSLMTLRNFGMGKRSIEDRVQEEARCLVDELRKTN 244575463
244553851 GVPCDPTFILGCAPCNVICSIVFQNRFDYKDQEFLALIDILNENVEILSSPWIQ 244553690
244525726 ICNNFPAIIDYLPGRHRKLLKNFAFAKHYFLAKVIQHQESLDINNPRDFIDCFLIKMEQ 244525550
244524359 EKHNPKTEFTCENLIFTASDLFAAGTETTSTTLRYSLLLLLKYPEVT 244524219
244517844 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQRYIDLLPTSLPHALTCDMKFRDYFIPK 244517656
244516177 GTTVIASLTSVLYDDKEFPNPEKFDPSHFLDENGKVKKSDYFFPFST 244516037
244496745 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI 244496566
rat 2C cluster in chromosome order

CYP2C79-de9b rat
exon 9 62% to 2C79 2 aa diffs to seq d and seq p
244491372 G*WICVREDLAQMTLFLFCPTILKNFNLNSQVNPKEL 244491262
rat 2C cluster in chromosome order

CYP2C79-se1[9] rat
frag q Exon 9 100% to 2C79
243885148 GKRICVGEGLARTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIPI* 243885330

CYP2C80 rat
        GenEMBL XM_217906.2 GNOMON exon 2 on AC109577.4 in HTGS 
92% to 2C24, 73% to 2C11 (temp name = CYP2CNEWC)
MGWLSDP wrong N-term from GNOMON prediction
Correct N-term possibly in a sequence gap
244632544 FSEVYGPVFTLYFGLKPTVVVYGYEVVKEVLDGEEFSGRGVFPIVTKVNNDL 244632389
          this exon 2 does not match 2C24
244632205 GVIFSNGTKWKELRRFSLMTLRNFGMGKRSIEDRIQEEASCLVEELRKTN 244632056
244628281 GSLCDPTFILSCAPSNVICSVIFHNRFDYKDENFLNLMEKFNENFKILNSPWMQ 244628120
244624041 VCNAIPAFIDYLPGSHNKVIKNFAEIKSYILRRVKEHQETLDMDNPRDFIDCFLIKIE 244623868
244620080 QEKHNPCTEFTIQSLVATVTDVFVAGSETTSTTLRYGLLLLLKHTEVT 244619937
244619006 AKVQEEIDHVIGRHRRPCMQDRTRMPYTDAMVHEIQRYINLIPNNVPHAATCNVRFRNYVIPK 244618818
244616897 GTDLITSLTSVLHDDKEFPNPEVFDPGHFLDEHGNFKRSDYFMPFSS 244616757
244614348 GKRMCVGEALARMELFLLLTTIVQNFNLKSFVATKDIDTTPLTNTFGCVPPSYQLYFTPR* 244614166
rat 2C cluster in chromosome order

CYP2C81 rat
93% to 2C7 28 aa diffs missing exon 1 Plus Strand, 91% to seq j (exons 6,7)
93% to seq k (exons 2,3)
244672079 FSKTYGPVFTLYLGSQPTVILHGYEAIKKALIDHGEKFSGRGSYPMIENVTKGF 244672240
244672408 GIAFSNGNRWKEIRRFTIMTLRNLGMGKRNIEDRVQEEAQRLVEELRKTK 244672557
244681144 GSPCDPSFILNCAPCNVICSITFQNHFDYKDKEILTFMEKVNENVKIMSSPRMQ 244681305
244683123 VCNSFPSLIDYFPGTHHKIAKNINYMKSYLLKKIEEHQESLDVTNPRDFVEYYLIKQKQ 244683299
244699290 ANHIEQSEYSHENLACSIMDLIGAGTETMSSTLRYALLLLMKYPHVP 244699430
244713313 AKVQEEIDHVIGRYRSPCMQDRSHMPYTDAMIHEVQRFINFVPTNLLHAVTCDIKFRNYLIPK 244713501
244717457 GTKVLTSLTSVLHGSKEFPNPEMFDPGHFLDENGNFKKSDYFLPFSA 244717597
244718606 GKRACVGEGLARMELFLFLTTILQNFKLKSLVHPKDIDTRPVLNGFASLPPTYQFCFIPS 244718785
rat 2C cluster in chromosome order

CYP2C81-de7b   rat
frag a Exon 7 minus Strand 100% to seq r, 80% to 2C13
244724629 LRVQEEIDQVIGRNPSPCMQDRSHMPYTNAMVHEVQR*SNIVPNNIVYEVTCDTKFRNYFIPK 244724441

CYP2C81-de8b   rat
frag 1 Exon 8 93% to 2C7 Plus Strand
244737232 GTTVLTSLTSVLHDSKEFPNPEMFDPGHFLDENRNFKKSDYFMPFSA 244737372

CYP2C81-de8c   rat
frag 2 Exon 8 76% to 2C13 Plus Strand 87% to seq u
244764239 GMMVITSLSSVLHYNKEFPNPERFDPGYFLDGNGNFKKTDYFILFSA 244764379

CYP2C81-de1d   rat
frag 3 Exon 1 with frameshift Plus Strand 85% to seq e 83% TO SEQ w
244783632 MDLVVVL 244783652
244783654 CSVSSLLLFSLWRQSSWRRKLPPGPNPLPIIGNFLQIDLNNLCQSLNN (0) 244783797

CYP2C81-de6e7e rat
frag 4 exon 6 70% to 2C13 Plus Strand
244799349 ELELEHLGSMVTDLFFAGIESIRTTMIFALLFLLNTHTSQ 244799468
exon 7 82% to 2C13, 86% to seq r and seq a
244801583 LQNRSHMPYTNAMVHEVQRYSDIVPNNIVHEVTSDTKFRNYFIPK (0) 244801717

CYP2C81-de1f2f3f rat
frag 5 Exons 1,2,3 84% to 2C7 variant Minus Strand
244826982 MDLVTFLVLTLSSLILLSLWR*NSRRRKLPPGPTPLLIIGNFLQLDVKNVSQSLTM (0) 244826815
244813456 FSKAYGPVFTLYLGSQPTVILHGYEAVKETLIDHGEEFSGRGSFPMVEKAFKCF 244813295
244813129 GIVFSNGNR*KEIRQFIIMTLQNLGMGKRNIEDHVQEEAQCLVEELRKTK 244812980

CYP2C82P rat
frag e Exons 1,4,4,5,6,7,8,9 almost an exact duplicate of seqs w,x,y,z, 
exons 6-9 of the wxyz cluster in a seq gap
244218695 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLINE (0) 244218865
244233879        LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 244234019
244240189 GVPCDPTFILGCAPCNVICSIVFQNHFNYKGQEFLALIDTLNENVEILSSPWIQ 244240350
244265531 ICNNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ 244265707
244266904 KHNPKTEFTCKNLIFTASDLFAAGTETTSPTLRYSLLLLPKYPEV 244267038
244273480 AKVQEEIDHVIGRHRSPCMQDRHHMPYTDAVLHEIQ*YIDLLPTSLPHALTCDMKFRDYFIPK 244273668
244275197 GTTVIASLTSVLYDDKEFPNPEKFDLSHFLDENGKFKKSDYFFPFST 244275337
244286429 GKRICVGEGLAQTELFLFLTTILQNFNLKSPVDLKELDTNPVANGFVSVPPKFQICFIP 244286605

>CYP2C82P-de9b frag d Exon 9 identical to seq p
244289962 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 244290072

rat 2C cluster in chromosome order

>CYP2C82P-se[1:4:4:5] rat
frag z Exon 5 minus strand 1 aa diff to CYP2C82P
243632036 ICDNFPAIIDYLPGRHRKLLKKFAFAKHYFLAKVIQHKESLDINNPRDFIDCFLIKMEQ (0) 243631860
frag y Exon 4 minus strand 92% to CYP2C82P
243654367 GVPCDPTFILGCAPCNVICSIVFQNHFNYKDQEFLALIE 243654251
243654249 LNENVEILSSP*IQ 243654208
frag x exon 4 minus strand 100% to CYP2C82P short exon 4
243659542 LILSYASCNVICSITFQNRFDYKDKEILTLMEKVNENVKIMSSPWIQ 243659402
frag w Exon 1 minus strand 100% to CYP2C82P
243675609 MDPVVVLMPSFSSLLLLSLWRQNSWRRKLPPGPNPLPIIGSFLQIDLNDLCQSLIN 243675442
rat 2C cluster in chromosome order

CYP2C83     Cercopithecus aethiops (African green monkey)
            No accession number 
            Catherine Booth-Genthe
            Merck Research laboratories
            92% to human CYP2C9, 90% to human CYP2C19
            cannot tell if this is the ortholog of
            2C9 or 2C19 without map information
            98% to 2C43 probable ortholog, name may be changed to 2C43

CYP2C84   Phalacrocorax carbo (Common cormorant)
          No accession number
          Hisato Iwata
          submitted to nomenclature committee 5/19/05 
          81% to 2C45 chicken (possible ortholog), 56% to 2C11 rat

CYP2C85     Bos taurus (cow)
            See cattle page for details
MDLPVVLVLCLCCLLLISLWKQSSGKGKLPPGPTPLPILGNILQLDVKDISKSVSN
LSKVYGPVFTLYFGMNPLVVLHGYEAVKEALIGLGEEFSGRGSCPVIQRASKGY
GVIFSNGKIWKETRRFSLMTLRDFGMGKRSMEDRVQQEACCLVEELRKTD
GLPCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ
LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ
EKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPEVT
AKVQEEIDHVIGRHRSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK
GTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFST
GKRICVGEGLARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV*

CYP2C86     Bos taurus (cow)
            See cattle page for details
MERLEITTLALVICVTCLVFLFVWKKSHKGLGKLPPGPTPLPIIGNLMQLNLKDIPASLSK
LAKQYGPVYTLHLGSQTTVVLHGYEVVKEALIDQGDEFLGRAHFPIIDDTQRGY
GLIFSNGDTWKQMRRFSSLMTLRDFGMGKRSLEERIQEEAQFLVEEFRKSE
AQPFNPAVTLSCATCNIICSILFNERFHYQDKTLHSLLDLLNENFNRISSLWNQ
IYNLWPKLIKPLPGEHRAFSKRLKDVHYFVLEKVKEHQKSLNHNNPRDYIDCFLSRMEQ
EKQNPESQFHLENLATCGSNLFSAGVETTTATLSYGFLLLMKYPEVQ
AKVHEEIDRVIGRTRSPCMKDKMKLPYTEAVLHEIQRYVTLVPSNLPHAVVQDTKFRQYVIPK
GTTVLPLLSSILYDCKEFPNPEKFDPGHFLDKNGSFRKTKYFVAFSI
GKRACVGEGLAQMELFLFFTTILQNFVLKPLGETKDIETKPIVIGLINMPPPFKLCLIPR*

CYP2C87     Bos taurus (cow)
            See cattle page for details
MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNIFQLDVKNISKSLTS
LSKVYGPVFTVYFGMKPTVVLHGYEAVKEALIDLGEEFSRRGSFPVIERNVKGH
GIVFSNGKTWKETRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTN
GLPCDPTFILGCAPCNVICSIIFQNRFDYKDQTFLNLMKTINENIKILGSPWIQ
VLNIFPVLLDFFPWSYSYKKLYTNTAYVKNYVLEKTREHQASLDINNPRDFIDCFLIKMEQ EKHNHQSEYTFENLTITVSDLFGAGTETTSTTLRYGLLLLLKHPEVT
AKIQEEIDRVIGRHRSPCMQDRTHMPYMDAVLHEIQRYIDLAPTSVPHAVNCDVKFRNYLIPK
GTDILTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFRKSDYFMAFSAGKRVCVGEGLA
RMELFLFLTTILQTFTLKSVVDPKDLDTTPAVTGIANVPPPYQLCFIPV*

CYP2C87-de2b   Bos taurus (cow)
            6kb downstream of 2C87 without an intervening exon 1, same orientation
LSKVCGPVFTVYFGMKPTVVLHGYEALQEALIDLGEEFSGRYSFPVNEKTRRGH

CYP2C88     Bos taurus (cow)
            See cattle page for details
MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNILQLDVKNISKSLTN
LSKVYGPVFTVYFGMKPIVVLHGYEAVKEALIDLGEEFSGRGMFPLAERANIVN
GILFSNGKTWKEIRRFSLMTLRNFGMGKRSIEDRVQEEACCLVEELRKTN
GLPCDPTFILGCAPCNVICSIIFQNRFDYKDPVFLDLMERLNEILRILSSPWVQ
VCNNFPALFDYLPGSHNKVLKNVANLKSFVLEKAMEHKASLDINNPRDYIDCFLIRMEQ
EKQNQQLEFTLENLTTTVFDLFGAGTETMSTTLRYGLLLLLKHPEVT
AKVQEEIDRVIGRHRSPCMQDRSHMPYTDAVVHEIQRYIDLVPSSLPHMVTHDIELRNYIIPK
GTGVLVSLTSVLYDDKVFPNPEMFDPGHFLDDSGNFKKSDHFMPFSA
GKRICAGESLARMEVFLFLTVILQKFTLKSVVDPKDIDTTPIANGFASVPPPYKLCFIPL

CYP2C89     Bos taurus (cow)
            See cattle page for details
     XXXXXGPVFTLYFGMKPTVVLHGYEAVKQVLIDQSEEFSGRGSLPVADNINQGL
     GIVFSNGEIWKQTRRFSLMVLRNMGMGKRTIEHRIQEEALCLVEALKKTN
     GSPCDPTLLLSCAPCNVICSIIFRNRFEYNDERLLTLIKYFNENSRLVSTPWVE
     LYNTFPSLLHYFPGSHNTIFKNMTEQRKFILEEIKKHQESLDLNNPQDFIDYFLIKMEK
     EKHNKHSEFTMDNLITTVWDVFSAGTETTSLTLRYGLLLLLKHPEVT
     AKVQEEIDRVVGRNRSPCMQDKSCMPYTDAVLHEIQRYIDLVPSSMPHAATQDVKFREYLIPK
     GTVILTSLTSVLHDDNEFSNPGQFDPGHFLDESGNFKKTDHFMAFSA
     GKRVCVGEGLARMELFLLLVSILQHFTLKSVVDPKHIDTAPSFKGLISIPPFCEMCFIPV* 1292

CYP2C90     Bos taurus (cow)
            See cattle page for details
LSNTYGPVFTVYFGLRPTVVLHGYEAVKEALIDQGEEFSGRGNIPMSQRVNKGY
GIIFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEAHCLVEELRKTN
GSPCDPTFILGCAPCNVICSIIFQNRFDYTDQNFLNLLDKFNENLQVVSSPWMQ
VCNTFPILIDYFPGSHNKLFKNFAYIRSYVLEKVKEHQATLDINNPRDFIDCFLIKMEQ
EKHNQEMEFTFENLIASVSDLFGAGTETTSTTLRYGLLMLLKHPEVT
AKVQEEIDRVIGRHRSPCMQDRSHMPYMDAVVHEIQRYIDLVPTNLPHAVTRDIKFRNYLIPK
GTTVVTSLSSVLHDEKEFPNPKVFDPAHFLDESGNFKKSDYFMAFSA
GKRSCVGEGLARMELFLFLTTILQKFTLKSVVDPKDLDTTPVSSGFGHVPPPYQLCFTPL*

CYP2C91   Sus scrofa (miniature pig) 
          no accession number
          Haitao Shang
          Submitted to nomenclature committee May 23, 2007
          Partial seq. differs from known pig sequences 66% to 2C36 
          frameshift and small deletion 
          pseudogene?

CYP2C92   horse
          No accession number
          Heather Knych
          Submitted to nomenclature committee June 25, 2007
          83% to CYP2C87 cow, 81% to CYP2C49 pig
 
CYP2C-se1[7] human
            NT_022154.9|Hs2_22310 
            2C pseudogene fragment chr 2
            old CYP2C56P
            Chr2q24.3 165142570-165142755 + strand Build 33
1768955 SKVQEETDHAVGRHWRPCMQDRSHMPYTEAMVHEVQRH*PHPTNVPHALTSDIKFRNYLLPK 1769140

CYP2C-se2[1:2] human
            NT_008583.11|Hs10_8740 
            Chr10q21.3 66415290-66415135 - strand, Build 33 in MER1_type repeat
            chromosome 10 pseudogene frag parts of exons 1 and 2
            old name = CYP2C61P
1832658 KGKLPHDLTSFLFVGNILQLNSKNLSKSITMLAKDYGPGFTVYFGIKPTVVV 1832813

CYP2C-se3[1] human
            NT_011512.5|Hs21_11669 
            chromosome 21 51% to 2C9
            chr21q21.2 25740563-25740423 build 33 - strand bracketed by L1 repeats
            old name = CYP2C63P
12398358 CPSCLILLFLWNGSYAKGKLLPGPIPLPIV*NILPLRSMNTSKSISMVS 12398212

CYP2C-se4[1] human
            NT_011602.7|HsX_11759 
            2C pseudogene fragment chr X 57% to 2C8
            ChrXq28 147659303-147659476 + strand Build 33
            inside MTMR1 intron 3 (myotubularin-related protein 1)
            old name = CYP2C64P
435396 ASVDLAAVLVLFLSHFLFLSLWKQSSEREKLLPGPTPIRIIGNILELDLKDICKSLSDVN 435575
435576 MLYAPL 435593

Cyp2c-se5[9] mouse
            GenEMBL NW_000107.1|Mm16_WIFeb01_286
            2c exon 9 fragment on chr 16
42687727 PFSTGKLICVGEGLARAELLLLLTTILQNFNLKSPVDLKDLDTIPVANG 42687873

CYP2C-se6[9] rat
frag p exon 9 100% to CYP2C82P-de9b
243895387 GKWICVREDLAQMTLFLFCPTILKNFNLKSQVNPKEL 243895497

CYP2C       rat
            no accession number (639bp)
            Zaphiropoulos,P.
            submitted to nomenclature committee
            82% amino acid identity to exon 2 of 2C24

CYP2C       rat
            no accession number (397bp)
            Zaphiropoulos,P.
            submitted to nomenclature committee
            similar to exon 3 of 2C7 
            possible pseudogene, with stop codon at location of conserved trp.

CYP2C       rat
            PIR B60822 (19 amino acids)
            Amelizad, Z., Narbonne, J.F., Wolf, C.R., Robertson, L.W. and
            Oesch, F.
            Effect of nutritional imbalances on cytochrome P-450 isozymes
            in rat liver.
            Biochem. Pharmacol. 37, 3245-3249 (1988)

CYP2C       dog
            PIR A60465 (33 amino acids)
            Komori, M., Shimada, H., Miura, T. and Kamataki, T.
            Interspecies homology of liver microsomal cytochrome P-450. A
            form of dog cytochrome P-450 (P-450-D1) crossreactive with
            antibodies to rat P-450-male.
            Biochem. Pharmacol. 38, 235-240 (1989)
            Note: probable N-terminal of 2C21 which is missing the N-terminal region

CYP2C       horse
            PIR PN0659 (16 amino acids)
            Komori, M., Higami, A., Imai, Y., Imaoka, S. and Funae, Y.
            Purification and characterization of a form of P450 from
            horse liver microsomes.
            J. Biochem. 114, 445-448 (1993)

2D Subfamily

CYP2D1      rat
            PIR A30495 (19 amino acids)
            Gonzalez, F.J., Matsunaga, T., Nagata, K., Meyer, U.A.,
            Nebert, D.W., Pastewka, J., Kozak, C.A., Gillette, J.,
            Gelboin, H.V. and Hardwick, J.P.
            Debrisoquine 4-hydroxylase: characterization of a new P450
            gene subfamily, regulation, chromosomal mapping, and
            molecular analysis of the DA rat polymorphism.
            DNA 6, 149-161 (1987)

CYP2D1      rat
            PIR S39761 (13 amino acids)
            Ohishi, N., Imaoka, S., Suzuki, T. and Funae, Y.
            Characterization of two P-450 isozymes placed in the rat
            CYP2D subfamily.
            Biochim. Biophys. Acta 1158, 227-236 (1993)

CYP2D1      rat
            GenEMBL J02867
            chr7: 120808284-120803991 (- strand)
MELLNGTGLWSMAIFTVIFILLVDLMHRRHRWTSRYPPGPVPWPVLGNLLQVDLSNMPYS
LYKLQHRYGDVFSLQKGWKPMVIVNRLKAVQEVLVTHGEDTA
DRPPVPIFKCLGVKPRSQGVILASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEA
GHLCDAFTAQAGQSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMVKLVEESLTE
VSGFIPEVLNTFPALLRIPGLADKVFQGQKTFMALLDNLLAENRTTWDPAQPPRNLTD
AFLAEVEKAKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQRRV
QQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRFTSCDIEVQDFVI
PKGTTLIINLSSVLKDETVWEKPHRFHPEHFLDAQGNFVKHEAFMPFSA
GRRACLGEPLARMELFLFFTCLLQRFSFSVPVGQPRPSTHGFFAFPVAPLPYQLCAVVREQGL

CYP2D2      rat
            GenEMBL X52027 X52455
            chr7: 120834409-120830514 (- strand)
MGLLIGDDLWAVVIFTAIFLLLVDLVHRHKFWTAHYPPGPVPLPGLGNLLQVDFENMPYS
LYKLRSRYGDVFSLQIAWKPVVVINGLKAVRELLVTYGEDTA
DRPLLPIYNHLGYGNKSKGVVLAPYGPEWREQRRFSVSTLRDFGVGKKSLEQWVTEEA
GHLCDTFAKEAEHPFNPSILLSKAVSNVIASLVYARRFEYEDPFFNRMLKTLKESFGE
DTGFMAEVLNAIPILLQIPGLPGKVFPKLNSFIALVDKMLIEHKKSWDPAQPPRDMTD
AFLAEMQKAKGNPESSFNDENLRLVVIDLFMAGMVTTSTTLSWALLLMILHPDVQRRV
HEEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADIVPTNIPHMTSRDIKFQGFLI
PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA
GRRACLGEPLARMELFLFFTCLLQRFSFSVLAGRPRPSTHGVYALPVTPQPYQLCAVAR

CYP2D3      rat
            GenEMBL X52028
            Chr7: 120817315-120813086 (- strand)
MELLAGTGLWPMAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQVDLCNMPYS
MYKLQNRYGDVFSLQMGWKPVVVINGLKAVQELLVTCGEDTA
DRPEMPIFQHIGYGHKAKGVVLCTYGPEWREQRRFSVSTLRNFGVGKKSLEQWVTDEA
SHLCDALTAEAGRPLDPYTLLNKAVCNVIASLIYARRFDYGDPDFIKVLKILKESMGE
QTGLFPEVLNMFPVLLRIPGLADKVFPGQKTFLTMVDNLVTEHKKTWDPDQPPRDLTD
AFLAEIEKAKGNPESSFNDANLRLVVNDLFGAGMVTTSITLTWALLLMILHPDVQCRV
QQEIDEVIGQVRHPEMADQAHMPFTNAVIHEVQRFADIVPMNLPHKTSRDIEVQGFLI
PKGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA
GRRACLGEPLARMELFLFFTCLLQRFSFSVPTGQPRPSDYGVFAFLLSPSPYQLCAFKR

CYP2D3-de8b rat
            UCSC browser Chr 7 (+ strand) 120811066-120811206 
            2aa diff to 2D2/2D3 exon 8
            lies between 2D1 and 2D3, a in fig. below
            GTTLIPNLSSLLNDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA
rat, mouse and human 2D clusters

CYP2D4_v1   rat
            GenEMBL M22331.1 X52029
            ONLY 5 AA DIFFS to CYP2D4_v2
            120781146-120776576 (- strand)
            note: 2D18 is an alternate splice of an untranslated exon of 
            the 2D4 gene.  The 5 aa diffs are allelic variation
            both haplotypes are found in the same library
            see Supporting document
MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQIDFQNMPAGFQK ()
LRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTADRPPLHFNDQSGFGPRSQ ()
GVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEARCLCAAFADHS ()
GFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEEESGFLPM ()
LLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTDAFLAEVEK ()
AKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILRPDVQC ()
RVQQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLIPK ()
GTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA ()
GRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSNYGVFGALTTPRPYQLCASPR

CYP2D4_v2   rat
            GenEMBL U48219 S77859 
            ONLY 5 AA DIFFS to CYP2D4_v1 
            120781146-120776576 (- strand)
            note: 2D18 is an alternate splice of an untranslated exon of 
            the 2D4 gene.  The 5 aa diffs are allelic variation
            both haplotypes are found in the same library
            see Supporting document

CYP2D5      rat
            GenEMBL X52030 X52458
            chr7: 120799154-120794726 (- strand)
MELLNGTGLWPMAIFTVIFILLVDLMHRHQRWTSRYPPGPVPWPVLGNLLQVDPSNMPYSMYK
LQHRYGDVFSLQMGWKPMVIVNRLKAVQEVLVTHGEDTADRPPVPIFKCLGVKPRSQ
GVVFASYGPEWREQRRFSVSTLRTFGMGKKSLEEWVTKEAGHLCDAFTAQN
GRSINPKAMLNKALCNVIASLIFARRFEYEDPYLIRMLTLVEESLIEVSGFIPE
VLNTFPALLRIPGLADKVFQGQKTFMAFLDNLLAENRTTWDPAQPPRNLTDAFLAEVEK
AKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQR
RVQQEIDEVIGQVRCPEMTDQAHMPYTNAVIHEVQRFGDIAPLNLPRITSCDIEVQDFVIPK
GTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSA
GRRACLGEPLARMELFLFFTCLLQHFSFSVPAGQPRPSTLGNFAISVAPLPYQLCAAVREQGH

CYP2D6      human
            GenEMBL M24499 (1195bp)
            Manns,M.P., Johnson,E.F., Griffin,K.J., Tan,E.M. and Sullivan,K.F.
            Major antigen of liver kidney microsomal autoantibodies in
            idiopathic autoimmune hepatitis is cytochrome P450db1
            J. Clin. Invest. 83, 1066-1072 (1989)

CYP2D6      human
            GenEMBL A20907 (1768bp)
            Genetic assay for cytochrome p450
            Patent: WO 9110745-A 13 25-JUL-1991;

CYP2D6      human
            GenEMBL M33189 (5503bp)
            Gonzalez,F.J.
            unpublished (1990)

Note on the 2D6 locus.  The normal situation is CYP2D8P, CYP2D7P, CYP2D6
            Alleles with an extra pseudogene have been found
            CYP2D8P, CYP2D7AP, CYP2D7BP, CYP2D6
              Heim,M.H. and Meyer,U.A.
              Evolution of a highly polymorphic human gene locus for 
              a drug metabolizing enzyme.
              Genomics 14,49-58 (1992)
            The 2D7AP sequence is 94.7% identical to CYP2D7P
            The 2D7BP sequence is created by gene conversion between 
            2D7AP and CYP2D6 and it is named CYP2D8BP below.

CYP2D7P     human 
            GenEMBL M33387
            The typical human 2D7 pseudogene
            In the 1996 nomenclature this was named CYP2D7P1

CYP2D7P1    human
            Same as CYP2D7P

CYP2D7P2    human 
            Same as CYP2D7AP

CYP2D7AP    human
            GenEMBL X58467 (13,278bp)
            Heim,M.H. and Meyer,U.A.
            Evolution of a highly polymorphic human gene locus for 
            a drug metabolizing enzyme.
            Genomics 14,49-58 (1992)
            Note: CYP2D7AP is 94.7% identical to CYP2D7P, both are 
            pseudogenes. In the 1996 nomenclature this was named CYP2D7P2

CYP2D7BP    human 
            This is the authors name for CYP2D8BP below
            In the 1996 nomenclature this was named CYP2D8P2

CYP2B8P     human
            GenEMBL M33387
            The typical human 2D8 pseudogene
            In the 1996 nomenclature this was named CYP2D8P1

CYP2D8P1    human
            Same as CYP2D8P

CYP2D8P2    human 
            Same as CYP2D7BP and CYP2D8BP

CYP2D8BP    human
            GenEMBL X58468 (13,677bp)
            Heim,M.H. and Meyer,U.A.
            Evolution of a highly polymorphic human gene locus for 
            a drug metabolizing enzyme.
            Genomics 14,49-58 (1992)
            This gene is called CYP2D7BP by the authors
            Note: CYP2D8P is a chimeric gene composed of part of 
            CYP2D7AP and part of CYP2D6.  There are only 14 base 
            changes in 13,677 base pairs relative to these parents.
            This gene is different from CYP2D8P.  It is a pseudogene.
            In the 1996 nomenclature this was named CYP2D8P2

Cyp2d9      mouse
            GenEMBL J04471 M24262 (846bp) M24267 (3367bp)
            Wong,G., Itakura,T., Kawajiri,K., Skow,L. and Negishi,M.
            Gene family of male-specific testosterone 16-alpha-hydroxylase
            (C-P-450-16-alpha) in mice: Organization, differential regulation,
            and chromosome location
            J. Biol. Chem. 264, 2920-2927 (1989)

Cyp2d9-de1b2b  mouse
            GenEMBL NT_039621.1 + strand
            x in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            exons 1 and 2  8-10kb upstream of 2d9 
43879793 MELLTGTDLWSVAIFTVIFILPVDLLHRRQRWTSRCPPGPVPWPVLGNLLQVDLDNMPYSLYK 79981
43880823 XXNRYGDMFSLHMAWKPMVVINGLKAMKEVLLTCGEDTADSPPVPIYEHRGXXXXXX 80969

Cyp2d9-de1c5c6c7c  mouse
            GenEMBL NT_039621.1 + strand
            y in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            exons 1,5,6,7 between 2b9 and 2b10 (uup)
43869836 MELLTGTELWPVAIITVIFILLVDLMHYHQLWTSHY 69943
43869943 PPGPVLWPVLGNLLQMDLHNMPHSMYK 70023
43872058 VLNTFPILLCIPGWADKVFPG*STFLTMVDKLVTEPKRT*DPDQPPCDLIDAFLAEMXX 72228
43872341 AKGNPSSNFNDANLRLVVFNLFGAGIVTSSITLTWVLLLMVLHPDVQ 72481
43872703 RLHQETDEVIGHVWWPERQSQX 72765
43872768 LMPYTNAVIHEVQHYTGIIPIPLPHRTSSDIEMQDFLITK 72887

Cyp2d9-de1d6d7d  mouse
            GenEMBL NT_039621.1 - strand
            z in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
            exons 1,6,7 10kb upstream of Cyp2d9-de1c5c6c7c
43859756 MELLTGTSLWPVAILTVIFILLQDLMHQQKCCTSCYLPGTVLWTLQRNLLQVDLHSMPHSLCK 59568
43858655 AKGNLESSFNDANLSLVVLDQFGTGIVASSVTLTWGLLLTILNPDVQ 58515
43858292 RMQQEIDKVIEHVW*TEMVHQAYMPYTNAAIHEVQRYKDIIPIPLPHRTSSDVEMQDFLITK 58107

Cyp2d10    mouse
            GenEMBL J04471 M24263 M24265 M24268 (4828bp)
            Wong,G., Itakura,T., Kawajiri,K., Skow,L. and Negishi,M.
            Gene family of male-specific testosterone 16-alpha-hydroxylase
            (C-P-450-16-alpha) in mice: Organization, differential regulation,
            and chromosome location
            J. Biol. Chem. 264, 2920-2927 (1989)
            
Cyp2d11    mouse
            GenEMBL J04471 M24264 M24266 (5661bp)
            Wong,G., Itakura,T., Kawajiri,K., Skow,L. and Negishi,M.
            Gene family of male-specific testosterone 16-alpha-hydroxylase
            (C-P-450-16-alpha) in mice: Organization, differential regulation,
            and chromosome location
            J. Biol. Chem. 264, 2920-2927 (1989)
            
Cyp2d12     mouse
            no accession number
            Negishi,M.
            submitted to nomenclature committee in 1990, but never published.
            ESTs AI116003 ue25f10.x1 (295-end 2 diffs, 1fs) AI785325 uj40c11.x1 
            (326-end 1 diff) AI527869 uj30b05.y1 (1-241 4 diffs, 2fs) AA986388 
            uc82e10.x1 (307-end 4 diffs)
Public Cyp2d12 from EST sequences.  Places where ESTs do not match Negishi's 
sequence are shown in ().  The EST seq is given. In these sites Y, G, N, A and R
are observed in multiple ESTs and they are probably the correct amino acids
F at the last variable site is seen twice and S is seen twice so this may be a 
polymorphic site
MELLTGTDLWSVAIFTVIFILLVDLM (Y) RRQSWTSCYPPGPVPWPVL (G) NLLQVDL (N) NMPYSL
YKLQNRYGDVFSLQMAWKPMVVINRMKAMKEVLLTCGEDTADRPPVPIFEHLGFKPRSQGMIFAPYGPEWREQ
RRFSLSSLRNFGLGRKSLEEWVIKEAGHLCDAFTTQAGQYINPNTMLKK (A) TCNVIASLIFARRFEYED
PYLIRMLKVLEDSLTELSGLIPEVINTFPILLHIPRLAD 
(53 amino acid gap)
ENLRMVVIDLFTAGILTTSTTLSWALLLMILHPDVQRRVQQEIDEVIGQVRHPEMADQAHMPYTNAVIHEVQRFGDIVPLHLPRITSRDIEVQDFLIPKGTILLPNMSSVHMDDTVWEKPLRFHPEHFLDAQGHFVKHEAFITFSAG (R) RSCLGEPLARMELFLFFTCLLQRFSFSVPDGQPQPSDHRVF (F) IMVAPSPYQLCAVIREQGH*

Cyp2d12-de1b5b6b7b mouse
           GenEMBL NT_039621.1 - strand
           detritus exons 1,5,6,7  fragments 7kb upstream of 2d12 
           v in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
44005713 M*LLTGTGLWPVAIFTIIFILLQDLMHHLKLWTSCYPPGTVPWPL 44005579
44003512 NTLPDSPAHPRVA*QVSPGTMTFLTMMDKLVTEQKRTWDPDHPLCNLTDAFLAEMEK 44003342
44003204 AKGSPQSSFKGANLCLVVLDQFDAGIVTTSITLT*GLLLTILNPRVQ 44003064
44002849 RVQQEINKVIGHV**PEMVDQDHMSYSNAVMYEVQHYADIITIPLAHKTFSDVEVQGSLITK 44002664

Cyp2d12-de5c6c7c mouse
           GenEMBL NT_039621.1 - strand
           detritus exons 5,6,7
           w in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
43998271          PRVA*QVSPGTMTFLTMMDKLVTEHKRTWDPGHPLCNLTDAFLAEMEK 33998128
43997989 AKGSPQSSFKGANLCLVVLDQFDAGIVTASITLTWGLLLTILHPGVQS 33997846
43997629 RVQQEINKVIGHVW*PEMVDQDRMSYSNAVMYEVQRYADIITIPLAHKTFSDVEVQGSLITK 33997444

Cyp2d13     mouse
            no accession number
            Negishi,M.
            submitted to nomenclature committee in 1990, but never published.
            no exact matches in the Genbank EST database as of 10/20/97
            sequence may be erroneous, or a rare transcript.

Cyp2d13     mouse
            No accession number
            Brian Libby
            partial Cyp2d13 gene sequence
            The top half of the sequence below is from Brian Libby
            This sequence matches Negishi's except at one amino acid
            shown in parentheses.  The bottom half is from EST BF533324
            Dr. Negishi's sequence called "ce" is complete, but still 
            unpublished. (see note to Cyp2d26)
Public Cyp2d13 seq from BF533324 EST and Brian Libby. One extra amino acid 
seen in EST BF533324 is shown as [D].  Two amino acids that do not agree 
are shown in ().  The EST sequence is given at the T and G sites.
MELLTGTGLWPVAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQVDLDNMPYSLYKL
QNRYGDVFSLQMAWKPVVVISGLKAVREVLVTCGEDTADRPEMPIFQHLGYGEKAKGVVF
APYGPEWRELRRFSVSTLRNLGLGKKSLEQWVTEEAGHLCDAFTAQAGSPLDPYTLLNKAVCNV
IASLIYARRFEYGDPDFIKMLKILKENMGENTGLFPE
(15 amino acid gap)
DKVFPGQKTFLTLVNKLVTEHKRTWDP [D] QPPRDLTDAFLAEMEKAKGNPKSSFNEANLRL
VVFDLFGAGIVTSSITLTWALLLMILHPDVQRRVQEEIDEVIGQVRCPEMADQAHMPYTNAVIH
EVQRFADIVPMNLPHKTSHDIEVQGFLIPKGTTLIPNLSS (T) LKDETVWEKPLRFHPEHFL
DAQGHFVKPEAFMPFSAGRRACLGEPL (G) RMELFLFFTCLLQRFSFLVPAGQPQPSDYGIF
TFLVSPSPYQLCAFTRDQATN*

Cyp2d13     mouse
            GenEMBL AC087902.4, EST BF533324, NT_039621.1 
NT_039621.1 - strand
44100884 MELLTGTGLWPVAIFTVIFILLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQVDLDNMPYSLYK 44100696
44099867 LQNRYGDVFSLQMAWKPVVVISGLKAVREVLVTCGEDTADRPEMPIFQHLGYGEKAK 44099697
44099412 GVVFAPYGPEWRELRRFSVSTLRNLGLGKKSLEQWVTEEAGHLCDAFTAQAG 44099257
44099169 SPLDPYTLLNKAVCNVIASLIYARRFEYGDPDFIKMLKILKENMGENTGLFPE 44099017
44098352 VLNTFPILLHIPGLADKVFPGQKTFLTLVNKLVTEHKRTWDPDQPPRDLTDAFLAEMEK 44098176
44098036 AKGNPKSSFNEANLRLVVFDLFGAGIVTSSITLTWALLLMILHPDVQ 44097896
44097675 RRVQEEIDEVIGQVRCPEMADQAHMPYTNAVIHEVQRFADIVPMNLPHKTSHDI 44097514
44097515 LEVQGFLIPK 44097486
44097091 GTTLIPNLSSALKDETVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSAG 44096948
44095907 RRACLGEPLARMELFLFFTCLLQRFSFLVPAGQPQPSDYGIFTFLVSPSPYQLCAFTR* 44095731

CYP2D14     bovine
            GenEMBL S45538 X68013 (1538bp) Swiss Q01361 (487 amino acids)
            PIR S29295 S37284 (500 amino acids) PIR S29862 (500 amino acids)
            Tsuneoka,Y., Matsuo,Y., Higuchi,R. and Ichikawa,Y.
            Characterization of the cytochrome P-450IID subfamily in
            bovine liver.  Nuceotide sequences and microheterogeneity.
            Eur. J. Biochem. 208, 739-746 (1992).

CYP2D14     Bos taurus (cow)
            See cattle page for details
MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPTPLPVLGNLLQVDFEDPRPSFNQ
LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPPAVYEHLGYGPRAEG
VILARYGDAWREQRRFSLTTLRNFGLGKKSLEQWVTEEASCLCAAFADQA
GRPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIIKLLDLTEDGLKEEFNLVRKV
VEAVPVLLSIPGLAARVFPAQKAFMALIDELIAEQKMTRDPTQPPRHLTDAFLDEVKE
AKGNPESSFNDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR
RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK
GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA
GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSEHGVFAFLVTPAPYQLCAVPR*

CYP2D15     dog
            GenEMBL D17397 (1665bp)
            Sakamoto,K., Kirita,S., (Aoyama,J., Baba,T. and Matsubara,T.)
            cDNA cloning and characterization of dog P-450 2D.
            Arch. Biochem. Biophys. 319, 372-382 (1995)
            check authors on paper
MGLLTGDTLGPLAVAVAIFLLLVDLMHRRRRWATRYPPGPTPVP
MVGNLLQMDFQEPICYFSQLQGRFGNVFSLELAWTPVVVLNGLEAVREALVHRSEDTA
DRPPMPIYDHLGLGPESQGLFLARYGRAWREQRRFSLSTLRNFGLGRKSLEQWVTEEA
SCLCAAFAEQAGRPFGPGALLNKAVSNVISSLTYGRRFEYDDPRLLQLLELTQQALKQ
DSGFLREALNSIPVLLHIPGLASKVFSAQKAIITLTNEMIQEHRKTRDPTQPPRHLID
AFVDEIEKAKGNPKTSFNEENLCMVTSDLFIAGMVSTSITLTWALLLMILHPDVQRRV
QQEIDEVIGREQLPEMGDQTRMPFTVAVIHEVQRFGDIVPLGVPHMTSRDTEVQGFLI
PKGTTLITNLSSVLKDEKVWKKPFRFYPEHFLDAQGHFVKHEAFMPFSAGRRVCLGEP
LARMELFLFFTCLLQRFSFSVPAGQPRPSDHGVFTFLKVPAPFQLCVEPR

CYP2D15      dog
             AB004268 
             Tasaki,T., Ito,S., Kamataki,T. and Fujita,S.
             unpublished

CYP2D15    Canis familiaris (dog)
           NW_876251.1:6772718-6776665
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           the dog genome has a seq gap between exons 3 and 4 
           with poor quality seq there. The C-terminal is also missing, 
           trust the mRNA seq for this CYP.

CYP2D16      guinea pig
             GenEMBL U21486 (1666bp)(500 amino acids)
             Jiang,Q. Voigt,J.M. and Colby,H.
             Molecular Cloning and sequencing of a guinea pig cytochrome P4502D     
             (CYP2D16): high level expression in adrenal microsomes.
             Biochem. Biophys. Res. Commun. 209, 1149-1156 (1995)

CYP2D17     Macaca fasicularis ( cynomolgus monkey)
            GenEMBL U38218(1494bp)
            Laddison,K.J., Speirs,A., Mankowski,D.C., Tweedie,D. and Lawton,M.
            Cloning, Sequencing and expression of the cynomolgus monkey liver 
            cytochrome P450 that is orthologous to human CYP2D6.
            ISSX abstracts number 367 (1995)
            94% identity to human 2D6

CYP2D17     Macaca fasicularis (cynomolgus monkey)
            GenEMBL ESTs BB889442, BB891868, BB878205, 
            BB889386, BB890418, BB890246, BB882021, BB881437
            L388 polymorphic with F 
            Three aa differ from U38218 (I297 = M in U38218,
            N337 = D in U38218, R426 = H in U38218) 
MELDALVPLAVTVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLG
NLLHVDFKNTPYCFDQLRRRFGNVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRP
PVPINQVLGFGPRSQGVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACL
CAAFTDQAGRPFRPNSLLDKAVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESG
FLREVLNAIPLLLRIPGLAGKVLRSQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFL
AEMEKAKGNPESSFNEENLRI VVADLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQE
IDN VIGQVRRPEMGDQARMPYTTAVIHEVQRFGDIVPLGVTHMTSRDIELQGFL IPKG
TTLFTNLSSVLKDEAVWEKPFRFHPEHFLDAQGR FVKPEAFLPFSAGRRACLGEPLAR
MELFLFFTCLLQRFSFSVPAGQPRPSHHGVFAFLVTPSPYELCAVPR

CYP2D17     Macaca mulatta (Rhesus monkey)
            GenEMBL DR774034.1
            N-term EST
MELDALVPLAVTVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ
LRHRFGDVFSLQLAWTPVVVLNGLAAAREALVTCGEDTADRPPVPINQVLGFGPRSQGVFLAR

CYP2D17     Macaca nemestrina (pig-tailed macaque)
            GenEMBL CO774286.1 
            only 3 aa diffs with 2D17 M. fasicularis
MELDALVPLAVTVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ
LRRRFGNVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRPPVPINQVLGFGPRSQGVF
LARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFTDQAGRPFRPNSLLDK
AVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESGFLREVLNAIPLLLRIPGLAGKV
LRSQKVFLTQLDELLTEHRMTWDPXXPPRDLTEAFLGKMEKAKGNPE

CYP2D18X    rat
            GenEMBL U48219, S77859
            Kawashima,H. and Strobel,H.W.
            cDNA cloning of a novel rat brain cytochrome P450 belonging to the 
            CYP2D subfamily.
            Biochem Biophys Res. Commun. 209, 535-540 (1995)
            Kawashima,H., Sequeira, D.J., Nelson, D.R. and Strobel,H.W.
            Protein expression and catalytic activity toward imipramine N-
            demethylation of
            a novel rat brain cytochrome P450 CYP2D18.
            Biochem Biophys Res. Commun. submitted
            note: this gene was cloned and sequenced from two independent 
            libraries.
            This appears [not] to be a distinct gene from CYP2D4.
            note: 2D18 is an alternate splice of an untranslated exon of 
            the 2D4 gene.  The 5 aa diffs are allelic variation
            both haplotypes are found in the same library
            This gene can be distinguished from CYP2D4 as alternative splice
            variant CYP2D4_v2

CYP2D18X    rat
            GenEMBL U48219 S77859 
            ONLY 5 AA DIFFS to 2D4 
            Chr7: 120781146-120776576 (- strand)
            note: 2D18 is an alternate splice of an untranslated exon of 
            the 2D4 gene.  The 5 aa diffs are allelic variation
            both haplotypes are found in the same library
            This gene can be distinguished from CYP2D4 as alternative splice
            variant CYP2D4_v2

CYP2D19     Callithrix jacchus (white-tufted-ear marmoset)
            GenEMBL D29822
            Igarashi,T., Sakuma,T., Isogai,M., Nagata,R. and Kamataki,T.
            Marmoset liver cytochrome P450s: study for expression and molecular
            cloning of their cDNAs
            Arch. Biochem. Biophys. 339 (1), 85-91 (1997)
            91% to 2D17, 90% to 2D42

CYP2D20     hamster
            T. Sakuma 
            95% identical to CYP2D27

CYP2D20     Syrian hamster
            no accession number
            Kouichi Kurose
            submitted to nomenclature committee 7/13/99
            clone name SH2D3
            1 amino acid diff with Sakumas sequence

CYP2D21     Sus scrofa (miniature pig)
            GenEMBL D89502 
            Sakuma,T., Shimojima,T., Miwa,K. and Kamataki,T.
            Cloning CYP2D21 and CYP3A22 cDNAs from liver of miniature pigs
            Drug Metab. Disp. 32, 376-378 (2004)
            8 amino acid differences to CYP2D25

Cyp2d22     mouse
            no accession number
            J. Leonard and N. Blume
            submitted to nomenclature committee
            88% identical to rat 2D4

Cyp2d22     mouse
            GenEMBL AF221525 NM_019823 frameshift x2 in exon 6, NT_039621.1  
NT_039621.1 - strand
43812601 MRLPTGAELWPIAIFTVIFLILVNLMHWRQRWTAHYPPGPMPWPVLGNLLHMDFQNMPAGFQK 12413
43811089 LRGRYGDLFSLQLASESVVVLNGLTALREALVKHSEDTADRPPLHFNDLLGFGPRSQ 10919
43810677 GIVLARYGPAWRQQRRFSVSTMHHFGLGKKSLEQWVTEEARCLCAAFADHTG 10522
43810448 PFSPNTLLDKAVCNVIASLLYACRFEYDDPRFIRLLGLLKETLKE 10314
43809907 FLNVFPMLLRIPGLVGKVFPGKRAFVTMLDELLAEHKTTWDPTQPPRDLTDAFLAEVEK 9731
43809546 AKGNPESSFNDE 9511
43809509 NLRTVVGDLFSAGM 9468
43809466 VTTSTTLSWALMLMILHPDVQ 9404
43809193 RVQQEIDEVIGQVQCPEMADQARMPYTNAVIHEVQRFADILPLGVPHKTSRDIELQGFLIPK 9008
43808581 GTTLITNLSSALKDETVWEKPLCFHPEHFLDAQGHFVKPEAFMPFSA 8441
43808344 GRRSCLGEPLARMELFLFFTCLLQRFSISVPDGQPQPSDHGVFRALTTPCPYQLCALPR 8168

CYP2D23    rabbit
           no accession number
           Yukio Yamamoto 
           submitted to nomenclature committee
           Clone name rabbit 2D/Clone I

CYP2D24    rabbit
           no accession number
           Yukio Yamamoto 
           submitted to nomenclature committee
           Clone name rabbit 2D/Clone II

CYP2D25    Sus scrofa (pig)
           GenEMBL Y16417, NM_214394
           Postlind, H., Axen, E., Bergman, T. and Wikvall, K. (1997)
           Cloning, structure and expression of a cDNA encoding vitamin D3 25-hydroxylase.
           Biochem. Biophys. Res. Commun. 241, 491-497.
           note: this is a microsomal emzyme different from the mitochondrial CYP27
           which also has vitamin D3 25-hydroxylase activity.

Cyp2d26      mouse 
           GenEMBL NT_039621.1 - strand
           68 ESTs see UNIGENE Mm.29064
MGLLVGDDLWAVVIFTAIFLLLVDLVHRRQRWTACYPPGPVPFPGLGNLLQVDFENIPYS
FYKLQNRYGNVFSLQMAWKPVVVVNGLKAVRELLVTYGEDTSDRPLMPIYNHIGYGHKSK
GVILAPYGPEWREQRRFSVSTLRDFGLGKKSLEQWVTEEAGHLCDAFTKEAEHPFNPSPL
LSKAVSNVIASLIYARRFEYEDPFFNRMLKTLKESLGEDTGFVGEVLNAIPMLLHIPGLP
DKAFPKLNSFIALVNKMLIEHDLTWDPAQPPRDLTDAFLAEVEKAKGNPESSFNDKNLRI
VVIDLFMAGMVTTSTTLSWALLLMILHPDVQRRVHQEIDEVIGHVRHPEMADQARMPYTN
AVIHEVQRFADIVPTNLPHMTSRDIKFQDFFIPKGTTLIPNLSSVLKDETVWEKPLRFYP
EHFLDAQGHFVKHEAFMPFSAGRRSCLGEPLARMELFLFFTCLLQRFSFSVPDGQPRPSD
YGIYTMPVTPEPYQLCAVAR

Note: Brian Libby (bjl@jax.org) at The Jackson Laboratory has given his 
permission to post sequence data he has on the 2d26 gene and a partial Cyp2d13 
gene from mouse.  He will make the BAC clone available to anyone who wants it.  
The BAC has at least two and maybe more P450 sequences.  I am putting a link to 
a pdf version of the 2D26 gene sequence file here.  It is color coded with 
additional information, such as sequencing primers and restriction sites. 
CYP2D26 gene sequence

Cyp2d26-de1b7b8b mouse
           GenEMBL NT_039621.1 - strand
           10kb upstream of 2d26, exon 1 aa 1-19, 36-57, exon 7,8
           on the edge of the mouse 2d cluster
           s in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
NT_039621.1 - strand
44262890 MGLQTGLWPMVISTALFCM 44262834
44262801 YPPSPVPLPELGSLLQVKFENM 44262736
44260947 GHVQKETDGIMGQVWLPQMSHQACMSFT 44260864
44260862 NAMIREV*HFRDTILVNLSHVTFCEIEI*GFXXXX 44260770
44260251 XXXXLITNLSLVLKNEITWEMPSPTPS*TFLESEGHLMKQETFMPXXX 44260129

CYP2D27    syrian hamster
           no accession number
           Kouichi Kurose
           95% identical to CYP2D20
           submitted to nomenclature committee 6/29/99

CYP2D28    syrian hamster
           no accession number
           Kouichi Kurose
           71% identical to CYP2D27 73% to CYP2D20
           clone name SH2D2
           submitted to nomenclature committee 7/13/99

CYP2D29    Macaca fuscata (Japanese monkey)
           GenEMBL AF301911 (release date March 1, 2001)
           Shizuo Narimatsu, Hiroyuki Hichiya, Shigeo Yamamoto, Kazuo Asaoka
           Submitted to nomenclature committee Oct. 16, 2000
           95% to CYP2D6

CYP2D30    Callithrix jacchus (white-tufted-ear marmoset)
           GenEMBL AY082602 
           Hichiya,H., Yamamoto,S., Asaoka,K. and Narimatsu,S.
           Complementary DNA cloning and characterization of a cytochrome P450
           2D enzyme from Marmoset monkey liver
           Unpublished
           submitted to nomenclature committee 3/5/02
           33 diffrerences to 2D19 also from marmoset.
           93% to 2D19, 91% to 2D29, 90% to 2D17

CYP2D31P   human
           NT_022676.10|Hs3_22832 chromosome 3 
           2D6 pseudogene fragment I-helix
899650 NQENLV*VVIDLFLGGTDTTATTLCWALIHMIQHGAVQ 899537

Cyp2d32-ps mouse
           GenEMBL XM_194978, NT_039621.1
           exons 4,5,6,7,8,9 NT_039621.1 + strand (vvp = old temp. name)
43898939 AMSPHNPNHLLDKAICNVIASLIYACRFKYGDPDIIK 33899049
         ILKVLKESM*KKIVFIPD 
43899746 VLNIFPIVLSISGLGDKVLPGKKVSLAIVDKMLTDXXX 33899850
43899865 TWDPD*SHCDLTDAFLAEMEQ 33899927
43900101 LHLLILHLLGAGIVMSSVTLTWTLLLMI*NPDVQ 33900202
43900439 XXXXEIDKVIGQVWHPEMADQVLMPFTNAVIHEVKCSEDITAMALPHRNSLHSNVQGFLIPK 33900612
43901007 GKSLITNLSSELKDEAIWEKPLCFHPEYFLDAKGHFV*HEPFMAFSE 33901147
43901248 GHQACLREPLACMELFLFFTFLLQRFSFSMSDGQPLPSEYSIYAMPVTPEPCQFCAVVQYQG 33901433

Cyp2d33-ps mouse
           GenEMBL NT_039621.1
           exons 4,5,6,7,8,9 NT_039621.1 + strand 3kb downstream of 2d12
44019279 XXXNPYHLLDKAVCNVIPSLIYACCFNYGDPDNRMLKLLKKKSMKKKIGFISD 44019428
44020071 VLNTFPTLLGISGLAEKVFSGQKTSFTIVNKMFTEH 44020178 
44020190 DPDQPPRDLTDAFLAEMEK 44020246
44020381 AKGNSERSFREPNLYLIILDLLGPGIVTSLVTLTWSLLLVIQQPDVQ 44020521
44020745 XXXXEIDKVIG*VWHPEMAD*ILMPFTNVVIHEVKRFEDITAMVLPQRTSPDIDVHGF 44020906
44022181 XXXLIPDLSSMLKDETVWEKPLHFHPKNFLDAQGHFL*FEAFMPFSEG 44022315
44022418 QACLGQPLDQIVLFLFITCLLQCFSFSLPKGQPPPSD*GIYAMPVTPAPSQLCAVVVR*EEQWH 44022609

Cyp2d34   mouse 
          GenEMBL NT_039621.1
          85% to 2d10 87% to 2dww/2d11 NT_039621.1 - strand
          old temp. name = tt
44079756 MELLTGTGLWSVAIFTVIFLILVDLMHRRQHWTSRYPPGPVPWPVLGNLLQVDLDNIPYSLYK 44079568 
44077878 LQNRYGDVFSLQMAWKPVVVINGLKAMQEVLLTCGKDTADHPPVPIFEYLGFKSKSQ 44077708
44077439 GVVLASYGPEWREQRQFSVSTLRNFGLGKKSLEEWVTKEAKHLCDAFTARAG 44077284
44077192 QSINPNTMLNNAVCNVIASLIFARRFEYEDPFLIRMLKMREESLKEVTGFIPG 44077037
44076407 VLNTFPILLRIPGLADMVFQSQKTFMAILDNLVTENRTTWDPDQPPRNLADAFLAEIQK 44076231
44076048 AKGNPESSFNDENLCMVVSDLFTAGMVTTSTTLSCALLLMILHPDVQ 44075908
44075711 RRVQQEIDAVIGQVRCPEMADQARMPYTNAVIHEVQRFGDIIPLNIPRITSRDIEVQDFLIPK 44075523
44075229 GTILIPNMSSMLKDETVWEKPLRFYPEHFLDAQGHFVKPEAFMPFSAG 44075086
44074985 RRSCLGEPLARMELFLFFTCLLQRFSFSVPAGQPQPSDHRIFAIPVAPYPYQVCAIMREQGH* 44074797

Cyp2d34-de1b2b7b8b mouse
           GenEMBl NT_039621.1
           detritus exons 1,2,7,8 about 4 kb downstream of 2d34 
           u in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
NT_039621.1 - strand
44070344 MELLTGTGL 44070318
44070324 WPVAIFTVIFILLVDLMHRHQHWTSRCPPGPVPWPVLGDLLQVNVYNIPYSLYK 44070163
44069514 LKKSCGDMFSLHMGWKPMVMIKGLKSVQDVLVTCGEDTADCPKIPVFHYI 44069365
44067376 QVQKEIDKVIGQVWHPEMADLGLMPFKKSVIHEVHHFADITAIP 44067245
44066770 QGKSFIPNLCSMLKDETVWEKPLHFHPKHFLDAQGHFVKHEVFMPFSAG 44066624

Cyp2d35-ps mouse
           GenEMBL NT_039621.1
           This seq was assembled from several smaller pieces found earlier
NT_039621.1 - strand
44113633 VIWLLTGTGL 44113604
44113610 WPVAIFTVIFILLVDLIHLCQHWTSCYPPGPVPCPVLGNLLQVDLYNMPYSLYK 44113449
44112585 MFSLQMVWKPMVLIKELKSVQDVLVTCGGGTVDRPEIPIFHHIGCGPKAK 44112436
44112148 XXLLASYGPEW*EQRPFSVSILCNFSQGKKFLEQSVTDEAGHICDTFTAQAG 44111999
44111917 SPLKPYTLLDKTLCNVIVSLIYAHRFKYGGPDIIKMLKVLKDNMGGKIGLIPE 44111759
44111115 VLNTFPVLLHIPGLADKVFPGKKTFLTIMDKLVTEHKKIWDLYQPSCDLTGAFLAEMEK 44110939
44110801 AKGNPESSFRESNLCLVVLDLLGDGIVTSSVTLTWGLLLTILHLDVQ 44110661
44110375 MPYTNAVIHEVPCYDDIIPIFLPHRTSSDVEMQDFLITK 44110259
44109226 SVLNDETVWEKSLCFLPDHFLDAQGNFVKPEAFMPFSAG 44109110
44109006 XQACLREPLAHMELFLFFTCLLQHFSFSVPAGQPLLSDYGIYTMPVSPEPYQLCAVVC* 44108833

Cyp2d36-ps  mouse
           GenEMBL NT_039621.1
NT_039621.1 - strand
44142171 MELLTETDLWPVAIFTVIFILLVELMHQCQR*TSFYTPGPVPWPLLGNLLQVDLDNMPYSLYK 44141983
44141174 NHYGDMSSLHMG*KSMVVISGLKAVQDVLVTC 44141079
44139955 GEDTTDCPEIPIFQHIGCGPKAK 44139887
44139615 GVVPAPYGLEWQEQR*FSVSTLCNFGL 44139535
44139535 GKKSLKQWVMEEAGH 44139491
44139399 SPLNPFPLLDKAGLNVSASLIYAHCFE*EDPVIIKMLTVLRK 44139274
44139026 VLNTFSIPLHIRGLADKAFPVQKTFLTIVDKMLTEHKRT*DPDKPP*DLIDAYLAKMKK 44138850
44138722 XXGNPESSFNETNLXX 44138687
44138681 VVLDQLGARIMTISITLT*VLLLMILHPHVQ 44138589
44138362 VGQYINKVISQVWHSGMADQGLMPFINVVIHEVQHFADIIAIPLPHRTSPDIKVLGSLIPK 44138180
44130610 GMNLIPNLSSVFKDNTVWEKPFCFHPEQFLDAQGHFVKHKAFMPFSAG 44130467
44130363 XQACLGDPLACMELFLFFTCILQRFSFSVPAGQPLHSDYGIYAMPVTPEPCQFCLV 44130199

Cyp2d37-ps mouse
           GenEMBL NT_039621.1
           Old temp name = hhp, 3 frameshifts and a stop codon 81% to 2d13 
NT_039621.1 - strand
44151915 MELLTGTGLWPVVIVTVIFILLVDMLHRCQRWTSCCPPDPVPWPVLGNLLQVDLDNMPYNLYK 44151727
44150957 LHNRYGDVFSLQMGWNHMAVINGLKVIQEVLVTCGEDTADRPEMPIFPHLGYGQKAK 44150787
44150509 GVVLAPYGPEWKEQR*FSASTLCNFSLGKKSLEQWVMEEVGHLFDVFTAHA 44150357
44150275 GSPLNPYPLLDKAVCNVIVSLIYAHRFEYGDPDFIKMLKVLKENMGENIGLFSE 44150114
44149452 VLNTFPILLRIPGLADKVFPGQKTFLIMVDKLVTEHKRTWNSDQPPRDLTDAFMAEMEK 44149276
44149137 AKGNPESSFNDANLCLVVLDLLGAATVTTSTTLSWALLLMILHPDVQ 44148997
44148774 QVQQEIDEVIWYVWLPEMADQVCMPFTNAVIHEVQ 44148670
44148653 XXXDIIPITLPHRTSRDIEVWGFLIPK 44148582
44148149 GMTLISNLF 44148123
44148124 SVLKDETVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSA 44148011
44147914 GHRSCLGEPLALMELFLFFTCLLQRFSFSMPAGQSLPSDYGIYTMPVTPAPYQLCAVV 44147741

Cyp2d38-ps mouse 
           GenEMBL XP_194978, LOC271298 chr 15 XM_194978, NT_039621.1 - strand
44166184 PVAIFTVILILLVNLMHRLQCWTSRYPPGPVPWLVLGNLLQADLHNMTYNLYK 44166026
44165213 LQNWCGDVFSLQMISKPVVVIKGLNAVGE 44165127 
44165125 LLVSCGEGTAEWPEIPIFHHIVCGPKTK 44165042
44164762 GVILAP*GCEWREQR 44164718
44164722 RGSVSILCNFSLGKKSLEQCVMEKAGHICDAFTVQAG 44164612 
44164557 SSLNPLSLLDKSLCNVVAYLIYA 44164489

Cyp2d39-ps mouse 
           GenEMBL NT_039621.1 
           Old temp name jj 
           Cyp2d26 like pseudogene exons 4,5,6,7,8(partial),9 
NT_039621.1 - strand
44178330 FDYGDPDIIKMLKALKENKGEKIGMIPH 44178247
44177610 VLNTFPILLHILELADKVFPGQKT 44177539
44177539 ILTMVDKLVIAHKRTGDCEKPHQELTD 44177459
44177454 AFLAEREX 44177434
44177299 AKGNPESSFNDANLCLVVLDLFGGGILTSSITLTWAL*LVILHP 44177168
44176934 RVQQDEVIVHVW*PKMANQANMSYSNAAIHEIQCYADIIPIHLPDRTSLDI*VQGFLLPK 44176755
44176344 GTKIIPNLSSVI 44176309
44175091 GHQVCLGEPLASMELFLFFTCLLQCFSFLVPTG*PQPSNYGIYAMPVTPEPYQLCAVV 44174918
44175055 MELFLFFTCLLQCFSFLV 44175002 note 9kb from rest of N-term at 2d32p

Cyp2d40 mouse
           GenEMBL NT_039621.1 
           Old temp name = rr 84% to 2d13 
NT_039621.1 - strand
44223024 MELLTGTDLWPVAIFTVIFILLVDLLHRRQRWTSRYPPGPVPWPVLGNLLQVDLDNMPYSFYK 44222836
44222037 LQNHYGDMFSLQMGWNAMVIVNGLKAVQEALVTCGEYTADRPEMPIFPHLGYGQKDK 44221867
44221588 GLVLAPYGPEWQEQRRFSMSTMRNFGLGKKSLEQWVTEEAGHLCDAFTDQA 44221436
44221354 GSPLNPYTLLNKAVCNVIASLIYAHRFKYKDPDFIKMLKVLKENTREKIGLIPE 44221193
44220527 VVKMFPIVLRIPGLADKIFPGQKTFLTMVDKLVTEHKRTWDPDQPPRDLTDAFMAEMET 44220351
44220212 AKGNPESSFNEANLRLVVLDLFGGGIVTTSATLTWALLLMILHPDVQ 44220072
44219854 RRVQEEIDEVIGQARRPEMADQARMPYTNAVIHEVQRFADIAPMTLPHRTSCDIEVQGFLIPK 44219666
44219246 GTTLICNLSSVLKDETVWEKPLRFYPEHFLDAQGHFVKPEAFMPFSA 44219100
44218999 GRRACLGEPLVRMELFLFFTCLLQRFSFSVPDGQPLPSDYGIYSMVVSPAPYQLCAVVR* 44218820

Cyp2d40-de7b9b mouse
           GenEMBL NT_039621.1
           detritus exons 7,9 fragment NT_039621.1 - strand
           t in Figure 5B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
44201031 VQQEINKFIGQVWRPETAVIHEVQCFANITPITLPHRTSCDIEVQGFLTPK 44200879
44200789 PSDYGIYSMPVTLEPYQLCVVVQ 44200721

Cyp2d41-ps mouse
           GenEMBL NT_039621.1
           old temp name = ssp, 82% to 2d13 one stop codon possible pseudogene NT_039621.1 - strand
44241024 MELLTGTDLWPVAIFTVIFILLVDLMHRHQRWTSRYPPGPVLWPVLGNLLQVDLDNMPYSLYK 44240836
44240062 LQNRYGDVFSLKLGRNPMVIVNRLMAVQEVLVTCGENTADRPEMPIFLPPSNGQKAK 44239892
44239602 GLAFAPYGPEWQEQKRFSMSTLRNFGLGKKLLEQ*MTKEAGHLCDAFTAQA 44239450
44239368 GSPLNPYTLLEKAMCNVIASLVYAHCFEYEDPDCIKMLRALKEYMIEKIGLIPEV 44239204
44238543 VKMFPIVLRIPGLADKIFPGQTTFLTMVDKLLTEHKRTWDPDQPPRDLIDAFLAEMEK 44238370
44238242 AKGNPESSFNEANLRQIVLDLFGAGTAPTSTTLSWALLLMILHPDVQ 44238102
44237884 SLVQEEIDEVIGQARRPEMADQARMPYTNAVIHEVQRFADIAPMTLPHRTSCDIEVQGFLIPK 44237696
44237268 QGTTLIPNLSSVLKDETVWEKPLRFHPEHFLDAQGHFVKPEAFMPFSA 44237125
44237024 GRRSCLGESLARMELFLFFTCLLQRFSFSVPDGQPQPSDYGIYSILVSPAPYQLCAVVR 44236848

CYP2D42     Macaca mulatta (rhesus monkey)
            No accession number
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            93% to CYP2D6, probable ortholog of CYP2D6

CYP2D43     Bos taurus (cow)
            See cattle page for details
            94% to 2D14 cow
5681 MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPMPLPVLGNLLQVDFEDPRPSFNQ
     LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPQALYKHLGFGPRAEG 6760
7291 VILARYGNAWREQRRFSLSTLRNFGLGKKSLEQWVTEEASCLCAAFADQA 7449
7550 GHPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIVKLLDVMEDGLKEEMKIMRQV 7714
8109 VEAVPVLLSIPGLAAKVVPGQKAFMTLVDELIAEQKMTRDPTQPPRHLTDAFLDEVKE 8288
     AKGNPESSFSDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR 8591
8806 RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK 8985
9424 GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA 9603
     GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSDHGVFVALVTPAPYQLCAVPR 9843

CYP2D44     Macaca fasicularis (cynomolgus monkey)
            No accession number
            ESTs BB890306, BB877128, BB888901, BB887284, BB877988, BB881640
            Yasuhiro Uno
            Submitted to nomenclature committee 9/29/2005
            93% to M. mulatta 2D42, 92% to 2D17 M. fasicularis 91% to 2D6
            differs from 2D17 another cynomolgus seq. 
            complete sequence

CYP2D45v1  Xenopus tropicalis (frog)
           See Xenopus page for seq

CYP2D45v2  Xenopus tropicalis (frog)
           See Xenopus page for seq

CYP2D46    Xenopus tropicalis (frog)
           See Xenopus page for seq

CYP2D47    Xenopus tropicalis (frog)
           See Xenopus page for seq

CYP2D48    Xenopus laevis
           GenEMBL BC077934  
           56% TO CHICKEN 2D49
MSLLSQLCSFAFGCNVFTLGIICTLCLLLLDYMKRKKPCKNFPP
SPPSKPFVGNLLQLNFRNLNNSFKQLSKQYGDVMSLQVFWKPVVVLNGLEVMKEALIQ
KSEDTADRPEFHVLEILGFVGNNKAVVLANYGQSWKDLRRFTLSTLRDFGMGKKSLEE
RVREEAGYLCAAFQSEQGRPFDPHILLNTAVSNVICSIIFGERFEYDDHTFQKLLCLI
EESVKAESGAVPQIIASLPWSSKIPGLAKMFFQPRIRMLKYLQEIIKDHQQTWDSGHT
RDFIDAFMLEMEKAKGVKDSNFNEQNLLLTTADLFSAGSETTNTTLRWGLLFMLLYPN
VQRKVHEEIDHVIGRTRKPTMGDVLQMPYTNAVIHEIQRYVDIVPLSVPHMTYRDTHI
QGFFIPKGVTIMTNLSSVLKDEKAWEKPFQFYPEHFLDRDGKFVKREAFMAFSAGRRV
CLGEQLARMELFLFFTTLLQRFSFQIPNGEPSPREDPVFVFLQLPHDYKMCAKVR

CYP2D49    Gallus gallus (chicken)
           chr1:46131304-46140141
           ENSGALT00000019412.2
MTLLLWLSSWSNISVLGVFLTVFTILVDFMKRRKKWSRYPPGPMPLPFVG
TMPYVNYYNPHLSFEKFRKKFGNIFSLQNCWTNVVVLNGYKTVKEALVNK
SEDFADRPYMPVYEHLGYGHKSEGLVLARYGHLWKELRKFTLTTLRNFGM
GKKSLEERVTEEAGFLCSAISSEGGHPFDPRFLVNNAVCNVICTITYGER
FDYGDKTFKKLLTLFENSLNEEAGFLPQLLNVAPVLLRIPGLPQKIFPCQ
KAYVDFTQMLIDKHKETWNPAYIRDFTDAFLKEMAKGKEAEENGFNKSNL
TLVTADLLVAGSETTATTLRWAFLFMLLYPEIQSKVHKEIDKVIGRNRPP
TMADQVNMPYTNAVIHEVQRFGDVVPMGLPHMTYRDTELQGFFIPKGTTI
ITNLTSVLKDETAWKKPNEFYPEHFLNENGQFVRPEAFLPFSAGRRACLG
EQLTRMELFIFFTTLMQKFTFVFPEDQPRPREDSHFAFTNSPHPYQLRAV
PSITQDQGK

CYP2D50    horse
           No accession number
           Heather Knych
           Submitted to nomenclature committee Oct. 3, 2007
           80% to cattle CYP2D14 and CYP2D43

Cyp2d-se1[1:8:9] mouse
           GenEMBL NT_039621.1
           old temp name = xxp 
           about 400,000 bp from the main Cyp2d cluster
           + strand solo exons 1,8(partial),9  frameshift in exon 1 
           ortholog to CYP2D-se2[9] rat
43401344 MGLLTS 1361
43401361 LLSVAIFAAIFLLLVDIMQRCQCWATCYLLLLDFQNMPYSLYK 1489
43402076 EETVWEKPLRFHPELFLDAQGHFVKPEAFMPFSA 2177
43402729 GHRSCLGEPLACMKLFLFFTCLLQRFSFSVPDGQPQPSNCGVFPFLVAPSLYQLCAVLLKQGH 2917

CYP2D-se2[9] rat
             UCSC browser chr7:120386407-120386565  
             exon 9 (+ strand) 73% to 2D3
             ortholog to Cyp2d-se1[1:8:9] mouse
ACLGEPLTCMELFLFFICLLQSFSFSVKAGQPRPSNHGIFEMPISPSSYQLCA

2E Subfamily

CYP2E1      human
            PIR A60554 (18 amino acids)
            Robinson, R.C., Shorr, R.G.L., Varrichio, A., Park, S.S.,
            Gelboin, H.V., Miller, H. and Friedman, F.K.
            Human liver cytochrome P-450 related to a rat
            acetone-inducible, nitrosamine-metabolizing cytochrome
            P-450: identification and isolation.
            Pharmacology 39, 137-144 (1989)

CYP2E1      Macaca fasicularis (cynomolgus monkey)
            No accession 
            Wu Zhicong
            Submitted to nomenclature committee 10/30/2006
            Only 3 aa diffs to CYP2E1 Macaca mulatta (rhesus monkey)
            Note: the 2E1 seq from 1992 S55205 differs from this
            seq at 12 amino acids and a frameshifted region, but this 
            seq matches rhesus monkey at 9/11 sites so this seq is probably more 
            accurate.  One site is not included in the shorter S55205 seq.

CYP2E1      Macaca fasicularis (monkey)
            GenEMBL S55205 (1508bp) Swiss P33266 (449 amino acids)
            PIR S28167 (449 amino acids)
            Komori,M., Kikuchi,O., Sakuma,T., Funaki,J., Kitada,M.
            and Kamataki,T.
            Molecular cloning of monkey liver cytochrome P-450 cDNAs:
            similarity of the primary sequences to human cytochromes P-450.
            Biochim. Biophys. Acta 1171, 141-146 (1992)

CYP2E1      Macaca mulatta (rhesus monkey)
            NM_001040213
            Brian A. Carr, Merck & Co. Inc.
            Submitted to nomenclature committee 4/22/2004
            94% to CYP2E1, ortholog of CYP2E1

CYP2E1      Mesocricetus auratus (hamster)
            GenEMBL D17449 (2512bp)
            Sakuma,T., Takai,M., Yokoi,T. and Kamataki,T.
            Molecular cloning and sequence analysis of hamster CYP2E1
            Biochim. Biophys. Acta 1217, 229-231 (1993)

CYP2E1      hamster
            PIR S27176 (34 amino acids)
            Puccini, P., Menicagli, S., Longo, V., Santucci, A. and Gervasi,P.G.
            Purification and characterization of an acetone-inducible
            cytochrome P-450 from hamster liver microsomes.
            Biochem. J. 287, 863-870 (1992)

CYP2E1      rat
            GenEMBL S48325 (1093bp)
            Richardson,T.H., Schenkman,J.B., Turcan,R., Goldfarb,P.S.
            and Gibson,G.G.
            Molecular cloning of a cDNA for rat diabetes-inducible
            cytochrome P450RLM6:hormonal regulation and similarity to 
            the cytochrome P4502E1 gene.
            Xenobiotica 22, 621-631 (1992)

CYP2E1      rat
            PIR B27425 (34 amino acids)
            Favreau, L.V., Malchoff, D.M., Mole, J.E. and Schenkman, J.B.
            Responses to insulin by two forms of rat hepatic microsomal
            cytochrome P-450 that undergo major (RLM6) and minor
            (RLM5b) elevations in diabetes.
            J. Biol. Chem. 262, 14319-14326 (1987)

CYP2E1      rat
            GenEMBL AF061442
            Yoo,M. and Shin,S.W.
            The complete coding sequence of the rat brain cytochrome P450 2E1
            Unpublished

Cyp2e1      mouse
            GenEMBL L11650 (1827bp) Swiss Q05421 (493 amino acids)
            Davis,J.F. and Felder,M.R.
            Mouse ethanol-inducible cytochrome P450 (P450IIE1).
            Characterization of cDNA clones and testosterone 
            induction in kidney tissue.
            J. Biol. Chem. 268, 24933-24939 (1993)

Cyp2e1      mouse
            PIR A21231 (39 amino acids)
            Ryskov, A.P., Ivanov, P.L., Kramerov, D.A. and Georgiev, G.P.
            Mouse ubiquitous B2 repeat in polysomal and cytoplasmic poly
            (A)+RNAs: uniderectional orientation and 3'-end localization.
            Nucleic Acids Res. 11, 6541-6558 (1983)
            C-terminal 39 amino acids

CYP2E1v1    dog
            no accession number
            Susan M. Lankford and Stephen A. Bai
            submitted to nomenclature committee

CYP2E1v2    dog
            no accession number
            Susan M. Lankford and Stephen A. Bai
            submitted to nomenclature committee
            note: only one amino acid difference with 2E1v1

CYP2E1     Canis familiaris (dog)
           NW_876287.1: 395882-405665
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           77% to human CYP2E1
MAALGITVALLVWMATLMLISIWKQIYSRWKLPPGPFPLPIIGNILQVDIKNVPKSLAKLAEQYGPVFTLYLGSQ
RTVVLHGYKAVKEVLLDHKNDLSGRGEVFAFQSHKDRGITFNNGPGWKDTRRLSLSTLRDYGMGKRGNEERIQRE
IPFLLEALRGTRGQPFDPTFLLGFAPFNVIADILFHKHFDYSDQTGLRIQKLFNENFHLLSTGWLQLYNIFPSYL
HYLPGSHRKVLRNVAELKDYSLERVKEHQESLDPTCSRDFTDCLLQELQKERYGTEPWYTLDNIAVTVADLFFAG
TETTSTTLRYGLLILMKYPEVEEKLHEEIDRVIGPSRVPAIKDRLEMPYMDAVVHEIQRFIDLLPSNLPHVANQD
TMFRGYVIPKGTVVIPTLDSVLFDKQEFPDPEKFKPEHFLNENGKFKYSDYFKAFSAGKRVCVGKSLARMELFLF
LSAILQHFNLKSLVDPKDIDLSPCTIGFAKIPPHYKLCVVPRSG*

CYP2E2      rabbit
            GenEMBL J03726 (multiple genomic fragments)
            GenEMBL M19162 (multiple genomic fragments)
            GenEMBL M19163 (multiple genomic fragments)
            Khani,S.C., Porter,T.D., Fujita,V.S. and Coon,M.J.
            Organization and differential expression of two highly similar
            genes in the rabbit alchol-inducible cytochrome P-450 subfamily
            J. Biol. Chem. 263, 7170-7175 (1988)

CYP2E1      sus scrofa (pig)
            GenEMBL AB000885.1
            Kimura,M., Kawakami,K., Suzuki,H. and Hamasima,N.
            Cloning of the pig cytochrome P-450-j gene
            Unpublished

CYP2E1      sus scrofa (pig)
            GenEMBL AB052259
            Misaki Kojima
            2 amino acid differences with AB000885.1
            Submitted to nomenclature committee Oct. 27, 2000
            clone name c469

CYP2E1      Bos taurus (cow)
            GenEMBL AJ001715
            van Raak,M., Natsuhori,M., Ligtenberg,M., Kleij,L., ten Berghe,D.,
            de Groene,E.M., Van Miert,A.S., Witkamp,R.F. and Horbach,G.J.
            Isolation of a full length cytochrome P450 (CYP2E) cDNA sequence
            and its functional expression in V79 cells
            Unpublished
            79% to human 2E1
MAALGITVALLVWMATLLFISIWKHIYSSWKLPPGPFPLPIIGNLLQLDIKNIPKSFTR
LAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNN
GIIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQ
GQPFDPTFVVGFAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQ
LYNNFPDYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEM
AKERHSVDPMYTLENIAVTVADLLFAGTETTSTTLRYGLLILMKYPEVE
EKLHEEIDRVIGPSRIPAVKDRLDMPYLDAVVHEIQRFIDLLPSNLLHEATQDTVFRGYVIPK
GTVVIPTLDSVLHDRQEFPEPEKFKPEHFLNENGKFKYSDHFKAFSA
GKRVCVGEGLARMELFLLLAAILQHFNLKSLVDPKDIDLSPIAIGFGKIPPRYKLCLIPRSKV*

CYP2E1       horse 
             No accession number
             Heather Knych
             Submitted to nomenclature committee Oct. 17, 2007

CYP2E1     Balaenoptera acutorostrata (Minke whale)
           No accession number
           Iwata Hisato
           submitted to nomenclature committee 1/6/05 
           84% to CYP2E1 cow, 76% to CYP2E1 human

2F Subfamily

CYP2F1      human
            GenEMBL J02906
MDSISTAILLLLLALVCLLLTLSSRDKGKLPPGPRPLSILGNLL
LLCSQDMLTSLTKLSKEYGSMYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYP
AFFNFTKGNGIAFSSGDRWKVLRQFSIQILRNFGMGKRSIEERILEEGSFLLADVRKT
EGEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGELYD
ILDPRFPSLLDWVPGPHQRIFQNFKCLRDLIAHSVHDHQASSPRDFIQCFLTKMAEEK
EDPLSHFHMDTLLMTTHNLLFGGTKTVSTTLHHAFLALMKYPKVQARVQEEIDLVVGR
ARLPALKDRAAMPYTDAVIHEVQRFADIIPMNLPHRVTRDTAFRGFLIPKGTDVITLL
NTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAGRRLCLGELLARMELFLYL
TAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLRPR

CYP2F1      Bos taurus (cow)
            See cattle page for details
LSKEFGAVYTVYLGPRRVVVLSGYQAVKEALVDQAEEFGGRGDYPVFFNFTKGN
GIAFSNGDRWKVLRKYSVQILRNFGMGKRTIEERILEEGHFLLEELRKTQ
GKPFDPTFVVSRSVSNIICSVIFGSRFDYDDDRPLSIIHLINENFQIMSSPWGE
MYNIFPNLLDWVPGPHRRLFKNYGRIKDIIARSVREHQASLDPNSPRDFIDCFLTRWH
QEKQDPLSHFFMDTLLMTTHNLLFGGTETVGTTLRHAFRLLMKYPEVQ
VRVQEEIDRVVGHERLPTVEDRAAMPYTDAVIHEVQRFADVIPMSLPHRVTRDTNFRGFTIPR
GTDVITLLNTVHYDPSQFLKPKEFNPEHFLDANMSFKKSPAFMPFSA
GRRLCLGEALARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPYQLCVLAR 

CYP2F1P     human
            AC008537.3 93% identical to 2F1
            Fernandez-Salguero,P., Hoffman,S.M., Cholerton,S., Mohrenweiser,H.,
            Raunio,H., Rautio,A., Pelkonen,O., Huang,J.D., Evans,W.E.,
            Idle,J.R. and Gonzalez, F.J.
            A genetic polymorphism in coumarin 7-hydroxylation: sequence of the
            human CYP2A genes and identification of variant CYP2A6 alleles.
            Am. J. Hum. Genet. 57, 651-660 (1995)
            There are two 2F1 genes, and one pseudogene of 2F1 on chromosome 19.  
GEPFDPTFVLSRSRSNIICSVLFGSRFDYDDERLLTIIRLINDNFQIMSSPWGE
LYDIFPSLLNWVPGPHQRIFQNFKCLRDLIAHSVHDHQASLDPRSPRDFIHCFLTKMAE
KKEDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLAFMKYPKVQ
AHVQEEINLVVGHVRLPALKDRAAMPYTDMVIHEVQRFADIIPMNLPHRITRDTAFHGFLIPK
GTDVITLLNTVHYDPSQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAG
HRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLCLHPR

CYP2F1     Canis familiaris (dog)
           NW_876313.1:NW_876270.1:43272128-43283098
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           86% to human CYP2F1
MDGVSTAILLGLLALAFLFLILNSRGKSQLPPGPRPLPFLGNLLQLRSQDMLTSLTKSKEYGSVYTVHLGPRRVV
VLSGYQAVKEALVDQGEDFSGRGDYPVFFNFTKGNGIAFSNGDRWKVLRRFSVQILRNFGMGKRSIEERILEEGS
FLLAELRKTEGKPFDPTFVLSRSVSNIICSVIFGSRFDYDDERLLTIIRLINDNFQIMSGPWGEQLYNIFPSLLD
WIPGPHRRLFQNFGCMKDLIARSVRDHQDSLDPRCPRDFIDCFLNKMAQEKQDPHSHFHMDTLLMTTHNLIFGGT
ETVGTTLRHAFLVLMKYPKVQARVQEEIDRVVGRARLPALEDRAAMPYTDAVIHEVQRFADVIPMNLPHRVIRDT
PFRGFLLPKGTDIITLLNTVHYDPNQFLTPQEFNPEHFLDANQSFKKSPAFMPFSAGRRLCLGESLARMELFLYL
TAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPFQLRLRTR*

Cyp2f2      mouse
            GenEMBL M77497, NT_039413.1 + strand
            Swiss P33267 (491 amino acids)
            Ritter J.K., Owens I.S., Negishi M., Nagata K., Sheen Y.Y.,
            Gillette J.R. and Sasame H.A.
            Mouse pulmonary cytochrome P-450 naphthalene hydroxylase: cDNA 
            cloning, sequence and expression in Saccharomyces cerevisiae.
            Biochemistry 30, 11430-11437(1991)

CYP2F3      goat
            GenEMBL AF016293
            Huifen Wang, Diane L. Lanza, and Garold S. Yost.  
            Cloning and expression of CYP2F3, a cytochrome P450 that bioactivates  
            The selective pneumotoxins 3-methylindole and naphthalene
            submitted

CYP2F4      rat
            GenEMBL AF017393
            R. Michael Baldwin and Alan Buckpitt
            submitted to nomenclature committee

CYP2F5      Gorilla gorilla
            GenEMBL AF372494
            Chen,N., Whitehead,S.E., Caillat,A.W., Gavit,K., Isphording,D.R.,
            Kovacevic,D., McCreary,M.B. and Hoffman,S.M.
            Identification and cross-species comparisons of CYP2F subfamily
            genes in mammals
            Mutat. Res. 499 (2), 155-161 (2002)

CYP2F6      Macaca mulatta (rhesus monkey)
            No accession number
            Mike Baldwin
            Pdf file of nucleotide/amino acid alignment
            This file shows polymorphism data
            The particular sequence shown is a pseudogene due to 
            A premature stop codon.
            PDF file for the sequences of a non-truncated version
            Pdf files from Mike Baldwin

2G Subfamily

CYP2G1P     human
            GenEMBL S80997, S80998, S80999
            Sheng J, Ding X
            Biochem. Biophys. Res. Commun.  218, 570-574 (1996)
            Identification of human genes related to olfactory-specific CYP2G1.
            2 PCR fragments for a human 2G1 are presented and 2 more PCR 
            fragments from two possible 2G1 pseudogenes are also shown.
            86% identical to rat 2G1

CYP2G1P     human
            GenEMBL AC008537 genomic DNA in 93 fragments
            Sequence is assembled from fragments and it may need to be revised
            The * indicate intron locations except the last one that is a stop 
            codon. The sequence is 78% identical to rat 2G1. 
            There is a frameshift after YMGP on the second line.
            CYP2G1 is 58-59% identical to some CYP2A sequences so it may actually 
            Be a CYP2A sequence.  The 2G subfamily might be absorbed by CYP2A
CYP2G1P revised seq AC008537 missing exons 4, 5 and 6 
MELGGAVTIFLALRLSCLLILIAWKRMDKAGKLPPGPTPILFLGHLLQVRTDATFQSFMK
LREKYSPVFTVYMGP (fs) RPVVVLCGHEAVKEALIDQADEFSGRGELASIKQNFQGHG
VALANGERWRILRRFSLTILRDFGMGKQSIKERIQEEASYLLEEFQKTK
AKIHEEINQVIGPHRLPRVDDRVKMPYTDVVIHEIQRLVDIVPMGVPHNIIQDTQFRGYLLPK
GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSSGRGK
RICLGEAMARMELFLYFTSTLQNFSLCSLVPLVDIDITPKLSGFGNITPTYELCLVAR

CYP2G1      chimp
            Not a pseudogene

CYP2G1      Bos taurus (cow)
            See cattle page for details
            88% to human pseudogene 2G2P
3860 MELGGAFTIFLALCLSCLLILIAWKRMSKGGKLPPGPTPIPFLGNVLQVRTDATFQSFMK(0) 4039
4854 LKEKYGPVFTVYMGPRPVVVLCGHEAVKEALVDRADEFSGRGELASVERNFQGH(1)5015
6748 GVALANGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGFLLVELRKTR(1)6897
8738 GARIEPTFFLSRTVSNVISSVVFGSRFDYEDQQFLKLLQMINQSFIEMSTSWAQ (0) 8899
9151 LYDMYSGIMQYLPGRHNRIYYLIEELKDFIASKVKINEASLDPQNPRDFIDCFLIKMHQ(0) 9327
300  DKNNPHTEFNLKNLVLTTLNLFFAGTETVSSTLRYGLLLMMKHPEVE(1)145
997  AKIHEEIDQVIGPHRIPSVDDRAKMPYTDAVIHEIQRLTDIVPMGVPHNVIRDTHFRGYLLPK(0) 1185
1314 GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGHFKKNEAFVPFSS(1) 1454
586  GKRICLGEAMARMELFLYFTSILQNFSLRSLVPPADIDITPKVSGFGNIPPTYELCFMVR(1) 765

CYP2G1      rat 
            GenEMBL M33296

CYP2G1      rabbit
            PIR B31944 (50 amino acids)
            Ding, X. and Coon, M.J.
            Purification and characterization of two unique forms of
            cytochrome P-450 from rabbit nasal microsomes.
            Biochemistry 27, 8330-8337 (1988)

Cyp2g1      mouse 
            GenEMBL L81171, NM_013809, NT_039410.1
            Hua, Z., Zhang, Q.Y., Su, T., Lipinskas, T.W., Ding, X.
            cDNA cloning, heterologous expression, and characterization of
            mouse CYP2G1, an olfactory-specific steroid hydroxylase.
            Arch. Biochem. Biophys. 340, 208-214 (1997) 
            94.9% identical to rat CYP2G1

CYP2G1      Canis familiaris (dog)
            chr1:115782146-115791970 UCSC broswer May 2005 assembly
            90% to human 2G2P
MELGGAFTIFLALSLSCLLILIAWKRNSKGGKLPPGPTPIPFLGNVLQVRTDATFQSFMK
LREKYGPIFTVYMGPRPVVVLCGHEAVKEALVDRADEFSGRGELASIERNFQGH
GVALANGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGFLLEELRKTK
GSPIEPTFFLSRTVSNVISSVVFGSRFDYEDKQFLKLLQMINESFIEMSTPWAQ
LYDMYSGIMQYLPGRHNRIYYLIEELKDFIASRVKINEASLDPQNPRDFIDCFLIKMHQ
DTNNPHTEFNLKNLVLTTLNLFFAGTETVSFTLRYGLLLMMKHPEVE
AKIHEEIDQVIGPHRIPSVDDRAKMPYTDAVIHEIQRLTDIVPMGVPHNVIRDTHFRGYLLPK
GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSS
GKRICLGEAMARMELFLYFTSILQNFSLHSLVPPADIDITPRVSGFGNIPPTYELCLKAR

CYP2G2P     human
            AC008962 comp(28700-40696) seq of gene has two in frame stop codons
MEMGGAVTIFLALCLSCLLILIAWK*MNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK
LREKYSPVFTVYMGPRPVVVLCGHEAVKEALVDQADEFSGRGELASIKQNFQGHG
VALANGERWRIL*RFPLTILRDFGMGKRSIEERIQEEASYLLEEFRKTK
GAPIDPIFLLSRTVSNVISSVVFRSRFDYEDKQFLNLLRLINESFIEMSTPWAQ
LYDMYSGIMQYLPGRHNLIYYLVEELKDFIASRVKINEASFDPQNPRDFIDCFLIKMH
QDKNNPRTEFNLKNLVLTTLNLFFAGTETVSSTLRYGFLLLMKHPEVE
AKIHEEINQVIGPHRLPRVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNLIRDTQFRGYLLPK
GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGRFKKNEAFVPFSSGR
GKRICLGEAMDRMELFLYFTSTLQNFSLHSLVPPVDIDITPKLSGFGNIPPTYELCLVAR*

CYP2G2      Macaca mulatta (rhesus monkey)
            Note this does not look like a pseudogene
            exon 2 = trace archive file 456149111
MELGGAVTIFLALCLSCLLVLIAWKRMNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK (0)
LKEKYGPLFTVYMGLWPVVVLCGHEAVKEALIDQADEFSGRGKLASIEQNFQGH (1)
GVALANGERWRILRRFSLTILRDFGMGKRSIEERILEEASYLLEEFRKTK (1)
GAPIDPTFLLSRTVSNVISSVVFGSRFDYEDKQFLNLLRLINESFIEMSTPWAQ (0)
LYDMYSGIMQYLPGRHNRVYYLIEQLKDFIASRVKINEASFDSQNPRDFIDCFLIKMHQ (0)
DKNNPRTEFNLKNLVLTALNLFFAGTETVSSTLRYGFLLLMKHPEVE (1)
ARIHEEINQVIGPHRLPSVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNVIRDTQFRGYLLPK (0)
GTDVFPLLGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNEAFVPFSS (1)
GKRICLGEAMARMELFLYFTSILQNFSPRSLVPPADIDITPKLSGFGNIPPTYELCLVAR

2H Subfamily

CYP2H1      chicken
            PIR D44107 (22 amino acids)
            Nakai, K., Ward, A.M., Gannon, M. and Rifkind, A.B.
            Beta-naphthoflavone induction of a cytochrome P-450
            arachidonic acid epoxygenase in chick embryo liver distinct
            from the aryl hydrocarbon hydroxylase and from
            phenobarbital-induced arachidonate epoxygenase.
            J. Biol. Chem. 267, 19503-19512 (1992)

CYP2H2      chicken
            PIR E44107 (25 amino acids)
            Nakai, K., Ward, A.M., Gannon, M. and Rifkind, A.B.
            Beta-naphthoflavone induction of a cytochrome P-450
            arachidonic acid epoxygenase in chick embryo liver distinct
            from the aryl hydrocarbon hydroxylase and from
            phenobarbital-induced arachidonate epoxygenase.
            J. Biol. Chem. 267, 19503-19512 (1992)

2J Subfamily

CYP2J1      rabbit 
            GenEMBL D90405 
            Kikuta, Y., Sogawa, K., Haniu, M., Kinosaki, M., Kusunose, E., Nojima, Y.,   
            Yamamoto, S., Ichihara, K., Kusunose, M. and Fujii-Kuriyama, Y. 
            A novel species of cytochrome P-450 (P-450ib) specific for the small intestine 
            of rabbits.
            J. Biol. Chem. 266, 17821-17825 (1991)

CYP2J2      human
            GenEMBL U37143 (1876bp)
            Wu, S., Moomaw, C., Tomer, K.B., Capdevila, J.H., Falck, J.R., 
            and Zeldin, D.C. 
            Molecular Cloning and Expression of CYP2J2, a Human Cytochrome P450  
            Arachidonic Acid Epoxygenase Highly Expressed in Heart. 
            J. Biol. Chem., 271: 3460-3468 (1996)

CYP2J2      Macaca fasicularis (cynomolgus monkey)
            No accession number
            Yasuhiro Uno
            Submitted to nomenclature committee 1/11/2005
            Clone name mfCYP2J2_2-B5
            94% to 2J2 human

CYP2J2     Canis familiaris (dog)
           NW_876313.1 :19927114-19956047
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           78% to human CYP2J2
MLAAVGSLAATLWAVLHLRTLLLGAVAFLFFADFLKRRRPKNYPPGPVPLPFVGNFFHLDFEQSHLKLQRFVKKY
GNVFSVQMGDMPLVVVTGLPLIKEVLVDQNQVFVNRPITPIRERVFKNSGLIMSSGQIWKEQRRFTLATLKNFGL
GRKSIEERIQEEAHHLIQAIEEENGQPFNPHFKINNAVSNIICSITFGKRFEYQDEQFQELLRLLDEVTCLETSM
RCQLYNVFPWIIKFLPGPHQKLFNDWEKLKLFIAHMTENHRRDWNPAEPRDFIDAYLKEMEKGNATSSFHEENLI
YSTLDLFFAGTETTSTTLRWGLLYLALNPEIQEKVQAEIDRVIGQSQLPGLAVRESMPYTNAFIHEVQRMGNIVP
LNVPREVTGDTTLAGYYLPKGTVIVTNLTALHRDPAEWATPDTFNPEHFLENGQFKKREAFLPFSIGKRVCIGEQ
LARSELFIFFTSLVQRFTFRPPDNEKLSLEFRTGLTISPVSHRLRAIPRS*

CYP2J3     rat
            GenEMBL U39943 (1778bp)
            Wu, S., Murphy, E., Gabel, S., Chen, W., Tomer, K.B., Foley, J., 
            Steenbergen, C., Falck, J.R., Moomow, C.R., and Zeldin, D.C. 
            Molecular Cloning, Expression, and Functional Significance of a Cytochrome 
            P450 Highly Expressed in Rat Heart Myocytes. submitted.
            91% to mouse 2j9 exon 8 in a seq gap
            UCSC browser chr5 shown below
116772039 MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ 116771830
116767788 FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN 116767791 
116766010 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEG 116765861
116765445 GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ 116765284
116760602 LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAK 116760426
116758387 YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ 116758247
116754923 EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPK 116754735
          GTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM 
116749991 GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL 116749815

CYP2J3P1    rat
            GenEMBL U40000 (1909bp)
            Wu, S., Murphy, E., Gabel, S., Chen, W., Tomer, K.B., Foley, J., 
            Steenbergen, C., Falck, J.R., Moomow, C.R., and Zeldin, D.C. 
            Molecular Cloning, Expression, and Functional Significance of a Cytochrome 
            P450 Highly Expressed in Rat Heart Myocytes. submitted.
            Not a true pseudogene, but an alternative splice variant of CYP2J3
MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ
FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN
GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQGEAYHLVEAIKDEG
GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ
LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEGRDFIDAFLKEMAK
YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ
EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPRKVAVDTYLAGFNLPK
GTMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM (GC boundary, retains intron)
GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKLSLQFRMSVTISPVSHRLCAIPRL

CYP2J3P2    rat
            GenEMBL U40004
            Wu, S., Murphy, E., Gabel, S., Chen, W., Tomer, K.B., Foley, J., 
            Steenbergen, C., Falck, J.R., Moomow, C.R., and Zeldin, D.C. 
            Molecular Cloning, Expression, and Functional Significance of a Cytochrome 
            P450 Highly Expressed in Rat Heart Myocytes. submitted.
            Not a true pseudogene, but an alternative splice variant of CYP2J3
MLVTAGSLLGAIWTVLHLRILLLAAVTFLFLADFLKHRRPKNYPPGPWRLPLVGCLFHLDPKQPHLSLQQ
FVKKYGNVLSLDFANIPSVVVTGMPLIKEIFTQMEHNFLNRPVTLLRKHLFNKN
GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAYHLVEAIKDEG
GLPFDPHFNINKAVSNIICSVTFGERFEYHDSQFQEMLRLLDEAMCLESSMMCQ
LYNIFPRILQYLPGSHQTLFSNWRKLKLFISDIIKNHRRDWDPDEPRDFIDAFLKEMAK
YPDKTTTSFNEENLICSTLDLFFAGTETTSTTLRWALLCMALYPEVQ
EKMQAEIDRVIGQGRQPNLADRDSMPYTNAVIHEVQRIGNIIPFNVPREVAVDTYLAGFNLPKG
(small deletion)
RDPKEWATPDTFNPEHFLENGQFKKRESFLPFSM
GKRACLGEQLARSELFIFITSLIQKFTFKPPVNEKVSLQFRMSVTISPVSHRLCAIPRL

CYP2J4      rat
            GenEMBL L81170 (1826bp)
            Zhang,Q.-Y., Ding,X., Kaminsky,L.S.
            cDNA cloning, heterologous expression, and characterization of rat
            intestinal CYP2J4
            Arch. Biochem. Biophys. 340, 270-278 (1997)
            UCSC browser chr5 shown below
116734902 MLATAGSLIATIWAALHLRTLLVAALTFLLLADYFKTRRPKNYPPGPWGLPFVGNIFQLDFGQPHLSIQP 116734693
116725983 FVKKYGNIFSLNLGDITSVVITGLPLIKETFTHIEQNILNRPLSVMQERITNKN 116725822
116723426 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRMQEEAHYLVEAIREEK 116723277
116722875 GKPFNPHFSINNAVSNIICSVTFGERFEYHDSRFQEMLRLLDEVMYLETTMISQ 116722714
116718583 LYNIFPWIMKYIPGSHQTVFRNWEKLKLFVSSMIDDHRKDWNPEEPRDFIDAFLKEMSK 116718407
116716306 YPEKTTSFNEENLICSTLDLFFAGTETTSTTLRWALLYMALYPEVQ 116716169
116713582 EKVQAEIDRVIGQKRAASLADRESMPYTNAVIHEVQRMGNIIPLNVPREVAMDTTLNGFHLPK 116713394
116711364 GTMVLTNLTALHRDPKEWATPDVFNPEHFLENGQFKKRESFLPFSM 116711227
116708412 GKRACLGEQLARSELFIFFTSLMQKFTFKPPTNEKLSLKFRNGLTLSPVTHRICAVPRE* 116708233

CYP2J4-de6b rat
            UCSC browser chr5: 116706163-116706053 (- strand)
            exon 6, frag w in fig. below
116706163 XXXXXXSFCEENLTCRTLDFLYAGIDTISNRLHWVLLLTCVNPEXX 116706053 
rat, mouse and human 2J cluster

Cyp2j5      mouse
            GenEMBL U62294 (1886bp), NT_039263.1
            J. Ma and D.C. Zeldin, unpublished.
            clone JM-6

CYP2J5P     rat
            UCSC browser Chr5: 116785102-116780337 (- strand)
            exons 1-4 69% to 2j5 mouse 
            now a pseudogene ortholog
116785102 MITSLSSLVTSSWAALLLRTLLLAAVTFLFLAGILRRHRPKDYQPGPWRLPFVGNFFQIDFEQSHLVLQK 116784893
116784415 FAKKYGNVFSLELDRPSVVVVTGQPLIKTKMFTHLEQNFANHFVTSVRKRAIGNN 116784251
116781318 GLITSNGQTWKEKRRFALMTLKNFGLGKKSLEQRMHE*AFHLVEARREEG 116781169
116780474 GQPVDLHLINNAVANVICSITFGGRFEYEDCQFQEMPTLLDEALHV 116780337

Cyp2j5-de2b mouse
            GenEMBL NT_039263.1|Mm4_39303_30 
            detritus exon 2
            q in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7613530 FVKKYGNLFSLELDSISVEVVSGLL 7613456
7613456 LIKEMFTHLDHNFVNRPVSAIQKHV 7613382

Cyp2j5-de9b  mouse
           GenEMBL NT_039263.1|Mm4_39303_30
           detritus exon 9
           r in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7603742 GK*ACPGEHLAISELFIIFTDLM*NFTFKAPINQKLSLS 763626
7603626 FRNGLTLSPVSYHICAVPQQ* 7603564

Cyp2j6      mouse
            GenEMBL U62295 (2046bp) NT_039263.1
            J. Ma and D.C. Zeldin, unpublished.
            clone JM-15

Cyp2j6-de6b mouse 
            GenEMBL NT_039263.1|Mm4_39303_30
            detritus exon 6 fragment 
            s in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7513690 TGFNKENLTCDTLDLLSGGIDTTSNGVHWVLLYRSVNKE 7513574

Cyp2j7      mouse
            GenEMBL XM_143894.1, NT_039263.1|Mm4_39303_30, AF218856
            D.C. Zeldin, unpublished.

Cyp2j7-de9b mouse
           GenEMBL NT_039263.1|Mm4_39303_30
           from old Cyp2jzzp 
           w in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7177505 GKGACLGKQLAMSQLFIFFTSLMQKSTFKPPINENLSLKFTMSP 7177374
7177375 LSPVSHHIYAVPRQ 7177334

Cyp2j7-de9c mouse
           GenEMBL NT_039263.1|Mm4_39303_30
           from old Cyp2jzzp 
           x in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7157638 GNRACPGEQLAMIELFIFFTALMQKCTFKSTVNEKLGLKIRLDLPLSPVSHHICAVPRQ 7157462 

Cyp2j7-de9d mouse
           GenEMBL NT_039263.1|Mm4_39303_30
           from old Cyp2jzzp 
           y in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7138888 GKRTCHGKQLARSELFIFFTALMHIFTLNPPISKKLSLKFSMGLAFSPVSH*ICVVPTQ 7138712

Cyp2j8      mouse
            GenEMBL NT_039263.1|Mm4_39303_30
            AF218857 AI429871 vv77f02.y1 69-184 (EST),
            AA760476 vv77f02.r1 69-227 (EST), AZ393698 283-329 (GSS), AI606765 
            vv77f02.x1 330-476 (EST) AZ057726 422-463 (GSS), XM_131520.1 (from nr) 
            AL772157.1 htgs AC102925.1
            D.C. Zeldin, unpublished.
            clone WQ4-1

Cyp2j8-de2b mouse
           GenEMBL NT_039263.1|Mm4_39303_30
           detritus exon 2 
           t in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7429084 LEKYGNNFSLILGD*TLVVITELLLTKEACIHMEQNILNHPATFIQECNSKK 7428929

Cyp2j8-de9b mouse
           GenEMBL NT_039263.1|Mm4_39303_30
           detritus exon 9 
           u in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7417728 ERLIRSKIFSFTLSLKMKSSIYMEVFSFKP 7417639

Cyp2j8-de9c mouse
           GenEMBL NT_039263.1|Mm4_39303_30
           detritus exon 9 
           v in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7414356 EQLARSEMFIFFIALMEKFTFKASVNEKLSLKFRMGFNLPQVSHNICAVPRY* 7414198

Cyp2j9      mouse
            GenEMBL NT_039263.1|Mm4_39303_30 AK018422 lung, also AF336850
            D.C. Zeldin, unpublished.
            clone WQ24-1

CYP2J10     rat
            GenEMBL XM_233199  
            Yu Z, Huse LM, Adler P, Graham L, Ma J, Zeldin DC, Kroetz DL.
            Mol Pharmacol 2000 May;57(5):1011-20
            Increased CYP2J expression and epoxyeicosatrienoic acid formation in 
            spontaneously hypertensive rat kidney.
            ortholog of mouse Cyp2j12
            Predicted by GNOMON 86% to 2j12 mouse (LOC313373), mRNA.
            2J10 seq specific rev primer matches 116499966-116499989
            forward primer 1 = 116515946 116515968
116516004 MLSTEDTLEAAIRALLHFRTLLLAAVTFLFLANYLKTRRPKNYPPGPWRLPFVGNLFQLDVKQPHVVIQK 116515795
116508667 FVKKYGNLTSLDFGTIPSVVITGLPLIKEAFTNTEQNFLNRPVTPLRKRVFNNN 116508506
116505791 GLIMSNGQTWKEQRRFTMTTLKNFGLGKRSLEQRIQEEANYLVEAIGADK 116505642
116505144 GQPFDPHFKINSAVSNIICSITFGERFEYEDSLFQELLRLLDEASCLESSMMCQ 116504983
116500081 LYNVFPTIIKYLPGSHQTVLRNWEKLKLFISCMMDSHQKDWNPDEPRDFIDAFLTEMAK 116499905
116496152 YRDKTTTSFNKENLIYSTLDLFFAGSETTSNILRWSLLYITTNPEVQ 116496012
116489147 EKVHSEIDRVIGHRRQPSTGDRDAMPYTNAVIHEVLRMGNIIPLNVPREMTADSTLAGFHLPK 116488959
116488244 GTTILTNLTGLHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSM 116488107
116479687 GKRACPGEQLARTELFIFFTALMQNFTFKPPVNETLSLKFRNGLTLAPVSHRICAVPRQ 116479511

Cyp2j11     mouse
            GenEMBL XM_131521, AC091461.3 Unigene Mm.26915, NT_039263.1
            Joan Graves, Hong Wang, and Darryl Zeldin
            Clone name CYP2JA

Cyp2j12     mouse
            GenEMBL XM_143892 (genbank entry missing part of exon 4)
            NT_039263.1|Mm4_39303_30

Cyp2j13     mouse
            GenEMBL NT_039263.1|Mm4_39303_30
            Map view locus LOC230459
            Joan Graves, Hong Wang, and Darryl Zeldin
            Clone name CYP2JC

CYP2J13     rat
            GenEMBL XM_233198 1455 bp 
            ortholog of mouse Cyp2j13
            Predicted GNOMON Rattus norvegicus similar to CYP2J4 (LOC313372) mRNA.
            Missing exon 1 74% to XM_233199, 79% to 2J4 
            78% to 2J3 90% to 2j13 mouse
116449294 FVKKYGNVISLDLGIMSSVIISSLPLIKEAFSHLDENFINRPIFPLQKHIFNDN 116449133
116446157 GLIFSSGQTWKEQRRFALMTLRNFGLGKKSLEQRIQEEAHHLVEAIGEEE 116446008
116445630 GQPFDPHFKINNAVSNIICSITFGERFEYHDSQFQELLKLLDKAMYLGTPMMIH 116445469
116440971 LYNMFPWIIKHLPGQHQTLLATWGKLKSYIADIIENHREDWNPAEPRDFIDAFLNEMAK 116440795
116428766 YPDKTTTSFNEENLICSTLDLFLAGTETTSTTLRWAVLYMALYPEVQ 116428626
116426881 EKVQAEIDQVIGQEKHPSLADRDSMPYTNAVVHEIQRMGNIVPLNVPREVAVDTTLAGFHLPK 116426693
116426568 GSVVMTNLTALHMDPKEWATPDVFNPEHFLENGQFKKRDSFLPFSM 116426431
116423270 GKRACLGEQLARSELFIFFTALMQKFTFKPPTNEKLSLKFRLGITISPVSHRICAVPRL 116423094

Cyp2j13de1X  mouse
            Detritus exon 1 7kb downstream of 2j13 (exon 8)
            Note: this is an early and incorrect nomenclature for Cyp2j13-de8b

Cyp2j13-de8b mouse
            GenEMBL NT_039263.1|Mm4_39303_30 
            detritus exon 8  ABOUT 7000BP DOWNSTREAM OF 2J13
            z in Figure 5D Nelson et al. Pharmacogenetics 14, 1-18 (2004)
7025751 GSVVLTNLTALQVDPKD*ATPDVVIPEHFLKNGEF*KGESFLPFSIG 7025611

>Cyp2j14-ps mouse
           GenEMBL NT_039263.1|Mm4_39303_30 
           exons 3,4,9
7377737 XXXXXSNGQTWKEQKRFALMILKNFELGKKSLEQHIQEEANHLLEAMGEEK 7377600
7376950 GQPFDPHY 7376927
7376925 VSNIICFITFGDHFEYDDNKFQELLKLTDETLCSEASMMLV 7376803
7353938 GKRSCPGEQMAISELFIFFT 7353879
7353880 LFTQKFTFSPPVNEKLKFKNGLTLSPVSHHICAVPRQ* 7353767

>Cyp2j15-ps mouse
           GenEMBL NT_039263.1|Mm4_39303_30
           exons 3,4,5,9 
7271792 GFI*SSSQIWKD*RFILMTLKHFGLGKILVHLMQGESCCHLVGA 7271661
7271288 GQHSDLHFIINNAVCNIIFSVTFDCFLETHDCRFQEMLKLMDEFICLETTMLHQ 7271127
7245486 LYNVFPHLMKYILVSLQTVFRN 7245421 
7245421 RGKLKLLASCMIDKHVRDWNPD*PRDFIDVFFKEMMK 7245311
7232303 GKRACHGEQLARSELFIF*TALIQKFVFKVPVNEKLSLKFRLGFPLPPVNHHIYAVPRD* 7232124

CYP2J16-de2b5b9b  rat
           UCSC browser (- strand) frag x in figure below
116691748 KKYGNIFGLNLGDLTSEVITGLLLSKE 116691668 exon 2
116684743 FYDIFPYLMKYIPGITSNCFQKLGKLKLFVSCMTDEHRRDWNPEDPRNFTDALLKEMMK 116684567 exon 5
116677505 GKRACPGEQLARSKLFIFFTALIQKFTF 116677422
116677420 RLGMKSILGLTLSPVTHHI*ALSKQ 116677346 exon 9
rat, mouse and human 2J cluster

CYP2J16    rat
           UCSC browser (- strand)
116664772 MLATVGSLLAKIWSAINFWTLLLTLLTFLLLADYLKNRRPNNYPPGPWRLPFVGNLFQFDLNISHLHLRIQQ 116664557
116654396 FVKKYGNLISLDFGNISVVVITGLPLIKEALINNEQNFLKRPIVPSRYRVFKDN 116654235
116651622 GIFFANVHKWKEQRRFALTMLKNFGLGKKSLEQCIQEEAHHLVEVIGEEK 116651473
116650955 GQPFDPHFRINNAVSNIICSITFGERFEYDDSQFQELLKLADEVICSEASMTSV 116650794
116640170 LYNVFPLIFKYLPGPHQTVFKNWEKLKSIVANMIDRHRKDWNPDEPRDFVDAFLTEMTK 116639994
116638624 YPDKTTTSFNEENLIATTLDLFFAGTETTSTTLRWALLYITLNPEVQ 116638484
116627938 EKVHSEIDRVIGHGRLPSTDDQDAMPYTNAVIHEVLRMGNIIPLNVPREVTADSTLAGFHLPK 116627750
116624337 GKMILTNLTALHRDPKEWATPDTFNPEHFLENGQFKKRDSFLPFSV 116624200
116612610 GKRACPGEKLAKSELFIFFTALMQNFTFKAPTNEKLSLKLRKGLSLYPVSYRICAVPR 116612437
rat, mouse and human 2J cluster

CYP2J16-de5c6c9c   rat
           UCSC browser (- strand)
           72% to 2j6 mouse, frag y in fig below
116604392 LYNIFPWIMNYGPGSHQ 116604342 116604222 exon 5
116604345 SVFRNWEKLKLFVSCMIDNKQRWVP 116604271 exon 5
116602255 YPEKSTSFSQGHLFCSTLNLFRAGSET 116602175 exon 6
116591992 GKRACPGEQMAISELFSFFAAFMQ 116591921 exon 9
116591919 KFTFHLAINEKLRMKFRNGLTLP*SSHLYC 116591830 exon 9
rat, mouse and human 2J cluster

CYP2J17P   rat
           UCSC browser (- strand)
116584536 MLATASCLVANVCSAIPLWTLLLAALSWLPQKQAPQKQPSRALAPAIFGNLFQFDLDVSQLHSGI*PSKK 116584327 exon 1
116581102 FVTKYGNLISLDFGNTSSVIISGLPLIKEALTDM 116581001 exon 2
116580637 EQNLLKCIVLASREHVFKNN 116580578 exon 2 last half
116570454 LYNVFPFIIKYL 116570419 exon 5
116570408 NQTFFRNWENLNLFVSHMMESHRKDWNPVEPRDFIDAFLTYMTKEDD 116570268 exon 5 last half
116566151 KVHSEIDGVTGHGRPPSTGDRDSMPYTNAVIYEVLRMDNINPLKVPREVTADSTLDEFCLSK 116565966 exon 7
116563406 GTMVLINLTALYRESKEWTTQDTFNPEHFLENGMFKKRESF 116563284 exon 8
116559748 KFTFKPPISEKLSLKFRTGLTLSHVSCRI*SIHR 116559647 exon 9

CYP2J18P    rat
            UCSC browser (- strand)
            63% to 2j6 mouse
116551335 MLGTQDILEAGIWALLH 116551285 exon 1
116551282 RTLLLAAVTFLLLADYLKTGNK 116551217 exon 1 
116551217 KKYPWGPCNPPVMNNLFQLDLEQ 116551149 exon 1
116537661 LYNAFLSIMKYHPGSHQ 116537611 exon 5 
116537614 SVFRNWEKLIWRMSHIAENHCKG*NPAEL 116537528 exon 5 
116537523 REFIDAFLTKMTK 116537485 exon 5
116534551 YPDKTTTNFNEENLICA 116534501 exon 6
116534498 LEFLFARTEITSTTLSWVLLYLSANPGVQ 116534412 exon 6
116529361 LFIFFTSLMQKFTFKPPISEKLILKFRMGLILSPVCH*ICVVPRQ* 116529224 exon 9

Cyp2jbbpX   mouse 
            XM_143896 
            Map view locus LOC230464 
            exons 3-4 and exon 9
            temporary placeholder name for Cyp2j14-ps

Cyp2jzzpX   mouse 
            Map view locus LOC230460 
            3 C-term fragments ABOUT 19KB APART
            temporary placeholder name
            note this is an old name for Cyp2j7-de9b, Cyp2j7-de9c, Cyp2j7-de9d

CYP2J19     Gallus gallus (chicken) 
            NW_060417.1 weakly like a CYP2J, 52% to 2J2 human  
            BI390850.1 EST all the best hits are CYP2Js
12644 MDFRFWPISQLGKLNVSMLLVVLVMFLLIIDFVRKRRPRNFPPGPQLFPLVGTIVDLRQPLHLEMQK  12444
10910 LTARYGNIFSVQFGGLTFVVVSGYQMVREALVHQAEIFADRPHIPLLQEIFRGF  10749
10125 GLISSNGHIWRQQRKFVSATLKSIAVSFESKVQEESRYLVEAMEEEK  9985
8514  GQPFDPHYKINSAVSNIICSITFGNRFNYHDSNFQELLHLLAETLLLIGSFWGQ  8299
7615  LYNAFPLIMRWLPGPFRKIFRHWEKLQRFVRGVIAKHKEDLDQSDLGDYIDCYLKEIEK  7439
7077  CKGDTNSYFHEENLLCSTLDLFLTGTETTATAIRWALLYMAAYPHIQ  6937
6401  EKVQLEIDAVIGQCRQPTMEDKEHMPYTSAVLSEVLRMGNIVPLGVPRMSTNDTTLAGFHVPK  6213
5285  GTTLMTSLTSIMFDKNVWETPDTFNPEHFLENGQYRRREAFLPFSA  5148
4669  GKRACPGEQLARTELFIFFTALLQKFTFQAPSATVLSFAFTLSLTRCPKPFQLCALPR  4496

CYP2J20     Gallus gallus (chicken) 
            NW_060417.1 weakly like a CYP2J, 52% to 2J2 human  
            This sequence joins with the rest of the gene on 
            NW_060416.1|Gga8_WGA225_1
            joined by EST BI064782.1
            (part of a 6 gene CYP2J cluster)
    1641  MLRFLWDSISLQMLFIFLLVFLLVSDYMKRRKPKDFPPGPFSFPFLGNVQFMFAKDPVVAIQK  1453
    943   FIEKHGDIFRTQVGSMSFVIVNGLPLIKEALVTQGENFMDRPEFPTNTEFFNKF  782
    574   GLVSSNGHLWKQQRRFTLTTLRNFGLGKRSLEERIQEECRFLTDAFRDEQ  425
          GNPFNPHLKVNNAVSNIICSVTFGNRFEYHDEDFQNLLRLMNETAILQGKIMSQ
15531671  LYNFFPSVIKYFPGSHQTVIKNGRLMKRFVCKKISKHKEDLSPSESRDFIDSYLQEMAK  15531495
15531239  PNGRDFCEDNLVACTLDLFFAGTETTSTTIRWALLYMAIYPEIQ  15531108
15530636  ARVQAEIDAVIGQARQPSLEDRSNMPYTNAVIHEVQRKGNIIPFNVPRQAVKDTVLAGFRVPK  15530448
15529975  GTILIPNLSSVMFDMKEWETPHSFNPGHFLKDGQFWKREAFMPFSI  15529838
15529096  GKRACLGELLARAELFLFFTALLQKFTFQAPPDTILDLKFTHGMTLAPQPYMICAVPR  15528923

CYP2J21     Gallus gallus (chicken) 
            NW_060416.1|Gga8_WGA225_1
            (part of a 6 gene CYP2J cluster)
15526022  MLRFLWDSISLQMLFVFLLVFLLVSDYMKKRKPKDFPPGPFSFPFLGNMEFIIAKDPVAVTEK  15525834
15525310  FIEKHGDIFSTQVGSMSFVIVNGLPLIKEALVTQGENFMDRPEFPINTEFLNKF
15524941  GLVFSNGHLWKQQRRFTLTTLRNFGLGKRSLEERIQEECRFLTDAFRDEQ  15524792
15523650  GNPFNPHLKVNNAVSNIICSVTFGNRFEYHDEDFQNLLRLMDETVTLQGEPMSQ  15523489
15522627  LYAFFPSIIKYFPGSHQTVLKNEKLMKRFVCKKISKHKEDLSPSESRDFIDSYLQEMAK  15522451
15522209  KPNGSDFCEDNMVSCTLDLFFAGTETTSTTIRWALLYMAIYPEIQ  15522075
15521605  ARVQAEIDAVIGQARQPSLEDRSNMPYTNAVIHEVQRKGNIIPFNXXXXXXXXXXXXXXXXXX  15521471
15521289  XXLLIPNLSSVMSYKKQWETPHSFNPGHFLKDGQFWNREAFMPFSI  15521158
15520424  GKRACLGELLARAELFLFFTSLLQKFTFQAPPDTILDLKFTVGITLAPQPYKICAVPR  15520251

CYP2J22     Gallus gallus (chicken) 
            NW_060416.1|Gga8_WGA225_1
            (part of a 6 gene CYP2J cluster)
15518269  MLRFLWDSISLQMLFVFLLVFLLVSDYMKKRKPKDFPPGPFALPFLGNVQLMVAKDPVSTVQK  15518081
15517552  XXEKHGDIFSMQVGSMSFVIVNGLQMIKEALVTQGENFMDRPEFPMNAEVFNKF  15517403
15517205  GLLSSNGHLWKQQRRFTLTTLRNFGLGKRSLEERIQEECRFLTDAFRDEQ  15517056
15515960  GNPFNPHLKINNAVSNVICSITFGNRFEYHDEDFQNLLRLMDETVTLHGKIMSQ  15515799
15514587  LYTFFPSIVKYLPGSHQTVIKNGKLMKDFVCNVISKHKEDLNPSESRDFIDSYLQEMAK  15514411
15514166  PDSSDFCEDNLVSCTLDLFFAGTETTSTTIRWALLFMAMYPEIQ  15514035
15513576  ARVQAEIDAVIGQARQPSLEDRNNMPYTNAVIHEVQRKGNIIPFNALRLTVKDTVLAGFRVSK  15513388
15512873  GTILIPNLSSVMYDKKEWETPHSFNPGHFLKDGQFWKREAFMPFSI  15512736
15512011  GKRACLGELLARAELFLFFTSLLQKFTFQAPPDTILDFKFTMGITLAPRPYKICAVPR  15511838

CYP2J23     Gallus gallus (chicken) 
            NW_060416.1|Gga8_WGA225_1
            (part of a 6 gene CYP2J cluster)
15510424  MLRFLWDSISLQMLFVFLLVFLLVSDYMKRRKPKDFPPSPFSFPFLGNVQFMFAKDPVVATQK  15510236
15509668  XXEKLGDIFSMQAGSQSFVIVNGLPLIKEALVTQGENFMDRPEIPLDTDIFSKL  15509519
15509300  GLISSSGHLWKQQRRFTLTTLRNFGLGKRSLEERIQEECRFLTEAFRDEQ  15509151
15508915  GNPFNPHLKINNAVSNIICSVTFGNRFEYHDENFQTLLRLMDETVTLHEKIMSQ  15508754
15508232  LYNAFPSIVKYLPGSHQTIFKNWRLMKDFVNEKISKHKEDLNPSESRDFIDSYLQEMAK  15508056
15507812  PSGSEFHEENLVACALDLLFAGTETTSTTIRWALLFMAVYPEIQ  15507681
15507221  AHVQAEIDAVIGQARQPALEDRNNMPYTNAVIHEVQRKGNIIPFNVPRQAVKDTVLAGFRVPK  15507033
15506561  GTILIPNLSSVMYDKKEWETPHSFNPGHFLKDGQFWKREAFMPFSI  15506424
15505718  GKRACLGELLARAELFLFFTSLLQKFTFQAPPDTILDFKFTMGITLAPRPYKICAVPR  15505545

CYP2J24P    Gallus gallus (chicken) 
            NW_060416.1|Gga8_WGA225_1
            (part of a 6 gene CYP2J cluster)
15504220    DSMKRQWLNFFKSIVGQQQLHCADYMKRRKPKDFPPSPFSFPFLGNV*FMFAKDPVVATQK  15504038
15503534  IIEEHGDIFSMQVGTQSFVIVNGLPLIKEALVTQGENFMDRPEIPMNAEVFSKL  15503385
15503168  GLLSSNGHL*KQQRRFTLTTL*NLGLGKRSLEERIQKECQFLTDAFRDEQ  15503019
15501515  GNPFNPHLKVNNAVSNVICSITFGNWFEYHDKDFQNLLQLMDETATFYGKIMNQ  15501354
          gap
15501024  PNGSDFCGDNLVLCTLDLFFAGTETTSTTIRWALLFMAIYPEIQ  15500893
          gap
15498733  GKRACLGELLARVEIFLFFTSLLQKFTFQAPPDTILDVKFTMGITLAPQPYKICAVPR  15498560

CYP2J25   Phalacrocorax carbo (Common cormorant)
          No accession number
          Hisato Iwata
          submitted to nomenclature committee 5/19/05 
          78% to 2J23, 76% to 2J22, 70% to 2J21, 75% to 2J20
          55% to 2J19

CYP2J26     Bos taurus (cow)
            See cattle page for details
MLEALGSLVAALWTTLRPGIVLLGAFVFLLFADFLKRQHPKNYPPGPLRLPFIGNFFHLDLGKGILVPQQ
VVKKYGNIIRLDFGVIHFIVITGLPYIKEALVNQEQNFVNRPMIPLQKHIFNNK
GLVRSNGQVWKEQRRFTLTTLRNFGLGRKSLEERIQEEVTYLIQAIGEEN
GQPFDPHFIINNAVSNIICSITFGERFDYKDDQFQELLRLLDEILCIQASVCCQ
LYNAFPRIMNFLPGSHHTLFRKWEKLKMFVANVIENHRKDWNPAEARDFIDAYLQEIEK 11676
HKGNATSSFDDENLICSTLDLFLAGTETTSTTLRWGLLFMALNPEIQ 14705
EKVQAEIDRVLGQSQKVSTASRESMPYTNAVIHEVQRMGNIVPMNVPREVTVDTVLAGYH 15236
LVKGTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRESLTSSPASYRLCAIPRA* 25310

CYP2J27     Bos taurus (cow)
            See cattle page for details
MLEALGSLAAALWAALRPGTVLLGAVVFLFLDDFLKRRRPKNYPPGPPPLPEVGNFFQLDFDKAHLSLQR 
FVKKYGNVFSVDFGIFRSVLITGLPLIKEALVHQDQNFANRPLIPIEKRIFNNK 37352
GLIMSNGHVWKEQRRFALTTLRNFGLGKKSLEERIQEEAAYLIQEIGEEN 39667
GQPFDPHFTINNAVSNIICSITFGERFDYQDDQFQELLRLFDEMMHLRTSTCCQ 40221
LYNIFPRIMSFLPGPQHALFSKWEKLKMFIAGVVENHKRDWNPAEARDFIDAYLQEIEK 42145
HKGNATSCFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ 43949
EKVQAEIDRVLGQSQKPSMAARESMPYTNAVIHEVLRMGNILPLNVPREVTVDTVLAGYRLPK 
GTMVTTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI 
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRMSMTLSPLSHRLCAIPRA*

CYP2J27-ie5b     Bos taurus (cow)
            See cattle page for details
            extra internal exon 5
LSNVFPRIMNFLPGPQHTLFSKWEKLKMFIAGVIENHKRDWNPAEARDFVDAY 41591

CYP2J28     Bos taurus (cow)
            See cattle page for details
MLEALGSLAAALWAALRPGTVLLGAIVFLLLTDLLNRRRPKNYPPGPPRLPFVGNFFQLDFEQGHLSLQR
FVKKYGNLFSLEFGDLPSVVITGLPLIKEVLVYQDQNFVNRPISPIRERVFKKN
GLIMSNGHIWKEQRRFSLTALRNFGLGRKSLEERIQEEVAYLIQAIGEEK
GQPFNPHFKINNAVSNIICSITFGERFDYQDDQFQELLRLLDEVTYLETTVWCQ
LYNVFPRIMNFLPGPHQMLFSNWRKLKMFVARVIENHKRDWNPAEARDFIDAYLQETEK
HKGNAASSFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ 716 
EKVQAEIDKVLDESQQPSMATRESMPYTNAVIHEVQRMGNILPLNVPREVTVDTVLAGYHLPK 
GTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKREAFLPFSI
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPEHEELSLKFRMGLTLSPVSHCLCAVPRA*

CYP2J29     Bos taurus (cow)
            See cattle page for details
MLSSLAAALWAALRPGTVLLGAVAFLFFADFLKRRRPKNFPPGPAGLPFVGNSFQLDPEKVHLTLQQ
FVKKYGNVFSLDFGTFPSILITGLPLIKEALVHQGENFSKRPVMPLQERIFNTK
GLIMSSGHIWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQMIREEN
GKPFDPHFIINNAVSNIICSITFGERFDYQDSQFRELLRLLDEVLNLHTSLCCQ
LYSVFPRIMNFVPGPHQTLFSNLEKLKMFVAEMIENHKRDWNPAEARDFIDAYLQEIEK 8435
HKGGDASSFREENLIYSTLDLFLAGTETTSTSLRWGLLYMALNPEIQ 5634     
EKVQAEIDRVLGQSQQPSTAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK 5455
GTVVVTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI 2548
GKRMCLGEQLARAELFIFFTSLLQKFTFRPPENEKLSLKFRVSLTLAPISHRLCAVPRG*

CYP2J30     Bos taurus (cow)
            See cattle page for details
MLEALSSLATALWAALRPDTVLLGTLAFLLFVDFLKRRHPKNYPPGPPGLPFVGNLFQLDPEKVPLVLHQ
FVKKYGNVFSLDFGTVPSVLITGLPLIKEVLVHQGQIFSNRPIVPLQEHIINNK
GLIMSSGQLWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQTIREEN
GQPFDPHLTINNAVSNIICSITFGERFDYQDDQFQELLRMLDEILNLQTSMCCQ
LYNVFPRIMNFLPGPHQALFSNMEKMKMFVARMIENHKRDWNPAEARDFIDAYLQEIEK
HKGDATSSFQEENLIYNTLDLFLAGTETTSTSLRWGLLFMALNPEIQ
EKVQAEIDRVLGQSQQPSMAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK 15084
GTMVMTNLTALHRDPTEWATPDTFNPEHFLENGQFKKRESFLPFSI 12265
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEQLSLKFRVSLTLAPVSHRLCAVPRG*

CYP2J31P    Bos taurus (cow)
            See cattle page for details
MGAAAFLFVVHLKRRRGKNYPPGPPGLPFLGNFFHLDLKQLHLSLQQ 
IVKKYGNMISLEMGGFSTVFFKWIAQNQRSPCLPGPKLVNHPIQRIQENIFKKH 5343
GLIMSNGHIWKEQRRSALTTLRNFGLGRKILEECIQEEAAYLIQTVGEEN 8001
XQPFDPHFTINNAVSNIVCSIAFGELFDYQDSXXQELLRLMDEAMYLQTSVRCRV 8538
LYNFFARIMNFLPGPHQTLFIKWEKLNMFIDSVIENHRRDWNPAEPRDFTDA
15856 GMWMCPGEQLARTELFIFFTSLLQKFTFRPPGDEKLSLQFRVSLTISSVSHWLC 16020

CYP2J32v1   pig 
            BW982013.1 CB287444.1, Z84061.1, BE014607.1
            97% to CJ016505.1, 80% to 2J27 cow, 
ALGSLAEALWTALRPSTILLGAVAFLFFADFLKKRRPKNYPPGPPRLPFIGNLFHLDLDK
GHLSLQRFVKKYGNVFSLDFGALSSVVITGLPFIKEAFVHQDKNFSNRPIVPIQQRVFKD
KGVVMSNGQVWKEQRRFALTTLRNFGLGKKSLEERIQEEAQYLIQAIGEENGQPFNPHFK
INNAVSNIICSITFGERFDYQDNQFQELLKLLDEVMCLQTSVWCQIYNIIPWIMKFLPGP
HQTLFSNWEKLKMFVAHVIENHRRDWNPAEARDFIEAYLQEIEKHTGDATSSFQEENLICS
TLDLFVAGTDTTSTTLRWGLLYMALYPEIQEKVQAEIDRVLGQLQQPSSSARESMPYTNA

CYP2J32v2   pig 
            CJ016505.1
NRPTVPIQQRVFKDKGVVMSNGQVWKEQRRFALTTLRNSGLGKKSLEERIQEEAQYLIQA
IGEENGQPFNPRFKINNAVSNIICSITFGERFDYQDDQFQELLKLLDEVMCLQTSVWCQI
YNIIPWIMKFLPGPHQTLFSNWEKLKMFVAHVIENHRRDWNPAEARDFIDAYLQEIEKHK
GDATSSFQEENLICSTLDLFVAGTETTSTTLRWGLLYMALYPEIQEK
VQAEIDRVLGXLQQPSTAARESMPYTNA

CYP2J33     pig 
            BP170090.1 CK453810.1, BW982704.1, DB811462.1
            DB817476.1, DY414727.1 DY418828.1 85% to CJ016505.1
            80% to 2J28 cow
MTQALGSLAEALWTALHPSTLLLGAVTFLFFADFLKKRRPKNYPPGPLRLPFVGNLFHLD
FEKAHLSLQRFVKKYGNIFSLDLCALSAVVVTGLPLIKEVLVHQNQKFANRPILPIQDRV
FKNKGVVTSSGQVWKEQRRFTLTTLRNFGLGKKSLEERIQEEAQYLIQAIGEENGQPFNP
QFKISNAVSNIICSITFGKRFDYQDDQFQELLRLLREVTHLQTLLWCQLFNVFPRIMKFL
PGPHQTLFSDWEKLEMFIARVIENHRRDWNPAEARDFIDAYLQ
EIEKNKGNATSSFHEENLICSTLDLLFPG
TDTTLITLRWGLLYMALHPEIQEKVQAEIDRVLGQSQQPSTAARESMPYTNAVIHEVQRM
GNIIPLNVPREVAEDTTLAGYHLPKGTMVLTNLTAL
HRDPAEWATPNIFNPEHFLENGKFKKREAFLPFSIGKRACLGEQLARTELFVFFTSLLQK
FSFRPPDNEKLSLKFRVGLTLSPVTYCICAVPRA*

CYP2J34     pig  
            BW981916.1, CJ028862.1, BW967356.1, CJ025847.1, BP142154.1
            BP168104.1, CJ025026.1, BW967863.1,  
            83% to BW982013.1, 80% to 2J28 cow
MTPALGFLAEALWTALRPSTLLLGAVAFLFFADFLKRRSPKNYPPGPPRLPFLGNFFHLD
VEKGHLALQRFVKEYGNIISLDSSVFSSVVITGLPLIKEAFVHQDQHFANRPMIPTQERV
FKKNGLIMSNGQVWKEQRRFALTTLRNFGLGKKSLEERIQEEAQYLIQAIGEENGQPFNP
HFKINNAVSNIICSITFGKRFDYQDDRFQELLRLLDEVTCQHTSVQVQLYNMFPRIMKFL
PGPHQTLFSNWEKLQIFVACVIENHKRDWNPAEARDFIDAYLQEIEKHKGNATSSFQEEN
LIFTTLDLFFAGTETTSTTLRWGLLYMALYPE

CYP2J35     pig 
            BW960287.1, BI359857.1 
            75% to 2J28 cow
MLGAVGFLAEVFGTALGPSALLLSAVAFLFVADILKRWRPKNYPPGPLRLPFVGNFLHLD
FEQWHLSLQRFVKKYGNVLSLDLGAFSSVVITGLPLIKEALVHQDQNFVNRPINLNQV
FQKNGLIMSNGQVWKEQRRFALTTLRNFGLGKKSLEERIQEEAQYLIQAVREENGQPFDP
HFKINNAVSNIICSITFGERFDYQDDQFQELLRLLDEVTCL
PKLVRVQLFNVFPRIMKLLPGPHQIIFSNREKLRMF
IARVIENHRRDWNPAEARDFIDAYLREIEKGSSPSVFNEENLICSTLDLFFAGTETTS
TTL

CYP2J36   Anolis carolinensis (anole lizard)
          scaffold 23 3305369-3326894 (-) strand
          (small gap in exon 8) 
          55% to CYP2J2, 43% TO CYP2C8
3358582 MWFHAFAIFWETISLQVILGFLATFLLLTDYVKRRRPRGFPPGPIPLPFLGNLLSYDAKKPHLYNQK 3358382
3357138 LVAIYGNVFSLQLGNIHIVFLNGLQAVKEALINQGESFLDRPKVPITYDVSKTF 3356977
3351644 GVITSNGQTWKQQRRFVMSTLRNFGLGKTYLEERIQEESRFLVAAIEDEK 3351495
3348890 GQPFDPYHQINNAVSNVICSVTFGNRFDYHDSDFQKLLHLLDETGVFLRNIWSH 3348729
3347734 LYNAFPSLMRRLPGPHQTYFKNWEQLKSFVRKIIEKHKEDWNPLKTKDFIDAYLNEMAK 3347558
3346355 FKENASSTFHMENLLQSTLDLFVAGTETTSATLHWAVLYMAVYPEIQ 3346215
3343877 AKVQAEIDSVIGQSHLPAMADRDNMPYTNAVIHEIQRRSSIVVVNAPRLTANDTQVAGFHLPK 3343689
3337326 xxxxxxxLTSILFDKNEWETPNVFNPNHFLKNGQFMKREAFVPFST 3337210
3335618 GKRACPGEQMAKMELFLVFTTLLQKFTFQAPKGVKLSLDSKTGHVLKPKPYQICAISR* 3335442

CYP2J37P   Anolis carolinensis (anole lizard)
           scaffold 23 3305369-3326894 (-) strand
           pseudogene 
           57% to CYP2J2, 43% TO CYP2R1
3326894 MLCHCFAVFWEALSLKIVFVFLFTFLIIADYIRQRRPRGFPPGPRPLPFVGNLFSVDITKPHLSSEK 3326694
3325276 FMEIYGKIFSLQLGKFPFVIVNGLQLVKEALIHQNENFVDRPILPIIYDHSKTF 3325115
3322787 GLIMSNGLSWKQQRRFALSTLRNFGLGKRSLEEQIQEESRFLVGAIEDEK 3322638
3320225 GQPFDSHYQINNAVSNVICSVTFGKCFDYHDSQFQKLLHLLDEMGNVQAGFWGM 3320064
3309149 AYNTFPALMKLLPGPHQTVFKNWDQLKSFVRKIIEKHQNWNPLETRDFIDAYLNEIAK 3308976
3308595 LKD*ASSSFHMENLLQ*TIDLFIAGTETETTSATLRWAVLYMAIYPDIQ 3308449
3307295 GKVQAEIDSVIGQSRSLTMADRDSLPYTNAVIHEIQRMGNILPFSAPRVAVNDTRLAGFYLPK 3307107
3305985 GTILLPNLTSLLFDKDEWDTPNKFNPNHFLKDGQFMKREAFIPFSI 3305848
3305545 GKRSCLGEQLARMELFLFFTTLMQKFTFQAPNGLRLSLDFKIGNALSPKPYKICAISR* 3305369

CYP2J38   Anolis carolinensis (anole lizard) 
          scaffold 23 3277211-3297585 (-) strand
          57% to CYP2J2, 43% TO CYP2C18
3297585 MLFHCFAVFWETLSLKAVLVFLATFLIVADYVRRIHSRGFPPGPMPLPFVGNLLHLDAEKPHFSTQK (0) 3297385
3295355 LADIYGNVFSLQLGNRHFVFVNGLEIVKEVLIHHGENFLDRPKFPIISDHAKTL 3295194
3294395 GLVMSNGLPWKQQRRFALSTLRNFGLGKRSLEERIQEESRFLAGAIENEK 3294246
3288794 GQPFDPHYQINNAVSNVICSITFGNRFDYHDSQFQKLLHLLNETGIIQRSIWAQ 3288633
3286768 LYNIFPALMKQLPGPHQTIFKNWEQLKYFVRTIIKKHQENRNPLETRDFIDAYLNEMTK 3286592
3285518 FKENVSSSFHMENLLQSALDLFIAGTETTSTTLRWALLYMAIYPEIQ 3285378
3282591 ERVQSEIDSVIGQSRPPAMTDRDNLPYTNAVIHEIQRISNILPLNVPRLTTNNTEIAGFHLPK 3282403
3280566 GTILICNLTSVLFDKDEWDTPKKFNPNHFLSNGQFRIREAFVPFSA 3280429
3277387 GKRACLGERLARMELFLFFTALIQKFSFQAPKGVELSLDFKMSLTLSPNQYHICAVSR* 3277211

CYP2J       pig 
            BF191621.1, BX914614.2, BQ601924.1 
            85% to 2J30 cow
            possible end of 2J34 or 2J35
GQSQQPSIAARECMPYTNA
VIHEVQRMGNIIPMNVPREAAEGTTLAGYHLPKGTMVLTNL
TALHRDPAEWTTPDRFNPEHFLENGQFKKREAFLPFSIGKRACLGEQLARTELFVFFTSL
LQKFTFRPPDNEKLSLKFRMGLTLSPVTYRICAVPRA

2K Subfamily

CYP2K1      Onchorhynchus mykiss (rainbow trout)
            GenEMBL L11528 (1853bp) PIR S45644 (504 amino acids)
            Buhler,D.R., Yang,Y.-H., Dreher,T.W., Miranda,C.L. and 
            Wang,J.-L.
            Cloning and sequencing of the major rainbow
            trout constitutive cytochrome P450 (P450 2K1): Identification
            of a new P450 gene subfamily and its expression in mature 
            rainbow trout liver and trunk kidney.
            Arch. Biochem. Biophys. 312, 45-51 (1994)

CYP2K1v2    Onchorhynchus mykiss (rainbow trout)
            GenEMBL AF045052
            Buhler,D.R.
            note: 98.6% identical to 2K1 may be an allele (5L1FL)
            submitted to nomenclature committee

CYP2K1v3    Onchorhynchus mykiss (rainbow trout)
            GenEMBL AF045053
            Buhler,D.R.
            note: 98.4% identical to 2K1 may be an allele (5L6FL)
            submitted to nomenclature committee

CYP2K2      Fundulus heteroclitus (killifish)
            John Stegeman
            submitted to nomenclature committee

CYP2K3      Onchorhynchus mykiss (rainbow trout)
            GenEMBL AF043551
            Buhler,D.R.
            (5L7FL) 96.5% identical to 2K1

CYP2K4      Onchorhynchus mykiss (rainbow trout)
            GenEMBL AF043296
            Yang,Y.-H., Andersson,T.B., Ryu,B.-W., Wang,J.-L. and Buhler,D.R.
            CYP2K4: A New Cytochrome P450 Isoform from Male Trunk Kidney of
            Post-Spawning Rainbow Trout.
            Unpublished
            kid8 from kidney

CYP2K5      Onchorhynchus mykiss (rainbow trout)
            GenEMBL AF151524
            Buhler,D.R.
            80% identical to 2K1
            clone name KM2-2 from sexually mature male trunk kidney library

CYP2K6      Danio rerio (zebrafish)
            No accession number
            Wang-Buhler, J.L., Yang, Y.H., Lee, S.J. and Buhler, D.R.
            Submitted to nomenclature committee 6/16/2000

CYP2K7      Danio rerio (zebrafish)
            GenEMBL AI722500 EST 88% to CYP2K6
Full length translation of this EST allowing framshifts
INNLFGAGXDTTVTTLRWGLLLFAKYPEIQAKVHDEIDSVIGERQPVPDDRKNLPYTDAVIHEIQRFADILPIG
LLRQTSCDVHLNGYLIKKGTSVFPLIASVLRDENEWETPDSFNPKHFLNKQGQFVKKDAFMPFGAGRRLCIGES
LARMELFLFFTSLLQHFCFTPPPGVSEDELDLTPVVGFTLSPMPHKLCAVKRF*

CYP2K7      Danio rerio (zebrafish)
            No accession number
            Donald R. Buhler
            EST AI722087 fd19b07.y1, AI722500 fd19b07.x1, BF157099 fl60g01.y1
            Submitted to nomenclature committee 2/10/2001
            503 amino acids, 76% to 2K6, 59% to CYP2K4, CYP2K5 

CYP2K8      Danio rerio (zebrafish)
            No accession number
            Yea-Huey Yang, Jun-Lan Wang-Buhler and Donald R. Buhler
            EST 78% to CYP2K5 clone name F2R
            Submitted to nomenclature committee 7/1/2000

CYP2K9      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_12487
3037 MIEDLFESSTSGFLMVAIVSLLLLQ
     LCFSFISREKRKDLPGPEALPLLGNLHQLDLKRLDCHLVQ 3231 (0)
3299 LSQKYGPIFRVYLASKKVVVLAGYTAVKQALVNQAEDFGEREIFPIFHDFNKGN 3460 (1)
3527 GILFTNGDQWKEMRRFALMTLKDFGMGKRTIEEKIIKECQYLIEAFEQHQ 3676 (1)
     GEAFSNAQVISYATSNIISAIMYGRRFDYKDPTFQAMIERDHEVIHLTGSPSIQ (0)
     IYNIFPWLGPFLKTWRYIMKKVEINIESTRRIIGEMKETRNP
     GTCRCFVDAFLIHKENQE (0)
4483 ESDVNAHYYHEDNLLHCAMNLFGAGTDTTATTLQWGLLYITKYPHIQ 4623 (1)
4692 DGVQEELRRVVGNRQVRVEDRKNLPYMEAVIHETQRMANIVPMSLPHRTS*DTFQGYVIKK (0?)
     GTMVIPLLTSVLYDESQWEKPHTFNPAHFLDDEGRFVRRDAFMPFSA 5095 (1)
5164 GRRMCLGEGLARMELFLFFASLLQHFRFKPAPGVSEDSLDLTPVVGITLNPLTHKLRAISRF* 5352


CYP2K10     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_19693
296  SKKYGPVFKVHFGPRKVVVLAGHKTVKEALVGNAEQFGDRDISPIFYDMNQGHG 457 LKU76565.x1
     missing exon 3 and part of exon 4
2727 YATSNIISSIVYGSRFDYDDPRFINMVNRVNEVIRLTGSAPIQ (0)
     LYNIFPGLANWIKNRQLLLKQVAMNLRDMTDLIQQLKDTLNPGVCRGFVDCFLLRKQKAV (0)
2184 DSGVIDSLYNEKNLLYSLSNLFGAGTDTTATTLRWGLLLMAKYPRIQG
     QVQQELSMVVGNRRVCVEDRKNLPYVDAV 1813
1812 IHEIQRLGNIAPMAVPHKTARDVEFRGYFIEK 1717
1286 GTTVFPLLTSVLYDENEWETPHTFNPSHFLDKDGKFIKRDAFMPFSA 1146
1063 GRRLCLGEGLAKMEIFLFFTSLLQQFRFTPPPGVGEDELDLTPVVGFTLSPSPHKLCAIPRQ* 


CYP2K11     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_10791
missing exons 1-4 about 176 aa
5891 VYDLFPWIGPLVNNKKLFQSLFAANKKQNLQLFAAAKEMLNPQMCRSFVDSFLARQQILE 5721 (0)
4989 KSGTNVHFHDENLMSTVMNLFNAGTDTTATTLRWGLLLMAKYPLIQ (1?)
4750 DQVQEELRRVIGSRQVQVEDRKSLPFTDAVIHETQRLANIVPMALPHKTSQDVTLQGFFIEK 4571 (0)
     GTTVYPLLTSVLYDETEWEKPLNFYPAHFLDKDGKFVKREAFLPFSA 4355 (1)
4287 GRRICLGEGLAKMELFIFFSTLLQHFRFRPPPGVSEDHLDLTPRVGLTLNPSAHKLCAVSCL* 3999

CYP2K12P    Fugu rubripes (pufferfish)
            No accession number
            Scaffold_3103 Length = 27036 59% to scaf 10791 
Heme junction missing the conserved Gly, no uspstream seq found 
With these defects and a frameshift this is probably a pseudogene 
LKB99171.x1 50% TO 2C37
17897 DQVQEELSRVIG 17862 frameshift
17860 SRQVQEGDRKNLSFTNAVIHETQSGHVALTSLPHVTNQDIIFRGHFLKKG 17711 (1)
17388 NYMEDTASVASVLLEETEWEHPHTFYPSHFLEKDRKFVKRDAFLPFSA 17242 (1)
17176 ISRACPGETLARVELFIFLVTLLQHFCFTLAPGVSPDELHVTPSIGSNHSPVAYRLCTVSCM* 16988

CYP2K13P    Fugu rubripes (pufferfish)
            No accession number
            Scaffold_12487 
pseudogene frag of 2K9
660 VRVEDRKNLPYMEAVIHETQRMANIVPMSLPHRTSXX frameshift
    RHLPGIRHQK 

CYP2K14P    Fugu rubripes (pufferfish)
            No accession number
            Scaffold_13436b Length = 4942 
pseudogene of 2K9
= LGW56404.x1 50% to 2A7 
two partial genes in this contig both on minus strand
Scaffold_13436b pseudogene of Scaffold_12487 (fs) = frameshift
3958 VRVEDRKNLPYMEAVIHETQRMANIVPMSLPHRTSRD (fs) RHLLX (fs) GIRH (fs) QK (1?)
     alternative frame                  RDTSFSGDTSSKRFTALFELAHVYV
     GTMVIPL (fs) LTSVLYDESQWEKPHTFNPAHFLDDEGRFVRRDAFMPFSA (1)
     GRRMCLGE (deletion 3 nuc) RMELF (insertion 12 nuc) LFF (deletion 33 nuc)
     VSVDSLDLTPVVGITLNPLTHNLRAISRF* 3368

CYP2K15P    Fugu rubripes (pufferfish)
            No accession number
            Scaffold_13758
pseudogene 
41% to LKB99171.x1 50% TO 2C37 Length = 5303
FC:C094J16aF1, FC:C007E01aF1 pseudogene
740 KGRITQRHFHDEKLMMTVSSHLAAGTHLDTYTALRQEPLVMAK*PEVQ 883 exon 6 (1) 52% to 2K11
    Exons 7 and 8 deleted
1284 (1) GLRSCPGEG*SRMKLFIFIVILLQHLCFSSSPVLMEEDLELKTVLGSILNPINCVLFVGRER* 1472 exon 9 48% to 2K9

CYP2K16 seq.c Danio rerio (zebrafish)
          ctg12742 68% to 2K8
57491 MAFLDALLHVSSTGTLICFLLLLLVAYLLFLRSQSDENEPPGPKPLPLLGNLLMLDVNKPHLSLCE 57294
52779 MAKQFGPVFKVYFGPKKVVVLAGYKAVKQALVNYAEAFGDREIMPLFHDFTKGH 52618
52022 GIIFANGESWREMRRFALTNLRDFGMGKKKIEEKIIEETCHLREEFEKFX 51876
50840 GKPFETAQLMNYAASSVISSIVYGRRFEYTDPQLRTMVDRANESVRLSGSASVQ 50679
50581 LYNMFPFLGPLLKNWRQLMKNLHLDIEEISELVNGLHQTLNHQDLRGFVDSFLVRKQX 50411
50317 DQDSGEKDSHFHEQNLIYTVGNLFVAGTDTTSTTLRWSLLLMAKYPHIQ 50171
43796 DRVQEEIDQVIGGRQPVSEDRKNLPYTDAVIHETQRLANIVPMSIPHMTSSDITFNGYFIKK 43614
43440 GTCIFPLLTSVLWDEDEWETPHIFNPNHFLDEQGRFVKRDAFMPFSA 43300
42178 GRRICLGESLARMELFLFFTSLLQYFRFTPPPGVSEDELELTPAVGFTLNPIAHKLCAVKR 41996

CYP2K17 seq.d Danio rerio (zebrafish)
           ctg12742 BI427723 zfishC-a1846d04.p1c zfishC-a1146b02.p1c
66780 MAVVESLLHFSSAGTLLGTLLLLLVFYRLSRDSEFQKKRKDPPGPKPIPLLGNLLTLDLSRPFDSLCE 66577
63586 LSKTYGNVYQVFLGPKKVVVLIGHKTVKEALVNYADEFGERDITPIFRXXXXXX 63443
63238 GILFSNGESWKEMRRFAISNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFX 63092
62992 GKPFDTTQPVNYAVSNIISSIVYGSRFEYTDPRFTEMVDRANENIRVSGSVSMX 62834
62747 LYNIFPWLGLFLNSKRTVVRNMLKNRAEFMKLITGLQETLNIHDRRGFVDSFLIRKQX 62577
60380 XXXXGKKDSYFHAENLLMTVGNLFAAGTDTTGTTLRWGLMLMAKYPQIQ 60246
60158 XRVQEEIDRVIGGRQPVVEDRKKLPYTDAVIHEIQRLANIVPMNLPHVTSCDVTFNGYFIKK 59976
59893 GTTVIPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFMPFSA 59753
59675 GRRVCLGESLARMELFLFFASLLQSYRFTTPPGVSEDELDLKGTVGVTLNPSPHKLCAIKRF 59490

CYP2K18 seq.e Danio rerio (zebrafish)
           ctg12742 MISSING FIRST TWO INTRONS EXON 3 IS DUPL. MAY BE A PSEUDOGENE
93% to 2K19, 91% to 2K21 zfishK-a1004a03.p1c (100% over 29aa) also matches 2K19, 2K20
78359 MAVVESLLQFASTGTLLGALLLFLVLYLVSSGSESQKEGKEPPGPKPLPLVGNLLTLDLTRPFDT 78165
78164 FFKLSKTYGNVFQVYLGPEKAVVLVGYKTVKEALVNYAEEFGDREIGPGFSIMNDEH 77912
77911 GILFSNGENWKEMRRFALSNLADFGMGKRRSEEK 
75750 GILFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKF 75604
75522 GKPFDTTQPVNYAVSNIISSIVYGSRFEYTDPQFTEMVDRANENVRVGGSISMX 75364
75253 LYDIFPWLGPFLKNKRIIVENIIQSRVQMTKLITALLETLNPNDPRGFVDSFLIRKXX 75086
74916 XQKSGKKDSYFHEENLMMTVTNLFIAGTDTTGTTLRWGLMLMAKYPHIQ 74773
74686 XRVQEEIDRVIGGRQPVVEDRKKLPYTDAVIHEIQRLANIVPLSLPHRTTSDITFNGYFIKK 74504
74413 GTTVVPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVRRDAFMPFSA 74273
73457 GRRVCLGESLARMELFLFFTSLLQSYRFTTPPGVSEDELDLKGIVGITLNPSPHKLCAIRR 73275

CYP2K19 seq.f Danio rerio (zebrafish)
ctg12742 91% to 2K21 AF221128 (1 aa diff) zfishC-a678c11.p1c (near perfect)
90000 MAVVESLLQFASTGTLLAALLLFLVLYLVSSGSQKEGKEPPGPKPLPLLGNLLTLDLTRAFDTFFE 89803 (0)
89722 LSKTYGNVFQVFLGPRKTVVLVGYKTVKEALVNYAEQFGDREIGPGFRIMNDEH 89561 (1)
89232 GILFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFE 89086 (1)
89004 GKPFDTTQPVNYAVSNIISSIVYGSRFEYTDPQFTEMVDRANENVRVGGSVSMW 88843 (0)
88707 FHEMFPWVGPFLKSKRIIVENIIQSRAQMTKLITALLETLNPNDPRGFVDSFLTRKLSDE 88528 (0)
88365 KSGKKDSYFHEENLIMTVTNLFVAGTDTTGTTLRWGLMLMAKYPQIQ 88225 (1)
88137 DRVQEEIDRVIGGRQPVVEDRKKLPYTDAVIHEIQRLANIVPLSLPHKTTSDITFNGYFIKK 87952 (0)
87861 GTTVVPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFIPFSA 87721 (1)
84188 GRRVCLGESLARMELFLFFTSLLQSYRFTTPPGVSEDELDLKGIVGITLNPSPHKLCAIRRS* 84000

CYP2K19 Danio rerio (zebrafish)
        GenEMBL AL919697
        Tseng, H.-P., Wang-Buhler, J.-L., Hu, C.-H., 
        Hseu, T.-H., Peng, J.R. and Buhler, D.R. 
        Submitted to nomenclature committee Oct. 14, 2004
        JR8

CYP2K20 seq.g Danio rerio (zebrafish)
         ctg12742 88% to 2K19 and 2K21 zfishC-a1699d01.q1c (100% over 57aa)
AF221128 (1 aa diff) zfishC-a678c11.p1c (near perfect) zfishC-a1101c09.q1c (100% over 39aa)
104280 MAVVESLLQFASTSALLGALLLLLVLYLASSGSTSQKEGKEPPGPKPLPLVGNLLTLDLTRSFDTFFE 104077
103997 LSKTYGNIFQVFLGHRKTVVLVGYKTVKEALVNYAEVFGDREIGPGFKXXXXX 103854
102358 GILFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFX 102212
102123 GKPFDTTQPVNYAVSNIISSIVYGSRFEYIDPRFTEMVARANENVRVGGSFSMX 101965
101852 IYNIFPWLGPFLKNRAVVVKNITQNRAEKKKLITALLETLNPHDPRGFVDSFLIHKXX 101685
101522 XQKSGKKDSYFHEENLMLTVANLFAAGTDTTGTTLRWGLMLMAKYPHIQ 101379
101300 DRVQEEIDRVIGGRQPVVDDRKKLPYTDAVIHEIQRLANIVPLSLPHRTTSDITFNGYFIKK 101118
 97108 GTTVVPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFMPFSA 96968
 92153 GRRVCLGESLARMELFLFFTSLLQSYRFTTPPGVSEDALDLKGIVGITLNPSPHKLCAIRR 91971

>CYP2K21 seq.h Danio rerio (zebrafish)
        ctg12742 91% to 2K19 zfishB-a619a12.q1c (near perfect)
112093 MAVVESLLQFASTGTLLGALLLFLVLYLVSSGSGSQKEGKEPPGPKPLPLLGNLLTLDLTRAFDTFFE 111890
111821 LSKTYGNIFQVYLGPKKTVVLVGYKTVKEALVNHAEAFGDREIGPSFRIMNDXX 111666
109983 GIVFSNGENWKEMRRFALSNLRDFGMGKRGSEEKIIEEIHHLKGEFDKFX 109837
109744 GKPFDTTEPVNYAVSNIISSIVYGSRFEYTDPQFTEMVDRANENVRVGGSISMX 109586
109441 LYNMFPWLGPFLKNKRIVVRNIIQSRAQMTKLITALLETLNPNDPRGFVDSFLIHKXX 109274
109110 XQKSGKKNSYFHNENLMMNVANLFVAGTDTTGTTLRWGLMLMAKYPQIQ 108967
108879 XRVQEEIDRVIGGRQPAVEDRKKLPYTDAVIHEIQRFANIVPLNLPHTTSCDITFNGYFIKK 108697
108484 GTTVIPLLTSVLKDESEWEKPNSFYPEHFLDEKGQFVKRDAFMPFSA 108344
107905 GRRICLGESLARMELFLFFTSLLQSYRFTTPPGVSEDELDLKGIVGITLNPSPHKLCAIRR 107723

>CYP2K21-de1 seq.i Danio rerio (zebrafish)
        ctg12742 PSEUDOGENE PARTIAL EXON 1?
113358 MAAVETLLQFASTGSLLSALLLLLVWYLVSSESTYQKKGKEPPGPKPLPLLGNLLT 113191 

>CYP2K22 Danio rerio (zebrafish)
         ctg11670 zfishC-a643a08.p1c MISSING EXON 6 GREATER THAN 95% to 2K7. 9aa diffs 
in the first exon, only 3 aa diffs in the rest
33920 MALVAALLPGLGFTVSTILAFLLLFLVISYFFSSKDKGKYPPGPKPLPVLGNLHILDLKNTYMSLWK 34120
37393 LSKQYGPVYTVHMGPRTVVVLSGYKVVKEALVNLSEEFGERDISPIFQDFNEGY 37554
37635 GIVFSNGENWKEMRRFALSNLRDFGMGKKRSEELITEEIKYLKEEIERFX 37781
39367 GKPFETKLPLAMAISNVIALIVYSIRFEYNSPKFHRAIVRANENAKLVGSPSVQ 39528
42486 LYNMFPWLRLFVANQKRVVDNVQESFKQIGEIVNGLKKTLNPQSPRGIVDKFLIQQQK 42659

45851 AKVHDEIDSVIGERQPVPDDRKNLPYTDAVIHEIQRFADILPIGLLRQTSCDVHLNGYLIKK 46036
46115 GTSVFPLIASVLRDENEWETPDSFNPKHFLNKQGQFVKKDAFMPFGA 46255
49040 GRRLCIGESLARMELFLFFTSLLQHFCFTPPPGVSEDELDLTPVVGFTLSPMPHKLCAVKRF 49225

CYP2K23    Gasterosteus aculeatus (three-spined stickleback)
           UCSC browser Chr XI (-) strand 9794341-9797707
           Joanna Wilson and students
           submitted to nomenclature committee Nov. 6, 2007
           61% to Fugu 2K11, 65% to 2K10
MSLFGDFVVYLCSSTSTFLGAVVLLLVLYLVSNSLTRRELRKVPPGPSPLPLLGNLLQLDLKRPYVTLCELSKKH
GSVFTVYLGTSRVVVLAGYKAVKEALVNHREEFGDRDISPIFYDLNHGHGILFANGESWKEMRRFALTNLRDFGM
GKQLSEHKILEECQYLMEVFEKHQGTEFIYTASPVNYATSNIISAIVYGSRFEYNDPQFMSMVERSNESISVVGS
VQIQLYNMFPKLVSWTKKRQLLLNNLTRTVRDVKELILHLKDTLHPQFCRGLVDCFLIQMQKDEEARVNTHYNEK
NLIFTVTNLFSAGTDTTATTLRWSLLLMAKYPHIQDQVQEELSRVVGSRQVRVEDRRNLPYTDAVIHETQRLANI
VPLAIPHKTSRDVTFQGFFISAGTTVIPLLTSVLRDESEWESPNSFNPSHFLDTEGKFIRRDAFMPFSAGSRACP
GESLARMELFLFFTSLLQRFRFTPPPGVKEDDLDLTPAVGFTLTPSPHELCAVSCEGIQNEKII*

CYP2K24    Gasterosteus aculeatus (three-spined stickleback)
           UCSC browser Chr XI (+) strand 9720129-9723291
           Joanna Wilson and students
           submitted to nomenclature committee Nov. 6, 2007
           59% to 2K10
MLMLEDLFLSYVTVALMLVLMCILVSLFFRSKDKRREPPGPQPLPLLGNLLQMDLKRLDRSLVD (0)
LSKKYGSVFTVHLGPQKVVVLAGYKTVKQALVNHAVEFGERRIPQFGNDLMLSDSYR (2)
KGIFFANGESWKEMRRFALSNLKDFGMGR
KAAEDKIIEEIQYLIEVFERHE (1)
GQPFSTGQPMNYAVSNIICSIVYGSRFEYRDKDFKLMVDRANENIQLAGS
PSVLLFDMYPGIFHWASNRMRLKRNVFENHKRIKQLIGHLQETFNVELCRGFVDSFLAQKKKLEDSGITDSYYNI
ENLVSTVGNLFSGGTDTTSSTLRWGLLLMAKYPRIQYQVQEELSRVVGSRQVRVEDRRNLPYTDAVIHETQRLAN
VVPLAIPHKTSQDVTFQGFFIKGGTTVFPLLTSVHHDESEWESPNSFNPSHFLDTEGKFIRRDAFMPFSAGRRAC
PGESLARMELFLFFTSLLQLFRFTPPPGVKEDDLDLTPVVGFTLTPSPHELCAVSREGIQNE*

CYP2K25    Gasterosteus aculeatus (three-spined stickleback)
           UCSC browser Chr XI (-) strand 9676173-9679867
           Joanna Wilson and students
           submitted to nomenclature committee Nov. 6, 2007
           59% to Fugu 2K10, 52% to 2K8 Danio
MENLFLQLNSTTILLGTVGILLLLYVFLTNFDHKRKEPPGPRPLPLFGNLLHLNLKSFHMTLYELSKKYGSVFSV
HLGPQKVVVLAGYKTVKQALVNHAVEFGERYVSPTGHDLSNGIVFGNGESWKEMRRFALTNLRDFGMGKKAAEDK
IIEEIQYLFEVFDRHQGQPFNTGQSMNYAVSNIICSIVYGSRFEYSDEEFRLMVDRVNYNIRLAGSPSAKLFDMY
PWLFQWTSNRKRLTRNVTENRNQIKRLIGRLQETLNVHMCRGFVDSFLAHKQKLEDLKITDSHYNMENLVSTVSN
LFAAGTNTSGTTLRWGLLLMAKYPHIQGKVQEELSRVVGNRQVRAKDRMNLPFADAVIHETQRFANVLPVTIAHK
TSTDVTFQGYFIKKGTTVFPLMTSVLWDESEWETPRTFNPAHFLDKDGKFFKRDALMPFGAGRRACPGESLARME
LFLFFTSFLQRFRFTPPPGIKEDDLDLTPAVGLTLAPSPHELCAVSREGIQNE*

CYP2K26    Gasterosteus aculeatus (three-spined stickleback)
           UCSC browser Chr XVIII (-) strand 12862313-12864957
           Joanna Wilson and students
           submitted to nomenclature committee Nov. 6, 2007
           73% to Fugu 2K11 see EST DN708008.1
MGIVDQVLESSSSASLLGVLLVLLLVYLASSFSLGSPKDRKEPPGPTPLPLIGNLLQLDLKRPYNTLLKLSKKYG
SVFTVYMGPEKVVVLAGYKTVKEALVNRAEEFGDRQAMLIIREFNQGHGVIWSNGDSWKDMRRFALTNLRDFGMG
KRASEDKIIEECEHLIEVFKKHK (1)
GEPFDTTQPMNYAVSNIICSIVYGSRFEYDDPQFTSLVDRTNRTIQLV
GSPSIQLYNLFPWIGKWIANRNEVETLITANKKQNLQLFSRLKETLNPLMCRGFVDAFLVRKQNLEESKNTNSHF
NDDNLMQTVLNLFAAGTDTTATTLRWGLLFMVKNPKIQ  (1 GC boundary)
DRVREELSEVVGSRQVQVEDRKKLPFTDAVIHETQRLANIVP
MAIPHKTTQDVTFQGHFIKKGTTVFPLLTSVLYDESEWEEPHSFHPAHFLDADGKFIKRDAFMPFSAGRRVCLGE
SLARMELFIFFSTLLQRFRFTAPPGVSVEDLDLTPRVGFTLNPSTHKLCAVPCV*

CYP2K27    Oryzias latipes (medaka)
           chr8:11128109:11132739: (-) strand
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           66% to Fugu 2K10
MDLLMPLVSSPTTVIGAVFLLLVLYLASAGSTSRDLGKDPPGPRPLPLLGNLLQLDPRRPHKALCELSKSYG
PVFTVYFGIQKVVVLAGYKTVKEALVNNAEEFGDRDITPMFQDMNKGHGILFANGESWKELRRFALTTLRDFGMG
KRIAEEKILEECDYLIQGLEKHQGRKFDLTCPLNYATSNIISSIVYGSRFDYDDPRFRNLVSRANETIRINGHPL
THLYNMFPRWFRWIKNRKIILNNVEMTVKDVKDLVKHLKETLNPSVCRGFVDCFLIKKQKEEDSCVKESHFTEQN
LVFSVSNLFAAGTDTTATTLRWGLLLMAKYPHIQDKVHEELAKVLGGRQVRVDDRKNLPYADAVIHEIQRVANII
PMSIPHKTNRDVTFHGYLIQKGTTVIPLLASVLNDENEWESPHTFNPHHFLSKEGKFVKRDAFMPFSAGRRACLG
ESLAKMELFLFFTSLLQRFHFTPPPGVSEEELDLTPAMGFVLAPSSHELCAVSLQ*

CYP2K28    Oryzias latipes (medaka)
           Chr8: 11120126:11125947: (-) strand
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           62% to Fugu 2K19
MIQYIFRFMPASVSLMWVIVGVLVLLFLYFQLSFFNWREPPGPRPLPLLGNLFQVDLKRLDQSLFDLSKKYGPVF
VVNFGPKKVVVLAGYRTVKQALVNQAKEFGNREVTPIFYDFNKEHGILFANGESWNEMRRFALSTLRDFGMGKRI
SEQNIIEECRWLIEELEKLQGKPFDNTHTISYAVSNVLSGLMFGKRFDYQDPLLQAIVDRDNEIIYLTGTVSILL
YNMFPWLGPWLKNWKTLMKNMEAAKTDMKKIIAELKDTLDPDTRRCFVDAFLTQKQNLKEVNGSHYHDDNLLYTV
MNLFAAGTDTTATTIEWCLLFMAKYPHIQERVQEELNWVVGSRQVRIEDRKNLPFTDAVIHESQRLANIAPMAIP
HTTSKDVTFQGYFIKKGTTVLPLLTSVLYDESEWESPRTFNPSHFLDKEGKFLKRGAFMPFSAGRRVCLGESLAR
MDIFLFFTSLLQHFSFTPPPGVSEDELDLTPVVGFTLSPQPQGLCAVRRQ*

CYP2K29    Oryzias latipes (medaka)
           Chr24: 11283779:11289362: (+) strand
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           68% to Fugu 2K11
MQILDFFQSYSSVSLVGILAVLVLYFISQFIFNSEQHGQEPPGPRPLPIIGNLMQIDLKRPYKTLEEFSKTYGPV
FTVFFGGEKVVVLAGYKTVKNALVNHDEEFGERAIPPIIQELNKGLGVLWSNGDIWRDIRRFALTNLRDFGMGKK
ACEDKITEECQYLLEVFKKFKGNAFDTTKPLNYAVSNIICSMVYGSRFEYDDPKFTSMVDRTNRNIQLSGSPTLQ
AYNMVPWLFKWVASRREVHECAAANRKQNQSIFSHLKETLNPQMCRGFVDAFLVKGQTLEKSGVTNSAFNDENLL
MTVIHLFAAGTETTSTTLRWGLLLMAKYPKIQDQVQDELRRVIGDRMVQVSDRKNLPFTDAVIHEIQRLASIVPT
ALPHKTSKDVTFQGYFIKKGTTVFPLLTSVLHDANEWEKPHTFYPAHFLDKDGKFVKREAFIPFSAGRRICLGES
LARMELFMFFTTLLQNFCFTPPPGVSKEELSLTPCGGITVGPVPHKLCAVPCSE*

CYP2K30    Oryzias latipes (medaka)
           Chr24: 11290118:11301397: (+) strand
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           63% to Fugu 2K11
MGVWDTLLPSLSPSSLLGAGVLLLLVFLFCPHRTSSQKHRKEPPGPTPIPILGNLHQLDLKRPDQTFMKFAKKYG
SVFTVYMGPKKTVVLTGYKTMKEALVNYAEEFGEREAPTVAKEAHLDCGVVWANGASWREMRRFALSTLRDFGMG
KRACEDKIIPECHSLLKEIRKFQGEAFDPTLIINSAVCNVICSMVYGTRFEYDDPDFRTILSRTMKGIQLLGSPG
VQLHNLFPRIGRLFLSASKQINQIFTANKNYHLKLLKETFTPHTCKSIADAFQLRQQEEDGFPNSHFHDANILVT
IMNLFTAGTETTAATLRWALLFMAKYPKIQDQVQEELSRVMEGRQVTVEDRQRLPFTDAVIHETQRKANIIPLSL
LHRTSQDVTFKGFFIEKGTTVIPVLTSVLYDENEWEKPNIFYPAHFLSKDGKFLKRDAFMPFSAGRRLCLGESLA
RMELFLFFSTLLQHFRIAPPLGVSEEELDLTPRPGGTLSPQPHKLCLVSLK*

2L Subfamily

CYP2L1      Panulirus argus (spiny lobster)
            GenEMBL U44826 (1601bp)
            James, M.O., Boyle, S.M., Trapido-Rosenthal, H., Carr, W.E.
            and Shiverick K.T.
            cDNA and protein sequence of a major form of P450, CYP2L,
            in the hepatopancreas of the spiny lobster Panulirus argus.
            Arch. Biochem. Biophys. 329, 31-38 (1996)

CYP2L2      spiny lobster
           no accession number
           Sean Boyle and Margaret O. James
           submitted to nomenclature committee

2M Subfamily

CYP2M1      Onchorhynchus mykiss (rainbow trout)
            GenEMBL U16657
            Yang,Y.H., Wang,J.L. and Buhler,D.R.
            cDNA cloning and characterization of a novel cytochrome P450 from rainbow 
            trout.
            Abstracts of the VII International Congress of Toxicology, 
            Vol. 7, No. 1, 10-P-2 (1995)

            Yang,Y.H., Wang,J.L., Miranda, C.L. and Buhler,D.R.
            CYP2M1: cloning, sequencing, and expression of a new
            cytochrome P450 from rainbow trout liver with fatty acid
            (omega-6)-hydroxylation activity.
            Arch. Biochem. Biophys. 352, 271-280 (1998)
            Note: 42% identical to CYP2K1

2N Subfamily

CYP2N1      Fundulus heteroclitus (killifish)
            John Stegeman
            submitted to nomenclature committee

CYP2N2      Fundulus heteroclitus (killifish)
            John Stegeman
            submitted to nomenclature committee

CYP2N3      Stenotomus chrysops (scup)
            No accession number
            Agnes Knorr, Andrew McArthur John Stegeman
            Submitted to nomenclature committee Nov. 3, 2000
            73% to 2N1

CYP2N4      Chaetodon mertensii (butterfly fish)
            No accession number
            Bryan DeBusk
            Submitted to nomenclature committee July 19, 2001

CYP2N5      Chaetodon punctatofasciatus (butterfly fish)
            No accession number
            Bryan DeBusk
            Submitted to nomenclature committee July 19, 2001

CYP2N6      Chaetodon auriga (butterfly fish)
            No accession number
            Bryan DeBusk
            Submitted to nomenclature committee July 19, 2001

CYP2N7      Chaetodon xanthurus (butterfly fish)
            No accession number
            Bryan DeBusk
            Submitted to nomenclature committee July 19, 2001

CYP2N8      Chaetodon plebius (butterfly fish)
            No accession number
            Bryan DeBusk
            Submitted to nomenclature committee July 19, 2001

CYP2N9      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_3261a
9342 MWLWDLVLWLRLTGFLLPVLIVLLIIMYSLRQKDPPNFPPGPPALPLLGNIFNIEAKQPHLYLTK 9148 (0)
     LADVYGSVFCIRLGRHKTVFVSGWKMVKEAIVTQADSFVDRPYSPMATRIYSGNS LKG95403.y1
     AGLFFSNGHVWRKQRRFAMATLRSFGLANGSMELSICEESRHLQEAMESQK LKG95403.y1
8235 GEPFDPVPLLNNAVANIICQIVFGRRFDYTDHMFQRMLHHLTEMAYLEGSIWAL 8074 (0)
7991 LYDSFPALMKHLPGPHNGIFSSSSSLQGFIWREIQRHKSDLDPSNPRDYIDAFLIEEG 7818 (0)
7743 NGNNQLGFEERNLVLCCLDLFLAGSETTSKTLQWGLIYLIRKPHIQ 7606 (1)
     EKVQVEIDRPIGRTRQPTMADRPNLPYTDAVIHEIQRMGNIVPLNGPSNGCQGTRPWRGYFIPK (0)
     GTSVMPNLTSVLFDKNEWETPDTFNPEHFLDAEGKFVRREAFLPFSA 7246 (1)
7162 GRRACLGEGLARMELLLFFVSLCQRFHFSTLDRVELSTEGITGATRTPYPFKIYAQVR* 6986

CYP2N10     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_3261b
13883 MWLYSVLSWDFTSLLLFFFVLILFANYLKNRDPPNFPPGPFAFPIVGNFFTMDSKNLHLYFNK 13695 (0)
12557 LADVHGNVFSFRLGGDKMVCVSGHKMVKEAIVTQADNFVDRPYDPISARVYGGQT 12393 (1)
      DGLFQSNGEVWKRQRRFALSTLRNFGLGKNILEQSICEEAQHLLEEMRSHG 12153 (1)
      GKPFNPARLFNNTVSNIICQLVMGKRFEYSDHKFQMLLKYLSEVLVLEGSFWGQ 11913 (0)
11814 LYEAFPSVMKHLPGPHNKVFSHFNHLKDFMNEEIQNHKKDLDHNNPRDYIDAFIIEMEK 11638 (0)
      NKDTNLGFTETNLAMCSLDLFIAGTETTATTLLWDLVYLINNPDIQ 11413 (1)
11290 GKVQAEIDQVIGQNRQPTMADRPNLPYTDAVIHEIQRMGNIVPLNGPRMAAKDTTLGGYFIPK 11102 (0)
11018 GTSLMPILTSVLFDKNEWETPDKFNPGHFLDAEGKFKKREALLPFSA (1)
      GKRVCLGEGLAKMELFLFFVSLFQNFTFFVPGGAELNTEGITGTTRVPHPFEILARPR* 10619

CYP2N11     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_3261c
      MWPLQLLLDFDIRALLLFISVLLLIGDYFRYKNPPNFPPGPMSLPFVGSFFSVDSKHPHNYFIQ (0)
18495 MAELYGKLFSIRLGSGKIVFACGYKMVKEAIVTQADNFVDRPFNAFGDRIYMGQR 18331 (1)
18251 DGLFQNNGEVWKRQQHFALSTLRNFGLGKNILEQSICEEAQHLLEEMRSHG (1)
      GKPFDPASLFTRAVSNIICQLVMGKRFEYSDHKFQMLLKYLSELLVLEGSFWGQ 17859 (0)
      LYQAFPSVMKHLPGPHNKVFSHYNHLKDFMNEEIQNHKKNLNHNNPRDYIDAFIIEMEK (0)
17498 NKDTNLGFTETNLVLCSLDLFLAGTQTTATTLLWALVYLINNPDIQ 17364 (1)
16988 EKVQAEIDQVIGQTRQPTMADRPNLPYTDAVIHEIQRMGNIVPLNASRMAAKDTTLGGYFIPK 16800 (0)
      GTSLLPILTSVLFDKNEWETPDKFNPGHFLDAEGKFKKREAFLPFSA (1)
16492 GKRVCLGEGLVKMELFLFFVSLFQKFSYSVSGGAELSTEGITGITRVPHPFEIHTRPRSF* 16310

CYP2N12X    Fugu rubripes (pufferfish)
            No accession number
            Scaffold_3261d
            Renamed CYP2AD1
22960 XCLNIHTGIALSNGYMWKKQRKFAHTHLRYFGEGQKLLENHIQMESKFMCEAFKDEQ 22811 (1? Bad boundary)
22727 GKPFDPQYTITNAVGNIISALVFGHRFEYSDASFRRILELDNEAVVLAGSARTQ 22566 (0)
22482 LYDSFPSLMKHLPGPHQTVHANYGKITDFLKKEVDKHMEEWNPEDPRDYVDTYLSEMEK 22306 (0)
21959 MNQDPQGGFNVETLLICILDLIEAGTESAATTLRWGLVFILNYPDVQ 21819 (1)
21739 EKVQEEIDRVIGQSRQPAMADRPNMPYTDAVIHEIQRFANVVPAGFPKMATKDTTVGGYFIPK 21551 (0)
21462 GLAITTMLSSVLFDKNEWETPDVFNPNHFLDSEGRFRKRDAFIPFSA 21322 (1)
21218 GKRVCIGENLAKMELFLFFTSILQHFNLSPVPGQMPSLEGILGFTYSPQPFRMIVAPR* 21042

CYP2N13  Danio rerio (zebrafish)

CYP2N14  Micropterus salmoides (largemouth bass)
         No accession number
         Alex J. McNally
         submitted to nomenclature committee May, 31, 2005
         74% to 2N10

CYP2N15  Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr VIII (+) strand 19111307-19114904
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         69% to 2N11
         see ESTs CD506195.1, CD504080.1, CD507761.1
         the genome assembly is missing the lower case region
MWLFHFLLGFDLKGLFLFMVVFFIIADIFKNRNPANYPPGPLSLPIVGNffsverkhphiyftk
LADIYGNVFSVRL
GRNKTVFVSGYKMVKEAIVTQADNFVDRPDNAMADRVYSGDSGGLFMSNGETWKRQRRFALSTLRSFGLGKSTME
QSICEEIRHLQEEIEKEKGEPFNPASLFNNAVSNIICQLVMGRRFDYCDHNFQSMLTYLCEILRLQGSVWGLLYD
SFPRVMKHLPGSHNKIFSHYDSLLDFMNKEVESHKKDLDHSDPGDYIDAFIIEMEKHNESDLGFTEANLALCSLD
LFLAGSETTSTTLLWALVYLMKYPDIQDKVQVEIDGVIGRSRQPSMADRPNLPYTEAVLHEIQRMGNIVPLNGAR
MATKHTTLGGYLIPKGTTVMPSLTSVLFDKTEWETPHTFNPGHFLGAEGKFVRREAFLPFSAGKRVCPGEGLAKM
ELFLFLVGLLQKFSFSVPDGVELSTEGITGVTRVPHPFKVYAKAR*

CYP2N16  Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr VIII (+) strand 19116076-19119924
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         77% to 2N9, 62% to Fugu 2N10
MSLCGFLLRFGPPEFLLLFFAFLLLVCFWAKKDPPNFPPGPPSLPFLGNIFNIESKQPHIYLTKLADVYGNVFCI
RLGRHRTVFVSGWKMVKEAIVTQADHFVDRPYSPMVTRIYSGNSGLFFSNGKVWRRQRRFAMSTLRTFGLANSSM
EQSICEESRHLQEALEKEKGEPFDPVPLINNAVANIICQIVFGRRFDYTDHNFQSMLRNLTDMAYLEGSIWALLY
DAFPAVMKHVPGPHNGIFRSSRSLEASIRAEIERHKLDLDPTNPRDYIDLFLIEEKHSKNRDLGFDEGNLVLCCL
DLFLAGSETTSKTLQWGLVYLIKSPHIQVQAEIDGVIGPTRHPTMADRPNLPFTDAVIHEIQRVGNVVPLNGLRM
AAKDTTLGGYFIPKGTSVMANLTSVLFDPAEWEKPDSFHPAHFLDAGGRFVRREAFLPFSAGKRACLGEGLARAE
LFLFFVTLLQKYHFTTLEGVELRGDGVIGATRTPHPFKVYAEAR*

CYP2N17  Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr XVI (-) strand 2228495-2232907
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         51% to Fugu 2N9, 71% to 2N12, see ESTs DT966028.1, DW631570.1

CYP2N18    Oryzias latipes (medaka)
           Chr4: 28082010:28087962: (-)strand
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           67% to Fugu 2N11
MWLDSFLLSFDLKALVLFIFLFLLIADWIKHRKPANFPPGPLGLPFVGNFLTIDGKHPHIYFSKMAESYGNVFSV
RLGSQATVFVSGYKMVKEALVTQAENFVDRPFSEIGGRFYEGNSNGLFFSNGEKWKKQRRFALSTLRTFGLGKNT
MEQSICEEIRHLQQQIENEKGGPFSPAGLFNNAVSNIICQLVMGKRFDYDDNNFQVMMKYISEAVQLEGSIWGIL
YESFPGLMKHLPGSHNKIFRNYKIVQDFLAQEIKIHKQDLDPNNPRDYIDSFIIEMEKHQNSDLGFNDANLAFCS
LDLFVAGTETTSTTLMWALIYLIKHPDVQVKVQQEIDRVIGQNRLPSMADRPNLPYTDAVVHEIQRIGNIVPLNG
LRVAAKDTTLGGYFIPKGTALMPMLTSVLFDKTEWETPDTFNPEHFLDADGKFVKKEAFLPFSAGKRVCLGEGLA
RMELFLFLVGLLQKFSFSVPEGVELSTEGITGTTRVPHPYKVYAKVR*

CYP2N19    Oryzias latipes (medaka)
           Chr4: 28070384:28074070: (-) strand
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           74% to Fugu 2N9
MWLCVWCQWCGLTGTLFFIFAVFFVLCLVKQKDPPHFPPGPPALPVLGNIFSIDSKQPHIYLTKLADVYGNVFCI
RLGRHKTVFVTGWKTVKEALVTQADNFVDRPYSPMVTRIYGGNSAGLFFSNGSVWKRQRRFAMTMLRTFGAAKSS
TEQSICEESRHLLEAMEMEGGEPFDPVPLLNKAVSNIICQIVFGRRFDYSDTDFQAMLTNLTDMAYLEGSVWALL
YDAFPALMKYLPGPHNSIFSSSKSLETTIRREINRHKQDLDPSNPRDYIDKFLMEERHNRKIHSGFEEENLVLCC
LDLFLAGSETTSKTLQWGLIYLITNPHIQDKVQAEMDRVVGHSRQPTTADRTNMPYTDAVIHEIQRMGNIVPLNG
LRMAAKDTTLGGYIIPKGTAVMPNLTSVLFDKTEWETPDNFNPEHFLDADGKLLRKEAFLPFSAGRRACLGEGLA
RMELFLFFVTLFQRFHFSAAAGVELRTEGIIGATRTPHPFQIIAKPR*

2P Subfamily

CYP2P1      Fundulus heteroclitus (killifish)
            John Stegeman
            submitted to nomenclature committee

CYP2P2      Fundulus heteroclitus (killifish)
            GenEMBL AF117342
            John Stegeman
            submitted to nomenclature committee

CYP2P3      Fundulus heteroclitus (killifish)
            GenEMBL AF117343
            John Stegeman
            submitted to nomenclature committee

CYP2P4      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_3261e
      MEAILSTLGLEWMDGRTILIFLLVFVLLADYIKNRVPSNFPPGPWPLPLIGDLHRINPSRLHLQFAE (0)
24760 FAGKYGNIFSLRLFGGRVVVLNGYKTVREALVEKGENFVDRPLIPLFEAFAGNR 24924 (1)
24994 GLVISNGNPWKHQRRFALHTLRNFGIGKKSLEPSIQQECHYLAEAFAQHKG 25156
      gap missing exon 4
26236 VYNTFPWLLKWLPGTHQTIFSEIKTVINFVDLKIQEHKRNFDPSSLRDYIDCFLAEMGE 26412 (0)
26493 KEDVESGFDMKNLSICTMDLFGAGTETTTTTLQWGLLYMIYYPHIQ 26630 (1)
seq runs off end of contig missing exons 7,8 and 9

CYP2P fragment Fugu rubripes (pufferfish)
                No accession number
probably exon 7 of 2P4 
Fc:c161F04y1 LPC.61739.y1 Fc:c161E03y1 LPC.61451.y1 60% to 2J9 
71% to 2P1
KVYAEISAVIGSSREPSITDRDNMPYTNAVIHEMQRMANIIPLNVVHMASSDTTIxxxxxxx

CYP2P fragment Fugu rubripes (pufferfish)
            No accession number
            Scaffold_2841 probably exons 8 and 9 of 2P4
80% to LPC61680  66% to 2D9 Length = 29344
LPC61680.x1 LPC22842.y1 LPC61776.x1 LPC61672.x1 Fc:c161P11x1 Fc:c161P09x1 
66% to 2D9 LPC61488.x1 64% to 2d9 Fc:c161O11x1 93% to LPC61680 
probably same gene 67% to CYP2K, upstream sequence runs off scaffold
62% to 2Z2 over 106 aa 59% to 2K10 over 108 aa 60% to 2N12 over 100 aa 
80% to 2P2
179 GTIIMPTLNSVLHDESMWETPHSFNPQHFLDQDGKFRKREAFLPFSA 319
442 GKRVCLGEQLARMELFLFFTSLLQRFSFSMADGEQPSLDFQLGGARFPKPYRLRAILR* 618

CYP2P5P    Fugu rubripes (pufferfish)
            No accession number
pseudogene fragment Fc:c060E24y1 LPC.22843.y1 
56% TO 2W1 PKG TO HEME 70% to scaf 2841 exon 8
GTIVVPTLNSVLPDESVWETPHSLDPPLFLDL*RXFRVREAFLPFFA

CYP2P6  Danio rerio (zebrafish)
ctg24224.g NEW 77% TO 2p9
1209157 MDLLHIYEWIDIKAVLFFACVFLLLSNYIQNKTPKNFPPGPWPLPIIGNLYHIDFNKIHLEVEK 1209348
1209657 LSEKYGSVVSVHLFGQRTVILNGYKQVKEVYIQQGDNVADRPELPMIHDIAGDN 1209818
1209977 GLVAPSGYKWKQQRRFALSTLRNFGLGKKSLEPSINLECHYLNEAISNEN 1210126
1210235 GRPFDPHLLLNNAISNVICVLVFGNRFDYSDHHFQTLLNNINEAMYLDGTIWAQ 1210396
1210482 LYNSHPRIMRLLPGPHKKNITLWNKVIDFARERVKEHRVDYDPSNPRDYVDCFLAEMEK 1210658
1210736 LKDDTAAGFDVENLCICTLDLFVAGTETTSTTLSWSLLYMIKYPEIQ 1210876
1212110 AKVQEEIDRVIGSSRQPSVSDRDNMPYTNAVIHEIQRFGNIAALNLPRAAVKDIQVGKYLIPK 1212298
1212390 GTIVIGNLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFS 1212527
1213310 GKRVCLGEQLARMELFLFFTSLLQHFTFSSPAGVEPSFNYKLGTTRAPKPFKLCAVSR 1213483

CYP2P7  Danio rerio (zebrafish)
ctg24224.h  81% to 2p9,  62% to 2P3 (Fundulus)
1214731 MDVLQFYKWLDIKTVLVFLVVFLFLSDYIRNKSPKNFPPGPWSLPFIGHIHHIEHKKVHLQFLK 1214922
1216466 FAEKYGKIFSIRLFGPRIVVLDGYKLVKEVYLQQGDNLADRPILPMFYDITEDK 1216627
1217670 GLIGSNGYKWKHQRRFALSTFRTFGLGKKSLEPSILLECSCLNDAFSNEQ 1217819
1217891 XPFDPRLLLNNAVSNVICALVFSNRFDYSDHHFQTLLKHINEVLYLEGTVWAQ 1218046
1218134 LYNFFPWLMRRLPGPHQKIFVLLNKVIDFVREKVNEHRVDYDPSNPRDYIDCFLAEMEK 1218310
1218399 LKDDTAAGFDVENLCICTLDLFVAGTETTSTTLYWGLLYIIKYPEIQ 1218539
1218632 AKVQQEIDAVVGGSRQPSVSDRDNMPYTNAVIHEIQRMGNIVPLNVFRITVEDTQIGEYSIPK 1218820
1218907 GTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFS 1219044
1219144 GKRVCLGEQLARMELFLFFSSLLQRFTFSPPAGVEPSLDYKLGGTHSPQPYKLCAVPR 1219317

CYP2P8  Danio rerio (zebrafish)
ctg24224.i   90% TO 2p9
1221362 MDLWYLYEWIDIKSILIFLCVFLLLGDYIKNKAPKNFPPGPWSLPIIGDLHHIDNSKIHLQFTK 1221553
1221722 FAERYGNIFSFRLFGPRIVVLNGYNLVKEVYIKQGDNLADRPVLPLFYEIIGDK 1228152
1221974 GLILSSGYKWKHQRRFALSTLRNFGLGKKSLEPSINVECGFLNEAISNEQ 1222123
1222203 GRPFDPRLLLNNAVSNVICVLVFGNRFDYSDHHFQTLLKNISEAVYLEGSICNQ 1222364
1224317 LYNMFPWLMERLPGPHKTIITLWRKVTDFVREKVNEHRVDYDPSNPRDYIDCFLTEMEK 1224493
1224582 LKDDTAAGFDVENLCICSLDLFVAGTETTSTTLYWGLLYMIKYPEIQ 1224722
1224812 AKVQEEIDAVVGGSRQPSVSDRDNMPYTNAVIHEIQRMGNIAPINLARSTSEDTQIGNYSIPK 1225000
1225184 GTMVTSNLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFS 1225321
1225421 GKRVCLGEQLARMELFLFFSSLLQRFTFSPPAGVEPSLDYKMGGTHCPKPFKLCAVPR 1225594

CYP2P8-de7,8  Danio rerio (zebrafish)
ctg24224.j  EXONS 7,8 pseudogene
1226868 PSVSDRDNMPYTNSVIHEIQSIGNIGPLNVFGITVK 1226975
1227088 GTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFS 1227225

CYP2P9  Danio rerio (zebrafish)
ctg24224.k   98% (7 AA DIFFS) TO 2p9
1227637 MDLWYLYEWIDIKSILIFLCVFLLLGDYIKNKAPKNFPPGPWSLPIIGDLHHIDNSKIHLQFTK 1227831
1227991 FAERYGNIFSLRLFGPRIVVLNGYNLVKEVYIKQGDNLADRPVLPLFYEIIGDK 1228131
1228249 GIVLSSGYKWKHQRRFALSTLRNFGLGKKSLEPSINLECGFLNEAISNEQ 1228398
1228473 GQPFDPRLLLNNAVSNVICVLVFGNRFDYSDHHFQTLLKHINEAIYLEGGICAQ 1228634
1228798 LYNMFPWLMQRLPGSHKKVITLWKKVIDFIRQKVNEHKVDHDPLNPRDYIDCFLAEMEK 1228974
1229073 LKDDTAAGFDVENLCICTLDLFVAGTETTSTTLYWGLLYMMKYPVIQ 1229210
1229290 AKVQEEIDRVVGGSRHPSVSDRDNMPYTNAVIHEIQRMGNIIPINVTRTTSEDIRIGKYSVPK 1229475
1229629 GTMVTSNLTSVLFDESEWETPHSFNPGHFLNAEGKFRRRDAFLPFSL 1229769
1229866 GKRVCLGEQLARMELFLFFSSLLQRFTFSPPAGVEPSLDYKLGATHCPQPYKLCAVPR 1230039

CYP2P9  Danio rerio (zebrafish)
        GenEMBL BC056816, NM_200620 61% to CYP2P3 
        zfishK-a583c07.p1c zfishC-a1218e09.p1ca
MDLWDLYEWIDIKSILIFLCVFLLLGDYIKNKAPKNFPPGPWSLPIIGDLHHIDNSKIHLQFTK
FAERYGNIFSLRLFGPRIVVLNGYNLVKEVYIKQGDNLADRPVLPLFYEIIGDK
GIVLSSGYKWKHQRRFALSTLRNFGLGKKSLEPSINLECGFLNEAISNEQ
GQPFDPRLLLNNAVSNVICVLVFGNRFDYSDHHFQTLLKHINEAIYLEGGICAQ
LYNMFPWLMQRLPGSHKKVITLWKKVIDFIRQKVNEHRVDHDPLNPRDYIDCFLAEMDK
LKDDTAAGFDVENLCICTLDLFVAGTETTSTTLYWGLLYMMKYPGIQ
AKVQEEIDRVVGGSRQPSVSDRDNMPYTNAVIHEIQRMGNIIPINVTRTTSEDIRIGKYSVPK
GTMVTSNLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFSL
GKRVCLGEQLARMELFLFFSSLLQRFTFSPPAGVEPSLDYKLGATHCPQPYQLCAVPR

CYP2P10  Danio rerio (zebrafish)
         GenEMBL BC049521, NM_201511 84% to CYP2p9 zfishG-a2632g08.q1c
MDMFYFYEWVDIKSILIFLCVFLLLSDYIKNKAPKNFPPGPWSL
PFIGDLHHIDPNKIHLQFTEFAEKYGKIFSFRLFGSRIVVLNGYNLVKEVYTQQGDNL
ADRPTLPITSAIIGDNRGLVASSGYKWKHQRRFALTTLRNFGLGKKNLELSINFECGF
LNEAISNEQGRPFNPRLLLNNAVSNVICVLVFGNRFEYSDHHFQNLLNKINESVYLEG
SIFVHLYNMFPWLMQLLPGPHKKLITLWQRVTDFVREKVNEHRVDYDPSSLRDYIDCF
LAEMEKHKDDTAAGFDVENLCMCTLDLFVAGTETTSTTLYWGLLYMIKYPEIQAKVQQ
EIDAVVGSSRQPSGSDRDNMPYTNAVIHEIQRMGNIIPLNVVRTTSEDTRIEKYSIPK
GTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFSLGKRVCLGEQLA
RMELFLFFSSVLQRFTFSPPAGVEPSLDFKMGFTRCPKPYKLCAVPR

CYP2P10  Danio rerio (zebrafish)
ctg24224.l   3 AA DIFFS TO 2P10
1232262 MDMFYFYEWVDIKSILIFLCVFLLLSDYIKNKAPKNFPPGPWSLPFIGDLHHIDPNKIHLQFTE 1232411
1233540 FAEKYGKIFSFRLFGSRIVVLNGYNLVKEVYTQQGDNLADRPTLPITSAIIGDNR 1233677
1233779 GLVASSGYKWKHQRRFALTTLRNFGLGKKNLELSINFECGFLNEAISNEQ 1233928
1234024 GRPFNPRLLLNNAVSNVICVLVFGNRFEYSDHHFQNLLNKINESVYLEGSIFVH 1234173
1237098 LYNMFPWLMQLLPGPHKKLITLWQRVTDFVREKVNEHRADYDQSSLRDYIDCFLAEMEK 1237274
1237383 HKDDTAAGFDVENLCMCTLDLFVAGTETTSTTLYWGLLYMIKYPEIQ 1237523
1237610 AKVQQEIDAVVGSSRQPSGSDRDNMPYTNAVIHEIQRMGNIIPLNVVRTTSEDTRIEKYSIPK 1237798
1239047 GTLVIGSLTSVLFDESEWETPHSFNPGHFLDAEGKFRRRDAFLPFSL 1239184
1239282 GKRVCLGEQLARMELFLFFTSVLQRFTFSPPAGVEPSLDFKMGFTRCPKPYKLCAVPR 1239455

CYP2P10-de9  Danio rerio (zebrafish)
ctg24224.m   3 AA DIFFS TO 2P10
1242741 MELFLFFSSLLYF 1242779
1242772 FTFSLPADVKPSLGYKMGAHTVP 1242840

CYP2P-se1 Danio rerio (zebrafish)
ctg24224.n solo exon (pseudogene)
1243476 MDMLHFYEWIDIKSILIFVCVFLLLSDFIKNKTPKNFPPGPWSLPIIGDIHHIDPSKLHLQLSE 1243667

CYP2P fragment  Atlantic salmon
                GenMEBL BI468047 EST00457 
                77% to CYP2P10
1   DPSSPRDFIDCFLNEIEKCEDDTRAGFNLENLSFCTLDLFVAGTETTSTTLYWGLLFMIN 180
181 YPEIQAKVQAEIDAVVRSSRQPSMEDRDSMPYTDAVIHETQRMGNIIPLNVSRMATKDTE 360
361 VGGYTIPKNTIVLGTLQSILFDESEWETPHTFNPGHFLDQEGKFRKRDAFLPFSLGKRVC 540
541 PXEQLAKMELFLFFTSLLQRFTFFSPPGVEPSL 639

CYP2P11  Micropterus salmoides (largemouth bass)
         No accession number
         David Barber
         Submitted to nomenclature committee 5/21/04
         73% to CYP2P3

CYP2P12    Oryzias latipes (medaka)
           chr4 28112615:28120754  (+)
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           61% to Zebrafish 2P10, 69% to CYP2P3
MEGITSVLGLEWVDTWTILIFLFVFLLLSDFLANRRPKNFPPGPHSLPFIGDLHRIQPARLHVQFTEFAEKYGNV
FSLHLLGERTVILNGYKQVKEALVQQGDDFVDRPTIPLFVDTIDNKGIVMSNGNSWKQQRRFALHTLRNFGLGKK
TMETYIQNECHYITQTFADKQGKPFDAQFLINNAVSNIICCLVFGERFEYSDQEYQKILRNLNDLLILEGSVSAM
LYNMFPWLMKRLPGPHQKIFSLTRKIIDFVKIKINEHKGNFDPSAPEDYIDSFLIEMEKVNKDSGFDIDNMCICT
MDLFLAGTETTTTTLYWGLLYMIYYPDIQGKVHAEIDAVIGSSRQPSMADKESMPYTDAVIHEIQRMGDIVPQGV
FRQANRDTTLDKYTIPKGTIIVPALHSVLHDESMWDNPHSFDPKNFLDKDGKFCKREAFNPFGAGKRVCLGEQLA
RMELFLFFTSLFQRFSFSAPTGEQLSLESRMGATRCPKPFRVIAAPR*

CYP2P13    Oryzias latipes (medaka)
           chr4 28123180:28130065 (+)
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           63% to Zebrafish 2P10, 75% TO CYP2P3
MEAITAVLGFEWIDSRSLLIFLFVFLLLSDYLANRRPKNFPPGPHSLPFIGDLHRINPSRLHLQLTEFAEKYGNV
FSLHLFGERAVILNGHKHVKEALVQRGDDFVDRPSIPLFEQFYSNKGIVVSNGYPWKQQRRFALHTLRNFGLGKK
TMEKYMQEECRYLTEAFGEYKVKPFNAQALINNAVSNIICCLVFGERYEYSDKQYQQILQDINEIMILQGGFAAQ
LFNSFPWLMKKLPGPHQKILTLLAKLIDFAKVKISEHKENLDPSSPKDYIDSFLIEMAQNENQESSFDISNLCMC
TLDLFIAGTETTTTTLHWGLLYMIYYADIQEKVQAEIDAVIGSSRQPSMADKENMPYTDAVIHEIQRMGNILPLG
VLRMASKDTTLDKYTIPKGTMIIPTLNSVLHDESMWETPHSFNPKHFLDKDGKFRKREAFNPFGAGKRVCLGEQL
ARMELFLFFTSLLQRFSFSAPAGEQPSLENRMGATRCPKPYRLCAVPR* 

2Q Subfamily

CYP2Q1      Xenopus laevis (african clawed frog)
            GenEMBL D50560 (2237bp)
            Ohi, H., Sugata, E., Fujita, Y., Saito, H., Saguchi, K., Murayama, N.
            and Higuchi, S.
            Cloning and expression analysis of a cDNA coding for a
            dexamethasone-inducible cytochrome P450 in Xenopus laevis
            Biochem. Mol. Biol. Internatl., 45, 689-697 (1998).
            Saito, H., Ohi, H., Sugata, E., Murayama, N., Fujita,Y. and Higuchi,S.
            Purification and characterization of a cytochrome P450 from liver
            microsomes of Xenopus laevis
            Arch. Biochem. Biophys., 345, 56-64 (1997)

CYP2Q2     Xenopus tropicalis (frog)
           See Xenopus page for seq

CYP2Q3     Xenopus tropicalis (frog)
           See Xenopus page for seq

2R Subfamily

CYP2R1     human
           AC018795.4 also AC025730 AC025748
           Mikael Oscarson
           submitted to nomencalture committee 9/4/98
           missing N-terminal (approximately 80 amino acids)
           Unigene entry Hs.16846
           ESTs AA058765 zk65e06.r1, AA099882 zl90c08.r1, AA115448 zl04h11.r1
           AI280096 qh85e09.x1, AA732048 nz87c04.s1, AA449325 zx06e11.s1,
           AI221745 qg93e12.x1, AA088847 zl90c08.s1, AA235247 zs37b03.s1,
           AA115449 zl04h11.s1, AI431661 tg74h07.x1, AI376519 te59a09.x1,
           T83549 yd44f12.r1, T91507 ye20c08.s1, R11612 yf47e10.r1,
           T91536 ye20c08.r1, AA449583 zx06e11.r1, T83719 yd65h05.r1
           AA663042
MWKLWRAEEGAAALGGALFLLLFALGVRQLLKQRRPMGFPPGPPGLPFIGNIY
SLAASSELPHVYMRKQSQVYGE 
IFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSR
YGRGWVDHRRLAVNSFRYFGYGQKSFESKILEETKFFNDAIETYKGRPFDFKQLITNAVS
NITNLIIFGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFR
NAAVVYDFLSRLIEKASVNRKPQLPQHFVDAYLDEMDQGKNDPSSTFSKENLIFSVGELI
IAGTETTTNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKCKMPYTEAVLHEV
LRFCNIVPLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDS
SGYFAKKEALVPFSLGRRHCLGEHLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMT
LQPQPYLICAERR 

CYP2R1     Macaca mulatta (rhesus monkey)
           partial
IFSLDLGGISTVVLNGYDVVKECLVHQSGIFADRPCLPLFMKMTKMGGLLNSRYGQGWVE
HRRLAVNSFRYFGYGQKSFESKILEETKFFTDAIETYKGRPFDFKQLITSAVSNITNLII
FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNASVVYD
FLSRLIEKASVNRKPQLPQHFVDAYFDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT
TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKFKMPYTEAVLHEVLRFCNIV
PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK
EALVPFSLGRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL
ICAERR

CYP2R1      Bos taurus (cow)
            See cattle page for details
MWEPHSAEAFVAALGGVFFLLLFALGVRQLLKQRRPSGFPPGPSGLPFIGNIYSLAASAELPHVYMKKQSQVYGE (0)
IFSLDLGGISAVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMG (1)
GLLNSRYGRGWVDHRKLAVNSFRCFGYGQKSFESKILEETKFFIDAVETYNGSPFDLKQLV
TNAVSNITNLVIFGERFTYEDTDFQHMIELFSENVELAASATVFLYNAFPWIGILPFGKH
QQLFRNAAVVYDFLSRLIEKASINRKPQLPQHFVDAYLDEMERSKNDPSSTFSKENLIFS
VGELIIAGTETTTNVLRWAVLFMALYPNIQ (1)
GQVQKEIDLIIGPSGKPSWDEKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGYSI
PKGTTVITNLYSVHFDEKYWRDPEIFYPERFLDSSGHFAKKEALIPFSL (1)
GRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPNLKPRLGMTLQPQPYLICAERR* 

CYP2R1    Sus scrofa (miniature pig) 
          no accession number
          Haitao Shang
          Submitted to nomenclature committee May 23, 2007
          95% to human 2R1
          partial seq.

CYP2R1    Sus scrofa (miniature pig) 
          BW980853.1, BG732954.1, BI359965.1 
          95% to human 2R1, lower case = cow seq
    MWEPPGAEVFPAALGGVL
2   FLLLFALGVRQLLKQRRPSGFPPGPSGLPFIGNIYSLAASAELPHIYMKKQSQVYGEIFS 181
182 LDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSRYGRGWVDHRR 361
362 LAVNSFRSFGYGQKSFESKILEETKFFMDAIETYSSRPFDFKQLITNAVSNITNLIIFGE 541
542 RFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNAAVVYDFLS 721
722 RLIEKASINRKPQSPQHFVDAYLDEMDQGEKDPSSTFSKENLIFSVGELIIAGTETTTNV 901
902 LRWAILFMALYPNIQGR 952
    vqkeidliigpsgkpswdekckmpyteavlhevlrfcnivplgifhatsedavvrgysi
    pkgttvitnlysvhfdekywrdpeifyperfldssghfakkealipfsl (1)
    GRRHCLGEQLARMEMFLFFTALLQRFHLHFPHelvpnlkprlgmtlqpqpylicaerr*

CYP2R1     Canis familiaris (dog)
           NW_876313.1:37769697-37744500
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           93% to human CYP2R1
MRGPPGAEACAAGLGAALLLLLFVLGVRQLLKQRRPAGFPPGPSGLPFIGNIYSLAASGELAHVYMRKQSRVYGE
IFSLDLGGISAVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSRYGRGWVDHRKLAVNSFRCFGYG
QKSFESKILEETNFFIDAIETYKGRPFDLKQLITNAVSNITNLIIFGERFTYEDTDFQHMIELFSENVELAASAS
VFLYNAFPWIGIIPFGKHQQLFRNAAVVYDFLSRLIEKASINRKPQSPQHFVDAYLNEMDQGKNDPSCTFSKENL
IFSVGELIIAGTETTTNVLRWAILFMALYPNIQGQVQKEIDLIMGPTGKPSWDDKCKMPYTEAVLHEVLRFCNIV
PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRNPEIFYPERFLDSSGYFAKKEALVPFSLGKRHCLG
EQLARMEMFLFFTALLQRLHFPHGLVPDLKPRLGMTLQPQPYLICAERR*

CYP2r1      mouse
            GenEMBL XM_146091.1
1   MLELPGARACAGALAGALLLLLFVLVVRQLLRQRRPAGFPPGPPRLPFVGNICSLALSAD 180
181 LPHVYMRKQSRVYGE 
    IFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLL 540
541 NSRYGRGWIDHRRLAVNSFHYFGSGQKSFESKILEETWSLIDAIETYKGGPFDLKQLITN 720
721 AVSNITNLILFGERFTYEDTDFQHMIELFSENVELAASAPVFLYNAFPWIGILPFGKHQR 900
901 LFRNADVVYDFLSRLIEKAAVNRKPHLPHHFVDAYLDEMDQGQNDPLSTFSKENLIFSVG 1080
1081ELIIAGTETTTNVLRWAILFMALYPNIQGQVHKEIDLIVGHNRRPSWEYKCKMPYTEAVL 1260
1261HEVLRFCNIVPLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWKDPDMFYPERF 1440
1441LDSNGYFTKKEALIPFSLGRRHCLGEQLARMEMFLFFTSLLQQFHLHFPHELVPNLKPRL 1620
1621GMTLQPQPYLICAERR 1668

CYP2R1     chicken
           XM_420996 Gnomon prediction seems too long
           80% to human 2R1
MGPAAGDAEPEAAAGGGPWLL
LALPPLLLLFALVVRQLLKQRRPPGFPPGPAGLPLIGNIHSLGAEQPHVYMRRQSQIH
GQIFSLDLGGISAIVLNGYDAVKECLVHQSEIFADRPSFPLFKKLTNMGGLLNSKYGR
GWTEHRKLAVNTFRTFGYGQRSFEHKISEESVFFLDAIDTYKGRPFDLKHLITNAVSN
ITNLIIFGERFTYEDTEFQHMIEIFSENIELAASASVFLYNAFPWIGILPFGKHQQLF
KNAAEVYDFLHKLIERVSENRKSQSPRHFIDAYLDEMDCNKNDPESTYSRENLIFSVG
ELIIAGTETTTNVLRWAVLFMALYPNIQGHVQKEIDLVIGPNKMPALEEKCKMPYTEA
VLHEVLRFCNIVPLGIFHATSKDTVVRGYSIPEGTTVITNLYSVHFDEKYWNNPEVFF
PERFLDSNGQFVKKDAFIPFSLGRRHCLGEQLARMELFLFFTSLLQRFHLRFPHGGIP
DLKPRLGMTLQPQPYLICAERR

CYP2R1     Xenopus tropicalis (frog)
           See Xenopus page for seq

CYP2R1      Danio rerio (zebrafish)

CYP2R1      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_7138
            69% to human 2R1
      MVPAQSPPLVPPSRDQALLGLACLTVAFLAVLLVRQLVK
      QRRPPGFPPGPSPIPIIGNIMSLATEPHVFLKKQSEVHGQ (0)
      IFSIDLGGILTVVLNGYDCIRECLYNQSEVFADRPSLPLFKKMTKMG 12808
12701 GLLNCKYSKGWIEHRKLACNSFRYFGSGQRLFERKISEECMFLVDAIDQHKGKAFNPKHL 12522
12521 VTNAVSNITNLIIFGQRFTYDDHNFQHMIELFSENVELAVSGWALLYNAFPWIEYLPFGK 12342
12341 HQKLFFNAAEVYDFLLRVTKEFSQGRVPHMPRHYVDAYLDELERNAGDPNSSFSYENLIY 12162
12161 SVGELIIAGTETTTNTLRWAMLYMALYPNIQG
      RVHREIDSVLANGRMPTLEDKQKMPYVEAVLHEVLRFCNIVPLGIFRATSQDA 11802
11801 NVNGYTIPKGTMVITNLYSVHFDEKYWSDPGVFSPQRFLDANGNFVRREAFLPFSLG 11631
11535 GRRQCLGEQLARMEMFLFFTTLLQRFHLQFPVGTIPTIAPKLGMTLQPKPYSICAVRR 11362
      HQKSLISVTTPCHK* 11317

CYP2R1   Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr II (+) strand 9716095-9718823
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         88% to Fugu 2R1
MVSIKAQSLVPVSCAQALLGVVCLAVALLAFLLVRQLVKQRRPPGFPPGPSPIPVIGNIFSLATEPHVFLKRQSE
VHGQIFSLDLGGILTVVLTGYDCVRECLYNQGEVFADRPSLPLFKKMTKMGGLLNCKYGKGWIEHRKLACNSFRY
FGSGQKQFERKISEECMFFVDAIDEHKGKPFNPKHLVTNAVSNITNLIIFGQRFTYDDRNFQHMIEIFSENVELA
VSGWALLYNAFPWIEYVPFGKHQKLFRNAAEVYDFLQEVIQSFSQGRVPHSPRHYVDAYLDDLERSAGAPDSSFS
YENLIYSVGELIIAGTETTTNTLRWAMLYMALYPNIQERVHREIDSVLANERAPTLEDKQKMPYVEAVLHEVLRF
CNIVPLGIFRATSQEAKVNGYTIPKGTMVITNLYSVHCDEKYWNDPGAFSPQRFLDSNGNFVRREAFLPFSLGRR
CCLGEQLARMEMFLFFTTLLQRFHLQFPAGSIPTVTPKLGMTLQPKPYSICAVRRQQKSPCFGDTPYPN*

CYP2R1     Oryzias latipes (medaka)
           chr3 17795604:17802282
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           87% to Fugu 2R1
MVSLTAASVVPVSRAMALLSVGCLAAALMAYLLVRQLVKQRRPPGFPPGPSPIPIIGNIFSLATEPHVFLKRQSE
VHGQIFSLDLGGIMTVVLNGYDCVKECLYHQSEVFADRPSLPLFKKMTKMGGLLNSKYGKGWNDHRKLACNSFRY
FGSGLRLFERKISEECMFFVDAIDEHKGKPFNPKHLVTNAVSNITNLIIFGQRFTYDDRDFQHMIELFSENVELA
VSGWALLYNAFPWIEYMPFGKHQKLFRNAMEVYDFLLEVIKRFSHGRVPHVPRHYVDAYLDELEQNSGDPSSSFS
YENLIYSVGELIIAGTETTTNTLRWAMLYMALYPNIQERVHREIDSVLTNGRAPTLEDKHKMPFVEAVLHEILRF
CNIVPLGIFRATSQEAKVNGYTIPKGTMVITNLYSVHFDEKYWNEPGVFSPQRFLDSSGNFVRREAFLPFSLGKR
HCLGEQLARMEMFLFFTTLLQRFHLQFPPGTVPTVTPKLGMTLQPKHYSICAIRRQQKVPNS*

CYP2R2P     Fugu rubripes (pufferfish)
            No accession number
Fc:c104I03x1 LPC.39565.x1 77% to fugu 2R1 MAY BE PSEUDOGENE OF scaf 7138 exon 8
201 DSVLANGRMPTLEDKQKMPYVEAVLHEVLRFCNIVPLGIFRATS*DANVNGYTIPKGTM 220
221 VITNLYSWHFYEKNWSKTGAFSHPKCLWDAHGHFCEWLMASMPGSFG 518

CYP2R3P     Fugu rubripes (pufferfish)
            No accession number
Fc:c068L08y2 LPC.26046.y2 67% to fugu 2R1 exon 8 possible pseudogene fragment
LYYTKIXTVLARVEIPTLEDKQKMPYLEAVLPEVLRFCDIVPLGLFRATSAGADVNGFTIPGGAVLIAILCSGRF

2S Subfamily

CYP2S1     human
           GenEMBL AF335278 AC011510
           ESTs T84852, AA315278, AA300981 and AA301039
           AA316621, AA496320, AA422150
           Rylander, T., Neve, E.P.A., Ingelman-Sundberg, M. and Oscarson, M
           Identification and tissue distribution of the novel human cytochrome 
           P450 2S1 (CYP2S1)
           Biocem. Biophys. Res. Commun. 281, 529-535 2001
           There is no UNIGENE entry for any of these ESTs
           52% identical to CYP2B subfamily members and 50% with CYP2A 
           members 50% with CYP2G1.
AC011510 one exon per line 78% to mouse 2s1 49% to 2B6 47% to 2A13
MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMR
LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH
GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE
GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAVVRAAGGTLLGVSSQGGQ
TYEMFSWFLRPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ
EEQNPGTEFTNKNMLMTVIYLLFAGTMTVSTTVGYTLLLLMKYPHVQ
KWVREELNRELGAGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ
GTEVFPLLGSILHEPNIFKHPEEFNPDRFLDADGRFRKHEAFLPFSL
GKRVCLGEGLAKAEVFLFFTTILQAFSLESPCPPDTLSLKPTVSGLFNIPPAFQLQVRPTDLHSTTQTR

CYP2S1     Macaca mulatta (rhesus monkey)
           AC011510
           exons 2,3 from CO649282.1, gene fragmented on multiple scaffolds
MEATGTWALLLALALLLLLTLALSGTRARGQLPPGPTPLPLLGNLLQLRPGALYSGLMR (0)
LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH (1)
GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE (1)
GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAMVRAAGGTLLGVSSRGGQ (0)
TYEMCSWFLWPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ (0)
EEQNPDTEFTNKNMLMTVIYLLFAGTMTVSATVGYTLLLLMKYPHVQ (1)
KRVREELTQELGSGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ (0)
GTEVFPLLGSILHDPSIFKHPEEFNPDHFLDADGRFRKHEAFLPFSL (1)
GKRVCLGEGLAKAELFLFFTTILQAFSLESPCPLDSLSLKPTISGLFNIPPAFQLQVRPTDLHSTTQTT*

CYP2S1      Bos taurus (cow)
            See cattle page for details
MEAAGTWALLLLLLLLVVTLVLPATWDRGHLPPGPTPLPLLGNLLQLRPGALYLGLLR
LSKKYGPVFTVYLGPWRRVVVLVGHEAVQEALGGQAEEFSGRGTVATLDGTFDSH 
GVFFSNGERWRQLRKFTTLALRDLGMGKREGEELIQAEARCLVEALQGTK
GRPFDPSLLLAQATCNIICSLVFDLRLPYDNEEFQAVVRAAGGIAVGVSSPWGQ
TYEMFSRFLQRLPGPHTQLLRHLGTVAAFAAQQVWQHKGSLGTSGPVRDLVDAFLLKMAK
EKQDPNTEFTAKNLLMTVVYLLFAGTVTVSTTIRYTLLLLLKYPQVQ
ERVQEELMRELGAGQRPSLGDRARLPYTDAVLHEAQRLLALVPMGIPRALTKTTRFRGYTLPQ
GTEVFPLLGSILHDPAVFEEPKEFNPGRFLDADGKFKKHEAFLPFSL
GKRVCLGEGLARTELFLLFTAILQAFSLEGPCPLGALSLQPAISGLFNIPQAFQLQFRPR*

CYP2S1      Sus scrofa (pig)
            DT323081.1 
            85% to CYP2S1 cow
MEAAGTWALLLVLVLLLLLALALPGIRTGGHLPPGPAPLPLL
GNLLQLRPGAL
YLGLMRLSKKYGPVFTVYLGPWRRVVVLVGREAVQEALGGQAEEFSGRGMVATLDGTFDS
HGVFFSSGERWRQLRKVTMLALRDLGMGKREGEELIQAEAQRLVEEIRGTKGRPLDPSLL
LAQATSNIICSLIFGRRFPYDNEEFQAVVRAAGGTVVGVSSPWGQTYEMFSRVLQYLPGP
HTQLLGHLGTLAAFAVQQV

CYP2S1     Canis familiaris (dog)
           NW_876270.1: 43044442-43033913
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           80% to human CYP2S1
MEAAGTWTLLLALLLLLLLLALARPRTRGHLPPGPPPLPLLGNLLQLRPGALYSGLLRLSKKYGPVFTVYLGPWR
RVVVLVGHEAVQEALGGQAEEFSGRGMLATLDGTFGGHGVFFSNGERWRQLRRLTTLALRDLGMGKREGEELIQA
EAQSLVEAFQGTVGRPFDPSLLLAQATSNIICSLTFGLRFPYEDKEFQAVVQAAGGTVLGVSSPWGQTYEMFSWL
LQHLPGPHTQLLSHLSVLATFAVQQVQRHKESLDTSGPPHDVVDAFLLKMAKEEQDPNTELTDKNLLMTVIYLLF
AGTVTVSTTVRYTLLLLLKYPQVQERVREELSRELGAGRAPGLGDRARLPYTDAVLHEAQRLLALVPMGVPRALA
RTTCFRGYTLPQGTEVFPLLGSVLHDPEIFDEPEEFNPDRFLDADGRFQKQEAFLPFSLGKRICLGEGLAHAELF
LLLTTILQAFSLESPSPPGALSLQPAVSGLFNIPPAFQLRVRP*

Cyp2s1         mouse
            GenEMBL AA967201 ua50f06.r1 80% IDENTICAL TO HUMAN CYP2S1
            AC073725.2, AC087155.1, NT_039407.1
AA967201 ua50f06.r1 80% IDENTICAL TO HUMAN CYP2S1
AA562979 vl64a09.r1
AA543966 vj69d06.r1
AA472776 vg94b11.r1
AI481433 vg94b11.x1
NT_039407.1 - strand 
1933418 MEAASTWALLLALLLLLLLLSLTLFRTPARGYLPPGPTPLPLLGNLLQLRPGALYSGLLR 1933239
1931966 LSKKYGPVFTVYLGPWRRVVVLVGHDAVREALGGQAEEFSGRGTLATLDKTFDGHG 1931799
1928473 GVFFANGERWKQLRKFTLLALRDLGMGKREGEELIQAEVQSLVEAFQKTE 1928324
1925993 GRPFNPSMLLAQATSNVVCSLVFGIRLPYDDKEFQAVIQAASGTLLGISSPWGQ 1925832
1925752 AYEMFSWLLQPLPGPHTQLQHHLGTLAAFTIQQVQKHQGRFQTSGPARDVVDAFLLKMAQ 1925573
1924579 EKQDPGTEFTEKNLLMTVTYLLFAGTMTIGATIRYALLLLLRYPQVQ 1924439
1922453 QRVREELIQELGPGRAPSLSDRVRLPYTDAVLHEAQRLLALVPMGMPHTITRTTCFRGYTLPK 1922265
1920451 GTEVFPLIGSILHDPAVFQNPGEFHPGRFLDEDGRLRKHEAFLPYSL 1920311
1920154 GKRVCLGEGLARAELWLFFTSILQAFSLETPCPPGDLSLKPAISGLFNIPPDFQLRVWPTGDQSR* 1919957

>Cyp2s1-ie4b mouse
           GenEMBL  NT_039407.1 + strand 2s 
           internal exon 4 partial duplication
           z in Figure 2B Nelson et al. Pharmacogenetics 14, 1-18 (2004)
1927805 QAASGTLIGISSP*GQ 1927852

2T Subfamily

CYP2T1    rat
          No accession number
          Lars von Buchholtz
          Submitted to nomenclature committee 3/6/2000 
          73% to CYP2T2P

CYP2T2P   human
          GenEMBL AC008537
RAQMRGSLPPRPRPLPLLGNL
QLQSGGLDRALHSLSGRWGRVFTVRLGPRPAVGLCGYAALRDALVLQADA 
VSGRGSMAVFERFTRGNGILFSNRPCWWTLRNFALGALKKFGLGTRTVEA 
RVLEEAACLLDEFQATIGAPFDPVRLLDNAVSNVICSLVFGNRYRYGDPE 
FLRLLNLFSDNFCIISSRWGESLMDWLPGPHHRIFRNFSE 
LRVISEQIQRHWQMRQPAEPRDFIDCLTRWVRHGQQDPESHFQE*TSVM 
TTHFFFGVTETTSTTLCYGLLILLKYLEVAAKVQELDPVVGWRPAPSL 
DYRVCLPYANAVLLEIQCFISVVPLGLPRTLTLDTHLHSHCLPKGTFVIP 
LLVTAHRDPTQFKDPDCFNPTNFLDKGKFQGNDAFMPFAPAKQMCLG 
TGLAHSGIFLFLTATLQRFCLLPVVRPGTINLTQCTGLGSVPPDFQLQPVAC

CYP2T2     Canis familiaris (dog)
           chr1:115897947-115901169 UCSC browser May 2005 assembly
           78% to mouse Cyp2t4
MFTALLLLLLLLLLLALARRSWGAQGTRTQGALPPGPTPLPLLGNLLQLESRRLDRALME (0)
LSGRWGPVFTVRLGPRPAVVLCGYSALRDALVLQADAFSGRGAMAVFERFTHGN
GIVFSNGLRWRTLRNFALGALKEFGLGTRTIEERILEEAACLLGEFQATT
GAPFDPRRLLGNAVSNVICSVVFGNRYGYEDPEFQRLLDLFNDNFRIMSSRWGE
MYNVFPTLLDWLPGPHHRIFQNFTELRVFISEQIQRHQQTRQPGKPRDFIDCFLDQMDK
EQNDPESHFQEETLVMTTHNLFFGGTETTSTTLRYGLLILLKYPEVA
AKVQAELDAVVGQSRTPRLGDREHLPYTNAVLHEIQRFISVLPLGLPRALTRDTHLHGYFLPK
GTFVIPLLVSSHRDPTQFKDPDCFNPTNFLDDKGEFQTNDAFMPFAP
GKRMCLGAGLARSEIFLFFTAILQRFCLLPVGNPANIDLSPQCTGLGNIPPAFQL
RLVAR

CYP2T2P ortholog      Bos taurus (cow)
            See cattle page for details
MMISGIIALSLLVLLLAPARWGWGARSTQRQGALPPRATPLRLLGSLLQLRIWRPGPCTHG
LSGRCGPVFTVCLGQCPVVVLCRYAALRDALVLQADAFSGRGAMAVFKRFTRGN
GIAFSKGPRWPTLRNFALGALKEFGLGTQTIEERVLEEAACLLGDFQATGG
GAPFDPQRLLDNAVSNVICSVVLGNHYGYEDMEFLRLLDLFNDNFRIMSSRWGE
XXXXXSLLDWLPGLHH*IFRNFAXLRVFISQQIQLHQQTR*SGKPHDFIDXXXXXXX
GTENPESHFQAETLAMTMHNLFFGXXETTSTTLRYGLILLKYSFVA
AKVQAELDDMVGRMCAPTLEDREHLPYTNTVLHEIQCFISVVPFGLPSALTCDTHLRGYFLPK
GTFVIPLLVSTHWVPTQFKNPECFNPTNFLNDQGEFQSNAFTPFAL GTCLGAGLAPTDIFLFLTSILLRFFLLPVGSHSDTDLTPQCTGLGNVPPAFQLRLVAR*

CYP2T3P   human
          GenEMBL AC008962 C-terminal missing
RAQMRGSLPPRPRLLPLLGNLQLQSGGLDRALHS
LSGRWGRVFTVRLGPRPAVVLCGYAALRDALVLQADAVSGRGSMAVFERFTRGN
GILFSNRPCWWTLRNFALGAPKKFGLDTRTIEARVLDEAACLLDEFQATI
GAPFDPVRLLGNAVSSVTCSCLREPLWLRGPGVPEVLNLLSDNFRIMSSKWGE  
SLMDWLPGPHHQIFQNFSELQVFISEQIQQHWHMRQPAEPRDFIDCLARWVRHG
QQDPESHFQEETLVMTMQLFFFFFFGGTETTSTTLC
AKGQELDPVVGQRPVPSPD
DHVQWPYTNAVLLEIQRFISVVPRTLTLDTHLHSHCLAKG

Cyp2t4 mouse
           GenEMBl NT_039413.1 + strand
157707 MVTCLALLLLLLILMLLLWWGGVVRRQAQMQKDLPPGPAPLPLLGNLLQLQSGDLDRVLME 157889 
158219 LSSHWGPVFTVWLGPLPAVVLCGYEALRDALVLQADAFSGRGAMAVFDRFTCGN 158380
158742 GIVFSNGPRWHSLRNFALGVLRELGVGRSTIEDRILEEAACVLDEFQATM 158891
159103 GAPFDPQQLLDSAVSNVICTVVFGKRYDYGDPEFRRLLNLFSDNFCIMSSRWAE 159264
159884 IYNMFPSFMDWIPGPHNRIFKNFQELRLFISEQIQWHWQSRQTGEPRDFIDCFLDQMDK 160060
160137 EQQDLESHFQDETLVMTTHDLFFGGTETTSTTLRYGLLIMLKYPEVA 160277
160379 AKVQEELDATVGRTWAPRIEDRARLPYTNAVLHEIQRFISVLPLGLPRALTRDVNLKNHFLHK 160567
160818 GTFVIPLLVSAHRDPTQFKDPDHFNPTNFLDDHGEFQNNDAFMPFAL 160958
161048 GKRMCLGAGLARSEIFLFLTAILQKFSLLPVGSPANINLNPQCTGLGNVPPAFQLRLVAR* 161230

2U Subfamily

CYP2U1    human
AC025090, (AC000016 has C-term) 41% to 2N1 new CYP2 subfamily

MSSPGPSQPPAEDPPWPARLLRAPLGLLRLDPSGGALLLCGLVALLGWSWLRRRRARGI
77036 PPGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVIGPQVLLAHLARVYGSI 76863
76862 FSFFIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVT 76734
105008 GPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPF 105160
105161 SIISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLLVNICPWLYYLPF 105340
105341 GPFKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEE 105517
105518 YLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ 105622
107396 KVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSENT 107554
109370 LQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGIG 109540
KRVCMGEQLAKMELFLMFVSLMQSFAFALPEDSKKPLLTGRFGLTLAPHPFNITISRR

CYP2U1     Macaca mulatta (rhesus monkey)
           note gc boundary between exons 7,8
MSSPGPPQPPAEDPPWPARLLRAPLGLLRMDPSGDALLLCGLVAVLGWSWLRRRRARGIP 
PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVVGPQVLLAHLARVYGSIFSF 
FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEK (1)
GVVFAHYGPIWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSI
ISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLMVNICPWLYYLPFGP
FKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLF
YIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ (1)
EKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSGNT (1)
VLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGI (1)
GKRVCMGEQLAKMELFLMFVSLMQSFAFALPEKSKKPLLTGRFGLTLAPHPFNITISRR*

CYP2U1      Bos taurus (cow)
            See cattle page for details
MASPGLPQPPTEDAAWPLRLLHAPPGLLRLDPTGGALLLLVLAALLGWSW
LWRLPERGIPPGPAPWPVVGNFGFVLLPRFLRRKSWPYRRARNGGMNASGQGVQLLLADL
GRVYGNIFSFFIGHYLVVVLNDFHSVREALVQQAEVFSDRPRVPLTSIMTKGKGIVFAHY
GPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFRYVKEEMQKHGDAPFNPFPIVNNAVSN
IICSLCFGRRFDYTNSEFKQMLTFMSRALEVCLNTQLLLVNICSWLYNLPFGPFKELRQI
EKDLTLFLKKIIKDHRESLDVENPQDFIDMYLLHVEEEKKNNSNSGFDEDYLFYIIGDLF
IAGTDTTTNSLLWCLLYMSLHPNIQEKIHEEIARVIGADRAPSLTDKAQMPYTEATIMEV
QRLSTVVPLSIPHMTSEKT
VLQGFTIPKGTIILPNLWSVHRDPAIWE
KPNDFYPDRFLDDQGQLIKKETFIPFGI
GKRVCMGEQLAKMELFLMFVSLMQSFTFVLPKDSKPILTGKYGLTLAPHPFNIIISKR

CYP2U1     Canis familiaris (dog)
           NW_8762971.1:28366254- 28348146
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           75% to human CYP2U1
WLHRRTPVAAAGGAAGAGGHSSARGPQLLLADLARAYGAVFSFFIGRHLVVVLSDFRSVRAALVQQAEIFSDRPR
VPLVSLVTKEKGIVFAHYGPVWKQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKEEMQKHGEDPFNPFPIVNNA
VSNIICSLCFGQRFDYTNSEFKKMLRLMSRALEICLNSQLLLVNICSWLYYLPFGPFKELRQIEKDITTFLKKII
KDHKESLNVENPQDFIDMYLLQVEEERKNNSNSSFNEDYLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDIQEK
VQEEIERVIGADRVPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSEKTLQGYTIPKGTVILPNLWSVHRDP
AIWEKPDDFYPNRFLDDQGQLIKKETFIPFGIGKRVCMGEQLAKMELFLMFVSLMQSFTFALPKDSKKPILTGRY
GLTLAPHPFNIVISKR*

Cyp2u1 mouse 
           GenEMBL AK018458 16 days embryo lung cDNA about 78% 
      MSSLG DQRPAAGEQPGARLHVRA        TGGALLLCLLAVLLGWVWLRRQRACGI
      PPGPKPRPLVGNFGHLLVPRFLRPQFWLGS     GSQTDTVGQHVYLARMARVYGNI
      FSFFIGHRLVVVLSDFHSVREALVQQAEVFSDRPRMPLISIMT
      KEKGIVFAHY
      GPIWKQQRRFSHSTLRHFGLGKLSLEPRIIEEFAYVKEAMQKHGEAPFSPF
      PIISNAVSNIICSLCFGQRFDYTNKEFKKVLDFMSRGLEICLHSQLFLINICPWFYYLPF
      GPFKELRQIERDISCFLKNIIREHQESLDASNPQDFIDMYLLHMEEEQGASRRSSFDED
      YLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ
      KKVHEEIERVIGCDRAPSLTDKAQMPYTEATIMEVQRLSMVVPLAIPHMTSEKT
      VLQGFTIPKGTVVLINLWSVHRDPAIWEKPDDFCPHRFLDDQGQLLKRETFIPFGIG

CYP2U1     Xenopus tropicalis (frog)
           See Xenopus page for seq

CYP2U1      Danio rerio (zebrafish)

CYP2U1-de1b Danio rerio (zebrafish)

CYP2U1      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_8899
            56% to human 2U1
MMSLSWLQSLSSSILTLVIMIILHHLFKCYQKRHGFANIPPGPKPWPVVGNFGGFL (0) 
NAAAVLTELAKVYGNVYSIYVGSQLVVVLNGYKVVRDALSNHPDVFSDRPDIPA 
ISIMTKISGIVFAPYGPLWQKHRRFCLSTLRNFGLGRLGLEPCIVEGLTNIKTELLRLE 
EESGGAGVDPAPVISNAVSNVICSLVLGHRFNHDDQEFRSMLRLMDRGLEICVNSPAVLI 
NVFPLLYHLPFGVFRELRQVERDITAFL
KRFIANHQETLDPNNPRDLTDMYLKEISARREAGDVDSGFTED
YLFYIIGDLFIAGTDTTANSVLWVILYMASYPDIQ
KVQAEIDGVVGPLRTPSLSDKGKLPFTEAAIMEVQRLTTVVPLAIPHMTSET
EFMGYTIPKGTVVLPNLWSVHRDPTEWDDPDSFDPTRFLDEDGTLLRKECFIPFGIG
RRVCMGAQLAKMELFLTVTNLLQTFHFRLPEGAPRPPLQGRFGLTLAPCPYTVCINPR

CYP2U1   Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr IX (-) strand 8019744-8022277
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         73% to Fugu 2U1
MASLSWPSGADLSRVDVVALLLASLLLALCLFDVHRRRRDLANIPPGPTPWPLVGNLGFSLVPALFRRRFGEKPV
DKNAMVLLTERAAVYGNVYSMFVGSQLMVVLNGYEAVKDALSNHPEVFSDRPDIPAITIMTKRKGIVFAPYGPVW
RKQRKFCHTTLRSFGLGKLSLEPCIQQGLTTVKTELLHLSKKSGATGVDPAPLISNAVSNVICSLILGQRFHHED
RQFRSMLDLMDRGLEICVSSPAVLINVFPLLYYWPFGVFRELRRVEGDITAFLKRIIATHRETLDPDNPRDLVDM
YLMEMSAQQAAGEEDSSFTEDYLFYIIGDLFIAGTDTTANSVLWVLLYMVLHPDIQDKVQTEMDEVVGTHRTPSS
TDKGSLPFTEATIMEVQRMTVAVPLAIPHMASETTEFRGYTIPKGTVIVPNLWSVHRDPTVWDEPDRFNPARFLD
EEGQLLRKECFIPFGIGRRVCMGEQLAKTELFLTVTSLLQAFRFRLPEGAPPPSLTGRFGLTLAPCPYAVCVSPR
G*

CYP2U1     Oryzias latipes (medaka)
           chr1 20316302:20324749
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           66% to Fugu 2U1
MVSSSFGLIWSSVLSLSNLLTSLLFLLVYYLVRFYQKQRTIYKNIPPGPKPWPVVGNFGNFFVPPSVRTKIAGQP
NSTNAIEIEALRQQATVFGNIHSLFIGGQLIVVLHGFHLIRDALLNQPEVFSDRPDIPLVTILTKRKGIVFAPYG
PVWRKQRKFCHTTLRSFGLGKLSLEPCIQRGLAGVKAELLRLNEERGSAGVDPATLIGNSVSNVICSLILGQCFH
HHDVEFRTMIRLMEHGLKICINSPAVLINIFPLLYYLPFGVFKELRQVERDITAFLKRIIAKHRDTLDPDNPRDL
TDMYLIEMLTQQAAGEEDSSFTDDYLFYVIGDLFIAGTDTTTNSILWFLLYMILHPDVQDKAQAEIDGVVGKHRV
PSVTDKGSLPFTEATIMEVQRLHSVVPLAIPHMTSETTVFRGYTIPKGTVIFPNLWSVHRDPTLWEDADSFNPSR
FLDNEGNLLRKEYFIPFGIGRRVCMGEQLAKMELFLTVTTLLQAFKFRHPEGNPPPTVKERFGLTMAPCPFSVCV
TPRGGPNLNP*

2V Subfamily

CYP2V1   Danio rerio (zebrafish)
         GenEMBL AB026158
         Ohta,M., Saitou,T., Yoshizaki,G. and Otsuki,A.
         Identification of a Cytochrome P450(CYP2) cDNA for Zebrafish
         Also found as an EST from Yea-Huey Yang, Jun-Lan Wang-Buhler and 
         Donald R. Buhler Submitted to nomenclature committee 7/1/2000
         Note: AB026158 has at least 2 frameshifts and some 
         other probable errors.  Buhler‚s sequence seems to be more 
         accurate.

CYP2V1   Danio rerio (zebrafish)
         No accession number
         Tseng, H.-P., Wang-Buhler, J.-L., Yang, Y.-H.,
         Hu, C.-H., Buhler, D.R.
         submitted to nomenclature committee 12/08/2003
         51% to CYP2Z2 
         clone name YH-F4-FL

2W Subfamily

CYP2W1   human
         GenEMBL AC073957.3 chromosome 7 
         clone RP11-449P15 40% to 2F1
MALLLLLFLGLLGLWGLLCACAQDPSPAARWAPGLRPLPLVGNLHLLRLSQQDRSLME 
LSERYGPVFTVHLGRQKTVVLTGFEAVKEALAGPGQELADRP
PIAIFQLIQRGGGIFFSSGARWRAARQFTVRALHSLGVGREPVADKILQELKCLSGQL
DGYRGRPFPLALLGWAPSNITFALLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQL
FNVHPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHVCPGDPVCSYVDALIQQGQG
DDPEGLFAEANAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQGRVQEELDRVLGP
GRTPRLEDQQALPYTSAVLHEVQRFITLLPHVPRCTAADTQLGGFLLPKGTPVIPLLT
SVLLDETQWQTPGQFNPGHFLDANGHFVKREAFLPFSA
GRRVCVGERLARTELFLLFAGLLQRYRLLPPPGVSPASLDTTPARAFTMRPRPRALCAVPRP*

CYP2W1    Macaca mulatta rhesus monkey 
          AC073957.7 chromosome 7
LSERYGPVFTVHLGCQKTVVLTGFEVVKEALAGPGQELADRPPIAIFQLIQRGG (1)
GIFFSSGARWRAARQFTVRALHSLGVGRKPVADKILQELKCLLGQLDGYR (1)
GQPFPLALLGWAPSNITFTLLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQ (0)
LFNVYPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHMRPGDPVCSYVDALIQQGQ (0)
GDDPEGLFAEDNAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQ (1)
GRVQEELDRVLRRGRPPQPEDQQVLPYTSAVLHEVQRFITLLPHVPRCTATDMQLGGFLLPK (0)
GTPVIPLLTSVLLDETQWQTPDQFNPGHFLDADGHFVKQEAFLPFSA (1)
GRRVCVGERLARTELFLLFAGLLQKYYLLPPPGVSPASLDTTPAQAFTMRPRAQALCAVPRP

CYP2W1      Bos taurus (cow)
            See cattle page for details
            Partial seq.
LGKQYGPVFTVHLGHQKTVVLTGYEAVKEALVGTGQELAGRPPIAIFQLINGGG (1)
GVFFSSGPRWRAARQLTVRALHGLGVGRAPVANKVLQELRCLTAQLDSYE (1)
GRPFPLALLRWAPSNITFTLLFGQRFDYRDPVFLSLLGLVDEVMVLLGKPSVQ (0)
LFNLYPRLVALLQLHRPVLRKIEEVRAILRALLEARRHRTPPRGPQQSYLDALIQQGQ (0)
XXXXX
XXXXXXXXXXXXXXXXPRPEDVHALPYTNAVLHEVQRFITLLPHAPRCTVANTQLGPYLLPK
GTPVLALLNSVLLDETQWKTPRQFNPGHFLDANGRFVKRPAFLPFSA

CYP2W1     Canis familiaris (dog)
           NW_876319.1: 293563-287849
           Joanna Wilson and students
           submitted to nomenclature committee Feb. 17, 2009
           83% to human CYP2W1
MALLLLGILLLLGLWGLLRTCTRTPSSASRWPPGPRPLPLIGNLHLLRVSQQDQSLMELSEQYGPVFTVHLGRQK
TVVLAGYEAVREALVGTGPELADRPPIAIFQLIQGGGGIFFSSGARWRAARQFTIRTLHGLGVGRGPMADNVLQE
LRCLMGQLDCYRGQPFPLALLGWAPSNITFTLLFGRRFDYQDPVFVSLLSLIDEVMVLLGTPSLQLFNIYPWLGA
LFQLHRPVLRKIEEVRAILRTLLKARRPSMPGGGPVQSYMDALIQQGQGKDPQGLFAEANMVACTLDMVMAGTET
TSATLQWAALLMGKHPSVQCRVQEELDRVLGPGRAPQLEDQRSLPYTNAVLHEVQRFITLLPHVPRCMAADTQLG
GYLLPKGTPVIPLLSSVLLDKTQWETPRQFNPGHFLDAEGRFVKRAAFLPFSAGRRVCVGESLARSELFLLFAGL
LHRYRLLPPPGLSPDALDTTPAPAFTMRPPAQALCAVPRPGGYDQGDWGRV*


The following cDNA AK000366.1 has been reported from Japan in a project to 
identify Full length cDNAs.  This is a part of the 2W1 gene.  The reported 
sequence shown below is not full length.  It is missing the N-terminal 
exon and the C-terminal exon. If one translates the sequence upstream of 
the ATG shown below, one finds the N-terminal exon sequence as shown 
above, however, there are only about 7 amino acids worth before the 
sequence runs out and stops. Similarly, if the genomic clone is searched 
downstream of the end of the cDNA, a clear heme binding sequence is 
found and another exon is identified.  The last exon has a problem.  It 
is too long if allowed to run until it hits a natural stop codon.  
However, in another frame there is a sequence LCAVPRP* that is identical 
to the end of CYP2D6 and this sequence is at the right location for this 
to be the end of the 2W1 gene.  I suspect there is a frameshift between 
the heme binding region and the LCAVPRP* sequence.  I have shown the 2W1 
gene with this frameshift, though the exact location is uncertain.

Cyp2w1     MOUSE 
           GenEMBL XM_144624 WHOLE mRNA
           PARTS from GSS AZ515172 AZ329864 AZ983190 BH076787
MALLLLGVWGILLLLGLWGLLQGCTRSPSLAPRWPPGPRPLPFL
GNLHLLGVTQQDRALMELSERYGPMFTIHLGSQKTVVLSGYEVVREALVGTGHELADR
PPIPIFQHIQRGGGIFFSSGARWRAGRQFTVRTLQSLGVQQPSMVGKVLQELACLKGQ
LDSYGGQPLPLALLGWAPCNITFTLLFGQRFDYQDPVFVSLLSLIDQVMVLLGSPGIQ
LFNTFPRLGAFLRLHRPVLSKIEEVRTILRTLLETRRPPLPTGGPAQSYVEALLQQGQ
EDDPEDMFGEANVLACTLDMVMAGTETTAATLQWAVFLMVKHPHVQGRVQEELDRVLG
PGQLPQPEHQRALPYTSAVLHEVQRYITLLPHVPRCTAADIQLGGYLLPKGTPVIPLL
TSVLLDKTQWETPSQFNPNHFLDAKGRFMKRGAFLPFSAGRRVCVGKSLARTELFLLF
AGLLQRYRLLPPPGLSPADLDLRPAPAFTMRP (end may be frameshifted)
PAQTFSYDSVYSGAKAAYPYVEVGSWPFIWHHGAEGVSAQCSGPTLS

2X Subfamily

CYP2X1   Ictalurus punctatus (catfish)
         GenEMBL AF315346.1
         Schlenk,D., Furnes,B. and Zhou,X.
         Isolation and cloning of a new P450 2 family gene from Ictalurus
         Punctatus.
         Unpublished
         42% to 2N2

CYP2X2      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_4007
      MVTSVILLCLGVVVLVLLLRSQRPKNFPPGPPVLPLLGSILELALDNPLQDFER (0)
12453 LRKKYGNVYSLFLGTRPAVVISGLKNIKEALVTKGSDFSGRPQDMILSI 12629
      possible frameshift DAIKTN (1)
13208 VIMQDYNLVWKEHRRFALTTMRNFGMGKTSMEDRIHGEIEYIVNTLEKNN (1)
      GKTLSPHLMFHNAASNIICQVLFGTRYEYDDHFIREIVRCFTENAKISNGPWAM (0)
      LYDSIPLVRYLPLPFKNAFKNVE (0)
      TAENLVKDLFVEHKKTRMSGDPRDFVDCYFDELDK (0)
      RGKDRSSFSENMLTMYALDLHFAGTDTTSNTLLTGFLYLMNYPHIQ (1)
      ERCHQEIDKVLQDNETVTYDARNQMPYMQ (0)
15630 AVIHEVQRVANTVPLSVFHCTTKDTEFMGYSIPK 15731 (0)
15853 GTLIIPHLASVLKEEGQWKFPNEFNPDNFLNDDGEFVKPEAFMP 15984 (frameshift) XST (1)
16100 GPRVCLGEGLARMELFLIIVTLLHKFQFIWPEDAGEPDYTPIFGATQTPKPYRMKIQLRK* 16282

CYP2X3      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_10845
missing exons 1-4 off the end of the scaffold
7527 LYDSFPAVRYLPLPFKRGFEMFK 7450 (0)
7381 MSHERYLEMFVETKKTRVPGKPRHFVDAYMDELEK 7277 (0)
7193 RGDEAFFSEDQLCAIILDLHFAGTDTTANTLLSGLLYLMKYPHIQ 7057 (1)
6289 EYCQQEIDKVMQGKNEVSFEDRVQMPYVQ 6203 (0)
6105 AVIHEIQRTANTVPLSVFHCTTRDTELMGYSIPK 6004 (0) exon 9
5617 GTLIIPNLSSVLNEKGQWKSSHEFNPENFLNENGEFVQPEAFMPFST 5477 (1)
5244 GPRVCLGEGLARMELFIILVSLLRKFRFIWPEDAEEPDLTPVFGVTQTPKPYSLKVQVRSRC* 5056

CYP2X4X     Fugu rubripes (pufferfish) discontinued name
            No accession number
FE:EFRy002apsE4 EST exons 10 and 11
Length = 458 395-496 51% to 2D6 87% to Scaffold_10845 (CYP2X3)
Note: this EST is not in the current Fugu databases and appears to have been 
removed. It may have been a poor quality sequence of CYP2X3 (March 2, 2005)
SSPKGTIIIPNLSSVLNEKGQWKCPHEFHPGNFLNENGEFVKPEAFVPFST
GPRVCLGEGLARMELFIILVTLLRRFKFIWPEDAEEPDLTPIFGLTQTPKPYRLKVQIRSSFK*

CYP2X5P     Fugu rubripes (pufferfish)
            No accession number
Scaffold_3538 57% to FE:EFRy002apsE4 51% to 2D6 Length = 26272 
61% to 2X2 59% to scaf 10845 (CYP2X3)
first 8 exons missing off end of scaffold
E in EXXR motif missing, one bad boundary, no exon 11 found
Possible pseudogene
25728 (0) PGIHKVQRIANTVPLNVQYCTMKETQLMAHLLPR 25627 exon 9 bad boundary
25349 (0) ETLIIQNLNSRQNEEGQWKFPHKSRPENFLNDQGEFVKTEDFMLFSA 25209 (1) exon 10

CYP2X6      Danio rerio (zebrafish)
            ctg22265.a 66% to CYP2X1
708019 MLGSSLLVVICILLIFFLIRVKKPKNFPPGPPPVPIFGNLLQLNLANPLKDFEK 707858
707784 FAEKYGEIFSLYTGSRPAVILNSFAVIKEALVTKAQDFSGRPQDFMISHATENKGN 707617
705571 IVLADYGPVLKGHRRFALMTMRNFGLGKQSMEERILGEISHVVDYLDKNA 
       GKRVDPHIMFHNVASNVISLLLFGCRFDYNSEFLQCYIQLINEISKIINGPWNM 705149
703459 IYDTFPLLRILPLPFKKAFDHVKVIKSMNLKLIDEHKSTRVPGEPRDFIDCYLDELDK 703286
703161 GKNCVSTFSEDKLLMSIMDLHFAGTDTISNTLLTAFLYLMNHPEVQ 703024
702766 VKCQQEIDDVLEGKDQVTYEDRHNMPYTLAVIHEVQRVANTVPLSVFHCTTRDTELMGYSIPK 702578
702500 GTIIIPNLTRVLKEEGQWKFPYEFNPANFLNEQGQFEKPEAFIPFST 702360
701098 GLRMCLGEGLARMELFLIFVTLLRRFQFVWPEDAGKPDYTPVFGLTLTPKPYRMHIRRRETVKQ* 700904

CYP2X7      Danio rerio (zebrafish)
            ctg22265.b CYP2X1 Missing C-term 
            BC053412 AI959373 fd08g05.y1 CK030199 AI959373
            zfishC-a2684d06.q1c
            ctg11087 = BC053412 FILLS IN exons 3,4 in a GAP IN ctg22265
718880 MLEVSVLILICIFLVFFLIRIKRPKNFPPGPPPVPIFGNLLQINMVDPLKEFER 718719
718641 LAEKYGNIFSLYTGSKPAVFLNNFEVIKEALVTKAQDFSGRPQDLMISHL 718492 TGNKG
670408 VVLADYGPLWKDHRRFALMTLRNFGLGKQSMEERILGEISHIVDFLDKNT 670557
670648 GKTVDPQIMFHNIASNVINLVLFGCRFDYNNEFLRGYIQRIAENLRILNGPWNM 670809
717005 IYDTFPLLRILPLPFKKAFDNVKIIKSMNRKLIDEHKSTRVPGQPRDFIDCYLDELDK 716832
716723 VKNCVST 716703 716703 FSEDQLIMNIMDMSFAGTDTTSNTLLAAFLYLMNHPDVQ ()
716439 VKCQQEIDDVLEGKDQVTYEDRHNMPYTLAVIHEVQRVANIVPLSVLHCTTRDTELMGYSIPK 716251 ()
716170 GTVIIPNLTVVLKEEGQWKFPHEFNPANFLNEQGQFEKPEAFIPFST 716030
713418 GPRVCLGEGLARMELFLIFVTLLRRF 713341
       QFVWPEDAGKPDYTPVFGLTMTPKPYRMHIRRRNTVKQ

CYP2X7-de9a Danio rerio (zebrafish)
            ctg22265.c CYP2X1 pseudogene? C-term 92% to 2X.b zfish41361-135c06.q1c        
            zfish45283253h10.q1k zfish43795-291e06.p1c
720930 LGEGLARMELFLVFVTLLRRFQFVWLEDAGKPDYTPVFRHTMTPKPYRMHIRRR 720769

CYP2X7-de9b Danio rerio (zebrafish)
            ctg22265.d CYP2X1 pseudogene? C-term 87% to 2X.b
727710 GPRVCLGEGLARMELFLVFVTLLRRYKFVWPRDAGKPDYTPVFGITMTPKPYRMLIRRRDTVQ 727519

CYP2X8 Danio rerio (zebrafish)
        ctg21275 87% to 2X.a
1267572 MLGSSVLVLICILLVFLLIRIQRPKNFPPGPSPLPIFGNLLHFNLANPLKEFER 1267411
1267339 FAEKYGNIFSLYTGSRPAVFLNSFAVIKEALVTKAQDFSGRPQDFMISHLTECKGN 1267172
1263893 VVLADYGPLWKDHRRFALMTLRNFGLGKQSMEERILGEISHVVGYLDKNI 1263744
1263630 GKTVDPQVMFHNVASNVISLVLFGRRFDYNSETLQCYIQLITEISKILNGPWNM 1263469
1262072 IYDTLPFLRILPLPFKKGFDHVKVLKGMNLKLIDEHKSTRVPGKPRDFIDCYLDELDK 1261899
1261775 RKNEVSTFSEDQLLMYILDLYFAGTDTTSNTLLTAFLYLMNHPEVQ 1261638
1261335 VKCQQEIDDVLEGKDQVSYEDRDNMPYTLAVIHEVQRVANTVPLSVFHCTTRDTELMGYSIPK 1261147
1261074 GTLIIPNLTIVLKEEGQWKFPHEFNPANFLNEQGQFEKPEAFIPFST 1260934
1269368 GPRVCLGEGLARMELFLVMVTLLRRFQFVWPNDAGKPDYTP
1269244 VYGVTLTPQPYRMHIKRRETVRX 1269179

CYP2X9 Danio rerio (zebrafish)
        ctg9731 exons 1-4 67% to 2X6 first 39 aa = 2X6 100%, FRAMESHIFT IN EXON 3
66640 MLGSSLLVVICILLIFFLIRVKKPKNFPPGPPPVPIFGNMLQLNINNPLKDFER 66479
66305 LANRYGNIYSLYFGSKPWVVLNGFEALKEALVTKAVDFAGRPQDLMVNRVTKGGGE 66138
65961 VILSDYGPSWKE  HRRFALMTLRNFGLGKQSMEERILGEVSHIIDKLEKR 65819
65727 GTAFDPQTMFHNAASNIICIVLFGSRYDYDDEFLKLFIHLYTENAKIANGPWAM 65566
ctg21275 exons 5-9 77% to 2X.b trace CF996180 joins these two contigs
1272272 IYDTFPMFRYLPLPFRKAFANASKARELSTQLVEEHKKTWVPGEPRDFIDCYLDELDK 1272099
1271302 RGNDGSSFSEAQLILYVLDLHFAGTDTTSNTLLTGFLYLMTHPEVQ 1271165
1269858 AKCQQEIDDVLEDKDQASYEDRHSMPYTQAVIHEVQRVANTVPLSVFHCTTKDTELMGYNIPK 1269670
1269601 GTFVIPNLGSALKEEGQWKFPHEFNPANFLNEQGEFEKPEAFVPFSA 1269461
1259306 GPRVCLGEGLACTELFLVFVTLLRRYKFVWPRDAGKPDYTPVFGITMTPKPYRMHIRWRNTVKQ 1259115

CYP2X10 Danio rerio (zebrafish)
        ctg24117.a 55% to 2X.b
57088 MLTALVLLCLGAFLLYLQLRIRRPKNFPPGPAPVPIFGNLLQLNRISPLKDFD 56930 (0)
56850 KFAQHYGSIYGIYIGSQPAVVLTGQKMIKEALITQAAEFSGRSNNMMFSHVTGGK 56686 (1)
56439 GVIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESA 56287 (1)
56129 GKSIDPQHLYHQAASNIIASIIFGSRFNYKDAYFQTLITSVEDLTKITIGPWAM 55968 (0)
55706 LYEIAPVLRIFPLPFQKAFQYFEQITKHVLKVVEEHKTSRVAGEPRDLIDCYLEEMEN 55533 (0)
55465 KSDHRTSFDESQMVTLLFDLFIAGTETTSNTLRTLTLYLMTYTHIQ 55328 (1)
54962 EQCQREIDEVLGARDHVTYEDRNAMHFVQAVIHEGQRVADIAPLSMFHSAKTDTQLRGYSIPK 54774 (0)
54580 GTIIIPYLSSSLREESQWKFPHEFNPQNFLNEKGEFVKNDAFMPFSA 54440 (1)
54228 GPRVCLGENLARMELFLILVTVLRRFRLVWPKDAGEPDFTYIYGGTQSVKPYRVIVEPRMHGEACKFVD* 54019

CYP2X10 Danio rerio (zebrafish)
        GenEMBL AY825256
        Tseng, H.-P., Corely-Smith, C., Hu, C.-H., Wang-Buhler, J.-L.,
        Hseu, T.-H., and Buhler, D.R. 
        Submitted to nomenclature committee Oct. 14, 2004
        Clone 898HuHP
MLTALVLLCLGAFLLYLQLRIRRPKNFPPGPAPVPIFGNLLQLN
RISPLKDFDKFAQHYGSIYGIYIGSQPAVVLTGQKMIKEALITQAAEFSGRSNNMMFS
HVTGGKGVIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESAG
KSIDPQHLYHQAASNIIASIIFGSRFNYKDAYFQTLITSVEDLTKITIGPWAMLYEIA
PVLRIFPLPFQKAFQYFEQITKHVLKVVEEHKTSRVAGEPRDLIDCYLEEMENKSDHR
TSFDESQMVTLLFDLFIAGTETTSNTLRTLTLYLMTYTHIQEQCQREIDEVLGARDHV
TYEDRNAMHFVQAVIHEGQRVADIAPLSMFHSAKTDTQLRGYSIPKGTIIIPYLSSSL
REESQWKFPHEFNPQNFLNEKGEFVKNDAFMPFSAGPRVCLGENLARMELFLILVTVL
RRFRLVWPKDEGEPDFTYIYGGTQSVKPYRVIVEPRMHGEACKFVD

CYP2X11 Danio rerio (zebrafish)
         ctg24117.b zfishI-a36g12.q1c EXONS 1-7 85% to CYP2 Length = 544
80259 MLTALVLLCLGAFLLYLQVRIRRPKDFPPGPAPVPFFGNLLQLNRINPIKDLDK 80420
80510 FAQHYGSIYGIYIGSKPAVVLTGQKMIKEALITQAAEFAGRPNHMMISHITRSKGS 80677
80848 VIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESA 80997
82938 GKSIDPQHLYHQAASNIIASVIFGSRFNYKDEYFQTLIQTMEKLTKIAIGTWAM 83099
83317 LYEIAPVLRIFPLPFWKAFHYFEKITRHSLKVVEEHKKSFVAGEPKDLIDCYLEEMKK 83490
83572 RADQRTTFDEAQMVTLLFDLYLAGTETTSNTLRTLTLF 83685
88976 EQCQREIDEVLGARDHVTYEDRNDMHFVQAVIHEGQRVADIVPLNVFHTARTDTQLRGYSIPK 89164
92540 GTIIIPYLSSSLREESQWKFPHEFNPQNFLNEKGEFVKNDAFMPFSA 92680
92791 GPRVCLGENLARMELFLILVTVLRKFRLVWPKDAGEPDFTYIYGGTQSLKPYPMIVKLR 92967

CYP2X11-de1 Danio rerio (zebrafish)
             ctg24117.c EXON 1 
94557 MLTALVLLCLGAFLLYLQLRIRRPRNFPPGPAPVPIFGNLLQLNHINPIKDLDK 94718

CYP2X12 Danio rerio (zebrafish)
        GenEMBL AY825257 EST partial seq CN509498.1
        Tseng, H.-P., Corely-Smith, C., Hu, C.-H., Wang-Buhler, J.-L.,
        Hseu, T.-H., and Buhler, D.R. 
        Submitted to nomenclature committee Oct. 14, 2004
        Clone s898HuHP full length seq.
        91% to 2X10 
MLTALVLLCLGAFLLYLQLRIRRPRNFPPGPAPVPIFGNLLQLN
HINPIKDLDKFAQHYGSIYGIYIGSKPAVVLTGQKMIKEALITQGAEFAGRSNKMMVS
HVTRSKGVIMADYGESWREHRRFALTTLRNFGLGKKSMEQRILEEVKHICLLLEESAG
KPIDPQHLYHQAASNIIASIIFRSRFDYQDEYFQTLITTMEKLTKIAIGPWAMLYEIA
PVLRIFPLPFHKAFQYFEQITNHVLKVVEEHKTSRVAGEPRDLIDCYLEEMNRRSDKH
TTFDESQMVTLLFDLFIAGTETTSNTLRTLTLYLMTYTHIQEQCQREIDEVLGARDHV
TYEDRNAMHFVQAVIHEGQRVADIVPLSMFHTARTDTQLRGYSIPKGTIIIPYLSSSL
REEGQWKFPHEFNPQNFLNEKGEFVKNDAFMPFSAGPRVCLGENLARMELFLILVTVL
RKFRLVWPKDAEEPDFTYIYGGTQSLKPYPMIVKLRTPGETHEYAK

CYP2X13  Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr XIX (-) strand 19940206-19948532
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         72% to Fugu 2X2
MFASIILLLICIVFIVIQLKSRRPKNFPPGPPVWPILGNILDLSLENPLKDFERLRKTYGNVYSLFLGPKPVVVI
NEMKTIKEALVTKGVDFAGRPQDLLINDSSERELVMTDYGSSWKEQRRFALMNLRNFGMGKDSMEERIHGEIQYT
VDTLEKSIGKSFSPQNMFHNAASNIICQVLFGKRFEYEDETIKTVVQCFTENAKIANGPWAMIYDSFPLIRSLPL
PFRRAFKNVETCRKIAKSLMNEHKQTRVPGEPRDFVDCYLDRLDK (0)
PGDRSSFSEAQLTMYILDLHFAGTDTTSNTLLTGFLYLMNYPHVQ (1)
EPVFKYGNMIFKYFFI
ERCQQEIDMVLEGKDQASSEDRNNMPYVQ (0)
AVIHEFQRVANTVPLSIFHSTTKDTELNGYSIPKGTLIIPNLT
SVLNEEGQWKFPNEFNPENFLNDQGEFVKPEAFMPFSAGPRMCLGEGLARMELFLFTVTLLRKFKFIWPEDAGEP
DFTPVYGVTLTPKPYRMKVQLRVSQKIPH*

CYP2X14    Oryzias latipes (medaka)
           chr6 21423000:21438000
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           67% to Fugu 2X2
MFVSLILLWLCICILFLQLKPRRPKNFPPGPPVLPMLGNLLHLSLDNPLKDFDRLRNSYGNVYSLFLGPKPAVII
NGFKAMKEAMVIKATDFAGRPQDLFVNDVSKRKGVILADYGESWRDHRRFALMTLRNFGLGKKSMEERISEEIQH
TIKTLENNIGKLFSPQIMFHNAASNIICQVLFGKRFEYDDEIIKTIVQCFTRNSKIANGPWAMIYDSIPLIRKLP
LPFREAFKNAEICVDVGTHLVNEHKETRIPGKPRDFVDCYLDEMEKVRGDDSSFSEDQLIIYALDLHFAGTDTTS
NTLLTGFFYLINYPHIQDKCQQEIDRVLEEKQQVTFEDRHNMPYMQAVIHEVQRIANTVPLSVFHSTTKETELMG
YTIPKGTMIIQNMGSVLREDGQWKFPHDFNPENFLNEKGEFVKPEAFMPFSAGPRMCLGEGLARMELFIIMVTLL
RKFKFTWPEDAGEPDFTPVYGVTLTPKPYFMKVQLRSKP*

CYP2X fragment a    Fugu rubripes (pufferfish)
               No accession number 
               Scaffold_9193  Length = 9721 51% to scaf 4007
possible exon 1 of 2X3 or 2X4 
LGL47087.y1 Length = 725 2 family N-term exon 1
333 MLVSLALLLAAAFGLWVFFQIQRPKNFPPGPPPIPLFGNLLEIQLDNPIADLER 172 (0)

CYP2X fragment b     Fugu rubripes (pufferfish)
                     No accession number 
possible exon 2 of 2X3 or 2X4 
LED83776.x1 75% to scaf 4007 exon 2 not in new version of fugu databases
LAKRYGNVYGLFLGSRPAVVINGVSAL

2Y Subfamily

CYP2Y1      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_39a from an early version of the genome
12087 MDLTVMLLTATLLLVVLWILNAHTRKHTRLPPGPRGIPVLGNLLQLDKKAPFKSLLK 11917 (0)
11768 LSENYGPVLTVALGPQRTVVLVGYEAVKDALVDHADDFTGRGPVPFLMKVTRGY 11607 (1)
11166 GLAISNGERWRQLRRFTLTTLRDFGMGRKGMEEWIQEESKHLVTRIKSTE 11011 (1)
10937 GAPFDPTFFLSCTVSNVICCLVFGQ 10869 frameshift
10867 GFSYDDEHFLSLLHIISETIQFGSSASGL 10781 (0) 
10700 MYNLFPRLMEWLPGRHREMFGKIEKVRAFTMEKIEEHQDTLDPSSPRDYIDCFLMRLQQ 10524 (0)
10452 EKPQPNTEFNYDNLVSTVLNLYLAGTETTSSTIRYALNVLIRHPKIQ 10312 (1)
10187 EKMQEDIDSVIGQGRCPYVEDRKSLPFTDAVLHEIQRYLDMIPFSIPHYALQDISFRGYTIPK 9999 (0)
9924  DTLIIPLLHSVLKDDKMWETPGSFN 9850 frameshift
9850  PLQHFVDGNGSFKKNPAFLPFSAG 9779 (1)
9687  GKRACVGESVARHGDIPSLSSHLVQHFTLSX 9595 frameshift
9593  PGGPDSVDLTPEYSSFANVPRKYKIIATPRCNKRLCIVI* 9471

GenEMBL CAAB01000830 WGS section of Genbank 25-JUL-2002
Note: the frameshift in exon 7 did not exist in the earlier version above
This is probably a sequence error
19218 MDLTVMLLTATLLLVVLWILNAHTRKHTRLPPGPRGIPVLGNLLQLDKKAPFKSLLK (0) 19048
18899 LSENYGPVLTVALGPQRTVVLVGYEAVKDALVDHADDFTGRGPVPFLMKVTRGY (1) 18738
18297 GLAISNGERWRQLRRFTLTTLRDFGMGRKGMEEWIQEESKHLVTRIKSTE (1) 18148
18074 GAPFDPTFFLSCTVSNVICCLVFGQRFSYDDEHFLSLLHIISETIQFGSSASGL (0) 17913
17832 MYNLFPRLMEWLPGRHREMFGKIEKVRAFTMEKIEEHQDTLDPSSPRDYIDCFLMRLQQ (0) 17656
17585 EKPQPNTEFNYDNLVSTVLNLYLAGTETTSSTIRYALNVLIRHPKIQ (1) 17445
17323 EKMQEDIDSVIGQGRCPYVEDRKSLPFTDAVLHEIQRYLDMIPFSIPHYALQDISFRG 17147
17145 YTIPK (0) 17131
17056 DTLIIPLLHSVLKDDKMWETPGSFNPQHFLDGNGSFKKNPAFLPFSA (1) 16916
16823 GKRACVGESLARMEIFLFVVSLVQHFTLSCPGGPDSVDLTPEYSSFANVPRKYKIIATPRWQ* 16635

CYP2Y2      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_39b from an early version of the genome
15595 MEFSVTLILAGLVLAFFWFILQKRKYNLPPGPTTLPLVGNLPQLDKKQPFKSFTE 15431 (0)
15356 LSKSYGPVMTLYLGWQRTVVLTGYEVVKEALVDQAEDFTGRGPLPFLLKATNGY 15195 (1)
15078 GLGISNGERWRQLRRFTLSTLRDFGMGRKGMEEWIQEESKHLTARIKTLK 14944 (1)
14815 VKPFDPTFLLGCTVSNVICCMVFGERFSYDDKQFLELLRVIAEVLRFNSSFLGQ 14654 (0)
14549 MYNVFPWILEHLPGPQHTMFSHVNFLREFIKKKIQEHKESLDPSSPRDYIDTFLIRMEQ 14373 (0)
14282 EKNLPNTEFHYENLVSTVLNLFLAGTETTSSTLRYALGVLIKHPNVQ 14142 (1)
14046 EEMQREIDNVVRQDQCPKMEDRKSLPFTDAVIHEVQRFLDIVPFGLPHYALKDITFRGYSIPK 13858 (0)
13775 GTVIIPLLHSVLKGDQWETPWAFNPKHFLDQNGSFKKNSAFLPFSA 13638 (1)
13390 GKRSCVGESLARMELFIFLVTLLKDFTFSCIEGPDSISLNPQYSGFANLPRNYEIVATPR* 13208

GenEMBL CAAB01000830 WGS section of Genbank 25-JUL-2002
22434 MEFSVTLILAGLVLAFFWFILQKRKYNLPPGPTTLPLVGNLPQLDKKQPFKSFTE (0) 22270
22195 LSKSYGPVMTLYLGWQRTVVLTGYEVVKEALVDQAEDFTGRGPLPFLLKATNGY (1) 22034
21935 GLGISNGERWRQLRRFTLSTLRDFGMGRKGMEEWIQEESKHLTARIKTLK (1) 21786
21654 VKPFDPTFLLGCTVSNVICCMVFGERFSYDDKQFLELLRVIAEVLRFNSSFLGQ (0) 21493
21388 MYNVFPWILEHLPGPQHTMFSHVNFLREFIKKKIQEHKESLDPSSPRDYIDTFLIRMEQ (0) 21212
21121 EKNLPNTEFHYENLVSTVLNLFLAGTETTSSTLRYALGVLIKHPNVQ (1) 20981
20885 EEMQREIDNVVRQDQCPKMEDRKSLPFTDAVIHEVQRFLDIVPFGLPHYALKDITFRGYSIPK (0) 20697
20614 GTVIIPLLHSVLKGDQWETPWAFNPKHFLDQNGSFKKNSAFLPFSA (1) 20477
20296 GKRSCVGESLARMELFIFLVTLLKDFTFSCIEGPDSISLNPQYSGFANLPRNYEIVATPR* 20047

CYP2Y3      Danio rerio (zebrafish)
            GenEMBL ESTs CK016257, CK869788, CK706387, CB891035
            Zebrafish blast server May 04 sequence NA1608
            62% to 2Y1 and 64% to 2Y2 45% to CYP2B6 45% to 2B3
30425 MELSSSLLLVLVLTVLMLIRWRRKENGLSLPPGPLALPLIGNLLTLDKSAPFKSFMK 30595 (0)
32151 WRKTYGSVMTVHLGPQRMVVLVGYETVKEALVDQAEDFAPRAPIAFMNRIVKGY (1?)
      GLAISNGERWRQLRRFTLTTLRDFGMGRKQMEQWIQEESRYLLKSFEETK 32519 (1?)
32651 SKPVDPTFFFSRTVSNVICSLVFGQRFDYEDKNFLQLLQIISKLLRFLSSPWGQ 32812 (0?)
33063 LYNIFPQVMERFSSRHHAILKDVENIRTFIRNKVKEHEQRLDFSDPSDFIDCFLIRLTQ 33239 (0?)
33356 EKDKRKLDTEFHKDNLMATVLNLFVAGTETTSTTLRYALMLLIKHPQIQ 33502 (1?)
34553 EQMQREIDRVIGQNRIPTMEDRKSLPFTDAVIHEVQRYMDIVPLSLPHYAMKDITFRGYKIPK 34741 (0)
34907 DTVIIPMLHSVLRDEGQWETPWTFNPEHFLDSNGNFKKNPAFMPFSA 35047 (1)
35424 GKRSCVGESLARMELFLFTVSLLQKFTFSSPNGPDGIDLSPELSSFANMPRFYELIASPR* 35606

CYP2Y4      Danio rerio (zebrafish)
            GenEMBL EST AL916779 
            Zebrafish blast server May 04 sequence NA1608
42397 MELSSSLLLVLVLTVLMLIRWRRKENGLSLPPGPLALPLIGNLLTVETSAPFKSFMK 42567 (0)
      missing exon 2
      XXXXTNGERWRQTERFTLTTLRDFGMGRKRMEQWIQEESRYLLKSFEETK
      SKPVDPLFFMSRAVSNVICSLVFGQRFDYEDKNFLQLLQIISNLMRFASSPWGQ
      LYNIFPKVMEILPGRHHTMFGEIDDLKSSIMTII 44325
44326 KEHEENLDPSDPKDFIDCFLIRLNQ (0?)
      QEKHNPDT 44524 44525 EFHKENMFATSLNLFTAGTETTSTTLRYALMLLIKHPHIQ 
44989 EQMQREIDCVIGQNRIPTMEDRKSLPFTDAVIHEVQRCLDIAPLNVPHYALKDITFRGYKIPK 45177 (0)
      DTVIIPMLHSVLRD 45348 45349 EGHWETPWTFNPEHFLDSNGNFKKNPAFMPFSA 45447 (1)
46751 GKRVCVGESLARMEIFLFIVSLLQKFSFSSPNGPDSIDPSPELSSFGNMPRLYELIASPR 46930

CYP2Y5   Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr I (-) strand 16588689-16592714
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         68% to Fugu 2Y1, 70% to 2Y2
MDFSATVFLAGLILALLWLFGVKNRRKYLLPPGPFALPLIGNLPQLDKNAPFKSILKFSETHGPVMTVHLGWQRV
VFLVGYDAVKEALVDQGDDFTGRGPLPFLMKVTKGYGLAISNGERWRQLRRFSLSTLRDFGMGRKGMEVWIQEES
RHLRARMESFKASPFNPRFLLSRTVSNVICCLVFGERFGYEDKKFLHLLNTISEVLDFLNSPVGQLYNIFPWLMG
HLPGSQHACFAKAEKLREFIETKIHQHKATLDPSSPRDFIDCFLIRINQEKDNPKTEFHYENLISTVLNLFLAGT
ETTSSTIRFALSVLIKYPNIQEKMQTEIDGVIGQSCVPSMENRKSLPFTDAVIHEVQRFLDIVPFSIPHYALHDI
SFRGYTIPKDTMIIPMLHSVLKEERNWATPQSFNPQHFLDQNDNFKKNPSFLPFSAGKRACVGESLARMELFIFL
VSLLQNFTFSSTGGPDSINLIPEYSSFANLPRTYQIIATPR*

CYP2Y6     Oryzias latipes (medaka)
           chr 13 2357422:2368485
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           68% to Fugu 2Y1
MDLSTSLILVVLTTVLLWLLNRRNSRKQHLPPGPPALPLIGNLLQLDKKRPFRTIVELSKTHGPVMTIYMGWQRA
VALVGYDAVKEALVDQADDFVGRAPLPFLYRATRGYGIGISNGERWRQLRRFALTTLRDFGMGRKGMEQWIQEES
RHIRAKINTFKGKPFDPTFILSCTVSNVICCLVYGERFNYDDKQFLELLQIISEVPRFNSSPMGAMYNLFPWLME
RLPGRQHTIFGYIEDIRKFAKNKIQEHKDKLDPSSPRDFIDCFLLRMDQEKDNPTSEFHYENLLAMVLNLFLAGT
ETTSSTIRYALSVLIKHPKIQEKMQEEIDSVIGRERCPSMEERKSLPFTDAVIHEVQRFMDLTPFSLPHYSLKDI
SFRGYTIPKDTMIFPMLHSVLREDKLWSSPWSFNPQNFLDQNGNFKKNPGFVPFSAGKRACVGESLARMELFLFI
VSFLQDFTFSAPNGPDSINLVPEYSSLANLPRRYELIATPR

2Z Subfamily

CYP2Z1      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_2993a
      MGLIVSVFGSHADWSISTLLLFTAVFILMVNWIRNRRPPSFPPGPWTLPVVGNMHNLAHHRMHLNLME (0)
16293 LAETYGNVFSIQLGQEWMVVLNGPTILKEALVNQGDSVADRPNLQLIIDSCHGL (1)
16785 GLGFSSGHLWKQQRQFAISTLRYFGSGSKSLEPVVLEEFAHCAKQFSEFK 16937 (1)
17023 GKPFAPQLMFYNIVTNIICSLVFGHRFEYGDKNFEKLMNSFGRCLQIEASVCAQ 17184 (0)
17262 LYNSFPRLMGCLPGPHQTVKRIYQNIRDFIREEMKEHKKGLDPSTPRDYIDCYLNKIKK 17435 (0)
      XXXXXXXXXXXNL (fs) VIX (fs) VWNX (fs) FVPX (fs) TNT (fs) TTFTYRWLFLFMA 
      (fs) KYPEMQ (1)
17899 EKVQAEIDEVIGQSRRATMDDCVNMPYTNAVIHESLRMGNVVPLSLLHATGRDIQLEGYTIPK 18087 (0)
18158 GTTVIANLTSALFDKNEWETPFAFNPGHFLDEEGRFRKRTAFLPFSA (1)
18388 GRRLCLGENLARMMLFLFFTSFMQDFTISFPAGVSPAMEYHHFGVTLAPHPFDICAVSR* 18567

CYP2Z2      Fugu rubripes (pufferfish)
            No accession number
            Scaffold_2993b
      MHWIFDLIGSFLAGDFKSLLFFLLIFILTADYLRNRRSGSFPPGPMAIPIIGNMLSLDRSRTHESLTQ (0)
21437 LAETYGNVYSLRTGQTWMVVVNSFKVVREALVTHGESVSDRPDLPLQDEIAHGK 21273 (1)
20946 GVISSNGHLWKQQRRFALSTLRLFGFGKKSLEPFITDEFTHCANIFRSYK 20815 (1)
20726 GKPLPPHLILNNVVSNIICSLVFGHRFEYGDKNFKNLIKLFDQSLQIEASVWAE 20565 (0)
20473 LYNSFPLLMKHVPGPHQTVKKIWNEVKDFVRNELKEHRKNWDPSDPRDYIDCYLREIQX 20300 (?) gap at boundary
19990 SGQSDSTFDEENLVICVMDLFVPGSETTSTTLRWAFLYMAKYPEIQ (1)
19748 EKVQAEIDRVVGQSRPLTMDDRVNLPYTDAVLHEIQRFGNIVPLSLPHVTNKAIQLEGYNIPK 19560 (0)
19470 GIMIIPNLTSALFDKNEWETPCTFNPGHFLDNEGKFRKRAAFIPFSA 19330 (1)
19220 GKRLCLGENLARMELFLFFTSFMQHFTFSMPAGVKPDMSFRFGVTLAPKPYEICAIPR* 19044

CYP2Z3   Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr VIII (-) strand 15162832-15165857
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         51% to Fugu 2N9, 71% to CYP2Z2 probable ortholog of CYP2Z2
MDSIFSICGSYFTLDVKSFLLFAVVFLLSADYIKNRRPGSFPPGPPALPIVGHIFNLDYKRVHVSLTQ
LAGRYGDVYSLRMGHRWMVVLNGITVLKEALVTQGDSLADRPDLPLQHDIAHGL
GVIFSNGNTWKQQRRFALSALRHFGFGK
KSLEPVILDEFTYCVKDFNSHKGKPFDPHLIVNNVVSNVICSLVFGHRFEYGDEKFLKLMKWFGDALELEASIWA
QLYNSFPVLMRRLPGPHKDLQHIWNNVKDFIGVELKEHKQNWDPSDQRDYIDCYLNEIQTGQADNTFDEENLVLC
VLDLFLAGSETTSTTLRWAFLYMVKYPEIQAKVQAEIDRVIGQSRLPSMEDRANMPYTDAVIHEVQRMANIVPLS
LPHITSKDIQLGGYTIPKGVTIIPNLTSVLFDKNEWETPDTFNPGHFLNAEGKFVKSAAFIPFSAGKRLCLGENL
AKMELFLFFTSFMQRFTFSMPPGVKPVMDFRFGITLAPFPYEVCVTSR*

CYP2Z4   Gasterosteus aculeatus (three-spined stickleback)
         UCSC browser Chr VIII (-) strand 15162832-15165857
         Joanna Wilson and students
         submitted to nomenclature committee Nov. 6, 2007
         missing exon found in ESTs DN671369.1, DW642948.1
         revised seq 59% to 2Z2
MDQLSGVSSTWLWLDGRSLLLFTLVVLVTAEYLRARRPSGFPPGPWPFPLVGNMFSLDPSNVHGDMTK (0)
LAEKYGKVYSLKMGPLWSVVLNGLSAVQEGLAEGDYANGRPDFAIHSDVLPEL (1)
GIVFSNGH
SWKQQRRFALITLKYFGVGKKSLESSILEEFIHASKEIASHEGKPFKPNVLMRNAVSNIICALVFGHRFEYSNEK
FQKMLTLLDNGTRIEASIWAQMYNAFPVLMRRLPGPHRTLQGIYGEILDLIKTEVDQHREDFNPSEPRDFIDCYL
NEMEKVADAGFNEDNLLMCSFDLFGAGTETTSTTLLWAFLYMAKYPEIQAKVQAEVGRVIGPSRQPSMKDRANMP
YTDAVIHEVQRIGNIVPLSLPHITSRDVQLGGYTIPKGVTIIPNLTSVLFDKNEWETPDTFNPGHFLNEEGKFVK
PAAFIPFSAGRLCLGENLARMELFLFFSSFMQRFSWSMPAGVEPLLKPRFGITLSPEPYEICAISR*

CYP2Z5     Oryzias latipes (medaka)
           chr4 31513782:31524077
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           70% to Fugu 2Z2
MDLFSSTIGLMLEWDLKSLLLFLSVFIITADYIKNRRPLSFPPGPPGLPILGNIFTVDVGRPHESFSKLAAEYGD
LYSLRFGQRWTVVLNGHKALKEALVTKGDSVVDRPHLPLQDEIAKGLGVIFSNGANWTEQRRFALSTLRYFGFGK
KSLEPVILNEFAHCAEELKRFKGEPLDPHLIINNTVSNIICHLVFGHRFNYGDKKFKKLMLLFDRALQIEASIWA
QLYNSFTLIMRCLPGPHKTLQHIWREVQDFIGEELKEHKKSWDPSDARDYIDFYLTEIQKTKGQEGSTFDEENLI
MCVLDLFVAGSETTSTTLRWAFLYMAKYPEIQEKVQAEIHKVIGKSRPPCMEDRAELPYTDAVIHEVQRIGNIVP
LSLPHATNKDVQLGGFTIPKGVLIIPNLTSVLFDEKEWETPHAFNPGHFLNKDGKFVKRGAFIPFSAGKRLCLGE
NLARMELFLFFTSFMQHFSFSMPAGVEPVLDYRAGLTLAPKPYKICVQASSEK*

2AA Subfamily

CYP2AA1     Danio rerio (zebrafish)
            GenEMBL AF497969
            Afonso Bainy and John Stegeman
            74% to 2AA2 
            submitted to nomenclature committee 4/5/02
MLVGLVKLDLASVGLTLFLGLIFLVLFEIFRINSYKFRFPPGPT
PLPFVGNLPHFLKSPMEFIRSMPQYGEMTTIFFGRKPVIMLNTIQLAKEAYVQDAFSG
RPAIPLFDWITNGLGIVMVTFNNSWRQQRRFALHTLRNFGLGKKTVEDRVLEESRYLI
AEMLKEEGKSMNPQHALQNAISNIICSIVFGDRFEYDNKRFEYLLKTLNENIMLAGSA
AGQIFNLVPFIKHFPGPHQKIKQNADELLGFIRDEAKEHKQTLDPDSPRDFIDAYLLE
IEKQKSSKDSTFHEENLVVSASDLFLAGTDTTETTIRWGLINLIQNPDVQERCHEEIV
RVLGYDRLPSMDDRDKLPYTLATVYEIQRCANIAPNVMHQTILPTRLHGYDIPQGTII
LTNLAAIFSNKDHWKHPDAFNPENFLDENGHFSKPESFIPFSLGPRVCLGETLARTEL
FLFITALLQRIRFSWPPDAKPIDMDGIMGLVRSPQTFNVVCHSRDNVK

CYP2AA1     Danio rerio (zebrafish)
            Chr 23
2AA1 partial seq missing exons 7-9 (broken gen may indicate incorrect genome assembly here)
	1 	66 	+ 	Chr:23 	38951974 	38952171 	- 	345 2AA1 2 diffs
211044 MLVGLVKLDLASVGLTLFLGLIFLVLFEIFRIHSYKGRFPPGPTPLPFVGNLPHFLKSPMEFIRS 210850
	62 	117 	+ 	Chr:23 	38949574 	38949741 	- 	453 2AA1 1 diff	
208602 MPQYGEMTTIFFGRKPVIMLNTIQLAKEAYVQDVFSGRPAIPLFDWITNGL 208450
	116 	214 	+ 	Chr:23 	38945155 	38945454 	- 	439 2AA1 1 diff
204324 GIVMVTFNNSWRQQRRFALHTLRNFGLGKKTVEDRVLEESRHLIAEMLKEE 204172
	168 	268 	+ 	Chr:23 	38944809 	38945129 	- 	471 2AA1 100%
204002 GKSMNPQHALQNAISNIICSIVFGDRFEYDNKRFEYLLKTLNENIMLAGSAAGQ 203841
	222 	395 	+ 	Chr:23 	38944376 	38944861 	- 	529 2AA1 100%	
203734 IFNLVPFIKHFPGPHQKIKQNADELLGFIRDEAKEHKQTLDPDSPRDFIDAYLLEIEK 203561
	                      Chr:23 	38944462 	38944602 	-          2AA1 1 diff
203475 QKPSKDSTFHEENLVVSASDLFLAGTDTTETTIRWGLINLIQNPDVQ 203335

3 exon fragment exons 7,8,9  2AA1 like sequence.  This gene is broken by an insertion of 2AA8 exons 1-8
I think the 2AA8 sequence needs to be moved to reunite 2AA1 fragments and make a whole 2AA1 and 
A whole 2AA8

	318 	388 	+ 	Chr:23 	38934350 	38934556 	- 	550 2AA1 2 diffs
       ERCHEEIVRVLGYDRLPSMDDRDKLPYTLATVYEIQRCANIAPNVMHQTILPTKLHGYNIPQ 193223
	359 	440 	+ 	Chr:23 	38931225 	38931455 	- 	446 2AA1 1 diff	
       GTIILTNLAAIFSNKEHWKHPDAFNPENFLDENGHFSKPESFIPFSL 190116
	436 	498 	+ 	Chr:23 	38927597 	38927785 	- 	549 2AA1 100%
186658 GPRVCLGETLARTELFLFITALLQRIRFSWPPDAKPIDMDGIMGLVRSPQTFNVVCHSRDNVK 186470

8 aa diffs to original Stegeman sequence AF497969, 3kb upstream of 2AA10
211044 MLVGLVKLDLASVGLTLFLGLIFLVLFEIFRIHSYKGRFPPGPTPLPFVGNLPHFLKSPMEFIRS 210850
208602 MPQYGEMTTIFFGRKPVIMLNTIQLAKEAYVQDVFSGRPAIPLFDWITNGL 208450
204324 GIVMVTFNNSWRQQRRFALHTLRNFGLGKKTVEDRVLEESRHLIAEMLKEE 204172
204002 GKSMNPQHALQNAISNIICSIVFGDRFEYDNKRFEYLLKTLNENIMLAGSAAGQ 203841
203734 IFNLVPFIKHFPGPHQKIKQNADELLGFIRDEAKEHKQTLDPDSPRDFIDAYLLEIEK 203561
203475 QKPSKDSTFHEENLVVSASDLFLAGTDTTETTIRWGLINLIQNPDVQ 203335
       ERCHEEIVRVLGYDRLPSMDDRDKLPYTLATVYEIQRCANIAPNVMHQTILPTKLHGYNIPQ 193223
       GTIILTNLAAIFSNKEHWKHPDAFNPENFLDENGHFSKPESFIPFSL 190116
186658 GPRVCLGETLARTELFLFITALLQRIRFSWPPDAKPIDMDGIMGLVRSPQTFNVVCHSRDNVK 186470

CYP2AA2     Danio rerio (zebrafish)
            AI657973 fc19c11.y1, AI958603 fc94a10.y1, AI544967 fb69h12.y1
            BI887677 AI444248 fb40e01.y1
            zfishC-a1385b03.q1c zfishC-a2172h09.q1c zfishG-a67c10.q1c
            these last three are from the zebrafish blast server
            48% to 2J1 74% to CYP2AA1
            intron phases from closely related zebrafish genomic sequences
MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGFNRL (0) exon 1
MAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPAIDWTSNGC (1) exon 2
GIIMATFNNSWKQQRRFALHTLRNFGLGKKSLESRVLEESQYLIAEMLKDE (1) exon 3
GRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSAAGQ (0) exon 4
IFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEK (0) exon 5
QKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQ (1) exon 6
ERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ (0) exon 7
GTIIMTNYTAIFSNKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGV (1) exon 8
GPRICLGDTLAKTALFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPETFSIICCSRDTKE* exon 9

CYP2AA3v1  Danio rerio (zebrafish)
           BC055136 ctg14330 Zv3 05/2004 zfishC-a1177h12.q1c Z35723-a631b05.p1c
zfishI-a76h10.q1c
131529 MFTALLKLDLASVGLTLFLGLIILVLFEIFRIHSYKRRTPPGPTPLPFVGTIPHFLKNPLGFIR 131720
131812 MSQYGDMSTMYLGRKPAIFLNTIQLAKETLVQDTFSGKPYLPVIEWISKGFG 131967
132158 GIAMVTFNHSWRQQRRFALHTLKNFGLGKKSVEDRVLEESRYLIAEM
LKDEGKPVDPHHPIQNAVSNIICSIVFGDRFEYNNKRFEYLLKMLNETIMLAGSASGR
IFNLVPFIKHFPGPHQKIKKNTDDLLIFLGDEVEEHRKTLDPGSPRDFIDAYLLEIEK
QKFNKDSTFHEGNLLASAGDLFMAGTDTTETTIRWGLLLLIQNPDVQERCHEEIVRVL
GYDRLPSMNDRDRLPYTLATVHEIQRYGNIIPILIHETILPTKLQGYSIPQGTTIVTN
IQAIFSSKDHWKHPDTFNPENFLEDGHFIKPESFIMFSLGPRSCLGEMLARTELFLFI
TSLLQRIHFSWPPDAKPIDMDGIFGLVHSPQTFNVICRSRDTK

CYP2AA3v1 Danio rerio (zebrafish)
        GenEMBL AL923007 
        Tseng, H.-P., Wang-Buhler, J.-L., Hu, C.-H., 
        Hseu, T.-H., Peng, J.R. and Buhler, D.R. 
        Submitted to nomenclature committee Oct. 14, 2004
        JR12
        Note: multiple ESTs and mRNAs support both 2AA3v1 and 2AA3v2
        Even though they only have 6-8 aa differences

CYP2AA3v2  Danio rerio (zebrafish)
           GenEMBl CK698285.1 EST and UCSC genomic seq.

CYP2AA3v2  Danio rerio (zebrafish)
           ctg14330 (7 aa diffs in the last four exons to 2AA3v1)
131529 MFTALLKLDLASVGLTLFLGLIILVLFEIFRIHSYKRRTPPGPTPLPFVGTIPHFLKNPLGFIRS 131723(0)
131812 MSQYGDMSTMYLGRKPAIFLNTIQLAKETLVQDTFSGKPYLPVIEWISKGF 131964 (1)
132158 GIAMVTFNHSWRQQRRFALHTLKNFGLGKKSVEDRVLEESRYLIAEMLKDE
133974 GKPVDPHHPIQNAVSNIICSIVFGDRFEYNNKRFEYLLKMLNETIMLAGSASGR 134135 (0)
136328 IFNLVPFIKHFPGPHQKIKKNTDDLLIFLGDEVEEHRKTLDPGSPRDFIDAYLLEIEK 136501 (0)
136597 QKFNKDSTFHEGNLLASAGDLFMAGTDTTETTIRWGLLFLIQNPDVQ 136739 (1)
       ERCHEEIVQVLGYDRLPSMDDRDRLPYTLATVHEIQRYGNIIPILIHETILPTKLQGYSIPQ 140163 (0)
143088 GTTIVTNIQAIFSSKDHWKHPDSFNPENFLEDRHFIKPESFIMFSL 143225 (1)
143308 GPRSCLGEILARTELFLFITSLLQRIHFSWPPDAKPIDMDGIFGLVHSPQAFNVICRSRDTK 143493

CYP2AA4 Danio rerio (zebrafish)
        ctg14330 77% to 2AA1 missing exons 1,2 dup exons 3,8
zfishB-a33e04.q1c zfishB-a46b05.q1c zfishC-a2901c10.p1c zfishK-a149h03.q1c
AI266900 (exons 1,2,3)
This is an older version of the sequence, use the newer version below
     MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGFNRL
     MAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPVIDWTSNGC
 716 GIIMATFNNSWKQQRRFALHTLRNFGLGKKSIESRVLEESQYLFAELLKDE 868
1337 GKSMNPQHALQNAVSNIICSIVFGERFDYDNKRLGYLLKILNENMMLTGSAIGQ 1498
4286 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLEIEK 4459
     QKSNKDSTFQEENLIGSAIDLFFAGTDSTATSIRWGLLFLIQNPDVQ 4684
5758 ERCHEEIVQVLGYDRLPCMDDCDRLPYTHATVHEIQRFAKTVPFGVFHETIWPTKLHGFDIPQ 5946
6575 GTMIMTNLAAIFSSKEHWKHPDTFNPENFLDENGHFSKPESYIPFSL 6715
8861 GLRACIGESLVRTELFLFATVLLQRIHFSWPPNAKPIDMDGIMGLVHSPQTFNVICRSRDTK 9046

CYP2AA4-ie3 Danio rerio (zebrafish)
            ctg14330 dup exon 3
1089 QRRFALHTLRNFGLGKKTVEDRVLEESQYLIAEMLKDEG

CYP2AA4-ie8 Danio rerio (zebrafish)
            ctg14330 dup exon 8
6375 GTMIMTNLAAIFSSKEHWKHPDTFNPENFLDENGHFSKPESY 6500

CYP2AA4 Danio rerio (zebrafish)
        Chr 23
96% exons 4,9 do not match older version of 2AA4 above
CK697338.1 only has three diffs in exon 4 to this seq
EB851360.1 matches exon 3 and exon 4 to YDNK 100%
EB982730.1 matches exon 4 and part of exon 5 with 1 diff near the end
There is EST support for this exon 4 in context.
No ESTs match the old exons 4 or 9
No exact match for the old exons 4 or 9 is found in the new assembly	
275732 GEPVNPHHALQNAVSNIFCSIMFGERFDYDNKRLGYLLKILNENMMLTGSAIGQ 275571
278301 MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGFNRL 278107
278025 MAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPVIDWTSNGC 277873
276540 GIIMATFNNSWKQQRRFALHTLRNFGLGKKSIESRVLEESQYLFAELLKDE 276388
       GEPVNPHHALQNAVSNIFCSIMFGERFDYDNKRLGYLLKILNENMMLTGSAIGQ 275571
272893 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLEIEK 272720
272635 QKSNKDSTFQEENLIGSAIDLFFAGTDSTATSIRWGLLFLIQNPDVQ 272495
271423 ERCHEEIVQVLGYDRLPCMDDCDRLPYTHATVHEIQRFAKTVPFGVFHETIWPTKLHGFDIPQ 271235
270806 GTMIMTNLAAIFSSKEHWKHPDTFNPENFLDENGHFSKPESYIPFSL 270666
268511 GLRACIGESLVRTELFLFATVLLQRIHFSWPPDAKPLDMDGIVGIVRYPQTFSIICCSRDSKK 268323

	                      Chr:23 	39019324 	39019347 	-          2AA4 100%
278301 MFGALVKLDLSSVGLTLLLGLILLVLFEIFRIHSYKSRFPPGPTPLPFVGNLPHLLRDPMGF 278116   
	62 	117 	+ 	Chr:23 	39018997 	39019164 	- 	315 2AA4 100%	
278025 MAQYGEMSTMYLGKKPAIVLNTIQVAKEALVQEAFAGRPCLPVIDWTSNGC 277873
	116 	168 	+ 	Chr:23 	39017512 	39017670 	- 	375 2AA4 100%	
276540 GIIMATFNNSWKQQRRFALHTLRNFGLGKKSIESRVLEESQYLFAELLKDE 276388
	55 	222 	+ 	Chr:23 	39016695 	39017207 	- 	347 zfishB-a496h01.q1c 100%	
       GEPVNPHHALQNAVSNIFCSIMFGERFDYDNKRLGYLLKILNENMMLTGSAIGQ 275571
	222 	292 	+ 	Chr:23 	39013805 	39014020 	- 	422 2AA4 100%	
272893 IFNLAPFIKHFPGPHQKIKKNSNELYSFIEDEVEEHRKTLDPVSPRDFIDAYLLEIEK 272720
	273 	326 	+ 	Chr:23 	39013622 	39013774 	- 	305 2AA4 100%	
272635 QKSNKDSTFQEENLIGSAIDLFFAGTDSTATSIRWGLLFLIQNPDVQ 272495
	327 	388 	+ 	Chr:23 	39012362 	39012550 	- 	394 2AA4 100%	
271423 ERCHEEIVQVLGYDRLPCMDDCDRLPYTHATVHEIQRFAKTVPFGVFHETIWPTKLHGFDIPQ 271235
	388 	440 	+ 	Chr:23 	39011775 	39011936 	- 	406 2AA.e 100%	
270806 GTMIMTNLAAIFSSKEHWKHPDTFNPENFLDENGHFSKPESYIPFSL 270666
433 	498 	+ 	Chr:23 	39009450 	39009644 	- 	408 	new 84% to 2AA4
268511 GLRACIGESLVRTELFLFATVLLQRIHFSWPPDAKPLDMDGIVGIVRYPQTFSIICCSRDSKK 268323
	                      Chr:23 	39003564 	39003746 	-           2AA5 exon 9 2 aa diffs 5.7kb downstream
262622 GPRSCLGETLAKTELFLFITSLLQRIRFSWLPDAKPLDMDGIMGIVRYPQPFSIICCSRDTK 262437

CYP2AA5X Danio rerio (zebrafish)
        ctg14330 90% to 2AA2
        This sequence discontinued since it is probably an incorrect 
        assembly of 2AA9
77910 MFTALLKLDLAFVGLTLFLGLIFLVVFEISRIYSYKCRFPPGPTPLPFVGNLPHLLKKPMEFIRX 78101
78196 LSQYGEMTTMYLGRKPAIVLNTYQVAKEALVQEAFAGRPHLPIIEWITKGL 78348
78622 GIVMVTFNHSWKQQRRFAQHTLRNFGLGKKSLESRVLEESQYLIAEMLKDE 78774
78980 GRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSAAGQ 79141
      IFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEK
81107 QKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQ 81249
85877 ERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ 86065
86197 GTIIMTNLAAILSDKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGVG 86340
95083 PRSCLGETLAKTELFLFITSLLQRIRFSWLPDAKPLYMDGIMGIVRYPQPFIIICCSRDTK 95265

CYP2AA6 Danio rerio (zebrafish)
        NA16005 Exons 4-7, 9  fd54c03.y1 AW019538 = fd54c03.x1 AI658337 
fc21h01.y1 fc21h01.x1 CA473712 73% to 2AA1
      MLAALLKLDLSSVGLSLFLGLIFLVLFEIFRIHSCKGRFPPGPTPLPFVGSIPHFLNNPMGFIKS
      LSQYGEMTTVYPGRKPAIILNTLQLMKEALVQNGSSFSGRPPVPVFNWVTDGY
      GIVMATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISEFLKAE
 4444 GKPFNPQHAIHNAAANIICSIVFGDRFDYDNKSFTYLLEIIKENIIQAGSLVGQ 4605 (0)
 7506 VFNLVPIIKHVPGPHQKIYQNGQAFKSFIRESVKEHRQTLDPDSPRDFIDAYLLEMEK 7679
 7891 QKSTQDSTFHEDNMVMAVGDLFLAGSDTTATTIRWGLIYLTQNPDVQ 8031
10145 ERCHEEIVRVLGFDRLPSMDDRDRLPYTLATVHEFQRCANLVPTGVPHETTQATKLRGYDIPQ 10334
10407 GTQILINLTEILTNKEHWKHPDTFNPENFLDDKGHFFKPEAFLPFSL 10547
      GPRACLGETLAKAELFLFVTSLLQRIRFSWPTGEKLPDMNGIFGIVRSPKPFNIICHSRGSKH* 10813

CYP2AA6-ie6 Danio rerio (zebrafish)
            NA16005 Duplicate exon 6 (3 aa diffs)
 9660 QKSTQDSTFHEDNMVMSVGDLFFAGTDTTATTIRWGLLYLTQNPDVQ 9800

CYP2AA6
	                       Chr:23 39065253       39065285 	-
324158 MLAALLKLDLSSVGLSLFLGLIFLVLFEIFRIHSCKGRFPPGPTPLPFVGSIPHFLNNPMGFIKS 324045
	62 	148 	+ 	Chr:23 	39064791 	39065066 	- 	308 2AA6 100%	
323927 LSQYGEMTTVYPGRKPAIILNTLQLMKEALVQNGSSFSGRPPVPVFNWVTDGY 323769
	52 	168 	+ 	Chr:23 	39062897 	39063256 	- 	349 2AA6 100%	
       GIVMATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISEFLKAE 321773
	168 	222 	+ 	Chr:23 	39056644 	39056808 	- 	320 2AA6 100%	
315681 GKPFNPQHAIHNAAANIICSIVFGDRFDYDNKSFTYLLEIIKENIIQAGSLVGQ 315520
	221 	279 	+ 	Chr:23 	39053573 	39053749 	- 	389 2AA6 100%
312619 VFNLVPIIKHVPGPHQKIYQNGQAFKSFIRESVKEHRQTLDPDSPRDFIDAYLLEMEK 312446
	266 	326 	+ 	Chr:23 	39053221 	39053439 	- 	322 2AA6 100%
       QKSTQDSTFHEDNMVMAVGDLFLAGSDTTATTIRWGLIYLTQNPDVQ 312094
	325 	431 	+ 	Chr:23 	39051558 	39051914 	- 	321 2AA6 1 diff	
310781 ERCHEEIVRVLGFDRLPSMDDRDRLPYTHATVHEFQRCANL
        389 	459 	+ 	Chr:23 	39051431 	39051646 	- 	342 2AA6 100%	
310519 GTQILINLTEILTNKEHWKHPDTFNPENFLDDKGHFFKPEAFLPFSL 310379
	433 	494 	+ 	Chr:23 	39051255 	39051440 	- 	390 	2AA6 100%
310304 GPRACLGETLAKAELFLFVTSLLQRIRFSWPTGEKLPDMNGIFGIVRSPKPFNIICHSR 310128

CYP2AA7 Danio rerio (zebrafish)
        NA16005 Exons 1-7 83% to 2AA1 96% (6  diffs) to AI964243 EST269357 zfishG-a606c02.p1c
AI964243 probably = AI964242 and BQ605503 
17072 MFAALLKLDLASVGLTLFLGLIFLVLFEIFRINSYKGRFPPGPTPLPFVGNLPHSLKNPMEFIRS 17266
17365 MPQYGEMTTMYLGRRPAIVLNTIQLAKEAFVQDAFSGRPFLPVMDWVANGL 17517
17817 GIIMVTFNHSWRQQRRFALHTLRNFGLGKKSVEDRVLEESRYLIAEILKGE 17969
19246 GKPMSPHHPIQNAVSNIICSIVFGDRFDYDNKRFEYLLELLNENFVLTGSAAGQ 19407
19622 IFNLAPFIKHFPGPHQKIKQNANELLAFIQGEVKEHKKTLDPDSPRDFIDAYLLEIEK 19795
      QKSNKDSTFHEGNLAISTADLFLAGTDTTSTTIRWGLLFLTQNPDVQ 20038
20933 ERCHEEIVQVLGYDRLPSMDDRDKLPYTLATVHEIQRCANLVPFGVIHETIQPTKLRGYDIPQ 21121
      GTVVMTNLAAILSDKEHWKHPDTFNPENFLDENGHFSKPESFIPFSL
      GPRFCLGETLAKMELFLFITSLLQRIRFSSPPDAKPIDMDGIMGIVRYPQPFSIICCSRDTKE*

	1 	66 	+ 	Chr:23 	39045108 	39045305 	- 	304 2AA7 100%
304178 MFAALLKLDLASVGLTLFLGLIFLVLFEIFRINSYKGRFPPGPTPLPFVGNLPHSLKNPMEFIRS 303984 
	62 	117 	+ 	Chr:23 	39044857 	39045024 	- 	373 2AA7 100%
303885 MPQYGEMTTMYLGRRPAIVLNTIQLAKEAFVQDAFSGRPFLPVMDWVANGL 303733
	117 	168 	+ 	Chr:23 	39044405 	39044560 	- 	398 2AA7 100%
303433 GIIMVTFNHSWRQQRRFALHTLRNFGLGKKSVEDRVLEESRYLIAEILKGE 303281
	168 	232 	+ 	Chr:23 	39042937 	39043131 	- 	374 2AA7 100%	
302004 GKPMSPHHPIQNAVSNIICSIVFGDRFDYDNKRFEYLLELLNENFVLTGSAAGQ 301843
	197 	289 	+ 	Chr:23 	39042555 	39042800 	- 	468 2AA7 100%
       IFNLAPFIKHFPGPHQKIKQNANELLAFIQGEVKEHKKTLDPDSPRDFIDAYLLEIEK 301455
	277 	326 	+ 	Chr:23 	39042339 	39042488 	- 	316 2AA7 100%	
301352 QKSNKDSTFHEGNLAISTADLFLAGTDTTSTTIRWGLLFLTQNPDVQ 301212
	327 	388 	+ 	Chr:23 	39041256 	39041444 	- 	461 2AA7 100%	
300317 ERCHEEIVQVLGYDRLPSMDDRDKLPYTLATVHEIQRCANLVPFGVIHETIQPTKLRGYDIPQ 300129
	388 	440 	+ 	Chr:23 	39039665 	39039826 	- 	384 2AA6 like 2 diffs	
298696 GTVVMTNLAAILSDKEHWKHPDTFNPENFLDKNGQFSKPESFIPFSL 298556	
last exon is missing in genome assembly, use EST seq

CYP2AA8 Danio rerio (zebrafish)
        NA3313 78% to 2AA7 zfishC-a402h10.p1c zfishC-a440h04.p1c
        Chr 23 (probably an assembly error, since this gene breaks 2AA1 in half)
 540 MFSALLKLDLAFAGMTLILSLIFMFLLEIFRIHSFKSRFPPGPSPLPFVGNLPVFLKNPMEFIRS 734
 811 LSQYGEMTTIYLGRKPTIMLNTVQLAKEVLIQDAFAGKPSLPVLDWVSNGL 963
1198 GIVMVTFNHSWRQQRRFALHTLRNFGLGRKSVESRVLEESQYLIAELLKKK 1350
1544 GKSVNPHHALQNAFSNVICSIVFGDRFDYDDKRFEHFLEILGKSMILTGSTAGQ 1705
3903 IFNFAPIIKHFPGPHQKIKKNADELSGFFQHEVKEHKKTLDPGSPRDYIDAYLLEMEK 4076
     QKSNKDSTFHDENLIGSTTDLFVAGSDSTATTFRWGLLFLIQNPDVQ 4304
4703 ERCHKEIVQVLGYDRLPSMEDRDRLPYTLATVHEIQRCANLAPFGLIHETIQPTKLQGYDLPR 4891
     GTTIIVNLTAIFSNKENWKHPDTFNPENFLDESGQFSKHESFIPFSL 5173
8867 GVRVCLGETLARTELFLFITALLQRIRFSLPPDAKPMDMDGILSVLRYPQNFSFICCSRDTKE 9055

CYP2AA9v1  Danio rerio (zebrafish)
        GenEMBL AY825258, AL922288 ESTs AI544967.1, CK708594.1
        EST BI887677 matches 2AA2 with 1 diff and 2AA9 with 2 diffs
        Tseng, H.-P., Wang-Buhler, J.-L., Hu, C.-H., 
        Hseu, T.-H., Peng, J.R. and Buhler, D.R. 
        Submitted to nomenclature committee Oct. 14, 2004
        JR11
        94% to 2AA5
MFTALLKVDLASVGLTLFLGLIFLVVFEIFRIYSYKCRFPPGPT
PLPFVGNLPHLLKKPMEFIRSLSQYGEMTTMYLGRKPAIVLNTYQVAKEALVQEAFAG
RPHLPIIEWITKGLGIVMVTFNNSWKQQRRFAQHTLRNFGLGKKSLESRVLEESQYLI
AEMLKDEGRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSA
AGQIFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLE
IEKQKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQERCHEEIV
QVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETAQPTKLRGYNIPQGTI
IMTNYTAIFSNKEHWKHPDTFNPENFLDENGHFSKPKCFIAFGVGPRICLGDTLAKTA
LFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPETFCIICCSRDTKE

CYP2AA9v2  Danio rerio (zebrafish)
           Chr 23
98% (7 diffs) to 2AA9v1 possible haplotype seq
Note 2AA2 has only 3 aa diffs with 2AA9v2 from aa 122 to aa 499. Only 1 diff in exons 4-9.
There are 4 aa diffs to 2AA9v1 in same region
However, ESTs EB965911.1 and CF416995.1 match 2AA2 seq over the first 200 aa
EB965911.1 100% and CF416995.1 3 aa diffs so 2AA1 is supported
As distinct from 2AA9
2AA9v2 is 100% to 2AA5 in exons 1-7 but differs in exons 8,9
no ESTs match CYP2AA5 exons 8,9.  Genomic seq for 2AA5 exon 9 is found 
with 2 aa diffs at Chr:23 	39003564-39003746 55kb away.  This was probably an error 
in an earlier assembly of contig ctg14330.  in this contig exon 8 has 4 aa diffs
from 2AA9 exon 8 in a close region, possibly seq errors.  I think 2AA5
may not exist but 2AA9 is the correct version of this gene.
231378 MFTALLKLDLAFVGLTLFLGLIFLVVFEISRIYSYKCRFPPGPTPLPFVGNLPHLLKKPMEFIRS 231184
231092 LSQYGEMTTMYLGRKPAIVLNTYQVAKEALVQEAFAGRPHLPIIEWITKGL 230940
230666 GIVMVTFNHSWKQQRRFAQHTLRNFGLGKKSLESRVLEESQYLIAEMLKDE 230514
230308 GRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSAAGQ 230147
228436 IFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEK 228263
228179 QKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQ 228039
224651 ERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ 224463
224331 GTIIMTNYTAIFSNKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGV 224191
       GPRICLGDTLAKTALFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPETFCIICCSRDxx 217157

	1 	66 	+ 	Chr:23 	38972308 	38972505 	- 	287 2AA5 100%
231378 MFTALLKLDLAFVGLTLFLGLIFLVVFEISRIYSYKCRFPPGPTPLPFVGNLPHLLKKPMEFIRS 231184
	62 	117 	+ 	Chr:23 	38972064 	38972231 	- 	312 2AA5 100%	
231092 LSQYGEMTTMYLGRKPAIVLNTYQVAKEALVQEAFAGRPHLPIIEWITKGL 230940
	116 	168 	+ 	Chr:23 	38971638 	38971796 	- 	379 2AA5 100% 
230666 GIVMVTFNHSWKQQRRFAQHTLRNFGLGKKSLESRVLEESQYLIAEMLKDE 230514
	168 	222 	+ 	Chr:23 	38971271 	38971435 	- 	387 2AA5 100% 2AA2 100%
230308 GRPMNPQHAVQNALSNIICSIVFGDRFDYNNKRFEYLLKILNESIILTGSAAGQ 230147
	222 	295 	+ 	Chr:23 	38969345 	38969563 	- 	386 2AA5 100% 2AA2 100%
228436 IFNFAPIIKHFPGPHQMINENANEVYSFVRHEVEEHRKTLDPGSPRDFIDGYLLEMEK 228263
	274 	326 	+ 	Chr:23 	38969166 	38969324 	- 	304 2AA5 100% 2AA2 100%
228179 QKSNKDSTFHEDNLITTTVDLFLAGSDSTSSSIRWGLLFLIQNPDVQ 228039
	327 	388 	+ 	Chr:23 	38965590 	38965778 	- 	434 2AA5 100% 2AA2 100%	
224651 ERCHEEIVQVLGYDRLPCMDDRDRLPYTLATVHEIQRCGNIAPFGLFHETVQPTKLRGYNIPQ 224463
	388 	440 	+ 	Chr:23 	38965300 	38965461 	- 	362 2AA2 100% 
224331 GTIIMTNYTAIFSNKEHWKHPDTFNPENFLDENGQFSKPKCFIAFGV 224191
	412 	495 	+ 	Chr:23 	38958284 	38958565 	- 	404 NA54442 100%, 1 AA DIFF WITH 2AA2
217336 GPRICLGDTLAKTALFLFITSLLQRIRFSLPPDAKPMDMDGILSIIRYPETFCIICCSRD 217157
	                      Chr:23 	39003564 	39003746 	-           2AA5 exon 9 2 aa diffs
262619 PRSCLGETLAKTELFLFITSLLQRIRFSWLPDAKPLDMDGIMGIVRYPQPFSIICCSRDTK 262437

CYP2AA10  Danio rerio (zebrafish)
          Chr 23 (see below)
          85% to CYP2AA1
183623 MLAALLKLDLASVGLTLFLGLIFLVLFEIFRIYSYKGRFPPGPTPLPFVGNLPHLLKNPMGFKRS
179490 LSEYGGLATVFIGRKPAISINTIQLAKEALVQDVFSGRPALPIFDWISHGL 179338
177849 GIIMVTFNHSWRQQRRFALHTLRNFGLGKKTVEDRVLEESQYLIAEMLKDE 177697
177562 GKSMNPQHALQNAVSNIICSIVFGDRFEYDNKRFEYLLKILNENIMLTGSAAGQ 177401
176529 IYNLVPFIKHFPGPHQKIKQNADDLFNFIRDEAKEHKQTLDPDSPRDFIDAYLLEIEK 176353
176306 QKFNKDSTFHEEHLVVSTSDLFLAGTDTTETTIRWGLIYLIQNPDVQ 176130
174181 ERCHEEIVQVLGYDRLPSMDDRDKLPYTLATVHEIQRYGNIAPKLLHETIRRTKLHGYDIPQ 173996
       GTTIIANFTAMFSDKELWKHPDAFNPENFLDENGQFSKPEYFFPFSL
       GPRACLGETLARTELFLFITSLLQRIRFSWPPNAKPIDMDGIVGIVRSPEPFNIICHSRDxx* 170736

	1 	110 	+ 	Chr:23 	38924403 	38924750 	- 	318 new 5 diffs to 2AA7	
183623 MLAALLKLDLASVGLTLFLGLIFLVLFEIFRIYSYKGRFPPGPTPLPFVGNLPHLLKNPMGFKRS
	62 	117 	+ 	Chr:23 	38920462 	38920629 	- 	327 2AA.g 100%	
179490 LSEYGGLATVFIGRKPAISINTIQLAKEALVQDVFSGRPALPIFDWISHGL 179338
	116 	206 	+ 	Chr:23 	38918752 	38918979 	- 	417 2AA.g 100% 
177849 GIIMVTFNHSWRQQRRFALHTLRNFGLGKKTVEDRVLEESQYLIAEMLKDE 177697
	168 	229 	+ 	Chr:23 	38918486 	38918689 	- 	438 2AA1 like 2 diffs	
177562 GKSMNPQHALQNAVSNIICSIVFGDRFEYDNKRFEYLLKILNENIMLTGSAAGQ 177401
	178 	302 	+ 	Chr:23 	38917411 	38917794 	- 	507 2AA.e 100%	
176529 IYNLVPFIKHFPGPHQKIKQNADDLFNFIRDEAKEHKQTLDPDSPRDFIDAYLLEIEK 176353
	269 	326 	+ 	Chr:23 	38917257 	38917439 	- 	363 2AA.e 100%
176306 QKFNKDSTFHEEHLVVSTSDLFLAGTDTTETTIRWGLIYLIQNPDVQ 176130
	327 	388 	+ 	Chr:23 	38915123 	38915308 	- 	451 2AA.e 4 diffs	
174181 ERCHEEIVQVLGYDRLPSMDDRDKLPYTLATVHEIQRYGNIAPKLLHETIRRTKLHGYDIPQ 173996
	367 	445 	+ 	Chr:23 	38913823 	38914083 	- 	351 2AA.f 100%	
       GTTIIANFTAMFSDKELWKHPDAFNPENFLDENGQFSKPEYFFPFSL
	430 	495 	+ 	Chr:23 	38911863 	38912060 	- 	475 new 85% to 2AA1
       GPRACLGETLARTELFLFITSLLQRIRFSWPPNAKPIDMDGIVGIVRSPEPFNIICHSRD 170736

CYP2AA10-de8b9b  Danio rerio (zebrafish)
          Chr 23 (see below)
          87% to 2AA3v1 
162569 GTAIVTNFEAIFSSKDHWKHPDAFNPKNFLEDEHFSKLESFIAFSL
162346 xxRSCLGEMLVRTELFLFITSLLQRIHFSWPPDAKPIDMDGIMGLVHSPQTFNVICRSRD 162173

	389 	440 	+ 	Chr:23 	38903541 	38903696 	- 	321 new 80% to 2AA3	
162569 GTAIVTNFEAIFSSKDHWKHPDAFNPKNFLEDEHFSKLESFIAFSL
	438 	495 	+ 	Chr:23 	38903300 	38903473 	- 	439 new 89% to 2AA3
162346 xxRSCLGEMLVRTELFLFITSLLQRIHFSWPPDAKPIDMDGIMGLVHSPQTFNVICRSRD 162173

CYP2AA11  Danio rerio (zebrafish)
          Chr 23 (see below)
          86% to CYP2AA6
293503 MLVGLLKLDLSSVGLSLFLGLFCLALFEICRIRIYKGRYPPGPTPLPFVGTIPHFLKNPMGFIRS 293309 
293192 LSQYGEMTTVYLGRKPAIILNALQLMKEAFVQNGSSFSGRPPVPVLTWVNQGY 293034
289501 GIIMAMDGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISELLRVE 289349
288521 GKPFNPQHAIHNAAANIICSIVFGDRFDYDNKSFTYLLEIIKENLDLAGSFAGQ 288360
284034 MVNLVPIIKNLPGPHQKIYQNGEEFKSFIRESVKAHRETLDPDSPRDFIDAYLLEMEK 283861
283191 QKSTQDSTFHEDNMVMSVGDLFFAGTDTTATTIRWGLLYLTQNPDVQ 283051
       ERCHDEIVQVLGFDCFPSMDDRDQLPYTLATVHEIQRCANVAPSGVPHQTTKPIKLRGYDIPQ 282518
282449 GTQILINLMGILANKEHWKHPDTFNPENFLDDKGHFFKPEAFLPFSL 282309
       GPRACLGETLAKTELFLFVTSLLQRIRFSWPTGEKWPDMNRILSVIRSPEPFNIICYSRDS 282053

	1 	66 	+ 	Chr:23 	39034433 	39034630 	- 	282 2AA.d 100%
293503 MLVGLLKLDLSSVGLSLFLGLFCLALFEICRIRIYKGRYPPGPTPLPFVGTIPHFLKNPMGFIRS 293309 
	62 	138 	+ 	Chr:23 	39034071 	39034331 	- 	285 NEW 86% to 2AA6	
293192 LSQYGEMTTVYLGRKPAIILNALQLMKEAFVQNGSSFSGRPPVPVLTWVNQGY 293034
	117 	178 	+ 	Chr:23 	39030437 	39030628 	- 	317 NA1642 100%	
289501 GIIMAMDGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLISELLRVE 289349
	168 	224 	+ 	Chr:23 	39029478 	39029648 	- 	333 NA1642 100% 5 diffs to 2AA6
288521 GKPFNPQHAIHNAAANIICSIVFGDRFDYDNKSFTYLLEIIKENLDLAGSFAGQ 288360
	221 	279 	+ 	Chr:23 	39024988 	39025164 	- 	363 new 89% to 2AA6	
284034 MVNLVPIIKNLPGPHQKIYQNGEEFKSFIRESVKAHRETLDPDSPRDFIDAYLLEMEK 283861
	273 	326 	+ 	Chr:23 	39024178 	39024339 	- 	321 CYP2AA6-ie6 100%	
283191 QKSTQDSTFHEDNMVMSVGDLFFAGTDTTATTIRWGLLYLTQNPDVQ 283051
	312 	396 	+ 	Chr:23 	39023630 	39023878 	- 	423 new 79% to 2AA6	
       ERCHDEIVQVLGFDCFPSMDDRDQLPYTLATVHEIQRCANVAPSGVPHQTTKPIKLRGYDIPQ 282518
	388 	465 	+ 	Chr:23 	39023343 	39023579 	- 	348 2AA6 3 diffs	
282449 GTQILINLMGILANKEHWKHPDTFNPENFLDDKGHFFKPEAFLPFSL 282309
	429 	496 	+ 	Chr:23 	39023180 	39023374 	- 	378 new 83% to 2AA.a
       GPRACLGETLAKTELFLFVTSLLQRIRFSWPTGEKWPDMNRILSVIRSPEPFNIICYSRDS 282053

CYP2AA12  Danio rerio (zebrafish)
          Chr 23 (see below)
          83% to 2AA6 
358763 MFASLLKLDLASVGLTLFLGLIFLVVFEIFRIRSYSGRFPPGPTPLPFVGTIPHFLKDSMGFIRS 358569  84% to 2AA7
358463 LSQYGEMTTVYLGRKPAMVLNTLQVIKEAIVQNGTSSSGRPSIPILTWITEGY 358305
       GIVLATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLVPEMLKLE 353402
353113 GKPFDPQHAIQNAVSNIICSIVFGDRFEYDNKRFEYLLEIIKENINQAGSLIGQ 352952
352042 VFNLIPIIKHFPGPHQKIYQNAEELKSFIRESTKSHRETLDPDSPRDFIDAYLLEMEK 351869
351605 QKSSQDSSFHEDNMVMSVADLFLAGSDTTATTIRWGLIYLTQNPDVQ 351465
335727 ERCHEEIVRVLGYDRLPCMDDRDRLPYTLATVHELQRCGNIVPSSVPHETTQPMKLRGYDIPQ 335539
335464 GTQMLINLSDILANKEHWKHPDTFNPENFLDDKGHFYRPEAFLPFSL 335324
       GPRVCLGETLAKTELFLFITSLLQRIRFSWPTGEKWPNMDGIVSVVRSPEPFKIICHSR 335067

	                      Chr:23 	39099777 	39099809 	-
358763  MFASLLKLDLASVGLTLFLGLIFLVVFEIFRIRSYSGRFPPGPTPLPFVGTIPHFLKDSMGFIRS 358569  84% to 2AA7
	62 	117 	+ 	Chr:23 	39099429 	39099602 	- 	274 2AA.d 100%	
358463 LSQYGEMTTVYLGRKPAMVLNTLQVIKEAIVQNGTSSSGRPSIPILTWITEGY 358305
	100 	174 	+ 	Chr:23 	39094517 	39094729 	- 	346 2AA.d 1 diff	
       GIVLATFGHSWRQQRRFALHTLRNFGLGKKSVEERVTEESGYLVPEMLKLE 353402
	167 	233 	+ 	Chr:23 	39094031 	39094243 	- 	355 2AA.d 100%
353113 GKPFDPQHAIQNAVSNIICSIVFGDRFEYDNKRFEYLLEIIKENINQAGSLIGQ 352952
	221 	293 	+ 	Chr:23 	39092960 	39093172 	- 	421 2AA.d 100%	
352042 VFNLIPIIKHFPGPHQKIYQNAEELKSFIRESTKSHRETLDPDSPRDFIDAYLLEMEK 351869
	270 	326 	+ 	Chr:23 	39092592 	39092762 	- 	325 2AA.d 100%	
351605 QKSSQDSSFHEDNMVMSVADLFLAGSDTTATTIRWGLIYLTQNPDVQ 351465
	324 	388 	+ 	Chr:23 	39076666 	39076866 	- 	431 2AA.a 100%	
335727 ERCHEEIVRVLGYDRLPCMDDRDRLPYTLATVHELQRCGNIVPSSVPHETTQPMKLRGYDIPQ 335539
389 	486 	+ 	Chr:23 	39076256 	39076591 	- 	327 2AA.a 100%
335464 GTQMLINLSDILANKEHWKHPDTFNPENFLDDKGHFYRPEAFLPFSL 335324
	429 	494 	+ 	Chr:23 	39076194 	39076382 	- 	390 	2AA.a 100%
       GPRVCLGETLAKTELFLFITSLLQRIRFSWPTGEKWPNMDGIVSVVRSPEPFKIICHSR 335067
Chr:23 	38974638 	38974682 	- 	
233555 QTFSIICCSRNTKE* 233511  (pseudogene piece after the gene)

2AB Subfamily

CYP2AB1P    human
            GenEMBL NT_022676.10|Hs3_22832 also AC068644.15
            chr3q27.1 185030751-185015757 - strand build 33
            old name = 2D31P
            NT_005962.297 (genescan predicted protein has errors)
            75% to 2ab1 mouse which is a functional gene
MLSLLSGLALLAISFLLLKLGTFCWDRSCLPPGPLPFPILGNLWQLCFQLHPETLLQ
LAQSVFTVWVGPIPVAVLSGFQVVKEALVSNSEQFSGRSLTPLFQDLFGER
GIICSSGHTWRQKRRFCLVMI*GLGL
GKLALEVQLQKEAAELAEAFRQEQGRPFDPQVSIVRST
VRVIGALVFGHHFLLEDPIFQELTQAIDFGLAFVSTVWRRLYDVFPWALC
HLPGPHQEIFRYQEVVLSLIHQEITRHKLRAPEAPRDFISCYLAQISK 
AMDDPVSTFNQENLV*VVIDLFLGGTDTTATTLCWALIHMIQHGAVQG
TVQLELDEVLGAAPVVCYEDRKRLPYTX
AVLHDVQRLSSVMAMGAVRQCVTSTRVCSYPVSK
GTIILPNLASVLYDPECWETPRQFNPGHFSDKDGNFVANEAFLPFSAGHRVYPAD
QLAQMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTWQPQPQEICAVPR

CYP2AB1P    Bos taurus (cow)
            See cattle page for details
MCPLLIWLGLLAASFLLLKFSIIYWERNHLPPDPFPFPILGNPWQLSFQLHPATLLQ
LAQTHGHVFTVWVGPTPVVVLCSFQA
KEALVSHSEQLSGWPLTPLFQDLAGERG
GVICSSGRTRRQ*RRFCLAALQGLG*GPLALELRLQEEAAGLVEAFHWEQ
GGPFDPQAPIVRSTARVTGALVFGRHFLSEDPFFQELI*ATNFGLAFXXXXXX
QLNDLFPWAFRCLPGPYREMFRYQKAVRGYIHREIMRHKLRTSEAPKDFISCYLAQIIK
ATDDPVSTFNEENLIQVVVGLFLGGTDTTGTTLYWVLIYMIQYGAIQS
ERVQQELVTVLGTSGAICYKDHEQLPHICTLLHEAQRLSSVA*V
AVCQCVTSTHVHGHPVPK
GTIILPNLAAVLCDPECWRTSRQFNPGHFLDKDGNFVVRDIFPPFSA
GHQMCLGD*LAQMKLLLMFATLLGTFSFQLPGRSPGLRLEYNFGGTRKPLPQKIYAVSRLNCPHPGPREEVL*

CYP2AB1P   Canis familiaris (dog)
           AACN010195735.1 
           exons 8,9 75% to cyp2ab1 mouse 
KRELPPGSFPFPSENPWQLSFQLYPETL (N-term fragment)

1543 GTIILPNLASVLLDPECWETPQQFNPGLFLDMGGNFLVNEAFLPFSA 1683
GHQVGPGDHLALMELFLMFANPFRTFWFQLPEGSLG*DLQYIWGTL*PQPQKICAVP 1941

CYP2AB1   Monodelphis domestica (short-tailed opossum)
          XM_001374342 
          Added N-term and removed some C-term seq 
          and internal seq from the prediction
          61% to CYP2AB1P human
MFSLATGLAILATSFLLLR
MLAFFLARTQFPPGPCPLPILGNLLQLLSPGACYPTLLPLTRKY
GSIFTVWLGSTPVVVLNGFQAVKDALVTHSEDFADRPVTPLFEDLFGDKGIISTSGHA
WQQQRRFGLITLRALGMGKKVLEQRLQEEAQYLVEIFHRQNGTSFDPHVPIVRAAANV
ICALVFGHRFPHGDPFFQELMKAIDFGLAFVNTIWRR (0)
LYDAFPW
LLRQLPGPHRKIFRYQEIVKSLICQEIERHKQRVPEDLEDFISCYLAQITKRKDDPAS
TFDEENLIQVIIDLFLGGTETTATTLRWALLYMIHHRDVQGKVQQELDTVLGPSRVIS
FKDRKLLPYTNAVLHEVQRFCSVISVGAVRKCGTATTVQGFPIQKGTIVLPNLASVLC
DPEHWETPWQFNPGHFLDGEGNFVIHEAFLPFSAGHRVCLGELLAKVELFLVFAHLLR
EFRLRAPAGASTNERDYILWGTKQPRPYDICASPRLGRFQGGPRKDRLEAAEMQREGG
TDQ*

Cyp2ab1    mouse
           GenEMBL NW_000107.1
           39% to Cyp2j5 new subfamily in Cyp2 EST BY749683.1 
           B6-derived CD11 +ve dendritic cells, rat ortholog XM_221297.1 91%
NW_000107.1|Mm16_WIFeb01_286
MFSLFSGMAFLAGSCLLLKLATLCWRRSHLPPGPFPFPLLGNLWQLNFQLHPNMLFQ
LAQTHGSVFTVWLGSTPIVVLSGFRAVKEALVSNSEQFSGRPLTPFFRDLFGEKG
VICSNGLTWRQQRRFCLTTLRELGLGKQALEVQLQHEAAELAKVFLQEEGRA
FDPQIPIIRSTTRVIGTLVFGHHFLSEEPIFLELIQAINLGLAFASTIWRR
LYDMFPWALRHLSGPHQKIFQYHEAVRGFIRHEIIRHKLRTAEAPKDFINCYLSQITK
AIDDPVSTFSEENLIQVVIDLFLGGTDTTATTLHWALIYLVHHRAIQG
RVQQELDEMLGAAQTICYEDRERLPYTRAVLHEVQRLSSVVAVGAVRQCVTSTWMHGYYVPK
GTIILPNLASVLYDPECWESPHQFNPGHFLDKDGNFVANEAFLPFSA
GHRVCPGEQLARMELFLMFATLLRTFQFQLPEGSQDLGLEYVFGGTLQPQPQKICAVLR

CYP2AB1   rat
          XM_221297 N-terminal incorrect, AC107471.6 N-term 
          92% to mouse
189790 MFSLFGGMAFLAGSFLLLKLAALCWRRSHLPPGPFPFPLLGNLWQLNFRLHPNMLFQ (0) 189620
LAQTHGNVFTVWLGSTPIVVLNGFRAVKEALVSNSEQFSGRPLTPFFRD
LFGEKGVICSNGLTWRQQRRFCLTTLRELGLGKQALELQLQHEAAELAEVFHQEQGRA
FDPQVPIIRSTTRVIGALVFGHHFLSEEPIFLELIRAINLGLAFASTTWRRLYDMFPW
ALRYLSGPHQKIFQYHEAVRGFIHHEIIRHKLRTPEAPKDFISCYLSQITKAMDDPVS
TFSEENLIQVVIDLFLGGTDTTATTLHWAIIYLVHHRAIQERVQQELDEVLGTAQAVC
YEDRERLPYTRAVLHEVQRLSSVVAVGAVRQCVTPTWMHGYYVSKGTIILPNLASVLC
DPECWETPHQFNPGHFLDKDGDFVTNEAFLPFSAGHRVCPGEQLARMELFLMFATLLR
TFRFQLPEGSQGLRLEYVFGGTLQPQPQKICAVPRLSSLSPREP

CYP2AB1    Gallus gallus (chicken)
           chr9:15,039,303-15,044,379 (+)
           49% to human 2AB1P, 51% to mouse, 54% to Xenopus
           This seq is named 2AB1 since it is the most like the 
           single Xenopus sequence.
18744 MLGIVELFVALVASLLILQFLKLQWMRSQLPPGPVPLPIIGNLWLLDFKLRRETLAK 18914
19663 LTNIYGNIYTVWMGQTPVVVLNGYKAVKDAIVTHSEETSGRPLTPFYRDMMGEK 19824
19958 GIFLTSGHTWKQQRRFGMTIIRSLGFGKNNLEHQIQTEASHLLHIFANTK 20107
21368 GRPFNPRTSIVHAIANIICAVVFGHRFSSEDESFSKLIKAVYFVIYFQATIWGR (0) 21529
21710 MYDAFPWLMHRFPGPHQKVFAYNNFMHNLVMNEIQMHEREKAGDPQDLIDFYLTQIAK 21884
22115 TKDDPTSTFNKDNMVQTVVDLLLGGTETTSTTLLWALLYMVQYPEIQ 22255
22786 ERVQREIEAVLEPSHVISYEDRKRLPYTNAVIHETLRYSNITSVGVPRLCVRNTTLLGFHIKK 22974
23285 GTLVLPNLHSVVYDSDHWATPCKFDPNHFLDVDGNFVNKEAFLPFSA 23425
23644 GHRVCLGEQMARVELFIFFTNLLRAFTFQLPEGVKEINPEYVLGAILQPHPYEICAVPR 23820

CYP2AB2    Gallus gallus (chicken)
           XM_422750 2 P450s fused together during annotation error
           chr9:15,031,052-15,037,949 (-)
MGINVLSPPEKNSEFYHVLFLLGLQFLRLQWRSRRFPPGPIPFP
IIGSIWWINFRADHGSLKKLAKAYGNICTLWLGHKPIVVLYGFKAVKDGLTTNSEDVS
GRLQTYLFNRFSSGKGTAEFQWMEHRVLYLKQEWLNWFLPASYPSKHRGTRIGSLQTS
PMGSSEKSIGLEQLSERDHRISWWEKPEHQRRFGIATLRKLGMGNKGMERGIQAEARH
LVEFFRSKDGRAVDPSFPIVHAVSNVICAVVFGHRFSLQDETFRRLMEAYNGIVAFGN
SYFYYTKNVPNSTYDEENMLQSVFDLFLGGSETTATTLRWALLYMVAYPDIQEKVQKE
LDAVLGSSHQIDYEDRKKLPYTNAVIHEIIRFSSIILITIPRQAVKDTTVLGYQVPKG
TIIMANIDSTLFDPEYWETPHQFNPGHFLDKDGNFVIREAFLAFSAGHRVCLGEVMAK
MELFIIFCSLLQIFKFTPPEGDKEINLSFVFGSTMKPHPYKL
CAVLR

CYP2AB3    Gallus gallus (chicken)
           XM_422750 2 P450s fused together during annotation error
           chr9:15,022,695-15,027,829 (-)
           46% to mouse 2ab1 
7270 MLAVSAVLVCLAASLLLVQFLGMQWKRRQLPPGPAPFPLFGNLLQMKFQIHHDILXX 7106
     MASMYGNIFTLWLTGTPVVVLHGY 6690
6689 QAVKEGMTAHAEEVAGRPLSRAFRLMTNGN 6618
6266 GVMFSNGHLWKQQRRFGLLTMRKMGVGKQNQECQIQEEAHHLVQYLRNTK 6117
5699 GKPLDPAVPVTHTVSNVICALILGHRFSIEDKRFLRLVEAVDDISAFANSVSFY 5538
4840 VHDQVPWIATHFLTRCKKALASIDTMRALLEEEIGSHKGKVDENQDFIGYYLDQMAK 4670
4111 SKEDAGATYDKANLLQTIFDLFLAGTETTATTLRWALLYMVAYPDVQ 3971
3128 KKVHKELDAVLGSSRLICYKDRKNLPYTNAVIHEIQRYSNIVLIALPRYTVKDTELLGFPIPK 2946
     DTIVLVNID 2769
2768 SVLSDPEKWETPDQFNPGHFLDKDGNFVHREAFLPFSI 2655
2354 GHRACMGELLARLELFIIFCTLLQAFTFTLPDGVNEVSTKFVFSS 2178
2177 TKKPPPHQICAIPR 2136

CYP2AB4    Gallus gallus (chicken)
           XM_426708 seq was added to mRNA translation to correct it
           chr9:15,009,527-15,018,429 (-)
MNPVKAAAMLSINQVMIALVVFLLVMQFLKLQRARRCLPPGPIP
LPVLGTLLQLNFQINRDVLMKLAKTYGNVFTLWFGWAPVIILNGFQAVKDGMTTHPED
VSGRLVSPFFRAMAKGKGIMLATGHMWKQQRRFALKTLRNLGLGKRGLEQRVQEEALH
LLEFFASLKEKPLDPYYPLIHSVSNVICAVVYGHRFSRGDETFHELIRATEHIFKFGG
SLLHHLYEIFPWLMCRLPGPHKKALSCYDILSSFTRREIREHKEREIPDEPRDFIDFY
LAHIEKSGDEPKSTYNEENMVYSINDLFLGGSETTSTTLNWGLLYMVAYPDVQEKVQK
ELDAVLGPSQMICYEHRRKVPYTNAVIHEIQRFSNIISIGMPRVCVRNTTLLGFPLKK
GSIVLPNIASSLYDP
EHWETPRQFNPAHFLDKDGNFVSQEAFLPFSIGHRVCLGEHLARTELFIFFANLLRAF
TFQLPEGVTTINTEPIFGGTLQPHPYKVCAIPR

CYP2AB1     Xenopus laevis (African clawed frog)
            GenEMBL BC074149.1 
            46% to 2AB1P hum, 49% to mouse, 54% to chicken
MSFTQETWSLQQILLAFLVCVIAVKYIKMRWAA
RSLPPGPTPLPLIGNLWALRFKLHPKTLRKIAVSYGDIYTLWLGHTPLVVLSGCRSVRNG
LISHSEELSGRPVDGLMQALTNERGIGSTNGHTWKQQRRFGLMTLRNLGLGKRGLESRIQ
EEAQCLVESLAAKNGEPVNPSDLIVHAVANVISAVVFGHRFSIEDPTFQEMVRCNGCIVT
NLGTAWGRIYDAFPWLMRFV PGPHQSSFAAMAYLTAFIKKEIKLHELNGPNEQPQDLIEY
YLAQIAKTKHEPDNTFDEANMIQTVI DLFIAGTETTATSLQWALLYMVAFPEIQKKVQEE
LDTVLDGSQLAYYEDKKRLPFTNAVIHEVQRYGNIASVGMLRSCIRKVTVNGYQLEKNTM
VLPNLDSVLHDQHQWETPYKFNPNHFLDKNGNFCTSEAFLPFSAGHRVCLGEQLARFELL
IFFTTLLRRFNIELPEGITEVNTKYVFKMTLQPHPYEICAVPR*

CYP2AB1     Xenopus tropicalis (frog)
            GenEMBL CX984262.1 CX984263.2 ESTs
            scaffold_535:154,346-161,099
131  MSFTQDTWSFQQILLALLVCVITIKYIKMKWAAKNLPPGPTPLPLLGNLWALRFKLHP  304
305  KTLRKMAKSYGDIYTLWLGHTPLVVLSGCKSVRNGLITHSEELSGRPVDGFMTALTNERG  484
485  IGTTNGHTWKQQRRFGLMTLRNLGLGKRGLESRIQEEAQCLVESLAAKNGEPINPSDLIV  664
665  LAVANVISAVVFGHRFSIEDPTFQEMVKCNSSLVSGLGTAWGRMYDAFPWLMRYV  829
     PGPHQKSFAAIDYLAAFIKKEIKLHEINSSKDDPQDMIDYYLTQIEK (0) 
     TKHELDTTFDEENMIQVVI
893  DLFIAGTETTAISLSGALLYMVAFPEIQKKVQKELDTVLDGSPLAYYEDRKKLPFTNAVI  714
713  HEVQRYGNIASVGIPRSCIRKVTVNGYQLNKNTIVLPNLDSVLHDQRQWETPYKFNPNHF  534
533  LDKNGDFCTNEAFLPFSAGHRVCLGEQLARFELFIFFTTILRRFSIELPKGVTEVNTDYV  354
353  FKMTLQPHPYEICAIPR  303

2AC Subfamily

CYP2AC1P    human
            AC022650 6p12.3 41% to 2C9 pseudogene 2 in frame stops 
            68% to rat CYP2AC1 (XM_236969.1) functional gene
            old name CYP2C57P 
GIAFSHGETWKTMRRFSLTTLRNFGMGEWIIEDTIIEECQNLIQ
NMFLVLGFLLKSHKTILRNRDELFSFIRMAFLDHHHKLDKNDPRNFTDVFLVTQQE
ENDTFADYFSDKKLVTLVNNLFTTGTETTASTLHWGILLVMRYPEVQS
KVHNEITKVVVSAQS*LAHRTQMTHTDAVI*EVQRFANILPTSLSHATTTNIFKNYCIPK
GTEVIILLASVARDQAQWEKPDTFNPEHFLNSKEKFIKREAFLPF

CYP2AC1P    Bos taurus (cow)
            See cattle page for details
            67% to rat 2AC1
MSGFESSFILPILSLILIFILNIKIVMTKASKQHFPPVPRPLPIIGNLHILNLKRPYQTMLE (0)
LSQKYGSIYSIQIGPRKVAVLxGYETVKDVLVNHTDQFGEWFHVPISERLFEGK
GIFFSHSDTSKIIRFTLTTSQNFGMGKKALEDTIIGESQHLIRNFETDKG
GKPFEVKTLTNASVANINVSVLLGKGFDYQNTPFLRLLTLIDQSVKLIVSPPTA
LFNMFPVLRFLLKTYKNILRNKDELFSFIRMTFLHHHHKLDKNDPRSLTDAFLVRQQE
DTSTDYFNDDTLVVLVNNLFAAGTESMVSTLCWGILFMSRYPEIQS
KVHDEIAKVMGSTQP*MAH*TQMPYTDAVILEVQRFADILPTGLPRATTTNTIFKNNYIPK
GTEVIFLLTSVL*DQTQWENPATFNPEHFLDSIEKFIKKEAFISFSV (1)
SPL*CAGESLAKMELLLFFMSLLQKFTFQPPPGVSHLDLDPTRDTGVVIQPMPHKIRALPRA

CYP2AC1   Canis familiaris (dog)
          XM_847513.1 
MSGFDSSIILPILSLLLIFLLNIKIFMTKASKQHFPPGPR
PLPIIGNLHILNlkrpyqtmleLSQKYGSIYSIQMGPKKVVVLSGYETVKDALVNYGD
QFGERSQVPIFERLFEGKGIVFSHGETWKTMRRFSLATLRNFGMGKRIIEDTIIEECQ
HLIWSFESHR GKPFEVKTVMNASVANVIVSVLLGKRFDYQDTQFLRLLTLIGENVKLI
GGPRIA
LFNMFPVLGFLLKSHKTVLRNRDELFAFIRMTFLDHQHKFDKNDPRSFIDAF
LVRQQE EKDTSTTYFSDENLVALVSNLFAAGTETTATTLCWALLLMMRYPEVQKKVCD
EITKVVGSAQPRITHRTQMPYTDAVIHEVQRFANILPTGLPHATTTNVMFKNYYIPKG
TEVITLLTSVLRDQTQWEKPDTFNPNHFLSSTGKFIKKEAFMPFSLGRRMCAGESLAK
MELFLFFTSLMQKFTFQPPPGVSHLDLDLTPDIGFTTRPMPHKICALLRA*

CYP2AC1   Monodelphis domestica (short-tailed opossum)
          XM_001369570.1 
MSNGGHSLVPQMSIEFWEQRPTQGANIYHGHYPPGPKPLPVIGN
LHILNLKRPYQTMLELSKKYGPIFSLRMGPKTVVVLSGYETVKDALVNYSEQFGERAR
IPIFERIFEGKGIVFSHGENWKITRRFSLTTLRNFGMGKRVIEERILEECHHLIQVFE
SHQGKPFEISTIMSASVANIIVSILFGKRFDYKDPQFLRLLHLIGENIRLAGGPSITI
FNMFPVLGFLLQDLKRVLRNRDELFSFIRTTFLKHLRKLDKNDQRSFIDAFLIKQQEK
DKSDDYFNNDNLVALVSNLFAAGTETTSSTLRWGILLMMKYPEIQKKVHNEITEVIGS
AQPRIEHRTQMPYTDAVIHEIQRFSNILPMNLSRETTTDVIFKNYYIPKGTEVITLLT
SVLQDQTQWEKPCTFHPQHFLTKEGKFIKRDAFLPFSAGQRMCAGESLAKMELFLFFT
SLLQKFTFCPSPGVSNSDLDLTPDIGFTTRPQPYKICALPYF

Cyp2ac1-ps mouse
           GenEMBL NW_000130.1|Mm17_WIFeb01_308 
           MISSING EXON 2 probably in a seq gap 
           Rat ortholog is 80% identical
MSGFDFSAMLALLGLSLILILHINVFMAKASKHQSPPGRKSWPVIGNLHIXXXXXXXXXXXX 

GIAYAHGKCWKTMRRFSLTTLRNFLMGKRIIEDTIVTECQHLIQCFESHK
GLVLGM*RLLKASIANVIVSVLLGKWFDYQDSQFLRLLTLIGENMKLIGNPSIV 
LLNMFPILGFLLRSKKKVLRNRVELFSFIRMAFLEHCHNRNKSDPRSLIDAFLVRQQG
ENNTSANHFNEENLLALVSNLFTARTKTTASTLHWGIILMMLYPEVQS 556747
KVRGEIIKVVGSAQPRIEHRIQMPYTDTVIHEIE (fs) RVANILPTSLFHETTTDVAFKNYYIPK
GTEIITLLTSVLQDQTQWEASDAFDPAHFLSPKGTFVKKESFVPFSW 561380
GCHMCAGEPLAKMELFLFFTSLMQKFIFQSPxx (fs) VSHLDLDLTPDIGFIMQSQPHKICALVRASAL

CYP2AC1    Rattus norvegicus (rat) 
           GenEMBL NW_044163.1|Rn9_1523 
           genomic ortholog to 2ac1 chromosome 9
3425457 MSGFDFSAILALLGLILILILNIKDFMAKASKRQCPPGPKPWPVIGNLHILNLKRPYQTMLE 3425272 
3423187 LSKKYGPIYSIQMGPRKVVVLSGYETVKDALVNYGNQFGERSQVPIFERLFDGK 3423026
3415443 GIAFAHGETWKTMRRFSLSTLRDFGMGKRTIEDTIVVECQHLIQSFESHK 3415294
3412018 GKPFEIKRVLNASVANVIVSMLLGKRFDYEDPQFLRLLTLIGENIKLIGNPSIV 3411857
3410639 LFNIFPILGFLLRSHKKVLRNRDELFSFIRRTFLEHCHNLDKNDPRSFIDAFLVKQQ 3410469
3410029 ENNKSADYFNEENLLALVSNLFTAGTETTAATLRWGIILMMRYPEVQS 3409886
3408812 KVHDEIHKVVGSAQPRIEHRTQMPYTDAVIHEIQRVANILPTSLPHETSTDVVFKNYYIPK 3408627
3406238 GTEVITLLTSVLRDQTQWETPDAFNPAHFLSSKGRFVKKEAFMPFSV 3406098
3402907 GRRMCAGEPLAKMELFLFFTSLMQKFTFQPPPGVSYLDLDLTPDIGFTIQPLPHKICALLRTSAL* 3402710

CYP2AC1    Xenopus laevis (African clawed frog)
           GenEMBL CB558367.1 NICHD_XGC_Kid1 Xenopus laevis cDNA clone
           CB559919.1 NICHD_XGC_Kid1 Xenopus laevis cDNA clone
           BJ030802.1 NIBB Mochii normalized Xenopus neurula cDNA clone 
           61% identical to rat 2ac1 from PPGP to end
MFLGDPVTVLLTVVLCLILANLLYGRKRNNFKNF
PPGPKPLPVIGNINIINLKRPYLTYLELWKKYGPVFSIQIGGQKMVVLCGYETVKDALVNYAEEFSERPK
IPIFRDISKEYGVLFSHGENWKVMRRFTLSTLRDFGMGRSSIEDRINEECDFLVEK
FKSYKGKPFENTMIINAAVANIIVSIILGHRFDYQDPIFLRLMSLINENIRLSGSPTVML
YNVFPSVMRWLPGSHKTIAKNAAENQR 
FIRKTFTKHRDKLDVNDQRTLVDAFLVKQQEKNVNVQYFHDENLTMIVSNLFAAGMETT
SSTIRWGLLLMMKYPEIQKNVQNEIEKVIGQSQPQTEHRKSMPYTDAVLHEIQRFGNIVP
MNLPHATAQDVTFRGYFLPKGTFVIPLLTSVLYDQTHFEKPHEFYPQHFLDSEGNFVKNE
AFLPFSAGKRSCAGENLAKTELFLFFTSLLQNFTFQASSGRRT*

CYP2AC1   Phalacrocorax carbo (Common cormorant)
          No accession number
          Hisato Iwata
          submitted to nomenclature committee 5/19/05 
          61% to CYP2AC1 rat 76% to 2AC1 chicken 70% to 2AC2 chicken

CYP2AC1   Gallus gallus (chicken)
          NW_060338.1|Gga3_WGA147_1 chr 3 
          XM_420052.1, BG641890.1 EST BU120706.1
3967773 MDWASVVPVGLLMILILLLILKTQDFWRSQGKFPPGPQPLPIIGNLHIMDLKKIGQTMLQ (0) 3967952
3968877 LSETYGPVFTVQMGMRKVVVLSGYDTVKEALVNHADAFVGRPKIPIVEKAGKGK  3969038
3969203 GVVFSSGENWKVMRRFTLTTLRDFGMGKKAIEDYVVEEYGYLADVIESQK  3969352
3970285 GKPLEMTHLMNSAVANVIVSILLGKRFEYEDPTFKRLVSLINENMRLFGSPSVS  3970446
3971108 LYNMFPILGPFLKDNKSFLENVKEVNDFIKVTFTKYLQVLDK  3971233
3971234 NDQRSFIDAFLVKQQE  3971281
3971703 QNEKANKFFDDENLTEVVRNLFTAGMDTTATTLRWGLLLMMKYPEIQ  3971843
3971973 KKVQEEIDRVIGSNPPRTE  3972029
        HRTKMPY
3972269        TDAVIHEIQRFANILPLNLPHETTMDVTIKGYFIPK  3972376
3972609 GTYIIPLLNSVLQDKTQWEKPCSFHPEHFLNSEGKFVKKDAFIPFSA  3972749
3973027 GRRICAGETLAKMELFLFFTSLLQRFTFQPPPGISSSDLDLSAPPRFVIAPVTHEVCAVSRS  3973212

CYP2AC2   Gallus gallus (chicken)
          NW_060338.1|Gga3_WGA147_1, chr 3 
          BG710846.1 EST, XM_420053.1
3974997 MALVFILTFLFIMKIGGLWSNHWRKNFPPGPRALPIIGNLHLFDLKRPYRTYLQ  3975158
3976589 LSKEYGPVFSVQMGQRKIVVISGYETVKEALINQADAFAERPKIPIFEDLTRGN  3976750
3977081 GIVFAHGENWKVMRRFTLTTLRDFGMGKRAIEDRIVEEYGYLIDNVGSQE  3977230
3977626 GKPFDASKIINAAVANIIVSILLGKRFDYKDSRFIRLQHLTNESMRLAGKPLVT  3977787
3978987 MYNIFPYLGFLLRANKTLLKNRDEFHAYVKATFLENLKTLDKNDQRSFIDAFLVKQQE  3979160
3979765 EKSITNGYFHNGNLLSLVSNLFTAGVETISTTLNWSFLLMLKYPEIQSKVQ  3979917
3980773 EEIEQVIGSNPPRIEHRTQMPYTDAVIHEVQRFANILPLDLPHETAEDVTLKDYFIPK  3980946
3981123 GTYIIPLLTSVLRDKSQWEKPDMFYPEHFLDSKGKFVKKDAFMPFSA  3981263
3982308 GRRICAGETLAKMELFLFFTSLLQRFTFQPPPGVSSSDLDLSPAISFNVVPKPYKICAVARS  3982493

2AD Subfamily

CYP2AD1     Fugu rubripes (pufferfish)
            No accession number
            Scaffold_805, Old Scaffold_3261d
            Formerly CYP2N12
92399 MILQKIFAYMDFSSWVLLIFLVLLITDVIRNWTPHNFPPGPWAMPFVGNIFTGVDFRTIEK (0) 92217
92102 LSQKYGPVFSLRRGNTRTVFINGYKMVKEALVSQLDSFEDRPVVPLFHVVFKGI (1) 91941
91785 GIALSNGYMWKKQRKFAHTHLRYFGEGQKLLENHIQMESKFMCEAFKDEQ 91633 (1 gc 
boundary ?)
91552 GKPFDPQYTITNAVGNIISALVFGHRFEYSDASFRRILELDNEAVVLAGSARTQ 91391 (0)
91307 LYDSFPSLMKHLPGPHQTVHANYGKITDFLKKEVDKHMEEWNPEDPRDYVDTYLSEMEK 91131 (0)
90784 MNQDPQGGFNVETLLICILDLIEAGTESAATTLRWGLVFILNYPDVQ 90644 (1)
90564 EKVQEEIDRVIGQSRQPAMADRPNMPYTDAVIHEIQRFANVVPAGFPKMATKDTTVGGYFIPK 90376 
(0)
90287 GLAITTMLSSVLFDKNEWETPDVFNPNHFLDSEGRFRKRDAFIPFSA 90147 (1)
90043 GKRVCIGENLAKMELFLFFTSILQHFNLSPVPGQMPSLEGILGFTYSPQPFRMIVAPR* 89867

CYP2AD2     Danio rerio (zebrafish)
            GenEMBL AF248042
            Tanguay R.L.
            75% to 2AD3 
MILHLIYDSFDFKSWIIFFVVFLIIAEMIKNRTPSNYPPGPWPL
PFLGTVFTKMDFKNINKLAKVYGKVFSLRVGSEKMIIVSGYKMVKEALVTQNDSFVLR
PPVPLFHKVYKGIGLTMSNGYIWRSHRRFAASHLRTFGEGKKNLELGIQQECVYLCDA
FKAEKEPFNPIFILHGAVSNTVACLTFGQRFDYNDEWYQEILRLDNQCVQLAGSPRVQ
LYNAFPKLLDYLPGPHQKVFSNYKKITQSLKDEIIKHREDWDPANPRDFIDNYLTEME
KKKSDPQAGFNIESLIISCLDIVEAGTETGATTLRWGLLFMIKFPEIQKKVQAEIDRV
IGQSRQPCLDDRVNMPYTEAVLHEIQRFGDVVPLGFPKQAAVDTKIGNYFIPKGTSIT
TNLSSVLHDPNEWETPDTFNPGHFLDKNGQFRKRDAFLPFSAGKRACVGELLARNVLF
LFFTSLLQQFTLSKCPGEEPSLEGEIWFTYAPAPFRISVSVR

CYP2AD3     Danio rerio (zebrafish)
            No accession number
            Tseng, H.-P., Wang-Buhler, J.-L., Yang, Y.-H.,
            Hu, C.-H., Buhler, D.R.
            submitted to nomenclature committee 12/08/2003
            75% to CYP2AD2 60% to CYP2AD1
            clone name YH-B1-FL

CYP2AD4     Oryzias latipes
            GenEMBL BJ494553 EST
            70% to CYP2AD1

CYP2AD5     Gasterosteus aculeatus
            GenEMBL CD499490 EST
            67% to CYP2AD1

CYP2AD6     Danio rerio (zebrafish)

CYP2AD7    Oryzias latipes (medaka)
           chr4 28086098:28094682
           Joanna Wilson and students
           submitted to nomenclature committee Jan. 25, 2008
           61%ID to Zebrafish 2AD2
           73% to 2AD1 (FORMERLY 2N12) 
           probable GC boundary based on mRNA EF546460.1
MIFQALFDRMDFNSWLVFGFVLLLLIDIVKTWKPPKFPPGPLSVPFLGNVFTGVDFKTMEKLSQDFGPVFSLRRG
SERMVFISGYKMVKEALVTQLDSFVDRPIVPLFHVVFKGLGIALSNGYLWKKQRKFANAHLRYFGEGQKSLERYI
EIESNFLCDAFKEEQ (1 GC boundary)
GRPFNPHYLITNAVGNIISSVVFGHRFEYSDPSFRKVLELDNEAVVLSGSARTQLYDAFP
SLLNYLPGPHQTVHANYREIVCFLRKEIEKHQEEWNPEDPRDYIDVYLSEMEKTKQDPQAGFNIETLVVSTLDLI
EAGTETTATTLRWGLMFMLHHPEIQEKVQEEIDRVIGQSRQPAMSDRPNLPYTDAVIHEIQRMGNIVPLGFPKMA
SKDTTLGGYFIPKGTPITTILSSVLFDKNEWETPHVFNPGHFLDSEGRFLKKEAFLPFSAGKRMCLGEHLAKMEL
FLFFSTLLQRFTFKPVPGEMPSLEGVLGFTHSPEEFRFLALPR*

2AE Subfamily

CYP2AE1     Danio rerio (zebrafish)
            NA7219 zfishG-a147a09.q1c zfishG-a1551g08.q1c Z35723-a848d07.q1c 
            49% to 2P6 48% to 2N13 46% to 2V1 46% to 2AD2
28876 MSSVFSQLIGQWLDVQGFLIFLCVLLLVKHFRDVYSKNMPPGPFPLPFVGNLTNIGFSDP 28715
28714 LGSFQR 28697 (0)
28473 IAEKYGDVCTLYLGTKPCILMTGYDTLKEAFVEQADIFTDRPYFPIVDKLGN 28336 (1?)
26270 AGLIMSSGHMWRQQRRFALATLKYFGVGKKTLENAILQECRFLCDSLQAER 26118
25139 GLPFDPQHLVTNAVSNIICGLVFGHRFEYDDHQFHLMQTYINNILQLPISNWGR 24978
24700 LYNEFPTLMSLLPGKHQTAFASMSKLQPFLKEEITKHQQDREPSSPRDYIDCYLEEIEK 24524
21648 QCKDSDAEFTEENLMFCVVDLFGAGTETTSNTLRWALAFMVKYPDVQ 21508
21386 EKVQSEIDQVIGQTRQPLMDDRTNLPYTYAVIHEIQRFANIVTFTPPRVANKDTTVGGQLIPK 21198
18506 GVIVLPMLKPILLDKKEYSTPYDFNPDHFLDQNGKFLKKENFIPFSI 18366
14291 GKRMCPGEQLAGMELFLFFISLMQHFTFLPPEGETLSLKIFLAIASAPAPFRI 14133
      KAVPRQCDNTAS*

CYP2AE1-de9 Danio rerio (zebrafish)
            NA7219 
            extra exon 9 6kb downstream of 2AE1
8074 GKRMCPGEQLARMELFLFFISLMQHFTFLPVEGQKLSLKGTTSVSSAPQPFQI 7916

2AF Subfamily

CYP2AF1   Phalacrocorax carbo (Common cormorant)
          No accession number
          Hisato Iwata
          submitted to nomenclature committee 5/19/05 
          45% to 2C11 rat
          this is a new vertebrate subfamily