P450s that have appeared since the 1993 P450 nomenclature update. Part C covering CYP10 to CYP69 Animal P450 names continue at CYP301 in this file Lower eukaryote names continue at CYP501-699 in this file Lower eukaryote names have used up all 3 digit names and now Continue at CYP5001. ALL Drosophila melanogaster P450S ARE NOW INCLUDED This includes list references that were incomplete and duplications of sequences that were already in the update. If a sequence is assigned an accession number that was not in the old update it is included in this list. Some expressed sequence tags (ESTs) are also included from humans. This list last revised Aug. 24, 2009 (adding in all new fungal CYP names) This list was last revised on Dec. 31, 2007. Added all Neurospora P450s see 527A1-553A1 Added CYP51 Trypanosoma brucei Added Fugu P450s, added CYP505A3 added CYP18 Spodoptera Added all human genes and pseudogenes Added 104 Anopheles genes Revised May 13, 2003 named all remaining Dictyostelium P450 genes Many sequences have been added without updating this list (May 11, 2005) Added names for Nectria haematococca (158 genes/pseudogenes) Feb. 16, 2006 Added all Phanerochaete chrysosporium (white rot fungi) genes Aug. 29, 2007 Compiled by David R. Nelson There are 80 C. elegans P450s listed here, surpassing the known mouse and human complements. The C. elegans genome is now about 90% finished sequence and 9% unfinished (one strand or one chemistry). About 1Mb is not cloned at three telomeres and two internal sites. A special issue of Science appeared in Dec. 98 even though the genome is not completely done. The amount of sequence in the Blast searchable database at Washington Univ. is 140Mb, more than the 100Mb size of the genome. Therefore, we can guess that this set includes all the P450 genes in C. elegans, but the distrubution is not even. Most P450 genes (43 genes) are on chromosome V. see additional info on C. elegans P450s see this list. To see the actual sequences go to the C. elegans sequence file. So far we are missing CYP11A, CYP11B, CYP17, CYP19, CYP21, CYP24 and CYP27A and CYP27B. Does C. elegans make steroids? The present evidence would suggest not. Does C. elegans have mitochondrial P450s? There is one probable mitochondrial P450 in C. elegans on cosmid ZK177 named CYP44. It was thought to be incomplete but now a full length sequence has been assembled from the genomic sequence. 10A Subfamily 11A Subfamily 11B Subfamily 12A Subfamily 12B Subfamily 13A Subfamily 13B Subfamily 14A Subfamily 16A Subfamily 17A Subfamily 18A Subfamily 19A Subfamily 21A Subfamily 22A Subfamily 23A Subfamily 24A Subfamily 25A Subfamily 26A Subfamily 27A Subfamily 27B Subfamily 28A Subfamily 29A Subfamily 30A Subfamily 31A Subfamily 32A Subfamily 33A Subfamily 33B Subfamily 33C Subfamily 33D Subfamily 33E Subfamily 34A Subfamily 35A Subfamily 35B Subfamily 35C Subfamily 35D Subfamily 36A Subfamily 37A Subfamily 37B Subfamily 40A Subfamily 41A Subfamily 42A Subfamily 43A Subfamily 44A Subfamily 45A Subfamily 46A Subfamily 51A Subfamily 52A Subfamily 52B Subfamily 52C Subfamily 52D Subfamily 52E Subfamily 52F Subfamily 53A Subfamily 53B Subfamily 54A Subfamily 55A Subfamily 56A Subfamily 57A Subfamily 58A Subfamily 59A Subfamily 60A Subfamily 60B Subfamily 61A Subfamily 62A Subfamily 63A Subfamily 64A Subfamily 65A Subfamily 66A Subfamily 67A Subfamily 10A Subfamily CYP10A1 Lymnaea stagnalis (pond snail) GenEMBL S46130 (1870bp) PIR JX0225 (545 amino acids) AAB23599.1 Teunissen,Y., Geraerts,W.P., van Heerikhuizen,H., Planta,R.J. and Joosse,J. Molecular cloning of a cDNA encoding a member of a novel cytochrome P450 family in the mollusc Lymnaea stagnalis. J. Biochem. 112, 249-252 (1992) Mitochondrial clan 1 MAIMKKFIHH SLKQLIKPNL TSTKRVVSTS PRKEQGVAAI SLEPSEMAQC PFRKSIDTFT 61 ETTNAVKAPG MTEVQPFERI PGPKGLPIVG TLFDYFKKDG PKFSKMFEVY RQRALEFGNI 121 YYEKVGHFHC VVISSPGEYS RLVHAERQYP NRREMVPIAY YRKQKGFDLG VVNSQGEEWY 181 RQRTVVSKKM LKLAEVSNFS TQMGEVSDDF VKRLSHVRDS HGEIPALERE LFKWAMESIG 241 TFLFEERIGC LGQETSPMAQ TFIANLEGFF KTLQPLMYNL PTYKLWSTKL WKQFENYSDN 301 VIDIGRSLVE KKWHPCKMEV TQNLHLISYL VNNGSMSTKE VTGLIVDLML AAVETTSSAT 361 VWCLYNLAKN PQVQEKLFQE ITEAQAKNNG TISAEDLCKL PMVKAVVKET LRLYPITYST 421 SRNIAEDMEL GGYTIPAGTH VQANLYGMYR DPSLFPEPEG ILPERWLRMN GSQMDATIKS 481 TSQLVWGHGA RMCLGRRIAE QEMHITLSKI IQNFTLSYNH DDVEPILNTM LTPDRPVRIE 541 FKPRQ CYP10A2 Biomphalaria glabrata (bloodfluke planorb, carries schistosomiasis) ESTs EV821686, EV821681, EV817279 Probable mitochondrial P450 LFDYFKKDGPRFNKI FQVQQQRSAQYGEIYYEKIADFESVIESSPEEYNRLERSEGKYPCRREMA PMAYYNRKKGYALGIVNSQGKDWYNLRTVVSKKMLKLSEVS YGMFHNPDYFPEPETFLPERWLKENKMDATIKSASQLVWGHGARMCLGRRLAEQELHI TIAKIIENFQLNYHGDEVQPILNTTMVPDRPVNIQFSRRQISKDTVFNNTH* CYP10A3 Lottia gigantea (owl limpet) JGI Protein ID:233081 53% to CYP10A1 MSSLIPKTFAKIAVTALPEIATKRVVSTSVAHKQSARVIEVDPLTLGTC PFRSTFEKVESIVNPIEPKVEPQELTKETGIKSFEHVPGPK GLPILGSLLDYFKKDGLRFDK MFEAFKVRSIEFGPVYK ESIGPIDTVVISDPAEYAKVIRADGKFPNRKEMEPMAF YREQKGIGLGLVNSQGEEWHRQRSAVIKKML MLNEVQDFSQPMNVVANDFITHLSSSRDASGE IQQLDKQIFKWAMESIGTFLFEERIGCLNEK PTELAQEFIDTLQKYFRLMQQLMYNLPMYKVFRTKKWQEFEK LSDRIMEVGREFVDKKIE KLQNADPSTSQEKGAFLTHLISQKS LSTSDVTSNAVDLLSAAVETTSNASLWCLYNLATN PEAQQTMFNEINTVLPNKEDVTPQALSKLPYVKAVLK ETFRKYPITYATSRFLPENLEVGGYNIPAG THVQANLYGMYSD AKIFPEPEKFKPERWLRESKMDSQT KSLSNLIWGHGARMCIGRRFAEQEMHIMLTK IIQNFKLEYHHEPVEPTLNT VMTPDRPVQIKFVPRS CYP10B1 Capitella capitata (Gallery worm, polychaete worm, annelid) JGI Protein ID:143013 46% to CYP10A1 MKGGQVVRKEKKLKPFKDIPGPKG LPVIGTLLEFTKKDGLKFNK MFEVMTSRSKEFGPVYK ERIGMIESVIVSDPHEYAKVIQVDGKHPHRIELFPMVH YRQKKKMALGTVNAQGEEWYRGRMVLSKVML KPKEVQDYVIAQSDVGNDFIRHMKSIVSSDGQ LISFEREIFKWALESICVVLFEERIGCFPNP PTEKAQEFITNLIGFFKYMQPLMYNFPMYKIYPTKTWRQYEK HGDIVVGIG LDMVRKVKKESKSEFLTYLLAQKS LSPVEANSHAVDLMMGAVETTANSMMWMLYCLSNF PDAQRKLQEEIDKIVPQDERITPEVLSKMKYTKACLK ETFRLFPITFATSRMIHEEIELGGYHIPKG THCQANLWGMGRD PDYFSDPLTFKPERWLRDIQTLEHH NPHAVLPFGHGARMCIGLRFAEQEIYIAVAK LMKHFTVKHEGELLPVLNT VMTPDRPVNFRFLLRTKPALSGLGDTKKPIAVELK CYP10B2 Capitella capitata (Gallery worm, polychaete worm, annelid) JGI Protein ID:134490 87% to CYP10B1, 47% to CYP10A1 KLKPFKDTPGPK GLPVIGTLLEFTKKDGLKFNK MFEVMTSRSKEFGSVYK ERIGFLESVIVSDPDEYTKVIKVDGKYPHRIEMLPMVH YRQKKKMALGTVNAQGEEWYRARMVLSKVML KPKEVQEYVIAQSDVGNDFIKHMKSIASNDGQLLN FEREIFKWALESICVVLFEERIGCFPNP PTERAQEFIANLIGFFKYMQPLMYNLPVYKIYPTKTWRKYEK HGDTVFGIGLDMV TKVFLPGEKESKSEFLTYLLAQKS LSPVEANSHAVDLMIGAVETTANSMMWMLYCLSNF PEVQRKLQDEIDATLPQDEHITPEILSRLKYTRACLK ETFRLYPVTFATSRMIHKDIELGGYHIPKG THCQANLWGMGRD PDYFSDPLTFKPERWQRDTQTLEYH NPHAVLPFGHGARMCIGRRFAEQEIYIAVAK LMKHFTVEHVGHLHPILIT VMIPDRPVNFRFRPRTKPER CYP10C1 Helobdella robusta (leech, annelid worm) JGI Protein ID:67702 48% to CYP10B2, 41% to CYP10A1 MRKYIIQIISRNKIHNFNNYDRIGNNILKV NCVSTSAAAFDNTDLLVSNANVLTENKLAELKSFESLPGPK GLPLFGTLLSYVRPGGLSFRK MFKVMDMRAKQYGSVYK ETIGSNLTSVIVSDMEEYLKVIRADGKMPNRIELEPFKY FRERNNFDIGIVNSQGLQWLEKRQILGPKIM ALSEILDHVGPINDVTMDFLKRMDEKLLDRDGLFK NLERNLFTWALEAVGTYLFEARIGCLNEP IPPFADMFITHLSGMFREMQGLMYGPPIYKYISTRNWKRFED HCYKIM EAGMTLIEETSSSSALLTFLLSQKS MSASSVNLTSIDVMIGAIETTSNSFTWMLYCLARN PEVQEKLFEEVLQYLPNKDSPVTREVIRKMDYVRACLK ETLRFYPITFATSRKLASEIELKGFRIPKG THVQANLWTIGRD ERLFPHPEKFNPDRWLRKSHEVSHLA TALSHLPFGHGPRMCIGQRLAEMEIYTMVSK LIRNFKVECAEKGEVQPILMT VLTPDRPMMLKFKRR 11A Subfamily CYP11A1 human PIR A48733 (239 amino acids) Matteson, K.J., Chung, B.C., Urdea, M.S. and Miller, W.L. Study of cholesterol side-chain cleavage (20,22 desmolase) deficiency causing congenital lipoid adrenal hyperplasia using bovine-sequence P450scc oligodeoxyribonucleotide probes. Endocrinology 118, 1296-1305 (1986) CYP11A1 human GenEMBL M14565 Chung BC, Matteson KJ, Voutilainen R, Mohandas TK, Miller WL. (1986) Human cholesterol side-chain cleavage enzyme, P450scc: cDNA cloning, assignment of the gene to chromosome 15, and expression in the placenta. Proc Natl Acad Sci U S A. 83, 8962-8966. First complete human CYP11A1 cDNA CYP11A1 rabbit GenEMBL S59219 (1336bp) PIR A49189 (445 amino acids) Yang,X., Iwamoto,K., Wang,M., Artwhol,J., Mason,J.I. and Pang,S. Inherited congenital adrenal hyperplasia in the rabbit is caused by a deletion in the gene encoding cytochrome P450 cholesterol side chain cleavage enzyme. Endocrinol. 132, 1977-1982 (1993) CYP11A1 bovine PIR A42033 (18 amino acids) Pikuleva, I.A., Lapko, A.G., Chashchin, V.L. Functional reconstitution of cytochrome P-450-scc with hemin activated with Woodward's reagent K. Formation of a hemeprotein cross-link. J. Biol. Chem. 267, 1438-1442 (1992) CYP11A1 Bos taurus (cow) See cattle page for details MLARGLPLRSALVKACPPI LSTVGEGWGHHRVGTGEGAGISTKTPRPYSEIPSPGDNGWLNLYHFWREKGSQRIHFRHI ENFQKYGPIYREKLGNLESVYIIHPEDVAHLFKFEGSYPERYDIPPWLAYHRYYQKPIGV LFKKSGTWKKDRVVLNTEVMAPEAIKNFIPLLNPVSQDFVSLLHKRIKQQGSGKFVGDIK EDLFHFAFESITNVMFGERLGMLEETVNPEAQKFIDAVYKMFHTSVPLLNVPPELYRLFR TKTWRDHVAAWDTIFNKAEKYTEIFYQDLRRKTEFRNYPGILYCLLKSEKMLLEDVKANI TEMLAGGVNTTSMTLQWHLYEMARSLNVQEMLREEVLNARRQAEGDISKMLQMVPLLKAS IKETLRLHPISVTLQRYPESDLVLQDYLIPAKTLVQVAIYAMGRDPAFFSSPDKFDPTRW LSKDKDLIHFRNLGFGWGVRQCVGRRIAELEMTLFLIHILENFKVEMQHIGDVDTIFNLI LTPDKPIFLVFRPFNQDPPQA* CYP11A1 Sus scrofa (pig) GenEMBL X13768 (see also NM_214427) Mulheron GW, Stone RT, Miller WL, Wise T. (1989) Nucleotide sequence of cytochrome P-450 cholesterol side-chain cleavage cDNA isolated from porcine testis. Nucleic Acids Res. 17, 1773 CYP11A1 Sus scrofa (pig) GenEMBL L34259 (2376bp), NM_214427 Urban,R.J., Shupnik,M.A. and Bodenburg,Y.H. Insulin-like growth factor-I increases expression of the porcine P-450 cholesterol side chain cleavage gene through a GC-rich domain. J. Biol. Chem. 269, 25761-25769 (1994) MLARGLALRSVLVKGCQPFLSAPRECPGHPRVGTGEGACISTKT PRPFSEIPSPGDNGWINLYRFWKEKGTQKIHYHHVQNFQKYGPIYREKLGNLESVYII DPEDVALLFKFEGPNPERYNIPPWVAYHQHYQKPVGVLLKKSGAWKKDRLVLNTEVMA PEAIKNFIPLLDTVSQDFVGVLHRRIKQQGSGKFSGDIREDLFRFAFESITNVIFGER LGMLEEIVDPEAQKFIDAVYQMFHTSVPMLNLPPDLFRLFRTKTWRDHVAAWDTIFNK AEKYTQNFYWDLRRKREFNNYPGILYRLLGNDKLLSEDVKANVTEMLAGGVDTTSMTL QWHLYEMARSLNVQEMLREEVLNARRQAQGDTSKMLQLVPLLKASIKETLRLHPISVT LQRYLVNDLVLRDYMIPAKTLVQVAVYAMGRDPAFFSNPGQFDPTRWLGKERDLIHFR NLGFGWGVRQCVGRRIAELEMTLFLIHILENFKVELQHFSDVDTIFNLILMPDKPIFL VFRPFNQDPLQA CYP11A1 goat GenEMBL D50058 (1825bp) Okuyama,E., Okazaki,T., Furukawa,A., Wu,R.-F. and Ichikawa,Y. Molecular cloning and nucleotide sequences of cDNA clones of sheep and goat adrenolcortical cytochrome P450scc. unpublished (1995) CYP11A1 sheep GenEMBL D50057 (1825bp) Okuyama,E., Okazaki,T., Furukawa,A., Wu,R.-F. and Ichikawa,Y. Molecular cloning and nucleotide sequences of cDNA clones of sheep and goat adrenolcortical cytochrome P450scc. unpublished (1995) CYP11A1 bovine PIR S29644 (21 amino acids) Tsujita, M. and Ichikawa, Y. Substrate-binding region of cytochrome P-450(scc) (P-450 XIA1). Identification and primary structure of the cholesterol binding region in cytochrome P-450(scc). Biochim. Biophys. Acta 1161, 124-130 (1993) Cyp11a1 mouse GenEMBL J05511, NM_019779, AF195119 Rice,D.A., Kirkman,M.S., Aitkin,L.D., Mouw,A.R., Schimmer,B.P. and Parker,K.L. Analysis of the promoter region of the gene encoding mouse cholesterol side-chain cleavage enzyme J. Biol. Chem. 265, 11713-11720 (1990) MLAKGLSLRSVLVKGCQPFLSPTWQGPVLSTGKGAGTSTSSPRS FNEIPSPGDNGWLNLYHFWRESGTQKIHYHQMQSFQKYGPIYREKLGTLESVYIVDPK DASILFSCEGPNPERFLVPPWVAYHQYYQRPIGVLFKSSDAWKKDRIVLNQEVMAPGA IKNFVPLLEGVAQDFIKVLHRRIKQQNSGNFSGVISDDLFRFSFESISSVIFGERMGM LEEIVDPEAQRFINAVYQMFHTSVPMLNLPPDFFRLLRTKTWKDHAAAWDVIFNKADE YTQNFYWDLRQKRDFSQYPGVLYSLLGGNKLPFKNIQANITEMLAGGVDTTSMTLQWN LYEMAHNLKVQEMLRAEVLAARRQAQGDMAKMVQLVPLLKASIKETLRLHPISVTLQR YTVNDLVLRNYKIPAKTLVQVASFAMGRDPGFFPNPNKFDPTRWLEKSQNTTHFRYLG FGWGVRQCLGRRIAELEMTILLINLLENFRIEVQNLRDVGTKFSLILMPENPILFNFQ PLKQDLGPAVTRKDNTVN CYP11A1 Xenopus tropicalis See Xenopus pages for seq CYP11A1 Danio rerio (zebrafish) See xebrafish pages for seq CYP11A1 Oncorhynchus mykiss (rainbow trout) GenEMBL S57305 (1789bp) Swiss Q07217 (514 amino acids) PIR S32197 (514 amino acids) Takahashi,M., Tanaka,M., Sakai,N., Adachi,S., Miller,W.L. and Nagahama,Y. Rainbow trout ovarian cholesterol side-chain cleavage cytochrome P450 (P450scc). cDNA cloning and mRNA expression during oogenesis. FEBS Lett. 319, 45-48 (1993) CYP11A1 Fugu rubripes (pufferfish) No accession number Scaffold_1630 37515 MARWSVWRSPVVLPLSRMEVPMTGARHSSTMPVARQTYSDSSSFV 37381 37380 RSFNDIPGLWKNGVANLYNFWKLDGFRNLHHIMVQNFNTFGPIYR 37246 (2) 37065 EKIGYYESVNIINPEDAAILFKAEGHYPKRLKVEAWTSYRDYRNRKYGVLLK 36904 (2) 36822 NGEEWRCNRVLLNKEVISPKVLENFVPLLDEVGNDFVVRVHKKIARSGQNKWTTDLSQELFKYALES 36628 (1) 36506 VSSVLYGERLGLFLDYIDPEAQHFIDCISLMFKTTSPML YIPPALLRKVGAKVWRDHVEAWDGIFNQ 36300 (1 expected) bad boundary 36208 ADRCIQNIYRRLRQETGPSKKYPGVLASLLLRDKLSIEDIKASITELMAGGVDT 36047 (0) 35975 TSITLLWTLYELARHPNLQEELRAEVAAARTESQGDMLEMLKRIPLVKGALKETLR 35808 (2) 35731 LHPVAVSLQRYIAEDIIIQNYHIPAG 35654 (0) 35568 TLVQLGLYAMGRDPKVFFRPEQYQPSRWLRSETHYFKSLGFGFGPRQCLGRRIAEAEMQLFLIH 35377 (0) 35297 MLENFRVEKQRHMEVQSTFELILLPDKPIILTLKPLSS* 35181 CYP11A1 Dasyatis americana (southern stingray) GenEMBL U63299(4619bp) Nunez,B.S. and Trant,J.M. Isolation of the cDNA encoding the interrenal form of cholesterol side chain cleavage cytochrome P450 of the southern stingray (Dasyatis americana). unpublished (1996) CYP11A2 Danio rerio (zebrafish) See xebrafish pages for seq 11B Subfamily CYP11B1 human GenEMBL J05140 M32863 (1482bp) M32878 (2633bp) M32879 (1155bp) Mornet,E., Dupont,J., Vitek,A. and White,P.C. Characterization of two genes encoding human steroid 11-beta- hydroxylase (P-450-11-beta). J. Biol. Chem. 264, 20961-20967 (1989) CYP11B1 human GenEMBL D16155 (156bp) Naiki,Y., Shizuta,Y., Kawamoto,T., Yasushiro,M., Miyahara,K., Toda,K., Tadao,O. and Imura,H. A nonsense mutation (TGG(116Arg)-TAG(stop)) in CYP11B1 causes steroid 11beta-hydroxylase deficiency. J. Clin. Endocrinol. Metab. 77, 1677-1682 (1993) CYP11B1 human GenEMBL D10169 D90428 (2085bp) Kawamoto,T., Mitsuuchi,Y., Toda,K., Yokoyama,Y., Miyahara,K., Miura,S., Ohnishi,T., Ichikawa,Y., Nakao,K., Imura,H., Ulick,S. and Shizuta,Y. Role of steroid 11beta-hydroxylase and steroid 18-hydroxylase in the biosynthesis of glucocorticoids and mineralocorticoids in humans. Proc. Natl. Acad. Sci. USA 89, 1458-1462 (1992) CYP11B1 human PIR S29068 (30 amino acids) Kawamoto, T., Mitsuuchi, Y., Toda, K., Miyahara, K., Yokoyama, Y., Nakao, K., Hosoda, K., Yamamoto, Y., Imura, H. and Shizuta, Y. Cloning of cDNA and genomic DNA for human cytochrome P-450 (11-beta). FEBS Lett. 269, 345-349 (1990) CYP11B1 Papio hamadryas ursinus (chacma baboon) GenEMBL U52085(228bp) Hampf,M., Swart,A. and Swart,P. Expression of Papio ursinus steroid 11beta hydroxylase unpublished (1996) CYP11B1 rat GenEMBL D10107 (1528bp) S58847 PIR B46040 (499 amino acids) Matsukawa,N., Nonaka,Y., Higaki,J., Nagano,M., Mikami,H., Ogihara,T. and Okamoto,M. Dahl's salt-resistant normotensive rat has mutations in cytochrome P450 (11 beta), but the salt-sensitive hypertensive rat does not. J. Biol. Chem. 268, 9117-9121 (1993) Note: only 1 amino acid difference with 11B1 at position 84 E (normal) changed to G (seen in 11B2 and 11B3) D10107 has five differences with D11354 CYP11B1 rat GenEMBL S58858 (330bp) PIR JX0251(499 amino acids) Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y., Okamoto,M., Nawata,H. and Omura,T. Three forms of rat CYP11B genes: 11 beta-hydroxylase gene, aldosterone synthase gene, and a novel gene. J. Biochem. 113, 144-152 (1993) Note: Fig. 3 has 6 errors. 1,2) aldo-46 amino acids 2 and 3 are incorrectly translated gctctc = AL not HS. 3) 11B3 codon at 559-561 incorrectly translated gac = D not N. 4,5,6) 11beta-62, 11B1 and 11B3 codon at 964-966 tcc = Ser not Pro. CYP11B1 rat GenEMBL S58849, D14086 to D14091 PIR A46039 (499 amino acids) Mukai,K., Imai,M., Shimada,H. and Ishimura,Y. Isolation and characterization of rat CYP11B genes involved in late steps of mineralo- and glucocorticoid syntheses. J. Biol. Chem. 268, 9130-9137 (1993) CYP11B1 rat GenEMBL S63899 (596bp) Mukai,K., Imai,M., Shimada,H., Okada,Y., Ogishima,T. and Ishimura,Y. Structural differences in 5'-flanking regions of rat cytochrome P-450aldo and P-450(11) beta genes Biochem. Biophys. Res. Commun. 180, 1187-1193 (1991) CYP11B1 rat GenEMBL D11354 (1528bp) PIR A46040 (499 amino acids) Matsukawa,N., Nonaka,Y., Higaki,J., Nagano,M., Mikami,H., Ogihara,T. and Okamoto,M. Dahl's salt-resistant normotensive rat has mutations in cytochrome P450 (11 beta), but the salt-sensitive hypertensive rat does not. J. Biol. Chem. 268, 9117-9121 (1993) Note: This sequence is the wild type DS rat sequence. The mutant DR rat has five amino acids differences accession number D10107 CYP11B1 Cavia porcellus (guinea pig) GenEMBL Z69785(2028bp) Bulow,H.E., Mobius,K., Bahr,V. and Bernhardt,R. Molecular cloning and functional expression of the cytochrome P450 11B-hydroxylase of the guinea pig. Biochem. Biophys. Res. Commun. 221, 304-312 (1996) CYP11B1 bovine PIR JX0151 (503 amino acids) Kirita, S., Hashimoto, T., Kitajima, M., Honda, S., Morohashi, K. and Omura, T. Structural analysis of multiple bovine P-450(11beta) genes and their promoter activities. J. Biochem. 108, 1030-1041 (1990) CYP11B1 Bos taurus (cow) See cattle page for details MALWAKARVRMAGPWLSLHEARLLGTRGAAAPKAVLPFEAMPRC PGNKWMRMLQIWKEQSSENMHLDMHQTFQELGPIFRYDVGGRHMVFVMLPEDVERLQQ ADSHHPQRMILEPWLAYRQARGHKCGVFLLNGPQWRLDRLRLNPDVLSLPALQKYTPL VDGVARDFSQTLKARVLQNARGSLTLHIAPSVFRYTIEASTLVLYGERLGLLTQQPNP DSLNFIHALEAMLKSTVQLMFVPRRLSRWMSTNMWREHFEAWDYIFQYANRAIQRIYQ ELALGHPWHYSGIVAELLMRADMTLDTIKANTIDLTAGSVDTTAFPLLMTLFELARNP EVQQAVRQESLVAEARISENPQRAITELPLLRAALKETLRLYPVGITLEREVSSDLVL QNYHIPAGTLVKVLLYSLGRNPAVFARPESYHPQRWLDRQGSGSRFPHLAFGFGVRQC LGRRVAEVEMLLLLHHVLKNFLVETLEQEDIKMVYRFILMPSTLPLFTFRAIQ CYP11B1 pig GenEMBL D38590(1671bp) Sun,T., Zhao,Y., Nonaka,Y. and Okamoto,M. Cloning and expression of cytochrome P450(11 beta) of porcine adrenal cortex. J. Steroid Biochem. Mol. Biol. 52, 227-232 (1995) MAIWAKAEAWLAGPWLALNRARTLGTRAVLAPKGVLPFEAIPQF PGKKWMRVLQLWREQGFENNHLEMHQTFQELGPIFRFDVGGRNMVLVMLPEDVERCQK VEGLHPQRDVPGPWLAYRHLRGHKCGVFLLNGPTWRLDRLQLNPGVLSLQAMQKFTPL VDGVARDFSQALRARVMQNARGSLTLDIKPSIFRYTIEASNLVLFGERLGLLAHQPNP ESLDFIHALEVMFKSTVQLMFMPRSLSRWTSTGTWKEHFEAWDCIFQYANKAIQRLYQ ELTLGHPWHYSGVVAELLTHANMTVDAIKANSIDLTAGSVDTTAYPLLMTLFELARNP EVQQALRQESLAAAARISENPQKAITELPLLRAALKETLRLYPVGIFLDRCVTSDLVL QNYHIPAGTLVKVLLYSLGRNPAVFARPERYHPQRWLDNQGSGTRFPHLAFGFGMRQC LGRRLAQVEMLLLLHHVLKNFLVETLVQEDIKMIYRFIMTPSTLPLLTFRAIS CYP11B1 sheep GenEMBL L34337 (1639bp) Boon,W.C., Roche,P.J., Hammond,V.E., Jeyaseelan,K., Crawford,R.J. and Coghlan,J.P. Cloning and expression analysis of a cytochrome P450-11-beta cDNA in sheep. unpublished (1994) CYP11B1 Ovis aries (sheep) GenEMBL L28716 (2300bp) Anwar,A., Jeyaseelan,K. and Coghlan,J.P. Molecular cloning and characterization of the ovine CYP11B1 promoter. Biochem. Mol. Biol. Int. 33, 1169-1178 (1994) CYP11B1 Ovis ammon (sheep) GenEMBL L47569(9027bp) Anwar,A., Jeyaseelan,K. and Coghlan,J.P. Characterization of an ovine 11-beta hydroxylase (cyp11b) gene. unpublished (1995) Cyp11b1 mouse PIR A41552 (500 amino acids) Domalik, L.J., Chaplin, D.D., Kirkman, M.S., Wu, R.C., Liu, W., Howard, T.A., Seldin, M.F., Parker, K.L. Different isozymes of mouse 11beta-hydroxylase produce mineralocorticoids and glucocorticoids. Mol. Endocrinol. 5, 1853-1861 (1991) Cyp11b1 mouse PIR A32210 (42 amino acids) Mouw, A.R., Rice, D.A., Meade, J.C., Chua, S.C., White, P.C., Schimmer, B.P. and Parker, K.L. Structural and functional analysis of the promoter region of the gene encoding mouse steroid 11beta-hydroxylase. J. Biol. Chem. 264, 1305-1309 (1989) Cyp11b1 mouse GenEMBL J04451 80% to mouse 11B2 64% to human 11B1 and 11B2 80% to 11B1 rat 78% to 11B2 rat. sequence below is from Ensembl mouse version 3 15.75000001-76000000 mouse chr 15 add 75 million to get global location 306415 MALRVTTDVWLARPWQCLHRTRALGTTATLAPKTLQPFEAIPQYSRNKWLKMIQILREQG 306356 306355 QENLHLEMHQVFRELGPIFR 306176 (?) 305773 HSVGKTQIVFVTLPEDVEKLYQVESTHPCRMPLESWIVHRELRGLGRGVFLL 305618 (?) 304234 NGPEWYFNRLQLNPNVLSPKAVQKFVPLVDGIARDFVDNLKKKMLESVHGSFSMDFQSSV 304175 304174 FNYTIE 304157 (?) 303918 ASHFVLFGERLGLIGRDLSPDSLKFLHTLHSMFKTTTQLLYLPRSLTRWTSTRVWKENLE 303859 303858 SWDFISEYG 303826 (?) 303446 TKCIKNVYRELAEGRPQSWSVTAELVAERTLSMDAIQANSMELIAGSTDT 303413 (0?) 301935 TSTPLVMTFFELARNPDVQQALRQESLAAEASIAANPQKAMSDLPLLRAALKETLR 301768 301535 LYPVGTFLERILSSDLVLQNYHVPAG 301458 301370 TVLNVNLYSMGRNPAVFPRPERYMPQRWLERKRSFKHLAFGFGVRQCLGRRLAEAEM 301314 301313 MLLLHH 301182 (?0) 300923 VLKSFHVETQEK 300888 300887 EDVRMAYRFVLMPSSSPLLTFRPVN 300705 CYP11B1 Xenopus tropicalis See Xenopus pages for seq CYP11B1 Danio rerio (zebrafish) See xebrafish pages for seq CYP11B2 human GenEMBL J05140 M32864 (1809bp) M32880 (3088bp) M32881 (1101bp) Mornet,E., Dupont,J., Vitek,A. and White,P.C. Characterization of two genes encoding human steroid 11-beta- hydroxylase (P-450-11-beta). J. Biol. Chem. 264, 20961-20967 (1989) CYP11B2 human GenEMBL D90429 D10170 (2114bp) Kawamoto,T., Mitsuuchi,Y., Toda,K., Yokoyama,Y., Miyahara,K., Miura,S., Ohnishi,T., Ichikawa,Y., Nakao,K., Imura,H., Ulick,S. and Shizuta,Y. Role of steroid 11beta-hydroxylase and steroid 18-hydroxylase in the biosynthesis of glucocorticoids and mineralocorticoids in humans. Proc. Natl. Acad. Sci. USA 89, 1458-1462 (1992) CYP11B2 human GenEMBL D13752(6910bp) Kawamoto,T., Mitsuuchi,Y., Ohnishi,T., Ichikawa,Y., Yokoyama,Y., Sumimoto,H., Toda,K., Miyahara,K., Kuribayashi,I., Nakao,K., Hosoda,K., Yamamoto,Y., Imura,H. and Shizuta,Y. Cloning and expression of a cDNA for human cytochrome P-450aldo as related to primary aldosteronism. Biochem. Biophys. Res. Commun. 173 (1), 309-316 (1990) CYP11B2 human GenEMBL S77398 S77401 S77403 S77406 S77409 (genomic sequences) Shizuta,Y., Kawamoto,T., Mitsuuchi,Y., Miyahara,K., Rosler,A., Ulick,S. and Imura,H. Inborn errors of aldosterone biosynthesis in humans. Steroids 60 (1), 15-21 (1995) CYP11B2 Bos taurus (cow) See cattle page for details MALWAKARVRMAGPWLSLHEARLLGTRGAAAPKAVLPFEAMPRCPGNKWMRMLQIWKE QSSENMHLDMHQTFQELGPIFRYDVGGRHMVFVMLPEDVERLQQADSHHPQRMILEPWLA YRQARGHKCGVFLLNGPQWRLDRLRLNPDVLSLPALQKYTPLVDGVARDFSQTLKARVLQ NARGSLTLDIAPSVFRYTIEASTLVLYGERLGLLTQQPNPDSLNFIHALEAMLKSTVQLM FVPRRLSRWMSTNMWREHFEAWDYIFQYANRAIQRIYQELALGHPWHYSGIVAELLMRAD MTLDTIKANTIDLTAGSVDTTAFPLLMTLFELARNPEVQQAVRQESLVAEARISENPQRA ITELPLLRAALKETLRLYPVGITLEREVSSDLVLQNYHIPAGTLVKVLLYSLGRNPAVFA RPESYHPQRWLDRQGSGSRFPHLAFGFGVRQCLGRRVAEVEMLLLLHHVLKNFLVETLEQ EDIKMVYRFILMPSTLPLFTFRAIQ* CYP11B2 Sus scrofa (pig) ESTs CV872108.1, BI341717.1 93% to 11B1 pig MAIWAKAEAWLAGPWLALHRARTLGTRAALAPKAVLPFEAIPQCPGNKWMRVLQLWREQG FENIHLDMHQTFQELGPIFRFDVGGRHMVLVMLPEDVERLQKVEGLHPQRMFLEPWLAYR QLRGHKCGVFLLNGPTWRLDRLQLNPGVLSLQAMQKFTPLVDGVARDFSQALRARVMQNA RGSLTLDIKPSIFRYTIEASNLVLFGERLGLLAHQPNPES CYP11B2 rat GenEMBL S58859 (353bp) PIR JX0252 (500 amino acids) Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y., Okamoto,M., Nawata,H. and Omura,T. Three forms of rat CYP11B genes: 11 beta-hydroxylase gene, aldosterone synthase gene, and a novel gene. J. Biochem. 113, 144-152 (1993) CYP11B2 rat GenEMBL S58850 D14092 to D14097 PIR B46039 (500 amino acids) Mukai,K., Imai,M., Shimada,H. and Ishimura,Y. Isolation and characterization of rat CYP11B genes involved in late steps of mineralo- and glucocorticoid syntheses. J. Biol. Chem. 268, 9130-9137 (1993) CYP11B2 rat GenEMBL S63898 (594bp) Mukai,K., Imai,M., Shimada,H., Okada,Y., Ogishima,T. and Ishimura,Y. Structural differences in 5'-flanking regions of rat cytochrome P-450aldo and P-450(11) beta genes. Biochem. Biophys. Res. Commun. 180, 1187-1193 (1991) CYP11B2 rat GenEMBL S64136 (3001bp) PIR JN0615 (506 amino acids) GenEMBL U14908 (3000bp) Zhou,M. and Gomez-Sanchez,C.E. Cloning and expression of a rat cytochrome P-450 11 beta-hydroxylase/aldosterone synthase (CYP11B2) cDNA variant. Biochem. Biophys. Res. Commun. 194, 112-117 (1993) Erratum: Biochem. Biophys. Res. Commun. 196, 1018 (1993) CYP11B2 rat PIR A34281 (20 amino acids) Swiss P30099 (510 amino acids) Ogishima, T., Mitani, F., Ishimura, Y. Isolation of aldosterone synthase cytochrome P-450 from zona glomerulosa mitochondria of rat adrenal cortex. J. Biol. Chem. 264, 10935-10938 (1989) Note: sequence is from N-terminal after signal sequence CYP11B2 rat Swiss P30099 (510 amino acids) GenEMBL D00567 (2824bp) Swiss P30100 (500 amino acids) GenEMBL D00568 (2705bp) Matsukawa,N., Nonaka,Y., Ying,Z., Higaki,J., Ogihara,T. and Okamoto,M. Molecular cloning and expression of cDNAs encoding rat aldosterone synthase: variants of cytochrome P-450 11beta Biochem. Biophys. Res. Commun. 169, 245-252 (1990) CYP11B2 hamster GenEMBL S73810 (1503bp) LeHoux,J.G., Mason,J.I., Bernard,H., Ducharme,L., LeHoux,J., Veronneau,S. and Lefebvre,A. The presence of two cytochrome P450 aldosterone synthase mRNAs in the hamster adrenal. J. Steroid Biochem. Mol. Biol. 49, 131-137 (1994) CYP11B2 cavia porcellus (domestic guinea pig) GenEMBL AF018569 Buelow,H.E. and Bernhardt,R. Molecular Cloning and Functional Expression of the Guinea Pig Aldosterone Synthase. Unpublished CYP11B2 Rana catesbeiana (bullfrog) GenEMBL D10984 (1919bp) Nonaka,Y., Takemori,H., Halder,S.K., Sun,T., Ohta,M., Hatano,O., Takakusa,A. and Okamoto,M. Frog Cytochrome P-450 (11 beta, aldo), a single enzyme involved in the final steps of glucocorticoid and mineralocorticoid biosynthesis. Eur. J. Biochem. 229, 249-256 (1995) Cyp11b2 mouse GenEMBL S85260 (2804bp) Domalik,L.J., Chaplin,D.D., Kirkman,M.S., Wu,R.C., Liu,W.W., Howard,T.A., Seldin,M.F. and Parker,K.L. Different isozymes of mouse 11 beta-hydroxylase produce mineralocorticoids and glucocorticoids. Mol. Endocrinol. 5, 1853-1861 (1991) Cyp11b2 mouse genEMBL S85260 68% to human 11B2 67% to human 11b1 94% to 11B2 rat 81% to 11B1 rat sequence below is from Ensembl mouse version 3 15.75000001-76000000 mouse chr 15 add 75million to get global location 321034 MALRVTADVWLARPWQCLHRTRALGTTATLAPKTLQPFEAIPQYSRNKWLKMIQILREQG 320975 320974 QENLHLEMHQVFRELGPIFR 320795 (?) 320391 HSVGKTQIVSVMLPEDAEKLHQVESMLPRRMHLEPWVAHRELRGLRRGVFLL 320238 (?) 318803 NGPEWRLNRLRLNRNVLSPKAVQKFVPMVDMVARDFLETLKEKVLQNARGSLTM 318762 318761 DVQQSLFNYTIE 318606 (?) 318477 ASNFALFGERLGLLGHDLSPGSLKFIHALHSMFKSTS 318487 318486 QLLFLPKSLTRWTSTRVWKEHFDAWDVISEYG (?) 318391 318000 NRCIWKVHQELRLGSSQTYSGIVAELISQGSLPLDAIKA 318004 318003 NSMELTAGSVDT 317848 (?) 316981 TAIPLVMTLFELARNPDVQKALRQESLAAEASIAANPQKAMSDLPLLRAALKETLR 316814 (?) 316581 LYPVGGFLERILSSDLVLQNYHVPAG 316504 (0?) 316416 TLVLLYLYSMGRNPAVFPRPERYMPQRWLERKRSFQHLAFGFG 316396 316395 VRQCLGRRLAEVEMMLLLHH 316228 (0?) 315961 ILKTFQVETLRQEDVQMAYRFVLMPSSSPVLTFRPVS 315851 CYP11B3 rat PIR JX0253 (498 amino acids) Nomura,M., Morohashi, K.-i., Kirita, S., Nonaka, Y., Okamoto,M., Nawata,H. and Omura,T. Three forms of rat CYP11B genes: 11 beta-hydroxylase gene, aldosterone synthase gene, and a novel gene. J. Biochem. 113, 144-152 (1993) CYP11B3 rat GenEMBL U14907 (1497bp) Zhou,M., Gomez-Sanchez,E.P., Foecking,M. and Gomez-Sanchez,C.E. Cloning of the cytochrome P-450 CYP11B3 complementary DNA in the rat. unpublished CYP11B3 rat GenEMBL S59144 D14098 to D14103 Mukai,K., Imai,M., Shimada,H. and Ishimura,Y. Isolation and characterization of rat CYP11B genes involved in late steps of mineralo- and glucocorticoid syntheses. J. Biol. Chem. 268, 9130-9137 (1993) Note: only one amino acid difference with Nomura's 11B3 CYP11B3 rat GenEMBL U17082(1497bp) Mellon,S.H., Bair,S.R. and Monis,H. P450c11B3 mRNA, transcribed from a third P450c11 gene, is expressed in a tissue-specific, developmentally, and hormonally regulated fashion in the rodent adrenal and encodes a protein with both 11-hydroxylase and 18-hydroxylase activities. J. Biol. Chem. 270, 1643-1649 (1995) Note Genbank has a seq from rat named CYP11B4 D14108 This is part of the CYP11B8P pseudogene (see below) CYP11B4 cattle Kirita et al. 1990 J. Biochem (Tokyo) 108, 1030-1041. Listed in 1996 nomenclature update As having the same coding region sequence But different flanking regions CB11beta-20 CYP11B5P cattle GenEMBL JU0316 Kirita et al. 1990 J. Biochem (Tokyo) 108, 1030-1041. CB11beta-1 pseudogene CYP11B6P cattle GenEMBL JU0317 Kirita et al. 1990 J. Biochem (Tokyo) 108, 1030-1041. CB11beta-3 pseudogene CYP11B7P cattle GenEMBL JU0318, D00458, D00459 Hashimoto et al. 1989 J. Biochem (Tokyo) 105, 676-679. LambdaB11beta(15-1) and (15-2) Kirita et al. 1990 J. Biochem (Tokyo) 108, 1030-1041. CB11beta-21 pseudogene CYP11B8P rat pseudogene GenEMBL D14104 to D14108 Mukai,K., Imai,M., Shimada,H. and Ishimura,Y. Isolation and characterization of rat CYP11B genes involved in late steps of mineralo- and glucocorticoid syntheses. J. Biol. Chem. 268, 9130-9137 (1993) Note: authors call this sequence 11B4 12A Subfamily CYP12A1 Musca domestica (housefly) GenEMBL U86618 Rene Feyereisen MIKYKQYSRAIVALRQRGAQQYSTNVTNASQPDVKATTTTTISP EWQEAKPFEEMPSMNSWPIIKNMLPWGKYGKMEPTQFLMALRDDMGPIVRTAAFMGRP PTVITHNPHDFEMVFRNEGIWPIRPGGDAQMYHRTVLREDFFQGVTGLVSVNGEKWGN FRSTVNPVLMQPKNVRLYLNKMAQVNDEFMARIRQIRDPETLEVPASFQEEMNRWTLE SVSVVALDKQLGLITTNRDNPDLKKLIGLLNDFFELGQKIEFGLPFWKYIKTPTFKLF MKTLDGLLEIGNKYVNEAIDRLEAERQSGVPEKPENEKSVLEKLIKIDRKIATVMAID MILAGVDTTSTTFTALLLCLAKNPEKQEKLREEIRQILPRKDSQFEPSSLNHIPYTRA CIKEALRMYPLTLGNARILANDTVLSGYRVPKGTLVSMISTGLLQDDNHYTKAKEYLP ERWMRPTKEETEDSATCPHALKASSPFIYLPFGFGPRSCVGRRIVEMELELGIARLVR NFRIEFNYPTENAFKFKLINVPNIPLKFKFTDVEN CYP12A2 Musca domestica (housefly) GenEMBL U94698 Rene Feyereisen CYP12A3 Musca domestica (housefly) GenEMBL U94699 Rene Feyereisen cyp12a4 Drosophila melanogaster GenEMBL AC006091 85973-87865 also AC015190 AC008141 chromosome 3 clone BACR48G05 (D475) MLKVRSALSLIQSQKATLSLATQKHTEYFKILLYIYINKLYYQRWQTNVATAEAREDSEW LQAKPFEQIPRLNMWALSMKMSMPGGKYKNMELMEMFEAMRQDYGDIFFMPGIMGNPPFL STHNPQDFEVVFRNEGVWPNRPGNYTLLYHREEYRKDFYQGVMGVIPTQGKPWGDFRTVV NPVLMQPKNVRLYYKKMSQVNQEFVQRILELRDPDTLEAPDDFIDTINRWTLESVSVVAL DKQLGLLKNSNKESEALKLFHYLDEFFIVSIDLEMKPSPWRYIKTPKLKRLMRALDGIQE VTLAYVDEAIERLDKEAKEGVVRPENEQSVLEKLLKVDRKVATVMAMDMLMAGVDTTSST FTALLLCLAKNPEKQARLREEVMKVLPNKNPEFTEASMKNVPYLRACIKESQRLHPLIVG NARVLARDAVLSGYRVPAGTYVNIVPLNALTRDEYFPQASEFLPERWLRSPKDSESKCPA NELKSTNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRNFNVEFNYPTENAFRSALIN LPNIPLKFKFIDLPN Cyp12a4 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser cyp12a5 Drosophila melanogaster GenEMBL AC006091 83449-85296 also AC015190 AC008141 chromosome 3 clone BACR48G05 (D475) 76% identical to other AC006091 SEQ. 58% TO 12A1, 12A2 MLKGRIALNILQSQKPIVFSASQQ*RWQTNVPTAEIRNDPEWLQAKPFEE IPKANILSLFAKSALPGGKYKNLEMMEMIDALRQDYGNIIFLPGMMGRDG LVMTHNPKDFEVVFRNEGVWPFRPGSDILRYHRTVYRKDFFDGVQGIIPS QGKSWGDFRSIVNPVLMQPKNVRLYFKKMSQVNQEFIKEIRDASTQEVPG NFLETINRWTLESVSVVALDKQLGLLRESGKNSEATKLFKYLDEFFLHSA DLEMKPSLWRYFKTPLLKKMLRTMDSVQEVTLKYVDEAIERLEKEAKEGV VRPEHEQSVLEKLLKVDKKVATVMAMDMLMAGVDTTSSTFTALLLCLAKN PEKQARLREEVMKVLPNKDSEFTEASMKNVPYLRACIKESQRVYPLVIGN ARGLTRDSVISGYRVPAGTIVSMIPINSLYSEEYFPKPTEFLPERWLRNA SDSAGKCPANDLKTKNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRN FNVEFNHSTKNAFRSALINLPNIPLKFKFKFTDVPN* Cyp12a5 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP12A6 Drosophila wassermani no accession number Tina Yee and Phil Danielson 59% to 12A2 56% to 12A1 N-terminal does not match known P450 sequences may be in error CYP12A7 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee March 17, 2005 revised to full length June 27, 2005 Clone name Lc_CYP12A2 59% to 12A5 CYP12A8 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee March 17, 2005 Clone name Lc_CYP12A1 60% to 12A5 67% to 12A7 (365 aa) CYP12A9 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee March 17, 2005 Revised seq Dec 12, 2007 Old Clone name Lc_CYP12A3 New clone name Luce0104J9 (408 aa) 74% identical to Musca Domestica CYP12A2 only 1 aa diff to earlier shorter version CYP12A10 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee March 17, 2005 Clone name Lc_CYP12A4 57% to 12A4 65% to 129A9 (126 aa) CYP12A11 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee Dec. 18, 2007 clone name Luce0102F03 62% identical to CYP12A2 and 57% identical to Dm CYP12a5, 63% to 12A7, 72% to 12A8 partial seq, 64% to 12A9 partial, 67% to 12A10 partial seq. 12B Subfamily CYP12B1 Drosophila acanthoptera no accession number Phil Danielson Ac40 submitted to nomenclature committee Cyp12b2 Drosophila melanogaster GenEMBL AC018326 7227-9141 also AC004345 AC004657 77% identical to 12b1 Cyp12b2 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp12c1 Drosophila melanogaster GenEMBL AC009385 comp(57935-59646) also AC012807 Cyp12c1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp12d1 Drosophila melanogaster GenEMBL AC008187 comp(84371-86114) Cyp12d1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP12D2 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee Dec. 18, 2007 clone name Lce0054K20 53% identity to Dm CYP12d1 probable ortholog of Cyp12d1 Cyp12e1 Drosophila melanogaster GenEMBL AC018294 2070-3932 Cyp12e1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP12F1 Anopheles gambiae (malaria vector) No accession number Submitted by Rene Feyereisen July 5, 2002 Low 40% range with other CYP12s 60% to 12F2 CYP12F1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPl3r4 CYP12F2 Anopheles gambiae (malaria vector) No accession number Submitted by Rene Feyereisen July 5, 2002 Low 40% range with other CYP12s 60% to 12F1 CYP12F2 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPl3r3 CYP12F3 Anopheles gambiae (malaria vector) No accession number Submitted by Rene Feyereisen July 5, 2002 Low 40% range with other CYP12s 60% to 12F2 CYP12F3 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPl3r2 CYP12F4 Anopheles gambiae (malaria vector) No accession number Submitted by Rene Feyereisen July 5, 2002 Low 40% range with other CYP12s 55% to 12F2 CYP12F4 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPl3r1 CYP12F5 Aedes aegypti (yellow fever mosquito) CYP12F6 Aedes aegypti (yellow fever mosquito) CYP12F7 Aedes aegypti (yellow fever mosquito) CYP12F8 Aedes aegypti (yellow fever mosquito) CYP12F9 Culex_quinquefasciatus (southern house mosquito) No accession number Osamu Komagata submitted to nomenclature committee 7/25/07 sequence 15 68% to 12F7 Aedes, 60% to 12F2 CYP12F9 Culex pipiens CYP12F10 Culex pipiens CYP12F11 Culex pipiens CYP12F12 Culex pipiens CYP12F13 Culex pipiens CYP12F14 Culex pipiens CYP12G1 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee March 17, 2005 Clone name Lc_CYP12C 51% to 12C1 (310 aa) CYP12H1 Tribolium castaneum (red flour beetle) GenEMBL XP_966937, DT790363.1 EST 37% to 12c1, 38% to 12A4 mito clan, 42% to 12F2 MSTGSENVISQKKLDHYKNLNALENSDVMPEGWDKAKPFESLPGPKPIPFFGN IWRFLVPKIGDFYGIDFLDLHKIFYEQYGDISILKGMINDPIVVIFNPKDYETLFRNEGIWPIRKGLQSF NEYRKSRKDIFVNAGLLLEEGENWFKIRSVVNPILMQPRNISQYTDKMNSVADELVNNITNLLEESENGE MPENFHNELYKWSLESVGLVTLDTHLGCLKNDLDEDSEPQKLIRSTLEMFKLMHKLDVLPSFHNYVSTPS WRKFVQVMDFIVETNMKYVNQVLDKLEKGETTSTEIPSVLEKLLKVDRNIAVTMSIDMMIAGIDTTGRIL GAALYFLGKNHEAQEKLRSEAISLLQTKDQAVTSEVLKKAPYLKAVIKETTRLAPIGIGNLRTTVKNLVL GGYQIPKGTDVVTSNLVLCTNDEYFSRAKEFIPERWLSTTSGELSKKNTNPFIFAPFGYGPRSCVGKRLA SLELEVALLKIIRNFELDWPHEDMVFKTKMLYGMTEPLKIHVRSV CYP12J1 Leptinotarsa decemlineata (Colorado potato beetle) No accession number Jianhua Zhang Submitted to nomenclature committee April 2, 2006 Clone name P12F5113 59 amino acids from heme to end 42% to 12A4, 57% to CYP12H1 XP_966937 5 aa diffs to CYP12J2 58% to XM_961844.1 PREDICTED: Tribolium castaneum 37% to 12c1, 38% to 12A4, 42% to 12f2 CYP12J2 Leptinotarsa decemlineata (Colorado potato beetle) No accession number Jianhua Zhang Submitted to nomenclature committee April 2, 2006 Clone name P12F231134 59 amino acids from heme to end 43% to 12A4, 57% to CYP12H1 XP_966937 91% to 12J1 CYP12J3 Ips pini (pine engraver, bark beetle) No accession number Pamela Sandstrom Submitted to nomenclature committee on April 26, 2006 Full length sequence 60% to 12J1 Colorado potato beetle (C-term piece) 44% to 12H1 Tribolium castaneum clone name EST 28H12 CYP12K1 Nasonia vitripennis (jewel wasp) See wasp page CYP12L1 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee Dec. 18, 2007 clone name Luce0117K18 46% identical to M. domestica CYP12A2 13B Subfamily CYP13B1 Caenorhabditis elegans (nematode worm) GenEMBL Z54269 (19692bp) and Z92827 (C29F7) F02C12.5 also C29F7 has the end of this sequence Formerly CYP16A1 but renamed after full length sequence sorted with CYP13 CYP13B2 Caenorhabditis elegans (nematode worm) GenEMBL Z81565 K06G5 14A Subfamily CYP14A1 Caenorhabditis elegans (nematode worm) GenEMBL Z50742 (20856bp) Wilson,R., Ainscough,R., Anderson,K., Baynes,C., Berks,M., Bonfield,J., Burton,J., Connell,M., Copsey,T., Cooper,J., Coulson,A., Craxton,M., Dear,S., Du,Z., Durbin,R., Favello,A., Fulton,L., Gardner,A., Green,P., Hawkins,T., Hillier,L., Jier,M., Johnston,L., Jones,M., Kershaw,J., Kirsten,J., Laister,N., Latreille,P., Lightning,J., Lloyd,C., McMurray,A., Mortimore,B., O'Callaghan,M., Parsons,J., Percy,C., Rifken,L., Roopra,A., Saunders,D., Shownkeen,R., Smaldon,N., Smith,A., Sonnhammer,E., Staden,R., Sulston,J., Thierry-Mieg,J., Thomas,K., Vaudin,M., Vaughan,K., Waterston,R., Watson,A., Weinstock,L., Wilkinson-Sproat,J. and Wohldman,P. 2.2 Mb of contiguous nucleotide sequence from chromosome III of C. elegans. Nature 368 (6466), 32-38 (1994) K09A11.2 CYP14A2 Caenorhabditis elegans (nematode worm) GenEMBL Z50742 (20856bp) see CYP14A1 for reference. K09A11.3 CYP14A3 Caenorhabditis elegans (nematode worm) GenEMBL Z50742 (20856bp) see CYP14A1 for reference. K09A11.4 CYP14A4 Caenorhabditis elegans (nematode worm) GenEMBL Z50742 (20856bp) Z70212(R04D3.1 continuation of Z50742) see CYP14A1 for reference. after K09A11.5 partial sequence at end of cosmid continues on cosmid R04D3 CYP14A5 Caenorhabditis elegans (nematode worm) GenEMBL U64847 F08F3.7 CYP15A1 Diploptera punctata (cockroach) GenEMBL AY509244 Helvig,C., Koener,J.F., Unnithan,G.C. and Feyereisen,R. CYP15A1, the cytochrome P450 that catalyzes epoxidation of methyl farnesoate to juvenile hormone III in cockroach corpora allata Unpublished CYP15A1 Tribolium castaneum (red flour beetle) GenEMBL XP_970303 44% to CYP15B1 Anopheles, 48% to CYP15B1 Aedes note: CYP15A1, B1 and C1 are all probably orthologs. Each species seems to have only a single CYP15. MLFFVTLVISLVLLFLILDTIKPRRYPPGPKWLPIVGNFLEFRRRLSEIGYHHLVWKEFSEEYGDVVGLK MGRNLVVAVFGAEAVKEVLTREEFDGRPDGFFFRLRTFGKRLGIVFSDGQFWQKQRKFSMQHLRNFGFGR KEMEEKIEEETKDLIAVFKKQCSEPIWMHTAFDVSVLNVLWAMMAGERFNINDERLRKLLKIVHDAFRLT DMSGGMLNQLPFLRFIAPETCGYNQLVDVLVRMWEFLQETISEHRKTLCSSHARDLIDAFLQKMDIQSDS SFTDDQLMSLCLDLFMAGSETTSNTLGFSVVYMLQFPEVQKKVQDEMDEVVGRNRWPTLQDRIKLKYTEA VLMEIQRRANIPPLGIAHRATRDVSLFDYRIPEGTIVLTSLYSVHMDHKFWKDPLAFRPERFLNKEGNLE VDEKYFAPFGYGKRRCLGESLAKANYFLFFTALLHNFYLEKDCDGPEPQLEGYDGVTISPKPFRAKLIPRTD CYP15A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph15, 49% to 15A1 Tribolium Pediculus genome site CYP15A1 Nasonia vitripennis (jewel wasp) See wasp page CYP15A1 Reticulitermes flavipes (termite) No accession number Michael Scharf Submitted to nomenclature committee Feb. 19, 2009 Clone name RfCyp15A1-2 69% to Diploptera punctata CYP15A1 58% to CYP15A1 Tribolium, 48% to 15B1 Anopheles CYP15B1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPj2l3 47% identical to 15A1, though this is probably the ortholog of CYP15A1 CYP15B1 Aedes aegypti (yellow fever mosquito) CYP15B1 Culex pipiens quinquefasciatus (southern house mosquito) XM_001864994 locus_tag="CpipJ_CPIJ014944 72% to CYP15B1 Aedes aegypti MRKPAFFPPGPAWFPLIGCGYRVFKLVNVHKFYHLMWDALARRY GPVVGLKLGRDRVVIVSGLAAIREFYSKEAFNGRPNGFFFRIRSFNKRLGLVFNDGAD WDVQRRFSVKTLKSLGMGRTGMVHSLEKETEELIHHLRKLSRGGRTIQMHSAFDIAVL NVIWTLLAGNRFKLDDKRLAWICDTIHNSFRVIDMSGGVLNQCPPIRHICPNKSGFKP LVNLLNPLLNFINDTISNIKANPECPERHNCLISSYLEELQIDNPHPSFSNEQLLSLC LDLFQAGSETTSNTLGFGVAHVMHHPNVIAKIHQELDNVVGRYRLPTLDDRPSLPYLE ATICEIQRIANVAPLAIAHRTTSPVQLGQYVIPRDTIAMVSLYSIHMDRQYWGDPEVF RPERFLNESEDKLIQHEYFLPFGAGKRRCLGEALAKSNLFLFFAAFMHAFIVEPANPG QLPPLEGFDGITLSPQPYQVRLKERLI CYP15B1 Culex pipiens CPIJ014944 CYP15C1 Bombyx mori (silkworm) BAAB01071346.1 BAAB01157546.1 BAAB01036905.1 BAAB01022841.1 BAAB01157214.1BAAB01068330.1 40% to CYP15 note: CYP15A1, B1 and C1 are all probably orthologs. Each species seems to only have a single CYP15. (not found in Drosophila) See silkworm page for sequence CYP15D1X Daphnia pulex (water flea) Confidential renamed CYP370A1 CYP15D2X Daphnia pulex (water flea) Confidential renamed CYP370A2 CYP15D3PX Daphnia pulex (water flea) Confidential renamed CYP370A3P CYP15D4X Daphnia pulex (water flea) Confidential renamed CYP370A4 CYP15D5X Daphnia pulex (water flea) Confidential renamed CYP370A5 CYP15D6X Daphnia pulex (water flea) Confidential renamed CYP370A6 CYP15D7X Daphnia pulex (water flea) Confidential renamed CYP370A7 CYP15D8X Daphnia pulex (water flea) Confidential renamed CYP370A8 CYP15D9X Daphnia pulex (water flea) Confidential renamed CYP370A9 CYP15D10X Daphnia pulex (water flea) Confidential renamed CYP370A10 CYP15D11X Daphnia pulex (water flea) Confidential renamed CYP370A11 CYP15D12X Daphnia pulex (water flea) Confidential renamed CYP370A12 CYP15D13X Daphnia pulex (water flea) Confidential renamed CYP370A13 CYP15E1X Daphnia pulex (water flea) Confidential renamed CYP370B1 CYP15E2X Daphnia pulex (water flea) Confidential renamed CYP370B2 CYP15F1 Reticulitermes flavipes (termite) No accession number Michael Scharf Submitted to nomenclature committee Feb. 19, 2009 Clone name RfCyp15A1-1 43% to CYP15A1 Tribolium, 38% TO CYP15B1 Anopheles 16A Subfamily CYP16A1X Caenorhabditis elegans (nematode worm) GenEMBL Z54269 number retired This sequence is now CYP13B1 17A Subfamily CYP17 human GenEMBL M14564 Chung,B.C., Picado-Leonard,J., Haniu,M., Bienkowski,M., Hall,P.F., Shively,J.E. and Miller,W.L. (1987) Cytochrome P450c17 (steroid 17 alpha-hydroxylase/17,20 lyase): cloning of human adrenal and testis cDNAs indicates the same gene is expressed in both tissues Proc. Natl. Acad. Sci. U.S.A. 84, 407-411. First human CYP17A1 cDNA CYP17 human GenEMBL M19489 Picado-Leonard J, Miller WL. (1987) Cloning and sequence of the human gene for P450c17 (steroid 17 alpha-hydroxylase/17,20 lyase): similarity with the gene for P450c21. DNA. 6. 439-448. First human CYP17A1 gene CYP17 human GenEMBL NM_000102 MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQMATL DIASNNRKGIAFADSGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH NGQSIDISFPVFVAVTNVISLICFNTSYKNGDPELNVIQNYNEGIIDNLSKDSLVDLV PWLKIFPNKTLEKLKSHVKIRNDLLNKILENYKEKFRSDSITNMLDTLMQAKMNSDNG NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWTLAFLLHNPQVKKKLYEEIDQ NVGFSRTPTISDRNRLLLLEATIREVLRLRPVAPMLIPHKANVDSSIGEFAVDKGTEV IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSVSYLPFGAGPRSCIGEILARQ ELFLIMAWLLQRFDLEVPDDGQLPSLEGIPKVVFLIDSFKVKIKVRQAWREAQAEGST CYP17 human EST GenEMBL Z19875 (235bp) UK-HGMP (United Kingdom human genome mapping project) covers amino acids 270-348 when translating the complementary strand. The fragment goes through at least 6 frame shifts. sequence ID AAAAWEO CYP17 human EST GenEMBL Z20209 (248bp) UK-HGMP (United Kingdom human genome mapping project) covers amino acids 265-349 when translating the complementary strand. The fragment goes through at least 8 frame shifts. sequence ID AAABPSZ CYP17 human GenEMBL S85459 (556bp) Biason,A., Mantero,F., Scaroni,C., Simpson,E.R. and Waterman,M.R. Deletion within the CYP17 gene together with insertion of foreign DNA is the cause of combined complete 17 alpha-hydroxylase/17,20-lyase deficiency in an Italian patient Mol. Endocrinol. 5, 2037-2045 (1991) CYP17A1 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP17A1 94% to human, missing first 9 aa CYP17 rat PIR A31359 (507 amino acids) Namiki, M., Kitamura, M., Buczko, E. and Dufau, M.L. Rat testis P-450-17-alpha cDNA: the deduced amino acid sequence, expression and secondary structural configuration. Biochem. Biophys. Res. Commun. 157, 705-712 (1988) CYP17 rat PIR D41425 (16 amino acids) Imaoka, S., Kamataki, T. and Funae, Y. Purification and characterization of six cytochromes P-450 from hepatic microsomes of immature female rats. J. Biochem. 102, 843-851 (1987) CYP17 rat GenEMBL S50146 Z11902 (3345bp) PIR S20655 (97 amino acids) Nason,T.F., Han,X.G. and Hall,P.F. Cyclic AMP regulates expression of the rat gene for steroid 17 alpha-hydroxylase/C17-20 lyase P-450 (CYP17) in rat Leydig cells. Biochim. Biophys Acta 1171, 73-80 (1992) Note: have not been able to download S50146 from GCG or NCBI. CYP17 rat GenEMBL X69816 (7556bp) Givens,C.R., Zhang,P., Bair,S.R. and Mellon,S.H. Transcriptional regulation of rat cytochrome P450c17 expression in mouse Leydig MA-10 and adrenal Y-1 cells: identification of a single protein that mediates both basal and cAMP-induced activities. DNA Cell Biol. 13, 1087-1098 (1994) CYP17 rat PIR S24316 (97 amino acids) Zhang, P., Nason, T.F., Han, X.G. and Hall, P.F. Gene for 17-alpha-hydroxylase/C(17-20) lyase P-450: complete nucleotide sequence of the porcine gene and 5' upstream sequence of the rat gene. Biochim. Biophys. Acta 1131, 345-348 (1992) Cyp17 mouse genEMBL M64863 AC079419.1 Mm.1262 MWELVGLLLLILAYFFWPKSKTPNAKFPRSLPFLPLVGSLPFLP RRGHMHANFFKLQEKYGPIYSLRLGTTTAVIVGHYQLAREVLVKKGKEFSGRPQMVTL GLLSDQGKGVAFADSSSSWQLHRKLVFSTFSLFRDDQKLEKMICQEANSLCDLILTYD GESRDLSTLIFKSVINIICTICFNISFENKDPILTTIQTFTEGIVDVLGHSDLVDIFP WLKIFPNKNLEMIKEHTKIREKTLVEMFEKCKEKFNSESLSSLTDILIQAKMNAENNN TGEGQDPSVFSDKHILVTVGDIFGAGIETTSSVLNWILAFLVHNPEVKRKIQKEIDQY VGFSRTPSFNDRTHLLMLEATIREVLRIRPVAPLLIPHKANIDSSIGEFAIPKDTHVI INLWALHHDKNEWDQPDRFMPERFLDPTGSHLITPTPSYLPFGAGPRSCIGEALARQE LFIFMALLLQRFDFDVSDDKQLPCLVGDPKVVFLIDPFKVKITVRQAWKDAQVEVST CYP17 hamster no accession number Cloutier,M., Fleury,A., Courtemanche,J., Ducharme,L. Mason,J.I. and Lehoux,J.G. Cloning and expression of hamster adrenal cytochrome P450c17 cDNA. Ann. N.Y. Acad. Sci. 774, 294-296 (1995) CYP17 Cavia (guinea pig) GenEMBL S75277(1732bp) Tremblay,Y., Fleury,A., Beaudoin,C., Vallee,M. and Belanger,A. Molecular cloning and expression of guinea pig cytochrome P450c17 cDNA (steroid 17 alpha-hydroxylase/17,20 lyase): tissue distribution, regulation, and substrate specificity of the expressed enzyme. DNA Cell Biol. 13 (12), 1199-1212 (1994) CYP17 Cavia porcellus (guinea pig) PIR S52756 (508 amino acids) Huang,Y., Voigt,J.M. and Colby,H.D. unpublished CYP17 pig GenEMBL Z11854 to Z11856 GenEMBL S40341 (1858bp) PIR S24233 (501 amino acids) PIR S30074 (501 amino acids) Zhang,P. Nason,T.F., Han,X.G. and Hall,P.F. Gene for 17 alpha-hydroxylasae/C(17-20) lyase P-450: complete nucleotide sequence of the porcine gene and 5' upstream sequence of the rat gene. Biochim. Biophys. Acta 1131, 345-348 (1992) CYP17 Sus scrofa (pig) GenEMBL U41519 to U41525 Conley,A.J., Graham-Lorence,S.E., Kagimoto,M., Lorence,M.C., Murry,B.A., Oka,K., Sanders,D. and Mason,J.I. Nucleotide sequence of a cDNA encoding porcine testis 17 alpha-hydroxylase cytochrome P-450. Biochim. Biophys. Acta 1130, 75-77 (1992) MWVLLVFFLLTLTYLFWPKTKGSGAKYPRSLPVLPVVGSLPFLP RRGHQHMNFFKLQDKYGPIFSFRLGSKTTVVIGDHQLAKEVLLKKGKEFSGRPRVMTL DILSDNQKGIAFADHGTSWQLHRKLALSTFSLFKGGNLKLENIINQEIKVLCDFLATR NGESIDLAQPLSLAMTNIVSFICFNFSFKKGDPALQAIVNFNDGILDAVGKEILYDMF PGIRILPSQTLENMKQCVRMRNELLREILENRKENYSRNSITNLLDIMIQAKTNAESN TGGPDHNLKLLSDRHMLATVADIFGAGVETSASVVKWIVAFLLHYPLLRKKIQDAIDQ NIGFNRAPSISDRNQLVLLEATIREVLRFRPVSPTLIPHRAIIDSSIGEFTIDKDTDV VVNLWALHHNEKEWHRPDLFMPERFLDPTGTQLISPSLSYLPFGAGPRSCVGEMLARQ ELFLFTAGLLQRFDLELPDDGQLPCLVGNPSLVLQIDPFKVKIKERQAWKEAHTEGSTS CYP17 bovine GenEMBL M64646 (1725bp) Zuber,M.X., John,M.E., Okamura,T., Simpson,E.R. and Waterman,M.R. Bovine adrenocortical cytochrome P-450-17-alpha: Regulation of gene expression by ACTH and elucidation of primary sequence J. Biol. Chem. 261, 2475-2482 (1986) CYP17A1 Bos taurus (cow) See cattle page for details MWLLLAVFLLTLAYLFWPKTKHS 116 GAKYPRSLPSLPLVGSLPFLPRRGQQHKNFFKLQEKYGPIYSFRLGSKTTVMIGHHQLAR 295 296 EVLLKKGKEFSGRPKVATLDILSDNQKGIAFADHGAHWQLHRKLALNAFALFKDG 460 461 NLKLEKIINQEANVLCDFLATQHGEAIDLSEPLSLAVTNIISFICFNFSFKNEDP 625 626 ALKAIQNVNDGILEVLSKEVLLDIFPVLKIFPSKAMEKMKGCVQTRNELLNEILEKCQEN 805 806 FSSDSITNLLHILIQAKVNADNNNAGPDQDSKLLSNRHMLATIGDIFGAGVETTTSVIK 982 983 WIVAYLLHHPSLKKRIQDDIDQIIGFNRTPTISDRNRLVLLEATIREVLRIRPVAPTLIP 1162 1163 HKAVIDSSIGDLTIDKGTDVVVNLWALHHSEKEWQHPDLFMPERFLDPTGTQLISPSL 1336 1337 SYLPFGAGPRSCVGEMLARQELFLFMSRLLQRFNLEIP 1450 DDGKLPSLEGHASLVLQIKPFKVKIEVRQAWKEAQAEGSTP* CYP17 Ovis aries (sheep) GenEMBL L40335 (1728bp) Murry,B.A., Swart,P. and Mason,J.I. Cloning and expression of ovine cytochrome P-450 17-alpha hydroxylase/c17-20 lyase. Unpublished (1995) CYP17 Equus caballus (horse) GenEMBL D30688(1906bp) D13818 Hasegawa,T., Mukoyama,H., Yoshida,S. and Takahashi,M. Molecular cloning and nucleotide sequence of equine testicular cytochrome P-450 steroid 17alpha-hydroxylase/C17,20-lyase messenger ribonucleic acid. Biol. Reprod. Mono. 1, 615-622 (1995) CYP17 Equus caballus (horse) GenEMBL D88184(6217bp) Hasegawa,T. Exon/Intron structure of Equine P450c17. unpublished (1996) CYP17A1 Xenopus tropicalis See Xenopus pages for seq CYP17A1 Danio rerio (zebrafish) See xebrafish pages for seq CYP17 Oncorhynchus mykiss (rainbow trout) GenEMBL S50356 Sakai,N., Tanaka,M., Adachi,S., Miller,W.L. and Nagahama,Y. Rainbow trout cytochrome P-450c17 (17 alpha-hydroxylase/17-20 lyase) cDNA cloning, enzymatic properties and temporal pattern of ovarian P-450c17 mRNA expression during oogenesis. FEBS Lett. 301, 60-64 (1992) Identical to X65800 CYP17 Oncorhynchus mykiss (rainbow trout) GenEMBL X65800 (2287bp) Swiss P30437 (514 amino acids) Sakai,N., Tanaka,M., Adachi,S., Miller,W.L. and Nagahama,Y. Rainbow trout cytochrome P-450c17 (17 alpha-hydroxylase/17,20-lyase). cDNA cloning, enzymatic properties and temporal pattern of ovarian P-450c17 mRNA expression during oogenesis FEBS Lett. 301, 60-64 (1992) Identical to S50356 CYP17 Oryzias latipes (medaka) GenEMBL D87121(2421bp) Kobayashi,D., Matsuyama,M., Tanaka,M., Fukada,S. and Nagahama,Y. Structural analysis of medaka P-450c17 and expression in the ovarian follicle. unpublished (1996) CYP17 Oryzias latipes (medaka) GenEMBL D87122(2302bp) Kobayashi,D., Tanaka,M., Fukada,S. and Nagahama,Y. Presence of a Novel Cytochrome P-450c17 Transcripts in Medaka Gonads unpublished (1996) note: this sequence is missing exon 6 otherwise identical to D87121 CYP17 Squalus acanthias (dogfish) GenEMBL S77384 (1964bp) Trant,J. Isolation and characterization of the cDNA encoding the spiny dogfish shark (Squalus acanthias) form of cytochrome P450c17 J. Exp. Zool. 272, 25-33 (1995) CYP17 Ictalurus punctatus (channel catfish) GenEMBL AF063837 Trant,J.M., Berard,C., Byrne,B.J. and Wunder,J. Isolation and heterologous expression of the cDNA encoding the cytochrome P450 17-hydroxylase from the channel catfish (Ictalurus punctatus) Unpublished (1998) CYP17A1 Fugu rubripes (pufferfish) No accession number Scaffold_4175 80% to Oryzias latipes 17, 79% to trout 17 73% to catfish 61% to dogfish 48% to other CYP17 from Fugu MDWVLFVYAFSAVNLALLALHLKFRTPASGPRGPPRLPALPLIGSLLSLRSPHPPHVLFKE LQGKYGQTYSLMMGSHRVIIVNHHAHAKEVLLKKGKIFAGRPRS 11667 (0) 11572 VTTDVLSRDGKDIAFGDYSATWRFHRKIVHGALCMFGEGSASIEKI 11435 (1) 10871 ICAEAASLCSILSEAWTAGLALDLSPELTRAVTNVICSLCFSSSYRRGDAEFEAMLHYSQ 10692 10691 GIVDTVAKDSLVDIFPCLQ 10635 (0) IFPNADLRLLKRCVSVRDKLLQKEYDKHK (0) AAYSDHVQRDLLDALLRAKCSAENNNTTGINAESVGLTDDHLLMT VGDIFGAGVETTTTVMKWAITYLIHHPQ 9765 (0) 9637 IQSRIQEELDSRVGMDRSPQLSDRGSLPYLEATIREVLRIRPVAPLFIPHVALSDT 9473 (2) 9369 SIGDFAVKKGTRVVINLWSLHHDEKEWENPERFDP 9265 (1) GRFLNSEGTGLVIPSSSYLPFGAGVRVCLGEALAK 8558 8557 MELFLFLSWILQRFTLTVPSGHSLPSLEGKFGVVLQPTKYKVNATPRPGWEGKCKACWN* 8378 CYP17A2 Danio rerio (zebrafish) See xebrafish pages for seq CYP17A2 Fugu rubripes (pufferfish) No accession number Scaffold_8086 50% to Oryzias latipes 17, 48% to trout 17 50% to catfish 49% to dogfish 48% to other CYP17 from Fugu 1198 MVTVGSFLIFRRPVRGSEPGSEAGPPRVKVPCISWVPVLGSLPWLRGGRPLHLIFTQLSYR 1380 (2) 1600 YGPLFALYLGPHLTVVVNNHQHAREVLLLRGKDFAGRPRM 1719 (0) 1797 VTTDLLTRGGKDIAFSDYCPLWKSHRRLVQNSFTLFGEGTSRLQDM 1934 (1) 2061 VLAAVDSLCEELLSMEGRGFDPAPAVTRAVTNVVCMLVFSATYRHGDSELQEVLRYNDGI 2240 2241 VQTIAGGGLVDIYPWMK 2291 (0) 2372 VFPNKTLSKLKACIAVRDRLLTHKLEEHK 2458 (0) 2622 ATLTDNQPRDLLDALLMGQVGRGRRKGSGRVEEDIITEDHVLMTAAEAFGAGVETTSTTLLWILAYLLHHPQ 2837 (0) 2925 VQERVQKELDDHVGSERPVRVSDRARLTYLDCVINEGMRIRPVSPVLIPHTAMTDSR 3074 (2) 3870 IGGHHISRGTRVLVNMWSIHHDSAHWDKPDLFNP 3971 (1) 4519 DRFRDHQGQRVTPSCFLPFGAGPRVCVGESLARLELFLFLSSLLQRMSFRLPNGA 4692 4693 SPPDLQGRMGVVLQPVPYKVVVTPRVG* 4776 CYP17 fragment a Fugu rubripes (pufferfish) No accession number Fc:c028I22x2 LPC.10549.x2 Length = 1007 61% to 17A1 Fugu GAPRATTDNHHAHAKEARPKKGKKAAGRPRK exon 1 ATTDGASRDGTDTAEGDHSATRRDQR exon 2 CYP17 fragment b Fugu rubripes (pufferfish) No accession number Fc:c028I22x1 LPC.10549.x1 Length = 894 78% to Fc:c028I22x2 65% to 17A1 Fugu KEQQAEHGQTTSRMMGSHRGNTDNQHAHAKEARQKKGKKGAGRPRK exon 1 ATTDGRSRDGTDIAEGDHSATWRHHRKIVQRARRRNGEGSAPSEKI exon 2 C-helix 18A Subfamily Cyp18a1 Drosophila melanogaster GenEMBL S66112 (63bp) Hurban,P. and Thummel,C. Isolation and characterization of fifteen ecdysone-inducible drosophila genes reveal unexpected complexities in ecdysone regulation. Molec. Cell Biol. 13, 7101-7111 (1993) Note: very short ecdysone inducible fragment in the heme binding region about 2/3 of amino acids are identical to 2D sequences. called Eig 17-1 dimethylnitrosamine demethylase Cyp18a1 Drosophila melanogaster GenEMBL U44753(2539bp) Bassett,M.H, Waterman,M.R., McCarthy,J.L. and Sliter,T.J. Cloning and characterization of CYP18 in Drosophila melanogaster: identification of an insect member of a new cytochrome P450 family. unpublished (1996) complete sequence from the Eig 17-1 fragment Cyp18a1 Drosophila melanogaster GenEMBL AC012164 114600-117669 also AC015216 Cyp18a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP18A1 Spodoptera littoralis (cotton leafworm) No accession number Lyndsay Davies Submitted to nomenclature committee 2/22/02 61% identical to CYP18 from Drosophila since there is only one CYP18 seq in Drosophila this new sequence will be called CYP18 without a subfamily or number like CYP18A1. If more CYP18s are found in a single species this may have to change. CYP18A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroup13.1a 60% to 18A1 Drosophila melanogaster. CYP18A1 Bombyx mori (silkworm) AU005208 EST BAAB01190855.1 BAAB01187772.1 60% to CYP18A1 Drosophila See silkworm page for sequence CYP18A1 Tribolium castaneum (red flour beetle) GenEMBL XP_968393 58% to 18A1 Drosophila melanogaster MFVYSGLVLWNFLAEELSTKVLAVFLMVLFLVRLVQMLVQMLKEASALPPGPWGLPILGSLPFLKGDLHL HFRDLTHKYGSLISTRLGSQLIVVLSDYKMIRDAFRKEEFTGRPITEFTTLLDGYGVINTAGKLWKDQRR FLHDGLRHFGMSYIGSRKTQMENRIMREVEEFLSVLTARKDTPIDLNPVLAVSLSNVICDILMSVRFSHN DERFKRFMFLIDEGFKLFSSLEASFFIPILKYLPGQRQTREKIAKNRAEMAQFLQETIEEHRKSFDPSHL RDLLDTYLYEIQKADEEGTGDHLFEGKDHDRQMQQIMGDLFSAGMETIKSSLQWAVLFMLHHPEVMKAVQ EELDQVVGRRRLPKLEDLPYLPVTESTMLEVLRISSIVPMGTTHAPTRDLKLNGFHLPRHAQIVPLLHSV HMDPSLWHEPERFNPSRFINAEGKVVKPEYFLPFGVGRRMCLGEILARMEIFSFFSSLLHSFDICVPTGE TLPSLKGVAGVTISPNAFRVCLKPRPMEWDSMGTIRPAGSH CYP18A1 Aedes aegypti (yellow fever mosquito) CYP18A1 Culex pipiens CPIJ001038 CYP18A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph18, 56% to 18A1 Apis Pediculus genome site CYP18A1 Nasonia vitripennis (jewel wasp) See wasp page CYP18A1 Daphnia pulex (water flea) Confidential CYP18B1 Bombyx mori (silkworm) BAAB01081335.1 BAAB01007952.1 BAAB01142048.1 BAAB01138082.1 BAAB01045663.1 39% to CYP18A1 Bombyx See silkworm page for sequence CYP18B2 Manduca sexta No accession number Yannick Pauchet Submitted to nomenclature committee April 7, 2009 confidential 19A Subfamily CYP19 human GenEMBL S52034 (142bp) S52789 (106bp) S52793 (149bp) S52794 (125bp) Harada,N. A unique aromatase (P-450AROM) mRNA formed by alternative use of tissue-specific exons 1 in human skin fibroblasts. Biochem. Biophys. Res. Commun. 189, 1001-1007 (1992) CYP19 human GenEMBL D14473 (295bp) S59092 S59095 S59171 Toda,K and Shizuta,Y. Molecular cloning of a cDNA showing alternative splicing of the 5'-untranslated sequence of mRNA for human aromatase P-450. Eur. J. Biochem. 213, 383-389 (1993) CYP19 human GenEMBL D13391 (2238bp) Katsumi,T. and Shizuta,Y. Identification and characterization of cis-acting regulatory elements for the expression of the human aromatase cytochrome P-450 gene. J. Biol. Chem. 269, 8099-8107 (1994) CYP19 human GenEMBL D21240 (794bp) D21241 (3231bp) Harada,N., Utsumi,T. and Takagi,Y. Tissue-specific expression of the human aromatase cytochrome P-450 gene by alternative use of multiple exons 1 and promoters, and switching of tissue-specific exons 1 in carcinogenesis. Proc. Natl. Acad. Sci. U.S.A. 90 (23), 11312-11316 (1993) CYP19 human GenEMBL X55983 (669bp) Toda,K., Miyahara,K., Kawamoto,T., Ikeda,H., Sagara,Y. and Shizuta,Y. Characterization of a cis-acting regulatory element involved in human- aromatase P-450 gene expression. Eur, J. Biochem. 205, 303-309 (1992) exon 1 CYP19 human GenEMBL S71536 (792bp) Toda,K., Simpson,E.R., Mendelson,C.R., Shizuta,Y. and Kilgore,M.W. Expression of the gene encoding aromatase cytochrome P450 (CYP19) in fetal tissues Mol. Endocrinol. 8, 210-217 (1994) CYP19 human GenEMBL M32245 (840bp) Harada,N., Yamada,K., Saito,K., Kibe,N., Dohmae,S. and Takagi,Y. Structural characterization of the human estrogen synthetase (aromatase) gene. Biochem. Biophys. Res. Commun. 166, 365-372 (1990) CYP19 human GenEMBL D29757 (875bp) PIR PC2041 (45 amino acids) Honda,S.-I., Harada,N. and Takagi,Y. Novel exon 1 of the aromatase gene specific for aromatase transcripts in human brain. Biochem. Biophys. Res. Commun. 198, 1153-1160 (1994) CYP19 human GenEMBL M22246 (2966bp) Harada,N. Cloning of a complete cDNA encoding human aromatase: immunochemical identification and sequence analysis Biochem. Biophys. Res. Commun. 156, 725-732 (1988) CYP19 human GenEMBL L21982 (1166bp) Mahendroo,M.S., Mendelson,C.R. and Simpson,E.R. Tissue-specific and hormonally-controlled alternative promoters regulate aromatase cytochrome P450 gene expression in human adipose tissue J. Biol. Chem. 268, 19463-19470 (1993) CYP19 human GenEMBL S85356 (1384bp) Means,G.D., Kilgore,M.W., Mahendroo,M.S., Mendelson,C.R. and Simpson,E.R. Tissue-specific promoters regulate aromatase cytochrome P450 gene expression in human ovary and fetal tissues. Mol. Endocrinol. 5, 2005-2013 (1991) CYP19 human GenEMBL S96437 (971bp) Kilgore,M.W., Means,G.D., Mendelson,C.R. and Simpson,E.R. Alternative promotion of aromatase P-450 expression in the human placenta. Mol. Cell. Endocrinol. 83, R9-R16 (1992) CYP19 human PIR A40542 (48 amino acids) Mahendroo, M.S., Means, G.D., Mendelson, C.R. and Simpson, E.R. Tissue-specific expression of human P-450-AROM. The promoter responsible for expression in adipose tissue is different from that utilized in placenta. J. Biol. Chem. 266, 11276-11281 (1991) CYP19 Macaca fuscata (Japanese macaque) GenEMBL S79807(369bp) Yamada-Mouri,N., Hirata,S., Hayashi,M. and Kato,J. Analysis of the expression and the first exon of aromatase mRNA in monkey brain. J. Steroid Biochem. Mol. Biol. 55 (1), 17-23 (1995) CYP19 rat GenEMBL S59505 (639bp) Fitzpatrick,S.L. and Richards,J.S. cis-acting elements of the rat aromatase promoter required for adenosine 3',5'-monophosphate induction in ovarian granulosa cells and constitutive expression in R2C Leydig cells. Molec. Endocrinol. 7, 341-354 (1993) Note: promoter CYP19 rat GenEMBL Z11815 (590bp) Hickey,G.J., Krasnow,J.S., Beattie,W.G. and Richards,J.S. Aromatase cytochrome P450 in rat ovarian granulosa cells before and after luteinization: Adenosine 3',5'-monophosphate-dependent and independent regulation. Cloning and sequencing of rat aromatase cDNA and 5' genomic DNA Mol. Endocrinol. 4, 3-12 (1990) CYP19 Oryctolagus cuniculus (rabbit) GenEMBL Z68271(1783bp) Delarue,B., Mittre,H., Feral,C., Benhaim,A. and Leymarie,P. Rapid sequencing of rabbit aromatase cDNA using RACE PCR without cloning. C. R. Acad. Sci. III, Sci. Vie 319, 663-670 (1996) CYP19 Oryctolagus cuniculus (rabbit) GenEMBL Z70302(1455bp) Delarue,B., Mittre,H. and Leymarie,P. Expression des transcrits codant pour l'aromatase de lapin dans differents tissus. unpublished (1996) Cyp19 mouse Swiss P28649 (503 amino acids) GenEMBL D00659 (2420bp) Terashima,M., Toda,K., Kawamoto,T., Kuribayashi,I., Ogawa,Y., Maeda,T. and Shizuta,Y. Isolation of a full-length cDNA encoding mouse aromatase P450 Arch. Biochem. Biophys. 285, 231-237 (1991) MFLEMLNPMQYNVTIMVPETVTVSAMPLLLIMGLLLLIWNCESS SSIPGPGYCLGIGPLISHGRFLWMGIGSACNYYNKMYGEFMRVWISGEETLIISKSSS MFHVMKHSHYISRFGSKRGLQCIGMHENGIIFNNNPSLWRTIRPFFMKALTGPGLVRM VEVCVESIKQHLDRLGEVTDTSGYVDVLTLMRHIMLDTSNMLFLGIPLDESAIVKKIQ GYFNAWQALLIKPNIFFKISWLYRKYERSVKDLKDEIAVLVEKKRHKVSTAEKLEDCM DFATDLIFAERRGDLTKENVNQCILEMLIAAPDTMSVTLYFMLLLVAEYPEVEAAILK EIHTVVGDRDIKIEDIQNLKVVENFINESMRYQPVVDLVMRRALEDDVIDGYPVKKGT NIILNIGRMHRLEYFPKPNEFTLENFEKNVPYRYFQPFGFGPRGCAGKYIAMVMMKVV LVTLLRRFQVKTLQKRCIENIPKKNDLSLHPNEDRHLVEIIFSPRNSDKYLQQ CYP19 pig no accession number Corbin, C.J., Khalil, M.W. and Conley, A.J. Functional ovarian and placental isoforms od porcine aromatase. Mol. Cell. Endocrinol. 113, 29-37 (1995) CYP19 Sus scrofa (pig) GenEMBL L15471 (454bp) Ko,Y., Choi,I., Green,M.L., Simmen,F.A. and Simmen,R.C. Transient expression of the cytochrome P450 aromatase gene in elongating porcine blastocysts is correlated with uterine insulin-like growth factor levels during peri-implantation development. Mol. Reprod. Dev. 37 (1), 1-11 (1994) Note: This is only a fragment of 80 amino acids, including helix K and the EXXR conserved sequence. CYP19 Sus scrofa (pig) GenEMBL U52141(1133bp) Choi,I., Collante,W., Simmen,R.C.M. and Simmen,F.A. Molecular cloning of multiple forms of cytochrome p450 aromatase and their developmental expression in porcine blastocysts, endometrium, and placenta. Unpublished (1997) CYP19 Sus scrofa (pig) GenEMBL U52142(1584bp) Choi,I., Collante,W., Simmen,R.C.M. and Simmen,F.A. Molecular cloning of multiple forms of cytochrome p450 aromatase and their developmental expression in porcine blastocysts, endometrium, and placenta. Unpublished (1997) CYP19 Sus scrofa (pig) GenEMBL U37309 (417bp) Choi,I., Simmen,R.C. and Simmen,F.A. Molecular cloning of cytochrome P450 aromatase complementary deoxyribonucleic acid from periimplantation porcine and equine blastocysts identifies multiple novel 5'-untranslated exons expressed in embryos, endometrium, and placenta. Endocrinology 137 (4), 1457-1467 (1996) CYP19 Sus scrofa (pig) GenEMBL U37311(2470bp) Choi,I., Simmen,R.C. and Simmen,F.A. Molecular cloning of cytochrome P450 aromatase complementary deoxyribonucleic acid from periimplantation porcine and equine blastocysts identifies multiple novel 5'-untranslated exons expressed in embryos, endometrium, and placenta. Endocrinology 137 (4), 1457-1467 (1996) CYP19 Sus scrofa (pig) GenEMBL U57510 Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A. Molecular cloning and structural characterization of porcine cytochrome p450 aromatase chromosomal genes: evidence for the existence of multiple, closely related genes that encode developmental and tissue-specific isoforms of aromatase. unpublished (1997) CYP19 Sus scrofa (pig) GenEMBL U57517(287bp) U57518(287bp) U57519(358bp) Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A. Molecular cloning and structural characterization of porcine cytochrome p450 aromatase chromosomal genes: evidence for the existence of multiple, closely related genes that encode developmental and tissue-specific isoforms of aromatase. unpublished (1997) CYP19 Sus scrofa (pig) GenEMBL U57520(517bp) U57521(495bp) Choi,I., Collante,W., Simmen,R.C.M., Troyer,D. and Simmen,F.A. Molecular cloning and structural characterization of porcine cytochrome p450 aromatase chromosomal genes: evidence for the existence of multiple, closely related genes that encode developmental and tissue-specific isoforms of aromatase. unpublished (1997) CYP19 bovine GenEMBL S66248 (2104bp) Hinshelwood,M.M., Corbin,C.J., Tsang,P.C. and Simpson,E.R. Isolation and characterization of a cDNA insert encoding bovine aromatase cytochrome P450. Endocrinology 133, 1971-1977 (1993) Note two amino acid differences with Z32741 CYP19 bovine GenEMBL M64646 (1725bp) Zuber,M.X., John,M.E., Okamura,T., Simpson,E.R. and Waterman,M.R. Bovine adrenocortical cytochrome P-450-17-alpha: Regulation of gene expression by ACTH and elucidation of primary sequence. J. Biol. Chem. 261, 2475-2482 (1986) CYP19 bovine GenEMBL Z32741 (4226bp) PIR S44210 (503 amino acids) Vanselow,J. and Furbass,R. Aromatase cytochrome P450 gene and pseudogene unpublished (1994) CYP19 bovine GenEMBL Z69241 to Z69250 (genomic sequences) Furbass,R. and Vanselow,J. unpublished (1996) CYP19P bovine GenEMBL Z32813 (1006bp) Vanselow,J. and Furbass,R. Aromatase cytochrome P450 gene and pseudogene unpublished (1994) CYP19A1 Bos taurus (cow) See cattle page for details MLLEVLNPRHYNVTSMVSEVVPIASIAILLLTGFLLLVWNYEDTSSIPGPSYFLGIGPLI SHCRFLWMGIGSACNYYNKMYGEFMRVWVCGEETLIISKSSSMFHVMKHSHYISRFGSKL GLQFIGMHEKGIIFNNNPALWKAVRPFFTKALSGPGLVRMVTICADSITKHLDRLEEVCN DLGYVDVLTLMRRIMLDTSNMLFLGIPLDESAIVVKIQGYFDAWQALLLKPDIFFKISWLCRKYEKSV 927 KDLKDAMEILIEEKRHRISTAEKLEDSIDFATELIFAEKRGELTRENVNQC 1079 1080 ILEMLIAAPDTMSVSVFFMLFLIAKHPQVEEAIIREIQTVVGERDIRIDDMQKLKVVEN 1256 1257 FINESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNLGRMHRLEFFPKPNEFTLE 1430 1431 NFAKNVPYRYFQPFGFGPRACAGKYITMVMMKVVLVTLLRR 1553 FHVQTLQGRCVEKMQKKNDLSLHPDETRDRLEMIFTPRNSDKCLER* CYP19 Equus caballus (horse) GenEMBL U37313 (458bp) Choi,I., Simmen,R.C. and Simmen,F.A. Molecular cloning of cytochrome P450 aromatase complementary deoxyribonucleic acid from periimplantation porcine and equine blastocysts identifies multiple novel 5'-untranslated exons expressed in embryos, endometrium, and placenta. Endocrinology 137 (4), 1457-1467 (1996) CYP19 chicken (three different strains of chicken) GenEMBL M73277 to M73285, M73286 to M73294 M73295 to M73303 PIR A41063 (495 amino acids) Matsumine,H., Herbst,M., Ou,S.-H.I., Wilson,J.D. and McPhaul,M.J. Aromatase mRNA in the extragonadal tissues of chickens with the henny-feathering trait is derived from a distinctive promoter structure that contains a segment of a retroviral long terminal repeat. J. Biol. Chem. 266, 19900-19907 (1991) CYP19 Coturnix coturnix japonica (Japanese quail) GenEMBL S46949 (692bp) PIR A48977(230 amino acids) Harada,N., Yamada,K., Foidart,A. and Balthazart,J. Regulation of aromatase cytochrome P-450 (estrogen synthetase) transcripts in the quail brain by testosterone. Brain Res. Mol. Brain Res. 15, 19-26 (1992) Note: only three amino acid differences with chicken. CYP19 Coturnix coturnix (quail) GenEMBL D50336(4351bp) Kudo,T., Yamamoto,H., Sato,S. and Sutou,S. Comparison of 5' upstream regions of chicken and quail aromatase gene. Unpublished (1995) CYP19 Poephila guttata (zebra finch) GenEMBL S75898(3188bp) Shen,P., Campagnoni,C.W., Kampf,K., Schlinger,B.A., Arnold,A.P. and Campagnoni,A.T. Isolation and characterization of a zebra finch aromatase cDNA: in situ hybridization reveals high aromatase expression in brain Brain Res. Mol. Brain Res. 24 (1-4), 227-237 (1994) CYP19A1 Xenopus tropicalis See Xenopus pages for seq CYP19 Ictalurus punctatus (channel catfish) GenEMBL S75715(2102bp) Trant,J. Isolation and characterization of the cDNA encoding the channel catfish (Ictalurus punctatus) form of cytochrome P450arom Gen. Comp. Endocrinol. 95 (2), 155-168 (1994) CYP19 Onchorynchus mykiss (rainbow trout) Tanaka, M., Telecky, T.M., Fukada, S., Adachi,S., Chen, S. and Nagahama, Y. Cloning and sequence analysis of the cDNA encoding P-450 aromatase (P450arom) from a rainbow trout (Onchorynchus mykiss) ovary, relationship between the amount of P450arom mRNA and the production of oestradiol-17-beta in the ovary. J. Mol. Endocrin. 8, 53-61 (1992) CYP19 Carassius auratus (goldfish) GenEMBL U18974(2939bp) Gelinas,D.M., Pitoc,G.A. and Callard,G.V. Isolation of goldfish brain aromatase cDNA and analysis of expression during the reproductive cycle and after steroid treatment in vivo. unpublished (1996) CYP19 Oryzias latipes (medaka) GenEMBL D82968(1851bp) Tanaka,M., Fukada,S., Matsuyama,M. and Nagahama,Y. Structure and promoter analysis of the cytochrome P-450 aromatase gene of the teleost fish, medaka (Oryzias latipes) J. Biochem. 117 (4), 719-725 (1995) CYP19 Tilapia nilotica (Cichlid fish) GenEMBL U72071(1804bp) Chang,X.T., Kobayashi,T., Nakamura,M., Kajura,H. and Nagahama,Y. Isolation and characterization of cDNA encoding the tilapia (Oreochromis niloticus) cytochrome P450 aromatase (P450arom): Changes in P450arom mRNA, protein and enzyme activity in ovarian follicles during oogenesis. J. Mol. Endocrinol. (1996) In press CYP19 Haplochromis burtoni (cichlid fish) GenEMBL AF114716 White,R.B. Aromatase expression during social change in the cichlid fish, Haplochromis burtoni. Unpublished CYP19 Paralichthys olivaceus (Japanese Flounder) GenEMBL AB017182 Kitano,T., Takamune,K., Kobayashi,T., Nagahama,Y. and Abe,S. Suppression of P450 Aromatase (P450arom) Gene Expression in Sex-Reversed Males Produced by Rearing Genetically Female Larvae at High Water Temperature during a period of Sex Differentiation in Japanese Flounder (Paralichthys olivaceus) Unpublished (1998) CYP19A1 Fugu rubripes (pufferfish) No accession number Scaffold_7098 65% to AF183906 ovary form of CYP19 from Zebrafish 60% to AF183908 brain form of CYP19 from Zebrafish 61% to other Fugu CYP19 this is probably the ovary form 9466 MAAVGLDAEVLVSVSPNATEAESPGSSAGTRALIILTCLLLLVWSHTEKKSVP 9308 (1) 9242 SLLGPSFCLGFGPLLTYVRFIWTGIGTASNYYNKKYGDIVRVWVNGEETLVISR 9081 (2) 8985 ASAVHHVLKSRQYTSRFGSKQGLSCIGMNERGIIFNNNVTEWRKIRGYFTK 8830 (1) 8759 ALTGPAVQNTVEVCNSSTQAHLDRLEDLAQVDVLSLLRCTVVDISNRLFLDIPIN 8595 (1) 8499 EKELLLKIHKYFDTWQTVLIKPDIYFKFGWIHQKHKTAA 8392 (2) 8296 RELQEAIEGLVEQKRRDLEQADKLENINFTAELLFAQ 8186 (0) 8084 NHGELSAENVMQCVLEMVIAAPDTLSVSLFFMLLLLKQNPDVELQLLQEIDAVVGK (0 expected, bad boundary) RQLQNGDLQKLRVLETFINECLRFHPV 7719 7718 VDFTMRRSLSDDVIEGYRVPKGTNIILNTGHMHRTEFFLRPTEFCLQNFEKN 7563 (0) APRRYFQPFGSGPRACVGKHIAMVMMKSILVTLLSQYSVCPHEGLT 7327 7326 LDCLPQTNNLSQQPVEHQEEAQQLSMRFLPRQRGSWQTV* 7207 CYP19A1 Danio rerio (zebrafish) GenEMBL AF183906 Chiang,E.F.L., Yan,Y.L., Guiguen,Y., Postlethwait,J. and Chung,B.C. Two Cyp19 (P450 Aromatase) Genes on Duplicated Zebrafish Chromosomes Are Expressed in Ovary or Brain Mol. Biol. Evol. 18 (4), 542-550 (2001) Called CYP19a expressed in ovary CYP19A2 Fugu rubripes (pufferfish) No accession number Scaffold_4200 65% to both brain and ovary forms of CYP19 from Zebrafish 61% to other Fugu CYP19 this is probably the brain form 2918 MKPKETLNITASGPFTPLPLPLMMMMMMLLLMMMMLFLTWNRPQRQHVP (1) 3451 GPLFLAGLGPLLSYCRFMWTGIGTACNFYNNKYGSLVRVWINGEETLILSR (2) 3674 SSAVYHVLRSAHYTARFGSRAGLECIGMEGQGVIFNSDVQLWRRARVYFSK (1) 3960 ALTGPGLQRTVGVCVTSTAKHLDCLVDMTDASGHVDALNLLRAIVVDISNRLFLRVPLN 4136 (1) EKDLLTKIHNYFETWQAVLIKPDIFFKIGWLFDKHRRAA 4319 (2) QELQDTMAALLKVKRKLVHEAEKLDDVLDFATELILAQ (0) EAGEFSADNVRQCVLEMVIAAPDTLSISLFFMLMLLKQHPDVELRIVEELSTvsrt (0) egeENIDYQRLKVMESFINESMRFHPVVDFTMRKALEDDTIEGIRIRKGTNIILNIGLMHKTE 5039 5040 FFPKPREFSLTNFEQT (0) VPSRFFQPFGCGPRSCVG 5219 5220 KHIAMVMMKAILATLLSRYTVCPRHGCTLTSIRQTNNLSQQPVEDEHSLAMRFIPRTIQSPS* 5408 CYP19A2 Danio rerio (zebrafish) GenEMBL AF183908 Chiang,E.F.L., Yan,Y.L., Guiguen,Y., Postlethwait,J. and Chung,B.C. Two Cyp19 (P450 Aromatase) Genes on Duplicated Zebrafish Chromosomes Are Expressed in Ovary or Brain Mol. Biol. Evol. 18 (4), 542-550 (2001) Called CYP19b expressed in brain CYP19 Trachemys scripta (red-eared slider turtle) no accession number Crews,D. Temperature-dependent sex determination: the interplay of steroid hormones and temperature. Zool. Sci. 13, 1-13 (1996) CYP20 human GenEMBL AC011737.8 chromosome 2 clone RP11-33N4, AC011737.8 chr 2 (missing exons 12,13) AC080075.2 (missing exons 1,7,8) MLDFAIFAVTFLLALVGAVLYLYP (0) ASRQAAGIPGITPTEEK (2) DGNLPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTS (1) DPFETMLKSLLRYQSGGGSVSENHMRKKLYENGVTDSLKSNFALLLK (0) LSEELLDKWLSYPETQHVPLSQHMLGFAMKSVTQMVMGSTFEDDQEVIRFQKNHGT (0) VWSEIGKGFLDGSLDKNMTRKKQYED (1) ALMQLESVLRNIIKERKGRNFSQHIFIDSLVQGNLNDQQ (0) ILEDSMIFSLASCIITAK (1) LCTWAICFLTTSEEVQKKLYEEINQVFGNGPVTPEKIEQLR (2) YCQHVLCETVRTAKLTPVSAQLQDIEGKIDRFIIPRE (0) TLVLYALGVVLQDPNTWPSPHK (2) genomic seq stops here the rest is cDNA FDPDRFDDELVMKTFSSLGFSGTQECPELR (2) intron site based on fish genomic DNA FAYMVTTVLLSVLVKRLHLLSVEGQVIETKYELVTSSREEAWITVSKRY CYP20 mouse GenEMBL AK020848 adult retina cDNA plus ESTs for C-term MLDFAIFAVTFLLALVGAVLYLYPASRQASGIPGLTPTEEKDGN LPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTTDVLKQHFNPNKTSDPFET MLKSLLGYQSGGGSAGEDHVRRKLYGDAVTASLHSNFPLLLQLSEELLDKWLSYPETQ HIPLSQHMLGFALKFVTRMVLGSTFEDEQEVIRFQKIHG TVWSEIGKGFLDGSLDKNTTRKKQYQEALMQLESTLKKIIKERKGGNFRQHT FIDSLTQGKLNEQQILEDCVVFSLASCIITAR LCTWTIHFLTTTGEVQKK LCKEIDQVLGEGPITSEKIEQLSYCQQVLFETVRTAKLTPVSARLQDIEGKVGPFVIPKE 360 TLVLYALGVVLQDPSTWPLPHRFDPDRFADEPVMKVFSSLGFSGTWECPELXFAYMVTAV 540 LVSVLLEKLRLLAVDRQVVEMKYELVTSAREEAWITVSKRH* CYP20A1 Bos taurus (cow) See cattle page for details MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGN LPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTLDPFET MLKSLLRYQSDSGNVSENHMRKKLYENGVTNCLRSNFALLIKLSEELLDKWLSYPESQ HVPLCQHMLGFAMKSVTQMVMGSTFEDEQEVIRFQKNHGTVWSEIGKGFLDGSLDKST TRKKQYEDALMQLESILKKIIKERKGRNFSQHIFIDSLVQGNLNDQQILEDTMIFSLA SCMITAKLCTWAVCFLTTYEEIQKKLYEEIDQVLGKGPITSEKIEELRYCRQVLCETV RTAKLTPVSARLQDIEGKIDKFIIPRETLVLYALGVVLQDPGTWSSPYKFDPERFDDE SVMKTFSLLGFSGTRECPELRFAYMVTAVLLSVLLRRLHLLSVEGQVIETKYELVTSS KEEAWITVSKRY CYP20A1 Sus scrofa (miniature pig) no accession number Haitao Shang Submitted to nomenclature committee May 23, 2007 partial sequence 92% to human 20A1, full seq. is not known Ortholog of human CYP20A1 CYP20A1 Sus scrofa (pig) DT324969.1, BG894896.1, BI336725.1, BP449309.1 Trace file gnl|ti|861177001 lower case = cow seq. MLDFAIFAV TFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGNLPDIVNSGSLHEFLVNLHERYGPVV SFWFGRRLVVSLGTVDVLKQHINPNKTSDPFETMLKSLLRYQSDSGNMSENHMRKKLYEN GVTNSLQSNFALLLKLSEELLDKWLAYPESQHVPLCQHMLGFAMKSVTQMVMGSTFEDEQ EVIRFQKNHGTVWSEIGKGFLDGSLDK NHMRKKLYENGV TNCLRSNFALLIKLSEELLDKWLSYPESQ HVPLCQHMLGFAMKSVTQMVMGSTFEDEQEVIRFQKNHGTVWSEIGKGFLDGSLDKSTTRK KQYEDALMQLESILKKI ikerkgrnfsqhifidslvqgnlndqqiledtmifsla scmitaklctwavcflttyeeiqkklyeeidqvlgkgpitsekieelrycrqvlcetv RTAKLTPVSARLQDIEGKIDKFIIPRETLVLYALGVVLQDPGTWSSPYK FDPERFDDESVMKTFSLLGFSGPQEC PELGWRYMVTTVLWSVLLRRRHLLPVEGQVIETNYDLVTSS XEEAGITVSKRFRPGILLPWLRKLTI* CYP20 Gallus gallus (chicken) BU454844 603215388F1 CSEQRBN14 Gallus gallus cDNA clone ChEST201o21 5'. BU111630 603125978F1 CSEQCHL13 Gallus gallus cDNA clone ChEST95k19 5' BU356654 603474052F1 CSEQCHN70 Gallus gallus cDNA MLDFAIFAVTFLLILVGAVLYLYP ASRQASGIPGLAPTDDK DGNLPDIIASRSLHEFLVNLHEKYGPLVSFWFGRRLVVSLGSIDLLKQHVNPNRSS DPFEMMLKSFLRYQSSLNGDTGESHLRRKLYESGVSKSLQSNLALIQK LSEELLAKWLSLPEAQHIPLCQHMLGFAMKSVTQTAMGSSFEDDQEVIRFRRHHDA IWSEIGKGFLDGSLDKNATRKKLYED ALKEMESTLRKVITGTPRQIIQQAFIDTLLQGNLSDQQ ILEDTMIFSLAGCIITAN LCTWAVYFLTTSEDVQQNLCKEVDHVLGKGPITHEKIEQLR YCRQVLCETVRTAKLTPIAAQLQELEGRVDQHTVPKE TLVLYALGVMLQDSSSWPSPYK FDPERFSEDSAMTNFSLLGFSGSQECPELR FAYMVATVLLSILVRKLYLHPVKGQVMETKYELVTSPKEEAWITVSKRS* CYP20 Xenopus laevis (African clawed frog) BJ037591.1 NIBB Mochii normalized Xenopus neurula library Xenopus laevis cDNA clone XL040p15 5' BQ725586 BQ725586.1 AGENCOURT_8103453 NICHD XGC Emb2 Xenopus laevis cDNA clone 95% identical to tropicalis seq. 74% to chicken BC044111.1 mRNA, complete cds Length = 2417 175 MLDFAIFAITFLLILVGAVLYLYPSSRQACGIPGLAPTEEKDGNLQDIVNSGS 333 334 LHEFLVNLHERFGPVASFWFGRRLVVSLGSLDLLKQHINPNKTSDPFQTMLKSLLGYQSG 513 514 VIGEAAESHVQKKLYENGITKALHSNFSVIIKLSEELLAKWGTYPQSQHVPLCQHMLGFA 693 694 MKSVTQTAMGSSFEDDQEVIHFRRNHDAIWSEIGKGFLDGSIERSPSRKKLYEDALMEME 873 874 TVLKKTIKERKGKNPGRHVFLDSLLQGNLSDKQVLEDSMIFSLAGCVITANLCTWAIYFL 1053 1054 TTSEEVQDKLYKEVNRVIGKGPITMDKLEQLSYCRQILCETVRTASLTPISARLQELEGR 1233 1234 VDQHIIPKETLVLYALGVVLQDNTAWPLAYRFDPDRFDDETAKQSLSLLGLSGSQECPEL 1413 1414 RFAYMVAMVLLCVLVRKLNLLPVKGQVMETKYELVTSPKEEAWITVSKRS 1563 CYP20 Xenopus tropicalis From JGI blast server http://aluminum.jgi-psf.org/prod/bin/runBlast.pl?db=xenopus1&dump=1 And from Sanger http://www.sanger.ac.uk/Projects/X_tropicalis/blast_server.shtml AL848968.1 Xenopus tropicalis EST, clone TEgg007o12 5' AL870093 AL870093.1 Xenopus tropicalis EST, clone TEgg120l22 5' 60% to Fugu CYP20, 70% to human CYP20, 62% to zebrafish, 30% to ciona CYP20 like seq 72% to chicken MLDFAIFAITFLLILVGAVLYLYP exon 1 SSRQACGIPGLAPTEEK exon 2 DGNLQDIVNSGSLHEFLVNLHERFGPVASFWFGRRLVVSLGSLDLLKQHINPNKTS exon 3 DPFQMMLKSLLGYQSGVIGEAAESHVQKKLFENGIIKALHSNFSVVIK exon 4 LSEDLLAKWLTYPQSQHVPLCQHMLGFAMKSVTQTAMGSSFDDDQEVIHFRRNHDA exon 5 IWSEIGKGFLDGSIERSPNRKKLYED exon 6 ALMEMETVLKKAIKERKVKNPGRHVFVDSLLQGNLSDKQ exon 7 VLEDSMIFSLAGcvitan exon 8 VCTWAIYFLTTSEEVQDKLFKEVTRVIGKGPITMDKLEQLS exon 9 YCRQILCETVRTASLTPISARLQELEGRVDQHIIPKE exon 10 tlvlyalgvvlqdntawplayr exon 11 X. laevis FDPDRFNDETAKQSLTLLGFSGSQECPELR exon 12 FAYMVAMVLLSVLVRKLHLLPVKGQVMETKYELVTSPKEEAWITVSKRS exon 13 CYP20 Fugu rubripes (pufferfish) No accession number Scaffold_486 59% TO CYP20 human MLDFAIFAVTFVIVLVGAVLYLYP (0) SSRRASGIPGLNPTDEK (2) 11654 DGNLQDIVGRGSLHEFLVSLHQEFGPVASFWFGSRPVVSLGSLQQLRQHINPNHST 11487 (1) DSFETMLKSLLGYHSGGGGASTDSIIRKKVYQGAIDTTLKNNFPLVLK (0) LVDELVGKWKSFPEDQHTPLCAHQLVLAMKTITQLALGESFSEDARVIAFRKNHDV (0) IWSEIGKGYMDGSLEKSTSRKGHYEK (1) ALSEMESTLLSVVKERKSQRNKSVFVDSLIQSTLTERQ IMEDCMVFMLAGCAITAN VCIWALHFLSTSEEVQDRLYKEFEEVLGSSPVSLEKIPQLR YCQQVLNETLRTAKLTPIAARLQEVEGKVDQHLIPKE SLVIYALGVILQDSDTWNAPYR FDPDRFEEESVKKSFHLLGFSGSQTCPELR FAYTVATVLLSVLVRQLKLHRLKDTLMEVRSELVSTPRDETWITFNLRN* CYP20 Tetraodon nigroviridis (freshwater pufferfish) >FS_CONTIG_2529_2 Length = 13425 from http://fugu.hgmp.mrc.ac.uk/blast/ >FS_CONTIG_2529_1 Length = 1009 from http://fugu.hgmp.mrc.ac.uk/blast/ MLDFAIFAVTFVVILVGAVLYLYP SSRRASGVPGLNPTDEK XXXXXXXXXXXDIVARGSVQEFLVSLHQEFGPVASFWFGSRPVVSLGSLQQLQQHANPNRSS DSFETMLKSLLGYHSGGGGASTENIIRKKVYQGAIDATLKNNFPLVLK LVDELVGKWTSSPEDQHTPLCAHQLVLAMKSITQLALGESFSQDARVVSFRRNYDA IWSEVGKGFMDGSLERSTSRKGRYEX ALSEMEATLLSVVKDRKSQRKTSVLVDTLLQSTLTDRQ IMEDCMVFTLAGCAITAN VCIWALHFLSSYEDVQDRLHQELEEVLGSGSVSLEKIPQLR YCQQVLNETVRTAKLTPVAAGLQEVEGKVDQHLIPKE TLVIYALGVILQDSHTWDAPCR FHPDRFEEESVRKSFRLLGFSGSQTCPELR VAYTVATVLLSAVVRQLRLHRLEDTLVEVRSELVSTPREETWITFSRRN CYP20 Danio rerio (zebrafish) Assembled CYP20 from zebrafish ESTs BQ259821 faa04d08.y1 zebrafish fin day3 regeneration BM185037 fv16g05.x2 zebrafish adult brain BG985721 2543 NICHD Zebrafish normalized I BM070720 fu98e06.y1 zebrafish adult brain cDNA CA472036 AGENCOURT_10739799 NCI_CGAP_ZKid1 cDNA (kidney?) BI981894 fu52c12.y1 zebrafish adult brain Danio rerio cDNA BM083017 fu28a11.y1 Campbell zebrafish ovary cDNA MLDFAIFAVTFVIILIGAVLYLYPSSRRASGVPGLNPTEEKDGNLQDIVNKGSLHEFLVG LHDEFGSVASFWFGARPVVSLGAVNQLRQHINPNWTTDSFETMLKSLLGYQSGSGVGLTE SMMRKKVYEGAINKTLENNFPLLLQQVEELVDKWASYPKSQHTPLCAHLLGLAMKAVTQL AMGSRFRDDAEVIRFRKNHEAIWSQ IGKGYLDGSLEKSSSRKAHYESALAEMESVLKSVAKQRPGQGSSQSFVNYLLQANLTER QVMEDGMVFTLAGCVITANLCIWAVHFLSVSEAVQDRLYHELVEVLGDEPVSLEKIPQLR YCQQVLNETVRTAKLTPVAARLQEVEGKVDQHIIPKETLVIYALGVVLQDADTWSLPYRF NPDRFAEESVMKSFSLLGFSGSQACPELRFAYTVATVLL STLVRRLRMHRVDGQVVEARYELVTTPKDDTWITVSKRN* CYP20 Danio rerio (zebrafish) >ctg10765 genomic contig CYP20 74% to fugu 9501 MLDFAIFAVTFVIILIGAVLYLYP (0) 9572 SSRRASGVPGLNPTEEK (2?) DGNLQDIVNKGSLPEFLVGLH DEFGSVASFWFGARPVVSLGAVNQLRQHINPNWT (1) 10024 12291 TDSFETMLKSLLGYQSGSGVGLTESMMRKKVYEGAINKTLENNFPLLLQ (0) 12439 12929 QVEELVDKWASYPKSQHTPLCAHFL 13003 frameshift 13003 GLAMKAVTQLAMGSRFRDDAEVIRFRKNHEA (0) 13095 15738 IWSEIGKGYLDGSLEKSSSRKAHYES (1?) 15815 AC boundary istead of AG 15897 ALAEMESVLKSVAKQRPGQGSSQSFVNYLLQANLTERQ (0) 16010 16583 VMEDGMVFTLAGCVITAN (1) 16636 17689 LCIWAVHFLSVSEAVQDRLYHELVEVLGDELVSLEKIPQLR (2) 17811 19293 YCQQVLNETVRTAKLTPVAARLQEVEGKVDQHVIPKE (0) 19403 21269 TLVIYALGVVLQDADTWSLPYR (2) 21334 21425 FNPDRFAEESVMKSFSLLGFSGSQACPELR (2) 21514 FAYTVATVLLSTLVRRLRMHRVDGQVVEARYELVTTPKDDTWITVSKRN* from wz10135.3 CYP20 Oryzias latipes (medaka fish) BJ524824.1 MF01SSB cDNA Oryzias latipes cDNA BJ003683.1 BJ003683 MF01SSA cDNA Oryzias latipes cDNA MLDFAIFAVTFVVILVGAVLYLYPSSRRASGVPGLFPTDEKDGNLQDIVDRGSLHEFLV GLHEQFGPVASFWFGRQPVVSLGSVDPLRQHINPNHTTDSFETMLKSLLGYQAGAGGGAN ESVMRKKLYESAINNALKNSFPAVLKVAEELVDKWSSVPEDQHIPLCAHLLGLALKTV TQLALGERFKDDAEVISFRKNHEAIWSEIGKGYMDGSLEKSSSRKRHYE SA LSDMEATLLAV AKDRKAQRRQTA FVDALLQSGLTERQIMEDCMVFTLAGC VITANLCIWALHFLSTAEDVQEKLCQEVEDLFGSDPVSLDRIPQLKYCQQVLNE TVRTAKLTPVAARLXEVEGKVGQHVIPKETLVIYALG VVLQDADTWSTPYRFDPDRFQDESARKSFCLLGFSGSQTCPELRFAYTVATVLLATLVRRLKLRPLK About 26 aa missing at end CYP20 Salmo salar (Atlantic salmon) GenEMBL CA063128.1 ssalrgb509318 mixed_tissue Salmo salar cDNA. GenEMBL CB516811.1 ssalrgb509318_rev mixed_tissue Salmo salar cDNA. GenEMBL CB513409.1 ssalrgb531212_rev mixed_tissue Salmo salar cDNA. BG935303.1 SL1-0624 Atlantic Salmon liver Salmo salar cDNA clone SL1-0624 MLDFAIFAVTFVIFLVGAVLYLYPSSRSASGIPGLNPTEEKDGNLQDIVNRGSLHEF LASLHGQFGPVASFWFGGRPVVSLGSVDQLRQHINPNRTTDSFETMLKSLLGYQSGTGGG ATEAVMRKKLYESAVNNTLEKNFPMLLKLVEELVGKWQSFPKDQHTPLCAHLLGLAMKAVTQ 18 amino acids missing here RKNHEAIWSEIGKGYLDGSMEKSSIRKEHYESA LAEMETVLMSVAKDRKGQRSQTAFVDTLLQSNLTERQVME DSMVFTLAGCVITANLCIWAVHFLST SEDVQEKLHQELEDVLGSEPVSLDK IPQLRYFQQVLNETVRTAKLTPIAARLQENEGKVD QHIIPKETLVIYALGVVLQDADTWSCPYKFDPDRFTEDSARKSFSLLGFSGNQACPELRF AYTVATVVLSTVVRQLKLYQVKGQVVEARSELVSTPKDDTWITVSRRS* CYP20 Ciona intestinalis SEQUENCE 186, 187 CYP20 ortholog 29% to fugu CYP20 CYS = HIS MLEVHVTITAVAVLAVILLLFLIGLLFYIYPSSN (?) KETTVPGPAVSDIR (2) DGNRSDIDRAGSLHQYLCNLHAEHGDLASFHFDKQLIISIADSKYFAPRSQVFQLP (1) DFLYEFWEPMIGKESFVFSQGVDVFKRRDNIDFKVDKKVKEFYAKIF (0) RLLAIELEEKWRKQPEDQHIPLTQHLKGLATKAVCQMMFGESFADDERVIKLYKAWDK (0) CWNEVMAASKMEPTKSESDEKK (1) SIKELRSLMEKAIEERWKKSKHLDGITLLDGLLASSNSSVSPLSSEQLLCDVITMFTMGVQVTNA (1) VLCWSVYFIAAHDDVQTKLRNKLNEVESLDIMPQSLNDVK (2) YMQSILKESIRCSGVFSYSGRVEDIDITVGDHLLPKG (0) TPIIEALMFPMQSDAFESPDM (2) FNPKNFSEATKSVNPLTYIPFAFGGSRHKPQDS (2) LVYLLVSITVASLFRNLRISLAGKEVDPQSANYESFVALPSTEEVWVTVGGKLDVETVGENN* CYP20 Ciona savignyi sequence for CYP20 ortholog 72% to C. intestinalis seq 186 MLEVHVSITAVAILVLIILLFLIGLLVYIYPVI (2) KETTVPGPDVSDAK (2) DGNKSDIERAGSLHAYLCNLHADFKHEGAVSFHHGKQLVISISNPKFFAPRSQVFQLP (1) SFLYEYWRPLIGEESFLFAKDIDVFRRRDYLDFKVNRNVKEFYAKNF (0) LAVELETKWEKQPEDQHIPLTQHLKGLATKAVCQMMFGDNYADGTKVLELYRAWDK (0) CWNEVMAASKMEPTKSESVEKK (1) SISELRKLMENAIEERWNKSKSLDGITLLDGLLSSQTSDSKDQLTSEQLLCDVITMFTMGVQITTA (1) VLCWSCYFVASNKEVQRKLKNQLNESKSLDIMPETLMDME (2) YMQGILKESVRCSGMFSFAGRVEDIDITVQNHLIPKG (0) TPILEAIMTEMRSEAFSCPDV (2) FNPKNFTKDNNHLNPLTFIPFAFGGSRHKPEES (2) LVYLLAAVTVSSLFRNLQVSLAGKEIEPFAANFESFVAMPSTEEIWVTVSKEKEVENVG* CYP20 Halocynthia roretzi (tunicate, Urochordate) AV385510 51% to CYP20 in Ciona KYMKMVLKESLRSGCVFSWAGRQQDIDINVGGNLITKYTPVIEAIGTAMTDEGNFSDPSE FDPKRFSEDNDEVINPLDYIPFGFGGSRKAPQNSLVFLLASIVASTLIRNLKVCIAGDSS QNQPDPDFSAFVSRPSKEELWVTVSALGKKKAETVG* CYP20 Molgula tectiformis (tailless ascidian, tunicate, Urochordate) CJ366548.1 EST N-term MDVQITLATLGVFIVLFVLFLLGFLFYLYPGSNNQSTIPGPNPSDAKDGNVSDLQ RVGSLQKYLEELHSEYGDIIAFWHQSNQIVSIADLDEFAQTNQVFERPDILFSSWIPLIG KDSFIFEKDVKNFQRLDML CYP20 Branchiostoma floridae (amphioxus) There are 8 CYP20s in amphioxus (see the amphioxus page) CYP20 Saccoglossus kowalevskii (Acorn worm, hemichordate) FF488963, FF488605, FF525072, FF503042, FF480950, FF429742 38% to Paracentrotus lividus, 30% to CYP20 human 33% to sponge CYP38A1 MTALVIIVVVLLVVVVAVFYLGTLFGSKQEQMLARIPGMNPTNSED GNMPDITVAGSLHEFLVKLHTKYGHVATFWYGKTHVISLASAKAFKDTNKLFDRPAVLFNFVKPLIGPD SIQYANEEDGKRRRHDYDQSFSYDAIKNYYSKFYEATDAVVAKIESLPSDEHINITEHMS LLVLKALSHATYGDFFKDDEFSITMLRHYETAMAILNQDISKDTEAGQ RFAQALKTWHDFIRAMIQHRRDNPPNDDDWTFIDVLMAKSSSEEELISDATSYF IAGYHTTAFMMVWTFYYMCENQEVQEKIYQEIIEVIGKDEQVNYVNLESLKYMRCVFD ESMRCSVLAPFAARVNMHADMVVQGYTIPKGTPVVHALGVVLMDDEIWSEPERFIPERFL PXNVSTRHPLSFQPFGFAGKRKCPGYRIAYAEGAVFLATLCRK FKFQLVEGQTIERKYGFVTIPSNDIWVTVTKRD* CYP20A2 Saccoglossus kowalevskii (Acorn worm, hemichordate) FF425228.1 45% to CYP20A1 Saccoglossus N-term region FFQFVKPLFGENSIQFANGNYGKQRRKLHDASFTHQAVKSYYPVFQEKTEIFIRKIMAPP PEEHIPLSEYVSVFVMKSLMSALFGQNVENDEQTVTMLRHYETVMLVLNKKMSANTEKKN TFHIALNAWHEFIRKIILKRRNEPSDE CYP20 Strongylocentrotus purpuratus (purple sea urchin) GenEMBL XM_787803.1, XM_793215.1 AAGJ02125342.1 (middle piece, two exons that were missing) 85% to Paracentrotus lividus 37% to CYP20 Saccoglossus kowalevskii (acorn worm) 35% to sponge CYP38A1 MLDFAIFAVTFIILLVGLLIYIYP (0) TTPQKTTTVPGLEPSDPV (2) KGNLDEIGDAGSLHQFLTKLHAEHGDIASFYFTDQLCVSITSPELFKEHQAVFNRP (1) ALLFKLFEPLITPDSIQYANGGDGRKRRDLTDRCFGFQALQNFIGVFNK (0) ITEQLVKKISALPPGEHISLHGTGLGLAIKGIMLTSFGDYFQDDKSITSFRKDYDF (0) VWGEMEARLDGNMPDEGSNRVKTFEE (1) 1967 ARERMYATIQAVIKFRRSNPPSQDKEVFIDVLLKADYPPGRLQADLLTYVIGGFHTSGN (1) 1791 359 LFAWALFFLAEDDKIQEKLYNHVIDIVGKTGEVSMEDIAEMT (2) 234 YMRQVIDETLRVSILAPYAARYQDFDVVLGGHVIPKG (0) TPVIHALGVSLQSEKYFPNPKT (2) FNPENFSAANVKKRPRDAYHPFGFAGRRVCPGQQ (2) FAYKEVAIFLAVFIRAFKVKVVPGQKPEHVHGLVSHLVNKDGGEVWITVEKR* CYP20 Paracentrotus lividus (common urchin, Deuterostomes) AM536943, AM531351, AM584793, AM534834, AM554260 85% to CYP20 Strongylocentrotus purpuratus 32% to human CYP20 (best match in human) first 11 aa identical 37% to sponge CYP38A1 CYP20 has a highly conserved N-terminal MLDFAIFAVTFVLLLVGLLIYIYPTTPQKSTTVPGLEASDPVKGNLDEIGDAGSMHQFLTKLHED HGDIASFYFTDQLCVSIASPELFKEHQAVFDRPPLLFKLFEPLITPDSIQYANGSDGRKR RDLTDRCFGFQALQNFIGIFNKITEQLVEKISALPPGEHVPLYSTGLGLAIKGIMKTSFG DYFKDDKSINSFRRDYDYVWAEMEARLDGSMPEPES EREKKFDEARERLYDTIRGVIKVRRSNPPSQDKEVFIDVLLKADYPPGKLLADL VSYSVGGFHTSGNLFAWALYFLAQDEEIQEKLYKHVIDIVGKTDEVGMEHIAKMTYMR QVIDETLRVSILAPYAARFQDFDVVLGGHVIPKGTPVVHALGVSLQSEKYFPNVKVFNPE NFSPANVKKRPRDAFQPFGFAGRRVCPGQQYAYKEVSI FLAVFLRAFKIKLVPGQKTEHVHGLVSHLVSKDGGDVWFTVEKR* CYP20 Rhipicephalus appendiculatus (brown ear tick, arthropod, arachnid, ecdysozoa) CD784670.1 35% to CYP20 Danio, 39% to CYP20 human N-term MLDFAIFAVCFVVFLLALVLYLYPSSAKQTTIPGLEPSDKKEG NVGDIVQAGGLQNFLISLHKEHGPIASFWIGTKLVVSIGKADLFKTQSHVFDKPAELFVL YRDVMGAGSIFFANGAEARKRRRLIDEVLTGKSLDKFLGPIEKLCSEVVMHLKDTPDDEH VPVYQYMYALCMKISTRLLFGEYFFDDMEVLKFSRNFELCIKELEE CYP20 Ixodes scapularis (blacklegged tick, arthropod, arachnid, ecdysozoa) EW872941.1 EW945756.1 N and C-term with a gap in the middle MLEFAIFAVCFVVFLLMLVVYLYPGSSKQT TIPGIDPSDPKEGNVGDIVQAGGLKNFLLSLHKDHGPITSFWMGTKLVISIGDAELFKTQ THIFDKPADLFELYRDLVGADSILFANGAEARQRRRRLDDVLTGDRINDFVGPLQKMCAE VVDRWESLPAEEHVPIYQYMYALCMKTCTRLLFGDYFFDDKHVLSSAGPLSCVSVLQRCC ACPRPDSLDQSMR MTWALYFLATH LKLQDFMQLSNLQKTIKETLRTATVEPWTARCQDVDVEIAGHVIPKKTPVIQALGVVLHE EDHWKVPHRFDSSRFNEKFPELAYCPFGFAGMRLCPGKDLSVLHMSVFLVTLCHRL KLRLIQDQVVTPTFALITKTDDEVWLTLERRRP* CYP20 Amblyomma cajennense (tick) EC780049.1 EST N-term MLDFAIFAVCFVVFLVALVVYLYPSSTKQTTIPGLGPSDAKEGNVGDIVQA GGLQNFLVSFYKEHGPIASFWIGTKLVVSIGQGELFKTQSHVFDKPADLFVLYREVMGTG SIFFANGAEARKRRKLIDDVLTGQRLDKFTLPIEKLCSEVVA CYP20 Helobdella robusta (leech, annelid worm) JGI gene model fgenesh4_pg.C_scaffold_12000405 [Helro1:168041] Helro1/scaffold_12:3739808-3743927 28% to CYP20 Saccoglossus kowalevskii, 29% to Strongylocentrotus purpuratus CYP20, 30% to Paracentrotus lividus CYP20 33% to Amphioxus CYP20 MLDYLIFAATVVVFLILAVIYLYPGSKKSIDIPGPEPVDPLEGNYPDIKACGSLHEFLVDLHETYGPICS FWAGPQLCVSVASPELFSEQANSFNRPVELFKAQTLLWGENSPFVLNGAEGRKKHAFLTGIFNEKYASCS LKLEHEILKKQTASWLKWPNEQFFPLYEKISGFVVKFLLSSHFGYDAENSKDVNVIKGNLDNVWQEVGHY MDDTSDADGLRDIRFEQYKKKLKSFISDALILRKTLPTVSDPAKLPIDHVLQLSEEEAVSILLLLFTNAY MKYCSLFTWMGYFLALEPHVQDQVRKELGSIPIEEINLANCNEFKYFKKVFQETLRCANIVSHTARIQNT DIVIQNSKIPAN (0) TPVIQSLGVVSSNPTIFHEPDK (2) FNPDRHASKNTPCSSFAS 3747683 QYFGFAGKRICPASSIVMSQAALFFAHVLECFSLTLAPDQVVSSHYGLFA 3747832 3747833 KPKEEIWLSV 3747862 ENINVRRVS* CYP20 Haementeria depressa (leech, annelid worm) CN807321.1 EST 67% to CYP20 Helobdella robusta NAYLKYSSLLTWMGYFLALEPDVQHKLRKELGLVSVSDLSSVNINKFKYFQNTLKETLRC AKIVPYTARIHNTDIVIKKDKIPANTPIIHALGVVSSDPAIFPEPDEFNPERFDSKNKPN TFACQSFGFAGKRKCPASSTVASHAALFYSHLLDCFELKLAPDQIVSSHHGLNAKPKEEI WLSLEM CYP20 Hirudo medicinalis (medical leech, annelid worm) EY484183.1 EST VEADKLNMQDMXRLDYLQLVTKEVLRCAGIVPFTARCQNTDSQLESHRIPAG (0) TPVLHAIGVVSNDSQIFDNPEE () FNPERFKNVNPSQFAFHPFGFAGQRTCPGSQVAIAQVATFYAHVLRSFELELAPNQ LAAPNHGFYVRPKEEVWLTLKARL* CYP20 Eisenia fetida (earthworm, annelid worm) EH670454.1, EH670439.1 ESTs ANKMKYLNQVLKETLRVARIATFAARVHHSDVVLGGYQVPAEMPILHAIGLSLHDEEVFP HPLKFDPDRFSVENEKHLPPFSFEPFGFSGKRKCPGYNFSYAEATVYLSLLIRSFKIELA FDQVAESKFGFVTKIKDDLWVIARPM* CYP20A1 Capitella sp. I (polychaete worm, annelid) JGI gene model e_gw1.28.81.1 [Capca1:145552] + e_gw1.28.94.1 [Capca1:145544] Capca1/scaffold_28:376236-381170 And Capca1/scaffold_28:374918-376952 upstream region with N-term 45% to Amphioxus CYP20, 41% to Strongylocentrotus purpuratus CYP20 34% to Helobdella robusta CYP20 MIQFLLFAVFFLVVLVVTVIYLYSGSGSRKSTIPGPEPLDDLEGNNAD IKASGSLHEFLMDIHADYGPMASFLFGPQLVVSICCPDLYKQYLKTFDRPASLFEFLEPLITANSIQFAN GAEGRMRHSAYMRSFGDSMVKNYYPMLQ KFASDLTRSLQLEAKETHIPLRKYMFAFAIKAMLRGAYGKYF DSDEEIHKLRAAYDVCWTDLEHRLLGDLPVEGSKRDERFEKNKKIMYDIMQKVLDDRRSNPPENDERCFV DALIDLDVSEDVKKSDMLTVLIAGFHTTGLLLTWCLYYLAKHEEIQDKLLGEIEAVLGKDGVVKADN VHQ LPYLRQVLNESLRCSVLAPFAARVQSTDVEIGGHTIPANTPVLMALGVTLCNDEFYPDPEIFDPERFTEE NMKNRPSLAFEPFGFGKRKCLGYKFSMAEFSVLIPTLIRHLKVELVPGQVVTPMFGLVTTPTDEIWVTVS ERK* CYP20A1P Capitella sp. I (polychaete worm, annelid, Lophotrochozoa) JGI gene model fgenesh1_pg.C_scaffold_1378000002 [Capca1:185650] Capca1/scaffold_1378:17781-19064 Pseudogene 51% to the N-term and 72% to the C-term of the whole gene in Capitella & = frameshift MIPIIYLLVVILVVIVLAALFVQAGNSQIRKSTIPGPKPRNKHEGNLADIRAIGSLHQFLLNLHGNFGPI ASFHYGAQMVVSISSPDLYKKYLKTFDRPPVLFEFPMISKHSLLLTNGQDAHIRREAYLHTFSDVAIKQQ YPMLQG (deletion of 179 aa) LHQIP& FLRQVMNESVRCSVLGAVAGRVQAVDVEVGGHQIPAH (0) TPVVMAIGVTHVNEEFYPAPERFDPGRFTEENVKSRPSLSFNPFGFGKRKCLGYKFSNAEFYVF ITTLLRCLKVELVPGQVVTPLYRVVTTPADEVWVTLSPRY* CYP20 Lottia gigantea jgi|Lotgi1|82740|gw1.84.105.1 37% to CYP20 Danio EST FC700835.1 MLDFAIFAVTFFMMLVGAVIYLMPSSRKATSVPGLDPTTKEDGNLSDIGRAGSLHEF LLDLHGRFGDIASFWMGRQLVISIASPELFKQHQNVFDRPVELFKVFEPVIGERSIQYCN GNDGRARRQTYDKSFSHHELFSYSPKFQEVADNLVKTLENKVKEEHLSLNQYMIAYAFKT VMNSLYGDAMDDEKDIIAMRGHYDIMWSEMEKRIADPTVPEDGSPRAISLNLALKEMRKI VQNIIDFRKKNGKKSEDYLLIDRIVEHMKTEDGIFSEILTFL IGGFHTAGNLLTWAVYFL ATHEEVEKKLLKEIKNVLGDDDISADNYDQLKYTKQIIDETMRCAVVAPWAARYQDFDSE LGGHKIPKKTPVIHALGVSLQDERQWPQPKIFDPERFNEENIKKRPHLAFSPFGFAGRRV CPGRKFAYLESAICLVSLFRRFKFHLVEGQVVQPVFGLVTHSEDDIWVTVSKR* CYP20 Aplysia californica (Califonia sea hare,mollusc, Lophotrochozoa) EB277115.1, EB214176.1, EB282241.1 MLDFVIFAVTFLTVLLVAIIWLYPSSKKVTTIPGVDPSSK EDGNFSDITRAGSLHEFLLDLHSQFGDIAGFWWGQTYVVSIASPELFKAYANVFDRPTELFRF IEPLISMQSLQFTNGSDGRMRRKAYDRPFSHENLKKHFAEFQTIAGIVEQKWSKKSESDH IPLGEDMALFSVRATLLTLMGDSFQ (gap) VGALHTSAYMLTWAMYFLATH xxxxxxxxxVQVLGQKDVVTDSNYGQLRYLYQVLQETLRCAVVAPWGARVQDFDTELGGHKVPTK TPVIHALGVVLQNEKLCPLPKRFDPDRFSTENSKDQPTYAFSPFGFAG KRTCPGYKFANRGSTVLIATLIKKFEVSMWSDQVVKPVFGLVTRPHEEILLKVSRRK* CYP20 Crassostrea virginica (Oyster, mollusc, Lophotrochozoa) EH649216.1 N-term MLDFVIFAVTFVVALLIAVIYLYPGSKRITTIPGPDLTTKED GNLADLARAGSLHEYLLDLHAQYGDITAFWMGQELVVSICSPDLFKQHSTVFDRPVSLFK MFEPLIGEKSIQYANKADGRNRRKLYDPVLSSDYLHHYYTAMQEVSDSLVKKWSSLPKEE HVPVAQYMIAQALKMA CYP20 Mytilus californianus (mussel, mollusc, Lophotrochozoa) ES406577.1 EST EGGARFQDF DSELGGHKIPKNTPVIHALGVTQQDEAIWPVPNRFDPDRFSEENSKKRQSLAFQPFGFAG KRICPGYRFAYAEATVCFVTLLRKFKFNMVEGQVITPVHGLVAHPSEEVWITIDKR* CYP20 Nematostella vectensis (starlet sea anemone, Cnidarian) XM_001634458 matches genomic seq NZ_ABAV01008690.1 and ABAV01008690.1 40% to human 31% to CYP20 HUMAN missing N-term 14466 (2) aga NGNLLDVKKAGSLHEYLLLLHKTYGPVASFMWGTINVVSVADPTLWKE LRSLFDRP (1) 14639 15646 PELFKLFEPLIGKKSIQYANGVHGRKLRQLFDRGFSHQAVKDYYQ (2) SFIE(0) 15792 15931 (2) DCCSDWFFLCPQVAEKVANQWVIKSSDEHIPIVESMLMLAIRAIGKAGM GKRFSAEKDVQAFMNAYRT (0) 16134 17805 TWEEMEACLNGSLSTKGSKRDQRFQN (1) 17882 18216 DMEVMRKMIRETISARRNQRTSSPEPLFIDSVLDCDFIDEKEVRK 18350 18790 EDDMISFFVGGFHTTGN (1) 18840 19760 MMSWCFYYLAIHPEIQEKVHQELIDVLGNDAIDPQAASELQ (2) 19882 20039 FCRQVLDETLRCAVVAPWGARVQYDHDLQIGEYVVPKG (0) 20152 20626 TPIITPFGVVLQDPELYPEPQM (2) 20691 21349 FDPDRFSPEAMKELPSHAYEP 21411 21412 FGFAGKRKCPGWRFSIAEGLVFLSVFVRRFKIRLVPGQKVEPVYGLVTTPKEELWVAVEKRK* 21600 CYP20 Hydra magnipapillata (hydra, Cnidarian) BP508840 BP508840 ESTs ABRM01022129.1 ABRM01019337.1 genomic 36% to CYP20 C-term 7448 aga LGNLGSLTFDGGIHKFLVENHKRLGPMFSFYWGKELAVSLACPILFKEVATLFNRP (1) 7615 9076 VNQFKLFLPLIGKECVQYANEDRGKKLRQINDRHFSHSSIKSYFKEFKQ (0) 9222 YFLLKVGAELCSKWSSSENTRIPLNQNMLSIAFKTIVKTSFG agt YTKQVIDEVMRIAVLAPYAARYSDYDIIVDGHLIPKK (0) TPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGF AGKRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWIKVLRRKNI* CYP20/CYP38A1 Suberite domuncula (sponge) GenEMBL Y17816 (1789bp) Mueller,W.E.G., Wiens,M., Batel,R., Steffen,R., Borojevic,R. and Custodio,M.R. Establishment of a primary cell culture from a sponge: Primmorphs from Suberites domuncula Mar. Ecol. Prog. Ser. In press Name change to CYP20 is recommended MLDFVIFAITAVAGLIGILLFFYFSRSTETKPVSSASPTSTIPR WSAPPADIEKGDLDVMMKKHGSLHQFLLHLHDNGKTPVTSFWWGKTHVVSFCSPQAFK ESAVFVNRPVELFVGFEPLITPFSIQYANDEDWVQRSKCLYHTLKGDDLKSYFHHFVQ IAQEEESLWSSYTSDKEVSLTKEVFPMTIKGIARTCFGDIFKDENELSKMAESYHVCW RTMEEGVPEAGSKRETEFLKHRRVLEDIIRRIIQERKEGEDLQELPFIDSMLQNYDSE DKIIADAISFMVGGFHTSGYMFTWMLWYLSSHPESQDRLRTEIERETGGERGDRLKEY SLRADTFLRQVQDETIRLSTLAPWAARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKT VWENTDSWDPDRFSPNGRRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIV PVEGQTVIQVHGLVTEPKDDIKIYIRSRKED CYP20 Schmidtea mediterranea (freshwater planarian) NZ_AAWT01057969 34% to Paracentrotus lividus 33% to Strongylocentrotus purpuratus 32% to Saccoglossus kowalevskii 25% to human CYP20 from SmedGD database Missing first exon 23008(2)aga KPKQIVIIPGLQKSDPK(2) 22918FGNLLEIKNSGSLHQFLSDLHLKFGAVATFYWNDQKIVSVCSNDLFKSVQDLFDRPSLI FGNFHEFIGENCLQYANGNEGQNRRKQYDKAFQHDMIIR(2)22625 18726 YISTIENVGKSNLEEWHGSKVTVDSIEHWAYIMAVKSILKACIIPEEQSPEYCDTIITLYKS VFIGLEKKFVECKVLSDEEEIQFQKDKNQFYSIIREIINKRLDFLENSDLENEKLLLIDY IILYHRELNRNLIENEVLIYLIGGFHTT(1) 18274 13764(1)GNALSWCIYYLAKHPKCQEKLFTEISKRQIQLQNTSECYKLILD LKYLKACIEETLRLSQLAPYAARVSDEDRYLSNYEVPANTPIILALGVSLKDEKI FNNPDQFNPERFVDSNFPSFAFVPFGFAGKRKCPGER(2)13357 13304 FSYLELSIWLIQLVGKYRFTLTNPHQTVNKVYGLVTRMSENVSVQFGERSDVL*13143 21A Subfamily CYP21A1P human GenEMBL M13935 (3206bp) White,P.C., New,M.I. and Dupont,B. Structure of human steroid 21-hydroxylase genes. Proc. Natl. Acad. Sci. U.S.A. 83, 5111-5115 (1986) 97% to 21A2 NT_033167.1|Hs6_33343 329415 MLLLGLLLLLPLLAGARLLWNWWKLRSLHLLPLAPGFLHLLQPDLPIYLLGLTQKFGPIYRLHLGLQ 329215 329117 DVVVLNSKRTIEEAMVKKWADFAGRPEPLTCK 329022 LVSKNYPDLSL 328702 XXWSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCE 328592 RMRAQPGTPVAIEEEFSLLTCSINCYLTFGDKIK 328383 328294 EDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLR 328193 328091 FFPNPGLRRLKQAIEKRDHNEEKQLRQHK 327835 ESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLEGHLHMAAVDLLIGGTETTANTLSWAVV 327654 FLLHHPE 327634 IQQRL*EELDHELGPGASSSRVPYKDRARLPLLNATIAEVLRLWPVV 327292 PLALPHRTTRPS 327257 SISGYDIPEGTVIIPNLQGAHLDETVWERPHEFWP 327069 326971 DRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDALPSLQPLPH 326791 CSVILKMQPFQVRLQPRGMGAHSPGQNQ 326708 CYP21A1P human with congenital adrenal hyperplasia GenEMBL M26857 (4034bp) X05445 Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C., Porter,R.R. and Campbell,R.D. Molecular characterization of the HLA-linked steroid 21-hydroxylase B gene from an individual with congenital adrenal hyperplasia. EMBO J. 6, 1653-1661 (1987) CYP21A1P human GenEMBL S60612 (426bp) Collier,S., Tassabehji,M. and Strachan,T. A de novo pathological point mutation at the 21-hydroxylase locus: implications for gene conversion in the human genome. Nature Genetics 3, 260-265 (1993) CYP21A2 human GenEMBL M21544 M21545 M21547 to M21550 M23224 M23225 M31022 M31023 (ten segments) Globerman,H., Amor-Gueret,M., Parker,K.L., New,M.I. and White,P.C. Nonsense mutation causing steroid 21-hydroxylase deficiency J. Clin. Invest. 82, 139-144 (1988) CYP21A2 human GenEMBL M12792 M23280 (5141bp) Higashi,Y., Tanae,A., Inoue,H., Hiromasa,T., and Fujii-Kuriyama,Y. Aberrant splicing and missense mutations cause steroid 21- hydroxylase [P-450(C21)] deficiency in humans: possible gene conversion products. Proc. Natl. Acad.Sci USA 85, 7486-7490 (1988) CYP21A2 human GenEMBL X54940 Partanen,J. and Campbell,R.D. Rapid characterisation of mutant P450c21 Genes by PCR unpublished (1993) CYP21A2 human GenEMBL X58898 to X58908 PIR S26485 (97 amino acids) PIR S26484 (371 amino acids) PIR S29670 (495 amino acids) PIR S29671 (372 amino acids) PIR S26584 (371 amino acids) PIR S29673 (372 amino acids) Helmberg,A., Tabarelli,M., Dobler,G. and Kofler,R. Identification of molecular defects causing congenital adrenal hyperplasia by cloning of PCR-amplified 21-hydroxylase genes unpublished (1992) CYP21A2 human with congenital adrenal hyperplasia GenEMBL M28548 (4044bp) X05449 Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C., Porter,R.R. and Campbell,R.D. Molecular characterization of the HLA-linked steroid 21-hydroxylase B gene from an individual with congenital adrenal hyperplasia. EMBO J. 6, 1653-1661 (1987) Note: mutant gene, 2 amino acid differences CYP21A2 human with congenital adrenal hyperplasia GenEMBL M26856 X05448 (4042bp) Rodrigues,N.R., Dunham,I., Yu,C.Y., Carroll,M.C., Porter,R.R. and Campbell,R.D. Molecular characterization of the HLA-linked steroid 21-hydroxylase B gene from an individual with congenital adrenal hyperplasia. EMBO J. 6, 1653-1661 (1987) Note: normal gene CYP21A1 Bos taurus (cattle) GenEMBL M11267, M13545 Chung BC, Matteson KJ, Miller WL. (1986) Structure of a bovine gene for P-450c21 (steroid 21-hydroxylase) defines a novel cytochrome P-450 gene family. Proc Natl Acad Sci U S A. 83, 4243-4247. First cattle CYP21A1 gene CYP21A1 Bos taurus (cow) See cattle page for details MVLAGLLLLLTLLSGAHLLWGRWKLRNLHLP 234 PLVPGFLHLLQPNLPIHLLSLTQKLGPVYRLRLGLQEVVVLNSKRTIEEAMIRKWVDFAG 413 414 RPQIPSYKLVSQRCQDISLGDYSLLWKAHKKLTRSALLLGTRSSMEPWVD 563 564 QLTQEFCERMRVQAGAPVTIQKEFSLLTCSIICYLTFGNKEDTLVHAFHDCVQDLMKT 737 738 WDHWSIQILDMVPFLRFFPNPGLWRLKQAIENRDHMVEKQLTRHKESMVAGQWRD 902 903 MTDYMLQGVGRQRVEEGPGQLLEGHVHMSVVDLFIGGTETTASTLSWAVAFLLHHPEIQ 1079 1080 RRLQEELDRELGPGASCSRVTYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSIF 1259 1260 GYDIPEGMVVIPNLQGAHLDETVWEQPHEFRPDRFLEPGANPSALAFGCGARVC 1421 1422 LGESLARLELFVVLLRLLQAFTLLPPPVGALPSLQPDPYCGVNLKVQPFQVRLQ 1583 PRGVEAGAWESASAQ* CYP21A1 pig S53049 M83939 MVLVWLLLLLTLLAGARLLWGQWKLRNLHLPPLVPGFLHLLQPNLPIYLLGLTQRLGPIY RLRLGLQDVVVLNSKRTIEEALVRKWVDFAGRPQIPSYKLASQHCPDISLGDYSLFWKAH KKLTRSALLLGVRSSMEPRVEQLTQEFCERMRAQAGTPVTIQKEFSVLTCSIICCLTFGD KEDTLVHALHDCVQDLMKTWEHWSIQILDMVPFLRFFPSPGLRRLKQAIENRDHLVEKQL RRHKESMVAGQWRDMLDYMLQEAGRQRVEEGQGQLLEGHVHMSVVDLFIGGTETTANTLS WAVVYLLHHPEIQWRLQEELDRELGPGAAGSRVPYKDRARLPLLNATIAEVLRLRPVVPL ALPHRATRPSSIFGYDIPEGTVVIPNLQGAHLDETVWEQPHEFRPDRFLAPGANPSALAF GCGARVCLGEPLARLELFVVLVQLLQAFTLLPPEGALPSLQPHPHSGINLKVQPFQVRLQ PRGGRGEGPGPR CYP21 rat GenEMBL U56853(1964bp) Zhou,M.Y., Vila,M.C., Gomez-Sanchez,E.P. and Gomez-Sanchez,C.E. Cloning of two alternatively spliced 21-hydroxylase cDNAs in the rat adrenal. unpublished (1996) MLLPGLLLLLLLLLLAGTRWLWGQWKLWKLRLPPLAPGFLHFLQPNLPVYLFGLAQKLGP IYRIRLGLQDVVVLNSNKTIEEALIQKWVDFAGRPQILDGKMNFDLSMGDYSLTWKAHKK LSRSALVLGMRDSMEPLVEQLTQEFCERMRAQAGASVAIHKEFSLLTCSIISCLTFGDKQ DSTLLNATHSCVRDLLKAWNHWSVQILDIIPFLRFFPNPGLWKLKQFQESRDHIVMQELK RHKDSLVAGQWKDMIDYMLQGVEKQRDARDPGQLHERHVHMSVVDLFVGGTETTAATLSW AVAFLLHHPEIQKRLQEELDLKLAPSSQLLYKNRMQLPLLMATIAEVLRLRPVVPMALPH RATKASSISGYDIPKDTIIIPNIQGANLDEMVWELPSKFWPDRFLESGKSPRIPTFGCGA RVCLGEPLARLEFFVVLARLLQTFTLLPPPDGTLPSLQPLPYTGINLLIPPFQVRLQPRN LAPQDQGQK CYP21 rat GenEMBL U56854 alternatively spliced form Cyp21 mouse genEMBL AF049850 Mm.18845 MLLPGLLLLLLLLAGTRWLWGQWKLRKLHLPPLAPGFLHFLQPN LPIYLLGLTQKLGPIYRIRLGMQ DVVVLNSNRTIEEALIQKWVDFAGRPHMLN GKMDL DLSLGDYSLMWKAHKKLSRSALMLGMRDSMEPLIEQLTQEFCERMRAQAGTPVAIHKE FSFLTCSIISCLTFGDKDSTLVQTLHDCVQDLLQAWNHWSIQILTIIPLLRFLPNPGL QKLKQIQESRDHIVKQQLKRHKDSLVAGQWKDMIDYMLQGVEKQRDGKDEERLHEGHV HMSVVDLFIGGTETTATTLSWAVAFLLHHPEIQKRLQEELDLKLGPGSQLLYRNRMQL PLLMATIAEVLRLRPVVPLALPHRATRASSISGYDIPKDMVIIPNIQGANLDEMVWEL PSKFWPDRFLEPGKNPRTPSFGCGARVCLGEPLARLELFVVLARLLQAFTLLPPPDGT LPSLQPQPYAGINLPIPPFQVRLQPRNLAPQDQGERP Cyp21a2-ps mouse GenEMBL AF030001 MLLPGLLLLLLLLAGTRWLWGQWKFRKLHLLPLAPGFLHFLQPNLPIYLLGLTQKLGPIYRIRLGLQ Gap 30 aa missing GKMDLDLSLGDYSLMWKAHKKLSRSALMLGMLDSMEPLIEQLTQEFCE RMRAQAGTPVAIHKEFSFLTCSIISCLTFGDK DSTLVQTLHDCVQDLLQAWNHWSIQILTIIPLLR FFPNPGLQKLKQIQESRDHIIKQQLKQHK DSLVAGQWKDMIDYMLQEVEKQRDGKDKEQLHEGHVHMSVVDLFIGGTETTATTLSWAVAFLLHHPE IQK*LQEELDLKLGPGSQLLYRNRMQLPLLMATIAEVLHLRPVVPLALPHRATKAS SISGYNIPKDMVIIPNIQGANLDEMVWELPSKFWP DRFLEPGKNPRTPSFGCGARVCLGEPLARLELFVVLARLLQAFTLLPPPDGTLPSLQPQL YAGINLLIPPFQVRLQPRNLAAQELGEKTLTG* CYP21 pig GenEMBL S53049 M83939 (4792bp) Swiss Q02390 (492 amino acids) PIR S28169 (492 amino acids) Burghelle-Mayeur,C., Geffrotin,C. and Vaiman,M. Sequences of the swine 21-hydroxylase gene (Cyp21) and a portion of the opposite-strand overlapping gene of unkown function previously described in human. Biochim. Biophys. Acta 1171, 153-161 (1992) CYP21 sheep PIR A43349 (497 amino acids) B43349 (479 amino acids) Crawford, R.J., Hammond, V.E., Connell, J.M. and Coghlan, J.P. The structure and activity of two cytochrome P450c21 proteins encoded in the ovine adrenal cortex. J. Biol. Chem. 267, 16212-16218 (1992) B43349 is truncated early after alternate splicing CYP21 Xenopus tropicalis See Xenopus pages for seq CYP21A1 Danio rerio (zebrafish) See xebrafish pages for seq CYP21 Fugu rubripes (pufferfish) No accession number Scaffold_15 44% to CYP21 human over 454 amino acids this is the first fish CYP21 sequence found MCSFLFCSSFSAPPAVKRNLPLSLWQLPLRPSSPPIPGPPCRFLIGNMTE 28230 LMHDHLPIHLTNLAKRYGNIYRLKCGNTT 28144 (1) 28058 AMIVLNSSDIIREALVKKWSDFAGRAVSYT 27969 (1) 27888 ADIVSGGGRNISLGDYTEEWKALRRLVHGALQRCCKHSLHNVIERQALQLRK 27754 (0) VLVDYRGGAVDLSEDFTVAASNVIITLVFGKE (0) YDKSSSELQQLHRCLNEIVALWGSTWISALDTFPLLR (0) KFPNPVFSRLLREVSRRDEIIRKHLNQFK (0) CVLCCAQSEGHRRTDVITGSLLEG (0) 26896 VLTDMHVHMATVDLLIGGSETTAAWLNWTVAFLLHRPE (0) FQTKVYEELCTVLEGRYPKYSDRQRLPILCSLIHEVLRLRPVAPLAVPHKAIRDS (2) SIAGYFIPRNTIIIPNLFGAHHDPEVWSDPYSFKP (1) 26238 ERFLEGGGGSTRALIPFGGGARLCLGETVAKMELFLFTAYLLRDFCFVLPDSEAPLPDLR 26059 26058 GVASVVLKIKSFTVIARPRTGP* 25990 CYP21 Tetraodon nigroviridis (freshwater pufferfish) GenEMBL AL281449.1 C0BG094DE12LP1 G Tetraodon nigroviridis genomic clone 094J24 T7.Length = 895 AL233853.1 C0BG007BE04XD1 G Tetraodon nigroviridis genomic clone 007I08 T7.Length = 1079 86% to Fugu CYP21 MGCIFFFFYLPFSAPPAVKRSLLQSLCGLLHRPSSPSIPGPPCRFLIGNMTE LMQDHLPIHLTDLAKRYGNIYRLKCGNTS AMVVLSSGDVIREALVKKWSDFAGRSVSYT ADIVSGGGRTISLGDYTEEWKAHRRLVHSAL (frameshift) ERCXKQSLHDVIERQALQLRK Missing exon 4 and part of exon 5 GSAWISALDTFPLLR KFPNPVFSRLLREVTRRDEIIRKHLNQYK CVLCCVQSQDNKSTDVITGSLLEG VLTDVHVHMATVDLLIGGTETTAAWLNWTVAFLLHRPE IQTKVYEELCTVLEGRYPKYSDRHRLPVLCSLVHEVLRLRPVAPLAVPHKAVRDS SIAGYFIPKNTIIIPNLFGAHHDPXVWPDPYSFXX Missing exon 11 22A Subfamily CYP22A1 Caenorhabditis elegans (nematode worm) GenEMBL U39648 (28581 bp) AF407572, NM_171699, NM_171698 cosmid T13C5.1 Jia,K., Albert,P.S. and Riddle,D.L. DAF-9, a cytochrome P450 regulating C. elegans larval development and adult longevity Development 129 (1), 221-231 (2002) daf-9 mutant This gene may be in the pathway to synthesize a ligand for DAF-12, a nuclear receptor. Daf-9 has a role in larval development. DAF comes from abnormal DAuer Formation 23A Subfamily CYP23A1 Caenorhabditis elegans (nematode worm) GenEMBL U39472 (42282 bp) see CYP14A1 for reference cosmid B0304.3 24A Subfamily CYP24 rat GenEMBL L04608 to L04619, S52625 to S52636 Ohyama,Y., Noshiro,M., Eggertsen,G., Gotoh,O., Kato,Y., Bjorkhem,I. and Okuda,K. Structural characterization of the gene encoding rat 25-hydroxyvitamin D3 24-hydroxylase. Biochemistry 32, 76-82 (1993) CYP24 rat GenEMBL Z28351 (1419bp) Hahn,C.N., Kerry,D.M., Omdahl,J.L. and May,B.K. Identification of a vitamin D responsive element in the promoter of the gene for the rat 25-hydroxyvitamin D3 24-hydroxylase Nuc. Acids Res. 22, 2410-2416 (1994) CYP24 rat GenEMBL X59506 (3209bp) Ohyama,Y., Noshiro,M. and Okuda,K. Cloning and expression of cDNA encoding 25-hydroxyvitamin D3 24- hydroxylase. FEBS Lett. 278, 195-198 (1991) CYP24 rat GenEMBL D17792 (?bp) Ohyama, Y., Ozono,K., Uchida,M., Shinki,T., Kato,S., Suda,T., Yamamoto,O., Noshiro,M. and Kato,Y. Identification of a vitamin D-responsive element in the 5'-flanking region of the rat 25-hydroxyvitamin D3 24-hydroxylase gene. J. Biol. Chem. 269, 10545-10550 (1994) CYP24 rat GenEMBL U03112 (?bp) Zierold,C., Darwish,H.M. and DeLuca,H.F. Identification of a vitamin D-response element in the rat calcidiol (25-hydroxyvitamin D3) 24-hydroxylase gene. Proc. Natl. Acad.Sci. USA 91, 900-902 (1994) CYP24 human GenEMBL S67623 (776bp) Labuda,M., Lemieux,N., Tihy,F., Prinster,C. and Glorieux,F.H. Human 25-hydroxyvitamin D 24-hydroxylase cytochrome P450 subunit maps to a different chromosomal location than that of pseudovitamin D-deficient rickets. J. Bone Miner. Res. 8, 1397-1406 (1993) CYP24 human GenEMBL L13286 (3254bp) Chen,K.-S., Prahl,J.M. and DeLuca,H.F. Isolation and expression of 1,25-dihydroxyvitamin D3 24-hydroxylase cDNA. Proc. Natl. Acad. Sci. USA 90, 4543-4547 (1993) CYP24 human NM_000782 MSSPISKSRSLAAFLQQLRSPRQPPRLVTSTAYTSPQPREVPVC PLTAGGETQNAAALPGPTSWPLLASLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLG SFESVHLGSPCLLEALYRTESVPQRLEIKPWKAYRDYRKEGYGLLILEGEDWQRVRSA FQKKLMKPGEVMKLDNKINEVLADFMGRIDELCDERGHVEDLYSELNKWSFESICLVL YEKRFGLLQKNAGDEAVNFIMAIKTMMSTFGRMMVTPVELHKSLNTKVWQGHTLAWDT IFKSVKACIDNRLEKYSQQPSADFLCDIYHQNRLSKKELYAAVTELQLAAVETTANSL MWILYNLSRNPQVQQKLLKEIQSVLPENQRPREEDLRNMPYLKACLKESMRLTPGVPF TTRTLDKATVLGEYALPKGTVLMLNTQVLGSSEDNFEDSSQFRPERWLQEKEKINPFA HLPFGVGKRMCIGRRLAELQLHLALCWIVRKYDIQATDNEPVEMLHSGTLVPSRELPI AFCQR CYP24A1 Bos taurus (cow) See cattle page for details 6035 MSSPISKSRSLAAFLQQLRSLGQPPRPVTSTACSPRRPREVPLCPVMEPGEAQDAATLP 6211 6212 GPTKWPLLGSLLEILWKGGLKKQHDTL (0) 6511 VEYHKKYGKIFRMKLGSFDSVHLGSPCLLEALYRTESAHPQRLEIKPWKAYRDYREEGYGLLIL(2) 6702 7484 EGEDWQRVRSAFPKKLMKPVEIMKLDDKINE(0) 7573 4559 VLADFMGRIDELCDERGRIEDLYTELNKWSFE (1) 2100 SICLVLYEKRFGLLQKNAGEEALNFIMAVKT (0) 1035 MMSMFGKMMVTPVELHRSLNTRVWQAHTQAWDTIFRS (1) 925 1303 VKSCVDNRLEKYSEQPSMDFLCDIYHHNQLSKKELYAAVTELQLAAVET (0) 1449 TANSLMWILYNLSRNPHVQQKLFKEIQSVLPENQLPRAEDLRNMPYLKACLKESMR LNPTVPFTTRTLDKAMVLGEYALPKG 5393 TVLVLNTHVLGSSEENFEESSQFRPERWLQDKKNLNPFAHLPFGVGKRMCVGRRLAELQL 5572 5573 HLALCW (0) 5602 IVRKYDVVATDLEPVETLHLGNLVPGRQLPVAFCQR CYP24A1 pig NM_214075 MSSPISKSGSLAALLQQLRSCGQPPRPVTSTACVPRRSKEVPLC PMEQPGETQDAAXAGSTRWPLLGSLLEILWKGGLKKQHDTLAEYHRKYGKIFRMKLGS FDSVHLGSPCLLEALYRTESAHPQRLEIKPWKAYRDYRKEGYGLLILEGEDWQRVRSA FQKKLMKPMEVMKLDNKINEVLADFMSRIDELCDERGCIEDLYSELNKWSFESICLVL YEKRFGLLQKNAGEEALNFITAIKTMMSTFGKMMVTPVELHKNLNTKVWQAHTLAWDT IFKSVKSCIDHRLEKYAEQPSADFLCDIYHHNQLSKKELYASVTELQLAAIETTANSL MWILYNLSRNPHVQQKLLKEIQSVLPENQMPRAEDLRNMPYLKACLKESMRLTPSVPF TTRTLDKAMVLGEYALPKGTVLMLNTYVLGTNEENFEDSSQFRPERWLQEKKKINPFA HLPFGVGKRMCIGRRLAELQLHLALCRIIRKYAIVATDSEPVGMLHLGILDPTRQLPI AFCQR Cyp24 mouse GenEMBL D89669, AC084066.1, Mm.6575 MSCPIDKRRPLIAFLRRLRDLGQPPRSVTSKAHVKRAPKEVPLC PLMTDGETRNVTSLPGPTNWPLLGSLLEIFWKGGLKKQHDTLAEYHKKYGQIFRMKLG SFDSVHLGSPSLLEALYRTESAHPQRLEIKPWKAYRDHRNEAYGLMILEGQEWQRVRS AFQKKLMKPVEIMKLDKKINEVLADFMGQIDELRDERGRIQDLYSELNKWSFESICLV LYEKRFGLLQKDTEEEALTFIAAIKTMMSTFGKMMVTPVELHKRLNTKVWQAHTLAWD TIFKSVKPCIDHRLERYSQQPGADFLCDIYQQDHLSKKELYAAVTELQLAAVETTANS LMWILYNLSRNPQVQQRLLREIQSVLPDNQTPRAEDVRNMPYLKACLKESMRLTPSVP FTTRTLDKPTVLGEYTLPKGTVLTLNTQVLGSSEDNFEDADKFRPERWLEKEKKINPF AHLPFGVGKRMCIGRRLAELQLHLALCWIIQKYNIVATDSEPVEMLHLGILVPSRELP IAFCPR CYP24A1 Xenopus tropicalis See Xenopus pages for seq CYP24A1 Danio rerio (zebrafish) See xebrafish pages for seq CYP24 Fugu rubripes (pufferfish) No accession number Scaffold_4128 MLWRLRGALTLPPELTVLDAIPGPTNWPLVGSLFELLRKGGLTRQHEAL VDYHKKFGKIFRLKLGSFESVHIGAPCLLESLYRTEGSYPQRLEIKPWTAYRDMRDEAYGLLIL EGKDWQRVRRAFQQKLMKPTEVVKLDRKINE VLEDFVSRIGKTNIGGKIEDLYFELNKWSFES ICLVLYDKRFGLLQDKVNEEAMNFITAVKT MMSTFGLMMVTPVELHKSLNTKTWQDHTAAWDRIFST AKVYIDKKLKRNSVIAPDDLIGDILHQSRLSKKELYAAITELQIGGVET 8608 TANSMLWAIFNLSRNPGAQRRLLEEIRTVVPPEQDPCGEHIKSMPYLKACLKESMR 8441 (2) 7830 ISPSVPFTSRTLDKDTVLGDYAIPKG 7753 TVLMINSHALGSSEDYFDDGKKFKPERWLREHGTINPFAH VPFGIGKRMCIGRRLAELQMSLFLQLVRDFE IVATDNEPLDVIHSGLLVPNRELPVAFIKR 25A Subfamily CYP25A1 Caenorhabditis elegans (nematode worm) GenEMBL Z66495 ( 40145bp) see CYP14A1 for reference cosmid C36A4.1 CYP25A2 Caenorhabditis elegans (nematode worm) GenEMBL Z66495 ( 40145bp) see CYP14A1 for reference cosmid C36A4.2 CYP25A3 Caenorhabditis elegans (nematode worm) GenEMBL Z66495 ( 40145bp) see CYP14A1 for reference cosmid C36A4.3 CYP25A4 Caenorhabditis elegans (nematode worm) GenEMBL Z66495 ( 40145bp) see CYP14A1 for reference cosmid C36A4.6 CYP25A5 Caenorhabditis elegans (nematode worm) GenEMBL AF038613 see CYP14A1 for reference F42A6 CYP25A6P Caenorhabditis elegans (nematode worm) GenEMBL U50072 K06B9.1 missing C-terminal 26A Subfamily Cyp26a1 mouse No accession number Jim Ray submitted to nomenclature committee Note: new family in mammals, homolog to human ESTs R51129 and R21282 Cyp26a1 mouse No accession number Martin Petkovich submitted to nomenclature committee Note: new family in mammals, homolog to human ESTs R51129 and R21282 Cyp26a1 mouse GenEMBL Y12657 Fujii, H., Sato, T., Kaneko, S., Gotoh, O., Fujii-Kiriyama, Y., Osawa, K., Kato, S. and Hamada, H. Metabolic inactivation of retinoic acid by a novel P450 differentially expressed in developing mouse embryos. EMBO J. 16, 4163-4173 (1997) Note: new family in mammals, homolog to human ESTs R51129 and R21282 Cyp26a1 mouse GenEMBL Y12657 NM_007811 EST AA239785 no UNIGENE entry MGLPALLASALCTFVLPLLLFLAALKLWDLYCVSSRDRSCALPL PPGTMGFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL GEHRLVSVHWPASVRTILGAGCLSNLHDSSHKQRKKVIMQAFSREALQCYVLVIAEEV SSCLEQWLSCGERGLLVYPEVKRLMFRIAMRILLGCEPGPAGGGEDEQQLVEAFEEMT RNLFSLPIDVPFSGLYRGVKARNLIHARIEENIRAKIRRLQATEPDGGCKDALQLLIE HSWERGERLDMQALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREEIKSKG LLCKSNQDNKLDMETLEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGW NVIYSICDTHDVADIFTNKEEFNPDRFIVPHPEDASRFSFIPFGGGLRSCVGKEFAKI LLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDNLPARFTYFQGDI CYP26A1 human GenEMBL NM_000783 White,J.A., Beckett-Jones,B., Guo,Y.D., Dilworth,F.J., Bonasoro,J., Jones,G. and Petkovich,M. cDNA cloning of human retinoic acid-metabolizing enzyme (hP450RAI) identifies a novel family of cytochromes P450 J. Biol. Chem. 272 (30), 18538-18541 (1997) Note: new family in mammals, equal to human ESTs R51129 and R21282 CYP26A1 human NM_000783 MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPL PPGTMGFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL GDDRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEV GSSLEQWLSCGERGLLVYPEVKRLMFRIAMRILLGCEPQLAGDGDSEQQLVEAFEEMT RNLFSLPIDVPFSGLYRGMKARNLIHARIEQNIRAKICGLRASEAGQGCKDALQLLIE HSWERGERLDMQALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKSKG LLCKSNQDNKLDMEILEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGW NVIYSICDTHDVAEIFTNKEEFNPDRFMLPHPEDASRFSFIPFGGGLRSCVGKEFAKI LLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFHGEI CYP26A1 Bos taurus (cow) See cattle page for details MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSSRDRSCALPL PPGTMGFPFFGETLQMVLQREKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL GEHRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKK (0) VIMQAFSREALQCYVPVIAEEVGNYLEQWLSCGERGLLVYPQVKRLMFRIAMRILLGCE SRLASGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR GLKARDLIHARIEENIRAKIRRLPAAEAGGGCKDALQLLIEHSWERGERLDMQ ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKNK GLLCKGNQDNKLDVEI LEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVADIFT 182 183 NKEEFNPDRFLLPHPEDASRFSFIPFGG GLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDDLPARFTRFQGEI* 449 CYP26A1 Sus scrofa (miniature pig) no accession number Haitao Shang Submitted to nomenclature committee May 23, 2007 partial sequence 92% to human 26A1, full seq. constructed from ESTs Ortholog of human CYP26A1 CYP26A1 Sus scrofa (pig) Pig ESTs CN161524.1, CN162920.1, CF179212.1, DY434075.1, CF180567.1, CN165481.1, 93% to human 26A1 MGLPALLASVLCTFVLPLLLFLAAIKLWDLYCVSSRDRSCTLPLPPGTM GFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGEHRLVS VHWPASVRTILGSGCLSNL HDSSHKQRKKVITQAFSREALQCYVPVIAEEVDSCLEQWLSCGERGLLVYPQVKR LMFRI AMRILLGCEPRLASGGEAEQQLVEAFEEMTRNLFSLPIDVPSSGLYRGLKARNLIHARIE ENIRAKICGLRAAGAGGGCKDALQLLIEHSWERGERLDMQALKQSSTELLFGGHETTASA ATSLITYLGLYPHVLQKVREELKSKGLLCKGNQDNKLDMEILEQLKYIGCVIKETLRLNP PVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDAADIFTNKEEFNPDRFLLPHPEDAS RFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWRLLNGPPTMKTSPTVYPVDDLPA RFTRFQGEI* CYP26A1 zebra fish GenEMBL U68234 White, J.A., Guo, Y.-D., Baetz, K., Beckett-Jones, B., Bonasoro, J. Hsu, K.E., Dilworth, F.E., Jones, G. and Petkovich, M. Identification of the retinoic acid-inducible all trans retinoic acid 4-hydroxylase. J. Biol. Chem. 271, 29922-29927 (1996) Note: new family in vertebrates, homolog to human ESTs R51129 and R21282 CYP26A1 Xenopus tropicalis See Xenopus pages for seq CYP26A1 Fugu rubripes (pufferfish) No accession number Scaffold_12575 67% to 26A1 human missing C-term zebrafish seq shown in lower case 2144 MAVSALLATFLCTIVLPLLLFLVTVKLWEVY (frameshift) VIRERDSACP (frameshift) SPLPPG (frameshift) TMGLPFIGETLQLILQ (0) 1689 RRKFLRMKRQKYGYIYRTHLFGNPTVRVTGANNVRHILLGEHRLVAV 1552 1551 QWPASVRTILGSDTLSNVHGAQHKTKKK 1465 (0) 1230 AIMQAFSREALEFYIPAMQHEVQAAVQEWLAKDSCVLVY PEMKRLMFRIAMQILLGFQLE 1051 (frameshift) 1049 QIKTDEQKLVEAFEEMIKNLFSLPIDMPFSGLYR 948 (0) 784 GLKARNFIHAKIEENIKRKLRESNSDSKCRDALQQLIDSSKKSGQVLSMQ 635 (0) 548 VLKESATELLFGGHETTASTATSLIMFLGLNPEVLDKLRHELSDKVMHKGF 396 (1) 329 LDLRSLNLETLEQLKYTSCVIKETLRMNPPVPGGFRVALKTFELG 195 (0) 100 GYQIPKGWNVIYSICDTHDVAEIFP 26 (frameshift) 24 NKEDFQPE 1 end of Scaffold_12575 RFMMKNCGDSSRFQYIPFGGG srmcvgkefakvllkiflveltqhcnwilsngpptmktgptiypvdnlptkftsyvrn CYP26A1 Xenopus laevis GenEMBL AF057566 Hollemann,T., Chen,Y., Grunz,H. and Pieler,T. Regionalized metabolic activity establishes boundaries of retinoic acid signalling. EMBO J. 17, 7361-7372 (1998) CYP26B1 human GenEMBL AC007002 Nelson, D.R. A second CYP26 P450 in humans and zebrafish: CYP26B1 Archives of Biochemistry and Biophysics 371, 345-347 (1999) MLFEGLDLVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRMLLGPNTVSNS IGDIHRNKRKVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQ KLTFRMAIRVLLGFSIPEEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRR GIQARQILQKGLEKAIREKLQCTQGKDYLDALDLLIESSKEHGKEMTMQELKDGTLELIF AAYATTASASTSLIMQLLKHPTVLEKLRDELRAHGILHSGGCPCEGTLRLDTLSGLRYLD CVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDP DRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRI TLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV CYP26B1 Bos taurus (cow) See cattle page for details MLFEGLELVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRML LGPNTVSNSIGDIHRNKR VFSKIFSHEALESYMPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLG FSLPVDLPFSGYRRGIQARQTLQKG 77 LEKAIREKLQCTQGKDYSDALDIFIESSKEHGKEMTMQELKDGTLELIFAAYAT 238 239 TASASTSLIMQLLKHPAVLEKLREELRAKGLLHSGGCPCEGTLRLDTLSGLHYLDCVIKE 418 419 VMRLFTPVSGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFG 592 593 QARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVL 700 AVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPETEAMLSATV* CYP26B1 pig AK238570.1, CJ034657.1 aa 88-142 LLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRMLLGPNTVANSIGDIHRNKR Cyp26b1 mouse GenEMBL AC022779.3 AW047279 GSS AZ741670, AZ369731, BM936625.1 MLFEGLELVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ (0) GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILLGEHQLVSTEWPRSARVLLGPNTVANSIGDIHRNKRK (0) VFSKIFSHEALESYLPKIQLVIQDTLRAWSSQPEAINVYQEAQRLTFRMAVRVLLGFSI PEEDLGHLFEVYQQFVENVFSLPVDLPFSGYRR (?) GIQARQILQKGLEKAIREKLQCTQGKDYSDALDILIESSKEHGKEMTMQELK DGTLELIFAAYATTASASTSLIMQLLK HPAVLEKLREELRAQGLLHGGGCPCEGTLRLDTLSSLRYLDCVIKEVMRLFTPVSGGYRTVLQTFELD (?) GFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLP FGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV CYP26B1 Xenopus tropicalis See Xenopus pages for seq CYP26B1 Danio rerio (zebrafish) See xebrafish pages for seq CYP26B1 Fugu rubripes (pufferfish) No accession number Scaffold_4267 74% to 26B1 human 8448 MLFDSFDLVSALATLAACLVSMALLLAVSQQLWQLRWTATRDRNCKLPMPKGSMGFPFIGETCHWLLQ 8651 17930 GSGFHASRRQKYGNVFKTHLLGRPLIRVTGAENIRKVLMGEHTLVTVDWPQSTSTLLGPNSLA 18118 18119 NSIGDIHRKKRK 18154 1487 VFAKVFSHEALESYLPKIQQVIQESLRVWSSNPEPINVYR 1606 (2) 1689 ESQRLSFTMAVRVLLGFRVSEEEMKHLFSTFQDFVDNLFSLPIDLPFSGYRK 1844 (0) 1921 GIRARDTLQKSIEKAIREKPLCSQGKDYSDALDVLMESAKENGSELTMQELK 2076 (0) exon 4 2851 ESTIELIFAAFATTASASTSLIMQLLRHPPVLERLREELRARGL exon 5,6 fused LHNGCLCPEGELRLDTIVSLKYLDCVIKEVLRLFTPVSGAYRTAMQTFELD 3135 (0) exon 5,6 fused 3271 GVQIPKGWSVMYSIRDTHDTSTVFKDVDVFDPDRFSQERGEDKEGRFHYLPFGGGVRSCLGKQLA exon 7 TLFLRILAIELASTSRFELATRQFPRVITVPVVHPVDGLKVKFYGLDSNQNEIMAKSEELLGAAV* 3663 exon 7 CYP26C1 human GenEMBL AL358613.11 May 2, 2001 522 amino acids, 6 exons, (0) = phase 0 intron 52% to 26B1 human, also 15 amino acid insertion in exon 5 vs. 26B1 MFPWGLSCLSVLGAAGTALLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFGETLHWLVQ (0) GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQSAHILLGSHTLLGAVGEPHRRRRK (0) VLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVSVYDASKALTFRMAARILLGLRL DEAQCATLARTFEQLVENLFSLPLDVPFSGLRK (0) GIRARDQLHRHLEGAISEKLHEDKAAEPGDALDLIIHSARELGHEPSMQELK (0) ESAVELLFAAFFTTASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGSEGPPPD CGCEPDLSLAALGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD (0) GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGAAREDSRGASSRLHYIPFGGGARSCLG QELAQAVLQLLAVELVRTARWELATPAFPAMQTVPIVHPVDGLRLFFHPLTPSVAGNGLCL* CYP26C1 Bos taurus (cow) See cattle page for details 67 MLPWGLSCLSALGAVGTALLGAGLLLSLAQHLWTLRWTLSRDRASALPLPKGSMGWPFFGETLHWLVQ (0) GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTVLLGEHRLVRSQWPQSAHILLGSHTLLGAVGESHRQRRK (0) 5955 ILARAFSRAALECYVPRLQRALRREVRSWCAARGPVAVYEAAKALTFRMAARILLGLR 6134 6135 LDEAQCSELARTFEQFVENLFSLPLDVPFSGLRK (0) 6251 GIRARDQLHRHLEEAIAQKLLEDKTAVEPGDALDGIIHSTRELGHELSVQELK (0) 46 ESAVELLFAAFSTTASASTSLVLLLLQHPAAIAKIQQELAAQGLGRACGCAPAASGGG 225 226 AGPPPDCGCEPDLSLAALGHLRYVGCVVKEVLRLLPPVSGGYRTALRTFELD (0) 381 GYQIPKGWNVMYSIRDTHETAAVYRSPPEGF DPERFGTAGDDALGAAGRFHYIPFGGGARSCLGQELAQTVLQLLAVELVRTARWELATPA 2322 2321 FPAMQTVPIVHPVDGLRLFFHPLASPAAQDGQRS* CYP26C1 mouse GenEMBL AC110212.1 84% to 26C1 human exon 5 ELAVELLFAAFFTTASASTSLILLLLQHPAAITKIQQELSAQGLGRACTCTPRASGSPPDCGCEPDLSLAMLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD Cyp26c1 mouse GenEMBL XM_140712.1 May 17, 2002 also AC110212.4 1 MISWGLSCLSVLGAAGTTLLCAGLLLGLAQQLWTLRWTLSRDWASTLPLPKGSMGWPFFG 180 181 ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRL 338 339 VLARVFSRSSLEQFVPRLQGALRREVRSWCAAQRPVA 447 448 VYQAAKALTFRMAARILLGLQLDEARCTELAHTFEQLVENLFSLPLDVPFSGLRKGIRAR 627 628 DQLYEHLDEAVAEKLQEKQTAEPGDALLLIINSARELGHEPSVQELK ELAVELL 987 988 FAAFFTTASASTSLILLLLQHPAAITKIQQELSAQGLGRACTCTPRASGSPPDCGC 1155 1156 EPDLSLAMLGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYSIRDT 1335 1336 HETAAVYRSPPEGFDPERFGVESGDARGSGGRFHYIPFGGGARSCLGQELAQAVLQLLAV 1515 1516 ELVRTARWELATPAFPVMQTVPIVHPVDGLLLFFHPLPTSGAGDGLPF* CYP26C1 Xenopus tropicalis See Xenopus pages for seq CYP26C1 Danio rerio (zebrafish) See xebrafish pages for seq CYP26C1 Fugu rubripes (pufferfish) No accession number Scaffold_11741 Equally similar to 26B1 and 26C1 human But C-terminal is 68% to 26C1 while 58% to 26B1 Lower case region very poor match may not be correct exon structure here. 6362 MLGLVSALATALTTLLLLLLLLALTRQLWSFRWSLTRDRRCELPLPKGSMGWPLVGETFQWLFQ 6171 (0) 5571 GSNFHISRRKRHGNVFKTHLLGKPLVRVTGAENIRKILLGEHSLVCTQWPQSTRIILGPN 5392 5391 ALVNSIGELHKRKRK 5347 (0) 4963 ILAKVFSRKALESYLPRLQEVIKCEIAKWCAEPGSVDVYAATRSLTFRIAIGVLLGLHL 4781 4780 EEERIDYLAQIFGQLMSNLFSLPIDAPFSGLRK (0) 3972 GIKARKILHANMEKIIEKKMERQQEEEEYRDAFDYMLSTSKEQGQQISIQELK 3814 (0) 3581 ETAVELIFAAHSTTASAATSLVLQLLHHPEVVERVRVELEAQKLcynslnlpsqa 3417 (1) 3377 ctfpqsqchasnLSLDKLNQLHYIDCVIKEVLRFLPPVSGGYRTALQTFELD 3222 (0) 2770 GYQIPKGWTVMYSIRDTHETAEIFQNPELFDPDRFVTAQVESRSSRFSYVPFGGGVR 2600 2599 SCVGKELAQIILKTLTIELIRTCKWTLATEKFPKMQTVPIVHPVNGLHVNFMYKNLHEIDH* 2414 27A Subfamily CYP27A1 human Swiss Q02318 (531 amino acids) Cali J.J., Russell D.W. Characterization of the human sterol-27-hydroxylase: A mitochondrial P450 that catalyzes multiple oxidation reactions in bile acid biosynthesis. J. Biol. Chem. 266, 7774-7778 (1991) CYP27A1 human GenEMBL X59812 (2107bp) Guo,Y., Strugnell,S., Back,D.W. and Jones,G. Transfected human liver cytochrome P-450 hydroxylates vitamin D analogs at different side-chain positions Proc. Natl. Acad. Sci. U.S.A. 90, 8668-8672 (1993) CYP27A1 human NM_000784 MAALGCARLRWALRGAGRGLCPHGARAKAAIPAALPSDKATGAP GAGPGVRRRQRSLEEIPRLGQLRFFFQLFVQGYALQLHQLQVLYKAKYGPMWMSYLGP QMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTYGPFTTEGHHWYQLRQA LNQRLLKPAEAALYTDAFNEVIDDFMTRLDQLRAESASGNQVSDMAQLFYYFALEAIC YILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWN AIFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPREAMGSLPELLM AGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMPLLKAVLKE TLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESFQPHRWL RNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPETGEL KSVARIVLVPNKKVGLQFLQRQC CYP27A1 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP27A1_18D1 100% to AB125162, 97% to human CYP27A1 Macaca fasicularis (cynomolgus monkey) GenEMBL AB125162 Kusuda,J., Osada,N., Tanuma,R., Hirata,M., Sugano,S. and Hashimoto,K. Isolation and characterization of cDNA for macaque neurological disease genes Unpublished, partial seq CYP27A1 Bos taurus (cow) See cattle page for details MNRMGALGSARLRWALLGRRAALPGLGSFGARAKAAIPSALPAAQAAEAPGTGPGDRRLRSLD ELSGPGQLRLLFQLLVQGYVLHLHQLQVLNKAKYGPIWINRVGPQMHVHLASAPLLEQVM RQEGKYPVRDDMKLWKEHRDQQGLSYGPFTTMGEQWYRLRQTLNQRMLKPAEAALYTDAL NEVINDFMDQLKQLRAESASGDHVPDIAHQFYFFALEAISYILFEKRIGCLERSIPKDTE TFVRSVGLMFHNSLFVTFLPTWTRPLLPFWKRYLDGWNTIFSFGKKLIDQKLEEIEAQLK TENPEKTQISGYLHFLLTSG QLSPREAEGSLPELLLAGVDTTSNTLTWALYHLSKNPEIQAALHKEVVGVVPAGQVPQHK 961 DLARMPLLKAVLKETLRLYPVVPVNSRVVVDKEIEVGGFLFPKNTQFVLCHYVISRDPDI 1141 YPEPDSFQPQRWLRKNQPDALKTQHPFGSVPFGYGVRACLGRRIAELEMQLLLTRLIQHYE 1324 VVLAPETGEVTSVARIVLVPNKKVGLRFLQRQS* CYP27A1 Sus scrofa (miniature pig) no accession number Haitao Shang Submitted to nomenclature committee May 23, 2007 partial sequence 3 amino acid differences to complete seq AK232936.1 Ortholog of human CYP27A1 CYP27A1 pig AK232936.1 MAALGCARLRWALLGTRVPFPVIGSQGARAKAAIPTALPAAQTAEAPGTGPGDRRLRTLKELSGPGRLRLLFQLLAQGYVLHLHQLQVLNKAKYGPMWINYIGPQIHVNVASAPLLEQVMRQEGKYPVRNDMDLWKEHRDLQGLTYGPFTTEGESWYRLRQALNQRMLKPADAALYTNALNEVVDDFMVRLNQLRAESTLGDHVPDIAHQFYFFALEAICYILFEKHIGCLERSIPQDTKAFVRSVGFMFQNSLYATFLPKWTRPLLPYWRRYLDGWNTIFSFGKKLIDQKLEEIEAQLQAGGPEEPRISGYLHLLLTSGQLSPREAEGSLPELLMAGVDTTSNTLTWALYHLSKNPEIQAALHEEVVGMVPAGRVPQYKDLAHMPLLKAVLKETLRLYPVVPVNSRVTDKEIEVGGFLFPKNTQFVLCHYVVSRDPSIFPEPDRFSPKRWLRKNQTDALKGQHPFGSVPFGYGVRGCLGRRIAELEMQLLLTRLIQQYEVTLAPETGEVQSVARIVLVPNKKVGLRFLQRQG CYP27A1 rat GenEMBL M73231 (2300bp) Shayiq,R.M. and Avadhani,N.G. Sequence complementarity between the 5'-terminal regions of mRNAs for rat mitochondrial cytochrome P-450c27/25 and a growth hormone- inducible serine protease inhibitor: a possible gene overlap. J. Biol. Chem. 267, 2421-2428 (1992) CYP27A1 rat GenEMBL U17363, U17369 to U17376 genomic sequence Mullick, J., Addya,S., Sucharov,C. and Avadhani,N.G. Localization of a transcription promoter within the second exon of the cytochrome P-450c27/25 gene for the expression of the major species of two-kilobase mRNA. Biochemistry 34, 13729-13742 (1995) Cyp27a1 mouse GenEMBL 8 ESTs follow AI286988 AK004977.1 Mm.26793 MAAWSRTRLRWTLLDPRVVGRGLCPQGARAKATIPAALQAQESTEGPGTGQDRPRLRSP AELPGTGTLQFLFQLFLQGYVLHLPDLQVLNKTKYGPMWTTSFGTYTNVNLASAPLLEQV MRQEGKYPIRDHMDQWKDHRDHKGLTYGIFIAQGEQWYHLRQALKQRLLKPDEAALYTDA LNEVISDFITRLDQVRAESESGDQVPDMAHLLYHLALEAITYILFEKRIGCLKPSIPEDT AAFIRSVAIMFQNSVYITFLPKWTRPLLPFWKRYLNGWDNIFSFGKKLID EKVQELKAQL QETGPDGVRVSGYLHFLLTNELLSTQETIGTFPELLLAGVDT TSNTLTWALYHLSKSPEI QEALHKEVTGVVPFGKVPQHKDFAHMPLLKAVIKETLRLYPVVPTNSRIITEKETEINGF LFPKNTQFVLCHYVVSRDPSVFPEPNSFQPHRWLRKKEADNPGILHPFGSVPFGYGVRSC LGRRIAELEMQLMLSRLVQKYEIALAPGMGEVKTVSRIVLVPSKKVRLHFLQRQ CYP27A1 rabbit PIR A90152 (21 amino acids) Dahlbaeck, H. Characterization of the liver mitochondrial cytochrome P-450 catalyzing the 26-hydroxylation of 5beta-cholestane-3alpha, 7alpha,12alpha-triol. Biochem. Biophys. Res. Commun. 157, 30-36 (1988) PIR A90155 (21 amino acids) Dahlbaeck, H. Biochem. Biophys. Res. Commun. (1989) 159:370 CYP27A1 pig no accession number Kjell Wikvall 77% identical to human CYP27A1X Xenopus tropicalis See Xenopus CYP27A8, CYP27A9 and CYP27A10 CYP27A1 Fugu rubripes (pufferfish) No accession number Scaffold_3437 46% to 27A1 MLLPIVLIPQHFCAFPLHVCLPLCKL MWKKGSWLLRPLAWARSCSGGAGRVSTVPASPPNLRFKPRTVEDLPRVSLWELLYRLTFQGFHSHMHELQ (0) 136 ILEKGRYGPIYRNGMNAVSVSTAKLLGEVLRNDDKFPNRGDMSIWKEYRDLRGYGYGPFTE 321 (2) 536 KDERWYNLRAVLNKRMLRPKDALQYGDTIGEVVTDFIRRIYFLRQRSPTGDVVTDLNNELYHFSLE 733 (1) 816 AIASILFETRLGCLEEEIPTGTQDFINAISQMFSNNFQVFLMPKWSRGVLPYWRRYVAGWDGIFSF 1013 (1) 1206 ATRLIDRKMEFIQQHLDNNQNVEGEYLTYLLSNTQMSIKDVYGSVSELLLAGVDT 1370 (0) 1465 TSNTLTWTLHLLSKYPQCQEILFKEVSTSVPADRAPSAEEVTRMPYLRAVVKESLR 1632 (2) 1756 MFPVIPMNGRILADKDVMIGGYQFSKN 1836 (0) 1949 TAFNFSHYAIGRDEDTFPEPATFMPERWLQDSHNRPNAFGAIAFGFGVRGCVGRRIAELEMYSFLCH 2149 (0) 2308 LMRHFEIKPDPKMGELKSVCRTVLIPDKPVSLRFLDRGSGHAA* 2439 CYP27A1 Tetraodon nigroviridis chr2:17054633-17057366 at UCSC browser One frameshift at & Adjacent to BCS1L and ZNF142 as in humans MEGLIFGNGSVSLYYRFLERNSNILFFFKYSPLREQ MWWKGSRLLPRPLVGPRVYSREAGQVFTAPTPPSKPRTLEDLPRVSLWELLYRLTIQGFHSRTHELQ (0) ILEKERHGPIYRNGMKAVSICTATLLEEVLRNDDKFPNRGDMSIWKEYRDL RGYGYGPFTE (2) NEDRWYNLRVVLNKRMLHPRDALQYGGTLSEVVTDFIGRIYLLRQRT & PTGDVVADLNNELYRFSLE (1) AIASILFETRLGCLEEKIPTGTQDFISAISQMFSNNLQVFLMPRWSRGLLPYWRRYVAGWDGIFSF (1) ATKLIDRKMEFIQQRLDDNQNVEGEYLTYLLANAQMSIKDVYGSISELLLAGVDT (0) TSNCLTWTLHLLSEYPQCQEILFKEVSTSVPAGRAPSAEEVTKMPYLRAVVKESLR (2) MFPVVPMNGRVVADRDVMIGGYHFSKN (0) TAFNFSHYAISRDEHTFPEAARFLPERWLRDGHRRGGPNAFGTIPFGFGVRGCVGRRIAELEMYLFLFH (0) LMRHFEIKPDPQMGELKSICRTVLIPDKPLSLYFLDRRSGHAA* CYP27A2 Fugu rubripes (pufferfish) chrUn:122717928-122721737 on UCSC browser or Scaffold_697 41% to 27A1 mouse, 49% to 27A3 Fugu, 42% to 27A1 Fugu 38% to 27C1 Fugu adjacent to MAP3K2 and SUMO1 note: MAP3K2 is next to ERCC3 in humans that is adjacent to CYP27C1 Fugu 27C1 is adjacent to ERCC3 Therefore CYP27A2 and CYP27C1 had a common origin 29906 MASFTALRCAAIGARNSALRPATLPSRNLNLQATSEAANLKGIADLPGPNTYKILYWLFVKGYGERSHLLQ 30118 (0) 30735 GKLKNIYGPMWRWKLGPYDFVSVASPELIARVIQQEGRYPVRVQLPHWKEYRDLRGQAYGLHVE 30926 (2) 31027 TGPEWSRLRSALNPRMLKLREVVAFSPVIHEVVGDLLRRIEFLRSGSQDGVTVSDIASELYKFGFE (1) 31334 GISAILFETRLGCLGEKVDPNVQRFISGVNDMLSLSDITYLFPRWTRSFVPVWKRFAQAWDDISDV 31531 (1) 31614 ASSLIDRRIAEIDARVANGQSVEGLYLTYLLSSDKMSRAEISTCITDLLLGGVDT 31778 (0) 32763 TSNTLSWALYHLAKDPVAQDRLYDEVNSVCPNHHQPTTDDLANMPFLKAVIKEVLR 32930 (2) 33004 LYPVVHQNARFISENDVILNDYWFPKK (0) TQFHLCHYSVCHDETQFKHAERFLPERWLRHSAPLSGYYQHHPYSFIPFGVGVRACVGKRVAELEMYFALTR (0) LMQSYNVQLEDDTMVVEPRTRTLLIPSKPINLRFLRRPGEQRC* CYP27A2P Tetraodon nigroviridis (freshwater pufferfish) GSCT00017845001_prot length=460 chr3:953742-956387 UCSC browser adjacent to MAP3K2 and SUMO1 frameshift and 11 aa deletion at &1 frameshift at &2 73% to CYP27A2 Fugu note in zebrafish, MAP3K2 is adjacent to CYP27C1, ERCC3 MASVAALRSAPIRPRARFPAALPRKELNLQATTRPAGLKGLEDLPGPGLG RVLSWLFAEGYAEKSHLLQAQLRSLYGPMWRWTLGPYDLVGVAAPELIAG IIQQEGRYPLRAPLPHWKDYRDLRGQAYGLHVDTGPEWSRLRSALNPKML KLQEVATFAPVVHSVVGDLLRRLEFLRSGSQDGVTVSDVASELYKFGFEG ISAILFETRLGCLDQKVDPNVQRFITAIGDMLSTSDFAYLVPRWARSLVP VWKRFVQAWDDISDVASWLIDKRIAEIEAQVAGGRSVEGLYLTHLLSSNK LSRAEIYTCITELLLGGVDTTSNTLSWALYHLAQDPGAQQRL &1 GEPTTEDLASMPFLKAVIKEVLR (2) LYPVVPSNARVITENDVILGGYWFPKG (0) TLYHLCHYTISHDASQFVHVEKFLPERWLRGGASQSRCYQHHPYSSIPFG VGVRACLGRRVAD &2 LEMHFALTR (0) LMQAYEVRLKDETEVVEAKTRTLLVPSKPINLRFLARA* CYP27A3 Fugu rubripes (pufferfish) No accession number Scaffold_6002 Length = 16767 = LGS139924.x1 57% to 27A1 I-helix = LGS125183.x1 Cyp27a1 First 16 aa and 49-81 supported by EST from AU050037 Paralichthys olivaceus aa 49 on also supported by AW343479 zebrafish EST in pufferfish 27A3 is between EAF2 and SLC15A2 these two genes are adjacent in humans but 27A3 is gone. MFRNRLLTVGLRASVPHREGLHRTAVNYAGARRRHASSAATEITEHNVR QKTMEDLGGPSFLTTLNWLFLKGYLPKTQQMQ (0) 4619 VEHSKIYGPLWKSKYGPMVVVNVASADLIEQVLRQEGRHPVRTDMPHWRRYRALRNQAHGPLTE 4428 MGAKWQRIRSILNPRMLKPQHVSSYGITINDVVTDFLEKLVWLRAKDGGGVMVNDVAGELYKFAFE GISSVLFESRMGCLNDEVPEETQKFIYSVGEMFRLSAVVVLFPQSVWPYLPLWKKFVAAWDYLFKV (1) 1891 AEQMVQKKMEEIQNKVDLHQDVEGAYLTHLLLSEKMTVTEILGSITELLLAGVDT 1715 1661 TSNTISWALYQLAQNPSIQDQLYHEVRSVCPGNKMPDSDDIAQMPYLKAVIRETLR LYPVVPGNARVTVDKEIVVGGYLFPKQ (0) 754 TLFHLCHYCVSHDENIFPNSRVFQPERWLRGREEKSKQHPFGSVPFGFGVRACLGRRV AELEMYLLLSR (0) LIRRFEVRPDPNGAEVKPITRTLLCPATPINLQFLDRGAQRAPGPAAGASL* CYP27A fragment a Fugu rubripes (pufferfish) No accession number a = exon 4 of 27A3 LPC42076.x1 39% to 27A1 202-276 79% to LPC42075.x1 gene duplication? Exon 4 GISSVLFESRMGCLNDEVPEETQKFIYSVGEMFRLSAVVVLFPQSVWPYLPLWKKFVAAWDYLFKV CYP27A fragment b Fugu rubripes (pufferfish) No accession number Similar to exon 4 of 27A3 LPC42075.x1 35% to 27A1 79% to LPC42076.x1 gene duplication? Exon 4 GISSVLFESRLGCLNDEVPEETRRVIYSVGEMCRLSAVVVLFPQSGWPYLPVWARLGAAGDYLLQF CYP27A3 Tetraodon nigroviridis (freshwater pufferfish) chr2:18232734-18236941 UCSC browser between EAF2 and SLC15A2 MLRSRLLSAGPARQGLRRAAASSAGARPRHAAPASSGVAGGQKTIDDLGGPSFLTTLNWLFVKGFLPKMQQMQ (0) VEQRKIYGPLWKSKYGPLTVVNVASADLIEQVLRQEGRHPVRTDMPHWRRYRELRNQAHGPLTE (2) MGAKWLRIRSILNPRMLKPKHVSSYASTINQVVTDFIQRVLWLGAQAGGGVMVHDVAGELYKFAFE (1) GICSVLFENRMGCLNEEVPEETQKFIFSVGEMFRLSPLVVLFPKYMWPYLPFWKQFVQAWDYLFKV (1) AEELVHKKMEEIQNKVDLQQSVEGAYLTHLLLSEKMSVTEILGSITELLLAGVDT (0) TSNTISWSLYQLAQNPDVQERLYQEVSAVCPGDQLPNSDHIAQMPYLKAVIRETLR (2) LYPVVPGNARVTVDNEIMVGDYLFPKQ (0) TLFHLCHYCVSHDESVFLNSHAFQPERWLRGTEEKSKQHPFGSVPFGFGVRACLGRRVAELEMYLLLSR (0) LIRRFEVRPDPAGTEVKPITRTLLCPAKPINLQFLDRGAGQTPRPGAGASPEPL* CYP27A3 Danio rerio (zebrafish) NP_001116749 4 aa diffs to seq on chr9:16358844-16371887 UCSC browser EST DR721514.1, DN833321.1, DT066990.1, CD283382.1, CR928406.1, CD599037.1, DT059155.1 These cover the whole seq with about 2 aa diffs 69% to CYP27A3 Fugu probable ortholog adjacent to EAF2 in both MFSNSLLRTGLFLCKQDCSGIKCLTSMFLNARTTSTLIGGDKQKTMDDLD GPSFLTSLYWLFGKGYFQTTHQMQIEHSKIYGPLWKSKYGPLVIVNVASA DLIEQVLRQEGRHPIRTDMPHWRGYRKLRNHAYGPLTEMGPEWQRIRSIL NPRMLKPKHVSNYTNAINGVVSDFIEKMAKLKTTKGNDVMVYDVAGELYK FAFEGISSVLFETRMGCLNDEIPEETQKFIFSVGEMFRLSPIVILFPKFL WPYMPFWKHFVAVWDHLFKVADELVQQKMTEIQEKVKTGSPVEGEYLTHL LISEQMSFTEVLGSITELLLAGVDTTSNTISWALYHLAREPEIQQKLHEE VVSVCPGDKTPSSDDITRMPLLKAIVRETLRLYPVVPGNARVVAENEIVV GDHLFPKNTLFHLCHFAVSYDETVFPDPFAFLPQRWIREQKQLSQHPFGS VPFGFGIRACLGRRVAELEMYLLLSRLIKRYEVRPDPSGKTVKPITRTLL VPATSIDLQFIDR CYP27A3 Oryzias latipes (medaka) chr21:6587265-6596897 UCSC browser adjacent to EAF2 MLRRPLLTATKRGASLLPRDDATKVAAAAGGAPSMLLAELRRHQTSPAVGVDKVKT LDDLGGPSFLTTLNWLFLKGYFKKTQQLQ (0) IEHRNIYGPIWKSKYGPLVVVNVASADLIEQVLRQEGKYPVRSQMPHWRSYRELRSKSHGPLTE (2) LGAKWQRIRSILNPRMLKPVHVSSYTNTLNDVVTDFIERISWLRATGDQGITVNDLTGELYKFAFE (1) GICSVLFEARMGCMNEVVPEETQKFIFSVGEMFRLSPIVVLFPKVFWPYLPAWKQFVATWDHLFSV (1) AEDLVKKKMEEIQQDVETNKEVEGAYLTHLLLSDQMTVTEILGSMTELLLAGVDT (0) TSNTISWALYHLAKEPEIQTKLYREVIDVCPGDKVPTSSDISQMPYLKAVIRETLR (2) LYPVVPSNARVTVENDIIVGGYVFQKQ (0) TLFHLCHYAVSHDESIFPDPHTFLPQRWLRGLDDKIKQHPFGSVPFGFGVRACLGRRVAELEMYLLLSR (0) LLKSYEVRPDPAGTEVKPITRTLLCPAEPINLQFLDRKA* CYP27A3 Gasterosteus aculeatus (three-spined stickleback) chrUn:42545325-42552979 UCSC browser adjacent to EAF2 note EAF2 and SLC15A2 are present in lizard but no CYP27A3 is between them (lost on the way to tetrapods) MLSRSTASLGLRLQRNATRSLLAALSPGPHGG RHASASTSSTVTAHSSPLKTMEQLGGPGLLTTLNWLFVKGYFQTTQQMQ (0) IEHSRIYGPLWKSKYGPLVVVNVASAQLIEQVLRQEGRLPVRTDMPHWRSYRELRNQAHGPLTE MGVKWQRIRSILNPRMLKPKHVSSYACTINEVVGDFVRRAAWLRETGGGGLMVNDLTAELYKFAFE GICSVLFETRMGCMNEVVPEETQKFIFSVGEMFRLSPILVLFPKSFWPYTPFWKQFVAAWDHLFKV VAEELVQQKMDEIQEKVHLDQSVEGAYLTHLLLSEQMTVTEILGSITELLLAGVDT TSNTIAWALYQLAKQPEIQEQLYQEVIGVCPGDKVPSSEDIAQMPLLKAIIRETLR MYPVVPGNARVSVESEIVVGDHLFPKN TLFHLCHYAVSYDESIFPDAHSFLPDRWLRGGEDKPRLHPFGSVPFGFGIRACLGRRVA ELEMYLLLSR LMQRFEVRPDPAGAPVKPITRTLLCPATPINLQFVDRRPSLC* CYP27A3 Osmerus mordax (Rainbow Smelt) EL545126 EST 75% to 27A3 zebrafish N-term MIGKCLLKIGLRACSQENCGIIRVTTTLNNVRDMRRGSSTAA TTVIAGNDKQKTMDDLDGPSFMTSLYWLFGKGYFQTTQQLQIEHSKIYGPLWKSKYGPLV IVNVASAELIEQVLRQEGRHPVRTDMPHWRAYRELRNQAYGPLTEMGAKWQRIRSILNPR MLKPKHVSAYTNAINEVVTDFIDKVAWLRDTRGGGVMVNDLSGELYKFAFEGICSVLFET RMGCLNEKVPEETQKFIFSVGEMFRLSPIVILFPKSVWPYLPFWKHFVVVWDHLFKIAEE LVH CYP27A4 Danio rerio (zebrafish) BX321915.3, BC055637 60% to 27A1 fugu Temp name CYP27A.d 2 aa diffs to chr9:32649613-32658960 UCSC browser 65501 MAVCFALSSAERLGIGLGFLRPTAAATGFRRAAGNAAAASVSVQDGHRK LKTEADLPEIKIFTMLYRLLFKGYLSRMHELQ (0) LYEKQLYGPLYKINVGNFQSVALNSVDLLEELLRKDEKFPSRGDMTLWTEYRDMKGIGYGPFTE EGEKWYKLRAVLNKRMLHPKDSVQYGDVVNAVITDFIKRIYYLREMSPTGDLVSNLTNKLYRFSLE GISSILFETRIGCLEKEIPAETQHFINSIAQMLTYSMPVVFMPNWTRNYLPFWQRYIGGWDGIFK FAGKMIDMKMEALQKRVDANQEVAGEYLTYLLSNGKMSSKDVYGSVSELLLAGVDT TSNTMLFALYHLSKDPETQDILYQEVTNVLKDDRIPTAQEVNTMSYLKAVIKETLR LYPVVPMNARLIAENEVVIGGHLFPKN (0) TTFTLCHYAISRDEKVFPEPQKFKPERWLRDGRTRPNPFGSIPFGFGVRGCVGRRIAELEMYLALAR LIKLFEIRPDPTVGEVRSLNRTVLAPDRKVNLHLVERKKTEA* 56151 CYP27A5 Danio rerio (zebrafish) BX321915.3 Temp name CYP27A.c 100% to chr9:32660646-32665857 59% to 27A1 fugu 54468 MAVCFALSSAERLGWCFLRPTAAATGFRSAAGNAAAASVSVQDGHRKLKTEADLPEI 54298 54297 KIFTMLYQMLFKGYLNSVHELQ 54232 (0) LYQKQVYGPLWKINAGNLQGISISSVELLEELLRKDEKYPCRGYMTLWTEHRDLRGISYGPFTE 53897 () EGEKWYKLRAVLNKRMLHPKDSVQYGDVVNAVITDFIKRIYYLREMSPTGDLVSNLTNELYRFSLE XIASILFETRIGCLEKEIPAETQKFINFIAQMFTYSMHVALLPNWTRKYFPFWQKYIDGWDGIFKF GTKMIDMKMEAIQKRVDTNQDVAGEYLTYLLSNGKISRKDVYGSVSELLLAGVDS TSNTMLWALYLLSKDPEAQETLYQEVTKVLKDDRIPTAEEVNSMPFLKAVIKETLR LYPVVPVNSRLIAETEVVIGEYFFPKK (0) TTFTLCHYVISRDEKVFPEPQKFKPERWLRDGRTRPNPFGSIPFGFGVRACVGRRIAELEMHLALAR LIKLFEMRPDPTVGEVKANFRSVLVPNKKVNLHFVERQKTET* 49254 CYP27A6 Danio rerio (zebrafish) BX321915.3 Temp name CYP27A.b 100% to chr9:32668191-32674283 (+) 61% to 27A1 fugu 46923 MAVCFALSSAERLGFGFLRPTAAATGFRRAAGNAAAASVSVQDGHRKLKTEA 46768 46767 DLPEIKIFTMLYQMLFKGYLNSIHELQ 46687 (0) 46408 IYQKQVYGPLSKINSGNLQGISISSVELLEELLRKDEKYPCRGYMTLWTEHRDLRGIGYGPFTE 46217 () EGEKWYKLRAVLNKRMLHPKDSLQYGDVVNAVITDFIKRIYYLREMSPTGDLVSNLTNELYRFSLE XIASILFETRIGCLEKEIPVETQEFINSIAQMFTYSMHVALLPNWTRNYFPFWQKYIDGWDGIFKF GTKMIDMKMEAIQKRVDTNQEVAGEYLTYLLSSGKMSSKDVYGSVSELLLAGVDT 1183 TSNTMLWALYLLSKDPAAQETLYQEVTKVLKDDRIPTAEEVNSMPFLKAVIKETLR 1000 LYPVVPVNSRLIAETEVVIGEYFFPKK (0) TTFNLCHYAISHDEKVFPEPQKFKPERWLRDGRTRPNPFGSIPFGFGVRACVGRRIAELEMHLALAR 40957 LIKLFEMRPDPTVGEVKANFRSVLVPNKKVNLHFVERQKTET* 40849 CYP27A7 Danio rerio (zebrafish) BX321915.3 Temp name CYP27A.a 100% to UCSC browser chr9 32687123-32697779 (+) strand 59% to 27A1 fugu zfishG-a34c11.p1c 28992 MAVSFALSSAERLGWCFLRPTTAATGFRRAAGNSAAASVSVQDGHRKLKTEA 27837 27836 DLPEIKIFTMLYQMLFKGYLNSVHELQ 27756 (0) 25958 LYQKQVYGPLWKINAGNLQGISITSVELLEELLRKDEKYPCRGYMTLWTEHRDLRGISYGPFTE 25767 () EGEKWYKLRAVLNKRMLHPKDSLQYGDVVNAVITDFIKRIYYLREMSPTGDLVSNLTNELYRFSLE GIASILFETRIGCLEKEIPAETQEFINSIAQMFTYNMHVALLPNWTRNYLPFWQKYIDGWDGIFKF GTKMINLKMEAIQTRLDTNQEVAGEYLTYLLSSGKMSCKDVYGSVSEVLLAGVDT TSNTMLWALYLLSKDPAAQETLHQEVTKVLKGDRIPTAEEVNSMPFLKAVIKETLR LYPVVPVNSRLIAESEVIIGEYLFPKK (0) TTFNLFHYAISHDEKVFPEPQKFKPERWLRDGRTRPNPFGSIPFGFGVRACVGRRIAELEMHLALAR LIKLFEMRPDPTVGEVKANFRSVLVPNKKVNLHFVERQKTET* 17333 CYP27A8-de4b5b Xenopus tropicalis (frog) pseudogene exons 4 and 5 only scaffold_77:1644664-1645797 (-) strand UCSC browser FETRIGCLDKQIPPETQKFIDSIGYMFKNSVYVTFLPHWTRGILPYWDRYIEGWDNIFDFGKR LIDKKMSEIQSRLDKGEEVEGEYLTYLLSSANLTMGEVYGSVCELLLAGVDT CYP27A8 Xenopus tropicalis (frog) DR832386.1 CX969640.1 DR852196.1 old CYP27A1 Xenopus best X.tropicalis hit to CYP27A1 hum in ESTdb best match in human = CYP27A1 55%, probably a CYP27A1 ortholog scaffold_77:1626758-1638467 (-) complete in UCSC browser Trace archive 570051728(+), walked to 411568263(+) 494948503(+) 54% to 27A7 zebrafish, 50% to 27A3 zebrafish, 50% to 27A1 human this seq is near BCS1L and ZNF142 like human CYP27A1 MPSASKLGFLPLGRCRWLLHTGRGVSVSQGRAVAGAAVGAVGEEKKMKTF EDLPGPSLLTNIYWVFLRGYILYTHELQAIYKKNYGPMWKST LGRYKTVNIADVDILETVLRQEGKYPMRSDMEVWKEHRRQRDLSLGPFTEEGHKWHTLRS VLNKRMLKPAEAMLYTGVVNEVVTDFLVRLEEMRSETPSGDMVNDIPNALYRFAFEGISY ILFETRIGCLEKQIPVETQRFIDSIGAMLKNSIFVTIFPPWTNNLLPYYKRYMDSWDNIF AFGNKLINEKMKKIEARLERDEEVQGEYLTYLISSGKLTDKEIYGSVAELLLAGVDTTSN TLSWALYHLAREPEIQNALYQEVIGVVPGQNIPTSEDISSMPLLRAVIKETLR LYPVVPTNSRVAVEKAITIGDYYFPKD TLIALHHYHISRDEKNFPESDKFIPQRWFRESRVKNNPFSSIPFGYGVRACVGRRI AELEMHMCLSR () IIKKYEVRPDPSGAEIKSMARIVLTPHKPINLRFLPR CYP27A9 Xenopus tropicalis (frog) AL787054.2 DT421274.1 CR427561.1 BC094536.1 note there are three adjacent CYP27A genes in Xenopus 50% to 27A1 human, 42% to 27B1 human and 38% to 27C1 human 52% to 27A5 zebrafish, 51% to 27A3 zebrafish scaffold_77:1590575-1599619, 11 aa diffs in first half exon 6 missing in genome assembly MRKGCHALLWKTCWANVQTGRE KATLGVAGAVAEQEKKLKMSTDLPGPSTLNILYWVFLRGYVFESHKLQVIWKKRYGPLW KTCIGSHRLVNVASPELLETLLRQEGKYPMRTDMFMWKEHRDLQDFSYGPLTEEGHRWHT LRRVLNQRMLKPKEAVRYTESFNDVVTDLLVVIKEITAQSPNRTTVDGVANLMYKFAFES ICTVLFETRIGCLKKEIPPETEKFINSIAIMLENQTRMEKLPRWTRGIFPYWRRFVEGWD NIFIYGKKLIDKKMEEIEGRLKRGEEVEGEYLTYLLSSGKLSMEEICGSVAELLQAGVDTTSNT LTWALYQLARNPEIQHNLHQEVIGVTPGDTIPDSEAIARMPLLRAVIKETLRLYPVVPEN GRVVTEKDVILNDYIIPKNSQFVLCHYALSRDETQFPEPDRFLPERWLR DSGMKHHPFSSIPFGYGVRACAGRRIAELEMHLALSRIIKMFQVVPDPELGEVGTKNRTV LVSSRPINLQFIER* CYP27A10 Xenopus tropicalis (frog) CX393015 DR862140.1 DN030991.1 49% to 27A1 human scaffold_77:1580461-1588306 complete in UCSC browser 53% to 27A7 zebrafish, 48% to 27A3 zebrafish MIAQRLQTGA QALLQQSCRASVQTVRKKATLGVSGATVVEGKTLKTLDDLPGPSPLKLLYWIFLRGYLFR THELQVIFRKTYGPMWKMSDRQHAMVTVASPDLLESLLRKEGKYPTRADMFIMREHRDLR GHSYGPVTEEGHQWHRIRTILNQRMLKPRETVVYAGSMNEVVSDLLLKIKELTAQSSSGT QVNGVAELMYKFAFESICTVLFETRLGCLNKEILPETQKFIDSIGIMLEHLTMLTRLPQW TKGILPYWGRYIEAWDTIFDFGKKLIDKKMEDIEGRLKRGEEVEGEYLTYLLSSGKLSME EVYGSVVELLQAGVDTTSN TLTWALYQLSRNPEIQNNLYQEVIRVIPGETIPDSEAIARMPLLKAVIKETLRL FPVVPENARMINEKEVTIKDYVFPVKTQFILGHYAISRDETTFPEADRF LPERWLRDSGMKHHPFGSIPFGYGVRACVGRRIAELEMHLALSRIIKMFQVIPDPDLGEV GAKNRAVLVANRPVNLRFIERQPRPE* CYP27A11 Petromyzon marinus (Lamprey) Contig1559:9792-24354 at UCSC browser, EST FD706879.1 adjacent to RQCD1 which is near CYP27A1 human (about 7 genes away) This is probably the ortholog of CYP27A1 51% to 27A3 zebrafish, 46% to 27A1 human, 48% to 27A5, 27A6 zebrafish MAAWWTRVTRLRPLSLLLRAAPPGFARVAGTETRSRSWTETRAETRALRWAH GSVAPVAAPVAAPRGAEHSRSPDELTGPSFFTIIYWVLCRNNNAHLLQ (0) VECKNRYGPMWKSKFGPYMVISVADADLIEQVIRQEGKYPIRFDMKNW TDYRDLRGYAYGPFTE NGSKWNQLRSVLNQKMLRPRDMVAYVKPINEVVTDLLARLHMLSASNGGMVNNLANEFYKFAFEGTEAIL FEKRLGCLEQSVPEETQVFIDAVRTVFIHSEKMALIPKWIRMILP LWKHYCNAWDVIFSF GMKEIDKKMIKISEMVARGEPVEGEYLTYLLSNGKLTEHEIHGSIAELLMAGIDT (0) TSNTMSWVLYHLSREPHLQERLHQEVEGVCPGNTIPNVQDLAKMSFLKAVIKEILR (2) LYPVVPSNSRLIVEGGTKVGGYVFPKG () TLFQLCHYAVSHNPEIFPDPEAFKPERWLRKGTMLAHPPFGSIPFGFGVRACVGRRVAELEMYLTLSR () LIKEFEVQWDPSQKDILPKTSTVIIPSESINLKFIARK 27B Subfamily CYP27B1 rat GenEMBL AF000139 St-Arnaud,R., Messerlian,S., Moir,J.M., Omdahl,J.L. and Glorieux,F.H. The 25-hydroxyvitamin D 1-alpha-hydroxylase gene maps to the pseudovitamin D-deficiency rickets (PDDR) disease locus J. Bone Miner. Res. 12, 1552-1559 (1997) previously named CYP40. This sequence has errors in it. CYP27B1 rat GenEMBL AB001992 Cloning and expression of rat 25-hydroxyvitamin D3-1-alpha- hydroxylase cDNA . Shinki, T., Shimada, H., Wakino, S., Anazawa, H., Hayashi, M., Saruta, T., DeLuca, H.F. and Suda, T. Proc. Natl. Acad. Sci. USA 94, 12920-12925 (1997) Note: this sequence has been called CYP27B1 in this paper. The name CYP40 was given in May 1997 based on the sequence from John Omdahl that was not completely accurate at the time of submission for a name. It was necessary to revise the name CYP40 to CYP27B1. CYP27B1 mouse GenEMBL AB006034 Takeyama, K-i., Kitanaka, S., Sato, T., Kobori, M., Yanagisawa, J. and Kato, S. 25-hydroxyvitamin D3 1 alpha hydroxylase and vitamin D synthesis. Science 227, 1827-1830 (1997) Cyp27b1 mouse GenEMBL AB006034 AC084293.2 Mm.6216 LEADSPNTDMTQAVKLASRVFHRIHLPLQLDASLGSRGSESVLR SLSDIPGPSTLSFLAELFCKGGLSRLHELQVHGAARYGPIWSGSFGTLRTVYVADPTL VEQLLRQESHCPERCSFSSWAEHRRRHQRACGLLTADGEEWQRLRSLLAPLLLRPQAA AGYAGTLDNVVRDLVRRLRRQRGRGSGLPGLVLDVAGEFYKFGLESIGAVLLGSRLGC LEAEVPPDTETFIHAVGSVFVSTLLTMAMPNWLHHLIPGPWARLCRDWDQMFAFAQRH VELREGEAAMRNQGKPEEDMPSGHHLTHFLFREKVSVQSIVGNVTELLLAGVDTVSNT LSWTLYELSRHPDVQTALHSEITAGTRGSCAHPHGTALSQLPLLKAVIKEVLRLYPVV PGNSRVPDRDIRVGNYVIPQDTLVSLCHYATSRDPTQFPDPNSFNPARWLGEGPTPHP FASLPFGFGKRSCIGRRLAELELQMALSQILTHFEVLPEPGALPIKPMTRTVLVPERS INLQFVDR The mouse and probably rat have lost 27C1 due to a chromosome rearrangement between BIN1 and ERCC3 It seems that 27C1 was broken and lost in this event. In fugu CYP27C1 is on the minus strand of scaffold 106 from 31916-36680 the neigboring gene, 2624 bp away, is ercc3 at 39305-43553 minus strand (also found in human 39kb away) Therefore, this linkage is very old. The next gene in the series ERCC3, CYP27C1 is BIN1 in human. In mouse chr 1 is syntenic to human chr 2 at ERCC3, but BIN1 is on chr 18, implying a chromosome break. CYP27B1 human GenEMBL AB005989 cDNA sequence Takeyama,K., Kitanaka,S., Sato,T., Kobori,M., Yanagisawa,J. and Kato,S. 25-Hydroxyvitamin D3 1alpha-hydroxylase and vitamin D synthesis Science 277, 1827-1830 (1997) CYP27B1 human GenEMBL AB005990 gene sequence Murayama,A., Kitanaka,S., Takeyama,K. and Kato,S. Human 25-hydroxyvitamin D3 1alpha-hydroxylase gene Unpublished (1997) CYP27B1 human GenEMBL AB005038 cDNA sequence AB006987 gene sequence Monkawa,T., Yoshida,T., Wakino,S., Shinki,T., Anazawa,H., Deluca,H.F., Suda,T., Hayashi,M. and Saruta,T. Molecular cloning of cDNA and genomic DNA for human 25-hydroxyvitamin D3 1alpha-hydroxylase Biochem. Biophys. Res. Commun. 239, 527-533 (1997) CYP27B1 human GenEMBL AF020192 cDNA sequence Fu,G.K., Lin,D., Zhang,M.Y., Bikle,D.D., Shackleton,C.H., Miller,W.L. and Portale,A.A. Cloning of human 25-hydroxyvitamin D-1alpha-hydroxylase and mutations causing vitamin D-dependent rickets type 1 Mol. Endocrinol. 11, 1961-1970 (1997) CYP27B1 human GenEMBL AF0027152 gene sequence Fu,G.K., Portale,A.A. and Miller,W.L. Complete structure of the human gene for the vitamin D 1alpha- hydroxylase, P450c1alpha. DNA Cell Biol. 16, 1499-1507 (1997) CYP27B1 human NM_000785 MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPS TPSFLAELFCKGGLSRLHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEG PRPERCSFSPWTEHRRCRQRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNN VVCDLVRRLRRQRGRGTGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDT ETFIRAVGSVFVSTLLTMAMPHWLRHLVPGPWGRLCRDWDQMFAFAQRHVERREAEAA MRNGGQPEKDLESGAHLTHFLFREELPAQSILGNVTELLLAGVDTVSNTLSWALYELS RHPEVQTALHSEITAALSPGSSAYPSATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPD KDIHVGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPTPHPFASLPFGF GKRSCMGRRLAELELQMALAQ ILTHFEVQPEPGAAPVRPKTRTVLVPERSINLQFLDR CYP27B1 Bos taurus (cow) See cattle page for details MTQTLKFASRVFHRVRCPELGASLGSRGSESAPRVLADIPGPSTPGFLAELFCKGGLSRLHELQ (0) VQGAARFGPVWLASFGTVRTVYLAAPTLVEQLLRQEGPRPERCSFSPWTEHRRRRQRACGLLTA (2) EGEEWQRLRSLLAPLLLRPQAAARYAGTLHGVVRDLVRRLRRQRGLGAGPPSLVRDVAGEFYKFGLE (1) GIAAVLLGSRLGCLEAEVPPDTETFIRAVGSVFVSTLLTMAMPSWLHRVVPGPWDRLCRDWDQMFAF (1) AQQHVEQREAEVAMRNQSEKSEEDMGPGAHLTYFLLQKELPAASILGNVTELLLAGVDT (0) VSNTLSWALYELSRHPEIQTALHAEITAALGPGSSTQPSATALSQLPLLKAVVKEVLR (2) LYPVVPGNSRVPDRDICVGEYIIPKN (0) TLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPAPHPFASLPFGFGKRSCVGRRLAELELQMALAQ (0) ILIHFEVQPEPGSAPVRPMTRTVLVPERSINLQFVDR* CYP27B1 pig DQ295065 MTQTLKLASRVFHRVCRAPELGSRGSDSAPRGLADLPGPSTPGF LAELFCKGGLSRLHELQVQGAARFGPVWLASFGKVRTVYVAAPTLVEQLLRQEGPLPE RCSFSPWTEHRRRRQRSCGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLHDVVQD LVRRLRSQRGLGAGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEPEVPPDTETFI RAVGSVFVSTLLTMAMPSWLHRLVPGPWARLCRDWDQMFAFAQEHVERREAEAAMKSQ GKPEEDLGSGAHLTYFLFREELPAPSILGNVTELLLAGVDTVSNTLSWALYELSRHPE VQMALHSEIKTALGPSSSAHPSATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIC VGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFRPARWLGECPAPHPFASLPFGFGKRS CMGRRLAELELQMALAQILIHFEVQPEPGSAPIRPMTRTVLVPERSINLQFVDR CYP27B1 Xenopus tropicalis NM_001006906.1 scaffold_67:1250110-1259521 in UCSC browser this gene is between MARCH9 and METTL1 LIKE human 27B1 54% to human 63% to 27B1 zebrafish, 48% to 27A3 zebrafish, 54% to 27B1 human MAQTLKLGSSRSSQLFRGLQELWAETVLKNSEKVIKGHKSLADM PGPSTVSFISDLFCRRGLARLHELQLEGKAKFGPVWKASFGPILTVHVAEPSLIEQVL RQEGKHPIRSDLSSWKDYRQCRGHSYGLLTAEGEEWQQFRSILGKHMLKPKEVEAYSD VLNDVVGDLIKKINYQRSQNQNNVVKDIAKEFYMFGLEGISSVLFESRIGCLEPTVPK ETEKFIQSINTMFVMTLLTMAMPKFLHKIFRKPWQKFCESWDYMFAFAKGHIDKRMKD VAQKLAQGEKVEGKYLTYYLAQEKIPMKSIYGNVTELLLAGVDTISSTLSWSLYELAQ HPDIQSAVYSEVEEILQGKQIPSPSDVARMPLLKAVVKEVLRLYPVIPGNARVVADRD IQVGDYIIPKKTLITLCHYATSRDENVFSNPNEFQPDRWLKKEDTHHPYASLPFGFGK RSCIGRRIAELEVYLALARILSHFEVKPEQPGSLVMPMTRTLLVPEKEINLQFLER CYP27B1 Danio rerio (zebrafish) chr11:1675158-1683318 72% TO 27B1 FUGU 568 MMMVLHQALKVTGRSALPLLRFAERWADVIRAPPTPQVKTLEQMPGPSPARFIRDLFMKRGFS 380 379 RLHQLQ 362 () 337 LEGRQKYGPMWKASFGPILTVHVAEPELIQQVLRQEGQHPMRSELSSWKDYRALRGEG 164 163 YGLLT 149 () 158 SEGEEWQCVRSLLSKHMLRPQAVEAYDGALNAVVSDLLQKLKLRSQESSSRIVSDISAEF 337 338 YRFGLEG 358 () 281 ISSVLFESRIGCLDAVVPVETERFIQSINTMFVMTLLTMAMPQWLHRLLPKPWDTFC 451 452 RCWDVMFEF 478 () 46 AKGHIDQRLQQEKQKLECGEQLEGRYLTYFLSQAGLPLTSVYSNVTELLLAGVDT 210 592 ISSTLSWSLYELSRHPDVQTALRXEVLSVMKGRRSVPQASDVAAMPLLKAVVKEIL 425 () 345 RLYPVIPANARVINKDIEVGGYVIPKN 265 () 84579 TLITLCHYATSRDP 84620 84620 QQFRDPDSFRPQRWGDRSDRSHPYATVPFGVGKRSCIGRRIAELEVYLALSR 84775 85167 ILMHFTVEPARQDDTVHPMTRTLLVPERQIDLRFTER* 85280 CYP27B1 Fugu rubripes (pufferfish) No accession number Scaffold_470 52% to 27B1 human MLQQALRVSCRSASPLVKWMERWAECASARPQAVKPLGDMPGPSVASFAWDLFAKRGLSRLHELQ (0) LEGVRRYGPMWKASFGPILTVHVADPALIEQVLRKEGQHPMRSDLSSWKDYRRLRGHHYGLLTS (2) 51430 EGEEWQSIRSLLGKHMLRPKAVEAYDQTLNSVVDDLITKLRLRRSSQGLVTDIASEFYRFGLE 51630 (1) 51726 GVSSVLFESRIGCLDKIVPEETERFIQCINTMFVMTLLTMAMPSWMHQLFPKPWNVFCQCWDYMFDF (1) AKGHIDQRMAAEAEKIARGEEVEGRYLTYFLSRTSLPMKTVYSNVTELLLAGVDT (0) 52280 ISSTLSWSLYELSRHQAVQASLREEVLSVLGGRRVPTAADVAQMPLLKATIKEVLR 52444 (2) 52527 LYPVIPANARVITERDIQVGGYLIPKN 52610 (0) 52697 TLITLCHYATSRDPAVFPRPDEFLPQRWLNKEQSHHPYASVPFGVGKRSCIGRRIAELELYLAVAR 52894 (0) 53237 ILLEFDIKPDPEGISVKPMTRTLLVPENVINLQFTER* 53347 CYP27C1 human GenEMBL AC027142 BM562765 BI459427 ESTs 43% identical to 27A1 assembled gene intron starting with QIH ending in VDT is from Celera's data CRA_Gene|hCG42613 /len=10487. This Celera sequence is still missing the C-terminal. Probable last exon is now found in AC027142. AG Intron boundary is in the same location as CYP26B1. Stop codon is one codon away from 26B1s stop codon. Length is preserved from cys to intron. (n) = intron phase, 9 exons 1 85452 MQTSAMALLARILRAGLRPAPERGGLLGGGAPRRPQPAGARLPAGARAEDKGAGRPGSPPG 85634 61 62 85635 GGRAEGPRSLAAMPGPRTLANLAEFFCRDGFSRIHEIQ (0) 85748 99 100 39574 QKHTREYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA (2) 39371 163 164 43984 EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSME (1) 43787 229 230 41743 GVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFC 41564 290 291 41563 RSWDGLFKFS 41534 300 (1) 301 QIHVDNKLRDIQYQMDRGRRVSGGLLTYLFLSQALTLQEIYANVTEMLLAGVDT (0) 354 (Celera sequence) 355 110201 TSFTLSWTVYLLARHPEVQQTVYREIVKNLGERHVPTAADVPKVPLVRALLKETLR (2) 110034 410 411 108566 LFPVLPGNGRVTQEDLVIGGYLIPKG (0) 108489 436 437 108006 TQLALCHYATSYQDENFPRAKEFRPERWLRKGDLDRVDNFGSIPFGHGVRSCIGRRIAELEIHLVVIQ (0) 107794 504 505 102503 LLQHFEIKTSSQTNAVHAKTHGLLTPGGPIHVRFVNRK* 102619 542 The mouse and probably rat have lost 27C1 due to a chromosome rearrangement between BIN1 and ERCC3 It seems that 27C1 was broken and lost in this event. In fugu CYP27C1 is on the minus strand of scaffold 106 from 31916-36680 the neigboring gene, 2624 bp away, is ercc3 at 39305-43553 minus strand (also found in human 39kb away) Therefore, this linkage is very old. The next gene in the series ERCC3, CYP27C1 is BIN1 in human. In mouse chr 1 is syntenic to human chr 2 at ERCC3, but BIN1 is on chr 18, implying a chromosome break. CYP27C1 Bos taurus (cow) See cattle page for details MALLARILKARPSPAAGRTGLLGGWDAGQPRLASSGLSARARAEDKGAGRPGASQAGGR AEGPRSLAAMPGPRTLANLVEFFGKDGFSRIHEIQ 297 QKHTREYGKIFKPHFGPQFVVSVADRDLVAQVLRAEGASPQRANMGSWQEYRDLRXRS 476 477 TGLISA 494 EGEQWLKMRSVLRQRILKPKDVAIFAGEINQVIADLIKRIYFLKSQAEDGDTVTNINDLFFKYSME GVATILYESRLGCLGNSIPQLTADYIEALALMFCTFKTSMYAGAIPRWLRLLIPKPWQEFCRSWDGLFEFS QIHVDNKLRDIRCQMERGERVRGGLLTYLFLSQELTLEEIYANMTEMLLAGVDT TSFTLSWAVYLLARHPEVQQALYREIVRNLGERHVPTAADVPKVPLVRALLKETLR LFPVLPGNGRVTQEDLIVGGYLIPRG TQLALCHYATSYEDENFPRAKEFRPERWLRQGNLRRVDNFGSIPFGYGARSCIGRRIAELEIHLLVIQ LLQHFEIKTSPWTKTVHAKTHGLLMPGEPIHVRFVNRK CYP27C1 Gallus gallus (chicken) GenEMBL BU235126 BU209688 Partial seq missing N and C-terminals 78% to Silurana 27C1 82% to human 27C1 PGLSGSQSLEVRSGAENKAARPGELLEPSPQLGRV KSLHEMPGPNTLYNLYEFFWKDGFGRIHE IQQKHTQEYGKIFKSHFGPQFVVSIADRDMVAQVLRSEGRAPQRANMESWQEYRDLRGR ATGLISAEGEQWLKMRSVLRQKILKPKDVAVYSGGVNEVITDLIKRIYTLRSQEEDGETV TNVNNLFFKYSMEGVATILYECRLGCLENNVPQQTVEYIEALELMFSMFKTTMYAGAIPR WLRPFIPKPWREFCRSWDGLFKFSQIHVDNKLKSIQS QLDQGEEVNGGLLTYLLVSKELTLEEIYA NMTEMLLAGVDTTSFTLSWAIYMLAKHPEVQQRVYEEIINKLGKDQAPVNRDVPKLPLIR AVLKETLR CYP27C1 Silurana tropicalis (frog) GenEMBL BQ392731 AL629634 AL595312 74% to human 27C1 not counting the divergent N-terminal 87% to Xenopus 27C1 MAALGQLLRGSARLEGLARSFHRFPGAQAAGQALEHEQAEGVLGATVKGSPMVKN LKEMPGPSTMANLVEFFWRDGFGRIQEIQQKHARQYGRIFKSHFGPQFVVSIADKDMVAQ VLRAERDAPQRANMESWHEYRELRGRSTGLISAEGEKWLNMRSVLRQKILRPRDVAMYSG GVNEVVEDLVKRIRKLRVQESDGLTVTNVNDLYFKYSMEAIATILYECRLGCLDDQIPQQ TKEYIEALELMFSMFKTTMYAGAIPKWLRPLIPKPWREF CRSWDGLFKFSQIHVDDRLRQIESQLEKGEEVQGGVLTHLLLSKELDLEEIYA NMTEMLLAGVDTTSFTLSWATYLLAKNPGIQEAVYQQIVQNFGKDQVPTAEDVPKMPLVRAVVKETL RLFPVLPGNGRVTQDDLVVGGYFIPKGTQLALCHYSTSYDAECFPAAEEFRPERWIRSGN LERKENFGSIPFGYGIRSCIGRRVAELEMHLLLIQLLQ NLEIKPSPQTTTVLPKTHGLLCPGGKINVRFVDRQ* CYP27C1 Xenopus laevis (African clawed frog) GenEMBL BJ086834 CA982419 BQ385474 BJ069739 87% to Silurana 27C1 MAVLGHLLKGSARLEGLARGFHQFPKIQAAGQALEQEQAEGEL GARAKEAPMM KSLKDMPGPSTLANLVEFFWRDGFGRIHEIQQKHTRQYGRIFKSHFGPQF VVSIADKDLVAQVIRAERDAPQRANMESWHEYRELRGRSTGLISAEGEKWLNMRSVLRQK ILRPRDVAMYTGGVNEVIGDLVKKIHKLRAQESDGLTVTNVNDLYFKYSMEAIATVLYEC RLGCLDDIIPQQTKEYIAALELMFSMFKTTMYAGGLPKWLRHHNFPNPWKEF 53aa gap see Silurana seq HMTEMLLAGVDTTSFTLSWATYLLAKNPSIQESVYQQIVQNLGKDQVPTAEDVTKIPMVR AVVKETLRLFPVLPGNGRVTQDDLVLDGYLIPKGTQLALCHYSTSYDDKCFPGAEEFKPE RWIRSGYLERKENFGSIPFGNGIRSCIGRRVAELEIHLLLIQL LQNFEIKLSPETMTVLPKTHGLLCPGEKINVRFLDRQ* CYP27C1 Xenopus tropicalis scaffold_893:261736-267439 exon 3 missing in a gap and last half not on this scaffold N-term adjacent to ERCC3 as in zebrafish and human 27C1 scaffold_4119:30-1794 has second half of gene 100% match NM_001011341.1 short at N-term (58 aa) CX494774.2 68% to human from refseq database 70% to 27C1 zebrafish MAALGQLLRGSARLEGLARSFHRFPGAQAAGQALEHEQAEGVLGATVKGSPMVKNLKE MPGPSTMANLVEFFWRDGFGRIQEIQQKHARQYGRIFKSHFGPQ FVVSIADKDMVAQVLRAERDAPQRANMESWHEYRELRGRSTGLISAEGEKWLNMRSVL RQKILRPRDVAMYSGGVNEVVEDLVKRIRKLRVQESDGLTVTNVNDLYFKYSMEAIAT ILYECRLGCLDDQIPQQTKEYIEALELMFSMFKTTMYAGAIPKWLRPLIPKPWREFCR SWDGLFKFSQIHVDDRLRQIESQLEKGEEVQGGVLTHLLLSKELDLEEIYANMTEMLL AGVDTTSFTLSWATYLLAKNPGIQEAVYQQIVQNFGKDQVPTAEDVPKMPLVRAVVKE TLRLFPVLPGNGRVTQDDLVVGGYFIPKGTQLALCHYSTSYDAECFPAAEEFRPERWI RSGNLERKENFGSIPFGYGIRSCIGRRVAELEMHLLLIQLLQNFEIKPSPQTTTVLPK THGLLCPGGKINVRFVDRQ CYP27C1 Danio rerio (zebrafish) ctg14633 zebrafish 70% TO FUGU CYP27C1 chr6:7837526-7850296 MARKNLLQESCRQLLIQTHGLHKSVASGSLEIAAHSQADLKEESAVSPAEEVQ KAARVKSLKEMPGPSTVANLLEFFYRDGFSRIHEIQ 46230 MEHAKKYGKIFKSRFGPQFVVSIADRDMVAQV 48886 48887 LRSESATPQRGNMESWKEYRDLRGRSTGLISA 48982 EGDEWLKMRSVLRQLIMRPRDVAVFSSDVT 50683 50683 DVVADLVKRVKTLRSQQDDSQTVLNINDLFFKYAME 50790 GVATILYETRLGCLENEIPKMSQEYITALHLMFSSFKTTMYAGAIPKW 52654 52655 LRPIIPKPWEEFCSSWDGLFKFS 52723 52813 IHVDKRLSEIKKQMEKSEEIKGGLLTHMLVTREMNLEEIYANMTEMLLAGVDT 52971 55090 TSFTLSWSTYLLARHPTIQQQIFEEVDRVLGGRVPTGEDVPYLPLIRGLVKETLR 55254 () LFPVLPGNGRVTHDDLIVGGYLIPKG 55452 () 58126 TQLALCHYSTSMDEENFPRPEEFRPDRWIRKDASDRVDNFGSIPFGYGIRSCVGRRIAEL 58305 58306 EMHLALTQ (0) LLQNFHIEVSPQTTEVHAKTHGLLCPGASINLRFTDRK* 58539 CYP27C1 Fugu rubripes (pufferfish) chrUn:4630123-4635029 UCSC browser Scaffold_1410 75% to 27C1 human adjacent to ERCC3 MSVMNKLTTTCWTNFYGDRRNKQMLFVLRCLHKSATSGTFGVAREEPLPERLITSTDATKKRLPKT 19233 LAEMPGPGTISNLFEFFWRDGFSRIHEIQ 19319 (0) 20076 IEHSKMYGKIFKSRFGPQLVVSVADRDLVAEVLRAE GVAPQRANMESWHEYRDMRRRSTGLISA 20261 (2) EGEDWLRMRSVLRQLIMRPRDVAVFSDDVSEVVDEVVDDLIKR IVCLRSQSSDGTTICNINDLFFKYAME (1) 20741 GIAAILYECRLGCLSQKIPQETEDYIDALHLMFSSFKTTMYAGAIPKWLRPV 20911 20912 FPKPWEEFCDSWDGLFRF 20968 (1) SVHVDKRLKQIESQLQRGEKVTGGLLTYMLVAKEMSVEEIYANVTEMLLAGvdt (0) 23004 TSFTLSWASYLLARHPDVQQQIHAEVMRVLGSEKVATAEDVQHLPFIRGLVKETLR 23183 (2) LFPVLPGNGRITQDDMVLGGYFIPKG 23623 TQLALCHYSTSLDDENFPSSLEFRPDRWIRKHSSDRLDNFGSIPFGYGIRSCIGKRI 23793 23794 AELEMHLALIR (0) 23998 IIQKFHVCVSPLTTDVKAKTHGLLCPGAPINLQFIDREI* 24117 CYP27C1 Tetraodon nigroviridis chrUn_random:116718046-116719086 UCSC browser N-term exons 1 and 2 only, adjacent to ERCC3 63% to Fugu CYP27C1 rest of gene in a sequence gap MSVMNKLTTVCWRNFQSDCHHRQMRFALRCLHKSATSGAFGAAREEAVPERLITPKDAAQKK SGKSLAEMPGPGTVSNLIEFFWRDGFSRIHEIQ (0) MEHSKMYGKIFKSRFGPQLVVSVADRDLVAEVLRAEGVAPQRGNME SWHEYRDLRGRSTGLISA (2) CYP27C1 mouse and rat Mouse is missing this sequence in blast searches. In humans it lies between BIN1 and ERCC3 on chr 2. This region does not contain 27C1 in mouse (chr 18) unless there is a sequence gap. Blast of the rat genome (Rat Genome Sequencing Consortium Assembly Posted date: Dec 4, 2002) also had no hits, so 27C1 may have been lost in rodent evolution. It is present in human, Fugu, Xenopus, Silurana (another frog) and chicken. The absence of 27C1 may be a useful synapomorphy for comparing rodents to other mammals. CYP27D1 Petromyzon marinus (Lamprey) Contig17808:3625-10857 (-) strand missing N-term exon in a sequence gap This sequence is more like CYP27A3 or CYP27B1 47% to CYP27A3 Fugu IIHHKRFGSMWTSYTGREMFVCLADPDLIEQLMRQEGPYPVRVIMSAWDDYLMMRGHSKGLLTQ (2) TGADWKRMREVLNQRMLKPREVMQFADPLNAVVTELMGHVRARRGNGAATAVFPLPDVAQLLYRFAFE () GVTTVLFETRMGSLEETVPKETQDFIRAIESMLSTTVHVMALPRWTRPLLPFWGRFVQGWD GSITAATRLINVKLAEVKDCISRGQEVRGEYLTYLLASDRLSLPEIYSNTTDILLAGVDT () TSNTLSWSLYHLAQMPALQEALCREVNAVVPGDAVPGIDEVARMPLLKAVIKETLR () LYPVVFQNARCIVDDGVVLGGYSLPKG (0) TNLSLCQYVLSMDGDVFPNPREFQPQRWLRGKDRAAWRHGSGGEGEEGAAAADDAATSEK THPFAYIPFGFGLRGCVGRRIAELEMHLCLAR (0) FVKTFEMMPEPGAKKVLPKTRGVLVPSTPINLLLRERL Traces from next contig upstream All of these are in a long repeat sequence Cannot find a mate pair seq that is outside of the repeat. 1193719280, 1255501914, 1472630360, 1470299497, 1467953899, 1466704577, 1443851513, 1447257494, 1196739470, 1266761373, 1466575034 1466508070, 1266008803, 1193867804, 1443531934, 1468578376 Similar gene exon 2, trace 1467681227 Petromyzon DLNIRRNSYDHLEQLKLQCGGILTSYTGREMFVCLADPYLIEQLMRQEGPYPVRVI MSAWHDYLMMRGHSKGLLTH (mate = 1468976488 exon 5) TSNTLS*SLYH AQMPALQEAWCREVNAVV PGDAAPGIDEVARMTLLKAVIKETLR Vertebrates CYP27E1 Branchiostoma floridae (Amphioxus) 40% to 27B1 Fugu, 37% to 27C1 fugu, 40% to 27A1 fugu (but not first exon) 35% to 11A1 fugu, 34% to CYP24 fugu (Best match to CYP27B) MAQQILRNSSVCSLVRPNSRALVSVAPAATVQQNRPLKEMPGPTNKLGQLWWGFKNRSRMHEAQ (0) LEQERKYGRMWQSSFGFNPNVNVAHVALAEQLMRQEGKYPKRIEVNFMQQYRDLRGYSYGLLNQ (2) NGPEWRHLRTAVSKRIMRPKEVPR (2) YGDSMNEVVTDMIDRFKDLRDTTGGGKTVPDLTNELYKWAME (1) SIATVLFDTRLGCLEREMPEKTQQFIDSIATMFRTAFLVSALKPWMLTYLGLGVWKRHVEAWDVIFSV(1) AHENIDRKVLDIDARLSRGEDLVGSFLTYMLTGTDVTKKDLYATVTELLLAGVDT (0) TSNTMVWTLYELARHPELQERLHQEVTSVVSPGQIPTVDDVKNMALLKNVIKEILR (2) VYPVLPANGRVLDKDIVLDGYNIPKG (0) TQFAILHYNMTRDPEVFEEPDRFNPDRWTRMGTEKVNTFSSVPFGFGPRQCA (1) GRRLAEMEMYLVLAR (0) LVQTFEVRQLTPGEVVRPVTRALLVPGDPVHLEFIDRP* CYP27F1 Strongylocentrotus purpuratus XP_790546.2 CX682858.1 EST CYP27B like 40% to 27A in Xenopus Missing C-terminal 1 MATRLSRFLA RARPSRSSGV RPQTFHDAHA SSCPLAAVPS TLSPHASTTT PSTPSTTKTS 61 TTSTTSAPSS ISDLPSNNNL SLGVSNGCTF DQAPTSINPS ERNCTIFDDV KSFKDIPGPK 121 GWPILGTLGT YLSGKGLERI YDHQIDFTKK YGPIWKERLG TLEFVNLAKP DLVEVMYRND 181 SRYPRRLDMK PWKVYRAHRD ESLGVLTHEG KEWHRARRTI GRKIMPPREV AKFTGVINEI 241 VTDMVERLRF VRDTKGEGDG VVPELQNEMY KWSKGSIFKL LLETRIGCXK EPIPEKSQGF 301 INAVGQMFAS GQIMFLTSPE WFRRWNLKPW RDHEAAWDYI FNFAKKSVDA RLQELATLAE 361 EGDVVEDSGI LTHLLASQQL SMKEVYSNAC ELLLAGVDTT SNTLAWALYE LSRHPDTQDR 421 LAKEVEGALA AKGRCIPEHD DLPNMPLLKG VIKETLR LYPVVPANNRVLDKDVTIGGYHIPKG 28A Subfamily CYP28A1 Drosophila mettleri GenEMBL U89746 (1895bp) Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C. Molecular cloning of a family of xenobiotic-inducible drosophilid cytochrome p450s: evidence for involvement in host-plant allelochemical resistance. Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997) note: 369mt, full length, one of 109 seqs submitted to Nomenclature committee CYP28A2 Drosophila mettleri GenEMBL U89747 (1764bp) Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C. Molecular cloning of a family of xenobiotic-inducible drosophilid cytochrome p450s: evidence for involvement in host-plant allelochemical resistance. Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997) note: 43mt, full length, one of 109 seqs submitted to Nomenclature committee CYP28A3 Drosophila nigrospiracula (a desert dwelling Drosophilid) GenEMBL U91565 (451bp) Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C. Molecular cloning of a family of xenobiotic-inducible drosophilid cytochrome p450s: evidence for involvement in host-plant allelochemical resistance. Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997) 54 amino acids CYP28A4 Drosophila hydei (a desert dwelling Drosophilid) GenEMBL U91566 (366bp) Danielson,P.B., MacIntyre,R.J. and Fogleman,J.C. Molecular cloning of a family of xenobiotic-inducible drosophilid cytochrome p450s: evidence for involvement in host-plant allelochemical resistance. Proc. Natl. Acad. Sci. U.S.A. 94, 10797-10802 (1997) 54 amino acids Cyp28a5 Drosophila melanogaster GenEMBL AC018242 6065-8082 also AC001660 Cyp28a5 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP28B1 Musca domestica (housefly) no accession number Nannan Liu 44% identical to 28A1 over 498 amino acids submitted to nomenclature committee 7/29/99 Cyp28c1 Drosophila melanogaster GenEMBL AC014191 comp(8254-10002) also AL133495 Cyp28c1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp28d1 Drosophila melanogaster GenEMBL AC017780 comp(28385-30262) also AC009355 AC008324 AC008327 Cyp28d1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp28d2 Drosophila melanogaster GenEMBL AC017780 comp(31545-33456) AC008324 CYP28E1 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee March 17, 2005 Clone name Lc_CYP28D 48% to 28D1 (148 aa) 29A Subfamily CYP29A1 Caenorhabditis elegans (nematode worm) GenEMBL Z69787 and AL022276 (Y102F5) C44C10.2 (with modifications) note: GeneFinder translation is incorrect in some regions. First exon probably ends at 11181 coding for 35 amino acids as does the first exon of CYP29A2. There is probably a sequence error in the stop codon at 11155-11157. 2nd exon probably runs from 11227-11293. 11400-11518 is the third exon. 12465 to 12979 should be one exon. There is probably a sequence error in the stop codon at 12879-12881. some segments around I-helix missed by genefinder. N-terminal of genefinder translation is probably a different gene. Matches F23A7.3, but this gene does not have a P450 sequence downstream. CYP29A2 C. elegans GenEMBL Z74043 T19B10.1 nearly the whole gene of wEST00713, CEMSH91R, missing 5' end CYP29A2 C. elegans GenEMBL Z74040 T19B10.1 5' end of gene from Z74043 CYP29A3 C. elegans no accession number Y108G3 contig 541 CYP29A4 C. elegans GenEMBL Z99102 B0331.1 30A Subfamily CYP30 Mercenaria mercenaria (northern quahog, a clam) GenEMBL AF014795 (1628bp) Brown, D., Clark,G.C. and Van Beneden,R.J. A novel cytochrome P450 from the clam, Mercenaria mercenaria Unpublished 31A Subfamily CYP31A1P C. elegans GenEMBL Z68213 C01F6.3 missing 25 amino acids at C-helix plus in frame stop codon at ERK* CYP31A2 C. elegans GenEMBL Z68336 and Z92789 F22B3 and H02I12 This gene is definitely different than CYP31A3, there are 8 amino acids differences and the introns are divergent. CYP31A3 C. elegans no accession number T16C6 this sequence revised according to Y17G9.contig 61 cosmid T16C6 lies 3' of E04A4 and 5' of R11E3. Y17G9 contig 61 covers this region. E04A4 ends at 29981 of Y17G9 contig 61 R11E3 starts at 49037 of Y17G9 contig 61 this sequence lies between 41545-45037 CYP31A4P C. elegans no accession number Pseudogene related to CYP31A sequences 44849-45028 of Y17G9.contig61 C-TERMINAL exon fragment, This sequence is inside the last intron of CYP31A3 CYP31A5P C. elegans CYP31B1 Ips paraconfusus (California fivespined ips, bark beetle) No accession number Dezene Huber Submitted to nomenclature committee May 19, 2005 54% to CYP31A2 C. elegans over 456aa missing 5' end (no signal peptide) possible nematode contamination, not an insect-like sequence 32A Subfamily CYP32A1 C. elegans GenEMBL U53148 C26F1.2 CYP32B1 C. elegans 33A Subfamily CYP33A1 C. elegans GenEMBL U55365 C12D5.7 33B Subfamily CYP33B1 C. elegans GenEMBL U50311 C25E10.2 33C Subfamily CYP33C1 Caenorhabditis elegans (nematode worm) GenEMBL AF039053 C45H4.a near 35k CYP33C2 Caenorhabditis elegans (nematode worm) GenEMBL AF039053 C45H4.b near 38k also on Y10C10 contig 60. Probable end of 33C2 gene is on contig 57, but the intron ends of these two contigs do not overlap yet. CYP33C3 Caenorhabditis elegans (nematode worm) GenEMBL AF016676 F41B5.b 5k to 7k CYP33C3 Caenorhabditis elegans (nematode worm) GenEMBL D35162 EST 5 prime read with heme signature 3 prime fragment of this clone yk18a10 = D32508 (not in coding region) this is a portion of clone YK824 CYP33C4 Caenorhabditis elegans (nematode worm) GenEMBL AF016438 F44C8 whole gene of EST CEL10E1 CYP33C4 Caenorhabditis elegans GenEMBL M88882 (501bp) EST CEL10E1 2nd frame contains PPGP and KKYG from N-terminal region of many P450s. CYP33C5 Caenorhabditis elegans (nematode worm) GenEMBL AF016676 F41B5.c CYP33C6 Caenorhabditis elegans (nematode worm) GenEMBL AF016676 F41B5.e CYP33C7 Caenorhabditis elegans (nematode worm) GenEMBL AF016676 F41B5.d CYP33C8 Caenorhabditis elegans (nematode worm) GenEMBL AF003385 R08F11 CYP33C9 Caenorhabditis elegans (nematode worm) GenEMBL AF016449 C50H11 CYP33C10P Caenorhabditis elegans (nematode worm) GenEMBL AF016676 F41B5.a contains first two exons split by 3700bp and no C-terminal sequence. CYP33C11 Caenorhabditis elegans (nematode worm) no accession number Y49C4 contig 103 CYP32C12P C. elegans 33D Subfamily CYP33D1 Caenorhabditis elegans (nematode worm) GenEMBL Z92804 K05D4.4 (previously found on F10A3 and F11A5 at early stages of sequencing) CYP33D2X Caenorhabditis elegans (nematode worm) GenEMBL Z92830 F11A5 this sequence is really the same gene as CYP33D1 CYP33D3 Caenorhabditis elegans (nematode worm) GenEMBL Z81487 (C54E10), AL021470 (Y17D7A.4) and Z98877 (Y69H2) GenEMBL AL020988 (Y80D3) 33E Subfamily CYP33E1 Caenorhabditis elegans (nematode worm) GenEMBL U61945 C49C8.4 CYP33E2 Caenorhabditis elegans (nematode worm) GenEMBL U61952 F42A9.5 CYP33E3P Caenorhabditis elegans (nematode worm) GenEMBL U61952 F42A9.4 missing C-terminal 3' neighbor is C49C8 (with CYP33E1) 34A Subfamily CYP34A1 Caenorhabditis elegans (nematode worm) GenEMBL Z81119 and AL022301 (Y75B12) T10H4.10 There are two P450s listed in this clone as one gene This is the first one CYP34A2 Caenorhabditis elegans (nematode worm) GenEMBL Z81119 and AL022301 (Y75B12) T10H4.11 There are two P450s listed in this clone as one gene This is the second one CYP34A3 Caenorhabditis elegans (nematode worm) GenEMBL Z81047 and AL022301 (Y75B12) C41G6.1 CYP34A4 Caenorhabditis elegans (nematode worm) GenEMBL AF068712 T09H2.1 CYP34A5 Caenorhabditis elegans (nematode worm) GenEMBL AF039050 B0213.10 CYP34A6 Caenorhabditis elegans (nematode worm) GenEMBL AF039050 B0213.11 CYP34A7 Caenorhabditis elegans (nematode worm) GenEMBL AF039050 B0213.12 CYP34A8 Caenorhabditis elegans (nematode worm) GenEMBL AF039050 B0213.14 CYP34A9 Caenorhabditis elegans (nematode worm) GenEMBL AF039050 B0213.15 CYP34A10 Caenorhabditis elegans (nematode worm) GenEMBL AF039050 B0213.16 35A Subfamily CYP35A1 Caenorhabditis elegans (nematode worm) GenEMBL U97008 C03G6 near 19k CYP35A2 Caenorhabditis elegans (nematode worm) GenEMBL U97008 C03G6 near 22k CYP35A3 Caenorhabditis elegans (nematode worm) no accession number K09D9 CYP35A4 Caenorhabditis elegans (nematode worm) GenEMBL AF016418 C49G7.8 CYP35A5 Caenorhabditis elegans (nematode worm) GenEMBL AF039049 K07C6.5 [and part of F40C5 contig 6 9-578 plus strand Note: newest version of F40C5 does not contain this P450. probably an error corrected by the sequencers] NOTE: K07C6 IS NOW IN GENBANK AND IT CONTAINS 4 P450S It is part of a contig with T09H2 and B0213 that has 13 P450s in a cluster 6 P450s on B0213 appear to have inserted themselves into a cluster of three olfactory receptor-like genes. K09D9 is the next clone 3' and it has at least one more P450. C49G7 is next with one more P450 Contig order K07C6 T09H2 B0213 K09D9 C49G7 35B Subfamily CYP35B1 Caenorhabditis elegans (nematode worm) GenEMBL AF039049 K07C6.4, [F40C5 contigs 5 and 14 Note: newest version of F40C5 does not contain this P450. probably an error corrected by the sequencers] CYP35B2 Caenorhabditis elegans (nematode worm) GenEMBL AF039049 K07C6.3 [and F40C5 contig 15 Note: newest version of F40C5 does not contain this P450. probably an error corrected by the sequencers] CYP35B3 Caenorhabditis elegans (nematode worm) GenEMBL AF039049 K07C6.2 35C Subfamily CYP35C1 Caenorhabditis elegans (nematode worm) GenEMBL Z77652 C06B3.3 35D Subfamily CYP35D1 Caenorhabditis elegans (nematode worm) GenEMBL Z83105 F14H3 near 20k CYP35D2P Caenorhabditis elegans (nematode worm) GenEMBL Z83105 F14H3 near 14k only N-terminal is present 36A Subfamily CYP36A1 Caenorhabditis elegans (nematode worm) GenEMBL Z83220 C34B7.3 37A Subfamily CYP37A1 Caenorhabditis elegans (nematode worm) GenEMBL Z81493 (F01D5) and Z92851 (Y39G8) 37B Subfamily CYP37B1 Caenorhabditis elegans (nematode worm) GenEMBL Z93381 and Z93389 F28G4 and T13F3 CYP38A1 Suberite domuncula (sponge) GenEMBL Y17816 (1789bp) Mueller,W.E.G., Wiens,M., Batel,R., Steffen,R., Borojevic,R. and Custodio,M.R. Establishment of a primary cell culture from a sponge: Primmorphs from Suberites domuncula Mar. Ecol. Prog. Ser. In press most similar to the CYP4 family MLDFVIFAITAVAGLIGILLFFYFSRSTETKPVSSASPTSTIPR WSAPPADIEKGDLDVMMKKHGSLHQFLLHLHDNGKTPVTSFWWGKTHVVSFCSPQAFK ESAVFVNRPVELFVGFEPLITPFSIQYANDEDWVQRSKCLYHTLKGDDLKSYFHHFVQ IAQEEESLWSSYTSDKEVSLTKEVFPMTIKGIARTCFGDIFKDENELSKMAESYHVCW RTMEEGVPEAGSKRETEFLKHRRVLEDIIRRIIQERKEGEDLQELPFIDSMLQNYDSE DKIIADAISFMVGGFHTSGYMFTWMLWYLSSHPESQDRLRTEIERETGGERGDRLKEY SLRADTFLRQVQDETIRLSTLAPWAARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKT VWENTDSWDPDRFSPNGRRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIV PVEGQTVIQVHGLVTEPKDDIKIYIRSRKED" CYP39 human GenEMBL EST R07010 R11279 and UNIGENE entry Hs.25121 covers the C-terminal part of a P450. The 2 ESTs with coding regions are not found in UNIGENE, but the opposite end of EST R11279 = R11221 and it is in UNIGENE with 13 EST sequences all from the 3 prime noncoding region. This sequence is most like CYP4A11, but the percent identity is only 39%. Since this is the most conserved region of P450s, the sequence must be in a new family. More sequence is known form the mouse homolog. ENLLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIPSGDLLMLSPFWLHRNPKYFPEPELFKPERWEKGKFRRKHSFL GTASWA FGAGSSQCPGKVFALLEVQVC CYP39A1 human AC008104 AL035670 note heme region exon corrected 1/18/02 MELISPTVIIILGCLALFLLLQRKNLRRPPCIKGWIPWIGVGFEFGKAPLEFIEKARIK YGPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYRT ASIPKNVFLALHEKLYIMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVR HLLYPVTVNMLFNKSLFSTNKKKIKEFHQYFQVYDEDFEYGSQLPECLLR NWSKSKKWFLELFEKNIPDIKACKSAKDNSM TLLQATLDIVETETSKENSPNYGLLLLWASLSNAVP VAFWTLAYVLSHPDIHKAIMEGISSVFGKAG KDKIKVSEDDLENLLLIKWCVLETIRLKAPGVITRKVVKPVEIL NYIIPSGDLLMLSPFWLHRNPKYFPEPELFKPERW KKANLEKHSFLDCFMAFGSGKFQCPARW FALLEVQMCIILILYKYDCSLLDPLPKQ SYLHLVGVPQPEGQCRIEYKQRI CYP39A1 Macaca fasicularis GenEMBL AB220544.1 CYP39A1 Canis familiaris (dog) GenEMBL XM_538946.2 CYP39 mouse GenEMBL ESTs AA096922, AA606237 Note: this sequence has a poorly conserved I Helix consensus QFKTYDEGFEYGSQLPEWLLRNWSKSKRWLLALFEKNIGNIKAHGSAGHS GTLLQAILEVVETETRQYSPNYGLVVLWAALANAPPIAFWTLGYILSHPDIHRTVL ESISSVFGTAGKDKIKVSEDDLKKLLIIKWCILESVRLRAPGVITRKVVK PVKILNHTVPSGDLLMLSPFWLHRNPKYFPEPESFKPERWKEANLDKYIF LDYFMAFGGRKFQCPGKWFALLEIQLCIILVLYKYECSLLDPL Cyp39 mouse GenEMBL NM_018887.1 AF237981 39a1 (oxysterol 7alpha-hydroxylase) 72% to human MGIMELFSPIAIAVLGSCVLFLFSRLKNLLGPPCIQGWIPWIGAGLEFGKAPLEFIEKARIKY GPVFTIFAMGNRMTFVSEEEGINVLLKSEHVDFESAVQSPVYHTAWIPKNVFSALHERLY ALMKGKMGTFNTHHFTGPLTEELHEQLEGLGTHGTMDLNDFVRYLLYPATLNTLFKKGLF LTDKRTIKEFYQQFKTYDEGFEYGSQLPEWLLRNWSKSKRWLLALFEKNIGNIKAHGSA GHSGTLLQAILEVVETETRQYSPNYGLVVLWAALANAPPIAFWTLGYILSHPDIHRTVL ESISSVFGTAGKDKIKVSEDDLKKLLIIKWCILESVRLRAPGVITRKVVKPVKILNHTVP SGDLLMLSPFWLHRNPKYFPEPESFKPERWKEANLDKYIFLDYFMAFGGGKFQCPGRW FALLEIQLCIILVLYKYECSLLDPLPKQSSRHLVGVPQPAGKCRIEYKQRA* CYP39A1 Bos taurus (cow) See cattle page for details 192 MEFISPTVIIILSCVAVLLFLQWKNLRRPPCIRGWIPWIGAGFEFGKTPLEFIEKARIK YGPVFTVIVMGTRMTFVTEEEGINVFLKSKEINFELAVQNPVYHT ASIAKNIFLKLHEKLYITVKGKMGIFNLYKFTGQLTEELQEQLQNLGTHGTTDLNKFMR HLLYPVTVNILFKKGLFPTDERKIREFHQHFQAYDEGFEYGSQLPECLLR NWSKSKKWLLALFEKNIPDIKTHKSAKENYP TVMQAVLDLLEMEANEQKSPNYGLLLLWASLSHTVP VAFWTFAFVLSHPNIHRTIMEGISSVFGTAG KDKIKVSEDDLKKLPLIKWCILETIRLRAPGVIARKVLKPVKIL DYTVPSGDLLMLSPFWLHRNPKYFPEPDLFKPERW KEANLEKHAFLDCFMAFGSGKYQCPGRW LALLEIQICIILIFYYYDCSLLDPLPKQ SSLHLVGVQQPEGRCRIQFKQRK* CYP39A1 Sus scrofa (miniature pig) no accession number Haitao Shang Submitted to nomenclature committee May 23, 2007 partial sequence 80% to human CYP39A1, complete seq constructed from ESTs Ortholog of human CYP39A1 CYP39A1 pig ESTs BP441358.1, BP458693.1, CV872464.1, DN131248.1, CB287479.1 DN131598.1, CN156146.1, DN115951.1, BP166812.1, DN116306.1, CN154029.1, MEFISPAVIIILGCVTLLLFLQWKNLRGPPCIGGWIPWIGAGFEFGKAPLEFIEKARIKY GPIFTVLAMGTRMTFVTEEEGMNVFLKSKEVNFELAVQNPVYRTASIPKNIFLKLHEKLC VMMKGKMGTFNLYQFTGQLTEELHKQLENLGTHGTMDLNNLVRHLLYPVTVNVLFK KGLFPTNERKIREFHQHFQAYDEGFEYGSQ LPERLLRNWSKSKNWLLALFEKNI PDIKTY KSAKDHSQTIMQDILDLLEMETNEQMSPNYGLLLLWASLSNAVPVTFWTLAFILSHPNIH RTILEGISSVFGTTGKDKI EVSEDNLKKLPLIQWCILETIRLRAPGVITRKVLKP VKILNYTVPSGDLLMLSPFWLHRNPKYFPEPDLFKPERWKKANLEKHAFLDWFMAFGTGKYQCP GRWLALLEIQMCVILIFYYYDCTLLDPLPKQSSLHLVGVQQPEGQCRVEFKQRK* CYP39A1 Gallus gallus (chicken) GenEMBL BG712800 BG710422 BU226060 AI979980 BU247492 BU226060 XM_420064.1 57% TO 39A1 HUMAN MAAAVLLAALLAVAVVAWVLFPPSEP RRPPCIGSRVPCLGAAF RLARAPLEFIERARAEHGPIFTVFALGKRFTFVTEDEGAEVFFTSKDLNFEQAVQRVV ANAVSVPAEAFYQNHGRLYSMMKGKMGPSNLHMFTGTLCKELHEHMAHLGTAGTGDLMEL VRHIMYTAVVNTLFGKGACPTSHSELREFEEHFWKYDEDFEYASQMPECFLRNWSKSKKW LLKLFEKVVSDAERTNPSETTSKTLLQHLLDNLQGK HLAPNYGLLMLWASQANAVPIAFWTLVFILSSP SVYKKVMEDLTSVFGNAGKDEIEVSEEDLKNLPYIKWCTLEAIRLRSPGAITKKVIKPIR IQSFTIPAGDMLMLSPYWLHRNPKYFPDPEMFKPDRWKEANLEKNAFLDGFVAFGGG NHQCPGRWFAIMEIQLLVALFLYKYEFVLLDAVPKESPLHLVGTQQPLTPFRVQYKGWE* CYP39A1 Xenopus tropicalis See Xenopus pages for seq CYP39A1 Xenopus laevis GenEMBL BC106433.1 CYP39A1 Danio rerio (zebrafish) GenEMBL NM_001030189.1 CYP39A1 Salmo salar (Atlantic salmon) GenEMBL DW563555.1 CB512602.1 ESTs EVVTLVLSLVILAISAHLLFAGNYPNAPPCIKGWIPWFGVAFEFGKSPLTFISQARDKYGPVFT 193 194 VVAAGKRLTFVTLHEDFRTFFMSKDVDFEQAVQEPVHNTASISKDSFYKFHPACNTLIKG 373 374 RLTPGNVAQLTDHLCEEFNDHLET 1 LGDQGSGGLSELVRAVMYPAVMSNLLGKYNSPGSPFTMEQFKEKFAIYDEGFEYGSQLPD 180 181 MFLREWASSKCWLLSLLGNMVVKAEDDETSSESGNRTLLQHLATLITDKFLPNYGLL 351 352 MLWASLANAIPITFWAVAFILSNPTVYQTAMEQINAALKDQDTRKTKVTAEELQQMPYVK 531 532 WCILEAIRLRAPGAITRRVVRPLRIQNYIIPPGDLLMVSPYWAHRNPHYFPEPEEFKPER 711 712 WEKADLVKN 738 CYP39A1 Oncorhynchus mykiss (trout) GenEMBL BX298944.3 cDNA 25 LVLFLVVLAISAHLLFGGNYPNAPPCIKGWIPWFGVAFEFGKSPLTFISQARDKYGPVFT 204 205 VVAAGKRLTFVTLHEDFRTFFMSKDVDFEQAVQEPVHNTASISKESFYKFHPACNTLIKG 384 385 RLTPGNVAQLTDHLCEEFNDHLETLGDQGSGGLNELVRAVMYPAVMSNLLGKYNSPGSPF 564 565 TMEQFREKFAIYDEGFEYGSQLPDXVLRXWASSKCWLLSLLGNMXCQAEDDETSSESGNR 744 745 TFTAHLV 765 40A Subfamily CYP40X rat, mouse, human (name changed to CYP27B1) Note: This family was created based on the rat sequence AF000139 submitted to the Nomenclature Committee in May 1997. This was the only sequence available at the time and it was about 38% identical to rat CYP27. The sequence was considered a new family and named CYP40. Since then, there have been additional sequences determined for the rat mouse and human homologs. The rat sequence AB001992 differs from the AF000139 sequence at 34 amino acids, mostly in two regions that are probably frameshifts. The sequence AB001992 is 42.3% identical to the rat CYP27A sequence. The mouse and human sequences are also more than 40% identical to the CYP27A sequence. Therefore, they belong in the CYP27 family as a new subfamily CYP27B1. Since there only appears to be one gene in each species, all three species orthologs will be called CYP27B1. CYP40 has been retired and this is indicated by the X in CYP40X. 41A Subfamily CYP41 Boophilus microplus (southern cattle tick) GenEMBL U92732 (414bp) Crampton,A.L., Miller,C., Baxter,G.D. and Barker,S.C. Expressed sequenced tags and new genes from the cattle tick, Boophilus microplus. Exp. Appl. Acarol. 22 (3), 177-186 (1998) EST is from the middle of the P450 whole sequence known but still confidential. 42A Subfamily CYP42 C. elegans GenEMBL AL020988 (Y80D3), M89401 (EST cm08B12) 43A Subfamily CYP43 C. elegans GenEMBL AF026203 E03E2.1 44A Subfamily CYP44A1 C. elegans GenEMBL U21321 CELZK177 only mitochodrial-like P450 in C. elegans MRRSIRNLAENVEKCPYSPTSSPNTPPRTFSEIPGPREIPVIGN IGYFKYAVKSDAKTIENYNQHLEEMYKKYGKIVKENLGFGRKYVVHIFDPADVQTVLA ADGKTPFIVPLQETTQKYREMKGMNPGLGNLNGPEWYRLRSSVQHAMMRPQSVQTYLP FSQIVSNDLVCHVADQQKRFGLVDMQKVAGRWSLESAGQILFEKSLGSLGNRSEWADG LIELNKKIFQLSAKMRLGLPIFRLFSTPSWRKMVDLEDQFYSEVDRLMDDALDKLKVN DSDSKDMRFASYLINRKELNRRDVKVILLSMFSDGLSTTAPMLIYNLYNLATHPEALK EIQKEIKEDPASSKLTFLRACIKETFRMFPIGTEVSRVTQKNLILSGYEVPAGTAVDI NTNVLMRHEVLFSDSPREFKPQRWLEKSKEVHPFAYLPFGFGPRMCAGRRFAEQDLLT SLAKLCGNYDIRHRGDPITQIYETLLLPRGDCTFEFKKL CYP44A1 Caenorhabditis briggsae XM_001679715.1 85% to CYP44A1 ortholog MRRSIKILAENLENCPYTSATTTAPPRRFSEIPGPREIPVIGNS GNLKYAIGTDAETIENYNKHLEEMYNKYGKIVKENLGFGRKYVVHVFDPADAQTVFAA DGKTPFIVPLQETTQKYREMKGMNPGLGNLNGPEWYRLRSSIQHAMMRPQSVQTYLPF SQKVSDDLVRHVAEEQIRFGNANMQKVAGRWSLESAGQILFEKSLESLGNRSEWADGL IELNKKIFQLSAKMRLGLPIFRLFSTPSWRKMVKLEDEFYAEVDRLMDDALDNLKISE SDSQNMRFASYLINRKELNRRDVKVILLSMFSDGLSTTAPMLIYNLYNIATHPEALRK IQEEIKEDPTSSKLPYLRACIKETFRMFPIGTEVSRITQKDLTLSGYLIPAGTAVDIN TNILMRNMVLFSDSPHEFKPERWLEKSKSVHPFAFLPFGFGPMCAGRRFAEQDLLTSL AKLCANFDIRHRGEPITQIYETLLLPRGNCEFEFKKL CYP44A1 Caenorhabditis brenneri strain PB2801 ABEG01003567.1 85% to CYP44A1 ortholog 10467 MRRSIKKLAEHVEKCPFNGNPTKSSVTRRFSEIPGPREIPVFGNAGNFKYAIGS (1) 10628 10675 DAKTIESYNIHLEEMYRKYGKIVKENLGFGRKYVVHVFDP (1) 10794 10859 ADAQTVLAADGKTPFIVPLQETTQKYREMKGMNPGLGNL (2) 10975 11023 NGPEWYRLRSSVQHAMMRPQSVQT (2) 11094 11141 YLPFSQKVSDDLISHVAKVQLRFGH 11215 11216 VNMQKVAGRWSLESAGQILFEKSLGSLGDRCEWADSLIELNKKIFQLSAK (2) 11365 11413 MRLGLPIFRLFSTPSWRKMVELEDQFYAEVDRLMDNALDKLKVNDSDNQNMRFASYLIN 11589 11590 RKELNRRDVKVILLSMFSDGLST (0) 11661 11703 TAPMLIYNLYNIAAHPEAQREIRKEIQDDPSSSKLPFLRACIKETFRMFPIGTEVSRITQ 11882 11883 KDLILSGFDVPAGTAVDINTNILMR (2) 11957 12004 DEVLFSDSPRIFKPQRWLEKSKEVHPFAFLPFGFGPRMCAGRRFAEQDLLTSLAK 12168 12169 LCANFEICHRGEPITQIYETLLLPRGNCEFEFRKL* 12276 CYP44A1 Caenorhabditis remanei strain PB4641 AAGD02002953.1 83% to CYP44A1 ortholog 11277 MRKSLQKFAEKCPFSATSVKTSVPRRFSEIPGPLELPIIGNIGNFKYAVRS (1) 11429 11477 DSKTIEGYNHHLEEMYHKYGKIVKENLGFGRKHVIHLFDP (1) ADAQIVLAADGKTPFIVPLQETTQKYREMKGMNPGLGNL (2) 11761 11802 NGPEWYRLRSSIQHAMMRPQSVQT (2) YLPFSQIVSEELVKHVAKEQVRFGH 11990 11991 VNMQKVAGRWSLESAGQILFEKSLGSLGDRSEWADGLIELNKKIFQLSAK (2) 12140 12187 MRLGFPLFRLFSTPSWKKMVELEDRFYAEVDRLMDDALDKLTVKDSD () 12379 SQNMRFASYLINQKELNRRDVKVILLSMFSDGLST (0) 12483 12530 TAPMLIYNLYNIAAHPDAQHKIQKEIKEDPTSTKLPFLRSCIKETFRMFPIGTEVSRITQ 12709 12710 KDLILSGFHVPSGTAVDINTNILMR (2) 12784 12829 NEVLFSDSPHEFKPQRWLEKSKDVHPFTFLPFGFGPRMCAGRRFAEQDLLTSLAK 12993 12994 LCANFDIHHRGEPITQIYETLLLPRGNCEFEFRKL 13098 CYP44A1 Caenorhabditis japonica chrUn:100501692-100508750 From UCSC browser MRTSLRKFAENVQKCPFPEKSSDIPARRFSEIPGPREVPFFGSTENLKFAIKS (1) DAQTIENYNEHLAEMYKNYGKIVKENLGFGRKCVVHLFDP (1) VDAQTVFAADGKTPFIVPLQETTQKYREMKGMNPGLGNL (2) NGPEWYRLRSSIQHAMMRPQSVQT (2) YLPFSQKVSQDLVSHVASEQARFGAVNMQKVAGRWSLESAGQILFEKSL GSLAENSEWADGLIELNKRIFQLSAK (2) MRLGLPIFRFFPTPSWRKMVQLEDEFYAEVDRLMDDALDKLKVLDIVEN (2) SKNLRFASYLINRKELNRRDVKVILLSMFTDGLST (0) TAPMLIYNLFNIATHPVAQERIRKELEK (1) DTTKLPFLRACIKETFRMFPIGTEVSRITQKDLVLSGFEIPAGTPVDINTNVLMR (2) NTVLFADDPLIFKPERWLDKSAATHHPFAFLPFGFGPRMCAGRRFAEQDLLTSLAKLCAKFEIRHVGEPVTQIYETLLLPRGNCKFEFASV* CYP44A1 Ascaris suum CB101937.1 44% to CYP44A1 aa 134-170 possible ortholog NGDEWYRLRSSVQKAMMRPQAVRQYLPAVNRVADELLQFI CYP44B1 Lottia gigantea (owl limpet) JGI Protein ID:163695 74% to CYP44B2, 34% to CYP44A1 C. briggsae 35% to CYP44A1 C. remanei MWGRNVKHAANY LKRCSISTSLGISRSEGRAVNSNDVSDYNQALPFEDIPGPRG IKYLGTLHHYMTGRLDKYKYQD ILEEYFSKYGPIFKETLF GSTIVHVCDPKMFEVIYQDEGKYPEIEPLVEPSQKFR GEGDLSLGLGNSNGEKWYNLRAIVQKSML KPAEVSSHLQIINEIANDFVDKTLHKGYS DDFLLDTSIWKLESGGALSFGKRPGYLECGSA QEMEAMKLVKMVDNTFTLATDLKFSFPLYKYIKTPAYKA LLDSEQYINDVTSVYLSETLKHIETEL ERTDDFNDIQNTYRFLYSLLGHKN LSKKDMEVIVLSMFIDGLRTVVPVLISVLFCLSQS PRVQEKLFTEIYNVLEDRKYIAASDIAQLPYLKACVK ETFRLYPIGLEISRINQNDMVLGEYKIPSG TKLNLNNFVLNRD PNIFANPEAFVPERWLRGSEHAVDA HPYLLLPFGRGPRMCVGRRFAEQDLYVLISK LIQNCEPHWTGEKYEHQFKI LVYPACPVKFQFKPRQSSSK CYP44B2 Lottia gigantea (owl limpet) JGI Protein ID:163698 74% to CYP44B1, 37% to 44A1 C. briggsae MWRNFQKNFFSVCGKVNR IKRCWISTSLGISRSEGRAVNSNDVSDYNQALPFEDIPGPQG IKYLGTLPHYMTGRLDKYKYQD ILEEYFSKYGPIFKETLF DKTVVHVCHPDMIKVVYQDEGKYPEIEPLMEPTQKSR CDEGMSLGLGNTNDEQWYRLRSAVQKHMM RPGEVSNYLPQVNEVANEFVDKTMRKGYS DDFILDASIWSLESSGYISFGKRLGYMEGGSE KEAEGVKVVKKNDEAFTLATDLKFSLPFYQQLQTPKYKAWHEAE NYTTKVASVYFENVLKFLETHL DSSSEPKKVQNTYRFLYSLLGHKN LSKKDMEVIVLSMFIDGLRTVVPVLISVLFCLSQS PRVQEKLFTEIYNVLEDRKYIAASDIAQMPYLKACVK ETFRLHPNALEISRVTKNNMVLGGYQIPSG SKLNINAFNMYRN SDIFENPQNYCPERWLRGWEHAVDV HPYLLLPFGRGPRMCVGRRFAEQDLYVLISK FIQNCEPQWRGSEFHQQYKI LMYPDRPVKFEFKSR CYP44C1 Capitella capitata (Gallery worm, polychaete worm, annelid) estExt_Genewise1Plus.C_850046 [Capca1:171547] 39% to CYP44B1, 37% to 44A1 C. briggsae MATRSLLQQQPSARVKPFSLIPGPRGLPYLGTLLDYKTGKYAPSSFDKAFRANHQKYGKIFKETIAGSTK VHLCDPDYAQRLFQLEGKRPHTPPLLETSKMYRKINRLSLGLGNMNGEKWSRSRKAVGHLLMKPKSVSQF LPHINGCVDDFITKIHHLRDEEGLVENFSNEIKMWTLESTANICFETRLGALNGTAAQSDFIQRMVNAMD SMLEQALKLRFSFGWWRLLPTPAWQKLYDCEDFFFSNAQRIVNEAISKMDALLESGDFVEGHSYPFLSYL LGREELSLEDVRIISMAMFSDGMITTSPTVASQLYCLATNPEVQDKVYEEVLRVVGIKTKEITSGHLEEL SYLKSCIKEGFRFFPIGTEISRIVPTDITIGGYLIPKGTHVEVNTNMLLQSPQYFHDAHRFIPERWTRDG SASNVHPYLVRPFSCGPRMCPGKRIAEQEMLTFIAKLLRHYRVEWRESHEMSQRYRILLAPDCSADFYFI PRNN* 45A Subfamily CYP45 Homarus americanus (American lobster) GenEMBL AF065892 (1581bp) Snyder,M.J. Identification of a new cytochrome P450 family, CYP45, from the lobster, Homarus americanus, and expression following hormone and xenobiotic exposures Arch. Biochem. Biophys. (1998) In press 46A Subfamily CYP46 human GenEMBL NM_006668 Lund EG, Guileyardo JM and Russell DW. cDNA cloning of cholesterol 24-hydroxylase, a mediator of cholesterol homeostasis in the brain. Proc. Natl. Acad. Sci. U.S.A. 96, 7238-7243 (1999) 32% identity with Drosophila 4D2 ESTs H06539, H51951, R36281 mouse homolog EST AA096922 MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPS FLLGHLPCFWKKDEVGGRVLQDVFLDW AKKYGPVVRVNVFHKTSVIVTSPESVK KFLMSTKYNKDSKMYRALQTVFGER LFGQGLVSECNYERWHKQRRVIDLAFSRSSLVSLMETFNEKAEQLVEILEAKADGQTPVSMQDMLTYTAMDILAK AAFGMETSMLLGAQKPLSQAVKLMLEGITASRNTLAK FLPGKRKQLREVRESIRFLRQVGRDWVQRRREALKRGEEVPADILTQILK AEEGAQDDEGLLDNFVTFFIA GHETSANHLAFTVMELSRQPEIVAR LQAEVDEVIGSKRYLDFEDLGRLQYLSQ VLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL FSTYVMGRMDTYFEDPLTFNPDRFGPGAPK PRFTYFPFSLGHRSCIGQQFAQ MEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC Cyp46 mouse GenEMBL AF094479 NM_010010 ESTs AA096922, R75217 no UNIGENE entry MSPGLLLLGSAVLLAFGLCCTFVHRARSRYEHIPGPPRPSFLLG HLPYFWKKDEDCGRVLQDVFLDWAKKYGPVVRVNVFYKTSVIVTSPESVKKFLMSTKY NKDSKMYRALQTVFGERLFGQGLVSECDYGRWYKQRKVMDLAFSRSSLVSLMETFNEK AEQLVEILEAKADGQTPVSMQDMLTCATIDILAKAAFGMETSMLLGAQKPLSQAVKVM LEGISASRNTLAKFMPGKRKQLREIRESIRLLRQVGKDWVQRRREALKRGEDMPADIL TQILKAEEGAQDDEVLLDNFVTFFIAGHETSANHLAFTVMELSRQPEIVARLQAEVDE VVGSKRHLDYEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLL FSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQMEVKVV MAKLLQRIEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC CYP46A1 Bos taurus (cow) See cattle page for details MSPGLLMLLGSAVLVVFGLCCTFVHRARSRYEHIPGPPRPS (2) FLLGHLPYFWKKDEVCGRVLQDVFLDW (2) AKKYGPVVRVNVFHKTSVIVTSPESVK (0) KFLMSTKYNKDSKMYHAIQTVFGER (1) LFGQGLVSECXYERWHKQRRIMDLAFSRSSLVGLMGTFNEKAEQLVEILEAQ 198 199 ADGQTPVSMQDMLTCATMDILAKAAFGMETSMLLGAQKPLSRKVKLILEGISASRNTLAK 378 379 FMPGKWKQLXETRESVRFLRQVGKEWVXRXRXALQRGEDVPADILTQILKAEEGAQD 367 DEILLDNFVTFFIAGHETSANHLAFTVMELSRQPEILARLQAEVDEVIGSKRHLDCEDLG 188 187 RLQYLSQVLKESLR 146 LYPPAWGTFRLLEEETLIDGVRVPGNTPLL FSTYVMGRMDTYFEDPLTNPDRFGPGAPK PKFTYFPFSLGPRSCIGQQFAQ MEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLQPRGWQPAPPPPPC CYP46A1 pig BI359892.1, BF193580.1, BF190788.1, lower case = cow seq. mspgllmllgsavlvvfglcctfvhrar SRYEHIPGPPRPSFLLGHLPYFWKK KDEVCGRVLQDVFLDWAKKYGPVVRVNVFHKTSVIVTSPESVKRFLMSTKYNKDS KMYHAIQTVFGERLFGQGLVSECDYERWHKQRRVMDLAFSRSSLVSLMGIFNEKAEQLVE ILEAQADGQTPVSMQDMLTCTTMDILAKAAFGIETSMLLGAQKPLSRKVKLIL egisasrntlakfmpgkwkqlxetresvrflrqv GKDWVQRRREALQRGEDVPADILTQILKAEEGAQD DEILLDNFVTFFIAGHETSANHLAFTVMELSRQPEILARLQAEVDEVIGSKRHLDCEDLG RLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLLFSTYVMGRMDTYFEDP LTFNPDRFSPKAPKPKFTYFPFSLGPRSCIGQQFAQMEVKVVMAKLLQRLEFRLVPGQSFGLQE Qatlkpldpvlctlqprgwqpapppppc CYP46A1 Xenopus tropicalis See Xenopus pages for seq CYP46A1 Danio rerio (zebrafish) Seq zebrafish pages for seq CYP46 Fugu rubripes (pufferfish) No accession number Scaffold_4537 60% to 46 human MGVFNLIFGWISQASIFLLLLLFIALLGYCMYIKYTHMKYDHIPGPPRDS (2) FFSGHSSKLLDIMKDDGVVHDMFLKW (2) AETYGPVYKIYFLHHVMVFVSCPETTK (0) EMLMSPKYTKDKFLHNRIGSLFGQR (2) FLGNGLVTVRDHEKWYKQRRIMDPAFSSL (2) YLRSLMGNFNETADKLMDKLSEIADNKTTANMLHLVNCVTMEVLAK (0) VAFGVDLDLLRKSSPFPRAVELCLKGMVFSIRDTFFM (0) LNPKNWSFIREVRGACRLLRQTGAQWIQQRKTAMRNGEVPKDILTQIIKSAGK (1) EEIMTQEDEEFMLDNFLTFFIA (1) GQETTANQLGFCIMELGRHPDILER (2) VKKEVDEAIGMKQDISYDDLGHLGYLSQ (0) VLKETLRLYPTAPGTSRDLKEDMVIGGVHVPGGVVCV (0) FSSYGMGRMETFFKDPLKFDPDRFDPDAPK (2) PYYCYFPFSLGPRSCLGQNFAQ (0) MEAKVVMAKLIQRFDFTLLPGQSFDILDNGTLRPKSGVLCSLRHRDHKK* CYP46A2P Fugu rubripes (pufferfish) CYP46A2 Danio rerio (zebrafish) Seq zebrafish pages for seq CYP46A3P Fugu rubripes (pufferfish) 46A-se1[12:13:14] human NT_004424.11|Hs1_4581 chromosome 1 CYP46 pseudogene fragment Old name = CYP46A4P 2405597 DPLTFNPYRFGPGAPKPRFTYFPFSLGHHSCIGQQFAQMEVKVVMAKLLQRLEFQLVPGP 2405776 2405777 RFGLQ*QATLKPLDPELCTLRPRGWQPAAPPPRC 2405878 Cyp47a1x Drosophila melanogaster GenEMBL AC005556, AC004516, AC004426 intron exon boundaries approximate renamed Cyp4aa1 following CYP4Z1 CYP48A1 Trichogramma cacciae (a parasitoid wasp) GenEMBL AF207950 BBRC 268, 677-682 (2000) Cloning and expression of cytochrome P450 genes belonging to the CYP4 family and to a novel family, CYP48, in two hymenopteran insects, Trichogramma cacciae and Apis mellifera by S. Tares, J.B. Bergé and M. Amichot. 123 amino acid sequence from I-helix to PERF region 45% identical to CYP47A1, but sure to drop into the mid to low 30% range for a full length sequence clone name 16Tc 66% to CYP4AB22 TAAAMTFTIMLLAENEEAQDKARAEVTQIFDRCDGKIEMQDIQD MTYLEWCVKEALRLYPPVSTMTRTMSEDLQLSKDFLVPAGAEVIFHLYDTHRDPNFWE DPDKFDPDRFSPERSHGRHPFSYL CYP48A2v1 Trichogramma cacciae (a parasitoid wasp) GenEMBL AF207951 BBRC 268, 677-682 (2000) Cloning and expression of cytochrome P450 genes belonging to the CYP4 family and to a novel family, CYP48, in two hymenopteran insects, Trichogramma cacciae and Apis mellifera by S. Tares, J.B. Bergé and M. Amichot. 124 amino acid sequence from I-helix to PERF region 40% identical to CYP47A1, but sure to drop into the mid to low 30% range for a full length sequence clone name 15Tc 72% to CYP4AB10 TGMAMTFTLMLLAENKEAQTKARAEVTQVMEKYNGNLNTSSLQE LPYLERCIKESLRLFPPVATLLRYTADELQLKNALVPADSHIMIHLYDTHRDANYWPN PDVFDPDRFLPERSANRHTFAYV CYP48A2v2 Trichogramma cacciae (a parasitoid wasp) GenEMBL AF207952 BBRC 268, 677-682 (2000) Cloning and expression of cytochrome P450 genes belonging to the CYP4 family and to a novel family, CYP48, in two hymenopteran insects, Trichogramma cacciae and Apis mellifera by S. Tares, J.B. Bergé and M. Amichot. 124 amino acid sequence from I-helix to PERF region 40% identical to CYP47A1, but sure to drop into the mid to low 30% range for a full length sequence. 58% identical to CYP48A1 clone name 21Tc ony two amino acid differences with CYP48A2v1 71% to CYP4AB10 TGMAMTFTLMLLAENKEAQTKARAEVTQVMEKYNGNLNTSSLQE LPYLERCIKESLRLFPPVATFLRYTADVLQLKNALVPADSHIMIHLYDTHRDANYWPN PDVFDPDRFLPERSANRHTFAYV CYP49A1 Drosophila melanogaster GenEMBL AC017930 48452-54739 AC007398 AC007418 ESTs AA941795 AI258819 AA697999 mitochondrial P450 fits in the mitochondrial clan Cyp49a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP49A1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs2l1 CYP49A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroupUn.423a 45% to 49A1 Drosophila. Note this sequence is named in the same subfamily as other CYP49s since there seems to be only one orthologous sequence per species and it does not make sense to name orthologs with different subfamily names. CYP49A1 Bombyx mori (silkworm) BAAB01133495.1 BAAB01150577.1 BAAB01091102.1 BAAB01198981.1 BAAB01133297.1 BAAB01093561.1 44% to CYP301A1 CYP49 and CYP301 are nearly subfamilies of a single family See silkworm page for sequence CYP49A1 Tribolium castaneum (red flour beetle) GenEMBL XP_970738 63% to 49A1 Drosophila, 47% to CYP301A1 Tribolium MSLSRKLLVPKRALAVAQRAYSTDRPYSTAIMPELLDELAIPEHVERVEATPRPYSAIPGPKELPLIGNA WRFAPIIGQYKIQELDKVMWSLNRDYGRIVKVGGLIGHPDLLFVFNGDDIEKVFRMEEAMPHRPSMPSLH YYKQILKKDFFDGNAGVIGVHGPKWEEFRKKVQHALLPPQIAKKYIEPLDVIAGDFLHRMEDMLDENQEL PNHFLSEIYKWALESVARVSLDTRLGCLEPNLSQNSESQRIINSINTFFWNVAEVELKMPVWRVYKNRSF KKYIGALEDFRTLCLKHIHKSMEKMQEKNFDEIKEENISIVERILLKTDNPKLAAVLALDLLLVGVDTTS IAAASTIYQLSQNPEKQQKLFDELQQVLPENDSKIDVSIQDKMPYLKACIKETLRMYPVIIGNGRSLQTD TVIAGYKVPKGTHVIFPHLVVSNSEDYFHEPHRFLPERWLKTENACPMHKKIHPFVTLPFGYGRRSCLGR RFAEAELQILLAKIFRKYKVEYNYGPLSYKITPTYVPEQPLKFKLVKRE CYP49A1 Aedes aegypti (yellow fever mosquito) CYP49A1 Culex pipiens CYP49A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph49, 58% to CYP49A1 Aedes, only 44% to CYP303A1 Aedes, 58% to 49A1 Anopheles gambiae Pediculus genome site CYP49A1? Helicoverpa armigera (cotton bollworm) No accession number Heiko Vogel submitted to nomenclature committee Feb. 6, 2008 Contig_262 only 26% to CYP49A1 unidentified mito clan N-term All best hits are CYP49A1 and this protein is not well conserved CYP49A2 Bombyx mori (silkworm) BAAB01014940.1 BAAB01083735.1 BAAB01056249.1 51% to CYP49A1 Drosoph. CYP49 and CYP301 are nearly subfamilies of a single family See silkworm page for sequence Note this is the last two digit number for animal P450s (except CYP51). The animal P450s will continue at CYP301A1. This leaves 200 possible animal families before reaching CYP501 and higher that are reserved for lower eukayotes. 51A Subfamily A note on nomenclature. CYP51s were originally all called CYP51, because only one gene was found per species and they all seemed to be in this one conserved family. However, rice had many CYP51s in at least two sequence groups, so subfamilies have been designated for CYP51s. These are not the typical subfamilies, but only one subfamily is created for each major taxonomic group. CYP51A for animals, CYP51B for bacteria. CYP51C for Chromista, CYP51D for Dictyostelium, CYP51E for Euglenozoa, CYP51F for fungi. Those groups with only one CYP51 per species are all called by one name: CYP51A1 is for all animal CYP51s since they are orthologous. The same is true for CYP51B, C, D, E and F. CYP51G (green plants) and CYP51Hs (monocots only so far) have individual sequence numbers. CYP51A1 human GenEMBL U23942 (3172bp) Stromstedt,M., Rozman,D. and Waterman,M.R.(1995) The ubiquitously expressed human CYP51 encodes lanosterol 14 alpha demethylase, a cyochrome P450 whose expression is regulated by oxysterols. Arch. Biochem. Biophys. 329, 73-81 (1996) unpublished CYP51A1 human GenEMBL U51684 to U51692 (genomic sequence) Rozman,D., Stromstedt,M., Tsui,L.-C., Scherer,S.W. and Waterman,M.R. Structure and mapping of the human lanosterol 14-alpha demethylase gene (CYP51) encoding the cytochrome P450 involved in cholesterol biosynthesis; comparison of exon/intron organization with other mammalian and fungal CYP genes. Genomics (1996) In press CYP51A1 human GenEMBL D55653 (3085bp) Aoyama,Y., Funae,Y, Noshiro,M., Horiuchi,T. and Yoshida,Y. Occurrence of a P450 showing high homology to yeast lanosterol 14- demethylase (P450DM) in rat liver. Biochem. Biophys. Res. Commun. 201, 1320-1326 (1994) CYP51A1 Macaca fasicularis (cynomolgus monkey) No accession number Yasuhiro Uno Submitted to nomenclature committee 1/11/2005 Clone name mfCYP51A1_7H11 98% to CYP51 human CYP51A1 Bos taurus (cow) See cattle page for details 22188 MLDLLQAGGSVLGQAMEQVTGGNLASMLLIACAFTLSLVYLFRLAVGHLAPPLPTGA (0) 22018 20194 KSPPYIVSPIPFLGHAIAFGKSPIEFLEDAYEK (0) 20096 17795 YGPVFSFTMVGKTFTYLLGSEAAALLFNSKNEDLNAEEVYSRLTTPVFGKGVAYDVPNT 17619 16395 VFLEQKKMLKSGLNIAHFRQHVSIIEKETKEYFKSWGESGEK 16270 14644 NLFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFR 14468 13586 RRDRAHREIKNIFYKAIQKRRESGEKIDDILQTLLESTYK 13467 13076 DGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCFLEQKTVCGENLPPLTYDQ (0) 12882 11489 LKDLNLLDRCIKETLRLRPPIMTMMRLAKTP 11397 10597 QTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLEDSPASGEKFAYVPFGA (1?) 10427 7020 GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPEKPIIRYKRRSK* 6841 CYP51A1 pig AB042982, AB009988, NM_214432 MVLLGLLQAGGSVLGQAMEQVTGVNLLSSLLLACAFTLILVYLF RQAIGHLAPLPAGAKSPPYIFSPIPFLGHAIAFGKSPIEFLENAYEKYGPVFSFTMVG KTFTYLLGSDAAALLFNSKNEDLNAEDVYSRLTTPVFGKGVAYDVPNPVFLEQKKMLK SGLNIAHFRQHVSIIEKETKEYFQSWGESGERNLFEALSELIILTASHCLHGKEIRSQ LNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRQSEE KIDDILQTLLDSTYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQ EKCYLEQKTVCGEDLPPLTYDQLKDLNLLDRCIKETLRLRPPIMTMMRMAKTPQTVAG YTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGEKFAYVPFGAGRHRCI GENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPVIRYKRRSK CYP51P1 human no accession number Rozman, D. Stromstedt, M. and Waterman, M.R. The three human cytochrome P450 lanosterol 14a-demethylase (CYP51) genes reside on chromosomes 3, 7, and 13: Structure of two retrotransposed pseudogenes, association with a line-1 element, and evolution of the human CYP51 family. Archives of Biochemistry and Biophysics 333, 466-474 (1996). processed pseudogene on chromosome 3 CYP51P1 human processed pseudogene U36926 5 in frame stops MAAAAGMMLLGLLQAGG*VLGQAMEEVAGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQL TAGAKSPPYIFSPVPFLGHAIAFGKSPTEFLENAYGNYGPVFSFIMVGKAFTYLLGSDAA ALLFNSKNEVLNAEDVYSRLTTPVFG*GVAYDVPNPVFLEQKKTLKSGLNIAHFK*HVSI IEKETKEYFESWGESGEKNVFAALSELIILTASHYLHGKEIRSQHNEKVAQLYADLDGGF SHAAWLLPGWLPLPCFRRRDRAHQEIKDIFYKAIQKRRQSQEKIDDILQTLLDATYKDGR PLTDDEVAGMLTGLLLAEQHTSSTSA*MGFFLARDKTLQEKCYLEQKTVCGENLPPLTY DQLKDLNLLDRCIKETLRLRHPVMIMMRMARIPKTVAGYTIPPGHQVCVSPTVNQRLKDS WVEHLDFNPDRYL*DNPASREKFAYVPFGAGHHGCTGENFAYVQIKTIWSTMLRLYEFDL IDGYFPTVNYTTMIHTPENPVIHYK*RSK CYP51P2 human no accession number Rozman, D. Stromstedt, M. and Waterman, M.R. The three human cytochrome P450 lanosterol 14a-demethylase (CYP51) genes reside on chromosomes 3, 7, and 13: Structure of two retrotransposed pseudogenes, association with a line-1 element, and evolution of the human CYP51 family. Archives of Biochemistry and Biophysics, 333: 466-474 (1996) processed pseudogene on chromosome 13 CYP51P2 human processed pseudogene U40053 MAAAAGMMLLGLLQAG GSVLGQAMEEVTGGNLLSMLLIACTFTLSLVYLFRLAAGHLVQLPAGAKSPPYVFSPVP FPGHAIAFGKSPVEFLE NAYEKYGPVFSFTMVGKTFTYL LGSDAAALLFNSKNEDQNAEDVYSHLTTPVFGKGVAYDVPNPVFLEQKKMLKSGLNKAHF KQHVSL EKETKEYFQSWGESGEKNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFS HAAWLLPGWLPLPSFRCRDRAHWEIKDIFYKAIQKRRQSQEKIDDILQTLLDATYKDGRP LTDDEVAGMLIGLLLAGQHSSSTTSAWMDFFLARDKTLQEKCYLEQKTVCGENLPPLTYD QLKGLNLLDRCIKETLRLRPPIMIMMRMARTPQTVVGYTIPPGHQVCVSPTVNQRPKDSW VERLDFNPDCYLQDNPASGEKFAYVPFGAGCHR*IGENFAYVQIKTIWSTMLRLYEFDLI DGYFPIVNYTTMIHTPENPLIHYKRRSK CYP51P3 human NT_025741.8|Hs6_25897 chromosome 6 CYP51P3 In an intron of the SASH1 gene 20447678 MLGLVQMSRSALGQLVEWVAGESDSLLSMLLISCAFILSLVCFATIVTTWPSCQLVQNAH 20447857 20447858 HMFSLPLHSLGMPYIWEKLN*ISRKCI*EVWAYEKYGPVCSFSVVSKTFT 20448007 20447859 ICFLSHYIPWACHTFGKS*IEFLESAYEKYGHMRSMDLYVVFLW*ARHLLLERDETAL 20448032 20448347 LFNSKTKDAEDVYSHLRTPAFGKGVECDMPNPAFLGQEKMLKSSLKVAHFRQQVSI 20448514 20448515 TEKERNTFKLGRKQRKKLCEALSELII 20448595 FDS*PWFTWKGNQKSTQ*EVAQLCADVSGGFKPLA 20448698 20448699 WLRPRWLPLRGCRSSYKAH*EHNYCL*GNPETQKPEEKIQGILQTLLDTTDKGEHLLAD 20448875 20448876 EEVTGLLIRLFSGQHTSSTTGA*MSCFVARDETLHEKCYLKQKTVRGEDLPSLTYDLLK 20449052 20449426 VIGRTIPAGHQMGLSLTVNQGFQDTWVELVDPDQ*LQDISTGEK 20449557 20449566 GGGRHHIGKNFAHVQIKTVWSTLFHLYEFDFIDGYFPTVNYKTVVHHTPKNPVITYK*R 20449742 CYP51P3 Cavia porcellus (Guinea pig) cavPor3_dna scaffold_60:1743956-1873082 at UCSC browser 32% to 51 human in orthologous position to CYP51P3 pseudogene inside SASH1 gene 60284 VVAEVGWLLLGFWQVSRLVVRWVVEQVTGASPCPY*PSPGPLSWAGLSASHL* 60126 60125 SPGTAAGGTESPSYLSLCSIPWAH*GCALAFW*CPAEFLENRV*EEWACLEFLWGEEEIY 59946 59945 FLVESDAAALLFKSKTEELNEEVHS 59871 CVPASGFQEG 59841 59545 VFIVETETKE*SNVGEEVEKIMYMSNFHSQS*F*KEIRSPSMWPWCSSYGA 59393 59392 IWSC*PFT 59368 WLLLGWQPWPRFSAATELTEASRTLSVRPSQSADTHEI*KKSSDSLHPLLKSTDKTGCPV 59189 59188 ACDEEAGMLT*LSFSR*LTSMTGF*MELIVARQNASRKRDL*QETVCRKALLPVTCDQL 59012 59011 RALNLLAHCIRQTSRPQ*DSSDCGKVSLFFQDSDYVFQPTGLES*WMH 58868 58867 ALKSLGDKFAHMPFRARHHCTGE 58799 58799 NFAYVQTRTV*FATLHWCGFDLINRFFPTINYTTMIRIPENLV 58671 CYP51A1 rat GenEMBL D29962 (1924bp) PIR JC2334 (430 amino acids) GenEMBL D55681 (2268bp) Aoyama,Y., Funae,Y, Noshiro,M., Horiuchi,T. and Yoshida,Y. Occurrence of a P450 showing high homology to yeast lanosterol 14- demethylase (P450DM) in rat liver. Biochem. Biophys. Res. Commun. 201, 1320-1326 (1994) Note: This is the first time a single P450 family is represented in plants and animals. The N-terminal is more conserved than the C- terminal. CYP51A1 rat GenEMBL U17697 (2383bp) Sloane,D.L., So,O., Leung,R., Scarafia,L.E., Saldou,N., Jarnagin,K. and Swinney,D.C. Molecular Cloning and Functional Expression of Rat Lanosterol 14 alpha-Demethylase Gene (1995) In press CYP51A1 rat Aoyama, Y. et al . Sterol 14-demethylase P450 (P45014DM) is one of the most ancient and conseved P450 species. The Journal of Biochemistry 119: 926-933 (1996) includes a report on a processed pseudogene from CYP51 in rats Cyp51 mouse GenEMBL NM_020010 AF166266 Mm.24155 MEQVTGGNLLSTLLIACAFTLSLVYLFRLAVGHMVQLPAGAKSP PHIYSPIPFLGHAIAFGKSPIEFLENAYEKYGPVFSLTMVGKTFTYLLGSDAAALLFN SKNEDLNAEEVYGRLTTPVFGKGVAYDVPNAIFLEQKKIIKSGLNIAHFKQYVPIIEK EAKEYFQSWGESGERNVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFT HAAWLLPAWLPLPSFRRRDRAHREIKNIFYKAIQKRRLSKEPAEDILQTLLDSTYKDG RPLTDEEISGMLIGLLLAGQHTSSTTSAWMGFFLAKDKPLQEKCYLEQKAVCGEDLPP LTYDQLKDLNLLDRCIKETLRLRPPIMTMMRMAKTPQTVAGYTIPPGHQVCVSPTVNQ RLKDSWAERLDFNPDRYLQDNPASGEKFAYVPFGAGRHRCVGENFAYVQIKTIWSTML RLYEFDLINGYFPTVNYTTMIHTPENPVIRYKRRSK CYP51P3 Cavia porcellus (Guinea pig) cavPor3_dna scaffold_60:1743956-1873082 at UCSC browser 32% to 51 human in orthologous position to CYP51P3 pseudogene inside SASH1 gene 60284 VVAEVGWLLLGFWQVSRLVVRWVVEQVTGASPCPY*PSPGPLSWAGLSASHL* 60126 60125 SPGTAAGGTESPSYLSLCSIPWAH*GCALAFW*CPAEFLENRV*EEWACLEFLWGEEEIY 59946 59945 FLVESDAAALLFKSKTEELNEEVHS 59871 CVPASGFQEG 59841 59545 VFIVETETKE*SNVGEEVEKIMYMSNFHSQS*F*KEIRSPSMWPWCSSYGA 59393 59392 IWSC*PFT 59368 WLLLGWQPWPRFSAATELTEASRTLSVRPSQSADTHEI*KKSSDSLHPLLKSTDKTGCPV 59189 59188 ACDEEAGMLT*LSFSR*LTSMTGF*MELIVARQNASRKRDL*QETVCRKALLPVTCDQL 59012 59011 RALNLLAHCIRQTSRPQ*DSSDCGKVSLFFQDSDYVFQPTGLES*WMH 58868 58867 ALKSLGDKFAHMPFRARHHCTGE 58799 58799 NFAYVQTRTV*FATLHWCGFDLINRFFPTINYTTMIRIPENLV 58671 CYP51A1 Gallus gallus (chicken) DKFZ426_22O10R1 from http://www.ri.bbsrc.ac.uk/cgi-bin/est-blast/blast.pl Roslin Institute chicken EST project 603372085F1>603372085F1 cloneID='ChEST280i22' Chondrocytes from http://www.chick.umist.ac.uk/ 603582971F1>603582971F1 cloneID='ChEST534n18' stage_36_trunks from http://www.chick.umist.ac.uk/ data from the UMIST/Nottingham/Dundee universities chicken EST sequencing project. 82% to human CYP51 81% to Fugu CYP51 MLSLLEVGGSLLERAAVGNPLSLLLAASAFALSLGYLFQLGYRRHVGADRTNH PPHIPSSIPFLGHAIAFGKSPIEFLENAYDKYGPVFSFTMVGK TFTYLLGSDAAALLFNSKNEDLNAEDVYSRLTTPVFGKGVAYDVPNVVFLEQKKMLKTGL NIAQFKQHVTLIEEETKEYFKAWGESGERNLFEAFSELIILTASHCLHGKEIRSLLNEKV AQLYADLDGGFTHAAWLLPAWLPLPSFRRRDRAHRAIKNIFYKVIQKRRSSEEKEDDMLQ TLLDASYKDGRPLTD DEIAGMLIGLLLAGQHTSSTTSAWLGFFIARDKAIQEQCYAEQKAVCGDDLPPLTYDQLK DLSLLDRCLKETLRLRPPIMTIMRLAKTPQTVAGYSIPPGHQVCVSPTVNQRLKDSWKDA LDFKPDRYLRDNPAAGEKFAYIPFGAGRHRCIGENFAYVQIKTIWSTLLRLYEFDLVDGY FPSINYTTMIHTPNNPVIRYKRRSL* CYP51A1 Xenopus laevis (African clawed frog) No accession number Hiroaki Ohi, Yoshiaki Fujita 78-80% identical to mammalian CYP51s submitted to nomenclature committee July 26, 2000 CYP51A1 Xenopus tropicalis See Xenopus pages for seq CYP51A1 Danio rerio (zebrafish) Seq zebrafish pages for seq CYP51A1 Fugu rubripes (pufferfish) No accession number Scaffold_437 78% to 51 mouse 67511 MSAHLYEMSSKLIGDTVGRVHD 67445 NLTTVVLAASFITLSLGYVSKLLLRQSFVTDA (0) KHPPYI 67266 67265 PSCIPFLGHAISFGKSPIEFLENAYEK 67185 66989 YGPVFSFTMVGSTFTYLLGSDAAALLFNSKNEDLNAEDVYSRLTTPVFGKGVAYDVPNP 66813 IFLEQKKMLKTGLNIAHFKEHVKIIEAETRE 66633 66632 YFQRWGDSGER 66600 66525 DLFEALSELIILTASSCLHGKEIRSMLDERVAQLYADLDGGFTHAA 66388 WLLPGWLPLPSF 66277 RKRDRAHREIKKIFFKVIEKRRRSGENTDDILQTLVDATYK DGRPLSDDEIGGMLIGLLLAGQHTSSTTSAWMGFFMARDRRLQ 65918 65917 ERCYAEQKAACGEDLPPLSFDQ 65852 LKDLSLLEGCLKETLRLRPPIMTMMRMARSPQ TAAGYTIPVGHQVCVSPTVNHRLGDAWEQRLEFKPDRYLDDNPAAGEKFAYI 64964 64963 PFGAG RHRCIGENFAYVQIKTIWSTLLRMYEF 64784 64783 DLVDGYFPTINYTTMIHTPHNPVIRYKRRHE* 64688 CYP51A1 Monosiga brevicola (choanoflagellate) single celled sister group to all animals) From JGI site estExt_fgenesh1_pg.C_250046 [Monbr1:38433] 51% to CYP51A1 of human MDKLPAAVVPYAEAAQEALVSLHETLGRPSTTTYLATSAVALGIWKYIRGNYLRPAKAPPKVPSQVPWLG CIFAFGQSPIEFMIDCYKKYGPVYSFVMFGTEVTYLLGSEASSRFWSTHNDVLNAEDLYANITVPVFGEG VAYAVEHKIFSEQKQMAKEGLTIDRFKAYTSMIEKETNGFIERWGQTGTIDFFDNMARMIIYTATRCLHG NETREDFDEDVAKLYHALDGGFTPQAWFFPPWLPLPSFRRRDRAHRELKERFYKIIDRRRQKAEEGTQTD LMHTFMTTPYKNVEDGRHLTTDEVSGMMIALLMAGQHTSSTVSSWLTCFITTTPGLEEKLYQEQVELFKR RPGPLSYEHINEMPLLWACIRETLRLRPPIMSIMRRAREDYKVTVNGVEYVIPKGSQVCVSPTVNGRLED EWEDPNTFNPYRFLKEEDGKLVVTEGEQITKGGKFKWVPFGAGRHRCIGFGFAQVQIRCIMSTILRKYKL EMVSGKLPPINYTTMIHTPTEPIVRYTRR* CYP51A1 Caenorhabditis elegans (nematode worm/ mammalian contamination) no accession number F54E5 contig 00150 this fragment's amino acid translation is identical to human sequences U51688, AC000120 and rat sequence AB004091. DVFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAA WLLPGWLPLPSFR The human sequences U23942, U40053 and D55653 and the rat sequences U17697, D55681, D87997, D78370 all have N instead of D at the first amino acid. This may be a polymorphism in this gene. It is very unlikely that this sequence is from C. elegans and it most likely represents a contamination by a mammalian sequence. CYP51B1 Mycobacterium tuberculosis GenEMBL Z80226 This is named in the CYP51 family even though it is in bacteria It always clusters with the other CYP51 sequences. The activity of this enzyme has been demonstrated to be a 14 alpha demethylase with preference for obtusifoliol. Mike Waterman presented the crystal structure of this protein at a recent meeting in Moscow in July 2000. See the bacterial nomenclature page for more CYP51B sequences. CYP51C1 Phaeodactylum tricornutum (diatom) GenEMBL BI307668.1 estExt_Genewise1.C_chr_310065|Phatr2 at JGI note: one of the few P450s from diatoms (stramenopiles) another is CYP97E1 from Skeletonema costatum The CYP51C subfamily is for CYP51s in the Chromista (Stramenopiles/heterokonts) 55% to Ectocarpus 51C1 MIVALVVVTGLTALWFFLLRPRTGGKHAPPVVTSSPLVPIPVIGHIVEFFRSPHFMMQRCLKDHGKVFTIPI FHKRLTFL IGPEAQEIFFKANDDVLSQSEVYDFTRPVFGNDVVYDASKKNRQVQFQTMANGLRTARLKAYIPKIEQETRAYIKNWGDA GQIDLLKALSELTILTASRCLHGEDVRTHIFKEVQELYHDLDHGLTPLTVFWPTAPSQAHRKRDVARKEMVRLFTKVIEE RQLHPERSDGTDILSLFMDIKYKDGSAVTMDQVTGLLIALLFAGQHTSCITSTWTSLFIANDPTLVSRILAEQKTVLGGD LNKPIEYEDLQNMELLHNCMREALRMCPTFIMILRKAERDVKIQSEGKSYVIPQGDMVVVSPTVSMRMKETFADPDTYDP DRFAAPREEHKQPYAYMGFGGGLHSCMGQNFAFLQVKTIISVLLREYELERVEPGMPDIGYDDMVVGPKGDCTVRYRKRV QS* CYP51C1 Thalassiosira pseudonana (diatom) No accession From JGI genome project scaffold 68 48% to 51G1 Arabidopsis 66% to CYP51C Phaeodactylum tricornutum 55% to Ectocarpus 51C1 See stramenopile pages for a tree This N-term is revised 1242369 MPFGFGSSTGPSDNLLIGLTCLAIFSLFYAFFVVRPRTKGGPHAPPVVTSSPVSSLPVVG TIVEFGKSPVKMVQRCYEDYGPVFTVPV 1242106 FHKRLTFLIGPEAQEPFFKAPDEVLSQN EVYGFMKPVFGPGIVYDASKKNRQVQFQSMANGLRTARLKGYTAKIERETRQYLESWGES GELDLFHALSELTILTASRCLHGDDVRENLFKEVSELYHDLDQGLTPLTVFFPNAPTKSH MKRNAARAKMVELFSKVIKNRRDNPDVQHSDGTDILSIFMDVKYKDGSNITDEQVTGLLI ALLFAGQHTSCITSTWTSLFILNNPAILKRIIAEQNDVFGSQPDADVDYKMVNEDMPLLH NSMKEALRLCPPLILLIRYALKDVKVKAAGKDYTIPKGDMVLISPSVGMRIPEVFKEPNT FDPDRFGPDREEDKSSPFAYMGFGGGMHSCMGQNFAFVQVKTILSVLFREFELEMVSETM PDIDYEAMVVGPKGDCRVRYKRRQ* CYP51C1 Ectocarpus siliculosus (brown algae) Genoscope Ectocarpus siliculosus brown algae project ESTs LQ0AAA4YI04FM1.SCF, LQ0AAA7YJ01FM1.SCF LQ0AAB54YL14FM1.SCF, LQ0AAB79YH22FM1.SCF LQ0AAB69YF16FM1.SCF, LQ0AAB65YF19FM1.SCF LQ0AAA9YO17FM1.SCF, LQ0AAB58YH11FM1.SCF LQ0AAB55YN02FM1.SCF, LQ0AAB70YE14FM1.SCF LQ0AAB83YM18FM1.SCF, LQ0AAB2YA05FM1.SCF 59% to Diatom CYP51C1 Thalassiosira pseudonana (ortholog) about 46% to Chlamydomonas CYP51G1 Ectocarpus sctg_6 1176507-1185087 CYP51D1 Dictyostelium discoideum (cellular slime mold) GenEMBL AU033519 47% identical to S. bicolor CYP51 all the best matches are to CYP51 note: CYP51D is a subfamily for CYP51s from Dictyostelium and other Mycetozoa MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN KSLGQSGQLADLYPNWKKVSIQFHSFSQISHYHHSKRETLPVLKSRHFPSIIQERRRSTD DSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEYFD ETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKGKT IPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFGIL QIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK* CYP51E1 Trypanosoma brucei (African sleeping sickness parasite) GenEMBL AZ217433 AQ948529 AQ950690 AL492728 AL481001 AZ217732 AQ948526 AQ647531 AL480043 AQ660405 AL495885 AZ212935 AL481816 Assembled from overlapping GSS fragments 33% to mammalian CYP51s Note: CYP51E is a subfamily for CYP51s of Euglenozoa Also AAHB01000003.1 T. brucei chromosome 11, comp (1400971-1402470) MLLEVAIFLLTALALYSFYFVKSFNVTRPTDPPVYPVTVPILGHIIQFGKSPLGFMQECKRQL KSGIFTINIVGKRVTIVGDPHEHSRFFLPRNEVLSPREVYSFMVPVFGEGVAYAAPYPRMREQ LNFLAEELTIAKFQNFVPAIQHEVRKFMAANWDKDEGEINLLEDCSTMIINTACQCLFGEDLRKRLDARRFAQ LLAKMESSLIPAAVFLPILLKLPLPQSARCHEARTELQKILSEIIIARKEEEVNKDSSTSDLLSGLLSAVYRDGTPMSLHEV CGMIVAAMFAGQHTSSITTTWSMLHLMHPANVKHLEALRKEIEEFPAQLNYNNVM DEMPFAERCARESIRRDPPLLMLMRKVMADVKVGSYV VPKGDIIACSPLLSHHDEEAFPEPRRWDPERDEKVEGAFIGFGAGVHKCIGQKFGLLQVK TILATAFRSYDFQLLRDEVPDPDYHTMVVGPTASQCRVKYIRRKAAAA* CYP51E1 Trypanosoma brucei (African sleeping sickness parasite) GenEMBL AF363026 Joubert,B.M., Nguyen,L.N., Matsuda,S.P.T. and Buckner,F.S. Cloning and functional characterization of a Trypanosoma brucei lanosterol 14 alpha-demethylase gene Mol. Biochem. Parasitol. 117 (1), 115-117 (2001) CYP51E1v1 Trypanosoma cruzi (Chagas disease parasite) GenEMBL AY283022.1 Buckner,F.S., Joubert,B.M., Boyle,S.M., Eastman,R.T., Verlinde,C.L. and Matsuda,S.P. Cloning and analysis of Trypanosoma cruzi lanosterol 14alpha-demethylase Mol. Biochem. Parasitol. 132 (2), 75-81 (2003) Submitted by Sean M. Boyle for Fred Buckner Oct 25, 2001 83% to CYP51 of T. brucei clone name CL13 CYP51-1 allele (only 3aa diffs to allele 2) MFIEAIVLALTALILYSVYSVKSFNTTRPTDPPVYPVTVPFLGH IVQFGKNPLEFMQRCKRDLKSGVFTISIGGQRVTIVGDPHEHSRFFSPRNEILSPREV YTIMTPVFGEGVAYAAPYPRMREQLNFLAEELTIAKFQNFVPAIQHEVRKFMAENWKE DEGVINLLEDCGAMIINTACQCLFGEDLRKRLNARHFAQLLSKMESSLIPAAVFMPWL LRLPLPQSARCREARAELQKILGEIIVAREKEEASKDNNTSDLLGGLLKAVYRDGTRM SLHEVCGMIVAAMFAGQHTSTITTSWSMLHLMHPKNKKWLDKLHKEIDEFPAQLNYDN VMDEMPFAERCVRESIRRDPPLLMVMRMVKAEVKVGSYVVPKGDIIACSPLLSHHDEE AFPNPRLWDPERDEKVDGAFIGFGAGVHKCIGQKFALLQVKTILATAFREYDFQLLRD EVPDPDYHTMVVGPTLNQCLVKYTRKKKLPS CYP51E1v2 Trypanosoma cruzi (Chagas disease parasite) GenEMBL AY283023.1 Buckner,F.S., Joubert,B.M., Boyle,S.M., Eastman,R.T., Verlinde,C.L. and Matsuda,S.P. Cloning and analysis of Trypanosoma cruzi lanosterol 14alpha-demethylase Mol. Biochem. Parasitol. 132 (2), 75-81 (2003) Submitted by Sean M. Boyle for Fred Buckner Oct 25, 2001 4 amino acid difference to CYP51E1v1 CYP51-2 allele, clone name WTB9 MFIEAIVLALTALILYSVYSVKSFNTTRPTDPPVYPVTVPFLGH IVQFGKNPLEFMQRCKRELKSGVFTISIGGQRVTIVGDPHEHSRFFSPRNEILSPREV YTIMTPVFGEGVAYSAPYPRMREQLNFLAEELTIAKFQNFVPAIQHEVRKFMAENWKK DEGVINLLEDCGAMIINTACQCLFGEDLRKRLNARHFAQLLSKMESSLIPAAVFMPWL LRLPLPQSARCREARAELQKILGEIIVAREKEEASKDNNTSDLLGGLLKAVYRDGTRM SLHEVCGMIVAAMFAGQHTSTITTSWSMLHLMHPKNKKWLDKLHKEIDEFPAQLNYDN VMDEMPFAERCVRESIRRDPPLLMVMRMVKAEVKVGSYVVPKGDIIACSPLLSHHDEE AFPNPRLWDPERDEKVDGAFIGFGAGVHKCIGQKFALLQVKTILATAFREYDFQLLRD EVPDPDYHTMVVGPTLNQCLVKYTRKKKLPS CYP51E1 Trypanosoma congolense Sanger Center congo469h11.p1k, congo931d01.p1k, congo605h04.p1k MLLEIVIFLLTAAALYAVYFVKSFNTTCVTDPPVYPVVLPFVGHILQFGRNPLEFMRKCK RELGSGIFTINILGKRVTIVGDPHEHSRFFLPRNEVLSPREVYSFMVPVFGEGVAYAASY PRMREQLNFLAEELTIAKFQNFVPAIQHEVRKFMVAHWDKEEGEINLLEDCSTMIINTAC QCLFGEDLRRRLDARRFARLLAMMESSLIPAAVFAPFLLRLPLPQSARCHEA RAELQQILSEIVISRKKEEVNRDSSTSDLLSGLLNAVY RDGTPMSLHEVCGMIVAAMFAGQHTSSITT TWSMLHLMHPSNKRHLETLRGEVEEFPAQLNYNNVMDEMPF AERCARESIRRDPPLLMLMRKV MSDVKVGPYVVPKGDIIACSPLLSHHDEEAFPDPRHWDPEREEKVEGAFIGFG AGVHKCIGQKFGLLQVKTVLATVFRDYDFQLLRDQVPDPDYHTMVVGPTYSQCRVKYIRRKVVTA* CYP51E1 Trypanosoma vivax Contig7098.1 86% to 51E1 T. brucei MLLEVITFLLTVLVLYSVYFVVSFNVTRPTDPPVRH VALPFLGHIIPFGKDPLGFMLECKRKLRSGVFTINIMGKRVTVVGDPREHSRFFLPRNEV LSPREVYAFMVPVFGEGVAYAAPYPRMREQLNFLAEELTIAKFQNFVPAIQHEARKFLSE NWNKDEGQVNLLEDCSAMIINTACQCLFGEDLRKRLDARRFAQLLAKMESSLIPAAVFMP SLLLLPLPQSACCRDAREELQLILSEIILARKREEVNKDSGTSDLLSGLLSAVYRDGTPM SLHEVCGMIVAAMFAGQHTSTITTTWSLLHLMHPSNRKHLETLRKEIEEFPAQLNYSNVM DEMPFAERCARESIRRDPPLIMLMRKVLADVKVGSYVVPKGDIIACSPLLSHHDEEAFPD PRRWDPEREEVIEGAFIGFGAGVHKCIGQKFGLLQVKTILATVLRDYDFELPRDEVPEPD YHTMVVGPTSSQCRVRYIRRKKPAS* CYP51E1 Leishmania major (Leishmaniasis parasite) LM7_11c.1.Contig2 L. major Friedlin chromosome 7_11c GenEMBL AZ048391 Also GenEMBL AZ048391 partial seq 76% to T. brucei CYP51E1 123423 MIGEFFLLLTAGLALYGWYFCKSFNTTRPTDPPVVHGAMPFVGHIIQFGKDPLDFMLNAK 123244 123243 KKYGGVFTMNICGNRVTVVGDVHQHNKFFTPRNEILSPREVYSFMVPVFGEGVAYAAPY 123067 123066 PRMREQLNFLAEELTVAKFQNFAPSIQHEVRKFMKANWNKDEGEINILDDCSAMIINTAC 122887 122886 QCLFGEDLRKRLDARQFAQLLAKMESCLIPAAVFLPWILKLPLPQSYRCRDARAELQDI 122710 122709 LSEIIIAREKEEAQKDSNTSDLLASLLGAVYRDGTRMSQHEVCGMIVAAMFAGQHTSTIT 122530 122529 TTWSLLHLMDPRNKRHLAKLHQEIDEFPAQLNYDNVMEEMPFAEQCARESIRRDPPLIML 122350 122349 MRKVLKPVQVGKCVVPEGDIIACSPLLSHQDEEAFPNPREWNPERNMKLVDGAFCGFGAG 122170 122169 VHKCIGEKFGLLQVKTVLATVLRDYDFELLGPLPEPNYHTMVVGPTASQCRVKYIRKKA 121993 121992 AA 121987 CYP51E1 Leishmania infantum Sanger Center Contig3290 8620 MIGELLLLLAAGLALYGWYFCKSFNTTRPTDPPVVHGTTPFVGHIIQFGKDPLGFMLKAK 8799 8800 KKYGGIFTMNICGNRITVVGDVHQHSKFFTPRNEILSPREVYSFMVPVFGEGVAYAAPYP 8979 8980 RMREQLNFLAEELTVAKFQNFAPSIQHEVRKFMKANWNKDEGEINILDDCSAMIINTACQ 9159 9160 CLFGEDLRKRLDARQFAQLLAKMESCLIPAAVFLPWILKLPLPQSYRCRDARAELQDILS 9339 9340 EIIIAREKEEAQKDTNTSDLLAGLLGAVYRDGTRMSQHEVCGMIVAAMFAGQHTSTITTT 9519 9520 WSLLHLMDPRNKRHLAKLHQEIDEFPAQLNYDNVMEEMPFAEQCARESIRRDPPLVMLMR 9699 9700 KVLKPVQVGKYVVPEGDIIACSPLLSHQDEEAFPNPREWNPERNMKLVDGAFCGFGAGVH 9879 9880 KCIGEKFGLLQVKTVLATVLRDYDFELLGPLPEPNYHTMVVGPTASQCRVKYIKKKKAA 10056 CYP51E1 Euglena gracilis GenEMBL EC678643.1 EC676958.1 EC676055.1 (N-term ESTs) EC672115.1 (C-term EST) 51% to CYP51G1 Arabidopsis MKLGVALAGAAALIAAPVVIVAFLRPKRGTAPLVPAPPVVGSLFDFMRSPLKFV KAAHEKYGDCFTVNICHKRLTFLIGPQAHQKFFKPNDQQLDQAKVYQFNVPVFGPGVVFD ADLDRRLQQFGFIQKALKPHLMKTYVRQMVEEAEDYFSSWDQEGIIDLREKLSELVIMTA SRCLMGKEVREQLFAEVARLYHDLDEGMQPISVFFPYLPIEKHRRRDIARLEMVKLFSKV LEQR NPDVYDPDRFAEPRMEDKKNGPFAFISFGGGRHTCLGEHFGMLQVKVIVAVLLQHFDLEL IEGLPEPDYAAMIVGPKPIHIKYRRVPKRF* Note: CYP51F is a subfamily for fungal CYP51 sequences CYP51F1 Saccharomyces cerevisiae (yeast) GenEMBL U10555 (24,431bp) Johnston,M., Andrews,S., Brinkman,R., Cooper,J., Ding,H., Dover,J., Du,Z., Favello,A., Fulton,L., Gattung,S., Geisel,C., Kirsten,J., Kucaba,T., Hillier,L., Jier,M., Johnston,L., Keppler,D., Langston,Y., Latreille,P., Louis,E., Macri,C., Mardis,E., Mouser,L., Nhan,M., Rifken,L., Riles,L., St.Peter,H., Thornton,L., Trevaskis,E., Vaudin,M., Vaughan,K., Vignati,D., Wilcox,L., Willis,A., Wilson,R., Wohldman,P. and Waterston,R. The complete sequence of Saccharomyces cerevisiae chromosome VIII Science 265, 2077-2082 (1994) YHR007c reverse complement of 20,961-22,553, ERG11 gene CYP51F1 Saccharomyces cerevisiae (yeast) GenEMBL T17568 (45bp) Burns,N., Grimwade,B., Ross-Macdonald,P.B., Choi,E.-Y., Finberg,K., Roeder,G.S. and Snyder,M. Large-Scale Analysis of Gene Expression, Protein Localization and Gene Disruption in Saccharomyces cerevisiae Genes Dev. 8, 1087-1105 (1994) CYP51F1 Saccharomyces paradoxus Spar_10379 98% to 51F1 S. cerevisiae see fungal pages for seq CYP51F1 Saccharomyces mikatae Smik_9639 96% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F1 Saccharomyces kudriavzevii Skud_Contig2067.14 96% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F1 Saccharomyces bayanus Sbay_23211 93% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F1 Saccharomyces castellii Scas_Contig699.20 78% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F1 Saccharomyces kluyveri Sklu_Contig2436.11 79% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F1 Kluyveromyces waltii kwal_154-g13.1 77% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F1 Kluyveromyces polysporus Kpol_1004p61 81% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F1 Kluyveromyces lactis 76% to CYP51F1 Saccharomyces kluyveri see fungal pages for seq CYP51F1 Ashbya gossypii ATCC 10895 NP_984259.1 71% to CYP51F1 S. cerevisiae, see fungal pages for seq CYP51F1 Yarrowia lipolytica CAG82748.1 63% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F1 Penicillium italicum GenEMBL Z49750 (2053bp) Nistelrooy,H.G.M., Van den Brink,H.M., Kan,J.A.L., Gorcom,R.F.M. and Waard,M.A. Isolation and molecular characterisation of the gene encoding eburicol 14-alpha-demethylase (CYP51) from Penicillium italicum Unpublished (1995) CYP51F1 Uncinula necator (grape powdery mildew fungus) GenBank U72657 (1960bp) U83840 (1756bp) U72658 (1575bp) Delye,C., Laigret,F. and Corio-Costet,M.F. Cloning and sequence analysis of the eburicol 14alpha-demethylase gene of the obligate biotrophic grape powdery mildew fungus Gene 195 (1), 29-33 (1997) note: a mutation causing resistance to an inhibitor triadimenol is Y136F Delye,C., Laigret,F. and Corio-Costet,M.F. A mutation in the 14 alpha-demethylase gene of Uncinula necator that correlates with resistance to a sterol biosynthesis inhibitor. Applied and Environ. Microbiol. 63, 2966-2970 (1997) CYP51F1 Schizosaccharomyces pombe GenEMBL Z54096 (12168bp) Hunt,S. Devlin,K. and Churcher,C.M. unpublished (1995) CYP51F1 Schizosachharomyces japonicus SJAG_03750 78% to Schizosachharomyces pombe CYP51A1 see fungal pages for seq CYP51F1 Schizosachharomyces octosporus SOCG_01956 see fungal pages for seq CYP51F1 Pneumocystis carinii AY228706 contig_349-snap.1 from Fungal Genome DB contig_349-snap.2 from Fungal Genome DB 58% to CYP51F1 S. pombe see fungal pages for seq CYP51F1 Ustilago maydis GenEMBL Z48164 (1874bp) Hargreaves,J.A. and Keon,J.P.R. The sterol 14 alpha demethylase (ERG11) gene from Ustilago maydis. Unpublished (1994) CYP51F1 Ustilago maydis GenEMBL XM_401277 locus_tag="UM03662.1 CYP51F1 Puccinia graminis f. sp. tritici PGTG_07202 revised 55% to CYP51F1 Malassezia globosa with a short seq gap. see fungal pages for seq CYP51F1 Malassezia globosa MGL_2415 67% to CYP51F1 Ustilago maydis, see fungal pages for seq CYP51F1 Sporobolomyces roseus JGI model e_gw1.1.480.1 55% to CYP51F1 Ustilago maydis, see fungal pages for seq CYP51F1 Issatchenkia orientalis GenEMBL S75391 (1242bp) Burgener-Kairuz,P. Zuber,J.P., Buchman,T.G., Bille,J. and Rossier,M. Rapid detection and identification of Candida albicans and Torulopsis (Candida) glabrata in clinical specimens by species-specific nested PCR amplification of a cytochrome P-450 lanosterol-alpha-demethylase (L1A1) gene fragment. J. Clin. Microbiol. 32, 1902-1907 (1994) CYP51F1 Kluyveromyces marxianus GenEMBL X97682 (338bp) Morace,G., Posteraro,B., Sanguinetti,M., Lo Cascio,G. and Fadda,G. Identification of various medically important Candida species in clinical specimens by using PCR-REA. Unpublished 88% to CYP51F1 Kluyveromyces lactis CYP51F1 Erysiphe graminis f. sp. hordei eburicol GenEMBL AF052515 (2167bp) Delye,C., Bousset,L. and Corio-Costet,M.F. PCR cloning and detection of point mutations in the eburicol 14alpha-demethylase (CYP51) gene from Erysiphe graminis f. sp. hordei, a 'recalcitrant' fungus. Curr. Genet. 34, 399-403 (1998) CYP51F1 Candida tropicalis GenEMBL M23673 Chen,C., Kalb,V.F., Turi,T.G. and Loper,J.C. Primary structure of the cytochrome P450 lanosterol 14 alpha-demethylase gene from Candida tropicalis DNA 7 (9), 617-626 (1988) CYP51F1 Candida glabrata GenEMBL S75389 (1242bp) Burgener-Kairuz,P. Zuber,J.P., Buchman,T.G., Bille,J. and Rossier,M. Rapid detection and identification of Candida albicans and Torulopsis (Candida) glabrata in clinical specimens by species-specific nested PCR amplification of a cytochrome P-450 lanosterol-alpha-demethylase (L1A1) gene fragment. J. Clin. Microbiol. 32, 1902-1907 (1994) CYP51F1 Candida glabrata S75389 GenPept CAG58795.1 see fungal pages for seq CYP51F1 Candida parapsilosis GenEMBL AF019902 (677bp) Okhravi,N., Adamson,P., Mant,R., Matheson,M.M., Midgley,G., Towler,H.M. and Lightman,S. Polymerase chain reaction and restriction fragment length polymorphism mediated detection and speciation of Candida spp causing intraocular infection. Invest. Ophthalmol. Vis. Sci. 39 (6), 859-866 (1998) CYP51F1 Candida parapsilosis CPAG_03310 75% to CYP51F1 Candida tropicalis see fungal pages for seq CYP51F1 Candida albicans GenEMBL X13296, orf19_922 Lai,M.H. and Kirsch,D.R. Nucleotide sequence of cytochrome P450 L1A1 (lanosterol 14 alpha-demethylase) from Candida albicans Nucleic Acids Res. 17, 804 (1989) CYP51F1 Candida guilliermondii GenEMBL X97680 (336bp) Morace,G., Sanguinetti,M., Posteraro,B., Cascio,G.L. and Fadda,G. Identification of various medically important Candida species in clinical specimens by PCR-restriction enzyme analysis J. Clin. Microbiol. 35 (3), 667-672 (1997) CYP51F1 Candida guilliermondii PGUG_03415.1 71% to CYP51F1 Candida lusitaniae see fungal pages for seq CYP51F frag. Candida guilliermondii X97680 this seq is 94% to Candida albicans but only 66% to full length CYP51A1 for C. guilliermondii This is a different gene MTDQEIANLLIVILMGGQHTSTSAWFLLHLGEKPHLQDVIYQEV VELLKEKGGDLNDLTYEDLQKLPSVNNTIKETLRMHMPLHSTFRKVTNPLRSPNKYMS SHTVYMFKFL CYP51F1 Candida dubliniensis P450L1A1, partial. GenEMBL AJ012573 Morace,G., Posteraro,B., Sanguinetti,M. and Fadda,G. Identification of various medically important yeast by Reverse Cross Blot Hybridization Unpublished MTDQEIANLLIGILMGGQHTSASTSAWFLLHLGEKPHLQDAIYQ EVVELLKEKGGDLNDLTYEDLQKLPSVTNTIKETLRMHMPLHSIFRKVKNPLRIPETNYVVPRGHYVLVSPG" CYP51F1 Candida dublinensis 93% to CYP51F1 Candida albicans, cdub_2-g61.1, GenEMBL AJ012573 see fungal pages for seq CYP51F1 Candida lusitaniae CLUG_04932.1 72% to CYP51F1 Candida tropicalis see fungal pages for seq CYP51F1 Lodderomyces elongisporus LELG_03738 77% to CYP51F1 Candida parapsilosis see fungal pages for seq CYP51F1 Pichia anomala GenEMBL AF019903 Okhravi,N., Adamson,P., Mant,R., Matheson,M.M., Midgley,G., Towler,H.M. and Lightman,S. Polymerase chain reaction and restriction fragment length polymorphism mediated detection and speciation of Candida spp causing intraocular infection Invest. Ophthalmol. Vis. Sci. 39 (6), 859-866 (1998) CYP51F1 Pichia stipitis JGI model e_gww1.7.1.351.1 71% TO CYP51F1 Debaryomyces hansenii see fungal pages for seq CYP51F1 Debaryomyces hansenii GenPept CAG88416.1 75% to CYP51F1 Candida guilliermondii see fungal pages for seq CYP51F1 Neurospora crassa No accession number Neurospora crassa sequence contig 1.721 (supercontig 99) Lower case = EST support 49163 MGILQVVAGPLSQQFSQLGTVSQIGVAIASFLFVAVVLNVLQQFLFKKPNEPPLVFHWFP 49342 49343 LIGSTITYGMDPPRFFKENREK (0) YGDCFTFILLGKKT 49522 49523 TVYVGPKGNDFILNGKIRDVNAEEIYTVLTTPVFGKDVVYDCPNsklmeqkk 49678 (0) 49741 fmkialtteafrqyvpiisdevtsylkrtadfkgksgivdippkmaqitiftashalqg 49917 49918 keirdkfdetladlyhdldmgfspinfmlhwaplpwnnrrdyaqrtvakiymdtikerra 50097 50098 rgetgaqdimwhlmnstykgdvpvpdheiahmmiallmagqhsssstsswimlrlasr 50271 50272 pdimedlyneqvknlgadlppltyedlaklplhaaivkETLRLHAPIHSIMRAVKTPMPV 50451 50452 PGTKYVIPTDHVLLAAPGVSATDESYFPQPDLWEPHRWEKDSPLAPTIVRNVPSEEDDEK 50631 50632 IDYGYGLVSKGANSPYLPFGAGRHRCIGEQFANVQLQTILAIIVRNFKFRNVDGSDkvig 50811 50812 tdyaslfsrplepakiywerregcql* 50892 CYP51F1 Neurospora discreta JGI gene model estExt_fgenesh3_kg.C_20410 98% to CYP51F1 N. crassa see fungal pages for seq CYP51F1 Aspergillus nidulans AACD01000145 (WGS section) AN8283.2 51 clan MGLVSLVLDNVCERCSALSVWALSGLGLLSVIIIAVVLNVLRQILFKNPN EPPVVFHWFPFIGSTISYGIDPYKFFFNCRAQYGDIFTFVLLGKKTTVYL GTKGNDFILNGKLKDVCAEEVYSPLTTPVFGRHVVYDCPNAKLMEQKKFV KYGLTSDALRSYVQLITAEVEDFAQKSSVFQNAKGVFDVSRTIAEITIYT ASRSLQGKEVRDKFDSTFAELYHDLDMGFAPINFMLPYAPLPHNRKRDAA QRKMAETYMEIIKERRKSGEKKDSEDMVWNLMSCVYKNGTPLSDEEIAHM MIALLMAGQHSSSSTLSWILLHLARHPEIVEELYQEQLKVLGSDMHMTYD DLQKLELHSKIIKETLRIHAPIHSIIRAVKSPMPVPGTSYVIPTSHNVLS SPGVTARSDEFFPNPLKWDPHRWDSNPIANSTEDEEKIDYGYGLVSKGTN SPYLPFGAGRHRCIGEQFAYVQLITVTAALVRLFKFDTVSESDKSSVPET DYSSLFSRPAGKCFVQYEKRNVTTKA* CYP51F1 Magnaporthe grisea AACU01001110 cont2.837 MG04432.4 (version4) 75% to CYP51 MGLLQDTTGPLVDAFYQLGTGAQVGVAFVSFIFLSVFFHVAQQIFFKNPH EPPVVFSWFPVVGSTVTYGKDPPQFFRDMAKKYGNIFTFILLGKKTTVYI GTEGNEFILNGKLRDVNAEEIYGPMTTPVFGKDVVYDCPNAKLMEQKKFM KIALTTEAFRSYVPIIADEVSSYLKRTPAFKGPSGVVNIPPKMAEITIFT ASHALQGKEIRDQFDETLADLYHDLDMGFHPVNFKLHWLPLPRNIRRDKA QKTIAKIYMDTIQRRRAKGKDSEAKDMMYHLMNSTYKNGTPVPDHEIAHM MIALLMAGQHSSSSTSSWIMLRLASRPDIMEELYQEQVRALGADLPPLRY EDLANLPLHLAVIKETLRLHAPINSILRAVKQDLPVPGTNYVIAKDTTVL AAPGYSAGDPNHFPEPELWEPHRWEADSRLAPRISMSNDNDEEEKIDYGY GLVSKGTTSPYLPFGAGRHRCIGEHFANVQLQTIVAMIVREFKFRNVDGS GKVVGTNYASLFSRPEEPAKIYWERR* CYP51F1 Fusarium graminearum AACM01000048 (WGS section) FG01000.1 FGcontig1.48_scaffold1 64% to Erysiphe graminis 72% to M. grisea 51F1 MGLLQELAGHPLAQQFQELPLGQQVGIGFAVFLVLSVVLNVLNQLLFRNP NEPPMVFHWFPFVGSTITYGMDPPTFFRENRAKHGDVFTFILLGKKTTVA VGPAGNDFILNGKLKDVCAEEIYTVLTTPVFGKDVVYDCPNAKLMEQKKF MKIALTTEAFRSYVPIISSEVRDYFKRSPDFKGKSGIADIPKKMAEITIF TASHALQGSAIRSKFDESLAALYHDLDMGFTPINFMLHWAPLPWNRKRDH AQRTVAKIYMDTIKERRAKGNNESEHDMMKHLMNSTYKNGIRVPDHEVAH MMIALLMAGQHSSSSTSSWIMLRLAQYPHIMEELYQEQVKNLGADLPPLT YEDLAKLPLNQAIVKETLRLHAPIHSIMRAVKSPMPVPGTKYVIPTSHTL LAAPGVSATDSAFFPNPDEWDPHRWEADSPNFPRMASKGEDEEKIDYGYG LVSKGSASPYLPFGAGRHRCIGEHFANAQLQTIVAEVVREFKFRNVDGGH TLIDTDYASLFSRPLEPANIHWERRQ* CYP51F1 Fusarium oxysporum FOXG_00394 92% to CYP51F1 Fusarium graminearum see fungal pages for seq CYP51F1 Fusarium verticillioides 98% to CYP51F1 Fusarium oxysporum FVEG_01123 see fungal pages for seq CYP51F1 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_2000426 Necha1/scaffold_2:1348428-1350174 genome annotation in progress 89% to Fusarium graminearum CYP51F1 FG01000.1 AACM01000048 FGcontig1.48_scaffold1 This gene model seems correct DRN 2/4/06 MGLLHEIAGHPLAQQFQELPLGQQVGIGFGAIVVLSVILNILSQILFVNPNEPPMVFHWFPFIGSTVTYG MDPPRFFKENRAK (0) FGDVFTFVLLGKKTTVAVGPSGNDFILNGKLKDVCAEEIYTVLTTPVFGKDVVYDCPNAKLMEQKK (0) FMKIALTTEAFRSYVPIISGEVRDYFKKSADFKGKTGIVDIPKKMAEITIFTASHALQGSV IRSKFDESLAALYHDLDMGFTPINFMLHWAPLPWNRKRDHAQRTVAKIYMDTIRERRIKDNDDSEHDMMK HLMNSTYKNGTPVPDHEIAHMMIALLMAGQHSSSSTSSWIMLRLAQYPQIMEELYQEQVKALGADLPPLK FEDLAKLPLNQAIIKETLRLHAPIHSIMRAVKSPMPVPGTKYVIPTSHTLLAAPGVSASDPAYFPNPDEW DPHRWEAGSPNAPTIARNNAEEEEKIDYGYGLVSKGSASPYLPFGAGRHRCIGEHFANVQLQTIVAEVVR EFKFSNVDGGNTLIGTDYASLFSRPLEPANIRWERRQ* CYP51F1 Aspergillus fumigatus GenEMBL AAK73660.1 also XP_749134.1, EAL87096.1 14-alpha sterol demethylase 79% to 51F1 A. nidulans MGLIAFILDGICKHCSTQSTWVLVGIGLLSILAVSVIINVLQQLLFKNPHEPPVVFHWFPFIGSTISYGI DPYKFFFDCRAKYGDIFTFILLGKKTTVYLGTKGNDFILNGKLRDVCAEEVYSPLTTPVFGRHVVYDCPN AKLMEQKKFVKYGLTSDALRSYVPLITDEVESFVKNSPAFQGHKGVFDVCKTIAEITIYTASRSLQGKEV RSKFDSTFAELYHNLDMGFAPINFMLPWAPLPHNRKRDAAQRKLTETYMEIIKARRQAGSKKDSEDMVWN LMSCVYKNGTPVPDEEIAHMMIALLMAGQHSSSSTASWIVLRLATRPDIMEELYQEQIRVLGSDLPPLTY DNLQKLDLHAKVIKETLRLHAPIHSIIRAVKNPMAVDGTSYVIPTSHNVLSSPGVTARSEEHFPNPLEWN PHRWDENIAASAEDDEKVDYGYGLVSKGTNSPYLPFGAGRHRCIGEQFAYLQLGTITAVLVRLFRFRNLP GVDGIPDTDYSSLFSKPLGRSFVEFEKRESATKA CYP51F1 Neosartorya fischeri NRRL 181 XM_001261294.1, NFIA_024690 98% to CYP51F1 A. fumigatus = ortholog 63% to CYP51F2 A. fumigatus MGLIAFILDGICKHCSTQSTWVLVGVGLLSILAVSVIANVLQQL LFKNPHEPPVVFHWFPFIGSTISYGIDPYKFFFDCRAKYGDIFTFILLGKKTTVYLGT KGNDFILNGKLRDVCAEEVYSPLTTPVFGRHVVYDCPNAKLMEQKKFVKYGLTSDALR SYVPLITDEVESFVKNSPAFQGHKGVFDVCKTIAEITIYTASRSLQGKEVRSKFDSTF AELYHNLDMGFAPINFMLPWAPLPHNRKRDAAQRKLTETYMEIIKARRQTGSKKDSED MVWNLMSCVYKNGTPVPDEEIAHMMIALLMAGQHSSSSTASWIVLRLATRPDIMEELY QEQIRVLGSDLPPLTFDNLQKLDLHAKVIKETLRLHAPIHSIIRAVKNPMAVDGTSYV IPTSHNVLSSPGVTARSEEHFPNPLEWDPHRWDEDIAASAEDDEKVDYGYGLVNKGTN SPYLPFGAGRHRCIGEQFAYLQLGTITAVLVRLFKFRNLPGVDGVPDTDYSSLFSKPL GRSFVEFEKRESATKA CYP51F1 Aspergillus oryzae GenEMBL BAE60239.1 79% to CYP51F1 Aspergillus nidulans, 60% to CYP51F4 MGILAVILDSVCERCSGSSLWMLSTVALLSILVVSVVINVLRQLLFKNYKEPPLVFHWFPFIGSTISYGM DPYRFFFNCREKYGDIFTFVLLGKKTTVYLGTKGNDFILNGKLRDVCAEEVYSPLTTPVFGRHVVYDCPN AKLMEQKKFVKFGLTSDALRSYVRLITEEVEDFVQKSSALQGPNGVFDVCKTIAEITIYTASRSLQGKEV RSRFDSTFAELYHDLDMGFAPINFMLPWAPLPHNRKRDAAQKRMTETYMEIIKERRKAGSKKDSEDMVWN LMSCMYKDGTPVPDEEIAHMMIALLMAGQHSSSSTAAWIVLHLAASPEITEELYQEQLRILGHDMPPLTY ENLQKLDLHAKVIKETLRIHAPIHSIIRAVKNPMPVEGTPYVIPTSHNVLSSPGVTARSEEHFPDPLEWK PHRWDEAIAVSSEDEEKVDYGYGLVTKGTNSPYLPFGAGRHRCIGEQFAYVQLGAITAALVRLFKFSNLP GVQTLPDTDYSSLFSKPLGNSKIQFEKREPVTKA CYP51F1 Aspergillus flavus AFL2G_06478 100% to CYP51F1 Aspergillus oryzae CYP51F1 Aspergillus niger estExt_fgenesh1_pm.C_150008|Aspni1 84% to CYP51F1 MGLLAVVLDSLCERCSNTSVLVVLGFGLLSLLAVSVIFNILRQLFFKNPNEPPLVFHWFPFIGSTISYGMDPYKFFFDCR AKYGDIFTFILLGKKTTVYLGTRGNDFILNGKLRDVCAEEVYSPLTTPVFGRNVVYDCPNAKLMEQKKFVKFGLTSDALR SYVRLITGEVENFVEHSAAFKGSSGVFDVCKTIAEITIYTASRSLQGKEVRSRFDSTFAELYHDLDMGFAPINFMLPWAP LPHNRKRDAAQKKMTETYMEIIKERRNGGNKKDSEDMVWNLMSCIYKDGTPVPDEEIAHMMIALLMAGQHSSSSTASWIV LHLARNPQIMEELYAEQIRVLGSDLPPLTYDNLQKLDLHAKVIKETLRIHAPIHSIIRAVKNPMPVDGTPYVIPTSHNVL SSPGVTARSEEYFPNPLKWDPHRWDETIAASAEEEDQIDYGYGLVSKGTNSPYLPFGAGRHRCIGEQFAYVQLGAITAAL VRLFKFRNLPNVKDIPETDYSSLFSKPAGKSIIQFEKRATTIKS* CYP51F1 Aspergillus clavatus NRRL 1 XM_001273213.1, ACLA_005420 89% to 51F1 S. fumigatus 61% to 51F2 A. fumigatus MGLITDILDGVCKYCSTQSIWMLGGVGLLSLLAVAVVVNVLRQL LFKNPNEPPLVFHWFPFIGSTISYGIDPYKFFFDCRAKYGDIFTFILLGKKTTVYLGT KGNDFILNGKLRDVCAEEVYSPLTTPVFGRHVVYDCPNAKLMEQKKFVKFGLTSDALR SYVHLITDELESFVKSSSAFQGPKGVFDVCKTIAEITIYTASRSLQGKEVRNKFDSTF AELYHDLDMGFAPINFMLPWAPLPHNRKRDAAQRKLTETYMEIIKARRQAGNEKDSED MVWNLMSCVYKNGVPVPDEEIAHMMIALLMAGQHSSSSTAAWIVLRLATRPDIMEELY QEQLRVLGSDLPPLTYDSLQKLDLHAKVIKETLRLHAPIHSIIRAVKNPMPVEGTPYV IPTSHNVLSSPGVTARSEEHFADPLEWNPHRWDEHIVENDEDEEKIDYGYGLVNKGTN SPYLPFGAGRHRCIGEQFAYVQLGTITAGLARLFKFRNLPDIEGIPDTDYSSLFSKPL GKSVVQFEKREPALKA CYP51F1 Aspergillus terreus NIH2624 XM_001212028.1, ATEG_02850.1 MGVFADIIGTLCEHCSTLSIWALGGAVLLAVFVLSVVINVLRQL LFKNPHEPPVVFHWVPWIGSTIRYGIDPYKFFFECRAKYGDIFTFVLLGKKTTVYLGT KGNDFILNGKLRDVCAEEVYSPLTTPVFGRHVVYDCPNAKLMEQKKFVKFGLTSDALR SYARLITAEVDDFVKNSPTFQQPKGTFDVCKTIAEITIYTASRSLQGKEVRNRFDSTF AEMYHDLDMGFAPINFVLPWAPLPHNRKRDAAQKKMAETYMEIIKERRQAGGKKDSED MVWNLMSCVYKDGTPLPDEEIAHMMIALLMAGQHSSSSTASWIVLRLAENPEITEELY QEQLRVLGSDLPPLTWDSLQKLDLHAKVIKETLRIHAPIHSIMRAVKNPMPVEGTPYV IPTSHNVLSSPGVTARSEEYFSEPLKWDPHRWDETAAPSTEEEEETIDYGYGLVTKGT NSPYLPFGAGRHRCIGEQFAYVQLGAITAALVRLFKFSNPPGANKLPDTDYSSLFSKP LGKSYVQFEKRETNTKA CYP51F1P Aspergillus terreus ATEG_10302.1 revised 71% to CYP51F1 Aspergillus terreus see fungal pages for seq CYP51F1 Mycosphaerella graminicola Fungal P450 Database Mgr032 63% to CYP51F1 Aspergillus fumigatus see fungal pages for seq CYP51F1 Mycosphaerella fijiensis JGI model e_gw1.3.1317.1 81% to CYP51F1 Mycosphaerella graminicola see fungal pages for seq CYP51F1 Uncinocarpus reesii UREG_07804.1 78% to CYP51F1 Ajellomyces capsulatus see fungal pages for seq CYP51F1 Coccidioides immitis CIMG_07469.2 89% to CYP51F1 Uncinocarpus reesii see fungal pages for seq CYP51F1 Ajellomyces capsulatus NAm1 (Histoplasma capsulatum) XM_001540158.1 63% to CYP51F2 A. fumigatus 74% to CYP51F1 A. fumigatus MGLLADVVARVCAHCSTLSPWAFLLTGSATLIVLSVIINVLHQL LWKNPNEPPVVFHWFPIIGSTISYGIDPYKFFLNCREKYGDIFTFVLLGKKTTVFLGT KGNDFILNGKLKDVCAEEVYSPLTTPVFGSHVVYDCPNSKLMEQKKFVKYGLTSESLR SYVTLITDEFNQYIKTSPAFQGDKGVLDVCKSIAEITIYTASRSLQGKEVRSKFDSSF AQLYHDLDMGFTPINFMLPWAPLPHNRKRDAAQQKMTKIYTDIIRQRREAGGKKDSED MVWNLMSCVYKDGTPLPDIEIAHMMIALLMAGQHSSSSTFSWIILRLASCPHIIEELY EEQKQVLGEDLPPLTYETLQKLNLNSHVIRETLRIHAPIHSILRAVKSPMPVDGTRYT IPTTHNLLAAPGVTSRLPEHFPNPMTWDPHRWENPAMAPEGDQSDEKLDYGYGLVSKG ANSPYLPFGSGRHRCIGEQFAYVQLGTLLVAIVRQLKLKKLDGETGVPATDYSSLFSK PLGKPLVAWERRNPTQK CYP51F1 Histoplasma capsulatum G217B, teleomorph: Ajellomyces capsulatus HCB02684.1 model is a fusion protein, top part deleted 96% to CYP51F1 Ajellomyces capsulatus NAm1 MGLLADVVARVCAHCSTLSLWALLLAGSASFIVLSVIINVLHQLLWKNPNEPPVVFHWFPIIGSTISYGIDPYKFFLNCR EKYGDIFTFVLLGKKTTVFLGTKGNDFILNGKLKDVCAEEVYSPLTTPVFGRHVVYDCPNSKLMEQKKFVKYGLTSESLR SYVTLITDEFNQYIKTSPAFQGDKGVLDVCKSISEITIYTASRSLQGKEVRSKFDSSFAQLYHDLDMGFSPINFMLPWAP LPHNRKRDAAQQKMTKIYTDIIRQRREAGGKKDSEDMVWNLMSCVYKDGTPLPDVEIAHMMIALLMAGQHSSSSTLSWII LRLASCPHIIEELYEEQKQVLGEDLPPLTYETLQKLNLNSHVIRETLRIHAPIHSILRAVKSPMPVDGTRYTIPTTHNLL AAPGVTSRLPEHFPNPMTWDPHRWENPAMAQEEDQSGEKLDYGYGLVSKGANSPYLPFGSGRHRCIGEQFAYVQLGTLLA AIVRQLKLKKLDGETGVPATDYSPLGKPLVAWERRNPTQK CYP51F1 Venturia inaequalis strain Ent27 GenEMBL AF227920 14 alpha-demethylase MGLLSPLLAXLPGSDRSWLFYTLASFGFTVAIVAANLVKQLLFS NPNEPPVVFHWFPFFGNTVVYGIDPIKFFAECKEKHGDIFTFILLGRKTTVYIGTKGN EFILNGKQSHVNAEEIYSPLTTPVFGSDVVYDCPNSKLMEQKKFVKYGLTTE ALKSYV TLIQQEVEDYTKRYPQFKGEKGSFDVCASMAEITIFTASRSLQGKEVRDKFDASFADL FHDLDMGFSPINFMLPWAPLPHNRRRDAANKKMTETYLEIIQSRKAEGVKKDSEDMIW NLMQCVYKNGTPIPDKEIAHMMIALLMAGQHSSSSTSSWILLRLATRPDIQEELYQEQ IRVCGADLPPLQYEDLARMPLHNQIIKETLRMHSPIHSILRAVKQPMPVEGTPYTIPT SHVLLAAPIASGGSPMYFPAPEKWEPHRWDEGSGGTNISGGENGGEEKEDYGYGLITK GASSPYLPFGAGRHRCIGEQFAYMQLNTVLATQVREFKFSLREGESFPKTDFSSLFSG PLRPAWLNWERREKSS CYP51F1 Phanerochaete chrysosporium (white rot fungus) Scaffold_44, JGI gene model ug.2.6.1 54% to Ustilago maydis CYP51F1 MSLSQYGPIAGLVGQAYDALASMSTSRLVLFLLINIPILSV LPKDKSLPPVVWHWFPWFGSAAAYGEDPIKFFFDCKEK YGNVFTFILMGRKVTVALTPAGNNFIMGGKHTTFSAEEVYG GLTTPVFGKDVVYDCPNELLMEQKKFVKFGLSTENFRQYVGMIEEEVLQFMRNDASFK IYQMNDINEWGAFDVLKVMSEITILTASRTLQGKEVRANITKDYAQVYNDLDGGFTPLHF MFPNLPLESYRKRDAAHKKISDFYISIIRKRRENPGQ EEHDMIAALMNQKYRVGRPLKDHEIAHIMIALLMAGQHTSSATGSWALLHIADRPDVA EALYEEQVKHFRQSDGSWRTPEYEELKELPVLDSVIRETLRIHPPIH SIMRAVREDVVVPPTLAAPSEDGRYVIPKGHVVLSSAAISQVDPMLWKNANDWDPSRWSD PEGVAAQAYKQYDDAEGAKVDFGFGLVSKGTDSPYQPFGAGRHRCIGEQFAYLQLGTIISTFVR HVEMRLPETGVPPPNYHVRSHTRILRLVCADVASADDDHTPEGASQHSLQAA* CYP51F1 Cryptococcus neoformans var. neoformans B-3501A chromosome 1 56% to CYP51F1 Phanerochaete chrysosporium AAEY01000001 CNBA0300 ESTs gb|CF192489.1 gb|CF192488.1, gb|CF191439.1 EAL23379.1 see fungal pages for seq CYP51F1 Rhizopus oryzae (Zygomycete fungus, Mucoromycotina) RO3G_11790.1 46% to CYP51F1 Aspergillus fumigatus = CYP51F1 see fungal pages for seq CYP51F1P Rhizopus oryzae (Zygomycete fungus, Mucoromycotina) RO3G_08504.1 revised 60% TO CYP51F1 Rhizopus oryzae see fungal pages for seq CYP51F1 Phycomyces blakesleeanus (Zygomycete fungus, Mucoromycotina) JGI model estExt_fgeneshPB_pg.C_10567 73% to CYP51F1 Rhizopus oryzae see fungal pages for seq CYP51F1 Batrachochytrium dendrobatidis JEL423 (chytrid) BDEG_01891 43% to CYP51F1 Rhizopus oryzae note this species has no obvious CYP61 sequence see fungal pages for seq CYP51F2 Aspergillus nidulans AACD01000029 (WGS section) AN1901.2 64% to 51F2, 60% to 51F1 MLSPTLFSAYVLVGAIVAVLVNVIRQIFFRNKNEPPMVFHWVPFVGSTIS YGMNPYKFFFSCREKYGDIYTFVMLGKKMTVYMGVKGNDFILNGKLKDLN AEEVYSPLTTPVFGSDVVYDCPNSKLMEQKKFIKFGLSQSALESHVPLIE KEVLDYIKTSPRFKGDSGVLDAPAAMAELTIYTAGSALQGKEVRKKLTAE FADLFHDLEMGFTPINFILPWAPLPQNRKRDIAHARMRETYMEIINQRRK NPDAQDHDMIWNLMHSTYKNGNPVPDKEIAHIMITLLMAGQHSSSSISAW ILLRLASEPQILEELYQEQLANLKRDPRTGAFEPLQYKDLDLLPLHQNVI KETLRVHLSIHSILRKVKNPIPVPDTPYIIPTSHTLLASPGATALSDEYF PNANMWDPHRWENQRPDKEDEEGELIDYGYGAVSKRMSSPYLPFGGGRHR CIGEKFAYVNLGVIVATIVRNLKLYNVDGKTGVPATDYSSMFMGPMKPAV VGWERRFPARS* CYP51F2 Magnaporthe grisea AACU01001139 cont2.870 MG04628.4 (version4) 62% to CYP51 Penicillium MAFFFPSAPVWVYSAGAALLFIIGSIILNFIWQQLPRPKSEPPLVFHWLP FIGNAVSYGMDPYRFYSQCREKHGDVFTFVLFGRRMTVFLGVQGNDFILN GKLQDLNAEEIYSPLTTPVFGSDIIYDCPNSKLMEQKKFVKFGLTQKALD SYVPLIEREVLDYIESSPVFQAGNHGIVDIPSMMAEITIFTASRTLQGPE VRKKLTGEFARLYHDLDLGFRPINFLAPWAPLPQNRRRDVAHARMRDVYM DLINKRRRQKDDQEEEEEAEPDMIRHLMGSCVYKNGQALPDKEIAHMMIT LLMAGQHSSSSSSAWIMLRLASRPDIAEEVYQEVQRLGHASLQHSDLDKL PLLANVVKETLRVHSSIHSIMRKVKRPMRIPGSDYVVTPGKVLVSAPIMT HLDEEHFRDARAWEPHRWDDAVDAQDDEIVDYGYGATSKGTKSPYLPFGA GRHRCIGEKFAYLNLAAIVSTLVRNFKFSTLDGKATVPPTDYTSMFSRPM QPATVRWERRSPKTA* CYP51F2 Fusarium graminearum AACM01000179 (WGS section) FG04092.1 FGcontig1.179_scaffold2 60% to CYP51F1 Penicillium 68% to Mg CYP51F2 54% to Mg 51F1 MFHLLIYPLWVLVALFAVIIANLLYQQLPRRPDEPPLVFHWFPFFGNAVA YGLDPCGFFEKCREKHGDVFTFILFGRKIVACLGVDGNDFVLNSRLQDAN AEEVYGPLTIPVFGSDVVYDCPNSKLMEQKKFVKFGLTQKALESHVQLIE REVLDYVETDPSFSGRTSTIDVPKAMAEITIFTASRSLQGEEVRRKLTAE FAALYHDLDLGFRPVNFLFPWLPLPHNRKRDAAHIKMREVYMDIINDRRK GGIRTEDGTDMIANLMGCTYKNGQPVPDKEIAHMMITLLMAGQHSSSSAS SWIVLHLASSPDITEELYQEQLVNLSVNGALPPLQYSDLDKLPLLQNVVK ETLRVHSSIHSILRKVKRPMQVPNSPYTITTDKVIMASPTVTAMSEEYFE NAKTWNPHRWDNRAKEEVDTEDVIDYGYGAVSKGTKSPYLPFGAGRHRCI GEKFAYVNLGVIVATLVRNFRLSTIDGRPGVPETDYTSLFSRPAQPAFIR WERRKKI* CYP51F2 Fusarium oxysporum FOXG_11545 83% to CYP51F2 Fusarium graminearum see fungal pages for seq CYP51F2 Fusarium verticillioides FVEG_10277 94% to CYP51F2 Fusarium oxysporum see fungal pages for seq CYP51F2 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_7000156 Necha1/scaffold_7:419968-421550 80% to Fusarium graminearum CYP51F2 57% to CYP51F1 like seq fgenesh1_pg.scaffold_2000426, genome annotation in progress Note only one intron, not two as seen in CYP51F1 MLLLVWYPVLAIGAFLGIILIHALRQKLFRKAGEPPLIFHWLPFIGNAVSYGLDPCNFFMKCREK (0) HGDVFTFVLFGRKIVCCLGVEGNDFVLNSRLQDANAEEIYGPLTIPVFGSDVVYDCPNSKFMEQKKFVKFGLTQK ALESHVQLIEREVLDYIHAVPAFSGSEGTVDISNAMAEITIFTAARSLQGEEVRRKLTAEFAALYHDLDL GFKPVNFLFPWAPLPHNRKRDAAHAKMRDIYMDIINKRRAEGGDLGEYSDMIANLMGCTYKNGQPVPDKE IAHMMITLLMAGQHSSSSASSWIMFRLASRPDIAEELYQEQLTHLSVDGYLPPLQHSDLDKLTLLQNVVK ETLRVHSSIHSILRKVKTPMQAPGTQYTITTDKVILASPTVTALSEEYFSNARTWDPHRWDNKEKEEVAG DDVVDYGYGKVSKGTKSPYLPFGAGRHRCIGEKFAYVNLGVIVATMVRNFKLSTMDGKSGVPATDYNSLF SRPVQPAYIRWERRKA* CYP51F2v1 Aspergillus fumigatus strain = ATCC 36607 GenEMBL AAF32372.1 also AAK73659.1 cytochrome P450 sterol 14 alpha-demethylase variant MVPMLWLTAYMAVAVLTAILLNVVYQLFFRLWNRTEPPMVFHWVPFLGSTISYGIDPYKFFFACREKYGD IFTFILLGQKTTVYLGVQGNEFILNGKLKDVNAEEVYSPLTTPVFGSDVVYDCPNSKLMEQKKFIKYGLT QSALESHVPLIEKEVLDYLRDSPNFQGSSGRMDISAAMAEITIFTAARALQGQEVRSKLTAEFADLYHDL DKGFTPINFMLPWAPLPHNKKRDAAHARMRSIYVDIINQRRLDGDKDSQKSDMIWNLMNCTYKNGQQVPD KEIAHMMITLLMAGQHSSSSISAWIMLRLASQPKVLEELYQEQLANLGPAGPDGSLPPLQYKDLDKLPFH QHVIRETLRIHSSIHSIMRKVKSPLPVPGTPYMIPPGRVLLASPGVTALSDEHFPNAGCWDPHRWENQAT KEQENDEVVDYGYGAVSKGTSSPYLPFGAGRHRCIGEKFAYVNLGVILATIVRHLRLFNVDGKKGVPETD YSSLFSGPMKPSIIGWEKRSKNTSK CYP51F2v2 Aspergillus fumigatus Af293 GenEMBL XP_752137.1 also EAL90099.1 14-alpha sterol demethylase variant 74% to 51F2 A. nidulans 62% to 51F1 A. nidulans only 5 aa diffs to CYP51F2v1, may be a stran variant MVPMLWLTAYMAVAVLTAILLNVVYQLFFRLWNRTEPPMVFHWVPYLGSTISYGIDPYKFFFACREKYGD IFTFILLGQKTTVYLGVQGNEFILNGKLKDVNAEEVYSPLTTPVFGSDVVYDCPNSKLMEQKKFIKYGLT QSALESHVPLIEKEVLDYLRDSPNFQGSSGRVDISAAMAEITIFTAARALQGQEVRSKLTAEFADLYHDL DKGFTPINFMLPWAPLPHNKKRDAAHARMRSIYVDIITQRRLDGEKDSQKSDMIWNLMNCTYKNGQQVPD KEIAHMMITLLMAGQHSSSSISAWIMLRLASQPKVLEELYQEQLANLGPAGPDGSLPPLQYKDLDKLPFH QHVIRETLRIHSSIHSIMRKVKSPLPVPGTPYMIPPGRVLLASPGVTALSDEHFPNAGCWDPHRWENQAT KEQENDKVVDYGYGAVSKGTSSPYLPFGAGRHRCIGEKFAYVNLGVILATIVRHLRLFNVDGKKGVPETD YSSLFSGPMKPSIIGWEKRSKNTSK CYP51F2 Neosartorya fischeri NRRL 181 XM_001267337.1, NFIA_109350 97% to CYP51F2 Aspergillus fumigatus MVPMLLLTAYMAVAMLTAILLNVVYQLFFRLWNRTEPPMVFHWV PFLGSTISYGIDPYKFFFACREKYGDIFTFILLGQKTTVYLGVQGNEFILNGKLKDVN AEEVYSPLTTPVFGSDVVYDCPNSKLMEQKKFIKYGLTQSALESHVPLIEKEVLDYLR NSPNFQGSSGQVDISAAMAEITIFTAARALQGQEVRSKLTAEFADLYHDLDKGFTPIN FMLPWAPLPHNKKRDAAHARMRSIYIDIINQRRLDGEKDSQKSDMIWNLMNSTYKNGQ QVPDKEIAHMMITLLMAGQHSSSSISAWIMLRLASQPKVLEELYQEQLANLGPVGPDG SLPPLQYKDLDKLPFHQHVVRETLRLHSSIHSLMRKVKSPLPVPGTPYMIPPGRVLLA SPGVTALSDEHFPNAGCWDPHRWENQAAKEQENDEVVDYGYGAVSKGTSSPYLPFGAG RHRCIGEKFAYVNIGVILATIVRHLRLFNVDGKKGVPETDYSSLFSGPMKPSIIGWEK RSKDTSK CYP51F2 Aspergillus oryzae GenEMBL BAE57417.1 77% to CYP51F4, 75% to CYP51F2 Aspergillus nidulans MASFTLVSAYAAAGLLAIIVLNLLRQLLFRNKTDPPLVFHWIPFLGSTVTYGMDPYAFFFSCRQKYGDIF TFILLGRKITVYLGIQGNEFILNGKLKDVNAEEIYSPLTTPVFGSDIVYDCPNSKLMEQKKFIKFGLTQA ALESHVPLIEKEVLDYLKTSPNFKGTSGRVEITDAMAEITIFTAGRALQGEEVRKKLTTEFADLYHDLDR GFTPINFMLPWAPLPRNRKRDAAHARMREIYMDIINERRKNPDRETSDMIWNLMHCTYKNGQPLPDKEIA HMMITLLMAGQHSSSSISSWIMLRLASEPAVMEELYQEQITKLSPDGRTLPPLQYRDLDLLPLHQNLIKE TLRLHLSIHSLMRKVKNPMPVPGTPYVVPADHVLLASPGVTALSDEYFPNASRWDPHRWENRVEKEDEED IVDYGYGTVSKGTSSPYLPFGAGRHRCIGEKFAYVNLGVIVATMARHMKLFNVDGKKGVPATDYSSMFSG PSKPAIIGWERRFPEKS CYP51F2 Aspergillus flavus AFL2G_02771 99% to CYP51F2 Aspergillus oryzae, 1 aa diff see fungal pages for seq CYP51F2 Aspergillus clavatus NRRL 1 XM_001271578.1, ACLA_046180 81% to CYP51F2 Aspergillus fumigatus MLSLTLFGLYLVSATAVVILVNVVYQHLFRLRNRTEPPMVFHWI PFIGSTITYGIDPCKFFFACREKYGNIFTFILLGQKVTVYLGVEGNEFILNGKLKDVN AEEVYSPLTTPVFGSDVVYDCPNSKFMEQKKFIKFGLTQSALEAHVPLIEKEVLDYLE TSPRFQGTSGLVDIAAAMAEITIFTAARALQGEEVRSKLTAEFADLYHDLDRCFTPVN FMFPWAPLPRNKKRDAAHVRMRAIYVDIINQRRRDGGEDTQKSDMIWNLMNCSYKNGQ QVPDKEIAHMMITLLMAGQHSSSSIGSWIMLRLASQPEVLEKLYQEQLDKLAQGGPGS NLRPLQYKDLELLPYHQHVIRETLRLHSSIHSILRKVKNTLSVPGTSYVIPPGRVLLA SPGVTALSDEHFPNASRWDPQRWENQTTKEDSGEMVDYGYGAVSKGTASPYLPFGAGR HRCIGEKFAYVNIGVILATLVRHLRLSNMDGKEGVPATDYSSLFSGPMKPSIIQWGKR SNDLSK CYP51F2 Aspergillus terreus NIH2624 XM_001215095.1, ATEG_05917.1 78% to CYP51F2 Aspergillus fumigatus MSLLTITSYAAVGILALIAWNVIRQLLFQNKSEPPVVFHWIPFL GSTISYGIDPYAFFASCRQKYGDIFTFILLGQKTTVYLGVQGNEFILNGKLKDVNAEE VYSPLTTPVFGSDVVYDCPNAKLMEQKKFIKYGLTQAALESHVQLIEREVLDYLRTSP NFHGASGVVDISAAMAELTIFTAGRALQGEEVRRKLTAEFADLYHDLDKGFTPINFML PWAPLPHNRKRDAAHARMRAIYMDIIRERRAHKNPDQVEAGSDSDMIWNLMRCTYKDG QPVPDKEIAHMMITLLMAGQHSSSSISAWIFLRLASEPRVLEELYQEQVASLGKPDAD GVFPPLQFRDLDRLPLHQNVVKETLRLHSSIHSIMRKVKNPLPVPGTPYVVPTSHVLL ASPGVTATSDEYFPNASRWDPHRWENQAEPDDDGEMVDYGYGRVSKGTASPYLPFGGG RHRCIGEKFAYINLGVIVATVVRHLKLANVDGRKGVPQTDYSSLFSGPVKPAIIGWER RFPTA CYP51F2 Aspergillus niger CBS 513.88 XM_001394187.1, CAK40582.1 80% to CYP51F2 Aspergillus fumigatus MALLAVAGVYAFAALLVAIVLNVTRQLLFRNEKEPPVVFHWIPF LGSTISYGMDPYTFFFSCRKKYGDIFTFVLLGQKTTVYLGVQGNDFILNGKLKDVSAE EVYSPLTTPVFGSDVVYDCPNSKLMEQKKFIKFGLTQAALESHVQLIEKETLDYLRDS PRFNGASGVIDIPAAMAEITIYTAARALQGEEVRKKLTAEFAELYHDLDKGFSPINFM LPWAPLPHNQKRDAAHARMREIYTDIINERRKNPDEEKSDMIWNLMHCTYKSGQPVPD KEIAHMMITLLMAGQHSSSSISSWIMLRLASEPQVLEELYQEQLASLSNRNGVFEPLQ YQDLDKLPFLQSVIKETLRIHSSIHSIMRKVKNPLPVPGTSYIIPEDHVLLASPGVTA LSDEYFPNATRWDPHRWENQPDKEEDGEMVDYGYGSVSKGTASPYLPFGAGRHRCIGE KFAYVNLGVIIATIVRHLKLFNVDGRKGVPGTDYSTLFSGPMKPAIVGWERRFPDHSK GSLN CYP51F2 Aspergillus niger (from JGI) estExt_fgenesh1_pm.C_40090|Aspni1 80% to CYP51F2 Aspergillus oryzae, 66% to 51F1 Penicillium italicum 4 aa diffs to Aspergillus niger CBS 513.88 MALLAVAGVYAFAALLVAIVLNVTRQLLFRNEKEPPVVFHWIPFLGSTISYGMDPYTFFFSCRKKYGDIFTFVLLGQKTT VYLGVQGNDFILNGKLKDVSAEEVYSPLTTPVFGSDVVYDCPNSKLMEQKKFIKFGLTQAALESHVQLIEKETLDYLRDS PRFNGASGVIDIPAAMAEITIYTAARALQGEEVRKKLTAEFAELYHDLDKGFSPINFMLPWAPLPHNRKRDAAHARMREI YTDIINERRKNPDEEKSDMIWNLMHCTYKSGQPVPDKEIAHMMITLLMAGQHSSSSISSWIMLRLASEPQVLEELYQEQL ASLSNRNGVFEPLQYQDLDKLPFLQSVIKETLRIHSSIHSIMRKVKNPLPVPGTSYIIPEDHVLLASPGVTALSDEYFPN ATRWDPHRWENQPDKEEDGEMVDYGYGSVSKGTASPYLPFGAGRHRCIGEKFAYVNLGVIIATIVRHLKLFNVDGRKGVP GTDYSTLFSGPMKPAIVGWERRFPDNIKGSMN* CYP51F2 Ajellomyces capsulatus NAm1 XM_001540591.1 70% to CYP51F2 Aspergillus fumigatus MSWFTIGTFGVLAFVAAVTLNLVYQLLFRFLNKTRPPLVFHWIP FLGSTIRYRTDPYKFFFSCRQKEASRLFQHGDIFTFVLLGRPTTVYLGIKGNEFILNG KLKDVNAEEVYGPLTTPVFGPDVVYDCPNSKLIEQKKFIKYGLTQAALESHVPLIEKE VMDYLDSSPNFLGASGEVDISAVMAEITIFTAGSALQGEEVRSKLTTEFAVLYHDLDK GFSPINFMLPWAPLPHNKKRDAAHTLMHAIYLDIIEKRRRAARNANGSQTQDMIENLM QCTYKNGQKLPDKEIANIMISLLMAGQHSSSTTSSWIMLHLASEPAVVEQLYQEQLDN LPRTGPNGSLGPLQYGDLNRLPLHRNVIRETLRLHTSIHSLLRKVMNPMPVAGTPYVI PPSHVVLSAPGVTALSDEYFPNPTMWDPNRWETQEPKVDEKEDMVDYGYGTISKGTSS PYLPFGAGRHRCIGEKFAYVNLTVIVAIMVRHLQFSNIGGKTGVPRTDYSSMFSAPMK PARINWKRRTAKSG CYP51F2 Histoplasma capsulatum G217B 94% to CYP51F2 Ajellomyces capsulatus NAm1 two identical sequences ABBT01000205.1 Ajellomyces capsulatus G217B 481620-479996 (-) ABBT01000207.1 Ajellomyces capsulatus G217B 78633-77009 (-) HCB05685.1 identical to HCB06140.1 see fungal pages for seq CYP51F2 Penicillium digitatum strain GL-01 DQ355161.1, also AJ439081 (5 aa diffs) 67% to CYP51F2 Aspergillus fumigatus MDLVPLVTGQIKCIAYYTTGLVLASIVLNVIKQLVFYNRKEPPV VFHWIPFIGSTVAYGMDPYQFFFASRAKYGNIFTFILLGKKTTVYLGVEGNEFILNGK LKDVNAEEIYGKLTTPVFGSDVVYDCPNSKLMEQKKFIKYGLSQEALESYVPLIADEI SSYIKSSPSFKGQSGTIDLVPAMAEITTFTAARTLQGEEVRSKLTTEFAKLFHDLDLG FTPINFMLPWAPLPQNRKRDRAHRRMREIYVDIIQARREAGEEANDNGRDKTKGTDMI SNLMRCVYRDGTPIPDKEIAHLMITLLMAGQHSSASISCWILLRLASQPEMTEKLFAE QVNNLGADLPLLQYKDLDKLPLHRNVIKETLRLHSSIHTLMRKVKNPMPVPGTDFVIP PSHTLLSSPGVTARDERHFRDPLRWDPHRWGSRVEAEDSSDTVDYRYGAVSKGTRSPY LPFGAGRHRCIGEKFAYLNLGVIIATLLREFRFFNPEGMEGVPDTDYSSLFSRPMQPA TVRWEVRS CYP51F3 Fusarium graminearum AACM01000457 (WGS section) FG11024.1 FGcontig1.457_scaffold8 50% to 51F1 N.crassa (third CYP51 in this genome) MESLYETLRTLPLSVSIPLTTSIIIILSIVTNVVKQLWFPNPHRPPVVFH IFPFIGSTVQYGIDPYAFFFDCRDKYGDCFTFILLGKSTTVFLGPKGNDF ILNGKHADLNAEDVYGKLTTPVFGEEVVYDCSNARFMDQKRLLKLGLTTD SLRCYIPKFVKEVEDYVKNSPYFKGDTGIVNITEVMAEITIYTASGSLLG NEVRSMFDSTFATLYRHLDDGFQPINFVMPGLPLPQNFRRNHARKVMEKL FSDIISKRRETGNQGDETDMIWMLMNAQYKDGEPLPDHHAARMLIAILMG GQHNTAVSGAWLLLNLAHKPHLVQELYEEQTQVLGSPQEPLTWENLQKLT LNGQVIKETLRLHSPIHSILRQVKSPMRVPGTEWVVPPSHTLLSSPGTMA RSEEFFPRPSEWDPHRWDKIEPLVKTAEDGQTVDYGFGVMSKSVSSPYLP FGAGRHRCVGENYAYAQLGAIVATFIRLVHIEQPDPKAPLPAPDYSSMFS RPMNPAEIRWRRRETVE* CYP51F3 Fusarium oxysporum FOXG_13138 87% to CYP51F3 Fusarium graminearum note: gc boundary at DQKR see fungal pages for seq CYP51F3 Fusarium verticillioides FVEG_12391 95% to CYP51F3 Fusarium oxysporum see fungal pages for seq CYP51F3 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_6000765 Necha1/scaffold_6:2267196-2268844 81% to Fusarium graminearum CYP51F3 52% to fgenesh1_pg.scaffold_2000426, Note two exons, but the second exon is not seen in CYP51F1 or CYP51F2 MESLYGSFRALPLSISIPFTVGLVVVIAIARNVIGQLYFPDRNRPPVVFHVFPFIGSSVQYGIDPYKFFF DCKAKYGDCFTFILLGKPTTVFLGQKGNEFILNGKHADLNAEEVYGKLTTPVFGKEVVYDCSNARFMDQKR (0) LLKLGLTTEALRAYIPKFVKEVEDYVNTSPYFKGETGQVNITEVMAEITIYTASGSLLGNEVRSMFDST FALLYRHLDDGFQPINFVMPWLPLPQNRRRDHARNVMEGLFANIIRQRREQGNQGDETDMIWTLMDAQYK DGTVLPDIHVARLLIAILMGGQHNTAASGAWLFLNLAHKPHLVEEMYQEQLEVLGSPPAPLTWENVQKLT LNGQVIKETLRLHSPIHSIMREVKSPMPVPGTEWVVPPSHTLLASPGTLARSEDFFPRPLEWDPHRWDKA DPVEGPVDDSQKVDYGFGMMSKAVSSPYLPFGAGRHRCVGEHYAYAQLGAIVATFVRLCHMEQVDPKAPI PATDYS (1) SMFSRPMNPAVIKWRRRKPE* CYP51F4 Aspergillus oryzae GenEMBL BAE63554.1 72% to 51F2 Aspergillus nidulans, 77% to 51F2 Aspergillus oryzae 60% to 51F1 Aspergillus oryzae MSWPRIGAYALLAFVAIMALNVTYQFLFRMLNKTRPPLVFHWIPFIGSTIHYGTDPYGFFFSCREKYGDI FTFILLGRPTTVYLGTQGNEFILNGKLKDVNAEEVYSPLTTPVFGSDVVYDCPNSKLIEQKKFIKFGLSQ AALEAHVPLIEKEVEDYLAMSPNFHGTSGEVDIPAAMAEITIFTAGSALQGEEVRSKLTTEFAVLYHDLD KGFTPINFMLPWAPLPHNKKRDAAHARMRAIYIDIINKRRNAGDNVPEKLDMIGNLMQCTYKNGQPLPDK EIAHIMITLLMAGQHSSSSISSWIMLRLASQPAVVEELYQEQLANLERTGPNGSLAPLQYKDFDNLPLHQ NVIRETLRLHSSIHSLLRKVKNPLPVPGTPYVIPTSHVLLAAPGVTALSDEYFPNAMAWDPHRWETQAPQ ENNKDDIVDYGYGAMSKGTSSPYLPFGAGRHRCIGEKFAYLNLAVIVATMVRHLRFSNLDGQTGVPDTDY SSLFSGPMKPARIRWERRAAKSG CYP51F5 Rhizopus oryzae (Zygomycete fungus, Mucoromycotina) RO3G_16595.1 62% to CYP51F1 Rhizopus oryzae 42% to CYP51F1 Ustilago maydis in the CYP51 family 45% to CYP51F1 S. pombe see fungal pages for seq CYP51F6 Kluyveromyces polysporus Kpol_1023p61 65% to CYP51F1 S. cerevisiae see fungal pages for seq CYP51F7 Uncinocarpus reesii UREG_00593.1 58% to CYP51F Uncinocarpus reesii from PPVV on 62% to Aspergillus terreus CYP51F1 model is short. Seq. runs into a gap, missing last exon see fungal pages for seq CYP51F7 Coccidioides immitis CIMG_00573.2 84% to CYP51F7 Uncinocarpus reesii (ortholog to second CYP51 in this species) CIMG_00573.2 CYP51G1 Triticum aestivum (wheat) Francis Durst CYP51BLE submitted to nomenclature committee note: CYP51G subfamily is for Green plants and their close relatives see under plants for more 51G sequences CYP51H1 Oryza sativa (rice) note: the CYP51H subfamily is for non-typical CYP51s in Green plants and their close relatives see under plants for more 51H sequences 52A Subfamily CYP52A1 Candida tropicalis GenEMBL M24894 M15945 Sanglard,D. and Loper,J.C. Characterization of the alkane-inducible cytochrome P450 (P450alk) gene from the yeast Candida tropicalis: identification of a new P450 gene family. Gene 76, 121-136 (1989) CYP52A2 Candida tropicalis Swiss P30607 (522 amino acids) GenEMBL M63258 (5745bp) Seghezzi,W., Sanglard,D. and Fiechter,A. Characterization of a second alkane-inducible cytochrome P450p-encoding gene, CYP52A2, from Candida tropicalis Gene 106, 51-60 (1991) CYP52A3 Candida maltosa GenEMBL D12475 D01168 (2047bp) Ohkuma,M., Hijiki,T., Tanimoto,T., Schunck,W.H., Muller,H.G., Yano,K. and Takagi,M. Evidence that more than one gene encodes n-alkane-inducible cytochrome P-450s in Candida maltosa, found by two-step gene disruption. Agric. Biol. Chem. 55, 1757-1764 (1991) Note: This is CYP52A3-b, one of two alleles in C. maltosa. CYP52A3 Candida maltosa GenEMBL S77461 (2047bp) Swiss P24458 (523 amino acids) Ohkuma,M., Hikiji,T., Tanimoto,T., Schunck,W.H., Muller,H.G., Yano,K. and Takagi,M. Evidence that more than one gene encodes n-alkane-inducible cytochrome P-450s in Candida maltosa, found by two-step gene disruption Agric. Biol. Chem. 55, 1757-1764 (1991) CYP52A4 Candida maltosa GenEMBL X51932, S64322 (1787bp) Schunck,W.H., Vogel,F., Gross,B., Kargel,E., Mauersberger,S., Kopke,K., Gengnagel,C. and Muller,H.G. Comparison of two cytochromes P-450 from Candida maltosa: primary structures, substrate specificities and effects of their expression in Saccharomyces cerevisiae on the proliferation of the endoplasmic reticulum. Eur. J. Cell Biol. 55, 336-345 (1991) CYP52A5 Candida maltosa GenEMBL D12714 Ohkuma,M. GenEMBL X55881 Ohkuma,M., Muraoka,S., Tanimoto,T., Fujii,M., Ohta,A. and Takagi,M. CYP52 (cytochrome P450alk) multigene family in Candida maltosa: identification and characterization of eight members. DNA Cell Biol. 14, 163-173 (1995) CYP52A6 Candida tropicalis GenEMBL Z13010 (2501bp) Swiss P30608 (524 amino acids) Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and Fiechter,A. Identification and characterization of additional members of the cytochrome P450 gene family CYP52 of Candida tropicalis. DNA Cell Biol. 11, 767-780 (1992) CYP52A7 Candida tropicalis GenEMBL Z13011 (2087bp) Swiss P30609 (507 amino acids) Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and Fiechter,A. Identification and characterization of additional members of the cytochrome P450 gene family CYP52 of Candida tropicalis. DNA Cell Biol. 11, 767-780 (1992) CYP52A8 Candida tropicalis GenEMBL Z13012 (1929bp) Swiss P30610 (517 amino acids) Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and Fiechter,A. Identification and characterization of additional members of the cytochrome P450 gene family CYP52 of Candida tropicalis. DNA Cell Biol. 11, 767-780 (1992) CYP52A9 Candida maltosa GenEMBL D26160 (283bp) Ohkuma,M. unpublished ALK5-B allele of CYP52A9 ALK5-A CYP52A9 Candida maltosa PIR JS0723 ALK5-A CYP52A10 Candida maltosa CYP52A11 Candida maltosa GenEMBL D26159 (815bp) Ohkuma,M. unpublished ALK8-B allele of CYP52A11 ALK8-A Note on CYP52A12-CYP52A20 and CYP52D2 from Candida tropicalis. These sequences seem to be orthologs of other Candida tropicalis sequences. 52A12 seems to be an ortholog of 52A6 52A13 and 52A14 seem to be orthologs of 52A2 52A15 and 52A16 seem to be orthologs of 52A1 52A17 and 52A18 seem top be orthologs of 52A8 52A19 and 52A20 seem to be orthologs of 52A7 the last 8 listed appear in what might be allelic pairs 94-96% identical to each other. It seems probable that these actually are the orthologs of those other sequences and this Candida tropicalis strain is different from the other. CYP52D2 matches Candida maltosa 52D1 and was not observed in C. tropicalis before. CYP52A12 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 1p 90% to CYP52A6 CYP52A13 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 2p might be an allele of CYP52A14 (96% identical) 86% to CYP52A2 CYP52A14 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 3p might be an allele of CYP52A13 (96% identical) 87% to CYP52A2 CYP52A15 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 4p might be an allele of CYP52A16 (95% identical) 85% to CYP52A1 CYP52A16 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 5p might be an allele of CYP52A15 (95% identical) 84% to CYP52A1 CYP52A17 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 7p might be an allele of CYP52A18 (94% identical) 78% to CYP52A8 CYP52A18 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 8p might be an allele of CYP52A17 (94% identical) 77% to CYP52A8 CYP52A19 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 9p might be an allele of CYP52A20 (95% identical) 75% to CYP52A7 CYP52A20 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 10p might be an allele of CYP52A19 (95% identical) 76% to CYP52A7 CYP52A21 Candida albicans GenEMBL XM_705082 (one aa diff to the Kim seq), orf19_10 XM_705076 (one aa diff to the Kim seq) Y14766 (3 aa diffs to the Kim seq) Donghak Kim and Paul Ortiz de Montellano Submitted to nomenclature committee July 21, 2006 72% to CYP52A11 (ALK8) of Candida maltosa CYP52A22 Candida albicans SC5314 79% to CYP52A4 orf19_7512 see Fungal pages for seq CYP52A23 Candida albicans SC5314 78% to CYP52A5 orf19_7513 see Fungal pages for seq CYP52A24 Candida albicans SC5314 78% to CYP52A3 orf19_5728 partial sequence see Fungal pages for seq CYP52A25 Candida dublinensis cdub_593-g99.1 79% to CYP52A3 Candida maltosa see Fungal pages for seq CYP52A26 Candida dublinensis cdub_14-g136.1 89% to CYP52A21 Candida albicans see Fungal pages for seq CYP52A27 Candida dublinensis cdub_14-g720.1 94% to CYP52A23 Candida albicans see Fungal pages for seq CYP52A28 Candida dublinensis cdub_14-g719.1 93% to CYP52A22 Candida albicans see Fungal pages for seq CYP52A29 Candida lusitaniae CLUG_03984.1 55% to CYP52A13 Candida tropicalis see Fungal pages for seq CYP52A30 Candida lusitaniae CLUG_04851.1 55% to CYP52A23 Candida albicans see Fungal pages for seq CYP52A31 Candida lusitaniae CLUG_04098.1 52% to CYP52A18 Candida tropicalis see Fungal pages for seq CYP52A32 Candida parapsilosis CPAG_03577 71% to CYP52A5 Candida maltosa see Fungal pages for seq CYP52A33 Candida parapsilosis CPAG_04394 73% to CYP52A5 Candida maltosa see Fungal pages for seq CYP52A34 Candida parapsilosis CPAG_04395 71% to CYP52A5 Candida maltosa see Fungal pages for seq CYP52A35 Candida parapsilosis CPAG_05481 65% to CYP52A21 Candida albicans see Fungal pages for seq CYP52A36 Candida parapsilosis CPAG_05482 65% to CYP52A21 Candida albicans see Fungal pages for seq CYP52A37 Candida parapsilosis CPAG_05523 58% to CYP52A12 Candida tropicalis see Fungal pages for seq CYP52A38 Candida parapsilosis CPAG_02285 74% to CYP52A25 Candida dublinensis see Fungal pages for seq CYP52A39 Candida guilliermondii PGUG_01238.1 66% to CYP52A14 Candida tropicalis see Fungal pages for seq CYP52A40 Candida guilliermondii PGUG_05855.1 60% to CYP52A25 Candida dublinensis see Fungal pages for seq CYP52A41 Candida guilliermondii PGUG_04005.1 52% to CYP52A12 Candida tropicalis see Fungal pages for seq CYP52A42 Candida guilliermondii PGUG_05670.1 62% to CYP52A23 Candida albicans see Fungal pages for seq CYP52A43 Debaryomyces hansenii CAG85755.1 54% to CYP52A3 Candida maltosa see Fungal pages for seq CYP52A44 Debaryomyces hansenii CAG85832.1 63% to CYP52A25 Candida dublinensis see Fungal pages for seq CYP52A45 Debaryomyces hansenii CAG88380.1 62% to CYP52A14 Candida tropicalis see Fungal pages for seq CYP52A46 Debaryomyces hansenii CAG88381.1 63% to CYP52A14 Candida tropicalis see Fungal pages for seq CYP52A47 Debaryomyces hansenii CAG88382.1 65% to CYP52A14 Candida tropicalis see Fungal pages for seq CYP52A48 Lodderomyces elongisporus LELG_04957 73% to CYP52A38 Candida parapsilosis see Fungal pages for seq CYP52A49 Lodderomyces elongisporus LELG_03506 72% to CYP52A37 Candida parapsilosis see Fungal pages for seq CYP52A50 Lodderomyces elongisporus LELG_03505 71% to CYP52A37 Candida parapsilosis see Fungal pages for seq CYP52A51 Lodderomyces elongisporus LELG_03309 75% to CYP52A35 Candida parapsilosis see Fungal pages for seq CYP52A52 Lodderomyces elongisporus LELG_00044 70% to CYP52A5 Candida maltosa see Fungal pages for seq CYP52A53 Pichia stipitis JGI model e_gww1.3.1.449.1 65% to CYP52A40 Candida guilliermondii see Fungal pages for seq CYP52A54 Pichia stipitis JGI model e_gww1.3.1.459.1 63% to CYP52A44 Debaryomyces hansenii see Fungal pages for seq CYP52A55 Pichia stipitis JGI model fgenesh1_pm.C_chr_3.1000237 63% to CYP52A14 Candida tropicalis see Fungal pages for seq CYP52A56 Pichia stipitis JGI model fgenesh1_pm.C_chr_8.1000042 59% to CYP52A38 Candida parapsilosis see Fungal pages for seq CYP52A57 Pichia stipitis JGI model e_gww1.3.1.1392.1 62% to CYP52A39 Candida guilliermondii see Fungal pages for seq 52B Subfamily CYP52B1 Candida tropicalis GenEMBL Z13013 (2442bp) Swiss P30611 (506 amino acids) Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and Fiechter,A. Identification and characterization of additional members of the cytochrome P450 gene family CYP52 of Candida tropicalis. DNA Cell Biol. 11, 767-780 (1992) 52C Subfamily CYP52C1 Candida tropicalis GenEMBL Z13014 (1622bp) Swiss P30612 (505 amino acids) Seghezzi,W., Meili,C., Ruffiner,R., Kuenzi,R., Sanglard,D. and Fiechter,A. Identification and characterization of additional members of the cytochrome P450 gene family CYP52 of Candida tropicalis. DNA Cell Biol. 11, 767-780 (1992) CYP52C2 Candida maltosa D12718 CYP52C3 Candida albicans SC5314 60% to CYP52C2 orf19_6574 see Fungal pages for seq CYP52C4 Candida dublinensis cdub_588-g100.1 94% to CYP52C3 Candida albicans (ortholog) see Fungal pages for seq CYP52C5 Candida parapsilosis CPAG_01250 48% to CYP52C3 Candida albican There is one 52C seq per Candida species except this species has two. They all seem to be orthologs even though the percent identity is not high see Fungal pages for seq CYP52C6 Candida parapsilosis CPAG_01665 50% to CYP52C2 Candida maltosa 53% to to CYP52C5 this species see Fungal pages for seq CYP52C7 Lodderomyces elongisporus LELG_05768 54% to CYP52C6 Candida parapsilosis 48% to CYP52C3 Candida albicans see Fungal pages for seq 52D Subfamily CYP52D1 Candida maltosa CYP52D2 Candida tropicalis No accession number David Craft Cognis Corp. Submitted to nomenclature committee June 26, 2001 Clone name 6p 61% to CYP52D1 52E Subfamily CYP52E1 Candida apicola GenEMBL X76225 (1560bp) PIR S38894 (519 amino acids) Lottermoser,K. unpublished (1994) CYP52E2 Candida apicola GenEMBL X87640 (2203bp) Lottermoser,K., Schunck,W.H. and Asperger,O. Cytochromes P450 of the sophorose lipid-producing yeast Candida apicola: heterogeneity and polymerase chain reaction-mediated cloning of two genes Yeast 12 (6), 565-575 (1996) CYP52E3 Candida bombicola, alternative name Starmerella bombicola EU552420 Inge Van Bogaert Submitted to nomenclature committee Dec. 7, 2007 Gene2 92% to 52E2, 82% to 52E1 52F Subfamily CYP52F1 Yarrowia lipolytica GenEMBL AB010388 (3578bp), GenPept CAG80007.1 Iida,T., Ohta,A. and Takagi,M. Cloning and characterization of an n-alkane-inducible cytochrome P450 gene essential for n-decane assimilation by Yarrowia lipolytica. Yeast 14 (15), 1387-1397 (1998) gene for ALK1 see Fungal pages for seq CYP52F2 Yarrowia lipolytica gene for ALK2 GenEMBL AB010389, GenPept CAG77659.1 Iida,T., Ohta,A. and Takagi,M. Yarrowia lipolytica cytochrome P450 gene, complete cds unpublished see Fungal pages for seq CYP52F3 Yarrowia lipolytica gene for ALK3 GenEMBL AB010390, GenPept CAG79910.1 Iida,T., Ohta,A. and Takagi,M. Yarrowia lipolytica cytochrome P450 gene, complete cds unpublished see Fungal pages for seq CYP52F4 Yarrowia lipolytica gene for ALK4 GenEMBL AB010391, GenPept CAG83106.1 Iida,T., Ohta,A. and Takagi,M. Yarrowia lipolytica cytochrome P450 gene, complete cds unpublished see Fungal pages for seq CYP52F5 Yarrowia lipolytica gene for ALK5 GenEMBL AB010392, GenPept CAG83107.1 Iida,T., Ohta,A. and Takagi,M. Yarrowia lipolytica cytochrome P450 gene, complete cds unpublished see Fungal pages for seq CYP52F6 Yarrowia lipolytica gene for ALK6 GenEMBL AB010393, GenPept CAG82620.1 Iida,T., Ohta,A. and Takagi,M. Yarrowia lipolytica cytochrome P450 gene, complete cds unpublished see Fungal pages for seq CYP52F7 Yarrowia lipolytica gene for ALK7 GenEMBL AB010394, GenPept CAG84028.1 Iida,T., Ohta,A. and Takagi,M. Yarrowia lipolytica cytochrome P450 gene, complete cds unpublished see Fungal pages for seq CYP52F8 Yarrowia lipolytica gene for ALK8 GenEMBL AB010395, GenPept CAG82058.1 Iida,T., Ohta,A. and Takagi,M. Yarrowia lipolytica cytochrome P450 gene, complete cds unpublished see Fungal pages for seq CYP52F9 Yarrowia lipolytica CAG84211.1 64% to CYP52F3 see Fungal pages for seq CYP52F10 Yarrowia lipolytica CAG82791.1 71% to CYP52F1 see Fungal pages for seq CYP52F11 Yarrowia lipolytica CAG83401.1 69% to CYP52F2 see Fungal pages for seq CYP52G1 Aspergillus nidulans AN6057.1 38% to 539A1 46% to 52F1 52 clan see Fungal pages for seq CYP52G2 Aspergillus fumigatus Af293 GenEMBL XP_755288.1 also EAL93250.1 60% to 52G1 MGLPLLTIAAVAVALRLLWTLAARWQHAQNARRLGCGSIPLYPSDPLGISVLKETLAADKAKKLLPLVER RVALMSDREGRYVTTFRFRQMGRENYFTTDPKNIQAILATQFKDFELGAPRRQALHPLLGAGIFSSDGEE WFHARGLLRPQFTRDQISDLDLEERHVQKAMQAMPVVNGKWTDAVDIQSIFFRLTIDSATEFLFGESVES QLSALNGGQTPVDTFPYYFDKSQWYAAQRARFEKLYWIVNNKECRHAQNEVHAYVDRVVHKALKAAREGK LANPNKPSQYVFLHALVSVTQDPIELRSQLLNILLAGRDTTASLLSWTVLMLARHPAEFHKLRQTIVDEF GTYDQPRNITFAALKSCQYLQYCLNETLRLFPVVPGNRRSATRDTTLPRGGGSDGTQPIYIRKGQTVVYN VHILHRRKDIWGPDAEEFKPSRWVDRKVGWDYVPFNGGPRICIGQQFALTEAGYVLVRLLQRFDAIEDMQ PHLEIRHSLNLTSAPADNVTVRLREAA CYP52G2 Neosartorya fischeri NFIA_085030 97% to CYP52G2 Aspergillus fumigatus = ortholog see Fungal pages for seq CYP52G3 Aspergillus oryzae GenEMBL BAE65166.1, AP007171.1 58% to CYP52G4, 63% to 52G1 14 P450 genes and 2 pseudogenes on this contig MIPLLILAALAVVFRLVWSIFTSLRHAQNARKWHCGAIPTYPGDILGINTLKEVLRADKEKLIPVLSAQR VETMTAREGRYVSTFRLRQMGRESIFTSDPKNMQAILATQFKEFELGSLRRNSLHPLLGSGIFSTDGEAW SRSRSLLRPQFTRDQVSDLDLEERHVQKAMAGMLADPATKWTPEIDIQSIFFRLTIDSATEFLFGDSVES QTAALSGSRIIEDKFPSYFDRGQWYAAQRARFEKLYWIVNNKESRETDRFVHAYVDRFIDAALAAVKEGK IDPEKRNSDHYVFLHGLTTATQDPVELRSQLLNILLAGRDTTASLLSWCVLLLARHPDIFQKLRNTILAD FGDYRNPRNITFSSLKSCRYLHYFMNEVLRLYPIVPGNRRVALKDTTLPRGGGPDGSEPVYVRKGQPVVY SVFVTHRRKDIWGADAEVFNPDRWEDLKVGWEYLPFNGGPRICIGQQFALTEAGYVLVRLLQRFDQIVDA RPEREIRFNATLTSAPWENVIVRLREGA CYP52G3 Aspergillus flavus AFL2G_05445 99% to CYP52G3 Aspergillus oryzae see Fungal pages for seq CYP52G4 Aspergillus oryzae GenEMBL BAE55312.1 54% to 52G1 MSDARTTLMETWFTPALVAAIVAARLLFSLYSAWRHAQRARSLNCQEAPLYPSRDPFGVATLLETLRADR DKFLPKLSQKRVDLISSQQNRYVSTFRVRQAGRENFFTVDPKNIQAMLATQFNDFFLGDMRRNAGAPVIR SGIFVSDGADWSHSRSLIRPQFTRTQINNLELEERHVQNALRAMPTQSNGWTSEVDIQTILFRLTLDSAT EFLFGKSCNSQLTALEKDAGEMSDSFLNSFDRCAWYLAARLRFERLYWIVNNKEFRECTRVVHELVDGYV HAALQRAQQAEKPADPERGSHYVFVDALTATTQDPNQLRDECLNVLIAGCDTTASLLSWNILLLARHPDI FQRLRKEIIERFGTYSEPRDINFSSLKSCQYLQHFINETLRLHPVVPFNRRCANKDTTLPRGGGKDGNSP VYLQKGQPVLYSSYVLQRRKDIWGEDAEEFNPDRWYGRKAIWEHIPFSGGPRTCIGQQFAITNTSFVLVR LLQRFDSIEDVYPEREIRYGVTLTNCPADRVTVRMHQAEL CYP52G4 Aspergillus flavus 99% to CYP52G4 Aspergillus oryzae AFL2G_00228 see Fungal pages for seq CYP52G5P Nectria haematococca e_gw1.8.757.1 Necha1/scaffold_8:999970-1000262 old name = CYP-un1 pseudogene PSEUDOGENE very short piece 57% to 538A3, 54% to 584D1, 61% to 52A3, 68% to 52G2 DESLRLYPVEPINARYANKDTTLPRGGGKDGNSPILIPKGSSTAFSVHIIHRRKDIRGPD ANEFKPERWEGRRVGWEYVP () FNGGPRICIG**FALIGASYVTVRLLQRFDKMESLEKDAVVRHNLKLINYVANGVKVRLHAAS CYP52G6 Aspergillus niger fgenesh1_pg.C_scaffold_15000225|Aspni1 57% to CYP52G2 see Fungal pages for seq CYP52G7 Aspergillus niger e_gw1.7.657.1|Aspni1 67% to CYP52G3 see Fungal pages for seq CYP52G8 Aspergillus clavatus ACLA_081330 83% to CYP52G2 see Fungal pages for seq CYP52G9 Aspergillus terreus NIH2624 GenPept EAU33222.1 ATEG_06678 CH476602 complement(join(1144551..1145711,1145779..1146177)) 62% to CYP52G2 see Fungal pages for seq CYP52G10 Aspergillus terreus ATEG_07540.1 68% to CYP52G4 see Fungal pages for seq CYP52H1 Aspergillus nidulans AN7131.1 45% to 584B1 52 clan see Fungal pages for seq CYP52H2 Aspergillus fumigatus Af293 GenEMBL XP_746567.1 also EAL84529.1 cytochrome P450 alkane hydroxylase 72% to 52H1 MLAPAVALFAAVYTTFLVLRYFQQLYKHRKQARSLRCQPPADGEAGFLGIASFLRLRKAAKEKRWIELIA EQYGKYGNTFTQTIFGQPLVSTIEPENLKALLATQFNDFELGTRHREFYPLLGDGIFTLDGAGWSHARAL LRPQFTRDQVADLDLMDGHVSRLIDLVPKDGSAFDIQRLFFLLTIDSATHFLFGESVGSLHAGTDTGLLG RSAVGNAEGFAEAFNKAQEYLAARSRAVVFYWMINPKEFRDANKRVHEVVDHYVQLALESRRNPEKKQPG GRYIFAEALAAENDNPKVLRDNMLNILLAGRDTTASLLSSTFFYLARHPNVWIKLRQVIVDAFGDAEHPK EEITQTKLKDIPYLRYVLNEVLRLLPPVPANFRVANKDTTLPVGGGPDQKSPVYIRKGTIVTYSVYAMHR RTDFYGPDANEFRPERWEENGKRGWEYLPFNGGPRICLGQQYALTEASFTIVKLLQRFDRIENGDPDLIE PIQQTNLTLAHDRGVYIRLYSSKAL CYP52H2 Neosartorya fischeri NFIA_029600 95% to CYP52H2 Aspergillus fumigatus = ortholog see Fungal pages for seq CYP52H3 Aspergillus oryzae GenEMBL BAE64862.1, AP007171.1 71% to 52H1, 48% to 584G1 14 P450 genes and 2 pseudogenes on this contig MLAQIAPLFGALCITFLVLQYLQRLYQQRKRAQSLGCQPAAQGPSGIFGIRSFVRLLNEVRHKRWVEYIA GQYGRYGNTYTQKALGKWMVSTIEPENIKALLATQFNDFGLGTRHREFYPLLGDGIFTLDGPGWSHARGL LRPQFTRDQVADLELMDGHISRMIDLIPQDGSSFDIQRLFFLLTIDSATHFLFGESVGALESSNSASLLG RSSVGSAQGFAEAFGTAQDYLTTRSRAMHFYWMVNPKEFREANQRVHEVVDHYVQLAIQSKNNPDKKSDR YIFAEALAADNDDPKVLRDNMLNILVAGRDTTASLLSSAFFYLSRNQDVWKKLRQTIIDEFGDSQNPKGE ITQAKLKDIPYLRYVLNEVLRLLPPVPLNFRVAAKDTSLPVGGGPDGRSPVFIPKGQVVAYSVYAMHRRT DLYGPDSHSFRPERWEENGRRGWDYLPFNGGPRICLGQQYALTEASYTLVKLVQRFDTLECADPELKQPA ILSTLTMSHDRGVKVRLSSSVPKCWFNQ CYP52H3 Aspergillus flavus AFL2G_05134 100% to CYP52H3 Aspergillus oryzae see Fungal pages for seq CYP52H4 Aspergillus niger estExt_GeneWisePlus.C_80246|Aspni1 73% to CYP52H2 see Fungal pages for seq CYP52H5 Aspergillus clavatus ACLA_054640 84% to CYP52H2 Aspergillus fumigatus see Fungal pages for seq CYP52H6 Aspergillus terreus ATEG_02198.1 75% to CYP52H5 see Fungal pages for seq CYP52J1P Aspergillus fumigatus Af293 GenEMBL XP_746816.1 also EAL84778.1 alkane hydroxylase predicted 42% to CYP52F1, in 52 clan insertion of 11 C bases causes frameshift = & MTNTDIAFGHQHGCELPPELTKNWPLGLDRIKELWTANTEGHLLAFLCSVAEKYEPGNSITQYFLFGPRA FHILQPESVEAILSTNFKDYGFGARAAIFAPRLGNGIFTQEGPAWRHSRDLLRKQFSRVQNRILEHFHEH VDNMVARLPLDGVVDLQPLFFNLTLDIATALLFGRSVYSLLAGIDQDADNGLFAEGFNIAQDGLAKRFRI APWHFLYNPPGFRKACGDVHRFVEQYIDQLDLENSEDLDDKTYGFIKRVAHESASRQHLRDQLLNVLLAG RDTTACCLSWTFRLLVHHEQAMIRLREEIASVMGDSAHPTKEQIRKMPYLSCVIKE SLRLYPPP & PPVPLNNREAIRTTILPTGGGPDADRPILVRKGELVVFFQYVNSRKKNIYGPDADCFRP ERWETGELDHIGWAYFPFNGGPRQCLGENFALMEVSYTVVRLLQTFSSIILPKGQPIEPVGSERQRLTLV LSSADGCKVQIHS CYP52J1 Neosartorya fischeri NFIA_113870 95% to CYP52J1 Aspergillus fumigatus = ortholog see Fungal pages for seq CYP52J2 Uncinocarpus reesii UREG_00942.1 71% to 52J1 in overlapping parts see Fungal pages for seq CYP52K1 Aspergillus oryzae GenEMBL BAE66393.1 44% to 52A5, 44% TO CYP52J1 MLTLSFSALLGIALAWAIHRLAWKYSRLKAICVKRDNKLATQYGCEQPPRLRNWWPLGIDRLIQIWTADS EQRLMDLFTFPFKDVGYTLEQKFLGTIAFGTIDPENLEAIMTNINVFSFGLRRHILFPLLGDGIFTQEGK SWRHSRELLRPQFTRQHYRDLNIFRPHVDRLLHYLTGNGAATDLQPLFFRLTLDTTTEYLFGKSVNSLMP GEATKGQSFANHFDMAQNYVVQRFRLLDLYWLIGGPKFWRSCSAVHRFIDEIIDTRMDSKEKDKDCNSKS IFFDAVARDSRTRRDLRDQLTNVLLAGRDTTACLLSWTFHCLARHPDVLGRLNVEIASVVGTNTDLTRDD FKRMPYLDQVLREVLRLYPPVPVNTRTAHKTTILPTGGGKDGTKPFMVREGENVAFCVYAMHRREDLYGP DAGQFRPERWDEDLPLFQNERTATWGYLPFNGGPRACLGQDFGFVEAAYTVVRILQKYPIVKPDMYKGDI QGRKWLGWSSHQPEGIEMVTEERQKMTIVLSLGDGCRVTLGR CYP52K1P Aspergillus flavus AFL2G_11704 97% to CYP52K1 Aspergillus oryzae small 6 aa deletion and frameshift = & see Fungal pages for seq CYP52L1 Graphium sp. ATCC 58400 GenEMBL AY438638.1 Skinner,K.M., Cone,M. and Ciuffetti,L.M. Characterization and cloning of an alkane inducible cytochrome P450 monooxygenase from a filamentous fungi, Graphium sp. Unpublished alkane monooxygenase P-450 (ALK1) gene 41/65 top blast hits to a complete set of named fungal P450s are CYP52 sequences. The others are from CYP538, CYP539, CYP584, CYP585 related families in the CYP52 clan of fungal P450s. 45% to CYP52G2 Aspergillus fumigatus MMDPFYLLGGCCVFGIVYKSVATLYVNYVHGKTARELGCKPLPK ERTRFFGLDILRREMKADREMYLPSDIMTRFEEVGATTYEYQMLNEKHIATCDPLVIQ TVLATQFKEFTFGNRPWGTVLGNGIFSIDNIGDNKEWLHTRAMLKPQFLRNQVSDLEL EEGHVRNFMSVLKVGDDGWTPKVNLQHMFFNFTLDSATEFLFGKSADCQLLFAPGGDP KGVSKPLRAFLKALDVAKETVNTKWFLGNNGWIADSPTFRKNCAYVNDFMDALIAKTK AELEAKPKGSKPSGGREKYHFLHAMLDETDDKVELRGQALNILLAGRETTASLLGWLW YYLARHPIEFQKLRAAVLKDFGTAENPKPMSFESLKACDQLQYCNNEILRLFPLVSYN SRMALTDTTLPRGGGPDGNSPIFVKKGQQIVYHPHAMHRRKDIWGQDADEFRPDRWKT LRPGWEYIPFNGGGRICMGQQFALTEASYLTVRLLQRFDRIENLDPNPVLKQKFRIVN TPGEGVLVRLHEAAAGA CYP52M1 Candida bombicola alternative name Starmerella bombicola EU552419 Inge Van Bogaert Submitted to nomenclature committee Dec. 7, 2007 gene1 45% to CYP52A3, 45% to CYP52E1 CYP52N1 Candida bombicola alternative name Starmerella bombicola EU552421 Inge Van Bogaert Submitted to nomenclature committee Dec. 7, 2007 Gene3 45% to CYP52M1, 45% to CYP52A15 CYP52P1 Aspergillus niger gw1.4.1682.1|Aspni1 46% to CYP52K1 see Fungal pages for seq CYP52P2 Mycosphaerella graminicola Mgr076 53% to CYP52P1 A. niger, 47% to 52K1 (missing N-term) yellow from JGI model e_gw1.8.895.1|Mycgr3 see Fungal pages for seq CYP52Q1 Aspergillus niger e_gw1.11.186.1|Aspni1 45% to CYP52H3, 41% to CYP52P1 see Fungal pages for seq CYP52R1 Mycosphaerella graminicola Mgr006 51% to CYP52G2 Aspergillus fumigatus see Fungal pages for seq CYP52R2 Mycosphaerella fijiensis fgenesh1_pm.C_scaffold_45000005 60% to CYP52R1 Mycosphaerella graminicola see Fungal pages for seq CYP52S1 Yarrowia lipolytica CAG81974.1 42% to 52H3, 44% to 52F1 see Fungal pages for seq CYP52T1 Aspergillus terreus ATEG_03903.1 48% to CYP52G2 see Fungal pages for seq 53A Subfamily CYP53A1 Aspergillus niger GenEMBL X52521 van Gorcom,R.F., Boschloo,J.G., Kuijvenhoven,A., Lange,J., van Vark,A.J., Bos,C.J., van Balken,J.A., Pouwels,P.H. and van den Hondel,C.A. Isolation and molecular characterisation of the benzoate-para-hydroxylase gene (bphA) of Aspergillus niger: a member of a new gene family of the cytochrome P450 superfamily Mol. Gen. Genet. 223, 192-197 (1990) CYP53A2 Aspergillus parasiticus GenEMBL AC005991 Lewis,J., Kupfer,D., Keller,N. and Roe,B.A. Aspergillus parasiticus Cosmid Clone ap0 Unpublished 57% to CYP53A3 27896 MLLDLYLVKCLFCPL 27941 VPIVALGLLVAYYVAGYLKQWHLHDLPGPFIAGFSRIWLILQVRQGYRSLVVNDLHRRCG 28120 28121 KIVRLAPNHISVADESAIQAIYGHGNGFLK 28210 intron 28267 SNFYNAFLNVDWSIFTTRSRAEHTRKRKIVSHAFSARSLAQVEQYAHNTMELLVRQWQ 28440 28441 KMVDSQEGLDDPYAVIDARVWCNYLTFDIIGDLAFGAPFGMLERGNAIVSMRKALEDAAV 28620 28621 TLDAVEVLNHRGDVSAALGICPDLIPYAKWLPDLFFRQGAEAIANVADVAGAAVDRRLKM 28800 28801 NTSMTEKRGDLLAHLIDAEDQAGAKLGYRELTGEAVTLIAAGSDTSSSTLCALLYWVSTT 28980 28981 PRVLWKLQNVLDEVIPVDVEVPYLAMVKKI 29070 intron 29117 PSLQWVIWEALRIHSTFGQGLPREVPPERGPVEICSHTFYPGDVLSVPGYTMHHSVDIWG 29296 29297 IDVEDFVPERWDPCRLTQRQKDSFIPFSEGPRACIGRNLAEMEFFVGCATLFRLFEFRVE 29476 29477 GQGPLEVREGWLRKPVSLQVGIRRRYLDVRSS* 29575 CYP53A3 Aspergillus nidulans or Emericella nidulans GenEMBL AY048583, AN7589.1 Fraser,J.A., Davis,M.A. and Hynes,M.J. Submitted to nomenclature committee Oct. 3, 2000 benzoate-para-hydroxylase (bzuA) gene 82% identical to CYP53A1 MITDFLTPENITPERIALALLGLLAAYYVVPYLQTWRLSDIPAP GLAAWTNFWLLLQTRLGHRFISVDNAHKKYGKLVRIAPRHISIADDAAIQAVYGHGNG FLKSDFYDAFVSIRRGLFNTRDRAEHTRKRKTVSHTFSAKSIGQFEQYIHHNIENLVK QLTRISNLQRNPKNGYATVDALNWFNFVAFDIIGDLAFGAPFGMLDKGQDIAEMRKSP DSPPQYVQAVEVLNRRGEVSATLGCYPALKPFAKYLPDRFFRDGLEAVENLAGIAIAC VNERLKPEVMANNTRVDLLARLMEGKDANGNKLGRAELTAEALTQLIAGSDTTSNTSC AILYYCLRTPGVIDKLHKVLDEAIPQDVEVPTHAMVKEIPYLQWVIWETMRIHSTSAM GLPREIPEGNPPVEISGHIFKPGDILSVPTYTIHHSKEIWGADADEFIPERWAPERLT ARQKAAFIPFSTGPRACVGRNVAEMELLVICSTVFRMFDWELQQKGPMETREGFLRKP LGLTVGVKRRTIV CYP53A4 Neurospora crassa AABX01000266.1 cont3.41 NCU01086.1 (version3) 9a61.tfa_50cg@9A61 65% to 53A1 over 417 aa note the automated assembly of the gene is missing the KYG fragment seen in 53A1. See my assembly for corrected seq. Neurospora crassa sequence contig 1.248 (supercontig 26) 14619 MAIISLLMSPWAPVVLLAGVAFYYLVPYFVTYSALRKIPSPFPAQFTDLWLL SVCRRGNRYQRVDELHKKLGPVVRIQPNHVSICDDAAIPTIYGHGNGF LNDFYDAFVSIRRGLFNTRDRAEHTRKRKIVSHTFSAKSVQQFEPYMHSNLELFVKQWDSMIKNSKNPDKAAHLD CLEWFNYLAFDVIGDLSFGQPFGMLSSGADMAEIRSSPDAAPIYAPAIEILNRRGEVSATLGIHPALKPFAKYLP DPFFTKGLAAVENLAGIAIACVKSRLDNPPPVTRKDLLQRLMEGRDEKGEPLGREELTAEALTQLIAGSDTTSNS SCALLFHAVRTPGVMQKLQAELDANIPPEVDVPTYDMVKELPYLEAVINEVLRFHSTSGIGLPRQIPHDASQGVH IQGYYLPPGTVLSVPTYSIHHSKEIWGPDADEFKPERWERLTARQKNAFIPFSHGPRSCVGRNVAEMEMKLIVAT WARRYEVKLLQDYMDTREGFLRKPLGLKVGLKLRK 13029 CYP53A4 Neurospora discreta JGI gene model estExt_Genewise1Plus.C_10226 97% to CYP53A4 N. crassa see Fungal pages for seq CYP53A5 Magnaporthe grisea MG07977.4 AACU01000984.1 cont2.1481 runs off end, changed C-term AACU01000983 cont2.1480 = N-term 69% to CYP53A4 (split on two contigs) see Fungal pages for seq CYP53A6 Fusarium graminearum FG08079.1 AACM01000324 FGcontig1.324_scaffold5 see Fungal pages for seq CYP53A7 Fusarium graminearum FG08086.1 AACM01000324 FGcontig1.324_scaffold5 see Fungal pages for seq CYP53A8 Fusarium graminearum FG10451.1 AACM01000435 FGcontig1.435_scaffold7 see Fungal pages for seq CYP53A10 Nectria haematococca (Fusarium solani group) 83% to Fusarium graminearum CYP53A7 possible ortholog fgenesh1_pg.scaffold_37000003 Necha1/scaffold_37:6771-8451 59% to fgenesh1_pg.scaffold_12000228, Note: this seq has two more introns than the CYP53A8 ortholog This gene model seems correct DRN 2/4/06 see Fungal pages for seq CYP53A11 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_12000228 Necha1/scaffold_12:587439-589032 84% to Fusarium graminearum CYP53A8 possible ortholog 64% TO 53A1 Aspergillus niger This gene model seems correct DRN 2/4/06 see Fungal pages for seq CYP53A12 Aspergillus fumigatus Af293 GenEMBL XP_755858.1 also EAL93820.1 join(<2165575..2165876,2165928..2166734, 2166796..>2167246) benzoate 4-monooxygenase 85% to CYP53A1, 79% to 53A3 seq looks correct MITDLLSLQNAGLILLGLIAVYYVIPYLQKWHLHDIPSPRFAAFSNLWLLLQARRGRRFLKVDEAHKKYG KLVRIAPKHVSIADDAAIQAIYGHGNGFLKADFYDAFVSIRRGLFNTRDRAEHTRKRKTVSHTFSMKSIG QFEQYIHQNVELFVQQWTKLAKLNGNPRSGYATIDALNWFNYLAFDIIGDLAFGAPFGMLEKGKDIAEMR KTPDSEPTYVQAVEVLNRRGEVSATLGCLPRLIPYAKYLPDRFFKDGVQAVENLAGIAVARVNERLKPEV MEKNTRVDLLSRLMEGKDSNGNKLGREELTAEALTQLIAGSDTTSNTTCAILYWCMSTPGVIPKLQKVLD EAIPDDVDVPTHAMVKDIPYLQWVIWETMRIHSTSAMGLPREIPPGNPPVTISGHTFYPGDVVSVPSYTI HRSKEIWGPDAEKFVPERWDPARLTARQKAAFIPFSTGPRACVGRNVAEMELLVMTGTIFRLFEFEMQQD GPMETREGFLRKPLGLIVGMKRRAVHASV CYP53A12 Neosartorya fischeri NFIA_090480 96% to CYP53A12 Aspergillus fumigatus = ortholog see Fungal pages for seq CYP53A13 Aspergillus oryzae GenEMBL BAE60506.1 58% TO CYP53A14, 81% to 53A3 MIAELLTPTGAAYVLTAAVIVYYILPYLQLWRLRDIPSPGFAAFSNLWLMLQYRKGNRFVTVDNAHKKYG KLVRIAPRHVSIADDEAIQAIYGHGNGFLKADFYDAFVSIRRGLFNTRDRAEHTRKRKTVSHTFSMKSIG QFEQYIHGNAELFVKQWNRIADTQSNPKTGYATIDALNWFNYLAFDIIGDLAFGAPFGMLEKGQDIAEMR KSPNDKPSYVQAVEVLNRRGEVSATLGACPSLIPWAKYIPDRFFRDGLEAVENLAGIAVARVNERLRPEV MANNTRVDLLARLMEGKDSNGNKLGREELTAEALTQLIAGSDTTSNTSCAILYWCLRTPGVIEKLHKVLD ESIPKDVDVPVHAMVKDIPYLQWVIWETMRIHSTSAMGLPREIPAGNPPVTISGHTFYPGDVVSVPTYTI HRSKEIWGPDAEQFVPERWDPKRLTARQKAAFIPFSTGPRACVGRNVAEMELLVIVGTVFRLFDFEIQQD GPMETREGFLRKPLGLMVGMKRRSVAV CYP53A13 Aspergillus flavus AFL2G_03223 99% to CYP53A13 Aspergillus oryzae, 1 aa diff, N-term extension deleted see Fungal pages for seq CYP53A14 Aspergillus oryzae BAE56637.1 56% to 53A3 MDGDSCPSCCSTCTSLNVFLCPHIPILTLGLLLVYYVTGYLKRWHLRDIPGPFIAGFSRIWLIVQVRQGY RSLVVHDLHRRYGKIVRLAPNHISIADESAIQAIYGHGNGFLETDFYNAFLNVDWSIFTTRSRAEHTRKR KIVSHAFSARSLAQVEQHAHNNMEHLVRQWRKMIDSEEGLDDPYAVIDARVWCNYLTFDIIGDLAFGAPF GMLERENATVSMRKAPENPEVTLDAVEVLNHRGDVSAAFGICPDLIPYAKWLPDPFFRQGAEAIANLAGV AGAAVDRRLKMDTSMTEKRGDLLALLIDAEDQAGAKLGHRELTGEAVTLIAAGSDTSSSTLCALLYWVSS TPRVLWKLQNVLDEVIPVDIEVPYLAMVKKITYLQWVIWEALRIHSTFGQGLPREVPPERGPVEICGHTF YPGDVLSVPGYTMHHSADIWGIDVEDFVPERWDPRRLTQRQKDSFIPFSEGPRACIGRNLAEMELFVGCA TLFRLFEFRVEGQGPLKVRERWLRKPVSLQVGIRRRYLDARSS CYP53A15 Cochliobolus lunatus EU597483 Nada Krasevec Submitted to nomenclature committee July 10, 2007 65% to CYP53A1 CYP53A16 Mycosphaerella graminicola Mgr055 64% to CYP53A11 Nectria haematococca see Fungal pages for seq CYP53A17 Uncinocarpus reesii UREG_05168.1 74% to CYP53A1 Aspergillus niger model has some missing seq (added back) see Fungal pages for seq CYP53A18 Coccidioides immitis CIMG_09108.2 90% to CYP53A17 see Fungal pages for seq CYP53A19 Fusarium oxysporum FOXG_04952 90% to CYP53A8 see Fungal pages for seq CYP53A19 Fusarium verticillioides FVEG_03183 98% to CYP53A19 Fusarium oxysporum = ortholog see Fungal pages for seq CYP53A20 Fusarium oxysporum FOXG_02776 92% to CYP53A7 see Fungal pages for seq CYP53A20 Fusarium verticillioides FVEG_01611 97% to CYP53A20 Fusarium oxysporum = ortholog see Fungal pages for seq CYP53A21 Aspergillus clavatus ACLA_074210 90% to CYP53A12 see Fungal pages for seq CYP53A22 Aspergillus terreus ATEG_07877.1 85% to CYP53A12 see Fungal pages for seq CYP53A23 Mycosphaerella fijiensis estExt_fgenesh1_pm.C_10348 88% to CYP53A16 Mgr055 Mycosphaerella graminicola see Fungal pages for seq 53B Subfamily CYP53B1 Rhodotorula minuta (a fungi) GenEMBL D63703(2766bp) Fukuda,H., Nakamura,K., Shibuya,K., Tanase,S., Gotoh,O., Ogawa,T. and Fujii,T. Nucleotide sequence of gene for cytochrome P450rm from Rhodotorula minuta IFO 1102. Unpublished (1995) CYP53B2 Sporobolomyces roseus estExt_fgenesh1_pg.C_30339 57% to CYP53B1 Rhodotorula minuta see Fungal pages for seq CYP53B3 Puccinia graminis f. sp. Tritici PGTG_08085 + PGTG_08086 revised 54% to CYP53B2 from LNRYP see Fungal pages for seq CYP53C1 Ustilago maydis GenEMBL XM_397620.1 hypothetical protein UM00005.1 in the CYP53 clan part B 54% to 53A1 54% to 53B1 CYP53C2 Phanerochaete chrysosporium AY321584 55% to CYP53C1 my Scaffold_164a JGI gene models ug.1.19.1 and pc.1.261.1 MAVIEALTQLDLKSWLLLIPALAIVAHILVWLLDPHGIRSYPGP LLAKFSDAWLGYVAAQGHRSEVVHDLHKQYGTFVRIAPNHLSIADPDALQVVYGHGTG TLKSNFYDAFVSIQRGLFNTRSRSEHARKRKIVSHIFSQKSVLEFEPHVRLYVKQLIQ QWDRLYEAGAKGLSGDDGEGGWRGRNGRVWLDCLPWYNYLAFDIIGDLAFGAPFGMLL AARDAAPVAVDHEQAMASYGKEKSEVQYIPAVQVINDRGTYSASLGVLPPWMRPIVKL FPWFRRGQKAVKQLAGIAVAAVAQRLTTPTDRVDLLGKLQEGRDDDGNLMGKEELTAE ALTQLIAGSDTTSNSSCAITYYLAKYPDAQRKLQQELDEALGSDDEPVSTFDQVKRLP YLQAVIDEALRIHSTSGIGLPRLVPKGGMTVCGRFFPEGTVLSVPTYTIHRDEEVWGK DPEVFRPERWFEQDKNAVQKTYNPFSFGPRSCIGRNLANMELLIIVSSILRRYDFVLE DPDKPFDTMEGFLRKPVECVVGIRRRTL CYP53D1 Fusarium oxysporum FOXG_14732 48% to CYP53A10 see Fungal pages for seq 54A Subfamily CYP54A1 Neurospora crassa GenEMBL X15033 Neurospora crassa sequence contig 1.1450 (supercontig 261) The translation below is more accurate than the one in Genbank MALTQLAHSASALLPLVLPSIAAILAFAIFQRYFAPNPLSNLPIVGEEYHGYEKKRQAYLTKAKDLYLEGYTKFKHGLFRIVTPNR NSVIVVSPRFLDELKKLPDDVVSFDAANDDSMHTKYTLIPTHEPILPHTIKTSLTPSLPRLNPQLSEEVQAFSQEIAPLMSSSPSD WAPININSKLLRIVAKVSGRVFIGPELCHDERYLEAAVGYTVSVMEAQRAVERMNPWVRPIAAWRLKEVRKLAQMERDATAFLRPV VEARREKQRKGEEKDNDMLQWLMDSADQGQGKQHGKWGEDTTTTRKLARLQLAISFAAIHTTTLVTTHAVYSLAADPKLQATLREE IQSVLQEHKGVFNTSALQAMKKTDSFLKETMRFHPLGQTSFNRKVLRTFALSNGQVIPKGSTIEVPNYAVSRDPEAYPNPDVPDPL RFYNLRNEAREKGEAEQGASGQFVSVNKEFLTFGYGRHACPGRFFAANEIKMILANLVMTYEMGLVEGETERYRDWDIAAGTIPDP TKDVMFRKL* CYP54A1 Neurospora discreta JGI gene model scaffold_13-snap.17.1 93% to CYP54A N. crassa see Fungal pages for seq CYP54B1 Magnaporthe grisea MG01391.4 45% to CYP54A1 AACU01000043 cont2.256 AI068604 revised to fill in gap see Fungal pages for seq CYP54C1 Fusarium graminearum FG11536.1 AACM01000473 FGcontig1.473_scaffold10 see Fungal pages for seq CYP54C2 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_20000135 Necha1/scaffold_20:362069-363960 75% to Fusarium graminearum CYP54C1 probable ortholog small gap after GYQK? (5 aa), this seq looks right, maybe the 54C seq is too long here. Note LTIEQ can be removed from 54C and the intron boundary is preserved. This gene model seems correct see Fungal pages for seq CYP54C3 Cryphonectria parasitica CB690595.1 60% to CYP54C2 N-term only, 55% TO 54C1 MATSTTSSLSETLSERFLVIVAAVLACALAYLGPKIISIIRLAALPVIGTE LGNEEKRRQAYLQGARKLYYAAYDNFKNGLFKGAARLTTSRNANVIVVSPKFLPELNKLP DSVVSMEAAVDDAMETKYTKIESHVPIIPHTVTGKLTPSLTRLNPTIARETAEALEL EMPLREFTNWQEVNIHEKLLRIVGMXSGR CYP54C4 Cryphonectria parasitica CB686707.1 CB688482.1 55% TO CYP54C1 TRPMPTCEDWTAVKINSVLLRIVAKASGRIFVGPELCHSEEYLEAAIRYTVEVIGAANAV SNVPPWLRSFKASKLPEVQRLHERRKHAIKFMQPVVESRKDLDQKPDDLLQWLIDNEGNL GDMSTWKLARTQLALSFAAIHTSTVVSTNVFYTLAVMPNEVILELRDEIRSVLGENNGNFT SSALQSMKKVDSFIKETMRYYPFANHSFERKVMRTFTLSNGQVIPAGVILECSTAHNQDD EVFPDASRFDPWRFSKLQEQEKEDGMDGAARHQMVSVTPNHLTFGYGRHACPGRFFAINX IKMIIGTFLLNYDIKNIDGVK* CYP54C5 Fusarium oxysporum FOXG_01238 82% to CYP54C1 see Fungal pages for seq CYP54C5 Fusarium verticillioides FVEG_00258 revised 90% to CYP54C5 Fusarium oxysporum = ortholog see Fungal pages for seq 55A Subfamily CYP55A1 Fusarium oxysporum GenEMBL D14517 (2598bp) Tomura,D., Obika,K., Fukamizu,A. and Shoun,H. Nitric oxide reductase cytochrome P450 (P450nor) gene, CYP 55, of the fungus Fusarium oxysporum containing a putative FNR site in its upstream flanking region. unpublished (1993) CYP55A1 Fusarium oxysporum PIR A40401 (403 amino acids) Kizawa, H., Tomura, D., Oda, M., Fukamizu, A., Hoshino, T., Gotoh, O., Yasui, T., Shoun, H. Nucleotide sequence of the unique nitrate/nitrite-inducible cytochrome P-450 cDNA from Fusarium oxysporum. J. Biol. Chem. 266, 10632-10637 (1991) Note: This sequence is probably a bacterial sequence related to the 105 family. CYP55A1 Fusarium oxysporum GenEMBL D14517 (2598bp) Park, S.-Y., Shimizu, H., Adachi, S.-i., Nakagawa,A., Tanaka, I., Nakahara, K., Shoun, H., Obayashi, E., Nakamura, H., Iizuka, T. and Shiro, Y. Crystal structure of nitric oxide reductase from dentrifying fungus Fusarium oxysporum. Nature Structural Biology 4, 827-832 (1997) CYP55A1v2 Fusarium oxysporum FOXG_12350 96% to CYP55A1v1 see Fungal pages for seq CYP55A1 Fusarium verticillioides FVEG_10773 96% to CYP55A1v2 Fusarium oxysporum see Fungal pages for seq CYP55A2 Cylindrocarpon tonkinense (a denitrifying fungus) GenEMBL D78511 Shoun,H. nor1 CYP55A3 Cylindrocarpon tonkinense (a denitrifying fungus) GenEMBL D78512 Shoun,H. nor2 CYP55A4 Trichosporon cutaneum (a basidiomycete fungus) GenEMBL AB052733 Zhang,L., Takaya,N., Kitazume,T., Kondo,T. and Shoun,H. Purification and cDNA cloning of nitric oxide reductase cytochrome P450nor (CYP55A4) from Trichosporon cutaneum Eur. J. Biochem. 268 (11), 3198-3204 (2001) Submitted to nomenclature committee Oct. 11, 2000 P450nor 66% to CYP55A1 CYP55A5v1 Aspergillus oryzae GenEMBL AB055659 Kaya,M., Hata,Y., Kawato,A., Abe,Y. and Akita,O. Aspergillus oryzae CYP55A5 gene for cytochrome P450nor Submitted to nomenclature committee Jan. 23, 2001 60% to 55A1 CYP55A5v1 Aspergillus oryzae GenEMBL BAC01275.1 cytochrome P450nor MNSEPVYPRFPFARPSGDEPPAEFHRLLRECPVSRVELWDGSHPWLVVKHKDVCEVLTDPRLSKVRQRDG FPEMSPGGKAAARNRPTFVDMDAPDHMHQRSMVSAFFNDEYVESRLPFIRDTVQHYLDRLIRAGKDGKEV DLVKHFALPIPSHIIYDILGIPIEDFEYLSGCDATRTNGSSTAAAAQAANKEILEYLERLVDKKTTNPSH DVISTLVIQQLKPGHIEKLDVVQIAFLLLVAGNATVVSMIALGVVTLLEHPDQLSRLLEDPSLSNLFVEE LCRFHTASALATRRVATVDIELRGQKIRAGEGIIASNQAANRDPEVFPDPDTFDMFRKRGPEEALGFGYG DHRCIAEMLARAELETVFSTLFQTLPSLKLAIPKSEIQWTPPTRDVGIVGLPVTWDRD CYP55A5v2 Aspergillus oryzae GenEMBL BAE61721.1 6 aa diffs to CYP55A5v1 MQLEPVYPRFPFARPSGDEPPAEFHRLLRECPVSRVELWDGSHPWLVVKHKDVCEVLTDPRLSKVRQRDG FPEMSPGGKAAARNRPTFVDMDAPDHMHQRSMVSAFFNDAYVESRLPFIRDTVQYYLDRLIRAGEDGKEV DLVKHFALPIPSHIIYDILGIPIEDFEYLSGCDATRTNGSSTAAAAQAANKEILEYLERLVDKKTTNPSH DVISTLVIQQLKPGHIEKLDVVQIAFLLLVAGNATVVSMIALGVVTLLEHPDQLSRLLEDPSLSNLFVEE LCRFHTASALATRRVATVDIELRGQKIRAGEGIIASNQAANRDPEVFPDPDTFDMFRKRGPEEALGFGYG DHRCIAEMLARAELETVFSTLFQTLPSLKLAVPKSEIQWTPPTRDVGIVGLPVTWDRD CYP55A5 Aspergillus flavus AFL2G_09432 99% to CYP55A5v2 Aspergillus oryzae, 2 aa diffs Supercontig 11: 146268-147914 (-) strand see Fungal pages for seq CYP55A6 Neurospora crassa AABX01000354.1 cont3.356 NCU06137.1 (version3) Neurospora crassa sequence contig 1.183 (supercontig 22) 1c8.tfa_90cg@1C8 70% to CYP55A1 EST h8b02ne.f1 in lower case mgaltenlefpfrrasgldpppeyawlrktdpvtqvrlfdgslawlvtkyhdvCQVATDERLSKERTRPGFPELA AGGKEAAKNRATFVDMDPPKHMQYRGMVQPIFEMEHVKELEPYIQKTVDDLLERMKNMGCEGGPVDLVQNFALPVPSY (?) IIYTILGVPVEDLDFLTQQNSIRTNGSSTAREASAANQ (?) ELLNYLRRLAEQRLQEPKDDLISKLMTEQVKAGKLDTADAVQMAFLLLVAGNATMVNMIAL (0) GIVTLFLNPSQLEELKRDPSVVPAFVEELSRFHTASAMAIRRVAKEDL EIRGKTIKKGEGVIASNLSANRDEEVFKDADKFDMHREWPVKELGFGWGEHRCIAEFLAKAELKAVF (?) ETIFKKLPDLKIAVPFDEIEYTPLDRDVGIVKLPVTW CYP55A6 Neurospora discreta JGI gene model estExt_fgenesh3_pm.C_80311 95% to CYP55A6 N. crassa see Fungal pages for seq CYP55A7 Trichosporon domesticus SBUG 752 (a basidiomycete fungus) GenEMBL AY044231 and AY044232 Dirk Patzak Submitted to nomenclature committee March 4, 2002 124 amino acid N-terminal fragment 77% to 55A4 CYP55A8 Fusarium graminearum FG11585.1 AACM01000475.1 FGcontig1.475_scaffold11 see Fungal pages for seq CYP55A9 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_30000098 Necha1/scaffold_30:272059-273639 92% to CYP55A2 Cylindrocarpon tonkinense probable ortholog genome annotation in progress 79% to CYP55A8 Fusarium graminearum This gene model seems correct DRN 2/5/06 see Fungal pages for seq CYP55A10 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_28000090 [Necha1:94366] Necha1/scaffold_28:245547-247185 96% to CYP55A3 Cylindrocarpon tonkinense probable ortholog genome annotation in progress Note: the very high similarity suggests a very close relationship between the two species. This is higher than the ortholog pair matches between F. graminearum and N. haematococca This seq is originally of bacterial origin (P450 B type not E type) similar to CYP105s This gene model seems correct DRN 2/4/06 see Fungal pages for seq CYP55A11 Nectria haematococca (Fusarium solani group) estExt_fgenesh1_pg.C_880004 (better model) Necha1/scaffold_88:11555-13495 58% to CYP55A3 Cylindrocarpon tonkinense 69% to AB226066.1 Aspergillus oryzae cDNA 58% to fgenesh1_pg.scaffold_28000090 54% to CYP55A8 Fusarium graminearum same as fgenesh1_pg.scaffold_88000004 which is too long at SNAQAHPAP between MHQ and RQM Note: This gene has one extra intron at the N-terminal that only codes for 3 amino acids This gene model seems correct DRN 2/5/06 see Fungal pages for seq CYP55A12P Aspergillus niger gw1.11.1089.1|Aspni1 48% to CYP55A5v1 aa 53-257, this is a pseudogene see Fungal pages for seq CYP55A13 Uncinocarpus reesii UREG_06342.1 71% to CYP55A5v1 Aspergillus oryzae see Fungal pages for seq CYP55A14 Histoplasma capsulatum HCB04560.1 73% to CYP55A13 see Fungal pages for seq CYP55A15P Fusarium oxysporum FOXG_14888 74% to CYP55A2 Cylindrocarpon tonkinense see Fungal pages for seq CYP55A16P Fusarium oxysporum FOXG_17502 83% to CYP55A11 see Fungal pages for seq CYP55A17 Aspergillus terreus ATEG_06140.1 73% to CYP55A13 Uncinocarpus reesii see Fungal pages for seq CYP55A18 Trichophyton rubrum ESTs DW700172.1, DW689200.1, DW681558.1 72% to CYP55A14 Histoplasma capsulatum see Fungal pages for seq CYP55B1 Chlamydomonas reinhardtii (green algae) See Chlamydomonas pages CYP55C1 Aspergillus niger fgenesh1_pm.C_scaffold_5000072|Aspni1 40% to CYP55A1, 40% to CYP55B1 see Fungal pages for seq 56A Subfamily CYP56A1 Saccharomyces cerevisiae GenEMBL X55713(2265bp) PIR S13502 (489 amino acids) Briza, P., Breitenbach, M., Ellinger, A. and Segall, J. Isolation of two developmentally regulated genes involved in spore wall maturation in Saccharomyces cerevisiae. Genes Dev. 4, 1775-1789 (1990) CYP56A2 Saccharomyces paradoxus NRRL Y-17217 GenEMBL AABY01000059.1 91% to CYP56A1 46799 MELFKLLGLTLFLTISYIAFAIIVPPLNFPKNIPTIPFYVVFLPVVFPIDQTELYDLYIR 46620 46619 ESMEKYGAVKFFFGSRWNILVSRSEYLAQIFKDEDTFAKSGNQKKIPYSALAAYTGDNVI 46440 46439 SAHGAVWRDYRNAVTNGLQHFDDTPIFKNAKLLCTLIKRRLQEGQLSIQMGPLSQRLALD 46260 46259 NISQVALGFDFGTLKHEKNAFHEHLIRIKKQIFHPFFLTFPFLDLLPIPSRKKAFKDVAS 46080 46079 FRELLVKRVQDELVNTYKFEQTTFAASDLIRAHNNEIIDYRQLTDNIVIILVAGHENPQL 45900 45899 LFSTSLYLLAKYSNVWQERLRQEVNGVTDPKILADLPLLNAFLLEAVRLYPPLSTIINRC 45720 45719 TTRTCKLGPEIVIPKGVYVGYNNFGTSHDPRTWGATADDFKPERWGSDIETIRKNWRMAK 45540 45539 NRCSVTGFHGGRRACLGEKLALTEMRITLAEMLKQFRWGLDPEWEEKLTPAGPLCPLNLK 45360 45359 LKFTENITE* 45330 CYP56A3 Saccharomyces bayanus MCYC 623 GenEMBL AACA01000139.1 83% to CYP56A1 23165 MSILQLIGFILLIIISYIAFVVVVPPLNFPRNIPTIPFYVVFLPVIFPIDQTELYDLYIR 22986 22985 EPMEKYGAVKIFFGSRWNILVSRSEYLAQIFKDEDTFAKSGNQKKIPYSALAAYTGDNVI 22806 22805 SAHGKVWRDYRNAVTNGLQHFDDAPISKNAKTFCNLIKKRAQNGESSIQMGPLSQRLALD 22626 22625 NISQVALGFDFATLTQDKNALHELLVHIKKQIFHPFFLAFPFLDLLPIPSRRKAFRDVAS 22446 22445 FRELLVKRVQDQLINNYKFEQTTFAASDLIRAHNNEVIDYKQLSDNIVIILVAGHENPQL 22266 22265 LFNTSLYLLAKYSTEWQNSLRQEVESVTNSKALADLPLLNAFLFEAVRLYPPLSTIINRC 22086 22085 TTKLCKLGADIVIPKDVYVGYNNFGTSHDPKNWGPTANDFRPERWGSDIETIRKNWRIAK 21906 21905 NRCTVTGFHGGRRACLGEKLALTEMRITLAEMLRQLEWTLDPEWEEKLTPAGPLCPFNLK 21726 21725 LNFKERLPS* 21696 CYP56A4 Saccharomyces castellii NRRL Y-12630 GenEMBL AACF01000075.1 63% to CYP56A1 43945 MPALRVILSLLIVLVGYVIFKVVWPPLNFPRNIPTIPCYVVFLPIIFDIDQVDIYNTYIK 43766 43765 VQMEKYGAVKIFFGSRWNILVSKPEYLAQIFKEEDVFAKSGNQKKIPYSAIAAYTGDNII 43586 43585 SAHGDVWRKYRSIMTNGLQHFKEEPFICNAKKFCGLLRKNRGPTG--NVLMGPLIQRLTL 43412 43411 DNISQVVLGFNFGTLSEETNELHQHLIKIKKQIFHPFFLTFPFFDMLPIPSRKKAFQDIS 43232 43231 KFREVLVSRVKNDLINNYKFEQTSYVSSDLIKAYNNNIIDYKQLTDNIVILLVAGHENPQ 43052 43051 LLLSNCLYLLAKYPS-WQKLIWKETQNISNYKDLNELPLFNSFLFEVVRLYPPLNIIINR 42875 42874 KTNKKCRMGQDIIVPKNTYVGYHNFGVTHNRNVWGNTAEEFDPRRWGTNIADIMIAWRSS 42695 42694 KNSCKLTSFHGGRRACLGEKLGFVEMRISIFEIIQQFELTLPKEWKEKMTPAGPLCPYNL 42515 42514 QLNIKSREKDVGVCE*42467 CYP56A5 Candida glabrata strain CBS138 GenEMBL CR380952.1, GenPept CAG59007.1 57% to CYP56A1 MLNILVFGFICLVLCWILTICFPPISFPNNIPTIPFYSVFIPQY YNIDQVQFYNLYIREKLEKYGAVKFYFGSRWNILVSRPEFLNNIFKNEDTFAKSGNQQ KIPYSVLAAYTGDNVISAHGVNWQKYRKPIKDGLQHFDIGILLKNAMKFCELIKEDMR PFKYHCNFKIIPYIQRLTLDNICRVGLGFEFGAIDEDNNSLHRQLIQIKKQIFDPFYL NFPMFDRLPIPSRLLAFQNVENFRSSLVDKVQKQLIKTYKFEQANTSGSALVRAYNNG ELTYKQLTDNIVILLVAGHENPQLFISNLIYLLGKYHDTWQVDIRNEILNNEENNLSE LPLLNSFLYECLRYYPPLSVIINRKTTKRCMLGPGIVVPKDTYVGYHNYSTCHDSNFW GHTSDIFDPTRWGKDIETINKAWKTTKNNCILNTFHGGKRACLGEKLVFVSTRIIIAE FLSSFEWKLSPLWVEQMTHAGPLCPKDLALDIRVRDTSMQIQKLNKYL CYP56A6 Ashbya gossypii (= Eremothecium gossypii) GenEMBL AE016819, NP_985947.1 54% to CYP56A1 MTANIVLVLLGGLVLNYIRSIVLPPWGFPRNIPTIPFYVTFLTT LFDIDQKELFELYIRKPMEKYGAVKIYFGNRWNILVSKPEFLAQMFKDEDTFAKSGNH IKIPYSILALYTGDNVISSHGLAWKKFRGALTQGLQFFDPSPLSANAEKFIGFIERDM KKNNGEVLMPALIQRLALANIAQIALGFDIGTLDPERPSKLQQQLEEVKKHIFHPLYM NFSFLDRLPIPSRIRARQQVERFRSDLLQEVRRNLIINYKYEQTSYAASDLIRVYEHG EITEKQLTDNIVILMVAGHENPQLLLTTCLYMLAKYPEWQARLRQQAVTLQDDLLNDS VEFNQFLFETVRVLPPLGQIINRKTSHKCKLGPEIVLPKDTYVGYQVYGTGTSTQVWG PDAAEFKPERWGSTNATVHETWRRSKNTGAMGAFHGGRRACLGEKMGLMETRVALAHM LRKLEWTLSPNWKDRVTPAGPLCPFMLKLQFTRIQEPLAAELTA CYP56A7 Kluyveromyces lactis NRRL Y-1140 GenEMBL XM_452498, KLLA0C06743g 49% to CYP56A1, 53% to 56A6 MLSLVLGIISSGILLSFLCLVCFPIGLPWSLPTIPIYVSFLPIL TDMDQLEIFERYIRHPMEKYGAVKIFFANRWNILTSKPEFLNIMFRDEDTFAKSGNQK KIPYSLLAKYTGDNVISAHGKVWMKYRSCIQNQLNVFDSKPLIDNAQLFVKLLKKSIE DNTEGTVLLPPLIQRLTLANIAKIALGFDIGTLKFDEHDNICRSRLHEKLNQVKQQIF KPVFLSFPFLDLLPIKSRQRAKIDIIQFRDMLLDTVYKNLIHNYKFEQTNNAAAGLIR EWKLQGITDEQLKDNLVIILVAGHENPQLLLSTLFYLLAKHQNWQYAIREELLSVDRN EDILNSFKLTAFIYEALRMFPPLGQIINRRTTRKCQLGPGIIIDKDVYCGYNVYGTGT ATSVWGETAKQFIPERWGENSNELANNWKKHKYDASMSAFHGGRRSCLGEKLALMEMK YVLYYVLMNFHIELHPQWKEKMTPAGPICPKMLKVKLTILKDPQDQRSDSLYSSAGET LSENI CYP56A8 Saccharomyces mikatae Smik_4197 90% to CYP56A2 Saccharomyces paradoxus partial seq see Fungal pages for seq CYP56A9 Saccharomyces kudriavzevii AACI02001147.1, AACI02000561.1 85% to CYP56A3 see Fungal pages for seq CYP56A10 Saccharomyces kluyveri AACE03000001.1 Sklu_Contig1634.3 Cyan region is an insertion 60% to CYP56A3 Saccharomyces bayanus w/o the insert see Fungal pages for seq CYP56A11 Kluyveromyces waltii kwal_162-g20.1 59% to CYP56A10 Saccharomyces kluyveri similar insertion as in 56A10 see Fungal pages for seq CYP56A12 Kluyveromyces polysporus Kpol_364p9 62% to CYP56A3 Saccharomyces bayanus see Fungal pages for seq CYP56B1 Aspergillus nidulans AN2706.1 43% to CYP56A1 547 clan revised 7/18/07 MLVTLTVLFLGLVGILASFLTYLFTPPQFFPKGLPTIPFYYTLIPLLNQT LKRAPTDQVTLYHRYLSTPLRTHGAVKLFFGGRWNILITKPSYIAEVLKN EDLYAKSGNQKKIPHSVLAQYTGDNIISSHGENWKLYSSIFKPGLQRDYD PSGIWRNASLLVQMISQDIKRKSPVDINPLMQRYALANLSEVLLGTTFDT LQKPAASLHAFQLLIKPKIFDPIFLNFPVLDYLPLQTREEARKLVTRFTD ELIETVRKGHTTCDHEKEHTRNLGCRLLYACESGLFTEMQLRHNMISAFL AGHENPQLLLVSSLFLLAEHPEMQESLRAEISALNDLEPAYNALSALPLL TSTIYEVLRLYPPISQLINRRTTAPTLLGGEIPIPAGTYVGYNAYATNRD IGFWGPDANEFKPSRWGNTMEEINALFRRANAKGAFISFHGGRRTCLGQR FALLEGRVTLAKLLMCVRWEIDPSWQRGMTPAGPLYARNLQLRFSNISGAGKAGAS CYP56B2 Uncinocarpus reesii UREG_00550.1 59% to CYP56B1 Aspergillus nidulans see Fungal pages for seq CYP56B3 Coccidioides immitis CIMG_00523.2 80% to CYP56B2 Uncinocarpus reesii see Fungal pages for seq CYP56C1 Aspergillus oryzae GenEMBL AP007164.1c third P450 of six on this accession CDS join(1160825..1161472,1161531..1162415,1162505..1162558) 47% to CYP56B1 Aspergillus nidulans 41% to CYP56A1 S. cerevisiae MVMHGLCIIIGKALAVGILFLSICIFSRIVPPRYPRNIPAVPFW VTLLSLVRDIDQEDIYRRHIQKPLQTHGAIKIFFAGQWNLLIQRSSYLNEIFRNEDVY QKSGNQKKIPHSVLAEFL (1?) GDNVISSRGTTWRLYRDIITPG LQGHFDAGLIAANAEELCSSLLAFQNTVGNCGVPVQDLLQQFTIANVSQVLLQANHKP TKGDESSLHQLQLAVKREIFKPIFMNFPVLDRLGKLIPCRVRARNVVEQFSAALQYGV RHGQGTPNASNLGARLMAARDGGVLTGKQFRDNLNVLFVAGQENPQLLLISMLYLLAKHP (0) DVQSRLRREIDACSTLDPSNVAFSELPYVTCVIYESLRLLP PISQLINRRTSQDVILGNQIYIPKGTYVGYNCYSTNRDPAVWGPTADEFRPERWGQSN TEISQCYRQRRARAEFVSFHGGSRACLGEKFALLEARVALFVLVSRLSWSLDPEWPDR KTPAGPLYPRALRLIFTERK CYP56C1 Aspergillus flavus AFL2G_06061 98% to CYP56C1 Aspergillus oryzae see Fungal pages for seq CYP56C2 Uncinocarpus reesii UREG_04911.1 60% to CYP56C1 Aspergillus oryzae see Fungal pages for seq CYP56C3 Aspergillus clavatus ACLA_064250 53% to CYP56C2 see Fungal pages for seq CYP56D1 Clavispora lusitaniae ATCC 42720 GenEMBL AAFT01000066.1, CLUG_02306.1 46% to CYP56A1 MLIDIIISNISVSFWCLCFFFLSWKIFRIIYPPFDFPKNIPTIPFYAAFVSSFLGWDQNK VYAHYYRENLEKYGAAKIYFASRWNILVTKPEYLAQIFKQNEIFEKSGNQHKIPYSLLAE YTGDNIISTGNKYWKLYRKAVTNSIQFPNLEPLQTNATVLIENLSAASDNGYFPIADILQ AYTLNNVGDCVIGTNFHKSDNFDISTSLKKVKQEIFRPLFLNFPLLDLLPFPSRIRARKL VRSFKRNYCNKILNELNEENKGRLGHNLATFWKQGDITEKQFQDNAMIAMIAGHENPQLL LTSLIYVIAKYPEIQRQLRSEMNSSNYDVKEQIPLLKAVVYETIRLFPPLGLIVNRKTRK NVILGNGIFIPKGTYVGYNNFGTQRDRNYWGDDADAFRPERWGNDGKKIEEKYAEAKSKC TLPAFHGRSRACLGEKFALAEVRKAVIEILQNFEISLDPSWKEQLTPAGPICPLKLSVML KKVCSEHEHNI* CYP56E1 Candida tropicalis MYA-3404 GenEMBL AAFN01000070.1 45% to CYP56A1 MIMFDIVKYTLIGLFSYLLYVILDIVLPPFNFPKNIPTIPFYVSFLGAYTNLDQRDI YNLYLREKLEKYGAVKIYFASRWNILITRPEYLLEMFRNEDVYSKRGNHLKIPGSVMATY TGDNIISAHGELWKLYREVIAKSIQFPDFEPITKNTKSLLEIIDGMIDSDKNHAIIPITD LFQKYSLANVTESILGVNFKVLEGDQSIMHQKIKYVKLQIFKPFFLNFPYFDSFPIPSRL QARKEVINFRNWYGQSIIDKHDPQLPNSAATKLVDGLMQEKLTEKQFLDNAIIVMIAGHE NPLLLMLSLMFVAAKYPKVQEAIRSEIDPTKPYLHSVIYETLRMYPPLGLIINRYTTRPT KLGNIVIPKGVYCGYNNFGTGRDRNVWGPDSDEFKPERWGRDNIEEINRNYANAKRSAEL PAFHGRKRACLGEKYALYEVKELLTSILGHYKVTLDASWKEKITPAGPISPFGLKVKFEK LIVA* CYP56E2 Candida albicans SC5314 GenEMBL AACQ01000054.1 45% to CYP56A1 missing N-term runs off the end of the contig whole seq on orf19_554 151305 DFLLEMFKNEDVFAKSGNHVKIPNSVLATYTGDNIISAHGELWKLYRDVVAQSIQFPDLK 151126 151125 PISKNTQKLLRFLDNEMGSANQATISVTDILQKYSLANVCESVLGVNFNVLDDKQSLMHE 150946 150945 KIKYVKLQIFNPIFLNFPYFDNFPIPSRLKARREVIGFRKWYGQSLIEKYNLQLPN 150778 150777 SAATKLVDSLIKEKLTEKQFLDNAIILMIAGHENPLLLMLSLLYVVSKYPQVQELIRNE 150601 150600 TETTKPYLHSVIYETLRMYPPLGLIINRCTTRITKLGNIVIPKNVYCGYNN 150448 150447 FGTGRDRNVWGSDADIFKPERWGLEIDEINKKFTLAKRSAELPAFHGRKRACLGEKYALF 150268 150267 EVKQFLLAILGEYKVSLDPNWKEQLTPAGPISPLRLKLNFEKLTVS* 150127 CYP56E3 Candida dublinensis cdub_14-g334.1 96% CYP56E2 Candida albicans (ortholog) see Fungal pages for seq CYP56E4 Candida parapsilosis CPAG_05515 65% to CYP56E1 Candida tropicalis see Fungal pages for seq CYP56E5 Lodderomyces elongisporus LELG_03266 runs of EE probably in an intron seq 59% to CYP56E2 Candida albicans see Fungal pages for seq CYP56E6 Pichia stipitis JGI model e_gwh1.2.1.1913.1 57% to CYP56E4 Candida parapsilosis see Fungal pages for seq 57A Subfamily CYP57A1v1 Nectria haematococca MP VI (Fusarium solani group) GenEMBL L20976 (3276bp) S70757 (1373bp) PIR S45583 (515 amino acids) Maloney,A.P. and VanEtten,H.D. A gene from the fungal plant pathogen Nectria haematococca that encodes the phytoalexin-detoxifying enzyme pisatin demethylase defines a new cytochrome P450 family. Mol. Gen. Genet. 243, 506-514 (1994) PDAT9 pisatin demethylase CYP57A1v2 Nectria haematococca (Fusarium solani group) no accession number Kevin McCkluskey and Hans Vanetten, Univ. Arizona personal communication pda4 pisatin demethylase Note 98% identical to CYP57A1v1, probable allele CYP57A2 Nectria haematococca (Fusarium solani group) GenEMBL X73145 (2750) Reimmann,C. and VanEtten,H.D. Cloning and characterization of PDA6-1 a fungal gene encoding a cytochrome P-450 which can detoxify the phytoalexin pisatin from garden pea. Gene 146, 221-226 (1994) high similarity to CYP57A1 CYP57A3 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_15000311 Necha1/scaffold_15:835216-837380 61% to CYP57A1 see Fungal pages for seq CYP57A4 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_30000136 Necha1/scaffold_30:357529-359333 63% to fgenesh1_pg.scaffold_15000311, 57% to 57A1 This gene model seems correct DRN 2/5/06 see Fungal pages for seq CYP57B1 Nectria haematococca (Fusarium solani group) gw1.57.27.1 Necha1/scaffold_57:81298-82977 43% to CYP57A3 in CYP57 family, model short at N-TERM see Fungal pages for seq CYP57B2 Nectria haematococca (Fusarium solani group) 83% to CYP57B1 see Fungal pages for seq CYP57B3 Aspergillus oryzae GenEMBL BAE55311.1 56% to 57B2, 55% to 57B1 MIGTVLDTVLGNPQGAVGGLFVGSFVIFWVVPFLYNLFFSPLRNVPGPFWARFTILWEFSQLMKGRSHEE YIKLHKKYGPVVRVSPKRYSVIDPQDVKKIYGFGADFPKSEFYDSLGDPKNIFTVRDNEDHKDRRRKVAS LYTMSSMVAYEDAVDRMTTLCIKKMTDLAASRKLISIPKFMQFYAFDVIGEITFDQNFGMMENMGDTQGI IKEIHAMNNTIGIMGLLPELQSTWRAAQKVLFSGTAMGKLTEYIWSQYWKHRNTNSGAKQKSQYDTFLRK VLELEAAHKVGRMNVLDSCGSNIGAGSDTTGITLSAALWYIYRNPDKLAKLRHEIDTMTAEGRISDPVTY KQAQEMPYLNAVIKETLRVHPGVGTILARVVPKGGITLTGGYYVPEGTHIGTNAWPLHYSETVYGPDADQ YRPERWLEDKPQPDYRDSMMFAFGSGSRTCIGRNISLLEITKVLPQIVRKFDLKFEEKKDPWDAWCAWFV YPKYKCWIEPRKPVENVA CYP57B3 Aspergillus flavus AFL2G_00227 revised 100% to CYP57B3 Aspergillus oryzae see Fungal pages for seq CYP57B4 Fusarium oxysporum FOXG_01386 81% to CYP57B2 see Fungal pages for seq CYP57C1 Mycosphaerella fijiensis fgenesh1_pg.C_scaffold_2000556 revised N-term 51% to CYP57B3 Aspergillus oryzae see Fungal pages for seq 58A Subfamily CYP58A1 Fusarium sporotrichioides (filamentous fungus) GeEMBL U22462 PIR S57337 (520 amino acids) Hohn.T.M., Desjardins,A.E. and McCormick,S.P. The Tri4 gene of Fusarium sporotrichioides encodes a cytochrome P450 monooxygenase involved in tricothecene biosynthesis. Mol. Gen. Genet. 248, 95-102 (1995) Note: also called TOX4 CYP58A2 Myrothecium roridum, complete cds. GenEMBL AF009417 (4058bp) Trapp,S.C., Hohn,T.M., McCormick,S. and Jarvis,B.B. Characterization of the gene cluster for biosynthesis of macrocyclic trichothecenes in Myrothecium roridum Mol. Gen. Genet. 257 (4), 421-432 (1998) (TRI4) gene CYP58A3 Fusarium graminearum CYP58B1 Aspergillus parasiticus No accession number Ken Ehrlich Submitted to nomenclature committee 12/30/2003 40% to 58A1, 37% to 58A2 on the border of the CYP58 family gene name CypA in aflatoxin biosynthesis CYP58B1 Aspergillus parasiticus aflatoxin pathway gene cluster GenEMBL AY371490 join(3110..3314,3377..4218,4278..4388,4464..4630,4700..4871) gene="aflU" MASNTVYTSLIGLLVALTVRSIYRVYFHPLRKIPGPKIAAITHL YQHYYDAVKGGKYIWKLDELHRKYGPIVRFNPNEVHIQDSHYYHHIYAGGAKKQDKDP GFPAVPLFPGVTVTTIKHNHHRLRRGIIKSFFSKQYVTGLEHVIQSKVNLLASRFTEA YRHGTVLDLKYVFAALTSDLTTHYVYGTNLNHLAEPDFKNDFLAGMDSVGPWIPVLLV FGRLLKLARYLPACLVPAGEFLHLWTLSERRVGEILDSQDNGTMGDQKTLLQAMATAD VSEEEKTATRLQMETLNIIAGGTETTARALAVGVFHLAHKPSLLLQLRDELRTVMPFP DSSASWTQLEQLPYLAGVVNESLRLSFGFIIRSARVYPNDPLVYEDLVIPPGTPISQS AYFVCMDPSIFPQPEDFNPDRWVQAAREGNNLHRYLIVFSKGSRHCLGINFALAEIYL AIATIARRFDLVPYQTTVEQLQMKRDLGFAAPEKGPFTVRAKVTGLAD CYP58B2P Aspergillus flavus NRRL3357 pseudogene GenEMBL AAIH01000612.1, AFL2G_07229 missing first 29 aa, two frameshifts and one stop codon, bad boundary at 1273 76 LRKIAGPKIAPITHLY*HYYDAVKGGKYIWKLDELHRKY (1) 192 255 GPIVRFNPNEVHIQDSHYYHHIYAGGAKKQDKDPEFPAVPLFPGVTVTTIKHNHHRLRRG 434 435 IIKSFFSKKYVTGLEYVIQSEVNLLASRFTEAYHHGTVLDLKYVFAALTSDLTTHYVYGTNLKX 623 626 LAEPDFKNDFLAGMDSVDPWIPVLLVFGRLLKLAGYLPACLVSAGEFLHLWT 781 783 LSERRVGEILDPQDNGTMGDQKTLLQAMATADVPEEEKTATRLQMETLNIIAGGTETTARALAV 974 975 GVFHLAHKPSLLLQLRDELRTVMPFPDSLASCTQLEQLPYSYL (0) 1103 1163 AGVVNESLRLAFGFIIRPARVYPNDPLVNEDLVIPFP (?) 1273 1362 TPISQSAYFVCMDPSIFSQPDDFNPDRWVQAARDGNKLHRYLIVLSKGSRHCLGIN (2) 1529 1599 FALAEIYLAIATVARRFDLVPYQTTVEQLQMKRDLGFTAPEKGPFTVRAKVTGLAD* 1769 CYP58B2P Aspergillus oryzae Supercontig 10: 19609-20001 + 95% to CYP58B2P Aspergillus flavus even pseudogenes are well conserved see Fungal pages for seq CYP58B3 Aspergillus nomius isolate AN13137 AY510454.1 8295-10054 92% to CYP58B1 Aspergillus parasiticus gene CypA of aflatoxin biosynthesis gene model revised MVSKTVSTALIGLLVALILRSVYRVFFHPLRKIPGPKIAAITHL YQHYYDAVQGGKYIWKLDELHRKYGPVIRFNPNEVHIQDSHYYHQIYAGGAKKQDKDP GFPAVPLFPEVTVTTIKHHHHRLRRGIIKSFFSKQYVMGLEHVIQSKVNLLASRLTEA YRHDTVLDLKYVFAALTSDLTTHYVYGTNLSHLTEPDFKNDFLAGMDSVGRWIPVLLV FGRLLKLARYLPACLVPAGEFLHLWTLSERRVGEILGSKNNGTMGDRKTLLQAMATAD VPEQEKTATRLQMETLNIIAGGTETTARALAVGVFHLANKPSLLLQLRDELRTVMPFP DSTASWTQLEQLPFL (0) AGVVNESLRLSFGFIIRSARVYPNDPLVYEDLVIPPG (0) TPISQSAYFVCMDPSIFPHPEDFNPDRWVQA ACEGHNLHRYMIVFSKGSRHCLGINFALAEIYLAIATVARRFDLVPYQTTVEQLQMKR DLGFAAPEKGPFTVQAKVTGLAD* CYP58C1 Aspergillus nidulans see Fungal pages for seq CYP58D1 Aspergillus nidulans AACD01000172.1 Aspergillus nidulans FGSC A4 revised see Fungal pages for seq CYP58D2 Aspergillus oryzae RIB40 GenEMBL BAE58658.1 AP007157.1b (second P450 of 8 on this accession) 73% to CYP58D1 complement(join(191351..191525,191581..191747, 191803..191910,191973..192829,192878..193118)) MGVISVVTFLASAKEHFIISILLLFPIALILRTIYRLYIDPLHH IPGPKLAAISHLYEFYHDVVRGGLFIWEIEKMHQEY GPIVRINPREVHIKDPYFYDELYAPAHGWRDKDAKSVEIFSSPNA LVSTVDHHTHRMRRKLLTSFFSRRSIERIEPVIHE SLSKFLDSLITAYEEDSVVELIDRLQALTGHVITQYAYGEDYGLHEPQNIGKGIVKVV QEGTEQIHLHRFFPLIQRFLRLIPSFFMTQLFPARAAMYDLLHGVRKKSIEVLQQKDV CTPTERTTMFHALTAPEVPPEERTLQRLEDEGLVLFAAGTETTATTLGVAIFHILSDP MVLTKLRKELEQVMPTPEGLATWRELEKLPYLNGVIHEALRFSGLAMRQQRVSPTEVI KYKDYAIPPGTPVSMLQYFLHTDPALFSDPEKFYPERWMLAAERKESLSRFLVTFGKG TRSCIGMNLAYAELHTALAAIVRRFDLELYETTAEDIRFVRDKLLPRAKNGPWRVRVK VVGIRKD CYP58D2 Aspergillus flavus AFL2G_03972 99% to CYP58D2 Aspergillus oryzae see Fungal pages for seq CYP58D3 Aspergillus fumigatus Af293 GenEMBL XP_753152.1 also AAHF01000003.1 trichodiene oxygenase predicted 71% to 58D1 N-term part of broken gene MRRKLLNNFFSRRSIENIEPVIHNSLSRFLDGLTTAYKEGSVVELIDRLQGLTGDVITEYAYGENYGLQD PENYGKGIVKVVQEGTDQIHFFRFFPLVSQLLRIIPSWFMGQVFPARAAIYQLLDGVRKQSILALQQKNV STAPQRMTIFQALTAPEVPPEERNLQRLQDEGLVLFAAGTETTATTLAVALFYN CYP58D3 Aspergillus fumigatus Af293 GenEMBL XP_753171.1 also AAHF01000003.1 73% to 58D1 C-term part of broken gene YILNDKRVLSKLRRELEQI MPTPQHPTTWRELEKLPYLTAVIREALRFSGIVMRQQRISPTEALKYKDIVVPPGTPVSMIARFVHMDPT IFPDPTRFHPERWMLAAGREEGLSKFLVTFSRGNRGCIGMNLAYAELYNTLAAVVRRFDLELNQTTPENI RFVRDKLLPRSKDGSWTVRVKVVGIREQ CYP58E1 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_14000019 Necha1/scaffold_14:58063-59829 48% to CYP58A3, 46% to CYP58A2 46% to 58A2 also 48% to 579A2, check 579A2 vs 58A2 = 65% Note 58A2 and 579A2 are same family and same subfamily, already revised names 579A2 = 58A3, not yet changed on blast server This gene model seems correct DRN 2/5/06 Note this gene is near scaffold_14000017 see Fungal pages for seq CYP58E2 Aspergillus oryzae GenEMBL BAE59845.1 61% TO 58E1, 47% TO 58A2, 41% to 58C1, 46% TO 58A3 MLDTISESRPNGLATLGVVVATFATYLALKALYNLYLHPLRKFPGPKLAAIGPYYEFYYDVMKDGMFLWE MERMHQVYGPIVRVNANEIHIRDPHYYSTVYAGNHRSTDKYHDAVAAFSVPQASLATIHHKVHRLRRSIL NPYFSKAAVTRLESAINERIERMCSRLEETMHYGQVVDLDAGFAALTADIVTTYFYGQNFDYLGNEGFKF QVRDAILGLIQFYHFTRFFPWIANTIKKLPIPIMRLIHPGAAYLVSSQEEIKDSIRASLDKGNKADAKSV IVQALEDPTIPPQERTLDRLGDEGTTIIFAGTETTARALSVGMFHILNNKTILKKLREELDTLPGVSSGV YSHVQLECLPYLTGAVQESLRLSHGPAIRLPRVANDKALKYGDYIIPPGTPVSLCTVLVHQDPCIFPDPH RFDPERWVKASKEGVNLDKFIAAFTKGTRQCLGINLAYAEIYLTFAKLIRTFNMEIYDTTVDDLTVHHIR LTGAPKQGTGEVKVKVTEKIL CYP58E2 Aspergillus flavus AFL2G_06877 revised 100% to CYP58E2 Aspergillus oryzae see Fungal pages for seq CYP58E3 Neosartorya fischeri NFIA_042080 53% to CYP58E1 see Fungal pages for seq CYP58E3 Aspergillus flavus NRRL3357 AAIH02002155.1 Identical to CYP58E3 Neosartorya fischeri (partial seq, upstream part in a seq gap) see Fungal pages for seq CYP58E3 Aspergillus terreus NIH2624 AAJN01000124.1, ATEG_04417.1 93% to CYP58E3 = ortholog see Fungal pages for seq CYP58F1 Aspergillus oryzae GenEMBL BAE64028.1 46% TO 58E2, 40% to 58C1, 45% to 58D1, 44% to 58B1 46% to 58A3 MALLVILGIGFVTYLVFLSIYRLYFHPLSHIPGPKLAAITHGYEFYHNIIRGGLFIWELERLHEVYGPII RINPREVHIKDPDYYDDIYASSARKREKDPVLVAQFGLEGSGFSAIDQETHHQRRAPVAKFFSKRAIENQ EHLIRDSLNKLVGHFQEACRAHTVVSLDAAFAALTSDVIHQYAYGFNPNNLDKEGFNAKVRDGINGLFQL AHLLYFFPILQTLMNVMPLPLLQVLNPPAFALASQKKELYEQGAAALEKANSKAGSGTLIDTLAAPSMPE HMRSAHRLMNEGFALVIGGTETTARSLALGAWHLFTREDIRNKLREELKQVMPTPDSQPTWNDLEKLPYM SGVISESLRLSTGIANRSSRVAPTEALVYKDYTIPPGTPVSETNYFTLMDPEIFSDPHAFDPERWIRAAA KGQRLDRYLVNFSKGSRMCVGLNLAYAELFLVIATIVRRFDIELYETPKSNIDFARDFGTPYPDKGNFSV RALVTGVISE CYP58F1 Aspergillus flavus AFL2G_09007 99% to CYP58F1 Aspergillus oryzae see Fungal pages for seq CYP58F2 Aspergillus niger fgenesh1_pm.C_scaffold_7000005|Aspni1 60% to CYP58F1 see Fungal pages for seq CYP58F3 Aspergillus terreus ATEG_04068.1 68% to CYP58F1 see Fungal pages for seq CYP58G1 Aspergillus oryzae GenEMBL BAE64022.1 40% to 58C1, 41% to 58D1, 39% to 58B1, 35% to 58A3 39% to 52E2, 41% to 58F1 revised 3/18/2009 MNLPLIWIGLFTAVSYLVIRSIYRLYFHPLSNFPGPKLAAVTHLYEFYYDVVKGGKFIWEMQRMHDQY GPIVRINPREIHIKDPYYFDPIYTSKGQAKDPYIVRTFATPLSTAATVEHDRHRYRRDLVNPFFSKRSVMGVDYIV QDKVDKVCKRLTQVHERGTVVSLDDLFAALTADVISHYAYGESLGFLDTENLKNEFRDAVASAGLLCHFA RFFFVVSMVAETMPALVEWMQPSSKGLWEAKRMIEQMARSSLEKDHEKNANSRKTIFDALCAESVRPEER TVARVRDEAMVVFGAGTETTARVLATGSYYLYRDKPRLEKLRAEIETVMPDSTDHVSLTQLESLPYLTAV INESLRMAHSVTMRLPRISPTPLAYKDYIIPPG TPVSQSVYFMHMDPTLFPNPDSFNPERWLEASSKGERL TKFLVPFSKGSRICLGMNLAYSELYQMFATLVRCFDLEIQTPPESVRITRDFIIGLPDDADYLKVHSLVT NAL CYP58G1 Aspergillus flavus AFL2G_09004 99% to CYP58G1 Aspergillus oryzae see Fungal pages for seq CYP58H1 Aspergillus oryzae GenEMBL BAE56719.1 43% to 58B1, 44% to 58C1, 42% to 58D1, 38% to 58A3 42% to 58G1, 40% to 58E2, 42% to 58F1 MDEFTISAGSLCLLSLVVGFIIQSVYRLFFHPLRKFPGPKLGAISHLVQGLCFANNPKGPVVRINPRELH INDPYYYEQIYAGSSRIREKDPRFIGVFTTPLPMVATVGHEHHRIRRGLLSSYFSRRALKKAELIIDQKV DRLLVRFHSAFKCHAVLPLQRVFAALAADIVSEYCYGASQGYLEQKVFQNQMIDAVNYVMSMCHINKSIP IIPKLLRCVPVGLMEKLGLQMADVIGVRNLIRRQAAKSLDKEWLSHDTNMLSKNVFDAIAAADVAPQEKT LRRLEEEGAALFGAGIETTARALTVAMFHLISDETMIRKLRDELKQVMLSPASRPTWAELEQLPYLTGVV NESLRLSFGLVARSPRVSPIESLAYGEYVIPPG (0) TPVSQSAYFVHMNPQV FPEPESFNPERWIKAAEKGQYLSRFLVAFSKGSRQCLGMNLAYAELYLTLARIVRLVDMKLVGTTIDNIR VGRDLGHPAPKAGNFKVKVEVMGIASKS CYP58H1 Aspergillus flavus AFL2G_07403 98% to CYP58H1 Aspergillus oryzae see Fungal pages for seq CYP58J1 Aspergillus oryzae GenEMBL BAE62179.1 41% to 58C1 46% to 58C1, 44% to 58D1, 41% to 58B1, 40% to 58A3, 43% to 58E2, 42% to 58F1, 41% to 58G1, 41% to 58H1 MDSPRLWTGLAGLITYLIVISVYRLFSHPLRNIPGPKLAAVTHLYEWYYDLFLGGKYLFEIERMHERYGP IVRINPPEIHINDPKYYDEIYASGTHRRNKDAEFVSFTGLLLSSASTTDHDLHRYRRGLMNNFFSKKSVR GISYFVEEKVHNLMQRFEAFYRCNKVVRLDDAFAAMTSDVITHYCYGKSWDYLDYANLRTDVRKAVRDLT CSVHFNRIFPIFLAVLKKLPLRWLYAIHPGRSVVLDIQKTIYEQSAEAIHGDKYKIGHNDAVDKHKTIYD QLTDPSIPAEERSLQRLQDEGLLLISVGTETTARALTTACFHIASDDQLRTRLREELRTVLPTPTSSVTW SELEKLPYL (0) TGTVNESLRLGGFLTTRSPRIAPDEPLTYKEYTIPPS (0) TPVSSSSYFGHKNPNIFPEPEKFSPERWISAGRNNDHLFKYITSFSRGSRICAGMNLAF LELYMTLAYFVRRFDVELVDTTVEDMKIVRDMRVGFTHRGEPTVYGRIVRVYED CYP58J1 Aspergillus flavus AFL2G_06095 98% to CYP58J1 Aspergillus oryzae see Fungal pages for seq CYP58K1 Aspergillus oryzae GenEMBL BAE63021.1 41% to 58J1, 38% to 58A1, 40% to 58D2, 39% to 58C1, 38% to 58B1, 42% to 58J1, 37% to 58H1, 37% to 58G1, 39% to 58F1, 39% to 58E2 MVMDQAWTWMALVAIAGLFWITRIFYRLYFHPLAKIPGPKLAAASHLLEFYYDVILGGKFLFQVEKMHQK YGPIVRINPKEVHIIDPTFYNEIYASGMRKRDKYEGFVRSLAADESTVSTVGSEKHRFRRSILQNFFSKR SVMEFSSAIGERVEKLMRRLEVFEKTQTPVALDVVFSALTSDMITYICYGKDWKFLDHKDFNCDIHQAGV DFANFFHFNRFFPWVFMTLRALSPRMLALLIPGRAATFKFQESLLKHAIEMAANEQSDAPSKETEKSRPN VISNLINPSIPYMERSRRRLEDEVITILVAGTEAPVKVLSMAMYYLGSEPAIGEKLRAELKTILPARTST ATYAELEKLPYLHGVVYESLRISDSVIARFPRIAPTETLRYKDHILPPGTPMSCSSYFISRNHDLFPNPE KFDPERWIHAAEKGENLKQHLTAFTKGSRICLGINLTIAELFLTIAHMCRRYHILLHNTEPEDVCTTSDL LAGYTRRGVLKVHAKLKAVRE CYP58L1 Aspergillus niger e_gw1.12.119.1|Aspni1 49% to CYP58D1, 49% to e_gw1.8.187.1 see Fungal pages for seq CYP58M1 Aspergillus niger e_gw1.8.187.1|Aspni1 51% to CYP58K1 see Fungal pages for seq CYP58M2 Neosartorya fischeri NFIA_057570 55% to CYP58M1 Note: this seq has no ortholog in A. fumigatus see Fungal pages for seq CYP58M3 Aspergillus terreus ATEG_02828.1 55% to CYP58M1 see Fungal pages for seq CYP58M4 Aspergillus terreus ATEG_03531.1 57% to CYP58M1 see Fungal pages for seq CYP58N1 Aspergillus niger e_gw1.9.613.1|Aspni1 42% to CYP58C1, 39% to e_gw1.8.187.1 see Fungal pages for seq CYP58P1 Aspergillus clavatus ACLA_013370 49% to CYP58H1 see Fungal pages for seq CYP58Q1 Aspergillus terreus ATEG_02820.1 49% to CYP58M1, 48% to CYP58P1 see Fungal pages for seq CYP58-un1 Aspergillus niger gw1.19.191.1|Aspni1 43% to CYP58J1 pseudogene fragment C-term, 52% to CYP58M1 TPMGSSSDFIRRDPDNFPSLNNFTSTRWLMENGITIHSTRYLVSFSKGSRGRIEM () NLPYFELYLTVAYLVR 59A Subfamily CYP59A1 Emericella nidulans GenEMBL L27825 (4310bp) Keller,N.P., Kantz,N.J. and Adams,T.H. Aspergillus nidulans verA is reqired for production of the mycotoxin sterigmatocystin Appl. Environ. Microbiol. 60, 1444-1450 (1994) gene stcS note: C-terminal missing Aspergillus nidulans (same as Emericella nidulans, Emericella is the teleomorph sexual phase of the fungal life cycle. GenEMBL U34740 (505 amino acids) Brown, D.W., Keller, N.P. and Adams, T.H. complete sequence of CYP59 Proc. Natl. Acad. Sci. USA 93, 1418-1422 (1996) CYP59A1 Aspergillus nidulans AACD01000132.1 stcS L27825 revised 7/20/07 MPSFSLLTGHFGALKQTIDGMPPNATLHSIMLKLSQKFRSGMFY INMWPFSGTWLVVATPSGAAQIQSLNLSKPNILRRPLETITGGPSLMSMHGETWKRWR ALFNPGFNPNYLIGLAPLIADEVVVFCEQLRQKARTGTVFQLEPLTLRLTVDTICSVT LYVVTPVGRWPFLTPDLEIHSSTTKLRTTPLPQRCNGRSNGPRLELPSTPLRRYLTVR PLVMWYNNRLMNRFIDQEVDRAYREQSGRQSKSVISLALRDYMKEKDGSLEDFKRRVA PQLRVFLFAGRDTTSSTLLYAFYLLSRHPEALAKVRLEHDQVFGPYHQQVHEKIHQDA KLLNQLPYTTAVLKETLRLFPPSASMREGGPGVEITDDNGQVYPTAGCNVWTLTVALH HNSAHWAEAESFIPERWLVGSDHPLYPAKGAWRAFEFGPRSCIGQTLAMLELRVALAM TLREFDIAPAYDKWDHIYPNDAVKEFNGHRAYQAEKGGGGAHPADGMPCLVTFRV CYP59A3 Aspergillus parasiticus aflatoxin pathway gene cluster GenEMBL AY371490 join(43951..44429,44479..45478) gene="aflN" function="conversion of versicolorin A to demethylsterigmatocystin 70% to CYP59A1 CYP59A4 Aspergillus oryzae GenEMBL BAE59520.1 also BAE71324.1 72% to 59A1 MPEFQLLAGHFGTLKKTIQGMPSDATLHSIMLKISQQFPSGIFYINMWPFSGTWMIVSTPSAASQIQKLN LSKPAILRRPLEMVTGGPSMMSMHGETWKKWRALFNPGFNPAYIIGLAPNISDEVAIFCAQLRKIAQQGE VFPLESLTTRLTVDSICSVVLDTQLHHQIKDHPLATALQRQIDWTSFGTTFNPFKRYLTIRPLVLWYNNK FMDRIIDGEVDRAYCTPPGHPSKSVISLALREYLQEQASNNSTRSLAEFKRLVAPQLRVFLFAGRNTTSS TLIYTYYLLAQHPEALAKIRAEHGDVLGADPAEAQGRIKEDVQLLNKLPYTTAVIKETLRLFPPSASMRE GRPDAEIIGEDGQRYPTVGCNVWTLTVALHHNSDYWDQVENFIPERWLVGPEDPLYPVKGAWRAFEFGPR SCIGQTLAMLELRIALAMTIRQFDITPAYDEWDSIHPATTAREVNGHRAYQAERGAGGAHPADGFPCRVK ERC CYP59A5 Aspergillus oryzae GenEMBL BAE48804.1 92% to 59A4, 71% to 59A1 MPEFKLLAGHFGTLKKTIQGMPSDATLHSIMLKISKQFQSGIFYINMWPFSGTWMIVSTPSAATQIQKLN LTKPAILRQPPETVTGGPSMMTMHGETWKKWRALFNPGFNPAYIIGLAPNITDEVATFCAQLRKKAQQGE VFPLESLTTRLTVNSICSVVLDTQLHHQIKDYPLATALQRQINWTSFGTTFNPLKRYFTIRPLVLWYNNK VMDRIIGGEVDRAYRTPPDHPSKSVISLALREYLQEQASSNSTRSLAEFKRLVAPQLRVFLFAGRNTTSS TLIYTYYLLAQHPDILAKIRAEHEDVLGVNPEEVQGRIKEDAQLLNKLPYTTAVIKETLRLFPPSASMRE GRPDAEIIGENGQRYPTVGCNVWTLTVALHHNSDHWNQVESFTPERWLVGPEDPLYPVKGAWRAFEFGPR SCIGQTLAMLELRIALAMTIRQFDITPAYDEWDSIHPATTSKEVNGHRAYQAERGAGGAHPADGFPCRVK ERF CYP59A5 Aspergillus flavus AFL2G_07219 96% to CYP59A5 Aspergillus oryzae see Fungal pages for seq CYP59A6 Aspergillus nomius isolate AN13137 AY510454.1 join(49221..49702,49768..50767) 94% to CYP59A5 Aspergillus flavus gene VerA of aflatoxin biosynthesis MPEFKLLAGHFGTLKKTIQGMPSDATLHSIMLKISQQFPSGIFY INMWPFSGTWMIVSAPSAASQIQKLNLTKPAILRRPLETVTGGPSMMSMHGETWKKWR ALFNPGFNPAYIIGLAPNIADEVATFCTQLRKKAQQAEVFPLESLTTRLTVDTICSVV LDTQLHHQIKDHPLATALQRQIDWTSFGTTFNPVKRYLTIRPLVLWYNNKVMDRIIDS EVDRAYRTPPDRPSKSVISLALREYLKEQSSSSSTRSLAEFKRLVAPQLRVFLFAGRN TTSSTLIYTYYLLAQHPEVLARIRAEHEDVLGGDPADAQGRIKEDVQLLNKLTYTTAV IKETLRLFPPSASMREGRPDAEIIGEDGQRYPTVGCNVWTLTVALHHNSDHWQQVESF IPERWLVGPEDPLYPVKGAWRAFEFGPRSCIGQTLAMLELRIALAMTIRQFDITPAYD EWDSIHPATTAREVNGHRAYQAERGAGGAHPADGFPCRVKERC* CYP59A7 Aspergillus ochraceoroseus strain SRRC 1432 AY092402.3 join(9134..9615,9669..10671) gene StcS/AflN in aflatoxin/sterigmatocystin biosynthesis 76% to CYP59A5 Aspergillus flavus MPEFKLLAGHFAALKQTIQGMPPNATLHSIMLKLSQQFPSGIFY INLWPFSGTWMIVATMTAASQIQSLNLTKPAILRRPLETITGGPSLISMHGETWKRWR ALFNPGFNPAYLIGLAPNIADEVAVFRNQLRKKAQQGDIFSLEPLTLRLTVDTICSVA LDARLFHQQKDHPLAVALQRQIEWASFGTTFNPVKRYLTIRPLVLWYNSRLMNRLIGK EVDRVHNARPGYSSKSVISLALKQYLKEQSNGDTRQSLEAFKRQVAPQLRVFLFAGRD TTSSTLLYTFYLLAKHPEILERVRREHDEVFGTDIKKAQSCIAEDPQLLNKLPYTLAV IKETLRLFPPSASMREGRPDVEIVGDDGRRYPTAGCNVWTLTVALHHNSNHWPEVESF IPERWLVGHEDPLYPAKGAWRAFEFGPHSCIGQTLALLELRIALAMTLREFNITPAYD EWDRLHPRDTIKEVNGHRAYQMEKGGGGAHPADGLPCRVTLRTT* CYP59B1X Fusarium graminearum (Gibberella zeae PH-1) GenEMBL AACM01000117.1 52362-54017 (+) strand, one intron renamed CYP5071A1 CYP59C1 Aspergillus nidulans AN5766.2 44% to 59A1 586clan? Revised 7/18/07 MAVTNLVLTLSIPLLTVIGYGLYCGFQHRRKINELRKRGIPMPKDWSWLT GHLLSLKKYTDRLPADAHVLLPTHELAVEFADTEMFLMDTWPVFPALVMV YDPDAALQISTKYNLPKSAIFPSLMHPITGGPSMISMNDAEWKKWRSIFN PGFSAGNMVDQVSTVVDSVQVFCDILREKAGTGLVHLDDLTTRLTMEVIL KVTLDMDSNYQRSDNEMVHALNTITKWHSFWDPRVRANPLRPLVQKYYGR VMDRCIRKELDKRFAEMQQGQRSASTSKRVKSVIALALEAYLADYHQKDG KASKLDDNFAQYATHQIRLFLFAGNDTTSSSIVYVYHMLSKHPKALARVR QEHDRIFGTLPSAAPQLLKSNPALLNQCPYTLAVIKETLRLFPPASTTRE GRDGVTLTDRLGNSYPLGHTIGAEIIHPTIHKNPRLWPQAEEFIPERWLV DPGHELYPSPAAWRPFEHGPRNCIGQTLVYNEMRIVIVMTARTFNIRPAY DEWDAMHAAKEGLL (0) TVHGERAYQTEKAGTHPADGYPCHVALCSGSPIS* CYP59C2 Aspergillus niger e_gw1.2.1390.1|Aspni1 63% to CYP59C1 see Fungal pages for seq CYP59D1 Aspergillus oryzae GenEMBL BAE66084.1 AP007175.1 55% TO 59D2, 41% to 59A1, 41% to 59C1 470087 MVIAVSDLVGSYGARVALVLALVIVLRFLQEMLKVRLLFYRLRKQ GLPMPKWNFAAGNLQMLPDLMKRHP KGSQQSEAFTLLSYEFASSDNCFYIDVWPFTKPLLVVNSPDLAVQACQTYALPKPPVLAKFFNPFAGGPS IFTTNGPEWKRNRGLFNPAFSTSNILQHTPHI 470671 VEEAEEYVEILREHARKGDTFTLDKMTCDYVLDIIGRVAM (2) KARLHSQRGRNPVAAALRSQVEWHCQDEQMNPFIRWNPMRPIMQWCNGRTM NQYIGAELDKRYEAWTQNKPSTRANSIIDIVLAEYMSTRPVRAALDPEFKSWAT IQLRTFLFAGHDSTAATIVYSIYLLSKHPEILSKVRTEHDEVFGSDISAAAGILKQHPEL INRLPYTLAVIKETLRLFPAASALREGQPGVYLQDKNGTKYPTEGLCIWIIHGGIQRNPN YWPDPHAFKPERWLVGPDHPLYPPKGAWRPFEQGLRDCIGQALALLDVKITLVLTLREFD FQDQYAHWDRLHPRSGPKTVFGERAYQIPQGGSHPVDGLPCRVSLRNQITK* CYP59D1 Aspergillus flavus AFL2G_11439 99% to CYP59D1 Aspergillus oryzae see Fungal pages for seq CYP59D2 Aspergillus oryzae GenEMBL BAE58153.1, AP007155.1 37% to 59A1, 37% to 59C1 2825205 MGANTLGWDGLTSRIAVAVATVCLTSFVYKLIKMRLMFYRLRKKGL (0) 2825068 2825008 PTPPWNPILGNLAVMAQLQKKWPSDSREAESFALLSTEAPGCEAGFYVDVWPFSIP MLVVTSPALAVQACQTYDLPKPDVLQPFINPMAGGSDNLFVSNGAHWKQARELFNHGFSMAAAMSHMTYI LEEAQVFVQMLKDHARKGDTFSLDALTCRYVMDIIGNVA LNTRFRFQEQHNPIAAAMRDTIELECGIETS NFLSRWNPRRLYRQWQNGRTMDYLIGVELDKRYKEWRETAKSSSHPRTQSIMDMVIAEYMKTRPQAQQQQ ELDPEFKRWATIQIRLFLFVGHDSEATTIIYSLYLLSKNPGVLIKVRAEHDRVFGAGVSSAYDVLTDHPE KINQLSYTHAVIKETLRLFPPANGLRGGLPGVSLRDEQGRIFPTEGCAIWIVHTAVHRNPSSWPQPHAFI PDRWLVEPGHPLYPPAGGWRPFEQGPRNCIGQNISLLGIKASLAMLVRQFDFHDAYAEYDRLHPSTGLKT MFGERAYMIQKGAGHPAQGFPCKVTLR CYP59D2 Aspergillus flavus 99% to CYP59D2 Aspergillus oryzae see Fungal pages for seq CYP59D3 Aspergillus niger e_gw1.12.714.1|Aspni1 63% to CYP59D2 see Fungal pages for seq CYP59E1 Mycosphaerella graminicola Mgr034 39% to 59C1, 9/10 top hits are CYP59, but only 34% to 59D3 N-term is short see Fungal pages for seq CYP59E2 Mycosphaerella fijiensis e_gw1.7.674.1 61% to CYP59E1 Mycosphaerella graminicola see Fungal pages for seq 60A Subfamily CYP60A1 Aspergillus parasiticus GenEMBL L40839 (1266bp) L41149 (1284bp) Yu,J., Chang,P.-k., Cary,J.W., Wright,M., Bhatnagar,D., Cleveland,T.E., Payne,G.A. and Linz,J.E. Comparative mapping of aflatoxin pathway gene clusters in Aspergillus parasiticus and aspergillus flavus. Applied Environ. Microbiol. 61, 2365-2371 (1995) gene ord1 missing N-terminal (approximately 93 amino acids missing) CYP60A2 Aspergillus nidulans GenEMBL U34740 (507 amino acids) Brown, D.W., Keller, N.P. and Adam, T.H. Proc. Natl. Acad. Sci. USA 93, 1418-1422 (1996) gene stcF CYP60A2 Aspergillus nidulans U34740 AN7818.1 53 clan, note the CYP60 family falls inside the much larger CYP65 family that has grown stcF revised 7/19/07 MILPLILVLYLLSTAAYRLWLHPVRNYPGPCWWAVWRVPYLKGT IRGTIVRDIQRLHNQYGPVVRIAPDELSYITPEAAKPIYTSSPEFPKDPMLLPPFHNG APGILAADYAHHRRYRRLLASAFSEKGLRAQQGMIQSHIDRLMTRLQGNCSSGSLDMT VWFNWATFDIIGDLAFGEPFGCLERMETNPWIASIQGNVKSIPILNALRRYRLDRLIE FLAPPRLLEMRRRNAQFTAEKVDRRLKHATTTRGDLWDSVLADPPDGEPPMSRAEMVS NASAIVLAGSETSATTLSGCLWLLLTNPEYLQQLTERIRARFSTATVIDAQTVTQIQG LQAVLDESLRLYPAVPMQSNRIVPPPGARLAGSWVPGGTSVAVQQFAACRSPTNFHRP DEFIPERWEKEGEFINDRREASQPFSIGPRNCIGRQLALAEMRLILVHLLWHFDIELD RRRMENMDWMAVQGI WILWDKKPLWVVLKNRST* CYP60A3 Aspergillus parasiticus GenEMBL U62774 Yu,J., Chang,P.K., Cary,J.W., Bhatnagar,D. and Cleveland,T.E. avnA, a gene encoding a cytochrome P-450 monooxygenase, is involved in the conversion of averantin to averufin in aflatoxin biosynthesis in Aspergillus parasiticus. Appl. Environ. Microbiol. 63, 1349-1356 (1997) CYP60A3 Aspergillus parasiticus aflatoxin pathway gene cluster GenEMBL AY371490 complement(join(46528..46863,46931..48025,48072..48128)) gene="aflG" function="conversion of averantin to 5'-hydroxyaverantin 70% to CYP60A2 note CYP60A1 and CYP60A3 are essentially the same sequence the CYP60A1 sequence is incomplete and has 4 amino acid differences. CYP60A4v1X Aspergillus flavus strain SRRC 141 GenEMBL AF106960 Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and Payne,G.A. Molecular characterization of an aflatoxin B2 producing mutant strain of Aspergillus flavus Unpublished verB gene 85% identical to 60B2 note: error in name assignment, this gene is CYP60B4v1 MYFLSLPALAIVIPVGYVLFQLGYNLFFHPLRGYPGPLLWRASS LPWKIALLRGTMHHDLMRYHQKYGDTVRIKPDEISYANAQAWRDIHAHVPGRPEFLKD PVRLPLAPNGVMSILVSDTKNHARFRSLFGHAFSDKGLRTQESTIVQYADLLVEVLRE VANTGRSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNLKSVAISHV LRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMKMEGDQGDFWDKVLVKSADDN QRGDGMSAGEMLNNAAVMVVAGSETTASALSGAMYLLCLSGKIKKATAEIRKSFASPE DIDLISVSHLPYLTAVIDETLRMYPAVPGQPPRVVPAGGATVCGRFVPEETRVGVSHL GAYFADYNFTHADKFIPERHLQKTEEPYKYDNYGAYQPWSVGLRNCIGRNLAYAEVRL TLAKLLWHFDFTLDVGKTGNFLDQKIWSIWAKRELYMFMKTRGTSSSSPQ CYP60A4v2X Aspergillus flavus strain SRRC 1007 GenEMBL AF106959 Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and Payne,G.A. Molecular characterization of an aflatoxin B2 producing mutant strain of Aspergillus flavus Unpublished verB gene 99% identical to 60A4v1 note: error in name assignment, this gene is CYP60B4v2 CYP60A4v3X Aspergillus parasiticus strain RH1 GenEMBL AF106958 Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and Payne,G.A. Molecular characterization of an aflatoxin B2 producing mutant strain of Aspergillus flavus Unpublished verB gene 97% identical to 60A4v1 note: error in name assignment, this gene is CYP60B4v3 CYP60A4v3X Aspergillus parasiticus aflatoxin pathway gene cluster GenEMBL AY371490 complement(join(48496..48846,48901..50052)) gene="aflL" function="conversion of versicolorin B to versicolorin A" note: error in name assignment, this gene is CYP60B4v3 CYP60A5 Aspergillus oryzae GenEMBL BAE71325.1, BAC45242.1, BAE59521.1 95% to CYP60A3 MGGDGWPSDGHILLLIVLTVLTPPSLALYRLWIHPLRSYPGPRWWAIWRGPYILSNIRGNLVRDLQRLHQ QFGHVVRIAPNELSFIVPEAASPIYTSNPEFPKDPMHLPPFHNGTPGILAADHAHHRRYRRLLAFSFSDK GLRQQRSLIERSVNLLITRLHENCGQGPLDLTLWFNWATFDIIGDLAFGNSFGCLDNVQTHPWISSIQGN VKLIPILNAFRRYRLDGLLQLLGSRKLLEQRRRNAQFTTDQVDRRLKNSSTPRGDIWDAVLAQKPDGEPP MSREEMISNTSAIVLAGSETSATLLSGCTWLLLKNPGHLHQLTSRIRSQFTHASEIDSQSVSRVEGLQAI LEESLRLYPPVPMQSNRIVPQSGAYIAGGWVPGGTSVGLQQFVACRSSSNFHRPEEFLPERWQGQGEFAH DRREVSQPFSIGPRNCIGRQLAYVETRLILVKLLWHFDLRLDTTRMKDTDWLAEQGIWILWDKNPLWVNL EPRNE CYP60A5 Aspergillus flavus AFL2G_07217 revised 98% to CYP60A5 Aspergillus oryzae Bottom HALF (two adjacent P450s in this gene model CYP60B3 and CYP60A5) see Fungal pages for seq CYP60A6 Aspergillus nomius isolate AN13137 AY510454.1 complement(join(51845..52180,52258..53352,53400..53456)) 92% to CYP60A3 gene AvnA of aflatoxin biosynthesis MGVDGWLSDGHFRLLILLIVLTPPSLAVYRLWIHPLRSYPGPRW WAIWRGPYILSNTRGSLVRDLQRLHQQFGPVVRIAPNELSFIAPEAAAPIYTSNPEFS KDPMHLPPFHNGTPGILAADHAHHRRYRRLLAFSFSDKGLRQQRGLIERSVNLLITQF HENCGQGPLDLTLWFNWATFDIIGDLAFGDSFGCLDNVQTHPWIASIQGNVKLIPILN GLRRYRLDGLLRLLGSRKLLEQRRRNAQFTTDQVDRRLQNSSTPRGDIWDAVLAQKLD GEPPMSRAEMISNASAIVLAGSETSATLLSGCTWLLLKNPEHLHQLTSRIRSEFSHAS EIDSQSVSRVEGLQAVLEESLRLYPPVPMQSNRIVPSTGAHIAGGWVPGGTSVGLQQF VACRSPSNFHRPDEFLPERWQGQGEFAHDRREVSQPFSIGPRNCIGRQLAYVEMRLIL VKLLWHFDLRLDTTRMKGTDWLAEQGIWILWDKKPLWVTLEPRDK CYP60A7 Aspergillus ochraceoroseus strain SRRC 1432 AY092402.3 complement(join(29656..30030,30172..31266,31338..31385)) gene AflG in aflatoxin/sterigmatocystin biosynthesis 79% to CYP60A3 Aspergillus parasiticus avnA gene model revised at one intron boundary MDVLRSFPLQLLGLRDFLILGSLGAYRLWLHPLRSYPGPRWWAL WRVPYLQSTIRGTIVRDLQRLHEQYGPVVRIAPDELSFINPEAAKPIYTSNPEFSKDP MHLPPFHNGVPGILAADHAHHRRYRRLLAFSFSDKGLKAQQGMIQQYIDQLIVRLHEN CSTGPVDMTLWYNWATFDIIGDLAFGESFGCLEKVETHPWIASIQGNVKAIPILNALR RYRLSGVLGLLAPRNLLEMRRRNAQFTADKVDRRLQHPGTARGDLWDSVLADNPNGES PMSRDEMVSNASAIVLAGSETSATLLSGCTWLLLKNPEYLNTLTERVRSQFARAADID AQTVTQVEGLQAVLEESLRLYPPVPMQSNRIVPPQGAQIAGMWVPGG (0) TSVAVQQFVACRSADNFHRPNEFLPQRWQGQGEFANDRREASQPFSIGPRNCIGRQLAYAEMRL ILVHVLWHFNLRLDQGRMKDADWLAEQGIWILWDKNHCGLCWNRGREKVEEDDAT* 60B Subfamily CYP60B1 Aspergillus nidulans GenEMBL U34740 (501 amino acids) AN7813.2, revised 7/18/07 Brown, D.W., Keller, N.P. and Adam, T.H. Proc. Natl. Acad. Sci. USA 93, 1418-1422 (1996) gene stcL MAFLSLPILTALGA VVYVLFQLVYNLYFHPLRDYPGPLLW RASSLPWKLTLLRGTMHHDLMRHHQTYGDTVRIKPDEISYANGQAWRDIHAHVPGRPE FLKDPVRLPLAPNGVMSILVSDTRNHARFRSLFGHAFSDKGLRAQEPTIARYADLLVE VLREVADTGKSVEMVRYFNMAIFDSIGALSFGESFDSLRNRELHPWVDTIHKNLKSVA ISHVLRSMGVEFLAPYLMPAELRGKRQENYTYAIEKLKKRMQKTGDQGDFWDRVIVKS ADGNQSGDGMSYGEMINNAAVMVVAGSETTSSALCGCTYLLCKFDKMDKAVAEVRGAF AAADQIDLVSVSRLPYLTAVIDETLRMYPSVPGQPPRVVPEGGAIVCGRFVPAETRVG VSHLGAYYAPYNFSHADKFIPERHLAGAKLEEPFRHDNYAAYQPWSVGVRNCIGRNLA YAEVRLTLAKLLWHFDISLDEERTGNFLDQKIWSIWAKRELYLEIRTREF CYP60B2 Aspergillus ochraceoroseus strain SRRC 1432 GenEMBL AY092402.3 complement(join(46781..47113,47181..48332)) Cary,J.W., Beltz,S.B., Bennett,C.A. and Klich,M.A. Molecular Characterization of the Aflatoxin Biosynthetic Cluster from Aspergillus ochraceoroseus. Unpublished function="desaturation of aflatoxin precursor VER B to VER A 84% to CYP60B1 Aspergillus nidulans StcL probable ortholog of CYP60B1 gene AflL/StcL MALPTLPVLAVLIGASYILVQLVYNLYFHPLRDYPGPLLWRASS LPWKFTLLRGTMHHDLMRFHQKYGDTVRIKPDEISFANAQAWRDIHAHVPGRPEFLKD PVRLPLAPIGVMSILVSVTRFLARFRSVFGLAFSVKGLRAQEPTIVQYADLLVEVLRE VADTGKSVEMVHYFNMAIFDSIGALSFGESFDSLKNRQLHPWVDAIHKNLKSVAISHV LRSMGIEFLTPYVLPKELRGKRQENYTYAIEKLKRRMQMTGDQGDFWDRVIVKSADGN QSGDGMSPGEMLNNAAVMVVAGSETTSSALCGTTYLLCQSGKMEKAVAEIRNAFPTPD KIDLVTVSHLPYLTAVIDETLRMYPAVPGQPPRVVPAGGATVCGKFLPAETRVGVSHL GAYYADYNFTRPEKFIPERHLQKMEEPFKHDNYGAYQPWSVGVRNCIGRNLAYAEVRL TLAKLLWHFDITLDEAKTGNFLDQKIWSIWANGELYMSFKARKV CYP60B3 Aspergillus oryzae GenEMBL BAE71326.1, BAC45243.1, BAE59522.1 81% TO 60B1 probable ortholog of CYP60B1 MYFLSLPALAIIVPVGYVLLHLGYNLFFHPLRGYPGPLLWRASSLPWKIALLRGTMHHDLMRFHQKYGDT VRVKPDEISYANAQAWRDIHAHVPGRPEFLKDPVRLPLAPNGVMSILVSDTKNHARFRSLFGHAFSDKGL RTQESTIVQYADLLVEVLREMANTGRSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNL KSVAISHVLRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMKMEGDLGDFWDKVLVKSADDNQRGD GMSAGEMLNNAAVMVVAGSETTASALSGAMYLLCLSGKIEKATAEIRKSFASPEDIDLISVSHLPYLAAV IDETLRMYPAVPGQPPRVVPAGGATVCGRFVPEETRVGVSHLGAYFADYNFTHADKFIPERHLQKTEEPY KYDNYGAYQPWSVGLRNCIGRNLAYAEVRLTLAKLLWHFDFTLDVGKTGNFLDQKIWSIWAKRELYMFIK TRGTSSSSPQ CYP60B3v1 Aspergillus flavus AFL2G_07217 revised 97% to CYP60B3 Aspergillus oryzae TOP HALF (two P450s in this gene model CYP60B3 and CYP60A5 see Fungal pages for seq CYP60B3v2 Aspergillus flavus strain SRRC 141 GenEMBL AF106960 Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and Payne,G.A. Molecular characterization of an aflatoxin B2 producing mutant strain of Aspergillus flavus Unpublished verB gene 85% identical to 60B2 97% to 60B3v1 ortholog 99% to CYP60B3v3 Aspergillus flavus strain SRRC 1007 formerly CYP60B4v1 MYFLSLPALAIVIPVGYVLFQLGYNLFFHPLRGYPGPLLWRASS LPWKIALLRGTMHHDLMRYHQKYGDTVRIKPDEISYANAQAWRDIHAHVPGRPEFLKD PVRLPLAPNGVMSILVSDTKNHARFRSLFGHAFSDKGLRTQESTIVQYADLLVEVLRE VANTGRSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNLKSVAISHV LRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMKMEGDQGDFWDKVLVKSADDN QRGDGMSAGEMLNNAAVMVVAGSETTASALSGAMYLLCLSGKIKKATAEIRKSFASPE DIDLISVSHLPYLTAVIDETLRMYPAVPGQPPRVVPAGGATVCGRFVPEETRVGVSHL GAYFADYNFTHADKFIPERHLQKTEEPYKYDNYGAYQPWSVGLRNCIGRNLAYAEVRL TLAKLLWHFDFTLDVGKTGNFLDQKIWSIWAKRELYMFMKTRGTSSSSPQ CYP60B3v3 Aspergillus flavus strain SRRC 1007 GenEMBL AF106959 formerly CYP60B4v2 MYFLSLPALAIVIPVGYVLFQLGYNLFFHPLRGYPGPLLWRASS LPWKIALLRGTMHHDLMRYHQKYGDTVRIKPDEISYANAQAWRDIHAHVPGRPEFLKD PVRLPLAPNGVMSILVSDTKNHARFRSLFGHAFSDKGLRTQESTIVQYADLLVEVLRE VANTGCSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNLKSVAISHV LRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMKMEGDQGDFWDKVLVKSADDN QRGDGMSAGEMLNNAAVMVVAGSETTASALSGAMYLLCLSGKVEKATGEIRKSFASPE DIDLISVSHLPYLTAVIDETLRMYPAVPGQPPRVVPAGGATVCGRFVPEETRVGVSHL GAYFADYNFTHADKFIPERHLQKTEEPYKYDNYGAYQPWSVGLRNCIGRNLAYAEVRL TLAKLLWHFDFTLDVGKTGNFLDQKIWSIWAKRELYMFIKTRGTSSSSPQ CYP60B3 Aspergillus parasiticus strain RH1 GenEMBL AF106958 formerly CYP60B4v3 MYFLSLPSLVIVIPVGYLLFHLGYNLFFHPLRGYPGPLLWRASS LPWKIALLRGTMHHDLMRFHQKYGDTVRIKPDEISYANAQAWRDIHAHVPGRPEFLKD PVRLPLAPNGVMSILVSDTKNHARFRSLFGHAFSDKGLRTQESTIVQYADLLVEVLRE VADTGRSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNLKSVAISHV LRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMKMEGDQGDFWDKVLVKSADDN QRGDGMSAGEMLNNAAVMVVAGSETTASALSGAMYLLCLSGKIEKATAEIRKSFASPE DIDLISVSHLPYLTAVIDETLRMYPAVPGQPPRVVPASGATVCGRFVPEETRVGVSHL ATYFADYNFTHADKFIPERHLQKTEEPFKYDNYGAYQPWSVGLRNCIGRNLAYAEVRL TLAKLLWHFDFTLDVDKTGNFLDQKIWSIWAKRELYMFIKTRGTSSSSPQ CYP60B3 Aspergillus parasiticus aflatoxin pathway gene cluster GenEMBL AY371490 formerly CYP60B4v3 MYFLSLPSLVIVIPVGYLLFHLGYNLFFHPLRGYPGPLLWRASS LPWKIALLRGTMHHDLMRFHQKYGDTVRIKPDEISYANAQAWRDIHAHVPGRPEFLKD PVRLPLAPNGVMSILVSDTKNHARFRSLFGHAFSDKGLRTQESTIVQYADLLVEVLRE VADTGRSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNLKSVAISHV LRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMKMEGDQGDFWDKVLVKSADDN QRGDGMSAGEMLNNAAVMVVAGSETTASALSGAMYLLCLSGKIEKATAEIRKSFASPE DIDLISVSHLPYLTAVIDETLRMYPAVPGQPPRVVPASGATVCGRFVPEETRVGVSHL ATYFADYNFTHADKFIPERHLQKTEEPFKYDNYGAYQPWSVGLRNCIGRNLAYAEVRL TLAKLLWHFDFTLDVDKTGNFLDQKIWSIWAKRELYMFIKTRGTSSSSPQ CYP60B3 Aspergillus nomius AY510454 complement(join(53898..54266,54322..55473)) gene VerB of aflatoxin biosynthesis 94% to CYP60B3 Aspergillus flavus MYFLSLSVLALVTLVSYVLFHLGYNLFLHPLRGYPGPLLWRASS LPWKIALLRGTMHHDLMRYHQKYGDTVRIKPDEISYANAEAWRDIHAHVPGRPEFLKD PVRLPLAPNGVMSILVSDTRNHARFRSLFGHAFSDKGLRTQESTILQYADLLVEVLRE VADTGRSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNLKSVAISHV LRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMRMEGDQGDFWDKVLVKSADDN QRGDGMSAGEMLNNAAVMVVAGSETTASALSGSMYLLCLSGKIEKATAEIRKSFASPE EIDLISVSHLPYLTAVIDETLRMYPAVPGQPPRVVPAGGATVCGKFVPEETRVGVSHL GAYFADYNFTHADKFIPERHLQKTEEPFKYDNYGAYQPWSVGIRNCIGRNLAYAEVRL TLAKILWHFDFTLDVDKTGNFLDQKIWSIWAKRELYMTLKSRDTSCSWPSTSSSPQ* CYP60B4v1X Aspergillus flavus strain SRRC 141 GenEMBL AF106960 Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and Payne,G.A. Molecular characterization of an aflatoxin B2 producing mutant strain of Aspergillus flavus Unpublished verB gene 85% identical to 60B2 note: error in name assignment, this gene was erroneously called CYP60A4v1 probable ortholog of CYP60B1 note: Asp. oryzae and Asp. flavus are very closely related species This seq is 97% identical to CYP60B3 Their P450 genes are nearly identical So this is the ortholog/ same gene as 60B3v2 MYFLSLPALAIVIPVGYVLFQLGYNLFFHPLRGYPGPLLWRASS LPWKIALLRGTMHHDLMRYHQKYGDTVRIKPDEISYANAQAWRDIHAHVPGRPEFLKD PVRLPLAPNGVMSILVSDTKNHARFRSLFGHAFSDKGLRTQESTIVQYADLLVEVLRE VANTGRSAEMVYYFNMAIFDSIGALSFGESFDSLKSRQLHPWVDAIHKNLKSVAISHV LRSMGIEFLTPYVLPKELRGKRQENYSYAVEKLNKRMKMEGDQGDFWDKVLVKSADDN QRGDGMSAGEMLNNAAVMVVAGSETTASALSGAMYLLCLSGKIKKATAEIRKSFASPE DIDLISVSHLPYLTAVIDETLRMYPAVPGQPPRVVPAGGATVCGRFVPEETRVGVSHL GAYFADYNFTHADKFIPERHLQKTEEPYKYDNYGAYQPWSVGLRNCIGRNLAYAEVRL TLAKLLWHFDFTLDVGKTGNFLDQKIWSIWAKRELYMFMKTRGTSSSSPQ CYP60B4v2X Aspergillus flavus strain SRRC 1007 GenEMBL AF106959 Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and Payne,G.A. Molecular characterization of an aflatoxin B2 producing mutant strain of Aspergillus flavus Unpublished verB gene 99% identical to 60B4v1 note: error in name assignment, this gene was erroneously called CYP60A4v2 probable ortholog of CYP60B1 renamed CYP60B3v3 CYP60B4v3X Aspergillus parasiticus strain RH1 GenEMBL AF106958 Bhatnagar,D., Cary,J.W., Ehrlich,K.C., Cleveland,T.E. and Payne,G.A. Molecular characterization of an aflatoxin B2 producing mutant strain of Aspergillus flavus Unpublished verB gene 97% identical to 60B4v1 note: error in name assignment, this gene was erroneously called CYP60A4v3 probable ortholog of CYP60B1 renamed CYP60B3 CYP60B4v3X Aspergillus parasiticus aflatoxin pathway gene cluster GenEMBL AY371490 complement(join(48496..48846,48901..50052)) gene="aflL" function="conversion of versicolorin B to versicolorin A" note: error in name assignment, this gene was erroneously called CYP60A4v4 probable ortholog of CYP60B1 renamed CYP60B3 61A Subfamily Note: all fungi have CYP61 a C-22 sterol desaturase in the ergosterol biosynthesis pathway. These will just be called CYP61 without subfamilies assigned. A few older names were 61A1-61A5. The sequential numbers may not be needed, except in rare cases where there are more than one CYP61 in a single species. CYP61A1 Saccharomyces cerevisiae (yeast) GenEMBL Z49211 (30,011bp) PIR S54015 Lye,G. and Churcher,C.M. Unpublished 1995 Barrell,B., Rajandream,M.A. and Walsh,S.V. S.cerevisiae chromosome XIII cosmid 9711 complement of 3155-4771. CYP61A1 Saccharomyces paradoxus 98% to CYP61A1 S. cerevisiae See the fungal pages for seq CYP61A1 Saccharomyces mikatae 97% to CYP61A1 S. cerevisiae See the fungal pages for seq CYP61A1 Saccharomyces castellii 83% to CYP61A1 S. cerevisiae See the fungal pages for seq CYP61A1 Saccharomyces bayanus 93% to CYP61A1 S. cerevisiae See the fungal pages for seq CYP61A1 Saccharomyces kluyveri 79% to CYP61A1 S. cerevisiae See the fungal pages for seq CYP61A1 Kluyveromyces waltii 82% to CYP61A1 S. cerevisiae C-term part only See the fungal pages for seq CYP61A1 Yarrowia lipolytica GenPept CAG84120.1 64% to CYP61A1 Saccharomyces paradoxus See the fungal pages for seq CYP61A1 Ashbya gossypii ATCC 10895 NP_982999.1 75% to CYP61A1 S. cerevisiae See the fungal pages for seq CYP61A1 Pichia stipitis JGI model estExt_genewise1_human.C_chr_6.10410 82% to CYP61A1 Debaryomyces hansenii See the fungal pages for seq CYP61A1 Debaryomyces hansenii CAG86871.1 80% to CYP61A1 Candida lusitaniae See the fungal pages for seq CYP61A1 Lodderomyces elongisporus 84% to CYP61A2 C.albicans CYP61A2/61A1 C.albicans GenEMBL AL033396 comp(39932..41485) gene="Ca35A5.10c" 68.9% identical to 61A1 note: most fungi have onlt one CYP61 gene so this is probably the ortholog of CYP61A1 but for historical reasons it was named CYP61A2 CYP61A1 Candida guilliermondii 79% to CYP61A1 Candida dublinensis See the fungal pages for seq CYP61A1 Candida parapsilosis 79% to CYP61A2 C.albicans See the fungal pages for seq CYP61A1 Candida lusitaniae 78% to CYP61A2 C.albicans See the fungal pages for seq CYP61A1 Candida dublinensis 96% to CYP61A2 C.albicans See the fungal pages for seq CYP61A1 Candida glabrata CBS138 GenPept CAG62657.1 81% to CYP61A1 S. cerevisiae See the fungal pages for seq CYP61A1 Kluyveromyces polysporus 86% to CYP61A1 S. cerevisiae See the fungal pages for seq CYP61A1 Kluyveromyces lactis 80% to CYP61A1 Saccharomyces kluyveri See the fungal pages for seq CYP61A1 Uncinocarpus reesii 75% to CYP61A1 Aspergillus fumigatus See the fungal pages for seq CYP61A1 Histoplasma capsulatum G217B 77% to CYP61A1 Uncinocarpus reesii ABBT01000205.1 Ajellomyces capsulatus G217B 1837477-1839379 (+) ABBT01000210.1 Ajellomyces capsulatus G217B 225529- 227431 (+) Two identical sequences. Note 51F2 also has two identical sequences One of them is also on ABBT01000205.1 and the other is on ABBT01000207.1 See the fungal pages for seq CYP61A1 Coccidioides immitis 90% to CYP61A1 Uncinocarpus reesii See the fungal pages for seq CYP61A1 Fusarium graminearum FG01959.1 AACM01000104 FGcontig1.104_scaffold1 See the fungal pages for seq CYP61A1 Nectria haematococca (Fusarium solani group) 88% to CYP61A1 Fusarium graminearum See the fungal pages for seq CYP61A1 Magnaporthe grisea See the fungal pages for seq CYP61A1 Ustilago maydis GenEMBL XM_397965.1 in the 61 clan 49% to CYP61 N.crassa See the fungal pages for seq CYP61A1 Sporobolomyces roseus JGI gene model estExt_fgenesh1_pm.C_40050 57% to CYP61A1 Ustilago maydis, See the fungal pages for seq CYP61A1 Malassezia globosa 74% to CYP61A1 Ustilago maydis See the fungal pages for seq CYP61A1 Aspergillus nidulans AN4042.1 66% to 61A1 51 clan revised 7/19/07 MAEINGSFVSPAADATVFPQVFQPAGLIADFLNGLTLWKTLATLFALAVV YDQFRYIYLKGAIVGPAWKLPFMGPFLQSVNPKFHEYKAKWDSGELSCVS VFHKFVVIASTRDMSRKIFNSPTYVKPCVVDAAHKLLGKTNWVFLDGKEH VDFRKGLNNLFTRQALSCYLPRMEEVYNDYYARFLKKSKNNNYKPTPWMP EFRDLMCAVSCRTFVGHYISDEAIDKISVDYYNITAALELVNFPIILPFT KTWYGKKAADMVLDEFAKCAAKSRARMAAGGEISCIMDAWIKAQLDSAKY REKIAKGIEVDSSEKPPQVLRDFTDYEVSQTIFTFLFASQDATSSACTWL FQLMADRPEILDKVREENLRLRNGDVNAPLTMDLLDSMTYTRAVVKETLR YRPPVIMVPYIAKKDFPITDKITVAKGSMIIPSVYPATRDEEAYPNADSF DPDRWITGTAEQHPKNFLIFGTGPHYCLGQTYAVLNLMAMIGKASMEMDW VHTPTPQSEEIKVFATIFP (0) DDCLLTFRPRA CYP61A1 Aspergillus oryzae GenEMBL AP007155.1, BAE58068.1 MANVTGSFVSPSADATVVPQLFQPSGLLGSLLGDFNVWKGLLTLFIAAVIYDQ (0?) FRYFYQKGSIVGPRWKLPFMGPFLQSVNPKFHEYKAKWDSGELSCVSVFHK (2?) FVVIASTRDMSRKIFNSPAYVKPCVVDSAHKLLGEDNWVFLDGKDHVEFRKGLNGL FTRSALSCYLPRQEETFNQYFKHFLEKSKANGYKPTPWMPEFRELMTAISCRTFVGHY MTDEVIQKINDDYYLITAALELVNFPIILPFTKTWYGKKAADMVMEEFAKCAAKSKAR MAAGGEVSCIMDAWVKAQQVSAKYREDVAKGIPAEKPPQLLRDFTDEEIAKTVFTFLF ASQDATSAASTWLFQLMADRPEVLEKVREENVRLRNGDINAPITMELLDQMEYTRAVV KETLRYRPPVIMVPYLVKKDFPITEKITVLKGSMIIPSVWPATHDEEAYPNPDTFDPD RWITGTAEQNPKNWLVFGTGPHYCLGQTYAQLNLMAMIGKASMEMTWEHTTTPKSEDI KVFATIFPQ (0?) DDCLLTFRPRA CYP61A1 Aspergillus flavus 100% to CYP61A1 Aspergillus oryzae CYP61A1 Aspergillus fumigatus Af293 GenEMBL XP_750145.1 cytochrome P450 sterol C-22 desaturase 81% to 61A1 Aspergillus oryzae MANVNGSFVSPSADATISPQLFYNVDSLSAVLNGFTFWKALATLFFAAVIYDQLRYFYLKGSLVGPTFKL PFMGPFLQSVNPKFHEYKAKWDSGELSCVSVFHKFVVIASTRDMSRKIFNSPAYVKPCVVDIAHKLLGPD NWVFLDGKEHVEFRKGLNGLFTRSALSSYLPVMEECYNKYYKYFLEKSKANDYKPEPWMPEFRELMCAVS CRTFVGHYMTDAAIKKIADDYYMITAALELVNFPFILPFTKAWYGKKASDMVLEEFSNCAAKSKAHMAAG GEITCIMDAWVKAQQDSAKYNEKIAKGLPVEDSEKPSHLLREFTDYEIAQTVFTLLFASQDATSAACTWL FQLVADRPDVLEKIREENLRVRNGNINAPLTMDLLDEMKYTRAVVRETLRYRPPVIMVPYLVKKDFPITD SITVSKGSMIIPSVWPATHDPEAYPNPDSFDPDRWITGDAEKQAKNFLVFGTGPHYCLGQTYAQLNLIAM IGKASLEMDWEHAPTPKSEDIKVFATIFPEV CYP61A1 Neosartorya fischeri 99% to CYP61A1 Aspergillus fumigatus = ortholog See the fungal pages for seq CYP61A1 Gibberella moniliformis 7600 AAIM02000091.1 95% TO FUSARIUM GRAMINEARUM 61A1 99% to 61A1 F. oxysporum also called Fusarium verticillioides FVEG_07284 MEAVNATSSGFSSVLAGTKYVNVALPPQVEYVIEAVSNAGVWTWVFTFIALCVAYDQ (1) 350695 IAYIIRKGPIEGPAMKLPFIGPFLDSMDPRFDGYHAKWSSGPLSCVSIFHK (2) 350847 350776 FVVIASTRDMARKVFNSPAYVKPTVVDVAPKLLGHDNWVFLDGKAHVDF 351054 351055 RKGLNGLFTRKALESYLPGQEEAYNTYFKHFLKMTKDAGGKPVPFMHEFREVMCAVSCRT 351234 351235 FVGHYISDEAVTKIAEDYYLITAALELVNLPVILPYTKSWYGKKAADMVLAEFSKCAA 351408 351409 KSKVRMAAGGEVTCIMDAWVLSMIQSERWREAEEKGEGHTVEKPAPLLRMFNDYEISQTI 351588 351589 FTFLFASQDATSSAATWLFQVTAQRPDVLDRVREENIKVRNGDPNAPITMDQLESLTYTR 351768 351769 AVVRELLRWRPPVIMVPYVTKKAFPLTENYTVPK (1) GSMLI 351948 351949 PTTFMALHDPEVYDNPSHFDPERYYSGDAEEKGSKNYLVFGTGPHYCLGQVYAQLNLALM 352128 352129 IGKASVMLDWKHHATPKSEEIKVFATIFPM 352218 DDCPLTFEERKW* CYP61A1 Fusarium oxysporum 99% to 61A1 Gibberella moniliformis See the fungal pages for seq CYP61A1 Aspergillus clavatus NRRL NZ_AAKD03000006.1 87% TO ASPERGILLUS FUMIGATUS CYP61A1, 52% TO CYP61A6 A. clavatus 468671 MANINGSFVSPSADATLSPQFFQSAEFLPAILNGFTIWKALVTLFIAAVIYDQ (1) 468513 LRYFYLKGSLVGPTFKLPFMGPFLQSVNPK 468315 468314 FHEYKAKWDSGELSCVSVFHK (2) FVVIASTR 468135 468134 DMSRKIFNSPAYVKPCVVDVAHKLLGADNWVFLDGKDHVEFRKGLNGLFTRSALSCYLPE 467955 467954 MEECYNQYYKRFLKKSKANQYKPEPWMPEFRELMCAVSCRTFVGHYITDEAVQKIADDYY 467775 467774 MITAALELVNFPFILPFTKAWYGKKASDMVLQEFSNCAAKSKARMAAGGEISCIMDAWVK 467595 467594 SQLDSAKYREKIANGVPPEEAGKPSHLLRDFTDYEIAQTIFTFLFASQDATSAACTWLFQ 467415 467414 LMADRPEILDKVREENVRVRNGDITAPLSMDLLDQMTYTRAVVKETLRYRPPVIMVPYLV 467235 467234 KKDFPITDSVTVAKGSMIIPSVWPATHDPEAYPNPDSFDPDRWITGDADKHPKNWLVFGT 467055 467054 GPHYCLGQTYAQLNLMAMIGKASMEMDWVHTPTPESEDIKVFATIFPQ 466911 DDCHLTFRPRA* CYP61A1 Aspergillus niger JGI gene model estExt_GeneWisePlus.C_12230|Aspni1 84% to CYP61A1 See the fungal pages for seq CYP61A1 Aspergillus terreus 87% TO CYP61A1 Aspergillus niger 70% to CYP61A1 Aspergillus terreus 55% to CYP61A6 Aspergillus terreus ATEG_03815.1 MATVNGSFVSPSADATIVPQLFAPSGLIGSVLEGFTVWKALLTLFLAAVIYDQLRYFWLKGSIVGPAMKLPFMGPFLQSV NPKFHEYKAKWDSGELSCVSVFHKFVVIASTRDMSRKIFNSPAYVKPCVVDAAHKLLGADNWVFLDGRDHVDFRKGLNGL FTRQALSCYLPRMEEVYNDYYARFLKKSKDVNYKPVPWMPEFRELMCAVSCRTFVGHYITDEAVQKIADDYYLITAALEL VNFPIILPYTKTWYGKKAADMVLEEFSKCAAKSKARMAAGGDISCIMDAWVKQQLDSARYRENVAKGVPVDENEKPAQVL RDFTDYEIAQTIFTFLFASQDATSAACTWLFQLMADRPEVLDKVREENLRVRNGDVNAPLTMDVLESLTYTRAVVKETLR YRPPVIMVPYLVKKDFPVTDKITVSKGSMIIPSVWPATHDEEAYPNADSFDPDRWITGTAEQQSKNWLVFGTGPHYCLGQ NYATLNLMAMIGKASMEMNWEHTPTPQSEDIKVFATIFPQDDCLLTFRPRA CYP61A1 Mycosphaerella graminicola 67% to CYP61A1 A. oryzae See the fungal pages for seq CYP61A1 Mycosphaerella fijiensis JGI gene model estExt_Genewise1.C_20563 80% to CYP61A1 Mycosphaerella graminicola See the fungal pages for seq CYP61A1 Chaetomium globosum CBS 148.51 NZ_AAFU01000895.1 AAFU01000895.1 84% TO N. CRASSA 61A1 MAANATTSPLATIKYGAASVAPQLEYVIDYVSNASTLSILATILATLVVYDQ (1) 6856 FRYILNKGSIAGPAWKMPFIGPFLQSMNPKFEEYYAKWLSGPLSCVSVFHK (2) 7008 7135 FVVIASTRDMARKVFNSPAFVKPCVVDVAHKLLGADNWVFLDGKAHVEF 7215 7216 RKGLNGLFTRRALEIYLPGQEEVYNRYFKEFVATTKQAGGEPVPFMTHFREVITAVSCRT 7395 7396 FVGHYITDEAVKKIADDYYLITAALELVNFPIILPYTKTWYGKKAADMVLAEFAKCAA 7569 7570 KSKARMAAGGEPNCIMDAWIIQMIQSQRWRDAQAKGDTEGVEKPTHILRDFTDYEISQTV 7749 7750 FTFLFASQDATSSAATWLFQIMAQRPDVLDRVREENLNVRNGDINAAVNMDQLESMTYTR 7929 7930 AVVRELLRYRPPVLMVPYVVKKPFPITETYTAPK (1) GSMVIPT 8109 8110 TYMALRDPEVYDRPDEFDPERYYTGDAEVKGMKNYLVFGTGPHYCLGQQYAQLNLALMVG 8289 8290 KASLLLDWKHHATPKSEEIKVFATIFPM 8373 DDCPLTFEDRKW* CYP61A1 Phanerochaete chrysosporium (white rot fungus) Scaffold_75 JGI gene model ug.78.18.1 62% to CYP61A1 Usilago maydis the sequence GYVLQYCLVLYAMLTHRR seems to be an insertion and it may need to be removed MASSQAAFPSTLSDSSRHSTDSPAFIGLLPTGSWF YTTAAILLSLLVIEQSVYRYKKRHLPGDKWTIPLIGKF ADSMKPTMEGYMKQWNSGALSAISVFNV RFIVMASTTEYARKILNSPTYAEPCLVHSAKQIILPDNW VFLTGKEHVEYRRGLNLLFTRKAL GYVLQYCLVLYAMLTHRR SLYLGIQDVITRKHFAKW LADAAKDPSAKPIMMTARELNMETSLRVFCGNHIPEHGAKEISDKYWMITVALELVNFPL AIPGTKVYNAIQARKAAMKWLELAARKSKESVAAGNPPECMLEEWVTILNDPAYKGR REFSDHEMAMVVFSFLFASQDAMSSGLIYGFQHLADHPEVLAKVREEQER VRGGDYEKPLTLEMMDEMPYLRAMVKETLRVKPPVTMVPYKTTKAFPISQDYTVPSGSMV IPSFYNSLHDPAVFPDPDRFMPERWLDPNGSANTNPRNYLVFGSGPHKCIGLEYAMMNIA LVLANAAVLMNWEHELTPQSDKVQIIATLFPQDGCKLKFSPRQHA* CYP61A1 Cryptococcus neoformans var. neoformans B-3501A chromosome 6 AAEY01000030 CNBF1100 ESTs gb|CF191501.1|CF191501,gb|CF188802.1| CF188802, gb|CF194425.1|CF194425 EAL20298.1 61% to CYP61A1 Ustilago maydis See the fungal pages for seq CYP61A1 Rhizopus oryzae (Zygomycete fungus, Mucoromycotina) 65% to P450-8 = CYP61 C-22 sterol desaturase 53% to CYP61A3 S. pombe P450-10_var1 See the fungal pages for seq CYP61A3/61A1 S. pombe GenEMBL Z98974 join(10345..10419,10514..12001) gene="SPAC19A8.04" 51% identical to CYP61A1 note: most fungi have onlt one CYP61 gene so this is probably the ortholog of CYP61A1 but for historical reasons it was named CYP61A3 CYP61A1 Schizosaccharomyces japonicus 80% to S. pombe CYP61A3 (misnamed, should be CYP61A1) See the fungal pages for seq CYP61A1 Schizosachharomyces octosporus See the fungal pages for seq CYP61A4/61A1 Botrytis cinerea (a plant-pathogenic fungus infecting over 200 plant species) GenEMBL AL111744 AL111746 Bitton,F., Levis,C., Fortini,D., Pradier,J.M. and Brygoo,Y. 58% to S. pombe 61A3 note: most fungi have onlt one CYP61 gene so this is probably the ortholog of CYP61A1 but for historical reasons it was named CYP61A4 CYP61A5/61A1 Neurospora crassa No accession number Neurospora crassa sequence contig 1.384 (supercontig 48) note: most fungi have onlt one CYP61 gene so this is probably the ortholog of CYP61A1 but for historical reasons it was named CYP61A5 4488 MASNATSFTSPLASAKLGTVAVPPQLEY 4404 VIDTISQASGWTIVFTLLAVLVAWDQ (0) VKYVLNKGTIA 4225 4224 GPSWKIPFIGPFLESLDPKFEEYYAKWLSGPLSCVSVFH (2) 4108 4037 KFVVIASTRDMARKVFNSPSYVKPCVVDVAHKLLGADNWVFLDGPAHVEFRKGLNGLFTR 3858 3857 RALEIYLPGQEEVYNRYFKSFLEITKNAGGKPVPWMPHFREVITAVSCRSFVGHYISDEA 3678 3677 VKKIADDYYLITAALELVNFPIIIPYTKTWYGKKAADMVLAEFAKCAAKSKVRMAAGGEA 3498 3497 TCIMDAWIKQMIDSKKWREAVASGNTEGLEKPTHLLRDFTDYEISQTIFTFLFASQDATS 3318 3317 SAATWLFQIMAQRPDVLDRVREENLKVRNGDIHAPINMEQLESMTYTRAVVRELLRYRPP 3138 3137 VLMVPYLVKKAFPITEDYTIPKGSMVVPTTYMALRDPEVYPNPDYFDPERYYSGDAEAKG 2958 2957 AKNYLVFGVGPHYCLGQHYAQLNLALFVGKASLLLDWKHHPTPLSEEIKVFATIFPKVSL 2778 DRSCCSACRDFFLLTLYRMTVPSLSRSASGKQRGFWVWTSI* CYP61A1 Neurospora discreta JGI gene model estExt_fgenesh3_kg.C_120065 98% to CYP61A5 N. crassa ortholog of CYP61A1 See the fungal pages for seq CYP61A6 Fusarium graminearum A second CYP61 gene in Fusarium graminearum CYP61A6 Nectria haematococca (Fusarium solani group) 79% to CYP61A6 Fusarium graminearum (ortholog) only 56% to 61A1 F. graminearum 55% to CYP61A1 Nectria See the fungal pages for seq CYP61A6 Aspergillus oryzae GenEMBL AP007155.1, BAE57461.1 73% to CYP61A6 Nectria haematococca MDAKMATEIFNTTSAQPESPQSWAYKTSQLQGALPDFLAEWSAWQYVVTFLVGLVLYDQ (1?) VLYLKRKGALPGPTFKIPLMGPFIQAIHPTFDGYLRQWASGPLSCVSIFHK (2?) 672882 FVVLASDRDLAHKVFKSPAYAEPCLVPVARDIIGHKAWVFLQGRDHAEYRRGLV 673043 673044 PLFTSRAIATYLPVQERVLDDYYNQFVAATKANQGKPMAFMTLFREINCALSCRTFFGDY 673223 673224 ISDDQVKKIADDFYLATDALDLVNIPLAMYVPFTKTWLGKRTADAVHRVFAECAARCKAN 673403 673404 MAKGATPTCIVDHWVLHMMESNRYRERVAAGETNLEKPKNMIREFTNEEISDTLFTFLFA 673583 673584 SQDASSSATTWLFQILAQRPDVLDKLREENLAARGGDRNKPFDLPMLESLTYTNAVIKEL 673763 673764 LRYKPPVILVPYLATKDFPVTPDYTVPKGSMIIPSCYPALHDPEAYPNPDVFDPERWISG 673943 673944 DAESKTKNWLVFGAGPHDCLARKYVPLSLAGMIGKASLELDWEHHPTPRSEEIRVFATLFPM (0) 674129 DGCNLVFKRRP* CYP61A6 Aspergillus flavus NRRL3357 AAIH01000781.1 Only 1 aa diff to 61A6 of Asp. Oryzae, 73% to CYP61A6 Nectria haematococca MDAKMATEIFNTTSAQPESPQSWAYKTSQLQGALPDFLAEWSAWQYVVTFLVGLVLYDQ VLYLKRKGALPGPTFKIPLMGPFIQAIHPTFDGYLRQWASGPLSCVSIFHR (2) 6085 FVVLASDRDLAHKVFKSPAYAEPCLVPVARDIIGHKAWVFLQGRDHAEYRRGLV 5924 5923 PLFTSRAIATYLPVQERVLDDYYNQFVAATKANQGKPMAFMTLFREINCALSCRTFFGDY 5744 5743 ISDDQVKKIADDFYLATDALDLVNIPLAMYVPFTKTWLGKRTADAVHRVFAECAARCKAN 5564 5563 MAKGATPTCIVDHWVLHMMESNRYRERVAAGETNLEKPKNMIREFTNEEISDTLFTFLFA 5384 5383 SQDASSSATTWLFQILAQRPDVLDKLREENLAARGGDRNKPFDLPMLESLTYTNAVIKEL 5204 5203 LRYKPPVILVPYLATKDFPVTPDYTVPKGSMIIPSCYPALHDPEAYPNPDVFDPERWISG 5024 5023 DAESKTKNWLVFGAGPHDCLARKYVPLSLAGMIGKASLELDWEHHPTPRSEEIRVFATLF 4844 4843 PM (0) 4838 DGCNLVFKRRP* CYP61A6 Gibberella moniliformis GenEMBL AAIM01002518.1 81% to CYP61A6 Nectria haematococca also called Fusarium verticillioides FVEG_08786 29769 MTSIVSPRHEAAISNASSTLKEDISNG 29688 NGLLSSYLNGYSNAQILLAILVVLIAYDQCMYLWRKGPIAGPAFKIPFMGPFIRALYPKF 29509 29508 DHYLAQWASGPLSCVSVFHK (2) 29456 29396 FVVLASDRDIAHKVFKSPTYAKPCIVPMAETLLRPNAWVFLQGKAHTEYRKGLNGLFVN 29220 29219 KALSTYLPVQEKVYDDYFGRFVAASEANKGKPMAFMRLFREINCALSCRTFFGDYISQDA 29040 29039 VEKIADDFYEVTAALELVNVPLSVYVPFTKCWKGKRTADAVLAEFAKCAAACKANMTSGA 28860 28859 EPRCIVDQWVLHMMESKKYNDRIAAGEEGVEKPRNLIREFTDDEIGQTMFTFLFASQDAS 28680 28679 SSATTWLFQVLAQRPDVLDRLREENLAVRNGNRHQPFELSMLESLPYTNAVIKELLRYRP 28500 28499 PVIFVPYEATKSFPVTPKYTVSKGTLIVPTCYPALHDPQAYPNPETFDPDRWITGDAESK 28320 28319 TKNWLVFGAGPHDCLARKYVPLTMAAMIGKASLELDWVHHATSRSEEIRVFATLFPE (0) 28149 28084 DECQLVFTKRE* 28049 CYP61A6 Trichoderma reesei GenEMBL AAIL01000244.1 75% to CYP61A6 Nectria haematococca MATIIEGSMPISFESTINATQHGTSGLVESLWSSCSAASNWQIAITALLVLVAYDQ (1) 10081 YLYISRKGSIAGPTFKIPLIGPFLQAIYPKFDAYLAQWASGPLSCVSVFHK (2) 9929 9869 FVVLASDRDLAHKVFKSPTFVRPCLVPMAINIMRPTAWVFLNGKAHAEFRRGLTGLFTN 9693 9692 KALSTYLPVQEKVYADYFNRFVEHSKANGGKPTKFMGWFREINCALSCRTFFGDYISQEA 9513 9512 VKRIADDFYLITDALELVNIPLSIHVPFTKCWRGKRTADAVLEEFTKCAAACKVNMASGA 9333 9332 QPTCIVDQWVQHMFESKRYNEAVAAGVEGLEKPTNLIREFSDEEIAQTLFTFLFASQDAS 9153 9152 SSATTWLFQILAQRPDVLDRLRAENLAARGGDKNRPFELEMLESLTYTNAVVKELLRYRP 8973 8972 PVIFVPYLALKPFPVTPDYTVPKNAMIIPSCYPALHDPAAYPNPDVFDPERWITGDAETK 8793 8792 TKNWLVFGAGAHDCLARRYVPLTMAAMIGKASLELDWVHHATERSEEIRVFATLFPM (0)8625 DECPLVFTRRE* 8517 CYP61A6 Aspergillus terreus NIH2624 GenEMBL AAJN01000026.1 76% to CYP61A6 Nectria haematococca MNVKMQADMANSTQPDRQIWHVTTHWQGKVSDSFSGWSACQYIVTILLGVIFYDQ (1) 44968 VMYIKRKGSIAGPTFKMPLMGPFLQAINPKFEEYLAQWASGPLSCVSVFHK (2)45120 45171 FVVLASDRDLAHKVFKSPSYAEPCLVPIAKDILGPKAWVFLQGKAHAEYRRGLTPLFT 45344 45345 NNALATYLPVQDKVFATYFDKFVAASAANNGRPMEFMTMFREINCALSCRTFFGDYISQD 45524 45525 AVKKIADDFYLVTAALELVNIPLSMYVPYTKPWLGKRTADAVHAEFARCAAACKANMASG 45704 45705 AAPTCIVDHWVLHMMESNRYRERVAAGEANVEKPTNLIREFTNEEIGETLFTFLFASQDA 45884 45885 SSSATTWLFQVLAQRPDVLDRVRDENLAARGGDKSKPFDLPMLEGLTYTNAVVKELLRHR 46064 46065 PPVIFVPYLATKNFPVTPSYTVPKGSMIIPSCYPALHDPAVYPDPDVFEPERWISGDAES 46244 46245 KGKNWLVFGAGAHECLAKRYVPLSMAAMIGKAALELDWTHHATERSEEIRVFATLFPM (0) 46418 DGCQLVFRKRS* 46499 CYP61A6 Chaetomium globosum CBS GenEMBL AAFU01000675.1 148.51 74% to CYP61A6 Nectria haematococca 796 MAADHVNASSSTPELLMSSSGLASGSSSWQFRDGVVGITAEWSAWQYVITLLLGLVVYDQ (1) 975 1031 VMYLYRKGSITGPRFKVPLTGPFIQALHPKFESYLAQWASGPLSCVSVFHK (2) 1183 1242 FVVLASDRELAHKVFKSPTYTEPCIVPLAKDILGHKAWVFLQGKAHAEYRRGLTPLFT 1415 1416 NKAMATYLPAQEKVFADYFDKFVAASQANVNQPLPFMTLFREINCALSCRTFFGDYISPA 1595 1596 AVKKIADDYYQATAALELVNVPFSIYIPFTKPWRGKRTADAVHAEFAKCAAACKRNMATG 1775 1776 AAPTCIVDHWVRHMFESNRYRERVAAGETDVEKPSNLIREFTDVEISETLFTFLFASQDA 1955 1956 SSSATTWLFQILAQRPDVLDRLREENLAARGGDRNKPFDLAMLESLTYTAAVVKELLRYR 2135 2136 PPVIFVPYLATKAFPITPDYTVPKGAMVIPSCYPALHDPKVYPNPDTFDPDRWITGDAES 2315 2316 KTKNWLVFGAGAHDCLARRYVPLTMAAMIGKAALELNWKHVATEKSEEIRVFATLFPM (0) 2489 DGAQLVFEKRP* 2579 CYP61A6 Aspergillus clavatus NRRL GenEMBL AAKD02000011.1 77% to CYP61A6 Nectria haematococca MDTEIANVSRAAAQSGSPQAWT 669433 PQPSSGQNTLLDCLAGWSTWQYLVTFLLGIVVYDQ (1) 669274 VMYLKRKGSIAGPAFKIPLMGPFLQAIHPKFDAYLAQWASGPLSCVSVFHK (2) 669122 669053 FVVLASDRDLAHKVFKSPSYAEPCIVPIAKDIIGHKSWVFLQGKAHAEYR 668904 668903 RGLTPLFTNKAMATYLPVQEKVFTDYFDRFVTASQANRGRPMAFMSMFREINCALSCRTF 668724 668723 FGDYISQKAVKRIADDFYLATAALELVNIPLSMYIPFTKTWLGKRTADAVHVEFAKCAAA 668544 668543 CKANMATGAKPTCIVDHWVLHMMESKRYHERVAAGETDVEKPTNLIREFTNEEIGETLFT 668364 668363 FLFASQDASSSATTWLFQILAQRPDVLDRLREENLAARGGDRNRPFDLPMLESLSYTNAV 668184 668183 IKELLRHRPPVIFVPYLATKNFPVTPDYTVPKGSMVIPSCWPALHDPDVYPNPEVFEPER 668004 668003 WISGDAESKTKNWLVFGAGPHECLAKRYVPLSMAAMIGKAALELEWTHHATEKSEEIRVF 667824 667823 ATLFPM (0) DGCQLVFSKRV* 667698 CYP61A6 Fusarium oxysporum 96% to CYP61A6 Gibberella moniliformis See the fungal pages for seq CYP61A7 Aspergillus niger JGI gene model estExt_fgenesh1_pm.C_40133|Aspni1 74% to CYP61A5 See the fungal pages for seq CYP61A8 Rhizopus oryzae (Zygomycete fungus, Mucoromycotina) 93% to P450-10_var1 53% to CYP61A3 S. pombe in CYP61 family See the fungal pages for seq CYP61A9 Rhizopus oryzae (Zygomycete fungus, Mucoromycotina) 50% to CYP61A3 S. pombe = CYP61 C-22 sterol desaturase See the fungal pages for seq CYP61A10 Aspergillus terreus NIH2624 NW_001471411.1 AAJN01000002.1 72% TO 61A1 A. ORYZAE, 55% TO A. TERREUS 61A6 70% TO CYP61A1 Aspergillus niger 70% to CYP61A1 Aspergillus terreus 55% to CYP61A6 Aspergillus terreus formerly CYP61A1, but a better match was found in A. terreus to CYP61A1 247642 MDFISKNTSPIIQKFQSPGIVAIFGSDISIWMVLLTVLITAVVYDQ (1) 247780 247843 FKYIRLKGTIAGPSWKIPFMGPFLESMDPKLEGYMAKWRSGELSCVSIFHK (2) 247997 248055 FVVIASTREMSCKIFNSPMFVKPCVVDSAHKLLGKDNWVFLDGKEHVDFRKGLNLLFTR 248228 248229 SALACYLPRQEEVFDMYFERFVARSAANNYKPLPWMPEFRKLMCTISCRTFVGHYISESA 248408 248409 LDKIAHDYYLITAALELVNFPIVLPFTKSWYGKRAADMVLREFSQCAAKSKLHIASGGEI 248588 248589 TCIMDAWVKAQLDSAAYRKNVAAGIPCEKPSHLLRDFTDYEIAQTVFTLLFASQDAN 248759 248760 SSAITWLFQIMADRPDVLDKLREENLRARNGDCNARLSMELFDKLPYTRAVVRETLRYRP 248939 248940 PVILVPYIAKKDFPITEKYTIAKGSMVLPSIWPATHDAEAYPNPDSFDPDRWITGTAEQH 249119 249120 PKNFLVFGTGPHYCLGQTYVQMNLMAIIGKASLALDWKHHITPDSEQIKVFATIFPQ (0) 249290 DDCLLTFRRQPGS* 62A Subfamily CYP62A1 Aspergillus nidulans GenEMBL U34740 L39121 (9392bp) Brown,D.W., Yu,J.H., Kelkar,H.S., Fernandes,M., Nesbitt,T.C., Keller,N.P., Adams,T.H. and Leonard,T.J. Twenty-five coregulated transcripts define a sterigmatocystin gene cluster in Aspergillus nidulans Proc. Natl. Acad. Sci. U.S.A. 93, 1418-1422 (1996) gene stcB CYP62A1 Aspergillus parasiticus aflatoxin pathway gene cluster GenEMBL AY371490 complement(join(62552..62810,62866..64025,64093..64200)) gene="aflV 69% to CYP62A1 of Aspergillus nidulans (ortholog) note the earlier 62A1 sequence is missing some internal amino acids CYP62A1 Aspergillus oryzae GenEMBL BAE71332.1, BAE59529.1, BAC20333.1 66% to CYP62A1 Aspergillus nidulans, ortholog MTNTASRELIRAIEHVPLTWWFLAVGGAWIVSKIVKILQTAYFSPLRKVPGPWYARLTSARLAWASFANN RIYYVQSLHEKYGPIVLIGPEEVDIADPVAAKQIHRMGSGFVKAPFYKLLSPGPVDNIFNFRDAKLHSTR RKLYAKGFTLNSLRQQWEPTIRNIVALTVERIRHDAQQGEAEILGWWTLMANETVCKLTFNGGHDTVRNG TKDPFVLMLERRMGDLAHLLQHFAPPLYYLGRLLGRAEPRLHDVFFSQETMFEAGKHVVAIARSARDAEG DRNLFVKALAAGDLESKIGCLNDTEIITDAGALLLAGSDPTALSLTYLIWCVLNRPKLQADLESEVAGLQ GDITDAACADLPTLNAVICESLRLYGPAPGSMPRSPPPDGATLCGYYIPPSAVVVTQNWSLHGNPKVWKD PHTFDHTRWLPGSSLSEEAKMSFNPFGQGARQCLGIHLGWMQLRLATALFFRRCPGAKLAPSTTPESMVM IDSFIAGMPKARRCAIQL CYP62A1 Aspergillus flavus 99% to CYP62A1 Aspergillus oryzae See fungal pages for sequence CYP62A1 Aspergillus nomius AY510454 complement(join(68188..68446,68501..69660,69727..69834)) gene CypX of aflatoxin biosynthesis 92% to CYP62A1 Aspergillus flavus MINTPPRDLIRAIEQVPLTWWFLTVGGAWILSTIVK ILQTAYFS PLRKVPGPWYARLTSARLAWASFANNRIYYVQSLHDKYGPIVLIGPEEVDIADPGAAK QIHRMGSGFVKAPFYKLLSPGPVDNIFNFRDAKLHSNRRKLYAKGFTLNSLRQEWEPT IRNIVALTVEKIRHDAQQGEAEILGWWTLMANETVCKLTFNGGHDTVRNGTKDPFVLM LERRMGDLAHLLQHFTPPLYYLGRLLARGVPRLHDVFFSQEMMFEAGKSVVAIARGAR GGEGDRNLFVKALAAGDFESKIGGLNDTEIITDAGALLLAGSDPTALSLTFLIWCVLN RPKLQAELESEVAGLPEDITDAACAELPIMNAVIHESLRLYGPAPGSMPRSPPPDGAT LCGYYIPPSAVVVTQNWSLHGNPKVWKDPHSFDHTRWLPGTSLSEEAKMSFNPFGQGA RQCLGIHLGWMQLRLATALFFRRCPGAKLAPSTTPETMVMIDSFIAGMPKARRCAIQL* CYP62A1 Aspergillus ochraceoroseus strain SRRC 1432 AY092402.3 join(18386..18418,18503..19650,19718..19979) gene StcB/AflV in aflatoxin/sterigmatocystin biosynthesis 80% to CYP62A1 Aspergillus nidulans N-terminal exon modified to be more like Podospora anserina EST CU876063 MDFFTSLGSTQWLSILVYVGGLWLLVKVFQVSL (0) TLRIAYFSPLSRIPGPWYARLTGLRLSWSVFAN NRIHYVHKLHQDYGPIVRIGPQEVDVADPALGREMHRMGSGFTKAPFYALLSPAPVDN IFNFRDAKLHAARRKLYAREFTLQSLRKEWEPTIREITKMTVQRIKHDALNGEAEIMG WWTLMANEVVCRLTFGGGAGTVAKGVKEPFVLMLERRMGDLAHLLKHFAPPGYYLGRM LAHVIPQLHDVFYSQEKMFAAGGGVVSRARAVKRDGDDSGPSNLFNKALEAGSLTDTD IITDAGALLLAGSDPTAISLTFLLWCVLSRPQIQHEVEAEVAQLEGEITDAACEGLPI LNAVINESLRLYGAAPGCMPRSPPPGGATLGGYFLPADTVVVTQNWSLQRDPNIWENA HSFDHTRLPREIRMTEQAKIAFNPFGHGARQCLGIHLGRVEMRLATAIFFRECAGARL SESVTEESMQVVDSFIAGVPRDRRCTIRLG* CYP62B1 Fusarium graminearum FG05806.1 AACM01000233 FGcontig1.233_scaffold3 See fungal pages for sequence CYP62B2 Nectria haematococca (Fusarium solani group) Necha1/scaffold_23:519115-520695 66% to CYP62B1 Fusarium graminearum (probable ortholog) See fungal pages for sequence CYP62B3 Fusarium oxysporum 76% to CYP62B1 Fusarium graminearum See fungal pages for sequence CYP62B3 Fusarium verticillioides 90% to CYP62B3 Fusarium oxysporum = ortholog See fungal pages for sequence CYP62B Fusarium sporotrichioides fragment BI188839 EST 88% to 62B1 PRVVPGKGATLAGHYLPAGTVVSTQAYTVHRDESIWPNALRFDGYRFMDKSRVTATQKSA LSPFGAGSRV CIGMHLAYMELRLGAALFFRECRGVKLGVEMTDEMMAMENHFLIAPKAHKCMVKL* CYP62C1 Aspergillus nidulans AN6414.1 49% to 62B1 53 clan See fungal pages for sequence CYP62C2 Aspergillus flavus NRRL3357 GenEMBL AAIH01000225.1 63% TO 62C1 1 AA diff to Aspergillus oryzae CYP62C2 MEWQIYLGIAFALWIVQ (0) 958 LYTAFTSPLRRVPGPLYTVLTRLPLKLASLTGNRIYFVHELHRKYGPIVRIAPDDVDISS 1137 1138 LAEFREIHRAGSPFLKSKWYEKFVPSKRSGVFTMRDPKEHAARRKLFARPFSKSELRRT 1314 1315 WEPAVREKVQLAVSQIQRELKAVGKSDLLKWWTFLATDVSGQLMFGESFNMLQLGK (0)1482 1535 KNQYINVLESTMMGSGIGAELPLVAWLGRHIPLSSFQNMFRATDYLFQYGQRAVTNSRTT 1714 1715 SNASRNIFAGMVYESEKGDGIITDEEVVLEAGNLIVAGSDTTAITLTYLIWAVLSQP 1885 1886 KLQRELEEEVNSLSADFDDSALEELPLLNAVIMEALRLYGAAPGALPRETPEGGAKFCGY 2065 2066 FIPQGMTVTTQAYSIHRDGDIYPDPERFDVSRWLKTETSASELAKKAFSPFGAGGRICLG 2245 2246 IHLAWMELRLATAQFFRECAGVRLAPSATWENMRPVNYFLISPRGHQCEVECNY* 2410 CYP62C2 Aspergillus oryzae GenEMBL BAE55279.1, AP007151.1 475900 MEWQIYLGIAFALWIVQ (0) 475950 476022 LYTAFTSPLRRVPGPLYTVLTRLPLKLASLTGNRIYFVHELHRKYGPIVRIAPDEVDISSLAE FREIHRAGSPFLKSKWYEKFVPSKRSGVFTMRDPKEHAARRKLFARPFSKSELRRTWEPAVREKVQLAVS QIQRELKAVGKSDLLKWWTFLATDVSGQLMFGESFNMLQLGKKNQYINVLESTMMGSGIGAELPLVAWLG RHIPLSSFQNMFRATDYLFQYGQRAVTNSRTTSNASRNIFAGMVYESEKGDGIITDEEVVLEAGNLIVAG SDTTAITLTYLIWAVLSQPKLQRELEEEVNSLSADFDDSALEELPLLNAVIMEALRLYGAAPGALPRETP EGGAKFCGYFIPQGMTVTTQAYSIHRDGDIYPDPERFDVSRWLKTETSASELAKKAFSPFGAGGRICLGI HLAWMELRLATAQFFRECAGVRLAPSATWENMRPVNYFLISPRGHQCEVECNY CYP62C3 Aspergillus niger JGI gene model estExt_fgenesh1_pg.C_150110|Aspni1 67% to CYP62C2 See fungal pages for sequence CYP62D1 Nectria haematococca (Fusarium solani group) fgenesh1_pg.scaffold_70000012 Necha1/scaffold_70:35727-37407 43% to 62B1, 43% to 62C1, 37% to 62A1 See fungal pages for sequence CYP62E1 Phaeosphaeria nodorum SN15 GENEMBL AAGI01000353.1 46% TO 62C1, 46% TO 62B1, 42% TO 62D1, 39% TO 62A1 MAFVDVVTANKPLLVALI (0) FVYVGLTCPTRHLPGPWYTRFTHYRLKRAVVTGQRIFYIDALHKQYGPIVRLSPTEV 956 GVADLDAFKEIHKIGTKYMKSEWYLRLANFPKAGVFTMLDPREHGARRKLLSRSFSRSY 1133 LVENWEPAVREKALLVVTKIKVDAMQSTADVYNWWMLLASDVSAHLAFGESFRMLETGH (0) 1307 ANQFIRVLKKLTMGAGIMVEMPFLRLLRFVPINAVQEMFNANEFILTGAGRAVEMARSRT 1542 GESNIFAKVIEDCEKEGEGHIDDMDVRIEAMNIIIAGTDTTGVTLTYLTWAVLQQPQ 1713 LQSALEVETAGLRENFTESDLINLPLLNATIEETLRLYGAAPSSLPRVVPQGGTRFSGHY 1893 IPQGVTVDTQAYTFHRDPRIWSDPLTFDPQRWISSQEGYTAEALSPDAKTAFHPFGVGAR 2073 SCIGIHLARMELRYAVAFFFRECRGIRLAPSTTLESMEFENFFLIAPKAHRCDITFQQ* CYP62F1 Mycosphaerella fijiensis JGI gene model e_gw1.7.207.1 revised at N-term 40% to CYP62B2 Nectria haematococca 41% to CYP62B1 Fusarium graminearum See fungal pages for sequence 63A Subfamily CYP63A1 Phanerochaete chrysosporium (white rot fungus) GenEMBL AF005475 (208 amino acids) Kullman,S.W. and Matsumura,F. Identification of a novel cytochrome P-450 gene from the white rot fungus Phanerochaete chrysosporium Appl. Environ. Microbiol. 63, 2741-2746 (1997) CYP63A1 Phanerochaete chrysosporium (white rot fungus) No accession number Jagjit Yadav Submitted to nomenclature committee Dec. 15, 2000 Clone name Pc-1 Aside from intron editing, this sequence and AF005475 have only 10 aa differences clustered in one region and are probably the same gene JGI gene model ug.20.36.1 MGLTQAQRLVLGQLARLVAPALAVCVLLAAARRTQLVRAPVWADALIALIAIPLFHVGRA HWRYARLARKAARLGAALPPRWEGKLPGSVDVLQLVDEAYRRGFLSDYFYEKFGELGHTY NFYVLWDMDYCTEDAAVIKAVLATDFNNWVKGERFDSYMHSVLGTGVFNADGELWKFHRS MTRPFFARERITDFETFNRHAEEAILKMKERLREGFAVDFADLISRFTLDAATEFLFGAC VHSLAGALPYPHGAPAHLHTTRARIPADDFAAAFRAAQDAVSHRARLVWLWPWFELARSR TDTPMRTVDRYLTPIIERALAMSRAAKQAPQGEKEEVADGETLLDHLARYTTDPTILHDE ILNIMIAGRDTTGGTLTFVIYFLTQHPDVLQRLRQEILDVVGPSNLPTYDDIKQMKYLRA VLNETQRLYPPVPWNMRYAVEDSIVPNSEPEGKPWFIPAGASVSYSVHCMHRRKDYWGPD AEEFDPDRFLDERLHKYLTPNPFIFLPFNAGPRICLGQQFAYNEMSFFLVKLLQTFEDIS FERDAFEPNALPPAEWAKFPGRKGKEKFWPRAHLTLYSEGGMWVKMREAQAMGQVA CYP63A2 Phanerochaete chrysosporium (white rot fungus) No accession number Jagjit Yadav Submitted to nomenclature committee Dec. 15, 2000 Clone name Pc-2 JGI gene model ug.20.35.1 85% to 63A3, 59% identical to 63A1 MLVSVDALALRTLVYELTYLLYPAVPTAAALILLQRFGNVWLPTWTIVLLSLCNVPVAHR ILVWLKDGRAARKAASMGAILPPRLKGRWPGSIDLLRQLTQTFETGFLSEMLWGYMHVLG QTFEVYILWDSNYVTSDANVIKTILATDFDNFVKGEKLDVCVRPVLGTGVFNSDGEMWKF HRSMTRPFFTRERISHFDLFDRHADATMAKMKARLAEGFAVDFQDLISRFTLDSATEFLF GQCVHSLASVLPYPHDAPAHLQTTGASRTEDFARAFAEAQDAVSFRLRMGWLWPWFELFG SRTKAPMAVVDAFLDPILRDAVARADKIKRENGGRVPEVKGEIEEDETLLDHLVNVVQTK ILHDEVLNIMIAGRDTTGGTLTSAVYFLSQYPEVLRRLREEILEKVGPTRRPTYDDIREM KYLRAFINETLRLYPAVPWNVRYPVKDTTIPGPHPDKPYFIPANTPVSYSVHCMHRRTDY WGPDAEAFDPDRFLDARVQRYLTPNPFIFLPFNAGPRICLGQQFAYNEMSFFVIRLLQHF DEVQLCEDALAPDCRVPDAWRGAPGRKGVERFWPKAHLTLYAKGGLWVKMREASTSEAVV CYP63A3 Phanerochaete chrysosporium (white rot fungus) JGI geme model ug.20.34.1 MPSSIDFPDRLVLRVIAYELVFLFYPAVPAAAGLVLLRRLTDIWLPTWAIVLLSVCSLPVVHGLSIWR 26138 26137 NHWRAARKAARMGAVLPPRLKGRWPGSIDLLMRLTDAFETGFMS (1) 26006 25949 DLLWEYMHTIGQTFEVYVLWDSNYVTSDANVVK (0) 25851 25697 AILATDFTSFVK (1) 25662 25698 GKKFDVCMRSVLGTGVFNSD (1) 25639 GDMWK (2) FHRTMTRPFFTRERISHFDLFDRHA (1) 25433 25379 DDAMAKMKARFAEGYAVDFQ DLISRFTLDSATEFLFGQCVHSLASVLPYPHNAPAHLQ 25206 25205 TTSASAAEDFARAFAEAQTVLNFRIRMGWLWPWFELFGSRTKAPMAVVDAFLDPILKAAV 25026 25025 ERADQIKHENGGKVPEAKEEIDEDETLLDHLVKYTN (1) 24918 24855 DPKILHDEVLNIMIAGRDT (0) 24805 24748 TAGTLTSAVYFLSQYPEVLRRLREEILEKVGPTRRPTYDDIREMKYLRAFIN (1) ETLRLYPAV (2) 24515 PWNVR (2) 24380 YPVKDTTIPGPEPDKPYFIPANTP (2) 24318 24258 VSYSVHCMHRRTDYWGPD AEAFDPDRFLDARVQRYLTPNPFIFLPFNAGPRICLGQ 24091 24090 QFAYNEMSFFVIRLLQHFDEVQLCEDALAPDCRVPDAWRGAPGRKGVERFWAKAHLTLYAK (0) 23908 GGLWVKMREAPTSEAV* 23800 CYP63A4 Phanerochaete chrysosporium (white rot fungus) JGI geme model pc.151.16.1 PC-4 MALPPGLQYLLPQLPLLLAPPAAVLLAAHAARAFAGTAAPAWALALACVLSWPVALTALV QLRAHRVAREAAARGARLPPAVEARYPGGVDLMRRNNSEVEEHIPGYRLSEFGRQYGWTY NFRMLFQDRVRGRPRPRPPGRILATDFTSYEKGAVFSAQMKSLLGTGVFNADGDLWKFHR AMTRPFFSRDRISHFDVFDRHAEDALKLAKARLSEGVPIDWQDLVSRFTLDSATEFLFGQ DVRSLSAPLPHPPTAPQAQHDTHDAEHPANRFAHAFLQAQLASARRSRYTAAWPLWEFWE NKVEKHTRVMDEFIQPLLRDALARKAKGADAQAEEAVADGETLLEHLVKLTDDPQIIHDE TLNILLAGRDTTAITLTMAGYMLAEHPDILQRLRKEILDTVGTRRPTYDDIRDMKYLRAF INEVLRMYPPVPFNVRFSTAPTVWPSPEGDFYVPAGTRCMYSVFVMHRRKDLWGPDADKF DPDRFLDERLGKYLTPNPFIFLPFNAGPRICLGQQFAYNETSFMLIRLLQRVSKIELHPE VSPQSVAPPGWAASSISDGKDKVVFKSHLTMYVQGGLWVTMQFENPEEH CYP63B1 Phanerochaete chrysosporium (white rot fungus) JGI geme models genscan.57.18.1 and genewise.57.16.1 PC-7 MPHPFSRYRLRVFGDFVRIVLAPSFVFWSAVQILKLRLGLLSPAAWLTFLFAASYARVQY RGFLQRQEARRRGGVLPPEVVGRWPGNIDILIKLGKASLTAYPGSFYLDLFEEYQSTTLN LKLLWSDLVRCLSFCRLSAVLKTLSQIITMDEEHIKHILTTGFNHFWRGRRQKERMYAPS GASRRHDTDSQGDVSQEWKKHRALARPFFARDRISDFDLFEKYAGATLGILGGLAGRGAA VDVQDLYARFTLDAAAEFLFGERLDTLHGALPVAGQAKLGSKGAATDDAFGAFVRAFEAS QDIITTRQVRGYFWPVRELFQDKVAPHAAVIGAFLEPIVQRTLDRKAKMRAAGVSPTTEH DTFLDYLADHTEDPKVIRDQLLNILMAGRDTTACLLTYVTYVMAMYPDIMQKMRQEVLHV CGHDAPNFEKLKALRYVHAVLNETLRVFPPVPMNVREVRARGVVLPHADPTYAAAPAPLY VPGGTVVMYLPVLTQRNTALWGDDADVFDPDRWLDARLRRFTENPMMYTPFSGGPRICIG QNYARNEATYLLVRLLQQFDAVALAPEAQPAGSLPPPEWRHARGRAAEERIWPAYAITLY VKVRLSLQWLYC CYP63C1 Phanerochaete chrysosporium (white rot fungus) JGI geme model pc.101.32.1 PC-5 MELHPRQYRLRFLLDVLRAIVWPQLVFNAALYLAGFHPGAFLRVVASVLAVPLLGTVRTA ISQRRNKIQAGAALGAKEVPCVRGKWPGNLDIVLGFVRSLKEAYLMQFLDDLFREYDCKT LNMRLLWEDQIWTIDEAHVRYMLAGPGFEWFHKGYYWQERMESFLGNGIFNRWAQRAIAR PWFVKDRISDLNIFDRHTTTTLALISEFVDRREAFDAQDLFARFTLDSASEFLFGRCLDT LHGTLPVAGRAKMGPKGTAIEDAFGSFARAFEDVQVQIARRTRIGKPWPLFELFTDKTAP SVAVIHDWLRPIVHEALAKKSAASAEKESGEDSTFLSHLANSTDDPQDIAYSVLNMLLAG RDTTASVLSFVVYFLALHPHVTEKLRAEILQAYGPDGRPSVEDMKDLKYVRAVLNETMRL FPPVPMNLRLSDAHPRIFPASGSAPKYYVAPRTVILYSIFLVQRRTDLWGADALEFRPER WLEPATARLLADHPFAFTPFHAGPRLCLGQNFAYNEMTFFIVRLLQRVSGFELAPDAQPE GSLPPARWKYGEGRQAVEKIWPASSVTTFIKVSLASMPCCGERWLKRRRQGGLWVRAVPA CYP63C2 Phanerochaete chrysosporium (white rot fungus) JGI geme model pc.101.28.1 PC-6, partial only 387 amino acids, needs more work QRAIARPWFAKDRISDLNIFDRHTSTTLALIADFADRREAFDAQDLFARFTLDSASEFLF GKCAETLHGTLPVAGRAKLGPKGSSVEDEFGSFAWAFEELFHDKTAKHRKVIQDWLQPIV REALHSKAAAARGEDTGEGTFLSHLTKTTDDPQDIAYSILNMLLAGRDTTAAALSFTVYL LALHPEVVEKLRAEVVQAYGSDGRPSVEDMKSLKYLRAVLNETMRLFPPVPLNIRTSDDT PRVFPASAGAPKYYVPPRTPVVYSSVIIQRRKDLWGADALDFRPERWLEPETARRLAENP FMFMPFHAGPRLCLGQNFAYNEMSFFVVRLLQRVAALELAPDAQPEGSLPPARWKNGEGR QAVEKIWPGSSVTTYIKVSSTRSRPCG 64A Subfamily CYP64A1 Aspergillus flavus GenEMBL U81806(2760bp) Prieto,R. and Woloshuk, C.P. ord1, an oxidoreductase gene involved in the conversion of O-methylsterigmatocystin to aflatoxin in Aspergillus flavus unpublished CYP64A1 Aspergillus parasiticus GenEMBL AF017151 Yu,J., Chang,P.K., Ehrlich,K.C., Cary,J.W., Montalbano,B., Dyer,J.M., Bhatnagar,D. and Cleveland,T.E. Characterization of the critical amino acids of an Aspergillus parasiticus cytochrome P-450 monooxygenase encoded by ordA that is involved in the biosynthesis of aflatoxins B1, G1, B2, and G2 Appl. Environ. Microbiol. 64 (12), 4834-4841 (1998) 97% identical to Aspergillus flavus CYP64A1 (0rtholog) CYP64A1 Aspergillus parasiticus aflatoxin pathway gene cluster GenEMBL AY371490 join(57543..57738,57815..57944,58000..58191,58256..58407, 58457..58701,58759..59030,59089..59488) gene="aflQ" function="conversion of O-methylsterigmatocystin to aflatoxin B1 and aflatoxin G1 and dihydro-O-methylsterigmatocystin to aflatoxin B2 and aflatoxin G2" CYP64A1 Aspergillus oryzae GenEMBL BAE71330.1, BAE59527.1 99% to CYP64A1 Aspergillus flavus MIYSIIICAGALLGLWILEKLLAPKDTRPPLPPGPWRKPIIGNLTDFPPKGTPEWLFWAKHQERYGPMSS LEVMGQTIIMINDAQLGIEIMHKKSALSQMIPDAPFAHMAGWGMSLATERNRQAWKTIRANMKQEIGTRR AISTFHSKMEIGIRRFLLRTLDNPDDLRFHIRKEANAFMMDVAYGYTIAPHGKDELYDLTQQSVRQFSHI FSPGEWSVNFFPILRYVPSWFPGASFQIKAAEYKWTIERMTMVPYLWIKDQVARGCSRPSILLRLLQKGH YESGSHQEQVLVWTNAEFVMGGSDTTVSAVSSLFVAMALYPEVQRKAREELDRVVGPTTLATFEHRSQLP FIDALVKEVFRWHPASPLGAPHITQEDQIWDGYLLPKGALLLPNIWTFTHDPSVYHDPMVFKPERFLEGK DSPPETDPMKFVFGFGRRICPGRFVTDEKLFLIACHAVSCFFISPKDPGAPEPDWLPGVISQPGTFDLNV VPRSPAHEELIRSIETDHPWKNADATDISRFMARNQMI CYP64A1 Aspergillus nomius AY510454 join(63157..63352,63429..63558,63614..63805,63870..64021, 64071..64315,64373..64644,64705..65104) gene OrdA of aflatoxin biosynthesis 93% to CYP64A1 Aspergillus flavus MIIGLIICAGALLGLWLLEKLLAPKDTHPPLPPGPWRKPIIGNL TDFPPKGTPEWLFWAKHQGRYGPISSLEVMGQTIIMINDAQLGIEIMHKKSALSQMIP DAPFAHMAGWGMSLATERNKQAWKTIRANMKQEIGTRRAISTFHLKMEIGIRRFLLRT LDSPDDLRFNIRKEANAFMMDVAYGYTIAPHGKDELYDLTQRSVRQFSHIFSPGEWSV NFFPILRYVPSWFPGASFQVKAAEYKRTIEKMTMVPYLWIKDQVARDCCRPSVLLRLL QKGHYESGSQEEQVLVWTNAEFVMGGSDTTVSAVSSFFVAMALYPEVQRKAREELDRV VGPTTLATFEHRSQLPFIDALVKEAFRWHPASPLGAPHITQEDQIWDGYLLPKGALLL PNIWAFTHDPSVYQDPMVFKPERFLEGNGSPPETDPMKFVFGFGRRICPGRFLTDEKL FLLACHTLSCFLVSPKDPAAPEPDWLPGVISQPSPFGLNVVPRSPAHEALIRSIETDH PWKNADATDISRFMARNEMI* 65A Subfamily CYP65A1 Fusarium sporotrichioides GenEMBL AF011355 Alexander, N., Hohn, T.M., and McCormick, S.P. (1998) The TRI11 gene of Fusarium sporotrichioides encodes a cytochrome P450 monooxygenase required for C-15 hydroxylation in trichothecene biosynthesis. Appl.Environ.Microbiol. 64(1), In Press The TRI11 gene catalyzes the hydroxylation of isotrichodermin to 15- decalonectrin submitted to nomenclature committee CYP65A2 Fusarium graminearum FG03540.1 AACM01000159 FGcontig1.159_scaffold2 See fungal pages for sequence CYP65B1 Neurospora crassa AABX01000364.1 cont3.489 Neurospora crassa sequence contig 1.742 (supercontig 102) 42% to CYP65A1 MNLSHLLPLGIPLPSPTSLIGIFLIG (0) TSILVSSILLSAIYNLTLHPLARHPGPFFHRASILPYLYRQITGTLPCSILDFHARYGPVVRISPNALSFADPQAWKDIY 26703 26702 GHRPHGEEEFAKLNLFYRIKGSPPSLLSETKEAHGTLRKLMAHGFSDRSMRAQEGIIGGYVK (1) EEQDERVPLDMVSWYNWTTFDIIGDLAFGEPFGCLEKAEYDPWVDAVGKSVRFGCVMFA LRLLGLEDWVCPLVRKLSGNARRFHRKRTMDKLQRRVKLTKERPDFLEGLLQKREEW (0) GIDMDALAANASLLIVAGSETTATLLAGATYMLLRDPEAMKKLTEEVRSTFKSEEEITL 25711 25710 SSVGNLEYMLACLNEAMRLYPPVPIGMPRVVPKGGAKVAGTFVPEG (0) TVVAVWHWATSHNEQHFVEPFEFHPERWMQDPRFANDRLD 25372 25371 AVQPFSVGPRNCIGR (2) SLAIAEMRLILTKVVYNFDMKLANPEKDWLDQHIYTLWDKHPLPVYLTPIERNGV* CYP65B1 Neurospora discreta JGI gene model scaffold_9-snap.228.1 85% to CYP65B1 N. crassa gene model does not appear to be assembled correctly. One low complexity sequence removed. mRNA may be needed to decide this assembly. See fungal pages for seq CYP65B2 Magnaporthe grisea MG06544.4 55% to 65B1 new N-term AACU01000188 cont2.1218 See fungal pages for seq CYP65C1 Neurospora crassa AABX01000034.1 cont3.25 Neurospora crassa sequence contig 1.96 (supercontig 12) 45% to 65B1 MPVIDKLAGVTGLSGLHLLLGAVGVT (0) ILLYIIGSAIYNVFFHPLASYPGPLLQRASPIPFSIRHALGVQPFHTQALHDKYG PVVRIAPNHLSFTDVRAWRDIFGHIAGSRHGQEELAKFKS FVRPIDAQPIHIISAGREKHGQLRRALANSFSDASMRGQGPLIGKYIDLLLHKLH EQGQEGKVPLNATSWYNSVTFDVAGDLIFGMSFGALDQNGNHSWLQYILGSLSSLAPMAA LSYAGLHWLVQLLWRAKGAEVFRKSMESVDLMLKERLKMPVDRNDLFEGLVQRQEKL (0) GLSFEELASNAWLLVL (2) AGADTTATTLSGTTYLLTQHPEVLKRVTQEVRSSFKSVEEIDISSVNRLTYMLAVLNEA 64919 64918 MRLYPPVTSLTARIVPRGGAHVLGEYLPEG (0) 64756 TIVDIQQWSMNHSKENWVDPWAFNPARFLDDKETAREKGNVLDALQTFSVGPRNCIGR (2) 64580 NLAYAEMRWILARILFDFDLRGAPTNQNWIERQKAYGLWDRIPLDVYFEPARERK* CYP65C1 Neurospora discreta JGI gene model estExt_Genewise1.C_41538, gene model incorrect. Added I-helix 87% to CYP65C1 N. crassa see fungal pages for seq CYP65D1 Magnaporthe grisea MG03832.4 47% to 65B1 partial seq. length 53916 runs off end, changed C-term AACU01001030.1 cont2.747 N-term on AACU01001031.1 cont2.748 seems to be missing some N-terminal seq. see fungal pages for seq CYP65E1 Magnaporthe grisea MG03830.4 37% to 60A3 AACU01001030 cont2.747 see fungal pages for seq CYP65F1 Magnaporthe grisea MG08498.4 42% to 65A1 AACU01001266 cont2.1600 see fungal pages for seq CYP65G1 Magnaporthe grisea MG00023.4 39% to CYP65A1 41% to 65B1 AACU01001352 cont2.7 see fungal pages for seq CYP65H1 Magnaporthe grisea MG10954.4 39% to 65B1 40% to 65A1 41% to MG05390.4 AACU01001824.1 cont2.2124 see fungal pages for seq CYP65J1 Magnaporthe grisea MG10070.4 39% to 65B1 , 40% to 60A2 AACU01001328 cont2.1928 see fungal pages for seq CYP65J2 Alternaria solani Alt3 BAD83682 69% to 65J1 Mg PKS for alternapyrone is next to three P450s called alt1, 2, 3 Chem Biol. 2005 Dec;12(12):1301-9. 1 MALSSALDSL WHQLDQLLSL INRNIITGII VLPVLYVLLK VIYNLYLSPL AGYPGPKLWA 61 VSRLPWNRAN MKGRISWKIR ELHDKYGPVV RIAPDELSYT TSGAWKKIYG QRNPEFVKAL 121 DGRGIAPASI GGQRSLMTEH QDKHLRLRRA IDPAFSQRAL REQESYFQDH SDNLVQKLKQ 181 RCKDGPLDMT TWYNLVAFDI VSDLAFGEPS GCVNNPDQPW IQAILARAKA IVWFQLAVQY 241 GFMGLLNWMT PKYVTESRKK HIAMTEAKLK ARVEAKNPGK DFMSYILEND EKLNHLELVM 301 LSSNFIVAGS GTSAGGMSGL TYLLLRNPDK LEKLKQEIRG LFKSRADMTL QAVTSCKYLR 361 ACLNEGMRLY PPTPGSLPRF VPGKGEMIEG KWVPGGYAVG VNQLAAGHSE RNFKKAREFH 421 PERWLDEPDS EFKDDDRSAV QPFSYGQRGC IGRSMAYAEM SLTMAKLVWY FDWELDEPDN 481 DWWNQQGTYL VWEKLPLQVK LTPVSDVVE CYP65K1 Magnaporthe grisea MG09945.4 poor match 41% to MG02982.4 AACU01001747.1 cont2.1897 see fungal pages for seq CYP65L1 Magnaporthe grisea MG02982.4 42% to CYP65B1 38% to 65A1 AACU01001589 cont2.600 see fungal pages for seq CYP65M1 Magnaporthe grisea MG00651.4 37% to 65B1 AACU01001445 cont2.120 see fungal pages for seq CYP65N1X Magnaporthe grisea Name changed to 563B1 see fungal pages for seq CYP65P1 Magnaporthe grisea MG07983.4 48% to 65B1 partial RUNS OFF END AACU01000985 cont2.1482 C-TERM HALF see fungal pages for seq CYP65Q1 Fusarium graminearum FG07765.1 AACM01000318 FGcontig1.318_scaffold4 see fungal pages for seq CYP65R1 Fusarium graminearum FG03191.1 AACM01000148 FGcontig1.148_scaffold2 see fungal pages for seq CYP65R2 Fusarium oxysporum 82% to CYP65R1 frameshift at TLQ/KMH see fungal pages for seq CYP65R2 Fusarium verticillioides 95% to CYP65R2 Fusarium oxysporum = ortholog see fungal pages for seq CYP65S1 Fusarium graminearum= Gibberella zeae GenEMBL AACM01000323 complement(join(168239..168404,168457..168623, 168682..168761,168816..169001,169050..169836, 169886..169960)) locus_tag="FG08023.1" sequence revised added back GMSRPEIIENSSLLIV to gene model MAIISTDLASGTSLGGGLVLLAAAVVLFVTGRAIYYVFFHPLAK IPGPKLYAATQLPYLYHLTRGQWVYRLKELHEQYGPVVRYTYNDVSFITADAYKTIYG HKTGGAREFGKDFRLYRQGRPAPSLIVTKHEDHKRQRKLLSHAFSHKALSEQNDLLMH YVNLFIEKLTQKAREGESVDMVAWYNFATFDLIGHLALGQPFGCLESGKYHPWVTKIF SSIKGISFSQAAIRLGLHNWISLITPAHVKKATKEHFEFTEHVAGARLDVKDNNSKDFMSYILRYNDER GMSRPEIIENSSLLIV AGSETTATLLSGVTYQLLTNPEAYKKLVTEIRLAFATEEDITAAAVD KQTYLLAVLSEGFRMYPPVPGALGRTIPKGGDFVEGHWFPENVGVSVPQLPAYHSARN FKDPEKFVPERWMDDPRYSNDCRAVVQPFSMGPRDCIGKNLAYLEMRLLLTRLLWKFD AELMPESREWKEQKIFTLWEKGSLHVKLTEVVREKN CYP65S2 Nectria haematococca (Fusarium solani group) JGI gene model e_gw1.7.949.1 short on N-term Necha1/scaffold_7:62397-63948 54% to CYP65S1 Fusarium graminearum see fungal pages for seq CYP65S3 Nectria haematococca (Fusarium solani group) JGI gene model fgenesh1_pg.scaffold_4000060 Necha1/scaffold_4:143263-145075 56% to CYP65S1 Fusarium graminearum 60% to 65S2 see fungal pages for seq CYP65S4 Fusarium oxysporum 58% to CYP65S1 Fusarium graminearum see fungal pages for seq CYP65S4 Fusarium verticillioides 94% to CYP65S4 Fusarium oxysporum see fungal pages for seq CYP65T1 Aspergillus nidulans AN7522.1 38% to 65A1 53 clan see fungal pages for seq CYP65T2 Neosartorya fischeri NRRL 181 AAKE02000011.1 DS027697.1 complement(627606..628748) 57% to 65T1 MMFVILAAASLLYISYILLLAIKTIYFHPLSHIPGPKWRIGIPLLHHLS 628603 AIRGRLDLDISSWHEKYGDVVRSGPDEVTFITAQAWKDIYGHGHRQLPKVQISAINGKDI 628424 628423 FSANDVDHARFRKALSHAFSAKGLQAQERIVTQYIDKLIQRLKWFAESGTAADMGKW 628253 YNLATFDLIGDLAFGEPFGGLDSA EYHHWVATMFGFVKSIPFLRALHRYPVVFKVILAFHPGSLMEMRSKQVEHAKATVQKRL SNSAAGRGDFMDSMLRHQGDKGGLSLDELEENANVLILAGSETTAALLSG VTYWLLRTPDALDKVMREVRAAFALERDITFNQVTAKLPYMLACLNEAFRLYPPVPGGLQ RWTEVPTWISGYRVPAN (0) TKVSVHQLSAYSSLKNFHQADRFIPERWLPEAIEDPASPFFSDNRAVFQPFSIGPRNCLG RNLAYTEMRVILARVLWTFDLTLCEESRDWKDQKIFVIWERGPLMCKLAMRDDRGDCQGGR* CYP65T2P Aspergillus fumigatus Af293 XP_748490.1 also AAHF01000010.1 58% to 65T1 Nterm aa 52-168 plus C-term part, 84% to 65T2 name revised from 65T3P SLRGRLDLDLRSWHEKYDGVVRPGPDEVTFITAQAWKDIYGHGHLQLPKVQISTINGK NIFATNDVDHARFRKALSHAFSAKGLQAQECLVTRYIDKRIERLKGFTESGTAADMGKW 425909 *NLATFDLIGDVAFGEPFGGLD AYSSPRNFHQADRSIPERWLPQVIENPASPFFLDNRAVFQPFSIRPRNCLGRNLAYTEMR MILARGLWTFXX LCEESRDWTDR CYP65T4 Aspergillus oryzae GenEMBL BAE59716.1 68% to 65T1, 45% TO CYP65AF1 MPFNILLWPVIAFFLYSLLTAIWYIYFHPLRRIPGPKSWIIFPIMRHASAIRGRFDIDMRQHHAVYGPVV RFGRDEVSFITAEAWKDIYGHGHQQLPKVLSSASNMLDIISSNDTDHSRYRKALSHAFSAKGLQAQEPLL NSYVDKLIERLKGIAESNLPADMVKWYNLTTFDIIGDLAFGEPFGGLDNSEYHHWVSTIFESIKAIPFLK LKDAYPLAFKAILGLIPKGIMEARKRQLEHSRITVQKRLQTSSSYNRGDFMDSMLRNRGEKDSLNDSELE ANSNILIIAGSETTATLLSGATYWILRNPEALAKLTDEVRSVMKSEPEITAQKASAELPYMLACFDEAFR LYPPVPTGLQRRTLVPTRISGCDIPAGTKVSVHQSAAYWSSTNFHAPDRFIPERWLPEAKSDPSSPFYSD NRGVVQPFSTGPRNCIGRNLAYAEMRVILARVLWNFDLELCEESTQWSDQKAYTLWEKPPLMCRLKLRES FRG CYP65T4 Aspergillus flavus 99% to CYP65T4 Aspergillus oryzae see fungal pages for seq CYP65T5 Aspergillus terreus NIH2624 NW_001471196 68% to 65T4, 62% to CYP65T1 MALQEFLPVVPAISVLYIFLLAIWNLYFH PLRHIPGPRSWIAFPIMRHISASRGRLDSDMRRFHAQYGGAVRLAPAEVSFITPDAWKTI YGHGHTQLPKVQTSESKGLDIISSEGPNHTRHRKALAHAFSARGLQAQEPLVRGYVDKLI ERLKEFAESQLQVDMVKWYNLTTFDLIGDLAFGESFGGLDNSRVHSWVSTIFRSVKVLPF VRITDTYPILIPLLMALLPKSLRTARRDQTNYSKETVHKRLANTAAHGRGDFMDSMLRHR GEKDGLSDRELEENASILIIAGSETTATLLSGVTYWLLRSPEALAKVTDEVRSTFQTEGE ITLQDVGARLPYMLACLDEAFRMYPPVPCALERRVLTPIVIAGYNIPPG (0) TVVSVHQSAAYCSPANFHRPQDYIPERWLPDAKSNPSSPYFSDQRDVLQPFSVGPRNCIG KNLAYAEMRLILARVLWNFDLELCEESLHWKDQKSYLLWDKPPLMCKLKQRI* CYP65T6 Aspergillus niger JGI gene model e_gw1.4.330.1|Aspni1 65% to CYP65T4 see fungal pages for seq CYP65T7 Aspergillus niger JGI gene model fgenesh1_pg.C_scaffold_3000290|Aspni1 68% to CYP65T4 see fungal pages for seq CYP65T8 Aspergillus clavatus 69% to CYP65T2 Neosartorya fischeri see fungal pages for seq CYP65 fragment Aspergillus ochraceus DQ054597 C-term piece after heme signature Involved in ochratoxin synthesis See DQ054596 for a second P450 in this pathway This fragment is 60% identical to CYP65T5 C-term FLRAQGTASERNLAYAEMRLIAVKLLWNFDLAFEEECEGWDNQK SYNIWEKDPLKVKLTP CYP65U1 Aspergillus nidulans AN6466.2 37% to 65B1 53 clan revised 7/18/07 MGVYLFSSSLWATLLLAILIYRLVRITYNLYFHPLSGIPGPSAWAASRLP FVYALIRGTIIQDFQKLHEQYGPIIRIAPNEVTFAHPEAYTDIFQPRPEK SQGQFLKDPLWWARQPGHPDSLLSVINPERHAQMRRVLSPGFTARALRKQ EPFVQKYVNLLVAQLQDLVSKSPSAHINMTPWFNYTTFDIFGDLGFGESF DCLQHSRYHPWIALLFDSVKAAGFVIATRYYPLIEAVLLKCVPPSMRKIQ RDHYQQIVDKVQRRLGWELQRPDFMSYVIDERGGLRLDAGELYATFMILT TAGSETTATALTGTLNYLVNHSPESLQRLENEIRGAFSGLDAITLDAVRN LPFLNAVIQEGLRLCPPIPWVLPRLVPQGGSMICGTWLPGGTPVSIQAYT LNRDPTLFNRATSFLPERWLASSTSDSSSCFFNDQRQVVQPFSTGPRACL GQHLAWAEMRLILAKLVWVFDFGAIDGQGVNWEDLRTYLLVERKPIN VRISSRTV* CYP65U2 Aspergillus niger JGI gene model e_gw1.3.403.1|Aspni1 61% to CYP65U1 see fungal pages for seq CYP65V1 Nectria haematococca (Fusarium solani group) JGI gene model gw1.24.106.1 model short on N-term Necha1/scaffold_24:57028-58643 49% to CYP65B1 N. crassa see fungal pages for seq CYP65V2 Aspergillus oryzae GenEMBL BAE61720.1, AP007163.1 53% to 65V1, 43% to 65B1 revised 3/20/2009 with a frameshift = & MMLNSINIPQNAISWAVACAAA (0) 198235 ATFCLASSVVYNIFFHPLARYPGPLSHAASRVPYFIRVFKGTLPFD MLDFHTRYGDVVRIAPDELAFSHPDAWKDIMGHSKGGLYLEKASWFYRP LEYDPPHIFNEGRQAHSLLRRQLAHSFSDKGMRDQEPMIRGYVDLLLQRLREMGGREDTVDISAWYNYTT FDIIGDLSFGESFGCLKGSNYDEWIESIFRAFPFITALQALSFTPLVKEVLLALVPPSMRDRRERLRDFT RQKMLRRMAIAEERPDLIEGLLKKKEEL (0) GLTIDKLTA & AETLVVAGSETTATLLSGVTYLLLKNPEAYQ 199191 RLREEVRSTFSSEDDINLSSAAKLPYMLACVEEALRMYPPGA IGPPRVTPKGGAEILGARIPGRTYVSIHHWALYRREEYFTDPHTFHPERFLGDPRFSNDRHNVLQPFHIG PRSCLGRNLAYSEMRLILALLFFNFDMELSDDSQDWIQQKNVIMWQRGPLKVHLTYIHRNSA CYP65V2 Aspergillus flavus 99% to CYP65V2 Aspergillus oryzae see fungal pages for seq CYP65V3P Aspergillus oryzae GenEMBL BAE55600.1, AP007151.1 Gene model has wrong C-term, revised about 60% to CYP65V2, 48% to 65B2, 54% to 65V1 one stop codon and one frameshift in last exon MSTAALVNAKSVLIATISA (0) PIIYMLGMAIYNVYFHPLNRFPGPVSHAISRIPYFYRAVRGTLPFD MLKLHERYGDIVRIAPDELAFSHPDAWKDIFGHKNGEPEMAKAAWFYRPLDDPLHIVNEDTDEHRRLRRQMAHGF SEKSMRAQEPIIRKYVDLSLEKLHQSCKNGSLVILDWYNFTTFDIIGDLAFGEPFGCLEGCNYDQWIKGI FKGAYLGSFMQALSFVPRLKSTLLLLVPKSMQEAHQGHKALTKAKMLRRAAITEKRPDLIDGLLRNKDEL 1455183 KLGLDKLIANAEILIIGGSETTASLLSGVTYLLLQNPNAYETLKNEVRSKFNHQEEIN 1455356 LISVNKLSYILA CLDEALRMYPPIANGLPRVCPKEGSWVLGEYIPGKTVLSIHQWALYRREKHFKDPNTYHPERFLSSPEFL DDRRDAFQPFHTGPRNCLGRN (2) LAYS*MRLILALVIFNFDMRLPQRAKDWIRQRNYLMWDKGPLKVYLRPRTSI*1455965 CYP65V3P Aspergillus flavus 98% to CYP65V3P see fungal pages for seq CYP65V4 Aspergillus niger JGI gene model e_gw1.11.216.1|Aspni1 70% to CYP65V3P see fungal pages for seq CYP65V5 Histoplasma capsulatum ABBT01000064.1 Ajellomyces capsulatus G217B 55% to CYP65V4 see fungal pages for seq CYP65V6 Aspergillus terreus 57% to CYP65V5 see fungal pages for seq CYP65W1 Nectria haematococca (Fusarium solani group) JGI gene model e_gw1.5.258.1 Necha1/scaffold_5:405215-406714 49% to CYP65B1 N. crassa see fungal pages for seq CYP65W2 Aspergillus niger fgenesh1_pg.C_scaffold_14000165|Aspni1 53% to CYP65W1 see fungal pages for seq CYP65X1 Aspergillus oryzae RIB40 genomic DNA, SC023 GenEMBL AP007157.1c, BAE58707.1 join(331674..331760,331832..332607,332687..333038, 333095..333264,333318..333474) third P450 gene in a set of 8 on this accession 42% to CYP65S2, 40% to CYP65S3, 42% to 65A1 MALAAEYFSHALSQPYGSSFWVAVAASLVFICYITCDTVYNLCL SPLSSFPGPRLWAVSNIPRQLSILGGRSHLKMLALHHRYGPVVRVGPSELSFNSPQGF RDIYGFRRGQPQFQKDPKMYGSPLTGISNSIGGHVDSDTHSRHRRLLSHSFSERSLRE QEGIVVYYVDLFIQRLRERTSVNKIHRAEEDLKSWFNFTTFDIIGDLMFAETFDCLKD SQLHPWIALMFNNVKGIAFLGVLNEYSLFRKMQGALLPKALKQKMLENHKLCAQKADR RLQKGASRPDFVSMQTQSCKHMGPLLFFFFMAANTDSITIAGSETTASLLSGCLFYLC KHKYIMDQLNKEIRTTFSKDEEITSSKCFNLSYLNAVLKESLRLYPPVAASLLRLVPK GGCTIDGHFVPEDVTVSTHHYASYRDAANFTFPEQFIPERWLGTDNRFDSDRKDVVQP FSLGPRDCLGKNLAHMEMRLILSKLLFNFDIHLTPESENWGQQKMFIVWDKPALMVRL TDRFA CYP65X1 Aspergillus flavus 99% to CYP65X1 Aspergillus oryzae, 48% to CYP65AU1 see fungal pages for seq CYP65Y1 Aspergillus fumigatus Af293 GenEMBL XP_746900.1 49% to CYP65V1, 88% to CYP65Y1 Neosartorya fischeri revised 3/12/2009 added N-term, one stop codon MGVGSDKG*VVCYVITQAVYNVFLHPLSRFPGPLLHRASRLAYVYRLIGGTLTLDALE MHQRYGDIVRVAPDELFFAHPDAWSDIHKKTGREMDKAPWFYRPLEQSPRCIVDESHEQHSRLRRQMAPG FSEKSMRDQEPIIRGYVDLLLQRLLEKSNDGQPVVISDWFNYTTFDIIGDLVFGEPFECLGSSKYDGWIK SIFESLRPLTIVQALSFFPWLKRIALDTVPKNLQGDIKQQQQQTEAKMRRRMAVTASRGDLMEGLLKKKD ELHLSTEELIGNAQVLILAGSETTASLLSGVIYLLLQNAVAYRQLVDEVRSTFNSEKDINFVSVSQLSYM LACLNEALRIYPPVANGLPRVVPKGGAQILGQYIPEQTYVAIPQWALYHREQYFAEPNDFHPERFLGDPK FANDRRDALQPFSVGPRNCLGRNLAYAEMRLILALVIFHFDMKIDPDCRDWIQQRNFTLWQKPPLKVYLK PVARKSEPNQV CYP65Y1 Neosartorya fischeri 88% to CYP65Y1 Aspergillus fumigatus = ortholog see fungal pages for seq CYP65Y2P fragment Histoplasma capsulatum G217B C-term pseudogene of CYP65Y sequence 67% to 65Y1 ABBT01000195.1 Ajellomyces capsulatus G217B see fungal pages for seq CYP65Z1 Aspergillus fumigatus Af293 GenEMBL XP_751828.1 41% to 65R1 Fg, 44% to 65S3 MAPWNHDIFDRLMLSAILSWSILLQSQIANYRRRTLTRRQGCVYLIGHSIYAIYFHPLSRYPGPKIAAIS PIALLVWEIRGKVHSKVKHLHDRYGDVVRIGPNALVYRAPRAWKEIYGYRTKKGQRTFQKDPSLYVPTPN GVHAIITAAESDHIRMRRLLAHAFSDRALREQESLLHFYADLLVQKLHENLNHSHSEVVDIARWYNFTTF DLIGDLAFGEPFQCLKDSKYHWWVSIMLDAVKLSVYLKVLWFFPILSPLTKLLVPRYLHQRREASFQLTV EKVSRRLHRQTNRPDFISYILRHKDDENRMSRQEIDANAATFVLAGSETTAALLSGCTFYLLKNPHIYRR LVTEIRSRFQHPSEIRLSSIATLSYLNAVLEESLRIYPPIPAMLPRLVPEEGAMINGEYVPAGTSVSMSL WSTFHSSGNFHNPDSFVPERWLASPEEEASRCFTMDKKEAFQPFSYGPRNCLGQHLANAEMRLILAKVFW HSDMELCPESENWADQESYNLWSRPPLMVMISRANGRNYEDNMV CYP65Z1 Neosartorya fischeri NRRL 181 XM_001267015 94% to CYP65Z1 name revised from CYP65Z3 see fungal pages for seq CYP65Z2 Neosartorya fischeri NRRL 181 XP_001260852.1 56% to CYP65Z1 Aspergillus fumigatus Af293 76% to CYP65Z5 Aspergillus clavatus MSPISIVLSILVAVLLGQAIYNIYLHPLRKFPGPKLAACSNICY IWWTISGEIHAKLKELHDQYGDVVRISPSTLVYRSASAWKDIYGHRKHGASSFIKDPE YYIPSPNGGHILTAGDADHARQRRLLSHAFSEKAMREQESLVQHYVDLLVEKLRAECA ASRSTVDMMKWYNYTTFDIIGDLAFGEPFDCLRDNRYHPWVKMVFQSVKMSVYLRLIH LYPLLESLVKRLLPKKAAKMRNEHWRMSKDKVSRRLDLQTSRPDFMSCILRHNDERGM TRQEIEANAGVLILAGSETTATLLSGCTFYLLKHPEKYNKLVQEIRGAFQKQDDITFL SVARLPYLHAVLEESLRLYPPVPGYLPRKVPMGGASIDGQYVPGGVSVSVVPFSASRA KSNFAEPESFIPERWLENRDRRFETDNREASQPFSFGPRNCLGKNLAYAEMRLIVAKL LWNFDMTLHEDYRDWDDQTSYIIWQKPSLMVELKAVR CYP65Z2P Aspergillus fumigatus 87% to CYP65Z2 Neosartorya fischeri supercontig 2 3641455-3643345 region see fungal pages for seq CYP65Z4 Aspergillus clavatus 81% to CYP65Z1 Neosartorya fischeri see fungal pages for seq CYP65Z5 Aspergillus clavatus 76% to CYP65Z2 Neosartorya fischeri see fungal pages for seq CYP65AA1 Aspergillus fumigatus Af293 GenEMBL XP_749877.1 43% to 65S2, 40% to 65Z1 MSFLMESSAVGQPSPLSFSNTLNALPGALVLYAALHSLYNIYLHSLSRFPGPKLWQTSYIFRHIASIRGT LDVSIKAFHTEYGPVVRYSPDELSFISAEAWRDIYGFRENALPKDPSFYGLIQLSRDKSPSIFTVDQDHH PRVRKALSYAFSEKALHDQEPFVKHYVDLLIQRLRGIADAEDNRVDLVKWYNFTTFDIIGDLAIGRSFDC LQDSAYHSWVDAFWKSIKMISPYARAMATYTDVPRLLRLFAPRALKEARLRRLQYVGVHMEERLARGILR DKPDFISYILRSKGTADELTDGEVEANVSFLLLAGTETTATALSGTTYYLLKNPEGLRKATAEVRSAYNS EDEITFATTAERLPYMQACLTEGLRIYPSGPIAAPRRTPRGTVTWIAGHPVPPVGGASNSPLNFHRQADF IPERWLSPSTMDPASPFRRDNRAASQPFSVGPRNCLGKAFALNEMQVILARMLWNFDMKLLPQSDGWERQ RIFTLWDKGPLMVELNEVRSSLHNQLFLQVCMVVCMVVHQSRSLQRAVHSSGPARNLTG CYP65AA1P Neosartorya fischeri NFIA_074000 C-term is missing 92% to CYP65AA1 Aspergillus fumigatus downstream gene is a polyketide synthase both genes are broken and fused at YNSE/PVEP PVEP is about aa 410 in the A. terreus PKS gene XP_001215858 The high conservation (92%) suggests an assembly error rather than a deletion creating a pseudogene. CYP65AA1 in A fumigatus is upstream of a short PKS gene on AAHF01000007 CYP65AB1 Aspergillus oryzae GenEMBL BAE56591.1 41% to 60A2, 39% to 60A5, 40% to 60B3 43% to 65S1, 40% to 65A2, 41% to 65B1, 44% to 65H1 47% to 65S3 Supercontig 5: 63753-65569 (+) strand Revised 3/18/2009 MLFSLGPLTIVYGLVIFVVAKTIYNLYLHPLRSYPGPLLARATRWYYSYYVKIGLLPQKTKELHDQYGPC VRIAPDELSYNTAEAWEDICGHRTGQRTESFEKDLTFFPPAPNGVDSI IVAKDDVHRRFRRLLSHPMSDKA LGSQQEIITGYVDQLIHELRQRSERSEVVDMVRWFNFTSFDILGDLAFGESFGCLGSGLMHPWIELIFTS IKSVMDMQIIRRIPGLFSLILTIAGLQQKQDLQEQFMFCQKKARERYTKETTRPDFMTYILRATEEKGMT PEEIEANAQILIMAGSETTASALSGTLFYLLKNSMAMQKLRQEIHATFQAEAEITMRSTQSMEYLHAVLQ EAMRVYPPVPCTFPRTTPPGGAMVCGRFVPGGYIVGVNQLAAMTSEKNFKDPLKFIPERWCGDERYQEDS RKAYQPFSYGPRNCLGKNLAYAEMRLVLTRLLWNFEFDLLEESKDWHAKQKVWMMWDKGDLKVRLKPLRH CYP65AB1 Aspergillus flavus Supercontig 8: 73235-75051 (+) strand see fungal pages for seq CYP65AB2 Aspergillus clavatus 82% to CYP65AB1 Aspergillus oryzae see fungal pages for seq CYP65AC1 Aspergillus oryzae GenEMBL BAE59505.1 AY510452.1 58% TO CYP65AC2, 45% to 65U1 MSQFAREIVRNAIYNTSSPDADSVSLRKATTTVLLIGVT (0) YCILVGIYRVTLHPLAKYPGPKLAAVTRLWHSYHLCTGDIVSVLSRAHEAYGPVLRIAPD EVLFISSRAWDDIYGARPGKPE MDKDTPLYKGPTAPHSIVTVDGELHRFYRRLLAKGFSDAALREQEPVIQRNINLLVEKLHKEVAAGKTPE MTAWFNYATFDLIGELAFGETYGCLENSHYHPWVEMILEVMKLRAMTHAVGYYPWIFHILMWFVPKSLRE KFVTHRRYTHDKVQRRMDQKIHYKDLTTNLVDPQNGLERYEIDGNCSTLIIAGSETTATALSATLYFLTQ NENAKRKVIGEIRTTFNNAGDINSISVNQLKYLSACMNEALRIFPPGPAVFPRRVPQGGDFIDGHWIPGG TQVGIAHYCINRSRRNFVDPDKFIPERWLGDPTYQTDDRHAVQPFSYGPRNCIAHNLARLEMRLVLARLI WEFDWELAPGSERWEEEALVFNVWSTKPLMIKFTPVAR CYP65AC2 Aspergillus oryzae GenEMBL BAE54634.1, AP007150.1 40% to 60A2 (CYP60 is in the 65 family) revised 3/20/2009 MTLSAESSQLPWALGNIVQQVHGIPIWGTVLTILVLVVL (0) HYAIYAVYNITFHPLAKYPGPKVAAMTRIVYTSHLLSGRLVEFLHQTHLKYGAVVGVAPN EVIFTSSQAWNDIYNVRQGAPE MSKDTPLYQSLGTPPTIAEAGHDLHRRYRRLLAKGFSEAGLREQEPVIQQKISVLIKQLHAATAKGTTPE MTSWFNFFTFDLISELTFGESFKCQENGRFHPWILMVTQSIKFRAIIQALGYYPLLFKLFMGLIPKSYQE AFRDHQKLTSENVQRRIDKKVDYTDLASNLIDPKHNLERYEIDGNCAVLIVAGSETTTTALSATMYYLTQ NPEAKAKVIKEVRSSFSSAEEITAISVNQLKYLPACFNEAMRKLPPAPAVFTRRVPREGAYIAGNCIPGG THVGMCHFATNNSSLNFKDPEKYIPERWLGDPEYEDDARAAMQVFSVGPRNCIGQNLARLELRLLLSRVI WEFDWELDSASMDWDKDMPVYLSWGMKPLKFHFAPVVR CYP65AC2 Aspergillus flavus 98% to CYP65AC2 Aspergillus oryzae see fungal pages for seq CYP65AC3P Aspergillus nidulans 67% to CYP65AC1 AN10101.3 (partial) ETYGPILRIAPGEVLFTSSSAWDDIYGARPGKPEMDKXXXXX KGPIAPHSIVAVDGELHRFLRRLLAREFSDVKLREQEPVNQRSINLLIEKLHDEVAAGKTPEMTAMFN YATFDLIGELAFGETYGX MENWRYHPWVKMIFYVMKLRALTHAVGYCSWVFPILXXXX PQVLRD*FVVHRKYTTYDKVQRRKDRKIHYWELTTNLFDPQTGLER XXXXXXXXXXXXXXXXXX STALSATLYFLTQNEDAKRKVIGEXXXXXXXX GDIKSISVTQX KYLHTRVNETLQIFPPRSAVFPRRVPLGGDWILGGTKVGSVHCCISCCRWNFV DPDKFIPERWLSAPAY*NDDRRAMQAFKSSSLHFS*SLRLKMRLVLARLICEFDLELA PGSE*WEGALVFNVWGTKPFKI CYP65AD1 Aspergillus oryzae GenEMBL BAE55230.1 45% to 65V1, 43% to CYP65AF1 MFPCCLVPTISTSQPVEVSIAAIASWSYNVFFHPLSNYPGPRLAAATRLWYAWHCAKGSLPFAIHKLHLR YGDVVRVAPDELSYIHPDGWNEIYGHRPGQSEIPKDPSFYSSALSSPEGIFRAPRDRHGYIRRQMSHGFS EKSMREQEDTIRHYADLMISYLSTQANGPKENVVDFTRWYNYFTFDVMGQLVFGESFNCLQSSGFHPWVS IIFDSIRYNVFVRCTQFWPWLSPVIRRFIPKSFQRRKIEQQALSREKANYRKTIHDGRNDLVANLLKPDS GVTDLEYQSTVQTLIVAGSETTASLLCGVTFHLLNNPEKLEKAVKEVRSEFDSADKISFVSVNKLHYLLA CLNEALRVYPPVADGFPRRTGSNVEVINGQPVPPNTAIRMTHWATYRSPRNFVRPNEYLPERWLGNAPGF EKDHKNALQPFHVGPRNCIGRNLAYMEMRLLLALVLWNFDLELYPASKLWDKQRVYNLWEKPELKVKVLP RKI CYP65AD1 Aspergillus flavus see fungal pages for seq CYP65AD2 Aspergillus terreus AAJN01000140.1 60% to CYP65AD1 Aspergillus flavus see fungal pages for seq CYP65AE1 Aspergillus oryzae GenEMBL BAE60010.1 40% to CYP65AF1, 39% to 65S2 MEALPTYLESFWHSGGGTMGISILVMLSTFLALGTIFVYRIWLHPLSGFPGPKCCSVSSIPVAWAQLRGR NHEFVSSLHRKYGSVVRISPSELSFISGAAWNDIYSRSKGRPALERDRTFFNDMLVDPETITMANEATHS RIRRAMAPAFSPRALLEQEPIIQANIKLLMDKLEARAGSGGAPTDLRAWFNYTTFDLIGDLAFGESFGCL ATSTCHEWVQFVLDHFYTSTLLHVVHRFHPFNRVLAALLPKSLIEKRKAHDSMTLTKVHRRLEVQGRRND FTQHLLDAAEAGTLSSREVEKQASVLILAGSETTSVALTFAIYLVLTNKPVLDQLNDELHSTFKEEQEIN LLSVNQLKYLHAVIQEALRFCPPISNGFPRQTPPQGATVDGMFIPGKTVVNINHWAAYRSPRNFTLPEQF LPERWLGDPRFDEDAKDVFQPFSVGPRNCIGKKFAYDSMKLILAKFLWRFKPTLLDKSRSWLAHQPTFVS FHQPPLLVDLEIKGSDAFPVRE CYP65AF1 Aspergillus oryzae GenEMBL BAE65144.1, AP007171.1 46% to 65T1 50% to 65S2 14 P450 genes and 2 pseudogenes on this contig MLSFLPLPGPFSSPVGTVGFYLGLGVLL (0) SILYLFAVIIYNVYFHPLAKFPGPKSYAATRIPYFQALLGGQIGQAIKDLHQK YGEVVRIAPNELSFIDGEAWKAIYGTRPGHKQKPKDVRYYPPTAGGVPSIVISNDEDHSRFRRTLSHAFS ETSLRVQEPLVNSYIDLLIQRLHEHCEAGNKPLDMVAWYNFTTFDIIGDLAFGEPFNCLQNSAYHKWVSM IFSNIRYGTYGNLARRFPGSKFLLRLITPTRIANGRNWHIELTKEKVKDRLAKSNDRMDFYGHILKQKDT ERAMTFDEMVTNGSTLIVAGSETTATLLSAVTFYLLKNERVLSKLQQEIRASFESEKDITVTGCNQLEYL NAVLTEGLRIFPPTPTGLPRIVDADGDMIAGKWVPGGTIVSIPHLAAFHSASNFTEPESFIPERFLGDPR FANDSKTVLQPFSFGPRNCIGRNLANAEMRLILARVLYNFDLELDERSENWSRQETYILWNKPGLYVRLR PRVGI CYP65AF1 Aspergillus flavus 98% to CYP65AF1 Aspergillus oryzae see fungal pages for seq CYP65AG1 Aspergillus oryzae GenEMBL BAE60004.1 45% to CYP65AF1, 39% to 65U1, 47% to 655Z1 MSRRFPIQVAQSEIPPIAWGLWRMVYRLWLHPLSGYPGPRLAAVSNLPYFAWTCTGNLHLRLQELHKVYG DVIRIRPNALTYRTPEAWTDIYGHRKPGTLPFSKDPEFFMPAQAGSSHMINANEKDHTRQKRLLNHAFSE RSLRQQEHLIMGYIDLFIQRLRGQARMGAETVNMEEWLNFLTFDIIGDLAFGEPFGCLQNSEYHPWVATI FKSIKTGAILRALNIYPILLGFIRRFLPKSLVQKRIAHYQMSKDRVTRRLQTETSRPDFISYILKYNDDR GMSTPEIEMNAALLIQAGSETTATVLAACLYFLQKNAACHRRLVQDIRSAFTQETDINFLSAAQLPYMNG VIEESLRLFPPAPGIGPRVVPKGGARICGRYVPGGVSVSVGHYSTFRSARNFTRPNEFLPQRWLDRDAES EFASDQTMALQPFSYGPRACIGRNLAYAEMRTILAKILWHFDVQLDERSADWANSKSYIVWEKGPLWLKL HPRNVPQETD CYP65AG2 Aspergillus niger JGI gene model e_gw1.3.1457.1|Aspni1 57% to CYP65AG1 see fungal pages for seq CYP65AG3 Aspergillus terreus 64% to CYP65AG1 see fungal pages for seq CYP65AH1 Gibberella moniliformis (also Fusarium verticillioides) AF155773.4 fumonisin biosynthetic gene cluster, FUM2 47% to 65L1, 45% to CYP65R1 MYQITAICDARWWLCLAFTCMAMLALFSFGVVMYNIFFHRASKF PGPTIAGATSFRYHWAMSTGNVAPWLREQHARYGEVVRIAPDMISYVSPDAWKDIYAY KPGEKEQNGIDWTIPSRDDDVPSMFSEPNDAEHNRVRRLFLPAFSDRALKQQEPLLSK YSDQLVHLIRRGIDDNRDQEFDAVKLYNFTTFDIMGDLTFGEPLGLLKNSSYSEWVQN LFRDIKTAGIFLFIFDFPPLPWLVKKFSPPSIQRAHEIHKQHTVDRVNRRLEKGLDRP DIWNLVLSQPEGRGLTHPQMHANADIFMIAGTETTATLLSGLTYLLLKNPEKLQRLVE EIRGSFGSIEELTVENLARLPYLSACLSEGLRCYPPVPIGPSRVTPKTGGQVLGERVP GRVRLTIPQCAAYYSDLNFKDPYSFIPERWLPGTGYESDRKGILQPFLIGPRNCIGQN LAYHEMRLILCKLLWHYDIELCPDKGNWLLDQKMYIFWEKRPLMVKFTKARK CYP65AJ1 Alternaria solani Alt1 BAD83680 48% to 65J1 Mg PKS for alternapyrone is next to three P450s called alt1, 2, 3 Chem Biol. 2005 Dec;12(12):1301-9. 1 MPAASELERR TKRSKSRNKI SKPIANMLSV IAFSICISPI VYFLIRSIYY LVFHPLSDYP 61 GPKLWAISRI PWNYVNLQGD LAWRIRDMHL HYNSSVIRIA PDELSYTSST ALKKIDGTPP 121 PREFLKCLDG RGIAPAVVNR RRSIVTETPE RHTILRRALQ PAFSERALRD QEDFFRDHTD 181 RLIAQLRKPQ YGVTEQNILR WFALLSFDIM SDLAFGQPAG CLDLVDEPWL GVIGSRVKSI 241 VWYQFAVYYR IEWILKWIMP KAAMEARKRH QALTLQKVQR RIEEERSGKR EGKKRDFMSY 301 ILGNDKENLS NMDLFGMASA FIVAGSNTTT YTMTAFTFFV CRDSEVYAKV IAEVRDKFAS 361 DTDITMVAAG DLPYLKACIE ETMRLSPPTP SALPRWVLEG GEEIDGKWVP GGVTVGVHNL 421 AACHVPWNWH RPLEFIPERW LQTKEGEFTH DDRALHARFL MVDIMSLTCF LSSMAMNEMR 481 LALAKLFWNF DISLSRNSGN WWITQKSYLV WEKKPLMVTI KPRH CYP65AK1 Aspergillus niger JGI gene model fgenesh1_pg.C_scaffold_9000016|Aspni1 42% to CYP65R1, 34% to fgenesh1_pg.C_scaffold_5000711 see fungal pages for seq CYP65AL1 Aspergillus niger JGI gene model fgenesh1_pg.C_scaffold_5000711|Aspni1 42% to CYP65Z2, 43% to CYP65Z3 see fungal pages for seq CYP65AM1 Aspergillus niger JGI gene model estExt_Genewise1.C_31719|Aspni1 45% to CYP65AF1, top 28 hits are CYP65 or CYP60 see fungal pages for seq CYP65AM2 Mycosphaerella graminicola 65% to CYP65AM1 A. niger see fungal pages for seq CYP65AM3 Aspergillus clavatus 92% t0 65AM1 see fungal pages for seq CYP65AM4 Mycosphaerella fijiensis 62% to CYP65AM2 Mycosphaerella graminicola JGI gene model estExt_fgenesh1_pm.C_10205 see fungal pages for seq CYP65AN1 Aspergillus niger JGI gene model gw1.16.109.1|Aspni1 36% to CYP65Z1 top 26 hits are to CYP65 or CYP60 see fungal pages for seq CYP65AP1 Aspergillus niger JGI gene model gw1.10.1058.1|Aspni1 40% to CYP65AF1, 60% to fgenesh1_pg.C_scaffold_1001144 Top 27 hits are CYP65 or CYP60 (CYP5118 also in this set, may need name change = 41% to CYP65Z2) see fungal pages for seq CYP65AP2 Aspergillus niger JGI gene model fgenesh1_pg.C_scaffold_1001144|Aspni1 40% to CYP65Z2, 60% to gw1.10.1058.1 see fungal pages for seq CYP65AP3 Histoplasma capsulatum G217B ABBT01000035.1 Ajellomyces capsulatus G217B 68% TO CYP65AP1 see fungal pages for seq CYP65AP4P Aspergillus terreus some parts are 76% to CYP65AP1 Aspergillus niger see fungal pages for seq CYP65AQ1 Aspergillus niger JGI gene model gw1.10.1062.1|Aspni1 37% to CYP65Z2 top 36 hits are CYP65 see fungal pages for seq CYP65AR1 Aspergillus niger JGI gene model gw1.11.201.1|Aspni1 36% to CYP65AH1, top 21 hits are CYP65 see fungal pages for seq CYP65AS1 Mycosphaerella graminicola 39% to CYP65E1, all top hits except CYP5117A1 are CYP65 sequences. This may mean that CYP5117 should be renamed. N-term from JGI model estExt_Genewise1Plus.C_chr_11525|Mycgr3 see fungal pages for seq CYP65AT1 Mycosphaerella graminicola JGI gene model e_gw1.3.1373.1|Mycgr3 40% to 65L1, 41% TO 65Z2 see fungal pages for seq CYP65AU1 Coccidioides immitis 47% to CYP65X1 see fungal pages for seq CYP65AV1 Histoplasma capsulatum 39% to CYP65AZ2, 40% TO CYP65AF1 see fungal pages for seq CYP65AW1 Fusarium oxysporum 44% to CYP65A1 Fusarium sporotrichioides see fungal pages for seq CYP65AW1 Fusarium verticillioides 90% to CYP65AW1 Fusarium oxysporum = ortholog see fungal pages for seq CYP65AX1 Fusarium oxysporum 42% to CYP65B1 see fungal pages for seq CYP65AX1 Fusarium verticillioides 94% to CYP65AX1 Fusarium oxysporum see fungal pages for seq CYP65AY1 Aspergillus flavus 40% to CYP65S4 Fusarium verticillioides Note: there is no ortholog of this gene in A. oryzae see fungal pages for seq CYP65AY2 Penicillium chrysogenum Wisconsin 54-1255 AM920421.1 61% to CYP65AY1 MQLLEWVVALTGLVWHTQGSFTPSLEQSTNLVQ 364847 WIISHLCLAIYNVFFHPLRNYPGPKLDAATGLVYVYHMVRGNSCKYLAGLHEKYGEVVRA 365026 365027 GPNEISYMTVSANKTIFGNKTTEKMSFEKNPAVYIQGEGTAQNILFASTGEHPRFKKLMS 365206 365207 PAFSEQAIKEQEPTIQQYTSIMIDALRNNRSGEACYPDANGIANIGAWCNFLIFDILSCL 365386 365387 SFGKPIGCLQMADYHEWANVIFGAMKHSHFLQCAHRLKPYHRLLEKLIPGDISGPYETHM 365566 365567 QNVRKNLRERETMENLARADFASFIPKGMSEDELVDNVNILATAGSETTATTLSSLFY 365740 365741 YLTHNPDSYQKLVDEVRSTFSSEEEITFNAVASLKYLKAVIQETFRIHPSVPVGLHRITP 365920 365921 KAGSYIDGKWVPG 365959 365982 QTWVSVALLAAYRSPRYWKQPEDFLPERWLEDPEFASDNRQIWAPFSIGPRKCIGIN (2) 366182 LAYLNMRLIVARLLWNFDFDPQPDNVDPHELKEYGVWQG 366356 QVPLNLKIRDARA* 366398 CYP65AZ1 Aspergillus oryzae GenEMBL BAE63025.1 39% to 65S2, 41% to 65Z2 formerly CYP5118A1 see fungal pages for seq CYP65BA1 Aspergillus terreus 43% to CYP65AF1 see fungal pages for seq CYP65BB1 Mycosphaerella fijiensis JGI gene model e_gw1.18.265.1 revised at N-term 46% to CYP65AF1 Aspergillus oryzae 45% to CYP563A2 Botrytis cinerea 44% to CYP65S2 Nectria haematococca 43% to CYP65Z2 Neosartorya fischeri see fungal pages for seq CYP65-un1 Magnaporthe grisea MG09636.4 poor match 39% to MG02982.4 AACU01001669 cont2.1850 PSEUDOGENE see fungal pages for seq CYP65-un2 Aspergillus flavus 45% to CYP65L1 see fungal pages for seq CYP65-un2 Aspergillus oryzae 45% to CYP65L1 100% to CYP65-un2 Aspergillus flavus Supercontig 26: 405493-406148 (-) strand see fungal pages for seq CYP65-un3 Aspergillus terreus 36% to CYP65AQ1 one frameshift, possible CYP65 pseudogene 30/31 top hits are CYP65 see fungal pages for seq CYP65-un4 Mycosphaerella fijiensis JGI gene model e_gw1.3.1109.1 46% to CYP65-un2 Aspergillus flavus see fungal pages for seq CYP65-un5 Mycosphaerella fijiensis JGI gene model e_gw1.8.672.1 45% to CYP65AS1 Mycosphaerella graminicola see fungal pages for seq 66A Subfamily CYP66A1 Agaricus bisporus (a mushroom) GenEMBL Z82021 (932bp) AW444023.1 adds C-term De Groot, P.W.J., Schaap, P.J., Van Griensven, L.J.L. and Visser, J. Isolation of develpmentally regulated genes from the edible mushroom Agaricus bisporus. unpublished Note: CYP66 is most similar to CYP64 WSYINSIVANSLWNSVKERMEAGTAKPCIATAMLEDLLDDDSAE SKEEETVRRGACANGFLGGADTTVSLVTSFFMAMALYPDVQKKAQAELDQVLGGRLPE FSDRPSLPYVNALLKESERWQPVFPLAIAHMSSNADEYDGYYIPKGTYVIGNAWSILH DPEFYKDPLVFNPDRFLKDGEIDPSVRDPNVASFGFGRRICPGRFFPDASLYSTVTHV LTVFDIKPNLDENGKEIGIKPDMTDGLLSIPM CYP66B1 Agaricus bisporus (edible mushroom) GenEMBL ABI271707 50% to CYP66A1 partial mRNA QLKSVRTFIRNVMESPDEFSEWIHFYTSSSIMEIIYGMKAKPED PYVDNAKKAIEGFNEAAVPGKFLVETFPVMKHIPSWFPGAGWKRQALFWRDVNREVRV RPFNLVKDQVNEGTATRSVCRTLIGNLPDSTAPDRIVKENIAIDTCAVSFIGAAETSH SAARVFFMAMLMNPEVQKKGQAELDKVLNGRLPEPNDGPNLPYISAMVKETLRWQLVL PLAVPHVAIEADEYNGYYIPKGTIVFGNSWTFMHDPEVYKDPESYMPERFLKDGKLDS SIRDPSTAVFGYGRRICPGRYFALNALYLMIAHTLAVFDIKPALDENDNEKEFKADVT GGMISQPVPFQCMIVPRSKAAADLIQNSDLME 67A Subfamily CYP67A1 Uromyces fabae (fungus) GenEMBL U81793(1849bp) Hahn,M. and Mendgen,K. Characterization of in planta induced rust genes isolated from a haustorium-specific cDNA library. unpublished (1997) CYP67A2 Puccinia graminis f. sp. tritici 76% to CYP67A1 Uromyces fabae see fungal pages for seq CYP68A1 Gibberella fujikuroi (a rice pathogenic fungus producing plant hormones(gibberellins)) GenEMBL Y15277 Malonek S, Bomke C, Bornberg-Bauer E, Rojas MC, Hedden P, Hopkins P, Tudzynski B. (2005) Distribution of gibberellin biosynthetic genes and gibberellin production in the Gibberella fujikuroi species complex. Phytochemistry. 2005 Jun;66(11):1296-1311. Malonek S, Rojas MC, Hedden P, Gaskin P, Hopkins P, Tudzynski B. (2005) Functional characterization of two cytochrome P450 monooxygenase genes, P450-1 and P450-4, of the gibberellic acid gene cluster in Fusarium proliferatum (Gibberella fujikuroi MP-D). Appl Environ Microbiol. 71, 1462-1472. called P450I or P450-1 GA14-synthase CYP68A1 Gibberella proliferatum (teleomorph sexual form) Fusarium proliferatum (anamorph asexual form) GenEMBl AJ628021 Malonek S, Bomke C, Bornberg-Bauer E, Rojas MC, Hedden P, Hopkins P, Tudzynski B. (2005) Distribution of gibberellin biosynthetic genes and gibberellin production in the Gibberella fujikuroi species complex. Phytochemistry. 2005 Jun;66(11):1296-1311. Malonek S, Rojas MC, Hedden P, Gaskin P, Hopkins P, Tudzynski B. (2005) Functional characterization of two cytochrome P450 monooxygenase genes, P450-1 and P450-4, of the gibberellic acid gene cluster in Fusarium proliferatum (Gibberella fujikuroi MP-D). Appl Environ Microbiol. 71, 1462-1472. Ortholog to Gibberella fujikuroi 68A1 GA14-synthase MTNHSSSYYYEFYKDHSHTFRRSMSENTLISSCLALATCAILLS IQWLKPQPLIMVNGRKFGELSNVRAKRDFTFGARQLLEKGFKMSPDKPFRIMGDVGEL HILPPKYAYEVRNNEKLSFTMAAFKWFYAHLPGFEGFREGTNESHIMKLVARHQLTHQ LTLVTGAVSEECALVLKDVYTDSPEWHDITAKDANMKFMARITFRVFLGKEMCRNPQW LRITSTYAVIAFRAVEELRLWPSWLRPVVQWFMPHCTQSRALVQEARDLINPLLERRR EEKAEAERTGEKVTYNDAVEWLDDLAREKGVGYDPACAQLSLSVAALHSTTDFFTQVM FDIAQNPELIEPLREEIISVLGKQGWSKNSLYNLKLMDSVLKESQRLKPIAIASMRRF TTHNVELSDGVILPKNKLTLVSAHQHWDPEYYKDPLKFDGYRFFNMRREPGKESKAQL VSATPDHMGFGYGLHACPGRFFASEEIKIALSHILLKYDFKPVEGSSMEPRKYGLNMN ANPTAKLSVRRRKEEIAI CYP68B1 Gibberella fujikuroi (a rice pathogenic fungus producing plant hormones(gibberellins)) GenEMBL Y15278 Malonek S, Bomke C, Bornberg-Bauer E, Rojas MC, Hedden P, Hopkins P, Tudzynski B. (2005) Distribution of gibberellin biosynthetic genes and gibberellin production in the Gibberella fujikuroi species complex. Phytochemistry. 2005 Jun;66(11):1296-1311. called P450II 38% identical to CYP68A1 CYP68C1 Fusarium sporotrichioides (fungi) GenEMBL AY040587 (not released yet) Andrew Peplow Isaac Meek Clone name TRI1 Submitted to nomenclature committee July 25, 2001 Low 40% range with 68A1 and 68B1, 44% to 68D1 CYP68C2 Fusarium graminearum FG00071.1 AACM01000004 FGcontig1.4_scaffold1 See fungal pages for sequence CYP68D1 Neurospora crassa GenEMBL AL355928.2 Neurospora crassa sequence contig 1.317 (supercontig 36) lovA sequence below not annotated in genbank entry 46% to 68C1 MANPSATPSSIPSWMERLDIKSITDPSATPFSYLVTAFLLAVVVYSLQGPRFPKNIKHLN PKGPLEFSDTRPKKEFVYGSRQMLANWFKANPNKPCRVISDFGEAIVLPPRMANEIKNDD RLSFTRWTYK (phase 0 intron) AFHGHLPGFEGFGEASRESHIVQEVIMRDLTKYLN (phase 1 intron) KVTEPLAQETSMAMEANLPKAANGEWSTINLRSKILPIVARISSRVFLGEELCRNEEWLK VTQQYTIDGFGAAEDLRLWPAALRPIVHWFLPSCQRARADVRVARSILDPVLKKRRQEKA ANGGKAEHDDAIEWFERTAKGKYYDPAVAQLVLSLVAIHTTSDLTCQVMTNLMQNPEFIA PLREEMIQVLSEGGWKKTSLYNMKLLDSVIKESQRVKPTGVASMRRYAEKDVTLSDGTFI PKGGFVAVSAHDMWNSEVYEQAEKWDGRRFLRMRETPGAGKENVAQLVSTAPEHLGFGHG QHACPGRFFAANEIKIALVHLLLNYEWRLPEGSDPKIRTFGFSMGVDPSLKVEYKGRQPEIEL* CYP68D1 Neurospora discreta JGI gene model estExt_fgenesh2_pg.C_110241 92% to CYP68D1 N. crassa See fungal pages for gene model CYP68D2 Nectria haematococca (Fusarium solani group) JGI gene model e_gw1.16.35.1 Necha1/scaffold_16:246007-247770 55% to 68D1, 48% to 68C2 See fungal pages for gene model CYP68D3 Aspergillus niger JGI gene model e_gw1.6.1239.1|Aspni1 60% to CYP68D1 See fungal pages for gene model CYP68E1 Magnaporthe grisea MG10527.4 with C-term extension = accidental fusion AACU01001764 cont2.2024 See fungal pages for gene model CYP68F1 Magnaporthe grisea MG00300.4 36% to 68A1 35% to 68D1 AACU01001402 cont2.70 See fungal pages for gene model CYP68frag Magnaporthe grisea MG10029.4 AACU01001319 cont2.1919 62% to CYP68A1 C-TERM fragment RUNS OFF END VACTPTHFGFGHGVHACPGRFFAANEVKIALAHILLKYDWKLA EGSEKLEPVERGLNYVVNPNIKFLIRRRKEELDLESLEF* CYP68G1 Magnaporthe grisea MG03834.4 40% to 68B1 AACU01001031 cont2.748 See fungal pages for gene model CYP68H1 Magnaporthe grisea MG02294.4 41% to 68B1 39% to 68D1 AACU01000444 cont2.449 See fungal pages for gene model CYP68H2 Magnaporthe grisea MG11075.4 38% to 68B1 AACU01001898 cont2.2223 See fungal pages for gene model CYP68H3 Magnaporthe grisea MG00024.4 41% to CYP68B1 41% to 68D1 AACU01001352 cont2.7 See fungal pages for gene model CYP68J1 Fusarium graminearum FG02672.1 AACM01000132 FGcontig1.132_scaffold1 See fungal pages for gene model CYP68J2 Fusarium graminearum FG04717.1 AACM01000196 FGcontig1.196_scaffold3 See fungal pages for gene model CYP68J3 Fusarium oxysporum 68% to CYP68J1 See fungal pages for sequence CYP68J3 Fusarium verticillioides 95% to CYP68J3 Fusarium oxysporum = ortholog, 60% to CYP68J4 See fungal pages for sequence CYP68J4 Fusarium oxysporum 70% to CYP68J2 Fusarium graminearum See fungal pages for sequence CYP68J4 Fusarium verticillioides 83% to CYP68J4 Fusarium oxysporum probable ortholog See fungal pages for sequence CYP68K1 Fusarium graminearum FG11002.1 AACM01000457 FGcontig1.457_scaffold8 See fungal pages for gene model CYP68L1 Aspergillus nidulans AN8530.1 45% to 68A1 54 clan See fungal pages for gene model CYP68L2 Aspergillus oryzae GenEMBL BAE66422.1 59% to 68L1, 47% TO CYP68Q1 MLGDLFESVLTNNSVTLLITVAVAAIALHLSSPKSNLPLVNDKKPWEFRFTKARKRFLANAHNLIKAGLA KAPAFRIVTGNGKRVVLDAKYANELRSHDDLSFGLHIADNFHAHIRGFQPFKQGSNDDEIYQNAVRMKLT QSLGNLTQPLVDETLVALQTYWTNDTNWHAIPLKSNIQKVVAQLSSRIFLGDQICRNPNWLRIAVDYTVD AFKAAEELRLWPKAFRAIVALFLPSCRKIRAELQEAQDIIRPVLDARRKDKQAALSAGKEPERYNDAMQW LEECAKGRSYEPAFGQLTFSVAAIHTTSDMLTQVLYDLCGRDALIQALREEVITVVQEEGWTKPTLYKLK LMDSVLKESQRLKPISVVSMQRVATADLRLSDGTFIPKGTSLAVSSDRMWDSEIYPNPLEFDGYRFLKLR ELPGHETSAQVVSPSPEHMGFGFGRHACSGRFFAINEVKIALCHILLKYEFKLADGSVPRATKFGFSFTS DPTTKLMIRRRQEETVL CYP68L2 Aspergillus flavus 97% to CYP68L2 Aspergillus oryzae See fungal pages for sequence CYP68L3 Histoplasma capsulatum G217B ABBT01000111.1 Ajellomyces capsulatus G217B 382612-384420 (-) strand 57% to CYP68L2 See fungal pages for sequence CYP68L4P Aspergillus clavatus AAKD03000015.1 117585-116726 (-) strand mid region is 63% to 68L2 See fungal pages for sequence CYP68L5P Aspergillus fumigatus Af293 XP_752004.1 also EAL89966.1 Internal fragment, needs revision Pseudogene fragment mid region aa 183-260 62% to CYP68L3 Histoplasma capsulatum 74% to CYP65L5P Neosartorya fischeri formerly CYP5088A1 KICRNPDWLRITVGYTVDTF & LPAENLQLWPRLIRPVVAKF & LPPCRKICWELKEATEIITPVLEERQ CYP68L5P Neosartorya fischeri 62% to CYP68L3 Supercontig 569: 1883929-1885384 (+) strand MAASRIATCMKDKMLTDISTQLFRLDP ISLAGVLAISGLFFLLTLPSKA & SIPMVNVKKLFQIRSSHIQKRYLDACKLSKTGSRK () APSFRVHSDNGIKTMLAPEYANEFHAHPALRFGRAITLEFHSD & FLGD KICRNPDWLLITVDYTVDTFFAAEDLRLWPRLIRPIVANFLPSCRKIRRELKEA PVLEERR DLEAARKANQPQKRYHDAIQLMEECAKGQHYPPAEAQMAFSIVAIHTTSDMLT QVLLDLCANPEVIQPLREEIVAVNQEE*WKKTALYKLKLLDS VLKESQRMKPVNI ASMRRLALEDIKLSDGTLIPKGSSILVSSRKMWDPDAYPNPELFNPYRFLRLRETPGHET SAQLVWPSPDLGFGYGKHACPGRFFTANEVKIALCHIILKYNFNLPDGYTPTVRKRGIS LSADPSAKLVIRRRQEEIAL* CYP68L6 Aspergillus terreus 63% to CYP68L3 See fungal pages for sequence CYP68L7 Aspergillus terreus 54% to CYP68L6 See fungal pages for sequence CYP68M1X Aspergillus nidulans Name revised to CYP5073A1 CYP68N1 Nectria haematococca (Fusarium solani group) JGI gene model e_gw1.41.77.1 Necha1/scaffold_41:90805-92550 48% to CYP68A1 Fusarium graminearum, 52% to CYP58D1 N. crassa See fungal pages for sequence CYP68N2P Nectria haematococca (Fusarium solani group) JGI gene model fgenesh1_pg.scaffold_36000012 Necha1/scaffold_36:51153-51671 48% to 68J2 C-term only, 64% to 68N1, pseudogene See fungal pages for sequence CYP68P1 Aspergillus fumigatus Af293 GenEMBL XP_756138.1 also EAL94100.1 41% to 68N1, 39% to 68A1 MDFLQRILGILSSEDVAPALFASSISVLFLILSYQVLYPPSKYSAFPTINGTRWLALSNRRILAEFVTDA QGLMRNGLEKYDIFRIISSIGPMTILHPKYTDEIHNDRQLNFMAVLAKEMFPNYPGFDLFREGTDGSTVL QDAVKFGSSRCLGKSTQLLSDETSTLLQKLWGDEPEWHEVTAKSSVHDIIAHLSALLFYGPELCSHKEWL EVTDEYASVGFLAARQLRLWPPILRPIAQWFLPACRRLRYLASRTRGLIEPVIAARQKEKAICYSHGRQP PVYDDAIEWTERAAKGRPYDAAMSPLLFSINALHTTTDLLTQVILDLSTQPDLIVALRQEILSVKPQQNG WKNASLNQLLLMDSAIKESQRLKPTESILMRRYAMDDLTLADGNKIPKGTVLGIPIFGMRDPKIYVDPDM YDGYRFQKMRDKPGFENKCQLVSTSPWHLGFGHGIHACPGRFLAAVQVKIILCYIVAKYDFKLAGGAPPK VQSVGIELISDTEARLAVRRRQEMVIGLE CYP68Q1 Aspergillus oryzae GenEMBL BAE64916.1, AP007171.1 47% TO 68L2, 45% to 68L1, 48% to 68D1 14 P450 genes and 2 pseudogenes on this contig MDHFNLAGPESNTSITSLEWLGIKNSFTGSHWAHITGLSELHPTGFLCLIATLIIGIVHLTRGPKPTVLP VVNPPGTFELTANRVKKEWLVDARQIIRRGFEKFPGKPFNMIAADVGLTTVLPPEYASEIRNNPSLSFVA FMAHLFFSELPGFEPTREGMFDNDIGITVVHKYLTVNLARITEPLSREATAALKDIFTDNSEWHDANLKA INLALVARLSSRIFLGEELCRNEEWLKITVNYTVDVMKAAERLRRVPGPLRRIVHWFLPEAQKCRDEVKR AGKVIRPVLEKRRREKATMESEGKEALQYNDAIEWFEQMAKSQGTSYDPEVVQLFLSTVAIHTTSDLLTV VMADLARNPEIIEPLREEISSVLRDGGWKKTSLTDMKLLDSVLKESLRLKPIAVVSMRRVAMDHLKLSDG TFLPKGTKMAVSSHRMWDPDVYENPEQWDGFRYVNLRETPGQDKHAQFVSTSERHLGFGHGKHACPGRFF ASSELKVALCHILMKYDFELAPGTVVQHRYSGASYYADPAIRVMLRRRNVALPSWFER CYP68Q2P Aspergillus clavatus ~73% to 68Q1 Aspergillus oryzae pseudogene adjacent to CYP5117A2P on opposite strand See fungal pages for sequence CYP68R1 Aspergillus terreus AF141924.1, XM_001209275.1 lovastatin biosynthesis gene cluster gene="lovA" function="involved in lovastatin production" 39% to 68D1 N. crassa 73% to CYP68R2 gene mlcC Penicillium citrinum ML-236B (compactin) biosynthetic gene cluster MTVDALTQPHHLLSLAWNDTQQHGSWFAPLVTTSAGLLCLLLYL CSSGRRSDLPVFNPKTWWELTTMRAKRDFDANAPSWIESWFSQNDKPIRFIVDSGYCT ILPSSMADEFRKMKELCMYKFLGTDFHSHLPGFDGFKEVTRDAHLITKVVMNQFQTQA PKYVKPLANEASGIITDIFGDSNEWHTVPVYNQCLDLVTRTVTFIMVGSKLAHNEEWL DIAKHHAVTMAIQARQLRLWPVILRPLVHWLEPQGAKLRAQVRRARQLLDPIIQERRA ERDACRAKGIEPPRYVDSIQWFEDTAKGKWYDAAGAQLAMDFAGIYGTSDLLIGGLVD IVRHPHLLEPLRDEIRTVIGQGGWTPASLYKLKLLDSCLKESQRVKPVECATMRSYAL QDVTFSNGTFIPKGELVAVAADRMSNPEVWPEPAKYDPYRYMRLREDPAKAFSAQLEN TNGDHIGFGWHPRACPGRFFASKEIKMMLAYLLIRYDWKVVPDEPLQYYRHSFSVRIH PTTKLMMRRRDEDIRLPGSL CYP68R2 Penicillium citrinum BAC20565.1, AB072893 gene mlcC compactin biosynthetic gene cluster join(AB072893.1 :15823..16167, AB072893.1:16234..16405,AB072893.1:16469..17508)" note="putative ML-236B biosynthetic gene" 73% to lovA CYP68R1 note: compactin is 6 desmethyl lovastatin 1 MLGQVLLTVE SYQWVSTPQA LVAVAVLLSL IAYRLRGRQS ELQVYNPKKW WELTTMRARQ 61 DFDTYGPSWI EAWFSKNDKP LRFIVDSGYC TILPSSMADE FRKIKDMCMY KFLADDFHSH 121 LPGFDGFKEI CQDAHLVNKV VLNQLQTQAP KYTKPLATLA DATIAKLFGK SEEWQTAPVY 181 SNGLDLVTRT VTLIMVGDKI CHNEEWLDIA KNHAVSVAVQ ARQLRVWPML LRPLAHWFQP 241 QGRKLRDQVR RARKIIDPEI QRRRAEKAAC VAKGVQPPQY VDTMQWFEDT ADGRWYDVAG 301 AQLAMDFAGI YASTDLFVGA LVDIARHPDL IQPLRQEIRT VIGEGGWTPA SLFKLKLLDS 361 CMKETQRIKP VECATMRSTA LRDITLSNGL FIPKGELAAV AADRMNNPDV WENPENYDPY 421 RFMRMREDPD KAFTAQLENT NGDHIGFGWN PRACPGRFFA SKEIKILLAH ILIQYDVKPV 481 PGDDDKYYRH AFSVRMHPTT KLMVRRRNED IPLPHDRC CYP68R3 Monascus pilosus DQ176595.1, ABA02241.1 monacolin K (lovastatin) biosynthetic gene cluster possible GC boundary at exon 2/3 DNKggc 86% to lovA CYP68R1 41447 MTVPTDTVSRRLQSLAWSDIKQHAPWLPSSRTLVSGFLCLILLQILYSRGRKSDLRVYNP 41626 41627 KKWWELTTMRAKREFDANAPAWIEAWFSKNDQPLRFIVDSGYCTILPSSMADEFRKMKEL 41806 41807 CMYKFLGT (0) 41830 41908 DFHSHLPGFDGFKEVTRDAHLITKVVMNQFQTQAAKYTKPLADEASATIADIFGDNK (1) 42078 42144 EWHTAPVYNECLDLVTRTVTFIMVGDKLAHNEEWLDIAKHHAVTMAIQARQLRLWPVILR 42323 42324 PIVHWLEPQGAKLRAQVRRARQLLEPIIQERRAEKAKCLAQGIEPPRYVDSIQWFEDTAK 42503 42504 GQWYDAAGAQLAMDFAGIYGTSDLMIGGLVDIVRHPHLIEPLRNEIRTVIGEEGWTPASL 42683 42684 YKLKLLDSCLKESQRVKPVECATMRSYALQNVTFSNGTFVPKGELVAVAADRMSNPEVWP 42863 42864 EPKKYDPYRYMRLREDPDKAFSAQLENTNGNHIGFGWHPRACPGRFFASKEIKIMLAFLL 43043 43044 IRYDWKLVPNEPLQYYRHSFSVRIHPATKLMMRRRDEDL* 43163 CYP68S1 Aspergillus niger JGI gene model e_gw1.4.292.1|Aspni1 51% to CYP68L2 See fungal pages for sequence CYP68T1 Aspergillus niger e_gw1.3.1460.1|Aspni1 51% to CYP68L2, 42% to e_gw1.4.292.1 See fungal pages for sequence CYP68U1 Mycosphaerella graminicola 47% to CYP68N1 Nectria haematococca, 48% to CYP68D3 A. Niger N-term from JGI model fgenesh1_pm.C_chr_10000062|Mycgr3 See fungal pages for sequence CYP68V1 Histoplasma capsulatum G217B 50% to CYP68N1 See fungal pages for sequence CYP68W1 Aspergillus clavatus 42% to 68D1 See fungal pages for sequence CYP68X1 Neosartorya fischeri 47% to CYP68V1 Histoplasma capsulatum Note: this seq does not have an ortholog in A. fumigatus See fungal pages for sequence CYP69A1 Gibberella fujikuroi (a rice pathogenic fungus producing plant hormones(gibberellins)) GenEMBL Y15279 Malonek S, Bomke C, Bornberg-Bauer E, Rojas MC, Hedden P, Hopkins P, Tudzynski B. (2005) Distribution of gibberellin biosynthetic genes and gibberellin production in the Gibberella fujikuroi species complex. Phytochemistry. 2005 Jun;66(11):1296-1311. called P450III a distant match to CYP54 CYP97E1 Skeletonema costatum (diatom) GenEMBL AF459441 Yang,S., Wu,R.S.S., Mok,H.O.L., Zhang,Z.P. and Kong,R.Y.C. Identification of a novel cytochrome P450 cDNA, CYP97E1, from the marine diatom Skeletonema costatum (Bacillariophyceae) J. Phycol. 39 (3), 555-560 (2003) 52% to 97B3, 51% to 97B2, 50% to 97B1 submitted to nomenclature committee Jan 28, 2001 this is the first full length sequence from a diatom See stramenopile/diatom pages for sequence and a tree CYP97E2 Thalassiosira pseudonana (diatom) No accession From JGI genome project See stramenopile pages for sequence and a tree CYP97E3 Ectocarpus siliculosus (brown algae) Genoscope Ectocarpus siliculosus brown algae project EST LQ0AAB42YI07FM1.SCF (C-term) 59% to CYP97E1 and CYP97E2 876 amino acids is very long for a P450 Ectocarpus sctg_63 344994-350246 CYP97E4 Phaeodactylum tricornutum (diatom) estExt_Genewise1.C_chr_50056|Phatr2 (at JGI) 69% to 97E2 and 97E1 this gene model was automatically generated at JGI I have not checked it, it may have some errors MRSSDYSRAPRWESFVVLAWATASAVLVGNVFQQALPQPVQAFGPALVPHLSRRCDLIQRRVSSDLEDVDVDVDNDAILG SNNIPRRLRIQGRGIPTRKDPIQPLDSMTYESDLIKTWEQDPSRQKGFDWEIEKLRRYFAGLRMRDDGVWVRQPSFFDFL VSKSRSDPGNAPRPVGLVDVVKLVLTNSLTSVGLGPALGMAAVPNAVIQKYEGSFFSFIKGVLGGDLQTLAGGPLFLLLN KYFEVYGPIFNLSFGPKSFLVVSDPVMARHVLRETSPDQYCKGMLAEILDPIMGKGLIPADPATWKVRRRAIVPSFHKRW LNRMITLFAERAEILADDLQPKSAKGQVVDMEERFCSVTLDIIGKAVFNYDFGSVTDESPIIKAVYRVLREAEHRSSSFI PYWNLPYADQWMGGQVEFRKDMTMLDDILADLINKAVSTRREASIEELEKRENEDDPSLLRFLVGMRGEDLSSMVLRDDL MTMLIAGHETTAAMLTWTLFELSRGDPGLLKEVQAEVRTVLKGKERPDYDDIVAMKKLRYSLIEALRLYPEPPLLIRRAR TEDNLPAGSSDLKSGVKVLRGTDMFISTWNLHRSPDLWENPEVFDPTRWDRPFNNAGIPGWSGYNPDKVSGLYPSENAAD FAFLPFGGGQRKCVGDQFAMMEATVTMALMIKKYDFDFAIPAEDVGMKTGATIHTMNGLMMRARQVNENEPVQSAEGYWE MQHLKRGLNANGRPYTTEEEAVWQTSERLSHKKEEEKPNGEGGCPMHKM* CYP97F1 Thalassiosira pseudonana (diatom) No accession From JGI genome project See stramenopile pages for sequence and a tree CYP97F2 Euglena gracilis GenEMBL EC678021.1 EST 60% to 97F1 of Thalassiosira pseudonana (diatom) 51% to CYP97B2 Glycine max, 50% to 97B3 Arabidopsis 52% to Pinus taeda CYP97B9, 50% to 97B4, 49% to 97B3 only 40% to 97A3 and 39% to 97C1 Arab. This is the CYP97B ortholog If euglena only has one CYP97, then CYP97B May be the oldest CYP97 subfamily IASALKNRDPADIADLEARDYDEMDNPSLLRILVDMRGEETTNQQLRDDLITLLIAGHET TASALTWAIFELVQQPELLQRVQREVDEVLGDRLPTMDDIRGLLLCRLCIAESLRMYPEP PLLIRRVLDDLTLPKGATAKFEATLKRATDIFLAIYNIHHDGRFWPNPDFYDPERFLRPY KNPEIPEWKGYDPEGWKGRLYPDEVSSDYAFMPFGAGPRRCLGDVFATLEGTVALAMVLR RFDFTFAAPTAQPD CYP97F3 Phaeodactylum tricornutum (diatom) e_gw1.27.30.1|Phatr2 (at JGI) 75% to CYP97F1 this gene model was automatically generated at JGI I have not checked it, it may have some errors MQVGKSGEEISFGDSANVLRTNIEQIYGGFPSLDGCPLAEGELADIGDGTMFIGLQNYYRNYGSPYKLCFGPKSFLVISD PVQAKHILKDANTNYDKGVLAEILEPIMGKGLIPADPETWSIRRRQIVPAFHKAWLEHIVGLFGYCNQPLIDTLNKRVDG DGKVEMESLFCSVALDIIGLSVFNYEFGSVTQESPVIKAVYSALVEAEHRSMTPAPYWNLPLANQLVPRLRKFNSDLKLL NDVLDDLITRAKQTRTVEDIEELENRNYNEVQDPSLLRFLVDMRGADIDNKQLRDDLMTMLIAGHETTAAVLTWALFELT KNPEIMKELQDEIDEVVGDRMPNYEDIKKMKFLRLVVAETLRMYPEPPLLIRRCRTPDELPQGAGREAKVIRGMDIFMAV YNIHRDERFWPSPDTFDPLRFTRSHSNPDVPGWAGFDPKKWEGKLYPNEVASDFAFLPFGGGARKCVGDEFAILEATVTL AMVLRRFEFSFDESKFEGKDDILSSAQGLNHPVGMRTGATIHTRNGLHLVVEKRGVPK* CYP97F4 Ectocarpus siliculosus (brown algae) Genoscope Ectocarpus siliculosus brown algae project ESTs LQ0AAA4YK08FM1.SCF, LQ0AAB23YL07FM1.SCF LQ0AAB3YI17FM1.SCF, LQ0AAB85YI22FM1.SCF 60% to CYP97F1 diatom, 62% to 97F3 Phaeodactylum Ectocarpus sctg_63 359933-352442 Note: two digit names for lower eukaryotes end here and continue as three digit names at CYP501. Animal P450 families cover CYP1-49 and CYP301-499 are reserved for new animal P450 families. Cyp301a1 Drosophila melanogaster GenEMBL AC017275 comp(35594-38192) also AC007356 possible mitochondrial P450 fits in the mitochondrial clan. Cyp301a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP301A1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs2l3 CYP301A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroupUn.423b 66% to 301A1 Drosophila melanogaster. CYP301A2 Drosophila arizonae no accession number from Tina Yee and Phil Danielson submitted to nomenclature committee 6/29/99 84% identical to Cyp301a1 clone name DU36az CYP301A1 Bombyx mori (silkworm) BAAB01030444.1 BP117781 BAAB01134599.1 BAAB01155686.1 BAAB01102862.1 BAAB01093671.1 BAAB01132355.1 BAAB01006099.1 65% to Drosophila 301A1, 64% to Apis 301A1 CYP49 and CYP301 are nearly subfamilies of a single family See silkworm page for sequence CYP301A1 Helicoverpa armigera (cotton bollworm) No accession number Heiko Vogel submitted to nomenclature committee Feb. 6, 2008 Contig_268 51% to CYP301A1, probable ortholog CYP301A1 Tribolium castaneum (red flour beetle) GenEMBL XP_974014 75% to 301A1 bee, 66% to CYP301A1 Drosophila MVATSKFPQAAKIVKNITRSRSQTVTPAFCPHLENLNEAKPYSEIPGPKPIPILGNTWRLLPIIGQYDVS DVAKLSELFYKEYGKIVKLSGLVGRPDLLFIYDANEIEKIYRQEGPTPFRPSMPCLVRYKSVVRKDFFGD VGGVVGVHGEPWRRFRSTVQKPILQVQTVKKYIEPIENVTNYFIQRMMEMQNDDQEMPSDFDNEIHKWAL ECIGRVSLDARLGCLDPNLPPDSEPQKIIEAAKYALRNVAILELKFPFWRYFPTSLWTRYVQNMDYFVEI CMKHIDAAMERLKSKTVLDEKDLSLVERILANEPDPKTAYILALDLILVGIDTISMAVCSILYQLATRPE QQEKMYQELKRVMPDPNTPLNTKLLDQMDFLKAFVKEVLRVYSTVIGNGRTLQQDTIIHGYRIPKGVQVV FPTLITGSMEEFVSQSNEFIPERWMKKSSSDYKIHPFASLPYGYGARMCLGRRFADLEIQVLLAKLIRSY KLEYHHEPLEYKVTFMYAPDGPLRFKVIKREE CYP301A1 Aedes aegypti (yellow fever mosquito) CYP301A1 Culex pipiens CYP301A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph301-2, 73% to 301A1 Tribolium Pediculus genome site CYP301A1 Nasonia vitripennis (jewel wasp) See wasp page CYP301B1 Tribolium castaneum (red flour beetle) GenEMBL XP_974040 43% to 49A1 Drosophila, mito clan 60% to 301B1 bee, 48% to 301A1 bee MKLLRAAKRVISSSSQQQLGVISAIEENHCKEWADAVPYEEVPGPKPLPFLGNTWRFIPFIGDFQIEHID KVSKKLYEKYGKIVKMQGLLGRPDMLFLFDPDEIEKVFRQEDTLPYRPSMPSLNYYKHVHRKEFFGDNCG VIAVHGDRWQNFRTKVNQIMLQPRVTRMYVKSIETTSQELVDRIEVIKNKKSEVPDDFLNELHKWSLESI AKIALDVRLGCLDPNAHPDTQTLINAINTFFMNVPILELKIPFWKIWNTPTFKEYLHALDTIREITLKHV DIALHKLSQDDCEELSVLQRVLKMENDAKTASNLALDMFLVGVDTTSNAVASILYQLSLHQEKQAKLYEE IASLLPCKSTQISHEKLDQMHYLKACIKETMRMYPVVIGNGRCTTRDRVIGGYQIPKGVQVIFQHCVISN LEEYFPRSGEFLPERWLKNSELFRNHHPFASLPFGFGKRMCLGRRFADLEIQTVIAKVVRNFKIEYHHDK LDYHIHPMYTPYGPLRLTFIRRQ CYP301B1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph301-1, 61% to 301B1 Tribolium, 58% to 301B1 Apis, insert between PERF and Heme not in Tribolium Pediculus genome site CYP301B1 Nasonia vitripennis (jewel wasp) See wasp page CYP301C1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph12, 39% to Tribolium 301A1, 39% to 301A1 Pediculus (including Ns at N-term) and 39% to 301B1 Pediculus 39% to CYP12H1 Tribolium Pediculus genome site CYP302A1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs2l2 Note: in Drosophila Cyp302a1 is the disembodied gene (dib), a C22 hydroxylase in the ecdysteroid biosynthetic pathway from cholesterol Cyp302a1 Drosophila melanogaster GenEMBL AC015396 31954-33705 Michael B. O'Connor Submitted to nomenclature committee 11/17/99 30% to Cyp12b1 other best matches are 12 family members the fly gene is called disembodied Dib see <A HREF="http://flybase.bio.indiana.edu:82/.bin/fbidq.html?FBrf0101392">abstract</A> Cyp302a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Gene name disembodied dib CYP302A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroupUn.2216 First 3 exons 45% to 302A1 Drosophila melanogaster. disembodied ortholog CYP302A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroupUn.2216 Last 4 exons 51% to 302A1 Drosophila melanogaster. disembodied ortholog it is highly probable that these to partial sequences are from the same gene (missing about 68 amino acids) Cyp302a1 Bombyx mori (silkworm) BAAB01119552.1 BAAB01040509.1 BAAB01022270.1 CK534186 BAAB01132138.1 46% to 302A1 Anoph. Probable ortholog See silkworm page for sequence Cyp302a1 Bombyx mori (silkworm) GenEMBL AB198340 Niwa, R, Sakudoh, T, Namiki, T. Saida, K, Fujimoto, Y, Kataoka, H. The ecdysteroidogenic P450 Cyp302a1/disembodied from the silkworm Bombyx mori, is transcriptionally regulated by prothoracicotropic hormone. Insect Molecular Biology 2005 in press CYP302A1 Manduca sexta (tobacco hornworm) GenEMBL DQ357067 Rewitz,K.F., Rybczynski,R., Warren,J.T. and Gilbert,L.I. Identification, characterization and developmental expression of Halloween genes encoding P450 enzymes mediating ecdysone biosynthesis in the tobacco hornworm, Manduca sexta Insect Biochem. Mol. Biol. 36 (3), 188-199 (2006) MYRLLNNYKCVPCRKFIQRNYTMEVSHNIIQKEPQKESLLHLND IPGPKCYPVIGTLHKYLPFIGDYDAEALDKIAWLNYRRYGCLVREVPGVNVIHVYDPD DIEAVFRQDDRFPARRSHLAMYHYRKNKPQVYNTGGILSTNGPEWWRLRSTFQKNFTS PQSVKAHVECTDDIAKEFVQWVHNRKFRCNEDFLIYLNRLNLEIIGAVAFNERFDSFS PQEQHSKSRSSKIIQAAFGSNSGVMKLDKGFLWKFFKTPLYRKLAQSQEYLEKVSTEI LMKKMKFFESNESDNDKSLLSSFLQQPNIDQKDIIGMMVDILMAAIDTTSYTTSFALY HIARNPECQEKIFTEVETLLPEKDSSITPNVISKAVYTRSSIKESLRLNPVSVGIGRL LQKDITLKGYFIPKDTVIVTQNMVSCRLPQYVREPLLFKPERWLRGSSEYENIHPFLS LPFGFGPRSCIARRLAEQNMCIILIRLIRDFQIQWKGRELGIRTLLINKPDQPVTLSF VPRLPRRL CYP302A1 Helicoverpa armigera (cotton bollworm) No accession number Heiko Vogel submitted to nomenclature committee Feb. 6, 2008 Contig_264 49% to CYP302A1, probable ortholog CYP302A1 Tribolium castaneum (red flour beetle) GenEMBL XP_974252 47% to 302a1 Drosophila, 49% to 302A1 bee, 49% to 302A1 Anopheles disembodied ortholog MCLFSKKSVHYLLTIRTRNYYYRGRFLTTAVDPPKSFKEIPGPLSLPLVGTLYLYFPFIGRYQFDRLHKN ALKNFQLYGPIIREEIVPGEHIVWLGDPDDIAKMFHTEGTYPYRKSHLTLEKYRLDRPHIYNSGGLLPTN GPEWSRIRKVFQKGLSGPTEALSFIKGSDDVISEWLDTRFKKIHKETSNMDFLQELSRLFLELIGVAAFD IRFQSFHDDELDPCSKSTKLLESAFVTNSTILKTDNGPQLWRKFETPAYRRLRKAQELMESVAIDLVALK LSTFKEKTSNPPTLLERYLASASLDFKDIIGVVCDFLLAGMDTTTYSSSFLLYHLATNPSTQDALYEEAC RLLPNPAAPLTTEKYKQAEYAKCAVKESLRLRPISIGVGRQLTTDVVFSGYKVPSGTVVVTLNQVLSRME KYFPEPDSFKPERWMKNDPSYVQTHPYLVIPFGHGQRSCIARRFAEQNMVILILKLARKYKLRWNGSEID SKSLLINKPDGPILLSFEPR CYP302A1v1 Aedes aegypti (yellow fever mosquito) CYP302A1v2 Aedes aegypti (yellow fever mosquito) CYP302A1 Culex pipiens CYP302A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph302, 52% to 302A1 Tribolium, 54% to 302A1 Apis ortholog to dib disembodied Pediculus genome site CYP302A1 Nasonia vitripennis (jewel wasp) See wasp page CYP302A1 Daphnia pulex (water flea) Confidential CYP302A1 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee Dec. 18, 2007 clone name Lce0063I12 68% identical to D. pseudoobscura CYP302a1 ortholog only 183 aa Cyp303a1 Drosophila melanogaster GenEMBL AC017306 12933-14444 no introns Cyp303a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP303A1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs3r4 CYP303A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroupUn.2253 43% to 303A1 Drosophila melanogaster. Note this sequence is named in the same subfamily as other CYP303As since there seems to be only one orthologous sequence per species and it does not make sense to name orthologs with different subfamily names. CYP303A1 Bombyx mori (silkworm) BAAB01045446.1 BAAB01180906.1 BP126135 BAAB01031702.1 45% to 303A1 Probable ortholog See silkworm page for sequence CYP303A1 Tribolium castaneum (red flour beetle) GenEMBL XP_966541 42% to 303A1 Dm, 47% to 303A1 Anopheles, 51% to 303A1 Aedes MLLAIGFFVIILGLLAYLDTKKPKNYPPGPSWLPIIGSAHTIAEWRKKAGYLYKATAEM AEKYGPVIGLRVGKDLIVVxxxxxxxx EFLASDDLAGRPTGAFFDMRTWGERR (1) GVLLTDSDFWQEQRRFILRQLREFGFG RKNMSSMIEEETAIMVESYQKMLAGKKSVTLNMESAFNIHILNTLWMMMAGIRYSPEDKNLKSLQDILGR LFSTVDMVGAPFSQFPILRFIAPEISGYKLYLETHLQIWEFLYDELKRHKETWDENNLRDFMDVYLKMLN SPERKESFCEKQLLAICLDMFMAGSETTSKSLGFCFLNLIMYPDVQKKAQQEIDAVVGRNRLPSLEDRPQ MPYMECVVLEALRMFSGRAFTVPHRAFKDTHLNGYIIPKDAMVIANMHGTLMGPESGVEDPEIFKPERYL KDGKITLADTYLPFGFGKHRCLGETLAKANVFLFTASLLQHFNFSIVPGSPPTTECIDGVTPGPLPFKAL VTPRE CYP303A1 Aedes aegypti (yellow fever mosquito) CYP303A1 Culex pipiens CYP303A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph303, 52% to 303A1 Aedes Pediculus genome site CYP303A1 Nasonia vitripennis (jewel wasp) See wasp page Cyp304a1 Drosophila melanogaster GenEMBL AC008359 comp(80887-82836) also AC014292 AC009393 Cyp304a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP304B1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPg2r1 CYP304B2 Aedes aegypti (yellow fever mosquito) CYP304B3 Aedes aegypti (yellow fever mosquito) CYP304B4 Culex pipiens CYP304B5 Culex pipiens CYP304B6 Culex pipiens CYP304B7 Culex pipiens CYP304C1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPg2r2 CYP304C1 Aedes aegypti (yellow fever mosquito) CYP304C1 Culex pipiens CYP304E1 Tribolium castaneum (red flour beetle) GenEMBL XP_973180 37% to 304a1 Dm, 40% to 304B3 Aedes, 38% to 304B1 Anopheles AAJJ01000062.1, DT790944.1 DN650948.1 ESTs MLGFLTLLVTFVTLLVFLFYKFSKERPENFPPGPPSLPIWGSYWFLLKENYNLLHLSFETLGKRYKTDIL GFFLGNAPTVVTLSYELTKEMLSRDEFMGRNDTIIVQTRSRGSPKGIFFTDGPVWKQQRRFALRHMRDYG FGRRSENMENLAHQETNYLIDFLTKEPSQGDLDVCKKKGLVLVPDFMYGSLINTVMLVLTSRHFEHQEVR KYAKAAFRFMKSEDSTGGAITITPWLRYIAPTFFGFTSAVDDNDMLIDFMK (0) GLIEEHVETFSDDHHRDFI DDYISESIKQGIEID (1) 23621 YLQLALITLDFMFPSPVGVGHTLSFYFAHLINNPEIQTKIQEEIDQVVGRSRLPS LDDRKDLPYFEASIRENLRLMPVTPLGVTRRCVKDTMLGGHFIPKDSFVLANLWTAHRDPRIWDDPEEFR PERFLDSDGKLLRKDYTIGFGAGKRLCAGETYARNTMFLIMSGLLQNFTFKSPYNRPIDLKDVLPGVTLS LKETWISAVPR CYP304F1 Helicoverpa armigera (cotton bollworm) No accession number Heiko Vogel submitted to nomenclature committee Feb. 6, 2008 Contig_236 42% to CYP304B2 or B3, 39% to CYP304B1, 37% to CYP304C1, 41% to CYP304E1 Tribolium, 40% to CYP304A1 HAH003134 100% to Contig_236 except a short internal seq Contig_226 1 diff to Contig_236, contig 226 and 236 agree in Region where HAH003134 is different (frameshift?) CYP304F2 Zygaena filipendulae (zygaenid moth, Lepidoptera) No accession number Mika Zagrobelny Larsen Submitted to nomenclature committee Feb. 4, 2009 Clone name Zygae_c481 63% to CYP304F1 Helicoverpa armigera (cotton bollworm) CYP304F3 Manduca sexta No accession number Yannick Pauchet Submitted to nomenclature committee April 7, 2009 confidential Cyp305a1 Drosophila melanogaster GenEMBL AC018013 comp(34574-37224) Cyp305a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP305A1 Tribolium castaneum (red flour beetle) GenEMBL XP_970235 41% to 305A1 Dm, 46% to 305A5 Aedes, 40% to 305B1 B. mori MWLAVLLFFVFVVTYVIQQVKKPPNFPPGPYWLPFVGNLPQLKKLSKKLGGQHLALSHLAKEYNTNVLGL KLGKDYVVTVFTYPLIRDVLISEEFEGRPDNFFLRLRCMGQKRGITCTEGEEWNTLRNFVNRHLRNLGFG KKPMEKMIQDEICDILTILKKDGSDIQVDKFLAPCVLSVIWTLITGEKISRENNQLDELLDLFNLRSKAF DMTGGTLTQYPWLRYFLPEWSGFNLIKSVNTSLKNLFMKYIREHKDGWFEGRNEDLIYRYISEMKTNTEA SKYFTDDHLVMVCVDLFIGGAQTTSRTLGFAFLMMIMYPEVQKKVQEQIDKHYDKNSSIEYSDRYKLPYV EAVLLETIRYRYVVPIGGPRRVTKNTTLDGYYLPKNTTVLISFYSINNDPENWQNPEIFNPERFLDEKGS LLPDEKLIPFALGRRRCVGEILAKNCIFLLFVEILRRYNVSLAPGSKPPTGKPIAGITLSPESYRVKFTERSL CYP305A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph305, 45% to 305A1 Anopheles gambiae Pediculus genome site CYP305A2 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPj2l1 CYP305A3 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPj2l2 CYP305A4 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPj2l4 CYP305A5 Aedes aegypti (yellow fever mosquito) XM_001654548 MIVLVLTSVLIIAFSYWLLQELRRPPNYPPGPQWLPFIGNTPFV RKLARASGGQHLAFEALSKQYNSPVIGLKLGREYVVVALQYPAVREVHSKEEFDGRPD NFFLRLRTMGTRLGVTCTDGPFWAEHRNFVVRHLRQAGYGRQPMQLQIQNELNELIGI IRDMDGEPVWPGSILPTSVINVLWTFTTGSRIPRDDQRLARLLKLLQDRSKAFDMSGG VLSQLPWLRHIAPEWTGYNLLKRFNQELHEFFMIIVERHHQEYHEEKCSDDLIYAFIK EMKDRKDDPSSTFTDLQLTMIILDIFIAGSQTTSITIDLAFMMLTMHTDIQKKIHAEI DENLHQDEMPQQNDRTSLPYTEAFLLEVQRFFHIVPVSGPRRALSDCTLGGYQIPKNT TILMGLRTVHMDPEHWGDPECFRPERFLSPDGKIITTERLIPFGLGRRRCLGESLARA CMFTFLVGILQKFSLRQPANCSEKPSPKLLPGITLSPKPYKVIFEPR CYP305A6 Aedes aegypti (yellow fever mosquito) XM_001654549 MITLVLSSVVIVSFIFWLWQDLQRPPNFPPGPKWLPFFGNTLLI RNLARISGGQHLAFEALSKQYKSPVIGLKLGREHVVVALQYPAVHEALTKEAFDGRPD NFFIRLRTMGTRLGITFTDGPFWTEHNSFVVRHLRQAGYGRQPMQLQIQNELNELIGI IRDLDSEPVWPGSILPTSVINVLWTFTTGSRIPRDDQRLTRLLKLLQDRSKAFDMSGG ILSQLPWLRHIAPEWTGYNLINRFNQEIHEFFKATIEKHHQDYTEEKCSDDLIYAFIK EMKERKDDPCSTFTDVQLSMIILDIFIAGSQTTSTTIDIALMILAMNTEIQRKIYAEI DDNFHPDEIPDQNCRTNLQYTEAFLLEVMRLYQIAPIGGPRRALSDCTLGGYRIPRNT TILMGLHTVQMDPDHWGDPENFRPERFIGPDGKIINTERLIPFGLGRRRCLGDSLARS CMFTFLVGILQKFSLRLPDSLEGPSLKLTPGITLSPKPYKVVFEPRLK CYP305A7 Culex pipiens CYP305A8 Culex pipiens CYP305A9 Culex pipiens CYP305A10 Culex pipiens CYP305A11 Culex pipiens CYP305A12P Anopheles gambiae (malaria mosquito) GenEMBL XM_001230354.2 (seq below is expanded on both ends) Partial cDNA sequence 97% identical to CYP305A3 and CYP305A4 Chromosome UNKN: 41,132,263-41,133,123 reverse strand. This transcript is a product of gene AGAP012937 The genomic region 66760 bp upstream of this seq is missing the N-term 140 aa, so this is a pseudogene. GLGRQAMHEQIQTELTELLDVLDTRAEQPLWPGSILALS VINVLWTIVTGSRVAREDDRLQRLLELLQERSKVFDMAGGTLNQLPWLRFIAPEWSGYNL VRRFNKQLTKFFYPTIEEHKGDFTEDKAVDDLIYAYIKEMRDREGTSNNGNNFTEVQLTM IILDIFIAGGQTTSATLDLALMMMIVHPEVQDKVHREIDSQLEPNAIPHYDDRLKLPYVE AFLLEVQRYFSIAPVNGPRRAVVDCTLGGYRVPKDTTVLMGLRNVHMDPEHWGDPEVFRP ERFLNEQRQIVNTERLLPFGQGKRRCLGETLARSCLFTFFVGVMKRF CLMGTDEGSEDGTSMAPSLTLKPGITLSAKPYHVVSQPRR CYP305B1 Bombyx mori (silkworm) GenEMBL AB044900 Nobuyoshi Katagiri, Okitsugu Yamashita 39% identical to 305a1 CYP305B1 Helicoverpa armigera (cotton bollworm) No accession number David G. Heckel submitted to nomenclature committee Feb. 21, 2005 HaCypZ010a-v1Pep after revision 61% to 305B1 CYP305B1 Helicoverpa armigera (cotton bollworm) No accession number Heiko Vogel submitted to nomenclature committee Feb. 6, 2008 Contig_201 38% to CYP305A3 (note: original CYP305C1 seq had an error in the mid region name changed to CYP305B1 after revision) CYP305C1X Helicoverpa armigera (cotton bollworm) No accession number Vladimir Grubor submitted to nomenclature committee 12/30/2003 55% to CYP305B1 324 aa, seq #7 revised sequence name changed to 305B1 CYP305D1 Nasonia vitripennis (jewel wasp) See wasp page Cyp306a1 Drosophila melanogaster GenEMBL AC012373 154223-156412 also AC015216 AC012164 AC012376 phantom phm gene Cyp306a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser phantom phm gene CYP306A1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs2r6 Ortholog of phantom phm gene CYP306A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroup13.1b 46% to 306A1 Drosophila. Note this sequence is named in the same subfamily as other CYP306As since there seems to be only one orthologous sequence per species and it does not make sense to name orthologs with different subfamily names. Ortholog of phantom phm gene CYP306A1 Bombyx mori (silkworm) BAAB01049676.1 BAAB01150476.1 BAAB01016115.1 BAAB01136488.1 BAAB01075467.1 AB162964 Niwa,R., Matsuda,T., Yoshiyama,T., Namiki,T., Mita,K., Fujimoto,Y. and Kataoka,H. CYP306A1, a cytochrome P450 enzyme, is essential for ecdysteroid biosynthesis in the prothoracic glands of Bombyx and Drosophila J. Biol. Chem. (2004) In press See silkworm page for sequence Ortholog of phantom phm gene CYP306A1 Manduca sexta (tobacco hornworm) GenEMBL DQ357066 Rewitz,K.F., Rybczynski,R., Warren,J.T. and Gilbert,L.I. Identification, characterization and developmental expression of Halloween genes encoding P450 enzymes mediating ecdysone biosynthesis in the tobacco hornworm, Manduca sexta Insect Biochem. Mol. Biol. 36 (3), 188-199 (2006) MDLFFIWLVTFVAGFWICKKIKEWQNLPPGPWGLPILGYLPFID PRQPHITLTKLSKLYGPIYGLKMGNIYAVVLSDHKLIRDAFAKENFSGRAPLYLTHGI MHGNGIICAEGGLWKDQRKLITTWLKSFGMSKHSALRDKLEKRIASGVYELLDSIEKS PKEPIDLPHMVSNSLGNVVNEIIFGFKFPSDDKTWQWLRQIQEEGCHEMGVAGVVNFL PFVRFFSPTIRKTIQMLIRGQAQTHTLYASVIARRRNMLGLEKPKGAEHLPHSNLFNE YPDGKLKCIKYSKHAPNTEHIFDPKTLIPTEGECILDSFLLEQKKRFENGDETARYMT DEQLHFLLADMFGAGLDTTSVTLAWFLLYMALYPDEQEIVRKEILSEYPEECEVESSR LPRLMAAIYETQRIRSIVPVGIPHGCLEDTYLGNYKIPKGAMVIPLQWALHMDPDVWE DSEEFKPSRFLAPDGSLLKPQEFIPFQTGKRMCPGDELSRMLASGFIARLFRRKRVRL ASIPSQEDMQGTVGVTLAPPRVQYFCDPI CYP306A1 Manduca sexta No accession number Yannick Pauchet Submitted to nomenclature committee April 7, 2009 confidential CYP306A1 Helicoverpa armigera (cotton bollworm) No accession number Heiko Vogel submitted to nomenclature committee Feb. 6, 2008 Contig_197 79% to CYP306A1 Bombyx HAH005766 100% to Contig_197 CYP306A1 Tribolium castaneum (red flour beetle) GenEMBL XP_968477, protein and AAJJ01000667.1 genomic 46% to 306A1 Aedes, 49% to Apis 306A1 Ortholog of phantom phm gene MAPLWSVLLSFLLFIIAYTWLRRSRNLPPGPWNLPVIGYLAWLNPQFPYLTLTSLSKKYGPIYGLYLGSI YTVVISDAKLIKKTFNKDASSGRAPLHLTHGIMKGFGLICAQGDLWKDQRKFVHNTLRTLGASKVSPNRP TMEALILHHVSDLVQHIKSLGESVTLDPLDSLRHSLGSAINQMVFGKCWSRDDATWKWLQHLQEEGTKHI GVAGPLNFLPLLRFLPKSKQTINFLTEGIRDTHQIYREIIEEQKNSSNETVSNVIQAFLAEKEKRKNEDS VKFYNDQQFHYLLADIFGASLDTTLTTLRWYVLYMAVHQDVQKKVRSLLNDLTLEQIAMVPYFEATIAEV QRIRPVVPVGIPHGSVEELEIAQYKVPPGTMIVPLQWAVHMDANIWDEPEVFKPERFINEEGKFFKPEAF IPFQAGKRMCVGDELARMFLYLFGAALVKNFAISCMGEVDLTGDCGITLTPKPHELIFTSL CYP306A1 Aedes aegypti (yellow fever mosquito) CYP306A1 Culex pipiens CYP306A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph306, phm phantom 51% to 306A1 Apis Pediculus genome site CYP306A1 Nasonia vitripennis (jewel wasp) See wasp page CYP306A1 Daphnia pulex (water flea) Confidential Cyp307a1 Drosophila melanogaster GenEMBL AC014810 comp(54284-52654) also AC007840 spook spo (see Flybase) 307a1 AC014810 comp(54284-52654) also AC007840 revised 9/8/2000 sequence revised between PERF and heme, no intron is present MLAALIYTILAILLSVLATSYICIIYGVKRRVLQPVKTKNSTEINHNAYQKYTQAPGPRP WPIIGNLHLLDRYRDSPFAGFTALAQQYGDIYSLTFGHTRCLVVNNLELIREVLNQNGKV MSGRPDFIRYHKLFGGERSNSLALCDWSQLQQKRRNLARRHCSPREFSCFYMKMSQIGCE EMEHWNRELGNQLVPGEPINIKPLILKACANMFSQYMCSLRFDYDDVDFQQIVQYFDEIF WEINQGHPLDFLPWLYPFYQRHLNKIINWSSTIRGFIMERIIRHRELSVDLDEPDRDFTD ALLKSLLEDKDVSRNTIIFMLEDFIGGHSAVGNLVMLVLAYIAKNVDIGRRIQEEIDAII EEENRSINLLDMNAMPYTMATIFEVLRYSSSPIVPHVATEDTVISGYGVTKGTIVFINNY VLNTSEKFWVNPKEFNPLRFLEPSKEQSPKNSKGSDSGIES DNEKLQLKRNIPHFLPFSIGKRTCIGQNLVRGFGFLV VVNVMQRYNISSHNPSTIKISPESLALPADCFPLVLTPREKIGPL Cyp307a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser spook spo (see Flybase) D. pseudoobscura ortholog of Cyp307a1 Jie Shen no introns MSAVYSVLAILLAIIITSYIRILYKSRRRVCVAIRNGKSSTESEQIYEQAPGPRPWPIIG NLNVLDKYRDNPFEGFTELAKKYGDIYSLTFGHTRCLVVNNLELIREVLNQNGKVVSGRP DFLRYHKLFGGERSNSLALCDWSQLQQKRRNMARRHCSPRESSSFYMQMSQIGCEEMEHF IREIDAKMVPGEPFDVKTFLLRACANMFSQYMCSQRFDYDDEEFHKIVQYFDEIFWEINQ GHPLDFLPWLYPFYRRHLNKIVHWSSTIRSFIMDRIISHRELDIDVDEPDSDFTDALLKS LVEDKDVSRNTIIFMLEDFIGGHSAVGNLVMLALAYIAKNPQVGRRIQDEVDAVSSKGQR KINLLDMHQMPYTMATIFEVLRYSSSPIVPHVATEDLVISGHGVTAGTIVFINNYVLNRS REHWEDSHQFEPQRFLEPSTEGRRDSRGSDSGIESDQKEFQLRKNIPHFLPFSIGKRTCI GQNLVRGFGFLLLANLLQGYNVSSGDPTTIRITPASLALPASCFPLLLTRRA* CYP307A1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs1x4 Ortholog of spook spo (see Flybase) CYP307A1 Bombyx mori (silkworm) BAAB01102325.1 BAAB01020228.1 51% to 307A1 probable ortholog See silkworm page for sequence Ortholog of spook spo (see Flybase) CYP307A1 Manduca sexta GenEMBL DQ899315 Ono,H., Rewitz,K.F., Shinoda,T., Itoyama,K., Petryk,A.F., Rybczynski,R., Jarcho,M., Warren,J.T., Marques,G., Shimell,M.J., Gilbert,L.I. and O'Connor,M.B. Spook and Spookier code for stage-specific components of the ecdysone biosynthetic pathway in Diptera Dev. Biol. (2006) 298, 555-70. MSVLLIIAVAVYAIYKYFSKTKILWRKSNKYGKEEVEVLREAPG PFPLPVVGSLHLLGRHESPFQAFTALAKEYGDIYSIKLGSAKCLIVNNLALIREVLNQ NGNFVAGRPDFLRFHKLFAGDRNNSLALCDWSNLQLRRRNLARRHCGPKQHTDFYARI GSVATLESYELVQTLKSITQTTEQSINLKPLLMTSAMNMFTNYMCSVRFDENDSEFRK VVDHFDEIFWEINQGYAVDFLPWLAPFYKKHMDKLSGWSQDIRSFILSRIVEQREMNL DMEGPEKDFLDGLLRVLHEDPNVDRNTIIFMLEDFLGGHSSVGNLVMLCLTAVARDPE IGRKIRAEIEGVTKGKRPVALTDRSNLPYTEAAILECLRYASSPIVPHVATENAKING YGVEKGTVIFINNYVLNNSEQYWNNPENFDPSRFLEKTKVRTRRNSQCDSGMESDSER SPSKETETEVYSVKKNIPHFIPFSIGKRTCIGQTLVTTMSFTMFANIIQEFDVGVENL DDLIQKPACVALPKDTYNMYLIPRKQ CYP307A1 Helicoverpa armigera (cotton bollworm) No accession number Heiko Vogel submitted to nomenclature committee Feb. 6, 2008 Contig_269 73% to CYP307A1 Bombyx, 56% to CYP307A1 Anopheles HAH014257 100% to Contig_269 Submitted sequences contain an intron CYP307A1 Tribolium castaneum (red flour beetle) GenEMBL AAJJ01000951.1 genomic 55% to 307A1 Anopheles Ortholog of spook spo gene MLALLCLCVVLLVWWFSRPKKSPSTI 33251 PGPRPWPLIGSMHLLAGHETPFQAFTALSRVYGDIFSIHLGSASCVVVNNFKLIKEVLIA 33430 33431 KGGDFGGRPDFARFHKLFGGDRNN (1) 33502 34320 SLALCDWSSLQKTRRSIARTYCSPRFTSLQYDRVNNVGEEELKSFLHQLDQLPHGQPC 34493 34494 NVKPAVLMVCANMFTQYMCSTSFAYEDKGFQKIVRYFDEIFWEINQGYAVDFLPWLLPVY 34673 34674 TGHMKKISNWATEIRQFILSRIIDKHRATLDTNSPPRDFTDALLMHLEEDPNMNWQHIIF 34853 34854 ELEDFLGGHSAIGNLVMVTLAAVVDHPEVAKRIQEEVDQVTGGTRCPNLFDKAAMPYTE 35030 35031 ATILETLRTASSPIVPHVASKDTEIDGHEVSKGTIVFINNYELNQGDAYWDEPGLFKPER 35210 35211 FLSSTGNIVKPAHFIPFSTGKRTCIGQRLVQCFSFVVLATLLQYYDVSTKESVK 35372 35373 VQPGCVAVPPDCFKLVLTPRK* 35438 CYP307A1 Aedes aegypti (yellow fever mosquito) CYP307A1 Culex pipiens CYP307A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph307-2, spo spook 66% to 307A1 Tribolium Pediculus genome site CYP307A1 Nasonia vitripennis (jewel wasp) See wasp page CYP307A1 Daphnia pulex (water flea) Confidential CYP307A1 Lucilia cuprina (Australian blowfly) No accession number Zhenzhong Chen Submitted to nomenclature committee Dec. 18, 2007 clone name Luce0093H16 60% identical to Dm CYP307a1 ortholog Cyp307a2 Drosophila melanogaster GenEMBL AC019383 comp(3284-3706) AL078186 This sequence was originally predicted as a pseudogene, but recent work shows it is a functional gene called spookier (spok) Ono H, Rewitz KF, Shinoda T, Itoyama K, Petryk A, Rybczynski R, Jarcho M, Warren JT, Marques G, Shimell MJ, Gilbert LI, O'Connor MB. Spook and Spookier code for stage-specific components of the ecdysone biosynthetic pathway in Diptera. Dev Biol. 2006 Oct 15;298(2):555-70. This is a new member of the halloween genes AC019383 comp(3284-3706) AL078186 AC019474 exon 2 MLTSVFYVLFAIAITIILISYVFLLLKCKQKAFVVIGLLYQEKKYQCFDQAPGPHPWPII GNINLLGRFQYNPFYGFGTLTKKYGDIYSLSLGHTRCIVVNNVDLIKEVLNKNGKYFGGR PDFFRYHKLFGGDRNN (1) 663 SLALCDWSQLQQKRRNLARRHCSPRESSSYFSKMSEIGGLEVNQLLDQLTNISSGYPCDV 842 843 KPLILAASANMFCQYMCSVRFNYSDKGFQKIIEYFDEIFWEINQGYSFDYIPWLVPFYCN 1022 1023 HISRIVHWSASIRKFILERIVNHRESNININEPDKDFTDALLKSLKEDKNVSRNTIIFML 1202 1203 EDFIGGHSAVGNLVMLALAYIAKNPTIALHIRNEVDTVSAKGIRRICLYDMNVMPYTMAS 1382 1383 ISEVLRYSSSPIVPHVAMEDTVIKGFGVRKGTIVFINNYVLNMSESFWNHPEQFDPERFL 1562 1563 ENNFTNNKESGLKCDDNKRTEFIRKNDNDGSTKSKKYGKQNLNNK 1697 1698 LLKKSIPQFLPFSVGKRTCIGQSLVRGFGFLLLANIIQNYNVNSADFSKIKLEKSSI 1868 1869 ALPKKCFKLSLRPRT* 1916 Cyp307a2 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Contig3286_Contig7811B.fa.9 66% to D. pseudoobscura ortholog of 307A1 first exon 67% to 307A2 55% to 307A1 so this is the ortholog of 307A2 sequence revised between PERF and heme (no intron), not a pseudogene MFFSVCYLIIAIGITAIVISYILIIFKCNEKATVRIGLSNPEKAIRKFGQ APGPQPWPIIGNLDLLGRFKSNPFEGLGALTKQYGDIYSLTLGHSRCLVV NSLDLIREVLNKNGKFIGGRPDFIRYHKLFGGDRNN (1) SLALCDWSNLQQKRRNLARRH CSPRESSPYFNKMSHIGCNEINHLMRELNKTQPGKPFDIKKLILGASANM FCQYMCSIRFDYNDKEFRRIVRYFDEIFWEINQGYSLDFLPWMLPFYKNH TQKIVQWSTTIRKFILDRVIHQREEMLDLDEPENDFTDALLRSLKEDQNV SRNTIIFMLEDFIGGHSAVGNLVMLALAYIAKNPMIGRRIEHEADIVTNK GRRRINLYDMDAMPYTMATIFEVLRYSSSPIVPHVAMEDTAIAGFGVTRG TIIFINNYMLNMSEKYWNSPDHFEPERFLDENYRRNIWKAVKCNDN RRGSEGSDSGIEFEKDTHVHKIIIDKLTSTKNSKKNKNIFNDKCGY QLKKNIPHFLPFSIGKRTCIGQNLVRGFGFVLLANIVQSYNITSADISKININH ESLALPADCFPLVLTPRA* CYP307B1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs3r2 Note: CYP307B sequences have been named spookiest Ono H, Rewitz KF, Shinoda T, Itoyama K, Petryk A, Rybczynski R, Jarcho M, Warren JT, Marques G, Shimell MJ, Gilbert LI, O'Connor MB. Spook and Spookier code for stage-specific components of the ecdysone biosynthetic pathway in Diptera. Dev Biol. 2006 Oct 15;298(2):555-70. CYP307B1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroup14.9 55% to 307B1 Anopheles. Note this sequence is given the same name as Anopheles 307B1 since there seems to be only one orthologous sequence and it does not make sense to name orthologs with different names. Note: CYP307B sequences have been named spookiest Ono H, Rewitz KF, Shinoda T, Itoyama K, Petryk A, Rybczynski R, Jarcho M, Warren JT, Marques G, Shimell MJ, Gilbert LI, O'Connor MB. Spook and Spookier code for stage-specific components of the ecdysone biosynthetic pathway in Diptera. Dev Biol. 2006 Oct 15;298(2):555-70. CYP307B1 Tribolium castaneum (red flour beetle) GenEMBL AAJJ01001163.1 genomic 53% to 307B1 Anopheles, 55% to 307B Aedes, 51% to bee 307B1 Note: CYP307B sequences have been named spookiest Ono H, Rewitz KF, Shinoda T, Itoyama K, Petryk A, Rybczynski R, Jarcho M, Warren JT, Marques G, Shimell MJ, Gilbert LI, O'Connor MB. Spook and Spookier code for stage-specific components of the ecdysone biosynthetic pathway in Diptera. Dev Biol. 2006 Oct 15;298(2):555-70. 13728 MFFTQFTLYLIAFVVLATVFAIAEAMRTRQGKY 13629 RPPSPLKLPIIGHLHLMSGYQVPYQAFTSLSKKFGSVFGLQLGYVKCVVINGQKNIREAL 13450 13449 VTKGHHFDSRPNFERYQQLFSGNKEN (1) 13372 13324 SLAFCNWSTTQKTRRDMLKAYTFPRAFTNRFYTLEELVSSGTRSVVAALSTNTTSTKSV 13148 13147 ISRHCANIFTRHFCSKSFDVDEATFVEMVDNYDEIFYEVNQGYAADFLPFLLPFHEKNLK 12968 12967 RVNGLTHKIRDFVLDHIIEGRFDSFDVQAEPDDYVESLIKYVKSGESPQLSWDSALFALE 12788 12787 DIIGGHSAVANFLVKLFAFLAKEPQVQENIHKEIDRVLGTKEVSISHRTLLPYTEAT 12617 12616 IFEAIRLIASPIVPRVANRDSSIN (1) 12545 12489 GLEIEKGTVIFLNNYDLSMSPQLWDEPEKFKPERFIRE 12376 12375 ERLVKPEHFLPFGGGRRSCMGYKMVQLVSFGILGGIMQNFTIGADDTSYTVPVGSLALP 12199 12198 KDTFTFKFVRR* 12163 CYP307B1 Aedes aegypti (yellow fever mosquito) Note: CYP307B sequences have been named spookiest Ono H, Rewitz KF, Shinoda T, Itoyama K, Petryk A, Rybczynski R, Jarcho M, Warren JT, Marques G, Shimell MJ, Gilbert LI, O'Connor MB. Spook and Spookier code for stage-specific components of the ecdysone biosynthetic pathway in Diptera. Dev Biol. 2006 Oct 15;298(2):555-70. CYP307B1 Culex pipiens CYP307B1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph307-1, spookiest 47% to 307B1 Apis Pediculus genome site Cyp308a1 Drosophila melanogaster GenEMBL AC015216 72624-74273 also AC012164 Cyp308a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp309a1 Drosophila melanogaster GenEMBL AC019891 comp(47437-49308) also AC009909 Cyp309a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp309a2 Drosophila melanogaster GenEMBL AC009909 45541-45982 also AC019977 Cyp309a2 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp310a1 Drosophila melanogaster GenEMBL AC017690 comp(12877-14650) Cyp310a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp311a1 Drosophila melanogaster GenEMBL AC014186 2270-3910 Cyp311a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp312a1 Drosophila melanogaster GenEMBL AC015424 57845-59744 also AC009369 AC007573 AC010003 Cyp312a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp313a1 Drosophila melanogaster GenEMBL AC017740 comp(167857-169728) also AC007809 AC007928 Cyp313a2 Drosophila melanogaster GenEMBL AC017371 4100-5929 AC007724 AL057969 AL067059 Cyp313a3 Drosophila melanogaster GenEMBL AC017371 1890-3720 also AC007724 Cyp313a4 Drosophila melanogaster GenEMBL AC017336 153877-155807 also AC007752 Cyp313a4 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp313a5 Drosophila melanogaster GenEMBL AC017371 6352-8145 also AC007724 Cyp313a6 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser This is a paralog of 313a4, D. pseudoobscura has two genes To D. melanogasters one Cyp313a4 Cyp313b1 Drosophila melanogaster GenEMBL AC007571 comp(104264-106046) also AC013225 Cyp313b1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp313c1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp313c2 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp314a1 Drosophila melanogaster GenEMBL AC012699 22585-24723 also AC006496 shade shd gene Cyp314a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser shade shd gene CYP314A1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs2r3 Ortholog of shade shd gene CYP314A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroup5.25 43% to 314A1 Drosophila. Note this sequence is named in the same subfamily as other CYP314s since there seems to be only one orthologous sequence per species and it does not make sense to name orthologs with different subfamily names. Ortholog of shade shd gene CYP314A1 Acyrthosiphon pisum (pea aphid) GenEMBL CF588143 48% to 314A1 Dros. Pseudoobscura aa 126-362 Ortholog of shade shd gene CYP314A1 Tribolium castaneum (red flour beetle) GenEMBL XP_972699, CB336479 CB336480 EST 42% to 314A1 bee, 39% to 314A1 Aedes Ortholog of shade shd gene MFEKIFQSLDVTSLLIIAIFFLFLEYRPPWWYRNNDCKKGVKLIPGPLALPGLGTTWIFFFGGFSFNRLH LYYENMYKRYGPVMKEEYWCNIPVINLFEKREIVKVLKAGGKYPLRPPVEAVAHYRRSRPDRYASIGLVN EQGEAWYDLRSTLTPALTSPKTITSFLPEAQQIADDWCNLLKLSRDKNGRVSNLNYIADRLGLELTCALV LGRRMGFLLPGAETETGEKLAEAVRQHFLGTRDTYFGFPFWKLFPTPAYKTLIKSEASIYELALELINSA NESTKESAVFQSVIQAEIDEREKIAAIIDFISAGIHTMKNSLLFLLHLIGQDLQIQKKIIEDSTKSYSKA CVTETFRLLPTANALGRILEEEMELGGYRLSAGTVVVCHFGIACRDERNFPDASKFKPERWLDDDKVQTA TNSLFLLTPFGAGRRICPGKRFIEHILPLLLESTVNSFEIQAEEELELQFEFLVTPKGQMPMVFKDRV CYP314A1 Aedes aegypti (yellow fever mosquito) CYP314A1 Culex pipiens CYP314A1 Culex_quinquefasciatus (southern house mosquito) No accession number Osamu Komagata submitted to nomenclature committee 7/25/07 sequence 26 62% to CYP314A1 Anoph., 76% to 314A1 Aedes Ortholog of shade shd gene a halloween gene in the Mito clan CYP314A1 Bombyx mori (silkworm) BP179750 BP179443 BAAB01069738.1 BAAB01063661.1 BAAB01099804.1 BAAB01099804.1 BAAB01122030.1 BAAB01134118.1 49% to 314A1 probable ortholog See silkworm page for sequence Ortholog of shade shd gene CYP314A1 Manduca sexta (tobacco hornworm) GenEMBL DQ372988 Rewitz,K.F., Rybczynski,R., Warren,J.T. and Gilbert,L.I. Developmental expression of Manduca shade, the P450 mediating the final step in molting hormone synthesis Mol. Cell. Endocrinol. 247 (1-2), 166-174 (2006) MSLPGVFLFSYYMKNVWSTPTPLLDWSCVPTLVLLVALVVVTVT AVLTRAVDTKNPSRLPGPQPLPFLGTRWLFWSRYKMNKLHEAYEDMFRRYGPVFAETT PGGATIVSIADREALDAVLKAPAKRPYRPPTEIVQVYRRSRPDRYASTGLVNEQGDKW LHLRRHLTVELTSPNTMQGFMPELNSICEDFLDLLQSSRQANGTVHGFDQLTNRMGLE SVCGLMLGSRLGFLERRMSGRAATLAAAVKTHFRAQRDSYYGAPLWKFAPTTLYKTFV RSEETIHTVVSELMEEAKARTNGSAKDDGMQEIFLRILENPALDMRDKKAAVIDFITA GIETLANSLVFLLYLLSGRPDWQHTIRSELPSCSRLSADDLSAAPSVRAAINEAFRLL PTAPFLARLLDTPMTLGGHKLPAGTFVLAHTGAACRREENFWRAHEFLPERWSSSRQP HAASLVAPFGRGRRMCPGKRFVELELHLLLAKIMQRWRVEFDGELDVQFDFLLSPKSP VTLRLVEW CYP314A1 Manduca sexta No accession number Yannick Pauchet Submitted to nomenclature committee April 7, 2009 confidential CYP314A1 Pediculus humanus humanus (body louse) No accession number Reed Johnson and Barry Pittendrigh Submitted to nomenclature committee March 14, 2007 Ph314, 51% to 314A1 Apis ortholog to shd shade Pediculus genome site CYP314A1 Nasonia vitripennis (jewel wasp) See wasp page CYP314A1 Daphnia magna (water flea) GenEMBL AB257771 Yoshinori Ikenaka Submitted to nomenclature committee 8/28/06 CYP family-1, full length sequence 41% to Cyp314a1 Drosophila, 40% to Apis CYP314A1 probable ortholog so the same subfamily name is used CYP314A1 Daphnia pulex (water flea) Confidential Cyp315a1 Drosophila melanogaster GenEMBL AC015141 229-2538 also AC008307 AC007725 AC007648 shadow sad gene Cyp315a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser shadow sad gene CYP315A1 Anopheles gambiae (malaria mosquito) GenEMBL Submitted by Christelle Abgrall, Hilary Ranson and Rene Feyereisen See the paper: Evolution of supergene families associated with insecticide resistance. Ranson, H., Claudianos, C., Ortelli, F., Abgrall, C., Hemingway, J. Sharakhova, M.V., Unger, M.F., Collins, F.H. and Feyereisen, R. Science 298, 179-181 (2002) Anopheles map name CYPs1x1 Note: in Drosophila Cyp315a1 is the shadow gene (sad), a C2 hydroxylase in the ecdysteroid biosynthetic pathway from cholesterol Ortholog of shadow sad gene CYP315A1 Apis mellifera (honeybee) WGS section genbank Submitted by May Berenbaum 2/11/04 AmGroupUn.1189 38% to 315A1 Drosophila. Note this sequence is named in the same subfamily as other CYP315s since there seems to be only one orthologous sequence per species and it does not make sense to name orthologs with different subfamily names. Ortholog of shadow sad gene CYP315A1 Bombyx mori (silkworm) AB167737 BAAB01119643.1 Niwa,R., Matsuda,T., Yoshiyama,T., Namiki,T., Mita,K., Fujimoto,Y. and Kataoka,H. CYP306A1, a cytochrome P450 enzyme, is essential for ecdysteroid biosynthesis in the prothoracic glands of Bombyx and Drosophila J. Biol. Chem. (2004) In press probable ortholog See silkworm page for sequence Ortholog of shadow sad gene CYP315A1 Manduca sexta (tobacco hornworm) GenEMBL DQ357068 Rewitz,K.F., Rybczynski,R., Warren,J.T. and Gilbert,L.I. Identification, characterization and developmental expression of Halloween genes encoding P450 enzymes mediating ecdysone biosynthesis in the tobacco hornworm, Manduca sexta Insect Biochem. Mol. Biol. 36 (3), 188-199 (2006) MHRMTRLLSKQQKLLIFQRNAASAECIRTDLTINEMPHPKSMPI LGTKLEFFAAGGGKKLHEYIDNRHKQLGSIFCENLGGSADLVFISDPTLMKTLFLNLE GKYPAHILPDPWILYEKLYGSKRGLFFMNGEEWLNNRRIMNKHLLKEDSEKWLNDPVK ATIKSFINNWKTRAEQGNFIPDLETEFYRLSTDVIIAILLGSNSSIKTSKQYEMLLCM FSESVKNIFQTTTKLYALPVTWCQRINLKVWRDFKECVDMSLFLAHKIVTEILNRRHE NDGLIKRLCEDKMSDEDIIRIVADFVIAAGDTTAYTSLWTLLLMAKNKDYVNNELPMK DINNIKHVVKEAMRLYPVAPFLTRILPKESILGPYKLNEGTPVIASIYTSGRDINNFS RPEEFLPYRWDRNDPRKKELMNHNPSASLPFALGSRSCIGKKIAMLQITELMSQIVKN FHLECLNKTPVNILTSQVLVPDKNIDIQVSLYDSSKLNKNECWL CYP315A1 Manduca sexta No accession number Yannick Pauchet Submitted to nomenclature committee April 7, 2009 confidential CYP315A1 Tribolium castaneum (red flour beetle) GenEMBL XP_970122 41% to 315A1 Drosophila Ortholog of shadow sad gene MLFRRVLAQLSTRPKSSLVSANTVLHFDEIPSPKGLPLVGTTLALIAHGSTPKLHDYVDKRHKQLGPIFK EKLGPVSAVFVADPDEIRAVFDHEGKHPLHVLPDAWVAYNQMYGCPRGLFFMDGANWWYHRRIMNRLLLK GDFRWIEGACECVSDKLIDSLMGESDYCGNLEATLYKWSLDVIVSILLGSGSYSQLCGELEPKVQKLAQV THLVFQTSAKLALIPASFASKFKIPQWRRFVDSVDNALAQANALVDTLIEKKPHSDGLLPKLLAEQITLE DIKRIIVDLVLAAGDTTAVAMEWMLYLVAKSPQIQEKLRRNPDFVKHVFKETLRLYPVAPFLTRILPEDA ILGGYGVPKGTLVVMSIYTSGRDGRYFKNPGLFQPERWDRKDEFYSSEMKKASLPFAMGLRACVGRKVAE TQLQMTLLKIVNKFEVELGNQREVEIVLKMVAVPSEPLLLKFKKL CYP315A1 Aedes aegypti (yellow fever mosquito) CYP315A1 Culex pipiens CYP315A1 Nasonia vitripennis (jewel wasp) See wasp page CYP315A1 Daphnia pulex (water flea) Confidential Cyp316a1 Drosophila melanogaster GenEMBL AC017388 35538-37158 also AC010113 AC010557 AC010112 Cyp316a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser Cyp317a1 Drosophila melanogaster GenEMBL AC020477 126550-128106 no introns also AC009844 AL109349 Cyp317a1 Drosophila pseudoobscura See D. pseudoobscura page on this server for sequence Obtained from UC Santa Cruz Browser CYP317A2 Lucilia cuprina (Australian blowfly)