last modified 1/17/2001 302 entries AC002329 AC007127 comp(82241-84323) CYP51A1 no ESTs MDWDYYTLLKTSVAIIIVFVVAKLITSSKSKKKTSVVPLPPVLQAWPPFIGSLIRFMKGP IVLLREEYPKLGSVFTVKLLHKNITFLIGPEVSSHFFNAYESELSQKEIYKFNVPTFGPG VVFDVDYPVRMEQFRFFSSALKGQQLRGYVDQMTKETEDYFSKWGESGEVDLKAELERLI TLTASRCLLGREVRDQLFDDVAPLFHDLDKGMQPISVIFPKLPIPAHNCRDRARGKIAKI FSNIIATRKRSGDKSENDMLQCFIDSKYKDGRETTESEVTGLLIAGLFAGQHTSSITATW TGAYLIQNKHWWSAALDEQKKLIGKHGDKIDYDVLSEMDFLFRSAKEALRLHPPKILLLR TVHSDFTVTTREGKQYEIPKGHIVATSPAFANRLPHVYKDPENFDPDRFSKEREEDKAAG SCSYISLGAGRHECPGGSFAFLQIKAVWCHLLRNFELELVSPFPEINWNALVVGAKGNVM VRYKRRPFS* AC007296 comp(52507-54167) CYP51A2 (76% to CYP51) ESTs AA720360, AA651059, N65031 no GSSs BE522875 M29H10STM developing seed cDNA clone M29H10 5'.Length = 356 BE529044 M73C16STM developing seed cDNA clone 600038258R1 5'Length = 243 BE530647 M78M23STM developing seed cDNA clone 600040195R1 5'. BE527494 M68F24STM developing seed cDNA clone 600036391R1 5'Length = 250 MELDSENKLLKTGLVIVATLVIAKLIFSFFTSDSKKKRLPPTLKAWPPLVGSLIKFLKGP IIMLREEYPKLGSVFTVNLVHKKITFLIGPEVSAHFFKASESDLSQQEVYQFNVPTFGPG VVFDVDYSVRQEQFRFFTEALRVNKLKGYVDMMVTEAEDYFSKWGESGEVDIKVELERLI ILTASRCLLGREVRDQLFDDVSALFHDLDNGMLPISVLFPYLPIPAHRRRDRAREKLSEI FAKIIGSRKRSGKTENDMLQCFIESKYKDGRQTTESEVTGLLIAALFAGQHTSSITSTWT GAYLMRYKEYFSAALDEQKNLIAKHGDKIDHDILSEMDVLYRCIKEALRLHPPLIMLMRA SHSDFSVTARDGKTYDIPKGHIVATSPAFANRLPHIFKDPDTYDPERFSPGREEDKAAGA FSYIAFGGGRHGCLGEPFAYLQIKAIWSHLLRNFELELVSPFPEIDWNAMVVGVKGNVMV RYKRRQLS* AC002340 T11J7.14 CYP71A12 complement(join(56204..56533,56649..56933, 57039..57530,57703..58089) chr 2 MEMILMVSLCLTTLITLFLLKQFLKRTANKVNLPPSPWRLPLIGNLHQLSLHPHRSLHSL SLRYGPLMLLHFGRVPILVVSSGEAAQEVLKTHDLKFANRPRSKAVHGLMNGGRDVVFGP YGEYWRQMKSVCILNLLTNKMVASFEKIREEELNEMIKKLEKASSSSSSENLSELFVTLP SDVTSRIALGRKHSEDETARDLKKRVRQIMELLGEFPIGDYVPALAWIDRINGFNARIKE VSQGFSDLMDKVVQEHLEAGNHKEDFVDILLSIESEKSIGFQAQRDDIKFMILDMFIGGT STSSTLLEWIMTELIRNPNVMKKLQDEIRSTIRPHGSYIKEKDVENMKYLKAVIKEVFRV HPPLPLILPRLLSEDVKVKGYNIAAGTEVIINAWAIQRDPAIWGPDAEEFKPERHLDSTL DYHGKDLNFIPFGSGRRICPGINLALGLVEVTVANLVGRFDWRAEAGPNGDQPDLTEAFG LDVCRKFPLIAFPSSVI AC002340 T11J7.16 CYP71A13 complement(join(66627..66956,67250..67534, 67629..68120,68320..68706)) chr 2 EST H76866 (3 diffs) AI992420 MEMILSISLCLTTLITLLLLRRFLKRTATKVNLPPSPWRLPVIGNLHQLSLHPHRSLRSL SLRYGPLMLLHFGRVPILVVSSGEAAQEVLKTHDHKFANRPRSKAVHGLMNGGRDVVFAP YGEYWRQMKSVCILNLLTNKMVESFEKVREDEVNAMIEKLEKASSSSSSENLSELFITLP SDVTSRVALGRKHSEDETARDLKKRVRQIMELLGEFPIGEYVPILAWIDGIRGFNNKIKE VSRGFSDLMDKVVQEHLEASNDKADFVDILLSIEKDKNSGFQVQRNDIKFMILDMFIGGT STTSTLLEWTMTELIRSPKSMKKLQDEIRSTIRPHGSYIKEKEVENMKYLKAVIKEVLRL HPSLPMILPRLLSEDVKVKGYNIAAGTEVIINAWAIQRDTAIWGPDAEEFKPERHLDSGL DYHGKNLNYIPFGSGRRICPGINLALGLAEVTVANLVGRFDWRVEAGPNGDQPDLTEAIG IDVCRKFPLIAFPSSVV AF069716 CYP71A14 BAC F6A4, Chromosome IV, 10571-13777 MEMIIISLCLATILALLLLKQFLNRTYTAKVNLPPSPWRVPVIGNLHQLSLHPHRSLRSL SHRYGPLMLLHFGRVPVLVVSSSDVAHDLMKTHDLKVANRPQLKVVEKIFNGGREMVFSP YGEYWRQIKSVCIVNLLNKKKVQSFEKVREEEISEMMERVEKASSDSSPLNLSELLLTLT SDVTSRVSLGRKYSKEESMSDFKIQMRKITELVGGFPVGEYIPCLAWIDKLRGVDEKAEE VSKAFGDLMEKVLQEHLDATDKPTLDFVDVLLSLERHERNGVQIRRSDIKFLILDMFLAG TETTYALLEWIMTELIRHPECMKKLQDEIRAKATKLILYISEEDVEDMKYLKAVVKEVLR LHPPLPLLVPRELSEDIKLKGYDIAAGTQVIINAWAIQRDTMTWGIDAEEFRPERHLDSL VDFRGTNFEFIPFGSGRRICPGIGFAMALVEVTLANLVNRFNWRMEVQHSGDEYDLAEST GLDVCRKFPLIVFPSPA* AF069716 CYP71A15 BAC F6A4, Chromosome IV, like 71A13 16004- 18550 ESTs BE528853 M72N09STM developing seed cDNA clone 600038039R1 5' Length = 255 BE528877 M72N15STM developing seed cDNA clone 600038063R1 5' Length = 257 BE530492 M77N03STM developing seed cDNA clone 600040015R1 5' Length = 263 MEIIIISLCLATILAFLLLKPLLNRTVAKDNLPPSPWRVPVIGNLHQLSLHPHRSLRSLS HRYGPLMLLHFGRVPILVVSSSDVAHDLMKTHDLKVANRPRLKVIETILNGGREVVFSPY GDYWRQIKTVCVVHLLNKKMVQSFAKVREEERSVMMEKVEKASSDSSPLNLSKLLITLTS DVASRVSFGKKHSNEASMSDFKNQVRKITELVGGFPVSEYIPCLAWIDQIRGLYNRAEEV SKIFGDLMDKVVQEHLDATNKPTKDFVDILLSFERQSKDGIEVRRSDIKFIILDIFLGGT TTTNSLLEWTMTELIRHPECMKKLQDEIRGDATNLTIYRSHEEVEDMKYLKAVIKEGLRL HPPFPLLVLRLLTQDVKLKGYDIAAGTQVITNAWAIQRDIVTWGIDAEEFRPERHLDSPL DFRGTNFEYIPFGSGRRICPGIGFAMALVEVTLANLVNRFNWRMDARLSGDEYDLAEATG IDVCRKFPLIVFPSNA* AB022210 CYP71A16 64% identical to 71A14 comp(26631-24211) ESTs AA395422, T43806 MEMMILISLCLTTFLTILLFFKSLLKRPNSNLPPSPWRLPVIGNLHQLSLHPHRALSSLS ARHGPLMLLRFGRVPVLIVSSADVAHDVMKTHDLKFANRPITKSAHKISNGGRDLVFAPY GEYWRNVKSLCTIHLLSNKMVQSSEKRREEEITLLMETLEEASLSSSSVNLSKLITNMVS DIMGKVVLGKKYSGEEGTIDVKTITKSFLDAVGLSPVGEYIPSLAWIGKITGSDGKLEKI TKQFGDFIEKVLQEHEDTTADKETPDFVDMLLTIQRDETAQCQLDKSDLKVIIFEMFLGS TTTTSAVIEWAMTRLMRNPECLKKLQDEIRSVSKMNSYVSGKEVENMNYLKAVIKEVLRL HPPLPLLVPRLLSEDVKLKGYDITAGTQVIINAWAIQRDTATWGSDAQEFRPERHFDSTW DFVGRNFKYIPFGAGRRLCPGIGLGSVMASVTLANLVKRFDWRVEDGPSGYDKPDLVEGA GIDVCRKFPLVVFPSSA* CYP71A17P pseudogene on AB017065 19000-21000 region 64% identical to 71A14 RKTLELSGYFPLGEYIPGLAXIEXGMDDKVEEASKEVVRLHPPFPIILXXXXXXDIKFKG YDIAAGTLVITNAWAIXRDTTILGADAKEFRPERHLDLLVEFKGQGYMYTPFGSGRRRCP GICFG AC007296 83847-85677 CYP71A18 86% identical to 71A12 NO ESTs no GSSs MEMTLMVSLCLTTLLTLLLLKKFLKRTAKKVNLPPSPWRIPVIGNLHQLSLHPHRSLHSL SLRYGPLMLLHFGRVPILVVSSSEAAHEILKTHDLKFANRPKSKAVHGLMNGGRDVVFGP YGEYWRQMKSVCILNLLTNKMVASFEKVREEEVNAMMEKLEKASCSSSAENLSELFVTLT SDVTSRVSLGKKYWEDETAGGLKKRVRQIMELLREFPIGDYVPALAWIDRINGFNSKIVE VSRAYSDLMEKVVQEHLEAGEHKADFVNILLSIEKEKNNGFKVQRNDIKFMILDMFIGGI STSSTLLEWIMTELIRNPECMKKLQNEIRSTIRPHGSYIKEKEVENMRYLKAVIKEVFRV HPPLPLILPRLLTEDVKVKGYDIAAGTEVLINAWSIHRDPAIWGPDAEEFKPERHLDSTL DYHGQDLKYIPFGSGRRICPGINLAMGLVEVTLANLVGRFDWSVDPGPNGDQPDLAEDFG LDVCRKNPLIAFPSSVA* AL049608 CYP71A19 CDS at4g13290 join(11443..11829,12066..12542,12626..12907,13106..13432) gene T9E8.30 64% identical to 71A16 no ESTs no GSSs MEIILVTLCLTTLLALLLLKSILKRTTTNNLNLPPSPWRLPVIGNLHQLSLNTHRSLRSL SLRYGPLMLLHFGRTPVLIVSSADVAHDILKTYDVICANRPKTKVIDKILRGGRDVAFAP YGEYWKQMKSICIQNLLSNKMVRSYKKIREDEIKLMIEKVENASSCSPPSPVNLSQLFMT LTNDIICRAALGRKYSSKEDGIDVENIVRAFSALVGEFPIGEYIPSLSWIDKIRGQDHKM EEVDKRFDEFLERVVKEHEDANKDTRSDLVDTLLTIQSDKSALKLIIWDMFLAGTATSLS FLEWAMTELMRNPKVMKKLQEEIRSSSRQGLFVTEKEAEKMDYLQAVIKEALRLRPPAPL MVPRVFSEDVTLKGYNIPAGTQVIINAWAIQRDTTTWGIDAEEFRPERHLDSILDFQGQD FKFIPFGSGKRICPGIGFTSALIGVTLANIVKRFNWRMDVEPQRVQHDLTEATGLVVFRK FPLIAIPSSA* AL049608 CYP71A20 CDS at4g13310 join(21221..21598,21753..22253,22331..22612,23485..23811) gene T9E8.50 88% identical to CYP71A19 no ESTs no GSSs MILITLCLTTLLALLLKSILKRTATKNFNLPPSPWRLPVIGNLHQLSLHTHRSLRSLSLR YGPLMLLHFGRTPVLIVSSADVAHDVMKTHDLVCANRPKTKVVDKILSGGRDVAFAPYGE YWRQMKSICIQNLLNNKMVRSYEKIREEEIKRMIEKLEKASCSSSPSPVNLSQILMTLTN DIICRVALGRKYSGKKDGIDVENIVRTFAALLGEFPVGEYIPSLSWIDRIRGLDHKMEVV DKRFDEFLERVVKEHEEADKETRSDLVDKLLTIQSDKTGQFELEKSALKLIIWDMFLAGT ATTLSFLEWAMTELMRNPKVMKKLQEEIRSSSPQDLFVTEKEAEKMNYLQAVIKEALRLR PPAPLLVPRVLSEDVKLKGYNIPAGTQVIVNAWAIQRDTTTWGTDAEEFKPERHLDTNLD FQGQDFKFIPFGSGKRICPGIGFTSALIGVTLANIVKRFNWRMDVEPQRVQHDLTEATGL VVFRKFPLIAIPSSA CYP71A21 AL049659 comp(61576..62187,62279..63139) T29H11.160 also U31288 fragment and GSS B25921 no ESTs MESMTMIILQSLIIFITILFFKKQKRGKKSNTPRSPPRLPLIGNLHQLGHHPHRSLCSLS HRYGPLMLLHLGRVPVLVVSSADVARDILKTHDRVFASRPRSKLFEKLFYDGRDVAFAPY GEYWRQIKSVCVLRLLSNKMVTSFRNVRQEEISLMMEKIQKSSSLQVNVSELLGSLTNDV ISRIALGRKYSGETDSKELMKRLMMLMGEFSVGTYVPWLGWIDWISGLDGQLNKTGNDLD EFLEKVVQDHVDGDGQRTDFVDVLLRIQREKSIGFEIDRLCIKAIVLDVLVAGTDSSYAL MDWAMTELLRHPECLRTLQEEVRTICKGNLSVSEEDIQNMSYLKAVIKETTRLHPPLPLL APHESIQDVILGDYHIPAGTQVMINAWAIGREAATWGPDAEKFRPERHLDSSVDFRGHNF ELVPFGAGRRICPAISFAVVLIEVALANFVHRYDWKLPEDSKENQTNVAESTGMVIHRLF PLYAIASSTT CYP71A22 AL049659 comp (64631..65242,65328..66188) T29H11.170 EST N38590 MESMIRIILLSLIIFITILFFIKQKKGKKSNTPASPPRLPLIGNLHQLGRHPHRSLCSLS NRYGPLMLLRFGLVPVLVVSSADVARDILKTYDRVFASRPRSKIFEKIFYEARDVALAPY GEYWRQMKSVCVLHLLTNKMVRSFRNVRQEEISLMMEKIQKSSSLQVNLSELLGSLTNDV ISRVALGRKYSDETDFKELMKRLTKLLGEFCVGTYVPWLAWIDWISGLDGQLKKTGNDLD EFLEKVVQDHEDGDAQRTDFVDVLLRIQREKSVGFEIDRLSIKAIILDVVVGGTDTSYAL MEWAMTELLHRPECLNRLQEEVRTICKGNSSVSEDDIKDMNYLKAVIKETMRLHPPLPLM VPHESTQDVRLGDYHIPAGTQVMINAWAIGREAATWGPDAEKFRPERHLNSSVDFRGHNF ELIPFGAGRRICPAISFAVILIEVTLANLVHRYDWRLPEEYIEDQTNVAESTGMVIHRLF PLYAIVSSTT CYP71A23 AL049659 comp (67262..67861,68005..68856) T29H11.180 MILFLCLIILFIITILFFKKHKTVNKIINFPSPPRLPLIGNLHQLSQHPHRSLCYLSHRY GPLMLLHFGSVPVIVASTAEAARDVLKTHDRVFASRPRSKIFEKLLYKSRNMASAPYGEY WRQMKSVSVLHLLSNKMVRSFQDVRQEEITLMMETIRKSSSKPVNLSKILSSLTNDVICR VALGRKYGVGTDFKELIDRLMRQLGTFTIGSYVPWLAWTDWVSGLEARLEKTANDFDKLL ERIVQDHEDGDGDKTDFVDVLLAAQRDKSFGFDIDRLSIKAIVLDAFVGGTDTSSTLVEW EMTELLRHPTCLKKLQEEVRTICKGKSSVSEDDIQGMEYLKAVVKEALRLHPPVPLMVPH QSTQDVRLRDNHIPAGTQVIVNLWAVGREAATWGPDANEFRPERHLESPSDFRGQDFELI PFGAGRRMCPGISFAVVLNEVVLANLVHGFDWQSIDDETDVAESIGSVIRRMHPLYVIPS STT CYP71A24 AL049659 comp(70246..71784) T29H11.190 GB translation may have retained intron before I-helix MMMMIILLLCSIILITILFFKKQSRGKKSNAPPSPPRLPLIRNLHQLGRHPHRSLCSLSH RYGPLMLLHFGSVPVLVVSSADAAKDVLKTHDRVFASRPRSKIFDKIFYNGRDVALAPYG EYWRQMKSVCVLHLFSNKMVRSFRDVRQEEISLMIEKIRISSSLRINLSEILVNLTNNVI CRVALGRKYGGKTDFKDLMKRLTRLLGEFSVGSYVSWLAWIDWIRGLDGQLIKISNDLDE FLERVVQDHVDGDGHKNDFVDFLLTIEREKSVGFEIDRLSIKAIILDVFVGDMDTTYTLL EWAMTELLCHHECLDRLQEEVRMVCKDKSGVSEDDLQDMKYLKAVIKETLRLHPPLPLMV PHESTHDVKLRDYHIPAGTHVMINAWAIGREAATWGPDAEEFRPERHLNSYVDYRGQDTE LVPFGAGRRICPAISFAVVLDEVVLANLVHQFDWTLPEESTEYQTDVAESTGMAVHRMFP LFAMTT CYP71A25 AL049659 comp (73216..73830,73929..74786) T29H11.200 BE524239 M47F11STM developing seed cDNA clone M47F11 5' Length = 405 MMMMIILLWSIIFMTILFLKKQLSGKKGKTPPSPPGLPLIGNLHQLGRHTHRSLCDLSRR YGPLMLLHLGRVPVLIVSSADMAQEILKTHDQAFANRPRSKLSQKLLYNNRDVASAPYGE YWRQMKSVCVIHLLSNKMVRSFRDVREEEITLMMAKIRKSSSLPFNVSKVLECLTNDVIC RVALGRKYGGETDFKKLTDRLSELLGTFSIGSFVPWLAWVDWIRGWDAQLDKMGKDLDDF FEKVVQDHEDGDRRDGTDLIDALLRVKREKSPGFEIERVSIKAITLDVFVGGSDTSFTLL EWAMTELLRHPKSLNRLQEEVRTICKGKSRVSEDDIQGMKYLKAVIKEALRLHPPFPMMA PHESTEDVKLRDYHIPAGTQVMMNAWAIGREVATWGPDAEEFKPERHLDTSVDFRGQNFE LLPFGAGRRICPAVSFAVVLNEVVLANLVHGFDWKLPEESKEDKTDVAESSGFSVHREFP LYAVASPYLT CYP71A26 AL049659 comp(76207..76818,76952..77809) T29H11.210 MMIMFFLLCSIIFVVTIIIFRKQKRGKKRNTLPSPPGLPLIGNLHQLGRHPHRSLCSLSH RYGPLMLLHFGRVPVLVVSSAELARDVLKTHDRVFASRPRSKIFEKLLYDKHDVASAPYG EYWRQMKSVCVLHLFSNKMVRSFREVREEEISLMMEKIRKSISLPVNLSKILVSLTNDVI CKVALGRKYGGETDFKELMERLNKLLGTFSVGSYVPWLAWIDWIRGLDCQLEKTANDVDK FFERVVQDHVDGNRDMTDFVDVLLAIQRDKTVGFEINRVSIKAIVMNVFVGGTDTSSTLM EWAMTELLRHPKCLKRLQEEVRTICKDKSSVSEEEIQNMSYLKAVIKEALRLHPPLPLMV PHESTQDVRLGDHHIPAGTQVLINAWAIGREAATWGPDVEEFRPERHLDSSVDYRGQAFE LIPFGSGRRICPAISFAVVLNEVVLANLVHRFDWRLSVESTEDQTEVAESTGIAIHRMFP LYAIASNTT AL022224 CYP71A27 F1C12.160 (parts from two genes) 61% identical to 71A20 at4g20240 two genes fused MEMILISLCLTTLLAFLFLKPLLKRITTTKPKLPPSPWRLPVIGNLHQLGPNPHRYLHSL SLRYGPLMLLHFGRVPVLVVSCPDVTNDIMKTHDLKFANRPKSKAINIFMEGGRDIIFGP YGEDWKSMKSLGVVHLLNNKMVRSFENLREEEIKVMTEKLEEASSSSSSVNLSKLLMTLT NDIICRITLGRKYNEEEGGIDIKNLVMTSSEFFGKFFFGDFIPSLAWIDWISGIDDKMKD INNKLDCFLDSMVQEHVDADHKEPSDFIDMLLLIQKDKTKRFKFDRSDLILILKDMFFSG TATTASQLEWTMTELMRHPECMKKLQDEINSFSTHNLNVTEKEVEKMNYLHCVIKEGLRL HPSGPLLFRLPSEDVQLKGYDISAGTHVIINAWALQRNPAIWGLDANEYRPERHFGTNLD FNRTDSKFVPFGAGRRLCPGIGFSLVLSKLALANLVKRFNWRLKVGPVGDDKPDLAEASG IDVCRKFPLIVFPSIAHTH* AL022224 CYP71A28 F1C12.160 (parts from two genes) 60% identical to 71A20 GSS B25676 at4g20240 two genes fused MILISLCFTTFLAFLFLNPLLKRTTTTKPNQPPSPWRLPVIGYLHQLSLHPHRSFRSLSL RLWSAHAPSLRSCPYPRFSKYISSSDVAHDVMKTHDLKFANRPKTKAVDIIINGGRDVAF SSYGEYWRQMKSLCTVHLLGRQMVRSFEKVREEEITSVIWKLEKQSSSSLPVNLSDLLLN MSNDVICRIAVGRKYSREENTSDFENQLRKVMELLGAFPVGDYIPGLAWIDKVRGLDRKM EEVSKTFVEFLERVVQEHVDEGENKETFDFVDILLIQLEKTNEFELERSDIRLIILEFFL GGTTTTFTAIDWAMTLVVRHPESMKKLQEEIQTYSRNKLYVPEEEVENMKYLKAVIKEVF RLHPPGPLSIPRQLSEDVFINAWAIHRDTEKWGPYAEEFKPERHLDLPLNFQGQDFNFIP FGSGRRLCPDIDFATMLIEVGLANFVYRFNWRVETRPLGDDDGLKQTGMEVCHKFPLIAF PSLASFTI* D78605 71B2 AC007357 31915-33510 BE523880 M43A8STM developing seed cDNA clone M43A8 5' Length = 390 MTILLCFFLVSLLTIVSSIFLKQNKTSKFNLPPSPSSLPIIGNLHHLAGLPHRCFHKLSI KYGPLVFLRLGSVPVVVISSSEAAEAVLKTNDLECCSRPKTVGSGKLSYGFKDITFAPYG EYWREVRKLAVIELFSSKKVQSFRYIREEEVDFVVKKVSESALKQSPVDLSKTFFSLTAS IICRVALGQNFNESGFVIDQDRIEELVTESAEALGTFTFSDFFPGGLGRFVDWLFQRHKK INKVFKELDAFYQHVIDDHLKPEGRKNQDIVTLILDMIDKQEDSDSFKLNMDNLKAIVMD VFLAGIDTSAVTMIWAMTELIRNPRVMKKAQgSIRTTLGLKKERITeEDLGKVEYLNHIL KETFRLHPALPFVVPRETMSHIKIQGYDIPPKTQIQLNVWTIGRDPKRWNDPEEFNPERF ANSSVDFRGQHFDLLPFGSGRRICPGMPMAIASVELALMNLLYYFDWSMPDGTKGEDIDM EEAGNISIVKKIPLQLVPVQRY AB024038 COMP(43851-45465) CYP71B3 also D78602 no ESTs no GSS MSILLYFFFLPVILSLIFMKKFKDSKRNLPPSPPKLPIIGNLHQLRGLFHRCLHDLSKKH GPVLLLRLGFIDMVVISSKEAAEEVLKVHDLECCTRPKTNASSKFSRDGKDIAFAPYGEV SRELRKLSLINFFSTQKVRSFRYIREEENDLMVKKLKESAKKKNTVDLSQTLFYLVGSII FRATFGQRLDQNKHVNKEKIEELMFEVQKVGSLSSSDIFPAGVGWFMDFVSGRHKTLHKV FVEVDTLLNHVIDGHLKNPEDKTNQDRPDIIDSILETIYKQEQDESFKLTIDHLKGIIQN IYLAGVDTSAITMIWAMAELVKNPRVMKKAQEEIRTCIGIKQKERIEEEDVDKLQYLKLV IKETLRLHPPAPLLLPRETMADIKIQGYDIPRKTILLVNAWSIGRNPELWENPEEFNPER FIDCPMDYKGNSFEMLPFGSGRKICPGIAFGIATVELGLLNLLYYFDWRLAEEDKDIDME EAGDATIVKKVPLELVPIIH AB024038 comp(78016-79595) 71B4 also D78603 EST T21833, AI100749 GSS B12539, B10642 MVSLLSFFLLLLVPIFFLLIFTKKIKESKQNLPPGPAKLPIIGNLHQLQGLLHKCLHDLS KKHGPVMHLRLGFAPMVVISSSEAAEEALKTHDLECCSRPITMASRVFSRNGKDIGFGVY GDEWRELRKLSVREFFSVKKVQSFKYIREEENDLMIKKLKELASKQSPVDLSKILFGLTA SIIFRTAFGQSFFDNKHVDQESIKELMFESLSNMTFRFSDFFPTAGLKWFIGFVSGQHKR LYNVFNRVDTFFNHIVDDHHSKKATQDRPDMVDAILDMIDNEQQYASFKLTVDHLKGVLS NIYHAGIDTSAITLIWAMAELVRNPRVMKKAQDEIRTCIGIKQEGRIMEEDLDKLQYLKL VVKETLRLHPAAPLLLPRETMADIKIQGYDIPQKRALLVNAWSIGRDPESWKNPEEFNPE RFIDCPVDYKGHSFELLPFGSGRRICPGIAMAIATIELGLLNLLYFFDWNMPEKKKDMDM EEAGDLTVDKKVPLELLPVIRISL D78601 CYP71B5 AL132969 79823-81516 region clone T04D02 EU Chr 3 project EST AI997548 MSIFLCFLLLLPLSLIFLKKLLPSKGKLPPGPKGLPIIGNLHQFGRFLHKSLHKISQEYG PVMLLHFGVVPVIIVSSKEGAEEVLKTHDLETCSRPKTVGSGLFTYNFKDIGFAPYGENW REMRKIAVSELFSQKKLKSFRYIREDESQLLVRKVSKSALETPTSSVNLRKVIFTFAASI ICRLSFGQNFCDFVDMETVEELVLESETNLGSLAFADFLPAGWIIDRISGQHSTVMKAFS KLTNFFELVIDDHLKSGKIEDHSDIISVMLDMINKPTEVGSYKVTDDHLKGLMSDVFLAG VNAGSITMIWTMTELSRHPRVMRKLQEEIRAALGPNKEKITEEDLEKVEYLKMVIEEAFR LHPPAPLLLPRLTMSDINIQGYSIPKNTMIQINTYTIGRDPKNWTKPDEFIPERFVDNPI EYKGQHFELLPFGAGRRVCPGMATGITIVELGLLSLLYFFDWSLPNGMTTKDIDMEEDGA FVIAKKVSLELVPTLHRW D78604 CYP71B6 EST F15479, F14199 GSSs B27880, B28631 identical to AC005967 join(40855..41760,41949..42554) F27D4.9 MSLFSFPISTELLPWLLLLLIPPLLIFFLLRSPKNLPPGPPRLPILGNIHQLGSLPHRSL RDLSLKYGPVITVYLGSVRTVVVHSPETAEEVLKLHDSECCTRPKLSITKSFFYDGLGLG FTKWGDYYRDVRKLCVLELFSVKRANSFRNIREEELSRLVNSFSDSASSGSSVDLTANLA KFVASFTCRMAFGLSFQGSGMDNETFLELFTEANRVIGKFAAADIFPGFGWILDRISGLD SSRRKSFQDLDTFYQKAIVDHREKKKTEDREDLIDVLLKLQSQETKLGSSRITDTHIRAI IMDLFVAGVDTSVITLDWTMAELSRHPRVMKKVQAEIREHVGDKGIVTYDDLEALVYMKM VIKETWRLHAPSPILIPREAMTNFKIKGYDIYPGTRIHVNAWAIGRNPDVWKDPDEFIPE RFVDSNVETKGTSFELLPFGSGRRGCPAMYVGLSTVEYTLANLLYHFDWKATEEVSVEEA PGLTSHRKHPLHLVPVNVINRKL X97864 71B7 also AC007357 39975-41560 MSILLCFLCLLPVFLVSLSILSKRLKPSKWKLPPGPKTLPIIGNLHNLTGLPHTCFRNLS QKFGPVMLLHFGFVPVVVISSKEGAEEALKTQDLECCSRPETVATRMISYNFKDIGFAPY GEEWKALRKLVVMELLNTKKFQSFRYIREEENDLLIKKLTESALKKSPVNLKKTLFTLVA SIVCRLAFGVNIHKCEFVDEDNVADLVNKFEMLVAGVAFTDFFPGVGWLVDRISGQNKTL NNVFSELDTFFQNVLDDHIKPGRQVSENPDVVDVMLDLMKKQEKDGESFKLTTDHLKGII SDIFLAGVNTSAVTLNWAMAELIRNPRVMKKVQDEIRTTLGDKKQRITEQDLSQVHYFKL VVKEIFRLHPAAPLLLPRETMSHVKIQGYDIPVKTQMMINIYSIARDPKLWTNPDEFNPD RFLDSSIDYRGLNFELLPFGSGRRICPGMTLGITTVELGLLNLLYFFDWVVPVGKNVKDI NLEETGSIIISKKTTLELVPLVHH* AB011485 CYP71B8 GSS B18782 exact match for 44 amino acids MIKSILLCFFFLFPLLLTLFKKLLPSKWKLPPGPTGLPIIGNLHQAWKLLHSSFHKLSLE HGPVMLLRFGVVPMVVISSKEAAKQVLKSRDLETCSRPKLVANGLFTRNFKDIAFAQYGE DWREMKKLVGLELFNPKKHKFFRYIREEEGDLLVKKLSKSSQTQTLVDLRKAFFSFTAGI IFRVSFGQNFRECDFIDMDRLEELVQESETNVFSFAFTDFFPTGLGWLVDRISGQHSRIE KAFSKLTKFFQHVIDEELKIGQSQDHSNLVSSMLDMINRSTEYGSFKITSDHLIAMMTDI VLGGVNAGTITMIWTMTELTRHPRVMKKLREEIRATLGPNKERITEEDLEKVEYLKLVIK ETFRLHPPGPFLLPRQVMSDIEIQGYHIPKNAHIKISTYAIGRDPKCWTNPEEFNPERFA NTSINYKGQHYELLPFGAGRRSCPGMTLGITILELGLLNILYYFDWSLPNGMTIKDIDME EDGALTIAKKVPLELIPTLPASLCIK AC004136 CYP71B9 47% IDENTICAL TO 71B1 MATIWFLSLLFLCCILLAAFKHKKRRTNQQQPPSPPGFPIIGNLHQLGELPHQSLWSLSK TYGPVMLLKLGSVPTVVVSSSETAKQVLKINDLHCCSRPSLAELSYNYLDIAFSPFDDYW KELRRICVQELFSAKRVHSIQPIKEEEVRKLIVSATESASQKSPVNLSEKFLDLTVSVIC KAAFSLDFHTSVLNNDGFDKLIHDAFLFLGSFSASNFFPNGGWIIDWLTGLQRRREKSVK DLDVFYQQMFDLHKQENKQGVEDFVDLLLKLEKEETVLGYGKLTRNHVKAILMNVLLGAI NTSAMTMTWAMAELIRNPRVMKKVQSEIRNQMINKSVITLDDIDHLPYLKMVIKETWRLH PPVPLLLPREVMSEFEINGYKIQPKTLLYVNVWAIGRDPDSWKDADMFYPERFMDNNIDA KGQNFELLPFGSGRRICPGMYMGTTMVEFGLANMLYQFDWEVPDGMVVEDIDMEESPGLA VGKKNELLLVPVKYLGH* AB019233 CYP71B10 71% identical to AC004136 MTVLWFVSLILLISILLVAVKHSKRRWVRQPPSPPGLPIIGNLHQLGELPHQSLCKLSKK YGPVMLLKLGRVPTVIVSTPETAKQVLKDYDLHCCSRPSLEGTRKLSYNYLDIAFSRFDD YWKELRKLCVEELFCNKRINSIQPIKEAEMEKLIDSIAESASQKTLVNLSDTFLSLNVNV ICKAVFGVNFQGTVLNNDKFQDLVHEALEMLGSFSASDFFPYVGWIVDWFTGLHARRERS VRDLDAFYEQMIDLHLQKNREESEDDFVDLLLRLEKEEAVLGYGKLTRNHIKAILMNILL GGINTSAITMTWAMAELIRNPRVMKKVQSEIRAQIGKNNKTRIISLDEINHLSYLNMVIK ETCRLHPVAPLLVPREVISEFKINGYTIQPKTRLHVNVWAIGRDPEIWKDPEEFLPERFM DCDIDVKGQDYELLPFGSGRRICPAVYMGITTVEFGLANLLYHFDWKLPEGVAVEDIYMD EASGLTSHKKHDLLLVP CYP71B11 AC006259 (94-702) partial seq. AC005964 (66557- 68105) whole seq. MSLWYIIVAFVFFSTIIIVRNTRKTKKNLPPGPPRLPIIGNLHQLGSKPHSSMFKLSEKY GPLMALRFGSVSTVVASTPETVKEVLKTFDAECCSRPYMTYPARLTYNLKDIGFCPYTKY WREVRKMTVVELYTAKRVQSFQHTRKEEVASLVDFITQAASLEKPVNLNTKLMKLSGSVI CRVVFGINLKGSKLENLYEEVIQGTMEVVGSFAAADYFPIIGRIIDRITGLHSKCEKIFK AMDAFFDQSIKHHLEDESIKDDIIDLLLKMERGEIELGEFQLTRDNTKGILFNILNAGID TSAQVMTWVMTYLISNPRVMKKAQAEVREVIKNKDDIIEEDIERLEYLKMVVKETFRVLP LVPLLIPREASKDVKIGGYDIPKKTWIHVNIWAIHRNPNVWKDPEAFIPERFMDNQIDYK GLNFEFLPFGSGRRMCPGIGMGMALVHLTLINLLYRFDWKLPEGMEVEDVDLEESYGLVC PKKVPLQLIPVLTQWS* CYP71B12 AC006259 (4286-6376) AC005964 (72227-73777) MSLWYIIVAFVFFSSMIIVRIIRKTKKNLPPGPPRLPIIGNLHQLGSKPHRSMFKLSETY GPLMSLKFGSVSTVVASTPETVKEVLKTFDVECCSRPNMTYPARVTYNLKDLCFSPYSKY WREVRKMTVVELYTAKRVQSFQHTRKEEVAALVDFIKQAASLEKPVNLNKKLMKLSGSVI CRVAFGINLQGSKLENTYEEVIQGTVELVGSFAAADYFPVVGRIIDRITGLHSKCEKLFK AMDAFFDQSIKHHLEDEIIKDDIIDLLLKMERGETTLGEFQLTRDHTKGILANILNAGID TSAQVMTWVMTYLISNPRVLKKAQAEVREVIKHKDDIIEEDIERLQYLKMVIKETFRINP LVPLLIPREASKDVKIGGYNIPKKTWIHVNIWAIHRNPNVWKDPEAFIPERFMDSQIDYK GLNFELLPFGSGRRICPGIGMGMALVHLTLINLLYRFDWKLPEGMKVADVDLEESYGLVC PKKIPLQLIPVL CYP71B13 AC006259 (9294-10829) AC005964 (76695-78230) MSLWYIIVVFVFFASIFIAKNTRKTKKNLPPGPPRLPIIGNLHQLGSKPHRSMFKLSEKY GPLVYLKLGKVPSVVASTPETVKDVLKTFDKDCCSRAFLTYPARISYNLKDLAFAPYSKY WKAVRKMTVVELYTAKRVKSFRNIREEEVASFVEFIKHSASLEEIVNLNQTLVKLSGSVI CRVGFGINLEGSKLENTYEEVIHGTMEVLGSFAASDYFPVIGGIIDRITGLHNKCEKVFK GTDSFFDHCIKHHLEDGGSKDDIVDLLLKVERGEIGLGEFQFTRNHTKGILLDILLAGVD TSGHTITWVMTHLIKNPRVMKKAQAEVREVIKNKDNITEEDIEGLEYLKMVVKETLRINP LVPLLTPREASKDVKIGGYNIPKKTWIHVNIWAIHRNPNVWKDPEAFIPERFMDNQIDYK GLNFELLPFGSGRRICPGIGMGMALIHLTLINLLYRFDWKLPEGMEVEDVDLEESYGLVC PKKVPLELIPVL CYP71B14 AC006259 comp(30932-32475) MIWWFIVGASFFFAFILIAKDTRTTKKNLPPGPPRLPIIGNLHQLGSKPQRSLFKLSEKY GSLMSLKFGNVSAVVASTPETVKDVLKTFDAECCSRPYMTYPARVTYNFNDLAFSPYSKY WREVRKMTVIELYTAKRVKSFQNVRQEEVASFVDFIKQHASLEKTVNMKQKLVKLSGSVI CKVGFGISLEWSKLANTYEEVIQGTMEVVGRFAAADYFPIIGRIIDRITGLHSKCEKVFK EMDSFFDQSIKHHLEDTNIKDDIIGLLLKMEKGETGLGEFQLTRNHTKGILLNVLIAGVD TSGHTVTWVMTHLIKNPRVMKKAQAEVREVIKNKDDITEEDIERLEYLKMVIKETLRINP LVPLLIPREASKYIKIGGYDIPKKTWIYVNIWAVQRNPNVWKDPEVFIPERFMHSEIDYK GVDFELLPFGSGRRMCPGMGLGMALVHLTLINLLYRFDWKLPEGMNIEDVDLEESYGLVC PKKVPLQLIPVL AB016889 CYP71B15 Arabidopsis thaliana genomic DNA, chromosome 3, P1 clone: MDJ14, MSVFLCFLVLLPLILIFLNVLKPSKYKLPPGPKKLPIIGNLHQRRTLHPRNRRNLAEMYG PVALLQYGFVPVVAISSKEAAEEVLKINDLECCSRPEAAGMRATFYNFKDIGMAPFGDEW SLMRKLSVVELFSVKKLQSFKYIIEEENNLCVKKLSEFATRQSPVNLERAIFTLVGNIVC RIGYGINLYECDFFEADRVVDLVLKAEAVIRETVFSDFFPGRIGRFIDCISGQNRRLKNN FSVVDTFFQNVLNEHLKPGRESSTIVDLMIDMKKKQENDGDALKFTTDHLKGMISDIFVA GIGGVAGITLWGMTELIRNPRVMKKVQDEIRTTLGDKKERIKEEDLNQLHYFKLVVKETL RLHPTTPLLLPRQTMSHIKIQGYDVPAKTQILVNVYAMGRDPKLWENADEFNPDRFLDSS VDFKGKNYEFIPFGSGRRICPGMTMGTILVEMALLNLLYFFDWGLAKQEEAKEIINGEEN FLAFFQVLHH AB024038 comp(13279-14849) CYP71B16 84% identical to CYP71B17 AB024038 15932-17496 no ESTs no GSSs MAISLLCLFLITLVSLIFVVKKIKHSKWDLPPSPPTFPVIGNLHQVGELPHRSFQRLAER TGHVMLLHFGFVPVTVISSREAAEEVLRTHDLKCCTRPKLVGSRLISRGFKDISFTPYGE EWRERRKFLVRELFCFKKVQYFGYIVEEECNLLVKKLTESAVGRPPVDLSKSLFWLAASI LFRIAFGQSFHDNKFIDEDKIDELIFETETAQASFTCSDFFPIAGLGWLADWISGKHRWL NNVFFKLDALFQRVIDDHSDPGRWKDHKDIVDVMLDVMHKQGKDDSLRLTIDHIKGFLTN IIIAGIDTGALTMIWAMTELARNPELMKNVQGEIRDSFGNNKERITKEDLNKVPFLNMVI KETFRLHPVAPLLLPRETMTHIKVQGYDIPPKRRILVNTWAIGRDPTLWINPEEFNPERF INNPVDYRGQHFELLPFGSGRRICPGMGLGITIVELGLLNLLYFFDWRAPDGMTHKDIDT EEAGILTVVKKVPLKLVPVRVQ* AB024038 comp(15932-17496) CYP71B17 84% identical to CYP71B16 AB024038 13279-14849 no ESTs no GSSs MAISLLCLFLITFVSLTIVGCKIKRSIWNLPPSPPKLPVIGNLHQVGELPHRSFRRLAER TGHVMLLHLGFVPVTVISSREAAEEVLRTHDLDCCSRPNLVGSRLISRGFKDLNFTPYGE EWKERRRFLVGELFCSKKLQSFIYIKEVECNFLVKKLSESAVDQSPVDLSKTLFWLAASI LFRVAFGQSFHESEFTDTDKIDELVFETETAQGSFTCSDFFPIAGLGWLVDWISGQHKRL NDVFLKLDALLQHVIDDHSNPGRSKDHKDIVDVMLDVMHKQGKDDSLRLTIDHIKGLLTN IIIAGIDTGALTMIWTMTELARNPEIMKKVQGEIRDRLGNNRERITKEDLDKVPFLNLVI KETFRLHPVAPLLLPRETMAHVKVQGYDIPPKRRILVNAWAIGRDPKLWTDPEEFKPERF IDSPVDYRGQHFELLPFGSGRRICPGMAMGMATLELGLLNLLYFFDWKLPDGMSHKDIDT EEAGTLTVVKKVHLKLVPVRVP* AB024038 comp(18089-19609) CYP71B18 ESTs AA650762, T44224, AI099821 (2 diffs) T44855 (3 diffs) GSS B25335 80% identical to CYP71B17 AB024038 15932-17496 missing an internal 21 amino acid fragment and frame shift after first 6 N-terminal amino acids. may be a pseudogene MAISLLCLFLIAFASLIFLGKKIKRSKWNFPPSPPKFPVIGNLHQLGELPHRSLQRLAEK TGDVMLIHLGFVPVTVISSKEAAEEVLRTHDLNCCSRPNLVGTRLLSRGFKDVGFTPYGE EWKERRKFLVREFFSLKKVQSFGYIREEECNFLVKKLLESAVDQSPVDLSKTLFWLTASV MFRVAFGQSFHESEFIDKDKVDELVFEAETAQASFTCSDFFPIAGLGWLVDCISGQHKRL KDVFFKLDALFQ (frame shift here, missing 21 amino acids) MLDVMQKQRKDDSLKLTINNIKGILTNIFLGGIDTGALTMIWAMTELARNPEVMKKVQGE IRDRLGRNKERITEEDINKVPYLNLVIKETFRLHHPVPLLLPRETMAHIKVQGYDIPPKR RILVNAWAIGRDPKLWTNPEEFNPERFINSPVDYRGQHFELLPFGSGRRICPGISMGMAT VELGLLNLLYFFDWKLPDGMTHKDIDTEEVGTLTIVKKVPLKLLPVRFQ* AI993108 701495348 91% to 71B18 probable new sequence NIFLGRIDTGALTMIWAMTELARNPGVMKKVQGEIRDRLRRNKERVTEEDNNKVPYLNLV VKETFRLHYPVPLLLPRETMAHIKVQGYDIPPKRRILVNAWAIGRDPKLWTNPREFDPGR FIDSPVDYKGQHFELLPFGSGR AB024038 comp(21194-22760) CYP71B19 ESTs H76241 MAISFLCVFLITFVSLIFFAKKIKRSKWNLPPSPPKFPVIGNLHQIGELPHRSLQHLAER YGPVMLLHFGFVPITVVSSREAAEEVLRTHDLDCCSRPKLVGTRLLSRDFKDIGFTPYGN EWKARRKFALRELFCLKKVQSFRHIREEECNFLVKQLSESAVDRSPVDLSKSLFWLTASI LFRVALGQNFHESDFIDKEKIEELVFEAETALASFTCSDFFPVAGLGWLVDWFSGQHKRL NDVFYKLDALFQHVIDDHLNPGRSKEHEDIIDSMLDVIHKQGEDSSLELTIDHIKGFLAN IFLAGIDTGAITMIWAVTELVKNPKLIKKVQGDIREQLGSNKERITEEDIEKVPYLKMVI KETFRLHPAAPLILPRETMAHIKVQGYDIPPKRRILVNVSAIGRDPKLWTNPKEFDPERF MDSFVDYRGQHYELLPFGSGRRICPGMPMGIAAVELGLLNLLYFFDWKLPDGMTHKDIDT EEAGTLTIVKKVPLKLVPVRVQ* AB024038 comp(26044-27630) CYP71B20 ESTs T04186, T45449, T21193, AA585920 (1 diff) T20987 (3 diffs) no GSSs 94% identical to CYP71B19 AB024038 21864-22760 MAISFLCFCLITLASLIFFAKKIKHLKWNLPPSPPKFPVIGNLHQIGELPHRSLQHLAER YGPVMLLHFGFVPVTVVSSREAAEEVLRTHDLDCCSRPKLVGTRLLSRNFKDVCFTPYGN EWKARRKFALRELFCLKKVQSFRHIREEECNFLVKQLSESAVNRSPVDLSKSLFWLTASI FFRVALGQNFHESNFIDKEKIEELVFEAETALASFTCSDFFPVAGLGWLVDWFSGQHKRL NDVFYKLDALFQHVIDDHLNPGRSKEHEDIIDSMLDAIHKEGKDSSLELIIDHIKGFLAN IFLAGIDTGALTMIWAMTELVKNPKLIKKVQGEIREQLGSNKARITEEDIDKVPYLKMVI KETFRLHPAAPLILPRETMAHIKVQGYDIPPKRRILVNVSAIGRDPKLWTNPEEFDPERF MDSSVDYRGQHYELLPFGSGRRICPGMPMGIAAVELGLLNLLYFFDWKLPDGMTHKDIDT EEAGTLTIVKKVPLQLVPVRVQ* AB024038 comp(31112-32720) CYP71B21 88% identical to 71B22 no ESTs no GSSs MSIFLCFLLLLPLFLVFYKRLLPSKGKLPPGPISLPIIGNLHQLGKSLHRSFYKLSQEYG PVMFLRFGVVPVVVFSTKEAAEEVLKTHDLETCTRPKLSATGLFTYNFKDIGFAQYGEDW REMRKLAMLELFSSKKLKAFRYIREEESELLVKKVTESAQTQTLVDLRKALFSYTASIVC RLAFGQNFHECDFVDMDKVEELVLESETNLGSFAFIDFFPAGLGWAIDRISGQHSRLHKA FARLSNFFQHVIDDHLKPWQSEDHSDIVGVMLDMINKESKVGSFKVTYDHLKGVMSDVFL AGVNAGAITMIWALTELTRHPRVMKKLQQEIRELLGDNKEKITEQDLEKVHYLKLVIQET FRLHPPAPLLLPRETMSDVKIQGYNIPKNTMIEINTYAIGRDPNCWTNPNEFIPERFVDS PIDYKGQHFELLPFGGGRRICPGMATGMTIVELGLLNVLYFFDWSLPYGMAIADINMEEA GAFVIAKKVPLELVPVLHY* AB024038 36984-38609 CYP71B22 clone: MTC11 N0 ESTS No GSSs 88% identical to CYP71B21 AB024038 (31112-32720) MSISLYFLLLLPLFLIFFKKLSPSKGKLPPGPLGLPIIGNLHQLGKSLHRSFHKLSQNYG PVMFLHFGVVPVVVVSTREAAEEVLKTHDLETCTRPKLTATKLFSYNYKDIGFAQYGDDW REMRKLAMLELFSSKKLKAFRYIREEESEVLVNKLSKSAETRTMVDLRKALFSYTASIVC RLAFGQNFHECDFVDMDKVEDLVLESETNLGSFAFTDFFPAGLGWVIDRISGQHSELHKA FARLSNFFQHVIDDHLKPGQSQDHSDIIGVMLDMINKESKVGSFQVTYDHLKGVMSDVFL AGVNAGAITMIWAMTELARHPRVMKKLQQEIREILGDNKEKITEQDLEKVHYLKLVIEET FRLHPPAPLLLPRETMSDLKIQGYNIPKNTMIEINTYSIGRDPNCWENPNDFNPERFIDS PVEYKGQHYELLPFGAGRRICPGMATGITIVELGLLNVLYFFDWSLPDGMKIEDIDMEEA GAFVVAKKVPLELIPTPHQW* AB024038 comp(40969-42839) CYP71B23 71% ident to 71B7 no ESTs no GSSs MSIFLCFLLLLLLLLVTIIFTRKSQSSKLKLPPGPPKLPIIGNLHYLNGLPHKCLLNLWK IHGPVMQLQLGYVPLVVISSNQAAEEVLKTHDLDCCSRPETIASKTISYNFKDIGFAPYG EEWRALRKLAVIELFSLKKFNSFRYIREEENDLLVKKLSEASEKQSPVNLKKALFTLSAS IVCRLAFGQNLHESEFIDEDSMEDLASRSEKIQAKFAFSNFFPGGWILDKITGQSKSLNE IFADLDGFFNQVLDDHLKPGRKVLETPDVVDVMIDMMNKQSQDGSFKLTTDHIKGIISDI FLAGVNTSATTILWAMTELIRNPRVMKKVQDEVRTVLGEKRDRITEQDLNQLNYFKLVIK ETFRLHPAAPLLLPREAMAKIKIQGYDIPEKTQIMVNVYAIGRDPDLWENPEEFKPERFV DSSVDYRGLNFELLPFGSGRRICPGMTMGIATVELGLLNLLYFFDWGLPEGRTVKDIDLE EEGAIIIGKKVSLELVPTRRQ* AB024038 COMP(45837-47529) CYP71B24 80% identical to 71B3 no ESTs GSS AQ011201 clone F23D14 (5 diffs/178 = 97%) B96063 4 diffs/126 = 97% these may be a closely related gene. MSILLYFIALLSLIIIKKIKDSNRNLPPSPLKLPVIGNLYQLRGLFHKCLHDLSKKHGPV LLLRLGFLDMVVISSTEAAEEALKVHDLECCTRPITNVTSKLWRDGQDIGLAPYGESLRE LRKLSFLKFFSTTKVRSFRYIREEENDLMVKKLKEAALKKSSVDLSQTLFGLVGSIIFRS AFGQRFDEGNHVNAEKIEDLMFEVQKLGALSNSDLFPGGLGWFVDFVSGHNKKLHKVFVE VDTLLNHIIDDHLKNSIEEITHDRPDIIDSLLDMIRKQEQGDSFKLTIDNLKGIIQDIYL AGVDTSAITMIWAMAELVKNPRVMKKVQDEIRTCIGIKQNEKIEEDDVDKLQYLKLVVKE TLRLHPAAPLLLPRETMSQIKIQGYNIPSKTILLVNVWSIGRDPKHWKNPEEFNPERFID CPIDYKGNSFEMLPFGSGRRICPGIAFAIATVELGLLNLLYHFDWRLPEEDKDLDMEEAG DVTIIKKVPLKLVPVLHH* AB024038 comp(75254-76832) CYP71B25 74% identical to 71B4 no ESTs no GSSs MAILQSFLLLLSLPFLFTLIYTKKMKESKRNLPPGPAKLPIVGNLHQLQGMVHRCLHELS KKHGPVMHLQLGFVPLVLISSSEAAEEALKTHDIECCTRPNTNAARVFSRNNKNIGLGAY SDEWRELRKVAVREYFSVKKVQSFRYVREEENHLMVKKLRDLALKQSPVDLSKTLFCLAA STVFRPVFGQSFSDNKHFSEEKIEELVFEAQKSLTFKFSDLFPIPGLGWFIGFVSGQHKG LHKVFIEVDNFLNHMIDDHQKQNQPQDRSDIVGSLLDMIHNQEQDKSFKLTIDHLKGITQ DIFLAGIDTSAITMIWAMAELVNNPRVMKKVQDEIRSCIGIKKERIEEEDVGKLQYLKLV IKETLRLHPAAPLLLPRETMADIKIQGYDIPRKTLLLVSAWSLGRDPKYWKNPEEFNPER FIDCPVDYKGHSFEFLPFGSGRRFCPGMASAIATIELTLLNLLYFFDWKLPEEMKDMNME ESGDVTIVKKVPLELLPVLYH* AB024038 comp(80407-82076) CYP71B26 68% identical to 71B9 very similar to Z33952, Z34037 (91% identical) but different gene MDSIWILSLLFFIIFLLLAAFKRKNHGKHRRIPSPPGFPIIGNLHQLGELQHQSLWKLSK KYGPVMLLKLGKVPTLILSSSETAKQALRDYDLHCCSRPSLAELSYNNLDMSSSPYNEYW KELRKLCSQELFSANKIQSIQPIKDEEVKKVIDSIAESSSLKNPVNLSKTFLALTTSVVC KAAFGVSFEGSVLNSDRFNKLVRDTFEMLGSFSASDFIPYVGWIIDKFNGLQGWRKKSFR DLDAFYEQIFDLHKEEKEVGSEDLVDVLLRLEKEEIVVGNGKLTRNHIKAILMNILLGGI DTSAITMTWAMAELAKNPRVMKKVQAEIRNQIKNKERISFDDTDKLEYLKMVIKETWRLH PPTPLLLPRDVITEFEINGYTIPAKTRLHVNVWAIGRDPDTWKDPEMFLPERFNDSNIDA KGQNFELLSFGSGRRICPGLYMGTTMVEFGLANMLYHFDWKLPEGMVVEDIDMEEAPGLT VSKKSELVLVPVKYLDH* AC007357 28180-29804 CYP71B27 B11396 B21019 no ESTs MAILFCFFLVSLVTLVSSIFFKQIKNTKFNLPPSPPSLPIIGNLHHLTGLPHRCYHKLSI KYGPVILLHLGFVPVVVISLSEAAEAVLKTHDLECCSRPKTVGTGKLSYGFKDISFVPYS EYWREMRKLAVTELFSLKKVQSYRYIRGEEIEFMVKKLSESALKQSPVDLGSTFFSLTAS IIFRVALGRNFHESGSIIDQEGIEELVTAATTAIGSFTFSDFFPSGLGRFLDCLFRTQTN INKVSEKLDAFYQHVIDdHLKPSTLDSSGDIVALMLDMIKKKGHKDDFKLNVDNIKAVLM NIFLAGIDTGAITMIWAMTELVKKPLVMKRAQENIRGVLGLKRDRItEEDLCKFDCLKHI VKETLRLHPPVPFLVPRETISHIKIQGYDIPPKTQIQVNVWTIGRDPKrWTDPEEFRPER FANTCVDFRGQHFDFLPFGSGRRICPAISMAIATVELGLMNLLDFFDWRLPDGMKVEDID IEEAGNVTVVKKLLIYLVPLQRH* AC007357 34549-36103 CYP71B28 T20906 T76255 T14112 N65665 MSVFLCFLCLLPLILIFLKNLKPSKWKLPPGPKKLPIIGNLHQRRELHPRNSRNLSEKYG PIVFLRYGFVPVVVISSKEAAEEVLKTHDLECCSRPETVGTRAISYNFKDIGFAPYGEDW RTMRKLSVVELFSSKKLQSFRYIREEENDLCVKKLSDLASRRSLVNLEKTLFTLVGSIVC RIGFGINLRECEFVDEDSIDDLVHKSEDVIRNSIFSDFFPGLMGRLIEWIFSERKRLNRL YSEVDTFFQNILDDHLKPGRESSDIIDVMIDMMKKQEKEGDSFKFTTDHLKGMISDIFLA GVGTSSTTLIWAMTELIRNPRVMKKVQDEIRTTLGDKKERITEEDLNQLHYFKLMVKEIF RLHPAAPLLLPRETLSHVKIQGYDIPAKTQIMINAYAIARDPKLWTNPDEFNPDRFLDSS IDYRGLNFELLPFGSGRRICPGMTMGIAIVELGLLNLLYFFDWGLPEKEEAKEIITGNEV ALDLVQVFLH* B21125 GSS fragment very similar to 71B28 PSKWKFPPGPXELPIIXNLHQRRDLHPRTVRDLSEKYGPIVFLRYGIVPVVVISSKEAAE EVLKTHDLECCSRPETVGPRAISYNFKDIGFAPYGEDWRTMRKLSVVELISSKK AC007357 36686-38241 CYP71B29 GSS B21027 is probably the same gene MAIILCFLILLPLILIFLKKLKHSKWKLPPGPPKLPFIGNLHQLQELPPRNLNHKYGPVI LLRFGFVPLVVISSKEAAEEVLKIHDLECCSRPETAGTRKISYNFKDIGFAPYGEEWKAM RKLSVVELFTAKKHQYFRSIREEENDLLVKKLTELALTRSPVNLKKTLFTLVGSIVCRIG FGFNLHECEFIDENSISDLVDKSEILEMTSMFSDFFPGGIGRFIDWISGQNKRFDNVFSD LDTFFQNILDYHLKPGRKVADSSDIIDVVINMIKKQEKDGDSFKLTTDHLKGMISDIFLA GVSTSASTLIWAITELVRNRKVMKKVQEEIRTTLGDKKERITEQDLTNLHYFKLVVKEIF RLHPAVPFLLPRETLSHVKIQGYDIPAKTQIMINVYAIARDPKLWTNPDEFNPDRFLDSS IDYKGLNFELLPFGSGRRICPGMTMGITLVEFALLNLLYFFDWGLPEKEEADKIITGNGV AVGLVQVLLH* >CYP71B30P Clone T04D02 from the EU chr 3 project 82204-83671 missing N-term 80 AA not found between end of 71B5 and start of this sequence may be a pseudogene AL132958 82204-83671 also found on AL132966 no ESTs no GSSs LFQFSVSMFFYLYLSSSAKEVLKTHDLETCTRPKMVTTGLFSYNFKDIGFTQYGEEWGEM KKLIGLELFSPKTQKSFRYIREEESDLLVKKISKSAQTQTLVDLGKPLFSFTAGIIFRLA FGQNFRKCDFIDRERVEELVVESETNVGTLAFSDFFPTGLGWLIDRLSGQSSRMNKVFSK LTNLFEHVIDDHLKTEQHQDHSNLISAMLNMINKPTNIGSFKITSDHLKGVMSDVFLAGV NAGVITMIWTMTELTRHPRVMKKLQKKIRATLGSNKERITEEDLEKVEYLKMVIEETFRL HPPAPLLLPRLTMSDVTIQGYNIPKNTMIEINTYTIGRDPKCWTNPEEFIPERFSNTSIN YKGQHFELLPFGAGRRSCPGMSLGITILELGLLNLLYFFDWRLPNGMTIADIDMEEVGAL NIAKKVPLELVPTLHHS* CYP71B31 Clone T04D02 from the EU chr 3 project 84593-86302 GSSs AL092788 (clone T10J6) AL081333 (clone F2M19) No ESTs AL132958 84593-86302 also found on AL132966 MSMFLGLLFLFPLFFILFKNLLPPRKKLPPGPTGLPLIGNLHQLGRLLHSSLHKLSLEHG PVMLVRWGVVPMAVFSSNEAAKEVLKTHDLETCNRPKLVANGLFTHGYKDIGFTQYGEEW REMKKFVGLELFSPKKHKSFRYIREEEGDLLVKKISNYAQTQTLVDLRKSLFSYTASIIF REAFGQNFRECDYINMDKLEELVQETETNVCSLAFTDFFPRGLGWLVDRISGQHSRMNIA FSKLTTFFEDVIDELLKTKQLDDHSDLVTAMLDVINRPRKFGSLKITYDHLIAMMSDVVL AGVNAGTVTMIWTMTELTRHPRVMKKLQEEIRATLGPNKERITEEDLEKVEYLNLVIKES FRLHPPAPLLLPRETMSDIEIQGYHIPKNAHVKINTYAIGRDPKRWTNPEEFNPERFLNT SINYKGQHYELLPFGAGRRNCPGMTLGITILELGLLNILYYFDWSLPSGMTIKDIDMEED GALNIAKKVPLQLVPTLP* CYP71B32 Clone T04D02 from the EU chr 3 project 87497-89156 two in frame stop codons may be a pseudogene AL132958 chromosome 3, BAC clone T4D2 also found on AL132966 MSIFLLCILFLFPLFQIIFKKLLPSKSKLPPGPTGLPIIGNLHQLGRSLHSVFHKLSL*Y GPVMLLRFGVVPVVVFSSKEAAKEVLKTHDLDTCTRPKLVANGLFSRNFKDIGFTQYGED WREMKKLVGLELFSPKKHKSFRYIREEEGDLLVKKISNSAQTQTLIDLRKASFSFTAGTI FRLAFGQNFHQCDFMDMDRLEELVLEAETNGCILALTDFLPTGLGWLVDRISGRHSRMNK AFSKLTNLFQHVIHEQLKIGQLQDHSNLVSAMLDMIIRPSKFSSFNITSNHLIAMMSDVV LAGVNAGTITMVWIVTELIRHPRVMRKLQEEIRATLGSNKERIT*EDLQKVEYLNMVIKE TFRLHPPSPLLLPRETMSDIEIQGYHIPKNALIRINTYTIGRDLKCWSNPERFLNTSINY KGQDYKLLPFGAGRRSCPGMNLGITILELGLLNILYFFDWSFPNGMTIEDIDMEENGALN KTLELIPTLPSALVNKSIR* CYP71B33 Comp(479-2126) AP001298 one in frame stop may be a pseudogene no ESTs no GSSs 76% TO 71B26 NEARLY IDENTICAL TO Z34126 (2 DIFFS) 83% TO Z27299 MVNILILSLLFLVILLLLTAFKRAKRRQHRPIPSPPGFPIIGNLHQLGELPHQSLWKLSK KYGPVMLLNLGKVPTVILSSSETAKQALRDYDLHCCSRPSLAELSYNNRDISFSPYNDYW KELRKLCTQELFSAKNIHSTQHIKDEEVKKLIVSIAESASLKTPINLNKKCLALTVSIVC RIAFGVSFEGTVLSNDSFNKLIREAFEMLGSFSASDFIPYIGWIIDRFTGLQGRREKSV* DLDAFYEQIIDLHKEEKEVGSEDFVDLLLKLEKEETVVGNNNFTRNNVKAILMDILLAGI ETSAGTMAWTMAELAKNPLVMRKVQSEIRNKFENRELISFEDIEQLHYLKTVIKETWRLH PPAPLLLPREVMSEFEINGYTMQPKTQIHVNVWAIGRDPNTWKDPEEFLPERFIDSNIDT KGQNFELLPFGGGRRMCPAMYMGTTMVEFGLANLLYHFDWKLPEGMKVEDMDMEEAPGLT VNKKNDLLLVPVKYLDN* CYP71B34 comp(2977-4644) AP001298 86% TO 71B26 ESTs Z33952, Z34037, Z34791, two diffs with Z34126 MTNIWLLSLIFVICILVAVFNHKNRRNYQRTPPSPPGCPIIGNLHQLGELPHQSLWKLSK KYGPVMLLKLGRVPTVIVSSSETAKQALKIHDLHCCSRPGFAELSYNYLDIAFSPYDDYW KEVRKLAVQELFSSKQVHSIQPIKDEEVKKLIDSISESAAQKTPINLNKTLLALTVSVVC RTAFSVNFEGTVLNSERFNNIVREALEMLGSFSASDFIPYVGRIIDLLTGLQGRRERSMR DLDAFYEQMFDLHKQKKEEGSEDFVDLLLRLEKEEAVLGNDKLTRNHIKAILMDVLLAGM DTSAITMTWAMAELAKNPRVMKKVQSEIRSQIKNKERISFDDTDKLEYLKMVIKETWRLH PTTPLLIPREAMSEFEINGYTIPVKTRLHVNVWAIGRDPDTWKDPEVFLPERFTDNNIDA KGQHFELLPFGGGRRMCPAVYMGTTMVEFGLANLLYHFDWKLPEGMKVDDIDMEEAPGLT VNKKNELILVPTKFLDP* check for this end PTKFLDPKNNVITWNI* CYP71B35 Comp(4867-6557) AP001298 81% TO 71B26 no ESTs no GSSs MAHIWLLPLIFLVCILLAVFNHKKHPKYRQFPCPPGFPIIGNLHQIGELPHQTLWKLSKK YGPVMHLMLGRVPTVVVSSSDTARQVLRVHDLHCCTRPSLSELSYNYLDIAFSPYDDYWK EVRKLCVQELFSTKQVHSIQPIKDEEVKKMIDSIAESASQKNPVNLNNKCLELTVSVVCR TAFGVSFEGTVLNSDRFNKIVREALEMLGSFSAADFIPYVGWIIDVLTGLQGRRERSKRD LNAFFEQMFDLHKEGKKEGNEDFVDLLLRLEKEEAVLGNDKLTRNHIKAILLDVLLAGID TSAITMTWAMTELARNPRVMKKVQSEIRTQMGNRSMISFEDMDQLEYLKMVIKETWRLHP TTPLLLPREAMSEFDINGYTIPVKTRLHVNVWAIGRDPDTWKDPEVFLPERFMDNNIDAK GQHFELLPFGGGRRICPAIYMGTTMVEFGLANLLYHFDWKLPEGVEVKDIDVEEAPGLTV NKKNELLLVPEMRRSCG* CYP71B36 comp(8161-9842) AP001298 78% TO 71B26 No ESTs no GSSs MATILFLSLLFLSCILLAAFTHKKRQQHQRKPPSPPGFPIIGNLHQLGELPHQSLWRLSK KYGHVMLLKFGSIPTVVVSSSETAKQVLKIHDLHCCSRPSLALSYNYLDIAFSPFDDYWK ELRRICVQELFSVKRVQSFQPIKEDEVKKLIDSVSESASQGTPVNLSEKFTSLTVRVTCK ATFGVNFQGTVLNSDRFEKLIHDTYLFLGSFSASDYFPNGGWIIDWLTGLHGQRERSVRA LDAFYEQMFDLHKQGNKEGVEDFVDLLLRLEKEETVIGYGKLTRNHIKAILMNVLIGGIG TSAITMTWAMTELMRNPRVMKKVQSEIRNQIGKKSMITLDDIDQLHYLKMVINETWRLHP PSPFLIPRQVMSEFELNDYVIPVKTRLYVNVWAIGRDPDTWKDPEEFLPERFVNSSIDAK GQHFELLPFGSGRRMCPAMYMGTTMVEFGLANMLYHFDWKIPVGMVAEDIDLEESPGLNA SKKNELVLVPLKYLDH CYP71B37 comp(10654-12314) AP001298 100% IDENTICAL TO Z27299 96% to B22554 BE524870 M55C8STM developing seed cDNA clone M55C8 5' Length = 409 MATIWFLPLLFLSCLLLAALRLKKRRQHQRKPPSPPGFPIIGNLHQLGELPHQSLWSLSK KYGPVMLLKFGSIPTVVVSSSETAKQALKIHDLNCCSRPSLAALSYNYLDIVFSPFNDYW KELRRMCVQELFSPKQVHLIQPIREEEVKKLMNSFSESAAQKTPVNLSEKLASLTVGVIC KAAFGVSFQGTVLNSDNFDKLIHDAFLFLGSFSASDYFPNVGWIIDWLTGLQGQRERSVR GLDAFYEQMFDLHKQGNKEGVEDFVDLLLKLEKEETVLGYGKLTRNHIKAVLMNVLLGGI GTSAITMTWAMTELMRNPRVMKKVQSEIRNQIGGKSMICLDDIDQLHYLKMVINETWRLH PPAPLLVPREVMSEFEINGYTIPAKTRLYVNVWGIGRDPDTWKDPEEFLPERFVNSNIDA KGQNFELLPFGSGRRMCPAMYMGTTMVEFGLANLLYHFDWKLPEGMVVEDIDMEESPGLN ASKKNELVLVPRKYLN >CYP71B38 AL353865 chromosome 3, BAC clone T10D17 79% to 71B5 29822 MSIFLCFLLLLPLSLILFKKLLPSKGKLPPGPIGLPIIGNLHQLGKLLYKSFHKISQEYGPV 29637 29636 VLLRLGVVPVIVVSSKEGAEEVLKTHDLETCTRPKTAATGLFTYNFKDIGFAPFGDDWRE 29457 29456 MRKITTLELFSVKKLKSFRYIREEESELLVKKISKSVDETQNSSVDLRKVLFSFTASIIC 29277 29276 RLAFGQNFHQCDFVD 29232 MEKVEELVLESEANLGTFAFADFFPGGWLIDRISGQHSRVNKAFYK 29011 29010 LTNFYKHVIDDHLKTGQPQDHSDIVSVMLDMINKPTKADSFKVTYDHLKGVMS 28855 28753 DIFLAGVNGGANTMIWTLTELSRHPRVMKKLQEEIRAMLGPNKERITEEDLEKV 28556 28555 EYLKLVMVETFRLHPPAPLLLPRLTMSDIKIQGYNIPKNTMIQINTYAIGRDPKYWKQPG 28376 28375 EFIPERFLDSPIDYKGQHFELLPFGAGRRICPGMATGITMVELGLLNLLYFFDWSLPNGM 28196 28195 TIEDIDMEEDEGFAIAKKVPLVLIQTSHRW 28106 AB023038.1 CYP72A7 join(40525..40798,40924..41141,41228..41475,41550..41922, 42038..42463) MSFSVVAALPVLVAVVVLWTWRIVKWVWIKPKMLESSLKRQGLTGTPYTPLVGDIKRNVD MMMEARSKPINVTDDITPRLLPLALKMLNSHGKTFFIWIGPLPTIVITNPEQIKEVFNKV NDFEKASTFPLIRLLAGGLASYKGDKWASHRRIINPAFHLEKIKNMIPAFYHCCSEVVCQ WEKLFTDKESPLEVDVWPWLVNMTADVISHTAFGSSYKEGQRIFQLQGELAELIAQAFKK SYIPGSRFYPTKSNRRMKAIDREVDVILRGIVSKREKAREAGEPANDDLLGILLESNSEE SQGNGMSVEDVMKECKLFYFAGQETTSVLLVWTMVLLSHHQDWQARAREEVMQVLGENNK PDMESLNNLKQMTMIFNEVLRLYPPVAQLKRVVNKEMKLGELTLPAGIQIYLPTILVQRD TELWGDDAADFKPERFRDGLSKATKNQVSFFPFGWGPRICIGQNFAMLEAKMAMALILQK FSFELSPSYVHAPQTVMTTRPQFGAHLILHKL AB023038.1 CYP72A8 join(42938..43226,43333..43550,43632..43879,43950..44313, 44385..44813) MSDTKISAVAVAAAVVVVTTVTVWIWKGLNVAWLRPKKNEAYLKRQGLSGTPFTFLVGDI KREASMVEQEKSRPINLTDDYTHRVMPLIQQTVKDHGKTSYMWMGPIASVIVTKPEHIKD VLNRVYDFPKPPVHPIVELFATGVALYEGEKWSKHRKIINPSFHLEKLKIMIPAFYESCS EMISKWEKLVTEQGSSNEIDVWPYLGDLTSDVISRTAFGSSYEEGKRIFELQEEQGRRVL KALELAFIPGFRFLPTKNNLRMRQINKEVKSRLREIIMKRQRGMDTGEAPKNDLLGILLE SNSGDHGMSIEDVVEECRLFHFAGQETTAVLLVWTMIMLSHHQKWQDQAREEILKVIGKN NKPNFDALSRLKTMSMILNEVLRLYPPGILLGRTVEKETKLGEDMTLPGGAQVVIPVLMV HRDPELWGEDVHEFNPERFADGISKATKNQVSFLPFGWGPRFCPGQNFALMEAKMALVLI LQRFSFELSPSYTHAPHTVLTLHPQFGAPLIFHML AB023038.1 45458-47366 CYP72A9 join(45293..45610,45873..46031,46142..46389,46497..46866, 46944..47369) Genbank entry has extra seq at N-term BE525861 M63N24STM developing seed cDNA clone 600034395R1 5'Length = 233 MEIVIASLALVVVLWCIWRILEWVWLKPKMLESYLRRQGLVGTRYTPLVGDVRRSFSMLK EARSKPMKPTDDLISLVMPYSFHMLNTYGKTFFTWSGPIPAITIMNPQLIKEVYNKFYDF EKTHTFPLTSLLTDGLANADGDKWVKHRKIINPAFHFEKIKNMVPTFYKSCIEVMCEWEK LVSDKGSSCELDVWPWIVNMTGDVISRTAFGSSYKEGQRIFILQAELAHLIILALGKNYI PAYXHFPTKNNRRMKTIVKEIQVILRGIISHREKARDAGEAPSDDLLGILLKSNSEQSKG NGLNMEEIMEECKLFYFAGQETTSVLLAWTMVLLSQHQDWQARAREEVMQVFGHNKPDLQ GINQLKVMTMIIYEVLRLYPPVIQMNRATHKEIKLGDMTLPGGIQVHMPVLLIHRDTKLW GDDAAEFKPERFKDGIAKATKNQVCFLPFGWGPRICIGQNFALLEAKMALALILQRFSFE LSPSYVHSPYRVFTIHPQCGAHLILHKL AB023038.1 CYP72A10 join(47816..48092,48192..48409,48497..48747,48892..49237, 49322..49747) MEISVACVTVSVVVVVVSWWVWRTLQWVWFKPKMLESYLRRQGLAGTPYTPLIGDLKRNV NMLTEATSKPIKLTEDITPRVLPHPFQMLKTHGRTFFTWLGPKPTITIMDPELIKEVFNK VYDYPKAQTFLLGRLIATGIINYDGDKWAKHRRIINPAFHIEKIKNMVPAFHQSCSDVVG EWSKLVSDKGSSSCEVDVWPWLVSMTGDVISRTAFGSSYKEGQRIFELQAELVHLILQAF WKVYIPGYRYLPTKSNRRMKAAAREIQVILKGIVNKRLRAREAGKAAPNDDLLGILLESN LGQAKGNGMSTEDVMEECKLFYFAGQETTSVLLVWAMVLLSHHQDWQARAREEVKQVFGD KEPDTECLSQLKVMTMILYEVLRLYPPVTHLTRAIDKEMKLGDLTLPAGVHISLPIMLVQ RDPMLWGTDAAEFKPERFKDGLSKATKSQVSFFPFAWGPRICIGQNFAMLEAKMAMALIL QTFTFELSPSYVHAPQTVVTIHPQFGAHLILRKL AB023038.1 CYP72A11 join(50166..50442,50543..50760,50866..51113,51248..51617, 51700..52125) MEISVASVTVSVAVVVVSWWVWRTLQWVWFKPKMLESYLRRQGLAGTPYTPLVGDLKKNF SMRAEARSKPINLTDDITPRIVPYPLQMLKTHGRTFFTWFGAIPTITIMDPEQITEVLNK VYDFQKAHTFPLGRLIATGVLSYDGDKWAKHRRIINPAFHLEKIKNMVPAFHQSCSEIVC KWDKLVSDKESSCEVDVWPGLVSMTADVISRTAFGSSCVEGQRIFELQAELAQLIIQTVR KAFIPGYSYLPTKGNRRMKAKAREIQVILRGIVNKRLRAREAGEAPNDDLLGILLESNLG QTKGNGMSTEDLMEECKLFYFVGQETTSVLLVWTMVLLSQHQDWQARAREEVKQVFGDKE PDAEGLNQLKVMTMILYEVLRLYPPIPQLSRAIHKEMELGDLTLPGGVLINLPILLVQRD TELWGNDAGEFKPDRFKDGLSKATKNQASFFPFAWGSRICIGQNFALLEAKMAMALILQR FSFELSPSYVHAPYTVFTIHPQFGAPLIMHKL AB023038 lone exon 5 405-503 52192-52485 CYP72A12P GSS B24203 ISLPIMLVQRDTMMWEFKPERFKDGLSKATKSHVSFFPFAWGPKICIGQNFAMLEAKMAM ALILQRFSFELSPSYVHAPQIVVTIHPQFGAHLILRKL AB023038.1 CYP72A13 join(52920..53196,53299..53516,53613..53860,53997..54366, 54446..54871) BE529090 M73J04STM developing seed cDNA clone 600038313R1 5'Length = 254 BE526012 M64O21STM developing seed cDNA clone 600034588R1 5'Length = 246 MEISVASVTVSVAVVVVSWWVWRTLQRVWLKPKMLESYLRRQGLAGTPYTPLVGDLKRNF SMLAEARSKPINLTDDITPRIVPYPLQMLKTHGRTFFTWFGPIPTITIMDPEQIKEVFNK VYDFQKAHTFPLGRLIAAGLVSYDGDKWTKHRRIINPAFHLEKIKNMVPAFHQSCSEIVG EWDKLVTDKQSSCEVDIWPWLVSMTADVISRTAFGSSYKEGQRIFELQAELAQLIIQAFR KAIIPGYRYFPTKGNRRMKAAAREIKFILRGIVNKRLRAREAGEAPSDDLLGILLESNLG QTKGNGMSTEELMEECKLFYFAGQETTTVLLVWTMVLLSQHQDWQARAREEVKQVFGDKE PDAEGLNQLKVMTMILYEVLRLYPPVVQLTRAIHKEMQLGDLTLPGGVQISLPILLIQRD RELWGNDAGEFKPDRFKDGLSKATKNQVSFFPFAWGPRICIGQNFALLEAKMAMTLILRK FSFELSPSYVHAPYTVLTTHPQFGAPLILHKL AB023038.1 CYP72A14 join(62438..62714,62831..63048,63150..63397,63534..63903, 63997..64422) MEISVSSVTFSLAVVVVSWWVWRTLKWVWFTPKMLERSLRRQGLSGTSYTPLIGDFKKMI SMFIEATSKPIKPTDDITPRVMPHPLQMLKTHGRTNLTWFGPIPTITIMDPEQIKEVFNK VYDFQKAHTFPLSKILGTGLVSYDGDKWAQHRRIINPAFHLEKIKNMVHVFHESCSELVG EWDKLVSDKGSSCEVDVWPGLTSMTADVISRTAFGSSYREGHRIFELQAELAQLVMQAFQ KFFIPGYSYLPTKGNRRMKTAAREIQDILRGIINKRERARESGEAPSEDLLGILLESNLG QTEGNGMSTEDMMEECKLFYLAGQETTSVLLVWTMVLLSQHQDWQARAREEVKQVFGDKQ PDTEGLNQLKVMTMILYEVLRLYPPVVQLTRAIHKEMKLGDLTLPGGVQISLPVLLVHRD TELWGNDAGEFKPERFKDGLSKATKNQVSFFPFAWGPRICIGQNFTLLEAKMAMSLILQR FSFELSPSYVHAPYTIITLYPQFGAHLMLHKL AB023038.1 CYP72A15 join(65370..65646,65763..65980,66066..66313,66403..66772, 66845..67270) MEISVASVTISVVLAVVSWWIWRTLQWVWFKPKMLEHYLRRQGLAGTPYTPLVGDLKKNF TMLSEARSKPLKLTDDISPRVVPYPLQMFKTYGRTYFTWFGPIPTITIMDPEQIKEVFNK VYDFQKPHTFPLATIIAKGLANYDGDKWAKHRRIINPAFHIEKIKNMVPAFHQSCREVVG EWDQLVSDKGSSCEVDVWPGLVSMTADVISRTAFGSSYKEGQRIFELQAELAQLIIQAFR KAFIPGYSYLPTKSNRRMKAAAREIQVILRGIVNKRLRAREAGEAPSDDLLGILLESNLR QTEGNGMSTEDLMEECKLFYFAGQETTSVLLVWTMVLLSQHQDWQARAREEVKQVFGDKE PDAEGLNQLKQMTMILYEVLRLYPPVTQLTRAIHKELKLGDLTLPGGVQISLPILLVQHD IELWGNDAAEFNPDRFKDGLSKATKSQVSFFPFAWGPRICIGQNFALLEAKMAMALILRR FSFEISPSYVHAPYTVITIHPQFGAQLIMHKL AI993723 701497314 91% to 72A13 probable new seq.C-term TLPGGVQLTLPILLIQRDRDLWGNDAGESKPDRLKDGLSKATKNQVSFFPFAWXPRTCIG QNFALLEAKMAMTLILRKFSFELSPSYVHAPYTVLTTHPQFGAQLMLHKL* AW004264 701669957 89% to 72A15 may be new P450 sequence C-term QHDIELWGNDAAEVKPDRVKDGLSKATKSQVSFFPFAWGPRICIGQNFALLEAKRARALI LRRFSFEISPS*CHAPDTGITSHPQFGAQLIMHKR* AC003105 F18A8.8 CYP72B also B29839, B62694, B28185 GSS fragments MEEESSSWFIPKVLVLSVILSLVIVKGMSLLWWRPRKIEEHFSKQGIRGPPYHFFIGNVK ELVGMMLKASSHPMPFSHNILPRVLSFYHHWRKIYGATFLVWFGPTFRLTVADPDLIREI FSKSEFYEKNEAHPLVKQLEGDGLLSLKGEKWAHHRKIISPTFHMENLKLLVPVVLKSVT DMVDKWSDKLSENGEVEVDVYEWFQILTEDVISRTAFGSSYEDGRAVFRLQAQQMLLCAE AFQKVFIPGYRFFPTRGNLKSWKLDKEIRKSLLKLIERRRQNAIDGEGEECKEPAAKDLL GLMIQAKNVTVQDIVEECKSFFFAGKQTTSNLLTWTTILLSMHPEWQAKARDEVLRVCGS RDVPTKDHVVKLKTLSMILNESLRLYPPIVATIRRAKSDVKLGGYKIPCGTELLIPIIAV HHDQAIWGNDVNEFNPARFADGVPRAAKHPVGFIPFGLGVRTCIGQNLAILQAKLTLAVM IQRFTFHLAPTYQHAPTVLMLLYPQHGAPITFRRLTNHED AC007651 CYP72C1 88180-91153 AC051629 chromosome I clone F6I1 45% identical to 72A9 no ESTs no GSSs BE522655 M27F12STM developing seed cDNA clone M27F12 5'Length = 396 WVWRAVNWVWLRPKRMEKYLKIQGFSGNSYIILMGDMR 387 3 diffs with 72C1 MLEIITVRKVFLIGFLILILNWVWRAVNWVWLRPKRLEKYLKKQGFSGNSYRILMGDMRE SNQMDQVAHSLPLPLDADFLPRMMPFLHHTVLKHGKKCFTWYGPYPNVIVMDPETLREIM SKHELFPKPKIGSHNHVFLSGLLNHEGPKWSKHRSILNPAFRIDNLKSILPAFNSSCKEM LEEWERLASAKGTMELDSWTHCHDLTRNMLARASFGDSYKDGIKIFEIQQEQIDLGLLAI RAVYIPGSKFLPTKFNRRLRETERDMRAMFKAMIETKEEEIKRGRGTDKN (FRAME SHIFT) QRLLFSMLASNTKTIKEQGPDSGLSLDDLIDDCKAFYLAGQNVTSSLFVWTLVALSQHQD WQNKARDEISQAFGNNEPDFEGLSHLKVVTMILHEVLRLYSPAYFTCRITKQEVKLERFS LPEGVVVTIPMLLVHHDSDLWGDDVKEFKPERFANGVAGATKGRLSFLPFSSGPRTCIGQ NFSMLQAKLFLAMVLQRFSVELSPSYTHAPFPAATTFPQHGAHLIIRKL* U71080 CYP73A5, U93215 genomic seq. MDLLLLEKSLIAVFVAVILATVISKLRGKKLKLPPGPIPIPIFGNWLQVGDDLNHRNLVD YAKKFGDLFLLRMGQRNLVVVSSPDLTKEVLLTQGVEFGSRTRNVVFDIFTGKGQDMVFT VYGEHWRKMRRIMTVPFFTNKVVQQNREGWEFEAASVVEDVKKNPDSATKGIVLRKRLQL MMYNNMFRIMFDRRFESEDDPLFLRLKALNGERSRLAQSFEYNYGDFIPILRPFLRGYLK ICQDVKDRRIALFKKYFVDERKQIASSKPTGSEGLKCAIDHILEAEQKGEINEDNVLYIV ENINVAAIETTLWSIEWGIAELVNHPEIQSKLRNELDTVLGPGVQVTEPDLHKLPYLQAV VKETLRLRMAIPLLVPHMNLHDAKLAGYDIPAESKILVNAWWLANNPNSWKKPEEFRPER FFEEESHVEANGNDFRYVPFGVGRRSCPGIILALPILGITIGRMVQNFELLPPPGQSKVD TSEKGGQFSLHILNHSIIVMKPRNC* X92510 CYP74A, also AB007647, Y12636| ESTs: AA394958, N65720 MASISTPFPISLHPKTVRSKPLKFRVLTRPIKASGSETPDLTVATRTGSKDLPIRNIPGN YGLPIVGPIKDRWDYFYDQGAEEFFKSRIRKYNSTVYRVNMPPGAFIAENPQVVALLDGK SFPVLFDVDKVEKKDLFTGTYMPSTELTGGYRILSYLDPSEPKHEKLKNLLFFLLKSSRN RIFPEFQATYSELFDSLEKEAFPLRESGFRRFQRRNRLLFLGSSFLRDESRRYKLKADAP GLITKWVLFNLHPLLSIGLPRVIEEPLIHTFSLPPALVKSDYQRLYEFLRIRGEILVEAD KLGISREEATHNLLFATCFNTWGGMKILFPNMVKRIGRAGHQVHNRLAEEIRSVIKSNGG ELTMGAIEKMELTKSVVYECLRFEPPVTAQYGRAKKDLVIESHDAAFKVKAGEMLYGYQP LATRDPKIFDRADEFVPERFVGEEGEKLLRHVLWSNGPETETPTVGNKQCAGKDFVVLVA RLFVIEIFRRYDSFDIEVGTSPLGSSVNFSSLRKASF* Z97339 CYP74B2 at4g15440 MAATSPRPPPSTSLTSQQPPSPPSQLPLRTMPGSYGWPLVGPLSDRLDFQGPDKFFRTRA EKYKSTVFRTNIPPTFPFFGNVNPNIVAVLDVKSFSHLFDMDLVDKRDVLIGDFRPSLGF YGGVRVGVYLDTTEPKHAQIKGFAMETLKRSSKVWLQELRSNLNIFWGTIESEISKNGAA SYIFPLQRCIFSFLCASLAGVDASVSPDIAENGWKTINTWLALQVIPTAKLGVVPQPLEE ILLHTWPYPSLLIAGNYKKLYNFIDENAGDCLRLGQEEFGLTRDEAIQNLLFVLGFNAYG GFSVFLPSLIGRITGDNSGLQERIRTEVRRVCGSGSDLNFKTVNEMELVKSVVYETMRFS PPVPLQFARARKDFQISSHDAVFEVKKGELLCGYQPLVMRDANVFDEPEEFKPDRYVGET GSELLNYLYWSNGPQTGTPSASNKQCAAKDIVTLTASLLVADLFLRYDTITGDSGSIKAV VKAK* AL133421 AB005249 CYP75B1 chromosome 5, BAC clone F13G24.190 join(67348..67797,68521..68961,69040..69210,69291..69770) no ESTs no GSSs AF241643 (exon 1) AF241644 (exon 2) AF241645 (exon 3) AF241646 (exon 4) AF241647 (exon 1, mutant) AF155171 BE522904 M30B1STM developing seed cDNA clone M30B1 5' Length = 360 BE524863 M55C12STM developing seed cDNA clone M55C12 5' Length = 411 BE527312 M68E17STM developing seed cDNA clone 600036167R1 5' Length = 247 BE524171 M46G4STM developing seed cDNA clone M46G4 5' Length = 378 BE525201 M59G6STM developing seed cDNA clone M59G6 5' Length = 379 MATLFLTILLATVLFLILRIFSHRRNRSHNNRLPPGPNPWPIIG NLPHMGTKPHRTLSAMVTTYGPILHLRLGFVDVVVAASKSVAEQFLKIHDANFASRPP NSGAKHMAYNYQDLVFAPYGHRWRLLRKISSVHLFSAKALEDFKHVRQEEVGTLTREL VRVGTKPVNLGQLVNMCVVNALGREMIGRRLFGADADHKADEFRSMVTEMMALAGVFN IGDFVPSLDWLDLQGVAGKMKRLHKRFDAFLSSILKEHEMNGQDQKHTDMLSTLISLK GTDLDGDGGSLTDTEIKALLLNMFTAGTDTSASTVDWAIAELIRHPDIMVKAQEELDI VVGRDRPVNESDIAQLPYLQAVIKENFRLHPPTPLSLPHIASESCEINGYHIPKGSTL LTNIWAIARDPDQWSDPLAFKPERFLPGGEKSGVDVKGSDFELIPFGAGRRICAGLSL GLRTIQFLTATLVQGFDWELAGGVTPEKLNMEESYGLTLQRAVPLVVHPKPRLAPNVY GLGSG AC003680 F17K2.9 CYP76C1 also D78600 MDIISGQALLLLFCFILSCFLIFTTTRSGRISRGATALPPGPPRLPIIGNIHLVGKHPHR SFAELSKTYGPVMSLKLGSLNTVVIASPEAAREVLRTHDQILSARSPTNAVRSINHQDAS LVWLPSSSARWRLLRRLSVTQLLSPQRIEATKALRMNKVKELVSFISESSDREESVDISR VAFITTLNIISNILFSVDLGSYNAKASINGVQDTVISVMDAAGTPDAANYFPFLRFLDLQ GNVKTFKVCTERLVRVFRGFIDAKIAEKSSQNNPKDVSKNDFVDNLLDYKGDESELSISD IEHLLLDMFTAGTDTSSSTLEWAMTELLKNPKTMAKAQAEIDCVIGQNGIVEESDISKLP YLQAVVKETFRLHTPVPLLIPRKAESDAEILGFMVLKDTQVLVNVWAIGRDPSVWDNPSQ FEPERFLGKDMDVRGRDYELTPFGAGRRICPGMPLAMKTVSLMLASLLYSFDWKLPKGVL SEDLDMDETFGLTLHKTNPLHAVPVKKRANIN AC003680 F17K2.10 CYP76C2 MDIIFEQALFPLFCFVLSFFIIFFTTTRPRSSRKVVPSPPGPPRLPIIGNIHLVGRNPHH SFADLSKTYGPIMSLKFGSLNTVVVTSPEAAREVLRTYDQILSSRTPTNSIRSINHDKVS VVWLPPSSSRWRLLRKLSATQLFSPQRIEATKTLRENKVKELVSFMSESSEREEAVDISR ATFITALNIISNILFSVDLGNYDSNKSGVFQDTVIGVMEAVGNPDAANFFPFLGFLDLQG NRKTLKACSERLFKVFRGFIDAKLAEKSLRDTNSKDVRERDFVDVLLDLTEGDEAELNTN DIVHLLLDLFGAGTDTNSSTVEWAMAELLRNPETMVKAQAEIDCVIGQKGVVEESDISAL PYLQAVVKETFRLHPAAPLLVPRKAESDVEVLGFMVPKDTQVFVNVWAIGRDPNVWENSS RFKPERFLGKDIDLRGRDYELTPFGAGRRICPGLPLAVKTVPLMLASLLYSFDWKLPNGV GSEDLDMDETFGLTLHKTNPLHAVPVKKRGRN AC003680 F17K2.11 CYP76C3 MSLPLYFLLTLFFFFFATAKTRRSSSTGTLPPGPPILPLVGNIFQLGFNPHRSLAAFSKT YGPIMSLKLGRLTAVVISSPEAAKEALRTHDHVMSARTFNDALRAFDHHKHSIVWIPPSA RWRLKKTITKYLLSPQNLDAIQSLRMRKVEELVSLVNEFRERGEAIDLARASFVTSFNII SNALFSVDLATYDSNSSSYEFHNTVVHLTDIAGIPNVGDYFQYMRFLDLQGTRKKAVLCI EKLFRVFQEFIDARLAKRFSRTEKEPKEASSIDMLDSLLDLTQQNEAELTMNDLKHLLLD VFVAGTDTNSSTMEWAMTELFRSTEKMVKAQSEIRQVIGQNGFVQESDIPSLPYLQAIVK ETLRLHPAAPLIPRKSESDVQIMGFLVPKNTQVVVNVWAIGRDASVWENPMKFEPERFLL RETDVKGRDFELIPFGSGRRMCPGISMALKTMHMVLASLLYSFDWKLQNGVVPGNIDMSE TFGLTLHKAKSLCAVPVKKPTISSSY AC003680 F17K2.8 CYP76C4 MDIISGQALFLLFCFISSCFLISTTARSRRSSGRAATLPPGPPRLPIIGNIHQVGKNPHS SFADLAKIYGPIMSLKFGCLNSVVITSPEAAREVLRTHDQILSGRKSNDSIRCFGHEEVS VIWLPPSSARWRMLRKLSVTLMFSPQRTEATKALRMKKVQELVSFMNESSERKEAVDISR ASYTTVLNIISNILFSVDLGSYDSKKSNEFQDTVIGAMEAAGKPDAANYFPFMGFLDLQG NRKAMRGLTERLFRVFRGFMDAKIAEKSLGNYSKDVSNRDFLDSLLILNEGDEAELDNND IEHLLLDMFTAGTDTSSSTLEWAMAELLRNPKTMVKAQAEMDRVLGQNSVVQESDISGLP YLQAVVKETFRLHPAAPLLVPRKAESDVEVLGFMVPKDTQVLVNVWAIGRDPSVWENPSQ FEPERFMGKDIDVKGRDYELTPFGGGRRICPGLPLAVKTVSLMLASLLYSFDWKLPNGVV SEDLDMDETFGITLHRTNTLYAIPVKKQTIN AC010164 chromosome I clone F14M2, WORKING DRAFT SEQUENCE, 76C5 93429-96444 also on AC069299 from 49100-52100 region MDIISGQPLFLIFCFILSCLLFFTTARSRRSPCQLSKSPPGPPRLPIIGNIHLVGKNPHH SFTDLSKTYGPVMSLKLGCLNSVVIASRDAVREVLKTHDQILSGRYISEATKSNNHHEFS VGWIHPSSSRFFRMLRKLSATQLFSPQCIQATKALRMKKVQELVNFLSESCEREEAVDIS HVSFVTALNIISNILFSVNLGSYDSKNSSAFQEMVIGYQESIGNPDLANFFPFMRFLDLQ GNSKKMRESSGRLLQVFREFYDARIVEKSSRSVEKDVSSKDFLDVLIDLQQGDETEINI DEIEHLLLDMFVAGTDTNSSTVEWAMAELLGNPKTMTKVQDEINHVIGQNGDFQESDISK LPYLKAVVKETFRLHPAAPFLLQRKAETNVEILGFTVLKDSQVLVNVWAIGRDPLVWENP THFEPERFLGKEIDVKGTDYELTPFGAGRRICPGLPLAMKTVHLMLASLLYTFEWKLPNG VGSEDLDMEETFGLTVHKTNPLLAVPLKKRA AC010164 chromosome I clone F14M2, WORKING DRAFT SEQUENCE, 76C6 comp(88950-90642) also on AC069299 54900-56600 region MDIISGQPMFLLFCFILSCFLFFTAARSRRSPTQVLSKSPPGPPRLPIIGNIHLVGKNPH HSFTDLSKTYGPVMSLKLGYLNSVVITSRDAAREVLKAHDQILSGRYITQATKSNNHHEF SVGWIHPSSPRFRXVEEMTFTQLFSPQRIEATKALRMKKVQELVNFLSESSERGEAVDIS RASFVTALNIISNILFSVNLGSYDSKNSSAFQEMVIGYMESIGNPDVSNFFPFMRLLDLQ GNSKKMKEYSGKLLQVFREFYDARILENSSRIDEKDVSSRDFLDALIDLQQGDESEINID EIEHLLLDMFLAGTDTNSSTVEWAMTELLGNPKTMTKVQDEINRVIRQNGDVQESHISKL PYLQAVIKETFRLHPAAPFLLPRKAERDVDILGFHVPKDSHVLVNVWAIGRDPNVWENPT QFEPERFLGKDIDVKGTNYELTPFGAGRRICPGLPLALKTVHLMLASLLYTFEWKLPNGV GSEDLDMGETFGLTVHKTNPLLACL* CYP76C7 58% to 76C3 clone T27I15 contig 5 European Chr. 3 project ESTs BE524967 M57A12STM developing seed cDNA clone M57A12 5' Length = 320 BE523420 M36G1STM developing seed cDNA clone M36G1 5' Length = 339 7496 MDIVAIVLSLLFIFFLFFFFYTTGKSCPGGAKNPPGPSKLSLLRNILQTVEKPHR 7332 7331 SLADLSRIYGSVMSFKLGCLTTVVISSPETAKEVLKTHDHVLSYRVSSDPVRAAGHHELS 7152 7151 LLWIPPLARWR 7119 6789 FLRKITRNQLFSTQRLEATSAIRTRKVQELMNFVNKCCERREAVNISRASFITSLNIIS 6610 6609 NALFSTNLANFDDSKTFHDFQNVVIRMMEISGKPNLADFFPFLGFLDLQGARKEARLLMH 6430 6429 KLFRVFQGFIDTKRSSTSRNNNDMLDSLLDIAHKKESELDDNNIKHLLL 6283 DLFLAGVDTSSSAVEWAMAELLRNPKMIVKVQEEIRQVIGLKGTVQDLDIVKLP 6047 6046 YLQAVVKESLRLHPPAPFLVPRKSESDDVQIFEFLIPKNTQ 5912 5798 VLVNVWAIGRDPNVWKNPTQFEPERFLGRGIDVKGNHFELIPFGAGRRICPG 5619 5618 MPLAFRIMHLVLASLLYGFDWEYQNGVVPENVDMNEAFGATLHKAEPLCIVPIKKRV* 5451 CYP76C8P clone T27I15 contig 5 50% to 76C1 two in frame stops and missing C-term and small deletion near EXXR motif mean this is probable pseudogene EXXR is also ETLL no R present. This is always an R. MKHTIAL 4993 ALLLILSFLFTYLILVAVKSRRRRSSSLALVPGPPGPSGWPIIGNLLQIIGKAPHRSFA 4817 4816 DLSRVYGPVMSLRLGSLTKVVISSPDAAREVL*TLDHVLSGQTFSKTVRAFGHHEVSVAW 4637 4636 LPSTSSRWR 4610 4581 VPIRLWILATKLFSREHHKATKSVRSKKAKELVTF 4444 4466 IIQRGERGFSVDIARTC 4386 4501 FITSLNVDSNVVFSIDLGSYDP 4322 4321 RASTELQDSLSRMMKTMGKPNLANYFPSLEFLDLQGIRKEMKVMCSERLFQVFQGLIDA 4145 4144 RIAERSSQIGPRVDALRGDLLDSVIDLIQEEGSEVDMNDIKHFLCV 4013 3769 RTCFIARTETNSTIVEWTLAELLHNPIAMANTKVETNFIVGQ 3653 small gap here 2632 AVVTETLLLHPPSLFLIPHKFESNIEILGYPIPENTQILVNNWAIERDSIIWENAERFKP 2453 2452 *RFLGRGIETIGKNFEMILFGAGLRICPGISLALRIAPLMLASLIYSFEWYPKNVK 2285 missing C-terminal exon CYP76G1 Clone F08J02 from the EU Chr 3 project comp(64780-66639) 43% to 76A6, 38% to 76B1, 42% to 76C2, 40% to 76D1, 41% to 76E1, 41% to 76F1 AL132969 chromosome 3, BAC clone F8J2 1-514 66639-64780 MINQLTKNELIGLFTSIAVLIYVTCLFYTKRCRTRLPPGPNPWPVIGNIFQLAGLPPHDS LTKLSRRHGPIMTLRIGSMLTVVISSSEVAREIFKKHDAALAGRKIYEAMKGGKSSDGSL ITAQYGAYWRMLRRLCTTQFFVTRRLDAMSDVRSRCVDQMLRFVEEGGQNGTKTIDVGRY FFLMAFNLIGNLMFSRDLLDPDSKRGSEFFYHTGKVMEFAGKPNVADFFPLLRFLDPQGI RRKTQFHVEKAFEIAGEFIRERTEVREREKSDEKTKDYLDVLLEFRGGDGVDEEPSSFSS RDINVIVFEMFTAGTDTTTSTLEWALAELLHNPRTLTKLQTELRTYFKSSNQKLQEEDLP NLPYLSAVIMETLRLHPPLPFLVPHKAMSTCHIFDQYTIPKETQVLVNVWAIGRDPKTWI DPIMFKPERFISDPDARDFKGQDYEFLPFGSGRRMCPALPLASRVLPLAIGSMVRSFDWA LENGLNAEEMDMGERIGITLKKAVPLEAIPIPYRGT* CYP77A4 AL162972 25962-27500 clone T1E3 also on AL162875 T32M21 GSS B97241 (65% identical to 77A3 N-term) AL082681 BAC F5F6. Rest of sequence will be on SP6 end of clone F5F6, AL083526 SP6 end of BAC F7B10 MFPLISFSPTSLDFTFFAIIISGFVFIITRWNSNSKKRLNLPPG PPGWPVVGNLFQFARSGKPFFEYAEDLKKTYGPIFTLRMGTRTMIILSDATLVHEALI QRGALFASRPAENPTRTIFSCNKFTVNAAKYGPVWRSLRRNMVQNMLSSTRLKEFGKL RQSAMDKLIERIKSEARDNDGLIWVLKNARFAAFCILLEMCFGIEMDEETIEKMDEIL KTVLMTVDPRIDDYLPILAPFFSKERKRALEVRREQVDYVVGVIERRRRAIQNPGSDK TASSFSYLDTLFDLKIEGRKTTPSNEELVTLCSEFLNGGTDTTGTAIEWGIAQLIANP EIQSRLYDEIKSTVGDDRRVDEKDVDKMVFLQAFVKELLRKHPPTYFSLTHAVMETTT LAGYDIPAGVNVEVYLPGISEDPRIWNNPKKFDPDRFMLGKEDADITGISGVKMIPFG VGRRICPGLAMATIHVHLMLARMVQEFEWCAHPPGSEIDFAGKLEFTVVMKNPLRAMV KPRI CYP77A9 AL162875 87274-88803 T32M21_230 no GSSs no ESTs MLDISNYVFFFFTLVTILLSCLVYILTRHSHNPKCANLPPGPKG WPVVGNLLQFARSGKQFFEYVDEMRNIYGPIFTLKMGIRTMIIISDANLAHQALIERG AQFATRPAETPTRKIFSSSDITVHSAMYGPVWRSLRRNMVQNMLCSNRLKEFGSIRKS AIDKLVEKIKSEAKENDGLVWVLRNARFAAFCILLDMCFGVKMEEESIEKMDQMMTEI LTAVDPRIHDYLPILTPFYFKERKNSLELRRKLVQFVVGFIEKRRLAIRNLGSDKTAS SFAYLDTLFDLRVDGRETSPSDEDLVTLCSEFLNAGTDTTGTAIEWGIAELISNPKIQ SRLYDEIKSTVGDDRTVEEKDLNKMVFLQAFVKELLRRHPPTYFTLTHGVTEPTNLAG YDIPVGANVEFYLPGISEDPKIWSKPEKFDPDRFITGGEDADLTGVAGVKMMPFGIGR RICPGLGMAVVHVELMLSRMVQEFEWSSYPPESQVDFTGKLVFAVVMKNPLRARVKARV CYP77A5P pseudogene AP000414 chromosome 3 clone MIE15 ECYLPGISEDPKIWSKP*QFEPNRFIQDADLTGVAGVKMIPFGIGRGICPRL AWNDKLVFAVVMKNPLRSKNVFDCIIIFFFTTRQFNSQFPKFAEIXDXR YVGAV* 67% identity for the top part with frame shift at gap CYP77A2 EIFLPGISDDPNLWSEPEKFDPDRFYLGKEDADITGVSGVKMIPFGMGRRICPGL E +LPGIS+DP +WS+P +F+P+RF +DAD+TGV+GVKMIPFG+GR ICP L Sbjct: 13981 ECYLPGISEDPKIWSKP*QFEPNRFI---QDADLTGVAGVKMIPFGIGRGICPRL CYP77A2 DFTEKLEFTVVMKNTLRAK 172 + +KL F VVMKN LR+K Sbjct: 14104 AWNDKLVFAVVMKNPLRSK 14190 pseudogene remnant of CYP77A partial sequence with several frameshifts and a stop codon in frame identical match to EST H36337 clone 178A11T7 Query: 53 EAWNDKLVFAVVMKNPLRSK 72 +AWNDKLVFAVVMKNPLRSK Sbjct: 2 KAWNDKLVFAVVMKNPLRSK 61 whole EST translated = KAWNDKLVFAVVMKNPLRSKNVFDCIIIFFFTTRQFNSQFPKFAEIXDXR YVGAV*NSKPLTGDXXREECXKEKNPXPERGSALTFSLSVXQGFDXXQXX XX The first part of this EST matches AP00414 exactly for 51 amino acids The EST suggests that a gene is made into mRNA that matches part of this sequence exactly. Opposite end of this sequence is AA651240 clone 178A11XP 3', mRNA sequence matches AP00414 from 16129-16750 = 3 prime UTR CYP77A6 AC011560 71181-72725 chromosome III clone IGF-F13M14 63% to 77A3 no introns exact to B30234 GSS AC013428 chromosome III clone F18K10 1-513 65425-66963 MSFLSFSPISLDLLSSNVFLTILAIIISGLLKTITYRKHNSNHLNLPPGPPGWPVIGNLF QFTRSGKQFFEYVEDLVKIYGPILTLRLGTRTMIIISDASLAHEALIERGAQFATRPVET PTRKIFSSSEITVHSAMYGPVWRSLRRNMVQNMLSSNRLKEFGSVRKSAMDKLIERIKSE ARDNDGLVWVLQNSRYAAFCVLLDICFGVEMEEESIEKMDQLMTAILNAVDPKLHDYLPI LTPFNYNERNRALKLRRELVDFVVGFIEKRRKAIRTATVSSFSYLDTLFDLRIIEGSETT PSDEDLVTLCSEFLNAGTDTTGAAIEWGIAELIANPEIQSRLYDEIKSTVGDRAVDERDV DKMVLLQAVVKEILRRHPPTYFTLSHGVTEPTTLSGYNIPVGVNIEFYLPGISEDPKIWS EPKKFDPDRFLSGREDADITGVAGVKMMPFGVGRRICPGMGMATVHVHLMIARMVQEFEW LAYPPQSEMDFAGKLVFAVVMKKPLRAMVRPRVQ* CYP77A7 AC011560 72998-74939 chromosome III clone IGF-F13M14 no introns exact with GSSs B25606, B24360, AL093147 ESTs AI995272 W43241 AC013428 chromosome III clone F18K10 1-513 67642-69180 MSILSFPHTFFNISPSLFYTILISSLVLLILTRRSAKSKIVKLPPGPPGWPVVGNLFQFA RSGKQFYEYVDDVRKKYGPIYTLRMGSRTMIIISDSALVHDVLIQRGPMFATRPTENPTR TIFSSNTFTVNASAYGPVWRSLRKNMVQNMLSSIRFREFGSLRQSAMDKLVERIKSEAKD NDGLVWVLRNARFAAFCILLEMCFGIEMDEESILNMDQVMKKVLITLNPRLDDYLPILAP FYSKERARALEVRCEQVDFIVKLIERRRRAIQKPGTDKTASSFSYLDTLFDLKTEGRITT PSNEELVSLCSEFLNGGTDTTGTAIEWGIAQLIVNPEIQSRLYDEIKSTVGDREVEEKDV DKMVFLQAVVKEILRKHPPTYFTLTHSVTEPTTVAGYDVPVGINVEFYLPGINEDPKLWS DPKKFNPDRFISGKEEADITGVTGVKMMPFGIGRRICPGLAMATVHVHLMLAKMVQEFEW SAYPPESEIDFAGKLEFTVVMKKPLRAMVKPRV* CYP77A8P AB010077 Query: 26 IISGLLKTITYRKHN 40 II+ L K TY+KHN Sbjct: 51066 IITLLGKAKTYQKHN 51110 Query: 167 KSAMDKLIERIKSEARDND 185 KS +DK +E+IKSE ++ND Sbjct: 56196 KSTIDKFVEKIKSEVKNND 56252 Query: 195 RYAAFCVLLDICFGVEMEEESIEKMDQLMTAILNAVDP 232 R+ A C+LL + F +EMEEESIEKM Q+M IL VDP Sbjct: 56264 RFTALCILLAMSFEIEMEEESIEKMYQMMAEILIVVDP 56377 Query: 233 KLHDYLPI 240 KLH YLPI Sbjct: 75123 KLHRYLPI 75146 CYP77B1 AC007296 comp(89516-91048) 45% to 77A3 sequences no introns ESTs Z30775, Z30776 noGSSs MDLTDVIIFLFALYFINLWWRRYFSAGSSQCSLNIPPGPKGWPLVGNLLQVIFQRRHFVF LMRDLRKKYGPIFTMQMGQRTMIIITDEKLIHEALVQRGPTFASRPPDSPIRLMFSVGKC AINSAEYGSLWRTLRRNFVTELVTAPRVKQCSWIRSWAMQNHMKRIKTENVEKGFVEVMS QCRLTICSILICLCFGAKISEEKIKNIENVLKDVMLITSPTLPDFLPVFTPLFRRQVREA RELRKTQLECLVPLIRNRRKFVDAKENPNEEMVSPIGAAYVDSLFRLNLIERGGELGDEE IVTLCSEIVSAGTDTSATTLEWALFHLVTDQNIQEKLYEEVVGVVGKNGVVEEDDVAKMP YLEAIVKETLRRHPPGHFLLSHAAVKDTELGGYDIPAGAYVEIYTAWVTENPDIWSDPGK FRPERFLTGGDGVDADWTGTRGVTMLPFGAGRRICPAWSLGILHINLMLARMIHSFKWIP VPDSPPDPTETYAFTVVMKNSLKAQIRSRT*
>CYP78A5 Susan Carr Zondlo submitted, now on AC027656 comp(32530-34190) EST T45256
MSPEAYVLFFNSFNLVTFEAFASVSLIIATVAFLLSPGGLAWAWTGSSKSRVSIPGPSGS
LSVFSGSNPHRVLAALAKRFKASPLMAFSVGFSRFVISSEPETAKEILSSSAFADRPVKE
SAYELLFHRAMGFAPYGEYWRNLRRISSTHLFSPRRIASFEGVRVGIGMKMVKKIKSLVT
SDACGEVEVKKIVHFGSLNNVMTTVFGESYDFDEVNGKGCFLERLVSEGYELLGIFNWSD
HFWFLRWFDFQGVRKRCRALVSEVNTFVGGIIEKHKMKKGNNLNGEENDFVDVLLGLQKD
EKLSDSDMIAVLWEMIFRGTDTVAILVEWVLARMVLHQDIQDKLYREIASATSNNIRSLS
DSDIPKLPYLQAIVKETLRLHPPGPLLSWARLAIHDVHVGPNLVPAGTIAMVNMWSITHN
AKIWTDPEAFMPERFISEDVSIMGSDLRLAPFGSGRRVCPGKAMGLATVHLWIGQLIQNF
EWVKGSCDVELAEVLKLSMEMKNPLKCKAVPRNVGFA*
AC005819 AC006418 CYP78A6 MATKLESSLIFALLSKCSVLSQTNLAFSLLAVTIIWLAISLFLWTYPGGPAWGKYLFGRL ISGSYKTGNVIPGPKGFPLVGSMSLMSSTLAHRRIADAAEKFGAKRLMAFSLGETRVIVT CNPDVAKEILNSPVFADRPVKESAYSLMFNRAIGFAPHGVYWRTLRRIASNHLFSTKQIR RAETQRRVISSQMVEFLEKQSSNEPCFVRELLKTASLNNMMCSVFGQEYELEKNHVELRE MVEEGYDLLGTLNWTDHLPWLSEFDPQRLRSRCSTLVPKVNRFVSRIISEHRNQTGDLPR DFVDVLLSLHGSDKLSDPDIIAVLWEMIFRGTDTVAVLIEWILARMVLHPDMQSTVQNEL DQVVGKSRALDESDLASLPYLTAVVKEVLRLHPPGPLLSWARLAITDTIVDGRLVPAGTT AMVNMWAVSHDPHVWVDPLEFKPERFVAKEGEVEFSVLGSDLRLAPFGSGRRICPGKNLG FTTVMFWTAMMLHEFEWGPSDGNGVDLSEKLRLSCEMANPLPAKLRRRR AB016893 CYP78A7 GSS AQ011443 F27J20 B25471 MELMNLASKETSYWMIALPAGFGSQNLHDVSTLGYLFLAVVFLSIVTWALAGGGGVAWKN GRNRLGRVAIPGPRGIPVFGSLFTLSRGLAHRTLAAMAWSRANTEIMAFSLGSTPVIVAS EPNIAREILMSPHFADRPVKQSAKSLMFSRAIGFAPNGTYWRMLRRIASTHLFAPRRILA HEAGRQLDCAEMVKAVSVEQNGAGSVVLRKHLQLAALNNIMGSVFGRRYDPLAQKEDLDE LTSMVREGFELLGAFNWSDYLPWLGYFYDSIRLNQRCSDLVPRIRTLVKKIIDEHRVSNS EKKRDIGDFVDVLLSLDGDEKLQEDDMIAVLWEMIFRGTDTTALLTEWTMAELVLNPNVQ TKLRDEILTAVGDGADGDVADADLAKLPYLNAVVKETLRLHPPGPLLSWARLSTSDVQLS NGMVIPKGTTAMVNMWAITHDQTVWSDPLKFDPERFTGNADMDIRGGDLRLAPFGAGRRV CPGKNMGLATVTRWVAELVRRFEWGQDQTEPVDLGEVLKLSCEMEHPLRAVV AC007323 CYP78A8 comp(15232-17051) 62% identical to 78A6 no ESTs no GSSs AC023628 94000-96000 region chromosome I clone F6F3 CYP78A8 MRTEIESLWVFALASKFNIYMQQHFASLLVAIAITWFTITIVFWSTPGGPAWGKYFFTRR FISLDYNRKYKNLIPGPRGFPLVGSMSLRSSHVAHQRIASVAEMSNAKRLMAFSLGDTKV VVTCHPAVAKEILNSSVFADRPVDETAYGLMFNRAMGFAPNGTYWRTLRRLGSNHLFNPK QIKQSEDQRRVIATQMVNAFARNPKSACAVRDLLKTASLCNMMGLVFGREYELESNNNLE SECLKGLVEEGYDLLGTLNWTDHLPWLAGLDFQQIRFRCSQLVPKVNLLLSRIIHEQRAA TGNFLDMLLSLQGSEKLSESDMVAVLWEMIFRGTDTVAVLVEWVLARIVMHPKVQLTVHD ELDRVVGRSRTVDESDLPSLTYLTAMIKEVLRLHPPGPLLSWARLSITDTSVDGYHVPAG TTAMVNMWAIARDPHVWEDPLEFKPERFVAKEGEAEFSVFGSDLRLAPFGSGKRVCPGKN LGLTTVSFWVATLLHEFEWLPSVEANPPDLSEVLRLSCEMACPLIVNVSSRRKIM* CYP78A9 AB036059 and contig 31 of clone F21F14 from the EU chr 3 project 8955-10728 numbering will change as contigs are joined EST AI995549 MATKLDTSSLLLALLSKCSLLTQTNLALSLLVASLASLALSLFFWSHPGGPAWGKYFLHR RRQTTVIPGPRGLPFVGSMSLMSNTLAHRCIAATAEKFRAERLMAFSLGETRVIVTCNPD VAKEILNSPVFADRPVKESAYSLMFNRAIGFAPYGVYWRTLRKIASNHLFSPKQIKRSET QRSVIANQIVKCLTKQSNTKGLCFARDLIKTASLNNMMCSVFGKEYELEEEHEEVSELRE LVEEGYDLLGTLNWTDHLPWLSEFDPQRIRSRCSNLVPKVNRFVNRIISDHREQTRDSPS DFVDVLLSLDGPDKLSDPDIIAVLWEMIFRGTDTVAVLIEWILARMVLHPDIQSTVHNEL DQIVGRSRAVEESDVVSLVYLTAVVKEVLRLHPPGPLLSWARLAITDTIIDGRRVPAGTT AMVNMWAIAHDPHVWENPLEFKPERFVAKEGEVEFSVLGSDLRLAPFGSGRRVCPGKNLG LTTVTFWTATLLHEFEWLTPSDEKTVDLSEKLRLSCEMANPLAAKLRPRRSFSV AC016662 CYP78A10 chromosome I clone IGF-F2P9 also AC079678 F9E11 64% to 78A2 comp(102654-104343) MTIDMYLSFASRSGSSPFPSLELCLSIFLFISLFVFWLTPGGFAWALYKARFHTRPESKTGPAI PGPSGLPIFGLLLAFVNNALTHRILANIADTCKAKALMAFSVGSTRFVITSEPETAKELL NSSAFADRPVKESAYELLFDRAMGFAPFGDYWRELRRISSTHLFSPKRIFSSGESRRKIG QNMVGEIKNAMECYGEVHIKKILHFGSLNNVMSSVFGKTYNFNEGIVYSKESNELEHLVS EGYELLGIFNWSDHFPGMRWLDLQGVRRRCRSLVGRVNVFVGKIINDHKSKRSLRDNPEE STYDDDFVDVLLGMHGNSKLSDSDMIAVLWEMIFRGTDTVAILLEWILARMVLHPDIQAK AQAEIDCIVGDSGRQVTDSDLPKLPYVRAIVKETLRMHPPGPLLSWARLSIHDTQIGTHF IPAGTTAMVNMWAITHDEKVWPEAHEYKPERFLGAQESNNFPIMGSDLRLAPFGAGRRVC PGKSMGLATVELWLAQLLGSYKWVSCGEVDLSETLKLSLEMKNTLVCKAIPRG* AB010692 CYP79A2 comp(11000-13200 region) also AF245302 about 53% identical to 79A1 MLDSTPMLAFIIGLLLLALTMKRKEKKKTMLISPTRNLSLPPGPKSWPLIGNLPEILGRN KPVFRWIHSLMKELNTDIACIRLANTHVIPVTSPRIAREILKKQDSVFATRPLTMGTEYC SRGYLTVAVEPQGEQWKKMRRVVASHVTSKKSFQMMLQKRTEEADNLVRYINNRSVKNRG NAFVVIDLRLAVRQYSGNVARKMMFGIRHFGKGSEDGSGPGLEEIEHVESLFTVLTHLYA FALSDYVPWLRFLDLEGHEKVVSNAMRNVSKYNDPFVDERLMQWRNGKMKEPQDFLDMFI IAKDTDGKPTLSDEEIKAQVT*ELMLATVDNPSNAAEWGMAEMINEPSIMQKAVEEIDRV VGKDRLVIESDLPNLNYVKACVKEAFRLHPVAPFNLPHMSTTDTVVDGYFIPKGSHVLIS RMGIGRNPSVWDKPHKFDPERHLSTNTCVDLNESDLNIISFSAGRRGCMGVDIGSAMTYM LLARLIQGFTWLPVPGKNKIDISESKNDLFMAKPLYAVATPRLAPHVYPT* note Kazusa gives different N-terminal MKGSIHDSAITGLEAQLMASKRGLIEA followed by RKE... for clone K18I23.1 AB025602 14755-16760 CYP79A3P Stop codon in Heme signature and frameshifts at K-helix and after Heme signature probable pseudogene but alternative C-terminal lies beyond. 88% identical to 79A2 MFDDSTSMLAFILGLLLLSLTMKRKETKKTMLMTSHTRNLSLPPGPKSWPLVGNLPEILG RNKPVFRWIHSLMEELNTDIACIRLANTHVIPVTSPRIAREILKKQDSVFAIRPLTMGTE YCSRGYLTIAVQSQGEQWKKMRRVVASHVTSKKSFKLMLEKRTEEADNLVRYINNRCVKN RGNGNGLAVIDLRFVVRQYSGNVARKMMFGIRHFGKGSEDESGPGSEEIEHVESLFTVLT HLYAFALSDYVPWLRFLDLEGHEKVVADAMRNVSKYNDPFVDERLMQWRNGKMKEPQDFL DMFIMAKDTNGKPTLSEEEIKAQVTELMLATVDNPSNAAEWAMAEMINEPSIMQKAVEEI DRVVGRHRFVLESDLPSLTYVKACVKETFRLHPVAPFNLPHMSTADAVVDGYFIPKGSHM LISRMGIGRNPNVWDKPLKFNPERHLGNNSCVELKEPNLNIISFSA*RRGCMGADIGAAM TYMLLARLIQGFTWSPVPGESKIDISESKSDLFMAKPLHAVATPRLDPHVYPT* AB025602 22090-22536 CYP79A4P Lone C-terminal exon note: this exon may join with N-terminal exons above to make a complete P450 gene. The natural last exon my be skipped. same as GSS B61504 LKEAFRLHHVAPFNLPHMSTTDVVVDGYFIPKGSHKLISRMGIGRNPKVWDKPLKFDPER HLSNNTCVELKEPDLNIITFGAERRGCMGADIGSAMTYMLVARLIQGFTWSPVPSESKID ISVSKSDLFMAKPLNAVVTPRLDPNVYPA* B19924 GSS fragment 70% identical to B61499 expand 62% identical to 79B2 FCIAAIDXPAHNMEWTLGEMLXKPEIFRKALEELDEVVWRDRLVPESRHTNSFY AL035708 T5J17.120 CYP79B2 complete sequence at4g39950 MNTFTSNSSDLTTTATETSSFSTLYLLSTLQAFVAITLVMLLKKLMTDPNKKKPYLPPGP TGWPIIGMIPTMLKSRPVFRWLHSIMKQLNTEIACVKLGNTHVITVTCPKIAREILKQQD ALFASRPLTYAQKILSNGYKTCVITPFGDQFKKMRKVVMTELVCPARHRWLHQKRSEEND HLTAWVYNMVKNSGSVDFRFMTRHYCGNAIKKLMFGTRTFSKNTAPDGGPTVEDVEHMEA MFEALGFTFAFCISDYLPMLTGLDLNGHEKIMRESSAIMDKYHDPIIDERIKMWREGKRT QIEDFLDIFISIKDEQGNPLLTADEIKPTIKELVMAAPDNPSNAVEWAMAEMVNKPEILR KAMEEIDRVVGKERLVQESDIPKLNYVKAILREAFRLHPVAAFNLPHVALSDTTVAGYHI PKGSQVLLSRYGLGRNPKVWADPLCFKPERHLNECSEVTLTENDLRFISFSTGKRGCAAP ALGTALTTMMLARLLQGFTWKLPENETRVELMESSHDMFLAKPLVMVGDLRLPEHLYPTV K AC006592 CYP79B3 comp(103303-105685) BAC F14M13 85% IDENTICAL TO 79B2 noESTs noGSSs also AC007168 comp(2059..2691,3443..4441)gene T26C19.1 fist 23 amino acids may not be true N-terminal it may start at MDT MQKQLAYITKHILIVSPSPSSLQMDTLASNSSDLTTKSSLGMSSFTNMYLLTTLQALAAL CFLMILNKIKSSSRNKKLHPLPPGPTGFPIVGMIPAMLKNRPVFRWLHSLMKELNTEIAC VRLGNTHVIPVTCPKIAREIFKQQDALFASRPLTYAQKILSNGYKTCVITPFGEQFKKMR KVIMTEIVCPARHRWLHDNRAEETDHLTAWLYNMVKNSEPVDLRFVTRHYCGNAIKRLMF GTRTFSEKTEADGGPTLEDIEHMDAMFEGLGFTFAFCISDYLPMLTGLDLNGHEKIMRES SAIMDKYHDPIIDERIKMWREGKRTQIEDFLDIFISIKDEAGQPLLTADEIKPTIKELVM AAPDNPSNAVEWAIAEMINKPEILHKAMEEIDRVVGKERFVQESDIPKLNYVKAIIREAF RLHPVAAFNLPHVALSDTTVAGYHIPKGSQVLLSRYGLGRNPKVWSDPLSFKPERHLNEC SEVTLTENDLRFISFSTGKRGCAAPALGTAITTMMLARLLQGFKWKLAGSETRVELMESS HDMFLSKPLVLVGELRLSEDLYPMVK AC006592 CYP79B4P 2 LONE EXONS comp(101836-101952) and comp(102489-102605) also AC007168 RILILFGTGKRGCASLALGMVITTMIFVRLLQCFKWKLN (intron) QTHVELMESSLDIILSKPLVMVGELRLSDSQMIFTQWSS* AC002986 CYP79C1 comp(75-1680) also AC007202 comp(77381- 79610) clone T8K14 MDYYLNNVIFSVVVTFSITLNIVFLIKSVVARFLGRRKKLPPCPRGFPIIGNLVGMLKNR PTSKWIVRVMNDMKTDIACFRFGRVHVIVITSDVIAREVVREKDAVFADRPDSYSAEYIS GGYNGVVFDEYGERQMKMKKVMSSELMSTKALNLLLKVRNLESDNLLAYVHNLYNKDESK TKHGAVVNVRDIVCTHTHNVKMRLLFGRRHFKETTMDGSLGLMEKEHFDAIFAALDCFFS FYVADYYPFLRGWNLQGEEAELREAVDVIARYNKMIIDEKIELWRGQNKDYNRAETKNDV PMIKDWLDILFTLKDENGKPLLTPQEITHLSDLDVVGIDNAVNVIEWTLAEMLNQREILE KAVEEIDMVVGKERLVQESDVPNLNYVKACCRETLRLHPTNPFLVPHMARHDTTLAGYFI PKGSHILVSRPGVGRNPKTWDEPLIYRPERHITGNEVVLTEPDLRLVSFGTGRRGCVGAK LGTSMIVTLLGRLLQGFDWTIPPGTTDRVELVESKENLFMANPLMACVKPRLDPNMYPKL WTGPA* AC008051 CYP79C2 1899-4148 chromosome 1, 52% to 79C1 ALSO ON AC073943 note: this gene has four potential frameshifts and it may be a pseudogene (corrected on AC073943) no ESTs no GSSs MTIITNPSSFTLVLISITLVLALARRFSRFMKPKGQLPPGPRGWPIVGNMLQMIINRPAH LWIHRVMEELQTDIACYRFARFHVITVTSSKIAREVLREKDEVLADRSESYASHLISHGY KNISFSSYGENWKLVKKVMTTKLMSPTTLSKTLGYRNIEADNIVTYVYNLCQLGSVRKP INVRDTILTYCHAVMMRMMFGQRHFDEVVENGGLGPKEKEHMDAIYLALDCFFSFNLTNYI PFLRGWNVRKAETEVREAVHIINICNDPIIQERIHLWRKKGGKQMEEDWLDILITLKDDQ GMHLFTFDEIRAQCKEINLATIDNTMNNVEWTIAEMLNHPEILEKATNELDIIVGKDRLV QESDISQLNYIKACSKESFRLHPANVFMPHHVAREDTTLAGYFVPK (phase 1 intron) GSQILVSRLGLGRNPKIWDEPNAFKPERYLDGHVEKSLGVTLMEPDMRFVTFGTGRRSCPG TKIGTSMTIMLLARLIQGFEWTLPIGKSSVELISAESNLFMAKPLLACAKPRLAPSLYPKIQI* AC008051 CYP79C3P pseudogene fragments 5709-6541 5709 VFMPYHVAREDTILSG 5756 6191 GSQILVSRLGLGRNPKIWDEPNAFKPERYLDGHVEKSLGVTLMEPDMRFVTFGTG RRSCPGTKIGTSMTIMLLARLIQGFEWTLPIGKSSVELISAESNLFMAKPLLACA KPRLAPS 6541 AC008051 CYP79C4P pseudogene fragments 7210-7435 7210 NIATTDNTMNNVEWTIAPMMNHPEILEKATNELDK*TGQNRLVQES 7335 QLNYIKACSKESFRFLP 7397 KKSFRLHPANV 7435 AC008051 CYP79C5P pseudogene fragments 9049-9895 9049 VFMPYHVAREDTILSG 9096 9525 GSQILVSRLGLGRNPKIWDEPNMFKTERHLDGHVKNSLGLTLLEPDMRFVTFGTG HRSCPTTKIGTSMTIMSLARLLQGFEWTLPNGKIQLELISAESNLFIAKPLL 9845 9854 EPILAPSLYPKIQI 9895 AC006341 CYP79F1 46% identical to 79C1 less with other 79s F14189, F14190 63871..66127 /gene="F3O9.21" CYP79F1 MMSFTTSLPYPFHILLVFILSMASITLLGRILSRPTKTKDRSCQLPPGPPGWPILGNLPE LFMTRPRSKYFRLAMKELKTDIACFNFAGIRAITINSDEIAREAFRERDADLADRPQLFI METIGDNYKSMGISPYGEQFMKMKRVITTEIMSVKTLKMLEAARTIEADNLIAYVHSMYQ RSETVDVRELSRVYGYAVTMRMLFGRRHVTKENVFSDDGRLGNAEKHHLEVIFNTLNCLP SFSPADYVERWLRGWNVDGQEKRVTENCNIVRSYNNPIIDERVQLWREEGGKAAVEDWLD TFITLKDQNGKYLVTPDEIKAQCVEFCIAAIDNPANNMEWTLGEMLKNPEILRKALKELD EVVGRDRLVQESDIPNLNYLKACCRETFRIHPSAHYVPSHLARQDTTLGGYFIPKGSHIH VCRPGLGRNPKIWKDPLVYKPERHLQGDGITKEVTLVETEMRFVSFSTGRRGCIGVKVGT IMMVMLLARFLQGFNWKLHQDFGPLSLEEDDASLLMAKPLHLSVEPRLAPNLYPKFRP AC006341 CYP79F2 89% identical to 79F1 B20058, B61499 CDS join(60240..61208,61386..61659,61953..62290) /gene="F3O9.20" CYP79F2 MMMKISFNTCFQILLGFIVFIASITLLGRIFSRPSKTKDRCRQLPPGRPGWPILGNLPEL IMTRPRSKYFHLAMKELKTDIACFNFAGTHTITINSDEIAREAFRERDADLADRPQLSIV ESIGDNYKTMGTSSYGEHFMKMKKVITTEIMSVKTLNMLEAARTIEADNLIAYIHSMYQR SETVDVRELSRVYGYAVTMRMLFGRRHVTKENMFSDDGRLGKAEKHHLEVIFNTLNCLPG FSPVDYVDRWLGGWNIDGEEERAKVNVNLVRSYNNPIIDERVEIWREKGGKAAVEDWLDT FITLKDQNGNYLVTPDEIKAQCVEFCIAAIDNPANNMEWTLGEMLKNPEILRKALKELDE VVGKDRLVQESDIRNLNYLKACCRETFRIHPSAHYVPPHVARQDTTLGGYFIPKGRNPKI WKDPLAYEPERHLQGDGITKEVTLVETEMRFVSFSTGRRGCVGVKVGTIMMAMMLARFLQ GFNWKLHRDFGPLSLEEDDASLLMAKPLLLSVEPRLASNLYPKFRP D78606 CYP81D1 AB026661 comp(8484-10672) MEETNIRVVLYSIFSLIFLIISFKFLKPKKQNLPPSPPGWLPIIGHLRLLKPPIHRTLRS FSETLDHNDGGGVMSLRLGSRLVYVVSSHKVAAEECFGKNDVVLANRPQVIIGKHVGYNN ANMIAAPYGDHWRNLRRLCTIEIFSTHRLNCFLYVRTDEVRRLISRLSRLAGTKKTVVEL KPMLMDLTFNNIMRMMTGKRYYGEETTDEEEAKRVRKLVADVGANTSSGNAVDYVPILRL FSSYENRVKKLGEETDKFLQGLIDDKRGQQETGTTMIDHLLVLQKSDIEYYTDQIIKGII LIMVIAGTNTSAVTLEWALSNLLNHPDVISKARDEIDNRVGLDRLIEEADLSELPYLKNI VLETLRLHPATPLLVPHMASEDCKIGSYDMPRGTTLLVNAWAIHRDPNTWDDPDSFKPER FEKEEEAQKLLAFGLGRRACPGSGLAQRIVGLALGSLIQCFEWERVGNVEVDMKEGVGNT VPKAIPLKAICKARPFLHKIIS* AL035601 F6G17.10 CYP81D2 complete sequence GSS AQ250603 Z99707 gene C7A10.10 C-term fragment <1-560 at4g37360 MEALMLIFTFCFIVLSLIFLIGRIKRKLNLPPSPAWALPVIGHLRLLKPPLHRVFLSVSQ SLGDAPIISLRLGNRLLFVVSSHSIAEECFTKNDVILANRQTTISTKHISYGNSTVVSAS YSEHWRNLRRIGALEIFSAHRLNSFSSIRRDEIRRLIGRLLRNSSYGFTKVEMKSMFSDL TFNNIIRMLAGKCYYGDGKEDDPEAKRVRTLIAEAMSSSGPGNAADYIPILTWITYSETR IKKLAGRLDEFLQGLVDEKREGKEKKENTMVDHLLCLQETQPEYYMDRIIKGTMLSLIAG GTDTTAVTLEWALSSLLNNPEVLNKARDEIDRMIGVDRLLEESDIPNLPYLQNIVSETLR LYPAAPMLLPHVASKDCKVGGYDMPRGTMLLTNAWAIHRDPLLWDDPTSFKPERFEKEGE AKKLMPFGLGRRACPGSGLAQRLVTLSLGSLIQCFEWERIGEEEVDMTEGPGLTMPKARP LEAMCRARDFVGKILPDSS Z99707 CYP81D3 CDS join(978..1475,1630..2016,2114..2731) gene C7A10.20 at4g37340 MEGQTLIFTFLFISLSLTFIIGRIKRRPNLPPSPSWALPVIGHLRLLKPPLHRVFLSVSE SLGDAPIISLRLGNRLVFVVSSHSLAEECFTKNDVVLANRFNSLASKHISYGCTTVVTAS YGDHWRNLRRIGAVEIFSAHRLNSFSSIRRDEIHRLIACLSRNSSLEFTKVEMKSMFSNL TFNNIIRMLAGKCYYGDGAEDDPEAKRVRELIAEGMGCFGAGNTADYLPILTWITGSEKR IKKIASRLDEFLQGLVDERREGKEKRQNTMVDHLLCLQETQPEYYTDNIIKGIMLVSNIL TGTDTSAVTLEWTLSALLNHPQILSKARDEIDNKVGLNRLVEESDLSHLPYLQNIVSESL RLYPASPLLVPHVASEDCKVGGYHMPRGTMLLTNAWAIHRDPKIWDDPTSFKPERFEKEG EAQKLLGFGLGRRACPGSGLAQRLASLTIGSLIQCFEWERIGEEEVDMTEGGGGVIMPKA IPLVAMCKARPVVGKILNESA* Z99707 CYP81D4 CDS join(3115..3615,4090..4464,4535..5137) gene C7A10.30 at4g37330 MDLNQILILSFLSLFTLAIFLLTRSKRKLNLPPSPAISLPVIGHLHLLKPPLHRTFLSLS KSIGNAPVFHLRLGNRLVYVISSRSIAEECFTKNDVVLANRPKFTISKHLGYNATYLLSA SYGDHWRNLRRIAAVEIFSTHRLNSFLYIRKDEIRRLISHLSRDSLHGFVEVEMKTLLTN LASNTTIRMLAGKRYFGEDNDDAKLVKNLVSEAVTSAGAGNPIDYLSILRWVSSYEKRIK NLGNRFDTFLQKLVDEKRAEKEKGETMIDHLLALQDIQPDYYTDVIIKGIILTLIIAGTD TSSVTLEWAMSNLLNHPEILKKARMEIDEKVGLDRLVDESDIVNLSYLQSIVLETLRMYP AVPLLLPHLSSEDCKVGGYDIPSGTMVLTNAWAMHRDPEVWEDPEIFKPERFEKEGEAEK LISFGMGRRACPGAGLAHRLINQALGSLVQCFEWERVGEDFVDMTEDKGATLPKAIPLRA MCKARSIVDKLI* Z99707 CYP81D5 CDS join(5994..6494,6880..7263,7340..7942) gene C7A10.40 Note: C-terminal revised from earlier sequence at4g37320 AF184833 inhibitor tagged site ITS66 EST AI999699 MDLTQILLLSFLFLTISIKLLLTKSNRKPNLPPSPAYPLPVIGHLHLLKQPVHRTFHSIS KSLGNAPIFHLRLGNRLVYVISSHSIAEECFTKNDVVLANRPDIIMAKHVGYNFTNMIAA SYGDHWRNLRRIAAVEIFSSHRISTFSSIRKDEIRRLITHLSRDSLHGFVEVELKSLLTN LAFNNIIMMVAGKRYYGTGTEDNDEAKLVRELIAEIMAGAGSGNLADYLPSINWVTNFEN QTKILGNRLDRVLQKLVDEKRAEKEKGQTLIDHLLSFQETEPEYYTDVIIKGIILALVLA GTDTSSVTLEWAMSNLLNHPEILEKARAEIDDKIGSDRLVEESDIVNLHYLQNIVSETLR LYPAVPLLLPHFSSDECKVAGYDMPRRTLLLTNVWAMHRDPGLWEEPERFKPERFEKEGE ARKLMPFGMGRRACPGAELGKRLVSLALGCLIQSFEWERVGAELVDMTEGEGITMPKATP LRAMCKARAIVGKTI AC002391 T20D16.15 CYP81D6 MHVSIQHIYKFLDLNMETYFLLLSLAFFLFISVKLLFGKRLSKFNLPPSPARPLPLIGHL HLLKLPLHRTFLSFSQSLGGAPIFCLRLGNRLTVVVSSYSIAEECFTKNDIVFANRPELI LGKHIEYNSTTMTSAPYGDHWRNLRRIATLEIFASHKLNGFLSVRKDEIRQLLLRLSKNS RHGFAKVEMRHLLFELTINNVFRMVAGKRFYGEGTEQDEVAQQVRHLMDEIVTSAGAGNA ADYLPIMRWFTNFEKRVKNLAIRIDKFLQSLVDEKRADKEKGTTMIDHLLSLQESQPDYY TDVTLKGIIIVMIIAGSETIAWTLEWAMLNVLNHPEVLKKARTEIDTKIGFDRLMDEADT KNLPYLQWIVLETLRLHPAAPTNVPHSTSEDCMLAGYDVPRGSMLLVNIWSMHRDPSIWE DPEMFKPERFKNEKLNQKLLSFGMGRRACPGYGLAHRVVSLTLGSMVQCFEWQRIGQEYV DNSEDKTVVLMRPTTPLLAMCKARPIVHEILDAFA* AC002391 T20D16.18 CYP81D7 MITPKLELLFFIIMFHVNSRFRYKLSYNFELPCSVQHIYKFFDLMETHFLILSLAFLFFI SLKLLFGKRHSKFNLPPSPARPLPFIGHLHLLKQPLHRTFLSFSQSLGDAPIFSLRLGNH LTVVVSSYSIAEECFTKNDIVLANRPKFILGKHIEYNFTTMTSAPYGDHWRNLRRIGTLE IFSSHKLNGFLSVRKDEIRHLLLRLSKNSQHGFAKVEMRQLFYDLTINNILRMVAGKRFY GEGTEQDEVARRVTQLIDEIVYRAGVGNAADYIPILRWITDFEKGVKELASRVDEFLQSL VDERRVHKQKGNTMMDHLLSLQETQPDYYTDVTLKGIIIVMILAGTETLAGTLEWAMLNL LNHPEVLEKARTEIDTEVGFDRLMDEADTKNLPYLQWIVLETLRLYPVAPTNIPHMTSDD CILAGYDVPRGSMLLVNVWSMHRDPSIWEAPEMFKPERFKNEKLNQKLLSFGFGRRACPG VGLAHRLMSLALGSMVQCFEWQRIGEEYVDTREEPMAMMRPATPLLAMCKARPIVHKFLD AFA* AL035601 F6G17.20 CYP81D8 ESTs AA586064, N38687, T41596, H76015 at4g37370 METKTLIFSILFVVLSLIYLIGKLKRKPNLPPSPAWSLPVIGHLRLLKPPIHRTFLSLSQ SLNNAPIFSLRLGNRLVFVNSSHSIAEECFTKNDVVLANRPNFILAKHVAYDYTTMIAAS YGDHWRNLRRIGSVEIFSNHRLNSFLSIRKDEIRRLVFRLSRNFSQEFVKVDMKSMLSDL TFNNILRMVAGKRYYGDGVEDDPEAKRVRQLIADVVACAGAGNAVDYLPVLRLVSDYETR VKKLAGRLDEFLQGLVDEKREAKEKGNTMIDHLLTLQESQPDYFTDRIIKGNMLALILAG TDTSAVTLEWALSNVLNHPDVLNKARDEIDRKIGLDRLMDESDISNLPYLQNIVSETLRL YPAAPMLLPHVASEDCKVAGYDMPRGTILLTNVWAIHRDPQLWDDPMSFKPERFEKEGEA QKLMPFGLGRRACPGSGLAHRLINLTLGSLIQCLEWEKIGEEVDMSEGKGVTMPKAKPLE AMCRARPSVVKIFNESV CYP81D10 74% to 81D2 AC013288 14837-27604 chromosome I clone T12I7 ALSO ON AC074025 GSSs B22310 B21261 MEDETQTLILITILILTLITLTFLIKKLKPKPNLPPSPPFSLPIIGHLRLLKPPLHRLFL TISKSLNDSPIFSLRLGNKLVFVVSSHSIVEECFTKNDVVLANRPKSIAS*HISYNYASM VSAPYSEHWRNLRRIGAVEIFSNHRLNSFYTIRRDEIRRLIARLSRSPNASLFAKVEMNS MLSNLAFNNIIRMVTGKCYYGDGAEDDPEAKRVRQLIAEAMSCFGAGHAADHLPMLRWIT DFERRVKKIAARLEFFQRLVDEKRVAKEKKENTMIDHLLSLQVSQPEYYTDHTIKGTMLS LILAGTDTSAVTLEWALSSLLNNPEVLKKVRDEIDNQIGLDRLLEESDIPNLPYLQNIVS ETLRLYPAGPLLVPHISSEDCKVGGYDMPCGTMLLVNVWAIHRDPRLWDDPASFKPERFE KEGETHKLLTFGLGRRACPGSGLARRLVSLSLGSLIQCFEWERIGEEEVDMTEGGGLTMP RAIPLVAMCRARAFVGKIPHESG* AP002057 CYP81D11 genomic DNA, chromosome 3, BAC clone T19N8, comp(8441-10208) 67% to 81D2 ESTs T14177 Z29010, Z29011, Z26103, H77232, F14118 METIYLILSLFFFFIFLSLKLLFGPRPRKLNLPPSPSRPFPIIGHLHLLKLPLHRRFLSL SESLNNAKIFSLSLGSRLVFVVSSHAVAEECFTKNDVVLANRPEFLVGKHIGYNSTTMVG AAYGDSWRNLRRIGTIEIFSSLRLNSFVSIRQDEIRRLIICLAKNSQH (phase 0 intron) GFVKVEMKPLFMCLTINNIIRMVAGKRFYGDGTENDNEAKHVRQLIAEVVVSGGAGNAAD YFPILRYVTNYEKHVKKLAGRVDEFLQSLVNEKRVEKVKGNTMIDHLLSLQETQPDYYT DVIIKGIIL (phase 0 intron) VMILAGTDTSAGTLEWAMSNLLNHPEVLRKAKTEIDDQIGVDRLVEEQDIVKLPYLQHIVS ETLRLYPVAPMLLPHLASEDCIVDGYDVPRGTIILVNAWAIHRDPKLWEEPEKFKPERFE KKGEDKKLMPFGIGRRSCPGSGLAQRLVTLALGSLVQCFEWERVEEKYLDMRESEKGTTM RKATSLQAMCKARPIVHKVLDASCP* D78607 CYP81F1 AL035601 F6G17.80 at4g37430 MLYFILLPLLFLVISYKFLYSKTQRFNLPPGPPSRPFVGHLHLMKPPIHRLLQRYSNQYG PIFSLRFGSRRVVVITSPSLAQESFTGQNDIVLSSRPLQLTAKYVVYNHTTVGTAPYGDH WRNLRRMCSQEILSSHRLIIFQHIRKDEILRMLTRLSRYTQTSNESNDFTHIELEPLLSD LTFNNIVRMVTGKRYYGDDVNNKEEAELFKKLVYDIAMYSGANHSADYLPILKLFGNKFE KEVKAIGKSMDDILQRLLDECRRDKEGNTMVNHLISLQQQQPEYYTDVIIKGLMMSMMLA GTETSAVTLEWAMANLLRNPEVLEKARSEIDEKIGKDRLIDESDIAVLPYLQNVVSETFR LFPVAPFLIPRSPTDDMKIGGYDVPRDTIVMVNAWAIHRDPEIWEEPEKFNPDRYNDGCG SDYYVYKLMPFGNGRRTCPGAGLGQRIVTLALGTLIQCFEWENVKGEEMDMSESTGLGMR KMDPLRAMCRPRPIMSKLLL AB019233 CYP81F2 about 70% identical to 81F1 EST T88150 MDYVLIVLPLALFLIAYKFLFSSKTQGFNLPPGPTPFPIVGHLHLVKPPVHRLFRRFAEK YGDIFSLRYGSRQVVVISSLPLVRESFTGQNDVILTNRPHFLTAKYVAYDYTTIGTAAYG DHWRNLRRICSLEILSSNRLTGFLSVRKDEIRRLLTKLSREYDGRVVELEPLLADLTFNN IVRMVTGRRYYGDQVHNKEEANLFKKLVTDINDNSGASHPGDYLPILKVFGHGYEKKVKA LGEAMDAFLQRLLDECRINGESNTMVSHLLSLQLDQPKYYSDVIIKGLMLSMMLAGTDTA AVTLEWAMANLLKKPEVLKKAKAEIDEKIGEERLVDEPDIANLPYLQNIVSETFRLCPAA PLLVPRSPSEDLKIGGYDIPRGTIVLVNAWAIHRDPRLWDEPEKFMPERFEDQEASKKLM VFGNGRRTCPGATLGQRMVLLALGSLIQCFDWEKVNGEDVDMTENPGMAMRKLVQLRAVC HKRPIM AL035601 F6G17.50 CYP81F3 EST AA712784 at4g37400 MFYYVIVLPLALFLLAYKLFFTSKTKRFNLPPSPPYSLPILGHHNLLKPPVHRLFHRLSK THGPIFSLQFGSRRAVVISSSSLATQCFTGQNDIILSNRPCFLTAKYVAYNYTTVGTAPY GDHWRNLRRICSLEILSSNRLTNFLHIRKDEIHRMLTRLSRDVNKEIELEPLLSDLTFNN IVRMVTGKRYYGDEVHNEEEANVFKKLVADINDCSGARHPGDYLPFMKMFGGSFEKKVKA LAEAMDEILQRLLEECKRDKDGNTMVNHLLSLQQNEPEYYTDVTIKGLMLGMMIAGTDTS AVTLEWAMSSLLNHPEALEKAKLEIDEKIGQERLIDEPDIANLPYLQNIVSETFRLYPAA PLLVPRSPTEDIKVGGYDVPRGTMVMVNAWAIHRDPELWNEPEKFKPERFNGGEGGGRGE DVHKLMPFGNGRRSCPGAGLGQKIVTLALGSLIQCFDWQKVNGEAIDMTETPGMAMRKKI PLSALCQSRPIMSKLQAHLKG AL035601 F6G17.60 CYP81F4 ESTs T46441 AA598000 AI099973 65% identical to 81F2 at4g37410 MFNYVIILPLALFLLAYKFFFTSKKQRYYLPPSPSYSLPILGHHLLIKPPVHRLFHRLSN IHGPIFYLRLGSRRAVVISSSSLARECFTGQNDVIVSNRPRFLTSKYIAYNYTTIATTSY GDHWRNLRRICSLEIVSSKRLANFLHIRKEEIQRMLTRLSRDARVGKEVELESILYDLTF NNIVRMVTGKIYYGDDVSDKEEAELFKKLFTFITTNSGARHPGEYLPFMKIFGGSFEKEV KAAAKVIDEMLQRLLDECKSDKDGNTMVNHLLSLQQDDPEYYTDIIIKGLMLGIMVASSE TSALTIEWAMASLLNHPKVLDKVKLEIDEIIGQDRLIEESDIANLPYLQNVVSETLRLHP AAPVLVPRSTAEDIKIGGYDVPRDTMVMVNAWAIHRDPDLWTEPERFNPERFNGGEGEKD DVRMLIAFGSGRRICPGVGLAHKIVTLALGSLIQCFDW KKVNEKEIDMSEGPGMAMRMMV PLRALCKTRPIMNKLPAYTKV AB007645 CYP81G1 clone K8K14.3 from Kazusa MIDLFLLALMAGFIAVAYVFRSKQKKNLPPNPVGFPVIGHLHLLKEPVHRSLRDLSRNLG IDVFILRLGSRRAVVVTSASAAEEFLSQQNDVVFANRPLATLTEYMGYNNTLVSTAPYGE HWRRLRRFCAVDILSTARLRDFSDIRRDEVRAMIRKINVELVTSGGSVRLKLQPFLYGLT YNILMSMVAGKREEDEETKEVRKLIREVFDFAGVNYVGDFLPTLKLFDLDGYRKRAKKLA SKLDKFMQKLVDEHRKNRGKAELEKTMITRLLSLQESEPECYTDDIIKGLVQVMLLAGTD TTAVTLEWAMANLLNHPEVLRKLKTELNEVSKEGRVFEESDTGKCPYLNNVISETLRLFP AAPLLVPHASSTDCEVAGFDIPRRTWLFINAWAIQRDPNVWDDPETFKPERFESETHRGK FLPFGIGRRACPGMGLAQLVLSLALGSLIQCFDWERDNDVAVDMSEGKGLTMPKSVPLVA KCKSLPILDKLVL* Z99707 CYP81H1 GSSs B26848, B28869 at4g37310 CDS join(8851..9360,9980..10369,10876..11532) gene C7A10.50 BE529494 M74L22STM Arabidopsis developing seed Arabidopsis thaliana cDNA clone 600038786R1 5' Length = 224 BE522141 M23H4STM Arabidopsis developing seed Arabidopsis thaliana cDNA clone M23H4 5' Length = 472 same as BE529494 MDCILLILTTLVAIFIVKIVLLVTKPNKNLPPSPNICFPIIGHLHLLKKPLLHRTLSHLS HSLGPVFSLRLGSRLAVIISSPTAAEECFLTKNDIVLANRPRFIMGKYVAYDYTSMVTAP YGDHWRNLRRITALEVFSTNRLNASAEIRHDEVKMLLQKLHDLSVERPAKVELRQLLTGL TLNVIMRMMTGKRFFEEDDGGKAGISLEFRELVAEILELSAADNPADFLPALRWFDYKGL VKRAKRIGERMDSLLQGFLDEHRANKDRLEFKNTMIAHLLDSQEKEPHNYSDQTIKGLIL MMVVGGTDTSALTVEWAMSNLLNHPQILETTRQNIDTQMETSSSRRLLKEEDLVNMNYLK NVVSETLRLYPVAPLMVPHVPSSDCVIGGFNVPRDTIVLVNLWAIHRDPSVWDDPTSFKP ERFEGSDQFGHYNGKMMPFGLGRRACPGLSLANRVVGLLLGSMIQCFEWESGSGGQVDMT EGPGLSLPKAEPLVVTCRTREMASELLFFGSEPSNKNV* AL353995 CYP81K1 chromosome 5, BAC clone F12B17 comp(9841..11343) EST AI997093 MEDLWFIFFSFLTATLIFFITKKFLWSLNSKLPPSPTPLPIIGH LHLIKKYPLPQALRHLSSNYGPVLFLKFGCRNVLTLSSPDSIEECFTNHDVTLANRPK TITSDHFSYGYKNFGFAPYGDLWRTLRRLSTLEVFSSASLQKNSSIRNEEVSNLCLII FRLSRDSRIVDLKYQFTLLTAHIMLRLVSGKRGVKKSDPESEKRFLDDFKLRFFSSMS MNVCDYFPVLRWIGYKGLEKRVIDMQRMRDEYLQRLIDDIRMKNIDSSGSVVEKFLKL QESEPEFYADDVIKGIIVLMFNGGTDTSPVAMEWAVSLLLNHPDKLEKLREEIKSNVK HKGLIQDSDLSSLPYLRCVIYETLRLYPAAPLLLPHCSSKRFNLGNYEIPENIMLLVN AWAVHRDGELWEEANVFKPERFEGFVGDRDGFRFLPFGVGRRACPAAGLGMRVVSLAV GALVQCFEWEKVEAGDIDMRPVFGVAMAKAEPLVALPKPWSEMVPILSQL AL353995 CYP81K2 chromosome 5, BAC clone F12B17 comp(11529..11633,11733..11837,12108..13583) ESTs AI997093 MENLWVIFLLTATLTFFITKFLWSQNPKLPPSPTPLPIIGHLHL INKYPLPQALHHLSSNYGPVLFLKFGCREVLILSSPDSIEECFTNHDIALANRPKTIT SDHFSYGYKNFGFAPYGDLWRTLRRLSTLEVFSSVSLQKNSSIRTEEVSNLCSSLFRF SGGNSCKVDLRYQFTLLTAHVMLRLVSGNRGVNESDSESETKFLDEFKSRFFSSLSSM NVCDYFPVLRWIGYKGLENRVIEIQRMRDEYLQRLIDDIRMKKFDSTGSVVEKFLKLQ ESEPEFYSDDVIKGIVVLMFNAGSDTSPVTMEWAMALLLNHPDKLDKVREEIKSNVKH KGIIQDSDLSSLPYLRCVIYETLRLHPAAPILPPHCSSKRFNLGNYEIPENTVLLVNA WAVHRDGELWEEADVFKPERFEEFVGDRDGFRFLPFGVGRRACPAAGLAMRVVSLAVG ALVQCFEWEKVEKEDIDMRPAFSVAMDRAEPLIALLKPCPEMVPILSQL AL021636 F10N7.200 CYP82C2 also AL049607 89014-90961 at4g31970 MDTSLFSLFVPILVFVFIALFKKSKKPKHVKAPAPSGAWPIIGHLHLLSGKEQLLYRTLG KMADQYGPAMSLRLGSSETFVVSSFEVAKDCFTVNDKALASRPITAAAKHMGYDCAVFGF APYSAFWREMRKIATLELLSNRRLQMLKHVRVSEISMVMQDLYSLWVKKGGSEPVMVDLK SWLEDMSLNMMVRMVAGKRYFGGGSLSPEDAEEARQCRKGVANFFHLVGIFTVSDAFPKL GWFDFQGHEKEMKQTGRELDVILERWIENHRQQRKVSGTKHNDSDFVDVMLSLAEQGKFS HLQHDAITSIKSTCLALILGGSETSPSTLTWAISLLLNNKDMLKKAQDEIDIHVGRDRNV EDSDIENLVYIQAIIKETLRLYPAGPLLGHREAIEDCTVAGYNVRRGTRMLVNVWKIQRD PRVYMEPNEFRPERFITGEAKEFDVRGQNFELMPFGSGRRSCPGSSLAMQVLHLGLARFL QSFDVKTVMDMPVDMTESPGLTIPKATPLEILISPRLKEGLYV AL021636 F10N7.220 CYP82C3 also AL049607 81769-83693 at4g31950 MDTSLFSLFVSILVFVFIALFKKSKKPKYVKAPAPSGAWPIIGHLHLLGGKEQLLYRTLG KMADHYGPAMSLRLGSSETFVGSSFEVAKDCFTVNDKALASLMTAAAKHMGYVFWLEMRK IAMIELLSNRALQMLNNVRVSEISMGVKDLYSLWVKKGGSEPVMVDLKSWLEDMIANMIM RMVAGKRYFGGGGAESSEHTEEARQWRKGIAKFFHLVGIFTVSDAFPKLGWLDLQGHEKE MKQTRRELDVILERWIENHRQQRKVSGTKHNDSDFVDVMLSLAEQGKLSHLQYDANTCIK TTCLALILGGSETSPSTLTWAISLLLNNKDMLKKVQDEIDIHVGRDRNVEDSDIKNLVYL QAIIKETLRLYPAAPLLGHREAMEDCTVAGYNVPCGTRLIVNVWKIQRDPKVYMEPNEFR PERFITGEAKDFDVRGQNFELMPFGSGRRSCPGPSLAMQMLHLGLARFLHSFEVKTVLDR PVDMSESPGLTITKATPLEVLINPRLKRELFV AL021636 F10N7.230 CYP82C4 also AL049607 78646-80569 at4g31940 MDTSLFSLFVPILVFVFIALFKKSKKPKYVKAPAPSGAWPIIGHLHLLGGKEQLLYRTLG KMADHYGPAMSLQLGSNEAFVVSSFEVAKDCFTVNDKALASRPMTAAAKHMGYNFAVFGF APYSAFWREMRKIATIELLSNRRLQMLKHVRVSEITMGVKDLYSLWFKNGGTKPVMVDLK SWLEDMTLNMIVRMVAGKRYFGGGGSVSSEDTEEAMQCKKAIAKFFHLIGIFTVSDAFPT LSFFDLQGHEKEMKQTGSELDVILERWIENHRQQRKFSGTKENDSDFIDVMMSLAEQGKL SHLQYDANTSIKSTCLALILGGSDTSASTLTWAISLLLNNKEMLKKAQDEIDIHVGRDRN VEDSDIENLVYLQAIIKETLRLYPAGPLLGPREAMEDCTVAGYYVPCGTRLIVNVWKIQR DPKVYMEPNEFRPERFITGEAKEFDVRGQNFELMPFGSGRRSCPGSSLAMQVLHLGLARF LHSFDVKTVMDMPVDMSENPGLTIPKATPLEVLISPRIKEELFV AL031369 CYP82F1 comp(52551..52924,53217.53496,53597.54526) gene= F13D4.120 GSSs B27454, B29081 matches with 2 diffs. B20125 (same BAC end as B27454) AC018722 Arabidopsis thaliana chromosome II CDS complement(join(42245..42618,42911..43190,43291..44220)) MDLIMLFLLSALFIFPVLILIKSRLRPKNKKSTAPMVPGAWPLLGHLHLFDTVNPTHVTF GAMADVYGPVFMAKLGSIKVMIINSKEVAKEIYTVHDKLLERPELTASKLLGYNDSFLTF SPYGLYWREIRKIAVSELFSTSGVDMHMVSRAREADLAFRALYVRWEKRGKPKEGVLVDM KQEFIDLTANISLMMVSGKRYFGENPNCEVKEARRCGKLIREFLDYFALFLLSDVAPVLG FLDWKTKRGMKRTAKGLDKVAEGWIEEHKNKRSDHGRSENDYLDILIKILGQDKIPGLSD THTKIKALCLNLVLAGSETAIVVLVWAVSLLLNNPHVLRKAQEELDSKIGKERVVEELDI KDLVYLQAIVKETFRLYPPVPLVAYRAVVEDFDIAFCKCHVPAGTQLMVSAWKIHRDPNV WSNPEQFEPERFLTSNRELDVGGQSYKFFPFGLGRRSCPAIPLGMRMVHYLLVRFLHSFD LARPSSQDVDMTESNGLVNHKATPLEVNIIPRLHKSLYEVDHIGTDN CYP82G1 or 82D2 check vs 82D1 chris steele A48 AB026647 comp(36473-38300) no ESTs no GSSs clone MJL12 MTFLFSTLQLSLFSLALVIFGYIFLRKQLSRCEVDSSTIPEPLGALPLFGHLHLLRGKKL LCKKLAAMSQKHGPIFSLKLGFYRLVVASDPKTVKDCFTTNDLATATRPNIAFGRYVGYN NASLTLAPYGDYWRELRKIVTVHLFSNHSIEMLGHIRSSEVNTLIKHLYKGNGGTSIVKI DMLFEFLTFNIILRKMVGKRIGFGEVNSDEWRYKEALKHCEYLAVIPMIGDVIPWLGWLD FAKNSQMKRLFKELDSVNTKWLHEHLKKRSRNEKDQERTIMDLLLDILPEDIVISGHVRD VIVKATIL*ALTLTGSDSTSITLTWAVSLLLNNPAALEAAQEEIDNSVGKGRWIEESDIQ NLKYLQAIVKETHRLYPPAPLT*GIREAREDCFVGGYRVEKGTRLLVNIWKLHRDPKIWP DPKTFKPERFMEDKSQCEKSNFEYIPFGSGRRSCPGVNLGLRVVHFVLARLLQGFELHKV SDEPLDMAEGPGLALPKINPVEVVVMPRLDPKLYSLL* U18929 CYP83A1 AL035528 F18A5.160 ESTs AA389789, T88085 at4g13770 MEDIIIGVVALAAVLLFFLYQKPKTKRYKLPPGPSPLPVIGNLLQLQKLNPQRFFAGWAK KYGPILSYRIGSRTMVVISSAELAKELLKTQDVNFADRPPHRGHEFISYGRRDMALNHYT PYYREIRKMGMNHLFSPTRVATFKHVREEEARRMMDKINKAADKSEVVDISELMLTFTNS VVCRQAFGKKYNEDGEEMKRFIKILYGTQSVLGKIFFSDFFPYCGFLDDLSGLTAYMKEC FERQDTYIQEVVNETLDPKRVKPETESMIDLLMGIYTEQPFASEFTVDNVKAVILDIVVA GTDTAAAAVVWGMTYLMKYPQVLKKAQAEVREYMKEKGSTFVTEDDVKNLPYFRALVKET LRIEPVIPLLIPRACIQDTKIAGYDIPAGTTVNVNAWAVSRDEKEWGPNPDEFRPERFLE KEVDFKGTDYEFIPFGSGRRMCPGMRLGAAMLEVPYANLLLSFNFKLPNGMKPDDINMDV MTGLAMHKSQHLKLVPEKVNKYM* D78598 CYP83A2 also T75944 EST at4g31500 MDLLLIIAGLVAAAAFFFLRSTTKKSLRLPPGPKGLPIIGNLHQMEKFNPQHFLFRLSKL YGPIFTMKIGGRRLAVISSAELAKELLKTQDLNFTARPLLKGQQTMSYQGRELGFGQYTA YYREMRKMCMVNLFSPNRVASFRPVREEECQRMMDKIYKAADQSGTVDLSELLLSFTNCV VCRQAFGKRYNEYGTEMKRFIDILYETQALLGTLFFSDLFPYFGFLDNLTGLSARLKKAF KELDTYLQELLDETLDPNRPKQETESFIDLLMQIYKDQPFSIKFTHENVKAMILDIVVPG TDTAAAVVVWAMTYLIKYPEAMKKAQDEVRSVIGDKGYVSEEDIPNLPYLKAVIKESLRL EPVIPILLHRETIADAKIGGYDIPAKTIIQVNAWAVSRDTAAWGDNPNEFIPERFMNEHK GVDFKGQDFELLPFGSGRRMCPAMHLGIAMVEIPFANLLYKFDWSLPKGIKPEDIKMDVM TGLAMHKKEHLVLAPTKHI U38416 CYP84A1 also AL022141, F23E13.110 at4g36220 MESSISQTLSKLSDPTTSLVIVVSLFIFISFITRRRRPPYPPGPRGWPIIGNMLMMDQLT HRGLANLAKKYGGLCHLRMGFLHMYAVSSPEVARQVLQVQDSVFSNRPATIAISYLTYDR ADMAFAHYGPFWRQMRKVCVMKVFSRKRAESWASVRDEVDKMVRSVSCNVGKPINVGEQI FALTRNITYRAAFGSACEKGQDEFIRILQEFSKLFGAFNVADFIPYFGWIDPQGINKRLV KARNDLDGFIDDIIDEHMKKKENQNAVDDGDVVDTDMVDDLLAFYSEEAKLVSETADLQN SIKLTRDNIKAIIMDVMFGGTETVASAIEWALTELLRSPEDLKRVQQELAEVVGLDRRVE ESDIEKLTYLKCTLKETLRMHPPIPLLLHETAEDTSIDGFFIPKKSRVMINAFAIGRDPT SWTDPDTFRPSRFLEPGVPDFKGSNFEFIPFGSGRRSCPGMQLGLYALDLAVAHILHCFT WKLPDGMKPSELDMNDVFGLTAPKATRLFAVPTTRLICAL* >AF267140.1 CYP84A4 ferulate-5-hydroxylase (CYP84) gene, partial Length = 1044 >CYP84A4 P1;AT5g04330 from MIPS (no Genbank accession number yet) ferulate-5-hydroxylase complete sequence (no ESTs) MLLKPYPPGP KGLPVIGNIL MMNQFNHRGL AKLSRIYGGL LHLRLGFSHI FVVSSPDIAR QVLQVQDHVF SNRPTTIAIR YLTYGGSDLA FCNYGPFWRR MRKLYVMMLF SRKRAESWVS VDEEVHKSVR LVASNVGKPL NICKLAFSLS RDITFRAAFG SSSSTSDESR LDEFLEIIQE FSKLFGEFNV ADYVPSWLSW IDPQGINGRV EKARKSLDGF IESVIDDHLH KKKREHDNVD EETDMVDQLL AFYEEEVKVN NSVTKINLDN IKGIIMDVMF GGTETVALAI EWVLTEILRS PENMKRVQDE LTSVVGLDRW RVEDTHLEKL TFLKCILKET LRLHPPFPLL LHETVKDTEI SGYFIPKGSR VMVNTYALGR DPNSWSDPES FNPGRFLNPI APDLKGNNFE FVPFGSGRRS CPGMQLGLYA FELAVAHLLH CFTWSLPDGM NPGDVDTVEG PGLTVPKAIP LVAVPTTRLL CPIVVS* Sequence AT5g04330 was extracted from sa_e_22 Fragment (39390-38758,40295-39468(C)) AB009048 CYP85A1 comp(57427-60263) GSS: B09951 MGAMMVMMGLLLIIVSLCSALLRWNQMRYTKNGLPPGTMGWPIFGETTEFLKQGPNFMRN QRLRYGSFFKSHLLGCPTLISMDSEVNRYILKNESKGLVPGYPQSMLDILGTCNMAAVHG SSHRLMRGSLLSLISSTMMRDHILPKVDHFMRSYLDQWNELEVIDIQDKTKHMAFLSSLT QIAGNLRKPFVEEFKTAFFKLVVGTLSVPIDLPGTNYRCGIQARNNIDRLLRELMQERRD SGETFTDMLGYLMKKEGNRYPLTDEEIRDQVVTILYSGYETVSTTSMMALKYLHDHPKAL QELRAEHLAFRERKRQDEPLGLEDVKSMKFTRAVIYETSRLATIVNGVLRKTTRDLEING YLIPKGWRIYVYTREINYDANLYEDPLIFNPWRWMKKSLESQNSCFVFGGGTRLCPGKEL GIVEISSFLHYFVTRYRWEEIGGDELMVFPRVFAPKGFHLRISPY >CYP85A2 AP002060 82% to Arabidopsis 85A1 Chr III clone T20F20 MGIMMMILGLLVIIVCLCTALLRWNQMRYSKKGLPPGTMGWPIF GETTEFLKQGPDFMKNQRLRYGSFFKSHILGCPTIVSMDAELNRYILMNESKGLVAGY PQSMLDILGTCNIAAVHGPSHRLMRGSLLSLISPTMMKDHLLPKIDDFMRNYLCGWDD LETVDIQEKTKHMAFLSSLLQIAETLKKPEVEEYRTEFFKLVVGTLSVPIDIPGTNYR SGFQARNNIDRLLTELMQERKESGETFTDMLGYLMKKEDNRYLLTDKEIRDQVVTILY SGYETVSTTSMMALKYLHDHPKALEELRREHLAIRERKRPDEPLTLDDIKSMKFTRAV IFETSRLATIVNGVLRKTTHDLELNGYLIPKGWRIYVYTREINYDTSLYEDPMIFNPW RWMEKSLESKSYFLLFGGGVRLCPGKELGISEVSSFLHYFVTKYRWEENGEDKLMVFP RVSAPKGYHLKCSPY K19M22.6 CYP86A1 GSS: AL088478 BAC end T2A13 also X90458 BE520915 M15G10STM developing seed cDNA clone M15G10 5' Length = 305 BE527724 M69E18STM developing seed cDNA clone 600036667R1 5' Length = 255 BE520916 M15G10TTM developing seed cDNA clone M15G10 3' Length = 208 MEALNSILTGYAVAALSVYALWFYFLSRRLTGPKVLPFVGSLPYLIANRSRIHDWIADNL RATGGTYQTCTMVIPFVAKAQGFYTVTCHPKNVEHILKTRFDNYPKGPMWRAAFHDLLGQ GIFNSDGDTWLMQRKTAALEFTTRTLRQAMARWVNGTIKNRLWLILDRAVQNNKPVDLQD LFLRLTFDNICGLTFGKDPETLSLDLPDNPFSVAFDTATEATLKRLLYTGFLWRIQKAMG IGSEDKLKKSLEVVETYMNDAIDARKNSPSDDLLSRFLKKRDVNGNVLPTDVLQRIALNF VLAGRDTSSVALSWFFWLVMNNREVETKIVNELSMVLKETRGNDQEKWTEEPLEFDEADR LVYLKAALAETLRLYPSVPQDFKYVVDDDVLPDGTFVPRGSTVTYSIYSIGRMKTIWGED CLEFRPERWLTADGERFETPKDGYKFVAFNAGPRTCLGKDLAYNQMKSVASAVLLRYRVF PVPGHRVEQKMSLTLFMKNGLRVYLQPRGEVLA CYP86A2 AF013293 AF195115 at4g00360 BE526773 M66C24STM developing seed cDNA clone 600035490R1 5' Length = 246 BE521721 M21B5STM developing seed cDNA clone M21B5 5' Length = 411 MDVSNTMLLVAVVAAYWLWFQRISRWLKGPRVWPVLGSLPGLIEQRDRMHDWITENLRAC GGTYQTCICAVPFLAKKQGLVTVTCDPKNIEHMLKTRFDNYPKGPTWQAVFHDFLGQGIF NSDGDTWLFQRKTAALEFTTRTLRQAMGRWVNRGIKLRFCPILETAQNNYEPVDLQDLIL RLTFDNICGLAFGKDTRTCAPGLPENGFASAFDRATEASLQRFILPEFLWRLKKWLGLGL EVSLSRSLGEIDGYLDAVINTRKQELLSQRESGVQRHDDLLSRFMKKKDQSYSETFLRHV ALNFILAGRDTSSVALSWFFWLITTHPTVEDKIVREICSVLIETRGTDVSSWTAEPLEFD EVDRLVYLKAALSETLRLYPSVPEDSKHVVNDDILPDGTFVPAGSSVTYSIYAAGRMKST WGEDCLEFKPERWISPDDGKFVNHDQYRFVAFNAGPRICLGKDLAYLQMKTIAAAVLLRH RLTVAPGHKVEQKMSLTLFMKNGLLVNVHKRDLEVMMKSLVPKERNDVVVLNGKCNGGIG EGVAVNAAVAVAV* B19916 GSS fragment like CYP86A GLLKEEEEFWKGQERIIITGMHSPGFVEHPFQTTPHLVRKELLKVMESFAKSXEAFDLQD VFLRLTFDIICLAVXGADPETLAIDLP B12879 GSS fragment mid region aa 142-218 similar to 86A TVRHPGVRWTNGGSKIRVGDILATDEDNAGPDDLQEILIGLTLDDMRRLALCEDTGTCAR GHPENGDASALDXATSA B09868 GSS fragment aa 157-207 mid region like 86A RLRQARGRGVDRGXXIRLCRVRARVQDKAEPCDVQELVERVTLDDICGLAFGKERGTWLR GYPESG CYP86A4 Y09581 AC061957 65912-67583 chromosome 1 clone F22L4 MEISNAMLLVAIVTGYWLWFKRISRWLKGPRVWPLLGSLPGLIEQRDRMHEWITENLRAC GGTYQTCIFAVPFLAKKQGLVTVTCDPKNLEHMLKTRFDNYPKGPTWQSVFHDLLGQGIF NSDGDTWLFQRKTAALEFTTRTLRQAMGRWVNRGIKLRFCPILAAAQDNAEPVDLQDLIL RLTFDNICGLAFGKDTRTCAPGLPENGFASAFDRATEASLQRFIIPKFMWKLKKWLGLGL EVSLSRSLGEIDEYLAAVINTRKQELMSQQESGTHQRHDDLLSRFMMKKTESYSDTFLQH VALNFILAGRDTSSVALSWFFWLITMHPTVEDKIVREICSVLIETRGTDDVASWTEVPLG FDEIDRLVYLKAAISETLRLYPSVPEDSKHVENDDVLPDGTFVPAGSSLTYSIYAAGRMK STWGEDCLEFNPERWISPIDGKFINHDQYRFVAFNAGPRICLGKDLAYLQMKTIAAAVLL RHRLTVVPGHKVEQKMSLTLFMKNGLLVNLYKR*
B09869 GSS fragment N-TERM to mid region CYP86A-like
86a4-like only 2 diffs from RTLRQ on
RMQDWITETLRGGGGTYRTCFLPGPFLGXKQGLVTVNVEPRNLEHMFKTRFDNTLKVLTC
NPFFHDLLVQGIFNSDRDTWFFQRKTAAVEFTTRTLRQAMGRWVNRGIKLRFCPILATAQ
DNAEPVDLQDLIXRLTFDNICGLAFGKDTRTCAPGLPENGFASAFDRATEASLQRFVIPK
FMWKLKKWLKLGLEVSLSRS
CYP86A7 AC011622 43171-44742 clone F24D7 EST F14165 MDGSTAAIILTLIVTYIIWFVSLRRSYKGPRVWPLVGSLPALITNAHRMHDFIADNLRMC GGTYQTCIFPIPFLAKKQGHVTVTCDPKNLEHILKTRFDNYPKGPSWQSVFHDLLGDGIF NSDGDTWRFQRKTAALEFTTRTLRQAMARWVDRAIKNRLVPILESARSRAEPIDLQDVLL RLTFDNICGLTFGKDPRTLSPEFPENGFAVAFDGATEATLQRFIMPEFIWKIRKWLRLGL EDDMSRSISHVDNYLSEIINTRKLELLGQQQDESRHDDLLSRFMKKKESYSDKYLKYVAL NFILAGRDTSSVAMSWFFWLVSLNPRVEEKIINEICTILIKTRDTNVSKWTDEPLTFDEI DQLVYLKAALSETLRLYPSVPEDSKFVVANDVLPDGTFVPSGSNVTYSIYSVGRMKFIWG EDCLEFKPERWLEESRDEKCNQYKFVAFNAGPRICLGKDLAYLQMKSITASILLRHRLTV APGHRVEQKMSLTLFMKFGLKMDVHKRDLTLPVEKVVNEMRKK* AC004665 CYP86A8 61000-63000 Z17399 AC005397 109407..110149 AC005861 77% identical to 86A2 GSS: AL089168 BAC end T3F17 MEISTALMILSAITAYFLWLTFISRCLKGPRVWPILGSLPGLIENCERMHDWISDNLRAC SGTYQTCICAIPFLAKKQGLVTVTCDPRNLEHILKNRFDNYPKGPTWQAVFHDLLGQGIF NSDGDTWLFQRKTAALEFTTRTLRQAMARWVNRAIKLRFLPILENARLGSEPIDLQDLLL RLTFDNICGLTFGKDPRTCAPGLPVNTFAVAFDRATEASLQRFILPEILWKFKRWLRLGL EVSLTRSLVQVDNYLSEIITTRKEEMMTQHNNGKHHDDLLSRFIKKKESYSDETLQRVAL NFILAGRDTSSVALSWFFWLITQHPAIEDKILREICTVLVETRGDDVALWTDEPLSCEEL DRLVFLKAALSETLRLYPSVPEDSKRAVKDDVLPDGTFVPAGSSITYSIYSAGRMKSTWG EDCLEFKPERWISQSDGGRFINHDPFKFVAFNAGPRICLGKDLAYLQMKSIASAVLLRHR LTVVTGHKVEQKMSLTLFMKYGLLVNVHERDLTAIAADLRECKSNVVNDGVGNGVSS* AB007648 CYP86B1 BE524312 M48E10STM developing seed cDNA clone M48E10 5' Length = 348 BE529089 M73H04STM developing seed cDNA clone 600038312R1 5' Length = 253 BE525954 M64G05STM developing seed cDNA clone 600034520R1 5' Length = 256 BE529637 M75C18STM developing seed MNFNSSYNLTFNDVFFSSSSSSDPLVSRRLFLLRDVQILELLIAIFVFVAIHALRQKKYQ GLPVWPFLGMLPSLAFGLRGNIYEWLSDVLCLQNGTFQFRGPWFSSLNSTITCDPRNVEH LLKNRFSVFPKGSYFRDNLRDLLGDGIFNADDETWQRQRKTASIEFHSAKFRQLTTQSLF ELVHKRLLPVLETSVKSSSPIDLQDVLLRLTFDNVCMIAFGVDPGCLGPDQPVIPFAKAF EDATEAAVVRFVMPTCVWKFMRYLDIGTEKKLKESIKGVDDFADEVIRTRKKELSLEGET TKRSDLLTVFMGLRDEKGESFSDKFLRDICVNFILAGRDTSSVALSWFFWLLEKNPEVEE KIMVEMCKILRQRDDHGNAEKSDYEPVFGPEEIKKMDYLQAALSEALRLYPSVPVDHKEV VQEDDVFPDGTMLKKGDKVIYAIYAMGRMEAIWGKDCLEFRPERWLRDGRFMSESAYKFT AFNGGPRLCLGKDFAYYQMKSTAAAIVYRYKVKVVNGHKVEPKLALTMYMKHGLMVNLIN RSVSEIDQYYAKSFDEGYIN* AL357612.1 CYP86B2 CDS complement(join(80435..80836,81200..82450)) gene= T22D6_190 chromosome V 78% to 86B1 also on AL392174.1 F8L15 MNLSFSSYNLTATAVSSSCFGSDELLVSRRLFSLRDVQILELFI AFFVFATIHSLRQKKHQGMPVWPLVGMLPSLISAVRSNIYEWLSDVLISQNGTFRFRG PWFSTLNCVVTCDPRNVEHLLKTRFSIYPKGSYFRETMQDLLGDGIFNTDDGTWQRQR KAASVEFHSAKFRQLTSQSLHELVHNRLLPVLETSGKIDLQDILLRLTFDNVCMIAFG VDPGCLSPKLPEIPFAKAFEDATEATVVRFVMPKFVWKLMRSLNLGTEKKLKESINGV DDFAEEVIRTRKKEMSLETEIAKRPDLLTIFMGLRDENGQKFSDKFLRDICVNFILAG RDTSSVALSWFFWLIEKNPEVEEKIMMGICKILEQRVDHGDTKKNMEYEPVFRPEEIK KMDYLQAALSETLRLYPSVPVDHKEVLEDDVFPDGTKLKKGEKVIYAIYAMGRMETIW GKDCREFKPERWLRDGRYMSESAYKFTAFNGGPRLCLGKDFAYYQMRYVAAAIIYRYK VRVDDKGGHKVEPKMALTMYMKHGLKVNMVKRSVSEIDHYYC AC000103 CYP86C1 MNVLISAVVWVYTHLRLSDVALALVGLFLLSYLREKLVSKGGPVMWPVLGIIPMLALNKH DLFTWCTRCVVRSGGTFHYRGIWFGGAYGIMTADPANVEHILKTNFKNYPKGAFYRERFR DLLEDGIFNADDELWKEERRVAKTEMHSSRFLEHTFTTMRDLVDQKLVPLMENLSTSKRV FDLQDLLLRFTFDNICISAFGVYPGSLETGLPEIPFAKAFEDATEYTLARFLIPPFVWKP MRFLGIGYERKLNNAVRIVHAFANKTVRERRNKMRKLGNLNDYADLLSRLMQREYEKEED TTRGNYFSDKYFREFCTSFIIAGRDTTSVALVWFFWLVQKHPEVEKRILREIREIKRKLT TQETEDQFEAEDFREMVYLQAALTESLRLYPSVPMEMKQALEDDVLPDGTRVKKGARIHY SVYSMGRIESIWGKDWEEFKPERWIKEGRIVSEDQFKYVVFNGGPRLCVGKKFAYTQMKM VAAAILMRYSVKVVQGQEIVPKLTTTLYMKNGMNVMLQPRDW* AB024038 1-1074 CYP86C2 ESTs Z26059 Z26058 no GSS 59% identical to 86C1 Z26059 and AP000601 provide the N-terminal sequence MSLIDRISGHLCLQDVSIALLGLFLFGCLQAKLTNKGEPVQWPVFGITPEFFLHVHDMFG WVTRCLTKSRGTFPYQGIWFSGSYGAMTSVPANIEYMLKTNFKNFPKGEFYKERFRDLLE DGIFNADDESWKEQRRIIITEMHSTRFVDHSFQTTRDLIERKLLKVMESFSKSQEAFDLQ EILLRLTFDNICIAGLGDDPGTLDDDLPHVPFAKAFEEATESTLFRFMIPPFVWKPMKFF DIGYEKGLREAVETVHNFIDKMVVERIAMLKDQGTLANSKSDVLSRLIQIESHKRGDEND RFTAKFFRQFCTSFILAGRDTSSVALTWFFWLITKHPEVETKILHEIREILNQREKNKYK LDETGEKESSRHFTVKELNDMVYLQAALSESLRLYPPIPMEMKQATEEDVFPDGTFLRKG SRVYFSVYAMGRMESIWGKDCEMFKPERWIQGGQYVSDDQFKYVVFNAGPRLCLGKTFAY LQMKMVAASILLNYSIKVDQDHVVVPRVTTTLYMKHGLKVRITPRSLEEKKQDS* AC007357 comp(51369-52951)CYP86C3 76% to 86C2 no ESTs no GSSs MPLTERVYNHLCLSDVSLALLGLFVFCCVREKVTKKLGPTIWPVFGITPEFFFHRNDVYG WATRCLKKCRGTFLYNGIWLGGSYGAVTCVPANVEYMLKTNFKNFPKGAFFKERFNDLLE DGIFNADAESWKEQRRIIITEMHSTRFVEHSFQTTQDLVRKKLLKVMESFARSQEAFDLQ DVLLRLTFDNICIAGLGDDPGTLDSDLPLVPFAQAFEEATESTMFRFMIPPFIWKPLKFF DIGYEKGLRKAVDVLHEFVDKMVVDRICKLKEEGTLGNRSDVLSRIIEIESHKTTDEKDP STIKFFRQFCTSFILAGRDTSSVALTWFFWVIQKHPEVENKIIREISEILRQRGDSPTSK NESLFTVKELNDMVYLQAALSETMRLYPPIPMEMKQAIEDDVFPDGTFIRKGSRVYFATY AMGRMESIWGKDCESFKPERWIQSGNFVNDDQFKYVVFNAGPRLCLGKTFAYLQMKTIAA SVLSRYSIKVAKDHVVVPRVTTTLYMRHGLKVTISSKSLEEKIHVQD* AC007357 comp(54696-56285) CYP86C4 like 86C2 78% no ESTs no GSSs MSLTERLYNHLSLFDLLLSLLGLFVFCCLREKLTNKRGPMLWPVFGITLEFFFHINDVYG WVTRSLKKCRGTFLYRGVWLDGSYGAVTCVPANVEYMLKTNFKNFPKGTFFKSRFNDLLE EGIFNADDDSWKEQRRIIITEMHSTGFVEHSFQTTQHLVRKKLLKVMESFAKSQEAFDLQ DVFLRLTFDIICLAGLGADPETLAVDLPQVPFAKAFEEATESTLFRFMIPPFIWKPMRFL DTGYEKGLRIAVGVVHGFVDKMIVDRICELKEEETLDNRSDVLSRIIQIESHKRENEIDP STIRFFRQFCTSFILAGRDTSSVALSWFCWVIQKHPEVENKIICEIREILRQRGDSPTSK NESLFTVKELNNMVYLQAALSETLRLFPPIPMEMKQAIEDDVLPDGTFVRKGSRVYFSIY AMGRMESIWGKDCEIFRPERWIQAGKFVSDDQFKYVVFNAGPRLCIGKTFAYLQMKMIAA SVLLRYSIKVVQDHVIAPRVTTNLYMKYGLKVTITPRSLEEKKLESCSM* >CYP704B1 AC073178.3 68191-70140 chromosome I clone IGF-F10D13 39% to 86A1, 38% to 86A7, 38% to 86A8, 38% to 94D1 ESTs AA728483 MSLCLVIACMVTSWIFLHRWGQRNKSGPKTWPLVGAAIEQLTNFDRMHDWLVEYLYNSRT VVVPMPFTTYTYIADPINVEYVLKTNFSNYPKGETYHSYMEVLLGDGIFNSDGELWRKQR KTASFEFASKNLRDFSTVVFKEYSLKLFTILSQASFKEQQVDMQELLMRMTLDSICKVGFGVEI GTLAPELPENHFAKAFDTANIIVTLRFIDPLWKMKKFLNIGSEALLGKSIKVVNDFTYSV IRRRKAELLEAQISPTNNNNNNNNKVKHDILSRFIEISDDPDSKETEKSLRDIVLNFVIA GRDTTATTLTWAIYMIMMNENVAEKLYSELQELEKESAEATNTSLHQYDTEDFNSFNEKV TEFAGLLNYDSLGKLHYLHAVITETLRLYPAVPQDPKGVLEDDMLPNGTKVKAGGMVTYV PYSMGRMEYNWGSDAALFKPERWLKDGVFQNASPFKFTAFQAGPRICLGKDSAYLQMKMA MAILCRFYKFHLVPNHPVKYRMMTILSMAHGLKVTVSRRS AC025417 82132-84239 CYP87A2 complete sequence ESTs H77048, N37614 and AI995175 MWALLIWVSLLLISITHWVYSWRNPKCRGKLPP GSMGFPLLGESIQFFKPNKTSDIPPFIKERVK RYGPIFKTNLVGRPVIVSTDADLSYFVFNQEGRCFQSWYPDTFTHIFGKKNVGSLHGFMY KYLKNMVLTLFGHDGLKKMLPQVE MTANKRLELWSNQYSVELKDATAS MIFDLTAKKLISHDPDKSSENLRANFVAFIQGLISFPFDIPGTAYHKCLQ GRAKAMKMLRNMLQERRENPRKNPSDFFDYVIEEIQ KEGTILTEEIALDLMFVLLFASFETTSLALTLAIKFLSDDPEVLKRLTEEHE TILRNREDADSGLTWEEYKSMTYTFQFINGTARLANIVPAIFRKALRDIKFKDYTIPAGW AVMVCPPAVHLNPEMYKDPLVFNPSRWEGSKVTNASKHFMAFGGGMRFCVGTDFTKLQMA AFLHSLVTKYRWEEIKGGNITRTPGLQFPNGYHVKLHKKRD* AC010926 CYP87A3P chromosome I clone IGF-F28P22 GSS B08375 B10696 This appears to be a pseudogene remnant with two small exons and a frame shift This accession number is now in version 10 and this does not contain this sequence Anymore. The sequence is still in AC010795 64524 NFMAFGDGMRLCIGADFS 64585 KLQTAVFLHSSVTKYRWEETSGGNILRTPGLQ 64764 64765 FPNGYHVKLKKK 64800 AC010795 87A3P chromosome I BAC F16M19 genomic sequence, complete sequence Length = 100512 Query: 98 HFMAFGGGMRFCVGTDFTKLQ 118 +FMAFG GMR C+G DF+K+Q Sbjct: 40865 NFMAFGDGMRLCIGADFSKVQ 40803 Query: 116 KLQMAAFLHSLVTKY----------------------------RWEEIKGGNITRTPGLQ 147 KLQ A FLHS VTKY RWEE GGNI RTPGLQ Sbjct: 40804 KLQTAVFLHSSVTKYSLKLLLFHLLS*DN*WFLIL*FCFLDFIRWEETSGGNILRTPGLQ 40625 Query: 148 FPNGYHVKLHKK 159 FPNGYHVKL KK Sbjct: 40624 FPNGYHVKLKKKEI* 40589 NFMAFGDGMRLCIGADFSKVQKLQTAVFLHSSVTKYRWEETSGGNILRTPGLQFPNGYHVKLKKKEI* AC000098 CYP88A3 also B12705 and B12104 GSS fragments AND EST N96065, BE523418 M36G11STM developing seed cDNA clone M36G11 5' Length = 420 BE526247 M64N15STM developing seed cDNA clone 600034863R1 5' Length = 239 BE527416 M68B02STM developing seed cDNA clone 600036301R1 5' Length = 274 MAETTSWIPVWFPLMVLGCFGLNWLVRKVNVWLYESSLGENRHYLPPGDLGWPFIGNMLS FLRAFKTSDPDSFTRTLIKRYGPKGIYKAHMFGNPSIIVTTSDTCRRVLTDDDAFKPGWP TSTMELIGRKSFVGISFEEHKRLRRLTAAPVNGHEALSTYIPYIEENVITVLDKWTKMGE FEFLTHLRKLTFRIIMYIFLSSESENVMDALEREYTALNYGVRAMAVNIPGFAYHRALKA RKTLVAAFQSIVTERRNQRKQNILSNKKDMLDNLLNVKDEDGKTLDDEEIIDVLLMYLNA GHESSGHTIMWATVFLQEHPEVLQRAKAEQEMILKSRPEGQKGLSLKETRKMEFLSQVVD ETLRVITFSLTAFREAKTDVEMNGYLIPKGWKVLTWFRDVHIDPEVFPDPRKFDPARWDN GFVPKAGAFLPFGAGSHLCPGNDLAKLEISIFLHHFLLKYQVKRSNPECPVMYLPHTRPT DNCLARISYQ* AC005700 CDS 88A4 gene="T32F6.4" AL088126 BAC end T1G2 AL088480 BAC T2A14 MTETGLILMWFPLIILGLFVLKWVLKRVNVWIYVSKLGEKKHYLPPGDLGWPVIGNMWSF LRAFKTSDPESFIQSYITRYGRTGIYKAHMFGYPCVLVTTPETCRRVLTDDDAFHIGWPK STMKLIGRKSFVGISFEEHKRLRRLTSAPVNGPEALSVYIQFIEETVNTDLEKWSKMGEI EFLSHLRKLTFKVIMYIFLSSESEHVMDSLEREYTNLNYGVRAMGINLPGFAYHRALKAR KKLVAAFQSIVTNRRNQRKQNISSNRKDMLDNLIDVKDENGRVLDDEEIIDLLLMYLNAG HESSGHLTMWATILMQEHPMILQKAKEEQERIVKKRAPGQKLTLKETREMVYLSQVIDET LRVITFSLTAFREAKSDVQMDGYIIPKGWKVLTWFRNVHLDPEIYPDPKKFDPSRWEGYT PKAGTFLPFGLGSHLCPGNDLAKLEISIFLHHFLLKYRVERSNPGCPVMFLPHNRPKDNC LARITRTMP B11817 GSS FRAGMENT N-TERMINAL LIKE CYP88 MAETTNWIPVWFPLMVLGCFGLNWLVTKVNVWLYDSTLGETGTILQPVELGWPFIGNMLS FLRAFK B11581 GSS FRAGMENT PROBABLE CYP88 PSEUDOGENE KEVFLR*VVDKTLRVITFSLTAFRKAKTDVEMNG*LIPNGWNVLTWF*DVPLDPEVWPNS REFDPARWEDGVLYQTRVRXLPLGAGSHLLPGI*SGPRSNISIFLHHFSPQY U61231 CYP89A2 this sequence is probably same gene as CYP89A8X but this one has sequence errors. use AC006193 sequence. BE521069 M16F9STM developing seed cDNA clone M16F9 5' Length = 391 BE521070 M16F9XP developing seed cDNA clone M16F9 3' Length = 389 MEIWLLILASLSGSLLFHLLLRRRNSSSPPLPPDPNFLPFLGTLQWLREGLGGLESYLRS VHHRLGPIVTLRITSRPAIFVADRSLTHEALVLNGGPSMRTVHHLPVISKIVDEHNISSG SYGATWRLLRRNITSEILHPSRVRSYSHARHWVLEILFERFRNHGGDEPIVLIHHLHYAM FALLVLMCFGDKLDEKQIKQVEFIQRLQLLSLSRFNIFNIWPKFTKLILRKRWQEFLQIR RQQRDILLPLIRARRRSLKREKRSEEEDKKDYVQSYVDTLLDLELPEENRKLNEEDIMNL CSEFLTAGTDTTATALQWIMANLVKYPEIQERLHEEIKSVVGRSKGGGEEDVEKMPYLKA LVLEGLRRHPPGHFLLPHSVTEDTVLGGYKVPKNGTINFMVAEIGRDPVEWEEPMAFKPE RFMGEEEAVDLTGSRGIKMMPFGAGRRICPGIGLAMLHLEYYVANMVREFQWKEVEGHEV DLTEKLEFTVVMKHPLKALAVPR* AC006193 CYP89A2 corrected = CYP89A8X comp(66014-67477+) 94% identical to 89A2 probably same gene with seq. errors in U61231, since 3 prime and 5 prime untranslated sequences are nearly identical and two longer regions with differences are due to frame shifts MEIWLLILASLSGSLLLHLLLRRRNSSSPPLPPDPNFLPFLGTLQWLREGLGGLESYLRS VHHRLGPIVTLRITSRPAIFVADRSLTHEALVLNGAVYADRPPPAVISKIVDEHNISSGS YGATWRLLRRNITSEILHPSRVRSYSHARHWVLEILFERFRNHGGEEPIVLIHHLHYAMF ALLVLMCFGDKLDEKQIKEVEFIQRLQLLSLTKFNIFNIWPKFTKLILRKRWQEFLQIRR QQRDVLLPLIRARRKIVEERKRSEQEDKKDYVQSYVDTLLDLELPEENRKLNEEDIMNLC SEFLTAGTDTTATALQWIMANLVKYPEIQERLHEEIKSVVGEEAKEVEEEDVEKMPYLKA VVLEGLRRHPPGHFLLPHSVTEDTVLGGYKVPKNGTINFMVAEIGRDPVEWEEPMAFKPE RFMGEEEAVDLTGSRGIKMMPFGAGRRICPGIGLAMLHLEYYVANMVREFQWKEVQGHEV DLTEKLEFTVVMKHPLKALAVPRRCH AB010073 CYP89A3 no ESTs MATWFHIFCVSFTVNFLLYLFFRRTNNNLPLPPNPNFFPMPGPFQWLRQGFDDFYSYLRS IHHRLGPIISLRIFSVPAIFVSDRSLAHKALVLNGAVFSDRPPALPTGKIITSNQHTISS GSYGATWRLLRRNLTSEILHPSRVKSYSNARRSVLENLCSRIRNHGEEAKPIVVVDHLRY AMFSLLVLMCFGDKLDEEQIKQVEFVQRRELITLPRFNILNVFPSFTKLFLRKRWEEFLT FRREHKNVLLPLIRSRRKIMIESKDSGKEYIQSYVDTLLDLELPDEKRKLNEDEIVSLCS EFLNAGTDTTATTLQWIMANLVRNQEIQKRLYEEIKSVIGEEEEKEIEEEEMKKMPYLKA VVLEGLRLHPPGHLLLPHRVSEDTELGGYRVPKKGTFNINVAMIGRDPTVWEEPMEFKPE RFIGEDKEVDVTGSRGIKMMPFGAGRRICPGIGSAMLHLEYFVVNLVKEFEWKEVEGYEV DLSEKWEFTVVMKYPLKALAVTRRKEKTHIVMA* AC005897 CYP89A4 AC007045 10850-12400 BAC "F23M2" MEIWLLILGSLFLSLLLNLLLFRRRDSSSLPLPPDPNFFPFLGTLQWLRQGLGGLNNYLR SVHHRLGPIITLRITSRPAIFVADRSLAHQALVLNGAVFADRPPAAPISKIISSNQHNIS SSLYGATWRLLRRNLTSEILHPSRVRSYSHARRWVLEILFDRFGKSRGEEPIVVVDHLHY AMFALLVLMCFGDKLDEKQIKQVEYVQRRQLLGFSRFNILNLWPKFTKLILRKRWEEFFQ MRREQHDVLLPLIRARRKIVEERKNRSSEEEEDNKVYVQSYVDTLLELELPDEKRKLNED EIVSLCSEFLNGGTDTTATALQWIMANLVKNPEIQKRLYEEIKSVVGEEAKEVEEEDAQK MPYLKAVVMEGLRRHPPGHFVLPHSVTEDTVLGGYKVPKKGTINFMVAEIGRDPMVWEEP MAFKPERFMGEEEAVDITGSRGIKMMPFGAGRRICPGIGLAMLHLEYYVANMVREFEWKE VQGHEVDLTEKFEFTVVMKHSLKALAVLRRS AC006193 CYP89A5 comp(51700-53166+) T42576 92% identical to CYP89A4 EST H77059 AI992955 AV534518 GSS AL087677 MEIWLLILGSLFLSLLLNLLFFRLRDSSSLPLPPDPNYFPFIGTIQWLRQGLGGLNNYLR SVHHRLGPIITLRITSRPSIFVADRSLAHQALVLNGAVFADRPPAAPISKIISSNQHNIS SCLYGATWRLLRRNLTSEILHPSRVRSYSHARRWVLEILFDRFGKNGRGEEPIVVVDHLH YAMFALLVLMCFGDKLDEKQIKQVEYVQRRQLLGFSRFNILNLWPKFTKLILRKRWEEFF QMRREQHDVLLPLIRARRKIVEERKNRSSEEEEDNKEYVQSYVDTLLELELPDEKRKLNE DEIVSLCSEFLNGGTDTTATALQWIMANLVKNPDIQKRLYEEIKSVVGEEANEVEEEDAQ KMPYLEAVVMEGLRRHPPGHFVLPHSVTEDTVLGGYKVPKNGTINFMVAEIGRDPKVWEE PMAFKPERFMEEAVDITGSRGIKMMPFGAGRRICPGIGLAMLHLEYYVANMVREFDWKEV QGHEVDLTEKLEFTVVMKHPLKALAVPRRSH* AC006193 CYP89A6 comp(55495-56982+) MEIWLLILGSLFLSLLVNHLLFRRRDSFSSLPLPPDPNFFPFIGTLKWLRKGLGGLDNYL RSVHHHLGPIITLRITSRPAIFVTDRSLAHQALVLNGAVFADRPPAESISKIISSNQHNI SSCLYGATWRLLRRNLTSEILHPSRLRSYSHARRWVLEILFGRFGKNRGEEPIVVVDHLH YAMFALLVLMCFGDKLDEKQIKQVEYVQRRQLLGFSRFNILTLWPKFTKLIYRKRWEEFF QMQSEQQDVLLPLIRARRKIVDERKKRSSEEEKDNKEYVQSYVDTLLDVELPDEKRKLNE DEIVSLCSEFLNAGTDTTATALQWIMANLVKNPEIQRRLYEEIKSIVGEEAKEVEEQDAQ KMPYLKAVVMEGLRRHPPGHFVLPHSVTEDTVLGGYKVPKKGTINFMVAEIGRDPKVWEE PMAFKPERFMEEAVDITGSRGIKMMPFGAGRRICPGIGLAMLHLEYYVANMVREFEWQEV QGHEVDLTEKLEFTVVMKHPLKALAVPRRSH* AC006193 CYP89A7 comp(58358-59827+) MEIWLLILGSLSLSLLLNLLFFRLRDSSSLPLPPAPNFFPFLGTLQWLRQGLGGFNNYVR SVHHRLGPIITLRITSRPAIFVADGSLAHQALVLNGAVFADRPPAAPISKILSNNQHTIT SCLYGVTWRLLRRNITEILHPSRMKSYSHVRHWVLEILFDRLRKSGGEEPIVVFDHLHYA MFAVLVLMCFGDKLDEKQIKQVEYVQRQMLLGFARYSILNLCPKFTKLILRKRWEEFFQM RREQQDVLLRLIYARRKIVEERKKRSSEEEEENKEYVQSYVDTLLDVELPDEKRKLNEDE IVSLCSEFLIAGSDTTATVLQWIMANLVKNQEIQERLYEEITNVVGEEAKVVEEKDTQKM PYLKAVVMEALRRHPPGNTVLPHSVTEDTVLGGYKVPKKGTINFLVAEIGRDPKVWEEPM AFKPERFMGEEEAVDITGSRGIKMMPFGAGRRICPGIGLAMLHLEYYVANMVREFQWKEV EGHEVDLTEKVEFTVIMKHPLKAIAVPRRSH* CYP89A9 AC009895 37255-38905 T21P5 54% to 89A4 ESTs T76749, T44110, T45663, T14186, T21361, AA394479 MEITTIIFLIISSLTFSIFLKLIFFFSTHKLPPGPPRFPVIGNIIWLKKNNFSDFQGVLR DLASRHGPIITLHVGSKPSIWVTDRSLAHQALVQNGAVFSDRSLALPTTKVITSNQHDIH SSVYGSLWRTLRRNLTSEILQPSRVKAHAPSRKWSLEILVDLFETEQREKGHISDALDHL RHAMFYLLALMCFGEKLRKEEIREIEEAQYQMLISYTKFSVLNIFPSVTKFLLRRKWKEF LELRKSQESVILRYVNARSKETTGDVLCYVDTLLNLEIPTEEKEGGKKRKLSDSEIVSLC SEFLNAATDPTATSMQWIMAIMVKYPEIQRKVYEEMKTVFAGEEEEREEIREEDLGKLSY LKAVILECLRRHPPGHYLSYHKVTHDTVLGGFLIPRQGTINFMVGEMGRDPKIWEDPLTF KPERFLENGEACDFDMTGTREIKMMPFGAGRRMCPGYALSLLHLEYYVANLVWKFEWKCV EGEEVDLSEKQQFITMVMKNPFKANIYPRRK* X87367 CYP90A1 Also AB005237, X87368 GSS: AL087931 BAC F15D22 BE528852 M72L09STM developing seed clone 600038038R1 5' Length = 254 BE527890 M69F17STM developing seed cDNA clone 600036867R1 5' Length = 239 MAFTAFLLLLSSIAAGFLLLLRRTRYRRMGLPPGSLGLPLIGETFQLIGAYKTENPEPFI DERVARYGSVFMTHLFGEPTIFSADPETNRFVLQNEGKLFECSYPASICNLLGKHSLLLM KGSLHKRMHSLTMSFANSSIIKDHLMLDIDRLVRFNLDSWSSRVLLMEEAKKITFELTVK QLMSFDPGEWSESLRKEYLLVIEGFFSLPLPLFSTTYRKAIQARRKVAEALTVVVMKRRE EEEEGAERKKDMLAALLAADDGFSDEEIVDFLVALLVAGYETTSTIMTLAVKFLTETPLA LAQLKEEHEKIRAMKSDSYSLEWSDYKSMPFTQCVVNETLRVANIIGGVFRRAMTDVEIK GYKIPKGWKVFSSFRAVHLDPNHFKDARTFNPWRWQSNSVTTGPSNVFTPFGGGPRLCPG YELARVALSVFLHRLVTGFSWVPAEQDKLVFFPTTRTQKRYPIFVKRRDFAT* CYP90B1 AF044216 clone T03A05 from EU Chr 3 project GSS B77695 T28K14 AL132979 chromosome 3, BAC clone T3A5 comp(28999-26096) 8 exons MFETEHHTLLPLLLLPSLLSLLLFLILLKRRNRKTRFNLPPGKSGWPFLGETIGYLKPYT ATTLGDFMQQHVSKYGKIYRSNLFGEPTIVSADAGLNRFILQNEGRLFECSYPRSIGGIL GKWSMLVLVGDMHRDMRSISLNFLSHARLRTILLKDVERHTLFVLDSWQQNSIFSAQDEA KKFTFNLMAKHIMSMDPGEEETEQLKKEYVTFMKGVVSAPLNLPGTAYHKALQSRATILK FIERKMEERKLDIKEEDQEEEEVKTEDEAEMSKSDHVRKQRTDDDLLGWVLKHSNLSTEQ ILDLILSLLFAGHETSSVAIALAIFFLQACPKAVEELREEHLEIARAKKELGESELNWDD YKKMDFTQCVINETLRLGNVVRFLHRKALKDVRYKGYDIPSGWKVLPVISAVHLDNSRYD QPNLFNPWRWQQQNNGASSSGSGSFSTWGNNYMPFGGGPRLCAGSELAKLEMAVFIHHLV LKFNWELAEDDQPFAFPFVDFPNGLPIRVSRIL Z99708 AB008097 CYP90C1 and B25656 AQ011208 GSS fragments at4g36380 MQPPASAGLFRSPENLPWPYNYMDYLVAGFLVLTAGILLRPWLWFRLRNSKTKDGDEEED NEEKKKGMIPNGSLGWPVIGETLNFIACGYSSRPVTFMDKRKSLYGKVFKTNIIGTPIII STDAEVNKVVLQNHGNTFVPAYPKSITELLGENSILSINGPHQKRLHTLIGAFLRSPHLK DRITRDIEASVVLTLASWAQLPLVHVQDEIKKMTFEILVKVLMSTSPGEDMNILKLEFEE FIKGLICIPIKFPGTRLYKSLKAKERLIKMVKKVVEERQVAMTTTSPANDVVDVLLRDGG DSEKQSQPSDFVSGKIVEMMIPGEETMPTAMTLAVKFLSDNPVALAKLVEENMEMKRRKL ELGEEYKWTDYMSLSFTQNVINETLRMANIINGVWRKALKDVEIKGYLIPKGWCVLASFI SVHMDEDIYDNPYQFDPWRWDRINGSANSSICFTPFGGGQRLCPGLELSKLEISIFLHHL VTRYSWTAEEDEIVSFPTVKMKRRLPIRVATVDDSASPISLEDH z99708 90C1 join(177418.177527,178800.179124,179230.179379, 179473.179733,180222.180311,180454.180532, 180605.180711, 180834.180964,181326.181446) /gene="C7A10.980" also AL022141 F23E13.220 fragment of 90C1 gene runs off end of cosmid MIPNGSLGWPVIGETLNFIACGYSSRPVTFMDKRKSLYGKVFKTNIIGTPIIISTDAEVN KVVLQNHGNTFVPAYPKSITELLGENSILSINGPHQKRLHTLIGAFLRSPHLKDRITRDI EASVVLTLASWAQLPLVHVQDEIKKMTFEILVKVLMSTSPGEDMNILKLEFEEFIKGLIC IPIKFPGTRLYKSLKAKERLIKMVKKVVEERQVAMTTTSPANDVVDVLLRDGGDSEKQSQ PSDFVSGKIVEMMIPGEETMPTAMTLAVKFLSDNPVALAKLVEENMEMKRRKLELGEEYK WTDYMSLSFTQNVINETLRMANIINGVWRKALKDVEIKGYLIPKGWCVLASFISVHMDED IYDNPYQFDPWRWDRINGSANSSICFTPFGGGQRLCPGLELSKLEISIFLHHLVTRYSWT AEEDEIVSFPTVKMKRRLPIRVATVDDSASPISLEDH CYP90D1 AP001307 comp(43183-45689) 51% to 90C ESTs = AI994541, R30379, R30380 MDTSSSLLFFSFFFFIIIVIFNKINGLRSSPASKKKLNDHHVTSQSHGPKFPHGSLGWPV IGETIEFVSSAYSDRPESFMDKRRLMYGRVFKSHIFGTATIVSTDAEVNRAVLQSDSTAF VPFYPKTVRELMGKSSILLINGSLHRRFHGLVGSFLKSPLLKAQIVRDMHKFLSESMDLW SEDQPVLLQDVSKTVAFKVLAKALISVEKGEDLEELKREFENFISGLMSLPINFPGTQLH RSLQAKKNMVKQVERIIEGKIRKTKNKEEDDVIAKDVVDVLLKDSSEHLTHNLIANNMID MMIPGHDSVPVLITLAVKFLSDSPAALNLLTVEENMKLKSLKELTGEPLYWNDYLSLPFT QKVITETLRMGNVIIGVMRKAMKDVEIKGYVIPKGWCFLAYLRSVHLDKLYYESPYKFNP WRWQERDMNTSSFSPFGGGQRLCPGLDLARLETSVFLHHLVTRFWIAEEDTIINFPTVHM KNKLPIWIKRI* AB010697 CYP93D1 MOJ9, comp(15190-16802) note July 16, 1999: This sequence lies adjacent to CYP712A2 on clone MOJ9. The names of these two sequences were accidentally swapped. This has now been corrected. 93D1 starts with MVDLQY CYP712A2 starts with MFTLNLDMNYTS MVDLQYFSVIILVCLGITVLIQAITNRLRDRLPLPPSPTALPIIGHIHLLGPIAHQALHK LSIRYGPLMYLFIGSIPNLIVSSAEMANEILKSNELNFLNRPTMQNVDYLTYGSADFFSA PYGLHWKFMKRICMVELFSSRALDSFVSVRSEELKKLLIRVLKKAEAEESVNLGEQLKEL TSNIITRMMFRKMQSDSDGGEKSEEVIKMVVELNELAGFFNVSETFWFLKRLDLQGLKKR LKNARDKYDVIIERIMEEHESSKKNATGERNMLDVLLDIYEDKNAEMKLTRENIKAFIMN IYGGGTDTSAITVEWALAELINHPEIMKKAQQEIEQVVGNKRVVEESDLCNLSYTQAVVK ETMRLHPGGPIFVRESDEECAVAGFRIPAKTRVIVNVWAIGRDSNQWEDPLEFRPERFEG SEWKVMSEKMMSFGAGRRSCPGEKMVFRFVPIILAAIIQCFELKVKGSVDMDEGTGSSLP RATPLVCVPVAKEATQSFSLLEPNVNF* AB007649 CYP94B1 510 aa GSS AL081067 BAC end F2E21 MEMLNAIILILFPIIGFVLIFSFPTKTLKAKTASPSNPTSYQLIGSILSFNKNRHRLLQW YTDLLRLSPSQTITVDLLFGRRTIITANPENVEHILKTNFYNFPKGKPFTDLLGDLLGGG IFNSDGELWSSQRKLASHEFTMRSLREFTFEILREEVQNRLIPVLSSAVDCGETVDFQEV LKRFAFDVVCKVSLGWDPDCLDLTRPVPELVKAFDVAAEISARRATEPVYAVWKVKRFLN VGSEKRMREAIKTVHLSVSEIIRAKKKSLDIGGDVSDKQDLLSRFLAAGHGEEAVRDSVI SFIMAGRDTTSAAMTWLFWLLSQNDDVETKILDELRNKGSLGLGFEDLREMSYTKACLCE AMRLYPPVAWDSKHAANDDILPDGTPLKKGDKVTYFPYGMGRMEKVWGKDWDEFKPNRWF EEEPSYGTKPVLKSVSSFKFPVFQAGPRVCIGKEMAFTQMKYVVGSVLSRFKIIPVCNNR PVFVPLLTAHMAGGLKVKIKRREQCDSMYI* CYP94B2 AC010797 clone F28J7 31152-32636 AC011664 clone F5A18 95514-97001 chromosome III no ESTs no GSSs 55% to 94B1 note this seq was seen on AC011624 clone T18B3 15623-16993, but it is missing from a revised version of that sequence AC011624.5 and it was probably wrong, since that required the gene to be in two different places on the same chromosome MEASTFILLLVLVLLLVSAGKHVIYSCRNSTPKTYPVIGCLISFYTNRNRLLDWYTELL TESPSRTVVIRRLAARRTVVTANPSNVEYILKTNFDNYPKGKPFTEILGDFLGN GIFNVDGNLWLKQRRLATHDFTPKSLREYVTVLRNEVEKELLAFLNAAAEDSQPFDLQE LLRRFTFNIVCIVFLGIDRCTLNPSSPVSEFDRAFQTASAVSAGRGSAPLSFVWKFKRLV GFGSEKELRKAVGEVHNCVDEIIRDKKRKPANQDFLSRLIVAGESDETVRDMVI SIIMAGRDTTSAVATRLFWLITGHEETEHDLVSEIRSVKEEITGGFDYESL KKLSLLKACLCEVMRLYPPVPWDSKHALTDDRLPDGTLVRAGDRVTYFPYGMGRMEELWG EDWDEFKPNRWAESYDKTCCRVLKKVNPFKFPVFQAGPRVCLGEEMAYVQMKYIVASILD RFEIEPIPTDKPDFVPMLTAHMAGGMQVRVHRRDPSLS CYP94B3 AL133315 Clone T08P19 from the EU Chr 3 project comp(10683-12101) 80% to 94B1 55% to 94B2 ESTs AI996574, H76099 MAFLLSFLILAFLITIIFFLSSSSTKKVQENTTYGPPSYPLIGSILSFNKNRHRLLQWYT ELLRLSPSQTILVPLLGNRRTIITTNPLNVEYILKTNFFNFPKGKPFTDLLGDLLGGGIF NVDGHSWSSQRKLASHEFSTRSLRSFAFEVLKDEVENRLVPVLSTAADVGTTVDLQDVLK RFAFDVVCKVSLGWDPDCLDLTRPVNPLVEAFDTAAEISARRATEPIYAVWKTKRVLNVG SERKLREAIRTVHVLVSEIVRAKKKSLEIGTGAEAKQDLLSRFLAAGHNGEAVRDMVISF IMAGRDTTSAAMTWLFWLLTENDDVERKILEEVDPLVSLGLGFEDLKEMAYTKACLCEAM RLYPPVSWDSKHAANDDVLPDGTRVKRGDKVTYFPYGMGRMETLWGTDSEEFNPNRWFDS EPGSTRPVLKPISPYKFPVFQAGPRVCVGKEMAFMQMKYVVGSVLSRFEIVPVNKDRPVF VPLLTAHMAGGLKVKIKRRSHILNNV* CYP94C1 AC005824 complement(68817..70304)gene="F15K20.21" 45% identical to 94A1 and 94B1 BE528245 M70D23STM developing seed cDNA clone 600037290R1 5' Length = 264 MLLIISFTIVSFFFIIIFSLFHLLFLQKLRYCNCEICHAYLTSSWKKDFINLSDWYTHLL RRSPTSTIKVHVLNSVITANPSNVEHILKTNFHNYPKGKQFSVILGDLLGRGIFNSDGDT WRFQRKLASLELGSVSVRVFAHEIVKTEIETRLLPILTSFSDNPGSVLDLQDVFRRFSFD TISKLSFGFDPDCLRLPFPISEFAVAFDTASLLSAKRALAPFPLLWKTKRLLRIGSEKKL QESINVINRLAGDLIKQRRLTGLMGKNDLISRFMAVVAEDDDEYLRDIVVSFLLAGRDTV AAGLTGFFWLLTRHPEVENRIREELDRVMGTGFDSVTARCDEMREMDYLHASLYESMRLF PPVQFDSKFALNDDVLSDGTFVNSGTRVTYHAYAMGRMDRIWGPDYEEFKPERWLDNEGK FRPENPVKYPVFQAGARVCIGKEMAIMEMKSIAVAIIRRFETRVASPETTETLRFAPGLT ATVNGGLPVMIQERS AC023279 comp(55362-56858) CYP94D1 formerly CYP86D1 March 6 2000 F12K21 chr I GSS B97734, AL084336 BAC end F8K5 85% identical to AQ010792 MALLIFIFLLCFPISIFFIFFTKKSSSEFGFKSYPIVGSFPGLVNNRHRFLDWTVETLSR CPTQTAIFRRPGKQQLIMTANPSNVEYMLKTKFESFPKGQQFTSVLEDFLGHGIFNSDGD MWWKQRKTASYEFSTKSLRDFVMSNVTVEINTRLVPVLVEAATTGKLIDLQDILERFAFD NICKLAFNVDCACLGHDGAVGVNFMRAFETAATIISQRFRSVASCAWRIKKKLNIGSERV LRESIATVHKFADEIVRNRIDQGRSSDHKEDLLSRFISKEEMNSPEILRDIVISFILAGR DTTSSALSWFFWLLSMHPEVEDKILQELNSIRARTGKRIGEVYGFEHLKMMNYLHAAITE SLRLYPPVPVDIKSCAEDNVLPDGTFVGKGWAITYNIFAMGRMESIWGKDCDRFDPERWI DETNGCFRGEDPSKFPAFHAGPRMCVGKDMAYIQMKSIVAAVLERFVVEVPGKERPEILL SMTLRIKGGLFARVQERS* AL163972.1|ATT5P19 CYP94D2 Arabidopsis thaliana DNA chromosome 3, BAC clone T5P19 Length = 101539 88% identical to 94D1 89747 MEHFILISLLCFPIVIFFIFFPKNSSSEFGFKSYPIVGSLPGLVNNRHRFLDWTVETLSR 89926 89927 CPTQTAIFRRPGKLQFVMTANPANVEYMLKTKFESFPKGERFISILEDFLGRGIFNSDGE 90106 90107 MWWKQRKTASYEFSTKSLRDFVMSNVTVEINTRLVPVLAE 90286 90287 AATNGKLIDLQDILERFAFDNICKLAFNVDSACLGDDGAAGVNFMQAFETAATIISQRFQ 90466 90467 SVISYSWKIKKKLNIGSERVLRESIMIVHKFADEIVRNRIEQGKVSDHKEDLLSRFISKE 90646 90647 EMNSPEILRDIVISFILAGRDTTSSALSWFFWLLSMHPEVKDKILQELNSIRERTGKRIG 90826 90827 EVYGFEDLKLMNYLHAAITESLRLYPPVPVDTMSCAEDNVLPDGTFIGKDWGISYNAYAM 91006 91007 GRMESIWGKDCDRFDPERWIDETNGGFRGENPYKFPAFHAGPRMCLGKEMAYIQMKSIVA 91186 91187 AVLERFVVEVPGKKERPEILMSVTLRIRGGLNVRVQERS 91303 CYP94D3P AC024226 comp(6890-7005) formerly 86D2P GSSs AL080732 F1L6, AQ010792 F20J3 Missing N-terminal 116 amino acids also seen on AC069553 in region 62377-62009 minus strand SDGKIWWKQRKTTSYEFSTKSLHEFVMSNVTVE (intron) SSRLVRVFEEAATTGKLIDIQDILERFAFDNICKIAFNVN(frame shift) IGDDRAAGVNFMRAFETVATIISQWFQSVVSYLWKIKKKLNIGSERVLRELIVTV (19 amino acid deletion compared to 94D1) HKEDLLSRFISKEEMNSLEILRDIVI (frame shift) SFILEGRDTTSSALSWFFWLLSMHPEVEDKILKELKLIRARTRKQIREVYGIEDLKLMNY LHAAITESLRLYPHVPVDTMSCVEDNVLLDGTFIGKSWRISYNAYAMGRMESIWGKDCER FDPERWIDETNGCLEVRILLSF(frame shift) FRGENPFKFPAF (Missing 8 amino acids in the heme binding signature sequence including Cys) GNGMASIQMKSIDADVLERFVVEAPGKKEHPDILLSITLRMKGGFFVRVQERS* AC002391 T20D16.19 CYP96A1 MALITLLEVSISLLFFSFLYGYFLISKKPHRSFLTNWPFLGMLPGLLVEIPRVYDFVTEL LEASNLTYPFKGPCFGGLDMLITVDPANIHHIMSSNFANYPKGTEFKKIFDVLGDGIFNA DSELWKDLRKSAQSMMTHQDFQRFTLRTIMSKLEKGLVPLLDYVAEKKQVVDLQDVFQRF TFDTSFVLATGVDPGCLSTEMPQIEFARALDEAEEAIFFRHVKPEIVWKMQRFIGFGDEL KMKKAHSTFDRVCSKCIASKRDEITNGVINIDSSSKDLLMCYMNVDTICHTTKYKLLNPS DDKFLRDMILSFMLAGRDTTSSALTWFFWLLSKNPKAITKIRQEINTQLSPRTNDFDSFN AQELNKLVYVHGALCEALRLYPPVPFQHKSPTKSDVLPSGHRVDASSKIVFCLYSLGRMK SVWGEDASEFKPERWISESGRLIHVPSFKFLSFNAGPRTCLGKEVAMTQMKTVAVKIIQN YEIKVVEGHKIEPVPSIILHMKHGLKVTVTKRSNLL* AL021636 F10N7.20 CYP96A2 AL021811 F10M6.190 intact at4g32170 MASISLLQVSVAILCFLILHYFLFKKPHGRYPRNWPVLRMLPAMLKALHRIYDYSVKILE TSDLTFPFKGPRFTGMDMLLTVDPANIHHIMSSNFSNYIKGPEFQDVFDVLGDSFITTDS ELWKNMRKSYQAMLHSQEFQRFSMSTMTSKLKYGLVPLLNHFAEEGTTLDLQSVFGRFTF DTIFILVTGSDPRSLSIEMPEDEFAKALDDVGEGILYRHFKPRFLWKLQNWIGFGQEKKL TEANATFDRVCAKYISAKREEIKRSQGTSNGGSQDLLTSFIKLDTTKYKLLNPSDDKFLR DNILAFILAGRDTTATALSWFFWLLSENPHVVAKIHQEININTDLSRTGNSQENVDKLVY LHGALCEAMRLYPPVSFGRKSPIKSDVLPSGHKVDANSKIIICLYALGRMRAVWGEDASQ FKPERWISENGGIKHEPSFKFLSFNAGPRTCLGKHLAMTQMKIVAVEILRNYDIKVLQGQ KIVPALGFILSMKHGLQITVTKRCSA AC004512 96A3 MAIVIGLLEIFTAFVFFIFFQCLLLHKKTPKPLLTNWPALGMLPGLLLQVPRIYDWITEV LEATDMTFCFKGPCLSGMDILLTVDPVNIHYILSSNFANYPKGMEFKKIFEVVGDSIFNV DSGLWEDMRNSSHAIFSNQDFQMFWVSTSVRKLRQGLVPILENAADKNILVDLQDLFQRF LFDTSLILMTGYDPKCLSVEMPKVEFGDAVDGVSDGVFYRHVKPVFLWRLQYLIGVGVEK RLKRGLAVFDQLLEKIITAKREEINSHGTHHPSRGEAIDVLTYYMTMDTTKYKYLEPSDD RFIKDTILGFLIAARDTTSSALTWFFWLMSKNPEAINKIRQEVNKKMPRFDPADLEKLVY LHGAVCETLRLYPPVPFNHKSPAKPDVLPSGHRVDEKWKIVISMYALGRMKSVWGDDAED FRPERWISDSGRLKHEPSYKFLAFNAGPRACLGKKLTFLQMKTVAAEIIRNYDIKVVEGH KTEPVPSVLFRMQHGLKVNITRI* AI999275 701555244 75% to 96A3 new P450 sequence DVLPSGHKVDANSKIVICIYALGRMRSVWGEDALDFKPERWISDNGGLRHEPSYKFMAFN SGPRTCLGKNLALLQMKMVALEIIRNYDFKVIEGHKVEPIPSILLRMKHGLKV AB019226 CYP96A4 78% identical to 96A3 B96393 GSS fragment MAMIIGLLEIFIAFIFFFVYQCFSLHKKTPKHMVMNWPVLGMLPGVLFQIPRIYDFVTEA LEAENMTGCFIGPWLSGTDILLTVDPVNIQYILSSNFVNYPKGKKFNKIFEFLGDGIFNV DSGLWEDMRNSSHAIFSHQDFQSFSVSTSVSKLSQGLVPILDNAVEKHILVDLQDLFQRF LFDTSSTLMAGYDPKSLSVEMPKVEFADAMDGVADAMFYRHLKPAFLWSIQSWIGVGIEK KMRRGLDVFDQMLGKIISAKREEIKNHGIHDSKGEAMDVLTYYMTIDTTKYKHLKPSNDK FIRDTILGLVIAARDTTSSALTWFFWLLSKNPEAMTKIRQEINKKMPKFDPADLDKLVYL DGAVCETLRLYPSVPFNHKSPAKPDVLPSGHKVDKNWRVVIPIYSLGRMKSVWGDDAEDF RPERWISDSGMLRQESSYKFLAFNAGPRTCLGKRLTFLQMKTVAVEIIRNYDIKVVEGHK PKPVPSVLLRMQHGLKVSVTKI B98666 GSS fragment 96% identical to 96A4 N-terminal MAMIIGLLEIFIAFMFVFVCQCFSLHKKTPKHMVMNWPVRGMLPGVLFQIRRIYDFATEA LEAENMTGCFIGPWLSGTDILLTVDPVNMXYILSSNFVNYPKGKKFNKIFE CYP96A5 AC007019 F7D8.23 MAYVGLVEVFIALLVFFFFHFLIHKKSHQITPRNWPVLGMLPGVLVMLHRINDYVAEILE VSNLTFAFKGPWFSGMNMLITADPSNIQHVFSSNFSNYDKGPEFKEMFDFLGNGIFTADS KLWEDMRKSALVVLSHQGFQSFSLRTITCKIKNGLVPVLDHFAEANTVFDLQDVFQRLAF DVTLTLVTGCDSSSLSIEMPKNEYAKAMDDAEEVVVYRHVKPVVLWKLQNWIGLGEEKKM KEANAAFDRSCAKYISAKREEIISHHSNIGGEAHAEDLLSVYMNLDISKYELLNPNDDNF LKDIIKSFMLAGRDAIATTLTWFFWLLSKNPEAVTKIRQEINTNLPGSGMSLDADKLNKM VYLHGALCESLRLYAPIPFERKTPIKQDVLPSGHMVDKNWKILFSVYALGRMRSVWGQDA SEFKPERWISERNGGLKHEPSFKFFVFNSGPRNCLGKNLSFLQMKTVAVEIIRNYDIKVV EGHKIEPASSIILHMKHGLKVTVSKRGLVS CYP96A6P AB025623 62% identical to 96A2 missing N-terminal PEVESAKALDTIVEGIMYRHVKPRFSWKLQSWIGVGMEKKMIEAGAIFDRVCGKYISARR EEVKRSQVNNNDHFIRDSHANLLTSHIKLDTTQYQLLDPINDKFLRDNVFALLLAGRDTT ASALTWFFWFLSENPLVVTKIRQEIDMNLPRSCSGQERPSCDPMEYLNKDVLKQDFGLYL WGGGGRMRAVWGEDALEFKPERWISETGELAFNAGSRICHGKQRAMVQMKIVAVEILQNY DIKVANGQKFEPDTSLILKMKHGFKVKINKRCSS* AC007519 48335-49896 CYP96A7 MASIGLYEAFIGFLCFLISFYFLVKKPFRYLLIKKTHQSYPWNWPVLGMLPGILLRLQRI YDYSVEFLENSNMTFQFKGPWFVGMDILATADPTNIHYIMSSNFSNYIKGPIFHEIFEAF GDGIINTDAELWRDWRNASQLIFNHQRYQNFSASTTKSKVNDGLVPLFNVDLQDVFQRFM FDITFIFLTGTDPRSLSIEMPEIEFAKALDDVGDVIVYRHITPRFVWKLQKWIGIGTEKK MSKANATLDRVCEKLIAAKREELRSQGIIDNANGDSEDLLTSHIMLDATKYELLNPNDDK FLRDFTVSFMAAGRDSTSSALTWFFWNLTENPNVLSKILQEINTNLPRTGSDQDMSSYLN KLVYLHGALNESMRLYPPIPFERKSPIKDDVLPSGHKVKSNLNIMIFIYAMGRMKTIWGE DAMEFKPERWLTSSGLRHEPSYKFFSFNAGPRTCLGKNLAMNLMKTVIVEILQNYEIKIV SGQKIEAKPGLVLHMKHGLKVTMTMKCSS AC007519 56507-58060 CYP96A8 MASIGLYEAFIGFLCFLISFYFLVKKPFSYLLIKKTLQSYPWNWPVLGMLPGVLLRLQRI YDCSVEVLENSNMTFQFKGPWFVGMDVLATVDPANIHHIMSSNFSNYIKGPIFHEIFEAF GDGIINTDAELWRDWRNASQLIFNHQRYQNFSASTTKTKVNDGLVPLFNHFANEEIVVDL EDVFQRFMYDITFIFITGTDPRSLSIEMPEVEFSKALDDVGDAIVHRHITPRFVWKLQKW IGIGTEKKMLKAHATFDRVCEKIIAAKREELGSQGITYNSNGEREDLLTSFIKLDATKYE VLKPSHDKFLRDFTIGFMAAGRDSTASTLTWFFWNLSKNPNVLTKILQEINTNLPRTGSD QDMSSYLNKLVYLHGALSESMRLYPPIPFQRKSPIKEDVLPSGHKVKSNINIMIFIYAMG RMKTIWGEDAMEFKPERWISETGGVRHEPSYKFLSFNAGPRTCLGKNLAMNLMKTVIVEI LQNYEIKIVSGQKIEPKPGLILHMKHGLKVTMTKKCSS CYP96A9 AL078620 54378-55928 F23K16.110 at4g39480 MALVSLLEISIAFFCFLIFRIFLISKKPHRSFLTNWPLLGMLPGLLTVLPRVYDFITEVL EDGNLNYLFIGPFLGGIDMLFTVDPANIHHIMSSNFANYPKGTEFKKLFDVLGDGIFNAD SDLWKDLRKSSQSMMNHPDFQRFSLATSLSKLEKGLVPLLDHVAKEKLVVDLQDVFQRFT FDTTFVLATGYDPGCLSVEMPEIEFARALDDAEEAIFYRHFKPEVVWKMQRLIGVGAELK LKRAHAIFDRVCSKCIASKRDEISQGIDSSSSKDLLMSSINVDTTKYKLLNPSDDRFLRD TILSFMLAGRDTTGSALTWFFWLLCNNQEAMTKIRQEINTNLFPRNKTDDGSVSYDSDSF NPQEVKKLVYLHGAVCEALRLYPPVPFNHKSPAKPDVLPSGHKVKANSRILFCLYSLGRM KSVWGEDAMEFKPERWISESGRSVHEPSYKFLSFNAGPRTCLGKEVAMTQMKTVAVKIIQ NYDINVVEGHKIKPAPSVILHMKHGLKVTVSKRCLS CYP96A10 AL078620 57169-58608 F23K16.120 at4g39490 MVFISLFEISIAFFCFLLFRHFLINKKTHRLCPTNWPFFG MIPGLLVEIHRVYDFITEILEVTNLTYPCTGPCFANLDMLVTVDPANIHHIMSSNFANYP KGPEFKKLFDILGDGIFNADSELWKDLRKSAQSMMMNPEFQKFSLATSLKKLEKGLVPLL DHVAKEKLAVDLQDMFQRFTFDTTFVLATGYDPGCLSVEMPEVEFARALDDAEEAIFFRH VKPEIFWRLQGLLGLGDEKKMTKARSTLDRVCSKYIAIKRDEVSRGTNNVDSHSKDLLTS YMNLDTTKYKLLNPSDERFLRDTILTFMLAGRDTTGSGLTWFFWLLIKNPEVIAKIRQEI NTALFQRSKVDDDASNNNDSDSFSPQELKKLVYLHGAICESLRLYPPVPFQHKSPTKPDV LPSGHKVDANSKILFCLYSLGRMKSVWGEDALEFKPERWISESGNSVHEPSYKFLSFNAG PRTCLGKEVAMMQMKSVAVKIIQNYEMKIVEGQQIEPAPSVILHMKHGLKVTVTKRCLV CYP96A11 AL078620 comp(58770-60179) F23K16.130 at4g39500 MASSISFLEASVAIFCFLILHYLFKTTTYNGRFPRNWPVLG MLPCLLVVLHRIYDYIVEILEISDLTFSFKGPWFAGMDTLLTVDPANIHHMMNSNFSNYI KGSDFKEVFDVFGDGIITTDSELWKNLRKSYQEMLHHQAFQSFSLSTTRSKLKDGLVPLL NHFAEEGTTVDLQDVLGRFTFDTILILITGSDPRSLSIEMHEDELAKALDVVGEGIFFRH VKPKFLWKLQNWMGFGHEKKMIEANATFDRVCAKYISDKRGEIIRSQRFSDISYGEPEDL LSSFMKLDTTKYNLLNPSDDKFLRDTILAFILAGRDTTASALTWFFWLLSENAQVVSKIR QEIINTNPSKNGNGQENLDKLVYLHGALCEAMRLYPPVSFGRKSPIKSDVLPSGHKVQAN SKIIICLYALGRMRAVWGDDALEFKPERWVSDKGSLRHEPSFKFLSFNSGPRTCLGKHLA MTQMKMVAVEILHNYEIKVIKGQKIKPVLGFILSMKHGLRITITKRCPY CYP96A12 AL078620 comp(60765-62291) F23K16.140 at4g39510 MASVSLLDVAIAIICLFFFYYMYFKKPHGQVFRNWPVIGMLPGFLMVLHRIYNFGVEALE MSHLTFLFKGPWFAEMDMLFTVDPANIHYILSSNFSNYTKGADFKEVFDVFGEMIFSSDS ELWKNQRKAAQFMLNHQGFQKLSLSATRSKLYDGLVPLFNQCCEEEKVVDLQQVFQRFTF DTTFFIVTGFDPKSLSIEMPEVEYAKALDDLGEGIFYRHIKPKFFWKLQNRFGLGQEKRM TEADATFDRVSAKYILAKREEIRSQGIDHHANGESEDLLTSHIKLDTTKYELLNPSDDKF LRDTILAFNLAGRDTTSSALSWFFWLLSENPQVVTKIRKEIIDKNISKDGRNGQENLDKL VYLHAALYESMRLYPPVAFQRKSPIKPDVLPSGHKVEANSVIIIFLFALGRMRAVWGEDA TEFKPERWVSESGGLRHAPSFKFLSFNAGPRTCPGKQLAMTLMKTVVVEILQNYDIDVIK GQKIEPEPGLMLHMKHGLRVTITKRCSA CYP96A13 NOTE FRAME SHIFT IN MID REGION AL162973.1|ATF9G14 Arabidopsis thaliana DNA chromosome 5, BAC clone F9G14 (ESSA project) Length = 104769 Query: 3 IVIGLLEIFTAFVFFIFFQCLLLHKKTPKPLLTNWPALGMLPGLLLQVPRIYDWITEVLE 62 +++ + +F A + F ++LH L NWP LGMLPGLLL+ RIYD+ EVLE Sbjct: 69530 LLMASISLFEASIAIFCF--IILHF-----FLGNWPVLGMLPGLLLEFHRIYDFSVEVLE 69688 Query: 63 ATDMTFCFKGPCLSGMDILLTVDPVNIHYILSSNFANYPKGMEFKKIFEVVGDSIFNV-D 121 +D+TF FKGP +GMD+L TVDP NIH+I+SSNF+NY KG +FK++F+V GD I D Sbjct: 69689 NSDLTFPFKGPWFTGMDMLFTVDPTNIHHIMSSNFSNYTKGPDFKQVFDVFGDGILTTDD 69868 Query: 122 SGLWEDMRNSSHAIFSNQDFQMFWVSTS 149 S LW++++ +S + ++Q FQ +ST+ Sbjct: 69869 SELWKNLKKASLVMLNHQGFQSLSMSTT 69952 Query: 153 LRQGLVPILENAADKNILVDLQDLFQRFLFDTSLILMTGY-DPKCLSVEMPKVEFGDAVD 211 L+ GLVP+ + A+ +VDLQD+F+RF+FDT+L+ +TG DP+ LS+EMP+VEF A+D Sbjct: 69960 LKDGLVPLFNHFAENGTVVDLQDVFKRFMFDTTLVTVTGSADPRSLSIEMPEVEFAKALD 70139 Query: 212 GVSDGVFYRHVKPVFLWRLQYLIGVGVEKRLKRGLAVFDQLLEKIITAKREEINSHGTHH 271 V +G+ +RHV+P LW+LQ +G G EK+ + A +Q K I KREE S G + Sbjct: 70140 HVGEGIMHRHVRPRLLWKLQKCVGFGQEKKFSKADATLNQACAKYILEKREETRSQGFDY 70319 Query: 272 PSRG-EAIDVLTYYMTMDTTKYKYLEPSDDRFIKDTILGFLIAARDTTSSALTWFFWLMS 330 S G E+ D+LTY++ +DTTKY+ L PSDD+F++DTIL F++A RDTT+SALTWFFWL+ Sbjct: 70320 HSNGSESEDILTYHIKIDTTKYELLNPSDDKFLRDTILAFVLAGRDTTASALTWFFWLLL 70499 Query: 331 KNPEAINKIRQEVN-----KKMPRFDPAD-LEKLVYLHGAVCETLRLYPPVPFNHKSPAK 384 +NP+ + KIRQE+N ++ P +P + L LVYLHGA+ E +RLYPPVPF SP K Sbjct: 70500 ENPQVVTKIRQEINTSNGGQEKPSCEPMEYLNNLVYLHGALYEAMRLYPPVPFERMSPIK 70679 Query: 385 PDVLPSGHRVDEKWKIVISMYALGRMKSVWGDDAEDFRPERWISDSGRLKHEPSYKFLAF 444 PDVLPSGH+VD KI+I +YALGRM++VWG+DA +F+PERW+S++ L+HEPS+KFLAF Sbjct: 70680 PDVLPSGHKVDSSMKILIFIYALGRMRAVWGEDASEFKPERWLSETTSLRHEPSFKFLAF 70859 Query: 445 NAGPRACLGKKLTFLQMKTVAAEIIRNYDIKVVEGH-KTEPVPSVLFRMQHGLKVNITR 502 NAGPR+C+GK+L MK V EI++NYDIKVV+G K EP P + RM+HGL+V +T+ Sbjct: 70860 NAGPRSCIGKQLAMTLMKIVVVEILQNYDIKVVKGQKKIEPAPGPILRMKHGLRVTLTK 71036 69536 MASISLFEASIAIFCFIILHFFLGNWPVLGMLPGLLLEFHRIYDFSVEVLE 69688 69689 NSDLTFPFKGPWFTGMDMLFTVDPTNIHHIMSSNFSNYTKGPDFKQVFDVFGDGILTTDD 69868 69869 SELWKNLKKASLVMLNHQGFQ note the translation in the genbank entry does not agree with this. They have a small intron and I have a frameshift instead. SLSMSTT 69952 69960 LKDGLVPLFNHFAE NGTVVDLQDVFKRFMFDTTLVTVTGSADPRSLSIEMPEVEFAKALD 70139 70140 HVGEGIMHRHVRPRLLWKLQKCVGFGQEKKFSKADATLNQACAKYILEKREETRSQGFDY 70319 70320 HSNGSESEDILTYHIKIDTTKYELLNPSDDKFLRDTILAFVLAGRDTTASALTWFFWLLL 70499 70500 ENPQVVTKIRQEINTSNGGQEKPSCEPMEYLNNLVYLHGALYEAMRLYPPVPFERMSPIK 70679 70680 PDVLPSGHKVDSSMKILIFIYALGRMRAVWGEDASEFKPERWLSETTSLRHEPSFKFLAF 70859 70860 NAGPRSCIGKQLAMTLMKIVVVEILQNYDIKVVKGQKKIEPAPGPILRMKHGLRVTLTKKC* 71045 CDS join(69536..69932,70000..71045) /gene="F9G14_210" /note="strong similarity to cytochrome P450 homolog, Arabidopsis thaliana, PIR:T09367" /codon_start=1 /product="cytochrome P450-like protein" /protein_id="CAB86044.1" /db_xref="GI:7413565" /translation="MASISLFEASIAIFCFIILHFFLGNWPVLGMLPGLLLEFHRIYD FSVEVLENSDLTFPFKGPWFTGMDMLFTVDPTNIHHIMSSNFSNYTKGPDFKQVFDVF GDGILTTDDSELWKNLKKASLVMLNHQGFQ K NGTVVDLQDVFKRFMFDTTLVTVTGSA DPRSLSIEMPEVEFAKALDHVGEGIMHRHVRPRLLWKLQKCVGFGQEKKFSKADATLN QACAKYILEKREETRSQGFDYHSNGSESEDILTYHIKIDTTKYELLNPSDDKFLRDTI LAFVLAGRDTTASALTWFFWLLLENPQVVTKIRQEINTSNGGQEKPSCEPMEYLNNLV YLHGALYEAMRLYPPVPFERMSPIKPDVLPSGHKVDSSMKILIFIYALGRMRAVWGED ASEFKPERWLSETTSLRHEPSFKFLAFNAGPRSCIGKQLAMTLMKIVVVEILQNYDIK VVKGQKKIEPAPGPILRMKHGLRVTLTKKC AC026480 CYP96A14P 9579-10061 Pseudogene fragment chr I clone F15E12 no more downstream seq in 80000 bp, GSSs: B22314 B20346 9579 MALIGLIEAFIAFVCFLIFYYFLIKKPYSYILIKISQSGLWNWPVLGMSPGALMRLP RIYDFSVDLLESNLTFHFKGPWFAGIDILATADSVNINHIYYRGPELREIFGPFGDGIINSD SELWRNLKKATQVIFNHQKYQKFSTSTTRSKLKLGLVPLFN 10061 AC079733 CYP96A15 ESTs AV555110 AV564547 AV563099 AV567476 AV557432 AV563463 AV556553 AV566162 AV564064 AV566685 AV561889 AI999275 AV532082 76322 MAMLGFYVTFIFFLVCLFTYFFLQKKPQGQPILKNWPFLRMLPGMLHQIPRIYDWTVEVLEATNL 76128 76127 TFYFKGPWLSGTDMLFTADPRNIHHILSSNFGNYPKGPEFKKIFDVLGEGILTVDFELW 75951 75950 EEMRKSNHALFHNQDFIELSVSSNKSKLKEGLVPFLDNAAQKNIIIELQDVFQRFMFDT 75774 75773 SSILMTGYDPMSLSIEMLEVEFGEAADIGEEAIYYRHFKPVILWRLQNWIGIGLERK 75603 75602 MRTALATVNRMFAKIISSRRKEEISRAKTEPYSKDALTYYMNVDTSKYKLLKPNKDKFIR 75423 75422 DVIFSLVLAGRDTTSSVLTWFFWLLSKHPQVMAKLRHEINTKFDNEDLEKLVYL 75261 75260 HAALSESMRLYPPLPFNHKSPAKPDVLPSGHKVDANSKIVICIYALGRMRSVWGEDALDF 75081 75080 KPERWISDNGGLRHEPSYKFMAFNSGPRTCLGKNLALLQMKMVALEIIRNYDFKVIEGHK 74901 74900 VEPIPSILLRMKHGLKVTVTKKI* 74829 CYP97A3 AC079041 9278-12286 ESTs AI995768, AV560628 most exon boundaries supported by ESTs from Arabidopsis, Medicago tuncatula, tomato and glycine max C-term agrees with ESTs but not genomic sequence. ESTs are right. This sequence is 82% to 97A1 from Cicer arietinum. It was accidentally named CYP97D1 MAMAFPLSYTPTITVKPVTYSRRSNFVVFSSSSNGRDPLEENSVPNGVKSLEKLQ EEKRRAELSARIASGAFTVRKSS (phase 2 intron) FPSTVKNGLSKIGIPSNVLDFMFDWTGSDQDYPK VPEAKGSIQAVRNEAFFIPLYELFLTYGGIFRLTFGPK (PHASE 0 INTRON) SFLIVSDPSIAKHILKDNAKAYSK (PHASE 0 INTRON) GILAEILDFVMGKGLIPADGEIWRRRRRAIVPALHQK (PHASE 0 INTRON) YVAAMISLFGEASDRLCQKLDAAALKGEEVEMESLFSRLTLDIIGKAVFNYDFDSLTNDTGVIE (PHASE 0 INTRON) AVYTVLREAEDRSVSPIPVWDIPIWKDISPRQRKVATSLKLINDTLDDLIATCK (phase 0 intron) RMVEEEELQFHEEYMNERDPSILHFLLASGDDVI (phase 0 intron) VSSKQLRDDLMTMLIAGHETSAAVLTWTFYLLTT (phase 0 intron) EPSVVAKLQEE (phase 0 intron) VDSVIGDRFPTIQDMKKLKYTTRVMNE (phase 0 intron) SLRLYPQPPVLIRRSIDNDILGEYPIKR (phase 2 intron) GEDIFISVWNLHRSPLHWDDAEKFNPERWPLDGPNPNETNQNFS (phase 2 intron) YLPFGGGPRKCIGDMFASFE (Phase 0 intron) NVVAIAMLIRRFNFQIAPGAPP (phase 0 intron) VKMTTGATIHTTEGLKLTVTKRTKPLDIHPYRYFQWILTG* Z97337 CYP97B3 and B28642, B09856 GSS fragments at4g15110 MAFPAAATYPTHFQGGALHLGRTDHCLFGFYPQTISSVNSRRASVSIKCQSTEPKTNGNI LDNASNLLTNFLSGGSLGSMPTAEGSVSDLFGKPLFLSLYDWFLEHGGIYKLAFGPKAFV VISDPIIARHVLRENAFSYDKGVLAEILEPIMGKGLIPADLDTWKLRRRAITPAFHKLYL EAMVKVFSDCSEKMILKSEKLIREKETSSGEDTIELDLEAEFSSLALDIIGLSVFNYDFG SVTKESPVIKAVYGTLFEAEHRSTFYFPYWNFPPARWIVPRQRKFQSDLKIINDCLDGLI QNAKETRQETDVEKLQERDYTNLKDASLLRFLVDMRGVDIDDRQLRDDLMTMLIAGHETT AAVLTWAVFLLSQNPEKIRKAQAEIDAVLGQGPPTYESMKKLEYIRLIVVEVLRLFPQPP LLIRRTLKPETLPGGHKGEKEGHKVPKGTDIFISVYNLHRSPYFWDNPHDFEPERFLRTK ESNGIEGWAGFDPSRSPGALYPNEIIADFAFLPFGGGPRKCIGDQFALMESTVALAMLFQ KFDVELRGTPESVELVSGATIHAKNGMWCKLKRRSK* B67502 and B26976 GSS fragments C-helix region like 97C1 YDKGLVAEVSQFLFGDGFAVAGGDRWKARRKAVGPSLHR >emb|AL132958.1|ATT4D2 Arabidopsis thaliana DNA chromosome 3, BAC clone T4D2 Length = 92611 Identities = 19/26 (73%), Positives = 21/26 (80%) Frame = +2 Query: 3 YDKGLVAEVSQFLFGDGFAVAGGDRW 28 Y KGLVAEVS+FLFG GFA+A G W Sbjct: 17297 YAKGLVAEVSEFLFGSGFAIAEGPLW 17374 CYP97C1 clone T04D02 from the EU Chr 3 project 17165-19444 N-terminal similar to Glycine max EST AI736606 MESSLFSPSSSSYSSLFTAKPTRLLSPKPKFTFSIRSSIEKPKP KLETNSSKSQSWVSPDWLTTLTRTLSSGKNDESGIPIANAKLDDVADLLGGALFLPLY KWMNEYGPIYRLAAGPRNFVIVSDPAIAKHVLRNYPKYAKGLVAEVSEFLFGSGFAIA EGPLWTARRRAVVPSLHRRY LSVIVERVFCKCAERLVENVAALCKRRNSCKYERNFSSQMTLDVIGLSLFNYNFDSLTTD SPVIEAVYTALKEAELRSTDLLPYWKIDALCKIVPRQVKAEKAVTLIRETVEDLIAKCKE IVEREGERINDEEYVNDADPSILRFLLASREEVSSVQLRDDLLSMLVAGHETTGSVLTWT LYLLSNSSALRKAQEEVDRVLEGRNPAFEDIKELKYITRCINESMRLYPHPPVLIRRAQV PDILPGNYKVNTGQDIMISVYNIHRSSEVWEKAEEFLPERFDIDGAIPNETNTDFKFIPF SGGPRKCVGDQFALMEAIVALAVFLQRLNVELVPDQTISMTTGATIHTTNVCQWIVYEGE PKVKTRIYVFMIIDWCEWTCFIVLLRNNHKKMNYGK* AC002409 CYP98A3 MSWFLIAVATIAAVVSYKLIQRLRYKFPPGPSPKPIVGNLYDIKPVRFRCYYEWAQSYGP IISVWIGSILNVVVSSAELAKEVLKEHDQKLADRHRNRSTEAFSRNGQDLIWADYGPHYV KVRKVCTLELFTPKRLESLRPIREDEVTAMVESVFRDCNLPENRAKGLQLRKYLGAVAFN NITRLAFGKRFMNAEGVVDEQGLEFKAIVSNGLKLGASLSIAEHIPWLRWMFPADEKAFA EHGARRDRLTRAIMEEHTLARQKSSGAKQHFVDALLTLKDQYDLSEDTIIGLLWDMITAG MDTTAITAEWAMAEMIKNPRVQQKVQEEFDRVVGLDRILTEADFSRLPYLQCVVKESFRL HPPTPLMLPHRSNADVKIGGYDIPKGSNVHVNVWAVARDPAVWKNPFEFRPERFLEEDVD MKGHDFRLLPFGAGRRVCPGAQLGINLVTSMMSHLLHHFVWTPPQGTKPEEIDMSENPGL VTYMRTPVQAVATPRLPSDLYKRVPYDM* CYP98A8 AC011765 comp(67700-69163) no ESTs no GSSs MDLLLISLTTIIIAAYMQNLRRRGSNIPPGPPTRFLVGNLHQLKPLWTQSFSEWSQTYGP IISVWLGSQLAVVVSSSDLAKQVLRDKDYQLCNRHRTARMTQNGSDLIWSDYGAHYVKMR KLCTLELFSLKSIECFRSMREMEVSSMVKSIFNDFMSDDQKPVVLRNYLDSVALNIVSRL VIGKTFEPKDGREFRSIVERETRLPGATKMLDYTVWLKRLSSWFTSDKAFMKHMARKRNW FKRAVMDEVYGGRDQKCFVQSLLELKEKDELTEETVMGLVWNMLTAGADTTAITIEWAMA EMIRCPTVKEKVQDELDSVVGSGRLMSDADIPKLPFLQCVLKEALRLHPPTPLMLPHKAS ESVQVGGYKVPKGATVYVNVQAIARDPANWSNPDEFRPERFLVEETDVKGQDFRVLPFGS GRRVCPAAQLSLNMMTLALGSLLHCFSWTSSTPREHIDMTEKPGLVCYMKAPLQALASSR LPQELYL* CYP98A9 AC011765 comp(70394-71887) GSS AL080882 EST AA651146 MIIYLISLLPIIVATLMLYQRWWRSNIPPGPKPKFLLGNLHQMKPLWTHSFSEWSETYGP IISVWIGSQLTVVVSSSDLARQVLRDKDHQLSNRHRIARMTQTGTDLVWSDYSPHYVKLR KLCTLELFSLKSIENFRSLREMEARSMVVSILKDLMSNSGDDQERKPVIVRKYLAAVVLN TISRLMIGKEFGSEEGKEFKAIVEKEHLLSGSGTILDHVWWLKWVSSWFFSDKEFLAHKD RRTKWFRGAIMVEEDIEIEDHRGFVRKLLVLKEQKELSEETVGGLVWNMLTAGADTTAVV IEWAMAEMIKCPTVQEKAQQELDSVVGSERLMTESDIPILPYLQCVVKEALRLHPSTPLM LPHKASETVWVGGYKVPKGATVYVNVQAIGRDPANWINPYEFRPERFLQEETDVKGRDFR VLPFGSGRRMCPAAQLSMNLMTLVMGNLLHCFSWSSPVPGERIDMSENPGLLCNMRTPLQ ALALPRAAARAIPLPLD* H36457 EST extreme C-terminal of 98A-like sequence 51% identical to 98A3 WSPPVPGERIDMXQXPGLLCNMRTPLQALALPRXPQXQYLYNKFQSM CYP701A3 Genbank AF047719-AF047721 GA3 GSS B61390 also AF149413 comp(107853-110058) BE528748 M72F06STM developing seed cDNA clone 600037919R1 5' Length = 269 MAFFSMISILLGFVISSFIFIFFFKKLLSFSRKNMSEVSTLPSVPVVPGFPVIGNLLQLK EKKPHKTFTRWSEIYGPIYSIKMGSSSLIVLNSTETAKEAMVTRFSSISTRKLSNALTVL TCDKSMVATSDYDDFHKLVKRCLLNGLLGANAQKRKRHYRDALIENVSSKLHAHARDHPQ EPVNFRAIFEHELFGVALKQAFGKDVESIYVKELGVTLSKDEIFKVLVHDMMEGAIDVDW RDFFPYLKWIPNKSFEARIQQKHKRRLAVMNALIQDRLKQNGSESDDDCYLNFLMSEAKT LTKEQIAILVWETIIETADTTLVTTEWAIYELAKHPSVQDRLCKEIQNVCGGEKFKEEQL SQVPYLNGVFHETLRKYSPAPLVPIRYAHEDTQIGGYHVPAGSEIAINIYGCNMDKKRWE RPEDWWPERFLDDGKYETSDLHKTMAFGAGKRVCAGALQASLMAGIAIGRLVQEFEWKLR DGEEENVDTYGLTSQKLYPLMAIINPRRS* AC001229 BAC F5I14.20 CYP702A1 also AC007234 comp(19575- 21432) MVEVYELLTVMVSLIVVKLFHWIYQSKNPKPNEKLPPGSMGFPIIGETFEFMKPHDAFQF PTFIKERIIRYGPIFRTSLFGAKVIISTDIELNMEIAKTNHAPGLTKSIAQLFGENNLFF QSKESHKHVRNLTFQLLGSQGLKLSVMQDIDLLTRTHMEEGARRGCLDVKEISSKILIEC LAKKVTGDMEPEAAKELALCWRCFPSGWFRFPLNLPGTGVYKMMKARKRMLHLLKETILK KRASGEELGEFFKIIFEGAETMSVDNAIEYIYTLFLLANETTPRILAATIKLISDNPKVM KELHREHEGIVRGKTEKETSITWEEYKSMTFTQMVINESLRITSTAPTVFRIFDHEFQVG SYKIPAGWIFMGYPNNHFNPKTYDDPLVFNPWRWEGKDLGAIVSRTYIPFGAGSRQCVGA EFAKLQMAIFIHHLSRDRWSMKIGTTILRNFVLMFPNGCEVQFLKDTEVDNSSGSNPDCC NG* Z97338 seq. (a) compl. 95383-97405 CYP702A2 at4g15300 MVEFHELLTVMVSLFLVKIFHWVYQWRNPKTNGKLPPGSMGFPFIGETFEFFKPHDALQF STFIKDRVLRHGPVFRTSLFGDKAIISMDMELNLEMAKANSVPGVTKSVIRLFGENNLFL QSKESHKHVRNLTFQLLGPQGLKSRMIEDVDLLARTYMEEGARNGYLDVKETSSKILIGC LAKKVMGEMEPEAAKELALCWRYFQSGWFRFFLNLPGTGVYKMMKARKRMMKLLRKTVLT KRASGEELGEFFNIIFGEMEGEGETMSVENAVEYIYTFFLVANETTPRILAATVKFISDH PKVKQELQREHEEIVRGKAEKEGGLTWEDYKSMHFTQMVINESLRIISTAPTVLRVLEHD FQVGDYTIPAGWTFMGYPHIHFNSEKYEDPYAFNPWRWEGKDLGAIVSKTFIPFGAGRRL CVGAEFAKMQMAVFIHHLFRYRWSMKSGTTIIRSFMLMFPGGCDVQISEDTEVDK* Z97338 seq. (b) 101369-104695 CYP702A3 at4g15310 MSEIYVLALSVVFSLIVVKLCHWVYQWSNPKCKGKLPPGSMGFPIIGETFEFMTPFDISL VVSPYLKKRISRYGSKVFRTSLFGAKVIVSIDPDVNMEIAKASSQLRATESVTRIFGENN PFLQSKEIHKYVRNLTSRFVGPEGLKTRLIHDIDNLLRNDVENGARNGSFDVREATIKMV GELIAKKIMGETESEAVKELGLCWSAFRTSWFQFSYNIPGTTVYRLVKARRKAAKLLKAL ILKKKASKEGLGDFLDIIFDEMEKDGTALDIDKAVNLIFVFFILSQETTPGVQGAVVKLV ADHPSVMEELQREHEAIVQNRADKDTGVTWEEYKSMTFTHMVIKESLRFTSTQPTVHRIP DQDVQIGDYTLPAGWLFFGIPQVHFDEEKYDDPLTFNPWRWQGKDINSTVSREYMPFGAG GTHCVGSEFAKLIIAILLHHLSRFRWSLDPKTEVLRRYTLVFPAGCVVHISKESE* Z97338 seq. (d) 126846-126941 CYP702A4P lone exon MESGAVKELGLCWVDFRTNWFNFSYNTPGTTVYRVVKV* Z97338 seq. (h) 168806-171661 CYP702A5 and GSSs B24420, AQ249305 MVEVYEFWAVIVSLIVVKLCHWIYQWKNPKGNGKLPPGSMGYPIIGETFEFMKLHDAIQL PTFVKEKLLRHGPVFRTSLFGGKVIISTDIGLNMEIAKTNHIPGMPKSLERLFGATNLFV NKDTHKHARSLTNQFLGSQALKLRMIQDIDFLARTHMKEGARKGCLDVKETASKIVIECL SKKVMGEMEPEAAKELTLCWTFFPRDWFRFAWNFPGTGVYRIVKARNRMMKVIKETVVKK RASGKKLGEFFETIFGDTESVTMSIEIATEYIFTLFVLANETTPGVLAATIKLISDNPKV MQELRREHEGIVQDKIKKDETADLTWEDYKSMTFTQMVINESLRITSTVPTVLRIIDHEI QFGDYTIPAGWIFMGYPYVHFNPEKYDDPLAFNPWRWKGKDLSTIVSKTYLPFGSGTRLC VGAEFVKLQMAIFIHHLFRYRWSMKAETTLLRRFILVLPRGSDVQIS* Z97338 seq. (i) 172246-175058 CYP702A6 BE521486 M19H9STM developing seed cDNA clone M19H9 5' Length = 379 BE521487 M19H9XP developing seed cDNA clone M19H9 3' Length = 327 MVDLYEFWAVIVSLIVVKLCHSIYQWRNPKSNGELPPGSMGYPIIGETFEFMKPHDAIQL PTFVKEKVLRHGPVFRTSLFGGKVIISTDIGLNMEIAKTNHIPGMPKSLARLFGANNLFV NKDTHKHARSLTNQFLGSQALKLRMLQDIDFLVRTHLKEGARKGSLDIKETTSKIIIECL AKKVMGEMEPDAAKELTLCWTFFPREWFGFAWNIPGTGVYRMVKARNRMMKVLKETVLKK RASGEELGDFFKTIFGDTERGVKTISLESATEYIFTLFLLANETTPAVLAATIKLISDHP KVMQELQREHEGIVRDKIEKNEKADLTWEDYKSMTFTQMVINESLRITSTVPTVLRIIDH EFQFGEYTIPAGWIFMGYPYVHFNAEKYDDPLAFNPWRWKGKDLSAIVSRTYIPFGSGSR LCVGAEFVKLKMAIFIHHLSRYRWSMKTE