rice chr 1 P450s in annotated list in order on the chr [46 genes]

from http://rgp.dna.affrc.go.jp/rgp/complete-chr/chr1/chr1-complete.html

and details from:

http://ricegaas.dna.affrc.go.jp/chr1-bin/search_table.pl

 

This list is taken from the chromosome 1 annotations found by a

keyword search for P450.  Not all P450s on chr 1 were annotated in this table.

 

Of the 46 genes annotated here 12 agree 100% with my annotations

2 are fusions of two P450s, 2 more are fusions to other genes

71T2 711A2 and 711A3 are split into two genes each

 

Three pseudogenes are represented as intact genes by

creative splicing to avoid frameshifts and stop codons,

and an artificial choice of N and C-termini to finish the gene.

 

The pseudogenes CYP94D8P, CYP715B3P, 71AA1P, 71AA4P, 72A36P, 76H12P

are missed in this annotation, but the annotation does not

cover pseudogenes.

 

CYP734A6, CYP71AA3, CYP71C18, CYP71C19 are also missed

 

Gene No. : 1-1_001 CYP715B2P

8174..8624 , 8639..8844 , 8948..9019 (-)

>1-1_001

MAEGDEWARHRCIVAPAFSATNLNDMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVR

NAAEIIAKASFSIAADDATVFHK AAGDAVPLHAVPLASLLHIRADRATYEAWKLGRKIDA

LLLDIIESRRRCEGGGRKTTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPE

WRAAVREEVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEVVRG

KR

 

>AP003727.3 $P CYP715B2P chromosome 1 clone:P0672D08 Pseudogene fragment

missing N and C-terminal and part of I-helix 39% to 715A1

NRMPMFGRGRVMAEGDEWARHRCIVAPAFSATNLN

DMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVRNAAEIIAKASFSIAADDATVFHK (frameshift)

VRLVSVPLASLLHIRADRATYEAWKLGRKIDALLLDIIESRRRCEGGGRK

TTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPEWRAAVRE this is missing from AP004123

EVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEV

 

Gene No. : 1-2_239 CYP96D1 100% agreement

1216403..1217523 , 1217636..1218047 (+)

>1-2_239

MGPLWTFILLYPEIFLAIICFFWFSLFRPIRQRQKSNLPVNWPVFGMLPFLVQNLHYIHD

KVADVLREAGCTFMVSGPWFLNMNFLITCDPATVNHCFNANFKNYPKGSEFAEMFDILGD

GLLVADSESWEYQRRMAMYIFAARTFRSFAMSTITRKTGSVLLPYLDHMAKFGSEVELEG

VFMRFSLDVTYSTVFAADLDCLSVSSPIPVFGQATKEAEEAVLFRHVIPPSVWKLLRLLN

VGTEKKLTNAKVVIDQFIYEEIAKRKAQASDGLQGDILSMYMKWSIHESAHKQKDERFLR

DTAVGFIFAGKDLIAVTLTWFFYMMCKHPHVEARILQELKGLQSSTWPGDLHVFEWDTLR

SAIYLQAALLETLRLFPATPFEEKEALVDDVLPNGTKVSRNTRIIFSLYAMGRIEGIWGK

DCMEFKPERWVSKSGRLRHEPSYKFLSFNTGPRSCLGKELSLSNMKIIVASIIHNFKVEL

VEGHEVMPQSSVILHTQNGMMVRLKRRDAA

 

Gene No. : 1-2_240 CYP96E1

1220501..1221994 , 1226247..1226672 (+)

>1-2_240

MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPVVGALPAIVANAGRVHD

WVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANFGNYPKGEEFAAMFDVLGG

GIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGGGLVPLLDGVAASGAAVDLQD

VFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDAEEVLFYRHVAPVPWLRLQSYLK

IGHYKKMAKAREVLDASIAELIALRRERKAADANATGDADLLTAYLACQDEIGMDGAAFD

AFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPGVEARILAELRAHPPSPTGAELKRLVY

LHAALSESLRLYPPVPFEHKAAARPDTLPSGAAVGPTRRVIVSLYSMGRMEAVWGKGCEE

FRPERWLTPAGRFRHERSCKFAAFNVGPRTCLGRDLAFAQMKAVVAAVVPRFRVAAAAAP

PRPKLSIILHMRDGLKVK RRDPVQGGGGHRRGHHHEICRCPQSGSGELDKATAMAADEEE

EVAPNLVFVTIQLPPSSSSSPLKTTQQLDGEGEELIGVQPKEEDRRLEEEEGGGVAADLA

VSRGPSRQACRCTGQESRAVGRGRKQGERRPEGEGICRR

 

>AP002484b $F CYP96E1 CDS  80463..81980 43% to 96A1

MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPV

VGALPAIVANAGRVHDWVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANF

GNYPKGEEFAAMFDVLGGGIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGG

GLVPLLDGVAASGAAVDLQDVFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDA

EEVLFYRHVAPVPWLRLQSYLKIGHYKKMAKAREVLDASIAELIALRRERKAADANAT

GDADLLTAYLACQDEIGMDGAAFDAFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPG

VEARILAELRAHPPSPTGAELKRLVYLHAALSESLRLYPPVPFEHKAAARPDTLPSGA

AVGPTRRVIVSLYSMGRMEAVWGKGCEEFRPERWLTPAGRFRHERSCKFAAFNVGPRT

CLGRDLAFAQMKAVVAAVVPRFRVAAAAAPPRPKLSIILHMRDGLKVK VHRRQED*

 

 

Gene No. : 1-2_394 CYP90D2

2035743..2035993 , 2036078..2036405 , 2036438..2036531 , 2036850..2037055 , 2037375..2037623 , 2038901..2039086 , 2039597..2039718 , 2043035..2043131 (+)

 

>1-2_394

MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAAAAARLPPGSFGWPVVG

ETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVTADAEVSRFVLQSDARAFV

PWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHLKSQLTADMRRRLSPALSSFP

DSSLLHVQHLAKS LLDEIEWVDELEEEQSGWAWASAGVRAHVRAQMRMERNVIARNGDEM

QMQ VVFEILVRGLIGLEAGEEMQQLKQQFQEFIVGLMSLPIKLPGTRLYRSLQAKKKMAR

LIQRIIREKRARRAAASPPRDAIDVLIGDGSDELTDELISDNMIDLMIPAEDSVPVLITL

AVKFLSECPLALHQLE VITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCVFVYFRSVH

LDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIFLHHLVTSFRW

VAEEDHIVNFPTVRLKRGMPIRVTAKEDDD

 

>AP003244 $F CYP90D2 59% to 90D1

CDS join(30874..31124,31209..31536,32037..32186,32506..32754,

33832..33921,34032..34217,34728..34849,38166..38262)

AQ157843 64% identical to AQ290163 75% TO 90C1 AT HEME BINDING REGION

C97894 Rice callus Oryza sativa cDNA clone C0085_11A, mRNA sequence

extreme C-term 71% to CYP90C1 opp end = C97895 (probably 3 prime untranslated)

MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAA

AAARLPPGSFGWPVVGETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVT

ADAEVSRFVLQSDARAFVPWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHL

KSQLTADMRRRLSPALSSFPDSSLLHVQHLAKS VVFEILVRGLIGLEAGEEMQQLKQQ

FQEFIVGLMSLPIKLPGTRLYRSLQAKKKMARLIQRIIREKRARRAAASPPRDAIDVL

IGDGSDELTDELISDNMIDLMIPAEDSVPVLITLAVKFLSECPLALHQLE EENIQLKR

RKTDMGETLQWTDYMSLSFTQH VITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCV

FVYFRSVHLDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIF

LHHLVTSFRWVAEEDHIVNFPTVRLKRGMPIRVTAKEDDD

 

Gene No. : 12_245 CYP94E2 100% agreement

1349356..1350990 (-)

>12_245

MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLD

WATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLG

RGLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQ

DVLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAM

RLANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGG

ELAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSR

HADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAV

RAGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRACLGR

EMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKREDDAAQ

QKLT

 

>AP003735.2b $F CYP94E2 genomic DNA, chromosome 1, BAC clone:B1147A04, complete

61% to AP003735 4872-6534

8263 MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLDW 8445

8446 ATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLGR 8625

8626 GLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQD 8805

8806 VLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAMR 8985

8986 LANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGGE 9165

9166 LAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSRH 9345

9346 ADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAVR 9525

9526 AGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRAC 9693

9694 LGREMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKR 9864

     EDDAAQQKLT* 9897

 

Gene No. : 12_246 CYP94E1 100% agreement

1352719..1354368 (-)

>12_246

MDSSYLHLVLPAAAAAVVVAVVLLLSLWRRCQTTSNHRPQANPILGNLVAFLANGHRFLD

WSTGLLAAAPASTMQVHGPLGLGYCGVATASPDAVEHMLRASFHNYVDKGDRVRDAFADL

LGDGLFLANGRLWRLQRKLAASSFSPRLLRLFAGRVVLDQLRRRLLLFFDAAADARRVFD

LQDVLKRFAFDNICSVAFGVDRDDSSPSSSSPSRLEAGGDGRDDAFFAAFDDAIDISFGR

ILHPTTLAWKAMKLLDVGSERRLRQAIGVVDEYVTAIMESKQRCSDSEEESDLLSRFTAA

MMEEDGGNELGAMFDSPEAKRRFLRDTVKTFVLAGKDTTSSALTWLFWFLAANPECERRV

YEEVTALRGDTAGDERDDGYEELKRMHYLHAAITETMRLYPPVPLASRVAAADDVLPDGT

VVRAGWFADYSSYAMGRMPQLWERDCGEFRPERWLDGGGGGGGRFVAVDAARYPVFHAGP

RSCLGKEMAYVQMKAVAAAVVRRFSVEVVPAAAANAPPSPPPHETAVTLRMKGGLRVLLT

RRRGVLSHA

 

Gene No. : 12_317 CYP71AA2 100% agreement

1719193..1720104 , 1720260..1720934 (+)

>12_317

MAGIMDSTTASYYTTLLCGALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHC

LLGSLPHHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTAS

IDIVFAPFGKHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASASSAVNVS

ELVKIMTNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARVLGGRSL

RTTKRVHEKLHQITEAIIQGHGIKDTVGDEHHECEDILDVLLRFQRDGGLGITLTKEIVS

AVLFDLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHY

LQLVIKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKDVNE

FRPERFKDDIVDFSGTDFRFIPGGSGRRMRPGLTFGVSNIEIALVTLLYHFDWKLPSETD

THELDMRETYGLTTRRRSDLLLKATPSYARLGWSTNMQIYSVKCLVYE

 

this pseudogene not annotated

>AP004326.2d $P CYP71AA1P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence Gene 4 pseudogene

81031 DFFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 81213 frameshift

81213 QETLRLHPPVPLLLPRLWSEPCKIMGYDIP 81302 frameshift

81304  KNTAIFVNTWALGR 81345 frameshift

81344 KIKNTGLMQVSSGLKY

81393 SRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSNK 81566

81567 LDMTEANGITTHRRIDIWLEATPFVPR 81647

 

this gene not annotated

>AP004326.2b $F CYP71AA3 genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence

Gene 2 no good matches in NR 79% to AP004326.2c

71860 MAGIVDTAAFCT

71896 LLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRR 72069

72070 YGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGARDIVFAPF 72243

72244 SKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSELVKI 72408

72409 MANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRATKR 72588

72589 VHQKLHQITDTIIQGHEIIKDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLL 72747

72748 RFHRDGGLGITLTKEIVSAVLF 72813 (0)

73327 DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIRQVLQGKTVVSEADIEGRLHYL 73497

73498 QLVIRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEF 73677

73678 RPERFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASS 73857

73858 CKLDMRETHGVTARRRTELLLKATPLYT* 73944

 

this pseudogene not annotated

>AP004326.2a $P CYP71AA4P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence

Length = 102983

4 genes 71B like

Gene 1 pseudogene 71 family

67989 LPPVPWPLPVIGSMH*LLGSLPHH 68060 frameshift with deletion

68060 RPACAVELLSPRRARSFRRVREAEPARLVRAVAASPAWPLVNVVGGEHVAAMMTAV 68227

68228 GARP 68239 frameshift with small deletion

68238 RCPRQEEYLEELGKVAKLAAGFNLVDLFPESRLVRAAQAAHGKIHSIMDAMVQ 68396

68397 DHLKAMEERREEVADGVVDDGDGDGADRDEELLSILLRFQRDGGLGITLTNGNHQRDS 68570 (0)

68886 GILAGGSDTTTTTVMWAMSELLRCPRAMQ 68972 frameshift with deletion

69023 YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 69202

69203 F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 69382

69383 RMELDMTESAGLTASRLTDLFG* 69451

 

Gene No. : 1-3_149 CYP710A5 this seq has long N-term

677956..679547 , 683244..683265 (-)

>1-3_149

MAIPGSKEHKCNCLSRSSRAFSTPRT MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALY

MLIEQLSYHRKKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFG

RFTVFIRDSELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNF

TPRALSTYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTE

KARERFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEP

ECLLDYLMQETVREIDEATAAGL PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL

DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIAVE

AFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLAFGAG

PHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYLKQR

 

>AP002092a $F CYP710A5 CDS complement(54602..56137) 60% TO 710A1

THIS SEQ IS THE SAME AS AP002093a CDS complement(98133..99668)

MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALYMLI EQLSYHR

KKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFGRFTVFIRD

SELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNFTPRALS

TYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTEKARE

RFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEPEC

LLDYLMQETVREI

DEATAAGL this may be too long vs arab. Check for intron

PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL

DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIA

VEAFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLA

FGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYL

KQR*

 

Gene No. : 1-3_150 CYP710A6 100% agreement

684165..685703 (-)

>1-3_150

MVESFHGLVVVDLRTAAPLLATAVALYILIEQLSYHRKKGSMPGPPLVVVPFLGSVTHLF

RDPVGFWDLQATRASKSGAGLTADFLFGRLMVFIRDSELSRRVFANVRADAFHLVGHPFG

KKLFGDHNLIYMVGKEHKDLRRRIAPNFTPRALSTYAVIQQRVILSHLRRWIDRSVANGG

KAEPIRVPCRDMNLETSQTVFVGPYLTVETRERFDRDYNLFNHGFITLPIDLPGSAFRRA

RLAVPRLKHILEDCARQSKQRMRGGGEPECLVDYLMQETVREIDEAAAAGLPPPPHTSDM

ETGNLLFDFLFAAQDASTSSLCWAVSALDSHPDVLARVRAEVAALWSPESGEPITAEMMT

EMKYTQAVAREVVRYWPPGPVVPHIAGEAFQLTEQYTIPKGTIVFPSVYESSFQGFPDAG

TFDPERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVIFMALLASLIDFRRERT

EGCDVPVYMPTIVPRDGCVVHLKQRCAKLPSF

 

Gene No. : 1-3_155 CYP710A7 100% agreement

704151..705665 (-)

>1-3_155

MVDSLLYGLLDLRMAAPLLAAAVALYVLVEQLSYHRKKGSLPGPPLVVPFIGSATHMIRD

PTGFWEMQAARARKSGVGFTADFLAGKFTIFIRDSELSNRVFANVRPDAFFVIGHPFGKK

LFGDHNLIYLFGDDHKDLRRRMATNFTPRALSTYAAIQQRGIVSHLRRWLDRSAANGGKA

EPIRVPCRDMNLETSQTVFAGPYLTEEARERFKSDYNLFNVGLLAFPVDLPGLAFRRARQ

AVARLVRMLRDCARESKARMRAGGEPECLVDYWMQETVREIDEAKAAGLPPPAHISDDEE

IGGFLFDFLFAAQDASTSSLCWAVSALDSHPDVLARVRAEVASLWSPDSGEPITADKIAE

MKYTKAVAREVVRHRPPATLMPHIALQNFQLTESYTIPKGTLVLPSMYESSFQGFHDPDA

FDPERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTE

GCDVPVYMPTMVPRDGCVVYLKQR

 

Gene No. : 1-3_159 CYP710A8 100% agreement

723800..725326 (-)

>1-3_159

MAAVVDFLDLRAAAPFVVAALAFYFLVEQLSYHRKKGPLPGPPLVVPFVGSVAHMIRDPT

GFWDAQAARARKSGAGLAADFLIGRFVVFIRDSELSHRVFANVRPDAFHLIGHPFGKKLF

GDHNLIYMFGEDHKDLRRRIAPNFTPRALSTYAAIQQRVILSHLRRWLDRSAANGGKAEP

IRVPCRDMNLETSQTVFAGPYLTKEAREKFERDYNFFNVGLMALPVDLPGFAFRSARLGV

ARLVRTLGECARASKARMRAGGEPECLVDFWMQETVREIDEAKAAGKPPPAHTDDEELGG

FLFDFLFAAQDASTSSLCWAVSALDSHPDVLAGVRAEVASLWSPESGEPITAEKIAEMKY

TQAVAREVVRHRPPATLVPHIAGEEFQLTEWYTIPKGTIVFPSVYESSFQGFPEPDTFDP

ERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVLFMALFVSVVDFRRDRTEGCD

EPVYMPTIVPRDSCTVYLKQRCAKFPSF

 

Gene No. : 1-3_343 CYP71T1 100% agreement

1694984..1695988 , 1696589..1697260 (+)

>1-3_343

MELSSSLAAVLHSPLFLLAALLLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGH

LPLLGSLPHRKLRSMAEAHGPVMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRM

AERLIYGRDMVFAPYGEFWRQARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGV

RGGGETVNLSDLLMSYANGVISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGE

FVPWLAWVDKLMGLDAKAARISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDH

RDFVDVLLDVSEVEEGAGAGEVLLFDTVAIKAIILDMIAAATDTTFTTLEWAMAELINHP

PVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELRLLRAVVKETLRLHAPVPLLVPRETVE

DTELLGYRVPARTRVIINVWAIGRDAAAWGDRAEEFVPERWLDGGGEEVEYAAQLGQDFR

FVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDWELPPHADGAAAATAARLDMGELFGLS

MRMKTTLNLVAKPWSSDV

 

Gene No. : 1-3_344 CYP71T2 exon 1 only + C-term extension

1699829..1700905 (+)

>1-3_344

MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLPLPPSPPGVPLLGH

LPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRTRDLAFASRPRVRM

SERLFYGRDMAFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQEVAALLDRVRRRCGG

GGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFADFEGLLGTMTVGEF

VPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQAVGDGEADADHRDFVD

VMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL VRTPLVVVLLTCRSADATVDYFLWNQT

 

Gene No. : 1-3_345 CYP71T2 exon 2 only + N-term extension

1702184..1702900 (+)

>1-3_345

MNERFIEQ DMMAAGTDSSFTTTEWVMAELINHPRVMRKLQDEIRAVVGTSSASAAAAATG

GGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVEDTELLGYRIPARTRVIINVWA

IGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDSRFVPFGAGRRGCPGAGFAALS

VELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLSVRLKADLNLVAKPWSPGAS

 

>AP003434.1b $F CYP71T2 chromosome 1, PAC clone:P0452F10, complete = AA754300

AA754300      42% IDENTICAL TO 71A14   1/98 I-HELIX 43% to 703A2

39698 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP

39839 LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018

40019 RDLAFASRPRVRMSERLFYGRDM AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195

40196 VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375

40376 DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552

40553 VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)

42074 DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169

42170 QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349

42350 DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529

42530 RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709

42710 VRLKADLNLVAKPWSPGAS* 42769

 

Gene No. : 1-3_346 CYP71T3 100% agreement

1708142..1709101 , 1711273..1711902 (+)

>1-3_346

MAVSLVVVVVVVIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHL

LGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAER

LLYGGRDVAFAPYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVD

LVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLG

WVDALNGMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVN

ETDMDAGVQLGTIEIKAIILDMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVG

ITSHITEDHLDRLPYLKAVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAW

TIGRDQATWGEHAEEFIPERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALA

SLLYNFDWETRVVDRRSEFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP

 

Gene No. : 1-3_348 CYP71T4 extension of exon 2 not correct

1718250..1719233 , 1719660..1720331 (+)

>1-3_348

MAVSLLPAVLVLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLP

LLGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRP

RMAMAELLLYGGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVR

AAAADVVVDLSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEP

MGELLPWFWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDF

VDVLLDVNETDKDAGIQLGTVEIKAIIM LICFLLHGHEQ DMFVGGSDTTTTMMAWTMAEL

INHPRAMRKAQNEIWAVVGNTSHVTKDHVDKLPYLKAVFKETLRLHPPLPLLIPREPPAD

TQILGYTIPAHTRVVINAWAIGRDAAAWGQQPDEFSPEKFLNSTIDYKGQDFELLPFGAG

RRGCPGIVFGVSAMEIALASLLYHFDWEAAATDHRRRGSQAWALPVDMSEVNGIAVHLKY

GLHVVAKPRMP

 

>AP003434.1d $F CYP71T4 chromosome 1, PAC clone:P0452F10, complete like 71A

58119 MAVSLLPAVL

58149 VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328

58329 LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508

58509 GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685

58686 LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859

58860 FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036

59037 VNETDKDAGIQLGTVEIKAIIM 59102 (0)

59562 DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717

59718 LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897

59898 PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077

60078 TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200

 

Gene No. : 3-2_172 CYP709D1 missing end of exon 4

907473..907752 , 907912..908135 , 908724..908977 , 909301..909613 ,

910180..910611 (+)

>3-2_172

MDVPSVVIPILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRM

KAAAAADEVAAGAHSHDFIPIVLPQHSKWATDHGKTFLYWLGAVPAVSLGRVEQVKQVLL

ERTGSFTKNYMNANLEALLGKGLILANGEDWERHRKVVHPAFNHDKLKFMSVVMAESVES

MVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKEVYQAQKELQELAF

SSSLDVPALVFLRGNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARA

LEREGNGLVLTTQEIIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLTV NM

VLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEFNPA

RFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVHAPM

EAITLRPRFGLPVVLRNLQG

 

>AP003258.2 $F CYP709D1 genomic DNA, chr 1, PAC clone:P0463A02, complete 46% to 709B2

N-term runs off end of contig identical to AP003764.2 (has N-term)

       MLKSTIELYIFTTAIAKKSLHSQTKHKSKMDVPSVVIP

151039 ILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRMKAAAAADEVA 150857

150856 AGAHSHDFIPIVLPQHSKWATDHG (1)

       KTFLYWLGAVPAVSLGRVEQVKQVLLERTGSFTKNYMNANLEA 150497

150496 LLGKGLILANGEDWERHRKVVHPAFNHDKLK 150404 (0)

149815 FMSVVMAESVESMVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKE 149639

149638 VYQAQKELQELAFSSSLDVPALVFLR 149561 (2)

149237 GNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARALEREGNGLVLTTQE 149055

149054 IIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLT VCGDAIPTPDMANRLKL 148875 (0)

148359 VNMVLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEF 148180

148179 NPARFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVH 148000

147999 APMEAITLRPRFGLPVVLRNLQG* 147928

 

Gene No. : 4_108 CYP71K1 100% agreement

580030..580668 , 580739..581656 (-)

>4_108

MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWALPVIGHLHHVAGALPHR

AMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAFATRPITPTGKVLMADSVG

VVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAVAALTTPGAT

AAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLERRMKLLPAQCLPDLFPSSRAAML

VSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAEEDLLDVLLRIQSQDKTNPALTNDN

IKTVIIDMFVASSETAATSLQWTMSELMRNPRVMRKAQDEVRRALAIAGQDGVTEESLRD

LPYLHLVIKESLRLHPPVTMLLPRECRETCRVMGFDVPEGVMVLVNAWAIGRDPAHWDSP

EEFAPERFEGVGAADFKGTDFEYIPFGAGRRMCPGMAFGLANMELALAALLYHFDWELPG

GMLPGELDMTEALGLTTRRCSDLLLVPALRVPLRDHER

 

Gene No. : 8-1_133 CYP76M14 100% agreement

696859..698433 (-)

>8-1_133

MEKSSELWLLWAVFSASLVFLYLTIRRRSGAGAGGKPPLPPGPTPLPLIGNLLDLRGGVI

HDKLAALARVYGPVMMIKLGLNDAVIISSRDAAREAFTRYDRHLAARAIPDTFRANGFHE

RSAVFLPSSDERWKALRGIQGTHIFTPRGLAAVRPVRERKVRDIIAYFRDHAGEELVIRQ

AIHTGVLNLVSSSFFSMDIAGMGSETARELREHVDEIMTVFAQPNVSDYFPFLRRLDLQG

LRRSTKRRFDRIFSILDDIVERRLVDRGERGGEGGASSNSSKSKHQYDGGDFLDALLELM

VTGKMERDDVTAMLFEAFVAGGDTVAFTLEWVMADLLRNPPVMAKLRAELDDVLGGKDQS

AIEEHDAGRLPYLQAVLKESMRLHSVGPLLHHFAAEDGVVVGGYAVPRGATVLFNTRAIM

RDPAAWERPEEFAPERFLAREGKAPVDFRGKEADFIPFGSGRRLCPGIPLAERVMPYILA

LMLREFEWRLPDGVSPEELDVSEKFMSVNVLAVPLKAVPVKVIN

 

Gene No. : 8-1_562 CYP72A31P pseudogene, not intact

3057089..3057514 , 3057625..3058009 , 3058183..3058256 (-)

>8-1_562

MLYTPYHKEMYMSVLLTSHGSNLPM SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEG

ESTKDDLLGILLESNTKHMEENGQSSQGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLL

SIHPEWQDHAREEIMGLFRKNKPDYEGLSRLKIVTMIFYEVLRLHPPFIEIGWKTYKEME

IGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEFKPERFSEGISKASKDPGAFLPFGWGPR

ICIGQNFALLESKMALCLILQRLEFELAPSYTHAPHTMVTLHPMHGAQMKVRAI

 

>AP003278 $P CYP72A31P chromosome 1, PAC clone:P0518F01, similar to 72A22 missing N-term half

AP003330.1 chromosome 1 clone B1085F01 CYP72A like

Pseudogene, no N-term in 9000bp upstream until next p450 ends near 22400

31539 SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEGESTKDDLLGILLESNTKHMEENGQSS 31718

31719 QGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLLSIHPEWQDHAREEIMGLFRKNKPDYE 31898

31899 GLSRLKI

32030 VTMIFYEVLRLHPPFIEIGWKTYKEMEIGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEF 32209

32210 KPERFSEGISKASKDPGAFLPFGWGPRICIGQNFALLESKMALCLILQRLEFELAPSYTH 32389

32390 APHTMVTLHPMHGAQMKVRAI 32452 or frameshift after KVR to

      SYMIISDYSVFYYYNSWL* (compare with end of 72A33)

 

Gene No. : 8-1_564 CYP72A32 end of gene is incorrect, missing heme signature

3066867..3066964 , 3067301..3067529 , 3067638..3068022 , 3068485..3068729 , 3069130..3069681 (-)

>8-1_564

MVLGGWLLMWAPASSPTILVAFGLLFGLVLAWQAGLQLHRLWWRPRRLEKALRARGLRGS

SYRFLTGDLAEESRRRKEAWARPLPLRCHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGP

TPEVHVTDPELAKVVMSNKFGHFEKIRFQALSKLLPQGLSYHEGEKWAKHRRILNPAFQL

EKLKLMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR

RIFELQGELFERVMKSVEKIFIPGYMYLPTENNRKMHQINKEIESILRSMIGKRMQAMKE

GESTKDDLLGILLESNMRHTEENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILL

LSMHPEWQDRARKEILGLFGKNKPEYDGLNNLKIVTMILYEVLRLYPPFIELKRRTYKEM

KIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGISKASKDP VYGVIDFCDT

FDRLSYPLRFLMYDMVNFLQCV

 

>AP003278a $F CYP72A32 19863-22437 chromosome 1, PAC clone:P0518F01, similar to 72A22

AP003330.1 50023-47446 chromosome 1 clone B1085F01, CYP72A like 536aa

AP004738.1 Oryza sativa chromosome 6 clone OSJNBa0090D06 chrom. conflict

50023 MVLGGWLLMWAPASSPTILVAFGLLFG

49942 LVLAWQ AGLQLHRLWWRPRRLEKALRARGLRGSSYRFLTGDLAEESRRRKEAWARPLPLR 49763

49762 CHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGPTPEVHVTDPELAKVVMSNKFGHFEKIR 49583

49582 FQALSKLLPQGLSYHEGEKWAKHRRILNPAFQLEKLK 49472 (0)

49071 LMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR 48904

48903 RIFELQGELFERVMKSVEKIFIPGYM 48826 (2)

48363 YLPTENNRKMHQINKEIESILRSMIGKRMQAMKEGESTKDDLLGILLESNMRHT 48202

48201 EENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILLLSMHPEWQDRARKEILGLFG 48022

48021 KNKPEYDGLNNLKI (0)

      VTMILYEVLR 47842

47841 LYPPFIELKRRTYKEMKIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGIS 47662

47661 KASKDP GAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELAPTYTHAPHTMITLHP 47482

47481 MHGAQIKIRAI* 47446

 

Gene No. : 8-1_565 CYP72A33 end is wrong and two exon boundaries disagree

3071048..3071076 , 3075238..3075367 , 3075707..3075777 , 3076421..3076649 , 3076758..3077142 , 3077951..3078246 , 3078684..3079205 (-)

>8-1_565

MWAPASSPTILAAFGLVGLVLAWQ AGLQLHRLWWRPRRLEKALRARGLRGSRYRFLTGDL

AEEGRRRKEAWARPLPLRCHDIAPRVEPFLHGAVGVGAAHGKPRITWFGPTPEVHVADPE

LARVVLSNKFGHFEKVSFPELSKLIPQGLSAHEGEKWAKHRRILNPVFQLEKLK SILFLY

LIIEMSSENVQ LMLPVFSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFG

SSYLEGRRIFELQGELFERVIKSIQKMFIPG YM YLPTENNRKMHQMNKEIESILRGMIGK

RMQAMKEGESTKDDLLGILLESNTRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVL

LTWTMLLLSMHPEWQDRAREEILGLFGKNKPDYDGLSRLKIVTMILYEVLRLYPPFIELT

RKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPERFSEGISKASKDP VEV

PRRMEIHRSFDLPRDNIQMSQNRSPCGAPPSSSYRQIRYPEQLADEPIRLSPPDGNGDIL

DLRGIKLEHFASS

 

>AP003278b $F CYP72A33 chromosome 1, PAC clone:P0518F01, 82% to 72A22

AP003330.1 59493-56536 chromosome 1 clone B1085F01, CYP72A like 516aa

N-term does not match in both, 3278 has MVLGGGWLSMWAPASSPTILAAFGLVGLVLAWQ

before the AGLQ seq.

59493 MVLEGK AGLQLHRLWWRPRRLEKALRARGLRGSRYRFL

      TGDLAEEGRRRKEAWARPLPLRCHDIAPRVEP 59284

59283 FLHGAVGVGAAHGKPRITWFGPTPEVHVADPELARVVLSNKFGHFEKVSFPELSKLIPQG 59104

59103 LSAHEGEKWAKHRRILNPVFQLEKLK 59026 (0)

58537 LMLPVFSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFGSSYLEG 58373

58372 RRIFELQGELFERVIKSIQKMFIPG 58298 (2)

57483 YLPTENNRKMHQMNKEIESILRGMIGKRMQAMKEGESTKDDLLGILLESN 57334

57333 TRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVLLTWTMLLLSMHPEWQDRAREEIL 57154

57153 GLFGKNKPDYDGLSRLKI (0) VTMIL 56977

56976 YEVLRLYPPFIELTRKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPER 56800

56799 FSEGISKASKDP GAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELATSYTHVPHT 56620

56619 IISLHPMHGAQIKVKSYMTISDYSVFY* 56536

 

 

Gene No. : 8-2_301 CYP72A17 N-term extension, C-term is wrong,

missing middle exons 2 and 3

1594648..1594915 , 1598386..1598464 , 1598535..1598717 , 1601636..1601742 , 1601972..1602084 , 1602475..1602841 , 1604468..1604841 , 1604937..1605347 , 1605464..1605643 (+)

>8-2_301

MEPSTRTRRLQPNRAPLGRYGEGGSRRIRRRRGDQNRILEAIPRWRGARGFVQQQQHQEK

GGGFGGEERRRQERRRQQEQKGNSTSSMGFLSTCAYGYLGRVDLQNSVHQSTCTVSTTSS

ASSHICFLYINLRIMSISIENDVNDNCDRNSNGGNGNGSIGNDNINTTTSESRAFCGFSF

LRPYRAVRYLRDLQPYILSSIQSASRVPPPEAPPSLPACSFGRSRPPPSLISNLSTVDHA

DAGDASPAYKREKEA MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAA

QMLEWAWLAPRRMERALRAQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPR

VAPLLHRALEEH ENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMD

YYSDEDGKSSKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQ

VFGRNKPDINGVSRLKV VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPV

LFIHRDAAAWGHDAGEFDPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKV

ALGMILQRFAFELSPAYAHAPYTVLTLHPQHGVP NTFADKHWKLHVPGIRHSEISISDMK

AEHYLLTTAAAVPCIVEVYEYPYSISRVFSWSS

 

>CYP72A17 $F AP002839 Oryza sativa genomic DNA, chromosome 1 36553-39431

AG025591.1 strain ND3008 PCR from rice genomic DNA clone T8121T.Length = 401

AG025107.1 strain NC2542 PCR from rice genomic DNA clone T5184T.Length = 504

AU071192 very similar to AQ050520 = 72A17

AP002744 CYP72A17 join(109468..109819,110022..110245,110529..110781,

111446..111819,111915..112346)

MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAAQMLEWAWLAPRRMERALR

AQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPRVAPLLHRALEEH (phase 1 intron)

GRVSFTWFGPMPRVTITDPDLVREVLSNKFGHF

EKTKLATRLSKLLVGGLVILHGEKWVKHRRIMNPAFHAEKLK (phase 0 intron)

RMLPAFSASCSELIGRWENAVAASVGKAELDIWPDFQNLSGDVISRAAFGVRHHEGRQ

IFLLQAEQAERLVQSFRSNYIPGLS (phase 2 intron)

LLPT ENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMDYYSDEDGKS

SKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQVFGRNKPDI

NGVSRLKV (phase 0 intron)

VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPVLFIHRDAAAWGHDAGEF

DPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKVALGMILQRFAFELSPAY

AHAPYTVLTLHPQHGVP VRLRRL*

 

Gene No. : 8-2_302 CYP72A18 C-term extension is wrong. 

1605912..1606000 , 1606047..1606200 , 1606390..1606478 , 1607050..1607241 , 1607596..1607992 , 1608401..1608779 , 1609428..1609672 , 1609767..1609987 , 1610630..1610930 (-)

>8-2_302

MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQGIRGNRYRLFT

GDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEHGKPSFTWFGPTPRVMISDPE

SIREVMSNKFGHYGKPKPTRLGKLLASGVVSYEGEKWAKHRRILNPAFHHEKIKRMLPVF

SNCCTEMVTRWENSMSIEGMSEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESA

ERIIQAFRTIFIPGYWFLPTKNNRRLREIEREVSKLLRGIIGKRERAIKNGETSNGDLLG

LLVESNMRESNGKAELGMTTDEIIEECKLFYFAGMETTSVLLTWTLIVLSMHPEWQERAR

EEVLHHFGRTTPDYDSLSRLKIVTMILYEVLRLYPPVVFLTRRTYKEMELGGIKYPAEVT

LMLPILFIHHDPDIWGKDAGEFNPGRFADGISNATKYQTSFFPFGWGPRICIGQNFALLE

AKMAICTILQRFSFELSPSYIHAPFTVITLHPQH VKYITTQSLHSDTSHCENRAGLLGTG

RYVPQDFHQICGSQEQNPFFVIASSLQPTPLGFDRRLKGLVMLIENPVIPLLPNRPGNQG

CDGVESPSVWQAAIVELIGHPTSIEIHQPHEVTTSRRRQRPPNRLVLELHPFQFPQTETD

EGIGLLGEVIQGVDCRPQIEIFEDCLDP

 

>CYP72A18 $F AP002839 Oryza sativa genomic DNA, chromosome 1 44993-41630

AU100789.1 Rice callus Oryza sativa cDNA clone C50810.Length = 419 C-term

AU102126.1 Rice callus cDNA clone C10756.Length = 571

AZ130306.1 OSJNBb0103O04r CUGI Rice BAC genomicLength = 320

C26802 36% TO 72  8/97 N-TERMINAL 19-67 REGION opposite end = C96903

C96903, C97406 58% IDENTICAL TO 72 C-TERM 65% to AQ050520

C96799, C28139 219-340 REGION 55% IDENTICAL TO 72 opposite end = C97406

D22332        48% TO 72     12/93  7/98 C-HELIX 89-191 REGION

AU081507.1 Rice callus Oryza sativa cDNA clone C12518_12Z.Length = 581

C26235        36% IDENTICAL TO 72     8/97 AMINO ACIDS 89-216 REGION

AP002744 complement(join(114545..114970,115379..115757,

116406..116650,116745..116965,117608..117908))

D21882        53% TO 72   5/93  7/98 245-352 REGION = 72A18

MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQG

IRGNRYRLFTGDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEH (phase 1 intron)

GKPSFTWFGPTPRVMISDPESIREVMSNKFGHYGKPKPTRLGKLLASGVV

SYEGEKWAKHRRILNPAFHHEKIK (phase 0 intron)

RMLPVFSNCCTEMVTRWENSMSIEGM

SEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESAERIIQAFRTIFIPGYW (phase 2 intron)