rice chr 1
P450s in annotated list in order on the chr [46 genes]
from
http://rgp.dna.affrc.go.jp/rgp/complete-chr/chr1/chr1-complete.html
and
details from:
http://ricegaas.dna.affrc.go.jp/chr1-bin/search_table.pl
This list
is taken from the chromosome 1 annotations found by a
keyword
search for P450. Not all P450s on
chr 1 were annotated in this table.
Of the 46 genes
annotated here 12 agree 100% with my annotations
2 are
fusions of two P450s, 2 more are fusions to other genes
71T2 711A2
and 711A3 are split into two genes each
Three
pseudogenes are represented as intact genes by
creative
splicing to avoid frameshifts and stop codons,
and an
artificial choice of N and C-termini to finish the gene.
The
pseudogenes CYP94D8P, CYP715B3P, 71AA1P, 71AA4P, 72A36P, 76H12P
are missed
in this annotation, but the annotation does not
cover
pseudogenes.
CYP734A6, CYP71AA3,
CYP71C18, CYP71C19 are also missed
Gene No. :
1-1_001 CYP715B2P
8174..8624
, 8639..8844 , 8948..9019 (-)
>1-1_001
MAEGDEWARHRCIVAPAFSATNLNDMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVR
NAAEIIAKASFSIAADDATVFHK AAGDAVPLHAVPLASLLHIRADRATYEAWKLGRKIDA
LLLDIIESRRRCEGGGRKTTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPE
WRAAVREEVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEVVRG
KR
>AP003727.3
$P CYP715B2P chromosome 1 clone:P0672D08 Pseudogene fragment
missing N
and C-terminal and part of I-helix 39% to 715A1
NRMPMFGRGRVMAEGDEWARHRCIVAPAFSATNLN
DMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVRNAAEIIAKASFSIAADDATVFHK (frameshift)
VRLVSVPLASLLHIRADRATYEAWKLGRKIDALLLDIIESRRRCEGGGRK
TTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPEWRAAVRE this is missing from
AP004123
EVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEV
Gene No. :
1-2_239 CYP96D1 100%
agreement
1216403..1217523
, 1217636..1218047 (+)
>1-2_239
MGPLWTFILLYPEIFLAIICFFWFSLFRPIRQRQKSNLPVNWPVFGMLPFLVQNLHYIHD
KVADVLREAGCTFMVSGPWFLNMNFLITCDPATVNHCFNANFKNYPKGSEFAEMFDILGD
GLLVADSESWEYQRRMAMYIFAARTFRSFAMSTITRKTGSVLLPYLDHMAKFGSEVELEG
VFMRFSLDVTYSTVFAADLDCLSVSSPIPVFGQATKEAEEAVLFRHVIPPSVWKLLRLLN
VGTEKKLTNAKVVIDQFIYEEIAKRKAQASDGLQGDILSMYMKWSIHESAHKQKDERFLR
DTAVGFIFAGKDLIAVTLTWFFYMMCKHPHVEARILQELKGLQSSTWPGDLHVFEWDTLR
SAIYLQAALLETLRLFPATPFEEKEALVDDVLPNGTKVSRNTRIIFSLYAMGRIEGIWGK
DCMEFKPERWVSKSGRLRHEPSYKFLSFNTGPRSCLGKELSLSNMKIIVASIIHNFKVEL
VEGHEVMPQSSVILHTQNGMMVRLKRRDAA
Gene No. :
1-2_240 CYP96E1
1220501..1221994
, 1226247..1226672 (+)
>1-2_240
MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPVVGALPAIVANAGRVHD
WVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANFGNYPKGEEFAAMFDVLGG
GIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGGGLVPLLDGVAASGAAVDLQD
VFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDAEEVLFYRHVAPVPWLRLQSYLK
IGHYKKMAKAREVLDASIAELIALRRERKAADANATGDADLLTAYLACQDEIGMDGAAFD
AFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPGVEARILAELRAHPPSPTGAELKRLVY
LHAALSESLRLYPPVPFEHKAAARPDTLPSGAAVGPTRRVIVSLYSMGRMEAVWGKGCEE
FRPERWLTPAGRFRHERSCKFAAFNVGPRTCLGRDLAFAQMKAVVAAVVPRFRVAAAAAP
PRPKLSIILHMRDGLKVK RRDPVQGGGGHRRGHHHEICRCPQSGSGELDKATAMAADEEE
EVAPNLVFVTIQLPPSSSSSPLKTTQQLDGEGEELIGVQPKEEDRRLEEEEGGGVAADLA
VSRGPSRQACRCTGQESRAVGRGRKQGERRPEGEGICRR
>AP002484b
$F CYP96E1 CDS 80463..81980 43% to 96A1
MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPV
VGALPAIVANAGRVHDWVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANF
GNYPKGEEFAAMFDVLGGGIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGG
GLVPLLDGVAASGAAVDLQDVFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDA
EEVLFYRHVAPVPWLRLQSYLKIGHYKKMAKAREVLDASIAELIALRRERKAADANAT
GDADLLTAYLACQDEIGMDGAAFDAFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPG
VEARILAELRAHPPSPTGAELKRLVYLHAALSESLRLYPPVPFEHKAAARPDTLPSGA
AVGPTRRVIVSLYSMGRMEAVWGKGCEEFRPERWLTPAGRFRHERSCKFAAFNVGPRT
CLGRDLAFAQMKAVVAAVVPRFRVAAAAAPPRPKLSIILHMRDGLKVK VHRRQED*
Gene No. :
1-2_394 CYP90D2
2035743..2035993
, 2036078..2036405 , 2036438..2036531 , 2036850..2037055 , 2037375..2037623 ,
2038901..2039086 , 2039597..2039718 , 2043035..2043131 (+)
>1-2_394
MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAAAAARLPPGSFGWPVVG
ETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVTADAEVSRFVLQSDARAFV
PWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHLKSQLTADMRRRLSPALSSFP
DSSLLHVQHLAKS LLDEIEWVDELEEEQSGWAWASAGVRAHVRAQMRMERNVIARNGDEM
QMQ VVFEILVRGLIGLEAGEEMQQLKQQFQEFIVGLMSLPIKLPGTRLYRSLQAKKKMAR
LIQRIIREKRARRAAASPPRDAIDVLIGDGSDELTDELISDNMIDLMIPAEDSVPVLITL
AVKFLSECPLALHQLE VITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCVFVYFRSVH
LDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIFLHHLVTSFRW
VAEEDHIVNFPTVRLKRGMPIRVTAKEDDD
>AP003244
$F CYP90D2 59% to 90D1
CDS
join(30874..31124,31209..31536,32037..32186,32506..32754,
33832..33921,34032..34217,34728..34849,38166..38262)
AQ157843
64% identical to AQ290163 75% TO 90C1 AT HEME BINDING REGION
C97894
Rice callus Oryza sativa cDNA clone C0085_11A, mRNA sequence
extreme
C-term 71% to CYP90C1 opp end = C97895 (probably 3 prime untranslated)
MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAA
AAARLPPGSFGWPVVGETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVT
ADAEVSRFVLQSDARAFVPWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHL
KSQLTADMRRRLSPALSSFPDSSLLHVQHLAKS VVFEILVRGLIGLEAGEEMQQLKQQ
FQEFIVGLMSLPIKLPGTRLYRSLQAKKKMARLIQRIIREKRARRAAASPPRDAIDVL
IGDGSDELTDELISDNMIDLMIPAEDSVPVLITLAVKFLSECPLALHQLE EENIQLKR
RKTDMGETLQWTDYMSLSFTQH
VITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCV
FVYFRSVHLDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIF
LHHLVTSFRWVAEEDHIVNFPTVRLKRGMPIRVTAKEDDD
Gene No. :
12_245 CYP94E2 100%
agreement
1349356..1350990
(-)
>12_245
MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLD
WATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLG
RGLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQ
DVLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAM
RLANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGG
ELAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSR
HADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAV
RAGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRACLGR
EMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKREDDAAQ
QKLT
>AP003735.2b
$F CYP94E2 genomic
DNA, chromosome 1, BAC clone:B1147A04, complete
61% to
AP003735 4872-6534
8263
MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLDW 8445
8446
ATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLGR 8625
8626
GLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQD 8805
8806
VLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAMR 8985
8986
LANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGGE 9165
9166
LAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSRH 9345
9346
ADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAVR 9525
9526
AGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRAC 9693
9694
LGREMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKR 9864
EDDAAQQKLT* 9897
Gene No. :
12_246 CYP94E1 100%
agreement
1352719..1354368
(-)
>12_246
MDSSYLHLVLPAAAAAVVVAVVLLLSLWRRCQTTSNHRPQANPILGNLVAFLANGHRFLD
WSTGLLAAAPASTMQVHGPLGLGYCGVATASPDAVEHMLRASFHNYVDKGDRVRDAFADL
LGDGLFLANGRLWRLQRKLAASSFSPRLLRLFAGRVVLDQLRRRLLLFFDAAADARRVFD
LQDVLKRFAFDNICSVAFGVDRDDSSPSSSSPSRLEAGGDGRDDAFFAAFDDAIDISFGR
ILHPTTLAWKAMKLLDVGSERRLRQAIGVVDEYVTAIMESKQRCSDSEEESDLLSRFTAA
MMEEDGGNELGAMFDSPEAKRRFLRDTVKTFVLAGKDTTSSALTWLFWFLAANPECERRV
YEEVTALRGDTAGDERDDGYEELKRMHYLHAAITETMRLYPPVPLASRVAAADDVLPDGT
VVRAGWFADYSSYAMGRMPQLWERDCGEFRPERWLDGGGGGGGRFVAVDAARYPVFHAGP
RSCLGKEMAYVQMKAVAAAVVRRFSVEVVPAAAANAPPSPPPHETAVTLRMKGGLRVLLT
RRRGVLSHA
Gene No. :
12_317 CYP71AA2 100%
agreement
1719193..1720104
, 1720260..1720934 (+)
>12_317
MAGIMDSTTASYYTTLLCGALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHC
LLGSLPHHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTAS
IDIVFAPFGKHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASASSAVNVS
ELVKIMTNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARVLGGRSL
RTTKRVHEKLHQITEAIIQGHGIKDTVGDEHHECEDILDVLLRFQRDGGLGITLTKEIVS
AVLFDLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHY
LQLVIKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKDVNE
FRPERFKDDIVDFSGTDFRFIPGGSGRRMRPGLTFGVSNIEIALVTLLYHFDWKLPSETD
THELDMRETYGLTTRRRSDLLLKATPSYARLGWSTNMQIYSVKCLVYE
this
pseudogene not annotated
>AP004326.2d
$P CYP71AA1P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 4 pseudogene
81031
DFFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 81213 frameshift
81213
QETLRLHPPVPLLLPRLWSEPCKIMGYDIP 81302 frameshift
81304
KNTAIFVNTWALGR 81345 frameshift
81344
KIKNTGLMQVSSGLKY
81393
SRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSNK 81566
81567
LDMTEANGITTHRRIDIWLEATPFVPR 81647
this gene
not annotated
>AP004326.2b
$F CYP71AA3 genomic
DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 2 no
good matches in NR 79% to AP004326.2c
71860
MAGIVDTAAFCT
71896
LLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRR 72069
72070
YGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGARDIVFAPF 72243
72244
SKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSELVKI 72408
72409
MANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRATKR 72588
72589
VHQKLHQITDTIIQGHEIIKDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLL 72747
72748
RFHRDGGLGITLTKEIVSAVLF 72813 (0)
73327
DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIRQVLQGKTVVSEADIEGRLHYL 73497
73498
QLVIRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEF 73677
73678
RPERFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASS 73857
73858
CKLDMRETHGVTARRRTELLLKATPLYT* 73944
this
pseudogene not annotated
>AP004326.2a
$P CYP71AA4P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Length =
102983
4 genes
71B like
Gene 1
pseudogene 71 family
67989
LPPVPWPLPVIGSMH*LLGSLPHH 68060 frameshift with deletion
68060
RPACAVELLSPRRARSFRRVREAEPARLVRAVAASPAWPLVNVVGGEHVAAMMTAV 68227
68228
GARP 68239 frameshift with small deletion
68238
RCPRQEEYLEELGKVAKLAAGFNLVDLFPESRLVRAAQAAHGKIHSIMDAMVQ 68396
68397
DHLKAMEERREEVADGVVDDGDGDGADRDEELLSILLRFQRDGGLGITLTNGNHQRDS 68570 (0)
68886
GILAGGSDTTTTTVMWAMSELLRCPRAMQ 68972 frameshift with deletion
69023
YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 69202
69203
F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 69382
69383
RMELDMTESAGLTASRLTDLFG* 69451
Gene No. :
1-3_149 CYP710A5 this
seq has long N-term
677956..679547
, 683244..683265 (-)
>1-3_149
MAIPGSKEHKCNCLSRSSRAFSTPRT
MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALY
MLIEQLSYHRKKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFG
RFTVFIRDSELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNF
TPRALSTYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTE
KARERFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEP
ECLLDYLMQETVREIDEATAAGL
PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL
DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIAVE
AFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLAFGAG
PHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYLKQR
>AP002092a
$F CYP710A5 CDS
complement(54602..56137) 60% TO 710A1
THIS SEQ IS THE SAME AS AP002093a CDS complement(98133..99668)
MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALYMLI
EQLSYHR
KKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFGRFTVFIRD
SELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNFTPRALS
TYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTEKARE
RFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEPEC
LLDYLMQETVREI
DEATAAGL this may be too long vs arab. Check for
intron
PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL
DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIA
VEAFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLA
FGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYL
KQR*
Gene No. :
1-3_150 CYP710A6 100%
agreement
684165..685703
(-)
>1-3_150
MVESFHGLVVVDLRTAAPLLATAVALYILIEQLSYHRKKGSMPGPPLVVVPFLGSVTHLF
RDPVGFWDLQATRASKSGAGLTADFLFGRLMVFIRDSELSRRVFANVRADAFHLVGHPFG
KKLFGDHNLIYMVGKEHKDLRRRIAPNFTPRALSTYAVIQQRVILSHLRRWIDRSVANGG
KAEPIRVPCRDMNLETSQTVFVGPYLTVETRERFDRDYNLFNHGFITLPIDLPGSAFRRA
RLAVPRLKHILEDCARQSKQRMRGGGEPECLVDYLMQETVREIDEAAAAGLPPPPHTSDM
ETGNLLFDFLFAAQDASTSSLCWAVSALDSHPDVLARVRAEVAALWSPESGEPITAEMMT
EMKYTQAVAREVVRYWPPGPVVPHIAGEAFQLTEQYTIPKGTIVFPSVYESSFQGFPDAG
TFDPERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVIFMALLASLIDFRRERT
EGCDVPVYMPTIVPRDGCVVHLKQRCAKLPSF
Gene No. :
1-3_155 CYP710A7 100%
agreement
704151..705665
(-)
>1-3_155
MVDSLLYGLLDLRMAAPLLAAAVALYVLVEQLSYHRKKGSLPGPPLVVPFIGSATHMIRD
PTGFWEMQAARARKSGVGFTADFLAGKFTIFIRDSELSNRVFANVRPDAFFVIGHPFGKK
LFGDHNLIYLFGDDHKDLRRRMATNFTPRALSTYAAIQQRGIVSHLRRWLDRSAANGGKA
EPIRVPCRDMNLETSQTVFAGPYLTEEARERFKSDYNLFNVGLLAFPVDLPGLAFRRARQ
AVARLVRMLRDCARESKARMRAGGEPECLVDYWMQETVREIDEAKAAGLPPPAHISDDEE
IGGFLFDFLFAAQDASTSSLCWAVSALDSHPDVLARVRAEVASLWSPDSGEPITADKIAE
MKYTKAVAREVVRHRPPATLMPHIALQNFQLTESYTIPKGTLVLPSMYESSFQGFHDPDA
FDPERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTE
GCDVPVYMPTMVPRDGCVVYLKQR
Gene No. :
1-3_159 CYP710A8 100% agreement
723800..725326
(-)
>1-3_159
MAAVVDFLDLRAAAPFVVAALAFYFLVEQLSYHRKKGPLPGPPLVVPFVGSVAHMIRDPT
GFWDAQAARARKSGAGLAADFLIGRFVVFIRDSELSHRVFANVRPDAFHLIGHPFGKKLF
GDHNLIYMFGEDHKDLRRRIAPNFTPRALSTYAAIQQRVILSHLRRWLDRSAANGGKAEP
IRVPCRDMNLETSQTVFAGPYLTKEAREKFERDYNFFNVGLMALPVDLPGFAFRSARLGV
ARLVRTLGECARASKARMRAGGEPECLVDFWMQETVREIDEAKAAGKPPPAHTDDEELGG
FLFDFLFAAQDASTSSLCWAVSALDSHPDVLAGVRAEVASLWSPESGEPITAEKIAEMKY
TQAVAREVVRHRPPATLVPHIAGEEFQLTEWYTIPKGTIVFPSVYESSFQGFPEPDTFDP
ERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVLFMALFVSVVDFRRDRTEGCD
EPVYMPTIVPRDSCTVYLKQRCAKFPSF
Gene No. :
1-3_343 CYP71T1 100%
agreement
1694984..1695988
, 1696589..1697260 (+)
>1-3_343
MELSSSLAAVLHSPLFLLAALLLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGH
LPLLGSLPHRKLRSMAEAHGPVMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRM
AERLIYGRDMVFAPYGEFWRQARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGV
RGGGETVNLSDLLMSYANGVISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGE
FVPWLAWVDKLMGLDAKAARISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDH
RDFVDVLLDVSEVEEGAGAGEVLLFDTVAIKAIILDMIAAATDTTFTTLEWAMAELINHP
PVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELRLLRAVVKETLRLHAPVPLLVPRETVE
DTELLGYRVPARTRVIINVWAIGRDAAAWGDRAEEFVPERWLDGGGEEVEYAAQLGQDFR
FVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDWELPPHADGAAAATAARLDMGELFGLS
MRMKTTLNLVAKPWSSDV
Gene No. :
1-3_344 CYP71T2 exon 1
only + C-term extension
1699829..1700905
(+)
>1-3_344
MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLPLPPSPPGVPLLGH
LPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRTRDLAFASRPRVRM
SERLFYGRDMAFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQEVAALLDRVRRRCGG
GGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFADFEGLLGTMTVGEF
VPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQAVGDGEADADHRDFVD
VMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL VRTPLVVVLLTCRSADATVDYFLWNQT
Gene No. :
1-3_345 CYP71T2 exon 2
only + N-term extension
1702184..1702900
(+)
>1-3_345
MNERFIEQ DMMAAGTDSSFTTTEWVMAELINHPRVMRKLQDEIRAVVGTSSASAAAAATG
GGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVEDTELLGYRIPARTRVIINVWA
IGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDSRFVPFGAGRRGCPGAGFAALS
VELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLSVRLKADLNLVAKPWSPGAS
>AP003434.1b
$F CYP71T2 chromosome
1, PAC clone:P0452F10, complete = AA754300
AA754300 42% IDENTICAL
TO 71A14 1/98 I-HELIX 43% to
703A2
39698
MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP
39839
LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018
40019
RDLAFASRPRVRMSERLFYGRDM AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195
40196
VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375
40376
DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552
40553
VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)
42074
DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169
42170
QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349
42350
DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529
42530
RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709
42710
VRLKADLNLVAKPWSPGAS* 42769
Gene No. :
1-3_346 CYP71T3 100%
agreement
1708142..1709101
, 1711273..1711902 (+)
>1-3_346
MAVSLVVVVVVVIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHL
LGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAER
LLYGGRDVAFAPYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVD
LVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLG
WVDALNGMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVN
ETDMDAGVQLGTIEIKAIILDMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVG
ITSHITEDHLDRLPYLKAVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAW
TIGRDQATWGEHAEEFIPERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALA
SLLYNFDWETRVVDRRSEFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP
Gene No. :
1-3_348 CYP71T4
extension of exon 2 not correct
1718250..1719233
, 1719660..1720331 (+)
>1-3_348
MAVSLLPAVLVLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLP
LLGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRP
RMAMAELLLYGGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVR
AAAADVVVDLSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEP
MGELLPWFWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDF
VDVLLDVNETDKDAGIQLGTVEIKAIIM LICFLLHGHEQ DMFVGGSDTTTTMMAWTMAEL
INHPRAMRKAQNEIWAVVGNTSHVTKDHVDKLPYLKAVFKETLRLHPPLPLLIPREPPAD
TQILGYTIPAHTRVVINAWAIGRDAAAWGQQPDEFSPEKFLNSTIDYKGQDFELLPFGAG
RRGCPGIVFGVSAMEIALASLLYHFDWEAAATDHRRRGSQAWALPVDMSEVNGIAVHLKY
GLHVVAKPRMP
>AP003434.1d
$F CYP71T4 chromosome
1, PAC clone:P0452F10, complete like 71A
58119
MAVSLLPAVL
58149
VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328
58329
LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508
58509
GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685
58686
LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859
58860
FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036
59037
VNETDKDAGIQLGTVEIKAIIM 59102 (0)
59562
DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717
59718
LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897
59898
PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077
60078
TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200
Gene No. :
3-2_172 CYP709D1
missing end of exon 4
907473..907752
, 907912..908135 , 908724..908977 , 909301..909613 ,
910180..910611
(+)
>3-2_172
MDVPSVVIPILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRM
KAAAAADEVAAGAHSHDFIPIVLPQHSKWATDHGKTFLYWLGAVPAVSLGRVEQVKQVLL
ERTGSFTKNYMNANLEALLGKGLILANGEDWERHRKVVHPAFNHDKLKFMSVVMAESVES
MVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKEVYQAQKELQELAF
SSSLDVPALVFLRGNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARA
LEREGNGLVLTTQEIIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLTV
NM
VLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEFNPA
RFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVHAPM
EAITLRPRFGLPVVLRNLQG
>AP003258.2
$F CYP709D1 genomic
DNA, chr 1, PAC clone:P0463A02, complete 46% to 709B2
N-term
runs off end of contig identical to AP003764.2 (has N-term)
MLKSTIELYIFTTAIAKKSLHSQTKHKSKMDVPSVVIP
151039
ILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRMKAAAAADEVA 150857
150856 AGAHSHDFIPIVLPQHSKWATDHG (1)
KTFLYWLGAVPAVSLGRVEQVKQVLLERTGSFTKNYMNANLEA
150497
150496 LLGKGLILANGEDWERHRKVVHPAFNHDKLK
150404 (0)
149815
FMSVVMAESVESMVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKE 149639
149638 VYQAQKELQELAFSSSLDVPALVFLR
149561 (2)
149237
GNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARALEREGNGLVLTTQE 149055
149054
IIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLT VCGDAIPTPDMANRLKL 148875 (0)
148359
VNMVLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEF
148180
148179
NPARFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVH 148000
147999 APMEAITLRPRFGLPVVLRNLQG* 147928
Gene No. :
4_108 CYP71K1 100%
agreement
580030..580668
, 580739..581656 (-)
>4_108
MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWALPVIGHLHHVAGALPHR
AMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAFATRPITPTGKVLMADSVG
VVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAVAALTTPGAT
AAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLERRMKLLPAQCLPDLFPSSRAAML
VSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAEEDLLDVLLRIQSQDKTNPALTNDN
IKTVIIDMFVASSETAATSLQWTMSELMRNPRVMRKAQDEVRRALAIAGQDGVTEESLRD
LPYLHLVIKESLRLHPPVTMLLPRECRETCRVMGFDVPEGVMVLVNAWAIGRDPAHWDSP
EEFAPERFEGVGAADFKGTDFEYIPFGAGRRMCPGMAFGLANMELALAALLYHFDWELPG
GMLPGELDMTEALGLTTRRCSDLLLVPALRVPLRDHER
Gene No. :
8-1_133 CYP76M14 100%
agreement
696859..698433
(-)
>8-1_133
MEKSSELWLLWAVFSASLVFLYLTIRRRSGAGAGGKPPLPPGPTPLPLIGNLLDLRGGVI
HDKLAALARVYGPVMMIKLGLNDAVIISSRDAAREAFTRYDRHLAARAIPDTFRANGFHE
RSAVFLPSSDERWKALRGIQGTHIFTPRGLAAVRPVRERKVRDIIAYFRDHAGEELVIRQ
AIHTGVLNLVSSSFFSMDIAGMGSETARELREHVDEIMTVFAQPNVSDYFPFLRRLDLQG
LRRSTKRRFDRIFSILDDIVERRLVDRGERGGEGGASSNSSKSKHQYDGGDFLDALLELM
VTGKMERDDVTAMLFEAFVAGGDTVAFTLEWVMADLLRNPPVMAKLRAELDDVLGGKDQS
AIEEHDAGRLPYLQAVLKESMRLHSVGPLLHHFAAEDGVVVGGYAVPRGATVLFNTRAIM
RDPAAWERPEEFAPERFLAREGKAPVDFRGKEADFIPFGSGRRLCPGIPLAERVMPYILA
LMLREFEWRLPDGVSPEELDVSEKFMSVNVLAVPLKAVPVKVIN
Gene No. :
8-1_562 CYP72A31P
pseudogene, not intact
3057089..3057514
, 3057625..3058009 , 3058183..3058256 (-)
>8-1_562
MLYTPYHKEMYMSVLLTSHGSNLPM
SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEG
ESTKDDLLGILLESNTKHMEENGQSSQGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLL
SIHPEWQDHAREEIMGLFRKNKPDYEGLSRLKIVTMIFYEVLRLHPPFIEIGWKTYKEME
IGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEFKPERFSEGISKASKDPGAFLPFGWGPR
ICIGQNFALLESKMALCLILQRLEFELAPSYTHAPHTMVTLHPMHGAQMKVRAI
>AP003278
$P CYP72A31P chromosome 1, PAC clone:P0518F01, similar to 72A22 missing N-term
half
AP003330.1
chromosome 1 clone B1085F01 CYP72A like
Pseudogene,
no N-term in 9000bp upstream until next p450 ends near 22400
31539
SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEGESTKDDLLGILLESNTKHMEENGQSS
31718
31719
QGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLLSIHPEWQDHAREEIMGLFRKNKPDYE 31898
31899 GLSRLKI
32030
VTMIFYEVLRLHPPFIEIGWKTYKEMEIGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEF 32209
32210
KPERFSEGISKASKDPGAFLPFGWGPRICIGQNFALLESKMALCLILQRLEFELAPSYTH 32389
32390 APHTMVTLHPMHGAQMKVRAI 32452 or frameshift
after KVR to
SYMIISDYSVFYYYNSWL* (compare with end of 72A33)
Gene No. :
8-1_564 CYP72A32 end of
gene is incorrect, missing heme signature
3066867..3066964
, 3067301..3067529 , 3067638..3068022 , 3068485..3068729 , 3069130..3069681 (-)
>8-1_564
MVLGGWLLMWAPASSPTILVAFGLLFGLVLAWQAGLQLHRLWWRPRRLEKALRARGLRGS
SYRFLTGDLAEESRRRKEAWARPLPLRCHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGP
TPEVHVTDPELAKVVMSNKFGHFEKIRFQALSKLLPQGLSYHEGEKWAKHRRILNPAFQL
EKLKLMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR
RIFELQGELFERVMKSVEKIFIPGYMYLPTENNRKMHQINKEIESILRSMIGKRMQAMKE
GESTKDDLLGILLESNMRHTEENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILL
LSMHPEWQDRARKEILGLFGKNKPEYDGLNNLKIVTMILYEVLRLYPPFIELKRRTYKEM
KIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGISKASKDP VYGVIDFCDT
FDRLSYPLRFLMYDMVNFLQCV
>AP003278a
$F CYP72A32
19863-22437 chromosome 1, PAC clone:P0518F01, similar to 72A22
AP003330.1
50023-47446 chromosome 1 clone B1085F01, CYP72A like 536aa
AP004738.1
Oryza sativa chromosome 6 clone OSJNBa0090D06 chrom. conflict
50023 MVLGGWLLMWAPASSPTILVAFGLLFG
49942 LVLAWQ
AGLQLHRLWWRPRRLEKALRARGLRGSSYRFLTGDLAEESRRRKEAWARPLPLR 49763
49762
CHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGPTPEVHVTDPELAKVVMSNKFGHFEKIR 49583
49582
FQALSKLLPQGLSYHEGEKWAKHRRILNPAFQLEKLK 49472 (0)
49071
LMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR 48904
48903 RIFELQGELFERVMKSVEKIFIPGYM 48826
(2)
48363
YLPTENNRKMHQINKEIESILRSMIGKRMQAMKEGESTKDDLLGILLESNMRHT 48202
48201
EENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILLLSMHPEWQDRARKEILGLFG 48022
48021 KNKPEYDGLNNLKI (0)
VTMILYEVLR 47842
47841
LYPPFIELKRRTYKEMKIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGIS 47662
47661 KASKDP
GAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELAPTYTHAPHTMITLHP 47482
47481
MHGAQIKIRAI* 47446
Gene No. :
8-1_565 CYP72A33 end is
wrong and two exon boundaries disagree
3071048..3071076
, 3075238..3075367 , 3075707..3075777 , 3076421..3076649 , 3076758..3077142 ,
3077951..3078246 , 3078684..3079205 (-)
>8-1_565
MWAPASSPTILAAFGLVGLVLAWQ
AGLQLHRLWWRPRRLEKALRARGLRGSRYRFLTGDL
AEEGRRRKEAWARPLPLRCHDIAPRVEPFLHGAVGVGAAHGKPRITWFGPTPEVHVADPE
LARVVLSNKFGHFEKVSFPELSKLIPQGLSAHEGEKWAKHRRILNPVFQLEKLK SILFLY
LIIEMSSENVQ
LMLPVFSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFG
SSYLEGRRIFELQGELFERVIKSIQKMFIPG YM YLPTENNRKMHQMNKEIESILRGMIGK
RMQAMKEGESTKDDLLGILLESNTRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVL
LTWTMLLLSMHPEWQDRAREEILGLFGKNKPDYDGLSRLKIVTMILYEVLRLYPPFIELT
RKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPERFSEGISKASKDP VEV
PRRMEIHRSFDLPRDNIQMSQNRSPCGAPPSSSYRQIRYPEQLADEPIRLSPPDGNGDIL
DLRGIKLEHFASS
>AP003278b
$F CYP72A33 chromosome
1, PAC clone:P0518F01, 82% to 72A22
AP003330.1
59493-56536 chromosome 1 clone B1085F01, CYP72A like 516aa
N-term
does not match in both, 3278 has MVLGGGWLSMWAPASSPTILAAFGLVGLVLAWQ
before the
AGLQ seq.
59493
MVLEGK AGLQLHRLWWRPRRLEKALRARGLRGSRYRFL
TGDLAEEGRRRKEAWARPLPLRCHDIAPRVEP
59284
59283
FLHGAVGVGAAHGKPRITWFGPTPEVHVADPELARVVLSNKFGHFEKVSFPELSKLIPQG 59104
59103 LSAHEGEKWAKHRRILNPVFQLEKLK 59026 (0)
58537
LMLPVFSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFGSSYLEG 58373
58372 RRIFELQGELFERVIKSIQKMFIPG 58298
(2)
57483
YLPTENNRKMHQMNKEIESILRGMIGKRMQAMKEGESTKDDLLGILLESN 57334
57333 TRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVLLTWTMLLLSMHPEWQDRAREEIL
57154
57153 GLFGKNKPDYDGLSRLKI (0) VTMIL
56977
56976
YEVLRLYPPFIELTRKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPER 56800
56799 FSEGISKASKDP
GAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELATSYTHVPHT 56620
56619
IISLHPMHGAQIKVKSYMTISDYSVFY* 56536
Gene No. :
8-2_301 CYP72A17 N-term extension, C-term is wrong,
missing
middle exons 2 and 3
1594648..1594915
, 1598386..1598464 , 1598535..1598717 , 1601636..1601742 , 1601972..1602084 ,
1602475..1602841 , 1604468..1604841 , 1604937..1605347 , 1605464..1605643 (+)
>8-2_301
MEPSTRTRRLQPNRAPLGRYGEGGSRRIRRRRGDQNRILEAIPRWRGARGFVQQQQHQEK
GGGFGGEERRRQERRRQQEQKGNSTSSMGFLSTCAYGYLGRVDLQNSVHQSTCTVSTTSS
ASSHICFLYINLRIMSISIENDVNDNCDRNSNGGNGNGSIGNDNINTTTSESRAFCGFSF
LRPYRAVRYLRDLQPYILSSIQSASRVPPPEAPPSLPACSFGRSRPPPSLISNLSTVDHA
DAGDASPAYKREKEA
MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAA
QMLEWAWLAPRRMERALRAQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPR
VAPLLHRALEEH ENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMD
YYSDEDGKSSKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQ
VFGRNKPDINGVSRLKV VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPV
LFIHRDAAAWGHDAGEFDPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKV
ALGMILQRFAFELSPAYAHAPYTVLTLHPQHGVP NTFADKHWKLHVPGIRHSEISISDMK
AEHYLLTTAAAVPCIVEVYEYPYSISRVFSWSS
>CYP72A17
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 36553-39431
AG025591.1
strain ND3008 PCR from rice genomic DNA clone T8121T.Length = 401
AG025107.1
strain NC2542 PCR from rice genomic DNA clone T5184T.Length = 504
AU071192
very similar to AQ050520 = 72A17
AP002744
CYP72A17 join(109468..109819,110022..110245,110529..110781,
111446..111819,111915..112346)
MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAAQMLEWAWLAPRRMERALR
AQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPRVAPLLHRALEEH (phase 1 intron)
GRVSFTWFGPMPRVTITDPDLVREVLSNKFGHF
EKTKLATRLSKLLVGGLVILHGEKWVKHRRIMNPAFHAEKLK
(phase 0 intron)
RMLPAFSASCSELIGRWENAVAASVGKAELDIWPDFQNLSGDVISRAAFGVRHHEGRQ
IFLLQAEQAERLVQSFRSNYIPGLS
(phase 2 intron)
LLPT
ENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMDYYSDEDGKS
SKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQVFGRNKPDI
NGVSRLKV (phase 0 intron)
VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPVLFIHRDAAAWGHDAGEF
DPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKVALGMILQRFAFELSPAY
AHAPYTVLTLHPQHGVP VRLRRL*
Gene No. :
8-2_302 CYP72A18 C-term
extension is wrong.
1605912..1606000
, 1606047..1606200 , 1606390..1606478 , 1607050..1607241 , 1607596..1607992 ,
1608401..1608779 , 1609428..1609672 , 1609767..1609987 , 1610630..1610930 (-)
>8-2_302
MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQGIRGNRYRLFT
GDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEHGKPSFTWFGPTPRVMISDPE
SIREVMSNKFGHYGKPKPTRLGKLLASGVVSYEGEKWAKHRRILNPAFHHEKIKRMLPVF
SNCCTEMVTRWENSMSIEGMSEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESA
ERIIQAFRTIFIPGYWFLPTKNNRRLREIEREVSKLLRGIIGKRERAIKNGETSNGDLLG
LLVESNMRESNGKAELGMTTDEIIEECKLFYFAGMETTSVLLTWTLIVLSMHPEWQERAR
EEVLHHFGRTTPDYDSLSRLKIVTMILYEVLRLYPPVVFLTRRTYKEMELGGIKYPAEVT
LMLPILFIHHDPDIWGKDAGEFNPGRFADGISNATKYQTSFFPFGWGPRICIGQNFALLE
AKMAICTILQRFSFELSPSYIHAPFTVITLHPQH VKYITTQSLHSDTSHCENRAGLLGTG
RYVPQDFHQICGSQEQNPFFVIASSLQPTPLGFDRRLKGLVMLIENPVIPLLPNRPGNQG
CDGVESPSVWQAAIVELIGHPTSIEIHQPHEVTTSRRRQRPPNRLVLELHPFQFPQTETD
EGIGLLGEVIQGVDCRPQIEIFEDCLDP
>CYP72A18
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 44993-41630
AU100789.1
Rice callus Oryza sativa cDNA clone C50810.Length = 419 C-term
AU102126.1
Rice callus cDNA clone C10756.Length = 571
AZ130306.1
OSJNBb0103O04r CUGI Rice BAC genomicLength = 320
C26802 36%
TO 72 8/97 N-TERMINAL 19-67 REGION
opposite end = C96903
C96903,
C97406 58% IDENTICAL TO 72 C-TERM 65% to AQ050520
C96799,
C28139 219-340 REGION 55% IDENTICAL TO 72 opposite end = C97406
D22332 48%
TO 72 12/93 7/98 C-HELIX 89-191 REGION
AU081507.1
Rice callus Oryza sativa cDNA clone C12518_12Z.Length = 581
C26235 36%
IDENTICAL TO 72
8/97 AMINO ACIDS 89-216 REGION
AP002744
complement(join(114545..114970,115379..115757,
116406..116650,116745..116965,117608..117908))
D21882 53%
TO 72 5/93 7/98 245-352 REGION = 72A18
MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQG
IRGNRYRLFTGDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEH
(phase 1 intron)
GKPSFTWFGPTPRVMISDPESIREVMSNKFGHYGKPKPTRLGKLLASGVV
SYEGEKWAKHRRILNPAFHHEKIK (phase 0
intron)
RMLPVFSNCCTEMVTRWENSMSIEGM
SEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESAERIIQAFRTIFIPGYW
(phase 2 intron)