rice chr 1
P450s in annotated list in order on the chr [46 genes]
from
http://rgp.dna.affrc.go.jp/rgp/complete-chr/chr1/chr1-complete.html
and
details from:
http://ricegaas.dna.affrc.go.jp/chr1-bin/search_table.pl
This list
is taken from the chromosome 1 annotations found by a
keyword
search for P450. Not all P450s on
chr 1 were annotated in this table.
Of the 46 genes
annotated here 12 agree 100% with my annotations
2 are
fusions of two P450s, 2 more are fusions to other genes
71T2 711A2
and 711A3 are split into two genes each
Three
pseudogenes are represented as intact genes by
creative
splicing to avoid frameshifts and stop codons,
and an
artificial choice of N and C-termini to finish the gene.
The
pseudogenes CYP94D8P, CYP715B3P, 71AA1P, 71AA4P, 72A36P, 76H12P
are missed
in this annotation, but the annotation does not
cover
pseudogenes.
CYP734A6, CYP71AA3,
CYP71C18, CYP71C19 are also missed
Gene No. :
1-1_001 CYP715B2P
8174..8624
, 8639..8844 , 8948..9019 (-)
>1-1_001
MAEGDEWARHRCIVAPAFSATNLNDMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVR
NAAEIIAKASFSIAADDATVFHK AAGDAVPLHAVPLASLLHIRADRATYEAWKLGRKIDA
LLLDIIESRRRCEGGGRKTTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPE
WRAAVREEVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEVVRG
KR
>AP003727.3
$P CYP715B2P chromosome 1 clone:P0672D08 Pseudogene fragment
missing N
and C-terminal and part of I-helix 39% to 715A1
NRMPMFGRGRVMAEGDEWARHRCIVAPAFSATNLN
DMIGVMEETTSKMLGEWSDIVALGHSCIDIEKGVVRNAAEIIAKASFSIAADDATVFHK (frameshift)
VRLVSVPLASLLHIRADRATYEAWKLGRKIDALLLDIIESRRRCEGGGRK
TTTTDLLWLLLAGNEASAAAERKLTTALALSWTLLMLATHPEWRAAVRE this is missing from
AP004123
EVEEVTGWSGPMDAAAMGKLTKMGCMLNEVLRLYPPSPNVQRPAACDAEV
Gene No. :
1-2_239 CYP96D1 100%
agreement
1216403..1217523
, 1217636..1218047 (+)
>1-2_239
MGPLWTFILLYPEIFLAIICFFWFSLFRPIRQRQKSNLPVNWPVFGMLPFLVQNLHYIHD
KVADVLREAGCTFMVSGPWFLNMNFLITCDPATVNHCFNANFKNYPKGSEFAEMFDILGD
GLLVADSESWEYQRRMAMYIFAARTFRSFAMSTITRKTGSVLLPYLDHMAKFGSEVELEG
VFMRFSLDVTYSTVFAADLDCLSVSSPIPVFGQATKEAEEAVLFRHVIPPSVWKLLRLLN
VGTEKKLTNAKVVIDQFIYEEIAKRKAQASDGLQGDILSMYMKWSIHESAHKQKDERFLR
DTAVGFIFAGKDLIAVTLTWFFYMMCKHPHVEARILQELKGLQSSTWPGDLHVFEWDTLR
SAIYLQAALLETLRLFPATPFEEKEALVDDVLPNGTKVSRNTRIIFSLYAMGRIEGIWGK
DCMEFKPERWVSKSGRLRHEPSYKFLSFNTGPRSCLGKELSLSNMKIIVASIIHNFKVEL
VEGHEVMPQSSVILHTQNGMMVRLKRRDAA
Gene No. :
1-2_240 CYP96E1
1220501..1221994
, 1226247..1226672 (+)
>1-2_240
MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPVVGALPAIVANAGRVHD
WVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANFGNYPKGEEFAAMFDVLGG
GIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGGGLVPLLDGVAASGAAVDLQD
VFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDAEEVLFYRHVAPVPWLRLQSYLK
IGHYKKMAKAREVLDASIAELIALRRERKAADANATGDADLLTAYLACQDEIGMDGAAFD
AFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPGVEARILAELRAHPPSPTGAELKRLVY
LHAALSESLRLYPPVPFEHKAAARPDTLPSGAAVGPTRRVIVSLYSMGRMEAVWGKGCEE
FRPERWLTPAGRFRHERSCKFAAFNVGPRTCLGRDLAFAQMKAVVAAVVPRFRVAAAAAP
PRPKLSIILHMRDGLKVK RRDPVQGGGGHRRGHHHEICRCPQSGSGELDKATAMAADEEE
EVAPNLVFVTIQLPPSSSSSPLKTTQQLDGEGEELIGVQPKEEDRRLEEEEGGGVAADLA
VSRGPSRQACRCTGQESRAVGRGRKQGERRPEGEGICRR
>AP002484b
$F CYP96E1 CDS 80463..81980 43% to 96A1
MELLPWLLGFVVKYPEIMASAACFLLLFCRFRRRSKRIPTNWPV
VGALPAIVANAGRVHDWVTEFLRAAAMSHVVEGPWGSPGDVLITADPANVAHMFTANF
GNYPKGEEFAAMFDVLGGGIFNADGESWSFQRRKAHALLSDARFRAAVAASTSRKLGG
GLVPLLDGVAASGAAVDLQDVFMRLTFDLTAMFVFGVDPGCLAADFPTVPFAAAMDDA
EEVLFYRHVAPVPWLRLQSYLKIGHYKKMAKAREVLDASIAELIALRRERKAADANAT
GDADLLTAYLACQDEIGMDGAAFDAFLRDTTLNLMVAGRDTTSSALTWFFWLLSNHPG
VEARILAELRAHPPSPTGAELKRLVYLHAALSESLRLYPPVPFEHKAAARPDTLPSGA
AVGPTRRVIVSLYSMGRMEAVWGKGCEEFRPERWLTPAGRFRHERSCKFAAFNVGPRT
CLGRDLAFAQMKAVVAAVVPRFRVAAAAAPPRPKLSIILHMRDGLKVK VHRRQED*
Gene No. :
1-2_394 CYP90D2
2035743..2035993
, 2036078..2036405 , 2036438..2036531 , 2036850..2037055 , 2037375..2037623 ,
2038901..2039086 , 2039597..2039718 , 2043035..2043131 (+)
>1-2_394
MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAAAAARLPPGSFGWPVVG
ETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVTADAEVSRFVLQSDARAFV
PWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHLKSQLTADMRRRLSPALSSFP
DSSLLHVQHLAKS LLDEIEWVDELEEEQSGWAWASAGVRAHVRAQMRMERNVIARNGDEM
QMQ VVFEILVRGLIGLEAGEEMQQLKQQFQEFIVGLMSLPIKLPGTRLYRSLQAKKKMAR
LIQRIIREKRARRAAASPPRDAIDVLIGDGSDELTDELISDNMIDLMIPAEDSVPVLITL
AVKFLSECPLALHQLE VITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCVFVYFRSVH
LDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIFLHHLVTSFRW
VAEEDHIVNFPTVRLKRGMPIRVTAKEDDD
>AP003244
$F CYP90D2 59% to 90D1
CDS
join(30874..31124,31209..31536,32037..32186,32506..32754,
33832..33921,34032..34217,34728..34849,38166..38262)
AQ157843
64% identical to AQ290163 75% TO 90C1 AT HEME BINDING REGION
C97894
Rice callus Oryza sativa cDNA clone C0085_11A, mRNA sequence
extreme
C-term 71% to CYP90C1 opp end = C97895 (probably 3 prime untranslated)
MVSAAAGWAAPAFAVAAVVIWVVLCSELLRRRRRGAGSGKGDAA
AAARLPPGSFGWPVVGETLEFVSCAYSPRPEAFVDKRRKLHGSAVFRSHLFGSATVVT
ADAEVSRFVLQSDARAFVPWYPRSLTELMGKSSILLINGALQRRVHGLVGAFFKSSHL
KSQLTADMRRRLSPALSSFPDSSLLHVQHLAKS VVFEILVRGLIGLEAGEEMQQLKQQ
FQEFIVGLMSLPIKLPGTRLYRSLQAKKKMARLIQRIIREKRARRAAASPPRDAIDVL
IGDGSDELTDELISDNMIDLMIPAEDSVPVLITLAVKFLSECPLALHQLE EENIQLKR
RKTDMGETLQWTDYMSLSFTQH
VITETLRLGNIIGGIMRKAVRDVEVKGHLIPKGWCV
FVYFRSVHLDDTLYDEPYKFNPWRWKEKDMSNGSFTPFGGGQRLCPGLDLARLEASIF
LHHLVTSFRWVAEEDHIVNFPTVRLKRGMPIRVTAKEDDD
Gene No. :
12_245 CYP94E2 100%
agreement
1349356..1350990
(-)
>12_245
MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLD
WATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLG
RGLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQ
DVLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAM
RLANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGG
ELAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSR
HADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAV
RAGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRACLGR
EMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKREDDAAQ
QKLT
>AP003735.2b
$F CYP94E2 genomic
DNA, chromosome 1, BAC clone:B1147A04, complete
61% to
AP003735 4872-6534
8263
MDGTLAPLLLLLLLFLPALLLYLRRRPAAASRINNNHCPHPNPVLGNALPFLRNRHRFLDW 8445
8446
ATDLLAAAPTSTIEVRGALGLGSGVATANPAVVDHFLRASFPNYVKGARFAVPFEDLLGR 8625
8626
GLFAADGRLWALQRKLASYSFSSRSLRRFSARVLRAHLHRRLVPLLDAAAGSGEAVDLQD 8805
8806
VLGRFGFDNICNVAFGVESSTLLEGGDRRHEAFFAAFDAAVEISVARVFHPTTLVWRAMR 8985
8986
LANVGSERRMRDAIRVIDEYVMAIVASEERLRLRRGEDEREHEQHLLSRFAASMEEEGGE 9165
9166
LAAMFGSPGAKRRFLRDVVVSFVMAGKDSTSSALTWLFWLLAANPRCERRVHEEVSSSRH 9345
9346
ADPRRADAGEDGHGDGYDELRRMHYLHAAISEAMRLYPPVPIDSRVAVAADALPDGTAVR 9525
9526
AGWFADYSAYAMGRMPQLWGEDCREFRPERWLSDGGEFVAVDAARYPVFHAGPRAC 9693
9694
LGREMAYVQMKAVAAAVIRRFAVEPVQAPASMETPPACEVTTTLKMKGGLLVRIRKR 9864
EDDAAQQKLT* 9897
Gene No. :
12_246 CYP94E1 100%
agreement
1352719..1354368
(-)
>12_246
MDSSYLHLVLPAAAAAVVVAVVLLLSLWRRCQTTSNHRPQANPILGNLVAFLANGHRFLD
WSTGLLAAAPASTMQVHGPLGLGYCGVATASPDAVEHMLRASFHNYVDKGDRVRDAFADL
LGDGLFLANGRLWRLQRKLAASSFSPRLLRLFAGRVVLDQLRRRLLLFFDAAADARRVFD
LQDVLKRFAFDNICSVAFGVDRDDSSPSSSSPSRLEAGGDGRDDAFFAAFDDAIDISFGR
ILHPTTLAWKAMKLLDVGSERRLRQAIGVVDEYVTAIMESKQRCSDSEEESDLLSRFTAA
MMEEDGGNELGAMFDSPEAKRRFLRDTVKTFVLAGKDTTSSALTWLFWFLAANPECERRV
YEEVTALRGDTAGDERDDGYEELKRMHYLHAAITETMRLYPPVPLASRVAAADDVLPDGT
VVRAGWFADYSSYAMGRMPQLWERDCGEFRPERWLDGGGGGGGRFVAVDAARYPVFHAGP
RSCLGKEMAYVQMKAVAAAVVRRFSVEVVPAAAANAPPSPPPHETAVTLRMKGGLRVLLT
RRRGVLSHA
Gene No. :
12_317 CYP71AA2 100%
agreement
1719193..1720104
, 1720260..1720934 (+)
>12_317
MAGIMDSTTASYYTTLLCGALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHC
LLGSLPHHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTAS
IDIVFAPFGKHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASASSAVNVS
ELVKIMTNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARVLGGRSL
RTTKRVHEKLHQITEAIIQGHGIKDTVGDEHHECEDILDVLLRFQRDGGLGITLTKEIVS
AVLFDLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHY
LQLVIKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKDVNE
FRPERFKDDIVDFSGTDFRFIPGGSGRRMRPGLTFGVSNIEIALVTLLYHFDWKLPSETD
THELDMRETYGLTTRRRSDLLLKATPSYARLGWSTNMQIYSVKCLVYE
this
pseudogene not annotated
>AP004326.2d
$P CYP71AA1P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 4 pseudogene
81031
DFFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 81213 frameshift
81213
QETLRLHPPVPLLLPRLWSEPCKIMGYDIP 81302 frameshift
81304
KNTAIFVNTWALGR 81345 frameshift
81344
KIKNTGLMQVSSGLKY
81393
SRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSNK 81566
81567
LDMTEANGITTHRRIDIWLEATPFVPR 81647
this gene
not annotated
>AP004326.2b
$F CYP71AA3 genomic
DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 2 no
good matches in NR 79% to AP004326.2c
71860
MAGIVDTAAFCT
71896
LLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRR 72069
72070
YGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGARDIVFAPF 72243
72244
SKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSELVKI 72408
72409
MANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRATKR 72588
72589
VHQKLHQITDTIIQGHEIIKDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLL 72747
72748
RFHRDGGLGITLTKEIVSAVLF 72813 (0)
73327
DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIRQVLQGKTVVSEADIEGRLHYL 73497
73498
QLVIRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEF 73677
73678
RPERFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASS 73857
73858
CKLDMRETHGVTARRRTELLLKATPLYT* 73944
this
pseudogene not annotated
>AP004326.2a
$P CYP71AA4P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Length =
102983
4 genes
71B like
Gene 1
pseudogene 71 family
67989
LPPVPWPLPVIGSMH*LLGSLPHH 68060 frameshift with deletion
68060
RPACAVELLSPRRARSFRRVREAEPARLVRAVAASPAWPLVNVVGGEHVAAMMTAV 68227
68228
GARP 68239 frameshift with small deletion
68238
RCPRQEEYLEELGKVAKLAAGFNLVDLFPESRLVRAAQAAHGKIHSIMDAMVQ 68396
68397
DHLKAMEERREEVADGVVDDGDGDGADRDEELLSILLRFQRDGGLGITLTNGNHQRDS 68570 (0)
68886
GILAGGSDTTTTTVMWAMSELLRCPRAMQ 68972 frameshift with deletion
69023
YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 69202
69203
F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 69382
69383
RMELDMTESAGLTASRLTDLFG* 69451
Gene No. :
1-3_149 CYP710A5 this
seq has long N-term
677956..679547
, 683244..683265 (-)
>1-3_149
MAIPGSKEHKCNCLSRSSRAFSTPRT
MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALY
MLIEQLSYHRKKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFG
RFTVFIRDSELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNF
TPRALSTYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTE
KARERFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEP
ECLLDYLMQETVREIDEATAAGL
PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL
DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIAVE
AFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLAFGAG
PHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYLKQR
>AP002092a
$F CYP710A5 CDS
complement(54602..56137) 60% TO 710A1
THIS SEQ IS THE SAME AS AP002093a CDS complement(98133..99668)
MRTSTDPSGSIESFHGLVHLRTAAPLLAAAVALYMLI
EQLSYHR
KKGSMPGAPLVVPFLGSAAHLIRDPVGFWDVQAALARKSGAGLAADFLFGRFTVFIRD
SELSHRVFANVRADAFHVVSHPFGKKLFGEHNLVYLVGEEHKDLRRRIAPNFTPRALS
TYAVIQQRVIISHLRRWLDRSASNGGKAEPIRVPCRDMNLETSQTVFVGPYLTEKARE
RFDRDYNLFNVGFITLPVDLPGFAFRRARLAGARLMHTLGDCARQSRQRMLGGGEPEC
LLDYLMQETVREI
DEATAAGL this may be too long vs arab. Check for
intron
PPPPHTSDVEVGALLFGFLFAAQDASTSSLCWAVSAL
DSHPNVLARVRAEVAALWSPESGEPITAEMMSAMKYTQAVAREVVRYHPPATLVPHIA
VEAFQLTAQYTIPKGTMVFPSVYESSFQGFQDADAFDPDRFFSEARREDVVYKRNFLA
FGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTEGCDVPVYMPTMVPRDGCVVYL
KQR*
Gene No. :
1-3_150 CYP710A6 100%
agreement
684165..685703
(-)
>1-3_150
MVESFHGLVVVDLRTAAPLLATAVALYILIEQLSYHRKKGSMPGPPLVVVPFLGSVTHLF
RDPVGFWDLQATRASKSGAGLTADFLFGRLMVFIRDSELSRRVFANVRADAFHLVGHPFG
KKLFGDHNLIYMVGKEHKDLRRRIAPNFTPRALSTYAVIQQRVILSHLRRWIDRSVANGG
KAEPIRVPCRDMNLETSQTVFVGPYLTVETRERFDRDYNLFNHGFITLPIDLPGSAFRRA
RLAVPRLKHILEDCARQSKQRMRGGGEPECLVDYLMQETVREIDEAAAAGLPPPPHTSDM
ETGNLLFDFLFAAQDASTSSLCWAVSALDSHPDVLARVRAEVAALWSPESGEPITAEMMT
EMKYTQAVAREVVRYWPPGPVVPHIAGEAFQLTEQYTIPKGTIVFPSVYESSFQGFPDAG
TFDPERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVIFMALLASLIDFRRERT
EGCDVPVYMPTIVPRDGCVVHLKQRCAKLPSF
Gene No. :
1-3_155 CYP710A7 100%
agreement
704151..705665
(-)
>1-3_155
MVDSLLYGLLDLRMAAPLLAAAVALYVLVEQLSYHRKKGSLPGPPLVVPFIGSATHMIRD
PTGFWEMQAARARKSGVGFTADFLAGKFTIFIRDSELSNRVFANVRPDAFFVIGHPFGKK
LFGDHNLIYLFGDDHKDLRRRMATNFTPRALSTYAAIQQRGIVSHLRRWLDRSAANGGKA
EPIRVPCRDMNLETSQTVFAGPYLTEEARERFKSDYNLFNVGLLAFPVDLPGLAFRRARQ
AVARLVRMLRDCARESKARMRAGGEPECLVDYWMQETVREIDEAKAAGLPPPAHISDDEE
IGGFLFDFLFAAQDASTSSLCWAVSALDSHPDVLARVRAEVASLWSPDSGEPITADKIAE
MKYTKAVAREVVRHRPPATLMPHIALQNFQLTESYTIPKGTLVLPSMYESSFQGFHDPDA
FDPERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVIFMALFVSLVDFRRERTE
GCDVPVYMPTMVPRDGCVVYLKQR
Gene No. :
1-3_159 CYP710A8 100% agreement
723800..725326
(-)
>1-3_159
MAAVVDFLDLRAAAPFVVAALAFYFLVEQLSYHRKKGPLPGPPLVVPFVGSVAHMIRDPT
GFWDAQAARARKSGAGLAADFLIGRFVVFIRDSELSHRVFANVRPDAFHLIGHPFGKKLF
GDHNLIYMFGEDHKDLRRRIAPNFTPRALSTYAAIQQRVILSHLRRWLDRSAANGGKAEP
IRVPCRDMNLETSQTVFAGPYLTKEAREKFERDYNFFNVGLMALPVDLPGFAFRSARLGV
ARLVRTLGECARASKARMRAGGEPECLVDFWMQETVREIDEAKAAGKPPPAHTDDEELGG
FLFDFLFAAQDASTSSLCWAVSALDSHPDVLAGVRAEVASLWSPESGEPITAEKIAEMKY
TQAVAREVVRHRPPATLVPHIAGEEFQLTEWYTIPKGTIVFPSVYESSFQGFPEPDTFDP
ERFFSEARREDVVYKRNFLAFGAGPHQCVGQRYALNHLVLFMALFVSVVDFRRDRTEGCD
EPVYMPTIVPRDSCTVYLKQRCAKFPSF
Gene No. :
1-3_343 CYP71T1 100%
agreement
1694984..1695988
, 1696589..1697260 (+)
>1-3_343
MELSSSLAAVLHSPLFLLAALLLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGH
LPLLGSLPHRKLRSMAEAHGPVMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRM
AERLIYGRDMVFAPYGEFWRQARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGV
RGGGETVNLSDLLMSYANGVISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGE
FVPWLAWVDKLMGLDAKAARISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDH
RDFVDVLLDVSEVEEGAGAGEVLLFDTVAIKAIILDMIAAATDTTFTTLEWAMAELINHP
PVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELRLLRAVVKETLRLHAPVPLLVPRETVE
DTELLGYRVPARTRVIINVWAIGRDAAAWGDRAEEFVPERWLDGGGEEVEYAAQLGQDFR
FVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDWELPPHADGAAAATAARLDMGELFGLS
MRMKTTLNLVAKPWSSDV
Gene No. :
1-3_344 CYP71T2 exon 1
only + C-term extension
1699829..1700905
(+)
>1-3_344
MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLPLPPSPPGVPLLGH
LPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRTRDLAFASRPRVRM
SERLFYGRDMAFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQEVAALLDRVRRRCGG
GGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFADFEGLLGTMTVGEF
VPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQAVGDGEADADHRDFVD
VMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL VRTPLVVVLLTCRSADATVDYFLWNQT
Gene No. :
1-3_345 CYP71T2 exon 2
only + N-term extension
1702184..1702900
(+)
>1-3_345
MNERFIEQ DMMAAGTDSSFTTTEWVMAELINHPRVMRKLQDEIRAVVGTSSASAAAAATG
GGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVEDTELLGYRIPARTRVIINVWA
IGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDSRFVPFGAGRRGCPGAGFAALS
VELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLSVRLKADLNLVAKPWSPGAS
>AP003434.1b
$F CYP71T2 chromosome
1, PAC clone:P0452F10, complete = AA754300
AA754300 42% IDENTICAL
TO 71A14 1/98 I-HELIX 43% to
703A2
39698
MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP
39839
LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018
40019
RDLAFASRPRVRMSERLFYGRDM AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195
40196
VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375
40376
DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552
40553
VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)
42074
DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169
42170
QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349
42350
DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529
42530
RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709
42710
VRLKADLNLVAKPWSPGAS* 42769
Gene No. :
1-3_346 CYP71T3 100%
agreement
1708142..1709101
, 1711273..1711902 (+)
>1-3_346
MAVSLVVVVVVVIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHL
LGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAER
LLYGGRDVAFAPYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVD
LVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLG
WVDALNGMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVN
ETDMDAGVQLGTIEIKAIILDMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVG
ITSHITEDHLDRLPYLKAVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAW
TIGRDQATWGEHAEEFIPERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALA
SLLYNFDWETRVVDRRSEFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP
Gene No. :
1-3_348 CYP71T4
extension of exon 2 not correct
1718250..1719233
, 1719660..1720331 (+)
>1-3_348
MAVSLLPAVLVLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLP
LLGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRP
RMAMAELLLYGGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVR
AAAADVVVDLSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEP
MGELLPWFWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDF
VDVLLDVNETDKDAGIQLGTVEIKAIIM LICFLLHGHEQ DMFVGGSDTTTTMMAWTMAEL
INHPRAMRKAQNEIWAVVGNTSHVTKDHVDKLPYLKAVFKETLRLHPPLPLLIPREPPAD
TQILGYTIPAHTRVVINAWAIGRDAAAWGQQPDEFSPEKFLNSTIDYKGQDFELLPFGAG
RRGCPGIVFGVSAMEIALASLLYHFDWEAAATDHRRRGSQAWALPVDMSEVNGIAVHLKY
GLHVVAKPRMP
>AP003434.1d
$F CYP71T4 chromosome
1, PAC clone:P0452F10, complete like 71A
58119
MAVSLLPAVL
58149
VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328
58329
LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508
58509
GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685
58686
LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859
58860
FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036
59037
VNETDKDAGIQLGTVEIKAIIM 59102 (0)
59562
DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717
59718
LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897
59898
PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077
60078
TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200
Gene No. :
3-2_172 CYP709D1
missing end of exon 4
907473..907752
, 907912..908135 , 908724..908977 , 909301..909613 ,
910180..910611
(+)
>3-2_172
MDVPSVVIPILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRM
KAAAAADEVAAGAHSHDFIPIVLPQHSKWATDHGKTFLYWLGAVPAVSLGRVEQVKQVLL
ERTGSFTKNYMNANLEALLGKGLILANGEDWERHRKVVHPAFNHDKLKFMSVVMAESVES
MVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKEVYQAQKELQELAF
SSSLDVPALVFLRGNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARA
LEREGNGLVLTTQEIIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLTV
NM
VLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEFNPA
RFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVHAPM
EAITLRPRFGLPVVLRNLQG
>AP003258.2
$F CYP709D1 genomic
DNA, chr 1, PAC clone:P0463A02, complete 46% to 709B2
N-term
runs off end of contig identical to AP003764.2 (has N-term)
MLKSTIELYIFTTAIAKKSLHSQTKHKSKMDVPSVVIP
151039
ILVVLVSRLLTSALVHLLWKPYAITKLFRGQGITGPKYRLFVGSLPEIKRMKAAAAADEVA 150857
150856 AGAHSHDFIPIVLPQHSKWATDHG (1)
KTFLYWLGAVPAVSLGRVEQVKQVLLERTGSFTKNYMNANLEA
150497
150496 LLGKGLILANGEDWERHRKVVHPAFNHDKLK
150404 (0)
149815
FMSVVMAESVESMVQRWQSQIQQAGNNQVELDLSRELSELTSDVITRSAFGSSHEEGKE 149639
149638 VYQAQKELQELAFSSSLDVPALVFLR
149561 (2)
149237
GNTRAHQLVKKSRTMLMEIIEGRLAKVEAAEAGYGSDLLGLMLEARALEREGNGLVLTTQE 149055
149054
IIDECKTFFFAGQDTTSNHLVWTMFLLSSNAQWQDKLREEVLT VCGDAIPTPDMANRLKL 148875 (0)
148359
VNMVLLESLRLYSPVVIIRRIAGSDIDLGNLKIPKGTVLSIPIAKIHRDRDVWGPDADEF
148180
148179
NPARFKNGVSRAASYPNALLSFSQGPRGCIGQTFAMLESQIAIAMILQRFEFRLSPSYVH 148000
147999 APMEAITLRPRFGLPVVLRNLQG* 147928
Gene No. :
4_108 CYP71K1 100%
agreement
580030..580668
, 580739..581656 (-)
>4_108
MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWALPVIGHLHHVAGALPHR
AMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAFATRPITPTGKVLMADSVG
VVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAVAALTTPGAT
AAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLERRMKLLPAQCLPDLFPSSRAAML
VSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAEEDLLDVLLRIQSQDKTNPALTNDN
IKTVIIDMFVASSETAATSLQWTMSELMRNPRVMRKAQDEVRRALAIAGQDGVTEESLRD
LPYLHLVIKESLRLHPPVTMLLPRECRETCRVMGFDVPEGVMVLVNAWAIGRDPAHWDSP
EEFAPERFEGVGAADFKGTDFEYIPFGAGRRMCPGMAFGLANMELALAALLYHFDWELPG
GMLPGELDMTEALGLTTRRCSDLLLVPALRVPLRDHER
Gene No. :
8-1_133 CYP76M14 100%
agreement
696859..698433
(-)
>8-1_133
MEKSSELWLLWAVFSASLVFLYLTIRRRSGAGAGGKPPLPPGPTPLPLIGNLLDLRGGVI
HDKLAALARVYGPVMMIKLGLNDAVIISSRDAAREAFTRYDRHLAARAIPDTFRANGFHE
RSAVFLPSSDERWKALRGIQGTHIFTPRGLAAVRPVRERKVRDIIAYFRDHAGEELVIRQ
AIHTGVLNLVSSSFFSMDIAGMGSETARELREHVDEIMTVFAQPNVSDYFPFLRRLDLQG
LRRSTKRRFDRIFSILDDIVERRLVDRGERGGEGGASSNSSKSKHQYDGGDFLDALLELM
VTGKMERDDVTAMLFEAFVAGGDTVAFTLEWVMADLLRNPPVMAKLRAELDDVLGGKDQS
AIEEHDAGRLPYLQAVLKESMRLHSVGPLLHHFAAEDGVVVGGYAVPRGATVLFNTRAIM
RDPAAWERPEEFAPERFLAREGKAPVDFRGKEADFIPFGSGRRLCPGIPLAERVMPYILA
LMLREFEWRLPDGVSPEELDVSEKFMSVNVLAVPLKAVPVKVIN
Gene No. :
8-1_562 CYP72A31P
pseudogene, not intact
3057089..3057514
, 3057625..3058009 , 3058183..3058256 (-)
>8-1_562
MLYTPYHKEMYMSVLLTSHGSNLPM
SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEG
ESTKDDLLGILLESNTKHMEENGQSSQGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLL
SIHPEWQDHAREEIMGLFRKNKPDYEGLSRLKIVTMIFYEVLRLHPPFIEIGWKTYKEME
IGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEFKPERFSEGISKASKDPGAFLPFGWGPR
ICIGQNFALLESKMALCLILQRLEFELAPSYTHAPHTMVTLHPMHGAQMKVRAI
>AP003278
$P CYP72A31P chromosome 1, PAC clone:P0518F01, similar to 72A22 missing N-term
half
AP003330.1
chromosome 1 clone B1085F01 CYP72A like
Pseudogene,
no N-term in 9000bp upstream until next p450 ends near 22400
31539
SLPIENNRKMHQINKEIESILRGLIGKRMQAMKEGESTKDDLLGILLESNTKHMEENGQSS
31718
31719
QGLTIKDIVEECKLFYFAGAETTSVLLTWTMLLLSIHPEWQDHAREEIMGLFRKNKPDYE 31898
31899 GLSRLKI
32030
VTMIFYEVLRLHPPFIEIGWKTYKEMEIGGVTYPAGVSIKIPVLFIHHDPDSWGSDVHEF 32209
32210
KPERFSEGISKASKDPGAFLPFGWGPRICIGQNFALLESKMALCLILQRLEFELAPSYTH 32389
32390 APHTMVTLHPMHGAQMKVRAI 32452 or frameshift
after KVR to
SYMIISDYSVFYYYNSWL* (compare with end of 72A33)
Gene No. :
8-1_564 CYP72A32 end of
gene is incorrect, missing heme signature
3066867..3066964
, 3067301..3067529 , 3067638..3068022 , 3068485..3068729 , 3069130..3069681 (-)
>8-1_564
MVLGGWLLMWAPASSPTILVAFGLLFGLVLAWQAGLQLHRLWWRPRRLEKALRARGLRGS
SYRFLTGDLAEESRRRKEAWARPLPLRCHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGP
TPEVHVTDPELAKVVMSNKFGHFEKIRFQALSKLLPQGLSYHEGEKWAKHRRILNPAFQL
EKLKLMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR
RIFELQGELFERVMKSVEKIFIPGYMYLPTENNRKMHQINKEIESILRSMIGKRMQAMKE
GESTKDDLLGILLESNMRHTEENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILL
LSMHPEWQDRARKEILGLFGKNKPEYDGLNNLKIVTMILYEVLRLYPPFIELKRRTYKEM
KIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGISKASKDP VYGVIDFCDT
FDRLSYPLRFLMYDMVNFLQCV
>AP003278a
$F CYP72A32
19863-22437 chromosome 1, PAC clone:P0518F01, similar to 72A22
AP003330.1
50023-47446 chromosome 1 clone B1085F01, CYP72A like 536aa
AP004738.1
Oryza sativa chromosome 6 clone OSJNBa0090D06 chrom. conflict
50023 MVLGGWLLMWAPASSPTILVAFGLLFG
49942 LVLAWQ
AGLQLHRLWWRPRRLEKALRARGLRGSSYRFLTGDLAEESRRRKEAWARPLPLR 49763
49762
CHDIAPRIEPFLHDAVVRPEQHYGKPCITWLGPTPEVHVTDPELAKVVMSNKFGHFEKIR 49583
49582
FQALSKLLPQGLSYHEGEKWAKHRRILNPAFQLEKLK 49472 (0)
49071
LMLPVFSACCEELISRWMGAIGSDGSYEVDCWPELKSLTGDVISRTAFGSSYLEGR 48904
48903 RIFELQGELFERVMKSVEKIFIPGYM 48826
(2)
48363
YLPTENNRKMHQINKEIESILRSMIGKRMQAMKEGESTKDDLLGILLESNMRHT 48202
48201
EENSQSSQGLTIKDIMEECKLFYFAGADTTSVLLTWTILLLSMHPEWQDRARKEILGLFG 48022
48021 KNKPEYDGLNNLKI (0)
VTMILYEVLR 47842
47841
LYPPFIELKRRTYKEMKIGGVTYPAGVIINLPVLFIHHDLKIWGSDVHEFKPERFSEGIS 47662
47661 KASKDP
GAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELAPTYTHAPHTMITLHP 47482
47481
MHGAQIKIRAI* 47446
Gene No. :
8-1_565 CYP72A33 end is
wrong and two exon boundaries disagree
3071048..3071076
, 3075238..3075367 , 3075707..3075777 , 3076421..3076649 , 3076758..3077142 ,
3077951..3078246 , 3078684..3079205 (-)
>8-1_565
MWAPASSPTILAAFGLVGLVLAWQ
AGLQLHRLWWRPRRLEKALRARGLRGSRYRFLTGDL
AEEGRRRKEAWARPLPLRCHDIAPRVEPFLHGAVGVGAAHGKPRITWFGPTPEVHVADPE
LARVVLSNKFGHFEKVSFPELSKLIPQGLSAHEGEKWAKHRRILNPVFQLEKLK SILFLY
LIIEMSSENVQ
LMLPVFSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFG
SSYLEGRRIFELQGELFERVIKSIQKMFIPG YM YLPTENNRKMHQMNKEIESILRGMIGK
RMQAMKEGESTKDDLLGILLESNTRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVL
LTWTMLLLSMHPEWQDRAREEILGLFGKNKPDYDGLSRLKIVTMILYEVLRLYPPFIELT
RKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPERFSEGISKASKDP VEV
PRRMEIHRSFDLPRDNIQMSQNRSPCGAPPSSSYRQIRYPEQLADEPIRLSPPDGNGDIL
DLRGIKLEHFASS
>AP003278b
$F CYP72A33 chromosome
1, PAC clone:P0518F01, 82% to 72A22
AP003330.1
59493-56536 chromosome 1 clone B1085F01, CYP72A like 516aa
N-term
does not match in both, 3278 has MVLGGGWLSMWAPASSPTILAAFGLVGLVLAWQ
before the
AGLQ seq.
59493
MVLEGK AGLQLHRLWWRPRRLEKALRARGLRGSRYRFL
TGDLAEEGRRRKEAWARPLPLRCHDIAPRVEP
59284
59283
FLHGAVGVGAAHGKPRITWFGPTPEVHVADPELARVVLSNKFGHFEKVSFPELSKLIPQG 59104
59103 LSAHEGEKWAKHRRILNPVFQLEKLK 59026 (0)
58537
LMLPVFSACCEELISRWMGSIGSDGSYEVDCWPEFKSLTGDVISRTAFGSSYLEG 58373
58372 RRIFELQGELFERVIKSIQKMFIPG 58298
(2)
57483
YLPTENNRKMHQMNKEIESILRGMIGKRMQAMKEGESTKDDLLGILLESN 57334
57333 TRHMEVNGQSNQGLTIKDIMEECKLFYFAGADTTSVLLTWTMLLLSMHPEWQDRAREEIL
57154
57153 GLFGKNKPDYDGLSRLKI (0) VTMIL
56977
56976
YEVLRLYPPFIELTRKTYKEMEIGGITYPAGVIINLPVMFIHHDPEIWGSDVHEFKPER 56800
56799 FSEGISKASKDP
GAFLPFGWGPRICIGQNFALLEAKMALCLILQRLEFELATSYTHVPHT 56620
56619
IISLHPMHGAQIKVKSYMTISDYSVFY* 56536
Gene No. :
8-2_301 CYP72A17 N-term extension, C-term is wrong,
missing
middle exons 2 and 3
1594648..1594915
, 1598386..1598464 , 1598535..1598717 , 1601636..1601742 , 1601972..1602084 ,
1602475..1602841 , 1604468..1604841 , 1604937..1605347 , 1605464..1605643 (+)
>8-2_301
MEPSTRTRRLQPNRAPLGRYGEGGSRRIRRRRGDQNRILEAIPRWRGARGFVQQQQHQEK
GGGFGGEERRRQERRRQQEQKGNSTSSMGFLSTCAYGYLGRVDLQNSVHQSTCTVSTTSS
ASSHICFLYINLRIMSISIENDVNDNCDRNSNGGNGNGSIGNDNINTTTSESRAFCGFSF
LRPYRAVRYLRDLQPYILSSIQSASRVPPPEAPPSLPACSFGRSRPPPSLISNLSTVDHA
DAGDASPAYKREKEA
MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAA
QMLEWAWLAPRRMERALRAQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPR
VAPLLHRALEEH ENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMD
YYSDEDGKSSKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQ
VFGRNKPDINGVSRLKV VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPV
LFIHRDAAAWGHDAGEFDPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKV
ALGMILQRFAFELSPAYAHAPYTVLTLHPQHGVP NTFADKHWKLHVPGIRHSEISISDMK
AEHYLLTTAAAVPCIVEVYEYPYSISRVFSWSS
>CYP72A17
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 36553-39431
AG025591.1
strain ND3008 PCR from rice genomic DNA clone T8121T.Length = 401
AG025107.1
strain NC2542 PCR from rice genomic DNA clone T5184T.Length = 504
AU071192
very similar to AQ050520 = 72A17
AP002744
CYP72A17 join(109468..109819,110022..110245,110529..110781,
111446..111819,111915..112346)
MGIGIGIGIGIGIGTGTGAALPFGEASPWSLLGGAVAALLLVWAAQMLEWAWLAPRRMERALR
AQGLRGTQYRFLHGDLTEDLRLVTAARSKPVPMDRPHDFIPRVAPLLHRALEEH (phase 1 intron)
GRVSFTWFGPMPRVTITDPDLVREVLSNKFGHF
EKTKLATRLSKLLVGGLVILHGEKWVKHRRIMNPAFHAEKLK
(phase 0 intron)
RMLPAFSASCSELIGRWENAVAASVGKAELDIWPDFQNLSGDVISRAAFGVRHHEGRQ
IFLLQAEQAERLVQSFRSNYIPGLS
(phase 2 intron)
LLPT
ENNRRMKAIDREIKSILRGIIEKRQKATKNGEASKDDLLGLLLQSNMDYYSDEDGKS
SKGMTVEEIIDECKLFYFAGMETTAVLLTWTMVALSMHPEWQDRAREEILQVFGRNKPDI
NGVSRLKV (phase 0 intron)
VTMVLHEVLRLYPPVVMMNRRTYKEIELGGVRYPAGVMLSLPVLFIHRDAAAWGHDAGEF
DPGRFAEGVARACKDPGAGAFFPFSWGPRICIGQNFALLEAKVALGMILQRFAFELSPAY
AHAPYTVLTLHPQHGVP VRLRRL*
Gene No. :
8-2_302 CYP72A18 C-term
extension is wrong.
1605912..1606000
, 1606047..1606200 , 1606390..1606478 , 1607050..1607241 , 1607596..1607992 ,
1608401..1608779 , 1609428..1609672 , 1609767..1609987 , 1610630..1610930 (-)
>8-2_302
MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQGIRGNRYRLFT
GDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEHGKPSFTWFGPTPRVMISDPE
SIREVMSNKFGHYGKPKPTRLGKLLASGVVSYEGEKWAKHRRILNPAFHHEKIKRMLPVF
SNCCTEMVTRWENSMSIEGMSEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESA
ERIIQAFRTIFIPGYWFLPTKNNRRLREIEREVSKLLRGIIGKRERAIKNGETSNGDLLG
LLVESNMRESNGKAELGMTTDEIIEECKLFYFAGMETTSVLLTWTLIVLSMHPEWQERAR
EEVLHHFGRTTPDYDSLSRLKIVTMILYEVLRLYPPVVFLTRRTYKEMELGGIKYPAEVT
LMLPILFIHHDPDIWGKDAGEFNPGRFADGISNATKYQTSFFPFGWGPRICIGQNFALLE
AKMAICTILQRFSFELSPSYIHAPFTVITLHPQH VKYITTQSLHSDTSHCENRAGLLGTG
RYVPQDFHQICGSQEQNPFFVIASSLQPTPLGFDRRLKGLVMLIENPVIPLLPNRPGNQG
CDGVESPSVWQAAIVELIGHPTSIEIHQPHEVTTSRRRQRPPNRLVLELHPFQFPQTETD
EGIGLLGEVIQGVDCRPQIEIFEDCLDP
>CYP72A18
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 44993-41630
AU100789.1
Rice callus Oryza sativa cDNA clone C50810.Length = 419 C-term
AU102126.1
Rice callus cDNA clone C10756.Length = 571
AZ130306.1
OSJNBb0103O04r CUGI Rice BAC genomicLength = 320
C26802 36%
TO 72 8/97 N-TERMINAL 19-67 REGION
opposite end = C96903
C96903,
C97406 58% IDENTICAL TO 72 C-TERM 65% to AQ050520
C96799,
C28139 219-340 REGION 55% IDENTICAL TO 72 opposite end = C97406
D22332 48%
TO 72 12/93 7/98 C-HELIX 89-191 REGION
AU081507.1
Rice callus Oryza sativa cDNA clone C12518_12Z.Length = 581
C26235 36%
IDENTICAL TO 72
8/97 AMINO ACIDS 89-216 REGION
AP002744
complement(join(114545..114970,115379..115757,
116406..116650,116745..116965,117608..117908))
D21882 53%
TO 72 5/93 7/98 245-352 REGION = 72A18
MLMMLGAASQWILAAAAAAAVAALLWLAVSTLEWAWWTPRRLERALRAQG
IRGNRYRLFTGDVPENVRLNREARKKPLPLGCHDIIPRVLPMFSKAVEEH
(phase 1 intron)
GKPSFTWFGPTPRVMISDPESIREVMSNKFGHYGKPKPTRLGKLLASGVV
SYEGEKWAKHRRILNPAFHHEKIK (phase 0
intron)
RMLPVFSNCCTEMVTRWENSMSIEGM
SEVDVWPEFQNLTGDVISKTAFGSSYEEGRRIFQLQAESAERIIQAFRTIFIPGYW
(phase 2 intron)
FLPTKNNRRLREIEREVSKLLRGIIGKRERAIKNGETSNGDLLG
LLVESNMRESNGKAELGMTTDEIIEECKLFYFAGMETTSVLLTWTLIVLS
MHPEWQERAREEVLHHFGRTTPDYDSLSRLKI (phase
0 intron)
VTMILYEVLRLYPPVVFL
TRRTYKEMELGGIKYPAEVTLMLPILFIHHDPDIWGKDAGEFNPGRFADG
ISNATKYQTSFFPFGWGPRICIGQNFALLEAKMAICTILQRFSFELSPSY
IHAPFTVITLHPQH GAQIKLKKI*
Gene No. :
8-2_305 CYP72A19 one
exon boundary does not agree
1617645..1618070
, 1618263..1618587 , 1618765..1619009 , 1619127..1619606 (-)
>8-2_305
MVYGLLGLALLWQVHRLLVRLWWQPRRLERALRAQGVRGTSYRFLTGDLKDYGRLSKEAW
ARPLPLRCHDIAPRVAPFVHRTIAEHGKACLSWFGPIPKVTIADAEIAKDVLSNKMGHFE
KLKFPVLSKLLADGVANYEGEKWAKHRRILNPAFHLEKLKLMLPAFSACCEELVGRWAAS
LGSDGSNEIDVWPEMQSLTGDVISRTAFGSSYLEGRRIFQLQAEQQELFMGAIQKISIPG
YMSLPTKNNRRMYQIKNEVESIIRDLVQKRMHAMKDGERTKDDLLGILLESSTRHADENG
HSGPGMTIEEVMEECKVFYFAGMETTAILLTWTMVVLSMHPEWQHRAREEV TMILYEVLR
LYPPGIGFVRQTYKEMEIGGVKYPAGVMIELPLLFIHHDPDIWGSDVNEFKPERFAEGIS
RASNDHGAFFPFGWGPRICMGQNFALLEAKMALCMILQRFEFELAPSYTHAPHIVLMLRP
MHGAPIKLRAI
>CYP72A19
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 comp(53699-51708)
AU100635.1
Rice callus clone C10787.Length = 594
AG024141.1
strain ND3053 PCR from ricegenomic clone ND3053_0_734_1A.Length = 374
AP002744
complement(join(124623..125048,125181..125565,
125743..125987,126105..126584))
this annotation missing first 10 aa
MDPTSVPWSSMVYGLLGLALLWQVHRLLVRLWWQPRRLERALRAQGVRGTSYRFLTGDLKDYGRLS
KEAWARPLPLRCHDIAPRVAPFVHRTIAEHGKACLSWFGPIPKVTIADAEIAKDVLSNKMGHFEKLKFPVLS
KLLADGVANYEGEKWAKHRRILNPAFHLEKLK (phase
0 intron)
LMLPAFSACCEELVGRWAASLGSDGSNEIDVWPEMQSLTGDVISRTAFG
SSYLEGRRIFQLQAEQQELFMGAIQKISIPGYM
(phase 2 intron)
SLPTKNNRRMYQIKNEVESIIRDLVQKR
MHAMKDGERTKDDLLGILLESSTRHADENGHSGPGMTIEEVMEECKVFYFAGMETTAILL
TWTMVVLSMHPEWQHRAREEV LSLFQKNKLDYEGLSKLKT
(intron joint not correct missing one
base?)
VTMILYEVLRLYPPGIGFVRQTYKEMEIGGVKYPAG
VMIELPLLFIHHDPDIWGSDVNEFKPERFAEGISRASNDHGAFFPFGWGPRICMGQNFAL
LEAKMALCMILQRFEFELAPSYTHAPHIVLMLRPMHGAPIKLRAI
Gene No. :
8-2_308 CYP72A20 two
boundaries do not agree
1622500..1622830
, 1623331..1623551 , 1623658..1623905 , 1624026..1624341 , 1624513..1624941 (+)
>8-2_308
MEEATG MEVGVSLEAGKPAAAPWGMLYYGVPALLVLGALYRAAERCWLGPRRVAGALQGQ
GLRGTAYRFPAGDLPENARRSKEARAKPMPPCHDIVPRVAPLLQDIVKEYGNVCITWFGT
TPRVVIAEPELVKDILSNKFGHFEKFTLKSLGKLIALGLASYEGEKWARHRRILNPAFHL
EKLKHMLPAFSTCCSEMIDRWDSKLAGSDGPFELDIWQEFQNLTGDVISRTAFGSSFMEG
RRIFQLQEEQADRIIKTIQYIYIPGYL YFPTENNRRMKENSREIEGLLRGIIEKRSRAVE
NGELSGDDLLGLMLKSNMDSGEPSNLRMSTEDVIEECKLFYFAGMETTSVLLTWTLVVLS
MHPEWQHRAREE VTMILHEVLRLYPPAVTLSRRTFKEIQIGGITYPAGVGLELPIILIHH
NTDVWGKDAHEFKPERFADGISKATKTNQQAFFPFGWGPRICIGQNFAMLEAKMALCVIL
QNFEFQLSPSYTHAPYASVTLHPQHGAQIILTRL
>CYP72A20
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 56581-59004
AP002744
join(129496..129808,130309..130529,130636..130883, 131004..131919)
MEVGVSLEAGKPAAAPWGMLYYGVPALLVLGALYRAAERCWLGPRRVAGALQGQGLRGTAYRFPAGDL
PENARRSKEARAKPMPPCHDIVPRVAPLLQDIVKEY
(PHASE 1 INTRON)
GNVCITWFGTTPRVVIAEPELVKDILSNKFGHFEKFTLKSLGKLIALGLASYEGEKWARH
RRILNPAFHLEKLK (phase 0 intron)
HMLPAFSTCCSEMIDRWDSKLAGSDGPFELDIWQEFQNLTGDVISRTAFGSSFMEGRRI
FQLQEEQADRIIKTIQYIYIPGYL (phase 2 intron)
VSCR
YFPTENNRRMKENSREIEGLLRGIIEKRSRAVENGELSGDDLLGLMLKSNMDSGE
PSNLRMSTEDVIEECKLFYFAGMETTSVLLTWTLVVLSMHPEWQHRAREEV LSAFGRDKP
NFDGLSRLKT (intron joint not correct as in gene 3)
VTMILHEVLRLYPPAVTLSRRTFKEIQIGGITYPAGVGLELPIILIHHNTDVWGKDAHEFKPERFADGISKAT
KTNQQAFFPFGWGPRICIGQNFAMLEAKMALCVILQNFEFQLSPSYTHAPYASVTLHPQH
GAQIILTRL*
Gene No. :
8-2_309 CYP72A21 C-term
missing
1627909..1628439
, 1628545..1628789 , 1628910..1629294 , 1629402..1629592 , 1629673..1629694 (+)
>8-2_309
MVLGAWLMSPASVPWSLLAYGVLGLVLLWQAGRLLHSLWWRPRRLELALRAQGLRGTRYR
FLTGDLGEHGRLNREAWARPLPLRCHDIAPRVAPFLHNAVREHGSACFTWFGPTPKVTIT
DPDLAKGVLSNKFGHFEKPKFPTLTKLFSDSLANHEGEKWVKHRRILNPAFHLEKLKLML
PAFSACCEELVSKWMESLGSDGSYEVDVWPEMQILTGDVISRTAFGSSYLEGRRIFQLQA
EQTERLLKCMQKIVIPGYMSLPTKNNRKMHQIKKETDSILRGLVDKRMQAMKEGECTKDD
LLGLLLESNMRHTEEDGQSNHGLTIEEVIEECKLFYFAGMETTSVLLTWTILLLSMHPEW
QDRAREEILGLFGKNKPEYEGLSRLKIVTMILYEVLRLYPPAVTFTRKTYKQMEIGGVTY
PAGVIVELPVLLIHHDPNIWGSDAHEFKPDR SKLRIA
>CYP72A21
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 61993-63890
AZ045374.1
nbeb0080P16f CUGI Rice BAC genomic Length = 843
AQ857269.1
nbeb0005G04r CUGI Rice BAC genomic Length = 855
AQ865258.1
nbeb0025C03f CUGI Rice BAC genomicLength = 738
AP002744
join(134887..135417,135523..135767,135888..136272,
136380..136805)
this annotation adds first 7 aa
AQ050520,
AQ272173, AQ159375, AU031882 (opposite end = D24685)
AQ575977
nbxb0088K02r
D24685.1
RICR2374A Rice root cDNA clone R2374_1A.Length = 419 = 72A
MVLGAWLMSPASVPWSLLAYGVLGLVLLWQAGRLLHSLWWRPRRLELALRAQGLRGTRYRFLTGD
LGEHGRLNREAWARPLPLRCHDIAPRVAPFLHNAVREHGSACFTWFGPTPKVTITDPDLA
KGVLSNKFGHFEKPKFPTLTKLFSDSLANHEGEKWVKHRRILNPAFHLEKLK
(phase 0 intron)
LMLPAFSACCEELVSKWMESLGSDGSYEVDVWP
EMQILTGDVISRTAFGSSYLEGRRIFQLQAEQTERLLKCMQKIVIPGYM
(phase 2 intron)
SLPTKNNRKMHQIKKETDSILRGLVDKRMQA
MKEGECTKDDLLGLLLESNMRHTEEDGQSNHGLTIEEVIEECKLFYFAGMETTSVLLTWT
ILLLSMHPEWQDRAREEILGLFGKNKPEYEGLSRLKI
(PHASE 0 intron)
VTMILYEVLRLYPPAVTFTRKTYKQMEIGGVTYPAGVIVELPVLLIHHDPNIWGSDAHEF
KPDR
FVEGISKASKNPGAFLPFGWGPRICIGQNFALLEAKMALCMILQCFKLELMPSYTH
APYSMVTLRPMHGAQIKLRAI*
Gene No. :
8-2_311 fusion of CYP72A22
missing C-term and CYP72A23
1632372..1632902
, 1633047..1633300 , 1633441..1633645 , 1633936..1634126 , 1636460..1636722 ,
1638075..1638625 , 1638719..1638963 , 1639103..1639487 , 1639603..1640028 (+)
>8-2_311
MVLGAGLRCPASVPWSSLAYGLLGLVLLWQGGRLLHRLWWRPRRLELALRAQGLRGTRYR
FLTGDLGEHGRLNREAWARPLPLRCHDIAPRVAPFLHSSVREHGKACFSWFGPIPKVTIA
NPDLAKDVLSNKFGHLEKHKFQGLTKLLSDGVASHEGEKWVKHRRILNPAFHLEKLKTLQ
RMLPAFSTCCEELISRWMESLGSEGSYEVDVWPEMQSLTGDVISRTAFGSSYLEGRRIFQ
LQAEQAERLLKCVQKIIIPGYMSLPTKNNRKMHQIKKEIDSILRGLIGKRMQAMREGEST
KDDLLGLLLESNMRHTAEHGQSSQGLTIEE
VTMILYEVLRLYPPAVTLTRQTYKQIEIGG
VTYPAGVIIELPLLLIHSDPDIWGSDVHKFNPER RLKVWAFVSDVRGHWASASPAQSWSR
GWRCPLLPAAATSPAAHLAPAAAGGASLLRDSPAGAEREAPLAAAAAAAGRGVGRVGRRR
GESTA MVFGELFSRASLPPPWSLLAYGLVGPVLLWQAGRLLDRLWWRPRRLERALRAQGL
RGTAYRFLLGDLREFGRLNEEAWSSAPLPLGCHDIVPRVTPFVHRNVRDNGRPCCFSWFG
PIPSVTITDPAQVRDVLSNKLGHFEKPKLPALTKLLADGLTSHDGEKWVKHRRIMNPAFH
LEKLKLMLPAFSTCCEELVGKWMDSLGPDGSCELDVWPEMQSLTGDVISRTAFGSSYSEG
RRIFQLQTEQAELFIGAIQKFVIPGYMYLPTKKNRRMRRINSEVESILRGIIGKRMQAIA
EGESTNDDLLGLLLESNMRHADENGRSSPGMTTEDVIEECKLFYFAGMETTSVLLTWTMV
VLSMHPEWQDRAREEVLGLFGRDKPEYEGLSRLKTVTMVLYEVLRLYPPAIVFSRKTYKE
MEIGGVVYPRGVILELPVLFIHHDREIWGRDVHEFRPERFAEGISRASNDRGAFLPFGWG
PRVCIGQNFALLEAKMALCMILQRFEFELAASYTHAPHTVMTLHPMHGAQMKLRMI
>CYP72A22
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 66435-68424
AP002744
join(139350..139880,140025..140278,140419..140806,
140914..141339)
MVLGAGLRCPASVPWSSLAYGLLGLVLLWQGGRLLHRLWWRPRRLELALRAQGLRGTRYRFLTGDL
GEHGRLNREAWARPLPLRCHDIAPRVAPFLHSSVREHGKACFSWFGPIPKVTIANPDLAKDVLSNK
FGHLEKHKFQGLTKLLSDGVASHEGEKWVKHRRILNPAFHLEKLK
(phase 0 intron)
RMLPAFSTCCEELISRWMESLGSEGSYEVDVWPEMQSL
TGDVISRTAFGSSYLEGRRIFQLQAEQAERLLKCVQKIIIPGYM
(phase 2 intron)
SLPTKNNRKMHQIKKEIDSILRGLIGKRMQAMREGESTKDDLLGLLLESNMRHTAEHGQSS
QGLTIEE VIEECKLFYFAGMETTSVLLTWTMLLLSMHPEWQDHAREEILGLFGKNKPEYE
GLSRLKI
(intron joint not correct)
VTMILYEVLRLYPPAVTLTRQTYKQIEIGGVTYPAGVIIELPLLLIHSDPDIWGSDVHKF
NPER
FAEGISKASKDPGAFLPFSWGPRICIGQNFALLETKMALCMILQHLELELALSYTH
APQSIITLRPTHGAQIKLRAI*
>CYP72A23
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 72149-74091
AP002744
join(145064..145603,145697..145941,146081..146465,
146581..147006)
AU067870
Rice callus Oryza sativa cDNA clone C10320_12Z, CYP72 like Nterm
AU067871
AU067869 very similar to AQ050520 K-helix to heme
MVFGELFSRASLPPPWSLLAYGLVGPV
LLWQAGRLLDRLWWRPRRLERALRAQGLRGTAYRFLLGDLREFGRLNEEAWSSAPLPLGC
HDIVPRVTPFVHRNVRDNGRPCCFSWFGPIPSVTITDPAQVRDVLSNKLGHFEKPKLPAL
TKLLADGLTSHDGEKWVKHRRIMNPAFHLEKLK
(PHASE 0 INTRON)
LMLPAFSTCCEELVGKWMDSLGPDGSCELDVWPEMQSLTGDVISRTAFGSSYSEGR
RIFQLQTEQAELFIGAIQKFVIPGYM (PHASE 2
INTRON)
YLPTKKNRRMRRINSEVESILRGIIGKRMQAIAEGESTNDDLLGLLLESNMRHADENGRS
SPGMTTEDVIEECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLGLFGRDKPEY
EGLSRLKT (PHASE 0 INTRON)
VTMVLYEVLRLYPPAIVFSRKTYKEMEIGGVVYPRGVILELPVLFIHH
DREIWGRDVHEFRPERFAEGISRASNDRGAFLPFGWGPRVCIGQNFALLEAKMALCMILQ
RFEFELAASYTHAPHTVMTLHPMHGAQMKLRMI*
Gene No. :
8-2_319 CYP72A24 C-term
incorrect
1675901..1676207
, 1678114..1678334 , 1678425..1678672 , 1678819..1679209 , 1679296..1679700 ,
1680310..1680444 (+)
>8-2_319
MG MVVFAAGDERPLMLVWAAVAGAVLAWCAVRAMEWAWWRPRRLERALRAQGLRGTPYRS
PAGDAPLNVQLSAEARARTMPLGCHDVVPRAMPLFHQAMKEHGKVSITWFGPVPRVTITK
PELVREVLSNKFGHFEKLKFGRFQRLLHNGLGSHEGEKWAKHRRIINPAFHLEKLKRMLP
AFAACCTELVDKWEGLAKGGDEPYEVDVWPEMQSLTGDVISRAAFGSSYLEGKRIFQLQG
EQIELIVATMNKIHIPGYIHLPTKSNRRMKQIAAEIEGMLKRIIAKRESALKAGEASSDD
DLLGLLLESNLDHSKGNGGAASSGISIDDVIGECKLFYFAGMETTSVLLTWTMVVLSMHP
EWQDRAREEVLHVFGSRAPDYDGLSRLRIVTMVLYEVLRLYTPLTALQRKTYKPMELGGV
RYPAGVVLTLPLLCVHHDKDVWGADADEFRPERFAEGISKASREAPAFFPFGWGPRICIG
QNFALLEAKMGLSMILQRFSFDLSPSYTHAPFPVGLLQPEHGAQ RCLVRRHLHRLMEPVS
HWSPSPSSHLFNGYNEQPSDVPSSPRHL
>CYP72A24
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 109970-113787
AQ864347.1
nbeb0023A03r CUGI Rice BAC genomicLength = 735
MVVFAAGDERPLMLVWAAVAGAVLAWCAVRAMEWAWWRPRRLERALRAQGLRGTPYRSPA
GDAPLNVQLSAEARARTMPLGCHDVVPRAMPLFHQAMKEH
(PHASE 1 INTRON)
GKVSITWFGPVPRVTITKPELVREVLSNKFGHFEKLKFGRFQRLLHNGLGSHEGEKWAKH
RRIINPAFHLEKLK(PHASE 0 INTRON)
RMLPAFAACCTELVDK
WEGLAKGGDEPYEVDVWPEMQSLTGDVISRAAFGSSYLEGKRIFQLQGEQIELIVATMNK
IHIPGYI (PHASE 2 INTRON)
HLPTKSNRRMKQIAAEIEGMLKRIIAKRESALKAGEASSDDDLLGLLLESNLDHSKGNGGA
ASSGISIDDVIGECKLFYFAGMETTSVLLTWTMVVLSMHPEWQDRAREEVLHVFGSRAPD
YDGLSRLRI (PHASE 0 INTRON)
VTMVLYEVLRLYTPLTALQRKTYKPMELGGVRYPAGVVLTLPLLCVHHDKDVWGADADEF
RPERFAEGISKASREAPAFFPFGWGPRICIGQNFALLEAKMGLSMILQRFSFDLSPSYTH
APFPVGLLQPEHGAQ VRLTRLN*
Gene No. :
8-2_320 CYP72A25 one intron joint does not agree
1681409..1681715
, 1681748..1682031 , 1682190..1682427 , 1682557..1682927 , 1683004..1683429 (+)
>8-2_320
MEIVDGASPPLHPWSLLLYALGALAALWWAWRALDRFWLRPRRLGRALRSQGLRGTDYRF
PSGYLKEFARLLAAALAAPMPPLSHDVASRALPFELAAIKQHAAYTHAPANVRDQTEVAW
NAGNVCVTWFGPEVRVIVSDPKLFREILANKNGRFGKQKSILWVQNLLADGLTSHQGEKW
VAHRRIMNHAFHLEKLKRMLPAFAACSSELISRWQDSVGADGAQEIDVWPEFQNLTGDVI
SRSAFGSSFSEGRRIFQLQSEQARNVMKMAKALYFP ELNRRTKANAREVRELLKGIITKR
ESAMKDGHAVNDDLLGLLLETNIKESQEAGSSKPTMTTKDIIEELKLLYFAGSDTTAVLL
TWTMVLLSMHPEWQDRAREEVLRVFGKNSPDFEGINHLKVVTMILHEVLRLYPPILLLGR
EAYEETELGGVTYPPGVTFALPIAGIHHDPDVWGEDVGEFKPERFAEGVSRASKDSPALV
PFSWGPRICVGQNFALLEAKMALSMILQRFSFGLSPSYTHAPFPIPTLQPQHGAQIKLTK
L
>CYP72A25
$F AP002839 Oryza
sativa genomic DNA, chromosome 1 115472-117492
AG021553.1
strain NC0134 PCR from rice genomic DNA clone NC0134_0_102_1A.
Length =
636 AG021553
AG023207.1
strain NC2780 PCR from rice clone NC2780_0_701_1A.Length = 474
MEIVDGASPPLHPWSLLLYALGALAALWWAWRALDRFWLRPRRLGRALRSQGLRGTDYRFPSGYLKEFARLL
AAALAAPMPPLSHDVASRALPFELAAIKQH (PHASE 1
INTRON)
GNVCVTWFGPEVRVIVSDPKLFREILANKNGRFGKQKSILWVQNLLADGLTSHQGEKWVA
HRRIMNHAFHLEKLK (PHASE 0 INTRON)
VQRMLPAFAACSSELISRWQDSVGADGAQEIDVWPEFQNLTGDVISRSAFGSSFSEGRRI
FQLQSEQARNVMKMAKALYFP GYR (PHASE 2 INTRON
FLPT
ELNRRTKANAREVRELLKGIITKRESAMKDGHAVNDDLLGLLLETNIKESQEAGSS
KPTMTTKDIIEELKLLYFAGSDTTAVLLTWTMVLLSMHPEWQDRAREEVLRVFGKNSPDF
EGINHLKV (PHASE 0 INTRON)
VTMILHEVLRLYPPILLLGREAYEETELGGVTYPPGVTFALPIAGIHHDPDVWGEDV
GEFKPERFAEGVSRASKDSPALVPFSWGPRICVGQNFALLEAKMALSMILQRFSFGLSPS
YTHAPFPIPTLQPQHGAQIKLTKL*
Gene No. :
9-2_186 CYP706C2 N-term
extension, one boundary does not agree
947923..948172
, 949734..950614 , 950768..951400 (+)
>9-2_186
MVVWVVQAHCGLMAVTRDSGYGSGDRQVGLARRRSAWAAQRRRHRSEHQAAVQFSTRSSR
HGRVVDSGAAARQWLNGYRRLAAIMQP
MATANLLYAALLVPTVLYLAVTRRRSRRLPPGP
VGLPLVGSLPFIDPNLHTYFASLAAKHGPILSIRLGSKVDIVVNSAQLAREVLRDQDSVF
ANRVMLDAGDAVSFGGAQNIVGNPLGPMWRLLRRVCVQEMMSPAGLASVHGLRRREFRST
LRYLHSKPGEPVDVGAQMFLNTMNVITGTMWGGTIGSESERSAVGSEFRGLVAEVTELLG
TPNVSDLFPVLKPFDLQGIRRKMERLRSRFDLLFTKIIQQRMRSQQDGGEMTTDFLECLL
KMEKEGSDGKTTFTMDN EMVVGGTDTTSNSVEWIMAELLQNPQVLNKVQQELDSIVGRDA
VVEESHLPQLHYLRMVIKETLRLHPPVPLLVPHSPSAAATVGGYHVPEGCRVLINVWAIQ
RNPLVWNKPLDFNPDRFARDGGHKGDFTGSQLDYLPFGSGRRMCAGMAMGEKVMVYSVAM
LLQAFDWKLPQGVQLDLSEKFGIVMKKATPLVAIPTPRLSKPELYYS
>AP003378.1a
$F CYP706C2 chromosome
1 clone P0047E11, 49% to 706A5
no
ortholog in indica on 9/6/02
28850
MATANLLYAALLVPTVLYLAVTRRRSRRLPPGPVGLPLVGSLPFIDPNLHTYFASLAAKH
29029
29030
GPILSIRLGSKVDIVVNSAQLAREVLRDQDSVFANRVMLDAGDAVSFGGAQNIVGNPLGP 29209
29210
MWRLLRRVCVQEMMSPAGLASVHGLRRREFRSTLRYLHSKPGEPVDVGAQMFLNTM 29377
29378
NVITGTMWGGTIGSESERSAVGSEFRGLVAEVTELLGTPNVSDLFPVLKPFDLQGIRRKM 29557
29558
ERLRSRFDLLFTKIIQQRMRSQQDGGEMTTDFLECLLKMEKEGSDGKTTFTMDN VKGFLL 29737 (0)
EMVVGGTDTTSNSVE 29917
29918
WIMAELLQNPQVLNKVQQELDSIVGRDAVVEESHLPQLHYLRMVIKETLRLHPPVPLLVP 30097
30098
HSPSAAATVGGYHVPEGCRVLINVWAIQRNPLVWNKPLDFNPDRFARDGGHKGDFTGSQL 30277
30278
DYLPFGSGRRMCAGMAMGEKVMVYSVAMLLQAFDWKLPQGVQLDLSEKFGIVMKKATPLV 30457
30458 AIPTPRLSKPELYYS* 30505
Gene No. :
9-2_191 CYP711A2 only
finds N-term
979355..979581
, 979692..979844 , 981484..981559 (+)
>9-2_191
MEISTVLGAILAEYAVTLVAMAVGFLVVGYLYEPYWKVRHVPGPVPLPLIGHLHLLAMHG
PDVFSVLTRKYGPIFRFHMGRQPLVMVADAELCKEVGVKKFKNFPNRSMPSPITNSPVHQ
KGLFFTS LALSALIPSSMLVSEDEWMRRNWG
>AP003254a
$F CYP711A2 60% to
711A1 no indica ortholog found on 9/6/02
CDS join(126724..126950,127061..127213,130152..131133,
131374..131706)
MEISTVLGAILAEYAVTLVAMAVGFLVVGYLYEPYWKVRHVPGP
VPLPLIGHLHLLAMHGPDVFSVLTRKYGPIFRFHMGRQPLVMVADAELCKEVGVKKFK
NFPNRSMPSPITNSPVHQKGLFFTS GSRWTTMRN MILSIYQPSHLATLIPSMESCIER
AAENLEGQEEINFSKLSLSFTTDVLGQAAFGTDFGLSKKLASSDDDEDTRKIAADTCA
EAKASSEFIKMHVHATTSLKMDMSGSLSIIVGQLLPFLHEPFRQVLKRLRWTADHEID
RVNLTLGRQLDRIVAERTAAMKRDPAALQQRKDFLSVMLTARESNKSSRELLTPDYIS
ALTYEHLLAGSATTAFTLTTALYLVAKHPEVEEKLLREIDGFGPRDRVPTAEDLQTKF
PYLDQ
VLKEAMRYYPSSPLIARELNQQLEIGGY
PLPKGTWVWMAPGVLGKDPKNFPEPEVFRPERFDPNGEEEKRRHPYALFPFGIGPRAC
IGQKFAIQEMKLSAIHFYRHYVFRPSPSMESPPEFVYSIVSNFKNGAKLQVIKRHI
Gene No. :
9-2_192 CYP711A2 finds
rest of 711A2 with retained intron seq.
982811..983764
, 984005..984337 (+)
>9-2_192
MILSIYQPSHLATLIPSMESCIERAAENLEGQEEINFSKLSLSFTTDVLGQAAFGTDFGL
SKKLASSDDDEDTRKIAADTCAEAKASSEFIKMHVHATTSLKMDMSGSLSIIVGQLLPFL
HEPFRQVLKRLRWTADHEIDRVNLTLGRQLDRIVAERTAAMKRDPAALQQRKDFLSVMLT
ARESNKSSRELLTPDYISALTYEHLLAGSATTAFTLTTALYLVAKHPEVEEKLLREIDGF
GPRDRVPTAEDLQTKFPYLDQ ARQTLPRRVYIVRHEFLTAHRARMQ VLKEAMRYYPSSPL
IARELNQQLEIGGYPLPKGTWVWMAPGVLGKDPKNFPEPEVFRPERFDPNGEEEKRRHPY
ALFPFGIGPRACIGQKFAIQEMKLSAIHFYRHYVFRPSPSMESPPEFVYSIVSNFKNGAK
LQVIKRHI
Gene No. :
9-2_195 CYP711A3 only
finds N-term
994882..995108
, 995221..995373 , 996465..996477 (+)
>9-2_195
MEIISTVLGSTAEYAVTLVAMAVGLLLLGYLYEPYWKVRHVPGPVPLPFIGHLHLLAMHG
PDVFTVLARKYGPVFRFHMGRQPLVMVADAELCKEVGVKKFKSIPNRSMPSAIANSLINQ
KGLCFT SKPK
>AP003254b
$F CYP711A3 82% to seq
on same contig
CDS
join(142251..142477,142590..142742,145800..146613,
146730..146825,146909..147244)
no indica
ortholog found on 9/6/02
MEIISTVLGSTAEYAVTLVAMAVGLLLLGYLYEPYWKVRHVPGP
VPLPFIGHLHLLAMHGPDVFTVLARKYGPVFRFHMGRQPLVMVADAELCKEVGVKKFK
SIPNRSMPSAIANSLINQKGLCFT RGSRWTALRN MIISIYQPSHLASLIPTMQSCIEC
VSKNLDGQEDITFSDLALGFATDVIGQAAFGTDFGLSKISASSNDDDIDKIATDTSAE
AKASSEFIRMHVHATTSLKMDLSGSLSIIIGQLLPFLQEPFRQVLKRIPWTADHEIDH
VNLALGGQMDKIVAERAAAMERDQAAPHAQQRKDFLSVVLAARESNKSWRELLTPDYI
SALTYEHLLAGSATTAFTLSTVLYLVSKHPEVEEKLLREIDGFGPHDHAPTAEDLQTK
FPYLDQ VVKESMRFYFLSPLIA RETCEQVEIGGYALPKGTWVWLAPGVLAKDPKNFPE
PEVFRPERFDPNGEEEKRRHPYAFIPFGIGPRACIGQKFSIQEIKLSVIHLYRNYVFR
HSPSMESPLEFQYSIVCNFKYGVKLR VIKRHTA
Gene No. :
9-2_196 CYP711A3 finds
rest of 711A3 with retained intron seq.
and
incorrect C-term extension and some bad intron boundaries
998459..999299
, 999407..999456 , 999540..999851 , 1000430..1000886 ,
1001963..1002132
(+)
>9-2_196
MIISIYQPSHLASLIPTMQSCIECVSKNLDGQEDITFSDLALGFATDVIGQAAFGTDFGL
SKISASSNDDDIDKIATDTSAEAKASSEFIRMHVHATTSLKMDLSGSLSIIIGQLLPFLQ
EPFRQVLKRIPWTADHEIDHVNLALGGQMDKIVAERAAAMERDQAAPHAQQRKDFLSVVL
AARESNKSWRELLTPDYISALTYEHLLAGSATTAFTLSTVLYLVSKHPEVEEKLLREIDG
FGPHDHAPTAEDLQTKFPYLDQ ACMPFFDIGRIYGLRLWSP RETCEQVEIGGYALPKGTW
VWLAPGVLAKDPKNFPEPEVFRPERFDPNGEEEKRRHPYAFIPFGIGPRACIGQKFSIQE
IKLSVIHLYRNYVFRHSPSMESPLEFQYSIVCNFKYGVKLR LSLRLIVCTSTQTHSPTLI
HLGGDGNERGAVAEPPKQEAEASLTNPIEPETEATRSLPITAAHGKTVESTDCAPPTATI
WTLALALATTGSQPPSALTADASKDSVRHILTIVTLSTPTVAPRALPLMMVTASLHPLST
TSSPPPTSHVGLVVVTALWFVSQHGEDDPNLDAVTSSVYLAKSASTIKGIRLVTVYLEVT
CRIQDFGKD
Gene No. :
9-2_197 CYP711A4
missing exon 4 and C-term stops too soon
compare
end to 711A3
1003560..1003783
, 1004069..1004221 , 1004743..1005550 , 1006236..1006505 (+)
>9-2_197
MDISEVLGATAEWAVTLVAMAVGLLVVAYLYEPYRKVWHVPGPVPLPLIGHLHLLAMHGP
DVFSVLARKHGPVFRFHMGRQPLIIVADAELCKEVGVKKFKSIPNRSMPSPIANSPIHKK
GLFFIRGPRWTSMRNMIISIYQPSHLASLIPTMESCIQRASKNLDGQKEITFSDLSLSLA
TDVIGLAAFGTDFGLSKLPVTPDDSNIDKIAADTSVEAKASSEFIKMHMHATTSLKMDLS
GSLSILVGMLLPFLQEPFRQVLKRIPGMGDYKIDRVNRALKTHMDSIVAEREAAMEHDLA
ASQQRKDFLSVVLTARESNKSSRELLTPDYISALTYEHLLAGSTTTAFTLSTVLYLVAKH
PEVEEKLLKEIDAFGPRYCVPMADDLQTKFPYLDQ GTWVWLAPGVLAKDPKNFPEPEIFR
PERFDPNGEEERRRHPYAFIPFGIGPRVCIGQKFSIQEIKLSMIHLYRHYVFRHSPSMES
PLEF
>AP003378.1b
$F CYP711A4 chromosome
1 clone P0047E11, 64% to 711A one in frame stop
continues
AP003254 contig = AQ859680.1 nbeb0013M03f BAC genomic
AQ860765.1
nbeb0015B04f CUGI Rice BAC genomic
82666
MDISEVLGATAEWAVTLVAMAVGLLVVAYLYEPYRKVWHVPGPVPLPLIGHLHLLAMHG 82842
82843 PDVFSVLARKHGPVFR 82890 (2)
83176
FHMGRQPLIIVADAELCKEVGVKKFKSIPNRSMPSPIANSPIHKKGLFFIR 83322 (2)
83850
GPRWTSMRNMIISIYQPSHLASLIPTMESCIQRASKNLDGQKEITFSDLSLSLAT 84014
84015
DVIGLAAFGTDFGLSKLPVTPDDSNIDKIAADTSVEAKASSEFIKMHMHATTSLKMDLS 84191
84192
GSLSILVGMLLPFLQEPFRQVLKRIPGMGDYKIDRVNRALKTHMDSIVAEREAAMEHDLA 84371
84372
ASQQRKDFLSVVLTARESNKSSRELLTPDYISALTYEHLLAGSTTTAFTLSTVLYLVAKH 84551
84552 PEVEEKLLKEIDAFGPRYCVPMADDLQTKFPYLDQ (0)
VVKESMRFYIMSPLLARETLEQVEIGGYVLPK 84848 (0)
85342
GTWVWLAPGVLAKDPKNFPEPEIFRPERFDPNGEEERRRHPYAFIPFGIGPRVCIGQKF
85518
85519 SIQEIKLSMIHLYRHYVFRHSPSMESPLEF*FAIICDFKYGVKLQAIKRHHA*
85677
Gene No. :
9-2_453 CYP72A35
missing part of an exon
2374696..2375121
, 2376055..2376433 , 2377089..2377252 , 2377612..2377832 , 2378579..2378867 (-)
>9-2_453
MLGEAASPWSLAGAGAAVALLWLCAWTLQWAWWTPRRLERALRAQGLRGTRYRLFIGDVA
ENGRLNREAASRPLPLGSHDVVPRVMPFFCNVLKEHGKLSFVWTGPKPFVIIRDPDLARE
ILSNKSGNFAKQTTAGIAKFVVGGVVTYEGEKWAKHRRILNPAFHQEKIKRMLPVFLACC
TKMITRWVNSMSSEGISELDVWDEFQNLTGDVISRTAFGSSYQEG YLPIENNRRIREIDQ
EIRTILRGIIVKRDKAVRNGEGSNDDLLGLLVESNMRQSNEKEDVGMSIEDMIEECKLFY
AAGSETTSMLLTWTLILLSMHPEWQEQAREEVMHHFGRTTPDHDGLSRLKIVTMILHEVL
RLYPPVVFLQRTTHKEIELGGIKYPEGVNFTLPVLSIHHDPSIWGQDAIKFNPERFANGV
SKATKFQTAFFSFAWGPRICLGQSFAILEAKMALATILQSFSFELSPSYTHAPHTVLTLQ
PQYGSPIKLKKL
>AP002899
$F CYP72A35 52% to
72A14 = AQ161379
complement(join(51630..52055,52989..53367,53942..54186,
54546..54766,55513..55801))
MLGEAASPWSLAGAGAAVALLWLCAWTLQWAWWTPRRLERALRA
QGLRGTRYRLFIGDVAENGRLNREAASRPLPLGSHDVVPRVMPFFCNVLKEHGKLSFV
WTGPKPFVIIRDPDLAREILSNKSGNFAKQTTAGIAKFVVGGVVTYEGEKWAKHRRIL
NPAFHQEKIKRMLPVFLACCTKMITRWVNSMSSEGISELDVWDEFQNLTGDVISRTAF
GSSYQEG WRIFQLQEEQAKRVLKAFQRIFIPGYWY LPIENNRRIREIDQEIRTILRGI
IVKRDKAVRNGEGSNDDLLGLLVESNMRQSNEKEDVGMSIEDMIEECKLFYAAGSETT
SMLLTWTLILLSMHPEWQEQAREEVMHHFGRTTPDHDGLSRLKIVTMILHEVLRLYPP
VVFLQRTTHKEIELGGIKYPEGVNFTLPVLSIHHDPSIWGQDAIKFNPERFANGVSKA
TKFQTAFFSFAWGPRICLGQSFAILEAKMALATILQSFSFELSPSYTHAPHTVLTLQP
QYGSPIKLKKL
Gene No. :
9-4_219 CYP94D13
splices out a stop codon near PERF motif
1139526..1140716
, 1140846..1141055 (+)
>9-4_219
MEFSSSSTSLFLLLSILPLLYFLCQRNDPKKQPHAHGLKSYPVVGIVPHFTKNNDRFLEF
TTEIMKRSPTQTMSFKALGLTGGGVITANPANVEYTLKTNFGNYPKGELAVSMVVDFLGH
GIFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLDVQD
VLERFAFDNICQVAFDEDPACLAEDSMASPQSAEFMRAFNDAQIAVRDRFMSPVKSLWRF
KRLFNMEPERRMREALATIHGFAERIVRERRERGKAGLARSDDFLSRFAASGEHSDESLR
DVVTNFLLAGRDTTSSALTWFFWVLSGRPDVEDKIVREIHAVRRASGSTSDATFSFDELR
DMQYLHAAITESMRLYPPVAMDTHSCKEDDFLPDGTF SPFKYPVFHAGPRMCLGKEMADI
QMKSIVASVLERFSLQYAGGEGHPGLVLSVTLRMKGGLPMQVATRG
>AP003232.1h
$F CYP94D13 chromosome
1 clone P0034E02 one in frame stop 53% to 94D2
100994
MEFSSSSTSLFLLLSILPLLYFLCQRNDPKKQPHAHGLKSYPVVGIVPHFTKNNDRF
100824
100823 LEFTTEIMKRSPTQTMSFKALGLTGGGVITANPANVEYTLKTNFGNYPKGELAVSMVVDF
100644
100643
LGHGIFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLD 100464
100463
VQDVLERFAFDNICQVAFDEDPACLAEDSMASPQSAEFMRAFNDAQIAVRDRFMSPVKSL 100284
100283
WRFKRLFNMEPERRMREALATIHGFAERIVRERRERGKAGLARSDDFLSRFAASGEHSDE 100104
100103
SLRDVVTNFLLAGRDTTSSALTWFFWVLSGRPDVEDKIVREIHAVRRASGSTSDATFSFD 99924
99923 ELRDMQYLHAAITESMRLYPPVAMDTHSCKEDDFLPDGTF VGKGWLVTY*AYAMARVEDI
99744
99743 WGADCEEFRPERWLDEVGAFRPE SPFKYPVFHAGPRMCLGKEMADIQMKSIVASVLERFS
99564
99563 LQYAGGEGHPGLVLSVTLRMKGGLPMQVATRG* 99465
Gene No. :
9-4_222 fusion of CYP94D12
and CYP94D11
1149455..1150963
, 1154325..1155779 (+)
>9-4_222
MEFSSSSTSLFLLLSILPLLYFLCQRHDPKKQPHAHGLKSYPVVGTLPHFAKNKDRFLEF
ITEIMKRSPTHTLSFKALGLTGGVITANPANVEYTLKTNFGNYPKGELAVSMLVDFLGHG
IFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLDVQDV
LERFAFDNICHVAFDEDPACLAEDSMASPQSAKFMRAFSDAQNAVMDRFMSPVKSRWRFK
RLFNMEPERQMREALATIHGFAERIVRERRERGEAGLARSDDFLSRFAASGDHSDESLRD
VVTNFLIAGRDTTSTALTWFFWLLSGRPDVEDKIVREIHAVRRASGGTGDPTFNLDELRD
MQYLHAAITESMRLYPPVAMDSHSCKEDDFLPDGTFVGKGWFVSYSAYAMARVEDIWGAD
CEEFRPERWLDEAGAFRPESPFKYPVFHAGPRMCLGKEMAYIQMKSIVASVLERFSLRYA
GGEGHPGFVLWLTLRMKGGLPMQ DTKKQPAGSNGLKSYPVVGTLPHFAKNRHRFLEWSTD
VMKRSPTHTMTFKALGLTGGVITANVANVEHILKTNFSNYPKGELSVSLLEDLLGHGIFN
SDGEQWLWQRKAASYEFNQRSLRSFVVDTVRFEVVERLLPLLEWARRDGRTLDVQDVLER
FAFDNICHVVFHEDPACLAEDSMVSSQSAEFIRACSDAQNAIIARFMSPVKSLWRVKRLF
NLDPERRMRDALTTIHGYADRIVRERRARGEAGLARSDDFLSRFAAGGEHSDESLRDVVT
NFLIAGRDSTSSALTWFFWLVSSRPDVEDKIVHEIRAVRSASSSGGTSSATFSFDELRDM
HYLHAAITESMRLYPPVHLDTHSCKEDDFLPDGTFVGKGWLVTYCAYAMGRVEDIWGADC
EEFRPERWLDEAGAFRPDSPFKYPIFHAGPRMCLGKEMAYIQMKSIVACVLEQFSLRYAG
GDGHPGFVLWSTLRMEGGLPMQVTTRE
>AP003232.1g
$F CYP94D12 chromosome
1 clone P0034E02
AZ135305.1
OSJNBb0115G10r CUGI Rice BAC genomic 53% to 94D2
AZ131112.1
OSJNBb0104L22f CUGI Rice BAC genomic
91065
MEFSSSSTSLFLLLSILPLLYFLCQRHDPKKQPHAHGLKSYPVVGTLPHFAKNKDRF
90895
90894
LEFITEIMKRSPTHTLSFKALGLTGGVITANPANVEYTLKTNFGNYPKGELAVSMLVDFL 90715
90714
GHGIFNSDGEQWQWQRKAASYEFNKRSLRNFVVDTVRSEVVERLLPLLERAERDGRTLDV 90535
90534 QDVLERFAFDNICHVAFDEDPACLAEDSMASPQSAKFMRAFSDAQNAVMDRFMSPVKSRW
90355
90354
RFKRLFNMEPERQMREALATIHGFAERIVRERRERGEAGLARSDDFLSRFAASGDHSDES 90175
90174
LRDVVTNFLIAGRDTTSTALTWFFWLLSGRPDVEDKIVREIHAVRRASGGTGDPTFNLDE 89995
89994
LRDMQYLHAAITESMRLYPPVAMDSHSCKEDDFLPDGTFVGKGWFVSYSAYAMARVEDIW 89815
89814
GADCEEFRPERWLDEAGAFRPESPFKYPVFHAGPRMCLGKEMAYIQMKSIVASVLERFSL 89635
89634 RYAGGEGHPGFVLWLTLRMKGGLPMQ VTTRG* 89539
>AP003232.1f
$F CYP94D11 chromosome
1 clone P0034E02
86279
MKFSSTSTPLFILLLPFLPLLYFLYLYQ DTKKQPAGSNGLKSYPVVGTLPHFAKNRHRFLEWST
86088
86087
DVMKRSPTHTMTFKALGLTGGVITANVANVEHILKTNFSNYPKGELSVSLLEDLLGHGIF 85908
85907
NSDGEQWLWQRKAASYEFNQRSLRSFVVDTVRFEVVERLLPLLEWARRDGRTLDVQDVLE 85728
85727
RFAFDNICHVVFHEDPACLAEDSMVSSQSAEFIRACSDAQNAIIARFMSPVKSLWRVKRL 85548
85547 FNLDPERRMRDALTTIHGYADRIVRERRARGEAGLARSDDFLSRFAAGGEHSDESLRDVV
85368
85367
TNFLIAGRDSTSSALTWFFWLVSSRPDVEDKIVHEIRAVRSASSSGGTSSATFSFDELRD 85188
85187
MHYLHAAITESMRLYPPVHLDTHSCKEDDFLPDGTFVGKGWLVTYCAYAMGRVEDIWGAD 85008
85007
CEEFRPERWLDEAGAFRPDSPFKYPIFHAGPRMCLGKEMAYIQMKSIVACVLEQFSLRYA 84828
84827 GGDGHPGFVLWSTLRMEGGLPMQVTTRE* 84741
Gene No. :
9-4_224 CYP94D10 with
C-term extension
putative cyst nematode resistance protein (partial) see NM_192720.1
1159307..1160838
, 1161725..1161756 , 1161873..1162141 , 1162201..1162359 , 1162605..1162652 (+)
>9-4_224
MELSPISASLLLILILLAFLPLLYFLYMHQDPKKKPRIHGLKSYPVVGTLPHIIKNKHRF
LKWSTSIMKCSPTNTMSYKALGLTGGVITANPANVEHILKTNFDNYPKGKLTVSMLEDFL
GHGIFNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEIVKRLLPLLEQAGLDGRTLDL
QDVLERFAFDNICLVAFGEDPACLTKERMAAPQSAEFMRAFNDAQNAILARFNSPAKSLW
RVKKLFNMEPERRMREALATIHGFAERIVRERRERGEAGLARGDDFLSRFAASGEHSDES
LRDVVTNFVLAGRDTTSSALTWFFWIVSGRPDVEDRVVREIRAVRASSGSTDATFSFDEL
REKHYLHAAITESMRLYPPVAIDTHSCKEDDFLPDGTFVGKGWLVMYSAYAMGRMEGIWG
ADCEEYRPERWLDEAGAFRPESTFKYPVFNAGPRICIGKEMAYIQMKSIVACVLEKFSLR
YASDANERPRSVLSLTLRMKWGLPMKVTIR NDIHNYRKWGDDQWLLDLGL PTDVGWTIEL
IDQLVTVWSAIQTIELTEHEDDQISWKLTSHGQYTAASAYNAQLLGTTANNFNNLIWKPW
APRKCKTFAWLIHMSLMSSGPGDGSTPPRRLLLHKENMGNPSGMDGVQQYSPQPLTAGFV
GVRM FSPSIFNEIGGSPIR
>AP003232.1e
$F CYP94D10 chromosome
1 clone P0034E02 51% to 94D1 4 diffs with BE230752
3 diffs
with AQ509587 78% to AZ135305
81213
MELSPISASLLLILILLAFLPLLYFLYMHQDPKKKPRIHGLKSYPVVGTLPHIIKNK 81043
81042
HRFLKWSTSIMKCSPTNTMSYKALGLTGGVITANPANVEHILKTNFDNYPKGKLTVSML 80866
80865 EDFLGHGIFNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEIVKRLLPLLEQAGLDGR
80686
80685
TLDLQDVLERFAFDNICLVAFGEDPACLTKERMAAPQSAEFMRAFNDAQNAILARFNSPA 80506
80505
KSLWRVKKLFNMEPERRMREALATIHGFAERIVRERRERGEAGLARGDDFLSRFAASGEH 80326
80325
SDESLRDVVTNFVLAGRDTTSSALTWFFWIVSGRPDVEDRVVREIRAVRASSGSTDATF 80149
80148
SFDELREKHYLHAAITESMRLYPPVAIDTHSCKEDDFLPDGTFVGKGWL 80002
80001
VMYSAYAMGRMEGIWGADCEEYRPERWLDEAGAFRPESTFKYPVFNAGPRICIGKE 79834
79833 MAYIQMKSIVACVLEKFSLRYAS 79765
DANERPRSVLSLTLRMKWGLPMKVTIR K* 79678
Gene No. :
9-4_225 CYP94D9 100%
agreement
1164142..1165689
(+)
>9-4_225
MELSSISASLLLILPLLPLLYFLYLHQDPKKQPRAHGLKSYPVVGTLPHFIKHKNHILEW
SAGVLKRSPMHTMSFKALGLTGGVFTANPANVEHMLKTNFGNYVKGEAIITMLEDFLGRG
IFNSDGEKWLWQRKATSYEFSKRTLRNFVVDTVQFEVIERLLPLLERAGRDGRTLDVQSV
LERFAFDNICRVVFDEDPACLAKDSVASPHIAEFMGACNDAQNAILARFNSPIKSLWRVK
RLFNIESERRLREALATIHAYTDRIIRERRERGEARGDDFLSRFAAGDKHSDESLHDVIT
NLVLAGRETTASALTWFFWLVSGRPDVEDNIVREIRAVRRASSSNGVTSGAAFSPHELRD
MHYLHAAITESMRLYPPVSLDTYVCKEEDFLPDGTFVGKGWQVTYCAYAMARVEDIWGTD
CEEFRPERWLDEAGVFRPESSFKYPVFHGGPRMCLGKEIAYIQMKSIVSCVFDRFTLRYT
GGEGHPGLVTSLALRMEGGLPMQVLLTNRGQAVSC
this
annotation missed CYP94D8P
>AP003232.1c
$P CYP94D8P chromosome 1 clone P0034E02 pseudogene fragments
no indica
ortholog found on 9/6/02
67707
FT*VFNDAQNTIVSRFLS*VKSL*RFKRLFNMEPKRQMWEALAR 67576
67575
HDPATPSGSFSKHNN*SLREVVTSFLLA 67492
67313
TFVGKGWLVIYYAYAMRYVEDIRGSDCEEFRLEQWMNKAGVF*PKSSFK 67167
67164
FEYPIFYIGQRMCLGKEMTYIHGFGFTALEKLMQRVA 67054
Gene No. :
9-4_229 CYP94D7 with
C-term extension
1179800..1181300
, 1181481..1181899 (+)
>9-4_229
MELSSTSASLLLILLLTLVYFLYLHQDPKKKPRTHGLKSYPVVGTLPHFINNKDRFLEWS
TGVMKRSPTHTMSFKELGLTGGVITANPANVEHILKANFGNYPKGELAVSLLEDFLGHGI
FNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEVVERLLPLLEYAGRHGRTLDVQDVL
ERFAFDNICRVAFDEDPACLTEESMAAPQSAEFMRAFNDAQNAILDRFNSPAKSLWRIKK
LFNMEPERRMRDSLATIHGYAERIVRERRERREARLERRDDFLSRFAASGEHSDESLRDV
VTNFILAGRDTTSSALTWFFWLLSGRPDVEDKIVREIRAVRQSSAGSEGTRGATFSLDEL
RDMQYLHAAITESMRLYPPVPFDTHSCKEEEFLPDGTFAGKGWLVTYCAYAMGRVEDIWG
ADCEEFRPERWLDEAGAFRPESTFKYPVFHAGPRMCLGKEMAYIQMKSIVACVLEQFSLR
YAGDAKGHPGLVVALTLRME ATLFGIHVSSAAVFGAPSEMRWARVVGYRIADLQEKLFAR
VDGLPMARPHRANLNTSSQPKSMPWRSGVGTGDKAEQLIQPLVLGVHRFILSPSPLINDE
WARIWNVSNSVGRIGERTCFFFRMRKRVSGLVFFANTQR
>AP003232.1b
$F CYP94D7 chromosome
1 clone P0034E02
BE230752.1
99MJ354 Rice Seedling cDNA clone 99MJ354 65% to 94C1
60720 MELSSTSASLLLILLLTLVYFLYLHQDPKKKPRTHGLKSYPVVGTLPHFINNKDRF
60553
60552
LEWSTGVMKRSPTHTMSFKELGLTGGVITANPANVEHILKANFGNYPKGELAVSLLEDFL 60373
60372
GHGIFNSDGEQWLWQRKAASYEFNKRSLRNFVVDTVRFEVVERLLPLLEYAGRHGRTLDV 60193
60192
QDVLERFAFDNICRVAFDEDPACLTEESMAAPQSAEFMRAFNDAQNAILDRFNSPAKSLW 60013
60012
RIKKLFNMEPERRMRDSLATIHGYAERIVRERRERREARLERRDDFLSRFAASGEHSDES 59833
59832
LRDVVTNFILAGRDTTSSALTWFFWLLSGRPDVEDKIVREIRAVRQSSAGSEGTRGATFS 59653
59652
LDELRDMQYLHAAITESMRLYPPVPFDTHSCKEEEFLPDGTFAGKGWLVTYCAYAMGRVE 59473
59472 DIWGADCEEFRPERWLDEAGAFRPESTFKYPVFHAGPRMCLGKEMAYIQMKSIVACVLEQ
59293
59292 FSLRYAGDAKGHPGLVVALTLRME GGLPMKVTIRE* 59185
Gene No. :
9-4_234 fusion of CYP94D6
(wrong N-term and splices out a frameshift)
and some
unknown gene plus a ribonuclease
E -related seq
1196669..1196829
, 1197128..1197950 , 1198031..1198583 , 1200577..1200837 , 1201013..1202439 (+)
>9-4_234
MPWTTTMDANEDGRCRRRTGVGAGGGGLTTAVPGGEDDGATNGRDRVARNKGN
STRKQPR
ADGLKAYPIVGILPHFVRNQHRLLEWSAGVVARCPTHTMSFNFKGFGLIAGAITGNPANV
EHIVKTNFQNYSKGEYVVSVMEDFLGHGIFNSDGDQWLWQRKAASYEFNKRSLRNFVVDT
VRSEVVDRLLPLLTRAERDGRTLDEQDVLERFAFDNVCCVAFDEDPACLTEEGMGTNART
EFLRAFNDAQNILMARFMSPVEWAWRAKRLLDLEPERRMREALATIHGYADRIVRERRER
GAAGLARKDDFLSRFAATGKHSDESLRD PDVEDRIAREIRAVRASSGSTDAAAFSFDELR
EMHYLHAAITEAMRLYPPVAMDSHCCQNDDVLPDGTFVGKGWQVTYSAYAMARLEELWGA
DCEEFRPERWLDEDGVFRPESPFKYPVFHGGPRMCLGKEMAYIQMKSIAACVFERFSFRF
VGGEGRPGLVFSVTLRMEGGLPMRVKKRRDSV SNEEAEVKVSDAETAAESAAGEAEADDV
EEEETKELRRNGVAAAPIGDALRERRPPPIAAAEILPEWSERAAIGETRASAMFGESRCD
GALLHNQPPDLIAVGRRRRRMDPSRPLLGRGALITSSAHAAAALLLVAFLFLTLRNLPIS
LSPPTAALTPTTSHLEQQDQASCDTTSTLDCADPQLFHLMMRRAIDAFPDVHFNRFGRPV
PGDPPSSSCDMAWRARSTASANYKDYRRFSVARDPVTCAYSVTSIGEYHSGPLARKPRRG
GTNATAPPPPPALSRSQFAAGKYLSYLGGGDRCKPMPHYLRSLLCSIAEARYLNRTLVLD
LSVCLAAAYAGGMPEEGKRLAFYIDIEHLQSVVGIVEHKRFWEDWDKWGAQGQLGVRIIE
DSRVAPTKFSKSRDPLIVRKFGDVEPGNYWYNVCEGEAEHVLRPPQGAIRTAPSLMDIVD
GIISRMQVDFDSVHVGGNDGNLRRRIEERLNGG
GRQVYVAGEGINVVLLDALKAKYSSVH
YLDAFEELWARDSKWFLEMKRLNGGVPVEFDGYMRELVDREVFLKGKKKVEVLV
>AP003232.1a
$F CYP94D6 chromosome
1 clone P0034E02
AZ135359.1
OSJNBb0115O14f CUGI Rice BAC genomicLength = 843 50% to 94D2
43934
MGTELSLTSSVPLLLLLLFPLLCFLCLRHG STRKQPRADGLKAYPIVGILPHFVRNQHRL 43755
43754
LEWSAGVVARCPTHTMSFNFKGFGLIAGAITGNPANVEHIVKTNFQNYSKGEYVVSVMED 43575
43574
FLGHGIFNSDGDQWLWQRKAASYEFNKRSLRNFVVDTVRSEVVDRLLPLLTRAERDGRTL 43395
43394
DEQDVLERFAFDNVCCVAFDEDPACLTEEGMGTNARTEFLRAFNDAQNILMARFMSPVEW 43215
43214
AWRAKRLLDLEPERRMREALATIHGYADRIVRERRERGAAGLARKDDFLSRFAATGKHSD 43035
43034 ESLRDV X 43017 frameshift
43014
TNFVLAGRDTTSSALTWFFWLVSGQ PDVEDRIAREIRAVRASSGSTDAAAFSFDELREMH
42835
42834
YLHAAITEAMRLYPPVAMDSHCCQNDDVLPDGTFVGKGWQVTYSAYAMARLEELWGADCE 42655
42654
EFRPERWLDEDGVFRPESPFKYPVFHGGPRMCLGKEMAYIQMKSIAACVFERFSFRFVG 42478
42477 GEGRPGLVFSVTLRMEGGLPMRVKKRRDSV C* 42382
Gene No. :
9-5_074 CYP73A35P
erroneously converts pseudogene into a complete gene
386581..387133
, 387239..387391 , 387488..387651 , 387728..387815 ,
387886..388247
(-)
>9-5_074
MDLLFVERLLVGLLAAAVVAIAVSKLRGRKLRLPPGPTPVPVFGNWLQVGDDLNHRNLAA
LARRFGDIFLLRMGQRNLVVVSSPPLAREVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFT
V GTAPGGRPRPRPWWTASAPTPPPPPAAHD ALNGERSRLAQSFEYNYGDFIPILRPFLRG
YLRICEEVKETRLKLFKDFFLEERK KLASTKAMDN NGLKCAIDHILEAQQKGEINEDNVL
YIVENINVA GTRASVQ VNHGEIQEKLRRELDTVLGPGRQITEPDTHRLPYLQAVVKETLR
LRMAIPLLVPHMNLRDAELAGYGIPAESKVLVNAWYLANDPGRWRRPEEFRPERFLEEER
NVEANGNDFRYLPSGAGRRSCPGIVLALPILGVTIGRLVQNFELLPPPGKDRVDTTEKGG
QFSLHILKHSTIVAKPRAF
>CYP73A35P $P Oryza sativa (rice) GenEMBL AP003446.1 March 29, 2001
AP003302.1 Feb. 21, 2001 Both sequences have the
same frameshifts
MDLLFVERLLVGLLAAAVVAIAVSKLRGRKLRLPPGPTPVPVFGNWLQVGDDLNHRNLAA
LARRFGDIFLLRMGQRNLVVVSSPPLAREVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFT
V
YGDHWRKMRRIMTVPFFTGKVVQRHRAGWEAEAAAVVDGLRADPAAA 168 (25 nuc. deletion and
frameshift in both sequences)
175
RRRLQLMMYSNVYRIMFDRRFESADDPLFLRLK ALNGERSRLAQSFEYNYGDFIPILRPF
LRGYLRICEEVKETRLKLFKDFFLEERK (2?)
NGLKCAIDHILEAQQKGEINEDNVLYIVENINVA (1)
AIETTLWSMEWAIAEL
(2 nuc. insertion and frameshift in both seqs.)
VNHGEIQEKLRRELDTVLGPGRQITEPDTHRLPYLQAVVKETLRLRMAIPLLVPHMNLRD
AELAGYGIPAESKVLVNAWYLANDPGRWRRPEEFRPERFLEEERNVEANGNDFRYLPSGA
GRRSCPGIVLALPILGVTIGRLVQNFELLPPPGKDRVDTTEKGGQFSLHILKHSTIVAKPRAF*
Gene No. :
9-6_096 CYP86A9 with
N-term extension (may be correct)
486465..488102
(+)
>9-6_096
MATDGGVLQLHPYA
MAAAAVALASAYMVWFWALSRRLSGPRMWPLVGSLPSVVLNRARVH
DWIADNLRATGDAATYQTCILPLPFLARRQGLVTVTCNPRNLEHILRARFDNYPKGPMWQ
ASFHDLLGQGIFNSDGETWLIQRKTAALEFTTRTLRQAMARWANRSIKYRLWRILDDHCN
AAASVDLQDLLLRLTFDNICGLTFGKDPETLSPGLPENPFANAFDEATEATMQRFLFPSL
LWRIKKAFGVGSERSLRDSLAVVDRHMTETIAARKATPSDDLLSRFMKKRDSKGKAFPED
VLQWIALNFVLAGRDTSSVALSWFFWTLMQRRDVERKVVLEIASVLRETRGDDTARWTEE
PLNFDELERLVYLKAALTETLRLYPSVPQDSKYVVADDVLPDGTVVPAGSAITYSIYSVG
RMESIWGKDCAEFRPERWLSADGSRFEPVKDAYRFVAFNGGPRTCLGKDLAYLQMKSIAS
AVLLRNSVELVPGHKVEQKMSLTLFMKNGLRVHVKPRDIASYVEPSEPAPPQGSLVIPTT
TAAAA
>AP003442.1
$F CYP86A9 chromosome
1 clone B1096A10 72% to 86A1
AQ954084.1
nbeb0054A03r CUGI Rice BAC genomicLength = 403 57% to 86A1
D41651 71% to 86A1 almost identical to
AP003442
49790 MAAAAVALASAYMVWFWALSRRLSGPRMWPLVGSLPSVVLNRARVHDWIADNLRATGDAATYQTCILPLPFLARRQGLV
50026
50027
TVTCNPRNLEHILRARFDNYPKGPMWQASFHDLLGQGIFNSDGETWLIQRKTAALEFTTR 50206
50207
TLRQAMARWANRSIKYRLWRILDDHCNAAASVDLQDLLLRLTFDNICGLTFGKDPETLSP 50386
50387
GLPENPFANAFDEATEATMQRFLFPSLLWRIKKAFGVGSERSLRDSLAVVDRHMTETIAA 50566
50567
RKATPSDDLLSRFMKKRDSKGKAFPEDVLQWIALNFVLAGRDTSSVALSWFF 50722
50723 WTLMQRRDVERKVVLEIASVLRETRGDDTARWTEEPLNFDELERLVYLKAALTETLR
50893
50894
LYPSVPQDSKYVVADDVLPDGTVVPAGSAITYSIYSVGRMESIWGKDCAEFRPERWLSAD 51073
51074
GSRFEPVKDAYRFVAFNGGPRTCLGKDLAYLQMKSIASAVLLRNSVELVPGHKVEQKM 51247
51248
SLTLFMKNGLRVHVKPRDIASYVEPSEPAPPQGSLVIPTTTAAAA* 51385
Gene No. :
9-6_137 fusion of CYP94C3
with an RNA helicase
and additional unknown seq.
738785..738862
, 738881..739121 , 741031..741099 , 741481..741552 , 741601..741712 ,
741741..741829 , 742152..742215 , 742298..742474 , 743125..743304 ,
743588..743853 , 743896..743951 , 744169..744316 , 744369..744639 ,
744737..744758 , 745157..745287 , 746102..746147 , 746280..747800 (-)
>9-6_137
MGVVEAEALHGAVEALAGSLQPHVATAFFVFSACTVALAALLAVVRLRPPWWCDCTVCEA
FLTASWAGEFDNLCDWYAHLLRTSPAQTVHVHVLRNVLTANPVTVDHVLRARFDNYPKGA
PFSAILADFLGRGIFNVDGDAWLFQRKLAAAELASPALRAFAARVVASELRCRLIPLLHS
ASREGNGKVLDLQDMFRRFAFDSICKISFGLDPGCLELSMPVSTLVEAFDTASTLSARRA
TVPMQIIWRLKRFLNVGDERKLRDAVRLVDALAAEVIRQRRKLGGAATGSDLLSRFMGSI
DDDKYLRDIVVSFMLAGRDTIASALTAFFLLLSDHPEVATAIRDEVARVTGDGNRTMAAT
FDKLKDMHYVHAAMYESMRLFPPVQFDSKFAAGDDTLPDGTVVAKGTRVTYHAYAMGRME
SVWGPDCAEFRPERWLRDGRFVPESPYRYPVFQAGVRVCIGKELALMEMKAVIVAVVRSF
DIEAIARSSRRPKFAPGLTATFAGGLPV TCIMSIMIDDSVTTRTKLGKFHGASEEGRPEI
STCQKVRMPSANKTPTPRLIPFIGCMGQLATAGAWDRASGKVAELVICPIYANLPAELQA
KIFEPAPAGARKVVLATNIADAETPYNPRTAMESFLVAPVSRASAEQRACRSGIQVRVAA
AVASAAAAASATAVIDETLGVLYRSNRVLIPICFGWNCNLTTKENEHERLLVLVDRDSDT
IGAYVCNPTTRRHDGAFIAFDPAVSQTRRLACGVVGRTPTLDLLIPESSSGEDELAPSPE
ERTLLLRVFSVSHAAAASASQVVGCSCQVRLRRRALTTRRPHKMDQVNLQCVQQKQFLVR
HFHCKLVAVNGFCRHPFRLVPSVDFSSAENDNACARIQDSGNHRAQPSPSDRSRQLFFFL
HEGLVRTGASVPIGSPGTRRGVDSGRSSPAVQPTGTKASGGRCRRHRRWLLSSCCLGPCR
RTQVEIFCGDAKFFSFRSDASTFIFGRPLQAHLSTSTTEAELQATVVGGWRRQQIDEKDL
LFFLLMRRGPFLQNWLYPSKPCYIRHSALLLVGILRACSDAVTEKREHIGTAMVNGVAPP
PPTSGAAPPLSSSDRPSPHPLSRLSCDGSGDGDRGWIRPRWWRPRTDLAAAAPSDPSRLP
PTVDLTALTVSAPSPERRWAREPVAAESSDADNGNDGGGE
>AP003289
$F CYP94C3 55% to 94C1
CDS complement(91864..93438)
MGVVEAEALHGAVEALAGSLQPHVATAFFVFSACTVALAALLAV
VRLRPPWWCDCTVCEAFLTASWAGEFDNLCDWYAHLLRTSPAQTVHVHVLRNVLTANP
VTVDHVLRARFDNYPKGAPFSAILADFLGRGIFNVDGDAWLFQRKLAAAELASPALRA
FAARVVASELRCRLIPLLHSASREGNGKVLDLQDMFRRFAFDSICKISFGLDPGCLEL
SMPVSTLVEAFDTASTLSARRATVPMQIIWRLKRFLNVGDERKLRDAVRLVDALAAEV
IRQRRKLGGAATGSDLLSRFMGSIDDDKYLRDIVVSFMLAGRDTIASALTAFFLLLSD
HPEVATAIRDEVARVTGDGNRTMAATFDKLKDMHYVHAAMYESMRLFPPVQFDSKFAA
GDDTLPDGTVVAKGTRVTYHAYAMGRMESVWGPDCAEFRPERWLRDGRFVPESPYRYP
VFQAGVRVCIGKELALMEMKAVIVAVVRSFDIEAIARSSRRPKFAPGLTATFAGGLPV
RVRRRRARASGHNPPI
NOT IN
THIS ANNOTATION
>AP006237.3
CYP734A6 (japonica cultivar-group) genomic DNA, chromosome 1, BAC
clone:OSJNBb0008D07
Length =
156874
complement(join(69916..70392,70471..70870,71055..71529,72013..72430))
72340
MGWWGWAAAAAAAAAWVAV
72373
KVLEVLWWRPRRVEEHFARQGITGPRYRFLVGCVREMVALMVAASAKPMPPPYRSHNVLP 72194
72193
RVLAFYHHWKKIYGNPPPPPLLLNSILSQKQQPRTRRWQVAVVGERFAPGRYDIDMMAAL (1) 72014
71530
GSTFLIWFGPTPRLAIADPELIREVLLARADRFDRYESHPMVRQLEGEGLVSLRGDKWAH 71351
71350
HRRVLTPAFHMDNLRLLLPCVGMTVLDMADKWRAMAEADKSGEVEIDVSDWFQVVTEDAI 71171
71170
TRTAFGRSYEDGKVVFKLQAQLMAFASEAFRKVFIPGYR (2) 71054
70869
FLPTKKNTSSWKLDKEIRKNLVTLIGRRQEAGDDEKLDGCAKDLLGLMINAAASS 70705
70704
NGGKRSALPVSPITVNDIVEECKTFFFAGKQTTSNLLTWAIVVLAMHPEWQERARQEV 70531
70530
LDVCGADGVPSREQLAKLKT (0)
70392
LGMILNETLRLYPP 70351
70350
AVATVRRAKADVELGGYLRIPRDTELLIPIMAVHHDARLWGPDAAQFNPARFAGGVARAA 70171
70170
RHPAAFIPFGLGARMCIGQNLAILEAKLTVAVILHRFEFRLSARYVHAPTVLMLLHPQYG 69991
69990
APIVFRPRSSSQPTCEKMNPLTSS* 69916
NOT IN
THIS ANNOTATION
>AP004232.1
$F CYP71C18 chromosome
1 clone OSJNBa0051H17 like CYP71C4
90% to
aaaa01002989.1 version 4 in Genbank does not allow for
frameshift
and skips beginning of heme signature
probably
not an ortholog no 99% match in indica 9/5/02
56483
MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP 56650
56651
PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLR 56815
56816
THDHVFASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREE 56995
56996
E 56998 (0)
57929
VHKVMTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANY 58105
58106
VLLAGFNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGND 58285
58286
DQDEMDFVDVLLLQERGITRDHLKAIL 58366 (0)
58462
DMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRTNIPK*GRELITECDQTNMTYLKA 58641
58642
VIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 58821
58822
RFVDGGSAANVDFIGTDFQFLPFGAX 58896 frameshift
58899
RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 59078
59079
VEYKGSVQDSAVIL* 59123
NOT IN THIS ANNOTATION
>AP004233.1
$F CYP71C19 chromosome
1 clone OSJNBa0065J17 50% to CYP71C4
= AQ857130
duplicate of the AP004232 gene at 27203-29726
probably
not ortholog only 91%
21862
MEQAAGLVYQLFQHEMFPWTFSVLALFPFLLLVLHYLATNHRTPTTCKETKNHHPP
21694
PPSPPRLPIIGHLHLIGGLLHVSLRELAHRYGPDLMLLHLGQVPNLIVSSPRAAEAVLR 21518
21517
THDLVFASRPYSLIADILLYGPSDVGLSPYGE*WRRRIITTHLLTNKKVRSYRVAREE 21344
21343
E 21335 (0)
VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKAN
SVLLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMS
KQQCEHDEGNDQDEMNFVNVLLLQEQGITREHLKAIL
(0)
20004
DMYQAGTETSSVVLVFAMAELMQKPHLMAKLQAELRTTIPKQGHELITERDLTDMTYLKA 19825
19824
VIKETLRLHPPTPLLLPHLAMADCNIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 19645
19644
RFVDDGSAANVDFIGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVSAEAA 19465
19464
IDKDGIDMAEAFGLSVQLKEKLLLVPVDYKDGMQDSAVILL* 19339
NOT IN
THIS ANNOTATION
>AP003142.2
$P CYP72A36P chromosome 1, PAC clone:P0435H01 probable pseudogene
53% to
72A15 86% to AP002899
5354
YLPIENNRRIREIY*EIRKILRGLIVKGDKAIRNGENTNDDLLGLLVESNMRQSNERE 5181
5180
EVGMSIED 5157
1115
IIEECRLFYFAGSETTS 1065 frameshift
1065
MLLT*TLIMLSMHPEWQERAREEVMHHFRRTTPDHDGLSRLKIVHM 928
NOT IN
THIS ANNOTATION
>AP003261.1
$P CYP76H12P chromosome 1 clone P0471B04,
N-terminal
exon of a CYP75 like sequence 70% identical to X70824 but no other part of
the gene
is found. Identical to AP003227
87336-87082 almost identical to AP003214
132381
MELAALCTDPVVLSAAFLCLLLNLSLCSYRPPSPGGGRRLPLGPPGVPVLGALPLVGPAPHADLASLARK 132172
132171
YGPIMYLKMGTCGVV 132127