584 sequences - 18 from other species = 566
cottonwood sequences
Last modified April 1, 2005 D. Nelson
Some names revised on 8/24/2006, mostly minor
changes like v1 to P1 etc.
<CYP51 Clan, 2 sequences, both full length,
95% identical.
>CYP51G1
Scaff LG_I (-)4925909-4924104
84% to Arab. 51G1 95% to Scaff LG_III CYP51G5
seq.
fgenesh1_pm.C_LG_I000188|Poptr1 gene model correct
FKBP-type peptidyl-prolyl cis-trans isomerase
downstream
$
4925909 MTGDTDNKFLNVGLLIIATLLVAKLISALIMPRSQKRLPPVMKGWPLIGG
4925760
4925759
LIRFLKGPIVMLREEYPKLGSVFTVNLVNRKITFLIGPEVSAHFFKASEV 4925610
4925609
DLSQQEVYQFNVPTFGPGVVFDVEYSIRQEQFRFFTEALRVNKLKGYVDQ 4925460
4925459 MVVEAE 4925442 (0)
4925102
DYFLKWGDSGVVDLKYELEHLIILTASRCLLGREVRDKLFDDVAALFHD 4924956
4924955
LDNGMLPVSVLFPYLPIPAHRRRDRARKKLAEIFANIINSRKLASKSEND 4924806
4924805
MLQCFIDSKYKDGRPTTESEITGLLIAALFAGQHTSSITSTWTGAYLLRH 4924656
4924655
NEYLSAVLEEQKNLMKKHGNKVDHDILSEMDVLYRCIKEALRLHPPLIML 4924506
4924505
LRSSHSDFSVTTRDGKEYDIPKGHIVATSPAFANRLPHVFKDPDSYDPDR 4924356
4924355
FAYGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWSHLLRNFELE 4924206
4924205
LISPFPEIDWNAMVVGVKDKVMVRYKRRELSVN* 4924104
>CYP51G5
Scaff LG_III (+)14476000-14477787
83% to Arab. 51G1 95% to Scaff LG_I CYP51G1
seqF.
eugene3.00031308|Poptr1 gene model correct
FKBP-type peptidyl-prolyl cis-trans isomerase
downstream
$
14476000
MTKDTDNKFLNVGLLILATLLVAKLISALIMPRSQKRLPPVMKGWPLIGG 14476149
14476150
LIRFLKGPIVMLREEYPKLGSVFTVNLANWKITFLIGPEVSAHFFKASEA 14476299
14476300
DLSQQEVYQFNVPTFGPGVVFDVDYSIRQEQFRFFTESLRVSKLKGYVDQ 14476449
14476450
MVVEAE 14476467 (0)
14476789
DYFSKWGDSGVVDIKYELEHLIILTASRCLLGREVRDKLFDDVSALFHD 14476935
14476936
LDNGMLPISVLFPYLPIPAHRRRDRARKKLAEIFANIINSRKLAGKSEND 14477085
14477086
MLQCFIDSKYKDGRPTTESEITGLLIAALFAGQHTSSITSTWTGAYLLRH 14477235
14477236
NEYLSAVLEEQKNLMKKHGNKVDQDILSEMGVLHRCIKEALRLHPPLIML 14477385
14477386
LRSSHSDFSVTTRDGKEYDIPKGHIVATSPAFANRLPHVFKDPERYDPDR 14477535
14477536
FAAGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWSHLLRNFEFE 14477685
14477686
LISPFPETDWNAMVVGVKDKVMVRYKRRELSVN* 14477787
<CYP71
clan 22 families, 71, 73, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 89,
92,
93, 98, 701, 703, 705, 706, 712, 736
66
sequences
29 nearly
complete CYP71 like sequences and some related partial seqs.
<71B
subfamily sequences (10 seqs all named)
three full
length sequences 71B38, 71B40 and 71B41 are all about 97% identical
this is
too similar for the genome duplication date.
>CYP71B41-de1b
LG_VIII (-) 12482829-12482572
71B like 100% to LG_VIII.4 LG_VIII.10
LG_VIII.16
eugene3.00081725|Poptr1 N-term only
$
12482829
MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12482680
12482679
QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSA 12482572
>CYP71B44P
LG_I
(+) 19811612-19812590
71B
like pseudogene 51% to 71B36
53% to 71B41
fgenesh1_pg.C_LG_I001954
[Poptr1:64602] model short
exon 1
$
19811612
MACHDPLIMWSLPLVLFFSLLMFLLIRKKQNKQQIPPTPPRLPIIGNLHQLGDLS 19811776
19811777
QRSLWQLSKKYGPVILLKLGAVPAVVISSAEAAKEVLKTNDLHACSRPLL 19811926
19811927
AGTGRLSYNYSDVSFTYTYGDYWRKM*KICVLELCSARRVQSFLF 19812061
19812135 IREEEVALLIDTISAYSFSATPVDLSEKILSFTANITCRAAFGK
19812266
19812267
SFQEIKGFDGKRFEEVIREASAILASFSAADFFPKDGWIIERLTG 19812401
19812402 LLHSRLERSFRELDVLYRRVIDDHIKLEE
19812485
19812486
EEKEDIVGGPLKL*RDQTEFGTIQLTHDHIKAKLM 19812590 (0)
>CYP71B40v1
LG_VIII.4 (-) 12489221-12487004
71B like 53%
to 71B36 97%
to LG_VIII.16
eugene3.00081726|Poptr1 gene model short at N-term
$
12489221
MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12489072
12489071
QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDVAF 12488922
12488921 CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKILTLELFSLKRVQSFRF
12488772
12488771
IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12488622
12488621
FDRDKFHEVVHDTVAVVGSISADESIPYLGWIVDRLTGHRARTERVFHEV 12488472
12488471
DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKEQTELGASQFTKDNIKAILL 12488316 (0)
12487621
NLFMAGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 12487472
12487471
DQLEYLRMVIKETLRLHPPGPLLIPRETMSHCKVSGHNIYPKMLVQINVW 12487322
12487321
AIGRDPRYWKDPEEFFPERFLDRSIDYKGQSFEYLPFGSGRRICPGMHMG 12487172
12487171 SITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVP
12487022
12487021 VNYLP* 12487004
>CYP71B40v2
scaffold_994 (+) 9952-10569
CYP71B40 100% match exon 2
fgenesh1_pg.C_scaffold_994000003|Poptr1 duplicate seq
$
9952 NLFMAGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 10101
10102 DQLEYLRMVIKETLRLHPPGPLLIPRETMSHCKVSGHNIYPKMLVQINVW
10251
10252
AIGRDPRYWKDPEEFFPERFLDRSIDYKGQSFEYLPFGSGRRICPGMHMG 10401
10402
SITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVPVNYLP* 10569
>CYP71B41
LG_VIII.16 (-) 12479032-12476806
71B like 54%
to 71B36
eugene3.00081724|Poptr1 gene model short at N-term
$
12479032
MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12478883
12478882
QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDLAF 12478733
12478732
CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKIVTLELFSLKRVQSFRF 12478583
12478582
IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12478433
12478432
FDRDKFHEVVHDTEAVAGSISADESIPYLGWIVDRLTGHRARTERVFHEL 12478283
12478282
DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKEQTELGASQFTKDNIKAILL 12478127 (0)
12477423 NLFLGGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI
12477274
12477273
DQLEYLRMVIKETLRLHPPAPLLIPRETMSHCKVSGHNIYPKMLVQINVW 12477124
12477123
AIGRDPTYWKDPEEFFPERFLDSSIDYKGQSFEYLPFGSGRRICPGMHMG 12476974
12476973
FITMEIILANLLYCFDWVYPDGMKKEDINMEEKAGVSLTTSKKTPLILVP 12476824
12476823 VNYLQ* 12476806
>CYP71B38
LG_VIII.10 (-) 12547429-12545187
71B like 54%
to 71B36 97% to LG_VIII.4
fgenesh1_pg.C_LG_VIII001676|Poptr1 gene model short on
the N-term
$
12547429
MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12547280
12547279
QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDLAF 12547130
12547129
CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKVLTLELFSLKRVQSFRF 12546980
12546979
IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12546830
12546829
FDRDKFHEVVHDTEAVAGSISADESIPYLGWIVDRLTGHRARTERVFHEL 12546680
12546679
DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKDQTELGASQFTKDNIKAILL 12546524 (0)
12545804
NLFLGGVDTISLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 12545655
12545654
DQLEYLRMVIKETLRLHPPAPLLITRETMSHCKVSGHNIYPKMLVQINVW 12545505
12545504 AIGRDPTYWKDPEEFFPERFLDSSIDYKGQSFEYLPFGSGRRICPGMHMG
12545355
12545354
FITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVP 12545205
12545204 VNYLQ* 12545187
>CYP71B43P
LG_X.28 (-) 6738703-6736564
71B like 52%
to 71B36 86% to LG_VIII.4
fgenesh1_pg.C_LG_X000579|Poptr1 gene model wrong 2
frameshifts
possible pseudogene or two seq. errors.
$
6738703 MALYAVPLWLPLILLLPLLLLFMKRMKDAGQSEQLLPPGPP 6738581
6738581
KLPILGNLHQLSSLPHQSMWHLSKKYGPVMLLRLGQIPTVVISSAEAA 6738438
6738437 REVLKVHDLAFCSRPLLSGAGRLTYNYLDIAFSPYSDHWRNMRKIVTLEL
6738288
6738287
FSLKRVQSFRFIREEEVGFLVNSLSESSALAAPVDLTQKVYALVANITFR 6738138
6738137 VAYGFDYRGTTFDRDRFHEVVHDTEAVVGSISADEYVPYLG 6738015
6738015
MIVDWLTGHRARMERVFHELDTFFQHVIDNHLKPGRIKDHDDMIDV 6737878
6737877 LLRIEKEQTELGASQFTSDNIKAVLL 6737800
(0)
6737179
NLFLGGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGKKGRVTEGDV 6737030
6737029
DQLEYLRMVIKETLRLHPPAPLLLPRETMSHCIVSGYNIYPKTLVHVNVW 6736880
6736879 AIGRDPKYWRDPEEFFPE 6736826
6736827
RFLDSSCDFNGQSFEYLPFGSGRRICPGIHMGSITVEIILSNLLHCFDWI 6736678
6736677
LPHGMQKEDINMEEKAGVSLAPSKKTPVILVPVNYLQ* 6736564
>CYP71B42P
LG_VIII.30 (-) 12460686-12458958
71B like pseudogene 95% to LG_VIII.31
54% to 71B24
eugene3.00081722|Poptr1 gene model short, frameshifts
$
12460686 MAFYILPLALLLLLLFPLPLILKKKQQ 12460608
12460608 KLYVLELFSLK 12460576
12460577
RVQSFRFIREEVISLLMNSISQSSSPATPVNLTQMLYSVFASIVFGMALG 12460428
12460427 KSFQGSDFHNERCRKSIHEAE 12460365 (small deletion
here)
12460365
GHNRMLITGYDARIERVFLELDTLFQQVIDDHLKAERKERKEDIINVLLKMERDQT 12460198
12460197 ESSAPQLTKYNIKAVIL 12460147 (0)
12459572
NIFLGGVDTGAITVIWATAEIARNPIIMKKAQEEIRSSVGQKGRATEERT 12459423
12459422
DELQYLKMVIKETLRLHPPAPLLLPRETMSRCQINGYDIYLKTLIQVNAW 12459273
12459272
AIGRDPEYWRDSEEFFPERFVDSPIDYKGQRFEFLPFGSGRRVCPGILMG 12459123
12459122 VTMVELALANLLHCFDWKLPNAV 12459054
12459053 AINMEEAAGLTISKKNPLFLVRINYPQQAQPD 12458958 sequence gap
here
>CYP71B39P
LG_VIII.31 (-) 12534486-12532739
71B like pseudogene 95% to
LG_VIII.30
eugene3.00081733|Poptr1 gene model short, frameshifts
$
12534486 LLLLLLFPLPLILKKKQQ 12534433
12534433 KLYVLELFSLK 12534401
12534402
RVQSFRFIREEVISLLMNSISQSSSPATPVNLTQMLYSVFASIVFRMALG 12534253
12534252
KSFQGSDFHNERCRKAIHEAE 12534190 (small deletion here)
12534190
GHNRMLITGYDARIERVFLELDTLFQQVIDDHLKAERKERKEDIINVLLKMERDQT 12534023
12534022 ESSAPQLTKYNIKAVIL 12533972 (0)
12533397
NIFLGGVDTGAITVIWATAEIARNPIIMKKAQEEIRSSVGQKGRATEERT 12533248
12533247
DELQYLKMVIKETLRLHPPAPLLLPRETMSRCQINGYDIYLKTLIQV 12533107
12533107
NAWAIGRDPEYWRDSEEFFPERFVDSPIDYKGQRFEFFPFGSGRRVCPGI 12532958
12532957 VMGVTMVELALINLLYCFDWKLPNAV
12532880
12532879 DINMEEAAGLTISKKMPLFLVPINYPQRAQPDKMSRTSLLSKHTCS*
12532739
>CYP71B45P
LG_X (-) 6735316-6733923
71 like pseudogene new 75% to CYP71B42P
fgenesh1_pg.C_LG_X000578|Poptr1 71 like I-helix + C-term (pseudogene?)
downstream of fgenesh1_pg.C_LG_X000579
$
6735316
ILSLGRTPTLVVSSAEAARAVLKTHDLDCCCRPRLSGSGRLTYNHVYVAF 6735167
6735166
APYGDYW*EMRKLFVLEPFILKRVQSFRFITGEVARIMNSIPQSSS 6735029
6734867 PYAG*ILDKVTDHHARIERVLH 6734802
6734482 NIFLGGVHAGAITVIWALEELAWNPRTMKKAQDEIRNSVGKKGRLAEESI
6734333
6734332 DEL 6734324
6734322 TLVIKETLRWQ 6734290
6734288
PPAPLLLPRETMSHCKINGYHIYPKILIQINV*AIGSDPTYWNDS*EFF 6734142
6734141
PERFVDSSID*KGQHFEFLPFGSGRRGCPGILMGVTMVELALANLLYCLDW 6733989
6733988 KSAKAIDINMEEAAGLTISKKM 6733923
<71D
subfamily 40 sequences all named
>CYP71D38-de2b
scaffold_710 (-) 1341-893
pseudogene 89% to 726B1 J-helix to end
eugene3.07100001|Poptr1 gene model short
$
1341 VIKNPRVLEKAQKEGRQVFND 1279
1276
LGTIPDETSLHDSKFLKLIIKETLRLHPPAPLMIPIECRKRYNVNGYDTHVKSKVLINAWAIGRDP 1079
1078 NYWNELERFYPERFINVSTDFKGIDFEFIPFGAGKRMCPGKL 953
953
VKRIRDLKLIPV 918
916 SYRSLVG* 893
>CYP71D38
scaffold_710 (-) 19864-18128
726A like 50% to 726A1 euphorbia
eugene3.07100003|Poptr1
$
19864
MLISLPVFLTILLVISILWTWTKFIKSNKSSSNPPPGPWKLPFIGNLHQL 19715
19714
VHPLPHHRMRDLAKKFGPVMQLQVGEVSTVIISSSEAAKEVMKTHEINFV 19565
19564
ERPHLLAASVLFYNRKDIAFAPYGEYWRQLRKISILELLSAKRVRSFKSI 19415
19414
REEEVSNFIASIYSKEGSPINLSRMIFSLENGITARTSIGNKCKNHEGFL 19265
19264
PIVEELAEALGGLNMIDIFPSSKFLYMVSRVRSRLERMHREADEILESII 19115
19114
SERRANSALASKMGKNEEDDLLGVLLNLQDHGNLEFQLTTSTIKAVIL 18971 (0)
18748
EMFSGGGDTSSTALEWAMSELIKNPRVMEKAQKEVRQVFNDLGTIPDETS 18599
18598
LHDLKFLKLIIKETLRLHPPVPLIPRECRKRCDVNGYDIHVKSKVLINAW 18449
18448
AIGRDPNCWNEPERFYPERFINVSTDFKGSDFEFIPFGAGKRMCPGMLFA 18299
18298
TANTEFPLAQMLYHFDWKPAGGLKPENLDMTESFGGAVKRKQDLKLIPIS 18149
18148 YRSLVG* 18128
>CYP71D38-de2c
scaffold_710 (-) 21263-20928
pseudogene
79% to 726B1 C-term
$
21263 NDLGTILDETS 21231
21213 LKLITEETLRLHPSAPLIPREWRKRCQVNGYD
21118
21117 NIHVKSKVLINAWA 21076
21085 CMGMLFAIA 21059
21059
HHFDWKPIDGLKPENLDMTESLGGATKRKRDLKLIFISYRSLVG 20928
>CYP71D23P
LG_XV.1 (+) 7113294-7114921
71D like possible
pseudogene 46% to 71D10 57% to LG_VII.22
fgenesh1_pg.C_LG_XV000688|Poptr1 gene model wrong 2 frameshifts
and a stop codon
$
7113294
MEQHFPLFAIFLTFLLFIFMVLRMRKKSETNKYLTTNPPPGPWKLPLVGN 7113443
7113444
IHHVAGHQIHHRFTDLARKYGPVMQILLGEVRFVVISSRETAKEVMKTNE 7113593
7113594
NIIVDRPDGVIPRIVFYNGKAISFTPYGEYWKQLRKSCSSKLLSPQCVRS 7113743
7113744
LIRSTMEEEVSDFVTSISSKEGSPINLSKMLFTLTFGLISRVILGKKGKN 7113893
7113894
QALLSSIEEWKQGGAGFDVADIFPSFKLFHSLGWARSKFVRQHQEIGEML 7114043
7114044
ETVINERRASKIRTKTSEHEIEEDFLDVLVNMQHALR 7114154
7114156 NLEFTNDNIKAILL 7114197 (0)
7114292 EFFLAGSDSSSAVMEWAMSEMLKNPRHMKRAQKEVRVVFTKMGNDDETRL
7114441
7114442
HELKYLQLIIKETTRLHPPAPLILRACREACKINGHDIPDRSNVMINAWA 7114591
7114592 IGRDPTYWNEA* 7114627
7114628
KFNPERFLDSSIDYMGTNFEFIPFGAGKRKCPGMAFGLAIVEMALAKLLY 7114777
7114778
IFDWKLCDGVKNEDLNMKEDTALGSTVKRKHELYLIPIPYHPSSPAK* 7114921
LG_VII.22 (+) 5888964-5891028
71D like 50%
to 71D4, 75% to LG_VII.12
fgenesh1_pg.C_LG_VII000682|Poptr1 gene model correct
$
5888964
MEFPILLASLLFIFAVLRLWKKSKGNGSTLALPPGPWKLPLIGNIHQLAG 5889113
5889114 SLPHHCLTDLAKKYGPVMQLQIGEVSTVVVSSGEAAKEVMKTHEINFVER
5889263
5889264
PCLLVANIMFYNRKNIGFAPYGDYWRQMRKVCTLELFSAKRVRSFRSVRE 5889413
5889414
EEVSNFIRNIYAKAGSPINLSKMMLDLSNGVIARTSIGKKSKNQEAFLPI 5889563
5889564
IEDVAEALAGLNIVDVFPSAKFLYMISKLRSRLERSHIEADEILENIINE 5889713
5889714
RRASKEERKTDQDNEVEVLLDVLLNLQNQGNLEFPLTTDSIKAIIV 5889851 (0)
5890402
EMFGAGSETTSTLLEWSMSEMLKNPRVMKKAQEEVRQVFSDSENV 5890536
5890537
DETGLQNLKFLKLIIKETLRLHPPISLIPRECSKTCEINGYVIQAKSKVI 5890686
5890687
INAWAIGRDSNDWTEAEKFYPERFQDSSIDYKGTNFEFIPFGAGKRMCPG 5890836
5890837
MLFGIGNAELLLARLLYHFDWKLSSGAALEDLDMNEAFGGTVKKKHYLNL 5890986
5890987 IPIPYGPCPLPVE* 5891028
>CYP71D42
LG_VII.3 (-) 5804810-5802796
71D like 49%
to 71D4 99% to LG_VII.27
note these two gene names were both assigned to the same
sequence
eugene3.00070713|Poptr1 gene model seems correct
$
5804810 MDQVFQFIYILIVPFLLLIFPVLRLWKKSQGNNSSTPPPPPGPWKLPLIGNLHQLL 5804643
5804642
GSLPHQVLRDMANKYGPVMQLQIGEVPTVIISSPEAAKEAIKTHEINFVD 5804493
5804492
RPCLLVAKVMFYNSKDIAFAPYGDYWRQMKKVCVLELLSAKRVKSFRSIR 5804343
5804342
EEEVSNFMRTIYSKAGSPINLSKMMFDLLNGITARASVGKKYKHQEAFLP 5804193
5804192
IIEQVIEAMGGTNIADVFPSSKLLYMISRFRSRLERSHQDADVILENIIY 5804043
5804042
EHRVRREVAKTDEESEAEDLLDVLLNLQNHGDLGFPLTTDSIKATIL 5803902 (0)
5803416
ELFTAGSDSSSTLMEWTMSEMLRNPRVMRKAQEEVRQVFSNTEDVDETCL 5803267
5803266
HNLEFLKLIIKETLRLHPPAPFIPRECNKTCEINGYVIQAKSKVMINAWA 5803117
5803116
IGRDSDHWTEAEKFYPERFLDSSIDYMGTNFEFIPFGAGKRMCPGILFGI 5802967
5802966
ATVELPLAQLLYHFDWKLPNGDLSEDLDMNEVFVGTVRRKHQLNVIPIPF 5802817
5802816 YPSPLQ* 5802796
>CYP71D43
LG_VII.12 (-) 5753477-5751477
71D like 48% to 71D4
94% to LG_VII.3
estExt_fgenesh1_pg_v1.C_LG_VII0660|Poptr1 gene model seems
correct
$
5753477 MEQVFQFIQILVPFLLLIFTVLRLWKKSQGNNSSTPPPP
5753361
5753360
PPPPPGPWKLPLIGNLHQLLGSLPHQVLRDMANKYGPVMQLQIGEVPTVI 5753211
5753210
ISSPEAAKEAMKTQEINFVDRPCLLVAKVMYYNSKDIGFAPYGDYWRQMK 5753061
5753060
KVCVLELLSAKRVKSFRSIREEEVSNFIRAIYSRAGSPINLSKMMFDLLN 5752911
5752910
GITARASVGKKYKHQEAFLPIIEQVIEAVGGTNIADVFPSSKLLYMISRF 5752761
5752760
RSRLERSHQDADVILENIIYEHRVRREVAKTDEESEAEDLLDVLLNLQNH 5752611
5752610 GDLGFPLTTDSIKATIL 5752560 (0)
5752097
ELFAGGSDTSSTLMEWTMSEMFRNPRVMRKAQEEVRQVFSNTENVDETCL 5751948
5751947
HNLEFLKLIIKETLRLHPPVPFIPRECNKTCEINGYVIQAKSRVMINAWA 5751798
5751797
IGRDSDHWTEAEKFYPERFLDSSIDYKGTNFDFIPFGAGKRMCPGILFGI 5751648
5751647
ATVELPLAQLLYHFDWKLPNGDLLEDLDMNEVFGGTVRRKHQLNLIPIPF 5751498
5751497 YPSPLQ* 5751477
>CYP71D24P
scaffold_228 (-) 35436-34671
71D like 2 copies exon 1 pseudogene
eugene3.02280006|Poptr1
$
35436 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQLASSATMP 35266
35266 HYLCAHWARKYG
35231
35234 RTPPLGPWKLPLIGNIHQLASSATMP 35157
35180 LASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVIISSPDAAKEVLKT
35040
35039
QEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRKACIWGLFSATRK 34890
34889 LSFRSIREEEVSNLISSIRSKEGSPINLRELLLDLSNETITRTSIGKKCK
34740
34739 NKARFLHTIEQVSKSVGGVNIFL 34671
>CYP71D25Pv1
scaffold_228 (-) 39599-38903
71D like 62% to LG_VII.22 41% to 71D4
exon 1 pseudogene
eugene3.02280007|Poptr1 100% to scaffold_228 (-)
34668-35436
$
39559 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQ
39413
39412
LASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVIISSPDAAKEVLKT 39272
39271
QEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRKACIWGLFSATRK 39122
39121
LSFRSIREEEVSNLISSIRSKEGSPINLRELLLDLSNETITRTSIGKKCK 38972
38971 NKARFLHTIEQVSKSVGGVNIFL 38903
>CYP71D25Pv2
scaffold_1911 (+) 7175-8184
71V like 37% to 71V5
fgenesh1_pg.C_scaffold_1911000001|Poptr1 contains a
duplication from exon 1
identical in seq to each other in overlap region
almost identical to 71D25P 2 aa diffs probable duplicate sequence
$
7175 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQLASSATMPHYLCAHWAKKYG 7381
7381 TPPPGPWKLPLIGNIHQLASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVII 7536
7537
SSPDAAKEVLKTQEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRK 7686
7687 ACIWGLFSATRKLSFRSIREEEVSNLISSIRSKAGSPINLRELLLDLSNE
7836
7837
IITRTSIGKKCKNKARFLHTIEQVSKSVGGVNIVDLFPSARLVHMISNMT 7986
7987
SSLQRLHEETDQMLEDIINERRASRVEKKTGENKIEAGDDLLDVLLNLQD 8136
8137 DGNFKVKTDSIKSIIL 8184 (0)
>CYP71D36Pv1
LG_I (-) 29491328-29490406
71D like EXXR 61% TO CYP71D26
eugene3.00012614|Poptr1 mid
regioN to EXXR 71D like
29491328 DTVDVLLNL*GQADLEFTLTTKNIKAIIL 29491242 (0)
29490942 DMFVAGSETSSRTVEWAK 29490889
29490889 TELAKHPKVMEKAQAEARQVFANVDEAGLHKLDHLQLLIKETL 29490761
29490759 NIPPIPLLFPRESKEACKITGYDMPAQSKVTVNIWGIGREPINWTEPERFYP
29490604
29490603 ERFLDSSMDYKGIDFKFIPFGAG 29490535
29490534 ILFGMATYVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS*
29490406
>CYP71D36Pv2
scaffold_1517 (-) 9207-8703
3 aa diffs to 71D36P, duplicate seq.
eugene3.15170002|Poptr1
gene model wrong
$
9207 LNEMAKHPKVMEKAQAKARQVFANVDEAGLHKLDHLQLLIKETL 9076
9074 NIPPIPLLFPRESKKACKITGYDMPAQSKVTVNIWGIGREPINWTEPERFYP
8919
8918 ERFLDSSMDYKGIDFKFIPFGA 8853
8852 GILFGMATVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS*DEK*SLH
8703
>CYP71D35P
LG_I (-) 29497891-29497622
71B like pseudogene no model exists
98%
to scaffold 1517, 93% to scaffold 6967, 47% to 71B18
$
29497891 KVTVNIWRIGREPINWTEPER 29497829
29497826 FYPERFLDSSMDYKGIDFKFIPFGAGILFGMAT
29497728
29497726
TVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS 29497622
>CYP71D36Pv3
scaffold_6967 (-) 2023-1561
100% to 71D36Pv2
eugene3.69670001|Poptr1 gene model wrong
$
2023 LNEMAKHPKVMEKAQAKARQVFANVDEAGLHKLDHLQLLIKETL 1892
1890
NIPPIPLLFPRESKKACKITGYDMPAQSKVTVNIWGIGREPINW 1759
1758
TEPERFYPERFLDSSMDYKGIDFKFIPFGAGILFGMATVVLPLAQLLCFL 1609
1608 DWIPPNGLRSADLVTS 1561
>CYP71D34
LG_I.29 (-) 29516258-29514652
71D like new 52% to 71D10 93% to LG_I.2
eugene3.00012615|Poptr1 gene model short one frameshift
$
29516258
MEQLQTPPSLVLLPSLLFIFMVLRMLKKSKSKDLTPNLPPGPRKLPVIGN 29516109
29516108 LHQLFGSLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMKVHD
29515959
29515958 INFAHRPHLPVGQIIFYNCTDIATA 29515884
29515882
AAYGDYWRQLRKVSILELLSPKRVQSFRSIREEEVSSLIGSISSSAGSII 29515733
29515732
NLSRMLFSVAYNITTRAAFSKLRKEEEIFVPLVQGIIQVGAGFNISDLFP 29515583
29515582 SIKLIPWITGMRSRMERLHQEADRILESIINDHRARKAEGNSSNESKADN
29515433
29515432 LVDVLLDLQEHGNLDFSLTTDNIKAVIL
29515349 (0)
29515263
DIFIAGTETSSTILQWAMSELLKHPEVMEKAQTEVREAFGKDGSVGELNY 29515114
29515113
LKMVIKETMRLHPPLPLLLPRECREECGINGYNIPIKSRVLVNVWAIGRD 29514964
29514963 SDYWVEAERFHPERFLDSSIDYKGVNFEFTPFGAGRR
29514853
29514852
MCPGILFGISNVDLLLANLLYHFDWKLPGDMEPESLDMSEAFGATVRR 29514709
29514708 KNALHLTPILHHPHPVRS* 29514652
scaffold_11610
(-) 1395-784
71D LIKE 96% to 71D34
eugene3.116100001|Poptr1 gene model wrong exon 2 only
$
1395
DLFIAGTETSSTILEWAMSELLKHPEVMEKAQTEVREVFGKDGSVGELNY 1246
1245
LKMVIRETMRLHPPLPLLLPRECREECGINGYNIPIKSRVLVNVWAIGRD 1096
1095
SNYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRRMCPGILFGISNVD 946
945 LLLANLLYHFDWKLPGDMKLESLDMSEAFGATVRRKNALHLTPILHQPHPVRS*
784
>CYP71D33P
LG_I
(-) 29524379-29524047
71B
like pseudogene
eugene3.00012616
[Poptr1:550175] model short
$
29524379
VWAIGRDSDYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRR 29524248
29524247
MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGATVRR 29524104
29524103 KNALHLTPILHHPHPVRS* 29524047
>CYP71D32P
LG_I
(-) 29530020-29529344
71B
like pseudogene
eugene3.00012617
[Poptr1:550176]
$
29530020
MEQPQIPSCLVLLPSLLFIFMVLRMLKKSKTKDLTPNLPPGPRKLPVIGN 29529871
29529870
LHQLFGSLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMK 29529730
29529676 VWAIGRDSNYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRR
29529545
29529544
MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGATVRR 29529401
29529400 KKALHLTPILHHPHPVRS* 29529344
>CYP71D31P
LG_I (-) 29532948-29532496
71B like pseudogene C-term
eugene3.00012618|Poptr1
$
29532948
VIRETMRLHPPLPLLLPRECREECGINGYNIXIKSRVLVNAWAIGRDSNY 29532799
29532798 WVEAERFHPERFLDSSIDYKGVNFEFTPFGAGRR
29532697
29532696
MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGAAVRR 29532553
29532552 KNALHLTPILHHPHPVRS* 29532496
>CYP71D30P
LG_I
(-) 29540735-29539563
71B
like PSEUDOGENE 2 models are from same gene
eugene3.00012620
[Poptr1:550179] gene start
eugene3.00012619
[Poptr1:550178] gene end
$
29540735
MEQLQIPTSLVLLSSLLFIFMVLRILKKSKTKDFTPNLPPGPRKLPVIGN 29540586
29540585 LHQLFCSLPHHRLR 29540544
(sequence gap)
29539973
LLPRECREECGINGYNIPIKSRVLVNVWAIGRDSNYWVEAERFQPERFLD 29539824
29539823 SSIDYKGVNFEFTPFGAGRR 29539764
29539763
MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGATVRR 29539620
29539619 KNALHLTPILHHPHPVRS* 29539563
>CYP71D29
LG_I.6 (-) 29575643-29574039
71D like 54% to 71D11 93% to LG_I.29
eugene3.00012625
[Poptr1:550184] gene model wrong gene has one frameshift
$
29575643
MEQLQIPTSLVLLSSLLFIFMVLRILKKSKTKDFTPNLPPGPRKLPVIGN 29575494
29575493 LHQLFGSLPHHRLRDWP 29575443
29575443 EKHGPIMHLQLGQVQTIVISSPETAEQVMKVHDINFAHR
29575327
29575326
PHLLAAQIIFYNCTDIATAAYGDYWRQLRKISILELLSPKRVQSFRSIR 29575180
29575179
EEEVSSLIGSISSSAGSIVNLSRMLFSVAYNITTRAAFSKLRKEEEIFVP 29575030
29575029
LVQGIIQVGAGFNVGDLFPSIKLLPWISGMRSRMERLHQEADRILESIIK 29574880
29574879 EHRARKAEGNSSNESKADDLVDVLLDLQEHGNLDFSLTTDNIKAVIL
29574739 (0)
29574653
DLFIAGTETSSTILEWAMSELLKHPEVMEKAQTEVREVFGKDGSVGELNY 29574504
29574503
LKMVIRETMRLHPPLPLLIPRECREECGINGYNIPIKSRVLVNVWAIGRD 29574354
29574353
SNYWVEAERFQPERFLDSSIDYKGVNFEFTPFGAGRRRMCPGIMFGISNV 29574204
29574203
DLLLANLLYHFDWKLPGDMKPESLDMSEAFGAAVRRKNALHLTPILHHPH 29574054
29574053 PVRS* 29574039
>CYP71D29
scaffold_1813 (+) 7926-8444
71D29 100% match duplicate seq.
grail3.1813000101|Poptr1 most of exon 2
$
7926 LKHPEVMEKAQTEVREVFGKDGSVGELNYLKMVIRETMRLHPPLPLLIPR
8075
8076
ECREECGINGYNIPIKSRVLVNVWAIGRDSNYWVEAERFQPERFLDSSID 8225
8226
YKGVNFEFTPFGAGRRRMCPGIMFGISNVDLLLANLLYHFDWKLPGDMKP 8375
8376 ESLDMSEAFGAAVRRKNALHLTP 8444
>CYP71D29-se3[1]
scaffold_1517 (-) 4845-4513
97% to 71D29 100% to scaffold_19234
eugene3.15170001|Poptr1
$
4845
MEQLQIPTSLVLLPSLLFIFMVLRILKKSKTKDFTPNLPPGPRKLPVIGN 4696
4695
LHQLFGSLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMKVHD 4546
4545 INFAHRPHLLV 4513 (seq gap here)
>CYP71D29-se1[1]
scaffold_19234 (-) 368-1
95% to 71D29 N-term no gene model at
JGI
$
368
MEQLQIPTSLVLLPSLLFIFMVLRILKKSKTKDFTPNLPPGPRKLPVIGNLHQLFG 201
200
SLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMKVHDINFAHRPHLLVGQIIFYNCTDI 3
>CYP71D29-se2[1]
scaffold_18933 (+) 920-1111
71B like N-term no gene model at JGI
may be same gene as scaffold_19234 1 aa
diff to 71D29
$
920 MEQLQIPTSLVLLPSLLFIFMVLRILKKSKTKDFTPNLPPGPRKLPVIGNLHQLFG
1087
1088 SLPHHRLR 1111
>CYP71D46P
scaffold_724 (+)
13518-14366
CYP71D46P 97% to 71D29 exon 1
eugene3.07240004|Poptr1 missing end of exon 1 and all of exon
2
$
13518
MEQLQIPTSLVLLSSLLFIFMVLRILKKSKTKDFTPNLPPGPRKLPVIGN 13667
13668
LHQLFGSLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMKVHD 13817
13818
INFAHRPHLLVGQIIFYNCTDIATAAYGDYWRQLRKISIVELLSPKRVQS 13967
13968 FRSIREEEVSSLIGSISSSAGSIINLSRMLFSVAYNITTRAAFSKLRKEE
14117
14118
EIFVPLVQGIIQVGAGFNIGDLFPSIKLLPWITGMRSRMERLHQEADRIL 14267
14268 ESIIKEHRARKAEGNSSNESKADDLVDVLL
14357
>CYP71D45P
scaffold_724 (+) 1454-2065
2 aa diffs to 71D31P 95% to 71D29 exon
2
eugene3.07240001|Poptr1 C-term
note exon 2 precedes exon 1 on scaf_724, could be a
nearly intact gene if rearranged
$
1454
DLFIAGTETSSTILEWAMSELLKYPEVMEKAQTEVREVFGKNGSVGELNY 1603
1604
LNMVIRETMRLHPPLHLLLPRECREECGINGYNIPIKSRVLVNAWAIGRD 1753
1754
SNYWVEAERFHPERFLDSSIDYKGVNFEFTPFGAGRRMCPGILFGISNVD 1903
1904
LLLANLLYHFDWKLPGDMKPESLDMSEAFGAAVRRKNALHLTPILHHPHPVTS* 2065
LG_I.2 (-) 29606262-29604787
71D like new 55% to 71D11 93% to LG_I.29
eugene3.00012629|Poptr1 gene model short with one frameshift
$
29606262 MEQLQIPSSLVLLPSLLFIFMVLRMLKKSKTKDLTPNLPPGPRKLPVIGN
29606113
29606112 LHQLFGSLPHHRLRDWP 29606062
29606062
EKQGPIMHLQLGQVQTIVISSPETAEQVIKVHDINFAHR 29605946
29605945
PHVLAAQIIFYNCTDIATAAYGDYWRQLQKISILELLSPKRVQSFRSIR 29605799
29605798 EEEVSSLIGSISSSAGSIVNLSRMLFSVAYNITTRAAFSKLRKEEEIFVP
29605649
29605648
LVQGIMQVGAGFNISDLFPSIKLLPWITGMRSRMERLHQEADRILESIIK 29605499
29605498
EHRARKAEGNSSNESKVDDLVDVLLDLQEHGNLDFSLTTDNIKAVIL 29605358 (0)
29605272
DLFIAGTETSSTILEWAMSELLKHPEVMEKAQTEVREVFGKDGSVGELNY 29605123
29605122 LKMVIRETMRLHPPLPLLFPRECREECGINGYNIPIKSRVLVNVWAIGRD
29604973
29604972
SNYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRR 29604862
29604861 MCPGILFGISNVDLLLANLLYHFDW 29604787
(sequence gap)
>CYP71D28v2
scaffold_2240 (-) 487-3
CYP71D28v2 1 aa diff to
71D28, duplicate seq.
fgenesh1_pg.C_scaffold_2240000001|Poptr1
$
487
MEQLQIPSSLVLLPSLLFIFMVLRMLKKSKTKDLTPNLPPGPRKLPVIGN 338
337 LHQLFGSLPHHRLRDWP 287
287
EKQGPIMHLQLGQVQTIVISSPETAEQVIKVHDINFAHRPHVLAAQIIFY 138
137
NCTDIATAAYGDYWRQLQKISILELLSAKRVQSFRSIREEEVSSL 3
>CYP71D26
scaffold_228.17 (-) 105939-104306
71D like 54%
to 71D4 60% to LG_I.29
fgenesh1_pg.C_scaffold_228000012|Poptr1 gene model wrong at
N-term
$
105939 MEHQFASTILVTILVTSISYVILWIWKKSKV 105847
105846
RNSNLNLPPVPSQLPLIGNMHNLVGSLPHHRFRDMAKKYGPVMHLRLGEV 105697
105696 THVLISSAETAKEVMKTHDLIFAQRPAPIAAKILSYNCMDIAFAPYGDYW
105547
105546
RMLRKLCVLELLSAKRVRSFRSIREEEVWRVVRSISSSAWSPVNFSRMIS 105397
105396
SLTYCITSRAAFGKICKGEDVFIPAVKEANKAAGGYSLADLYPSIKLLSV 105247
105246
ISGMRLTLEKIHARLDKILQEIINEHRSKKEMAAKTGADEEEHDLVDVLL 105097
105096 GIQDQGDTEFSLTDNNIKAIIL 105031 (0)
104929
DLFVAGTDTSSTTVVWAMSEMVKHPRVMKKAQEEVRQVFGDKGTVDE 104789
104788
AGLHELNYLKLAIKETFRLHPPVPLLLPRESREDCKINGYDIPIKSKVIV 104639
104638
NVSAIGRDPTYWNEPERFYPERFLDNSIEYKGTDFELLPFGAGRKMCPGI 104489
104488 LFGTVNVELPLAQLLFHFDWNLPKGPKPEDLDMSEVFGAVVTRKNDLCLI
104339
104338 PIPHHPLPGN* 104306
LG_VIII.13 (+) 5139033-5140582
71D like 51%
to 71D4 59% to LG_I.29
fgenesh1_pg.C_LG_VIII000713|Poptr1 gene model short,
missing N-term
5139033 SDEEAAKEVMKTHDVTFAQRPYFLVSDIISYNSTNIAFSPFGDYWRQVRK
5139182
5139183 ICILELLRAKRVQSFQAIREEEVSNLISSINYNAGLPINLTKLLYTISFD
5139332
5139333 STSRASFGKKSKDHEAFKSVMEEIMEVSKSFIISDIFLSIKLLHLISGTR
5139482
5139483 QKLKILHQKADQILESIINEDRAREAPSNEIEADDLVHVLLNLLGHGKLE
5139632
5139633
FPLTTDNIKSVNL 5139671 (0)
5139985 DMFLGGTETSSTVLDWAIAGLLRNPRVMKKAQAEVRQVFCTAGNVDETDL
5140134
5140135 EKLKYLELVVKETLRLHPPLSLLLPRESREDCEINGFKIPAKIKVVINVW
5140284
5140285
AIGRDPAYWNEPEK 5140326
5140328 FHPERFHDSLIDYNGANFEYIPFGAGRRMCPGISFGIANVEYPLAHLL
5140471
5140472 YHFNWKLPNGLKPENLDMTEVFGVAVRRSALDSHFV* 5140582
>CYP71D22
scaffold_122.5 (+) 199976-202441
71D like 45%
to 71D10 54% to LG_XI.14
eugene3.01220022|Poptr1 gene model seems correct
199976 MEWQLPSFSALSTFLLFMTFLLLKIFKEPKTNHNSGRNPPPGPKALRIIG
200125
200126 NLHQLGGGPSLLIRLRELAERYGPIMLLQVGEVPTIIISSPELAQEVMKT
200275
200276 HESCFDERPPFFAGNVYFYGNRDLIFAPYGDYWKQLRKIVTMEVLSPIRV
200425
200426 RTFRATREEEVASLIRTISSQQGSAINLSQILFSFTYSIISRISVGRNSK
200575
200576 NQKEFATIVKDFSTISKELSLAAGGANVVDLYPSQKLLHMFSWRKFRLGR
200725
200726 EHKKANKILERLIKERKASKRDKEIAENEVEDLLDVLLNLQLTVGLDSPL
200875
200876 TDECVKALLL 200905 (0)
201818 DMFAGGGDTTLTVLEWAMSELMKNPRVREKAQKEVRALFNDVGYIDESNV
201967
201968 HELQFLNLTLKETLRLHPPLCVYPRECKVNCKVAGYDLEAKTRVLINAWM
202117
202118 IGRDPKYWTEPEKFYPERFLDCSTDYKGANFEFLPFGSGKRICPGMAFGI
202267
202268 ATVELPLARLLLHFDWKIPNGIKPEDFDMSEIVSASVTRKNDIVLIPVTC
202417
202418 YDPPVKG* 202441
>CYP71D22P dup.
scaffold_122 (+) 221737-222360
71B like 100% match to scaffold_122.5
exon 2
fgenesh1_pg.C_scaffold_122000024|Poptr1 duplication,
probable error in assembly
$
221737
DMFAGGGDTTLTVLEWAMSELMKNPRVREKAQKEVRALFNDVGYIDESNV 221886
221887
HELQFLNLTLKETLRLHPPLCVYPRECKVNCKVAGYDLEAKTRVLINAWM 222036
222037
IGRDPKYWTEPEKFYPERFLDCSTDYKGANFEFLPFGSGKRICPGMAFGI 222186
222187 ATVELPLARLLLHFDWKIPNGIKPEDFDMSEIVSASVTRKNDIVLIPVTC
222336
222337 YDPPVKG* 222360
>CYP71D37P
LG_XI (+) 12873790-12874128
71B like PSEUDOGENE EXXR 86% TO
LG_XI.14
eugene3.00111064|Poptr1
$
12873790 NDLGTILDETSRRN 12873831
12873831 LLKLKLITEETLRLHPSAPLIPREWRKRCQVNGYDNIHVKSKVLINAWA 12873977
(deletion)
12873977 MLFAIA 12873994 (deletion)
12873994
HHFDWKPIDGLKPENLDMTESLGGATKRKRDLKLIFISYRSLVG* 12874128
>CYP71D38
LG_XI.8 (+) 12875195-12876930
71D like 52%
to 71D9
93% to LG_XI.14
54% to scaffold_122.5
fgenesh1_pg.C_LG_XI001081|Poptr1 gene
model seems correct
$
12875195
MLISLPVFLTILLVISILWTWTKFIKSNKSSSNPPPGPWKLPFIGNLHQL 12875344
12875345
VHPLPHHRMRDLAKKFGPVMQLQVGEVSTVIISSSEAAKEVMKTHEINFV 12875494
12875495 ERPHLLAASVLFYNRKDIAFAPYGEYWRQLRKISILELLSAKRVRSFKSI
12875644
12875645
REEEVSNFIASIYSKEGSPINLSRMIFSLENGITARTSIGNKCKNHEGFL 12875794
12875795
PIVEELAEALGGLNMIDMFPSSKFLYMVSRFRSRLERMHREADEILESII 12875944
12875945
SERRANSALASKMGKNEEDDLLGVLLNLQDHGNLEFQLTTSTIKAVIL 12876088 (0)
12876310 EMFSGGGDTSSTALEWAMSELIKNPRVMEKAQKEVRQVFNDLGTIPDETS
12876459
12876460
LHDLKFLKLIIKETLRLHPPVPLIPRECRKRCDVNGYDIHVKSKVLINAW 12876609
12876610
AIGRDPNCWNEPERFYPERFINVSTDFKGSDFEFIPFGAGKRMCPGMLFA 12876759
12876760
TANTEFPLAQMLYHFDWKPAGGLKPENLDMTESFGGAVKRKQDLKLIPIS 12876909
12876910 YRSLVG* 12876930
>CYP71D46P
LG_VI (+) 6134143-6134717
71D like 52% to 71D38 exon 2
eugene3.00060769|Poptr1
$
6134143 HMFLAGSDTT*FFEWALLEMIRNPRVMTRAQKEVREVGN 6134259
6134264 DESGLREVKYVKLIIKETLRLLPPVALLPRECRQSCKTQGYDIHEK
6134401
6134400 KNKAMINVWAMGRDPGYWIEPEKFYPE 6134480
6134480
RFLTCISDYISTDFEFLPFGA*RRMCPGLLLGKTTGRVATSHLLYHFD*ELPN 6134638
6134637 MDMTEAFSSVIGRKHDLIVIPIPFNS* 6134717
LG_XI.14 (+)
12884309-12886030
71D like 51%
to 71D9 93% to LG_XI.8 54% to scaffold_122.5
eugene3.00111066|Poptr1 gene model short
12884309 MLLSLPVFLTILLVISILWTWTKLIKSNKSSSNPPPGPWKLPFIGNLHQL
12884458
12884459 VHPLPHHRLRDLAKKFGPVMQLQVGEVSTVIISSSEAAKEVV 12884584
12884584 KTHEINFVERPHLLAASVLFYNRKDIAFAPYGEYWRQLRKIS
12884709
12884710 ILELLSAKRVRSFKSIREEEVSNLITSIYSKEGSPINLSRMIFSLENGIT
12884859
12884860 ARTSIGNKCKNQEAFLPIVDELTEAL 12884937
12884938 GGFNMIDIFPSSKFIYMVSRVRSRLERMHREADEILESIISERRANSALA
12885087
12885088 SKMDKNEEDDLLGVLLNLQDHGNLEFQLTTSAIKAIIL 12885201
(0)
12885413 MFSGGGDTSSTALEWAMSELVKNPRVMEKAQKEVRQVFNDIGTIPDEASL
12885562
12885563 HDLKFLKLIIKETLRLHPSGPLIPRECRKRCNVNGYDIHVKSKVLINAWA
12885712
12885713 IGRDPNYWNEPERFYPDRFINVSTDFKGSDFEFIPFGAGKRMCPGMLFAI
12885862
12885863
ANIEFPLAQMLYHFDWKPADGLKPEDLDMTESLGGTVKRKRDLKLIPISYRSLVG* 12886030
>CYP71D40P
LG_XI.18 (+) 12892840-12894465
71D like pseudogene 48% to 71D4 95% to 71D39
eugene3.00111067|Poptr1 gene model short may = scaf_4500 on
the end
$
12892840
MLLSLPVFLTILLVISILWTWTKLIKSNKSSSNPPPGPWKLPFIGNLHQL 12892989
12892990 VHPLPHHRLRDLAKKFGPVMQLQVGEVSTVIISSSEAAKEV
12893112
12893112
DVNFVERPHLLAASVLFYNRKDIAFAPYGEYWRQLRKISILELLSAKRVR 12893261
12893262
SFKSIREEEVSNLITSIHSQEGSPINLSRMIFSLENGITARTSIGNKCKN 12893411
12893412 QEAFLPIVDELTEAL 12893456
12893458 VTGGFNMIDIFPSSKFIYMVSRVRSRLERMHREADEILESIISERRANSA
12893607
12893608
LASKMDKNEEDDLLGVLLNLQDHGNLEFQLTTSAIKAIIL 12893727 (0)
12893963
EMFSGGGDTSSTALEWAMSELIKNPRVMEKAQKEVRQVFNDLGTIPDETS 12894112
12894113
LHDLKFLKLIIKETLRLHPPVPLIPRECRKRYNVNGYDTHVKSKVLINAW 12894262
12894263
AIGRDPNYWNELERFYPERFINVSTDFKGIDFEFIPFGAGKRMCPGKL 12894406
(deletion)
12894406 VKRKRDLKLIPISYRSLVG* 12894465
>CYP71D40P
scaffold_4500 (-) 3903-3733
100% match to 71D40P
$
3903
VLINAWAIGRDPNYWNELERFYPERFINVSTDFKGIDFEFIPFGAGKRMCPGKLFAM 3733
<CYP71AN new
subfamily 6 sequences
>CYP71AN1
LG_XVI.15 (+) 13154281-13155936
71B like 46%
to 71AH1 98% to CYP71AN2
96%
to LG_XVI.24
eugene3.00161321|Poptr1 gene model short at
N-term one stop codon
$
13154281 MT*LLYFQQTWQEIRPKIGLNYLVFFLIFLSFILFLFKLTRSRKLNLP
13154424
13154425
PSPPKLPVIGNIHHLGTLPHRSLQALSEKYGPLMLLHMGHVPTLIVSSAE 13154574
13154575
AASEIMKTHDIVFANRPQTTAASIFFHGCVDVGFAPFGEYWRKVRKISVQ 13154724
13154725
ELLGPKTVQSFHHVREEEAAGLIDKIRFACHSGTSVNISEMLISVSSDIV 13154874
13154875 SRCVLGRKADKEGGNSKFGELTRTFMVQLTAFSFGDLFPYLGWMDTLTGL
13155024
13155025
IPRLKATSRALDSFLDQVIEEHRSLESDGDRCAQTDFLQALLQLQKNGKL 13155174
13155175 DVQLTRDNIIAVVL 13155216 (0)
13155325
DMFVGGTDTSSTMMEWAIAELVRNQTIMRKAQEEVRRIVGKKSKVEANDI 13155474
13155475 EEMGYLKCIIKETLRLHPAAPLLVPRETSASFELGGYYIPPKTRVLVNAF
13155624
13155625
AIQRDPSFWDRPDEFLPERFENNPVDFKGQDFQFIPFGSGRRGCPGALFG 13155774
13155775
VTAVEFMIANLLYWFDWRLPDGATQEELDMSEICGMTAYKKTPLLLVPSL 13155924
13155925 YSP* 13155936
>CYP71AN2v1
LG_XVI.26 (+) 13161865-13163520
71A like 44%
to 71A12 97% to CYP71AN3
45% to 71AH1
eugene3.00161323|Poptr1 gene model short at N-term
$
13161865 MTELLYFQQTWQEIRPKIGLNYFVFFLIFLSFILFLFKLTTSRKLNLP 13162008
13162009
PSPPKLPVIGNIHHFGTLPHRSLQALSEKYGPLMLLHMGHVPTLIVSSAE 13162158
13162159
AASEIMKTHDIVFANRPQTTAASIFFHGCVDVGFAPFGEYWRKVKKISVQ 13162308
13162309
ELLGPKTVQSFHHVREEEAAGLIDKIRFACHSGTSVNISEMLISVSSDIV 13162458
13162459
SRCVLGRKADKEGGNSKFGELTRTVMVQLTAFSFGDLFPYLGWMDTLTGL 13162608
13162609
IPRLKATSRALDSFLDQVIEEHRSLESDGDRCAQTDFLQALLQLQKNGKL 13162758
13162759 DVQLTRDNIIAVVL 13162800 (0)
13162909
DMFVGGTDTSSTMMEWAIAELVRNQTIMRKAQEEVRRIVGKKSKVEANDI 13163058
13163059
EEMGYLKCIIKETLRLHPAAPLLVPRETSASFELGGYYIPPKTRVLVNAF 13163208
13163209
AIQRDPSFWDRPDEFLPERFENNPVDFKGQDFQFIPFGSGRRGCPGALFG 13163358
13163359
VTAVEFMIANLLYWFDWRLPDGATQEELDMSEICGMTAYKKTPLLLVPSL 13163508
13163509 YSP* 13163520
>CYP71AN2v2
scaffold_1240 (-) 6280-5579
71AN like 98% to 71AN2, 3 aa diffs
duplicate seq.
eugene3.12400001|Poptr1
$
6280 MT*LLYFQQTWQEIRPKIGLNYLVFFLIFLSFILFLFKLTRSRKLNLPPS
6131
6130
PPKLPVIGNIHHFGTLPHRSLQALSEKYGPLMLLRMGHVPTLIVSSAEAA 5981
5980
SEIMKTHDIVFANRPQTTAASIFFHGCVDVGFAPFGEYWRKVKKISVQEL 5831
5830
LGPKTVQSFHHVREEEAAGLIDKIRFACHSGTSVNISEMLISVSSDIVSR 5681
5680 CVLGRKADKEGGNSKFGELTRTFMVQLTAFSFGD
5579
>CYP71AN3
LG_XVI.24 (+) 13169467-13171129
71A like 44%
to 71A12
97% to CYP71AN2, 45% to 71AH1
eugene3.00161324|Poptr1 gene model short at N-term
$
13169467 MTELLYFQQTWQEIRPKIGLNYFVFFLIFLSFILFLFKLTTSRKLNLP 13169620
13169621 PSPPKLPVIGNIHHFGTLPHRSLQALSEKYGPLMLLHMGHVPTLIVSSAE
13169770
13169771
AASEIMKTHDIVFANRPQTTAASIFFHGCVDVGFAPFGEYWRKVRKISVQ 13169920
13169921
ELLGPKTVQSFHYVREEEAAGLIDKIRFACHSGTSVNLSEMLISVSNDIV 13170070
13170071
SRCVVGRKADKEGGNSKFGELTRTVMVQLTAFSFGDLFPYLGWMDTLTGL 13170220
13170221
IPRLKATSRTLDSLLDQVIEEHRSLESDGDRCAQTDFLLALLQLQKNGKL 13170370
13170371 DVQLTRDNIIAVVL 13170412 (0)
13170518
DMFVGGTDTSSTMMEWAIAELVRNQTIMRKAQEEVRRIVGKKSKVEANDI 13170667
13170668
EEMGYLKCIIKETLRLHPPAPLLVPRETSASVELGGYFIPPKTRVIVNAF 13170817
13170818
AIQRDPSFWDRPDEFLPERFENNPVDFKGQDFQFIPFGSGRRGCPGALFG 13170967
13170968
VTAVEFMIANLLYWFDWRLPDGATQEELDMSEICGMTAYKKTPLLLVPSL 13171117
13171118 YSP* 13171129
>CYP71AN4
LG_XII.23 (-) 10752455-10550851
71A like 44%
to 71A22 65% to CYP71AN5
45% to 71AH1
fgenesh1_pg.C_LG_XII000892|Poptr1 gene model seems
correct
$
10752455
MDPSIFLHQYWLELDRTILFSVLVLPFLAFCTIYFIKSIQTDKLNLPPSP 10752306
10752305
WKLPLIGNLHQVGRLPHRSLRTLSEKYGPLMLLHLGSSPALIVSSAETAK 10752156
10752155 EILKTHDKAFLDKPQTRAGDALFYGSSDIAFCSYGNYWRQAKKVCVLELL
10752006
10752005
SQRRVQAFQFAREEEVGKMVEKIQISCLSKVAIDLGAAFLTISNDILSRS 10751856
10751855
AFGRTYEEVDGQQLGELWRTAMDLIGEFCFKDFFPLLGWMDVITGLVSKL 10751706
10751705
KRTSKALDAFLDQVIEEHLVSRTEDDISDKKDLVDILLRIQKNGMTDIDL 10751556
10751555 SRDNLKAILM 10751526 (0)
10751444
DMFLGATDTTATTMEWAMAELVNNPSAMKKVQEEVRGVVGEKSKVEEI 10751301
10751300 DIDQMDFLKCIVK
ETLRLHPPLFIGRRTSASLELEGYHIPANLKVLINAW 10751151
10751150
AIQRDPKLWDSPEEFIPERFANKSVDFKGQNHQFIPFGAGRRGCPGIAFA 10751001
10751000
VVEVEYVLANILYWFDWEFPEGITAEDLDMSEVFTPVIRKKSPLRLVPVA 10750851
10750850 HFPKTICN* 10750824
>CYP71AN5
LG_XV.21 (-) 6427585-6426017
71B like 46%
to 71B2
65% to CYP71AN4, 47% to 71J1
fgenesh1_pg.C_LG_XV000592|Poptr1 gene model seems
correct
$
6427585 MIMLNPLLCPFLLLSLLFLLRLVKRDKLNLPPSPPKLPIIGNLHQLGRLH
6427436
6427435
RSLRALSSKYGPLMLLHFGKVPTLIVSSAEVAHEVMKTHDVAFAGRPQTR 6427286
6427285
AADVLFYGCVDVAFCPYGEYWRQVKKICVLELLSQKRVQAFQFVREEEVA 6427136
6427135
NMVEKVRLSCLNGAAVDLSDMFLSVSNNIISRSALGRVYENEGCDESFGG 6426986
6426985
LSRKAIDLIASFCFKDMFHLLGWMDTLTGLVAGLKHTSKALHNFLDQVIE 6426836
6426835
EHESLMNNDESDMKDIVDILLDLQKNGTLDIDLTRENLKAILM 6426707 (0)
6426622
DMFVGGTDTTAAAMEWAMAELVKNPIVMKKAQEEVRRVVGKKSKLCE 6426482
6426481
KHINEMVYLKCVLKESLRLHAPAMIARETSEAVKLQGYDIPPKTRVLINA 6426332
6426331
WAIQRDPKQWERSEEFIPERFTNISVDFKGQHNQFMPFGGGRRLCPGLSF 6426182
6426181
AVIEAEMVLANLLYWFDWNIPHGGNPEDMDMSESHTLIIRKKTPLVLVPV 6426032
6426031 MLSP* 6426017
<CYP71AP
new subfamily about equally similar to 71B and 71D (5 sequences, all named)
LG_XII.7 (+)
10772698-10774358
71B like 44%
to 71B2 98% to CYP71AP2
fgenesh1_pm.C_LG_XII000331|Poptr1 gene model short at
N-term
$
10772698
MSLLQWLKECSKPTLFVVTIFLVVVLKFLMKDKLKKRKLNLPPSPAKLPI 10772847
10772848
IGNLHQLGNMPHISLRGLAKKYGPIIFLQLGEIPTVVISSAGLAKEVLKT 10772997
10772998
HDLVLSSRPQLFSAKHLLYGCTDIAFAPYGAYWRNIRKICILELLSAKRV 10773147
10773148
RSYSYVREEEVARLIRRIAESYPGITNLSSMIALYTNDVLCRVALGRDFS 10773297
10773298
GGGEYDRHGFQKMFDDFQALLGGFSLGDYFPSMEFVHSLTGMKSKLQYTF 10773447
10773448 RRFDQFFDEVIAEHRSSKGKQEEKKDLVDVLLDIQKDGSSEIPLTMDNIKAVIL
10773609 (0)
10773747
DMFAGGTDTTFITLDWAMTELIMNPHVMEKAQAEVRSVVGDRRVVQE 10773887
10773888
SDLPRLNYMKAVIKEILRLHPAAPVLLPRESLEDVIIDGYNIPAKTRIYV 10774037
10774038
NVWGMGRDPELWENPETFEPERFMGSGIDFKGQDFELIPFGAGRRICPAI 10774187
10774188
TFGIATVEIALAQLLHSFDWKLPPGLEAKDIDNTEAFGISMHRTVPLHVIAKPHFD* 10774358
>CYP71AP2v1
LG_XII.9 (+) 10777737-10779397
71B like 43%
to 71B2 98% to CYP71AP1
eugene3.00120825|Poptr1 gene model correct
$
10777737
MSLLQWLKECSKPTLFVVTIFLVVVLKFLMKEKLKKRKLNLPPSPAKLPI 10777886
10777887
IGNLHQLGNMPHISLRGLAKKYGPIIFLQLGEIPTVVISSAGLAKEVLKT 10778036
10778037
HDLVLSSRPQLFSAKHLLYGCTDIAFAPYGAYWRNIRKICILELLSAKRV 10778186
10778187
RSYSYVREEEVARLIRRIAESYPGITNLSSMIALYANDVLCRVALGRDFS 10778336
10778337 GGGEYDRHGFQKMLDNFQALLGGFSLGDYFPSMEFVHSLTGMKSKLQYTF
10778486
10778487
RRFDQFFDEVIAEHRNSKGKQEEKKDLVDVLLDIQKDGSSEIPLTMDNIK 10778636
10778637 AVIL 10778648 (0)
10778786
DMFAGGTDTTFITLDWAMTELIMNPHVMEKAQAEVRSVVGDRRVVQE 10778926
10778927
SDLPRLNYMKAVIKEILRLHPAAPVLLPRESLEDVIIDGYNIPAKTRIYV 10779076
10779077
NVWGMGRDPELWENPETFEPERFMGSGIDFKGQDFELIPFGAGRRSCPAI 10779226
10779227
TFGIATVEIALVQLLHSFDWKLPPGLEAKDIDNTEAFGVSLHRTVPLHVI 10779376
10779377 AKPHFN* 10779397
>CYP71AP2v2
scaffold_9416 (-) 764-3
1aa diff to 71AP2 duplicate seq. see
LG_XII
fgenesh1_pg.C_scaffold_9416000001|Poptr1 gene model
short, runs off the end
$
764
MSLLQWLKECSKPTLFVVTIFLVVVLKFLMKEKLKKRKLNLPPSPAKLPI 615
614
IGNLHQLGNMPHISLRGLAKKYGPIIFLQLGEIPTVVISSAGLAKEVLKT 465
464
HDLVLSSRPQLFSAKHLLYGCTDIAFAPYGAYWRNIRKICILELLSAKRV 315
314 RSYSYVREEEVARLIRRIAESYPGITNLSSMIALYTNDVLCRVALGRDFS
165
164
GGGEYDRHGFQKMLDNFQALLGGFSLGDYFPSMEFVHSLTGMKSKLQYTF 15
14 RRFD 3
>CYP71AP3
LG_XII.11 (+) 10791887-10793547
71B like 42%
to 71B2 95% to CYP71AP1
fgenesh1_pm.C_LG_XII000332|Poptr1 gene model wrong on
N-term
$
10791887
MALLQWLKECSKPTLLVVTIFLVVVLKFLMKEKLKKRKLNLPPSPAKLPI 10792036
10792037
IGNLHQLGNMPHISLRGLAKKYGPIIFLQLGEIPTVVISSAGLAKEVLKT 10792186
10792187
HDLVLSSRPQLFSAKHLFYGCTDIVFAPYGAYWRNIRKICILELLSAKRV 10792336
10792337 HWYSFVREEEVARLIRRIAESYPGITNLSSMIALYANDVLCRIALGKDFS
10792486
10792487
GGGEYDRHGFQKMLDDYQALLGGFSLGDYFPSMEFVHSLTGMKSKLQYTF 10792636
10792637
RRFDQFFDEVIAEHRSSKGKQEEEKDLVDVLLDIQKDGSSEIPLTMDNIK 10792786
10792787 AVIL 10792798 (0)
10792936
DMFAAGTDTNFITLDWAMTELIMNPHVMEKAQAEVRSVVGDRRV 10793067
10793068
VQESDLRRLNYMKAVIKEIFRLHPAAPVLVPRESLEDVVIDGYNIPAKTR 10793217
10793218
IYVNVWGMGRDPELWENPETFEPERFMGSGIDFKGQDFELIPFGAGRRSC 10793367
10793368
PAITFGVATVEIALAQLLHSFDWKLPPGLEAKDIDNTEAFGISMHRTVPL 10793517
10793518 HVIAKPHFD* 10793547
>CYP71AP4
LG_XV.20 (+) 6453904-655556
71B like 44%
to 71B2 90% to CYP71AP1
eugene3.00150650|Poptr1 gene model seems correct
$
6453904
MAFLQWLKESSSPTLLFVTIFLLVALKFLVKGKLKNSKLNLPPSPAKLPI 6454053
6454054
IGNLHQLGNMPHISLRWLAKKYGPIIFLQLGEIPTVVISSVRLAKEVLKT 6454203
6454204
HDLVLSSRPQLFSAKHLFYGCTDIAFAPYGAYWRNIRKICILELLSAKRV 6454353
6454354
QWYSFVREEEVARLIHRIAESYPGTTNLSKMIGLYANDVLCRVALGRDFS 6454503
6454504
GGGEYDRHGFQKMLDDYQALLGGFSLGDYFPSMEFVHSLTGMKSKLQHTV 6454653
6454654
RRFDQFFDKVITEHQNSEGKQEEKKDLVDVLLDIQKDGSSEMPLTMDNIK 6454803
6454804 AVIL 6454815 (0)
6454945
DMFAAGTDTTFITLDWTMTELIMNPQVMEKAQAEVRSVVGDRIVVQESDL 6455094
6455095
PRLHYMKAVIKEIFRLHPAVPVLVPRESLEDVIIDGYNIPAKTRIYVNVW 6455244
6455245
GMGRDPELWENPETFEPERFMGSSIDFKGQDFELIPFGAGRRSCPAITFG 6455394
6455395 IATVEIALAQLLHSFDWELPPGIKAQDIDNTEAFGISMHRTVPLHVIAKP
6455544
6455545 HFN* 6455556
<CYP71AQ
new subfamily one sequence
>CYP71AQ1
LG_XV.25 (+) 6442643-6444377
71A like 49% to 71AJ1 47% to 71A12, 47% to 71A26
fgenesh1_pg.C_LG_XV000594|Poptr1 gene model seems
correct
$
6442643
MILHPYSLACLLFIFVTKWFFFNSARNKNLPPSPLKIPVVGNLLQLGLYP 6442792
6442793
HRSLQSLAKRHGPLMLLHLGNAPTLVVSSADGAHEILRTHDVIFSNRPDS 6442942
6442943
SIARRLLYDYKDLSLALYGEYWRQIRSICVAQLLSSKRVKLFHSIREEET 6443092
6443093
ALLVQNVELFSSRSLQVDLSELFSELTNDVVCRVSFGKKYREGGSGRKFK 6443242
6443243
KLLEEFGAVLGVFNVRDFIPWLGWINYLTGLNVRVEWVFKEFDRFLDEVI 6443392
6443393
EEFKANRVGVNEDKMNFVDVLLEIQKNSTDGASIGSDSIKAIIL 6443524 (0)
6443766
DMFAAGTDTTHTALEWTMTELLKHPEVMKKAQDEIRRITGSKIS 6443897
6443898
VTQDDVEKTLYLKAVIKESLRLHPPIPTLIPRESTKDVKVQGYDILAKTR 6444047
6444048
VIINAWAIGRDPSSWENPDEFRPERFLESAIDFKGNDFQFIPFGAGRRGC 6444197
6444198
PGTTFASSVIEITLASLLHKFNWALPGGAKPEDLDITEAPGLAIHRKFPL 6444347
6444348 VVIATPHSF* 6444377
<CYP71 two unassigned
pseudogenes about 9kb apart
>CYP71-un1 potri
LG_XIV (-) 9223410-9223207
71 like 50% to 71AN4
57% to BM407381.1 potato roots EST
eugene3.00141134|Poptr1
$
9223410
DIFSGGAGTTTTTIEWARLELMKSPRVMEKEQAELRQAFKGKSKVEEVDI 9223261
9223260 ENLDYLKAIIK*TLCLHP 9223207
>CYP71-un2 potri
LG_XIV (-) 9234477-9232794
71 like 42% to 71D40P
eugene3.00141136|Poptr1
$
9234477
MAMRRASSISHTLLQLHCSFPALSTFAIQIFLFMLVKYWKKCKTSKLPLI 9234328
9234327 GNLHQLNGGPLPHHGTTELSKKYGPAMQLQ
9234238
(sequence not identified here)
9233132 GMFAAGSDTTVTTIEWAMSELLSGPGGLDRAQTEV*QVFEGEN*
9233001
9233000 KSRTLGNQIIRDQLSKNLFRLHPPVPLLPREVTENQWAYDTREKQNDYNV
9232851
9232850 WAISRDPQQGIDANSFQPE 9232794
3 more weak pseudogenes
similar to the CYP71 family
>CYP71-un3
potri
LG_VI (-) 1075479-1074052
71 like pseudogene 40% to 71D26
$
1075479 SSPKMAGEVLKTRDIIFA* 1075423
1075422 RPERLASKILKFRIKDIVFSL* 1075357
1075356 GGYWRQMRKICTMELLSPK 1075300
XXXXXXXXXX
1075277
DEVSKLIKSIQAFTRRAMDFNEKIIFLTSVITCKTTLGN*CKD* 1075146
1075145 DAMISLTGEGSHLA* 1075101
1075100
GFNIVDLYPSLEFLLAIGIKLKLKKVLDQINTTLGSIINEHKEKLKGNIE 1074951
1074950 AVEEDLVDVLL 1074918
XXXXXXXXXXXXXXXXXXXXXXXXXXXX
1074462 TISSSTIIDWAMTEMMRNPRVLKKS*AEIRQALK*
1074358
1074357 NKTITEADIQELNYLKSVIK* 1074295
1074294 TMRLHPPIPLLLLIESREIC* 1074232
1074231
IDRYVTPIKTKVMVNAWAKMRDPEYWQNTENFIPKILNS 1074115
1074114 NTTLDFIGTNFTYMPFRVGKR 1074052
>CYP71-un4 potri
LG_XVI (+) 158690-159330
71 like pseudogene 47% to LG_VI (-) 1074348
40%
to 71D26
eugene3.00160030|Poptr1 gene model wrong
$
158690 LIEFVRASAGRSMEFTEEVFFLTRV 158764
158812
DTMISLTKERSLLAGGFNVVDDLYPSLECLQGVVGMKAEKVLAQINQILDNI 158967
158968 NNEHKEMGNSETIEEDLVDMLVRLQEDGTFKCPIE
159072
159072 IFNIKVSL* 159098
159246 DMLFAGTGVHRM 159281
159280 WAMKEIMKNPRVVKKAQ 159330
>CYP71-un5
potri
LG_XI (-) 12867601-12867395
71D like pseudogene no gene model at
JGI
53%
to CYP71D31P
$
12867601
ENDHFEYIPFGSGRRVCPYGVAIVEVTLASFLYLFEWELSSGMVPENLDM 12867452
12867451 DEAFGIAFRRKNNMCLIPI 12867395
<CYP73
family 4 sequences
>CYP73A42
LG_XIII (-)12825303-12821368
85% to 73A5, 2 introns, 96% TO 73A43
10989681-10992498
eugene3.00131281|Poptr1 gene model correct 98%
to 73A13 98% to 73A16
surrounding genes do not match with 73A43, not
from a genome duplication
$
12825303 MDLLLLEKTLLGSFVAILVAILVSKLRGKRFK
12825208
12825207 LPPGPIPVPVFGNWLQVGDDLNHRNLTDLAKKFGDIFLLRMGQRNLVVVS
12825058
12825057
SPDLSKEVLHTQGVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRI 12824908
12824907
MTVPFFTNKVVQQYRYGWEEEAAQVVEDVKKNPEAATNGIVLRRRLQLMM 12824758
12824757
YNNMYRIMFDRRFESEDDPLFNKLKALNGERSRLAQSFDYNYGDFIPILR 12824608
12824607
PFLRGYLKICQEVKERRLQLFKDYFVDERK 12824518 (2)
12823513
KLASTKNMNNEGLKCAIDHILDAQKKGEINEDNVLYIVENINVA 12823382 (1)
12821967
AIETTLWSIEWGIAELVNHPEIQKKLRHELDTLLGP 12821860
12821859
GHQITEPDTYKLPYLNAVIKETLRLRMAIPLLVPHMNLHDAKLGGFDIPA 12821710
12821709
ESKILVNAWWLANNPAHWKNPEEFRPERFLEEEAKVEANGN 12821587
12821586
DFRYLPFGVGRRSCPGIILALPILGITLGRLVQNFELLPPPGQSKIDTAE 12821437
12821436
KGGQFSLHILKHSTIVAKPRSF* 12821368
>CYP73A43
LG_XIX (+)10989681-10992498
84% to 73A5, 2 introns, 96% TO 73A42
12825303-12821368
estExt_Genewise1_v1.C_LG_XIX2612|Poptr1 gene
model corrrect
RNA helicase upstream
$
10989681
MDLLLLEKTLLGSFVAILVAILVSQLRGKRFKLPPGPLPVPVFG 10989812
10989813
NWLQVGDDLNHRNLTDLAKKFGDILLLRMGQRNLVVVSSPDLAKEVLHTQ 10989962
10989963
GVEFGSRTRNVVFDIFTGKGQDMVFTVYGEHWRKMRRIMTVPFFTNKVVQ 10990112
10990113
QYRYGWEEEAAQVVEDVKKNPEAATHGIVLRRRLQLMMYNNMYRIMFDRR 10990262
10990263
FESEEDPLFNKLKALNGERSRLAQSFDYNYGDFIPILRPFLRGYLKICKE 10990412
10990413
VKERRLQLFKDYFVEERK 10990466 (2)
10990867
KLGSTKSMSNEGLKCAIDHILDAQKKGEINEDNVLYIVENINVA 10990998 (1)
10991899
AIETTLWSIEWGIAELVNHPEIQKKLRDELDTVLGPGHQITEPDTNKLPY 10992048
10992049
LNAVIKETLRLRMAIPLLVPHMNLHDAKLGGFDIPAESKILVNAWWLANN 10992198
10992199
PAKWKNPEEFRPERFFEEEAKVEANGNDFRYLPFGVGRRSCPGIILALPI 10992348
10992349
LGITLGRLVQNFELLPPPGQSKIDTSEKGGQFSLHILKHSTIVAKPRSF* 10992498
>CYP73A44
Scaffold_164 (+)432328-434026
65% to 73A5, 1 intron 66% to 73A42 and 73A43
79% to
73A27 tobacco
eugene3.01640067 |Poptr1 gene model correct
Carbamoyl-phosphate
synthetase upstream, Nuclear exosomal RNA helicase downstream
$
432328 MASFVTKSMGFTLLAVASVSCIKFACPNLSTYFSPLPISVILPLLPLIVYLFSSVFTK 432501
432502
SSTGDLPPGPVSYPMFGNWLQVGNDLNHRLLASMSQTYGPVFLLKLGSKN 432651
432652
LAVVSDPELANQVLHTQGVEFGSRPRNVVFDIFTGNGQDMVFTIYGEHWR 432801
432802
KMRRIMTLPFFTNKVVNQYSTSWEQEMDLVVDDLRANEKVRTEGIVIRKR 432951
432952
LQLMLYNIMYRMMFDAKFQSQEDPLFVQATRFNSERSRLAQSFEYNYGDF 433101
433102
IPWLRPFLRGYLNKCRDLQQRRLAFFNNYYIEKRR 433206 (2)
433292
KIMAANGEKHKVSCAMDHIIQAQMKGEISEENVLYIVENINVAAIETTL 433438
433439
WSMEWAIAELVNHPTVQRKIRDEIRAVLKGSPVTESNLHELPYLQATIKE 433588
433589
TLRLHTPIPLLVPHMNLEEAKLGGFTIPKESKVVVNAWWLANNPEWWEKP 433738
433739
SEFRPERFLEEERDTEAIVGGKVDFRFLPFGVGRRSCPGIILAMPILGLI 433888
433889
VARLVSNFEMIAPPGMEKIDVSEKGGQFSLHIASHSTVVFKPIKA* 434026
>CYP73A45P
LG_VI (-)5146670-5145983
pseudogene 82% to 73A44
eugene3.00060651|Poptr1 gene model short
eugene3.00060650|Poptr1
Carbamoyl-phosphate
synthetase upstream, Nuclear exosomal RNA helicase downstream
$
5146670
SCAMDHMIHAQMKGVTSEENVLYIAENINVAGIETALWS 5146554
5146553
TAELVNHPTVQKKIRDEITTVLKGKPVTESNLHELPY*QATIKET 5146419
5146416
LHAPIPLLVPRMNLEEAKLGGFTIPKESKVVVNAWWLSNNPDW*EKPSEF 5146267
5146266
RPERVLEEDRDTESVVGGKVDFRFASY 5146186
5146189
LPFGVGRRSCPGIILAMPIMGLVNARLVSNFEMKAPPATGKID 5146061
5146060 ASEEGGQFSLHIANHSAVVFDPIKA*
5145983
<CYP75
family 3 sequences
>CYP75A13
LG_I (+) 19972937-19975122
80% to 75A8 87% to CYP75A12
flavonoid
3',5'-hydroxylase
fgenesh1_pg.C_LG_I001972
[Poptr1:64620]
gene
duplication with Cpn10 upstream and calcium/calmodulin kinase dowstream
$
19972937
MDVDPVLPGKLTLAALLFFISYQFTGSFIRKLLHRYPPGPRGWPIIGAIP 19973086
19973087
LLGDMPHVTLAKMAKKHGPVMYLKMGTRDMVVASNPDAARAFLKTLDLNF 19973236
19973237
SNRPIDGGPTHLAYNAQDMVFADYGPRWKLLRKLSNLHMLGGKALEDWAP 19973386
19973387 VRVTELGHMLRAMCEASRKGDPVVVPEMLTYAMANMIGQIILSRRVFVTK
19973536
19973537
GSESNEFKDMVVELMTSGGFFNIGDFIPSVAWMDLQGIERGMKKLHRRFD 19973686
19973687
VLLTKMIEDHSATSHERKGKPDFLDVLMANQENSDGARLCLTNIKALLL 19973833 (0)
19974493
DLFTAGTDTSSSVIEWALAEMLKNQSILKRAQEEMDQVIGRNRRLVESDI 19974642
19974643
PKLPYLQAVCKETFRKHPSTPLNLPRIADQACEVNGYYIPKGARLSVNIW 19974792
19974793
AIGRDPDVWDNPEVFTPERFFTEKYAKINPRGNDFELIPFGAGRRICAGA 19974942
19974943
RMGIVLVEYILGTLVHSFDWKLPEDVDLNMDEVFGLALQKAVPLSAMVSP 19975092
19975093 RLEPNAYLA* 19975122
>CYP75A12
LG_IX (-) 6112639-6110882
87% to CYP75A13
fgenesh1_pm.C_LG_IX000390|Poptr1
gene
duplication with Cpn10 upstream and calcium/calmodulin kinase dowstream
$
6112639 MVLLWELTMAALFFFINYLLTRCLIRKLSTRQL 6112541
6112540
PPGPRGWPIIGAIPVLGAMPHAALAKMAKQYGPVMYLKMGTCNMVVASTP 6112391
6112390
DAARAFLKTLDLNFSNRPPNAGATHLAYNAQDMVFADYGPRWKLLRKLSN 6112241
6112240
LHMLGGKALEDWAHVRVSELGHMLRAMCEASRKGEPVVVPEMLTYAMANM 6112091
6112090
IGQIILSRRVFVTKGSESNEFKDMVVELMTSAGLFNVGDYIPSVAWMDLQ 6111941
6111940
GIERGMKRLHRRFDVLLTKMMEEHIATAHERKGKPDFLDVLMANQENLDG 6111791
6111790 EKLSFTNIKALLL 6111752 (0)
6111511
NLFTAGTDTSSSIIEWSLAEMLKNPRILKQAQDEMDQVIGRNRRLEESDI 6111362
6111361
PKLPYLQAICKETFRKHPSTPLNLPRIADQACEVNGYYIPKGTRLSVNIW 6111212
6111211
AIGRDPDVWDNPLDFTPERFFSEKYAKINPQGNDFELIPFGAGRRICAGT 6111062
6111061 RMGIVLVQYILGTLVHSFDWKLPKDVELNMDEVFGLALQKAVPLSAMVTP
6110912
6110911 RLEPNAYLA* 6110882
>CYP75B12
LG_XIII (-) 6200373-6197990
75B like 75% to 75B1
estExt_fgenesh1_pg_v1.C_LG_XIII0709|Poptr1
$
6200373
MSPLILYSALLAIFVYCLLQLRSLRDRHGKPLPPGPKPWPLVGNLPHLGP 6200224
6200223
MPHHSMAALAKTYGPLMHLRFGFVDVVVAASASVAAQFLKVHDSNFSSRP 6200074
6200073
PNSGAKHIAYNYQDLVFAPYGPRWRMLRKISSVHLFSAKSLDDFRHIRQ 6199927 (0)
6199385
EEVAVLTGALTRSGPTTPVNLGQLLNVCTANALGRVMLGRRVFGDGSGD 6199239
6199238
GDPKADEFKSMVVEVMVLAGVFNIGDFVPALEWLDLQGVAAKMKKLHKRF 6199089
6199088
DAFLTNIVEEHKTSSSTASVRSEKHTDLLSTLIALKEQQDVDGEEGKLTD 6198939
6198938 TEIKALLL 6198915 (0)
6198643
LQNMFTAGTDTSSSTVEWAIAELIRHPDILAQVKQELDSVVGRDRLVTEL 6198494
6198493
DLAQLTYLQAVVKETFRLHPSTPLSLPRIAAESCEIGGYHIPKGSTVLVN 6198344
6198343 VWAIARDPDVWTKPLEFRPERFLPGGDKADVDVKGNDFELIPFGAGRRIC
6198194
6198193
AGMSLGLRMVQLLTATLIHAFDWDLADGLVPEKLNMDEAYGLTLQRADPL 6198044
6198043 MVHPRPRLSPKVYRTPN* 6197990
<CYP76
family 17 sequences
>CYP76A8
LG_VI (-) 6443611-6441801
76C like 49%
to 76G1
52% to scaffold_28 (+) 3043896
eugene3.00060817|Poptr1 gene model seems correct
$
6443611
MEWLWPSNLSISLSLFSLALLSLLLLRAKSSQKRHPPGPSGWPIFGNLFD 6443462
6443461
LGSMPHRTLTDMRQKYGNVIWLRLGAMNTMVILSAKAATEFFKNHDLSFA 6443312
6443311
DRTITETMRAHGYDQGSLALAPYGSYWRVLRRLVTVDMIVTKRINETASI 6443162
6443161
RRKCVDDMLQWIEEESCKVGKAAGIHVSRFVFLMTFNMLGNLMLSRDLLD 6443012
6443011
PESKVGSEFFDAMMGLMEWSGHANLADFFPWLRRLDLQGLRKNMERDLGK 6442862
6442861
AMEIASKFVKERVEDKIVTSDSRKDFLDVLLEFRGSGKDEPDKLSERDVN 6442712
6442711 IFIL 6442700 (0)
6442415 EIFLAGSETTSSTVEWALTELLCNPESMIKVKAELAQVVRASKKVEES
6442272
6442271
DMENLPFLQAVVKETLRLHPPIPFLVPRRAMQDTNFMGYDIPKNTQVLVN 6442122
6442121
AWAIGRDPDAWDDPSCFMPERFIGKRVDYRGQDLEFIPFGAGRRMCAGVP 6441972
6441971
LAHRVLHLILGSLLHHFDWEFEANVNPASVDKKDRMGITVRKSEPLMAVP 6441822
6441821 KRFNKA* 6441801
>CYP76A9P
scaffold_28 (+) 3042728-3044399
76G like pseudogene 41% to 76G1
52% to LG_VI (-) 6442139
eugene3.00280293|Poptr1 gene model short missing EXXR and
heme signature
$
3042728 MESAWNLLAG*GLFS 3042782
3042784 LKQRKLRVDTKQQQPGPPAWPVF 3042852
3042853
GNIFDLGAIPHQTLYKLKEKYGPVIWLKLGYTNTLVIQSAETAAGLFKNH 3043002
3043003
DLAFSDRKVLLVFTAHNYYQGSLALGWYGPNWRMLQ 3043110
3043113
SLLNKAHDKLITKQIDQTAVLRQKCIDDMIRYIEEDVAEAQAQGESGEIK 3043262
3043263 GAHYLFLMTFNLIGNLVLSRDLVNPRSKDGHKFYDAMNNVMKRAGTRNVA
3043412
3043413
EFLTFLKWLDPQGIMRNMVQDMRQTMRIVEKFVKERTEEWKSGRKKTNDF 3043562
3043563 LDALLEHEGDEKDGPDVISDQNRLIIIL
3043646 (0)
3043773
EMSFGGSETTSTAMEWAMTELLRNPMVMG*ATEELHQVVGPK* 3043901
3043902 KVEESDIDQLPYLQPVVKETLRLHPVIPLLLPQNTLEDTNFMGHLIPKDT
3044051
3044052
QVFAKA*GIGRDPDSWEDPMSFKPERFLGSNIEYRGQNFEFIPFGSGR* 3044198
3044199
ICVGMLLAHRVVLLGLASLLHCFDWELGSNYAPGTIDVNERMGLTVQKLI 3044348
3044349 PLKAKPKQIGRMINVK* 3044399
>CYP76A9P-de2b
scaffold_28 (+) 3040250-3041175
76
like two C-TERM fragments
$
3040250 MASLLHSFDWEISSGTNPETLD 3040315
3041175 TLDMNWWMGITVRKLVPLYAIPRKI 3041249
>CYP76F3
LG_Ia (-) 8580345-8578719
47% to 76C4, 71% to LG_III (+) 11354190
fgenesh1_pm.C_LG_I000361 [Poptr1:48778]
gene model wrong at N-term
$
8580345 MESLINLLLCVLFTFVL 8580295
8580294
VKILHFIARGSKTESSGKLPPGPAALPIIGSLLDLGDKPHKSLARLAKTH 8580145
8580144
GPLMSLKLGQITTIVISSPTLAKEVLQKHDVSFSNRTIPDALRAHKHHEL 8579995
8579994
GLPWVPIAMRWRNLRKVCNSYIFTNQKLDANQDLRRKKIQELVALVQEHC 8579845
8579844
LAGEAMDIGQAAFTTALNALSNSIFSLNLSDSNSETASQLKEVVGGIMEE 8579695
8579694
AGKPNLADYFPVLRRIDLQGIKRRMTIHFGKILNIFDGIVNERLQLRKMQ 8579545
8579544 GYVPVNDMLDTLLTISEDNNEDIMETSQIKHLFL
8579443 (0)
8579324 DLFAAGTDTTSSTLEWAMAELLHNPRTLSIARTELEQTIGKGSLIEE
8579184
8579183
SDIVRLPYLQAVIKETFRLHPAVPLLLPRKAGENVEISGYTIPKGAQLFV 8579034
8579033
NAWAIGRDPSLWEDPESFVPERFLGSDIDARGRNFELIPFGAGRRICPGL 8578884
8578883
PLAMRMLHMMLGSLIHSFDWKLENGVTPESMDMEDKFGITLGKARSLRAV 8578734
8578733 PIQL* 8578719
>CYP76F4
LG_III (+) 11353007-11354677
76C like 47% to 76C4
71% to LG_I (-)
8580324
estExt_Genewise1_v1.C_LG_III0204|Poptr1 gene model seems
correct
$
11353007
MNFFISVLLYFLLTFAVIQSLDYILRRSKRKSGKLPPGPSRLPIVGNLLD 11353156
11353157 LGDKPHKSLAKLAKTHGQLMSLKLGQVTTIVVSSATMAKEVLQKHDLTFC
11353306
11353307
NRTVVDAVRALDHHEAGIAWLPVATRWRNLRKICNSHIFTAQKLDANQDL 11353456
11353457
RRKKVQDLLAEVQERCLVGEAVDLRQAAFTATLNALSNTVLSLDLTDLSS 11353606
11353607
DIAREFKEHISCIMDEAGKPNLVDYFPLLRRIDPQGIRRRTAIHFGKVFD 11353756
11353757
LFDRLIIERLQLRKVKGYIPLDDMLDTLLTISEVNNEEMDATRIKHFFL 11353903 (0)
11354058
DLFGAGTDTTSSTLEWAMAELLHSPKTLLKARAELERTIGEGNLLEESDI 11354207
11354208
TRLPYLQAVIKETLRLHPAVPFLLPHKAGADAEIGGFTVPKNAQVLVNVW 11354357
11354358 AIGRDPSMWEDPNSFVPERFLESGIDHRGQNFEFIPFGSGRRICPGLPLA
11354507
11354508
MRMLPLMLGSLILSFDWKLADGVTPENLNMDDKFGLTLLKAQPLRAIPIT 11354657
11354658 RELKHG* 11354677
>CYP76G-se1[2]
scaffold_256 (+) 532-1167
76G like exon 2 only
97% to scaffold_256 (+) 69529
eugene3.02560001|Poptr1
$
532 EMFTAGTDTTTSTLEWAMAELLRNPKVMKTVQSELRSTIGLNKKLEDKDI 681
682 ENLPYLKAVIRETLRLHPPLPFLVPHMAMNPCKMLGYYIPKETTILVNVW 831
832 AIGRDSKTWDDPLVFKPERFLEPNMVDYKGRHFEFIPFGSGRRMCPAMPL 981
982 ASRVLPLALGSLLLSFDWILPVGLKPEDMDMTEKIGITLRKSVPLKVIPT 1131
1132 PYKGSSNHYGF* 1167
>CYP76G-se2[1]
scaffold_256 (+) 26660-27136
76G like exon 1 fragment
1 aa diff to scaffold_256 6
(+) 34579
eugene3.02560004|Poptr1
$
26660
MDYEIAGLVLAVLLWVAWAVVTQRRYRRFEEQGQLPPGPRPLPVVGNIFL 26809
26810
LGWAPHESFANLARVHGPIMTIWLGSMCNVVISSSEVAREMFKNHDAVLA 26959
26960
GRKIYEAIRGDFGNEGSIITAQYGPHWRMLRRLCTTEFFVSSRLDAMQGARTRVHRWHA 27136
>CYP76G2
scaffold_256 (+) 32430-34839
76G like 66% to 76G1
97% to scaffold_256 (+) 69529
fgenesh1_pg.C_scaffold_256000007|Poptr1 gene model correct
$
32430 MDYEIAGLVLAVLLWVAWAVVTQRRYRRFEEQGQLPPGPRPLPVVGNIFL
32579
32580
LGWAPHESFANLARVHGPIMTIWLGSMCNVVISSSEVAREMFKNHDAVLA 32729
32730
GRKIYEAIRGDFGNEGSIITAQYGPHWRMLRRLCTTEFFVTSRLDAMQGA 32879
32880
RTRCIDGMLQYIEDDSANGTSAIDLGRYFFLMAFNLIGNLMFSKDLLDPK 33029
33030 SEKGAKFFQHAGIVLELAGKPNMADFFPILRWLDPQGVRRKTQFHVARAF
33179
33180
EIAGGFIKERTESTQKENSRDDKRKDYLDVLLEFRGDGVEEPSRFSSTTI 33329
33330 NAIVL 33344 (0)
34204
EMFTAGTDTTTSTLEWAMAELLRNPNVMKTVQSELRSTIGPNKKLEDKDI 34353
34354
ENLPYLKAVIRETLRLHPPLPFLVSHMAMNPCKMLGYYVPKETTILVNVW 34503
34504 AIGRDSKTWDDPLVFKPERFLEPNMVDYKGRHFEFIPFGSGRRMCPAMPL
34653
34654
ASRVLHLALGSLLLSFDWILPDGLKPEDMDMTEKIGITLRKNVPLKVIPT 34803
34804 PYKGSSHHYGF* 34839
>CYP76G3
scaffold_256 (+) 67375-69789
76G like 66%
to 76G1
97% to scaffold_256 (+) 34579
eugene3.02560008|Poptr1 gene model correct
$
67375
MDYEIAGLVLAVLLWVAWAVVTERRYRRFEEQGQLPPGPRPLPVVGNIFL 67524
67525
LGWAPHESFANLARVHGPIMTIWLGSMCNVVISSSEVAREMFKNHDAVLA 67674
67675
GRKIYEAIRGDFGNEGSIITAQYGPHWRMLRRLCTTEFFVTSRLDAMQGA 67824
67825
RTRCIDGMLQYIEDDSANGTSAIDLGRYIFLMAFNLIGNLMFSKDLLDPK 67974
67975
SEKGAKFFQHAGKVVELAGKPNMADCFPILRWLDPQGIRRKTQFHVARAF 68124
68125
EIAGGFIKERTESTQKENSRDDKRKDYLDVLLEFRGDGVEEPSRFSSTTI 68274
68275 NAIVL 68289 (0)
69154
EMFTAGTDTTTSTLEWAMAELLHNPKVMKTVQSELRSTIGPNKKLEDKDI 69303
69304 ENLPYLKAVIRETLRLHPPLPFLVPHMAMNPCKMLGYYIPKETTILVNVW
69453
69454
AIGRDSKTWDDPLVFKPERFLEPNMVDYKGRHFEFIPFGSGRRMCPAMPL 69603
69604
ASRVLYLALGSLLLSFDWILPDGLKPEDMDMTEKIGITLRKSVPLKVIPT 69753
69754 PYKGSSNHDGF* 69789
>CYP76G4
scaffold_256 (+) 78586-81678
76G like 68%
to 76G1
97% to scaffold_256 (+) 100990
fgenesh1_pg.C_scaffold_256000012|Poptr1 gene model seems
correct
$
78586
MDYEIAGLVLAVLLWVAWAVVTERRYRRSEEQGQLPPGPRPLPVVGNIFQ 78735
78736
LGWAPHESFTNLARVHGPIMTIWLGSMCNVVISSSEVAREMFKNHDAVLA 78885
78886 GRKIYEAMKGDFGNEGSIITAQYGPHWRMLRRLCTTEFFVTSRLDAMQGA
79035
79036
RTRCIDGMLQYIEDGSANGTRAIDLGRYIFLMAFNLIGNLMFSKDLLDPK 79185
79186
SEKGAKFFQHAGKVTELAGKPNMADFLPILRWLDPQGIRRKTQFHVARAF 79335
79336
EIAGGFIKERTESVQKENSRDDKRKDYLDVILEFRGDGVEEPSRFSSTTI 79485
79486 NVIVF 79500 (0)
81043 EMFTAGTDTTTSTLEWAMAELLHNPKVLKTVQSELRSTIGPNKKLEDKDV
81192
81193
ENLPYLKAVIRETLRLHPPLPFLVPHMAMNPCKMLGYYVPKETTILVNVW 81342
81343
AIGRDSKTWDDPLVFKPERFLEANMVDYKGRHFEFIPFGSGRRMCPAMPL 81492
81493
ASRVLPLALGSLLLSFDWILPDGLKPENMDMTEKIGITLRKSVPLKVIPT 81642
81643 PYKGSSNHDGF* 81678
>CYP76G5
scaffold_256 (+) 98560-101256
76G like 68% to 76G1
97% to scaffold_256 (+) 81418
eugene3.02560012|Poptr1 gene model correct
$
98560 MDYEIAGLVLAVLLWVAWAVVTERRYRRSEEQGQLPPGPRPLPVVGNIFQ
98709
98710 LGWAPHESFTNLARVHGPIMTIWLGSMCNVVISSSEVAREMFKNHDAVLA
98859
98860 GRKIYEAMKGDFGNEGSIITAQYGPHWRMLRRLCTTEFFVTSRLDAMQGA
99009
99010 RTRCIDGMLQYIEDGSANGTSAIDLGRYIFLMAFNLIGNLMFSKDLLDPK
99159
99160 SEKGAKFFQHAGKVMELAGKPNMADFLTILRWLDPQGIRRKTQFHVARAF
99309
99310 EIAGGFIKERTESMQKENSRDDKRKDYLDVLLEFRGDGVEEPSRFSSTTI
99459
99460 NVIVF 99474 (0)
100624
EMFTAGTDTTTSTLEWAMAELLRNPKVLKTVQSELRSTIGPNKKLEDKDI 100773
100774
ENLPYLKAVIRETLRLHPPLPFLVPHMAMNPCKMLGYYIPKETTILVNVW 100923
100924
AIGRDSKTWDDPLVFKPERFLESNMVDYKGRHFEFIPFGSGRRMCPAMPL 101073
101074 ASRVLPLALGSLLLSFDWILPEGLKPEDMDMTEKMGITLRKSVPLKVIPT
101223
101224 PYKRSSDHYGF 101256
>CYP76T1
LG_II (+) 11243028-11244627
76C like 51% to 76C4
97% to LG_II (+) 11302436
eugene3.00021389 [Poptr1:552074] gene
model short at N-term, frameshift
$
11243028
MEYLFYLLLISFCWACLHVLNASVLLRRKSGCTVLPPA 11243141
11243141
PRQLPIIGNILALGDKPHRTLAKLSQTYGPLMTLKLGRITTIVISSP 11243281
11243282
NIAKEALQKHDQALSSRTVPDALHVQYYNYHKNSMIWLPASTQWKFLRKL 11243431
11243432
TATQMFTSQRLDASRALRGKKVQELLEYVHEKCNNGHAVDVGRSVFTTVL 11243581
11243582
NLISNTFFSLDVTNYNSDLSQEFSNLVVGFLEQIGKPNIADYFPILRLVD 11243731
11243732
PQGIRRKTNNYLKRLTQIFDSIINERTRLRSSSVASKASHDVLDALLILA 11243881
11243882 KENNTELSSTDIQVLLI 11243932 (0)
11244034
DFFIAGTDTTSSTVEWAMTELLLNPDKMVKAKNELQQVEGPVQESDISKC 11244183
11244184
PYLQAIVKETFRLHPPSPFLPRKAVSEVEMQGFTVPKNAQVLITIWAIGR 11244333
11244334
DPAIWPEPNSFKPERFLECQADVKGRDFELIPFGAGRRICPGLPLGHKMV 11244483
11244484
HLTLASLIHSFDWKIADDLTPEDIDMSETFGFTLHKSEPLRAIPMKT* 11244627
>CYP76T2
LG_II (+) 11302436-11304039
76C like new 50% to 76C4
96% to LG_II (+) 11371594
eugene3.00021394|Poptr1 gene model seems correct
$
11302436
MEYLFYLLLISFCWACLHVLNASVLLRRKSGCTVLPPGPRQLPIIGNILA 11302585
11302586
LGDKPHRTLAKLSQTYGPLMTLKLGRITTIVISSPNIAKEALQKHDQALS 11302735
11302736 SRTVPDALHVQYYN 11302777
11302778
YHKNSMVWLPASTHWKFLRKLTATQMFTSQRLDASRALRGKKVQELLEYV 11302927
11302928
HEKCNNGHAVDVGRSVFTTVLNLISNTFFSLDVTNYNSDLSQEFSNLVVG 11303077
11303078
VLEQIGKPNIADYFPILRLVDPQGIRRKTNNYLKRLTQIFDSIINERTRL 11303227
11303228 RSSSVASKASHDVLDALLILAKENNTELSSTDIQVLLI
11303341 (0)
11303443
DFFIAGTDTTSSTVEWAMTELLLNPDKMVKAKNELQQVEGPVQESDISKC 11303592
11303593
PYLQAIVKETFRLHPPSPLLLPRKAVSEVEMQGFTVPKNAQILINIWAIG 11303742
11303743
RDPAIWPDPNSFKPERFLECQADVKGRDFELIPFGAGRRICPGLPLGHKM 11303892
11303893 VHLTLASLIHSFDWKIAGDLTPEDIDTSETFGLTLHKSEPLRAIPMKT*
11304039
>CYP76T3
LG_II (+) 11339047-11340633
76C like 52% to 76C4
95% to LG_II (+) 11302436-11304039
fgenesh1_pm.C_LG_II000640
[Poptr1:342710] gene model seems correct
$
11339047 MEYLFYLLLISFCWACLHVLNASVLLRRKSGCTILPPGPRQLPIIGNILA
11339196
11339197
LGDKPHRTLAKLSQTYGPLMTLKLGRITTIVISSPNIAKEALQKHDQALS 11339346
11339347
SRTVPDAVRGHHKNSILWLPASSHWKFLKKLTATQMFTSQRLDASRALRG 11339496
11339497
KKVQELLEYVHEKCNNGHAVDVGRSVFTTVLNLISNTFFSLDIANYNSDL 11339646
11339647 SQEFSYLVVGVMEQIGKANIADYFPILRLVDPQGIRRKTNNYLKRLTQIF
11339796
11339797
DSIINERTRLRSSSVASKASHDVLDALLILAKENNTELSSTDIQVLLL 11339940 (0)
11340037
DFFIAGTDTTSSTVEWAMTELLLNPDKMVKAKNELQQVEGPVQESDISKC 11340186
11340187
PYLQAIVKETFRLHPPAPLLLPRKAVSEVEMQGFTVPKNAQILINIWAIG 11340336
11340337
RDPTIWPDPNSFKPERFLECQADVKGRDFELIPFGAGRRICPGLPLGHKM 11340486
11340487
VHLALASLIHSFDWKIADDLTPEDIDTSETFGITLHKSEPLRAIPMKT* 11340633
>CYP76T4
LG_II (+) 11361815-11362516
76C like
93% to LG_II (+) 11371594
eugene3.00021397|Poptr1
(sequence gap)
$
11361815
RRKSGCTVLPPGPRQLQIIGNILALGDKPHRTLAKLSQTYGPLMTLKLGR 11361964
11361965
ITTIVISSPNIAKEALQKHDQALSSRTVPDALRVHHRNSILWLPASTHWK 11362114
11362115
FLRKLTATQMFTSQRLDASQALRGKKAQEMLEYVHENCNNGHAVDIRRSV 11362264
11362265 FTTSLNLISNTFFSLDIANYNSDLSQEFSDLVVGVTEQIGKPNIADYFPI
11362414
11362415 LRLVDPQGVRRKTNNYLKRLTQIFDSIINERTRP
11362516
(sequence gap)
>CYP76T5
LG_II (+) 11370278-11371869
76C like 52% to 76C4
96% to LG_II (+) 11302436
fgenesh1_pg.C_LG_II001397 [Poptr1:347891]
gene model seems correct
$
11370278
MEYLFFVLLISFTWACLHVPIASILLRRKSGCTVLPPGPRQLPIIGNILA 11370427
11370428
LGDKPHRTLANLSQTYGPLMTLKLGRITTIVISSPNIAKEALQKHDQALS 11370577
11370578
SRTVPDALRVHHKNSMIWLPASTHWKFLRKLTATQMFTSQRLDASRALRG 11370727
11370728 KKVQELLEYVHENCNNGHAVDVGRSVFTTVLNLISNTFFSLDVTNYNSDL
11370877
11370878
SQEFSDLVVGVMEQIGKPNIADYFPILRLVDPQGIRRKTNNYLKRLTQIF 11371027
11371028
DSIINERTRLRSSSVASKASHDVLDALLILAKENNTELSSTDIQVLLI 11371171 (0)
11371273
DFFIAGTDTTSSTVEWAMTELLLNPDKMVKAKNELQQVEGPVQESDISKC 11371422
11371423
PYLQAIVKETFRLHPPVPLLLPRKAVSEVEMQGFTVPKNAQILINIWAIG 11371572
11371573
RDPAIWPDPNSFKPERFLECQADVKGRDFELIPFGAGRRICPGLPLGHKM 11371722
11371723
VHLTLASLIHSFDWKIADDLTPEDIDTSETFGITLHKSEPLRAIPMKT* 11371869
>CYP76T6
scaffold_5854 (+) 673-2263
76 like 51% to 76C4 50% to 76F2 Vitis
vinifera
97% to LG_II (+) 11340358
eugene3.58540001|Poptr1 gene model short in middle 1
frameshift
$
673 MEYLFYLLLISFCWACLHVLNASVLLRRKSGCTILPPGPRQLPIIGNILA 822
823 LGDKPHRTLAKLSQTYGPLMTLKLGRITTIVISSPNIAKEALQKHDQALS 972
973 SRTVPDAVRGHHKNSILWLPASSHWKFLKKLTATQMFTSQRLDASRALRG 1122
1123 KKVQELLEYVHEKCN 1167
1167
QGHAVDVGRSVFTTVLNLISNTFFSLDIANYNSDLSQEFSYLVVGVM 1307
1308
EQIGKANIADYFPILRLVDPQGIRRKTNNYLKRLTQIFDSIINERTRLRS 1457
1458 SSVASKASHDVLDALLILAKENNTELSSTDIQILLL
1565 (0)
1667
DFFNAGTDTTSSTVEWAMTELLLNPDKMVKAKNELQEIEGPVQESDISKC 1816
1817
PYLQAIVKETFRLHPPAPLLLPRRAVSEVEMQGFTVPKNAQILINIWAIG 1966
1967
RDPAIWPDPNSFKPERFLECQADVKGRDFELIPFGAGRRICPGLPLAHKM 2116
2117
VHLTLASLIHSFDWKIADDLTPEDIDMSETFGLTLHKSEPLRAIPMKT* 2263
<CYP80 family 8
sequences
>CYP80C1
LG_XII (-) 13076542-13074912
76G like 45%
to 76C2 49% to 80B2
66% to LG_XV (-) 10045180
estExt_fgenesh1_pg_v1.C_LG_XII0135|Poptr1 gene model seems
correct
$
13076542
MDQRFLQRLFSLVSSAEILFLLLLPLTFIILKNIIRSCSESKYLPPGPKP 13076393
13076392
WPIIGNLLHVGNQPHVSLAEIAKIHGPLISLRLGTQLLVVGSSAKAAAEI 13076243
13076242
LKTHDRFLSARHVPQVIPRESHVLRRVALVWCPESIDTWKLLRGLCRTEL 13076093
13076092
FSAKAIESSATLREKKVGELMDFLVAREGKVVSIGEVVFSTVFNTISNLL 13075943
13075942
FSNDLAGLEEKGMSSGLKSHVRKLMLLVATPNIADFYPIFAGLDPQGLRR 13075793
13075792
KLSKLVEETFAIWAINIKERRNSYVHDSPKRDFLDVFLANGFDDDQINWLAA 13075637 (0)
13075517
ELFSAGTDTTATTIEWAVAEILKNKEVMKKVDEELEREITKNTISESDVS 13075368
13075367
GLPYLNACIKETLRLHPPVPLLVPHRATETCEVMKYTIPKDSQVLVNVWA 13075218
13075217 ISRDPSTWEDPLSFKPDRFLGSNLEFKGGNYEFLPFGAGRRICPGLPMAN
13075068
13075067
KLVPLILASLIRCFDWSLPNGEDLAKLDMKDKFGVVLQKEQPLVLVPKRRL* 13074912
>CYP80C-se1[1]
scaffold_12820 (-) 312-52
76C like N-term 50% to 76C3 no gene
model at JGI
probable pseudogene 65% to LG_XII (-) 13076542
$
312
ILKCVSSPSSKNQSLPPGPKPWPIIGNILHFGKKPHISTAYFAKTHGPLISLKLGKQLLIVGSSKR 115
114 AATXILKSHDRLLSARYVFKA 52
>CYP80C2
LG_XV (-) 10046207-10044971
76C like 43% to 76C2
68% to LG_XII (-) 13075301
fgenesh1_pg.C_LG_XV001071|Poptr1 Missing N-term in a
seq gap
(seq gap)
$
10046207
RISIVWATQCSDGWKSLRALCRNELFSAKAIESQAVLREKKMGEMVGFIG 10046058
10046057
RREGEVVGIGEVVLAIVFNTIANLLFSVDLIGLEDDGATTGLKSLMWRMM 10045908
10045907
KLGATPNIADFYPILGGIDPQGLKRKMAVCVNQMFDIWGKYINERREKHV 10045758
10045757 HDGPRSDFLDVFLANGFEDLQINWLAL
10045677 (0)
10045576
ELLSAGTDTTATTVEWAIAELLKNKEVLKKVSEEIKRETDTNSLKESHVS 10045427
10045426
QLPYLNACVKETLRLHPPVPFLIPRRALETCKVMDYTIPRDSEVIVNVWA 10045277
10045276
VGRDPSLWEDPLSFKPERFLGSDLDFKGQDFEFLPFGAGRRICPGLPMAA 10045127
10045126
KQVHLIIATLLYYFDWSLPNGEDPAMLDMSEKFGITLQKEQPLLVVPRRRI* 10044971
>CYP80D-se1[1]
LG_XII (-) 13112302-13111922
76C like pseudogene
80% to LG_XII (-) 13087099
eugene3.00121151|Poptr1
$
13112302
SFFLLAILFLLSIVLKHKSSLAIPPWPNSWPIIGNELQMGNKPHIFLQVD 13112153
13112152
GPLTSLRLGTQLVVVGSSRKAASEILKTHDRELSGRCVPNVPFAKDPKLN 13112003
13112002 GDSIAWTVECTDRWKFFRSRIINELFL
13111922
>CYP80D1 LG_XII (-)
13100933-13099325 76G
like 41% to 76C2
97% to LG_XII (-) 13087099
eugene3.00121149|Poptr1 gene model seems correct
$
13100933
MVSISVLANSYPSFPMLFLLAILLLLSLVLKHKSSKVPAIPPGPKSWPII 13100784
13100783
GNVLQMGNKPHISLTKLAQVYGPLMSLRLGTQLVVVGSSREAASEILKTH 13100634
13100633
DRELSGRCVPHASFAKDPKLNEDSIAWTFECTDRWRFFRSLMRNELFSSK 13100484
13100483 VVDGQSRTRETKAREMIDFLKKKEGEGVKIRDIVFVYTFNVLANIYLSKD
13100334
13100333
LIDYDQTGECQRVCGLVREMMELHTTLNISDLYPILGSLDLQGVSRKCNE 13100184
13100183 CESRIQELWGSVIKERREGRNDTGDDDDNSSKRKDFLDVLLDGEFSDEQISLFFV
13100019 (0)
13099933
QELLAAVSDSTSSTVEWAMAELMRNPQAMKQLREELAGETPEDLITESSL 13099784
13099783
AKFPYLHLCVKETLRLHPPAPFLIPHRATEDCQVLDCTIPKDTQVLVNVW 13099634
13099633
AIARDPASWEDPLCFKPERFLNSDLDYKGNHFEFLPFGSGRRICAGLPMA 13099484
13099483
VKKVQLALANLIHGFDWSLPNNMLPDELNMDEKYGITLMKEQPLKLIPKLRK* 13099325
>CYP80D2
LG_XII (-) 13087162-13085557
76G like 42% to 76C2
97% to LG_XII (-) 13100870
eugene3.00121147|Poptr1 gene model seems correct
$
13087162
MVSISVLANSYPSFPMLFLLAILLLLSLVLKHKSSKVPAIPPGPKSWPII 13087013
13087012
GNVLQMGNKPHISLTKLAQVYGPLMSLRLGTQLVVVGSSREAASEILKTH 13086863
13086862 DRELSGRCVPHASFAKDPKLNEDSIAWTFECTDRWRFFRSLMRNELFSSK
13086713
13086712
VVDGQSSTRETKAKEMIDFLKKKEGEGVKIRDIVFVYTFNVLANIYLSKD 13086563
13086562
LIDYDQTGECQRVCGLVREMMELHTTLNISDLYPILGSLDLQGLSRKTNE 13086413
13086412
CGSRIQELWRSIIKERREGRNDTGDDDNSSKRKDFLDVLLDGEFSDEQIS 13086263
13086262 SFFV 13086251 (0)
13086165
QELLAAVSDSSSSTIEWAMAELMRNPQAMKQLREELAGETPEDLITESSL 13086016
13086015
AKFPYLHLCVKETLRLHPPAPLLIPHRATEDCQVLDCTIPKDTQVLVNVW 13085866
13085865
AIARDPASWEDPLCFKPERFLNSDLDYKGNHFEFLPFGSGRRICAGLPMA 13085716
13085715 VKKVQLALANLIHGFDWSLPNNMLPDELDMAEKYGITLMKEQPLKLIPKLRK*
13085557
>CYP80E1
LG_I (+) 2171265-2172908
39% to 76C4
58% to LG_I (+) 8156063
eugene3.00010276 [Poptr1:547835] gene model
correct
$
2171265
MATIVTEISSNTLFTILFLLPLIYLIAKQLKALYSSRFAPLPPGPYSWPI 2171414
2171415
LGNALQIGNSPHITLASLAKTYGPLFSLRLGSQLVIVAASQEAATEILKT 2171564
2171565
QDRFLSGRFVPDVIPAKWLKLENLSLGWIGEVNNEFKFLRTVCQSKLFSN 2171714
2171715
KALLSQSCLREKKAADTVRFIRTMEGKVLKIKKVAFAAVFSMLTNILISS 2171864
2171865
DLISMEQESTEGEMTEIIRNIFEVGAAPNISDLFPILAPFDLQNLRKKSK 2172014
2172015
ELYLRFSTMFEAIIEERRERKMSSDNASGKEDFLDTLISNGSSNEHINVLLL 2172170 (0)
2172303
ELLVAGSDTSTSAIEWAMAELLRNPQCMKKAQAELASEINQDLIQE 2172440
2172441
SDLPRLKFLHACLKESMRLHPPGPLLLPHRAVNSCKVMGYTIPKNSQVLV 2172590
2172591 NAYAIGRDPKSWKDPLDYKPERFLTSNMDFRGSNIEFIPFGAGRRACPGQ
2172740
2172741
PMATKHVPLVLASLLHFFDWSLPTGHDPKDIDMSDKFHTSLQKKQPLLLI 2172890
2172891 PKIKN* 2172908
>CYP80E2
LG_I (+) 8155946-8158046
76G like 41% to
76G1
58% to LG_I (+) 2172267
estExt_fgenesh1_pg_v1.C_LG_I0934
[Poptr1:691155] gene model seems correct
$
8155946
MAQTSLTQAIDLFSPILLLLPLLLLIVLKHFRHNSSPPFPPGPYPWPILG 8156095
8156096
NILQLGDKPHITLTHFAKIHGPIFSLRLGTQLVVVGSSQAAAIAILKTHD 8156245
8156246
RILSGRHVPHMAPSKSSELNKLSLGWVVECNERWRYLRTICKSELFSLKA 8156395
8156396 LESQACIRERKAKEMIGFINKMEGKVVKIREVATATVFNMLSNILVSRDL
8156545
8156546
VSLEHESEDGGMSSVLKDIARLASTPNISDFYPILGPLDLQGLRKKTMEL 8156695
8156696
HRRSFNMCEAIIQERREGGEGKRDGPDASRRRDFLDALILNGSSDDQIDILLM 8156854 (0)
8157432
ELLSAGTDTSSSTIEWTMAELIKNPRCLKKVQEEIANVINMNRDTG 8157569
8157570 FKESHLPQLTYLQACVKETLRLHPPGPFLLPHRAIDSCQVMNYTIPKNTQ
8157719
8157720
VLVNYWAIGRDPKSWEEPVVFNPERFLSSNLDFKGNDFEFIPFGSGRRIC 8157869
8157870
PGLPMAAKHVALIIAYLILFFDWSLPCGKNPTDLDMSENYGLTLRKEQPL 8158019
8158020 LLVPTSKK* 8158046
<CYP77 family 4 sequences
>CYP77A10
LG_VIII (-) 1164925-1163381
no introns 72% to 77A4
fgenesh1_pg.C_LG_VIII000203 gene model
correct
Transcription
factor GT-2 related protein upstream, E3 ubiquitin ligase downstream
$
1164925
MSLLSFSSATLDPYYHLFFTILALFISGLIFLLSRKPKSKRSHLPPGPPG 1164776
1164775
WPIVGNLFQVAQSGKPFFEYVDDIRSKYGSIFTLKMGTRTMIIISDAKLA 1164626
1164625
HEALIERGACFASRPKENPTRTIFSCNKFSVNAAVYGSVWRSLRRNMVQN 1164476
1164475
MLSSSRIKEFRNVRDSAMDTLINRLRTEAEANSGDVWVIKNVRFAVFCIL 1164326
1164325
LAMCFGIEMDDETIEKMDQVMKSVLIVLDPRIDDFLPILSPFFSKQRKRA 1164176
1164175
SEVRKAQVNFMVSFIEKRRNAIRNPGSDKSAMSFSYLDTLFDLTFE GRKS 1164026
1164025 TPSNEELVTLCSEFLNGGTDTTA
TAVEWGIAQLIANPEVQTKLYNEIKST 1163876
1163875
VGDRKVDEKDVEKMEYLHAVVKELLRKHPPTYFVLSHAVTEPTTLAGYDI 1163726
1163725 PLDASVEFFSYGIGEDPKVWNNPEKFNPDRF ISDGEDADITGVTGVKMMP
1163576
1163575 FGVGRRI CPGLGLATVHLHLMIARMVQEFEWTA
YPPNSKLDFSGKLEFTV 1163426
1163425 SMKNSLRAMIKPRV* 1163381
>CYP77A11P
LG_X (-) 21016073-21015481
pseudogene 64% to 77A10
eugene3.00102536|Poptr1
eugene3.00102535|Poptr1
Transcription factor
GT-2 related protein upstream, E3 ubiquitin ligase downstream
$
21016073 KSSPSNEELVTLCSEFLNGGTDTTG 21015999
21015998 EIKSTAGDRKVDEKDVEK 21015945
21015827 DLPINANVEFYSHGTG*NTKV*TNPEKYNPDRFMSGREDADTTGVTGVKAMHFGVGRR
21015654
21015649 ICPGLWLATVHLHLMVAK 21015596
21015561 KLDISVKLEFTVVMKNSLSAMVKPRA* 21015481
>CYP77B3
LG_IV (-) 589294-587777
68% to 77B1 71% to CYP77B4
eugene3.00040066|Poptr1
tandem duplication not genome duplication, 16kb from 77B4
$
589294
MDLIDLLILCIALMFARLWWRHWSVTGGGPRNLPPGPPGWPIVGNLFQII 589145
589144
LQRRPFIYVVRDLRAKYGPIFTLQMGQRTLVIVTSSELIHEALVQRGPTF 588995
588994
ASRPADSPIRLVFSVGKCAINSAEYGPLWRSLRKNFVTQFINPVRIKQCS 588845
588844
WVRECASENHMKRLKTEALENGFVEVMSNCRLTICSILICLCFGARISEE 588695
588694
RIKSIEAILKEVMLMTTPKLPDFLPILAPLFRKKMEEAKELRRKQMECLV 588545
588544
PLIRNRRAFVEKGENPDLEMASPVGAAYIDSLFAMKPVNRGPLGEQEFVT 588395
588394
LCSEVISAGTDTSATTIEWALLNLVQNQEIQEKLYQEIIGCVGKHGVVKE 588245
588244
EDTEKMPYLGAIVKETFRRHPPSHFVLSHAATNETQLAGYTIPADVNVEF 588095
588094
YTAWLTEDPDLWKDPGEFRPERFLEGDGVDVDMTGTRGVKMMPFGVGRRI 587945
587944
CPAWSLGVLHVNMLLARMVHAFKWLPCPTAPPDPTETFAFTVVMKNPLKA 587795
587794 VILPR* 587777
>CYP77B4
LG_IV (+) 605695-607218
66% to 77B1 no introns 71% to CYP77B3
fgenesh1_pm.C_LG_IV000021|Poptr1 gene model has wrong
N-term
tandem duplication not genome duplication, 16kb from 77B3
$
605695
MELIDLLILGLTLFFLAIWWRSFSVVNGGGAKNLPPGPPGWPLVGNLFQI 605844
605845
ILERRHFIFVIRDLRKKYGPIFSMQMGQSTLVIVTSPDLIHEALVQKGPI 605994
605995
FASRPPDSPIRLVFSVGKCAVNSAEYGPLWRTLRRNFVTELISPVRIRQC 606144
606145
SWIREWALESHMKRLKSEALENGYVDVMDVCRFTVCSILVFICFGAKISE 606294
606295
HWIHDIDNVTKDVMLISIPQLPDFLPILTPLFRKQMKRAKDLRKTQIECL 606444
606445
VPLIRNRRAFVEKGENPKMEMLSPVGAAYVDSLFTLKAPGRGLLGEEELV 606594
606595
TVCSELFVAGIDTSTSVLQWVFLELVLNQDIQEKLYREIVESVGKDGVIN 606744
606745
EEDVEKMNYLNAVVKETLRVHSPAHFTLSHATTEETELGGYKIPSNVNVE 606894
606895
FYIEWMTEDPSLWKDPGIFRPERFIDGDGVNVDMTGTKGKVKMLPFGAGR 607044
607045
RTCPGLALGLLHVNLMLARMVQAFKWLPAPNAPPDPTEAFAFTVVMKNPL 607194
607195 KAVILPR* 607218
<CYP78
family 14 sequences
Note:
The CYP78 family had four subfamilies, three were added in rice, but
due
to early naming decisions, sequences in the logical B and C subfamilies
were
named as 78A sequences. Five of
these now have publications, so to
avoid
confusion. the B and C subfamilies are being included as part of a
larger
A subfamily. Renamed rice
sequences are CYP78B4P = 78A12P,
78B5
= 78A13, 78B6 = 78A14,78C5 = 78A15, 78C6 = 78A16, 78C7 = 78A17
(the
names 78B1 to B3 and C1 to C4 were never used. They were reserved in
case
name changes were made to some of the 78A sequences).
>CYP78A18
LG_Vb (-) 1989840-1988143
78A like 70%
to 78A7
estExt_fgenesh1_pg_v1.C_LG_V0235|Poptr1 gene model seems
correct
$
1989840
MEIDLATKDTSWWVYTLPAFLGSEILIDGYVLFSLVMAFVTLGILTWAFA 1989691
1989690 VGGVAWKNGRNRRGHRLIPGPRGLPVFGSLLTLCRGLAHRTLASMACSRD
1989541
1989540
NTQLMAFSLGSTPVVVASDPHTAREILTSIHFADRPIKLSAKSLMFSRAI 1989391
1989390
GFAPSGTYWRLLRRIASGHLFSPRRISAHESLRQLECSTMLRDMTNEQEL 1989241
1989240
NGFVSLRKHLQFASLNNIMGSVFGKRYDMVHDSQDLEELRGMVREGFELL 1989091
1989090
GAFNWCDYLPWLSYFYDPFRINERCLKLVPRVRKLVKGIIEEHRISKSRN 1988941
1988940 VGDSCDFVDVLLSLDGEEKLQDDDMVAVLW
1988851 (0)
1988754
EMIFRGTDTTALLTEWVMAELVLHPEIQEKLHSELDMAVKD 1988632
1988631
GSLAALTDADVEKLPYLQAVVKETLRVHPPGPLLSWARLSTSDVQLNNGM 1988482
1988481 VIPANTTAMVNMWAITHDPNVWEDPLEFKPERFIEADVDVRGNDLRLAPF
1988332
1988331
GAGRRVCPGKKLGLVTVTLWVAKLVHCFKWNRDVDHPVDLSEVLKLSCEM 1988182
1988181 KYPLHAVAVGRK* 1988143
>CYP78A19
LG_Va (-) 13684462-13682757
78A like 68%
to 78A7
fgenesh1_pm.C_LG_V000403|Poptr1 gene model correct
$
13684462
MDLFPTPVDSSWWMFALPAMLQIQKLSNPLILLFVLASFLVITVLNWAFS 13684313
13684312
TGGLAWKNGRNQKGNVPIPGPRGLPLFGSLFSLSRGLAHRTLACMASSQA 13684163
13684162
ATQLMAFSLGSTPAIVTSDPQIAREILTSPHFADRPIKLSAKSIMFSRAI 13684013
13684012 GFAPNGAYWRLLRRIASNHLFAPRRIAAYEPWRQLDCANMLSGIYNEQSL
13683863
13683862
RGIVCLRKHLQNASINNIMGTVFGKRYDLMHNNEEAKELQELVREGFELL 13683713
13683712
GAFNWSDYLPWLNYFYDPSRIKQRCCLLVPRVKKLVKKIIDEHRIMKPKN 13683563
13683562 EFQNADFVHVLLSLEGEEKLDEDDMVAVLW
13683473 (0)
13683371 EMIFRGTDTTALLTEWIMAELVLNPEIQAKLRNELNFIVGNRSV
13683240
13683239
KDADVAKLPYLQAVIKETLRVHPPGPLLSWARLSTSDVHLSNGMVVPTNT 13683090
13683089
TAMVNMWAITHDPRVWEDALVFKPERFLERQGGADVDVRGGDLRLAPFGA 13682940
13682939
GRRVCPGKNIGLVTVSLWVAKLVHHFEWVQDTHNPVDLSEVLRLSCEMKK 13682790
13682789 PLSAVAIPKE* 13682757
>CYP78A20
LG_II (+) 4123900-4125604
78A like 67% to 78A7
fgenesh1_pg.C_LG_II000566
[Poptr1:347060] gene model
correct
$
4123900
MELVPSSVDSSWWMFALPAMLQTENLSNPLILLFVLISFLVITLLTWAFS 4124049
4124050 TGGLAWKNGRNHKGSVSIPGPRGLPFFGSLFSLSRGLAHRTLACMASSQA
4124199
4124200
ATQLMAFSLGSTPAIVTSDPQIAREILTSPHFADRPIKLSAKSLMFSRAI 4124349
4124350
GFAPNGAYWRLMRRIASTHLFAPRRIAAHEPWRQLDCAKMLSGIYDDQSL 4124499
4124500
HGVVYLRKHLQDASLNNIMGTVFGKRYDLMQFNEEAKELQELVIEGFELL 4124649
4124650
GAFNWSDYLPWLNYFYDPFRIKERCCQLVPRVKKLVKQIIEEHRIKKPKN 4124799
4124800 VFDNADFVDVLLSLEGEEKLEEDDMVAVLW
4124889 (0)
4124990
EMIFRGTDTTALLTEWVMAELVLNQEIQAKLGKELNLVVGNRSVTDADVA 4125139
4125140
DLPYLQAVIKETLRVHPPGPLLSWARLSTSDVHLSNGMVVPVNTTAMVNM 4125289
4125290
WAITHDPRVWEDALVFKPERFMESQGGADVDVRGGDLRLAPFGAGRRVCP 4125439
4125440
GKNLGLVTVSLWVAKLVHHFEWVQDMHSPVDLSEMLKLSCEMKKPLSAVA 4125589
4125590 IPRN* 4125604
>CYP78A21v1
LG_VII (-) 5114877-5113187
78A like 69%
to 78A7
eugene3.00070646|Poptr1 gene model correct
$
5114877
MELDLVTKDTSWWVFTLPAFLGSKSLLDGFILFSLSMAFVSLAFLTWAFA 5114728
5114727
VGGIAWKNGRNRKGHRSIPGPRGLPIFGSLFTLSRGLAHRTLASMAWRRA 5114578
5114577
NTQLMAFSLGSTPVVVASDPHIAREILTSPYFADRPIKQSAKSLMFSRAI 5114428
5114427
GFAPSGAYWRLLRRIASTHLFSPRRILAHESLRQLESTTMLRNITNEQRR 5114278
5114277
NGFVTLRKHLQFASLNNIMGSVFGKTYDMSQDRQELEELRDMVSEGFELL 5114128
5114127
GAFNWCDYLTWLNYFYDPFRIQKRCSKLVPRVRKLVKDIIEEHRLGEPGK 5113978
5113977 VGDDGDFVDVLLSLEGEEKLQDDDMVAVLW
5113888 (0)
5113801 EMIFRGTDTTALLTEWVMAELVLHTEVQEKLRRELDMAVKDRSLSELT
5113658
5113657
DSEVSKLPYLQAVVKEALRVHPPGPLLSWARLCSSDVQLSNGMVIPADTT 5113508
5113507 AMVNMWAITHDPHVWEDPLEFKPERFIE
5113424
5113423
ADVDVRGGDLRLAPFGAGRRVCPGKNLGLVTVTLWVAKLVHHFKW 5113289
5113288 VHDGEHPVDLSEVLKLSCEMKYPLHAVALQMNN*
5113187
>CYP78A21v2
scaffold_2390 (-) 7318-5629
78A like 69%
to 78A7
eugene3.23900001|Poptr1 gene model missing some seq before
the PERF motif
$
7318
MELDLVTKDTSWWVFTLPAFLGSKSLLDGFILFSLSMAFVSLAFLTWAFA 7169
7168 VGGIAWKNGRNRKGHRSIPGPRGLPIFGSLFTLSRGLAHRTLASMAWRRA
7019
7018
NTQLMAFSLGSTPVVVASDPHIAREILTSPYFADRPIKQSAKSLMFSRAI 6869
6868
GFAPSGAYWRLLRRIASTHLFSPRRILAHESLRQLESTTMLRNITNEQRR 6719
6718
NGFVTLRKHLQFASLNNIMGSVFGKTYDMSQDRQELEELRDMVSEGFELL 6569
6568 GAFNWCDYLTWLNYFYDPFRIQKRCSKLVPRVRKLVKDIIEEHRLGEPGK
6419
6418 VGDDGDFVDVLLSLEGEEKLQDDDMVAVLW 6329
(0)
6242
EMIFRGTDTTALLTEWVMAELVLHTEVQEKLRRELDMAVKDRSLSELT 6099
6098
DSEVSKLPYLQAVVKEALRVHPPGPLLSXARLCSSDVQLSNGMVIPADTTAMV 5940
5940 HMWAITHDPHVWEDPLEFKPERFIE 5866
5865 ADVDVRGGDLRLAPFGAGRRVCPGKNLGLVTVTLWVAKLVHHFKW
5731
5730 VHDGEHPVDLSEVLKLSCEMKYPLHAVALQMNN*
5629
>CYP78A22
LG_Ib (+) 6124306-6126694
68% to 78A9
eugene3.00010701 [Poptr1:548260] gene model correct
$
6124306
METQIDSFWVLVLVSKCKAFSAQNPIFLLVSVFLAWLAMALCYWVYPGGPAWGNYLRKK 6124482
6124483 GISCSRAKMIPGPRGFPVIGSMNLMVNLAHHKLAAAAKAFKAERLMAFS
6124629
6124630
LGETKVIITCNPDVAKEILNSSVFADRPVKESAYQLMFNRAIGFAPYGVY 6124779
6124780
WRTLRRIAATHLFCPKQISSTESQRFNIASQMVSAIASQGGDYFCVRGIL 6124929
6124930
KKASLNNMMCSVFGRKYDLGSSNSETEELRRLVDEGYDLLGKLNWSDHLP 6125079
6125080
WLANLDLQRIRFRCSNLVPKVNRFVNRVIEEHREDQTGQRRNDFVDVLLS 6125229
6125230 LHGPDKLSHHDMIAVLW 6125280 (0)
6126071
EMIFRGTDTVAVLIEWILARMVLHRDIQSKVHDELDQVVGRSRP 6126202
6126203
LMEADIQSMVYLPAVVKEVLRLHPPGPLLSWARLAITDTNVDGYDVPAGT 6126352
6126353 TAMVNMWAITRDPQVWANPLRFLPERFLCKDATADVEFSVSGSDLKLAPF
6126502
6126503
GSGRRTCPGKALGLATVSFWVGVLLHEFEWVQCDHEPVDLSEVLRLSCEM 6126652
6126653 SNPLTIKVNPRRR* 6126694
>CYP78A23
LG_IIIb (-) 13439255-13437042
78A like 68%
to 78A9
eugene3.00031195|Poptr1 gene model correct
$
13439255
METQIDSFWVLALVSKCKAFSSQDPIFLLLSLFLAWLAIALCYWVYPGGP 13439106
13439105
AWGKYWLKRATCSKAKMIPGPRGFPVIGSMNLMVNLAHHKLAAAAKTLKA 13438956
13438955
KRLMAFSMGETRVIITCNPDVAKDILNSSVFADRPVKESAYQLMFNRAIG 13438806
13438805
FAPYGVYWRTLRRIAATHLLCPKQISSTEPQRLDIASQMVSVMACQGGDY 13438656
13438655
FRVRDILRKASLNNMMCSVFGRKYDLGTSNNEIEELGGLVDEGYDLLGKL 13438506
13438505
NWSDHLPWLANFDLQKIRFRCSNLVPKVNRFVNRVIQEHREDQSGQRRND 13438356
13438355 FVDVLLSLHGPDKLSDHDMVAVLW 13438284
(0)
13437665
EMIFRGTDTVAVLIEWILARMILHPDIQSKVHDELDQVAGRSRPLME 13437525
13437524
ADIRSMVYLPAVVKEVLRLHPPGPLLSWARLAITDTDVDGYDVPAGTTAM 13437375
13437374
VNMWAITRDPQVWVDPLKFSPERFLSKEVTADVEFSVSGSDLRLAPFGSG 13437225
13437224
RRTCPGRTLGLATVSFWVGSLLHEFEWARCGHEPVDLSEVLRLSCEMAKP 13437075
13437074 LTVKVNPRRR* 13437042
>CYP78A24
LG_IIc (+) 13511138-13512986
78A like 72% to 78A9
eugene3.00021624 [Poptr1:552309] gene
model seems correct
$
13511138
MRTDIDNFWIFALASKCRVFTQENIAWSLLIMGLAWIATTLIYWAYPGGP 13511287
13511288
AWGKYKLKNTSFTISKPIPGPRGLPLIGGMRLMTSLAHHKIAAAADACKA 13511437
13511438
RRLMAFSLGDTRVIVTCNPDVAKEILNSSVFADRPVKESAYSLMFNRAIG 13511587
13511588
FAPYGVYWRTLRKIASTHLFCPKQIKTAASQRRRIASETVSMFNDHEGSG 13511737
13511738
FTVRGILKRASLNNMMCSVFGREYELDSCNSEVEELRALVDEGYDLLGTL 13511887
13511888
NWTDHLPWLADFDPQKIRFRCSNLVPKVNRFVSRILAEHR 13512007
13512008 AQAGNETPDFVDVLLSLQGHDKLSDSDMIAVLW
13512106 (0)
13512321
EMIFRGTDTVAVLMEWILARMVLHPDVLSKVHDELDKVVGRSRAVAESDI 13512470
13512471
TAMVYLQAAVKEVLRLHPPGPLLSWARLAITDTTIDGYHVPKGTTAMVNM 13512620
13512621 WAISRDPDSWEDPLEFMPERFVTKKG
13512698
13512699
ELEFSVLGSDLRLAPFGSGRRTCPGKTLGLTTVTFWVASLLHEYEWLPCD 13512848
13512849 GNKVDLSEVLGLSCEMANPLTVKLRPRR
13512932
13512933 SHYKPTVLGVQNYVLDV* 13512986
>CYP78A25
LG_XIV (-) 3837010-3835172
78A like 72% to 78A9
estExt_fgenesh1_pg_v1.C_LG_XIV0445|Poptr1 gene model seems
correct
$
3837010
MRTDIDSFWIFALASKCRAFTQENIAWSLLIIGLAWIVVTLIYWAYPGGP 3836861
3836860
AWGKYKLKNTSLTISNPIPGPRGFPITGSMKLMTSLAHHKIAAAADACKA 3836711
3836710
RRLMAFSLGDTRVIVTCNPDVAKEILNSSVFADRPVKESAYSLMFNRAIG 3836561
3836560 FAPYGVYWRTLRKIASTHLFCPKQIKAAESQRLQIASQMVSTFNDREKSS
3836411
3836410
FSVREVLKRASLNNMMCSVFGREYKLDSFNNEVEELRALVEEGYDLLGTL 3836261
3836260
NWSDHLPWLADFDPQKIRFRCSNLVPKVNRFVSRIIAEHRALTRSENPDF 3836111
3836110 VDVLLSLQGHDKLSDSDMIAVLW 3836042 (0)
3835810 EMIFRGTDTVAVLIEWILARMVLHPDVQSKVHDELYKVVGRSRAVAESDI
3835661
3835660
TAMVYLQAVVKEVLRLHPPGPLLSWARLAITDTTIDGYHVPKGTTAMVNM 3835511
3835510
WAISRDPEFWEDPLEFMPERFVVTKEDVLEFSVLGSDLRLAPFGSGRRTC 3835361
3835360
PGKTLGITTVTFWVASLLHEYEWVPGEENNVDLSEVLRLSCEMANPLTVK 3835211
3835210 VRPRRSSQSPLY* 3835172
>CYP78A26P
scaffold_3645 (-) 4941-3651
78A like PSEUDOGENE 64% to 78A10
eugene3.36450001|Poptr1
$
4941
FLSKLGPSTGPGLELILGIVLFVFIFSFWLAPGGLAWDLSKTRTTIPGPS 4792
4791 GWPILGMVLAFTGSLTHRVLARISELLKAKPLMVFSVGFPRFIISSHPETAKEILNSSAFA 4609
4607
IKESAYELLFHKAMGFAHFGDCWRNLRRISATHLFSPNKRIAALGEFIRD 4458
4457
IGLKMVSEIKSLAERNGEVLEIRKVLHFGSLDNVMKRVFGRSYEFGDESK 4308
4307 VGVCELEGLVSERYELLGIFNWSDHF 4230
4231
FPILGWLDLQGVRKRCRNLAAKVNVFVEKIIDEHKMKRVESDKNEDIIKS 4082
4081 DESSSDFVDVLLDLQKENKL 4022
4019 CSDMIAVLW 3993
3870 EMIFR 3856
3854
GTDTVAILLEWILARMVLHPVIQAKVQAEIDNVVGSSRSVSDFVLPNLPY 3705
3704 LRAVVRETLRVLPPGPLL 3651
>CYP78D2
LG_IIIa (+) 15447061-15449577
78A like 50%
to 78A5
fgenesh1_pg.C_LG_III001462|Poptr1 gene model short at
N-term
$
15447061
MKSIPANLSSILFCLAVITHQTPWPVALLLFSLSSFFAFSLNYWLVPGGFAW 15447216
15447217
RNHHDNQNPSRFRGPIGWPIVGTLPQMGSLAHRKLASMAASLGATKLMAF 15447366
15447367
SLGSTRVIISSHPDTAREILCGCSFADRPIKESARLLMFERAIGFAPSGD 15447516
15447517 YWRHLRRIAANYMFSPRKISALEPLRQRLANEMVAEVREEMKERRVVVLR
15447666
15447667
DILQKGSLSNVLESVFGSDVSIEREELGFMVKEGFDLIAEFNLDDYFPLR 15447816
15447817
FLDFHGVKRRCCQLAGKVNSVVGQIVKERKGAGDSRSGSDFLSALLSLPE 15447966
15447967 EDQLNESDMVALLW 15448008 (0)
15448966 EMIFRGTDTVALLLEWIMARMVVHPEIQAKAQEELDTCIGGHREVQDSDI
15449115
15449116
PNLPYLRAIVKEVLRLHPPGPLLSWARLAIHDVHVDKTFIPAGTTVMVNM 15449265
15449266
WAITHDPSIWRDPWSFNPDRFIEEDVLIMGSDLRLAPFGAGRRVCPGKAL 15449415
15449416
GLATVHLWLARLLHEYRWLPAKPVDLSECLRLSLEMKRPLECHVVQRRSKVTQ* 15449577
>CYP78D3v1
LG_Ia (-) 3924905-3921817
51% to 78A5
fgenesh1_pg.C_LG_I000468 [Poptr1:63116]
gene model correct
$
3924905
MMSLLANLSFLLFFLAIITHQTPWPITVLLLSLFSS FALSLNYWLVPGGFA 3924753
3924752
WRNHHDNQNPSKFRGPIGWPVFGTLPQMGSLAHRKLASMATSLGATKLMA 3924603
3924602
FSLGTTRVIISSHPDTAREILWGSSFADRPVKESARLLMFERAIGFAPSG 3924453
3924452
DYWRHLRRIAANHMFSPKKISGLEPLRQRLANEMLAEVSGEMKERRAVVL 3924303
3924302
RGILQKSSLSNVLESVLGSDVHVKREELGFMAQEGFDLVSRFNLEDYFPL 3924153
3924152
RFLDFYGVKRRCYKLAGKVNSLVGQIVRERKRAGDFRSRTDFLSALLSLP 3924003
3924002 EQERLDESDMVPLLW 3923958 (0)
3922440
EMIFRGTDTVAILLEWIMARMVLHPEIQAKAQQELEKFIGNHRRVQDSDI 3922291
3922290
PNLPYLQAIVKEVLRLHPPGPLLSWARLAIHDVHVDKMSIPAGTTAMVNM 3922141
3922140
WAITHDPSIWRDPWAFNPDRFMEEDVLIMGSDLRLAPFGSGRRVCPGKAL 3921991
3921990 GLATVHLWLARLLHEYKWLPAKPVDLSECLRLSLEMKRPLECHVVPWSKV
3921841
3921840 ADFDQKT* 3921817
>CYP78D3v2
scaffold_1387 (-) 6082-5135
78A like 1 aa diff to LG_I 3922062
estExt_Genewise1_v1.C_13870002|Poptr1 runs off end in
intron seq
probable duplicate seq of LG_I 3922062
$
6082
MMSLLANLSFLLFFLAIITHQTPWPITVLLLSLFSSFALSINYWLVPGGFA 5930
5929
WRNHHDNQNPSKFRGPIGWPVFGTLPQMGSLAHRKLASMATSLGATKLMA 5780
5779
FSLGTTRVIISSHPDTAREILWGSSFADRPVKESARLLMFERAIGFAPSG 5630
5629
DYWRHLRRIAANHMFSPKKISGLEPLRQRLANEMLAEVSGEMKERRAVVL 5480
5479
RGILQKSSLSNVLESVLGSDVHVKREELGFMAQEGFDLVSRFNLEDYFPL 5330
5329
RFLDFYGVKRRCYKLAGKVNSLVGQIVRERKRAGDFRSRTDFLSALLSLP 5180
5179 EQERLDESDMVPLLW 5135 (0)
>CYP78D-se1[2]
LG_IX 340405-340566
CYP78 like pseudogene
69% to LG_I (-) 3924905
$
340405 PLLSWGRLAFHDTQVGPHVIPAGITAMVNMWSITHDERIWSDKNSISTARIYNQ
340566
<CYP79 family 6 sequences
>CYP79D5
LG_XIII (+) 12617173-12619030
79D like 58%
to 79D1
eugene3.00131268|Poptr1 gene model seems correct
12617173
MEYLSATSFTALLSFPTSLLVLAIILFYFIQSHRNVKKHPLPPGPKPWPI 12617322
12617323
VGCLPTMLRNKPVYRWIHNLMKEMNTEIACIRLGNVLVIPVICPDIACEF 12617472
12617473
LKAQDNTFASRPNTMTTDLISRGYLATILSPSGDQWNKMKKVLMTHVLSP 12617622
12617623
KKHQWLYSKRVEEADHLVHYVYNQCKKSVHQGGIVNLRTAARHYCANVTR 12617772
12617773
KMLFNKRFFGEGMKDGGPGFEEEEYMDALFSCLKHIYAFCISDFLPSLIG 12617922
12617923
LDLDGHEKVVMENHRIINKYHDPIIHERVQQWKDGAKKDTEDLLDILITL 12618072
12618073
KDRNGNPLLSKDEIKAQIT 12618129 (0)
12618407
EIMVAAVDNPSNACEWAFAEMLNQPEILEKATEELDRVVGKERLVQES 12618550
12618551
DFAHLNYVKACAREAFRLHPFAPFNVPHVSAADTTVANYFIPKGSYVLLS 12618700
12618701
RLGLGRNPKVWDEPLKFKPERHLNEMEKVVLTENNLRFISFSTGKRGCKG 12618850
12618851
VTLGTSMTTMLFARLLQAFTWSLPPRQSSIDLTIAEDSMALAKPLCALAK 12619000
12619001
PRLRPQVYPGY* 12619036
>CYP79D6v1
LG_XIII (+) 12759202-12761058
79B like 57%
to 79D1 95% to 79D5
fgenesh1_pg.C_LG_XIII001242|Poptr1 gene model seems
correct
$
12759202
MEYLAPTSFTTLLSFTASLLVLAIILFYFIQSHKNVKKHPLPPGPKRWPV 12759351
12759352
VGCLPTMLRNKPVYRWIHNLMKEMNTEIACIRLGNVHVIPVICPDIACEF 12759501
12759502
LKAQDNTFASRPHTMTTDLISRGYLTTALSPSGDQWNKMKKVLMTHVLSP 12759651
12759652
KKHQWLYSKRVEEADHLVHYVYNQCKKSVHQGGIVNLRTAAQHYCANLTR 12759801
12759802
KMLFNKRFFGEGMKDGGPGFEEEEYVDALFSCLNHIYAFCISDFLPSLIG 12759951
12759952 LDLDGHEKVVMENHRIINKYHDPIIHERVQQWKDGAKKDTEDLLDILITL
12760101
12760102 KDRHGNPLLSKDEIKAQIT 12760158 (0)
12760435
EIMVAAVDNPSNACEWAFAEMLNQPEILEKASEELDRVVGKERLVQES 12760578
12760579
DFAHLNYVKACAREAFRLHPVAPFNVPHVSAADTTVANYFIPKGSYVLLS 12760728
12760729
RLGLGRNPKVWDEPLKFKPERHLNEMEKVVLTENNLRFISFSTGKRGCIG 12760878
12760879
VTLGTSMTTMLFARLLQAFTWSLPPSQSSIDLTIAEDSMALAKPLCALAK 12761028
12761029 PRLPPQVYPGY* 12761064
>CYP79D6v2
scaffold_1585 (-) 9137-7275
79B like 58%
to 79D1 95% to 79D5
eugene3.15850001|Poptr1 gene model seems correct only 2aa
diffs to 79D6 possible duplicate
9137
MEYLAPTSFTTLLSFTASLLVLAIILFYFIQSHKNVKKHPLPPGPKRWPV 8988
8987
VGCLPTMLRNKPVYRWIHNLMKEMNTEIACIRLGNVHVIPVICPDIACEF 8838
8837
LKAQDNTFASRPHTMTTDLISRGYLTTALSPSGDQWNKMKKVLMTHVLSP 8688
8687 KKHQWLYSKRVEEADHLVHYVYNQCKKSVHQGGIVNLRTAAQHYCANLTR
8538
8537
KMLFNKRFFGEGMKDGGPGFEEEEYVDALFSCLNHIYAFCISDFLPSLIG 8388
8387
LDLDGHEKVVMENHRIINKYHDPIIHERVQQWKDGAKKDTEDLLDILITL 8238
8237
KDRHGNPLLSKDEIKAQIT 8181 (0)
7904
EIMVAAVDNPSNACEWAFAEMLNQPEILEKATEELDRVVGKERLVQES 7761
7760
DFAHLNYVKACAREAFRLHPVAPFNVPHVSAADTTVANYFIPKGSYVLLS 7611
7610
RLGLGRNPKVWDEPLKFKPERHLNEMEKVVLTENNLRFISFSTGKRGCIG 7461
7460
VTLGTSMTTMLFARLLQAFTWSLPPSQSSIDLTIAEDSMALAKPLSALAK 7311
7310
PRLPPQVYPGY* 7275
>CYP79D7
LG_XIII (+) 12783206-12785069
79B like 58%
to 79D1, 99% to 79D5
eugene3.00131275|Poptr1 gene model seems correct only 5aa
diffs to 79D5, seems odd
$
12783206
MEYLSATSFTALLSFPTSLLVLAIILFYFIQSHRNVKKHPLPPGPKPWPI 12783355
12783356
VGCLPTMLRNKPVYRWIHNLMKEMNTEIACIRLGNVLVIPVICPDIACEF 12783505
12783506
LKAQDNTFASRPNTMTTDLISRGYLATILSPSGDQWNKMKKVLMTHVLSP 12783655
12783656
KKHQWLYSKRVEEADHLVHYVYNQCKKSVHQGGIVNLRTAARHYCANVTR 12783805
12783806
KMLFNKRFFGEGMKDGGPGFEEEEYVDALFSCLNHIYAFCISDFLPSLIG 12783955
12783956 LDLDGHEKVVMENHRIINKYHDPIIHERVQQWKDGAKKDTEDLLDILITL
12784105
12784106 KDRNGNPLLSKDEIKAQIT 12784162 (0)
12784440
EIMVAAVDNPSNACEWAFAEMLNQPEILEKATEELDRVVGKERLVQES 12784583
12784584
DFAHLNYVKACAREAFRLHPLAPFNVPHVSAADTTVANYFIPKGSYVLLS 12784733
12784734
RLGLGRNPKVWDEPLKFKPERHLNEMEKVVLTENNLRFISFSTGKRGCIG 12784883
12784884
VTLGTSMTTMLFARLLQAFTWSLPPRQSSIDLTIAEDSMALAKPLCALAK 12785033
12785034 PRLPPQVYPGY* 12785069
>CYP79D8
LG_IV (+) 3871814-3873941
79A like 56%
to 79D1 73% to 79D5
eugene3.00040407|Poptr1 gene model seems correct
$
3871814 MDYFPSTSSFIILLSFPILFLVLAITLFSFIQSSKNVKQYSLPPGPRPWPLVGSL
3871978
3871979
PTMLRNKPVYQWIHNLMKEMNTEIACIRLGNIHVIPVTCPNIACEFLKEQ 3872128
3872129
DDVFSSRPETISSYLASNGYLATVVSPFGDQWKKMKSVMATQVLSPTRHQ 3872278
3872279 WLHKKRVEEGDNLVRLVYKQCQESD 3872353
3872354 QDGIVNLRFTSQHYCANVIRKLMFNKRYFGVGMENGGPGFEEEQHVDALF
3872503
3872504
TILSHLFSFCVSDFLSFLTWLDLDGHEKVMKEKDKIIKKYHDPIIDDRIQ 3872653
3872654
QWKDGKKKDIEDLLDVLITLKDDNGNPLLSKDEIKAQVE 3872770
3873315
DIILAAVDNPSNACEWAFAEMLNNPEILETAVEELDRVVGKQRLVQESDF 3873464
3873465 AQLNYVKACAREAFRLHPVAPFNVPHVSMADTVVAKHFIPKGSYVILSRL
3873614
3873615
GLGRNPKVWDEPLEFKPERHLKGTGNVVLAENGLRFISFSTGKRGCMAVT 3873764
3873765
LGSSMTNMLFARLLHGFSWSLPSNESSIDLSTAKDSMALAKPLLAVAKPR 3873914
3873915 LPAHLYPK* 3873941
LG_II (-) 16041237-16040849
79D like 78% to 79D8
fgenesh1_pg.C_LG_II001846|Poptr1
$
16041237
DIILATVDNPSNACEWAFAEVLNSPGILKMFVEELDRVVGKQQLVQESD 16041091
16041090
SALLNYVEVCAREAFRLHPVAPLNIPRVSMADTVVSNHFIPRGSYAILSRLG 16040935
16040935 DLKVWDEPLRFKPECHLTRTGHVVLAENG
16040849
<CYP81
family 50 sequences
>CYP81B3v1
LG_IIc (-) 9154259-9152268
81D like 47% to 81D8
59% to LG_II (-) 9149602
fgenesh1_pg.C_LG_II001121
[Poptr1:347615] gene model
short in middle, one frameshift
$
9154259
MEISFYSCFMLFLMFYFLSKHLCKISKNLPPSPGLSLPIIGHLYLIKKPLHQTLANLSNKYGPILFI 9154059
9154058
QFGSRPVILVSSPSVAEECLSKNDIIFANRPRLLAGKHLGYNYTTLTWASYGNHWRNLRRIAALEI 9153861
9153860
LSTNRLKMFYHIRADEVRLLVHKLFKGCRGGEFMSIDAKSTFFDLTLNVITRMIAGKRYYGEDLAEL 9153660
9153659
GEARQFKEIVRETFELSGATNIGDFVPALKWIGLNNIEKRLAILHRKRDEFVQDLILEHRKVKSEFA 9153459
9153458 SHQGSSKTMINVLLTLQETEPEYYTDEIIRGLMT
9153357 (0)
9152886
VILSAGTDTSAGTMEWALSLLLNNPQALMKAQIEIDTIIGPS 9152760
9152759 KLIEESDLLKLPYLQGIIKETLRMYPP 9152680
9152678 PHLPPHESSEECTVGGFRVPRGTMLLVNMRSVHNDPNLWE
9152559
9152558
EPTKFKPERFHGPEGKRDGFIYLPFGAGRRGCPGEGLATRIIGLALGSLIQCFEWERVCGELVDMS 9152361
9152360 EGTGLTMPKAQNLWAKCRPRPAMVNQLSQT*
9152268
>CYP81B3v2
scaffold_1047 (+) 16009-16578
81D like 46% to 81D6 N-term no model at JGI
100% to LG_II (-) 9152501 duplicate
seq
$
16009
MEISFYSCFMLFLMFYFLSKHLCKISKNLPPSPGLSLPIIGHLYLIKKPL 16158
16159
HQTLANLSNKYGPILFIQFGSRPVILVSSPSVAEECLSKNDIIFANRPRL 16308
16309
LAGKHLGYNYTTLTWASYGNHWRNLRRIAALEILSTNRLKMFYHIRADEV 16458
16459 RLLVHKLFKGCRGGEFMSIDAKSTFFDLTLNVITRM 16566
>CYP81B4
LG_IId (-) 9151207-9149369
81D like 49% to 81D2
95% to scaffold_1047 (+) 5087
fgenesh1_pm.C_LG_II000512
[Poptr1:342582] gene model seems correct
$
9151207 MFLHFLLLYLVLYVLTNHFRNKIQNLPPSPFPALPIIGHLHLLKKPLHRS
9151058
9151057
LSKISNRHGPVVLLQLGSRRVLVVSSPSAAEECFTKNDIVFANRPHLLAG 9150908
9150907
KHLGRNYTTLSWAPHGDLWRNLRKISSLEILSSNRLQLFSSIRTEEVKFL 9150758
9150757
IRRLFKNNDEIIDLKSSFFELMLNVMMRMIAGKRYYGENEAEVEEGRRFR 9150608
9150607
EIVTETFQVSGASAVGDFLHVLAVIGGTEKRLMKLQEKRDGFLQELVDEH 9150458
9150457
RRRMGNNKSCFSNERNYKTMIEVLLTLQESEPEYYKDETIKDLMV 9150323 (0)
9149989
VLLSAGTETTAGTMEWALSLLLNNPLILRKAQNEIDKVVGHDRLIDE 9149849
9149848
SDVVKLPYLHCVIKETMRMYPIGPLLVPHRSSEECGVGGFQIPSGTMLLV 9149699
9149698
NMWAIQNDPKIWDDAAKFKPERFEGSVGVRDGFKLMPFGSGRRRCPGEGL 9149549
9149548
AIRMVGLTLGSLLQCFEWDRVSQEMVDMTGGTGLTMPKAQPLLARCTSRP 9149399
9149398 SMANLLSQI* 9149369
>CYP81B5
LG_IIb (-) 9145798-9143865
81D like 50% to 81D8
78% to LG_XIV (-) 929000
eugene3.00021123
[Poptr1:551808] gene model
short on N-term
$
9145798
MATLFLYFPFFLALYMITRHLLDKIQNLPPSPFLSLPIIGHLYLFKKPIY 9145649
9145648
RTLSNISNRYGQLVVLLRLGSRRVLVVSSPSIAEECFTKNDVVFANRPRL 9145499
9145498
LIGKHLGYNCTNLFWASYGDHWRNLRKIVSIEVLSAYRLQMHSATHLEEV 9145349
9145348
KWMIGWLFRNQNQVVDMKKAFLELTLNIIMRMIAGKRYYGDDVSDVEQAQ 9145199
9145198
RFRAIHAEMYTLIGQTIIGDYVPWIKSKKMEKRLIECRVKRDSFMQCLIE 9145049
9145048
EQRRVLLESDCCGERKRTMIQVLLSLQETEPEYYTDDIIKGLML 9144914 (0)
9144482 VLLFAGTDTSSSIMEWALSLLLNHSEVLLKAQKEIDEYIGPDRLIDEADL
9144333
9144332
AQLPYLRSIINETLRMYPPAPLLVPHESSEECLVGGFRIPHGTMLFVNMW 9144183
9144182
AIHNDPKIWLDPRKFRPDRFNGLEGARDGFRLMPFGYGRRSCPGEGLALR 9144033
9144032
MVGLALGSLIQCFEWQRIDDKSVDMTERPGFTMAKAQPLKAICRPRLSMLKLFSQ* 9143865
>CYP81B6
scaffold_1047 (+) 3473-5320
81D like 51%
to 81D3
95% to LG_II (-) 9149602
eugene3.10470001|Poptr1 gene model seems correct
$
3473
MFLHFLLLYLVLYVLTNHFRNKIQNLPPSPFPALPIIGHLHLLKKPLHRS 3622
3623
LSKISNRHGPVVLLQLGSRRVLVVSSPSAAEECFTKNDIVFANRPHLLAG 3772
3773 KHLGRNYTTLPWAPHGDLWRNLRKISSLEILSSNRLQLLSSIRTEEVKLL
3922
3923
IRRLFKNNDQIIDLKSSFFELMLNVMMRMIAGKRYYGENEAEVEEGRRFR 4072
4073
EIVTETFQVSGASAVGDFLHVLAVIGGTEKRFMKLQEKRDGFMQELVDEP 4222
4223
RRRMGNNKSCFSNERNYKTMIEVLLTLQESEPEYYKDETIKDLMV 4357 (0)
4700 VLLSAGTDTTAGTVEWALSLLLNNPLILKKAQNEIDKVVGQDRLIDE
4840
4841
SDVAKLPYLHCVIKETMRMYPVGPLLVPHESSEECVVGGFQIPRGTMLLV 4990
4991
NIWAIQNDPKIWDDAAKFKPERFDGSEGVRDGFKLMPFGSGRRSCPGEGL 5140
5141
AMRMAGLTLGSLLQCFEWDRVSQEMVDLTEGTGLSMPKAQPLLARCTSRP 5290
5291 SMANLLSQI* 5320
>CYP81B6P
scaffold_1047 (+) 8471-9091
81D like duplicate seq
100% to scaffold_1047 (+) 3473-5320
eugene3.10470002|Poptr1 exon 2 only
(0)
$
8471
VLLSAGTDTTAGTVEWALSLLLNNPLILKKAQNEIDKVVGQDRLIDE 8611
8612
SDVAKLPYLHCVIKETMRMYPVGPLLVPHESSEECVVGGFQIPRGTMLLV 8761
8762 NIWAIQNDPKIWDDAAKFKPERFDGSEGVRDGFKLMPFGSGRRSCPGEGL
8911
8912
AMRMAGLTLGSLLQCFEWDRVSQEMVDLTEGTGLSMPKAQPLLARCTSRP 9061
9062 SMANLLSQI* 9091
>CYP81B7
LG_XIV (-) 930738-928767
81D like 51%
to 81D8
93% to scaffold_40 (+) 2691314
eugene3.00140107|Poptr1 gene model seems correct
$
930738
MATLILYFPVILALYIITSHFLDKIRNFPPGPFPSLPIIGHLYLLKKPIY 930589
930588
RTLSKISSKHGPVLLLQLGSRRLLVVSSPSIAEECFTKNDVVFANRPRLL 930439
930438
IAKHLAYNSTSLVWAPYGDHWRNLRRIVSIEVLSAYRLQMLSAIRLEEVK 930289
930288 SMVCVLFRNQKHTVDMKTVFFELTLNIMMRMIAGKRYYGENVSDVEEAKR
930139
930138
FRALHAESFLLGGKTIIGDYIPWIKSKKMEKRLIECNLKRDSFLQCLIEE 929989
929988
QRRKILEGDCCGEKKKNLIQVLLSLQETEPEYYTDDIIKGLVV 929860 (0)
929387
VILFAGTDTSSTTMEWALSLLLNHPEVLEKAKREIDEHIGHDRLMDEGDL 929238
929237
AQLPYLRSILNETLRMYPPAPLLVPHESSEECLVGGFRIPRGTMLSVNMW 929088
929087
AIQNDPKIWRDPTKFRPERFDNPEGGRYEFKLMPFGHGRRSCPGEGLALK 928938
928937
VVGLALGSLLQCFEWQKIGDKMVDMTESPGFTVPKAKQLEAICRARPRML 928788
928787 TLLSQI* 928767
>CYP81B8P1
scaffold_40 (+) 2662475-2663606
81D like pseudogene N-term
73% to LG_IIc (-) 9152501
$
2662475
YYYFLLFLMFRVLSKHLRKINKNLPPSPGLSLPITGHLYLIKKPLRQTLA 2662624
2662625 NLSN
QYGPILF
IKFGSRTVILV*SPSVAGEC
2662717
GIILANLPRLVGP
2662762
GKHLGYIYTTLAWASYGKHWRNLRRISALEILSTNRLQMFCHIRAH 2662899
RRLYKGSKGGEFMTN
2662947
DAKSTFFYLTLDVIMRMIAGKRYHGENPAELGESRKVKEIVTETFELSGA 2663096
2663097
TNTGDFVPVLKWFEMNHNEKRLAVLHSKRDKFLQDLIEAHRKVKDES 2663237
ASDQGSG*
2663262 TTIDILLALQETEPEFYTYEIIRGMMT 2663345
(0)
2663366 RRSCPEKGLALCMVGLTLES 2663425
2663428
FEWERVSEEMAGMTEGIGLSMPRAHPLLAKCRLCPSMVSLLSRI 2663559
>CYP81B9P
scaffold_40 (+) 2666003-2667947
81D like pseudogene, one frameshift
98% to scaffold_40 (+) 2687031
91% to LG_XIV (-) 929000 with one frameshift
fgenesh1_pg.C_scaffold_40000328
[Poptr1:94216] model short (2 genes fused?)
$
2666003
MATLFLYFPVFLALYIISTHFLNKIRNFPPSPFPSLPIIGHLYLLKKPLY 2666152
2666153
RTLSKISDKHGPVILLQLGSRRQLVVSSPSIAEECFTKNDVVFANRPRLL 2666302
2666303 IAKHLAYNSTSLVWAPYGDHWRNLRKIVSIEVLSAYRLQMLSSIRLEEVR
2666452
2666453 SMICVLFRNQNQ 2666488
2666488
VVDMRTVFFELTLNIMMRMIAGKRYYGENVSDVEEAKRFRAIHAESFLLG 2666637
2666638
GKTIIGDYIPWIKSKEMEKRLIECNLKRDSFLQCLIEEQRRKILEGDCCG 2666787
2666788 EKKKNLIQVLLSLQETEPEYYTDDIIKGLVV
2666880 (0)
2667327 VILLAGTHTSSSTMEWALSLLLNHPQVLEKAKREIDEHIGHDRLMDEADL
2667476
2667477
AQLPYLRSILNETLRMYPAAPLLVPHESSEECLVGGFRIPRGTMLSVNVW 2667626
2667627 AIQNDPKIWRDPTKFRPER 2667683
2667684
FDNLEGGRDEFKLMPFGHGRRSCPGEGLALRVVGLALGSLLQCFEW 2667821
2667822 QKIGDKMVDMTEASGSAISKAQPLKAICRARPSMLTHLSQI*
2667947
>CYP81B-se1[2]
scaffold_40 (+) 2671685-2672180
81D like pseudogene exon 2 only
$
2671685
VILLAGTHTSSSTMEWALSLLLNHPQVLEKAKREIDEHIGHDRLMDEADLAQ 2671840
2671842
LPYLRSILNETLRMYPAAPLLVPHESSEECLVGGFRIPRGTMLSVNV 2671982
2671983 WAIQNDPKIWRDPTKF 2672030
2672031
RPERFDNLEGGRDEFKLMPFGHGRRSCPGEGLALRVVGLALGSLLQCFEW 2672180
2672181
QKIGDKMVDMTEASGSAISKAQPLEAICRARPSMLTHLSQI 2672303
>CYP81B8P2
scaffold_40 (+) 2676659-2676886
81D like N-term frag.
100% match to scaffold_40 (-) 2662499
$
2676659
YYYFLLFLMFRVLSKHLRKINKNLPPSPGLSLPITGHLYLIKKPLRQTLA 2676808
2676809 NLSN
QYGPILF
IKFGSRTVILV*SPS 2676886
(sequence gap)
>CYP81B10
scaffold_40 (+) 2684818-2685009
81D like N-term frag.
scaffold_40
(+) 2685841-2687264
81D like
98% to scaffold_40 (+) 2667714
fgenesh1_pg.C_scaffold_40000329
[Poptr1:94217] model short
(N-term in a sequence gap)
$
2684818
MATLFLYFPVFLALYIISTHFLNRIRNFPPSPLPSLPIIGHLYLLKKPLY 2684967
2684968 RTLSKISDKHGPVI 2685009
(sequence gap)
2685841 LNIMMRMIAEKRYYGGNVSDVEEAKRFRAIHAESFLLGGKTIIGDYIPWI
2685990
2685991
KSKEMEKRLIECNLKRDSFLQCLIEEQRRKILEGDCCGEKKKNLIQVLLS 2686140
2686141 LQETEPEYYTDDIIKGLVV 2686197 (0)
2686644
VILLAGTHTSSSTMEWALSLLLNHPQVLEKAKREIDEHIGHDRLMDEADL 2686793
2686794 AQLPYLRSILNETLRMYPAAPLLVPHESSEECLVGGFRIPRGTMLSVNVW
2686943
2686944 AIQNDPKIWRDPTKFRPER 2687000
2687001
FDNLEGGRDEFKLMPFGHGRRSCPGEGLALRVVGLALGSLLQCFEW 2687138
2687139
QKIGDKMVDMTEASGSAISKAQPLEAICRARPSMLTHLSQI* 2687264
>CYP81B-se2[1]
scaffold_40 (+) 2689865-2690096
N-term
$
2689865 PPRPFPSPPVTGHLYLH*KSIYWT 2689936
2689938 LSKFACQHGPAILLTFGSRRALSDSSPSIAEQCFT
2690042
2690043 NLAWPPRGDCWRKLRKIL 2690096
>CYP81B11
scaffold_40.6 (+) 2691314-2693223
81D like 51% to 81D8
94% to LG_XIV (-) 929000
fgenesh1_pg.C_scaffold_40000330
[Poptr1:94218] gene model
short, 2 frameshifts
$
2691314 MATLILYFLVILALYIITRHFLT 2691382
2691382
KIRNFPPGPFPSLPIIGHLYLLKKPIYRTLSKISSKHGPVILLQLGSRR 2691528
2691529
LLVVSSPSIAEECFTKNDVVFANRPRLLIAKHLAYNSTSLVWAPYGDHWR 2691678
2691679 NLRRIVSIEVLSAYRLQMLSAIRLEEVKSMICVLFRNQKQIVDMKTVFFE
2691828
2691829
LTLNIMMRMIAGKRYYGESVSDVEEAKKFRAIHAETFLIGGKTIIGDYIP 2691978
2691979
WIKSKKMEKRMIECHIKRDSFMQYLIEEQRRKILESDCCGEKKTNLIQVL 2692128
2692129 LSLQETEPEYYTDDIIKGIML 2692191 (0)
2692602 VLLLAGTDTSSTTMEWALSLLLNHPEVLEKAQREIDEHIGHDRLMDEGDLAQ
2692757
2692759
LPYLRSILNETLRMYPPAPLLVPHESSEECLVGGFRIPRGTMLSVNV 2692899
2692900 WAIQNDPKIWRDPTKFRPER 2692959
2692960
FDNLEGGRYEFKLMPFGHGRRSCPGEGLALKVVGLALGSLLQCFEW 2693097
2693098
QKIGDKMVDMTESPGFTVPKAKQLEAICRARPRMLTLLSQI* 2693223
>CYP81B-se3[1]
scaffold_40 (+) 2695016-2695445
81D like N-term frag.
$
2695016 LLFSVFLGLYIITKHFLNEIQNLPPSPFPS
2695105
2695359 QPSFAAAKHLAYNCTNFAGPPYGDYW*NV
2695445
>CYP81B12
scaffold_40.7 (+) 2698671-2700987
81D like 51% to 81D8
88% to LG_XIV (-) 929000
fgenesh1_pg.C_scaffold_40000331
[Poptr1:94219] bad boundary,
gene model too long
$
2698671
MATFFLHFSVFLALYIITRHFLNKIRNFPPSPFPSLPIIGHLYLLKKPIY 2698820
2698821
RALSKISSKHGPVILLQLGSRRQLVVSSPSIAEECFTKNDVVFANRPGYL 2698970
2698971
IAKHLAYNTTGLLWAPYGDHWRNLRRIVSIEVLSAYRLQMLSSIRLEEVR 2699120
2699121
SMICVLFRNQNQIVDMKTVFFELTLNIMMRMIAGKRYYGEDVSDVEEAKR 2699270
2699271
FRAIHAETLLLGGKTIIGDYVPWIKSKKMLKRVIECHLKSDSFMQYLIEE 2699420
2699421
QRRKILESDCCGEKKRNLIQVLLSLQENEPGYYTDDIIKGIML 2699549 (0)
2700385
VLLLAGTDTSSATMEWALSLLLNHPRVLEKAQREIDEHIGHDRLMDEGDL 2700534
2700535
AQLPYLRSILNETLRMYPPAPLLIPHESSEECLVGGFRIPRGTMLSVNMW 2700684
2700685
AIQNDPKIWPDPTKFRPERFDNPEGARDGFKLMPFGHGRRSCPGEGLALK 2700834
2700835
VVGLALGSLLQCFKWQKISDKMVDMTEGPGFTSTKAQPLEAI*RPRPSMHT 2700987
>CYP81B12-de2b
scaffold_40 (+) 2701996-2702187
81D like 84% to 40.7
$
2701996
PGEGSALRVVGLALGSLLQCF*WQKIGDKMVDMTESPGFTALKAKPLEAI 2702145
2702146 CRPRPSMLGHISQI 2702187
scaffold_10588
(+) 2-1138
81D like runs off the end
89% to scaffold_40
(+) 2687031
eugene3.105880001|Poptr1
$
2
EPEYYTDDIIKGLVV (0) 46
518
VILFAGTHTSSTTMEWALSLLLNHPEVLEKAKREIDEQIG 637
638 HDRLMDEADLAQLPYLRSVLNETLRMYPAAPLLVPHESSEECLVGGFRIPR 790
791 GTMLSVNVWAIQNDPKIWRDPTKFRPERFDN 883
884 PEVARDGFKLMPFGYGRRSCPGESMALRVMGLALGSLLQCFEWQKIGDKMVDMTE
1048
1049 ASGFTIPKAKPLKVICRPRPDMLRHLS* 1138
>CYP81C3
scaffold_40.4 (-) 2639518-2637241
81D like 45% to 81A7
3 aa diffs to scaffold_40 (-) 2633172
fgenesh1_pg.C_scaffold_40000322
[Poptr1:94210] gene model
seems correct
$
2639518
MEFLYYHLALLFFLFIVVKNLFHRKRNLPPAPFALPVIGHLYLLKQPLYK 2639369
2639368
SLHALLSRYGPALSLRFGSRFVIVVSSPSVVEECFTKNDKIFANRPKSMA 2639219
2639218
GDRLTYNYSAFVWAPYGDLWRKLRRLAVAEIFSSKSLRKSSTVREEEVSC 2639069
2639068
LIRRLLKVSTSGTQNVELRLLFSILASNVVMIVSAGKRCVEEEHAGTKME 2638919
2638918
KQLFQDFKDKFFPSLAMNICDFIPILRVIGFKGLEKNMKKLHGIRDEFLQ 2638769
2638768
NLIDEIRLKLKKTTSLKTDEVTDGEERRSVAEILLCLQESEPEFYTDEVI 2638619
2638618 KSTVL 2638604 (0)
2637858
MMFIAGTETSAITLEWAMTLLLNHPKVMQKVKAEIDEHVGHGRLLNESDI 2637709
2637708 VKLPYLRCVINETLRLYPPAPLLLPHFSSEACTAGGFDIPQGTMLVVNAW
2637559
2637558
TMHRDPKLWEEPNEFKPERFEAGLGEGDGFKYIPFGIGRRVCPGASMGLQ 2637409
2637408
IVSLALGVLVQCFEWDKVGTVEDTSHGLGMILSKAKPLEALCSPRRDLIT 2637259
2637258 LLSHL* 2637241
>CYP81C4
scaffold_40.3 (-) 2633172-2630892
81D like 45% to 81A7
3 aa diffs to scaffold_40 (-) 2637726 duplicate seq
fgenesh1_pg.C_scaffold_40000321
[Poptr1:94209] gene model
seems correct
$
2633172
MEFLYYHLALLFFLFIVVKNLFHRKRNLPPAPFALPVIGHLYLLKQPLYK 2633023
2633022 SLHALLSRYGPALSLRFGSRFVIVVSSPSVVEECFTKNDKIFANRPKSMA
2632873
2632872
GDRLTYNYSAFVWAPYGDLWRKLRRLAVAEIFSSKSLRKSSTVREEEVSC 2632723
2632722
LIRRLLKVSTSGTQNVELRLLFSILASNVVMIVSAGKRCVEEEHAGTKME 2632573
2632572
KQLFQDFKDKFFPSLAMNICDFIPILRVIGFKGLEKNMKKLHGIRDEFLQ 2632423
2632422 NLIDEIRLKLKKTTSLKTDEVTDGEERRSVAEILLCLQESEPEFYTDEVI
2632273
2632272 KSTVL 2632258 (0)
2631509
MMFVAGTETSAITLEWALTLLLNHPKVMQKVKAEIDEHVGHGRLLNESDI 2631360
2631359
VKLPYLRCVINETLRLYPPAPLLLPHFSSEACTAGGFDIPQGTMLVVNAW 2631210
2631209
TMHRDPKLWEEPNEFKPERFEASLGEGDGFKYIPFGIGRRVCPGASMGLQ 2631060
2631059
IVSLALGVLVQCFEWDKVGTVEDTSHGLGMILSKAKPLEALCSPRRDLIT 2630910
2630909 LLSHL* 2630892
>CYP81C5
LG_IIa (+) 9173256-9175652
81K like 51% to 81K1
72% to scaffold_40 (-) 2637726
fgenesh1_pg.C_LG_II001124
[Poptr1:347618] gene model
seems correct
$
9173256
MESLYHHLALLFFLFLVVKILFRQKQNLPPSPFALPIIGHLHLFKHPQSL 9173405
9173406
QTLSSQYGPILFLKFGCRSTLVVSSPSAVEECFTKNDIIFANRPQSMAGD 9173555
9173556
HLTYNYTGFVWAPYGHLWRSLRRISVIEIFASKSLQKSSIIREEEVCSLL 9173705
9173706
RRLLKAKNGVTAKVDLKFLFSLLTCNVMMRLAAGKPCIDEEVAGTKVEKQ 9173855
9173856
LFQEFKERFSPGLGMNICDFIPILRLIGYKGLEKSTKKLQSTRDKYLQHL 9174005
9174006
IDEIRMRRTSSSSKTAEQWKREGKSSVIETFLSLQDLEPEFLTDTVIKSV 9174155
9174156 LS 9174161 (0)
9175035
MMFVAGTETSAVTLEWAMALLLNHPKAMQKLKAEIDEHVGHGRLLN 9175172
9175173 ESNIVKLPYLRCVIKETLRLYPPAPLLLPHFSSGACTVGGFDIPQGTTLV
9175322
9175323
VNAWAMHRDPKLWEESNEFKPERFEAGLGEQEGFKYIPFGTGRRVCPGAS 9175472
9175473
MGLQMVSIALGALVQCFEWDKVAPVEDMSHSPGISLSKVKPLEALCCPRG 9175622
9175623 DLTTLLYHP* 9175652
>CYP81R1P
scaffold_64 (-) 903647-903045
81K like pseudogene 37% to 81K2
aa107-293
86% to scaffold_64 (-) 863860
eugene3.00640126|Poptr1 gene model wrong
$
903647 SYSYTAFLFAP 903615
903607
YGHLWRTPRRFSVSELFSRGCLDWSTAITEEVRTLLRLILSKVSDDRAKK 903458
903457 VDLNYFFTITSLNVIMKMNAGKKRVEEEKAACIDSEKQCIEDVQKIFPSNPGTSL 903293
903292 LDFFPILKWIGYNGDIEESTVI 903227
903227 KERDEFLQGLIEEVKRKETSS 903165
903162 DTSNTEEVKDQTTVIG 903115
903113 SLLALQKSDPELFTDVVVKGTAI 903045
>CYP81R2
scaffold_64 (-) 881364-879605
81K like 43%
to 81K1
2 aa diffs to scaffold_64 (-) 863860 duplicate seq
92% to scaffold_279 (-)
94272
eugene3.00640125|Poptr1 gene model seems correct
$
881364
MNYMYYCLAFFLSSFLVFKLVFQRSRNLPPSPFGFPIIGHLHLVSKPPMH 881215
881214
KVLAILSNKCGPVFTLKLGSRNIVAVCSLSAAEECYIKNDIVFANRPQSI 881065
881064 FVHYWSYNYAAFLFAPYGHLWRTLRRFSVTELFSRSCLDRSAAISEEVRT
880915
880914
LVRLILSKVSDDGAKKVDLNYFFTITSLNVIMKMNAGKKWVEEEKAACID 880765
880764
SGKQCIEDVQKIFPSNPGTTVLDFFPFLKWFGYRGEEESVIKVYKERDEF 880615
880614
LQGLIEEVKRKETSSVTSNPAEGVKDQTTVIGSLLALQKSDPELYTDEVV 880465
880464 KGTMA 880450 (0)
880228
TLYLAGVDTVDFTTEWAMTFLLNHPERLERVKAEIDREVGHERLVQESDL 880079
880078
PKLRYVRCVVNETLRLYPPAPLLLPHAPSEDCIVGGYKIPRGTIVMVNAW 879929
879928
AIHRDPKLWEDPESFKPERFEGLNNEGEKQGFIPFGIGRRACPGNHMAMR 879779
879778 RVMLALAALIQCFEWERVGKELVDMSIVDALISVQKAKPLEAICTPRPFT
879629
879628 TTLISPP* 879605
>CYP81R1
scaffold_64a (-) 865374-863615
81K like 41%
to 81K1
2 aa diffs to scaffold_64 (-) 879850 duplicate seq
eugene3.00640122|Poptr1 gene model seems correct
$
865374 MNYMYYCLAFFLSSFLVFKLVFQRSRNLPPSPFGFPIIGHLHLVSKPPMH
865225
865224
KVLAILSNKCGPVFTLKLGSRNIVAVCSLSAAEECYIKNDIVFANRPQSI 865075
865074
FVHYWSYNYAAFLFAPYGHLWRTLRRFSVTELFSRSCLDRSTAISEEVRT 864925
864924
LVRLILSKVSDDGAKKVDLNYFFTITSLNVIMKMNAGKKWVEEEKAACID 864775
864774
SGKQCIEDVQKIFPSNPGTTVLDFFPFLKWFGYRGEEESVIKVYKERDEF 864625
864624
LQGLIEEVKRKETSSVTSNPAEGVKDQTTVIGSLLALQKSDPELYTDEVV 864475
864474 KGTMA 864460 (0)
864238
TLYLAGVDTVDFTAEWAMTFLLNHPERLERVKAEIDREVGHERLVQESDL 864089
864088 PKLRYVRCVVNETLRLYPPAPLLLPHAPSEDCIVGGYKIPRGTIVMVNAW
863939
863938
AIHRDPKLWEDPESFKPERFEGLNNEGEKQGFIPFGIGRRACPGNHMAMR 863789
863788
RVMLALAALIQCFEWERVGKELVDMSIVDALISVQKAKPLEAICTPRPFT 863639
863638 TTLISPP* 863615
>CYP81R6
scaffold_279 (-)
96174-94443
81D like 43% to
81K1 43% to 81A6
92% to scaffold_64 (-) 879850
eugene3.02790010|Poptr1 gene model seems correct, short at
XXXXXXX
$
96174
MDYMYYCLAFFLSSFLVFKLVFQRSRNLPPSPFRFPIIGHLHLVTKPPMH 96025
96024
KVLAILSNKCGPIFTLKLGSKNIVAVCSLSAAEECFLKNDIVFANRPQSI 95875
95874 FFHYWSYNYAAFLFAPYGHLWRTLRRFSVTELFSRSCLDRSTAITEEVRT
95725
95724
LLRLILSKVSDDGAKNVDLNYFFTITSLNVIMKMIAGKKWVEEEKAACID 95575
95574
SGKQCLEDVQKIFPSNTGTILKWVGYKVKEESVIKVFKERDEFLQGLIEEVKRKETSSVTSNPA 95383
95382 AEGVKDQKTVIGSLLALQKSDPELYTDEVVKGTMA
95278 (0)
95066 TFYLAGVDTVDFTTEWAMTFLLNHPERLERVKAEIDREVGHERLVQESDL
94917
94916
PKLRYLRCVVNETLRLYPPAPLLLPHAPSEDCTIGGYEIPRGTIVMVNVW 94767
94766
AIHRDPKLWEDPESFKPERFEGLNNEGEKQGFIPFGIGRRACPGNHMAMR 94617
94616
RVMLALAALIQCFEWERVGQELIDMSIVKALISVQKAKPLEATCTPRPFT 94467
94466 TSLISPP* 94443
>CYP81R6-de2b
scaffold_279 (-) 99748-99595
pseudogene PKG-PERF region of 81 like
78% to scaffold_64 (-)
879850
eugene3.02790011|Poptr1
$
99748 SPLLLPHAPSEDCIVGGYKIPRGMIVMVSVWA 99653
99651 VHRDPKSWEGSESFKLEKY 99595
>CYP81R4
scaffold_40.2 (-) 2629530-2627051
81D like 50% to 81K2
scaffold_40 (-)
2625145 81D like
56% to scaffold_64 (-) 879850
fgenesh1_pg.C_scaffold_40000320
[Poptr1:94208] 2 genes fused?
$
2629530
MLSYCCLAFFFLIFLVIKYVFHGNKNLPPSPPSLPIIGHLHLLKPPLHQT 2629381
2629380 LQTLLQQYGPVLSLKAGCRSMLVLSSPSAVEECFTKNDVVLSNRPTFLAG
2629231
2629230
DHLTYNYTTIIFSPYGHLWRTLRRFAVLEMFSQKGLNKFSAVRKEEVCSL 2629081
2629080
LRQLSKVSCSGNKKVDLHYFFSLLSFNVAMRMSAGKKCIEEEVACSDLGK 2628931
2628930
QDLTELKKIFHPPLSTGLCDFFPALKWIDYKGFEKSVIKVRDGRDGFSQD 2628781
2628780
LIDEIRQKKTSSCSSPDAGPEKTTMIETLLSLQEQEPDFYTDDIIKGLVV 2628631 (0)
2627671
AIFAAGTDTVAVTMEWAMSLLLNHPEILQKVREEIDSEVGHTRLVEELDL 2627522
2627521
PKLKYLRCVINETLRLYPVVPLLLPRCPSEDCTVAGYKVPKGTILLVNAF 2627372
2627371
AMHRDPKMWEQPDRFKPERFEVTEEEKEGIKFIPFGMGRRACPGSNMGMR 2627222
2627221
AIMLAMAALFQCFEWERTGPEMVDMTVAAAISMVKATPLEAFCKPYHSMA 2627072
2627071 NLFSQL* 2627051
>CYP81R5P
scaffold_40 (-) 2625277-2624657
81D like exon 2 94% to40.2
$
2625277
AMFSAGTDTVAVTMEWAMALLLNHPEILQKVRVEIDSQVGHTRLVEEVDL 2625128
2625127
PKLKYLRCVINETLRLYPVVPLLLPRCPSEDCTVAGYNVPKGTILLVNAF 2624978
2624977
AMHRDPKMWEQPDRFKPERFEATVEEKEGIKFIPFGMGRRACPGSNMGMR 2624828
262482