584 sequences - 18 from other species = 566
cottonwood sequences
Last modified April 1, 2005 D. Nelson
Some names revised on 8/24/2006, mostly minor
changes like v1 to P1 etc.
<CYP51 Clan, 2 sequences, both full length,
95% identical.
>CYP51G1
Scaff LG_I (-)4925909-4924104
84% to Arab. 51G1 95% to Scaff LG_III CYP51G5
seq.
fgenesh1_pm.C_LG_I000188|Poptr1 gene model correct
FKBP-type peptidyl-prolyl cis-trans isomerase
downstream
$
4925909 MTGDTDNKFLNVGLLIIATLLVAKLISALIMPRSQKRLPPVMKGWPLIGG
4925760
4925759
LIRFLKGPIVMLREEYPKLGSVFTVNLVNRKITFLIGPEVSAHFFKASEV 4925610
4925609
DLSQQEVYQFNVPTFGPGVVFDVEYSIRQEQFRFFTEALRVNKLKGYVDQ 4925460
4925459 MVVEAE 4925442 (0)
4925102
DYFLKWGDSGVVDLKYELEHLIILTASRCLLGREVRDKLFDDVAALFHD 4924956
4924955
LDNGMLPVSVLFPYLPIPAHRRRDRARKKLAEIFANIINSRKLASKSEND 4924806
4924805
MLQCFIDSKYKDGRPTTESEITGLLIAALFAGQHTSSITSTWTGAYLLRH 4924656
4924655
NEYLSAVLEEQKNLMKKHGNKVDHDILSEMDVLYRCIKEALRLHPPLIML 4924506
4924505
LRSSHSDFSVTTRDGKEYDIPKGHIVATSPAFANRLPHVFKDPDSYDPDR 4924356
4924355
FAYGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWSHLLRNFELE 4924206
4924205
LISPFPEIDWNAMVVGVKDKVMVRYKRRELSVN* 4924104
>CYP51G5
Scaff LG_III (+)14476000-14477787
83% to Arab. 51G1 95% to Scaff LG_I CYP51G1
seqF.
eugene3.00031308|Poptr1 gene model correct
FKBP-type peptidyl-prolyl cis-trans isomerase
downstream
$
14476000
MTKDTDNKFLNVGLLILATLLVAKLISALIMPRSQKRLPPVMKGWPLIGG 14476149
14476150
LIRFLKGPIVMLREEYPKLGSVFTVNLANWKITFLIGPEVSAHFFKASEA 14476299
14476300
DLSQQEVYQFNVPTFGPGVVFDVDYSIRQEQFRFFTESLRVSKLKGYVDQ 14476449
14476450
MVVEAE 14476467 (0)
14476789
DYFSKWGDSGVVDIKYELEHLIILTASRCLLGREVRDKLFDDVSALFHD 14476935
14476936
LDNGMLPISVLFPYLPIPAHRRRDRARKKLAEIFANIINSRKLAGKSEND 14477085
14477086
MLQCFIDSKYKDGRPTTESEITGLLIAALFAGQHTSSITSTWTGAYLLRH 14477235
14477236
NEYLSAVLEEQKNLMKKHGNKVDQDILSEMGVLHRCIKEALRLHPPLIML 14477385
14477386
LRSSHSDFSVTTRDGKEYDIPKGHIVATSPAFANRLPHVFKDPERYDPDR 14477535
14477536
FAAGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWSHLLRNFEFE 14477685
14477686
LISPFPETDWNAMVVGVKDKVMVRYKRRELSVN* 14477787
<CYP71
clan 22 families, 71, 73, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 89,
92,
93, 98, 701, 703, 705, 706, 712, 736
66
sequences
29 nearly
complete CYP71 like sequences and some related partial seqs.
<71B
subfamily sequences (10 seqs all named)
three full
length sequences 71B38, 71B40 and 71B41 are all about 97% identical
this is
too similar for the genome duplication date.
>CYP71B41-de1b
LG_VIII (-) 12482829-12482572
71B like 100% to LG_VIII.4 LG_VIII.10
LG_VIII.16
eugene3.00081725|Poptr1 N-term only
$
12482829
MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12482680
12482679
QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSA 12482572
>CYP71B44P
LG_I
(+) 19811612-19812590
71B
like pseudogene 51% to 71B36
53% to 71B41
fgenesh1_pg.C_LG_I001954
[Poptr1:64602] model short
exon 1
$
19811612
MACHDPLIMWSLPLVLFFSLLMFLLIRKKQNKQQIPPTPPRLPIIGNLHQLGDLS 19811776
19811777
QRSLWQLSKKYGPVILLKLGAVPAVVISSAEAAKEVLKTNDLHACSRPLL 19811926
19811927
AGTGRLSYNYSDVSFTYTYGDYWRKM*KICVLELCSARRVQSFLF 19812061
19812135 IREEEVALLIDTISAYSFSATPVDLSEKILSFTANITCRAAFGK
19812266
19812267
SFQEIKGFDGKRFEEVIREASAILASFSAADFFPKDGWIIERLTG 19812401
19812402 LLHSRLERSFRELDVLYRRVIDDHIKLEE
19812485
19812486
EEKEDIVGGPLKL*RDQTEFGTIQLTHDHIKAKLM 19812590 (0)
>CYP71B40v1
LG_VIII.4 (-) 12489221-12487004
71B like 53%
to 71B36 97%
to LG_VIII.16
eugene3.00081726|Poptr1 gene model short at N-term
$
12489221
MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12489072
12489071
QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDVAF 12488922
12488921 CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKILTLELFSLKRVQSFRF
12488772
12488771
IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12488622
12488621
FDRDKFHEVVHDTVAVVGSISADESIPYLGWIVDRLTGHRARTERVFHEV 12488472
12488471
DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKEQTELGASQFTKDNIKAILL 12488316 (0)
12487621
NLFMAGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 12487472
12487471
DQLEYLRMVIKETLRLHPPGPLLIPRETMSHCKVSGHNIYPKMLVQINVW 12487322
12487321
AIGRDPRYWKDPEEFFPERFLDRSIDYKGQSFEYLPFGSGRRICPGMHMG 12487172
12487171 SITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVP
12487022
12487021 VNYLP* 12487004
>CYP71B40v2
scaffold_994 (+) 9952-10569
CYP71B40 100% match exon 2
fgenesh1_pg.C_scaffold_994000003|Poptr1 duplicate seq
$
9952 NLFMAGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 10101
10102 DQLEYLRMVIKETLRLHPPGPLLIPRETMSHCKVSGHNIYPKMLVQINVW
10251
10252
AIGRDPRYWKDPEEFFPERFLDRSIDYKGQSFEYLPFGSGRRICPGMHMG 10401
10402
SITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVPVNYLP* 10569
>CYP71B41
LG_VIII.16 (-) 12479032-12476806
71B like 54%
to 71B36
eugene3.00081724|Poptr1 gene model short at N-term
$
12479032
MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12478883
12478882
QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDLAF 12478733
12478732
CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKIVTLELFSLKRVQSFRF 12478583
12478582
IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12478433
12478432
FDRDKFHEVVHDTEAVAGSISADESIPYLGWIVDRLTGHRARTERVFHEL 12478283
12478282
DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKEQTELGASQFTKDNIKAILL 12478127 (0)
12477423 NLFLGGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI
12477274
12477273
DQLEYLRMVIKETLRLHPPAPLLIPRETMSHCKVSGHNIYPKMLVQINVW 12477124
12477123
AIGRDPTYWKDPEEFFPERFLDSSIDYKGQSFEYLPFGSGRRICPGMHMG 12476974
12476973
FITMEIILANLLYCFDWVYPDGMKKEDINMEEKAGVSLTTSKKTPLILVP 12476824
12476823 VNYLQ* 12476806
>CYP71B38
LG_VIII.10 (-) 12547429-12545187
71B like 54%
to 71B36 97% to LG_VIII.4
fgenesh1_pg.C_LG_VIII001676|Poptr1 gene model short on
the N-term
$
12547429
MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12547280
12547279
QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDLAF 12547130
12547129
CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKVLTLELFSLKRVQSFRF 12546980
12546979
IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12546830
12546829
FDRDKFHEVVHDTEAVAGSISADESIPYLGWIVDRLTGHRARTERVFHEL 12546680
12546679
DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKDQTELGASQFTKDNIKAILL 12546524 (0)
12545804
NLFLGGVDTISLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 12545655
12545654
DQLEYLRMVIKETLRLHPPAPLLITRETMSHCKVSGHNIYPKMLVQINVW 12545505
12545504 AIGRDPTYWKDPEEFFPERFLDSSIDYKGQSFEYLPFGSGRRICPGMHMG
12545355
12545354
FITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVP 12545205
12545204 VNYLQ* 12545187
>CYP71B43P
LG_X.28 (-) 6738703-6736564
71B like 52%
to 71B36 86% to LG_VIII.4
fgenesh1_pg.C_LG_X000579|Poptr1 gene model wrong 2
frameshifts
possible pseudogene or two seq. errors.
$
6738703 MALYAVPLWLPLILLLPLLLLFMKRMKDAGQSEQLLPPGPP 6738581
6738581
KLPILGNLHQLSSLPHQSMWHLSKKYGPVMLLRLGQIPTVVISSAEAA 6738438
6738437 REVLKVHDLAFCSRPLLSGAGRLTYNYLDIAFSPYSDHWRNMRKIVTLEL
6738288
6738287
FSLKRVQSFRFIREEEVGFLVNSLSESSALAAPVDLTQKVYALVANITFR 6738138
6738137 VAYGFDYRGTTFDRDRFHEVVHDTEAVVGSISADEYVPYLG 6738015
6738015
MIVDWLTGHRARMERVFHELDTFFQHVIDNHLKPGRIKDHDDMIDV 6737878
6737877 LLRIEKEQTELGASQFTSDNIKAVLL 6737800
(0)
6737179
NLFLGGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGKKGRVTEGDV 6737030
6737029
DQLEYLRMVIKETLRLHPPAPLLLPRETMSHCIVSGYNIYPKTLVHVNVW 6736880
6736879 AIGRDPKYWRDPEEFFPE 6736826
6736827
RFLDSSCDFNGQSFEYLPFGSGRRICPGIHMGSITVEIILSNLLHCFDWI 6736678
6736677
LPHGMQKEDINMEEKAGVSLAPSKKTPVILVPVNYLQ* 6736564
>CYP71B42P
LG_VIII.30 (-) 12460686-12458958
71B like pseudogene 95% to LG_VIII.31
54% to 71B24
eugene3.00081722|Poptr1 gene model short, frameshifts
$
12460686 MAFYILPLALLLLLLFPLPLILKKKQQ 12460608
12460608 KLYVLELFSLK 12460576
12460577
RVQSFRFIREEVISLLMNSISQSSSPATPVNLTQMLYSVFASIVFGMALG 12460428
12460427 KSFQGSDFHNERCRKSIHEAE 12460365 (small deletion
here)
12460365
GHNRMLITGYDARIERVFLELDTLFQQVIDDHLKAERKERKEDIINVLLKMERDQT 12460198
12460197 ESSAPQLTKYNIKAVIL 12460147 (0)
12459572
NIFLGGVDTGAITVIWATAEIARNPIIMKKAQEEIRSSVGQKGRATEERT 12459423
12459422
DELQYLKMVIKETLRLHPPAPLLLPRETMSRCQINGYDIYLKTLIQVNAW 12459273
12459272
AIGRDPEYWRDSEEFFPERFVDSPIDYKGQRFEFLPFGSGRRVCPGILMG 12459123
12459122 VTMVELALANLLHCFDWKLPNAV 12459054
12459053 AINMEEAAGLTISKKNPLFLVRINYPQQAQPD 12458958 sequence gap
here
>CYP71B39P
LG_VIII.31 (-) 12534486-12532739
71B like pseudogene 95% to
LG_VIII.30
eugene3.00081733|Poptr1 gene model short, frameshifts
$
12534486 LLLLLLFPLPLILKKKQQ 12534433
12534433 KLYVLELFSLK 12534401
12534402
RVQSFRFIREEVISLLMNSISQSSSPATPVNLTQMLYSVFASIVFRMALG 12534253
12534252
KSFQGSDFHNERCRKAIHEAE 12534190 (small deletion here)
12534190
GHNRMLITGYDARIERVFLELDTLFQQVIDDHLKAERKERKEDIINVLLKMERDQT 12534023
12534022 ESSAPQLTKYNIKAVIL 12533972 (0)
12533397
NIFLGGVDTGAITVIWATAEIARNPIIMKKAQEEIRSSVGQKGRATEERT 12533248
12533247
DELQYLKMVIKETLRLHPPAPLLLPRETMSRCQINGYDIYLKTLIQV 12533107
12533107
NAWAIGRDPEYWRDSEEFFPERFVDSPIDYKGQRFEFFPFGSGRRVCPGI 12532958
12532957 VMGVTMVELALINLLYCFDWKLPNAV
12532880
12532879 DINMEEAAGLTISKKMPLFLVPINYPQRAQPDKMSRTSLLSKHTCS*
12532739
>CYP71B45P
LG_X (-) 6735316-6733923
71 like pseudogene new 75% to CYP71B42P
fgenesh1_pg.C_LG_X000578|Poptr1 71 like I-helix + C-term (pseudogene?)
downstream of fgenesh1_pg.C_LG_X000579
$
6735316
ILSLGRTPTLVVSSAEAARAVLKTHDLDCCCRPRLSGSGRLTYNHVYVAF 6735167
6735166
APYGDYW*EMRKLFVLEPFILKRVQSFRFITGEVARIMNSIPQSSS 6735029
6734867 PYAG*ILDKVTDHHARIERVLH 6734802
6734482 NIFLGGVHAGAITVIWALEELAWNPRTMKKAQDEIRNSVGKKGRLAEESI
6734333
6734332 DEL 6734324
6734322 TLVIKETLRWQ 6734290
6734288
PPAPLLLPRETMSHCKINGYHIYPKILIQINV*AIGSDPTYWNDS*EFF 6734142
6734141
PERFVDSSID*KGQHFEFLPFGSGRRGCPGILMGVTMVELALANLLYCLDW 6733989
6733988 KSAKAIDINMEEAAGLTISKKM 6733923
<71D
subfamily 40 sequences all named
>CYP71D38-de2b
scaffold_710 (-) 1341-893
pseudogene 89% to 726B1 J-helix to end
eugene3.07100001|Poptr1 gene model short
$
1341 VIKNPRVLEKAQKEGRQVFND 1279
1276
LGTIPDETSLHDSKFLKLIIKETLRLHPPAPLMIPIECRKRYNVNGYDTHVKSKVLINAWAIGRDP 1079
1078 NYWNELERFYPERFINVSTDFKGIDFEFIPFGAGKRMCPGKL 953
953
VKRIRDLKLIPV 918
916 SYRSLVG* 893
>CYP71D38
scaffold_710 (-) 19864-18128
726A like 50% to 726A1 euphorbia
eugene3.07100003|Poptr1
$
19864
MLISLPVFLTILLVISILWTWTKFIKSNKSSSNPPPGPWKLPFIGNLHQL 19715
19714
VHPLPHHRMRDLAKKFGPVMQLQVGEVSTVIISSSEAAKEVMKTHEINFV 19565
19564
ERPHLLAASVLFYNRKDIAFAPYGEYWRQLRKISILELLSAKRVRSFKSI 19415
19414
REEEVSNFIASIYSKEGSPINLSRMIFSLENGITARTSIGNKCKNHEGFL 19265
19264
PIVEELAEALGGLNMIDIFPSSKFLYMVSRVRSRLERMHREADEILESII 19115
19114
SERRANSALASKMGKNEEDDLLGVLLNLQDHGNLEFQLTTSTIKAVIL 18971 (0)
18748
EMFSGGGDTSSTALEWAMSELIKNPRVMEKAQKEVRQVFNDLGTIPDETS 18599
18598
LHDLKFLKLIIKETLRLHPPVPLIPRECRKRCDVNGYDIHVKSKVLINAW 18449
18448
AIGRDPNCWNEPERFYPERFINVSTDFKGSDFEFIPFGAGKRMCPGMLFA 18299
18298
TANTEFPLAQMLYHFDWKPAGGLKPENLDMTESFGGAVKRKQDLKLIPIS 18149
18148 YRSLVG* 18128
>CYP71D38-de2c
scaffold_710 (-) 21263-20928
pseudogene
79% to 726B1 C-term
$
21263 NDLGTILDETS 21231
21213 LKLITEETLRLHPSAPLIPREWRKRCQVNGYD
21118
21117 NIHVKSKVLINAWA 21076
21085 CMGMLFAIA 21059
21059
HHFDWKPIDGLKPENLDMTESLGGATKRKRDLKLIFISYRSLVG 20928
>CYP71D23P
LG_XV.1 (+) 7113294-7114921
71D like possible
pseudogene 46% to 71D10 57% to LG_VII.22
fgenesh1_pg.C_LG_XV000688|Poptr1 gene model wrong 2 frameshifts
and a stop codon
$
7113294
MEQHFPLFAIFLTFLLFIFMVLRMRKKSETNKYLTTNPPPGPWKLPLVGN 7113443
7113444
IHHVAGHQIHHRFTDLARKYGPVMQILLGEVRFVVISSRETAKEVMKTNE 7113593
7113594
NIIVDRPDGVIPRIVFYNGKAISFTPYGEYWKQLRKSCSSKLLSPQCVRS 7113743
7113744
LIRSTMEEEVSDFVTSISSKEGSPINLSKMLFTLTFGLISRVILGKKGKN 7113893
7113894
QALLSSIEEWKQGGAGFDVADIFPSFKLFHSLGWARSKFVRQHQEIGEML 7114043
7114044
ETVINERRASKIRTKTSEHEIEEDFLDVLVNMQHALR 7114154
7114156 NLEFTNDNIKAILL 7114197 (0)
7114292 EFFLAGSDSSSAVMEWAMSEMLKNPRHMKRAQKEVRVVFTKMGNDDETRL
7114441
7114442
HELKYLQLIIKETTRLHPPAPLILRACREACKINGHDIPDRSNVMINAWA 7114591
7114592 IGRDPTYWNEA* 7114627
7114628
KFNPERFLDSSIDYMGTNFEFIPFGAGKRKCPGMAFGLAIVEMALAKLLY 7114777
7114778
IFDWKLCDGVKNEDLNMKEDTALGSTVKRKHELYLIPIPYHPSSPAK* 7114921
LG_VII.22 (+) 5888964-5891028
71D like 50%
to 71D4, 75% to LG_VII.12
fgenesh1_pg.C_LG_VII000682|Poptr1 gene model correct
$
5888964
MEFPILLASLLFIFAVLRLWKKSKGNGSTLALPPGPWKLPLIGNIHQLAG 5889113
5889114 SLPHHCLTDLAKKYGPVMQLQIGEVSTVVVSSGEAAKEVMKTHEINFVER
5889263
5889264
PCLLVANIMFYNRKNIGFAPYGDYWRQMRKVCTLELFSAKRVRSFRSVRE 5889413
5889414
EEVSNFIRNIYAKAGSPINLSKMMLDLSNGVIARTSIGKKSKNQEAFLPI 5889563
5889564
IEDVAEALAGLNIVDVFPSAKFLYMISKLRSRLERSHIEADEILENIINE 5889713
5889714
RRASKEERKTDQDNEVEVLLDVLLNLQNQGNLEFPLTTDSIKAIIV 5889851 (0)
5890402
EMFGAGSETTSTLLEWSMSEMLKNPRVMKKAQEEVRQVFSDSENV 5890536
5890537
DETGLQNLKFLKLIIKETLRLHPPISLIPRECSKTCEINGYVIQAKSKVI 5890686
5890687
INAWAIGRDSNDWTEAEKFYPERFQDSSIDYKGTNFEFIPFGAGKRMCPG 5890836
5890837
MLFGIGNAELLLARLLYHFDWKLSSGAALEDLDMNEAFGGTVKKKHYLNL 5890986
5890987 IPIPYGPCPLPVE* 5891028
>CYP71D42
LG_VII.3 (-) 5804810-5802796
71D like 49%
to 71D4 99% to LG_VII.27
note these two gene names were both assigned to the same
sequence
eugene3.00070713|Poptr1 gene model seems correct
$
5804810 MDQVFQFIYILIVPFLLLIFPVLRLWKKSQGNNSSTPPPPPGPWKLPLIGNLHQLL 5804643
5804642
GSLPHQVLRDMANKYGPVMQLQIGEVPTVIISSPEAAKEAIKTHEINFVD 5804493
5804492
RPCLLVAKVMFYNSKDIAFAPYGDYWRQMKKVCVLELLSAKRVKSFRSIR 5804343
5804342
EEEVSNFMRTIYSKAGSPINLSKMMFDLLNGITARASVGKKYKHQEAFLP 5804193
5804192
IIEQVIEAMGGTNIADVFPSSKLLYMISRFRSRLERSHQDADVILENIIY 5804043
5804042
EHRVRREVAKTDEESEAEDLLDVLLNLQNHGDLGFPLTTDSIKATIL 5803902 (0)
5803416
ELFTAGSDSSSTLMEWTMSEMLRNPRVMRKAQEEVRQVFSNTEDVDETCL 5803267
5803266
HNLEFLKLIIKETLRLHPPAPFIPRECNKTCEINGYVIQAKSKVMINAWA 5803117
5803116
IGRDSDHWTEAEKFYPERFLDSSIDYMGTNFEFIPFGAGKRMCPGILFGI 5802967
5802966
ATVELPLAQLLYHFDWKLPNGDLSEDLDMNEVFVGTVRRKHQLNVIPIPF 5802817
5802816 YPSPLQ* 5802796
>CYP71D43
LG_VII.12 (-) 5753477-5751477
71D like 48% to 71D4
94% to LG_VII.3
estExt_fgenesh1_pg_v1.C_LG_VII0660|Poptr1 gene model seems
correct
$
5753477 MEQVFQFIQILVPFLLLIFTVLRLWKKSQGNNSSTPPPP
5753361
5753360
PPPPPGPWKLPLIGNLHQLLGSLPHQVLRDMANKYGPVMQLQIGEVPTVI 5753211
5753210
ISSPEAAKEAMKTQEINFVDRPCLLVAKVMYYNSKDIGFAPYGDYWRQMK 5753061
5753060
KVCVLELLSAKRVKSFRSIREEEVSNFIRAIYSRAGSPINLSKMMFDLLN 5752911
5752910
GITARASVGKKYKHQEAFLPIIEQVIEAVGGTNIADVFPSSKLLYMISRF 5752761
5752760
RSRLERSHQDADVILENIIYEHRVRREVAKTDEESEAEDLLDVLLNLQNH 5752611
5752610 GDLGFPLTTDSIKATIL 5752560 (0)
5752097
ELFAGGSDTSSTLMEWTMSEMFRNPRVMRKAQEEVRQVFSNTENVDETCL 5751948
5751947
HNLEFLKLIIKETLRLHPPVPFIPRECNKTCEINGYVIQAKSRVMINAWA 5751798
5751797
IGRDSDHWTEAEKFYPERFLDSSIDYKGTNFDFIPFGAGKRMCPGILFGI 5751648
5751647
ATVELPLAQLLYHFDWKLPNGDLLEDLDMNEVFGGTVRRKHQLNLIPIPF 5751498
5751497 YPSPLQ* 5751477
>CYP71D24P
scaffold_228 (-) 35436-34671
71D like 2 copies exon 1 pseudogene
eugene3.02280006|Poptr1
$
35436 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQLASSATMP 35266
35266 HYLCAHWARKYG
35231
35234 RTPPLGPWKLPLIGNIHQLASSATMP 35157
35180 LASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVIISSPDAAKEVLKT
35040
35039
QEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRKACIWGLFSATRK 34890
34889 LSFRSIREEEVSNLISSIRSKEGSPINLRELLLDLSNETITRTSIGKKCK
34740
34739 NKARFLHTIEQVSKSVGGVNIFL 34671
>CYP71D25Pv1
scaffold_228 (-) 39599-38903
71D like 62% to LG_VII.22 41% to 71D4
exon 1 pseudogene
eugene3.02280007|Poptr1 100% to scaffold_228 (-)
34668-35436
$
39559 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQ
39413
39412
LASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVIISSPDAAKEVLKT 39272
39271
QEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRKACIWGLFSATRK 39122
39121
LSFRSIREEEVSNLISSIRSKEGSPINLRELLLDLSNETITRTSIGKKCK 38972
38971 NKARFLHTIEQVSKSVGGVNIFL 38903
>CYP71D25Pv2
scaffold_1911 (+) 7175-8184
71V like 37% to 71V5
fgenesh1_pg.C_scaffold_1911000001|Poptr1 contains a
duplication from exon 1
identical in seq to each other in overlap region
almost identical to 71D25P 2 aa diffs probable duplicate sequence
$
7175 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQLASSATMPHYLCAHWAKKYG 7381
7381 TPPPGPWKLPLIGNIHQLASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVII 7536
7537
SSPDAAKEVLKTQEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRK 7686
7687 ACIWGLFSATRKLSFRSIREEEVSNLISSIRSKAGSPINLRELLLDLSNE
7836
7837
IITRTSIGKKCKNKARFLHTIEQVSKSVGGVNIVDLFPSARLVHMISNMT 7986
7987
SSLQRLHEETDQMLEDIINERRASRVEKKTGENKIEAGDDLLDVLLNLQD 8136
8137 DGNFKVKTDSIKSIIL 8184 (0)
>CYP71D36Pv1
LG_I (-) 29491328-29490406
71D like EXXR 61% TO CYP71D26
eugene3.00012614|Poptr1 mid
regioN to EXXR 71D like
29491328 DTVDVLLNL*GQADLEFTLTTKNIKAIIL 29491242 (0)
29490942 DMFVAGSETSSRTVEWAK 29490889
29490889 TELAKHPKVMEKAQAEARQVFANVDEAGLHKLDHLQLLIKETL 29490761
29490759 NIPPIPLLFPRESKEACKITGYDMPAQSKVTVNIWGIGREPINWTEPERFYP
29490604
29490603 ERFLDSSMDYKGIDFKFIPFGAG 29490535
29490534 ILFGMATYVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS*
29490406
>CYP71D36Pv2
scaffold_1517 (-) 9207-8703
3 aa diffs to 71D36P, duplicate seq.
eugene3.15170002|Poptr1
gene model wrong
$
9207 LNEMAKHPKVMEKAQAKARQVFANVDEAGLHKLDHLQLLIKETL 9076
9074 NIPPIPLLFPRESKKACKITGYDMPAQSKVTVNIWGIGREPINWTEPERFYP
8919
8918 ERFLDSSMDYKGIDFKFIPFGA 8853
8852 GILFGMATVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS*DEK*SLH
8703
>CYP71D35P
LG_I (-) 29497891-29497622
71B like pseudogene no model exists
98%
to scaffold 1517, 93% to scaffold 6967, 47% to 71B18
$
29497891 KVTVNIWRIGREPINWTEPER 29497829
29497826 FYPERFLDSSMDYKGIDFKFIPFGAGILFGMAT
29497728
29497726
TVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS 29497622
>CYP71D36Pv3
scaffold_6967 (-) 2023-1561
100% to 71D36Pv2
eugene3.69670001|Poptr1 gene model wrong
$
2023 LNEMAKHPKVMEKAQAKARQVFANVDEAGLHKLDHLQLLIKETL 1892
1890
NIPPIPLLFPRESKKACKITGYDMPAQSKVTVNIWGIGREPINW 1759
1758
TEPERFYPERFLDSSMDYKGIDFKFIPFGAGILFGMATVVLPLAQLLCFL 1609
1608 DWIPPNGLRSADLVTS 1561
>CYP71D34
LG_I.29 (-) 29516258-29514652
71D like new 52% to 71D10 93% to LG_I.2
eugene3.00012615|Poptr1 gene model short one frameshift
$
29516258
MEQLQTPPSLVLLPSLLFIFMVLRMLKKSKSKDLTPNLPPGPRKLPVIGN 29516109
29516108 LHQLFGSLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMKVHD
29515959
29515958 INFAHRPHLPVGQIIFYNCTDIATA 29515884
29515882
AAYGDYWRQLRKVSILELLSPKRVQSFRSIREEEVSSLIGSISSSAGSII 29515733
29515732
NLSRMLFSVAYNITTRAAFSKLRKEEEIFVPLVQGIIQVGAGFNISDLFP 29515583
29515582 SIKLIPWITGMRSRMERLHQEADRILESIINDHRARKAEGNSSNESKADN
29515433
29515432 LVDVLLDLQEHGNLDFSLTTDNIKAVIL
29515349 (0)
29515263
DIFIAGTETSSTILQWAMSELLKHPEVMEKAQTEVREAFGKDGSVGELNY 29515114
29515113
LKMVIKETMRLHPPLPLLLPRECREECGINGYNIPIKSRVLVNVWAIGRD 29514964
29514963 SDYWVEAERFHPERFLDSSIDYKGVNFEFTPFGAGRR
29514853
29514852
MCPGILFGISNVDLLLANLLYHFDWKLPGDMEPESLDMSEAFGATVRR 29514709
29514708 KNALHLTPILHHPHPVRS* 29514652
scaffold_11610
(-) 1395-784
71D LIKE 96% to 71D34
eugene3.116100001|Poptr1 gene model wrong exon 2 only
$
1395
DLFIAGTETSSTILEWAMSELLKHPEVMEKAQTEVREVFGKDGSVGELNY 1246
1245
LKMVIRETMRLHPPLPLLLPRECREECGINGYNIPIKSRVLVNVWAIGRD 1096
1095
SNYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRRMCPGILFGISNVD 946
945 LLLANLLYHFDWKLPGDMKLESLDMSEAFGATVRRKNALHLTPILHQPHPVRS*
784
>CYP71D33P
LG_I
(-) 29524379-29524047
71B
like pseudogene
eugene3.00012616
[Poptr1:550175] model short
$
29524379
VWAIGRDSDYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRR 29524248
29524247
MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGATVRR 29524104
29524103 KNALHLTPILHHPHPVRS* 29524047
>CYP71D32P
LG_I
(-) 29530020-29529344
71B
like pseudogene
eugene3.00012617
[Poptr1:550176]
$
29530020
MEQPQIPSCLVLLPSLLFIFMVLRMLKKSKTKDLTPNLPPGPRKLPVIGN 29529871
29529870
LHQLFGSLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMK 29529730
29529676 VWAIGRDSNYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRR
29529545
29529544
MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGATVRR 29529401
29529400 KKALHLTPILHHPHPVRS* 29529344
>CYP71D31P
LG_I (-) 29532948-29532496
71B like pseudogene C-term
eugene3.00012618|Poptr1
$
29532948
VIRETMRLHPPLPLLLPRECREECGINGYNIXIKSRVLVNAWAIGRDSNY 29532799
29532798 WVEAERFHPERFLDSSIDYKGVNFEFTPFGAGRR
29532697
29532696
MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGAAVRR 29532553
29532552 KNALHLTPILHHPHPVRS* 29532496
>CYP71D30P
LG_I
(-) 29540735-29539563
71B
like PSEUDOGENE 2 models are from same gene
eugene3.00012620
[Poptr1:550179] gene start
eugene3.00012619
[Poptr1:550178] gene end
$
29540735
MEQLQIPTSLVLLSSLLFIFMVLRILKKSKTKDFTPNLPPGPRKLPVIGN 29540586
29540585 LHQLFCSLPHHRLR 29540544
(sequence gap)
29539973
LLPRECREECGINGYNIPIKSRVLVNVWAIGRDSNYWVEAERFQPERFLD 29539824
29539823 SSIDYKGVNFEFTPFGAGRR 29539764
29539763 MCPGILFG