584 sequences - 18 from other species = 566 cottonwood sequences

Last modified April 1, 2005  D. Nelson

Some names revised on 8/24/2006, mostly minor changes like v1 to P1 etc.

 

<CYP51 Clan, 2 sequences, both full length, 95% identical.

 

>CYP51G1

Scaff LG_I (-)4925909-4924104

84% to Arab. 51G1 95% to Scaff LG_III CYP51G5 seq.

fgenesh1_pm.C_LG_I000188|Poptr1 gene model correct

FKBP-type peptidyl-prolyl cis-trans isomerase downstream

$

4925909 MTGDTDNKFLNVGLLIIATLLVAKLISALIMPRSQKRLPPVMKGWPLIGG 4925760

4925759 LIRFLKGPIVMLREEYPKLGSVFTVNLVNRKITFLIGPEVSAHFFKASEV 4925610

4925609 DLSQQEVYQFNVPTFGPGVVFDVEYSIRQEQFRFFTEALRVNKLKGYVDQ 4925460

4925459 MVVEAE 4925442 (0)

4925102 DYFLKWGDSGVVDLKYELEHLIILTASRCLLGREVRDKLFDDVAALFHD 4924956

4924955 LDNGMLPVSVLFPYLPIPAHRRRDRARKKLAEIFANIINSRKLASKSEND 4924806

4924805 MLQCFIDSKYKDGRPTTESEITGLLIAALFAGQHTSSITSTWTGAYLLRH 4924656

4924655 NEYLSAVLEEQKNLMKKHGNKVDHDILSEMDVLYRCIKEALRLHPPLIML 4924506

4924505 LRSSHSDFSVTTRDGKEYDIPKGHIVATSPAFANRLPHVFKDPDSYDPDR 4924356

4924355 FAYGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWSHLLRNFELE 4924206

4924205 LISPFPEIDWNAMVVGVKDKVMVRYKRRELSVN* 4924104

>CYP51G5

Scaff LG_III (+)14476000-14477787

83% to Arab. 51G1 95% to Scaff LG_I CYP51G1 seqF.

eugene3.00031308|Poptr1 gene model correct

FKBP-type peptidyl-prolyl cis-trans isomerase downstream

$

14476000 MTKDTDNKFLNVGLLILATLLVAKLISALIMPRSQKRLPPVMKGWPLIGG 14476149

14476150 LIRFLKGPIVMLREEYPKLGSVFTVNLANWKITFLIGPEVSAHFFKASEA 14476299

14476300 DLSQQEVYQFNVPTFGPGVVFDVDYSIRQEQFRFFTESLRVSKLKGYVDQ 14476449

14476450 MVVEAE 14476467 (0)

14476789 DYFSKWGDSGVVDIKYELEHLIILTASRCLLGREVRDKLFDDVSALFHD 14476935

14476936 LDNGMLPISVLFPYLPIPAHRRRDRARKKLAEIFANIINSRKLAGKSEND 14477085

14477086 MLQCFIDSKYKDGRPTTESEITGLLIAALFAGQHTSSITSTWTGAYLLRH 14477235

14477236 NEYLSAVLEEQKNLMKKHGNKVDQDILSEMGVLHRCIKEALRLHPPLIML 14477385

14477386 LRSSHSDFSVTTRDGKEYDIPKGHIVATSPAFANRLPHVFKDPERYDPDR 14477535

14477536 FAAGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWSHLLRNFEFE 14477685

14477686 LISPFPETDWNAMVVGVKDKVMVRYKRRELSVN* 14477787

 

<CYP71 clan 22 families, 71, 73, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 89,

92, 93, 98, 701, 703, 705, 706, 712, 736

66 sequences

29 nearly complete CYP71 like sequences and some related partial seqs.

 

<71B subfamily sequences (10 seqs all named)

three full length sequences 71B38, 71B40 and 71B41 are all about 97% identical

this is too similar for the genome duplication date.

>CYP71B41-de1b

LG_VIII (-) 12482829-12482572

71B like 100% to LG_VIII.4 LG_VIII.10 LG_VIII.16

eugene3.00081725|Poptr1 N-term only

$

12482829 MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12482680

12482679 QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSA 12482572

>CYP71B44P

LG_I (+) 19811612-19812590

71B like pseudogene 51% to 71B36 53% to 71B41

fgenesh1_pg.C_LG_I001954 [Poptr1:64602] model short exon 1

$

19811612 MACHDPLIMWSLPLVLFFSLLMFLLIRKKQNKQQIPPTPPRLPIIGNLHQLGDLS 19811776

19811777 QRSLWQLSKKYGPVILLKLGAVPAVVISSAEAAKEVLKTNDLHACSRPLL 19811926

19811927 AGTGRLSYNYSDVSFTYTYGDYWRKM*KICVLELCSARRVQSFLF 19812061

19812135 IREEEVALLIDTISAYSFSATPVDLSEKILSFTANITCRAAFGK 19812266

19812267 SFQEIKGFDGKRFEEVIREASAILASFSAADFFPKDGWIIERLTG 19812401

19812402 LLHSRLERSFRELDVLYRRVIDDHIKLEE 19812485

19812486 EEKEDIVGGPLKL*RDQTEFGTIQLTHDHIKAKLM 19812590 (0)

>CYP71B40v1

LG_VIII.4 (-) 12489221-12487004

71B like 53% to 71B36 97% to LG_VIII.16

eugene3.00081726|Poptr1 gene model short at N-term

$

12489221 MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12489072

12489071 QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDVAF 12488922

12488921 CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKILTLELFSLKRVQSFRF 12488772

12488771 IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12488622

12488621 FDRDKFHEVVHDTVAVVGSISADESIPYLGWIVDRLTGHRARTERVFHEV 12488472

12488471 DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKEQTELGASQFTKDNIKAILL 12488316 (0)

12487621 NLFMAGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 12487472

12487471 DQLEYLRMVIKETLRLHPPGPLLIPRETMSHCKVSGHNIYPKMLVQINVW 12487322

12487321 AIGRDPRYWKDPEEFFPERFLDRSIDYKGQSFEYLPFGSGRRICPGMHMG 12487172

12487171 SITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVP 12487022

12487021 VNYLP* 12487004

 

>CYP71B40v2

scaffold_994 (+) 9952-10569

CYP71B40 100% match exon 2

fgenesh1_pg.C_scaffold_994000003|Poptr1 duplicate seq

$

 9952 NLFMAGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 10101

10102 DQLEYLRMVIKETLRLHPPGPLLIPRETMSHCKVSGHNIYPKMLVQINVW 10251

10252 AIGRDPRYWKDPEEFFPERFLDRSIDYKGQSFEYLPFGSGRRICPGMHMG 10401

10402 SITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVPVNYLP* 10569

>CYP71B41

LG_VIII.16 (-) 12479032-12476806

71B like 54% to 71B36

eugene3.00081724|Poptr1 gene model short at N-term

$

12479032 MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12478883

12478882 QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDLAF 12478733

12478732 CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKIVTLELFSLKRVQSFRF 12478583

12478582 IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12478433

12478432 FDRDKFHEVVHDTEAVAGSISADESIPYLGWIVDRLTGHRARTERVFHEL 12478283

12478282 DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKEQTELGASQFTKDNIKAILL 12478127 (0)

12477423 NLFLGGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 12477274

12477273 DQLEYLRMVIKETLRLHPPAPLLIPRETMSHCKVSGHNIYPKMLVQINVW 12477124

12477123 AIGRDPTYWKDPEEFFPERFLDSSIDYKGQSFEYLPFGSGRRICPGMHMG 12476974

12476973 FITMEIILANLLYCFDWVYPDGMKKEDINMEEKAGVSLTTSKKTPLILVP 12476824

12476823 VNYLQ* 12476806

>CYP71B38

LG_VIII.10 (-) 12547429-12545187

71B like 54% to 71B36 97% to LG_VIII.4

fgenesh1_pg.C_LG_VIII001676|Poptr1 gene model short on the N-term

$

12547429 MALYVVPLWLPLILLLALLLLFMKKMEVKRQSEQLLPPSPPKLPILGNLH 12547280

12547279 QLGSLPHQSLWQLSKKYGPVMLIRLGRIPTVVISSAEAAREVLKVHDLAF 12547130

12547129 CSRPLLAGTGRLTYNYLDIAFSPYSDHWRNMRKVLTLELFSLKRVQSFRF 12546980

12546979 IREEEVSLLVNFISESSALAAPVDLTQKLYALVANITFRMAYGFNYRGTS 12546830

12546829 FDRDKFHEVVHDTEAVAGSISADESIPYLGWIVDRLTGHRARTERVFHEL 12546680

12546679 DTFFQHLIDNHLKPGRIKEHDDMVDVLLRIEKDQTELGASQFTKDNIKAILL 12546524 (0)

12545804 NLFLGGVDTISLTVNWAMAELVRNPRVMKKVQDEVRKCVGNKGRVTESDI 12545655

12545654 DQLEYLRMVIKETLRLHPPAPLLITRETMSHCKVSGHNIYPKMLVQINVW 12545505

12545504 AIGRDPTYWKDPEEFFPERFLDSSIDYKGQSFEYLPFGSGRRICPGMHMG 12545355

12545354 FITMEIILANLLYCFDWVFPDGMKKEDINMEEKAGVSLTTSKKTPLILVP 12545205

12545204 VNYLQ* 12545187

>CYP71B43P

LG_X.28 (-)  6738703-6736564

71B like 52% to 71B36 86% to LG_VIII.4

fgenesh1_pg.C_LG_X000579|Poptr1 gene model wrong 2 frameshifts

possible pseudogene or two seq. errors.

$

6738703 MALYAVPLWLPLILLLPLLLLFMKRMKDAGQSEQLLPPGPP 6738581

6738581 KLPILGNLHQLSSLPHQSMWHLSKKYGPVMLLRLGQIPTVVISSAEAA 6738438

6738437 REVLKVHDLAFCSRPLLSGAGRLTYNYLDIAFSPYSDHWRNMRKIVTLEL 6738288

6738287 FSLKRVQSFRFIREEEVGFLVNSLSESSALAAPVDLTQKVYALVANITFR 6738138

6738137 VAYGFDYRGTTFDRDRFHEVVHDTEAVVGSISADEYVPYLG 6738015

6738015 MIVDWLTGHRARMERVFHELDTFFQHVIDNHLKPGRIKDHDDMIDV 6737878

6737877 LLRIEKEQTELGASQFTSDNIKAVLL 6737800 (0)

6737179 NLFLGGVDTSSLTVNWAMAELVRNPRVMKKVQDEVRKCVGKKGRVTEGDV 6737030

6737029 DQLEYLRMVIKETLRLHPPAPLLLPRETMSHCIVSGYNIYPKTLVHVNVW 6736880

6736879 AIGRDPKYWRDPEEFFPE 6736826

6736827 RFLDSSCDFNGQSFEYLPFGSGRRICPGIHMGSITVEIILSNLLHCFDWI 6736678

6736677 LPHGMQKEDINMEEKAGVSLAPSKKTPVILVPVNYLQ* 6736564

>CYP71B42P

LG_VIII.30 (-) 12460686-12458958

71B like pseudogene 95% to LG_VIII.31 54% to 71B24

eugene3.00081722|Poptr1 gene model short, frameshifts

$

12460686 MAFYILPLALLLLLLFPLPLILKKKQQ 12460608

12460608 KLYVLELFSLK 12460576

12460577 RVQSFRFIREEVISLLMNSISQSSSPATPVNLTQMLYSVFASIVFGMALG 12460428

12460427 KSFQGSDFHNERCRKSIHEAE 12460365 (small deletion here)

12460365 GHNRMLITGYDARIERVFLELDTLFQQVIDDHLKAERKERKEDIINVLLKMERDQT 12460198

12460197 ESSAPQLTKYNIKAVIL 12460147 (0)

12459572 NIFLGGVDTGAITVIWATAEIARNPIIMKKAQEEIRSSVGQKGRATEERT 12459423

12459422 DELQYLKMVIKETLRLHPPAPLLLPRETMSRCQINGYDIYLKTLIQVNAW 12459273

12459272 AIGRDPEYWRDSEEFFPERFVDSPIDYKGQRFEFLPFGSGRRVCPGILMG 12459123

12459122 VTMVELALANLLHCFDWKLPNAV 12459054

12459053 AINMEEAAGLTISKKNPLFLVRINYPQQAQPD 12458958 sequence gap here

>CYP71B39P

LG_VIII.31 (-) 12534486-12532739

71B like pseudogene 95% to LG_VIII.30 

eugene3.00081733|Poptr1 gene model short, frameshifts

$

12534486 LLLLLLFPLPLILKKKQQ 12534433

12534433 KLYVLELFSLK 12534401

12534402 RVQSFRFIREEVISLLMNSISQSSSPATPVNLTQMLYSVFASIVFRMALG 12534253

12534252 KSFQGSDFHNERCRKAIHEAE 12534190 (small deletion here)

12534190 GHNRMLITGYDARIERVFLELDTLFQQVIDDHLKAERKERKEDIINVLLKMERDQT 12534023

12534022 ESSAPQLTKYNIKAVIL 12533972 (0)

12533397 NIFLGGVDTGAITVIWATAEIARNPIIMKKAQEEIRSSVGQKGRATEERT 12533248

12533247 DELQYLKMVIKETLRLHPPAPLLLPRETMSRCQINGYDIYLKTLIQV 12533107

12533107 NAWAIGRDPEYWRDSEEFFPERFVDSPIDYKGQRFEFFPFGSGRRVCPGI 12532958

12532957 VMGVTMVELALINLLYCFDWKLPNAV 12532880

12532879 DINMEEAAGLTISKKMPLFLVPINYPQRAQPDKMSRTSLLSKHTCS* 12532739

>CYP71B45P

LG_X (-) 6735316-6733923

71 like pseudogene new 75% to CYP71B42P

fgenesh1_pg.C_LG_X000578|Poptr1 71 like I-helix + C-term (pseudogene?)

downstream of fgenesh1_pg.C_LG_X000579

$

6735316 ILSLGRTPTLVVSSAEAARAVLKTHDLDCCCRPRLSGSGRLTYNHVYVAF 6735167

6735166 APYGDYW*EMRKLFVLEPFILKRVQSFRFITGEVARIMNSIPQSSS 6735029

6734867 PYAG*ILDKVTDHHARIERVLH 6734802

6734482 NIFLGGVHAGAITVIWALEELAWNPRTMKKAQDEIRNSVGKKGRLAEESI 6734333

6734332 DEL 6734324

6734322 TLVIKETLRWQ 6734290

6734288 PPAPLLLPRETMSHCKINGYHIYPKILIQINV*AIGSDPTYWNDS*EFF 6734142

6734141 PERFVDSSID*KGQHFEFLPFGSGRRGCPGILMGVTMVELALANLLYCLDW 6733989

6733988 KSAKAIDINMEEAAGLTISKKM 6733923

 

<71D subfamily 40 sequences all named

 

>CYP71D38-de2b

scaffold_710 (-) 1341-893

pseudogene 89% to 726B1 J-helix to end

eugene3.07100001|Poptr1 gene model short

$

1341 VIKNPRVLEKAQKEGRQVFND 1279

1276 LGTIPDETSLHDSKFLKLIIKETLRLHPPAPLMIPIECRKRYNVNGYDTHVKSKVLINAWAIGRDP 1079

1078 NYWNELERFYPERFINVSTDFKGIDFEFIPFGAGKRMCPGKL 953

 953 VKRIRDLKLIPV 918

 916 SYRSLVG* 893

>CYP71D38

scaffold_710 (-) 19864-18128

726A like 50% to 726A1 euphorbia

eugene3.07100003|Poptr1

$

19864 MLISLPVFLTILLVISILWTWTKFIKSNKSSSNPPPGPWKLPFIGNLHQL 19715

19714 VHPLPHHRMRDLAKKFGPVMQLQVGEVSTVIISSSEAAKEVMKTHEINFV 19565

19564 ERPHLLAASVLFYNRKDIAFAPYGEYWRQLRKISILELLSAKRVRSFKSI 19415

19414 REEEVSNFIASIYSKEGSPINLSRMIFSLENGITARTSIGNKCKNHEGFL 19265

19264 PIVEELAEALGGLNMIDIFPSSKFLYMVSRVRSRLERMHREADEILESII 19115

19114 SERRANSALASKMGKNEEDDLLGVLLNLQDHGNLEFQLTTSTIKAVIL 18971 (0)

18748 EMFSGGGDTSSTALEWAMSELIKNPRVMEKAQKEVRQVFNDLGTIPDETS 18599

18598 LHDLKFLKLIIKETLRLHPPVPLIPRECRKRCDVNGYDIHVKSKVLINAW 18449

18448 AIGRDPNCWNEPERFYPERFINVSTDFKGSDFEFIPFGAGKRMCPGMLFA 18299

18298 TANTEFPLAQMLYHFDWKPAGGLKPENLDMTESFGGAVKRKQDLKLIPIS 18149

18148 YRSLVG* 18128

>CYP71D38-de2c

scaffold_710 (-) 21263-20928

pseudogene 79% to 726B1 C-term

$

21263 NDLGTILDETS 21231

21213 LKLITEETLRLHPSAPLIPREWRKRCQVNGYD 21118

21117 NIHVKSKVLINAWA 21076

21085 CMGMLFAIA 21059

21059 HHFDWKPIDGLKPENLDMTESLGGATKRKRDLKLIFISYRSLVG 20928

>CYP71D23P

LG_XV.1 (+) 7113294-7114921

71D like possible pseudogene 46% to 71D10 57% to LG_VII.22

fgenesh1_pg.C_LG_XV000688|Poptr1 gene model wrong 2 frameshifts and a stop codon

$

7113294 MEQHFPLFAIFLTFLLFIFMVLRMRKKSETNKYLTTNPPPGPWKLPLVGN 7113443

7113444 IHHVAGHQIHHRFTDLARKYGPVMQILLGEVRFVVISSRETAKEVMKTNE 7113593

7113594 NIIVDRPDGVIPRIVFYNGKAISFTPYGEYWKQLRKSCSSKLLSPQCVRS 7113743

7113744 LIRSTMEEEVSDFVTSISSKEGSPINLSKMLFTLTFGLISRVILGKKGKN 7113893

7113894 QALLSSIEEWKQGGAGFDVADIFPSFKLFHSLGWARSKFVRQHQEIGEML 7114043

7114044 ETVINERRASKIRTKTSEHEIEEDFLDVLVNMQHALR 7114154

7114156 NLEFTNDNIKAILL 7114197 (0)

7114292 EFFLAGSDSSSAVMEWAMSEMLKNPRHMKRAQKEVRVVFTKMGNDDETRL 7114441

7114442 HELKYLQLIIKETTRLHPPAPLILRACREACKINGHDIPDRSNVMINAWA 7114591

7114592 IGRDPTYWNEA* 7114627

7114628 KFNPERFLDSSIDYMGTNFEFIPFGAGKRKCPGMAFGLAIVEMALAKLLY 7114777

7114778 IFDWKLCDGVKNEDLNMKEDTALGSTVKRKHELYLIPIPYHPSSPAK* 7114921

>CYP71D41

LG_VII.22 (+) 5888964-5891028

71D like 50% to 71D4, 75% to LG_VII.12

fgenesh1_pg.C_LG_VII000682|Poptr1 gene model correct

$

5888964 MEFPILLASLLFIFAVLRLWKKSKGNGSTLALPPGPWKLPLIGNIHQLAG 5889113

5889114 SLPHHCLTDLAKKYGPVMQLQIGEVSTVVVSSGEAAKEVMKTHEINFVER 5889263

5889264 PCLLVANIMFYNRKNIGFAPYGDYWRQMRKVCTLELFSAKRVRSFRSVRE 5889413

5889414 EEVSNFIRNIYAKAGSPINLSKMMLDLSNGVIARTSIGKKSKNQEAFLPI 5889563

5889564 IEDVAEALAGLNIVDVFPSAKFLYMISKLRSRLERSHIEADEILENIINE 5889713

5889714 RRASKEERKTDQDNEVEVLLDVLLNLQNQGNLEFPLTTDSIKAIIV 5889851 (0)

5890402 EMFGAGSETTSTLLEWSMSEMLKNPRVMKKAQEEVRQVFSDSENV 5890536

5890537 DETGLQNLKFLKLIIKETLRLHPPISLIPRECSKTCEINGYVIQAKSKVI 5890686

5890687 INAWAIGRDSNDWTEAEKFYPERFQDSSIDYKGTNFEFIPFGAGKRMCPG 5890836

5890837 MLFGIGNAELLLARLLYHFDWKLSSGAALEDLDMNEAFGGTVKKKHYLNL 5890986

5890987 IPIPYGPCPLPVE* 5891028

 

>CYP71D42

LG_VII.3 (-) 5804810-5802796

71D like 49% to 71D4 99% to LG_VII.27

note these two gene names were both assigned to the same sequence

eugene3.00070713|Poptr1 gene model seems correct

$

5804810 MDQVFQFIYILIVPFLLLIFPVLRLWKKSQGNNSSTPPPPPGPWKLPLIGNLHQLL 5804643

5804642 GSLPHQVLRDMANKYGPVMQLQIGEVPTVIISSPEAAKEAIKTHEINFVD 5804493

5804492 RPCLLVAKVMFYNSKDIAFAPYGDYWRQMKKVCVLELLSAKRVKSFRSIR 5804343

5804342 EEEVSNFMRTIYSKAGSPINLSKMMFDLLNGITARASVGKKYKHQEAFLP 5804193

5804192 IIEQVIEAMGGTNIADVFPSSKLLYMISRFRSRLERSHQDADVILENIIY 5804043

5804042 EHRVRREVAKTDEESEAEDLLDVLLNLQNHGDLGFPLTTDSIKATIL 5803902 (0)

5803416 ELFTAGSDSSSTLMEWTMSEMLRNPRVMRKAQEEVRQVFSNTEDVDETCL 5803267

5803266 HNLEFLKLIIKETLRLHPPAPFIPRECNKTCEINGYVIQAKSKVMINAWA 5803117

5803116 IGRDSDHWTEAEKFYPERFLDSSIDYMGTNFEFIPFGAGKRMCPGILFGI 5802967

5802966 ATVELPLAQLLYHFDWKLPNGDLSEDLDMNEVFVGTVRRKHQLNVIPIPF 5802817

5802816 YPSPLQ* 5802796

 

>CYP71D43

LG_VII.12 (-) 5753477-5751477

  71D like 48% to 71D4 94% to LG_VII.3

estExt_fgenesh1_pg_v1.C_LG_VII0660|Poptr1 gene model seems correct

$

5753477 MEQVFQFIQILVPFLLLIFTVLRLWKKSQGNNSSTPPPP 5753361

5753360 PPPPPGPWKLPLIGNLHQLLGSLPHQVLRDMANKYGPVMQLQIGEVPTVI 5753211

5753210 ISSPEAAKEAMKTQEINFVDRPCLLVAKVMYYNSKDIGFAPYGDYWRQMK 5753061

5753060 KVCVLELLSAKRVKSFRSIREEEVSNFIRAIYSRAGSPINLSKMMFDLLN 5752911

5752910 GITARASVGKKYKHQEAFLPIIEQVIEAVGGTNIADVFPSSKLLYMISRF 5752761

5752760 RSRLERSHQDADVILENIIYEHRVRREVAKTDEESEAEDLLDVLLNLQNH 5752611

5752610 GDLGFPLTTDSIKATIL 5752560 (0)

5752097 ELFAGGSDTSSTLMEWTMSEMFRNPRVMRKAQEEVRQVFSNTENVDETCL 5751948

5751947 HNLEFLKLIIKETLRLHPPVPFIPRECNKTCEINGYVIQAKSRVMINAWA 5751798

5751797 IGRDSDHWTEAEKFYPERFLDSSIDYKGTNFDFIPFGAGKRMCPGILFGI 5751648

5751647 ATVELPLAQLLYHFDWKLPNGDLLEDLDMNEVFGGTVRRKHQLNLIPIPF 5751498

5751497 YPSPLQ* 5751477

>CYP71D24P

scaffold_228 (-) 35436-34671

71D like 2 copies exon 1 pseudogene

eugene3.02280006|Poptr1

$

35436 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQLASSATMP 35266

        35266 HYLCAHWARKYG 35231

                               35234 RTPPLGPWKLPLIGNIHQLASSATMP 35157

35180 LASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVIISSPDAAKEVLKT 35040

35039 QEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRKACIWGLFSATRK 34890

34889 LSFRSIREEEVSNLISSIRSKEGSPINLRELLLDLSNETITRTSIGKKCK 34740

34739 NKARFLHTIEQVSKSVGGVNIFL 34671

>CYP71D25Pv1

scaffold_228 (-) 39599-38903

71D like 62% to LG_VII.22 41% to 71D4

exon 1 pseudogene

eugene3.02280007|Poptr1 100% to scaffold_228  (-)    34668-35436

$

39559 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQ 39413

39412 LASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVIISSPDAAKEVLKT 39272

39271 QEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRKACIWGLFSATRK 39122

39121 LSFRSIREEEVSNLISSIRSKEGSPINLRELLLDLSNETITRTSIGKKCK 38972

38971 NKARFLHTIEQVSKSVGGVNIFL 38903

>CYP71D25Pv2

scaffold_1911 (+) 7175-8184

71V like 37% to 71V5

fgenesh1_pg.C_scaffold_1911000001|Poptr1 contains a duplication from exon 1

identical in seq to each other in overlap region

almost identical to 71D25P  2 aa diffs probable duplicate sequence

$

7175 MQQEIPLVLGFLLLVFAVLRLGKKSKGHDSTRTPPPGPWKLPLIGNIHQLASSATMPHYLCAHWAKKYG 7381

7381 TPPPGPWKLPLIGNIHQLASSATMPHYLCAHWAKKYGPIMQIQIGEVPTVII 7536

7537 SSPDAAKEVLKTQEINFAERPALLVSEIMLYNGQGMSFAKFGDHWKLMRK 7686

7687 ACIWGLFSATRKLSFRSIREEEVSNLISSIRSKAGSPINLRELLLDLSNE 7836

7837 IITRTSIGKKCKNKARFLHTIEQVSKSVGGVNIVDLFPSARLVHMISNMT 7986

7987 SSLQRLHEETDQMLEDIINERRASRVEKKTGENKIEAGDDLLDVLLNLQD 8136

8137 DGNFKVKTDSIKSIIL 8184 (0)

>CYP71D36Pv1

LG_I (-) 29491328-29490406

71D like EXXR 61% TO CYP71D26

eugene3.00012614|Poptr1 mid regioN to EXXR 71D like

$

29491328 DTVDVLLNL*GQADLEFTLTTKNIKAIIL 29491242 (0)

29490942 DMFVAGSETSSRTVEWAK 29490889

29490889 TELAKHPKVMEKAQAEARQVFANVDEAGLHKLDHLQLLIKETL 29490761

29490759 NIPPIPLLFPRESKEACKITGYDMPAQSKVTVNIWGIGREPINWTEPERFYP 29490604

29490603 ERFLDSSMDYKGIDFKFIPFGAG 29490535

29490534 ILFGMATYVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS* 29490406

>CYP71D36Pv2

scaffold_1517 (-) 9207-8703

3 aa diffs to 71D36P, duplicate seq.

eugene3.15170002|Poptr1 gene model wrong

$

9207 LNEMAKHPKVMEKAQAKARQVFANVDEAGLHKLDHLQLLIKETL  9076

9074 NIPPIPLLFPRESKKACKITGYDMPAQSKVTVNIWGIGREPINWTEPERFYP 8919

8918 ERFLDSSMDYKGIDFKFIPFGA 8853

8852 GILFGMATVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS*DEK*SLH 8703

>CYP71D35P

LG_I (-) 29497891-29497622

71B like pseudogene no model exists

98% to scaffold 1517, 93% to scaffold 6967, 47% to 71B18

$

29497891 KVTVNIWRIGREPINWTEPER 29497829

29497826 FYPERFLDSSMDYKGIDFKFIPFGAGILFGMAT 29497728

29497726 TVVLPLAQLLCFLDWIPPNGLRSADLVTSQVFGSS 29497622

>CYP71D36Pv3

scaffold_6967 (-) 2023-1561

100% to 71D36Pv2

eugene3.69670001|Poptr1 gene model wrong

$

2023 LNEMAKHPKVMEKAQAKARQVFANVDEAGLHKLDHLQLLIKETL 1892

1890 NIPPIPLLFPRESKKACKITGYDMPAQSKVTVNIWGIGREPINW 1759

1758 TEPERFYPERFLDSSMDYKGIDFKFIPFGAGILFGMATVVLPLAQLLCFL 1609

1608 DWIPPNGLRSADLVTS 1561

>CYP71D34

LG_I.29 (-) 29516258-29514652

71D like  new 52% to 71D10 93% to LG_I.2

eugene3.00012615|Poptr1 gene model short one frameshift

$

29516258 MEQLQTPPSLVLLPSLLFIFMVLRMLKKSKSKDLTPNLPPGPRKLPVIGN 29516109

29516108 LHQLFGSLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMKVHD 29515959

29515958 INFAHRPHLPVGQIIFYNCTDIATA 29515884

29515882 AAYGDYWRQLRKVSILELLSPKRVQSFRSIREEEVSSLIGSISSSAGSII 29515733

29515732 NLSRMLFSVAYNITTRAAFSKLRKEEEIFVPLVQGIIQVGAGFNISDLFP 29515583

29515582 SIKLIPWITGMRSRMERLHQEADRILESIINDHRARKAEGNSSNESKADN 29515433

29515432 LVDVLLDLQEHGNLDFSLTTDNIKAVIL 29515349 (0)

29515263 DIFIAGTETSSTILQWAMSELLKHPEVMEKAQTEVREAFGKDGSVGELNY 29515114

29515113 LKMVIKETMRLHPPLPLLLPRECREECGINGYNIPIKSRVLVNVWAIGRD 29514964

29514963 SDYWVEAERFHPERFLDSSIDYKGVNFEFTPFGAGRR 29514853

29514852 MCPGILFGISNVDLLLANLLYHFDWKLPGDMEPESLDMSEAFGATVRR 29514709

29514708 KNALHLTPILHHPHPVRS* 29514652

>CYP71D44

scaffold_11610 (-) 1395-784

71D LIKE 96% to 71D34

eugene3.116100001|Poptr1 gene model wrong exon 2 only

$

1395 DLFIAGTETSSTILEWAMSELLKHPEVMEKAQTEVREVFGKDGSVGELNY 1246

1245 LKMVIRETMRLHPPLPLLLPRECREECGINGYNIPIKSRVLVNVWAIGRD 1096

1095 SNYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRRMCPGILFGISNVD 946

 945 LLLANLLYHFDWKLPGDMKLESLDMSEAFGATVRRKNALHLTPILHQPHPVRS* 784

>CYP71D33P

LG_I (-) 29524379-29524047

71B like pseudogene

eugene3.00012616 [Poptr1:550175] model short

$

29524379 VWAIGRDSDYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRR 29524248

29524247 MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGATVRR 29524104

29524103 KNALHLTPILHHPHPVRS* 29524047

>CYP71D32P

LG_I (-) 29530020-29529344

71B like pseudogene

eugene3.00012617 [Poptr1:550176]

$

29530020 MEQPQIPSCLVLLPSLLFIFMVLRMLKKSKTKDLTPNLPPGPRKLPVIGN 29529871

29529870 LHQLFGSLPHHRLRDLAEKHGPIMHLQLGQVQTIVISSPETAEQVMK 29529730

 

29529676 VWAIGRDSNYWVEAERFHPERFLDSAIDYKGVNFEFTPFGAGRR 29529545

29529544 MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGATVRR 29529401

29529400 KKALHLTPILHHPHPVRS* 29529344

>CYP71D31P

LG_I (-) 29532948-29532496

71B like pseudogene C-term

eugene3.00012618|Poptr1

$

29532948 VIRETMRLHPPLPLLLPRECREECGINGYNIXIKSRVLVNAWAIGRDSNY 29532799

29532798 WVEAERFHPERFLDSSIDYKGVNFEFTPFGAGRR 29532697

29532696 MCPGILFGISNVDLLLANLLYHFDWKLPGDMKPESLDMSEAFGAAVRR 29532553

29532552 KNALHLTPILHHPHPVRS* 29532496

>CYP71D30P

LG_I (-) 29540735-29539563

71B like PSEUDOGENE 2 models are from same gene

eugene3.00012620 [Poptr1:550179] gene start

eugene3.00012619 [Poptr1:550178] gene end

$

29540735 MEQLQIPTSLVLLSSLLFIFMVLRILKKSKTKDFTPNLPPGPRKLPVIGN 29540586

29540585 LHQLFCSLPHHRLR 29540544

(sequence gap)

29539973 LLPRECREECGINGYNIPIKSRVLVNVWAIGRDSNYWVEAERFQPERFLD 29539824

29539823 SSIDYKGVNFEFTPFGAGRR 29539764

29539763 MCPGILFG