This file still needs some work to make it look pretty and to get rid of confusing

older names that are present after making some name revisions.  The name to the left

after the > is the correct name.  All soybean P450 sequences are included here.

 

The names match the excel file of soybean P450s by name order.

I have not updated the chromosome order excel file, so be careful.

 

David Nelson

 

Last modified June 25, 2009

Last modified Oct. 9, 2009 added 12 pseudogenes

 

 

>CYP51G1     Glycine max (soybeans, Fabales)

            DQ340249

            Li,L.Y. and Yu,D.Y.

            Comprehensive analysis of putative P450 genes superfamily in

            Glycine max and Medicago truncatula

            Unpublished

                scaffold_72:3448996..3446386 (- strand)

chr7 13905939-13903329 (-) strand Glyma07g14460.1

3448996  MEIDSRFLNTGLLLVATILVAKLISAFIVPKSRKRVPPIVKGWP

LIGGLIRFLKGPIFMLRDEYPKLGSVFTLKLFHKNITFLIGPEVSAHFFKASETDLSQ

QEVYQFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRVNKLKGYVNQMVAEAE (0)

DYFSKWGPSGEVDLKYELEHLIILTASRCLLGREVRDKLFDDVSALFHDLDNGMLPISVLFPYL

PIPAHKRRDQARKKLAEIFASIITSRKSASKSEEDMLQCFIDSKYKDGRSTTEAEVTG

LLIAALFAGQHTSSITSTWTGAYLLSNNQYLSAVQEEQKMLIEKHGDRVDHDVLAEMD

VLYRCIKEALRLHPPLIMLMRSSHTDFSVTTREGKEYDIPKGHIIATSPAFANRLGHV

FKDPDRYDPDRFAVGREEDKVAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNF

ELELVSPFPEIDWNAMVVGVKGKVMVRYKRKELSVNQ* 3446386

 

>CYP51G8 Gm0073x00075 scaffold_73:1407411..1410201 (+ strand)

No good boundary at MLIE/KKHG but 5 more aa can be added to get a (2) boundary

7 other aa diffs, possibly a pseudogene.  One extra intron

chr3 34351408-34349121 (-) strand Glyma03g26820.1

1407651  MEIDGRFLNTGLLLVATILVAKLISAFIVPKSRKRVPPIVKGWPLIGGLIRFLKGPIFMLREEYPKLG

         SVFTLKLFHKNITFLVGPEVSAYFFKASETDL  1407950

1407951  SQQEVYQFNVPSFGPGVVFDVDYSVRQEQFRFFTEALRVNKLKGYVNQMVAEAE (0) 1408112

1408938  DYFSKWGPSGEVDLKYELEHLIILTASRCLLGREVRDKLFDDVSALFHDLDNGMLPIS

         VLFPYLPIPAHKRRDQARKKLAEIFASIITSRKSASKSEED  1409234

1409235  MLQCFIDSKYKDGRSTTEAEVTGLLIAALFAGQHTSSITSTWTGAYL

         LSDNQCLSAVQEEQKMLIESMGTE  (2) 1409432

1409429  KHGDRVDHDVLAEMDVLYRCIKEALRLHPPLIMLMRSSHTDFSVTTREGKEYDIPKG

         HIIATSPAFANRLGHVFKDPDRYDPDRFAVGREEDKVAGAFSY             1409731

1409732  ISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFELELVSPFPEI

         DWNAMVVGVKGKVMVRYKRKELSVNQ* 1409938

 

>CYP51G9P chr18 50% to 51G1

61387599    LPLKSILCIYQNSFELELVSPFPEVN*NIMVVGLKGKVLLKNMHRDYS                61387742

 

>CYP51G10P chr2 no gene model 53% to 51G8

7912267  LLLKPILCIYQNSFELELVSPFTEVN*NTMVAGVKGKVVLK  7912145

 

>CYP71A10    Glycine max (soybean)

            GenEMBL AF022157 (1838bp)

            Siminszky,B., Dewey,R.E. and Corbin,F.T.

            capable of catalyzing the metabolism of phenylurea herbicides

MALLSSVLKQLPHELSSTHYLTVFFCIFLILLQLIRRNKYNLPP

SPPKIPIIGNLHQLGTLPHRSFHALSHKYGPLMMLQLGQIPTLVVSSADVAREIIKTH

DVVFSNRRQPTAAKIFGYGCKDVAFVYYREEWRQKIKTCKVELMSLKKVRLFHSIRQE

VVTELVEAIGEACGSERPCVNLTEMLMAASNDIVSRCVLGRKCDDACGGSGSSSFAAL

GRKIMRLLSAFSVGDFFPSLGWVDYLTGLIPEMKTTFLAVDAFLDEVIAEHESSNKKN

DDFLGILLQLQECGRLDFQLDRDNLKAILVDMIIGGSDTTSTTLEWTFAEFLRNPNTM

KKAQEEVRRVVGINSKAVLDENCVNQMNYLKCVVKETLRLHPPLPLLIARETSSSVKL

RGYDIPAKTMVFINAWAIQRDPELWDDPEEFIPERFETSQVDLNGQDFQLIPFGIGRR

GCPAMSFGLASTEYVLANLLYWFNWNMSESGRILMHNIDMSETNGLTVSKKVPLHLEP

EPYKT

 

>CYP71A10 scaffold_157: 676518..679205 (+) strand

chr6 14848477-14851164 no gene model

676518  MALLSSVLKQLPHELSSTHYLTVFFCIFLILLQLIRRNKYNLPPSPPKIPIIGNLHQL

        GTLPHRSFHALSHKYGPLMMLQLGQIPTLVVSSADVAREI        676811

676812  IKTHDVVFSNRRQPTAAKIFGYGCKDVAFVYYGEEWRQKIKTCKVELMS

        LKKVRLFHSIRQEVVTELVEAIGEACGSERPCVNLTEMLMAASNDIV        677099

677100  SRCVLGRKCDDACGGSGSSSFAALGRKIMRLLSAFSVGDFFPSLGWVD

        YLTGLIPEMKTTFLAVDAFLDEVIAEHESSNKKNDDFLGILLQLQECGRLD            677396

677397  FQLDRDNLKAILV (0)  677435

678582  DMIIGGSDTTSTTLEWTFAEFLRNPNTMKKAQEEVRRVVGINSK

        AVLDENCVNQMNYLKCVVKETLRLHPPLPLLIARETSSSVKLRGYDIPAKTMVFIN  678881

678882  AWAIQRDPELWDDPEEFIPERFETSQVDLNGQDFQLIPFGIGRRGCPA

        MSFGLASTEYVLANLLYWFNWNMSESGRILMHNIDMSETNGLTVSKKVPLHLEPEPYKT* 679205

 

>CYP71A33    Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 10/15/2008

            Clone C20

            79% to 71A10, same as EST AI496547

MAFLSSVLKQLAYEPSSTHYLTAFFCFVSLLLMLKLTRRNKSNFPPSPPKLPIIGNLHQL

GTLPHRSFQALSRKYGPLMMLQLGQTPTLVVSSADVAREIIKTHDVVFSNRPQPTAAKIF

LYNCKDVGFAPYGEEWRQTKKTCVVELLSQRKVRSFRSIREEVVSELVEAVREACGGSER

ENRPCVNLSEMLIAASNNIVSRCVIGRKCDATVGDSVNCSFGELGRKIMRLFSAFCVGDF

FPSLGWVDYLTGLIPEMKATFLAVDAFLDEVIAERESSNRKNDHSFMGILLQLQECGRLD

FQLSRDNLKAILM (0)

 

>CYP71A33 Gm0157x00067:peptide scaffold_157:686782..689402 (+ strand)

Glyma06g18560.1 Gm06:14858479..14861103 (+ strand)

MAFLSSVLKQLAYEPSSTHYLTAFFCFVSLLLMLKLTRRNKSNFPPSPPKLPIIGNLHQLGTLPHRSFQALSRKYGPLMM

LQLGQTPTLVVSSADVAREIIKTHDVVFSNRPQPTAAKIFLYNCKDVGFAPYGEEWRQTKKTCVVELLSQRKVRSFRSIR

EEVVSELVEAVREACGGSERENRPCVNLSEMLIAASNNIVSRCVIGRKCDATVGDSVNCSFGELGRKIMRLFSAFCVGDF

FPSLGWVDYLTGLIPEMKATFLAVDAFLDEVIAERESSNRKNDHSFMGILLQLQECGRLDFQLSRDNLKAIL (0)

MDMIIGGSDTTSTTLEWAFAELLRKPNTMKKAQEEIRRVVGINSRVVLDENCVNQMNYLKCVVKETLRLHSPVPLLVARETSSSVKLR

GYDIPAKTMVFINAWAIQRDPELWDDPEEFIPERFETSQIDLNGQDFQLIPFGSGRRGCPAMSFGLASTEYVLANLLYWF

NWNMSESGMLMHNIDMNETNGLTVSKKIPLHLEPEPHIP*

 

>CYP71A34 CYP71A40

4583-4586k+ Gm0069x00326:peptide partial 54% to 71A33

chr4 11168538-11170517 (+) strand Glyma04g12180.1

MASIQWPYEQLKTAFSVSTFHQYLFLFLLVIV

VLKLTRRPKIKPSFNLPPSPRKLPIIGNLHQLSKLPYHSLRTLSQKHGSLMLLQL

GQTRALVVSSPDAVREIMKTHDITFSNRPKTTAAKTLLYGCNDIGFASYGESWKHKRKIC

VLELLSPKRVQSLSLIREEEVAELINKIREASLSDASSSVNLSELLIETTNNIICKCALG

KKYSTEDCHSRIKELAKRAMIQLGVVTVGDRFPFLGWVDFLTGQIQEFKATFGALDALFD

QVIAEHKKMQRVSDLCSTEKDFVDILIMPDSELTKDGIKSILL (0)

DMFVAGSETTASALEWAMAELMKNPMKLKKAQDEVRKFVGNKSKVEENDINQ

MDYMKCVIKETLRLHPPAPLLAPRETASSVKLGGYDIPAKTLVYVNAWAIQRDPEFWERP

EEFIPERHDNSRVHFNGQDLQFITFGFGRRACPGMTFGLASVEYILANLLYWFNWKLPAT

HTSGQDIDMSETYGLVTYKKEALHLKPIPFFL*

 

>CYP71A34-de1b CYP71A40-de1b 71% to 71A34

chr4 11166176-11166347 (+) strand no gene model

         LSYLAHCHSFITLSQKHGSMVL*QLGLTRALVVLSAGC  4580389

4580385  DVVREIMKTHDITFSKRPKIT     4580447

 

>CYP71A35P CYP71A43P scaffold_47 80% to 71A10 723-724k-

chr4 42894-42895k+  Glyma04g36340.1 gene model is wrong

42894308 GDFFPSLDWVDYLTDLILEMKTTFLAVDAFLDEIIVEHESNNKKNDDFLGILLQLQECGR

LDFQHVRDNLKTILM

MIIGGSDTTSTTLEWTFA*LLRNPNTMKKAQEE

SRRVVGTNSRVVLDENCVNQMNYLKCVVRETLRLHPPVPLLVA*E

TSSSVKLRGYHTTTKIMVFINASTIQRDTKLWDDPGEFIPKRFETNQVDFNGQDFQLISF

SIGRKGCPTMSFGLASAQYVLSNLLYWFN*KMPKFGILLMHDADMSETNGLTVNKKIQLH

LVP*PYKT 42895349

 

>CYP71A36P CYP71A42P scaffold_47 719-720k- 75% to 71A33

chr4    Gm04:42898450..42900611 (+ strand)

Glyma04g36350.1 model is wrong

42898361 MAPPPSVLKLLSHELSSTN*FLSVFFCFLSLLFLLKLAKRNKFNLPPSPPKLPIIGNLHQ

LGTLPHRSFHALSRKYGPLMLLQLGQIPTLVVSSAEVAREIIKKHDIAFSNRPQSTAAKI

NSNDVDFSNYDEEWRQKKNTCVVEPLSQKKVRSFRSIQEEVVAELVEGVREACG-SERE-

RPCVNLTEMLIAASNNIVSRCVHGRKCDDRIGGGGGSSCSFGVLGRKVMRLLSAFSVGDF

FP*LGWVDSLTGLIPEMKAMSVTIDAFFDEVIAEHE

NMKNDESDVEDFVGILLHQLQE

CGKLDFELTRDNLKGILV

DMIIGGSYTTSTTLEWVFADLI 42899986

 

>CYP71A37 CYP71A45

chr5 92% to 71A43 Glyma05g02730.1 Gm05:2090956..2093112 (- strand)

Glyma05g02730.1:peptide 89% to 71A43 Gm05:2090956..2093112 (- strand)

MALRSVFFYLLSISFFLHQTKPETNLKLPPSPPKIPIIGNIHQFGTLPHRSLRDLSLKYGEMMMLQLGQMQTPTLVVSSV

DVAMEIIKTYDLAFSDRPHNTAAKILLYGCADVGFASYGDKWRQKRKICVLELLSTKRVQSFRAIREEEVAELVNKLREA

SSSDASYVNLSEMLMSTSNNIVCKCALGRSFTRDGNNSVKNLAREAMIHLTAFTVRDYFPWLGWIDVLTGKIQKYKATAG

AMDALFDTAIAEHLAEKRKGQHSKRKDFVDILLQLQEDSMLSFELTKTDIKALLT (0)

DMFVGGTDTTAAALEWAMSELVRNP

IIMKKVQEEVRTVVGHKSKVEENDISQMQYLKCVVKETLRLHLPTPLLPPRVTMSNVKLKGFDIPAKTMVYINAWAMQRD

PRFWERPEEFLPERFENSQVDFKGQEYFQFIPFGFGRRGCPGMNFGIASIEYVLASLLYWFDWKLPDTLDVDMSEVFGLV

VSKKVPLLLKPKTFPF*

 

>CYP71A38P CYP71A44P

Glyma05g02720.1:peptide     Gm05:2070754..2074103 (- strand)

84% to 71A43 missing N and C-terms

KTNLNLPPSPPKLPIIGNLHQLGTLPHRSLRDLSLKYGDMMMLQLGQRQTPTLVVSSAEVAMEIMKT

HDLAFSNRPQNTAAKILLYGCTDVGFALYGEKWRQKRKICVLELLSMKRVQSFRVIREEEVAELVNKLREASSSDAYYVN

LSKMLISTANNIICKCAFGWKYTGDGYSSVKELARDTMIYLAAFTVRDYFPWLGWIDVLTGKIQKYKATAGAMDALFDQA

IAKHLTGKTEGEQSKRK &

DLVDILLQLQEDSMLSFELTKNDLKALIT (0)

DMFIGGTDTTSSTLEWAISELVRNPIIMRKVQEEVR &

SIVGHKSNVEENDVTQMHYLKCVVKETLRLHPPTPLLAPRETMSSVKLKGYD

IPAETMVYINAWAIQRDPEFWESPEEFLPERFENSQVHFKGQEYFQFIPFGCGRRECPGI

NFGIASIDYVLASLLDWFD

 

>CYP71A39P CYP71A35P Gm0157 55% to 71A33

Glyma06g18550.1             Gm06:14843220..14853611 (+ strand) error 3 gene span

14843130-14843883 (+) strand

671177  MALHLPSFMKQVQIFYLN

671225  YATLFLLLSFISMLVAFKLTRRRRRSKLNLPSSPPRLQIIGNYHQLRKLPHRS  671383

671385  FQTLSQKHNPLLMLQLGQLPVWVVSSANLAREVMQTHDP

        VLASRPHLPATEILLYECKDVGHSSNGETWREKRKLCVNELLSMKRVRSVQFIREEEVEAL  671684

671685  VSYIRKACSVINLSEMLVTAS  671747

        NNIVCRFTFGSKYDA 14843883

 

>CYP71A40P CYP71A41P

scaffold_157:

Gm0157x00064   scaffold_157:660596..660246 (- strand)

Pseudogene 53% to 71D96, 54% to 83E14

Glyma06g18520.1 Gm06:14832208..14832556 (- strand)

660596  DTAGTDTTFITL

DWTMTELLMNPQVMEKAQKEVRSILGERRIVTESDLHQLEYMRAVIKEIFWLHPPVPVLV

PRESMEDVVIEGYRAPAKTRVFVNAWAIGRDPESWEDPNAFNPES 660246

 

>CYP71A41P CYP71A34P pseudogene scaffold_157: 682263..683677 (+) strand

pseudogene 71% to 71A10

chr6 14853964- 14855378 (+) strand

682263 14853964 EACASQRERPCVNLSEMLIAASNNIVSRCVLGRKYDDKMGCARSSSCSFGVLGRKVMR

LLSAFCVGDFFPSLCWVDSLTGLIP*MKSTSVAIDASL &

DEEVIVEHESKNMKNDHSDVKDFLGILLHLQECRRLDFKLTRDNLKGILM (0)

682826  DMIVGGSDTTSTNLEWAFVDLFRKPNTMNKA*EEVRKAMGINSRLVDEKC

VNQMNYLKCVIKETMRLHTLIPLLIARETTSNVKLR & 683083

YDIPAKTRIFINAWAI &

PEEFILERFEISQVDLNG*DFQLILFGGGRRACPA &

ISFELASTKCVLANL &

YRFNWKMPKSCVMMHNINMIE*N &

GFSVS*KVPFHFK*ESYI 683677 14855378

 

>CYP71A42P CYP71A39P

Gm0171

Glyma17g13450.1              Gm17:10297791..10298212 (+ strand)

652427  LHQLGTLSHRTLQQLSNKHGPLMFLQLG      652510

652519  PTLVVSSTEMAREIFKNRDSVFSGRPSLHAANRLGYNGSTVSFA

PYGEYWREMRKIMILELLSPKRVQSFQAVRLEEVKLLL  652764

 

>CYP71A43 CYP71A38

Gm0171x10010:peptide 59% to 71A33 scaffold_171:641359..639377 (- strand)

Glyma17g13430.1 Gm17:10284540..10286991 (- strand)

MALLKQWPYEVFSSTFYISLSFFISVLLLFKLTKRTKPKTNLNLPPSLPKLPIIGNIHQ

FGTLPHRSLRDLSLKYGDMMMLQLGQMQTPTLVVSSVDVAMEIIKTHDLAFSDRPHNTAAKILLYGCTDVGFASYGEKWR

QKRKICVLELLSMKRVQSFRVIREEEAAKLVNKLREASSSDASYVNLSEMLMSTSNNIVCKCAIGRNFTRDGYNSGKVLA

REVMIHLTAFTVRDYFPWLGWMDVLTGKIQKYKATAGAMDALFDQAIAEHLAQKREGEHSKRKDFLDILLQLQEDSMLSF

ELTKTDIKALVTDMFVGGTDTTAAVLEWAMSELLRNPNIMKKVQEEVRTVVGHKSKVEENDISQMHYLKCVVKEILRLHI

PTPLLAPRVTMSDVKLKGYDIPAKTMVYINAWAMQRDPKFWERPEEFLPERFENSKVDFKGQEYFQFIPFGFGRRGCPGM

NFGIASVEYLLASLLYWFDWKLPETDTQDVDMSEIFGLVVSKKVPLLLKPKTFSF*

 

>CYP71A44 CYP71A37

74% to 71A34 no gene model, Gm0171

Glyma17g13420.1             Gm17:10265063..10269766 (- strand)

623989 10269539 MAFSTFYLSLFFF

ISVLYLFNLTRKTKSKTNLNLPPSPPKLPLIGNLHQLGSLPHRSLRDLSLKHGDIMLLQL

GQMQNPTVVVSSADVAMEIMKTHDMAFSNRPQNTAAKVLLYGGIDIVFGLYGERWSQKRK

ICARELLSTKRVQSFHQIRKEEVAILVNKLREVSSSEECYVNLSDMLMATANDVVCRCVL

GRKYPGVKELARDVMVQLTAFTVRDYFPLMGWIDVLTGKIQEHKATFRALDAVFDQAIAE

HMKEKMEGEKSKKKDFVDILLQLQENNMLSYELTKNDLKSLLL (0) 623102

620430 DMFVGGTDTSRATLEWTLSELVRNPTI

MKKVQEEVRKVVGHKSNVEENDIDQMYYLKCVVKETLRLHSPAPLMAPHETISSVKLKGY

DIPAKTVVYINIWAIQRDPAFWESPEQFLPERFENSQVDFKGQHFQFIPFGFGRRGCPGM

NFGLAFVEYVLASLLYWFDWKLPESDTLKQDIDMSEVFGLVVSKKTPLYLKPVTVSSLSEF* 10265354 619804

 

>CYP71A45P CYP71A36P

Gm0171x10009:peptide           scaffold_171:618246..616969 (- strand)

62% to 71A33 no gene model chr17 10262-10263k-

10263796 MGLESIQQEPCSASHLSCLNSQEEATPIHQHRQSSSATHVAPYSEEAEKKGL

618090 FVSTKSVRSFQFIIVEEVAEMIGVIHEACATSYKKLASVNLSELLIALTN  617941

617908 ELGRKLLCQFTAFWMGDFFPSLAWVDVLA

GQIPKFKVTLSSLDSFFDQVIAQHKEKMNKREDHEQSDTKDFVDILLQLEEAGMLGFELSHDNLKAMLV

617225 DMFIGASDTTSTTLEWTMAELMRHQNTMEKVQEEVRRVVGYKAEVDEND

VKQMNYLKCVVKETLRLHPPAPLLLPRENTSVVKLRGYEIQAKTRLM

LNAWAIQRDPEFCDGPDEFL  616878 10262428

 

>CYP71D8     Glycine max (soybean)

            GenEMBL Y10493 (1800bp)

            Schopfer,C.R. and Ebel,J.

            Identification of elicitor-induced cytochrome P450s of soybean

            (Glycine max L.) using differential display of mRNA

            Mol. Gen. Genet. 258, 315-322 (1998)

            clone CP7

            note: genomic sequence does not give intact sequence

            but the mRNA does so the genomic seq may have errors

Gm0053x00464:peptide              scaffold_53:3854820..3852677 (- strand)

Chr11 Glyma11g06690.1, Gm11:4727299..4729773 (+ strand)

3854820       4727347       MEYSPLSIVITFFVFLLLHWLVKTYKQKSSHKLPPGPWRLPIIG

NLHQLALAASLPDQALQKLVRKYGPLMHLQLGEISTLVVSSPKMAMEMMKTHDVHFVQ

RPQLLAPQFMVYGATDIAFAPYGDYWRQIRKICTLELLSAKRVQSFSHIRQDENKKLI

QSIHSSAGSPIDLSGKLFSLLGTTVSRAAFGK 3854245 &

3854243 ENDDQDEFMSLVRKA 3854199 &

3854197 ITMTGGFEVDD

MFPSLKPLHLLTRQKAKVEHVHQRADKILEDILRKHMEKRTRVKEGNGSEAEQEDLVD

VLLRLKESGSLEVPMTMENIKAVIW (0) 3853916

NIFAAGTDTSASTLEWAMSEMMKNPKVKEKAQA

ELRQIFKGKEIIRETDLEELSYLKSVIKETLRLHPPSQLIPRECIISTNIDGYEIPIK

TKVMINTWAIGRDPQYWSDADRFIPERFNDSSIDFKGNSFEYIPFGAGRRMCPGMTFG

LASITLPLALLLYHFNWELPNKMKPEDLDMDEHFGMTVARKNKLFLIPTVYEAS 4729485

 

>CYP71D9     Glycine max (soybean)

            GenEMBL Y10490 (1754bp)

            Schopfer,C.R. and Ebel,J.

            Identification of elicitor-induced cytochrome P450s of soybean

            (Glycine max L.) using differential display of mRNA

            Mol. Gen. Genet. 258, 315-322 (1998)

            clone CP3

Gm0004x00151:peptide

              scaffold_4:1805973..1809310 (+ strand)

Glyma18g08950.1 Gm18:7686955..7690536 (+ strand)

MDLQLLYFTSIFSIFIFMFMTHKIVTKKSNSTPSLPPGPWKLPI

IGNMHNLVGSPLPHHRLRDLSAKYGSLMHLKLGEVSTIVVSSPEYAKEVMKTHDHIFA

SRPYVLAAEIMDYDFKGVAFTPYGDYWRQLRKIFALELLSSKRVQSFQPIREEVLTSF

IKRMATIEGSQVNVTKEVISTVFTITARTALGSKSRHHQKLISVVTEAAKISGGFDLG

DLYPSVKFLQHMSGLKPKLEKLHQQADQIMQNIINEHREAKSSATGDQGEEEVLLDVL

LKKEFGLSDESIKAVIW (0)

DIFGGGSDTSSATITWAMAEMIKNPRTMEKVQTEVRRVFDK

EGRPNGSGTENLKYLKSVVSETLRLHPPAPLLLPRECGQACEINGYHIPAKSRVIVNA

WAIGRDPRLWTEAERFYPERFIERSIEYKSNSFEFIPFGAGRRMCPGLTFGLSNVEYV

LAMLMYHFDWKLPKGTKNEDLGMTEIFGITVARKDDLYLIPKTVHN

 

>CYP71D10    Glycine max (soybean)

            GenEMBL AF022459 (1691bp)

            Siminszky,B., Dewey,R.E. and Corbin,F.T.

            clone name 5/16

Gm0070x00483:peptide              scaffold_70:3975543..3973852 (- strand)

Chr15 Glyma15g05580.1 Gm15:3950984..3952757 (+ strand)

3975631       3950978 MVMELHNHTPFSIYFITSILFIFFVFFKLVQRSDSKTSSTCKLP

PGPRTLPLIGNIHQIVGSLPVHYYLKNLADKYGPLMHLKLGEVSNIIVTSPEMAQEIM

KTHDLNFSDRPDFVLSRIVSYNGSGIVFSQHGDYWRQLRKICTVELLTAKRVQSFRSI

REEEVAELVKKIAATASEEGGSIFNLTQSIYSMTFGIAARAAFGKKSRYQQVFISNMH

KQLMLLGGFSVADLYPSSRVFQMMGATGKLEKVHRVTDRVLQDIIDEHKNRNRSSEER

EAVEDLVDVLLKFQKESEFRLTDDNIKAVIQDIFIGGGETSSSVVEWGMSELIRNPRV

MEEAQAEVRRVYDSKGYVDETELHQLIYLKSIIKETMRLHPPVPLLVPRVSRERCQIN

GYEIPSKTRIIINAWAIGRNPKYWGETESFKPERFLNSSIDFRGTDFEFIPFGAGRRI

CPGITFAIPNIELPLAQLLYHFDWKLPNKMKNEELDMTESNGITLRRQNDLCLIPITR

LP* 3973855 3952757

 

>CYP71D96    Glycine max (soybeans, Fabales)

            DQ340243,

            ESTs BM892131, BI892766, BM094727,

            BE805102, BF595208, BM892923, BF324498,

            CA953308, BE347664

            Li,L.Y. and Yu,D.Y.

            Comprehensive analysis of putative P450 genes superfamily in

            Glycine max and Medicago truncatula

            Unpublished

            74% to 71D87, 71% to 71D86, Called CYP71D54

            cannot be certain about the ortholog

scaffold _13 no introns      scaffold_13:5879367..5876961 (- strand)

Glyma01g38590.1              Gm01:50645569..50648012 (- strand)

5879367         MEAQASFLFISLFFSLVLHLLAKHYYKPKTTLSHKLPPGPKKLPLIGNLH

             QLAMAGSLPHRTLRDLALKYGPLMHLQLGEISSVVVSSPNMAKEIMKTHD      5879068

5879067         LAFVQRPQFLPAQILTYGQNDIVFAPYGDYWRQMKKICVSELLSAKRVQSFSHIRED

             ETSKFIESIRISEGSPINLTSKIYSLVSSSVSRVAFGDKSKDQ      5878768

5878767         EEFLCVLEKMILAGGGFEPDDLFPSMKLHLINGRKAKLEK

MHEQVDKIADNILREHQEKRQRALREGKVDLEEEDLVDVLLRIQ    5878516

QSDNLEIKISTTNIKAVILDVFTAGTDTSASTLEWAMAEMMRNPRVREKAQAEVRQAF

RELKIIHETDVGKLTYLKLVIKETLRLHAPSPLLVPRECSELTIIDGYEIPVKTKVMI

NVWAIGRDPQYWTDAERFVPERFDGSSIDFKGNNFEYLPFGAGRRMCPGMTFGLANIM

LPLALLLYHFNWELPNEMKPEDMDMSENFGLTVTRKSELCLIPIVNDL*

 

>CYP71D99    Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 10/15/2008

            Clone C1

            83% to 71D100, 66% to 71D105, 58% to 71D101, 60% to 71D102,

            58% to 71D11, 54% to 71D104, 64% to 71D81 Medicago

            = patent seqs CS716170, CS716157, CS716151

Gm0186x00071:peptide scaffold_186:559955..557867 (- strand)

Glyma08g43900.1 Gm08:43688957..43691102 (- strand)

MALLFFYFLVLISFAFTTIIVQKIRKKPKKTDDTTCKIPHGPRKLPIIGNIYNLLCSQPH

RKLRDLAIKYGPVMHLQLGQVSTIVISSPECAREVMKTHDINFATRPKVLAIEIMSYNST

SIAFAGYGNYWRQLRKICTLELLSLKRVNSFQPIREDELFNLVKWIDSKKGSPINLTEAV

LTSIYTIASRAAFGKNCKDQEKFISVVKKTSKLAAGFGIEDLFPSVTWLQHVTGLRAKLE

RLHQQADQIMENIINEHKEANSKAKDDQSEAEEDLVDVLIQYEDGSKKDFSLTRNKIKAI

ILDIFAAGGETTATTIDWAMAEMVKNPTVMKKAQSEVREVCNMKARVDENCINELQYLKL

IVKETLRLHPPAPLLLPRECGQTCEIHGYHIPAKTKVIVNAWAIGRDPNYWTESERFYPE

RFIDSTIDYKGSNFEFIPFGAGRRICAGSTFALRAAELALAMLLYHFDWKLPSGMRSGEL

DMSEDFGVTTIRKDNLFLVPFPYHPLPVS

 

>CYP71D100   Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 10/15/2008

            Clone C16

            83% to 71D99, 65% to 71D105, 58% to 71D9,

            63% to 71D81 Medicago

Gm0186x00072:peptide scaffold_186:567184..565259 (- strand)

Model is short at N-term

Glyma08g43920.1             Gm08:43696374..43698299 (- strand)

MALLFLFFVALISFLFTILIVQKLGKKSKKTDDTTCDMHMPHGPRKLPIIGNIYNLICSQ

PHRKLRDLAIKYGPVMHLQLGEVSTIVISSPDCAKEVMTTHDINFATRPQILATEIMSYN

STSIAFSPYGNYWRQLRKICILELLSLKRVNSYQPVREEELFNLVKWIASEKGSPINLTQ

AVLSSVYTISSRATFGKKCKDQEKFISVLTKSIKVSAGFNMGDLFPSSTWLQHLTGLRPK

LERLHQQADQILENIINDHKEAKSKAKGDDSEAQDLVDVLIQYEDGSKQDFSLTKNNIKA

IIQDIFAAGGETSATTIDWAMAEMIKDPRVMKKAQAEVREVFGMNGRVDENCINELQYLK

LIVKETLRLHPPAPLLLPRECGQTCEIHGYHIPAKTKVIVNAWAIGRDPKYWTESERFYP

ERFIDSTIDYKGNSFEFIPFGAGRRICPGSTSALRTIDLALAMLLYHFDWNLPNGMRSGE

LDMSEEFGVTVRRKDDLILVPFPYHPLPVT

 

>CYP71D101   Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 10/15/2008

            Clone C2

            92% to 71D102, 75% TO 71D9, 61% TO C19, 60% TO 71D100, 58% TO 71D99,

            56% TO 71D104, 60% to 71D81 medicago

scaffold_4:1781316..1783621 (+ strand) missing N-term

Gm0004x00149:peptide

Glyma18g08930.1              Gm18:7662218..7664677 (+ strand)

1781204 MDLQTLYFTSILSIFIFMFLGHKIITKKPASTPNLPPGPWKIPIIGNIHNVVGSLPHHRL

RDLSAKYGPLMHLKLGEVSTIVVSSPEYAKEVLSTHDLIFSSRPPILASKIMSYDSMGMS

FAPYGDYWRRLRKICASELLSSKRVQSFQPIRGEELTNFIKRIASKEGSPINLTKEVLLT

VSTIVSRTALGNKCRDHKKFISAVREATEAAGGFDLGDLYPSAEWLQHISGLKPKLEKYH

QQADRIMQNIVNEHREAKSSATHGQGEEVADDLVDVLMKEEFGLSDNSIKAVIL (0)

DMFGGGTQTSSTTITWAMAEMIKNPRVMKKVHAEVREVFGGKVGHPDESDMENLKYLKSVVKETLR

LHPPGPLLLPRQCGQACEINGYYIPIKSKVIINAWAIGRDPNHWSEAERFYPERFIGSSV

DYQGNSFEYIPFGAGRRICPGLTFGLTNVEFPLALLMYYFDWKLPNEMKNEDLDMTEAFG

VSARRKDDLCLIPITFHL* 1783489

 

>CYP71D102   Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 10/15/2008

            Clone C18

            92% to 71D101, 74% to 71D9, 63% to 71D81 Medicago

Gm0186x00070:peptide scaffold_186:548163..545888 (- strand)

Glyma08g43890.1             Gm08:43676827..43679376 (- strand)

548163 MKKKSASTPNLPPGPWKLPIIGNILNIVGSLPHCRLRDLSAKYGPLMHLKLGEVSTIVVS

SPEYAKEVLNTHDLIFSSRPPILASKIMSYDSKGMSFAPYGDYWRWLRKICTSELLSSKC

VQSFQPIRGEELTNFIKRIASKEGSAINLTKEVLTTVSTIVSRTALGNKCRDHQKFISSV

REGTEAAGGFDLGDLYPSAEWLQHISGLKPKLEKYHQQADRIMQSIINEHREAKSSATQG

QGEEVADDLVDVLMKEEFGLSDNSIKAVIL (0) 547354

546502 DMFGGGTQTSSTTITWAMAEMIKNPRVTKK

IHAELRDVFGGKVGHPNESDMENLKYLKSVVKETLRLYPPGPLLLPRQCGQDCEINGYHI

PIKSKVIVNAWAIGRDPNHWSEAERFYPERFIGSSVDYKGNSFEYIPFGAGRRICPGLTF

GLTNVELPLAFLMYHFDWKLPNGMKNEDLDMTEALGVSARRKDDLCLIPITFHP* 545888

 

>CYP71D103P Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 10/15/2008

            Clone C3

            61% TO 71D105, C-HELIX TO I-HELIX, 49% TO 71D11

            54% TO 71D93 MEDICAGO middle region

            no ESTs

            scaffold 49 pseudogene, 64% to 71D105

              scaffold_49:5584638..5585264 (+ strand)

chr20 (-) strand Glyma20g01090.1 Gm20:761225..762863 (- strand)

5584572  762882     IYLQLGETTTIIVSSPECVKEI

MKTHDVVFASRPQSATFDILYYESTGIASAPYGNYWRVIRRMCTIELFTQKRVNYFQPIR

EEELSYLIIKIIDYSHKGSSSSPINVSQMVLSSIYSITSTVAFGKNYKD 762490

QEEFISLVKEE

VEIAGRDLYCSARWLQLVTGLRAKLEKLHRQMDRVLENIIIEHKEAKSGAKEGQCEQKKE

DLVDILLKFQDGSDKDICLTNGKFKGIIQ 5585264

5585991 DIFVGGGDTSAITIDWAMAEM & 5586053

5586053 VRDPRVMKKAQAEVRKVFNIKGRIDETCINELKYLKSVVKETLRLQPPFPLVPREC 5586220

 

>CYP71D103P

chr20 (-) strand Glyma20g01090.1 Gm20:761225..762863 (- strand)

762882  IYLQLGETTTIIVSSPECVKEIMKTHDVVFASRPQSATFDILYYESTGIASAP

        YGNYWRVIRRMCTIELFTQKRVNYFQPIREEELSYLIIKIIDYSHKG        762583

762582  SSSSPINVSQMVLSSIYSITSTVAFGKNYKDQEEFISLVKEEVEIAG

        RDLYCSARWLQLVTGLRAKLEKLHRQMDRVLENIIIEHKEAKSGAKEGQ     762295

762294  CEQKKEDLVDILLKFQDGSDKDICLTNGKFKGII       762193

 

>CYP71D104   Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 10/15/2008

            Clone C17

            56% to 71D105, 56% to 71D101, 55% to 71D10, 55%

            to 71D81 Medicago

              scaffold_4:1796671..1799722 (+ strand)

              Gm0004x00150:peptide

Glyma18g08940.1              Gm18:7677712..7680602 (+ strand)

MDLGHQNIPSLAILPFFLFMFTVFSLFWRTKTKPSNSKLPPGPPKLPLIGNLHQLGAMPH

HGLTKLSHQYGPLMHIKLGALSTIVVSSPEMAKEVLKTHDIIFANRPYLLAADVISYGSK

GMSFSPYGSYWRQMRKICTFELLTPKRVESFQAIREEEASNLVREIGLGEGSSINLTRMI

NSFSYGLTSRVAFGGKSKDQEAFIDVMKDVLKVIAGFSLADLYPIKGLQVLTGLRSKVEK

LHQEVDRILEKIVRDHRDTSSETKETLEKTGEDLVDVLLKLQRQNNLEHPLSDNVIKATI

LDIFSAGSGTSAKTSEWAMSELVKNPRVMEKAQAEVRRVFGEKGHVDEANLHELSYLKSV

IKETLRLHIPVPFLLPRECSERCEINGYEIPAKSKVIINGWAIGRDPNHWTDAKKFCPER

FLDSSVDYKGADFQFIPFGAGRRMCPGSAFGIANVELLLANLLFHFDWNMPNGKKPEELD

MSESFGLSVRRKHDLYLIPSICLSFGN

 

>CYP71D105   Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 10/15/2008

            Clone C19

            63% to 71D11, 65% to 71D100, 70% to 71D81

            Medicago

Gm0049x00336:peptide scaffold_49:5668495..5665897 (- strand)

Chr20 (+) strand Glyma20g00980.1 Gm20:678927..681649 (+ strand)

Unneeded extension

5668558 MATKLVIFIRTFRTPYWSLVT

MDSEVLNLLALILPFLLFVIVALKIGRRNLKKSESTPKI

PPGPWKLPIIGNILHLVTSTPHRKLRDLAKIYGPLMHLQLGELFIIVVSSAEYAKEIMKT

HDVIFAQRPHSLASDILSYESTNIISAPYGHYWRQLRKICTVELFTQKRVNSFKPIREEE

LGNLVKMIDSHGGSSSINLTEAVLLSIYNIISRAAFGMKCKDQEEFISVVKEAITIGAGF

HIGDLFPSAKWLQLVSGLRPKLDIIHEKIDRILGDIINEHKAAKSKAREGQDEAEEDLVD

VLLKFKDGNDRNQDICLTTNNIKAIIL

680249 DIFGAGGETSATTINWAMAEMIKNPRAMNKAQL

EVREVFDMKGMVDEICIDQLKYLKSVVKETLRLHPPAPLLLPRECGQTCEIHGYHIPGKS

KVIVNAWTIGRDPNYWTEAERFHPERFFDSSIDYKGTNFEYIPFGAGRRICPGITLGLIN

VELTLAFLLYHFDWKLPNGMKSEDLDMTEKFGVTVRRKDDLYLIPVTSRPFLVR 5666598 680869

 

>CYP71D105

chr20 (+) strand) Glyma20g00980.1 Gm20:678927..681649 (+ strand)

678972  MDSEVLNLLALILPFLLFVIVALKIGRRNLKKSESTPKIPPGPWKLPIIGNIL

        HLVTSTPHRKLRDLAKIYGPLMHLQLGELFIIVVSSAEYAKEIMKTH        679271

679272  DVIFAQRPHSLASDILSYESTNIISAPYGHYWRQLRKICTVELFTQKRVNSFKPIR

        EELGNLVKMIDSHGGSSSINLTEAVLLSIYNIISRAAFGMKCK    679571

679572  DQEEFISVVKEAITIGAGFHIGDLFPSAKWLQLVSGLRPKLDIIHEKIDRILGDII

        NEHKAAKSKAREGQDEAEEDLVDVLLKFKDGNDRNQDICLTTNN             679871

679872  IKAIIL  (0)  679889

680249  DIFGAGGETSATTINWAMAEMIKNPRAMNKAQL

EVREVFDMKGMVDEICIDQLKYLKSVVKETLRLHPPAPLLLPRECGQTCEIHGYHIPGKS

KVIVNAWTIGRDPNYWTEAERFHPERFFDSSIDYKGTNFEYIPFGAGRRICPGITLGLIN

VELTLAFLLYHFDWKLPNGMKSEDLDMTEKFGVTVRRKDDLYLIPVTSRPFLVR  680869

 

>CYP71D106 CYP71D159 Glyma01g38630.1:peptide 93% to 71D8 Gm01:50667387..50669162 (- strand)

model is short

50669226    MEYSPLSIVI

TFFVFLLLHWLVKIYKQKSRYKLPPSPWRLPIIGNLHQLALAASLPDQALQKLVRKYGPL

MHLQLGEISALVVSSPKMAMEVMKTHDVHFVQRPQLLAPQFMVYGATDIVFAPYGDYWRQIRKICTLELLSAKRVQSFSH

IRQDENRKLIQSIHSSAGSSIDLSGKLFSLLGTTVSRAAFGKENDDQDELMSLVRKAITMTGGFELDDMFPSLKPLHLLT

RQKAKVEHVHQRADKILEDILRKHMEKRTIGKEGSNEAEQEDLVDVLLRLKESGSLEVPMTMENIKAVIWNIFASGTDTP

ASTLEWAMSEMMKNPRVREKAQAELRQTFKGKEIIRETDLEELSYLKSVIKETLRLHPPSQLIPRECIKSTNIDGYDIPI

KTKVMINTWAIGRDPQYWSDAERFIPERFDDSSIDFKGNSFEYIPFGAGRRMCPGITFGLASITLPLALLLYHFNWELPN

KMKPADLDMDELFGLTVVRKNKLFLIPTIYEAS*

 

>CYP71D107P CYP71D167P Chr1 cypnew chr1 64% to 71D145 Glyma01g38620.1

50665300 TFFLFLLLHWLIKKYKSKSSHT

50665234    LSPGPRKLPLIGTCINLLTVAGSLQYHALRELAHKYEPLMHLQLC

EISAVINCILPKMVAKEIMKTHDLAFVQPQLLSPQTLAYGATNIAFAPYGGDY*  50664938

50664937    RQMRKKCT &

LELLSAERV*SFSYLLEDETKNYRLHSKIAGSPINLTSRIFS

LLIC*ALLAAFGNKSEDQDEFVSLVRE 50664707

 

>CYP71D108 CYP71D158 Glyma01g38610.1:peptide   Gm01:50657113..50660224 (- strand)

73% to 71D96

MEAQTYFLVIALSLFILLNWLAKYLKLKPNVAHKLPPGPKKLPLIGNMHQLAVAGSLPHRALQKLAHIYGPLMHLQLGEI

SAVVVSSPNMAKEITKTHDVAFVQRPQIISAQILSYGGLDVVFAPYGDYWRQMRKVFVSELLSAKRVQSFSFIREDETAK

FIDSIRASEGSPINLTRKVFSLVSASVSRAAIGNKSKDQDEFMYWLQKVIGSVGGFDLADLFPSMKSIHFITGSKAKLEK

LLNRVDKVLENIVREHLERQIRAKDGRVEVEDEDLVDVLLRIQQADTLDIKMTTRHVKALILDVFAAGIDTSASTLEWAM

TEMMKNSRVREKAQAELRKVFGEKKIIHESDIEQLTYLKLVIKETLRLHPPTPLLIPRECSEETIIGGYEIPVKTKVMIN

VWAICRDPKYWTDAERFVPERFEDSSIDFKGNNFEYLPFGAGRRICPGITFGLASIMLPLAQLLLHFNWELPDGMKPESI

DMTERFGLAIGRKHDLCLIPFVDNL*

 

>CYP71D109 CYP71D157 Glyma01g38600.1:peptide     Gm01:50652996..50654725 (- strand)

88% to 71D96

MEAQACFMFTTLFFFWVLHWLA

YYYKPKTTLSHKLPPGPKKLPLIGNLHQLAMAGSLPHRTLRDLALKYGPLMHLQLGEISSVVVSSPNMAKEIMKTHDLAF

VQRPQFLPAQILTYGQSDIAFAPYGDYWRQMKKICVSELLSAKRVQSFSDIREDETAKFIESVRTSEGSPVNLTNKIYSL

VSSAISRVAFGNKCKDQEEFVSLVKELVVVGAGFELDDLFPSMKLHLINGRKAKLEKMQEQVDKIVDNILKEHQEKRERA

RREGRVDLEEEDLVDVLLRIQQSDNLEIKITTTNIKAIILDVFTAGTDTSASTLEWAMAEMMRNPRVREKAQAEVRQAFR

ELKIINETDVEELIYLKLVIKETLRLHTPSPLLLPRECSKRTIIDGYEIPVKTKVMINAWAIARDPQYWTDAERFVPERF

DGSSIDFKGNNFEYLPFGAGRRMCPGMTLGLANIMLPLALLLYHFNWELPNEMKPEYMDMVENFGLTVGRKNELCLIPVVNDL*

 

>CYP71D110 CYP71D160 Glyma01g42600.1:peptide   Gm01:53822921..53824667 (- strand)

72% to 71D10 joint may need fixing

MVMELHSQNNPFSIYLITSFLFLLFLLFKLVKKSSSNNSTSKLPPGPKTLPLIGNLHQLVGSKSHHCFKKLADKYGPLMH

LKLGEVSNIIVTSKELAQEIMRTQDLNFADRPNLISTKVVSYDATSISFAPHGDYWRQLRKLCTVELLTSKRVQSFRSIR

EDEVSELVQKIRASASEEGSVFNLSQHIYPMTYAIAARASFGKKSKYQEMFISLIKEQLSLIGGFSIADLYPSIGLLQIM

AKAKVEKVHREVDRVLQDIIDQHKNRKSTDREAVEDLVDVLLKFRRHPGNLIEYINDMFIGGGETSSSTVEWSMSEMVRN

PRAMEKAQAEVRKVFDSKGYVNEAELHQLTYLKCIIREAMRLHPPVPMLIPRVNRERCQISGYEIPAKTRVFINAWAIGR

DPKYWTEAESFKPERFLNSSIDFKGTNYEFIPFGAGRRICPGITFATPNIELPLAHLLYHFDWKLPNNMKNEELDMTESY

GATARRAKDLCLIPITVRP*

 

>CYP71D111 CYP71D122

Gm0043x00075:peptide 92% to 71D112 4207-4209k+

Glyma02g17940.1 model short             Gm02:16101502..16103987 (- strand)

MEAQTFFLVIALFFLLHWLAKCYNSSVCHKLPPGPKKLPIIGNLHQL

AEAGSLPHHALRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFG

QMISYGGLGIAFAPYGDHWRQMRKMCATELLSAKRVQSFASIREDEAAKFIDLIRESAGS

PINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYF

ITGKMARLKKLHKQVDKVLENIIKDHHEKNKSAKEDGAEVEDQDFIDLLLRIQQDDTLGI

EMTTNNIKALIL (0)

DIFAAGTDTSSSTLEWTMTEMMRNPTVREKAQAELRQTFREKDIIHESDL

EQLTYLKLVIKETLRVHPPTPLLLPRECSQLTIIDGYEIPAKTKVMVNAYAICKDPQYWT

HADRFIPERFEDSSIDFKGNNFEYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELP

NNMKPEDMDMAEHFGLAINRKNELHLVPFVYDL*

 

>CYP71D112 CYP71D118

Gm0043x00078:peptide 94% to CYP71D148

Glyma02g17720.1              Gm02:15932477..15934785 (- strand)

MEAQTYFLVIALFFLLHWLAKCYKSSVVSHKLPPGPKKL

PIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSF

LQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSAKRVQSFASIREDEAAKFI

NSIREAAGSPINLTSQIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADV

FPSIPFLYFITGKMAKLKKLHKQVDKVLENIIREHQEKKKIAKEDGAEVEDQDFIDLLLK

IQQDDTMDIEMTTNNIKALIL (0)

DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQTFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQ

PTIIDGYEIPTKTKVMVNAYAICKDPKYWTDAERFVPERFEDSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLAL

LLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLVPLVSDH*

 

>CYP71D113 CYP71D164

Glyma02g40150.1:peptide revised 54% to 71D124

           Gm02:45358150..45360601 (- strand)

MEHQLITFLSFLLYSLSFILFLFQILKVGKRSKVKTMNLPPGPWKLPIIGSIHHMIGFLPHHRLRELALKHGPLMHLKLG

EVPAIVVSSPEVAKEVMKTYDSIFAQRPHQVGADIMCYGSTDIATAPLGGYWKQLRRICSQELLSNKRVRSYQSIREEEV

LNLMRLVD ANTRSCVNLSEKVSCMTSAITARATFGEKCKDQED

FISLVKKLLKLVERLFVFDIFPSHKWLHVISGEISKLEELQREYDMIIGNIIRKAEKKTGE

VEVDSLLSVLLNIKNHDVLEYPLTIDNIKAVML (0)

NMFGAGTDTSSAVI

EWTMSEMLKNPRVMTKAQEEVRRVFGSKGYTNEAALEDLKFLKAVIKETLRLHPPFPLLLPRECRETCEVKGYTIPAGTK

VIVNAWAIARDPKYWSEAEKFYPERFMDSPIDYKGSNHELIPFGAGRRICPGISFGVSSVELCLAQLLYYFNWELPNGNK

ENDLEMTEALGASSRRKTDLTLKVLVTVKAVNLC*

 

>CYP71D114 CYP71D163

Glyma02g46840.1:peptide Gm02:50681153..50683602 (- strand)

87% to 71D150

MEMELHISLSTILPFFILVFMLIINIVWRSKTKNSNSKLPPGPRKLPLIGNIHHLGTLPHRSLARLANQYGPLMHMQLGE

LSCIMVSSPEMAKEVMKTHDIIFANRPYVLAADVITYGSKGMTFSPQGTYWRQMRKICTMELLAPKRVDSFRSIREQELS

IFVKEMSLSEGSPINLSEKISSLAYGLISRIAFGKKSKDQEAYIEFMKGVTDTVSGFSLADLYPSIGLLQVLTGIRPRVE

KIRRGMDRIIDNIVRDHRDKNSDTQPVVGEENGEDLVDVLLRLQKNGNLQHPLSDTVVKATIMDIFSAGSETTSTTMEWA

MSELVKNPRMMEKAQIEVRRVFDPKGYVDETSIHELKYLRSVIKETLRLHTPVPLLLPRECSERCEINGYEIPAKSKVIV

NAWAIGRDPNYWIEAEKFSPERFIDCSIDYKGGEFQFIPFGAGRRICPGINLGIVNVEFSLANLLFHFDWKMAPGNSPQE

LDMTESFGLSLKRKQDLQLIPITYHTAA*

 

>CYP71D115P CYP71D162P Glyma02g46830.1:peptide 85% to 71D150

pseudogene missing I-helix Gm02:50677600..50680052 (- strand)

MLELHIISLSTILPFFFLVFTIINTLWR

SKTKNSNSKLPQGPRKLPFIGSIQHLGTLPHRSLARLASQYGPLMHMQLGELCCIVVSSPQMAKE

VMNTHDIIFANRPYV

AADVITFGSKGMTFSPQGTYWRQMRKICTMELLAPKRVESFRSIRERELSFFVR

EISLIEGSPINLSEKITSLAYGL

LSRIVFGKKSKD

QEAYMVHMKGVVETIEGFSLADLYPSIGLLQVLTGIKTRVEKIQRGMDTILENI

VRDHRNKTLDTQAIGEENGEYLVDVLLRL

 

VKNPRVMEKVQIEVR

RVFNGKGYVDETSIHELKYLRSVIKETLRLHPPSPLMLSRECSKRCEINGYEIQIKSKVIVNAWAIGRDPKYWIEAEKFS

PERFIDCSIDYEGGEFQFIPYGAGRRICPGINFGIVNVEFSLANLLFHFDWKMAQGNGPEELDMTESFGFLNYLYHHLYF

SV*

 

>CYP71D116 CYP71D161

Glyma02g46820.1:peptide 94% to 71D110 Gm02:50670896..50672756 (- strand)

MVMELQSQNNPFSIHLITFFLFLLFLLFKLVKKSSSNNTSKLPPGPKTLPLIGNLHQLVGSKSHHCFKKLADKYGPLMHL

KLGEVSNIIVTSKELAQEIMRTQDLNFADRPNLVSTKIVSYNATSISFAPHGDYWRQLRKLCTVELLTSKRVQSFRSIRE

DEVSELVQKIRAGASEEGSVFNLSQHIYPMTYAIAARASFGKKSKYQEMFISLIKEQLSLIGGFSLADLYPSIGLLQIMA

KAKVEKVHREVDRVLQDIIDQHKNRKSTDREAVEDLVDVLLKFRSENELQYPLTDDNLKAVIQDMFIGGGETSSSTVEWS

MSEMVRNPWAMEKAQAEVRKVFDSKGYVNEAELHQLTYLKCIIREAMRLHPPVPLLIPRVNRERCKINGYEIPAKTRVFI

NAWAIGRDPKYWTEAESFKPERFLNSSIDFKGTNYEFIPFGAGRRICPGISFATPNIELPLAHLLYHFDWKLPNNMKNEE

LDMTESYGATARRAKDLCLIPITVRP*

 

>CYP71D117P CYP71D173P chr2 identical to CYP71D118P adjacent (tandem duplication)

no gene model

INMGRTLEDLWKD

50659704  EVRKEFDNKGYVDEADLHQLIYLKCTIRDAMRLHSPVP  50659591

 

>CYP71D118P CYP71D172P chr2 nearly identical to CYP71D152P on chr14, no gene model

INMGRTLEDLWKD

50644472  EVRKEFDNKGYVDEADLHQLIYLKCTIRDAMRLHSPVP  50644359

 

>CYP71D119P CYP71D129P 85% to 71D127

Gm0003x00281:peptide 2190-2193k+

Chr5 Glyma05g28540.1

34376706 MELLFPFSLLFTFACILLALFNTLNRSNSKNLPPG

WKLPLLGNIHQFLGPLPHQTLANLA

NQHGPLMHLQLGEKPHII

SSADIAKEIMKTHDAIFANRPHLLASKFFVYDSSDI

FSSYGRAWRQLK

KKFCISELLNAKHV*SLRHTREKEATKLVRNVYANEGSIINLTTKEIESV

TIAIIARAANGTKCKDQEAFVSTMEQMLVLLGGFSIADFYPSIKVLPLLT

TGMKTRVERAQRENDKILEHMVKDHQENRNKHGVTHEDFIDILLKTQKR

DDLEIPMTHNNIKALIW

DMFAGGTAAPTAVTVWAMSEHMKNPKVMEKAHTEIRKVFNVKGYVDETGLRQCQYLNSVI

KETKRLH

PPEALLVSRENSEACVINGYEIPAKSKVIINAWAIGRES

NSYDFSGTNFEYIPFGAGRRICPGAAFSMPYMLLSVANLLYHFVWELPNGAIHQELDM

THESFGLTVKRANDLCLIPIPYHPTS*LGH 34373704

 

>CYP71D120P CYP71D117P Scaffold_236 no gene model 59% to CYP71D103P

chr6 no gene model 555-556k-

9786  VILASRPLSVAAHIMYY*STATTGIVYAPFEKYWRVLQKMCTIE

       LFTQKCVNSFHSI*EQELANLIQMIDSHKGSPIN*PHSTMLSSIFSITSRVAFGKK      10085

10086  CKEHKEFITLVK*GVAAA*GYLFSSSRWLQFAT

       GLRPKFERLHVQVDRILEKIIIQHKEAKSRVKEG*KNEDLVDILLRFLDGNDINKDVCLI          10364

10365  NDNIKAIIL (0)

 

>CYP71D121P CYP71D165P

Glyma06g36270.1:peptide      Gm06:38392060..38393115 (- strand)

NILPGPWKLPIIGNIPHLVTSAPHKKLRDLAKKYGPLMHLKLDAKEVMKIHDLKFSSRPQVLA

IAFAPYGNYWRRLRKTCTLDC

 

>CYP71D122 CYP71D166 Glyma07g20080.1 79% to 71D105 Gm07:20353915..20355733 (- strand)

bad boundary

20355761    MDSQILNSLALILPFLLFMILALKIGRNLKKTESTP

NIPPGPWKLPIIGNVPHLVTSAPHRKLKDLAK (2)

(1) VYGPLMHLQLGEVFTVIVSSAE

YAKEIMKTHDVIFATRPHILAADIFSYGSTNTIGAPYGNYWRQLRKICTVELLTQKRVNSFKPIREEELTNLIKMIDSHK

GSPINLTEEVLVSIYNIISRAAFGMKCKDQEEFISAVKEGVTVAGGFNVADLFPSAKWLQPVTGLRPKIERLHRQIDRIL

LDIINEHKDAKAKAKEDQGEAEEDLVDVLLKFPDGHDSKQDICLTINNIKAIILDIFGAGGETAATAINWAMAEMIRDPR

VLKKAQAEVRAVYNMKGMVDEIFIDELQYLKLVVKETLRLHPPVPLLVPRVCGESCGIGGYHIPVKSMVIVNAWAIGRDP

NYWTQPERFYPERFIDSSIEYKGTNFEYIPFGAGRRLCPGITFGLKNVELALAFLLFHFDWKLPNGMKNEDLDMTQQFGVTVRRKADLFLIPITSRPILVRKLP* 20353843

 

>CYP71D123P CYP71D168P chr7 ~70% to CYP71D169 Glyma07g20440.1

20725943    HLCIKD*NSSCRCIML*IHKF

PLMHRQLGWSSQLLFPHHDSIFASRTKILVVDVLCYESTSL

IFAPYGNYWR*LRKICTVELFTQRHVNSFKPI  20725785

REGELINLVKMIDSHKG

MKCKD*KEFISVVKEGLLVGVGFNIVDLY

20714290    DVFGVGGESSATTIIWTRVEMVKNPSVMKKAQLKVREVFDMKRMV

DEICMVELKYLES

HIPVKSKVIVNAWEIGRDPNY

SIDYKGTNFEHIPFGARRRKCPGSSCGLINVELALAFLFYHFDWKLP  20713913

NGMKSEGLNMTKQSGMAVRRNELYLI 20713836

 

>CYP71D124 CYP71D149 scaffold_34 1685-1687k+ 81% to 71D105

chr7 Glyma07g20430.1   Gm07:20663400..20666558 (- strand)

20666469 MDSEVHNMLAVIMSFSLFIIVALKIGRNLKKTESSPNIPPGPWKLPIIGNIHHLVTCTPHRK

LRDLAKTYGPLMHLQLGEVFTIIVSSPEYAKEIMKTHDVIFASRPKILASDILCYESTNI

VFSPYGNYWRQLRKICTVELLTQRRVNSFKQIREEEFTNLVKMIDSHKGSPINLTEAVFL

SIYSIISRAAFGTKCKDQEEFISVVKEAVTIGSGFNIGDLFPSAKWLQLVTGLRPKLERL

HGKTDRILKEIINEHREAKSKAKEDQGEAEEDLVDVLLKFQDGDDRNQDISLTINNIKAIIL (0)

DVFAAGGETSATTINWAMAEIIKDP

RVMKKAQVEVREIFNMKGRVDEICINELKYLKSVVKETLRLHPPAPLLIPRECGQTCEIN

GYHIPVKSKVFVNAWAIGRDPKYWTEPERFYPERFIDSSIDYKGNNFEFTPFGSGRRICP

GITLGSVNVELALAFLLYHFHWKLPNGMKSEELDMTEKFGASVRRKEDLYLIPVICHPLQ

VRKTITFEFVFTPLILKESLYLLLF* 20664210

 

>CYP71D125 CYP71D128

Gm0068x00077:peptide 65% to 71D158 scaffold_68:561198..562897 (+ strand)

Chr7 Glyma07g39710.1 Gm07:44107519..44109530 (- strand)

MSFKLYSFIHT

44109473 MELRPSFLVLTSFLLLLLWLARIYKQKIKVRSVVHKLPPGPWKLPLIGNLHQLAGAGTLPHHTLQNLSRKYGPLMHLQLG

EISAVVVSSSDMAKEIMKTHDLNFVQRPELLCPKIMAYDSTDIAFAPYGDYWRQMRKICTLELLSAKRVQSFSFIREEEV

AKLIQSIQLCACAGSPVNVSKSVFFLLSTLISRAAFGKKSEYEDKLLALLKKAVELTGGFDLADLFPSMKPIHLITRMKA

KLEDMQKELDKILENIINQHQSNHGKGEAEENLVDVLLRVQKSGSLEIQVTINNIKAVIWDIFGAGTDTSATVLEWAMSE

LMKNPRVMKKAQAEIREAFRGKKTIRESDVYELSYLKSVIKETMRLHPPVPLLLPRECREPCKIGGYEIPIKTKVIVNAW

ALGRDPKHWYDAEKFIPERFDGTSNDFKGSNFEYIPFGAGRRMCPGILLGIANVELPLVALLYHFDWELPNGMKPEDLDM

TEGFGAAVGRKNNLYLMPSPYDHSLNHFIVN* 44107774

 

>CYP71D126P CYP71D148P

scaffold_68 84% to 71D158

chr7 Glyma07g39700.1 Gm07:44104581..44105960 (- strand)

564678 44105993 MEAQFFLAVIKFFLSLLVLLLAKNYKQKGLHKLPPGPWKLPIIGNLLQVEAASSLPHRAFRELAQK

YGPLMHLQLGEISAVIVSSPL

IAMEIMKTHDLAFAQRPKFLASDIIGYGLVDIFAPYGDY*RQMKKICTLE

SATKVQSFSPNREEVAKL

ERIQSSAGAPINLTGMINSFISTFV

FGNITTENCEGFLSIVKETIEVADGFDLADMFPSFKPMHFITGLKAKLDKMHNKVDKILD

KIIKENQANKGMGEEKNENLVE

DIFAAGTDTSAKVIEWAMSEMMRNPGGREKAQAEIRQTF*GKEAISESNMGELNYLK

ETLRLHPPAPLLLPRECREACRIYGYDIPIKTKVIVNAWAIGRDPEH*HDAESFIPERFH

GASIDFKGTDFEYIPFGAGRRMCPGISFGMASVEFALAKLLYH

QGMKPEELDMEEAFGAEAGRKNNLHLIPIPYNPSIHHDNCK*GTFI* 44104450  566218

 

>CYP71D127 CYP71D119

Gm0352x00002:peptide         scaffold_352:15162..20294 (+ strand)

54% to 71D104

Glyma08g11570.1             Gm08:8425769..8430957 (- strand)

MELLIPFSLLFTFACILLALFNTLNRSNSKILPPGPWKLPLLGNIHQFFGPLPHQTLTNLANQHGPLMHLQLGEKPHIIV

SSADIAKEIMKTHDAIFANRPHLLASKSFAYDSSDIAFSSYGKAWRQLKKICISELLNAKHVQSLRHIREEEVSKLVSHV

YANEGSIINLTKEIESVTIAIIARAANGKICKDQEAFMSTMEQMLVLLGGFSIADFYPSIKVLPLLTGMKSKLERAQREN

DKILENMVKDHKENENKNGVTHEDFIDILLKTQKRDDLEIPLTHNNVKALIWDMFVGGTAAPAAVTVWAMSELIKNPKAM

EKAQTEVRKVFNVKGYVDETELGQCQYLNSIIKETMRLHPPEALLLPRENSEACVVNGYKIPAKSKVIINAWAIGRESKY

WNEAERFVPERFVDDSYDFSGTNFEYIPFGAGRRICPGAAFSMPYMLLSLANLLYHFDWKLPNGATIQELDMSESFGLTV

KRVHDLCLIPIPYHPTSKLGHL*

 

>CYP71D128P CYP71D123P

scaffold_146:154194..152668 (- strand)

Gm0146x00016:peptide, bad boundary at beginning of exon 2

Missing 22 aa, 83% to 71D10

Glyma08g19410.1 Gm08:14674572..14676375 (- strand)

MVMEVHDHTSYLIYFISSIIVFALFKLVQRSDSKTSSTCCKLPPGPRTLPLIGN

MHQFVGSLPVHHCLKNLADNYGPLMHLKLGEVSNIIVTSQEMAQEIMKTRDLNFSDRPNLVSSRIVSYNGSNIVFSQHGE

YWRQLRKICTVELLTAKRVQSFRSIREEEVAELVKKIAATASEAEGSNIFNLTENIYSVTFGIAARAAFGKKSRYQQVFI

SNIDKQLKLMGGFSVADLYPSSRVLQMMGASGKLEKVHKVTDRVLQDIIDEHKNRTRSSSNEECEAVEDLVDVLLKFQKE

SSEFPLTDENIKAVIQ (0)

XXXXXXXXXXXXXXXXXXXXXX

RNPMVMEQAQAEVRRVYDRKGHVDETELHQLVYLKSIIKETLRLHPPVPLLVPRVSRER

CQINGYEIPSKTRVIINAWAIGRNPKYWAEAESFKPERFLNSSIDFRGTDFEFIPFGAGRRICPGITFAIPNIELPLAQL

LYHFDWKLPNKMNIEELDMKESNGITLRRENDLCLIPIARQP*

 

>CYP71D129 CYP71D114 Gm0186x00073:peptide scaffold_186:572228..570106 (- strand)

missing intron and about 36 aa at lower case region QSEAEEDLVDVLIQYEDGSKKDFSLTRNKIKAIIL

This may be a pseudogene

Glyma08g43930.1:peptide Gm08:43701221..43703343 (- strand)

MALLFLYFSALISFIFLTLIVQKIGRKPKKTDDTTFKIPDGPRKLPIIGNIYNLLSSQPHRKLRDMALKYGPLMYLQLGE

VSTIVISSPECAKEVMKTHDINFATRPKVLAIDIMSYNSTNIAFAPYGNYWRQLRKICTLELLSLKRVNSYQPIREEELS

NLVKWIDSHKGSSINLTQAVLSSIYTIASRAAFGKKCKDQEKFISVVKKTSKLAAGFGIEDLFPSVTWLQHVTGVRPKIE

RLHQQADQIMENIINEHKEAKSKAKEKD

fphnsssmqa

DIFGAGGETSATTIDWAMAEMVKNSGVMKKAQAEVREVFNMK

GRVDENCINELKYLKQVVKETLRLHPPIPLLLPRECGHTCEIQGYKIPAKSKVVINAWAIGRDPNYWTEPERFYPERFID

STIEYKGNDFEYIPFGAGRRICPGSTFASRIIELALAMLLYHFDWKLPSGIICEELDMSEEFGVAVRRKDDLFLVPFPYH

PLPFILTSQ*

 

>CYP71D130P CYP71D151P scaffold_186 chr8 43669-43671k-

539917 43671032 DLDKLTYLKCVIKETIKLHPPTPLLLPRESKEKCQINGYEILARTRVFINAW

        AIGRDPKYWINAETFKPERFLDSSIDYKGTNFEFIPFGAGRR

539635  PGIAFAIADIELPLAHLLYHFDWKLPNGIKLEELDMSESFGLSARRKN 43670607 539492

 

>CYP71D131P CYP71D150P scaffold_186

chr8 no gene model

538642  KYGPLIHLK & 538616

538616 43669731     NLKMGKLTNVVVSSHEVCREIIKAQDAIFLSKPFLLSAT 43669624 538500

537281 43668396     LVYHDASNITYSPYGSYWRQLRKICTRRL     43668310 537195

 

>CYP71D132P CYP71D130P  78% to 71D129

scaffold_5 7776-7777k+

chr9 no gene model

30596592 KIPKKTDDKTCKIPDCHPTNAPII

GNIYNVLSFQPHIKLKGMTLKYGP

LGELSTIMISYPESAKEVMKTHDINFATRPKVLAIDIMSYNSTNIAFDP*GNYWRQLRKF

FMLELLNLKCVKSY*PIREEEVSNVLKLINSHKGASLNLTQPVLSSIYTIASRASFGNKS

KDQQKFISVVKKISKLVVG

FGIEDLFPSAT*LQHVTGVRPMIDRLHQQVDQIMENIIN 30595899

 

>CYP71D133 CYP71D121 68% to 71D160

Gm0075x10021:peptide scaffold_75:622918..620679 (- strand)

Glyma09g41570.1 Gm09:46236154..46238801 (+ strand)

N-term in model is wrong

MTNIVAIISFSLILIVVL

MKIVRNHKKTKPTPNVPPGPWKLPVIGNVHQIITSAPHRKLRDLAKIYGPLMHLQLGEVTTIIVSSPECAKEIMKTHDVI

FASRPRGVVTNILSYESTGVASAPFGNYWRVLRKMCTIELLSQKRVDSFQPIREEELTTLIKMFDSQKGSPINLTQVVLS

SIYSIISRAAFGKKCKGQEEFISLVKEGLTILGDFFPSSRWLLLVTDLRPQLDRLHAQVDQILENIIIEHKEAKSKVREG

QDEEKEDLVDILLKLQDGDDSNKDFFLTNDNIKATILEIFSAGGEPSAITIDWAMSEMARDPRVMKKAQDEVRMVFNMKG

RVDETCINELKYLKSVVKETLRLHPPGPLLLPRESTQECKIHGYDIPIKSKVIVNAWAIGRDPNYWNEPERFYPERFIDS

SIDYKGNNFEYIPFGAGRRICPGSTFGLVNVEMALALFLYHFDWKLPNGIQNEDLDMTEEFK
VTIRRKNDLCLIPVSPPCSVVAMYSS*

 

>CYP71D134 CYP71D136 1 aa diff to 71D143, 71D144, 71D138, 71D135, 71D147, 71D138

Glyma10g12700.1 Gm10:14237260..14239000 (- strand)

14239000        MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMH

LQLGEISAVVASSPKMAKEIVKTHDVSFLQ     14238701

14238700        RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPIN

LTSRIFSLICASISRVAFGGIYKEQDEFVV     14238401

14238400        SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQ

DFIDLLLRIQQDDTLDIQMTTNNIKALIL  (0) 14238104

14237868        DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLL

PRECSQPTIIDGYEIPAKTKVMVNAY           14237569

14237568        AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMK

PEEMNMDEHFGLAIGRKNELHLIPNVNL*      14237260

 

>CYP71D135 CYP71D137 CYP71D143 100%

Glyma10g12710.1 Gm10:14264023..14265779 (+ strand)

14264023        MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYG

PLMHLQLGEISAVIASSPKMAKEIVKTHDVSFLQ         14264322

14264323        RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINL

TSRIFSLICASISRVAFGGIYKEQDEFVV      14264622

14264623        SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQD

FIDLLLRIQQDDTLDIQMTTNNIKALIL  (0) 14264919

14265171        DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPL

LLPRECSQPTIIDGYEIPAKTKVMVNAY        14265470

14265471        AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMK

PEEMNMDEHFGLAIGRKNELHLIPNVNL*      14265779

 

>CYP71D136P CYP71D138P CYP71D137P 100%

Glyma10g12780.1              Gm10:14303796..14304902 (+ strand)

14303723        SRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREH

QEKNKIAKEDGAELEDQDFIDLLLRIQQDD     14304022

14304023        TLDIQMTTNNIKALIL  (0) 14304070

14304306        DIFAAGTDTSASTLEWAMAEMMRNPRVWEKAQAELRQAFREKEIIHESDLEQLTYLK

LVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY      14304605

14304606        AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPL

ALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*  14304914

 

>CYP71D137 CYP71D139 CYP71D134 100%

Glyma10g12790.1 Gm10:14378072..14381581 (- strand)

14381581        MEAQTYFLVIALFFLLHLLAKYYKLKTNVSHTLPPGPKKLPIIGNLHQLAAAGSLPHHALKKLSKKYGP

LMHLQLGEISAVVASSPKMAKEIVKTHDVSF  14381282

14381281        LQRPYFVAGEIMTYGGLGIAFAQYGDHWRQMRKICVTEVLSVKRVQSFASIREDEAAKFINSIRESAGSTINL

TSRIFSLICASISRVAFGGIYKEQDEF          14380982

14380981        VVSLIRRIVEIGGGFDLADLFPSIPFLYFITGKMAKLKKLHKQVDKLLETIVKEHQEKHKRAKEDGAEIED

EDYIDVLLRIQQQSDTLNINMTTNNIKALIL  (0) 14380676

14378692        DIFAAGTDTSASTLEWAMTEVMRNPRVREKAQAELRQAFRGKEIIHESDLEQLTYLK

LVIKETFRVHPPTPLLLPRECSQLTIIDGYEIPAKTKVMVNVY      14378393

14378392        AVCKDPKYWVDAEMFVPERFEASSIDFKGNNFEYLPFGGGRRICPGMTFGLATIMLPLA

LLLYHFNWELPNKIKPENMDMAEQFGVAIGRKNELHLIPSVN       14378090

 

>CYP71D138 CYP71D146 4AA diffs to 71D140P

Glyma10g22120.1 Gm10:28105591..28107347 (- strand)

28107347        MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYG

PLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ         28107048

28107047        RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAG

SPINLTSRIFSLICASISRVAFGGIYKEQDEFVV         28106748

28106747        SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNQIAKEDGAEL

EDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL            28106451

28106199        DIFAAGTDTSASTLEWAMAETTRNPTVREKAQAELRQAF*EKEIIHESDLEQLTYLKLVIKETFRVHPPT

PLLLPRECSQPTIIDGYEIPAKTKVMVNAY     28105900

28105899        AICKDSQYWIDADRFVPERFEVSSIDFKGNNFNYLLFGGGRRICPGMTFGLASIMLPLALLLYHFNWELPN

KMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*            28105591

 

>CYP71D139P CYP71D145P = CYP143P 100%

Glyma10g22100.1              Gm10:28092932..28094478 (- strand)

28094481        QYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQM

RKMCATELLSTKRVQSFASIREDEAAKFIDSIRE         28094182

28094181        SAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFL

TGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKED        28093882

28093881        GAELEDQDFIDLLRIQQDDTLDIQMTTNNIKALIL  (0) 28093777

28093525        DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDQEQLTYLKLVIKETFKVHPPTPL

LLPRECSQPTIIDGYEIPAKTKVMVNAY        28093226

28093225        AICKDSQYWIDADRFVPERFEGSSIDFKGNKFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNK

MKPEEMNMDEHFGLAIGRKNELHLIPNVNL*  28092917

 

>CYP71D140P CYP71D144P pseudogene, second exon 100% to CYP71D138, 71D137,

71D140, 71D143, 71D147

Glyma10g22090.1              Gm10:28074448..28077153 (- strand)

28077153        MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRD

LAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ          28076854

28076853        RPHLVFGQMISYGGLGIAFAPYGDHWRQTRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSP

INLTSRIFSLICASISRVAF  (insertion) 28076590

28075623        GGIYKDQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEK

NKIAKEDGAELEDQDFIDLLRIQQDDTL (small deletion of 14 aa)          28075334

28075056        DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHP

PTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY            28074757

28074756        AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELP

NKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*           28074448

 

>CYP71D141 CYP71D143 1 aa diff to 71D138, 71D137, 71D144, 71D143, 71D138, 71D147

Glyma10g22080.1              Gm10:28059981..28061618 (- strand)

28061705        MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHAL

RDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ       28061406

28061405        RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFID

SIRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVV         28061106

28061105        SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKIA

KEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL  (0) 28060809

28060577        DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHP

PTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY            28060278

28060277        AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWE

LPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*        28059969

 

>CYP71D142 71D135, 71D138, 71D137, 100% match

Glyma10g22070.1 Gm10:28042082..28043822 (- strand)

28043822        MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHAL

RDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ       28043523

28043522        RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFID

SIRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVV         28043223

28043222        SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVNKVLENIIREHQEKNKIA

KEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL (0) 28042926

28042690        DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRV

HPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY         28042391

28042390        AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNW

ELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*      28042082

 

>CYP71D143 CYP71D141 1 aa diff to CYP71D143, 71D144, 71D138, 71D137, 71D147, 71D138

Glyma10g22060.1              Gm10:28024992..28026748 (- strand)

28026748        MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHA

LRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ      28026449

28026448        RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKF

IDSIRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVV      28026149

28026148        SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKN

KIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL  (0) 28025852

28025600        DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRV

HPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY         28025301

28025300        AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFN

WELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*    28024992

 

>CYP71D144 CYP71D140 CYP71D142 100%

Glyma10g22000.1              Gm10:27986164..27987920 (- strand)

27987920        MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSL

PHHALRDLAKKYGPLMHLQLGEISAVIASSPKMAKEIVKTHDVSFLQ          27987621

27987620        RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFID

SIRESAGSPINLTSRIFSLICASISRVSFGGIYKEQDEFVV         27987321

27987320        SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKI

AKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL  (0) 27987024

27986772        DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFR

VHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY        27986473

27986472        AICKDSQYWIDADRFVPERFQGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWE

LPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*  27986164

 

>CYP71D145 CYP71D111 87% to 71D8

Gm0053x00465:peptide scaffold_53:3861779..3858316 (- strand)

Chr11 4720-4723k+ Glyma11g06660.1 Gm11:4720371..4723868 (+ strand)

3861756 4720410 MEHSQLSIVITFFVFLLLLRLVKNHKPKSSHKLPPGPWKLPIIGNLHQVALAASLPHHALQKLARKYGPLMHLQLGEIST

LVVSSPKMAMEIMKTHDLAFVQRPQLLAPQYMAYGATDIAFAPYGEYWRQMRKICTLELLSAKRVQSFSHIRQDENRKLI

QSIQSSAGSPIDLSSKLFSLLGTTVSRAAFGNKNDDQDEFMSLVRKAVAMTGGFELDDMFPSLKPLHLLTGQKAKVEEIH

KRADRILEDILRKHVEKRTRAKEEGNNSEAQQEDLVDVLLRIQQSGSLEVQMTTGHVKAVIW (0)

DIFAAGTDTSASTLEWAM

AEMMKNPRVREKAQAVIRQAFKGKETIRETDLEELSYLKSVIKETLRLHPPSQLIPRECIKSTNIDGYEIPIKSKVMINT

WAIGRDPQYWSDAERFIPERFDGSYIDFKGNSYEYIPFGAGRRMCPGMTFGLASITLPLALLLYHFNWELPNKMKPEDLD

MNEHFGMTVGRKNKLCLIPTVYQAT* 4723689

 

>CYP71D146P CYP71D112P

Gm0053x00463:peptide           scaffold_53:3841398..3839144 (- strand)

Revised 70% to 71D96

Glyma11g06700.1

3841413 4740843 QSKDRMSYVLGQKVIGSVGGFELADLFPSMKFIHFITGTKAKLEKLLNRVDRVLENIVREHTRK &

RQIRAKEGRILVDEDEDLVDVLIRVQQADTLDIKMTTRHVKALIL (0)

DVFAGGIDTSASTLEWAMTEM

MKNPRVREKAQAELRQAFREKKIIHESDIEQLTYLKLVIKETLRLHPPTPLLIPRECSEE

TIIAGYEIPVKTKVMINVWAICRDPKYWTDAERFVPERFEDSSIDFKGNNFEYLPFGAGR

RICPGISFGLASIMLPLAQLLLYFNWELPNGMKPESIDMTERFGLAIGRKNDLCLIPFIY

DP* 4742981 3839275

 

>CYP71D147P CYP71D113P 80% TO 71D96

SCAFFOLD_53

Chr11 Glyma11g06710.1

3831459 4750796 NTVEGAQASFLLITLFFFLVLYWLATYFY*KPKTTITYKLPPGPKKLPLIGNLHQLAIAG

SLPYLALRDLALKYGPLMHLQLGEISILVVSSPNMAKEIMKTHDLAFVQRPQFLPAQILT

YGQNDIVFALYGDYWRQMKKM &

CVSELLSAKRVQSFSHIREDET

KTRQMQQQVDKIAYNILQEHQEKRDRALQESRVDLEEEDLVDVLLRIQQSDTIKIKITTT

NINAVTL (0)

VFTAGMDTSATTLEWAMAEIMRNPIVRKKAQTEVRQALGELKIIHETDVEELTYLKLVIK

ETLGLRTPSLLLLPRECSERTIIDGYEIPIKTKVMVNVWAIARDPQYWTDAERFVLERFD

DSFIDFKGNNFEYLSFEARRRMCPDMTFGLVNIMLPLYHFNWELPNELKPEDMDMS

ENFGLTIYIGRKSQLCLM 4752389 3829866

 

>CYP71D148P CYP71D127P 73% to CYP71D165P

scaffold_28 4156k-

chr12  Glyma12g21000.1 (-) strand

22688502 FNSNIPPGPWKLPIIGNIPHLVTSNPHRKLRDLDKKYGPLMHLRLDAKEHTK 22688657

 

>CYP71D149P CYP71D132P

1089-1092k+

Gm0090x00153:peptide scaffold_90:1090059..1091456 (+ strand)

Chr14 Glyma14g01870.1

1079117 MELHISLSTILPFLILVFMLIINLVWRSKIKNSNSKLPPGPRKLPLIGNIHQLGNLPHRS

RARLANQYSPL

MHMQLGELCCIMVSSPE

MAKEVMNTHDIIFSNRPYVLAADVITYGSKGMTFSPQGTYWRQMRKI

CTMELLAPKHVDSFRSIREQELTIFVKEISLSE

GSPINHSEKISSLAYVLISRIAFGIKSKDQQAYREFMKGV 

TDTGAGFSLADLYPSIGLLHVLTGIRTSVEKIHR

GMDRILENIVRDHREKNLDTKAVGEEN

GEDLVDVLLRLQRNGDHQHPMSDCCQSN (0)

DIFSAGSDTSSTIMIWVMSELVKNP

RVMEKVQIEVRRVFDRKGYVDETSIQE

VKYLRSVI*ETLRLHPPLPLLLPRECSERCEINGYEIPTKSKVIVNAWAMGRDPNYWIEAEKFNPERF

LDSSIDYKGAEFEFIPFGAGRRTFPGINLGIV

ANFLFHFDWKMAQGNSPQELDMTESFGLTVKRKQDLQLIPITYHSATS* 1081198

 

>CYP71D150 CYP71D133 70% to CYP71D104

Gm0090x00154:peptide         scaffold_90:1094732..1096763 (+ strand)

Chr14 Glyma14g01880.1

1083908 MGLELHISLSIILPFFLLVFILIITLWRSKTKNSNSKLPPGPRKLPLIG

SIHHLGTLPHRSLARLASQYGSLMHMQLGELYCIVVSSPEMAKEVMNTHDIIFANRPYVLAADVITYGSKGMTFSPQGTY

LRQMRKICTMELLAQKRVQSFRSIREQELSIFVKEISLSEGSPINISEKINSLAYGLLSRIAFGKKSKDQQAYIEHMKDV

IETVTGFSLADLYPSIGLLQVLTGIRTRVEKIHRGMDRILENIVRDHREKTLDTKAVGEDKGEDLVDVLLRLQKNGDLQH

PLSDTVVKATILDIFSAGSDTSSTIMVWVMSELVKNPRVMEKVQIEVRRVFDGKGYVDETSIHELKYLRSVIKETLRLHP

PSPFLLPRECSERCEINGYEIPTKSKVIVNAWAIGRDPNYWVEAEKFSPERFLDSPIDYKGGDFEFIPFGAGRRICPGIN

LGIVNVEFSLANLLFHFDWRMAQGNRPEELDMTESFGLSVKRKQDLQLIPITYHTARS* 1086121

 

>CYP71D151P CYP71D134P 77% to CYP71D152P

scaffold_90

chr14 no gene model

1115676 1104987    ELWKDGGETLSSTVEWSMSEMVRNP             1115750

1115753         KVMEKAQAELRKAFDIKGYVDEVD*LQLIYFK 1105159       1115848

 

>CYP71D152P CYP71D125P

scaffold_74 85-86k+ no gene model

chr14 8352k+ no gene model

8352186 NMGRTLEDLWKDGGETSSIEEWSMSQMVRKP

KVMEKAQAEVRKEFDNKGYVDEADLHQLIYLKCTIRDAMRLHSPVP 8352418

 

>CYP71D153P CYP71D126P

scaffold_74 2712-2713k- no gene model 80% to 71D129

chr14 10978k- Glyma14g12240.1

10978782 EKDFPHNFSSMQV

DIFATGGDTSTTTIDWEMA*MVKISRVMKNTQAEVKEVFNMKGRVDQNCINELN*YLKQV

VRETLRLHPPIPLLVPTECGQTCDIQGYKIRAKSKVVINTWAIGRNPNYWTKPYRFYPER

FIDSTIK 10978292

 

>CYP71D154P CYP71D169P chr14 Glyma14g14510.1 67% to 71D160

14771370 MDSQMLNSLALIVPFFLFMIVVLKLGRNLKKKTQSYLNITQR

14771244  PCKLPVIGNIHQVVTSTPHQKLRDLAKIYGPMMYLQLEEIFTIIVSL

VEYAK*IMKTHDVNLASRPKILAADMVSYEGTNIAFSPYGNY*K*VQKLCTME  14770945

14770944    LRSSQ  14770930

LEKKSSPIREEELANLVKMVGSHEGTVNVS 14770862

 

>CYP71D155 CYP71D131 80% to 71D160

Gm0014x10128:peptide scaffold_14:8981884..8984374 (+ strand)

Chr14 Glyma14g14520.1 model has wrong C-term

14831928    MDSQILNSLALILPLFLFMILILKLGRKLKRTELSLNIPRGPWKL

PIIGNLHQLVTSTPHRKLRDLAKIYGP

MMHLQLGEIFTIVVSSAEYAEEILKTHDVNFASRPKFLVSEITTYEHTSIAFAPYGEYWRQVRKICAMELLSPKRVNSFR

SIREEEFTNLVKMVGSHEGSPINLTEAVHSSVCNIISRAAFGMKCKDKEEFISIIKEGVKVAAGFNIGDLFPSAKWLQHV

TGLRSKLEKLFGQIDRILGDIINEHKEAKSKAKEGNGKAEEDLLAVLLKYEEGNASNQGFSLTINNIKA

VTSDIFAGGID

AVATAINWAMAEMIRDPRVMKKAQIEVREIFNMKGRVDESCMDELKYLKSVVKETLRLHPPAPLILPRECAQACEINGFH

IPVKTKVFINVWAIARDPNYWSEPERFYPERFIDSSIDFKGCNFEYIPFGAGRRICPGSTFGLASVELILAFLLYHFDWK

LPNGMKNEDFDMTEEFGVTVARKDDIYLIPVTYNPFLVR* 14829625

 

>CYP71D156P CYP71D124P

scaffold_156 564-565k- no gene model, 57% to 71D163P

chr14 (-) strand NO GENE MODEL

47494855 NMFGAGTDTSSAVIEWAMSEMMENPRVMTKAQDEVTINFASNNEI 47494721

 

>CYP71D157P CYP71D170P chr15 no gene model 82% to 71D129

47190069 PKKIDDKTCKIPDGHLTNLPIIRKYIQLSDMALKYGP*LIL

47189945    LGELSTIVISYLESAKEVMKTHDINFATRPKVLAIDIMYYNS

TNIVFDP*GNYWR*IRKICTLELLSLKRVKSY*PIKEEEL  47189700

SNVVKLIDSHKGPSFKLT*PVLSSIYTIASRAGFGNKC

KDQQKFISVVKKISKLAASFGIEDLFPSATWL*HVTGVRPMIYRLHQQVDQIMENIIN

DKDFPHNSSSIQAILF 47189275

 

>CYP71D158 CYP71D106 Gm0025x10028:peptide scaffold_25:621105..622673 (+ strand)

62% to 71D8

Glyma17g01110.1             Gm17:622315..624042 (+ strand)

621105  MAVLSSLAVITFFLSLLVLFLAK

NYKQKSLHKLPPGPWKLPIIGNLLQLAAASSLPHHAIRELAKKYGPLMHLQLGEISAVIVSSPNMAKEIMKTHDLAFAQR

PKFLASDIMGYGSVDIAFAPYGDYWRQMRKICTLELLSAKKVQSFSNIREQEIAKLIEKIQSSAGAPINLTSMINSFIST

FVSRTTFGNITDDHEEFLLITREAIEVADGFDLADMFPSFKPMHLITGLKAKMDKMHKKVDKILDKIIKENQANKGMGEE

KNENLVEVLLRVQHSGNLDTPITTNNIKAVIW (0)

DIFAAGTDTSAKVIDWAMSEMMRNPRVREKAQAEMRGKETIHESNLGE

LSYLKAVIKETMRLHPPLPLLLPRECIEACRIDGYDLPTKTKVIVNAWAIGRDPENWHDADSFIPERFHGASIDFKGIDF

EYIPFGAGRRMCPGISFGIANVEFALAKLLYHFNWELQQGTKPEEFDMDESFGAV

VGRKNNLHLIPIPYDPSIHDNGKGGTFI* 622763

 

>CYP71D159P CYP71D147P CHR17 72% to 71D126 20410k+

YKIRAKSKVIINAWEIGRDPKYWIEPDRFYSKRLIDSTIK

 

>CYP71D160 CYP71D115

Gm0019x10048:peptide 75% to 71D105          scaffold_19:8403106..8405210 (+ strand)

Glyma17g31560.1 Gm17:34696333..34698561 (+ strand)

8403052 MDSQILNSLALILPFFLF

MIVVLKLGRKLKKTEPSLNIPPGPWKLPIVGNLHQLVTSSPHKKFRDLAKIYGPMMHLQLGEIFTIVVSSAEYAKEILKT

HDVIFASRPHFLVSEIMSYESTNIAFSPYGNYWRQVRKICTLELLSQKRVNSFQPIREEELTNLVKMIGSQEGSSINLTE

AVHSSMYHIITRAAFGIRCKDQDEFISAIKQAVLVAAGFNIGDLFPSAKWLQLVTGLRPTLEALFQRTDQILEDIINEHR

EAKSKAKEGHGEAEEEGLLDVLLKFEDGNDSNQSICLTINNIKAVIADIFGGGVEPIATTINWAMAEMIRNPRVMKTAQV

EVREVFNIKGRVDETCINELKYLKSVVKETLRLHPPAPLILPRECQETCKINGYDIPVKTKVFINAWAIGRDPNYWSEPE

RFYPERFIDSSVDYKGGNFEYIPFGAGRRICPGITFGLVNVELTLAFLLYHLDWKLPNGMKNEDFDMTEKFGVTVARKDD

IYLIPATSRPFLVRFCY*   8405288

 

>CYP71D161P CYP71D116P

scaffold_19 chr17 34788k+ no gene model

8494773 34788219 MDSQILNSLALIIFPFFLSMIVVLKLGRKLMKTKPSLNIPPGPWNLPIIGNVHLLVTSTPH*KLTDL  8494973

8494978  MNLTDTVHSSMYNIISRAAFGMKYKDREKFISMVKEGVKTASDFN 34788558 8495112

 

>CYP71D162P CYP71D107P exon 2 only

80% to 71D99

Gm0004x00148:peptide scaffold_4:1770183..1771255 (+ strand)

Glyma18g08920.1              Gm18:7651666..7652442 (+ strand)

1770722 DIFGAGGETSATTIDWAMAEMMKNPKVMKKAEAEVREVFNMKVRVDENCINEIKYLKLVVKETL

RLLPPIPLLLPRECGQTCEIHGYLIPAKSKVIVNAWAIGRDPNYWTEPERIYPERFIDSTIDYKQSNFEYIPFGVGRRIC

PGSTFASRIIELALAKLLYHFDWKFPNGMISEE*DMSEEFGVAVRRKDDLFMVPFPVT* 1771330

 

>CYP71D163P CYP71D108P

60% to 71D10

pseudogene

Gm0004x00152:peptide scaffold_4:1825111..1828599 (+ strand)

Glyma18g08960.1 Gm18:7705852..7709650 (+ strand)

1824801 MDKVFILHVFLTFLLLLVLYKTMKISKSKSSTTNLLPPWPW ()

KLPLIGNLHQLFGSTLPHHVLRNLATKYGPLMHLKLGEVSNIIVSSPEMAKE

IMKTHDIIFSNRPQILVAKVAYNAKDIAFSPCGSYWRQLRKMCKEELLASKRVQCFRSIR

EEEVSALIKTISQSVGFVVNLSEKIYSLTYGITARAALGEKCIHQQEFICIIEEAVHLSG

GLCLADLYPSITWLQMFSVVKAKSEKLFRKIDGILDNIIEDHKNRRRLGQLFDTDQKDLV

DVLLGFQQPNKDIPLDPPLTDDNVKAVIL (0)

DVFSAGTET

SSAVVEWAMSEMVKNPKVMKKAQAEVRRVYNSKGHVDETDLDQLTYFRNNE &

ETMRLHPPAPMLLPRESKEKCEINGYEIPARTRVL &

INAWAIGREPKYWTNAETFK*ETFKSERFLDSSIEYKGTNFEFIPFGAGRRVCPGIAFA

IADIELPLAQLLYHFDWKLPNGSKLEEFDMRESFGLTARRKNGLCLIPIIYHQLNK* 1828599

 

>CYP71D164P CYP71D109P 10640127 74% to 71D99, scaffold 4

chr18 no gene model 16826k-

10640121 16826720 AKEVMKTHDINFATRPKILPIDIMSYNSINVAFDP*GNYWRQLRKICTLEVLSLKCVNSY*PIREEEQSNLVKVIN*

HKGPSFNLTQPVLTSIYTIASRAAFGNKCKDQEKFISVVKKI*KLAKGF 809 & 16826343

16826341    VIEDLFPSATWLQHATGVRPMIHRLHQQ 16826258 10639659

 

>CYP71D165P CYP71D110P 17904788 three pieces 65%-80% to 71D105, 71D100

scaffold 4

chr18 no gene model 24121k-

24121158    SNIPPGPWNLLIIGNIPHLLTSTPHQKL*DLAKKYGPFMHLKL

17904686 IPPGPWNLLIIGNIPHLLTSTPHQKL*DLAKKYGPFMHLKLD

AKEVMKMHDLKFSSRPQVLA  17904501

17904472  IAFAPYGNYWRQLRKTCTLDC  17904410

 

>CYP71D166P CYP71D120P 77% to 71D99

Gm0264x00007:peptide

Glyma18g38290.1 Gm18:45714880..45715314 (+ strand)

KLPIIGNIYDLLSSQPYRKFKRHDLKYGPLMHLQLGEVSTIVISS

PEYAKEVMKIDDINFATRPKVLAIDIMSYN

STSIAFAPYGNYWRQLRKICTLELLSLKRVNS*QPIREEDLSNLVKWIDSHKGSSINLTQ

EVLSSIYTIASRAAFGKKCKDQENFISVVKKTIKVA

DLFPYATWLQHFIGVTPKIERLHQQADQIME

NIIN*NKEGKSKDKGNQSEAEKDLVDVLTQYEDGSKPNFSLT*NNIKAIIL

 

>CYP71D167P CYP71D152P

chr20 (-) strand Glyma20g00940.1 Gm20:659638..661331 (- strand)

661348  LTHVFRVIVSSAEYTKEIMKTHDVT

661273  FASRPLILAADILSYGSTNIIGSPYGNYWRLLRKICTVELLT*KRINSFKPIREEELTNLIKMIDSHK  661070

661074  INLTEAVLLSIYNIISRAAFGMTCKDQEEFISAVKEGVTVAGGFNLGNLFPSAKW

        LQLVTGLRPKIERLHRQIDRILLDIINEHREAKAKAKEGQQGEAE  660775

660774  EDLVDVLLKFQDGNDSKLDICLTINNIKAML  (0) 660682

660186  DIFGAGGETAATAINWAMAKMIRDPRVLKKAQAEVREVYNMKGK

        VDEICIDELKYLKLVVKETLRLHPPAPLLLPRACEIDGYHISVKSMVI  659911

659910  VNAWAIGRDPKYWSEAERFYPERFIDSSIDYK

        GGNFEYIPFGAGRRICPGSTFGLKNVELALAFLLFHFDWKLPNGMKNEDLDMTEQSGVTVTRKADL  659617

        FLIHITFRPIMVRK  659575

 

>CYP71D168P CYP71D153P

chr20 (+) strand Glyma20g00960.1      Gm20:670200..671805 (+ strand)

670059  MDFQIFDMLAPISLFLFMIVALKLGRNLTKTKSIPTYPLAHGSYLT*ETYPIL

        LHLLHIEN*ET*PKNMDP*CI*NLGTSTIVVSSAEYAKE  670334

670335  VMKIHDL*FSSRPQVLAGKIIGYDKKTIAFAPYGNYWRQLRKNCTLELFTIKRINSFRPIREEEFNILIKRIA

        SANGSTCNLTMAVLSLSYGIISRAAFL  670634

670635  QRPREFILLTEQVVKTSGGFNIGEFFPSAPWIQIVAGF

        KPELERLFIRNDQILQDIINEHKDHAKPKGKEGQGEVAEDMVDVLLKFQ  670895

        DMGGENQDASLTDDNIKAVI (0) 670955

671222  KMFASGGETSANSINWTMAELMRNPRVMKKA

        QAEVREVFNMKGRVDETCINQMKYLKAVAKETMRLHPPVPLLFPRECGEACEIDGYHHIPVK  671500

671501  SKVIVSAWAIGRDPKYWSEAERLYLERFFASSIDYKGT

        SFEFISFGAGRRICPGGSFGLVNVEVALAFLLYHFDWKLPNRMKTEDLDMTEQFGLTV  671788

671789  KRKKDLYLIPSLAT* 671833

 

>CYP71D169 CYP71D154

chr20 (+) strand Glyma20g00970.1:peptide Gm20:674339..676944 (+ strand)

674303  MDSELLSILPPIMSFFLFMIVALKIGSNLKKTESSPNI

674417  PPGPWKLPIIGNIHHLVTSAPHRKLRDLAKMYGPLMHLQLGEVFTIIVS

        SPEYAKEIMKTHDVIFASRPKILASDILCYESTNIVFSPYGNYWRQLR  674707

674708  KICTLELFTQKRVNSFQPTREKELTNLVKMVDSHKGSPMNFTEAVLLSIY

        NIISRAAFGMECKDQEEFISVVKEAVTIGSGFNIGDLFPSAKWLQL  674995

674996  VTGLRPKLERLHRQIDRILEGIINEHKQANSKGYSEAKEDLVDVLLKFQ  675142

        DGNDSNQDICLSINNIKAIIL  (0) 675205

675705  DIFSAGGDTAASTINWAMAEMIRDSRVMEKVQIEVREVFNMKGR

        VDEICIDELKYLKSVVKETLRLHPPAPLLLPRECGQACEINGYH  675968

675969  IPVKSKVIVNAWAIGRDPKYWSEAERFYPERFIDSSIDYKGT

        NFEYIPFGAGRRICPGSTFGLINVEVALAFLLYHFDWKLPNGMKSEDLDMTEQF  676256

676257  GVTVRRKNDLYLIPVPSNPFQVR* 676328

 

>CYP71D170 CYP71D155

chr20 (+) strand Glyma20g00990.1      Gm20:682359..683943 (+ strand)

682096  MDSEVLNILALVVPFFLFMILALKIARNHTITESSPKVPPGPWKLPIIGNIHHLITSTPHRKLR  682287

682286  DLAKIYGPLMHLQLGEVFTIIVSSAEYAKEIMKTHDLIFASRPHTL

        VADILAYESTSIITAPYGRYWRQLLKICTVELFTQKRVNSFT  682549

        KFARWSFSPKNVSIHSHKGLSINLAEIVVLSIYNIISRAAFGMKSQNQEE

        FISAVKELVTVAAGFNIGDLFPSVKWL  682730

682731  QRVTGLRPKLVRLHLKMDPLLGNIISEHKEAKSK       682832

682832  AIEGKDETEEDLVDVLLKFLDVNDSNQDICLTINNMKAIIL  (0) 682954

683323  DIFAAGGETATTTINWVMAEIIRDPRVMKKAQVEVREVFNTKG

        RVDEICINELKYLKSVVKETLRLHPPAPLLLPRECGQTCEIDGYHIP  683592

683593  VKSKVIVNAWAIGRDPKYWSEAERFYPERFIDSSIDYKGTN

        FEYIPFVAGRRICPGSTFGLINVELALAFLLYHFDWKLPNEMKSEDLDMTEEFGL      683880

683881  TVTRKEDIYLIPVTSRPFS  683937

 

>CYP71D171P CYP71D156P

chr20 (+) strand 87% to 71D124 Glyma20g01000.1     Gm20:685380..686860 (+ strand)

685359  MDSEVLKMLAVIMSFSLFIFVALKIGSNLKKTDSSPKI

685473  PPGPWKIPIIGNIDHFVTSTPHRKLRDLAKIYGPLMHLQLGEIF

        TIIVLSPEYAKEIIKTHDVIFASRTKILLADIICYESTSIIFAPYGNYWRQ  685757

685758  LQKICTVELLTQRRVNSFKQIREEELTNLVKMIDSHKGS 685874

        PMNFTEAVF*LINNIISRAAFGMKCKDQ  685958

685959  EEFISVVKEAVTIGSGFNIGDLFPSAKWLKLVTGLRPKLERLHWQIDWILEDIINEHKEAKSKAKK

        AKVQQRKIWLMFS*NFRMTTIEISLTINNIEAIIL  686261

 

>CYP71D172P CYP71D135P

scaffold_1 12462k+

chr20 (-) strand Glyma20g16450.1 Gm20:22853217..22853634 (- strand)

NDTSSATITWTMAEMIKNPRIMEKAQAEVRLYFGNEGKPNKSGREYGQACEI

22853510 NRYHIPMKSRVIVNA*GIGRDPNLWTEAERFIESSVDYKGNNFQFI

PFGAGRRMCPGLTFGLSNVECVLAMLMYHFDWKLPNGMKHEDLDMTEIFGITVTRKDNL 22853196

YLIPKTFH

 

>CYP71D173P CYP71D171P chr20 no gene model 90% to 71D164P

30778178  AKEVMKTHDINFATRPKVLTIDIMSYNSTNVTFDP*GNYWRQLRK

ICTLEFLSLKHVNSY*PIREEELSNLVKVID*HKG  30777949

FVIEDLLPSATWLQHATRVKPMIHRLH*Q 30777717

KDLPHNSSSI

 

>CYP71D136X discontinued seq

Gm0020x00168:peptide 69% to 71D96           scaffold_20:7260028..7263537 (+ strand)

Chr10 Glyma10g12790.1

14381581 MEAQTYFLVIALFFLLHLLAKYYKLKTNVSHTLPPGPKKLPIIGNLHQLAAAGSLPHHALKKLSKKYGPLMHLQLGEISA

VVASSPKMAKEIVKTHDVSFLQRPYFVAGEIMTYGGLGIAFAQYGDHWRQMRKICVTEVLSVKRVQSFASIREDEAAKFI

NSIRESAGSTINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRRIVEIGGGFDLADLFPSIPFLYFITGKMAKLKKL

HKQVDKLLETIVKEHQEKHKRAKEDGAEIEDEDYIDVLLRIQQQSDTLNINMTTNNIKALILDIFAAGTDTSASTLEWAM

TEVMRNPRVREKAQAELRQAFRGKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQLTIIDGYEIPAKTKVMVN

VYAVCKDPKYWVDAEMFVPERFEASSIDFKGNNFEYLPFGGGRRICPGMTFGLATIMLPLALLLYHFNWELPNKIKPENM

DMAEQFGVAIGRKNELHLIPSVNDLCVH* 14378072

 

>CYP71D137PX discontinued seq

Gm0020x00169:peptide         scaffold_20:7338351..7336739 (- strand)

1 aa diff to CYP71D138, missing N-term

chr10

28106849 SPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLAD

VFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLL

RIQQDDTLDIQMTTNNIKALIL (0)

DIFAAGTDTSASTLEWAMAE

MMRNPRVWEKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQ

PTIIDGYEIPAKTKVMVNAYAICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGG

RRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 28059969

 

>CYP71D138X discontinued seq

Gm0020x00171:peptide 68% to 71D96 scaffold_20:7363851..7362198 (- strand)

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQ

KLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQM

ISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINLTSRIFSLICASISRVAFG

GIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVNKVLENIIREHQEKNKIAKEDGAELED

QDFIDLLLRIQQDDTLDIQMTTNNIKALIL

DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDL

EQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYAICKDSQYWIDADRFVPERFEGSSIDFKGN

NFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*

 

>CYP71D139X discontinued seq

Gm0020x00172:peptide 100% to 71D138 above

scaffold_20:7381829..7380089 (- strand)

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVV

ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDS

IRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHK

QVNKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALILDIFAAGTDTSASTLEWAMAEM

MRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYA

ICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMD

EHFGLAIGRKNELHLIPNVNL*

   

>CYP71D140X discontinued seq

Gm0020x00173:peptide           scaffold_20:7396143..7394663 (- strand)

2 aa diffs to CYP71D138

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQ

KLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQM

ISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINLTSRIFSLICASISRVAFG

GIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNQIAKEDGAELED

QDFIDLLLRIQQDDTLDIQMTTNNIKALILDIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDL

EQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYAICKDSQYWIDADRFVPERFEGSSIDFKGN

NFNYLPFGGGRRICPGMTLGLASIMLPL

ALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*

 

>CYP71D141X discontinued seq

Gm0020x00176:peptide         scaffold_20:7437789..7436033 (- strand)

2 aa diffs to CYP71D138

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVI

ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDS

IRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHK

QVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL DIFAAGTDTSASTLEWAMAEM

MRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYA

ICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMD

EHFGLAIGRKNELHLIPNVNL*

 

>CYP71D142X discontinued seq

Gm0020x00177:peptide   scaffold_20:7444105..7442757 (- strand)

4 aa diffs to CYP71D138

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQ

KLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKK

YGPLMHLQLGEISAVIASSPKMAKEIVKTHDVSFLQRPHLVFGQM

ISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINLTSRIFSLICASISR

VSFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVD

KVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL

DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDL

EQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVM

VNAYAICKDSQYWIDADRFVPERF

QGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNM

DEHFGLAIGRKNELHLIPNVNL*

 

>CYP71D143PX discontinued seq

Gm0020x00179:peptide         scaffold_20:7498285..7496736 (- strand)

missing the N-terminal, 5 aa diffs to CYP71D138

100% to Glyma10g22100.1 Gm10:28092932..28094478 (- strand)

YGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFAS

IREDEAAKFIDSIRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFL

TGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLRIQQDDTLDIQMTTNNIKALILDIFAAGTDTS

ASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDQEQLTYLKLVIKETFKVHPPTPLLLPRECSQPTIIDGYEIP

AKTKVMVNAYAICKDSQYWIDADRFVPERFEGSSIDFKGNKFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELP

NKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*

 

>CYP71D144PX discontinued seq

Gm0020x00180:peptide scaffold_20:7512153..7509432 (- strand)

9 aa diffs to CYP71D138

4 aa diffs to Glyma10g22120.1 Gm10:28105591..28107347 (- strand)

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVV

ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQTRKMCATELLSTKRVQSFASIREDEAAKFIDS

IRESAGSPINLTSRIFSLICASISRVAF

GGIYKDQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGK

MTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLL

RIQQDDTLDIQMTTNNIKALILDIFAAGTDTSAST

LEWAMAE

TTRNPTVREKAQAELRQAF*EK

EIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYAICKDSQYW

IDADRFVPERFEVSSIDFKGNNFNYLLFGGGRRICPGMTFGLASIMLPLALLLYHFNWELPNKMKPEEMNMDEHFGLAIG

RKNELHLIPNVNL*

 

>CYP71D145PX discontinued seq

Gm0763x00001:peptide scaffold_763:554..59 (- strand) 100% to CYP71D138

duplicate contig

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHAL

RDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIA

FAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINLTSRIFSL

ICASIS

 

>CYP71D146X discontinued seq 2 aa diffs to CYP71D138, 2 aa diffs to 71D147

Gm1021x00001:peptide 68% to 71D96   scaffold_1021:5083..3343 (- strand)

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVV

ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDS

IRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHK

QVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALILDIFAAGTDTSASTLEWAMAEM

MRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLMLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYA

ICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMD

EHFGLAIGRKNELHLIPNVNL*

 

>CYP71D147x discontinued seq 2 aa diffs to CYP71D138, 2 aa diffs to 71D138

Gm1090x00001:peptide 68% to 71D96   scaffold_1090:8646..6906 (- strand)

Exact match to new scaffold_265 but split into two genes and exon 1 runs off the end

Also 100% to CYP142

MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVV

ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDS

IRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHK

QVNKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL DIFAAGTDTSASTLEWAMAEM

MRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYA

ICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMD

EHFGLAIGRKNELHLIPNVNL*

 

>Scaffold_265 of Glyma1 100% to CYP71D147 exon 2 (-) strand

100% to 71D134, 71D135 71D143, 71D142, 71D141, 71D140P

Glyma0265s00200.1        scaffold_265:16254..16862 (- strand)

16862  DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHES

       DLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY  16563

16562  AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLA

       SIMLPLALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*  16254

 

>Scaffold_265 of Glyma1 100% to CYP71D147 N-term (-) strand

Based on their orientation these pieces are from two different genes 16,034 apart.

This is nearly the same spacing and orientation between CYP142 Cterm and CYP141 N-term

16,147 bp.  These pieces are probably a part of the chr10 CYP71D cluster

no gene model

220  MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQL 2

 

>CYP71AH3    Glycine max (soybean)

            GenEMBL Y10489 (1603bp)

            Schopfer,C.R. and Ebel,J.

            Identification of elicitor-induced cytochrome P450s of soybean

            (Glycine max L.) using differential display of mRNA

            Mol. Gen. Genet. 258, 315-322 (1998)

            clone CP1

            formerly CYP71A9

scaffold_35 3592302..3588813 (-) strand

chr5 (+) strand 2116886-2120375  Glyma05g02760.1

MISFTVFVFLTLLFTLSLVKQLRKPTAEKRRLLPPGPRKLPFIG

NLHQLGTLPHQSLQYLSNKHGPLMFLQLGSIPTLVVSSAEMAREIFKNHDSVFSGRPS

LYAANRLGYGSTVSFAPYGEYWREMRKIMILELLSPKRVQSFEAVRFEEVKLLLQTIA

LSHGPVNLSELTLSLTNNIVCRIALGKRNRSGADDANKVSEMLKETQAMLGGFFPVDF

FPRLGWLNKFSGLENRLEKIFREMDNFYDQVIKEHIADNSSERSGAEHEDVVDVLLRV

QKDPNQAIAITDDQIKGVLV (0) 2117773

2119767 DIFVAGTDTASATIIWIMSELIRNPKAMKRAQEEVRDL

VTGKEMVEEIDLSKLLYIKSVVKEVLRLHPPAPLLVPREITENCTIKGFEIPAKTRVL

VNAKSIAMDPCCWENPNEFLPERFLVSPIDFKGQHFEMLPFGVGRRGCPGVNFAMPVV

ELALANLLFRFDWELPLGLGIQDLDMEEAIGITIHKKAHLWLKATPFCE 2120375

 

>CYP71AH7P

chr5 83% to 71AH3

Glyma05g02750.1 Gm05:2112489..2113935 (+ strand)

2112676  DIFVVGTSTASATIIWTMSELIRNPKAMKRAQ

         EEIRGVVKGKEMVEEIDLSRLLYLKSFVKEDLRLHPPVPL  2112891

         LMPRETTESCTIKGFEIPTKTTRVLVNAKSI  2112984

 

>CYP71AU9    Glycine max (soybean, Fabales)

            No accession number

            Muhammad Azam Chattha

            Submitted to nomenclature committee 9/26/2008

            Clone C59

            50% to 71A26, 42% to 71D81 medicago, 51% to

            71AR1v1

            67% to 71AU7 Medicago, 55% to 71AU3 Vitis

scaffold_230 488534..490780 (+) strand

chr16 Glyma16g32010.1 Gm16:35215946..35218490 (- strand)

488534 35218490  MWISQQENSSSWFFLPVVTFIILFLLRTFLNLLSNRNNDSKKPSPPSPPKLPIIGNLHQL

GTHIHRSLQSLAQTYGSLMLLHLGKVPVLVVSTAEAAREVLKTHDPVFSNKPHRKMFDIL

LYGSKDVASAPYGNYWRQTRSILVLHLLSAKKVQSFEAVREEEISIMMENIRKCCASLMP

VDLTGLFCIVANDIVCRAALGRRYSGEGGSKLRGPINEMAELMGTPVLGDYLPWLDWLGR

VNGMYGRAERAAKKVDEFFDEVVDEHVNKGGHDGHGDGVNDEDQNDLVDILLRIQKTNAM

GFEIDRTTIKALIL (0)

DMFGAGTETTSTILEWIMTELLRHPIVMQKLQGEVRNVVRDRTHIS

EEDLSNMHYLKAVIKETFRLHPPITILAPRESTQNTKVMGYDIAAGTQVMVNAWAIARDP

SYWDQPEEFQPERFLNSSIDVKGHDFQLLPFGAGRRACPGLTFSMVVVELVIANLVHQFN

WAIPKGVVGDQTMDITETTGLSIHRKFPLIAIASPHA* 35216244 490780

 

>CYP71AU10P CYP71AU12P 83% to 71AU12

Gm0003x00766:peptide 12800k+

Glyma05g19650.1

23651424 YDIAAGT*VLVNARVIARDLSWDQSLEFKLERFLSSSIDFKGLDFELIPFGAKRRGCPR

VTFATIIIEVVLANLVHQFDWSLPSGATGEDLDMSETTGLVVHKKSPLLVATVYQRN* 23651077

 

>CYP71AU11P CYP71AU23P Chr7 Glyma07g31370.1 80% to 71AU12

36396710    NLHQLGLFPHRTLQTLAKNYGPLMLLHFGKVPVHVVS

SSDAAREVMKTHDLVFSDRPQRKINDILLYGSKDLPSSNYGEH*RQLRSLSVLHLLST  36396994

36396995    KRVQSFRGVREEKTARMMENIWQCCCDSLHVNLSDLC

AALANDVACRAALGRR  36397153

YCGGEGREF

QHWLLEFRELLVAVSVGEDYVLWLDWMSKVNGLSQRAHGVAKNLDQF

IDEVISDHVRNGRDGHVDVDSEEQNDFVNVLLSIEKSKTTGSTIDRTPIK

36398323 DMLVAGTDTTYTTLEWTISELLKHP 36398397

 

>CYP71AU12 CYP71AU11 60% to 71AU27

Gm0191x10008:peptide scaffold_191:338028..333406 (- strand)

Glyma07g31380.1 Gm07:36400692..36405314 (- strand)

338018  36405304 MLFFTVFVLCLSLAFMIKWYSNAVTSKNSPPSPPRLPLLGNLHQLGLFPHRTLQTLAKKYGPLMLLHFGKVPVLVVSSAD

AAREVMRTHDLVFSDRPQRKINDILLYGSKDLASSKYGEYWRQIRSLSVSHLLSTKRVQSFRGVREEETARMMDNIRECC

SDSLHVNLTDMCAAITNDVACRVALGKRYRGGGEREFQSLLLEFGELLGAVSIGDYVPWLDWLMSKVSGLFDRAQEVAKH

LDQFIDEVIEDHVRNGRNGDVDVDSKQQNDFVDVLLSMEKNNTTGSPIDRTVIKALILDMFVAGTDTTHTALEWTMSELL

KHPMVMHKLQDEVRSVVGNRTHVTEDDLGQMNYLKAVIKESLRLHPPLPLIVPRKCMEDIKVKGYDIAAGTQVLVNAWVI

ARDPSSWNQPLEFKPERFLSSSVDFKGHDFELIPFGAGRRGCPGITFATNIIEVVLANLVHQFDWSLPGGAAGEDLDMSE

TAGLAVHRKSPLLAVATAYQRN* 36400833 333541

 

>CYP71AU12-de1b CYP71AU11-de1b

Gm0191

Chr7 no gene model, pseudogene 1 kb upstream of CYP71AU12

339459 36406745 REFQHLLLEFGELLGTVSIGDYVPWLDRLTNKVSGLFERAHRVAKL

LNQFINEVIEEHFRNGRGVDVDVDVDVDSEEQNE 339220

NDFVDALLSIE

339184  NNTTGSPIDRTAIKALI 36406420 339134

 

>CYP71AU13P CYP71AU24P