This
file still needs some work to make it look pretty and to get rid of confusing
older
names that are present after making some name revisions. The name to the left
after
the > is the correct name. All
soybean P450 sequences are included here.
The
names match the excel file of soybean P450s by name order.
I
have not updated the chromosome order excel file, so be careful.
David
Nelson
Last
modified June 25, 2009
Last
modified Oct. 9, 2009 added 12 pseudogenes
>CYP51G1 Glycine max
(soybeans, Fabales)
DQ340249
Li,L.Y. and Yu,D.Y.
Comprehensive analysis of putative P450 genes superfamily in
Glycine max and Medicago truncatula
Unpublished
scaffold_72:3448996..3446386
(- strand)
chr7
13905939-13903329 (-) strand Glyma07g14460.1
3448996 MEIDSRFLNTGLLLVATILVAKLISAFIVPKSRKRVPPIVKGWP
LIGGLIRFLKGPIFMLRDEYPKLGSVFTLKLFHKNITFLIGPEVSAHFFKASETDLSQ
QEVYQFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRVNKLKGYVNQMVAEAE
(0)
DYFSKWGPSGEVDLKYELEHLIILTASRCLLGREVRDKLFDDVSALFHDLDNGMLPISVLFPYL
PIPAHKRRDQARKKLAEIFASIITSRKSASKSEEDMLQCFIDSKYKDGRSTTEAEVTG
LLIAALFAGQHTSSITSTWTGAYLLSNNQYLSAVQEEQKMLIEKHGDRVDHDVLAEMD
VLYRCIKEALRLHPPLIMLMRSSHTDFSVTTREGKEYDIPKGHIIATSPAFANRLGHV
FKDPDRYDPDRFAVGREEDKVAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNF
ELELVSPFPEIDWNAMVVGVKGKVMVRYKRKELSVNQ* 3446386
>CYP51G8 Gm0073x00075 scaffold_73:1407411..1410201 (+ strand)
No good boundary at MLIE/KKHG but 5 more aa can be added to get a
(2) boundary
7 other aa diffs, possibly a pseudogene. One extra intron
chr3 34351408-34349121
(-) strand Glyma03g26820.1
1407651
MEIDGRFLNTGLLLVATILVAKLISAFIVPKSRKRVPPIVKGWPLIGGLIRFLKGPIFMLREEYPKLG
SVFTLKLFHKNITFLVGPEVSAYFFKASETDL
1407950
1407951
SQQEVYQFNVPSFGPGVVFDVDYSVRQEQFRFFTEALRVNKLKGYVNQMVAEAE (0) 1408112
1408938
DYFSKWGPSGEVDLKYELEHLIILTASRCLLGREVRDKLFDDVSALFHDLDNGMLPIS
VLFPYLPIPAHKRRDQARKKLAEIFASIITSRKSASKSEED 1409234
1409235
MLQCFIDSKYKDGRSTTEAEVTGLLIAALFAGQHTSSITSTWTGAYL
LSDNQCLSAVQEEQKMLIESMGTE
(2) 1409432
1409429
KHGDRVDHDVLAEMDVLYRCIKEALRLHPPLIMLMRSSHTDFSVTTREGKEYDIPKG
HIIATSPAFANRLGHVFKDPDRYDPDRFAVGREEDKVAGAFSY 1409731
1409732
ISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFELELVSPFPEI
DWNAMVVGVKGKVMVRYKRKELSVNQ* 1409938
>CYP51G9P
chr18 50% to 51G1
61387599 LPLKSILCIYQNSFELELVSPFPEVN*NIMVVGLKGKVLLKNMHRDYS 61387742
>CYP51G10P
chr2 no gene model 53% to 51G8
7912267 LLLKPILCIYQNSFELELVSPFTEVN*NTMVAGVKGKVVLK 7912145
>CYP71A10 Glycine max (soybean)
GenEMBL AF022157 (1838bp)
Siminszky,B., Dewey,R.E. and Corbin,F.T.
capable of catalyzing the metabolism of phenylurea herbicides
MALLSSVLKQLPHELSSTHYLTVFFCIFLILLQLIRRNKYNLPP
SPPKIPIIGNLHQLGTLPHRSFHALSHKYGPLMMLQLGQIPTLVVSSADVAREIIKTH
DVVFSNRRQPTAAKIFGYGCKDVAFVYYREEWRQKIKTCKVELMSLKKVRLFHSIRQE
VVTELVEAIGEACGSERPCVNLTEMLMAASNDIVSRCVLGRKCDDACGGSGSSSFAAL
GRKIMRLLSAFSVGDFFPSLGWVDYLTGLIPEMKTTFLAVDAFLDEVIAEHESSNKKN
DDFLGILLQLQECGRLDFQLDRDNLKAILVDMIIGGSDTTSTTLEWTFAEFLRNPNTM
KKAQEEVRRVVGINSKAVLDENCVNQMNYLKCVVKETLRLHPPLPLLIARETSSSVKL
RGYDIPAKTMVFINAWAIQRDPELWDDPEEFIPERFETSQVDLNGQDFQLIPFGIGRR
GCPAMSFGLASTEYVLANLLYWFNWNMSESGRILMHNIDMSETNGLTVSKKVPLHLEP
EPYKT
>CYP71A10 scaffold_157: 676518..679205 (+) strand
chr6 14848477-14851164
no gene model
676518
MALLSSVLKQLPHELSSTHYLTVFFCIFLILLQLIRRNKYNLPPSPPKIPIIGNLHQL
GTLPHRSFHALSHKYGPLMMLQLGQIPTLVVSSADVAREI 676811
676812
IKTHDVVFSNRRQPTAAKIFGYGCKDVAFVYYGEEWRQKIKTCKVELMS
LKKVRLFHSIRQEVVTELVEAIGEACGSERPCVNLTEMLMAASNDIV 677099
677100
SRCVLGRKCDDACGGSGSSSFAALGRKIMRLLSAFSVGDFFPSLGWVD
YLTGLIPEMKTTFLAVDAFLDEVIAEHESSNKKNDDFLGILLQLQECGRLD 677396
677397 FQLDRDNLKAILV
(0) 677435
678582
DMIIGGSDTTSTTLEWTFAEFLRNPNTMKKAQEEVRRVVGINSK
AVLDENCVNQMNYLKCVVKETLRLHPPLPLLIARETSSSVKLRGYDIPAKTMVFIN 678881
678882
AWAIQRDPELWDDPEEFIPERFETSQVDLNGQDFQLIPFGIGRRGCPA
MSFGLASTEYVLANLLYWFNWNMSESGRILMHNIDMSETNGLTVSKKVPLHLEPEPYKT* 679205
>CYP71A33 Glycine max (soybean,
Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 10/15/2008
Clone C20
79% to 71A10, same
as EST AI496547
MAFLSSVLKQLAYEPSSTHYLTAFFCFVSLLLMLKLTRRNKSNFPPSPPKLPIIGNLHQL
GTLPHRSFQALSRKYGPLMMLQLGQTPTLVVSSADVAREIIKTHDVVFSNRPQPTAAKIF
LYNCKDVGFAPYGEEWRQTKKTCVVELLSQRKVRSFRSIREEVVSELVEAVREACGGSER
ENRPCVNLSEMLIAASNNIVSRCVIGRKCDATVGDSVNCSFGELGRKIMRLFSAFCVGDF
FPSLGWVDYLTGLIPEMKATFLAVDAFLDEVIAERESSNRKNDHSFMGILLQLQECGRLD
FQLSRDNLKAILM (0)
>CYP71A33 Gm0157x00067:peptide scaffold_157:686782..689402
(+ strand)
Glyma06g18560.1
Gm06:14858479..14861103 (+ strand)
MAFLSSVLKQLAYEPSSTHYLTAFFCFVSLLLMLKLTRRNKSNFPPSPPKLPIIGNLHQLGTLPHRSFQALSRKYGPLMM
LQLGQTPTLVVSSADVAREIIKTHDVVFSNRPQPTAAKIFLYNCKDVGFAPYGEEWRQTKKTCVVELLSQRKVRSFRSIR
EEVVSELVEAVREACGGSERENRPCVNLSEMLIAASNNIVSRCVIGRKCDATVGDSVNCSFGELGRKIMRLFSAFCVGDF
FPSLGWVDYLTGLIPEMKATFLAVDAFLDEVIAERESSNRKNDHSFMGILLQLQECGRLDFQLSRDNLKAIL
(0)
MDMIIGGSDTTSTTLEWAFAELLRKPNTMKKAQEEIRRVVGINSRVVLDENCVNQMNYLKCVVKETLRLHSPVPLLVARETSSSVKLR
GYDIPAKTMVFINAWAIQRDPELWDDPEEFIPERFETSQIDLNGQDFQLIPFGSGRRGCPAMSFGLASTEYVLANLLYWF
NWNMSESGMLMHNIDMNETNGLTVSKKIPLHLEPEPHIP*
>CYP71A34 CYP71A40
4583-4586k+ Gm0069x00326:peptide
partial 54% to 71A33
chr4 11168538-11170517
(+) strand Glyma04g12180.1
MASIQWPYEQLKTAFSVSTFHQYLFLFLLVIV
VLKLTRRPKIKPSFNLPPSPRKLPIIGNLHQLSKLPYHSLRTLSQKHGSLMLLQL
GQTRALVVSSPDAVREIMKTHDITFSNRPKTTAAKTLLYGCNDIGFASYGESWKHKRKIC
VLELLSPKRVQSLSLIREEEVAELINKIREASLSDASSSVNLSELLIETTNNIICKCALG
KKYSTEDCHSRIKELAKRAMIQLGVVTVGDRFPFLGWVDFLTGQIQEFKATFGALDALFD
QVIAEHKKMQRVSDLCSTEKDFVDILIMPDSELTKDGIKSILL (0)
DMFVAGSETTASALEWAMAELMKNPMKLKKAQDEVRKFVGNKSKVEENDINQ
MDYMKCVIKETLRLHPPAPLLAPRETASSVKLGGYDIPAKTLVYVNAWAIQRDPEFWERP
EEFIPERHDNSRVHFNGQDLQFITFGFGRRACPGMTFGLASVEYILANLLYWFNWKLPAT
HTSGQDIDMSETYGLVTYKKEALHLKPIPFFL*
>CYP71A34-de1b CYP71A40-de1b
71% to 71A34
chr4 11166176-11166347 (+) strand
no gene model
LSYLAHCHSFITLSQKHGSMVL*QLGLTRALVVLSAGC 4580389
4580385 DVVREIMKTHDITFSKRPKIT 4580447
>CYP71A35P CYP71A43P
scaffold_47 80% to 71A10 723-724k-
chr4
42894-42895k+ Glyma04g36340.1 gene model is
wrong
42894308 GDFFPSLDWVDYLTDLILEMKTTFLAVDAFLDEIIVEHESNNKKNDDFLGILLQLQECGR
LDFQHVRDNLKTILM
MIIGGSDTTSTTLEWTFA*LLRNPNTMKKAQEE
SRRVVGTNSRVVLDENCVNQMNYLKCVVRETLRLHPPVPLLVA*E
TSSSVKLRGYHTTTKIMVFINASTIQRDTKLWDDPGEFIPKRFETNQVDFNGQDFQLISF
SIGRKGCPTMSFGLASAQYVLSNLLYWFN*KMPKFGILLMHDADMSETNGLTVNKKIQLH
LVP*PYKT 42895349
>CYP71A36P CYP71A42P
scaffold_47 719-720k- 75% to 71A33
chr4 Gm04:42898450..42900611
(+ strand)
Glyma04g36350.1
model is wrong
42898361
MAPPPSVLKLLSHELSSTN*FLSVFFCFLSLLFLLKLAKRNKFNLPPSPPKLPIIGNLHQ
LGTLPHRSFHALSRKYGPLMLLQLGQIPTLVVSSAEVAREIIKKHDIAFSNRPQSTAAKI
NSNDVDFSNYDEEWRQKKNTCVVEPLSQKKVRSFRSIQEEVVAELVEGVREACG-SERE-
RPCVNLTEMLIAASNNIVSRCVHGRKCDDRIGGGGGSSCSFGVLGRKVMRLLSAFSVGDF
FP*LGWVDSLTGLIPEMKAMSVTIDAFFDEVIAEHE
NMKNDESDVEDFVGILLHQLQE
CGKLDFELTRDNLKGILV
DMIIGGSYTTSTTLEWVFADLI 42899986
>CYP71A37 CYP71A45
chr5 92% to 71A43 Glyma05g02730.1
Gm05:2090956..2093112 (- strand)
Glyma05g02730.1:peptide 89% to 71A43 Gm05:2090956..2093112 (- strand)
MALRSVFFYLLSISFFLHQTKPETNLKLPPSPPKIPIIGNIHQFGTLPHRSLRDLSLKYGEMMMLQLGQMQTPTLVVSSV
DVAMEIIKTYDLAFSDRPHNTAAKILLYGCADVGFASYGDKWRQKRKICVLELLSTKRVQSFRAIREEEVAELVNKLREA
SSSDASYVNLSEMLMSTSNNIVCKCALGRSFTRDGNNSVKNLAREAMIHLTAFTVRDYFPWLGWIDVLTGKIQKYKATAG
AMDALFDTAIAEHLAEKRKGQHSKRKDFVDILLQLQEDSMLSFELTKTDIKALLT (0)
DMFVGGTDTTAAALEWAMSELVRNP
IIMKKVQEEVRTVVGHKSKVEENDISQMQYLKCVVKETLRLHLPTPLLPPRVTMSNVKLKGFDIPAKTMVYINAWAMQRD
PRFWERPEEFLPERFENSQVDFKGQEYFQFIPFGFGRRGCPGMNFGIASIEYVLASLLYWFDWKLPDTLDVDMSEVFGLV
VSKKVPLLLKPKTFPF*
>CYP71A38P CYP71A44P
Glyma05g02720.1:peptide Gm05:2070754..2074103
(- strand)
84% to 71A43 missing N and C-terms
KTNLNLPPSPPKLPIIGNLHQLGTLPHRSLRDLSLKYGDMMMLQLGQRQTPTLVVSSAEVAMEIMKT
HDLAFSNRPQNTAAKILLYGCTDVGFALYGEKWRQKRKICVLELLSMKRVQSFRVIREEEVAELVNKLREASSSDAYYVN
LSKMLISTANNIICKCAFGWKYTGDGYSSVKELARDTMIYLAAFTVRDYFPWLGWIDVLTGKIQKYKATAGAMDALFDQA
IAKHLTGKTEGEQSKRK &
DLVDILLQLQEDSMLSFELTKNDLKALIT (0)
DMFIGGTDTTSSTLEWAISELVRNPIIMRKVQEEVR &
SIVGHKSNVEENDVTQMHYLKCVVKETLRLHPPTPLLAPRETMSSVKLKGYD
IPAETMVYINAWAIQRDPEFWESPEEFLPERFENSQVHFKGQEYFQFIPFGCGRRECPGI
NFGIASIDYVLASLLDWFD
>CYP71A39P CYP71A35P Gm0157 55% to
71A33
Glyma06g18550.1 Gm06:14843220..14853611
(+ strand) error 3 gene span
14843130-14843883
(+) strand
671177 MALHLPSFMKQVQIFYLN
671225
YATLFLLLSFISMLVAFKLTRRRRRSKLNLPSSPPRLQIIGNYHQLRKLPHRS 671383
671385
FQTLSQKHNPLLMLQLGQLPVWVVSSANLAREVMQTHDP
VLASRPHLPATEILLYECKDVGHSSNGETWREKRKLCVNELLSMKRVRSVQFIREEEVEAL 671684
671685
VSYIRKACSVINLSEMLVTAS
671747
NNIVCRFTFGSKYDA 14843883
>CYP71A40P CYP71A41P
scaffold_157:
Gm0157x00064
scaffold_157:660596..660246 (- strand)
Pseudogene 53% to 71D96, 54% to 83E14
Glyma06g18520.1
Gm06:14832208..14832556 (- strand)
660596
DTAGTDTTFITL
DWTMTELLMNPQVMEKAQKEVRSILGERRIVTESDLHQLEYMRAVIKEIFWLHPPVPVLV
PRESMEDVVIEGYRAPAKTRVFVNAWAIGRDPESWEDPNAFNPES 660246
>CYP71A41P CYP71A34P pseudogene scaffold_157: 682263..683677 (+)
strand
pseudogene 71% to 71A10
chr6 14853964-
14855378 (+) strand
682263 14853964 EACASQRERPCVNLSEMLIAASNNIVSRCVLGRKYDDKMGCARSSSCSFGVLGRKVMR
LLSAFCVGDFFPSLCWVDSLTGLIP*MKSTSVAIDASL &
DEEVIVEHESKNMKNDHSDVKDFLGILLHLQECRRLDFKLTRDNLKGILM (0)
682826
DMIVGGSDTTSTNLEWAFVDLFRKPNTMNKA*EEVRKAMGINSRLVDEKC
VNQMNYLKCVIKETMRLHTLIPLLIARETTSNVKLR & 683083
YDIPAKTRIFINAWAI &
PEEFILERFEISQVDLNG*DFQLILFGGGRRACPA &
ISFELASTKCVLANL &
YRFNWKMPKSCVMMHNINMIE*N &
GFSVS*KVPFHFK*ESYI 683677 14855378
>CYP71A42P CYP71A39P
Gm0171
Glyma17g13450.1
Gm17:10297791..10298212
(+ strand)
652427
LHQLGTLSHRTLQQLSNKHGPLMFLQLG
652510
652519
PTLVVSSTEMAREIFKNRDSVFSGRPSLHAANRLGYNGSTVSFA
PYGEYWREMRKIMILELLSPKRVQSFQAVRLEEVKLLL 652764
>CYP71A43 CYP71A38
Gm0171x10010:peptide 59% to 71A33 scaffold_171:641359..639377 (- strand)
Glyma17g13430.1
Gm17:10284540..10286991 (- strand)
MALLKQWPYEVFSSTFYISLSFFISVLLLFKLTKRTKPKTNLNLPPSLPKLPIIGNIHQ
FGTLPHRSLRDLSLKYGDMMMLQLGQMQTPTLVVSSVDVAMEIIKTHDLAFSDRPHNTAAKILLYGCTDVGFASYGEKWR
QKRKICVLELLSMKRVQSFRVIREEEAAKLVNKLREASSSDASYVNLSEMLMSTSNNIVCKCAIGRNFTRDGYNSGKVLA
REVMIHLTAFTVRDYFPWLGWMDVLTGKIQKYKATAGAMDALFDQAIAEHLAQKREGEHSKRKDFLDILLQLQEDSMLSF
ELTKTDIKALVTDMFVGGTDTTAAVLEWAMSELLRNPNIMKKVQEEVRTVVGHKSKVEENDISQMHYLKCVVKEILRLHI
PTPLLAPRVTMSDVKLKGYDIPAKTMVYINAWAMQRDPKFWERPEEFLPERFENSKVDFKGQEYFQFIPFGFGRRGCPGM
NFGIASVEYLLASLLYWFDWKLPETDTQDVDMSEIFGLVVSKKVPLLLKPKTFSF*
>CYP71A44 CYP71A37
74% to 71A34 no gene model, Gm0171
Glyma17g13420.1 Gm17:10265063..10269766
(- strand)
623989 10269539 MAFSTFYLSLFFF
ISVLYLFNLTRKTKSKTNLNLPPSPPKLPLIGNLHQLGSLPHRSLRDLSLKHGDIMLLQL
GQMQNPTVVVSSADVAMEIMKTHDMAFSNRPQNTAAKVLLYGGIDIVFGLYGERWSQKRK
ICARELLSTKRVQSFHQIRKEEVAILVNKLREVSSSEECYVNLSDMLMATANDVVCRCVL
GRKYPGVKELARDVMVQLTAFTVRDYFPLMGWIDVLTGKIQEHKATFRALDAVFDQAIAE
HMKEKMEGEKSKKKDFVDILLQLQENNMLSYELTKNDLKSLLL (0) 623102
620430 DMFVGGTDTSRATLEWTLSELVRNPTI
MKKVQEEVRKVVGHKSNVEENDIDQMYYLKCVVKETLRLHSPAPLMAPHETISSVKLKGY
DIPAKTVVYINIWAIQRDPAFWESPEQFLPERFENSQVDFKGQHFQFIPFGFGRRGCPGM
NFGLAFVEYVLASLLYWFDWKLPESDTLKQDIDMSEVFGLVVSKKTPLYLKPVTVSSLSEF*
10265354 619804
>CYP71A45P CYP71A36P
Gm0171x10009:peptide scaffold_171:618246..616969
(- strand)
62% to 71A33 no gene
model chr17 10262-10263k-
10263796 MGLESIQQEPCSASHLSCLNSQEEATPIHQHRQSSSATHVAPYSEEAEKKGL
618090
FVSTKSVRSFQFIIVEEVAEMIGVIHEACATSYKKLASVNLSELLIALTN 617941
617908 ELGRKLLCQFTAFWMGDFFPSLAWVDVLA
GQIPKFKVTLSSLDSFFDQVIAQHKEKMNKREDHEQSDTKDFVDILLQLEEAGMLGFELSHDNLKAMLV
617225
DMFIGASDTTSTTLEWTMAELMRHQNTMEKVQEEVRRVVGYKAEVDEND
VKQMNYLKCVVKETLRLHPPAPLLLPRENTSVVKLRGYEIQAKTRLM
LNAWAIQRDPEFCDGPDEFL 616878 10262428
>CYP71D8 Glycine max (soybean)
GenEMBL Y10493 (1800bp)
Schopfer,C.R. and Ebel,J.
Identification of elicitor-induced cytochrome P450s of soybean
(Glycine max L.) using differential display of mRNA
Mol. Gen. Genet. 258, 315-322 (1998)
clone CP7
note: genomic sequence does not give intact sequence
but the mRNA does so the genomic seq may have errors
Gm0053x00464:peptide
scaffold_53:3854820..3852677
(- strand)
Chr11
Glyma11g06690.1, Gm11:4727299..4729773 (+ strand)
3854820
4727347 MEYSPLSIVITFFVFLLLHWLVKTYKQKSSHKLPPGPWRLPIIG
NLHQLALAASLPDQALQKLVRKYGPLMHLQLGEISTLVVSSPKMAMEMMKTHDVHFVQ
RPQLLAPQFMVYGATDIAFAPYGDYWRQIRKICTLELLSAKRVQSFSHIRQDENKKLI
QSIHSSAGSPIDLSGKLFSLLGTTVSRAAFGK
3854245 &
3854243 ENDDQDEFMSLVRKA 3854199 &
3854197 ITMTGGFEVDD
MFPSLKPLHLLTRQKAKVEHVHQRADKILEDILRKHMEKRTRVKEGNGSEAEQEDLVD
VLLRLKESGSLEVPMTMENIKAVIW
(0) 3853916
NIFAAGTDTSASTLEWAMSEMMKNPKVKEKAQA
ELRQIFKGKEIIRETDLEELSYLKSVIKETLRLHPPSQLIPRECIISTNIDGYEIPIK
TKVMINTWAIGRDPQYWSDADRFIPERFNDSSIDFKGNSFEYIPFGAGRRMCPGMTFG
LASITLPLALLLYHFNWELPNKMKPEDLDMDEHFGMTVARKNKLFLIPTVYEAS
4729485
>CYP71D9 Glycine max (soybean)
GenEMBL Y10490 (1754bp)
Schopfer,C.R. and Ebel,J.
Identification of elicitor-induced cytochrome P450s of soybean
(Glycine max L.) using differential display of mRNA
Mol. Gen. Genet. 258, 315-322 (1998)
clone CP3
Gm0004x00151:peptide
scaffold_4:1805973..1809310
(+ strand)
Glyma18g08950.1
Gm18:7686955..7690536 (+ strand)
MDLQLLYFTSIFSIFIFMFMTHKIVTKKSNSTPSLPPGPWKLPI
IGNMHNLVGSPLPHHRLRDLSAKYGSLMHLKLGEVSTIVVSSPEYAKEVMKTHDHIFA
SRPYVLAAEIMDYDFKGVAFTPYGDYWRQLRKIFALELLSSKRVQSFQPIREEVLTSF
IKRMATIEGSQVNVTKEVISTVFTITARTALGSKSRHHQKLISVVTEAAKISGGFDLG
DLYPSVKFLQHMSGLKPKLEKLHQQADQIMQNIINEHREAKSSATGDQGEEEVLLDVL
LKKEFGLSDESIKAVIW
(0)
DIFGGGSDTSSATITWAMAEMIKNPRTMEKVQTEVRRVFDK
EGRPNGSGTENLKYLKSVVSETLRLHPPAPLLLPRECGQACEINGYHIPAKSRVIVNA
WAIGRDPRLWTEAERFYPERFIERSIEYKSNSFEFIPFGAGRRMCPGLTFGLSNVEYV
LAMLMYHFDWKLPKGTKNEDLGMTEIFGITVARKDDLYLIPKTVHN
>CYP71D10 Glycine max (soybean)
GenEMBL AF022459 (1691bp)
Siminszky,B., Dewey,R.E. and Corbin,F.T.
clone name 5/16
Gm0070x00483:peptide
scaffold_70:3975543..3973852
(- strand)
Chr15
Glyma15g05580.1 Gm15:3950984..3952757 (+ strand)
3975631 3950978 MVMELHNHTPFSIYFITSILFIFFVFFKLVQRSDSKTSSTCKLP
PGPRTLPLIGNIHQIVGSLPVHYYLKNLADKYGPLMHLKLGEVSNIIVTSPEMAQEIM
KTHDLNFSDRPDFVLSRIVSYNGSGIVFSQHGDYWRQLRKICTVELLTAKRVQSFRSI
REEEVAELVKKIAATASEEGGSIFNLTQSIYSMTFGIAARAAFGKKSRYQQVFISNMH
KQLMLLGGFSVADLYPSSRVFQMMGATGKLEKVHRVTDRVLQDIIDEHKNRNRSSEER
EAVEDLVDVLLKFQKESEFRLTDDNIKAVIQDIFIGGGETSSSVVEWGMSELIRNPRV
MEEAQAEVRRVYDSKGYVDETELHQLIYLKSIIKETMRLHPPVPLLVPRVSRERCQIN
GYEIPSKTRIIINAWAIGRNPKYWGETESFKPERFLNSSIDFRGTDFEFIPFGAGRRI
CPGITFAIPNIELPLAQLLYHFDWKLPNKMKNEELDMTESNGITLRRQNDLCLIPITR
LP* 3973855 3952757
>CYP71D96 Glycine max (soybeans,
Fabales)
DQ340243,
ESTs
BM892131, BI892766, BM094727,
BE805102,
BF595208, BM892923, BF324498,
CA953308,
BE347664
Li,L.Y. and Yu,D.Y.
Comprehensive analysis of putative P450 genes superfamily in
Glycine max and Medicago truncatula
Unpublished
74% to 71D87, 71% to 71D86, Called CYP71D54
cannot be certain about the
ortholog
scaffold
_13 no introns scaffold_13:5879367..5876961 (-
strand)
Glyma01g38590.1
Gm01:50645569..50648012
(- strand)
5879367
MEAQASFLFISLFFSLVLHLLAKHYYKPKTTLSHKLPPGPKKLPLIGNLH
QLAMAGSLPHRTLRDLALKYGPLMHLQLGEISSVVVSSPNMAKEIMKTHD 5879068
5879067
LAFVQRPQFLPAQILTYGQNDIVFAPYGDYWRQMKKICVSELLSAKRVQSFSHIRED
ETSKFIESIRISEGSPINLTSKIYSLVSSSVSRVAFGDKSKDQ 5878768
5878767
EEFLCVLEKMILAGGGFEPDDLFPSMKLHLINGRKAKLEK
MHEQVDKIADNILREHQEKRQRALREGKVDLEEEDLVDVLLRIQ 5878516
QSDNLEIKISTTNIKAVILDVFTAGTDTSASTLEWAMAEMMRNPRVREKAQAEVRQAF
RELKIIHETDVGKLTYLKLVIKETLRLHAPSPLLVPRECSELTIIDGYEIPVKTKVMI
NVWAIGRDPQYWTDAERFVPERFDGSSIDFKGNNFEYLPFGAGRRMCPGMTFGLANIM
LPLALLLYHFNWELPNEMKPEDMDMSENFGLTVTRKSELCLIPIVNDL*
>CYP71D99 Glycine max (soybean,
Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 10/15/2008
Clone C1
83% to 71D100, 66% to 71D105, 58% to 71D101, 60% to 71D102,
58% to 71D11, 54% to 71D104, 64% to 71D81 Medicago
= patent seqs CS716170, CS716157, CS716151
Gm0186x00071:peptide
scaffold_186:559955..557867 (- strand)
Glyma08g43900.1
Gm08:43688957..43691102 (- strand)
MALLFFYFLVLISFAFTTIIVQKIRKKPKKTDDTTCKIPHGPRKLPIIGNIYNLLCSQPH
RKLRDLAIKYGPVMHLQLGQVSTIVISSPECAREVMKTHDINFATRPKVLAIEIMSYNST
SIAFAGYGNYWRQLRKICTLELLSLKRVNSFQPIREDELFNLVKWIDSKKGSPINLTEAV
LTSIYTIASRAAFGKNCKDQEKFISVVKKTSKLAAGFGIEDLFPSVTWLQHVTGLRAKLE
RLHQQADQIMENIINEHKEANSKAKDDQSEAEEDLVDVLIQYEDGSKKDFSLTRNKIKAI
ILDIFAAGGETTATTIDWAMAEMVKNPTVMKKAQSEVREVCNMKARVDENCINELQYLKL
IVKETLRLHPPAPLLLPRECGQTCEIHGYHIPAKTKVIVNAWAIGRDPNYWTESERFYPE
RFIDSTIDYKGSNFEFIPFGAGRRICAGSTFALRAAELALAMLLYHFDWKLPSGMRSGEL
DMSEDFGVTTIRKDNLFLVPFPYHPLPVS
>CYP71D100 Glycine max (soybean, Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 10/15/2008
Clone C16
83% to 71D99, 65% to 71D105, 58% to 71D9,
63% to 71D81 Medicago
Gm0186x00072:peptide
scaffold_186:567184..565259 (- strand)
Model
is short at N-term
Glyma08g43920.1 Gm08:43696374..43698299
(- strand)
MALLFLFFVALISFLFTILIVQKLGKKSKKTDDTTCDMHMPHGPRKLPIIGNIYNLICSQ
PHRKLRDLAIKYGPVMHLQLGEVSTIVISSPDCAKEVMTTHDINFATRPQILATEIMSYN
STSIAFSPYGNYWRQLRKICILELLSLKRVNSYQPVREEELFNLVKWIASEKGSPINLTQ
AVLSSVYTISSRATFGKKCKDQEKFISVLTKSIKVSAGFNMGDLFPSSTWLQHLTGLRPK
LERLHQQADQILENIINDHKEAKSKAKGDDSEAQDLVDVLIQYEDGSKQDFSLTKNNIKA
IIQDIFAAGGETSATTIDWAMAEMIKDPRVMKKAQAEVREVFGMNGRVDENCINELQYLK
LIVKETLRLHPPAPLLLPRECGQTCEIHGYHIPAKTKVIVNAWAIGRDPKYWTESERFYP
ERFIDSTIDYKGNSFEFIPFGAGRRICPGSTSALRTIDLALAMLLYHFDWNLPNGMRSGE
LDMSEEFGVTVRRKDDLILVPFPYHPLPVT
>CYP71D101 Glycine max (soybean, Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 10/15/2008
Clone C2
92% to 71D102, 75% TO 71D9, 61% TO C19, 60% TO 71D100, 58% TO 71D99,
56% TO 71D104, 60% to 71D81 medicago
scaffold_4:1781316..1783621
(+ strand) missing N-term
Gm0004x00149:peptide
Glyma18g08930.1
Gm18:7662218..7664677
(+ strand)
1781204 MDLQTLYFTSILSIFIFMFLGHKIITKKPASTPNLPPGPWKIPIIGNIHNVVGSLPHHRL
RDLSAKYGPLMHLKLGEVSTIVVSSPEYAKEVLSTHDLIFSSRPPILASKIMSYDSMGMS
FAPYGDYWRRLRKICASELLSSKRVQSFQPIRGEELTNFIKRIASKEGSPINLTKEVLLT
VSTIVSRTALGNKCRDHKKFISAVREATEAAGGFDLGDLYPSAEWLQHISGLKPKLEKYH
QQADRIMQNIVNEHREAKSSATHGQGEEVADDLVDVLMKEEFGLSDNSIKAVIL
(0)
DMFGGGTQTSSTTITWAMAEMIKNPRVMKKVHAEVREVFGGKVGHPDESDMENLKYLKSVVKETLR
LHPPGPLLLPRQCGQACEINGYYIPIKSKVIINAWAIGRDPNHWSEAERFYPERFIGSSV
DYQGNSFEYIPFGAGRRICPGLTFGLTNVEFPLALLMYYFDWKLPNEMKNEDLDMTEAFG
VSARRKDDLCLIPITFHL* 1783489
>CYP71D102 Glycine max (soybean, Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 10/15/2008
Clone C18
92% to 71D101, 74% to 71D9, 63% to 71D81 Medicago
Gm0186x00070:peptide
scaffold_186:548163..545888 (- strand)
Glyma08g43890.1 Gm08:43676827..43679376
(- strand)
548163 MKKKSASTPNLPPGPWKLPIIGNILNIVGSLPHCRLRDLSAKYGPLMHLKLGEVSTIVVS
SPEYAKEVLNTHDLIFSSRPPILASKIMSYDSKGMSFAPYGDYWRWLRKICTSELLSSKC
VQSFQPIRGEELTNFIKRIASKEGSAINLTKEVLTTVSTIVSRTALGNKCRDHQKFISSV
REGTEAAGGFDLGDLYPSAEWLQHISGLKPKLEKYHQQADRIMQSIINEHREAKSSATQG
QGEEVADDLVDVLMKEEFGLSDNSIKAVIL (0) 547354
546502 DMFGGGTQTSSTTITWAMAEMIKNPRVTKK
IHAELRDVFGGKVGHPNESDMENLKYLKSVVKETLRLYPPGPLLLPRQCGQDCEINGYHI
PIKSKVIVNAWAIGRDPNHWSEAERFYPERFIGSSVDYKGNSFEYIPFGAGRRICPGLTF
GLTNVELPLAFLMYHFDWKLPNGMKNEDLDMTEALGVSARRKDDLCLIPITFHP*
545888
>CYP71D103P Glycine max (soybean, Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 10/15/2008
Clone C3
61% TO 71D105, C-HELIX TO I-HELIX, 49% TO 71D11
54% TO 71D93 MEDICAGO middle region
no
ESTs
scaffold
49 pseudogene, 64% to 71D105
scaffold_49:5584638..5585264
(+ strand)
chr20
(-) strand Glyma20g01090.1 Gm20:761225..762863 (- strand)
5584572 762882 IYLQLGETTTIIVSSPECVKEI
MKTHDVVFASRPQSATFDILYYESTGIASAPYGNYWRVIRRMCTIELFTQKRVNYFQPIR
EEELSYLIIKIIDYSHKGSSSSPINVSQMVLSSIYSITSTVAFGKNYKD
762490
QEEFISLVKEE
VEIAGRDLYCSARWLQLVTGLRAKLEKLHRQMDRVLENIIIEHKEAKSGAKEGQCEQKKE
DLVDILLKFQDGSDKDICLTNGKFKGIIQ 5585264
5585991 DIFVGGGDTSAITIDWAMAEM & 5586053
5586053 VRDPRVMKKAQAEVRKVFNIKGRIDETCINELKYLKSVVKETLRLQPPFPLVPREC
5586220
>CYP71D103P
chr20 (-) strand Glyma20g01090.1 Gm20:761225..762863
(- strand)
762882
IYLQLGETTTIIVSSPECVKEIMKTHDVVFASRPQSATFDILYYESTGIASAP
YGNYWRVIRRMCTIELFTQKRVNYFQPIREEELSYLIIKIIDYSHKG 762583
762582
SSSSPINVSQMVLSSIYSITSTVAFGKNYKDQEEFISLVKEEVEIAG
RDLYCSARWLQLVTGLRAKLEKLHRQMDRVLENIIIEHKEAKSGAKEGQ 762295
762294
CEQKKEDLVDILLKFQDGSDKDICLTNGKFKGII
762193
>CYP71D104 Glycine max (soybean, Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 10/15/2008
Clone C17
56% to 71D105, 56% to 71D101, 55% to 71D10,
55%
to 71D81 Medicago
scaffold_4:1796671..1799722
(+ strand)
Gm0004x00150:peptide
Glyma18g08940.1
Gm18:7677712..7680602
(+ strand)
MDLGHQNIPSLAILPFFLFMFTVFSLFWRTKTKPSNSKLPPGPPKLPLIGNLHQLGAMPH
HGLTKLSHQYGPLMHIKLGALSTIVVSSPEMAKEVLKTHDIIFANRPYLLAADVISYGSK
GMSFSPYGSYWRQMRKICTFELLTPKRVESFQAIREEEASNLVREIGLGEGSSINLTRMI
NSFSYGLTSRVAFGGKSKDQEAFIDVMKDVLKVIAGFSLADLYPIKGLQVLTGLRSKVEK
LHQEVDRILEKIVRDHRDTSSETKETLEKTGEDLVDVLLKLQRQNNLEHPLSDNVIKATI
LDIFSAGSGTSAKTSEWAMSELVKNPRVMEKAQAEVRRVFGEKGHVDEANLHELSYLKSV
IKETLRLHIPVPFLLPRECSERCEINGYEIPAKSKVIINGWAIGRDPNHWTDAKKFCPER
FLDSSVDYKGADFQFIPFGAGRRMCPGSAFGIANVELLLANLLFHFDWNMPNGKKPEELD
MSESFGLSVRRKHDLYLIPSICLSFGN
>CYP71D105 Glycine max (soybean, Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 10/15/2008
Clone C19
63%
to 71D11, 65% to 71D100, 70% to 71D81
Medicago
Gm0049x00336:peptide
scaffold_49:5668495..5665897 (- strand)
Chr20
(+) strand Glyma20g00980.1 Gm20:678927..681649 (+ strand)
Unneeded
extension
5668558 MATKLVIFIRTFRTPYWSLVT
MDSEVLNLLALILPFLLFVIVALKIGRRNLKKSESTPKI
PPGPWKLPIIGNILHLVTSTPHRKLRDLAKIYGPLMHLQLGELFIIVVSSAEYAKEIMKT
HDVIFAQRPHSLASDILSYESTNIISAPYGHYWRQLRKICTVELFTQKRVNSFKPIREEE
LGNLVKMIDSHGGSSSINLTEAVLLSIYNIISRAAFGMKCKDQEEFISVVKEAITIGAGF
HIGDLFPSAKWLQLVSGLRPKLDIIHEKIDRILGDIINEHKAAKSKAREGQDEAEEDLVD
VLLKFKDGNDRNQDICLTTNNIKAIIL
680249 DIFGAGGETSATTINWAMAEMIKNPRAMNKAQL
EVREVFDMKGMVDEICIDQLKYLKSVVKETLRLHPPAPLLLPRECGQTCEIHGYHIPGKS
KVIVNAWTIGRDPNYWTEAERFHPERFFDSSIDYKGTNFEYIPFGAGRRICPGITLGLIN
VELTLAFLLYHFDWKLPNGMKSEDLDMTEKFGVTVRRKDDLYLIPVTSRPFLVR
5666598 680869
>CYP71D105
chr20 (+) strand) Glyma20g00980.1 Gm20:678927..681649
(+ strand)
678972
MDSEVLNLLALILPFLLFVIVALKIGRRNLKKSESTPKIPPGPWKLPIIGNIL
HLVTSTPHRKLRDLAKIYGPLMHLQLGELFIIVVSSAEYAKEIMKTH 679271
679272
DVIFAQRPHSLASDILSYESTNIISAPYGHYWRQLRKICTVELFTQKRVNSFKPIR
EELGNLVKMIDSHGGSSSINLTEAVLLSIYNIISRAAFGMKCK 679571
679572
DQEEFISVVKEAITIGAGFHIGDLFPSAKWLQLVSGLRPKLDIIHEKIDRILGDII
NEHKAAKSKAREGQDEAEEDLVDVLLKFKDGNDRNQDICLTTNN 679871
679872
IKAIIL (0) 679889
680249 DIFGAGGETSATTINWAMAEMIKNPRAMNKAQL
EVREVFDMKGMVDEICIDQLKYLKSVVKETLRLHPPAPLLLPRECGQTCEIHGYHIPGKS
KVIVNAWTIGRDPNYWTEAERFHPERFFDSSIDYKGTNFEYIPFGAGRRICPGITLGLIN
VELTLAFLLYHFDWKLPNGMKSEDLDMTEKFGVTVRRKDDLYLIPVTSRPFLVR 680869
>CYP71D106 CYP71D159 Glyma01g38630.1:peptide 93% to 71D8
Gm01:50667387..50669162 (- strand)
model is short
50669226 MEYSPLSIVI
TFFVFLLLHWLVKIYKQKSRYKLPPSPWRLPIIGNLHQLALAASLPDQALQKLVRKYGPL
MHLQLGEISALVVSSPKMAMEVMKTHDVHFVQRPQLLAPQFMVYGATDIVFAPYGDYWRQIRKICTLELLSAKRVQSFSH
IRQDENRKLIQSIHSSAGSSIDLSGKLFSLLGTTVSRAAFGKENDDQDELMSLVRKAITMTGGFELDDMFPSLKPLHLLT
RQKAKVEHVHQRADKILEDILRKHMEKRTIGKEGSNEAEQEDLVDVLLRLKESGSLEVPMTMENIKAVIWNIFASGTDTP
ASTLEWAMSEMMKNPRVREKAQAELRQTFKGKEIIRETDLEELSYLKSVIKETLRLHPPSQLIPRECIKSTNIDGYDIPI
KTKVMINTWAIGRDPQYWSDAERFIPERFDDSSIDFKGNSFEYIPFGAGRRMCPGITFGLASITLPLALLLYHFNWELPN
KMKPADLDMDELFGLTVVRKNKLFLIPTIYEAS*
>CYP71D107P CYP71D167P Chr1 cypnew chr1 64% to 71D145 Glyma01g38620.1
50665300
TFFLFLLLHWLIKKYKSKSSHT
50665234 LSPGPRKLPLIGTCINLLTVAGSLQYHALRELAHKYEPLMHLQLC
EISAVINCILPKMVAKEIMKTHDLAFVQPQLLSPQTLAYGATNIAFAPYGGDY* 50664938
50664937 RQMRKKCT
&
LELLSAERV*SFSYLLEDETKNYRLHSKIAGSPINLTSRIFS
LLIC*ALLAAFGNKSEDQDEFVSLVRE
50664707
>CYP71D108 CYP71D158 Glyma01g38610.1:peptide Gm01:50657113..50660224
(- strand)
73% to 71D96
MEAQTYFLVIALSLFILLNWLAKYLKLKPNVAHKLPPGPKKLPLIGNMHQLAVAGSLPHRALQKLAHIYGPLMHLQLGEI
SAVVVSSPNMAKEITKTHDVAFVQRPQIISAQILSYGGLDVVFAPYGDYWRQMRKVFVSELLSAKRVQSFSFIREDETAK
FIDSIRASEGSPINLTRKVFSLVSASVSRAAIGNKSKDQDEFMYWLQKVIGSVGGFDLADLFPSMKSIHFITGSKAKLEK
LLNRVDKVLENIVREHLERQIRAKDGRVEVEDEDLVDVLLRIQQADTLDIKMTTRHVKALILDVFAAGIDTSASTLEWAM
TEMMKNSRVREKAQAELRKVFGEKKIIHESDIEQLTYLKLVIKETLRLHPPTPLLIPRECSEETIIGGYEIPVKTKVMIN
VWAICRDPKYWTDAERFVPERFEDSSIDFKGNNFEYLPFGAGRRICPGITFGLASIMLPLAQLLLHFNWELPDGMKPESI
DMTERFGLAIGRKHDLCLIPFVDNL*
>CYP71D109 CYP71D157 Glyma01g38600.1:peptide Gm01:50652996..50654725 (- strand)
88%
to 71D96
MEAQACFMFTTLFFFWVLHWLA
YYYKPKTTLSHKLPPGPKKLPLIGNLHQLAMAGSLPHRTLRDLALKYGPLMHLQLGEISSVVVSSPNMAKEIMKTHDLAF
VQRPQFLPAQILTYGQSDIAFAPYGDYWRQMKKICVSELLSAKRVQSFSDIREDETAKFIESVRTSEGSPVNLTNKIYSL
VSSAISRVAFGNKCKDQEEFVSLVKELVVVGAGFELDDLFPSMKLHLINGRKAKLEKMQEQVDKIVDNILKEHQEKRERA
RREGRVDLEEEDLVDVLLRIQQSDNLEIKITTTNIKAIILDVFTAGTDTSASTLEWAMAEMMRNPRVREKAQAEVRQAFR
ELKIINETDVEELIYLKLVIKETLRLHTPSPLLLPRECSKRTIIDGYEIPVKTKVMINAWAIARDPQYWTDAERFVPERF
DGSSIDFKGNNFEYLPFGAGRRMCPGMTLGLANIMLPLALLLYHFNWELPNEMKPEYMDMVENFGLTVGRKNELCLIPVVNDL*
>CYP71D110 CYP71D160 Glyma01g42600.1:peptide Gm01:53822921..53824667
(- strand)
72% to 71D10 joint may need fixing
MVMELHSQNNPFSIYLITSFLFLLFLLFKLVKKSSSNNSTSKLPPGPKTLPLIGNLHQLVGSKSHHCFKKLADKYGPLMH
LKLGEVSNIIVTSKELAQEIMRTQDLNFADRPNLISTKVVSYDATSISFAPHGDYWRQLRKLCTVELLTSKRVQSFRSIR
EDEVSELVQKIRASASEEGSVFNLSQHIYPMTYAIAARASFGKKSKYQEMFISLIKEQLSLIGGFSIADLYPSIGLLQIM
AKAKVEKVHREVDRVLQDIIDQHKNRKSTDREAVEDLVDVLLKFRRHPGNLIEYINDMFIGGGETSSSTVEWSMSEMVRN
PRAMEKAQAEVRKVFDSKGYVNEAELHQLTYLKCIIREAMRLHPPVPMLIPRVNRERCQISGYEIPAKTRVFINAWAIGR
DPKYWTEAESFKPERFLNSSIDFKGTNYEFIPFGAGRRICPGITFATPNIELPLAHLLYHFDWKLPNNMKNEELDMTESY
GATARRAKDLCLIPITVRP*
>CYP71D111 CYP71D122
Gm0043x00075:peptide 92% to 71D112 4207-4209k+
Glyma02g17940.1
model short Gm02:16101502..16103987
(- strand)
MEAQTFFLVIALFFLLHWLAKCYNSSVCHKLPPGPKKLPIIGNLHQL
AEAGSLPHHALRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFG
QMISYGGLGIAFAPYGDHWRQMRKMCATELLSAKRVQSFASIREDEAAKFIDLIRESAGS
PINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYF
ITGKMARLKKLHKQVDKVLENIIKDHHEKNKSAKEDGAEVEDQDFIDLLLRIQQDDTLGI
EMTTNNIKALIL (0)
DIFAAGTDTSSSTLEWTMTEMMRNPTVREKAQAELRQTFREKDIIHESDL
EQLTYLKLVIKETLRVHPPTPLLLPRECSQLTIIDGYEIPAKTKVMVNAYAICKDPQYWT
HADRFIPERFEDSSIDFKGNNFEYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELP
NNMKPEDMDMAEHFGLAINRKNELHLVPFVYDL*
>CYP71D112 CYP71D118
Gm0043x00078:peptide 94% to CYP71D148
Glyma02g17720.1
Gm02:15932477..15934785
(- strand)
MEAQTYFLVIALFFLLHWLAKCYKSSVVSHKLPPGPKKL
PIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSF
LQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSAKRVQSFASIREDEAAKFI
NSIREAAGSPINLTSQIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADV
FPSIPFLYFITGKMAKLKKLHKQVDKVLENIIREHQEKKKIAKEDGAEVEDQDFIDLLLK
IQQDDTMDIEMTTNNIKALIL (0)
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQTFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQ
PTIIDGYEIPTKTKVMVNAYAICKDPKYWTDAERFVPERFEDSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLAL
LLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLVPLVSDH*
>CYP71D113 CYP71D164
Glyma02g40150.1:peptide revised 54% to 71D124
Gm02:45358150..45360601
(- strand)
MEHQLITFLSFLLYSLSFILFLFQILKVGKRSKVKTMNLPPGPWKLPIIGSIHHMIGFLPHHRLRELALKHGPLMHLKLG
EVPAIVVSSPEVAKEVMKTYDSIFAQRPHQVGADIMCYGSTDIATAPLGGYWKQLRRICSQELLSNKRVRSYQSIREEEV
LNLMRLVD ANTRSCVNLSEKVSCMTSAITARATFGEKCKDQED
FISLVKKLLKLVERLFVFDIFPSHKWLHVISGEISKLEELQREYDMIIGNIIRKAEKKTGE
VEVDSLLSVLLNIKNHDVLEYPLTIDNIKAVML (0)
NMFGAGTDTSSAVI
EWTMSEMLKNPRVMTKAQEEVRRVFGSKGYTNEAALEDLKFLKAVIKETLRLHPPFPLLLPRECRETCEVKGYTIPAGTK
VIVNAWAIARDPKYWSEAEKFYPERFMDSPIDYKGSNHELIPFGAGRRICPGISFGVSSVELCLAQLLYYFNWELPNGNK
ENDLEMTEALGASSRRKTDLTLKVLVTVKAVNLC*
>CYP71D114 CYP71D163
Glyma02g46840.1:peptide
Gm02:50681153..50683602 (- strand)
87% to 71D150
MEMELHISLSTILPFFILVFMLIINIVWRSKTKNSNSKLPPGPRKLPLIGNIHHLGTLPHRSLARLANQYGPLMHMQLGE
LSCIMVSSPEMAKEVMKTHDIIFANRPYVLAADVITYGSKGMTFSPQGTYWRQMRKICTMELLAPKRVDSFRSIREQELS
IFVKEMSLSEGSPINLSEKISSLAYGLISRIAFGKKSKDQEAYIEFMKGVTDTVSGFSLADLYPSIGLLQVLTGIRPRVE
KIRRGMDRIIDNIVRDHRDKNSDTQPVVGEENGEDLVDVLLRLQKNGNLQHPLSDTVVKATIMDIFSAGSETTSTTMEWA
MSELVKNPRMMEKAQIEVRRVFDPKGYVDETSIHELKYLRSVIKETLRLHTPVPLLLPRECSERCEINGYEIPAKSKVIV
NAWAIGRDPNYWIEAEKFSPERFIDCSIDYKGGEFQFIPFGAGRRICPGINLGIVNVEFSLANLLFHFDWKMAPGNSPQE
LDMTESFGLSLKRKQDLQLIPITYHTAA*
>CYP71D115P CYP71D162P Glyma02g46830.1:peptide 85% to 71D150
pseudogene missing I-helix
Gm02:50677600..50680052 (- strand)
MLELHIISLSTILPFFFLVFTIINTLWR
SKTKNSNSKLPQGPRKLPFIGSIQHLGTLPHRSLARLASQYGPLMHMQLGELCCIVVSSPQMAKE
VMNTHDIIFANRPYV
AADVITFGSKGMTFSPQGTYWRQMRKICTMELLAPKRVESFRSIRERELSFFVR
EISLIEGSPINLSEKITSLAYGL
LSRIVFGKKSKD
QEAYMVHMKGVVETIEGFSLADLYPSIGLLQVLTGIKTRVEKIQRGMDTILENI
VRDHRNKTLDTQAIGEENGEYLVDVLLRL
VKNPRVMEKVQIEVR
RVFNGKGYVDETSIHELKYLRSVIKETLRLHPPSPLMLSRECSKRCEINGYEIQIKSKVIVNAWAIGRDPKYWIEAEKFS
PERFIDCSIDYEGGEFQFIPYGAGRRICPGINFGIVNVEFSLANLLFHFDWKMAQGNGPEELDMTESFGFLNYLYHHLYF
SV*
>CYP71D116 CYP71D161
Glyma02g46820.1:peptide 94% to 71D110 Gm02:50670896..50672756 (- strand)
MVMELQSQNNPFSIHLITFFLFLLFLLFKLVKKSSSNNTSKLPPGPKTLPLIGNLHQLVGSKSHHCFKKLADKYGPLMHL
KLGEVSNIIVTSKELAQEIMRTQDLNFADRPNLVSTKIVSYNATSISFAPHGDYWRQLRKLCTVELLTSKRVQSFRSIRE
DEVSELVQKIRAGASEEGSVFNLSQHIYPMTYAIAARASFGKKSKYQEMFISLIKEQLSLIGGFSLADLYPSIGLLQIMA
KAKVEKVHREVDRVLQDIIDQHKNRKSTDREAVEDLVDVLLKFRSENELQYPLTDDNLKAVIQDMFIGGGETSSSTVEWS
MSEMVRNPWAMEKAQAEVRKVFDSKGYVNEAELHQLTYLKCIIREAMRLHPPVPLLIPRVNRERCKINGYEIPAKTRVFI
NAWAIGRDPKYWTEAESFKPERFLNSSIDFKGTNYEFIPFGAGRRICPGISFATPNIELPLAHLLYHFDWKLPNNMKNEE
LDMTESYGATARRAKDLCLIPITVRP*
>CYP71D117P CYP71D173P chr2 identical to CYP71D118P adjacent (tandem duplication)
no
gene model
INMGRTLEDLWKD
50659704 EVRKEFDNKGYVDEADLHQLIYLKCTIRDAMRLHSPVP 50659591
>CYP71D118P CYP71D172P chr2 nearly
identical to CYP71D152P on chr14, no gene model
INMGRTLEDLWKD
50644472 EVRKEFDNKGYVDEADLHQLIYLKCTIRDAMRLHSPVP 50644359
>CYP71D119P CYP71D129P 85% to 71D127
Gm0003x00281:peptide 2190-2193k+
Chr5 Glyma05g28540.1
34376706 MELLFPFSLLFTFACILLALFNTLNRSNSKNLPPG
WKLPLLGNIHQFLGPLPHQTLANLA
NQHGPLMHLQLGEKPHII
SSADIAKEIMKTHDAIFANRPHLLASKFFVYDSSDI
FSSYGRAWRQLK
KKFCISELLNAKHV*SLRHTREKEATKLVRNVYANEGSIINLTTKEIESV
TIAIIARAANGTKCKDQEAFVSTMEQMLVLLGGFSIADFYPSIKVLPLLT
TGMKTRVERAQRENDKILEHMVKDHQENRNKHGVTHEDFIDILLKTQKR
DDLEIPMTHNNIKALIW
DMFAGGTAAPTAVTVWAMSEHMKNPKVMEKAHTEIRKVFNVKGYVDETGLRQCQYLNSVI
KETKRLH
PPEALLVSRENSEACVINGYEIPAKSKVIINAWAIGRES
NSYDFSGTNFEYIPFGAGRRICPGAAFSMPYMLLSVANLLYHFVWELPNGAIHQELDM
THESFGLTVKRANDLCLIPIPYHPTS*LGH 34373704
>CYP71D120P CYP71D117P Scaffold_236 no gene
model 59% to CYP71D103P
chr6 no gene model 555-556k-
9786
VILASRPLSVAAHIMYY*STATTGIVYAPFEKYWRVLQKMCTIE
LFTQKCVNSFHSI*EQELANLIQMIDSHKGSPIN*PHSTMLSSIFSITSRVAFGKK 10085
10086
CKEHKEFITLVK*GVAAA*GYLFSSSRWLQFAT
GLRPKFERLHVQVDRILEKIIIQHKEAKSRVKEG*KNEDLVDILLRFLDGNDINKDVCLI 10364
10365
NDNIKAIIL (0)
>CYP71D121P CYP71D165P
Glyma06g36270.1:peptide Gm06:38392060..38393115 (- strand)
NILPGPWKLPIIGNIPHLVTSAPHKKLRDLAKKYGPLMHLKLDAKEVMKIHDLKFSSRPQVLA
IAFAPYGNYWRRLRKTCTLDC
>CYP71D122 CYP71D166 Glyma07g20080.1 79%
to 71D105 Gm07:20353915..20355733 (- strand)
bad
boundary
20355761 MDSQILNSLALILPFLLFMILALKIGRNLKKTESTP
NIPPGPWKLPIIGNVPHLVTSAPHRKLKDLAK
(2)
(1) VYGPLMHLQLGEVFTVIVSSAE
YAKEIMKTHDVIFATRPHILAADIFSYGSTNTIGAPYGNYWRQLRKICTVELLTQKRVNSFKPIREEELTNLIKMIDSHK
GSPINLTEEVLVSIYNIISRAAFGMKCKDQEEFISAVKEGVTVAGGFNVADLFPSAKWLQPVTGLRPKIERLHRQIDRIL
LDIINEHKDAKAKAKEDQGEAEEDLVDVLLKFPDGHDSKQDICLTINNIKAIILDIFGAGGETAATAINWAMAEMIRDPR
VLKKAQAEVRAVYNMKGMVDEIFIDELQYLKLVVKETLRLHPPVPLLVPRVCGESCGIGGYHIPVKSMVIVNAWAIGRDP
NYWTQPERFYPERFIDSSIEYKGTNFEYIPFGAGRRLCPGITFGLKNVELALAFLLFHFDWKLPNGMKNEDLDMTQQFGVTVRRKADLFLIPITSRPILVRKLP*
20353843
>CYP71D123P CYP71D168P chr7 ~70% to
CYP71D169 Glyma07g20440.1
20725943 HLCIKD*NSSCRCIML*IHKF
PLMHRQLGWSSQLLFPHHDSIFASRTKILVVDVLCYESTSL
IFAPYGNYWR*LRKICTVELFTQRHVNSFKPI 20725785
REGELINLVKMIDSHKG
MKCKD*KEFISVVKEGLLVGVGFNIVDLY
20714290 DVFGVGGESSATTIIWTRVEMVKNPSVMKKAQLKVREVFDMKRMV
DEICMVELKYLES
HIPVKSKVIVNAWEIGRDPNY
SIDYKGTNFEHIPFGARRRKCPGSSCGLINVELALAFLFYHFDWKLP 20713913
NGMKSEGLNMTKQSGMAVRRNELYLI 20713836
>CYP71D124 CYP71D149 scaffold_34 1685-1687k+
81% to 71D105
chr7 Glyma07g20430.1 Gm07:20663400..20666558 (-
strand)
20666469 MDSEVHNMLAVIMSFSLFIIVALKIGRNLKKTESSPNIPPGPWKLPIIGNIHHLVTCTPHRK
LRDLAKTYGPLMHLQLGEVFTIIVSSPEYAKEIMKTHDVIFASRPKILASDILCYESTNI
VFSPYGNYWRQLRKICTVELLTQRRVNSFKQIREEEFTNLVKMIDSHKGSPINLTEAVFL
SIYSIISRAAFGTKCKDQEEFISVVKEAVTIGSGFNIGDLFPSAKWLQLVTGLRPKLERL
HGKTDRILKEIINEHREAKSKAKEDQGEAEEDLVDVLLKFQDGDDRNQDISLTINNIKAIIL (0)
DVFAAGGETSATTINWAMAEIIKDP
RVMKKAQVEVREIFNMKGRVDEICINELKYLKSVVKETLRLHPPAPLLIPRECGQTCEIN
GYHIPVKSKVFVNAWAIGRDPKYWTEPERFYPERFIDSSIDYKGNNFEFTPFGSGRRICP
GITLGSVNVELALAFLLYHFHWKLPNGMKSEELDMTEKFGASVRRKEDLYLIPVICHPLQ
VRKTITFEFVFTPLILKESLYLLLF* 20664210
>CYP71D125 CYP71D128
Gm0068x00077:peptide 65% to 71D158 scaffold_68:561198..562897
(+ strand)
Chr7 Glyma07g39710.1 Gm07:44107519..44109530 (- strand)
MSFKLYSFIHT
44109473 MELRPSFLVLTSFLLLLLWLARIYKQKIKVRSVVHKLPPGPWKLPLIGNLHQLAGAGTLPHHTLQNLSRKYGPLMHLQLG
EISAVVVSSSDMAKEIMKTHDLNFVQRPELLCPKIMAYDSTDIAFAPYGDYWRQMRKICTLELLSAKRVQSFSFIREEEV
AKLIQSIQLCACAGSPVNVSKSVFFLLSTLISRAAFGKKSEYEDKLLALLKKAVELTGGFDLADLFPSMKPIHLITRMKA
KLEDMQKELDKILENIINQHQSNHGKGEAEENLVDVLLRVQKSGSLEIQVTINNIKAVIWDIFGAGTDTSATVLEWAMSE
LMKNPRVMKKAQAEIREAFRGKKTIRESDVYELSYLKSVIKETMRLHPPVPLLLPRECREPCKIGGYEIPIKTKVIVNAW
ALGRDPKHWYDAEKFIPERFDGTSNDFKGSNFEYIPFGAGRRMCPGILLGIANVELPLVALLYHFDWELPNGMKPEDLDM
TEGFGAAVGRKNNLYLMPSPYDHSLNHFIVN* 44107774
>CYP71D126P CYP71D148P
scaffold_68 84% to 71D158
chr7 Glyma07g39700.1
Gm07:44104581..44105960 (- strand)
564678
44105993
MEAQFFLAVIKFFLSLLVLLLAKNYKQKGLHKLPPGPWKLPIIGNLLQVEAASSLPHRAFRELAQK
YGPLMHLQLGEISAVIVSSPL
IAMEIMKTHDLAFAQRPKFLASDIIGYGLVDIFAPYGDY*RQMKKICTLE
SATKVQSFSPNREEVAKL
ERIQSSAGAPINLTGMINSFISTFV
FGNITTENCEGFLSIVKETIEVADGFDLADMFPSFKPMHFITGLKAKLDKMHNKVDKILD
KIIKENQANKGMGEEKNENLVE
DIFAAGTDTSAKVIEWAMSEMMRNPGGREKAQAEIRQTF*GKEAISESNMGELNYLK
ETLRLHPPAPLLLPRECREACRIYGYDIPIKTKVIVNAWAIGRDPEH*HDAESFIPERFH
GASIDFKGTDFEYIPFGAGRRMCPGISFGMASVEFALAKLLYH
QGMKPEELDMEEAFGAEAGRKNNLHLIPIPYNPSIHHDNCK*GTFI*
44104450 566218
>CYP71D127 CYP71D119
Gm0352x00002:peptide scaffold_352:15162..20294
(+ strand)
54% to 71D104
Glyma08g11570.1 Gm08:8425769..8430957
(- strand)
MELLIPFSLLFTFACILLALFNTLNRSNSKILPPGPWKLPLLGNIHQFFGPLPHQTLTNLANQHGPLMHLQLGEKPHIIV
SSADIAKEIMKTHDAIFANRPHLLASKSFAYDSSDIAFSSYGKAWRQLKKICISELLNAKHVQSLRHIREEEVSKLVSHV
YANEGSIINLTKEIESVTIAIIARAANGKICKDQEAFMSTMEQMLVLLGGFSIADFYPSIKVLPLLTGMKSKLERAQREN
DKILENMVKDHKENENKNGVTHEDFIDILLKTQKRDDLEIPLTHNNVKALIWDMFVGGTAAPAAVTVWAMSELIKNPKAM
EKAQTEVRKVFNVKGYVDETELGQCQYLNSIIKETMRLHPPEALLLPRENSEACVVNGYKIPAKSKVIINAWAIGRESKY
WNEAERFVPERFVDDSYDFSGTNFEYIPFGAGRRICPGAAFSMPYMLLSLANLLYHFDWKLPNGATIQELDMSESFGLTV
KRVHDLCLIPIPYHPTSKLGHL*
>CYP71D128P CYP71D123P
scaffold_146:154194..152668 (- strand)
Gm0146x00016:peptide, bad boundary at
beginning of exon 2
Missing 22 aa, 83% to 71D10
Glyma08g19410.1
Gm08:14674572..14676375 (- strand)
MVMEVHDHTSYLIYFISSIIVFALFKLVQRSDSKTSSTCCKLPPGPRTLPLIGN
MHQFVGSLPVHHCLKNLADNYGPLMHLKLGEVSNIIVTSQEMAQEIMKTRDLNFSDRPNLVSSRIVSYNGSNIVFSQHGE
YWRQLRKICTVELLTAKRVQSFRSIREEEVAELVKKIAATASEAEGSNIFNLTENIYSVTFGIAARAAFGKKSRYQQVFI
SNIDKQLKLMGGFSVADLYPSSRVLQMMGASGKLEKVHKVTDRVLQDIIDEHKNRTRSSSNEECEAVEDLVDVLLKFQKE
SSEFPLTDENIKAVIQ (0)
XXXXXXXXXXXXXXXXXXXXXX
RNPMVMEQAQAEVRRVYDRKGHVDETELHQLVYLKSIIKETLRLHPPVPLLVPRVSRER
CQINGYEIPSKTRVIINAWAIGRNPKYWAEAESFKPERFLNSSIDFRGTDFEFIPFGAGRRICPGITFAIPNIELPLAQL
LYHFDWKLPNKMNIEELDMKESNGITLRRENDLCLIPIARQP*
>CYP71D129 CYP71D114 Gm0186x00073:peptide scaffold_186:572228..570106 (- strand)
missing intron and about 36 aa at lower case
region QSEAEEDLVDVLIQYEDGSKKDFSLTRNKIKAIIL
This may be a pseudogene
Glyma08g43930.1:peptide Gm08:43701221..43703343 (- strand)
MALLFLYFSALISFIFLTLIVQKIGRKPKKTDDTTFKIPDGPRKLPIIGNIYNLLSSQPHRKLRDMALKYGPLMYLQLGE
VSTIVISSPECAKEVMKTHDINFATRPKVLAIDIMSYNSTNIAFAPYGNYWRQLRKICTLELLSLKRVNSYQPIREEELS
NLVKWIDSHKGSSINLTQAVLSSIYTIASRAAFGKKCKDQEKFISVVKKTSKLAAGFGIEDLFPSVTWLQHVTGVRPKIE
RLHQQADQIMENIINEHKEAKSKAKEKD
fphnsssmqa
DIFGAGGETSATTIDWAMAEMVKNSGVMKKAQAEVREVFNMK
GRVDENCINELKYLKQVVKETLRLHPPIPLLLPRECGHTCEIQGYKIPAKSKVVINAWAIGRDPNYWTEPERFYPERFID
STIEYKGNDFEYIPFGAGRRICPGSTFASRIIELALAMLLYHFDWKLPSGIICEELDMSEEFGVAVRRKDDLFLVPFPYH
PLPFILTSQ*
>CYP71D130P CYP71D151P scaffold_186 chr8 43669-43671k-
539917 43671032 DLDKLTYLKCVIKETIKLHPPTPLLLPRESKEKCQINGYEILARTRVFINAW
AIGRDPKYWINAETFKPERFLDSSIDYKGTNFEFIPFGAGRR
539635
PGIAFAIADIELPLAHLLYHFDWKLPNGIKLEELDMSESFGLSARRKN 43670607 539492
>CYP71D131P CYP71D150P scaffold_186
chr8 no gene model
538642
KYGPLIHLK & 538616
538616 43669731 NLKMGKLTNVVVSSHEVCREIIKAQDAIFLSKPFLLSAT 43669624 538500
537281 43668396 LVYHDASNITYSPYGSYWRQLRKICTRRL 43668310 537195
>CYP71D132P CYP71D130P 78% to 71D129
scaffold_5 7776-7777k+
chr9 no gene model
30596592 KIPKKTDDKTCKIPDCHPTNAPII
GNIYNVLSFQPHIKLKGMTLKYGP
LGELSTIMISYPESAKEVMKTHDINFATRPKVLAIDIMSYNSTNIAFDP*GNYWRQLRKF
FMLELLNLKCVKSY*PIREEEVSNVLKLINSHKGASLNLTQPVLSSIYTIASRASFGNKS
KDQQKFISVVKKISKLVVG
FGIEDLFPSAT*LQHVTGVRPMIDRLHQQVDQIMENIIN 30595899
>CYP71D133 CYP71D121 68% to 71D160
Gm0075x10021:peptide scaffold_75:622918..620679
(- strand)
Glyma09g41570.1
Gm09:46236154..46238801 (+ strand)
N-term
in model is wrong
MTNIVAIISFSLILIVVL
MKIVRNHKKTKPTPNVPPGPWKLPVIGNVHQIITSAPHRKLRDLAKIYGPLMHLQLGEVTTIIVSSPECAKEIMKTHDVI
FASRPRGVVTNILSYESTGVASAPFGNYWRVLRKMCTIELLSQKRVDSFQPIREEELTTLIKMFDSQKGSPINLTQVVLS
SIYSIISRAAFGKKCKGQEEFISLVKEGLTILGDFFPSSRWLLLVTDLRPQLDRLHAQVDQILENIIIEHKEAKSKVREG
QDEEKEDLVDILLKLQDGDDSNKDFFLTNDNIKATILEIFSAGGEPSAITIDWAMSEMARDPRVMKKAQDEVRMVFNMKG
RVDETCINELKYLKSVVKETLRLHPPGPLLLPRESTQECKIHGYDIPIKSKVIVNAWAIGRDPNYWNEPERFYPERFIDS
SIDYKGNNFEYIPFGAGRRICPGSTFGLVNVEMALALFLYHFDWKLPNGIQNEDLDMTEEFK
VTIRRKNDLCLIPVSPPCSVVAMYSS*
>CYP71D134 CYP71D136 1 aa diff to 71D143, 71D144, 71D138, 71D135, 71D147, 71D138
Glyma10g12700.1 Gm10:14237260..14239000 (- strand)
14239000
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMH
LQLGEISAVVASSPKMAKEIVKTHDVSFLQ 14238701
14238700
RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPIN
LTSRIFSLICASISRVAFGGIYKEQDEFVV 14238401
14238400
SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQ
DFIDLLLRIQQDDTLDIQMTTNNIKALIL (0) 14238104
14237868
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLL
PRECSQPTIIDGYEIPAKTKVMVNAY 14237569
14237568
AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMK
PEEMNMDEHFGLAIGRKNELHLIPNVNL* 14237260
>CYP71D135 CYP71D137 CYP71D143 100%
Glyma10g12710.1 Gm10:14264023..14265779 (+ strand)
14264023
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYG
PLMHLQLGEISAVIASSPKMAKEIVKTHDVSFLQ 14264322
14264323
RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINL
TSRIFSLICASISRVAFGGIYKEQDEFVV 14264622
14264623
SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQD
FIDLLLRIQQDDTLDIQMTTNNIKALIL (0) 14264919
14265171
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPL
LLPRECSQPTIIDGYEIPAKTKVMVNAY 14265470
14265471
AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMK
PEEMNMDEHFGLAIGRKNELHLIPNVNL* 14265779
>CYP71D136P CYP71D138P CYP71D137P 100%
Glyma10g12780.1
Gm10:14303796..14304902
(+ strand)
14303723
SRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREH
QEKNKIAKEDGAELEDQDFIDLLLRIQQDD 14304022
14304023
TLDIQMTTNNIKALIL (0) 14304070
14304306
DIFAAGTDTSASTLEWAMAEMMRNPRVWEKAQAELRQAFREKEIIHESDLEQLTYLK
LVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY 14304605
14304606
AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPL
ALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 14304914
>CYP71D137 CYP71D139 CYP71D134 100%
Glyma10g12790.1
Gm10:14378072..14381581 (- strand)
14381581
MEAQTYFLVIALFFLLHLLAKYYKLKTNVSHTLPPGPKKLPIIGNLHQLAAAGSLPHHALKKLSKKYGP
LMHLQLGEISAVVASSPKMAKEIVKTHDVSF 14381282
14381281
LQRPYFVAGEIMTYGGLGIAFAQYGDHWRQMRKICVTEVLSVKRVQSFASIREDEAAKFINSIRESAGSTINL
TSRIFSLICASISRVAFGGIYKEQDEF 14380982
14380981
VVSLIRRIVEIGGGFDLADLFPSIPFLYFITGKMAKLKKLHKQVDKLLETIVKEHQEKHKRAKEDGAEIED
EDYIDVLLRIQQQSDTLNINMTTNNIKALIL (0) 14380676
14378692
DIFAAGTDTSASTLEWAMTEVMRNPRVREKAQAELRQAFRGKEIIHESDLEQLTYLK
LVIKETFRVHPPTPLLLPRECSQLTIIDGYEIPAKTKVMVNVY 14378393
14378392
AVCKDPKYWVDAEMFVPERFEASSIDFKGNNFEYLPFGGGRRICPGMTFGLATIMLPLA
LLLYHFNWELPNKIKPENMDMAEQFGVAIGRKNELHLIPSVN 14378090
>CYP71D138 CYP71D146 4AA diffs to 71D140P
Glyma10g22120.1
Gm10:28105591..28107347 (- strand)
28107347
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYG
PLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ 28107048
28107047
RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAG
SPINLTSRIFSLICASISRVAFGGIYKEQDEFVV 28106748
28106747
SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNQIAKEDGAEL
EDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL 28106451
28106199
DIFAAGTDTSASTLEWAMAETTRNPTVREKAQAELRQAF*EKEIIHESDLEQLTYLKLVIKETFRVHPPT
PLLLPRECSQPTIIDGYEIPAKTKVMVNAY 28105900
28105899
AICKDSQYWIDADRFVPERFEVSSIDFKGNNFNYLLFGGGRRICPGMTFGLASIMLPLALLLYHFNWELPN
KMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 28105591
>CYP71D139P CYP71D145P = CYP143P 100%
Glyma10g22100.1
Gm10:28092932..28094478
(- strand)
28094481
QYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQM
RKMCATELLSTKRVQSFASIREDEAAKFIDSIRE 28094182
28094181
SAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFL
TGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKED 28093882
28093881
GAELEDQDFIDLLRIQQDDTLDIQMTTNNIKALIL (0) 28093777
28093525
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDQEQLTYLKLVIKETFKVHPPTPL
LLPRECSQPTIIDGYEIPAKTKVMVNAY 28093226
28093225
AICKDSQYWIDADRFVPERFEGSSIDFKGNKFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNK
MKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 28092917
>CYP71D140P CYP71D144P pseudogene, second exon 100% to CYP71D138, 71D137,
71D140, 71D143, 71D147
Glyma10g22090.1
Gm10:28074448..28077153
(- strand)
28077153
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRD
LAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ 28076854
28076853
RPHLVFGQMISYGGLGIAFAPYGDHWRQTRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSP
INLTSRIFSLICASISRVAF (insertion) 28076590
28075623
GGIYKDQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEK
NKIAKEDGAELEDQDFIDLLRIQQDDTL (small deletion
of 14 aa) 28075334
28075056
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHP
PTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY 28074757
28074756
AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELP
NKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 28074448
>CYP71D141 CYP71D143 1 aa diff to 71D138, 71D137, 71D144, 71D143, 71D138, 71D147
Glyma10g22080.1
Gm10:28059981..28061618
(- strand)
28061705
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHAL
RDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ 28061406
28061405
RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFID
SIRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVV 28061106
28061105
SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKIA
KEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL (0) 28060809
28060577
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHP
PTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY 28060278
28060277
AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWE
LPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 28059969
>CYP71D142 71D135, 71D138, 71D137, 100% match
Glyma10g22070.1
Gm10:28042082..28043822 (- strand)
28043822
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHAL
RDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ 28043523
28043522
RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFID
SIRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVV 28043223
28043222
SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVNKVLENIIREHQEKNKIA
KEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL (0)
28042926
28042690
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRV
HPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY 28042391
28042390
AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNW
ELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 28042082
>CYP71D143 CYP71D141 1 aa diff to CYP71D143, 71D144, 71D138, 71D137, 71D147, 71D138
Glyma10g22060.1
Gm10:28024992..28026748
(- strand)
28026748
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHA
LRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQ 28026449
28026448
RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKF
IDSIRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVV 28026149
28026148
SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKN
KIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL (0) 28025852
28025600
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRV
HPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY 28025301
28025300
AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFN
WELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 28024992
>CYP71D144 CYP71D140 CYP71D142 100%
Glyma10g22000.1
Gm10:27986164..27987920
(- strand)
27987920
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSL
PHHALRDLAKKYGPLMHLQLGEISAVIASSPKMAKEIVKTHDVSFLQ 27987621
27987620
RPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFID
SIRESAGSPINLTSRIFSLICASISRVSFGGIYKEQDEFVV 27987321
27987320
SLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKI
AKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL (0) 27987024
27986772
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFR
VHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY 27986473
27986472
AICKDSQYWIDADRFVPERFQGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWE
LPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 27986164
>CYP71D145 CYP71D111 87% to 71D8
Gm0053x00465:peptide scaffold_53:3861779..3858316
(- strand)
Chr11 4720-4723k+ Glyma11g06660.1
Gm11:4720371..4723868 (+ strand)
3861756 4720410
MEHSQLSIVITFFVFLLLLRLVKNHKPKSSHKLPPGPWKLPIIGNLHQVALAASLPHHALQKLARKYGPLMHLQLGEIST
LVVSSPKMAMEIMKTHDLAFVQRPQLLAPQYMAYGATDIAFAPYGEYWRQMRKICTLELLSAKRVQSFSHIRQDENRKLI
QSIQSSAGSPIDLSSKLFSLLGTTVSRAAFGNKNDDQDEFMSLVRKAVAMTGGFELDDMFPSLKPLHLLTGQKAKVEEIH
KRADRILEDILRKHVEKRTRAKEEGNNSEAQQEDLVDVLLRIQQSGSLEVQMTTGHVKAVIW (0)
DIFAAGTDTSASTLEWAM
AEMMKNPRVREKAQAVIRQAFKGKETIRETDLEELSYLKSVIKETLRLHPPSQLIPRECIKSTNIDGYEIPIKSKVMINT
WAIGRDPQYWSDAERFIPERFDGSYIDFKGNSYEYIPFGAGRRMCPGMTFGLASITLPLALLLYHFNWELPNKMKPEDLD
MNEHFGMTVGRKNKLCLIPTVYQAT* 4723689
>CYP71D146P CYP71D112P
Gm0053x00463:peptide scaffold_53:3841398..3839144
(- strand)
Revised 70% to 71D96
Glyma11g06700.1
3841413 4740843 QSKDRMSYVLGQKVIGSVGGFELADLFPSMKFIHFITGTKAKLEKLLNRVDRVLENIVREHTRK
&
RQIRAKEGRILVDEDEDLVDVLIRVQQADTLDIKMTTRHVKALIL (0)
DVFAGGIDTSASTLEWAMTEM
MKNPRVREKAQAELRQAFREKKIIHESDIEQLTYLKLVIKETLRLHPPTPLLIPRECSEE
TIIAGYEIPVKTKVMINVWAICRDPKYWTDAERFVPERFEDSSIDFKGNNFEYLPFGAGR
RICPGISFGLASIMLPLAQLLLYFNWELPNGMKPESIDMTERFGLAIGRKNDLCLIPFIY
DP* 4742981 3839275
>CYP71D147P CYP71D113P 80% TO 71D96
SCAFFOLD_53
Chr11 Glyma11g06710.1
3831459 4750796 NTVEGAQASFLLITLFFFLVLYWLATYFY*KPKTTITYKLPPGPKKLPLIGNLHQLAIAG
SLPYLALRDLALKYGPLMHLQLGEISILVVSSPNMAKEIMKTHDLAFVQRPQFLPAQILT
YGQNDIVFALYGDYWRQMKKM &
CVSELLSAKRVQSFSHIREDET
KTRQMQQQVDKIAYNILQEHQEKRDRALQESRVDLEEEDLVDVLLRIQQSDTIKIKITTT
NINAVTL (0)
VFTAGMDTSATTLEWAMAEIMRNPIVRKKAQTEVRQALGELKIIHETDVEELTYLKLVIK
ETLGLRTPSLLLLPRECSERTIIDGYEIPIKTKVMVNVWAIARDPQYWTDAERFVLERFD
DSFIDFKGNNFEYLSFEARRRMCPDMTFGLVNIMLPLYHFNWELPNELKPEDMDMS
ENFGLTIYIGRKSQLCLM 4752389 3829866
>CYP71D148P CYP71D127P 73% to CYP71D165P
scaffold_28 4156k-
chr12 Glyma12g21000.1 (-) strand
22688502 FNSNIPPGPWKLPIIGNIPHLVTSNPHRKLRDLDKKYGPLMHLRLDAKEHTK
22688657
>CYP71D149P CYP71D132P
1089-1092k+
Gm0090x00153:peptide scaffold_90:1090059..1091456
(+ strand)
Chr14 Glyma14g01870.1
1079117 MELHISLSTILPFLILVFMLIINLVWRSKIKNSNSKLPPGPRKLPLIGNIHQLGNLPHRS
RARLANQYSPL
MHMQLGELCCIMVSSPE
MAKEVMNTHDIIFSNRPYVLAADVITYGSKGMTFSPQGTYWRQMRKI
CTMELLAPKHVDSFRSIREQELTIFVKEISLSE
GSPINHSEKISSLAYVLISRIAFGIKSKDQQAYREFMKGV
TDTGAGFSLADLYPSIGLLHVLTGIRTSVEKIHR
GMDRILENIVRDHREKNLDTKAVGEEN
GEDLVDVLLRLQRNGDHQHPMSDCCQSN (0)
DIFSAGSDTSSTIMIWVMSELVKNP
RVMEKVQIEVRRVFDRKGYVDETSIQE
VKYLRSVI*ETLRLHPPLPLLLPRECSERCEINGYEIPTKSKVIVNAWAMGRDPNYWIEAEKFNPERF
LDSSIDYKGAEFEFIPFGAGRRTFPGINLGIV
ANFLFHFDWKMAQGNSPQELDMTESFGLTVKRKQDLQLIPITYHSATS*
1081198
>CYP71D150 CYP71D133 70% to CYP71D104
Gm0090x00154:peptide scaffold_90:1094732..1096763
(+ strand)
Chr14 Glyma14g01880.1
1083908 MGLELHISLSIILPFFLLVFILIITLWRSKTKNSNSKLPPGPRKLPLIG
SIHHLGTLPHRSLARLASQYGSLMHMQLGELYCIVVSSPEMAKEVMNTHDIIFANRPYVLAADVITYGSKGMTFSPQGTY
LRQMRKICTMELLAQKRVQSFRSIREQELSIFVKEISLSEGSPINISEKINSLAYGLLSRIAFGKKSKDQQAYIEHMKDV
IETVTGFSLADLYPSIGLLQVLTGIRTRVEKIHRGMDRILENIVRDHREKTLDTKAVGEDKGEDLVDVLLRLQKNGDLQH
PLSDTVVKATILDIFSAGSDTSSTIMVWVMSELVKNPRVMEKVQIEVRRVFDGKGYVDETSIHELKYLRSVIKETLRLHP
PSPFLLPRECSERCEINGYEIPTKSKVIVNAWAIGRDPNYWVEAEKFSPERFLDSPIDYKGGDFEFIPFGAGRRICPGIN
LGIVNVEFSLANLLFHFDWRMAQGNRPEELDMTESFGLSVKRKQDLQLIPITYHTARS* 1086121
>CYP71D151P CYP71D134P 77% to CYP71D152P
scaffold_90
chr14 no gene model
1115676 1104987 ELWKDGGETLSSTVEWSMSEMVRNP 1115750
1115753
KVMEKAQAELRKAFDIKGYVDEVD*LQLIYFK
1105159 1115848
>CYP71D152P CYP71D125P
scaffold_74 85-86k+ no gene model
chr14 8352k+ no gene model
8352186 NMGRTLEDLWKDGGETSSIEEWSMSQMVRKP
KVMEKAQAEVRKEFDNKGYVDEADLHQLIYLKCTIRDAMRLHSPVP
8352418
>CYP71D153P CYP71D126P
scaffold_74 2712-2713k- no gene model 80% to 71D129
chr14 10978k- Glyma14g12240.1
10978782 EKDFPHNFSSMQV
DIFATGGDTSTTTIDWEMA*MVKISRVMKNTQAEVKEVFNMKGRVDQNCINELN*YLKQV
VRETLRLHPPIPLLVPTECGQTCDIQGYKIRAKSKVVINTWAIGRNPNYWTKPYRFYPER
FIDSTIK 10978292
>CYP71D154P CYP71D169P chr14 Glyma14g14510.1
67% to 71D160
14771370
MDSQMLNSLALIVPFFLFMIVVLKLGRNLKKKTQSYLNITQR
14771244 PCKLPVIGNIHQVVTSTPHQKLRDLAKIYGPMMYLQLEEIFTIIVSL
VEYAK*IMKTHDVNLASRPKILAADMVSYEGTNIAFSPYGNY*K*VQKLCTME 14770945
14770944 LRSSQ 14770930
LEKKSSPIREEELANLVKMVGSHEGTVNVS
14770862
>CYP71D155 CYP71D131 80% to 71D160
Gm0014x10128:peptide scaffold_14:8981884..8984374 (+ strand)
Chr14
Glyma14g14520.1 model has wrong C-term
14831928 MDSQILNSLALILPLFLFMILILKLGRKLKRTELSLNIPRGPWKL
PIIGNLHQLVTSTPHRKLRDLAKIYGP
MMHLQLGEIFTIVVSSAEYAEEILKTHDVNFASRPKFLVSEITTYEHTSIAFAPYGEYWRQVRKICAMELLSPKRVNSFR
SIREEEFTNLVKMVGSHEGSPINLTEAVHSSVCNIISRAAFGMKCKDKEEFISIIKEGVKVAAGFNIGDLFPSAKWLQHV
TGLRSKLEKLFGQIDRILGDIINEHKEAKSKAKEGNGKAEEDLLAVLLKYEEGNASNQGFSLTINNIKA
VTSDIFAGGID
AVATAINWAMAEMIRDPRVMKKAQIEVREIFNMKGRVDESCMDELKYLKSVVKETLRLHPPAPLILPRECAQACEINGFH
IPVKTKVFINVWAIARDPNYWSEPERFYPERFIDSSIDFKGCNFEYIPFGAGRRICPGSTFGLASVELILAFLLYHFDWK
LPNGMKNEDFDMTEEFGVTVARKDDIYLIPVTYNPFLVR* 14829625
>CYP71D156P CYP71D124P
scaffold_156 564-565k- no gene model, 57% to
71D163P
chr14 (-) strand NO GENE MODEL
47494855 NMFGAGTDTSSAVIEWAMSEMMENPRVMTKAQDEVTINFASNNEI
47494721
>CYP71D157P CYP71D170P chr15 no gene model
82% to 71D129
47190069
PKKIDDKTCKIPDGHLTNLPIIRKYIQLSDMALKYGP*LIL
47189945 LGELSTIVISYLESAKEVMKTHDINFATRPKVLAIDIMYYNS
TNIVFDP*GNYWR*IRKICTLELLSLKRVKSY*PIKEEEL 47189700
SNVVKLIDSHKGPSFKLT*PVLSSIYTIASRAGFGNKC
KDQQKFISVVKKISKLAASFGIEDLFPSATWL*HVTGVRPMIYRLHQQVDQIMENIIN
DKDFPHNSSSIQAILF
47189275
>CYP71D158 CYP71D106 Gm0025x10028:peptide scaffold_25:621105..622673 (+ strand)
62% to 71D8
Glyma17g01110.1 Gm17:622315..624042
(+ strand)
621105
MAVLSSLAVITFFLSLLVLFLAK
NYKQKSLHKLPPGPWKLPIIGNLLQLAAASSLPHHAIRELAKKYGPLMHLQLGEISAVIVSSPNMAKEIMKTHDLAFAQR
PKFLASDIMGYGSVDIAFAPYGDYWRQMRKICTLELLSAKKVQSFSNIREQEIAKLIEKIQSSAGAPINLTSMINSFIST
FVSRTTFGNITDDHEEFLLITREAIEVADGFDLADMFPSFKPMHLITGLKAKMDKMHKKVDKILDKIIKENQANKGMGEE
KNENLVEVLLRVQHSGNLDTPITTNNIKAVIW (0)
DIFAAGTDTSAKVIDWAMSEMMRNPRVREKAQAEMRGKETIHESNLGE
LSYLKAVIKETMRLHPPLPLLLPRECIEACRIDGYDLPTKTKVIVNAWAIGRDPENWHDADSFIPERFHGASIDFKGIDF
EYIPFGAGRRMCPGISFGIANVEFALAKLLYHFNWELQQGTKPEEFDMDESFGAV
VGRKNNLHLIPIPYDPSIHDNGKGGTFI* 622763
>CYP71D159P CYP71D147P CHR17 72% to 71D126
20410k+
YKIRAKSKVIINAWEIGRDPKYWIEPDRFYSKRLIDSTIK
>CYP71D160 CYP71D115
Gm0019x10048:peptide 75% to 71D105 scaffold_19:8403106..8405210
(+ strand)
Glyma17g31560.1
Gm17:34696333..34698561 (+ strand)
8403052
MDSQILNSLALILPFFLF
MIVVLKLGRKLKKTEPSLNIPPGPWKLPIVGNLHQLVTSSPHKKFRDLAKIYGPMMHLQLGEIFTIVVSSAEYAKEILKT
HDVIFASRPHFLVSEIMSYESTNIAFSPYGNYWRQVRKICTLELLSQKRVNSFQPIREEELTNLVKMIGSQEGSSINLTE
AVHSSMYHIITRAAFGIRCKDQDEFISAIKQAVLVAAGFNIGDLFPSAKWLQLVTGLRPTLEALFQRTDQILEDIINEHR
EAKSKAKEGHGEAEEEGLLDVLLKFEDGNDSNQSICLTINNIKAVIADIFGGGVEPIATTINWAMAEMIRNPRVMKTAQV
EVREVFNIKGRVDETCINELKYLKSVVKETLRLHPPAPLILPRECQETCKINGYDIPVKTKVFINAWAIGRDPNYWSEPE
RFYPERFIDSSVDYKGGNFEYIPFGAGRRICPGITFGLVNVELTLAFLLYHLDWKLPNGMKNEDFDMTEKFGVTVARKDD
IYLIPATSRPFLVRFCY* 8405288
>CYP71D161P CYP71D116P
scaffold_19 chr17 34788k+ no gene model
8494773 34788219
MDSQILNSLALIIFPFFLSMIVVLKLGRKLMKTKPSLNIPPGPWNLPIIGNVHLLVTSTPH*KLTDL 8494973
8494978
MNLTDTVHSSMYNIISRAAFGMKYKDREKFISMVKEGVKTASDFN 34788558 8495112
>CYP71D162P CYP71D107P exon 2 only
80% to 71D99
Gm0004x00148:peptide scaffold_4:1770183..1771255
(+ strand)
Glyma18g08920.1
Gm18:7651666..7652442
(+ strand)
1770722
DIFGAGGETSATTIDWAMAEMMKNPKVMKKAEAEVREVFNMKVRVDENCINEIKYLKLVVKETL
RLLPPIPLLLPRECGQTCEIHGYLIPAKSKVIVNAWAIGRDPNYWTEPERIYPERFIDSTIDYKQSNFEYIPFGVGRRIC
PGSTFASRIIELALAKLLYHFDWKFPNGMISEE*DMSEEFGVAVRRKDDLFMVPFPVT* 1771330
>CYP71D163P CYP71D108P
60% to 71D10
pseudogene
Gm0004x00152:peptide scaffold_4:1825111..1828599 (+ strand)
Glyma18g08960.1
Gm18:7705852..7709650 (+ strand)
1824801 MDKVFILHVFLTFLLLLVLYKTMKISKSKSSTTNLLPPWPW
()
KLPLIGNLHQLFGSTLPHHVLRNLATKYGPLMHLKLGEVSNIIVSSPEMAKE
IMKTHDIIFSNRPQILVAKVAYNAKDIAFSPCGSYWRQLRKMCKEELLASKRVQCFRSIR
EEEVSALIKTISQSVGFVVNLSEKIYSLTYGITARAALGEKCIHQQEFICIIEEAVHLSG
GLCLADLYPSITWLQMFSVVKAKSEKLFRKIDGILDNIIEDHKNRRRLGQLFDTDQKDLV
DVLLGFQQPNKDIPLDPPLTDDNVKAVIL (0)
DVFSAGTET
SSAVVEWAMSEMVKNPKVMKKAQAEVRRVYNSKGHVDETDLDQLTYFRNNE &
ETMRLHPPAPMLLPRESKEKCEINGYEIPARTRVL &
INAWAIGREPKYWTNAETFK*ETFKSERFLDSSIEYKGTNFEFIPFGAGRRVCPGIAFA
IADIELPLAQLLYHFDWKLPNGSKLEEFDMRESFGLTARRKNGLCLIPIIYHQLNK* 1828599
>CYP71D164P CYP71D109P 10640127 74% to 71D99, scaffold 4
chr18 no gene model 16826k-
10640121
16826720
AKEVMKTHDINFATRPKILPIDIMSYNSINVAFDP*GNYWRQLRKICTLEVLSLKCVNSY*PIREEEQSNLVKVIN*
HKGPSFNLTQPVLTSIYTIASRAAFGNKCKDQEKFISVVKKI*KLAKGF 809 & 16826343
16826341 VIEDLFPSATWLQHATGVRPMIHRLHQQ
16826258 10639659
>CYP71D165P CYP71D110P 17904788 three pieces 65%-80% to 71D105, 71D100
scaffold 4
chr18 no gene model 24121k-
24121158 SNIPPGPWNLLIIGNIPHLLTSTPHQKL*DLAKKYGPFMHLKL
17904686
IPPGPWNLLIIGNIPHLLTSTPHQKL*DLAKKYGPFMHLKLD
AKEVMKMHDLKFSSRPQVLA 17904501
17904472 IAFAPYGNYWRQLRKTCTLDC 17904410
>CYP71D166P CYP71D120P 77% to 71D99
Gm0264x00007:peptide
Glyma18g38290.1
Gm18:45714880..45715314 (+ strand)
KLPIIGNIYDLLSSQPYRKFKRHDLKYGPLMHLQLGEVSTIVISS
PEYAKEVMKIDDINFATRPKVLAIDIMSYN
STSIAFAPYGNYWRQLRKICTLELLSLKRVNS*QPIREEDLSNLVKWIDSHKGSSINLTQ
EVLSSIYTIASRAAFGKKCKDQENFISVVKKTIKVA
DLFPYATWLQHFIGVTPKIERLHQQADQIME
NIIN*NKEGKSKDKGNQSEAEKDLVDVLTQYEDGSKPNFSLT*NNIKAIIL
>CYP71D167P CYP71D152P
chr20 (-) strand Glyma20g00940.1 Gm20:659638..661331 (- strand)
661348
LTHVFRVIVSSAEYTKEIMKTHDVT
661273
FASRPLILAADILSYGSTNIIGSPYGNYWRLLRKICTVELLT*KRINSFKPIREEELTNLIKMIDSHK 661070
661074
INLTEAVLLSIYNIISRAAFGMTCKDQEEFISAVKEGVTVAGGFNLGNLFPSAKW
LQLVTGLRPKIERLHRQIDRILLDIINEHREAKAKAKEGQQGEAE 660775
660774
EDLVDVLLKFQDGNDSKLDICLTINNIKAML
(0) 660682
660186
DIFGAGGETAATAINWAMAKMIRDPRVLKKAQAEVREVYNMKGK
VDEICIDELKYLKLVVKETLRLHPPAPLLLPRACEIDGYHISVKSMVI 659911
659910
VNAWAIGRDPKYWSEAERFYPERFIDSSIDYK
GGNFEYIPFGAGRRICPGSTFGLKNVELALAFLLFHFDWKLPNGMKNEDLDMTEQSGVTVTRKADL 659617
FLIHITFRPIMVRK 659575
>CYP71D168P CYP71D153P
chr20 (+) strand Glyma20g00960.1 Gm20:670200..671805
(+ strand)
670059
MDFQIFDMLAPISLFLFMIVALKLGRNLTKTKSIPTYPLAHGSYLT*ETYPIL
LHLLHIEN*ET*PKNMDP*CI*NLGTSTIVVSSAEYAKE 670334
670335
VMKIHDL*FSSRPQVLAGKIIGYDKKTIAFAPYGNYWRQLRKNCTLELFTIKRINSFRPIREEEFNILIKRIA
SANGSTCNLTMAVLSLSYGIISRAAFL
670634
670635
QRPREFILLTEQVVKTSGGFNIGEFFPSAPWIQIVAGF
KPELERLFIRNDQILQDIINEHKDHAKPKGKEGQGEVAEDMVDVLLKFQ 670895
DMGGENQDASLTDDNIKAVI (0)
670955
671222
KMFASGGETSANSINWTMAELMRNPRVMKKA
QAEVREVFNMKGRVDETCINQMKYLKAVAKETMRLHPPVPLLFPRECGEACEIDGYHHIPVK 671500
671501
SKVIVSAWAIGRDPKYWSEAERLYLERFFASSIDYKGT
SFEFISFGAGRRICPGGSFGLVNVEVALAFLLYHFDWKLPNRMKTEDLDMTEQFGLTV 671788
671789
KRKKDLYLIPSLAT* 671833
>CYP71D169 CYP71D154
chr20 (+) strand Glyma20g00970.1:peptide Gm20:674339..676944 (+ strand)
674303
MDSELLSILPPIMSFFLFMIVALKIGSNLKKTESSPNI
674417
PPGPWKLPIIGNIHHLVTSAPHRKLRDLAKMYGPLMHLQLGEVFTIIVS
SPEYAKEIMKTHDVIFASRPKILASDILCYESTNIVFSPYGNYWRQLR 674707
674708
KICTLELFTQKRVNSFQPTREKELTNLVKMVDSHKGSPMNFTEAVLLSIY
NIISRAAFGMECKDQEEFISVVKEAVTIGSGFNIGDLFPSAKWLQL 674995
674996
VTGLRPKLERLHRQIDRILEGIINEHKQANSKGYSEAKEDLVDVLLKFQ 675142
DGNDSNQDICLSINNIKAIIL (0) 675205
675705
DIFSAGGDTAASTINWAMAEMIRDSRVMEKVQIEVREVFNMKGR
VDEICIDELKYLKSVVKETLRLHPPAPLLLPRECGQACEINGYH 675968
675969
IPVKSKVIVNAWAIGRDPKYWSEAERFYPERFIDSSIDYKGT
NFEYIPFGAGRRICPGSTFGLINVEVALAFLLYHFDWKLPNGMKSEDLDMTEQF 676256
676257
GVTVRRKNDLYLIPVPSNPFQVR* 676328
>CYP71D170 CYP71D155
chr20 (+) strand Glyma20g00990.1 Gm20:682359..683943
(+ strand)
682096
MDSEVLNILALVVPFFLFMILALKIARNHTITESSPKVPPGPWKLPIIGNIHHLITSTPHRKLR 682287
682286
DLAKIYGPLMHLQLGEVFTIIVSSAEYAKEIMKTHDLIFASRPHTL
VADILAYESTSIITAPYGRYWRQLLKICTVELFTQKRVNSFT 682549
KFARWSFSPKNVSIHSHKGLSINLAEIVVLSIYNIISRAAFGMKSQNQEE
FISAVKELVTVAAGFNIGDLFPSVKWL
682730
682731
QRVTGLRPKLVRLHLKMDPLLGNIISEHKEAKSK
682832
682832
AIEGKDETEEDLVDVLLKFLDVNDSNQDICLTINNMKAIIL (0) 682954
683323
DIFAAGGETATTTINWVMAEIIRDPRVMKKAQVEVREVFNTKG
RVDEICINELKYLKSVVKETLRLHPPAPLLLPRECGQTCEIDGYHIP 683592
683593
VKSKVIVNAWAIGRDPKYWSEAERFYPERFIDSSIDYKGTN
FEYIPFVAGRRICPGSTFGLINVELALAFLLYHFDWKLPNEMKSEDLDMTEEFGL 683880
683881
TVTRKEDIYLIPVTSRPFS 683937
>CYP71D171P CYP71D156P
chr20 (+) strand 87% to 71D124 Glyma20g01000.1 Gm20:685380..686860
(+ strand)
685359
MDSEVLKMLAVIMSFSLFIFVALKIGSNLKKTDSSPKI
685473
PPGPWKIPIIGNIDHFVTSTPHRKLRDLAKIYGPLMHLQLGEIF
TIIVLSPEYAKEIIKTHDVIFASRTKILLADIICYESTSIIFAPYGNYWRQ 685757
685758
LQKICTVELLTQRRVNSFKQIREEELTNLVKMIDSHKGS 685874
PMNFTEAVF*LINNIISRAAFGMKCKDQ
685958
685959
EEFISVVKEAVTIGSGFNIGDLFPSAKWLKLVTGLRPKLERLHWQIDWILEDIINEHKEAKSKAKK
AKVQQRKIWLMFS*NFRMTTIEISLTINNIEAIIL 686261
>CYP71D172P CYP71D135P
scaffold_1 12462k+
chr20 (-) strand Glyma20g16450.1 Gm20:22853217..22853634 (- strand)
NDTSSATITWTMAEMIKNPRIMEKAQAEVRLYFGNEGKPNKSGREYGQACEI
22853510 NRYHIPMKSRVIVNA*GIGRDPNLWTEAERFIESSVDYKGNNFQFI
PFGAGRRMCPGLTFGLSNVECVLAMLMYHFDWKLPNGMKHEDLDMTEIFGITVTRKDNL
22853196
YLIPKTFH
>CYP71D173P CYP71D171P chr20 no gene model
90% to 71D164P
30778178 AKEVMKTHDINFATRPKVLTIDIMSYNSTNVTFDP*GNYWRQLRK
ICTLEFLSLKHVNSY*PIREEELSNLVKVID*HKG 30777949
FVIEDLLPSATWLQHATRVKPMIHRLH*Q
30777717
KDLPHNSSSI
>CYP71D136X discontinued seq
Gm0020x00168:peptide 69% to 71D96 scaffold_20:7260028..7263537
(+ strand)
Chr10 Glyma10g12790.1
14381581 MEAQTYFLVIALFFLLHLLAKYYKLKTNVSHTLPPGPKKLPIIGNLHQLAAAGSLPHHALKKLSKKYGPLMHLQLGEISA
VVASSPKMAKEIVKTHDVSFLQRPYFVAGEIMTYGGLGIAFAQYGDHWRQMRKICVTEVLSVKRVQSFASIREDEAAKFI
NSIRESAGSTINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRRIVEIGGGFDLADLFPSIPFLYFITGKMAKLKKL
HKQVDKLLETIVKEHQEKHKRAKEDGAEIEDEDYIDVLLRIQQQSDTLNINMTTNNIKALILDIFAAGTDTSASTLEWAM
TEVMRNPRVREKAQAELRQAFRGKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQLTIIDGYEIPAKTKVMVN
VYAVCKDPKYWVDAEMFVPERFEASSIDFKGNNFEYLPFGGGRRICPGMTFGLATIMLPLALLLYHFNWELPNKIKPENM
DMAEQFGVAIGRKNELHLIPSVNDLCVH* 14378072
>CYP71D137PX discontinued seq
Gm0020x00169:peptide scaffold_20:7338351..7336739
(- strand)
1 aa diff to CYP71D138, missing N-term
chr10
28106849 SPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLAD
VFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLL
RIQQDDTLDIQMTTNNIKALIL (0)
DIFAAGTDTSASTLEWAMAE
MMRNPRVWEKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQ
PTIIDGYEIPAKTKVMVNAYAICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGG
RRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 28059969
>CYP71D138X discontinued seq
Gm0020x00171:peptide 68% to 71D96 scaffold_20:7363851..7362198 (- strand)
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQ
KLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQM
ISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINLTSRIFSLICASISRVAFG
GIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVNKVLENIIREHQEKNKIAKEDGAELED
QDFIDLLLRIQQDDTLDIQMTTNNIKALIL
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDL
EQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYAICKDSQYWIDADRFVPERFEGSSIDFKGN
NFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*
>CYP71D139X discontinued seq
Gm0020x00172:peptide 100% to 71D138 above
scaffold_20:7381829..7380089 (- strand)
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVV
ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDS
IRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHK
QVNKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALILDIFAAGTDTSASTLEWAMAEM
MRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYA
ICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMD
EHFGLAIGRKNELHLIPNVNL*
>CYP71D140X discontinued seq
Gm0020x00173:peptide scaffold_20:7396143..7394663
(- strand)
2 aa diffs to CYP71D138
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQ
KLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQM
ISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINLTSRIFSLICASISRVAFG
GIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVDKVLENIIREHQEKNQIAKEDGAELED
QDFIDLLLRIQQDDTLDIQMTTNNIKALILDIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDL
EQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYAICKDSQYWIDADRFVPERFEGSSIDFKGN
NFNYLPFGGGRRICPGMTLGLASIMLPL
ALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*
>CYP71D141X discontinued seq
Gm0020x00176:peptide scaffold_20:7437789..7436033
(- strand)
2 aa diffs to CYP71D138
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVI
ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDS
IRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHK
QVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL DIFAAGTDTSASTLEWAMAEM
MRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYA
ICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMD
EHFGLAIGRKNELHLIPNVNL*
>CYP71D142X discontinued seq
Gm0020x00177:peptide scaffold_20:7444105..7442757 (-
strand)
4 aa diffs to CYP71D138
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQ
KLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKK
YGPLMHLQLGEISAVIASSPKMAKEIVKTHDVSFLQRPHLVFGQM
ISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINLTSRIFSLICASISR
VSFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHKQVD
KVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDL
EQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVM
VNAYAICKDSQYWIDADRFVPERF
QGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNM
DEHFGLAIGRKNELHLIPNVNL*
>CYP71D143PX discontinued seq
Gm0020x00179:peptide scaffold_20:7498285..7496736
(- strand)
missing the N-terminal, 5 aa diffs to
CYP71D138
100% to Glyma10g22100.1 Gm10:28092932..28094478 (-
strand)
YGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFAS
IREDEAAKFIDSIRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFL
TGKMTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLRIQQDDTLDIQMTTNNIKALILDIFAAGTDTS
ASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHESDQEQLTYLKLVIKETFKVHPPTPLLLPRECSQPTIIDGYEIP
AKTKVMVNAYAICKDSQYWIDADRFVPERFEGSSIDFKGNKFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELP
NKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL*
>CYP71D144PX discontinued seq
Gm0020x00180:peptide scaffold_20:7512153..7509432
(- strand)
9 aa diffs to CYP71D138
4 aa diffs to Glyma10g22120.1 Gm10:28105591..28107347 (-
strand)
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVV
ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQTRKMCATELLSTKRVQSFASIREDEAAKFIDS
IRESAGSPINLTSRIFSLICASISRVAF
GGIYKDQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGK
MTRLKKLHKQVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLL
RIQQDDTLDIQMTTNNIKALILDIFAAGTDTSAST
LEWAMAE
TTRNPTVREKAQAELRQAF*EK
EIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYAICKDSQYW
IDADRFVPERFEVSSIDFKGNNFNYLLFGGGRRICPGMTFGLASIMLPLALLLYHFNWELPNKMKPEEMNMDEHFGLAIG
RKNELHLIPNVNL*
>CYP71D145PX discontinued seq
Gm0763x00001:peptide scaffold_763:554..59
(- strand) 100% to CYP71D138
duplicate contig
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHAL
RDLAKKYGPLMHLQLGEISAVVASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIA
FAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDSIRESAGSPINLTSRIFSL
ICASIS
>CYP71D146X discontinued seq 2 aa diffs to CYP71D138, 2 aa
diffs to 71D147
Gm1021x00001:peptide 68% to 71D96 scaffold_1021:5083..3343 (-
strand)
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVV
ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDS
IRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHK
QVDKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALILDIFAAGTDTSASTLEWAMAEM
MRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLMLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYA
ICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMD
EHFGLAIGRKNELHLIPNVNL*
>CYP71D147x discontinued seq 2 aa diffs to CYP71D138, 2 aa
diffs to 71D138
Gm1090x00001:peptide 68% to 71D96 scaffold_1090:8646..6906 (-
strand)
Exact match to new scaffold_265 but split
into two genes and exon 1 runs off the end
Also 100% to CYP142
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQLGEISAVV
ASSPKMAKEIVKTHDVSFLQRPHLVFGQMISYGGLGIAFAPYGDHWRQMRKMCATELLSTKRVQSFASIREDEAAKFIDS
IRESAGSPINLTSRIFSLICASISRVAFGGIYKEQDEFVVSLIRKIVESGGGFDLADVFPSIPFLYFLTGKMTRLKKLHK
QVNKVLENIIREHQEKNKIAKEDGAELEDQDFIDLLLRIQQDDTLDIQMTTNNIKALIL DIFAAGTDTSASTLEWAMAEM
MRNPRVREKAQAELRQAFREKEIIHESDLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAYA
ICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLASIMLPLALLLYHFNWELPNKMKPEEMNMD
EHFGLAIGRKNELHLIPNVNL*
>Scaffold_265 of Glyma1 100% to CYP71D147 exon 2 (-) strand
100% to 71D134, 71D135 71D143, 71D142, 71D141, 71D140P
Glyma0265s00200.1 scaffold_265:16254..16862
(- strand)
16862
DIFAAGTDTSASTLEWAMAEMMRNPRVREKAQAELRQAFREKEIIHES
DLEQLTYLKLVIKETFRVHPPTPLLLPRECSQPTIIDGYEIPAKTKVMVNAY 16563
16562
AICKDSQYWIDADRFVPERFEGSSIDFKGNNFNYLPFGGGRRICPGMTLGLA
SIMLPLALLLYHFNWELPNKMKPEEMNMDEHFGLAIGRKNELHLIPNVNL* 16254
>Scaffold_265 of Glyma1 100% to CYP71D147 N-term (-) strand
Based on their orientation these pieces are from two different
genes 16,034 apart.
This is nearly the same spacing and orientation between CYP142
Cterm and CYP141 N-term
16,147 bp. These
pieces are probably a part of the chr10 CYP71D cluster
no gene model
220
MEAQSYLLLIGLFFVLHWLAKCYKSSVSQKLPPGPKKLPIIGNLHQLAEAGSLPHHALRDLAKKYGPLMHLQL
2
>CYP71AH3 Glycine max (soybean)
GenEMBL Y10489 (1603bp)
Schopfer,C.R. and Ebel,J.
Identification of elicitor-induced cytochrome P450s of soybean
(Glycine max L.) using differential display of mRNA
Mol. Gen. Genet. 258, 315-322 (1998)
clone CP1
formerly CYP71A9
scaffold_35
3592302..3588813 (-) strand
chr5 (+) strand 2116886-2120375 Glyma05g02760.1
MISFTVFVFLTLLFTLSLVKQLRKPTAEKRRLLPPGPRKLPFIG
NLHQLGTLPHQSLQYLSNKHGPLMFLQLGSIPTLVVSSAEMAREIFKNHDSVFSGRPS
LYAANRLGYGSTVSFAPYGEYWREMRKIMILELLSPKRVQSFEAVRFEEVKLLLQTIA
LSHGPVNLSELTLSLTNNIVCRIALGKRNRSGADDANKVSEMLKETQAMLGGFFPVDF
FPRLGWLNKFSGLENRLEKIFREMDNFYDQVIKEHIADNSSERSGAEHEDVVDVLLRV
QKDPNQAIAITDDQIKGVLV
(0) 2117773
2119767 DIFVAGTDTASATIIWIMSELIRNPKAMKRAQEEVRDL
VTGKEMVEEIDLSKLLYIKSVVKEVLRLHPPAPLLVPREITENCTIKGFEIPAKTRVL
VNAKSIAMDPCCWENPNEFLPERFLVSPIDFKGQHFEMLPFGVGRRGCPGVNFAMPVV
ELALANLLFRFDWELPLGLGIQDLDMEEAIGITIHKKAHLWLKATPFCE
2120375
>CYP71AH7P
chr5 83% to 71AH3
Glyma05g02750.1
Gm05:2112489..2113935 (+ strand)
2112676
DIFVVGTSTASATIIWTMSELIRNPKAMKRAQ
EEIRGVVKGKEMVEEIDLSRLLYLKSFVKEDLRLHPPVPL 2112891
LMPRETTESCTIKGFEIPTKTTRVLVNAKSI 2112984
>CYP71AU9 Glycine max (soybean,
Fabales)
No accession number
Muhammad Azam Chattha
Submitted to nomenclature committee 9/26/2008
Clone C59
50% to 71A26, 42% to 71D81 medicago, 51% to
71AR1v1
67% to 71AU7 Medicago, 55% to 71AU3 Vitis
scaffold_230 488534..490780 (+) strand
chr16 Glyma16g32010.1 Gm16:35215946..35218490 (- strand)
488534 35218490 MWISQQENSSSWFFLPVVTFIILFLLRTFLNLLSNRNNDSKKPSPPSPPKLPIIGNLHQL
GTHIHRSLQSLAQTYGSLMLLHLGKVPVLVVSTAEAAREVLKTHDPVFSNKPHRKMFDIL
LYGSKDVASAPYGNYWRQTRSILVLHLLSAKKVQSFEAVREEEISIMMENIRKCCASLMP
VDLTGLFCIVANDIVCRAALGRRYSGEGGSKLRGPINEMAELMGTPVLGDYLPWLDWLGR
VNGMYGRAERAAKKVDEFFDEVVDEHVNKGGHDGHGDGVNDEDQNDLVDILLRIQKTNAM
GFEIDRTTIKALIL (0)
DMFGAGTETTSTILEWIMTELLRHPIVMQKLQGEVRNVVRDRTHIS
EEDLSNMHYLKAVIKETFRLHPPITILAPRESTQNTKVMGYDIAAGTQVMVNAWAIARDP
SYWDQPEEFQPERFLNSSIDVKGHDFQLLPFGAGRRACPGLTFSMVVVELVIANLVHQFN
WAIPKGVVGDQTMDITETTGLSIHRKFPLIAIASPHA* 35216244 490780
>CYP71AU10P CYP71AU12P 83%
to 71AU12
Gm0003x00766:peptide 12800k+
Glyma05g19650.1
23651424 YDIAAGT*VLVNARVIARDLSWDQSLEFKLERFLSSSIDFKGLDFELIPFGAKRRGCPR
VTFATIIIEVVLANLVHQFDWSLPSGATGEDLDMSETTGLVVHKKSPLLVATVYQRN* 23651077
>CYP71AU11P CYP71AU23P Chr7 Glyma07g31370.1
80% to 71AU12
36396710 NLHQLGLFPHRTLQTLAKNYGPLMLLHFGKVPVHVVS
SSDAAREVMKTHDLVFSDRPQRKINDILLYGSKDLPSSNYGEH*RQLRSLSVLHLLST 36396994
36396995 KRVQSFRGVREEKTARMMENIWQCCCDSLHVNLSDLC
AALANDVACRAALGRR 36397153
YCGGEGREF
QHWLLEFRELLVAVSVGEDYVLWLDWMSKVNGLSQRAHGVAKNLDQF
IDEVISDHVRNGRDGHVDVDSEEQNDFVNVLLSIEKSKTTGSTIDRTPIK
36398323
DMLVAGTDTTYTTLEWTISELLKHP 36398397
>CYP71AU12 CYP71AU11 60% to 71AU27
Gm0191x10008:peptide scaffold_191:338028..333406
(- strand)
Glyma07g31380.1
Gm07:36400692..36405314 (- strand)
338018
36405304 MLFFTVFVLCLSLAFMIKWYSNAVTSKNSPPSPPRLPLLGNLHQLGLFPHRTLQTLAKKYGPLMLLHFGKVPVLVVSSAD
AAREVMRTHDLVFSDRPQRKINDILLYGSKDLASSKYGEYWRQIRSLSVSHLLSTKRVQSFRGVREEETARMMDNIRECC
SDSLHVNLTDMCAAITNDVACRVALGKRYRGGGEREFQSLLLEFGELLGAVSIGDYVPWLDWLMSKVSGLFDRAQEVAKH
LDQFIDEVIEDHVRNGRNGDVDVDSKQQNDFVDVLLSMEKNNTTGSPIDRTVIKALILDMFVAGTDTTHTALEWTMSELL
KHPMVMHKLQDEVRSVVGNRTHVTEDDLGQMNYLKAVIKESLRLHPPLPLIVPRKCMEDIKVKGYDIAAGTQVLVNAWVI
ARDPSSWNQPLEFKPERFLSSSVDFKGHDFELIPFGAGRRGCPGITFATNIIEVVLANLVHQFDWSLPGGAAGEDLDMSE
TAGLAVHRKSPLLAVATAYQRN* 36400833 333541
>CYP71AU12-de1b CYP71AU11-de1b
Gm0191
Chr7 no gene model, pseudogene 1 kb upstream of CYP71AU12
339459
36406745
REFQHLLLEFGELLGTVSIGDYVPWLDRLTNKVSGLFERAHRVAKL
LNQFINEVIEEHFRNGRGVDVDVDVDVDSEEQNE 339220
NDFVDALLSIE
339184 NNTTGSPIDRTAIKALI 36406420 339134
>CYP71AU13P CYP71AU24P