Comparison
of Dictyoselium assemblies from genome annotation and my annotation.
Attempts
are made to reconcile the differences in the assemblies.
D. Nelson
May 13,
2003
I have 55
sequences identified
51 = seq 19
ng7190 use ng7190
508A1 = seq 1+2
ng3018 missing N-term exon use my seq
508A2 = seq 22+45+69+86+88 ng5262 N-term
incorrect use my seq
508A3 = seq 21+67+72+82+87 ng10949 correct
the two diffs and add my N-term
508A4 = seq 66+70 ng4360
remove K from mine at boundary then use mine
508B1 = seq 15+77 ng6400
missing most of N-term use my seq
508B2P = seq
60
508C1 = seq 4+23+24+83 ng1934 missing three
parts use my seq
508D1 = seq 59+81 ng1935
missing N-terminal exon use my seq
508E1 = seq 71+85 ng8897 use
my seq correct two other defects
513A1 = seq 8+53 ng6889
n-term missing use my seq
513A2P = seq
38
ng12344 correct 1 aa diff use my seq
513A3 = seq 7+50 ng4838
add missing intron back use my seq
513B1 = seq 49
ng627 2aa diffs use ng627 seq
513C1 = seq 5+56 ng6446
100% match
513D1 = seq 54+55+78 ng11186
513E1 = seq 51+52+90 ng3101
use N instead of D
513E2P = seq
37a complete except for missing N-term exon 55% to 513E1
(old name 513C1P) ng1289
missing first 13 aa use my seq
513E3P = seq 37b
upstream of 513E2P only 429 bp between them (old name 513C2P)
513F1 = seq 57
513G1P = seq 41
37% to CYP513A1
514A1 = seq 11
ng1282 add 7aa to exon 1 and change 1 aa L to F in ng1282
514A2 = seq 65
ng1006 joint at N-term exon does not agree, 2aa diffs
inside, use ng1006 seq
514A3P = seq 89
514A4 = new seq ng2792
7aa difs to 514A1
515A1 = seq 79b ng4137
some parts of both seqs have support
515A2P = seq
79a
515B1 = seq 6
ng2647 N-terms do not match use ng2647 but extend back
516A1 = seq 9+43 ng9107
use ng9107
516A2P = seq
64
516B1 = seq 10+42 ng9290 use
mine but add insertion
516B2P = seq 48
partial, might be a pseudogene
517A1 = seq 3+61 ng4501
use this seq with my N-terminal
517A2 = seq 74
ng10310 correct 3aa diffs and use my seq
517A3P = seq
36
ng682 missing some seq. Use my seq add new C-term piece
517A4 = new seq ng5440
most like 517A1, 13 aa diffs
518A1 = seq 20
518A2P = seq
25+27+29+30
518B1 = seq 26+28
519A1 = seq 12
ng8085 add N-term exon then use ng8085
519B1 = seq 35+44+75 ng6320
USE ng6320
519C1 = seq 31+32+33 ng5581
use my seq (N-term missing)
519C2P = seq
46
519D1 = seq 14+34 ng12432 use
my seq
519E1 = seq 17+84 ng6352 add
two short pieces use ng6352
520A1 = seq 47+68+80 ng8864
use this seq with my N-term
520B1 = seq 58
ng1210 correct stop codon. use my seq.
521A1 = seq 13
ng2405 100% match
522A1 = seq 18
519H1P = seq
39+73 ng11335 old
523A1 correct three aa diffs add my N-term
524A1 = seq 91
ng752 N-terms do not agree use my seq
525A1 = seq 62
554A1 = seq 40
ng1193 small diffs use ng1193 seq
555A1 = seq 76
ng10297 use ng10297
556A1 = seq 92
ng10943 use my seq
Dicty database did not
identify the following as P450s 508B2P, 513E2P, 513F1, 513G1P,
515A2P, 516A2P, 516B2,
518A1, 518A2P, 518B1, 519C2P, 522A1, 525A1,
note 518A1, 518B1 and 522A1
have non-standard Heme signatures
(13 seqs)
Dicty database found two
new genes I had missed 514A4 and 517A4 both very similar to existing genes.
>514A2
>aa-ng1006
Contig_0356
pred cdna pORF 5256
3526 strand r start y stop y ph 0
MNLIYTIILTIIILVLIISIKDLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCF
NDFYKQYGKVYRLRLGSVETVVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILH
SNGDYHFLLKGVLSSQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIK
KLILKVVLNFTLGIEENDDINLSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNM
ISSYVFVRDYLAKKLEEVKKKEYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILL
SIVDIIIASVDTTSNSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPY
IISIMNETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALE
FKPERFKTQTLNQKGLLHFGAGPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDR
GDVGLSMQCKPYDALFIKRN
JC1c209h10.r1
has FL not CF
JC3a120c12.s1
has ?F not FL
Contig_0356 has CF not FL use CF
Query: 1
MNLIYTIILTIIILVLIISIKDLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCF 60
MNLIYTIILTIIIL
DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVAT
Sbjct: 1
MNLIYTIILTIIIL-------DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATFL 53
Query: 61
NDFYKQYGKVYRLRLGSVETVVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILH 120
NDFYKQYGKVYRLRLGSVETVVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILH
Sbjct: 54
NDFYKQYGKVYRLRLGSVETVVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILH 113
Query: 121
SNGDYHFLLKGVLSSQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIK 180
SNGDYHFLLKGVLSSQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIK
Sbjct: 114
SNGDYHFLLKGVLSSQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIK 173
Query: 181
KLILKVVLNFTLGIEENDDINLSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNM 240
KLILKVVLNFTLGIEENDDINLSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNM
Sbjct: 174
KLILKVVLNFTLGIEENDDINLSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNM 233
Query: 241
ISSYVFVRDYLAKKLEEVKKKEYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILL 300
ISSYVFVRDYLAKKLEEVKKKEYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILL
Sbjct: 234
ISSYVFVRDYLAKKLEEVKKKEYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILL 293
Query: 301
SIVDIIIASVDTTSNSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPY 360
SIVDIIIASVDTTSNSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPY
Sbjct: 294 SIVDIIIASVDTTSNSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPY
353
Query: 361
IISIMNETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALE 420
IISIMNETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALE
Sbjct: 354
IISIMNETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALE 413
Query: 421
FKPERFKTQTLNQKGLLHFGAGPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDR 480
FKPERFKTQTLNQKGLLHFGAGPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDR
Sbjct: 414
FKPERFKTQTLNQKGLLHFGAGPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDR 473
Query: 481 GDVGLSMQCKPYDALFIKRN 500
GDVGLSMQCKPYDALFIKRN
Sbjct: 474 GDVGLSMQCKPYDALFIKRN 493
>CYP513B1
>aa-ng627
Contig_0257
pred cdna pORF 5998
4427 strand r start y stop y ph 0
MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQK
WFKIYGPIYSVNMGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQEN
GKNQSALKRIFLSELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSIN
IISRFLFGIDFPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALV
VQIETLVKSHKYKEDDECMLSKLMIEHDKGNIPWDAVISNCNTIITAGSDSTSSTALFFL
IEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYRK
HPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENS
QTLISFGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNT
VVQIKKR
Note only
one seq has L C. The others all
have S W so correct my seq
>CYP513B1
seq 49 complete seq 45% to seq 7, 50
Length = 488
Score = 2504 (881.5 bits), Expect =
4.8e-264, P = 4.8e-264
Identities = 485/487 (99%), Positives =
485/487 (99%)
Query: 1
MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQK 60
MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQK
Sbjct: 1
MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQK 60
Query: 61
WFKIYGPIYSVNMGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQEN 120
WFKIYGPIYSVNMGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQEN
Sbjct: 61
WFKIYGPIYSVNMGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQEN 120
Query: 121
GKNQSALKRIFLSELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSIN 180
GKNQSALKRIFLSELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSIN
Sbjct: 121
GKNQSALKRIFLSELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSIN 180
Query: 181
IISRFLFGIDFPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALV 240
IISRFLFGIDFPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALV
Sbjct: 181
IISRFLFGIDFPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALV 240
Query: 241
VQIETLVKSHKYKEDDECMLSKLMIEHDKGNIPWDAVISNCNTIITAGSDSTSSTALFFL 300
VQIETLVKSHKYKEDDECML KLMIEHDKGNIP DAVISNCNTIITAGSDSTSSTALFFL
Sbjct: 241
VQIETLVKSHKYKEDDECMLLKLMIEHDKGNIPCDAVISNCNTIITAGSDSTSSTALFFL 300
Query: 301
IEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYRK 360
IEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYRK
Sbjct: 301 IEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYRK
360
Query: 361
HPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENS 420
HPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENS
Sbjct: 361
HPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENS 420
Query: 421
QTLISFGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNT 480
QTLISFGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNT
Sbjct: 421
QTLISFGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNT 480
Query: 481 VVQIKKR 487
VVQIKKR
Sbjct: 481 VVQIKKR 487
>517A3P
MEIVNV (frameshift)
FIILIILFLVKDF (0)
VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHEL
(10 aa deletion)
IVRAWIGE
RLFLFVSNYDVKYFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS
VS**LRVHTSKKLMEEKAIEFIDSLEKISNNNEI (0)
FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFE
IFSPLYDWFFTRRL*DCDIV*QIINILTENH
This is a
pseudogene, so the sequences in green need to be added.
Ng682 has
an extension that I did not have that needs to be added to my seq.
The seq
similarity stops after LCSNG so I removed that last two aa.
>aa-ng682
Contig_0274
pred cdna pORF 17004 15801
strand r start y stop y ph 0
MEIVNVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELIVRAWIGERLFLFVS
NYDVKYFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS
FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRL
KGCDIVRQIISSQNENHFKSIDPSKPRDLMDDLLIEYGLNEITKEDTMKINQICFDIFGPAIG
TVTITMNWVILQLCNRPEPQEIAYLEIKKAVKDDEYVNFNHKQNAPYIVAFIKETMRLCS
NGPN
>CYP517A3P
Seq 36 pseudogene 77% to seq 3 77% to seq 61
Length
= 242
Score = 546 (192.2 bits), Expect =
8.8e-99, Sum P(2) = 8.8e-99
Identities = 103/103 (100%), Positives =
103/103 (100%)
Query: 6
VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELIVRAWIGERLFLFVSNYDVK 65
VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELIVRAWIGERLFLFVSNYDVK
Sbjct: 20
VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELIVRAWIGERLFLFVSNYDVK 79
Query: 66
YFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS 108
YFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS
Sbjct: 80
YFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS 122
Score = 418 (147.1 bits), Expect =
8.8e-99, Sum P(2) = 8.8e-99
Identities = 79/86 (91%), Positives =
80/86 (93%)
Query: 109
FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL 168
FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL
Sbjct: 157
FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL 216
Query: 169 YDWFFTRRLKGCDIVRQIISSQNENH
194
YDWFFTRRL CDIV QII+ ENH
Sbjct: 217 YDWFFTRRL*DCDIV*QIINILTENH
242
>CYP524A1
MKTPTKYFIIFILLAALAVF (0)
KGSLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTSIMNKF
VLFVTDAEINRQVFKEENAKLYLSLGAKKILTEK
AIPFIEGAP
HRQLRKQLLPLFTIRALSSYLPIQESIVDEHIAMWIKNGKADINARNNCRDLNMAISTGVFV
GNNTPESVRDDIAKNFFVMNEGFLCLPIDLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGE
KPQSLIDLWVEHFLNCPKEERDELSNDTIIFTLLSFMFASQDALTS
SLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTMRQATYTRMVVSEILRFRPPAVMVPHE
NIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYSDPYKFDPQRFDSVRKEDVTCAKNSLVFGAG
PHFCIGKELAKNQIEVFLTKLAMSTEWTHNKTPGGDEIIFGPTIFPKDGCNITIKARN*
Note: two
exon extensions occur in ng752. VSEAT to KLIVA extends exon 1 downstream
VGKFT to
ITQK extends exon 2 upstream. This
makes the N-term too long.
The N-terminal up to the PFFGML motif is usually 30-50 aa long, but ng752 is 96aa long. I include the sequence VLFV to LTEK, which ng752 removes as an intron.
I think my solution is better, since it preserves the length between the PFFGML motif and the C-helix motif (HRQLRK) as compared to the Arabidopsis CYP710A1 seq and
The
CYP26B1 FROM FUGU (see aligns below, deleted intron shown in red)
>aa-ng752
Contig_0287
pred cdna pORF 3086
1351 strand r start y stop y ph 0
MKTPTKYFIIFILLAALAVF
VSEATSKVGQQTTTTTQQVKCSGLQCTLNKLIVA
VGKFTIKQILVGIVIVLATIALHQQYVITQK
KGSLPGPSFVP
PFFGML
FQLIFTPFSFYEKQEKYGPISWTSIMNKF
AIPFIEGAP HRQLRKQLLPLFTIRALSSYLPIQESIVDEHIAMWIKNGK
ADINARNNCRDLNMAISTGVFVGNNTPESVRDDIAKNFFVMNEGFLCLPIDLPGTTLRKA
INARVRLVEIFTDIIAKSRKRMGDGEKPQSLIDLWVEHFLNCPEEERDELSNDTIIFTLL
SFMFASQDALTSSLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTMRQATYTRMVV
SEILRFRPPAVMVPHENIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYSDPYKFDPQRF
DSVRKEDVTCAKNSLVFGAGPHFCIGKELAKNQIEVFLTKLAMSTEWTHNKTPGGDEIIF
GPTIFPKDGCNITIKARN
>CYP710A1
(subject) CYP524A1 = query
Length = 496
Score = 780 (274.6 bits), Expect =
1.3e-80, P = 1.3e-80
Identities = 167/469 (35%), Positives =
267/469 (56%)
Query: 8
FIIFILLAALA-VFKG-SLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTS---I
62
F++F+L+ L+ +FK ++PGP FVPP G
L+ P SF++KQ IS S +
Sbjct: 19 FLLFLLVEQLSYLFKKRNIPGPFFVPPIIGNAVALVRDPTSFWDKQSSTANISGLSANYL
78
Query: 63 MNKFVLFVTDAEINRQVFKEENAKLYLSLG---AKKILTEKAIPFIEGAPHRQLRKQLLP 119
+ KF++++ D E++ Q+F + +G
KK+ + + ++ G H+ +R+QL P
Sbjct: 79
IGKFIVYIRDTELSHQIFSNVRPDAFHLIGHPFGKKLFGDHNLIYMFGEDHKSVRRQLAP 138
Query: 120
LFTIRALSSYLPIQESIVDEHIAMW---IKNGKADINARNNCRDLNMAISTGVFVGNNTP 176
FT +ALS+Y +Q+ ++ H+ W G ++ R
R+LN+ S VFVG
Sbjct: 139
NFTPKALSTYSALQQLVILRHLRQWEGSTSGGSRPVSLRQLVRELNLETSQTVFVGPYLD 198
Query: 177
ESVRDDIAKNFFVMNEGFLCLPIDLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGEK 236
+ ++ ++ + N G + LPIDLPG +A A
RL E
KS+ RM GE+
Sbjct: 199
KEAKNRFRTDYNLFNLGSMALPIDLPGFAFGEARRAVKRLGETLGICAGKSKARMAAGEE 258
Query: 237 PQSLIDLWVEHFLNCPKEERDELSNDTIIFTLL-SFMFASQDALTSSLVWTVQLMAEHPD
295
P LID W++ +
+ + S D I LL F+FA+QDA TSSL+W V L+ P+
Sbjct: 259
PACLIDFWMQAIV--AENPQPPHSGDEEIGGLLFDFLFAAQDASTSSLLWAVTLLDSEPE 316
Query: 296
ILAKVRAEQASL-RP-NNEKLDLDTMRQATYTRMVVSEILRFRPPAVMVPHENIEDIVIG 353
+L +VR E A + P +N + +D + + YTR V E++R+RPPA
MVPH D +
Sbjct: 317
VLNRVREEVAKIWSPESNALITVDQLAEMKYTRSVAREVIRYRPPATMVPHVAAIDFPLT 376
Query: 354
DNVHVPKGTMILPSIWSAHFQEGGYSDPYKFDPQRFDSVRKEDVTCAKNSLVFGAGPHFC 413
+ +PKGT++ PS++ + FQ G+++P +FDP RF R+ED
+N L FG GPH C
Sbjct: 377
ETYTIPKGTIVFPSVFDSSFQ--GFTEPDRFDPDRFSETRQEDQVFKRNFLAFGWGPHQC 434
Query: 414
IGKELAKNQIEVFLTKLAMSTEWTHNKTPGGDEIIFGPTIFPKDGCNITIKAR 466
+G+ A N + +F+ + ++ ++ G DEI++ PTI PKDGC + + R
Sbjct: 435
VGQRYALNHLVLFIAMFSSLLDFKRLRSDGCDEIVYCPTISPKDGCTVFLSRR 487
Fugu
CYP26B1
>CYP26B1
Scaffold_4267 Scaffold_49 partial sequence 74% to 26B1
Length = 513
Score = 355 (125.0 bits), Expect =
3.4e-36, P = 3.4e-36
Identities = 122/440 (27%), Positives =
195/440 (44%)
Query: 24 LPGPSFVPPFFGMLFQLIFTPFSFY-EKQEKYGPISWTSIMNKFVLFVTDAEINRQVFKE 82
+P S PF G + F+
+++KYG + T ++ + ++ VT AE R+V
Sbjct: 49
MPKGSMGFPFIGETCHWLLQGSGFHASRRQKYGNVFKTHLLGRPLIRVTGAENIRKVLMG 108
Query: 83 ENAKLYLSL--GAKKILTEKAIPFIEGAPHRQLRKQLLPLFTIRALSSYLPIQESIVDEH
140
E+ + + +L ++ G
HR+ RK +F+ AL SYLP + ++ E
Sbjct: 109
EHTLVTVDWPQSTSTLLGPNSLANSIGDIHRKKRKVFAKVFSHEALESYLPKIQQVIQES 168
>CYP524A1
Seq 91 complete seq 25% to 10, 42 468 aa
Length = 468
Score = 1926 (678.0 bits), Expect =
1.1e-236, Sum P(3) = 1.1e-236
Identities = 371/388 (95%), Positives =
376/388 (96%)
Query: 1
MKTPTKYFIIFILLAALAVF 20
MKTPTKYFIIFILLAALAVF
Sbjct: 1
MKTPTKYFIIFILLAALAVF 20
Query: 86
KGSLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTSIMNKFAIPFIEGAPHRQLR 145
KGSLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTSIMNKF + + +RQ+
Sbjct: 21
KGSLPGPSFVPPFFGMLFQLIFTPFSFYEKQEKYGPISWTSIMNKFVLFVTDAEINRQVF 80
Query: 146 KQ 147
K+
Sbjct: 81 KE 82
Query: 112
FYEKQEK-YGPISWTSIMNKFAIPFIEGAPHRQLRKQLLPLFTIRALSSYLPIQESIVDE 170
F E+ K Y +
I+ + AIPFIEGAPHRQLRKQLLPLFTIRALSSYLPIQESIVDE
Sbjct: 80 FKEENAKLYLSLGAKKILTEKAIPFIEGAPHRQLRKQLLPLFTIRALSSYLPIQESIVDE
139
Query: 171
HIAMWIKNGKADINARNNCRDLNMAISTGVFVGNNTPESVRDDIAKNFFVMNEGFLCLPI 230
HIAMWIKNGKADINARNNCRDLNMAISTGVFVGNNTPESVRDDIAKNFFVMNEGFLCLPI
Sbjct: 140 HIAMWIKNGKADINARNNCRDLNMAISTGVFVGNNTPESVRDDIAKNFFVMNEGFLCLPI
199
Query: 231
DLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGEKPQSLIDLWVEHFLNCPEEERDEL 290
DLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGEKPQSLIDLWVEHFLNCP+EERDEL
Sbjct: 200
DLPGTTLRKAINARVRLVEIFTDIIAKSRKRMGDGEKPQSLIDLWVEHFLNCPKEERDEL 259
Query: 291
SNDTIIFTLLSFMFASQDALTSSLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTM 350
SNDTIIFTLLSFMFASQDALTSSLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTM
Sbjct: 260
SNDTIIFTLLSFMFASQDALTSSLVWTVQLMAEHPDILAKVRAEQASLRPNNEKLDLDTM 319
Query: 351
RQATYTRMVVSEILRFRPPAVMVPHENIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYS 410
RQATYTRMVVSEILRFRPPAVMVPHENIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYS
Sbjct: 320
RQATYTRMVVSEILRFRPPAVMVPHENIEDIVIGDNVHVPKGTMILPSIWSAHFQEGGYS 379
Query: 411 DPYKFDPQRFDSVRKEDVTCAKNSLVFGAGPHFCIGKELAKNQIEVFLTKLAMSTEWTHN
470
DPYKFDPQRFDSVRKEDVTCAKNSLVFGAGPHFCIGKELAKNQIEVFLTKLAMSTEWTHN
Sbjct: 380
DPYKFDPQRFDSVRKEDVTCAKNSLVFGAGPHFCIGKELAKNQIEVFLTKLAMSTEWTHN 439
Query: 471 KTPGGDEIIFGPTIFPKDGCNITIKARN
498
KTPGGDEIIFGPTIFPKDGCNITIKARN
Sbjct: 440 KTPGGDEIIFGPTIFPKDGCNITIKARN
467
>aa-ng1193
Contig_0430
pred cdna pORF 4315
2561 strand r start y stop y ph 0
MDLLLFIFFLILFYYSVKYYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKY
HGIFRMWLGGTYYVVVSDYKLIREMYIENFENFKNRIATFKTMTGDDSRGIIGCNGDIWD
SNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNEEEEDDDDDGNKSIIINNTR
FYFQSLTLTVMFKMIFNENKSFQLYSDSTEFKLIFKTILNLLNSLNVYNVIYDFLGIFQP
ILLKFTKILDKNSFLSKIATEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDENGLNEK
QIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELKKLNKSEITANDKIN
TPFLNSFIKETNRLYPIAPLSLPRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYK
DPNEFKPDRFLNSKVSDTLNFGIGPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLP
ENLILSVSVRPTEYSLKLIKRV
>CYP554A1 SEQ 40 30% to
508 469 aa complete
MDLLLFIFFLILFYYS (0)
YYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKYHGLFRMWFGGTYYVVV
SDYKLIREMYIENFENFKNRIA
TFKTMTGDDSRGIIGCNGDIWDSNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNE
(INTRON)
IINNTRFYFQSLTLT
VMFKMIFNENKSFQLYSDST
EFKLIFKTILNLLNSLNVYNVI
YDFLGIFQPILLKFTKILDKNSFLSKIA
TEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDEN
GLNEKQIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELK
KLNKSEITANDKINTPFLNSFIKETNRLY
(0)
KSNNNFFFRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYKDPNEFKPDRFLNSKVSDTLNFGI
GPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLPENLF*
1. VK there are two
possible GTs for this exon boundary
2. IIACP2E4184 is only seq
with LFRMWF so change this to IFRMWL
3. EEEDDDDDGNKS no good GT AG pairs in frame so
leave this in
4. PIAPLSLP matches other
related P450s in this region so this is correct
5. ILSVSVRPTEYSLKLIKRV adds
correct C-terminal
use the
ng1193 seq as the correct seq.
>CYP554A1
SEQ 40 30% to 508 469 aa complete
Length = 470
Score = 2323 (817.7 bits), Expect =
7.3e-245, P = 7.3e-245
Identities = 458/483 (94%), Positives =
459/483 (95%)
Query: 1
MDLLLFIFFLILFYYSVKYYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKY 60
MDLLLFIFFLILFYYS
YYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKY
Sbjct: 1
MDLLLFIFFLILFYYS--YYKADNQNSLSLSGPTPVPILGNIHQVGKDAHLTIPIISKKY 58
Query: 61
HGIFRMWLGGTYYVVVSDYKLIREMYIENFENFKNRIATFKTMTGDDSRGIIGCNGDIWD 120
HG+FRMW GGTYYVVVSDYKLIREMYIENFENFKNRIATFKTMTGDDSRGIIGCNGDIWD
Sbjct: 59
HGLFRMWFGGTYYVVVSDYKLIREMYIENFENFKNRIATFKTMTGDDSRGIIGCNGDIWD 118
Query: 121
SNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNEEEEDDDDDGNKSIIINNTR 180
SNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNE
IINNTR
Sbjct: 119
SNKELIMKSYKKVLNKDMNDFILLKSKELFNFFEKNGIKNE-------------IINNTR 165
Query: 181
FYFQSLTLTVMFKMIFNENKSFQLYSDSTEFKLIFKTILNLLNSLNVYNVIYDFLGIFQP 240
FYFQSLTLTVMFKMIFNENKSFQLYSDSTEFKLIFKTILNLLNSLNVYNVIYDFLGIFQP
Sbjct: 166
FYFQSLTLTVMFKMIFNENKSFQLYSDSTEFKLIFKTILNLLNSLNVYNVIYDFLGIFQP 225
Query: 241
ILLKFTKILDKNSFLSKIATEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDENGLNEK 300
ILLKFTKILDKNSFLSKIATEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDENGLNEK
Sbjct: 226
ILLKFTKILDKNSFLSKIATEKFNSRIKEIDFTSDDFKANDLLDSLIMTINEDENGLNEK 285
Query: 301
QIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELKKLNKSEITANDKIN 360
QIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELKKLNKSEITANDKIN
Sbjct: 286
QIENIKSICIDFLMAGTDTTGSTIEWIILKLVNNPEFQELIFQELKKLNKSEITANDKIN 345
Query: 361
TPFLNSFIKETNRLYPIAPLSLPRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYK 420
TPFLNSFIKETNRLY
RKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYK
Sbjct: 346 TPFLNSFIKETNRLYKSNNNFFFRKSINEMIIGDNKYYIPANTNILMDVKGFSLDENNYK
405
Query: 421
DPNEFKPDRFLNSKVSDTLNFGIGPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLP 480
DPNEFKPDRFLNSKVSDTLNFGIGPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLP
Sbjct: 406
DPNEFKPDRFLNSKVSDTLNFGIGPRNCIGQTIAMNQIHIFLSNLILNYRMFSIDCLPLP 465
Query: 481 ENL 483
ENL
Sbjct: 466 ENL 468
>aa-ng1210
Contig_0437
pred cdna pORF 7346
5011 strand r start y stop y ph 0
MDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSMVVSDPEIVNEIWV
KQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSKVTSVVEK
KLNKLIETMEYHSINKLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIF
KMLSTNSPQYSFPYLKFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSF
ISDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDC
KVLKYEHISKMPYLYGCFRESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGATIIQNIFG
MGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDKISIPFGVGKRQCLS
PAMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKLFLEKRFVRKRV
Note:
ng1210 missing N-terminal exon and some exon 2. compare to 520A1
One amino
acid diff. Y vs. *
Stop codon
in JC1c13f09.s1 Y in Contig_0437 correct seq to Y
Use my
sequence for N-terminal
>CYP520B1 Seq 58 47% to seq 47 317 aa
MNYLLIIICIIFFSLFFDF
KIRKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSM
VVSDPEIVNEIWVKQHDNFINRPKNITHK
MFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSK
VTSVVEKKLNKLIETMEYHSINKLP
FDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFP*LKF
FFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSF
ISDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINY
PEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCFRESLRIRPVTPLS
LPRVAKCDTYIKDDIFIPKG
ATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYF
NDLDKISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNE
KEVYSITIKPQPFKLFLEKRV*
>CYP520A1
seq (47+68+80) complete 47% to seq 58 517 aa
Length = 517
Score = 1247 (439.0 bits), Expect = 7.6e-131,
P = 7.6e-131
Identities = 248/516 (48%), Positives =
358/516 (69%)
Query:520B1
1 MNYLLIIICIIFFSLFFDFKIRK-NWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRA 59
M L II +I F + FDFK +K N + + K+
NGPWSLPIIGG++LI D PNR+
Sbjct:520A1
1 MEILTFIIYLITFFILFDFKKKKFKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRS 60
>CYP520B1
Seq 58 47% to seq 47 317 aa
Length = 507
Score = 2460 (866.0 bits), Expect =
2.2e-259, P = 2.2e-259
Identities = 469/470 (99%), Positives =
469/470 (99%)
Query: 2 DVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSMVVSDPEIVNEIWVK
61
DVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSMVVSDPEIVNEIWVK
Sbjct: 36
DVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSMVVSDPEIVNEIWVK 95
Query: 62
QHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSKVTSVVEKK 121
QHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSKVTSVVEKK
Sbjct: 96
QHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSKVTSVVEKK 155
Query: 122
LNKLIETMEYHSINKLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFK 181
LNKLIETMEYHSINKLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFK
Sbjct: 156
LNKLIETMEYHSINKLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFK 215
Query: 182
MLSTNSPQYSFPYLKFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFI 241
MLSTNSPQYSFP LKFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFI
Sbjct: 216
MLSTNSPQYSFP*LKFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFI 275
Query: 242
SDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCK 301
SDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCK
Sbjct: 276
SDLQSNDIDILLQICIDIVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCK 335
Query: 302
VLKYEHISKMPYLYGCFRESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGATIIQNIFGM 361
VLKYEHISKMPYLYGCFRESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGATIIQNIFGM
Sbjct: 336
VLKYEHISKMPYLYGCFRESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGATIIQNIFGM 395
Query: 362
GNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDKISIPFGVGKRQCLSP 421
GNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDKISIPFGVGKRQCLSP
Sbjct: 396 GNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDKISIPFGVGKRQCLSP
455
Query: 422
AMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKLFLEKR 471
AMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKLFLEKR
Sbjct: 456
AMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKLFLEKR 505
>aa-ng1282
Contig_0470
pred cdna pORF 2724
4834 strand f start y stop y ph 0
MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS
YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA
LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTESLTFTFLVIMLKNFKIVNPTDIPID
VEGEVGLAMQCKPFDALFIKHTNRNDFSKINKGLNQQCFTSSSQFPTCYQTEFFEFGFNG
NFQLNDGIDCSSGVQQLSFFEKGFVFRVDHFLFEF
Note exon
1 longer in ng1282 by 7 aa better match to 514A2
1 aa diff
L vs. F
SSM757 has
F
IIAFP1D28194
has F
JAX4a185c11.s1
has F Note: there appear to be two
genes use L in this gene
>CYP514A1
Seq. 11 complete seq 35% to seq 7
Length = 496
Score = 2525 (888.8 bits), Expect =
2.9e-266, P = 2.9e-266
Identities = 492/500 (98%), Positives =
492/500 (98%)
Query: 1
MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV 60
MNTIFTIILTITIL
DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV
Sbjct: 1
MNTIFTIILTITIL-------DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV 53
Query: 61
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS 120
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS
Sbjct: 54
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS 113
Query: 121 SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA
180
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA
Sbjct: 114
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA 173
Query: 181
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS 240
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS
Sbjct: 174
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS 233
Query: 241
YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI 300
YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI
Sbjct: 234
YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI 293
Query: 301
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY 360
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY
Sbjct: 294
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY 353
Query: 361
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA 420
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA
Sbjct: 354
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA 413
Query: 421
LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTESLTFTFLVIMLKNFKIVNPTDIPID 480
LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTES TFTFLVIMLKNFKIVNPTDIPID
Sbjct: 414
LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID 473
Query: 481 VEGEVGLAMQCKPFDALFIK 500
VEGEVGLAMQCKPFDALFIK
Sbjct: 474 VEGEVGLAMQCKPFDALFIK 493
>aa-ng1289
Contig_0472
pred cdna pORF 124 1538 strand f
start n stop y ph 1
PIGLPIIGNLHQLKNNPNKVFKKWYKKYGPIYRIRMGNIETIILTGYPIIKKSFIDNSNV
FQKRYEYRSKIQFNNCSDLTFTNGETHSNLKKIILSEMTLTKLKKQENYIILECEKLCKH
LDNHCESGLPFKFKFYGKLFSLNIIFRFLFGLNYQYDNEKIIEIINLILELVEGAGDPII
SDFIPLVKYFENINQNKFLKIYKKLIILIENIINNLKLKFIENNYDDDNDENNITIFLKL
FKEFKNGKITWDNLVGTCVDLTVGGTDTVSDSLNFSLIALINNPNCQEKLYNEIKNELLK
NNNNNNIDDLIILKHSLYRLSIPYLSMVMKETYRNFPVALLGLPIITTQDMEINGYKISK
GTQVFKNIFLSHKSKDFFKSPNDFNPERFSDSGNNFMFGGGTTNLVQFGAGSRDCIGKNV
VDCELFTVLGTLINRYQFLNPVLSKPLNENGIIGLANQSPDNYFIIKKRK
>513E2P Seq 37a 45% to seq 5
KNRINKINSKIPGPIGLPIIGNLHQLKNNPNK
VFKKWYKKYGPIYRIRMGNIETIILTGYPIIKKSFIDNSNVFQKRYEYRSKIQFNNCSDL
TFTNGETHSNLKKIILSEMTLTKLKKQENYIILECEKLCKHLDNHCESGLPFKFKFYGKL
FSLNIIFRFLFGLNYQYDNEKIIEIINLILELVEGAGDPIISDFIPLVKYFENINQNKFL
KIYKKLIILIENIINNLKLKFIENNYDDDNDENNITIFLKLFKEFKNGKITWDNLVGTCV
DLTVGGTDTVSDSLNFSLIALINNPNCQEKLYNEIKNELLKNNNNNNIDDLIILKHSLYR
LSIPYLSMVMKETYRNFPVALLGLPIITTQDMEINGYKISKGTQVFKNIFLSHKSKDFFK
SPNDFNPERFSDSGNNFMFGGGTTNLVQFGAGSRDCIGKNVVDCELFTVLGTLINRYQFL
NPVLSKPLNENGIIGLANQSPDNYFIIKKRK*
Note ng1289 is missing
first part of exon 2
There is no exon 1 since
this is a pseudogene
Ng1679 = NADPH cytochrome
P450 reductase not a P450
>aa-ng1681
Contig_0635
pred cdna pORF 3022 992 strand r
start y stop y ph 0
MLTNIIILIILYLFYDFCYKNFKYRNYGSPWALPVIGHFIHVINQPHLVVHNDRMKYNNG
RFVNYWFGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWD
FHHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGENFDTRSNFLYFYSNILFDY
ILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNYLILSYPFLSIYLRYFTYTTF
NLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAGTDTN
SSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEV
LRLIPATPFSVPRMSNEGFEVDGIKIPKGTYLFPSMYSIFRDEKYWGENANKFYPERFLT
DSHSNNYFPYGVGKRMCLGSNFSQHELYICLTNIVLNFKIKSIDGKPLNEIPNYGITFRP
NIFEVKLENR
Two intron boundaries do
not agree use ng1681 as correct seq
>CYP518B1 Seq (26+28)
50% to seq 20
Length = 491
Score = 2574 (906.1 bits), Expect = 1.8e-271, P = 1.8e-271
Identities = 488/490 (99%), Positives = 489/490 (99%)
Query: 1 MLTNIIILIILYLFYDFCYKNFKYRNYGSPWALPVIGHFIHVINQPHLVVHNDRMKYNNG
60
MLTNIIILIILYLFYDFCYKNFKYRNYGSPWALPVIGHFIHVINQPHLVVHNDRMKYNNG
Sbjct: 1
MLTNIIILIILYLFYDFCYKNFKYRNYGSPWALPVIGHFIHVINQPHLVVHNDRMKYNNG 60
Query: 61
RFVNYWFGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWD 120
RFVNYWFGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWD
Sbjct: 61
RFVNYWFGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWD 120
Query: 121
FHHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGENFDTRSNFLYFYSNILFDY 180
FHHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGENFDTRSNFLYFYSNILFDY
Sbjct: 121
FHHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGENFDTRSNFLYFYSNILFDY 180
Query: 181
ILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNYLILSYPFLSIYLRYFTYTTF 240
ILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNYLILSYPFLSIYLRYFTYTTF
Sbjct: 181
ILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNYLILSYPFLSIYLRYFTYTTF 240
Query: 241
NLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAGTDTN 300
NLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAG DTN
Sbjct: 241
NLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAG-DTN 299
Query: 301
SSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEV 360
SSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEV
Sbjct: 300
SSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEV 359
Query: 361
LRLIPATPFSVPRMSNEGFEVDGIKIPKGTYLFPSMYSIFRDEKYWGENANKFYPERFLT 420
LRLIPATPFSVPRMSNEGFEVDGIKIPKGTYLFPSMYSIFRDEKYWGENANKFYPERFLT
Sbjct: 360 LRLIPATPFSVPRMSNEGFEVDGIKIPKGTYLFPSMYSIFRDEKYWGENANKFYPERFLT
419
Query: 421
DSHSNNYFPYGVGKRMCLGSNFSQHELYICLTNIVLNFKIKSIDGKPLNEIPN-YGITFR 479
DSHSNNYFPYGVGKRMCLGSNFSQHELYICLTNIVLNFKIKSIDGKPLNEIP+ YGITFR
Sbjct: 420
DSHSNNYFPYGVGKRMCLGSNFSQHELYICLTNIVLNFKIKSIDGKPLNEIPSDYGITFR 479
Query: 480 PNIFEVKLENR 490
PNIFEVKLENR
Sbjct: 480 PNIFEVKLENR 490
>CYP508C1
>aa-ng1934
Contig_0725
pred cdna pORF 555 2164 strand
f start y stop y ph 0
MIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFLSDIDLIKDIFINKFENFSSRPKS
PLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANTKTIYTLLDNQ
FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLAN
SIEILETPYYYYLQKTDKVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEY
AANTLEWIIIKLCNSPIHQEIAY
NELKKVVSSKVIIDDSIKREITLSDRPNTPYIQAIIKETMRMHPVVVFGLPRYCENDIFI
GDENYFIPKGCKVFINFHSIGYNEKYFKDPYKFEPNRFLENSNNSMDSFFPFGLGNRVCL
GRQLANDQLYLVIANLILKYKLKTIDENKINEDGIFGLTVSPNKYKINLESR
>CYP508C1 seq 4, 23, 24,
83 complete seq 42% to seq 66 486 aa
MGPWGYIKNKRIHKNEAKGPIGFPLIGN
MIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFLSD
IDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANTKTIYTLLDNQ
VNDLIKEISKFESQNKS (0)
FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLAN
SIEILETPYYYYLQKTDKVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEY
GTTDDDVLNITQVTLDMFMAGTDT (1)
TANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQA
IIKETMRMHPVVVFGLPRYCENDIFIGDENYFIPKGCKVFINFHSIGYNEKYFKDPYKFE
PNRFLENSNNSMDSFFPFGLGNRVCLGRQLANDQLYLVIANLILKYKLKTIDENKINEDG
IFGLTVSPNKYKINLESR*
Ng1934 is
missing three pieces of sequence that need to be present
My N-term
may still be too short
Compare
with 508B1
>CYP508B1
Seq. 15, 77 50% identical to 508A1
Length = 485
Score = 1028 (361.9 bits), Expect =
1.2e-107, P = 1.2e-107
Identities = 207/476 (43%), Positives =
307/476 (64%)
Query: 5 GYIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVF
64
G K K+IHKNE GP PL+GN+ Q+GK +PH
L K+ +Y + I ++ GD Y+V
Sbjct: 18
GIKKYKKIHKNELSGPFPIPLLGNLHQLGK-EPHYTLTKMHNVYGE-IFRLHFGDVYTVV 75
Query: 65
LSDIDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRAN 124
+SD LI+++F++ ENF RP P + G G++
S E W + + ++ NAMKR +
Sbjct: 76
VSDPILIREMFVDNHENFKYRPLLPTFKFGAGGDHGLS-LSNERWERTRELVQNAMKRTS 134
Query: 125 TKTIYTLLDNQVNDLIKEISKFESQNKSFNPKYYFRKFVLSTMFKYIFNEDVPYDENLEN
184
K IY +LD+QV +LIK + +++ F Y ++F LS MFKYIFNED+ YDE++
Sbjct: 135
IKKIYDMLDSQVYELIKSMKQYQVTGNPFEIHLYAQRFTLSIMFKYIFNEDISYDEDITK 194
Query: 185
GKLSELTMEMENIFKTLKVGKLANSIEILETPYYYYLQKTDEVFKN----IKKLIIEDK- 239
GK++EL M+ IFK+L GKL + I I + YY +L+ +D+ F +KK I +
Sbjct: 195
GKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFSTVKKFIYKRYL 254
Query: 240 NHNLSINPEKPRDLLDILINEYGTTDDDVLNITQVTLDMFMAGTDTTANTLEWIIIKLCN
299
H +I+ + PRDL+D+LINE+ + + I Q +LDMF+AG DTTA ++ W ++++
Sbjct: 255
EHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAASIIWFVLRMQE 314
Query: 300
SPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQAIIKETMRMHPVVVFGLP 359
P Q AYNE+K V + D + LSDRP TPY+ AIIKE +R++PV FGLP
Sbjct: 315
HPEIQLKAYNEIKDAVGDR----D----RVLLSDRPKTPYLNAIIKEILRLNPVGPFGLP 366
Query: 360
RYCENDIFIGDENYFIPKGCKVFINFHSIGYNEKYFKDPYKFEPNRFLENSNNSMDSFFP 419
NDI IG+ YFIPK ++ +N+ +G+NEKYF++P +F+P+RFL N NN D++ P
Sbjct: 367 HRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFL-NKNN--DAYMP
423
Query: 420
FGLGNRVCLGRQLANDQLYLVIANLILKYKLKTIDENKINEDGIFGLTVSPNKYKINLES 479
FG+G+R C+G QLA D+ YL +N++L
+ LK I +++ FGLT+ PNK+K+ LES
Sbjct: 424
FGVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIG-TPVSDYEEFGLTLKPNKFKVLLES 482
Query: 480 R 480
R
Sbjct: 483 R 483
>Contig_0725,
D. discoideum, sequenced by the D. discoideum Sequencing
Consortium, assembled with Phusion
Length = 4136
Plus Strand HSPs:
Score = 764 (274.0 bits), Expect =
1.3e-172, Sum P(3) = 1.3e-172
Identities = 147/147 (100%), Positives =
147/147 (100%), Frame = +3
Query: 6
YIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFL 65
YIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFL
Sbjct: 486
YIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFL 665
Query: 66
SDIDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANT 125
SDIDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANT
Sbjct: 666
SDIDLIKDIFINKFENFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANT 845
Query: 126 KTIYTLLDNQVNDLIKEISKFESQNKS
152
KTIYTLLDNQVNDLIKEISKFESQNKS
Sbjct: 846 KTIYTLLDNQVNDLIKEISKFESQNKS
926
Score = 681 (244.8 bits), Expect =
1.3e-172, Sum P(3) = 1.3e-172
Identities = 136/147 (92%), Positives =
140/147 (95%), Frame = +1
Query: 153
FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEI 212
FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEI
Sbjct: 1003 FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEI
1182
Query: 213
LETPYYYYLQKTDEVFKNIKKLIIED-KNHNLSINPEKPRDLLDILINEYGTTDDDVLNI 271
LETPYYYYLQKTD+VFKNIKKLIIE
KNHNLSINPEKPRDLLDILINEYGTTDDDVLNI
Sbjct: 1183 LETPYYYYLQKTDKVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEYGTTDDDVLNI
1362
Query: 272 TQVTLDMFMAGTDTTANTLEW-IIIKL
297
TQVTLDMFMAGTDT+ T+ + I IKL
Sbjct: 1363 TQVTLDMFMAGTDTSNYTILY*IKIKL 1443
sdic6B16c2.p1c
has DEV not DKV
JC2b120c09.r1 has DKV not DEV
Contig_0725 has DKV not DEV
Score = 290 (107.1 bits), Expect =
1.3e-172, Sum P(3) = 1.3e-172
Identities = 57/58 (98%), Positives =
57/58 (98%), Frame = +3
Query: 285
TTANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQA 342
T ANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQA
Sbjct: 1494
TAANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLSDRPNTPYIQA 1667
>aa-ng1935
Contig_0725
pred cdna pORF 4043
2373 strand r start n stop y ph 0
VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKMYGGIYRLWFGDYFVVSLN
DPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLK
QTYDNLSDSVNSLINLMKEFQSSNESFQPDMYLRKYALTTMFKYVFNETVSFENKIVQGE
EAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTI
DIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQ
DTVYNELKETVVGPVVTLNDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI
PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCIGQALGMDQIY
LLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR
CYP508D1 Seq (59+81) 42% to
seq 4 483 aa complete
MVYLNNIFIFFIIFFIYSFVKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMD
FYHKMYGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSS
(phase 2)
RPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES
FQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNR
YIEHRKTIDIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTL
NDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI
(intron)
PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCI
(intron)
GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR*
Ng1935 is
missing n-terminal exon
1 aa diff
inside change KKY to RKY then use my seq
>CYP508D1
Seq (59+81) 42% to seq 4 483 aa
complete
Length = 483
Score = 2443 (860.0 bits), Expect =
1.4e-257, P = 1.4e-257
Identities = 462/463 (99%), Positives =
463/463 (100%)
Query: 1
VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKMYGGIYRLWFGDYFVVSLN 60
VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKMYGGIYRLWFGDYFVVSLN
Sbjct: 20
VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKMYGGIYRLWFGDYFVVSLN 79
Query: 61 DPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLK
120
DPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLK
Sbjct: 80
DPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLK 139
Query: 121
QTYDNLSDSVNSLINLMKEFQSSNESFQPDMYLRKYALTTMFKYVFNETVSFENKIVQGE 180
QTYDNLSDSVNSLINLMKEFQSSNESFQPDMYL+KYALTTMFKYVFNETVSFENKIVQGE
Sbjct: 140
QTYDNLSDSVNSLINLMKEFQSSNESFQPDMYLKKYALTTMFKYVFNETVSFENKIVQGE 199
Query: 181
EAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTI 240
EAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTI
Sbjct: 200
EAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTI 259
Query: 241
DIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQ 300
DIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQ
Sbjct: 260
DIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQ 319
Query: 301
DTVYNELKETVVGPVVTLNDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI 360
DTVYNELKETVVGPVVTLNDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI
Sbjct: 320
DTVYNELKETVVGPVVTLNDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI 379
Query: 361
PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCIGQALGMDQIY 420
PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCIGQALGMDQIY
Sbjct: 380
PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCIGQALGMDQIY 439
Query: 421
LLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR 463
LLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR
Sbjct: 440
LLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR 482
.CYP521A1
100% match
>aa-ng2405
Contig_0941
pred cdna pORF 14409 12827
strand r start y stop y ph 0
MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRTDPYKTLAKASKK
TEHGILKCWNGEHLMVVVDNPSIIKQMYVNTNNFTDRPQTKVFEIISRNYKNSGFANGEK
WKHLRGLYAPSFTKIKSRPHENIILKYVNFEIKSLKNHAITNSIYNPFLIENINSFGTKV
ITEIIFGREFSENEVYSLIGPMNKLFGILDTPFPSESISFLKPFYRRSYKECDKQCEELF
KLVEKVYDDHLLNLDKDNPKDVMDVMIVETDFKEKDHVICICCDLLMGTKDTFNTIVLWF
FVLMINYQDVQLKGYQEIIKVLECTGRDHVTIEDIDKLPYIDGIIKEISRIHPAGPLSVP
RTAINDIMINGYFIPKGCHVFQNTYGAVYNYMKESDEPCKMKPERWIENEKLRKDGKLDP
TNDLALISLPFSSGIRNCPGVGFAEYELFLLFSNIILNFHLSSPNNLKLNESGHFGLTMK
PFPFLVDLKLR
>aa-ng2647
Contig_1035
pred cdna pORF 453 2253 strand
f start y stop y ph 0
MITFIIYKNFDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFGS
YDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISL
TKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYE
DELIPSDILDFIHASEELLVVLSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKK
VELLDQDNPKCFIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLS
NHESIQEKLYNEICKNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPV
PLGLPHLCSEDTEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVT
FSTGPRVCPGKNLSEDELFSFGTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVS
LRS
>CYP515B1
Seq. 6 34% to seq 5
Length = 498
Score = 2437 (857.9 bits), Expect =
6.0e-257, P = 6.0e-257
Identities = 468/472 (99%), Positives =
470/472 (99%)
Query: 13
KRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYG-KVFSMKFGSYDTVILNEPDV 71
KRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYG KVFSMKFGSYDTVILNEPDV
Sbjct: 26
KRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGRKVFSMKFGSYDTVILNEPDV 85
Query: 72
IVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISLTKSKTRLLEGL 131
IVEAFHLNSTSFMDRILLPSFEVVGK +NIGFTQTEYWKKIRGILNISLTKSKTRLLEGL
Sbjct: 86 IVEAFHLNSTSFMDRILLPSFEVVGKKKNIGFTQTEYWKKIRGILNISLTKSKTRLLEGL
145
Query: 132
FNQEYFRFDQFIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDF 191
FNQEYFRFDQFIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDF
Sbjct: 146
FNQEYFRFDQFIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDF 205
Query: 192
IHASEELLVVLSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQDNPKC 251
IHASEELLVVLSRMPSDYIKVLRPFESHKK+KEICDRMSKFIKPRVDKKVELLDQDNPKC
Sbjct: 206
IHASEELLVVLSRMPSDYIKVLRPFESHKKIKEICDRMSKFIKPRVDKKVELLDQDNPKC 265
Query: 252
FIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYN 311
FIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYN
Sbjct: 266
FIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYN 325
Query: 312 EICKNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSED
371
EICKNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSED
Sbjct: 326
EICKNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSED 385
Query: 372
TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGK 431
TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGK
Sbjct: 386
TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGK 445
Query: 432
NLSEDELFSFGTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS 483
NLSEDELFSFGTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS
Sbjct: 446
NLSEDELFSFGTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS 497
JC3e13g02.r1
has KLK not KIK
JC3a84h08.r1
has KLK not KIK fix this in my seq
JC2b306h10.s1
has KNQNIG not KKKNIG
JC3e86b07.r1
has KNQNIG not KKKNIG fix this in my seq
JAX4a105a03.r1
has KYGKV not KYGRKV
JC3a40f05.s1
has KYGKV not KYGRKV fix in my seq
CYP515B1 Seq. 6 34% to seq
5
MATLLLVIIFMITFIIYKN (0)
FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYG
KVFSMKFGSYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFT
QTEYWKKIRGILNISLTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDT
(0)
MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVVLSRMPS
DYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQD
NPKCFIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEIC
KNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACV
KETLRRSPPVPLGLPHLCSED
TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTV
TFSTGPRVCPGKNLSEDELFSFGTKLFK
TFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS*
>IIAFP1D83520 45769
letters
Length = 45,769
Plus Strand HSPs:
Score = 754 (270.5 bits), Expect = 7.1e-118, Sum P(3) =
7.1e-118
Identities = 144/145 (99%), Positives = 145/145 (100%), Frame
= +1
Query: 10
FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFGSYDTVILNEP 69
FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFGSYDTVILNEP
Sbjct: 727
FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFGSYDTVILNEP 906
Query: 70
DVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISLTKSKTRLLE 129
DVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISLTKSKTRLLE
Sbjct: 907 DVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNISLTKSKTRLLE
1086
Query: 130 GLFNQEYFRFDQFIKKQLKSKSDTM 154
GLFNQEYFRFDQFIKKQLKSKSDT+
Sbjct: 1087 GLFNQEYFRFDQFIKKQLKSKSDTV 1161
Score = 450 (163.5 bits), Expect = 7.1e-118, Sum P(3) =
7.1e-118
Identities = 91/99 (91%), Positives = 92/99 (92%), Frame = +1
Query: 142
FIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV 201
FI +K K MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV
Sbjct: 1243 FIFLNIKKK---MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV
1413
Query: 202
LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKK 240
LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKK
Sbjct: 1414
LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKK 1530
Score = 46 (21.3 bits), Expect = 7.1e-118, Sum P(3) =
7.1e-118
Identities = 9/9 (100%), Positives = 9/9 (100%), Frame = +3
Query: 1 MITFIIYKN 9
MITFIIYKN
Sbjct: 462 MITFIIYKN 488
>514A4
>aa-ng2792
Contig_1093
pred cdna pORF 7469
9223 strand f start y stop y ph 0
MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFS
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS
YFFIKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA
LEFKPERFKNQPLNQKGLFHFGAGARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID
VEGEVGLAMQCKPFDALFIKRN
Note only
7aa diffs to ng1282 are there two genes?
CYP514A1 Seq. 11 complete
seq 35% to seq 7
MNTIFTIILTITILVLSLILK (0)
DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVF
RLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISS
NGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKN
TPNQIRILAVKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYL
PFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEKENELKNETT
DETSSKLNNIPIIENYYKNYLDGSIHYDS
ILFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMN
(1)
ESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEF
WGEDALEFKPERFKNQPLYQKGLFHFGA
(1)
GPRGCPGGRFTESFTFTFLVIMLKNFKIVNPTDIPI
DVEGEVGLAMQCKPFDALFIKRN*
6aa diffs
JC2b365b08.s1 FGKDYNIIS
JC3c23c03.s1 FGKDYNIIS
JC3a164c10.s1 FGKDYNIIS
IIAGP1D6746 FGKDYNIIS
IIAGP1D4567 FGKDYNIIS
JAX4a225h05.s1
LGKDYNIFS
IIAGP1D1903 LGKDYNIFS
JAX4a225h05.r1
LGKDYNIFX
SFH636
LGKDYNIFS
Two
different genes present
JC3a164c10.s1
YFFLKD
JC3f62d07.s1 YFFLKD
JC1a54g04.r1 YFFLKD use
SSM757 PLN
and GARA
IIAFP1D28194 GARA
JC3e102f08.s1
PLY
JC3c13d09.r1 PLY
JC3c13f09.r1 PLY
GGTGAAGATGCTTTAGAATTTAAACCAGAAAGATTTAAAAATCAACCTTTATATCAAAAA
GGACTTTTTCATTTTGGTGCAGGTATGTTTTGATACCATTTTAAAAAGGAATTAAAATTT
F G A
G
TTTATTTAAACAGGAGCTAGAGCTTGTCCAGGAGGAAGATT
G A R
A
>CYP514A1
Seq. 11 complete seq 35% to seq 7
Length = 503
Score = 2557 (900.1 bits), Expect =
1.2e-269, P = 1.2e-269
Identities = 496/502 (98%), Positives =
497/502 (99%)
Query: 1
MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV 60
MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV
Sbjct: 1
MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV 60
Query: 61
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFS 120
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRF GKDYNI S
Sbjct: 61
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIIS 120
Query: 121
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA 180
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA
Sbjct: 121
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA 180
Query: 181
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS 240
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS
Sbjct: 181
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS 240
Query: 241
YFFIKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI 300
YFF+KDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI
Sbjct: 241 YFFLKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI
300
Query: 301
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY 360
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY
Sbjct: 301
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY 360
Query: 361
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA 420
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA
Sbjct: 361
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA 420
Query: 421
LEFKPERFKNQPLNQKGLFHFGAGARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID 480
LEFKPERFKNQPL QKGLFHFGAG R CPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID
Sbjct: 421
LEFKPERFKNQPLYQKGLFHFGAGPRGCPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID 480
Query: 481 VEGEVGLAMQCKPFDALFIKRN 502
VEGEVGLAMQCKPFDALFIKRN
Sbjct: 481 VEGEVGLAMQCKPFDALFIKRN 502
>CYP508A1
missing N-terminal 100% match
>aa-ng3018
Contig_1182
pred cdna pORF 2352 535 strand r
start n stop y ph 0
ISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKKYGHILRVYMADMYTVVVSD
PLLIREMYVDNSDIFTDRVKKPSVEHGTFYHGTVTSYGEHWKNNREIVGKAMRKTNLKHI
YELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSAMFKFLFNHDIPEDEDINKGDTQ
KLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFKDIMKYHREKYNEHLKTFDP
DVERDLLDILIKEYGTDNDDKILSILATINDFFLAGVDTSSTALESMVLMLTNYPEIQEK
AFDEIKTVVNGRSKVNLSDRQSTPYLVAVIKETLRYKPMSPFGLPRSSSKDCMIGGHFIP
KNAQILINYQALGMNEEYYENPEQFDPSRFLKVESNVAFLPFSIGIRSCVGQSFAQDELY
ICISNILLNFKLKSIDGKKIDETEEYGLTLKTKNRFNVTLEKRII
>CYP508A1 complete (old
seqs. 1 and 2) 28% to seq 8
MALFEIIISLFVVYIIHNAISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKK
YGHILRVYMADMYTVVVSDPLLIREMYVDNSDIFTDRVKKPSVEHGTFYHGTVTSYGEHW
KNNREIVGKAMRKTNLKHIYELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSAMFK
FLFNHDIPEDEDINKGDTQKLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFK
DIMKYHREKYNEHLKTFDPDVERDLLDILIKEYGTDNDDKILSILATINDFFLAGVDTSS
TALESMVLMLTNYPEIQEKAFDEIKTVVNGRSKVNLSDRQSTPYLVAVIKETLRYKPMSP
FGLPRSSSKDCMIGGHFIPKNAQILINYQALGMNEEYYENPEQFDPSRFLKVESNVAFLP
FSIGIRSCVGQSFAQDELYICISNILLNFKLKSIDGKKIDETEEYGLTLKTKNRFNVTLE
KRII*
Add N-term
exon
>513E1
>aa-ng3101
Contig_1216
pred cdna pORF 338 1958 strand
f start y stop y ph 0
MDLYISILILIISLIIFFKNNNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEK
YGVIYSIRLGNIETVVFTGYPIFKKAFIENSQIFAPRFQHLSRFEANGCKNLIGSNDEIH
STLKKLILTEITSNKIKKMENHIVLECENLCKQLDKHCQDGLPFSLNMYFKLFSLNIILR
FLFGTINNSYQDKSNQDIVDVIIEFLHYGGNPIMSDFIPILKPFYKQNKFFKFYPILCDH
LNKLIENYKNNKQLQKQQKQQQNEDDDDDDDGTIIGKLLKEYHNGKISWTSVVSTCVDVF
LAGVDSTSNSTIFTLIALVNNSNCQEKLFNEIKNNLKKSDDGHDEIVIRHSLYRSSIPYL
SLVMKEVYRLYSVILIGLPHITSEDVEIEGYKIAKGTQIIQNVFSTHLCEKTFPMSKSFI
PERFIETGSNNMFGGGQTNLVHFGTGVRDCVGKSLADCEIFTVLATLINRYQFINPTLEP
LNDIGSFGIAYQPPINNFIIKKRL
Contig
1216 shows D here
JC3e44c07.r1
shows N. N is more like other seqs
so use N
>513E1
seq 90+52+51 39% to seq 5+56; 42% to seq 8; 47% to seq 37 shorter match
Length =
505
Score = 2635 (927.6 bits), Expect =
6.4e-278, P = 6.4e-278
Identities = 503/504 (99%), Positives =
504/504 (100%)
Query: 1
MDLYISILILIISLIIFFKNNNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEK 60
M+LYISILILIISLIIFFKNNNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEK
Sbjct: 1
MNLYISILILIISLIIFFKNNNRISKINSKIPGPIRLPIFGNLLQINKDPHIQFQKWYEK 60
Note 1 aa
diff at aa2
>aa-ng4137
Contig_1639
pred cdna pORF 484 2497 strand
f start y stop y ph 0
MILGIILGLFIYIYLINIKFFNRAVPSSLLVGENKLKCKFRSGPLILPIIGSLYKLSLKY
PHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKY
QGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIDMGPI
FKKILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNNY
KEIEKEYNNCLNFFQPLIDNILKNISDDDDDGNEPKCFLEYFISEIRKDTSNLIKITDLP
YICFDIIVAGIVTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSII
KETLRISPPAPFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHF
LSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRS
LSPFEFKSKLIIRK
>CYP515A1
Seq 79b 4 diffs
Length = 495
Score = 2569 (904.3 bits), Expect =
6.3e-271, P = 6.3e-271
Identities = 491/494 (99%), Positives =
492/494 (99%)
Query: 1
MILGIILGLFIYIYLINIKFFNRAV-PSSLLVGENKLKCKFRSGPLILPIIGSLYKLSLK 59
MILGIILGLFIYIYLINIKFFNRAV PSSLLVGENKLKCKF SGPLILPIIGSLYKLSLK
Sbjct: 1
MILGIILGLFIYIYLINIKFFNRAVLPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLK 60
Query: 60
YPHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGK 119
YPHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGK
Sbjct: 61
YPHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGK 120
Query: 120
YQGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIDMGP 179
YQGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIDMGP
Sbjct: 121
YQGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIDMGP 180
Query: 180
IFKKILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNN 239
IFK+ILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNN
Sbjct: 181 IFKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNN
240
Query: 240
YKEIEKEYNNCLNFFQPLIDNILKNISDDDDDGNEPKCFLEYFISEIRKDTSNLIKITDL 299
YKEIEKEYNNCLNFFQPLIDNILKNISDDDDDGNEPKCFLEYFISEIRKDTSNLIKITDL
Sbjct: 241
YKEIEKEYNNCLNFFQPLIDNILKNISDDDDDGNEPKCFLEYFISEIRKDTSNLIKITDL 300
Query: 300
PYICFDIIVAGIVTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSI 359
PYICFDIIVAGIVTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSI
Sbjct: 301
PYICFDIIVAGIVTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSI 360
Query: 360
IKETLRISPPAPFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDH 419
IKETLRISPP PFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDH
Sbjct: 361
IKETLRISPP-PFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDH 419
Query: 420 FLSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTR
479
FLSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTR
Sbjct: 420
FLSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTR 479
Query: 480 SLSPFEFKSKLIIRK 494
SLSPFEFKSKLIIRK
Sbjct: 480 SLSPFEFKSKLIIRK 494
>JC3a177b07.3764
183690 letters
Length = 183,692
Minus Strand HSPs:
Does not
have the L does have the P remove the L keep P
Query: 19
KFFNRAVLPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKY 61
KFFNRAV PSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKY
Sbjct:
49578 KFFNRAV-PSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKY 49453
R present
in both RILLN 514A1 and 514A2P use
R here
Query: 67
NQDNNI-DMGPIFKRILLNILYRFLFGVSFEYDD 99
+
DMGPIFKRILLNILYRFLFGVSFEYDD
Sbjct:
46162 KKKKKKKDMGPIFKRILLNILYRFLFGVSFEYDD 46061
Score = 142 (55.0 bits), Expect =
1.6e-34, Sum P(4) = 1.6e-34
Identities = 27/27 (100%), Positives =
27/27 (100%), Frame = -1
Query: 73
DMGPIFKRILLNILYRFLFGVSFEYDD 99
DMGPIFKRILLNILYRFLFGVSFEYDD
Sbjct:
48929 DMGPIFKRILLNILYRFLFGVSFEYDD 48849
>JC2c172c05.r1
Clone JC2c172c05, reverse read, bases 57 through 651, from
2000-01-17
Length = 593
Minus Strand HSPs:
Query: 100 IKETLRISP 108
IKETLRISP
Sbjct: 474 IKETLRISPcgt 448 coud be
either PPP or PPAP use PPAP
Query: 110
PFALPHICTDDIVIDDIFIPKNTQVIPNIY 139
PFALPHICTDDIVIDDIFIPKNTQVIPNIY
agcagca 348 PFALPHICTDDIVIDDIFIPKNTQVIPNIY 259
>aa-ng4360
Contig_1738
pred cdna pORF 283 2061 strand
f start y stop y ph 0
MIMLIKVFVLLLVYKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTKDYGDIFR
VWYTVVISDPVLIRKIYVENHESFRDRPKIPSMKYGTYYHGTAASMGEDWVRNRGIVSSA
MRKSNIKHIYEVINNQVDVLMSTMKKYEKRSEPFEPRYYMTKYTMAAMFKYIFNEDIGED
EDIHTGEIQKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKNFPLLLKFFNGRY
EQHLETIKPESPRDLLDILINEYGTNTHDDYLNIASTVLDFFFAGTDTSSTTLEYLFLMM
ANYPEIQDKVHQEVKSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFKPVTPFGVPRSCV
NEITIDEKYFIPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTTNTASNEESSFNS
NLAFIPFSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDETISYGVTLKPKTR
FKVLLEKRLI
>CYP508A4
seq (66+70) complete 55% to CYP508 3 indel diffs
Length = 502
Score = 2507 (882.5 bits), Expect =
2.4e-264, P = 2.4e-264
Identities = 490/502 (97%), Positives =
490/502 (97%)
Query: 1 MIMLIKVFVLLLV-------YKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTK
53
MIMLIKVFVLLLV
YKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTK
Sbjct: 1 MIMLIKVFVLLLVVYILHNSYKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTK 60
Query: 54
DYGDIFRVW----YTVVISDPVLIRKIYVENHESFRDRPKIP-SMKYGTYYHGTAASMGE 108
DYGDIFRVW
YTVVISDPVLIRKIYVENHESFRDRPKIP SMKYGTYYHGTAASMGE
Sbjct: 61
DYGDIFRVWFADLYTVVISDPVLIRKIYVENHESFRDRPKIPKSMKYGTYYHGTAASMGE 120
Query: 109
DWVRNRGIVSSAMRKSNIKHIYEVINNQVDVLMSTMKKYEKRSEPFEPRYYMTKYTMAAM 168
DWVRNRGIVSSAMRKSNIKHIYEVINNQVDVLMSTMKKYEKRSEPFEPRYYMTKYTMAAM
Sbjct: 121
DWVRNRGIVSSAMRKSNIKHIYEVINNQVDVLMSTMKKYEKRSEPFEPRYYMTKYTMAAM 180
Query: 169
FKYIFNEDIGEDEDIHTGEIQKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKN 228
FKYIFNEDIGEDEDIHTGEIQKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKN
Sbjct: 181
FKYIFNEDIGEDEDIHTGEIQKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKN 240
Query: 229
FPLLLKFFNGRYEQHLETIKPESPRDLLDILINEYGTNTHDDYLNIASTVLDFFFAGTDT 288
FPLLLKFFNGRYEQHLETIKPESPRDLLDILINEYGTNTHDDYLNIASTVLDFFFAGTDT
Sbjct: 241
FPLLLKFFNGRYEQHLETIKPESPRDLLDILINEYGTNTHDDYLNIASTVLDFFFAGTDT 300
Query: 289
SSTTLEYLFLMMANYPEIQDKVHQEVKSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFK 348
SSTTLEYLFLMMANYPEIQDKVHQEVKSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFK
Sbjct: 301
SSTTLEYLFLMMANYPEIQDKVHQEVKSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFK 360
Query: 349
PVTPFGVPRSCVNEITIDEKYFIPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTT 408
PVTPFGVPRSCVNEITIDEKYFIPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTT
Sbjct: 361 PVTPFGVPRSCVNEITIDEKYFIPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTT
420
Query: 409
NTASNEESSFNSNLAFIPFSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDET 468
NTASNEESSFNSNLAFIPFSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDET
Sbjct: 421
NTASNEESSFNSNLAFIPFSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDET 480
Query: 469 ISYGVTLKPKTRFKVLLEKRLI 490
ISYGVTLKPKTRFKVLLEKRLI
Sbjct: 481 ISYGVTLKPKTRFKVLLEKRLI 502
>VFK340
(VFK340Q) /pub/dna_csm/LIBRARY/VF/VFK3-B/VFK340Q.Seq.d/
Length = 681
Plus Strand HSPs:
Score = 310 (114.2 bits), Expect =
9.3e-27, P = 9.3e-27
Identities = 59/60 (98%), Positives =
59/60 (98%), Frame = +3
Query: 1
DYGDIFRVWFADLYTVVISDPVLIRKIYVENHESFRDRPKIPKSMKYGTYYHGTAASMGE 60
DYGDIFRVWFADLYTVVISDPVLIRKIYVENHESFRDRPKIP SMKYGTYYHGTAASMGE
Sbjct: 204
DYGDIFRVWFADLYTVVISDPVLIRKIYVENHESFRDRPKIP-SMKYGTYYHGTAASMGE 380
Remove K
but keep FADL
>aa-ng4501
Contig_1803
pred cdna pORF 4169
5254 strand f start y stop y ph 0
MEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVE
HVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLM
DDLLIEYGLNEITKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEI
KKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIF
INYLEISQNEEIFKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNL
LLKYKLKSQNGEKIDDTIHFSVSLKAKDYGIKLEKRI
9 seqs
support the ng4501 ending
12 seqs
support my ending
clearly
there are two genes probably 517A1 and 517A4
my ending
seems to be from 517A4 swap ends
>CYP517A1
Seq. 3 complete 37% to 508 484 aa short, 3 diffs at end
Length = 484
Score = 1756 (618.1 bits), Expect =
9.0e-185, P = 9.0e-185
Identities = 334/337 (99%), Positives =
336/337 (99%)
Query: 1
MEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVE 60
MEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVE
Sbjct: 147
MEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVE 206
Query: 61
HVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLM 120
HVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLM
Sbjct: 207
HVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLM 266
Query: 121
DDLLIEYGLNEITKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEI 180
DDLLIEYGLNEITKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEI
Sbjct: 267
DDLLIEYGLNEITKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEI 326
Query: 181
KKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIF 240
KKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIF
Sbjct: 327 KKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIF
386
Query: 241
INYLEISQNEEIFKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNL 300
INYLEISQNEEIFKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNL
Sbjct: 387
INYLEISQNEEIFKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNL 446
Query: 301
LLKYKLKSQNGEKIDDTIHFSVSLKAKDYGIKLEKRI 337
LLKYKLK+QNGEKIDD+I FSVSLKAKDYGIKLEKRI
Sbjct: 447
LLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLEKRI 483
>aa-ng5440
Contig_2215
pred cdna pORF 8556
6953 strand r start y stop y ph 0
MEIVNVLLFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELI
GNYSKKYNGVVRAWIGERLFLFVSNYDVVKYFQKDENFLYRPSLLVPGWRYASSNGLGVM
SSSDDEWKRAKSSVSQSLRVHTSKKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS
MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD
IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT
VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN
GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPNK
HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE
KRI
This is
517A4
>CYP517A1
Seq. 3 complete 37% to 508 484 aa 13 diffs second gene???
Length = 484
Score = 2460 (866.0 bits), Expect =
2.3e-259, P = 2.3e-259
Identities = 470/483 (97%), Positives =
474/483 (98%)
Query: 1 MEIVNVLLFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELI
60
MEI+NV LFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHEL
Sbjct: 1
MEIINVFLFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELF 60
Query: 61
GNYSKKYNGVVRAWIGERLFLFVSNYDVVKYFQKDENFLYRPSLLVPGWRYASSNGLGVM 120
G YSKKYNGVVRAW GERLF FVSNYDVVKYFQKDENF RPS+LVPGWRYASSNGLGVM
Sbjct: 61
GGYSKKYNGVVRAWFGERLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGLGVM 120
Query: 121
SSSDDEWKRAKSSVSQSLRVHTSKKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS 180
SSSDD+WKRAKSSVSQSLRV T+KKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS
Sbjct: 121
SSSDDKWKRAKSSVSQSLRVRTTKKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS 180
Query: 181
MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD 240
MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD
Sbjct: 181
MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD 240
Query: 241
IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT 300
IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT
Sbjct: 241
IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT 300
Query: 301
VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN 360
VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN
Sbjct: 301
VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN 360
Query: 361
GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPNK 420
GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPN
Sbjct: 361 GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPNI
420
Query: 421
HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE 480
HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE
Sbjct: 421 HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE
480
Query: 481 KRI 483
KRI
Sbjct: 481 KRI 483
>aa-ng4838
Contig_1954
pred cdna pORF 3150
1572 strand r start y stop y ph 0
MTSLTLYLIIFSIILYLFVNRNKRKNLKIPGPNGIPIFGNLLSLSGEMHLTLQEWYKTYG
SVFSIRMGNIDTVVLTEYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSL
LKKVVLSEMTTTKIKRMEIHIIKETEKILKILDKHAERGEPFIINNYLNMFSMNVILRFL
LGIDYPYENVDETVGYVKSIKSFFAVAGLPILSDFIPIPLKKSGVFFDSYKELEIETDKL
IEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITWECVSHTCIDIISAGTDTSAN
TLVMALIELINNQEIQSKAFSSIRSSsTFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYI
IQNIFSSHRSDKIFQSPNEFIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIAT
LLNRYQFINPNNSKKLNDIGTFGLAQVCPDTNIILKKRI
>CYP513A3
Seq. (7+50)complete seq 49% to seq 8 2 indels 2 aa diffs
Length = 491
Add the VN
to exon 1
This seq
missing part of the K-helix in cDNA SSI868
(removed
as intron?)
Restore
this seq
Change the
two aa diffs to match ng4838
>SSI868
(SSI868Q) /pub/dna_csm/LIBRARY/SS/SSI8-C/SSI868Q.Seq.d/
Length = 669
Plus Strand HSPs:
Score = 594 (214.2 bits), Expect =
7.6e-57, P = 7.6e-57
Identities = 118/118 (100%), Positives =
118/118 (100%), Frame = +3
Query: 9
DIISAGTDTSANTLVMALIELINNQEIQSKAFSSIRSSCLNDSNDDDDDDEIVITHSKYR 68
DIISAGTDTSANTLVMALIELINNQEIQSKAFSSIRSSCLNDSNDDDDDDEIVITHSKYR
Sbjct: 12
DIISAGTDTSANTLVMALIELINNQEIQSKAFSSIRSSCLNDSNDDDDDDEIVITHSKYR 191
Query: 69
SLLPYISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKI 126
SLLPYISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKI
Sbjct: 192 SLLPYISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKI
365
Query: 1 MTSLTLYLIIFSIILYLFVNRNKRKNLKIPGPNGIPIFGNLLSLSGEMHLTLQEWYKTYG 60
MTSLTLYLIIFSIILYLF
RNKRKNLKIPGPNGIPIFGNLLSLSGEMHLTLQEWYKTYG
Sbjct: 1
MTSLTLYLIIFSIILYLF--RNKRKNLKIPGPNGIPIFGNLLSLSGEMHLTLQEWYKTYG 58
Query: 61
SVFSIRMGNIDTVVLTEYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSL 120
SVFSIRMGNIDTVVLTEYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSL
Sbjct: 59
SVFSIRMGNIDTVVLTEYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSL 118
Query: 121
LKKVVLSEMTTTKIKRMEIHIIKETEKILKILDKHAERGEPFIINNYLNMFSMNVILRFL 180
LKKVVLSEMTTTKIKRMEIHIIKETEKILKILDKHAERGEPFIINNYLNMFSMN ILRFL
Sbjct: 119
LKKVVLSEMTTTKIKRMEIHIIKETEKILKILDKHAERGEPFIINNYLNMFSMNGILRFL 178
Query: 181
LGIDYPYENVDETVGYVKSIKSFFAVAGLPILSDFIPIPLKKSGVFFDSYKELEIETDKL 240
LGIDYPYENVDETVGYVKSIKSFFAVAGLPILSDFIPIPLKKSG FFDSYKELEIETDKL
Sbjct: 179
LGIDYPYENVDETVGYVKSIKSFFAVAGLPILSDFIPIPLKKSGGFFDSYKELEIETDKL 238
Query: 241 IEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITWECVSHTCIDIISAGTDTSAN
300
IEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITWECVSHTCIDIISAGTDTSAN
Sbjct: 239
IEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITWECVSHTCIDIISAGTDTSAN 298
Query: 301 TLVMALIELINNQEIQSKAFSSIRSS
326
TLVMALIELINNQEIQSKAFSSIRSS
Sbjct: 299 TLVMALIELINNQEIQSKAFSSIRSS
324
Score = 691 (243.2 bits), Expect =
4.5e-240, Sum P(2) = 4.5e-240
Identities = 133/140 (95%), Positives =
135/140 (96%)
Query: 320
FSSIRSSSTFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKIFQSPNE 379
+ S+
TFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKIFQSPNE
Sbjct: 351
YISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKIFQSPNE 410
Query: 380
FIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIATLLNRYQFINPNNSKKLNDI 439
FIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIATLLNRYQFINPNNSKKLNDI
Sbjct: 411
FIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIATLLNRYQFINPNNSKKLNDI 470
Query: 440 GTFGLAQVCPDTNIILKKRI 459
GTFGLAQVCPDTNIILKKRI
Sbjct: 471 GTFGLAQVCPDTNIILKKRI 490
>aa-ng5261
Contig_2128
pred cdna pORF 246 5321 strand
f start y stop y ph 0 not a
P450
MVISSPITKIKKLKEDNIDNSSSSNSNSNSNNNNNEINNFGNHHHNDDNISFSSNNNNNN
NNSNKGSNINNSSTTTTTTSTPLIGGDGGGDSCEVKRSRNILPHEVLLTRVTFKVETPIG
YVRENNVLAIIGDSAEIGNWKSKDAPQMVNSECTDTHLIWTIILSFPKATRINYKYIIKS
NSNNIEWEASKGLSRTLSIEGIDMFVNDGLFGVLESGSKPWIGRGWLPEDEYQLRIQLQC
DDNQSPVHLYDHLIEQNQLVLRLHDPFGLSQDFHLPMTGFTELVLHTHSLSSFSFSVTLL
LKDGLNLPITTTTNNTTTTTTTTTTATANSNIGSNGASSPTSYPSSPNGYKVLGKSYICS
NQVSTDLWGKLYTPIIGEKYFPIGEFRCNYLVVTPFSHPQNSLSNLWHSFVEQPQTLIGH
RGNGKNNFGINANAVTENTILSFLTAAQFGAKMVEFDLQLTFDGVPVIFHDYEIEIETNE
GFTMKETINRLTLEQFLKVKPTTNKSDLISQKLKRLKSLRISKSTSDLLSLSSCPSSTSF
DPSSFLENVTQCNSNLVVSDNNNNNNNNNNNNNNNNNNNNNKNNDNNNNNNNKNNNTIRP
SSIIHDRFSTLQDAFHLVPQEIGFMIEIKYPNLAMQNLRKFKAPERNEFIDIILNIVFNE
TNDRRIAFLTFDPDIAILLRTKQFRYPVLFLVCCDTPTFFEEFDPDVNINDNRGNSITNA
ISFVKTVNLDGIVCDSQTILNNHSFVNLIHSDNLLLFTYGSKNVDPVNVKIQKELGVDGI
IADNITKLNKTVIELHTCNCTTYCGFILDGLISAVEGDAFFNCSNSNNCVFSQNVINQML
PVIPMTCMAGECINNNSSGSQNNNNTTSDNGSRNGSENSHHNSDSSNNDNIKCPTCDCNK
STTHFMIAIIIFLVLKKKKKMAINYEINGNEIKNLQTITYNYKEPIEIDKELEYCLTNWS
IEKGQKGCPYCQQSFNIKSVSLPPKFVTNTLDSLLVYCNTDGCEFNSNSDRNQLIRRCEY
RNHFERICKVICIDCNQQFTREQLKLHSNECTSKSVKCNASSLMCSWVGPIKELQSHESN
CHYIALSPILNKLYDKIQQLDISNQKTTSFLLEKINYLESIISSSSALSTPNLSSPIYYQ
PIPTAPLPDSTTMYDNRNTTTTTTTTTTISLDKITPVEIKKVASIEIFKNNSNKYVIGKV
ESGVKSVSIDGDYADTWPIVKGNLPNSVDTLILLDGFKELVYSVPKSIKVLHIGDIKKSH
AFYIQCSSVNEIHFHDNCKIKFTKNVIPDSAKTVHFYRINEPLTTESIGKNVKCLHIHDG
FSKSLGYGVLPSSIQELHLYDIKRQALSIPSSIKILYLHNQFNNESIELPDQLETLYLDN
IIHPIKNFPSSLKTIHLINYKFEDKVPFPPKVKIIMGTK
>aa-ng5262
Contig_2128
pred cdna pORF 7224
5425 strand r start y stop y ph 0
MIFGIIGLNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKYGKIFRIYLADLYTVIVC
DPIIARELFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKH
IYQLLDDQVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVHNGQL
AQLMKPMQKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYK
PDVQRDLMDLLIKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQE
KAYNEIKSALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFKETLRYKPISPFGLPRST
TSDIILNNGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPR
NCVGSNFAQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK
>CYP508A2
Seq (22+45+69+86+88) Complete sequence 62% to CYP508
Length = 493
5 diffs N-term missing
Score = 2429 (855.1 bits), Expect =
4.3e-256, P = 4.3e-256
Identities = 464/469 (98%), Positives =
467/469 (99%)
Query: 8
LNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKYGKIFRIYLADLYTVIVCDPIIARE 67
LNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKYGKIFRIYLADLYTVIVCDPIIARE
Sbjct: 25
LNENQLPGPFPIPILGNIYQLTNLPHFDLTKMSEKYGKIFRIYLADLYTVIVCDPIIARE 84
Query: 68
LFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKHIYQLLDD 127
LFVDKFDNFIDRPKIPSV +GTFYHGTVASMGDNWK +KEIVGKAMRKTNL+HIYQLLDD
Sbjct: 85
LFVDKFDNFIDRPKIPSVNNGTFYHGTVASMGDNWKKHKEIVGKAMRKTNLEHIYQLLDD 144
Query: 128
QVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVHNGQLAQLMKPM 187
QVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVHNGQLAQLMKPM
Sbjct: 145
QVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVHNGQLAQLMKPM 204
Query: 188 QKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYKPDVQRDL
247
QKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYKPDVQRDL
Sbjct: 205
QKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYKPDVQRDL 264
Query: 248
MDLLIKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQEKAYNEIK 307
MDLLIKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQEKAYNEIK
Sbjct: 265
MDLLIKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQEKAYNEIK 324
Query: 308
SALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFKETLRYKPISPFGLPRSTTSDIILN 367
SALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFKETLRYKPISPFGLPRSTTSDIILN
Sbjct: 325
SALSSNGGGGGGGLTQRNKVLLSDRQSTPFVVSLFKETLRYKPISPFGLPRSTTSDIILN 384
Query: 368
NGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPRNCVGSNF 427
NGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPRNCVGSNF
Sbjct: 385
NGQFIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPRNCVGSNF 444
Query: 428
AQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK 476
AQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK
Sbjct: 445
AQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK 493
>VFH640
(VFH640Q) /pub/dna_csm/LIBRARY/VF/VFH6-B/VFH640Q.Seq.d/
Length = 1335
Plus Strand HSPs:
This cDNA
matches ng5262 so change these aa
Query: 1 LFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKHIYQLLDD
60
LFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKHIYQLLDD
Sbjct: 291
LFVDKFDNFIDRPKIPSVKHGTFYHGTVASMGDNWKNNKEIVGKAMRKTNLKHIYQLLDD 470
>aa-ng5581
Contig_2278
pred cdna pORF 5550
6959 strand f start y stop y ph 0
MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTKIISNYKEVI
ENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDP
ITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSASKVFKFMEEIYDQHLLKF
DKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKGCMSFVFAGDDTVAATLEWVCLYLIN
NPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRTPLPLSVPRI
ATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQQQQQQQPQP
IQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFSVCANIILNFQIKSIDGMQLKDIEV
SGISIHPIPFSIKLISRN
>CYP519C1
Seq (31+32+33) complete sequence 39% to seq 12
Length = 533
4 diffs N-term missing
Change all
4 to ng5581 seq add my N-term
Score = 2258 (794.9 bits), Expect =
5.7e-238, P = 5.7e-238
Identities = 434/438 (99%), Positives =
434/438 (99%)
Query: 1 MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTKIISNYKEVI
60
MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTKIISNYKEVI
Sbjct: 96
MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTKIISNYKEVI 155
Query: 61
ENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDP 120
ENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDP
Sbjct: 156
ENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDP 215
Query: 121
ITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSASKVFKFMEEIYDQHLLKF 180
ITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNS SKVFK MEEIYDQHLLKF
Sbjct: 216
ITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSDSKVFKIMEEIYDQHLLKF 275
Query: 181
DKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKGCMSFVFAGDDTVAATLEWVCLYLIN 240
DKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIK CMSFVFAGDDTVAATLEWVCLYLIN
Sbjct: 276
DKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKFCMSFVFAGDDTVAATLEWVCLYLIN 335
Query: 241
NPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRTPLPLSVPRI 300
NPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRTPLPLSVPRI
Sbjct: 336
NPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRTPLPLSVPRI 395
Query: 301
ATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQQQQQQQPQP 360
ATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQQQQQQQPQP
Sbjct: 396
ATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQQQQQQQPQP 455
Query: 361
IQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFSVCANIILNFQIKSIDGMQLKDIEV 420
IQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLF VCANIILNFQIKSIDGMQLKDIEV
Sbjct: 456 IQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFRVCANIILNFQIKSIDGMQLKDIEV
515
Query: 421 SGISIHPIPFSIKLISRN 438
SGISIHPIPFSIKLISRN
Sbjct: 516 SGISIHPIPFSIKLISRN 533
>aa-ng6320
Contig_2573
pred cdna pORF 634 2249 strand
f start y stop y ph 0
MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV
YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAFADYDQW
QKTRKLVSSALTKTKIKTFNNLIEKQTENLIESMNEFSNKNELFHPRKYLTKYSLNIILS
MLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI
YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLAGSDTQKATQEW
FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP
RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQQQKQQQQQQ
QVNNKNSIDSSESSNLEFFDDLEKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS
SINNQIIDDNEVFGVSINPKEFSIKLTKR
>JC1b162b04.s1
Clone JC1b162b04, standard read, bases 71 through 684, from
2000-09-07
Length = 612
Plus Strand HSPs:
Supports
ng6320 seq
Query: 78
LTKYSLNIILSMLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFS
137
LTKYSLNIILSMLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFS
Sbjct: 1
LTKYSLNIILSMLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFS 180
Query: 138
NKQYKKYIFDIYYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLA 197
NKQYKKYIFDIYYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLA
Sbjct: 181
NKQYKKYIFDIYYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLA 360
Query: 198 GSDTQKATQEW 208
GSDTQKATQEW
Sbjct: 361 GSDTQKATQEW 393
>JAX4a216e09.r1
Clone JAX4a216e09, reverse read, bases 164 through 715, from
1999-02-08
Length = 550
Minus Strand HSPs:
Score = 446 (162.1 bits), Expect =
6.7e-57, Sum P(2) = 6.7e-57
Identities = 86/87 (98%), Positives =
86/87 (98%), Frame = -3
Query: 209 RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQQQKQQQQQQ
268
RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLV EPDQFKPERWINYKNQQQQKQQQQQQ
Sbjct: 263
RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVXEPDQFKPERWINYKNQQQQKQQQQQQ 84
Query: 269 QVNNKNSIDSSESSNLEFFDDLEKVSN
295
QVNNKNSIDSSESSNLEFFDDLEKVSN
Sbjct: 83
QVNNKNSIDSSESSNLEFFDDLEKVSN 3
>JAX4a216e09.r1
Clone JAX4a216e09, reverse read, bases 164 through 715, from 1999-02-08
translate frame +1 translate plus frames translate all frames
GGATTTGATACTTTTTCTAAATCATCAAAAAATTCTAAATTACTTGATTCACTACTATCA
ATTGAATTTTTATTATTTACTTGTTGTTGTTGTTGTTGTTGTTTTTGTTGTTGTTGATTT
TTATAATTTATCCATCTTTCTGGTTTAAATTGATCAGGTTCANAAACTAATAAATTGCCC
ATTCCAAATATATTTAAAAGTATTTGAGTACCTTTTGGTATGAACATACCATTACTTAAA
TAAGTATCATCAATTGAAATTCTTGGTAAACCAAGTGGTCCAGGTGAACGAATACGGAAA
ACTTNTTTAATAATTGAAATAAAGNATGGACAATTTTCAATATGATTTGANGTTACAAAT
TTACAATCTTTACCAACCACACCTTATTAATTCTTGATATGCTTTCTTTTGAACATCTGG
ATTATTAATTAAATAAAGGCAAAACCATTCTTGAGTTGCTTTTTGTGTATCTGAACCTGC
CAATAGGAAATCCATTCCGACCAATATTACTGGCTCTTTTTCTCTACCTTGACTTGCAAT
TATTAGTTGA
>JC3f115e15.s1
Clone JC3f115e15, standard read, bases 27 through 739, from
2002-12-10
Length = 711
Minus Strand HSPs:
Supports
ng6320 seq
Query: 1
NFANRPQPKSIRVITNFKGLAFADYDQWQKTRKLVSSALTKTKIKTFNNLIEKQTENLIE 60
NFANRPQPKSIRVITNFKGLAFADYDQWQKTRKLVSSALTKTKIKTFNNLIEKQTENLIE
Sbjct: 367
NFANRPQPKSIRVITNFKGLAFADYDQWQKTRKLVSSALTKTKIKTFNNLIEKQTENLIE 188
Query: 61
SMNEFSNKNELFHPRKYLTKY-SLNIIL 87
SMNEFSNKNEL + + Y +NI++
Sbjct: 187 SMNEFSNKNELVNNYYFYYYY*HINILI
104
>IIAFP1D53328
Length = 865
Minus Strand HSPs:
Supports
ng6320 seq
Query: 273
KNSIDSSESSNLEFFDDLEKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS 328
K +
SESSNLEFFDDL+KVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS
Sbjct: 567
KIQLIGSESSNLEFFDDLKKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS 400
>CYP519B1
Seq (35+44+75) complete seq 51% to seq 12 496 aa
Length = 488
Score = 2147 (755.8 bits), Expect =
1.2e-257, Sum P(2) = 1.2e-257
Identities = 412/431 (95%), Positives =
420/431 (97%)
Query: 1
MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV 60
MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV
Sbjct: 1
MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV 60
Query: 61
YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAFADYDQW 120
YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAF + DQW
Sbjct: 61
YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAFGEGDQW 120
Query: 121 QKTRKLVSSALTKTKIKTFNNLIEKQTENLIESMNEFSNKNELFHPRKYLTKYSLNIILS
180
QKT+KLVSSALTKTKIKTFNNLIEKQ+ENLIESMNEFSNKNELFHPRKYLTKYSLNIILS
Sbjct: 121
QKTKKLVSSALTKTKIKTFNNLIEKQSENLIESMNEFSNKNELFHPRKYLTKYSLNIILS 180
Query: 181
MLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI 240
MLFSKEIGKNESINKG+MERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI
Sbjct: 181
MLFSKEIGKNESINKGSMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI 240
Query: 241
YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLAGSDTQKATQEW 300
YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKE VILVGMDFLLAGSDTQKATQEW
Sbjct: 241
YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKEPVILVGMDFLLAGSDTQKATQEW 300
Query: 301
FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP 360
FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP
Sbjct: 301
FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP 360
Query: 361
RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQQQKQQQQQ- 419
RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQ QKQQ Q
Sbjct: 361
RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQHQKQQPQPT 420
Query: 420 --QQVNNKNSI 428
++V+N S+
Sbjct: 421 TLKKVSNPFSL 431
Score = 320 (112.6 bits), Expect =
1.2e-257, Sum P(2) = 1.2e-257
Identities = 65/68 (95%), Positives =
66/68 (97%)
Query: 442
LEKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELSSINNQIIDDNEVFGVSINPKE 501
L+KVSNPFSLGPRNC GMAIAKSSIYSVCSNILLNFELSSINNQIIDDNEVFGVSINPKE
Sbjct: 422
LKKVSNPFSLGPRNC-GMAIAKSSIYSVCSNILLNFELSSINNQIIDDNEVFGVSINPKE 480
Query: 502 FSIKLTKR 509
FSIKL KR
Sbjct: 481 FSIKLAKR 488
>aa-ng6352
Contig_2589
pred cdna pORF 2441 615 strand r
start y stop y ph 0
MGIGLIILYLLIGLLAYDFTKKNKKISKNDPKQPLAIPVLGHLHLFGSQPHRSLTELAKK
FGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASSESIRIYSGNLVDISFSVGES
WKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEPFFPHKYYNKYTMNIVM
SIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGDYVWYTQPFFYFKNKKLEQDT
KKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTGGKEKDMVIHVSTDFLLAGSDTNAST
LEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLVGAIKETLRMRTPAPL
SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM
EATTETKSNITTEILPNDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGG
KKIDETEVFGITIHPKDFSIQLKKRE
>CYP519E1
seq 84+17 41% to seq 14
Length = 467
Note add
to exon 2 as shown below
Check
between PERW and heme
EST
supports extra piece here add as above
Score = 2039 (717.8 bits), Expect =
4.5e-251, Sum P(2) = 4.5e-251
Identities = 401/434 (92%), Positives =
403/434 (92%)
Query: 1 MGIGLIILYLLIGLLAYDFTKKNKKISKNDPKQPLAIPVLGHLHLFGSQPHRSLTELAKK
60
MGIGLIILYLLIGLLAYDF
QPLAIPVLGHLHLFGSQPHRSLTELAKK
Sbjct: 1
MGIGLIILYLLIGLLAYDF-------------QPLAIPVLGHLHLFGSQPHRSLTELAKK 47
Query: 61
FGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASSESIRIYSGNLVDISFSVGES 120
FGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASSESIRIYSGNLVDISFSVGES
Sbjct: 48
FGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASSESIRIYSGNLVDISFSVGES 107
Query: 121
WKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEPFFPHKYYNKYTMNIVM 180
WKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEPFFPHKYYNKYTMNIVM
Sbjct: 108
WKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEPFFPHKYYNKYTMNIVM 167
Query: 181
SIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGDYVWYTQPFFYFKNKKLEQDT 240
SIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGDYVWYTQPFFYFKNKKLEQDT
Sbjct: 168
SIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGDYVWYTQPFFYFKNKKLEQDT 227
Query: 241
KKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTGGKEKDMVIHVSTDFLLAGSDTNAST 300
KKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTGGKEKDMVIHVSTDFLLAGSDTNAST
Sbjct: 228
KKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTGGKEKDMVIHVSTDFLLAGSDTNAST 287
Query: 301
LEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLVGAIKETLRMRTPAPL 360
LEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLVGAIKETLRMRTPAPL
Sbjct: 288
LEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLVGAIKETLRMRTPAPL 347
Query: 361
SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM 420
SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVE +K
Sbjct: 348 SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVENDLDKVVLPFS
407
Query: 421 EATTETKSNITTEI 434
NI +EI
Sbjct: 408 IGPRNCPGNIISEI 421
Score = 366 (128.8 bits), Expect =
4.5e-251, Sum P(2) = 4.5e-251
Identities = 70/70 (100%), Positives =
70/70 (100%)
Query: 437
NDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGGKKIDETEVFGITIHPK 496
NDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGGKKIDETEVFGITIHPK
Sbjct: 397
NDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGGKKIDETEVFGITIHPK 456
Query: 497 DFSIQLKKRE 506
DFSIQLKKRE
Sbjct: 457 DFSIQLKKRE 466
>aa-ng6400
Contig_2611
pred cdna pORF 90 1077
strand f start n stop y ph 0
DISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFST
VKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAAS
IIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLNAIIKEILRLNPVGPFG
LPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFLNKNNDAYMPF
GVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFGLTLKPNKFKVLLESRK
>CYP508B1
Seq. 15, 77 50% identical to 508A1
Length = 485
missing N-term
Use my seq
Score = 1576 (554.8 bits), Expect =
1.1e-165, P = 1.1e-165
Identities = 300/300 (100%), Positives =
300/300 (100%)
Query: 1
DISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFST 60
DISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFST
Sbjct: 185
DISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFST 244
Query: 61
VKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAAS 120
VKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAAS
Sbjct: 245
VKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQTSLDMFLAGNDTTAAS 304
Query: 121
IIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLNAIIKEILRLNPVGPFG 180
IIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLNAIIKEILRLNPVGPFG
Sbjct: 305
IIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLNAIIKEILRLNPVGPFG 364
Query: 181
LPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFLNKNNDAYMPF 240
LPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFLNKNNDAYMPF
Sbjct: 365
LPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQFDPSRFLNKNNDAYMPF 424
Query: 241
GVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFGLTLKPNKFKVLLESRK 300
GVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFGLTLKPNKFKVLLESRK
Sbjct: 425 GVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFGLTLKPNKFKVLLESRK
484
>aa-ng6418
Contig_2617
pred cdna pORF 14697
9940 strand r start y stop y ph 0 not a P450
MKFQKHLKNVTVWNHHYLPYQKLKRKLRTFFSPPQSPSDSTPPPPTTTTTTASTTTVNNN
NRAISISDGGNGNLNFLDILENEIKKVNQFILDEKLTIVMNRFEDLKLKFKSIDFSSPFS
VYLSVGLMTPQQIEDNNNNYLNTIQSNISNNNNNNNNNSNNNNNNNNSSNNNNSNINNST
DNNKDLSTPLDQSSSSLYFFSEQQQQPPGQGSNDVDTVTAFQIAFEDILKQLAEIDQFSQ
LNIQAIQKLLNTYSKKKIKTLISDRKKRKVSQTETTLPKMFHLEINNNHNSNNSNNNNNN
NNNNNNNNNNNNNNNNNNNNSNNTLSPATNTISILSKEQLIEDVKKDIDNYFKEKVEVLE
LYHHTQIKKLTLEIELEYQTICQTNVHLLNNNYYDILNIKEKELSNKVESLCRNVVNNNI
SIVKSLIEDIKNLMRDELKLEQVNNTMWIETIHSCIHKSCYLGNVEMVKFLFNTFNDLKM
VNIKYCDEKNKQFIHLAAENGRLSVMEFLIENGADIECLDFLSRRPLHLSAKNGKYDCVK
LLLEKGAKQDVLDREGCTPLYLACRQPPSANNIINSSGDISLSVVDLLLLSGSPLTRGPY
GRHILHEASSKGNCNNINSILKIDPDSANSTDSFGRTPLFESIKHGQLQAISTLLEKNAQ
IVSVFDDDKRTPLHEAAANGQLDSMHLLINNILENLKEDQVSKVVNQQDDDGWSPLHDAC
YLNYIHCVKCLLENKANPNLLDKGEWSPIVHSLYRGNIQAAITIIEFIKKNPDLFQDTTT
TTTLSSSNTIPQTTLKDSKNLSCSSSGVPLITASTVGTTVLPPKLDIVDLNKSISNLNLS
SNIILKSPESELLSPTPSSSSNNSSSISVTPSTSPSTSKKILKSLMSTVTFRIQLVSTTI
KPGQKVGIVGNRKAIGSWNPSAALMFREVDNIGSGGWIGKVQVPVNVQLEYKYVIFEGSR
LDTWEALPENRKFTPQEEEEFVDDGVFGQITFDEDSVTAEQSKQSLFMERGWLVQDTQIR
IRFGETSPTDLQRKIKQPITLYNGKEQAGKIVVTLRDRSSNIQSGTGLSQVNLPLTKDHS
VQFQTNQLKPNSYHSIQFDIYRRGMSSVLIGRSIVLVSELTSTNPYKKVVPIINSSLLSV
GELYFFPLIVSPFTHSKISDVLSNTYWKSTLLIGHRGGGAENARNVGKYKRTHIKENTIL
SFVTAASLGAQYIEFDVQLSRDNIPVIYHDFEILGQDGIKIPINKINLEQFKSLQQEKKN
SSNNNNNNNNNNSNNNNNNNNKNNSNRINNNNSNQHDENNKNNISKTLPSLKTLRGRSKS
MQDIFNDFGTSSSNILANLRNSGIINVNNTPSSPLASSSSSANSSQPISDSFTTLEETFK
NVPIATGFNIEIKYPNQEKEMEIQMNNMNRNAYVDTILKVVFEHAGNRSVMFSSFDADIC
LLCSLKQPKYPVFFLNNAGFTQHADPRANSISEAIRFSKSAHLLGIVTNSKILCEAPPII
GQVKSAGLMLCSWGSENNDPALVDLQEQLGCDGVISDHIAYISKHYYTNNNTIINNQNNQ
>aa-ng6446
Contig_2629
pred cdna pORF
2447 885 strand r
start y stop y ph 0
MNYLVLILVSLVSIYFLFIKNQDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKYLEW
FEKYGPVFRVSIGSLETVVLTGYPILREAFIDNSDTFTSRFQRENARSINGYKGLINSNG
DYHKNLKSVILSEMTATKIKKMESHINQESKRLCELLDQHAKQGTPFTMNKYLNLFSINI
ILRFLFGVNYPYTELDDGSSSIIQVIQQFLKLVSQPSITTYFPILSPFMNDRSKEFYDIH
KLLSNHIINLIERYKDSKQHQQQEQEPIDGATEPTVTILDKLLIEVENNRITQNALISIC
IDVLIAGTDTVGQTLSFAIVALVNNAEIQEKLSRNIIDSMEKGDNHYSYSKYRNGIPYLA
LVVKEVFRMYPAGILGLPHMTSEDCEIQGHKIAKGTQIIQNIYSTHRSESFWPNPNNFIP
ERHIQNDVSKSVHFAVGTRNCMGMSLSEAEVHTAMAELFGNFKFTNPSNIPLNDQGTFSV
ALNCPDFFVKIERRN
>CYP513C1
Seq 5+56 complete seq 43% to seq 7 486 aa
Length = 496
Score = 2559 (900.8 bits), Expect =
7.3e-270, P = 7.3e-270
Identities = 495/495 (100%), Positives =
495/495 (100%)
>aa-ng6889 Contig_2828 pred cdna pORF 197 2177 strand f start y stop y ph 0
MKNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYGVIYRIKMGNVETVVLTEY
PIIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNKLKTLVLSEMTNQRIKKYE
TSFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCFTFGLNYSYPYDEFEKASE
FIKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFNYTSDLITKFKNENEIHNN
INDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAGADTSANTILYCLLELINN
PNIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFNKYRTLAPYLSMVVKETFR
KYPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEKQFSEPDSFIPERFISQQQ
SANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNPNPSTPLNEIGKFGITYSC
PENKIIIKKRF
>CYP513A1
Seq. 8+53 complete same as 8b
Length = 510
1aa diff N-term missing
Most seqs
have T at this site so change seq to T add N-term
Score = 2551 (898.0 bits), Expect =
5.1e-269, P = 5.1e-269
Identities = 489/490 (99%), Positives =
490/490 (100%)
Query: 2
KNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYGVIYRIKMGNVETVVLTEYP 61
KNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYGVIYRIKMGNVETVVLTEYP
Sbjct: 20 KNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYGVIYRIKMGNVETVVLTEYP
79
Query: 62
IIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNKLKTLVLSEMTNQRIKKYET 121
IIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNKLKTLVLSEMTNQRIKKYET
Sbjct: 80
IIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNKLKTLVLSEMTNQRIKKYET 139
Query: 122
SFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCFTFGLNYSYPYDEFEKASEF 181
SFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCFTFGLNYSYPYDEFEKASEF
Sbjct: 140
SFIDNEIKKLFKVLDEHADTGKPIILNNHIKMFSMNIVLCFTFGLNYSYPYDEFEKASEF 199
Query: 182
IKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFNYTSDLITKFKNENEIHNNI 241
IKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFNYTSDLI+KFKNENEIHNNI
Sbjct: 200
IKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFNYTSDLISKFKNENEIHNNI 259
Query: 242 NDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAGADTSANTILYCLLELINNP
301
NDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAGADTSANTILYCLLELINNP
Sbjct: 260
NDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAGADTSANTILYCLLELINNP 319
Query: 302 NIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFNKYRTLAPYLSMVVKETFRK
361
NIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFNKYRTLAPYLSMVVKETFRK
Sbjct: 320
NIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFNKYRTLAPYLSMVVKETFRK 379
Query: 362
YPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEKQFSEPDSFIPERFISQQQS 421
YPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEKQFSEPDSFIPERFISQQQS
Sbjct: 380
YPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEKQFSEPDSFIPERFISQQQS 439
Query: 422
ANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNPNPSTPLNEIGKFGITYSCP 481
ANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNPNPSTPLNEIGKFGITYSCP
Sbjct: 440
ANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNPNPSTPLNEIGKFGITYSCP 499
Query: 482 ENKIIIKKRF 491
ENKIIIKKRF
Sbjct: 500 ENKIIIKKRF 509
>aa-ng7190
Contig_2949 pred cdna pORF 4620 3115 strand r
start y stop y ph 0
MIGTIAVLVIAILSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIFTLHLMGF
KMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLRFVKNGLV
LSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSINKSLGQSG
QLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDV
LYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEYFDETQKDI
NDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKGKTIPAGHI
LAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFGILQIKTIW
TVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK
>VFN761
(VFN761Q) /pub/dna_csm/LIBRARY/VF/VFN7-C/VFN761Q.Seq.d/
Length = 1049
Plus Strand HSPs:
EST
supports ng7190
Query: 1
QLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDV 60
QLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDV
Sbjct: 358
QLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDV 537
>CYP51
Seq 19 complete 26% to seq 8
Length =
466 mid and N-term diffs
Score = 2149 (756.5 bits), Expect =
2.0e-226, P = 2.0e-226
Identities = 428/465 (92%), Positives =
433/465 (93%)
Query: 1
MIGTIAVLVIAIL-------SPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF 53
MIGTIAVLVIAIL
SPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF
Sbjct: 1 MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF 60
Query: 54
TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR 113
TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR
Sbjct: 61
TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR 120
Query: 114
FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN 173
FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN
Sbjct: 121
FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN 180
Query: 174 KSLGQSGQLADLYHELEEGLNPISFF-FPNLPLPSFKKRDAARAKVAAIFHSIIQERRRS 232
KSLGQSGQLADLY ++ I F F
+
KR+
+ F SIIQERRRS
Sbjct: 181 KSLGQSGQLADLYPNWKKV--SIQFHSFSQISHYHHSKRETLPVLKSRHFPSIIQERRRS 238
Query: 233 TDDSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEY
292
TDDSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEY
Sbjct: 239
TDDSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEY 298
Query: 293
FDETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKG 352
FDETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKG
Sbjct: 299
FDETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKG 358
Query: 353
KTIPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFG 412
KTIPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFG
Sbjct: 359
KTIPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFG 418
Query: 413
ILQIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK 459
ILQIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK
Sbjct: 419
ILQIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK 465
>aa-ng7890
Contig_3262
pred cdna pORF 3080
5244 strand f start y stop y ph 0 reductase
MEILESIDFIEVLILDNLGAIIIVAVIVGTYLYMNKPPPPPPVFNKPNNKINKEAQKPKK
TITKNEDGKKVMKIFFGTQTRTAEDFSRIIEKECKKIGIPCEVVDLESYEHEQELHSESF
VMFLVATHGEGDPTDNAKEFYLWLTNDERPTDLLNGVPFTVFGLGNKTYEHYNAVARVID
RRMEELGGKRVFERGEGDDDATLEEDFNRWKKDMWPVVCKFLGYELKSTEDDKFVPRFRM
VTLNQDSKDINDPFIKIVSTPLKPKLSTDNKVIYDMKNPYYAEVLENRELHSNESDRSCR
HIEFKLGDEVSYTTGDHLGVFPINDSKLVEQLIKRLGVNGDDMIALVPIDQEGSVIKASF
GPMTIRRAFSEHLDITNPVRKSVLRALAESTTNEEEKKRLLYLATEEANEEYNKYIKNDF
RGVVDLLESFPGLQPLIAHFLEFTPRLPARMYSISSSPHNKNGVVSITSVVVNFTTGNQR
AHNGVASTWLSHLKVGDKVPLFVRESHFKLPSAATEQKPVIMVGPGTGLAPFRGFLQELQ
HRNHSQQQQSLLFFGCRSDTVDYIYREELEQYHQSSVLGDLVVAFSRKTSQKVYVQNKLL
EHKEKVWELLNKGAYFYVCGDGRNMSKAVQQALLSIIKEFGSKDDNSAQQFIDDMSSHGR
YLQDVWF
>aa-ng8085
Contig_3339
pred cdna pORF 1808 75
strand r start n stop y ph 0
LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFILT
DPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKLK
TIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKGA
MEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTNL
DTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKKA
YQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGYT
IPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYK
KQKDLLKEKEMQQLQEGADVVIDNKN
NIKNNNSSNKPNSKTNSIFDD
LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELK
SIDGKPIDDTEVYGLTIHTKVHPISLTLRP
Add large
insert and N-terminal seq
>CYP519A1
Seq. 12 32% to 508 complete sequence
Length = 484
joint at end differs N-term missing
Score = 2085 (734.0 bits), Expect =
8.8e-255, Sum P(2) = 8.8e-255
Identities = 395/397 (99%), Positives =
395/397 (99%)
Query: 1
LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFILT 60
LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFILT
Sbjct: 21
LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFILT 80
Query: 61
DPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKLK 120
DPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKLK
Sbjct: 81
DPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKLK 140
Query: 121
TIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKGA 180
TIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKGA
Sbjct: 141
TIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKGA 200
Query: 181
MEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTNL 240
MEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTNL
Sbjct: 201
MEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTNL 260
Query: 241
DTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKKA 300
DTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKKA
Sbjct: 261
DTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKKA 320
Query: 301
YQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGYT 360
YQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGYT
Sbjct: 321 YQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGYT
380
Query: 361
IPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYKKQK 397
IPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYK
K
Sbjct: 381
IPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYKLDK 417
Score = 355 (125.0 bits), Expect =
8.8e-255, Sum P(2) = 8.8e-255
Identities = 69/69 (100%), Positives =
69/69 (100%)
Query: 442
LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELKSIDGKPIDDTEVYGLTIHTKV 501
LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELKSIDGKPIDDTEVYGLTIHTKV
Sbjct: 415 LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELKSIDGKPIDDTEVYGLTIHTKV
474
Query: 502 HPISLTLRP 510
HPISLTLRP
Sbjct: 475 HPISLTLRP 483
>aa-ng8864
Contig_3673
pred cdna pORF 2489 854 strand r
start n stop y ph 0
YKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI
VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK
LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN
DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ
QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF
MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR
VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN
QFGNR
DDSIDTTNTTTNTTLNGTTS
KYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLN
FTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS
>CYP520A1
seq (47+68+80) complete 47% to seq 58 517 aa
Length = 517
n_term missing Joint at end differs 6aa diffs
Score = 2164 (761.8 bits), Expect =
3.8e-265, Sum P(2) = 3.8e-265
Identities = 412/415 (99%), Positives =
414/415 (99%)
Query: 1
YKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI 60
+KKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI
Sbjct: 24
FKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI 83
Query: 61 VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK
120
VTDP+IIQDI IKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK
Sbjct: 84
VTDPKIIQDIGIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK 143
Query: 121
LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN 180
LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN
Sbjct: 144
LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN 203
Query: 181
DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ 240
DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ
Sbjct: 204
DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ 263
Query: 241
QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF 300
QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF
Sbjct: 264
QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF 323
Query: 301
MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR 360
MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR
Sbjct: 324
MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR 383
Query: 361
VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN 415
VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN
Sbjct: 384 VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN
438
Score = 374 (131.7 bits), Expect =
3.8e-265, Sum P(2) = 3.8e-265
Identities = 74/78 (94%), Positives =
74/78 (94%)
Query: 441
KYYNDLERV-SIPFGVGKRRCMAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEEEQY 499
KYYNDLERV
SIPFGVGKRRCMAPSMADHNV IAMANIVLNF MKSSDPKQMPLSEE QY
Sbjct: 440
KYYNDLERVISIPFGVGKRRCMAPSMADHNVFIAMANIVLNFAMKSSDPKQMPLSEEGQY 499
Query: 500 AITIKPKYPFKVLFEKRS 517
AITIKPKYPFKVLFEKRS
Sbjct: 500 AITIKPKYPFKVLFEKRS 517
>IIAFP1D30759
Length = 967
Plus Strand HSPs:
Score = 248 (92.4 bits), Expect =
2.4e-20, P = 2.4e-20
Identities = 51/62 (82%), Positives =
52/62 (83%), Frame = +3
Query: 1
VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWN-KVRPKMTCHFT-K 58
VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDE PKMT F
K
Sbjct: 600
VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEME*S*DPKMTLSFPLK 779
>JC1a198c01.s1
Clone JC1a198c01, standard read, bases 84 through 533, from
2000-05-12
Length = 448
Minus Strand HSPs:
Score = 729 (261.7 bits), Expect =
5.6e-71, P = 5.6e-71
Identities = 139/139 (100%), Positives =
139/139 (100%), Frame = -1
Query: 17
DTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLN 76
DTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLN
Sbjct: 448
DTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLN 269
Query: 77
GTTSKYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEE 136
GTTSKYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEE
Sbjct: 268
GTTSKYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEE 89
Query: 137 EQYAITIKPKYPFKVLFEK 155
EQYAITIKPKYPFKVLFEK
Sbjct: 88 EQYAITIKPKYPFKVLFEK 32
>JC2c167d08.r1
Clone JC2c167d08, reverse read, bases 38 through 462, from
2000-01-18
Length = 423
Plus Strand HSPs:
Score = 615 (221.5 bits), Expect =
7.2e-59, P = 7.2e-59
Identities = 117/117 (100%), Positives =
117/117 (100%), Frame = +2
Query: 41 NPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLNGTTSKYYNDLERVSIPFGVGKRRC
100
NPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLNGTTSKYYNDLERVSIPFGVGKRRC
Sbjct: 2
NPSRWLEYKKMKDLNQFGNRDDSIDTTNTTTNTTLNGTTSKYYNDLERVSIPFGVGKRRC 181
Query: 101
MAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS 157
MAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS
Sbjct: 182
MAPSMADHNVLIAMANIVLNFTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS 352
>aa-ng8897
Contig_3681
pred cdna pORF 9565
10869 strand f start y stop y ph 0
MVSDYKIIEELFIKNHNSFLERPITPSFSHCSDNQNGILLSNEKWVNNREMIQKAMKKGI
VKSVYELLNKQINDLIQSVKPFSESGEPLNFRLFATRFTLSTMMTYIFNEPLSYNENLEN
GTVFFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEENYDLLHTFIKEVGIK
DKESVKSIVSNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGRVED
RIYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKGHFIPKGSHVVINYRALN
HDESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANHELFLFVSNFILNYKITP
ITNEPIDTTENFGINIRPNSFKINVYKRN
N-term
missing
Deleted
one seq as an intron between GTV and FLKP
There is a
frameshift there
Two other
small diffs
>CYP508E1
seq (71+85) complete seq 38% to 508A1
Length = 481
Score = 1317 (463.6 bits), Expect =
3.0e-138, P = 3.0e-138
Identities = 258/267 (96%), Positives =
260/267 (97%)
Query: 123
VFFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEENYDLLHTFIKEVGIKDK 182
+ FLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNE DLLHTFIKEVGIKDK
Sbjct: 218
ISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNE-----DLLHTFIKEVGIKDK 272
Query: 183
ESVKSIV-SNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGRVEDR 241
ESVKS+
SNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGRVEDR
Sbjct: 273
ESVKSMFHSNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGRVEDR 332
Query: 242
IYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKGHFIPKGSHVVINYRALNH 301
IYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKGHFIPKGSHVVINYRALNH
Sbjct: 333
IYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKGHFIPKGSHVVINYRALNH 392
Query: 302
DESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANHELFLFVSNFILNYKITPI 361
DESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANHELFLFVSNFILNYKITPI
Sbjct: 393
DESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANHELFLFVSNFILNYKITPI 452
Query: 362 TNEPIDTTENFGINIRPNSFKINVYKRN
389
TNEPIDTTENFGINIRPNSFKINVYKRN
Sbjct: 453 TNEPIDTTENFGINIRPNSFKINVYKRN
480
Score = 715 (251.7 bits), Expect =
1.8e-74, P = 1.8e-74
Identities = 156/204 (76%), Positives =
166/204 (81%)
Query: 1
MVSDYKIIEELFIKNHNSFLERPITPSFSHCSDNQNGILLSNEKWVNNREMIQKAMKKGI 60
MVSDYKIIEELFIKNHNSFLERPITPSFSHCSDNQNGILLSNEKWVNNREMIQKAMKKGI
Sbjct: 75
MVSDYKIIEELFIKNHNSFLERPITPSFSHCSDNQNGILLSNEKWVNNREMIQKAMKKGI 134
Query: 61
VKSVYELLNKQINDLIQSVKPFSESGEPLNFRLFATRFTLSTMMTYIFNEPLSYNENLEN 120
VKSVYELLNKQINDLIQSVKPFSESGEPLNFRLFATRFTLSTMMTYIFNEPLSYNENLEN
Sbjct: 135 VKSVYELLNKQINDLIQSVKPFSESGEPLNFRLFATRFTLSTMMTYIFNEPLSYNENLEN
194
Query: 121
GTVF--FLKPIYSL-----------FLK-LTDSNVPLARDYV-YKKFYEHLKTIDNEKE- 164
GTV FL+ I+ L
FLK + + L V + Y + K
++ K
Sbjct: 195
GTVLYMFLENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTI 254
Query: 165 ENYDLLHTFIKEVGIKDKESVKSI 188
+N DLLHTFIKEVGIKDKESVKS+
Sbjct: 255 DNEDLLHTFIKEVGIKDKESVKSM 278
>JC1c218c07.r1
Clone JC1c218c07, reverse read, bases 61 through 714, from
2001-01-11
Length = 652
Plus Strand HSPs:
Score = 311 (114.5 bits), Expect =
8.6e-79, Sum P(3) = 8.6e-79
Identities = 65/68 (95%), Positives =
66/68 (97%), Frame = +2
Query: 101
NYDLLHTFIKEVGIKDKESVKSMFHSNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELI 160
NYDLLHTFIKEVGIKDKESVKS+
SNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELI
Sbjct: 452
NYDLLHTFIKEVGIKDKESVKSIV-SNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELI 628
Query: 161 FKELKSIV 168
FKELKSIV
Sbjct: 629 FKELKSIV 652
Score = 307 (113.1 bits), Expect =
8.6e-79, Sum P(3) = 8.6e-79
Identities = 59/59 (100%), Positives =
59/59 (100%), Frame = +3
Query: 42
LENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEE 100
LENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEE
Sbjct: 201 LENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEE
377
Score = 215 (80.7 bits), Expect =
8.6e-79, Sum P(3) = 8.6e-79
Identities = 41/41 (100%), Positives =
41/41 (100%), Frame = +2
Query: 1
EPLNFRLFATRFTLSTMMTYIFNEPLSYNENLENGTVLYMF 41
EPLNFRLFATRFTLSTMMTYIFNEPLSYNENLENGTVLYMF
Sbjct: 80
EPLNFRLFATRFTLSTMMTYIFNEPLSYNENLENGTVLYMF 202
>JC1c218c07.r1
Clone JC1c218c07, reverse read, bases 61 through 714, from 2001-01-11 translate
frame +1 translate plus frames translate all frames
TGTTAAAAGTGTTTATGAATTATTAAATAAACAAATCAATGATTTAATTCAATCAGTTAA
ACCATTTTCAGAAAGTGGTGAACCATTAAACTTTCGTTTATTTGCCACAAGATTTACATT
ATCAACTATGATGACTTATATTTTCAATGAACCATTATCATATAATGAAAATTTAGAAAA
TGGGACAGTTTTGTACATGTTTAGAAAATATTTTTGTCTTGGCTGATGTTGGTCATATTG
GTGATTATATTAGTTTCTTAAAACCAATTTATTCTTTATTTTTAAAATTAACAGATTCAA
ATGTTCCATTAGCAAGAGATTATGTATATAAAAAATTTTATGAACATTTAAAAACAATTG
ATAATGAAAAAGAAGAGTAAATATATATTTAAAAAATTTATTTTTTTATTATTATTATTC
TAATAATTTTTTTTTTTTAAAACTTTTTAGAAATTATGATTTATTACATACATTTATTAA
AGAAGTTGGAATCAAAGATAAAGAAAGTGTTAAATCAATTGTTTCAAATTCATTAGATTT
ATTAGTTGCAGGTACAGATACTGCTGCAAAAGGAATTGAATGGATTATATTAAGAATTGC
AAATAATCAAGATGTTCAAGAATTAATTTTTAAAGAATTAAAAAGTATTGTA
>aa-ng9107
Contig_3767
pred cdna pORF 15219 13452
strand r start y stop y ph 0
MIILLLSIIIFILYIVKIFKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMF
KYGNVMTVYFANIKTVVISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQY
WKNYRQILAQSFLKLRDHKGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVI
MQLLYSHRSSYDLKGQHPIIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQV
WSYSQDSIKEHREKLLINPEKIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTE
TSSSTMSWLLLFLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRP
VGALSLPRIASEDVTCGPYTIEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQS
TYSFIPFGCGSRVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDY
NK
>CYP516A1
seq 9, 43 complete seq 39% to 10, 42; 30% to 508A1
Length = 485
first intron joint differs
Score = 2465 (867.7 bits), Expect =
6.6e-260, P = 6.6e-260
Identities = 478/481 (99%), Positives =
478/481 (99%)
Query: 1
MIILLLSIIIFILYIVKIFKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMF 60
MIILLLSIIIFILYI
FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMF
Sbjct: 1
MIILLLSIIIFILYI---FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMF 57
Query: 61
KYGNVMTVYFANIKTVVISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQY 120
KYGNVMTVYFANIKTVVISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQY
Sbjct: 58
KYGNVMTVYFANIKTVVISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQY 117
Query: 121
WKNYRQILAQSFLKLRDHKGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVI 180
WKNYRQILAQSFLKLRDHKGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVI
Sbjct: 118
WKNYRQILAQSFLKLRDHKGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVI 177
Query: 181
MQLLYSHRSSYDLKGQHPIIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQV 240
MQLLYSHRSSYDLKGQHPIIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQV
Sbjct: 178
MQLLYSHRSSYDLKGQHPIIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQV 237
Query: 241
WSYSQDSIKEHREKLLINPEKIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTE 300
WSYSQDSIKEHREKLLINPEKIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTE
Sbjct: 238 WSYSQDSIKEHREKLLINPEKIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTE
297
Query: 301
TSSSTMSWLLLFLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRP 360
TSSSTMSWLLLFLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRP
Sbjct: 298 TSSSTMSWLLLFLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRP
357
Query: 361
VGALSLPRIASEDVTCGPYTIEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQS 420
VGALSLPRIASEDVTCGPYTIEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQS
Sbjct: 358
VGALSLPRIASEDVTCGPYTIEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQS 417
Query: 421
TYSFIPFGCGSRVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDY 480
TYSFIPFGCGSRVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDY
Sbjct: 418
TYSFIPFGCGSRVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDY 477
Query: 481 N 481
N
Sbjct: 478 N 478
>aa-ng12432
Contig_5098
pred cdna pORF 64
307 strand f start n stop y ph 2 = 519D1
NSKHKYYNDLDKISIPFSTGRRACVGGQLGEAELYIVCANLVYNFKIESWDGKKINELED
FGIIIHPSSHNLKITKRNNK
>CYP519D1
Seq. (14+34) complete seq 38% to seq 12 492 aa
Length = 493
2 diffs N-term missing
Score = 406 (142.9 bits), Expect =
1.0e-41, P = 1.0e-41
Identities = 76/78 (97%), Positives =
76/78 (97%)
Only one
seq supports ng12432 = IIAFP1D52853, three seqs support 519D1 SSI404
IIAFP1D78046 JC3b13f10.s1 so keep seq as is
Query: 3
KHKYYNDLDKISIPFSTGRRACVGGQLGEAELYIVCANLVYNFKIESWDGKKINELEDFG 62
KHKYYNDLDKISIPFSTGRR CVG QLGEAELYIVCANLVYNFKIESWDGKKINELEDFG
Sbjct: 415
KHKYYNDLDKISIPFSTGRRGCVGVQLGEAELYIVCANLVYNFKIESWDGKKINELEDFG 474
Query: 63 IIIHPSSHNLKITKRNNK 80
IIIHPSSHNLKITKRNNK
Sbjct: 475 IIIHPSSHNLKITKRNNK 492
>aa-ng10297
Contig_4318
pred cdna pORF 525 2131 strand
f start y stop y ph 0
MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK
IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKSSENIVNSNGLIWKKLQ
SISIQELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFLF
NFRYNDYKDEKVIQLVDYIHSIFRMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYEY
IENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDTN
SLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRFT
VSPLSQPHRTNKDIILNGYFIPSGSQIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLEK
KLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKIN
LIKR
>CYP555A1
seq 76 41% to seq 4
Length = 486 4 diffs at
two locations
Score = 2495 (878.3 bits), Expect =
4.4e-263, P = 4.4e-263
Identities = 481/485 (99%), Positives =
482/485 (99%)
Query: 1
MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK 60
MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK
Sbjct: 1
MIIIVIVVFLFYFSFLNLNLNPKKKRPPSPITLPVIGNLISLLNNQPQNILFNYYKKYGK 60
Query: 61
IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKS-SENIVNSNGLIWKKL 119
IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFK +ENIVNSNGLIWKKL
Sbjct: 61
IYQLQYGIVNTVVLSEFDILKEAFIENGEVFIERYNKITKKFKIIAENIVNSNGLIWKKL 120
Query: 120
QSISIQELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFL 179
QSISIQELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFL
Sbjct: 121
QSISIQELSPNIKIKKYEPMIINETNKLIDSFNEHIKSNESIDPTLNIKICFLNIIISFL 180
Query: 180
FNFRYNDYKDEKVIQLVDYIHSIFRMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYE 239
FNFRYNDYKDEKVIQLVDYIHSIFRMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYE
Sbjct: 181 FNFRYNDYKDEKVIQLVDYIHSIFRMGSHPIPQDYIPILNKFYINKTTKIHQKIFENIYE
240
Query: 240
YIENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDT 299
YIENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDT
Sbjct: 241
YIENQVQKRLEILNKNNNNNNNNINECFVDLLLLKFKSNLLTWNEVIKTTTDLMIAGSDT 300
Query: 300
NSLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRF 359
NSLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRF
Sbjct: 301
NSLFTIHLIIALTNRENIQNKVFNEILNFYILNENNKITFSNKSKTPYYNSVLKEVERRF 360
Query: 360
TVSPLSQPHRTNKDIILNGYFIPSGSQIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLE 419
TVSPLSQPHRTNKDIILNGYFIPSGS IIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLE
Sbjct: 361
TVSPLSQPHRTNKDIILNGYFIPSGSHIIQNVYSCHLNDKDWENPFQFNPDRFLNNNQLE 420
Query: 420 KKLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKI
479
KKLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKI
Sbjct: 421
KKLITFGMGPRNCLGFQFALMSIWIVNLILFKSIKFSSNKLIEEEIREGGTTLSPFPFKI 480
Query: 480 NLIKR 484
NLIKR
Sbjct: 481 NLIKR 485
>aa-ng10310
Contig_4323
pred cdna pORF 4996
3612 strand r start y stop y ph 0
MEENFFDRPSTFVPTWRYMSSNGGGIMSSNDEKWKRAKTTFLKSLKIHGKKYLIEKKSIE
FVNSIEKFSNSNQVFYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNS
HLTVFDCFGILSPFYDLFFKFRLRPIEILKKTIDKQLTNHLNSMDSKMGDHQSRDIMDDL
LIEYGSLNEISNQDRIQINQICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQD
TIKIKRNNVINDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQ
ICANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHFGIGQRGCPGR
FVAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKPKDFGILLEKRNKLFVND
>CYP517A2
SEQ 74 66% to seq 3
Length = 511
3 aa diffs missing N-term
Score = 2186 (769.5 bits), Expect =
2.4e-230, P = 2.4e-230
Identities = 413/416 (99%), Positives =
415/416 (99%)
Query: 2 EENFFDRPSTFVPTWRYMSSNGGGIMSSNDEKWKRAKTTFLKSLKIHGKKYLIEKKSIEF
61
EENFFDRPSTFVPTWRYMSSNGGGIMSSNDEKWKRAKTTFLKSLKIHGKKYLIE+KSIEF
Sbjct: 95
EENFFDRPSTFVPTWRYMSSNGGGIMSSNDEKWKRAKTTFLKSLKIHGKKYLIERKSIEF 154
Query: 62
VNSIEKFSNSNQVFYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNSH 121
VNS EKFSNSNQVFYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNSH
Sbjct: 155
VNSSEKFSNSNQVFYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNSH 214
Query: 122
LTVFDCFGILSPFYDLFFKFRLRPIEILKKTIDKQLTNHLNSMDSKMGDHQSRDIMDDLL 181
LTVFDCFGILSPFYDLFF+FRLRPIEILKKTIDKQLTNHLNSMDSKMGDHQSRDIMDDLL
Sbjct: 215
LTVFDCFGILSPFYDLFFRFRLRPIEILKKTIDKQLTNHLNSMDSKMGDHQSRDIMDDLL 274
Query: 182
IEYGSLNEISNQDRIQINQICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQDT 241
IEYGSLNEISNQDRIQINQICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQDT
Sbjct: 275
IEYGSLNEISNQDRIQINQICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQDT 334
Query: 242
IKIKRNNVINDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQI 301
IKIKRNNVINDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQI
Sbjct: 335
IKIKRNNVINDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQI 394
Query: 302
CANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHFGIGQRGCPGRF 361
CANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHFGIGQRGCPGRF
Sbjct: 395
CANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHFGIGQRGCPGRF 454
Query: 362
VAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKPKDFGILLEKRNKLFVND 417
VAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKPKDFGILLEKRNKLFVND
Sbjct: 455 VAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKPKDFGILLEKRNKLFVND
510
Frameshift
gone in ng10310
Correct
the three aa diffs and use my N-term
Dict-IV-V896c06.plc
supports these changes
>aa-ng12344
Contig_5063
pred cdna pORF 5408
4653 strand r start y stop y ph 0
MIFFTSNSNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASS
TDTIMFCLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV
IKETFRKHPVTAHETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEE
LPNSNLVHFGVGVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSC
PQNKIIIKKRI
>CYP513A2
seq 38 54% to seq 8 n-term
missing
Length = 452
Score = 1273 (448.1 bits), Expect =
1.4e-133, P = 1.4e-133
Identities = 245/248 (98%), Positives =
245/248 (98%)
Query: 3
FFTSNSNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTD 62
FF
NSNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTD
Sbjct: 203
FFYINSNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTD 262
Query: 63
TIMFCLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMVIK 122
TIMFCLIEL NLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMVIK
Sbjct: 263
TIMFCLIELPNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMVIK 322
Query: 123
ETFRKHPVTAHETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEELP 182
ETFRKHPVTAHETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEELP
Sbjct: 323
ETFRKHPVTAHETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEELP 382
Query: 183
NSNLVHFGVGVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSCPQ 242
NSNLVHFGVGVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSCPQ
Sbjct: 383
NSNLVHFGVGVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSCPQ 442
Query: 243 NKIIIKKR 250
NKIIIKKR
Sbjct: 443 NKIIIKKR 450
T
supported by two sequences P by only one.
T is the consensus
>JC1a87f04.r1
Clone JC1a87f04, reverse read, bases 46 through 667, from
2000-04-13
Length = 620
Plus Strand HSPs:
Score = 580 (209.2 bits), Expect =
2.5e-55, P = 2.5e-55
Identities = 113/113 (100%), Positives =
113/113 (100%), Frame = +1
Query: 1 SNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFC
60
SNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFC
Sbjct: 229
SNKYIEKFKSGKNSKQSKTENKPIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFC 408
Query: 61
LIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 113
LIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV
Sbjct: 409
LIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 567
>JAX4a21c04.r1
Clone JAX4a21c04, reverse read, bases 45 through 797, from
1998-11-11
Length = 751
Minus Strand HSPs:
Score = 449 (163.1 bits), Expect =
1.6e-41, P = 1.6e-41
Identities = 89/95 (93%), Positives =
89/95 (93%), Frame = -1
Query: 19
TENKPIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFCLIELTNLKNIQNKVYNEI 78
T
PIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFCLIEL NLKNIQNKV NEI
Sbjct: 730
TRK*PIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFCLIELPNLKNIQNKVXNEI 551
Query: 79
KQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 113
KQEIK QNINNNNNEDEVLIQYNKYRNSLPYFSMV
Sbjct: 550 KQEIKXQNINNNNNEDEVLIQYNKYRNSLPYFSMV 446
>IIAFP2D52585
Length = 809
Plus Strand HSPs:
Score = 282 (104.3 bits), Expect =
7.3e-24, P = 7.3e-24
Identities = 54/54 (100%), Positives =
54/54 (100%), Frame = +2
Query: 60 CLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV
113
CLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV
Sbjct: 50
CLIELTNLKNIQNKVYNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMV 211
>aa-ng10949
Contig_4574
pred cdna pORF 2253 638 strand r
start n stop y ph 2
NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLSDPILIREMF
VNNGDYFLDRPKIPSIRHATHYHGIATSSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV
DNLIESMNKIESDGQVFEPRIYFKKYTMAAMYKFIFNEEINFNNEISELIGPIEQVFKDL
GSGSLFDVLLISRPLYYQWIEHTDKNYPKILNFLKKKYHQHLKTYNPEIQRDLLDLLIKE
YYSGSDDDILTIIATINDLFLAGTDTSSASLEYMVMMLVNYPEIQEKVYDEIKLTVNGRN
KVLLSDRQFTPYTVSFIKETLRYKPPSSVGVPRTTSQDIIIGDKFIPKDAQIFINYYGLS
RNQDYFENPEQFEPSRFMNPDTNIAFLPFSIGTRNCVGQNFALDEMFLAFSNIILNFKFS
SIDGKQIDETELYGVTLRCKNKFNVSIKKRI
>CYP508A3
seq 21+67+72+82+87 57% to 508A1 458aa
Length = 477
missing N-term two other small
diffs
Correct
the two diffs add my N-term
Score = 2319 (816.3 bits), Expect =
2.0e-244, P = 2.0e-244
Identities = 446/451 (98%), Positives =
446/451 (98%)
Query: 1 NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLSDPILIREMF
60
NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLSDPILIREMF
Sbjct: 29
NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLSDPILIREMF 88
Query: 61
VNNGDYFLDRPKIPSIRHATHYHGIATSSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQV 120