828 sequences from Dictyostelium discoideum 
in 55 contigs (by David Nelson).  These sequences completely searched for 
new hits and revised by Jinchuan Xing (March 19, 2001) and again by D. Nelson 
August 15, 2001.  Many sequences have been joined so the number of contigs has 
dropped to 55.  There are 45 full length sequences and one more that only lacks 
the N-terminal exon.
We have made best estimates of some N-terminal exons though this is difficult 
to do without EST data since these exons are very short and not well conserved.
We estimate there are 15 families represented.
Last modified August 16, 2001 and revised again on Feb. 25, 2002, May 16, 2003

45 complete (42 genes and three pseudogenes)
(1+2 = 508A1), (3+61 = 517A1), (4+23+24+83 = 508C1), (5+56 = 513C1), (6 = 515B1), (7+50 = 513A3), 
(8+53 = 513A1), (9+43 = 516A1), (10+42 = 516B1), (11 = 514A1), (12 = 519A1), (13 = 521A1),
(14+34 = 519D1), (15+77 = 508B1), (17+84 = 519E1), (18 = 522A1), (19 = CYP51), (20 = 518A1), 
(21+67+72+82+87 = 508A3), (22+45+69+86+88 = 508A2), (26+28 = 518B1), (31+32+33 = 519C1), (35+44+75 = 519B1),
(38 = 513A2P), (39+73 = 519H1P), (40 = 554A1), (41P = 513G1P), (47+68+80 = 519F1), (49 = 513B1), 
(51+52+90 = 513E1), (54+55+78 = 513D1), (57 = 513F1), (58 = 519G1), (59+81= 508D1), (62 = 525A1), 
(65 = 514A2), (66+70 = 508A4), (71+85 = 508E1), (74 = 517A2), (76 = 555A1), (79b = 515A1), (91 = 524A1), 
(92 = unnamed), 514A4 (new seq) 517A4 (new seq)

6 C-terminal pseudogene fragments (does not include 45 complete seqs above)
(25+27+29+30 = 518A2P), (37a = 513E2P), (37b = 513E3P), (60 = 508B2P), (64 = 516A2P), (79 = 515A2P)

complete except for N-term exon
(37a = 513E2P)

4 contigs that do not include a C-terminal fragment
(36 = 517A3P), (46 = 519C2P), (48 = 516B2P), (89 = 514A3P)

for intron locations mapped to an alignment see 
Intron alignment map

51     = seq 19       (1 intron)
508A1  = seq 1+2      (4 introns)
508A2  = seq 22+45+69+86+88 (3 introns)
508A3  = seq 21+67+72+82+87 (4 introns)
508A4  = seq 66+70    (3 introns)
508B1  = seq 15+77    (2 introns)
508B2P = seq 60       (2 introns and a corrupted heme region)
508C1  = seq 4+23+24+83 (4 introns)
508D1  = seq 59+81    (4 introns)
508E1  = seq 71+85    (2 introns)
513A1  = seq 8+53     (1 intron)
513A2P = seq 38       (0 introns, processed pseudogene)
513A3  = seq 7+50     (1 intron)
513B1  = seq 49       (1 intron)
513C1  = seq 5+56     (1 intron)
513D1  = seq 54+55+78 (1 intron)
513E1  = seq 51+52+90 (1 intron)
513E2P = seq 37a      (0 introns) complete except for missing N-term exon 55% to 513E1 
513E3P = seq 37b      (insertion, deletion, frameshifts) upstream of 513E2P only 429 bp between them  
513F1  = seq 57       (1 intron)
513G1P = seq 41       (0 introns, processed pseudogene)  37% to CYP513A1 
514A1  = seq 11       (3 introns)
514A2  = seq 65       (3 introns)
514A3P = seq 89       (1 intron with bad boundary)
514A4  = new seq      (3 introns) only 7aa diffs to 514A1
515A1  = seq 79b      (6 introns)
515A2P = seq 79a      (3 introns, partial sequence, exons 4-7)
515B1  = seq 6        (2 introns)
516A1  = seq 9+43     (2 introns)
516A2P = seq 64       (0 introns, partial sequence)
516B1  = seq 10+42    (2 introns)
516B2P = seq 48       (1 intron, bad boundary, partial sequence)
517A1  = seq 3+61     (2 introns)
517A2  = seq 74       (2 introns)
517A3P = seq 36       (2 introns)
517A4  = new sequence (2 introns) only 13 aa difs to 517A1 
518A1  = seq 20       (1 intron)
518A2P = seq 25+27+29+30 (0 introns, partial sequence)
518B1  = seq 26+28    (6 introns)
519A1  = seq 12       (2 introns)
519B1  = seq 35+44+75 (1 intron)
519C1  = seq 31+32+33 (2 introns)
519C2P = seq 46       (0 introns, partial sequence)
519D1  = seq 14+34    (2 introns)
519E1  = seq 17+84    (3 introns)
519F1  = seq 47+68+80 (2 introns)
519G1  = seq 58       (5 introns)
519H1P = seq 39+73    (3 introns and 50 nuc. insertion)
520A1    renamed 519F1
520B1    renamed 519G1
521A1  = seq 13       (1 intron)
522A1  = seq 18       (3 introns)
523A1    renamed 519H1P
524A1  = seq 91       (1 intron)
525A1  = seq 62       (
554A1  = seq 40       (1 intron)
555A1  = seq 76       (2 introns)
556A1  = seq 92       (3 introns)

CYP51 Seq 19 complete 26% to seq 8 40% to rice CYP51
MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIF
TLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLR
FVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSIN
KSLGQSGQLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTD
DSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEYFD
ETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKGKT
IPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFGIL
QIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK*
AU033519
AU060691
Contig7113
c-JAX4a244b11.s1 
JAX4b08b04.r1
JC1a178c02.r2
JC2b25d02.r1
JC2b119a09.s1
JC2e67h09.s1
sdic2Bf6.p1t
sdic2BF6.q1t
sdic6A53a12.q1t
sdic6A53b7.p1t
SLB124

>SLB124 (SLB124Q) /pub/dna_csm/LIBRARY/SL/SLB1-A/SLB124Q.Seq.d/
        Length = 1550

  Plus Strand HSPs:

 Score = 222 (83.2 bits), Expect = 5.6e-17, P = 5.6e-17
 Identities = 44/44 (100%), Positives = 44/44 (100%), Frame = +1

Query:     1 MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKN 44
             MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKN
Sbjct:   115 MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKN 246

CYP508A1 complete (old seqs. 1 and 2) 28% to seq 8 
on single contig with seq 22 and 21 CHR2.0.28372 2447-4459
MALFEIIISLFVVYIIHNA (0)
ISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKK
YGHILRVYMADMYTVVVSDPLLIREMYVDNSDIFTDRVKKP (0)
SVEHGTFYHGTVTSYGEHW
KNNREIVGKAMRKTNLKHIYELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSA
MFKFLFNHDIPEDEDINKGDTQKLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFK
DIMKYHREKYNEHLKTFDPDVERDLLDILIKEYGTDNDDKILSILATINDFFLA (1)
GVDTSSTALESMVLMLTNYPEIQEKAFDEIKTVVNGRSKVNLSDRQSTPYLVAVIKETLRYKPMSP
FGLPRSSSKDCMIGGHFIPKNAQILINYQALGMNEEYYENPEQFDPSRFLKVESNVAFLP
FSIGIRSCV (2)
GQSFAQDELYICISNILLNFKLKSIDGKKIDETEEYGLTLKTKNRFNVTLEKRII*
AU033852
AU034252
AU034703
AU037735
AU037979
AU053778
AU060139
AU060437
AU060796
AU071927
AU074155
C24684
C25600
C89925
C90052
C91122
C92049
C92378
C94043
C94448
Contig13138 
IIAFP1D59967
IIAFP1D72549
JAX4a63h05.r1
JAX4a63h05.s1
JAX4a134e03.r1
JC2a57e02.s1
JC2b54f08.r1
JC2b54f08.s1 = 80% to 508A1 not same seq no exact match to any other seq.
TWGTLYYITKFTISAIFKFLFNHDIPQDEDINKGDALKLMGPMS*VFQNTGIGTLFDVIN
ITLPLYLLYLKMFDQSFKDIIKYHTEKYNEHLKTFYPHVQTYLLHILIK*YGTDNYNKIL 408
SILAAINDFFL 441
JC2b181d11.s1
JC2c128a08.r1   
JC2c128a08.s1 
JC2d13d01.r1 
c-JC2d95b07.s1
JC2e48a02.s1
JC2e111b07.r1
sdic2Af2.q1t
sdic2Ce4.p1t
sdic2Ce5.q1t (92%)
SLA411 N-term
SLB521
SLC329
SLE212
SLJ668
SLJ729
SSA192
SSA247
SSC117
SSC755
SSE231
SSE715
SSG143
SSG323
SSG630
SSH119
SSJ655
SSK816
AFK388
AFI677
AFO867
AFL432
AFF717

CYP508A2 Seq (22+45+69+86+88) Complete sequence from c-JC2d95b07.s1 
62% to CYP508A1
on single contig with seq 508A1 and 21 CHR2.0.28372 5025-6821
5025 MIFGIIGYLFLIYILHNA 5078 (0) 
5195 YSKYKRLNENQLPGPFPIPILGNIYQLTNL
     PHFDLTKMSEKYGKIFRIYLADLYTVIVCDPIIARELFVDKFDNFIDRPKIP 5440 (0)
5543 SVKHGTFYHGTVASMGDNW
KNNKEIVGKAMRKTNLKHIYQLLDDQVDVLIESMRTIESSGETFDPRYYLTKFTMSAMFKYIFNEDISKDEDVH
NGQLAQLMKPMQKVFKDFGTGSLFDVLEITRPLYFLYLEWFTSHYYQVINFGKMKIYKHLETYKPDVQRDLMDLL
IKEYGTETDDQILSISATVSDFFLAGVDTSATSLELIVMMLINYPEYQEK
AYNEIKSALSSNGGGGGGGLTQRNKVLLSD
RQSTPFVVSLFKETLRYKPISPFGLPRSTTSDIILNNGQ
FIPKNAQILINYHALSRNEEYFENPNQFDPTRFLNSDSNPAFMPFSIGPRNCV 6562 (2)
6663 GSNFAQDEIYIALSNMILNFKFKSIDGKPVDETQTYGLTLKPNPFKVILEKRK* 6824
c-JC2d95b07.s1 22366 letters 
CHR2.0.28372 5025-6821
Contig13205
IIAFP1D1875
IIAFP1D80337
IIAFP2D50103 
IIAFP2D52521
IIAGP1D0079
JAX4b61d06.s1
JAX4c01f11.s1
JC1a68e02.s2
JC2a57e02.r1
JC2a66h06.s1 this may be a typo check for .r1
JC2a66h07.s1 N-term
JC2a81b01.r1
JC2a162c10.r1
JC2a162d02.r1
JC2a193f03.s1
JC2b21g02.r1
JC2b76b01.r1 
JC2b76b01.s1 
JC2b322g02.r1
JC2b363f02.r1
JC2c04g01.r1 formerly seq 86
JC2c11e11.s1
JC2c11e11.r1
JC2c123h07.r1
JC2c123h07.s1
JC2c157c02.r1
JC2c157c02.s1
c-JC2d95b07.s1
JC2e48a02.r1 N-term
SLD887
VFH640
VFL109
VFL314
VFG480

CYP508A3 seq 21+67+72+82+87 complete 57% to 508A1 408aa
on single contig with seq 22 and 508A1 CHR2.0.28372 --1065
the N-terminal region is badly frameshifted but reconstructed 
based on seq 22, 45, 69 still missing 26 aa but reconstruction 
matches contig 5911 
MEFLKLILFLIIFYIIHNT (0) 
YIKFKKINKNELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLS
DPILIREMFVNNGDYFLDRPKIPSIRHATHYHGIA (1)
TSSGEYWLKIRDIINKAMRKTNLKLIYDSLDQQVDNLIESMNKIESDG
QVFEPRIYFKKYTMAAMYKFIFNEEINFNNEISELIGPIEQVFKDLGSGSLFDVLLISRPLYYQWIEHTDKNYPK
ILNFLKKKYHQHLKTYNPEIQRDLLDLLIKEYYSGSDDDILTIIATINDLFLA (1)
GTDTSSASLEYMVMMLVNYPEIQEK 
VYDEIKLTVNGRNKVLLSDRQFTPYTVSFIKETLRYKPPSSVGVPRTTSQDIIIGDKFIP
KDAQIFINYYGLSRNQDYFENPEQFEPSRFMNPDTNIAFLPFSIGTRNCV (2)
GQNFALDEMFLAFSNIILNFKFSSIDGKQIDETELYGVTLRCKNKFNVSIKKRI* 
chr2 28372 929-1605
Contig5911 chr 2
Contig13310 chr 2
IIAFP1D65605
JAX4a231g12.r1
JC1a148h05.s1
JC1b30e03.r1 
JC1b215d01.r1
JC2a31d03.s2
JC2a114d02.r1
JC2b191g09.r1
JC2b306d10.s1 
JC2b306d10.r1 
JC2c166b04.s1
JC2d13d01.s1 
JC2d95b07.s1 
c-JC2d95b07.s1
JC2e11e01.r1
JC2e73f10.s1
JC3a25h05.r1
IIAGP1D19240

>_4
                              NFFFFIFLLCPISPLFLYKN*LLLLFYLYYN*LLLLLLLLLLLLLLLLFFFFFFFD*YLT
                              IVKGYLLGFHLLIIY*DFFIGWYRILIKKKKKKMIKKKYFIFIKNMLKK*KRI*YKKTIC
                              LCVF*LYN*FFIINWFF**IKILIRTTD*LNKIEKKKKKKKKKKKKKKKKKKKRWHLIF*
                              D*PQDFFLKIN*FFYF*FL*FFFFYFLFFIYYIILKKKKKKIYCNKTTSKT*WSF*N*YC
                              F**FFI*SIILFVFFL*F**KKKKKKDNNNNYTFF*TQIW*NLNN*FFFFFFFFFFFFLE
                              IKSIFISILNLKK*IKMN*KDQFQFQF*EIYINLQVVYHTEI*LKYLKNMVEFIDFGLLI
                              >_5
                              FFFFYFSPLSHFPSFFI*KLIIIIILFIL*LIIIIIIIIIIIIIIIIIFFFFFF*LIFDN
                              CQGIFIGFSFVNYILRFFYWVV*NFN*KKKKKNDKKKIFYFYKKHVKEVKKNLIQKNNLF
                              MCILII*LIFYY*LVFLIDKDFNSYHRLIK*N*KKKKKKKKKKKKKKKKKKKKVAFNFLG
                              LTPRFFFENKLIFLFLIFVIFFFLFFIFYLLYNFKKKKKKNLLQ*NYK*NIMEFLKLILF
                              LIIFYIIHNTVCIFSLILIKKKKKKG****LYFFLNTDMVKFK*LIFFFFFFFFFFFFGN
                              *INFYQYIKFKKINKNELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADX
                              >_6
                              IFFFLFFSSVPFPLFFYIKINYYYYFIYIIINYYYYYYYYYYYYYYYYFFFFFFLINI*Q
                              LSRDIYWVFIC*LYIKIFLLGGIEF*LKKKKKK**KKNILFL*KTC*RSKKEFNTKKQFV
                              YVYFNYIINFLLLIGFFNR*RF*FVPQIN*IKLKKKKKKKKKKKKKKKKKKKKGGI*FFR
                              IDPKIFF*K*INFFIFNFCNFFFFIFYFLSII*F*KKKKKKFIAIKLQVKHNGVFKINIV
                              SNNFLYNP*YCLYFFFNFNKKKKKKRIIIIIILFSKHRYGKI*IINFFFFFFFFFFFFWK
                              LNQFLSVY*I*KNK*K*IKRTNSNSNFRKFISTYKWFTTQRFN*NI*KIWWNL*ILVC*X

>JC2a86a02.3396 541398 letters
        Length = 541,404

  Minus Strand HSPs:

 Score = 484 (175.4 bits), Expect = 1.3e-45, P = 1.3e-45
 Identities = 90/92 (97%), Positives = 90/92 (97%), Frame = -3

Query:     21 YIKFKKINLNELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVWS 80
              YIKFKKIN NELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVV S
Sbjct: 192760 YIKFKKINKNELKGPIPIPILGNLYQLTSGLPHRDLTKISEKYGGIYRFWFADLYTVVLS 192581

Query:     81 DPILIREMFVNNGDYFLDRPKIPSIRHATHYH 112
              DPILIREMFVNNGDYFLDRPKIPSIRHATHYH
Sbjct: 192580 DPILIREMFVNNGDYFLDRPKIPSIRHATHYH 192485

CYP508A4 seq (66+70) complete 55% to CYP508
MIMLIKVFVLLLVVYILHNS (0)
YKKYKKLDKNELKGPTPIPVLGNLHQLSSLPHRDLSKMTKDYGDIFRVWFADL (2)
YTVVISDPVLIRKIYVENHESFRDRPKIP (0)
SMKYGTYYHGTAASMGEDWVRNRGIV SSAMRKSNIKHIYEVINNQVDVLMSTMKK
YEKRSEPFEPRYYMTKYTMAAMFKYIFNEDIGEDEDIHTGEIQKIMGPMNQVMEDF
GTGSLFDVLEISQTFYLKWLELTEKNFPLLLKFFNGRYEQHLETIKPESPRDLLDI
LINEYGTNTHDDYLNIASTVLDFFFAGTDTSSTTLEYLFLMMANYPEIQDKVHQEV
KSYLKQIGKDKVELNDRQSLPYVVAVIKETLRFKPVTPFGVPRSCVNEITIDEKYF
IPKGAQVIINYPSIFENEKYFKNANQFDPSRFLQTTTTNTASNEESSFNSNLAFIP
FSIGPRNCVGMQFAQDELFLAFANIVLNFTIKSVDGKKIDETISYGVTLKPKTRFK
VLLEKRLI
Contig11215  Chr 2
IIAFP1D53115
IIAFP1D83733 poor quality seq
IIAFP1D83741
IIAGP1D11636
JC1b242d09.s1
JC1c118b04.r1
JC1c167b01.s1
JC2b119f08.r1
JC2b119f08.s1 KYG region
JC2b220e03.r1 
VSG666
AFH195
AFK530
VFK340

>VFK340 (VFK340Q) /pub/dna_csm/LIBRARY/VF/VFK3-B/VFK340Q.Seq.d/
        Length = 681

  Plus Strand HSPs:

Supports N-term exon

Query:     1 MIMLIKVFVLLLVVYILHNSYKKYKKLDKNELKGPTPIPVLGNLHQLS 48
             MIMLIKVFVLLLVVYILHNSYKKYKKLDKNELKGPTPIPVLGNLHQLS
Sbjct:    24 MIMLIKVFVLLLVVYILHNSYKKYKKLDKNELKGPTPIPVLGNLHQLS 167

CYP508B1 Seq. 15, 77  complete 50% identical to 508A1
MLFNIFLYLFIFCIVSSG (0)
IKKYKKIHKNELSGPFPIPLLGNLHQLGKE
PHYTLTKMHNVYGEIFRLHFGDVYTVVVSDPILIREMFVDNHENFKYRPLLPTFKFGAGG
DHGLSLSNER WERTRELVQNAMKRTSIKKIYDMLDSQVYELIKSMKQYQVTGNPFEIHLY
AQRFTLSIMFKYIFNEDISYDEDITKGKIAELVQPMDQIFKSLGSGKLGDFISIAQPFYY
QWLKFSDKQFNGPFSTVKKFIYKRYLEHINTIDHDNPRDLMDLLINEFSNDKNLIPTILQ
TSLDMFLAGN (0)
DTTAASIIWFVLRMQEHPEIQLKAYNEIKDAVGDRDRVLLSDRPKTPYLN
AIIKEILRLNPVGPFGLPHRSSNDIVIGNGKYFIPKDSQILVNYRGLGFNEKYFENPSQF
DPSRFLNKNNDAYMPFGVGDRKCVGLQLAGDEQYLSFSNILLNFNLKNIGTPVSDYEEFG
LTLKPNKFKVLLESRK*
AU038895 *
AU074803 *
IIAAP1D5220 *
IIAFP1D67917 *
IICBP1D42314 
JC1b221g12.s1 *
JC2a236e12.r1 *
JC2b149h06.s1 *
JC3a263c07.s1 25685 letters
c-JC2b149h06.s1 *
JC2d41b07.s1 *
SSM113
FC-IC0188 EST
Length = 635

Query: 15236 LVTLKNIFFFF--FFFFFLVGKN 15298
             +V L NIF FF  FF +  V KN
Sbjct:     1 MVYLNNIFIFFIIFFIYSFVKKN 23

>_1
PILDYSIPFI**I***IHLFFFFYLFIYKDLWTEKKKKDNNEKKKKK**KKKKKKNIFIF
IFFFIFLFFNFSQRP*FPFFISF*FILPQVDFLKKK*KIKKKKIYLKNQDNKFFFNFFLG
LGFKKKNKKKK*KIKLKLLQY*KYPYPRIKLTIDTSE*KFKTLRILRWLKYSSNIKKIIN
*KSLDINFIN*SIIAHQEFGDVKKYFFFFFFFFFFGWKKLFLGFFFESF*DILNLNSPKK
KKKKMFINFYF*IFLFLFFFIVIFYFLFFIFLLLFFLII*KKKK*INIIKNVI*YIFIFI
YILYC*LRRMYFTFIFF*KKKNILIKKKKKKKNRLKNIRKFIKMNYQDHFQFHYLEIYIN
>_2
QFWIIVYHSFDEYDDESIFFFFFIYLFIRTYGLKKKKKIIMKKKKKNNKKKKKKKIFSFL
FFFLFFYFLIFPKDLNSLFLYLFSLFYPRSIS*KKNKK*KKKKFI*RTKIINFFLIFF*G
*GLKKKIKKKNKKLN*NFYNIKNIHTHA*N*QLTQVSRNLKH*EY*GG*NIQAISKR*SI
KKVWISILLINQL*PIKNLVTLKNIFFFFFFFFFLVGKNYFWVFSLNPFKIF*ISTHPKK
KKKKCSSIFIFKFFYFYFFLLLFFIFYFLFFYCYFF*SSKKKKNK*I**KMLFNIFLYLF
IFCIVSSGVCILHLFFFKKKKIY*LKKKKKKKID*KI*ENS*K*IIRTISNSITWKFTS
>_3
NFGL*YTIHLMNMMMNPSFFFFLFIYL*GPMD*KKKKR***KKKKKIIKKKKKKKYFHFY
FFFYFFIF*FFPKTLIPFFYIFLVYFTPGRFPKKKIKNKKKKNLFKEPR**IFF*FFFRA
RV*KKK*KKKIKN*IKTFTILKISIPTHKTNN*HK*VEI*NIKNIEVVKIFKQYQKDNQL
KKFGYQFY*LINYSPSRIW*R*KIFFFFFFFFFFWLEKIIFGFFL*ILLRYFKSQLTQKK
KKKNVHQFLFLNFFIFIFFYCYFLFFIFYFFIVIFFNHLKKKKINKYNKKCYLIYFYIYL
YFVLLAPAYVFYIYFFLKKKKYTN*KKKKKKK*IKKYKKIHKNELSGPFPIPLLGNLHQ

JC3a263c07.s1 25685 letters extends N-term
Query: 15642 IKKYKKIHKNELSGPFPIPLLGNLHQLGKEPHYTLTKMHNVYGEIFRLHFGDVYTVVVSD 15821
             I KYKKIH NEL GP PIP+LGNLHQ G+ PH  LTKM   YG I R++  D+YTVVVSD
Sbjct:    20 ISKYKKIHVNELCGPTPIPILGNLHQFGELPHRVLTKMTKKYGHILRVYMADMYTVVVSD 79

Query: 15822 PILIREMFVDNHENFKYRPLLPTFKFGAGGDHGLSLS-NERWERTRELVQNAMKRTSIKK 15998
             P+LIREM+VDN + F  R   P+ + G    HG   S  E W+  RE+V  AM++T++K 
Sbjct:    80 PLLIREMYVDNSDIFTDRVKKPSVEHGTFY-HGTVTSYGEHWKNNREIVGKAMRKTNLKH 138

Query: 15999 IYDMLDSQVYELIKSMKQYQVTGNPFEIHLYAQRFTLSIMFKYIFNEDISYDEDITKGKI 16178
             IY++LD QV  LI+SMK  + +G  F+   Y  +FT+S MFK++FN DI  DEDI KG  
Sbjct:   139 IYELLDKQVDVLIRSMKSIETSGKTFDTRYYITKFTMSAMFKFLFNHDIPEDEDINKGDT 198

Query: 16179 AELVQPMDQIFKSLGSGKLGDFISIAQPFYYQWLKFSDKQFNGPFSTVKKFIYKRYLEHI 16358
              +L+ PM ++F++ G G L D I+I QP Y  +L+  D+ F      + K+  ++Y EH+
Sbjct:   199 QKLMGPMSEVFQNAGRGSLFDVINITQPLYLLYLEMFDQSFKD----IMKYHREKYNEHL 254

Query: 16359 NTIDHDNPRDLMDLLINEFSNDKN-LIPTILQTSLDMFLAGNVCTS 16493
              T D D  RDL+D+LI E+  D +  I +IL T  D FLAG V TS
Sbjct:   255 KTFDPDVERDLLDILIKEYGTDNDDKILSILATINDFFLAG-VDTS 299
           
CYP508C1 Seq 4,23,24,83 (complete) 42% to seq 66 N-term short may be missing real 1st exon
MGPWG (0)
YIKNKRIHKNEAKGPIGFPLIGNMIQIGKTKPHIELMKLEKIYNQRILKIWLGDYYSVFLSDIDLIKDIFINKFE
NFSSRPKSPLTRLGTNDFRGINGSSGETWFKNKNIIVNAMKRANTKTIYTLLDNQVNDLIKEISKFESQNKS (0)
FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEILETPYYYY
LQKTDKVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEYGTTDDDVLNITQVTLDMFMAGTDT (1)
TANTLEWIIIKLCNSPIHQEIAYNELKKVVSSKVIIDDSIKREITLS
DRPNTPYIQAIIKETMRMHPVVVFGLPRYCENDIFIGDENYFIPKG (0)
CKVFINFHSIGYNEKYFKDPYKFEPNRFLENSNNSMDSFFPFGLGNRVCLG
RQLANDQLYLVIANLILKYKLKTIDENKINEDGIFGLTVSPNKYKINLESR*
C84082
C90646
Contig8310 (3387-1715) minus strand
IIACP2E3219
IIAEP1D1344
IIAFP1D46071
IIAFP1D67346
IIAGP1D6549
JAX4a27b02.s1 
JC1c281f06.s1 
JC2b06a12.r1 no exact match to any other seq
RYYMTKYTMAAMFKYIFNEDIGEDEDIGKIMGPMNQVMEDFGTGSLFDVLEISQTFYLKWLELTEKNFPLLLK
FFNGRYEQHLETIKPESPRDLLDILINEY
JC2b120c09.r1 
JC2b120c09.s1 Seq 24 46% to seq 21 is this from a cluster?
KTIDIENPRDLLDLLIIEYGDHSDENMILIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQD
sdic6A4b6.q1t
sdic6A87h12.p1c
sdic6B16c2.p1c possibly seq 4 with frame shift and bad seq
FNPKYYFRKFVLSTMFKYIFNEDVPYDENLENGKLSELTMEMENIFKTLKVGKLANSIEI 447
LETPYYYYLQKTDEVFKNIKKLIIEKYKNHNLSINPEKPRDLLDILINEYGTTDDDVL 621
SSC732
SSI265
SFG492

CYP508D1 Seq (59+81) 42% to seq 4  483 aa complete
MVYLNNIFIFFIIFFIYSF (0)
VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMD
FYHKMYGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSS (2)
RPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNAMKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES (0)
FQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFFKILQPLYYQYLLYRGGCFNRIRTLIRNR
YIEHRKTIDIENPRDLLDLLIIEYGDHSDENMISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTL
NDRPSTPYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFI
PKDSMVLINFYSLGRNPKDFPDPLKFDPNRFIGSTPDSFMPFGTGPRNCI (2)
GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR*
Contig8310  Chr 6
IIADP1D3779
IIADP2D3779 note 90% to same clone opposite read error rate is about 10%
IIAFP1D13883
IIAFP1D54156
IIAFP1D67079
IIAFP2D44284
JC1a137g04.r1
JC1a231c03.r1
JC1b152d10.r1
JC1b152d10.s1
JC1b218a10.r1
JC1c260b08.r1
AFJ424 (N-term)
CFB272

Query:   164 ESFQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFF 223
             + FQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFF
Sbjct:  3408 KKFQPDMYLRKYALTTMFKYVFNETVSFENKIVQGEEAELIDNINEAFNFMTLGNAGDFF 3229

Query:   224 KILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTIDIENPRDLLDLLIIEYGDHSDENM 283
             KILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTIDIENPRDLLDLLIIEYGDHSDENM
Sbjct:  3228 KILQPLYYQYLLYRGGCFNRIRTLIRNRYIEHRKTIDIENPRDLLDLLIIEYGDHSDENM 3049

Query:   284 ISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTLNDRPST 343
             ISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTLNDRPST
Sbjct:  3048 ISIVQVCFDVILAGVDTLASSLEWFLVMLCNNQQIQDTVYNELKETVVGPVVTLNDRPST 2869

Query:   344 PYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFIPKDSMVLINFYSLGRNPKDFPDPL 403
             PYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFIPKDSMVLINFYSLGRNPKDFPDPL
Sbjct:  2868 PYTMACIKETIRLKAPAPFGLPHTTDQDIIVKGHFIPKDSMVLINFYSLGRNPKDFPDPL 2689

Query:   404 KFDPNRFIGSTPDSFMPFGTGPRNCI 429
             KFDPNRFIGSTPDSFMPFGTGPRNC+
Sbjct:  2688 KFDPNRFIGSTPDSFMPFGTGPRNCM 2611

 Score = 426 (155.0 bits), Expect = 3.8e-214, Sum P(3) = 3.8e-214
 Identities = 87/108 (80%), Positives = 91/108 (84%), Frame = -1

Query:     3 YLNNIFIFFIIFFIYSF-VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKM 61
             + NN F + +I  I+   VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKM
Sbjct:  4094 FYNN-F*WILIDLIFLI*VKKNKKSTKENDLKGPIALPIIGNLFGLRNDTYSIMDFYHKM 3918

Query:    62 YGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSSRPFLPTITFGSF 109
             YGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSS  F   I F  F
Sbjct:  3917 YGGIYRLWFGDYFVVSLNDPEIIREIFIKNYSNFSSFVFKN*IIFFFF 3774

 Score = 361 (132.1 bits), Expect = 2.8e-207, Sum P(3) = 2.8e-207
 Identities = 74/93 (79%), Positives = 78/93 (83%), Frame = -3

Query:    73 YFVVSLNDPEIIREIFIKNYSNFSSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNA 132
             +F +  N    I  IF+     + SRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNA
Sbjct:  3777 FFFLKNNLLIFILLIFLIFLIIYHSRPFLPTITFGSFNYRGISGSNGDYWKRNRNLLLNA 3598

Query:   133 MKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES 165
             MKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES
Sbjct:  3597 MKKSNLKQTYDNLSDSVNSLINLMKEFQSSNES 3499

 Score = 272 (100.8 bits), Expect = 3.8e-214, Sum P(3) = 3.8e-214
 Identities = 53/53 (100%), Positives = 53/53 (100%), Frame = -1

Query:   430 GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR 482
             GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR
Sbjct:  2534 GQALGMDQIYLLLSNIFLNFKITSENGKKLDDTDYVSGLNLKPAKYKVCLEKR 2376

>Contig_0725, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
TTTTTTTTTTTTAAAATTTTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAGGCCCCATGGG
GCCCCTGGGGGGTTTTTTTTTTTTTTGGAAATTTGGAAATTTGGAAAAAATCCAATTTTT
TTTTTTAAAAAAATTTTTTTTTTAAAAAAAAAAAAAAAGGCCCCAAAAAAAAAAGGGGGA
AAATTTAAATTATTAAAACCCTTTTACCCCCTTTTTTAAAAAATTTTTTTTTTAAATTTT
TTTTTTAAAATTTTTTTTTTTTTTTTCCCCCCCAACCAATTAAAAAAAAAATTTTCTTTT
AAAACCCCCCCAGGGAATTTTTAATTTTTTCCCGGTTTTTTTTTTAAATTTTTTAAATCC
CTCCCTTGGGAAAAATTTAAAAACCTTTTTTTAAAAAAAAAATTTAAAACCAAAATTTTT
TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT
ATTAGTATATTAAAAATAAAAGAATTCATAAAAATGAAGCAAAAGGTCCAATTGGGTTCC
CATTAATTGGTAATATGATTCAAATTGGTAAAACAAAGCCACATATTGAATTAATGAAAC
TTGAAAAAATTTATAATCAAAGAATTTTAAAAATTTGGCTTGGAGATTATTATTCAGTTT
TCTTATCAGATATTGATTTAATAAAAGATATTTTCATAAATAAATTTGAAAATTTTTCAT
CAAGACCAAAATCTCCATTAACTAGACTTGGTACAAATGATTTTAGAGGTATTAATGGTT
CATCAGGTGAAACATGGTTTAAAAATAAAAATATTATTGTAAATGCAATGAAAAGAGCAA
ATACAAAAACGATCTATACTTTATTGGATAATCAAGTAAATGATTTAATAAAAGAAATTT
CAAAATTTGAATCACAAAATAAATCAGTATGTTTAATTAATAATACAAATATTATTAAAA
AAACTAATTTTTTTTTTTTTTTTCTTTTTTTTTTATATATAGTTTAATCCAAAATATTAT
TTTAGAAAGTTTGTTTTATCAACAATGTTTAAATATATTTTTAATGAAGATGTACCATAT
GATGAAAATTTGGAAAATGGTAAATTATCAGAATTAACAATGGAAATGGAAAATATTTTT
AAAACTTTAAAAGTTGGGAAATTAGCTAATAGTATAGAAATTTTAGAAACTCCATATTAT
TACTATTTACAAAAAACTGATAAAGTATTTAAAAATATTAAAAAATTGATTATTGAAAAG
TATAAGAATCACAATTTATCTATAAATCCTGAAAAACCAAGAGATCTTTTAGATATTTTA
ATTAATGAATATGGTACCACTGATGATGATGTTTTAAATATTACTCAAGTAACTTTGGAT
ATGTTTATGGCAGGAACAGATACAAGTAACTATACTATACTATACTAAATAAAAATTAAA
TTATATTATTTATAATTTTCTAAAATTTTCTAAAATATTCTAAAATTATAATTACAGCTG
CCAATACTTTAGAATGGATAATCATTAAACTTTGCAATAGTCCAATTCATCAAGAAATAG
CATATAATGAATTAAAAAAGGTAGTATCTTCAAAAGTGATAATTGATGATTCAATTAAAA
GGGAAATAACATTATCAGATAGACCAAATACACCATATATTCAAGCAATCATTAAAGAAA
CCATGAGAATGCATCCTGTTGTTGTATTTGGTTTACCAAGATATTGTGAAAATGATATAT
TTATTGGTGATGAAAATTATTTCATTCCAAAAGGAGTATGTATAATAATAATTATATTTT
TAAATTATGTATGATAATAATAATAATAATAATAATTTTTTTTTTTTTTTTACAGTGTAA
AGTTTTCATTAATTTTCATTCAATTGGGTATAATGAAAAATATTTTAAAGATCCATATAA
ATTTGAACCAAATAGATTTTTAGAAAATTCAAATAATTCAATGGATTCTTTTTTTCCATT
TGGTTTAGGTAATAGAGTTTGTTTGGGTAGACAATTAGCAAATGATCAACTTTATTTGGT
AATTGCAAATTTAATTTTAAAATATAAATTAAAAACAATTGATGAAAATAAAATTAATGA
AGATGGTATTTTTGGTTTAACTGTTAGTCCAAATAAATATAAAATTAATTTAGAATCAAG
ATAAATAGTACACACACCCACACACACACACATATATATATATATCTATAAATTACACAA
AAGTAAAAATTAAAAAAATAAAAAAAAAATAAAAAAATAAAAAAATACAAATGTATAATA
ATTTAATAATTTATTTAATTGATTTTTTTATATTTAAATTTTTTTTTTTTTTTTTTTACA
ATAAAATTATAAAAAAATAGATAAATAATGATTTATCTTTTTTCAAGACAAACTTTATAT 2400
TTTGCTGGTTTGAGATTTAATCCTGAAACATAATCAGTATCATCAAGTTTTTTACCATTT
TCAGAAGTAATTTTAAAATTAAGGAAAATATTTGAAAGAAGTAAATAAATTTGATCCATA 
CCTAATGCTTGACCACTAAAAAAAAATTAAATTAAAATAATATTAATATTAATTATAAAA 2580
AAAAAAAAAAAAAATTAAAAAAAAAACTTACATGCAATTTCTAGGACCAGTACCAAAAGG
CATAAAGGAATCAGGTGTAGAACCAATGAATCTATTTGGATCAAATTTAAGAGGGTCAGG
GAAATCTTTTGGATTTCTACCTAATGAATAAAAGTTTATAAGTACCATTGAATCTTTTGG
AATAAAGTGACCCTTAACGATTATATCTTGATCAGTTGTATGAGGTAAACCAAATGGAGC
AGGTGCTTTTAATCTTATGGTTTCTTTAATACATGCCATTGTATATGGTGTTGATGGTCT
ATCATTTAATGTTACCACTGGTCCAACAACAGTCTCTTTCAATTCATTATAAACAGTGTC
TTGAATTTGTTGATTATTGCATAACATTACCAAGAACCATTCTAATGAACTTGCCAATGT
ATCAACACCGGCCAAGATTACGTCAAAACAAACTTGAACGATTGAAATCATATTCTCGTC
AGAATGGTCACCATATTCAATGATTAACAAGTCTAATAAGTCTCTTGGGTTTTCAATGTC
TATGGTTTTACGATGTTCAATATATCTATTTCTTATTAATGTTCTAATTCTATTGAAACA
GCCACCTCTATATAAAAGATATTGGTAATATAATGGTTGCAAAATTTTAAAGAAATCACC
TGCATTACCAAGTGTCATAAAATTAAATGCTTCATTTATATTATCAATTAATTCAGCTTC
TTCACCTTGTACAATTTTATTTTCAAATGATACAGTTTCATTAAATACATATTTAAACAT
AGTTGTTAATGCATACTTTCTTAAATACATATCAGGTTGAAACTTTTTAAAAAAAAAAAA 3420
AAAAAAAAAAAAAAAAAAAAAAAATTTAATTAAAGAATAAAAAAAAAAGGATTTAAAATT 3480
CTTGTTGTAAATACATACACTTTCATTTGAAGATTGAAATTCTTTCATTAAATTAATTAA
AGAATTAACAGAATCACTTAAATTATCATATGTTTGTTTTAAATTTGATTTTTTCATTGC 3600
ATTTAAAAGTAAATTTCTATTTCTTTTCCAATAATCACCATTTGATCCTGAAATTCCTCT
ATAATTAAATGATCCAAATGTTATTGTTGGTAAAAATGGACGACTATGGTAAATAATTAA
AAAAATTAAAAAAATTAATAATATAAATATTAGTAAATTATTTTTTAAAAAAAAAAAAAA 3780
AAAAATATAATTTAATTTTTAAATACAAACGATGAAAAATTTGAATAATTTTTAATAAAT
ATTTCTCTAATTATTTCTGGATCATTTAAAGAAACAACAAAATAATCACCAAACCATAAT
CTATAAATTCCACCATACATTTTATGATAAAAGTCCATAATTGAATATGTGTCATTTCTT
AAACCAAATAAATTACCAATAATTGGTAAGGCTATTGGCCCTTTTAAATCATTTTCTTTA
GTGCTTTTTTTATTTTTTTTTACCTAAATTAAAAAAATTAAATCAATTAATATCCATTAA
AAATTATTATAAAATTATAAAATTAATAATATTTTCCAATGAATAAATAAAAAAAA



CYP508E1 seq (71+85) complete seq 38% to 508A1
MNIINIVIFLIIFYLFKSN (0)
YKKYIKKNKFEVNGPFPLPIIGFTSNYIKPHIKCHELVKQYGDIFRV
YLGDNKTIMVSDYKIIEELFIKNHNSFLERPITPSF
SHCSDNQNGILLSNEKWVNNREMIQKAMKKGIVKSVYELLNKQINDLIQSVKPFSESG 
EPLNFRLFATRFTLSTMMTYIFNEPLSYNENLENGTVLYMF (FRAME SHIFT)
LENIFVLADVGHIGDYISFLKPIYSLFLKLTDSNVPLARDYVYKKFYEHLKTIDNEKEE (2)
NYDLLHTFIKEVGIKDKESVKSIVSNSLDLLVAGTDTAAKGIEWIILRIANNQDVQELIFKELKSIVDSCGR
VEDRIYLSDRSSTPYLNATIKESMRITPITPYGIPRVVGKDLIIKG
HFIPKGSHVVINYRALNHDESIYKNPNQFNPNRFLDNNIESFIPFSIGNRNCVGQQIANH
ELFLFVSNFILNYKITPITNEPIDTTENFGINIRPNSFKINVYKRN*
CHR2.0.4511 
JAX4a63e05.s1
JAX4a63e05.r1
JAX4a246e10.s1
c-JAX4a246e10.r1
JC1a11b06.r1
JC1a65c04.s2
JC1a181f06.r1
JC1a195c01.s1 
JC1b01e02.s1
JC1b01e02.r1
JC1b117h11.r1 
JC1b117h11.s1 
JC1b128d04.r1 
JC1b231a04.r1 
JC1b231a04.s1 
JC1c88c05.s1  
JC1c110e07.s1
JC1c218c07.r1 
JC2d96a12.r1
sdic6A95c3.q1t

CYP513A1 Seq. 8+53 complete same as 8b 
MNYLVGLVLIFTIFYFFLQKNDKNMNSKIPGPKGIPILGNLLSMKGDLHLKLQEWYKQYG
VIYRIKMGNVETVVLTEYPIIREAFIGNSNSFVNRFQRKSRLKLNNGENLVIVNGDIHNK
LKTLVLSEMTNQRIKKYETSFIDNEIK KLFKVLDEHADTGKPIILNNHIKMFSMNIVLCF
TFGLNYSYPYDEFEKASEFIKLMVEFFNIAGQPIISDFIPSLEPFIDTSNYLNTYKRIFN
YTSDLISKFKNENEIHNNINDNNKSLADKPILSKLLQSFENGEISWDSVVSTCIDLQTAG
ADTSANTILYCLLELINNPNIQSKVYDDIKQAIIQSKENENQNDNENQEQTEEIITLSFN
KYRTLAPYLSMVVKETFRKYPSGTIGLPHVTSEDVELNGYKICAGTQIIQNIWATHRNEK
QFSEPDSFIPERFISQQQSANSNLIHFGCGVRDCIGKSLADSEIFTMLASLINRYEFTNP
NPSTPLNEIGKFGITYSCPENKIIIKKRF*
AU039107
AU039893
AU061936
C90256
C91045
Contig15093
IIAFP1D86342
IIAGP1D0885
JAX4a50g04.r1
JAX4a50g04.s1
JAX4a69d01.r1
JAX4a223f01.r1
JAX4b04a08.r1
JAX4b04a08.s1
JC2a07a03.r1
JC2a86a12.s1
JC2b18g07.r1
JC2b18g07.s1
JC2b46a06.r1 N-term seq 8b
JC2b46a06.s1 89% identical to seq 8
c-JC2e41d02.s1
SLG684
SSI504
SSJ415
SSM394
SFH684
AFK177

CYP513A2P seq 38 complete 54% to seq 513A1 probable pseudogene no ESTs
Processed pseudogene only intron in 513A1 is removed in 513A2P
MNFLIILINIIIVLTTIIFLK (frameshift)
KIIKKKNKYIPGPIGLPILGNLLSLKG (frameshift)
ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQ 
KERNNCENVLLANGEMF (28aa deletion)
KLFKELDNLAEIGEPIILNRY (frameshift)
IKLISSNVMLTFTYGDKLLYKYNEIETFEEYLKNLSNYLKVTGRPILSDFIPFLKPF 
SKNVNIEEWDDFFYINSNKYIEKFKSGKNSKQSKTENK
PIISKLLSLYENGEISWASVIGSCIDMQASSTDTIMFCLIELTNLKNIQNKV
YNEIKQEIKKQNINNNNNEDEVLIQYNKYRNSLPYFSMVIKETFRKHPVTAH
ETSNDVEFRGYKISKGTQIIKNIWATHRNEKIFQSPNSFIPERFLEELPNSNLVHFGV
GVRDCMGKSLAESQIFTILASFINRYEFLNPNPSIPLNDVGKFGLAFSCPQNKIIIKKRK*
Dict-IV-V477b10.p1c same seq as Dict-IV-V42f04.p1c including all frameshifts etc.
Dict-IV-V42f04.p1c 
Dict-IV-V831a03.q1c
Contig1006
IIAFP1D52879
IIAFP1D84459
JAX4a21c04.r1 73% identical to seq 8
JC1a87f04.r1
Contig_5063

>JC3a109h04.r1 Clone JC3a109h04, reverse read, bases 52 through 600, from
            2001-03-22
        Length = 547

  Minus Strand HSPs:

Query:     1 MNFLIILINIIIVLTTIIFLKKIIKKK 27
             MNFLIILINIIIVLTTIIFLK   KKK
Sbjct:   356 MNFLIILINIIIVLTTIIFLKNNKKKK 276 frameshift

Query:    18 IFLKKIIKKKNKYIPGPIGLPILGNLLSLKG 48
             +F  KIIKKKNKYIPGPIGLPILGNLLSLKG
Sbjct:   307 LFS*KIIKKKNKYIPGPIGLPILGNLLSLKG 215

>Contig_5063, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 10,471

  Minus Strand HSPs:

 Score = 355 (130.0 bits), Expect = 4.2e-67, Sum P(3) = 4.2e-67
 Identities = 69/70 (98%), Positives = 70/70 (100%), Frame = -3

Query:     1 ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENVLLA 60
             ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENVLLA
Sbjct:  5864 ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENVLLA 5685

Query:    61 NGEMFKIIQR 70
             NGE+FKIIQR
Sbjct:  5684 NGELFKIIQR 5655

 Score = 294 (108.6 bits), Expect = 4.2e-67, Sum P(3) = 4.2e-67
 Identities = 57/57 (100%), Positives = 57/57 (100%), Frame = -2

Query:    92 IKLISSNVMLTFTYGDKLLYKYNEIETFEEYLKNLSNYLKVTGRPILSDFIPFLKPF 148
             IKLISSNVMLTFTYGDKLLYKYNEIETFEEYLKNLSNYLKVTGRPILSDFIPFLKPF
Sbjct:  5607 IKLISSNVMLTFTYGDKLLYKYNEIETFEEYLKNLSNYLKVTGRPILSDFIPFLKPF 5437

 Score = 106 (42.4 bits), Expect = 4.2e-67, Sum P(3) = 4.2e-67
 Identities = 21/21 (100%), Positives = 21/21 (100%), Frame = -1

Query:    71 KLFKELDNLAEIGEPIILNRY 91
             KLFKELDNLAEIGEPIILNRY
Sbjct:  5668 KLFKELDNLAEIGEPIILNRY 5606

AAAATCATTACAATCATCAATATTCACATTTTTTGAAAATGGTTTTAAAAATGGAATAAA
ATCTGATAAAATTGGTCGACCTGTAACTTTTAAATAATTTGATAAATTTTTTAAATATTC 5520
TTCAAAAGTTTCAATTTCATTATATTTATATAATAATTTATCACCATATGTAAAAGTTAG
CATGACATTCGATGAAATCAATTTTATATCTATTTAAAATTATTGGTTCACCAATTTCAG 5640
CAAGGTTATCTAATTCTTTGAATAATTTTAAACAATTCTCCATTTGCCAATAACACATTC
TCACAATTATTTCTTTCTTTTTGAAATCTATTTTCAAAAATATAAGAATTTCCAATAAAT 5780
GCCTCCTTTAAAGTAGAGGATTCTGTTAAAACTATAGTTTCTATAGAACCCATTCTAATT
CTATAAATTGGTCCATATTGTTTATAAAATTCTTGAAATGAGATTAACCTTTTAATGATA 5900

>_4
                              YH*KVNLISRIL*TIWTNL*N*NGFYRNYSFNRILYFKGGIYWKFLYF*K*ISKRKK*L*
                              ECVIGKWRIV*NYSKN*ITLLKLVNQ*F*IDIKLISSNVMLTFTYGDKLLYKYNEIETFE
                              EYLKNLSNYLKVTGRPILSDFIPFLKPFSKNVNIDDCNDF
                              >_5
                              SLKG*SHFKNFINNMDQFIELEWVL*KL*F*QNPLL*RRHLLEILIFLKIDFKKKEIIVR
                              MCYWQMENCLKLFKELDNLAEIGEPIILNRYKIDFIECHANFYIW**III*I**N*NF*R
                              IFKKFIKLFKSYRSTNFIRFYSIFKTIFKKCEY**L**FX
                              >_6
                              IIKRLISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCE
                              NVLLANGELFKIIQRIR*PC*NW*TNNFK*I*N*FHRMSC*LLHMVINYYINIMKLKLLK
                              NI*KIYQII*KLQVDQFYQILFHF*NHFQKM*ILMIVMIX


>JC3a109h04.r1 Clone JC3a109h04, reverse read, bases 52 through 600, from
            2001-03-22
        Length = 547

  Minus Strand HSPs:

 Score = 251 (93.4 bits), Expect = 2.1e-20, P = 2.1e-20
 Identities = 50/57 (87%), Positives = 52/57 (91%), Frame = -3

Query:    19 ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKLFKELDNL 75
             ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQK     +N+
Sbjct:   212 ISFQEFYKQYGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENV 42

>JC3a109h04.r1_4 Clone JC3a109h04, reverse read, bases 52 through 600, from 2001-03-22 translate frame +1 translate plus frames translate all frames
                              DL*YLMIIKPPISF*NNYLIFLQTPNF*FNFIYLFKFLNYYFLIVILYLYFKKLIIIF*K
                              *IKNEFFNNIN*YYNSFNNNYFLKK**KKKINIYLVLLGYQF*EIYYH*KVNLISRIL*T
                              IWTNL*N*NGFYRNYSFNRILYFKGGIYWKFLYF*K*ISKRKK*L*ECVIGKWRNV*NYS
                              KK
                              >JC3a109h04.r1_5 Clone JC3a109h04, reverse read, bases 52 through 600, from 2001-03-22 translate frame +1 translate plus frames translate all frames
                              *FIILNDNKTTNIFLK*LSYFFTNPQFLI*FYLFI*IFKLLFFNCYFIFIF*KINYYLLK
                              INKK*IF**Y*LIL**F*QQLFS*KIIKKKNKYIPGPIGLPILGNLLSLKG*SHFKNFIN
                              NMDQFIELEWVL*KL*F*QNPLL*RRHLLEILIFLKIDFKKKEIIVRMCYWQMEKCLKLF
                              KEX
                              >JC3a109h04.r1_6 Clone JC3a109h04, reverse read, bases 52 through 600, from 2001-03-22 translate frame +1 translate plus frames translate all frames
                              IYNT****NHQYLFKIIILFFYKPPIFNLILFIYLNF*IIIF*LLFYIYILKN*LLSFKN
                              K*KMNFLIILINIIIVLTTIIFLKNNKKKK*IYTWSYWVTNFRKFIIIKRLISFQEFYKQ
                              YGPIYRIRMGSIETIVLTESSTLKEAFIGNSYIFENRFQKERNNCENVLLANGEMFKIIQ
                              RX

CYP513A3 Seq. (7+50)complete  49% to seq 8 only one intron
MTSLTLYLIIFSIILYLFVN (0)
RNKRKNLK IPGPNGI PIFGNLLSLSGEMHLTLQEWYKTYGSVFSIRMGNIDTVVLT
EYPTIRKAFVDNSLAFASRYQLKSRVVLTGAKDLAIQNGEIHSLLKKVVLSEMTTTKIKRMEIHIIKE
TEKILKILDKHAERGEPFIINNYLNMFSMNVILRFLLGIDYPYENVDETVGYVKSIKSFFAVAGLPIL
SDFIPIPLKKSGVFFDSYKELEIETDKLIEKFKKSRNEKIENGTYNEEEDESILSKLLKEYEHGNITW
ECVSHTCIDIISAGTDTSANTLVMALIELINNQEIQSKAFSSIRSSCLNDSNDDDDDDEIVITHSKY
RSLLPYISMIIKETFRKHPIALLGLPHVTTEDVEIDGYKIEAGTYIIQNIFSSHRSDKI
FQSPNEFIPERFFESSQNQGLIHFGLGVRDCVGKSLAECEIFTLIATLLNRYQFINPNN
SKKLNDIGTFGLAQVCPDTNIILKKRI*
C90627
IIAFP1D53507
IIAFP1D59661
IIAFP1D61001
IIAFP1D72987
JAX4b12e11.s1 N-term
JC1a07f07.r1
JC1a25b04.r1
JC1a26e12.r1
JC1a68c10.s2
JC1a91f11.r1 N-term
JC1a105h02.s1 N-term
JC1a107g08.r1
JC1a111f06.s2
JC1a135d11.s1 mid region
JC1a150d03.r2
JC1a226f02.s1
JC1b144h08.r1
JC1b186b12.r1
JC1c102g01.r1
JC1c102g01.s1
JC1c131e03.s1
JC1c232h03.s1
JC1c235b07.s1
JC1c247d05.r1
JC1c247d05.s1
JC1c253h06.s1
JC1c279d04.r1
JC1c279d04.s1
JC2a11g11.r1 CALLED 7B
JC2b41a05.r1
JC2b257e01.s1
JC2b257e01.r1 mid region
JC2b257f06.s1
JC2c20g05.r1
JC2c20g05.s1
SSI868
AFC386
Seq 7b JC2a11g11.r1  C-term 90% to seq 7 may be a different gene
DTSANTLVMALIELINNQEIQSKALPSIRSSCL
NDSNDDDDDDEIVIAHSKYRSLLPYISMIIKESFRKHPIALLGLPHVTTEDVEIDGYKIE 281
AGPFIIPNILSSHRSDKIFQSPNEFIPEIFFGSCQNX 173
NQGLIHFGLGVRDCVGKSLAECEIFPLIATLLNKYQFINPNNS*KLNDIGTFGL 14

>JC1b186b12.r1 Clone JC1b186b12, reverse read, bases 66 through 644, from
            2000-09-15
        Length = 577

  Minus Strand HSPs:

Query:     1 MTSLTLYLIIFSIILYLFRN 20
             MTSLTLYLIIFSIILYLF N
Sbjct:   501 MTSLTLYLIIFSIILYLFVN 442

Query:    19 RNKRKNLKIPGPNGIPIFGNLLSLSG 44
             RNKRKNLKIPGPNGIPIFGNLLSLSG
Sbjct:   340 RNKRKNLKIPGPNGIPIFGNLLSLSG 263

>JC1b186b12.r1 Clone JC1b186b12, reverse read, bases 66 through 644, from 2000-09-15 translate frame +1 translate plus frames translate all frames
TTTGGTTGTAGTCATTTTCTGAAAAACTACTTTTTTCAATAATGAATGAATTTCACCATT
TTGAATAGCAAGATCTTTAGCACCAGTTAATACAACTCTACTTTTCAATTGATATCTACT
TGCAAATGCTAATGAATTATCAACAAATGCTTTTCTAATAGTTGGATATTCAGTTAAAAC
CACTGTATCAATATTACCCATTCTAATGGGAAAATACAGATCCATAAGTTTTATACCATT
CTTGTAGTGTTAAATGCATTTCACCACTTAAAGATAATAAATTACCGAAAATTGGAATTC
CATTAGGTCCTGGAATTTTTAAATTTTTTCTTTTATTTCTCTAGTTAATTAATTTAAAAT
                              R  N  K  R
AAATAAAAAAAAAATTAGTTTTAAGATAAAGATAAATCAATTTTTGAAAAAAAAAATAAA
AAAAAAAAAATAAAAAATTACATTTACAAACAAATATAAGATAATTGAAAATATAATCAA
                       N  V  F  L  Y
ATATAATGTTAAACTAGTCATTGTTTAAGATTTTGCTTTGGAAATTGAGTTTGAGGGTTT
TTTTTTTTTTTTTTTGTTTTTTTGGTTTTTTTTTTAA

CYP513B1 seq 49 complete seq 45% to seq 7, 50 only one intron
MNLLVLSVILAIIIYLIFKR (0)
NYKYSPSKINSKIPGPIGLPIFGNILSLDNKNGIHTTFQKWFKIYGPIYSVN
MGNKSAVVLTGFPIIKKAFIDNSEAFAPHYTFESRYKLNKCSDITQENGKNQSALKRIFL
SELTVTRIKKQESHIQNEIVKLMKVLDKHSEDGKPFLLNNYFSMFSINIISRFLFGID
FPYQDFEETSDLMVGIRDLLIASGEIVLSDFLPIPHSKRSKLYTSYQALVVQIETLV
KSHKYKEDDECMLSKLMIEHDKGNIPWDAVISNCNTIITAG
SDSTSSTALFFLIEMMNNPTIQTKVYNDIVVSFEQNQQADDYMNESMVILKYSKYRSLIPYLSLALKENYR
KHPAAPFGAPHETTQETVIEGYTIAKGTMIFQNIYATQRSDTFYSQPDEFIPERWNGDENSQTLIS
FGTGIRDCIGKSLAYNEIFTIIASVLNRYEFINPNPSIPFDDNGIPGLTTQCKNTVVQIKKR*
IIADP1D4990
JAX4a221a10.r1
JC1c244e11.r1
JC1c244e11.s1
CFG492 SUPPORTS FIRST INTRON BOUNDARY

>CFG492 (CFG492Q) /pub/dna_csm/LIBRARY/CF/CFG4-D/CFG492Q.Seq.d/
        Length = 1416

  Plus Strand HSPs:

 Score = 254 (94.5 bits), Expect = 1.5e-20, P = 1.5e-20
 Identities = 51/51 (100%), Positives = 51/51 (100%), Frame = +2

Query:     1 MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNK 51
             MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNK
Sbjct:    20 MNLLVLSVILAIIIYLIFKRNYKYSPSKINSKIPGPIGLPIFGNILSLDNK 172



CYP513C1 Seq 5+56 complete seq 43% to seq 7 495 aa only one intron
MNYLVLILVSLVSIYFLFIKN (0?)
QDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKY
LEWFEKYGPVFRVSIGSLETVVLTGYPILREAFIDNSDTFTSRFQRENARS
INGYKGLINSNGDYHKNLKSVILSEMTATKIKKMESHINQESKRLCELLDQHAKQGTPFTMNKYLNLF
SINIILRFLFGVNYPYTELDDGSSSIIQVIQQFLKLVSQPSITTYFPILSPFMNDRSKEF
YDIHKLLSNHIINLIERYKDSKQHQQQEQEPIDGATEPTVTILDKLLIEVENNRITQNAL
ISICIDVLIAGTDTVGQTLSFAIVALVNNAEIQEKLSRNIIDSMEKGDNHYSYSKYRNGI
PYLALVVKEVFRMYPAGILGLPHMTSEDCEIQGHKIAKGTQIIQNIYSTHRSESFWPNPN
NFIPERHIQNDVSKSVHFAVGTRNCMGMSLSEAEVHTAMAELFGNFKFTNPSNIPLNDQG
TFSVALNCPDFFVKIERRN*
C91402
CHR2.0.12287
Contig2443  Chr 6
JAX4a56e09.r1
c-JAX4a56e09.r1
JAX4a73a11.r1
JC1a97h05.r1
JC1b31b11.s1
JC1b136b11.s1
JC1c226g09.r1 
JC1c226g09.s1
sdic6A2e2.q1t 41-92 KYG region
sdic6A74a10.p1c
sdic6Fh12.q1t
sdic6Rf10.q1t
SSK171 
AFH363 N-term EST supports first intron boundary
VFB834
SFK345 N-term EST supports first intron boundary

>CYP513C1 Seq 5+56 complete seq 43% to seq 7 486 aa
          Length = 486

 Score = 215 (75.7 bits), Expect = 1.0e-20, P = 1.0e-20
 Identities = 48/58 (82%), Positives = 48/58 (82%)

Query:     1 MNYLVLILVSLVSIYFLFIKNQDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKYL 58
             MNYLVLILVSL          QDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKYL
Sbjct:     1 MNYLVLILVSL----------QDRKKKINSKIPGPKGFLFGNLIELARTKKNLHLKYL 48


CYP513D1 Seq (54+55+78) complete seq 38% to seq 8 only one intron
MGISSIIIILFIIVLLKKLIKK (0)
EDRIHRINKNIPGPKSKLLVGNLFDLKGQVHEKLKEWYEQYGSVYRIEFGSVSTVVLTEYATLKEAFV
DNGEIFQSRFQRKSRTTCNKGLNLANSNGEYFNHLKKTLSNEITNQKMKK
NEKIIKIQVGLLSEFFNEISGGGGGGSGGSGISKE
PINNIIKMYSLNVML
SLLFNIHFPYNNNSYQDELMSTITRYFKSTGLPYPSDFIPILYPFLKNK
PKEYFEDYESVKKLITRITNEYQLKHMTEISNKSTIEEIENYQPTNILESLLKQYRLNKI
PYDGVIGCLMDLILAGSDTTGNTCLFSLVALVNNSNIQEKLFNEISNAFNDDDGDEL
NGANDISNSLLKLSYFSDRIKTPYLVAFIKEVKRYYPCAPLSVPHL
LTEDCEIQGYKIAKGTQVIQNIYSTHLSQSFC
SNPLEFSPERFLDSTNEPKIITFGIGQRKCPGENIFEIEIYIFLVYLIKKFKFSHPI
DDNLQLNDRGQFGLSLQCPQLNIKVESR*
IIAFP1D25075
IIAFP1D36726
JAX4b58a08.r1
JC2a05e01.s1 
JC2c166f03.r1 
JC2e05a11.r1
JC2e05a11.s1
c-JC2e05a11.s1
SFE487

>SFE487 (SFE487Q) /pub/dna_csm/LIBRARY/SF/SFE4-D/SFE487Q.Seq.d/
        Length = 1079

  Plus Strand HSPs:

 Score = 163 (62.4 bits), Expect = 9.2e-11, P = 9.2e-11
 Identities = 39/50 (78%), Positives = 40/50 (80%), Frame = +1

Query:     1 MGISSIIIILFIIGLLKK---------KINKNIPGPKSKLLVGNLFDLKG 41
             MGISSIIIILFII LLKK         +INKNIPGPKSKLLVGNLFDLKG
Sbjct:    40 MGISSIIIILFIIVLLKKLIKKEDRIHRINKNIPGPKSKLLVGNLFDLKG 189

>JC2e05a11.r1_frame+1 Clone JC2e05a11, reverse read, bases 59 through 417, from 1999-04-07, translation frame +1
GTFFFLXNITWKFSKMGISSIIIILFIXGLLKKGXKKVFSIFFLKKNNNNNNYNKLIFFK
IK*NKIRKIEFXE*IKIFQDQKVNYWLVIYLI*KDKXMKN*RNGMXNMEXFIVLNLVVL
>JC2e05a11.r1_frame+2 Clone JC2e05a11, reverse read, bases 59 through 417, from 1999-04-07, translation frame +2
GPFFF*XISRGNFQKWEFLQLLLSYLLXDY*KRGXKRYFLFFF*KKIIIIIIIIN*FFLK
*NKIKSGR*NSQNK*KYSRTKK*IIGW*FI*FKRTSX*KIKGMV*XIWKXLSY*IW*C
>JC2e05a11.r1_frame+3 Clone JC2e05a11, reverse read, bases 59 through 417, from 1999-04-07, translation frame +3
DLFFFX*YHVEIFKNGNFFNYYYPIYYXIIKKGG*KGIFYFFFKKK*****L**INFF*N
KIK*NQEDRIXRINKNIPGPKSKLLVGNLFDLKGQVHEKLKEWYXQYGXVYRIEFGSV

>JC2e05a11.r1 Clone JC2e05a11, reverse read, bases 59 through 417, from 1999-04-07 translate frame +1 translate plus frames translate all frames
GGGACCTTTTTTTTTTTAANTAATATCACGTGGAAATTTTCAAAAATGGGAATTTCTTCA
ATTATTATTATCCTATTTATTATNGGATTATTAAAAAAGGGGGNTAAAAAGGTATTTTCT
ATTTTTTTTTTAAAAAAAAATAATAATAATAATAATTATAATAAATTAATTTTTTTTAAA
ATAAAATAAAATAAAATCAGGAAGATAGAATTCNCAGAATAAATAAAAATATTCCAGGAC
CAAAAAGTAAATTATTGGTTGGTAATTTATTTGATTTAAAAGGACAAGTNCATGAAAAAT
TAAAGGAATGGTATGANCAATATGGAAGNGTTTATCGTATTGAATTTGGTAGTGTTA

CYP514A1 Seq. 11 complete seq 35% to seq 7

MNTIFTIILTITILVLSLILK (0)
DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVF
RLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISS
NGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKN
TPNQIRILAVKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYL
PFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEKENELKNETT
DETSSKLNNIPIIENYYKNYLDGSIHYDS
ILFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMN (1)
ESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEF
WGEDALEFKPERFKNQPLYQKGLFHFGA (1)
GPRGCPGGRFTESLTFTFLVIMLKNFKIVNPTDIPI
DVEGEVGLAMQCKPFDALFIKRN*
* means sequence was verified by blast with seq 11
Note 514A1 is only 7aa diffs from 514A4 so some of these genomic seqs are from
514A4
AU075025 *
C93191  *
Contig16233 Chr 6 whole gene *
IIAFP1D5994 *
IIAFP1D46081 *
IIAFP1D56636 probably same gene some errors
IIAGP1D1903  *
IIAGP1D4567 *
IIAGP1D6746 *
JAX4a85a04.r1 *
JAX4a165d01.r1 * 
JAX4a185c11.s1 *
JAX4a225h05.r1 *
JAX4a225h05.s1 *
JC1a54g04.r1  *
JC2b365b08.s1 *
JC2c130f08.r1 *
JC3c23c03.s1   FGKDYNIIS
JC3a164c10.s1  FGKDYNIIS
SSM757
SFH636
Contig_0470

>Contig_0470, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 26,456

  Plus Strand HSPs:

 Score = 1791 (635.5 bits), Expect = 3.1e-257, Sum P(4) = 3.1e-257
 Identities = 349/352 (99%), Positives = 350/352 (99%), Frame = +3

Query:    22 DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV 81
             DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV
Sbjct:  2865 DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV 3044

Query:    82 VLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISSNGDYHYVLRGILTSEITTRK 141
             VLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISSNGDYHYVLRGILTSEITTRK
Sbjct:  3045 VLTGPEVIDECFNKKHREIFKERYIKFSRFFGKDYNIISSNGDYHYVLRGILTSEITTRK 3224

Query:   142 LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII 201
             LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII
Sbjct:  3225 LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII 3404

Query:   202 VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEK 261
             VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEK
Sbjct:  3405 VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFLKDFIGIKLDAIKIKYEK 3584

Query:   262 ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL 321
             ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL
Sbjct:  3585 ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL 3764

Query:   322 LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMNESYRY 373
             LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMN +Y Y
Sbjct:  3765 LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMNGNYPY 3920

 Score = 410 (149.4 bits), Expect = 3.1e-257, Sum P(4) = 3.1e-257
 Identities = 77/86 (89%), Positives = 81/86 (94%), Frame = +2

Query:   359 KYPYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE 418
             K  +++  + ESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE
Sbjct:  3965 KINFVLI*IIESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE 4144

Query:   419 DALEFKPERFKNQPLYQKGLFHFGAG 444
             DALEFKPERFKNQPLYQKGLFHFGAG
Sbjct:  4145 DALEFKPERFKNQPLYQKGLFHFGAG 4222

 Score = 305 (112.4 bits), Expect = 3.1e-257, Sum P(4) = 3.1e-257
 Identities = 58/65 (89%), Positives = 59/65 (90%), Frame = +3

Query:   438 LFHFGAGARACPGGRFTESLTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDAL 497
             +  F  G R CPGGRFTESLTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDAL
Sbjct:  4329 IIFFKTGPRGCPGGRFTESLTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDAL 4508

Query:   498 FIKRN 502
             FIKRN
Sbjct:  4509 FIKRN 4523

 Score = 94 (38.1 bits), Expect = 3.1e-257, Sum P(4) = 3.1e-257
 Identities = 21/21 (100%), Positives = 21/21 (100%), Frame = +3

Query:     1 MNTIFTIILTITILVLSLILK 21
             MNTIFTIILTITILVLSLILK
Sbjct:  2724 MNTIFTIILTITILVLSLILK 2786

>Contig_0470, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
ATTTTATAATTTAAATAGTTTTTTTTTTAATATAAAAACTTTAATATAAAAGATTCTCAA 2700
TTAAAATATTTTATAATATTACAATGAATACAATATTTACAATTATTTTAACAATTACAA
TATTAGTATTATCATTAATATTAAAAGTAAAATTTTAAAATATTATATATATTATTGATT
ATTAAAAATTAAACACTTACTTATACTTTTATTTTTTTAAATAGGATTTATTATTTGAAG
GTAGGATTAAAAAAATTAATAAATTAATACCTGGTCCTTCTACAATTCCAGTTTTTGGTA
ATTTATTACAAATTAACGCAAAAGATTTCCCAAAAAGTGTAAATGATTTCTATGAAAGAT
ATGGCAAAGTTTTTAGATTAAGATTGGGTAGTGTTGAAATTGTTGTTCTAACAGGGCCTG
AAGTTATTGATGAATGTTTTAATAAAAAACATAGAGAAATTTTCAAAGAAAGATATATTA
AATTCTCAAGATTTTTTGGTAAAGATTATAACATTATTTCTTCAAATGGTGATTATCATT
ATGTTCTAAGAGGAATTCTTACAAGTGAAATTACAACAAGAAAGTTAAATAATGGCAGAT
TAGAATCAAATAAATTTATTTTAGAAATGTTTAGTAATCTTTGTAAAGATAATAAAGAAA
CTCTTGTTAAAAATACACCAAATCAAATTAGGATTCTTGCTGTTAAATTAATATTAAATT
TCACATTAGGAATTGAAGAGAATGATGAAACCATTCTAATCATTGTAGAAAAGATTAAAT
GCATTTTTGAAGCTGCTGGATTATTAATTTACTCTGATTATTTACCATTTTTATTTCCAT
TAGATATAAAATCAATGTCAAAGAATGATATTATTTCTAGTTACTTTTTTTTAAAAGATT
TTATAGGTATAAAACTTGATGCTATTAAAATTAAATATGAAAAAGAAAATGAATTGAAAA
ATGAAACTACTGATGAAACATCTTCAAAACTAAACAATATTCCAATTATTGAGAATTATT
ATAAAAATTATTTAGATGGTTCAATTCATTATGATTCAATCTTGTTTTCAATTTCTGATA
TTATTTTTGCAGCAGTTGATTCAACATCCAATGGGTTTAGTCTTTTAATTGGTCAATTAA
TTAATAAACCTGAAATTCAAGATAAAATATATGAAGAAATCATGAGAAATGATGAAAATA
ATAATACAAATAATATATCATTTGCTGATCATACAAAATATCCATATATTATTTCTGTAA 3900
TGAATGGTAATTATCCATATATATATATATATTGGTATTAATATGTGTTGATATATTTTT
TCAAAAAATAAATTTTGTATTAATTTGAATAATAGAATCATATAGATATAATTCTTCAGT
ACCAATAACTGAACCGAATAAAACAACAGAAGATGTTGAAGTAAATGGATATAAAATTGC
AAAAGGTACAATGATAATTAAAAATCTTCGTGGTACACATTTATCAAAAGAGTTTTGGGG 4140
TGAAGATGCTTTAGAATTTAAACCAGAAAGATTTAAAAATCAACCTTTATATCAAAAAGG
ACTTTTTCATTTTGGTGCAGGTATGTTTTGATACCATTTTAAAAAGGAATTAAAATTTAC
CACTATTTTTTTTTTTTTTAATTTACAACTAATTTATAATAATAATAATAATAATAATAA 4320
TCATCATTATTATTTTTTTTAAAACAGGACCTAGAGGTTGTCCAGGAGGAAGATTTACAG 4380
AATCTTTGACTTTTACATTTTTGGTGATAATGTTAAAGAATTTTAAAATAGTCAATCCAA
CTGATATTCCAATTGATGTTGAAGGAGAAGTTGGTTTGGCAATGCAATGTAAACCTTTTG 4500
ATGCCTTATTCATAAAACGTAATTAATAAAAAAAATATTTATTTTTTTATTATTTTAATC
ATTGTTTTCATTGTTGCAAAAATATAATATTTAATAAATAAATTTAGATACAAATAGAAA

CYP514A2 Seq 65 similar to seq 11 500 aa 
MNLIYTIILTIIILVLIISIK (0)
DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCFNDFYKQYGKVYRLRLGSVET
VVLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILHSNGDYHFLLKGVLS
SQVTVRKLNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIKKLILKVVLNFTLGI
EENDDINLSLFQNGSN
IFKAAGLFIYSDYLPFLFPLDIKSMAKSNMISSYVFVRDYLAKKLEEVKKKEYIING
DDDGGVDTSQTPLIESYYKLYLQGLIGYDSILLSIVDIIIASVDTTS
NSISFIIARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPYIISIMN (1)
ETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLIS
KEFWGEDALEFKPERFKTQTLNQKGLLHFGA (1)
GPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDRGDVGLSMQCKPYDALFIKRN*
AB050504 C-term only
IIADP1D1846   *
IIAFP1D29162  * 43/45 identities
JC1a251d12.r1 *
JC1c209h10.r1 *
JC1c220h10.s1 *
JC1c262g11.r1 *
JC1c290g01.r1 *
JC2a117h07.r1 *
Dict_IV1e09.p1c extends upstream
IICBP3D35368

>Contig_0356, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 17,230

  Minus Strand HSPs:

 Score = 1773 (629.2 bits), Expect = 3.6e-258, Sum P(4) = 3.6e-258
 Identities = 346/349 (99%), Positives = 347/349 (99%), Frame = -3

Query:    22 DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCFNDFYKQYGKVYRLRLGSVETV 81
             DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCFNDFYKQYGKVYRLRLGSVETV
Sbjct:  5126 DLFFEDKIKKINKSIPSPPTIPIFGNLLQINSKDVATCFNDFYKQYGKVYRLRLGSVETV 4947

Query:    82 VLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILHSNGDYHFLLKGVLSSQVTVRK 141
             VLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILHSNGDYHFLLKGVLSSQVTVRK
Sbjct:  4946 VLTGGDIIDECFNKKYRDFLKARYVKFSRYLGKDTNILHSNGDYHFLLKGVLSSQVTVRK 4767

Query:   142 LNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIKKLILKVVLNFTLGIEENDDIN 201
             LNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIKKLILKVVLNFTLGIEENDDIN
Sbjct:  4766 LNNGRLEFNKYILQMFNNLNNNDEGSTMFLANDVPSQIKKLILKVVLNFTLGIEENDDIN 4587

Query:   202 LSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNMISSYVFVRDYLAKKLEEVKKK 261
             LSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNMISSYVFVRDYLAKKLEEVKKK
Sbjct:  4586 LSLFQNGSNIFKAAGLFIYSDYLPFLFPLDIKSMAKSNMISSYVFVRDYLAKKLEEVKKK 4407

Query:   262 EYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILLSIVDIIIASVDTTSNSISFII 321
             EYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILLSIVDIIIASVDTTSNSISFII
Sbjct:  4406 EYIINGDDDGGVDTSQTPLIESYYKLYLQGLIGYDSILLSIVDIIIASVDTTSNSISFII 4227

Query:   322 ARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPYIISIMNETYR 370
             ARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPYIISIMN  Y+
Sbjct:  4226 ARLTNHQEIQSKIYEEIMSNDINNNSNNISFSDHSKYPYIISIMNGNYK 4080

 Score = 405 (147.6 bits), Expect = 3.6e-258, Sum P(4) = 3.6e-258
 Identities = 76/76 (100%), Positives = 76/76 (100%), Frame = -1

Query:   367 ETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALEFKPERF 426
             ETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALEFKPERF
Sbjct:  4018 ETYRYYASVPLPEPNMTTEDIEVDGYKIAKGTQIYKNIRGTLISKEFWGEDALEFKPERF 3839

Query:   427 KTQTLNQKGLLHFGAG 442
             KTQTLNQKGLLHFGAG
Sbjct:  3838 KTQTLNQKGLLHFGAG 3791

 Score = 329 (120.9 bits), Expect = 3.6e-258, Sum P(4) = 3.6e-258
 Identities = 59/59 (100%), Positives = 59/59 (100%), Frame = -2

Query:   442 GPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDRGDVGLSMQCKPYDALFIKRN 500
             GPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDRGDVGLSMQCKPYDALFIKRN
Sbjct:  3705 GPRGCPGARFTECFFFTLMVLLFKNYKLQNPNDNPIDDRGDVGLSMQCKPYDALFIKRN 3529

 Score = 93 (37.8 bits), Expect = 3.6e-258, Sum P(4) = 3.6e-258
 Identities = 21/21 (100%), Positives = 21/21 (100%), Frame = -2

Query:     1 MNLIYTIILTIIILVLIISIK 21
             MNLIYTIILTIIILVLIISIK
Sbjct:  5256 MNLIYTIILTIIILVLIISIK 5194

CYP514A3P seq 89 67% to 514A1 possible pseudogene no ESTs
LSGIPFVEDCYESYLDGSVRYDSFLFSVSDVVFAAVDSASNGFSLLIGQLFNRPGIQDGI
YEGIMRNDEVSNADDMSFADRAECPCVFSVMD (bad intron joint)
ESYRF (frame shift)
YSSVPITEPNVATGDVEVNGYRIAEGAMIIXNLRGAHFSRGFWG
IIAFP1D7832  87% to seq 11 
IIAFP2D7832


>IIAFP2D7832
        Length = 1248

  Minus Strand HSPs:

 Score = 484 (175.4 bits), Expect = 6.1e-67, Sum P(2) = 6.1e-67
 Identities = 93/98 (94%), Positives = 94/98 (95%), Frame = -3

Query:     1 LSGIPFVEDCYESYLDGSVRYDSFLFSVSDVVFAAVDSASNGFSLLIGQLFNRPGIQDGI 60
             LSGIPFVEDCYESYLDGSVRYDSFLFSVSDVVFAAVDSASNGFSLLIGQLFNRPGIQDGI
Sbjct:   685 LSGIPFVEDCYESYLDGSVRYDSFLFSVSDVVFAAVDSASNGFSLLIGQLFNRPGIQDGI 506

Query:    61 YEGIMRNDEVSNADDMSFADRAECPCVFSVMDESYRFY 98
             YEGIMRNDEVSNADDMSFADRAECPCVFSVMD  +  Y
Sbjct:   505 YEGIMRNDEVSNADDMSFADRAECPCVFSVMDVCFSIY 392

 Score = 221 (82.9 bits), Expect = 6.1e-67, Sum P(2) = 6.1e-67
 Identities = 42/44 (95%), Positives = 42/44 (95%), Frame = -2

Query:    98 YSSVPITEPNVATGDVEVNGYRIAEGAMIIXNLRGAHFSRGFWG 141
             YSSVPITEPNVATGDVEVNGYRIAEGAMII NLRGAHFS GFWG
Sbjct:   305 YSSVPITEPNVATGDVEVNGYRIAEGAMIIXNLRGAHFSXGFWG 174

>IIAFP2D7832  translate frame +1 translate plus frames translate all frames
ACCACCCACAACAAAAAACCACAAAAACACCATCGGCCCNNCCAAATAAACNCCACCCCG
CTCACCCTCACACGCAACCCATACAACGCCCAGGTCAATTTTCTTTCNNNAAGGCAAGGG
GCGACCCACCNNATTGAGCCCCAGTCGAAGGCGCAACNAAATGAACCTCCCCCTCCCCAA
AACCCTNTNGAAAAATGNGCACCACGAAGATTNTTAATTATCATTGCACCTTCTGCAATT
CTATATCCATTCACTTCAACATCTCCTGTTGCTACATTCGGCTCAGTTATTGGTACTGAA
GAATAAATCTATATGATTCTACTATTCAAACTAACACAAAACTCATTCTTTGAAAAAACA
TACCAACACATATCAACACCAACATACATATATATATGGAGAAACACACATCCATTACAG
AAAAAACACATGGACATTCCGCACGATCAGCAAAAGACATATCATCCGCATTACTAACCT
CATCATTCCTCATGATTCCTTCATATATTCCATCCTGAATTCCAGGCCTATTAAACAACT
GACCAATCAAAAGACTAAACCCATTGGACGCTGAATCAACTGCCGCAAAAACAACATCAG
AAACCGAAAACAAGAACGAATCATAACGAACCGAACCATCCAAATAACTCTCATAACAAT
CCTCAACAAACGGAATACCGCTCAAGCCTCGAAAATGCCCCACCAACCAGGCCCCAATTC
CCAAATCCAATCTCCCTCTCCCAAGACTGAAAGCGCTAAACAACCAACCAAAGGCGCCCA
AAAACCCCAATAAAAAACACCTTTCCAAAAAAAAAACAACCTAACCCTAAGAACAAAACA
AACAACCCCATCCCCCCCTGGGAACAACACGGGAACCTTCTAAAAAACCCCAAAACGGGA
AAAAATAACAAAAAAAGGGGCAAAAAATAAAACCCACCAAGCAAAAACCAAAAAAAAATC
CCAAACCCAGCCCTCCAAAAAAAACGCCAAACCAAAAACCCCCTGCCAACAAAGGAACCA
AAAAAGGGGCCAAACAAACCCTCTTCAAAACCCAAAGGGGAAACTCAAAAACCAAAGTCA
CAAGCAAGAAACCCAAACGCGAACTGCGGCACCACCAAAAAGAAGCCTCCCACAACCCCA
AAAACAAAACCAAACATCTCAAAAAAAAGAAAGGAACCAAAACAGCCAAAATAAAACAAC
GGCGGACAACCACCCGAAAAAACACCCCAAAAACAACGGAACCCCCCC

>_4
                              LGRGRFIXLRLRLGLNXVGRPLPXXKKIDLGVVWVACEGERGGVYLXGPMVFLWFFVVGG
                              >_5
                              GEGEVHXVAPSTGAQXGGSPLAXXKEN*PGRCMGCV*G*AGWXLFGXADGVFVVFCCGWX
                              >_6
                              WGGGGSFXCAFDWGSXXWVAPCLXERKLTWALYGLRVRVSGVXFIWXGRWCFCGFLLWVV

CYP514A4 almost identical to 514A1 (7 aa diffs)
MNTIFTIILTITILVLSLILKDLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSV
NDFYERYGKVFRLRLGSVEIVVLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFS
SNGDYHYVLRGILTSEITTRKLNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILA
VKLILNFTLGIEENDETILIIVEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISS
YFFIKDFIGIKLDAIKIKYEKENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSI
LFSISDIIFAAVDSTSNGFSLLIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKY
PYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGEDA
LEFKPERFKNQPLNQKGLFHFGAGARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPID
VEGEVGLAMQCKPFDALFIKRN
ng2792        Contig_1093

>Contig_1093, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 13,682

  Plus Strand HSPs:

 Score = 1791 (635.5 bits), Expect = 2.2e-259, Sum P(4) = 2.2e-259
 Identities = 349/352 (99%), Positives = 350/352 (99%), Frame = +2

Query:    22 DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV 81
             DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV
Sbjct:  7610 DLLFEGRIKKINKLIPGPSTIPVFGNLLQINAKDFPKSVNDFYERYGKVFRLRLGSVEIV 7789

Query:    82 VLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFSSNGDYHYVLRGILTSEITTRK 141
             VLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFSSNGDYHYVLRGILTSEITTRK
Sbjct:  7790 VLTGPEVIDECFNKKHREIFKERYIKFSRFLGKDYNIFSSNGDYHYVLRGILTSEITTRK 7969

Query:   142 LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII 201
             LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII
Sbjct:  7970 LNNGRLESNKFILEMFSNLCKDNKETLVKNTPNQIRILAVKLILNFTLGIEENDETILII 8149

Query:   202 VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFIKDFIGIKLDAIKIKYEK 261
             VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFIKDFIGIKLDAIKIKYEK
Sbjct:  8150 VEKIKCIFEAAGLLIYSDYLPFLFPLDIKSMSKNDIISSYFFIKDFIGIKLDAIKIKYEK 8329

Query:   262 ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL 321
             ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL
Sbjct:  8330 ENELKNETTDETSSKLNNIPIIENYYKNYLDGSIHYDSILFSISDIIFAAVDSTSNGFSL 8509

Query:   322 LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMNESYRY 373
             LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMN +Y Y
Sbjct:  8510 LIGQLINKPEIQDKIYEEIMRNDENNNTNNISFADHTKYPYIISVMNGNYPY 8665

 Score = 409 (149.0 bits), Expect = 2.2e-259, Sum P(4) = 2.2e-259
 Identities = 77/86 (89%), Positives = 81/86 (94%), Frame = +3

Query:   359 KYPYIISVMNESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE 418
             K  +++  + ESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE
Sbjct:  8712 KINFVLI*IIESYRYNSSVPITEPNKTTEDVEVNGYKIAKGTMIIKNLRGTHLSKEFWGE 8891

Query:   419 DALEFKPERFKNQPLNQKGLFHFGAG 444
             DALEFKPERFKNQPLNQKGLFHFGAG
Sbjct:  8892 DALEFKPERFKNQPLNQKGLFHFGAG 8969

 Score = 311 (114.5 bits), Expect = 2.2e-259, Sum P(4) = 2.2e-259
 Identities = 59/59 (100%), Positives = 59/59 (100%), Frame = +2

Query:   444 GARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDALFIKRN 502
             GARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDALFIKRN
Sbjct:  9044 GARACPGGRFTESFTFTFLVIMLKNFKIVNPTDIPIDVEGEVGLAMQCKPFDALFIKRN 9220

 Score = 94 (38.1 bits), Expect = 2.2e-259, Sum P(4) = 2.2e-259
 Identities = 21/21 (100%), Positives = 21/21 (100%), Frame = +2

Query:     1 MNTIFTIILTITILVLSLILK 21
             MNTIFTIILTITILVLSLILK
Sbjct:  7469 MNTIFTIILTITILVLSLILK 7531

CYP515A1 Seq 79b complete
MILGIILGLFIYIYLINIK (0)
FFNRAVPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSLK
MGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGII (2)
FGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNI(0)
DMGPIFKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLF
EYLAKQPADFIPILKPFNNYKEIEKEYNNCLNFFQPLIDNILKNISDDDDDGN(2)
EPKCFLEYFISEIRKDTSNLIKITDLPYICFDIIVAGI(1)
VTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSIIKETLRISPP(1)
APFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHFLSKDELNIGQ
CAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQL
NDENGHFTRSLSPFEFKSKLIIRK*
IIAFP1D72152
JC1a231d09.s1
JC1a254g04.s1
JC2a52d03.r1
JC2a204a05.s1
JC2b337a08.s1
JC2c56e03.r1
JC2c172c05.r1 
JC2c172c05.s1
JC2d33f05.s1
JC2d53c08.r1
JC2e77c08.r1
JC2e116h09.s1
c-JC2e13d10.s1
Dict-IV-V627e11.p1c
Dict-IV-V627e11.q1c
Dict-IV-V635h04.p1c

>Contig_1639, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 3429

  Plus Strand HSPs:

Query:    19 KFFNRAVPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSL 78
             KFFNRAVPSSLLVGENKLKCKF SGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSL
Sbjct:   620 KFFNRAVPSSLLVGENKLKCKFRSGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSL 799

Query:    79 KMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGI--IFGNGSHWRKLKD 136
             KMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGI  ++   + +  LK 
Sbjct:   800 KMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGIM*VYNFKNKYLVLKK 979

Query:   137 ILS 139
             I++
Sbjct:   980 IIN 988

Query:   109 FHLPSFYYMGKYQGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKK 168
             F +   Y + KY  I FGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKK
Sbjct:   964 FSIKKNY*LIKYI*IRFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKK 1143

Query:   169 INQDNNI 175
             INQDNNI
Sbjct:  1144 INQDNNI 1164

Query:   176 DMGPIFKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILK 235
             DMGPIFK+ILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILK
Sbjct:  1269 DMGPIFKKILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILK 1448

Query:   236 PFNNYKEIEKEYNNCLNFFQPLIDNILKNISDDDDDG 272
             PFNNYKEIEKEYNNCLNFFQPLIDNILKNISDDDDDG
Sbjct:  1449 PFNNYKEIEKEYNNCLNFFQPLIDNILKNISDDDDDG 1559


Query:   312 VTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSIIKETLRISP 368
             VTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSIIKETLRISP
Sbjct:  1853 VTTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSIIKETLRISP 2023

Query:   370 APFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHFLSKDELNIG 429
             APFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHFLSKDELNIG
Sbjct:  2120 APFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHFLSKDELNIG 2299

Query:   430 QCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRSLSPFEFKSK 489
             QCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRSLSPFEFKSK
Sbjct:  2300 QCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRSLSPFEFKSK 2479

Query:   490 LIIRK 494
             LIIRK
Sbjct:  2480 LIIRK 2494


>Contig_1639, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
TTTTTTAAAAAATTTTTTTAAAAAAAAGGTTTTTTTTGAAAAAAAAAAATGGGGGGAAAA
AAAATTTTTTTTTAAAAGGTTTAAAATTTTTAAAAAAAAAAAAAAAACCCAAAAAAAGGT
TTTGGGAAAAAAAAGGGAAGGAAAAGGGTTTAAACCCCCCCAAATTTTTTATTTTTTTTT
TCCAAAAAAATTTTTTTTAGGGGGGAAAAATTTTTATTTTTCCCCCCCCTTTTTTTTTTT
TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGGAAAAGGGGGGTTTTTTTTTGAAT
TAAACCCTTTTTTAAAAAGGGGAAACCCCTTTGGGCAAAAAAAAAAAATGTTTTTTTTTC
AAAAAGGAAGGGTTTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
AAAAGTTACTAATAAAATTTTAATAATTAAATTTTTAAAACTTAATTTAATTAATATTAA
AAAATGATATTAGGAATTATTTTAGGATTATTTATATATATATATTTAATAAATATAAAA
GTAAGCATTAATATAATATAATATAATATTAATAAAATTTTTAAATAAAAAATAATATTA
ATAAAATTTTTAATTTAAAAAGTTTTTTAATAGAGCAGTACCTTCATCATTATTAGTTGG
AGAAAATAAATTAAAATGTAAATTTCGAAGTGGTCCATTAATATTACCTATTATTGGTAG
CCTTTACAAATTAAGTTTAAAATATCCTCACTTATCATTCAAACAATTATCAGATAAATA
CGGAAAAGTATTTTCATTGAAAATGGGTTCAATTGATACAATTGTAATTAATGATATCAA
TTTTTTACAAAAATCATTTCGTGACAATCCAACCTTTTTTTCACAAAGATTTCATTTACC
ATCTTTTTATTACATGGGAAAATATCAAGGTATTATGTAAGTATATAACTTTAAAAATAA
ATATTTAGTATTAAAAAAAATTATTAATTAATTAAATATATATAAATTAGATTTGGTAAT
GGTTCTCATTGGAGAAAATTAAAAGATATTTTAAGTAGTTCAATTACTAAATCAAAATCT
CGTCAAATGGAAGAATTATTTTATAATGAATATTTTAAAGCTGAAGAATATTTATTAAAA 1140
AAAATAAATCAAGATAATAATATTGTAAGTTTTTATTATTTTTGAAAAAAATAAATAAAT
AAATAAATAAATATATATATATATAATTGATTATTAGTTTAATTTTTCACAAAAAAAAAA
AAAAAAAGGATATGGGTCCAATTTTTAAAAAAATTTTATTAAATATTTTATATAGATTTT
TATTTGGTGTATCATTTGAATATGATGATAATTTATTATCAAAAGAATTTTATTCATTTA
TCCAAAGCTATAATAAATTATTTGAATATTTAGCAAAACAACCAGCAGATTTTATCCCAA
TTTTAAAACCATTTAATAATTATAAAGAAATTGAAAAAGAGTATAATAATTGTTTAAATT 1500
TCTTTCAACCATTAATTGATAATATTTTAAAAAATATTAGTGATGATGATGATGATGGAA
AGTATGTATAAGTTATAATTCTAGTATGTGTTTAATTAAAATACTAAATTTTATTTTTTT
TTTTTTTTTTATTAAATAATAGTGAACCAAAATGTTTTTTAGAATATTTCATTAGTGAAA
TTCGAAAGGATACATCAAATTTAATTAAAATAACTGATCTTCCATATATATGTTTCGATA
TTATTGTTGCAGGTATAGGTAAATCTAATATTTATTTTTTCTTTTTTTCTTTTTTTTTTT 1800
TTTTTTTTTTTTTTCAAAAAATTAAAATTTATGATTCTAATTTTTTTTTAAAGTTACAAC
AAGTACAACTATGGATTGGATGTTATTATATTTAACAAATTATCCGAATATTCAAGAGAA
ATTATTTTTTGAAATAAACACACCAAACCATCCATTACATAAGGATAAATTACAATTCCC
ATATTTAAATTCAATTATTAAAGAAACTTTAAGAATTTCACCACGTAAATATTTATAATA 2040
CATTATTATAATTTATTATACATTTTGAAATAATTACTGAAAATTTTTATTTTTTTAAAA
AATCCATATATATATAGCAGCACCATTTGCATTACCACATATATGTACTGATGATATTGT
TATTGATGATATATTTATTCCAAAAAATACTCAAGTAATTCCAAACATATATGGATGTAA
TAGAAGTAATATTGAAAGCAGTGAATCTAACGTCTTCAATCCTGATCACTTTTTATCAAA
AGATGAATTAAATATTGGTCAATGCGCTTTTTCATTTGGTTCACGTCAATGTCCTGGTGC
CAATGTCGCTGATTCAATAATGTTTTTGGTATCAACTAAATTATATAAAACATTTAAATT
TGAGAGAACCACTACTCAATTAAATGATGAAAATGGTCATTTTACAAGATCATTATCCCC
ATTTGAATTCAAATCAAAATTAATTATTAGAAAATAATTTACTAATATTTTATTTAAAAA
AAAAAAAAAAAAAAAAAAAAATTGCTAAAAATACAATTAAAATAAAAGGGTTTTAAATTA
TCTATATTTGCCCCCAGATAGATATTTTATATTTTTTTAAATTATCTATTAAAGTTATAT
CTAAGATAAAATGATGTGTCGGATATCCAAAAAAAAAAAATAATAATAATAATAATAATA
ATCCGAAATAAATCGGGTTCACACAGATACTCTATTTCATTATTTGGATGAGGTTATTTG
AAATTTAAAATTAAGCACGCTATCAGTAATGTAATTGATCCTATTGGAGTTGTTGAAATG
GCAATAATTTCAATGTAGAGTAGAATTGAATGGAAACATAACAATAACTGGTTCACCTGT
ATTTGGACTATTTTCTTCAGTATTAACAATGACATATGTGTCTACTGAAAGACTTGGTAC
ATCATTACATGTTTCTTCACCGGTGATAAAAGATGGAGCAGTGTAGGTTTTTAGAATTAC
TGATGTTTGATTCAACCTCATGACCCTAACCAGTTGGAAGTGCTGTAATCAAATAGTTGC
AGGATTAAAGAATGAATATGAATCGGTTGTATCTGTAAGGGTGGGGACGAAAGGAGTCTT
ACTATCCGCATAGTAAATGAAAACTGGAATCTTTGCGAGTGGCATATCTTTTTGCTGTAC
CAAATTAACCACTGATTGTAATTGTACCGCTATTAATGATAATAACTTTATTGTTTAATT
GAATTACCTGGTGGAATTGAACCCATTTCCAAAAGCTATATTGTAAAACACCATGGTGTT
TGAAGGAATATCAGCTGATGTAATATTTAAGAGAAGTCTAGTTACATTTGGTAATTATTT
TTCATCATTGATATTATATCATCATTTTATTTAAACAAAAGTGTTTATTTTTAATTTTTA
AATTGATTT



>CYP515A1 dd_02444 chr2 genome assembly one exon differs this is probably correct
MILGIILGLFIYIYLINIKFFNRAVPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKY
PHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKY
QGIIFGNGSHWRKLKDILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNI DMGPI
FKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLFEYLAKQPADFIPILKPFNNY
KEIEKEYNNCLNFFQPLIDNILKNISDDDDDG NEPKCFLEYFISEIRKDTSNLIKITDL P
YICFDIIVAGIV TTSTTMDWMLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSII
KETLRISPPAPFALPHICTDDIVIDDIFIPKNTQVIPNIYGCNRSNIESSESNVFNPDHF
LSKDELNIGQCAFSFGSRQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGHFTRS
LSPFEFKSKLIIRK

>c-JC2e13d10.s1_1 3245 letters
                              FRIIYIYRFNKYKSIINII*YNINKIFK*KIILIKFLI*KVF**STTFIIISWRK*IKM*
                              ISKWSINITYYW*PLQIKFKISSLIIQTIIR*IRKSIFIENGFN*YNCN**YQFFTKIIS
                              *QSTFFSQRFHLPSFYYMGKYQGIM*VYNFKNKYLVLKKIIN*LNIYKLDLVMVLIGEN*
                              KIF*VVQLLNQNLVKWKNYFIMNILKLKNIY*KK*IKIIIL*VFIIFEKNK*INK*IYIY
                              IIDY*FNFSQKKKKKGYGSNF*KNFIKYFI*ISIWCII*I***FIIKRILFIYPKL**II
                              *IFSKTTSRFYPNFKTI**L*RN*KRV**LFKFLSTIN**YFKKY*X
                              >c-JC2e13d10.s1_2 3245 letters
                              LGLFIYIDLINIKVSLI*YNIILIKFLNKK*Y**NF*FKKFFNRALPSSLLVGENKLKCK
                              FPSGPLILPIIGSLYKLSLKYPHLSFKQLSDKYGKVFSLKMGSIDTIVINDINFLQKSFR
                              DNPPFFHKDFIYHLFITWENIKVLCKYITLKINI*Y*KKLLIN*IYIN*IW*WFSLEKIK
                              RYFK*FNY*IKISSNGRIIL**IF*S*RIFIKKNKSR**YCKFLLFLKKINK*INKYIYI
                              *LIISLIFHKKKKKKDMGPIFKRILLNILYRFLFGVSFEYDDNLLSKEFYSFIQSYNKLF
                              EYLAKQPADFIPILKPFNNYKEIEKEYNNCLNFFQPLIDNILKNISX
                              >c-JC2e13d10.s1_3 3245 letters
                              *DYLYI*I**I*KYH*YNII*Y**NF*IKNNINKIFNLKSFLIEHYLHHY*LEKIN*NVN
                              FQVVH*YYLLLVAFTN*V*NILTYHSNNYQINTEKYFH*KWVQLIQL*LMISIFYKNHFV
                              TIHLFFTKISFTIFLLHGKISRYYVSI*L*K*IFSIKKNY*LIKYI*IRFGNGSHWRKLK
                              DILSSSITKSKSRQMEELFYNEYFKAEEYLLKKINQDNNIVSFYYF*KK*INK*INIYIY
                              N*LLV*FFTKKKKKRIWVQFLKEFY*IFYIDFYLVYHLNMMIIYYQKNFIHLSKAIINYL
                              NI*QNNQQILSQF*NHLIIIKKLKKSIIIV*ISFNH*LIIF*KILV

Query:    11 IYIYLINIKFFNRAVPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSD 70
             IYIY  N K+  + +PSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSD
Sbjct:     4 IYIYRFN-KY--KTLPSSLLVGENKLKCKFPSGPLILPIIGSLYKLSLKYPHLSFKQLSD 60

Query:    71 KYGKVFSLKMGSIDTIVINDINFLQKSFRDNPTFFSQRFHLPSFYYMGKYQGII-FGNGS 129
             KYGKVFSLKMGSIDTIVINDINFLQKSFRDNP FF + F    F      + +  FGNGS
Sbjct:    61 KYGKVFSLKMGSIDTIVINDINFLQKSFRDNPPFFHKDFIYHLFITWENIKVLCKFGNGS 120


CYP515A2P Seq 79 38% to 508A1 appears to be different from seq 79b
At DNA and protein seq levels missing some parts, exons 4,5,6,7
DMGPIFKRILLNILYRFLFGVSFEYDDNLLSK*FYSFIQSYNKLFEYLAKQ 45989
PADFIPILKPFNNYKEIEKEYNNCLNFFQPLIDNILKNIIDDDDDGI atgt (2) 45851
agn EPKCFLEYFISEIRKDTSNLIKITDL 45690

(1)VTTSTTMDWLLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSFIKETLRISPP(1)
APFALPHICTDDIVIDDIFIPKNTQVIPNLYGCNRSNIESS
EPNIFNPDRFLSKDESNIGQCAFSSGLRQCPGANVADSIMFLVSTKLYKTFKFERTTTQL
NDENGYLTKTLCPLEFKSKLIIRK*
JAX4a108h12.r1
JC1a254g08.r1
JC2a190g04.r1
JC2b80a12.s1
JC2b234d12.s2
c-JC2c22g11.s1 12873 letters
JC3a177b07.3764 183690 letters

>CYP515A2 dd_00373 chr2 genome assembly
MDWLLLYLTNYPNIQEKLFFEINTPNHPLHKDKLQFPYLNSFIKETLRISPPAPFALPHI
CTDDIVIDDIFIPKNTQVIPNLYGCNRSNIESSEPNIFNPDRFLSKDESNIGQCAFSSGL
RQCPGANVADSIMFLVSTKLYKTFKFERTTTQLNDENGYLTKTLCPLEFKSKLIIRK


CYP515B1 Seq. 6 34% to seq 5
MATLLLVIIFMITFIIYKN (0)
FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYG
KVFSMKFGSYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFT
QTEYWKKIRGILNISLTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDT (0)
MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVVLSRMPS
DYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQD
NPKCFIDYLILQIRSDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEIC
KNTNTNTTATTTENDNESYFPKLIEKNNYPLFNACV
KETLRRSPPVPLGLPHLCSED TEIGGYLIPKGTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTV
TFSTGPRVCPGKNLSEDELFSFGTKLFK
TFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS* 
AU037975
AU071546
C91073
C92981
Contig170 chr 2
JC2a64f12.r1
SSB769
SSF410
SSH115
SSJ454 
AFI690
AFK630 
SFE728 

>IIAFP1D83520 45769 letters
        Length = 45,769

  Plus Strand HSPs:

 Score = 1725 (612.3 bits), Expect = 2.5e-257, Sum P(3) = 2.5e-257
 Identities = 334/342 (97%), Positives = 335/342 (97%), Frame = +1

Query:   152 FIKKQLKSKSDTMFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV 211
             FI   +K K   MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV
Sbjct:  1243 FIFLNIKKK---MFIRPYLKRLSFNIIYSYLFSETIPYEDELIPSDILDFIHASEELLVV 1413

Query:   212 LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQDNPKCFIDYLILQIR 271
             LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQDNPKCFIDYLILQIR
Sbjct:  1414 LSRMPSDYIKVLRPFESHKKLKEICDRMSKFIKPRVDKKVELLDQDNPKCFIDYLILQIR 1593

Query:   272 SDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEICKNTNTNT 331
             SDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEICKNTNTNT
Sbjct:  1594 SDPSQTIKIDAIQYIVLDLLVGGTDSTSSALEWMILFLSNHESIQEKLYNEICKNTNTNT 1773

Query:   332 TATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSEDTEIGGYLIPK 391
             TATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSEDTEIGGYLIPK
Sbjct:  1774 TATTTENDNESYFPKLIEKNNYPLFNACVKETLRRSPPVPLGLPHLCSEDTEIGGYLIPK 1953

Query:   392 GTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGKNLSEDELFSF 451
             GTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGKNLSEDELFSF
Sbjct:  1954 GTQIISNIYSASNCDKVFTDPLEFNPLRFIENSPPQTVTFSTGPRVCPGKNLSEDELFSF 2133

Query:   452 GTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS 493
             GTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS
Sbjct:  2134 GTKLFKTFKFSRPNKNQLYDEVAILGITLEPKPFITKVSLRS 2259

 Score = 756 (271.2 bits), Expect = 2.5e-257, Sum P(3) = 2.5e-257
 Identities = 145/155 (93%), Positives = 148/155 (95%), Frame = +1

Query:    10 FMITFIIYKNFDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFG 69
             F   +  +  FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFG
Sbjct:   697 FFFFYFFF**FDSKRKNKYPPGPINLPFIGGLYKLKPGKQHLSLNELYQKYGKVFSMKFG 876

Query:    70 SYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNIS 129
             SYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNIS
Sbjct:   877 SYDTVILNEPDVIVEAFHLNSTSFMDRILLPSFEVVGKNQNIGFTQTEYWKKIRGILNIS 1056

Query:   130 LTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDTM 164
             LTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDT+
Sbjct:  1057 LTKSKTRLLEGLFNQEYFRFDQFIKKQLKSKSDTV 1161

 Score = 90 (36.7 bits), Expect = 2.5e-257, Sum P(3) = 2.5e-257
 Identities = 19/19 (100%), Positives = 19/19 (100%), Frame = +3

Query:     1 MATLLLVIIFMITFIIYKN 19
             MATLLLVIIFMITFIIYKN
Sbjct:   432 MATLLLVIIFMITFIIYKN 488

CYP516A1 seq 9, 43 complete seq 39% to 10, 42; 30% to 508A1
MIILLLSIIIFILYIVKI (0)
FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIKTVV
ISDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQYWKNYRQILAQSFLKLRDHKGITEKI
SLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVIMQLLYSHRSSYDLKGQHPIIDA
LKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQVWSYSQDSIKEHREKLLINPEK (2)
IDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTETSSSTMSWLLL
FLINNPNFQDKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRPVGALSLPRIAS
EDVTCGPYTIE KGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQSTYSFIPFGCGS
RVCVGSSLARDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDYNVKLTKI*
C90082
Contig14304 Chr 6
IIAAP1D3115
IIAAP1D3581
IIAAP1E3107 N-term
LPPLPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIK
IIACP2D2392 95% to IIAAP1E3107
LPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFAHIK
IIADP1D2090
IIAFP1D1377
JAX4a92g02.s1
JAX4b47f08.r1 91% to IIAAP1E3107
PFIGNLHQLSIDPHLSIQKLMFKYGNVMTVYFPNIK
JC1b185h09.r1 
JC1c288g05.s1
sdic6A3g4.p1c
sdic6A46b12.p1c
sdic6A90a2.q1t
sdic6Ma11.p1ca
sdic6Td4.p1c
SSG513

>Contig_3767, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 15,392

  Minus Strand HSPs:

 Score = 1263 (449.7 bits), Expect = 4.7e-253, Sum P(3) = 4.7e-253
 Identities = 243/243 (100%), Positives = 243/243 (100%), Frame = -1

Query:    19 FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIKTVVI 78
             FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIKTVVI
Sbjct: 15077 FKNKSCCGNEIILPPGPISLPFIGNLHQLAIDPHLAIQKLMFKYGNVMTVYFANIKTVVI 14898

Query:    79 SDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQYWKNYRQILAQSFLKLRDH 138
             SDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQYWKNYRQILAQSFLKLRDH
Sbjct: 14897 SDPNYLKEVFVNQSHKTSDRYLMGTSRIIGNEKDILFSNGQYWKNYRQILAQSFLKLRDH 14718

Query:   139 KGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVIMQLLYSHRSSYDLKGQHP 198
             KGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVIMQLLYSHRSSYDLKGQHP
Sbjct: 14717 KGITEKISLESVKLSQAFEWYATSGQVVNPCSLFKMYTLNVIMQLLYSHRSSYDLKGQHP 14538

Query:   199 IIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQVWSYSQDSIKEHREKLLIN 258
             IIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQVWSYSQDSIKEHREKLLIN
Sbjct: 14537 IIDALKMVEESLAVGNVLDLFPILNIFLKKNKLKLVNNLQQVWSYSQDSIKEHREKLLIN 14358

Query:   259 PEK 261
             PEK
Sbjct: 14357 PEK 14349

 Score = 1178 (419.7 bits), Expect = 4.7e-253, Sum P(3) = 4.7e-253
 Identities = 226/227 (99%), Positives = 227/227 (100%), Frame = -1

Query:   261 KIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTETSSSTMSWLLLFLINNPNFQ 320
             +IDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTETSSSTMSWLLLFLINNPNFQ
Sbjct: 14252 RIDDLLDLFINEIKLSKNSEFFDDEGLYRVCSDLLLSGTETSSSTMSWLLLFLINNPNFQ 14073

Query:   321 DKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRPVGALSLPRIASEDVTCGPYT 380
             DKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRPVGALSLPRIASEDVTCGPYT
Sbjct: 14072 DKVRTELLEATGGKKTIGLTEKSKTPFFNACIKEALRIRPVGALSLPRIASEDVTCGPYT 13893

Query:   381 IEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQSTYSFIPFGCGSRVCVGSSLA 440
             IEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQSTYSFIPFGCGSRVCVGSSLA
Sbjct: 13892 IEKGSQIIMNVYGLAMDPTVWEDPETFNPYRWLSSDISQSTYSFIPFGCGSRVCVGSSLA 13713

Query:   441 RDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDYNVKLTKI 487
             RDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDYNVKLTKI
Sbjct: 13712 RDEIFLGIGNILLNYIFESQNGKPINEKGHFGIALQTVDYNVKLTKI 13572

 Score = 79 (32.9 bits), Expect = 4.7e-253, Sum P(3) = 4.7e-253
 Identities = 18/18 (100%), Positives = 18/18 (100%), Frame = -3

Query:     1 MIILLLSIIIFILYIVKI 18
             MIILLLSIIIFILYIVKI
Sbjct: 15219 MIILLLSIIIFILYIVKI 15166

>JC3f71h11.s1 Clone JC3f71h11, standard read, bases 110 through 740, from
            2002-11-15
        Length = 629

  Plus Strand HSPs:

Query:     1 MIILLLSIIIFILYIFK 17
             MIILLLSIIIFILYI K
Sbjct:    62 MIILLLSIIIFILYIVK 112

Query:    16 FKNKSCCGNEIILPPGPISLPFIGNLHQLAI 46
             FKNKSCCGNEIILPPGPISLPFIGNLHQLAI
Sbjct:   204 FKNKSCCGNEIILPPGPISLPFIGNLHQLAI 296

CYP516A2P Seq 64 70% to seq 9 probable pseudogene fragment
KMAREDGTWGPXXXRGGQIKKERVRGLGRGPTVWGGPETFLPXXXXXXXXXXXXX
SVIPCGWGSRGWGGSSLAREEILVGMGNV*LNYIWESQKGKPIKEKGHLGSALQTGDYNVKVTKI
IIAAP1E3541

CYP516B1 seq 10, 42 complete seq 39% to 9, 43
MYLILSLIIFLAYVA (0)
FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSF
SDKYGGLTTIFLGSVPTVLISEPNILREIIIKNNDSIIDRYISDSGLIIG
GERNLLFSKGSFWIKYRKIFSSAMTNARKFNIASRIEQQA
ISLNNYFGTYANSKQA (0)
INPHDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLKPF 868
YTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLDLILMEIEKSEE 1021
KQFYDDDSLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGN 1192
LPALVHRKDTSYLNACIQETMRIRTAAPLALPRIASEDIKVGG
YTIPKGTQVMMSVYGMASDERYWKDPHIFNPERWLSSNHSTENGGGGGGVVGNSSQSEV
FIPFGVGPRMCVGMGVAKDELYYCASQMFMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK*
C25607
Contig11546 Chr 2
JAX4a161d08.s1 N-term
c-JAX4a161d08.s1
JAX4a161d08.r1
JAX4b34h10.r1 90%
JAX4b38d02.s1 90%
JAX4d06b12.s1
JC1c262b03.s1
JC2b265h11.s1
JC2b383g02.s1
JC2b388b09.r1 
JC2d19a07.s1
JC2c86c05.r1
JC2c92b06.r1 poor quality
JC2d19a07.s1
SSA260

>Contig_3853, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 8577

  Plus Strand HSPs:

 Score = 1759 (624.3 bits), Expect = 5.2e-260, Sum P(3) = 5.2e-260
 Identities = 334/337 (99%), Positives = 335/337 (99%), Frame = +2

Query:   156 QAIN-HDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLK 214
             Q IN HDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLK
Sbjct:   683 Q*INPHDYIRRYSLNGVIDYSFSDSVEYESDTHHIVIRAAEIMEEILATGNPHDYLPFLK 862

Query:   215 PFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLDLILMEIEKSEEKQFYDDD 274
             PFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLL+LILMEIEKSEEKQFYDDD
Sbjct:   863 PFYTKKRNTLAMAVGQVWDYCNDAITVHRKTLDHEKPRDLLNLILMEIEKSEEKQFYDDD 1042

Query:   275 SLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALVHRKDT 334
             SLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALVHRKDT
Sbjct:  1043 SLSKCLTDLIVAGHETVAITLGWMILFLSNHQDVQQKVYDELINVVGKGNLPALVHRKDT 1222

Query:   335 SYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERYWKDPH 394
             SYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERYWKDPH
Sbjct:  1223 SYLNACIQETMRIRTAAPLALPRIASEDIKVGGYTIPKGTQVMMSVYGMASDERYWKDPH 1402

Query:   395 IFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYYCASQM 454
             IFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYYCASQM
Sbjct:  1403 IFNPERWLSSNHSTENGGGGGGVVGNSSQSEVFIPFGVGPRMCVGMGVAKDELYYCASQM 1582

Query:   455 FMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK 491
             FMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK
Sbjct:  1583 FMNFKWSPVNDKPIDDEGVARIALEYKEYQVVLERRK 1693

 Score = 736 (264.1 bits), Expect = 5.2e-260, Sum P(3) = 5.2e-260
 Identities = 142/144 (98%), Positives = 144/144 (100%), Frame = +1

Query:    16 FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPTVLISEPN 75
             FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPTVLISEPN
Sbjct:   172 FHKKRTNGMPPGPFPLPIIGNLHQLGKSPYKSLKSFSDKYGGLTTIFLGSVPTVLISEPN 351

Query:    76 ILREIIIKNNDSIIDRYISDSGLIIGGERNLLFSKGSFWIKYRKIFSSAMTNARKFNIAS 135
             ILREIIIKNNDSIIDRYISDSGLIIGGERNLLFSKGSFWIKYRKIFSSAMTNARKFNIAS
Sbjct:   352 ILREIIIKNNDSIIDRYISDSGLIIGGERNLLFSKGSFWIKYRKIFSSAMTNARKFNIAS 531

Query:   136 RIEQQAISLNNYFGTYANSKQAIN 159
             RIEQQAISLNNYFGTYANSKQA++
Sbjct:   532 RIEQQAISLNNYFGTYANSKQAVS 603

 Score = 69 (29.3 bits), Expect = 5.2e-260, Sum P(3) = 5.2e-260
 Identities = 15/15 (100%), Positives = 15/15 (100%), Frame = +1

Query:     1 MYLILSLIIFLAYVA 15
             MYLILSLIIFLAYVA
Sbjct:    13 MYLILSLIIFLAYVA 57

CYP516B2P seq 48 72% to 10 no ESTs
MFLILSLIIFWAFVA bad AG boundary
FHKKRPKGLPPGPFPFPILGNLHQWGRSPFKSLKSFSAKFGGWPPIFWGG
VPPVLIGDPNFLREIFIKN
sdic6A62f2.p1c N-term
sdic6A62f3.p1c
FHKKGTQGFPPGPFPFPI

CYP517A1 Seq. 3 complete 37% to 508 483 aa
MEIINVFLFLIILFLVKDF (0)
VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELF
GGYSKKYNGVVRAWFGE RLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGLGVM
SSSDDKWKRAKSSVSQSLRVRTTKKLMEEKAIEF
IDSLEKISNNNEI (0)
FYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAF
DCFEIFSPLYDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEI
TKEDTMQINQICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNL
NHKQNAPYIVAFIKETMRLCSNG FGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEI
FKNAKEFNPTRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKSQNGE
KIDDTIHFSVSLKAKDYGIKLEKRI*
AU071503
AU071705
C84055
Contig14028 chr 2
IIAFP2D15468
JAX4a13e01.r1  92% to seq 3
VFKYMFNQDLSVESGMSRTIGNAAEHVFGNLSKLTAFDWFEIFSPLYDWLFTRRLKGCDIARQIIS
JAX4a15h01.s1
JAX4a27h12.s1
JAX4a82d03.r1
JAX4a82d03.s1
JAX4a150b03.s1
JAX4a207g03.s1
JAX4b33d01.r2 (formerly seq 61) actual translation
KLFLFLSNYXVXXYFQKXKNFLYKPSLLXPXWXYPSXNGLXVXTSSXDQW
KKPKSXVSQSLKLHTSKKLMEKKXIEFIDSLXKISNNNEI (intron)
Only 12 aa diffs in 90 to seq 36 and 14 Xs
16 diffs with seq 3 plus 14Xs but QS matches where ** is in seq 36
and there is no 1 aa deletion as in seq 36   so this is more like seq 3
This is probably a poor seq version of seq 3
JAX4d06c02.r1
JAX4d06c03.s1
c-JAX4d09b12.s1
JC1a221g07.s1
JC2a129a01.s1
JC2a188c06.r1
JC2b115g01.r1
JC2b115g01.s1
JC2e14a08.s1
sdic6A83g9.p1c CHR 6
sdic6A76c7.q1t MEIINVFLFLIILFFGKRF N-term
SSB673 
SSC317
SSC561
SFF882
CFF730
CFI212
SFA813
Exact match to contig 1803

>Contig_1803, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 8010

  Plus Strand HSPs:

 Score = 1669 (592.6 bits), Expect = 2.7e-247, Sum P(3) = 2.7e-247
 Identities = 316/316 (100%), Positives = 316/316 (100%), Frame = +2

Query:   168 FYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL 227
             FYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL
Sbjct:  4304 FYPKGHIQGYACSMLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPL 4483

Query:   228 YDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQIN 287
             YDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQIN
Sbjct:  4484 YDWFFTRRLKGCDIVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQIN 4663

Query:   288 QICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYI 347
             QICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYI
Sbjct:  4664 QICFDIFGPAVGTVTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYI 4843

Query:   348 VAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNP 407
             VAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNP
Sbjct:  4844 VAFIKETMRLCSNGFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNP 5023

Query:   408 TRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKSQNGEKIDDTIHFS 467
             TRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKSQNGEKIDDTIHFS
Sbjct:  5024 TRYLDESLPVPNIHFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKSQNGEKIDDTIHFS 5203

Query:   468 VSLKAKDYGIKLEKRI 483
             VSLKAKDYGIKLEKRI
Sbjct:  5204 VSLKAKDYGIKLEKRI 5251

 Score = 563 (203.2 bits), Expect = 2.7e-247, Sum P(3) = 2.7e-247
 Identities = 109/126 (86%), Positives = 114/126 (90%), Frame = +3

Query:     3 IINVFLFLIILFLVKDF-----VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEH 57
             I+  +  L+IL++ K       VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEH
Sbjct:  3723 ILENYYNLLILYIKKKKNKKKKVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEH 3902

Query:    58 ELFGGYSKKYNGVVRAWFGERLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGL 117
             ELFGGYSKKYNGVVRAWFGERLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGL
Sbjct:  3903 ELFGGYSKKYNGVVRAWFGERLFFFVSNYDVVKYFQKDENFHNRPSVLVPGWRYASSNGL 4082

Query:   118 GVMSSS 123
             GVMSSS
Sbjct:  4083 GVMSSS 4100

 Score = 210 (79.0 bits), Expect = 2.7e-247, Sum P(3) = 2.7e-247
 Identities = 43/43 (100%), Positives = 43/43 (100%), Frame = +2

Query:   125 DKWKRAKSSVSQSLRVRTTKKLMEEKAIEFIDSLEKISNNNEI 167
             DKWKRAKSSVSQSLRVRTTKKLMEEKAIEFIDSLEKISNNNEI
Sbjct:  4103 DKWKRAKSSVSQSLRVRTTKKLMEEKAIEFIDSLEKISNNNEI 4231

 Score = 95 (38.5 bits), Expect = 6.8e-198, Sum P(3) = 6.8e-198
 Identities = 20/20 (100%), Positives = 20/20 (100%), Frame = +1

Query:     1 MEIINVFLFLIILFLVKDFV 20
             MEIINVFLFLIILFLVKDFV
Sbjct:  3646 MEIINVFLFLIILFLVKDFV 3705

>Contig_1803, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
AAAAAAAAAATATAGCTTTAAATTTAAAATTTTATATTAAGAAAAATGGAAATAATTAAT 3660
GTTTTTTTGTTTCTAATCATTCTTTTCTTGGTAAAAGATTTTGTATGTATTTAAATTTTT 3720
ATATTTTAGAAAATTATTACAATTTATTAATTCTTTACATAAAAAAAAAAAAAAATAAAA 3780
AAAAAAAGGTTAAAAAAAATAAGAAAATTCATACAAAATCTCCAAGTGGACCAATTGCAT
TTCCTATTCTTGGAAATGTTGTTCAAATTAGATTTTGGGAATTATTCAAAATTCAAGAAC
ATGAATTATTTGGTGGTTATAGTAAAAAATATAATGGTGTTGTAAGAGCATGGTTTGGAG
AAAGATTATTTTTTTTTGTTTCAAATTATGATGTTGTAAAGTATTTTCAAAAAGATGAAA 4020
ACTTCCATAACAGACCATCAGTTTTAGTTCCAGGTTGGAGATATGCTTCAAGTAATGGGC
TAGGTGTAATGAGCTCATCTATGACAAATGGAAAAGAGCAAAATCAAGTGTTTCGCAATC 4140
ATTAAGAGTTCGTACAACTAAAAAATTAATGGAAGAAAAAGCAATTGAATTTATTGATTC
ACTTGAAAAAATTTCAAATAATAATGAAATTGTAAGTTTTTAAAAATAATAATTACAATC 4260
AAAATATTGATAACATATTAATTTATTAAATTTTTATTTTTAGTTTTATCCAAAAGGACA 4320
TATTCAAGGATATGCTTGTTCAATGTTATTCAAATATATGTTTAATCAAGATTTATCAGT
TGAAAGTGGCATGTCAAGAACTATTGGTAATGCAGTTGAACATGTTTTTGGTAATCTTTC
AAAATTAACTGCATTTGATTGTTTTGAAATTTTCTCACCACTTTATGATTGGTTCTTTAC
AAGAAGATTAAAAGGTTGCGATATCGTTAGACAAATAATCAGTAGTCAAAATGAAAATCA
TTTAAAGTCAATTGATCCAAGTAAACCAAGAGATTTAATGGATGATTTGTTAATTGAATA
TGGATTAAATGAAATCACTAAAGAAGATACAATGCAAATCAATCAAATTTGTTTTGATAT
TTTTGGACCAGCCGTTGGTACAGTTACAATCACAATGAATTGGGTAATTTTACAATTATG
TAATCGTCCAGAACTTCAAGAGATTGCATATCAAGAAATTAAAAAAGCTGTCAAAGATGA
TGAATATGTCAATTTGAATCATAAACAAAATGCCCCTTATATCGTTGCCTTTATTAAAGA
AACAATGAGACTTTGTTCAAATGGATTTGGTTTACCAAGAACTGCTAAAAATGATCAAAT
TTGTGGTGATTTTTTCATTCCAAAAGATGCTATTATTTTTATTAATTATTTAGAAATTAG
TCAAAATGAAGAAATCTTTAAAAATGCCAAAGAATTTAACCCAACTCGTTATTTAGATGA
ATCACTTCCTGTACCAAATATTCACTTTGGTGTTGGTCAAAGAGCATGTCCTGGTCGTTT
TGTCGCAATCGATAAAATGTTTCTTGGAATTTCAAATTTACTTTTAAAGTATAAATTAAA
ATCTCAAAATGGTGAAAAAATTGATGATACAATTCATTTTAGTGTTTCTTTAAAAGCAAA
AGATTATGGAATAAAATTAGAAAAGAGAATTTAGAACAATAAAAATAATTGTATTTAATT
CAAATTATTATCCCAATTCATTGATTTAAATAATTATTATTAATAAAGACTTTTTTTAAA
AAAAAAATTATTTTTTTATTTATTTATTTATTTATTTATTTATTTATTTATTTATCTATT
TATTTATTTTTTAACCCTATTTTTTTTTTTTTTTTTTATATTTTCATTTTTGCTAACAAA
TGTGTAATCTATTAAAAAGAGATAATTTGAAATTATTATTTTTAATATTTATTATAGAAA
TATATAATAATAAAAATAATAATAAAAATTTCGCACAATCATTGTTATTATTATTTCTAT
AATAAATAGGGAATTTAATTCAAATAGCCTTTGTGGATAACTAAATATGTTATGTTGAAC
TTCATTAATTTATAAAAATAAAAAAAAAAAAAAAAAAAAAAAAAAATAAAAACAAATTAA
GAATTTCCAACTTTATTAATAAAAAATATAATATCGGTAAAGTTGAAGGGGTATTTAAAA
AATTTCAACATTGCTCGACCCCTCTTCAACTTAATCGGTATTTAATTTTTTTTTTTTTAC
TCTATGTTGTTAGTGGTAAAATTAAAATAAAAATAAAAAAATAAAAATAAAAATAAAAAT
AAAAATAAAAATAAAAATAAAAACAATGGTGTTGTTTAAAGATTTAAGTTTTTTTTTGTT
TTTTTTTTTTTTTTTTAATACATATTTTTTAAAATTCTAATACAAATAAAATAAATGTCA
AATACTTCAAGTGATAATTATAAAGTTTGGTTCATAACTGGAGTTACAGGTGGAGCAGGT
AAAGCATTAGCATCTAGATTATTAGAAATTGGAGGTTATAAAGTTGCAGGTACATCCAGA
AGTAAAGAGAAATTAAATTCTTTAGATGGTAGTTTAATATCAAATAAGGATTTCCTTAGA
TTACAAGTAAATTTAATTGATGAGAAAAATGTAAAGGAATCAATTGAAGAAACCATAAAG
AAATTCGGTAAAATTGATATTATTGTAAATAATGCCGGACAATCAATATTTGGTAGTGTT
GAAGAGATTACTGATAAAGAACAAAGGTTATTAATGGATGTATTGTATTTCGGTCCTTGT
AATGTAATTCGTTCAGTTTTACCTCATTTTAGAAAGAGAAAATCCGGTTTAATCTTCAAC
ATCTCTTCAGTTATTGGTGGTGATCCTACCAATAATGTACCCTTCTTTTCTGGATATTGT
GCTGGTAAATCTGCAATTCACTCAATGACTCAATGTCTCAGGGAAGAGGTTAAACAATTT
AATATTAATGTGTGTTGCATCCTTTTAGGTTATTTAAAAACCAGTTTCAGTAGTCCATTA
ACAACTAATCAAATTAAAGACTATAATCTTAAAGAGAAAGCCAATGATATGAATAAACAA
TTCCAAAATAATACACCAGAATCACCAATTAATTTTGCTAATTTTATCATTGAAAATTCA
AAACATTCAAATATTCCCCAAACAATCCAATTTGGTAAAAATCCATACCATAATCCAAAT
AATTCAAATAATCCAAATAATAATAATAAACCATCTTATCCTCATCAAAAAGTAAGAGTT
AATCAAGATCTTCCAAAGCAATTTGATAAAGATATTGCAAATTTACCAGAATTAACTGAT
GGTAATATTCATATGCCTTATATTTTACATGAATAAATAAAATCTTATTTCATAAAGATA
TAAATTAAATAAATTGGTCCCCACAATTTCTATTTTTTTTTTTTTAATGTTTTTAATTAT
ATTCCTTTTTTATCAATTTAATGTAAAGAAACCATCATTTATTCATTTCAATGTATTATT
AAATATTTATTTAAATAGTATTATTTATTTATTTATTATTATAAATTAATTATTCTAAAT
ATAGTAAATTAGTTATTATTATATAATTATATTTCTTAAAAATAGTAATATAAAATTAAT
ATAATATAATATTAATATAAAATTATTTATTTCCAAAGATTATTATATTCAATTAAAAAA
TTAATAATCTTATTAATTTTTTAGTTAATTCTATCAACTCATTTTTAATAGTATTAATTA
AAATTTTTATTTTTATTTTTATAATATAAACAAATAAAAGAATAAAAAGATAAAATTCAA
TATTAAATTATATAAATTTCAAAGATAATAATTATAAAGGAAGACCACTTCTGTTAAAAT
AAGTAATATAAGTTATTATTTCTTTACAGTTATTATTTTTTTTTTTTTTTTTTTTTTTTT
AAAAAATTAATATTTATATAAAATATATTCCCTACTCTTTTTTCTTCAAAATAAAAAAAT
AATTACAAAAAATTAACCATAATTAAAAATAAAAGAAGGGATTTGAAATGAATGTGTCAT
ATGACGATCGATTTCTTTGGTTTTTATTTTTTATTTTTTTTATTTTTTATTTTTAATTTT
CAAATTGTTTTGGTTAATATGGTGGTGTTTTGTGAAAATTATTATTTGAAAAAAAAAAAA
TAGTCGCAACACGAAGAAAAAAAATTAAAAAAAAATTAAAAAAAAAATTAAAATTTAAAA
TTAAAAAAAAAAAATTATTAAAAAAAATTAAAAAAAATTAAAAAAAAAAAAAAATTTTGA
AAATAATTTTTTTTTAATTGCACATAAAATTTTAACCCCCCCCCCCCCAGATCCAATAAT
TTTCTTGTTTTTTAAATTTTTAAAAACAAATAATAAACAAATAATTTTTTTAAAAAAAAA
AAAAATAAAAATTTTAAAAGAGCTACTTTT


CYP517A2 SEQ 74 66% to seq 3
MRILIIIILIIIVFLVKDT (0)
IKKNKKVNSKSPCGPFAFPILGNIIQYFFYQILNIKEHSIIERYSRKYDG
ITRIWFGDIFILYVSNYEIVKCFQKEENFFDRPSTFVPTWRYMSSNGGGI
MSSNDEKWKRAKTTFLKSLKIHGKKYLIEKKSIEFVNSIEKFSNSNQV (0)
FYPKQYSQGFTSSIFFKYMFNEDISIDNKFLKEIGTAVGMVFTKNSHLT
VFDCFGILSPFYDLFFKFRLRPIEILK
KTIDKQLTNHLNSMDSKMGDHQSRDIMDDLLIEYGSLNEISNQDRIQINQ
ICFDVMSTDIGTVATTIDWVLLQLCNRQDLQEIICNEIQDTIKIKRNNVI
NDGGGGADTDNLFINLCDKQSIPYLIAFIKETMRVFSNGWSLPKTSKHDQ
ICANYFIPKGSILFINYFSIHLNEEFFKNPREFNPARYLDDSIPIPDLHF
GIGQRGCPGRFVAMDQVFLCIANTLLKYKIKSIDGKKIDDTIQFSVYLKP
KDFGILLEKRNKLFVND*
Contig16783  Chr 6
IIAFP1D11203
IIAFP1D19488
IIAFP1D53751
IIAFP1D67548 (may be a pseudo gene)
IIAFP1D76983  
sdic6A65c10.p1c 
Dict-IV-V896c06.plc
Contig_4323

CYP517A3P Seq 36 pseudogene 77% to seq 3 77% to seq 61 
MEIVNV (frameshift)
FIILIILFLVKDF (0)
VKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHEL (10 aa deletion)
IVRAWIGERLFLFVSNYDVKYFQKDENFLYKPSLLVPGWRYASSNGLGVMSSSDDEWKRAKSS
VS**LRVHTSKKLMEEKAIEFIDSLEKISNNNEI (0)
FYPKGHIQGYACSMLFKYMFNQDLSVESGLSRTIGNAVEHVFGNLSKLTAFDCFE
IFSPLYDWFFTRRLKGCDIVRQIISSQNENHFKSIDPSKPRDLMDDLLIEYGLNEITKEDTMKINQICFDIFGPAIG
TVTITMNWVILQLCNRPEPQEIAYLEIKKAVKDDEYVNFNHKQNAPYIVAFIKETMRLCS
NG

JC2e10h11.s1 matches pseudogene seq
c-JC2e10h11.r2 matches pseudogene seq
JC1a286e01.r1 matches pseudogene seq
JC2d93e02.s1 matches pseudogene seq
JC2e75c06.s1 matches pseudogene seq
JC2a172b05.r1 matches pseudogene seq
JAX4a222c08.r1 matches pseudogene seq
JC2b182g05.s1 matches pseudogene seq
SFG734 seq matches pseudogene except it has the insert GNYSKKYNGV that 17A3P is missing. 4 diffs with 17A1
SFF555 seq matches pseudogene except it has the insert GNYSKKYNGV that 17A3P is missing. 4 diffs with 17A1

CYP517A4 13 aa diffs with 517A1 ng5440 exact match to Contig_2215
MEIVNVLLFLIILFLVKDFVKKNKKIHTKSPSGPIAFPILGNVVQIRFWELFKIQEHELI
GNYSKKYNGVVRAWIGERLFLFVSNYDVVKYFQKDENFLYRPSLLVPGWRYASSNGLGVM
SSSDDEWKRAKSSVSQSLRVHTSKKLMEEKAIEFIDSLEKISNNNEIFYPKGHIQGYACS
MLFKYMFNQDLSVESGMSRTIGNAVEHVFGNLSKLTAFDCFEIFSPLYDWFFTRRLKGCD
IVRQIISSQNENHLKSIDPSKPRDLMDDLLIEYGLNEITKEDTMQINQICFDIFGPAVGT
VTITMNWVILQLCNRPELQEIAYQEIKKAVKDDEYVNLNHKQNAPYIVAFIKETMRLCSN
GFGLPRTAKNDQICGDFFIPKDAIIFINYLEISQNEEIFKNAKEFNPTRYLDESLPVPNK
HFGVGQRACPGRFVAIDKMFLGISNLLLKYKLKTQNGEKIDDSIQFSVSLKAKDYGIKLE
KRI

CYP518A1 Seq. 20 38% to CYP508 complete no short first exon
MSILIILIISIIFYLIFDFLYKNFSNRQFKGPLALPLVGSLLHLKDDTHL
VFQNDSKLYKDGDNNGKIIKYWFCDQLTLAIYDTNTMKEIYLKNPESLNTRVKSPSTNVIGNRFRGIVTADENYWQF
HRDILMKSFTGRKVKSLSSSIEKETIDLITYMKFIEKSGQS (0)
FSPRSNFMNFYSNIIFDYVFSRRIENIYEGVNEEQGKVLLAIRELFDYLADTLIVNYLIFTK
PFYFLYLKMFGHPADSLKKILTKYYLEHFESIDLNNARDVLDSLI
IEYRKVGGKEEQSSIIPMVNELILAGTETNSSTAEWFILTMVNNLDYQDKIYNELKSTLE
TTTAMIKLSNRNQTPLFNAALKEVLRLYPPVPFGVPRQVNQSFEINGGSLKIPKGTQIIQ
SLYSIFRDENYWDSPDQFKPERFLDQDSHSNNYFPYGIGVRNCIGMGFSQDELYISLSNL
VLNFKLLPLIENSKICDKPIFGFSFKPNEFKINLEKRNN*
FC-BG13
IIAFP1D6284
IIAFP1D35357
IIAFP1D75459 
JC2b232e02.r1
JC2c81c10.r1
JC2c81c10.s1 
VFB214
VFJ528
JC3a13b12.s1

>JC3a13b12.s1 183648 letters
        Length = 183,648

Query:     1 MSILIILIISIIFYLIFDFLYKNFSNRQFKGPLALPLVGSLLHLKDDT 48
             MSILIILIISIIFYLIFDFLYKNFSNRQFKGPLALPLVGSLLHLKDDT
Sbjct: 66633 MSILIILIISIIFYLIFDFLYKNFSNRQFKGPLALPLVGSLLHLKDDT 66490

CYP518A2P Seq 25, 27, 29, 30  62% to seq 20 pseudogene two in frame stops and a frame 
shift missing I-helix
PFYFIYSKIFGHPSDPLKNIINKYY*EHCETIYIDNPR
DVLDSIISEYRKVNGKEECS 211
XXXXXXXXXILAAVENNSATMKWFTL
(24aa gap)
LINLSHRPLTPLFNASLKEVLRLYPPV (frame shift)
SIGVSRLVIEEFTIDNGKYTFLKGAQIIQSLYSIFRDEKYWSSPNEFIP
ERFIYQNN*SNNWFPYSIGVRNCVGMGFSQDEL
YLLLTNLVLNFHILPPFENTKIDGTPIFGFSFKPKLR*
Contig6572   Chr 2
IIAFP1D12433
JAX4a48g04.s1
JAX4a81g11.r1
JAX4a81g11.s1 missing I helix exon
JC1c02f04.r2 
JC2a18h05.s1
JC2a64g05.s1 
JC2b88e02.r1
JC2b112a04.r1 90% to seq 27
JC2b155a02.r1
JC2b212a08.r1 short frag at C-term
JC2b254e11.s1 
JC2b329d01.r1 5 prime part shown below
JC2c48h11.r1
c-JC2c48h11.s1
JC2c145g10.r1 
JC2c145g10.s1
JC2e17e02.r2
JC2e17e05.r1 91% to seq 27
JC2e21b09.s1 5 prime part shown below
JC2e71b08.s1

CYP518B1 complete Seq (26+28) 50% to seq 20 482 aa
MLTNIIILIILYLFYDF (0)
CYKNFKYRNYGSPWALPVI (1)
GHFIHVINQPHLVVHNDRMKYNNGRFVNYWFGDYL
SIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWDF
HHGILSKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGEN(0)
FDTRSNFLYFYSNILFDYILGKRVENIYENELRKDRKKFMVSIQEVMDSVG
LIKFLNYLILSYPFLSIYLRYFTYTTFNLKKIL
KQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIECDKSFVAIAIELLAAGT (1)
DTNSSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATLKEVLRLIP
ATPFSVPRMSNEGFEVDGIKIPKG (0)
TYLFPSMYSIFRDEKYWGENANKFYPERFLTDSHSNNYFPYGVGKRMCLG
SNFSQHELYICLTNIVLNFKIKSIDGKPLNEIP (1)
NYGITFRPNIFEVKLENR*

Contig5329 Chr 2
Contig13654 Chr 2
IIADP2D5718
IIAGP1D2186
JAX4a94e04.r1
JAX4a94e04.s1
JAX4a215e06.r1
JC1b80a09.s1

>JPCRa04a07.p1 413590 letters
        Length = 413,591

  Minus Strand HSPs:

Query:      5 IIILIILYLF-----YDFCYKNFKYRNYGSPWALPVIGHF 39
              +I L ++Y F     Y  CYKNFKYRNYGSPWALPVIG F
Sbjct: 147997 VIPLKLIYKFFY*KNYIKCYKNFKYRNYGSPWALPVIGKF 147878

Query:      8 LIILYLFYDFCYKNFKYRNYGSPWALPVI-GHFIHVINQPHLVVHNDRMKYNNGRFVNYW 66
              L++ + F  F +  F +  Y   +    I GHFIHVINQPHLVVHNDRMKYNNGRFVNYW
Sbjct: 147891 LLVNFFFIFFFFFLFFFN*YL**YFFNFIKGHFIHVINQPHLVVHNDRMKYNNGRFVNYW 147712

Query:     67 FGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWDFHHGIL 126
              FGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWDFHHGIL
Sbjct: 147711 FGDYLSIAITDPILYKKIYLNFPKQINSRLKSPTVLNISERFRGIISSNENNWDFHHGIL 147532

Query:    127 SKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGEN 162
              SKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGEN
Sbjct: 147531 SKLFNGHKAKINNFLFEKETKFIIEYMKKISKSGEN 147424

Query:    161 ENFDTRSNFLYFYSNILFDYILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNY 220
              + FDTRSNFLYFYSNILFDYILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNY
Sbjct: 147324 KKFDTRSNFLYFYSNILFDYILGKRVENIYENELRKDRKKFMVSIQEVMDSVGLIKFLNY 147145

Query:    221 LILSYPFLSIYLRYFTYTTFNLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIE 280
              LILSYPFLSIYLRYFTYTTFNLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIE
Sbjct: 147144 LILSYPFLSIYLRYFTYTTFNLKKILKQYYDEHLETIDLNKPRDVLDNLIMEYKNQNAIE 146965

Query:    281 CDKSFVAIAIELLAAGT 297
              CDKSFVAIAIELLAAGT
Sbjct: 146964 CDKSFVAIAIELLAAGT 146914

Query:    298 DTNSSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATL 357
              DTNSSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATL
Sbjct: 146822 DTNSSTSEWFLTYLVNNPIYQDKIYDELIGALKINKPLKGSDVLINLSHRPLTPLFNATL 146643

Query:    358 KEVLRLIPATPFSVPRMSNEGFEVDGIKIPKG 389
              KEVLRLIPATPFSVPRMSNEGFEVDGIKIPKG
Sbjct: 146642 KEVLRLIPATPFSVPRMSNEGFEVDGIKIPKG 146547

Query:    390 TYLFPSMYSIFRDEKYWGENANKFYPERFLTDSHSNNYFPYGVGKRMCLGSNFSQHELYI 449
              TYLFPSMYSIFRDEKYWGENANKFYPERFLTDSHSNNYFPYGVGKRMCLGSNFSQHELYI
Sbjct: 146458 TYLFPSMYSIFRDEKYWGENANKFYPERFLTDSHSNNYFPYGVGKRMCLGSNFSQHELYI 146279

Query:    450 CLTNIVLNFKIKSIDGKPLNEIPN 473
              CLTNIVLNFKIKSIDGKPLNEIP+
Sbjct: 146278 CLTNIVLNFKIKSIDGKPLNEIPS 146207


CYP519A1 Seq. 12 32% to 508 complete sequence
MESIINLIFYIIIFLILIDF (0)
LKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLA
MKKGGIYSVWLGDEKVFILTDPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEY
SQWKINSKWVSSAFTKTKLKTIGDLIEKESNYFIEHLKAYSNSGQP (0)
IFPKPYISKFGINVISGMMFSQVISKDESVDKGAMEKLTVPIQAVFKRLGADNLDDFISI 
LQPVFYFQNEKFKRQVQEIYDYLEG
IYNQHDTNLDTENPKDLMDLLIISTEG
KERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNP
DVQKKAYQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEET
TIMGYTIPKGSQIFQNVYGMSHLFV
SDPYKFKPERWIEYK 
KQKDLLKEKEMQQLQEGADVVIDNKNNIKNNNSSNKPNSKTNSIFDD
LDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFELKSIDGKPIDDTEVYGLTIHTKVHPISLTLRP*
AU037246
AU039172
AU071956
AU072353
AU072354
AU073173
AU074914
C84123
IIAFP1D41514
JC1a150h12.r1
c-JC1b185d08.r1
JC2a50a01.r1 93% to seq 12
YSVGNRKCPGAFFSELALFSLGSNILLNFTLKSIDGKPIDDTEVYGLTIHTKVHPISLTLRP
JC2a186g04.r1
JC2a186g04.s1
JC2a236b08.s1
JC2b07f01.s1
JC2b85e11.r1 ends match Seq 12 middle does not 
JC2b306d05.s1
JC2b332h01.s1
JC2c176f07.r1
JC2d23b01.r1
JC2e69g10.s1
JC2e116g02.s1
JC2e116g02.r1
QELINALKDEDNKKFIPISKKDHCPYMGAIFKQGLLLKPLGLLRIPPVALK*TTIMGYTI
PKGSQIFQNVYGMSHLFVSDPYKFKP*RWIEYKKQK
sdic6A22g4.p1c
sdic6A45h1.p1ca
SSB654
SSC825
SSE270
SSE271
SSG312
SSM444
SSM473
AFD538
AFK771

>JC2b07f01.s1_frame-1 Clone JC2b07f01, standard read, bases 149 through 555, from 1999-06-29, translation frame -1
YFFLFXVYFDN*NFLFFFILILINQWSLL*I*YFI*LFF*F*LILKXLYFENYIL**NII
IYIILK*FFFFFKYS*KRIYXLGKMSPXEEVXHFQYLEIYQN*ERTLTDI*QTWQ*KKEE
SIQFG*EMKKFSF*L
>JC2b07f01.s1_frame-2 Clone JC2b07f01, standard read, bases 149 through 555, from 1999-06-29, translation frame -2
IFFYSRFILIIKIFYFFLY*Y*LINGVYYKFNILYNYFFNFN*F*XIYILKTIYYNKI**
YILY*NNFFFFLNTVKKEYXF*EK*APXRRXCISNIWRFTKIRREPSQIFNKLGNEKRRN
LFSLXRR*KSFHFN
>JC2b07f01.s1_frame-3 Clone JC2b07f01, standard read, bases 149 through 555, from 1999-06-29, translation frame -3
FFFIXGLF**LKFSIFFYININ*SMESIINLIFYIIIFLILIDFEXFIF*KLYIIIKYNN
IYYIKIIFFFF*IQLKKNIXFRKNEPPRGGXAFPIFGDLPKLGENPHRYLTNLAMKKGGI
YSVWXGDEKVFILT

>sdic6A22g4.p1c_frame+1 translation frame +1
ILMIKIFYFFLY*Y*LINGVYYKFNRLYNYFCNFN*FCKYLYFENYIL**NIIIYIILK*
FFFFFKYR*KRIYLLGKMSHQEEVLHFQYLEIYQN*ERTLTDI*QTWQ*KKEESIQFG*E
MKKFSF*LTQKQLEMHGLNSLEILVTIQKQKVLEYFPVILMIWPSLNILNGK*IVNGYHL
PSQRQN*KLLGD
>sdic6A22g4.p1c_frame+2 translation frame +2
F**LKFSIFFYININ*SMESIINLIGYIIIFVILIDFVNIYILKTIYYNKI**YILY*NN
FFFFLNTDKKEYIF*EK*ATKRRCCISNIWRFTKIRREPSQIFNKLGNEKRRNLFSLVRR
*KSFHFN*PRSS*RCMG*TV*KF**PSKNKKC*NIFR*F**YGLR*IFSMENK**MGIIC
LHKDKIKNYWV
>sdic6A22g4.p1c_frame+3 translation frame +3
FDD*NFLFFFILILINQWSLL*I**VI*LFL*F*LIL*IFIF*KLYIIIKYNNIYYIKII
FFFF*IQIKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGD
EKVFILTDPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSEWVSSA
FTKTKLKTIG*

>CYP519A1 dd_02971 chr 2 genome assembly one extra exon missing N-term
MLKKNISFRKNEPPRGGVAFPIFGDLPKLGENPHRYLTNLAMKKGGIYSVWLGDEKVFIL
TDPEAVRDAWVKQFRNFSDHPKTKSVRIFSGNFNDMAFAEYSQWKINSKWVSSAFTKTKL
KTIGDLIEKESNYFIEHLKAYSNSGQPIFPKPYISKFGINVISGMMFSQVISKDESVDKG
AMEKLTVPIQAVFKRLGADNLDDFISILQPVFYFQNEKFKRQVQEIYDYLEGIYNQHDTN
LDTENPKDLMDLLIISTEGKERDMIIHIGMDCLLAGSDSTSATCEWFCLFMINNPDVQKK
AYQELINALKDEDNKKFIPISKKDNCPYMLSIFKEVLRLRPVGVLGIPRVALEETTIMGY
TIPKGSQIFQNVYGMSHLFVSDPYKFKPERWIEYKKQKDLLKEKEMQQLQEGADVVIDNK
NNIKNNNSSNKPNSKTNSIFDDLDKVSIPYSVGNRKCPGASLSELALFSLCSNILLNFEL
KSIDGKPIDDTEVYGLTIHTKVHPISLTLRP


CYP519B1 Seq (35+44+75) complete seq 51% to seq 12 496 aa
MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSLNPHRSLTELAKV
YGGVYSLHIGDSKTVVITDVSAFKDVTIKQFKNFANRPQPKSIRVITNFKGLAFADYDQW
QKTRKLVSSALTKTKIKTFNNLIEKQTENLIESMNEFSNKNEL (0)
FHPRKYLTKYSLNIILS
MLFSKEIGKNESINKGTMERLTIPFNEAFKKVGKVDDFLWFLSPFFYFSNKQYKKYIFDI
YYFMEEIYDQHLLDLDYNEPKDLLDQLIIASQGREKETVILVGMDFLLAGSDTQKATQEW
FCLYLINNPDVQKKAYQELISVVGKDCKFVTSNHIENCPYFISIIKEVFRIRSPGPLGLP
RISIDDTYLSNGMFIPKGTQILLNIFGMGNLLVSEPDQFKPERWINYKNQQQQKQQQQQQ
QVNNKNSIDSSESSNLEFFDDLEKVSNPFSLGPRNCVGMAIAKSSIYSVCSNILLNFELS
SINNQIIDDNEVFGVSINPKEFSIKLTKR
Contig6040
IIACP1D2091 N-term
IIAFP1D26250 
IIAFP1D53328
IIAGP1D2374
JAX4a216e09.r1 
JAX4a216e09.s1 91% to IIACP1D2091 45% to seq 12
JC1b162b04.s1
sdic6A33h1.p1c 81% TO N-TERMINAL OF SEQ (35, 44, 75) WITH NO INTRON
MNLINSILIFYFIWIVFDFIRKNRRISFNDPPSLWAFP

>JC3f115e15.s1 Clone JC3f115e15, standard read, bases 27 through 739, from
            2002-12-10
        Length = 711

  Minus Strand HSPs:

 Score = 259 (96.2 bits), Expect = 2.3e-21, P = 2.3e-21
 Identities = 48/48 (100%), Positives = 48/48 (100%), Frame = -3

Query:     1 MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSL 48
             MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSL
Sbjct:   643 MNLINLILYFILFWIVFDFIRKNRRISFNDPPSPWALPIIGHLHKLSL 500

CYP519C1 Seq (31+32+33) complete sequence 39% to seq 12
MNILLLIFYFLVCFLIFDF (0) 
IKKNKVKKYDVPTLSYALPIIGHLYKLGVNPHRNLTKLVEKNGGIFSLWLGDIKTVIVTDP 
SINKEI MVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITK 
TKIISNYKEVIENQTKILINSMRTHSKINEP (0)
FKSKKYFGKFSISIVLGIMFKQDNDEKQININDNNIDNDPIT
KLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSAS
KVFKFMEEIYDQHLLKFDKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKGCMSFVFAGDDTVAAT
LEWVCLYLINNPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKE
VLRIRTPLPLSVPRIATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRW
INYYNQKQQQQQQQQQPQPIQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFSVCANI
ILNFQIKSIDGMQLKDIEVSGISIHPIPFSIKLISRN
CHR2.0.36204 
Contig14835 Chr 2
IIAFP1D21425 
IIAGP1D1935
JAX4a151f11.s1 92% to seq 33
JAX4b23a04.s1 91% to seq 33
c-JAX4a75h04.s1 
JC2a39h06.s1 
JC2a233c03.r1
JC2b147a08.s1
JC2b175d02.s1
JC2b175d02.r1
JC2b233a07.r1
JC2e04d01.s1 may extend to N-terminal exon (MLVLFINYLXXEXISXDF?)
JC2e22c10.r1 
JC2e115e03.s1 92% to seq 31
JC2f02e07.s1
JC2f02e07.r1
JC2f11b07.r1

>CYP519C1 dd_00538 chr2 genome assembly missing first exon
MNIKKNKVKKYDVPTLSYALPIIGHLYKLGVNPHRNLTKLVEKNGGIFSLWLGDIKTVIV
TDPSINKEIMVKQFTNFSDRPRLKSFESFTGGGVNLIFIDYNEKWPVIRKIVSSSITKTK
IISNYKEVIENQTKILINSMRTHSKINEPFKSKKYFGKFSISIVLGIMFKQDNDEKQINI
NDNNIDNDPITKLTEPIQQVFLLLGTGNISDFIKILRPFFKNEYKKLNNSASKVFKFMEE
IYDQHLLKFDKSNPRDLMDYFIEYEFTNSPNTTLEEKKISIIKGCMSFVFAGDDTVAATL
EWVCLYLINNPAIQEKCYNELISVLGDNNNESKIKFISLKERDNCQYLINVIKEVLRIRT
PLPLSVPRIATQNCEINGFFIEKGTQILSNAFGMSHLYVDEPNVFNPDRWINYYNQKQQQ
QQQQQQPQPIQNNNYFNDLDRVCLPFSTGPRNCVGISIAELNLFSVCANIILNFQIKSID
GMQLKDIEVSGISIHPIPFSIKLISRN

CYP519C2P seq 46 and related seqs 77% to seq 32
KIKKNDIPTLPFALPIIGHLYRPGSNPHRDLTKLVEKNGGIISLCLGDIKTVIFTDPSITKEL
JC2e83h08.r1 3 diffs with JC2e99d08.s1
JC2e99d08.s1 N-term
sdic6A2d5.p1c N-term 89% to JC2e99d08.s1
sdic6B24c11.p1c
FLFVNNKIIGYNNLLQIIYSLYLKKMRGKVKKG this might be an N-terminal exon

>sdic6B24c11.p1c translate frame +1 translate plus frames translate all frames
TTTTTTGTTTGTTAACAATAAAATAATTGGATATAATAATCTTTTACAAATTATATATTC
TTTATATCTCAAAAAAATGCGAGGAAAAGTAAAAAAAGGTTGATTTTAGTTTGTCAGAAT
TGGCCTTATATTATTATTTGGTGGGTTAAAAGTATAAATTTATTTGATGAATGTTTTTAA
ATTAAATTTATATTATACAAATAATTAAAAAAAAAGAAAAAAAAAAATTAAAGATAATAA
TTTTTTTTTTTTGATTAAAAATTAAAAAACATATTCTTTTTTTATAGATAAAGAAAAATA
AAAAAAAATGATATACCAACCTTACCATTTGCTTTACCAATAATTGGTCATCTTTACAGA
CCTGGAAGTAATCCACATAGAGATTTAACAAAACTGGTAGAAAAGAATGGTGGAATTATC
TCTTTGTGTCTTGGTGATATCAAAACTGTAATTTTTACAGATCCTTCTATAACCAAAGAA
TTATTAGGCCAATTTAAGTAATCCATAATAACTAGAAATTTGAAATAAAGAAATCCCTAC
TAAAATTTCTAATTTAAAAAATCTTGGGTAAATTAATAATTATAAAAAAAATTTAAAAAT
AAAAACTAATGAACTATCACTAGGT

>Contig_2045, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
ATTGGAAGCAATATCAAGAATAAATAATTTAATTACTTGTTTAAGTAATGTTGGTATCAC
TTTTCAAAATAATACTACTAGTAATAAATTAGTTAATGACTATATCTATAATAGAGATAA
AAATGGTAATCAACTTTCAATTGAAGCTTCAGTTAGTTTATTAAATGAAATTATTAAAAA
AGAGAAAGAACAACAACAAGACCAACAAAATCAACAAGAACAACAAGAGCAACAAGATGA
TGATGATGATGAATACTTTGAAGATTATGAAGATGAAGATGAAGATTATCAATATGATGA
ATATGATGAAAATGATGAAAATGATTTATATGATTGTTGAGATGTGGAACAGCACTTGAA
TAATTGTTTTTTAAAAAAAAAGTTTTATTAAAAAAAATAATAATAAAGAAAATTCTTAAA
AAACTTAGACAAAAAACTATTACAAAAGATCCTTTGAAAATTTTTTTTTTTTTAAAAAAA
AAGGTACTCGAAATGATTCATTTATATGTTGACGAACCAAATGTATTTAATCCTGATAGA
TTCACAAAATCAAAAACTACCAACAACAACTGATCAACAACAATGATCTTGATAGGGTTT
GTTTACCATTTCCAATTGGTCCTAGAAATTGTGTAGTGTTGTAGTGAGTTTTTTCACCCC
GACTGTAGGGAAAAATAGCGCTATGTGGTCAAGAAAAAAAAAAAATGTAAAATTTCCCTA
CAACATGGGTTCCACAAAAAATAAAAAAATAAATAATAAAATTGAAAGTTGACTAACCAT
CAAATAAAAAAATAAAAAAAAAGTGGTTCATTTATTTTTATTATATATTTTTGTTTTTTA
TTTATTTTTTAAATTATCACCTTTTTTATTATCTCTTTTTTTAATTATAAATTATTTTTC
AAATTAATTCAAACACTAATTCTACTCAAAAAATCAAAACATTAATTCTGCACCTAAAAT
CAGCCATGGCTGAAATTATTTTTTATTTTCTTCTTATCAATTTAGTTTATATTAATTGTC
TAAAACAATAGATAAAAATTGATTTTGGTAAATTTATCTCTTCACCATCTATATTTGATA
CAGCTCTTAACAATGGCGAAAAATTACAACTATTATTAATCGAATTAGCTGTTGGTGTAT
TTATTAAAACTCTTTTTTTATTTTTTTTTATTGGTATATTATTAATATTGAAATGATAGT
AATACAAATTAATTATTCTTTGTTTTTTTTTTTTTTCAATTTTAACATTATATAATTTGT
ATAAATTATATATTTGTTCGCACATGTTTACGAAATCTTCAGTAGAGATTTTTTTTTTTT
CAAAAATAATAAATAATTGTTTATTCATACCCCATAATATAAACAATCTGTGAGAATCCA
AGGATAAATTAAGACTGTTGATATTAGTTAAAATAAAATGTATTGTTTTAACTGTTTATA
ATATTTATTATTTATTTAATTATTTTTATTTTTTTATAAATAATTTTTTTTTTTTTTTTT
TTAATTTTCACAGTTCCCAAAACAAATCTTATAATAAATTAAAGTAAATAGTTATTAATA
GTATATTATAAATAGATTTTGATAATAAATTATAAGTTTTTTAATTTAAAGATAATATAT
TCTTTAAATCGATTTTATCATACCACCTGCCTATTTTTTTTTTTTTTTTTTTTTTTTTTT
TTTTTTTTTTTTTTGTTAACAATAAAATAATTGGATATAATAATCTTTTACAAATTATAT
ATTCTTTATATCTCAAAAAAATACAAGGAAAAGTAGAAAAAAGTTTATTTTAGTTTGTCA
GAATTGGCCTTATATTATTATTTGGTGGGTTAAAAGTATAAATTTATTTGATGAATGTTT
TTAAATTAAATTTATATTATACAAATAATTAAAAAAAAAGAAAAAAAAAAATTAAAGATA 1920
ATAATTTTTTTTTTTTGATTAAAAATTAAAAAACATATTCTTTTTTTATAGATAAAGAAA
AATAAAAAAAAATGATATACCAACCTTACCATTTGCTTTACCAATAATTGGTCATCTTTA
CAGACCTGGAAGTAATCCACATAGAGATTTAACAAAACTGGTAGAAAAGAATGGTGGAAT
TATCTCTTTGTGTCTTGGTGATATCAAAACTGTAATTTTTACAGATCCTTCTATAACCAA
AGAATTATTAGGCCAATTTAAGTAATCCATAATAACTAGAAATTTGAAATAAAGAAATCC
CTACTAAAATTTCTAATTTAAAAAATCTTGGGTAAATTAATAATTATAAAAAAAATTTAA
AAATAAAAACTAATGAACTATCACTAGGTTTCAAAAGCTCATTGGGAGGAATATTATTTG
AGAATCGAACCATTGATCTTAAGTGTTAATTTTTATAAATTTCGCCCAAAGGGGGGCTCG
AACCCCCGACCACAAGGTTAAAAGCCTTGCGCTCTACCGACTGAGCTATCTGGGCCTTGA
ATGAAAATTGCATTTTTATTACAAAAAATCGATCAATTAAAATTATTTTTCAAAAAGTTA
TAAAATAAACATTCAAAAACAAAAAAGAGTATACTCTTAAGTGCAAATTATAAGGGGGAC
TGAAACAGAGGGACAAATATTGTATTATAGGATGCAATTGCGTCTTATAATTCATTTTAT

CYP519D1 Seq. (14+34) 38% to seq 12 566 aa
MNVFVLTVFICIIYLLFDL (0)
IKKNKKLKDEPPTPKLALPLIGHLYLLGDRPNRSFLELSKRYGG
IFKIWMGEYPTVVLTDPDHVNEVWCKQFLNFTNRPHFNSLDQFSSGFRNLSFSDYPLWSE
LRKLVSSSFTKSKVKGISNLLETQTNYLINTMNNYSINNKP (0)
FNPKKYIHKLTLNVVCMIA
FSKEIKNDEDVNEGDMARLTKPKEMILKHLGSSNFCDFVPLVRPLFYLKNKRFDQTLKQV
IEYIKEIYDDHLLNLDLNSSPKDIMDLLIMSTNDSKEDIIIHTCIDLLIAGSDTVGVTFR
VFRFIYPNNPIIQEKCFNELFNAFSNSNNTDNNNNNSTITAA
IGFGDEYSSKTPFLNACIKEVLRIKPVTSLGLPRIANDDTFVNGYRIPKGT
PIIENIYGLSNSDQLIDDPTTFNPYRWLEYQKLKS
FQNDLKQQQQQQQQQQQQQQQLQLQQEQQEQEQQKINLEFNNNNNNNNNNNNNNS
NNKHKYYNDLDKISIPFSTGRRGCVGVQLGEAELYIVCANLVYNFKIESWDGKKINELEDFG
IIIHPSSHNLKITKRNNK*
AU073802
C90413
IIAAP1D4848
IIADP1D0354
IIAFP1D73623
IIAFP1D78046
IIAFP1D97427
JAX4a136d08.s1
JAX4a136e05.r1
JAX4a169f04.r1
sdic6A3c3.q1t
SSI404 (deletion of mid region)

>IIAFP1D74108
        Length = 822

  Minus Strand HSPs:

 Score = 115 (45.5 bits), Expect = 3.9e-13, Sum P(2) = 3.9e-13
 Identities = 23/24 (95%), Positives = 23/24 (95%), Frame = -3

Query:    20 IKKNKKLKDEPPTPKLALPLIGHL 43
             IKKNKKLKDEPPTPKLALPLIG L
Sbjct:   109 IKKNKKLKDEPPTPKLALPLIGIL 38

 Score = 89 (36.4 bits), Expect = 3.9e-13, Sum P(2) = 3.9e-13
 Identities = 17/20 (85%), Positives = 18/20 (90%), Frame = -1

Query:     1 MNVFVLTVFICIIYXLFDLI 20
             MNVFVLT FICIIY LFDL+
Sbjct:   351 MNVFVLTFFICIIYLLFDLV 292

>IIAFP1D78046  translate frame +1 translate plus frames translate all frames
TTTACCTATATAGNCNGCCCGTTTGAGTGAACTTCAGNGCCGAACTTTANAGGGATCCAC
CAACAACAACAATAGGATTTGGTGACGAATATTCTTGTAAAGCACCATTAATACATGCAT
GCATTAAAGAAGTTTTAAGAATTAAACCAGTTACATCATTAGGTTTACCACGTATTGCAA
ATGATGATACTTTTGTAAATGGTTATAGAATTCCAAAAGGAACTCAAATCATTGAAAATA
TTTATGGTCTTTCTATTCTGATCAATTAATAGATGATCCTACAACTTTTAATCCTTATCG
TTGGTTGGAATATCAAAAATTAAAATCATTTCAAAATGATTTAAAACAACAACAACAACA
ACAACAACAACAACAACAACAACAACAACAACTACAACTACAACAAGAACAACAAGAACA
AGAACAACAAAAAATTAATTTAGAATTTAATAATAATAATAATAATAATAATAATAATAA
TAATAATAATAGTAATAATAAACATAAATATTATAATGATTTAGATAAAATTTCAATTCC
ATTTTCAACTGGTAGAAGGGGATGTGTTGGTGTACAATTAGGTGAAGCAGAGTTATATAT
CGTTTGTGCAAATTTAGTTTATAATTTCAAAATTGAATCATGGGATGGTAAAAAAATAAA
TGAATTGGAAGATTTTGGTATTATTATTCACCCTTCTTCTCATACTTTAAAATTACAAAA
AGAAAATATTAATAANAGTANTAAAATAAAAAATAAAAANAAAAGATATTCTTTTTACCA
TATCACAATTTTTAGATATCTAAAAAAAA

>_1
                              FTYIXXPFE*TSXPNFXGIHQQQQ*DLVTNILVKHH*YMHALKKF*ELNQLHH*VYHVLQ
                              MMILL*MVIEFQKELKSLKIFMVFLF*SINR*SYNF*SLSLVGISKIKIISK*FKTTTTT
                              TTTTTTTTTTTTTTTRTTRTRTTKN*FRI******************T*IL**FR*NFNS
                              IFNW*KGMCWCTIR*SRVIYRLCKFSL*FQN*IMGW*KNK*IGRFWYYYSPFFSYFKITK
                              RKY**XX*NKK*KXKIFFLPYHNF*ISKKX
                              >_2
                              LPI*XARLSELQXRTLXGSTNNNNRIW*RIFL*STINTCMH*RSFKN*TSYIIRFTTYCK
                              **YFCKWL*NSKRNSNH*KYLWSFYSDQLIDDPTTFNPYRWLEYQKLKS FQNDLKQQQQQ
                              QQQQQQQQQQLQLQQEQQEQEQQKINLEFNNNNNNNNNNNNNNS NNKHKYYNDLDKISIP
                              FSTGRRGCVGVQLGEAELYIVCANLVYNFKIESWDGKKINELEDFGIIIHPSSHTLKLQK
                              ENINXSXKIKNKXKRYSFYHITIFRYLKKX
                              >_3
                              YLYXXPV*VNFXAELXRDPPTTTIGFGDEYSCKAPLIHACIKEVLRIKPVTSLGLPRIAN
                              DDTFVNGYRIPKGTQIIENIYGLSILIN**MILQLLILIVGWNIKN*NHFKMI*NNNNNN
                              NNNNNNNNNNYNYNKNNKNKNNKKLI*NLIIIIIIIIIIIIIIVIININIIMI*IKFQFH
                              FQLVEGDVLVYN*VKQSYISFVQI*FIISKLNHGMVKK*MNWKILVLLFTLLLIL*NYKK
                              KILIXVXK*KIKXKDILFTISQFLDI*KK
>IIAFP1D97427
        Length = 792

  Minus Strand HSPs:

Query:     1 MNVFVLTI 8
             MNVFVLT+
Sbjct:   698 MNVFVLTV 675

Query:     8 IKKNKKLKDEPPTPKLALPLIGHLYLLGD 36
             IKKNKKLKDEPPTPKLALPLIGHLYLLGD
Sbjct:   456 IKKNKKLKDEPPTPKLALPLIGHLYLLGD 370

>IIAFP1D97427  translate frame +1 translate plus frames translate all frames
TTTAACTTTTNNAAGCACTGNGGTGAATGCCCCCCGACCTTAGAGTGATCCACTNATTAA
TTAAATAATTTGTTTGAGTTTCTAATATGTTTGAAATTCCTTTAACTTTTGATTTTGTGA
AGGATGAAGAAACTAATTTTCTAAGTTCTGACCAAAGTGGATAATCACTAAATGATAAAT
TTCTAAAACCTGATGAGAATTGATCTAAACTATTAAAGTGTGGTCTATTTGTAAAATTAG
AAACTGNTTACACCATACTTCATTAACATGATCAGGATCAGTCAAAACAACTGTTGGATA
TTCTCCCATCCAAATCTTAAAAATTCCACCGTATCGTTTTGATAATTCTAAAAATGATCT
ATTTGGTCTATCACCTAATAAATATAAATGTCCTATTAATGGTAATGCTAATTTTGGTGT
TGGTGGTTCATCTTTTAATTTTTTATTTTTTTTAATCTATTTTTAAATAAATAAAATAAA
TAAATAAAATTAATAAATAAATTAAAAAAGGGTAGTGGTGGTGCGTAAAAAAAAAAAGTT
TTGAAAAAAAAAAAGATATTTTTTTTTTTTTTTTTTTTTTTTGGGAATAAAAAATATTAA
AAAAATGAAAAAATTAAATAAAATTAATAAAAGATATTTACCAAATCGAATAATANATAA
ATAATGCAAATAAAAACAGTTAAAACAAATACATTCATTTTGAGTGTAAAATATTTTTTT
TTTAAACTCTAAAACTTTTTTTTTTTTTTTTTTCCTGATTTTTTTTTCTACCGCCCCATC
TTTATAATAAAA

>IIAFP1D97427_frame-1 translation frame -1
FYYKDGAVEKKIRKKKKKKVLEFKKKIFYTQNECICFNCFYLHYLXIIRFGKYLLLILFN
FFIFLIFFIPKKKKKKKKYLFFFQNFFFLRTTTTLF*FIY*FYLFILFI*K*IKKNKKLK
DEPPTPKLALPLIGHLYLLGDRPNRSFLELSKRYGGIFKIWMGEYPTVVLTDPDHVNEVW
CXQFLILQIDHTLIV*INSHQVLEIYHLVIIHFGQNLEN*FLHPSQNQKLKEFQTY*KLK
QII*LXSGSL*GRGAFTXVLXKVK
>IIAFP1D97427_frame-2 translation frame -2
FIIKMGR*KKKSGKKKKKKF*SLKKKYFTLKMNVFVLTVFICIIYXLFDLVNIFY*FYLI
FSFF*YFLFPKKKKKKKNIFFFFKTFFFYAPPLPFFNLFINFIYLFYLFKNRLKKIKN*K
MNHQHQN*HYH**DIYIY*VIDQIDHF*NYQNDTVEFLRFGWENIQQLF*LILIMLMKYG
VXSF*FYK*TTL**FRSILIRF*KFII**LSTLVRT*KISFFILHKIKS*RNFKHIRNSN
KLFN*XVDHSKVGGHSPQCXXKL
>IIAFP1D97427_frame-3 translation frame -3
LL*RWGGRKKNQEKKKKKSFRV*KKNILHSK*MYLF*LFLFALFXYYSIW*ISFINFI*F
FHFFNIFYSQKKKKKKKISFFFSKLFFFTHHHYPFLIYLLILFIYFIYLKID*KK*KIKR
*TTNTKISITINRTFIFIR**TK*IIFRIIKTIRWNF*DLDGRISNSCFD*S*SC**SMV
*XVSNFTNRPHFNSLDQFSSGFRNLSFSDYPLWSELRKLVSSSFTKSKVKGISNILETQT
NYLINXWITLRSGGIHXSAXKS*

>CYP519E1 seq 84+17 complete 48% TO 519B1 look upstream of QPLA for more
MGIGLIILYLLIGLLAYDF (0)
TKKNKKISKNDPKQPLAIPVLGHLHLFGSQPHRSLTELAKKFGGIFTLWMGDERSMVITDPNILRELYVKNHLNFYNRASS
ESIRIYSGNLVDISFSVGESWKNRRRYVSAALTKTKVLNVITLIEEQANFLINSMQYYAKSGEP (0)
FFPHKYYNKYTMNIVMSIGFSKTISENESVEEGPISQLIIPFYNILENLGSGNLGD
YVWYTQPFFYFKNKKLEQDTKKVYTFLEEIYNEHIKNLDESNPRDLMDQLIISTG
GKEKDMVIHVST (0)  
DFLLAGSDTNASTLEWFCIFLANNPEIQKKAYEELISVVGKDCKAVTTKYRDDCPYLV
GAIKETLRMRTPAPLSLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYK
PERWVEYYKNKTPTREMEATTETKSNITTEILP
NDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGGKKIDETEV
FGITIHPKDFSIQLKKRE*
Dict-IV-V228a03.q1c
Dict-IV-V7e07.p1c
Dict-IV-V35d03.q1c
Dict-IV-V35d09.q1c
AU071937
IIBCP1D2282
SSC778
IICCP1D19875
IICBP1E43296
IICBP1E27480
JC1c295d05.s1
AU037557
IIAFP1D71814
SSD784 supports C-term 

>SSD784 (SSD784Q) /pub/dna_csm/LIBRARY/SS/SSD7-D/SSD784Q.Seq.d/
        Length = 600

  Plus Strand HSPs:

 Score = 633 (227.9 bits), Expect = 6.3e-61, P = 6.3e-61
 Identities = 120/120 (100%), Positives = 120/120 (100%), Frame = +3

Query:     1 SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM 60
             SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM
Sbjct:    39 SLIRVSEEDFMTSGGIFIPKGTQIVPNLYGIGQNFVDDPSSYKPERWVEYYKNKTPTREM 218

Query:    61 EATTETKSNITTEILPNDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGG 120
             EATTETKSNITTEILPNDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGG
Sbjct:   219 EATTETKSNITTEILPNDLDKVVLPFSIGPRNCPGNIISEINLFLACSNILLNFEFSNGG 398

>Contig_2589, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 2691

  Minus Strand HSPs:

 Score = 93 (37.8 bits), Expect = 1.9e-10, Sum P(2) = 1.9e-10
 Identities = 19/19 (100%), Positives = 19/19 (100%), Frame = -2

Query:     1 MGIGLIILYLLIGLLAYDF 19
             MGIGLIILYLLIGLLAYDF
Sbjct:  2441 MGIGLIILYLLIGLLAYDF 2385

 Score = 85 (35.0 bits), Expect = 1.9e-10, Sum P(2) = 1.9e-10
 Identities = 16/16 (100%), Positives = 16/16 (100%), Frame = -1

Query:    20 QPLAIPVLGHLHLFGS 35
             QPLAIPVLGHLHLFGS
Sbjct:  2208 QPLAIPVLGHLHLFGS 2161

>_4
ISL*FCKF*K*LLLYYYFYF*NLYLFVFLIKIFYFIFFLFFFFNLYFLN**TKKNKKISK
NDPKQPLAIPVLGHLHLFGSQPHRSLTELAKKFGGIFTLWMGDERSMVITDPNILRELYV
>_5
*PMIL*VLKIIITLLLFLFLKSIFICFFN*NILFYFFFIFFF*SLFFKLIDKKE*KN**K
*SKATIGNSSIRTFTFIWKSTTSFFNRISKEIWWNFYIMDG**KINGHNRPKYTS*IICX
>_6
LAYDFVSFKNNYYFIIIFIFKIYIYLFF*LKYFILFFFYFFFLIFIF*INRQKRIKKLVK
MIQSNHWQFQY*DIYIYLEVNHIVL*QN*QRNLVEFLHYGWVMKDQWS*QTQIYFVNYMX

CYP519F1 Seq 47+68+80 (complete) 47% to seq 58
MEILTFIIYLITFFILFDF(phase 0)
KKKKFKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYGGIYKIWLAERMLMI
VTDPEIIQDIWIKQHDKFVNRPHNITSQIFSLNHKSLVFGDVDEWNKVRPKMTCHFTKIK
LNSTKPKQIVNDQLKKMLKIMTTHSLDSKPFNQYVYLNTYSMNIILGLMLSIELPHSNSN
DKDGQFSKVLHSIDEIFKSIGTNGPEDIFPTLLPFFKNRISTFTNHLNVIKDFIRSIYKQ
QIKTFDINIEPRNIMDCLISEYYEDDDQEDEVAKQELIIQLCIDMLVAATDTSASTLEWF
MLFMINNPNLQEDLYEEVVNVVGKDCPYVTFDDVPKLALIKACYFEILRIRPVTSLSLPR
VSMEDTTTLNDIFIPKDTIIIQNIFGMGNSEKFVSNPTVFNPSRWLEYKKMKDLN QFGNR
DDSIDTTNTTTNTTLNGTTS KYYNDLERVSIPFGVGKRRCMAPSMADHNVLIAMANIVLN
FTMKSSDPKQMPLSEEEQYAITIKPKYPFKVLFEKRS
Contig4769 Chr 6
IIADP1D1565 
IIAFP1D30759 
IIAFP1D53731
IIAFP1D75367
JC1a198c01.s1
JC1a278b03.s1
JC2c167d08.r1
sdic6A29c1.q1t N-term
sdic6B9c1.p1c

CYP519G1 Seq 58 complete 47% to seq 47
MNYLLIIICIIFFSLFFDF (0)
KIRKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYGGIYKIWLGESFSM (0)
VVSDPEIVNEIWVKQHDNFINRPKNITHK
MFSSNYRSLNFGDNPNWKFNRSMASSHFTKTKLLSSK
VTSVVEKKLNKLIETMEYHSINKLP (0)
FDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFPYLKF
FFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSF
ISDLQSNDIDILLQICIDIVVAGT (1)
DTVANLLQWFVLFCINY
PEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCFRESLRIRPVTPLS
LPRVAKCDTYIKDDIFIPKG (0)
ATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYF
NDLDKISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNE
KEVYSITIKPQPFKLFLEKRV*
Dict-IV-V885f05.p1c
IIAAP1D3111
IIAAP1E3151 
IIAEP1D2263 
IIAFP1D59991 
JAX4a82a11.r1
JC1c13f09.s1
sdic6Ce8.q1t
Contig_0437

>Contig_0437, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 7555

  Minus Strand HSPs:


Query:    11 IFFSLFF-DF-KIRKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYG 68
             +FF + + +F KI+KNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYG
Sbjct:  7114 LFFIINY*NF*KIKKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYG 6935

Query:    69 GIYKIWLGESFSMVVSDPEIVNEIWVKQHDNFI 101
             GIYKIWLGESFSMV    +I+  I +K ++N +
Sbjct:  6934 GIYKIWLGESFSMV----KII--IIIKNNNNLL 6854

Query:    82 VVSDPEIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKT 141
             VVSDPEIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKT
Sbjct:  6830 VVSDPEIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKT 6651

Query:   142 KLLSSKVTSVVEKKLNKLIETMEYHSINKLPFDSY 176
             KLLSSKVTSVVEKKLNKLIETMEYHSINKLP   Y
Sbjct:  6650 KLLSSKVTSVVEKKLNKLIETMEYHSINKLPVSIY 6546

Query:   170 KLPFDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFPYL 229
             KL FDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFPYL
Sbjct:  6499 KL*FDSYVGFSEYSLNIILNMLVSMDIDECENSTQNVIYSINEIFKMLSTNSPQYSFPYL 6320

Query:   230 KFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFISDLQSNDIDILLQI 289
             KFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFISDLQSNDIDILLQI
Sbjct:  6319 KFFFKKDLNNFKFHLDKIKSFIHSIYLKQLESYDPSNPRNILDSFISDLQSNDIDILLQI 6140

Query:   290 CIDIVVAGTDTVANLLQWFVLFCI 313
             CIDIVVAGT  +  ++   ++  I
Sbjct:  6139 CIDIVVAGTGNIIIIIIIIIIIII 6068

Query:   293 IVVAGTDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCF 352
             +++   DTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCF
Sbjct:  5967 LLLFNIDTVANLLQWFVLFCINYPEIQEKLYNEIIEVVGKDCKVLKYEHISKMPYLYGCF 5788

Query:   353 RESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGAT 387
             RESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKG +
Sbjct:  5787 RESLRIRPVTPLSLPRVAKCDTYIKDDIFIPKGVS 5683

Query:   381 FIPKGATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDK 440
             F  K ATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDK
Sbjct:  5611 FFFKKATIIQNIFGMGNDEKYISEPNKFKPERWVEYIKNKKVNKNGNENSVNKYFNDLDK 5432

Query:   441 ISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKL 500
             ISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKL
Sbjct:  5431 ISIPFGVGKRQCLSPAMAEQESLLSIATVVLNYKLKSNGQKKLNEKEVYSITIKPQPFKL 5252

Query:   501 FLEKRV 506
             FLEKRV
Sbjct:  5251 FLEKRV 5234


MNYLLIIICIIFFSLFFDF JC1c13f09.r1 possible N-terminal of 520B1 
|| || |       | |||
MNILLLIFYFLVCFLIFDF N-term of seq 519C1
IIAFP1D75985 matches N-term seq shown above from JC1c13f09.r1

>JC3e96f03.r1 Clone JC3e96f03, reverse read, bases 60 through 572, from
            2002-09-12
        Length = 511

  Minus Strand HSPs:

 Score = 320 (117.7 bits), Expect = 2.3e-27, P = 2.3e-27
 Identities = 57/61 (93%), Positives = 59/61 (96%), Frame = -1

Query:    83 VVSDPEIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMASSHFTKT 142
             VVSDP+IVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSM + HFTKT
Sbjct:   184 VVSDPKIVNEIWVKQHDNFINRPKNITHKMFSSNYRSLNFGDNPNWKFNRSMVNGHFTKT 5

Query:   143 K 143
             K
Sbjct:     4 K 2

 Score = 199 (75.1 bits), Expect = 5.1e-14, P = 5.1e-14
 Identities = 42/93 (45%), Positives = 61/93 (65%), Frame = -2

Query:    11 IFFSL-FFDFKKKKFKKNKRYSKSPNKEANGPWSLPIIGGLHLIGDRPNRSFSELSKIYG 69
             +FF + + +F K +   N  + +   K+ NGPWSLPIIGG++LI D PNR+ ++LSK YG
Sbjct:   468 LFFIINYLNF*KIRKNWNLNFKRLFKKDVNGPWSLPIIGGIYLINDNPNRALTKLSKKYG 289

Query:    70 GIYKIWLAERMLMVVSDPEIVNEIWVKQHDNFI 102
             GIYKIWL E   MV    +I+  I +K ++N +
Sbjct:   288 GIYKIWLGESFSMV----KII--IIIKNNNNLL 208

CYP521A1 Seq. 13 complete seq 36% to seq 12
MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRTDPYKTLAKASKK
TEHGILKCWNGEHLMVVVDNPSIIKQMYVNTNNFTDRPQTKVFEIISRNYKNSGFANGEK
WKHLRGLYAPSFTKIKSRPHENIILKYVNFEIKSLKNHAITNSIYNPFLIENINSFGTKV
ITEIIFGREFSENEVYSLIG PMNKLFGILDTPFPSESISFLKPFYRRSYKECDKQCEELF
KLVEKVYDDHLLNLDKDNPKDVMDVMIVETDFKEKDHVICICCDLLMGTKDTFNTIVLWF
FVLMINYQDVQLKGYQEIIK
VLECTGRDHVTIEDIDKLPYIDGIIKEISRIH PAGPLSVPRTAINDIMINGYFIPK
GCHVFQNTYGAVYNYMK
ESDEPCKMKPERWIENEKLRK
DGKLDPTNDLALISLPFSSGIRNCPGVGFAEYELFLLFSNIILNFHLSSPNNLKLNESGH
FGLTMKPFPFLVDLN*
AU071731
AU074308
C84168
C90134
Contig15846
IIAAP1D2650
IIAFP1D39484
IIAFP1D63655
IIAFP1D67462 Length = 834 
IIAFP1D78046
IIAFP1D81808
IIAFP1D85856
JC1b64g05.s1
JC1b246f07.s1
JC2d102b06.r1
SSC405
SSI130
SSI404
SSK320

>SSK320 (SSK320Q) /pub/dna_csm/LIBRARY/SS/SSK3-A/SSK320Q.Seq.d/
        Length = 625

  Plus Strand HSPs:

 Score = 242 (90.2 bits), Expect = 1.6e-19, P = 1.6e-19
 Identities = 48/48 (100%), Positives = 48/48 (100%), Frame = +1

Query:     1 MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT 48
             MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT
Sbjct:    16 MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT 159

>IIAFP1D81808
        Length = 786

  Minus Strand HSPs: no first intron

 Score = 242 (90.2 bits), Expect = 1.3e-19, P = 1.3e-19
 Identities = 48/48 (100%), Positives = 48/48 (100%), Frame = -3

Query:     1 MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT 48
             MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT
Sbjct:   625 MILLTLLYLIIFYIIIDFIKKNYKTKNQLPSPLGIALPIIGHLHLLRT 482

CYP522A1 Seq. 18 29% to seq 85 489aa
MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKER
FHKSFDKFYDKYKDFYFIKFGQHDCIVLNSPKLIKQVVIEQSDSVFERFHTPSIKRYAQ
GKSILGCSPDEWKKLRSFIVISFSKNKMGQQVLDKIFHTQYLKFENHIKKLIKSNNNI (0)
VTLEPEFKRLTISIIFNFQFGTDLEFTDPLIDSLLVCTEKIIASCQKASDLMP
IFEIFTS (0)
YKDIDGVVKEMYALVKPFLEKYLKQH
DRNNPKCALDHMINCILDQDEPKLITYEHLPHFLMDMFIGGTESTARTM (0)
DWFTLMMTNRKEMQDRIRTELLDVGIRLPVLVDKQKYPLLNASIKEIHR 
LRPIQPIIASRVVNDPIVLKHECSAKGES 
YTIPVGTLIIPNAHSFNFDPQYHKDPLTFNPNRYIGDNPEILHMTFDIGIRTCPF
MSFAIDELFIIFSRLFQSFEFQPIDNTPISEEAFTINSIRPKQWSCQVIERDHK*
AU036927
AU071581
Contig14543 Chr 2 also in excel as 14583 check for typo
Contig 15712 
IIAEP1D0380
IIAFP1D28356
IIAFP1D73782
IIAFP1D77020
IIAFP1D85481
IIAGP1D3671
c-JAX4a56g08.s1
JAX4a66b12.s1
JAX4a86b01.r1
JAX4a86b01.s1
JAX4b25h02.r1
JC1a76h05.s1
JC2a62d07.r1
JC2b337b05.r1
JC2e96f05.r1
JC2e128b02.s1
SSB828
SFH475
SFH209
AFO254
AFJ405
SFK749
AFA531
AFH875
AFB727
AFO263
AFK692
SFH771
SFA776
SFD153

>Contig_2198, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 4993

  Minus Strand HSPs:

 Score = 993 (354.6 bits), Expect = 8.3e-260, Sum P(4) = 8.3e-260
 Identities = 187/187 (100%), Positives = 187/187 (100%), Frame = -3

Query:   303 DWFTLMMTNRKEMQDRIRTELLDVGIRLPVLVDKQKYPLLNASIKEIHRLRPIQPIIASR 362
             DWFTLMMTNRKEMQDRIRTELLDVGIRLPVLVDKQKYPLLNASIKEIHRLRPIQPIIASR
Sbjct:  2438 DWFTLMMTNRKEMQDRIRTELLDVGIRLPVLVDKQKYPLLNASIKEIHRLRPIQPIIASR 2259

Query:   363 VVNDPIVLKHECSAKGESYTIPVGTLIIPNAHSFNFDPQYHKDPLTFNPNRYIGDNPEIL 422
             VVNDPIVLKHECSAKGESYTIPVGTLIIPNAHSFNFDPQYHKDPLTFNPNRYIGDNPEIL
Sbjct:  2258 VVNDPIVLKHECSAKGESYTIPVGTLIIPNAHSFNFDPQYHKDPLTFNPNRYIGDNPEIL 2079

Query:   423 HMTFDIGIRTCPFMSFAIDELFIIFSRLFQSFEFQPIDNTPISEEAFTINSIRPKQWSCQ 482
             HMTFDIGIRTCPFMSFAIDELFIIFSRLFQSFEFQPIDNTPISEEAFTINSIRPKQWSCQ
Sbjct:  2078 HMTFDIGIRTCPFMSFAIDELFIIFSRLFQSFEFQPIDNTPISEEAFTINSIRPKQWSCQ 1899

Query:   483 VIERDHK 489
             VIERDHK
Sbjct:  1898 VIERDHK 1878

 Score = 877 (313.8 bits), Expect = 8.3e-260, Sum P(4) = 8.3e-260
 Identities = 168/168 (100%), Positives = 168/168 (100%), Frame = -2

Query:     1 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKERFHKSFDKFYD 60
             MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKERFHKSFDKFYD
Sbjct:  3570 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKERFHKSFDKFYD 3391

Query:    61 KYKDFYFIKFGQHDCIVLNSPKLIKQVVIEQSDSVFERFHTPSIKRYAQGKSILGCSPDE 120
             KYKDFYFIKFGQHDCIVLNSPKLIKQVVIEQSDSVFERFHTPSIKRYAQGKSILGCSPDE
Sbjct:  3390 KYKDFYFIKFGQHDCIVLNSPKLIKQVVIEQSDSVFERFHTPSIKRYAQGKSILGCSPDE 3211

Query:   121 WKKLRSFIVISFSKNKMGQQVLDKIFHTQYLKFENHIKKLIKSNNNIV 168
             WKKLRSFIVISFSKNKMGQQVLDKIFHTQYLKFENHIKKLIKSNNNIV
Sbjct:  3210 WKKLRSFIVISFSKNKMGQQVLDKIFHTQYLKFENHIKKLIKSNNNIV 3067

 Score = 409 (149.0 bits), Expect = 8.3e-260, Sum P(4) = 8.3e-260
 Identities = 75/81 (92%), Positives = 77/81 (95%), Frame = -2

Query:   222 FEIFTSYKDIDGVVKEMYALVKPFLEKYLKQHDRNNPKCALDHMINCILDQDEPKLITYE 281
             + +   YKDIDGVVKEMYALVKPFLEKYLKQHDRNNPKCALDHMINCILDQDEPKLITYE
Sbjct:  2754 YRLTEQYKDIDGVVKEMYALVKPFLEKYLKQHDRNNPKCALDHMINCILDQDEPKLITYE 2575

Query:   282 HLPHFLMDMFIGGTESTARTM 302
             HLPHFLMDMFIGGTESTARTM
Sbjct:  2574 HLPHFLMDMFIGGTESTARTM 2512

 Score = 303 (111.7 bits), Expect = 8.3e-260, Sum P(4) = 8.3e-260
 Identities = 60/60 (100%), Positives = 60/60 (100%), Frame = -2

Query:   168 VTLEPEFKRLTISIIFNFQFGTDLEFTDPLIDSLLVCTEKIIASCQKASDLMPIFEIFTS 227
             VTLEPEFKRLTISIIFNFQFGTDLEFTDPLIDSLLVCTEKIIASCQKASDLMPIFEIFTS
Sbjct:  2997 VTLEPEFKRLTISIIFNFQFGTDLEFTDPLIDSLLVCTEKIIASCQKASDLMPIFEIFTS 2818

>Contig_2198, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
GATGATGATGAAGATGAAGATGAAGATGAAGAACATGATGAAGATGAAGAAGAGGATGAT
GATAATATATTTTATGAATCATTTATTAATGGTGATATTAAAATTTGTAAATTAATTGAA
AAATTTTGTCCTAATCAATTTAAAATAACAAAGAAATCAATTTTAAATGCAATTAAAAAT
GGAAATATTCACATTGTTAAATATTATTATAATCATATTAAAAATAATGATCAATTATTA
TTTAATAATTTTAAAAGTTTATTTTCAAAATATTTTTTATAATAAACATTTTATTTATTT
ATTAATTAATTAATTTATTTATTTAATTGTTTTTTTTTTTAAATTTTTTTTTATTTTTTT
TTATTTATTTATTCTAATAAAATACACCAATCAATATTTGGTTTAATTGTTTTTGAATTA
TTTAAAGTTGAAACAGCCATATGACCAGCAAAGAATTCTGTATTATATTCAGTACCATTA
TCATCGATTTTAACAGGTACAAAGAGGAAACCTCTTGGGACGTCATTGGTATTGACAAAG
ATCCAATCAGTGTTGAATTTTCCTCCTCCAAATTCATGTCTTTCTTTATTTTTACCCATC
CATTTACCATCATTATCAAAAACACAGAAAGCTGTAATCCAACCACTGATATAACGTGGA
CCTGAGCCTGCACCTTCTTGACTGGCAATACGATTCCACCATTCAATATCTGGTTTACCA
TTGACGGAACTAATGATTTTATCAATGATTGGTGAGAGCATTTCAGACCAATCTTTCATA
ACTGAACCATCTTTATTATCTTTGAAATCAAATTCTTTTAATTTTTCAATTCTATTCTTT
ATATCGACCCAATCATCGAGGGTACCCTCTAATGTTACCTCTGGTAAACCACACATTAAA
CACATTTTAAAGTCAAAGTAACTTTTAACGGTTGCCATTAATGCAATTGATGCTGCCATT
GTATCGTTTGGAGTTGTAGTTGAAAATGGTTTGTTTGCCCATTCTCTAATTGATGGATCT
TTGATATTCTTTTCAATTTCTTGAGTCATTCTAATGGTTAATGATTGATAATCTGCTGTC
ATCAATTTTCCACCACCCCTTACTACCAATTGTTTTTTTCCTTGGAAATCAACAATCTTA
CTTCTTAATTGTTCAGCGTTGGCATTTAAATATGACGAGAATTGTACCAAAATTGCCATC
CAAATATCATCTGGACGAATGATTAAAGAATGGTGATTTGAATACGCAATGAACGAACTT
AAAACAAATGAATTACCGCCTACACTCTTTGAAATACCTTCTTTGATACTACTTCTAACT
ACTTTTCTTTCTGGTTCATTTGGTTTACCACCATACAATTTTTGGTGTGAAAAATCTAAT
CCTTTGAATTCACTCTCTCTTACATTTGCTACTTTAAATGTTATAGACATTGTTGTTATT
GTTCTGCTGTTGTTGAAAAAATGGAATTAATAAAAAAAAAAAAAAAAAAATTTATATAGT
TAATAAAAAACAAAAAAAACAAAAAAAAAAAACCCAAAAAAAACAAAAAAAAAAACCAAA
AAAAAAAATTTAAAATTTAATTTAAAAAAATAAAATAAAAAATAATTTAATTGAAAATTA
CTTTCTCGGGTGGGAAATTCAAAGTAGAATTTCGTAAGGTCCGAGCGTTGAAAATAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAATGAAAATAATAGTTTTTTTTGGAATTTGGACTC
ATTTGTTTTTGTTAATTAAAAACATTTAATGATTTTTTTTTAATTAAAAAAATAAAAAAA
AAAAAAAATCTTTTAATTTTTTTTTACAATTGTAAAAATACCTTATCTAATTTTAATGTT
GTATAATAATTTATTTATTTATGATCTCTTTCAATTACTTGACATGACCATTGTTTTGGT
CTAATTGAATTAATTGTAAAAGCTTCTTCTGATATTGGTGTATTGTCAATTGGTTGGAAT
TCAAAAGACTGGAATAATCTAGAGAATATAATAAACAATTCATCGATAGCAAATGACATG
AATGGACATGTTCTAATACCAATATCAAATGTCATATGAAGGATTTCAGGATTATCACCA
ATGTAACGATTTGGATTGAAAGTTAATGGGTCTTTGTGATATTGTGGATCAAAGTTGAAT
GAATGTGCATTTGGTATGATTAATGTACCAACTGGAATAGTATAACTTTCACCTTTGGCA
CTACATTCATGTTTTAAAACAATTGGATCATTAACAACACGTGAAGCAATAATAGGTTGA
ATTGGTCTTAATCTATGAATTTCTTTAATTGATGCATTTAACAATGGATACTTTTGTTTA
TCAACTAAAACTGGTAATCTAATACCAACATCTAATAATTCTGTACGTATTCTATCTTGC 2400
ATCTCTTTTCTATTTGTCATCATTAATGTAAACCAGTCCTGTATTTATATAAACAACAAA
ATAATTAGTAACTATTGTTTGGGTAGCACTTTCACATTAATGTATAATTACCATTGTTCT 2520
AGCAGTTGATTCAGTACCACCTATAAACATATCCATTAAGAAATGTGGCAAATGTTCATA
TGTAATTAATTTTGGTTCATCTTGATCTAATATACAATTTATCATATGATCTAATGCACA
TTTTGGATTATTTCTGTCGTGTTGTTTTAAATATTTCTCTAAAAATGGTTTAACCAATGC 2700
ATACATTTCTTTAACTACACCATCAATATCTTTATACTGTTCGGTCAATCGATAATTTAC
AAATTAATAAATTATACATACTACACTATTAATAATAATAATAAATAAAATACATACACT 2820
TGTAAATATTTCAAATATTGGCATTAAATCTGATGCTTTTTGACAACTTGCAATTATTTT
TTCAGTACACACTAATAAAGAATCAATTAATGGATCAGTAAATTCTAAATCTGTACCAAA 2940
TTGAAAATTGAAAATAATACTAATTGTAAGTCTTTTAAATTCAGGCTCCAATGTAACCTA 3000
TACAAATAAATAAATTAATATGTGTGCAGTTGTAATTTATATAATTATTATTGATTTGAT
GGTACATACGATATTATTATTTGATTTGATTAATTTTTTAATATGATTTTCAAATTTTAA
ATATTGAGTATGAAATATTTTATCCAATACTTGTTGACCCATTTTATTTTTTGAAAATGA
AATTACAATGAATGATCTTAACTTTTTCCATTCATCAGGACTACATCCTAAAATACTTTT
ACCTTGAGCGTATCTTTTAATTGATGGTGTATGAAATCTTTCGAAAACACTATCACTTTG
TTCTATTACAACTTGTTTAATTAATTTTGGTGAATTTAAAACAATACAATCATGTTGACC
AAATTTAATGAAATAAAAATCTTTATATTTGTCATAAAATTTATCAAAACTTTTATGAAA
TCTTTCTTTATCAAATGCATATAATGCACCTAATAATGGTAAATTGAATGGACCTTGTAC
TTTTCGTAAATCATTTTTACCACCATTATTTAATAAATATTTATTTACAAAAATTACTGT
TAAAATTATTATTACTATAGTTAATATCATTGTGTGTTATTATTAATTTATTGGTTTTTA
TGAATATGAAAAAAAAATAATAGTGGTCTGTGTGTGTTTAAATATAAAAAAATTTCATTT
TTAATTGAATAAAAAATAAAAAAAAAAATAAAAAAAAAAAATAAATAATTTTAAATAAAT
TTAATTTTTTAAATTATTATTATTATTGAATTATATTTATTTATTTTGAATGTTGTAGGG
AATTTTCAATAAAATTTCACTACACAGCGTTCATCATCATCCATTGGCATATTAATAATA
TTAACTGGTAATAGTAATGGCGAATCTAATTGTTGTTGTTGTGGTGATGTTGTTGTTTTT
GTTGTTGTTGAAGAAGATGAAGAAGATGAAGAAGATGAAGATGATCTTAATTGCAGTGGT
TGTAGTTGATTTATTACATTCTTCAATTGGTTCCTCTGGAATTGCATGCTTCAGTTTCTT
GTTTAGTTAATTCTTTACTTGGTTTAATTTCTTGTAAAATTTGACTAGCCGTTGTTACAT
TATGTTGATTTAAATCTTGTGAGGAAGATAAAGAATTCTCTGCATTAACTTTTAATTGTA
AAATATCAAATGGTGATAATACTTCAGGTGCTTCATCAATCATTTTGGATGATTGTTACT
ATTATGTTGTTGTTGTTGTGATGATGATGATATTGATAATGATCAAGATGAAGCTGATTG
ATGATTTTTTCTTTCTTTACCAAGGAAATCAGTTGATCAATTCAAAGAAGAATTTAATTT
TGGCATCATATTGCACCGACTGATGGTTTGCGTTGAATAAAACTTGTTGATCTGGTGATG
ATGGTATACCCCTAATTGTTGGGGATTATCCAACTGATACACTTGTAGTAGTATTACCAC
CAATATTATTACTACTAGTATTTAATGGAGTTGGAATTGGTGTCGATGCACTACTTAATG
ATGGTGAAGATTGAGCTGATGATATCACATTATTATTATTATTATTTATTTTTTATTATT
GATAACAACTGGAGTAACACTTAGAATTGTTTTGAAAGAAACCTTCACTATCTAATTTTG
CTGCTTTTGTTCTTTCATCTTTTGAAATTAATACCCAATTAGTATTTTAATTTTGGTAAT
TCATTAATAACAAATAATATAAACTCTTCACATTTCTTTAAATTAAATTATTTTCAAATG
ATAAATATTTGAAGTTTTAGATTTCTTTAATGTTTGTTATTTGTTGTCCTTAAATTGAAA
ATATGAAAATAAAAAAAAAATGAAAAACAAAAACGAAAAAAAAAATGAGAAAAAAAAAAA
AAAAAAAAAAAAACCCTTTAAAAAACCTTAATTAAACCTATTATTCAAATAAAAAATATT
TTAATAACCTTTTTTTGTTTTTTAAAAAAAAAAAAAATTAAGGGCCAGGATTTTTTTTTT
TTTCCCTCCCCTC


>AFB727 (AFB727Q) /pub/dna_csm/LIBRARY/AF/AFB7-B/AFB727Q.Seq.d/
        Length = 1042

  Plus Strand HSPs:

 Score = 246 (91.7 bits), Expect = 3.7e-20, P = 3.7e-20
 Identities = 49/49 (100%), Positives = 49/49 (100%), Frame = +2

Query:     1 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE 49
             MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE
Sbjct:    14 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE 160



>IIAEP1D0380
        Length = 907

  Plus Strand HSPs:

 Score = 246 (91.7 bits), Expect = 4.2e-20, P = 4.2e-20
 Identities = 49/49 (100%), Positives = 49/49 (100%), Frame = +1

Query:     1 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE 49
             MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE
Sbjct:    88 MILTIVIIILTVIFVNKYLLNNGGKNDLRKVQGPFNLPLLGALYAFDKE 234

CYP519H1P =  old CYP523A1 seq 39, 73 complete 41% to seq 14 487 aa
MFIIYFIFLFLLIISLFIDF (0)
IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGDHYSIVVSDP 
VIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNSNFTKTKLT
KTIYNYLEDQTNQLIENMGNYSKSGEPV (1)
FLSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN frameshift
FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD (bad GT boundary)
PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN (1)
DCQEKAYNEIVSVMGEDCNKISYADRP
KLPYLVACINECLRMRTEDPLGIPRGAVEDIEINGYFMPKGAKVHHYLYAFGMNETVFEN
VNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRSCFGKNLSELEVFVVCSNILLNFELSSY
NGKPVDDFEIFGIHPPEFPVKLIKRK*
Contig1006 Chr 2
IIAFP1D52879
JAX4a43c04.s1 N-TERM
JAX4a44c04.r1
JAX4a127e01.r1
JC2a56d05.r1
JC2a201c07.r1
JC2b169a08.r1 
c-JC2b169a08.r1
JC2d58c03.s1
JC2e17c09.r1 similar to seq 39 may be seq 39
Length = 501
KISLNXIWGLFFSKKILQNKIFXNXKIKKIPXPIQIFXKKXXPPNX
FXNFXXILSPLLYFTKKNYQKNXSTSTNFIXXINXXHLKNLXXXXX
PKNLMDILIINSTKGKDKNKKPIXHIXYNFLMVGSNX


>Contig_4699, D. discoideum, sequenced by the D. discoideum Sequencing
            Consortium, assembled with Phusion
        Length = 3784

  Plus Strand HSPs:

Query:    13 IISLFIDFIKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD 72
             II L+   IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD
Sbjct:  2076 IIILYYI*IKKNLKKSNNDPPGPISLPLLGNLHNLTKNPHRGLKNLSDKYGGIFRCYLGD 2255

Query:    73 HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS 132
             HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS
Sbjct:  2256 HYSIVVSDPVIINEIYIKKFEKVCTRPNNDTFKMFSSGFKDLAFSDNYNIWSKIRTIVNS 2435

Query:   133 NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPVF-LSTITIYMKISLNVICKLFFS 190
             NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPV  +  I IY+ I + ++ +  F+
Sbjct:  2436 NFTKTKLTKTIYNYLEDQTNQLIENMGNYSKSGEPVCKIFNIYIYIYIYI*ILIQF*FN 2612

Query:   170 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 222
             LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN
Sbjct:  2633 LSTITIYMKISLNVICKLFFSKEILQNESFDNGKMRRIAVPIQIVCKELGAGN 2791

Query:   218 LGAGNFGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD 266
             +G+  FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD
Sbjct:  2776 VGSR*FGDFVGILSPLLYFTKKKYQKNSSTSTDFIGEINDEHLKNLDHD 2922

Query:   267 PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN 309
             PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN
Sbjct:  2973 PKDLMDMLIIDSTKGKDEDKEPIVHIGYDFLMVDQIRHLVLWN 3101

Query:   310 DCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDIEI 369
             DCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDIEI
Sbjct:  3131 DCQEKAYNEIVSVMGEDCNKISYADRPKLPYLVACINECLRMRTEDPLGIPRGAVEDIEI 3310

Query:   370 NGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRS 429
             NGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRS
Sbjct:  3311 NGYFMPKGAKVHHYLYAFGMNETVFENVNKFQPDRWLTNDQVHLKQMLNHLVPFSVGPRS 3490

Query:   430 CFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK 482
             CFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK
Sbjct:  3491 CFGKNLSELEVFVVCSNILLNFELSSYNGKPVDDFEIFGIHPPEFPVKLIKRK 3649

>Contig_4699, D. discoideum, sequenced by the D. discoideum Sequencing Consortium, assembled with Phusion translate frame +1 translate plus frames translate all frames
TAAAATTTTTTTTTTTTTTTTTTTTTAAAAATAAGGTTTTTTTTGGAAAAAAAAAAAAAA
AATTTTTTAAAAAAAAAAAAAAATTTTAAATTTAAAAAAATTTGAAATTTTTTTTTTTTT
TTTTTTTTTTTTTTTTTTAAAGGTTTGAAAATTCTCCATTGGGGCTCGGTTTTTTTTTAA
TTTTAATTTTAATTTTTAATAAATTAATTTTTTTTTTTTGGGTTATTTTTAACTTTTAAA
AATTGGGAAAAAAAAATATTTTGGATTTTTAAAAAAAAAATGTTTTTTTTTTTTTAACAA
TTTTATTCCTTATTAAATTTCCAAAATGGGTTTTTTTTTTTTTTTTTTTTTTAATGGAAA
AAAATTGGTTTTTGGAAAATATAGATTTACATCCATAATATCAAAAATCGATATTTTTAG
TTAAAAAAAAAAAAGTATTTTGGGGGGTTTTTTTTTTTTTTTTTTTTATTTAATTAGTGG
TCATATGTTATAAAATAAAAAAAATTATTTAAATCATATGACCACTTAAAAAAATAGTCT
AAATGTGTATTAGTGATATAAGTAGATTATTTTAAAAAGTTTCTTTTC