>gnl|ti|647066038
1095898227332 34% to 17A1 35% to
2U1 fugu 33% to 2U1 human
74% to 1095901734433
1097567032902
1096761105127 1096123323522 1096088745900
Combined
seq from CN769290 and CN769570 39% to CYP17A
EST
= CN769570.1
mate
pair of 1096088745900
had partial match to N-TERM.
WALKED UPSTREAM TO
1097672588127,
N-term still missing, end of this exon seq not certain
cannot
walk upstream any further
(1)
AFNRNTNSLINSDPGPRFKILRKLASSSLKIYAEGLLGMERIAISEYCELSKKLQSIKEKPVSVHKIM (1)
AGCATTCAACAGAAATACGAACAGCCTCATTAACAGTGATCCAGGCCCGCGTTTTAAAATTTTA
CGAAAGTTAGCATCATCTTCTTTGAAAATTTACGCTGAGGGTTTATTGGGAATGGAAAGA
ATAGCAATCAGTGAATATTGTGAACTGAGTAAAAAGTTACAATCAATAAAAGAAAAACCA
GTATCGGTTCATAAAATAATGGGT
(0)
QSTLNIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNETSYVSS
IPLLRYFPTATSRNIFEIIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGE
ELTEKITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRY
VSLKDRPMLHLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHH
DESYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLK
DYRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPRN*
AGCAAAGCACACTTAACATTA
TTTGTACCATTCTTTTTAATCATCGCTACGAGGATGACAACCAGGAGTTTCAGAATATCA
TAAAATACTCAAGTTTAATCGTTCAAACTTTTAATGAAACCAGTTACGTATCTTCCATTC
CATTGCTGCGCTATTTCCCAACGGCAACGTCGCGAAATATTTTTGAAATCATAAGGCTTC
GTGATCCGATTTTAAAACGAAAACTCCAAGAGCACAGAAAATCTTACGATAAGAATAATT
TACGTGACATAACCGATGCATTAATAAAAGTGTCTTTAGATTCAGAGATGGGTGAAGAAT
TAACTGAAAAGATTACTGATGATAATATTGAGTTTCTTTTAAACGATTTTATGATTGCTG
GATCCGAAACTTCATCAAGTACTATTCTTTGGTTTATTGTTTACATGTTACATTGGCCAG
AATACCAAAATAAACTTTATGATGAAATTACTAAAGTAGCATCAGATAACCGTTATGTAT
CTTTAAAGGATCGACCTATGCTTCATTTAATGCAAGCTGCAATTCATGAAACACTTAGAC
TGTCATCGGTGGTACCTCTTGGTTTGGTTCATAAAGCAATGGAGAACAGTAGCATTTGTG
GCAAGTTTGTTCCTAAGGGAGCTCTTATTTTAACAAATTTATGGAGTATGCATCACGATG
AAAGCTATTGGAAAAATGCAATGAGTTTTTACCCGGAACGTTGGCTGGAAAAATCTGGCG
AGTTCAATTATAAATTGGGGTACGCATATTTACCGTTTTCTAATGGACCTCGTAGTTGTT
TAGGAGAAACATTGGCAAAAACAGAGTTGTTTGTGTTTATTACACGATTACTTAAAGATT
ACCGATTTGAAATGCCAACTGGAAAAGAGTTACCTTGTTTAGATGGTCGTTCTGGAATCA
CCTCCCCTCCTAATGACTTTGAAGTCGTGATAATTCCAAGAAATTAA
>complete combined seq CN566859
CN566581 CYP2 clan member [gene 2]
1097326058990
32%
to 2X9 aa 26-146 34% to CYP17A
MFLEVIGAVFIPPLIWTIWVYIKHLIDCLHYPRGPIPLPFIGNGYLIRKAEPYKELVNLGKIYG
DVFSFSVGSVRYVIVNSLEGIQEVLVKKGWQFAGRPKGP
()
SWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAIEESFQLNKKLLETNGKPFSMQEIT
(1)
1097329249233
(1)
TLCVLNIICSILFNHRYKEDDLEFQDIIKYSNICFKERGVNNYIISIPWLRY
FPSASSRNLDEMIKIRDPLL
KKKVQEHKRSYDEYNLRDLTDALIKASNSETGQDPDEKVTDDNIVFILN
NFILAGSETSSNTILWFIVYILHWPEYQDKLYDEILKVTSGSRYPCLKDRPSLHLMQAAI
YETLRLSSVAPFGLHHKAMEKSSICGKSI
PKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL
395
394
AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227
AGATGCAACTGATAATAAATTCGCGATCCGCTATAAAGAAAAAA
GTCCAAGAGCACAAAAGATCGTATGACGAATATAATTTACGCGATCTAACAGATGCTTTA
ATAAAAGCATCAAACTCGGAGACGGGACAAGATCCGGATGAAAAAGTTACTGATGATAAT
ATTGTATTTATCTTAAATAATTTTATACTCGCAGGATCAGAGACTTCATCAAATACGATT
CTTTGGTTCATTGTTTATATTTTACATTGGCCGGAGTATCAAGATAAACTTTATGATGAA
ATTTTAAAAGTAACATCAGGTAGCCGTTACCCTTGCTTAAAAGATCGCCCGTCACTACAT
TTAATGCAAGCTGCAATTTATGAAACACTTAGGTTGTCATCGGTCGCACCTTTTGGTTTA
CATCATAAAGCCATGGAGAAAAGTAGCATTTGTGGAAAATCTATCCCTAAAGGCGCTCTT
ATAATAACCAATCTATGGAGTATACACCATGACGAAAGCTACTGGAAAAATGCAATGAGT
TTTTACCCTGAACGTTGGTTGGAAAGTTCTGGCGAATTTAACTCTAAACTAGGAAATGCG
TATTTACCATTCTCTAGTGGACCTCGTAGCTGTATTGGAGAAACATTAGCAAAAACTGAG
>1097329039870
1096703377333 1097664213870
MLFKVIGTILIPPLIWVVWIYIKHLVDCLSYPQGPFPLPFIGNAHLIRNRESYKVF
SEFQKIYGSVFGFSIGSTRYVVVNNLEGVQEVLIKKGSQFAGRPRRA
(1)
ATGCTCTTTAAAGTCATTGG
TACAATCTTGGTTCCACCTTTAATATGGGTTGTATGGATTTATATCAAACATCTTGTTGA
CTGCTTGTCCTATCCTCAAGGACCATTTCCTCTCCCATTTATAGGAAATGCTCATTTAAT
AAGAAATAGGGAGTCTTATAAAGTGTTTTCTGAATTTCAGAAGATTTATGGCAGCGTTTT
TGGATTTAGCATTGGCTCAACCAGATATGTGGTTGTAAATAACTTAGAAGGAGTTCAAGA
GGTTTTGATCAAAAAAGGTTCACAGTTTGCAGGCCGCCCAAGACGAGCAAGT
>1096703827379
1095896863976
MFPEIVGAIMLPPLIWAAWIYIKHLVDCLVYPRGPFPLPFVGNAYLFSKGKPYKEFVKLG
103
KTYGDVFGFSIGSIRYVVVNSLEGIKKXXXXXXXXXXXXXXXX
ATGTTTCCTGAAATCGTTG
GCGCAATTATGCTTCCTCCCTTGATATGGGCAGCGTGGATTTACATAAAACATCTTGTTG
ACTGTTTAGTTTATCCCCGAGGACCATTTCCACTACCTTTTGTAGGAAATGCATATCTCT
TCAGTAAAGGCAAACCTTATAAAGAATTTGTTAAACTTGGAAAAACTTACGGCGATGTAT
TTGGCTTTAGCATTGGTTCAATACGATATGTAGTCGTGAACAGCTTGGAAGGTATCAAGA
AGT
>1095899160393
frameshifted
MFFEVIRAFFTPPLVWIIMVYIKNLIDYLYYPREPIPLPFIGNGDLIRKAEPFKEL
VNLEKKYGDVFSFRIGLVRFVVVSSLEVILEILVKKGWQANGRPKAP
(1)
ATGTTTTTTGAAGTTATTCGCGCCTTCTTTACTCCACCTTTGGTATGGATTATAATGGTTTATATAAAA
AATTTAATCGATTATTTGTATTATCCACGAG
AACCGATACCACTACCATTTATTGGAAATGGTGATTTGATAAGAAAAGCAGAACCGTTTA
AAGAGTTGGTTAACCTGGAAAAAAAATATGGCGATGTTTTTAGTTTTAGGATTGGTTTAG
TCAGATTTGTGGTTGTTTCA
AGTTTAGAAGTAATTTTAGAAATACTAGTAAAAAAAGGGTG
GCAGGCAAATGGTCGTCCAAAAGCTCCAAGT
>1097329360095
4 aa diffs to CN566859 from PKG
FYLNNFILAGSETSSNTILWFIVYILHWPEYQDKLYDEILKVTSGSRYPCLKDRPSLHLM
QAAIYETLRLSSVAPFGLHHKAMEKSSICGKSIPKGALIITNLWSIHHDESYWKNAMSFY
PERWLESSGEFNSKLGNAYLPFSSGPRSCIGETLAKTELFIFISRLINDFRFVKPISEEL
PRLDGSFGITCTPYDFKVEIVPRSKNLLF*
TTTTATCTTAATAATTTTATACTTGCAGGATCAGAGACTTCAT
CAAATACGATTCTTTGGTTCATTGTTTATATTTTACATTGGCCGGAGTATCAAGATAAAC
TTTATGATGAAATTTTAAAAGTAACATCAGGTAGCCGTTACCCTTGCTTAAAAGATCGCC
CGTCACTACATTTAATGCAAGCTGCAATTTATGAAACACTTAGGTTGTCATCGGTCGCAC
CTTTTGGTTTACATCATAAAGCCATGGAGAAAAGTAGCATTTGTGGAAAATCTATCCCTA
AAGGCGCTCTTATAATAACCAATCTATGGAGTATACACCATGACGAAAGCTACTGGAAAA
ATGCAATGAGTTTTTACCCTGAACGTTGGTTGGAAAGTTCTGGCGAATTTAACTCTAAAC
TAGGAAATGCGTATTTACCATTCTCTAGTGGACCTCGTAGCTGTATTGGAGAAACATTAG
CAAAAACTGAGTTGTTTATTTTTATATCCCGATTAATAAATGATTTCCGATTTGTAAAAC
CGATATCAGAGGAATTACCGCGTTTAGATGGTAGTTTTGGCATCACTTGTACTCCTTATG
ACTTTAAAGTTGAAATAGTTCCAAGGAGTAAAAATTTACTGTTTTAA
>1097509039345
92% identical to 1096064108200, probably joins with 1095898835518
1096625274183 1095900033599 1095896933215 100% match so this similar seq is real
1097206379175 1097678021634
MFLEVAFGVVTPLFLYVIATYLDHLFKCRFYPPGPFPLPIIGNLHLIGKKPHEKFVEYSK
538
KYGEVFSLSFGMHRVVIVSGKDSIREVLVQKSNIFAGRPKNYIANIVSRGYKNIGYGDIG
718
PKWKILRKIAHSSLKNYGESTAHLETLVVRESEELHKNLYKKSNRSTKLEHKF
(1)
>gnl|ti|649400787
1095898835518 93% identical to 1096064108200, 39% to
17A1 fugu
35%
to 2U1
gnl|ti|647175227
1095898288652 1096602038000
(1)
GVAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFTGVAGTNAISFIPWLRFLPLDGLR
KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDY
LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP
LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH
EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKP
GDSLPSLYGNCGLL*
AGGTGTTGCGGTATTAAATGT
CATTTGCTCTATTGTATTTGGAAAACGCTATGAGTACGAAAATTGTGAATTTAAAGAAAT
CCTAACCTACATGAATTATGTTTTTACTGGTGTAGCTGGTACAAACGCAATTTCTTTTAT
TCCGTGGCTTCGTTTCCTTCCATTAGATGGATTACGAAAATTAAAAAAAGGACTTTCAAT
TAGAGATCCGGTTCTTCGGAAGCAGTTGTTATATCACAGAGAGACCTACAATGAAAGTAA
CCTGCGTGACTATACAGACTATGTCATACAATTTTCAAGAGATGAGGCCATCTTGAAAAA
GTTTGGAGAACAGCTAACTGATGACTACTTAGAGCTTTTACTTAATGATATATTTATAGC
TGGAACTGAAACTGCATTGACAACTTTACTTTGGTCAATTATCTACCTTATTCACTGGCC
AAAGTTTCAAGACAAAATTTACAATGAAATTGTTTCAGCTATTGGTAAAAATAGATATCC
TTCTATGAAAGATCGTAATATGCTGCCTCTTGTTAACGCTGCGTTATCAGAAACATTGCG
GTTATCTTCTGTTACTCCATTAGGAGTACCTCACAAAGCTATGGAAGATACAACTCTCTT
GAATGATTTAAAGATTCCCAAAGGCACCACAATTTTAACGAACCTTTGGCAATTACATCA
CAATAAAAACTGTTGGGAAAATCCACATGAGTTTAATCCATATAGATGGTTTACTAATGA
TCAAACACTTGATTCTATAAAATCTATGAATTTTTTACCTTTTTCTGCTGGTACCAGAGT
GTGTTTAGGAAAGGGTATTGCTGAAGTTGAACTTTTTCTTTTTTACTCAAGGCTGGTTCG
TGATTTTAAGTTTGAAGTAAAACCCGGCGATAGTCTTCCAAGTTTATATGGAAATTGTGG
ATTACTCTAA
>gnl|ti|648017453
1095896110991
52 1e-05 35% to 17A1
fugu 34% to 2U1 fugu
gnl|ti|647987527
1095895119635
1096703762277
used this seq to walk upstream past a repeat could not go futher
71% to 1095898227332
(1)
ELTTLNIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSASNLLSSIPWLRYFPTTASKYIQ 707
706
EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527
526
DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347
346
HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167
166
LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYFEKSVEEDLPRLDSF 374
375
PGVTRSPYDFKVVVVSRS*
>gnl|ti|647193621
1095899233960 1096082123583 1097696262164 1096620040714
1097206342731
Combined
seqeunces BP505786 and CB073123 and CB271974 40% to CYP17A [gene 3]
CN570733
same as CN570522 BP505786
50% to 1095898835518 37% to 17A1
(1)
VTGVMNVLCGIVFGTQYEENDKELEKVISFKQLILDGVADTFAISFLPWLRFFPSNGLKKVRK
GVLIRDKLLRFQLKKHRETYNPVQIRDYTDYVLKYSKEFETSRNIDEQLSEDNMEMM
LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES
AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY
RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFNLCLKPGASTP
SLNGVLRVTLTPDTSYIILKPRSNNLISQKIEA*
AGTTACTGGAGTGATGAACGTTCTTTG
TGGAATTGTTTTTGGTACACAATATGAAGAAAATGATAAAGAACTTGAAAAAGTCATATC
TTTTAAACAGTTAATATTAGATGGAGTAGCAGATACATTCGCAATATCTTTTTTGCCGTG
GTTAAGGTTTTTTCCTTCAAACGGATTAAAGAAAGTACGAAAAGGCGTGTTGATAAGAGA
TAAACTACTTAGGTTTCAATTAAAAAAACATCGAGAAACATACAATCCAGTTCAAATAAG
AGATTACACTGATTACGTACTTAAATACTCAAAAGAGTTCGAAACTTCAAGAAACATAGA
TGAGCAGTTAAGTGAAGATAATATGGAAATGATGCTTCAGGATATTTTCATTAGTGGTAG
CGAAACAACTATATCAACACTTCTTTGGTTTGCTGTTTATTTAGTTAACTGGCCAAAGTA
TCAAGATGATATCTATGATGAAACTATTAAAATAGTCGGTAATGATAGGTATCCTAGTCT
TTCAGATCGTCCAAAGCTTCATTTATTTGAAAGTGCTATGAAAGAAACTCTGCGTTTGTC
GTCTGTCATTCCATTAGGTTTACCTCACAGAAGTCTTGAAGAAACCAGCATAAAAAAATT
TAAAATTCCTAAAAATACAAACGTAATGATTAATCTGTGGCAGTTGCACCATGATAGTAA
ATCTTGGAGTGATCCTCATACATTTAATCCATATAGATGGTTAAATGACAAGAATATCTT
TGACAAAAGCAAAAACCCAAACTATCTTCCATTTTCAACCGGATTAAGAGCCTGCTTAGG
TTATCACACAACCGAATCCATCATTTTTTTGTTTTTTACCCGATTGATAAGAGATTTTAA
TCTTTGTTTGAAACCTGGCGCATCTACTCCAAGTTTAAACGGTGTTTTGCGAGTAACCTT
AACTCCTGATACGTCATACATTATTCTAAC
>gnl|ti|648033522
1095897342515 39% to 17A1 N-term
1095899118747
1095900033599 1096071090512 1096703396910
1096608233968
MFLEIAFGVTAPLLLYVIATYLDHLFKCRFYPP
GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK
SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK
ESEELHKRLFKNCNRSTELEDEF
(1)
ATGTTCTTAGAAATTGCTTTTGGAGTAACAGCTCCTCTGCTTTTGTATGTCATTGCAACTTATCTAG
ATCATTTGTTTAAATGCAGATTTTACCCGCCAGGCCCTTTTCCTTTACCGATTATTGGGA
ACTTACATTTGATTGGAAAAAAACCACATGAAAAGTTTGTAGAATATTCAAAAAAGTATG
GAGAAGTATTCAGTCTAAGTTTTGGAATGCATCGTGTTGTTATTGTTTCAGGAAAAGATT
CTATTAGAGAGGTTTTGGTTCAAAAATCAAACATTTTTGCAGGGCGTCCTAAAAACTACA
TTGCTAATATTGTATCTCGTGGTTATAAAAATATTGGCTACGGAGATATTGGACCTAAAT
GGAAAATTTTGAGGAAAATTGCTCACTCTTCTTTAAAAAACTATGGAGAGTCAACTAAAC
ATTTGGAAACGCTTGTCGTAAAAGAAAGCGAAGAGCTACACAAAAGACTTTTTAAAAATT
GTAACAGATCCACAGAGCTAGAAGATGAGTTTGGT
1096064108200
93% to 1095898835518
1097206931796(9 aa diffs)
1097206498632
walked up to 1096081234652 found mate pair 1096071090512
already
known N-term seq matches 1095897342515 100%
1095897342515
38% to 17A1 fugu whole seq.
MFLEIAFGVTAPLLLYVIATYLDHLFKCRFYPP
GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK
SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK
ESEELHKRLFKNCNRSTELEDEF
(1)
(1)
GVAVLNVICFIVFAKRYENKDSEFKKILMYMNYVFSGVASTNFASFIPWLRFFPLDGLR
KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDYLEL
LLNDIFIAGTETALTTLLWSIIYLIHWPKFQDEIYNEIVSTIGKDRYPSMKDRNMLPLVN
AALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNENCWENPHEFNPYRWF
TNDQALDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKPGDSLPSLDG
NYGITLTPRIFTTFVVARNDSLVAQNHSL*
>gnl|ti|647182814
1095899213949 1095958075467 1095733042694
1097672545497
54% to 1095898835518, 36% to 17A1 36% to 2U1
walked
upstream to 1097672406696 which mate pairs to exon 2 below
(1)
GVAVLNVICFIVFGERYQYSDPAFIEILTTINNIVSGLSNTTAVDFLPGLRYLQFSEIK 256
257
KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436
437
VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616
617
ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796
797
HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFE
GVPGCPLPSLIGKCSITLAPEEFNVHVTPRINSLMFSKNVLPE*
>combined
seq CN774619 CN775634 CYP2 clan member
[Gene 1]
32%
to CYP1C1 aa 173-297 29% to 17A2
2 ESEELHKRLLMKSKTSVDLKTEFGAAIINVICFIVFGERYQYSNSEFKEVLTTINNIV
175
176
DGLSNTTAVGFLPWLRFLPFSPIKKLSISLSKYIRFLNDKLTKHKETFNENKIRDS 343
344
TDSIIN 361
>1096526199166
frame3_ORF1 7aa diffs to CN774619 may be same gene
(1)
GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS
LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI
TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL
RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD
KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN
ITHAPKQFCAYLTPRINNLM*
AGGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAGATTCAGAAT
TTAAAGAAGTTCTTACAACAATAAATGATATAGTCGATGGGTTGTCAAATACAACTGCTG
TTGGATTTTTGCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTT
CACTTTCAAAATATGTTCGTTTTTTAAACGATAAGTTGAAAAAACATAAGGAAACATTTG
ATGAAAAGAAAATTCGAGATTTTACTGATTCTATTATAAATTTTTCTAATAACGAAGCTG
TCAAACAAAAATTTAAAAACGTTGATGAACATTTAGAGCCTGTGATTGGGGATTTATTTA
TAACGGGTAGTGAGACCACATTAACATCTTTATTGTGGTTAATTCTTTATATGATGCATT
ATCCCAAATATCAACAAGAAATTTTTAAAGAAATTACAACGGTTATTGGTGAAGACCGGT
ACCCATGTTTAAATGACCGTGATTCTTTGCATCTTGTTAAAGCCGCATTAAAAGAGTGTC
TGCGTTTATCTTCAATTGTTCCTCTTGGATTACCACACAAAACAACCAAAGAAACAGTTC
TTATGGGACATAGCATTCCTGGGAATGCAACAGTCATGATTAATCATTGGCAGATTCATA
ACGATACTAACTACTGGGAAAATCCTAACGAATTTAATCCTTATCGGTGGATTGGTAAAG
ATAAGAAATTTGATCCAAGTAAAGCAACAAGTTTTTTACCTTTTTCAGCCGGTACAAGAG
TTTGTTTAGGGAAAACAGTTGCTGAAAATGAACTATTTTTCTTCTTTTCTAGATTAATTC
GAGATTTTAACTTTGAGTGCATACCTGGTTGTCCACCTCCAAGTTTAATTGGTAAATGCA
ATATTACTCATGCTCCAAAACAGTTTTGCGCATACTTGACTCCAAGAATAAACAATCTAATGTAA
>whole
gene 1095899272864 1096526199166
MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS
KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY
GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1)
GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS
LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI
TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL
RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD
KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN
ITHAPKQFCAYLTPRINNLM*
>gnl|ti|655005893
1095958068757
44 0.002 43% to 4V5
fugu 36% to 4T5
gnl|ti|651153924
1095901025079 N-term
gnl|ti|651153911
1095901025066
1097206604076
1097206339312
complete
gene no introns ESTs = CV566433.1 CX054637.1
CV566166.1
MVSVFYILFSGLVFYVVSKILWKLWRNSYGLSSIVTPPNVPFFGTSLYLHSDA
RKFFFQLYDYTRRYGDVFCIWLGPKPVICSSSVKFSEAVLSSQKVITKGFSYDFLHDWLK
TGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQVP
IGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL
568
567 PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI
397
396
DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217
216
QSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40
39
PNDFIPERFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEK
ILLYSIMKNFHLKSMQNENEVFGTLDIIHKSINGINIKFTRR*
ATGGTATCAGTTTTTTATATATTATTTAGTGGACTT
GTTTTCTATGTTGTTAGTAAGATATTGTGGAAGTTATGGAGAAATTCATATGGTTTATCA
TCAATAGTTACACCTCCAAATGTACCATTTTTTGGAACATCTTTGTACTTGCATAGTGAT
GCCCGCAAATTTTTTTTCCAACTATATGACTACACAAGAAGATATGGCGATGTGTTTTGC
ATTTGGTTGGGGCCAAAACCAGTAATATGTTCTTCCTCTGTAAAATTCTCAGAAGCAGTA
TTAAGTAGTCAGAAAGTTATCACCAAAGGATTTTCTTATGATTTTTTGCATGACTGGTTA
AAAACTGGGTTACTTACAAGCACAGGATCAAAATGGAAAACACGTAGAAGGCTACTAACT
CCAAGTTTTCATTTTTCTATACTCAATAACTTTATTAAAATATTCGAAGAGCAAGCATCC
ATTCTGGTGGACAAACTAGCTGTAGCTGCTGACAACAAGGAAGTTGTAGATGTGCAAGTA
CCTATTGGTTTGGCAACCTTGGATATAATCTGCGAAACTTCAATGGGTGTAAAAGTAAAT
GCACAAAGTCATCCAGATTCTGAGTATGTTAAAGCT
ATCACAGTTTTAAATGAAGAAATTCAAATGCGTCAAAAGTTTCCTTGGCTTTG
GTTTGATGCCATTTACAAACTGTTGCCTTGTGGGAAAAGGTTTTATAAGGCTTTAGATGT
TGCTCATAAGCTATCTTTTGATGTAATAAATGAACGCATGCAAATGAAAATTCAAGAATC
TTATTGTGAGACTGCGTCAGATGAAAAGAAATTTTTTTTAGATTTATTGTTAGATATATA
TCGCAAAGGTAAAATTGACACTGAAGGTATTCAAGAAGAAGTTGATACTTTTATGTTTGA
AGGTCATGATACAACTTCAGCTGCACTAGGTTGGACTCTTTGGTTGTTAGGAAAAAATCC
AGATGTTCAAAAAAAGCTGCACAAAGAAATTGATGAGATAGAGTTAAATGGAGGTTCACT
TTATGATAAAGTCAGACAGTCTAAATACCTTGAAATTATTCTTAAAGAATCATTACGAAT
GCATCCTCCTGTCCCTATGTATGGAAGAACAGTTGAGGAAGATATGACTATTGATGGTCA
GTTTGTTCCCAAAGGAGCACAAATAGTTCTTTTAGTTTTAATCTTGCACTCAAACCCTGA
TTATTGGGAAAACCCAAATGATTTTATACCTGAACGT
TTTGAAGCTGATAGTTATGAAAAGCGCAACCC
ATACAGTTATGTACCTTTTTCTGCTGGACCAAGGAATTGCATTGGCCAAAAATTTGCCAT
GATTGAAGAGA
AAATATTACTGTATAGCATAATGAAAAACTTCCATCTTAAGTCAATGCAGAATGAAAATG
AGGTTTTTGGTACTCTTGATATAATTCATAAGTCAATTAATGGAATTAATATAAAGTTCA
CAAGAAGATAA
>1096064105622
very similar to 1095958068757 varies at N-term 86%
346
MIYASYLVLVGLFVFFVSKILWKLWKSSYGLETIATPPNIPVFGTSLYLHSDARKFFFQL 525
526
SEFTKKYGTVFCIWLGPKPMIISSSVKFSEAVLSSQKVITKGFSYDFLHDWLKTGLLTST 705
706
GSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQVPIGLATLD 885
886
IICETSMGVKVNAQSHPDSEYVKAITVLNEEFVMRIKYPWXLWFDVIYKLLPCGKR
34
aa gap between these two seqs
>CV564924.1 EST 93% to 1095958068757
EKKFFLDLLWDIYRKGEIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQRKLHKEIDEIE
LNGGSLYDKVRQSKYLENILKESLRMHPPVPMYGRTVEEDMTIDDQFIPKGAQIILLVLM
LHSNPEYWENPNDFMPERFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEKILLYSIM
KNFHLKSMQDENEVFGTVDVIHKSINGINIMFTRR
GAAAAGAAATTTTTTTTAGATTTGTTATGGGATATATATCGAAAAGGTGAAATTGACACTGAAGGTATTCAAGAAGAAGTTGATACTTTTATGTTTGAAGGTCATGATACTACTTCAGCTGCACTAGGTTGGACTCTTTGGTTGTTAGGAAAAAATCCAGACGTTCAAAGGAAGTTGCACAAAGAAATTGATGAAATAGAGTTAAATGGAGGTTCACTTTATGATAAAGTTAGACAGTCTAAATACCTTGAAAATATTCTTAAAGAATCATTACGAATGCATCCTCCTGTCCCTATGTATGGAAGAACAGTTGAGGAAGATATGACTATTGATGATCAGTTTATTCCCAAAGGAGCACAAATTATTCTTTTAGTCCTAATGTTGCATTCGAACCCAGAATATTGGGAAAATCCAAATGATTTCATGCCTGAACGTTTTGAAGCTGATAGTTATGAAAAGCGCAACCCATACAGTTATGTACCTTTTTCTGCTGGACCAAGGAATTGCATTGGCCAAAAATTTGCCATGATTGAAGAGAAAATATTACTGTACAGCATAATGAAAAACTTCCATCTTAAGTCAATGCAGGATGAAAATGAAGTATTTGGGACTGTTGATGTAATCCATAAATCAATTAATGGAATTAATATAATGTTCACCAGAAGAAAAGGAAAAACTTATCTTGTTTAGTTTAGTTCATTATTTATCAGTAATTTGAAATAAT
>1096064105622 90% to
1095958068757 varies at N-term
1096071088011 joins CV564924.1 EST
MIYASYLVLVGLFVFFVSKILWKLWKSSYGLETIATPPNIPVFGTSLYLHSDARKFFFQL
SEFTKKYGTVFCIWLGPKPMIISSSVKFSEAVLSSQ
KVITKGFSYDFLHDWLKTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILV
DKLAVAADNKEVVDVQVPIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEFVMR
IKYPWLWFDVIYKLLPCGKRFYKALDVAHKLSFDVINERMQMKIRESYCETASDEKKFFL
DLLLDIYQKGEIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQRKLHKEIDEI
ELNGGSLYDKVRQSKYLENILKESLRMHPPVPMYGRTVEEDMTIDNQFIPKGAQIILLVL
MLHSNPEYWENPNDFMPDRFEADSYEKRNPYSYVPFSAGPRNCIGQKFAMIEEKILLYSIM
KNFHLKSMQDENEVFGTVDVIHKSINGINIMFTRR
>gnl|ti|655009968
1095963046224
42 0.010 46% to CYP20
35% to 27B1
419
DGGIHKFLVENHKRLGPMFSFYWGKELAVSLACPILFKEVATLFNRP 559
>gnl|ti|646849327
1095897329284 1097672251908 mate pair = 1097672200068
has exon 2
40% to 2X2
N-term
MLLQITCGFLFPPLIWIVWTYIKHLYDCLSYPQGPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIRYVVVNNLEGIKEVLIKKGSQFAGRPRLKFTI
(1)
ATGCTTCTTGAAATTACTTGTGGGGTTCTGTTCCCACC
GTTAATATGGATTGTCTGGACATATATTAAACATCTTTATGATTGTTTGAGTTATCCACA
AGGACCAATACCACTGCCATTTATAGGAAATGCTCATCTTTTAAGAAAAGGTGAACCTTA
CAAGGAATTAGTTAATCTTGGAAAGATATATGGTGATGTTTTTGGATTTAGTATTGGTTC
AATTAGATATGTAGTTGTAAACAATTTAGAAGGTATTAAGGAAGTTTTGATTAAAAAAGG
TTCACAGTTTGCTGGTCGTCCAAGGCTAAAGTTTACTATTAGT
Exon 2
1097331043073 1097206900216 1097672200068 mate pair = 1097672251908
This mate
pair has 2 aa diffs to 1095897329284
one nuc
diff same aa seq
1096124035195
1096041114543 1097329360644 1096625189581
1095958061778
1095898207031
(1)
ALSRGMNGLIMSDPSPHFRILRKLASSSLKIYAEGLDGMEKKAINEYSYLHKKLSTMNGKAVSLKRMI (1)
AGCTTTGAGTAGGGGTATGAATGGCCTTATTATGAGTGATCCT
TCACCACATTTTAGAATTTTACGAAAATTAGCATCATCTTCGTTAAAAATTTATGCTGAA
GGATTAGACGGGATGGAAAAAAAAGCTATAAATGAGTACAGTTATTTGCATAAAAAATTA
TCAACAATGAATGGAAAGGCTGTATCTTTAAAAAGAATGATAGGT
>1096124019772
related exon 2 5 aa diffs to 1097331043073
1096123858905
1096123680637
(1)
ALTRAMNGLIISDPSPHFKILRKLASSSLKLYAEGLDGMEKKAINEYSYLHKKLSTMNGKAVSLKRMI (1)
>1097265020030 new
N-term weak with frameshifts and a stop codon
no exact matches exist so
this may be poor quality sequence
TCGCTYPQKIWNVL
WTDIKHLSDSESYPQGPISLPI
XXXAHIERKGETYREIDRLR*IYGDDIGMCIGTLRYVDVNNLEGIRDVLIYTGTQFL
ACGTGTGGGTGTACGTACCCACAGAAAATATGGAATGTC
TGGACAGATATAAAACATCTCTCAGATAGTGAGAGTTATCCACAAGGACCAATATCACTGCCAATT
GCACATATAGAAAGAAAAGGTGAGACATACAGGGAGATAGATAGACTTAGATAG
ATATATGGTGATGATATAGGTATGTGTATCGGTACACTTAGATATGT
AGATGTAAACAATTTAGAAGGTATTAGGGACGTTTTGATTTACACAGGTACACAGTTTCT
CTGGT
>1096110062131 related exon 2 73% to 1095897329284
1097331675401 1097646001099 1096704247756
(1)
AWSRALNGLVACDPGPRFKVLRKLASSSLKIYAEGLDGMEKKAADEYSHLNKKLQTMNGKPVSLQNMI (1)
mate pair
of 1097646001099 = 1097664041480, continues on 1096703402618
1097329754969
1097664053056
possible
frameshift at NDRP_LHL
(1)
ELGTLNIICTILFNHRYEEDDKEFQDIIKYSNLTVKIFGGTSILSSIPWLRFLPSASSRSIYE
IVRIRDPLLKKKLQEHKSSFDENNLRDVTDVLIKVSLGSDIAKGSEEKITDENIEFLLND
FIIAGSETSSSTILWFIVYLLHWPEYQDKLYNEIIKVTSGKRYPCLNDRP
?
LHLTQATIHETLRLSSVGPLAIVHKAMENSSICGKPVPKGAFILTNLWSTH
HDESYWKNPMCFYPERWLEKSGEFNSKLGYAFLPFSGGPRSCLGEALARTELFVFFSRLV
TDYRFEKPNGEELPRLNGRFGLTCSPFDFKSVVVPRC*
AGAGTTAGGTACCCTCAACATCATTTGTACTATTTTGTTCAATCATCGATATGAAGAAGAT
GATAAAGAATTTCAGGATATCATCAAATACTCAAATCTGACTGTTAAAATTTTTGGTGGA
ACAAGCATTTTATCTTCTATTCCATGGCTGCGTTTTTTACCATCAGCTTCTTCAAGAAGC
ATATATGAGATAGTAAGAATACGTGATCCACTTTTGAAAAAAAAGCTACAAGAGCACAAG
AGCTCGTTTGATGAGAATAACTTACGTGATGTGACTGATGTATTAATTAAGGTTTCTTTG
GGTTCAGATATTGCAAAAGGTTCCGAAGAAAAAATTACTGACGAAAACATAGAGTTTCTT
TTAAACGATTTCATAATTGCCGGATCAGAAACTTCATCAAGTACAATTCTTTGGTTTATT
GTTTATCTTTTACATTGGCCAGAATACCAAGATAAACTTTATAACGAAATTATAAAAGTT
ACATCAGGTAAGCGTTACCCATGTTTAAACGATCGCCCc
CTTCATTTAACGCAAGCCACAATTCATGAAACACTTCGATTGTCATCAGTAGGTCCTC
TTGCTATAGTTCATAAAGCGATGGAAAACAGTTCCATATGTGGAAAACCAGTTCCCAAAG
GAGCTTTTATACTAACAAATTTATGGAGTACACATCATGATGAAAGTTATTGGAAAAATC
CAATGTGTTTTTATCCAGAACGTTGGTTAGAAAAATCTGGTGAGTTTAATTCTAAGTTAG
GGTATGCATTTTTGCCGTTTTCAGGCGGACCTCGTAGCTGTTTAGGAGAAGCACTTGCAA
GAACAGAGTTGTTTGTCTTTTTTTCACGATTAGTAACAGATTATCGGTTTGAAAAACCAA
ATGGTGAGGAGTTACCGCGTTTGAATGGTCGTTTTGGTCTCACTTGCTCTCCTTTTGACT
TTAAATCGGTGGTTGTTCCAAGATGTTAA
>1097206642797
related exon 2 61% to 1095897329284
1096761288099
1096082164704 1097567110690 1097672343044
(1)
DWSRTMNSLINNDLNATFKVLRKITSSSLKIYAEGLVGMEKRAIEEYTHLNKKLLSLKGQAVSIKNMI (1)
AGATTGGAGTAGAACAATGAACAGCCTCATCAATAACGACTTAAATGCAACCT
TTAAAGTTTTACGAAAAATAACATCCTCATCATTAAAGATTTATGCGGAAGGATTGGTGG
GAATGGAAAAAAGAGCTATTGAGGAATACACCCACTTAAATAAAAAGCTTTTATCATTGA
AAGGGCAAGCAGTATCTATTAAAAACATGATTGGT
>1097206059080
5 aa diffs to 1095898809307 might be the same gene
(1)
GPCKPSHIICTILFNHRYDENDQEFQDIIKYSNLSVRASSATSLISSIPWLRFFPSTASR
NIYEIIRLRDPILKRKLQEHRSSYDENNLRDVTDSLIKVSLDSALENNSHEKITDDNIEF
LLNDFIIAGSETSSNTVLWFIVYMLHWPEYQDKLYNEILKITSGNRYPCLSDRPMLHLMQ
AAIHETLRLSSVAPLGVGHKAMESSSICGKPVPKGAFILTNLWSIHHDETHWNNAMSFYP
ERWLEKSGEFNLKLGEAYLPFSSGPRSCLGETLAKIELFVFISRLVKDYRFEKPTEEDLP
NLKGESGITRTPSEFKVMAIPRN*
AGGGCCGTGCAAACCGTCTCACATAATTTGCACAATACTTTTTAATCATCGATATGATG
AAAATGATCAAGAATTTCAAGATATCATAAAATATTCAAATTTGTCTGTTAGAGCATCTA
GTGCAACCAGTCTTATATCTTCTATTCCATGGTTACGGTTTTTTCCTTCAACTGCTTCAA
GAAATATTTATGAAATAATAAGACTTCGTGATCCGATTTTGAAACGGAAACTTCAAGAAC
ACCGAAGTTCTTATGATGAAAATAATTTACGCGATGTGACTGATTCCTTAATAAAAGTCT
CTTTGGATTCAGCATTGGAAAACAATTCACATGAGAAAATCACAGATGATAACATTGAGT
TTCTTTTAAACGATTTTATAATTGCTGGATCAGAAACGTCGTCAAACACTGTTCTTTGGT
TTATTGTTTATATGTTGCATTGGCCAGAATATCAAGATAAACTTTATAATGAAATTTTAA
AGATAACATCCGGAAATCGTTATCCATGTTTAAGCGATCGCCCTATGCTTCATTTGATGC
AAGCTGCAATTCATGAAACACTTAGACTGTCGTCAGTAGCACCTTTGGGTGTAGGTCATA
AAGCAATGGAAAGCAGTAGCATCTGTGGTAAACCTGTTCCAAAGGGTGCTTTTATATTAA
CAAACTTGTGGAGCATACATCACGATGAGACTCATTGGAATAATGCCATGAGTTTTTATC
CAGAACGTTGGCTGGAAAAATCTGGTGAGTTTAATTTGAAACTTGGTGAAGCGTACTTAC
CATTTTCAAGTGGACCGCGTAGTTGTTTGGGAGAAACATTAGCTAAAATTGAATTGTTTG
TATTTATATCACGGTTAGTAAAAGATTATCGGTTTGAAAAACCAACTGAAGAAGACTTAC
CAAACTTAAAAGGTGAATCTGGCATAACTCGCACTCCTTCTGAATTTAAAGTTATGGCTA
TTCCAAGAAATTAA
>gnl|ti|649393684
1095898809307 45% to 17A1 C-term. No exact matches
(1)
VYLKLGEAYLPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEELPNLKGESGITRIPSEFKVMTIPRN*
AGTTTATTTGAAACTTGGTGAAGCGTACTTACCATTTTC
AAGTGGACCGCGTAGTTGTTTGGGAGAAGCATTAGCAAAAATAGAGTTGTTTATATTTAT
ATCACGGTTAGTAAAAGATTATCGGTTTGAAAAACCAACTGAAGAAGAGTTACCAAACTT
AAAAGGTGAATCTGGCATAACTCGCATTCCTTCTGAATTTAAAGTTATGACTATTCCAAGAAATTAA
>gnl|ti|646968536
1095898162561 83% to 1095897329284 37% to 2X2 N-term
1096041100060
1097672010393 1096602125478
MILKVIGSIFFPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFSIGSIRYVIVNNLEGIHEVLIKKGSQFSGRPRII
(1)
ATGATTCTTAAAGTCATTGGTAGCATTTTTTTC
CCGCCTCTTATTTGGTTTGTCTACAGTTACATCAAACATCTTATAGAATGTTTGTACTAT
CCGAAAGGACCAGTTCCTCTACCGTTCATAGGAAATACAAACTTATTAAGAAAAAAGGAA
ACTTGTAAAGAGTTTGTTAATCTTGGGAAGATATATGGTGATATTTTTGGATTCAGCATT
GGTTCTATTAGATATGTAATTGTTAACAACTTAGAAGGTATTCATGAAGTTTTAATTAAA
AAAGGCTCACAATTTTCTGGTCGACCAAGGATTATATGT
>1097509072583
new exon 3 boundary wrong
(0)
LWSYTCDKESGTNLTVLDDLSNLSFDIVGDVGFGYQFNTITSHSSNEFTSAVRNLTKMQI 694
NASVFSKVLITCFPFLVKFLLLFGKRRNLIQIVYKTLNK
(2)
AGCTTTGGTCATATACATGCGATAAAGA
AAGTGGGACAAACCTAACTGTTCTGGATGATTTGTCTAATCTGTCATTCGATATAGTTGG
TGATGTTGGTTTTGGGTACCAATTTAACACAATCACTTCTCATTCTAGTAATGAATTTAC
TTCAGCTGTTCGGAATTTGACTAAAATGCAAATCAATGCTAGTGTGTTCTCAAAAGTTTT
AATAACTTGTTTTCCATTTTTGGTCAAATTCTTGTTATTGTTTGGAAAGCGTAGAAATCT
TATACAGATTGTTTATAAAACTTTGAACAAGT
>gnl|ti|648014530
1095896049543 41% to CYP21
LKYLDCVVK
PANTILRTHVSSIHMNETIYPDPHSFKHERFMTG
AGTCCTGCAAATACAATCC
TACGAACTCATGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCAT
TTAAACATGAAAGGTTTATGACAGGT
>1096082202706
probably the same as 1095896049543
which has errors
1097664076692
1095994179331
(0)
NIEVQEKLREDIQKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPLLGRRTISATKF
GEYEVPANTILRTHVSSIHMNETIYPDPHSFKPERFMT
(1)
AGAACATAGAAGTTCAAGAGAAACTTAGAGAAGATATCCAGAA
AAACATATTGGATGTAAATAATATTTCTTTTGAGGAAGTTATGAGTTTAAAATATTTGGA
TTGTGTCGTTAAAGAAACCTTGCGCTTACATGGACCTGCACCACTTTTAGGCAGAAGGAC
CATTAGTGCAACAAAATTTGGTGAATATGAAGTTCCTGCAAATACAATCCTACGAACTCA
TGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCATTTAAACCTGA
AAGGTTTATGACAGGT
>1097309000937 1097206907008 1095901911044
MFLVCLALIVLFIGLFLLCYLLKRTFHPLRLLPSPKEQLITGHNRYFHGRDHTSTYLSFN
858
EKFKEEGLCTLDTLY
(1)
ATGTTTCTAGTATGTCTAGCACTCATAGTTTTATTTATTGGATTA
TTTTTACTGTGTTATTTATTAAAACGTACCTTTCACCCTCTTCGACTTTTACCATCACCA
AAAGAACAACTTATTACTGGTCATAATAGGTACTTTCACGGCCGCGACCATACTAGCACC
TATTTGAGTTTCAACGAAAAGTTTAAAGAAGAAGGTTTATGTACGCTAGATACATTATATGGT
>1096091465110
88% to 1097331817678
1096625274441
1096123742264 1097265046825 1095964362241
MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGF
701
NEKFKEEGLCTLDTLY
(1)
>1097331817678
1096526275245 1096124165677
1096110023112 1096761988512
1096701884902
walked
down from end of 1096526275245
walked
farther from end of 1097672563082 ran into a repeat region
MFVICLALITLFIGLFFLRCLLKRIFHPLRLLPSPKEHLITGHISHFQGRDHSNTFLSFNEKFKEE
GLCTLDTLY
(1)
ATGTTTGTGATATGTCTAGCACTCATAACTTTGTTTATTGGATTATTTTTCCTGC
GTTGTTTATTAAAACGTATCTTTCACCCTCTTCGATTATTACCATCACCAAAAGAACATC
TCATTACTGGTCATATTAGTCACTTTCAAGGCCGTGACCATTCTAACACCTTTTTGAGCT
TCAACGAAAAATTTAAAGAAGAAGGTTTATGCACGCTAGATACATTATATGGT
>1095899139433
1096703930092 1097509100606 1097675339850
new exon 2
VPRYVYLIAPEFIKKIFADGKLFQRPTTLKILAPLIGNSMLGSNYEDHHWQRKLFNGAFT
549
SQQLKNYFPAFLKHTNLLMK
AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATTCATTAAAAAGATATT
TGCAGATGGGAAACTTTTTCAAAGGCCTACTACATTAAAAATCTTGGCACCATTAATTGG
AAACAGCATGCTTGGTTCAAATTACGAAGACCATCATTGGCAAAGAAAGTTATTCAATGG
AGCATTTACTTCACAACAGCTGAAAAATTATTTTCCTGCATTTTTAAAGCATACTAATTT
GCTAATGAAAGT
>new
exon 2 1095899339221 1097206043402 1097672369437 possible frameshift/insertion
(1)
GFKFIYLLMPEYIKTMVSNGKVFQKSTAMKVIFPLVGNGMLVSNYEHHHWQRKLFNEAFS
AQQLKKYFPAFKEHT
DLLIK (0)
AGGGTTCAAATTTATTTACCTTTTAATGCCAGAATATATTAAAACAA
TGGTTTCTAATGGCAAGGTTTTTCAAAAATCGACTGCAATGAAAGTTATATTTCCTCTAG
TTGGCAACGGTATGCTTGTGTCAAATTATGAACATCACCATTGGCAAAGAAAATTATTTA
ATGAAGCATTTTCTGCACAACAGTTAAAAAAATATTTTCCTGCATTTAAAGAGCATACTA
ATAAAAGATTTACTAATAAAAGT
>1095964240637
1097516021618 1096705343938 1095900018167
1096607016658 new exon 2
(1)
GFRFVDLLLPEFIKTIFSDGKVFHRSNVLKVLFPLVGNGMIVSNYEDHHWQRKVLNEAFT 854
SQQLKNYFPAFTLHTDLLMK (0)
AGGTTTCAGATTTGTTGATCTATTATTGCCAGAATTTATTAAAACAA
TATTTTCTGATGGTAAAGTTTTTCACAGATCGAATGTTTTGAAAGTTTTGTTTCCTCTAG
TTGGAAATGGTATGATTGTATCAAATTATGAAGATCATCATTGGCAAAGAAAAGTTTTAA
ATGAAGCTTTTACCTCCCAACAGCTAAAGAATTATTTTCCTGCTTTTACATTGCATACTG
ATTTGCTAATGAAAGT
>1097675832709 new exon
1 with one possible frameshift or there is another exon
MCMVYIAVLILLCLIVFF
ANVLKRFYHPLRNFPSPQENLITGHYSYFYRYDHVKTLLNFGKQFEKNGLYTLDTLN
(1)
ATGTGTATGGTTTATATAGCAGTATTGATTTTAT
TATGTTTAATAGTATTCTTTGCTAATGTTTTAAAGCGTTTTTATCATCCGCTTCGTAAT
TTTCCCTCACCTCAAGAAAATTTAATTACAGGCCATTATAGCTATTTTTATCGTTATGAT
CATGTCAAGACTTTGTTAAATTTTGGAAAGCAGTTTGAAAAGAATGGCTTATATACATTA
GATACATTAAATGGT
N-terminal
EST sequences for hydra P450s
>DN812964.1
ACAC-aac48b12.g1 Hydra EST UCI 7..same as DN812371.1
MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFG
280
KQFKERGLYTLDTLN
>DN810769.1
ACAC-aac19b13.g1 Hydra EST UCI 7.. same as DN812371.1
MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFG
256
KQFKERGLYTLDTLN
>DN816152.1
ACAC-aac24b14.g1 Hydra EST UCI 7.. same as DN812371.1
IAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKE
199
RGLYTLDTLN
>CN775805.1
tae77f11.x1 Hydra EST Darmstadt .. same as DN812371.1
IAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKE
185
RGLYTLDTLN
>BP514308.1
BP514308 Hydra magnipapillata c...have this one
MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG
208
KEFKDYGLYTINTL
>BP514307.1
BP514307 Hydra magnipapillata c...same as BP514308.1
MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG
208
KEFKDYGLYTINTL
>BP505238.1
BP505238 Hydra magnipapillata c... same as BP514308.1
MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFG
209
KEFKDYGLYTINTL
>CO509836.1
tai58f02.y1 Hydra EST UCI 5 ALP .. same as BP514308.1
IYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEF
181
KDYGLYTINTL
>DN813094.1
ACAC-aab89g09.g1 Hydra EST UCI 7..= 1097675463974
MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN
303
EKFKEEGLCTLDTL
>DN603400.1
ACAC-aac10m18.g1 Hydra EST UCI 7..= 1097675463974
same as
1096091465110 DN813094.1 DN137655.1
MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN
283
EKFKEEGLCTLDTL
>DN137655.1
ACAE-aaa07c04.g1 Hydra EST UCI 5.. ..= 1097675463974
same as
1096091465110 DN813094.1 DN137655.1
LICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFNEK
192
FKEEGLCTLDTL
>CN567598.1
tag12b09.x1 Hydra EST -Kiel 1 Hy..we have this one
LLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH
>CX833403.1
ACAC-aaa40d06.g1 Hydra EST UCI 7..
91% to
1095899139433 exon 2
IIFTVLFW*RTFHPLQLLPSPKEQLITGHNMYFHGRDHTSTYLSFNKQFKK*GLCTQHTLX
VPRYVYLIAPQFITKIFAYGKLFQRPTTLKILAPLIGNSMLGSNYKDHHWQKKLFNGAFT
431
SQQLKNYFPAFLKHTT*LMKHWSYTCDKESGTNLTVLDDLSNLSFNIVGDVGFLGFGYQFTQ
ITSHASNEYTS
>1097675877620
new exon 1 with one possible frameshift or there is another exon
MYMICIAAIVILCFL
VLAVMLKRFYYPLCMLPSPKENLFTAHYRYFYGHDHINAFLNFQNQFKDYGLYTLDLLLG
(1)
ATGTATATGATTTGCATAGCAGCCATTGTTATATTATGTTTTCTC
GTCCTTGCTGTTATGTTAAAACGTTTTTATTATCCGCTTTGTATGCTTCCATCACCCAAA
GAAAATTTATTTACAGCTCATTATAGATATTTTTATGGTCATGATCATATCAACGCTTTT
TTAAATTTTCAAAACCAGTTTAAAGACTATGGCTTGTATACATTAGATTTATTACTTGGT
>1095901177607 new exon
1 only 5 aa diffs from 1097675877620
all three of these
sequences seem to have a frameshift. So this is probably
evidence for another
upstream exon. They almost
certainly don’t have a frameshift
MYMICIAAIVILCFL
VLAVMLKRFYYPLCMLPSPKENLFTAHYRYIYGHDHINAFLYFQNQFKEYGLYTLDIFL
ATGTATATGATTTGCATAGCAGCCATTGTTATATTATGTTTTCTC
GTCCTTGCTGTTATGTTAAAACGTTTTTATTATCCGCT
TTGTATGCTTCCATCACCCAAAGAAAATTTATTTACAGCTCATTATAGATATATTTATGG
TCATGATCATATCAACGCTTTTTTATATTTTCAAAACCAGTTTAAAGAATATGGCTTGTA
TACATTAGATATATTTCTTGGT
>DN813094.1
ACAC-aab89g09.g1 Hydra EST UCI 7..= 1097675463974
MFLICLALLILSIGLFFLRYLLKRIFHPLQLLPSPKEQLITGHISHFQGRDHSNTFLGFN
303
EKFKEEGLCTLDTLY
(1)
ATGTTTCTGATTTGTCTAGCACTTTTAATTTTATCTATTGGATTATTTTTTTTGCGT
TATTTATTAAAACGTATCTTTCACCCTCTTCAACTTTTACCATCACCAAAAGAACAACTC
ATTACTGGTCATATTAGTCACTTTCAAGGCCGCGACCATTCCAACACCTTTTTGGGTTTC
AACGAAAAGTTTAAAGAAGAAGGTTTATGTACGCTAGATACATTATATGTGCCCAGGTAT
>1097675463974
mate pair to exon 2 1097675525814
also
1096123961892 1097329363407 1097509311387
1096064006710
1097509202292 1096703858851 1097675514139 1096607047135
walked
up to 1096526789565 continued on 1097516017620
(1)
VPRYVYLIAPEFIKKIFADGKLFQRATSLKVLAPIIGNSMLTSNYEDHHWQRKLFNGAFT 565
SQQLKNYFPAFLTHTDFLMK (0)
(1)
AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATT
TATAAAAAAGATATTTGCAGATGGAAAATTTTTTCAAAGAGCTACTTCATTAAAGGTTTT
GGCACCTATAATTGGAAATAGCATGCTTACTTCAAATTACGAAGACCATCATTGGCAAAG
AAAGTTATTCAATGGAGCATTCACTTCACAACAGCTAAAAAACTACTTTCCTGCATTTTT
AACGCATACTGATTTTCTAATGAAAGTAAGTTTTGATAATATTTAGTTATAGTTTTGTTG
TTTTTATTATAATAATGCAAAACAAATTCTTTTAGCTTGTAAGTACATGGT
>gnl|ti|647058148
1095898198167 I-helix mate pair = 1095898261914
1095898261914 1097206705284 1096761285028
1096761249195 1097206596155
1097675525814 1097675035030 1097329367631
1097206911388 1097206828844
1097460197276
LWSYTSDKESGTNLTVLDDLSNLSFDIIGDVGFGYQFNTINSHSGNEFT
SAFRYLTELQHNASVFSKVLISCFPFLAQFLLLFGKRRKLIQVVHKTLNK
(0)
1097664219266
1097664219266 1096123852196 1097206273255
these
have 2 aa diffs 1097331817534 1097206329872 1096526011941
(0)
LIEKRKKEIDDGISTEEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLIAGHETTSTAMTWCLYMLGT (0)
AGCTTTGGTCTTATACAAGTGATAAAGAAAGTGGGACAA
ACTTAACTGTTTTGGATGATTTGTCTAATCTGTCCTTTGATATAATTGGTGATGTTGGTT
TTGGCTACCAATTTAACACAATCAACTCTCATTCTGGTAATGAATTTACATCAGCTTTTA
GATATTTGACTGAACTGCAACATAATGCTAGTGTGTTCTCAAAAGTTTTGATAAGTTGTT
TTCCGTTTTTGGCGCAATTTTTGTTATTGTTTGGAAAACGTAGAAAACTTATACAAGTTG
TCCATAAAACTTTGAATAAGGT
1096123183594
1097335028467 mate pair
1097335034435 = 100% match to
1095898198167
1097325031454 1097509311387
(0)
NLEVQEKLREEIQKNILDKKNITFEEILSLKYLDCVVKETLRLHGPAPILGRRNINATKF 957
GEYEVPANTVLRTH
VSSLHMNETIYPDPHSFKPERFMT (1)
AGAACTTAGAAGTTCAAGAAAAACTTAGAGAAGAGATCCAGAAAAATATATTGGATAAAAAAAAT
ATTACTTTTGAAGAAATCTTGAGTTTGAAATACTTAGATTGTGTCGTTAAAGAAACCTTG
CGCTTGCATGGACCAGCACCAATTTTAGGCAGAAGAAACATTAATGCAACAAAATTTGGC
GAATATGAAGTTCCTGCCAACACAGTACTACGAACTCAT
EST = DN137322.1 extends to end of gene = 1096071008743
1097672372091
1097509003243
1097329517617
LLFLIAGHETTSTAMTWCLYMLGT
NLEVQEKLREEIQKNILDKKNITFEEILSLKYLDCV 579
VKETLRLHGPAPILGRRNINATKFGEYEVPANTVLRTH
VSSLHMNETIYPDPHSFKPERFMT
1096625230620
1096705372268 1097675934526 1097622179423 1097265071715
1097675205779
GEIPATFYLTFGHGIYNCIGKNFALLEIKTFLVKALLQFEFSVDPEHISYKKFIWLTTITAEPLSIRVKPIAD*
GTTAGCAGTCTACACATGAATGAGACTATTTATCCAGATCCTCATTCGTT
TAAACCTGAAAGGTTTATGACAGGCGAAATACCAGCAACATTCTATCTTACTTTTGGGCA
CGGTATATATAACTGTATTGGAAAGAATTTTGCTTTGCTTGAAATCAAAACATTCTTGGT
CAAAGCGTTGTTGCAATTCGAATTTTCTGTTGACCCTGAACATATCAGTTACAAAAAGTT
TATTTGGTTAACTACGATAACAGCAGAACCATTGTCAATTAGAGTAAAACCTATTGCAGATTGA
>1096703991752
1096123270489 1096526190394 probably same as 1096625230620
GEIPATFYLTFGHGIYNCIGKNFALLEIKTFLVKALLQFEFSVDPEHISYTKFVWLTTXXXXXXIRVNLIAD*
AGGCGAAATACCAGCAACATTCTAT
CTTACTTTTGGGCACGGTATATATAACTGCATTGGAAAGAATTTTGCTTTGCTTGAAATC
AAAACATTCTTGGTCAAAGCGTTGTTGCAATTCGAATTtTCTGTTGACCCTGAACATATCA
GTTACACGAAGTTTGTTTGGTTAACTACGXXXXXXXXXXXXXXXXXXGTCATTAGAGTAAACC
TAaTTGCAGATTaa
>these
have 2 aa diffs from 1097675463974,
1097331817534 1097206329872 1096526011941
(0)
LIEKRKKEIDDGISTEEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLVAGHETTSNAMTWCLYMLGT (0)
>1096071008743
84% to CN567799 1096602116307 1096123983311 1096124057195
886
(0) NLEVQDKLREEILKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPILRRRTMNAIKF 707
706
GEYEVPANTVLQTHISSLHMNETIYADPHLFKPERFMT (1) 593
>gnl|ti|648470985
1095898761545 N-term mate pair = C-term 1095899295538
1097264057439
extends N-term down 1097325864056 joins N and C-terms
681
MFLVYSLLVVIFSYFLIKISWKLWIYSYGLSTVPTPPTIPFFGNCLQLESDSVKFNKQI 854
855 REWSKIYGNVFCVWIGLTPMIYSSSVNFSEAILSSQKVLKKASVYEFLYEWLQTGLLTSTGNK
WKLRRRLLTPSFHFSILNNFLKIFEEQGACLVDKLRIYAKSGGNFDIQVPIGLATLDIICETSM
GVKVNAQSHPDSEYAKAIGILSEEIPKRIKYPWLWPDIIYKHLACGKRYYKALDVAHKLSLDVI
KERVKTLIQNKSEVTSNKNKK
ATGTTTTTGGTGTACAGTCTATTGGTTGTTATTTTTTCATACT
TTTTAATTAAAATATCTTGGAAACTTTGGATTTATTCTTATGGTCTCTCGACTGTTCCAA
CACCTCCAACCATACCATTTTTTGGCAATTGTCTTCAGCTTGAAAGTGATTCTGTAAAGT
TTAACAAACAAATACGCGAGTGGAGCAAAATATACGGAAATGTTTTCTGCGTTTGGATAG
GCCTTACGCCAATGATATACTCATCTTCTGTAAATTTCTCGGAAGCAATCTTAAGCAGTC
AAAAAGTCCTCAAAAAAGCATCTGTTTATGAATTTTTGTATGAATGGCTTCAAACCGGG
TTACTGACAAGCACAGGAAATAAG
TGGAAACTGCGTCGTCGACTTCTTACACCAAGCTTTCATTTTTCTATACTCAATAATT
TTTTAAAAATTTTTGAAGAGCAAGGAGCTTGTTTAGTTGATAAATTACGTATTTATGCCA
AAAGTGGTGGAAATTTCGATATCCAGGTACCTATTGGATTAGCAACTTTAGATATAATAT
GCGAGACATCAATG
GGAGTAAAAGTAAATGCACAGAGTCACCCAGACTCAGA
GTATGCTAAAGCCATCGGTATATTAAGTGAAGAAATACCAAAAAGAATTAAGTACCCATG
GTTATGGCCAGATATTATTTATAAACATCTTGCTTGTGGAAAAAGATATTATAAAGCTCT
AGATGTTGCTCATAAATTATCTCTTGATGTAATAAAAGAAAGAGTTAAAACACTTATTCA
AAATAAAAGCGAGGTTACATCAAATAAAAACAAAAAA
GAATCAGGCTCTGAAAAAAAAAA
ATTTTTTTTAGACTTATTGTTAGATATGCATAAAAAAGGTGAAATTGATACTGAAGGGAT
TCAAGAAGAGGTTGATACTTTTATGTTTGAAGGTCATGATAGCACTTCATCAGCATTAAG
CTGGATGCTGTGGTTGCTAGGAAGATATCCACAAGTTCAACAGAAACTGCATTCAGAAAT
TGATGAAGTGgAA
1095899295538
1096703530556 1096705948493 1096526527227 1096625218937
1097191001062
ESGSEKKKFFLDLLLDMHKKGEIDTEGIQEEVDTFMFEGHDSTSSALSWMLWLLGRYPQVQQKLHSEIDEVE
LTGGSLYEKVRNFKYLENVVKESMRIHPPVPLIGRHIEEDMVIDGQFVPKSSEIVLLVMM
MQSSPEYWKDPYDFIPERFEQEDFVKRNPYIYIPFSAGPRNCIGQKFAMIEEKMLLYIIM
KNFYVQSIQNENEILLALNIIHKSSNGIIMKFTER*
TTAACTGGAGGTTCACTTTATGAAAAAGTAAGAAACT
TTAAATATCTTGAAAACGTTGTAAAAGAAAGTATGCGAATTCACCCACCTGTTCCTTTAA
TTGGCAGGCATATTGAAGAAGACATGGTAATTGATGGTCAGTTTGTTCCTAAAAGTTCTG
AAATTGTTTTACTTGTAATGATGATGCAATCAAGTCCTGAATACTGGAAAGATCCATATG
ATTTCATACCTGAAAGGTTTGAACAAGAAGATTTTGTTAAGCGCAATCCATATATCTATA
TTCCATTTTCAGCAGGTCCAAGAAACTGTATTGGTCAGAAGTTCGCAATGATTGAAGAGA
AAATGCTGTTATATATCATAATGAAAAACTTTTACGTCCAATCCATCCAGAATGAAAATG
AAATACTTCTTGCTCTAAATATTATACATAAATCGAGTAATGGTATCATAATGAAATTCA
CTGAAAGATGA
>1096083942127
1097329109827 clearly best match to 4V sequences
MAFILLIFFLLLITLFLIWIYWVRSYNLNFVPSPLRFPLFGCALFLKSESH
ELFKQVRWFFSEFGSAFCLWIGPKPVLMTGNIDHIQTVLKSQKIITKSSSYTFLNE
WLGTGLLTSTGAKWKSRRKVLTKAFHFSIINSYVDSFYQNSVSLSNHLENHSGVPINIQA
LMSLFTLDIICETAMGFKLNSMKNLNCDYVNAVEEVKILLIERQKSPWLWNKFVYKLFSS
GKKFYTQLQVLKSFTKKIVNKRIKNYSLSSNGCKSFLDLLIDAYNQGKIDLEGIYEEVDT
FMFAGHDTTAAALSYIFLMLGTHPKVQKKLHEEIDTNVNINSYENLSEKIRKMEYLDCVI
KESLRLHPPVSVFGRILEDDTIFSNHLVGKGADIVLCPETLHTDPLYWENHRSFIPERFS
NVEFAFCQPYLYIPFSAGPRNCIGQKFALMEIKIAIFVVMSKFIVTAVEQCLSPM
ATFIQRYENGVLMLFEDEKRFLYML*
>1097329374310
no introns very similar to 1095899295538 seq
1096608398403
1096761840588 1097460256370
67% to
1095958068757 88% to 1095898761545
MFIAYSLLVVVSLYFVIKLFWK
FWIYSYGLSTVPTPPTIPFFGNSLQLESDSVKFNKQLCEWSKIYGNVFCVWVGLR
PTIFSSSVNFSEAILSSQEVLKKASIYEFLHDWLKTGLLTSTGNK
WKLRRRLLTPSFHFSILNNFLKIFEEQGACLVDKLRTYAKSGENFDIQVPIGLATLDIICETSMGVKVNAQSHP
DSAYVKAINILSEEIPRRFKYPWLWPDIIYKHLACGKRYYKALDVAHKLSLDVINERIETLFQNE
NNVTTNKNKEVSSEKKKFFLDLLLDIHKKGEIDTEGIQEEVDTFMFEGHDTTSSALSWIL
WLLGRYPQVQQKLHSEIDEVELTGGSLYEKVRNFKYLENIIKESLRIHPPVPLIGRHIEK
DMVIDGQFIPKKSEIGVLVMMMHSSPEYWKDPYDFIPERFEQEDFVKRNPYIYIPFSAGPRNCIG
QKFAMIEEKMLLYSIMKNFYVQSMQNENEILPSLDLIRKSVNGIILKLTER*
ATGTTTATTGCGTACAGTTTGTTGGTTGTAGTTTCTTTATACTTTGTAATTAAATTATTTTGGAAG
TTTTGGATTTATTCTTATGGTCTCTCGACTGTTCCAACACCTC
CAACCATACCATTTTTTGGGAATTCTCTTCAACTTGAAAGTGATTCTGTTAAGTTTAATA
AACAACTATGCGAGTGGAGCAAAATATACGGAAATGTGTTCTGTGTTTGGGTAGGCCTTA
GGCCAACTATTTTCTCATCTTCTGTAAATTTCTCGGAAGCAATTTTAAGCAGTCAAGAAG
TCCTTAAAAAAGCATCAATTTATGAATTTTTGCATGACTGGCTTAAAACTGGATTACTAA
CAAGCACAGGAAATAAGTGGAAACTGCGTCGTCGACTC
CTTACACCAAGCTTTCATTTTTCTATACTCAATAATTTTTTAAAAATT
TTTGAAGAGCAAGGAGCTTGTTTAGTTGATAAACTACGTACTTATGCCAAAA
GTGGTGAAAATTTTGATATCCAGGTACCTATTGGATTAGCAACTTTAGATATAATATGTG
AGACATCAATGGGAGTAAAAGTAAATGCACAGAGTCACCCAGATTCAGCGTATGTTAAAG
CCATTAATATTTTAAGTGAGGAAATACCAAGGAGATTTAAATACCCATGGTTGTGGCCAG
ATATTATTTATAAACATCTTGCTTGTGGAAAGAGATATTATAAAGCACTAGATGTTGCTC
ACAAATTGTCTCTAGATGTAATAAATGAAAGAATTGAAACACTTTTTCAAAATGAAAACA
ATGTTACCACAAATAAGAACAAAGAAGTTAGCTCAGAAAAAAAAAAGTTTTTTTTAGACC
TACTGTTAGATATACATAAAAAAGGTGAAATTGATACTGAAGGGATTCAAGAAGAGGTTG
ATACTTTTATGTTTGAAGGTCATGATACCACCTCATCAGCATTAAGCTGGATACTTTGGT
TGCTAGGAAGATATCCACAAGTTCAACAGAAACTGCATTCAGAAATTGATGAAGTTGAAT
TAACCGGAGGTTCACTTTATGAAAAAGTAAGAAACTTTAAATATCTAGAAAACATCATAA
AAGAAAGCCTGCGAATTCATCCGCCTGTTCCTTTAATTGGCAGACATATTGAAAAAGATA
TGGTAATTGATGGTCAGTTTATTCCTAAAAAATCTGAAATTGGTGTTCTTGTCATGATGA
TGCATTCAAGTCCTGAATATTGGAAAGATCCATATGATTTCATTCCTGAAAGGTTTGAAC
AAGAAGATTTTGTTAAGCGCAATCCCTATATCTATATTCC
ATTTTCTGCAGGTCCGAGAAATTGTATTGGTCAGAAGTTCGCAATGATTGAAGAGAAAAT
GTTGTTATATAGCATAATGAAAAACTTTTACGTCCAATCCATGCAGAATGAAAATGAAAT
ACTTCCTTCTCTAGATCTTATACGTAAGTCGGTTAATGGTATCATATTAAAACTTACTGA
ACGATAA
>1095964281471
1097672357643 1097675038710 1096526281478 1097675573844
MFSNIKMIYTLCIIICGFYFLIKILWMCWKYSYGLTSIATPPNTPFLGTSFYFLSDS
RKSYFQLCNYTKQFGNVFCIWLGPKPMIVSSSVKFLKAVLSSEKITTKGFSYDWIHDWLK
TGLLTSSGPKWKARRKLL
TSSFHFSVFNRLKIIIEEQACILVDKISFAADNKKVVDVQTLIGLATLDVICETIMGVKINAQ
780
SYPDSEYVKAISVLHKEIVNRMKFPWLWFDVIYKLLPCGKRFYKALDVAHKFTFDIINKR
600
MEISVNESYIDTPLEEKSYFLDLLLNIHKKKEIDMEGIQEEVDTF
IFAGHDTISVALSWTLWLLGKYSEIQRKLHKSIDEIELNGGSLFEKVRNFKYLENII
KESMRIHPPVPMYGRTVEENMTIDGQFVPKGAQIILLVLMLHSDPNIWENPKEFIPERFE
TDDWKIKNSYSYLPFSAGSRNCLGQKFAMIEAKMLLYSIM
KKFSLKSMQDENEVYGTVDILHKSINGINILFTRR*
>gnl|ti|648478468
1095898788708 N-term EST = CV564880.1
1097672125473 1097509103730 1096123847153
1097664004740 1097329293298
1096092407854 1097325278081 1097675392269 1097672158546 1097206129107
1095899351259
76% to
1097329374310 similar to 4V5
MFLTFMFLFLIYFLIKVFWKLWIYSYGLSTVSTPPTLPLFGNCLQIKSDPVKASKQL
FEWSRVYGKVFCVWVGIRPTIFSSSVNFSEAILSSQKIIQKGFVYNFLHEWLKTGLLTST
GNKWKLRLRLLTPSFHFSILNNFLKIFEEQGNCLIDKFRVLAQNGKYFDIQVPIGLATLD
IICETSMGVKINAQYQPDSEYVTAINILSEEIVRRFKYPWLWPNIFYKHFSCGKRYFKAL
DIAHKLSLNVIHERIQTSLQNESENVLINKLDNKSVLNNEEELGVRKKRFFLDLLLDMHK
KGEIDVDGIQEEVDTFMFEGHDTTSS
AMCWTLWLLGRYPQIQQKLHAEVDEVELTSGSLYEKVRNFKYLE
NVLKESLRLHPPVPLISRYIEEDMMIDGQFIPKKSEIAILVMMIHLNPEYWKDPHSFIPE
RFDQDDFVKRNPYTYIPFSAGPRNCIGQKFAMIEEKMLLYNIMKHFYVESMQNENEILRT
QDLISKSANGIMMKFYER*
ATGTTTTTAACTTTTATGTTTTTGTTTCTTATTTATTTTCTAATTA
AAGTATTTTGGAAGCTTTGGATTTATTCTTATGGCCTGTCAACTGTTTCTACACCTCCCA
CATTACCATTATTTGGCAATTGTCTTCAAATCAAAAGTGATCCTGTAAAAGCCAGCAAAC
AACTATTCGAGTGGAGCAGAGTATACGGAAAAGTGTTTTGTGTTTGGGTTGGCATTCGGC
CAACTATATTCTCATCTTCTGTTAATTTTTCCGAAGCAATTTTAAGCAGTCAAAAAATAA
TTCAAAAAGGATTTGTGTACAATTTTTTGCATGAATGGCTTAAAACTGGTCTACTAACAA
GTACGGGAAATAAGTGGAAATTGCGTCTTCGACTTCTAACGCCAAGCTTTCATTTTTCTA
TACTCAATAACTTTTTAAAAATTTTTGAAGAGCAAGGAAATTGTTTAATTGATAAATTTC
GCGTTCTTGCCCAAAATGGAAAATATTTTGATATTCAGGTGCCTATTGGGTTAGCTACAT
TAGATATAATATGCGAGACGTCAATGGGAGTGAAAATAAACGCGCAGTATCAGCCAGATT
CCGAATATGTTACTGCCATTAACATCTTAAGTGAGGAAATAGTTAGACGGTTTAAGTACC
CGTGGTTGTGGCCAAATATTTTTTATAAGCATTTTTCTTGTGGAAAACGGTACTTTAAAG
CATTAGACATTGCTCATAAACTGTCTCTTAATGTAATTCATGAAAGAATTCAAACTAGTT
TACAAAACGAAAGTGAGAATGTGTTAATCAATAAACTTGACAATAAGAGCGTGTTGAACA
ATGAAGAGGAACTCGGTGTACGTAAAAAGAGGTTTTTCTTAGATTTATTGTTAGACATGC
ATAAAaAAGGTGAAATT
GATGTTGATGGGATTCAAGAGGAGGTGGATACATTTATGTTTGAAGGTCACGACACCACC
TCATCAGCAATGTGTTGGACATTATGGTTGCTGGGAAGATATCCACAAATTCAACAGAAA
CTGCATGCTGAAGTTGATGAAGTTGAACTAACTTCGGGTTCACTATATGAAAAAGTACGA
AACTTTAAATATCTTGAAAATGTTTTAAAAGAAAGCCTGAGACTTCATCCACCAGTTCCC
TTAATCAGTAGGTATATTGAAGAAGATATGATGATTGATGGTCAGTTTATTCCTAAAAAA
TCTGAAATCGCTATTCTTGTGATGATGATACACTTAAATCCTGAGTATTGGAAAGATCCT
CACAGCTTTATACCTGAAAGATTTGATCAAGATGATTTTGTAAAGCGTAATCCATACACT
TACATTCCATTCTCCGCTGGCCCTAGAAATTGCATTGGTCAAAAGTTTGCAATGATAGAA
GAAAAAATGCTGTTATATAACATAATGAAACATTTTTATGTAGAATCCATGCAGAATGAA
AATGAAATTTTAAGAACTCAAGATCTTATAAGTAAATCAGCTAATGGTATCATGATGAAGTTCTATGAAAGATGA
>Combined
CN627429 CN775805 27% to 4T5 [gene 6]
GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK
ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH
WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK
DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKALLA
FFPFLMHLSFMYGKRKRAEQVICNTLNM
LINKRKKEIDHRIAADQKDFLTVVLK
DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK
>EST DN812371.1 joins with CN627429
CN775805 and 1095901729505 1097325001902
CN776982
and CN770283 1097206896815 1096110026952
1096110107596
MYTIGIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFKERGLYTLDTLN
GFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH
WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMK
LWSYSCDKDNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL
RFQLNAVHKALLAFFPFLMHLSFMYGKRKRAEQVICNTLNM
LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT
(0)
NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDV
PAGSFLRIPIDSAHMNESVYHDPYSFRPERFLTGEIPPLSFLTFGQGIYNCIGKNFALLE
IKTFLVKALLQFEFSVDLKHLNYKKLISITNKTVEPLWIRVKPI*
EST
matches 1097325001902 1097265052814 for exon 1
1 tcggcatagc agtattaatt tttttgtgtt
tttcactgtt ttttgctaat attttaaaac
61 gtttttatca tccgcttcgt aagttgccat
cacctaaaga aaatttcttt actgctcatt
121
atggctactt taatggctat gatcaaataa atgctgtaat aaattttgga aaacagttta
181
aagagcgtgg cttgtataca ttagatacat taaat ggatt tagatttgtt aatcttttaa
241
tgccagaatt tattaaaaca gtgttttctg atggaaactc attccaaaga tcgaccgcta
301
caaaagttat atttcctcta gttggaaatg gtatttttgt gtcaaattat gaagatcatc
361
attggcaaag aaaagtgtta aatgaagctt ttactttaca acagctaaaa aattattttc
421
cagcttttac agtgcacatt gatttgctaa tgaaactttg gtcatattca tgtgacaagg
481
ataatggtac taacataatt gttttggatg acttatctaa tttatcattt gatataattg
541
gggatgttgg ttttggctat ca
>1096526100337
74% to CN776982
1096526100337 1097206278072 1096123494736
(0)
NLDVQNKLREEIKKNVFDIKSILREEVLSIKYLDCVVKETLRMHPPASFISRKNKTETKL 308
GDYDIPAGTFLRISINNVHMNESVYPDPYLFKPERFMT
(1)
1095898850029
same as 1096520314506 = mate pair match to 1096526207508
1097206730806
DEIPPSSFLSFGQGIYNCIGKNFALLEIKTFLVKALLHFEVSVDPSHVNYTKQILLTLNTVEPIWIRVKSIEE*
AGAATTTAGATGTTCAAAACAAACTAAGAGAAGAAATAAAGAAAAA
TGTCTTTGATATAAAAAGTATTTTACGGGAAGAAGTTTTAAGCATCAAGTACTTGGATTG
TGTAGTTAAAGAGACATTACGCATGCATCCACCTGCGTCATTTATATCTCGAAAAAACAA
AACTGAAACAAAGTTGGGTGATTATGATATACCTGCTGGCACGTTTTTAAGAATTTCAAT
TAACAACGTACATATGAATGAGTCTGTTTATCCTGATCCTTATTTATTTAAGCCGGAACG
ATTTATGACAGGT
AGATGAAATACCACCATCGTCTTTTCTCTCATTTGGGCAAGGTATTTATAATTGTATTGGAAAGAAT
TTTGCTTTGCTTGAAATTAAAACGTTTTTGGTTAAAGCATTATTACATTTTGAAGTTTCT
GTCGACCCAAGTCATGTGAATTATACAAAACAGATTTTGTTAACTTTAAATACCGTTGAA
CCCATTTGGATAAGAGTGAAATCTATTGAAGAATAA
>1097696222067
new exon 6 1097375001145 1097672638278
1096041191032
1097678083218
GEIQPYSYLTFGQGIFNCIGKNFALLEIKTFLVKALLQFEFSVDLEHMNYIKKIFISTKTVEPLWIRVKPI*
AGGTGAAATACAACCAT
ATTCCTACCTCACATTTGGGCAAGGTATTTTTAATTGTATTGGAAAGAATTTTGCTTTGC
TTGAAATTAAAACATTCTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTTGACCTTG
AGCATATGAATTATATAAAGAAAATTTTCATTTCTACTAAAACTGTTGAACCGTTATGGA
TAAGAGTGAAACCTATATAA
>1097331770349
new exon six with stop, no other exact matches
DEIPSSSYLTFGYGIYNCIGKNFALLEIKTFLIKAL*QFEFLVDPEQLSYKKQISIST
330
KTAEPLWIRVKSI*
AGATGAAATACCATCTTCATCCTAC
CTTACATTTGGGTATGGTATTTATAACTGTATTGGAAAGAATTTTGCTTTGCTTGAAATT
AAAACATTTTTGATTAAAGCGTTGTAACAATTTGAGTTTTTGGTTGACCCTGAGCAATTA
AGTTATAAAAAGCAGATTTCAATTTCTACTAAAACAGCTGAACCGTTATGGATAAGAGTA
AAGTCTATATAA
>1097329444796 new exon 6 no other exact matches, most like CN567799
GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFEFSVDPKNINYTKVIWLTTRTVEPLLIRVKPLQPV
AGGCGAAATACCAGCATCGTTCTATCTTCCTT
TTGGACACGGTGTTTATAACTGCATTGGAAAGAATTTTGCTTTGCTTGAAATTAAAACAT
TTTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTCGATCCTAAGAATATAAATTATA
CAAAGGTTATTTGGTTAACTACGAGAACAGTTGAACCATTGCTTATAAGAGTAAAGCCAT
TACAGCCCGTAC
>1097325113147 1097690001285 1097942838551
GEIPATFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFEFSVDPKHANYTKVIWLTAKT
290
TEPLSIRVKPIVD*
AGGCGAAATAC
CAGCAACATTCTATCTTCCTTTTGGGCATGGTGTTTATAACTGTATTGGAAAGAATTTTG
CTTTGCTTGAAATCAAAACATTTTTGGTCAAAGCGTTGTTGCAATTCGAATTTTCTGTTG
ACCCTAAGCATGCAAATTATACAAAGGTTATTTGGCTAACTGCAAAAACAACTGAACCAT
TGTCAATCAGAGTAAAGCCTATTGTAGATTGA
>1097206250175
1095899118096
GEVPPFSFLTFGRSNYNCIGKNFVLLDIKAFLVKALLQFKFSVDP
360
MHLNYKKPISITNKAVDPLWIRVKTI*
AGGTGAAGTACCGCCATTTTCCTTTCTAACAT
TTGGGCGAAGTAATTATAATTGTATTGGAAAGAATTTTGTTCTGCTTGACATCAAAGCAT
TCTTGGTCAAAGCGTTATTGCAGTTTAAATTTTCAGTAGACCCTATGCATTTGAATTATA
AGAAGCCGATTTCTATTACTAATAAAGCCGTTGATCCCTTATGGATTAGAGTAAAGACTA
TATAA
>1096123749751
boundary is not right, no other exact matches
GETPASLYLPFGHGVYKVIGKNFSLLEIKTLSVKALLQLEKVVDPKNINYSKVIWLTSRT
211
VEPLFIRVKLIVD*
GGTGAAACACCAGCATCGCTTTATCT
TCCTTTTGGACACGGTGTTTATAAAGTCATTGGAAAGAATTTTTCTTTGCTTGAAATTAA
AACATTGTCGGTCAAAGCATTGTTGCAATTAGAAAAGGTTGTCGATCCTAAGAATATAAA
TTATTCAAAGGTTATTTGGTTAACTTCGAGAACAGTTGAACCATTGTTTATAAGAGTAAA
GCTTATTGTAGATTAA
>gnl|ti|654999901
1095901768752 87% to 1095901729505
1095901905311
mate pair links to 1095901795880 exon 5
1097331953492
1097664070304 1096761841205 1097675516783 1096602049536
1096761821875
(0)
LINKRKKEIEDGIETGEKDFLTIVLKDQQKEGSKMTNDLIRNNLVTLLIAGHETTSVAMQWCLYILGT (0)
AGCTTATCAACAAACGTAAAAAAGAAATAGAAGATGGAATAGAAACTGGTGAAAAAGATTTTTTAACA
ATTGTTTTAAAAGATCAACAAAAAGAGGGCAGCAAGATGACAAATGATTTGATTAGAAAT
AATCTAGTAACACTTTTAATTGCTGGTCATGAAACAACTTCTGTAGCAATGCAATGGTGC
TTATACATTCTTGGCACAGT
1095901795880
1097491021716
1096123686039 1097672412446
(0)
NSDVQNKLREDIKKNVFDIKSITCEEVLSIKYLDCVVKEVLRLHPPVSFIGRINTR
QTNFGEYNVPAGSYLRVPINSAHMNESVYPDPYSFKPERFLT
(1)
AGAATTCAGATGTTCAAAACAAGCTACGAGAAGACATAAAGAAAAATGTCTTTGATATAA
AAAGTATTACGTGTGAAGAAGTTTTAAGTATTAAGTATTTAGATTGTGTAGTTAAAGAAG
TGTTGCGCTTGCATCCGCCTGTATCATTTATAGGTAGAATCAACACTAGACAAACAAACT
TTGGTGAATATAATGTACCTGCTGGCTCTTATCTACGAGT
>1097206350025
1097675534489 1096602217388 all with frameshift
(0)
LIDKRKKEIEDGIATDEKDLLTIALKDQQKENSKMTX
NLIRDNLMTFLIAAHETTSTGMQWCLYMLGT (0)
AGCTTATCGACAAACG
AAAAAAAGAAATAGAAGACGGAATAGCAACTGATGAAAAAGATTTATTAACAATCGCTTT
AAAAGATCAGCAAAAAGAAAACAGCAAGATGACCTTNAATTTAATTAGAGATAATCTAATG
ACATTTTTAATTGCTGCTCATGAAACAACTTCTACGGGAATGCAATGGTGTTTGTATATG
CTTGGCACAGT
>1097331459342
framshift and short 2 aa (pseudogene?)
LIDKRKKEIEDGIATDEKDLLTIALKDQQKENSKMT
NLIRDNLMTFLIAAHETTSTGMQWCLYML
AGCTTATCGACAAACGAAAAAAAGAAATAGAAGACGGAATAGCAACTGATGAA
AAAGATTTATTAACAATCGCTTTAAAAGATCAGCAAAAAGAAAACAGCAAGATGACCTTA
ATTTAATTAGAGATAATCTAATGACATTTTTAATTGCTGCTCATGAAACAACTTCTACGG
GAATGCAATGGTGTTTGTATATGCTG
>1097263640455 new exon 5
(0)
NLDVQEKLREGIKKNVSDIKNISYEEVLSNKYLDCVVKEALRIHPPRS
AGAATTTAGACGTTCAAGAAAAACTAAGAGAAGGGATAAAGAAGAATGTA
TCTGATATAAAGAATATTTCATATGAAGAGGTTTTAAGTAACAAGTACTTAGATTGTGTA
GTTAAAGAAGCATTGCGCATCCATCCACCGCGCTCCAGCTA
>1096526374787
no 100% matches to this seq, best match is 1095901795880
but intron
boundaries do not match
this may
be a poor quality sequence or pseudogene
(1)EKVINIKYLDCVVKEVLRLHPPVLFIGRINTRQTNLGKYIETAGSNQRVPINNAHMNESVYPDPYSFMPKRLLT
(1)
AGAAAAAGTTATAAATATTAAGTATTTAGATTGTGTAGTTAAAGAAGTGTTGCGCTTGCA
TCCGCCTGTATTATTTATAGGTAGAATCAACACTAGACAAACAAACTTAGGTAAATATAT
AGAAACTGCTGGCTCTAATCAACGAGTTCCTATTAACAATGCTCATATGAATGAGTCTGT
TTATCCTGATCCTTATTCATTTATGCCAAAGAGGTTGCTGACAGGT
>CN567598.1
tag12b09.x1 Hydra EST -Kiel 1 Hy..
LLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH
>Combined
N and C-terms CN567799 CN567598 tag12b09.x1 [gene 4] N-term
N-term
has an extension
1096761916754
probably 1095964418219 (poor quality seq)
RVRFLLRYLLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLDTLH
(1)
VPRFVYLIAPEFIKKIFADGKLFQRSKSIRTLAPLIGNSMVGSNYEHHHWQRKLFNG
AFTSQQLKNYFPAFLKHTNLLMK
(0)
LWSYTCDKESGTNLTVLDDLSNLSF
CN567598
part of three exons
1 cacgcgtccg atttttactg cgttatctat
taaagcgcat ctttcaccct cttcgatttt
61 taccatcacc aaaagaacaa ctcattactg
gtcatattaa tcactttcaa ggccgcgacc
121
attctagcac ctatttgagt ttcaacgaaa agtttaaaga agaaagttta tgcacgctag
181
atacattaca t
gtgcccagg tttgtttatc
taattgctcc agagtttatt aaaaagatat
241
ttgcagatgg aaaacttttt caaaggtcaa aatcaataag aactttggcc cctttaattg
301
gaaacagcat ggttggttca aattacgaac accatcattg gcaaagaaag ttattcaatg
361
gagctttcac ttcacaacaa ctgaaaaatt attttccagc atttttaaaa catactaatt
421
tgcttatgaa g
ctttggtca tatacatgtg
ataaagaaag tgggacaaat ttaactgttt
481
tggatgattt gtctaatctg tcatttg
DIVGDVGFGYHFNTITSHSGNEVTKAFQKY
CQLRHSLHPFYKALFAYFPFLMRLSFMFGKHKKAEQVISYTXXX
(0)
AGCTTTGGTCATATACATGCGATAAAGAAAGTGGTACCAACATAATTGTTTT
GGATGATTTGTCTAATCTATCATTTGATATAGTTGGTGATGTTGGTTTCGGCTATCATTT
TAACACCATAACTTCTCATTCCGGTAATGAAGTTACAAAAGCCTTCCAAAAGTATTGTCA
ACTACGACATAGCTTGCATCCCTTTTATAAAGCTTTATTTGCTTATTTTCCATTTTTAAT
GCGTCTATCATTCATGTTTGGAAAACATAAAAAAGCTGAGCAAGTTATAAGTTATACT
>1096081231152
new exon 3
IWSYTCDKENGTKIIVLDDLSNLSLDIIGDVGYGYQFNTLTSHSGNEFTKAFQSYCQLQY
135
NIKPIYKALSAFFPFLMGLSIMFGKRKKTEEILRNNLNM
AGATTTGGTCATATACATGTGATAAAGAAAATGGTACCAAAATAATTGTTTT
AGATGACTTGTCTAATTTATCACTTGATATAATTGGTGATGTTGGTTATGGCTATCAATT
TAACACCTTAACTTCTCATTCTGGTAATGAATTTACAAAGGCTTTTCAAAGTTATTGTCA
ACTACAATATAACATAAAGCCAATCTATAAAGCTCTATCAGCTTTTTTTCCTTTCCTAAT
GGGGCTGTCAATCATGTTTGGAAAACGAAAGAAAACAGAGGAAATTTTACGTAATAATCT
AAACATGGT
>1095898814465
new exon 2 mate pair to 1095899110069
1095993318769 1095963238168 1096704141526
1095899349182 1097664110168
1097622218011
(1)
VPRYVYLIAPEFIKKIFADGKLFQRTTSIRIMAPSIGNSMLSSNYEDHHWQRKLFNGAFT 471
SQQLKNYFPSFLTHTNLLMK
(0)
AGTGCCCAGGTATGTTTATCTAATTGCTCCAGAATTTATTAAAAAAATATTTGCTGATGGCAAACTTTTT
CAAAGAACTACTTCAATTAGAATTATGGCACCTTCAATTGGAAACAGCATGCTTAGTTCA
AATTACGAAGACCATCATTGGCAAAGAAAATTATTCAATGGAGCATTCACTTCACAACAG
CTAAAAAACTATTTTCCTTCATTTTTAACGCATACTAATTTACTGATGAAAGT
1097567103129 1097675494277
1095899110069
(0)
IWSYTCDKESGTNLTVLDDLSNLSFDIIGDVGFGYQFNTITSHSRNEFTSAIRYLAEIQL 657
NASVFLKVLISYFPFLIQLLVMFGKRRKFIQIVRKTLNK
(0)
AGATTTGGTCTTATACATGTGATAAAGAAAGTGGGACAAACT
TAACTGTTTTGGATGATTTGTCTAATCTGTCATTTGATATAATCGGTGATGTTGGTTTTG
GTTACCAATTTAACACAATTACATCTCATTCTCGTAATGAATTTACTTCAGCTATTCGGT
ATTTGGCTGAAATTCAACTCAATGCTAGTGTGTTCTTAAAAGTTTTAATAAGTTATTTTC
CATTTTTAATTCAATTGTTGGTAATGTTTGGGAAGCGTAGAAAATTTATACAGATTGTCC
GTAAAACATTGAACAAGGT
39%
to 3A27 trout aa 307-472 58% to CN770283 [gene 4]
683
FFIAGYETISTTLTLCLYMLAINLEVQEKLREEIQKNKLDVNNISFEEVTSLKYLDCVVK 504
503
ETLRLHGLAPVLGRETINAIKFGEYEIPANTVLQTHVSNLHMNETIYRDPHSFKPERFMT
324
323
GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFKFSIDPMHINYTK
168
167
IIWLTMRTVEPLLIRVKPIAE* 102
TTTTTCATAGCTGGTTATGAAACAATTTCTACTACTTTGACTTTGTGTTTATATATGCTA
GCCATTAACTTAGAGGTTCAAGAGAAACTTAGAGAAGAGATTCAGAAAAATAAATTGGAT
GTAAATAATATTTCTTTTGAAGAAGTTACGAGTTTAAAATATTTGGATTGTGTCGTTAAA
GAAACCTTGCGCTTGCATGGACTTGCACCAGTTTTAGGCAGAGAGACCATTAATGCAATA
AAATTTGGCGAATATGAAATTCCTGCAAACACAGTACTTCAAACTCATGTTAGCAATCTA
CACATGAATGAGACTATTTATCGAGATCCTCATTCATTTAAACCTGAAAGGTTTATGACA
GGGGAAATACCAGCATCATTCTATCTTCCTTTTGGGCACGGTGTTTATAACTGTATTGGA
AAGAACTTTGCTTTGCTTGAAATTAAAACATTCTTGGTCAAAGCGTTGTTGCAATTCAAA
TTTTCTATTGACCCTATGCATATAAATTATACAAAGATTATTTGGTTAACTATGAGAACA
GTTGAACCATTGCTAATTAGAGTAAAACCTATTGCAGAATAA
>gnl|ti|648014530
1095896049543 41% to CYP21
LKYLDCVVKETLRLHGXXXXXXXXXXXXX
KFGEYEVPANTILRTHVSSIHMNETIYPDPHSFKHERFMTG
GTTTAAAATATTTGGATTGTGTCGTAAAG
GAAACCTTGCGCTTACATGGA
AAATTTGGTGAATATGAAGTCCCTGCAAATACAATCC
TACGAACTCATGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCAT
TTAAACATGAAAGGTTTATGACAGGT
>1096082202706
probably the same as 1095896049543
which has errors
1095994179331
(0)
NIEVQEKLREDIQKNILDVNNISFEEVMSLKYLDCVVKETLRLHGPAPLLGRRTISATKF
GEYEVPANTILRTHVSSIHMNETIYPDPHSFKPERFMT
(1)
AGAACATAGAAGTTCAAGAGAAACTTAGAGAAGATATCCAGAA
AAACATATTGGATGTAAATAATATTTCTTTTGAGGAAGTTATGAGTTTAAAATATTTGGA
TTGTGTCGTTAAAGAAACCTTGCGCTTACATGGACCTGCACCACTTTTAGGCAGAAGGAC
CATTAGTGCAACAAAATTTGGTGAATATGAAGTTCCTGCAAATACAATCCTACGAACTCA
TGTTAGCAGTATACACATGAATGAAACTATTTATCCAGATCCTCATTCATTTAAACCTGA
AAGGTTTATGACAGGT
>1096526207508
with frameshifts probably same seq as 1096526100337
mate
pair = 1096520314506 same as 1095898850029
(0)
NLDVQNKLREEIKKNVFDIKSILR
EEVLSIKYL
DCVVKETLRMHPPASFISRKNKTETKLGDYDLPAGTFLRISINNVHMNESVLSWIPYLFKPER
AGAATTTAGATGTTCAAAACAAACTAAGAGAAGAAATAAAGAAAAATGTCTTTGATATAAAAAGTATT
TTACGG
GAAGAAGTTTTAAGCATCAAGTACTT
GATTGTGTAGTTAAAGAGACATT
ACGCATGCATCCACCTGCGTCATTTATATCTCGAAAAAACAAAACTGAAACAAAGTTGGG
TGATTATGATCTACCTGCTGGCACGTTTTTAAGAATTTCAATTAACAACGTACATATGAA
TGAGTCTGTTTTATCCTGGATCCCTTATTTATTTAAGCCCGAACGAA
>BP514308 N-term 25% to 46a [gene 9]
1096761991009
1096082187152 1096123591182 1095899045709 1097383004013
MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFKDYGLYTINTLI
(1)
ATGTATTCGATATACATAGCGATTATAATAGTTC
CTTTAGTTTTTTTCGTTGCTGTTTTTTTTAAACGTTTTTATCATCAATTTCGTTTGTTGC
CATCACCCAAAGAGAGTTTAATTACATGTCACTATAGTTATTTTGATGTTCATGACCATG
TTAACACTCTGTTAAACTTTGGTAAAGAGTTTAAAGATTACGGATTATATACAATTAATA
CTTTAATTGGT
1096124094276
1095964247544 1095901005745 1095899045709
GPRQVHLLLPHFIKTVIADGKFFQRSPVFKAVFPLVGNSMIVSNYEDHHWQRKLF
NQAFTSQQLKRYFLAFTLHTDLLMK
(0)
AGGACCCAGACAAGTTCATCTTTTATTGC
CACATTTCATTAAAACAGTAATTGCAGATGGAAAGTTTTTTCAAAGATCACCAGTTTTTA
AAGCCGTATTTCCTCTTGTTGGAAACAGTATGATCGTTTCTAATTATGAAGATCATCATT
GGCAAAGAAAATTATTTAATCAAGCCTTTACTTCGCAACAATTAAAAAGATATTTTTTAG
CTTTTACTCTGCATACTGATTTGCTAATGAAGGT
LWSCTCDKENGTNLNVWSDLSNLSFDIIGDVGFGYQFNTITSHSGNAFTKALRSY
INLRFNSSVVHNVLIAYFPFLMRFLSKFGNLNKAEQ
(0)
1095958061820
82% to 1095901729505
1095964290917
1096124019775 1096528662475 1096159548758 1097622041589, 1097672472615
1096064134288
mate pair = 1096041094868 exon 3,
(0)
LIDKRKKEIENGLVKEEKDFLSIVLKDQQQEKSKLTNDLIRDNLMTLLIAGHETTSTAMLWCLYTLGT (0)
AGCTTATCGATAAGCGTAAAAAAGAAATAGAAAATGGATTAGTAAAAGAAGA
GAAAGATTTTTTATCAATTGTTTTAAAAGATCAACAACAAGAAAAGAGCAAACTGACAAA
TGATTTGATTAGAGATAATTTAATGACGCTTTTAATTGCTGGTCATGAAACTACTTCTAC
TGCAATGCTGTGGTGTTTATACACATTAGGAACAGT
run
into seq gap downstream
>gnl|ti|655009968
1095963046224 KYG region 46%
to CYP20 35% to 27B1
(2) LGNLGSLTFDGGIHKFLVENHKRLGPMFSFYWGKELAVSLACPILFKEVATLFNRP (1)
AGATTGGGAAATCTTGGCTCTCTAACATTTGA
TGGTGGAATTCACAAGTTTCTTGTTGAAAACCATAAAAGGCTTGGTCCAATGTTCAGCTT
TTATTGGGGCAAAGAACTGGCTGTTAGTCTAGCTTGTCCAATTCTTTTTAAGGAGGTT
GCCACTCTATTTAATCGACCAGGT
>1097263613070 mate pair to 1097206643989 I-helix/J helix boundary?
1097329235455 1097672289528 1096705876537 1096110072452
LTWLVYFLCKHPEVESKVYNEIKEFTEKDLDMELLTKFS
(2)
AGTATTGACATGGCTTGTTTATTTCTTATGTAAACATCCAGAAGTGGAATCTAAGGTATACA
ATGAGATAAAAGAATTTACAGAAAAAGATCTAGATATGGAATTACTTACAAAATTTAGGT
>BP508840
BP508840 Best match in Fugu, human and Ciona is CYP20
1096602177777
1097264059772 1096123966865 1097672368420
1097206643989 1096110028119
(2)
YTKQVIDEVMRIAVLAPYAARYSDYDIIVDGHLIPKK
(0)
(0)
TPIILALGTVFQDETIFPEPDR (2)
(2)
FDPDRFSDKQIEERSALAFQPFGFAGKRKCPGYRLAYAETLTYTFYIIKNFHISL
FDKQSVKMHYGFVTKPSEEIWIKVLRRKNI*
1096111030955
AGTTACACAAAGCAAGTTATTGATGAAGTTATGAGAATTGCTGTACTTGCTCCATATG
CAGCTAGATATTCGGATTATGATATAATCGTTGATGGACATCTAATACCAAAAAAGGT
AGATTTGACCCTGACCGTTTTAGTGATAAACAAATTGAAGAACG
TTCAGCGTTAGCTTTTCAGCCGTTTGGATTTGCGGGAAAAAGAAAGTGTCCTGGATATAG
ATTAGCATATGCTGAAACATTGACGTACACATTTTATATCATCAAGAATTTTCATATTTC
GCTATTCGATAAGCAATCTGTGAAAATGCATTATGGTTTTGTCACAAAACCGTCTGAAGA
AATTTGGATTAAAGTGTTACGACGTAAAAATATCTAG
>CYP20 amphioxus 39% to CYP20 Danio
MLDYAIFAITFVVFLIATVLYLYP (0)
(0) GANKITTIPGLEPSDPK (2)
(2) DGNLGDVGRAGSLHEFLLKLHTEYGDIASFWWGQQLVVSLGAPELWKQH
ERIFDRP (1)
(1)
ALLFKGFEPLIGAKSIQYANSVDGRTRRKLYDPSYGHNAMKHYYSIFQE (0)
(0)
LGQEMAKKWESMKGDQHIPLHAHIIALAMKAITRSSFGDAFKDEKECVQFGRNYDI (0)
(0) CWNDMEERIKGSHPTEGSPREKKFKE (1)
(1)
ALGKLHATIARVAKYRRENPSPPQEQLFIDVLIEGNLPEEQ (0)
(0) VLCDAMTFTVGGIHTSGN (1)
(1)
LLTWALYYIATHEEVEEKLHQELSDVLGKKGEVTPDNISQLV (2)
(2) YLRQVLDESLRCAVIAPWGARYMDLDAEVGGHIVPAK
(0)
(0) QTPVIHAFGVVLQDERIWPEPNK (2)
FDPDRFDAENSKGRHKLAFQPFGFAGGRKCP (1)
(1)
GYRFTYTWTSVFLSILCRQFKLHLVDGQVVKPCHGLVTRPVDEIWITVTKRD*
1096111030955
ATCTCCATCAGTTAATTGCATAGATGCGAGAATGCATGTTAAGTGCATGTAACTCTGAATATTAGCGACAAAGTTTAGTATCACTATCTATGATAGTATTTTTAGTATTTATATGATTCATATTTTTCAGCATAACTCTAATAATAAATATTAATCTAAATTTTAATGCTTTTTTTTTTAAATGATTGAATATTTTTAAAACATGTATTAAATAGTTTATTAACTAAATTTTTAAAGTATTTTTAAGTACTAATAAATTTAAAAAATAAAAAAAGATGTTTATGTTTCAAAGTTTATATTTCAGTTACACAAAGCAAGTTATTGATGAAGTTATGAGAATTGCTGTACTTGCTCCATATGCAGCTAGATATTCGGATTATGATATAATCGTTGATGGACATCTAATACCAAAAAAGGTTTATACTAATATTA
>gnl|ti|646862798
1095898098005
41 0.018 35% to 17A1
34% to 2P4
gnl|ti|647168675
1095899196297
1097622027233
1096703988838 1097329365089 1097329154279
42% to 1095898227332
MLVFQQLIFAVLVPAFLYFVFSYLQHLWICSKYPKGLFPLPLLGNIH
QLGKNSSQTFSSLTKIYGDIFSVSIGTQRLVILNSMESIHEALLTKGSTFGGRPTEF
TSNVFTKGYKNLSHTDYGPNLKALRKVIHLSVQKYAGGLTRQEQMITFERDELCKKLFN
TEKEIALRCEI
(1)
ATGCTTGTTTTTCAACAATTAATATTCGCCG
TACTTGTTCCGGCTTTTTTATATTTTGTTTTTTCTTATTTGCAACATTTATGGATTTGTA
GTAAGTACCCAAAGGGTCTGTTTCCATTACCGTTGTTAGGAAACATTCATCAATTAGGTA
AAAACTCTTCTCAAACATTTTCATCTTTAACAAAAATTTATGGAGATATATTTAGTGTGA
GTATTGGTACCCAGCGACTCGTTATACTCAATAGTATGGAAAGCATACATGAAGCTTTGTTAACCAAAGGTTCAACTT
TTGGTGGTAGACCAACTGAgGTTACGTCAAATGTTTTTACAAAAGGATATAAAAACTTATC
GCACACTGATTATGGACCGAATTTAAAAGCGTTGCGAAAAGTTATTCATCTTTCCGTTCA
AAAATATGCTGGCGGACTAACGAGACAAGAACAGATGATAACTTTTGAAAGAGACGAACT
TTGTAAAAAACTTTTTAATACTGAAAAGGAAATAGCTTTACGTTGTGAAATTGGT
(1)
DFCTVNVMSGYLFNERFLNQNSEFKDVVKSIQLLLDNSGITDKTTFIHWLRYLPLREWN
793
EIKQARLVLNPWVEKKVEDHWRKYNENEIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617
616 LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLL
446
445
QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266
265
PKRWINENGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89
88 LPSLEGQFGITFRPNSFKVL* 29
AGATTTTTGCACTGTAAATGTAATGTCGGGGTATTTATTCAATGAACGCTTTCTGAACCAAAATTCCG
AGTTTAAAGATGTCGTAAAAAGTATTCAACTTTTGCTAGATAACTCTGGAATTACAGATA
AAACCACGTTCATACATTGGCTTCGTTACTTGCCATTGCGGGAATGGAATGAAATAAAAC
AAGCGAGACTTGTCTTAAACCCGTGGGTCGAAAAAAAGGTTGAAGATCATTGGAGAAAGT
ATAATGAAAATGAAATCATTAATGTAACTGATAGCATGATTCAACATTTTTTAACAAAGT
ACGATGGTTTAGACACTGATTTTGCAAAGAAATACATTACCTTATTATTGATCGAATTAC
TTGTTGCCGGTACCGAAACGACAGCTATTACTATTTGCTGGATGGTTTTATATCTAATAC
ATAACCCTGAGTATCAAGAAGAAATTTATAAAGAAATTACATTAAATATTGGTTGTAGAT
TGCGAATAACATCTGTTGTGCCACTAAACTTGGCTCACAAAGCATTAAAAGATACCAGCA
TTTGTGGAAAAATTATTCCTAAAGACGCTATAGTAATTACAAATTTATGGAATCTTCATC
ACGACAACAGATACTTTAAAAATCCTAATGAATTTGATCCTAAACGCTGGATAAACGAAA
ATGGTCTATTTGACTCAATTTCTCAAAAATATTTTAAACCTTTTTCGGCTGGAGCGAGAG
TATGTCTTGGCGAGACATTAGCCAAAAATCAACTTTTTTTAATCATCTCCGGTCTAATTA
TGAATTTTATTTTCACATCTGCACCAGGAAAAGACTTACCTAGTCTTGAAGGACAATTTG
GAATCACATTCCGTCCCAATAGTTTTAAGGTTTTATAA
>gnl|ti|651477674
1095901303788
39 0.11 39% to CYP21 39% to
2R1 40% to 2P4
49% to 1095898227332
1096703646566
mate pair = 1096703498438 = N-terminal exon
1097675091467
MFLFVVFEVVFGLIIPVLLYVI
VVYIYHIWECQRYPPGPFPLPVIGNYNLLANDPVKALCDLEIIYGDVFSLSLGTVR
VVVVSSHESIYDVLVGDGSNFSGRPREYSSLLFTGGFENLSHMDNNPLTKKIRKVFYSKL
KTNGSILAHNENIVKHESELLHQRLLQNEGSVTNLRYEI
(1)
(1)
DLCIVNSICSIIFGNRLSDTCEVHEILKATRLLLKNLSNIEIMHYLPWM
RFFLLKKQNEISESRNICKFWIQTQLHKRKKSLKNENISDILLNLWDQQKQENP
NEEQYRMILVELVMAGSETTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILE
TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD
KNGELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGETPKLDGD
IGITLTPLPYNAVAKQRT*
ATGTTTCTTTTTGTAGTTTTTGAAGTTGTATTTGGGCTGATAATTCCCGTTTTACTTTACGTAATAGTT
GTTTATATTTATCATATTTGGGAATGTCAAAGATACCCACCAGGT
CCATTTCCTCTTCCGGTAATTGGAAACTACAACTTGTTAGCAAATGATCCTGTGAAGGCA
TTGTGCGATCTAGAAATTATTTACGGAGATGTTTTCAGTTTAAGTTTAGGAACCGTTCGG
GTGGTTGTTGTAAGCAGCCACGAGAGTATTTACGATGTTTTAGTTGGAGATGGATCAAAT
TTTTCCGGAAGACCCAGGGAGTATTCATCTTTACTTTTTACTGGAGGTTTTGAAAACCTT
TCCCATATGGATAACAACCCGTTGACTAAAAAAATCAGAAAAGTTTTTTATTCAAAACTT
AAAACAAACGGAAGTATTTTAGCACACAATGAAAATATTGTCAAACATGAAAGTGAACTT
TTACATCAAAGACTACTGCAAAACGAAGGAAGCGTCACCAATCTTCGTTATGAAATCGGT
AGATCTTTGTATTGTTAACAGCA
TATGCAGTATTATTTTTGGTAACCGGCTTAGTGATACTTGTGAAGTTCATGAAATTTTAA
AAGCGACCAGGTTACTTCTAAAAAACTTGTCAAACATTGAAATTATGCATTATTTACCAT
GGATGAGATTTTTTTTATTAAAAAAGCAAAACGAAATCAGCGAATCTAGAAACATTTGCA
AATTTTGGATTCAAACCCAGTTGCATAAACGAAAAAAAAGTTTAAAAAACGAAAATATCT
CAGATATTCTTTTGAACCTTTGGGACCAACAAAAACAAGAAAACCCTAATGAGGAACAAT
ACAGAATGATTTTAGTTGAGTTAGTTATGGCTGGTTCCGAAACAACAGCCGCAACGATAAC
TTGGCTAATCTTTTATCTTTTGCATTGGCCTCACTATCAAAGCATTCTTTACAAAGAAAT
CAAAAATGTTTGTGGTGATCAGTACCCTACGTTTAATGATATTAAATCAATGCCTATAAT
GCAAGCAACTATACTTGAAACTTTAAGGTTGTCTTCTGTCGTTCCTTTAAGCTTATCTCA
CAAAGCCGTAAATAACGCGAAAATTAATAAATTCACAATCCCTAAAGATACAATAATAAT
AACAAATTTATGGGGCGTACATCATAATGAAAAATACTGGGAAAAACCGTTTGAATTCAA
TCCTATGCGTTGGCTTGATAAAAATGGCGAACTTTCAACAGCAAAGCGTTTAGGATATTT
CCCTTTTTCAGCCGGCCCAAGAGGTTGCATTGGTGAGTCATTTGCAAGAATGCAAATGTT
TATTATATGCTCTCGACTGATAAAAGATTTCTCCTTTGAGTTGCCTCAAAGCGGAGAAAC
CCCAAAACTAGATGGTGATATTGGAATTACACTAACGCCCCTTCCTTATAATGCAGTAGC
TAAACAGCGAACCTAA
>gnl|ti|654998190
1095901734433 33% to CYP21 33% to
CYP17 33% to 2U1
gnl|ti|651148169
1095901003210
gnl|ti|651162328
1095901096755
74% to 1095898227332
possible pseudogene
870
FQDIIKTHNET
837
SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658
657
TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSG
IGYPSLNDRPRFHLIQAIIHETLRLLSVAPLGLCHKALENGSICGKFVPKG
(frameshift)
LLILTNLWSIHHDERYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS
147
146
GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKDSLDGRSGVTCLPYEFEIVMIPRS*
>gnl|ti|655009845
1095963045220 near C-helix region poor match
gnl|ti|648592188
1095595897239
NOT A
P450, MATCHED FISH AND DROSOPHILA SEQUENCES
637
STSNCVTMNQSDYLQNVLQKFGFDNCKQRSSPCKQVPYSYHNQESIKSNGSMIKL 473
472
YRQMVGSLLYAMTCTRPDLSYVITKLSQHLSKPNSGDWIMIKHVFRYIKHTLN 314
313
YCLTFRK 293
>gnl|ti|648047811
1095899057643 I-helix 4 aa diffs to 1095898198167
(0)
LIEKRKKEIDDGISTKEKDIITIVLKDQQQESSKLTNDLIRDNLLLFLIAGHKTTSTTMTWCLYILGT (0)
AGCTTATTGAAAAACGTAAAAAAGAAATCGACGATGGAATATCAACAAAAGAGA
AGGATATTATCACAATTGTCTTAAAAGATCAACAGCAAGAAAGCAGCAAACTAACAAATG
ATTTGATTAGAGATAATTTATTATTATTTCTCATAGCTGGTCATAAAACAACTTCTACTA
CTATGACTTGGTGTTTATATATACTAGGCACTGT
CYP1
like (only one seq)
1095899272864
5 aa diffs in N-term overlap
region
MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS
KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY
GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1)
>gnl|ti|648485307
1095899272864 57% to 1095897342515
ATGTGGTATGAAAT
TATCTGCGGACTGATCATTTCGATTTTGCTATATATTATTGGTTCTTACTTGATGCACTT
GCTGGAATGCAGGAAGTATCCTCTTGGACCTTTTCCAATACCAATCTTTGGTAACTTGCA
TTTATTAGGAACAGAGCCACATAAAATACTTGCTGCATACTCAAAAAAGTATGGAGCAGT
CTTTAGCATAAGTTTAGGATTGCAAAGAATTGTTATAATTTCTGACATTACTACAACTAG
AGAAGCACTAGTTCAAAAAGCATCCATATTTGCAGGTAGACCAAAATCTTATTTAATTCA
ATTAATTTCAAGTGGGTACAAAGGCATTGCATTTATGGACTATGGTTCCTTCTGGAAAGT
TTTGCGTAAAGTTAGTCATTCTTCATTAAAAATATATGGAGAAGGACATGAACGTTTTGA
AAAGATACTTACAAAAGAAAGTGAAGAGCTACATAAAAGACTTTTAAAGAAATCAAATAA
TTCCGTAGAGCTGAAATCTGAATTTGGT
>CN775634
tae83e09.y1
Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to
SW:CP11_OPSTA
Q92095 CYTOCHROME P450 1A1
AGAGAGTGAAGAGCTACATAAAAGACTTTTAATGAAATCAAAAACTTCCGTAGATCTGAAAACTGAATTT
GGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAAATTCAGAAT
TTAAAGAAGTTCTTACAACAATAAACAATATAGTCGATGGGTTGTCAAATACAACTGCTGTCGGTTTTTT
GCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTTCACTTTCAAAATATATTCGT
TTTTTAAACGATAAGTTGACCAAACATAAGGAAACATTTAATGAAAACAAAATTCGAGATTCTACTGATT
CTATTATAAAC
32%
to CYP1C1 aa 173-297
2 ESEELHKRLLMKSKTSVDLKTEF (1)
GAAIINVICFIVFGERYQYSNSEFKEVLTTINNIV
175
176
DGLSNTTAVGFLPWLRFLPFSPIKKLSISLSKYIRFLNDKLTKHKETFNENKIRDS 343
344
TDSIIN 361
opposite
end of clone =
>CN774619
tae83e09.x1
Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to
SW:CPT7_CHICK
P12394 CYTOCHROME P450 17
TTTAAACTACATAAGATTGTTTACTCTTGGAATCAAGTATGCGCAAAATTGTTTTGGAGCATGAGTAATA
TTGCATTTTCCAATTAAACTTGGAGGTGGACAACCAGGTGTGCACTCAAACTTAAAATCTCGAATTAATC
TAGAAAAGAAGAAAAATAGTTCATTTTCAGCAACTGTTTTCCCTAAACAAACTCTTGTACCGGCTGAAAA
AGGTAAGAAACTTGTTGCTTTACTTGGATCAAATTTCTTATCTTTGCCAATCCGCCGATAAGGAATTAAA
TTCATTAGGATTTTCCCAGGAGTTAGTAACGTTATGAATCTGCCAATGATTAATCATGACAGTAGCAATT
TTCAGGAATGCTATGTGACATGAGAACTCCCCCCCAGAGGT
38%
to CYP17A2 Fugu aa 383-485
391
TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN
275
273
PYRRIGKDKKFDPSKATSFLPFSAGTRVCL
(1)
GKTVAENELFFFFSRLIRDFKFECTPGCPP 94
93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7
1097672474909
AGGGAAA
GTTGCTGAAAATGAACTATTTTTC TTCTTTTCTAGATTAATTCGAGATTTTAAGTTTG
AGTGCACACCT
GGTTGTCCACCTCCAAGTTTAGTTGGAAAATGCAATATTACTCATGCT
CCAAAACAATTTTGCGCATACTTGATTCCAAGAATAAACAATCTTATGTAG
>1096526199166
frame3_ORF1 7aa diffs to CN774619 may be same gene
(1)
GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS
LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI
TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL
RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD
KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN
ITHAPKQFCAYLTPRINNLM*
AGGTGCTGCAATTATAAACGTGATTTGTTTCATTGTTTTTGGGGAAAGATACCAGTATTCAGATTCAGAAT
TTAAAGAAGTTCTTACAACAATAAATGATATAGTCGATGGGTTGTCAAATACAACTGCTG
TTGGATTTTTGCCGTGGTTGAGATTTTTACCGTTTTCTCCAATAAAAAAACTGAGTATTT
CACTTTCAAAATATGTTCGTTTTTTAAACGATAAGTTGAAAAAACATAAGGAAACATTTG
ATGAAAAGAAAATTCGAGATTTTACTGATTCTATTATAAATTTTTCTAATAACGAAGCTG
TCAAACAAAAATTTAAAAACGTTGATGAACATTTAGAGCCTGTGATTGGGGATTTATTTA
TAACGGGTAGTGAGACCACATTAACATCTTTATTGTGGTTAATTCTTTATATGATGCATT
ATCCCAAATATCAACAAGAAATTTTTAAAGAAATTACAACGGTTATTGGTGAAGACCGGT
ACCCATGTTTAAATGACCGTGATTCTTTGCATCTTGTTAAAGCCGCATTAAAAGAGTGTC
TGCGTTTATCTTCAATTGTTCCTCTTGGATTACCACACAAAACAACCAAAGAAACAGTTC
TTATGGGACATAGCATTCCTGGGAATGCAACAGTCATGATTAATCATTGGCAGATTCATA
ACGATACTAACTACTGGGAAAATCCTAACGAATTTAATCCTTATCGGTGGATTGGTAAAG
ATAAGAAATTTGATCCAAGTAAAGCAACAAGTTTTTTACCTTTTTCAGCCGGTACAAGAG
TTTGTTTAGGGAAAACAGTTGCTGAAAATGAACTATTTTTCTTCTTTTCTAGATTAATTC
GAGATTTTAACTTTGAGTGCATACCTGGTTGTCCACCTCCAAGTTTAATTGGTAAATGCA
ATATTACTCATGCTCCAAAACAGTTTTGCGCATACTTGACTCCAAGAATAAACAATCTAATGTAA
>whole
gene 1095899272864 1096526199166
MWYEIICGLIISILLYIIGSYLMHLLECRKYPLGPFPIPIFGNLHLLGTEPHKILAAYS
KKYGAVFSISLGLQRIVIISDITTTREALVQKASIFAGRPKSYLIQLISSGYKGIAFMDY
GSFWKVLRKVSHSSLKIYGEGHERFEKILTKESEELHKRLLKKSNNSVELKSEF (1)
GAAIINVICFIVFGERYQYSDSEFKEVLTTINDIVDGLSNTTAVGFLPWLRFLPFSPIKKLSIS
LSKYVRFLNDKLKKHKETFDEKKIRDFTDSIINFSNNEAVKQKFKNVDEHLEPVIGDLFI
TGSETTLTSLLWLILYMMHYPKYQQEIFKEITTVIGEDRYPCLNDRDSLHLVKAALKECL
RLSSIVPLGLPHKTTKETVLMGHSIPGNATVMINHWQIHNDTNYWENPNEFNPYRWIGKD
KKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFNFECIPGCPPPSLIGKCN
ITHAPKQFCAYLTPRINNLM*
CYP2
like (2 different sequences)
>CN566581
taf98h10.x1
Hydra EST -Kiel 1 Hydra magnipapillata cDNA 3' similar to SW:CPC8_HUMAN
P10632
CYTOCHROME
CACGCGTCCGCTTTTTAGTCCCTGTGTAATAAGTATTTCTTAGACATAATTTTAAAATGTTTCTTGAAGT
TATTGGCGCAGTCTTTATTCCACCTTTGATATGGACTATATGGGTTTACATTAAACATTTAATTGATTGT
TTGCATTATCCAAGAGGACCAATACCACTACCATTTATTGGAAATGGTTATTTGATAAGAAAAGCTGAAC
CATATAAAGAGTTGGTTAACTTAGGAAAAATATATGGCGATGTTTTTAGTTTTAGCGTTGGTTCAGTCAG
ATATGTAATTGTCAACAGTTTAGAAGGAATTCAAGAAGTACTAGTTAAAAAAGGGTGGCAATTTGCTGGT
CGTCCAAAAGGTCCAAGTTGGGATAGATCCATTCACGGTCTAATCCAACGTGATCCAAGTAAAAAATTTA
AAATATTACGGAAGCTAGCAACATCATCTTTGAAAATCTTTGCTGATGGATTGGCAGGGATGGAAAGTAA
AGCTATA
32%
to 2X9 aa 26-146
57 MFLEVIGAVFIPPLIWTIWVYIKHLIDCL
144
HYPRGPIPLPFIGNGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVL 323
324
VKKGWQFAGRPKGPS
WDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI
497
EESFQLNKKLLETNGKPF
opposite
end of clone =
>CN566859
taf98h10.y1
Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to SW:CPT7_SQUAC
Q92113
CYTOCHROME P450 17
TAATCATAAATGTGTATTTAAAATAAAAATTAAAACTCTTTTTGATAGACAATAGTTAGCTAATAGAAAT
TAGAAAAATTTTTAATTAACACATTTTGCCTAAAATTTAAAGGTTAAGCAAACATTTACTAATCGTTTCA
TTTTCAAATTATTTACTATTTATCTAGTATCTAATTTTTCAATCAATATAGTTTATATAAATTTAAAAAA
AAAAACATTAGCAAAAT
TAAAG CAATAAATTTTTACT CCTTGGAACTATTTCAACTTTAAAGTCATAAGG
AGTACAAGTGATGCCAAAACTACCTTCTAAGCGCGGTAATTCTTCTAAAATTGGTTTCACAAATCGGAAA
TCATTTATCAATCGTGATATAAAAATAAACAACTCAGTTTTTGCCAATGTTTCTCCAATACAGCTACGAG
GTCCACTAGAGAATGGTAAATAAGCGTTTCCTAGTTTAGAATTAAATTCGCCAGAATTTTCTAACCAACG
TTCAGGGTAAAAACTCATTGCATTTTTCCAGTAACTTTCGTCATGGTGTATACTCCATAGGTTGGTTATT
ATAAGAGCCCCTTT
GGGGAATAGGTTTTCCACAAATGCTACCTTTCTCC
44%
to 17A1 fugu aa 378-485
607
RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395
394
AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227
CYP3
like (two different sequences)
>CN567799
opposite end = CN567598 tag12b09.x1
tag12b09.y1
Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to TR:Q9PVE8 Q9PVE8
CYTOCHROME
P450 3A30
TTACTTTGTATTTAAAAATCATCAAAAGAAAACCCCAAACAATCATTATTATAAATGTTAGGACTTAAAG
TTTTAAATAAAGTTTATTCTTCTGTTGAATATTATTCTGCAATAGGTTTTA
CTCTAATTAGCAATGGTTC
AACTGTTCTCATAGTTAACCAAATAATCTTTGTATAATTTATATGCATAGGGTCAATAGAAAATTTGAAT
TGCAACAACGCTTTGACCAAGAATGTTTTAATTTCAAGCAAAGCAAAGTTCTTTCCAATACAGTTATAAA
CACCGTGCCCAAAAGGAAGATAGAATGATGCTGGTATTTCCCCTGTCATAAACCTTTCAGGTTTAAATGA
ATGAGGATCTCGATAAATAGTCTCATTCATGTGTAGATTGCTAACATGAGTTTGAAGTACTGTGTTTGCA
GGAATTTCATATTCGCCAAATTTTATTGCATTAATGGTCTCTCTGCCTAAAACTGGTGCAAGTCCATGCA
AGCGCAAGGTTTCTTTAACGACACAATCCAAATATTTTAAACTCGTAACTTCTTCAAAAGAAATATTATT
TACATCCAATTTATTTTTCTGAATCTCTTCTCTAAGTTTCTCTTGAACCTCTAAGTT
AATGGCTAGCATA
TATAAACACAAAGTCAAAGTAGTAGAAATTGTTTCATAACCAGCTATGAAAAA
Combined
N and C-terms
>CN567598
tag12b09.x1 [gene 4] N-term
RVRFLLRYLLKRIFHPLRFLPSPKEQLITGHINHFQGRDHSSTYLSFNEKFKEESLCTLD
TLHVPRFVYLIAPEFIKKIFADGKLFQRSKSIRTLAPLIGNSMVGSNYEHHHWQRKLFNG
AFTSQQLKNYFPAFLKHTNLLMKLWSYTCDKESGTNLTVLDDLSNLSF
39%
to 3A27 trout aa 307-472 58% to CN770283 [gene 4]
683
FFIAGYETISTTLTLCLYMLAI
(0)
NLEVQEKLREEIQKNKLDVNNISFEEVTSLKYLDCVVK 504
503
ETLRLHGLAPVLGRETINAIKFGEYEIPANTVLQTHVSNLHMNETIYRDPHSFKPERFMT (1)
323
GEIPASFYLPFGHGVYNCIGKNFALLEIKTFLVKALLQFKFSIDPMHINYTK 168
167
IIWLTMRTVEPLLIRVKPIAE* 102
>CN770283
58% to CN567799
tad87b02.y2
Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to
SW:CP4C_BLADI
P29981 CYTOCHROME P450 4C1
AAAGAATGTATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGTT
GTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAATCAAACAAAGT
TTGGTGACTTTGATGTACCTGCTGGCTCTTTTTTACGAATTCCTATTGACAGTGCACATATGAACGAGTC
TGTTTATCATGATCCTCATTCATTTAGACCACAACGATTCTTGACAGGTGAAATACCACCATTATCCTTC
CTTACATTTGGGCAAGGTACATATAATTGTATCGGAAAGAATTTTGCTTTGCTTGAAATCAAAACATTCT
TGGTCAAAGCGTTGCTGCAATTCAAATTTTCAGTAGACCTTAAGCGTTTGGAAATTAACAAGCTGAAT
35%
to 3A27 trout aa 343-472
2
KNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDVPAGSFLRI
PIDSAHMNESVYHDPHSFRPQRFLTGEIPPLSFLTFGQGTYNCIGKNFALLEIKTFLVKA
LLQFKFSVDLKRLEINKLN
418
>CN776982
taf28f06.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3'
similar to TR:Q9VXY0 Q9VXY0 CG9081 PROTEIN. ;.
Length = 316
Same
as 1096041191032
Query:
79
RPQRFLTGEIPPLSFLTFGQGTYNCIGKNFALLEIKTFLVKALLQFKFSVDLKRLEINKL 138
R
+RFLTGEIPPLSFLTFGQG YNCIGKNFALLEIKTFLVKALLQF+FSVDLK L KL
Sbjct:
307 RQERFLTGEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNYKKL 128
QRTRQERFLT
GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNY
KKLISITNKTVEPLWIRVKPI*
AGGTGAAATACCACCATTATCCTTCCTTACATTTGGGCAAGGTATATATAATTGTATCGGAAAGAAT
TTTGCTTTGCTTGAAATCAAAACATTCTTGGTCAAAGCGTTGCTGCAATTCGAATTTTCA
GTAGACCTTAAGCATTTGAATTATAAGAAGCTGATTTCGATTACTAATAAAACCGTTGAA
CCGTTATGGATAAGAGTGAAGCCTATATAA
Combined
CN776982 and CN770283 [gene 5]
(0)
NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTK
FGDFDVPAGSFLRIPIDSAHMNESVYHDPHSFRPQRFLT
()
GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALLQFEFSVDLKHLNY
KKLISITNKTVEPLWIRVKPI*
AGAACTTAGGTGTTCAAAACAAATTGAGAGAAGAGATAAAAAAGAATGT
ATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGT
TGTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAA
TCAAACAAAGTTTGTTGACTTTGATGTACCTGCTGGCTCTTTT
CYP4
Like
>CN627429
tae92b11.y1
Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to
SW:CP51_CANGA
P50859 CYTOCHROME P450 51
GATAATGGTACTAACATAATTGTTTTGGATGACTTATCTAATTTATCATTTGATATAATTGGGGATGTTG
GTTTTGGCTATCAATTTAACACAATTACTTCTCATTCTGGTAATGAGTTTACAAAAGCGCTTCAGAGTTA
TTGTCAACTACGATTTCAATTGAATGCCGTCCATAAAGCTCTACTAGCTTTCTTTCCATTTTTAATGCAT
CTGTCATTTATGTATGGAAAACGTAAACGAGCTGAGCAAGTCATCTGTAATACTTTAAACATGCTTATTA
ATAAACGCAAAAAAGAGATAGACCACCGAATAGCAGCTGATCAAAAAGATTTTTTAACAGTTGTTTTAAA
AGATCAACAAAAAGAAGGCAACAAGATGACAAATGACTTGATTAAAAATAATCTGATGACGCCTTTAATT
GCAGGTCACAAAACAACTTCCACTGACATGCCATGGTGTTTCAACGTGCTTGCGCCAAACCCAAGTGCTA
CCAAACACATGCAAAGAAACAGAAAGAAA
GAATACATCT CGACCACAAAA
31%
to 4T5 aa 183-347
1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKALLA
189
190
FFPFLMHLSFMYGKRKRAEQVICNTLNM
LINKRKKEIDHRIAADQKDFLTVVLK
351
352
DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK 540
CN770090
taf75f08.y1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 5'
similar to TR:Q40411 Q40411 PUTATIVE CYTOCHROME P-450.
;.
Length = 299
Score = 113 bits (283), Expect = 1e-26
Identities = 54/56 (96%), Positives =
55/56 (98%)
Frame = -1
Query:
1 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNA
56
DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQL +
Sbjct:
170 DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLKS 3
CN775805
tae77f11.x1 Hydra EST Darmstadt I Hydra magnipapillata cDNA 3'
similar to TR:Q42700 Q42700 CYTOCHROME P450 ;.
Length = 562
Score = 58.5 bits (140), Expect = 5e-10
Identities = 27/27 (100%), Positives =
27/27 (100%)
Frame = +3
Query:
1
DNGTNIIVLDDLSNLSFDIIGDVGFGY 27
DNGTNIIVLDDLSNLSFDIIGDVGFGY
Sbjct:
480 DNGTNIIVLDDLSNLSFDIIGDVGFGY 560
>Combined
CN627429 CN775805 27% to 4T5 [gene 6]
GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK
ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH
WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK
1
DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL RFQLNAVHKALLA 189
190
FFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLK 351
352
DQQKEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK 540
Combined
CN776982 and CN770283 [gene 5] = DN812371.1
(0)
NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTK
FGDFDVPAGSFLRIPIDSAHMNESVYHDPHSFRPQRFLT
()
GEIPPLSFLTFGQGIYNCIGKNFALLEIKTFLVKALL
QFEFSVDLKHLNY
KKLISITNKTVEPLWIRVKPI*
AGAACTTAGGTGTTCAAAACAAATTGAGAGAAGAGATAAAAAAGAATGT
ATTTGATATAAAAAGTGTTTCATATGAAGAAGTTTTGAGTATCAAGTACTTAGATTGTGT
TGTTAAAGAAACATTGCGCATGCATACACCTGTAGCATTTATTGGTAGAATAAATAAGAA
TCAAACAAAGTTTGTTGACTTTGATGTACCTGCTGGCTCTTTT
EST DN812371.1 joins with CN627429
CN775805 and 1095901729505
GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK
ERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNGIFVSNYEDHH
WQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDK
DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQL
RFQLNAVHKALLAFFPFLMHLSFMYGKRKRAEQVICNTLNM
LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT
NLGVQNKLREEIKKNVFDIKSVSYEEVLSIKYLDCVVKETLRMHTPVAFIGRINKNQTKFGDFDV
PAGSFLRIPIDSAHMNESVYHDPYSFRPERFLTGEIPPLSFLTFGQGIYNCIGKNFALLE
IKTFLVKALLQFEFSVDLKHLNYKKLISITNKTVEPLWIRVKPI*
1095901729505
I-helix part of DN812371.1
1097675072038
1097672494604 1097567117390
(0)
LINKRKKEIEDGIAADQKDFLTVVLKDQQKEGSKMTNDLIKDNLMTLLIAGHETTSTAMQWCLYMLGT (0)
AGCTTATTAATAAACGCAAAAAAGAAATAGAAG
ATGGAATAGCAGCTGATCAAAAAGATTTTTTAACAGTTGTTTTAAAAGATCAACAAAAAG
AAGGCAGCAAGATGACAAATGACTTGATTAAAGATAATCTGATGACGCTTTTAATTGCTG
GTCACGAAACAACTTCTACTGCAATGCAATGGTGTTTATACATGCTTGGCACAGT
CYP17
like (4 different sequences)
>CN769570
taf31c10.y1
Hydra EST Darmstadt I Hydra magnipapillata cDNA 5' similar to
SW:CPT7_CHICK
P12394 CYTOCHROME P450 17
ACAGAAAATCTTACGATGAGAATAATTTACGTGACATAACCGATGCATTAATAAAAGTGTCTTTAGATTC
AGAGATGGGTGAAGAATTAACTGAAAAGATTACTGATGATAATATTGAGTTTCTTTTAAACGATTTTATG
ATTGCTGGATCCGAAACTTCATCAAGTACTATTCTTTGGTTTATTGTTTACATGTTACATTGGCCAGAAT
ACCAAAATAAACTTTATGATGAAATTACTAAAGTAGCATCAGATAACCGTTATGTATCTTTAAAGGATCG
ACCTATGCTTCATTTAATGCAAGCTACAATTCATGAAACACTTAGACTGTCATCGGTGGTACCTCTTGGT
TTGGTTCATAAAGCAATGGAGAACAGTAGCATTTGTGGCAAGTTTGTTCCTAAGGGAGCTCTTATTTTAA
CAAATTTATGGAGTATGCATCACGATGAAAGCTATTGGAAAAATGCAATGAGTTTTTACTCGGAACGTTG
GCTGGAAAAATCTGGCGAGTTCCATTATAAATTGGGGTACGCATAATTACCGTTTTCTATAGGG
35%
to CYP17A zebrafish aa 266-449
3
RKSYDENNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLNDFMIAGSETSSSTILWF
IVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQATIHETLRLSSVVPLGLVHK
AMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYSERWLEKSGEFHYKLGYA*LP
FSIG
554
>CN769290
opposite end = CN769570 taf31c10.y1
taf31c10.x1
Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to
SW:CPT7_RANDY
O57525 CYTOCHROME P450 17
TTATCTTGCTTTTCACGTTTCCTTGGTTATTCCATATTTTTGAAATTTTTAAATATTATATAAAGCAAAA
ATACAGAAAAGTGAAGCAAAAATAGTTAATTTCTTGGAATTATCACGACTTCAAAGTCATTAGGAGGGGA
GGTGATTCCAGAACGACCATCTAAACAAGGTAACTCTTTTCCAGTTGGCATTTCAAATCGGTAATCTTTA
AGTAATCGTGTAATAAACACAAACAACTCTGTTTTTGCCAATGTTTCTCCTAAACAACTACGAGGTCCAT
TAGAAAACGGTAAATATGCGTACCCCAATTTATAATTGAACTCGCCAGATTTTTCCAGCCAACGTTCCGG
GTAAAAACTCATTGCATTTTTCCAATAGCTTTCATCGTGATGCATACTCCATAAATTTGTTAAAATAAGA
GCTCCCTTAGGAACAAACTTGCCACAAATGCTACTGTTCTCCATTGCTTTATGAACCAAACCAAGAGGTA
CCACCGATGACAGTCTAAGTGTTTCATGAATTGTAGCTTGCATTAAATGAAGCATAGGTCGATCCTTTAA
AGATACATAACGGTTATCTGATGCTACTTTAGTAATTTCATCATAAAGTTTATTTTGGTATTCTGGCCAA
TGTAACATGTAAACAATAAACCAAAGAATAGTACTTGATGAAGTTTCGGATCCAGCAATCATAAAATCGT
TTACAAGAAACTCAATATTATCCTCAGT
42%
to CYP17A aa 299-503
728
TEDNIEFLVNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDR
PMLHLMQATIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWK
NAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEM
PTGKELPCLDGRSGITSPPNDFEVVIIPRN*
>Combined
seq from CN769290 and CN769570 39% to CYP17A [gene 7]
RKSYDENNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLNDFMIAGSETSSSTILWF
IVYMLHWPEYQNKLYDEITKVASDNRYVSLKDR
PMLHLMQATIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWK
NAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEM
PTGKELPCLDGRSGITSPPNDFEVVIIPRN*
>CN774619
tae83e09.x1
Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to
SW:CPT7_CHICK
P12394 CYTOCHROME P450 17
TTTAAACTACATAAGATTGTTTACTCTTGGAATCAAGTATGCGCAAAATTGTTTTGGAGCATGAGTAATA
TTGCATTTTCCAATTAAACTTGGAGGTGGACAACCAGGTGTGCACTCAAACTTAAAATCTCGAATTAATC
TAGAAAAGAAGAAAAATAGTTCATTTTCAGCAACTGTTTTCCCTAAACAAACTCTTGTACCGGCTGAAAA
AGGTAAGAAACTTGTTGCTTTACTTGGATCAAATTTCTTATCTTTGCCAATCCGCCGATAAGGAATTAAA
TTCATTAGGATTTTCCCAGGAGTTAGTAACGTTATGAATCTGCCAATGATTAATCATGACAGTAGCAATT
TTCAGGAATGCTATGTGACATGAGAACTCCCCCCCAGAGGT
38%
to CYP17A2 Fugu aa 383-485
391
TSGGEFSCHIAFLKIATVMINHWQIHNVTNSWENPNEFN 275
273
PYRRIGKDKKFDPSKATSFLPFSAGTRVCLGKTVAENELFFFFSRLIRDFKFECTPGCPP 94
93 PSLIGKCNITHAPKQFCAYLIPRVNNLM* 7
>CN570733
same as CN570522 BP505786
tag42d11.y1
Hydra EST -Kiel 2 Hydra magnipapillata cDNA 5' similar to SW:CPT7_ORYLA
P70085
CYTOCHROME P450 17
AGCTGGTTTCTTCAAGACTTCTGTGAGGTAAACCTAATGGAATGACAGACGACAAACGCAGAGTTTCTTT
CATAGCACTTTCAAATAAATGAAGCTTTGGACGATCTGAAAGACTAGGATACCTATCATTACCGACTATT
TTAATAGTTTCATCATAGATATCATCTTGATACTTTGGCCAGTTAACTAAATAAAC
VYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETS
>CN570522
same as CN570733
tag42d11.x1
Hydra EST -Kiel 2 Hydra magnipapillata cDNA 3' similar to SW:CPT7_ORYLA
P70085
CYTOCHROME P450 17
GGTGTTTATTTAGTTAACTGGCCAAAGTATCAAGATGATATCTATGATGAAACTATTAAAATAGTCGGTA
ATGATAGGTATCCTAGTCTTTCAGATCGTCCAAAGCTTCATTTATTTGAAAGTGCTATGAAAGAAACTCT
GCGTTTGTCGTCTGTCATTCCATTAGGTTTACCTCACAGAAGTCTTGAAGAAACCAGC
43%
to CYP17 aa 326-389 same as BP505786
GVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFESAMKETLRLSSVIPLGLPHRSLEETS
>CN566859
opposite end = CN566581 taf98h10.x1
taf98h10.y1
Hydra EST -Kiel 1 Hydra magnipapillata cDNA 5' similar to SW:CPT7_SQUAC
Q92113
CYTOCHROME P450 17
TAATCATAAATGTGTATTTAAAATAAAAATTAAAACTCTTTTTGATAGACAATAGTTAGCTAATAGAAAT
TAGAAAAATTTTTAATTAACACATTTTGCCTAAAATTTAAAGGTTAAGCAAACATTTACTAATCGTTTCA
TTTTCAAATTATTTACTATTTATCTAGTATCTAATTTTTCAATCAATATAGTTTATATAAATTTAAAAAA
AAAAACATTAGCAAAAT
TAAAG CAATAAATTTTTACT CCTTGGAACTATTTCAACTTTAAAGTCATAAGG
AGTACAAGTGATGCCAAAACTACCTTCTAAGCGCGGTAATTCTTCTAAAATTGGTTTCACAAATCGGAAA
TCATTTATCAATCGTGATATAAAAATAAACAACTCAGTTTTTGCCAATGTTTCTCCAATACAGCTACGAG
GTCCACTAGAGAATGGTAAATAAGCGTTTCCTAGTTTAGAATTAAATTCGCCAGAATTTTCTAACCAACG
TTCAGGGTAAAAACTCATTGCATTTTTCCAGTAACTTTCGTCATGGTGTATACTCCATAGGTTGGTTATT
ATAAGAGCCCCTTT
GGGGAATAGGTTTTCCACAAATGCTACCTTTCTCC
Combined
seq
Opposite
end
MFLEVIGAVFIPPLIWTIWVYIKHLIDCLHYPRGPIPLPFIG
NGYLIRKAEPYKELVNLGKIYGDVFSFSVGSVRYVIVNSLEGIQEVLVKKGWQFAGRPKG
PSWDRSIHGLIQRDPSKKFKILRKLATSSLKIFADGLAGMESKAI
44%
to 17A1 fugu aa 378-485 [gene 2]
607
RKVAFVENLFPKGALIITNLWSIHHDESYWKNAMSFYPERWLENSGEFNSKLGNAYLPFSSGPRSCIGETL 395
394
AKTELFIFISRLINDFRFVKPILEELPRLEGSFGITCTPYDFKVEIVPRSKNLLV* 227
CYP46
like (only one seq)
>CN775805
tae77f11.x1
Hydra EST Darmstadt I Hydra magnipapillata cDNA 3' similar to TR:Q42700
Q42700
CYTOCHROME P450
TCGGCATAGCAGTATTAATTTTTTTGTGTTTTTCACTGTTTTTTGCTAATATTTTAAAACGTTTTTATCA
TCCGCTTCGTAAGTTGCCATCACCTAAAGAAAATTTCTTTACTGCTCATTATGGCTACTTTAATGGCTAT
GATCAAATAAATGCTGTAATAAATTTTGGAAAACAGTTTAAAGAGCGTGGCTTGTATACATTAGATACAT
TAAATGGATTTAGATTTGTTAATCTTTTAATGCCAGAATTTATTAAAACAGTGTTTTCTGATGGAAACTC
ATTCCAAAGATCGACCGCTACAAAAGTTATATTTCCTCTAGTTGGAAATGGTATTTTTGTGTCAAATTAT
GAAGATCATCATTGGCAAAGAAAAGTGTTAAATGAAGCTTTTACTTTACAACAGCTAAAAAATTATTTTC
CAGCTTTTACAGTGCACATTGATTTGCTAATGAAACTTTGGTCATATTCATGTGACAAGGATAATGGTAC
TAACATAATTGTTTTGGATGACTTATCTAATTTATCATTTGATATAATTGGGGATGTTGGTTTTGGCTAT
CA
N-term
26% to CYP46a zebrafish aa 10-203
3 GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINF
167
168
GKQFKERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNG 332
333
IFVSNYEDHHWQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDKDNGTNII 500
501
VLDDLSNLSFDIIGDVGFGY 560
>gi|47138506|gb|CN627429.1|CN627429
tae92b11.y1 Hydra EST Darmstadt I Hydra
magnipapillata
cDNA 5'
similar to SW:CP51_CANGA P50859 CYTOCHROME P450 51 ;.
Length = 540
Score = 58.5 bits (140), Expect = 5e-10
Identities = 27/27 (100%), Positives =
27/27 (100%)
Frame = +1
Query:
160 DNGTNIIVLDDLSNLSFDIIGDVGFGY 186
DNGTNIIVLDDLSNLSFDIIGDVGFGY
Sbjct:
1
DNGTNIIVLDDLSNLSFDIIGDVGFGY 81
DNGTNIIVLDDLSNLSFDIIGDVGFGYQFNTITSHSGNEFTKALQSYCQLRFQLNAVHKA
LLAFFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLKDQQ
KEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK
Combined
CN775805 and CN627429 [gene 8]
3
GIAVLIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINF 167
168
GKQFKERGLYTLDTLNGFRFVNLLMPEFIKTVFSDGNSFQRSTATKVIFPLVGNG 332
333
IFVSNYEDHHWQRKVLNEAFTLQQLKNYFPAFTVHIDLLMKLWSYSCDKDNGTNII 500
501
VLDDLSNLSFDIIGDVGFGY 560
QFNTITSHSGNEFTKALQSYCQLRFQLNAVHKA
LLAFFPFLMHLSFMYGKRKRAEQVICNTLNMLINKRKKEIDHRIAADQKDFLTVVLKDQQ
KEGNKMTNDLIKNNLMTPLIAGHKTTSTDMPWCFNVLAPNPSATKHMQRNRKKEYISTTK
BP514308
BP514308 Hydra magnipapillata cDNA library Hydra magnipapillata
cDNA
clone hydmg002bw_87.
Length = 586
Score = 66.6 bits (161), Expect = 1e-12
Identities = 28/56 (50%), Positives =
37/56 (66%)
Frame = +2
Query:
5
LIFLCFSLFFANILKRFYHPLRKLPSPKENFFTAHYGYFNGYDQINAVINFGKQFK 60
+I
+ F A KRFYH R LPSPKE+ T HY
YF+ +D +N ++NFGK+FK
Sbjct:
53
IIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFK 220
>BP514308
N-term 25% to 46a [gene 9]
1096761991009
1096082187152 1096123591182 1095899045709 1097383004013
MYSIYIAIIIVPLVFFVAVFFKRFYHQFRLLPSPKESLITCHYSYFDVHDHVNTLLNFGKEFKDYGLYTINTLI
(1)
GPRQVHLLLPHFIKTVIADGKFFQRSPVFKAVFPLVGNSMIVSNYEDHHWQRKLF
NQAFTSQQLKRYFLAFTLHTDLLMK
(0)
LWSCTCDKENGTNLNVWSDLSNLSFDIIGDVGFGYQFNTITSHSGNAFTKALRSY
INLRFNSSVVHNVLIAYFPFLMRFLSKFGNLNKAEQVIYNTLNM
(0)
ATGTATTCGATATACATAGCGATTATAATAGTTC
CTTTAGTTTTTTTCGTTGCTGTTTTTTTTAAACGTTTTTATCATCAATTTCGTTTGTTGC
CATCACCCAAAGAGAGTTTAATTACATGTCACTATAGTTATTTTGATGTTCATGACCATG
TTAACACTCTGTTAAACTTTGGTAAAGAGTTTAAAGATTACGGATTATATACAATTAATA
CTTTAATTGGT
1096041094868
1096013042315 1097383004013 1097675345181 1096602222013
AGCTCTGGTCATGCACATGTGATAAAGAAAATGGCACTAACTTAAACGTTTGGAGTGACTTGTCTAATCTTTC
ATTTGATATAATTGGTGACGTTGGTTTTGGCTATCAATTCAACACTATTACATCTCATTC
TGGAAATGCGTTTACAAAAGCACTTCGAAGTTATATTAACTTACGATTTAATTCTAGCGT
AGTGCACAATGTTCTAATAGCTTATTTTCCATTCTTAATGCGTTTTTTATCAAAGTTTGG
AAATCTTAATAAAGCTGAGCAAGTTATTTACAATACCCTGAACATGGT
>BP508840
BP508840 Hydra magnipapillata cDNA library Hydra magnipapillata
cDNA
clone hmp_03437.
Length = 452
Blast
with CYP20 Fugu C-term
Query:
88
VDQHLIPKESLVIYALGVILQDSDTWNAPYRFDPDRFEEESVKK----SFHLLGFSGSQT 143
VD
HLIPK++ +I ALG + QD + P RFDPDRF ++ +++ +F GF+G +
Sbjct:
18
VDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGFAGKRK 197
Query:
144 CPELRFAYTVATVLLSVLVRQLKLHRLKDTLMEVRSELVSTPRDETWI 191
CP R AY
+++ + +++ V+ P +E WI
Sbjct:
198 CPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWI 341
Note:
this seqs. Best match in Fugu human and Ciona is CYP20
DYDIIVDGHLIPKKTPIILALGTVFQDETIFPEPDRFDPDRFSDKQIEERSALAFQPFGF
AGKRKCPGYRLAYAETLTYTFYIIKNFHISLFDKQSVKMHYGFVTKPSEEIWIKVLRRKNI*
>gnl|ti|647066038
1095898227332
Length =
1123
Score = 59.7 bits (143), Expect = 5e-08
Identities = 66/274 (24%), Positives =
119/274 (43%), Gaps = 21/274 (7%)
Frame = -3
Query: 226
PEAGSKRETEFLKHRRVLEDIIRRIIQERKEGEDLQELPFI-DSMLQ-NYDSE------D 277
P A
S+ E ++ R + I++R +QE ++ D L I D++++
+ DSE +
Sbjct: 866
PTATSRNIFEIIRLR---DPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTE 696
Query: 278
KIIADAISF-----MVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXK 332
KI D I F M+ G TS W + Y+ PE Q+
K
Sbjct: 695
KITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLK 516
Query: 333
EYSLRADTFLRQVQDETIRLSTLAPWA-ARYSDKKVTVCGYTIPAKTPMIHALGVGLKNK 391
+ + ++
ET+RLS++ P
+ + ++CG +P ++
L ++
Sbjct: 515
DRPML--HLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDE 342
Query: 392
TVWENTDSWDPDRFSP-----NGRRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLS- 445
+
W+N S+ P+R+ N + G + PF + R C G + E+ VF + LL
Sbjct: 341
SYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFS-NGPRSCLGETLAKTELFVFITRLLKD 165
Query: 446
-RFEIVPVEGQTVIQVHGLVTEPKDDIKIYIRSR 478
RFE+ + + +T P +D ++ I R
Sbjct: 164
YRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPR 63
37% to 2U1
fugu
866
PTATSRNIFEIIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTE 696
695
KITDDNIEFLLNDFMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLK 516
515
DRPMLHLMQAAIHETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDE 342
341
SYWKNAMSFYPERWLEKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKD 165
164
YRFEMPTGKELPCLDGRSGITSPPNDFEVVIIPR 63
>gnl|ti|648017453
1095896110991
Length =
1042
Score = 52.0 bits (123), Expect = 1e-05
Identities = 58/226 (25%), Positives =
95/226 (42%), Gaps = 19/226 (8%)
Frame = -1
Query: 241
RVLEDIIRRIIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISFMVG--- 289
R+ +
I++R +QE ++ D L I L DS K+ D I F++
Sbjct: 697
RLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILNDLI 518
Query: 290
--GFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQD 347
G TS TW + Y+ +PE QD
+ L L+
Sbjct: 517
LAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLL--HLLQATIH 344
Query: 348
ETIRLSTLAPWAARY-SDKKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWDPDRF- 405
ET+RLS++AP R+ +
+ T+C + T
+I L ++
W+N S+ P+R+
Sbjct: 343
ETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWL 164
Query: 406
SPNG----RRGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRF 447
+ G + GN + PF
R C G + E+ V S L++ F
Sbjct: 163
NETGEFDYKLGNAYIPFS-GGPRACLGETLAKTELFVIISRLVTDF 29
38% to 17A1
fugu 37% to 2U1 fugu
697
RLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILNDLI 518
517
LAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATIH 344
343
ETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWL 164
163
NETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29
>gnl|ti|655005893
1095958068757
Length = 952
Score = 44.3 bits (103), Expect = 0.002
Identities = 33/145 (22%), Positives =
59/145 (40%), Gaps = 5/145 (3%)
Frame = -2
Query: 265
FIDSMLQNYDSEDKIIADAI-----SFMVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXX 319
F+D
+L Y + KI + I +FM G
T+ W LW L +P+ Q
Sbjct: 438
FLDLLLDIY-RKGKIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEI 262
Query: 320
XXXXXXXXXXXXKEYSLRADTFLRQVQDETIRLSTLAPWAARYSDKKVTVCGYTIPAKTP 379
K +R +L + E++R+ P
R ++ +T+ G +P
Sbjct: 261
DEIELNGGSLYDK---VRQSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQ 91
Query: 380
MIHALGVGLKNKTVWENTDSWDPDR 404
++ + + N WEN + + P+R
Sbjct:
90 IVLLVLILHSNPDYWENPNDFIPER 16
44% to 4V5
fugu 42% to 4T5
438
FLDLLLDIYRKGKIDTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEI 262
261
DEIELNGGSLYDKVRQSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQ 91
90 IVLLVLILHSNPDYWENPNDFIPER 16
>gnl|ti|655009968
1095963046224
Length =
1057
Score = 42.0 bits (97), Expect = 0.010
Identities = 21/47 (44%), Positives =
26/47 (55%)
Frame = +2
46% to
CYP20 35% to 27B1
Query:
65
GSLHQFLLHLHDNGKTPVTSFWWGKTHVVSFCSPQAFKESAVFVNRP 111
G
+H+FL+ H P+ SF+WGK VS
P FKE A NRP
Sbjct: 422
GGIHKFLVENHKR-LGPMFSFYWGKELAVSLACPILFKEVATLFNRP 559
>gnl|ti|646862798
1095898098005
Length = 963
Score = 41.2 bits (95), Expect = 0.018
Identities = 56/246 (22%), Positives =
99/246 (40%), Gaps = 18/246 (7%)
Frame = -3
Query: 235
EFLKHRRVLEDIIRRIIQE--RKEGEDLQELPFIDSMLQN----YDSEDKIIADA----- 283
E + R VL + + +++
RK E+ + + DSM+Q+ YD
D A
Sbjct: 793
EIKQARLVLNPWVEKKVEDHWRKYNEN-EIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617
Query: 284
-ISFMVGGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFL 342
I +V G T+ WM+ YL +PE Q+
++ L
Sbjct: 616
LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEK---NLFPLL 446
Query: 343
RQVQDETIRLSTLAPW-AARYSDKKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWD 401
+ ET+R++++ P A + K ++CG IP +I
L + ++N + +D
Sbjct: 445
QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266
Query: 402
PDRF-SPNGR----RGNDFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEIVPVEGQT 456
P R+ +
NG F PF R C G + ++ + S L+ F G+
Sbjct: 265
PKRWINENGLFDSISQKYFKPFSA-GARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89
Query: 457
VIQVHG 462
+ + G
Sbjct:
88 LPSLEG 71
35% to
17A1 34% to 2P4
793
EIKQARLVLNPWVEKKVEDHWRKYNEN-EIINVTDSMIQHFLTKYDGLDTDFAKKYITLL 617
616
LIELLVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLL 446
445
QAFIQETLRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFD 266
265
PKRWINENGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD 89
88 LPSLEG 71
>gnl|ti|651477674
1095901303788
Length = 819
Score = 38.5 bits (88), Expect = 0.11
Identities = 40/175 (22%), Positives =
69/175 (39%), Gaps = 7/175 (4%)
Frame = +2
Query: 289
GGFHTSGYMFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQDE 348
G F
T+ TW+++YL P Q
+++ ++ E
Sbjct:
29
GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFN---DIKSMPIMQATILE 199
Query: 349
TIRLSTLAPWAARYSD-KKVTVCGYTIPAKTPMIHALGVGLKNKTVWENTDSWDPDRF-S 406
T+RLS++ P + + + +TIP T +I L N+ WE ++P R+
Sbjct: 200
TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD 379
Query: 407
PNGRRGN----DFCPFGVHSRRKCPGYLFSYFEVGVFASILLSRFEI-VPVEGQT 456
NG
+ PF R C
G F+ ++ + S L+ F +P G+T
Sbjct: 380
KNGELSTAKRLGYFPFSA-GPRGCIGESFARMQMFIICSRLIKDFSFELPQSGET 541
39% to
CYP21 39% to 2R1 40% to 2P4
29
GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILE 199
200
TLRLSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLD 379
380
KNGELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGET 541
>gnl|ti|654998190
1095901734433
Length =
1030
Score = 38.1 bits (87), Expect = 0.15
Identities = 48/200 (24%), Positives =
74/200 (37%), Gaps = 21/200 (10%)
Frame = -2
Query: 197
FGDIFKDENELSKMAESYHVCWRTMEEGVPEAGSKRETEFLKHRRVLEDIIR-------R 249
F DI
K NE S ++ + W P A
S+
K++ + +IIR R
Sbjct: 870
FQDIIKTHNETSYISS---IPWLRY---FPTATSRNMIILNKNKYNIFEIIRLRDPILIR 709
Query: 250
IIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISF-----MVGGFHTSGY 296
+QE K D L + L DS
+KI D F M+ G
TS
Sbjct: 708
KLQEHKRTYD*SNLGDVTDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 529
Query: 297
MFTWMLWYLSSHPESQDXXXXXXXXXXXXXXXXXXKEYSLRADTFLRQVQDETIRLSTLA 356
M W + Y+ PE QD
+ ++ + ET+RL ++A
Sbjct: 528
MILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRP--RFHLIQAITHETLRLLSVA 355
Query: 357
PWA-ARYSDKKVTVCGYTIP 375
P + + ++CG +P
Sbjct: 354
PLGLCHKAMENGSICGKFVP 295
30% to
CYP21 25% to 1C2
870
FQDIIKTHNETSYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIR 709
708
KLQEHKRTYD*SNLGDVTDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 529
528
MILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVA 355
354
PLGLCHKAMENGSICGKFVP 295
>gnl|ti|651148169
1095901003210
Length =
1130
Score = 37.4 bits (85), Expect = 0.25
Identities = 39/137 (28%), Positives =
54/137 (39%), Gaps = 20/137 (14%)
Frame = -2
Query: 197
FGDIFKDENELSKMAESYHVCWRTMEEGVPEAGSKRETEFLKHRRVLEDIIR-------R 249
F DI
K NE S ++ + W P A
S+
K++ + +IIR R
Sbjct: 544
FQDIIKTHNETSYISS---IPWLRY---FPTATSQNMIILNKNKYNIFEIIRLRDPILIR 383
Query: 250
IIQERKEGEDLQELPFIDSML--------QNYDSEDKIIADAISF-----MVGGFHTSGY 296
+QE K D L + +L DS
+KI D F M+ G
TS
Sbjct: 382
KLQEHKRTYD*SNLGDVTDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 203
Query: 297
MFTWMLWYLSSHPESQD 313
M W + Y+ PE QD
Sbjct: 202
MILWFIVYILHRPEYQD 152
Same as
above
544
FQDIIKTHNETSYISSIPWLRYFPTATSQNMIILNKNKYNIFEIIRLRDPILIR 383
382
KLQEHKRTYD*SNLGDVTDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSN 203
202
MILWFIVYILHRPEYQD 152
Database:
fasta.hydra_magnipapillata.001
Posted date: May 16, 2005 8:55 AM
Number of letters in database:
513,442,738
Number of sequences in database: 500,000
gnl|ti|647066038
1095898227332
60 5e-08
gnl|ti|648017453
1095896110991
52 1e-05
gnl|ti|655005893
1095958068757
44 0.002
gnl|ti|655009968
1095963046224
42 0.010
gnl|ti|646862798
1095898098005
41 0.018
gnl|ti|651477674
1095901303788
39 0.11
gnl|ti|654998190
1095901734433
38 0.15
gnl|ti|651148169
1095901003210
37 0.25
CYP21danio
search
gnl|ti|647066038
1095898227332
177 2e-43
gnl|ti|649400787
1095898835518
153 3e-36
gnl|ti|648017453
1095896110991
150 2e-35
gnl|ti|647182814
1095899213949
142 8e-33
gnl|ti|646862798
1095898098005 141 1e-32
gnl|ti|651477674
1095901303788
141 1e-32
gnl|ti|647193621
1095899233960
133 3e-30
gnl|ti|647987527
1095895119635
97 4e-19
gnl|ti|651162328
1095901096755
96 9e-19
gnl|ti|654998190
1095901734433
91 2e-17
gnl|ti|655006784
1095958075467
81 2e-14
gnl|ti|651148169
1095901003210
74 3e-12
gnl|ti|648033522
1095897342515
72 8e-12
gnl|ti|647134594
1095899118747 72 8e-12
gnl|ti|648485307
1095899272864
71 2e-11
gnl|ti|648026854
1095896933215
70 4e-11
gnl|ti|651118815
1095900033599
70 4e-11
gnl|ti|655005893
1095958068757
60 5e-08
gnl|ti|647175227
1095898288652
45 5e-07
gnl|ti|648589386
1095733042694
56 8e-07
gnl|ti|649393684
1095898809307
54 2e-06
gnl|ti|646849327
1095897329284
51 2e-05
gnl|ti|646968536
1095898162561
49 9e-05
gnl|ti|647168675
1095899196297
48 2e-04
gnl|ti|649448444
1095899351259
33 0.079
gnl|ti|648014530
1095896049543
35 1.9
gnl|ti|655009845
1095963045220
34 3.2
gnl|ti|648592188
1095595897239
34 3.2
gnl|ti|653058100
1095949490108
34 3.2
>gnl|ti|647066038
1095898227332
Length =
1123
Score = 177 bits (449), Expect = 2e-43
Identities = 109/320 (34%), Positives = 168/320
(52%), Gaps = 8/320 (2%)
Frame = -3
Query:
215
NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274
N+I T+ F+ Y+ + E Q + + + IV + S + S PLLR FP + +
Sbjct:
1010 NIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNET--SYVSSIPLLRYFPTATSRNIFE 837
Query:
275
EVARRDELIGKHIEEFKKSEHKEG-GTLTSSLLKC-LEPQQGAANHXXXXXXXXXXXXXX 332
+ RD ++ + ++E +KS K +T +L+K L+ + G
Sbjct:
836
IIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLND 657
Query: 333
XLIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLCALIS 391
+I G+ET ++ + W + ++LH PE Q+K+Y+E+
V D RY DR L
+ A I
Sbjct:
656
FMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQAAIH 477
Query:
392 EMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFL
451
E LRL V PL + H+A+ NSSI G
F+PK +I+ NL+ HHD W + SF
PER+L
Sbjct:
476
ETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYPERWL 297
Query:
452
EGGGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPEL 506
E G L +PF G R CLGE +AK E+F+F LL++++F +P KE LP L
Sbjct:
296
EKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEMPTGKE--LPCL 123
Query:
507 RGVASVVLKVKPYTVIAHPR 526
G + +
+ V+ PR
Sbjct:
122 DGRSGITSPPNDFEVVIIPR 63
34% to
17A1 35% to 2U1 fugu 33% to 2U1 human
1010
NIICTILFNHRYEDDNQEFQNIIKYSSLIVQTFNETSYVSSIPLLRYFPTATSRNIFE 837
836
IIRLRDPILKRKLQEHRKSYDKNNLRDITDALIKVSLDSEMGEELTEKITDDNIEFLLND 657
656
FMIAGSETSSSTILWFIVYMLHWPEYQNKLYDEITKVASDNRYVSLKDRPMLHLMQAAIH 477
476
ETLRLSSVVPLGLVHKAMENSSICGKFVPKGALILTNLWSMHHDESYWKNAMSFYPERWL 297
296
EKSGEFNYKLGYAYLPFSNGPRSCLGETLAKTELFVFITRLLKDYRFEMPTGKELPCL 123
122 DGRSGITSPPNDFEVVIIPR 63
>gnl|ti|649400787
1095898835518
Length =
1120
Score = 153 bits (387), Expect = 3e-36
Identities = 97/309 (31%), Positives =
161/309 (52%), Gaps = 11/309 (3%)
Frame = +2
Query: 211
VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270
VA NVI ++ F K Y+ + E +++ +N + +
G +A+ P LR P
Sbjct:
44
VAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFT--GVAGTNAISFIPWLRFLPLDGLR 217
Query: 271
RLMKEVARRDELIGK----HIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXX 326
+L K
++ RD ++ K H E +
+S ++ T
+++ +
Sbjct: 218
KLKKGLSIRDPVLRKQLLYHRETYNESNLRD---YTDYVIQFSRDEAILKKFGEQLTDDY 388
Query: 327
XXXXXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDV-RYPQYSDRHKLP 384
+ I GTET L W++
+L+H P+ QDK+Y E+ + RYP DR+ LP
Sbjct: 389
LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP 568
Query: 385
YLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHF-IPKNTIIIPNLYGAHHDPEVWDDPY 443
+ A +SE LRL V PL VPH+A+
++++ IPK T I+ NL+ HH+ W++P+
Sbjct: 569
LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH 748
Query: 444
SFKPERFLEGGG--GSLRSL--IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASK 499
F P R+ S++S+ +PF G R+CLG+ +A++E+FLF + L+R+FKF
Sbjct: 749
EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKF-EVKP 925
Query: 500
EEPLPELRG 508
+ LP L G
Sbjct: 926
GDSLPSLYG 952
>gnl|ti|649400787
1095898835518 44% to 1095898227332, 39% to 17A1 fugu
35% to 2U1
44
VAVLNVICSIVFGKRYEYENCEFKEILTYMNYVFTGVAGTNAISFIPWLRFLPLDGLR 217
218
KLKKGLSIRDPVLRKQLLYHRETYNESNLRDYTDYVIQFSRDEAILKKFGEQLTDDY 388
389
LELLLNDIFIAGTETALTTLLWSIIYLIHWPKFQDKIYNEIVSAIGKNRYPSMKDRNMLP 568
569
LVNAALSETLRLSSVTPLGVPHKAMEDTTLLNDLKIPKGTTILTNLWQLHHNKNCWENPH 748
749
EFNPYRWFTNDQTLDSIKSMNFLPFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKP 925
926
GDSLPSLYG 952
>gnl|ti|648017453
1095896110991
Length =
1042
Score = 150 bits (380), Expect = 2e-35
Identities = 98/286 (34%), Positives =
150/286 (52%), Gaps = 8/286 (2%)
Frame = -1
Query: 215
NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274
N+I T+
F++ Y++ E Q + + N + + + L S P LR FP S+ ++
Sbjct: 877
NIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSAS--NLLSSIPWLRYFPTTA-SKYIQ 707
Query: 275
EVAR-RDELIGKHIEEFKKSEHKEG-GTLTSSLLKCLEPQQGAANHXXXXXXXXXXXXXX 332
E+ R
RD ++ + ++E +KS + +T +L+K
+
Sbjct: 706
EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527
Query: 333
XLI-GGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDV-RYPQYSDRHKLPYLCALI 390
LI G+ET ++ + W + ++LH PE
QDK++ E+ V RYP +DR L L A I
Sbjct: 526
DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347
Query: 391
SEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERF 450
E LRL VAPL + H+A+ NS+I + K T+II NL+ HHD W +P SF PER+
Sbjct: 346
HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167
Query: 451
LEGGGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREF 492
L G L IPF GG R CLGE +AK E+F+ + L+ +F
Sbjct: 166
LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29
877
NIICTILFNQRYEQDDDEFQNIIKYSNLSFKAFSASNLLSSIPWLRYFPTTASKYIQ 707
706
EIERLRDPILKRKLQEHRKSYDENNLRDITDALIKASIHLNAEKDSLIKVTDDNIQFILN 527
526
DLILAGSETSSSTITWFIVYMLHYPEYQDKIFNEVIKVTSGNRYPCLNDRPLLHLLQATI 347
346
HETLRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERW 167
166
LNETGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDF 29
>gnl|ti|647182814
1095899213949
Length =
1074
Score = 142 bits (357), Expect = 8e-33
Identities = 89/291 (30%), Positives =
145/291 (49%), Gaps = 7/291 (2%)
Frame = +2
Query: 211
VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270
VA NVI + F + Y S
++ +N IVS G +A+D
P LR
Sbjct:
83
VAVLNVICFIVFGERYQYSDPAFIEILTTINNIVS--GLSNTTAVDFLPGLRYLQFSEIK 256
Query: 271
RLMKEVARRDELIGKHIEEFKKS-EHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXXXXX 329
+L + L+ +++ KK+ + T
S++K + +
Sbjct: 257
KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436
Query: 330
XXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLC 387
+ I G+ET L W +
+++H P+ Q++++EE+ V+ + RYPQ
SDR L +
Sbjct: 437
VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616
Query: 388
ALISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKP 447
A I E
LRL + PL VPH+ + ++++ G+ IPKNT +I N
+ H+D W +P F P
Sbjct: 617
ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796
Query: 448
ERFLEG----GGGSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF 494
R+++ S
+PF G R+CLG+ VA+ E+F F L+R+FKF
Sbjct: 797
HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKF 949
>gnl|ti|647182814
1095899213949
54% to 1095898835518, 36% to 17A1 36% to 2U1
83
VAVLNVICFIVFGERYQYSDPAFIEILTTINNIVSGLSNTTAVDFLPGLRYLQFSEIK 256
257
KLKSSLVIYFRLLNDQLKKHKKTFDENNIRDFTDSIIKFSKDETMENKFEEELTDEHLEH 436
437
VIGDMFIAGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPQLSDRDSLHLVK 616
617
ASIKECLRLSSIIPLGVPHKTMSDTTLIGYNIPKNTTVIINHWQIHNDTNHWKNPNEFNP 796
797
HRWIDDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKF 949
>gnl|ti|646862798
1095898098005
Length = 963
Score = 141 bits (356), Expect = 1e-32
Identities = 71/193 (36%), Positives =
113/193 (58%), Gaps = 4/193 (2%)
Frame = -3
Query: 334
LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRHKLPYLCALISEM 393
L+
GTET A + W V +L+H PE Q+++Y+E+ + RYP ++++ P L A I E
Sbjct: 604
LVAGTETTAITICWMVLYLIHNPEYQEEIYKEITLNIGCRYPTLAEKNLFPLLQAFIQET 425
Query: 394
LRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEG 453
LR+ V PL + H+A++++SI G IPK+ I+I NL+ HHD +
+P F P+R++
Sbjct: 424
LRITSVVPLNLAHKALKDTSICGKIIPKDAIVITNLWNLHHDNRYFKNPNEFDPKRWINE 245
Query: 454
GG----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGV 509
G S
+ PF GAR+CLGE +AK ++FL + L+ F F A ++ LP L G
Sbjct: 244
NGLFDSISQKYFKPFSAGARVCLGETLAKNQLFLIISGLIMNFIFTSAPGKD-LPSLEGQ 68
Query: 510
ASVVLKVKPYTVI 522
+ + + V+
Sbjct:
67 FGITFRPNSFKVL 29
>gnl|ti|651477674
1095901303788
Length = 819
Score = 141 bits (355), Expect = 1e-32
Identities = 75/196 (38%), Positives =
111/196 (56%), Gaps = 5/196 (2%)
Frame = +2
Query: 336
GGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRHKLPYLCALISEMLR 395
G T AA + W + +LLH P Q
+Y+E+ V +YP ++D +P + A I E LR
Sbjct:
29 GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILETLR
208
Query: 396
LRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEGGG 455
L V PL++ H+A+ N+ I IPK+TIII NL+G HH+ + W+ P+ F
P R+L+ G
Sbjct: 209
LSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLDKNG 388
Query: 456
----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPELRGVA 510
PF G R C+GE+ A+M+MF+ + L+++F F LP S E P+L G
Sbjct: 389
ELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGE--TPKLDGDI 562
Query: 511
SVVLKVKPYTVIAHPR 526
+ L PY +A R
Sbjct: 563
GITLTPLPYNAVAKQR 610
29
GWFRTTAATITWLIFYLLHWPHYQSILYKEIKNVCGDQYPTFNDIKSMPIMQATILETLR 208
209
LSSVVPLSLSHKAVNNAKINKFTIPKDTIIITNLWGVHHNEKYWEKPFEFNPMRWLDKNG 388
389
ELSTAKRLGYFPFSAGPRGCIGESFARMQMFIICSRLIKDFSFELPQSGETPKLDGDI 562
563
GITLTPLPYNAVAKQR 610
>gnl|ti|647193621
1095899233960
Length =
1050
Score = 133 bits (335), Expect = 3e-30
Identities = 96/310 (30%), Positives =
149/310 (48%), Gaps = 10/310 (3%)
Frame = +2
Query: 215
NVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFSRLMK 274
NV+ + F Y+++ EL+K+ I+ G A+ P LR FP+ ++ K
Sbjct: 110
NVLCGIVFGTQYEENDKELEKVISFKQLILD--GVADTFAISFLPWLRFFPSNGLKKVRK 283
Query: 275
EVARRDELIG----KHIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXXXXXX 330
V RD+L+ KH E + + ++ T
+LK + + + N
Sbjct: 284
GVLIRDKLLRFQLKKHRETYNPVQIRD---YTDYVLKYSKEFETSRNIDEQLSEDNMEMM 454
Query: 331
XXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLPYLCA 388
+ I G+ET + L W +L++ P+ QD +Y+E ++ + RYP SDR KL +
Sbjct: 455
LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES 634
Query: 389
LISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPE 448
+ E LRL V PL +PHR++ +SI IPKNT ++ NL+ HHD + W DP++F P
Sbjct: 635
AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY 814
Query: 449
RFLEGGG----GSLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLP 504
R+L
+ +PF G R CLG + +FLF L+R+F L P
Sbjct: 815
RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFN-LCLKPGASTP 991
Query: 505
ELRGVASVVL 514
L GV V L
Sbjct: 992
SLNGVLRVTL 1021
>gnl|ti|647193621
1095899233960
50% to 1095898835518 37% to 17A1
110
NVLCGIVFGTQYEENDKELEKVISFKQLILDGVADTFAISFLPWLRFFPSNGLKKVRK 283
284
GVLIRDKLLRFQLKKHRETYNPVQIRDYTDYVLKYSKEFETSRNIDEQLSEDNMEMM 454
455
LQDIFISGSETTISTLLWFAVYLVNWPKYQDDIYDETIKIVGNDRYPSLSDRPKLHLFES 634
635
AMKETLRLSSVIPLGLPHRSLEETSIKKFKIPKNTNVMINLWQLHHDSKSWSDPHTFNPY 814
815 RWLNDKNIFDKSKNPNYLPFSTGLRACLGYHTTESIIFLFFTRLIRDFN-LCLKPGASTP
991
992
SLNGVLRVTL 1021
>gnl|ti|647987527
1095895119635
Length =
1003
Score = 96.7 bits (239), Expect = 4e-19
Identities = 57/137 (41%), Positives =
74/137 (54%), Gaps = 4/137 (2%)
Frame = +3
Query: 394
LRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEG 453
LRL VAPL + H+A+ NS+I + K T+II NL+ HHD W +P SF PER+L
Sbjct:
18
LRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWLNE 197
Query: 454
GGGSLRSL----IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGV 509
G L IPF GG R CLGE +AK E+F+ + L+ +F F S EE LP L
Sbjct: 198
TGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYF-EKSVEEDLPRLDSF 374
Query: 510
ASVVLKVKPYTVIAHPR 526
V + V+ R
Sbjct: 375
PGVTRSPYDFKVVVVSR 425
>gnl|ti|647987527
1095895119635
Same as 1095896110991
18
LRLSSVAPLGLRHKAMENSTICDKPVLKGTLIITNLWSIHHDERYWKNPMSFYPERWLNE 197
198
TGEFDYKLGNAYIPFSGGPRACLGETLAKTELFVIISRLVTDFYFEKSVEEDLPRLDSF 374
375 PGVTRSPYDFKVVVVSR
425
>gnl|ti|651162328
1095901096755
Length = 986
Score = 95.5 bits (236), Expect = 9e-19
Identities = 59/181 (32%), Positives =
94/181 (51%), Gaps = 5/181 (2%)
Frame = -1
Query: 334
LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLD-VRYPQYSDRHKLPYLCALISE 392
+I
G+ET + ++ W + ++LHRPE QDK+Y+E+
V + YP +DR + + A+I E
Sbjct: 752
MIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHE 573
Query: 393
MLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLE 452
LRL VAPL + H+A+ N SI G F+PK L +
+ F
Sbjct: 572
TLRLLSVAPLGLCHKALENGSICGKFVPKGASYSD*LMEYTS**ALLEKCNKFFS*ALAR 393
Query: 453
GGGG----SLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRG 508
G + F GG R CLGE +AK E+ +F + L+++++F + ++ L G
Sbjct: 392
*FGQF*L*FRLCIFVFSGGPRSCLGETLAKTELLVFISRLVKDYRFEKPNGKDSLDGRSG 213
Query: 509
V 509
V
Sbjct: 212
V 210
>gnl|ti|651162328
1095901096755
2 aa diffs to 1095901734433
752
MIAGSETSSNMILWFIVYILHRPEYQDKLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHE 573
572
TLRLLSVAPLGLCHKALENGSICGKFVPKGASYSD*LMEYTS**ALLEKCNKFFS*ALAR 393
392
*FGQF*L*FRLCIFVFSGGPRSCLGETLAKTELLVFISRLVKDYRFEKPNGKDSLDGRSG 213
212 V 210
>gnl|ti|654998190
1095901734433
Length =
1030
Score = 90.9 bits (224), Expect = 2e-17
Identities = 57/182 (31%), Positives =
93/182 (51%), Gaps = 13/182 (7%)
Frame = -2
Query: 253
SALDSFPLLRKFP----------NPPFSRLMKEVARRDELIGKHIEEFKKS-EHKEGGTL 301
S + S
P LR FP N + + + RD ++ + ++E K++ + G +
Sbjct: 837
SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658
Query: 302
TSSLLKC-LEPQQGAANHXXXXXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQD 360
T
+L+K LE +H
+I G+ET + ++ W + ++LHRPE QD
Sbjct: 657
TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 478
Query: 361
KVYEELCCVLD-VRYPQYSDRHKLPYLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHFI 419
K+Y+E+ V + YP +DR + +
A+ E LRL VAPL + H+A+ N SI G F+
Sbjct: 477
KLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVAPLGLCHKAMENGSICGKFV 298
Query: 420
PK 421
PK
Sbjct: 297
PK 292
Score = 73.6 bits (179), Expect = 4e-12
Identities = 41/107 (38%), Positives =
61/107 (57%), Gaps = 5/107 (4%)
Frame = -3
Query: 410
RNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFLEGGGGSLRSL----IPFG 465
R G+ P+ +I+ NL
HHD W + +F PER+L+ G
+L +PF
Sbjct: 326
RTVVFVGNLFPRELLILTNL*SIHHDECYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS 147
Query: 466
GGARLCLGEAVAKMEMFLFTAYLLREFKF-LPASKEEPLPELRGVAS 511
GG R
CLGE +AK E+F+F + L+++++F P KE LP L G +S
Sbjct: 146
GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKE--LPSLDGRSS 12
837
SYISSIPWLRYFPTATSRNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 658
657
TDALIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 478
477
KLYDEIFKVFSGIGYPSLNDRPRFHLIQAITHETLRLLSVAPLGLCHKAMENGSICGKFV 298
297
PKELLILTNL*SIHHDECYWKNAINFFPERWLDNSGNFNYNLGYAYLPFS 147
146
GGPRSCLGETLAKTELFVFISRLVKDYRFEKPNGKELPSLDGRSS 12
>gnl|ti|655006784
1095958075467
Length = 931
Score = 80.9 bits (198), Expect = 2e-14
Identities = 46/141 (32%), Positives =
73/141 (51%), Gaps = 6/141 (4%)
Frame = -2
Query: 392
EMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSFKPERFL 451
E
LRL + PL VPH+ + ++++ + + +I N + H+D W
+P P R++
Sbjct: 744
ECLRLSSIIPLGVPHKTMSDTTLIAYTLNIYGTVIINHWQIHNDTNHWKNPNESTPHRWI 565
Query: 452
EGGGG----SLRSLIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKF--LPASKEEPLPE 505
+ S
+PF G R+CLG+ VA+ E+F F L+R+FKF +P
PLP
Sbjct: 564
DDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFEGVPGC---PLPS 394
Query: 506
LRGVASVVLKVKPYTVIAHPR 526
L
G S+ L + + V PR
Sbjct: 393
LIGKCSITLAPEEFNVHVTPR 331
>gnl|ti|655006784
1095958075467
Same as 1095899213949
744
ECLRLSSIIPLGVPHKTMSDTTLIAYTLNIYGTVIINHWQIHNDTNHWKNPNESTPHRWI 565
564
DDDSKFDATRATSYLPFSAGTRVCLGKTVAETELFFFFTRLIRDFKFEGVPGCPLPS 394
393
LIGKCSITLAPEEFNVHVTPR 331
>gnl|ti|651148169
1095901003210
Length =
1130
Score = 73.9 bits (180), Expect = 3e-12
Identities = 50/170 (29%), Positives =
83/170 (48%), Gaps = 13/170 (7%)
Frame = -2
Query: 253
SALDSFPLLRKFP----------NPPFSRLMKEVARRDELIGKHIEEFKKS-EHKEGGTL 301
S + S
P LR FP N + + + RD ++ + ++E K++ + G +
Sbjct: 511
SYISSIPWLRYFPTATSQNMIILNKNKYNIFEIIRLRDPILIRKLQEHKRTYD*SNLGDV 332
Query: 302
TSSLLKC-LEPQQGAANHXXXXXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQD 360
T L+K LE
+H
+I G+ET + ++ W + ++LHRPE QD
Sbjct: 331
TDVLIKISLESVTETDSHEKITDDNTEFLLNDFMIAGSETSSNMILWFIVYILHRPEYQD 152
Query: 361
KVYEELCCVLD-VRYPQYSDRHKLPYLCALISEMLRLRPVAPLAVPHRAI 409
K+Y+E+ V + YP +DR + +
A+I E LRL VAPL H+ +
Sbjct: 151
KLYDEIFKVFSGIGYPSLNDRPRFHLIQAIIHETLRLLSVAPLG*SHKPV 2
>gnl|ti|648033522
1095897342515
Length =
1108
Score = 72.4 bits (176), Expect = 8e-12
Identities = 46/143 (32%), Positives =
80/143 (55%), Gaps = 2/143 (1%)
Frame = +3
Query:
69
GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128
GP LP++GN L L
+K YG ++ L+ G
+V+++ + IRE LV+K
Sbjct: 153
GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 326
Query: 129
WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186
+ FAGRP +Y +IVS G +
I GD +WK R++
HS+L+ +T L +++ K
Sbjct: 327
SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 503
Query: 187
QAQHLCQVLRDYSGKAVDLSEDF 209
+++ L
+ L ++ +L ++F
Sbjct: 504
ESEELHKRLFKNCNRSTELEDEF 572
>gnl|ti|648033522
1095897342515 39% to 17A1 N-term
153
GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 326
327
SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 503
504
ESEELHKRLFKNCNRSTELEDEF 572
>gnl|ti|647134594
1095899118747
Length =
1050
Score = 72.4 bits (176), Expect = 8e-12
Identities = 46/143 (32%), Positives =
80/143 (55%), Gaps = 2/143 (1%)
Frame = -1
Query:
69
GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128
GP LP++GN L L
+K YG ++ L+ G
+V+++ + IRE LV+K
Sbjct: 612
GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 439
Query: 129
WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186
+ FAGRP +Y +IVS G +
I GD +WK R++
HS+L+ +T L +++ K
Sbjct: 438
SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 262
Query: 187
QAQHLCQVLRDYSGKAVDLSEDF 209
+++ L
+ L ++ +L ++F
Sbjct: 261
ESEELHKRLFKNCNRSTELEDEF 193
612
GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 439
438
SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTKHLETLVVK 262
261
ESEELHKRLFKNCNRSTELEDEF 193
>gnl|ti|648485307
1095899272864
Length = 944
Score = 70.9 bits (172), Expect = 2e-11
Identities = 46/143 (32%), Positives =
80/143 (55%), Gaps = 2/143 (1%)
Frame = -2
Query:
69 GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK
128
GP +P+ GN+ L
+ I L A +K YG ++ ++
G +V++++ REALV+K
Sbjct: 799
GPFPIPIFGNLHLLGTEPHKI-LAAYSKKYGAVFSISLG-LQRIVIISDITTTREALVQK 626
Query: 129
WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRCTT--DSLHSVIEK 186
S FAGRP SY ++S
G + I+ D+ WK R+V+HS+L+ + ++ K
Sbjct: 625
ASIFAGRPKSYL-IQLISSGYKGIAFMDYGSFWKVLRKVSHSSLKIYGEGHERFEKILTK 449
Query: 187
QAQHLCQVLRDYSGKAVDLSEDF 209
+++ L + L S +V+L +F
Sbjct: 448
ESEELHKRLLKKSNNSVELKSEF 380
>gnl|ti|648485307
1095899272864 57% to 1095897342515
799
GPFPIPIFGNLHLLGTEPHKILAAYSKKYGAVFSISLGLQRIVIISDITTTREALVQK 626
625
ASIFAGRPKSYLIQLISSGYKGIAFMDYGSFWKVLRKVSHSSLKIYGEGHERFEKILTK 449
448
ESEELHKRLLKKSNNSVELKSEF 380
>gnl|ti|648026854
1095896933215
Length =
1081
Score = 70.1 bits (170), Expect = 4e-11
Identities = 46/143 (32%), Positives =
78/143 (54%), Gaps = 2/143 (1%)
Frame = +1
Query:
69 GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK
128
GP LP++GN L L
+K YG ++ L+ G
+V+++ + IRE LV+K
Sbjct: 175
GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 348
Query: 129
WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186
+ FAGRP +Y +IVS G +
I GD +WK R++
HS+L+ +T L +++ +
Sbjct: 349
SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 525
Query: 187
QAQHLCQVLRDYSGKAVDLSEDF 209
+++ L
+ L S ++ L
F
Sbjct: 526
ESEELHKNLYKKSNRSTKLEHKF 594
175
GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 348
349
SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 525
526
ESEELHKNLYKKSNRSTKLEHKF 594
>gnl|ti|651118815
1095900033599
Length =
1071
Score = 70.1 bits (170), Expect = 4e-11
Identities = 46/143 (32%), Positives =
78/143 (54%), Gaps = 2/143 (1%)
Frame = -1
Query:
69
GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128
GP LP++GN L L
+K YG ++ L+ G
+V+++ + IRE LV+K
Sbjct: 516
GPFPLPIIGN-LHLIGKKPHEKFVEYSKKYGEVFSLSFGM-HRVVIVSGKDSIREVLVQK 343
Query: 129
WSDFAGRPYSYTGXDIVSGGGRTISLGDFSEEWKAHRRVTHSALQRC--TTDSLHSVIEK 186
+ FAGRP +Y +IVS G +
I GD +WK R++
HS+L+ +T L +++ +
Sbjct: 342
SNIFAGRPKNYIA-NIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR 166
Query: 187
QAQHLCQVLRDYSGKAVDLSEDF 209
+++ L
+ L S ++ L
F
Sbjct: 165
ESEELHKNLYKKSNRSTKLEHKF 97
516
GPFPLPIIGNLHLIGKKPHEKFVEYSKKYGEVFSLSFGMHRVVIVSGKDSIREVLVQK 343
342 SNIFAGRPKNYIANIVSRGYKNIGYGDIGPKWKILRKIAHSSLKNYGESTAHLETLVVR
166
165
ESEELHKNLYKKSNRSTKLEHKF 97
>gnl|ti|655005893
1095958068757
Length = 952
Score = 59.7 bits (143), Expect = 5e-08
Identities = 70/308 (22%), Positives =
122/308 (39%), Gaps = 8/308 (2%)
Frame = -2
Query: 150
RTISLGDFSEEWKAHRRVTHSALQRCTTDSLHSVIEKQAQHLCQVLRDYSG--KAVDLSE 207
+T L +WK RR+ + ++ + E+QA L L + +
VD+
Sbjct: 921
KTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQV 742
Query: 208
DFTVASSNVITTLTFS---KAYDKSSAELQKLQECLNEIVSLWGS-PWISALDSFPLLRK 263
+A+ ++I + A +E K LNE + + PW+ + LL
Sbjct: 741
PIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL-- 568
Query: 264
FPNPPFSRLMKEVARRDELIGKHIEEFKKSEHKEGGTLTSSLLK--CLEPQQGAANHXXX 321
P R K + +L I E
+ + +E T+S K L+
Sbjct: 567
---PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI 397
Query: 322
XXXXXXXXXXXXLIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRH 381
+ G +T +A L WT+ L P+VQ K+++E+
+
Y
Sbjct: 396
DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217
Query: 382
KLPYLCALISEMLRLRPVAPLAVPHRAIRNSSIAGHFIPKNTIIIPNLYGAHHDPEVWDD 441
+ YL ++ E LR+ P
P+ + +I G F+PK I+ + H +P+
W++
Sbjct: 216
QSKYLEIILKESLRMHPPVPM-YGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40
Query: 442
PYSFKPER 449
P F PER
Sbjct:
39 PNDFIPER 16
921 KTGLLTSTGSKWKTRRRLLTPSFHFSILNNFIKIFEEQASILVDKLAVAADNKEVVDVQV
742
741
PIGLATLDIICETSMGVKVNAQSHPDSEYVKAITVLNEEIQMRQKFPWLWFDAIYKLL 568
567
PCGKRFYKALDVAHKLSFDVINERMQMKIQESYCETASDEKKFFLDLLLDIYRKGKI 397
396
DTEGIQEEVDTFMFEGHDTTSAALGWTLWLLGKNPDVQKKLHKEIDEIELNGGSLYDKVR 217
216
QSKYLEIILKESLRMHPPVPMYGRTVEEDMTIDGQFVPKGAQIVLLVLILHSNPDYWEN 40
39 PNDFIPER 16
>gnl|ti|647175227
1095898288652
Length =
1081
Score = 45.1 bits (105), Expect(2) =
5e-07
Identities = 21/54 (38%), Positives =
33/54 (61%)
Frame = -1
Query: 461
LIPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGVASVVL 514
+ F
G R+CLG+ +A++E+FLF + L+R+FKF + LP L G + L
Sbjct: 961
IFTFSAGTRVCLGKGIAEVELFLFYSRLVRDFKF-EVKPGDSLPSLYGNCGITL 803
Score = 31.2 bits (69), Expect(2) =
5e-07
Identities = 10/26 (38%), Positives =
17/26 (65%)
Frame = -3
Query:
425 IIPNLYGAHHDPEVWDDPYSFKPERF 450
I+ NL+ HH+ W++P+ F P R+
Sbjct:
1079 ILTNLWQLHHNKNCWENPHEFNPYRW 1002
ILTNLWQLHHNKNCWENPHEFNPYRWXXXXXXXXXX
IFTFSAGTRVCLGKGIAEVELFLFYSRLVRDFKFEVKPGDSLPSLYGNCGITL
>gnl|ti|648589386
1095733042694
Length =
1032
Score = 55.8 bits (133), Expect = 8e-07
Identities = 48/189 (25%), Positives =
84/189 (44%), Gaps = 6/189 (3%)
Frame = +1
Query: 211
VASSNVITTLTFSKAYDKSSAELQKLQECLNEIVSLWGSPWISALDSFPLLRKFPNPPFS 270
VA NVI + F + Y S
++ +N IV+ G +A+D
P LR
Sbjct: 454
VAILNVICFIVFGERYQYSDPAFIEILTTINNIVA--GLSNTTAVDFLPGLRYLQFSEIK 627
Query: 271
RLMKEVARRDELIG----KHIEEFKKSEHKEGGTLTSSLLKCLEPQQGAANHXXXXXXXX 326
+L + L+ KH E F ++ ++ T S++K
+ +
Sbjct: 628
KLKSSLVIYFRLLNDQLKKHKETFDENNIRD---FTDSIIKFSKDETMENKFEEELTDEH 798
Query: 327
XXXXXXXL-IGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVL-DVRYPQYSDRHKLP 384
+
IGG+ET L W + +++H P+
Q++++EE+ V+ + RYP+ SDR L
Sbjct: 799
LEHVIGDMFIGGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPKLSDRDSLH 978
Query: 385
YLCALISEM 393
+ A I +
Sbjct: 979
LVKASIKRV 1005
454 VAILNVICFIVFGERYQYSDPAFIEILTTINNIVAGLSNTTAVDFLPGLRYLQFSEIK
627
628
KLKSSLVIYFRLLNDQLKKHKETFDENNIRDFTDSIIKFSKDETMENKFEEELTDEH 798
799
LEHVIGDMFIGGSETTLTSLLWLIIYMIHYPKYQEEIFEEITRVIGENRYPKLSDRDSLH 978
979
LVKASIKRV 1005
>gnl|ti|649393684
1095898809307
Length = 1093
Score = 54.3 bits (129), Expect = 2e-06
Identities = 25/65 (38%), Positives =
43/65 (66%)
Frame = +2
Query: 462
IPFGGGARLCLGEAVAKMEMFLFTAYLLREFKFLPASKEEPLPELRGVASVVLKVKPYTV 521
+PF G R CLGEA+AK+E+F+F +
L+++++F ++EE LP L+G + + + V
Sbjct:
50
LPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEE-LPNLKGESGITRIPSEFKV 226
Query: 522
IAHPR 526
+ PR
Sbjct: 227
MTIPR 241
>gnl|ti|649393684
1095898809307 45% to 17A1 C-term
LPFSSGPRSCLGEALAKIELFIFISRLVKDYRFEKPTEEELPNLKGESGITRIPSEFKVMTIPR
>gnl|ti|646849327
1095897329284
Length = 980
Score = 51.2 bits (121), Expect = 2e-05
Identities = 30/68 (44%), Positives =
39/68 (57%)
Frame = +2
Query:
69
GPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLNCGSTSAMVVLNNSEIIREALVKK 128
GP LP +GN L + L L K YG+++ +
GS VV+NN E I+E L+KK
Sbjct: 302
GPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIR-YVVVNNLEGIKEVLIKK 478
Query: 129
WSDFAGRP 136
S FAGRP
Sbjct: 479
GSQFAGRP 502
>gnl|ti|646849327
1095897329284 40% to 2X2 C-term
GPIPLPFIGNAHLLRKGEPYKELVNLGKIYGDVFGFSIGSIRYVVVNNLEGIKEVLIKKGSQFAGRP
>gnl|ti|646968536
1095898162561
Length =
1074
Score = 48.9 bits (115), Expect = 9e-05
Identities = 32/91 (35%), Positives =
44/91 (48%), Gaps = 2/91 (2%)
Frame = +1
Query:
48
FPKLLHSLYKLFFSTVSPTI--SGPRSLPLLGNMLDLAQDHLPIHLTALAKCYGNIYRLN 105
FP
L+ +Y + GP LP +GN L
+
L K YG+I+ +
Sbjct: 598
FPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFS 777
Query: 106
CGSTSAMVVLNNSEIIREALVKKWSDFAGRP 136
GS V++NN E I E
L+KK S F+GRP
Sbjct: 778
IGSIR-YVIVNNLEGIHEVLIKKGSQFSGRP 867
>gnl|ti|646968536
1095898162561 83% to 1095897329284
FPPLIWFVYSYIKHLIECLYYPKGPVPLPFIGNTNLLRKKETCKEFVNLGKIYGDIFGFSIGSIRYVIVNNLEGIHEVLIKKGSQFSGRP
>gnl|ti|647168675
1095899196297
Length = 998
Score = 48.1 bits (113), Expect = 2e-04
Identities = 18/48 (37%), Positives =
32/48 (66%)
Frame = -2
Same as 1095898098005
Query: 334
LIGGTETIAALLNWTVAFLLHRPEVQDKVYEELCCVLDVRYPQYSDRH 381
L+
GTET A + W V +L+H PE Q+++Y+E+ + RYP ++++
Sbjct: 175
LVAGTETTAITICWMVLYLIHNPEYQEEIYKEITSNIGCRYPTLAEKN 32
>gnl|ti|649448444
1095899351259
Length =
1086
Score = 32.7 bits (73), Expect(2) =
0.079
Identities = 15/32 (46%), Positives =
19/32 (59%)
Frame = -3
Query:
414
IAGHFIPKNTIIIPNLYGAHHDPEVWDDPYSF 445
I G FIPK + I + H +PE W DP+SF
Sbjct:
1039 IDGQFIPKKSEIAILVMMIHLNPEYWKDPHSF 944
Score = 25.4 bits (54), Expect(2) =
0.079
Identities = 10/31 (32%), Positives = 17/31
(54%)
Frame = -2
Query: 462
IPFGGGARLCLGEAVAKMEMFLFTAYLLREF 492