87 contigs assembled from Ciona intestinalis genomic DNA sequence. 24 additional Ciona savignyi sequences are included, since these are used against Ciona intestinalis sequences to help assemble the genes. 793 accession numbers sorted by PROTEIN sequence from the JGI Ciona blast server. 79 P450s are assembled from the I-helix to the end of the gene. One seq (seq 91) is a pseudogene that breaks in the middle of an exon, has an extra aa in the heme signature and no upstream P450 sequence. 66 completed genes are 53 intestinalis P450s seq 2, 4, 7, 14, 15, 17, 18, 21, 23, 25, 26, 27, 34, 36, 41, 42, 46, 49, 51, 53, 57, 64, 65, 66, 96, 103, 109, 110, 115, 118, 119, 129, 125, 134, 136, 147, 148, 155, 158, 162, 164, 166, 167, 169, 180, 184, 186, 192, 198, 204, 208, 209, 228 and 13 savignyi orthologs to seq 2 (2 genes), 26, 41 (2 genes) and 110 (2 genes), 115, 134, 166, 167, 192, 204 there are gene clusters Nov. 6, 2002 D. Nelson The email for the Ciona EST blast server is chikako@develop.zool.kyoto-u.ac.jp >sequence 1, 50 The N-term is represented twice and is different from seq 2 or 4 or 27 ILTLICLLLFYTWYRRPSRFPPGPRGFPIVGVLPFLEKYSERTMHKWSKKYGPVMSVRMGNEDWVVMGNYE GCiWno533_e09.b1 - DEV31420.x1 MIELVLPDLSIAPTILTLICLLLFYTWYRRPSRFPPGPRGFPIVGVL PFLEKYSERTMHKWSKKYGPVMSVRMGNEDWVVMGNYEA >GCiWno533_e09.b1 TTCCTGTTGGGTTAAAATCTCTCATAACAAGTCCTTTGCACTCAGTTATTTCATTCGCCAATTTCATGTACGGACGGCCC GAGAATGCACTGCCTGCCTTCACAAAGCTCTAAAAGTGAGATTCAATCATTACAAGACTATATGATGTAAGACTGTAGTG TTTTATAATACTTCTTTATTCTTTAGGTATGCTTGGAGTTTATGAAATTCTTATTGTTGTTGCGTGTTATAAAACGATTG CAGTTTACACATCTAGTGTGGTGGGAGACCCTCTTAGCACATAATATCCAAATATCTTAATTGTGTTTTAAACAATTAGA GTCTTCAGAATACGGTTTTATAATTCTTTGAAATCATTTGTTTACCACCAAGTGCGTCAAGAAAAAAAATAAAACGGTGT CCCATCTCCCCCACCCTACTACATTACCTGATAAACAGCTTCGTAGTTTCCCATGACAACCCAGTCCTCATTTCCCATTC TCACCGACATCACCGGACCATATTTCTTGCTCCATTTGTGCATCGTTCGCTCTGAGTATTTTTCAAGGAACGGCAAAACT CCGACGATTGGGAAACCGCGAGGTCCAGGTGGAAATCGAGAGGGTCTTCGATACCAAGTATAAAACAATAATAAGCAAAT GAGCGTTAGTATTGTCGGNGCAATACTCAAATCGGGTAACACGAGTTCTATCATTCTGACACGTTGAGCACAGCACTAAA ACTGCTGCTTTGTATAGCGAACCTGGAACGTTATGCCCGCTTTAGTTTTTGCTGCGCGAAAACGCTTTCTCTTGCGAAGG AAATTGCGCTGTGCAACCCTACGAGCTATAAATCATGTATGGTCTTAAAACGTTGTGAAGACAATCTG >GCiWno533_e09.b1_4 this = N-term of seq 1 RLSSQRFKTIHDL*LVGLHSAISFARESVFAQQKLKRA*RSRFAIQSSSFSAVLNVSE** NSCYPI*VLXRQY*RSFAYYCFILGIEDPLDFHLDLAVSQSSEFCRSLKNTQSERCTNGA RNMVR*CR*EWEMRTGLSWETTKLFIR*CSRVGEMGHRFIFFLDALGGKQMISKNYKTVF *RL*LFKTQLRYLDIMC*EGLPPH*MCKLQSFYNTQQQ*EFHKLQAYLKNKEVL*NTTVL HHIVL**LNLTFRAL*RQAVHSRAVRT*NWRMK*LSAKDLL*EILTQQE >GCiWno533_e09.b1_5 QIVFTTF*DHT*FIARRVAQRNFLRKRKRFRAAKTKAGITFQVRYTKQQF*CCAQRVR MI ELVLPDLSIAPTILTLICLLLFYTWYRRPSRFPPGPRGFPIVGVLPFLEKYSERTMHKWS KKYGPVMSVRMGNEDWVVMGNYEA VYQVM**GGGDGTPFYFFS*RTWW*TNDFKEL*NRI LKTLIV*NTIKIFGYYVLRGSPTTLDV*TAIVL*HATTIRIS*TPSIPKE*RSIIKHYSL TSYSLVMIESHF*SFVKAGSAFSGRPYMKLANEITECKGLVMRDFNPTGX >GCiWno533_e09.b1_6 DCLHNVLRPYMIYSS*GCTAQFPSQEKAFSRSKN*SGHNVPGSLYKAAVLVLCSTCQNDR TRVTRFEYCXDNTNAHLLIIVLYLVSKTLSISTWTSRFPNRRSFAVP*KILRANDAQMEQ EIWSGDVGENGK*GLGCHGKLRSCLSGNVVGWGRWDTVLFFFLTHLVVNK*FQRIIKPYS EDSNCLKHN*DIWILCAKRVSHHTRCVNCNRFITRNNNKNFINSKHT*RIKKYYKTLQSY II*SCND*ISLLELCEGRQCILGPSVHEIGE*NN*VQRTCYERF*PNRX >GCiWno533_e09.g1_1 this seq = seq 27 NEFSSVPKSCVHV*FVP*LSCYLLLTQKLWVQTITHKVTYMVTRKLARI**CDLRHQF*K ENEKKIFFYKAVNFLQYIQLLNYIRDLFDGGTETTVSTSRWAILCMLHYPETQKKLRNEI MTIIGKRLNFYITNTYLIFLLKGPTRQRVCRISQICLTPVLLYKKYSGSGL*PH*VCPIK *MKMQRLTDTQFQKA*R*EISYIHLIVRLK*KP*ADGLIPKKLNDIFSSTL*CVVFHVYF *LGVTKPVGGAQRSRCVGRNQAVKPERHLDDKENLXSRIT*YLSRWVHVIAWENNLLERK SHLPDXMXXKVGVSSGPX >GCiWno533_e09.g1_2 TNSARYPNRVFTYNLCRSSVVTCF*PRSYGYKQLPTKLHTW*LVSWHEFNNVIFAINFKK KMKKKFFFIKL*TFCSTSSC*TTYVICSMAVPRPL*APVVGQYYVCCITRKLRKNYATK* *LLLVSV*ISI*PTHI*YSC*RAQHASEYVA*VRYALHLCFYTRSTQVPDFSPTECAP*S K*RCNG*RIHNSKRRNGKRFHTFI*SSD*NKNPEQMV*FPKN*TTFLAVHCDVSCFMSIF N*VSPNLWAVHNDPDVWDETKQLNLSVTSMTRKTXSVESRDTFLGGSTSLLGRTTCSNGN LIFLIXWXQRLEFLPDP >GCiWno533_e09.g1_3 RIQLGTQIVCSRIICAVAQLLPASNPEVMGTNNYPQSYIHGNS*AGTNLIM*SSPSILKR K*KKNFFL*SCELFAVHPAAKLHT*SVRWRYRDHCKHQSLGNTMYAALPGNSEKTTQRNN DYYW*AFKFLYNQHIFNIPVKGPNTPASMSHKSDMPYTCAFIQEVLRFRTLAPLSVPHKV NEDATVNGYTIPKGVTVRDFIHSFNRPIEIKTLSRWFNSQKIERHF*QYTVMCRVSCLFL IRCHQTCGRCTTIQMCGTKPSS*T*ASPR*QGKLXQSNHVIPFSVGPRHCLGEQLARTEI SSS*XHGXKGWSFFRT >GCiWno7_b18.b1 CHROMAT_FILE: GCiWno7_b18.b1 PHD_FILE: GCiWno7_b18.b1.phd.1 CHEM: term DYE: big TIME: Sat Sep 1 16:00:22 2001 TEMPLATE: GCiWno7_b18 DIRECTION: fwd Length = 905 Score = 132 bits (329), Expect = 1e-30 Identities = 65/86 (75%), Positives = 66/86 (76%) Frame = -3 Query: 1 MIELVLPDLSIAPTILTLICLLLFYTWYXXXXXXXXXXXXXXIVGVLPFLEKYSERTMHK 60 MIELVLPD SIA ILTL CL LFYTWY IVGV PFLEKYSERTMHK Sbjct: 810 MIELVLPDXSIARQILTLXCLYLFYTWYRRPSRFPPGPRGFPIVGVCPFLEKYSERTMHK 631 Query: 61 WSKKYGPVMSVRMGNEDWVVMGNYEA 86 WSKKYGPVMSVRMGN+DWVVMGNYEA Sbjct: 630 WSKKYGPVMSVRMGNDDWVVMGNYEA 553 PSVGVLPFLEKYSERTMHKWSKKYGPVMSVRMGNDDWVVMGNYE 286 not seq 2 or 4 Two diffs with seq above probably same gene QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIG = seq 2 (1) QDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP = seq 4 NLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVVAFSVGPRHCLGEQLARMEYFIYLVSMVQKFEFF = seq 4 Gap KQIARVV = seq 4 DEV13827.y1 DEV13827.y1 DEV17948.x1 I-HELIX DEV19056.y1 DEV25829.x1 I-HELIX OPPOSITE END OF SEQ 2 DEV26097.x1 I-HELIX DEV31420.x1 N-TERM DEV36161.x1 DEV39712.x1 DEV46240.x1 DEV5183.x01 DEV9515.x1 LGK545.x1 FIRST AND LAST PART = SEQ 1 BUT DIFFERS IN MIDDLE LQW146181.x1 I-HELIX OPPOSITE END OF SEQ 1 LQW146181.y1 LQW183999.x1 I-HELIX OPPOSITE END OF SEQ 1 LQW183999.y1 LQW188756.y1 LQW198468.y1 LQW227602.y1 LQW258179.y1 LQW259311.x1 LQW259311.x2 LQW259311.y1 I-HELIX OPPOSITE END OF SEQ 1 LQW80689.x1 LQW80689.x1 LQW95486.y1 LQW198468.x1 use to compare joint LQW198468.x1.phd.1 LQW198468.y1 13:47:14 2001 TEMPLATE: LQW198468 DIRECTION: fwd Length = 1003 Score = 196 bits (493), Expect = 6e-50 Identities = 98/121 (80%), Positives = 102/121 (83%), Gaps = 14/121 (11%) Frame = +3 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIG------------- 47 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIG Sbjct: 186 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGENL*TIIDVM*LK 365 Query: 48 -KKIQARHRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTT 106 KK + RVPAMNDKAQMPYTCAFMQEVFRYRTLVPLS++HMTN+DVVLNGY IPKGTT Sbjct: 366 LKKNSGQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTT 545 Query: 107 V 107 V Sbjct: 546 V 548 LQW80689.x1 CHEM: term DYE: ET TIME: Fri May 4 11:53:29 2001 TEMPLATE: LQW80689 DIRECTION: fwd Length = 488 Score = 113 bits (281), Expect(2) = 4e-49 Identities = 52/56 (92%), Positives = 55/56 (97%) Frame = +2 Query: 48 KKIQARHRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPK 103 KKIQARHRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLS++HMTN+DVVLNGY IPK Sbjct: 320 KKIQARHRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPK 487 Score = 101 bits (249), Expect(2) = 4e-49 Identities = 47/50 (94%), Positives = 49/50 (98%) Frame = +1 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKI 50 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIG+ + Sbjct: 136 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGENL 285 >LQW80689.x1 >lcl|LQW80689.x1 CHROMAT_FILE: LQW80689.x1 PHD_FILE: LQW80689.x1.phd.1 CHEM: term DYE: ET TIME: Fri May 4 11:53:29 2001 TEMPLATE: LQW80689 DIRECTION: fwd TGGTACTTTAAGAACACATTTAAGCATATATCCATATACATTTTTCATTTTTCGTTGGTATGCTCCTTAGAAAACTTCCG ATACTTTGNCGAACTTCTTTTAANACTTACAATACGTCCTTCTATTTAGGACCTACAACTGTTGCAATATGTTCGGGACT TGTTTGTTGCTGGAACCGAAACGACAACCAGCACACTAAGGTGGTCAATACTTTGCATGATTCATAATCCGGAAAAGCAA GAAAAATTAAGAAAAGAAATATGTGATGTCATTGGTGAGAATTTATAAACAATTATTGATGTTATGTAGTTAAAACTTAA AAAAAATTCAGGCCAGGCATAGGGTTCCAGCGATGAATGACAAAGCTCAGATGCCTTACACTTGCGCGTTCATGCAGGAA GTTTTCAGATACCGGACTCTGGTTCCCTTAAGCGTAGTGCATATGACTAATCAAGACGTTGTACTGAACGGTTATACAAT ACCCAAAG >LQW80689.x1_1 WYFKNTFKHISIYIFHFSLVCSLENFRYFXELLLXLTIRPSI*DLQLLQYVRDLFVAGTE TTTSTLRWSILCMIHNPEKQEKLRKEICDVIGENL*TIIDVM*LKLKKNSGQA*GSSDE* frameshift here QSSDALHLRVHAGSFQIPDSGSLKRSAYD*SRRCTERLYNTQX >LQW80689.x1_2 GTLRTHLSIYPYTFFIFRWYAP*KTSDTLXNFF*XLQYVLLFRTYNCCNMFGTCLLLEPK RQPAH*GGQYFA*FIIRKSKKN*EKKYVMSLVRIYKQLLMLCS*NLKKIQARHRVPAMND KAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKX >LQW80689.x1_3 VL*EHI*AYIHIHFSFFVGMLLRKLPILXRTSFXTYNTSFYLGPTTVAICSGLVCCWNRN DNQHTKVVNTLHDS*SGKARKIKKRNM*CHW*EFINNY*CYVVKT*KKFRPGIGFQR*MT KLRCLTLARSCRKFSDTGLWFP*A*CI*LIKTLY*TVIQYPK >rciad045g08 seq a Length = 682 Score = 201 bits (505), Expect = 4e-51 Identities = 97/109 (88%), Positives = 102/109 (92%) Frame = -3 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKIQARHRVPAMN 60 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIG+ RVPAMN Sbjct: 662 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGQD-----RVPAMN 498 Query: 61 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 109 DKAQMPYTCAFMQEVFRYRTLVPLS++HMTN+DVVLNGY IPKGTT+SP Sbjct: 497 DKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTTISP 351 >rciad045g08 ACCCCCGATGAATATTTTTATGGTTGAACCGCATCGTTAAATTAAACTATCTTTGCAATTTGCTTGAACCGCAAAGGAAC AAATACAACACCACTCGACCCGTCTTCAACATCTGGAAGATCTGGTTCATTCGGATCCGGAAAAAACTCAAACTTCTGAA CCATTGAAACTAAGTAGATGAAATATTCCATTCGAGCAAGTTGTTCTCCCAAGCAATGACGTGGACCAATCGAGAAAGGT ATCACGTGTTTAGACTGAACAAAGTTTCCTTTGTCATCGAGGTGACGCTCAGGTTTAAACTTGCTTGGTTCGTCCCACAC ATCTGGATTGTTGTGCACCGCCCACAGGTTTGGTGATATTGTTGTTCCTTTGGGTATTGTATAACCGTTCAGTACAACGT CTTGATTAGTCATATGCACTACGCTTAAGGGAACCAGAGTCCGGTATCTGAAAACTTCCTGCATGAACGCGCAAGTGTAA GGCATCTGAGCTTTGTCATTCATCGCTGGAACCCTATCCTGGCCAATGACATCACATATTTCTTTTCTTAATTTTTCTTG CTTTTCCGGATTATGAATCATGCAAAGTATTGACCACCTTAGTGTGCTGGTTGTCGTTTCGGTTCCAGCAACAAACAAGT CCCGAACATATTGCAACAGTTGTAGGTCTGTGTATGACGAGT >rciad045g08_6 identical to seq 2 SSYTDLQLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGQDRVPAM NDKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTTISPNLWAVHNNPD VWDEPSKFKPERHLDDKGNFVQSKHVIPFSIGPRHCLGEQLARMEYFIYLVSMVQKFEFF PDPNEPDLPDVEDGSSGVVFVPLRFKQIAKIV* >cign065e15 seq a Length = 579 Score = 201 bits (505), Expect = 4e-51 Identities = 97/109 (88%), Positives = 102/109 (92%) Frame = +3 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKIQARHRVPAMN 60 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIG+ RVPAMN Sbjct: 192 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGQD-----RVPAMN 356 Query: 61 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 109 DKAQMPYTCAFMQEVFRYRTLVPLS++HMTN+DVVLNGY IPKGTT+SP Sbjct: 357 DKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTTISP 503 >cign065e15 TTCNGCACGAGGCACCGTTTAGCCGTATCAACAATCAACTAATGACGGATGTGAGAGTAATTTTGCAAATGTTAAGAGAA ATACTGTCCGAGCACAAGTCGACATTTAACAAAGATGACGTCCGAGATTTTATCGATGCTTTCATCGCTGAGCAAAATTC AGAAAGCAAACACTCGTCATACACAGACCTACAACTGTTGCAATATGTTCGGGACTTGTTTGTTGCTGGAACCGAAACGA CAACCAGCACACTAAGGTGGTCAATACTTTGCATGATTCATAATCCGGAAAAGCAAGAAAAATTAAGAAAAGAAATATGT GATGTCATTGGCCAGGATAGGGTTCCAGCGATGAATGACAAAGCTCAGATGCCTTACACTTGCGCGTTCATGCAGGAAGT TTTCAGATACCGGACTCTGGTTCCCTTAAGCGTAGTGCATATGACTAATCAAGACGTTGTACTGAACGGTTATACAATAC CCAAAGGAACAACAATATCACCAAACCTGTGGGCGGTGCACAACAATCCAGATGTGTGGGACGAACCAAGCAAGTTTAAA CCTGAGCGTCACCTCGATG XHEAPFSRINNQLMTDVRVILQMLREILSEHKSTFNKDDVRDFIDAFIAEQNSESKHSSY TDLQLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGQDRVPAMNDK AQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTTISPNLWAVHNNPDVWD EPSKFKPERHLD Combined = seq 2 HEAPFSRINNQLMTDVRVILQMLREILSEHKSTFNKDDVRDFIDAFIAEQNSESKH SSYTDLQLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGQDRVPAM NDKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTTISPNLWAVHNNPD VWDEPSKFKPERHLDDKGNFVQSKHVIPFSIGPRHCLGEQLARMEYFIYLVSMVQKFEFF PDPNEPDLPDVEDGSSGVVFVPLRFKQIAKIV* >rcieg032b06 seq a Length = 764 Score = 201 bits (505), Expect = 4e-51 Identities = 97/109 (88%), Positives = 102/109 (92%) Frame = -2 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKIQARHRVPAMN 60 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIG+ RVPAMN Sbjct: 706 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGQD-----RVPAMN 542 Query: 61 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 109 DKAQMPYTCAFMQEVFRYRTLVPLS++HMTN+DVVLNGY IPKGTT+SP Sbjct: 541 DKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTTISP 395 >ciht001p11 seq a Length = 629 Score = 196 bits (493), Expect = 9e-50 Identities = 95/109 (87%), Positives = 100/109 (91%) Frame = +1 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKIQARHRVPAMN 60 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIG+ RVPAMN Sbjct: 292 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGQD-----RVPAMN 456 Query: 61 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 109 DKA MPYTCAFMQEVF YRTLVPLS++HMTN+DVVLNGY IPKGTT+SP Sbjct: 457 DKAXMPYTCAFMQEVFXYRTLVPLSVVHMTNQDVVLNGYTIPKGTTISP 603 >cign008n15 seq b Length = 622 Score = 195 bits (490), Expect = 2e-49 Identities = 96/109 (88%), Positives = 98/109 (89%) Frame = +2 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKIQARHRVPAMN 60 QLL YV DLF AGTETTTSTL WSILCMIHNPEKQEKLRKEIC V+G+ RVPAMN Sbjct: 125 QLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQEKLRKEICSVVGQD-----RVPAMN 289 Query: 61 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 109 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP Sbjct: 290 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 436 >cign008n15 TCGGCACGAGGGAATTGTTTCTGAACACAACTCGACATTTAACAAAGATGACGCCCGGGATTTTATCGATGCTTTTATTG CTGAGAAAAACTCTCAAAACAAACACTCGTCATTTACTGATTCACAGCTGCTGCACTATGTGGTTGACTTATTCGAGGCT GGAACCGAAACGACAACCAGCACACTAATGTGGTCAATACTTTGCATGATTCATAATCCGGAAAAGCAAGAAAAACTAAG AAAAGAAATCTGTAGTGTTGTAGGCCAGGATAGGGTTCCAGCGATGAATGACAAAGCTCAGATGCCTTACACTTGCGCGT TCATGCAGGAAGTTTTCAGATACCGGACTCTGGTTCCCTTAAGCTTAATGCATATGACCAATGAAGATGTCGTACTTAAC GGTTACAATATTCCGAAGGGAACAACGGTGTCACCTAATCTGTGGGCGGTGCACAACGATCCAGATGTGTGGGACGAACC AAGCAAGTTTAAACCTGAGCGTCACCTCGATGACAAAGGAAACTTTGTTCAGTCTAAACACGTAGTCGCTTTCTCGGTGG GTCCACGTCATTGCTTGGGAGAACAACTTGCTCGAATGGAATATTTCATCTACTTAGTTTCA >cign008n15_2 identical to seq 4 GIVSEHNSTFNKDDARDFIDAFIAEKNSQNKHSSFTDSQLLHYVVDLFEAGTETTTS TLMWSILCMIHNPEKQEKLRKEICSVVGQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPL SLMHMTNEDVVLNGYNIPKGTTVSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKH VVAFSVGPRHCLGEQLARMEYFIYLVS >cign022n24 seq b Length = 675 Score = 195 bits (490), Expect = 2e-49 Identities = 96/109 (88%), Positives = 98/109 (89%) Frame = +2 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKIQARHRVPAMN 60 QLL YV DLF AGTETTTSTL WSILCMIHNPEKQEKLRKEIC V+G+ RVPAMN Sbjct: 77 QLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQEKLRKEICSVVGQD-----RVPAMN 241 Query: 61 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 109 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP Sbjct: 242 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 388 >cign022n24 TCGGCACGAGGATTTTATCGATGCTTTTATTGCTGAGAAAAACTCTCAAAACAAACACTCGTCATTTACTGATTCACAGC TGCTGCACTATGTGGTTGACTTATTCGAGGCTGGAACCGAAACGACAACCAGCACACTAATGTGGTCAATACTTTGCATG ATTCATAATCCGGAAAAGCAAGAAAAACTAAGAAAAGAAATCTGTAGTGTTGTAGGCCAGGATAGGGTTCCAGCGATGAA TGACAAAGCTCAGATGCCTTACACTTGCGCGTTCATGCAGGAAGTTTTCAGATACCGGACTCTGGTTCCCTTAAGCTTAA TGCATATGACCAATGAAGATGTCGTACTTAACGGTTACAATATTCCGAAGGGAACAACGGTGTCACCTAATCTGTGGGCG GTGCACAACGATCCAGATGTGTGGGACGAACCAAGCAAGTTTAAACCTGAGCGTCACCTCGATGACAAAGGAAACTTTGT TCAGTCTAAACACGTAGTCGCTTTCTCGGTGGGTCCACGTCATTGCTTGGGAGAACAACTTGCTCGAATGGAATATTTCA TCTACTTAGTTTCAATGGTTCAGAAGTTTGAGTTTCTACCGGATCCGAATGAACCGAACCTTCCAGATATTGAAAAAGGG TCGAATGGAGCTGCATTCGTTCCTCTGCCCTTTAA >cign022n24_2 DFIDAFIAEKNSQNKHSSFTDSQLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQ EKLRKEICSVVGQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYN IPKGTTVSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVVAFSVGPRHCLGEQL ARMEYFIYLVSMVQKFEFLPDPNEPNLPDIEKGSNGAAFVPLPF Combined identical to seq 4 GIVSEHNSTFNKDDARDFIDAFIAEKNSQNKHSSFTDSQLLHYVVDLFEAGTETTTS TLMWSILCMIHNPEKQEKLRKEICSVVGQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPL SLMHMTNEDVVLNGYNIPKGTTVSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKH VVAFSVGPRHCLGEQLARMEYFIYLVSMVQKFEFLPDPNEPNLPDIEKGSNGAAFVPLPF >ciad086a13 seq b Length = 549 Score = 195 bits (490), Expect = 2e-49 Identities = 96/109 (88%), Positives = 98/109 (89%) Frame = +2 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKIQARHRVPAMN 60 QLL YV DLF AGTETTTSTL WSILCMIHNPEKQEKLRKEIC V+G+ RVPAMN Sbjct: 11 QLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQEKLRKEICSVVGQD-----RVPAMN 175 Query: 61 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 109 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP Sbjct: 176 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 322 >cinc015m09 seq b Length = 658 Score = 195 bits (490), Expect = 2e-49 Identities = 96/109 (88%), Positives = 98/109 (89%) Frame = +2 Query: 1 QLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVIGKKIQARHRVPAMN 60 QLL YV DLF AGTETTTSTL WSILCMIHNPEKQEKLRKEIC V+G+ RVPAMN Sbjct: 275 QLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQEKLRKEICSVVGQD-----RVPAMN 439 Query: 61 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 109 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP Sbjct: 440 DKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTTVSP 586 >best matches to CYP11A1 >scf/ciona01/G126/seq_dir/hrs/G126P69032R.T0/G126P69032RA7.T0.seq 722 (07) 722 ABI Length = 722 Plus Strand HSPs: Score = 221 (77.8 bits), Expect = 3.3e-17, P = 3.3e-17 Identities = 51/188 (27%), Positives = 103/188 (54%), Frame = +1 Query: 320 VTEMLAGGVDTTSMTLQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATMLQLVPLLKA 379 +T++ G DT + T+ W L +A+N ++Q+ + AE++ +++ M++ +P +A Sbjct: 112 ITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDVINVS-MMESMPYTRA 288 Query: 380 SIKETLRLHPI-SVTLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREPTFFFDPENFDPT 438 + E+ R+ P+ ++L +D+ ++D +IP TLV ++A+ +P ++ +PE+F P Sbjct: 289 VMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFRPE 468 Query: 439 RWLSKDKNITYFRNL-GFGWGVRQCLGRRIAELEMTIFLINMLENFRV---EIQHLSDVG 494 R ++++ T + F G R CLG ++A IFL N++ F+ E Q D+ Sbjct: 469 RHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFKFRFHENQPEPDLT 648 Query: 495 TTFNLILMP 503 +L+P Sbjct: 649 GVMTSVLLP 675 44% to mouse 2j6 112 ITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDVINVSMMESMPYTRA 288 289 VMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFRPE 468 469 RHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFKFRFHENQPEPDLT 648 649 GVMTSVLLP 675 >scf/ciona01/G126/seq_dir/hrs/G126P605845F.T0/G126P605845FG6.T0.seq 701 (27) 0 701 ABI Length = 701 Plus Strand HSPs: Score = 217 (76.4 bits), Expect = 8.9e-17, P = 8.9e-17 Identities = 46/166 (27%), Positives = 95/166 (57%), Frame = +2 Query: 320 VTEMLAGGVDTTSMTLQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATMLQLVPLLKA 379 +T++ G DT + T+ W L +A+N ++Q+ + AE++ +++ M++ +P +A Sbjct: 149 ITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDVINVS-MMESMPYTRA 325 Query: 380 SIKETLRLHPI-SVTLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREPTFFFDPENFDPT 438 + E+ R+ P+ ++L +D+ ++D +IP TLV ++A+ +P ++ +PE+F P Sbjct: 326 VMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFRPE 505 Query: 439 RWLSKDKNITYFRNL-GFGWGVRQCLGRRIAELEMTIFLINMLENFR 484 R ++++ T + F G R CLG ++A IFL N++ F+ Sbjct: 506 RHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFK 646 149 ITDLFMAGTDTMATTVHWSLVFLAQNPEIQNKMAAEIIKVTGNDVINVSMMESMPYTRA 325 326 VMYESTRMRPVFPLSLGHQATSDVTVKDNVIPKGTLVVANLWAIQNDPKWWKNPESFRPE 505 506 RHITENGGFTKNEKIVPFSIGPRFCLGSQMATYHQFIFLANLVRTFK 646 >scf/ciona01/G126/seq_dir/hrs/G126P609638F.T0/G126P609638FH3.T0.seq 726 (23) 0 726 ABI Length = 726 Plus Strand HSPs: Score = 195 (68.6 bits), Expect = 9.0e-18, Sum P(2) = 9.0e-18 Identities = 56/199 (28%), Positives = 98/199 (49%), Frame = +1 Query: 311 MSFEDIKANVTEMLAGGVDTTSMTLQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATM 370 +S DI+ V + G DTT+ + W +Y + R+ +Q + E+ GDM T+ Sbjct: 109 LSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEEL---DDVFGGDMGTI 279 Query: 371 ----LQLVPLLKASIKETLRLHPISVTLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREP 426 LQ + LL+ +IKE+LR+ P + R + + + IPA T V + I +L P Sbjct: 280 TNSHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVAIFIDSLHHNP 459 Query: 427 TFFFDPENFDPTRWLSKDKNITY-FRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRV 485 + + D + FDP R+L ++ + + + F G R C+G++ A +E + L +L + + Sbjct: 460 SVWPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVLLTQILRKYSI 639 Query: 486 EIQ-HLSDVGTTFNLILMPEKP 506 D+ +LIL P Sbjct: 640 HSHDEEEDLRKQADLILRSSTP 705 50% to 4V2 human 52% to 4V5 fugu 75% to seq 115 NFLPPQDTRRMAFLDVLLRAESEDGRS 109 LSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDDVFGGDMGTI 279 280 TNSHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVAIFIDSLHHNP 459 460 SVWPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVLLTQILRKYSI 639 640 HSHDEEEDLRKQADLILRSSTP 705 Score = 54 (19.0 bits), Expect = 9.0e-18, Sum P(2) = 9.0e-18 Identities = 15/43 (34%), Positives = 20/43 (46%), Frame = +1 Query: 166 NFLPLLDAVSRDFVSVLHRRIKKAGSGNYSGDISDDLFRFAFE 208 NFLP D F+ VL R + G DI +++ F FE Sbjct: 28 NFLPPQDTRRMAFLDVLLRAESEDGRSLSLSDIQEEVDTFMFE 156 >best matches to 27A1 >scf/ciona01/G126/seq_dir/hrs/G126P601882R.T0/G126P601882RC8.T0.seq 734 0 734 ABI Length = 734 Minus Strand HSPs: Score = 247 (86.9 bits), Expect = 5.1e-20, P = 5.1e-20 Identities = 57/210 (27%), Positives = 105/210 (50%), Frame = -3 Query: 320 LSPREAMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPA--GQVPQH 377 LS + + + G DTT+ +TW +Y + + P IQ +HEE+ V G + + Sbjct: 696 LSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDDVFGGDMGTIT-N 520 Query: 378 KDFAHMPLLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTA 437 + LL+ +KE+LR++P VP R+ +E V P TQ + +P+ Sbjct: 519 SHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVAIFIDSLHHNPSV 340 Query: 438 FSEPESFQPHRWLRNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKY 497 + + + F P R+L P +HP+ +PF G R C+G++ A +E ++LL ++++KY Sbjct: 339 WPDVDRFDPDRFL----PENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVLLTQILRKY 172 Query: 498 KVVLAPETGELKSVARIVLVPNKKVGLQFLQR 529 + E +L+ A ++L + + + R Sbjct: 171 SIHSHDEEEDLRKQADLILRSSTPLNISLTPR 76 75% to seq 115 49% to 4V2 human NFLPPQDTRRMAFLDVLLRAESEDGRS 696 LSLSDIQEEVDTFMFEGHDTTAAAMTWTIYLIGRHPAIQARIHEELDDVFGGDMGTITN 520 519 SHLQKLSLLERTIKESLRMFPSVPFIGRVTTEECSVGSHSIPAGTQVAIFIDSLHHNPSV 340 339 WPDVDRFDPDRFLPENCVGRHPYSFIPFSAGPRNCIGQKFALMEEKVLLTQILRKY 172 171 SIHSHDEEEDLRKQADLILRSSTPLNISLTPR 76 >sequence 38, 65 2 accessions 50% to 2J2 VRDLFMAGTDTTSSTTCWIILFLCRYPEVQRKMQEEVDQVLGSNGVPKMALAEKMPYTRAVIQ EIARMRPTLPLSVPHCTTQDTMMMGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERH 149 LDSNGNFVKSSNIMQFNIGLRSCLGQQLAKMELFLLTTSLCRHFSFSVVGEVDMEGESMVTLRPCSMEVVATKRA* DEV34315.x1 LQW188368.y1 LQW143417.x1 LQW149539.y1 LQW220722.y1 >SEQUENCE 34, 52, 71, 217 74% TO SEQ 29, 60% to seq 65, 39% TO 2C9 MLDILLSCIRAWFSTITLSIIVFYVTWHYWNKRTPGSPPGPRGFPFIGAITSIRKHPEHVMTKWNKEYGPVCMVRLGFKDVLVIGS YEAAHEAYVKSQDFLDRPSPFGLEILGGGYGLFPIAYGSFHQEQRRFGLNTLREHGMGRRVLESTILQYAEELCDRLETKMSAPVL LDQEVYIAVSSTIAHIVFGHNTMQDSPEFRDMILWMLRPNTATVLAGILVFAPYLKHLPFFRGVHNDSRALRLKLESLNLKEVKKH DKTRDPSDPRDFIDSFLNEMDK (1) boundary not consistent with related sequences HFKGKIDPNWETYSSFSDQQLVSM (0) VRDLFLAGTDTTSATTCWIILFLCKYPDVQRRMQKEIDDVIGENGIPKLALAERLPFTRAVIQ (0) EMGRIRPNVPLAVPHCASRDSTLMGFNIPKDTIIMTNIWGIHHDEKTWKDPYKFNPDRHLDAEGNFVKSNHVMQFNIGLRSCLGQQ LARMELFLITVTLFRKFFIELEPGCDIDMEGESLVSLRPYPFKVLLTKRI* DEV10735.x1 exon 1 N-term to I-helix DEV17022.x1 exon 1 N-term to I-helix LQW166080.x1 exon 1 N-term to I-helix LQW267527.x1 exon 1 N-term to I-helix DEV41989.y1 exon 1 N-term to I-helix DEV39698.x1 exon 1 N-term to I-helix LQW44017.y1 exon 1 N-term to I-helix LQW64492.x1 exon 1 N-term to I-helix LQW33972.x1 exon 1 N-term to I-helix LQW201553.y1 exon 2 I-helix LQW222497.x1 exon 2 I-helix LQW142648.x1 exon 2 I-helix LWG3369.x1 exon 2 I-helix DEV54722.x1 exon 2 I-helix LQW151633.y1 exon 2 I-helix LQW73069.x2 exon 2 I-helix DEV39698.y1 exon 3 EXXR to end LQW68269.y1 exon 3 EXXR to end DEV41989.x1 exon 3 EXXR to end DEV41989.x2 exon 3 EXXR to end LQW73069.x2 exon 3 EXXR to end LQW151633.y1 exon 3 EXXR to end DEV49881.y1 exon 3 EXXR to end >student assembly of seq 34 ortholog needs checking missing C-term (2 different genes) 68% to seq 65 MMEEVSIVQTIRGTFDTILLLAIVFYLTWHIWNRRSPLSPPGPRGIPLLGAITQLGKHPEHA MMKWNQQYGPICMVRLGHKDV LVLGSYEAAHEALVKNTHIADRPTHNIEVFYGGKGILMINSGSFHQEQRRFGLNTLREY GMGRRALEPTILIYVNDLCDRIEKYGSDPFVIDGEVYLTISSTISHIVFGHDVIHDNPKFKEIIFKMIEPN KLNVLAGILAFAPFLKHFPFFSTVHKSSKEFRLKLQSLIGEEVQEHKKTRDPTQSRDFIDSFLQAMEK (1?) VQDGKLDPNWEPYSSFTIDQLTAM (0?) VRDLFMAATDTTSSTICWIILFLCKNTDVQIKMQEEIDEVLGQNGILKYELATKMHYVRAVIQ (0) LFKEIARLRPSVPLSLPHASNRDTSIMGYRIPKDTIVLTNIWGIHHDEKIWKNPYEFNPE RHLDANGNFVKSKKVMQFNIGLRSCLGQQLANMELFLVTVSLFHRFSFSVVNPKIDMGGE SMVSLRPYPYSVVATTRG* G126P617768R.T0/G126P617768RB11.T0.seq 688 (37) G126P65096R.T0/G126P65096RG8.T0.seq 738 (02) G126P617150F.T0/G126P617150FA9.T0.seq 677 (38) >POSSIBLE ORTH OF SEQ 38 and probable end of seq 34 orth. Minus Strand HSPs: 699 LFKEIARLRPSVPLSLPHASNRDTSIMGYRIPKDTIVLTNIWGIHHDEKIWKNPYEFNPE 520 519 RHLDANGNFVKSKKVMQFNIGLRSCLGQQLANMELFLVTVSLFHRFSFSVVNPKIDMGGE 340 339 SMVSLRPYPYSVVATTRG* 283 G126P64775F.T0/G126P64775FF5.T0.seq 724 (01) G126P602025R.T0/G126P602025RH2.T0.seq 720 (12) >scf/ciona01/G126/seq_dir/hrs/G126P610830R.T0/G126P610830RG2.T0.seq 745 >scf/ciona01/G126/seq_dir/hrs/G126P610830R.T0/G126P610830RG2.T0.seq_4 745 0 745 ABI SSPWFTGCKLVKITHNTLXIAPVIRQ*V*GDLATVRV*TPLVFV*TPPTCRM*TLISYTL FHFQYLCCLNKFLYIKEMAGKNTIHIRVTKITI*SCGI*PEFYFTILRCSRN*LLQYVGC L*TRFCLIRKLLG*DQAKEMPIYITLYTPI*PKY*V*QSDQVGSTLFFALFRYALN*LIN FQYGDV*KRFFSI*GNCSLRTKRPSFSSTCFKQRYFDHGL*NSERHHRSYQHLGDSPGGK NEFHHREL >scf/ciona01/G126/seq_dir/hrs/G126P610830R.T0/G126P610830RG2.T0.seq_5 745 0 745 ABI FFXMVYRL*AG*NNT*HTXNSSGNPTVSVRRSGDCPCVNPPGFCINTPDLSYVNTNFLYP IPFPIFVLFKQILVY*GNGW*KHYTHQGDQNHNLILWYLT*VLFYNTQV*PKLITPICRL FINAVLFNKEIARLRPS*GNAYIHNTIHPNITKILSLTV*PSGFHPVFCII*ICFKLIN* LPVWGCLKTLFFYLRKLLVEDQASLFLFHMLQTEILRSWVIEFRKTPSFLPTFGGFTRRK K*IPPQGAX >scf/ciona01/G126/seq_dir/hrs/G126P610830R.T0/G126P610830RG2.T0.seq_6 745 0 745 ABI LXHGLPAVSWLK*HITHXQ*LR*SDSECEAIWRLSVCKPPWFLYKHPRPVVCKH*FPIPY SISNICAV*TNSCILRKWLVKTLYTSG*PKSQSNLVVFNLSFILQYSGVAEINYSNM*AV YKRGFV**GNCSVETKLRKCLYT*HYTPQYNQNIEFNSLTKWVPPCFLHYLDML*IN*LT SSMGMFKNAFFLFKEIAR*GPSVPLSLPHASNRDTSIMGYRIPKDTIVLTNIWGIHQEEK MNSTTGSS >scf/ciona01/G126/seq_dir/hrs/G126P602025R.T0/G126P602025RH2.T0.seq_4 720 0 720 ABI KY*V*QFDQVGSTLFLHYLDML*IN*LTFRYG*CLKTPFFSI*GNCSFETKRPPFSSTCF KQRYFDHGL*GSERYHRSYQHLGDSP*RKNMEESLRIQS*KASRREWQLR*IQESYAVQY RTAELPWTATCKYGVILGHSFVVS*IFLFTCEP*D*HGRGKYGVITPISIRRRGHNTRVK NVDRVFNPIEKYTDLYIKRY*QRAHHGLEFKPQNTRVTNLKETWSSG*EFPPHWSFSVQD >scf/ciona01/G126/seq_dir/hrs/G126P602025R.T0/G126P602025RH2.T0.seq_5 720 0 720 ABI ILSLTV*PSGFHPVFALFRYALN*LINFQVWVMFKNALFFYLRKLLV*DQASPFLFHMLQ TEILRSWVIRFRKIPSFLPTFGGFTMTKKYGRILTNSILKGISTRMATSLNPRKLCSSIS DCGVALDSNLQIWSYSWSQFRCFIDFPFHL*TLRLTWEGKVWCHYAHIHTASWPQHEGKE RRQSV*PDRKIYRFIYKTILATSPSWVGI*APKYQGDKP*RNLVQRLGISTTLEFQRPRX >scf/ciona01/G126/seq_dir/hrs/G126P602025R.T0/G126P602025RH2.T0.seq_6 720 0 720 ABI NIEFNSLTKWVPPCFCII*ICFKLIN*LSGMGDV*KRPFFLFKEIARLRPSVPLSLPHAS NRDTSIMGYKVPKDTIVLTNIWGIHHDEKIWKNPYEFNPERHLDANGNFVKSKKVMQFNI GLRSCLGQQLANMELFLVTVSLFHRFSFSLVNPKIDMGGESMVSLRPYPYGVVATTRG*R TSTECLTRSKNIPIYI*NDISNEPIMGWNLSPKIPG*QTLKKLGPAVRNFHHIGVSASKX VMFKXAFFLFKEIARLRPSVPLSLPHASNRDTSIMGYRIPKDTIVLTNIWGIHHDEKIWK NPYEFNPERHLDANGNFVKSKKVMQFNIGLRSCLGQQLANMELFLVTVSLFHRFSFSVVN PKIDMGGESMVSLRPYPYSVVATTRG*RMSKECLTRSKHIPIYI*NDISNEPVMGSNLSP KIPGWQTFKETLVQRLARPI*KE*YKNYLNDKTRRFRNKTRKKTG*KRIXPQERRIHXRK S >scf/ciona01/G126/seq_dir/hrs/G126P64775F.T0/G126P64775FF5.T0.seq_5 724 0 724 ABI GDV*XRFFSI*GNCSFETKRPSFSSTCFKQRYFDHGL*NSERHHRSYQHLGDSP*RKNME ESLRIQPRKASRREWQLR*IQESYAV*YRTA*LPWTATCKYGAILGHSFVVS*IFLFSCE P*D*HGRGKYGVITPISIQRRGHNTRVKNVERVFNPIETYTDLYIKRY*QRARHGFEFKP QNTRVANL*RNFGPAVSATDLKRII*KLP**QDQAVSKQNS*KNRVKANXTTGAEDSXXQ KX >scf/ciona01/G126/seq_dir/hrs/G126P64775F.T0/G126P64775FF5.T0.seq_6 724 0 724 ABI *CLXTLFFYLRKLLV*DQASLFLFHMLQTEILRSWVIEFRKTPSFLPTFGGFTMTKKYGR ILTNSTPKGISTRMATSLNPRKLCSLISDCVVALDSNLQIWSYSWSQFRCFIDFPFQL*T LRLTWEGKVWCHYAHIHTASWPQHEGKECRKSV*PDRNIYRFIYKTILATSPSWVRI*AP KYQGGKPLKKLWSSG*RDRFKKNNIKTTLMTRPGGFETKLVKKQGKSESHHRSGGFIXAK V >scf/ciona01/G126/seq_dir/hrs/G126P617150F.T0/G126P617150FA9.T0.seq_4 677 0 677 ABI YYVIYHLLDHSFPLQEHGRPD*NARRNRRIVGTKRNPKIRARYQDALREGCDTGSCG*DV TSAWAQYAINRLPTRNRNTEPNT*TIKRMNDCVAEWLIRPIHLFRFQI*KRGHRCWHRFI SPLFLTVMRACKVSKHHIVFFPHGLPCCKLC*NNT*HTAIAPVIRQ*V*GDLATVRV*TP LVFV*TPPTVRM*TLISYTLFHFQYL*SLNKFLYIKGIPPQGEIQ >scf/ciona01/G126/seq_dir/hrs/G126P617150F.T0/G126P617150FA9.T0.seq_5 677 0 677 ABI ILRHLPFVGSFFSFARTRTSRLKCKKK*TNCWDKTES*NTSSLPRCIT*GL*YRFVWVGC DKCVGTIRY*QTSNSKPKHGTKYINNKTHERLCSGMVDSTDSPFPIPNIKTRP*VLASIH ITSVPYRYASV*SKQTSHSFLSPWFTVL*AVLK*HITHCNSSGNPTVSVRRSGDCPCVNP PGFCINTPDCPYVNTNFLYPIPFPIFVKFKQILVY*GNSTTGRDSX >scf/ciona01/G126/seq_dir/hrs/G126P617150F.T0/G126P617150FA9.T0.seq_6 677 0 677 ABI DTTSSTICWIILFLCKNTDVQIKMQEEIDELLGQNGILKYELATKMHYVRAVIQVRVGRM *QVRGHNTLLTDFQLETETRNQIHKQ*NA*TIV*RNG*FDRFTFSDSKYKNEAIGVGIDS YHLCSLPLCERVK*ANIT*FSFPMVYRAVSCVKITHNTLQ*LR*SDSECEAIWRLSVCKP PWFLYKHPRLSVCKH*FPIPYSISNICEV*TNSCILREFHHRARFX >scf/ciona01/G126/seq_dir/hrs/G126P617768R.T0/G126P617768RB11.T0.seq_4 688 0 688 ABI RTGRFERPAFAC**MNLYV*RFVIFSWRQLILRHLPFVGSFFSFARTRTSRLKCKKK*MK CWGRTES*NTSSPPRCITLGL*YRFVWIGCDKTSWA*YAINRLPTRNRNTEPKT*TIKRM NDCVAKWLIRPIHFPIPNIKTRPWVLASIHITSVLHRYAIV*SKQTSHSFFPPWFTVL*A VLK*HITHCYSSGNPTVGVRRSGDCPCVNPPGFCIYTPDCQCESTTWSS >scf/ciona01/G126/seq_dir/hrs/G126P617768R.T0/G126P617768RB11.T0.seq_5 688 0 688 ABI PYGAI*ATCFCVLINESLCLKVRDLFMAATDTTSSTICWIILFLCKNTDVQIKMQEEIDE VLGQNGILKYELATKMHYVRAVIQVRVDRM*QNVVGIIRY*QTSNSKPKHGTKDINNKTY ERLCSEMVDTTDSFSDSKYKNEAMGVGIDSYHLRSSPLCDRVK*ANIT*FFSXMVYSAVS CVKITHNTLL*LR*SDSGCEAIWRLSVCKPPWFLYIHPRLSV*IHHMELX >scf/ciona01/G126/seq_dir/hrs/G126P617768R.T0/G126P617768RB11.T0.seq_6 688 0 688 ABI VRGDLSDLLLRVNK*ISMFKGS*SFHGGN*YYVIYHLLDHSFPLQEHGRPD*NARRNR*S VGAERNPKIRARHQDALR*GCDTGSCG*DVTKRRGHNTLLTDFQLETETRNQRHKQ*NV* TIV*RNG*YDRFIFRFQI*KRGHGCWHRFISPPFFTVMRSCKVSKHHIVFFXHGLQCCKL C*NNT*HTAIAPVIRQWV*GDLATVRV*TPLVFVYTPPTVSVNPPHGAX >scf/ciona01/G126/seq_dir/hrs/G126P64855R.T0/G126P64855RH11.T0.seq 689 (01) >scf/ciona01/G126/seq_dir/hrs/G126P64855R.T0/G126P64855RH11.T0.seq_4 689 0 689 ABI GGPGLXGRRK*MKCGAERES*K*RLATKRHYVRAVIQVRGDGSDKTWGA*YAINRLPTRN RNTGPKT*TIKRRTMCSERVDTTDSFSDSKYKNEARVLGIDSYHLRSSPLADRVK*ANIT *FFSPWFTVL*AVLK*HITHCYSSGNPTVGVGRSGDCPCVNPPGFCIYTPDGRV*TLISY TLFHFQYLAGLNKFLYIKEMAE*KHYTHQGTKITI*SWGI*PECNSTTG >scf/ciona01/G126/seq_dir/hrs/G126P64855R.T0/G126P64855RH11.T0.seq_5 689 0 689 ABI GRSRIXRQKEIDEVWGRTGILKIEARHQEALR*GCDTGSWGWE*QNVGGIIRY*QTSNSK PKHGTKDINNKT*NDV*RKG*YDRFIFRFQI*KRGQGVGHRFISPPFFTVSGSCKVSKHH IVFFP MVYSAVSCVKITHNTLL*LR*SDSGCGAIWRLSVCKPPWFLYIHPR RSCVNINFL YTIPFPIFGWFKQILVY*GNGGIKTLYTSGDQNHNLILGYLTGM*FHHRX >scf/ciona01/G126/seq_dir/hrs/G126P64855R.T0/G126P64855RH11.T0.seq_6 689 0 689 ABI REVPDXKAEGNR*SVGQNGNPKNRGSPPRGITLGL*YRFVGMGVTKRGGHNTLLTDFQLE TETRDQRHKQ*NVERCVAKGLIRPIHFPIPNIKTRPGCWASIHITSVLHR*RIV*SKQTS HSFFPHGLQCCKLC*NNT*HTAIAPVIRQWVWGDLATVRV*TPLVFVYTP PTVVCKH*FP IHYSISNIWLV*TNSCILRKWRNKNTIHIRGPKSQSNLGVFNRNVIPPQG LXHGLPAVSWLK*HITHXQ*LR* SDSECEAIWRLSVCKPPWFLYKHPRPVVCKH*FPI PYSISNICAV*TNSCILRKWLVKTLYTSG*PKSQSNLVVFNLSFILQYSGVAEINYSNM*AV >scf/ciona01/G126/seq_dir/hrs/G126P65096R.T0/G126P65096RG8.T0.seq 738 0 738 ABI TTTGACGCTGAAACTCCATGGTGGNAATTCGATACGTATCATTCTAGTGT TGCCAAGAGGTCCAATCTAACGTATGATGAGCGCCAATAGGCAATTAATC GGATAGCCTTAATACTTGCTCGTTTAGCCGTGTGTACTTTGTGTTTGGTG GTGATATTACTAAATGTTTGAGCAAGCGTTACAAAGTTTCTTCTTTTACT GGACTTAGTTTCAAATCCATGATGGGCTTGTTGCTTACACCATTTTCTAT ATTTAAGTCGGTATATGTATTGGTGGCGTTATCCTCCTGCCACACGACCG TACGGGCGATTTGAGCGACCTGCTTTTGCGTGTTAATAAATGAATCTCTA TGTTTAAAGGTTCGTGATCTTTTCATGGCGGCAACTGATACTACGTCATC TACCATTTGTTGGATCATTCTTTTCCTTTGCAAGAACACGGACGTCCAGA TTAAAATGCAAGAAGAAATAGATGAAGTGTTGGGGCAGAACGGAATCCTA AAATACGAGCTCGCCACCAAGATGCATTACGTTAGGGCTGTGATACAGGT TCGTGTGGATAGGATGTGACAAAACGTCGTGGGCATAATACGCTATTAAC AGACTTTCAACTCGAAACCGAAACACGGAACCAAAGACATAACCAATAAA ACGTTTGAACCATTGTGTTTCGAAATGGTTGCAACCACCGATCCATTTCC CGACTCCAAAAATAAAAACAAGCCATGTGTGAAGGCAT >scf/ciona01/G126/seq_dir/hrs/G126P65096R.T0/G126P65096RG8.T0.seq_1 738 0 738 ABI FDAETPWWXFDTYHSSVAKRSNLTYDERQ*AINRIALILARLAVCTLCLVVILLNV*ASV TKFLLLLDLVSNP*WACCLHHFLYLSRYMYWWRYPPATRPYGRFERPAFAC**MNLYV*R FVIFSWRQLILRHLPFVGSFFSFARTRTSRLKCKKK*MKCWGRTES*NTSSPPRCITLGL *YRFVWIGCDKTSWA*YAINRLSTRNRNTEPKT*PIKRLNHCVSKWLQPPIHFPTPKIKT SHV*RH >scf/ciona01/G126/seq_dir/hrs/G126P65096R.T0/G126P65096RG8.T0.seq_2 738 0 738 ABI LTLKLHGGNSIRIILVLPRGPI*RMMSANRQLIG*P*YLLV*PCVLCVWW*YY*MFEQAL QSFFFYWT*FQIHDGLVAYTIFYI*VGICIGGVILLPHDRTGDLSDLLLRVNK*ISMFKG S*SFHGGN*YYVIYHLLDHSFPLQEHGRPD*NARRNR*SVGAERNPKIRARHQDALR*GC DTGSCG*DVTKRRGHNTLLTDFQLETETRNQRHNQ*NV*TIVFRNGCNHRSISRLQK*KQ AMCEG >scf/ciona01/G126/seq_dir/hrs/G126P65096R.T0/G126P65096RG8.T0.seq_3 738 0 738 ABI *R*NSMVXIRYVSF*CCQEVQSNV**APIGN*SDSLNTCSFSRVYFVFGGDITKCLSKRY KVSSFTGLSFKSMMGLLLTPFSIFKSVYVLVALSSCHTTVRAI*ATCFCVLINESLCLKV RDLFMAATDTTSSTICWIILFLCKNTDVQIKMQEEIDEVLGQNGILKYELATKMHYVRAV IQVRVDRM*QNVVGIIRY*QTFNSKPKHGTKDITNKTFEPLCFEMVATTDPFPDSKNKNK PCVKA STLTGFVAIAPFLRHLPVFGMLYRKTMKFKQDIHDVIEKEIKEHEDTRDPKEPRDYVDSFLQAMEK (0) >scf/ciona01/G126/seq_dir/hrs/G126P608952R.T0/G126P608952RB12.T0.seq 763 0 763 ABI Length = 763 Plus Strand HSPs: Score = 157 (55.3 bits), Expect = 3.4e-11, P = 3.4e-11 Identities = 32/52 (61%), Positives = 39/52 (75%), Frame = +2 Query: 1 VRDLFMAATDTTSSTICWIILFLCKNTDVQIXMQEEIDELLGENGIPKLALA 52 VRDLFMAATDTTSSTICWIILFLCK ++ +++ + G+NGI K LA Sbjct: 602 VRDLFMAATDTTSSTICWIILFLCKTRTSRLKCKKKXMKWWGQNGILKYELA 757 G126P64401F.T0/G126P64401FH9.T0.seq 724 >scf/ciona01/G126/seq_dir/hrs/G126P618763R.T0/G126P618763RF3.T0.seq 724 0 724 ABI Length = 724 Plus Strand HSPs: Score = 406 (142.9 bits), Expect = 4.2e-41, Sum P(2) = 4.2e-41 Identities = 82/83 (98%), Positives = 83/83 (100%), Frame = +3 Query: 1 YRTYVILIILQDQNINGIKFRYRLIMHTFILYFYASVTKYKL*KFCVNLIQIG*QILVYT 60 YRTYVILIILQDQNINGIKFRYRLIMHTFILYFYASVTKYKL*KFCVNLIQIG*QILVYT Sbjct: 117 YRTYVILIILQDQNINGIKFRYRLIMHTFILYFYASVTKYKL*KFCVNLIQIG*QILVYT 296 Query: 61 LF*QTSVLGISGYITGPLTQLRV 83 LF*QTSVLGISGYITGPLTQLR+ Sbjct: 297 LF*QTSVLGISGYITGPLTQLRL 365 Score = 49 (17.2 bits), Expect = 4.2e-41, Sum P(2) = 4.2e-41 Identities = 9/15 (60%), Positives = 11/15 (73%), Frame = +2 Query: 76 GPLTQLRVVFILCHI 90 G ++VVFILCHI Sbjct: 341 GTTNTVKVVFILCHI 385 >scf/ciona01/G126/seq_dir/hrs/G126P618763R.T0/G126P618763RF3.T0.seq_1 724 0 724 ABI LRAELHVVEFVIQPNHVTLSTSFFKQWKRQSEVKILCKLIEHMLS*LYYKTKI*TV*SLG IDL*CTHLFSIFMHQ*QNINCKNSA*TLYK*ANKFWSTHCFNKLQF*ELVVTLRDH*HS* GCLYPLPYLTGMVLIN*ITGFNYFTGGNKSKLNQDGFADVQDGKLDPNWEPYSSFTIDQL TAMVIYLNGSGCG*TTNRNLVK*ISCGSLNCIVI*FLLETVMPDKQQL*YKLENGNTKIP VX >scf/ciona01/G126/seq_dir/hrs/G126P618763R.T0/G126P618763RF3.T0.seq_2 724 0 724 ABI SELSSMWWNS*SNPIT*LYRHLSSSNGKDRAK*KFYVNL*NICYLNYTTRPKYKRYKV*V *TYNAHIYSLFLCISNKI*TVKILRKPYTNRLTNFGLHIVLTNFSFRN*WLHYGTTNTVK VVFILCHI*LAWS*SIKSLGLIILQVVISQNSTKTDSLTYKMGNWIQTGNRTPHLP*IS* PQW*FI*TGLAVVKRPTVI*LNELVVVV*IV*LFNFY*KR*CQINNNCDTN*KTAIQKFQ L >scf/ciona01/G126/seq_dir/hrs/G126P618763R.T0/G126P618763RF3.T0.seq_3 724 0 724 ABI QS*APCGGIRDPTQSRDFIDIFLQAMEKTERSKNSM*TYRTYVILIILQDQNINGIKFRY RLIMHTFILYFYASVTKYKL*KFCVNLIQIG*QILVYTLF*QTSVLGISGYITGPLTQLR LSLSFAIFNWHGLNQLNHWV*LFYRW**VKTQPRRIR*RTRWEIGSKLGTVLLIYHRSAN RNGNLFKRVWLWLNDQP*FS*MN*LW*FKLYSYLIFIRNGNAR*TTIVIQIRKRQYKNSS >scf/ciona01/G126/seq_dir/hrs/G126P617020F.T0/G126P617020FA11.T0.seq_4 699 0 699 ABI LIYHRSANRNGNLFKRVWLWLNDQP*X*LNELVVVV*IV*LFNFY*KR*CQINNNCDTN* KTAIQKFQLW*LTWEVFKRTV*VYGIPLLLNNWVEWSRL*HLVSFIINIDSEYFAEITS* R*LLLVSFVGICTDITSFHQVIQFGN*EEANTTSVIRIILVLPRRPNLTYDKRQ*AINRI ALLLARLAVCTLCLVVILPNV*VSVTKFFLLLLGLVSNP*WACCGIPPQGENR >scf/ciona01/G126/seq_dir/hrs/G126P617020F.T0/G126P617020FA11.T0.seq_5 699 0 699 ABI HLP*IS*PQW*FI*TGLAVVKRPTVXLVK*ISCGSLNCIVI*FLLETVMPDKQQL*YKLE NGNTKIPVMVVNMGSFQTDSVGVWNTAFVKQLGGMVPFITPSLVYYKHR*RILCGNYKLK IITVGIVCWYLYRYHIISPSYSIWKLRRSQYYFRDTYHSSVAKEAQSNV**APIGN*PDS LITCSFSRVYFVFGGDITKCLSKRYKVFSSFTGLSFKSMMGLLRNSTTGREQX >scf/ciona01/G126/seq_dir/hrs/G126P617020F.T0/G126P617020FA11.T0.seq_6 699 0 699 ABI SFTIDQLTAMVIYLNGSGCG*TTNRNXS*MN*LW*FKLYSYLIFIRNGNAR*TTIVIQIR KRQYKNSSYGS*HGKFSNGQCRCMEYRFC*TIGWNGPVYNT*SRLL*T*IANTLRKLQVK DNYCWYRLLVFVPISHHFTKLFNLETEKKPILLP*YVSF*CCQGGPI*RMISANRQLTG* PYYLLV*PCVLCVWW*YYQMFE*ALQSFFFFYWA*FQIHDGLVAEFHHRARTX >scf/ciona01/G126/seq_dir/hrs/G126P612515F.T0/G126P612515FG9.T0.seq_1 715 (29) DSRPVVEFFQTDSVGVWNTAFVNNWVEWSRL*HLVSFIINIDSEYFAEITS*R*LLLVSF VGICTDITSFHQVIQFGN*EEANTTSVIRIILVLPRRPNLTYDKRQ*AINRIALLLARLA VCTLCLVVILPNV*VSVTKFFLLLLGLVSNP*WACC*NYFLYLSRYMYWWR*PFCHTTHD RFEWTALRVNK*TSMFKGS*SFHGGN*YYVIYHLLDHSFPLQEHGRPDXNARRNRRIVX >scf/ciona01/G126/seq_dir/hrs/G126P612515F.T0/G126P612515FG9.T0.seq_2 715 0 715 ABI ILALWWNSFKRTV*VYGIPLLLTIGWNGPVYNT*SRLL*T*IANTLRKLQVKDNYCWYRL LVFVPISHHFTKLFNLETEKKPILLP*YVSF*CCQGGPI*RMISANRQLTG*PYYLLV*P CVLCVWW*YYQMFE*ALQSFFFFYWA*FQIHDGLVAKTIFYI*VGICIGGVNRSATRPTT DLSGLLCVLINEPLCLKVRDLFMAATDTTSSTICWIILFLCKNTDVQIXMQEEIDELL >scf/ciona01/G126/seq_dir/hrs/G126P612515F.T0/G126P612515FG9.T0.seq_3 715 0 715 ABI FSPCGGILSNGQCRCMEYRFC*QLGGMVPFITPSLVYYKHR*RILCGNYKLKIITVGIVC WYLYRYHIISPSYSIWKLRRSQYYFRDTYHSSVAKEAQSNV**APIGN*PDSLITCSFSR VYFVFGGDITKCLSKRYKVFSSFTGLSFKSMMGLLLKLFSIFKSVYVLVALTVLPHDPRP I*VDCFAC**MNLYV*RFVIFSWRQLILRHLPFVGSFFSFARTRTSRLXCKKK*TNC >SEQUENCE 65, 78 81% TO SEQ 29 35% TO 2A6 may be in a cluster with 29 MEISVQSISSIFDTILLVLIVSYLTWHFWNQRSAWA PPGPRGLPFIGPITSIRIHPEHAMMKWNQQYGPVCMVRFGFKDILLLGSYEAAHEAL exon 1 VKNMDLADRPSNGIAVFKGGKGILMTKFGSYHQEQRRFSLNKLREYGMGRRALEPTI exon 1 LLYSNELCERIEKFGSKPFYIDMEIYKAISSTICHIVFGHNVIEENEDFKEIINTLN exon 1 KKSKLNVLSGILAFAPFLRFLPVFSTIHAKSVNFQQTLHALVRQEISEHEKTRDPKE exon 1 PRDYIDSFLNEMDK (0) exon 1 JOINT PROBLEM gc not gt DFRGQVDPNWKQYSSFTYEQLVAM (0) EST SUPPORT exon 2 from PNW on CRDLFMAGTDTTSSTTSWIILFLCRYPEVQRKMQEEADQVLGSNGEPKMALAEKMPYTRAVIQ (0) exon 3 EIARMRPTLPLSVPHCTTQDTMVMGYKIPKDTIVLTNIWGIHHDEKLWKNPYDFNPERHLDSNGNFV exon 4 KSSNIIQFNIGLRSCLGQQLAKMELFLLTTSLCRHFSFSVVGEVDMEGESMVTLRPCSMEVVATKRA* exon 4 DEV2228.x1 exon 1 supported by EST sequence AL669345.1 exon 1 EST sequence LQW149539.y1 exon 4 heme to end DEV34315.x1 LQW229649.x1 DEV46839.x1 LQW229649.x1 exon 1 LQW149539.x1 exon 1 LQW224871.y1 exon 1 to mid DEV46839.x1 exon 1 LQW18242.y1 exon 2 LQW277754.x1 exon 2 DEV46839.x1 exon 2 LQW224871.y1 exon 2 1 diff >sequence 29 9 accessions 33% to 1A1 in a cluster with 65 MEEYGIINSIKVAVDTVILVGLAIYLTWHFWNKRTPGGAPGPRGLPFIGAITSIRKHPEHAMMKWNQQYG PVCMVRLGFKDILLLGSYEAAHEALVKSPDFANRPPAHSIEVIAGGKGIVMIPFGPFHQEQRRFGLNALREH GMGRRALEPTVHLCAQELCERIEEYGSKPFNLDNEVY QSVSSIISRLVFGHDVVQENLEFRKMIFQMIEPNKLNILAGIL AFAPYLKHLPFFSEIHRKNKEFRLKMESSIRKEIEEHMETRDRKEPRDYIDSFLNEMEK (?) DEPNTKNEEWKQYSSFTQNQLIAM (0) LQW220722.x1 exon 1 LQW220722.y1 C-term of seq 65 gene cluster LQW162545.y1 exon 1 LQW202945.x1 exon 1 DEV48427.x1 exon 1 LQW72018.y1 exon 1 DEV21292.y1 exons 1, 2 GCiWno655_k19.b1 GCiWno749_o22.b1 GCiWno178_l07.b1 GCiWno714_n01.g1 exon 1 GCiWno709_e10.b1 exon 1 GCiWno143_b11.g1 exon 1 GCiWno201_d17.g1 exon 1 GCiWno809_g22.g1 exon 1 >DEV21292.y1_5 RSPPTSNTSRSSAKFTEKTKNLD*KWSRRFVKK*KNIWRHVTERNHVTTSTVS*TKWKNQ IKYQRQKV*TRTQKFQSQ*YVSTCPLSIVFITV*LI*YHHVCVTKNIAIRKNTYLKSKSC QPKKHYVLSIFADFPTDISQLAISAMTPTDEDEPNTKNEEWKQYSSFTQNQLIAMVYFTY IDVVVCLYV**NERTYSLFGVDIKX >GCiWno655_k19.b1_1 LHRQFLKRNGKIKSSTKGRRYKRVHKNFSHNNMSVLAR*V*FLSLFN*YNTIMFL*QRTS RLEKISI*SLKVANRKNITYCQYLPISRRIFHNSLF*Q*RLPTRTNRIQKTKNGNNIHLL PRTNLLLWYTLLI*MLWYVCTCDEMKERIAYLVLT*SHPV*GMAKKCHQTTQNTALYLYN NRLYMFVSASRLI*LYVFLYRVDYYHCR*TISKGGGGKKGXTXNT*YXE*X*XVFL*IKX RVL*KVGRDXGFIN*FGMVFWFKXXMGKKNT*FEEKGGVIFXXKXIYKX >GCiWno655_k19.b1_2 YIDSFLNEMEKSNQVPKAEGINAYTKISVTTICQYLPVKYSFYHFLINIIPSCFCDKEHR D*KKYLFKV*KLPTEKTLRIVNICRFPDGYFTTRYFSNDAYRRGRTEYKKRRMETIFIFY PGPTYCYGILYLYRCCGMFVRVMK*KNV*LIWC*HKATRYKEWLKNATKQHKTPPCIYII TVYICLFQHRV*FNYTYFYTEWIITIVDRLLVKGGGERRGXX*IHNXXNRXDXCFYKLXK GFCERWGGIXVL*IDLEWFFGLKXKWERRILNLRKKVVLFFXKXLYIN >GCiWno655_k19.b1_3 TSTVS*TKWKNQIKYQRQKV*TRTQKFQSQQYVSTCPLSIVFITF*LI*YHHVFVTKNIA IRKNIYLKSKSCQPKKHYVLSIFADFPTDISQLAILAMTPTDEDEPNTKNEEWKQYSSFT QDQLIAMVYFTYIDVVVCLYV**NERTYSLFGVDIKPPGIRNG*KMPPNNTKHRPVSI** PSIYVCFSIAFDLTIRIFIQSGLLPLSIDY**RGGGKEGXXXKYIIXRIGXIXVFIN*XK GFVKGGEGXGFYKLIWNGFLV*KXNGKEEYLI*GKRWCYFXXKXYI* >GCiWno495_k23.b1 CHROMAT_FILE: GCiWno495_k23.b1 PHD_FILE: GCiWno495_k23.b1.phd.1 CHEM: term DYE: big TIME: Tue Oct 16 10:59:14 2001 TEMPLATE: GCiWno495_k23 DIRECTION: fwd Length = 869 Score = 121 bits (300), Expect = 2e-27 Identities = 62/65 (95%), Positives = 62/65 (95%), Gaps = 3/65 (4%) Frame = +2 Query: 1 NERTYSLFGVDIKPPGIRNG-KMPPNNTKHRPVSI--PSIYVCFSIAFDLTIRIFIQSGL 57 NERTYSLFGVDIKPPGIRNG KMPPNNTKHRPVSI PSIYVCFSIAFDLTIRIFIQSGL Sbjct: 113 NERTYSLFGVDIKPPGIRNG*KMPPNNTKHRPVSI**PSIYVCFSIAFDLTIRIFIQSGL 292 Query: 58 LPLSI 62 LPLSI Sbjct: 293 LPLSI 307 >GCiWno495_k23.b1_1 RIQKRKMETIFIFYPGPTYCYGILYLYRCCGMFVRVMK*KNV*LIWC*HKATRYKEWLKN ATKQHKTPPCIYIITVYICLFQHRV*FNYTYFYTEWIITIVDSTI**XVGXDGTP*IHNM RID*S*HKQLTTGCERTRGVGSITH*MSMCYYLM*IYSYSTDG*IIIFHTTIAVYMLPV* RLRGYCYNQRILNGLPNVLSFHKSVNTRKK**LSKKNT*KLLDSHEVQNTAVVITTHTSR PCLDIKSLKHGYTLMDATLSPGC*KTTLKTRLTMPLKKMGRLLGSIEYPP >GCiWno495_k23.b1_2 EYKNEKWKQYSSFTQDQLIAMVYFTYIDVVVCLYV**NERTYSLFGVDIKPPGIRNG*KM PPNNTKHRPVSI**PSIYVCFSIAFDLTIRIFIQSGLLPLSIVLYSXGWGXMGHLRYIIC E*TDRDINS*QRAVKELEESVL*PIECPCATI*CEYIHILLMGESLYSTLLLLSICYRSR GSEAIVTTNGFLMVCPMYYPSISLSIHARNNDYPKRIRENS*IPTRCRTPLLL*QPILPG HAWI*KVSNMDIL*WMQHYHLVVERLHLKPVLPCL*KKWGDYWDLLSIPX >GCiWno495_k23.b1_3 NTKTKNGNNIHLLPRTNLLLWYTLLI*MLWYVCTCDEMKERIAYLVLT*SHPV*GMAKKC HQTTQNTALYLYNNRLYMFVSASRLI*LYVFLYRVDYYHCR*YYIVXGGEXWDTLDT*YA NRLIVT*TVNNGL*KNSRSRFYNPLNVHVLLSDVNIFIFY*WVNHYIPHYYCCLYVTGLE AQRLLLQPTDS*WSAQCIILP*VCQYTQEIMTIQKEYVKTLRFPRGAEHRCCYNNPYFPA MLGYKKSQTWIYSDGCNIITWLLKDYT*NPSYHAFEKNGAIIGIY*VSP >GCiWno47_e18.g1_1 XVSLAHW*RIGXSVPMVKIKSSTKGRRYKRVHKNFSHNNMSVLAR*V*FLSLFN*YNTIM FV*QRTSRLEKIPI*SLKVANRKNITYCQYLPISRRIFHNSLFQQ*HLPTRTNGIQKTKN GNNIHLLPRTNLLLWYTLLI*MLWYVCTCDEMKERI*LIWC*HKATRYKEWLKNATKQHK TPPCIYNNRLYMFVSASRLI*LYVFLYRVDYYHCR*YYIVRXXHSPPLYHHLHLSNAPHL PSIISPFLTSPHPLSKPSHTPIHIPAHLLLPPAHPHHRTYSTTLPLPPPPTVTHPSHLVP LTTPLRALPPLPPTPVLSPQ >GCiWno47_e18.g1_2 L*V*HIGNALXXRYPW*KSNQVPKAEGINAYTKISVTIICQYLPVKYSFYHCLINIIPSC LCDKEHRD*KKYLFKV*KLPTEKTLRIVNICRFPDGYFTTRYFSSDTYRRGRTEYKKRRM ETIFIFYPGPTYCYGILYLYRCCGMFVRVMK*KNVYSLFGVDIKPPGIRNG*KMPPNNTK HRPVSIITVYICLFQHRV*FNYTYFYTEWIITIVDSTI*YAPXTPXPSITTYTFQTPPTF PLSSPPFLLPHIHSLSPPTHPSTFPPISYSPLHTRITALTQQHFLSRPPPPSPTHLTSYL LLHHSAHSPLSRLLPSSHL >GCiWno47_e18.g1_3 CKFSTLVTHWXLGTHGKNQIKYQRQKV*TRTQKFQSQ*YVSTCPLSIVFITV*LI*YHHV CVTKNIAIRKNTYLKSKSCQPKKHYVLSIFADFPTDISQLAISAVTPTDEDERNTKNEEW KQYSSFTQDQLIAMVYFTYIDVVVCLYV**NERTYIAYLVLT*SHPV*GMAKKCHQTTQN TALYL**PSIYVCFSIAFDLTIRIFIQSGLLPLSIVLYSTXPPLPXPLSPLTPFKRPPPS LYHLPLSYFPTSTL*ALPHTHPHSRPSPTPPCTPASPHLLNNTSSPAPPHRHPPISPRTS YYTTPRTPPSPAYSRPLTS >sequence 2,3,9,44, 45 COMPLETE 81 accessions 94% to sequence 1 40% to 2U1 MVLQLLSDINVSSLVIFFTAFLALYYWYTRPKNFPPGPRGVPFLGVIPFLGNYPERVMRKWSKKYGPVMSVRMG REDWVVLGDYETIQQ (0) SLVKQGQCFSGRPDVPVLNQITNGHGLITVDYNEDWKTQRRFGITTLRG (2) FGVGKRSMEDRIVEEVAYLNDAIRSHNEKPFDIL (0) SILSNAVSNNICSVVMGRRFDYDDKRFMEIMARLSRS (2) FNDPTANFALNVVMFMPILVKIPPFSRINNQLMTDVRVI (1) LQMLREILSEHKSTFNKDDVRDFIDAFIAEQNSESKHSSYT (0) DLQLLQYVRDLFVAGTETTTSTLRWSILCMIHNPEKQEKLRKEICDVI (1) GQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSVVHMTNQDVVLNGYTIPKGTT (0) ISPNLWAVHNNPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSIGPRHCLGEQLARMEYFIYLVSMV QKFEFFPDPNEPDLPDVEDGSSGVVFVPLRFKQIAKIV* rciad81k12 EST rcieg32b06 EST = AV862083 rciad45g08 EST rciad44e10 EST rciad47d10 EST DEV16203.x1 DEV25023.x01 DEV25023.x1 = SEQ 44 DEV25395.x1 small deletion 8aa DEV36142.x1 DEV36161.y1 DEV36161.x1 = SEQ 1 DEV43341.y1 DEV43341.x1 = SEQ 9 THIS MAY BE N-TERM DEV46240.y1 DEV46240.x1 = SEQ 1 DEV6477.x1 DEV8789.x1 DEV8789.y1 = SEQ 9 DEV9515.y1 DEV9515.x1 = SEQ 1 LQW12658.y1 LQW132813.x1 LQW156838.x1 LQW164355.y1 LQW164355.x1 = SEQ 9 LQW176676.y1 LQW188497.y1 LQW189083.y1 LQW189083.x1 = SEQ 9 LQW193177.y1 LQW198468.x1 LQW198468.y1 = SEQ 1 LQW206936.y1 LQW210436.x1 LQW210436.y1 = SEQ 9 LQW211598.x1 LQW212168.x1 LQW21261.y1 LQW224972.x1 LQW227602.x1 LQW227602.y1 = SEQ 1 LQW228810.x1 LQW229022.y1 LQW231535.y1 LQW232410.y1 LQW234563.x1 LQW249118.y1 LQW249118.x1 = SEQ 9 LQW25924.x1 LQW265547.y1 LQW268100.x1 LQW272015.x1 LQW278692.y1 LQW35104.x1 WXXP TO HEME LQW9894.x1 DEV13827.x01 note: DEV13827.y1 = seq 1 may be opposite end of gene DEV13827.x1 DEV16357.y1 DEV1636.x1 DEV17399.x1 15 aa deletion after TIQQ DEV19053.y1 DEV24912.y1 DEV25110.x01 DEV25110.x9 DEV25829.y1 DEV30184.x1 DEV34783.x1 DELETION OF SHNEKPFDIL DEV35744.x1 DEV36429.x1 DEV39712.y1 DEV43341.x1 DEV48144.y1 DEV50104.y1 DEV8789.y1 LGK545.y1 LQW103513.y1 LQW151967.y1 LQW164355.x1 LQW182856.y1 LQW184127.x1 LQW188756.x2 OPPOSITE END IN SEQ 1 LQW189083.x1 LQW210436.y1 LQW214090.y1 LQW21815.y1 LQW221848.y1 LQW226594.x1 HEME LQW226690.y1 LQW236384.y1 LQW249118.x1 LQW258179.x1 LQW36428.x1 LQW41750.x1 LQW60970.x1 LQW65773.x1 LQW8464.x1 LQW89091.y1 LQW96601.y1 >CIONA SAVIGNYI SEQUENCE 71% TO CIONA INTESTINALIS seq 2 70% TO SEQ 4 This sequence carefully checked for contigiuity between exons This sequence is not hybrid between multiple genes MLQRMLNEINVSTSFIFLTVFLGLYYWYRRPKNFPPGPRGIPFLGVLPFLGNYPERKMRK WSNKYGPVMSVRMGRQDWVVLGDHETIQQ (0) GLVKHGNAFSGRPSIATIDQITEGHGVLFIDYNDHWKRQRRFGLSTLRG (2) FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFDIQ (0) SILSNAVSNNICSIVMGRRFDYDDERFKEIMGRLARG (2) FNDPEASFVLQILIFMPALINFPYFSRINGELMEDVRVI (1) PELLREIVEEHKASYEQDNHRDFIDAFLGEQKAENGAKTFT (0) DTQLLQYVRDLFVAGTETTTSTVRWSLLCLIHNPETQEKLRKEIFEVL (1) GPEKIPAFENKSKMPYTSAFIQEIYRYRTLVPLSVTHMTNEDAHISGYTIPKGTT (0) IAPNLWAVHNDPEEWDEPNKFKPERHLDAGGNFVQLKHVIPFSVGPRHCLGEQLARMEVF IFLVSLVQKFEFLPDPKATVLPDIENGASGAAYVPLPFKIVAKVV* Accession length (datafile) exon # G126P600183F.T0/G126P600183FG2.T0.seq 711 (13) exon 1 G126P69914R.T0/G126P69914RA5.T0.seq 727 (10) exon 1 G126P600628R.T0/G126P600628RB7.T0.seq 710 (08) exon 1 walked to exon 2 G126P616258F.T0/G126P616258FG2.T0.seq 675 (37) exon 2 G126P606888F.T0/G126P606888FD6.T0.seq 758 (25) exon 2 G126P69517R.T0/G126P69517RE7.T0.seq 708 (12) exon 3 G126P64550R.T0/G126P64550RF9.T0.seq 795 (00) exon 4 G126P64980F.T0/G126P64980FE12.T0.seq 726 (03) exon 4, 5 G126P600445F.T0/G126P600445FB7.T0.seq 711 (08) exon 4 G126P601719R.T0/G126P601719RH9.T0.seq 739 (13) exon 4 G126P602293R.T0/G126P602293RE9.T0.seq 739 (13) exon 4 G126P611387R.T0/G126P611387RG7.T0.seq 717 (25) exon 4, 5 G126P67490R.T0/G126P67490RD8.T0.seq 731 (04) exon 4, 5, 6 G126P68083F.T0/G126P68083FC9.T0.seq 731 (07) exon 4, 5, 6 G126P64682R.T0/G126P64682RH9.T0.seq 733 (02) exon 6, 7 G126P69523R.T0/G126P69523RG9.T0.seq 733 (??) exon 6, 7 G126P69883F.T0/G126P69883FD3.T0.seq 738 (10) exon 8, 9 G126P609473F.T0/G126P609473FC2.T0.seq 761 (24) exon 8, 9 G126P606736F.T0/G126P606736FH9.T0.seq 692 (20) exon 8, 9 G126P611477F.T0/G126P611477FF12.T0.seq 950 (41) exon 9 >Second savignyi whole gene ortholog to seq 2/4 Checked for contiguity between exons not a hybrid gene 68% to seq 2 and seq 3, 78% to first savignyi ortholog of seq 2/4 MLQRVFNEINVSTGFVVLTVFLALYYWYRRPKNYPPGPRGIPFLGVLPFLGNYAERTMHK WSKKYGPVMSVRMGRQDWVFLGDHETIRQ (0) ALVNQGNSFSGRPLIATLDQITEGHGIVLLDYGDWWKRQRSFGFSTLRG (2) FGVGKRSMEDRITEEVAYLNDAIRSHDGKAFDIK (0) SILSNAVSNNISSIVMGRRFEYDDEHSKEIMARLAMG (2) FNDPDTSFFLQILIFMPACVHLPYFSRVNKKLMEDVHVTR (1) SELLREIIAEHKASYDQDNHRDFIDAFLGEHNAENGTDAFT (0) DKQLLHYVRDLFFAGSETSTSTLRWTLLCLIHHPEKQEKLRKEIFEVL (1) GQEKIPAVDNKAYMPYTCAFMQEVYRYRTLAPFGVAHMTNEDVNLNGYSIPNGTT (0) IYVNLWAVHNNPDVWDEPNKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIYL VSLVQKFEFLPDPDATELPDIENGSYGPICAPKPFKMVAREV* G126P66001F.T0/G126P66001FD9.T0.seq 747 (07) exon 1, 2 G126P600523F.T0/G126P600523FG11.T0.seq 708 (08) exon 2, 3 G126P600643F.T0/G126P600643FC9.T0.seq 725 (09) exon 2, 3 G126P69906R.T0/G126P69906RE4.T0.seq 969 (12) exon 3 walked from RE4 to FC8 G126P605356F.T0/G126P605356FF5.T0.seq 762 (20) exon 4 G126P69993F.T0/G126P69993FC8.T0.seq 717 (09) exon 4, 5 intron 3-4 = 1600bp G126P603029R.T0/G126P603029RF12.T0.seq 943 (15) exon 4, 5 G126P603996F.T0/G126P603996FA6.T0.seq 739 (15) exon 5, 6 G126P66969R.T0/G126P66969RF11.T0.seq 710 (03) exon 6 walked to exon 7 G126P66256F.T0/G126P66256FC11.T0.seq 964 (05) exon 7 G126P604570R.T0/G126P604570RD6.T0.seq 735 (14) exon 7, 8 G126P604249F.T0/G126P604249FC7.T0.seq 711 (16) exon 7, 8 G126P612840R.T0/G126P612840RC1.T0.seq 711 (29) exon 7, 8 G126P612199R.T0/G126P612199RD4.T0.seq 742 (32) exon 7, 8 walked to exon 9 G126P607416R.T0/G126P607416RF5.T0.seq 731 (21) exon 9 G126P608773F.T0/G126P608773FA7.T0.seq 728 (23) exon 9 G126P608264R.T0/G126P608264RF7.T0.seq 993 (24) exon 9 >related savignyi 2/4 like exon sequences (probably two more closely related genes) TLVKQGSIFSGRPSIPILEEMTKGHGILLLDYGEKWKSQRKFGLMTLRG (2) GLVKQGTAFSGRPSIDIIERVSKGNGMIFLNYDEHWKKQRRFGLSTLRG (2) = RB10 FGVGKRSMEDRITEEVAYLNDAIRTHDGKAFNIQ (0) = RB10 SILGNAISNNICSIVMGRRFDYDDERFKEIITRLARR (2) FNDPEVSLVRQILIFMPALVNAPYFSRINAELMENVRVI (1) FIDPEVSSALQIFVFMPALVNFPYFSRIYGELMKDVRVI (1) PESLREMVEEHKASYEQDNHRDFIDAFLGEQKAENGAKTFT (0) SELLREIVADHKALYDQDNHRDFIDAFLGEQKSENGSETFT (0) DMQLLHYVRDLFVAGTETTTSTLRWSLLCLIHDPEIQDKLRKEIFEVL (1) GQEKIPAFEDKSKMPYTRAFIQEIYRHRTLLPLSVTHMTNEDANICGYTIQKGTT (0) ISSNLWAVHNDPDVWNEPSKFKPERHLDDKGNFVQSSHVIPFSVGPRHCLGEQLARME VFIYLVSLVQKFEFLPDPDATELPDIKIGSNGPAYVPLPFNMVARVV* IAPNLWAVHNDPVVWDEPNKFKPERHLDADGNFVQLKHVIPFSVGPRHCLGEQLARMEVF IFLVSLVQKFEFLPDPKAKVLSDNENGASGAAYVPLPFKIVAKVV G126P61537F.T0/G126P61537FD1.T0.seq 721 (00) exon 1 G126P64657R.T0/G126P64657RH5.T0.seq 766 (01) exon 1, 2 G126P69683F.T0/G126P69683FC11.T0.seq 762 (08) exon 2 G126P65262R.T0/G126P65262RC7.T0.seq 736 (07) exon 2 G126P68944R.T0/G126P68944RB10.T0.seq 738 (09) exon 3 G126P64729R.T0/G126P64729RG12.T0.seq 725 (00) exon 3, 4 G126P608695R.T0/G126P608695RC1.T0.seq 731 (21) exon 5 partial G126P612819R.T0/G126P612819RB12.T0.seq 726 (30) exon 4, 5 G126P65190R.T0/G126P65190RF6.T0.seq 753 (03) exon 5, 6 G126P67778F.T0/G126P67778FG5.T0.seq 719 (05) exon 6 G126P607271F.T0/G126P607271FG11.T0.seq 775 (20) exon 6, 7 G126P609438F.T0/G126P609438FB4.T0.seq 709 (22) exon 7 G126P64695R.T0/G126P64695RH10.T0.seq 752 (01) exon 8, 9 G126P601373R.T0/G126P601373RB4.T0.seq 705 (09) exon 9 >sequence 4, 8, 92 (48 accessions) 83% to sequence 2, 38% to 2U1, 2R1, 2D6 faces away from seq 220 on same clone note the end of seq 4 is identical to the end of seq 1 MLQHLLGDINVSSLVIFFTAFLALYYWYTRPKNFPPGPRGVPFLGVIPFLGNY PERVMRKWSKKYGPVMSVRMGREDWVILNDYENIQK (0) ALVKQGHSFSGRPDIPVFTQINNGMGLLTVDYNDHWKMQRRFGITTLRG (2) FGVGKRSMEDRIVEEVAYLNDAIRSHNDKPFDIL (0) GVLSNAVSNNICSIVMGRRFDYDDERYKEIMFRLSRS (2) TKDPEANFYITVLMFVPFLVNFPPFSRINKEMMDDVKVILG (1) DHLDEIVSEHKSTFNKDDARDFIDAFIAEKNSQNKHSSFT (0) DSQLLHYVVDLFEAGTETTTSTLMWSILCMIHNPEKQEKLRKEICSVV (1) GQDRVPAMNDKAQMPYTCAFMQEVFRYRTLVPLSLMHMTNEDVVLNGYNIPKGTT (0) VSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVVAFSVGPRHCLG EQLARMEYFIYLVSMVQKFEFLPDPNEPNLPDIEKGSNGAAFVPLPFKQIARVV* DEV19056.x1 exon 1 LQW246757.y1 exons 1, 2 LQW195694.x1 exons 1, 2 LQW41553.y1 exons 1, 2 DEV29656.x1 exons 1, 2 LQW242270.y1 exons 1, 2, 3, 4 OPPOSITE END = SEQ 220 (GENE CLUSTER?) LQW188756.x2 exons 3, 4 DEV55352.x1 exon 4 LQW252475.y1 exons 5, 6 opposite end .x1 = seq 220 (- strand) LQW36797.x1 exons 5, 6 LQW263538.y1 exons 5, 6, 7 LQW26095.x1 exons 5, 6, 7, 8 LQW269368.y1 exons 5, 6, 7, 8 DEV40081.y1 exons 6, 7, 8 DEV29656.y1 exons 7, 8 LQW221848.x1 exon 8 LQW174458.x2 exon 8 LQW111344.x01 exon 8 LQW60970.y1 exon 8 LQW8752.x1 exon 8 LQW246757.x1 exon 8 LQW145059.x1 exon 8 LQW145059.x2 exon 8 LQW187803.y1 exon 8 LQW103513.x1 exon 8 DEV37657.y1 exon 8 DEV37657.y2 exon 8 DEV39543.y1 exon 8 LQW224972.x1 exon 8 LQW212168.x1 exon 8 LQW46430.x1 exon 8 DEV9515.y1 exon 8 LQW259311.x2 exon 9 LQW259311.x1 exon 9 DEV36161.x1 exon 9 DEV46240.x1 exon 9 LQW95486.y1 exon 9 LQW258179.y1 exon 9 DEV9515.x1 exon 9 LQW146181.y1 exon 9 LQW227602.y1 exon 9 LQW198468.y1 exon 9 LQW183999.y1 exon 9 DEV13827.y1 exon 9 LQW188756.y1 exon 9 DEV39712.x1 exon 9 DEV19056.y1 exon 9 DEV5183.x01 exon 9 >SEQUENCE 211 94% to seq 4, 4 diffs probably = seq 4 opposite end = seq 2 689 PERHLDDQGTFVQSKHVVAFSVGPRHCLGEPLARMEYLIYLVSMVQKFEFLPDPNEPNLP 510 509 DIEKGSNGAAFVPLP 465 LQW211598.y1 HEME >SEQUENCE 27, 28, 220 SIMILAR TO SEQ 2 and seq 8 BUT NOT SAME SEQ Faces away from seq 8 on same clone There appear to be at least two and maybe three closely related sequences MLTNLIANPSISPVLAILSCVIAFYYWYKRPKNMPPGPRGIPFLGIIPFVGMN PEQAFMQWSKKYGPVITVRMGRKDWVVLCDHDTIHQ (0) VLVKQSTVCSGRPKIPIVSELSKGHGILFADYCEKWKSQRKFGMKTLRE (2) FGVGKKCTEDRVLEEVDFLCNEIRSKNGKPFDIQ (0) DIMCNAVSNIIMNIVIGRRCNYDENFFTDVISRFTKW (2) FNDPTAGAMFTGMMFLPQLKYVPPFSKYYKIFRDDIGALH (1) EFFEEVIKEHEKNFDGNNLRDFIDAFLLEMKKNESGSEFT (0) YIQLLNYIRDLFDGGTETTVSTSRWAILCMLHYPETQKKLRNEIMTII (1) GPNTPASMSHKSDMPYTCAFIQEVLRFRTLVPLSVPHKVNEDATVNGYTIPKGVT (0) VSPNLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIFL ISMVQKFEFLPDPNEPDLPEINHGTNGAAFVPLPYKIVANQI* LQW219274.y1 upstream of exon 6 opp end = exon 1 DEV10453.x1 exon 1 LQW219274.x1 exon 1 LQW242270.x1 exon 1 opposite end .y1 = seq 4 LQW252475.x1 exon 1 plus 711 bp upstream LQW275290.x1 exon 2 LQW195694.y1 exon 2 opposite end .x1 = seq 4 LQW212515.y1 exon 2, 3, 4 LQW211000.y1 exon 3, 4 LQW173295.y1 exon 3, 4 DEV3674.y1 exon 3, 4 LQW275290.y1 exon 6, 7 2 aa diffs I-HELIX DEV23276.x1 exon 6 2 DEV10453.y1 exon 6 2 LQW212515.x1 exon 7, 8 LQW268927.y1 exon 7, 8 LQW173295.x1 exon 8 DEV3674.x1 exon 9 LQW211000.x1 exon 8 DEV21239.y1 exon 9 one diff with 18, 23, 27, 55 DEV40662.y1 >sequence 42, 59, 218 36% TO 2U1 Fugu MLSLFFSSVSIFFRCFSFLFNSWTLTFATSVLLLLVIMDKTD KKMPGSPPGPRGLPILGMLPFMMSYPERLMADWSKKHGPVIM (0) VKMGPKNVVILGSSDAAHAAFVKNTHLANRPQDGLEIFAGGKGILFINS SMFHSEQRRFCLSALREFGMGRRTLEPKIVDCAALLCDQIDDVCGHDVST (1) GPIQIEQMIYVTMSNVISHLVFGHDSLNDNKGLCELLLRTIEPNKYNALAGILMFLP SLKNVPPFSTTTRRAIEFREDLHRYIKMEIDRHRASRDPNQPRDFIDKFLNEIEKIKVRHNSNNNNNNNLILNNNVK (1) GSKSRRKIAGASFSEDQLIPLVRDLFMAGTDTSSTTITWVVIFLASFPDVQTRLHEELDSVLGQNRTPSVSMEDKMPYMRAVIQ (0) ETFRMRPPLPLSVPHVAQCNTTLMGYRVPKDSIILTNTWGIQNNPKFWVDPDTFKPQRHIDDLGKFIKSPNVI PFNIGSRNCLGQQLAKMELFITIASLCRRFWFTLPTGETPDLKGESTFILRPFPFNVVATKRV* DEV45222.y1 exon 1 LQW242739.x1 exon 1 LQW242739.x2 exon 1 LQW131160.y1 exon 1 LQW33948.y1 exon 1 LQW135435.y1 exon 1 DEV15725.y2 exon 1 DEV15725.y1 exons 1, 2 LQW185195.y1 exons 1, 2 AV963814.1 EST exons 2, 3 AV963844.1 EST exons 2, 3 AV957999.1 EST exons 2, 3 LQW181338.x1 exon 3 LQW181338.x3 exon 3 LQW74897.y1 exon 3 LQW158751.y1 exon 3 DEV35314.x1 exon 3 DEV35314.x4 exon 3 AV852807.1 EST exon 4 DEV57124.y1 exon 4 LQW47584.x1 exon 4 LQW167366.x1 exon 4 2 diffs LQW32310.x1 exon 4 DEV29829.x1 exons 4, 5 LQW33948.x1 exons 4, 5 DEV50829.x1 exons 4, 5 LQW131920.x1 exon 5 LQW185195.x1 exon 5 LQW220705.x1 exon 5 DEV45222.x1 exon 5 >sequence 5 4 accessions to sequence 1 PKG to heme 42% TO 2R1 VLTNIWRVHNNQNVWKNPHEFRPDRHLDSNGNFVSSNNVIPFACGYRRCLGEQIAKAE IFLFIVNIVKRFHLVVDQVTGPPTLEKDPKGAVNTPAPF LQW185918.x1 DEV58821.x2 DEV58821.x1 LQW152115.x1 PERF TO END LQW228498.x1 HEME 2 DIFFS GCiWno174_g15.g1 >GCiWno174_g15.g1 NTTTATGCCTGGNCNCAGAGGATGCAAAATTGCGTGTCTACTGGAACACTGTGTATGATCATGGACTGATGCCTTCTTAT AATATTATCTGCTATGAGCTGTATTTATTTATTAAAAGCTGTATTTTTGTAGCAAAAGTGTGTAAGTAAAGCAGGATTAT ATATAGATATGAAACTGTTTAAAACGTGTGCTTAACAAATTGTGCTGTCAGTTTTTTCTCGCTCGAACGCTGAAACGAAA AGGTGCTGGCGTGTTCACGGCTCCCTTAGGATCTTTTTCCAACGTCGGCGGACCGGTTACTTGATCCACCACCAAGTGAA ACCGTTTGACAATATTTACTATAAAAAGAAATATCTCTGCTTTGGCTATTTGCTCGCCCAAACAACGCCTATAACCACAT GCAAAAGGAATGACGTTGTTAGATGAGACGAAATTGCCGTTGCTGTCAAGATGACGATCGGGTCGAAACTCGTGTGGATT TTTCCAAACATTTTGATTATTGTGGACCCTCCAAATGTTGGTAAATACCTATATGTTAGAATTAAGCACAAAACGTACAT GTTATACTTTATTATATATATCATTATTAATGTAACTTATTTATCCTAGCCTGACAGGGCAACGCCAGTCTCCAAAACAG AGGCGTTCTGTCCTACATATCATGTCCGCTTATACGAGTTACCACTTATATAACTTCATAGGCGATTTTTTTGGTTATTT TTGTTCAACGTCTGGCTGCCAATTTAGACGACCACATAAGGTAGACCCATATGGGCTGATGTGTGCTGATTGACCAATGT ACTTACTGGAGTATCTTTAGGAATGTCATAACCGGACCATGGCACGCTTCCAACAGTTCGATGAAGCAGATTCCCCGTGG CAATGGGATCGTGCCGGTGTTTTTCTTGTATAAACCAACG >GCiWno174_g15.g1_4 LVYTRKTPARSHCHGESASSNCWKRAMVRL*HS*RYSSKYIGQSAHISPYGSTLCGRLNW QPDVEQK*PKKSPMKLYKW*LV*ADMICRTERLCFGDWRCPVRLG*ISYINNDIYNKV*H VRFVLNSNI*VFTNIWRVHNNQNVWKNPHEFRPDRHLDSNGNFVSSNNVIPFACGYRRCL GEQIAKAEIFLFIVNIVKRFHLVVDQVTGPPTLEKDPKGAVNTPAPFRFSVRARKN*QHN LLSTRFKQFHIYI*SCFTYTLLLQKYSF**INTAHSR*YYKKASVHDHTQCSSRHAILHP LXPGIX >GCiWno174_g15.g1_5 VGLYKKNTGTIPLPRGICFIELLEACHGPVMTFLKILQ*VHWSISTHQPIWVYLMWSSKL AARR*TKITKKIAYEVI*VVTRISGHDM*DRTPLFWRLALPCQARINKLH***YI**SIT CTFCA*F*HIGIYQHLEGPQ*SKCLEKSTRVSTRSSS*QQRQFRLI*QRHSFCMWL*ALF GRANSQSRDISFYSKYCQTVSLGGGSSNRSADVGKRS*GSREHASTFSFQRSSEKKLTAQ FVKHTF*TVSYLYIILLYLHTFATKIQLLINKYSS*QIIL*EGISP*SYTVFQ*TRNFAS SXXRHKX >GCiWno174_g15.g1_6 RWFIQEKHRHDPIATGNLLHRTVGSVPWSGYDIPKDTPVSTLVNQHTSAHMGLPYVVV*I GSQTLNKNNQKNRL*SYISGNSYKRT*YVGQNASVLETGVALSG*DK*VTLIMIYIIKYN MYVLCLILTYRYLPTFGGSTIIKMFGKIHTSFDPIVILTATAISSHLTTSFLLHVVIGVV WASK*PKQRYFFL**ILSNGFTWWWIK*PVRRRWKKILREP*TRQHLFVSAFEREKTDST IC*AHVLNSFISIYNPALLTHFCYKNTAFNK*IQLIADNIIRRHQSMIIHSVPVDTQFCI LXXQA*X >sequence 7, 100, 101 complete 19 accessions 36% to 2J2, 36% to 2Y1 MLDYLTFTNLALFAIALLLFYIWLTPKYKNLPPGPMGVPFLGCLPFMETLAERTFTKWSKKYGPI ITVQLGDHRTVILNSYEAIEE (0) AFIKKGEKFNGRFKTYFTEFASEGLGMVFIDGEKWKEHRKFGTRALAG (2) AGMYGKTIEQRVLEEAENICHVIRSKNEEPFILE (0) DDLILSVANVINGITIRERCDEEGNENLLKYAAIVVEG (2) FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGNGKTF (1) agt TLFDNLIEQHQQQHDRLHPRDLIDLYLNQMKDFS (0) KPQLRYFLKDLFAAGTETSSSTIRWALFYLIVNPDIQDKVHKEIDDVI (1) GHNGVVRYDTKLPYTKAVLQETYRIRTATPLGVPRRNTEDVTLMGYFIPKNTQ (0) IIPNLWWVHNDPEYWNEPDVFKPERHLDEEGNLIMSNRVIPFSIGARHCLGENLARTEIFL FLVSILQKFTVLPNPEDPNPPLECSPGIVNSPHKYPLIMKER* AV956864 EST exons 1, 2, 3, 4 and part of 5 LQW15076.y1 exons 4, 5 DEV13161.x01 exon 5 LQW241946.y2 exon 5 LQW148541.y1 exon 5 DEV38450.x1 C-helix LQW169782.x1 LQW189302.y1 LQW189302.x1 N-TERM LQW211779.y1 LQW243977.y1 LQW243977.x1 mid LQW247662.y1 LQW247662.x1 C-helix LQW155205.x1 C-helix LQW266480.y1 LQW70591.x1 LQW70591.y1 C-helix LQW70591.y2 C-helix LQW208910.x1 DEV43834.y1 N-TERM DEV8515.y1 LQW145047.y2 C-helix LQW25145.x1 mid GCiWno946_o03.g1 ex 7 GCiWno41_m16.g1 ex 6 DFCRKEILEHKKNLDENEPGDYIDAFLVEMKKH LQW201878.x1 from a similar gene but not same gene >cicl015f07_3 = seq 7 FTNLALFAIALLLFYIWLTPKYKNLPPGPMGVPFLGCLPFMETLAERTFTKWSKKYGPII TVQLGDHRTVILNSYEAIEEAFTKKGEKFNGRFKTYFTEFASEGLGIFFIDGEKWKEHRK FGTRALAGAGMYGKTIEQRVLEEAENICHVIRSKNEEPFILEDYLILSVANVINGITIKE RCDEEGNENLLKYVASVVEGFRKDGLIYSLLSVIPWCR >GCiWno946_o03.g1 TCGATTCGAGCTNCGGTACCCTTTAAGATTACTATATACAAAAGTTATTATTATAAAATAACATTTTAATATATTTATTA TTAACACCATAACGTTATTCAATTTCTTTACAGAAACCACAACTACGTTATTTTTTAAAAGATCTTTTCGCTGCTGGAAC CGAAACGAGCAGTAGCACAATACGTTGGGCGTTATTTTACCTCATTGTTAACCCTGATATTCAGGATAAAGTGCACAAGG AAATAGACGATGTTATAGGTAAAATGGTATGGGAGTCATGGGACTGTATATTTAAATTGATTTTCGTTTGCAATCGTCAT AACTTTGGGTTATTTCGTATTTACCAACATATATAGCAAATAAATATTAGTTGTTTTTTAATAACTTTTGATTATTTTTA TAAATTACTTCTACCCTCTTACTCTTTTGTAACCGATTTGTGTATACGTCTATGCACAGTTTCTGCTTGAAATACTTATA ACTTACCCCAATAAATACAACGTATTTCATTTACAAACAGGGCATAACGGAGTTGTTCGTTACGACACCAAACTACCATA CACTAAAGCGGTCCTTCAAGAGACTTATAGAATACGAACAGCGACGCCTCTTGGCGTGCCAAGACGAAATACTGAGGATG TTACATTGATGGGATACTTTATACCAAAGAACACTCAGGTATAATAATACATGATGCTATCCATATTTGAATACATGCAG GTTTTAACTAGTTTTGATGTGGAAATCAGGTCTAAAACATATAGTACCGTCAGGGNTAAATGTAAACCGCTATATAGCAC ATCAGTTCGTTAACAATTAAGTTCCATGTCGATTTAGATCATTCCNTAACCTTGGTGGGTCCACAACGACCCAGAGTATT GGAACGAACCCCGACGTTTC >_1 SIRAXVPFKITIYKSYYYKITF*YIYY*HHNVIQFLYRNHNYVIF*KIFSLLEPKRAVAQ YVGRYFTSLLTLIFRIKCTRK*TML*VKWYGSHGTVYLN*FSFAIVITLGYFVFTNIYSK *ILVVF**LLIIFINYFYPLTLL*PICVYVYAQFLLEILITYPNKYNVFHLQT GHNGVVR YDTKLPYTKAVLQETYRIRTATPLGVPRRNTEDVTLMGYFIPKNTQV**YMMLSIFEYMQ VLTSFDVEIRSKTYSTVRXKCKPLYSTSVR*QLSSMSI*IIP*PWWVHNDPEYWNEPRRF >_2 RFELRYPLRLLYTKVIIIK*HFNIFIINTITLFNFFTETTTTLFFKRSFRCWNRNEQ*HN TLGVILPHC*P*YSG*SAQGNRRCYR*NGMGVMGLYI*IDFRLQSS*LWVISYLPTYIAN KY*LFFNNF*LFL*ITSTLLLFCNRFVYTSMHSFCLKYL*LTPINTTYFIYKQGITELFV TTPNYHTLKRSFKRLIEYEQRRLLACQDEILRMLH*WDTLYQRTLRYNNT*CYPYLNTCR F*LVLMWKSGLKHIVPSGXNVNRYIAHQFVNN*VPCRFRSFXNLGGSTTTQSIGTNPDV >_3 DSSXGTL*DYYIQKLLL*NNILIYLLLTP*RYSISLQKPQLRYFLKDLFAAGTETSSSTI RWALFYLIVNPDIQDKVHKEIDDVIGKMVWESWDCIFKLIFVCNRHNFGLFRIYQHI*QI NISCFLITFDYFYKLLLPSYSFVTDLCIRLCTVSA*NTYNLPQ*IQRISFTNRA*RSCSL RHQTTIH*SGPSRDL*NTNSDASWRAKTKY*GCYIDGILYTKEHSGIIIHDAIHI*IHAG FN*F*CGNQV*NI*YRQG*M*TAI*HISSLTIKFHVDLDHSXTLVGPQRPRVLERTPTF 2N9 mid region 8235 GEPFDPVPLLNNAVANIICQIVFGRRFDYTDHMFQRMLHHLTEMAYLEGSIWAL 8074 (0) 7991 LYDSFPALMKHLPGPHNGIFSSSSSLQ GFIWREIQRHKSDLDPSNPRDYIDAFLIEEGNGN-NQLGFE ALFDNLIEQHQQQHDRLHPRDLIDLYLNQMKDFSVSQLYFKIT ERNLVLCCLDLFLAGSETTSKTLQWGLIYLIRKPHIQ 7606 (1) >GCiWno41_m16.g1 CHROMAT_FILE: GCiWno41_m16.g1 PHD_FILE: GCiWno41_m16.g1.phd.1 CHEM: term DYE: big TIME: Sat Sep 1 11:02:18 2001 TEMPLATE: GCiWno41_m16 DIRECTION: rev Length = 868 Score = 75.3 bits (182), Expect = 7e-14 Identities = 34/34 (100%), Positives = 34/34 (100%) Frame = -2 Query: 1 ALFDNLIEQHQQQHDRLHPRDLIDLYLNQMKDFS 34 ALFDNLIEQHQQQHDRLHPRDLIDLYLNQMKDFS Sbjct: 378 ALFDNLIEQHQQQHDRLHPRDLIDLYLNQMKDFS 277 >GCiWno41_m16.g1 TATTGTTATAATAATAAAGGTAATAAAATGTTATAATAAGAATAACTTTTGCATAGTAGGTTGGGGGAAGATGGGACACC TTTTTATTTGATTTTCTCGTTCCATTTGGTAGTAAATAAAGATCCTCACGACTCACATAAACCGCAGTTAATTGGATATT ATGTGCTAAAGGTGTCCCGATTTCCCCCACCCTACTATGTAATAACAAAGAAATTAAATGTTATAATAATAGTAATAACT TTTGTATATAGTAATCTTAAAATACAGTTGACTTACAGAAAAGTCTTTCATTTGATTCAAATACAAATCAATCAGATCTC TTGGGTGGAGACGGTCATGTTGTTGCTGGTGTTGTTCTATCAAGTTATCGAACAACGCTGCAATAATTATTACCATGAAA TGAACGAACGAATGATGTAACTTATTGGCTCATCCTCGTATGATGGGCAAAATAATTGTAACACCGCTTACGAGTTACCA CGCATGTGTCTTTGTAAGTAAAAACTTTTTTTATGTTTTTGTATTGTATTGCTAATTACTTAGAGTTTATAGACAACCCA TAAGCAACCACTGGGTTGGAGCAAATGCCGTTAAGTTTCTTGCCCAAGAATACGCCCACAGTAGTGTAGCAACCACGAGC CTTGAACCCGTAACATCTCGGTAAGAGGCAGGTTCAAATGCAGTTTAGCAAAAAGTATTGTATCCTATTTCATAAATCGC TTATTTAGTAATTTACATTTTTCTCTTTAACTAAATGTTNTACCGTTTCCCGCTTTGAGTCCATCTTTCAGGCTTTTTAT GTGTTTTTGAGTCTGCGGAAAGAATCGACACCAGGGAATAACCGACNNACACGAATAAATCAAACCAT >_4 WFDLFVXVGYSLVSILSADSKTHKKPERWTQSGKRXNI*LKRKM*ITK*AIYEIGYNTFC *TAFEPASYRDVTGSRLVVATLLWAYSWARNLTAFAPTQWLLMGCL*TLSN*QYNTKT*K KFLLTKTHAW*LVSGVTIILPIIRG*ANKLHHSFVHFMVIIIAALFDNLIEQHQQQHDRL HPRDLIDLYLNQMKDFSVSQLYFKITIYKSYYYYYNI*FLCYYIVGWGKSGHL*HIISN* LRFM*VVRIFIYYQMERENQIKRCPIFPQPTMQKLFLL*HFITFIIITI >_5 MV*FIRVXRLFPGVDSFRRLKNT*KA*KMDSKRETVXHLVKEKNVNY*ISDL*NRIQYFL LNCI*TCLLPRCYGFKARGCYTTVGVFLGKKLNGICSNPVVAYGLSINSK*LAIQYKNIK KVFTYKDTCVVTRKRCYNYFAHHTRMSQ*VTSFVRSFHGNNYCSVVR*LDRTTPATT*PS PPKRSD*FVFESNERLFCKSTVF*DYYIQKLLLLL*HLISLLLHSRVGEIGTPLAHNIQL TAVYVSREDLYLLPNGTRKSNKKVSHLPPTYYAKVILIITFYYLYYYNNX >_6 GLIYSCXSVIPWCRFFPQTQKHIKSLKDGLKAGNGXTFS*REKCKLLNKRFMK*DTILFA KLHLNLPLTEMLRVQGSWLLHYCGRILGQET*RHLLQPSGCLWVVYKL*VISNTIQKHKK SFYLQRHMRGNS*AVLQLFCPSYEDEPISYIIRSFISW**LLQRCSIT**NNTSNNMTVS TQEI*LICI*IK*KTFL*VNCILRLLYTKVITIIITFNFFVIT**GGGNRDTFST*YPIN CGLCES*GSLFTTKWNEKIK*KGVPSSPNLLCKSYSYYNILLPLLL*QX A related EST >cits048p05 CGGCCGCTACTGCCTAAACTTGCTAGAAAGTATGGACCCATTTTCACTATTACAGCTGGGTATAGACGCGTGGTGTTTCT TGTTGGCTATGACTTAATTAAAGAAGTTATCACTGACAGAGCAAAAGATTTTGCATCAAGATGTCCCAACATGCCAGGAA GAATTGTCCGTGGAGAAGGTTTAGATGGCATTGCTGCAGCCCCATATGGTCCCAAATGGATGGCAAACCGTAAGTTCTTT TACTCTGCCATGCGCACCATGGGGTTGGGAAAACGTGGGATAGAAAAGTGTGTGGTTGATGAGATTCCCTACATTGTTGA AGAACTTGAGAAGCTTTGCACCAAAAATGAGTTGTTTGAACCATCGAGTGTATTTGACTCCGCTGTACTGAACGTGCTTG CATATTTCACTTTTGGAAACAGATATTCCTATCAAGACGAAAAATTTAAAGAGCTGATTCACATCAATAATGATTTCTTT CAAAAAGCAAAGTTTCTGAATCAGCCACTATTCTTCCTCCTCAGTTTGGTACCTGGCCTGCATAAATACTGGCTTCCTCA ATGTGGAAAAGATCTAAAAGAATCAGTTGGGAAAATCAACAAATTTGTAAAAGCTGAGATTGAACAACATCGACAAAATT TTGACCGCAAAAATCCAAGAGATTATATTGACTGTTATCTCAAAGAGCTCGATCAGATGAATGATCAAAGCGAACTATCG G RPLLPKLARKYGPIFTITAGYRRVVFLVGYDLIKEVITDRAKDFASRCPNMPGRIVRGEG LDGIAAAPYGPKWMANRKFFYSAMRTMGLGKRGIEKCVVDEIPYIVEELEKLCTKNELFE PSSVFDSAVLNVLAYFTFGNRYSYQDEKFKELIHINNDFFQKAKFLNQPLFFLLSLVPGL HKYWLPQCGKDLKESVGKINKFVKAEIEQHRQNFDRKNPRDYIDCYLKELDQMNDQSELS >GCiWno434_a04.g1 CHROMAT_FILE: GCiWno434_a04.g1 PHD_FILE: GCiWno434_a04.g1.phd.1 CHEM: term DYE: big TIME: Thu Oct 11 12:43:00 2001 TEMPLATE: GCiWno434_a04 DIRECTION: rev Length = 940 Score = 91.7 bits (224), Expect = 8e-19 Identities = 41/42 (97%), Positives = 42/42 (99%) Frame = +3 Query: 1 FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGNGKTF 42 FRKDGLIYSLLSVIPWCRFFPQTQKH+KSLKDGLKAGNGKTF Sbjct: 156 FRKDGLIYSLLSVIPWCRFFPQTQKHMKSLKDGLKAGNGKTF 281 >GCiWno434_a04.g1 TACGAATTCGAGCTCGTACCCTATTTTTTTAACGGTGTATTTATTTTCGTGCATGCGGTGTGAAAAATCGAATGTTAAAT TCACCGAAAATATAGCATATGCTATGCCATGTAAAAATAGGAAATATTAAAGCTATCAATAATTTCCCAAACAGGTTCCG GAAAGATGGTTTGATTTATTCGTTGCTGTCGGTTATTCCGTGGTGTCGATTCTTTCCGCAGACTCAAAAACACATGAAAA GCCTGAAAGATGGACTCAAAGCGGGAAACGGTAAAACATTTAGTTGAAGAGAAGAATGTAAATTACGAAATAAACGATTT ATGAAATAGGATACAATACTTTTTGCTAAACTGCATTTGAACCTGCCTCTTACCATGATGTTACGGGTTCAAGGCTCGTG GTTGCTACACTATACTGTGGGCGTATTCTTGGGCAAGAAACTTAACGGCATTTGCTCCAACCCAGTGGTTGCTTATAGGT TGTCTATAAACTCTAAGTAATTAGCAATACAATACAAAAACATAAAAAAAAGTTTTCACCTACAAAGACACATGTGTGGT AACTCGTAAGCGGTGTTACAATTATTTTGCCCATCATACGAGGATGAGCCAATAAGTTACATCATTCGTTCGTTCATTTC ATGGTAAAAATTATTGCAGCGTTGTTCGATAACTTGATAGAACAACACCAGCAACAACATGACCGTCTCCACCCAAGAGA TCTGATTGATTTGTATTTGAATCAAATGAAAGACTTTTCTGNTAGTCAACTGTATTTTAAGATTACTATATACAAAAGTT ATTACTATTATTATAATATTTAATTTCTTTGGTATACTGNANGGGTGGGGGGGGAAAATCGGGGAAACCTTTAGCACATA ATATCCCATTAACTGCGGTCTATGTGAGTCGTTGAGGATCTTTATTTACTACCCAATGGN >_1 YEFELVPYFFNGVFIFVHAV*KIEC*IHRKYSICYAM*K*EILKLSIISQTGSGKMV*FI RCCRLFRGVDSFRRLKNT*KA*KMDSKRETVKHLVEEKNVNYEINDL*NRIQYFLLNCI* TCLLP*CYGFKARGCYTILWAYSWARNLTAFAPTQWLLIGCL*TLSN*QYNTKT*KKVFT YKDTCVVTRKRCYNYFAHHTRMSQ*VTSFVRSFHGKNYCSVVR*LDRTTPATT*PSPPKR SD*FVFESNERLFX*STVF*DYYIQKLLLLL*YLISLVYXXGGGGKSGKPLAHNIPLTAV YVSR*GSLFTTQWX >_2 TNSSSYPIFLTVYLFSCMRCEKSNVKFTENIAYAMPCKNRKY*SYQ*FPKQVPERWFDLF VAVGYSVVSILSADSKTHEKPERWTQSGKR*NI*LKRRM*ITK*TIYEIGYNTFC*TAFE PASYHDVTGSRLVVATLYCGRILGQET*RHLLQPSGCL*VVYKL*VISNTIQKHKKKFSP TKTHVW*LVSGVTIILPIIRG*ANKLHHSFVHFMVKIIAALFDNLIEQHQQQHDRLHPRD LIDLYLNQMKDFSXSQLYFKITIYKSYYYYYNI*FLWYTXXVGGENRGNL*HIISH*LRS M*VVEDLYLLPNG >_3 RIRARTLFF*RCIYFRACGVKNRMLNSPKI*HMLCHVKIGNIKAINNFPNRFRKDGLIYS LLSVIPWCRFFPQTQKHMKSLKDGLKAGNGKTFS*REECKLRNKRFMK*DTILFAKLHLN LPLTMMLRVQGSWLLHYTVGVFLGKKLNGICSNPVVAYRLSINSK*LAIQYKNIKKSFHL QRHMCGNS*AVLQLFCPSYEDEPISYIIRSFISW*KLLQRCSIT**NNTSNNMTVSTQEI *LICI*IK*KTFLXVNCILRLLYTKVITIIIIFNFFGILXGWGGKIGETFST*YPINCGL CESLRIFIYYPM >LQW155205.y1 TAGCCTGGACCGCGCTGTCGTATCTATAAGTCTCTTGAAGGACGCTTTAGTGTATGGTAGTTTGGTGTCGTAACGAACAA CTCCGTTATGCCCTGTTTGTAAATAATATACGTTGTATTTATTGGGCTAAGTTTTAAATATTTCATGCAGAAGTTGTGTT TAGGAGTAAATAAATAGTCGGCTAAGAGCATGTGGTTACAAAAGACTTAAGACTAAGAGGTGGTATTTGCTTTATAAGTA TAAAAGTAATTAAAAAACTTTGTTTTTTGTGATAATAGTAAAAAGTGCTTACAATTATGACGGTTGCATGCGAAAACCCA TATACATATATACGGTCACAAATCTTTGAATCACCTATAACGTCGTCTATTTCTTTATGCACTTTATCCTGAATATCAGG GTTAACAATGAGGTAAAATAACGCCCAACGTATGGTGCTACTACTCGTTTCGGTTCCAGCAGCGAAAAGATCTTTTAAAA AATAACGTAGTTGTGGTTTCTGTGAAGAAATTGAGTATTGTTATAATAATAAAGGTAATAAAATGTTATAATAAGAATAA CTTTTGCATAGTAGGTTGGGGGGAGATGGAACACCTTTTTATTTGATTTTCTCGTTCCATTTGGTAGTAAATAAAGATCC TCACGACTCACATAGAACGCAGTTAATGGGATATTATGTGCTCAAGGTGTCCAGATTTTCCCACCCTAATATGTAATAAC AAGAAATTAAATGTATAATAAAGTAATACTTTGTATTAGTATCTTAAATACGTGATTCAGAAATCTTCATTGATCAATAC ATAATCAGACCTGGGGGAACGGCACTTGAGCGGGTGACATCAGTTCAAAAGTCAGACTACTGAAGAAACAAGAGTAACGG CCAGAGAGGAAATGGCGCAAGCAAAAATGAAAAAAAAAGAGGGAAAACACAAAACGACAGCCACAAAAACATGACTACCC >_4 G*SCFCGCRFVFSLFFFHFCLRHFLSGRYSCFFSSLTFELMSPAQVPFPQV*LCIDQ*RF LNHVFKILIQSITLLYI*FLVITY*GGKIWTP*AHNIPLTAFYVSREDLYLLPNGTRKSN KKVFHLPPTYYAKVILIITFYYLYYYNNTQFLHRNHNYVIF*KIFSLLEPKRVVAPYVGR YFTSLLTLIFRIKCIKK*TTL*VIQRFVTVYMYMGFRMQPS*L*ALFTIITKNKVF*LLL YL*SKYHLLVLSLL*PHALSRLFIYS*TQLLHEIFKT*PNKYNVYYLQTGHNGVVRYDTK LPYTKASFKRLIDTTARSRL >_5 VVMFLWLSFCVFPLFFSFLLAPFPLWPLLLFLQ*SDF*TDVTRSSAVPPGLIMY*SMKIS ESRI*DTNTKYYFIIHLISCYYILGWENLDTLST*YPINCVLCES*GSLFTTKWNEKIK* KGVPSPPNLLCKSYSYYNILLPLLL*QYSISSQKPQLRYFLKDLFAAGTETSSSTIRWAL FYLIVNPDIQDKVHKEIDDVIGDSKICDRIYVYGFSHATVIIVSTFYYYHKKQSFLITFI LIKQIPPLSLKSFVTTCS*PTIYLLLNTTSA*NI*NLAQ*IQRILFTNRA*RSCSLRHQT TIH*SVLQETYRYDSAVQAX >_6 GSHVFVAVVLCFPSFFFIFACAISSLAVTLVSSVV*LLN*CHPLKCRSPRSDYVLINEDF *ITYLRY*YKVLLYYTFNFLLLHIRVGKSGHLEHIISH*LRSM*VVRIFIYYQMERENQI KRCSISPQPTMQKLFLL*HFITFIIITILNFFTETTTTLFFKRSFRCWNRNE**HHTLGV ILPHC*P*YSG*SA*RNRRRYR*FKDL*PYICIWVFACNRHNCKHFLLLSQKTKFFNYFY TYKANTTS*S*VFCNHMLLADYLFTPKHNFCMKYLKLSPINTTYIIYKQGITELFVTTPN YHTLKRPSRDL*IRQRGPGX >GCiWno464_n10.g1 CHROMAT_FILE: GCiWno464_n10.g1 PHD_FILE: GCiWno464_n10.g1.phd.1 CHEM: term DYE: big TIME: Mon Oct 15 09:58:50 2001 TEMPLATE: GCiWno464_n10 DIRECTION: rev Length = 925 Score = 92.8 bits (227), Expect = 4e-19 Identities = 42/42 (100%), Positives = 42/42 (100%) Frame = -1 Query: 1 FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGNGKTF 42 FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGNGKTF Sbjct: 688 FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGNGKTF 563 >GCiWno400_a20.b1 CHROMAT_FILE: GCiWno400_a20.b1 PHD_FILE: GCiWno400_a20.b1.phd.1 CHEM: term DYE: big TIME: Wed Oct 10 12:08:39 2001 TEMPLATE: GCiWno400_a20 DIRECTION: fwd Length = 871 Score = 92.8 bits (227), Expect = 4e-19 Identities = 42/42 (100%), Positives = 42/42 (100%) Frame = +3 Query: 1 FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGNGKTF 42 FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGNGKTF Sbjct: 123 FRKDGLIYSLLSVIPWCRFFPQTQKHIKSLKDGLKAGNGKTF 248 >sequence 209 A different C-term from seq 27 or 10 last exon has 7 diffs from seq 10 LQW203867.x1 exon 9 LQW35630.x1 exon 9 3 diffs with seq 10 GCiWno809_c05.g1 GCiWno49_j21.g1 GCiWno46_e07.g1 GCiWno973_a07.b1 k-helix exon has 5 diffs with seq 10 GCiWno520_h02.b1 MIEAFLRQWTDTTTVIIFLFTLLFYYWYRRPLRFPPGPRGLPLV GVLPFLRKYTARTMHKWSNKYGPVMSVRMGNEDWVVMGNYEAVHQ (0?) 6 diffs to seq 199 SFVKAGNVFSGRPVLKVANEIAEGKGIVMRDFNTTWKTHRKFGSITLRG (2?) FGVGKKSLEERIYEEVEVMNKEILSKEGKPFDIT (0?) EILTNAVMNVISIITTDQRFEYDDEHFRLLQQIFTKW (2?) FLEPENTAAFAQIIYVPLLCNIPPYRSKYLEVKRDMKRVSG (?) VFQQMVEQHRKTFDKNNLRDFIDAFICEGKKGTDESFT (0) DGQLVQYVREMFEAGTETMTGTVRWAMLCLIHYPDAQKKLREEIFEAI GNNNRPSISDRKAMPFTSAFIQEVFRFRTRVPLGVQHMTTETVNFANYVIPKGTT (0) ILANMWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARME IFIFLVSMVQKFEFLPDPNEPDLPEINDGVKGNGFFPYPFNFVASEI* Top of seq 27 for comp MLTNLIANPSISPVLAILSCVIAFYYWYKRPKNMPPGPRGIPFLGIIPFVGMN PEQAFMQWSKKYGPVITVRMGRKDWVVLCDHDTIHQ (0) >GCiWno799_o14.b1 CHROMAT_FILE: GCiWno799_o14.b1 PHD_FILE: GCiWno799_o14.b1.phd.1 CHEM: term DYE: big TIME: Tue Dec 4 12:05:33 2001 TEMPLATE: GCiWno799_o14 DIRECTION: fwd Length = 828 Score = 148 bits (370), Expect = 2e-35 Identities = 79/127 (62%), Positives = 80/127 (62%), Gaps = 39/127 (30%) Frame = -3 Query: 1 GVLPFLRKYTARTMHKWSNKYGPVMSVRMGNEDWLAMGN--------------------- 39 GVLPFL KYTARTMHKWS KYGPVMSVRMGNEDW+ MGN Sbjct: 397 GVLPFLGKYTARTMHKWSKKYGPVMSVRMGNEDWVVMGNYEAVHQVLFENFKTAV**TST 218 Query: 40 ------------------SFVKAGNVFSGRPVLKVANEIAEGKGIVMRDFNTTWKTHRKF 81 SFVKAGNVFSGRPVLKVANEIAEGKGIVMRDFN WKTH KF Sbjct: 217 HKQVD*IGTALLNVTIFQSFVKAGNVFSGRPVLKVANEIAEGKGIVMRDFNAPWKTHSKF 38 Query: 82 GSITLRG 88 SI LRG Sbjct: 37 RSIPLRG 17 >GCiWno673_d07.b1_1 ISKPQYSKPVHTNKLIR*ALRF*MLQYFRAL*KQATCSLDVLC*RWRMR*LRARELL*EI LTQHGKHTESLEALLFGGI*HDLC*R*PLVRKRLNLNNIVLAIFTFKQNSKSNFILFRFG VGKKSLEERIYEEVEVMNKEILSKEGKPFDITVRACIYYV*TSIIHFNMNMRFFAYFWYS CVGGKMGHL*HIISKHPDYVLNK**QSIRVVGIRLYISLKVK*MFFNFV*EILTNAVMNV ISIITTDQRFEYDDEHFRLLXXFSRNVKSHFAYILGTEIMRSYMMD*TVLM >GCiWno673_d07.b1_2 FQNRSIVNQYTQTS*LDRHCDSECYNISELCKSRQRVLWTSCVEGGE*DS*GQGNCYERF *HNMENTQKVWKHYSSGVYSMICVNVNR*CEND*I*II*Y*RSLRSNKILNLTLFCSGSG LVRRA*KSESMKRWR**TKKFFQKKENHLTSQYVPVYIMYKRALYTLT*I*DFSLTFGIA VWGERWDTFST*YPSILTMF*INNNSLLESWEYGYIFL*R*NKCFLTLFRKS*PTQS*TS SAS*LPIKGLNMMTSIFDYFXXFHEMLRAILRIY*VQKSCEVT*WIELC* >GCiWno673_d07.b1_3 FKTAV**TSTHKQVD*IGTAILNVTIFQSFVKAGNVFSGRPVLKVANEIAEGKGIVMRDF NTTWKTHRKFGSITLRGYIA*FVLTLTVSAKTIKFK*YSTSDLYVQTKF*I*LYFVQVRG W*EELRRANL*RGGGNEQRNSFKRRKTI*HHSTCLYILCINEHYTL*HEYEIFRLLLV*L CGGKDGTPLAHNIQAS*LCFK*IITVY*SRGNTVIYFFEGKINVF*LCLGNLDQRSHERH QHHNYRSKV*I**RAFSTTSXXFTKC*EPFCVYTRYRNHAKLHDGLNCAD >GCiWno673_d07.b1 ATTTCAAAACCGCAGTATAGTAAACCAGTACACACAAACAAGTTGATTAGATAGGCACTGCGATTCTGAATGTTACAATA TTTCAGAGCTTTGTAAAAGCAGGCAACGTGTTCTCTGGACGTCCTGTGTTGAAGGTGGCGAATGAGATAGCTGAGGGCAA GGGAATTGTTATGAGAGATTTTAACACAACATGGAAAACACACAGAAAGTTTGGAAGCATTACTCTTCGGGGGTATATAG CATGATTTGTGTTAACGTTAACCGTTAGTGCGAAAACGATTAAATTTAAATAATATAGTACTAGCGATCTTTACGTTCAA ACAAAATTCTAAATCTAACTTTATTTTGTTCAGGTTCGGGGTTGGTAAGAAGAGCTTAGAAGAGCGAATCTATGAAGAGG TGGAGGTAATGAACAAAGAAATTCTTTCAAAAGAAGGAAAACCATTTGACATCACAGTACGTGCCTGTATATATTATGTA TAAACGAGCATTATACACTTTAACATGAATATGAGATTTTTCGCTTACTTTTGGTATAGCTGTGTGGGGGGAAAGATGGG ACACCTTTAGCACATAATATCCAAGCATCCTGACTATGTTTTAAATAAATAATAACAGTCTATTAGAGTCGTGGGAATAC GGTTATATATTTCTTTGAAGGTAAAATAAATGTTTTTTAACTTTGTTTAGGAAATCTTGACCAACGCAGTCATGAACGTC ATCAGCATCATAACTACCGATCAAAGGTTTGAATATGATGACGAGCATTTTCGACTACTTCANCANTTTTCACGAAATGT TAAGAGCCATTTTGCGTATATACTAGGTACAGAAATCATGCGAAGTTACATGATGGATTGAACTGTGCTGATG >GCiWno673_d07.b1 CHROMAT_FILE: GCiWno673_d07.b1 PHD_FILE: GCiWno673_d07.b1.phd.1 CHEM: term DYE: big TIME: Fri Nov 2 12:35:19 2001 TEMPLATE: GCiWno673_d07 DIRECTION: fwd Length = 873 Score = 63.2 bits (151), Expect = 5e-10 Identities = 30/30 (100%), Positives = 30/30 (100%) Frame = +1 Query: 1 EILTNAVMNVISIITTDQRFEYDDEHFRLL 30 EILTNAVMNVISIITTDQRFEYDDEHFRLL Sbjct: 691 EILTNAVMNVISIITTDQRFEYDDEHFRLL 780 >GCiWno520_h02.b1_4 XEVXGYEQKKFLSKGRKTILTSQVRAXYILCINEHYTL*HEYEIFRLLLI*LCGGRWDTF ST*YPSILTMF*INNNSLLESWGYGYIFL*R*NKYFLTLFRKS*PTQS*TSSAS*LPIKG LNMMTSISDYFSKFSRNGKMPFCV*YYGTEIHAKFT*WI*TVLSDYPEMFNLPFAGSWNP RTRLLSHR*FMFLCFAIYPLTEASIWK*NGT*KGFQVCKTLLNIYFKIKMLFFYILQPFF NKW*SNIEKPLTKITFATLLMLLFVKGRRAQTKVLRYDTL*LI*ANSPSFSVII >GCiWno520_h02.b1_5 RGXGL*TKEIPFKREENHFDITGTCLXYIMYKRALYTLT*I*DFSLTFDIAVWGKMGHL* HIISEHPDYVLNK**QSIRVVGIRLYISLKVK*IFFNFV*EILTNAVMNVISIITTDQRF EYDDEHFRLLQQIFTKW*DAILRIILWYRNTCEVHMMDLNCAE*LS*NV*PSVRRFLEPE NTAAFAQIIYVPLLCNIPPYRSKYLEVKRDMKRVSGL*NTSKYIF*NKNVIFLHSTAVFQ QMVEQHRKTFDKNNLRDFIDAFICEGKKGTDESFTV*YFVVNLSQFPFF*RNNX >GCiWno520_h02.b1_6 KRXRVMNKRNSFQKGGKPF*HHRYVPXIYYV*TSIIHFNMNMRFFAYF*YSCVGEDGTPL AHNIRAS*LCFK*IITVY*SRGDTVIYFFEGKINIF*LCLGNLDQRSHERHQHHNYRSKV *I**RAFPTTSANFHEMVRCHFAYNTMVQKYMRSSHDGFELC*VIILKCLTFRSQVPGTR EHGCFRTDNLCSFALQYTPLPKQVFGSKTGHEKGFRFVKHF*IYILK*KCYFFTFYSRFS TNGRAT*KNL*QK*PSRLY*CFYL*REEGHRRKFYGMILCS*FKPIPLLLA**X >LQW203042.y1 >lcl|LQW203042.y1 CHROMAT_FILE: LQW203042.y1 PHD_FILE: LQW203042.y1.phd.1 CHEM: term DYE: ET TIME: Wed Aug 22 12:21:14 2001 TEMPLATE: LQW203042 DIRECTION: rev TAGAAGGATCTCGATCAATTCAGCTCGGTACAAGGGTATGATATAGAATTACGGGAGAATATGGGACACCTTTACACATA GTATCCAAATACCATTATCATGATGTTTTAAACACTAAAACGACGCGGCCTGTATATGGGAGTCGCGACGATACGGTTTT ATAATAATTTTGAAAATAATTTTGAAAATAATCGTTCTTATTTGTTTACTACCACATGGGACGAGGAAATAGAACGAAAA TTGTCTCATCTTTCCCCAGCCTGTTATAGGCTACTTTATAACATCCTGGCTGTAAACAAGGGGGTTTTCATCTAAGTTTT GCAGAATAAATAAAACTATTAATTATAAACTGACCATAAACCCTCTATATTATAACTTGCCTATTGCGGCAAATATTTCT TCCCGAAGTTTTGTTTGCGCATCTGGGTCATGGATGAGACACAGCATCGTCCATCTGACAGTTCCTGTCATTGTTTCTGT TCCAGCTTCGAACATTTCACGCACATATTGTACAAGCTGCCCATCCTGAGAAGAGCCACCGAACGAAAACATGTCTTATA TAGATAGAGATGCTCAAATGTGATCTAAGGTATCCCAAAGTTTCAAAAAAGCAGCAGCCTTCTTACCGATCGGGTAAATG CCAGGAAAAACATTTCAATTTTTCGACTGAACGAAGCTGTGTACACGTTTATCACTTTCTGACATACCTGCTGACGCAAA ATTTCCGCGCCGATTCAAATCAATTACATCATCTAAAAATTAAACATCATATGTCTTACTGATACCACGTGAGAAAGCGG CGTCTATCCAAAGATTCCGAATTAAAACCATTGCCCGCTAGTTTCGTAAACCTTCCGTAACAACCATCGTAGGTGTCCAA GAATCTTGGAGATCGAAAAGAACATTCATTAGCAAAGCGTAAACAGTTCCATTGGCCTCAAAAAACCACTCGTAGC >LQW203042.y1_4 YEWFFEANGTVYALLMNVLFDLQDSWTPTMVVTEGLRN*RAMVLIRNLWIDAAFSRGISK TYDV*FLDDVIDLNRRGNFASAGMSESDKRVHSFVQSKN*NVFPGIYPIGKKAAAFLKLW DTLDHI*ASLSI*DMFSFGGSSQDGQLVQYVREMFEAGTETMTGTVRWTMLCLIHDPDAQ TKLREEIFAAIGKL*YRGFMVSL*LIVLFILQNLDENPLVYSQDVIK*PITGWGKMRQFS FYFLVPCGSKQIRTIIFKIIFKIIIKPYRRDSHIQAASF*CLKHHDNGIWILCVKVSHIL P*FYIIPLYRAELIEILL >LQW203042.y1_5 LRVVF*GQWNCLRFANECSFRSPRFLDTYDGCYGRFTKLAGNGFNSESLDRRRFLTWYQ* DI*CLIFR*CN*FESARKFCVSRYVRK**TCTQLRSVEKLKCFSWHLPDR*EGCCFFETL GYLRSHLSISIYIRHVFVRWLFSGWAACTICA*NVRSWNRNNDRNCQMDDAVSHP*PRCA NKTSGRNICRNRQVII*RVYGQFIINSFIYSAKLR*KPPCLQPGCYKVAYNRLGKDETIF VLFPRPMW**TNKNDYFQNYFQNYYKTVSSRLPYTGRVVLVFKTS**WYLDTMCKGVPYS PVILYHTLVPS*IDRDPSX >LQW203042.y1_6 ATSGFLRPMELFTLC**MFFSISKILGHLRWLLRKVYETSGQWF*FGIFG*TPLSHVVSV RHMMFNF*MM*LI*IGAEILRQQVCQKVINVYTASFSRKIEMFFLAFTRSVRRLLLF*NF GIP*ITFEHLYLYKTCFRSVALLRMGSLYNMCVKCSKLEQKQ*QELSDGRCCVSSMTQMR KQNFGKKYLPQ*ASYNIEGLWSVYN**FYLFCKT*MKTPLFTARML*SSL*QAGER*DNF RSISSSHVVVNK*ERLFSKLFSKLL*NRIVATPIYRPRRFSV*NIMIMVFGYYV*RCPIF SRNSISYPCTELN*SRSFX DEV18495.x1 CHEM: term DYE: ET TIME: Wed Mar 21 15:33:43 2001 TEMPLATE: DEV18495 DIRECTION: fwd Length = 554 Score = 78.0 bits (189), Expect = 3e-14 Identities = 43/70 (61%), Positives = 54/70 (76%), Gaps = 3/70 (4%) Frame = -1 Query: 54 FSRGISKTYDV*FLDDVIDL---NRRGNFASAGMSESDKRVHSFVQSKN*NVFPGIYPIG 110 ++ GISKT + + +I + N +FASAG+S+ +KR+HSFVQSKN*NVFPGIYPI Sbjct: 350 YTVGISKT*KMF*IKKMIKIKFENCAKSFASAGISKINKRLHSFVQSKN*NVFPGIYPIF 171 Query: 111 KKAAAFLKLW 120 KKAAAFLKLW Sbjct: 170 KKAAAFLKLW 141 >DEV18495.x1_4 SVIIGYV*LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLETNAPNKVVVLSAGQSF CIEATALLHSGYQ*DIKNVLNKKNDKN*I*KLREKFCVSRYFKN**TFTQLRSVEKLKCF SWHLPDL*KGCCFFETLGYLRSHLSISIYIKHVFVRWLFSGWAACTICA*NVRSWNRNND RNCQ >DEV18495.x1_5 *RNNRIRVTTLSSRK*QRQT*FLFRIHLLHEISLG*IRKSVLP*NKCAEQSGGVKCRTIF LYRSDRFTTQWVSVRHKKCFK*KK**KLNLKTARKVLRQQVFQKLINVYTASFSRKIEMF FLAFTRSLKRLLLF*NFGIP*ITFEHLYLYKTCFRSVALLRMGSLYNMCVKCSKLEQKQ* QELSX >DEV18495.x1_6 LA***DTCNYVILA*IATTNVISVSYTPPTRNFAGLDP*KCLTLKQMRRTKWWC*VPDNL FV*KRPLYYTVGISKT*KMF*IKKMIKIKFENCAKSFASAGISKINKRLHSFVQSKN*NV FPGIYPIFKKAAAFLKLWDTLDHI*ASLSI*NMFSFGGSSQDGQLVQYVREMFEAGTETM TGTVX Next walk >rciht025j24 Length = 708 Score = 114 bits (283), Expect = 1e-25 Identities = 57/60 (95%), Positives = 58/60 (96%), Gaps = 1/60 (1%) Frame = -3 Query: 1 SVIIGYV-LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLETNAPNKVVVLSAGQSF 59 SVII YV LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYL+TNAPNKVVVLSAGQSF Sbjct: 226 SVIIEYV*LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLKTNAPNKVVVLSAGQSF 47 >rciht025j24_4 LQQIFTKW*DAILRIILWYRNTCEVHMMDLNCAE*LS*NV*PSVRRFLEPENTAAFAQII YVPLLCNIPPYRSKYLEVKRDMKRVSGM*NTSKYIF*NKNVIFLHSTAVFQQMVEQHRKT FDKNNLRDFIDAFICEGKKGTDESFTV*YFVVNLSQFPFLLA***NTCNYVILA*IATTN VISVSYTPPTRNFAGLDP*KCLTLKQMRRTKWWC*VPDNLFV*KRPLYYTVGISKT >rciht025j24_5 SANFHEMVRCHFAYNTMVQKYMRSSHDGFELC*VIILKCLTFRSQVPGTREHGCFRTDNL CSFALQYTPLPKQVFGSKTGHEKGFRYVKHF*IYILK*KCYFFTFYSRFSTNGRAT*KNL *QK*PSRLY*CFYL*REEGHRRKFYGMILCS*FKPIPLSFSVIIEYV*LRYPRVNSNDKR DFCFVYTSYTKFRWVRSVKVSYLKTNAPNKVVVLSAGQSFCIEATALLHSGYQ*DX >rciht025j24_6 FSKFSRNGKMPFCV*YYGTEIHAKFT*WI*TVLSDYPEMFNLPFAGSWNPRTRLLSHR*F MFLCFAIYPLTEASIWK*NGT*KGFQVCKTLLNIYFKIKMLFFYILQPFFNKW*SNIEKP LTKITFATLLMLLFVKGRRAQTKVLRYDTL*LI*ANSPFF*RNNRIRVTTLSSRK*QRQT *FLFRIHLLHEISLG*IRKSVLP*NKCAEQSGGVKCRTIFLYRSDRFTTQWVSVRX >rciht035c13 Length = 727 Score = 114 bits (283), Expect = 1e-25 Identities = 57/60 (95%), Positives = 58/60 (96%), Gaps = 1/60 (1%) Frame = -3 Query: 1 SVIIGYV-LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLETNAPNKVVVLSAGQSF 59 SVII YV LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYL+TNAPNKVVVLSAGQSF Sbjct: 212 SVIIEYV*LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLKTNAPNKVVVLSAGQSF 33 >rciht035c13_4 GLNMMTSISDYFSKFSRNGKMPFCV*YYGTEIHAKFT*WI*TVLSDYPEMFNLPFAGSWN PRTRLLSHR*FMFLCFAIYPLTEASIWK*NGT*KGFQVCKTLLNIYFKIKMLFFYILQPF FNKW*SNIEKPLTKITFATLLMLLFVKGRRAQTKVLRYDTL*LI*ANSPFF*RNNRIRVT TLSSRK*QRQT*FLFRIHLLHEISLG*IRKSVLP*NKCAEQSGGVKCRTIFLYRSDRFTT PV >rciht035c13_5 GFEYDDEHFRLLQQIFTKW*DAILRIILWYRNTCEVHMMDLNCAE*LS*NV*PSVRRFLE PENTAAFAQIIYVPLLCNIPPYRSKYLEVKRDMKRVSGM*NTSKYIF*NKNVIFLHSTAV FQQMVEQHRKTFDKNNLRDFIDAFICEGKKGTDESFTV*YFVVNLSQFPFLLA***NTCN YVILA*IATTNVISVSYTPPTRNFAGLDP*KCLTLKQMRRTKWWC*VPDNLFV*KRPLYY TSX >rciht035c13_6 V*I**RAFPTTSANFHEMVRCHFAYNTMVQKYMRSSHDGFELC*VIILKCLTFRSQVPGT REHGCFRTDNLCSFALQYTPLPKQVFGSKTGHEKGFRYVKHF*IYILK*KCYFFTFYSRF STNGRAT*KNL*QK*PSRLY*CFYL*REEGHRRKFYGMILCS*FKPIPLSFSVIIEYV*L RYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLKTNAPNKVVVLSAGQSFCIEATALLH QX >ciht025j24 Length = 671 Score = 98.7 bits (242), Expect = 1e-20 Identities = 48/51 (94%), Positives = 49/51 (95%), Gaps = 1/51 (1%) Frame = +2 Query: 1 SVIIGYV-LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLETNAPNKV 50 SVII YV LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYL+TNAPNKV Sbjct: 518 SVIIEYV*LRYPRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLKTNAPNKV 670 >LQW163322.x1 >lcl|LQW163322.x1 CHROMAT_FILE: LQW163322.x1 PHD_FILE: LQW163322.x1.phd.1 CHEM: term DYE: ET TIME: Wed Aug 22 11:17:42 2001 TEMPLATE: LQW163322 DIRECTION: fwd AACGGCAGTGCAGCTTTTGTATAGAAGCGACCGCTTTACTACACAGTGGGTATCAGTAAGACATAAAAAATGTTTTAAAT AAAAAAAATGATAAAAATTAAATTTGAAAACTGCGCGAAAAGTTTTGCGTCAGCAGGTATTTCAAAAATTAATAAACGTT TACACAGCTTCGTTCAGTCGAAAAATTGAAATGTTTTTCCTGGCATTTACCCGATCTTTAAAAAGGCTGCTGCTTTTTTG AAACTTTGGGATACCTTAGATCACATTTGAGCATCTCTATCTATATAAAACATGTTTTCGTTCGGTGGCTCTTCTCAGGA TGGGCAGCTTGTACAATATGTGCGTGAAATGTTCGAAGCTGGAACAGAAACAATGACAGGAACTGTCAGATGGGCGATGC TGTGTCTCATCCATTACCCAGATGGGCAAAAAAAACTTCGGGAAGAAATATTTGAAGCAATAGGCAAGTTATAATATAGA GGGTTTATGGTCAGTTTATAATTAATAGTTTTATTTATTCTGCAAAACTTAGATTAAACCCCCTTGTTTACAACCAGGAT GTTATAAAGTATAACAGGCTGGGGAAAGATGGGACAATTTTCGTTATATTTCCTCGTCCCATGTGGTAGTAAACAAATAA GAACGTTCAAAGAATTATTATAAAACCGTATCGTCGCGACTCCCATATACAGGCCGCGTCGATATTGTTTAAAACATCAT GATAATGGTACTTGGATACTATGTGCTAAAGGTGTCCCATATTCCCCCGTAATTCTATATCATACCCTTTTACAGATTTA TCATTTTACATTGTATTACTGTTGATCTATACATAAGTGGGATACGTAGGCACTGATCCAATTCGGGACATGTGTACCCA TCCCAGCTATCAAATGTAAAATCGGTAAAATTGGACTTCTGGCTTCCACGGAACACTCGAAAGGATACACAGATATTACA CGGCTTCAAAGTAGCGCATCAAATAAACCTATGACAGCTCACATGTATTAAGAGACGCAACTTA >GCiWno379_j24.b1_4 XL*LX*ANFPFF*RNNRIRVXTLSSRK*QRQT*FLFRIHLLHEISLG*IRKSILP*NKCA EQSGGVKCRTIFLYRSDRFTTQWVSVRHKKMF*IKKMIKIKFENCAKSFASAGISKINKR LHSFVQSKN*NVFPGIYPIFKKAAAFLKLWDTLDHI*ASLSI*NMFSFGGSSQDGQLVQY VREMFEAGTETMTGTVRWAMLCLIHYPDAQTKLREEIFAAIGKL*YRGFMVSL*LIVLFI LQNLDENPLVYNQDVIKYIRLGKDRTMFVRCFS >GCiWno379_j24.b1_5 FVVNXSQFPFLLA***NTCXYVIFA*IATTNVISVSYTPPTRNFAGLDP*KYLTLKQMRR TKWWC*VPDNLFV*KRPLYYTVGISKT*KNVLNKKNDKN*I*KLR*KFCVSRYFKN**TF TQLRSVEKLKCFSWHLPDL*KGCCFFETLGYLRSHLSISIYIKHVFVRWLFSGWAACTIC A*NVRSWNRNNDRNCQMGDAVSHPLPRCANKTSGRNICRNRQVII*RVYGQFIINSFIYS AKLR*KPPCLQPGCYKVYQAGQR*DHVCSLF*X >GCiWno379_j24.b1_6 FCS*XKPISLSFSVIIEYVXLRYLRVNSNDKRDFCFVYTSYTKFRWVRSVKVSYLKTNAP NKVVVLSAGQSFCIEATALLHSGYQ*DIKKCFK*KK**KLNLKTALKVLRQQVFQKLINV YTASFSRKIEMFFLAFTRSLKRLLLF*NFGIP*ITFEHLYLYKTCFRSVALLRMGSLYNM CVKCSKLEQKQ*QELSDGRCCVSSITQMRKQNFGKKYLPQ*ASYNIEGLWSVYN**FYLF CKT*MKTPLFTTRML*SISGWAKIGPCLFVVLV >GCiWno973_a07.b1_4 F*SVKDAVL*FCKYTCXYTKWEKKIMK*WHLTLRYFILTYAALFYKGXGPPLSSFIYINI LFDAERDVAHN*MW*LILGNNNRPSISDRKAMPFTSAFIQEVFRFRTRVPLGVQHMTTET VNFANYVIPKGTTVLL*GIL*CRCNIVYSILYFCEMLFRYFRFLPICGRCTTILMCGTNQ ASLNLSVTSMTKETLFSLNT*YLSRWVHVIAWENNLLEWKSSSSWFQWFRSLSFFRIRTN QIFLRLMTELRATVSFHTRSTLLLVRFNLM >GCiWno973_a07.b1_5 VLECERCGFIVL*IYLXIYQMGKENHEIVASYPALLHINLRGSILQRXRSASKLVYLY*H PF*R*TGCCT*LNVVTYFR*QQPPEHKRSQSNAIHLSFYTRGVSISHSRSFGRPAHDNRN RKLRKLCYSKRNHGTVIGYIVMSL*YSI*YFVFL*NVVSIF*ILANMWAVHNDPDVWDEP SKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARMEIFIFLVSMVQKFEFLPDPNE PDLPEINDGVKGNGFFPYSFNFVASEI*SNX >GCiWno973_a07.b1_6 FRV*KMRFYSSVNILVXIPNGKRKS*NSGILPCATSY*PTRLYSTKV*VRL*ARLFILTS FLTLNGMLHITKCGDLF*VTTTARA*AIAKQCHSPQLLYKRCFDFALAFLWASST*QQKP *TSQTMLFQKEPRYCYRVYCNVVVI*YIVFCIFVKCCFDILDSCQYVGGAQRS*CVGRTK QV*T*ASPR*QRKLCSV*TRDTFLGGSTSLLGRTTCSNGNLHLPGFNGSEV*VSSGSERT RSS*D**RS*GQRFLSILVQLCC**DLI*X Walk 1 >GCiWno1017_g22.g1 CHROMAT_FILE: GCiWno1017_g22.g1 PHD_FILE: GCiWno1017_g22.g1.phd.1 CHEM: term DYE: big TIME: Thu Dec 27 11:52:37 2001 TEMPLATE: GCiWno1017_g22 DIRECTION: rev Length = 905 Score = 208 bits (523), Expect = 6e-53 Identities = 107/118 (90%), Positives = 110/118 (92%), Gaps = 4/118 (3%) Frame = -3 Query: 50 SVKDAVL-FCKYTCXYTKWEKKIMK-WHLTLRYFILTYAALFYKGXGPPLSSFIYINILF 107 SV+DAVL FCKYT YTKWEKKIMK WHLTLRYFILT AALFYKG GPPLSSFIYINILF Sbjct: 492 SVEDAVL*FCKYTLLYTKWEKKIMK*WHLTLRYFILTNAALFYKGLGPPLSSFIYINILF 313 Query: 108 DAERDVAHN-MW-LILGNNNRPSISDRKAMPFTSAFIQEVFRFRTRVPLGVQHMTTET 163 DAERDVAHN MW LILGNNNRPSISDRK+MPFTSAFIQEVFRFRTRVPLGVQHMTT+T Sbjct: 312 DAERDVAHN*MW*LILGNNNRPSISDRKSMPFTSAFIQEVFRFRTRVPLGVQHMTTQT 139 >GCiWno1017_g22.g1_4 NTPCCKPPWPRDVIKGITRGGEKDGNNCSVIISSFPMWVVKQIRKVQKDYV*NPYGRRLP IYRPASLLFKTS**WYLDTRX*RCPIFPPLILYHTLLQIYHFNIFIHVVLLD*MGYR*AR ESHIS*SWFKQLTTLF*SVEDAVL*FCKYTLLYTKWEKKIMK*WHLTLRYFILTNAALFY KGLGPPLSSFIYINILFDAERDVAHN*MW*LILGNNNRPSISDRKSMPFTSAFIQEVFRF RTRVPLGVQHMTTQTVNFANYVIPKGTTVLL*GIL*CRCNIVYSILYVCEMLFR*VPSSN R >GCiWno1017_g22.g1_5 KHPML*TPLAQGCYKRYNQGWGKGWEQLFGYNFLVPHVGSKTNKEGSEGLCIKPVWSATP HIQARVVIV*NIMIMVFGY*VXKVSHIPPVNSISYPFTDISL*YFYTCSTIRLDGIPLGT *ITYFVIVV*TINNAVLEC*RCGFIIL*IYFVIYQMGKENHEIVASYPALLHINQRGSIL QRFRSASKLVYLY*HPF*R*TGCCT*LNVVTYFR*QQPPEHKRSQINAIHLSFYTRGVSI SHSRSFGRPTHDNTNRKLRKLCYSKRNHGTVIGYIVMSL*YSI*YFVCL*NVVSIGTELE SX >GCiWno1017_g22.g1_6 QTPHVVNPPGPGML*KV*PGVGKRMGTIVRL*FPRSPCG**NK*GRFRRIMYKTRMVGDS PYTGPRRYCLKHHDNGIWILGXKGVPYSPR*FYIIPFYRYITLIFLYM*YY*IRWDTVRH VNHIFRDRGLNN*QRCFRVLKMRFYNSVNILCYIPNGKRKS*NSGILPCATSY*PTRLYS TKV*VRL*ARLFILTSFLTLNGMLHITKCGDLF*VTTTARA*AIANQCHSPQLLYKRCFD FALAFLWASNT*QHKP*TSQTMLFQKEPRYCYRVYCNVVVI*YIVFCMFVKCCFDRYRAR IX LQW194191.y2 LQW194191.y2.phd.1 LQW194191.x1 14:34:39 2001 TEMPLATE: LQW194191 DIRECTION: rev Length = 916 Score = 65.2 bits (156), Expect(2) = 2e-18 Identities = 30/34 (88%), Positives = 33/34 (96%) Frame = +2 Query: 27 FPPLILYHTLLQIYHFNIFIHVVLLD*MGYR*AR 60 + P+ILYHTLLQIYHFNIFIHVVLLD*+GYR*AR Sbjct: 437 YSPVILYHTLLQIYHFNIFIHVVLLD*VGYR*AR 538 Score = 46.1 bits (107), Expect(2) = 2e-18 Identities = 22/24 (91%), Positives = 22/24 (91%) Frame = +1 Query: 5 ASLLFKTS**WYLDTRX*RCPIFP 28 ASLLFKTS**WYLDT *RCPIFP Sbjct: 373 ASLLFKTS**WYLDTMC*RCPIFP 444 >LQW194191.y2_1 XTLSXYHYVRECSKLDRHMTGLSDGRCCVSSITRWQKNSGRTI*SNRQVII*RVYGQFII NSFIYSAKLRLNPLVYNQDVIKYNRLGKDGTIFVIFPRPMW**TNKNVQRIIIKPYRRDS HIQAASLLFKTS**WYLDTMC*RCPIFPRNSISYPFTDISL*YFYTCSTIRLGGIPLGT* ITYFVIVV*TITNAVSEC*RCGYIVLYIYFVIYPMGQDTHEIVASYVSYLYEQRGLSTSL RRSKRY**HL*VTDVLLLAFSNTRVQ*NAPVSRLSSSGP*TG*ISSVS*SIKFSKSGPEL LSRCFX >LQW194191.y2_2 VHSAXTTMCVNVRSWTDT*QDCQMGDAVSHPLPDGKKIREELFEAI GKL*YRGFMVSL*L IVLFILQNLD*TPLFTTRML*SITGWGKMGQFSLYFLVPCGSKQIRTFKELL*NRIVATP IYRPRRYCLKHHDNGIWILCAKGVPYSPVILYHTLLQIYHFNIFIHVVLLD*VGYR*ARE SHIS*SWFKQLPTLFQSVEDAVI*FCTYTLLYTQWDKTLMK*WHLT*ATYMNNAVYLHRY VDLSVINNIFE*RMCYSWLLVTREYNETLPFQGCPRLGHEQDEFQASVDQSSSANQDLNC YLVAL >LQW194191.y2_3 YTQPVPLCA*MFEAGQTHDRTVRWAMLCLIHYQMAKKFGKNYLKQ*ASYNIEGLWSVYN* *FYLFCKT*IKPPCLQPGCYKV*QAGERWDNFRYISSSHVVVNK*ERSKNYYKTVSSRLP YTGRVVIV*NIMIMVFGYYVLKVSHIPP*FYIIPFYRYITLIFLYM*YY*IRWDTVRHVN HIFRDRGLNNYQRCFRVLKMRLYSSVHILCYIPNGTRHS*NSGILRELLI*TTRSIYIVT SI*ALLITSLSNGCATLGF**HASTMKRSRFKVVLVWAMNRMNFKRQLINQVQQIRT*IA ISLL YVREMFEAGQTHDRT-VRWAMLCLIHYQMAKKKIREELFEAI YVREMFEAGTETMTGXVRWAMLCLIHYPDAQKKLREEIFEAI GCiWno874_b07.g1 Best hit LQW163322.x1 LQW163322.x1.phd.1 LQW163322.y1 11:17:42 2001 TEMPLATE: LQW163322 DIRECTION: fwd Length = 1024 Score = 63.6 bits (152), Expect = 2e-10 Identities = 31/42 (73%), Positives = 35/42 (82%), Gaps = 1/42 (2%) Frame = +1 Query: 7 YVREMFEAG-QTHDRTVRWAMLCLIHYQMAKKKIREELFEAI 47 YVREMFEAG +T TVRWAMLCLIHY +KK+REE+FEAI Sbjct: 337 YVREMFEAGTETMTGTVRWAMLCLIHYPDGQKKLREEIFEAI 462 5 diffs with seq 209 >GCiWno874_b07.g1 CHROMAT_FILE: GCiWno874_b07.g1 PHD_FILE: GCiWno874_b07.g1.phd.1 CHEM: term DYE: big TIME: Mon Dec 10 12:34:53 2001 TEMPLATE: GCiWno874_b07 DIRECTION: rev Length = 977 Score = 173 bits (433), Expect = 1e-42 Identities = 98/115 (85%), Positives = 102/115 (88%), Gaps = 6/115 (5%) Frame = -3 This is seq 209 Query: 1 YVREMFEAG-QTHDRTVRWAMLCLIHYQMAKKKIREELFEAIGKL-YRGFMVSL-LIVLF 57 YVREMFEAG +T VRWAMLCLIHY A+KK+REE+FEAIGKL YRGFMVSL LIVLF Sbjct: 627 YVREMFEAGTETMTGXVRWAMLCLIHYPDAQKKLREEIFEAIGKL*YRGFMVSL*LIVLF 448 Query: 58 ILQNLD-TPLFTTRML-SITGWGKMGQFSLYFLVPCGSKQIRTFKELL-NRIVAT 109 ILQNLD TPLFTTRML SITGWGKMGQFSLYFLVPCGSKQIRTFKELL N IVAT Sbjct: 447 ILQNLD*TPLFTTRML*SITGWGKMGQFSLYFLVPCGSKQIRTFKELL*NHIVAT 283 >sequence 10, 11, 22 28 accessions 45% to 2C8 might be two closely related genes this seq from NLWA is almost identical to seq 2 (may be part of a gene cluster) YFFIVLKYCTDVLINLLSADVFQELVDQHRISFDKNNIRDFIDAFICEVDNGKDDSFT (0) DRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRNEILDVI 508 GNNNRPSISDRKSMPFTSAFIQEVFRFRTRVRTGVPHKTTETVNFANYVIPKGTTIVA NLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARMKIFIFLVSMVQKFEF LPDPNEPDLPEIDDGVIGNGFFPYPFNFVANEI* DEV12046.y1 I-HELIX FID not FTD in seq 11 DEV16908.x1 DEV3110.y1 DEV36061.y1 DEV39761.y1 DEV40662.x1 FID not FTD one more diff DEV49690.x1 I-HELIX * LQW134849.x01 LQW134849.x1 LQW141933.y1 LQW172858.y1 LQW197304.x1 LQW210937.y1 LQW214724.x2 LQW214724.y1 I-HELIX * OPPOSITE END OF SEQ 11 LQW238538.x1 I-HELIX * OPPOSITE END OF SEQ 22 LQW238538.y1 LQW251009.x1 FID not FTD LQW254973.x1 LQW254973.y1 LQW271884.y1 LQW29511.x1 FID not FTD one more diff I-HELIX LQW29511.y1 LQW51603.x1 I-HELIX * OPPOSITE END OF SEQ 11 LQW51603.y1 LQW63108.y1 FID not FTD one more diff I-HELIX LQW63573.x1 I-HELIX * LQW77027.y1 GCiWno973_a07.b1 CHROMAT_FILE: GCiWno973_a07.b1 PHD_FILE: GCiWno973_a07.b1.phd.1 CHEM: term DYE: big TIME: Tue Dec 25 11:42:59 2001 TEMPLATE: GCiWno973_a07 DIRECTION: fwd Length = 811 Score = 150 bits (374), Expect(2) = 3e-56 Identities = 67/69 (97%), Positives = 69/69 (99%) Frame = -1 Query: 166 NLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARMKIFIFL 225 N+WAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARM+IFIFL Sbjct: 316 NMWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARMEIFIFL 137 Query: 226 VSMVQKFEF 234 VSMVQKFEF Sbjct: 136 VSMVQKFEF 110 Score = 91.3 bits (223), Expect(2) = 3e-56 Identities = 45/58 (77%), Positives = 46/58 (78%) Frame = -2 Query: 102 ILGNNNRPSISDRKSMPFTSAFIQEXXXXXXXXXXGVPHKTTETVNFANYVIPKGTTV 159 ILGNNNRPSISDRK+MPFTSAFIQE GV H TTETVNFANYVIPKGTTV Sbjct: 582 ILGNNNRPSISDRKAMPFTSAFIQEVFRFRTRVPLGVQHMTTETVNFANYVIPKGTTV 409 >GCiWno809_c05.g1 CHROMAT_FILE: GCiWno809_c05.g1 PHD_FILE: GCiWno809_c05.g1.phd.1 CHEM: term DYE: big TIME: Wed Dec 5 19:20:32 2001 TEMPLATE: GCiWno809_c05 DIRECTION: rev Length = 890 Score = 148 bits (369), Expect(2) = 3e-55 Identities = 66/69 (95%), Positives = 68/69 (97%) Frame = +1 Query: 166 NLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKHVIPFSVGPRHCLGEQLARMKIFIFL 225 N+WAVHNDPDVWDEPSKFKPERHLDDKGNFVQS HVIPFSVGPRHCLGEQLARM+IFIFL Sbjct: 403 NMWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSNHVIPFSVGPRHCLGEQLARMEIFIFL 582 Query: 226 VSMVQKFEF 234 VSMVQKFEF Sbjct: 583 VSMVQKFEF 609 Score = 90.1 bits (220), Expect(2) = 3e-55 Identities = 45/58 (77%), Positives = 46/58 (78%) Frame = +2 Query: 102 ILGNNNRPSISDRKSMPFTSAFIQEXXXXXXXXXXGVPHKTTETVNFANYVIPKGTTV 159 ILGNNNRPSISDRK+MPFTSAFIQE GV H TTETVNFANYVIPKGTTV Sbjct: 137 ILGNNNRPSISDRKAMPFTSAFIQEVFRFRTRVP*GVQHMTTETVNFANYVIPKGTTV 310 >cinc016b21 Length = 698 Score = 197 bits (496), Expect = 1e-49 Identities = 100/121 (82%), Positives = 102/121 (83%), Gaps = 3/121 (2%) Frame = +2 Query: 59 DRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRNEIL---GNNNRPSISDRK 115 DRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRNEIL GNNNRPSISDRK Sbjct: 347 DRQLVQYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRNEILDVIGNNNRPSISDRK 526 Query: 116 SMPFTSAFIQEXXXXXXXXXXGVPHKTTETVNFANYVIPKGTTVSFIAMWNLWAVHNDPD 175 SMPFTSAFIQE GVPHKTTETVNFANYVIPKGTT+ + NLWAVH DPD Sbjct: 527 SMPFTSAFIQEVFRFRTRVRTGVPHKTTETVNFANYVIPKGTTI----VANLWAVHXDPD 694 Query: 176 V 176 V Sbjct: 695 V 697 Score = 85.0 bits (207), Expect = 1e-15 Identities = 39/40 (97%), Positives = 39/40 (97%) Frame = +1 Query: 19 ADVFQELVDQHRISFDKNNIRDFTDAFICEVDNGKDDSFT 58 ADVFQELVDQHRISFDKNNIRDF DAFICEVDNGKDDSFT Sbjct: 88 ADVFQELVDQHRISFDKNNIRDFIDAFICEVDNGKDDSFT 207 >cinc016b21 CGGCACGAGGCCTCGTGCCCTTTGCTTTGCAATATACCCCCTTACCGAAGCAAGTATTTGGAAGTAAAGCGGGACATGAT GCAAATGGCCGACGTTTTTCAAGAACTGGTAGACCAACACAGAATATCTTTCGACAAAAATAACATTCGAGACTTTATTG ATGCTTTCATTTGCGAGGTTGACAATGGAAAAGACGACAGTTTTACAGTAATAATAACTTATACAAATGATAAGTAAAAG GTTCATTTAATATAGGCTATAGCAAAAACAGTGTATTGCGACACGAAGTGTAAGCTAACTCGAACCAAAATGTACAACTT TCGTTAATATATCTCTCAAATCAAAGGATCGACAGCTTGTACAGTACCTGCGGGAAATATTTGAAGCTGGAACGCAAACC ACCACCGGTACACTTAATTGGGCAATACTATGTCTCATCCATTATCCACAAGCACAAAAGAAACTGAGAAATGAAATTCT TGACGTCATTGGTAACAACAACCGCCCGAGCATAAGCGATCGCAAATCAATGCCATTCACCTCAGCATTTATACAAGAGG TGTTTCGATTTCGTACACGTGTCCGAACAGGCGTTCCGCATAAAACAACAGAAACTGTAAACTTCGCAAACTATGTAATT CCTAAGGGAACAACGATCGTAGCCAATCTATGGGCGGTGCACNACGATCCTGATGTGT >cinc016b21_1 RHEASCPLLCNIPPYRSKYLEVKRDMMQMADVFQELVDQHRISFDKNNIRDFIDAFICEV DNGKDDSFTVIITYTNDK*KVHLI*AIAKTVYCDTKCKLTRTKMYNFR*YISQIKGSTAC TVPAGNI*SWNANHHRYT*LGNTMSHPLSTSTKETEK*NS*RHW*QQPPEHKRSQINAIH LSIYTRGVSISYTCPNRRSA*NNRNCKLRKLCNS*GNNDRSQSMGGAXRS*CV >cinc016b21_2 GTRPRALCFAIYPLTEASIWK*SGT*CKWPTFFKNW*TNTEYLSTKITFETLLMLSFARL TMEKTTVLQ***LIQMISKRFI*YRL*QKQCIATRSVS*LEPKCTTFVNISLKSKDRQLV QYLREIFEAGTQTTTGTLNWAILCLIHYPQAQKKLRNEILDVIGNNNRPSISDRKSMPFT SAFIQEVFRFRTRVRTGVPHKTTETVNFANYVIPKGTTIVANLWAVHXDPDVX >cinc016b21_3 ARGLVPFALQYTPLPKQVFGSKAGHDANGRRFSRTGRPTQNIFRQK*HSRLY*CFHLRG* QWKRRQFYSNNNLYK**VKGSFNIGYSKNSVLRHEV*ANSNQNVQLSLIYLSNQRIDSLY STCGKYLKLERKPPPVHLIGQYYVSSIIHKHKRN*EMKFLTSLVTTTARA*AIANQCHSP QHLYKRCFDFVHVSEQAFRIKQQKL*TSQTM*FLREQRS*PIYGRCTTILMC >cibd036p10 Length = 599 Score = 193 bits (485), Expect = 2e-48 Identities = 91/98 (92%), Positives = 93/98 (94%) Frame = +1 Query: 137 GVPHKTTETVNFANYVIPKGTTVSFIAMWNLWAVHNDPDVWDEPSKFKPERHLDDKGNFV 196 GVPHKTTETVNFANYVIPKGTT+ + NLWAVHNDPDVWDEPSKFKPERHLDDKGNFV Sbjct: 1 GVPHKTTETVNFANYVIPKGTTI----VANLWAVHNDPDVWDEPSKFKPERHLDDKGNFV 168 Query: 197 QSKHVIPFSVGPRHCLGEQLARMKIFIFLVSMVQKFEF 234 QSKHVIPFSVGPRHCLGEQLARMKIFIFLVSMVQKFEF Sbjct: 169 QSKHVIPFSVGPRHCLGEQLARMKIFIFLVSMVQKFEF 282 >cibd036p10 GGCGTTCCGCATAAAACAACAGAAACTGTAAACTTCGCAAACTATGTAATTCCTAAGGGAACAACGATCGTAGCCAATCT ATGGGCGGTGCACAACGATCCTGATGTGTGGGACGAACCAAGCAAGTTTAAACCTGAGCGTCACCTCGATGACAAAGGAA ACTTTGTTCAGTCTAAACACGTGATACCTTTCTCGGTGGGTCCACGTCATTGCTTGGGAGAACAACTTGCTCGAATGAAA ATCTTCATCTTCCTGGTTTCAATGGTTCAGAAGTTTGAGTTTCTTCCGGATCCGAACGAACCAGATCTTCCTGAGATTGA TGACGGAGTTATCGGGAACGGTTTCTTTCCATATCCGTTTAACTTTGTTGCCAATGAGATTTAACACCAACACCTATGTC AGTAGCGCTTAATGCTTGATATATTTACTCTATACTTTTTGCTTTGCAAGTATATCTAGGGTTGTGACTTTCCATAAAAT GATCGAAGATAATTTACAAAGTCTCAACAAATATAAAAAGTTATTTTTTATACGTATACCTAACTTCGGCATTATATCGT ACATCCAGCATGCTTCATATCTCACTGTTCATATTGTAG GVPHKTTETVNFANYVIPKGTTIVANLWAVHNDPDVWDEPSKFKPERHLDDKGNFVQSKH VIPFSVGPRHCLGEQLARMKIFIFLVSMVQKFEF LPDPNEPDLPEIDDGVIGNGFFPYPFNFVANEI* >sequence 5, 12, 194 DEQLHHYTRDLLVAAINTTT AAFGWCVVSLLRHPECQDKIWQEVDKVIGDETPSCKHQENMPYMRSFIQEIHRH RSIATLNLPHRTVRSVRLCGYDIPKDTPVLTNIWRVHNNQNVWKNPH EFRPDRHLDSNGNFVSSNNVIPFACGYRRCLGEQIAKAEIFLFIVNIVKRF HLVVDQVTGPPTLEKDPKGAVNTPAPFRFSVRARK LQW173019.y2 LQW18289.x1 exxr to pkg DEV16259.x1 I-HELIX DEV52712.y1 I-HELIX 2 DIFFS LQW229033.y1 I-helix LQW1044.x2 I-helix GCiWno59_m08.g1 GCiWno329_m07.b1 GCiWno150_b22.g1 >GCiWno150_b22.g1 CCATATACCCCCACTACTATTTAGTAGGGCGGTGGAAAACGGGCACCTTTAACCATACTATCTAAATATCCTAGTCGCGT TTTAAACAAATAACAATGGCTTATGTATAAGTCGTAAGTAGGCCTATATACGGTTTATAACTCTATGAATGTTTTTTGTT TACTACCAAATGGGACGAGAAAATAGAATAAAAAGTGTCCCATCTTCCCCCACCCTTACTGTATAAAAAAAAGTTGCCAT GATAGGTTAAGCAAAAAGAGGTAATAATGCTATGTAGCGAAATAGAACATCAAGCGAATATTTTATGGTAAATGCCGTTA CGTTATTAAGTGTTTGGTCCTGGACGCTGTTCTATAACGGCCAATCGCTACTGACCAGACTGCTGCGCTATAGGCTTCCT TTATTAATAATTTATATTCACGTAAACAAGCATATAGTAGGGTGGGGGAAGGCGGGACACCTTTTCATTCCATTTTCTCC TCTCATTTGGTTTTAAACAAAGAACATTCAAAGAATTATAAAAACCGTATCCTCACGACCCCCATAGACCGTTGTTAATT GTTTAAAAACACAATTAGGATATTATGTGCTAAAGGCGCACCGTCTTCCCCAAGCCTACCATGCGTTTTTATTTAAATCA GGCGACGAAACTCCCAGTTGCAAACACCAAGAANACATGCCGTACATGCGCTCGTTTATACAAGAAATACACCGGCACCG ATCCATTGCAACGCTGAATCTGCCTCATCGAACTGTTCGAAGCGTGCGACTGTGCGGTTATGACATTCCTAAAGATACCT CCAGTAGTACATTGGCCAATCAGCACACATCAGACATATGCGNTTACCTTATGTGTNTTTCTAAATTCGCAGCCAGACGT TTGACAANAATA >GCiWno150_b22.g1_1 PYTPTTI**GGGKRAPLTILSKYPSRVLNK*QWLMYKS*VGLYTVYNSMNVFCLLPNGTR K*NKKCPIFPHPYCIKKSCHDRLSKKR**CYVAK*NIKRIFYGKCRYVIKCLVLDAVL*R PIATDQTAAL*ASFINNLYSRKQAYSRVGEGGTPFHSIFSSHLVLNKEHSKNYKNRILTT PIDRC*LFKNTIRILCAKGAPSSPSLPCVFI*IRRRNSQLQTPRXHAVHALVYTRNTPAP IHCNAESASSNCSKRATVRL*HS*RYLQ*YIGQSAHIRHMRLPYVXF*IRSQTFDXNX >GCiWno150_b22.g1_2 HIPPLLFSRAVENGHL*PYYLNILVAF*TNNNGLCISRK*AYIRFITL*MFFVYYQMGRE NRIKSVPSSPTLTV*KKVAMIG*AKRGNNAM*RNRTSSEYFMVNAVTLLSVWSWTLFYNG QSLLTRLLRYRLPLLIIYIHVNKHIVGWGKAGHLFIPFSPLIWF*TKNIQRIIKTVSSRP P*TVVNCLKTQLGYYVLKAHRLPQAYHAFLFKSGDETPSCKHQEXMPYMRSFIQEIHRHR SIATLNLPHRTVRSVRLCGYDIPKDTSSSTLANQHTSDICXYLMCXSKFAARRLTXI >GCiWno150_b22.g1_3 IYPHYYLVGRWKTGTFNHTI*IS*SRFKQITMAYV*VVSRPIYGL*LYECFLFTTKWDEK IE*KVSHLPPPLLYKKKLP**VKQKEVIMLCSEIEHQANILW*MPLRY*VFGPGRCSITA NRY*PDCCAIGFLY**FIFT*TSI**GGGRRDTFSFHFLLSFGFKQRTFKEL*KPYPHDP HRPLLIV*KHN*DIMC*RRTVFPKPTMRFYLNQATKLPVANTKXTCRTCARLYKKYTGTD PLQR*ICLIELFEACDCAVMTFLKIPPVVHWPISTHQTYAXTLCVFLNSQPDV*QX >GCiWno329_m07.b1 TTGTTTAAAACCAAATGAGAGGAGAAAATGGAATGAAAAGGTGTCCCGCCTTCCCCCACCCTACTATATGCTTGTTTACG TGAATATAAATTATTAATAAAGGAAGCCTATAGCGCAGCAGTCTGGTCAGTAGCGATTGGCCGTTATAGAACAGCGTCCA GGACCAAACACTTAATAACGTAACGGCATTTACCATAAAATATTCGCTTGATGTTCTATTTCGCTACATAGCATTATTAC CTCTTTTTGCTTAACCTATCATGGCAACTTTTTTTTTATACAGTAAGGGTGGGGGAAGATGGGACACTTTTTATTCTATT TTCTCGTCCCATTTGGTAGTAAACAAAAAACATTCATAGAGTTATAAACCGTATATAGGCCTACTTACGACTTATACATA AGCCATTGTTATTTGTTTAAAACGCGACTAGGATATTTAGATAGTATGTGTTAAAGGTGCCCGTTTTCAACCGCCCTACT AAATAGTAGTGTGGGGTTATATGGGACATATTTTCATTATGTTTTTCCCGTCCCATTTTTTACATAATTTTAACATAGGT GCAAAAACAATAAAGCTTGTATAAAACTCAGTAGTTTGTAACTTAGAAATACTGATCCTTGTAACCGTTCCAAATAAACT TACCGATAACCTGGTCCACCTCTTGCCATATTTTTGCCCTGACACTCTGGATGGCCGAACAAAGACACCACACACCAACC CAACGCTGGGAGTCGCTGTATTAATAGCCAGCCACCAATAAATCTTGAGTGGAATGAATGGAGCTGGTCATCCTATAACA CATTTATTATTTGCATTGGTGGCGGAGTGGGTTATTCCTTTCGGCGGGTAATAGTTGGGGCAGTTTTTAATGGGCACCGG GCGCGGGCCCTCT >GCiWno329_m07.b1_4 RARARCPLKTAPTITRRKE*PTPPPMQIINVL*DDQLHSFHSRFIGGWLLIQRLPALGWC VVSLFGHPECQGKNMARGGPGYR*VYLERLQGSVFLSYKLLSFIQALLFLHLC*NYVKNG TGKT**KYVPYNPTLLFSRAVENGHL*HILSKYPSRVLNK*QWLMYKS*VGLYTVYNSMN VFCLLPNGTRK*NKKCPIFPHPYCIKKKLP**VKQKEVIMLCSEIEHQANILW*MPLRY* VFGPGRCSITANRY*PDCCAIGFLY**FIFT*TSI**GGGRRDTFSFHFLLSFGFKQ >GCiWno329_m07.b1_5 EGPRPVPIKNCPNYYPPKGITHSATNANNKCVIG*PAPFIPLKIYWWLAINTATPSVGLV CGVFVRPSRVSGQKYGKRWTRLSVSLFGTVTRISISKLQTTEFYTSFIVFAPMLKLCKKW DGKNIMKICPI*PHTTI**GG*KRAPLTHTI*IS*SRFKQITMAYV*VVSRPIYGL*LYE CFLFTTKWDEKIE*KVSHLPPPLLYKKKVAMIG*AKRGNNAM*RNRTSSEYFMVNAVTLL SVWSWTLFYNGQSLLTRLLRYRLPLLIIYIHVNKHIVGWGKAGHLFIPFSPLIWF*TX >GCiWno329_m07.b1_6 RGPAPGAH*KLPQLLPAERNNPLRHQCK**MCYRMTSSIHSTQDLLVAGY*YSDSQRWVG VWCLCSAIQSVRAKIWQEVDQVIGKFIWNGYKDQYF*VTNY*VLYKLYCFCTYVKIM*KM GREKHNENMSHITPHYYLVGRLKTGTFNTYYLNILVAF*TNNNGLCISRK*AYIRFITL* MFFVYYQMGRENRIKSVPSSPTLTV*KKSCHDRLSKKR**CYVAK*NIKRIFYGKCRYVI KCLVLDAVL*RPIATDQTAAL*ASFINNLYSRKQAYSRVGEGGTPFHSIFSSHLVLNX >GCiWno59_m08.g1 CTGGCCTTCGCTTACTGTAATATTCTACTACTATCATTATTACCTCTTTTTGCTTAACCTATCATGGCAACTTTTTTTTT ATACAGTAAGGGTGGGGGAAGATGGGACACTGTTTATTCTATTTTCTCGTCCCATTTGGTAGTAAACAAAAAACATTCAT AGAGTTATAAACCGTATATAGGCCTACTTACGACTTACACATAAGCCATTGTTATTTGTTTAAAACGCGACTAGGATATT TAGATAGTATGTGTTAAAGGTGCCCGTTTTCAACCGCCCTACTAAATAGTAGTGTCGGGTTATATGGGACATATTTTCAT TATGTTTTTCCCGTCCCATTTTTTACATAATTTTAACATAGGTGCAAAAACAATAAAGCCTGTATAAAACTCAGTAGTTT GTAACTTAGAAATACTGATCTTTGTAACCGTTCAAAATAAACTTACCGATAACCTTGTCCACCTCTTGCCATATTTTGTC CTGACACTCTGGATGCCGAAGCAAAGACACAACACACCAACCAAACGCTGCAGTCGTTGTATTAATAGCAGCTACCAATA AATCTCGAGTGTAATGGTGGAGTTGCTCATCCTATAACACATTTATTTTTGTATTGTGGCGTATTGCTTATTCTTTTGCT GNTATATGTTTGTAGTTTTTTATGTGTAGTTTGCTGTGCCGTCCTGTCGTATTGGTTAAATTGAACGAATGAACACAATG TAAGTTGTTTACCACGCGTGGCGAGGCCACAACAGTCGTTATAACGCGGGGCTTTCTTGATTCAAATATCTCCGTGCCAG CGGCCGCGAACTATTGTGCCCGCGGG >GCiWno59_m08.g1_4 PRAQ*FAAAGTEIFESRKPRVITTVVASPRVVNNLHCVHSFNLTNTTGRHSKLHIKNYKH IXAKE*AIRHNTKINVL*DEQLHHYTRDLLVAAINTTTAAFGWCVVSLLRHPECQDKIWQ EVDKVIGKFILNGYKDQYF*VTNY*VLYRLYCFCTYVKIM*KMGREKHNENMSHITRHYY LVGRLKTGTFNTYYLNILVAF*TNNNGLCVSRK*AYIRFITL*MFFVYYQMGRENRINSV PSSPTLTV*KKSCHDRLSKKR******NITVSEGQ >GCiWno59_m08.g1_5 PAGTIVRGRWHGDI*IKKAPRYNDCCGLATRGKQLTLCSFVQFNQYDRTAQQTTHKKLQT YXSKRISNTPQYKNKCVIG*ATPPLHSRFIGSCY*YNDCSVWLVCCVFASASRVSGQNMA RGGQGYR*VYFERLQRSVFLSYKLLSFIQALLFLHLC*NYVKNGTGKT**KYVPYNPTLL FSRAVENGHL*HILSKYPSRVLNK*QWLMCKS*VGLYTVYNSMNVFCLLPNGTRK*NKQC PIFPHPYCIKKKLP**VKQKEVIMIVVEYYSKRRPX >GCiWno59_m08.g1_6 RGHNSSRPLARRYLNQESPAL*RLLWPRHAW*TTYIVFIRSI*PIRQDGTANYT*KTTNI XQQKNKQYATIQK*MCYRMSNSTITLEIYW*LLLIQRLQRLVGVLCLCFGIQSVRTKYGK RWTRLSVSLF*TVTKISISKLQTTEFYTGFIVFAPMLKLCKKWDGKNIMKICPI*PDTTI **GG*KRAPLTHTI*IS*SRFKQITMAYV*VVSRPIYGL*LYECFLFTTKWDEKIE*TVS HLPPPLLYKKKVAMIG*AKRGNNDSSRILQ*AKAX >ciht008o06 3 aa diffs Length = 696 Score = 115 bits (284), Expect = 1e-25 Identities = 53/56 (94%), Positives = 54/56 (95%) Frame = +1 Query: 1 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSDMW 56 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLA HTSD+W Sbjct: 76 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLAILHTSDIW 243 >ciht008o06 CNGCACGAGGCCCAAGCCTACCATGCGTTTTTATTTAAATCAGGCAACGAAACTCCCAGTTGCAAGCACCAAGAAAACAT GCCGTACATGCGCTCGTTTATACAAGAAATACACCGGCACCGATCCATTGCAACGCTGAATCTGCCTCATCGAACTGTTC GAAGCGTGCGACTGTGCGGTTATGACATTCCTAAAGATACTCCAGTAAGTACATTGGCCATTCTGCACACATCAGACATA TGGGTTTACCTTATGTGTTTTTCTAAATTCGCAGCCAGACCTTCATGGGCGAGAAAAATCGCCCATGAAGGTATATAAGT GGTAACTCGTATAAGCGGACATGATATGTAGGACAGAACGCCTCTGTTTTGGAGACTGGCGTTGCCCTGTCAGGCTAGGA TAAATAAGTTACATTAATAATGATATATATAATAAAGTATAACATGTACGTTTTGTGCTTAATTCTAACATAGGTACTTA CCAACATTTGGAGGGTCCACAATAATCAAAATGTTTGGAAAAATCCACACGAGTTTCGACCCGATCGTCATCTTGACGGC AACGGCAATTTCGTCTCATCTAACAACGTCATTCCTTTTGCATGTGGTTATAGGCGTTGTTTGGGCGAGCAAATAGCCAA AGCAGAGATATTTCTTTTTATAGTAAATATTGTCAAACGGTTTCACTTGGTAGTGG VLTNIWRVHNNQNVWKNPHEFRPDRHLDSNGNFVSSNNVIPFACGYRRCLGEQIAKAEIFLFIVNIVKRFHLVV >GCiWno582_d02.b1 CHROMAT_FILE: GCiWno582_d02.b1 PHD_FILE: GCiWno582_d02.b1.phd.1 CHEM: term DYE: big TIME: Wed Oct 24 12:56:49 2001 TEMPLATE: GCiWno582_d02 DIRECTION: fwd Length = 880 Score = 121 bits (300), Expect = 2e-27 Identities = 55/56 (98%), Positives = 56/56 (99%) Frame = -1 Query: 1 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSDMW 56 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSD+W Sbjct: 310 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSDIW 143 >GCiWno582_d02.b1 CATCATATGTATGGCNNCTGCAGGAATCGACTTCTAGAGGGATCCCCATACGAGTTACCACTTATATAACTTCATGGGCG ATTTTTTTGGTTATTTTTGTTCAACGTCTGGCTGCGAATTTAGAAAAACACATAAGGTAAACCCATATGTCTGATGTGTG CTGATTGGCCAGTGTACTTACTGGAGTATCTTTAGGAATGTCATAACCGCACAGTCGCACGCTTCGAACAGTTCGATGAG GCAGATTCAGCGTTGCAATGGATCGGTGCCGGTGTATTTCTTGTATAAACGAGCGCATGTACGGCATGTTTTCTTGGTGC TTGCAACTGGGAGTTTCGTCGCCTGATTTAAAAAATAAAACTTACAGTAGGCTTGGGGAAGACGGTGCGCCTTTAGCACA TAATATCCTAATTGTGTTTTTAAACAATTAACAACGGTCTATGGGGGTCGTGAGGATACGGTTTTTATAATTCTTTGAAT GTTCTTTGTTTAAAACCAAATGAGAGGAGAAAATGGAATGAAAAGGTGTCCCGCCTTCCTCTACTATACTTGTTTACGTG ATTAAATTATTAATAAAGTAAGCCTATACGCCTACAGCAGTCTGGTCAGTGGCGATGGGCCGTTAAAACAGCGTTTAGGA CCAAACACTTAAAGTAAAGGTATTTACCATAAAATATTCGCTTGATGTTCTATGCTACTACATAGCATTATTACCTCTTT TTGCTTAACCTATCATGGCAACTTTTTTTTATACAGTAAAGGTTGGGGGAACATGGGAACACTGTTTAATTCTATTTTTC TCGGCCCATTTGGGCAGTAACACAAAAAACTTTTCTTAAGAGTTATAAAACCCGTATTATAGGCCTCACTTTACCACTTT >GCiWno582_d02.g1 TACGAATTCGAGCTCGTACCCAGAGTTCTTTTTATACCAAATGGGTCGAGAAAATAGAACGAAAACATGTCCCATATTAC CCGACTCTATATCTTGTATTTTCAAATGAATCTACGATTTATTTACATCAGTATATAGCGTTACCCTACTGTTCTACTGT GTTAAGGCAATCTAAATAAAATAATGTTTATACAGACTCTCTTTCGGAAAGCAGTAACTAATGTCACATGCTCCGTTCTA ATGGGAAAACGATACGACTACGACGATCCAGTTTTGAACAATATTACAGAACATATAAGGTAACTGAACCACTTTTTATA CGCGATGGACATGCTTTACCCAATTGATGAAAGGATAAACACATACATTTCCAACGTATGCTTGAATATAATATAGTACT GTGGGGTACCGTACCTCCTAAATCCCATATTTCATGATCGTGTTTTACCTATTAACAACGTTATTAAAGTCGTCAAGACA CGGTTATGTAATTTTGTAACACATTGTGTTTACTACCAAATAATACATTGTGTTTACTACCAAATAATACATTGTGGTTG CTACCAAATGAGACGAGAAAAGAGAATGAAAACTTGTCCCGTCTTACCCAACTGTACATACTAACTTTTTCATGTATGTC ATAATACGCAGACCGAGTAAAACAGTCGTCATGTACTCACCGATTCTTCGGTTTATCCCGCCTTTCCGTTGGACATACAA GAAGATAGTAAACAGCATTGAACAGGTTACAGGTATAACTGATATAATACATGTTAATGCTGTCTATAATTTGGTTTAAT CCTGTACTGTATGCTCACGTGTTAAAAATCGTTCGAAAAATATATATCCAAATATATTAATAATAGTCTACATATGAATA TANTACCATAAGGACCACTAGTGCAGTAATACCCTGTAGTGCTGGACTG LQW173019.y2 LQW173019.y2.phd.1 LQW173019.x1 11:47:54 2001 TEMPLATE: LQW173019 DIRECTION: rev Length = 992 Score = 122 bits (304), Expect = 3e-28 Identities = 56/56 (100%), Positives = 56/56 (100%) Frame = -3 Query: 1 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSDMW 56 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSDMW Sbjct: 273 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSDMW 106 >LQW173019.y2 >lcl|LQW173019.y2 CHROMAT_FILE: LQW173019.y2 PHD_FILE: LQW173019.y2.phd.1 CHEM: term DYE: ET TIME: Wed Aug 22 11:47:54 2001 TEMPLATE: LQW173019 DIRECTION: rev ACTAAGCCTGTCCATACGAGTACACTTATATACTTCATGGGCGATTTTTTTGGTTATTTTTGTTCAACGTCTGGCTGCGA ATTTAGAAAAACACACAAGGTAAACCCACATGTCTGATGTGTGCTGATTGGCCAATGTACTTACTGGAGTATCTTTAGGA ATGTCATAACCGCACAGTCGCACGCTTCGAACAGTTCGATGAGGCAGATTCAGCGTTGCAATGGATCGGTGCCGGTGTAT TTCTTGTATAAACGAGCGCATGTACGGCATGTTTTCTTGGTGTTTGCAACTGGGGGTTTCGTTGCCTGATTTAAATAAAA ACGCATAGTAGGCTTGGGGAAGACGGTTGCGCCTTTAGCACATAATATCCTAATTGTGTTTTTAAACAATTAACAACGGT CTATGGGGGTCGTGAGGATACGGTTTTTATAATTCTTTGAATGTTCTTTGTTTAAAACCCAATGAGAGGAGAAAATGGAA TGAAAAGGTGTCCCGCCTTCCCCTACTACATACTTGTTTACGTGATTAAATTATGAATAAAGTAAGCCTACGCCTACAGC AGTCTGGTCAGTAGCGATTGGCCGTTATAGAACAGCGTCCAGGACCAAACACTTAATAACGTAACGGCATTTACCATAAA ATATTCGCTTGATGTTCTATTTCGCTACATAGCATTATTACCTCTTTTTGCTTAACCTATCATGGCAACTTTTTTTATAC AGTAAGGGTGGGGGAAGATGGACACTTTTATTCTATTTTCTGGTCCCATTGTAGTAACAAAAGCTCATGAGTATAACCGT ATTAGCCTATAGAATAATAAAAGCATGGTATTGTTAAAAGGATGGCATAGGATGAGTGTAGAGCGTTTCAGGCTAAAAAA AGGGGGTATGACATTGGAGGTACGCAATAGAACTTCTGGAAACAACGATACGGTGTAAGGTGAAAAAAAAGTCAGTTTGA GTGAAACTCTTCTGGTTTTTTGATTGTTNNGN >LQW173019.x1 >lcl|LQW173019.x1 CHROMAT_FILE: LQW173019.x1 PHD_FILE: LQW173019.x1.phd.1 CHEM: term DYE: ET TIME: Wed Aug 22 11:24:41 2001 TEMPLATE: LQW173019 DIRECTION: fwd GGCACACGTTGCCATACTTAAGGTGGTAGCCACTACAAGGAGTGCCCCGTGTATAGTGCTGAACACGAACGTATATCAGC TAGTTCTCTATCATAACCACCACTGTGGTCGCGGCAAACATCAGAATGCACAACAAATGGTTCCCATCATTCAACCCACA CGTCTCATATCCTATGGTCATATAAATATACCAAACTGGACATCCTACAGCATATCAATTTCACCATACCAACAATCATC ACCATATTAAACCCCTCAATATAGATTCATCAATATGAATGCTAAACAAATGCCACAGTACCTAGCACAATCATACACAC ACCACACTCGATCCATCCTCAACACAAGCAACATACCAATCCTAATGACCTCAGACACACCAACCACCAACGTCACAACA ATACATAATATAGATCACAACACAATAAGTCATTACCGCCTGCACAATACACCTCACCAACTTGTTGTAGCCACAATACT ACATAGCAGATCACACTACGCTACCNATACCAACTACACAAACATAGCGCAATTCCACGCTACAGGTCCCAACTTATAGG CGACCACCACACACAATTCCACTTATTACAACACCATCAGCAACAAACACAAATACAAATACACTATCGATGATACATAG AGCACTATAGCGACAAATACACCAACCATACATATGATCATCCAACTACAACCATGACACAGACAATATAGATAGCCACA CACAATATAGGCACATATACTACACCACTCACACACCAAATGACTAGACAGTCTGCCACCACCTAAGTGGCGATTACCAC ATCACACAACACACACCAATACCTAACTCAACTCATACATACTACCCATACATANCACGCACTTACAGATCTGAGACGCT CACTGCGTACCACCAAATCATACTCATAACCCATCACAACTTAACAATCGATAAACACACAATTGACCAGATCGACGTAA AGATATTACAACACCAGTAATTAAAACACACTACTACAACCGATCTATACACACAACATTACACCACCAACATTTACAAC ATATTATTATCCTACATATGAGAACACACTACCCGAGCCATAGTCAGCTTCATAATCATCATCAACCAGCCAATTAACAA TTACTGTCATCTGACACCAGACACATACGGCACTATCACTCACGCTACTCTGACACACACACATCAAACTACATGGACTT ACAACACAAAAGTGCACAACACATAACCAGCGAGACATCATTACACATTAGAGTTACACACACAATACTAATAGTTATGT GTTACACAAATCTAAGACCAACACTACATGAAATATGATCTATACAAACACCAATATACCTACCAACCACTAGACCTATT ACACAATATCAACACAGACTAAATGCGCTAGACAATAATATACAGCAACTAAACCACACAAACAAACAACAACAAACTAC ACACAACAACAAGTACATAAAGACATAATAACCACTAACCTCCACACACACACATGTAATGCACACCCAGTACAAGACAA CCAAACACACAACCATACACATTAGACGACACAACACACACATACTAAATATCACACCACAGAACAAATATTAACAACAC ACACAACACAACACAATAGAATAACACACATACACACTCTCAAGACCAACTATCATTAACTCAACATAGGAATTACATTG GTCTACCACTCAGCACATTCACAAGACAACTACGAACACCATCACCGCTATACACAACCAATCACAGCATACAGGACACT ACCGTACTTCACCATCCACACGCACTACATCATTCCTTCATCCATGAGAATCATACTACACGACTAGCCCACTTCATACA AAGTCTAACTGCAACAATACACAACCAGCCAACAACTCACTATAACAACACATCAACATATATCCAACCAACAATCAAAC ACCAGTACTCATACACACTCACCACAAACACTATACCCTCTATAGACAGCAACACCATAACTACACATATCAAGACAATC AAAATCCCCATACACCCACAACACAAGACCAACCAAGCTAATTCAACACATCCGCACGCACCACACCACCAAATACATAA ATAACCACAATAAAATAATACTAATAAACAACAATCAACCATACATATCATCAACAACCACAAGCACCCACAC LQW18289.x1 LQW18289.x1.phd.1 LQW18289.y1 15:50:36 2001 TEMPLATE: LQW18289 DIRECTION: fwd Length = 292 Score = 121 bits (300), Expect = 9e-28 Identities = 55/56 (98%), Positives = 56/56 (99%) Frame = +3 Query: 1 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSDMW 56 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSD+W Sbjct: 36 NMPYMRSFIQEIHRHRSIATLNLPHRTVRSVRLCGYDIPKDTPVSTLANQHTSDIW 203 >LQW18289.x1 >lcl|LQW18289.x1 CHROMAT_FILE: LQW18289.x1 PHD_FILE: LQW18289.x1.phd.1 CHEM: term DYE: ET TIME: Wed Mar 21 15:50:36 2001 TEMPLATE: LQW18289 DIRECTION: fwd GCAGTGCCGAAACTCCCAGTTGCAAGCACCAAGAAAACATGCCGTACATGCGCTCGTTTATACAAGAAATACACCGACAC CGATCCATTGCAACGCTGAATCTGCCTCATCGAACTGTTCGAAGCGTGCGACTGTGCGGTTATGACATTCCTAAAGATAC TCCAGTAAGTACATTGGCCAATCAGCACACATCAGACATATGGGTTTACCTTATGTGTTTTTCTAAATTCGCAGCCAGAC GTTGAACAATAATAATCAAAAAAATCGCCCATGAAGTTATATAAGTGGTAAC >LQW18289.y1 >lcl|LQW18289.y1 CHROMAT_FILE: LQW18289.y1 PHD_FILE: LQW18289.y1.phd.1 CHEM: term DYE: ET TIME: Wed Mar 21 15:50:54 2001 TEMPLATE: LQW18289 DIRECTION: rev GGCCACACACATAAATTGAAGCATACGAACATGAGAGTCACTTTTATATCAACCCAGGCAGCATAACAGGTGCCTTTTCA CCAACTGCAAGGTAAGACATTAGCCTATTGATTTTTATTGAGCATATTGATACTAGCACTTTTATTGACAAAGTCCTGTT TACTTTATTGCAAGTTAAAATACTAGCATACTTATGAATGAATGTAACTTACTTTATCCTTGCTTGGTAGGAGCACGACA GTTGTTATAACACAGGTGTTTATACACCTCGTACCACCTTATGAGTTACCATGTATGTTACTTTGTGTTTTTTGGGGGGG ATTTCGTTTATGTTTAACTGATAATTTGTACACCACTTTAGTAACCACTGGGTTGGAGCAATTGTCGTATGTGTCTTACC CAGGACACATACGCCCACAATGGTAGCAGCGACGAGGCCTGAACCCAATAAACTTAACGGCAATTGCTTCAACCCAGTGG TCACAAATGGGTTTTTCAAATTATCAGCCATACATAAAAAAATAAAAAAAAATATTCTCCCATGAAGTAACATACATGGT AACTCGTAAGCTTGCATGAGATGTATGAAACAGAACACCCGTGTTACAACGACTGTTGTTTTCCGGCCACGCAAAGATAA AGTAATTATATTCATTATACATTCATATTTTCAGTG >sequence 13, 55, 80 46% to 2C8 The C-terminal parts match for 13 and 55 but not the I-helix region. This is a different gene ITAGQRFEYDDQFFIQIRRHLWFMDPDRTAAFSAMVFVPLLCHIPPYRKKYEQIKQDTKE MMGLFQQLVDQHRKTFDKNNLRDFIDAFILENENGTDESFT DSLICKDRQLVHYVRELFKAGTETSTGTLRWAMLCLIHYPGAQEK IRKEIFDVLGNSTFPSMSDRNAMPYTSAFIQEVFRFRTLAPLGVPHKTTDTVNFANYIIPKG TTILSNLWAVHNDPTVWNNPRQFKPERHIDDKGKYVQSNHVIPFSVGPRHCLGEQLARMEIFIFLV SMVQKFEFLPDPNKTDLPELDDGVNGVAFVPYPFKLVAKEI* DEV1691.y1 LGJ489.y1 LQW217248.x2 LQW193647.x1 >cibd060b13 Length = 512 Score = 131 bits (325), Expect = 4e-30 Identities = 64/69 (92%), Positives = 65/69 (93%), Gaps = 3/69 (4%) Frame = +3 Query: 7 DRQLVHYVRELFKAGTETSTGTLRWAMLCLIHYPGAQEKIRKEIFDVLGK---PSMSDRN 63 DRQLVHYVRELFKAGTETSTGTLRWAMLCLIHYPGAQEK+RKEIFDVLG PSMSDRN Sbjct: 306 DRQLVHYVRELFKAGTETSTGTLRWAMLCLIHYPGAQEKMRKEIFDVLGNSTFPSMSDRN 485 Query: 64 AMPYTSAFI 72 AMPYTSAFI Sbjct: 486 AMPYTSAFI 512 >cibd060b13 ACATTACGGCTGGACAACGGTTTGAATACGACGATCAGTTCTTTATACAAATTCGACGACATTTATGGTTTATGGATCCG GACCGAACCGCTGCGTTTTCTGCTATGGTGTTTGTCCCGTTACTGTGTCACATTCCACCTTACCGTAAAAAGTACGAGCA GATAAAGCAAGATACGAAAGAAATGATGGGCCTGTTTCAGCAACTGGTCGACCAGCATAGGAAAACCTTCGACAAAAATA ATCTACGGGATTTCATCGACGCTTTTATTCTGGAGAATGAAAATGGGACTGACGAAAGTTTTACGGACCGACAGCTTGTA CATTACGTACGTGAACTCTTCAAAGCTGGTACTGAAACTTCTACGGGTACTCTTAGATGGGCGATGTTGTGTCTCATTCA CTATCCTGGAGCACAGGAGAAAATGAGAAAGGAAATATTTGACGTCTTAGGCAATAGCACTTTTCCCAGCATGAGTGATC GTAACGCGATGCCATACACTTCCGCGTTCATA >cibd060b13_3 ITAGQRFEYDDQFFIQIRRHLWFMDPDRTAAFSAMVFVPLLCHIPPYRKKYEQIKQDTKE MMGLFQQLVDQHRKTFDKNNLRDFIDAFILENENGTDESFT DRQLVHYVRELFKAGTETS TGTLRWAMLCLIHYPGAQEKMRKEIFDVLGNSTFPSMSDRNAMPYTSAFI QEVFRFRTLAPLGVPHKTTDTVNFANYIIPKGTTILSN LWAVHNDPTVWNNPRQFKPERHIDDKGKYVQSNHVIPFSVGPRHCLGEQLARMEIFIFLV SMVQKFEFLPDPNKTDLPELDDGVNGVAFVPYPFKLVAKEI* >rcibd060a13 Length = 765 Score = 112 bits (277), Expect = 7e-25 Identities = 53/54 (98%), Positives = 53/54 (98%) Frame = -2 Query: 9 LGNSTFPSMSDRNAMPYTSAFIQEVFRFRTLAPLGVPHKTTDTVNFANYIIPKG 62 L NSTFPSMSDRNAMPYTSAFIQEVFRFRTLAPLGVPHKTTDTVNFANYIIPKG Sbjct: 764 LXNSTFPSMSDRNAMPYTSAFIQEVFRFRTLAPLGVPHKTTDTVNFANYIIPKG 603 >rcibd060a13 TGTGTTGTATTTAAAACTGCGACTTGTATATATTTATTAGACTCACAGCAAAACACGGCAATAGAAAAAGACAGTAACAC TTATATAGAATAGAACGTATTGCATTTGGAAATATGCGACTTAAAGGGGTATACCAACGAAAATCCAAAATTTGTAAATG TGTAGAGATGCTCAAATGTGGTAAAAACTACCACCTAAGTTTCAAAAAACAAAATACTTCAAAAAAATAAAAATTTTGAT TTTTCGTTGTTATGCCATTTGAAGAACAAGAAATTTCGCTAAATTTCTTTAGCGACCAATTTAAACGGATAAGGCACAAA CGCTACACCATTAACCCCATCATCCAATTCTGGAAGATCTGTTTTGTTCGGATCGGGAAGAAACTCAAACTTCTGAACCA TTGAAACCAGAAAGATGAAAATTTCCATTCTAGCTAGTTGTTCTCCCAAGCAATGACGTGGACCCACTGAGAAAGGTATT ACGTGATTCGATTGAACATATTTTCCTTTATCGTCAATGTGGCGCTCAGGTTTAAATTGACGTGGGTTGTTCCATACAGT AGGATCGTTGTGCACTGCCCATAAGTTTGATAAAATCGTGGTTCCTTTTGGAATAATATAGTTAGCAAAGTTGACAGTGT CTGTTGTCTTGTGTGGAACCCCTAACGGAGCTAACGTTCGAAACCGAAACACTTCTTGTATGAACGCGGAAGTGTATGGC ATCGCGTTACGATCACTCATGCTGGGAAAAGTGCTATTGNCTAAG NSTFPSMSDRNAMPYTSAFIQEVFRFRTLAPLGVPHKTTDTVNFANYIIPKGTTILSN LWAVHNDPTVWNNPRQFKPERHIDDKGKYVQSNHVIPFSVGPRHCLGEQLARMEIFIFLV SMVQKFEFLPDPNKTDLPELDDGVNGVAFVPYPFKLVAKEI* >sequence 14, 37, 69 25 accessions 50% to 2C8 MLPSGYIFVFVFLCIYYLLQWRRRPKNFPPGPLGIPLFGIA PFAGVDMHKYLATYYAKYGGVMSFRLATKDWIVLNDIEAITQ (0) ALLKQGESFSGRPQSYLMNQLTEGCGIVFSTGPRWQAQRRFVLTALKT (2) (C-helix) LGMGKRSTMVGIINEENQNFLSVIQSSGGKVNIL (0) SEFRKLLSNIVSNLIMGKRFNYEDEKLQAIVDHR (2) PTSSVVMFIPFLRFIPPFKQGYQRLIASTQRVL (1) DIISEITEEHKNSFDENNLRDFIDIFLLEMKRRNSEYFT (0) ELQLLHLVRDLFVGAIDTTTATLGWGIICLLHYPECQVRIQEEIDDVI (1) GCAEPDMSHHESMPYLRAFIQEVHRFQTIAPLNIPHCVTEDCVLFGYHIPKS TPVMSNIWRVHNDPKYWENPEKFSPERHLDSEGRFVPSNRVLSFAVGHRSCLG VQLARVELFLFYASTLKKYEFQIDPEYGLPDWSNDRSGTVKTPKKFSVLLKSR* LQW4880.y1 exon 1 opposite = LQW4880.x1 (exon 6) DEV1604.y1 exon 1 opposite = DEV1604.x1 ? LQW232493.y1 exon 1 opposite = LQW232493.x1 (exon 7) LGJ1425.x1 exon 1 opposite = LGJ1425.y1 (exon 7) LQW269127.y1 exon 2 opposite = LQW269127.x1 (exon 7) LQW48810.x1 exon 2 opposite = LQW48810.y1 (exon 2) LQW48810.y1 exon 2 opposite = LQW48810.x1 (exon 2) DEV27943.y1 exon 2 opposite = DEV27943.x1 (exons 8,9 fused) LQW265377.x1 exons 2 and 3 opposite = LQW265377.y1 (exons 8,9) exons 4 and 5 are missing LQW37658.y1 exon 6 opposite LQW37658.x1 ? LQW98270.x1 exon 6 opposite = LQW98270.y1 ? LQW160072.x1 exon 6 LQW160072.y1 (C-terminal and 3 prime UTR) LQW160072.x2 exon 6 LQW160072.y1 (C-terminal and 3 prime UTR) DEV47237.x1 exon 6 no opposite LQW4880.x1 exon 6 opposite = LQW4880.y1 (exon 1) DEV55412.x1 exon 7 opposite = DEV55412.y1 ? LQW232493.x1 exon 7 opposite = LQW232493.y1 (exon 2) LQW103809.x01 exon 7 opposite = LQW103809.y1 ? LQW103809.x1 exon 7 opposite = LQW103809.y1 ? LQW269127.x1 exon 7 opposite = LQW269127.y1 (exon 2) LGJ1425.y1 exon 7 seq 14 opposite = LGJ1425.x1 (exon 1) LQW231417.x1 exons 8,9 fused opposite = LQW231417.y1 LQW265377.y2 exons 8,9 fused opposite = LQW265377.x1 (exons 2, 3) DEV27943.x1 exons 8,9 fused opposite = DEV27943.y1 (exon 2) LQW160072.y2 exons 8,9 fused opposite = LQW160072.x1 (exon 6) GCiWno663_c10.b1 ex 3, 4 Probable savignyi ortholog to exon 4 DEIRRLQANNTCNFLSGKHFDYKDKRLQAIIDHR (2) Two candidates for exon 4 in intestinalis XEIRRLEANIISKVLIGKRFDYTDERLNSIIDCR GCiWno350_k07.b1 related seq SEFRKLLSNIVSNLIMGKRFNYEDEKLQAIVDHR GCiWno663_c10.b1 this is it >GCiWno663_c10.b1 TTCTCGTCTATTTTTTTCTTTCATTCTATTATTCATTTTGGTAGTAAACAAAGAACATTCAAGGAACTATGAAACCTTGT CCTCGCGACTTTCATAGACCGTTGTTACTTGTTTAAAACACGACCAGGATATTTATATATTTTATGCTAAAGATGTCCCA TCTTCCCCCACCCCACTATATATATTTACCATTTTTTAGTCTAGGAATGGGAAAAAGAACCATGGTCGGAATTATAAACG AAGAAAACCAGAATTTTTTATCCGTAATACAATCTTCTGGAGGAAAGGTTAACATTTTGGTAAGCCATATTTTTTTATTG GTACACGTCCTTTTATTTTTAACGTTATATGCATTTCTTACTACAGTCTGAATTTCGGAAACTCCTGTCGAATATTGTTA GTAATCTGATAATGGGAAAGCGTTTTAACTACGAAGACGAGAAGTTGCAAGCTATTGTGGATCATAGGTAATAAATGATA TAACTTAACCATGCAAGGTATAAAACTGCAAAGTGCCATTATATATAGCTGGGGGTGGGGGAAGATGGGACCCCNNNNGC NCANNNNATCCANNNNTCCCTGACCGCCGCCCGAACCAANCCCACAACCGCCCCATGGGAGCCCGCGACAACACCGGCGC ACCACCCACTCGACCCNCGCGCCCCACCACGAGACCACACCACACGCAAAGGGGCCCCACCCTTCCCTTCCCTCCTTTCT ATCACTCTCCCTCTTTTCCCCACTTGGTCTACTCATTTTTATCTCTTCTTTCTCTCCCCTCATTTCTTACCCCCCCTGTC CTGCCCGTCCACACGCCCCCCCCGTTTGCCTAATTTTTGTCTCGCTCTTCTTCCACTCTCTTACCGCGCTTCCCACTCAA TCCCN >_1 FSSIFFFHSIIHFGSKQRTFKEL*NLVLATFIDRCYLFKTRPGYLYILC*RCPIFPHPTI YIYHFLV*EWEKEPWSEL*TKKTRIFYP*YNLLEERLTFW*AIFFYWYTSFYF*RYMHFL LQSEFRKLLSNIVSNLIMGKRFNYEDEKLQAIVDHR**MI*LNHARYKTAKCHYI*LGVG exon 4 EDGTPXAXXIXXSLTAARTXPTTAPWEPATTPAHHPLDPRAPPRDHTTRKGAPPFPSLLS ITLPLFPTWSTHFYLFFLSPHFLPPLSCPSTRPPRLPNFCLALLPLSYRASHSIP >_2 SRLFFSFILLFILVVNKEHSRNYETLSSRLS*TVVTCLKHDQDIYIFYAKDVPSSPTPLY IFTIF*SRNGKKNHGRNYKRRKPEFFIRNTIFWRKG*HFGKPYFFIGTRPFIFNVICISY YSLNFGNSCRILLVI**WESVLTTKTRSCKLLWIIGNK*YNLTMQGIKLQSAIIYSWGWG KMGPXXXXXSXXP*PPPEPXPQPPHGSPRQHRRTTHSTXAPHHETTPHAKGPHPSLPSFL SLSLFSPLGLLIFISSFSPLISYPPCPARPHAPPVCLIFVSLFFHSLTALPTQS >_3 LVYFFLSFYYSFW**TKNIQGTMKPCPRDFHRPLLLV*NTTRIFIYFMLKMSHLPPPHYI YLPFFSLGMGKRTMVGIINEENQNFLSVIQSSGGKVNILVSHIFLLVHVLLFLTLYAFLT exon 3 TV*ISETPVEYC**SDNGKAF*LRRREVASYCGS*VINDIT*PCKV*NCKVPLYIAGGGG RWDPXXXXXPXXPDRRPNQXHNRPMGARDNTGAPPTRPXRPTTRPHHTQRGPTLPFPPFY HSPSFPHLVYSFLSLLSLPSFLTPPVLPVHTPPPFA*FLSRSSSTLLPRFPLNP Try to use savignyi to close gap and find exon 4 >scf/ciona01/G126/seq_dir/hrs/G126P69589F.T0/G126P69589FF10.T0.seq 724 0 724 ABI Length = 724 Minus Strand HSPs: Score = 175 (61.6 bits), Expect = 3.7e-13, P = 3.7e-13 Identities = 34/34 (100%), Positives = 34/34 (100%), Frame = -1 Query: 34 RPTGTIVVFLPFLRFIPPFKQKFQKMVDSTEQVI 67 RPTGTIVVFLPFLRFIPPFKQKFQKMVDSTEQVI Sbjct: 427 RPTGTIVVFLPFLRFIPPFKQKFQKMVDSTEQVI 326 >scf/ciona01/G126/seq_dir/hrs/G126P69589F.T0/G126P69589FF10.T0.seq 724 0 724 ABI TTGAATATCGCCCTGTGGTGGAATTTTCAATTACATTTTGTAAAGAACCG TATTTTATCATATGTTAGTAAAAAATCACCGCGGACAGTGCTGGTCTGTG GACCTGCGTTTGAGAATCACCAGTTTAGAAAGCCAAGGCTCGCTTTTCTA GCTTAAGAGCAAATCTCATCCCAACAAATCCCTTTCATTGTACATTTGTA CAACTACCACGAAATTCATATTATTAGATCGGGCATATTTACAGACAATA GCATAAAGAGGGTATAAAGCTTGGCAGGAGAAATAGATGACTTGCCACGG TTTTAATAACAATTTTAACAATACCTATGACTTGCTCTGTGCTGTCCACC ATTTTTTGAAATTTTTGTTTGAAGGGTGGTATGAATCGAAGAAACGGCAA GAAAACAACAATGGTACCGGTAGGCCTGGGAACACATTGGCGAATTTGTA TAAAGTAGTAATATTCCTGTAACGTTAAATGTCATGTCTGTCATTTAAAG GTAATTATTAGGAATATTAAATAAGACTTGTATGGCGAATTTGTATAAAG TAGTAATATTCCTGTAACGTTAAATGTCATGTCTGTCATTTAAAGGTAAT TATTAGAAATATTAAATAAGACTTGTATATTGTATCGCCTTTAGCAATAT ATANAACGGAATGTTAGTGTATGCTTCCATACCTGTGGTCTATAATCGCT TGTAAATCTCTTATCTTGTAGTCA >_1 LENYDLLVLWWEFFTIFTRFCVTFCLT*CYMYLVC*NKKLNVEYFYSLGMGKRTMDAIIN EETNRFIASVQLAGGTVNILV*KVSLGYYDKIFMLYPQNGNVFLITYD**HR*KNHNLH* K*DTNLHPREREISRGVNTFHTLRVLIDFTIE*VKRLCNAQQNTYRTRFDGFRPIIHAIF YRGSTLTTKIRDYKRL*TTGMKAYTNIPVYTSLKAIQYTSLI*YS**X >_2 LKTMISSSCGGNSLPSLPDSVLRFASLNVICTLCVKIKN*MSNIFTVWEWESVPWMQLLT KKQIALSLQSN*PAELSTYWFEKFHWDIMIRYLCFIPKMEMSF**PMISNTVKRIITCIK NKILIFIRVNGKSQGASTHSTH*GF**ILR*NKLNACVMHSKILTGRDSTASGQ*YMQFS IGEAL*LQR*EITSDYRPQV*KHTLTFRFIHR*RRYNIQVLFNIPNN >_3 *KL*SPRPVVGILYHLYPILCYVLPHLMLYVPCVLK*KTKCRIFLQFGNGKAYHGCNY*R RNKSLYRFSPISRRNCQHTGLKSFIGIL**DIYALSPKWKCLFDNL*LVTPLKES*LALK IRY*SSSA*TGNLKGRQHIPHIKGFDRFYDRIS*TLV*CTAKYLQDEIRRLQANNTCNFL SGKHFDYKDKRLQAIIDHRYESIH*HSGLYIAKGDTIYKSYLIFLII Do these overlap? >_4 DYKIRDLQAIIDHRYGSIH*HSVXYIAKGDTIYKSYLIFLIITFK*QT*HLTLQEYYYFI QIRHTSLI*YS**LPLNDRHDI*RYRNITTLYKFANVFPGLPVPLLFSCRFFDSYHPSNK NFKKWWTAQSKS*VLLKLLLKPWQVIYFSCQALYPLYAIVCKYARSNNMNFVVVVQMYNE RDLLG*DLLLS*KSEPWLSKLVILKRRSTDQHCPR*FFTNI**NTVLYKM*LKIPPQGDI Q >_5 *LQDKRFTSDYRPQVWKHTLTFRXIYC*RRYNIQVLFNISNNYL*MTDMTFNVTGILLLY TNSPYKSYLIFLIITFK*QT*HLTLQEYYYFIQIRQCVPRPTGTIVVFLPFLRFIPPFKQ KFQKMVDSTEQVIGIVKIVIKTVASHLFLLPSFIPSLCYCL*ICPI**YEFRGSCTNVQ* KGFVGMRFALKLEKRALAF*TGDSQTQVHRPALSAVIFY*HMIKYGSLQNVIENSTTGRY SX >_6 TTR*EIYKRL*TTGMEAYTNIPXYILLKAIQYTSLI*YF**LPLNDRHDI*RYRNITTLY repeat KFAIQVLFNIPNNYL*MTDMTFNVTGILLLYTNSPMCSQAYRYHCCFLAVSSIHTTLQTK ISKNGGQHRASHRYC*NCY*NRGKSSISPAKLYTLFMLLSVNMPDLII*ISW*LYKCTMK GICWDEICS*ARKASLGFLNW*FSNAGPQTSTVRGDFLLTYDKIRFFTKCN*KFHHRAIF X >scf/ciona01/G126/seq_dir/hrs/G126P68275F.T0/G126P68275FC12.T0.seq 684 0 684 ABI Length = 684 Plus Strand HSPs: Score = 101 (35.6 bits), Expect = 7.3e-05, P = 7.3e-05 Identities = 20/33 (60%), Positives = 25/33 (75%), Frame = +1 Query: 1 LGMGKRSMVGIINEENQNFLSVIQSSGGKVNIL 33 LGMGKR+M IINEE F++ +Q +GG VNIL Sbjct: 142 LGMGKRTMDAIINEETNRFIASVQLAGGTVNIL 240 >scf/ciona01/G126/seq_dir/hrs/G126P68275F.T0/G126P68275FC12.T0.seq 684 0 684 ABI CTTGAAAACTATGATCTCCTCGTCCTGTGGTGGGAATTCTTTACCATCTT TACCCGATTCTGTGTTACGTTTTGCCTCACTTAATGTTATATGTACCTTG TGTGTTAAAATAAAAAACTAAATGTCGAATATTTTTACAGTTTGGGAATG GGAAAGCGTACCATGGATGCAATTATTAACGAAGAAACAAATCGCTTTAT CGCTTCAGTCCAATTAGCCGGCGGAACTGTCAACATACTGGTTTGAAAAG TTTCATTGGGATATTATGATAAGATATTTATGCTTTATCCCCAAAATGGA AATGTCTTTTTGATAACCTATGATTAGTAACACCGTTAAAAGAATCATAA CTTGCATTAAAAATAAGATACTAATCTTCATCCGCGTGAACGGGAAATCT CAAGGGGCGTCAACACATTCCACACATTAAGGGTTTTGATAGATTTTACG ATAGAATAAGTTAAACGCTTGTGTAATGCACAGCAAAATACTTACAGGAC GAGATTCGACGGCTTCAGGCCAATAATACATGCAATTTTCTATCGGGGAA GCACTTTGACTACAAAGATAAGAGATTACAAGCGATTATAGACCACAGGT ATGAAAGCATACACTAACATTCCGGTTTATACATCGCTAAAGGCGATACA ATATACAAGTCTTATTTAATATTCCTAATAATTN >_1 LENYDLLVLWWEFFTIFTRFCVTFCLT*CYMYLVC*NKKLNVEYFYSLGMGKRTMDAIIN EETNRFIASVQLAGGTVNILV*KVSLGYYDKIFMLYPQNGNVFLITYD**HR*KNHNLH* K*DTNLHPREREISRGVNTFHTLRVLIDFTIE*VKRLCNAQQNTYRTRFDGFRPIIHAIF YRGSTLTTKIRDYKRL*TTGMKAYTNIPVYTSLKAIQYTSLI*YS**X >_2 LKTMISSSCGGNSLPSLPDSVLRFASLNVICTLCVKIKN*MSNIFTVWEWESVPWMQLLT KKQIALSLQSN*PAELSTYWFEKFHWDIMIRYLCFIPKMEMSF**PMISNTVKRIITCIK NKILIFIRVNGKSQGASTHSTH*GF**ILR*NKLNACVMHSKILTGRDSTASGQ*YMQFS IGEAL*LQR*EITSDYRPQV*KHTLTFRFIHR*RRYNIQVLFNIPNN >_3 *KL*SPRPVVGILYHLYPILCYVLPHLMLYVPCVLK*KTKCRIFLQFGNGKAYHGCNY*R RNKSLYRFSPISRRNCQHTGLKSFIGIL**DIYALSPKWKCLFDNL*LVTPLKES*LALK IRY*SSSA*TGNLKGRQHIPHIKGFDRFYDRIS*TLV*CTAKYLQDEIRRLQANNTCNFL SGKHFDYKDKRLQAIIDHRYESIH*HSGLYIAKGDTIYKSYLIFLII LQW69473.x1 CHEM: term DYE: ET TIME: Fri May 4 12:32:48 2001 TEMPLATE: LQW69473 DIRECTION: fwd Length = 478 Score = 66.0 bits (158), Expect = 5e-11 Identities = 32/33 (96%), Positives = 33/33 (99%) Frame = -1 Query: 1 LGMGKRTMVGIINEENQNFLSVIQSSGGKVNIL 33 LGMGKR+MVGIINEENQNFLSVIQSSGGKVNIL Sbjct: 379 LGMGKRSMVGIINEENQNFLSVIQSSGGKVNIL 281 >_4 MPTCRL*EIDT*ILCANDVPSSPTPLYIYHFLA*EWEKEAWSEL*TKKTRIFYP*YNLLE ERLTFW*AIFFYWYTSFYF*RYMHFLLQSEFRKLLSNIVSNLIMGKRFNYEDEKLQAIVD HR**MI*LNHARYKTAKCHYI*YRVGEDGTPLAHNIQWD >_5 HAYLSTIGDRHLDIMC*RCPIFPHPTIYLPFFSLGMGKRSMVGIINEENQNFLSVIQSSG GKVNILVSHIFLLVHVLLFLTLYAFLTTV*ISETPVEYC**SDNGKAF*LRRREAASYCG S*VINDIT*PCKV*NCKVPLYIV*GGGRWDTVST*YTVGX >_6 CLLVDYRRSTLRYYVLTMSHLPPPHYIFTIF*PRNGKKKHGRNYKRRKPEFFIRNTIFWR KG*HFGKPYFFIGTRPFIFNVICISYYSLNFGNSCRILLVI**WESVLTTKTRSCKLLWI IGNK*YNLTMQGIKLQSAIIYSIGWGKMGHR*HIIYSGX >GCiWno663_c10.b1 CHROMAT_FILE: GCiWno663_c10.b1 PHD_FILE: GCiWno663_c10.b1.phd.1 CHEM: term DYE: big TIME: Fri Nov 2 11:02:35 2001 TEMPLATE: GCiWno663_c10 DIRECTION: fwd Length = 885 Score = 59.7 bits (142), Expect = 4e-09 Identities = 29/33 (87%), Positives = 31/33 (93%) Frame = +3 Query: 1 LGMGTRSMVGIINEENQNFSSVIQSSGGQVNIL 33 LGMG R+MVGIINEENQNF SVIQSSGG+VNIL Sbjct: 201 LGMGKRTMVGIINEENQNFLSVIQSSGGKVNIL 299 >GCiWno709_i09.b1 CHROMAT_FILE: GCiWno709_i09.b1 PHD_FILE: GCiWno709_i09.b1.phd.1 CHEM: term DYE: big TIME: Tue Nov 6 11:40:56 2001 TEMPLATE: GCiWno709_i09 DIRECTION: fwd Length = 892 Score = 71.4 bits (172), Expect = 1e-12 Identities = 34/34 (100%), Positives = 34/34 (100%) Frame = -2 Query: 1 RPTSSVVMFIPFLRFIPPFKQGYQRLIASTQRVL 34 RPTSSVVMFIPFLRFIPPFKQGYQRLIASTQRVL Sbjct: 264 RPTSSVVMFIPFLRFIPPFKQGYQRLIASTQRVL 163 >GCiWno709_i09.b1 TCTCCACAGAAATATATCGATAAAATCTCTAAGGTTGTTTTCGTCGAAAGAATTTTTGTGCTCTTCAGTTATTTCAGAGA TAATATCTGTACGGCGTATGGCACAAAGTTAATATGTTATAAATATAAGGGAATAAACTACACTAGTATTAAGCCATGTA CCTAAAACCCGTTGCGTGCTGGCGATTAATCTCTGATAGCCTTGTTTGAATGGTGGTATAAATCGAAGAAACGGAATAAA CATAACGACGCTGCTTGTCGGCCTGCATTACAGTTTATTGTAAAATTAACGCATTATAAACAAAATTAAGCACTTGTATA CACTTTGTGTAGATAAATGGGTATGATATATAGTGGGGGTAGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCNNCCCCCCCCCCCCCCCCC CCCCCCCCCCCC >GCiWno709_i09.g1 CAAGCTTAAAACAGTGGTTATCTATATATAATATACCTTTGGTGGGAGGCGTTTCATAGATAACTGGAAATGTAAAAACT AAAACTAGCACCGATACTGTGGCAAAAACGCGTAGTATACCCAGAAACTAAGTGTAGTAGGGTGGGAGAAGATGGACCAC CTTTTCAATATATTTTCTCGTCCCATTTGGTAGGAAACAAGATCACCTATGATATAATATAAGAATAAAAACGTAGACCA TCTTACCCCGCCCAATAGACCGTCATCCAATATCTATATTTAAAGTGTGCGAGAAGCCCATGCGGTGCTGCGTGTTGTGG CTGCTGTATAGGTTTCTGTAACATTGCATTTGCTCTACACGGAAATGGCAAACTTTTAACCGACTTCCGTGTTACTCTAA CTTTGATTACTTATGTCTGTTCTATTTTGTATGCTGTTGGCTTATGTGCATATTTTTTGTCTTGATAGTCTTATGAAAGT TACCTAAACCGGGATTGCAAAACTCAAAGTCTCACAGTAACAACAATTTTACATTTTTTACCTAT