11/17/2000 David Nelson
Sequence numbers correspond to the alignment of 263 carriers.  
A and B numbers are used to avoid renumbering as more sequences are added.  
All 46 genes have their intron exon structure given.  Seq 94 has no genomic sequence
available, so its structure is assumed to be like seq. 95.  The introns are marked
with question marks in this sequence.  Two genes have alternative splicing (phosphate 
carrier seqs. 165 and 167 and seqs. 106, 106A and 106B).

26 ANT1 HUMAN  T1         

MGDHAWSFLKDFLAGGVAAAVSKTAVAPIERVKLLLQ                        (PHASE 0 INTRON)
VQHASKQISAEKQYKGIIDCVVRIPKEQGFLSFWRGNLANVIRYFPTQALNFAFKDKYKQ
LFLGGVDRHKQFWRYFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKGAAQREFHGL
GDCIIKIFKSDGLRGLYQGFNVSVQGIIIYRAAYFGVYDTAKG                  (PHASE 1 INTRON)
MLPDPKNVHIFVSWMIAQSVTAVAGLVSYPFDTVRRRMMMQSGRKGA              (PHASE 1 INTRON)
DIMYTGTVDCWRKIAKDEGAKAFFKGAWSNVLRGMGGAFVLVLYDEIKKYV* 

30 ANT3 HUMAN  T2         

MTEQAISFAKDFLAGGIAAAISKTAVAPIERVKLLLQ                        (PHASE 0 INTRON)
VQHASKQIAADKQYKGIVDCIVRIPKEQGVLSFWRGNLANVIRYFPTQALNFAFKDKYKQ
IFLGGVDKHTQFWRYFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKSGTEREFRGL
GDCLVKITKSDGIRGLYQGFSVSVQGIIIYRAAYFGVYDTAKG                  (PHASE 1 INTRON)
MLPDPKNTHIVVSWMIAQTVTAVAGVVSYPFDTVRRRMMMQSGRKG               (PHASE 1 INTRON)
ADIMYTGTVDCWRKIFRDEGGKAFFKGAWSNVLRGMGGAFVLVLYDELKKVI*  

AC036184.3 chromosome 2 FRAGMENT 72% TO ANT3
Sbjct: 62267 KEADLEYTGTLDCWRTIFREERGKAFFKGMWSS 62169

AP001522.2 chromosome 18 ANT LIKE FRAGMENTS

109963 LGGMDRHTRLWRYLVGSLASGG
109964 AAGATFLCFLYPLDFARTCLAADVGK 110100
110144 EGHQVRRAPGPPGLQHLHAGHHHLPAADPSMKNSAKGELPDAKSPWW 110284
110285 EMDDHRGCSSHPFHAVWQTMVQPGLRAAHITYTAPSAVGERPLKMRRQGLLPGRWA 110452
110453 GVLRPGG 110473

AL354854.5 chromosome 9 clone RP11-572H4, ANT3 pseudo 85%

MTEQAISFAKDFLAGGITAAISKTAVASIKRVQLLLQMQHASMPMAAAKQCKGIVDCIVR 103474
103475 IPKDQGVLSFWRGNLANVIRYSPTQALNFAFKDKYKQIFLAGVDKHTQFCRYFAGNLASG 103654
103655 GTAVVYPLDFTRTRLAADVGKSGTEREFRGLGDCLVKISKSDGIRGLYQGFSVS 103816
103817 VQAIIIYQAAYFRVYDTANGMFPDPKNTHILVSWMTAQTVTAVAGVLS*PFDTVRRRTMM 103996
103997 QSRRKGADIMYTGTVDCWRKIFRDERGEAFFKGVWSNALKGMGVGAGFVLVLYDELK 104167

AL353573.10 chromosome 9 clone RP11-550E18 ANT2 pseudo 84%

146646 SFAEDFLAGGVAVAIPKMAVVPIEGVKLLLQVQHASKQITTDKQYKGIIDCVVRIPKEQG 146467
146466 VLSFWRGNLANVIR*FPTQALNFTFKDKNKQIFLGGVDKKTQFWHDFAGNLASGSAAGAT 146287
146286 SLCFVYPFGFAHTFLAADVGRPGGERGFRDLDDCLVKIYKSDGIKGLYQGFNVPVQGIVI 146107
146106 YQAVYFSIYGTVKGMLPDPKNTRIVISWMMAQSITSI 145996
145994 AGLTSYPFDTVCRCMMMQSELKATDIMYTGTLDCWRKIARDEGDKAFFKGAWSNVLRGMG 145815
145814 GAFVLVLCDEIKKY 145773

AL161783.5 chromosome 9 clone RP11-334P12 ANT2 pseudo 95% to AL353573.10 

83203 QVQHASKQIASDNQYKGIIDCVVRIHKEQGVLSFWRGTLANVIR*FPTQALIFTFKDKNR 83024
83023 QILLGGVDKKTQFWHDFAGNLASDSVAGATSLCFVYPFGFAHTFLAADVGRPAGERGFRD 82844
82843 LDDCLVKIYKSDGIKGLYQGFNVPVQGIVIYQAVYFSIYGTVKGMLPDPKNTRIVISWMM 82664
82663 AQSITSI 82643
82641 AGLTSYPFDTVCRCMMMQSELKATDIMYTGTLDCWRKIARDEGDKAFFKGAWSNVLRGMG 82462
82461 GAFVLVLCDEIKKY 82420

AC008488.6 chromosome 5 pseudogene 67% to ANT2

157875 MTDATVTFSKDFLAGGVAAAISKNTAVPIKWVKLLLQVQHASKQITADKLCKD*TDSVVH 157696
157695 TPKEQGVLSFCHGNLASVIRYLPT*AVNFAQGKQKQIF*GGVDERPQLWCYIAGNLAPS 157519
157518 DATGPHPCVWCTLLIFAAT*LCTCLAANVGKARAERESRSLSNCLVKIYKSDGIKRLNQG 157339
157338 FNVSVWGIIIYQAPYFGIRDTAKGIFPDPKNANIFIS 157228
157225 MIAQSVTGIAGIKSHSYPFDTVCHHKMTGSGFKGTDILHTGIPDCWRKIAHDEGDKAFLK 157046
157045 GAWSSILRGMGGVSMLFLYDKIKKY 156971

AC025170.4 chromosome 5 99% to AC008488.6 one stop more may be same pseudogene

MTDATVTFSKDFLAGGVAAAISKNTAVPIKWVKLLLQVQHASKQITADKLCKD*TDSVVH 46221
46222 TPKEQGVLSFCHGNLASVIRYLPT*AVNFAQGKQKQIF*GGVDERPQLWCYIAGNLAPS 46398
46399 DATGPHPCVWCTLLIFAAT*LCTCLAANVGKARAERESRSLSNCLVKIYKSDGIKRLNQG 46578
46579 FNVSVWGIIIYQAPYFGIRDTAKGIFPDPKNANIFIS 46689
46692 MIAQSVTGIAGIKSHSYPFDTVCHHKMTGSGFKGTDILHTGIPDCWRKIAHDEGDKAFLK 46871
46872 GA*SSILRGMGGVSMLFLYDKIKKY 46946

AC023504.12 chromosome 3 ANT2 pseudo 78%
 
106148 MIDVTVSFAKDFLGDGVATAIFKIAVAPTVQVKLLMQVQHASKHITADKQYKGIIDCVVR 106327
106328 IPKEQGSPVLLAQ*PGHCLQILPHPGSQLRLQR*IQADLPGWGGQEDPVLALLCRESDIR 106507
106508 LCRWGHILVFCVPS*FCPYSSSSRCG*S*SEREFRGLSDCLVKICKSNGIKGPYQGFNVS 106687
106688 VQVIIIYRAAYFGIYDTRNGMLPDPKNTHIVISWMIAQTVRAVAGLASYPLDTVRPRMMM 106867
106868 QSGCKGTDLMYTGMLDCWRKIASDEGGKAFLKGAWSSVLRGTGGAFVLVLYD*IKNYI 107041

AC012525.6 chromosome 4 ANT2 pseudo 56% 

33550 KDFLAGGVTAAISKMAVAPTEGSSCCCRCRVPASRSPQISNTRAL*TAWSAFPRSRSPVP 33729
33730 LAR*PGQCHQILPYPRSQLCLQR*KQADLPGGCDKRIQFWHKFAGSLASGGAPGAT 33897
33898 SLCFVYPLDFDRTHLAADVGKAGAEREFQGLGDRLVKIYKSDGIKGLYQGSNRSVQGIII 34077
34078 YRAACFGVYDTARRMLPDSRNTHMSSAV*SRSPSLPLLG*LPIHLTLFATE**CSQGA 34251
34252 ADIMYTGRLHCWRKIAPDEGGRAFFKGAWSNVLRGMGGAFVLVLYDEIR 34398

AL354943.8 chromosome 6 ANT PSEUDOGENE = AC073199.3 chromosome 6

23256 MTQQAISFAKDHLAGGIAAAVAKIVVALIEQVKLL*QMQGARKQVTAD 23113
23098 IVDYFLKQALNITFKDKQTQLFLG 23027
23030 SMDEHTQF*SCFAHNLASNKAARATCLCFVYLLGFARTHLVVNVGKSYTELGFRFKDLG 22851
22850 DCLVKISKSDGIWGLYQGFSISVQGIILFWAA*RPKACSLTPRPPHHGEPDDCP 22689
22688 DCGGAANMTFYPLGPVQTQRSQHTAHGNPQLLEEDF*GGKGKAFFKSAWSNV 22533
22532 LRDMGGAFRPVLYVELKK 22479

AC073199.3 chromosome 6 = AL354943.8 chromosome 6 ANT PSEUDOGENE 

40076 MTQQAISFAKDHLAGGIAAAVAKIVVALIEQVKLL*QMQGARKQVTAD 39933
39918 IVDYFLKQALNITFKDKQTQLFLG 39847
39850 SMDEHTQF*SCFAHNLASNKAARATCLCFVYLLGFARTHLVVNVGKSYTELGFRFKDLG 39671
39670 DCLVKISKSDGIWGLYQGFSISVQGIILFWAA*RPKACSLTPRPPHHGEPDDCP 39509
39508 DCGGAANMTFYPLGPVQTQRSQHTAHGNPQLLEEDF*GGKGKAFFKSAWSNV 39353
39352 LRDMGGAFRPVPYVELKK 39299

AL392084.3 chromosome 9 ANT PSEUDO
150374 AVAPSEWVKLLLQVQRASEQIMPDKWVNGIMDCMVRIPKEQGVLFFWVGNLANAIRYFPT 150195
150194 *VLNFAFRYKYK*VFLGWRGRGGVWTSTCSSEGIFQEPGLWR
150100 VFFRNLDSGGAAQASSHCLIYPLDFSRTHLAA--RKLGTTWELEGLGDFLGKITKSDGIWGLYQ 149913
149947 GASIKA*CLRPGTIIYLVTSLSVYNTAKGALPDLKNTCIVVNWMIAQT*WP 149774
149773 *SSVAS*PFNTVWRWMMMQCWHKGADVRYMGILDCWTKILKDEESKAFFKGAWSNILRGM 149594
149593 E*AFILVL 149570

AC015501.4 ANT1 LIKE FRAGMENTS 79%

173111 ASSFGKDLLAGGVAAAVSKTAVAPIERVKLLLQVQASSKQISPEARYKGMVDCLVRIPRE 172932
172931 QG 172926
68112 GLLPKPKKTPFLVSFFIAQVVTTCSGILSYPFDTVRRRMMMQ 68237

AC021286.3 72% TO ANT1 probable pseudogene lacks part of exon at RRRMMMQ joint
no start codon at the right location
8724 ASSFGKDLLAGGVAAAVSKTAVAPIERVKLLLQVQASSKQISPEARYKGMVDCLVRIPREQG 8539
176036 FFSFWRGNLANVIRYFPTQALNFAFKDKYKQLFMSGVNKEK
156441 QFWRWFLANLASGGAAGATSLCVVYPLDFARTRLGVDIGKG 156319
153616 PEERQFKGLGDCIMKIAKSDGIAGLYQGFGVSVQGIIVYRASYFGAYDTVK 153461
151929 GLLPKPKKTPFLVSFFIAQVVTTCSGILSYPFDTVRRRMMMQVFYVI 151789
147274 YKGTLDCFVKIYQHEGISSFFRGAFSNVLRGTGGALVLVLYDKIKEFFHIDIGGR* 147007

AC022220.7 chromosome X ANT-LIKE FRAGMENTS
230087 IPRGLTMPEVQCVCKNIASDKQYKGIEDYIVCFPKEK*ILFFWRGSLGS 229941
229944 FWRYFADNLASGGTSGVIFSFFYSLGFARTHWATNIGKPVKGKKFKGIEDCV 229789
229665 WGEDFKDECGKAFNKDA*SNVLRGIGGMLILYDKLKK 229555

AC073269.4 chromosome 7 clone RP11-436F9, 88% to ANT2 probable pseudogene
frameshift at RRRMMM region

MTDAAVSFAKDFLAGGVAEAISKTAVAPIEQVKLLLQVQHASKQITSAKQYKGIIDCVV 54712
54658 RIPKKQGVLSWRGNLANAIRYFPTQAFNFAFKDKYK 54816
54817 QIFLGGVDK 54843
54828 KTQFGRYFAGNLASGGAAGATYLCFVYPLDFARTCLAANVGKAEAEREFRGLCDC 55007
55008 LVKIYKSDGIKGLYQGFNVSMQGIIRAAYFGIYDTAKGMLPDPKDTHIVISWMTTQTV 55181
55182 TAFSGLTSYSFDIVR 55226 frameshift
55232 VMIQSGRKVTDIMYTGTLDCWRKIAGDEGGKAFFKGSWSSVLRGMGGAFVLVLFDEIKKY 55411

33 ANT2 HUMAN  T3         

MTDAAVSFAKDFLAGGVAAAISKTAVAPIERVKLLLQ                        (PHASE 0 INTRON)
VQHASKQITADKQYKGIIDCVVRIPKEQGVLSFWRGNLANVIRYFPTQALNFAFKDKYKQ
IFLGGVDKRTQFWRYFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKSGAEREFRGL
GDCLVKIYKSDGIKGLYQGFNVSVQGIIIYRAAYFGIYDTAKG                  (PHASE 1 INTRON)
MLPDPKNTHIVISWMIAQTVTAVAGLTSYPFDTVRRRMMMQSGRKGT              (PHASE 1 INTRON)
DIMYTGTLDCWRKIARDEGGKAFFKGAWSNVLRGMGGAFVLVLYDEIKKYT*  

AL353716.9 chromosome 6 ANT2-LIKE N-TERM FRAG
186096 MTDAAVSFAKDFLAGGVTTAISKMAVVPIKLLKFIFVI 186209

AL390965.10 chromosome 13 ANT2 pseudo 63%

LAGGVVTAISKAAGAPIEWLKLLLQVQHATKEITTDKQYKHIIDCVVCIPKEERVLSFWR 160213
160212 GNLANVIRYFPTQ 160165
160171 ALNFAFFFKGSWSIVFSGMSGAFVLVLYHEIKKYA-CAGATSLCFVYPLDFVCTNLPVDV 159995
159994 GKAGAKREFRSLGDCLVKIYKSNRIKGLHQGFTMSVQGISIYPAAYFSICDTAKGMLPDS 159815
159814 KS 159809
159777 QPVIVIAGLNSYPFDTVHC*MSMQSG 159700
159411 AGLEYTGIFDC*RNIARDEGGKDF 159340

AC069303.2 chromosome 2 clone RP11-332N21, Query: 1 ANT2 pseudo 88%
MTDATVSFAKDFLAGGVAAAISKMAVVPIQRVKLLLQVQHASKQVTADKQYKGIIDCVVC 17144
ISKEQGVLSFWRGNLANVIRYFPTQAFNFAFKDKYKQIFLGDVDKRTQFWRYFEGNLTSG 16964
SAAGATSLCFVYPLDFALTRLAANVGKAGAEREFRSLGDCLVKIYKSDGIKGLYQGFNMS 16784
VQGIIIYRAAYFSIYDTAKGMLPDPKNTHILIS*MITQTVTAVAGLTSYPFDTIRHHMMM 16604
QSGCKGTDIMYTGMLNCWRKIARDVGGKASFKGAWSSVFRGTGGAFVLVLHNEIKKY 16433

44 AC026309.15 chromosome 3 ESTs T07400 T34496 R19000
FRAMESHIFT IN THIS SEQ AT VTQ/PAD BUT NOT IN PSEUDOGENE AC025032.10
THIS IS PROBABLY A SEQ ERROR.

6014 MIQNSRPSLLQPQDVGDTVETLM 5946                            (PHASE 0 INTRON)
241  LHPVIKAFLCGSISGTCSTLLFQPLDLLKTRLQTLQPSDHG               (PHASE 2 INTRON)
SRRVGMLAVLLKVVRTESLLGLWKGMSP                                 (PHASE 0 INTRON)
SIVRCVPGVGIYFGTLYSLKQYFLRGHPPTALESVMLGVGSRSVAGVCMSPITVIKTRYE (PHASE 0 INTRON)
SGKYGYESIYAALRSIYHSEGHRGLFSGLTATLLRDAPFSGIYLMFYNQTKNIVPHD    (PHASE 1 INTRON)
QVDATLIPITNFSCGIFAGILASLVTQPADVIKTHMQLYPLKFQWVGQAVTLIFK      (PHASE 0 INTRON)
DYGLRGFFQGGIPRALRRTLMAAMAWTVYEEMMAKMGLKS*  

AC025032.10 chromosome 3 clone 92% TO SEQ 44 (PSEUDOGENE)

42799 MIQNSHSSLLQPQDVRDTVETLM 42731 
      LHLVIKAFLCGSISGTCSTLLFQPLDLLKTRLQTLQPSDHGSRR 42571
42570 VGMLAVLLKVVHTESLLGLWKGMSPSVVRCVPGVGIYFGTL*SLKRYFLRGHPPTALES 42394
42393 IMLGVGSLSVAGVCMLPITVIKMCYESGKYGYESIYAAVRSIYRSEGHRGLFSG 42232
42231 LTATLLRDMPFSGNYLMFYNQTKNIV 42154
      PHDQVDATFIPITNFSCGI 42097
42096 FAGILASLVTQPADVIKTHMQLYPLKFQWIGQAVTLTFKDYGLRGFFQG 41950
      GIPRALCRTLMAAMA*MVYEEMMAKMGLKS 41860

AC073880.4 9 diffs to ORNT1 partial seq 

84806 MKSNPPIQAAIDLMAGAAVTVLCLIS 84729
80769 LGPVPLMLSGG 80737
80735 GVGGICLWLAVYPVDCNKSRIQVLSISGKQAGFIRTFINVVKNEG 80601
79972 GIMALYSGLKPTMIRAFPDNGALFLAYEYTRKLMMSQLEAY 79850

AF254982.1 chromosome 21 clone probable ORNT1 pseudogene

194913 LGPVPLMLSGG 194945
194947 GVGGICLWLAVYLVDCIKSRIQVLSMS*KQAGFIRTFINVVKNEG 195081
195716 GIMALHSGLKPTMIRALPDDLALFLADEYTRKLMMSQLEAY 195838

AC027723.2 chromosome 10 probable ORNT1 pseudogene

  3368 LGPVPLMLSGG 3400
  3402 GVGGICLWLAVYLVDCIKSRIQVLSMS*KQAGFIRTFINVVKNEG 3536
148951 GIMALHSGLKPTMIRALPEDLALFLADEYTRKLMMSQLEAY 148829

AL354798.6 chromosome 13 clone probable ORNT1 pseudogene also AC018739.4 

64232 GVGGICLWLAVYPVDCIK*RIQVLSMSGKQAGFIRTFINVVKNEG 64098
63466 GIMALYSGLKPTMIRAFPDNGALFLAYEYSRNLMMSQLEAY 63344

AL139386.7 chromosome 13 ORNT1 pseudogene
27713 GVGGICLWLAVYLVDCIKSRTQVLSMS*KQAGFIRTFINVVKNEG 27847
28481 GIMALHSGLKPTMIRALPDDLALFLADEYTRKLMMSQLEAY 28603

AL356259.3 chromosome 13 partial seq like ORNT1

55181 MKSNPPIQAAIDLMAGAASTVLCLIT 55258
59218 LGPVPLMLSGGGSWRDLLWLAVYPVDCIKSRIQVLSMSGKQAEFIRTFINVVKNE 59382
60019 GIMALYSGLKPT 60054
60054 MIRAFPDNGALFLAYEYSRNLMMSQLEAY 60140

50,53 AL360268 chr 9 BE269761.1 BE297822.1 AA368408 AA325835 = seq 53 THC126756
intron structure shown for carrier part only

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQ
KIVQAGDKDLDGQLDFEEFVHYLQDHEKKLRLVFKSLDKKNDG
RIDAQEIMQSLRDLGVKISEQQAEKILK
SMDKNGTMTIDWNEWRDYHLLHPVENIPEIILYWKHST                         (PHASE 0 INTRON)
IFDVGENLTVPDEFTVEERQTGMWWRHLVAGGGAGAVSRTCTAPLDRLKVLMQ          (PHASE 0 INTRON)
VHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIKFMAYEQ            (PHASE 0 INTRON)
IKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPME                           (PHASE 0 INTRON)
VLKTRMALRKTGQYSGMLDCARRILAREGVAAFYKGYVPNMLGIIPYAGIDLAVYE       (PHASE 0 INTRON)
TLKNAWLQHYAVNSADPGVFVLLACGTMSSTCGQLASYPLALVRTRMQAQA            (PHASE 1 INTRON)
SIEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAVSISYVVYENLKITLGVQSR*

AL390038.5 chromosome 1 56% to seq 50/53 one short fragment
WKYLLAGGIVGTYPQTCTMPLDHLKILLQV

52 HUMAN W24006 AC016331.2 = 83K exon 8 + 9 AI814230 wj70e02.x1
intron structure shown for carrier part only
MRGSPGDAERRQRWGRLFEELDSNKDGRVDVHELRQGLARLGGGNPDPGAQHG
ISSEGDADPDGGLDLEEFSRYLQEREQRLLLMFHSLDRNQDG
HIDVSEIQQSFRALGISISLEQAEKILH
SMDRDGTMTIDWQEWRDHFLLHSLENVEDVLYFWKHST                         (PHASE 0 INTRON)
VLDIGECLTVPDEFSKQEKLTGMWWKQLVAGAVAGAVSRTGTAPLDRLKVFMQ          (PHASE 0 INTRON)
VHASKTNRLNILGGLRSMVLEGGIRSLWRGNGINVLKIAPESAIKFMAYEQ            (PHASE 0 INTRON)
IKRAILGQQETLHVQERFVAGSLAGATAQTIIYPME                           (PHASE 0 INTRON)
VLKTRLTLRRTGQYKGLLDCARRILEREGPRAFYRGYLPNVLGIIPYAGIDLAVYE       (PHASE 0 INTRON)
TLKNWWLQQYSHDSADPGILVLLACGTISSTCGQIASYPLALVRTRMQAQA            (PHASE 1 INTRON)
SIEGGPQLSMLGLLRHILSQEGMRGLYRGIAPNFMKVIPAVSISYVVYENMKQALGVTSR*

52A AC016331.2 2nd gene missing N-term extension Also on AC011539.5 chr 19
AA909324 AC016331.2 AC011539.5 chromosome 19  AI678642          
52A EST 204-300 AI678642.1 tu58b01.x1 (gene is expressed)

VLDTGEQLMVPVEVLEVDNKEVLWKFLLSGAMAGAVSRTGTAPLDRAKVYMQ           (PHASE 0 INTRON)
VYSSKTNFTNLLGGLQSMVQEGGFRSLWRGNGINVLKIAPEYAIKFSVFEQ            (PHASE 0 INTRON)
CKNYFCGIQGSPPFERLLAGSLAVAISQTLINPME                            (PHASE 0 INTRON)
VLKTRLTLRRTGQYKGLLDCARQILQREGTRALYRGYLPNMLGIIPYACTDLAVYE       (PHASE 0 INTRON)
MLQCFWVKSGRDMGDPSGLVSLSSVTLSTTCGQMASYPLTLVRTRMQAQD             (PHASE 1 INTRON)
TVEGSNPTMRGVLQRILAQQGWLGLYRGMTPTLLKVLPAGGISYVVYEAMKKTLGI*


MOUSE GENE = VLDTGEQLMVPVDVLEEENKGTLWKFLLSGAMAGAVSRTGTAPLDRARVYMQ

52B human B40956 gss frag. AL359258.4 first gene on clone   AL390038.5 chr 1 H02381
intron structure shown for carrier part only

MLLWMQGFVLEAVACQDNDDYLRYGILFEDLDCNGDGVVDIIELQEGLRNWSSAFDPNSEE 117837
SMDSDGSMTVDWDEWKYYFLLHPATNITEMIHFWKHST                         (PHASE 0 INTRON)
LIDIGEISAIPDEFTEQEKQSGDWWKRLVSAGIASAVARTCTAPLDRLKVMMQ          (PHASE 0 INTRON)
VHSLKSRKMRLISGLEQLVKEGGIFSLW*GNGVNVLKIAPETALKVGAYEQ            (PHASE 0 INTRON)
YKKLLSFDGVHLGILERFIFGSLAGVTAQTCIYPME                           (PHASE 0 INTRON)
VLKTRLAIGKTGEYSGIIDCGKKLLKQEGVRSFFKGYTPNLLGIVPYAGIDLAVYE       (PHASE 0 INTRON)
QILKNFWLENYAGNSVNPGIMILVGCSTLSNTCGQLASFSVNLIRTRMQAS            (PHASE 1 INTRON)
APVEKGKTTSMIQLIQEIYTKEGKLGFYRGFTSNIIKVLPAVGVGCVAYEKVKPLFGLTWK*

52C AL390038.5 exon 4 35503 35619 = AL392088 chromosome 1 only exon on AL392088

STDIDGSMTVDWVEWRKNFFLNLQKKDVEEVAHYWKHVT

55 B40956 gss frag. = AL359258.4 chromosome 1 second gene on clone exon 5 = 1625-1458
AF123303 = seq 55 human peroxisomal calcium dependent transporter BE885075 has N-term
AA001086 ze47c08.r1 AL356110.1 chromosome 1 AC013627.3 AL390036.6 AA001086 BE731210.1
intron structure shown for carrier part only
MLRWLRDFVLPTAACQDAEQPTRYETLFQALDRNGDGVVDIGELQEGLRNL
GIPLGQDAEEKIFTTGDVNKDGKLDFEEFMKYLKDHEKKMKLAFKSLDKNNDGKIEAS
EIVQSLQTLGLTISEQQAELILQSIDVDGTMTVDWNEWRDYFLFNPVTDIEEIIRFWKHST  (PHASE 0 INTRON)
GIDIGDSLTIPDEFTEDEKKSGQWWRQLLAGGIAGAVSRTSTAPLDRLKIMMQ          (PHASE 0 INTRON)
VHGSKSDKMNIFGGFRQMVKEGGIRSLWRGNGTNVIKIAPETAVKFWAYEQ            (PHASE 0 INTRON)
YKKLLTEEGQKIGTFERFISGSMAGATAQTFIYPME                           (PHASE 0 INTRON)
VMKTRLAVGKTGQYSGIYDCAKKILKHEGLGAFYKGYVPNLLGIIPYAGIDLAVYE       (PHASE 0 INTRON)
LLKSYWLDNFAKDSVNPGVMVLLGCGALSSTCGQLASYPLALVRTRMQAQA            (PHASE 1 INTRON)
MLEGSPQLNMVGLFRRIISKEGIPGLYRGITPNFMKVLPAVGISYVVYENMKQTLGVTQK*

57 GRAVE'S DISEASE ANTIGEN AC037456.6 chromosome 10 also AL360177.15

MAAATAAAALAAADPPPAMPGAAGAGGPTTRRDFYWLRSFLAGG                   (PHASE 1 INTRON)
IAGCCAKTTVAPLDRVKVLLQAHNHHYKHLG                                (PHASE 1 INTRON)
VFSALRAVPQKEGFLGLYKGNGAMMIRIFPYGAIQFMAFEHYKT                   (PHASE 0 INTRON)
LITTKLGISGHVHRLMAGSMAG                                         (PHASE 1 INTRON)
MTAVICTDPVDMVRVRLAFQVKGEHRYTGIIHAFKTIYAK                       (PHASE 0 INTRON)
EGGFFGFYRGLMPTILGMAPYAG                                        (PHASE 1 INTRON)
VSFFTFGTLKSVGLSHAPTLLGSPSSDNPNVLVLKTHVNLLCGGVAGAIAQTIS         (PHASE 2 INTRON)
YPFDVTRRRMQLGTVLPEFEKCL                                        (PHASE 2 INTRON)
TMRDTMKYDYGHHGIRKGLYRGLSLNYIRCIPSQAVAFTTYELMKQFFHLN*

AC074273.4 chromosome 4 59% to grave's antigen
143825 LGLYKDNETTMIQIFSYSATQFVAFEYYKMFTTMELRISGHMHILTRSTEGITAVTCTY 143649
143648 PVAMVRVPVVQVKGEHNYAGII 143583

62 THC160221 AC004143.1 AC002126.1 TIGR CONTIG THC160221

MRMLRLSCPRPSHQRQ                                          (PHASE 0 INTRON)
RDHRQVLSSLLSGALAGALAKTAVAPLDRTKIIFQV                      (PHASE 1 INTRON)
SSKRFSAK                                                  (PHASE 0 INTRON)
EAFRVLYYTYLNEGFLSLWRGNSATMVRVVPYAAIQFSAHEEYKRILGSYYGFRGE  (PHASE 2 INTRON)
ALPPWPRLFAGALAGTTAASLTYPLDLVRARMAVTPKEM                   (PHASE 2 INTRON)
YSNIFHVFIRISREEGLKTLYHGFMPTVLGVIPYAGLSFFTYETLKSLHRE       (PHASE 1 INTRON)
YSGRRQPYPFERMIFGACAGLIGQSASYPLDVVRRRMQTAGVTGYPRASIA   
RTLRTIVREEGAVRGLYKGLSMNWVKGPIAVGISFTTFDLMQILLRHLQS*   

70C NM_012140 DIC GENE = AJ131612.1

MAAEARVSRWYFGGLASCGAACCTHPLDLLK                        (PHASE 0 INTRON)
VHLQTQQEVKLRMTGMALRVVRTDGILALYSGLSASLCRQ               (PHASE 0 INTRON)
MTYSLTRFAIYETVRDRVAKGSQGPLPFHEKVLLGSVSG                (PHASE 1 INTRON)
LAGGFVGTPADLVNVR                                       (PHASE 2 INTRON)
MQNDVKLPQGQRRN                                         (PHASE 2 INTRON)
YAHALDGLYRVAREE                                        (PHASE 1 INTRON)
GLRRLFSGATMASSRGALVTVGQ                                (PHASE 0 INTRON)
LSCYDQAKQRVLSTGYLSDNIFTHFVASFIA                        (PHASE 0 INTRON)
GGCATFLCQPLDVLKTRLMNSKGEYQ                             (PHASE 0 INTRON)
GVFHCAVETAKLGPLAFYK                                    (PHASE 0 INTRON)
GLVPAGIRLIPHTVLTFVFLEQLRKNFGIKVPS*

73B  AC008579.4 chromosome 5 clone UNKNOWN SEQ 87% to AF177333.1
also AC005618.1 chromosome 5 This sequence has no introns (processed gene).
140638 MKSGPGIQAAIDLTAGAAGGTACVLTGQPFDTIKVKMQTFPDLYKGLTDCFLKTYAQVGLRGF 140826
140827 YKGTGPALMAYVAENSVLFMCYGFCQQFVRKVAGMDKQAKLSDLQTAAAGSFASAFAALA 141006
141007 LCPTELVKCRLQTMYEMEMSGKIAKSHNTIWSVVKGILKKDGPLGFYHGLSSTLLQEVPG 141186
141187 YFFFFGGYELSRSFFASGRSKDELGPVHLMLSGGVAGICLWLVVFPVDCIKSRIQVL 141357
141358 SMYGKQAGFIGTLLSVVRNEGIVALYSGLKATMIRAIPANGALFVAYEYSRKMMMKQLEAY 141543

73A   AF177333.1 AL161614.5 chromosome 13 87% to AC008579.4
also NM_014252.1 solute carrier family 25 (mitochondrial carrier;
ornithine transporter) member 15 (SLC25A15)
also AF112968.1 ornithine transporter (ORNT1) mRNA
MKSNPAIQAAIDLTAGAAG                                       (PHASE 1 INTRON)
GTACVLTGQPFDTMKVKMQTFPDLYRGLTDCCLKTYSQVGFRG
FYKGTSPALIANIAENSVLFMCYGFCQQVVRKVAGLDKQAKLS               (PHASE 2 INTRON)
DLQNAAAGSFASAFAALVLCPTELVKCRLQTMYEMETSGKIAKSQN            (PHASE 2 INTRON)
TVWSVIKSILRKDGPLGFYHGLSSTLLREVPGYFFFFGGYELSRSFFASGRSKDELG (PHASE 1 INTRON)
PVPLMLSGGVGGICLWLAVYPVDCIKSRIQVLSMSGKQAGFIRTFLNVVKNEG     (PHASE 1 INTRON)
ITALYSGLKPTMIRAFPANGALFLAYEYSRKLMMNQLEAY

74 Z28872 HUMAN CARNITINE 

MADQPKPISPLKNLLAGGFGGVCLVFVGHPLDTVK                       (PHASE 0 INTRON)
VRLQTQPPSLPGQPPMYSGTFDCFRKTLFRE                           (PHASE 0 INTRON)
GITGLYRGMAAPIIGVTPMFAVCFFGFGLGKKLQQKHPEDVLS               (PHASE 2 INTRON)
YPQLFAAGMLSGVFTTGIMTPGERIKCLLQ                            (PHASE 0 INTRON)
IQASSGESKYTGTLDCAKKLYQEFGIRGIYKGTVLTLMRD                  (PHASE 1 INTRON)
VPASGMYFMTYEWLKNIFTPEGKR                                  (PHASE 2 INTRON)
VSELSAPRILVAGGIAGIFNWAVAIPPDVLKSRFQTA                     (PHASE 1 INTRON)
PPGKYPNGFRDVLRELIRDEGVTSLYKGFNAVMIRAFPANA                 (PHASE 0 INTRON)
ACFLGFEVAMKFLNWATPNL*

AL391221.6 chromosome 6 PSEUDOGENE 84% TO 74 CARNITINE CARRIER ALSO = AL355513.4
158968 IADQPKPI
158944 SLLKNLLASGFGGMCLVFMGHPLDTVKVRLQTQPPSLPRQPPMYSGTFDSLPKTLRRD 158771
158770 ITGLYKGMAAPIIGVTPIFAVCFFGFGLRKKL*QKHPEDVLSYPQLFAAGMLSGTF 158603
158602 TTGIATPGEPIKSLLHFQPSSGETKYTGTLDCAKKLYQEFQIRGIYKGTVLTLM*DVP 158429
158428 ASGTYFMTNEWLKNIFTPEGKRVCELSVP*ILVAGCIAGIFNWAMAVPQDVLKYPFQT 158255
APPGKYPNSFGDVLRELIWDEGITSLSKGSDAVMTRAFPANAACFLGLEVAMKFLNWATPNL*

78 EST T23648 = Z58487.1 genomic frag. AC011427.2 chr 5 AI625818.1 ty65a06.x1
the seq LKSRAVAPAEQ is not in the alignment

MGSFQLEDFAAGWIGG                                         (PHASE 1 INTRON)
AASVIVGHPLDTVK                                           (PHASE 0 INTRON)
TRLQAGVGYGNTLSCIRVVYRRES                                 (PHASE 0 INTRON)
MFGFFKGMSFPLASIAVYNSVVFGVFSNTQRFLSQHRCGEPEA
SPPRTLSDLLLASMVAGVVSVGLGGPVDLIKIRLQMQTQPFRDA             (PHASE 1 INTRON)
NLGLKSRAVAPAEQPAYQGPVHCITTIVRNEGLAGLYRGASA
MLLRDVPGYCLYFIPYVFLSEWITPEACTGPSPCAVWLAGGMAG             (PHASE 1 INTRON)
AISWGTATPMDVVKSRLQADGVYLNKYKGVLDCISQSYQKEGLK             (PHASE 0 INTRON)
VFFRGITVNAVRGFPMSAAMFLGYELSLQAIRGDHAVTSP*

78A AL157871.2 chromosome 14 UNKNOWN SEQUENCE = AL135838.3 BE791151 EST
TWO GENES ON THIS BAC. 2 INTRONS ARE MISSING COMPARED TO GENE 2
gene 1 

MALDFLAGCAGG                 (PHASE 1 INTRON)
VAGVLVGHPFDTVK               (PHASE 0 INTRON)
VRLQVQSVEKPQYRGTLHCFKSIIKQES (PHASE 0 INTRON)
100992 VLGLYKGLGSPLMGLTFINALVFGVQGNTLRALGHDSP
100884 LNQFLAGAAAGAIQCVICCPMELAKTRLQLQDAGPARTYKGSLDCLAQIYGHEGLR 100717
100716 GVNRGMVSTLLRETPSFGVYFLTYDALTRALGCEPGDRLLVPKLLLAGGTSGIVSW 100549
100548 LSTYPVDVVKSRLQADGLRGAPRYRGILDCVHQSYRAEGWRVFTRGLASTLLRAFPVNAATFA 100363
TVTVVLTYARGEEAGPEGEAVPAAPAGPALAQPSSL*

78B AL157871.2 gene 2 78B Also on AL135838.3 TQAQKQQRRLSASGPLAVPP not in alignment
131377 MDFVAGAIGG 131403                                       (PHASE 1 INTRON)
VCGVAVGYPLDTVK                                                 (PHASE 0 INTRON)
VRIQTEPKYTGIWHCVRDTYHRER                                       (PHASE 0 INTRON)
VWGFYRGLSLPVCTVSLVSSVSFGTYRHCLAHICRLRYGNPDAKPTKADITLSGCASGLVR  (PHASE 0 INTRON)
VFLTSPTEVAKVRLQTQTQAQKQQRRLSASGPLAVPPMCPVPPACPEPKYRGPLHCLATV 136867
136868 AREEGLCGLYKGSSALVLRDGHSFATYFLSYAVLCEWLSPAGHSRPD 137020  (PHASE 1 INTRON)
137326 VPGVLVAGGCAGVLAWAVATPMDVIKSRLQADGQGQRRYRGLLHCMVTSVREEGPRVLF 137505
137506 KGLVLNCCRAFPVNMVVFVAYEAVLRLARGLLT* 137607

Mouse ESTs of N-term
>gb|AI662981.1|AI662981 uj64c08.y1 Sugano mouse liver mlia Mus musculus cDNA 

MDFVAGAIGGVCGVAVGYPLDTVKVRIQTEPKYTGIWHCVRDTYHRERVWGFYRGL 69
MDFVAGAIGGVCGVAVGYPLDTVKVRIQTE KY GIWHC+RDTY +ERVWGFYRGL
MDFVAGAIGGVCGVAVGYPLDTVKVRIQTEAKYAGIWHCIRDTYRQERVWGFYRGL 222

Query: 70  SLPVCTVSLVSSVSFGTYRHCLAHICRLRYGNPDAKPTKADITLSGCASGL-QVFLTSPT 128
           SLPVCTVSLVSSVSFGTY HCLAHICR RYG+ DAKPTKADITLSGCASGL  VF  SPT
Sbjct: 223 SLPVCTVSLVSSVSFGTYHHCLAHICRFRYGSTDAKPTKADITLSGCASGLVPVFPASPT 402

Query: 129 EVA 131
            VA
Sbjct: 403 WVA 411

>gb|AA119501.1|AA119501 mo30h08.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus
           cDNA clone IMAGE:555135 5' similar to WP:C54G10.4
           CE05519 MITOCHRONDRIAL CARRIER PROTEIN ;.
          Length = 452

 Score =  154 bits (385), Expect(2) = 9e-43
 Identities = 71/88 (80%), Positives = 74/88 (83%)
 Frame = +1

Query: 21  GVCGVAVGYPLDTVKVRIQTEPKYTGIWHCVRDTYHRERVWGFYRGLSLPVCTVSLVS 80
           GVCGVAVGYPLDTVKVRIQTE KY GIWHC+RDTY +ERVWGFYRGLSLPVCTVSLVS
Sbjct: 151 GVCGVAVGYPLDTVKVRIQTEAKYAGIWHCIRDTYRQERVWGFYRGLSLPVCTVSLVS 324

Query: 81  SVSFGTYRHCLAHICRLRYGNPDAKPTK 108
           SVSFGTY HCLAHICR RY      P +
Sbjct: 325 SVSFGTYHHCLAHICRFRYAARTPSPPR 408

 Score = 40.2 bits (92), Expect(2) = 9e-43
 Identities = 18/18 (100%), Positives = 18/18 (100%)
 Frame = +3

Query: 103 DAKPTKADITLSGCASGL 120
           DAKPTKADITLSGCASGL
Sbjct: 390 DAKPTKADITLSGCASGL 443

>gb|AA119500.1|AA119500 mo30h07.r1 Life Tech mouse embryo 13 5dpc 10666014 Mus musculus
           cDNA clone IMAGE:555133 5' similar to WP:C54G10.4
           CE05519 MITOCHRONDRIAL CARRIER PROTEIN ;.
          Length = 349

 Score =  132 bits (328), Expect = 2e-30
 Identities = 61/69 (88%), Positives = 63/69 (90%)
 Frame = +2

Query: 21  GVCGVAVGYPLDTVKVRIQTEPKYTGIWHCVRDTYHRERVWGFYRGLSLPVCTVSLVS 80
           GVCGVAVGYPLDTVKVRIQTE KY GIWHC+RDTY +ERVWGFYRGLSLPVCTVSLVS
Sbjct: 149 GVCGVAVGYPLDTVKVRIQTEAKYAGIWHCIRDTYRQERVWGFYRGLSLPVCTVSLVS 322

Query: 81  SVSFGTYRH 89
           SVSFGTY H
Sbjct: 323 SVSFGTYHH 349

mouse ortholog
>gb|AI048215.1|AI048215 ud67g09.y1 Sugano mouse liver mlia Mus musculus cDNA clone
           IMAGE:1451008 5' similar to gb:M26880 UBIQUITIN (HUMAN);
           gb:X51703 Mouse mRNA for ubiquitin (MOUSE);.
          Length = 381

Query: 1   PSLLQVFLTSPTEVAKVRLQTQTQAQKQQRRLSASGPLAVPPMCPVPPAC--PEPKYRGP 38
           P L++VFLTSPTEVAKVRLQTQTQAQ QQRR SAS     P +CP P AC  P PKY GP
Sbjct: 174 PGLVRVFLTSPTEVAKVRLQTQTQAQTQQRRSSASWTSGAPALCPTPTACLEPRPKYSGP 353

Query: 39  LHCLATVAR 47
           LHCL TVAR
Sbjct: 354 LHCLVTVAR 380

DHHPGGRAQ*HH*ECQGKDPGQGGHPP*PTEADLCRQAAGRWPHPVRLQHPERVHPAPGLVRVFLTSPTEVAKVRLQTQTQAQTQQRRSSASWTSGAPALCPTPTACLEPRPKYSGPLHCLVTVAR

RPSPWRSSPVTPLRMSRQRSRTRRASPLTNRG*SLQASSWKMAAPCQTTTSRKSPPCTWSCPGVPDVTH*GGQSPPADTDPSSDTAAAVLGLLDIWGSRFVSHTHCLLGAQA*VQWATALFSHSGS

KTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLSGCS*RHPLRWPKSACRHRPKLRHSSGGPRPPGHLGLPLCVPHPLLAWSPGLSTVGHCTV*SQWLE

AP000944.3 chromosome 11 57% TO AL157871.2

182018 PFDLIKVRLQNQTEPRAQPGSPPPRYQGPVHCAASIFREEGPRGLFRGAWALTLR 182182
182421 LVAGGFAGIASWVAATPLDMIKSRMQMDGLRRRVYQGMLDCMVSSIRQEGLGVFFR 182585
182586 GVTINSARAFPVNAVTFLSYE 182636

AP002358.1 chromosome 11 = AP002780.1 68% to AL157871.2 gene 2
34970 ILSLPSPPLQVFLASPTEVAKVCLQTQMQQLRPLASGPLSVPLVCPVSPVCSVSKYHGLL 35149
35150 HCLAMVAHEEGLHSLYKSSSALLFWDSHSFATYFFSYAALCGWLGPTGHSQP 35305

89 NM_003705 mito carrier, Aralar), member 12 (SLC25A12) = seq 89 homolog of U00052
AC068039.3 chromosome 2 = SEQ 89 SLC25A12 GENE STRUCTURE SHOWN FOR CARRIER PART ONLY

MAVKVQTTKRGDPHELRNIFLQYASTEVDGERYMTPEDFVQRYLGLYNDPNSNPKIVQ
LLAGVADQTKDGLISYQEFLAFESVLCAPDSMFIVAFQLFDKSGNGEVTFENVKEIFG
QTIIHHHIPFNWDCEFIRLHFGHNRKKHLNYTEFTQFLQELQLEHARQAFALKDKSKS
GMISGLDFSDIMVTIRSHMLTPFVEENLVSAAGGSISHQVSFSYFNAFNSLLNNMELV
RKIYSTLAGTRKDVEVTKEEFAQSAIRYGQVTPLEIDILYQLADLYNASGRLT
LADIERIAPLAEGALPYNLAELQRQ                                      (PHASE 0 INTRON)
QSPGLGRPIWLQIAESAYRFTLGSVAGA                                   (PHASE 1 INTRON)
VGATAVYPIDLVKTRMQNQRGSGSVVGELMYKNSFDCFKKVLRYEGFFGLYRG          (PHASE 1 INTRON)
LIPQLIGVAPEKAIKLT                                              (PHASE 0 INTRON)
VNDFVRDKFTRRDGSVPLPAEVLAGGC                                    (PHASE 0 INTRON)
AGGSQVIFTNPLEIVKIRLQVAGEITTGPRVSALNVLRDLGIFGLYK                (PHASE 0 INTRON)
GAKACFLRDIPFSAIYFPVYAHCKLLLADENGHVGGLNLLAAGAMAG                (PHASE 1 INTRON)
VPAASLVTPADVIKTRLQVAARAGQTTYSGVIDCFRKILREEGPSAFWKGTAA          (PHASE 1 INTRON)
RVFRSSPQFGVTLVTYELLQRWFYIDFGGL                                 (PHASE 2 INTRON)
KPAGSEPTPKSRIADLPPANPDHIGGYRLATATFAGIENKFGLYLPKFKSPSVAVVQPKAAVAATQ*

90B ortholog Xenopus EST 
MPSPTHTSFWTEASQQVLAGGSAGLVEICLMHPLDVVKTRFQIQRSKSDPTSYKSLGDCFKKIYRSEGLLGFYKGILPPILAETPKRAVKFFTFEQYKKLLVPLSLAPAWVFAIA

90B ortholog porcine EST Z81153.1 N-term
MSAKPNAGFVNEASRQILAGGSAGLVEICLMHPLDVVKTRFQIQRCATXPNSYKSL

90B ortholog Bovine EST AW356735.1  N-terminal 
MSAKPSVGFVNEASRQILAGGSAGLVEICLMHPLDVVKTRFQIQRCTTD

90B ortholog Rat EST AW919948 N-terminal
MSASNVSLLHETCRQVAAGGCAGLVEICLMHPLDVVKTRFQVQRSVTDPQSYKSLRDSFQVIFRTEGLFGFYKGIIPPI

90B ortholog assembled mouse seq from AC079959.6 (298 amino acids)
MSASNVSLLHETSRQVAAGGSAGLVEICLMHPLDVVKTR (intron)
FQVQRSVTDPQSYRTVRGSFQMIFRTEG (intron)
LFGFYKGIIPPILAETPKRAVK (intron)
FSTFELYKKFLGYMSLSPGL (intron)
TFLIAGLGSGLTEAVVVNPFEVVKVGLQVNRNLFKE (intron)
QPSTFAYARQIIKKEGLGFQGLNKGLTATLGRHGIFNMVYFGFYHNVKNIIPSSK (intron)
DPTLEFLRKFGIGFVSGTMGSVFNIPFDVAKSRIQGPQPVPGEIKYRSCFKTMEMIYREEG (intron)
ILALYKGLVPKVMRLGP (intron)
GGGVMLLVYEYTYAWLQENW*

90B Assembled human seq from several sources complete seq AL079303, AL079304
Sbjct: 91966 MSAKPEVSLVREASRQIVAGGSAG 91898 AL079304.3        (PHASE 1 INTRON)
LVEICLMHPLDVVKTRFQIQRCATDPNSYKSLVDSFRMIFQMEG                  (PHASE 2 INTRON)
LFGFYKGILPPILAETPKRAVK                                        (PHASE 0 INTRON)
FFTFEQYKKLLGYVSLSPAL                                          (PHASE 0 INTRON)
TFAIAGLGSGLTEAIVVNPFEVVKVGLQANRNTFAE                          (PHASE 0 INTRON)
QPSTVGYARQIIKKEGWGLQGLNKGLTATLGRHGVFNMVYFGFYYNVKNMIPVNK       (PHASE 0 INTRON)
DPILEFWRKFGIGLLSGTIASVINIPFDVAKSRIQGPQPVPGEIKYRTCFKTMATVYQEEG (PHASE 2 INTRON)
ILALYKGLLPKIMRLGPG                                            (PHASE 1 INTRON)
GAVMLLVYEYTYSWLQENW*

91 AK000766.1 AC002540 AND AC002540.1 very similar to citrin AF118838 also on 
AX004413.1 patent seq. GENE STRUCTURE SHOWN FOR CARRIER PART ONLY

MAAAKVALTKRADPAELRTIFLKYASIEKNGEFFMSPNDFVTRYLNIFGESQPNPKTVEL
LSGVVDQTKDGLISFQEFVAFESVLCAPDALFMVAFQLFDKAGKGEVTFEDVKQVFGQTT
IHQHIPFNWDSEFVQLHFGKERKRHLTYAEFTQFLLEIQLEHAKQAFVQRDNARTGRVTA
IDFRDIMVTIRPHVLTPFVEECLVAAAGGTTSHQVSFSYFNGFNSLLNNMELIRKIYSTL
AGTRKDVEVTKEEFVLAAQKFGQVTPMEVDILFQLADLYEPRGRMT
LADIERIAPLEEGTLPFNLAEAQRQ                                       (PHASE 0 INTRON)
KASGDSARPVLLQVAESAYRFGLGSVAGA                                   (PHASE 1 INTRON)
VGATAVYPIDLVKTRMQNQRSTGSFVGELMYKNSFDCFKKVLRYEGFFGLYRG           (PHASE 1 INTRON)
LLPQLLGVAPEKAIKLT                                               (PHASE 0 INTRON)
VNDFVRDKFMHKDGSVPLAAEILAGGC                                     (PHASE 0 INTRON)
AGGSQVIFTNPLEIVKIRLQVAGEITTGPRVSALSVVRDLGFFGIYK                 (PHASE 0 INTRON)
GAKACFLRDIPFSAIYFPCYAHVKASFANEDGQVSPGSLLLAGAIAG                 (PHASE 1 INTRON)
MPAASLVTPADVIKTRLQVAARAGQTTYSGVIDCFRKILREEGPKALWKGAGA           (PHASE 1 INTRON)
RVFRSSPQFGVTLLTYELLQRWFYIDFGGV                                  (PHASE 2 INTRON)
KPMGSEPVPKSRINLPAPNPDHVGGYKLAVATFAGIENKFGLYLPLFKPSVSTSKAIGGGP*

94 AK023106.1 H46077/T66746/H22920 THERE IS NO GENOMIC SEQ FOR THIS CARRIER
POSSIBLE INTRON LOCATIONS ARE BASED ON SEQ 95 and several Tetraodon nigroviridis 
genomic clones in the GSS section of the database AL267404.1 (exon 2)
AL188177.1 (exons 3, 4), AL242394.1 (exons 4, 5), AL272975.1 (exon 10)
Human STS G43092 codes for N-terminal 28 amino acids with one frameshift.

MADKQIS This intron not seen in STS G43092.1
LPAKLINGGIAGLIGVTCVFPIDLAKTRLQNQQNGQRVYTSM
SDCLIKTVRSEGYFGMYRG
AAVNLTLVTPEKAIKLAANDFFRHQLSKDG
QKLTLLKEMLAGCGAGTCQ    (this intron not seen in human seq 95, but seen in fish) 
VIVTTPMEMLKIQLQDAGRIA
AQRKILAAQGQLSAQGGAQPSVEAPAAPRPTATQLTRDLLRSRGIAGLYKGLGATLLR
DVPFSVVYFPLFANLNQLGRPASEEKSPFYVSFLAGCVAGSAAAVAVNPCDV
VKTRLQSLQRGVNEDTYSGILDCAR
KILRHEGPSAFLKGAYCRALVIAPLFGIAQVVYFLGIAESLLGLLQDPQA*

95 AC027760.1 AA663839 ae70a03.s1 AV661919 R14486 yf83d07.r1 THC95322
 
MTHQDLS                                                  (PHASE 2 INTRON)
ITAKLINGGVAGLVGVTCVFPIDLAKIRLQNQHGKAMYKGM                (PHASE 2 INTRON)
IDCLMKTVRAEGFFGMYRG                                      (PHASE 1 INTRON)
AAVNLTLVTPEKAIKLAANDFFRRLLMEDG                           (PHASE 2 INTRON)
MQRNLKMEMLAGCGAGMCQVVVTCPMEMLKIQLQDAGRLA                 (PHASE 1 INTRON)
VHHQGSASAPSTSRSYTTGSASTHRRPSATLIAWELLRTQGLAGLYRGLGATLLR  (PHASE 2 INTRON)
DIPFSIIYFPLFANLNNLGFNELAGKASFAHSFVSGCVAGSIAAVAVTPLDV     (PHASE 1 INTRON)
LKTRIQTLKKGLGEDMYSGITDCAR                                (PHASE 2 INTRON)
KLWIQEGPSAFMKGAGCRALVIAPLFGIAQGVYFIGIGERILKCFD*

AC067895.2 chromosome 6 78% to seq 95 pseudogene fragment
148990 GATLLRDIPFSSIYFPPFANFNHPGFNELTGKASSTHSFRSGCAAGS 148850

>96C AB007915.2 mRNA for KIAA0446 protein, partial CDS <3481..4587 39% to 96B
= AC007227.3 chromosome 1 clone (only 2 introns)
= AL135927.14 clone RP11-54H19 on chromosome 1

MEDKRNIQIIEWEHLDKKKFYVFGVAMTMMIRVSVYPFTLIRTRLQVQKGKSLYHGTFDA
FIKILRADGITGLYRGFLVNTFTLISGQCYVTTYELTRKFVADYSQSNTVKSLVAGGSAS
LVAQSITVPIDVVSQHLMMQRKGEKMGRFQVRGNPEGQGVVAFGQTKDIIRQILQADGLR
GFYRGYVASLLTYIPNSAVWWPFYHFYAE                                   (PHASE 1 INTRON)
QLSYLCPKECPHIVFQAVSGPLAAATASILTNPMDVIRTRVQ                      (PHASE 0 INTRON)
VEGKNSIILTFRQLMAEEGPWGLMKGLSARIISATPSTIVIVVGYESLKKLSLRPELVDSRHW*

101 AF182404.1, AC011933.4 = seq 101  est T31632

MVGYDPKPDGRNNTKFQVAVAGSVSGLVTRALISPFDVIKIRFQ                    (PHASE 0 INTRON)
LQHERLSRSDPSAKYHGILQASRQILQEEGPTAFWKGHVPAQILSIGYGAVQ            (PHASE 0 INTRON)
FLSFEMLTELVHRGSVYDAREFSVHFVCGGLAACMATLTVAPVDVLRTRFAAQGEPK       (PHASE 0 INTRON)
VYNTLRHAVGTMYRSEGPQVFYKGLAPTLIAIFPYAGLQFSRYSSLKHLYKWAIPAEGKKNE  (PHASE 1 INTRON)
NLQNLLCGSGAGVISKTLTYPLDLFKKRLQVGGFEHARAAFGQ                     (PHASE 0 INTRON)
VRRYKGLMDCAKQVLQKEGALGFFKGLSPSLLKAALSTGFMFFSYEFFCNVFHCMNRTASQR*

103 AC008053.5  THC144154

MDFLMSGLAACGACVFTNPLEVVKTRMQLQGELQAPGTYQRHYRNVFHAFITIGKVDGLA
ALQKGLAPALLYQFLMNGIRLGTYGLAEAGGYLHTAEGTHSPARSAAAGAMAGVMGAYLG
SPIYM 68005                                                       (PHASE 0 INTRON)
VKTHLQAQAASEIAVGHQYKHQ                                            (PHASE 0 INTRON)
GMFQALTEIGQKHGLVGLWRGALGGLPRVIVGSSTQLCTFSSTKDLLSQWE               (PHASE 0 INTRON)
IFPPQSWKLALVAAMMSGIAVVLAMAPFDVACTRLYNQPTDAQGK                     (PHASE 0 INTRON)
64254 GLMYRGILDALLQTARTEGIFGMYKGIGASYFRLGPHTILSLFFWDQLRSLYYTDTK*

AC026546.3 chromosome 1  52% to seq 103
94187 METVPPAVDLVLGASACCLACVFTNPLEVVKTRLQLQGELQARGTYPRP 94041
94040 YHGFIASVAAVARADGLWGLQKGLAAGLLYQGLMNGVRFYCYSLACQAGLTQQPGGTV 93867
93866 VAGAVAGALGAFVGSPAYL 93819                                   (PHASE 0 INTRON)
      IKTQLQAQTVAAVAVGHQHNHQ                                      (PHASE 0 INTRON)
      TVLGALETIWRQQGLLGLWQGVGGAVPRVMVGSAAQLATFASAKAWVQKQQ 92428   (PHASE 0 INTRON)
      WLPEDSWLVALAGGMISSIAVVVVMTPFDVVSTRLYNQPVDTAGR               (PHASE 0 INTRON)
91449 GQLYGGLTDCMVKIWRQEGPLALYKGLGPAYLRLGPHTILSMLFWDELRKLAGRAQHKGT* 91267

AC079572.1 Mus musculus mouse genomic seq for ortholog CANNOT EXTEND UPSTREAM

      MKPTQAQMAPAMDSREMVSPA
52038 VDLVLGASACCLACVFTNPLEVVKTRLQLQGELQAPGTYPRPYRGFVSSVAAVARADGLW 51859
51858 GLQKGLAAGLLYQGLMNGVRFYCYSLACQAGLTQQPGGTVVAGAAAGALGAFVGSPAYL 51682
      EST CONTINUES SEQ.    VKTQLQAQTVATMAVGHQHQHX
50760 GVLSALETIWRQQGMLGLWRGVGGAVPRVTVGSAAQLATFTSAKAWVQDRQ 50608      
      WFLEDSWLVTLAGGMISSIAVVAVMTPLDVVSTRLYNQPVDRAGR 49783
 5291 GQLYGGLADCLVKTCQQEGPLALYKGLGPAYLRLGPHTILSMFFWDELRKLVARAQHQGT* 5476

Query:   162 QTVLGALETIWRQQGLLGLWQGVGGAVPRVMVGSAAQLATFASAKAWVQKQQ 213
             Q VL ALETIWRQQG+LGLW+GVGGAVPRV VGSAAQLATF SAKAWVQ +Q
Sbjct: 50763 QGVLSALETIWRQQGMLGLWRGVGGAVPRVTVGSAAQLATFTSAKAWVQDRQ 50608

mouse homolg ESTs
gb|AI430537.1|AI430537 mc47h05.y1 Soares mouse p3NMF19.5 Mus musculus cDNA clone

Query: 1   ATGPEAMETVPPAVDLVLGASACCLACVFTNPLEVVKTRLQLQGELQARGTYPRPYHGFI 60
           A   ++ E V PAVDLVLGASACCLACVFTNPLEVVKTRLQLQGELQA GTYPRPY GF+
Sbjct: 64  APAMDSREMVSPAVDLVLGASACCLACVFTNPLEVVKTRLQLQGELQAPGTYPRPYRGFV 243

Query: 61  ASVAAVARADGLWGLQKGLAAGLLYQGLMNGVRFYCYSLACQAGLTQQPGGTVVAGAVAG 120
           +SVAAVARADGLWGLQKGLAAGLLYQGLMNGVRFYCYSLACQAGLTQQPGGTVVAGA AG
Sbjct: 244 SSVAAVARADGLWGLQKGLAAGLLYQGLMNGVRFYCYSLACQAGLTQQPGGTVVAGAAAG 423

RLPSIACAMKPTQAQMAPAMDSREMVSPAVDLVLGASACCLACVFTNPLEVVKTRLQLQGELQAPGTYPRPYRGFVSSVAAVARAD
GLWGLQKGLAAGLLYQGLMNGVRFYCYSLACQAGLTQQPGGTVVAGAAAGALGAFVGSPAYLVKTQLQAQTVATMAVGHQHQH

gb|W41322.1|W41322 mc47h05.r1 Soares mouse p3NMF19.5 Mus musculus cDNA clone

Query: 5   EAMETVPPAVDLVLGASACCLACVFTNPLEVVKTRLQLQGELQARGTYPRPYHGFIASVA 64
           ++ E V PAVDLVLGASACCLACVFTNPLEVVKTRLQLQGELQA GTYPRPY GF++SVA
Sbjct: 75  DSREMVSPAVDLVLGASACCLACVFTNPLEVVKTRLQLQGELQAPGTYPRPYRGFVSSVA 254

Query: 65  AVARADGLWGLQKGLAAGLLYQGLMNGVR 93
           AVARADGLWGLQKGLAAGLLYQGLMNGVR
Sbjct: 255 AVARADGLWGLQKGLAAGLLYQGLMNGVR 341

105 HUMAN THC188274  AF125531.1 AC004958.1 105 does not seem to have alternative 
splicing

MDPETRGQEIIKVTPLQQMLASCTGAILTSVIV                                  (PHASE 1 INTRON)
TPLDVVKIRLQAQNNPLPKG                                               (PHASE 1 INTRON)
22071 KCFVYSNGLMDHLCVCEEGGNKLWYKKPGNFQGTL 21967                    (PHASE 0 INTRON)
DAFFKIIRNEGIKSLWSGLPPTL                                            (PHASE 2 INTRON)
VMAVPATVIYFTCYDQLSALLRSKLGENETCIPIVAGIVARF                         (PHASE 1 INTRON)
GAVTVISPLELIRTKMQSKKFSYVELHRFVSKKVSEDGWISLWRGWAPTVLRDVPFSA         (PHASE 1 INTRON)
MYWYNYEILKKWLCEKSGLYEPTFMINFTSGALSGS                               (PHASE 0 INTRON)
FAAVATLPFDVVKTQKQTQLWTYESHKI                                       (PHASE 1 INTRON)
SMPLHMSTWIIMKNIVAKNGFSGLFSG                                        (PHASE 1 INTRON)
LIPRLIKIAPACAIMISTYEFGKAFFQKQNVRRQQY*

106 AC019152.5 chr 17 THC130388 TIGR AC025326.3 chr 2, AC019151.2

MADQDPAGISPLQQMVASGTGAVVTSLFM                                      (PHASE 1 INTRON)
TPLDVVKVRLQSQRPSMASE                                               (PHASE 1 INTRON)
LMPSSRLWSLSYTKL                                                    (PHASE 1 INTRON)
PSSLQSTGKCLLYCNGVLEPYLCPNGARCATWFQDPTRFTGTM                        (PHASE 0 INTRON)
DAFVKIVRHEGTRTLWSGLPATL                                            (PHASE 2 INTRON)
VMTVPATAIYFTAYDQLKAFLCGRALTSDLYAPMVAGALARL                         (PHASE 1 INTRON)
GTVTVISPLELMRTKLQAQHVSYRELGACVRTAVAQGGWRSLWLGWAPTALRDVPFSA         (PHASE 1 INTRON)
LYWFNYELVKSWLNGFRPKTKTSVGMSFVAGGISGT                               (PHASE 0 INTRON)
VAAVLTLPFDVVKTQRQVALGAMEAVRV                                       (PHASE 1 INTRON)
NPLHVDSTLLLRRIRAESGTKGLFAG                                         (PHASE 1 INTRON)
FLPRIIKAAPSCAIMISTYEFGKSFFQRLNQDRLLGG*

Gene 106 has alternative splicing as shown below by three different mRNA sequences.
Exon 4 has two alternative splice sites.  If the first site is used the product
is called exon 4a and it is 8 amino acids longer than exon 4b.SEQ. 106 includes 
exon 3 and uses the first splice site in exon 4.  This is found in 8 ESTs
Seq. 106A skips exon 3 and uses the second site in exon 4.  This is seen in 1 EST
Seq. 106B includes exon 3 but uses the second site in exon 4.  This is the most 
abundant EST seen 25 times.  No product is seen that skips exon three and uses the 
first splice site in exon 4.

   EXON 2   *     EXON 3    *EXON 4a *    EXON4b
106  RPSMASE1LMPSSRLWSLSYTKL1PSSLQSTG1KCLLYCNGVLEPYLCPNGARCATWFQDP  8 ESTs
106A RPSMASG1-------------------------KCLLYCNGVLEPYLCPNGARCATWFQDP  1 EST
106B RPSMASE1LMPSSRLWSLSYTKW1---------KCLLYCNGVLEPYLCPNGARCATWFQDP 25 ESTs

106A AC019152.5 chr 17 THC130388 TIGR AC025326.3 chr 2, AC019151.2

MADQDPAGISPLQQMVASGTGAVVTSLFM                                      (PHASE 1 INTRON)
TPLDVVKVRLQSQRPSMASG                                               (PHASE 1 INTRON)
KCLLYCNGVLEPYLCPNGARCATWFQDPTRFTGTM                                (PHASE 0 INTRON)
DAFVKIVRHEGTRTLWSGLPATL                                            (PHASE 2 INTRON)
VMTVPATAIYFTAYDQLKAFLCGRALTSDLYAPMVAGALARL                         (PHASE 1 INTRON)
GTVTVISPLELMRTKLQAQHVSYRELGACVRTAVAQGGWRSLWLGWAPTALRDVPFSA         (PHASE 1 INTRON)
LYWFNYELVKSWLNGFRPKTKTSVGMSFVAGGISGT                               (PHASE 0 INTRON)
VAAVLTLPFDVVKTQRQVALGAMEAVRV                                       (PHASE 1 INTRON)
NPLHVDSTLLLRRIRAESGTKGLFAG                                         (PHASE 1 INTRON)
FLPRIIKAAPSCAIMISTYEFGKSFFQRLNQDRLLGG*

106B AC019152.5 chr 17 THC130388 TIGR AC025326.3 chr 2, AC019151.2

MADQDPAGISPLQQMVASGTGAVVTSLFM                                      (PHASE 1 INTRON)
TPLDVVKVRLQSQRPSMASE                                               (PHASE 1 INTRON)
LMPSSRLWSLSYTKW                                                    (PHASE 1 INTRON)
KCLLYCNGVLEPYLCPNGARCATWFQDPTRFTGTM                                (PHASE 0 INTRON)
DAFVKIVRHEGTRTLWSGLPATL                                            (PHASE 2 INTRON)
VMTVPATAIYFTAYDQLKAFLCGRALTSDLYAPMVAGALARL                         (PHASE 1 INTRON)
GTVTVISPLELMRTKLQAQHVSYRELGACVRTAVAQGGWRSLWLGWAPTALRDVPFSA         (PHASE 1 INTRON)
LYWFNYELVKSWLNGFRPKTKTSVGMSFVAGGISGT                               (PHASE 0 INTRON)
VAAVLTLPFDVVKTQRQVALGAMEAVRV                                       (PHASE 1 INTRON)
NPLHVDSTLLLRRIRAESGTKGLFAG                                         (PHASE 1 INTRON)
FLPRIIKAAPSCAIMISTYEFGKSFFQRLNQDRLLGG*


AL390963.4 chromosome 1 59% to 106 pseudogene
59443 LAPSNKWWSHVSRAVVTSFLVTPLDMVEGPFAASVPAGGQ*ADTFLQTSEPLLCQAA 59273
59272 LLSPILYYNGVLEPLYL*PNGPRCATWF*DPTSFTGTLDASVKILRHKGTRTLWSGLPAT 59093
59092 LVMTVPATTNYFTAYNQLKACLCG*ALTST*PW*LVCWSTLALSLQSSVPWSLCRQ 58925
58924 SCRFSICYTMSWYLCRAPVAYSGWSSLRLCRGPTLLRNVPFSALYWFNYELVKSWLNGLR 58745
58744 PKELHGPMTTIGVSFAAGSITMMLATVLTLTFVVMKTQHQATLGVMEAVEVTSLDADSTW 58565
58564 LLRGGSVPSRASEG 58523

111 R70272/R70235, NM_018155.1, AK001480.1, AL049246.1, AC009896.3 AC068429.2

MSQRDTLVHLFAGG                                                    (PHASE 2 INTRON)
CGGTVGAILTCPLEVVKTRLQSSSVTLYISEVQLNTMAGASVNRVVSPGPLHCLK           (PHASE 2 INTRON)
VILEKEGPRSLFRGLGPNLVGVAPSR                                        (PHASE 2 INTRON)
AIYFAAYSNCKEKLNDVFDPDSTQVHMISAAMAG                                (PHASE 1 INTRON)
FTAITATNPIWLIKTRLQLDAR                                            (PHASE 2 INTRON)
NRGERRMGAFECVRKVYQTDGLKGFYRGMSASYAGISETVIHFVIYES
IKQKLLEYKTASTMENDEESVKEASDFVGMMLAAATSKTCATTIAYPHE                 (PHASE 1 INTRON)
VVRTRLREEGTKYRSFFQTLSLLVQEEGYGSLYRGLTTHLVRQIPNTAIMMATYELVVYLLNG*

AC011457.1 chromosome 19 possible pseudogene to 111
110558 GTVAAILMCPLEVRMTWLLSSSVKLFISEV*LNTMVFFISEV*LNAMAEAHYQLNDIS*I 110737
110738 S*MSCIYLNLFIS*R*PWKK-KGFFPCLED*AS*LRVGPSRATYFADYSNCMKKLNAIMD 110914
110915 CDSTW-YIISATVAGFTVLIATNPS*LKETQI*LDYR 111022

113 AC073194.4   T66303/T72707 also AC026078.3 chromosome 11   
This gene has no introns. It probably derived as a processed gene from seq. 111
RLALRTVYYPQVHLG is not in the alignment.  Several positions in this seq may be 
Polymorphic  (FQEEG, FREEG; LIRRIP, LIRQIP) 

MATGGQQKENTLLHLFAGGCGGTVGAIFTCPLEVIKTRLQSS
RLALRTVYYPQVHLGTISGAGMVRPTSVTPGLFQVLKSILEKEGPK   
SLFRGLGPNLVGVAPSRAVYFACYSKAKEQFNGIFVPNSNIVHIFSAGSAAFITNSLMNPIWMV   
KTRMQLEQKVRGSKQMNTLQCARYVYQTEGIRGFYRGLTASYAGISETIICFAIYESLKKYL
KEAPLASSANGTEKNSTSFFGLMAAAALSKGCASCIAYPHEVIRTRLREEGTKYKSFV
QTARLVFQEEGYLAFYRGLFAQLIRRIPNTAIVLSTYELIVYLLEDRTQ*

123 AC012213.3 H80950 FLX1 form2 AL046268.1 FIRST 50 aa missing from gene
not sure if there are introns in this region

MTGQGHSASGSSAWSTVFRHVRYENLVAGVSGGVLSNLALHPLDLVKIRFAV         (PHASE 1 INTRON)
SDGLELRPKYNGILHCLTTIWKLDGLRGLYQGVTPNIWGAGLSWGLYFFF           (PHASE 2 INTRON)
YNAIKSYKTEGRAEYLEATEYLVSAAEAG                                (PHASE 1 INTRON)
AMTLCITNPLWVTKTRLMLQYDAVVNSPHRQYKGMFDTLVKIYKYEGVRGLYK        (PHASE 0 INTRON)
GFVPGLFGTSHGALQFMAYELLKLKYNQHINRLPEAQL                       (PHASE 0 INTRON)
STVEYISVAALSKIFAVAATYPYQVVRARLQDQHMFYSGVIDVITKTWR            (PHASE 2 INTRON)
KEGVGGFYKGIAPNLIRVTPACCITFVVYENVSHFLLDLREKRK*

125 HUMAN FLX1 FORM1 Z98048.1 N-term part, AL049764.4 C-term part      

MASVLSYESLVHAVAGAV                                           (PHASE 0 INTRON)
GSVTAMTVFFPLDTARLRLQV                                        (PHASE 1 INTRON)
DEKRKSKTTHMVLLEIIKEEGL                                       (PHASE 2 INTRON)
LAPYRGWFPVISSLCCSNFVYFYTFNSLKALWVKGQHSTTGKDLVVGFVAG          (PHASE 1 INTRON)
VVNVLLTTPLWVVNTRLKLQGAKFRNEDIVPTNYKGIID                      (PHASE 1 INTRON)
AFHQIIRDEGISALWNGTFPSLLLVFNPAIQFMFYEGLKRQLLKKRMK             (PHASE 0 INTRON)
LSSLDVFIIGAVAKAIATTVTYPLQTVQSILR                             (PHASE 0 INTRON)
FGRHRLNPENRTLGSLRNILYLLHQRVR                                 (PHASE 2 INTRON)
RFGIMGLYKGLEAKLLQTVLTAALMFLVYEKLTAATFTVMGLKRAHQH*   

132 AC051642.2 AC018410.3, AC022470.6 chromosome 6, AC024045.4 chromosome 11 67% to 125 
28269 RCFGIMGLYTGLEAKLL*TVLTAASIYEKLMVAIFTVMRLKSTSKH 28406
132, 136 AF223466.1 HT015 132(N71105 H60352 and 136(N91093 N90124) end wrong (f.s?)

MELRSGSVGSQAVARRMDGDSRDGGGGKDATGSE
DYENLPTSASVSTHMTAGAMAGILEHSVMYPVDSVK                         (PHASE 0 INTRON) 
TRMQSLSPDPKAQYTSIYGALKKIMRTEGFWRPLRGV
NVMIMGAGPAHAMYFACYENMKRTLNDVFHHQGNSHLANG                     (PHASE 1 INTRON)
IAGSMATLLHDAVMNPAEV                                          (PHASE 1 INTRON)
VKQRLQMYNSQHRSAISCIRTVWRTEGLGAFYRSYTTQLTMNIPFQSIHFIT
YEFLQEQVNPHRTYNPQSHIISGGLAGALAAAATTPLDVCKTLLNTQENVALSLANIS
GRLSGMANAFRTVYQLNGLAGYFKGIQARVIYQMPSTAISWSVYEFFKYFLTKRQLENRAPY*

AF216674.2 chromosome 8 132, 136 3 DIFFS
64977 QHRSAISCIRTVWRTEGLGAFYRSYTTQLTMNIPFQSIHFITYEFLQEQVNPHRTY 64810
64809 NPHSHIISGGLAGALAAAATTPLDVCKTLLNTQENVALSLANISGRL*GMANAFRTVY 64636
64635 QLNGLAGYFKGIQARVIYQMPSTAISWSVYEFFKYFL 64525

AC051642.2 chromosome 8 132, 136 1 DIFF
104718 QHRSAISCIRTVWRTEGLGAFYRSYTTQLTMNIPFQSIHFITYEFLQEQVNPHRTY 104885
104886 NPQSHIISGGLAGALAAAATTPLDVCKTLLNTQENVALSLANISGRLSGMANAFRTVY 105059
105060 QLNGLAGYFKGIQARVIYQMPSTAISWSVYEFFKYFL 105170

134 human R01184 AL353719.6 chromosome 10 N-TERM is missing AC007643.3          

GEAGACRPPVRQDPDSGPDYEALPAGATVTTHMVAGAVAGILEHCVMYPIDCVK           (PHASE 0 INTRON)
QTRMQSLQPDPAARYRNVLEALWRIIRTEGLWRPMRGL
NVTATGAGPAHALYFACYEKLKKTLSDVIHPGGNSHIANG                         (PHASE 1 INTRON)
AAGCVATLLHDAAMNPAEV                                              (PHASE 1 INTRON)
VKQRMQMYNSPYHRVTDCVRAVWQNEGAGAFYRSYTTQLTMNVPFQAIHFMTYEFLQEHF
NPQRRYNPSSHVLSGACAGAVAAAATTPLDVCKTLLNTQESLALNSHITGHITGMASAFR
TVYQVGGVTAYFRGVQARVIYQIPSTAIAWSVYEFFKYLITKRQEEWRAGK*

137 PET8 HOMOL AC022894.2, AC022293.14 chromosome 3, AC069395.6 EST H08058

MDRPGFVALLVAGGVAGVSVDLILFPLDTIK
TRLQSPQGFSKAGGFHGIYAGVPSAAIGSFPNA                                (PHASE 1 INTRON)
AAFFITYEYVKWFLHADSSSYLTPMKHMLAASAGEV                             (PHASE 0 INTRON)
VACLIRVPSEVVKQRAQVSASTRTFQIFSNILYEE                              (PHASE 0 INTRON)
GIQGLYRGYKSTVLRE                                                 (PHASE 0 INTRON)
IPFSLVQFPLWESLK                                                  (PHASE 0 INTRON)
ALWSWRQDHVVDSWQSAVCGAFAG                                         (PHASE 1 INTRON)
GFAAAVTTPLDVAKTRIMLAK                                            (PHASE 0 INTRON)
AGSSTADGNVLSVLHGVWRSQGLAG                                        (PHASE 2 INTRON)
LFAGVFPRMAAISLGGFIFLGAYDRTHSLLLEVGRKSP*

151 citrate car. hum U25147

MPAPRAPRALAAAAPASGKAKLTHPEKAILAGGLAGGIEICITFPTEYVKTQLQLDERSHPPRYRGIG (PHASE 1 INTRON)
DCVRQTVRSHGVLGLYRGLSSLLYGSIPKAAVR                                    (PHASE 2 INTRON)
FGMFEFLSNHMRDAQGRLDSTRGLLCGLGAGVAEAVVVVCPMETIK                       (PHASE 0 INTRON)
VKFIHDQTSPNPKYRGFFHGVREIVREQG                                        (PHASE 1 INTRON)
LKGTYQGLTATVLKQGSNQAIRFFVMTSLRNWYRG                                  (PHASE 1 INTRON)
DNPNKPMNPLITGVFGAIAGAASVFGNTPLDVIKTRMQ                               (PHASE 0 INTRON)
GLEAHKYRNTWDCGLQILKKEGLKA                                            (PHASE 2 INTRON)
FYKGTVPRLGRVCLDVAIVFVIYDEVVKLLNKVWKTD*

AC022860.3 chromosome 11 59% to citrate (pseudogene)

102887 MPGVTPPAHHGSAAFWKAEGTLPGKAITAGGLDGSIHTCIPFPTEHVRTNERSP 102726
102725 PPGTRHGGLWGTVTAT 102678
102684 SHGVLGLNRGLSFLLCGSIPEA 102619
102628 RVRFGKLEFLCTQMWNAQGLLNSRRGLL*GLDTGMAEAMVVVRPMETIKVKFIHYQT 102449
102448 SPNPR 102434
102446 KNR*FFGVREIMREQGLRGLTRGLKAAGLKQGWNQAIVSLS*PPCSNWYQTDK 102279
102278 PMNPLITGVF*AIAGTASVFGNTPLDGIPRGSRPSQGHC 102162
102161 PMPVTDGVCLDVAITFIIYNEVVKLLNKVWKKD 102063

AC055828.2  58% to citrate (pseudogene)
153138 PGKATLAGGQARDIEIGVRCPRSAGGRTCSPTSARTRGTRRMDSASTLSRPRHPGPAPRP 153317
153318 RLPALPLYPQGGFGF*MFLFLSSQTRDAQGRPYNNAGCCAAGAPA 153452
153453 WPGPWWSYFPWTPSR*SSSTARPPQTSSAEESPTGVREIVREQGDPAGPHGPRA 153614
153615 EAGLEPGHPLLLLTPLHSWYRGDNPSKPMNPLVAGAFGAIAGAASVLGNAPLHGIETRMR 153794
153795 GLKSTNAEHTGLRLQILRKEGLKAFYNGIVPHLGRVCLDVTTVFILCHEVGKLLNSV 153965

AC073188.3 chromosome 7 54% to citrate (pseudogene)
120147 PTERGRS*LQPDERWHQP
120221 RAHCRGHGVPGRHRGLGSLLCGSCLQGGVGF*MFQFHSSQTRDAQGRPDGSAGCGA 120388
       GRWRAEAMVVVCSMDTIKVKFIHGQTFPD 120479
120480 LQCRGVSHQVREIVREQGDSPGPHGTTLKQGWNQAVCF 120593
       LLLTSLHSWYRGDNPSKPMDPLVAGVFGAIVGAASVLGNAPLDG 120724
120725 IETGCGAWSSTNAEHTGLWLQILRKEGLKDL*NGTVPAWPLDASPWM*PGIRVT 120886
120887 VFILYQEVEKLHNSV 120931

AC023824.2 chromosome 16 citrate carrier pseudogene

49641 SPLPPTMAAQRSGGAEGTLPGKAITARWPGRQHPHLYSVSHRAREDQRALAPAGHPAWG 49465
49464 PVGHSHSHGVLGLNRGLSFLLCGSIPEAESGSGSWSSSAP 49345
49344 RCGMPRDC*TAGAGCCEV*TPAWPRPWWSYVPWRPSK*SSSTTRPPQTQEQIILRG*GDY 49165
49164 AGTRAEGTYQGLRAAGLKQGWNQAIVSLS*PPCSNWYQTDKPMNPLITGV 49015
49014 F*AIAGTASVFGNTPLDGIPRGSRPSQGHCPMPVTDG 48904
48903 VCLDVAITFIIYNEVVKLLNKVWKKD 48826

AC007506.1 chromosome 4, two citrate related exons
487 QVKFIHDQTSPNPKYRGFFHGVREIVREQG 398
231 GLKGTFHGLTATVPETGLRTQAIPFFVMNSLRNWFRGVFCNPKAP 97

AC022285.10 chromosome 17 citrater carrier related C-term fragment
129238 LNKLLQRTISHYGRVCLHIAFVFALYDEIIKHINKI 129131

AP000767.3 chromosome 11 = AP000645.2 80% to CITRATE (pseudogene)
25945 LITGGFGAIVCTASVFGNTPLDVIKTRR*GLEAHKYRNLMDCGWQILRKEELKAFYKGIV 26124
26125 P 26127

AC015695.3 chromosome 11 64% to citrate (pseudogene)
79735 RAHDPAVTGCLVAIVCTASVFGHTLLXVIKTRR*GLEAHKYRNLMDCGWQIMRKEELKAF 79556
79555 YKGIVP 79538

AC055755.7 chromosome 3 PSEUDO TO SEQ 151 CITRATE
71988 ILAGGLAGGIEIYITFPTQYVKTQWQLD*PSNPPRYQSIGEWVWQTVRSHGVLGLYRS 71815
71814 LSSLLYGSIPKAAASFGMF*FLGNHLRDSQGRLDSTRGLLCCLGAGVAEAVVIVC 71650
71649 PMETIKVRFI 71620

AC016486.4 chromosome 11 SAME AS AC055755.7 BUT DIFF CHROMOSOME
139737 ILAGGLAGGIEIYITFPTQYVKTQWQLD*PSNPPRYQSIGEWVWQTVRSHGVLGLYRS 139910
139911 LSSLLYGSIPKAAASFGMF*FLGNHLRDSQGRLDSTRGLLCCLGAGVAEAVVIVC 140075
140076 PMETIKVRFI 140105

46B AC022956.2 chromosome 1 = AC022960.2 chromosome 18 95% to AC022598.3
NO INTRONS
17063 MIDSEAHEKRPPILTSSKQDISPHITNVGEMKHYLCGCCAAFNNVAITYPIQKVLFRQQL 16884
16883 YGIKTRDAVLQLRRDGFRNLYRGILPPLMQKTTTLALMFGLYEDLSCLLRKHVRAPEFAT 16704
16703 HGVAAVLAGTAEAIFTPLERVQTLLQNHKHHDKFTNTYQAFKALKCHGIGEYYRGLVPIL 16524
16523 FRNGLSNVLFFGLRGPIKEHLPTATTHSAHLVNDFIGGGLLGAMLGFLCFPINVVKTRLQ 16344
16343 SQIGGEFQSFPKVFQKIWLERDRKLINLFRGAHLNYHRSLISWGIINATYEFLLKFI 16173

AC022598.3 chromosome 8 = AC009871.5 87% to AC022960 pseudogene
37705 MMDSEAHEKRLPILISSKQDISPHITNVGEMKHYLCGCCAAFNNITITFPIQKVLF*QQL 37884
37885 YSIKTRDAILQLRRVGFQNLYRGILPPLMQKTTTLALMFGLYKDLSYLLHKHVSAPEFAA 38064
38065 RGVAAVLSGTTEAIFTPLERVQTLLQDHKHHDTFTNTYQAFKALKCHRIGEYY*GLVPIL 38244
38245 FWNGLSNVLFFGLRGPIKEHLPTEMTHSAHLVNDFICGGLLGAMLGFLFFPINVVKTHIW 38424
38425 SQIGGEFQSFPQVFQKVWLEWDRKLINLFRGAYLNYHRSLISWGIINATYEFLLKII 38595

46A AL138752.5 CHR 9p12 EST FOR N-TERM = AA149221 = AC073598.1 chromosome 10
NO INTRONS mouse GSS clone for C-terminal AZ026662.1
56629 MMDSEAHEKRPPILTSSKQDISPHITNVGEMKHYLCGCCAAFNNVAITFPIQKVLFRQQLYGIKTRDAILQ
56852 LRRDGFRNLYRGILPPLMQKTTTLALMFGLYEDLSCLLHKHVSAPEFATSGVAAVLAGTT 57031
57032 EAIFTPLERVQTLLQDHKHHDKFTNTYQAFKALKCHGIGEYYRGLVPILFRNGLSNVLFF 57211
57212 GLRGPIKEHLPTATTHSAHLVNDFICGGLLGAMLGFLFFPINVVKTRIQSQIGGEFQSFP 57391
57392 KVFQKIWLERDRKLINLFRGAHLNYHRSLISWGIINATYEFLLKVI* 57532

AL391500.2 chromosome 6 89% to AL138752.5 pseudogene
22866 MDSEAHEKRPPTLTSSEQDISPRITNVGEMKHYLCGCCAAFNNIAITYPIQTVLFRQQLY 22687
22686 GIKTRDAIF*LRRDGFRNLYPGILPPLMQKTATLALTFGLYEDLSCLLHKHVSAPEFATC 22507
22506 CMAAVLAGTTEAIFTPPEIVQTLIQDHKHHDRFTNTYQAFKALKCHGIGEYYRGLVPILF 22327
22326 QNGLSNVLFCGL*GPIKEHLPTETTHSAHLVNNFICGGPLGAMLGFLFF 22180
22182 PINVKTRIQSQIGGEFQSFPKVFQKIWLERDRKLINLFRGAHLNYHQSLIS*SIINA 22008
22007 TYE 21999

AC040947.2 chromosome 17 pseudogene = AL354765.2 chromosome 13 87% to AL138752.5
44259 MMDSEAHEKRPPMLTSSKQDISPHIINVGEMKHDLCGCCAAFN
44140 HVEITFPIQKVLFQQQLYGIKTRDAILQLRRDGF*NLYPGILP*LMQKTTTLALMFG 43961
43960 LYEDLSYLLHKHVSAPEFATRVVVAVLARTAEAIFTPLQRVQTLLQDHKHHDKFTSTYQA 43781
43780 FKALKCHRIGERYRGLLPILFQNGLNNVLFFGLRGPIKEHLPTSTTHSAHLVNDFLCGGQ 43601
43600 LGAMLGFLFFLINVVKTSIQSQPSGEFQSFPKVFQKIWLERDRKLINLFRGAHLNYHRSL 43421
43420 ISWGIISATYEFLLKL 43373

AC068848.2 very similar to AL138752.5 with internal frame shift pseudogene
104256 MMDSEAHGKRPPILTSSKQDMSPHITDL
104342 EMKHYLCGCCAAFNNVAITFPIQKVLFPQQLYGIKTGDAILQLRTDGFRNLYRGIFPRLM 104521
104522 QKTTTLALTFGLYEDLSYLLHKHVSAPEFATCGVAAVLAGTTEAIFTXXXX    
       LQTLLQDHKHHDKFANIYQAFKALKCHGIGEFYRGLVPILFQNGLSNVFFFGLRGHVKEHLPTAT 104857
104858 THNSHLVNDFIGGGLLGAMLGFLFSPINVVKTRIQSQIGGEFQSFPKVFQKIWLERDRKL 105037
105038 INLFRGSHLNYHLSLISWGIINA 105106

AC020568.4 chromosome 20 pseudogene similar to AL138752.5
91876 MMDSEAHGKRPPILTSSKQDMSPHITDLVK*SITCVAAAFNNVAITFPIQKVLFPQQL 92049
92050 YGIKTGDAILQ
92015 LRTDGFRNLYRGIFPRLMQK 92143
24716 SYQAFKALKCHGIGEFYRGLVPILFQNGLSNVFFF
      PINVVKTRIQSQIGGEFQSFPKVFQKIWLERDRKLINLFRG 25066
25067 SHLNYHLSLISWGIINA 25117

165 PHOSPHATE CARRIER HUMAN AC013283.8 1 and 2 are the same gene except first exon
MFSSVAHLARANPFNTPHLQLVHDGLGDLRSSSPGPTGQPRRPRNLAAAAVE               (PHASE 1 INTRON)
EYSCEFGSAKYYALCGFGGVLSCGLTHTAVVPLDLVKCRMQ                          (PHASE 0 INTRON)
VDPQKYKGIFNGFSVTLKEDGVRGLAKGWAPTFLGYSMQGLCKFGFYEVFKVLYSNMLGE       (PHASE 0 INTRON)
ENTYLWRTSLYLAASASAEFFADIALAPMEAAKVRIQTQPGYANTLRDAAPKMYKEEGLKA      (PHASE 2 INTRON)
FYKGVAPLWMRQIPYTMMKFACFERTVEALYKFVVPKPRSECSKPEQLVVTFVAGYIA         (PHASE 1 INTRON)
GVFCAIVSHPADSVVSVLNKEKGSSASLVLKRLGFKG                              (PHASE 1 INTRON)
VWKGLFARIIMIGTLTALQWFIYDSVKVYFRLPRPPPPEMPESLKKKLGLTQ*

167 P04 CARRIER HUMAN 2   1 and 2 are the same gene except first exon
MFSSVAHLARANPFNTPHLQLVHDGLGDLRSSSPGPTGQPRRPRNLAAAAVE              (PHASE 1 INTRON)
EQYSCDYGSGRFFILCGLGGIISCGTTHTALVPLDLVKCRMQ                        (PHASE 0 INTRON)
VDPQKYKGIFNGFSVTLKEDGVRGLAKGWAPTFLGYSMQGLCKFGFYEVFKVLYSNMLGE      (PHASE 0 INTRON)
ENTYLWRTSLYLAASASAEFFADIALAPMEAAKVRIQTQPGYANTLRDAAPKMYKEEGLKA     (PHASE 2 INTRON)
FYKGVAPLWMRQIPYTMMKFACFERTVEALYKFVVPKPRSECSKPEQLVVTFVAGYIA        (PHASE 1 INTRON)
GVFCAIVSHPADSVVSVLNKEKGSSASLVLKRLGFKG                             (PHASE 1 INTRON)
VWKGLFARIIMIGTLTALQWFIYDSVKVYFRLPRPPPPEMPESLKKKLGLTQ*

AC027433.2 chromosome 18 phosphate carrier pseudogene
139285 ENVYL*CT*LYFTPSLSVEEFYFTDFALAPMKAADVEFQTKPGHANF*GILLPKGINKER 139464
139465 WSLMNETGNYTPQEKSIALDIPLKYGMHLLLFKPRNKMSKDRTDGCSI*SRL 139620
139621 RSWGCYCLSPADSVASVVNKGSATQLPQRIGCRLWE 139728

AC013283.8 Alternative exon 2 for first domain of phosphate carrier = PO4 carrier 2
142382 EQYSCDYGSGRFFILCGLGGIISCGTTHTALVPLDLVKCRMQV 142510

AL356977.7 chromosome 1 phosphate carrier related 68% pseudogene
64828 TAIVSLHLVKCRMQVDPGWYKGVLSGFGVTVHSDGLCGLARGCAQTFFGYSLQGFF 64661
64660 KFGLYEVFKIRSAELLGPEKAYGWRAGLYLFAWASAEFFPDVSLAPMEAVKV 64505
64504 RVQTRPGYASTLRATASRM*GEEGLWAFYKGVAPLWLRQIPYTMMKFACFERTV 64343

178 OXOGLUTARATE/MALATE HUM AC032038.2

MAATASAGAGGMDGKPRTSPKSVKFLFGGLAG                                      (PHASE 2 INTRON)
MGATVFVQPLDLVKNRMQLSGEGAKTREYKTSFHALTSILKAEGLRGIYTG                   (PHASE 2 INTRON)
LSAGLLRQATYTTTRLGIYTVLFERLTGADGTPPGFLLKALIGMTAGATGAFVGTPAEVALIRMTADGR (PHASE 1 INTRON)
LPADQRRGYKNVFNALIRITREEGVLTLWR                                        (PHASE 0 INTRON)
GCIPTMARAVVVNAAQLASYSQSKQFLLDSGYFSDNILCHFCASMISGLVTTAASMPVDIAKTR      (PHASE 1 INTRON)
IQNMRMIDGKPEYKNGL                                                     (PHASE 0 INTRON)
DVLFKVVRYEGFFSLWKGFTPYYARLGPHTVLTFIFLEQMNKAYKRLFLSG*

184 F11430 = UCP4 AC008104.3 chromosome 18 
EST AV652060.1 covers missing intron joint at MG(DLT)TYDTVKHYL

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELA                                 (PHASE 1 INTRON)
TFPLDLTKTRLQMQGEAALARLGDGARESAPYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVV     (PHASE 1 INTRON)
YSGGRMVTYEHLREVVFGKSEDEHYPLW                                         (PHASE 2 INTRON)
KSVIGGMMAGVIGQFLANPTDLVKVQMQMEGKRKLEGKPLR                            (PHASE 2 INTRON)
FRGVHHAFAKILAEGGIRGLWAGWVPNIQRAALVNMGD                               (PHASE 1 INTRON)
LTTYDTVKHYLVLNTPLEDNIMTHGLSS                                         (PHASE 2 INTRON)
LCSGLVASILGTPADVIKSRIMNQPRDKQGR                                      (PHASE 2 INTRON)
GLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRM                                   (PHASE 0 INTRON)
TPWSMVFWLTYEKIREMSGVSPF

188 UCP1 U28480     

MGGLTASDVHPTLGVQLFSAGIAACLADVITFPLDTAKVRLQ                           (PHASE 0 INTRON)
VQGECPTSSVIRYKGVLGTITAVVKTEGRMKLYSGLPAG
LQRQISSASLRIGLYDTVQEFLTAGKET                                         (PHASE 1 INTRON)
APSLGSKILAGLTTGGVAVFIGQPTEVVKVRLQAQSHLHGIKPRYTGTYNAYRIIATTEGLTGLWKG  (PHASE 1 INTRON)
TTPNLMRSVIINCTELVTYDLMKEAFVKNNILAD                                   (PHASE 1 INTRON)
DVPCHLVSALIAGFCATAMSSPVDVVKTRFINSPPGQYKSVPNCAMKVFTNEGPTAFFKG         (PHASE 2 INTRON)
LVPSFLRLGSWNVIMFVCFEQLKRELSKSRQTMDCAT*

193 UCP3 ISOLOG THC182634 AF050113.1 UCP3, AC073645.2 chromosome 11

MVGLKPSDVPPTMAVKFLGAGTAACFADLVTFPLDTAKVRLQ                           (PHASE 0 INTRON)
IQGENQAVQTARLVQYRGVLGTILTMVRTEGPCSPYNGLVAG
LQRQMSFASIRIGLYDSVKQVYTPKGADN                                        (PHASE 1 INTRON)
SSLTTRILAGCTTGAMAVTCAQPTDVVKVRFQASIHLGPSRSDRKYSGTMDAYRTIAREEGVRGLWKG (PHASE 1 INTRON)
TLPNIMRNAIVNCAEVVTYDILKEKLLDYHLLTD                                   (PHASE 1 INTRON)
NFPCHFVSAFGAGFCATVVASPVDVVKTRYMNSPPGQYFSPLDCMIKMVAQEGPTAFYKG         (PHASE 2 INTRON)
FTPSFLRLGSWNVVMFVTYEQLKRALMKVQMLRESPF

194 UCP2  HUMAN ISOLOG U82819 AC024029.3 chromosome 11 

MVGFKATDVPPTATVKFLGAGTVACIADLITFPLDTAKVRLQ                           (PHASE 0 INTRON)
IQGESQGPVRATASAQYRGVMGTILTMVRTEGPRSLYNGLVAG
LQRQMSFASVRIGLYDSVKQFYTKGSEH                                         (PHASE 1 INTRON)
ASIGSRLLAGSTTGALAVAVAQPTDVVKVRFQAQARAGGGRRYQSTVNAYKTIAREEGFRGLWKG    (PHASE 1 INTRON)
TSPNVARNAIVNCAELVTYDLIKDALLKANLMTD                                   (PHASE 1 INTRON)
DLPCHFTSAFGAGFCTTVIASPVDVVKTRYMNSALGQYSSAGHCALTMLQKEGPRAFYKG         (PHASE 2 INTRON)
FMPSFLRLGSWNVVMFVTYEQLKRALMAACTSREAPF* 

198 AL035423.4 BMCP1 brain mito carrier protein 1
MGIFPGIILIFLRVKFATAAVIVSGHQKSTTVSHE
MSGLNWKPFVYGGLASIVAEFGTFPVDLTKTRLQVQGQSIDARFKEIKYRGMFHALFRICKEEGVL  
ALYSGIAPALLRQASYGTIKIGIYQSLKRLFVERLEDETLLINMICGVVSGVISSTIANPTDVL  
KIRMQAQGSLFQGSMIGSFIDIYQQEGTRGLWRGVVPTAQRAAIVVGVELPVYDITKKHL 
ILSGMMGDTILTHFVSSFTCGLAGALASNPVDVVRTRMMNQRAIVGHVDLYKGTV   
DGILKMWKHEGFFALYKGFWPNWLRLGPWNIIFFITYEQLKRLQI* 

AC025517.2 chromosome 4 89% TO SEQ 198 (pseudogene)
      MSRLNWKPFVYDRLASITAEFGTFPMDLAKTRLQVQGQSIDVRFK
72444 ETKYRRMFHALFWIYKAEGGLALYSGIAPVLQRQASYGTIKIGIYQSLKQLSVERLE 72274
72273 DETLLINMICGVVSGVIFSTIANPTDVLKIRMQAQGSLFQGSMIGSFIDIY 72121
72120 QQEGTRGLWRSVVPTAQHAAIVVGVELPVYDFTKKHLILSGMMEDTTLTHFVSSFT 71953
71952 YGLAGALASNPGDVAGTHVMNQRAIVGHVDLYKGTLDGILKMWKHEGFF 71806
      AFLYSKGFWPNWLWRGPWNIIL*ITYE*LKRL*I*