>From dnelson@XXXX Mon Apr 10 14:53:50 2000 ... I have been teaching a bioinformatics course here at UT and one task we took on as a course was the discovery and assembly of all the mitochondrial carriers in Drosophila. We found 44, and there is a report in Genbank of another very distant member, but it is very weak. I sent this information to Marian and Mark Adams, but it was not updated in the paper, which claims only 38 mitochondrial carriers. I attach these sequences for you. ... [Note the correspondence between these and CG's was established by MA April 15 2000 by matches with aa_gadfly.dros0321] =============================================================================== March 7, 2000 This is the completed Drosophila mitochondrial carrier set. There are 44 mito carriers in Drosophila. The mito carrier motif is P(hyd)(E,D)(hyd)(hyd)(K,R)X(K,R)(hyd)Q where (hyd) = a hydrophobic amino acid. Some variation is allowed as in the second motif of seq. 3 below. Mito carrier motifs in the sequences are underlined. There should be three, spaced about 100 amino acids apart. 1. AC014984 comp(11159-11678, 9307-9415, 9050-8759) two introns ESTs AI107181, AA698455, AI388998, AA696852, AA441566 Sun Young Moon >CG1628|FBan0001628|CT4364|FBan0001628 last_updated:000321 MHGGGTGNNINFVEGLIDFLAGSLGGAAQVYVSQPLDTVKVKLQTFPEAYRGMLDCFLST YRKDGVLRGLYAGSVPAVFANVAENSVLFAAYGGCQKFVAFCVGKETAGDLTTVQNACAG SLAACFSTLTLCPTELIKCKLQALREMKNFVEPAHPQDIRTPWTLTRYIWRTEGIRGFYR GLSSTFLREMPGYFFFFGSYEGTRELLRRDDQSKDDIGPLRTMIAGAIGGVCLWTSTFPA DVIKSRIQVKNLNESMFAVGADIVRREGVLALYRGLLPSVLRTIPATATLFVVYEYTKRA LSATL* 2. AC013977 comp(58168-59166) = AC008140, AC009219, AC009984, AC009741 no introns ESTs AI258994, AI258107 David Nelson >CG6608|FBan0006608|CT20552|FBan0006608 last_updated:000321 MAVTTGSTSEATTTTTPVPRRKHSTREQLHQMLAGGLSAAITRSTCQPLDVLKIRFQLQV EPLGKNAAKEGPGALTSKYTSIGQAVKTIYREEGMLAFWKGHNPAQVLSIMYGICQFWTY EQLSLMAKQTSYLADHQHLSNFLCGAAAGGAAVIISTPLDVIRTRLIAQDTSKGYRNATR AVSAIVRQEGPRGMYRGLSSALLQITPLMGTNFMAYRLFSDWACAFLEVSDRSQLPTWTL LGLGASSGMLSKTIVYPFDLIKKRLQIQGFESNRQTFGQTLQCHGVWDCLRLTVRQEGVR GLYKGVAPTLLKSSMTTALYFSIYDKLKQVRF* 3. AC019974 comp(31626-32546) = AC009392 no introns ESTs AI513226, AI258372, AI135368, AI106672, AI514125, AA803450, AI064545, AI405779, AI388401, AI133912, AI063819, AI106827, AI064563, AI135996, AI108894, AI107113 David Nelson colt gene Y12495 = AC019974, AC009392 Hartenstein,K., Sinha,P., Mishra,A., Schenkel,H., Torok,I. and Mechler,B.M. The congested-like tracheae gene of Drosophila melanogaster encodes a member of the mitochondrial carrier family required for gas-filling of the tracheal system and expansion of the wings after eclosion Genetics 147 (4), 1755-1768 (1997) MTTTENVSTERKANPVKSFLTGGFGGICNVLSGHPLDTIKVRLQTMPRPAPGEQPLYRGT FDCAAKTIKNEGVRGLYKGMSAPLTGVAPIFAMCFAGYALGKRLQQRGEDAKLTYPQIFV AGSFSGLFSTLIMAPGERIKVLLQTQQGQGGERKYNGMIDCAGKLYKEGGLRSVFKGSCA TMLRDLPANGLYFLVYEALQDVAKSKSETGQISTASTIFAGGVAGMAYWILGMPADVLKS RLQSAPEGTYKHGIRSVFKDLIVKDGPLALYRGVTPIMLRAFPANAACFFGIELANKFFN IVAPNF* 4. AC020113 comp(29610-29643, 29538-29076, 28424-29000) = AC009849 two introns ESTs AI135358, AI135651, AI404820, AI513234, AI944743, AI945049 SUN YOUNG MOON >CG4995|FBan0004995|CT16036|FBan0004995 last_updated:000321 >CG18626|FBan0018626|CT41430|FBan0018626 last_updated:000321 MVVDFVAGLLGGAAGVLVGHPFDTVKVHLQTDDPRNPKYKGTFHCFRTIVQRDKFIGLYRG ISSPMGGIGLVNAIVFGVYGNVQRLSNDPNSLTSHFFAGSIAGVAQGFVCAPMELAKTRL QLSTQVDSGIKFTGPIHCLKYIVKTEGIRGAFKGLTATILRDIPGDYFVSFEYLMRQVET PGVAYTLMAGGCAGMSSWLACYPIDVVKTHMQADALGANAKYNGFIDCAMKGFRNEGPQY FFRGLNSTLIRAFPMNAACFFVVSWVLDICNAKGGMDSVMHSDQPLTLVNLDNKSQADLE ATAPTVEEVVRKIITDNAMSHQYVSTPKDVVHSHYTSSTINIPKESKARLASDCNLK* 5. AC010580 COMP (45288-45333, 45386-45652, 45716-46131, 46244-46409) THREE introns = AC017540, AC010579. ESTs AA141259, AA141260 Jianning Tao >CG4743|FBan0004743|CT15283|FBan0004743 last_updated:000321 MAAELGLESAAGSVAIKMQEPVNKLKFFHALVAGGVAGMVVDIALFPIDTVKTRLQSELG FWRAGGFRGIYKGLAPAAAGSAPTAALFFCTYECGKQFLSSVTQTKDSPYVHMAAASAAE VLACLIRVPVEAKQRSQTLQGNKQSGLQILLRAYRTEGLKRGLYRGFGSTIMREIPFSLI QFPLWEYFKLQWTPLTGFDSTPFSVALCGAVAGGISAGLTTPLDVVKTRIMLAERESLNR RRSARRILHGIYLERGFSGLFAGFVPRVLWITLGGAFFFGFYDLTTRILGATSTDH* 6. AC013100 20082-20482, 20573-21011 = AC008360 one intron Note: this gene is adjacent to AC012807 (2 genes on fragment) ESTs AI238956, AI404603, AI063021, AA695373, AI404222, AI401861, AA141396 Yuan Gao >CG8790|FBan0008790|CT25340|FBan0008790 last_updated:000321 MPHQERKSMWFFGGLASVGAAMVTHPLDLIKVTLQTQQGHLSVAQLIPKLAREQGVLVFY NGLSASVLRQLTYSTARFGVYEAGKKYVNTDSFGGKVALAGASGLVGGIVGTPADMVNVR MQNDVKLPPQQRRNYNNAFDGLVRVYRQEGFKRLFSGATAATARGILMTIGQIAFYDQTK IYLLATPYFQDNLVTHFTASLVAGTIATTLTQPLDVLKTRSMNAKPGEFNGLWDIVKHTA KLGPLGFFKGYVPAFVRLGPHTIITFVFLEQLRLKFGTLN* 7. AC017347 comp(23830-24078, 23632-23768, 22686-23213) = AC007588, AC007852 two introns no ESTs David Nelson >CG8323|FBan0008323|CT24577|FBan0008323 last_updated:000321 MATSDFVLGGLASVGATFFTNPIEVIKTRIQLQGELAARGTYVEPYKGIVNAFITVAKND GITGLQKGLAPALYFQFIINSFRLSIYSEAMERRWMHNRKGEVSYGMGLLWGAIGGVVGC YFSSPFFLIKTQLQSQAAKQIAVGYQHAHTSMTDALRQIYSRNGVRGLWRGSVAALPRAA LGSGAQIATFGKTKALLVQYDLVTQPTLNSFSAGLIAGSIMSVAITPPDVITTRLYNQGV DAEGRGLLYRGWLDCFVKILRSEGVYGMYKGFWANYLRIAPHSTLVLLFFDELVAVRTKY SNQ* 8. AC017347 comp(22017-22264, 21802-21937, 21164-21694 = AC007588, AC007852 two introns no ESTs David Nelson >CG18327|FBan0018327|CT41605|FBan0018327 last_updated:000321 MATSDFVLGGVAAMGAGVFTNPVEVIKTRIQLQGELAARGSHAQPYKSVFQAFVTVAKND GILGLQKGLAPALCFQFVINSFRLSIYTHAVEKGWVHNNKGEISFAKGMLWGALGGVVGS YCASPFFLIKTQLQAQAAKQIAVGYQHQHASMSDAIRKIYRKNGVFGLWRGSLANVSRAT VASAVQIAVFGQAKSLLKENGVVTHPTILSFCSGLAAGSFVSLAITPLDVVTTRLYNQGV DAQGRGIYYRGWLDCVLTILRSEGVYGLYKGFWPIYLRSAPYSTLVLLFFDELIALREKY DLHY* 9. AC017347 comp(20255-20502, 19249-19788, 19836-19971) = AC007588, AC007852 two introns ESTs AI530949, AI530942, AI259350 first two ESTs have retained an intron David Nelson >CG18324|FBan0018324|CT41583|FBan0018324 last_updated:000321 MTKSDFVLGGTAAMGAVVFTNPIDVVKTRMQLQGELAARGTYVKPYRHLPQAMLQIVLND GLLALEKGLAPALCYQFVLNSVRLSVYSNALELGYLQNADGSISFYRGMFFGALGGCTGT YFASPFYMIKAQQHAQAVQSIAVGFQHKHTSMMDALLHIYRTNGISGFWRAALPSLNRTL VASSVQIGTFPKAKSLLKDKGWITHPVLLSFCAGLSSGTLVAVANSPFDVLTTRMYNQPV DEKGRGLMYKGLVDCFTKIWRTEGIHGMYKGFWPIYFRSAPHTTLTFVFFEKLLHLRDRY VFSQRRN* 10. AC017377 comp(130619-131276, 130194-130549) one intron no ESTs David Nelson >CG18340|FBan0018340|CT41647|FBan0018340 last_updated:000321 MGKGVNTVFRPAEWDNSEEKERPKLEYLVTNKKTPPVELYLTAFASACSAEIVGYPFDMC KTRMQIQGEIASRVGQKAKYRGLLATAMGIVREEGLLKLYGGISAMLFRHSLFSGIKMLT YDYMREKMIVPDEDGRPQLSFLGSCISGVLAGATASVLTNPTELIKIQMQMEGQRRLRGE PPRIHNVLQALTSIYRTGGVVGLWKGTVPNTWRSALVTIGDVSCYDFCKRFLIAEFDLVD NREVQFVAAMTAGVADAILSLPADVVKSRIMNQPTDEQGRGIHYKGSLDCLSRLVREEGF LAMYKGFIPYWMRVGPASVVFWMTFEQIRRFRGSEGY* 11. AC017377 comp(131488-132495) no introns EST AI947102 David Nelson >CG9064|FBan0009064|CT26034|FBan0009064 last_updated:000321 MDKAERDYWHLRSLEIEEEPRFPPTNVADPLTARNLFQLYVNTFIGANLAESCVFPLDVA KTRMQVDGEQAKKTGKAMPTFRATLTNMIRVEGFKSLYAGFSAMVTRNFIFNSLRVVLYD VFRRPFLYQNERNEEVLKIYMALGCSFTAGCIAQALANPFDIVKVRMQTEGRRRQLGYDV RVNSMVQAFVDIYRRGGLPSMWKGVGPSCMRACLMTTGDVGSYDISKRTFKRLLDLEEGL PLRFVSSMCAGLTASVLSTPADVIKSRMMNQPVDESGKNLYYKNSLDCVRKLVREEGVLT LYKGLMPTWFRLGPFSVLFWLSVEQLRQWEGQSGF* 12. AC020205 8898-9190, 9260-10024, 10091-10257, = AC006496 two introns 33 ESTs AI517048, AI517037, AI404051, AI238330, AI389263, AI063177 AI292677, AI387048, AI404547, AI294348, AI389567, AI388110 AI293132, AI403941, AI389245, AI114161, AI387833, AI293024 AI386902, AI064405, AI063393, AI403869, AI388005, AI292968 AI514217, AI402309, AI134761, AA949677, AA817529, AA948886 AI515546, AI533469, AI401873 Robert Moxley >CG9090|FBan0009090|CT25968|FBan0009090 last_updated:000321 MFKSLFDAAQNSTFKSPFTSVNCQSATPTSAPTSTAVVTPTLKDVAPRQLTRNHNIAAAA VAEGDSCEFGSNHYFLLCGLGGIISCGSTHTMVVPLDLVKCRLQVDPAKYKSVFTGFRIS LAEEGVRGLAKGWAPTFIGYSMQGLCKFGLYEVFKKVYGDAIGEENAFLYRTGLYLAASA SAEFFADIALAPMEAAKVKIQTTPGFAKTLREALPKMTAQEGVTAFYKGLVPLWMRQIPY TMMKFACFERTVELLYKYVVPKPRADCTKGEQLVVTFAAGYIAGVFCAIVSHPADTVVSK LNQAKGASALDVAKQLGWSGLWGGLVPRIVMIGTLTAAQWFIYDAVKVFLRMPRPPPPEM PESLKKKLGVTGEQ* 13. AC018265 gene 1 8643-8701, 9520-9654, 9713-10198, 10259-10402, 10456-10600 = AC007889 4 introns ESTs AI108449, AI531531, AA804104, AI404803 these cover up to the third motif at PFDVVK, and the 3 prime UTR. The end of this gene matches a C.elegans EST T01651 at 78% identity, thus confirming the end of this gene. David Nelson >CG18347|FBan0018347|CT41675|FBan0018347 last_updated:000321 MSSSATIATPLPQPQHQQFALLPKIINGGIAGIIGVTCVFPLDLVKTRLQNQQIGPNGER MYNSMFDCFRKTYKAEGYFGMYRGSGVNILLITPEKAIKLTANDYFRHKLTTKDGKLPLT SQMVAGGLAGAFQIIVTTPMELLKIQMQDAGRVAAAAKLAGKTVEKVSATQLASQLIKDK GIFGLYKGIGATGLRDVTFSIIYFPLFATLNDLGPRRNDGSGEAVFWCSFLAGLAAGSTA ALAVNPFDVVKTRLQAIKKADGEKEFKGISDCIT*KTLKHEGPTAFFKGGLCRMIVIAPL FGIAQTVYYLGVAEGLLGYQKK* 14. AC018265 11201-11256, 11319-11453, 11525-12293 gene 2 = AC007889, AF145637 ESTs AI945223, AI946852 David Nelson >CG12201|FBan0012201|CT10663|FBan0012201 last_updated:000321 MLEQVEQKNQEQKKPQKFNVFPKIINGGVAGIIGVACVYPLDMVKTRLQNQTIGPNGERM YTSIADCFRKTIASEGYFGMYRGSAVNIVLITPEKAIKLTANDFFRYHLASDDGVIPLSR ATLAGGLAGLFQIVVTTPMELLKIQMQDAGRVAAADRAAGREVKTITALGLTKTLLRERG IFGLYKGVGATGVRDITFSMVYFPLMAWINDQGPRKSDGSGEAVFYWSLIAGLLSGMTSA FMVTPFDVVKTRLQADGEKKFKGIMDCVNRTLKEEGISAFFKGGLCRIMVLAPLFGIAQM FYFLGVGEKILGIERTKSV* 15. AC009385 97280-97481, 97544-97787, 97852-97984, 98056-98388 = AC012807 three introns, no ESTs Robert Moxley >CG18363|FBan0018363|CT41706|FBan0018363 last_updated:000321 MPVFDDHFLGDCHDEPEGLLPRWWFGGFASMCVAFAVAPIDIVKTHMQIQRQKRSILGTV KRIHSLKGYLGFYDGFSAAILRQMTSTNIHFIVYDTGKKMEYVDRDSYLGKIILGCVAGA CGSAFGIPTDLINVRMQTDMKEPPYKRRNYKHVFDGLIRIPKEEGWKALYKGGSVAVFKS SLSTCSQIAFYDIIKTEVRKNISVNDGLPLHFLTSLGTSIISSAITHPLDVVRTIMMNSR PGEFRTVFQASVHMMRFGVMGPYRGFVPTIVRKAPATTLLFVLYEQLRLHFGICSLGGEK YN* 16. AC012788, AC008286, AC008299 mRNA seq in Genbank Y18197 ARALAR1 = AC012788, AC008286, AC008299, AC008317, AC008336 >CG2139|FBan0002139|CT6974|FBan0002139 last_updated:000321 MPLTKSLPNSPSLLKRAGTEKLREVFLKYASIQKNGEHYMTSEDFVRKFLGLFSESAFND ESVRLLANIADTSKDGLISFSEFQAFEGLLCTPDALYRTAFQLFDRKGNGTVSYADFADV VQKTELHSKIPFSLDGPFIKRYFGDKKQRLINYAEFTQLLHDFHEEHAMEAFRSKDPAGT GFISPLDFQDIIVNVKRHLLTPGVRDNLVSVTEGHKVSFPYFIAFTSLLNNMELIKQVYL HATEGSRTDMITKDQILLAAQTMSQITPLEIDILFHLAGAVHQAGRIDYSDLSNIAPEHY TKHMTHRLAEIKAVESPADRSAFIQVLESSYRFTLGSFAGAVAPTVVYPIDLVKTRMQNQ RAGSYIGEVAYRNSWDCFKKVVRHEGFMGLYRGLLPQLMGVAPEKAIKLTVNDLVRDKLT DKKGNIPTWAEVLAGGCAGASQVVFTNPLEIVKIRLQVAGEIASGSKIRAWSVVRELGLF GLYKGARACLLRDVPFSAIYFPTYAHTKAMMADKDGYNHPLTLLAAGAIAGVPAASLVTP ADAIKTRLQVVARSGQTTYTGVWDATKKIMAEEGPRAFWKGTAARVFRSSPQFGVTLVTY ELLQRLFYVDFGGTQPKGSEAHKITTPLEQAAASVTTENLDHIGGYRAAVPLLAGVESKF GLYLPRFGRGVTAASPSTATGS* 17. AC018185 = 81% identical to AC020205 mRNA = AF137371 >CG4994|FBan0004994|CT15958|FBan0004994 last_updated:000321 MFSSFFETARNSPFRTPMSMARCDAAAPVVEPQPVEGRQIAAAATPVANQQDSCEFGSTK YFALCGIGGILSCGTTHTFVVPLDLVKCRLQVDQAKYKNLVHGFKVTVAEEGARGLAKGW FPTLLGYSAQGLCKFGLYELFKVKYAEIIGEENAYLYRTSLYLAASASAEFFADIVLAPF EAAKVKIQTIPGYANNFREAVPKMLKEEGVNAFYKGLVPLWMRQIPYTMMKFACFERTVE LLYKYVVPKPRADCTKGEQLIVTFAAGYIAGVFCAVVSHPADVVVSKLNQAKGASAISVA KSLGFSGMWNGLTPRIIMIGTLTALQWFIYDGVKVALGIPRPPPPEMPASLKAKQH* 18. AC012929 = Y10618 join(5191..5487,5563..5690,5768..6242) alternative splicing of 5 prime UTR annotated gene in Genbank >CG16944|FBan0016944|CT37582|FBan0016944 last_updated:000321 MGKDFDAVGFVKDFAAGGISAAVSKTAVAPIERVKLLLQVQHISKQISPDKQYKGMVDCF IRIPKEQGFSSFWRGNLANVIRYFPTQALNFAFKDKYKQVFLGGVDKNTQFWRYFAGNLA SGGAAGATSLCFVYPLDFARTRLAADTGKGGQREFTGLGNCLTKIFKSDGIVGLYRGFGV SVQGIIIYRAAYFGFYDTARGMLPDPKNTPIYISWAIAQVVTTVAGIVSYPFDTVRRRMM MQSGRKATEVIYKNTLHCWATIAKQEGTGAFFKGAFSNILRGTGGAFVLVLYDEIKKVL* 19. AC012929 = Y10618 join(7347..7667,7744..7871,8169..8643) alternative splicing of 5 prime UTR annotated gene in Genbank >CG1683|FBan0001683|CT4708|FBan0001683 last_updated:000321 MGDEGGGGGHGKGDLKSFLMDFMMGGVSAAIAKTAVAPIERVKLILQVQEVSKQIAADQR YKGIVDCFIRIPKEQGFSSFWRGNLANVIRYFPTQALNFAFKDVYKSVFLGGVDKHKQFW RHFAGNLASGGAAGATSLCFVYPLDFARTRLAADVGKGGNREFNGLIDCLMKVIKSDGPI GLYRGFIVSVQGIVIYRAAYFGFYDTCRDFLPNPKSTPFYVSWAIAQVVTTVAGIASYPF DTVRRRMMMQSGLKKSEMVYKNTAHCWLVIAKQEGIGAFFKGALSNIIRGTGGALVLALY DEMKKYF* 20. AC018177 COMP(38745-38861, 38252-38684, 38006-38182, 37773-37945) three introns ESTs AI110167, AA979538, AA951938 Ying Shen >CG3476|FBan0003476|CT11691|FBan0003476 last_updated:000321 MEEVEISTEKKSNPVKSFIAGGVGGMCNVLVGHPLDTIKVRLQTMPTPPPGQPPRYKGVI DCAARTFRYEGFRGFYRGISAPLVGVTPIYAVDFAVYAAGKRLFQTDDHIRLTYPQIFAA GALAGVCSALVTVPTDRIKVLLQTQTVSNGPLLYNGTIDTAAKLYRQGGIRSLFKGTCAC ILRDSPTGFYFVTYEFLQELARKKSANGKISTTSTILSGGTAGIVFWTLAVPFDVLKSRL QSAPEGTYKHGIRSVFRNLMATEGPKALFRGILPILLRAFPSTAAVFFGVELTNDLLKA* 21. AC005709 = AC006243, AC009183, AC009354, AC007884, AC018192 Muthiah Kumaraswami no introns no ESTs >CG2857|FBan0002857|CT9770|FBan0002857 last_updated:000321 MPENSVVVQLMQAVGGGIAGAATRTITQPLDVLKIRFQMQVEPVTNHKGSKYRGVIHAFK SVYAEEGMRGMFRGHNSGQVLSISYALVQFWSYEQLRSMAHQFDYWRERPFLMFFICGGI AGCLGAVAAQPFDVVRTQMVAADPSSRRSQMNTFTGLRKVYKMEGWMGLSRGLPFTLVQV FPLVGANFLFYKYLNAAVLMAKPPDQRQEIHGAFLFLNGALSGVLAKMIVYPADLLKKRI QLMAFKQERKTFGRNPECPTILGCITTTFREEGIGGFYKGMLPTLLKAGLMSAVYFSIYD MFKRHYIAPMKEAEKNRQKLGKH* 22.AC020327 13228-13367, 13433-14314 one intron Swapna Menon >CG4392|FBan0004392|CT14260|FBan0004392 last_updated:000321 MNVPDDFTQKEMQTGLWWRHLVAGGIAGAVSRTCTAPLDRIKVYLQVQTQRMGISECMHI MLNEGGSRSMWRGNGINVLKIAPETAFKFAAYEQMKRLIRGDDGSRQMSIVERFYAGAAA GGISQTIIYPMEVLKTRLALRRTGQYAGIADAAVKIYKQEGVRSFYRGYVPNILGILPYA GIDLAVYETLKRRYIANHDNNEQPSFLVLLACGSTSSTLGQLCSYPLALVRTRLQAQGKT IIQGHNWFYRNAIIILPFSLAAAETIANQKRKTQIPLKSSDAHSGEETMTGLFRKIVRQE GLTGLYRGITPNFLKVLPAVSISYVVYEYTSRALGIKMS* 23. AC020102 one intron 2 ESTs comp(33652-34401, 35642-35870) Ajit Kulkarni >CG5254|FBan0005254|CT16777|FBan0005254 last_updated:000321 MAGQQHDISHAKRAAFQVLAGGSAGFLEVCIMQPLDVVKTRIQIQATPAPNAAALGEVHY NGVFDCFAKMYRHEGISSYWKGIMPPILAETPKRAIKFLVFEQTKPLFQFGSPTPTPLTF SLAGLTAGTLEAIAVNPFEVVKVAQQADRQKKMLSTFAVAKGIIQQDGLGFSGLNKGITA TMGRNGVFNMVYFGFYHSVKNVVPEYKESHLEFLRKVTIGFLAGTLACFVNIPFDVAKSR IQGPQPVPGQIKYRGTLSSMGIVYREEGFRALYKGLVPKIMRLGPGGAILLLVFEYYYDY LLHNYS* 24. AC017981 17489-17642, 19697-19850, 19909-20263, 20652- 20814, 21800-21942 four introns no ESTs Allen Sickmier >CG8026|FBan0008026|CT6130|FBan0008026 last_updated:000321 MNPIKAQSTGSPKKFNVFAHVKYEHLVAGVSGGVVSTLILHPLDLIKIRFA*VNDGRTAT VPQYRGLSSAFTTIFRQEGFRGLYKGVTPNVWGSGSSWGLYFML*YNTIKTFIQGGNTTM PLGPTMNMLAAAESGILTLLLTNPIWVVKTRLCLQCDAASSAEYRGMIHALGQIYKEEGI RGLYRGFVPGMLGVSHGAIQFMTYEELKNAYNEYRKLPIDTKL*ATTEYLAFAAVSKLIA AAATYPYQVVRARLQDHHHRYNGTWDCIKQTWRYERMRG*FYKGLKASLTRVVPACMVTF LVYENVSHFLLARRKRIETKEDASDV* 25. AC020252 comp(11793-12137, 12455-12941) one intron EST AI294955 Ji Ma >CG4323|FBan0004323|CT14141|FBan0004323 last_updated:000321 MSYNQRRIARWYFGGLASSMAAMVTHPIDLIKVLIQTQAEKLSVFQTTRKIVKEQGPLAM YNGISASMLRQYTYTLARFGIYSVGSGAMDTSTMAGKTCLAAIAGGIGGFVGAPADLINV RLQNDVKLPPEKRRNYKHAIDGLVRITREEGWKNLFNGSSMIALRGAFMTVGQIAFYEQS KSQMIKLGMPDYMGTYILASMISSVVATTLTQPIDVVKTRRMNAAPGEYSGLGDVFVKTS KEGPLAFFKGYVPSLSRLLPHTVLLFLGLEYLRTHFGYLPEPKQTGAFYRYDDVDD* 26. AC008182 comp(95417-96317) AC017552 AC009523 no intron Ji Ma >CG9582|FBan0009582|CT27052|FBan0009582 last_updated:000321 MATRSEETVRSLAHWQFLAGGLSGFIEIICFHPLDVVKTRMQIQGAHPFGGEVVYTCPLD AIVKIYRYEGLSSLWKGIVPPICVETPKRGGKFLMYESLKPYFQFGAPQPTPLTHAMSGS MAAILESFLVNPFEVVKITQQAHRGKRLKTLSVVKYIIKHDGYGIKGLYRGITALVARNA VFHFGFFGFYNALKDIVPSPEDKTYNILRKVIIAGLASSLACVMSVTLDMAKCRIQGPQP VKGEVKYQWTISTIKSTFKEEGFRSLFKGLGAMILRVGPGGAMLLVTYEYLFEFLKSQNI \* 27. AC018172 comp(9696-10092, 10536-11242) one intron EST AI946682 Yongkai Mo >CG2616|FBan0002616|CT8885|FBan0002616 last_updated:000321 MGRGPRPRSFTGGGGGGGTGGSGGSGGTSGGNNLKPERSAREDDAINRLTDSKSSHRKLL SDPRFQIRPLQQVISACTGAMITACFMTPLDVIKTRMQSQQSPAHKCFFYSNGLMDHLFA SGPNGSELASLRQRPQFSSSWDALMKISRHEGLAALWSGLGPTLVSALPSTIIYFVAYEQ FKARYLQIYESHYNKRFTGLNVFHTSRDTKKSLPSVVPMMSGVTARICAVTVVSPIELVR TKMQAQRQTYAQMLQFVRSVVALQGVWGLWRGLRPTILRDVPFSGIYWPIYESLKQNLGH GSQPSFSLSFLAGVMAGTVAAIVTTPFDVVKTHEQIEFGERVIFTDSPARDFGKKSTFSR LTGIYRTHGVRGLFAGCGPRLLKVAPACAIMISTFEYSKSFFFHYNVRHHNEALLLDNPK DTTVEDDDIE* 28. AC015137 = AC009740 comp(10280-10858,10969-11292) one intron ESTs: AA978681, AA104694, AI389649, AI294952, AI294952, AI514635, AI239104, AI402446 Ryan Kendall >CG6782|FBan0006782|CT21037|FBan0006782 last_updated:000321 MDRSAFASLVSPYRRRPWMTEHGAAAADSGQVGLKGIVAGGITGGIEICITYPTEYVKTQ LQLDEKGAAKKYNGIFDCVKKTVGERGFLGLYRGLSVLVYGSIPKSAARFGAFEFLKSNA VDSRGQLSNSGKLLCGLGAGVCEAIVAVTPMETIKVKFINDQRSGNPKFRGFAHGVGQII KSEGISGIYKGLTPTILKQGSNQAIRFFVLESLKDLYKGDDHTKPVPKLVVGVFGAIAGA ASVFGNTPLDVVKTRMQGLEASKYKNTAHCAVEILKNEGPAAFYKGTVPRLGRVCLDVAI TFMIYDSFMDLFN* 29. AC010671 gene 3 introns shown by \* EST AI389787 >CG14208|FBan0014208|CT33821|FBan0014208 last_updated:000321 MDPLRLTTLILSADPRYRIKPMQQVVSALVGGLITTFV*VTPLEVVKTRVQTQHAIRQRP TVSKLCYVYHNGLMTHVCRSSDICVPKPGRDPQNLRPLRGAM*DAFVKIVCTSGFSGLWA GLSPTLVSALPSTIIYFLTYEYIKNSLSHIYLVSQYYVPMASGICSRTIVVTAITPIEMV RIKMQSEYMTYAELWRVLRSLIRQHGILGLWRGWPPTVMRDAPFSGTYWAVYEAIKRAFS VTEPTFLFSFLTGAISGAVATFVTMPFDLITTHTQIELGQDVL*PSVLSRMRQIYRLQGV RGLYVGVMPRMLRVVPACAIMISTFEYSKSFFFHYNLDLQEAGTYYVKCQ* 30. AC010671 gene 3 introns shown by \* EST AI402433 >CG14209|FBan0014209|CT33822|FBan0014209 last_updated:000321 MAAASSQNPSKATMTDPRFRIRPLQQVASACTGAMVTACF*MTPLDVIKTRLQAQQQALL SNKCFLYCNGLMDHICPCGPDTPNPAAAKPAPRFSGTI*DAFIKISRTEGIGSLWSGLSP TLISALPSTIIYFVAYEQFKARFTDIHHPIPFLVPLLAGVSGRILAVTCVSPVELIRTKM QSQRMTHAEMFGTIRQVVQSQGVLGLWRGLPPTILRDVPFSGIYWTCYEYLKSSFGVVEP TFSFSFAAGAISGSVAATITTPFDVVKTHEQIEFGEKFIFS*DNPPKQVATKSVAMRLAS IYRMGGVPAIFSGLGPRLFKVAPACAIMISSFEYGKSFFYHYNIDQHNRSNQATKGPGS* 31. AC019585 = AC009388, AC009249 introns shown by \* ESTS AI064276, AA439218, AI386586, AA698575, AI946686, AI945569 David Nelson >CG4963|FBan0004963|CT15898|FBan0004963 last_updated:000321 MNIDDYESLPTTSVGVNMTAGAIAGVLEHVVMYPLDSVKVR*MQSLSPPTKNMNIVSTLR TMITREGLL*RPIRGASAVVLGAGPAHSLYFAAYEMTKELTAKFTSVRNLNYVI*SGAVA TLIHDAISSPTDVIKQRMQMYNSPYTSVVSCVRDIYKREGFKAFYRAYGTQLVMNLPYQT IHFTTYEFFQNKMNLERKYNPPVHMAAGAAAGACAAAVTTPLDVIKTLLNTQETGLTRGM IEASRK*IYHMAGPLGFFRGTTARVLYSMPATAICWSTYEFFKFYLCGLDADQYKSSITG SSEPRKADYVLPRTTDEEQIDQEREAAKEKDTTATLHSAPTSVNASGAIKTVCELSTRPA GPTINLHTRHTDVKSPYERGFST* 32. AC012986 30851-31404, 31468-31597, 31665-31979 = AC007886, AC009733 two introns ESTs AI404722, AI516330, AI292585, AI542291 David Nelson >CG7943|FBan0007943|CT5716|FBan0007943 last_updated:000321 MKDDGAPNVAKTTDPPPPKRFPSGRAHSPHGDGEAGKLLHGSVFSKRFFGSFQWEEFACG CGAAFVNIAVTYPIYKMIFRQMLHGVPITSAFAQLRHEGLGFLYRGMLPPLAQKTISLSI MFGVVDGTRRYLVEDYRLNDYGAKVLAAVVAGSAESILLPFERVQTLLADSKFHQHFSNT QNAFRYVVSHHGYRELYRGLEPVFWRNGLSNALFFVLREEASVRLPKRKSVSTRTVQEF IAGAVIGASISTIFYPLNVIKVSLQSEMGQRSEGSWQACKRIYVERDRRIGNFYRGCPFN TGRSFISWGIMNTAYENLKKLMQQQPPLPLASE* 33. AC017153 = AC004442 EST AI259170 5 introns David Nelson >CG18317|FBan0018317|CT41569|FBan0018317 last_updated:000321 MAQNTADTLIHLIAGG*SAGTVGAVVTCPLEVVKTRLQSSTAFMTPSRLAENAGGGPANG GQSELLRPEQRRKLSTTILRNRSQPQ*IMAISHCGISSTTPKSMSIVQCL*RHIVQNEGP RALFKGLGPNLVGVAPSRAIYFCTYSQTKNTLNSLG*FVERDSPLVHIMSAASAGFVSST ATNPIWFVKTRMQLDYNSKVQMTVRQCIERVYAQGGVAAFYKGITASYFGICETMVHFVI YEFIKSKL*LEQRNQRHTDTKGSRDFLEFMMAGAVSKTIASCIPYPHEVARTRLREEGNK YNSFWQTLHTVWKEEGRAGLYR*GLATQLVRQIPNTAIMMATYEAVVYVLTRRFNNKSNE FYDF* 34. AC005445 COMP(19104-19517, 18486-18939) one intron EST AI944701 Brian Bothner >CG11196|FBan0011196|CT31268|FBan0011196 last_updated:000321 MGEDSSRRLPRWWFGGVCAAIAVTGTHPIDLIKVQLQTQSQADRKTVGEILKGIHERSGI LGFYNGISASWFRQLTYTTTRFALYEAGKDYVDTQKVSSKMALATFAGIVGGIVGVPGDV VTVRLQNDVKLPEEKRRSYKHVFDGLFRIYKEEGVSSLFRGTVPAVSRAVLLTIGTNAAY DQVKQMLKIATGAGEGVPLHFATSTIAGCIAVVITQPLDVIKTTFMNAQPGEFSGIGGAF LSTAKQGPLAFYKGFIPALIRVSPNTIITFVLYEQARMRFGYLPPDK* 35. AC019775 comp(52852-53040, 53240-53367, 54125-53426) = AC008205 2 introns ESTs AA697685, AA697686, AI402846, AI402855, AI403013 David Nelson >CG5805|FBan0005805|CT18202|FBan0005805 last_updated:000321 MTESSSSTKSRLAVAPTGVGGAAEGATYIRTIEWDMMNKTKFFPLSMLSSFSVRCCLFPL TVIKTQLQVQHKSDVYKGMVDCAMKIYRSEGVPGLYRGFWISSVQIVSGVFYISTYEGVR HVLNDLGAGHRMKALAGGGCASLVGQTIIVPFDVISQHAMVLGMSAHAGSKGDINPLGIK SWPGRSRLHISMDIGREIMRRDGFRGFYRGYTASLMAYVPNSAMWWAFYHLYQQELFRIC PVWSHLFIQCVAGSLGGFTTTILTNPLDIVRARLQVHRLDSMSVAFRELWQEEKLNCFFK GLSARLVQSAAFSFSIILGYETIKRIAVDEQYKHQIRW 36. AC017782 comp(21596-21725, 20691-21524) = AC007856 AC008218 one intron shown by \* ESTs AI512269, AI405068, AI260461, AI518659, AA697477, AI403621, AI297023, AI455634, AA803073, AA697231 David Nelson >CG1907|FBan0001907|CT2867|FBan0001907 last_updated:000321 MSATSVQEAPKKAVATNAIKFLFGGLSGMGATMVVQPLDL*VKTRMQISGAGSGKKEYRS SLHCIQTIVSKEGPLALYQGIGAALLRQATYTTGRLGMYTYLNDLFREKFQRSPGITDSM AMGTIAGACGAFIGTPAEVALVRMTSDGRLPVAERRNYTNVANALARITREEGLTALWRG SLPTVGRAMVVNMTQLASYSQFKTYFRHGPLQMEEGIKLHFCASMLSGLLTTITSMPLDI AKTRIQNMKMVDGKPEYRGTADVLLRVARQEGVFALWKGFTPYYCRLGPHTVLTFIILEQ LNQGYNKYVLGSNKSTGL* 37. AC020238 = AC007756 AC007757 three introns shown by \* no ESTs David Nelson >CG4241|FBan0004241|CT13930|FBan0004241 last_updated:000321 MSLRLFIHSRAFGYECILRFSIHFLPFTISVRQKIDQVVISLISGAAAGALAKTVIAPLD RTKINFQIRNDVPFSFRASLRYLQNTYANEGVLALWRGNSATMARIVPYAAIQFTAHEQW RRILHVDKDGT*NTKGRRFLAGSLAGITSQSLTYPLDLARARMAVTDRYTGYRTLRQVFT NWIKGPIAVGISFSTYDLIKAWLTELANLRRVEK* 38. AC014153 GENE 1 comp(17446-18381) = AC010665 ESTs AI405292, AI134015, AI389947 >CG18418|FBan0018418|CT41906|FBan0018418 last_updated:000321 MALVYGVEKKTVPTHMKFVMGGTSGMLATCIVQPLDLLKTRMQISGTLGTREYKNSFEVL SKVLKNEGILSLYNGLSAGLLRQATYTSAKMGVYQMELDWYRKNFGNYPSMVASMTMGIV AGAFGAMCGNPAEVALIRMMSDNRLIPEDRRNYKNVGDAFVRIVKDEGVVALWRGCLPTV GRAMVVNMVQLASYSLMKNQLHGYLSEGIPLHLTAALVSGLLTSVTSMPLDMAKTRIQQM KVIDGKPEYSGTIDVLKKVLKNEGAFAVWKGFTPYLMRMGPHTIFSFVFLEQMNKAYSKH MLSDSLSDSVP* 39. AC014153 GENE = 2 comp(16188-17093) no ESTs >CG7514|FBan0007514|CT2166|FBan0007514 last_updated:000321 MAYSIEKKSIPGYMMYINGGLAGMLGTCIVQPLDLVKTRMQISATTGEYKSSFDCLLKVF KNEGILALYNGLSAGLMRQATYTTARMGFYQMEIDAYRKQFNAPPTVLASMGMGILAGAF GAMFGNPAEVALIRMMSDNRLPPAERRNYTGVLNAFVRIVKDEGVITLWKGCMPTVGRAM IVNMVQLASYSQLKAAFSEYFSGLSLHIAAAMMSGLLTTIASMPLDMAKTRIQQQKTAEY KGTMDVLMKVSKNEGIASLWKGFTPYLCRLGPHTVFAFIFLEQLTKAYKHIVLGDDSESN I* AC009216 120925-121173, 122098-122207, 122345-122834, 122899-123071 = AC012693 45% TO CE5 no ESTs 3 introns shown by \* >CG6492|FBan0006492|CT20203|FBan0006492 last_updated:000321 MAAKTDESSPAVASSTSSNPAPSSGRHQLRPVKFDYADSFACTYIVSVVAASIAELATYP LDLTKTRLQIQGEGAAHSAGKSN*MQYRGMVATAFGIAREEGALKLWQGVTPALYRHVVY \*SGVRICSYDLMRKEFTQNGTQALPVWKSALCGVTAGAVAQWLASPADIVKVQIQMEGRR RLMGEPPRVHSAGHAFRQIVQRGGIKGLWKGSIPNVQRAALVNLGDLTTYDTIKHLIMNR LQMPDCHTVHVLASVCAGFVAAIMGTPADVVKTRIMNQPTDENGR*GLLYRGSVDCLRQT VSKEGFVALYKGFLPCWIRMAPWSLTFWLSFEQIRKMIGASGY* AC019525 200827-200898, 200975-201035, 201914-202725 2 introns shown by \* = AC010700 ESTs AI512559, AA697689, AI259003, AI532563 >CG1326|FBan0001326|CT2942|FBan0001326 last_updated:000321 KEIVLGEGFQSLYRGLGPVLQSLCISNFVYFYTFHALKAVASGGSPSQHSALKDLLLGSI AGIINVLTTTPFWVVNTRLRMRNVAGTSDEVNKHYKNLLEGLKYVAEKEGIAGLWSGTIP SLMLVSNPALQFMMYEMLKRNIMRFTGGEMGSLSFFFIGAIAKAFATVLTYPLQLVQTKQ RHRSKESDSKPSTSAGSTPRTESTLELMISILQHQGIRGLFRGLEAKILQTVLTAALMFM AYEKIAGTVGMLLKRN* AC014955 = AC006492 4 introns shown by \* ESTs AA949318, AA142277, AI405082, AI114037, AA695733, AI402226, >CG7314|FBan0007314|CT22571|FBan0007314 last_updated:000321 MGEVKDWRPFVYGGVASITAEF*GTFPIDTTKTRLQIQGQKIDQSFSQLRYRGMTDAFVK ISREEGLRALYSG*IWPAVLRQATYGTIKFGTYYTLKKLANERGLLINEDGSERVWSNIL CAAAAGAISSAIANPTDVLKVRMQVHGKGQHKGLLGCFGEIYKYEGVRGLWRGVGPTAQR AVVIASVELPVYDFCKLQLMNAFGDHVGNHFI*SSFIASLGSAIASTPIDVIR*TRLMNQ RPVSITMNGVVTAAATPKLYSGSLDCAVQTIRNEGLPALYKGFIPTWVRMGPWNIIFFIT YEQLKKY* AC007817 = AC019578 4 introns shown by \* ESTs AA440846, AI542106 >CG5646|FBan0005646|CT17838|FBan0005646 last_updated:000321 MKWENYCDFVAGCFG*GACGVLVAHPLDTIKVWQQASNSSVVTAIQQIYSRNNG*VNGFY RGMFFPFISTGAINSLLFGIYGNHLRQLRKVCHSDYQREQLEYHNMFLAGSVAGFVQSFI ACPMELIKVRLQTAT*YYSDYLYGQRRTAFGTFKRILKTDGISGLYRGLLPM*IDVLPYG IYMLAYRQGVDYMDRRDFVRRRRSQSDGSSVNLLVTTLAGAWAGVISWVCVIPFDVVKTL MQADENHKYRGIFHCVRVQYRAYGWRSIFRGSWMLVARAVPFNAATFLGYEYALEWCQRW NGTVY* AC014256 21309-21745, 21866-22307 = AC008356, AC008234 one intron shown by \* no ESTs >CG16736|FBan0016736|CT32113|FBan0016736 last_updated:000321 MNNSDAGSSIEQFIKMPVYVGLLIKTTAQLLSHPMELVRVNMQANVIHHSRLSINHMFRL MARHGLPGFYYGIVAACLRCTVHTMSTYTLFYNLQDNKYVLMLQPYNTSMVLGITGFWGG VLATPFAKLAVIRQADLTRGSYERRK*NYRNFWRGLKCMYAKGGFTYLFTGWKINSISST AVAVLYTPISDKVHTVISWFHRLDEPWLSDLITMALTGSIITVIMTPVDALATLTLNESS HYGRTSYPYLYRKIIRKHGYKGFFFGWKPALMALIPHTVLATFVYRFLLDRYIT*