專利名稱:脫氧-d-木酮糖磷酸生物合成途徑的基因在改變類異戊二烯濃度中的應(yīng)用的制作方法
技術(shù)領(lǐng)域:
本發(fā)明涉及來自于細(xì)菌或寄生蟲的DNA序列(SEQ1,3,5,7)的應(yīng)用,該基因編碼gcpE或yfgB蛋白質(zhì),當(dāng)將該基因摻入到病毒,真核細(xì)胞和原核細(xì)胞的基因組時所述的DNA序列改變類異戊二烯的含量,并且涉及測量類異戊二烯合成中g(shù)cpE基因的活性的方法。此外本發(fā)明還涉及鑒定在植物中具有除草劑,抗寄生蟲,抗病毒,殺真菌的作用和在人和動物中具有抗真菌,抗寄生蟲,抗病毒的作用的物質(zhì)的方法。
已經(jīng)知道借助于經(jīng)典的乙酸/甲羥戊酸途徑和一種可替代的甲羥戊酸-依賴性生物合成途徑,脫氧-D-木酮糖磷酸途徑形成類異戊二烯的生物合成途徑(Rohmer,M.,Knani,M.,Simonin,p.,Sutter,B.和Sahm,H.(1993)生物化學(xué)雜志295517-524)。
在美國專利US5858367中描述了aarC-寡聚核苷酸在識別抗細(xì)菌物質(zhì)中的應(yīng)用。
令人驚奇的是,已經(jīng)發(fā)現(xiàn)gcpE蛋白質(zhì)在類異戊二烯生物合成的另一種代謝途徑中還具有激酶功能和催化類異戊二烯生物合成的糖或前體的磷酸化作用,特別是2-C-甲基-D-赤蘚糖醇,2-C-甲基-D-赤蘚糖醇磷酸的磷酸化,特別是2-C-甲基-D-赤蘚糖醇-4-磷酸的磷酸化,2-C-甲基-D-赤蘚糖,2-C-甲基-D-赤蘚糖磷酸的磷酸化,特別是2-C-甲基-D-赤蘚糖-4-磷酸,CH2(OH)-C(CH3)=C(OH)-CH2-O-PO(OH)2,
CH2(OH)-C(CH3)=C(OH)-CH2-OH,CH2(OH)-C(CH3)-CO-CH2-O-PO(OH)2,CH2(OH)-C(CH3)-CO-CH2OH,CH2=C(CH3)-CO-CH2-O-PO(OH)2,CH2=C(CH3)-CO-CH2-OH,CH2=C(CH3)-CH(OH)-CH2-O-PO(OH)2,CH2=C(CH3)-CH(OH)-CH2-OH,CH2(OH)-C(=CH2)-C(OH)-CH2-O-PO(OH)2,CH2(OH)-C(=CH2)-C(OH)-CH2-OH,CHO-CH(CH3)-CH(OH)-CH2-O-PO(OH)2,CHO-CH(CH3)-CH(OH)-CH2-OH,CH2(OH)-C(OH)(CH3)-CH=CH-O-PO(OH)2,CH2(OH)-C(OH)CH3-CH=CH-OH,CH(OH)=C(CH3)-CH(OH)-CH2-O-PO(OH)2,CH(OH)=C(CH3)-CH(OH)-CH2-OH,CH3-C(CH3)=CH-CH2-O-PO(OH)2,CH3-C(CH3)=CH-CH2-OH,CH2=C(CH3)-CH2-CH2-O-PO(OH)2,CH2=C(CH3)-CH2-CH2-OH的磷酸化。
因此本發(fā)明涉及來自于細(xì)菌或寄生蟲的DNA序列的應(yīng)用,所述DNA序列編碼gcpE或yfgB的蛋白質(zhì)[來自于細(xì)菌或來自寄生蟲的gcpE或yfgB]或所述的DNA序列編碼該蛋白質(zhì)的類似物或衍生物,其中該蛋白質(zhì)的一個或多個氨基酸被缺失,疊加或被其它氨基酸替代,并且基本上沒有降低該多肽的酶促活性。特別是本發(fā)明涉及SEQ1,3,5,7的DNA序列的應(yīng)用。列舉的SEQ 1和5序列以及蛋白質(zhì)2和6的起始來源是微生物大腸桿菌K12菌株。SEQ 3和7列舉的序列以及蛋白質(zhì)4和8的原始來源是微生物惡性瘧原蟲株3D7。
這樣的DNA序列描述于US5858367并且也可以通過下面的國際互聯(lián)網(wǎng)地址和登記號找到http//www3.ncbi.nlm.nih.gov/Entrez/protein.html AAD07695,AAD18517,AAC75568,AAC67648,AAC65433,P36979,CAA15530,CAA98356,CAA98355,AAC24056,AAC07467,P54482,P44667,P27434,P27433,BAA17717,BAA20919,BAA16402,S23058,AAB51469,1819264A,CAA45783,CAA45782,AAA21360,AAA21359,BAA02549,I39486,2113330A。
本發(fā)明的序列適用于在病毒、真核生物和原核生物中表達(dá)基因,他們負(fù)責(zé)1-脫氧-D-木酮糖途徑的類異戊二烯的合成。
根據(jù)本發(fā)明真核生物或真核生物細(xì)胞包括動物細(xì)胞,植物細(xì)胞,藻類,酵母,真菌,而原核生物或原核生物細(xì)胞包括細(xì)菌,古細(xì)菌和真細(xì)菌。
當(dāng)將DNA序列插入到定位了上面所述DNA序列的基因組時,上面所說的基因能夠在病毒、真核生物和原核生物中表達(dá)。以本領(lǐng)域內(nèi)技術(shù)人員已知的方式培養(yǎng)本發(fā)明轉(zhuǎn)化的病毒、真核生物和原核生物并且分離在這樣的培養(yǎng)期間形成的類異戊二烯,非強(qiáng)制性地進(jìn)行純化。不是所有的類異戊二烯都需要分離,在一些情況下,類異戊二烯可以直接釋放到室溫。
借助于下面的步驟可以進(jìn)行所使用的轉(zhuǎn)基因病毒、真核生物和原核生物的制備以便修飾類異戊二烯的含量a)生產(chǎn)具有下面亞序列的DNA序列i)在病毒、真核生物和原核生物中具有活性并且在預(yù)定的靶組織或靶細(xì)胞中確保形成RNA,ii)編碼具有來自于細(xì)菌或寄生蟲的gcpe或yfgB蛋白質(zhì)的氨基酸序列的多肽或者該多肽的類似物或者衍生物的DNA序列,iii)導(dǎo)致將聚-A殘基加到病毒、真核生物和原核生物的RNA的3’末端的未翻譯的序列,
b)利用或不利用載體(例如質(zhì)粒,病毒DNA)將該DNA序列轉(zhuǎn)移到和摻入到病毒細(xì)胞,原核生物細(xì)胞或真核生物細(xì)胞的基因組。
從這種方式轉(zhuǎn)化的植物細(xì)胞再生完整的整個植株。
給編碼gcpE或yfgB蛋白質(zhì)或者其類似物或者衍生物的序列提供確保在一定的器官或細(xì)胞中轉(zhuǎn)錄的啟動子,該啟動子以有義方向偶合(啟動子的3’末端到編碼序列的5’末端)到編碼待形成的蛋白質(zhì)的序列。將決定mRNA合成的終止的終止信號結(jié)合到編碼序列的3’末端。以便指導(dǎo)將表達(dá)的蛋白質(zhì)到一定的亞細(xì)胞室,例如葉綠體,淀粉體,線粒體,液泡,胞液或者細(xì)胞間隔,將編碼所謂的信號序列或轉(zhuǎn)移肽的其他的序列插入到啟動子和該編碼序列之間。該序列必須是在與該蛋白質(zhì)的編碼序列相同的閱讀框架中。為了將本發(fā)明的DNA序列導(dǎo)入到高等植物中需要獲得大量的克隆載體,所述的載體含有大腸桿菌中的復(fù)制信號和用以選擇轉(zhuǎn)化細(xì)胞的標(biāo)記物。載體的例子是pBR322,pUC-系列,M13mp-系列,pACYC184,EMBL3等等。根據(jù)將需要的基因?qū)氲街参镏械姆椒ǎ枰渌腄NA序列。例如如果將Ti或Ri質(zhì)粒用于轉(zhuǎn)化植物細(xì)胞,必須插入Ti和Ri質(zhì)粒T-DNA的至少一個右邊界,但是通常是右邊界和左邊界,作為待導(dǎo)入基因的側(cè)面區(qū)域。在歐洲專利120516;Hoekama在“雙性植物載體系統(tǒng)”,Offset-drukkerij KantersB.V.Alblasserdam(1985),第5章;Fraley等人,植物科學(xué)的關(guān)鍵和綜述4,1-46和An等人(1985)EMBO J.4,277-287廣泛地調(diào)查和描述的T-DNA在轉(zhuǎn)化植物細(xì)胞中的應(yīng)用。一旦將導(dǎo)入的DNA摻入到基因組,通常是穩(wěn)定的并且還保留在原始轉(zhuǎn)化的細(xì)胞的子代中。通常它含有選擇性標(biāo)記,該標(biāo)記授予轉(zhuǎn)化植物細(xì)胞對殺生劑或抗生素,例如卡那霉素,G418,bleomycin,潮霉素或磷蘇菌素和其他的抗性。所使用的特定的標(biāo)記物允許選擇出失去插入的DNA的轉(zhuǎn)化細(xì)胞。
許多技術(shù)可用于將DNA導(dǎo)入到植物。這些技術(shù)包括用農(nóng)桿菌的輔助轉(zhuǎn)化,原生質(zhì)體的融合,DNA的微注射,電擊穿,以及微彈轟擊方法和病毒注射。然后將轉(zhuǎn)化植物材料在合適的培養(yǎng)基上再生為完整植株,所述的培養(yǎng)基含有用于選擇目的的抗生素或殺生劑。對于用于注射和電擊穿的質(zhì)粒沒有特別的需求。但是,如果完整植株從這樣的轉(zhuǎn)化細(xì)胞再生,必須存在選擇性標(biāo)記物。以常規(guī)的方式在植物中生長轉(zhuǎn)化細(xì)胞(McCormick等人(1986),植物細(xì)胞報告5,81-84)。通??梢耘囵B(yǎng)該植物并且與具有相同的轉(zhuǎn)化基因組或其他基因組的植物進(jìn)行雜交。獲得的個體具有相應(yīng)的表現(xiàn)型特性。
本發(fā)明還提供了表達(dá)載體,它含有一個或多個本發(fā)明的DNA序列??梢酝ㄟ^給本發(fā)明的DNA提供合適的功能調(diào)節(jié)信號獲得這樣的表達(dá)載體。這樣的復(fù)制信號是負(fù)責(zé)表達(dá)的DNA序列,例如啟動子,加強(qiáng)子,核糖體結(jié)合位點,并且由宿主生物體識別。
非強(qiáng)制性地其他的調(diào)節(jié)信號例如控制重組DNA在宿主生物體中復(fù)制或重組的,也可以是表達(dá)載體的組成部份。
用于表達(dá)本發(fā)明的酶的合適的宿主細(xì)胞和生物體是那些不包括具有DOXP合成酶功能的固有酶,DOXP還原異構(gòu)酶或gcpE激酶的微生物。這樣的例子是古細(xì)菌,動物,真菌,絲狀毛霉和一些真細(xì)菌。這樣的固有酶活性的缺失本質(zhì)上促進(jìn)了重組酶的檢測和純化。因此,也可能是利用各種化學(xué)品和藥物第一次直接從宿主細(xì)胞的粗提物中檢測到本發(fā)明的重組酶的活性和特別是活性的抑制。
如果想要獲得多肽鏈的翻譯后修飾和天然折疊本發(fā)明的酶優(yōu)選地在真核細(xì)胞中表達(dá)。但是根據(jù)表達(dá)系統(tǒng),當(dāng)表達(dá)通過DNA拼接去除內(nèi)含子的基因組DNA序列時,確保產(chǎn)生的酶的多肽序列對寄生蟲具抗性。利用重組DNA技術(shù),將編碼內(nèi)含子的序列去除或為了試驗的目的將該序列插入到待表達(dá)的DNA序列。
利用本領(lǐng)域內(nèi)技術(shù)人員已知的方法從宿主細(xì)胞或宿主細(xì)胞的上清液可以分離蛋白質(zhì)。也可以需要酶的體外重新活化。
為了有助于純化,以具有不同肽鏈的融合蛋白的形式表達(dá)本發(fā)明的酶活該酶的亞序列。寡聚一組氨酸序列和來源于谷胱甘肽S-轉(zhuǎn)移酶,硫氧還蛋白或鈣調(diào)素結(jié)合肽的序列特別適用于該目的。
進(jìn)一步以具有本領(lǐng)域內(nèi)技術(shù)人員已知的這樣的肽鏈的融合蛋白的形式表達(dá)本發(fā)明的酶或該酶的亞序列,該重組酶被運(yùn)輸?shù)郊?xì)胞外或運(yùn)輸?shù)剿拗骷?xì)胞的腔室。因此有助于該酶的思維活性的純化和調(diào)查。
當(dāng)表達(dá)本發(fā)明的酶時,證明修飾個體密碼子是方便的。如果在寄生蟲中使用的密碼子不同于在異源表達(dá)系統(tǒng)中的密碼子的使用,該編碼序列中堿基的有目的替代是可取的,以便確保蛋白質(zhì)的最佳合成。此外非翻譯5’和3’區(qū)域的分別缺失經(jīng)常可調(diào)節(jié),例如如果存在DNA的3’序列的幾個脫穩(wěn)定的序列基序ATTTA。然后在真核生物中優(yōu)選表達(dá)中應(yīng)該缺失這些。堿基的缺失,疊加或替代時這類變異并且是本發(fā)明的主題。
進(jìn)一步可以在標(biāo)準(zhǔn)條件下,采用本領(lǐng)域內(nèi)技術(shù)人員已知的方法進(jìn)行體外翻譯獲得本發(fā)明的酶。適用于本發(fā)明的目的的系統(tǒng)是兔子網(wǎng)織紅血球和小麥胚提取物和細(xì)菌裂解液。也可以將體外轉(zhuǎn)錄的mRNA翻譯到Xenopus卵子。
其序列來源于本發(fā)明的酶的肽序列的寡聚-和多肽的序列可以通過化學(xué)合成方法獲得。如果合適選擇該序列,這樣的肽具有本發(fā)明特征的特性。這樣的肽可以大量地產(chǎn)生并且特別適合于酶活性動力學(xué),酶活性的調(diào)節(jié),酶的三維空間結(jié)構(gòu),各種化學(xué)劑和藥物對酶活性的抑制,各種配體的結(jié)合幾何形狀和結(jié)合親和性的調(diào)查。
本發(fā)明也提供了用于測定gcpE蛋白質(zhì)的酶促活性的方法。利用已知的方法可以測定所述的活性。通過測定糖或磷糖類或類異戊二烯生物合成的前體的磷酸化,特別是2-C-甲基-D-赤蘚糖醇,2-C-甲基-D-赤蘚糖醇磷酸,特別是2-C-甲基-D-赤蘚糖醇4-磷酸的磷酸化,2-C-甲基-D-赤蘚糖,2-C-甲基-D-赤蘚糖磷酸的磷酸化,特別是2-C-甲基-D-赤蘚糖4-磷酸,CH2(OH)-C(CH3)=C(OH)-CH2-O-PO(OH)2,CH2(OH)-C(CH3)=C(OH)-CH2-OH,CH2(OH)-C(CH3)-CO-CH2-O-PO(OH)2,CH2(OH)-C(CH3)-CO-CH2OH,CH2=C(CH3)-CO-CH2-O-PO(OH)2,CH2=C(CH3)-CO-CH2-OH,CH2=C(CH3)-CH(OH)-CH2-O-PO(OH)2,CH2=C(CH3)-CH(OH)-CH2-OH,CH2(OH)-C(=CH2)-C(OH)-CH2-O-PO(OH)2,CH2(OH)-C(=CH2)-C(OH)-CH2-OH,CHO-CH(CH3)-CH(OH)-CH2-O-PO(OH)2,CHO-CH(CH3)-CH(OH)-CH2-OH,CH2(OH)-C(OH)(CH3)-CH=CH-O-PO(OH)2,CH2(OH)-C(OH)CH3-CH=CH-OH,CH(OH)=C(CH3)-CH(OH)-CH2-O-PO(OH)2,CH(OH)=C(CH3)-CH(OH)-CH2-OH,CH3-C(CH3)=CH-CH2-O-PO(OH)2,CH3-C(CH3)=CH-CH2-OH,CH2=C(CH3)-CH2-CH2-O-PO(OH)2,CH2=C(CH3)-CH2-CH2-OH的磷酸化進(jìn)行測定。本發(fā)明還提供了將該測定方法用于識別抑制特定的酶的活性的物質(zhì)。
已經(jīng)發(fā)現(xiàn)脫氧-1-木酮糖磷酸代謝途徑也存在于許多寄生蟲,病毒和真菌。
因此本發(fā)明還涉及用于篩選抑制脫氧-D-木酮糖磷酸代謝途徑的化合物的方法。根據(jù)本發(fā)明,提供了含有重組表達(dá)載體的宿主生物體,其中載體包括至少SEQ ID NO1,SEQ ID NO3或SEQID NO5的寡聚核苷酸序列的一部分或其變異體或其同系物,和提供了被懷疑對人和動物具有抗微生物,抗寄生蟲,抗細(xì)菌,抗病毒和抗真菌的作用或?qū)χ参锞哂锌刮⑸铮共《?,殺菌劑,除草劑或殺真菌劑的作用的化合物。然后將宿主生物體與該化合物和待測定的化合物接觸。
序列表<110>哈?!ぶ祚R<120>脫氧-D-木酮糖磷酸生物合成途徑的基因在改變異戊二烯類的濃度中的應(yīng)用<130>15904<140><141><150>19923567.8<151>1999-05-21<150>19923568.6<151>1999-05-21<160>8<170>PatentIn Ver.2.1<210>1<211>1119<212>DNA<213>大腸桿菌<220><221>CDS<222>(1)..(1119)<400>1atg cat aac cag gct cca att caa cgt aga aaa tca aca cgt att tac 48Met His Asn Gln Ala Pro Ile Gln Arg Arg Lys Ser Thr Arg Ile Tyr1 5 10 15gtt ggg aat gtg ccg att ggc gat ggt gct ccc atc gcc gta cag tcc 96Val Gly Asn Val Pro Ile Gly Asp Gly Ala Pro Ile Ala Val Gln Ser20 25 30atg acc aat acg cgt acg aca gac gtc gaa gca acg gtc aat caa atc 144Met Thr Asn Thr Arg Thr Thr Asp Val Glu Ala Thr Val Asn Gln Ile35 40 45aag gcg ctg gaa cgc gtt ggc gct gat atc gtc cgt gta tcc gta ccg 192Lys Ala Leu Glu Arg Val Gly Ala Asp Ile Val Arg Val Ser Val Pro50 55 60acg atg gac gcg gca gaa gcg ttc aaa ctc atc aaa cag cag gtt aac 240Thr Met Asp Ala Ala Glu Ala Phe Lys Leu Ile Lys Gln Gln Val Asn65 70 75 80gtg ccg ctg gtg gct gac atc cac ttc gac tat cgc att gcg ctg aaa288Val Pro Leu Val Ala Asp Ile His Phe Asp Tyr Arg Ile Ala Leu Lys85 90 95gta gcg gaa tac ggc gtc gat tgt ctg cgt att aac cct ggc aat atc 336Val Ala Glu Tyr Gly Val Asp Cys Leu Arg Ile Asn Pro Gly Asn Ile100 105 110ggt aat gaa gag cgt att cgc atg gtg gtt gac tgt gcg cgc gat aaa 384Gly Asn Glu Glu Arg Ile Arg Met Val Val Asp Cys Ala Arg Asp Lys115 120 125aac att ccg atc cgt att ggc gtt aac gcc gga tcg ctg gaa aaa gat 432Asn Ile Pro Ile Arg Ile Gly Val Asn Ala Gly Ser Leu Glu Lys Asp130 135 140ctg caa gaa aag tat ggc gaa ccg acg ccg cag gcg ttg ctg gaa tct 480Leu Gln Glu Lys Tyr Gly Glu Pro Thr Pro Gln Ala Leu Leu Glu Ser145 150 155 160gcc atg cgt cat gtt gat cat ctc gat cgc ctg aac ttc gat cag ttc 528Ala Met Arg His Val Asp His Leu Asp Arg Leu Asn Phe Asp Gln Phe165 170 175aaa gtc agc gtg aaa gcg tct gac gtc ttc ctc gct gtt gag tct tat 576Lys Val Ser Val Lys Ala Ser Asp Val Phe Leu Ala Val Glu Ser Tyr180 185 190cgt ttg ctg gca aaa cag atc gat cag ccg ttg cat ctg ggg atc acc 624Arg Leu Leu Ala Lys Gln Ile Asp Gln Pro Leu His Leu Gly Ile Thr195 200 205gaa gcc ggt ggt gcg cgc agc ggg gca gta aaa tcc gcc att ggt tta 672Glu Ala Gly Gly Ala Arg Ser Gly Ala Val Lys Ser Ala Ile Gly Leu210 215 220ggt ctg ctg ctg tct gaa ggc atc ggc gac acg ctg cgc gta tcg ctg 720Gly Leu Leu Leu Ser Glu Gly Ile Gly Asp Thr Leu Arg Val Ser Leu225 230 235 240gcg gcc gat ccg gtc gaa gag atc aaa gtc ggt ttc gat att ttg aaa 768Ala Ala Asp Pro Val Glu Glu Ile Lys Val Gly Phe Asp Ile Leu Lys245 250 255tcg ctg cgt atc cgt tcg cga ggg atc aac ttc atc gcc tgc ccg acc 816Ser Leu Arg Ile Arg Ser Arg Gly Ile Asn Phe Ile Ala Cys Pro Thr260 265 270tgt tcg cgt cag gaa ttt gat gtt atc ggt acg gtt aac gcg ctg gag 864Cys Ser Arg Gln Glu Phe Asp Val Ile Gly Thr Val Asn Ala Leu Glu275 280 285caa cgc ctg gaa gat atc atc act ccg atg gac gtt tcg att atc ggc 912Gln Arg Leu Glu Asp Ile Ile Thr Pro Met Asp Val Ser Ile Ile Gly290 295 300tgc gtg gtg aat ggc cca ggt gag gcg ctg gtt tct aca ctc ggc gtc 960Cys Val Val Asn Gly Pro Gly Glu Ala Leu Val Ser Thr Leu Gly Val305 310 315 320acc ggc ggc aac aag aaa agc ggc ctc tat gaa gat ggc gtg cgc aaa 1008Thr Gly Gly Asn Lys Lys Ser Gly Leu Tyr Glu Asp Gly Val Arg Lys325 330 335gac cgt ctg gac aac aac gat atg atc gac cag ctg gaa gca cgc att 1056Asp Arg Leu Asp Asn Asn Asp Met Ile Asp Gln Leu Glu Ala Arg Ile340 345 350cgt gcg aaa gcc agt cag ctg gac gaa gcg cgt cga att gac gtt cag 1104Arg Ala Lys Ala Ser Gln Leu Asp Glu Ala Arg Arg Ile Asp Val Gln355 360 365cag gtt gaa aaa taa 1119Gln Val Glu Lys370<210>2<211>372<212>PRT<213>大腸桿菌<400>2Met His Asn Gln Ala Pro Ile Gln Arg Arg Lys Ser Thr Arg Ile Tyr1 5 10 15Val Gly Asn Val Pro Ile Gly Asp Gly Ala Pro Ile Ala Val Gln Ser20 25 30Met Thr Asn Thr Arg Thr Thr Asp Val Glu Ala Thr Val Asn Gln Ile35 40 45Lys Ala Leu Glu Arg Val Gly Ala Asp Ile Val Arg Val Ser Val Pro50 55 60Thr Met Asp Ala Ala Glu Ala Phe Lys Leu Ile Lys Gln Gln Val Asn65 70 75 80Val Pro Leu Val Ala Asp Ile His Phe Asp Tyr Arg Ile Ala Leu Lys85 90 95Val Ala Glu Tyr Gly Val Asp Cys Leu Arg Ile Asn Pro Gly Asn Ile100 105 110Gly Asn Glu Glu Arg Ile Arg Met Val Val Asp Cys Ala Arg Asp Lys115 120 125Asn Ile Pro Ile Arg Ile Gly Val Asn Ala Gly Ser Leu Glu Lys Asp130 135 140Leu Gln Glu Lys Tyr Gly Glu Pro Thr Pro Gln Ala Leu Leu Glu Ser145 150 155 160Ala Met Arg His Val Asp His Leu Asp Arg Leu Asn Phe Asp Gln Phe165 170 175Lys Val Ser Val Lys Ala Ser Asp Val Phe Leu Ala Val Glu Ser Tyr180 185 190Arg Leu Leu Ala Lys Gln Ile Asp Gln Pro Leu His Leu Gly Ile Thr195 200 205Glu Ala Gly Gly Ala Arg Ser Gly Ala Val Lys Ser Ala Ile Gly Leu210 215 220Gly Leu Leu Leu Ser Glu Gly Ile Gly Asp Thr Leu Arg Val Ser Leu225 230 235 240Ala Ala Asp Pro Val Glu Glu Ile Lys Val Gly Phe Asp Ile Leu Lys245 250 255Ser Leu Arg Ile Arg Ser Arg Gly Ile Asn Phe Ile Ala Cys Pro Thr260 265 270Cys Ser Arg Gln Glu Phe Asp Val Ile Gly Thr Val Asn Ala Leu Glu275 280 285Gln Arg Leu Glu Asp Ile Ile Thr Pro Met Asp Val Ser Ile Ile Gly290 295 300Cys Val Val Asn Gly Pro Gly Glu Ala Leu Val Ser Thr Leu Gly Val305 310 315 320Thr Gly Gly Asn Lys Lys Ser Gly Leu Tyr Glu Asp Gly Val Arg Lys325 330 335Asp Arg Leu Asp Asn Asn Asp Met Ile Asp Gln Leu Glu Ala Arg Ile340 345 350Arg Ala Lys Ala Ser Gln Leu Asp Glu Ala Arg Arg Ile Asp Val Gln355 360 365Gln Val Glu Lys370<210>3<211>2109<212>DNA<213>Plasmodium falciparum<220><221>CDS<222>(70)..(2109)<400>3cagcctataa atattattat ttattattat tttttttttt ttttttcata atgcctgaat 60aaccacaaa atg agt tat ata aaa aga ctg att ctt ttt atg tta ctg ttt 111Met Ser Tyr Ile Lys Arg Leu Ile Leu Phe Met Leu Leu Phe1 5 10tat tct cat gta aaa att aaa aaa tta ttt att aaa att tct aat gta 159Tyr Ser His Val Lys Ile Lys Lys Leu Phe Ile Lys Ile Ser Asn Val15 20 25 30aac ata ttt ttt gca gaa gca aag aaa aat gga aaa aag gaa ttc ttt 207Asn Ile Phe Phe Ala Glu Ala Lys Lys Asn Gly Lys Lys Glu Phe Phe35 40 45ctt ttt tta cta aat ata aaa aaa aat agc caa cag aaa aaa act tat 255Leu Phe Leu Leu Asn Ile Lys Lys Asn Ser Gln Gln Lys Lys Thr Tyr50 55 60cat att acc aaa agg aat acc ata aat aaa agt gat ttt tta tat tct 303His Ile Thr Lys Arg Asn Thr Ile Asn Lys Ser Asp Phe Leu Tyr Ser65 70 75tta cta aat gaa gaa ggg aat tct tca aaa aag gaa tat aaa aat tta 351Leu Leu Asn Glu Glu Gly Asn Ser Ser Lys Lys Glu Tyr Lys Asn Leu80 85 90aaa gat gaa gaa aaa tat aat atc ata caa aat ata aaa aaa tat tgt 399Lys Asp Glu Glu Lys Tyr Asn Ile Ile Gln Asn Ile Lys Lys Tyr Cys95 100 105 110gaa tgt act aaa aaa tat aaa agg ctc cca aca cga gaa gta gtt att 447Glu Cys Thr Lys Lys Tyr Lys Arg Leu Pro Thr Arg Glu Val Val Ile115 120 125gga aat gtt aaa att gga gga aat aat aaa ata gct att caa act atg 495Gly Asn Val Lys Ile Gly Gly Asn Asn Lys Ile Ala Ile Gln Thr Met130 135 140gct agc tgt gat aca aga aat gta gaa gaa tgt gta tat caa att aga 543Ala Ser Cys Asp Thr Arg Asn Val Glu Glu Cys Val Tyr Gln Ile Arg145 150 155aaa tgt aaa gat ttg ggt gct gac att gta agg ttg act gtt caa gga 591Lys Cys Lys Asp Leu Gly Ala Asp Ile Val Arg Leu Thr Val Gln Gly160 165 170gtt caa gaa gca caa gct agt tat cat att aaa gaa aaa tta tta tct 639Val Gln Glu Ala Gln Ala Ser Tyr His Ile Lys Glu Lys Leu Leu Ser175 180 185 190gaa aat gta aat atc cca tta gta aca gat att cat ttt aat cct aaa 687Glu Asn Val Asn Ile Pro Leu Val Thr Asp Ile His Phe Asn Pro Lys195 200 205ata gct tta atg gca gct gat gtg ttt gaa aaa att cga gtg aat cca 735Ile Ala Leu Met Ala Ala Asp Val Phe Glu Lys Ile Arg Val Asn Pro210 215 220gga aat tat gtt gat gga aga aaa aaa tgg ata gat aaa gtt tat aaa 783Gly Asn Tyr Val Asp Gly Arg Lys Lys Trp Ile Asp Lys Val Tyr Lys225 230 235act aaa gaa gaa ttt gat gaa ggg aaa tta ttt ata aaa gaa aaa ttt 831Thr Lys Glu Glu Phe Asp Glu Gly Lys Leu Phe Ile Lys Glu Lys Phe240 245 250gta cca tta att gaa aaa tgt aaa aga tta aat aga gca ata aga att 879Val Pro Leu Ile Glu Lys Cys Lys Arg Leu Asn Arg Ala Ile Arg Ile255 260 265 270ggt aca aat cat gga ttc ctt tca tct cga gta tta tca tat tat gga 927Gly Thr Asn His Gly Phe Leu Ser Ser Arg Val Leu Ser Tyr Tyr Gly275 280 285gat aca cea tta gca tta gta gaa agt gct atg aga ttt tct gat tta 975Asp Thr Pro Leu Ala Leu Val Glu Ser Ala Met Arg Phe Ser Asp Leu290 295 300tgt aat gaa aac aat ttt aac aat ctt gtt ttt tct atg aaa gct tct 1023Cys Asn Glu Asn Asn Phe Asn Asn Leu Val Phe Ser Met Lys Ala Ser305 310 315aat gct tat gtt atg ata caa tct tat aga tta tta gta tct aaa caa 1071Asn Ala Tyr Val Met Ile Gln Ser Tyr Arg Leu Leu Val Ser Lys Gln320 325 330tat gaa aga aat atg atg ttc cct ata cat tta gga gtt aca gaa gca 1119Tyr Glu Arg Asn Met Met Phe Pro Ile His Leu Gly Val Thr Glu Ala335 340 345 350gga ttt ggt gat aat gga aga ata aaa tct tat tta ggt ata gga tct 1167Gly Phe Gly Asp Asn Gly Arg Ile Lys Ser Tyr Leu Gly Ile Gly Ser355 360 365tta tta tat gat ggt ata gga gat acc att cgt ata tcc tta aca gaa 1215Leu Leu Tyr Asp Gly Ile Gly Asp Thr Ile Arg Ile Ser Leu Thr Glu370 375 380gat cct tgg gaa gag tta act cct tgt aaa aaa tta gtt gaa aat tta 1263Asp Pro Trp Glu Glu Leu Thr Pro Cys Lys Lys Leu Val Glu Asn Leu385 390 395aag aaa aga ata ttt tat aat gaa aat ttt aaa gaa gat aat gaa tta 1311Lys Lys Arg Ile Phe Tyr Asn Glu Asn Phe Lys Glu Asp Asn Glu Leu400 405 410aaa aat aat gaa atg gat acc aaa aat cta tta aat ttt gaa gaa aat 1359Lys Asn Asn Glu Met Asp Thr Lys Asn Leu Leu Asn Phe Glu Glu Asn415 420 425 430tat cga aat ttt aat aat ata aaa aaa aga aat gta gaa aaa aat aat 1407Tyr Arg Asn Phe Asn Asn Ile Lys Lys Arg Asn Val Glu Lys Asn Asn435 440 445aat gta tta cat gaa gag tgc act ata ggt aat gta gta acc ata aaa 1455Asn Val Leu His Glu Glu Cys Thr Ile Gly Asn Val Val Thr Ile Lys450 455 460gag tta gaa gat tct ctg caa att ttt aaa gat tta aat tta gaa gta 1503Glu Leu Glu Asp Ser Leu Gln Ile Phe Lys Asp Leu Asn Leu Glu Val465 470 475gat tca aat gga aat ttg aaa aag gga gcc aaa aca act gat atg gtt 1551Asp Ser Asn Gly Asn Leu Lys Lys Gly Ala Lys Thr Thr Asp Met Val480 485 490att ata aat gat ttt cat aat ata aca aat tta gga aaa aaa act gtg 1599Ile Ile Asn Asp Phe His Asn Ile Thr Asn Leu Gly Lys Lys Thr Val495 500 505 510gat aaa tta atg caa gtg gga att aat ata gta gtt caa tat gaa cca 1647Asp Lys Leu Met Gln Val Gly Ile Asn Ile Val Val Gln Tyr Glu Pro515 520 525cat aat ata gaa ttt ata gaa aaa atg gaa cca aat aat gat aat aat 1695His Asn Ile Glu Phe Ile Glu Lys Met Glu Pro Asn Asn Asp Asn Asn530 535 540aat aat aat aat aat aat aat ata tta ttt tat gtg gat ata aaa aat 1743Asn Asn Asn Asn Asn Asn Asn Ile Leu Phe Tyr Val Asp Ile Lys Asn545 550 555att atg aac agt tca gaa aaa aat att aaa tta agt aat tct aaa gga 1791Ile Met Asn Ser Ser Glu Lys Asn Ile Lys Leu Ser Asn Ser Lys Gly560 565 570tat gga tta att tta aac gga aaa gaa gat ata caa acc ata aaa aaa 1839Tyr Gly Leu Ile Leu Asn Gly Lys Glu Asp Ile Gln Thr Ile Lys Lys575 580 585 590ata aaa gaa tta aat cgt cgt cct tta ttc att cta tta aaa tca gat 1887Ile Lys Glu Leu Asn Arg Arg Pro Leu Phe Ile Leu Leu Lys Ser Asp595 600 605aac ata tat gaa cat gta tta ata acc aga aga att aat gaa ctt tta 1935Asn Ile Tyr Glu His Val Leu Ile Thr Arg Arg Ile Asn Glu Leu Leu610 615 620caa tcc tta aat ata aat ata cct tat ata cat tat gtt gat att aat 1983Gln Ser Leu Asn Ile Asn Ile Pro Tyr Ile His Tyr Val Asp Ile Asn625 630 635tca aat aat tat gat gat ata tta gtt aat tca aca tta tat gca gga 2031Ser Asn Asn Tyr Asp Asp Ile Leu Val Asn Ser Thr Leu Tyr Ala Gly640 645 650agt tgt ttg atg gat tta atg ggg gat ggt ctt att gtt aac gta act 2079Ser Cys Leu Met Asp Leu Met Gly Asp Gly Leu Ile Val Asn Val Thr655 660 665 670aat gat gtt ctt aca aat aaa aaa ggg tag 2109Asn Asp Val Leu Thr Asn Lys Lys Gly675 680<210>4<211>679<212>PRT<213>Plasmodium falciparum<400>4Met Ser Tyr Ile Lys Arg Leu Ile Leu Phe Met Leu Leu Phe Tyr Ser1 5 10 15His Val Lys Ile Lys Lys Leu Phe Ile Lys Ile Ser Asn Val Asn Ile20 25 30Phe Phe Ala Glu Ala Lys Lys Asn Gly Lys Lys Glu Phe Phe Leu Phe35 40 45Leu Leu Asn Ile Lys Lys Asn Ser Gln Gln Lys Lys Thr Tyr His Ile50 55 60Thr Lys Arg Asn Thr Ile Asn Lys Ser Asp Phe Leu Tyr Ser Leu Leu65 70 75 80Asn Glu Glu Gly Asn Ser Ser Lys Lys Glu Tyr Lys Asn Leu Lys Asp85 90 95Glu Glu Lys Tyr Asn Ile Ile Gln Asn Ile Lys Lys Tyr Cys Glu Cys100 105 110Thr Lys Lys Tyr Lys Arg Leu Pro Thr Arg Glu Val Val Ile Gly Asn115 120 125Val Lys Ile Gly Gly Asn Asn Lys Ile Ala Ile Gln Thr Met Ala Ser130 135 140Cys Asp Thr Arg Asn Val Glu Glu Cys Val Tyr Gln Ile Arg Lys Cys145 150 155 160Lys Asp Leu Gly Ala Asp Ile Val Arg Leu Thr Val Gln Gly Val Gln165 170 175Glu Ala Gln Ala Ser Tyr His Ile Lys Glu Lys Leu Leu Ser Glu Asn180 185 190Val Asn Ile Pro Leu Val Thr Asp Ile His Phe Asn Pro Lys Ile Ala
195 200 205Leu Met Ala Ala Asp Val Phe Glu Lys Ile Arg Val Asn Pro Gly Asn210 215 220Tyr Val Asp Gly Arg Lys Lys Trp Ile Asp Lys Val Tyr Lys Thr Lys225 230 235 240Glu Glu Phe Asp Glu Gly Lys Leu Phe Ile Lys Glu Lys Phe Val Pro245 250 255Leu Ile Glu Lys Cys Lys Arg Leu Asn Arg Ala Ile Arg Ile Gly Thr260 265 270Asn His Gly Phe Leu Ser Ser Arg Val Leu Ser Tyr Tyr Gly Asp Thr275 280 285Pro Leu Ala Leu Val Glu Ser Ala Met Arg Phe Ser Asp Leu Cys Asn290 295 300Glu Asn Asn Phe Asn Asn Leu Val Phe Ser Met Lys Ala Ser Asn Ala305 310 315 320Tyr Val Met Ile Gln Ser Tyr Arg Leu Leu Val Ser Lys Gln Tyr Glu325 330 335Arg Asn Met Met Phe Pro Ile His Leu Gly Val Thr Glu Ala Gly Phe340 345 350Gly Asp Asn Gly Arg Ile Lys Ser Tyr Leu Gly Ile Gly Ser Leu Leu355 360 365Tyr Asp Gly Ile Gly Asp Thr Ile Arg Ile Ser Leu Thr Glu Asp Pro370 375 380Trp Glu Glu Leu Thr Pro Cys Lys Lys Leu Val Glu Asn Leu Lys Lys385 390 395 400Arg Ile Phe Tyr Asn Glu Asn Phe Lys Glu Asp Asn Glu Leu Lys Asn405 410 415Asn Glu Met Asp Thr Lys Asn Leu Leu Asn Phe Glu Glu Asn Tyr Arg420 425 430Asn Phe Asn Asn Ile Lys Lys Arg Asn Val Glu Lys Asn Asn Asn Val435 440 445Leu His Glu Glu Cys Thr Ile Gly Asn Val Val Thr Ile Lys Glu Leu450 455 460Glu Asp Ser Leu Gln Ile Phe Lys Asp Leu Asn Leu Glu Val Asp Ser465 470 475 480Asn Gly Asn Leu Lys Lys Gly Ala Lys Thr Thr Asp Met Val Ile Ile485 490 495Asn Asp Phe His Asn Ile Thr Asn Leu Gly Lys Lys Thr Val Asp Lys500 505 510Leu Met Gln Val Gly Ile Asn Ile Val Val Gln Tyr Glu Pro His Asn515 520 525Ile Glu Phe Ile Glu Lys Met Glu Pro Asn Asn Asp Asn Asn Asn Asn530 535 540Asn Asn Asn Asn Asn Ile Leu Phe Tyr Val Asp Ile Lys Asn Ile Met545 550 555 560Asn Ser Ser Glu Lys Asn Ile Lys Leu Ser Asn Ser Lys Gly Tyr Gly565 570 575Leu Ile Leu Asn Gly Lys Glu Asp Ile Gln Thr Ile Lys Lys Ile Lys580 585 590Glu Leu Asn Arg Arg Pro Leu Phe Ile Leu Leu Lys Ser Asp Asn Ile595 600 605Tyr Glu His Val Leu Ile Thr Arg Arg Ile Asn Glu Leu Leu Gln Ser610 615 620Leu Asn Ile Asn Ile Pro Tyr Ile His Tyr Val Asp Ile Asn Ser Asn625 630 635 640Asn Tyr Asp Asp Ile Leu Val Asn Ser Thr Leu Tyr Ala Gly Ser Cys645 650 655Leu Met Asp Leu Met Gly Asp Gly Leu Ile Val Asn Val Thr Asn Asp660 665 670Val Leu Thr Asn Lys Lys Gly675<210>5<211>1200<212>DNA<213>大腸桿菌<400>5aatggctatc agaccgcttt aatgtcgatg gcttcaccct gcatccgttt acgcagggta 60cgtttcgtac ggtcgataac atcgcccgcc aactgaccac aggcagcatc gatatcatca 120ccacgagttt tacgcacaat agtggtgaaa ccgtagctca tcagcacttt tgagaaacgg 180tcgatacggc tgttcgagct gcgtccatac ggcgcacccg ggaacgggtt ccacgggatc 240aggttgatct tacacggcgt atctttcagc agttccgcca gttggtgcgc gtgttcagtg 300ccgtcgttaa cgtggtcaag catcacgtat tcaatagtga ctcggccctg attggcgttg 360gatttctcca gataacggcg caccgcagca aggaacgttt cgatattgta ctttttgttg 420atcggcacaa tttcgtcacg aatttcgtcg ttcggcgcgt gcagggaaat tgccagtgca 480acgtcgatca tatcgcccag tttatccagc gccggaacta caccggaagt ggaaagcgtg 540acgcgacgtt tagacaggcc aaaaccgaaa tcatcaagca tgatttccat cgccggaacg 600acgttgttca ggttgagcag cggctcgccc atgcccatca tcactacgtt agtgatcgga 660cgctgaccgg tgacttttgc tgcgccgacg attttcgccg cacgccacac ctggccgata 720atttccgaca cccgcaggtt gcggttaaag ccctgctggg cggtggaaca gaatttacac 780tccagcgcac accccacctg cgaagagacg cagagcgtgg cacggtcgtc ttccgggata 840tacaccgttt cgacgcgctg atcgccaacg gcgatcgccc atttaatggt gccgtcagat 900gaacgctgtt cttcaaccac ttccggtgcg cggatttccg ccacctcttt cagtttgccg 960cgcaacactt tgttgatgtc ggtcatctca tcaaagttgt cgcagcaata gtgatacatc 1020cacttcatca cctgatcggc gcggaagggt ttttcaccta aatctttaaa aaactcccgc 1080atctgctgac ggttgagatc cagcaggttg atttttccat ctttcgtggt gacgttttca 1140ggtgtgacta attgttcaga catatgctat tccggcctcg ttattacacg ttatggcccc 1200<210>6<211>384<212>PRT<213>大腸桿菌<400>6Met Ser Glu Gln Leu Val Thr Pro Glu Asn Val Thr Thr Lys Asp Gly1 5 10 15Lys Ile Asn Leu Leu Asp Leu Asn Arg Gln Gln Met Arg Glu Phe Phe
20 25 30Lys Asp Leu Gly Glu Lys Pro Phe Arg Ala Asp Gln Val Met Lys Trp35 40 45Met Tyr His Tyr Cys Cys Asp Asn Phe Asp Glu Met Thr Asp Ile Asn50 55 60Lys Val Leu Arg Gly Lys Leu Lys Glu Val Ala Glu Ile Arg Ala Pro65 70 75 80Glu Val Val Glu Glu Gln Arg Ser Ser Asp Gly Thr Ile Lys Trp Ala85 90 95Ile Ala Val Gly Asp Gln Arg Val Glu Thr Val Tyr Ile Pro Glu Asp100 105 110Asp Arg Ala Thr Leu Cys Val Ser Ser Gln Val Gly Cys Ala Leu Glu115 120 125Cys Lys Phe Cys Ser Thr Ala Gln Gln Gly Phe Asn Arg Asn Leu Arg130 135 140Val Ser Glu Ile Ile Gly Gln Val Trp Arg Ala Ala Lys Ile Val Gly145 150 155 160Ala Ala Lys Val Thr Gly Gln Arg Pro Ile Thr Asn Val Val Met Met165 170 175Gly Met Gly Glu Pro Leu Leu Asn Leu Asn Asn Val Val Pro Ala Met180 185 190Glu Ile Met Leu Asp Asp Phe Gly Phe Gly Leu Ser Lys Arg Arg Val195 200 205Thr Leu Ser Thr Ser Gly Val Val Pro Ala Leu Asp Lys Leu Gly Asp210 215 220Met Ile Asp Val Ala Leu Ala Ile Ser Leu His Ala Pro Asn Asp Glu225 230 235 240Ile Arg Asp Glu Ile Val Pro Ile Asn Lys Lys Tyr Asn Ile Glu Thr245 250 255Phe Leu Ala Ala Val Arg Arg Tyr Leu Glu Lys Ser Asn Ala Asn Gln260 265 270Gly Arg Val Thr Ile Glu Tyr Val Met Leu Asp His Val Asn Asp Gly275 280 285Thr Glu His Ala His Gln Leu Ala Glu Leu Leu Lys Asp Thr Pro Cys290 295 300Lys Ile Asn Leu Ile Pro Trp Asn Pro Phe Pro Gly Ala Pro Tyr Gly305 310 315 320Arg Ser Ser Asn Ser Arg Ile Asp Arg Phe Ser Lys Val Leu Met Ser325 330 335Tyr Gly Phe Thr Thr Ile Val Arg Lys Thr Arg Gly Asp Asp Ile Asp340 345 350Ala Ala Cys Gly Gln Leu Ala Gly Asp Val Ile Asp Arg Thr Lys Arg355 360 365Thr Leu Arg Lys Arg Met Gln Gly Glu Ala Ile Asp Ile Lys Ala Val370 375 380<210>7<211>1320<212>DNA<213>Plasmodium falciparum<220><221>CDS<222>(163)..(1173)<400>7taaataaata aattataaat ctttcaagaa tatatttttt ataaaaacat aaaatataaa 60atatacatat atatatatat atatatttta tattactttt aaaattattt atttatacaa 120atggaaattt aatgtgaaga atagaaaaaa cattttgtca at atg gaa aag tca174
Met Glu Lys Ser1aaa agg tac ata age ctg att aag atg atg gaa agg aaa aaa ttt gag 222Lys Arg Tyr Ile Ser Leu Ile Lys Met Met Glu Arg Lys Lys Phe Glu5 10 15 20aag tat aga tta aaa caa ata atg gat aat ata tat aaa gga aaa ata 270Lys Tyr Arg Leu Lys Gln Ile Met Asp Asn Ile Tyr Lys Gly Lys Ile25 30 35att gaa ata aat aaa atg aaa aat att cca act gaa ata aga aga gaa 318Ile Glu Ile Asn Lys Met Lys Asn Ile Pro Thr Glu Ile Arg Arg Glu40 45 50tta aaa aat ata ttt cat aat aat att tta agt ata aaa ccg atc aaa 366Leu Lys Asn Ile Phe His Asn Asn Ile Leu Ser Ile Lys Pro Ile Lys55 60 65gaa tta aaa tat gat aga gca tat aaa gta tta ttt cag tgt aaa gat 414Glu Leu Lys Tyr Asp Arg Ala Tyr Lys Val Leu Phe Gln Cys Lys Asp70 75 80aat gaa aag att gaa gca aca tca tta gat ttt ggt tcg cat aaa tct 462Asn Glu Lys Ile Glu Ala Thr Ser Leu Asp Phe Gly Ser His Lys Ser85 90 95 100tta tgt ata tct agc caa ata ggt tgt tct ttt gga tgt aag ttt tgt 510Leu Cys Ile Ser Ser Gln Ile Gly Cys Ser Phe Gly Cys Lys Phe Cys105 110 115gct act ggt caa att ggt ata aaa aga caa tta gat ata gat gaa ata 558Ala Thr Gly Gln Ile Gly Ile Lys Arg Gln Leu Asp Ile Asp Glu Ile120 125 130act gat caa ctt tta tat ttt caa tca aaa gga gtt gat ata aaa aat 606Thr Asp Gln Leu Leu Tyr Phe Gln Ser Lys Gly Val Asp Ile Lys Asn135 140 145ata tct ttt atg ggt atg gga gaa cct tta gct aat cca tat gtt ttt 654Ile Ser Phe Met Gly Met Gly Glu Pro Leu Ala Asn Pro Tyr Val Phe150 155 160gat tct ata caa ttt ttt aat gat aat aat tta ttt tct ata tct aat 702Asp Ser Ile Gln Phe Phe Asn Asp Asn Asn Leu Phe Ser Ile Ser Asn165 170 175 180aga cgt att aat ata tct act gtt ggt ctt tta cca gga att aaa aaa 750Arg Arg Ile Asn Ile Ser Thr Val Gly Leu Leu Pro Gly Ile Lys Lys185 190 195tta aat aac atc ttt cct caa gtt aat tta gct ttc tca tta cat tct 798Leu Asn Asn Ile Phe Pro Gln Val Asn Leu Ala Phe Ser Leu His Ser200 205 210cca ttt act gaa gaa agg gat caa ctt gta cca att aat aaa ttg ttt 846Pro Phe Thr Glu Glu Arg Asp Gln Leu Val Pro Ile Asn Lys Leu Phe215 220 225ccg ttt aat gaa gtt ttt gat tta tta gat gaa aga ata gca aaa act 894Pro Phe Asn Glu Val Phe Asp Leu Leu Asp Glu Arg Ile Ala Lys Thr230 235 240ggt aga aga gtt tgg ata agt tat att tta att aaa aat ctt aat gac 942Gly Arg Arg Val Trp Ile Ser Tyr Ile Leu Ile Lys Asn Leu Asn Asp245 250 255 260tcc aaa gat cat gca gaa gct ttg tct gat cat ata tgt aaa aga cca 990Ser Lys Asp His Ala Glu Ala Leu Ser Asp His Ile Cys Lys Arg Pro265 270 275aat aac ata aga tac tta tat aat gta tgt tta ata cct tat aat aaa 1038Asn Asn Ile Arg Tyr Leu Tyr Asn Val Cys Leu Ile Pro Tyr Asn Lys280 285 290ggt aat aga att tat aat ata tca ttt gaa tat ata tat ata tat ata 1086Gly Asn Arg Ile Tyr Asn Ile Ser Phe Glu Tyr Ile Tyr Ile Tyr Ile295 300 305tat tta cta ata ata aaa aaa aag ata tta tgt aaa tat att atg ttt 1134Tyr Leu Leu Ile Ile Lys Lys Lys Ile Leu Cys Lys Tyr Ile Met Phe310 315 320cac aca tta tat aaa tat ata ggc ata gag gac atg tta taaaaaagtg1183His Thr Leu Tyr Lys Tyr Ile Gly Ile Glu Asp Met Leu325 330 335caacatatat atatatatat atatatatat atatatatat acattttttt tatatttata 1243ttatcttttt aatacattta ttccattaca ttgcagccaa aaatgttgac gaaaattttc 1303atcgtttgga cgatgct1320<210>8<211>337<212>PRT<213>Plasmodium falciparum<400>8Met Glu Lys Ser Lys Arg Tyr Ile Ser Leu Ile Lys Met Met Glu Arg1 5 10 15Lys Lys Phe Glu Lys Tyr Arg Leu Lys Gln Ile Met Asp Asn Ile Tyr20 25 30Lys Gly Lys Ile Ile Glu Ile Asn Lys Met Lys Asn Ile Pro Thr Glu35 40 45Ile Arg Arg Glu Leu Lys Asn Ile Phe His Asn Asn Ile Leu Ser Ile50 55 60Lys Pro Ile Lys Glu Leu Lys Tyr Asp Arg Ala Tyr Lys Val Leu Phe65 70 75 80Gln Cys Lys Asp Asn Glu Lys Ile Glu Ala Thr Ser Leu Asp Phe Gly85 90 95Ser His Lys Ser Leu Cys Ile Ser Ser Gln Ile Gly Cys Ser Phe Gly100 105 110Cys Lys Phe Cys Ala Thr Gly Gln Ile Gly Ile Lys Arg Gln Leu Asp115 120 125Ile Asp Glu Ile Thr Asp Gln Leu Leu Tyr Phe Gln Ser Lys Gly Val130 135 140Asp Ile Lys Asn Ile Ser Phe Met Gly Met Gly Glu Pro Leu Ala Asn145 150 155 160Pro Tyr Val Phe Asp Ser Ile Gln Phe Phe Asn Asp Asn Asn Leu Phe165 170 175Ser Ile Ser Asn Arg Arg Ile Asn Ile Ser Thr Val Gly Leu Leu Pro180 185 190Gly Ile Lys Lys Leu Asn Asn Ile Phe Pro Gln Val Asn Leu Ala Phe195 200 205Ser Leu His Ser Pro Phe Thr Glu Glu Arg Asp Gln Leu Val Pro Ile210 215 220Asn Lys Leu Phe Pro Phe Asn Glu Val Phe Asp Leu Leu Asp Glu Arg225 230 235 240Ile Ala Lys Thr Gly Arg Arg Val Trp Ile Ser Tyr Ile Leu Ile Lys245 250 255Asn Leu Asn Asp Ser Lys Asp His Ala Glu Ala Leu Ser Asp His Ile260 265 270Cys Lys Arg Pro Asn Asn Ile Arg Tyr Leu Tyr Asn Val Cys Leu Ile275 280 285Pro Tyr Asn Lys Gly Asn Arg Ile Tyr Asn Ile Ser Phe Glu Tyr Ile290 295 300Tyr Ile Tyr Ile Tyr Leu Leu Ile Ile Lys Lys Lys Ile Leu Cys Lys305 310 315 320Tyr Ile Met Phe His Thr Leu Tyr Lys Tyr Ile Gly Ile Glu Asp Met325 330 335Leu
權(quán)利要求
1.細(xì)菌或寄生蟲的gcpE或yfgB基因的DNA序列用于摻入到病毒、真核生物和原核生物細(xì)胞的基因組中的應(yīng)用。
2.DNA序列用于摻入到病毒、真核生物和原核生物細(xì)胞的基因組中的應(yīng)用,所述DNA序列與細(xì)菌或寄生蟲的gcpE或yfgB的蛋白質(zhì)的DNA序列或來源于通過插入、缺失或替代的序列的其類似物或衍生物雜交,并且所述的DNA序列編碼具有g(shù)cpE或yfgB基因的生物學(xué)活性的質(zhì)體蛋白質(zhì)。
3.根據(jù)權(quán)利要求1或2之一的DNA序列SEQ 1,3,5或7的應(yīng)用。
4.根據(jù)權(quán)利要求1,2或3的應(yīng)用,其特征在于將這些序列與控制元件連接,該原件確保在細(xì)胞中的轉(zhuǎn)錄和翻譯的元件并且導(dǎo)致可翻譯的mRNA的表達(dá),引起gcpE或yfgB基因的合成。
5.植物細(xì)胞,其包括DNA序列,所述DNA序列與細(xì)菌或寄生蟲的gcpE或yfgB基因的DNA序列或來源于通過插入、缺失或替代的序列的其類似物或衍生物雜交,并且所述的DNA序列編碼具有g(shù)cpE或yfgB蛋白質(zhì)的生物學(xué)活性的質(zhì)體蛋白質(zhì)。
6.轉(zhuǎn)化的植物細(xì)胞和來自于所述的植物細(xì)胞的轉(zhuǎn)基因植物,包括DNA序列或與細(xì)菌或寄生蟲的gcpE或yfgB基因的DNA序列或來源于通過插入、缺失或替代的序列的其類似物或衍生物雜交的DNA序列,并且所述的DNA序列編碼具有g(shù)cpE或yfgB蛋白質(zhì)的生物學(xué)活性的質(zhì)體蛋白質(zhì)。
7.根據(jù)權(quán)利要求1-4任一項所述的應(yīng)用,特別是用于增加病毒,真核生物或原核生物細(xì)胞中類異戊二烯的含量。
8.根據(jù)權(quán)利要求1-4任一項所述的應(yīng)用,用于測定gcpE或yfgB蛋白質(zhì)的酶促活性。
9.根據(jù)權(quán)利要求1-4任一項所述的應(yīng)用,用于識別對gcpE蛋白質(zhì)的酶促活性具有抑制作用的物質(zhì)。
10.用于測定來自于細(xì)菌或寄生蟲的gcpE蛋白質(zhì)的酶促活性的方法,其特征在于缺失了類異戊二烯的生物合成的糖或磷糖或前體的磷酸化,特別是2-D-甲基-D-赤蘚糖醇,2-C-甲基-D-磷酸赤蘚糖醇,特別是2-C-甲基-D-赤蘚糖醇-4-磷酸的磷酸化,2-C-甲基-D-赤蘚糖,2-C-D-赤蘚糖磷酸,特別是2-C-甲基-赤蘚糖-4-磷酸,CH2(OH)-C(CH3)=C(OH)-CH2-O-PO(OH)2,CH2(OH)-C(CH3)=C(OH)-CH2-OH,CH2(OH)-C(CH3)-CO-CH2-O-PO(OH)2,CH2(OH)-C(CH3)-CO-CH2OH,CH2=C(CH3)-CO-CH2-O-PO(OH)2,CH2=C(CH3)-CO-CH2-OH,CH2=C(CH3)-CH(OH)-CH2-O-PO(OH)2,CH2=C(CH3)-CH(OH)-CH2-OH,CH2(OH)-C(=CH2)-C(OH)-CH2-O-PO(OH)2,CH2(OH)-C(=CH2)-C(OH)-CH2-OH,CHO-CH(CH3)-CH(OH)-CH2-O-PO(OH)2,CHO-CH(CH3)-CH(OH)-CH2-OH,CH2(OH)-C(OH)(CH3)-CH=CH-O-PO(OH)2,CH2(OH)-C(OH)(CH3)-CH=CH-OHCH(OH)=C(CH3)-CH(OH)-CH2-O-PO(OH)2,CH(OH)=C(CH3)-CH(OH)-CH2-OHCH3-C(CH3)=CH-CH2-O-PO(OH)2,CH3-C(CH3)=CH-CH2-OH,CH2=C(CH3)-CH2-CH2-O-PO(OH)2,CH2=C(CH3)-CH2-CH2-OH的磷酸化。
11.篩選化合物的方法,該方法包括a)提供了含有重組表達(dá)載體的宿主細(xì)胞,其中載體包括編碼來自于細(xì)菌或寄生蟲的gcpE或yfgB蛋白質(zhì)或該多肽的類似物或衍生物的DNA序列的至少一個片段,其中一個或多個氨基酸被缺失,疊加或被另一個氨基酸替代,并且基本上沒有降低該多肽的酶促活性,而且提供了被懷疑對人和動物具有抗真菌、抗寄生蟲,或抗病毒作用的化合物,b)將宿主細(xì)胞與該化合物接觸和c)測定該化合物的抗真菌,抗寄生蟲或抗病毒作用。
12.篩選化合物的方法,該方法包括a)提供了含有重組表達(dá)載體的宿主細(xì)胞,其中載體包括編碼來自于細(xì)菌或寄生蟲的gcpE或yfgB蛋白質(zhì)或該多肽的類似物或衍生物的DNA序列的至少一個片段,其中多肽一個或多個氨基酸被缺失,疊加或被其它氨基酸替代,并且基本上沒有降低該多肽的酶促活性,而且提供了被懷疑對植物具有抗病毒,抗寄生蟲,殺真菌或除草劑作用的化合物,b)將宿主細(xì)胞與該化合物接觸和c)測定該化合物的抗病毒,抗寄生蟲,殺真菌或除草劑作用。
全文摘要
本發(fā)明涉及來自于細(xì)菌或寄生蟲的命名為gcpE和yfgB基因的DNA序列用于摻入到病毒,真核細(xì)胞和原核細(xì)胞的基因組,從而改變類異戊二烯的濃度。本發(fā)明還涉及識別在植物中具有除草劑,抗寄生蟲,抗病毒劑,殺真菌劑作用和在人體和動物中具有抗真菌,抗寄生蟲,抗病毒劑作用的物質(zhì)的方法。
文檔編號G01NGK1351715SQ00807856
公開日2002年5月29日 申請日期2000年5月20日 優(yōu)先權(quán)日1999年5月21日
發(fā)明者哈?!ぶ祚R 申請人:朱馬制藥有限公司