Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.
The first embodiment is as follows:
a vector pHT01-dcas 9-omega for fusion expression of dcas9 and RNA polymerase omega subunit is constructed, which can express dcas 9-omega fusion protein. Specifically, a vector pHT01-dcas9 for expressing dcas9 is constructed, then the omega subunit gene of RNA polymerase is amplified, and after enzyme digestion, the omega subunit gene is enzymatically linked with the vector pHT01-dcas9 after the same enzyme digestion, so that a vector pHT01-dcas 9-omega is obtained.
Step 1, constructing pHT01-dcas 9-omega vector for expressing dcas 9-omega protein, and the vector map is shown in figure 2.
1) Designing a primer for pET-dcas9-VP64-6XHis aiming at an amplification template of the dcas9 gene, and using the primer
The DNA Polymerase was used for PCR to clone the dcas9 gene into pHT01 vector to obtain pHT01-dcas 9. Primers for amplification of dcas9 were:
dcas9-F:CGCGGATCCATGGACAAGAAGTACAGCATCGGCCTGGCCA
dcas9-R:gctctagaTCCTGATGAGCCTGAGCCGTGGTGGTGGTGGTGGTGGTC
and (3) PCR reaction system: 2 × PrimeSTAR Mix, 25 μ l; forward primer (10mM), 2.5. mu.l; reverse primer (10mM), 2.5. mu.l; template, 0.5 μ l; add ddH2O to 50. mu.l. The amplification conditions were: 10s at 98 ℃; 5s at 55 ℃; 72 ℃ for 20 s; 28 x; 72 ℃ for 5 min; 4 ℃ and infinity.
2) The dcas9 gene fragment and pHT01 vector were digested with XbaI and BamHI. The enzyme cutting system is as follows: xba I, 0.5. mu.l; BamHI, 1.5. mu.l; 10 × buffer, 5 μ l; the product was recovered by PCR in 30. mu.l. After treatment at 37 ℃ for 2h, the cells were recovered on a 1% agarose gel.
3) The dcas9 gene treated in 2) and pHT01 carrier are connected by enzyme with T4 ligase, the enzyme is: 10 XT 4DNA ligase buffer, 1. mu.l; t4DNA ligase, 0.5. mu.l; the molar ratio of the enzyme digestion fragment to the enzyme digestion vector is 1: 3; add ddH2O to 10. mu.l, and reacted at 25 ℃ for 2 h.
4) And (3) transforming the enzyme-linked product of the step (3) into escherichia coli DH5 alpha, verifying positive clones by colony PCR, and obtaining a recombinant vector pHT01-dcas9 after sequencing verification.
Step 2, pHT01-dcas 9-omega is constructed, and the vector is used for expressing the dcas 9-omega fusion protein.
5) And (3) amplifying the omega subunit of the RNA polymerase by using the genome of the bacillus subtilis SCK6 as a template, and recovering a PCR product through agarose gel. Primers for the ω subunit were:
ω-F:GCTCTAGAATGTTAGATCCGTCAATTGATTCTTTAATGAA
ω-R:TCCCCCCGGGCTATTCGCGGTCTTCCTTTTCAAACG
and (3) PCR reaction system: 10 Xpfu buffer, 5. mu.l; dNTP (2.5mM), 2. mu.l; forward primer (10 μ M), 2 μ l; reverse primer (10. mu.M), 2. mu.l; pfu polymerase, 2. mu.l; template, 1. mu.l; add ddH2O to 50. mu.l. The amplification conditions were: 95 ℃ for 5 min; 30 cycles of 95 ℃, 30s, 55 ℃, 30s, 72 ℃, 30 s; 72 ℃ for 5 min; 4 ℃ and infinity.
6) The fragment obtained in step 5) and the vector pHT01-dcas9 obtained in step 1 were digested with XbaI/XmaI in the following manner: xba I, 0.5. mu.l; xma i, 1.5 μ l; 10 × buffer, 5 μ l; the product was recovered by PCR in 30. mu.l. After treatment at 37 ℃ for 2h, the cells were recovered on a 1% agarose gel.
7) And (3) carrying out enzyme ligation on the vector and the fragment subjected to enzyme digestion by using T4 ligase, wherein an enzyme ligation system is shown as 3), transforming an enzyme ligation product into escherichia coli DH5 alpha, carrying out colony PCR (polymerase chain reaction) verification on positive clone, and carrying out sequencing verification to obtain a recombinant vector pHT01-dcas 9-omega.
The recombinant vector pHT01-dcas 9-omega inducible expression vector induces the expression of the dcas 9-omega by IPTG.
Example two:
a vector pHY-P242-BsmBI-BsmBI-sgRNA capable of expressing sgRNA was constructed, and the vector map is shown in FIG. 3.
1) Artificially synthesizing (Shanghai Biotechnology engineering Co., Ltd.) gene sequence P242-BsmBI-BsmBI-sgRNA, wherein the gene sequence comprises a P242 promoter, two BsmBI enzyme cutting sites and a sgRNA framework (shown in SEQ ID NO. 3).
2) The synthesized gene sequence and pHY300PLK vector are subjected to enzyme digestion treatment by EcoRI and HindIII, and the enzyme digestion system is as follows: EcoRI, 0.5. mu.l; hind III, 1.5. mu.l; 10 × buffer, 5 μ l; synthesis of the sequence, 30. mu.l. After treatment at 37 ℃ for 2h, the cleaved product was recovered on a 1% agarose gel.
3) Enzyme-linking the enzyme-digested vector with the fragment, wherein the enzyme-linking system is as follows: the molar ratio of the enzyme digestion vector to the enzyme digestion fragment is 1: 3; 10 XT 4DNA ligase buffer, 1. mu.l; t4DNA ligase, 0.5. mu.l; add ddH2O to 10 mul, reaction at 25 ℃ for 2 h.
4) And transforming the enzyme-linked product into an escherichia coli DH5 alpha strain, and carrying out PCR (polymerase chain reaction) and sequencing verification on a colony of the transformant to obtain a recombinant vector pHY-P242-BsmBI-BsmBI-sgRNA. The promoter of this vector is a constitutive promoter, and it can transcribe a downstream gene without an inducer, and since there is no ribosome binding site (RBS sequence) behind the promoter P242, this vector can transcribe only a gene downstream of the promoter P242 into RNA, but cannot translate it into a polypeptide, thereby obtaining a sgRNA sequence.
5) Designing a guide RNA sequence targeting a specific site, synthesizing two complementary primers according to the guide RNA sequence, annealing the two primers, and simultaneously using BsmB I to enzyme-cut the vector pHY-P242-BsmB I-BsmBI-sgRNA. The annealing conditions are as follows: 95 ℃ for 5 min; 95 deg.C, -1 deg.C/cycle, 40 cycles; at 55 ℃ for 30 min; 55 deg.C, -1 deg.C/cycle, 30 cycles; 4 ℃ and infinity. The enzyme cutting system is as follows: BsmBI, 0.5. mu.l; 10 × buffer, 5 μ l; pHY-P242-BsmB I-BsmBI-sgRNA plasmid, 30. mu.l, plus ddH2O to 50. mu.l. After treatment at 55 ℃ for 2 hours, the digested vector was recovered on a 1% agarose gel.
6) And (3) connecting the annealed primer with the enzyme-digested pHY-P242-BsmBI-BsmBI-sgRNA plasmid to construct a targeting vector: pHY-P242-sgR1, pHY-P242-sgR2, pHY-P242-sgR3, pHY-P242-sgR4 and pHY-P242-sgR5, which are targeted to different sites upstream of the P43-GFP expression unit, and are named sgR1, sgR2, sgR3, sgR4 and sgR 5. The enzyme connecting system is as follows: annealed fragment (50mM), 1. mu.l; enzyme digestion vector, 1 μ l; 10 XT 4DNA ligase buffer, 1. mu.l; t4DNA ligase, 0.5. mu.l; add ddH2O to 10. mu.l, and reacted at 25 ℃ for 2 h.
The sgrnas targeting different positions of the GFP promoter have the targeting sequences respectively (see fig. 1):
sgR 1: GATGGCCTTTTACGCGTCGA, 267bp from the Transcription Start Site (TSS) of GFP;
sgR 2: TAAGCAGAAGGCCATCCTGA, 283bp from the Transcription Start Site (TSS) of GFP;
sgR 3: CGGTGAACGCTCTCCTGAGT, 323bp from the Transcription Start Site (TSS) of GFP;
sgR 4: TCGTTTTATCTGTTGTTTGT, 343bp from the Transcription Start Site (TSS) of GFP;
sgR 5: ACTGGTCTGATCGGATCTTC, 410bp from the Transcription Start Site (TSS) of GFP.
Example three:
pHT01-dcas 9-omega and sgRs (including sgR1, sgR2, sgR3, sgR4 and sgR5) were transformed with Bacillus subtilis SG, respectively.
1) The bacillus subtilis competence is prepared by the following specific steps: SG was cultured overnight in YN medium (nutrient broth 18g/L, yeast extract 8g/L), and the overnight culture was transferred to fresh YN medium to OD600Equal to 1, induced with 1.5% xylose for 2h, resulting in competent cells.
2) SG/pHT01-dcas 9-omega was obtained by transforming SG competent cells with pHT01-dcas 9-omega and plating them on LB medium containing chloramphenicol. SG/pHT01-dcas 9-omega competent cells were prepared according to 1), and plasmids sgRs were transformed, respectively, and plated in LB medium containing chloramphenicol and tetracycline to obtain strain SG/pHT01-dcas 9-omega/sgRs.
Example four:
the effect of different targeting sgrnas on the expression level of GFP of the targeting gene was analyzed, and the results are shown in fig. 4.
1) Bacillus subtilis SG containing sgR1/pHT01-dcas 9-omega, sgR2/pHT01-dcas 9-omega, sgR3/pHT01-dcas 9-omega, sgR4/pHT01-dcas 9-omega, sgR5/pHT01-dcas 9-omega vectors was inoculated into chloramphenicol-resistant and tetracycline-resistant 3ml LB liquid medium and cultured overnight at 37 ℃.
2) Mu.l of the overnight culture was transferred to 3ml of fresh LB medium containing chloramphenicol and tetracycline resistance, and 0.5mM IPTG was added to induce expression of dcas 9-. omega.for another 10 hours.
3) 100. mu.l of each bacterial solution was mixed with 100. mu.l of water on a 96-well black-wall transparent bottom fluorescent plate, and the cell density (OD) was measured600) And fluorescence intensity (488nm/511nm), and the relative fluorescence intensity is obtained by dividing the fluorescence intensity by the cell density.
As can be seen from fig. 4, when sgRNA is targeted upstream of the promoter (P43) of the gene of interest (GFP), e.g., sgR1, sgR2, sgR3, sgR4, sgR5, dcas9- ω fusion protein can promote expression of GFP. The expression level (relative fluorescence intensity) of GFP is improved by 1.5 times to the maximum.
Example five:
the effect of different targeted sgrnas on the level of GFP transcription of the targeted gene was analyzed, see fig. 5.
1) Strains containing sgRs (sgR1, sgR2, sgR3, sgR4, sgR5) vector promoting GFP expression were cultured at 37 ℃ in 3ml of LB medium containing chloramphenicol and tetracycline resistance, 200. mu.l of overnight culture was transferred to 3ml of fresh LB medium containing chloramphenicol and tetracycline resistance after overnight culture, IPTG was added to induce expression of dcas 9-. omega., and further culture was carried out for 10 hours to collect cells.
2) The specific steps of RNA extraction refer to a kit: bacterial RNA Kit (Omega Co.).
3) The specific steps of carrying out DNase treatment on the sample are as follows: to an RNase-free EP tube was added 3. mu.l of RNA, 10 × Reaction buffer with MgCl2Mu.l of DNase I (RNase-free) was added to 10. mu.l of water. Incubate at 37 ℃ for 1h, add 1. mu.l of 50mM EDTA, and incubate at 65 ℃ for 10 min.
4) See kit for specific procedures for reverse transcription: RevertAId First Strand cDNA Synthesis Kit (ThermoFisher Co., Ltd.)
5) The specific steps of the fluorescent quantitative PCR refer to a kit: iTaq Universal SYBR Green Supermix (Axygen Co., Ltd.)
The internal reference gene is as follows: rpsj
The primers are as follows: RPSJ-F: GATGGAAGCGTTCAACTAGCAGAC
RPSJ-R:GCAGATTGTGTGGACAGGTAATGG
The reaction conditions are as follows: at 95 ℃ for 3 min; 95 ℃ for 10 s; 30s at 55 ℃; 39 cycles were performed; 95 ℃ for 10 s; 65 ℃, 5s, 95 ℃, 0.5 ℃/s.
The target gene is as follows: GFP (green fluorescent protein)
The primers are as follows: GFP-F: ACAAATACAAAGATTCTCGTGAGC
GFP-R:CTAATCGCATAAGAGCATCAACAG
The reaction conditions are as follows: at 95 ℃ for 3 min; 95 ℃ for 10 s; 30s at 55 ℃; 39 cycles were performed; 95 ℃ for 10 s; 65 ℃, 5s, 95 ℃, 0.5 ℃/s.
As shown in FIG. 5, the relative fluorescence quantitative analysis showed that the GFP expression level of the strain was increased, and the maximum transcription level was increased by 2.3 times, indicating that dcas 9-omega indeed increased the GFP expression level at the transcription level, and the principle of the present invention was satisfied.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.
Sequence listing
<110> university of Hubei
<120> dcas 9-omega fusion protein for specifically regulating expression of bacillus subtilis exogenous gene and application thereof
<160> 4
<170> SIPOSequenceListing 1.0
<210> 1
<211> 1441
<212> PRT
<213> Artificial sequence (artificial sequence)
<400> 1
Met Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val
1 5 10 15
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe
20 25 30
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile
35 40 45
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu
50 55 60
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys
65 70 75 80
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser
85 90 95
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys
100 105 110
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr
115 120 125
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp
130 135 140
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His
145 150 155 160
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro
165 170 175
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr
180 185 190
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala
195 200 205
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn
210 215 220
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn
225 230 235 240
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe
245 250 255
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp
260 265 270
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp
275 280 285
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp
290 295 300
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser
305 310 315 320
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys
325 330 335
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe
340 345 350
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser
355 360 365
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp
370 375 380
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg
385 390 395 400
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu
405 410 415
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe
420 425 430
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile
435 440 445
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp
450 455 460
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu
465 470 475 480
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr
485 490 495
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser
500 505 510
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys
515 520 525
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln
530 535 540
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr
545 550 555 560
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp
565 570 575
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly
580 585 590
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp
595 600 605
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr
610 615 620
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala
625 630 635 640
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr
645 650 655
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp
660 665 670
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe
675 680 685
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe
690 695 700
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu
705 710 715 720
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly
725 730 735
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly
740 745 750
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln
755 760 765
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile
770 775 780
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro
785 790 795 800
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu
805 810 815
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg
820 825 830
Leu Ser Asp Tyr Asp Val Asp Ala Ile Val Pro Gln Ser Phe Leu Lys
835 840 845
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg
850 855 860
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys
865 870 875 880
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys
885 890 895
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp
900 905 910
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr
915 920 925
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp
930 935 940
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser
945 950 955 960
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg
965 970 975
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val
980 985 990
Val Gly Thr Ala Leu Ile Lys Lys Tyr Pro Lys Leu Glu Ser Glu Phe
995 1000 1005
Val Tyr Gly Asp Tyr Lys Val Tyr Asp Val Arg Lys Met Ile Ala Lys
1010 1015 1020
Ser Glu Gln Glu Ile Gly Lys Ala Thr Ala Lys Tyr Phe Phe Tyr Ser
1025 1030 1035 1040
Asn Ile Met Asn Phe Phe Lys Thr Glu Ile Thr Leu Ala Asn Gly Glu
1045 1050 1055
Ile Arg Lys Arg Pro Leu Ile Glu Thr Asn Gly Glu Thr Gly Glu Ile
1060 1065 1070
Val Trp Asp Lys Gly Arg Asp Phe Ala Thr Val Arg Lys Val Leu Ser
1075 1080 1085
Met Pro Gln Val Asn Ile Val Lys Lys Thr Glu Val Gln Thr Gly Gly
1090 1095 1100
Phe Ser Lys Glu Ser Ile Leu Pro Lys Arg Asn Ser Asp Lys Leu Ile
1105 1110 1115 1120
Ala Arg Lys Lys Asp Trp Asp Pro Lys Lys Tyr Gly Gly Phe Asp Ser
1125 1130 1135
Pro Thr Val Ala Tyr Ser Val Leu Val Val Ala Lys Val Glu Lys Gly
1140 1145 1150
Lys Ser Lys Lys Leu Lys Ser Val Lys Glu Leu Leu Gly Ile Thr Ile
1155 1160 1165
Met Glu Arg Ser Ser Phe Glu Lys Asn Pro Ile Asp Phe Leu Glu Ala
1170 1175 1180
Lys Gly Tyr Lys Glu Val Lys Lys Asp Leu Ile Ile Lys Leu Pro Lys
1185 1190 1195 1200
Tyr Ser Leu Phe Glu Leu Glu Asn Gly Arg Lys Arg Met Leu Ala Ser
1205 1210 1215
Ala Gly Glu Leu Gln Lys Gly Asn Glu Leu Ala Leu Pro Ser Lys Tyr
1220 1225 1230
Val Asn Phe Leu Tyr Leu Ala Ser His Tyr Glu Lys Leu Lys Gly Ser
1235 1240 1245
Pro Glu Asp Asn Glu Gln Lys Gln Leu Phe Val Glu Gln His Lys His
1250 1255 1260
Tyr Leu Asp Glu Ile Ile Glu Gln Ile Ser Glu Phe Ser Lys Arg Val
1265 1270 1275 1280
Ile Leu Ala Asp Ala Asn Leu Asp Lys Val Leu Ser Ala Tyr Asn Lys
1285 1290 1295
His Arg Asp Lys Pro Ile Arg Glu Gln Ala Glu Asn Ile Ile His Leu
1300 1305 1310
Phe Thr Leu Thr Asn Leu Gly Ala Pro Ala Ala Phe Lys Tyr Phe Asp
1315 1320 1325
Thr Thr Ile Asp Arg Lys Arg Tyr Thr Ser Thr Lys Glu Val Leu Asp
1330 1335 1340
Ala Thr Leu Ile His Gln Ser Ile Thr Gly Leu Tyr Glu Thr Arg Ile
1345 1350 1355 1360
Asp Leu Ser Gln Leu Gly Gly Asp Gly Ser Ala Ala Ser Arg Met Leu
1365 1370 1375
Asp Pro Ser Ile Asp Ser Leu Met Asn Lys Leu Asp Ser Lys Tyr Thr
1380 1385 1390
Leu Val Thr Val Ser Ala Arg Arg Ala Arg Glu Met Gln Ile Lys Lys
1395 1400 1405
Asp Gln Met Ile Glu His Thr Ile Ser His Lys Tyr Val Gly Lys Ala
1410 1415 1420
Leu Glu Glu Ile Asp Ala Gly Leu Leu Ser Phe Glu Lys Glu Asp Arg
1425 1430 1435 1440
Glu
<210> 2
<211> 4326
<212> DNA
<213> Artificial sequence (artificial sequence)
<400> 2
atggataaga aatactcaat aggcttagct atcggcacaa atagcgtcgg atgggcggtg 60
atcactgatg aatataaggt tccgtctaaa aagttcaagg ttctgggaaa tacagaccgc 120
cacagtatca aaaaaaatct tataggggct cttttatttg acagtggaga gacagcggaa 180
gcgactcgtc tcaaacggac agctcgtaga aggtatacac gtcggaagaa tcgtatttgt 240
tatctacagg agattttttc aaatgagatg gcgaaagtag atgatagttt ctttcatcga 300
cttgaagagt cttttttggt ggaagaagac aagaagcatg aacgtcatcc tatttttgga 360
aatatagtag atgaagttgc ttatcatgag aaatatccaa ctatctatca tctgcgaaaa 420
aaattggtag attctactga taaagcggat ttgcgcttaa tctatttggc cttagcgcat 480
atgattaagt ttcgtggtca ttttttgatt gagggagatt taaatcctga taatagtgat 540
gtggacaaac tatttatcca gttggtacaa acctacaatc aattatttga agaaaaccct 600
attaacgcaa gtggagtaga tgctaaagcg attctttctg cacgattgag taaatcaaga 660
cgattagaaa atctcattgc tcagctcccc ggtgagaaga aaaatggctt atttgggaat 720
ctcattgctt tgtcattggg tttgacccct aattttaaat caaattttga tttggcagaa 780
gatgctaaat tacagctttc aaaagatact tacgatgatg atttagataa tttattggcg 840
caaattggag atcaatatgc tgatttgttt ttggcagcta agaatttatc agatgctatt 900
ttactttcag atatcctaag agtaaatact gaaataacta aggctcccct atcagcttca 960
atgattaaac gctacgatga acatcatcaa gacttgactc ttttaaaagc tttagttcga 1020
caacaacttc cagaaaagta taaagaaatc ttttttgatc aatcaaaaaa cggatatgca 1080
ggttatattg atgggggagc tagccaagaa gaattttata aatttatcaa accaatttta 1140
gaaaaaatgg atggtactga ggaattattg gtgaaactaa atcgtgaaga tttgctgcgc 1200
aagcaacgga cctttgacaa cggctctatt ccccatcaaa ttcacttggg tgagctgcat 1260
gctattttga gaagacaaga agacttttat ccatttttaa aagacaatcg tgagaagatt 1320
gaaaaaatct tgacttttcg aattccttat tatgttggtc cattggcgcg tggcaatagt 1380
cgttttgcat ggatgactcg gaagtctgaa gaaacaatta ccccatggaa ttttgaagaa 1440
gttgtcgata aaggtgcttc agctcaatca tttattgaac gcatgacaaa ctttgataaa 1500
aatcttccaa atgaaaaagt actaccaaaa catagtttgc tttatgagta ttttacggtt 1560
tataacgaat tgacaaaggt caaatatgtt actgaaggaa tgcgaaaacc agcatttctt 1620
tcaggtgaac agaagaaagc cattgttgat ttactcttca aaacaaatcg aaaagtaacc 1680
gttaagcaat taaaagaaga ttatttcaaa aaaatagaat gttttgatag tgttgaaatt 1740
tcaggagttg aagatagatt taatgcttca ttaggtacct accatgattt gctaaaaatt 1800
attaaagata aagatttttt ggataatgaa gaaaatgaag atatcttaga ggatattgtt 1860
ttaacattga ccttatttga agatagggag atgattgagg aaagacttaa aacatatgct 1920
cacctctttg atgataaggt gatgaaacag cttaaacgtc gccgttatac tggttgggga 1980
cgtttgtctc gaaaattgat taatggtatt agggataagc aatctggcaa aacaatatta 2040
gattttttga aatcagatgg ttttgccaat cgcaatttta tgcagctgat ccatgatgat 2100
agtttgacat ttaaagaaga cattcaaaaa gcacaagtgt ctggacaagg cgatagttta 2160
catgaacata ttgcaaattt agctggtagc cctgctatta aaaaaggtat tttacagact 2220
gtaaaagttg ttgatgaatt ggtcaaagta atggggcggc ataagccaga aaatatcgtt 2280
attgaaatgg cacgtgaaaa tcagacaact caaaagggcc agaaaaattc gcgagagcgt 2340
atgaaacgaa tcgaagaagg tatcaaagaa ttaggaagtc agattcttaa agagcatcct 2400
gttgaaaata ctcaattgca aaatgaaaag ctctatctct attatctcca aaatggaaga 2460
gacatgtatg tggaccaaga attagatatt aatcgtttaa gtgattatga tgtcgatgcc 2520
attgttccac aaagtttcct taaagacgat tcaatagaca ataaggtctt aacgcgttct 2580
gataaaaatc gtggtaaatc ggataacgtt ccaagtgaag aagtagtcaa aaagatgaaa 2640
aactattgga gacaacttct aaacgccaag ttaatcactc aacgtaagtt tgataattta 2700
acgaaagctg aacgtggagg tttgagtgaa cttgataaag ctggttttat caaacgccaa 2760
ttggttgaaa ctcgccaaat cactaagcat gtggcacaaa ttttggatag tcgcatgaat 2820
actaaatacg atgaaaatga taaacttatt cgagaggtta aagtgattac cttaaaatct 2880
aaattagttt ctgacttccg aaaagatttc caattctata aagtacgtga gattaacaat 2940
taccatcatg cccatgatgc gtatctaaat gccgtcgttg gaactgcttt gattaagaaa 3000
tatccaaaac ttgaatcgga gtttgtctat ggtgattata aagtttatga tgttcgtaaa 3060
atgattgcta agtctgagca agaaataggc aaagcaaccg caaaatattt cttttactct 3120
aatatcatga acttcttcaa aacagaaatt acacttgcaa atggagagat tcgcaaacgc 3180
cctctaatcg aaactaatgg ggaaactgga gaaattgtct gggataaagg gcgagatttt 3240
gccacagtgc gcaaagtatt gtccatgccc caagtcaata ttgtcaagaa aacagaagta 3300
cagacaggcg gattctccaa ggagtcaatt ttaccaaaaa gaaattcgga caagcttatt 3360
gctcgtaaaa aagactggga tccaaaaaaa tatggtggtt ttgatagtcc aacggtagct 3420
tattcagtcc tagtggttgc taaggtggaa aaagggaaat cgaagaagtt aaaatccgtt 3480
aaagagttac tagggatcac aattatggaa agaagttcct ttgaaaaaaa tccgattgac 3540
tttttagaag ctaaaggata taaggaagtt aaaaaagact taatcattaa actacctaaa 3600
tatagtcttt ttgagttaga aaacggtcgt aaacggatgc tggctagtgc cggagaatta 3660
caaaaaggaa atgagctggc tctgccaagc aaatatgtga attttttata tttagctagt 3720
cattatgaaa agttgaaggg tagtccagaa gataacgaac aaaaacaatt gtttgtggag 3780
cagcataagc attatttaga tgagattatt gagcaaatca gtgaattttc taagcgtgtt 3840
attttagcag atgccaattt agataaagtt cttagtgcat ataacaaaca tagagacaaa 3900
ccaatacgtg aacaagcaga aaatattatt catttattta cgttgacgaa tcttggagct 3960
cccgctgctt ttaaatattt tgatacaaca attgatcgta aacgatatac gtctacaaaa 4020
gaagttttag atgccactct tatccatcaa tccatcactg gtctttatga aacacgcatt 4080
gatttgagtc agctaggagg tgacggttct gcagcttcta gaatgttaga tccgtcaatt 4140
gattctttaa tgaataaatt agattcaaaa tatacgctgg tgactgtttc tgcgagacgt 4200
gcccgtgaaa tgcaaatcaa aaaagaccaa atgattgaac atacgatttc acacaaatat 4260
gtaggcaaag ctttagaaga aattgatgca ggcctgcttt cgtttgaaaa ggaagaccgc 4320
gaatag 4326
<210> 3
<211> 404
<212> DNA
<213> Artificial sequence (artificial sequence)
<400> 3
aagcttgctc tagatatttt tttgccaaag ctgtaattga ggaatcatag aatttttaat 60
ttaaatttta tttgaatctt tacaatccta ttgatataat aaatgtagtg aggtggagga 120
gacgcgcgtc tccgttttag agctagaaat agcaagttaa aataaggcta gtccgttatc 180
aacttgaaaa agtggcaccg agtcggtgct tttttcttca gcacaattcc aagaaaaaca 240
cgatttagaa cctaaaaaga acgaatttga actaactcat aaccgagagg taaaaaaaga 300
acgaagtcga gatcagggaa tgagtttata aaataaaaaa agcacctgaa aaggtgtctt 360
tttttgatgg ttttgaactt ggactagtcc gggatcccga attc 404
<210> 4
<211> 1323
<212> DNA
<213> Artificial sequence (artificial sequence)
<400> 4
ggaaccacaa attaaaactg gtctgatcgg atcttcaggc atcaaataaa acgaaaggct 60
cagtcgaaag actgggcctt tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt 120
aggacaaatc cgccgctcta gctaagcaga aggccatcct gacggatggc cttttacgcg 180
tcgacgggat cctgataggt ggtatgtttt cgcttgaact tttaaataca gccattgaac 240
atacggttga tttaataact gacaaacatc accctcttgc taaagcggcc aaggacgctg 300
ccgccggggc tgtttgcgtt tttgccgtga tttcgtgtat cattggttta cttatttttt 360
tgccaaagct gtaatggctg aaaattctta catttatttt acatttttag aaatgggcgt 420
gaaaaaaagc gcgcgattat gtaaaatata aagtgatagc ggtaccatta tagaaaggag 480
gtgataaaaa tgcgtaaagg agaagaactt ttcactggag ttgtcccaat tcttgttgaa 540
ttagatggtg atgttaatgg gcacaaattt tctgtcagtg gagagggtga aggtgatgca 600
acatacggaa aacttaccct taaatttatt tgcactactg gaaaactacc tgttccatgg 660
ccaacacttg tcactacttt cgggtatggt gttcaatgct ttgcgagata cccagatcat 720
atgaaacggc atgacttttt caagagtgcc atgcccgaag gttatgtaca ggaaagaact 780
atatttttca aagatgacgg gaactacaag acacgtgctg aagtcaagtt tgaaggtgat 840
acccttgtta atagaatcga gttaaaaggt attgatttta aagaagatgg aaacattctt 900
ggacacaaat tggaatacaa ctataactca cacaatgtat acatcatggc agacaaacaa 960
aagaatggaa tcaaagttaa cttcaaaatt agacacaaca ttgaagatgg aagcgttcaa 1020
ctagcagacc attatcaaca aaatactcca attggcgatg gccctgtcct tttaccagac 1080
aaccattacc tgtccacaca atctgccctt tcgaaagatc ccaacgaaaa gagagaccac 1140
atggtccttc ttgagtttgt aacagctgct gggattacac atggcatgga tgaactatac 1200
aaataaaagc ttgggcttaa ttaattaaga ctcctgttga tagatccagt aatgacctca 1260
gaactccatc tggatttgtt cagaacgctc ggttgccgcc gggcgttttt tattggtgag 1320
aat 1323