Disclosure of Invention
The invention aims to overcome the defects of the prior art and provides a method for obtaining a common genic-sterile line by site-specific mutation of a common genic-sterile gene by using a genome editing technology, the aim of rapidly cultivating the common genic-sterile line is fulfilled by completely inactivating the common genic-sterile gene, the breeding period is short, the cost is low, the practicability is high, and the method has important significance for promoting the wide application of third-generation hybrid rice.
Therefore, the invention provides a method for cultivating a common genic male sterile line of rice, which comprises the following steps:
s1, designing a target site sequence according to the rice common genic male sterile gene by using a CRISPR/Cas9 system;
s2, constructing a pCRISPR/Cas9 recombinant vector containing the target site sequence fragment;
s3, introducing the obtained pCRISPR/Cas9 recombinant vector into embryonic callus of a maintainer line to obtain a transgenic seedling;
s4, screening transgenic positive plants in the transgenic seedlings;
s5, screening mutant plants in the transgenic positive plants;
s6, breeding the mutant plant, and separating the common genic male sterile line without transgenic components from the progeny plant.
In the above method, preferably, one strand of the target site sequence designed in step S1 has one of the following structures: 5' - (N)n-NGG-3’、5’-(N)n-NAG-3’、5’-(N)n-NGA-3', said N representing any one of A, T, C and G, 14. ltoreq. n.ltoreq.30.
The method preferably further comprises the step of selecting the general genic male sterile gene as the genemsp1、pair1、pair2、zep1、mel1、pss1、tdr、udt1、gamyb4、ptc1、api5、wda1、cyp704B2、dpw、mads3、osc6、rip1、csaAndaid1one kind of (1).
The method preferably further comprises the step of selecting the general genic male sterile gene as the geneptc1The Target site sequence comprises PTC1-Target1 and PTC1-Target2, the DNA sequence of the PTC1-Target1 is a sequence shown in SEQ ID NO.1, and the DNA sequence of the PTC1-Target1 is a sequence shown in SEQ ID NO. 2.
In the above method, preferably, the step S2 specifically includes the following steps:
s2-1, designing a joint primer with a sticky end according to the target site sequence and the information of the enzyme cutting site;
s2-2, enzyme cutting of the original vector;
s2-3, annealing the adaptor primer with the sticky end and then connecting the adaptor primer to an original vector subjected to enzyme digestion to obtain a recombinant gRNA expression cassette;
s2-4, carrying out PCR amplification on the recombinant gRNA expression cassette to obtain an amplification product;
s2-5, enzyme cutting the amplification product, connecting the amplification product after enzyme cutting to the pCRISPR/Cas9 carrier after enzyme cutting to obtain the recombinant carrier.
In the above method, preferably, the linker primer in step S2-1 includes PTC1-Target1-F, PTC1-Target1-R, PTC1-Target2-F and PTC1-Target 2-R; the DNA sequence of the PTC1-Target1-F is a sequence shown by SEQ ID NO.3, the DNA sequence of the PTC1-Target1-R is a sequence shown by SEQ ID NO.4, and the DNA sequence of the PTC1-Target2-F is a sequence shown by SEQ ID NO. 5; the DNA sequence of the PTC1-Target2-R is a sequence shown in SEQ ID NO. 6.
In the method, the original vector is preferably pU3-gRNA or pU6 a-gRNA.
In the above method, preferably, in step S2-2, BsaI is used to enzyme-cut the original vector; in the step S2-3, the adapter primer with sticky end is located between two Bsa I restriction sites of the original vector to form a recombinant gRNA expression cassette; in the step S2-5, BsaI is used to enzyme-cut the amplification product.
Preferably, in the above method, the step S5 specifically includes: extracting DNA of the transgenic positive plant, and carrying out PCR amplification to obtain an amplification product; sequencing the amplification product, and selecting T with the loss-of-function mutation at any target site0The generation homozygous mutant is used as a mutant plant.
Preferably, in the above method, step S6 specifically includes: and backcrossing the mutant plant and the maintainer line, and separating the common genic male sterile line without transgenic ingredients from the progeny plant.
Compared with the prior art, the invention has the advantages that:
(1) the invention provides a method for obtaining a common genic-sterile line by target mutation of a common genic-sterile gene based on a CRISPR/Cas9 system, the cultured common genic-sterile line does not contain transgenic components, and the method is not essentially different from the common genic-sterile line obtained by a physical-chemical mutagenesis method and a backcross breeding method, so that potential risks brought by transgenosis are avoided.
(2) The invention provides a method for culturing a common genic male sterile line of rice, which realizes the rapid culture of the common genic male sterile line, has higher efficiency and less genome damage than a physicochemical mutagenesis method; compared with the traditional backcross breeding method, the time is saved (the traditional backcross breeding method generally needs 4-6 years, but the method for the common genic male sterile line of the rice provided by the invention only needs two years), and the defects that the traditional backcross breeding method causes loss of some original excellent characters and the plant types of the common genic male sterile line bred by using the engineering maintainer line are inconsistent can be overcome.
Example 1:
the method for rapidly breeding the common genic-sterile line is specifically applied to obtain the common genic-sterile line from the PTC1 gene (the cDNA sequence of the PTC1 gene is shown in SEQ ID No.7, and the qDNA sequence of the PTC1 gene is shown in SEQ ID No. 8) in the CRISPR/Cas9 system site-specific mutagenesis maintainer line 832B, and specifically comprises the following steps (the process flow can be seen in figure 1):
(1) designing double target sites according to the exon sequence of the PTC1 gene:
PTC1-Target1(SEQ ID NO.1):CATGGTGGTCACCAAGTACC;
PTC1-Target2(SEQ ID NO.2):GGCCACAAGCTGCTCAGCCT。
the two sites are respectively positioned at 886bp-905bp and 917bp-936bp of the coding region of the PTC1 gene.
PTC1-Target1 has the structure of 5 '- (N) N-NGG-3'; PTC1-Target2 has a 5 '- (N) N-NGA-3' structure.
(2) Construction of CRISPR/Cas9 recombinant vector (pCRISPR/Cas 9-PTC 1) containing PTC1-Target1 and PTC1-Target2 double targets:
2.1 synthesizing two complementary paired nucleotide single strands of the target site according to the target site sequence:
PTC1-Target1-F(SEQ ID NO.3): ggcaCATGGTGGTCACCAAGTACC;
PTC1-Target1-R(SEQ ID NO.4): aaacGGTACTTGGTGACCACCATG;
PTC1-Target2-F(SEQ ID NO.5): gccGGCCACAAGCTGCTCAGCCT;
PTC1-Target2-R(SEQ ID NO.6): aaacAGGCTGAGCAGCTTGTGGC。
2.2 construction of CRISPR/Cas9-PTC1 recombinant vector:
2.2.1 the two complementary paired single nucleotide strands in step 2.1 are mixed in equal amount, and then denatured at 90 ℃ for 3min and annealed at 20 ℃ for 5min to form a double-stranded linker with sticky ends.
2.2.2 enzyme digestion of circular pU3-gRNA and pU6a-gRNA vectors to obtain linearized pU3-gRNA and pU6a-gRNA vectors: the pU3-gRNA and pU6a-gRNA vectors were digested with 10U BsaI in a 20. mu.L reaction system for 20min, and the vector was inactivated by leaving the vector at 70 ℃ for 5min to obtain linearized pU3-gRNA and pU6agRNA vectors.
2.2.3 ligation of the double-stranded linker with sticky ends obtained in step 2.2.1 to linearized pU3-gRNA, pU6a-gRNA vectors. The connection steps are as follows: taking 1. mu.l of 10x T4 DNA ligase buffer, 0.5. mu.l of pU3-gRNA/pU6a-gRNA vector (12 ng), 1. mu.l of PTC1-Target 1/PTC 1-Target1, 1. mu. l T4 DNA ligase, and finally adding ddH2And O to the total volume of 10 mu l, connecting for 30min at the temperature of 20-25 ℃ to obtain a recombinant guide-RNA expression cassette intermediate vector: pU3-PTC1-Target1-gRNA, pU6 a-PTC 1-Target 2-gRNA.
2.2.4 PCR amplification of expression cassettes: amplifying the pU3-PTC1-Target1-gRNA obtained in the step 2.2.3 by using primers U3-F and U3-R; the pU6 a-PTC 1-Target2-gRNA obtained in step 2.2.3 was amplified with primers U6a-F and U6 a-R. Wherein the sequences of the primers are respectively as follows:
U3-F:TTCAGAGGTCTCTCTCGCACTGGAATCGGCAGCAAAGG;
U3-R:AGCGTGGGTCTCGTCAGGGTCCATCCACTCCAAGCTC;
U6a-F:TTCAGAGGTCTCTCTGACACTGGAATCGGCAGCAAAGG;
U6a-R:AGCGTGGGTCTCGACCGGGTCCATCCACTCCAAGCTC;
the reaction system is as follows: 1 ul of pU3-PTC1-Target 1-gRNA/pU6 a-PTC 1-Target2-gRNA vector, 1.5 ul of each primer, 10 ul of dNTP, 25 ul of 2xbuffer, 1 ul of KOD enzyme, and ddH2O to a total volume of 50. mu.L.
The reaction procedure is as follows: pre-denaturation at 95 ℃ for 3min, 35 cycles: denaturation at 95 ℃ for 10s, annealing at 60 ℃ for 15s, and extension at 68 ℃ for 20 s.
The amplified PTC1-Target1-gRNA and PTC1-Target2-gRNA were purified using a PCR purification kit.
2.2.5 PTC1-Target1-gRNA and PTC1-Target2-gRNA are connected to pCRISPR/Cas9 vector to construct pCRISPR/Cas9-PTC1 recombinant vector. The specific construction method comprises the following steps: 1. mu.l of PTC1-Target1-gRNA and 1. mu.l of PTC1-Target2-gRNA, 1. mu.L of pCRISPR/Cas9 vector, 1. mu.L of BsaI, 1.5. mu.L of 10xSmart Buffer, 1.5. mu.L of ATP (1 mM), 1. mu. L T4 DNA ligand, and finally ddH2O to a total volume of 15. mu.L.
The reaction system is reacted on PCR, and the reaction program is 5min at 37 ℃, 5min at 10 ℃ and 5min at 20 ℃.
(3) Adopting agrobacterium-mediated technology to integrate pCRISPR/Cas9-PTC1 recombinant vector into embryogenic callus of maintainer line 832B and obtaining 16 strains of T through callus differentiation0Transgenic plants are generated. The method comprises the following specific steps:
3.1 the constructed pCRISPR/Cas9-PTC1 recombinant vector is introduced into Agrobacterium EHA105 to obtain EHA105 containing the pCRISPR/Cas9-PTC1 recombinant vector; and (3) inducing the callus on an induction culture medium by using the mature seeds of 832B to obtain the rice callus.
3.2 the EHA105 containing the recombinant vector pCRISPR/Cas9-PTC1 was inoculated on YM agar medium and cultured at 28 ℃ for 2 days to obtain a culture solution.
3.3 adding the collected culture solution into NB liquid medium containing 100mol/L acetosyringone to obtain bacterial solution with OD600 of 0.5.
3.4 soaking the rice callus obtained in the step 3.1 in the bacterial liquid obtained in the step 3.3 for 30min, washing with sterile water for 3 times, drying, and transferring the rice callus to an NB culture medium for dark culture at 28 ℃ for 3 days. Then transferring the rice callus to a culture medium containing 50mg/L hygromycin, placing the culture medium at 28 ℃ for dark culture, and subculturing once every 15 days for 2 times. After resistance screening, transferring the transgenic seedlings into a regeneration culture medium, and differentiating 16 transgenic seedlings. 5 transgenic positive plants are obtained by hygromycin PCR detection.
(4) Identification of mutation sites:
4.1 extracting DNA of 5 positive plants.
4.2 design detection primers according to the positions of two target sites:
PT 6-F: CCTCCGACATGATGCCCCGGTAGTCCAT and
PT6-R:GAGCTCCCCATGGTGGTCACCAAGTACCAG。
4.3 PT6-F is upstream of Target site PTC1-Target1, PT6-R is downstream of Target site PTC1-Target 2.5 positive strains of DNA are taken as a template, PT6-F and PT6-R are taken as primers to carry out PCR amplification to obtain a PCR amplification product. The reaction system of PCR amplification is as follows: mu.l of each of PCR Mix, PT6-F and PT 6-R0.2. mu.l, 1. mu.l of template, ddH2Make up to 10. mu.l of O. The reaction procedure is as follows: pre-denaturation at 95 ℃ for 3min, 35 cycles: denaturation at 94 ℃ for 10sAnnealing at 60 ℃ for 30s and extension at 72 ℃ for 30 s.
4.4 the PCR amplification product was sent to sequencing company for sequencing. The sequencing results are shown in FIG. 2, which represents base deletions. From FIG. 2 it is shown that two of the 5 positive plants are homozygous mutant lines, T0Representing the sterile character, are respectively named as PTC-M1 and PTC-M2. PTC-M1 lacks 26 bases between the two Target sites, PTC-M2 lacks 1 base A at the PTC1-Target1 Target site; the other three are heterozygous fertile mutant plants which are respectively named as PTC-M3, PTC-M4 and PTC-M5; PTC-M3 lacks a base C near the PTC1-Target2 Target site; the PTC-M4 lacks two base GC near the PTC1-Target2 Target site; PTC-M5 has a deletion of one base T near the Target site of PTC1-Target 2.
(5) Selecting PTC-M1 strain from two homozygous mutant strains, backcrossing the PTC-M1 strain with 832B for 1 generation to obtain BC1F1 progeny; the progeny of BC1F1 is selfed for 1 generation to obtain 378 strains of BC1F2 progeny. Because 26 bases are deleted from the PTC1 gene of the PTC-M1 strain, when the primers PT6-F and PT6-R are used for amplifying DNA of the 832B strain and the PTC-M1 strain, the sizes of PCR products of the two strains are different by 26 bases, and the PCR product of a heterozygous single strain is different by 26 bases. Therefore, the PTC1 gene of the backcross progeny can be genotyped according to the size and the number of the amplified bands of the PT6-F and PT6-R primers, and the sterile plants, hybrid plants and fertile plants in the backcross progeny can be identified through genotyping.
The method comprises the following specific steps:
5.1 taking 832B (electrophoresis band of PCR product is larger) and sterile mutant PTC-M1 (electrophoresis band of PCR product is smaller) as contrast, respectively amplifying DNA of 378 BC1F2 offspring by PT6-F and PT6-R primers, carrying out polyacrylamide gel electrophoresis on the amplified product, and identifying sterile single plants, heterozygous fertile single plants and homozygous fertile single plants according to the size and number of electrophoresis bands. See figure 3 for results: m is 50bp marker; p1 is 832B band type, and is homozygous fertile wild type; p2 is PTC-M1 banding pattern, is homozygous sterile mutation banding pattern, and double banding is fertile heterozygosis. Therefore, among individuals No. 1-24, individuals No.1, 2, 6, 7, 15, 16 are fertile individuals,individuals 8, 9, 10, 11, 14, 18, 21, 22, 23, 24 are mutated sterile individuals, and other double-banded plants are heterozygous fertile plants.
By the method, sterile single strains 89, heterozygous fertile strains 183 and homozygous fertile strains 106 are identified from 378 strains BC1F2 progeny.
Performing field fertility character investigation at the heading and flowering stage, wherein the heterozygous single plant and the common genic male sterile line single plant have glume flower forms in thegraph 4, wherein the heterozygous single plant anther is light yellow, contains normal pollen and shows fertility characters; the anther of the common genic male sterile line is white, normal pollen does not exist in the anther, and the anther shows sterile character. The results of the fertility investigation were completely consistent with the above genotyping results.
Carrying out hygromycin gene PCR detection on 89 sterile single plants, wherein hygromycin gene PCR detection is carried out in the figure 5, and the heterozygous plants contain hygromycin gene and therefore contain transgenic components; the common genic male sterile line does not contain hygromycin gene, so that the common genic male sterile line does not contain transgenic components. As a result, 2 ordinary genic male sterile lines containing no transgenic component were selected from 89 sterile individuals.
The hygromycin gene PCR detection primer is HPT-F: GACGTCTGTCGAGAAGTTTC and HPT-R: GCTGTTATGCGGCCATTGTC, the reaction sequence is: pre-denaturation at 95 ℃ for 3min, 35 cycles: denaturation at 95 ℃ for 10s, annealing at 58 ℃ for 15s, and extension at 72 ℃ for 30 s.
The foregoing is merely a preferred embodiment of the invention and is not intended to limit the invention in any manner. Although the present invention has been described with reference to the preferred embodiments, it is not intended to be limited thereto. Those skilled in the art can make many possible variations and modifications to the disclosed embodiments, or equivalent modifications, without departing from the spirit and scope of the invention, using the methods and techniques disclosed above. Therefore, any simple modification, equivalent replacement, equivalent change and modification made to the above embodiments according to the technical essence of the present invention are still within the scope of the protection of the technical solution of the present invention.
SEQUENCE LISTING
<110> research center for hybrid rice in Hunan province
<120> method for cultivating common genic male sterile line of rice
<130> do not
<160>8
<170>PatentIn version 3.5
<210>1
<211>20
<212>DNA
<213> Artificial sequence
<400>1
catggtggtc accaagtacc 20
<210>2
<211>20
<212>DNA
<213> Artificial sequence
<400>2
ggccacaagc tgctcagcct 20
<210>3
<211>24
<212>DNA
<213> Artificial sequence
<400>3
ggcacatggt ggtcaccaag tacc 24
<210>4
<211>24
<212>DNA
<213> Artificial sequence
<400>4
aaacggtact tggtgaccac catg 24
<210>5
<211>23
<212>DNA
<213> Artificial sequence
<400>5
gccggccaca agctgctcag cct 23
<210>6
<211>23
<212>DNA
<213> Artificial sequence
<400>6
aaacaggctg agcagcttgt ggc23
<210>7
<211>2232
<212>DNA
<213> Rice
<400>7
cgttgattgg cagcaactag ctagctcgcc gtccggccgg ccggccatgg cgcctaagat 60
ggtgatcagc ctggggagct cgcggcggcg gaagcgcggc gagatgctgt tccggttcga 120
ggccttctgc cagcccggct acccggcgaa cttcgccggc gccggcggct tcagggacaa 180
cgtgaggacg ctgctcggct tcgcgcacct ggaggccggc gtccacggcg agaccaagtg 240
ctggtcgttc cagctcgagc tgcaccgcca cccccccacc gtcgtgaggc tcttcgtcgt 300
cgaggaggag gtcgccgcct cgccgcaccg ccagtgccac ctctgccgcc atattgggtg 360
ggggaggcat ctgatatgca gcaagaggta tcacttcttg ctgccgagga gggaatcggc 420
ggcggaagcc gacggcctgt gcttcgcgat caaccacggc ggcggcggtg gcgcggagaa 480
agcgtcgtcg aaagggacga cgacgacggc ctccagcaga ggccacctgc tacacggcgt 540
cgtgcacctc aacggctacg gccacctcgt cgccctccac ggcctcgagg gcggctccga 600
cttcgtctcc ggccaccaga tcatggacct ctgggaccgc atttgctcag ccttgcacgt 660
aaggacggtg agcctggtgg acacggcgag gaagggccac atggagctga ggctgctgca 720
cggcgtcgcg tacggcgaga cgtggttcgg gcggtggggg tacaggtacg gccggccgag 780
ctacggcgtc gcgctgccgt cgtaccggca gtcgctgcac gtgctcggct ccatgccgct 840
ctgcgtgctg gtgccgcacc tgtcgtgctt cagccaggag ctccccatgg tggtcaccaa 900
gtaccaggcc atcagcggcc acaagctgct cagcctcggc gacctcctcc gcttcatgct 960
cgagctgcgc gcccgcctgc cggccacctc cgtcacggcc atggactacc ggggcatcat 1020
gtcggaggcc tcgtgccggt ggtcggcgaa gcgcgtcgac atggcggcgc gcgccgtcgt 1080
ggacgcgctc cgccgcgccg agccggcggc gcggtgggtc acgcggcagg aggtgcgcga 1140
cgcggcgcgc gcctacatcg gcgacacggg cctcctcgac ttcgtgctca agtccctcgg 1200
caaccacatc gtcggcaact acgtcgtgcg ccgcaccatg aacccggtga ccaaggtgct 1260
cgagtactgc ctcgaggacg tctccagcgt gctcccggcg gtcgccgccg gcggcggcgt 1320
gccggcgcag ggcaagatga gggtgaggtt ccagctcacg cgtgcgcagc tcatgaggga 1380
cctggtgcac ctgtaccggc acgtgctcaa ggagcccagc caggcgctca ccggcggcgc 1440
gttcggcgcg atcccggtgg cggtgcggat ggtcctggac atcaagcact tcgtcaaaga 1500
ttaccacgaa ggacaagccg cggcgagcag caatggcggt ggcggattcg ggcatcccca 1560
catcaacctg tgctgcacgc tgctcgtgag caacgggagc ccggagctag ctccaccgta 1620
cgagacggtg accctgccgg cgcacgcgac ggtgggcgag ctgaagtggg aggcgcagag 1680
ggtgttcagc gagatgtacc tcggcctgag gagcttcgcg gcggactccg tcgtcggggt 1740
cggcgccgac caggagggcc tcccggtgct cgggctggtc gacgtcggaa gcgccgtcgt 1800
ggtgcaaggg agcgtgggcg agcagataaa cggggaggac cacgagagga aggaggaggc 1860
ggcggcggcg gccgtgtgcg aggggagcgg cggcggcgag cgcgtcgtgg actgcgcgtg 1920
cggcgcggtg gacgacgacg gcgagcgcat ggcgtgctgc gacatctgcg aggcgtggca 1980
gcacacgcgg tgcgccggga tcgcggacac cgaggacgcg ccgcacgtct tcctctgcag 2040
ccggtgcgac aacgacgtcg tgtcgttccc gtccttcaac tgttagatgt gatgctgctg 2100
ctgctactgc tactactact gcctctgctg ctatatatga tgctacctag tacaagtgat 2160
cgagaattca atttgttttc tcggcaaaac caaaatgaaa acgaaggtaa aaccaagtga 2220
acttcagatc aa 2232
<210>8
<211>2407
<212>DNA
<213> Rice
<400>8
cgttgattgg cagcaactag ctagctcgcc gtccggccgg ccggccatgg cgcctaagat 60
ggtgatcagc ctggggagct cgcggcggcg gaagcgcggc gagatgctgt tccggttcga 120
ggccttctgc cagcccggct acccggcgaa cttcgccggc gccggcggct tcagggacaa 180
cgtgaggacg ctgctcggct tcgcgcacct ggaggccggc gtccacggcg agaccaagtg 240
ctggtcgttc cagctcgagc tgcaccgcca cccccccacc gtcgtgaggc tcttcgtcgt 300
cgaggaggag gtcgccgcct cgccgcaccg ccagtgccac ctctgccgcc atattggtcc 360
gtcgaacaaa ctacaattaa tcaatcaacc tttacatagg attgatccga tcgatgccat 420
ggtgttgtag ggtgggggag gcatctgata tgcagcaaga ggtatcactt cttgctgccg 480
aggagggaat cggcggcgga agccgacggc ctgtgcttcg cgatcaacca cggcggcggc 540
ggtggcgcgg agaaagcgtc gtcgaaaggg acgacgacga cggcctccag cagaggccac 600
ctgctacacg gcgtcgtgca cctcaacggc tacggccacc tcgtcgccct ccacggcctc 660
gagggcggct ccgacttcgt ctccggccac cagatcatgg acctctggga ccgcatttgc 720
tcagccttgc acgtaaggta gtagtagtat acatgtgcgt gtgcatgcat gcaagcaatg 780
caacgatgtc gggctgcgtg tgagaacatt tgcttgggca tggtgtggtg tatgcaagga 840
cggtgagcct ggtggacacg gcgaggaagg gccacatgga gctgaggctg ctgcacggcg 900
tcgcgtacgg cgagacgtgg ttcgggcggt gggggtacag gtacggccgg ccgagctacg 960
gcgtcgcgct gccgtcgtac cggcagtcgc tgcacgtgct cggctccatg ccgctctgcg 1020
tgctggtgcc gcacctgtcg tgcttcagcc aggagctccc catggtggtc accaagtacc 1080
aggccatcag cggccacaag ctgctcagcc tcggcgacct cctccgcttc atgctcgagc 1140
tgcgcgcccg cctgccggcc acctccgtca cggccatgga ctaccggggc atcatgtcgg 1200
aggcctcgtg ccggtggtcg gcgaagcgcg tcgacatggc ggcgcgcgcc gtcgtggacg 1260
cgctccgccg cgccgagccg gcggcgcggt gggtcacgcg gcaggaggtg cgcgacgcgg 1320
cgcgcgccta catcggcgac acgggcctcc tcgacttcgt gctcaagtcc ctcggcaacc 1380
acatcgtcgg caactacgtc gtgcgccgca ccatgaaccc ggtgaccaag gtgctcgagt 1440
actgcctcga ggacgtctcc agcgtgctcc cggcggtcgc cgccggcggc ggcgtgccgg 1500
cgcagggcaa gatgagggtg aggttccagc tcacgcgtgc gcagctcatg agggacctgg 1560
tgcacctgta ccggcacgtg ctcaaggagc ccagccaggc gctcaccggc ggcgcgttcg 1620
gcgcgatccc ggtggcggtg cggatggtcc tggacatcaa gcacttcgtc aaagattacc 1680
acgaaggaca agccgcggcg agcagcaatg gcggtggcgg attcgggcat ccccacatca 1740
acctgtgctg cacgctgctc gtgagcaacg ggagcccgga gctagctcca ccgtacgaga 1800
cggtgaccct gccggcgcac gcgacggtgg gcgagctgaa gtgggaggcg cagagggtgt 1860
tcagcgagat gtacctcggc ctgaggagct tcgcggcgga ctccgtcgtc ggggtcggcg 1920
ccgaccagga gggcctcccg gtgctcgggc tggtcgacgt cggaagcgcc gtcgtggtgc 1980
aagggagcgt gggcgagcag ataaacgggg aggaccacga gaggaaggag gaggcggcgg 2040
cggcggccgt gtgcgagggg agcggcggcg gcgagcgcgt cgtggactgc gcgtgcggcg 2100
cggtggacga cgacggcgag cgcatggcgt gctgcgacat ctgcgaggcg tggcagcaca 2160
cgcggtgcgc cgggatcgcg gacaccgagg acgcgccgca cgtcttcctc tgcagccggt 2220
gcgacaacga cgtcgtgtcg ttcccgtcct tcaactgtta gatgtgatgc tgctgctgct 2280
actgctacta ctactgcctc tgctgctata tatgatgcta cctagtacaa gtgatcgaga 2340
attcaatttg ttttctcggc aaaaccaaaa tgaaaacgaa ggtaaaacca agtgaacttc 2400
agatcaa 2407