It will be apparent to those skilled in the art that various modifications and variations can be made in the specific embodiments of the present disclosure without departing from the scope or spirit of the disclosure. Other embodiments will be apparent to those skilled in the art from consideration of the specification. The specification and examples are exemplary only.

Sequence listing

<110> Yuan code Gene technology (Beijing) Ltd

<120> methods, compositions and uses for detecting microsatellite instability in a control-free sample based on next-generation sequencing technology

<130> BH2110329

<141> 2021-06-28

<160> 10

<170> SIPOSequenceListing 1.0

<210> 1

<211> 21

<212> DNA

<213> Artificial sequence ()

<400> 1

tctgcatttt aactatggct c 21

<210> 2

<211> 21

<212> DNA

<213> Artificial sequence ()

<400> 2

ctcgcctcca agaatgtaag t 21

<210> 3

<211> 21

<212> DNA

<213> Artificial sequence ()

<400> 3

ctgcggtaat caagttttta g 21

<210> 4

<211> 22

<212> DNA

<213> Artificial sequence ()

<400> 4

aaccattcaa catttttaac cc 22

<210> 5

<211> 18

<212> DNA

<213> Artificial sequence ()

<400> 5

gaaatggtgg gaacccag 18

<210> 6

<211> 21

<212> DNA

<213> Artificial sequence ()

<400> 6

ggtggatcaa atttcacttg g 21

<210> 7

<211> 20

<212> DNA

<213> Artificial sequence ()

<400> 7

gagtcgctgg cacagttcta 20

<210> 8

<211> 20

<212> DNA

<213> Artificial sequence ()

<400> 8

ctggtcactc gcgtttacaa 20

<210> 9

<211> 20

<212> DNA

<213> Artificial sequence ()

<400> 9

attgtgccat tgcattccaa 20

<210> 10

<211> 27

<212> DNA

<213> Artificial sequence ()

<400> 10

gtgtcttgct gaattttacc tcctgac 27

Claims

1. A method for detecting microsatellite instability of a sample without a control based on a next generation sequencing technology is characterized by comprising the following steps:

(1) Utilizing an amplification primer composition with a sequence shown in SEQ ID NO. 1-10 to library gDNA (deoxyribonucleic acid) from a tissue sample to obtain a sample library, sequencing the sample library to obtain sequencing data, wherein a control sample is not included in the step, the tissue sample is a potential cancer tissue and does not include blood or components thereof, and adding a specific tag sequence into the obtained sample library after the library is built in the step (1);

(2) Extracting sample data from the sequencing data by using the specific tag sequence of the sample, and extracting a sequencing sequence corresponding to each MSI biomarker from the sample data according to the amplification primer composition, wherein the biomarkers are BAT25, BAT26, MONO27, NR21 and NR24, and each primer in the amplification primer composition specifically binds to at least a partial sequence of the MSI biomarker;

(4) Calculating the average value and standard deviation of the sequence length of the maximum sequencing sequence corresponding to the amplification primers in each biomarker in a human blood database according to the amplification primer composition of the sequences shown in SEQ ID NO. 1-10, wherein the average value of BAT25 is 123.9038462, the standard deviation is 0.533564241, the average value of BAT26 is 179.0666667, the standard deviation is 0.393122697, the average value of MONO27 is 171.5357143, the standard deviation is 2.8620995, the average value of NR21 is 110.9642857, the standard deviation is 0.631427984, the average value of NR24 is 133.8653846, and the standard deviation is 0.595039829;

(5) Calculating a Z value using formula (I), wherein the biomarker is considered unstable if the | Z value | > =3, and stable if the | Z value | <3, wherein formula (I): z value = (calculated-mean)/standard deviation;

considering the tissue sample as being high frequency unstable, i.e., MSI-H type, if more than 2 biomarkers are unstable, and considering the tissue sample as being stable, i.e., MSS type, if less than 1 biomarker is unstable;

wherein the sequencing in step (1) is second-generation sequencing.

2. The method for detecting microsatellite instability in a non-control sample based on secondary sequencing technology according to claim 1 wherein the amplification primer composition comprises a first amplification primer pair, a second amplification primer pair, a third amplification primer pair, a fourth amplification primer pair and a fifth amplification primer pair, wherein each of the amplification primer pairs specifically binds to at least a partial sequence of 5 different biomarkers of MSI.

3. A composition for detecting microsatellite instability of a non-control sample based on a next-generation sequencing technology is characterized by comprising an amplification primer composition for performing library building on a tissue sample to obtain a sample library, a specific tag sequence of the sample and reagents for primer extension and amplification reaction, wherein the sequence of the amplification primer composition is shown as SEQ ID NO. 1-10.

4. Use of a composition according to claim 3 in the preparation of a detector for detecting microsatellite instability in a sample without a control based on a second generation sequencing technique.