WO2025085782A1

Movatterモバイル変換

Info

Publication number: WO2025085782A1
Application number: PCT/US2024/052024
Authority: WO
Inventors: Samuel Henry Sternberg; George Davis LAMPE
Original assignee: Columbia University in the City of New York
Current assignee: Columbia University in the City of New York
Priority date: 2023-10-20
Filing date: 2024-10-18
Publication date: 2025-04-24
Anticipated expiration: 2026-04-20

Abstract

The present disclosure provides Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated transposon (CAST) systems and components thereof which are fully or partially derived from orthogonal CAST systems. More particularly, the present disclosure provides engineered CAST systems comprising one or more Cas proteins selected from: Cas5, Cas6, Cas7, Cas8, Cas12, and combinations thereof; and one or more transposon-associated proteins selected from TnsA, TnsB, TnsC, TnsD, TniQ, and combinations thereof, wherein at least one of TnsB, TnsC, or TniQ is a chimeric protein comprising amino acid sequences derived from respective TnsB, TnsC, or TniQ proteins, or homolog thereof, of at least two CAST systems.

Description

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Table 2. Chimeric TniQ dimer sequences

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Table 3. Chimeric TnsC protein sequences Description Protein Sequence SEQ ID NO

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Table 4. Plasmid Sequences

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119

The scope of the present invention is not limited by what has been specifically shown and described hereinabove. Those skilled in the art will recognize that there are suitable alternatives to the depicted examples of materials, configurations, constructions, and dimensions. Variations, modifications, and other implementations of what is described herein will occur to those of ordinary skill in the art without departing from the spirit and scope of the invention. Numerous references, including patents and various publications, are cited and discussed in the description of this invention. The citation and discussion of such references is provided merely to clarify the description of the present invention and is not an admission that Attny Docket No. COLUM-42515.601 Client Ref No. CU24119 any reference is prior art to the invention described herein. All references cited and discussed in this specification are incorporated herein by reference in their entirety.

Claims

Attny Docket No. COLUM-42515.601 Client Ref No. CU24119 C^LAIMS What is claimed is: 1. A system comprising an engineered Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)- associated transposon (CAST) system or one or more nucleic acids encoding the engineered CAST system, wherein the engineered CAST system comprises: a) one or more Cas proteins selected from: Cas5, Cas6, Cas7, Cas8, Cas12, and combinations thereof; and b) one or more transposon-associated proteins selected from TnsA, TnsB, TnsC, TnsD, TniQ, and combinations thereof, wherein at least one of TnsB, TnsC, or TniQ is a chimeric protein comprising amino acid sequences derived from respective TnsB, TnsC, or TniQ proteins, or homolog thereof, of at least two CAST systems, and, optionally c) at least one guide RNA (gRNA) complementary to at least a portion of a target nucleic acid sequence, or at least one nucleic acid encoding thereof. 2. The system of claim 1, comprising a chimeric TnsB, wherein the chimeric TnsB has a C- terminal domain comprising an amino acid sequence derived from a TnsB protein, or homolog thereof, of a first CAST system and an N-terminal domain comprising an amino acid sequence derived from a TnsB protein, or homolog thereof, of a second at least one second CAST system. 3. The system of claim 2, wherein the C-terminal domain comprises the C-terminal hook and at least a portion of C-terminal linker region of TnsB. 4. The system of claim 2 or 3, further comprising: a TnsA protein derived from the second CAST system; a TnsC protein having an amino acid sequence fully or partially derived a TnsC protein or homolog thereof from the first CAST system; and/or a TniQ protein derived from the first CAST system. 5. The system of claim 4, wherein the TnsC protein is a chimeric protein comprising a C- terminal region having an amino acid sequence derived from the first CAST system. Attny Docket No. COLUM-42515.601 Client Ref No. CU24119 6. The system of claim 1, comprising a chimeric TnsC, wherein the chimeric TnsC has a C- terminal domain comprising an amino acid sequence derived from a TnsC protein, or homolog thereof, of a second CAST system and N-terminal domain derived from a TnsC protein, or homolog thereof, of a first CAST system. 7. The system of claim 6, further comprising: a TnsA and a TnsB protein derived from the second CAST system or a chimeric TnsB having a C-terminal domain comprising an amino acid sequence derived from a TnsB protein, or homolog thereof, of the second CAST system and an N-terminal domain comprising an amino acid sequence derived from a TnsB protein, or homolog thereof, from the first CAST system or a third CAST system, and a TnsA protein derived from same CAST system as the N-terminal domain of the TnsB protein; and/or a TniQ is derived from the CAST first system. 8. The system of claim 1, comprising a chimeric TniQ having an N-terminal domain comprising an amino acid sequence derived from a TniQ protein, or homolog thereof, of a second CAST system and C-terminal domain comprising an amino acid sequence derived from a TniQ protein, or homolog thereof, derived from the first CAST system. 9. The system of claim 8, wherein the chimeric TniQ is fused to a second TniQ derived from the first CAST system. 10. The system of claim 8 or 9, further comprising: a TnsC protein fully or partially derived from the second CAST system, and, optionally, a TnsA and TnsB protein derived from the second CAST system. 11. The system of claim 10, wherein the TnsC protein is a chimeric protein comprising a C- terminal region having an amino acid sequence derived from a TnsC protein, or homolog thereof, of a third CAST system, and the system further comprises: a TnsA and TnsB protein derived from the third CAST system; or Attny Docket No. COLUM-42515.601 Client Ref No. CU24119 a chimeric TnsB protein having a C-terminal domain comprising an amino acid sequence derived from a TnsB protein, or a homolog thereof, of the third CAST system, and a TnsA protein derived from same CAST system as the N-terminal domain of the chimeric TnsB protein. 12. The system of any of claims 2-11, wherein the one or more Cas proteins are derived from the first CAST system. 13. The system of any of claims 1-12, wherein TnsB is a chimeric protein comprising an amino acid sequence having at least 70% identity to any of SEQ ID NOs: 33-35 and 257-288; TnsC is a chimeric protein comprising an amino acid sequence having at least 70% identity to any of SEQ ID NOs: 48-53; and/or TniQ is a chimeric protein comprising comprises an amino acid sequence having at least 70% identity to any of SEQ ID NOs: 36-47. 14. A method for modifying a nucleic acid comprising contacting a target nucleic acid sequence with a system of any of claims 1-13 or one or more components thereof. 15. A cell comprising a system of any of claims 1-13 or one or more components thereof.