Movatterモバイル変換


[0]ホーム

URL:


WO2025168806A1 - Polypeptides having carbonic anhydrase activity and polynucleotides encoding same - Google Patents

Polypeptides having carbonic anhydrase activity and polynucleotides encoding same

Info

Publication number
WO2025168806A1
WO2025168806A1PCT/EP2025/053303EP2025053303WWO2025168806A1WO 2025168806 A1WO2025168806 A1WO 2025168806A1EP 2025053303 WEP2025053303 WEP 2025053303WWO 2025168806 A1WO2025168806 A1WO 2025168806A1
Authority
WO
WIPO (PCT)
Prior art keywords
seq
polypeptide
carbonic anhydrase
amino acids
correspondning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/EP2025/053303
Other languages
French (fr)
Inventor
Johanna Katarina Sofie TIEMANN
Esben Peter Friis
Lars Anderson
Stefanie NEUN
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Novozymes AS
Original Assignee
Novozymes AS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Novozymes ASfiledCriticalNovozymes AS
Publication of WO2025168806A1publicationCriticalpatent/WO2025168806A1/en
Pendinglegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Definitions

Landscapes

Abstract

The present invention relates to polypeptides having carbonic anhydrase activity and polynucleotides encoding the polypeptides. The invention also relates to nucleic acid constructs, vectors, and host cells comprising the polynucleotides as well as methods of producing and using the polypeptides.

Description

POLYPEPTIDES HAVING CARBONIC ANHYDRASE ACTIVITY AND POLYNUCLEOTIDES ENCODING SAME
Reference to a Sequence Listing
This application contains a Sequence Listing in computer readable form, which is incorporated herein by reference.
Background of the Invention Field of the Invention
The present invention relates to polypeptides having carbonic anhydrase activity, polynucleotides encoding the polypeptides, nucleic acid constructs, vectors, and host cells comprising the polynucleotides as well as methods of producing and using the polypeptides. The invention also relates to using the carbonic anhydrases in bioreactors for extracting carbon dioxide and compositions useful for such extraction processes.
Description of the Related Art
Carbon dioxide (CO2) emissions are a major contributor to the phenomenon of global warming. CO2 is a by-product of combustion and it creates operational, economic, and environmental problems. CO2 emissions may be controlled by capturing CO2 gas before emitted into the atmosphere. There are several chemical approaches to control the CO2 emissions (A. Kohl and R. Nielsen, Gas Purification, 5th ed., Gulf Professional Publishing, Houston, TX, 1997). However, many of these approaches have drawbacks such as high energy consumption, slow processes, and use of ecologically questionable or toxic compounds.
An enzyme-based approach using the capability of carbonic anhydrase to catalyse the conversion of CO2 to bicarbonate at a very high rate (turnover is up to 106 molecules of CO2 per second), overcomes the reaction rates and environmental issues in relation to CO2 capture. Technical solutions for extracting CO2 from gases, such as combustion gases or respiration gases, using carbonic anhydrases have been described in WO 2006/089423, US 6,524,842, WO 2004/007058, WO 2004/028667, US 2004/0029257, US 7,132,090, WO 2005/114417, US 6,143,556, WO 2004/104160, US 2005/0214936, WO 2008/095057. Generally, these techniques operate by bringing a soluble or immobilized carbonic anhydrase into contact with CO2 which either may be in a gas phase or a liquid phase. In the presence of water, carbonic anhydrase catalyses the conversion of CO2 into bicarbonate ions which may be further protonated or deprotonated to carbonic acid and/or carbonate ions depending on the pH of the medium. The ions may either be utilized to facilitate growth of algae or microorganisms that utilize bicarbonate/carbonate as a carbon source, to induce a pH change in a surrounding medium or supply buffering capacity, to provide bicarbonate/carbonate as an active agent for subsequent chemical processes, or precipitated as a carbonate salt, or converted back into pure CO2, which can then be used (for example in enhanced oil recovery, for production of urea, for food and beverage processing, or to supply CO2 to greenhouses or cultivation ponds), released (for example from a contained life support environment such as a submarine, spacecraft, or artificial lung), compressed (for example for transportation through pipelines), or stored (such as in geological or deep oceanic formations or saline aquifers).
Mammalian, plant and prokaryotic carbonic anhydrases (alpha- and beta-class CAs) generally function at physiological temperatures (37 °C) or lower temperatures. The temperature of combustion gasses or the liquids into which they are dissolved may, however, easily exceed the temperature optimum for the carbonic anhydrase used to capture the CO2. One of the drawbacks of using enzyme-based solutions is that extensive cooling may be needed in CO2 extraction processes prior to contacting the CO2-containing gas/liquid with the carbonic anhydrase, and cooling is an energy consuming process. Consequently, there is a need for more heat-stable carbonic anhydrases when the enzyme is to be used under industrially relevant conditions.
A carbonic anhydrase obtainable from Persephonella marina is described in WO2012/025577. A metagenomic carbonic anhydrase isolated from the Logatchev hydrothermal is described in WO2018/017792. Variants of these enzymes with improved thermostability are described in PCT/US2023/081794.
The present invention provides new carbonic anhydrase enzymes useful for CO2 capture.
Summary of the Invention
The present invention provides polypeptides having carbonic anhydrase activity and polynucleotides encoding the polypeptides.
Accordingly, the present invention relates to polypeptides having carbonic anhydrase activity selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of any one of SEQ ID NOs: 1 , 3, 5, 7, 9, 11 , 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31 , 33, 35, 37, 39, 41 , 43, 45, 47, 49, 51 , 53, 55, 57, 59, 61 , 63, 65, 67, 69, 71 , 73, 75, 78, 80, 82, and 84;
(c) a polypeptide derived from any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85 by having 1-30 alterations (e.g., substitutions, deletions and/or insertions at one or more positions, e.g., 1 or 2 or 3 or 4 or 5 or 6 or 7 or 8 or 9 or 10 or 11 or 12 or 13 or 14 or 15 or 16 or 17 or 18 or 19 or 20 or 21 or 22 or 23 or 24 or 25 or 26 or 27 or 28 or 29 or 30 alterations, in particular substitutions; (d) a polypeptide having a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1.0, to the three-dimensional structure of the polypeptide of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85, wherein the three-dimensional structure is calculated by Alphafold.
(e) a polypeptide derived from the polypeptide of (a), (b), (c) or (d) wherein the N- and/or C-terminal end has been extended by addition of one or more amino acids; and
(f) a fragment of the polypeptide of (a), (b), (c) or (d); wherein the polypeptide has carbonic anhydrase activity.
The present invention also relates to polynucleotides encoding the polypeptides of the present invention; nucleic acid constructs; recombinant expression vectors; recombinant host cells comprising the polynucleotides; and methods of producing the polypeptides.
The present invention also relates to methods of of using the carbonic anhydrases for extraction of carbon dioxide from a carbon dioxide-containing medium.
Brief Description of the Figures
Figure 1 shows a schematic diagram of a counter-current gas-liquid contactor. In a typical counter-current gas-liquid contactor, the CCh-rich inlet gas (e.g., a mixed gas comprising CO2) enters the bottom of the contactor and travels upwards, while the CCh-lean inlet liquid enters the top of the contactor and flows downwards. The liquid flow can be in the form of droplets, such as a spray, or as a continuous stream or sheet of liquid, such as flows along a surface, or a combination of these. Inside the gas-liquid contactor, CO2 from the CCh-rich inlet gas is absorbed by the liquid and carried out from the contactor as a CCh-rich outlet liquid. CCh-rich liquid means the liquid comprises an increased amount of CO2 in its dissolved gaseous form or any ionic form of CO2, such as bicarbonate, or a reaction product of CO2 and a compound in the inlet absorption liquid, relative to the CCh-lean liquid. CCh-lean gas, from which CO2 has been partially or completely removed, exits the contactor as the outlet gas. A counter-current gas-liquid contactor is the most common type of gas-liquid contactor used industrially. The internals of the contactor can be largely empty with liquid sprayed down from the top and gas flowing up from the bottom, or, more commonly, the contactor contains different types of packing materials to increase the residence time of gas and liquid inside the contactor and promote a large surface area of interaction between the gas and liquid. The liquid is typically delivered onto the packing materials from spray nozzles or other types of openings designed to deliver the liquid in a uniform fashion over the packing material. The liquid is comprised of absorption compounds, water, and other components that may be needed to optimize the process. For example, the liquid can comprise CA in soluble form or as small particles of suspended biocatalyst that flow through the contactor. The packing material may be coated with or have CA attached to it.
Figure 2 shows a schematic diagram of a co-current gas-liquid contactor. In a typical cocurrent gas-liquid contactor, the CCh-rich inlet gas and CCh-lean inlet liquid enter the contactor at the same end (e.g., the top) and exit at the same end. The outlet gas preferably exits the contactor at a location above any sump present for collection of the outlet liquid. This type of contactor can provide a lower pressure drop compared to a counter-current contactor, because the gas and liquid flows are both moving in the same direction, however the efficiency of gasliquid interaction may be lower compared to a counter-current design. The internals and function of the co-current contactor are as described for Figure 1 , with the exception that the gas and liquid are flowing in the same direction.
Figure 3 shows a schematic diagram of a perpendicular-flow gas-liquid contactor. In a perpendicular gas-liquid contactor, the gas typically travels in an overall horizontal direction from the inlet gas to the outlet gas while the liquid overall travels vertically, from inlet liquid at the top to outlet liquid at the bottom to take advantage of gravity. This contactor design can take advantage of specialized liquid delivery systems, such as those generating flat sheets of liquid, to create high gas-liquid contact in a compact design that does not require tall vertical structures. The contactor can have internal baffles or packing materials to enhance gas-liquid contact and control gas and liquid flows. The internals and function of the co-current contactor are as described for Figure 1 , with the exception that the overall gas and liquid flows are perpendicular to each other.
Figure 4 shows a schematic diagram of a membrane gas-liquid contactor. A membrane contactor utilizes a gas permeable membrane (dotted line) to separate the gas flow from the liquid flow. The diagram shows counter-current flow of gas and liquid, though co-current and perpendicular-type flows are also possible. The CCh-rich inlet gas comes in contact with the membrane and CO2, preferably selectively, passes across the membrane into the CO2 absorption liquid. Membranes used in these contactors can be microporous, allowing the surface of the liquid to be exposed to the gas through pores in the membrane that are small enough to prevent the liquid from passing through due to physical phenomena, such as surface tension. Alternatively, the membranes can be non-porous, yet made from CCh-gas permeable materials. Microporous membranes may provide faster CO2 absorption rates, while non-porous membranes may minimize liquid losses to evaporation in the gas stream. While passing through the contactor, the inlet liquid becomes enriched in CO2, such that the outlet liquid is CCh-rich by comparison, and the inlet gas becomes depleted in CO2, such that the outlet gas is CCh-lean. The diagram only shows a representation of the basic functional unit of a membrane contactor, which in operational form contains many layers of stacked membranes or bundles of tubular, or hollow-fiber, membranes arranged in suitable housing with dividers and control mechanisms to optimally direct the gas and liquid flows. Membrane contactors are used for large industrial gas scrubbing applications as well as for small units, such as for CO2 removal during dialysis in which the semi- permeable membrane may separate two liquids, such as CCh-rich blood and a CCh-lean buffer solution capable of absorbing excess CO2 from the blood. CA can be present in soluble or suspended particulate form in the liquid and can be immobilized on or in the membrane.
Figure 5 shows a schematic diagram of a bubble-tank gas-liquid contactor. The system shown is a batch-mode bubble-tank gas-liquid contactor, in which a stream of inlet gas comprising CO2 is bubbled (or sparged) through a fixed amount of absorption liquid. CO2 is absorbed into the liquid such that the outlet gas is depleted in CO2 compared to the inlet gas. In batch mode, eventually the liquid will reach a maximum CO2 absorption capacity, and the CCh-rich inlet gas can be directed to another batch reactor containing CCh-lean absorption liquid, or the batch reactor can be emptied and filled with fresh CCh-lean absorption liquid, or the inlet gas flow can be stopped while the batch reactor is changed from absorption mode to CO2 desorption mode, such as by applying heat, sweep gas or vacuum to the batch reactor, to release absorbed CO2 from the CCh-rich liquid. This type of contactor can, for example, be used for producing solid precipitated forms of CO2, such as carbonates, like calcium, magnesium, and manganese carbonate. Gas delivery nozzles that produce very small gas bubbles can enhance the gas-liquid contact and improve CO2 absorption efficiency. A bubble-tank contactor can be enclosed equipment or can operate in open environments, such as bubbling CCh-rich gas streams into algae ponds. CA can be present in soluble or suspended particulate form in the liquid and can be immobilized on the surfaces of structures or packing immersed in or exposed to the liquid.
Figure 6 shows a schematic diagram of a batch-mode stirred-tank gas-liquid contactor. In this type of contactor, the CCh-rich gas is exposed to the surface of a liquid leading to CO2 gas absorption into the liquid. The liquid may be quiescent or may be mixed by some means to cause movement of the liquid and liquid components. The principle of a stirred-tank contactor can apply to controlled enclosed equipment or can apply to open environments, such the absorption of CO2 from air into a body of water, like an ocean. CA can be present in soluble or suspended particulate form in the liquid and can be immobilized on the surfaces of structures, mixers or packing immersed in or exposed to the liquid.
Figure 7 shows a schematic diagram of an integrated CO2 scrubbing system. In the system shown, CCh-rich feed gas (1) enters near the bottom of the absorber (2) and flows upwards where it comes in contact with CCh-lean absorption liquid (3) that enters the absorber near the top. Scrubbed gas (4), from which CO2 has been removed, exits the absorber at the top. CC>2-rich absorption liquid (5) exits at the bottom of the absorber and (optionally) passes through a biocatalyst recovery unit (6) which separates the catalyst for recycling (7) and redelivery into the absorber along with the CCh-lean absorption liquid (3). The main amount of CCh-rich absorption liquid (5) exits the (optional) biocatalyst recovery unit (6) and travels to an optional temperature regulator (e.g., heat exchanger) (8) where the CO2-rich absorption liquid is pre- heated before traveling to the desorber (9). The CCh-rich absorption liquid enters near the top of the desorber and flows downwards. Heat is supplied to the desorber by any suitable means, e.g., a re-boiler, and optionally another desorption driving force such as a sweep gas (10) or vacuum (13), or a combination of these applied to the desorber causing extracted CO2 to be released from the absorption liquid and exit the desorber, (optionally) passing through an absorption liquid condenser (11) to remove absorption liquid vapor from the gas stream, and (optionally) passing through a sweep gas condenser (12) to remove sweep gas compounds from the CO2 gas stream prior to release, compression and/or use of the purified CO2 gas (14). Sweep gas compounds separated in the sweep gas condenser (12) are optionally recycled and fed back to the desorber along with the provision of fresh sweep gas (10). CO2- lean absorption liquid (15) exits the desorber, and (optionally) passes through a second biocatalyst recovery unit (16), which separates the biocatalyst for recycling (17) and redelivery into the desorber along with the CO2- rich absorption liquid (5), before (optionally) passing through a temperature regulator (8), and (optionally) passing through a secondary CO2 desorber (18) before returning to the absorber (3). Although the secondary CO2 desorber may function by any of the known means of desorption, a preferred mode of operation for the secondary CO2 desorber is as a secondary air sweep desorber utilizing a membrane-based design, in which CCh-lean sweep gas (22), such as air is contacted with the CCh-lean liquid (15) to further remove residual CO2 remaining in the CCh-lean liquid (15) and provide a very CCh-lean liquid (3) for re-entry to the absorber. The secondary sweep gas (23) exiting the secondary CO2 desorber (18) can be released to the atmosphere or can be used for a purpose, such as to supply air for combustion, e.g., when the CO2 scrubber is installed at a power plant. Depleted biocatalyst and/or other depleted components of the absorption liquid can be added at various points in the process, such as at the locations indicated (20 and 21). Removal of samples for process monitoring and control of liquid levels as well as removal of insoluble contaminants can be carried out at various points in the process, such as at the locations indicated (24 and 25). Although not depicted in the diagram, it is understood that pumps to provide and control liquid flow, blowers to provide and control gas flow, and all relevant valves, meters, instrumentation and equipment for process control and monitoring can be installed and utilized at the needed locations. CA can be present in soluble or suspended particulate form in the liquid as it flows through the system and can be immobilized on the surfaces of structures, mixers or packing material immersed in or exposed to the liquid.
Figure 8 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the Endozoicomonas arenosclerae carbonic anhydrase of SEQ ID NO: 2 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 9 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the Achromatium sp. carbonic anhydrase of SEQ ID NO: 4 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol. Figure 10 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the Endozoicomonas numazuensis carbonic anhydrase of SEQ ID NO: 6 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 11 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the Thiorhodococcus drewsii carbonic anhydrase of SEQ ID NO: 8 (bottom). Key residues described herein that may be important for activity are indicated with the
* symbol.
Figure 12 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 10 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 13 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 12 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 14 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the Aquificales bacterium carbonic anhydrase of SEQ ID NO: 14 (bottom). Key residues described herein that may be important for activity are indicated with the
* symbol.
Figure 15 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 16 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 16 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 18 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 17 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 20 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 18 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 22 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 19 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 26 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 20 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 28 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol. Figure 21 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 30 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 22 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 32 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 23 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 34 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 24 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 36 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 25 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 38 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 26 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 40 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 27 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 42 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 28 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 44 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 29 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 46 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 30 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 48 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 31 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 50 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 32 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 52 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol. Figure 33 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 54 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 34 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 56 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 35 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 58 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 36 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 60 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 37 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 62 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 38 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 64 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 39 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 66 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 40 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 68 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 41 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 70 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 42 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the Trichophaea saccata carbonic anhydrase of SEQ ID NO: 74 (bottom). Key residues described herein that may be important for activity are indicated with the
* symbol.
Figure 43 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the Eleutherascus tuberculatus carbonic anhydrase of SEQ ID NO: 76 (bottom). Key residues described herein that may be important for activity are indicated with the
* symbol.
Figure 44 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the Lactobacillus sp. carbonic anhydrase of SEQ ID NO: 79 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol. Figure 45 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 81 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 46 is a sequence alignment of the Persephonella marina carbonic anhydrase of SEQ ID NO: 77 (top) with the carbonic anhydrase of SEQ ID NO: 83 (bottom). Key residues described herein that may be important for activity are indicated with the * symbol.
Figure 47 shows the normalized WAU activity of carbonic anhydrases relative to the Persophenella marina carbonic anhydrase as described in Example 3.
Figure 48 shows the normalized WAU activity of carbonic anhydrases relative to the Persophenella marina carbonic anhydrase as described in Example 4.
Definitions
In accordance with this detailed description, the following definitions apply. Note that the singular forms "a," "an," and "the" include plural references unless the context clearly dictates otherwise.
Unless defined otherwise or clearly indicated by context, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
Carbonic anhydrase: The term “carbonic anhydrase” means an enzyme of EC 4.2.1.1 having carbonic anhydrase activity.
Carbonic anhydrase Activity: The term “carbonic anhydrase activity” or “CA activity” is defined herein as an EC 4.2.1.1 activity which catalyzes the conversion between carbon dioxide and bicarbonate [CO2 + H2O HCOa' + H+], For purposes of the present invention, CA activity may be determined according to the procedure described in WQ2018/017792 (the content of which is incorporated herein by reference). One unit of CA activity is defined after Wilbur [1 U = (1/tc)-(1/tu) x 1000] where U is units and tc and tu represent the time in seconds for the catalyzed and uncatalyzed reaction, respectively (Wilbur, 1948, J. Biol. Chem. 176: 147-154). cDNA: The term "cDNA" means a DNA molecule that can be prepared by reverse transcription from a mature, spliced, mRNA molecule obtained from a eukaryotic or prokaryotic cell. cDNA lacks intron sequences that may be present in the corresponding genomic DNA. The initial, primary RNA transcript is a precursor to mRNA that is processed through a series of steps, including splicing, before appearing as mature spliced mRNA.
CO2-lean/CO2-rich: The terms “CO2-lean” and “CO2-rich” carrier liquid are terms used in the present invention to describe the relative amount of carbon (e.g., in the form of dissolved CO2, chemically reacted CO2, bicarbonate, carbonic acid and/or carbonate salt) present in the carrier liquid as it circulates through the process. As used herein, the term “CO2-lean carrier liquid” generally refers to carrier liquid entering an absorption module. The term “CO2-rich carrier liquid” generally refers to a carrier liquid entering a desorption module. It is understood that the term “CC>2-lean carrier liquid” can also be applied to carrier liquid exiting a desorption module, and the term “CCh-rich carrier liquid” can also be applied to carrier liquid exiting an absorption module. CC>2-rich carrier liquid contains more carbon compared to CCh-lean carrier liquid within a system at a given point in time.
CO2-containing medium: The term “CCh-containing medium” is used to describe any material which contains at least 0.001% CO2, preferably at least 0.01%, more preferably at least 0.1 %, more preferably at least 1 %, more preferably at least 5%, most preferably 10%, even more preferred at least 20%, and even most preferably at least 50% CO2. Preferably the CO2- containing medium has a temperature between 5 °C and 110 °C, more preferably between 10 °C and 100 °C, more preferably between 20 °C and 95 °C, more preferably between 30 °C and 90 °C, more preferably between 40 °C and 85 °C, more preferably between 50 °C and 80 °C, more preferably between 55 °C and 75 °C and most preferably between 60 °C and 70 °C at any pressure. CCh-containing media are in particular gaseous phases (including gas mixtures), liquids or multiphase mixtures, but may also be solid. A CCh-containing gaseous phase is for example raw natural gas obtainable from oil wells, gas wells, and condensate wells, syngas generated by the gasification of a carbon containing fuel (e.g., methane) to a gaseous product comprising CO and H2, or emission streams from combustion processes, e.g., from carbon based electric generation power plants, or from flue gas stacks from such plants, industrial furnaces, stoves, ovens, or fireplaces or from airplane or car exhausts. A CO2-containing gaseous phase may alternatively be ambient air (including hot (above 40°C) air, e.g., desert air), or from respiratory processes in mammals (such as the CCh-containing gas phase in an artificial lung), living plants and other CO2 emitting species, in particular from green-houses. A CCh-containing gas phase may also be off-gas, from aerobic or anaerobic fermentation, such as brewing, fermentation to produce useful products such as ethanol, or the production of biogas. Such fermentation processes can occur at elevated temperatures if they are facilitated by thermophilic microorganisms, which are for example encountered in the production of biogas. A CO2- containing gaseous phase may alternatively be a gaseous phase enriched in CO2 for the purpose of use or storage. The above-described gaseous phases may also occur as multiphase mixtures, where the gas co-exists with a certain degree of fluids (e.g., water or other solvents) and/or solid materials (e.g., ash or other particles). CCh-containing liquids are any solution or fluid, in particular aqueous liquids, containing measurable amounts of CO2, preferably at one of the levels mentioned above at any pressure. CCh-containing liquids may be obtained by passing a CO2- containing gas or solid (e.g., dry ice or soluble carbonate containing salt) into the liquid. CO2- containing fluids may also be compressed CO2 liquid (that contains contaminants, such as drycleaning fluid), supercritical CO2, or CC>2 solvent liquids, like ionic liquids. A CCh-containing liquid may also be referred to as a “carrier liquid”. A CCh-containing liquid may also include compounds capable of improving the CCh-containing capacity of the liquid, such as HCO3' (KHCO3 or NaHCO3), CO32’ (Na2CO3 or K2CO3), HPO42’ (K2HPO4 or Na2HPO4) or MDEA or Tris. C02 extraction: The term “CO2 extraction” is to be understood as a reduction of carbon from a CCh-containing medium. Such an extraction may be performed from one medium to another, e.g., gas to liquid, liquid to gas, gas to liquid to gas, liquid to liquid or liquid to solid, but the extraction may also be the conversion of CO2 to bicarbonate, carbonate or carbonic acid within the same medium or the conversion of bicarbonate to CO2 within the same medium. The term CO2 capture is also used to indicate extraction of CO2 from one medium to another or conversion of CO2 to bicarbonate/ carbonate or conversion of bicarbonate/carbonate to CO2.
Coding sequence: The term “coding sequence” means a polynucleotide, which directly specifies the amino acid sequence of a polypeptide. The boundaries of the coding sequence are generally determined by an open reading frame, which begins with a start codon, such as ATG, GTG, or TTG, and ends with a stop codon, such as TAA, TAG, or TGA. The coding sequence may be a genomic DNA, cDNA, synthetic DNA, or a combination thereof.
Control sequences: The term “control sequences” means nucleic acid sequences involved in regulation of expression of a polynucleotide in a specific organism or in vitro. Each control sequence may be native (/.e., from the same gene) or heterologous (/.e., from a different gene) to the polynucleotide encoding the polypeptide, and native or heterologous to each other. Such control sequences include, but are not limited to leader, polyadenylation, prepropeptide, propeptide, signal peptide, promoter, terminator, enhancer, and transcription or translation initiator and terminator sequences. At a minimum, the control sequences include a promoter, and transcriptional and translational stop signals. The control sequences may be provided with linkers for the purpose of introducing specific restriction sites facilitating ligation of the control sequences with the coding region of the polynucleotide encoding a polypeptide.
Expression: The term “expression” means any step involved in the production of a polypeptide including, but not limited to, transcription, post-transcriptional modification, translation, post-translational modification, and secretion.
Expression vector: An "expression vector" refers to a linear or circular DNA construct comprising a DNA sequence encoding a polypeptide, which coding sequence is operably linked to a suitable control sequence capable of effecting expression of the DNA in a suitable host. Such control sequences may include a promoter to effect transcription, an optional operator sequence to control transcription, a sequence encoding suitable ribosome binding sites on the mRNA, enhancers and sequences which control termination of transcription and translation.
Extension: The term “extension” means an addition of one or more amino acids to the amino and/or carboxyl terminus of a polypeptide, wherein the “extended” polypeptide has carbonic anhydrase activity.
Fragment: The term “fragment” means a polypeptide having one or more amino acids absent from the amino and/or carboxyl terminus of the mature polypeptide, wherein the fragment has carbonic anhydrase activity. Fusion polypeptide: The term “fusion polypeptide” is a polypeptide in which one polypeptide is fused at the N-terminus and/or the C-terminus of a polypeptide of the present invention. A fusion polypeptide is produced by fusing a polynucleotide encoding another polypeptide to a polynucleotide of the present invention, or by fusing two or more polynucleotides of the present invention together. Techniques for producing fusion polypeptides are known in the art, and include ligating the coding sequences encoding the polypeptides so that they are in frame and that expression of the fusion polypeptide is under control of the same promoter(s) and terminator. Fusion polypeptides may also be constructed using intein technology in which fusion polypeptides are created post-translationally (Cooper et al., 1993, EMBO J. 12: 2575-2583; Dawson et al., 1994, Science 266: 776-779). A fusion polypeptide can further comprise a cleavage site between the two polypeptides. Upon secretion of the fusion protein, the site is cleaved releasing the two polypeptides. Examples of cleavage sites include, but are not limited to, the sites disclosed in Martin et al., 2003, J. Ind. Microbiol. Biotechnol. 3: 568-576; Svetina et al., 2000, J. Biotechnol. 7Q: 245-251 ; Rasmussen-Wilson et al., 1997, Appl. Environ. Microbiol. 63: 3488-3493; Ward et al., 1995, Biotechnology 13: 498-503; and Contreras et al., 1991 , Biotechnology 9: 378-381 ; Eaton et al., 1986, Biochemistry 25: 505-512; Collins-Racie et al., 1995, Biotechnology 13: 982-987; Carter eta/., 1989, Proteins: Structure, Function, and Genetics 6: 240-248; and Stevens, 2003, Drug Discovery World 4: 35-48.
Heat-stable: The term “heat-stable” or “thermostable” as used in reference to a carbonic anhydrase indicates that the enzyme is functional or active (/.e., can perform catalysis) at an elevated temperature, i.e., above 45°C, preferably above 50°C, more preferably above 55°C, more preferably above 60°C, even more preferably above 65°C, most preferably above 70°C, most preferably above 75°C, most preferably above 80°C, most preferably above 85°C most preferably above 90°C, and even most preferably above 100°C. In a preferred embodiment the carbonic anhydrase displays optimum activity at one of the temperatures indicated above, i.e., the enzyme’s temperature optimum is at one of the temperatures indicated above. The temperature stability of the carbonic anhydrase can be increased to some extent by way of formulation, e.g., by combination with stabilizing chemicals or by immobilization of the enzyme or by chemical modification, e.g., cross-linking, to preserve the enzyme in its active three- dimensional shape. In order for an enzyme to be considered as heat-stable it remains active after at least 15 minutes, preferably for at least 2 hours, more preferably for at least 24 hours, more preferably for at least 7 days, more preferably for at least 10 days, even more preferably for at least 14 days, most preferably for at least 30 days, even most preferably for at least 50 days at the elevated temperature. The level of activity may be measured using an assay described in WO2018/017792, e.g., after incubation for the given time in 1 M NaHCCh buffer at pH 8 at the given elevated temperature. The activity may be compared with the enzyme activity prior to the temperature elevation, thereby obtaining the residual activity of the enzyme after the heat treatment. Preferably, the residual activity is at least 30% after the given time at the elevated temperature, more preferably at least 40%, more preferably at least 50%, more at least 60%, even more preferably at least 70%, most preferably at least 80%, even most preferably the residual activity is at least 90%, and absolutely most preferred the level of residual activity is at least equal to or unchanged after the given time at the elevated temperature.
Heterologous: The term "heterologous" means, with respect to a host cell, that a polypeptide or nucleic acid does not naturally occur in the host cell. The term "heterologous" means, with respect to a polypeptide or nucleic acid, that a control sequence, e.g., promoter, of a polypeptide or nucleic acid is not naturally associated with the polypeptide or nucleic acid, i.e., the control sequence is from a gene other than the gene encoding the mature polypeptide.
Host Strain or Host Cell: A "host strain" or "host cell" is an organism into which an expression vector, phage, virus, or other DNA construct, including a polynucleotide encoding a polypeptide of interest (e.g., an amylase) has been introduced. Exemplary host strains are microorganism cells (e.g., bacteria, filamentous fungi, and yeast) capable of expressing the polypeptide of interest and/or fermenting saccharides. The term "host cell" includes protoplasts created from cells.
Introduced: The term "introduced" in the context of inserting a nucleic acid sequence into a cell, means "transfection", "transformation" or "transduction," as known in the art.
Isolated: The term “isolated” means a polypeptide, nucleic acid, cell, or other specified material or component that has been separated from at least one other material or component, including but not limited to, other proteins, nucleic acids, cells, etc. An isolated polypeptide, nucleic acid, cell or other material is thus in a form that does not occur in nature. An isolated polypeptide includes, but is not limited to, a culture broth containing the secreted polypeptide expressed in a host cell.
Mature polypeptide: The term “mature polypeptide” means a polypeptide in its mature form following N-terminal processing and/or C-terminal processing (e.g., removal of signal peptide).
Mature polypeptide coding sequence: The term “mature polypeptide coding sequence” means a polynucleotide that encodes a mature polypeptide having carbonic anhydrase activity.
Native: The term "native" means a nucleic acid or polypeptide naturally occurring in a host cell.
Nucleic acid: The term "nucleic acid" encompasses DNA, RNA, heteroduplexes, and synthetic molecules capable of encoding a polypeptide. Nucleic acids may be single stranded or double stranded and may be chemical modifications. The terms "nucleic acid" and "polynucleotide" are used interchangeably. Because the genetic code is degenerate, more than one codon may be used to encode a particular amino acid, and the present compositions and methods encompass nucleotide sequences that encode a particular amino acid sequence. Unless otherwise indicated, nucleic acid sequences are presented in 5'-to-3' orientation. Nucleic acid construct: The term "nucleic acid construct" means a nucleic acid molecule, either single- or double-stranded, which is isolated from a naturally occurring gene or is modified to contain segments of nucleic acids in a manner that would not otherwise exist in nature, or which is synthetic, and which comprises one or more control sequences operably linked to the nucleic acid sequence.
Operably linked: The term "operably linked" means that specified components are in a relationship (including but not limited to juxtaposition) permitting them to function in an intended manner. For example, a regulatory sequence is operably linked to a coding sequence such that expression of the coding sequence is under control of the regulatory sequence.
Purified: The term “purified” means a nucleic acid, polypeptide or cell that is substantially free from other components as determined by analytical techniques well known in the art (e.g., a purified polypeptide or nucleic acid may form a discrete band in an electrophoretic gel, chromatographic eluate, and/or a media subjected to density gradient centrifugation). A purified nucleic acid or polypeptide is at least about 50% pure, usually at least about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, about 99.5%, about 99.6%, about 99.7%, about 99.8% or more pure (e.g., percent by weight or on a molar basis). In a related sense, a composition is enriched for a molecule when there is a substantial increase in the concentration of the molecule after application of a purification or enrichment technique. The term "enriched" refers to a compound, polypeptide, cell, nucleic acid, amino acid, or other specified material or component that is present in a composition at a relative or absolute concentration that is higher than a starting composition.
In one aspect, the term "purified" as used herein refers to the polypeptide or cell being essentially free from components (especially insoluble components) from the production organism. In other aspects, the term "purified" refers to the polypeptide being essentially free of insoluble components (especially insoluble components) from the native organism from which it is obtained. In one aspect, the polypeptide is separated from some of the soluble components of the organism and culture medium from which it is recovered. The polypeptide may be purified (/.e., separated) by one or more of the unit operations filtration, precipitation, or chromatography.
Accordingly, the polypeptide may be purified such that only minor amounts of other proteins, in particular, other polypeptides, are present. The term "purified" as used herein may refer to removal of other components, particularly other proteins and most particularly other enzymes present in the cell of origin of the polypeptide. The polypeptide may be "substantially pure", i.e., free from other components from the organism in which it is produced, e.g., a host organism for recombinantly produced polypeptide. In one aspect, the polypeptide is at least 40% pure by weight of the total polypeptide material present in the preparation. In one aspect, the polypeptide is at least 50%, 60%, 70%, 80% or 90% pure by weight of the total polypeptide material present in the preparation. As used herein, a "substantially pure polypeptide" may denote a polypeptide preparation that contains at most 10%, preferably at most 8%, more preferably at most 6%, more preferably at most 5%, more preferably at most 4%, more preferably at most 3%, even more preferably at most 2%, most preferably at most 1%, and even most preferably at most 0.5% by weight of other polypeptide material with which the polypeptide is natively or recombinantly associated.
It is, therefore, preferred that the substantially pure polypeptide is at least 92% pure, preferably at least 94% pure, more preferably at least 95% pure, more preferably at least 96% pure, more preferably at least 97% pure, more preferably at least 98% pure, even more preferably at least 99% pure, most preferably at least 99.5% pure by weight of the total polypeptide material present in the preparation. The polypeptide of the present invention is preferably in a substantially pure form (/.e., the preparation is essentially free of other polypeptide material with which it is natively or recombinantly associated). This can be accomplished, for example by preparing the polypeptide by well-known recombinant methods or by classical purification methods.
Recombinant: The term "recombinant" is used in its conventional meaning to refer to the manipulation, e.g., cutting and rejoining, of nucleic acid sequences to form constellations different from those found in nature. The term recombinant refers to a cell, nucleic acid, polypeptide or vector that has been modified from its native state. Thus, for example, recombinant cells express genes that are not found within the native (non-recombinant) form of the cell, or express native genes at different levels or under different conditions than found in nature. The term “recombinant” is synonymous with “genetically modified” and “transgenic”.
Recover: The terms "recover" or “recovery” means the removal of a polypeptide from at least one fermentation broth component selected from the list of a cell, a nucleic acid, or other specified material, e.g., recovery of the polypeptide from the whole fermentation broth, or from the cell-free fermentation broth, by polypeptide crystal harvest, by filtration, e.g. depth filtration (by use of filter aids or packed filter medias, cloth filtration in chamber filters, rotary-drum filtration, drum filtration, rotary vacuum-drum filters, candle filters, horizontal leaf filters or similar, using sheed or pad filtration in framed or modular setups) or membrane filtration (using sheet filtration, module filtration, candle filtration, microfiltration, ultrafiltration in either cross flow, dynamic cross flow or dead end operation), or by centrifugation (using decanter centrifuges, disc stack centrifuges, hyrdo cyclones or similar), or by precipitating the polypeptide and using relevant solid-liquid separation methods to harvest the polypeptide from the broth media by use of classification separation by particle sizes. Recovery encompasses isolation and/or purification of the polypeptide.
Secreted: The term “secreted” as used herein is to be understood as a polypeptide which after expression in a cell is either transported to and released to the surrounding extracellular medium or is associated/embedded in the cellular membrane so that at least a part of the polypeptide is exposed to the surrounding extracellular medium. Sequence identity: The relatedness between two amino acid sequences or between two nucleotide sequences is described by the parameter “sequence identity”.
For purposes of the present invention, the sequence identity between two amino acid sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, J. Mol. Biol. 48: 443-453) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, Trends Genet. 16: 276-277), preferably version 6.6.0 or later. The parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EBLOSUM62 (EMBOSS version of BLOSUM62) substitution matrix. In order for the Needle program to report the longest identity, the -nobrief option must be specified in the command line. The output of Needle labeled “longest identity” is calculated as follows:
(Identical Residues x 100)/(Length of Alignment - Total Number of Gaps in Alignment)
For purposes of the present invention, the sequence identity between two polynucleotide sequences is determined as the output of “longest identity” using the Needleman-Wunsch algorithm (Needleman and Wunsch, 1970, supra) as implemented in the Needle program of the EMBOSS package (EMBOSS: The European Molecular Biology Open Software Suite, Rice et al., 2000, supra), preferably version 6.6.0 or later. The parameters used are a gap open penalty of 10, a gap extension penalty of 0.5, and the EDNAFULL (EMBOSS version of NCBI NLIC4.4) substitution matrix. In order for the Needle program to report the longest identity, the nobrief option must be specified in the command line. The output of Needle labeled “longest identity” is calculated as follows:
(Identical Deoxyribonucleotides x 100)/(Length of Alignment - Total Number of Gaps in Alignment)
Signal Peptide: A "signal peptide" is a sequence of amino acids attached to the N- terminal portion of a protein, which facilitates the secretion of the protein outside the cell. The mature form of an extracellular protein lacks the signal peptide, which is cleaved off during the secretion process.
Subsequence: The term “subsequence” means a polynucleotide having one or more nucleotides absent from the 5' and/or 3' end of a mature polypeptide coding sequence; wherein the subsequence encodes a fragment having carbonic anhydrase activity.
Syngas: The term “syngas” or “synthesis gas” is used to describe a gas mixture that contains varying amounts of carbon monoxide and hydrogen generated by the gasification of a carbon containing fuel (e.g., methane or natural gas) to a gaseous product with a heating value. CO2 is produced in the syngas reaction and must be removed to increase the heating value.
Thermophilic: The term “thermophilic” in relation to an organism, describes an organism which thrives at relatively high temperatures, i.e., above 45 °C. Hyperthermophilic organisms thrive in extremely hot environments, that is, hotter than around 60 °C with an optimal temperature above 80 °C. Variant: The term “variant” means a polypeptide having carbonic anhydrase activity comprising a man-made mutation, i.e., a substitution, insertion (including extension), and/or deletion (e.g., truncation), at one or more positions. A substitution means replacement of the amino acid occupying a position with a different amino acid; a deletion means removal of the amino acid occupying a position; and an insertion means adding 1-5 amino acids (e.g., 1-3 amino acids, in particular, 1 amino acid) adjacent to and immediately following the amino acid occupying a position.
Wild-type: The term "wild-type" in reference to an amino acid sequence or nucleic acid sequence means that the amino acid sequence or nucleic acid sequence is a native or naturally- occurring sequence. As used herein, the term "naturally-occurring" refers to anything (e.g., proteins, amino acids, or nucleic acid sequences) that is found in nature. Conversely, the term "non-naturally occurring" refers to anything that is not found in nature (e.g., recombinant nucleic acids and protein sequences produced in the laboratory or modification of the wild-type sequence).
Detailed Description of the Invention
Polypeptides Having Carbonic Anhydrase Activity
The present invention relates to polypeptides having carbonic anhydrase activity. In one aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 2;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 1 ;
(c) a polypeptide derived from or SEQ ID NO: 2 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 2.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 2. The polypeptide of any of the embodiments above related to SEQ ID NO: 2 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 2.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 1.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 1.
In another embodiment, the polypeptide is derived from SEQ ID NO: 2 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 2 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 2 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 4;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 3;
(c) a polypeptide derived from or SEQ ID NO: 4 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 4.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 4.
The polypeptide of any of the embodiments above related to SEQ ID NO: 4 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids. In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 4.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 3.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 3.
In another embodiment, the polypeptide is derived from SEQ ID NO: 4 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 4 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 4 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 6;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 5;
(c) a polypeptide derived from or SEQ ID NO: 6 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 6.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 6.
The polypeptide of any of the embodiments above related to SEQ ID NO: 6 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 6. In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 5.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 5.
In another embodiment, the polypeptide is derived from SEQ ID NO: 6 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 6 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 6 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 8;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 7;
(c) a polypeptide derived from or SEQ ID NO: 8 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81 %, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 8.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 8.
The polypeptide of any of the embodiments above related to SEQ ID NO: 8 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 8.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 7.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 7.
In another embodiment, the polypeptide is derived from SEQ ID NO: 8 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 8 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 8 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 10;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 9;
(c) a polypeptide derived from or SEQ ID NO: 10 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 10.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 10.
The polypeptide of any of the embodiments above related to SEQ ID NO: 10 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ I D NO: 10.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 9.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 9.
In another embodiment, the polypeptide is derived from SEQ ID NO: 10 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 10 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 10 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 12;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ I D NO: 11 ;
(c) a polypeptide derived from or SEQ ID NO: 12 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 12.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 12.
The polypeptide of any of the embodiments above related to SEQ ID NO: 12 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 12.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 11. The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 11 .
In another embodiment, the polypeptide is derived from SEQ ID NO: 12 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 12 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 12 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 14;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ I D NO: 13;
(c) a polypeptide derived from or SEQ ID NO: 14 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 14.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 14.
The polypeptide of any of the embodiments above related to SEQ ID NO: 14 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 14.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 13.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 13. In another embodiment, the polypeptide is derived from SEQ ID NO: 14 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 14 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 14 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 16;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ I D NO: 15;
(c) a polypeptide derived from or SEQ ID NO: 16 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 16.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 16.
The polypeptide of any of the embodiments above related to SEQ ID NO: 16 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 16.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 15.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 15.
In another embodiment, the polypeptide is derived from SEQ ID NO: 16 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 16 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 16 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 18;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 17;
(c) a polypeptide derived from or SEQ ID NO: 18 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 18.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 18.
The polypeptide of any of the embodiments above related to SEQ ID NO: 18 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 18.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 17.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 17.
In another embodiment, the polypeptide is derived from SEQ ID NO: 18 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 18 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 18 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 20;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 19;
(c) a polypeptide derived from or SEQ ID NO: 20 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 20.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 20.
The polypeptide of any of the embodiments above related to SEQ ID NO: 20 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 20.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 19.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 19.
In another embodiment, the polypeptide is derived from SEQ ID NO: 20 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 20 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 20 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15. In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 22;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 21 ;
(c) a polypeptide derived from or SEQ ID NO: 22 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 22.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 22.
The polypeptide of any of the embodiments above related to SEQ ID NO: 22 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 22.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 21.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 21 .
In another embodiment, the polypeptide is derived from SEQ ID NO: 22 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 22 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 22 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of: (a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 24;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 23;
(c) a polypeptide derived from or SEQ ID NO: 24 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 24.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 24.
The polypeptide of any of the embodiments above related to SEQ ID NO: 24 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 24.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 23.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 23.
In another embodiment, the polypeptide is derived from SEQ ID NO: 24 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 24 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 24 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 26; (b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 25;
(c) a polypeptide derived from or SEQ ID NO: 26 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 26.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 26.
The polypeptide of any of the embodiments above related to SEQ ID NO: 26 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 26.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 25.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 25.
In another embodiment, the polypeptide is derived from SEQ ID NO: 26 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 26 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 26 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 28;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 27; (c) a polypeptide derived from or SEQ ID NO: 28 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 28.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 28.
The polypeptide of any of the embodiments above related to SEQ ID NO: 28 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 28.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 27.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 27.
In another embodiment, the polypeptide is derived from SEQ ID NO: 28 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 28 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 28 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 30;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 29;
(c) a polypeptide derived from or SEQ ID NO: 30 by substitution, deletion or addition of one or several amino acids; (d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81 %, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 30.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 30.
The polypeptide of any of the embodiments above related to SEQ ID NO: 30 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 30.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 29.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 29.
In another embodiment, the polypeptide is derived from SEQ ID NO: 30 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 30 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 30 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 32;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 31 ;
(c) a polypeptide derived from or SEQ ID NO: 32 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and (e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 32.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 32.
The polypeptide of any of the embodiments above related to SEQ ID NO: 32 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 32.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 31.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 31 .
In another embodiment, the polypeptide is derived from SEQ ID NO: 32 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 32 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 32 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 34;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 33;
(c) a polypeptide derived from or SEQ ID NO: 34 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity. In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81 %, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 34.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 34.
The polypeptide of any of the embodiments above related to SEQ ID NO: 34 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 34.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 33.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 33.
In another embodiment, the polypeptide is derived from SEQ ID NO: 34 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 34 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 34 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 36;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 35;
(c) a polypeptide derived from or SEQ ID NO: 36 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 36.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 36.
The polypeptide of any of the embodiments above related to SEQ ID NO: 36 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 36.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 35.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 35.
In another embodiment, the polypeptide is derived from SEQ ID NO: 36 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 36 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 36 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 38;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 37;
(c) a polypeptide derived from or SEQ ID NO: 38 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 38.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 38.
The polypeptide of any of the embodiments above related to SEQ ID NO: 38 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 38.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 37.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 37.
In another embodiment, the polypeptide is derived from SEQ ID NO: 38 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 38 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 38 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 40;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 39;
(c) a polypeptide derived from or SEQ ID NO: 40 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 40. In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 40.
The polypeptide of any of the embodiments above related to SEQ ID NO: 40 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 40.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 39.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 39.
In another embodiment, the polypeptide is derived from SEQ ID NO: 40 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 40 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 40 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 42;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 41 ;
(c) a polypeptide derived from or SEQ ID NO: 42 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 42.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 42. The polypeptide of any of the embodiments above related to SEQ ID NO: 42 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 42.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 41.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 41 .
In another embodiment, the polypeptide is derived from SEQ ID NO: 42 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 42 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 42 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 44;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 43;
(c) a polypeptide derived from or SEQ ID NO: 44 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81 %, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 44.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 44.
The polypeptide of any of the embodiments above related to SEQ ID NO: 44 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids. In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 44.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 43.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 43.
In another embodiment, the polypeptide is derived from SEQ ID NO: 44 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 44 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 44 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 46;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 45;
(c) a polypeptide derived from or SEQ ID NO: 46 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 46.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 46.
The polypeptide of any of the embodiments above related to SEQ ID NO: 46 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 46. In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 45.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 45.
In another embodiment, the polypeptide is derived from SEQ ID NO: 46 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 46 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 46 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 48;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 47;
(c) a polypeptide derived from or SEQ ID NO: 48 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 48.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 48.
The polypeptide of any of the embodiments above related to SEQ ID NO: 48 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 48.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 47.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 47.
In another embodiment, the polypeptide is derived from SEQ ID NO: 48 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 48 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 48 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 50;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 49;
(c) a polypeptide derived from or SEQ ID NO: 50 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81 %, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 50.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 50.
The polypeptide of any of the embodiments above related to SEQ ID NO: 50 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 50.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 49.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 49.
In another embodiment, the polypeptide is derived from SEQ ID NO: 50 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 50 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 50 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 52;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 51 ;
(c) a polypeptide derived from or SEQ ID NO: 52 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 52.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 52.
The polypeptide of any of the embodiments above related to SEQ ID NO: 52 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 52.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 51. The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 51 .
In another embodiment, the polypeptide is derived from SEQ ID NO: 52 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 52 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 52 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 54;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 53;
(c) a polypeptide derived from or SEQ ID NO: 54 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 54.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 54.
The polypeptide of any of the embodiments above related to SEQ ID NO: 54 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 54.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 53.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 53. In another embodiment, the polypeptide is derived from SEQ ID NO: 54 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 54 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 54 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 56;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 55;
(c) a polypeptide derived from or SEQ ID NO: 56 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 56.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 56.
The polypeptide of any of the embodiments above related to SEQ ID NO: 56 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 56.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 55.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 55.
In another embodiment, the polypeptide is derived from SEQ ID NO: 56 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 56 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 56 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 58;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 57;
(c) a polypeptide derived from or SEQ ID NO: 58 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 58.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 58.
The polypeptide of any of the embodiments above related to SEQ ID NO: 58 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 58.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 57.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 57.
In another embodiment, the polypeptide is derived from SEQ ID NO: 58 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 58 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 58 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 60;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 59;
(c) a polypeptide derived from or SEQ ID NO: 60 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 60.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 60.
The polypeptide of any of the embodiments above related to SEQ ID NO: 60 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 60.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 59.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 59.
In another embodiment, the polypeptide is derived from SEQ ID NO: 60 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 60 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 60 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15. In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 62;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 61 ;
(c) a polypeptide derived from or SEQ ID NO: 62 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 62.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 62.
The polypeptide of any of the embodiments above related to SEQ ID NO: 62 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 62.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 61.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 61.
In another embodiment, the polypeptide is derived from SEQ ID NO: 62 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 62 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 62 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of: (a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 64;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 63;
(c) a polypeptide derived from or SEQ ID NO: 64 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 64.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 64.
The polypeptide of any of the embodiments above related to SEQ ID NO: 64 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 64.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 63.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 63.
In another embodiment, the polypeptide is derived from SEQ ID NO: 64 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 64 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 64 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 66; (b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 65;
(c) a polypeptide derived from or SEQ ID NO: 66 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 66.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 66.
The polypeptide of any of the embodiments above related to SEQ ID NO: 66 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 66.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 65.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 65.
In another embodiment, the polypeptide is derived from SEQ ID NO: 66 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 66 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 66 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 68;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 67; (c) a polypeptide derived from or SEQ ID NO: 68 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 68.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 68.
The polypeptide of any of the embodiments above related to SEQ ID NO: 68 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 68.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 67.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 67.
In another embodiment, the polypeptide is derived from SEQ ID NO: 68 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 68 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 68 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 70;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 69;
(c) a polypeptide derived from or SEQ ID NO: 70 by substitution, deletion or addition of one or several amino acids; (d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 70.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 70.
The polypeptide of any of the embodiments above related to SEQ ID NO: 70 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 70.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 69.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 69.
In another embodiment, the polypeptide is derived from SEQ ID NO: 70 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 70 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 70 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 72;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ I D NO: 71 ;
(c) a polypeptide derived from or SEQ ID NO: 72 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and (e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 72.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 72.
The polypeptide of any of the embodiments above related to SEQ ID NO: 72 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 72.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 71.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 71 .
In another embodiment, the polypeptide is derived from SEQ ID NO: 72 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 72 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 72 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 74;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 73;
(c) a polypeptide derived from or SEQ ID NO: 74 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity. In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 74.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 74.
The polypeptide of any of the embodiments above related to SEQ ID NO: 74 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 74.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 73.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 73.
In another embodiment, the polypeptide is derived from SEQ ID NO: 74 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 74 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 74 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 76;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 75;
(c) a polypeptide derived from or SEQ ID NO: 76 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 76.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 76.
The polypeptide of any of the embodiments above related to SEQ ID NO: 76 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 76.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ ID NO: 75.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 75.
In another embodiment, the polypeptide is derived from SEQ ID NO: 76 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 76 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 76 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 79;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 78;
(c) a polypeptide derived from or SEQ ID NO: 79 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to SEQ ID NO: 79.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 79.
The polypeptide of any of the embodiments above related to SEQ ID NO: 79 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 79.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 78.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 78.
In another embodiment, the polypeptide is derived from SEQ ID NO: 79 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 79 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 79 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 81 ;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 80;
(c) a polypeptide derived from or SEQ ID NO: 81 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 81. In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 81 .
The polypeptide of any of the embodiments above related to SEQ ID NO: 81 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 81 .
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 80.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 80.
In another embodiment, the polypeptide is derived from SEQ ID NO: 81 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 81 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 81 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 83;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 82;
(c) a polypeptide derived from or SEQ ID NO: 83 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 83.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 83. The polypeptide of any of the embodiments above related to SEQ ID NO: 83 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids.
In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 83.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 82.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 82.
In another embodiment, the polypeptide is derived from SEQ ID NO: 83 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 83 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 83 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
In another aspect, the invention relates to polypeptides having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to SEQ ID NO: 85;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of SEQ ID NO: 84;
(c) a polypeptide derived from or SEQ ID NO: 85 by substitution, deletion or addition of one or several amino acids;
(d) a polypeptide derived from the polypeptide of (a), (b), or (c) wherein the N- and/or C-terminal end has been extended by the addition of one or more amino acids; and
(e) a fragment of the polypeptide of (a), (b), or (c); wherein the polypeptide has carbonic anhydrase activity.
In one embodiment, the polypeptide has a sequence identity of at least 60%, e.g., at least
65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least
84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least
91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least
98%, at least 99%, or 100% to SEQ ID NO: 85.
In one embodiment, the polypeptide comprises, consists essentially of, or consists of the amino acid sequence of SEQ ID NO: 85.
The polypeptide of any of the embodiments above related to SEQ ID NO: 85 may have an N-terminal and/or C-terminal extension of one or more amino acids, e.g., 1-5 amino acids. In another embodiment, the polypeptide is a fragment comprising at least 85%, 90%, or 95% of the number of amino acids of SEQ ID NO: 85.
In some embodiments, the polypeptide is encoded by a polynucleotide having a sequence identity of at least 60%, e.g., at least 65%, at least 70%, at least 75%, at least 80%, at least 81%, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% to the coding sequence of SEQ
ID NO: 84.
The polynucleotide encoding the polypeptide preferably comprises, consists essentially of, or consists of SEQ ID NO: 84.
In another embodiment, the polypeptide is derived from SEQ ID NO: 85 by substitution, deletion or addition of one or more amino acids. In some embodiments, the polypeptide is a variant of SEQ ID NO: 85 comprising a substitution, deletion, and/or insertion at one or more positions. In one embodiment, the number of amino acid substitutions, deletions and/or insertions introduced into the polypeptide of SEQ ID NO: 85 is up to 15, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, or 15.
The amino acid changes may be of a minor nature, that is conservative amino acid substitutions or insertions that do not significantly affect the folding and/or activity of the protein; small deletions, typically of 1-30 amino acids; small amino- or carboxyl-terminal extensions, such as an amino-terminal methionine residue; a small linker peptide of up to 20-25 residues; or a small extension that facilitates purification by changing net charge or another function, such as a poly-histidine tract, an antigenic epitope or a binding module.
Examples of conservative substitutions are within the groups of basic amino acids (arginine, lysine and histidine), acidic amino acids (glutamic acid and aspartic acid), polar amino acids (glutamine and asparagine), hydrophobic amino acids (leucine, isoleucine and valine), aromatic amino acids (phenylalanine, tryptophan and tyrosine), and small amino acids (glycine, alanine, serine, threonine and methionine). Amino acid substitutions that do not generally alter specific activity are known in the art and are described, for example, by H. Neurath and R.L. Hill, 1979, In, The Proteins, Academic Press, New York. Common substitutions are Ala/Ser, Val/lle, Asp/Glu, Thr/Ser, Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, Lys/Arg, Asp/Asn, Leu/lle, Leu/Val, Ala/Glu, and Asp/Gly.
Alternatively, the amino acid changes are of such a nature that the physico-chemical properties of the polypeptides are altered. For example, amino acid changes may improve the thermal stability of the polypeptide, alter the substrate specificity, change the pH optimum, and the like.
Essential amino acids in a polypeptide can be identified according to procedures known in the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and Wells, 1989, Science 244: 1081-1085). In the latter technique, single alanine mutations are introduced at every residue in the molecule, and the resultant molecules are tested for carbonic anhydrase activity to identify amino acid residues that are critical to the activity of the molecule. See also, Hilton et al., 1996, J. Biol. Chem. 271 : 4699-4708. The active site of the enzyme or other biological interaction can also be determined by physical analysis of structure, as determined by such techniques as nuclear magnetic resonance, crystallography, electron diffraction, or photoaffinity labeling, in conjunction with mutation of putative contact site amino acids. See, for example, de Vos et al., 1992, Science 255: 306-312; Smith et al., 1992, J. Mol. Biol. 224: 899-904; Wlodaver et al., 1992, FEBS Lett. 309: 59-64. The identity of essential amino acids can also be inferred from an alignment with a related polypeptide, and/or be inferred from sequence homology and conserved catalytic machinery with a related polypeptide or within a polypeptide or protein family with polypeptides/proteins descending from a common ancestor, typically having similar three-dimensional structures, functions, and significant sequence similarity. Additionally or alternatively, protein structure prediction tools can be used for protein structure modelling to identify essential amino acids and/or active sites of polypeptides. See, for example, Jumper et al., 2021 , “Highly accurate protein structure prediction with AlphaFold”, Nature 596: 583-589.
Alpha-carbonic anhydrases may be identified by the consensus sequence motif: S-E- [HN]-x-[LIVM]-x(4)-[FYH]-x(2)-E-[LIVMGA]-H-[LIVMFA](2). In one embodiment, the carbonic anhydrase comprises the consensus sequence motif S-E-[HN]-x-[LIVM]-x(4)-[FYH]-x(2)-E- [LIVMGA]-H-[LIVMFA](2).
Additional guidance on the structure-activity relationship of the carbonic anhydrases herein can be determined using multiple sequence alignment (MSA) techniques well-known in the art. Based on the teachings herein, the skilled artisan could make similar alignments with any number of carbonic anhydrases described herein or known in the art. Such alignments aid the skilled artisan to determine potentially relevant domains (e.g., binding domains or catalytic domains), as well as which amino acid residues are conserved and not conserved among the different carbonic anhydrase sequences. It is appreciated in the art that changing an amino acid that is conserved at a particular position between disclosed polypeptides will more likely result in a change in biological activity (Bowie et al., 1990, Science 247: 1306-1310: “Residues that are directly involved in protein functions such as binding or catalysis will certainly be among the most conserved”). In contrast, substituting an amino acid that is not highly conserved among the polypeptides will not likely or significantly alter the biological activity.
Even further guidance on the structure-activity relationship for the skilled artisan can be found in published x-ray crystallography studies known in the art. A crystal structure of the Persephonella marina a-carbonic anhydrase identified active site residues, the calcium binding site, and disulfide residues important for thermostability (Kim et al., 2019, Mol. Cells. 42(6): 460- 469). Amino acid residues H107, H109, and H126 are predicted to form a histidine triad which is important for catalysis (corresponding to positions H86, H88, and H105 for the P. marina mature polypeptide sequence of SEQ ID NO: 77). Amino acid residues C44 and C197 are predicted to form a disulfide bridge to provide conformational and thermal stability by increasing structural rigidity (corresponding to positions C23 and C176 for the P. marina mature polypeptide sequence of SEQ ID NO: 77). Amino acid residues Y25, H82, E113 and T193 are predicted to participate in a proton shuttle mechanism, which also is relevant for the catalytic activity of the enzyme (corresponding to positions Y4, H61 , E92 and T172 for the P. marina mature polypeptide sequence of SEQ ID NO: 77). Amino acid residues V128, V138, L136, L192, V201 and W203 are predicted to be responsible for CO2 binding (corresponding to positions V107, V117, L115, L171 , V180 and W182 for the P. marina mature polypeptide sequence of SEQ ID NO: 77).
The Endozoicomonas arenosclerae carbonic anhydrase of SEQ ID NO: 2 contains key amino acid residues that may be important for activity (See alignment in Figure 8). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 2. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 26 and/or 179 of SEQ ID NO: 2. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 7 of SEQ ID NO: 2, a histidine at a position corresponding to position 64 of SEQ ID NO: 2, a glutamic acid at a position correspondning to position 95 of SEQ ID NO: 2 and/or a threonine at a position correspondning to position 175 of SEQ ID NO: 2. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 110 of SEQ ID NO: 2, a leucine at a position correspondning to position 118 of SEQ ID NO: 2, a valine at a position corresponding to position 120 of SEQ ID NO: 2, a leucine at a position correspondning to position 174 of SEQ ID NO: 2, a valine at a position correspondning to position 183 of SEQ ID NO: 2 and/or a tryptophan at a position correspondning to position 185 of SEQ ID NO: 2. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The Achromatium sp. carbonic anhydrase of SEQ ID NO: 4 contains key amino acid residues that may be important for activity (See alignment in Figure 9). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 92, 94, and/or 111 of SEQ ID NO: 4. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 28 and/or 184 of SEQ ID NO: 4. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 9 of SEQ ID NO: 4, a histidine at a position corresponding to position 67 of SEQ ID NO: 4, a glutamic acid at a position correspondning to position 98 of SEQ ID NO: 4 and/or a threonine at a position correspondning to position 180 of SEQ ID NO: 4. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 113 of SEQ ID NO: 4, a leucine at a position correspondning to position 121 of SEQ ID NO: 4, a valine at a position corresponding to position 123 of SEQ ID NO: 4, a leucine at a position correspondning to position 171 of SEQ ID NO: 4, a valine at a position correspondning to position 188 of SEQ ID NO: 4 and/or a tryptophan at a position correspondning to position 190 of SEQ ID NO: 4. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The Endozoicomonas numazuensis carbonic anhydrase of SEQ ID NO: 6 contains key amino acid residues that may be important for activity (See alignment in Figure 10). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 6. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 26 and/or 179 of SEQ ID NO: 6. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 7 of SEQ ID NO: 6, a histidine at a position corresponding to position 64 of SEQ ID NO: 6, a glutamic acid at a position correspondning to position 95 of SEQ ID NO: 6 and/or a threonine at a position correspondning to position 175 of SEQ ID NO: 6. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 110 of SEQ ID NO: 6, a leucine at a position correspondning to position 118 of SEQ ID NO: 6, a valine at a position corresponding to position 120 of SEQ ID NO: 6, a leucine at a position correspondning to position 174 of SEQ ID NO: 6, a valine at a position correspondning to position 183 of SEQ ID NO: 6 and/or a tryptophan at a position correspondning to position 185 of SEQ ID NO: 6. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The Thiorhodococcus drewsii carbonic anhydrase of SEQ ID NO: 8 contains key amino acid residues that may be important for activity (See alignment in Figure 11). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 88, 90, and/or 107 of SEQ ID NO: 8. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 25 and/or 178 of SEQ ID NO: 8. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 6 of SEQ ID NO: 8, a histidine at a position corresponding to position 63 of SEQ ID NO: 8, a glutamic acid at a position correspondning to position 94 of SEQ ID NO: 8 and/or a threonine at a position correspondning to position 174 of SEQ ID NO: 8. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 109 of SEQ ID NO: 8, a leucine at a position correspondning to position 117 of SEQ ID NO: 8, a valine at a position corresponding to position 119 of SEQ ID NO: 8, a leucine at a position correspondning to position 173 of SEQ ID NO: 8, a valine at a position correspondning to position 182 of SEQ ID NO: 8 and/or a tryptophan at a position correspondning to position 184 of SEQ ID NO: 8. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 10 contains key amino acid residues that may be important for activity (See alignment in Figure 12). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 95, 97, and/or 114 of SEQ ID NO: 10. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 32 and/or 186 of SEQ ID NO: 10. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 4 of SEQ ID NO: 10, a histidine at a position corresponding to position 70 of SEQ ID NO: 10, a glutamic acid at a position correspondning to position 101 of SEQ ID NO: 10 and/or a threonine at a position correspondning to position 182 of SEQ ID NO: 10. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 116 of SEQ ID NO: 10, a leucine at a position correspondning to position 124 of SEQ ID NO: 10, a valine at a position corresponding to position 126 of SEQ ID NO: 10, a leucine at a position correspondning to position 181 of SEQ ID NO: 10, a valine at a position correspondning to position 190 of SEQ ID NO: 10 and/or a tryptophan at a position correspondning to position 192 of SEQ ID NO: 10. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 12 contains key amino acid residues that may be important for activity (See alignment in Figure 13). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 90, 92, and/or 109 of SEQ ID NO: 12. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 27 and/or 180 of SEQ ID NO: 12. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 8 of SEQ ID NO: 12, a histidine at a position corresponding to position 65 of SEQ ID NO: 12, a glutamic acid at a position correspondning to position 962 of SEQ ID NO: 12 and/or a threonine at a position correspondning to position 176 of SEQ ID NO: 12. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 111 of SEQ ID NO: 12, a leucine at a position correspondning to position 119 of SEQ ID NO: 12, a valine at a position corresponding to position 121 of SEQ ID NO: 12, a leucine at a position correspondning to position 175 of SEQ ID NO: 12, a valine at a position correspondning to position 184 of SEQ ID NO: 12 and/or a tryptophan at a position correspondning to position 186 of SEQ ID NO: 12. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The Aquificales bacterium carbonic anhydrase of SEQ ID NO: 14 contains key amino acid residues that may be important for activity (See alignment in Figure 14). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 97, 99, and/or 116 of SEQ ID NO: 14. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 32 and/or 187 of SEQ ID NO: 14. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 13 of SEQ ID NO: 14, a histidine at a position corresponding to position 72 of SEQ ID NO: 14, a glutamic acid at a position correspondning to position 103 of SEQ ID NO: 14 and/or a threonine at a position correspondning to position 183 of SEQ ID NO: 14. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 118 of SEQ ID NO: 14, a valine at a position corresponding to position 128 of SEQ ID NO: 14, a leucine at a position correspondning to position 182 of SEQ ID NO: 14, a valine at a position correspondning to position 191 of SEQ ID NO: 14 and/or a tryptophan at a position correspondning to position 193 of SEQ ID NO: 14. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, or at least five residues.
The carbonic anhydrase of SEQ ID NO: 16 contains key amino acid residues that may be important for activity (See alignment in Figure 15). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 87, 89, and/or 106 of SEQ ID NO: 16. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 16 and/or 177 of SEQ ID NO: 16. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 5 of SEQ ID NO: 16, a histidine at a position corresponding to position 62 of SEQ ID NO: 16, a glutamic acid at a position correspondning to position 93 of SEQ ID NO: 16 and/or a threonine at a position correspondning to position 173 of SEQ ID NO: 16. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 108 of SEQ ID NO: 16, a leucine at a position correspondning to position 116 of SEQ ID NO: 16, a valine at a position corresponding to position 118 of SEQ ID NO: 16, a leucine at a position correspondning to position 172 of SEQ ID NO: 16, a valine at a position correspondning to position 181 of SEQ ID NO: 16 and/or a tryptophan at a position correspondning to position 183 of SEQ ID NO: 16. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 18 contains key amino acid residues that may be important for activity (See alignment in Figure 16). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 90, 92, and/or 109 of SEQ ID NO: 18. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 27 and/or 181 of SEQ ID NO: 18. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 8 of SEQ ID NO: 18, a histidine at a position corresponding to position 65 of SEQ ID NO: 18, a glutamic acid at a position correspondning to position 96 of SEQ ID NO: 18 and/or a threonine at a position correspondning to position 177 of SEQ ID NO: 18. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 111 of SEQ I D NO: 18, a leucine at a position correspondning to position 119 of SEQ I D NO: 18, a valine at a position corresponding to position 121 of SEQ ID NO: 18, a leucine at a position correspondning to position 185 of SEQ ID NO: 18, a valine at a position correspondning to position 180 of SEQ ID NO: 18 and/or a tryptophan at a position correspondning to position 187 of SEQ ID NO: 18. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 20 contains key amino acid residues that may be important for activity (See alignment in Figure 17). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 87, 89, and/or 106 of SEQ ID NO: 20. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 24 and/or 176 of SEQ ID NO: 20. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 5 of SEQ ID NO: 20, a histidine at a position corresponding to position 62 of SEQ ID NO: 20, a glutamic acid at a position correspondning to position 93 of SEQ ID NO: 20 and/or a threonine at a position correspondning to position 172 of SEQ ID NO: 20. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 108 of SEQ ID NO: 20, a leucine at a position correspondning to position 116 of SEQ ID NO: 20, a valine at a position corresponding to position 118 of SEQ ID NO: 20, a leucine at a position correspondning to position 171 of SEQ ID NO: 20, a valine at a position correspondning to position 180 of SEQ ID NO: 20 and/or a tryptophan at a position correspondning to position 182 of SEQ ID NO: 20. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 22 contains key amino acid residues that may be important for activity (See alignment in Figure 18). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 98, 100, and/or 117 of SEQ ID NO: 22. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 35 and/or 188 of SEQ ID NO: 22. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 16 of SEQ ID NO: 22, a histidine at a position corresponding to position 73 of SEQ ID NO: 22, a glutamic acid at a position correspondning to position 104 of SEQ ID NO: 22 and/or a threonine at a position correspondning to position 184 of SEQ ID NO: 22. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 119 of SEQ ID NO: 22, a leucine at a position correspondning to position 127 of SEQ ID NO: 22, a valine at a position corresponding to position 129 of SEQ ID NO: 22, a leucine at a position correspondning to position 183 of SEQ ID NO: 22, a valine at a position correspondning to position 192 of SEQ ID NO: 22 and/or a tryptophan at a position correspondning to position 194 of SEQ ID NO: 22. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 26 contains key amino acid residues that may be important for activity (See alignment in Figure 19). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 87, 89, and/or 106 of SEQ ID NO: 26. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 24 and/or 177 of SEQ ID NO: 26. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 5 of SEQ ID NO: 26, a histidine at a position corresponding to position 62 of SEQ ID NO: 26, a glutamic acid at a position correspondning to position 93 of SEQ ID NO: 26 and/or a threonine at a position correspondning to position 173 of SEQ ID NO: 26. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 108 of SEQ ID NO: 26, a leucine at a position correspondning to position 116 of SEQ ID NO: 26, a valine at a position corresponding to position 118 of SEQ ID NO: 26, a leucine at a position correspondning to position 172 of SEQ ID NO: 26, a valine at a position correspondning to position 181 of SEQ ID NO: 26 and/or a tryptophan at a position correspondning to position 183 of SEQ ID NO: 26. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 28 contains key amino acid residues that may be important for activity (See alignment in Figure 20). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 87, 89, and/or 106 of SEQ ID NO: 28. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 24 and/or 177 of SEQ ID NO: 28. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 5 of SEQ ID NO: 28, a histidine at a position corresponding to position 62 of SEQ ID NO: 28, a glutamic acid at a position correspondning to position 93 of SEQ ID NO: 28 and/or a threonine at a position correspondning to position 173 of SEQ ID NO: 28. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 108 of SEQ ID NO: 28, a leucine at a position correspondning to position 116 of SEQ ID NO: 28, a valine at a position corresponding to position 118 of SEQ ID NO: 28, a leucine at a position correspondning to position 172 of SEQ ID NO: 28, a valine at a position correspondning to position 181 of SEQ ID NO: 28 and/or a tryptophan at a position correspondning to position 183 of SEQ ID NO: 28. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 30 contains key amino acid residues that may be important for activity (See alignment in Figure 21). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 86, 88, and/or 105 of SEQ ID NO: 30. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 24 and/or 175 of SEQ ID NO: 30. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 5 of SEQ ID NO: 30, a histidine at a position corresponding to position
61 of SEQ ID NO: 30, a glutamic acid at a position correspondning to position 92 of SEQ ID NO: 30 and/or a threonine at a position correspondning to position 171 of SEQ ID NO: 30. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position
107 of SEQ ID NO: 30, a leucine at a position correspondning to position 115 of SEQ ID NO: 30, a valine at a position corresponding to position 117 of SEQ ID NO: 30, a leucine at a position correspondning to position 171 of SEQ ID NO: 30, a valine at a position correspondning to position 179 of SEQ ID NO: 30 and/or a tryptophan at a position correspondning to position 181 of SEQ ID NO: 30. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 32 contains key amino acid residues that may be important for activity (See alignment in Figure 22). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 87, 89, and/or 106 of SEQ ID NO: 32. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 24 and/or 177 of SEQ ID NO: 32. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 5 of SEQ ID NO: 32, a histidine at a position corresponding to position
62 of SEQ ID NO: 32, a glutamic acid at a position correspondning to position 93 of SEQ ID NO: 32 and/or a threonine at a position correspondning to position 173 of SEQ ID NO: 32. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position
108 of SEQ ID NO: 32, a leucine at a position correspondning to position 116 of SEQ ID NO: 32, a valine at a position corresponding to position 118 of SEQ ID NO: 32, a leucine at a position correspondning to position 172 of SEQ ID NO: 32, a valine at a position correspondning to position 181 of SEQ ID NO: 32 and/or a tryptophan at a position correspondning to position 183 of SEQ ID NO: 32. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 34 contains key amino acid residues that may be important for activity (See alignment in Figure 23). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 87, 89, and/or 106 of SEQ ID NO: 34. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 25 and/or 177 of SEQ ID NO: 34. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 6 of SEQ ID NO: 34, a histidine at a position corresponding to position 62 of SEQ ID NO: 34, a glutamic acid at a position correspondning to position 93 of SEQ ID NO: 34 and/or a threonine at a position correspondning to position 173 of SEQ ID NO: 34. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 108 of SEQ ID NO: 34, a leucine at a position correspondning to position 116 of SEQ ID NO: 34, a valine at a position corresponding to position 118 of SEQ ID NO: 34, a leucine at a position correspondning to position 172 of SEQ ID NO: 34, a valine at a position correspondning to position 181 of SEQ ID NO: 34 and/or a tryptophan at a position correspondning to position 183 of SEQ ID NO: 34. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 36 contains key amino acid residues that may be important for activity (See alignment in Figure 24). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 91 , 93, and/or 110 of SEQ ID NO: 36. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 25 and/or 182 of SEQ ID NO: 36. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 6 of SEQ ID NO: 36, a histidine at a position corresponding to position 66 of SEQ ID NO: 36, a glutamic acid at a position correspondning to position 97 of SEQ ID NO: 36 and/or a threonine at a position correspondning to position 178 of SEQ ID NO: 36. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 112 of SEQ ID NO: 36, a leucine at a position correspondning to position 120 of SEQ ID NO: 36, a valine at a position corresponding to position 122 of SEQ ID NO: 36, a leucine at a position correspondning to position 177 of SEQ ID NO: 36, a valine at a position correspondning to position 186 of SEQ ID NO: 36 and/or a tryptophan at a position correspondning to position 188 of SEQ ID NO: 36. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 38 contains key amino acid residues that may be important for activity (See alignment in Figure 25). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 38. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 26 and/or 179 of SEQ ID NO: 38. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 7 of SEQ ID NO: 38, a histidine at a position corresponding to position 64 of SEQ ID NO: 38, a glutamic acid at a position correspondning to position 95 of SEQ ID NO: 38 and/or a threonine at a position correspondning to position 175 of SEQ ID NO: 38. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 110 of SEQ ID NO: 38, a leucine at a position correspondning to position 118 of SEQ ID NO: 38, a valine at a position corresponding to position 120 of SEQ ID NO: 38, a leucine at a position correspondning to position 174 of SEQ ID NO: 38, a valine at a position correspondning to position 183 of SEQ ID NO: 38 and/or a tryptophan at a position correspondning to position 185 of SEQ ID NO: 38. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 40 contains key amino acid residues that may be important for activity (See alignment in Figure 26). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 40. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 27 and/or 179 of SEQ ID NO: 40. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 8 of SEQ ID NO: 40, a histidine at a position corresponding to position 64 of SEQ ID NO: 40, a glutamic acid at a position correspondning to position 95 of SEQ ID NO: 40 and/or a threonine at a position correspondning to position 175 of SEQ ID NO: 40. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 110 of SEQ ID NO: 40, a leucine at a position correspondning to position 118 of SEQ ID NO: 40, a valine at a position corresponding to position 120 of SEQ ID NO: 40, a leucine at a position correspondning to position 174 of SEQ ID NO: 40, a valine at a position correspondning to position 183 of SEQ ID NO: 40 and/or a tryptophan at a position correspondning to position 185 of SEQ ID NO: 40. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 42 contains key amino acid residues that may be important for activity (See alignment in Figure 27). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 87, 89, and/or 106 of SEQ ID NO: 42. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 25 and/or 176 of SEQ ID NO: 42. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 6 of SEQ ID NO: 42, a histidine at a position corresponding to position 42 of SEQ ID NO: 42, a glutamic acid at a position correspondning to position 93 of SEQ ID NO: 42 and/or a threonine at a position correspondning to position 172 of SEQ ID NO: 42. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 108 of SEQ ID NO: 42, a leucine at a position correspondning to position 116 of SEQ ID NO: 42, a valine at a position corresponding to position 118 of SEQ ID NO: 42, a leucine at a position correspondning to position 171 of SEQ ID NO: 42, a valine at a position correspondning to position 180 of SEQ ID NO: 42 and/or a tryptophan at a position correspondning to position 182 of SEQ ID NO: 42. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 44 contains key amino acid residues that may be important for activity (See alignment in Figure 28). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 87, 89, and/or 106 of SEQ ID NO: 44. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 24 and/or 177 of SEQ ID NO: 44. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 5 of SEQ ID NO: 44, a histidine at a position corresponding to position 62 of SEQ ID NO: 44, a glutamic acid at a position correspondning to position 93 of SEQ ID NO: 44 and/or a threonine at a position correspondning to position 173 of SEQ ID NO: 44. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 108 of SEQ ID NO: 44, a leucine at a position correspondning to position 116 of SEQ ID NO: 44, a valine at a position corresponding to position 118 of SEQ ID NO: 44, a leucine at a position correspondning to position 172 of SEQ ID NO: 44, a valine at a position correspondning to position 181 of SEQ ID NO: 44 and/or a tryptophan at a position correspondning to position 183 of SEQ ID NO: 44. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 46 contains key amino acid residues that may be important for activity (See alignment in Figure 29). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 90, 92, and/or 109 of SEQ ID NO: 46. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 27 and/or 180 of SEQ ID NO: 46. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 8 of SEQ ID NO: 46, a histidine at a position corresponding to position 65 of SEQ ID NO: 46, a glutamic acid at a position correspondning to position 96 of SEQ ID NO: 46 and/or a threonine at a position correspondning to position 176 of SEQ ID NO: 46. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 111 of SEQ ID NO: 46, a leucine at a position correspondning to position 119 of SEQ ID NO: 46, a valine at a position corresponding to position 121 of SEQ ID NO: 46, a leucine at a position correspondning to position 175 of SEQ ID NO: 46, a valine at a position correspondning to position 184 of SEQ ID NO: 46 and/or a tryptophan at a position correspondning to position 186 of SEQ ID NO: 46. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 48 contains key amino acid residues that may be important for activity (See alignment in Figure 30). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 48. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 26 and/or 180 of SEQ ID NO: 48. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 6 of SEQ ID NO: 48, a histidine at a position corresponding to position 64 of SEQ ID NO: 48, a glutamic acid at a position correspondning to position 95 of SEQ ID NO: 48 and/or a threonine at a position correspondning to position 176 of SEQ ID NO: 48. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 109 of SEQ ID NO: 48, a leucine at a position correspondning to position 117 of SEQ ID NO: 48, a valine at a position corresponding to position 119 of SEQ ID NO: 48, a leucine at a position correspondning to position 175 of SEQ ID NO: 48, a valine at a position correspondning to position 184 of SEQ ID NO: 48 and/or a tryptophan at a position correspondning to position 186 of SEQ ID NO: 48. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 50 contains key amino acid residues that may be important for activity (See alignment in Figure 31). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 50. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 26 and/or 179 of SEQ ID NO: 50. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 7 of SEQ ID NO: 50, a histidine at a position corresponding to position 64 of SEQ ID NO: 50, a glutamic acid at a position correspondning to position 95 of SEQ ID NO: 50 and/or a threonine at a position correspondning to position 175 of SEQ ID NO: 50. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 110 of SEQ ID NO: 50, a leucine at a position correspondning to position 118 of SEQ ID NO: 50, a valine at a position corresponding to position 120 of SEQ ID NO: 50, a leucine at a position correspondning to position 174 of SEQ ID NO: 50, a valine at a position correspondning to position 183 of SEQ ID NO: 50 and/or a tryptophan at a position correspondning to position 185 of SEQ ID NO: 50. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 52 contains key amino acid residues that may be important for activity (See alignment in Figure 32). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 86, 88, and/or 105 of SEQ ID NO: 52. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 23 and/or 176 of SEQ ID NO: 52. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 4 of SEQ ID NO: 52, a histidine at a position corresponding to position 61 of SEQ ID NO: 52, a glutamic acid at a position correspondning to position 92 of SEQ ID NO: 52 and/or a threonine at a position correspondning to position 172 of SEQ ID NO: 52. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 107 of SEQ ID NO: 52, a leucine at a position correspondning to position 115 of SEQ ID NO: 52, a valine at a position corresponding to position 117 of SEQ ID NO: 52, a leucine at a position correspondning to position 171 of SEQ ID NO: 52, a valine at a position correspondning to position 180 of SEQ ID NO: 52 and/or a tryptophan at a position correspondning to position 182 of SEQ ID NO: 52. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 54 contains key amino acid residues that may be important for activity (See alignment in Figure 33). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 85, 87, and/or 104 of SEQ ID NO: 54. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 22 and/or 175 of SEQ ID NO: 54. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 3 of SEQ ID NO: 54, a histidine at a position corresponding to position 60 of SEQ ID NO: 54, a glutamic acid at a position correspondning to position 91 of SEQ ID NO: 54 and/or a threonine at a position correspondning to position 171 of SEQ ID NO: 54. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 106 of SEQ ID NO: 54, a leucine at a position correspondning to position 114 of SEQ ID NO: 54, a valine at a position corresponding to position 116 of SEQ ID NO: 54, a leucine at a position correspondning to position 170 of SEQ ID NO: 54, a valine at a position correspondning to position 179 of SEQ ID NO: 54 and/or a tryptophan at a position correspondning to position 181 of SEQ ID NO: 54. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 56 contains key amino acid residues that may be important for activity (See alignment in Figure 34). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 90, 92, and/or 109 of SEQ ID NO: 56. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 28 and/or 180 of SEQ ID NO: 56. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 9 of SEQ ID NO: 56, a histidine at a position corresponding to position 65 of SEQ ID NO: 56, a glutamic acid at a position correspondning to position 96 of SEQ ID NO: 56 and/or a threonine at a position correspondning to position 176 of SEQ ID NO: 56. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 111 of SEQ ID NO: 56, a leucine at a position correspondning to position 119 of SEQ ID NO: 56, a valine at a position corresponding to position 121 of SEQ ID NO: 56, a leucine at a position correspondning to position 175 of SEQ ID NO: 56, a valine at a position correspondning to position 184 of SEQ ID NO: 56 and/or a tryptophan at a position correspondning to position 186 of SEQ ID NO: 56. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 58 contains key amino acid residues that may be important for activity (See alignment in Figure 35). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 92, 94, and/or 111 of SEQ ID NO: 58. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 28 and/or 182 of SEQ ID NO: 58. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 9 of SEQ ID NO: 58, a histidine at a position corresponding to position 66 of SEQ ID NO: 58, a glutamic acid at a position correspondning to position 98 of SEQ ID NO: 58 and/or a threonine at a position correspondning to position 178 of SEQ ID NO: 58. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 113 of SEQ ID NO: 58, a leucine at a position correspondning to position 121 of SEQ ID NO: 58, a valine at a position corresponding to position 123 of SEQ ID NO: 58, a leucine at a position correspondning to position 177 of SEQ ID NO: 58, a valine at a position correspondning to position 186 of SEQ ID NO: 58 and/or a tryptophan at a position correspondning to position 188 of SEQ ID NO: 58. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 60 contains key amino acid residues that may be important for activity (See alignment in Figure 36). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 85, 87, and/or 104 of SEQ ID NO: 60. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 20 and/or 175 of SEQ ID NO: 60. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 1 of SEQ ID NO: 60, a histidine at a position corresponding to position 58 of SEQ ID NO: 60, a glutamic acid at a position correspondning to position 91 of SEQ ID NO: 60 and/or a threonine at a position correspondning to position 171 of SEQ ID NO: 60. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 106 of SEQ ID NO: 60, a valine at a position corresponding to position 116 of SEQ ID NO: 60, a leucine at a position correspondning to position 170 of SEQ ID NO: 60, a valine at a position correspondning to position 179 of SEQ ID NO: 60 and/or a tryptophan at a position correspondning to position 181 of SEQ ID NO: 60. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, or at least five residues.
The carbonic anhydrase of SEQ ID NO: 62 contains key amino acid residues that may be important for activity (See alignment in Figure 37). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 62. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 27 and/or 179 of SEQ ID NO: 62. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 8 of SEQ ID NO: 62, a histidine at a position corresponding to position 64 of SEQ ID NO: 62, a glutamic acid at a position correspondning to position 95 of SEQ ID NO:
62 and/or a threonine at a position correspondning to position 175 of SEQ ID NO: 62. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 110 of SEQ ID NO: 62, a leucine at a position correspondning to position 118 of SEQ ID NO: 62, a valine at a position corresponding to position 120 of SEQ ID NO: 62, a leucine at a position correspondning to position 174 of SEQ ID NO: 62, a valine at a position correspondning to position 183 of SEQ ID NO: 62 and/or a tryptophan at a position correspondning to position 185 of SEQ ID NO: 62. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 64 contains key amino acid residues that may be important for activity (See alignment in Figure 38). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 91 , 93, and/or 110 of SEQ ID NO: 64. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 25 and/or 112 of SEQ ID NO: 64. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 6 of SEQ ID NO: 64, a histidine at a position corresponding to position
63 of SEQ ID NO: 64, a glutamic acid at a position correspondning to position 97 of SEQ ID NO:
64 and/or a threonine at a position correspondning to position 177 of SEQ ID NO: 64. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 112 of SEQ ID NO: 64, a leucine at a position correspondning to position 120 of SEQ ID NO: 64, a valine at a position corresponding to position 122 of SEQ ID NO: 64, a leucine at a position correspondning to position 176 of SEQ ID NO: 64, a valine at a position correspondning to position 185 of SEQ ID NO: 64 and/or a tryptophan at a position correspondning to position 187 of SEQ ID NO: 64. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 66 contains key amino acid residues that may be important for activity (See alignment in Figure 39). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 88, 90, and/or 107 of SEQ ID NO: 66. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 25 and/or 178 of SEQ ID NO: 66. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 6 of SEQ ID NO: 66, a histidine at a position corresponding to position
63 of SEQ ID NO: 66, a glutamic acid at a position correspondning to position 94 of SEQ ID NO: 66 and/or a threonine at a position correspondning to position 174 of SEQ ID NO: 66. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position
109 of SEQ ID NO: 66, a leucine at a position correspondning to position 117 of SEQ ID NO: 66, a valine at a position corresponding to position 119 of SEQ ID NO: 66, a leucine at a position correspondning to position 173 of SEQ ID NO: 66, a valine at a position correspondning to position 182 of SEQ ID NO: 66 and/or a tryptophan at a position correspondning to position 184 of SEQ ID NO: 66. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 68 contains key amino acid residues that may be important for activity (See alignment in Figure 40). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 68. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 24 and/or 178 of SEQ ID NO: 68. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 5 of SEQ ID NO: 68, a histidine at a position corresponding to position
64 of SEQ ID NO: 68, a glutamic acid at a position correspondning to position 95 of SEQ ID NO: 68 and/or a threonine at a position correspondning to position 174 of SEQ ID NO: 68. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position
110 of SEQ ID NO: 68, a leucine at a position correspondning to position 118 of SEQ ID NO: 68, a valine at a position corresponding to position 120 of SEQ ID NO: 68, a leucine at a position correspondning to position 173 of SEQ ID NO: 68, a valine at a position correspondning to position 182 of SEQ ID NO: 68 and/or a tryptophan at a position correspondning to position 184 of SEQ ID NO: 68. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 70 contains key amino acid residues that may be important for activity (See alignment in Figure 41). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 89, 91 , and/or 108 of SEQ ID NO: 70. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 24 and/or 179 of SEQ ID NO: 70. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 4 of SEQ ID NO: 70, a histidine at a position corresponding to position 62 of SEQ ID NO: 70, a glutamic acid at a position correspondning to position 95 of SEQ ID NO: 70 and/or a threonine at a position correspondning to position 175 of SEQ ID NO: 70. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 110 of SEQ ID NO: 70, a leucine at a position correspondning to position 118 of SEQ ID NO: 70, a valine at a position corresponding to position 120 of SEQ ID NO: 70, a leucine at a position correspondning to position 174 of SEQ ID NO: 70, a valine at a position correspondning to position 183 of SEQ ID NO: 70 and/or a tryptophan at a position correspondning to position 185 of SEQ ID NO: 70. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The Trichophaea saccata carbonic anhydrase of SEQ ID NO: 74 contains key amino acid residues that may be important for activity (See alignment in Figure 42). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 102, 104, and/or 121 of SEQ ID NO: 74. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 34 and/or 195 of SEQ ID NO: 74. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 14 of SEQ ID NO: 74, a histidine at a position corresponding to position 72 of SEQ ID NO: 74, a glutamic acid at a position correspondning to position 108 of SEQ ID NO: 74 and/or a threonine at a position correspondning to position 191 of SEQ ID NO: 74. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 123 of SEQ ID NO: 74, a leucine at a position correspondning to position 131 of SEQ ID NO: 74, a valine at a position corresponding to position 133 of SEQ ID NO: 74, a leucine at a position correspondning to position 190 of SEQ ID NO: 74, a valine at a position correspondning to position 199 of SEQ ID NO: 74 and/or a tryptophan at a position correspondning to position 201 of SEQ ID NO: 74. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The Eleutherascus tuberculatus carbonic anhydrase of SEQ ID NO: 76 contains key amino acid residues that may be important for activity (See alignment in Figure 43). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 106, 108, and/or 125 of SEQ ID NO: 76. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 34 and/or 202 of SEQ ID NO: 76. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 15 of SEQ ID NO: 76, a histidine at a position corresponding to position 72 of SEQ ID NO: 76, a glutamic acid at a position correspondning to position 112 of SEQ ID NO: 76 and/or a threonine at a position correspondning to position 198 of SEQ ID NO: 76. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 127 of SEQ ID NO: 76, a leucine at a position correspondning to position 136 of SEQ ID NO: 76, a valine at a position corresponding to position 138 of SEQ ID NO: 76, a leucine at a position correspondning to position 197 of SEQ ID NO: 76, a valine at a position correspondning to position 206 of SEQ ID NO: 76 and/or a tryptophan at a position correspondning to position 208 of SEQ ID NO: 76. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The Lactobacillus sp. carbonic anhydrase of SEQ ID NO: 79 contains key amino acid residues that may be important for activity (See alignment in Figure 44). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 79, 81 , and/or 98 of SEQ ID NO: 79. In another embodiment, the carbonic anhydrase contains a glutamic acid at a position correspondning to position 85 of SEQ ID NO: 79 and/or a threonine at a position correspondning to position 163 of SEQ ID NO: 79. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 100 of SEQ ID NO: 79, a valine at a position corresponding to position 110 of SEQ ID NO: 79, a leucine at a position correspondning to position 162 of SEQ ID NO: 79, a valine at a position correspondning to position 183 of SEQ I D NO: 2 and/or a tryptophan at a position correspondning to position 171 of SEQ ID NO: 79. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 81 contains key amino acid residues that may be important for activity (See alignment in Figure 45). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 95, 97, and/or 114 of SEQ ID NO: 81. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 30 and/or 185 of SEQ ID NO: 81. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 11 of SEQ ID NO: 81 , a histidine at a position corresponding to position 68 of SEQ ID NO: 81 , a glutamic acid at a position correspondning to position 101 of SEQ ID NO: 81 and/or a threonine at a position correspondning to position 181 of SEQ ID NO: 81. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 116 of SEQ ID NO: 81 , a leucine at a position correspondning to position 124 of SEQ ID NO: 81 , a valine at a position corresponding to position 126 of SEQ ID NO: 81 , a leucine at a position correspondning to position 180 of SEQ ID NO: 81 , a valine at a position correspondning to position 189 of SEQ ID NO: 81 and/or a tryptophan at a position correspondning to position 191 of SEQ ID NO: 81 . Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
The carbonic anhydrase of SEQ ID NO: 83 contains key amino acid residues that may be important for activity (See alignment in Figure 46). Accordingly, in one embodiment of the present invention, the carbonic anhydrase contains a histidine residue at position(s) corresponsding to position 88, 90, and/or 107 of SEQ ID NO: 83. In another embodiment the carbonic anhydrase contains a cysteine residue at position(s) corresponding to position 25 and/or 180 of SEQ ID NO: 83. In another embodiment, the carbonic anhydrase contains a tyrosine at a position corresponding to position 6 of SEQ ID NO: 83, a histidine at a position corresponding to position 63 of SEQ ID NO: 83, a glutamic acid at a position correspondning to position 94 of SEQ ID NO: 83 and/or a threonine at a position correspondning to position 176 of SEQ ID NO: 83. Preferably, at least one of the proton shuttle positions are present, more preferably at least two proton shuttle positions are present, more preferably at least three proton shuttle positions are present and most preferably all the proton shuttle positions are present in the carbonic anhydrase. In another embodiment, the carbonic anhydrase contains a valine at a position corresponding to position 109 of SEQ ID NO: 83, a leucine at a position correspondning to position 117 of SEQ ID NO: 83, a valine at a position corresponding to position 119 of SEQ ID NO: 83, a leucine at a position correspondning to position 175 of SEQ ID NO: 83, a valine at a position correspondning to position 184 of SEQ ID NO: 83 and/or a tryptophan at a position correspondning to position 186 of SEQ ID NO: 83. Preferably, at least one of the residues responsible for CO2 binding is present, more preferably at least two, at least three, at least four, at least five, or all six residues.
Additional guidance on the structure-activity relationship for the skilled artisan can be found in other published x-ray crystallography studies known in the art, e.g., the carbonic anhydrase of SEQ ID NO: 2 of PCT/US2023/081794 (Fredslund et al., 2018, Enzyme Microb. Technol. 114: 48-54).
Additionally, structure-activity can be deciphered with the aid of the highly accurate neural network-based modelling program of AlphaFold (Jumper et al., Nature, 596: 583-589 (2021)). AlphaFold is a computational method for predicting the three-dimensional structure of a polypeptide from its amino acid sequence. Predicted structures for millions of polypeptides deposited in the UniProt database have been deposited in the AlphaFold Protein Structure Database, using the AlphaFold Monomer v2.0 model (Varadi et al., Nucleic Acids Research, 50: D439-D444 (2021)). In the AlphaFold Protein Structure Database, the three-dimensional structure of a polypeptide can be obtained by searching for the UniProt accession number of the polypeptide. In addition to the many three-dimensional structures that are already publicly available, code is available for reproducing and predicting structures of new polypeptides at source code repositories.
For the purposes of the present invention, the relatedness between the three-dimensional structure of two polypeptides is described by the parameter “structural similarity”. A three- dimensional structure of any polypeptide may be obtained experimentally via, e.g., X-ray crystallography or using in silico methods such as AlphaFold (vide supra). The structural similarity between three-dimensional structures may then be determined by the TM-score, which is calculated using the following general formula (Zhang & Skolnick, Proteins, 57:702-710 (2004)):
TM-score where LN is the length of the native structure, LT is the length of the aligned residues to the template structure, dj is the distance between the ith pair of aligned residues and do is a scale to normalize the match difference. ‘Max’ denotes the maximum value after optimal spatial superposition.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85, wherein the three- dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 2, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least
0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 4, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 6, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 8, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 10, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 12, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 14, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 16, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 18, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 20, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 22, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 24, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 26, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 28, wherein the three-dimensional structure is calculated by Alphafold. In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 30, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 32, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 34, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 36, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 38, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 40, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least
0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 42, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 44, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 46, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 48, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 50, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 52, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 54, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 56, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 58, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 60, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 62, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 64, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 66, wherein the three-dimensional structure is calculated by Alphafold. In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 68, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 70, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 72, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 74, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 76, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 79, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least
0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 81 , wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 83, wherein the three-dimensional structure is calculated by Alphafold.
In one embodiment, the carbonic anhydrase has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1 .0, to the three-dimensional structure of the polypeptide of SEQ ID NO: 85, wherein the three-dimensional structure is calculated by Alphafold.
Variants can be prepared using any mutagenesis procedure known in the art, such as site-directed mutagenesis, synthetic gene construction, semi-synthetic gene construction, random mutagenesis, shuffling, etc.
Site-directed mutagenesis is a technique in which one or more mutations are introduced at one or more defined sites in a polynucleotide encoding the parent polypeptide.
Site-directed mutagenesis can be accomplished in vitro by PCR involving the use of oligonucleotide primers containing the desired mutation. Site-directed mutagenesis can also be performed in vitro by cassette mutagenesis involving the cleavage by a restriction enzyme at a site in the plasmid comprising a polynucleotide encoding the parent and subsequent ligation of an oligonucleotide containing the mutation in the polynucleotide. Usually the restriction enzyme that digests the plasmid and the oligonucleotide is the same, permitting sticky ends of the plasmid and the insert to ligate to one another. See, e.g., Scherer and Davis, 1979, Proc. Natl. Acad. Sci. USA 7Q: 4949-4955; and Barton et al., 1990, Nucleic Acids Res. 18: 7349-4966.
Site-directed mutagenesis can also be accomplished in vivo by methods known in the art. See, e.g., US 2004/0171154; Storici et al., 2001 , Nature Biotechnol. 19: 773-776; Kren et a/., 1998, Nat. Med. 4: 285-290; and Calissano and Macino, 1996, Fungal Genet. Newslett. 43: 15- 16.
Any site-directed mutagenesis procedure can be used in the present invention. There are many commercial kits available that can be used to prepare variants.
Synthetic gene construction entails in vitro synthesis of a designed polynucleotide molecule to encode a polypeptide of interest. Gene synthesis can be performed utilizing a number of techniques, such as the multiplex microchip-based technology described by Tian et al., 2004, Nature 432: 1050-1054, and similar technologies wherein oligonucleotides are synthesized and assembled upon photo-programmable microfluidic chips.
Single or multiple amino acid substitutions, deletions, and/or insertions can be made and tested using known methods of mutagenesis, recombination, and/or shuffling, followed by a relevant screening procedure, such as those disclosed by Reidhaar-Olson and Sauer, 1988, Science 241 : 53-57; Bowie and Sauer, 1989, Proc. Natl. Acad. Sci. USA 86: 2152-2156; WO 95/17413; or WO 95/22625. Other methods that can be used include error-prone PCR, phage display (e.g., Lowman etal., 1991 , Biochemistry 30: 10832-10837; US 5,223,409; WO 92/06204), and region-directed mutagenesis (Derbyshire et al., 1986, Gene 46: 145; Ner et al., 1988, DNA 7: 127).
Mutagenesis/shuffling methods can be combined with high-throughput, automated screening methods to detect activity of cloned, mutagenized polypeptides expressed by host cells (Ness et al., 1999, Nature Biotechnology 17: 893-896). Mutagenized DNA molecules that encode active polypeptides can be recovered from the host cells and rapidly sequenced using standard methods in the art. These methods allow the rapid determination of the importance of individual amino acid residues in a polypeptide.
Semi-synthetic gene construction is accomplished by combining aspects of synthetic gene construction, and/or site-directed mutagenesis, and/or random mutagenesis, and/or shuffling. Semi-synthetic construction is typified by a process utilizing polynucleotide fragments that are synthesized, in combination with PCR techniques. Defined regions of genes may thus be synthesized de novo, while other regions may be amplified using site-specific mutagenic primers, while yet other regions may be subjected to error-prone PCR or non-error prone PCR amplification. Polynucleotide subsequences may then be shuffled.
In one embodiment, the carbonic anhydrase has improved thermostability compared to the compared to carbonic anhyrases known in the art when tested under the same conditions (e.g., using an assay described in WO2018/017792, e.g., after incubation for the given time in 1 M NaHCOs buffer at pH 8 at the given elevated temperature). In one embodiment, the carbonic anhydrases are cablable of maintaining a residual activity of at least 30%, preferably above 40%, more preferably above 50%, more preferably above 60%, even more preferably above 70%, most preferably above 80%, most preferably above 85%, most preferably above 90%, most preferably above 95%, and even most preferably the residual activity is unchanged after incubation in 1 M NaHCOs buffer pH 8 at temperatures above 45 °C, preferably above 50 °C, above 55 °C, above 60 °C, above 65 °C, more preferably above 70 °C, most preferably above 80 °C, most preferably above 90 °C, most preferably above 100 °C, most preferably above 105 °C and even most preferably above 110 °C for at least 15 minutes, preferably for at least 2 hours, more preferably for at least 24 hours, more preferably for at least 7 days, more preferably for at least 10 days, even more preferably for at least 14 days, most preferably for at least 30 days, even most preferably for at least 50 days at the elevated temperature. In one embodiment, the carbonic anhydrase maintains at least 50% residual activity in 30% methyldiethanolamine (MDEA) at 85°C after five days. In one embodiment, the carbonic anhydrase maintains at least 85% activity when incubated for 15 minutes in 1M NaHCCh solution (approximately pH 8-10) in the temperature range 25-90°C. In one embodiment, the carbonic anhydrase maintains full activity at 50°C over the pH range 4-11 for one day. In one embodiment, the carbonic anhydrase maintains more than 50% activity over the pH range 4-11 after 10 days at 50°C.
The polypeptide may be a fusion polypeptide.
In an aspect, the polypeptide is isolated.
In another aspect, the polypeptide is purified.
Sources of Polypeptides Having Carbonic Anhydrase Activity
A polypeptide having carbonic anhydrase activity of the present invention may be obtained from microorganisms of any genus. For purposes of the present invention, the term “obtained from” as used herein in connection with a given source shall mean that the polypeptide encoded by a polynucleotide is produced by the source or by a strain in which the polynucleotide of the invention has been inserted. In one aspect, the polypeptide obtained from a given source is secreted extracellularly.
In one aspect, the carboninc anhydrase is a Endozoicomonas carbonic anhydrase, e.g., a carbonic anhydrase from Endozoicomonas arenosclerae or Endozoicomonas numazuensis, such as the carbonic anhydrase of SEQ ID NO: 2 or 6.
In another aspect, the carboninc anhydrase is a Achromatium carbonic anhydrase, e.g., the Achromatium carbonic anhydrase of SEQ ID NO: 4.
In another aspect, the carboninc anhydrase is a Thiorhodococcus carbonic anhydrase, e.g., a carbonic anhydrase from Thiorhodococcus drewsii, such as the carbonic anhydrase of SEQ ID NO: 8.
In another aspect, the carboninc anhydrase is a Aquificales carbonic anhydrase, e.g., a carbonic anhydrase from Aquificales bacterium, such as the carbonic anhydrase of SEQ ID NO: 14.
In another aspect, the carboninc anhydrase is a Trichophaea carbonic anhydrase, e.g., a carbonic anhydrase from Trichophaea saccata, such as the carbonic anhydrase of SEQ ID NO: 74.
In another aspect, the carboninc anhydrase is a Eleutherascus carbonic anhydrase, e.g., a carbonic anhydrase from Eleutherascus tuberculatus, such as the carbonic anhydrase of SEQ ID NO: 76.
In another aspect, the carboninc anhydrase is a Lactobacillus sp. carbonic anhydrase, such as the carbonic anhydrase of SEQ ID NO: 79. It will be understood that for the aforementioned species, the invention encompasses both the perfect and imperfect states, and other taxonomic equivalents, e.g., anamorphs, regardless of the species name by which they are known. Those skilled in the art will readily recognize the identity of appropriate equivalents.
The polypeptides may be identified and obtained from other sources including microorganisms isolated from nature (e.g., soil, composts, water, etc.) or DNA samples obtained directly from natural materials (e.g., soil, composts, water, etc.) using the above-mentioned probes. Techniques for isolating microorganisms and DNA directly from natural habitats are well known in the art. A polynucleotide encoding the polypeptide may then be obtained by similarly screening a genomic DNA, metagenomic DNA or cDNA library of another microorganism or mixed DNA sample. Once a polynucleotide encoding a polypeptide has been detected with the probe(s), the polynucleotide can be isolated or cloned by utilizing techniques that are known to those of ordinary skill in the art (see, e.g., Davis etal., 2012, Basic Methods in Molecular Biology, Elsevier).
Polynucleotides
The present invention also relates to polynucleotides encoding a polypeptide of the present invention, as described herein.
The polynucleotide may be a genomic DNA, a cDNA, a synthetic DNA, a synthetic RNA, a mRNA, or a combination thereof. The polynucleotide may be cloned from a strain of Endozoicomonas, Achromatium, Thiorhodococcus, Aquificales, Trichophaea, Eleutherascus or a related organism and thus, for example, may be a polynucleotide sequence encoding a variant of the carbonic anhydrase of the invention.
In an embodiment, the polynucleotide is a subsequence encoding a fragment having carbonic anhydrase of the present invention. In an aspect, the subsequence contains at least 85%, 90%, or 95% nucleotides of any one of SEQ ID NOs: 1 , 3, 5, 7, 9, 11 , 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31 , 33, 35, 37, 39, 41 , 43, 45, 47, 49, 51 , 53, 55, 57, 59, 61 , 63, 65, 67, 69, 71 , 73, 75, 78, 80, 82, and 84.
In one embodiment the polynucleotide encoding the polypeptide of the present invention is isolated from a Endozoicomonas, Achromatium, Thiorhodococcus, Aquificales, Trichophaea, Eleutherascus or Lactobacillus cell.
The polynucleotide may also be mutated by introduction of nucleotide substitutions that do not result in a change in the amino acid sequence of the polypeptide, but which correspond to the codon usage of the host organism intended for production of the enzyme, or by introduction of nucleotide substitutions that may give rise to a different amino acid sequence. For a general description of nucleotide substitution, see, e.g., Ford et al., 1991 , Protein Expression and Purification 2: 95-107.
In an aspect, the polynucleotide is isolated. In another aspect, the polynucleotide is purified.
Nucleic Acid Constructs
The present invention also relates to nucleic acid constructs comprising a polynucleotide of the present invention, wherein the polynucleotide is operably linked to one or more control sequences that direct the expression of the coding sequence in a suitable host cell under conditions compatible with the control sequences.
The polynucleotide may be manipulated in a variety of ways to provide for expression of the polypeptide. Manipulation of the polynucleotide prior to its insertion into a vector may be desirable or necessary depending on the expression vector. Techniques for modifying polynucleotides utilizing recombinant DNA methods are well known in the art.
Promoters
The control sequence may be a promoter, a polynucleotide that is recognized by a host cell for expression of a polynucleotide encoding a polypeptide of the present invention. The promoter contains transcriptional control sequences that mediate the expression of the polypeptide. The promoter may be any polynucleotide that shows transcriptional activity in the host cell including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell.
Examples of suitable promoters for directing transcription of the polynucleotide of the present invention in a bacterial host cell are described in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Lab., NY, Davis et al., 2012, supra, and Song et a!., 2016, PLOS One 11(7): e0158447.
Examples of suitable promoters for directing transcription of the polynucleotide of the present invention in a filamentous fungal host cell are promoters obtained from Aspergillus, Fusarium, Rhizomucor and Trichoderma cells, such as the promoters described in Mukherjee et al., 2013, “Trichoderma: Biology and Applications”, and by Schmoll and Dattenbdck, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
For expression in a yeast host, examples of useful promoters are described by Smolke et al., 2018, “Synthetic Biology: Parts, Devices and Applications” (Chapter 6: Constitutive and Regulated Promoters in Yeast: How to Design and Make Use of Promoters in S. cerevisiae), and by Schmoll and Dattenbdck, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
Terminators
The control sequence may also be a transcription terminator, which is recognized by a host cell to terminate transcription. The terminator is operably linked to the 3’-terminus of the polynucleotide encoding the polypeptide. Any terminator that is functional in the host cell may be used in the present invention.
Preferred terminators for bacterial host cells may be obtained from the genes for Bacillus clausii alkaline protease (aprH), Bacillus licheniformis alpha-amylase (amyL), and Escherichia coli ribosomal RNA (rrnB).
Preferred terminators for filamentous fungal host cells may be obtained from Aspergillus or Trichoderma species, such as obtained from the genes for Aspergillus niger glucoamylase, Trichoderma reesei beta-glucosidase, Trichoderma reesei cellobiohydrolase I, and Trichoderma reesei endoglucanase I, such as the terminators described in Mukherjee et al., 2013, “Trichoderma: Biology and Applications”, and by Schmoll and Dattenbdck, 2016, “Gene Expression Systems in Fungi: Advancements and Applications”, Fungal Biology.
Preferred terminators for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase, Saccharomyces cerevisiae cytochrome C (CYC1), and Saccharomyces cerevisiae glyceraldehyde-3-phosphate dehydrogenase. Other useful terminators for yeast host cells are described by Romanos et al., 1992, Yeast 8: 423-488. mRNA Stabilizers
The control sequence may also be an mRNA stabilizer region downstream of a promoter and upstream of the coding sequence of a gene which increases expression of the gene.
Examples of suitable mRNA stabilizer regions are obtained from a Bacillus thuringiensis crylllA gene (WO 94/25612) and a Bacillus subtilis SP82 gene (Hue et al., 1995, J. Bacteriol. 177: 3465-3471).
Examples of mRNA stabilizer regions for fungal cells are described in Geisberg et al., 2014, Cell 156(4): 812-824, and in Morozov et al., 2006, Eukaryotic Ce// 5(11): 1838-1846.
Leader Sequences
The control sequence may also be a leader, a non-translated region of an mRNA that is important for translation by the host cell. The leader is operably linked to the 5’-terminus of the polynucleotide encoding the polypeptide. Any leader that is functional in the host cell may be used.
Suitable leaders for bacterial host cells are described by Hambraeus et al., 2000, Microbiology 146(12): 3051-3059, and by Kaberdin and Blasi, 2006, FEMS Microbiol. Rev. 30(6): 967-979.
Preferred leaders for filamentous fungal host cells may be obtained from the genes for Aspergillus oryzae TAKA amylase and Aspergillus nidulans triose phosphate isomerase.
Suitable leaders for yeast host cells may be obtained from the genes for Saccharomyces cerevisiae enolase (ENO-1), Saccharomyces cerevisiae 3-phosphoglycerate kinase, Saccharomyces cerevisiae alpha-factor, and Saccharomyces cerevisiae alcohol dehydrogenase/glyceraldehyde-3-phosphate dehydrogenase (ADH2/GAP).
Polyadenylation Sequences
The control sequence may also be a polyadenylation sequence, a sequence operably linked to the 3’-terminus of the polynucleotide which, when transcribed, is recognized by the host cell as a signal to add polyadenosine residues to transcribed mRNA. Any polyadenylation sequence that is functional in the host cell may be used.
Preferred polyadenylation sequences for filamentous fungal host cells are obtained from the genes for Aspergillus nidulans anthranilate synthase, Aspergillus niger glucoamylase, Aspergillus niger alpha-glucosidase, Aspergillus oryzae TAKA amylase, and Fusarium oxysporum trypsin-like protease.
Useful polyadenylation sequences for yeast host cells are described by Guo and Sherman, 1995, Mol. Cellular Biol. 15: 5983-5990.
Signal Peptides
The control sequence may also be a signal peptide coding region that encodes a signal peptide linked to the N-terminus of a polypeptide and directs the polypeptide into the cell’s secretory pathway. The 5’-end of the coding sequence of the polynucleotide may inherently contain a signal peptide coding sequence naturally linked in translation reading frame with the segment of the coding sequence that encodes the polypeptide. Alternatively, the 5’-end of the coding sequence may contain a signal peptide coding sequence that is heterologous to the coding sequence. A heterologous signal peptide coding sequence may be required where the coding sequence does not naturally contain a signal peptide coding sequence. Alternatively, a heterologous signal peptide coding sequence may simply replace the natural signal peptide coding sequence to enhance secretion of the polypeptide. Any signal peptide coding sequence that directs the expressed polypeptide into the secretory pathway of a host cell may be used.
Effective signal peptide coding sequences for bacterial host cells are the signal peptide coding sequences obtained from the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus licheniformis subtilisin, Bacillus licheniformis beta-lactamase, Bacillus stearothermophilus alphaamylase, Bacillus stearothermophilus neutral proteases (nprT, nprS, npr/VT), and Bacillus subtilis prsA. Further signal peptides are described by Freudl, 2018, Microbial Cell Factories 17: 52.
Effective signal peptide coding sequences for filamentous fungal host cells are the signal peptide coding sequences obtained from the genes for Aspergillus niger neutral amylase, Aspergillus niger glucoamylase, Aspergillus oryzae TAKA amylase, Humicola insolens cellulase, Humicola insolens endoglucanase V, Humicola lanuginosa lipase, and Rhizomucor miehei aspartic proteinase, such as the signal peptide described by Xu et al., 2018, Biotechnology Letters 40: 949-955 Useful signal peptides for yeast host cells are obtained from the genes for Saccharomyces cerevisiae alpha-factor and Saccharomyces cerevisiae invertase. Other useful signal peptide coding sequences are described by Romanos et al., 1992, supra.
Propeptides
The control sequence may also be a propeptide coding sequence that encodes a propeptide positioned at the N-terminus of a polypeptide. The resultant polypeptide is known as a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is generally inactive and can be converted to an active polypeptide by catalytic or autocatalytic cleavage of the propeptide from the propolypeptide. The propeptide coding sequence may be obtained from the genes for Bacillus subtilis alkaline protease (aprE), Bacillus subtilis neutral protease (nprT), Myceliophthora thermophila laccase (WO 95/33836), Rhizomucor miehei aspartic proteinase, and Saccharomyces cerevisiae alpha-factor.
Where both signal peptide and propeptide sequences are present, the propeptide sequence is positioned next to the N-terminus of a polypeptide and the signal peptide sequence is positioned next to the N-terminus of the propeptide sequence. Additionally or alternatively, when both signal peptide and propeptide sequences are present, the polypeptide may comprise only a part of the signal peptide sequence and/or only a part of the propeptide sequence. Alternatively, the final or isolated polypeptide may comprise a mixture of mature polypeptides and polypeptides which comprise, either partly or in full length, a propeptide sequence and/or a signal peptide sequence.
Regulatory Sequences
It may also be desirable to add regulatory sequences that regulate expression of the polypeptide relative to the growth of the host cell. Examples of regulatory sequences are those that cause expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Regulatory sequences in prokaryotic systems include the lac, tac, and trp operator systems. In yeast, the ADH2 system or GAL1 system may be used. In filamentous fungi, the Aspergillus niger glucoamylase promoter, Aspergillus oryzae TAKA alpha-amylase promoter, and Aspergillus oryzae glucoamylase promoter, Trichoderma reesei cellobiohydrolase I promoter, and Trichoderma reesei cellobiohydrolase II promoter may be used. Other examples of regulatory sequences are those that allow for gene amplification. In fungal systems, these regulatory sequences include the dihydrofolate reductase gene that is amplified in the presence of methotrexate, and the metallothionein genes that are amplified with heavy metals. Transcription Factors
The control sequence may also be a transcription factor, a polynucleotide encoding a polynucleotide-specific DNA-binding polypeptide that controls the rate of the transcription of genetic information from DNA to mRNA by binding to a specific polynucleotide sequence. The transcription factor may function alone and/or together with one or more other polypeptides or transcription factors in a complex by promoting or blocking the recruitment of RNA polymerase. Transcription factors are characterized by comprising at least one DNA-binding domain which often attaches to a specific DNA sequence adjacent to the genetic elements which are regulated by the transcription factor. The transcription factor may regulate the expression of a protein of interest either directly, i.e., by activating the transcription of the gene encoding the protein of interest by binding to its promoter, or indirectly, i.e., by activating the transcription of a further transcription factor which regulates the transcription of the gene encoding the protein of interest, such as by binding to the promoter of the further transcription factor. Suitable transcription factors for fungal host cells are described in WO 2017/144177. Suitable transcription factors for prokaryotic host cells are described in Seshasayee et al., 2011 , Subcellular Biochemistry 52: 7- 23, as well in Balleza et al., 2009, FEMS Microbiol. Rev. 33(1): 133-151.
Expression Vectors
The present invention also relates to recombinant expression vectors comprising a polynucleotide of the present invention, a promoter, and transcriptional and translational stop signals. The various nucleotide and control sequences may be joined together to produce a recombinant expression vector that may include one or more convenient restriction sites to allow for insertion or substitution of the polynucleotide encoding the polypeptide at such sites. Alternatively, the polynucleotide may be expressed by inserting the polynucleotide or a nucleic acid construct comprising the polynucleotide into an appropriate vector for expression. In creating the expression vector, the coding sequence is located in the vector so that the coding sequence is operably linked with the appropriate control sequences for expression.
The recombinant expression vector may be any vector (e.g., a plasmid or virus) that can be conveniently subjected to recombinant DNA procedures and can bring about expression of the polynucleotide. The choice of the vector will typically depend on the compatibility of the vector with the host cell into which the vector is to be introduced. The vector may be a linear or closed circular plasmid.
The vector may be an autonomously replicating vector, i.e., a vector that exists as an extrachromosomal entity, the replication of which is independent of chromosomal replication, e.g., a plasmid, an extrachromosomal element, a minichromosome, or an artificial chromosome. The vector may contain any means for assuring self-replication. Alternatively, the vector may be one that, when introduced into the host cell, is integrated into the genome and replicated together with the chromosome(s) into which it has been integrated. Furthermore, a single vector or plasmid or two or more vectors or plasmids that together contain the total DNA to be introduced into the genome of the host cell, or a transposon, may be used.
The vector preferably contains one or more selectable markers that permit easy selection of transformed, transfected, transduced, or the like cells. A selectable marker is a gene the product of which provides for biocide or viral resistance, resistance to heavy metals, prototrophy to auxotrophs, and the like.
The vector preferably contains at least one element that permits integration of the vector into the host cell's genome or autonomous replication of the vector in the cell independent of the genome.
For integration into the host cell genome, the vector may rely on the polynucleotide’s sequence encoding the polypeptide or any other element of the vector for integration into the genome by homologous recombination, such as homology-directed repair (HDR), or non- homologous recombination, such as non-homologous end-joining (NHEJ).
For autonomous replication, the vector may further comprise an origin of replication enabling the vector to replicate autonomously in the host cell in question. The origin of replication may be any plasmid replicator mediating autonomous replication that functions in a cell. The term “origin of replication” or “plasmid replicator” means a polynucleotide that enables a plasmid or vector to replicate in vivo.
More than one copy of a polynucleotide of the present invention may be inserted into a host cell to increase production of a polypeptide. For example, 2 or 3 or 4 or 5 or more copies are inserted into a host cell. An increase in the copy number of the polynucleotide can be obtained by integrating at least one additional copy of the sequence into the host cell genome or by including an amplifiable selectable marker gene with the polynucleotide where cells containing amplified copies of the selectable marker gene, and thereby additional copies of the polynucleotide, can be selected for by cultivating the cells in the presence of the appropriate selectable agent.
Host Cells
The present invention also relates to recombinant host cells, comprising a polynucleotide of the present invention operably linked to one or more control sequences that direct the production of a polypeptide of the present invention.
A construct or vector comprising a polynucleotide is introduced into a host cell so that the construct or vector is maintained as a chromosomal integrant or as a self-replicating extra- chromosomal vector as described earlier. The choice of a host cell will to a large extent depend upon the gene encoding the polypeptide and its source. The polypeptide can be native or heterologous to the recombinant host cell. Also, at least one of the one or more control sequences can be heterologous to the polynucleotide encoding the polypeptide. The recombinant host cell may comprise a single copy, or at least two copies, e.g., three, four, five, or more copies of the polynucleotide of the present invention.
The host cell may be any microbial cell useful in the recombinant production of a polypeptide of the present invention, e.g., a prokaryotic cell or a fungal cell.
The prokaryotic host cell may be any Gram-positive or Gram-negative bacterium. Grampositive bacteria include, but are not limited to, Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, and Streptomyces. Gram-negative bacteria include, but are not limited to, Campylobacter, E. coli, Flavobacterium, Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma.
The bacterial host cell may be any Bacillus cell including, but not limited to, Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, and Bacillus thuringiensis cells. In an embodiment, the Bacillus cell is a Bacillus amyloliquefaciens, Bacillus licheniformis and Bacillus subtilis cell.
For purposes of this invention, Bacillus classes/genera/species shall be defined as described in Patel and Gupta, 2020, Int. J. Syst. Evol. Microbiol. 70: 406-438.
The bacterial host cell may also be any Streptococcus cell including, but not limited to, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus cells.
The bacterial host cell may also be any Streptomyces cell including, but not limited to, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans cells.
Methods for introducing DNA into prokaryotic host cells are well-known in the art, and any suitable method can be used including but not limited to protoplast transformation, competent cell transformation, electroporation, conjugation, transduction, with DNA introduced as linearized or as circular polynucleotide. Persons skilled in the art will be readily able to identify a suitable method for introducing DNA into a given prokaryotic cell depending, e.g., on the genus. Methods for introducing DNA into prokaryotic host cells are for example described in Heinze et al., 2018, BMC Microbiology 18:56, Burke et al., 2001 , Proc. Natl. Acad. Sci. USA 98: 6289-6294, Choi et al., 2006, J. Microbiol. Methods Q4: 391-397, and Donald et al., 2013, J. Bacteriol. 195(11): 2612- 2620.
The host cell may be a fungal cell. “Fungi” as used herein includes the phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota as well as the Oomycota and all mitosporic fungi (as defined by Hawksworth et al., In, Ainsworth and Bisby’s Dictionary of The Fungi, 8th edition, 1995, CAB International, University Press, Cambridge, UK).
Fungal cells may be transformed by a process involving protoplast-mediated transformation, Agrobacterium-mediated transformation, electroporation, biolistic method and shock-wave-mediated transformation as reviewed by Li et al., 2017, Microbial Cell Factories 16: 168 and procedures described in EP 238023, Yelton et al., 1984, Proc. Natl. Acad. Sci. USA 81 : 1470-1474, Christensen etal., 1988, Bio/TechnologyQ: 1419-1422, and Lubertozzi and Keasling, 2009, Biotechn. Advances 27: 53-75. However, any method known in the art for introducing DNA into a fungal host cell can be used, and the DNA can be introduced as linearized or as circular polynucleotide.
The fungal host cell may be a yeast cell. “Yeast” as used herein includes ascosporogenous yeast (Endomycetales), basidiosporogenous yeast, and yeast belonging to the Fungi Imperfecti (Blastomycetes). For purposes of this invention, yeast shall be defined as described in Biology and Activities of Yeast (Skinner, Passmore, and Davenport, editors, Soc. App. Bacteriol. Symposium Series No. 9, 1980).
The yeast host cell may be a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell. In a preferred embodiment, the yeast host cell is a Pichia or Komagataella cell, e.g., a Pichia pastoris cell (Komagataella phaffii).
The fungal host cell may be a filamentous fungal cell. “Filamentous fungi” include all filamentous forms of the subdivision Eumycota and Oomycota (as defined by Hawksworth et al., 1995, supra). The filamentous fungi are generally characterized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cerevisiae is by budding of a unicellular thallus and carbon catabolism may be fermentative.
The filamentous fungal host cell may be an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Fili basidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell. In a preferred embodiment, the filamentous fungal host cell is an Aspergillus, Trichoderma or Fusarium cell. In a further preferred embodiment, the filamentous fungal host cell is an Aspergillus niger, Aspergillus oryzae, Trichoderma reesei, or Fusarium venenatum cell.
For example, the filamentous fungal host cell may be an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Talaromyces emersonii, Thielavia terrestris, Tra metes villosa, Tra metes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell.
In an aspect, the host cell is isolated.
In another aspect, the host cell is purified.
Methods of Production
The present invention also relates to methods of producing a carbonic anhydrase of the present invention, comprising (a) cultivating a cell, which in its wild-type form produces the carbonic anhydrase, under conditions conducive for production of the carbonic anhydrase; and optionally, (b) recovering the carbonic anhydrase.
The present invention also relates to methods of producing a carbonic anhydrase of the present invention, comprising (a) cultivating a recombinant host cell of the present invention under conditions conducive for production of the carbonic anhydrase; and optionally, (b) recovering the carbonic anhydrase.
The host cell is cultivated in a nutrient medium suitable for production of the carbonic anhydrase using methods known in the art. For example, the cell may be cultivated by shake flask cultivation, or small-scale or large-scale fermentation (including continuous, batch, fed- batch, or solid-state, and/or microcarrier-based fermentations) in laboratory or industrial fermentors in a suitable medium and under conditions allowing the polypeptide to be expressed and/or isolated. Suitable media are available from commercial suppliers or may be prepared according to published compositions (e.g., in catalogues of the American Type Culture Collection). If the carbonic anhydrase is secreted into the nutrient medium, the polypeptide can be recovered directly from the medium. If the polypeptide is not secreted, it can be recovered from cell lysates.
The carbonic anhydrase may be detected using methods known in the art that are specific for the polypeptide, including, but not limited to, the use of specific antibodies, formation of an enzyme product, disappearance of an enzyme substrate, or an assay determining the relative or specific activity of the polypeptide.
The carbonic anhydrase may be recovered from the medium using methods known in the art, including, but not limited to, collection, centrifugation, filtration, extraction, spray-drying, evaporation, or precipitation. In one aspect, a whole fermentation broth comprising the polypeptide is recovered. In another aspect, a cell-free fermentation broth comprising the polypeptide is recovered.
The carbonic anhydrase may be purified by a variety of procedures known in the art to obtain substantially pure polypeptides and/or polypeptide fragments (see, e.g., Wingfield, 2015, Current Protocols in Protein Science’, 80(1): 6.1.1-6.1.35; Labrou, 2014, Protein Downstream Processing, 1129: 3-10).
In an alternative aspect, the polypeptide is not recovered.
Compositions Comprising Carbonic anhydrases and Methods of Preparation
The invention provides a composition comprising the carbonic anhydrases of the present invention and preferably an excipient and a method for preparing such a composition comprising admixing the carbonic anhydraseof the invention with an excipient.
In a particular embodiment, a carbonic anhydrase of the invention is the major (polypeptide) component of the composition, e.g., a mono-component composition. In a monocomponent composition the carbonic anhydrase of the invention preferably constitures at least 80% of the cabonic anhydrase activity, more preferably at least 90%, even more preferably at leat 95% and most preferably 100% of the carbonic anhydase activity. The excipient in this context is to be understood as any auxilliary agent or compound used to formulate the composition and includes solvent (e.g., water, inorganic salts, fillers, pigments, waxes), carriers, stabilizers, cross-linking agents, adhesives, preservatives, buffers and the like.
The composition may further comprise one or more additional enzymes, such as one or more additional carbonic anhydrases, a decarboxylase, laccase, or oxidase.
The compositions may be prepared in accordance with methods known in the art and may be in the form of a liquid or a solid composition. For instance, the enzyme composition may be formulated using methods known to the art of formulating technical enzymes and/or pharmaceutical products, e.g., into coated or uncoated granules or micro-granules. The polypeptide of the invention may thus be provided in the form of a granule, preferably a nondusting granule, a liquid, in particular a stabilized liquid, a slurry or a protected polypeptide.
For certain applications, immobilization of the carbonic anhydrase may be preferred. An immobilized enzyme comprises two essential functions, namely the non-catalytic functions that are designed to aid separation (e.g., isolation of catalysts from the application environment, reuse of the catalysts and control of the process) and the catalytic functions that are designed to convert the target compounds (or substrates) within the time and space desired (Cao, Carrier-bound Immobilized Enzymes: Principles, Applications and Design, Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim, Germany, 2005). When an enzyme is immobilized, it is made insoluble to the target compounds (e.g., substrates) it aids converting and to the solvents used. An immobilized enzyme product can be separated from the application environment in order to facilitate its reuse, or to reduce the amount of enzyme needed, or to use the enzyme in a process where substrate is continuously delivered and product is continuously removed from proximity to the enzyme, which, e.g., reduces enzyme cost. Furthermore, enzymes are often stabilized by immobilization. A process involving immobilized enzymes is often continuous, which facilitates easy process control. The immobilized enzyme can be retained as a heterogeneous catalyst by mechanical means, or by inclusion in a definite space. The latter can be done by microencapsulation, e.g., in semi permeable membranes or by inclusion in UF systems using, e.g., hollow fiber modules, etc. Immobilization on porous carriers is also commonly used. This includes binding of the enzyme to the carrier, e.g., by adsorption, complex/ionic/covalent binding, or just simple absorption of soluble enzyme on the carrier and subsequent removal of solvent. Cross-linking of the enzyme can also be used as a means of immobilization. Immobilization of enzyme by inclusion into a carrier is also industrially applied. (Buchholz et al., Biocatalysts and Enzyme Technology, Wiley- VCH Verlag GmbH & Co. KGaA, Weinheim, Germany, 2005). Specific methods of immobilizing enzymes such as carbonic anhydrase include, but are not limited to, spraying of the enzyme together with a liquid medium comprising a polyfunctional amine and a liquid medium comprising a cross-linking agent onto a particulate porous carrier as described in W02007/036235 (hereby incorporated by reference), linking of carbonic anhydrase with a cross-linking agent (e.g., glutaraldehyde) to an ovalbumin layer which in turn adhere to an adhesive layer on a polymeric support as described in W02005/114417 (hereby incorporated by reference), or coupling of carbonic anhydrase to a silica carrier as described in U.S. Patent No. 5,776,741 or to a silane, or a CNBr activated carrier surface such as glass, co-polymerization of carbonic anhydrase with methacrylate on polymer beads as described in Bhattacharya et al., 2003, Biotechnol. Appl. Biochem. 38: 111-117 (hereby incorporated by reference) or using globular protein and adhesive as described in US 2010/068784. The carbonic anhydrase may also be immobilized using tags such as histidine-like tags (e.g., 6x His tag or HQ tag) or a cellulose binding module (CBM) (Liu et al, 2008, Biotechnol. Prog. 25: 68-74).
An embodiment of the present invention is a composition comprising a matrix suitable for immobilization and a carbonic anhydrase described herein. In a further embodiment of the present invention the carbonic anhydrase is immobilized on a matrix. The matrix may for example be selected from the group beads, fabrics, fibers, hollow fibers, membranes, particulates, porous surfaces, rods, structured packing, and tubes. Specific examples of suitable matrices include alumina, bentonite, biopolymers, calcium carbonate, calcium phosphate gel, carbon, cellulose, ceramic supports, clay, collagen, glass, hydroxyapatite, ion-exchange resins, kaolin, nylon, phenolic polymers, polyaminostyrene, polyacrylamide, polypropylene, polymerhydrogels, sephadex, sepharose, silica gel, precipitated silica, and TEFLON-brand PTFE. In an embodiment of the present invention carbonic anhydrase is immobilized on a nylon matrix according to the techniques described in Methods in Enzymology volume XLIV (section in the chapter: Immobilized Enzymes, pages 118-134, edited by Klaus Mosbach, Academic Press, New York, 1976), hereby incorporated by reference. The polypeptide to be included in the composition may be stabilized in accordance with methods known in the art e.g., by stabilizing the polypeptide in the composition by adding an antioxidant or reducing agent to limit oxidation of the polypeptide or it may be stabilized by adding polymers such as PVP, PVA, PEG, sugars, oligomers, polysaccharides or other suitable polymers known to be beneficial to the stability of polypeptides in solid or liquid compositions or it may be stabilized by adding stabilizing ions, such as zinc (e.g. zinc chloride or zinc sulphate) which is present in the enzyme active site. A preservative, such as Proxel, or penicillin, can be added to extend shelf life or performance in application.
In embodiments of the present invention, the carbonic anhydrase is immobilized by adsorption onto a matrix, surface or substrate. Non-limiting examples of a matrix, surface or substrate include those from the group: beads, fabrics, fibers, hollow fibers, membranes, particulates, porous surfaces, rods, structured packing, and tubes. Specific examples of suitable matrices, surfaces or substrates include alumina, bentonite, biopolymers, calcium carbonate, calcium phosphate gel, carbon, cellulose, ceramic supports, clay, collagen, glass, hydroxyapatite, ion-exchange resins, kaolin, nylon, phenolic polymers, polyaminostyrene, polyacrylamide, polyacrylonitrile (acrylic), polyethylene, polypropylene, polyester, polyurethane, polymerhydrogels, sephadex, sepharose, silica gel, precipitated silica, and TEFLON-brand PTFE. In embodiments, the matrices, surfaces or substrates may be dried after adsorption of the enzyme.
In a further embodiment, the composition of the invention is a composition applicable in the capture of carbon dioxide.
Uses
Management and control of carbon dioxide (CO2) concentration is important for a broad range of industrial, agricultural, medical and other technical processes. Conversion of CO2 to alternative chemical forms, such as bicarbonate and carbonate, is one way to remove, separate, or extract CO2 from a CCh-containing medium, thereby controlling the CO2 concentration in that medium. Conversion of CO2 to an alternative chemical form can also function to retain the CO2 in a CC>2-containing medium for a period of time, which can, for example, be useful for transporting CO2 and for utilizing CO2 in chemical and biochemical conversions that depend on the presence of CO2, bicarbonate, or carbonate. Conversion of CO2 to an alternative chemical form, such as bicarbonate, and reconversion from that alternative chemical form back to CO2 is another way to control and manage the CO2 concentration in processes, and provides a means to separate and transport CO2 from one location to another location, and provides a means to separate CO2 from a medium in one time period, and release the CO2 from the medium in a later time period.
Carbon dioxide emissions are a major contributor to the phenomenon of global warming. CO2 is a by-product of combustion, and it creates operational, economic, and environmental problems. CO2 emissions may be controlled by capturing CO2 gas before emitted into the atmosphere. There are several chemical approaches to control the CO2 emissions (A. Kohl and
R. Nielsen, Gas Purification, 5th ed., Gulf Professional Publishing, Houston, TX, 1997). However, many of these approaches have drawbacks such as high energy consumption, slow processes, and use of ecologically questionable or toxic compounds.
Technical solutions for extracting CO2 from gases, such as combustion gases, fuel gases, atmospheric gases or respiration gases, using carbonic anhydrases have been described in, for example, WG2006/089423, US 6,524,842, WG2004/007058, WG2004/028667,
US2004/0029257, US 7,132,090, WG2005/114417, US 6,143,556, WG2004/104160, US2005/0214936, WG2008/072979, WG2008/095057, WO2012/003336, WO2012/025577, WO2012/092984, WO 2012/154735, US 7,998,714, WO 2013/151757, US 8,871 ,008, WO2015/126925, US2015/0099289, and in the literature (e.g., Russo et al., 2013, Postcombustion carbon capture mediated by carbonic anhydrase, Sep. Purif. Technol. 107: 331-339;
S. Salmon and A. House, "Enzyme-catalyzed solvents for CO2 separation," in Novel Materials for Carbon Dioxide Mitigation Technology, F. Shi and B. Morreale, Eds., Amsterdam, Elsevier B.V., 2015, pp. 23-86; and, S. Salmon and A. House, “Low-energy solvents for carbon dioxide capture enabled by a combination of enzymes and vacuum regeneration,” Final Scientific/Technical Report for DE-FE0007741 , U.S. Department of Energy, National Energy Technology Laboratory, 2015, DOI: 10.2172/1222645), which are herein incorporated by reference.
Generally, CO2 scrubbing techniques operate by bringing a soluble or immobilized carbonic anhydrase into contact with CO2 which either may be in a gas phase or a liquid phase. In the presence of water, carbonic anhydrase catalyses the conversion of CO2 into bicarbonate ions which may be further protonated or deprotonated to carbonic acid and/or carbonate ions depending on the pH of the medium. The ions may either be utilized to facilitate growth of algae or microorganisms that utilize bicarbonate/carbonate as a carbon source, to induce a pH change in a surrounding medium or supply buffering capacity, to provide bicarbonate/carbonate as an active agent for subsequent chemical processes, or precipitated as a carbonate salt, or converted back into pure CO2, which can then be used (for example in enhanced oil recovery, for production of urea, for food and beverage processing, or to supply CO2 to greenhouses or cultivation ponds), released (for example from a contained life support environment such as a submarine, spacecraft, or artificial lung), compressed (for example for transportation through pipelines), or stored (such as in geological or deep oceanic formations or saline aquifers).
Furthermore, the use of carbonic anhydrase to catalyse the conversion of CO2 to bicarbonate in the presence of cations, such as sodium and potassium, can help accelerate subsequent processes, such as conversion of bicarbonates to carbonates, resulting in more rapid conversion of CO2 into useful alternative chemical forms. CO2 conversion to bicarbonate can improve the productivity of biological systems that utilize the carbon from CO2 to produce chemical compounds (e.g., Atsumi et al., 2009, Direct photosynthetic recycling of carbon dioxide to isobutyraldehyde, Nature Biotechnology 27(12): 1177-1180). CO2 conversion to carbonate, and the combination of carbonate with divalent cations such as calcium and magnesium, for example, is useful for sequestering, or storing, CO2 in large quantities (e.g., Favre et al., 2009, Biocatalytic capture of CO2 with carbonic anhydrase and its transformation to solid carbonate, J. Molec. Cat. B: Enzymatic, 60(3-4): 163-170). Production of carbonates is useful forthe production of concrete, or for above-ground or below-ground CO2 mineralization inside or as part of rocks or geologic formations, e.g., sub-surface limestone or basalt formations, also called “mineral sequestration,” which help mitigate the negative impacts of CO2 release to the atmosphere (e.g., US 2014/0234193). Preferably, sufficient cations and alkalinity to enable the carbonation reaction would be available at the mineralization site, however, these materials could also be delivered to the site, e.g., in the form of seawater, industrial brines, or aqueous waste streams comprising cations and alkalinity.
The presence of carbonic anhydrase in the systems and processes for CO2 utilization can improve reaction efficiencies. For example, by rapidly converting CO2, which has low water solubility, into bicarbonate, which has high water solubility, the presence of CA can help increase the carbon concentration in water systems used for growing algae, especially saline water systems comprising sodium or potassium cations, thereby improving algae growth and productivity. Delivery of CA into CO2 sequestration sites, such as underground storage locations rich in divalent cations, can accelerate the mineralization process by accelerating the first step of the CO2 to bicarbonate to carbonate reaction sequence, and ultimately resulting in precipitated carbonate-based solids. Similarly, in the presence of CO2, CA can accelerate the removal or recovery of cations, especially divalent cations, from aqueous liquids, by accelerating the rate at which CO2 is converted to its ionic forms and made available for ionic complexation with the cations to form solid bicarbonate or carbonate-based products that precipitate in the liquid or can be recovered by filtration or other solid-liquid separation techniques. Conversion of CO2 into its ionic forms, such as bicarbonate, helps dissolve CO2 into water-containing liquids and therefore increases the amount of CO2 that can be loaded into and carried by these liquids. By accelerating the rate of CO2 conversion to bicarbonate, CA can improve processes, such as the CarbFix process (Oelkers et al., 2008, Mineral carbonation of CO2, Elements 4: 333-337), which requires large amounts of water to dissolve CO2, by reducing the amount of water needed to transport a certain amount of CO2 and by promoting the mineralization of CO2.
The carbonic anhydrases described above are useful in a series of applications which are described in more detail below. When referring to carbonic anhydrases herein it is intended to include all the carbonic anhydrases described in the present invention in particular if they fall within the claimed identities.
In particular, carbonic anhydrases may be used for carbon dioxide extraction from CO2 emission streams, e.g., from carbon-based or hydrocarbon-based combustion in electric generation power plants, or from flue gas stacks from such plants, industrial furnaces, stoves, ovens, or fireplaces or from airplane or car exhausts. Carbonic anhydrases may also be used to remove CO2 in the preparation of industrial gases such as acetylene (C2H2), carbon monoxide (CO), chlorine (CI2), hydrogen (H2), methane (CH4), nitrous oxide (N2O), propane (CsHs), sulfur dioxide (SO2), argon (Ar), nitrogen (N2), and oxygen (O2). Carbonic anhydrases can also be used to remove CO2 from a raw natural gas during the processing to natural gas. Removal of CO2 from the raw natural gas will serve to enrich the methane (CH4) content in the natural gas, thereby increasing the thermal units/m3. Raw natural gas is generally obtained from oil wells, gas wells, and condensate wells. Natural gas contains between 1 % to 10% CO2 when obtained from geological natural gas reservoirs by conventional methods, but depending on the natural source or recovery method used may contain up to 50% CO2 or even higher. Carbonic anhydrase can also be used to purify the natural gas such that it is substantially free of CO2, e.g., such that the CO2 content is below 1%, preferably below 0.5%, 0.2%, 0.1 %, 0.05% and most preferably below 0.02%. In resemblance to the methane enrichment of natural gases, carbonic anhydrases can also be used to enrich the methane content in biogases. Biogases will always contain a considerable degree of CO2, since the bacteria used in the fermentation process produce methane (60-70%) and CO2 (30-40%). Biogas production may be performed using mesophilic or thermophilic microorganisms. Thermophilic strains allow the fermentation to occur at elevated temperatures, e.g., from 40°C to 80°C, or from 50°C to 70°C, or from 55°C to 60°C. In such processes a heat-stable carbonic anhydrase is particularly useful to remove CO2 from the methane. The present invention provides for the use of a carbonic anhydrase to reduce the carbon dioxide content in a biogas, preferably the CO2 content is reduced such that it constitutes less than 25%, more preferably less than 20%, 15%, 10%, 5%, 2%, 1%, 0.5% and most preferably less than 0.1 %. In a preferred embodiment the carbonic anhydrase is heat stable. Furthermore, carbonic anhydrase may be applied in the production of syngas by removing the CO2 generated by the gasification of a carbon containing fuel (e.g., methane or natural gas) thereby enriching the CO, H2 content of the syngas. Where syngas production occurs at elevated temperatures the use of a heat-stable carbonic anhydrase is an advantage. The present invention provides for the use of a carbonic anhydrase to reduce the carbon dioxide content in a syngas production. Preferably, the CO2 content is reduced such that it constitutes less than 25%, more preferably less than 20%, 15%, 10%, 5%, 2%, 1%, 0.5% and most preferably less than 0.1%. In a preferred embodiment the carbonic anhydrase is heat stable.
In an aspect of the present invention the CO2 extraction from a CCh-containing medium is performed in enzyme-based bioreactors. Before the carbon dioxide-containing medium is processed in a bioreactor, it may be purified to free it from contaminants which may disturb the enzymatic reaction or interfere with bioreactor functionality in other ways, e.g., by clotting outlets or membranes. Gasses multiphase mixtures emitted from combustion processes, e.g., flue gases or exhausts, are preferably cleared of ash, particles, NOX and/or SO2, before the gas/ multiphase mixture is passed into the bioreactor. Alternatively, SO2 separation and CO2 extraction can occur in the same reactor to improve system efficiency, because both separations are typically operated at alkaline process conditions. The raw natural gas from different regions may have different compositions and separation requirements. Preferably, oil, condensate, water and natural gas liquids, if present in the raw natural gas, are removed prior to the extraction of CO2 in an enzymebased bioreactor. The CO2 emitted from combustion processes or present in the raw natural gas may be extracted in the same process as the sulfur removal, or it may be extracted in a separate process. If the gas at this point exceeds the temperature tolerance of the carbonic anhydrase of the present invention, some degree of cooling may be needed. Preferably, the maximum temperature to which carbonic anhydrase is exposed during CO2 extraction process whether it is the process temperature in the bioreactor or the feed gas temperature may be between 0°C and 120°C. Preferably the maximum process temperature is between 40°C and 120°C, more preferably between 45°C and 110°C, more preferably between 50°C and 100°C, more preferably between 55°C and 90°C even more preferably between 60°C and 80°C, and most preferably between 65°C and 75°C.
Reactors, systems and processes for gas separation, including CO2 extraction, are well known in the art and are used commercially for various purposes (A. Kohl and R. Nielsen, Gas Purification, 5th ed., Gulf Professional Publishing, Houston, TX, 1997). By selecting CO2 absorption liquids (also called CO2 absorption solvents) and operational conditions that are compatible with enzyme limitations, CAs of the present invention can be used in any solventbased CO2 extraction reactor to generate a bioreactor (a reactor comprising biological material such as an enzyme) for extracting CO2 from gases, such as combustion gases, atmospheric gases, fuel gases, or respiration gases. CA can be present in these systems in an aqueous- soluble form, can be present as a suspended protein-based solid, and can be present in a form that is chemically or biochemically modified or combined with other materials. These different forms are collectively called biocatalysts.
Schematic diagrams for several common reactors (or gas-liquid contactors) used for CO2 separation systems and processes are shown in Figures 1 to 6. These reactors can be used for CO2 absorption from a gas into a liquid and for CO2 desorption from a liquid to a gas. The reactors can be used for a single once-through unit operation or the gas or liquid streams or both can be recirculated from the outlet to the inlet of a reactor to provide for multiple passes of gas and liquid streams through a reactor. Multiple reactors can be arranged in CO2 gas scrubbing systems in sequential or parallel arrangements, or both, to enable handling of large volumes of gas and liquid or provide high efficiency of CO2 removal. Different types of reactors can be used to form gas scrubbing systems with many different configurations. For example, a counter-current packed reactor (illustrated by Figure 1) can be used for CO2 gas absorption into a liquid and can be arranged in a recirculating system together with the same type or another type of reactor, e.g., a membrane-based reactor (illustrated by Figure 4), used for removing (desorbing) CO2 from the liquid, and the process is repeated when the CCh-lean liquid returns to the counter-current reactor. Many combinations and variations are possible. For continuous flow, the oulet liquid from one reactor travels to the inlet liquid of another reactor in the schematics shown in Figures 1 to 6. The reactors can be large or small.
Figure 7 shows an integrated CO2 scrubbing system utilizing recirculation of the CO2- absorption liquid between the absorber (2) and desorber (9) as well as optional units for biocatalyst separation (6 and 16) and recycling, optional utilization of sweep gas (10) in the desorption stage and optional utilization of secondary air sweep (18) prior to CCh-lean liquid entry to the absorber. Because carbonic anhydrase improves the rate of CO2 extraction, combining carbonic anhydrase with CO2 extraction reactors enables reactor and process improvements such as smaller size and less expensive gas-liquid contactors {e.g, shorter absorption column), the use of process intensification approaches (e.g., horizontal spray reactors and rotating packed bed reactors), and use of low energy consuming and low volatility CCh-absorption liquids, as well as overall lower operating temperatures compared to the conventional approaches.
One type of reactor uses liquid membranes. This may for example be reactors including hollow fiber membranes containing a liquid film as described in Majumdar ef a/., 1988, AIChE 34: 1135-1145; US 4,750,918; US 6,156,096; W004/104160. Such hollow fiber membrane-based designs are also sometimes termed hollow fiber liquid membranes (HFLM) and the CO2 separation devices based on these have been termed hollow fiber contained liquid membrane (HFCLM) permeators. A common feature of HFCLM permeators is that the hollow fibers enclosing the feed and sweep gas streams are near (/.e., “tightly packed” or “immediately adjacent”) to one another and they are enclosed in a single rigid treatment chamber to form one complete permeator. In such a design, a liquid surrounds the shell side of the tightly packed feed and sweep hollow fibers. Because the distance between the outside wall of one hollow fiber is very close to adjacent hollow fibers the thickness of the liquid layer between them is thin, like a membrane, and the composition of the liquid only allows certain components to pass, hence the term “liquid membrane” has been used to describe the liquid surrounding the hollow fibers. Contained liquid membrane permeators where the liquid film is sandwiched between two structural support membranes have also been described in the art (Cowan et al., 2003, Ann. NY Acad. Sci. 984: 453-469); this design essentially functions in the same way as the HFCLM. Contained liquid membrane permeators have also been used in combination with carbonic anhydrase as described in US 6,143,556, WO 2004/104160, Cowan et al., 2003, Ann. NY Acad. Sci. 984: 453-469; and Trachtenberg et al., 2003, SAE international Conference on Environmental Systems Docket number 2003-01-2499. In these cases, the CO2 desorption step takes place in the same enclosed treatment chamber as the absorption step. Another example describes an amine-based CO2 capture reactor based on absorber/desorber hollow fiber membrane modules (Kosaraju et al., 2005, Ind. Eng. Chem. Res. 44:1250-1258). Another type of reactor uses direct gas-liquid contact. This may for example be conventional solvent-based CO2 capture reactors that are based on absorber/desorber column reactors (US 2008/0056972, Reddy etal., Second National Conference on Carbon Sequestration, NETUDOE, Alexandria, VA, May 5-8, 2003). Example flow schemes for commercial direct gasliquid contactor reactors that use alkanolamines (such as monoethanolamine, diethanolamine, and methyldiethanolamine) for CO2 extraction are shown in A. Kohl and R. Nielsen, Gas Purification, 5th ed., Gulf Professional Publishing, Houston, TX, 1997: 57-62. Example flow schemes for commercial direct gas-liquid contactor reactors that use alkaline salt solutions (such as potassium carbonate) for CO2 extraction are shown in A. Kohl and R. Nielsen, Gas Purification, 5th ed., Gulf Professional Publishing, Houston, TX, 1997: 334-340.
Direct gas-liquid contact reactors using carbonic anhydrase have been described in US 6,524,843; WG2004/007058, WG2004/056455, US 7,176,017, and US2004/0059231. In these types of reactors, the gas phase or multiphase mixture, is contacted with a liquid phase under conditions where the CO2 in the gas phase is absorbed by the liquid phase where it is converted into bicarbonate by carbonic anhydrase. The bicarbonate enriched liquid is removed from the reactor by a continuous flow, to ensure that the equilibrium between CO2 and bicarbonate is shifted towards continuous conversion of CO2. The gas phase dissolution into the liquid phase is dependent on the surface contact area between the gas and liquid. A large contact area can for example be achieved by passing liquid and CO2-containing gas through a high surface area packed column, tray or plate column or tower, by spraying small droplets of liquid through the CO2-containing gas (/.e., a spray contactor), or by bubbling the CO2-containing gas through the liquid (/.e., bubble tank or pond), or by combinations of these techniques. Packed columns can comprise packings such as raschig rings, berl saddles, lessing rings, intalox metal, intalox saddles, pall rings or engineered packings such as Q-PAC (Lantec Products, Inc., Agoura Hills, CA 91301). The packing materials may be comprised of a polymer such as nylon, polyester, polyethylene, polyetheretherketone, polypropylene, polystyrene or fluoropolymer (e.g., polytetrafluoroethylene), a ceramic such as silica, or a metal such as aluminium, carbon steel, or stainless steel, or a cellulose-based material such as wood or cotton fiber.
In reactor types where the liquid is continuously exchanged or when it is desirable to restrain carbonic anhydrase to one or more locations in the reactor, carbonic anhydrase may be retained in the reactor by various means. In packed columns the carbonic anhydrase can be immobilized on the packing material (for methods of immobilizing CA, see for example in WG2005/114417, WO2013/151757) or can be immobilized as particles (e.g., US2015/0099289 and Yan et al., 2007, Fabrication of single carbonic anhydrase nanogel against denaturation and aggregation at high temperature, Biomacromolecules 8: 560-565) that recirculate from the outlet to the inlet of the packed column along with at least a portion of the liquid flow and avoiding travel through other parts of the system, or can circulate along with the CO2-absorption liquid throughout the process, both absorption and desorption, due to enzyme stabilization imparted by the combination of enzyme with the immobilization matrix. The use of particles in gas-liquid contactors containing packing materials, baffles, and other internals is possible when the particles have the size and physical properties to flow along with the liquid. Small particles comprising CA are known to enhance CO2 absorption, and is explained by the ability of carbonic anhydrase to rapidly catalyse the CO2 hydration reaction together with the ability of small particles to be located and move around in the thin liquid film interface between the gas and liquid phases, allowing the substrate (CO2) to rapidly come in contact with the CA catalyst (E. Alper and W.D. Deckwer, Some aspects of gas absorption mechanism in slurry reactors, in “Mass Transfer with Chemical Reaction in Multiphase Systems,” E. Alper (ed.), Springer Science & Business Media, Dordrecht, 1983, pp. 199-224).
Different sized screens, filters or solid-liquid separation techniques, can be used to restrain CA enzyme, chemically or physically modified CA enzyme, or immobilized CA enzyme to particular operational units, regions, or locations in the CO2 scrubbing system. Such techniques can also be used to restrain one type or mixture of CA biocatalysts in one reactor and another type or mixture of CA biocatalysts in another reactor zone. For example, this technique can be used to localize different CAs in the absorber and in the desorber. In “bubbling” reactors the carbonic anhydrase can be entrapped in a porous substrate, for example, an insoluble gel particle such as silica, silicone, urethane, alginate, alginate/chitosan, alginate/ carboxymethylcellulose, or the carbonic anhydrase can be immobilized (by covalent bonds, ionic charges, entrapment or encapsulation) on a fixed solid packing, or can be immobilized on or in particles in suspension in the liquid, or the carbonic anhydrase can be chemically linked in an albumin or PEG network. Carbonic anhydrase can also be restrained to a particular location in the reactor by entrapment in a polymeric immobilization material which may comprise a micellar or inverted micellar material, such as described in WO2010/037109, and may include chemical modification of the enzyme as part of the entrapment or immobilization technique (e.g., WO2012/122404). CAs of the present invention can be immobilized by these and other techniques known in the art.
Spray contactors may include vertical or horizontal spray chambers, countercurrent spray columns, venturi scrubbers, ejectors or jet scrubbers, cyclone scrubbers, and spray dryers (A. Kohl and R. Nielsen, Gas Purification, 5th ed., Gulf Professional Publishing, Houston, TX, 1997: 418-427 and 604-616). Use of spray contactors is desirable for avoiding pressure drop and improving tolerance to solid particulates in the gas, such as may be important for atmospheric pressure post-combustion exhaust gas applications. However, to be most effective, the rate of CO2 absorption in spray contactors must be fast, and carbonic anhydrase can provide the needed catalysis to achieve these fast rates.
CO2 extraction in a direct gas-liquid contact reactor may involve a first absorption stage followed by optionally a subsequent desorption, precipitation, utilization, collection, regeneration or release stage. A general description of the absorption stage is as follows. When the absorption reactor is in operation, a water-containing liquid enters the reactor at one end, preferably the top, and flows to the other end, preferably the bottom, and the CCh-containing gas stream (feed gas) enters the reactor at one end, preferably at the opposite end (the bottom) (“countercurrent”) from the liquid and the gas passes through the liquid and exits, minus the CO2 extracted into the liquid, through a gas outlet at the opposite end (preferably, the top of the reactor). The liquid that exits the absorption reactor is enriched in bicarbonate/carbonate (CCh-rich liquid) and the exit gas is reduced in the CO2 content compared to the feed gas. The CCh-rich liquid may be processed in subsequent reactions, for example to generate pure CO2 by passing through a desorption reactor or produce carbonate precipitates such as CaCCh. The CCh-rich liquid from the absorption reactor can also be utilized, e.g., to enhance algae growth, collected, e.g., by pumping the CO2- rich liquid into a contained geological formation, released, e.g., by pumping the CCh-rich liquid into the environment, such as release of bicarbonate liquid into seawater from a submarine life support system, evaporated or desalinated. The CCh-rich liquid containing bicarbonate anion can be used in industrial processes, such as in the manufacturing processes for ammonium carbonate and ammonium bicarbonate, which are useful as fertilizer, or in processes for the removal and neutralization of acid gases such as sulfur dioxide.
The reactors described herein may involve an absorption stage, a desorption stage or a sequence of absorption and desorption stages in which carbonic anhydrase may catalyze either the hydration of CO2 to bicarbonate or the dehydration of bicarbonate to CO2 or both. The reactors can be combined with each other where each reactor constitutes a module. For example, a liquid membrane reactor can function as absorption module and the direct gas-liquid contact reactor can function as a desorption module or vice versa.
The terms “CCh-lean” and “CCh-rich” absorption liquid are terms used in the present invention to describe the relative amount of carbon (e.g., in the form of dissolved CO2, chemically reacted CO2, bicarbonate, carbonic acid and/or carbonate salt) present in the absorption liquid as it circulates through the process. As used herein, the term “CCh-lean liquid” generally refers to absorption liquid entering an absorption unit. The term “CC>2-rich liquid” generally refers to an absorption liquid entering a desorption unit. It is understood that the term “CC>2-lean liquid” can also be applied to absorption liquid exiting a desorption module, and the term “CCh-rich liquid” can also be applied to absorption liquid exiting an absorption unit. CCh-rich liquid contains more carbon compared to CCh-lean liquid within a given system at a given point in time. As used herein, the term “CCh-rich gas” generally refers to a gas mixture with a relatively high CO2 content, or it can be a pure stream of CO2 gas. A CCh-rich gas can be a feed gas. The term “CCh-lean gas” generally refers to a gas mixture that is depleted in CO2 content compared to the CCh-rich gas from which at least a portion of CO2 was removed. A CCh-lean gas can be a gas that does not comprise CO2, e.g., a pure stream of nitrogen gas. A CCh-lean gas can be used as a sweep gas to help remove CO2 from a CCh-rich liquid.
Without limiting the scope of the present invention, Figure 7 is provided to illustrate a general schematic of a CO2 extraction system comprising both absorption and desorption units through which the CO2 absorption liquid circulates as it removes CO2 from a CCh-containing gaseous phase (feed gas, 1) in the absorber (2), releases purified CO2 gas (14) from the desorber (9), and recirculates back to the absorber. The term “feed gas” is often used in relation to CO2 extraction reactors where it implies that CO2 is removed from the CO2 containing gaseous phase by contact with a CCh-lean absorption liquid in the reactor. The feed gas may be at atmospheric pressure, or at pressures above or below atmospheric pressure. Selective solubility of CO2 in the absorption liquid causes extraction of CO2 from the feed gas into the absorption liquid in the absorber. In the desorber, CO2 is released from the CCh-rich absorption liquid by introducing a pressure difference (for example, a lower partial pressure of CO2 in the desorber gas phase compared to that in the feed gas, such as can be achieved by applying vacuum in the desorber, or can be achieved by passing a sweep gas through the desorber, such as air or a condensable sweep gas) that lowers the solubility of CO2 in the carrier liquid and/or applying heat, e.g., via a reboiler, steam or a sweep gas to drive CO2 into the gas phase in the desorber. Heat for desorption can also be applied by inducing cavitation, e.g., through application of ultrasonic or other acoustic or vibrational energy, and by applying microwave or infrared energy to the CO2- rich liquid. More than one desorption stage can be used to optimize the efficiency of CO2 release. For example, heat can be applied in one stage of the desorber to remove and capture the bulk of the CO2, followed by one or more secondary desorption stages, e.g., using air sweep, to remove additional CO2 and ‘polish’ the liquid to a more CCh-lean loading. Heat energy alone can be used to drive desorption such as is commonly used in monoethanol amine-based CO2 extraction processes. For example, the temperature in the desorber of a typical monoethanol amine-based CO2 extraction is greater than 100°C (e.g., 120°C). Alternatively heat energy can be combined with pressure reduction to drive desorption. In this case the temperature in the desorber can be lowered. For example, together with a reduced pressure (e.g., vacuum) compared to the pressure in the absorber (e.g., atmospheric pressure), the desorber can be operated at 70°C. A difference in pH can be used to facilitate absorption and desorption, wherein CO2 absorption into an aqueous medium is favored at more alkaline pH whereas CO2 desorption from an aqueous medium is favored at a less alkaline (more acidic) pH. The range of relevant pH difference (“swing”) between absorption and desorption depends on the particular process. For example, for the sake of illustration, CO2 absorption into a bicarbonate-based carrier liquid can occur at pH 9 or above resulting in a decrease in the pH of that carrier liquid to below pH 9. Desorption of CO2 from that carrier liquid can then occur at pH below pH 9.
A pressure difference between the absorber and the desorber can be established/occur when the pressure of the feed gas passing through the absorber is higher than the pressure of the gas phase in the desorber. In some cases, such as for natural gas upgrading, the gas pressure in the absorber is higher than in the desorber and the gas pressures in both the absorber and the desorber may be above atmospheric pressure. In other cases, the gas pressure in the absorber is above atmospheric pressure and the gas pressure in the desorber is at or below atmospheric pressure (i.e., equal to or less than 100 kPa). Alternatively, a pressure difference between the absorber and the desorber can be established/occur when the pressure of the feed gas (such as a coal-fired post-combustion flue gas) passing through the absorber is approximately at atmospheric pressure and the pressure of the gas phase in the desorber is below atmospheric pressure. In one embodiment of the present invention, the total gas pressure difference between the absorber and the desorber is at least about 35 kPa. Alternatively, a sweep gas comprising no or low concentration of CO2, such as air, can be used to provide the driving force needed to release CO2 from the CCh-rich liquid as it passes through the desorber. CO2 absorption and desorption can be operated in batch mode in a single reactor by first exposing the lean liquid in the reactor to CCh-rich gas for a period of time, then applying desorption driving force options, such as heat, vacuum, acoustic or cavitation effects, or sweep gas, or combinations of these, to release CO2 from the CCh-rich liquid for a period of time to regenerate the CCh-lean liquid. The sweep gas can be heated to provide both thermal and partial pressure driving forces for CO2 release. The cycle can be repeated. Air sweep desorption can be desirable for applications where CO2 separation is needed and the CO2 being separated can be released to the atmosphere, for example when the released CO2 is regarded as CCh-neutral emissions, such as the CO2 separated from methane during biogas upgrading.
One embodiment of the invention encompasses the use of condensable sweep gas compounds in the desorption stage of a CO2 gas separation reactor. A typical solvent-based CO2 gas separation reactor has two main stages: 1) an absorption stage, and 2) a desorption stage. In the absorption stage, CO2 from the gas phase is absorbed by the liquid phase. In the desorption stage, CO2 in the liquid phase is released to the gas phase. In order for CO2 to be released from the liquid to the gas phase, a concentration gradient must exist, wherein the concentration (partial pressure) of CO2 in the gas phase is less than the concentration (partial pressure) of CO2 in the liquid phase. A “sweep” stream of non-CCh gas applied in the desorption stage can assist the mass transfer of CO2 from the liquid to the sweep gas phase because the incoming sweep gas has no or low concentration of CO2, causing CO2 from the liquid phase to move in the direction of the low partial pressure of CO2. One type of sweep gas is water vapour (water in the gas phase), such as is produced when aqueous liquids are boiled. However, considerable energy is required to boil water. Air or nitrogen gas can also be used as sweep gases, however there is no easy way to separate these gases from CO2 after the desorption step, which may be necessary to obtain an isolated or purified CO2 product. It is therefore desirable to utilize compounds that can perform as a sweep gas without the energy cost of boiling water and also have properties that allow these compounds to be easily separated from CO2 after the desorption step to provide a purified stream of CO2 gas and recovered sweep gas compounds that can be used for subsequent desorption operations.
The use of certain non-water-miscible volatile carriers together with CO2 absorption solvents, such as aqueous MEA, in CO2 stripping processes has been described (R.A. Frimpong, J.E. Remias, J.K. Neathery, M. Liu and K. Liu, Enhancing solvent regeneration with a high volatility liquid as a stripping carrier, Tenth Annual Conference on Carbon Capture & Sequestration, May 2-5, 2011 , Pittsburgh, Pennsylvania; Proceedings on CD-ROM, Exchange Monitor Publications & Forums, 4455 Connecticut Ave NW, Suite A700, Washington, DC 20008). These systems can include CA to enhance desorption efficiency by overcoming rate limitations that may exist in the conversion of bicarbonate to CO2 during these processes, and CAs can be selected and engineered to be compatible with and withstand these processes whether or not they perform a specific catalytic role in these processes.
Condensable sweep gas compounds have the properties of a boiling temperature (equivalent to condensation temperature) that is substantially higher than the boiling temperature of CO2, while at the same time, preferably, being lower than the boiling temperature of water. One such compound is dimethyl ether, which has a boiling point (b.p.) of -24°C at 1 atmosphere pressure. This low boiling point facilitates the removal of dimethyl ether from reaction mixtures. Dimethyl ether is a gas at temperatures below the boiling point of water, and dimethyl ether will condense to a liquid at temperatures significantly higher than the sublimation temperature of CO2 (-78.5°C, 1 atm). Therefore, a sweep gas containing a mixture of CO2 and dimethyl ether can be passed through a chilled condenser at temperatures between -78°C to -24°C to cause the dimethyl ether to condense to a liquid and become trapped by the condenser while allowing the CO2 to pass through the condenser as a purified gas stream. The dimethyl ether liquid can be recycled for reuse as a sweep gas. Dimethyl ether is a non-toxic, inexpensive compound, resistant to auto-oxidation compared to other alkyl ethers, and is considered as an alternative fuel or a renewable fuel (BioDME) from gasification of lignocellulosic biomass. Therefore, dimethyl ether is readily available for sweep gas applications. Dimethyl ether is combustible, therefore engineering controls are needed to prevent combustion during use. Such controls can be optimized to take advantage of the feature that the CO2 compound being removed is itself non-combustible. Other examples of condensable sweep gas compounds are ethanol (b.p. 78.4°C), methanol (b.p. 64.7°C), acetone (b.p. 56°C), propanol (b.p. 97-98°C), isopropanol (b.p. 82.6°C), tertiary butanol (b.p. 82.2°C), and diethyl ether (b.p. 34.6°C). In each case, appropriate engineering controls are needed to prevent unwanted combustion or decomposition, e.g., a risk associated with diethyl ether is decomposition to explosive peroxides. In a preferred embodiment, the condensable sweep gas compound is used in an amount that does not significantly diminish the CC>2-loading capacity achievable with the CO2 absorption liquid.
In one preferred embodiment, dimethyl ether is the condensable sweep gas because the CO2 desorption can be carried out at low temperatures, such as at ambient temperature, or in the range 0-90°C or in the range 40-60°C, to enable a near-isothermal process between the absorber temperature (typically 40°C for a post-flue gas desulfurization combustion flue gas, or a biogas) and the desorber temperature, which can be held at 40°C or raised to a moderately higher temperature, for example to ensure solubility of the CCh-rich liquid until desorption has occurred.
The condensable sweep gas can be contacted directly with the CC>2-containing liquid, or can be separated from the CCh-containing liquid, such as by a CCh-permeable membrane, analogous to an industrial process called “pervaporation,” where the liquid-phase feed is separated from the vapour-phase permeate by a membrane that is selective for the desired components. Vacuum is optionally applied on the permeate side to provide a low partial pressure of the desired component and drive the mass transfer of that component across the membrane. In one embodiment, the condensable sweep gas compound directly contacts the liquid phase comprising the CO2 to be removed. In a preferred embodiment, the presence of the condensable sweep has no or minimal impact on the amount of CO2 that can be absorbed by the absorption solution (also called “loading capacity”).
In one embodiment, the condensable sweep gas compound is a volatile compound that is soluble in or miscible with water and has a lower boiling point than water, but does not form an azeotrope with water, e.g., methanol (b.p. 64.7°C). The liquid phase comprising the CO2 to be removed can comprise the water-soluble condensable sweep gas compound. In the reaction zone where desorption of CO2 is carried out, the temperature is raised above the boiling point of the water-soluble condensable sweep gas compound, and the sweep gas compound evaporates from the water-based liquid, carrying CO2 along in the vapour phase. Alternatively, the water- soluble condensable sweep gas compound can be raised to a temperature above its boiling point to convert it to a gaseous form, and the gaseous form can be contacted with the liquid phase comprising the CO2 to be removed, preferably maintaining the temperature above the boiling point of the water-soluble condensable sweep gas.
In one embodiment, the condensable sweep gas compound is a volatile compound that may form an azeotropic mixture together with water and boils at a lower temperature than the boiling temperature of water, preferably in the presence of CO2 absorption compounds. Upon exposure to heat, the volatile compound performs as a sweep gas as it vaporizes from the mixture. CO2 dissolved in the liquid is carried along with the sweep gas at lower temperatures than if only water was present. This means less energy is required to remove CO2 from the liquid. For example, water-miscible compositions, such as ethanol/water, when heated will volatilize towards a low-boiling azeotropic composition. In the case of ethanol/water, the azeotropic composition is 95.629 wt% ethanol, which can, for example, be prepared by mixing 95.629 g dry ethanol with pure water to make a total of 100 g solution (NIST Standard Reference Material 1828). At the azeotropic composition, the composition (of, e.g., ethanol and water) in the vapor phase is the same as the composition in liquid phase. Hence, boiling an azeotropic liquid at its azeotropic composition does not result in a change in the liquid composition. Ethanol boils at 78.4°C and water boils at 100°C, whereas the azeotrope boils at 78.2°C, which is lower than either of its constituents. Upon heating (or distilling), compositions of ethanol and water, wherein the proportion of ethanol is less than the azeotropic composition, will release relatively more ethanol into the gas phase than water, and the composition of the boiling liquid phase will become less concentrated in ethanol and more concentrated in water. Meanwhile, the proportion of ethanol in the gas phase will increase, and the collected condensate will have a higher proportion of ethanol compared to the initial boiling liquid composition. Therefore, ethanol can perform the function of a sweep gas by vaporizing at a lower temperature compared to water.
In one embodiment, the condensable sweep gas has low solubility in water or is easily separated from water to avoid the sweep gas component being retained by the CCh-lean liquid as it exits the desorption stage. For example, tertiary butanol is not miscible with water and has a boiling point of 82°C, which is lower than the boiling point of water. Therefore, tertiary butanol can function as a sweep gas and can be subsequently easily separated from water, e.g., in a condensation tank that provides for liquid-liquid separation. Additives, such as surfactants, can be present to enhance the miscibility of the sweep gas compound with water during the CO2 extraction stage to enhance the effectiveness of the sweep gas in removing CO2. Also, additives, such as surfactants, can be present to enhance the interaction of CO2 with the sweep gas, or reduce the interaction of CO2 with the CO2 absorption solution, in either case leading to enhanced release of CO2 from the CO2 absorption solution.
The ability to conduct low temperature desorption with a condensable sweep gas is especially beneficial when combined with a catalyst, such as a carbonic anhydrase, to minimize rate limitations in the conversion of bicarbonate to CO2 when CO2 desorption is carried out at low temperature.
In one embodiment, the presence of carbonic anhydrase in the liquid or in contact with the liquid means that a constant supply of dissolved CO2 will be available in the solution as the result of the CA catalyzed conversion of bicarbonate to CO2 in aqueous solutions comprising bicarbonate. The resulting vapour phase contains CO2, H2O and sweep gas compound (e.g., ethanol). Water and ethanol are recovered from the vapour phase in a cold trap, resulting in a pure CO2 gas at the exit.
The absorber and desorber shown schematically in Figure 7 can be at essentially the same (“isothermal”) temperature or at different temperatures. A carbonic anhydrase may be present in only the absorber or the desorber or both. Regeneration of CO2 using vacuum (low pressure) at low temperatures, e.g., 70°C in the desorber where a high temperature carbonic anhydrase such as carbonic anhydrase is present is a further embodiment of the present invention. Carbonic anhydrase in such a process catalysis both absorption and desorption of CO2 to and from the absorption solvent. When the absorber and desorber are at different temperatures, a temperature regulator (e.g., heat exchanger) can be used to conserve energy in the process.
In a further illustration, a modification of the vacuum carbonate process for H2S absorption (A. Kohl and R. Nielsen, Gas Purification, 5th ed., Gulf Professional Publishing, Houston, TX, 1997: 383-388) has been described for CO2 extraction (US 2007/0256559) and disclosed in combination with carbonic anhydrase (Lu et al., DOE Project No. DE-FC26-08NT0005498, NETL CO2 Capture Technology for Existing Plants R&D Meeting, March 24-26, 2009, Pittsburgh, PA). In this illustration, atmospheric pressure power plant flue gas contacts aqueous potassium carbonate and carbonic anhydrase in the absorber column at temperatures in the range 40 to 60°C, where carbonic anhydrase improves the rate of CO2 hydration to bicarbonate in the carrier liquid. The CO2-rich absorption liquid is pumped to a desorber column (or “stripper”) where CO2 is released from the absorption liquid by a combination of low pressure (e.g., 14- 55 KPa) and the application of heat (e.g., 50-70°C) obtained by directly injecting low-pressure, low-quality exhaust steam from a low pressure steam turbine of the power plant. Carbonic anhydrases of the present invention are especially suitable for use in the described modified vacuum carbonate process because they can tolerate temperatures both in the absorber and the desorber, meaning that, carbonic anhydrases can recirculate along with the absorption liquid through both absorption and desorption stages of the process.
A further type of reactor uses membranes in combination with CO2 hydration catalysis by CA followed by precipitation. In one case, CO2 is removed from a gaseous stream by passing the gaseous stream through a gas diffusion membrane into solution where conversion to bicarbonate and, subsequently, to carbonate is accelerated by passing the CO2 solution over a matrix that contains CA and adding a mineral ion to cause precipitation of the carbonic acid salt (US 7,132,090). It has been shown that CA can not only catalyse the CO2 hydration/dehydration reaction but can also promote the precipitation of calcium carbonate (Mirjafari et al., 2007, Ind. Eng. Che. Res., 46: 921-926).
A further type of reactor removes CO2 from ambient air. A reactor designed to remove CO2 from ambient air have been reported (Stolaroff et al. 2008 Environ. Sci. Technol., 42: 2728- 2735), however this reactor does not utilize carbonic anhydrase. Without being bound by the design of the reported ambient air reactor, a CA combined with suitable absorption liquids as disclosed in the present invention, could be used in such a reactor or in other reactor designs as described herein. A heat stable carbonic anhydrase is especially useful because exposure of the reactor to environmental conditions, such as sunlight, may increase the liquid temperature requiring the CA to have good thermostability, therebyavoiding the need to cool the reactor. This illustrates a situation where the process of extracting CO2 from the CCh-containing medium requires CA to function at or tolerate higher temperatures than the initial temperature of the CO2- containing medium, such as ambient air, which may be cold at night (below 10°C) and hot during the day (above 45°C).
The different membrane reactors and direct gas-liquid contact reactors described herein as well as other alternatives may be applied in a carbon dioxide extraction process, where the absorption process and desorption process occur in at least two steps. Such reactors generally comprise the following elements: a) at least one absorption unit, which may comprise a gas inlet zone and/or a gas outlet zone; b) at least one desorption unit comprising a gas outlet zone; c) a C02 absorption liquid; and d) means for connecting the absorption unit(s) and the desorption unit(s) such that the absorption liquid can pass from the absorption unit(s) to the desorption unit(s). Optionally the means for connecting the absorption and desorption units is a circuit, allowing the absorption liquid to be returned to the absorption unit once it has passed through the desorption unit. One or both of the units may comprise at least one C02-permeable membrane which separates a gas phase from a liquid phase, such as described in WO 2010/014773 and WO 2010/014774. This type of membrane unit is also termed a gas-liquid membrane (GLM) unit. The GLM unit may, e.g., be in the form of a hollow fiber membrane, a flat sheet membrane or a spiralwound membrane. The GLM unit may either function as an absorber unit and/or a desorber unit. Alternatively, one of the units may be a GLM unit and the other unit may be composed such that the gas and liquid phases are in direct contact or in other words the gas-liquid interface is not separated by a membrane. This type of unit is also termed a direct gas-liquid contact (DGLC) unit or just a direct contact (DC) unit. The DGLC unit may, e.g., be in the form of a column filled with packing material that allows for gas-liquid contact, and/or a liquid-containing vessel equipped with an inlet for exposing gas to the liquid (such as a bubble column), and/or a liquid-spray (such as a spray tower) and/or an aerator unit and/or a falling film. The DGLC unit may either function as an absorber unit or a desorber unit. Bubble cap system, sieve plate system, disk-and-doughnut column and packed column are examples of the internals found in DGLC units.
The reactor types described above may be operated at any desired temperature. In one embodiment, the reactor is operated with a temperature of the liquid in contact with and/or comprising carbonic anhydrase between 0°C and 120°C or 5°C and 110°C, more preferably between 10°C and 100°C, more preferably between 20°C and 95°C, more preferably between 30°C and 90°C, more preferably between 40°C and 85°C, more preferably between 40°C and 80°C, more preferably between 40°C and 75°C, and more preferably between 40°C and 70°C, and most preferably between 40°C and 60°C.
The absorption and desorption rates of CO2 are dependent on the pH in the absorption liquid. In the reactor types described in relation to the present invention the pH of the CO2-lean absorption liquid is between pH 4 to12, preferably above pH 7 (as measured at room temperature, e.g., 20-25°C), more preferably above pH 8, more preferably between 8 and 12, more preferably between 8 and 10.5, more preferably between 8.5 and 10, even more preferably between 9 and 9.5. The hydration of CO2 to to bicarbonate during absorption results in release of a proton causing the pH of the absorption liquid to decrease as the carbon content of the CO2-rich absorption liquid increases. The extent of pH decrease depends on the buffering capacity of the absorption liquid and the amount of CO2 absorbed. In a preferred embodiment of the present invention the absorption liquid is a bicarbonate-based buffer or a carbonate-based buffer, such as lithium bicarbonate, sodium bicarbonate, potassium bicarbonate, cesium bicarbonate, ammonium bicarbonate or another suitable salt of the bicarbonate, or lithium carbonate, sodium carbonate, potassium carbonate, cesium carbonate, ammonium carbonate or another suitable salt of the carbonate, or combinations of bicarbonate and carbonate compounds, where, depending on the pH, greater or lesser amount of carbonate and/or carbonic acid will exist together with bicarbonate.
In one embodiment of the present invention, the CCh-rich absorption liquid passes through a desorption stage where the pH of the CCh-rich absorption liquid will increase as the CO2 is released. In order to recirculate absorption liquid through such an absorption-desorption system, it is preferred that the pH of the absorption liquid returns to the pH of the CCh-lean absorption liquid before again passing through the absorption stage.
In a preferred embodiment of the present invention the reactor is equipped with means for regulating pH in the absorption liquid. This can be performed in several ways. One way is to add an alkaline substance to the absorption liquid, e.g., at one of the auxiliary component’s addition points (20) indicated in Figure 7, using automatic pH adjustment equipment such as an automatic titrator. The alkaline substance preferably has a similar composition (e.g., concentration of solvent, ionic strength, amount of carbonic anhydrase, etc.) as the absorption liquid circulating in the system and can be added at any time before absorption for adjustment of pH. Similarly, a neutral to acidic substance can be added to the absorption liquid any time before desorption, e.g., at one of the auxiliary component’s addition points (21) indicated in Figure 7. Extra absorption liquid can be removed from the system if needed, e.g., at one of the removal points (24 and 25) indicated in Figure 7.
In the CO2 capture processes described herein the carbonic anhydrases of the present invention may be combined with one or more other CAs. Different process steps in the overall CO2 scrubbing process may require optimization of operating conditions, e.g., temperature, pH, carrier liquid compositions, pressure and so forth. The CAs of the present invention may be combined with other CAs operating at different optimal conditions and which are suitable for use in the CO2 scrubbing process. For example, one CA can circulate throughout the system along with the absorption liquid and a different CA can be immobilized at one or more locations in the system.
The carbonic anhydrases of the present invention and bicatalyst based bioreactors described herein comprising a CA of the present invention also find more unconventional applications, such as in pilot cockpits, submarine vessels, aquatic gear, safety and firefighting gear, astronaut space suits and artificial lung devices to keep breathing air free of toxic CO2 levels. Other applications are to remove CO2 from confined spaces, such as to reduce hazardous CO2 levels from inside breweries and enclosed buildings carrying out fermentation, and from CO2 sensitive environments like museums and libraries, to prevent excessive CO2 from causing acid damage to books and artwork. Another application is to remove CO2 from hot ambient air, e.g., in a desert. In this case the carbonic anhydrase could for example be comprised in a reactor suitable for extracting CO2 from ambient air as described in Stolaroff et al. 2008 Environ. Sei. Techno , 42, 2728-2735, such a reactor can, for example, take the form of an “artificial tree” or a windmill as described in W02008/041920.
The carbonic anhydrases described herein can be used alone as a CO2 extraction biocatalyst together with a water-based absorption liquid or it may optionally be combined with conventional CO2 extraction technologies such as chemical absorption via amine-based solvents or aqueous ammonia or physical solvents such as Selexol™ (Union Carbide) or polyethylene glycol ethers. In a further embodiment of the present invention, a carbonic anhydrase is combined with one or more CO2 absorbing compounds, such as amine-based compounds, for example, aqueous alkanolamines including monoethanolamine (MEA), diethanolamine (DEA), N- methyldiethanolamine (MDEA), 2-amino-2-hydroxymethyl-1 ,3-propanediol (Tris or AHPD), diglycolamine (DGA), 2-amino-2-methyl-1 -propanol (AMP), , Methylmonoethanolamine (MMEA), Dimethylmonoethanolamine (DMMEA), diethylmonoethanolamine (DEMEA), diisopropanol amine (DI PA), triisopropanolamine (TIPA), aqueous soluble salts (e.g., sodium or potassium salts) of N-methylaminopropionic acid or N,N-dimethylaminoacetic acid or N-methylalanine, N- methylglycine, N,N-dimethylglycine, beta-alanine (3-aminopropanoic acid) or other natural or modified amino acids (e.g., N-substituted amino acid derivatives), 2-(2-aminoethylamino)ethanol (AEE), triethanolamine (TEA) or other primary, secondary, tertiary or hindered amine-based solvents including those described on pages 7 to 9 of US 4,112,052 (hereby incorporated by reference) , or aqueous soluble salts of glycine (e.g., sodium or potassium glycinate) and taurine or other liquid CO2 absorption compositions such as aqueous solutions comprising NaOH, KOH, LiOH, alkali-metal carbonate salts (e.g., lithium, sodium, potassium, or ammonium), alkali-metal bicarbonate salts, alkali-metal phosphate salts, or borate salts, such as sodium tetraborate decahydrate (borax), at different ionic strengths, molar concentrations (ranging from dilute solutions to highly concentrated solutions, up to the solubility limit of the salts, which may vary based on the temperature) or aqueous electrolyte solutions and promoters such as piperazine, or polyethylene glycol ethers, or a blend of them or analogs or blends thereof. The aqueous soluble salts and solvents may be used in combinations with each other and may be combined with pH buffering and mineral sequestering compounds, such as phosphate salts, polyphosphate salts and borate salts, to provide mixed salt solutions, such as potassium or sodium bicarbonate with potassium or sodium phosphate. The aqueous soluble salts and solvents may be combined with simple electrolytes (e.g., alkali halides, such as NaCI, KCI, and metal halides, such as ZnCI) and sulfate salts, such as sodium sulfate and potassium sulfate. Preferably the CO2 absorption composition comprises sufficient concentration of aqueous-soluble salts, e.g., NaCI, to stabilize and optimize the catalytic activity of the carbonic anhydrase. The combination of a carbonic anhydrase with CO2 absorption components may be applied in the bioreactors described herein andmay be applied to already existing CO2 scrubbing facilities based on conventional techniques. In conventional bioreactors, the concentration of alkanolamines is typically 15-30 weight percent. In an embodiment of the present invention the concentration of alkanolamines can be in the conventional range or preferably at a lower concentration, such as preferably below 15% (V/V), more preferably below 12%, 10%, 8%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.2% and most preferably below 0.1 % (V/V).
Certain simple amino acids and amines are known to activate a-CAs (Akdemir et al. , 2013, The extremo-a-carbonic anhydrase (CA) from Sulfurihydrogenibium azorense, the fastest CA known, is highly activated by amino acids and amines, Bioorg. Med. Chem. Lett. 23: 1087-1090) and are herein incorporated by reference. They include D-Phe, L-DOPA, L- and D-Trp, dopamine, serotonin, L- and D-His, L-Phe, L-Tyr, 2-pyridyl-methylamine, L-adrenaline, D-DOPA, D-Tyr, and deveral heterocyclic amines.
In conventional processes, corrosion and oxidation inhibitors, such as contained in Fluor Daniel's proprietary EconAmine FG solvent, are added to provide for increasing the amine concentration while reducing the risk of corrosion. Inorganic corrosion inhibitors include vanadium (e.g., sodium metavanadate), antimony, copper, cobalt, tin, and sufur compounds. Organic corrosion inhibitors include thiourea and salicylic acid.
Other auxiliary absorption liquid components can include wetting agents, chelating agents (e.g., ethylenediamine tetraacetic acid, polyphosphate salts), antifoams, viscosity reducers, and other compounds capable of increasing the flux of CO2 into or out of the carrier liquid.
In conventional processes, techniques to reduce and/or avoid foam formation are commonly employed. These include removal of foam-causing impurities prior to CO2 extraction and use of antifoaming agents and foam inhibitors such as silicone compounds or high-boiling alcohols such as oleyl alcohol or octylphenoxyethanol (A. Kohl and R. Nielsen, Gas Purification, 5th ed., Gulf Professional Publishing, Houston, TX, 1997: 224-230).
Another aspect of the present invention relates to biogas production where the CO2 extraction is performed directly in the biogas fermentation broth, as an alternative to passing the biogas through a bioreactor as described above. By adding a carbonic anhydrase to the anaerobic broth, as an additive in a biogas fermentation medium, more CO2 from the gas phase can be converted into bicarbonate, which is the substrate for methane production by the methanogenic Archaea. Particularly, the genus Methanosarcina is frequently present in thermophilic biogas digesters (Mladenovska and Ahring, 2000, FEMS Microbiol. Ecol. 3: 225- 229). It has been shown for Methanosarcina thermophila TM-1 that bicarbonate may be a limiting factor for the methane production, for example cultures of M. thermophila TM-1 grown in low bicarbonate solution (0.6 mM) showed a considerable lag phase (/.e., methane production began later) when compared with cultures containing ten times higher bicarbonate dosages (6 mM). Additionally, the total yield of methane was 25 times less at the lower bicarbonate dosage (Murray and Zinder, 1985, Appl. Environ. Microbiol. 50: 49-55). Consequently, a heat-stable carbonic anhydrase is particularly useful when the biogas production is performed at elevated temperatures using one or more thermophilic microorganisms, for example methanogens like Methanosarcina sp. that can use CCh/biocarbonate as carbon source for growth and methanogenesis.
A further embodiment of the present invention is use of a carbonic anhydrase described herein to enhance growth of algae and other aquatic plants that utilize bicarbonate as a carbon source by catalyzing the conversion of CO2 to bicarbonate in or for delivery to the aquatic plant environment. This approach can, for example, be used to simultaneously remove CO2 from a combustion exhaust gas, such as a flue gas, and provide CO2 for conversion to bicarbonate by contacting the exhaust gas with liquid from a cultivation pond. Certain approaches to cultivating algae and aquatic plants involve use of enclosed tubes or shallow troughs or ponds in which heat from sunlight raises the water temperature. Hence a heat stable carbonic anhydrase is particularly useful at the elevated cultivation temperatures.
The present invention is further described by the following examples that should not be construed as limiting the scope of the invention.
Examples
Example 1 : Cloning and Expression of Carbonic Anhydrases
Cloning of codon-optimized carbonic anhydrase genes into suitable expression hosts, either Bacillus subtilis or Aspergillus oryzae, was performed by standard laboratory methods, well-known in the field. For gene expression of constructs in B. subtilis, pre-cultures in TB-Gly medium (1.3% Tryptone, 2.7% yeast extract, 0.6% glycerol) under antibiotic selection were inoculated and cultivated at 37 °C for 24 h while shaking (700 rpm, Incubator Hood TH 15, Edmund Buhler). Main cultures in either TB-Gly medium or starch-based medium suitable for B. subtilis fermentation were inoculated in a ratio of 1 :30 from their respective pre-cultures and incubated for 72 h at 37 °C and 700 rpm. Fermentations were harvested by centrifugation (Thermo Scientific Heraeus Megafuge 40 centrifuge) at 2500 rpm for 30 min and cleared supernatants were transferred to a 96-well PCR plate. For gene expression of constructs in A. oryzae minimal MDLI2 medium (131 mM maltose, 4 mM MgSO4, 17 mM NaCI, 8 mM urea, 11 mM K2SO4, 88 mM KH2PO4, 0.01% non-ionic surfactant and trace metals; pH 5.0) was inoculated with spores and fermented for 96 h at 37 °C. Fermentations were sterile filtered (0.2 pm filter) and transferred to a 96-well PCR plate.
Example 2: Detection of Carbonic Anydrase Activity
Expression supernatants from Example 1 were diluted 20-fold with assay buffer (0.188 M TAPS, 0.30 M NaCI, 50 mg/L Cresol red, pH 9.0) and 25 pL sample was transferred into a transparent, flat-bottom 96-well assay plate. The substrate solution was prepared by carbonating a cold 0.15 M NaCI solution using a standard household sparkling water maker (SodaStream). To determine carbonic anhydrase activity 200 pL cold substrate solution was auto injected (300 pL/s) into the sample and the change in absorbance at 573 nm was recorded for 30 s using a ClarioStar microtiter plate spectrophotometer. Each sample was measured in triplicate. Persophenella marina carbonic anhydrase (SEQ ID NO: 77) supernatant expressed from a B. subtilis host served as positive control and empty medium as a negative control.
To calculate the activity for each sample, absorbance curves were normalized by subtracting the sample’s absorbance at reaction completion (flat line in the end), so that all samples had a corrected absorbance value of 0.00 at reaction completion. Reaction times were determined as the times passed from sample injection until the normalized absorbance value reached 0.15. The average times for three replicates (tSamPie average and t negative control average, respectively) were used to calculate the WAU score with the following formula: 1000
WAU scores above 60 s-1 indicated high activity. Table 1 shows the determined WAU score for the discovered wildtype carbonic anhydrases. Table 1
Reaction faster than time between substrate injection and start of absorbance reading.
Example 3: Normalized WAU activity of carbonic anhydrases at 0.2 mg/mL protein relative to Persophenella marina carbonic anhydrase
Purified carbonic anhydrases (in 50 mM HEPES, 0.3 M NaCI, pH 7) were diluted to 0.2 mg/mL with 0.15 M NaCI. To generate a standard curve Persophenella marina carbonic anhydrase (SEQ ID NO: 77; in 50 mM HEPES, 0.3 M NaCI, pH 7) was diluted with 0.15 M NaCI to 0.2 mg/mL, 0.1 mg/mL, 0.05 mg/mL and 0.025 mg/mL, respectively. 16 pL diluted enzyme sample was combined with 224 pL assay buffer (0.188 M TAPS, 0.30 M NaCI, 50 mg/L Cresol red, pH 9.0) and 25 pL sample was transferred into a transparent, flat-bottom 96-well assay plate. The substrate solution was prepared by carbonating a 0.15 M NaCI solution using a standard household sparkling water maker (SodaStream). To determine carbonic anhydrase activity 200 pL substrate solution was auto-injected (100 pL/s) into the sample and the change in absorbance at 573 nm was recorded for 30 s using a TECAN microtiter plate spectrophotometer. Each sample was measured in triplicate.
The time in seconds to reach the absorbance of 0.2 (to.2) was determined for each sample. A calibration curve was constructed by plotting 1/to.2 vs the concentration of the corresponding standard. The normalized activity of the samples was calculated using linear regression with the standard curve. One normalized WAU activity unit corresponds to the activity of 1.0 mg/mL Persophenella marina carbonic anhydrase (SEQ ID NO: 77) in this assay. An average of two determinations was calculated and results shown in Figure 47. Bars marked with a star were higher than 0.22 in normalized WAU units.
Example 4: Normalized WAU activity of carbonic anhydrases at 1.0 mg/mL protein relative to Persophenella marina carbonic anhydrase
Purified carbonic anhydrases (in 50 mM HEPES, 0.3 M NaCI, pH 7) were diluted to 1.0 mg/mL with 0.15 M NaCI. To generate a standard curve Persophenella marina carbonic anhydrase (SEQ ID NO: 77; in 50 mM HEPES, 0.3 M NaCI, pH 7) was diluted with 0.15 M NaCI to 0.2 mg/mL, 0.1 mg/mL, 0.05 mg/mL and 0.025 mg/mL, respectively. 16 pL diluted enzyme sample was combined with 224 pL assay buffer (0.188 M TAPS, 0.30 M NaCI, 50 mg/L Cresol red, pH 9.0) and 25 pL sample was transferred into a transparent, flat-bottom 96-well assay plate. The substrate solution was prepared by carbonating a 0.15 M NaCI solution using a standard household sparkling water maker (SodaStream). To determine carbonic anhydrase activity 200 pL substrate solution was auto-injected (100 pL/s) into the sample and the change in absorbance at 573 nm was recorded for 30 s using a TECAN microtiter plate spectrophotometer. Each sample was measured in triplicate.
The time in seconds to reach the absorbance of 0.2 (to.2) was determined for each sample. A calibration curve was constructed by plotting 1/to.2 vs the concentration of the corresponding standard. The normalized activity of the samples was calculated using linear regression with the standard curve. So, one normalized WAU activity unit corresponds to the activity of 1.0 mg/mL Persophenella marina carbonic anhydrase (SEQ ID NO: 77) in this assay. An average of two determinations was calculated and results shown in Figure 48. Bars marked with a star were higher than 0.3 in normalized WAU units.
The invention described and claimed herein is not to be limited in scope by the specific aspects herein disclosed, since these aspects are intended as illustrations of several aspects of the invention. Any equivalent aspects are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description. Such modifications are also intended to fall within the scope of the appended claims. In the case of conflict, the present disclosure including definitions will control.
The invention is further defined by the following numbered paragraphs:
Paragraph [1]: A polypeptide having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of any one of SEQ ID NOs: 1 , 3, 5, 7, 9, 11 , 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31 , 33, 35, 37, 39, 41 , 43, 45, 47, 49, 51 , 53, 55, 57, 59, 61 , 63, 65, 67, 69, 71 , 73, 75, 78, 80, 82, and 84;
(c) a polypeptide derived from any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85 by having 1-30 alterations (e.g., substitutions, deletions and/or insertions at one or more positions, e.g., 1 or 2 or 3 or 4 or 5 or 6 or 7 or 8 or 9 or 10 or 11 or 12 or 13 or 14 or 15 or 16 or 17 or 18 or 19 or 20 or 21 or 22 or 23 or 24 or 25 or 26 or 27 or 28 or 29 or 30 alterations, in particular substitutions; (d) a polypeptide having a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1.0, to the three-dimensional structure of the polypeptide of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85, wherein the three-dimensional structure is calculated by Alphafold.
(e) a polypeptide derived from the polypeptide of (a), (b), (c) or (d) wherein the N- and/or C-terminal end has been extended by addition of one or more amino acids; and
(f) a fragment of the polypeptide of (a), (b), (c) or (d); wherein the polypeptide has carbonic anhydrase activity.
Paragraph [2]: A polypeptide having carbonic anhydrase activity, which is:
(a) a polypeptide having at least 60% sequence identity to any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85; or
(b) a fragment of the polypeptide of (a); wherein the polypeptide has carbonic anhydrase activity.
Paragraph [3]: A polypeptide having carbonic anhydrase activity, which is a fragment having at least 75%, 80%, 85%, 90%, or 95% of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85; wherein the fragment has carbonic anhydrase activity.
Paragraph [4]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 2.
Paragraph [5]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 4.
Paragraph [6]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 6.
Paragraph [7]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 8.
Paragraph [8]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 10.
Paragraph [9]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 12.
Paragraph [10]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 14.
Paragraph [11]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 16.
Paragraph [12]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 18.
Paragraph [13]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 20. Paragraph [14]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 22.
Paragraph [15]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 24.
Paragraph [16]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 26.
Paragraph [17]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 28.
Paragraph [18]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 30.
Paragraph [19]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 32.
Paragraph [20]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 34. Paragraph [21]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 36.
Paragraph [22]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 38.
Paragraph [23]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 40.
Paragraph [24]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 42.
Paragraph [25]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 44.
Paragraph [26]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 46.
Paragraph [27]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 48.
Paragraph [28]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 50.
Paragraph [29]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 52.
Paragraph [30]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 54.
Paragraph [31]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 56.
Paragraph [32]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 58.
Paragraph [33]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 60.
Paragraph [34]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 62.
Paragraph [35]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 64. Paragraph [36]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 66.
Paragraph [37]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 68.
Paragraph [38]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 70.
Paragraph [39]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 72.
Paragraph [40]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 74.
Paragraph [41]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 76.
Paragraph [42]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 79. Paragraph [43]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 81.
Paragraph [44]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 83.
Paragraph [45]: The polypeptide of any one of the preceding paragraphs, having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 85.
Paragraph [46]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 1 .
Paragraph [47]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 3.
Paragraph [48]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 5.
Paragraph [49]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 7. Paragraph [50]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 9.
Paragraph [51]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 11.
Paragraph [52]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 13.
Paragraph [53]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 15.
Paragraph [54]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 17.
Paragraph [55]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 19. Paragraph [56]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 21.
Paragraph [57]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 23.
Paragraph [58]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 25.
Paragraph [59]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 27.
Paragraph [60]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 29.
Paragraph [61]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 31.
Paragraph [62]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 33.
Paragraph [63]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 35.
Paragraph [64]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 37.
Paragraph [65]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 39.
Paragraph [66]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 41.
Paragraph [67]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 43.
Paragraph [68]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 45.
Paragraph [69]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 47.
Paragraph [70]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 49.
Paragraph [71]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 51.
Paragraph [72]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 53.
Paragraph [73]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 55.
Paragraph [74]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 57. Paragraph [75]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 59.
Paragraph [76]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 61.
Paragraph [77]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 63.
Paragraph [78]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 65.
Paragraph [79]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 67.
Paragraph [80]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 69. Paragraph [81]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 71.
Paragraph [82]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 73.
Paragraph [83]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 75.
Paragraph [84]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 78.
Paragraph [85]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 80.
Paragraph [86]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 82.
Paragraph [87]: The polypeptide of any one of the preceding paragraphs, which is encoded by a polynucleotide having at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to the coding sequence of SEQ ID NO: 84.
Paragraph [88]: The polypeptide of any one of the preceding paragraphs, which is a variant of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85; comprising a substitution (e.g., conservative substitution), deletion, and/or insertion at one or more positions.
Paragraph [89]: The polypeptide of any one of the preceding paragraphs, comprising, consisting essentially of, or consisting of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85.
Paragraph [90]: The polypeptide of any one of the preceding paragraphs, comprising an N- terminal extension and/or C-terminal extension of 1-10 amino acids, e.g., 1 , 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acids, and wherein the extended polypeptide has carbonic anhydrase activity.
Paragraph [91]: The polypeptide of any one of the preceding paragraphs having at most 10%, at most 9%, at most 8%, at most 7%, at most 6%, at most 5%, at most 4%, at most 3%, at most 2% or at most 1% sequence differences to the polypeptide of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85.
Paragraph [92]: The polypeptide of any one of the preceding paragraphs, which differs from the polypeptide of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85 by at 1 , 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14 or 15 amino acids.
Paragraph [93]: The polypeptide of any one of the preceding paragraphs, which has a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1.0, to the three-dimensional structure of the polypeptide of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85, wherein the three-dimensional structure is calculated by Alphafold. Paragraph [94]: The polypeptide of any one of the preceding paragraphs, which has improved thermostabilty.
Paragraph [95]: The polypeptide of any one of the preceding paragraphs, which is isolated.
Paragraph [96]: The polypeptide of any one of the preceding paragraphs, which is purified.
Paragraph [97]: A fusion polypeptide comprising the polypeptide of any one of the preceding paragraphs and a second polypeptide.
Paragraph [98]: A composition comprising the polypeptide of any one of paragraphs 1-96 or the fusion polypeptide of paragraph 97.
Paragraph [99]: The composition of paragraph 98, which is a liquid composition, solid composition, solution, dispersion, paste, powder, granule, granulate, coated granulate, tablet, cake, crystal, crystal slurry, gel or pellet.
Paragraph [100]: A polynucleotide encoding the polypeptide of any one of paragraphs 1-96 or the fusion polypeptide paragraph 97.
Paragraph [101]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 1.
Paragraph [102]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 3.
Paragraph [103]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 5.
Paragraph [104]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 7.
Paragraph [105]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 9.
Paragraph [106]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 11.
Paragraph [107]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 13.
Paragraph [108]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 15.
Paragraph [109]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 17.
Paragraph [110]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 19.
Paragraph [111]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 21. Paragraph [112]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 23.
Paragraph [113]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 25.
Paragraph [114]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 27.
Paragraph [115]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 29.
Paragraph [116]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 31.
Paragraph [117]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 33.
Paragraph [118]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 35.
Paragraph [119]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 37.
Paragraph [120]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 39.
Paragraph [121]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 41.
Paragraph [122]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 43.
Paragraph [123]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 45.
Paragraph [124]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 47.
Paragraph [125]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 49.
Paragraph [126]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 51. Paragraph [127]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 53.
Paragraph [128]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 55.
Paragraph [129]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 57.
Paragraph [130]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 59.
Paragraph [131]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 61.
Paragraph [132]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 63.
Paragraph [133]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 65.
Paragraph [134]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 67.
Paragraph [135]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 69.
Paragraph [136]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 71.
Paragraph [137]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 73.
Paragraph [138]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 75.
Paragraph [139]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 78.
Paragraph [140]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 80.
Paragraph [141]: The polynucleotide of paragraph 100, which comprises at least 60%, at least
65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least
92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least
99% or 100% sequence identity to SEQ ID NO: 82. Paragraph [142]: The polynucleotide of paragraph 100, which comprises at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99% or 100% sequence identity to SEQ ID NO: 84.
Paragraph [143]: The polynucleotide of any one of paragraphs 100-142, which is isolated.
Paragraph [144]: The polynucleotide of any one of paragraphs 100-143, which is purified.
Paragraph [145]: A nucleic acid construct or expression vector comprising the polynucleotide of any one of paragraphs 100-144, wherein the polynucleotide is operably linked to one or more control sequences that direct the production of the polypeptide in an expression host.
Paragraph [145]: A recombinant host cell comprising the polynucleotide of any one of paragraphs 100-144 operably linked to one or more control sequences that direct the production of the polypeptide or the fusion polypeptide.
Paragraph [146]: The recombinant host cell of paragraph 145, wherein the polypeptide is heterologous to the recombinant host cell.
Paragraph [147]: The recombinant host cell of paragraph 145 or 146, wherein at least one of the one or more control sequences is heterologous to the polynucleotide encoding the polypeptide.
Paragraph [148]: The recombinant host cell of any one of paragraphs 145-147, which comprises at least two copies, e.g., three, four, five, or more copies of the polynucleotide of any one of paragraphs 100-144.
Paragraph [149]: The recombinant host cell of any one of paragraphs 145-148, which is a yeast recombinant host cell, e.g., a Candida, Hansenula, Kluyveromyces, Pichia, Saccharomyces, Schizosaccharomyces, or Yarrowia cell, such as a Kluyveromyces lactis, Saccharomyces carlsbergensis, Saccharomyces cerevisiae, Saccharomyces diastaticus, Saccharomyces douglasii, Saccharomyces kluyveri, Saccharomyces norbensis, Saccharomyces oviformis, or Yarrowia lipolytica cell.
Paragraph [150]: The recombinant host cell of any one of paragraphs 145-148, which is a filamentous fungal recombinant host cell, e.g., an Acremonium, Aspergillus, Aureobasidium, Bjerkandera, Ceriporiopsis, Chrysosporium, Coprinus, Coriolus, Cryptococcus, Fili basidium, Fusarium, Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, Neurospora, Paecilomyces, Penicillium, Phanerochaete, Phlebia, Piromyces, Pleurotus, Schizophyllum, Talaromyces, Thermoascus, Thielavia, Tolypocladium, Trametes, or Trichoderma cell, in particular, an Aspergillus awamori, Aspergillus foetidus, Aspergillus fumigatus, Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger, Aspergillus oryzae, Bjerkandera adusta, Ceriporiopsis aneirina, Ceriporiopsis caregiea, Ceriporiopsis gilvescens, Ceriporiopsis pannocinta, Ceriporiopsis rivulosa, Ceriporiopsis subrufa, Ceriporiopsis subvermispora, Chrysosporium inops, Chrysosporium keratinophilum, Chrysosporium lucknowense, Chrysosporium merdarium, Chrysosporium pannicola, Chrysosporium queenslandicum, Chrysosporium tropicum, Chrysosporium zonatum, Coprinus cinereus, Coriolus hirsutus, Fusarium bactridioides, Fusarium cerealis, Fusarium crookwellense, Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium sarcochroum, Fusarium sporotrichioides, Fusarium sulphureum, Fusarium torulosum, Fusarium trichothecioides, Fusarium venenatum, Humicola insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora thermophila, Neurospora crassa, Penicillium purpurogenum, Phanerochaete chrysosporium, Phlebia radiata, Pleurotus eryngii, Talaromyces emersonii, Thielavia terrestris, Trametes villosa, Trametes versicolor, Trichoderma harzianum, Trichoderma koningii, Trichoderma longibrachiatum, Trichoderma reesei, or Trichoderma viride cell.
Paragraph [151]: The recombinant host cell of any one of paragraphs 145-148, which is a prokaryotic recombinant host cell, e.g., a Gram-positive cell selected from the group consisting of Bacillus, Clostridium, Enterococcus, Geobacillus, Lactobacillus, Lactococcus, Oceanobacillus, Staphylococcus, Streptococcus, or Streptomyces cells, or a Gram-negative bacteria selected from the group consisting of Campylobacter, E. coli, Flavobacterium, Fusobacterium, Helicobacter, llyobacter, Neisseria, Pseudomonas, Salmonella, and Ureaplasma cells, such as Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus brevis, Bacillus circulans, Bacillus clausii, Bacillus coagulans, Bacillus firmus, Bacillus lautus, Bacillus lentus, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, Bacillus stearothermophilus, Bacillus subtilis, Bacillus thuringiensis, Streptococcus equisimilis, Streptococcus pyogenes, Streptococcus uberis, and Streptococcus equi subsp. Zooepidemicus, Streptomyces achromogenes, Streptomyces avermitilis, Streptomyces coelicolor, Streptomyces griseus, and Streptomyces lividans cells.
Paragraph [152]: The recombinant host cell of any one of paragraphs 145-151 , which is isolated.
Paragraph [153]: The recombinant host cell of any one of paragraphs 145-152, which is purified. Paragraph [154]: A method of producing the polypeptide of any one of paragraphs 1-96 or the fusion polypeptide of paragraph 97, the method comprising cultivating a cell, which in its wildtype form produces the polypeptide, under conditions conducive for production of the polypeptide.
Paragraph [155]: The method of paragraph 154, further comprising recovering the polypeptide or the fusion polypeptide.
Paragraph [156]: A method of producing a polypeptide having carbonic anhydrase activity, comprising cultivating the recombinant host cell of any one of paragraphs 145-153 under conditions conducive for production of the polypeptide or the fusion polypeptide.
Paragraph [157]: The method of paragraph 156, further comprising recovering the polypeptide.
Paragraph [158]: A method of extracting carbon dioxide, the method comprising treating a carbon dioxide-containing medium with a polypeptide of any one of paragraphs 1-96 or the fusion polypeptide of paragraph 97.
Paragraph [159]: The method of paragraph 158, where carbon dioxide-containing gas or multiphase mixture is emitted from combustion.
Paragraph [160]: The method of paragraph 158, wherein the carbon dioxide-containing gas is a flue gas.
Paragraph [161]: The method of paragraph 158, wherein the carbon dioxide-containing gas or multiphase mixture is a raw natural gas or a syngas.
Paragraph [162]: The method of paragraph 158, wherein the carbon dioxide-containing gas or multiphase mixture is a biogas.
Paragraph [163]: The method of any one of paragraphs 158-162, wherein the carbon dioxide- containing medium is a bicarbonate-containing liquid, and the carbon dioxide extraction is the conversion of bicarbonate to carbon dioxide.
Paragraph [164]: The method of paragraph 163, wherein the carbon dioxide-containing medium further comprises an amine-based compound. Paragraph [165]: The method of paragraph 164, wherein the amine-based compound is Tris or MDEA.
Paragraph [166]: The method of any one of paragraphs 158-165, wherein the carbon dioxidecontaining medium further comprises HPO42; such as K2HPO4 or Na2HPC>4.
Paragraph [167]: The method of any one of paragraphs 158-166, wherein the extraction of carbon dioxide is performed at temperatures between 55°C and 120°C.
Paragraph [168]: A bioreactor for extracting carbon dioxide, where said bioreactor comprises a carbonic anhydrase of any one of paragraphs 1-96 or fusion protein of paragraph 97.
Paragraph [169]: The bioreactor according to paragraph 168, where said bioreactor comprises the following elements: a) at least one absorption module; b) at least one precipitation stage; c) a carrier liquid; and d) means for connecting the absorption module(s) to the precipitation stage(s) such that the carrier liquid can be passed from the absorption module(s) to the precipitation stage(s).
Paragraph [170]: The bioreactor according to paragraph 168 or 169, where said reactor comprises the following elements: a) at least one absorption module; b) at least one desorption module; c) a carrier liquid; and d) means for connecting the absorption module(s) and the desorption module(s) such that the carrier liquid can be passed from the absorption module(s) to the desorption module(s).
Paragraph [171]: The bioreactor according to any one of paragraphs 168-170, wherein the carbonic anhydrase is entrapped in the absorption module or in the desorption module or in both modules, or in the precipitation stage.
Paragraph [172]: The bioreactor according to any one of paragraphs 168-171 , wherein the reactor further comprises a bicarbonate-based carrier liquid.
Paragraph [173]: The bioreactor according to any one of paragraphs 168-172, wherein the reactor further comprises an amine-based carrier liquid. Paragraph [174]: The bioreactor according to paragraph 173, where the amine-based carrier liquid includes Tris or MDEA. Paragraph [175]: The bioreactor according to any one of paragraphs 168-174, wherein the reactor further comprises HPO42; such as K2HPO4 or Na2HPC>4.
Paragraph [176]: A composition comprising the carbonic anhydrase of any one of paragraphs 1- 96 or the fusion polypeptide of paragraph 97.
Paragraph [177]: The composition of paragraph 176, where the composition comprises a matrix suitable for immobilization.
Paragraph [178]: The composition of paragraph 177, wherein the matrix is selected from the group consisting of beads, fabrics, fibers, hollow fibers, membranes, particulates, porous surfaces, rods, and tubes.

Claims

Claims What is claimed is:
1 . A polypeptide having carbonic anhydrase activity, selected from the group consisting of:
(a) a polypeptide having at least 60% sequence identity to any one of SEQ ID NOs:
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85;
(b) a polypeptide encoded by a polynucleotide having at least 60% sequence identity to the coding sequence of any one of SEQ ID NOs: 1 , 3, 5, 7, 9, 11 , 13, 15, 17, 19, 21 , 23, 25, 27, 29, 31 , 33, 35, 37, 39, 41 , 43, 45, 47, 49, 51 , 53, 55, 57, 59, 61 , 63, 65, 67, 69, 71 , 73, 75, 78, 80, 82, and 84;
(c) a polypeptide derived from any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85 by having 1-30 alterations (e.g., substitutions, deletions and/or insertions at one or more positions, e.g., 1 or 2 or 3 or 4 or 5 or 6 or 7 or 8 or 9 or 10 or 11 or 12 or 13 or 14 or 15 or 16 or 17 or 18 or 19 or 20 or 21 or 22 or 23 or 24 or 25 or 26 or 27 or 28 or 29 or 30 alterations, in particular substitutions;
(d) a polypeptide having a TM-score of at least 0.80, e.g., at least 0.85, at least 0.90, at least 0.905, at least 0.910, at least 0.915, at least 0.920, at least 0.925, at least 0.930, at least 0.935, at least 0.940, at least 0.945, at least 0.950, at least 0.955, at least 0.960, at least 0.965, at least 0.970, at least 0.975, at least 0.980, at least 0.985, at least 0.990, at least 0.995, or even 1.0, to the three-dimensional structure of the polypeptide of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85, wherein the three-dimensional structure is calculated by Alphafold.
(e) a polypeptide derived from the polypeptide of (a), (b), (c) or (d) wherein the N- and/or C-terminal end has been extended by addition of one or more amino acids; and
(f) a fragment of the polypeptide of (a), (b), (c) or (d); wherein the polypeptide has carbonic anhydrase activity.
2. The polypeptide of claim 1 , comprising, consisting essentially of, or consisting of any one of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 79, 81 , 83, and 85.
3. A composition comprising the polypeptide of claim 1 or 2.
4. A method of extracting carbon dioxide, the method comprising treating a carbon dioxidecontaining medium with a polypeptide of claim 1 or 2.
5. A bioreactor for extracting carbon dioxide, where said bioreactor comprises a polypeptide of claim 1 or 2.
6. A polynucleotide encoding the polypeptide of claim 1 or 2.
7. A nucleic acid construct or expression vector comprising the polynucleotide of claim 6, operably linked to one or more control sequences that direct the production of the polypeptide in an expression host.
8. A recombinant host cell comprising the nucleic acid construct or expression vector of claim 7.
9. A method of producing a polypeptide having carbonic anhydrase activity, comprising cultivating the recombinant host cell of claim 8 under conditions conducive for production of the polypeptide.
10. The method of claim 9 further comprising recovering the carbonic anhydrase.
PCT/EP2025/0533032024-02-082025-02-07Polypeptides having carbonic anhydrase activity and polynucleotides encoding samePendingWO2025168806A1 (en)

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US202463551239P2024-02-082024-02-08
US63/551,2392024-02-08

Publications (1)

Publication NumberPublication Date
WO2025168806A1true WO2025168806A1 (en)2025-08-14

Family

ID=94605802

Family Applications (1)

Application NumberTitlePriority DateFiling Date
PCT/EP2025/053303PendingWO2025168806A1 (en)2024-02-082025-02-07Polypeptides having carbonic anhydrase activity and polynucleotides encoding same

Country Status (1)

CountryLink
WO (1)WO2025168806A1 (en)

Citations (51)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4112052A (en)1975-06-261978-09-05Exxon Research & Engineering Co.Process for removing carbon dioxide containing acidic gases from gaseous mixtures using aqueous amine scrubbing solutions
EP0238023A2 (en)1986-03-171987-09-23Novo Nordisk A/SProcess for the production of protein products in Aspergillus oryzae and a promoter for use in Aspergillus
US4750918A (en)1985-05-281988-06-14The Trustees Of The Stevens Institute Of TechnologySelective-permeation gas-separation process and apparatus
WO1992006204A1 (en)1990-09-281992-04-16Ixsys, Inc.Surface expression libraries of heteromeric receptors
US5223409A (en)1988-09-021993-06-29Protein Engineering Corp.Directed evolution of novel binding proteins
WO1994025612A2 (en)1993-05-051994-11-10Institut PasteurNucleotide sequences for the control of the expression of dna sequences in a cellular host
WO1995017413A1 (en)1993-12-211995-06-29Evotec Biosystems GmbhProcess for the evolutive design and synthesis of functional polymers based on designer elements and codes
WO1995022625A1 (en)1994-02-171995-08-24Affymax Technologies N.V.Dna mutagenesis by random fragmentation and reassembly
WO1995033836A1 (en)1994-06-031995-12-14Novo Nordisk Biotech, Inc.Phosphonyldipeptides useful in the treatment of cardiovascular diseases
US5776741A (en)1994-02-211998-07-07Novo Nordisk A/SMethod of enzyme immobilization on a particulate silica carrier for synthesis inorganic media
US6143556A (en)1995-06-072000-11-07Trachtenberg; Michael C.Enzyme systems for gas processing
US6156096A (en)1994-03-232000-12-05Applied Membrane Technology, Inc.Gas separation using hollow fiber contained liquid membrane
US6524842B1 (en)2000-06-302003-02-25Envirogen, Inc.Biodegradation of gasoline oxygenates
US6524843B1 (en)1997-06-042003-02-25Co2 Solution Inc.Process and apparatus for the treatment of carbon dioxide with carbonic anhydrase
WO2004007058A1 (en)2002-07-112004-01-22Co2 Solution Inc.Triphasic bioreactor and process for gas effluent treatment
US20040029257A1 (en)2002-01-282004-02-12Co2 SolutionProcess for purifying energetic gases such as biogas and natural gas
US20040059231A1 (en)2002-09-192004-03-25Colin CorporationArteriostenosis inspecting apparatus
WO2004028667A1 (en)2002-09-272004-04-08Co2 Solution Inc.A process and a plant for recycling carbon dioxide emissions from power plants into useful carbonated species
WO2004056455A1 (en)2002-12-192004-07-08Co2 Solution Inc.Process and apparatus using a spray absorber bioreactor for the biocatalytic treatment of gases
US20040171154A1 (en)2001-07-272004-09-02Francesca StoriciSystems for in vivo site-directed mutagenesis using oligonucleotides
WO2004104160A1 (en)2003-05-192004-12-02Michael TrachtenbergMethods, apparatuses, and reactors for gas separation
US20050214936A1 (en)2003-06-182005-09-29Bhattacharya Sanjoy KEnzyme facilitated solubilization of carbon dioxide from emission streams in novel attachable reactors/devices
WO2005114417A2 (en)2004-05-072005-12-01General Motors CorporationProcess for immobilization of protein catalysts, product, and use
WO2006089423A1 (en)2005-02-242006-08-31Co2 Solution Inc.An improved co2 absorption solution
US7132090B2 (en)2003-05-022006-11-07General Motors CorporationSequestration of carbon dioxide
US7176017B2 (en)2001-07-132007-02-13Co2 Solution Inc.Triphasic bioreactor and process for gas effluent treatment
WO2007036235A1 (en)2005-09-302007-04-05Novozymes A/SImmobilization of enzymes
US20070256559A1 (en)2006-05-082007-11-08The Board Of Trustees Of The University Of IllinoisIntegrated vacuum absorption steam cycle gas separation
US20080056972A1 (en)2006-09-062008-03-06Mitsubishi Heavy Industries, Ltd.Co2 recovery system and co2 recovery method
WO2008041920A1 (en)2006-10-062008-04-10Morphic Technologies Aktiebolag (Publ.)A method and an arrangement for extracting carbon dioxide from air
WO2008072979A1 (en)2006-12-152008-06-19Sinvent AsMethod for capturing co2 from exhaust gas
WO2008095057A2 (en)2007-01-312008-08-07Novozymes A/SHeat-stable carbonic anhydrases and their use
WO2010014774A2 (en)2008-07-312010-02-04Novozymes A/SModular membrane reactor and process for carbon dioxide extraction
WO2010014773A1 (en)2008-07-312010-02-04Novozymes A/SModular reactor and process for carbon dioxide extraction
WO2010037109A2 (en)2008-09-292010-04-01Akermin, Inc.Process for accelerated capture of carbon dioxide
WO2012003336A2 (en)2010-06-302012-01-05Codexis, Inc.Chemically modified carbonic anhydrases useful in carbon capture systems
WO2012025577A1 (en)2010-08-242012-03-01Novozymes A/SHeat-stable persephonella carbonic anhydrases and their use
WO2012092984A1 (en)2011-01-072012-07-12Statoil Petroleum AsRotating vacuum stripper
WO2012122404A2 (en)2011-03-082012-09-13Akermin, Inc.Micellar polysiloxane enzyme immobilization materials
WO2012154735A2 (en)2011-05-102012-11-15Danisco Us Inc.Thermostable carbonic anhydrases and methods of use thereof
WO2013151757A1 (en)2012-04-062013-10-10Akermin, Inc.Polysilicate-polysilicone enzyme immobilization materials
US20140234193A9 (en)2007-08-092014-08-21Lawrence Livermore National Security, LlcElectrochemical Production of Metal Hydroxide Using Metal Silicates
US8871008B2 (en)2011-09-072014-10-28Carbon Engineering Limited PartnershipTarget gas capture
US20150099289A1 (en)2013-10-092015-04-09Akermin, Inc.Carbonic anhydrase xerogel particles
WO2015126925A1 (en)2014-02-182015-08-27Akermin, Inc.Processes and methods for low energy carbon dioxide capture
WO2017144177A1 (en)2016-02-262017-08-31Keskin HüseyinDriving and/or flight simulator
WO2018017792A1 (en)2016-07-202018-01-25Novozymes A/SHeat-stable metagenomic carbonic anhydrases and their use
WO2020194124A1 (en)*2019-03-262020-10-01Saipem S.P.A.Carbonic anhydrase variants for improved co2 capture
US10920213B2 (en)*2018-08-022021-02-16Sk Innovation Co., Ltd.Heat-resistant carbonic anhydrase mutants and composition for capturing carbon dioxide containing the same
WO2022178376A2 (en)*2021-02-192022-08-25Alliance For Sustainable Energy, LlcEngineered thermostable carbonic anhydrase enzymes
WO2024118901A2 (en)*2022-11-302024-06-06Novozymes A/SCarbonic anhydrase variants and polynucleotides encoding same

Patent Citations (53)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4112052A (en)1975-06-261978-09-05Exxon Research & Engineering Co.Process for removing carbon dioxide containing acidic gases from gaseous mixtures using aqueous amine scrubbing solutions
US4750918A (en)1985-05-281988-06-14The Trustees Of The Stevens Institute Of TechnologySelective-permeation gas-separation process and apparatus
EP0238023A2 (en)1986-03-171987-09-23Novo Nordisk A/SProcess for the production of protein products in Aspergillus oryzae and a promoter for use in Aspergillus
US5223409A (en)1988-09-021993-06-29Protein Engineering Corp.Directed evolution of novel binding proteins
WO1992006204A1 (en)1990-09-281992-04-16Ixsys, Inc.Surface expression libraries of heteromeric receptors
WO1994025612A2 (en)1993-05-051994-11-10Institut PasteurNucleotide sequences for the control of the expression of dna sequences in a cellular host
WO1995017413A1 (en)1993-12-211995-06-29Evotec Biosystems GmbhProcess for the evolutive design and synthesis of functional polymers based on designer elements and codes
WO1995022625A1 (en)1994-02-171995-08-24Affymax Technologies N.V.Dna mutagenesis by random fragmentation and reassembly
US5776741A (en)1994-02-211998-07-07Novo Nordisk A/SMethod of enzyme immobilization on a particulate silica carrier for synthesis inorganic media
US6156096A (en)1994-03-232000-12-05Applied Membrane Technology, Inc.Gas separation using hollow fiber contained liquid membrane
WO1995033836A1 (en)1994-06-031995-12-14Novo Nordisk Biotech, Inc.Phosphonyldipeptides useful in the treatment of cardiovascular diseases
US6143556A (en)1995-06-072000-11-07Trachtenberg; Michael C.Enzyme systems for gas processing
US6524843B1 (en)1997-06-042003-02-25Co2 Solution Inc.Process and apparatus for the treatment of carbon dioxide with carbonic anhydrase
US6524842B1 (en)2000-06-302003-02-25Envirogen, Inc.Biodegradation of gasoline oxygenates
US7176017B2 (en)2001-07-132007-02-13Co2 Solution Inc.Triphasic bioreactor and process for gas effluent treatment
US20040171154A1 (en)2001-07-272004-09-02Francesca StoriciSystems for in vivo site-directed mutagenesis using oligonucleotides
US20040029257A1 (en)2002-01-282004-02-12Co2 SolutionProcess for purifying energetic gases such as biogas and natural gas
WO2004007058A1 (en)2002-07-112004-01-22Co2 Solution Inc.Triphasic bioreactor and process for gas effluent treatment
US20040059231A1 (en)2002-09-192004-03-25Colin CorporationArteriostenosis inspecting apparatus
WO2004028667A1 (en)2002-09-272004-04-08Co2 Solution Inc.A process and a plant for recycling carbon dioxide emissions from power plants into useful carbonated species
WO2004056455A1 (en)2002-12-192004-07-08Co2 Solution Inc.Process and apparatus using a spray absorber bioreactor for the biocatalytic treatment of gases
US7132090B2 (en)2003-05-022006-11-07General Motors CorporationSequestration of carbon dioxide
WO2004104160A1 (en)2003-05-192004-12-02Michael TrachtenbergMethods, apparatuses, and reactors for gas separation
US20050214936A1 (en)2003-06-182005-09-29Bhattacharya Sanjoy KEnzyme facilitated solubilization of carbon dioxide from emission streams in novel attachable reactors/devices
WO2005114417A2 (en)2004-05-072005-12-01General Motors CorporationProcess for immobilization of protein catalysts, product, and use
US20100068784A1 (en)2004-05-072010-03-18Gm Global Technology Operations, Inc.Process for immobilization of protein catalysts, product, and use
WO2006089423A1 (en)2005-02-242006-08-31Co2 Solution Inc.An improved co2 absorption solution
WO2007036235A1 (en)2005-09-302007-04-05Novozymes A/SImmobilization of enzymes
US20070256559A1 (en)2006-05-082007-11-08The Board Of Trustees Of The University Of IllinoisIntegrated vacuum absorption steam cycle gas separation
US20080056972A1 (en)2006-09-062008-03-06Mitsubishi Heavy Industries, Ltd.Co2 recovery system and co2 recovery method
WO2008041920A1 (en)2006-10-062008-04-10Morphic Technologies Aktiebolag (Publ.)A method and an arrangement for extracting carbon dioxide from air
WO2008072979A1 (en)2006-12-152008-06-19Sinvent AsMethod for capturing co2 from exhaust gas
WO2008095057A2 (en)2007-01-312008-08-07Novozymes A/SHeat-stable carbonic anhydrases and their use
US20140234193A9 (en)2007-08-092014-08-21Lawrence Livermore National Security, LlcElectrochemical Production of Metal Hydroxide Using Metal Silicates
WO2010014773A1 (en)2008-07-312010-02-04Novozymes A/SModular reactor and process for carbon dioxide extraction
WO2010014774A2 (en)2008-07-312010-02-04Novozymes A/SModular membrane reactor and process for carbon dioxide extraction
WO2010037109A2 (en)2008-09-292010-04-01Akermin, Inc.Process for accelerated capture of carbon dioxide
US7998714B2 (en)2008-09-292011-08-16Akermin, Inc.Process for accelerated capture of carbon dioxide
WO2012003336A2 (en)2010-06-302012-01-05Codexis, Inc.Chemically modified carbonic anhydrases useful in carbon capture systems
WO2012025577A1 (en)2010-08-242012-03-01Novozymes A/SHeat-stable persephonella carbonic anhydrases and their use
WO2012092984A1 (en)2011-01-072012-07-12Statoil Petroleum AsRotating vacuum stripper
WO2012122404A2 (en)2011-03-082012-09-13Akermin, Inc.Micellar polysiloxane enzyme immobilization materials
WO2012154735A2 (en)2011-05-102012-11-15Danisco Us Inc.Thermostable carbonic anhydrases and methods of use thereof
US8871008B2 (en)2011-09-072014-10-28Carbon Engineering Limited PartnershipTarget gas capture
WO2013151757A1 (en)2012-04-062013-10-10Akermin, Inc.Polysilicate-polysilicone enzyme immobilization materials
US20150099289A1 (en)2013-10-092015-04-09Akermin, Inc.Carbonic anhydrase xerogel particles
WO2015126925A1 (en)2014-02-182015-08-27Akermin, Inc.Processes and methods for low energy carbon dioxide capture
WO2017144177A1 (en)2016-02-262017-08-31Keskin HüseyinDriving and/or flight simulator
WO2018017792A1 (en)2016-07-202018-01-25Novozymes A/SHeat-stable metagenomic carbonic anhydrases and their use
US10920213B2 (en)*2018-08-022021-02-16Sk Innovation Co., Ltd.Heat-resistant carbonic anhydrase mutants and composition for capturing carbon dioxide containing the same
WO2020194124A1 (en)*2019-03-262020-10-01Saipem S.P.A.Carbonic anhydrase variants for improved co2 capture
WO2022178376A2 (en)*2021-02-192022-08-25Alliance For Sustainable Energy, LlcEngineered thermostable carbonic anhydrase enzymes
WO2024118901A2 (en)*2022-11-302024-06-06Novozymes A/SCarbonic anhydrase variants and polynucleotides encoding same

Non-Patent Citations (97)

* Cited by examiner, † Cited by third party
Title
"Soc. App. Bacteriol", 1980
AKDEMIR ET AL.: "The extremo-α-carbonic anhydrase (CA) from Sulfurihydrogenibium azorense, the fastest CA known, is highly activated by amino acids and amines", BIOORG. MED. CHEM. LETT, vol. 23, 2013, pages 1087 - 1090
ANNA DI FIORE ET AL: "Thermostable Carbonic Anhydrases in Biotechnological Applications", INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, vol. 16, no. 7, 8 July 2015 (2015-07-08), pages 15456 - 15480, XP055340373, DOI: 10.3390/ijms160715456*
ATSUMI ET AL.: "Direct photosynthetic recycling of carbon dioxide to isobutyraldehyde", NATURE BIOTECHNOLOGY, vol. 27, no. 12, 2009, pages 1177 - 1180, XP008153202, DOI: 10.1038/nbt.1586
BALLEZA ET AL., FEMS MICROBIOL. REV, vol. 33, no. 1, 2009, pages 133 - 151
BARTON ET AL., NUCLEIC ACIDS RES., vol. 18, 1990, pages 7349 - 4966
BHATTACHARYA ET AL., BIOTECHNOL. APPL. BIOCHEM, vol. 38, 2003, pages 111 - 117
BOWIE ET AL., SCIENCE, vol. 247, 1990, pages 1306 - 1310
BOWIESAUER, PROC. NATL. ACAD. SCI. USA, vol. 86, 1989, pages 2152 - 2156
BURKE ET AL., PROC. NATL. ACAD. SCI. USA, vol. 98, 2001, pages 6289 - 6294
BYUNG HOON JO ET AL: "Engineering de novo disulfide bond in bacterial [alpha]-type carbonic anhydrase for thermostable carbon sequestration", SCIENTIFIC REPORTS, vol. 6, no. 1, 1 July 2016 (2016-07-01), XP055698957, DOI: 10.1038/srep29322*
CALISSANOMACINO: "Fungal Genet", NEWSLETT, vol. 43, 1996, pages 15 - 16
CARTER ET AL., PROTEINS: STRUCTURE, FUNCTION, AND GENETICS, vol. 6, 1989, pages 240 - 248
CHOI ET AL., J. MICROBIOL. METHODS, vol. 64, 2006, pages 391 - 397
CHRISTENSEN ET AL., BIOLTECHNOLOGY, vol. 6, 1988, pages 1419 - 1422
COLLINS-RACIE ET AL., BIOTECHNOLOGY, vol. 13, 1995, pages 982 - 987
CONTRERAS ET AL., BIOTECHNOLOGY, vol. 9, 1991, pages 378 - 381
COOPER ET AL., EMBO J, vol. 12, 1993, pages 2575 - 2583
COWAN ET AL., ANN. NY ACAD. SCI, vol. 984, 2003, pages 453 - 469
COWAN ET AL., ANN. NYACAD. SCI, vol. 984, 2003, pages 453 - 469
CUNNINGHAMWELLS, SCIENCE, vol. 244, 1989, pages 1081 - 1085
DATABASE Uniparc [online] 24 July 2014 (2014-07-24), ANONYMOUS: "UPI0004D9B68F | UniParc | UniProt", XP093268431, retrieved from https://www.uniprot.org/uniparc/UPI0004D9B68F/entry Database accession no. UPI0004D9B68F*
DAVIS ET AL.: "Basic Methods in Molecular Biology", 2012, ELSEVIER
DAWSON ET AL., SCIENCE, vol. 266, 1994, pages 776 - 779
DERBYSHIRE ET AL., GENE, vol. 46, 1986, pages 145
DONALD ET AL., J. BACTERIOL, vol. 195, no. 11, 2013, pages 2612 - 2620
E. ALPERW.D. DECKWER: "Mass Transfer with Chemical Reaction in Multiphase Systems", 1983, SPRINGER SCIENCE & BUSINESS MEDIA, article "Some aspects of gas absorption mechanism in slurry reactors", pages: 199 - 224
EATON ET AL., BIOCHEMISTRY, vol. 25, 1986, pages 505 - 512
FAVRE ET AL.: "Biocatalytic capture of CO with carbonic anhydrase and its transformation to solid carbonate", ENZYMATIC, vol. 60, no. 3-4, 2009, pages 163 - 170, XP026439361
FAVRE N ET AL: "Biocatalytic capture of CO"2 with carbonic anhydrase and its transformation to solid carbonate", JOURNAL OF MOLECULAR CATALYSIS B : ENZYMATIC,, vol. 60, no. 3-4, 1 October 2009 (2009-10-01), pages 163 - 170, XP026439361, ISSN: 1381-1177, [retrieved on 20090506]*
FORD ET AL., PROTEIN EXPRESSION AND PURIFICATION, vol. 2, 1991, pages 95 - 107
FREDSLUND ET AL., ENZYME MICROB. TECHNOL, vol. 114, 2018, pages 48 - 54
FREUDL, MICROBIAL CELL FACTORIES, vol. 17, 2018, pages 52
GEISBERG ET AL., CELL, vol. 156, no. 4, 2014, pages 812 - 824
GUOSHERMAN, MOL. CELLULAR BIOL, vol. 15, 1995, pages 5983 - 5990
HAMBRAEUS ET AL., MICROBIOLOGY, vol. 146, no. 12, 2000, pages 3051 - 3059
HAWKSWORTH ET AL.: "Ainsworth and Bisby's Dictionary of The Fungi", 1995, UNIVERSITY PRESS
HEINZE ET AL., BMC MICROBIOLOGY, vol. 18, 2018, pages 56
HILTON ET AL., J. BIOL. CHEM., vol. 271, 1996, pages 4699 - 4708
HUE ET AL., J. BACTERIOL, vol. 177, 1995, pages 3465 - 3471
J.E. REMIASJ.K. NEATHERYM. LIUK. LIU: "Tenth Annual Conference on Carbon Capture & Sequestration", 2 May 2011, article "Enhancing solvent regeneration with a high volatility liquid as a stripping carrier"
JUMPER ET AL., NATURE, vol. 596, 2021, pages 583 - 589
JUMPER ET AL.: "Highly accurate protein structure prediction with AlphaFold", NATURE, vol. 596, 2021, pages 583 - 589, XP037990370, DOI: 10.1038/s41586-021-03819-2
KABERDINBLÄSI, FEMS MICROBIOL. REV, vol. 30, no. 6, 2006, pages 967 - 979
KIM ET AL., MOL. CELLS, vol. 42, no. 6, 2019, pages 460 - 469
KOSARAJU ET AL., IND. ENG. CHEM. RES., vol. 44, 2005, pages 1250 - 1258
KREN ET AL., NAT. MED, vol. 4, 1998, pages 285 - 290
LABROU, PROTEIN DOWNSTREAM PROCESSING, vol. 1129, 2014, pages 3 - 10
LI ET AL., MICROBIAL CELL FACTORIES, vol. 16, 2017, pages 168
LIU ET AL., BIOTECHNOL. PROG., vol. 25, 2008, pages 68 - 74
LOWMAN ET AL., BIOCHEMISTRY, vol. 30, 1991, pages 10832 - 10837
LU ET AL.: "DOE Project No. DE-FC26-08NT0005498", NETL CO2 CAPTURE TECHNOLOGY FOR EXISTING PLANTS R&D MEETING, vol. 1997, 24 March 2009 (2009-03-24), pages 383 - 388
LUBERTOZZIKEASLING, BIOTECHN. ADVANCES, vol. 27, 2009, pages 53 - 75
MAJUMDAR ET AL., ALCHE, vol. 34, 1988, pages 1135 - 1145
MARTIN ET AL., J. IND. MICROBIOL. BIOTECHNOL., vol. 3, 2003, pages 568 - 576
MIRJAFARI ET AL., IND. ENG. CHE. RES., vol. 46, 2007, pages 921 - 926
MLADENOVSKAAHRING, FEMS MICROBIOL. ECOL, vol. 1997, 2000, pages 225 - 229
MOROZOV ET AL., EUKARYOTIC CELL, vol. 5, no. 11, 2006, pages 1838 - 1846
MUKHERJEE ET AL., TRICHODERMA: BIOLOGY AND APPLICATIONS, 2013
MURRAYZINDER, APPL. ENVIRON. MICROBIOL, vol. 50, 1985, pages 49 - 55
NEEDLEMANWUNSCH, J. MOL. BIOL, vol. 48, 1970, pages 443 - 453
NER ET AL., DNA, vol. 7, 1988, pages 127
NESS ET AL., NATURE BIOTECHNOLOGY, vol. 17, 1999, pages 893 - 896
OELKERS ET AL.: "Mineral carbonation of CO", ELEMENTS, vol. 4, 2008, pages 333 - 337
PATELGUPTA, INT. J. SYST. EVOL. MICROBIOL., vol. 70, 2020, pages 406 - 438
RASMUSSEN-WILSON ET AL., APPL. ENVIRON. MICROBIOL, vol. 63, 1997, pages 3488 - 3493
REDDY ET AL.: "NETLIDOE", 5 April 2003, SECOND NATIONAL CONFERENCE ON CARBON SEQUESTRATION
REIDHAAR-OLSONSAUER, SCIENCE, vol. 241, 1988, pages 53 - 57
RICE ET AL.: "EMBOSS: The European Molecular Biology Open Software Suite", TRENDS GENET, vol. 16, 2000, pages 276 - 277, XP004200114, DOI: 10.1016/S0168-9525(00)02024-2
ROMANOS ET AL., YEAST, vol. 8, 1992, pages 423 - 488
RUSSO ET AL.: "Post-combustion carbon capture mediated by carbonic anhydrase", SEP. PURIF. TECHNOL, vol. 107, 2013, pages 331 - 339, XP028997572, DOI: 10.1016/j.seppur.2012.06.022
S. SALMONA. HOUSE: "Final Scientific/Technical Report for DE-FE0007741", 2015, NATIONAL ENERGY TECHNOLOGY LABORATORY, article "Low-energy solvents for carbon dioxide capture enabled by a combination of enzymes and vacuum regeneration"
S. SALMONA. HOUSE: "Novel Materials for Carbon Dioxide Mitigation Technology", 2015, ELSEVIER B.V., article "Enzyme-catalyzed solvents for CO2 separation", pages: 23 - 86
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOR LAB., NY
SCHERERDAVIS, PROC. NATL. ACAD. SCI. USA, vol. 76, 1979, pages 4949 - 4955
SCHMOLLDATTENBÖCK: "Gene Expression Systems in Fungi: Advancements and Applications", FUNGAL BIOLOGY, 2016
SESHASAYEE ET AL., SUBCELLULAR BIOCHEMISTRY, vol. 52, 2011, pages 7 - 23
SMITH ET AL., J. MOL. BIOL., vol. 224, 1992, pages 899 - 904
SMOLKE ET AL.: "Synthetic Biology: Parts, Devices and Applications", REGULATED PROMOTERS IN YEAST: HOW TO DESIGN AND MAKE USE OF PROMOTERS IN S, 2018
SONG ET AL., PLOS ONE, vol. 11, no. 7, 2016, pages e0158447
STEVENS, DRUG DISCOVERY WORLD, vol. 4, 2003, pages 35 - 48
STOLAROFF ET AL., ENVIRON. SCI, 2008
STOLAROFF ET AL., ENVIRON. SCI. TECHNOL., vol. 42, 2008, pages 2728 - 2735
STORICI ET AL., NATURE BIOTECHNOL., vol. 19, 2001, pages 773 - 776
SVETINA ET AL., J. BIOTECHNOL, vol. 76, 2000, pages 245 - 251
TECHNOL., vol. 42, pages 2728 - 2735
TIAN, NATURE, vol. 432, 2004, pages 1050 - 1054
TRACHTENBERG ET AL., SAE INTERNATIONAL CONFERENCE ON ENVIRONMENTAL SYSTEMS DOCKET NUMBER, 24 January 2003 (2003-01-24), pages 99
VARADI ET AL., NUCLEIC ACIDS RESEARCH, vol. 50, 2021, pages 439 - 444
VOS ET AL., SCIENCE, vol. 255, 1992, pages 306 - 312
WILBUR, J. BIOL. CHEM., vol. 176, 1948, pages 147 - 154
WINGFIELD, CURRENT PROTOCOLS IN PROTEIN SCIENCE, vol. 80, no. 1, 2015, pages 1 - 35
WLODAVER ET AL., FEBS LETT, vol. 309, 1992, pages 59 - 64
XU ET AL., BIOTECHNOLOGY LETTERS, vol. 40, 2018, pages 949 - 955
YAN ET AL.: "Fabrication of single carbonic anhydrase nanogel against denaturation and aggregation at high temperature", BIOMACROMOLECULES, vol. 1997, 2007, pages 560 - 565, XP055270789, DOI: 10.1021/bm060746a
YELTON ET AL., PROC. NATL. ACAD. SCI. USA, vol. 81, 1984, pages 1470 - 1474
ZHANGSKOLNICK, PROTEINS, vol. 57, 2004, pages 702 - 710

Similar Documents

PublicationPublication DateTitle
US11773386B2 (en)Heat-stable metagenomic carbonic anhydrases and their use
US9909115B2 (en)Heat-stable persephonella carbonic anhydrases and their use
EP2446029B1 (en)Heat-stable carbonic anhydrases and their use
JP6068523B2 (en) Thermostable carbonic anhydrase and its use
AU2023400641A1 (en)Carbonic anhydrase variants and polynucleotides encoding same
WO2024244350A1 (en)Carbonic anhydrase variants and polynucleotides encoding same
WO2025168806A1 (en)Polypeptides having carbonic anhydrase activity and polynucleotides encoding same

Legal Events

DateCodeTitleDescription
121Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number:25705170

Country of ref document:EP

Kind code of ref document:A1


[8]ページ先頭

©2009-2025 Movatter.jp