Movatterモバイル変換


[0]ホーム

URL:


AU2014354631B2 - Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase - Google Patents

Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase
Download PDF

Info

Publication number
AU2014354631B2
AU2014354631B2AU2014354631AAU2014354631AAU2014354631B2AU 2014354631 B2AU2014354631 B2AU 2014354631B2AU 2014354631 AAU2014354631 AAU 2014354631AAU 2014354631 AAU2014354631 AAU 2014354631AAU 2014354631 B2AU2014354631 B2AU 2014354631B2
Authority
AU
Australia
Prior art keywords
seq
plant
pufa synthase
pufa
genetically modified
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
AU2014354631A
Other versions
AU2014354631A1 (en
Inventor
Scott A. Bevan
Daniel J. Gachotte
Cory M. LARSEN
P. Ann Owens MERLO
James G. Metz
Terence A. Walsh
Ross Zirkle
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
DSM IP Assets BV
Corteva Agriscience LLC
Original Assignee
DSM IP Assets BV
Corteva Agriscience LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by DSM IP Assets BV, Corteva Agriscience LLCfiledCriticalDSM IP Assets BV
Publication of AU2014354631A1publicationCriticalpatent/AU2014354631A1/en
Priority to AU2017272160ApriorityCriticalpatent/AU2017272160B2/en
Application grantedgrantedCritical
Publication of AU2014354631B2publicationCriticalpatent/AU2014354631B2/en
Priority to AU2019250139Aprioritypatent/AU2019250139A1/en
Assigned to DSM IP ASSETS B.V., CORTEVA AGRISCIENCE LLCreassignmentDSM IP ASSETS B.V.Request to Amend Deed and RegisterAssignors: DOW AGROSCIENCES, LLC, DSM IP ASSETS B.V.
Ceasedlegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

This disclosure concerns recombinant host organisms genetically modified with a polyunsaturated fatty acid (PUFA) synthase system and one or more accessory proteins that allow for and/or improve the production of PUFAs in the host organism. The disclosure also concerns methods of making and using such organisms as well as products obtained from such organisms.

Description

PRODUCTION OF OMEGA-3 LONG-CHAIN
POLYUNSATURATED FATTY ACIDS IN OILSEED CROPS BY A TIIRAUSTOCHYTRID PUFA SYNTHASE
PRIORITY CLAIM
This application claims the benefit of the filing date of United States Provisional Patent Application Serial Number 61/909,289, filed November 26, 2013, for “PRODUCTION OF OMEGA-3 LONG-CHAIN POLYUNSATURATED FATTY ACIDS IN OILSEED CROPS BY A TIIRAUSTOCHYTRID PUFA SYNTHASE.”
TECHNICAL FIELD
The present disclosure generally relates to recombinant host organisms (e.g., plants) genetically modified with a polyunsaturated fatly acid (PUFA) synthase system and one or more accessory proteins that allow for and/or improve the production of PUFAs in the host organism. The present invention also relates to methods of making and using such organisms (e.g., to obtain PUFAs) as well as products obtained from such organisms (e.g., oil and seed).
BACKGROUND
Reference to any prior art in the specification is not, and should not be taken as, an acknowledgment or any form of suggestion that this prior art forms part of the common general knowledge in Australia or any other jurisdiction or that this prior art could reasonably he expected to be ascertained, understood and regarded as relevant by a person skilled in the art.
Polyunsaturated fatty acids (PUFAs) are considered to be useful for nutritional, pharmaceutical, and industrial applications, as well as other purposes. However, the current supply of PUFAs from natural sources (e.g., fish oils and algal oils) and from chemical synthesis is not sufficient or cost-effective for many long-term commercial needs.
Vegetable oils derived from plants (e.g., oil seed crops) are relatively inexpensive, do not have the contamination issues associated with fish oils, and are considered sustainable. However, the PUFAs found in commercially-developed plants and plant oils do not typically include more saturated or longer-chain PUFAs, and only typically include fatty acids such as linoleie acid (eighteen carbons with 2 double bonds, in the delta 9 and 12 positions — 18:2 delta 9,12) and linolenic acid (18:3 delta 9,12,15).
The production of more unsaturated or longer-chain PUFAs in plants by the modification of the fatty acids endogenously produced by plants has been described.
For example, the genetic modification of plants with various individual genes encoding fatty acid elongases and/or desaturases has been described as resulting in the generation of leaves or seeds containing significant levels of longer-chain and more unsaturated PUFAs, such as docosahexaenoic acid (DHA) and eicosapentaenoic acid (EPA), but also containing significant levels of mixed shorter-chain and less unsaturated PUFAs. Qi et al. (2004) Nature Biotech. 22:739; PCT International Patent Publication No. WO 04/071467; Abbadi et al. (2004) Plant Cell 16:1; Napier and Sayanova (2005) Proc. Nutr. Soc. 64:387-93; Robert et al. (2005) Functional Plant Biol. 32:473-79; U.S. Patent Publication No. 2004/0172682; Petrie et al. (2012) PLOS One 7:e49165; and U.S. Provisional Application No. 61/345,537 (filed May 17,2010).
DISCLOSURE
Described herein are methods and compositions that may be utilized to produce LC-PUFAs in transgenic host organisms (e.g., plant cells, plant parts, and plants), as well as quantities of non-natural plant lipids, for example, triacylglycerol (TAG) and phospholipids (PL) that are enriched in such PUFAs, and to produce them with longer backbones and with a greater degree of unsaturation than was previously available. Also described herein are systems for producing PUFAs in a host plant by providing a host organism that is genetically modified with a functional PUFA synthase system.
Some embodiments herein include a genetically modified plant cell (e.g., a canola, soy, and/or Arabidopsis plant cell) that comprises polynucleotide(s) encoding at least one polypeptide of polyunsaturated fatty acid (PUFA) synthase (PFA1, PFA2, and PFA3) from a Thraustochytrid Schizochytrium alga (a representative example of which is the Schizochytrium sp. as deposited under ATCC Accession No. PTA-9695) and a phosphopantetheinyl transferase (Hetl) from the cyanobacteria genus, Nostoc, the expression products of which reconstitute a functional PUFA synthase system. For example, a plant cell may comprise polynucleotide(s) encoding Schizochytrium PFA1, PFA2, and PFA3; and Nostoc Hetl. In some embodiments, the plant cell also comprises at least one polynucleotide encoding the acyl-CoA synthetase isozyme 2 (SzACS2). In particular embodiments, polynucleotide(s) encoding at least one DHA-preferring accessory protein are also expressed in a host plant cell to promote incorporation of LC-PUFA into seed oil (e.g., ACS, DGAT, LPAT, LPCAT, and PDAT).
Particular embodiments herein include a genetically modified plant (e.g., a Brassica, Glycine, Arabidopsis, and oilseed crop plant) comprising such a plant cell, i as well as descendants, seeds, tissues, or parts thereof. In particular embodiments, the genetically modified plant cell is an oilseed crop plant cell; for example and without limitation, safflower, sunflower, and palm). A genetically modified oilseed crop plant cell expressing a functional PUFA synthase system may generate a characteristic fatty acid profile, for example, including DHA-containing vegetable oils. ) Some embodiments provide polynucleotide(s) encoding components of the functional PUFA synthase system (e.g., Schizochytrium PFA1, PFA2, PFA3, and Nostoc Hetl), and optionally, at least one accessory protein (e.g., SzACS2). In some examples, these polynucleotides are comprised in a single recombinant expression vector. In other examples, the polynucleotides are contained in different i recombinant expression vectors.
In particular embodiments, polynucleotide(s) encoding the components of the functional PUFA synthase system, and optionally, at least one accessory protein, are operably linked to a seed-specific promoter. For example, the polynucleotide(s) may be operably linked in particular examples to a promoter selected from the group ) consisting of PvDlec2; PvPhas; LfKCS3; FAE1; BoACP; BnaNapinC; SSPR02745.1; and SSPR02743.1, which promoter elements are exemplified herein. In some embodiments, the polynucleotide(s) are operably linked to a constitutive promoter (e.g., ubiquitin and CsVMV promoters), or a leaf-specific promoter. Additional promoters may be employed in particular embodiments to drive expression i of the functional PUFA synthase system during different stages of growth and/or at higher levels during seed development, for example, to provide increased accumulation of LC-PUFAs.
In other embodiments, the polynucleotide(s) encoding the components of the functional PUFA synthase encode a polypeptide comprising an amino acid sequence ) that is at least 80% identical to SEQ ID NO:l; a polypeptide comprising an amino acid sequence that is at least 80% identical to SEQ ID NO:4; and/or a polypeptide comprising an amino acid sequence that is at least 80% identical to SEQ ID NO:7 or SEQ ID NO: 14, wherein the polynucleotide(s) also comprise a Hetl gene (e.g., a polynucleotide encoding a polypeptide having at least 80% identity to the encoded product of SEQ ID NO: 10). In some embodiments, the polynucleotide(s) also comprise a SzACS2 gene (e.g., a polynucleotide encoding a polypeptide having at least 80% identity to the encoded product of SEQ ID NO: 11).
In particular examples, the polynucleotide(s) encoding the components of the functional PUFA synthase comprise a polynucleotide that is at least 70% identical to SEQ ID NO:2 and/or SEQ ID NO:3; a polynucleotide that is at least 70% identical to SEQ ID NO:5 and/or SEQ ID NO:6; a polynucleotide that is at least 70% identical to SEQ ID NO:8, SEQ ID NO:9, and/or SEQ ID NO: 13; and/or a 1 polynucleotide that encodes a polypeptide having at least 80% identity to SEQ ID NO: 10. In certain examples, the polynucleotide(s) also comprise a polynucleotide that encodes a polypeptide having at least 80% identity to SEQ ID NO: 11.
In particular examples, the polynucleotide(s) encoding the components of the functional PUFA synthase hybridize under stringent conditions (e.g., very stringent conditions) to SEQ ID NO:2 and/or SEQ ID NO:3; SEQ ID NO:5 and/or SEQ ID NO:6; SEQ ID NO:8, SEQ ID NO:9, and/or SEQ ID NO:13; SEQ ID NO: 10; and/or SEQ ID NO:ll. In certain examples, the polynucleotide(s) hybridize under stringent conditions to all of SEQ ID NO:2; SEQ ID NO:5; SEQ ID NO:8; SEQ ID NO:10; and SEQ IDNO:ll. i Also described herein is a method for producing a transgenic plant, wherein the method comprises introducing polynucleotide(s) encoding the components of the functional PUFA synthase into a plant cell; and regenerating a plant from the plant cell. In some embodiments, the polynucleotide(s) are transformed into a crop plant cell in a single vector. In other embodiments, the polynucieotide(s) are transformed into a crop plant cell via multiple vectors to generate a plant containing constituent genes in different events. In some embodiments, the polynucleotide(s) are introduced into the crop plant cell by introgression via conventional breeding. A functional PUFA synthase system is reconstituted in particular examples by breeding crosses to optimize the PUFA profile of the progeny plants (e.g., by selecting for plants producing DHA i (C22:6, n-3) and/or EPA (C20:5, n-3)).
In some embodiments, a genetically modified plant, a descendant, cell, tissue, seed, or part thereof, or a non-natural oil (e.g., a raw seed oil) obtained from the genetically modified plant, descendant, seed, cell, tissue, or part thereof comprises detectable amounts of DHA, DPA(n-6) (C22:5, n-6), and/or EPA. In particular embodiments, the genetically modified plant, descendant, cell, tissue, seed, or part or oil thereof comprises a fatty acid profile having between 0.01% and 15% DHA by weight (e.g., between 0.05% and 10% DHA; or 0.05% to 5% DHA by weight). In particular embodiments, the genetically modified plant, descendant, cell, tissue, seed, or part or oil thereof comprises a fatty acid profile having between 0.01% and 10% EPA by weight (e g., between 0.05% and 5% EPA; or 0.05% to 1%EPA by weight). In particular embodiments, the genetically modified plant, descendant, cell, tissue, seed, or part or oil thereof comprises a fatty acid profile having between 0.01% and 10% DPA(n-6) by weight (e.g., between 0.01% and 5% DPA(n-6); or 0.01% to 1% DPA(n-6) by weight). In particular embodiments, the genetically modified plant, descendant, cell, tissue, seed, or part or oil thereof comprises a fatty acid profile having a ratio of EPArDHA of from 10:1 to 1:30 (e.g., from about 2:1 to about 1:10, from about 1:1 to about 1:12, from about 2:1 to about 1:11, from about 1:1.5 to about 1:5, from about 6:1 to about 1:6.5, and about 1:1.25) by weight of total fatty acids. In particular embodiments, the genetically modified plant, descendant, cell, tissue, seed, or part or oil thereof comprises a fatty acid profile having a ratio of DPA(n-6):DHA of from 1:1 to 1:10 (e.g., from 1:2 to 1:5, from about 1:3 to about 1:5, about 1:3 to about 1:6, and about 1:5) by weight of total fatty acids. In particular embodiments, the genetically modified plant, descendant, cell, tissue, seed, or part or oil thereof comprises a fatty acid profile having from 70% to 99% triglycerides by weight of the oil.
In some embodiments, detectable amounts of DHA, DPA(n-6) and/or EPA are also found in a plant commodity product (e.g., oil product, specialty oil product, grain, and meal) obtained from the genetically modified plant. LC-PUFA-containing vegetable oils that are obtained from genetically modified plants described herein may be used as low cost sources of DHA/EPA for food ingredients toward improved human nutrition. Oils and oilseeds that are obtained from genetically modified plants described herein may be used as low cost, high quality co-3 LC-PUFA sources for animal feed and aquaculture, or as feedstock for inter-esterification to build structured lipids enriched in co-3 LC-PUFAs for pharmaceutical and nutraceutical uses.
Also described herein are methods for producing an oil comprising at least one LC-PUFA, wherein the method comprises growing a genetically modified plant (,e.g., an oilseed plant), descendant, cell, tissue, or part thereof described herein, and/or recovering oil (e.g., a seed oil) from a genetically modified plant described herein.
Also described herein are methods for providing a supplement or therapeutic product containing at least one LC-PUFA to an individual, wherein the method comprises providing to the individual a genetically modified plant, descendant, cell, tissue, or part thereof described herein, an oil described herein, a seed described herein, a food product described herein, a functional food described herein, or a pharmaceutical product described herein. In some embodiments, a LC-PUFA 1 contained in such embodiments is DHA, DPA(n-6), and/or EPA.
The foregoing and other features will become more apparent from the following detailed description of several embodiments, which proceeds with reference to the accompanying figures. BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 includes a summary of exemplary PUFA synthase constructs utilized in some embodiments. The direction of transcription is shown above each boxed PTU, the coding sequence is noted above each PTU, and the promoter / terminator combinations used are indicated. i FIG. la includes PUFA synthase constructs arranged in “Orientation 1.” FIG. lb includes constructs arranged in “Orientation 2.” FIG. lc includes constructs arranged in “Orientation 3.” FIG. 2 includes a IIPLC radiometric trace of 14C-labeled EPA, DHA, and DPA standards. Retention times for EPA and DHA are labeled at ~5.3 and ~6.1 minutes, I respectively. FIG. 3 includes a summary of LC-PUFA content in T2 seeds of Arabidopsis events transformed with the PUFA synthase and Hetl transgenes. Each vertical bar represents the LC-PUFA content from one Arabidopsis event; black = DHA, dark gray = EPA, light gray = DPA (n-6). I FIG. 4 includes a graphical representation of LC-PUFA content of T3 seed progeny from selected transgenic Arabidopsis DHA-producing T? lines. Each circle represents T3 seed from one homozygous T2 plant. The gray bar represents the average LC-PUFA content for each T3 seed line. FIG. 5 includes a graphical representation of LC-PUFA content of individual T2 seeds from four homozygous and five hemizygous Ti canola plants. 48 seeds were analyzed per line. “PUFAs” represent the total of DUA + EPA + DPA (n = 6). The ordinate axis corresponds to the LC-PUFA content as % of total FAMEs. 5
SEQUENCE LISTING
The nucleic acid sequences listed in the accompanying sequence listing are shown using standard letter abbreviations for nucleotide bases, as defined in 37 C.F.R. § 1.822. Only one strand of each nucleic acid sequence is shown, but the ) complementary strand is understood to be included by any reference to the displayed strand. In the accompanying sequence listing: SEQ ID NO: 1 shows the amino acid sequence of an exemplary PFA1 protein:
MDTRIAIVGMSAILPSGENVRESWEAIRDGLDCLSDLPADRVDVTAYYNPEKTTKDK IYCKRGGFIPEYDFDAREFGLNMFQMEDSDANQTISLLKVKEALTDANIPAFSSGKK 5 NIGCVLGIGGGQKASHEFY SRLNYWVDKVLRKMGL PEE DVAAAVDKYKAS FPEWRL
DSFPGFLGNVTAGRCCNTFNMEGMNCWDAACASSLIAVKVAIEELLYGDCDAMIAG ATCTDNSIGMYMAFSKTPVFSTDPSVKAYDAATKGMLIGEGSAMLVLKRYADAVRDG DTVHAVIKGCASSSDGKAAGIYTPTISGQEEALRRAYARANVDPATVTLVEGHGTGT PVGDKIELTALSNLFSKAFSANGGGAEEAEQVAVGSIKSQIGHLKAVAGLAGLVKVV 3 LALKHKTLPQTINVDKPPSLVDGTPIQQSPLYVNTMNRPWFTPVGVPRRAGVSSFGF GGANYHAVLEEFEPEHESAYRYNNLPQVALLHAGDVATLAATVRAKLALATAEQEEA RVVKNADYIAYHRFLDECKLRGAVPQAHARVGLLVRDLSSLIAVLEAAAAKLAGEES ATEWTVSVATGEAAFRVRGVATEANVAALFSGQGAQYTHMFSDVAMNWPPFRESVAA MDRAQRERFGRPAKRVSSVLYPRKPYGDEPRQDHKEISQTRYSQPATLACSVGAFDI 5 FKAAARAPSFAAGHSLGEFAALYAAGSLDRDAVFDLVCARAKAMSDFTAQASSSGGA MAAVIGAKADQLSLGGAPDVWLANSNSPSQTVITGTAEAVAAASDKLRCSGNFRWP LACEAAFH S PHMRGAEQT FASALAQAPVSAPAAARFYSNVT GGAAVT S PADVKTNLG KHMTSPVQFVQQVRAMHAAGARVFVEFGPKQVLSRLVKETLGEAGDVVTVAVNPDSA KDSDTQLRQAALTLAVAGVPLKDFDRWQLPDATRLEPVKKKKTTLRLSAATYVSAKT ) LRQREAVLNDGYTVSGATAWKEVDTANEERLVRQAQDLQRQLAEASTAAQAAQSKV AELERTIQDLERKVQQQQQEKGENSDSNAAAEVLRRHKELLQRMLQDCDEQAVPVAT VVPTPTSSPTPTSSPVSGNSKSTRGSADLQALLAKAETVVMAVLAAKTGYEADMVEA DMDLEAELGIDSIKRVEILSEVQGQLGVEAKDVDALSRTRTVGEVVDAMKAEIVAAS GGSAPAVPSAPAASAAPTPAASTAPSADLQALLSKAETVVMAVLAAKTGYEADMVEA ) DMDLEAELGIDSIKRVEILSEVQGQLGVEAKDVDALSRTRTVGEVVDAMKAEIVAAS AGSAPAPAVPSAPAASAAPTPAASTAPSADLQALLSKAETVVMAVLAAKTGYEADMV EADMDLEAELGIDSIKRVEILSEVQGQLGVEAKDVDALSRTRTVGEWDAMKAEIVA AS GGSAPAPAVP SAPAASAAP T PAAATAP SADLQALLAKAETVVMAVLAAKTGYEAD MVEADMDLEAELGIDSIKRVEILSEVQGQLGVEAKDVDALSRTRTVGEVVDAMKAEI ) VAASAGSAPAPAVPSAPAASAAPTPAASTAPSADLQALLSKAETVVMAVLAAKTGYE ADMVEADMDLEAELGIDSIKRVEILSEVQGQLGVEAKDVDALSRTRTVGEWDAMKA EIVAASGGSAPAAAVPSAPAASAAPTPATAPSADLQALLSKAETWMAVLAAKTGYE ADMVEADMDLEAELGIDSIKRVEILSEVQGQLGVEAKDVDALSRTRTVGEVVDAMKA
EIVAASGGSAPAAPSAPALLPTLFGSECEDLSLTFPVITTLPLPAELVLAEGGARPV WVDDGSALTSSLVSSLGDRAVLLQVQSSSACSPRSTTHKLVTVADRSEAALQAALT SVEAQFGKVGGFVFQFGDDDVQAQLGWALLAAKHLKTSLSEQIEGGRTFFVAVARLD GQLGLSGKSTTATVDLSRAQQGSVFGLCKTLDLEWPAVFCRGIDLAADLDAAQAARC ) LLGELSDPDVAVRE SGYSASGQRCTTTTKSLTTGKPHQPISSSDLFLVSGGARGITP LCVRELAQRVGGGTYVLIGRSELPTTEPAWAVGVESGKPLEKAALAFLKAEFAAGRG AKPTPMLHKKLV GAW GAREVRAS LAEITAQGATAVYESCDVSSAAKVREMVERVQQ QGGRRVSGVFHASGVLRDKLVENKSLADFSAVYDTKVGGLINLLACVDLAQLRHLVL FS S LAGFHGNVGQSDYAMANEALNKLAAHLSAVHPQLCARSIC FGPWDGGMVTPALK ) ANFIRMGIQIIPRQGGAQTVANMLVSSSPGQLLVGNWGVPPVVPSATEHTVLQTLRQ SDNPFLDSHVIQGRRVLPMTLAVGYMAHQAQSIYAGHQLWAVE DAQLFKGIAIDNGA DVPVRVELSRRKEEQEDAGKVKVKVQVLLKSQVNGKSVPAYKATVVLSPAPRPSVIT RDFDLTPDPACTEHDLYDGKTLFHGKAFQGIEQVLSATPKQLTAKCRNLPLTPEQRG QFVVNLSQQDPFQADIAFQAMLVWARMLRQSAALPNNCERFDFYKPMAPGATYYTSV i KLASASPLV
DSVCKCTVAMHDEQGEVYFSARASWLNKTLTY SEQ ID N0:2 shows the nucleotide sequence of an exemplary PFA1 gene, referred to herein as PFA1 vl, isolated from Thraustochytrid Schizochytrium sp. (as represented by ATCC Accession No. PTA-9695):
) ATGGATACTCGCATCGCGATCGTGGGGATGTCGGCGATCCTGCCGAGCGGGGAGAAC GTGCGCGAGAGCTGGGAGGCGATCCGCGATGGGCTGGATTGCCTGAGCGATCTGCCG GCGGACCGCGTGGACGTGACGGCCTACTACAACCCGGAGAAGACGACCAAGGACAAG ATCTACTGCAAGCGCGGCGGGTTCATCCCGGAGTACGACTTCGACGCGCGTGAGTTC GGGCTCAACATGTTCCAGATGGAGGACTCGGACGCCAACCAGACGATCTCGCTGCTC ! AAGGTGAAGGAGGCGCTGACGGACGCCAACATCCCGGCGTTCTCGAGCGGTAAGAAG AACATCGGCTGCGTGCTGGGCATCGGCGGCGGCCAGAAGGCGAGCCACGAGTTCTAC TCGCGGCTCAACTACGTGGTCGTGGACAAGGTGCTGCGCAAGATGGGCCTGCCGGAG GAAGACGTGGCGGCGGCGGTGGACAAGTACAAGGCGAGTTTCCCCGAGTGGCGCCTC GACTCTTTCCCCGGGTTCCTGGGCAACGTCACGGCGGGGCGCTGCTGCAATACCTTC i AACATGGAGGGCATGAACTGCGTCGTGGACGCGGCCTGCGCGTCGTCGCTGATCGCG GTCAAAGTGGCGATCGAGGAGCTGCTCTACGGCGACTGCGATGCGATGATCGCGGGT GCCACCTGCACGGACAACTCGATCGGGATGTACATGGCCTTCTCCAAGACGCCCGTG TTTTCCACGGACCCGAGCGTCAAGGCGTACGACGCCGCCACCAAAGGCATGCTCATC GGCGAGGGCTCGGCGATGCTCGTGCTGAAGCGCTACGCGGACGCCGTGCGCGACGGC GACACCGTGCACGCCGTCATCAAGGGGTGCGCGTCCTCGAGCGACGGCAAGGCGGCG GGCATCTACACGCCGACAATCTCGGGCCAGGAGGAGGCCCTGCGCCGCGCCTACGCC ' CGCGCCAATGTCGACCCGGCCACTGTGACGCTGGTGGAGGGCCACGGCACGGGTACG CCGGTGGGCGACAAGATCGAGCTGACGGCGCTGAGCAACCTCTTCTCCAAGGCGTTT TCTGCCAACGGTGGCGGCGCGGAGGAAGCAGAGCAGGTGGCGGTGGGCAGCATCAAG TCGCAGATCGGGCACCTCAAGGCGGTGGCCGGGCTGGCCGGGCTGGTCAAGGTGGTG CTGGCGCTCAAGCACAAGACGCTGCCGCAGACGATCAACGTCGACAAGCCGCCGTCG CTGGTGGACGGGACCCCGATCCAGCAGTCGCCGCTGTACGTCAACACGATGAACCGC CCCTGGTTCACGCCCGTAGGGGTGCCGCGCCGCGCCGGCGTGTCGTCGTTTGGGTTT GGCGGTGCCAACTACCACGCCGTGCTGGAGGAGTTTGAGCCCGAGCACGAGAGCGCG TACCGGTACAACAACCTGCCGCAGGTGGCGCTGCTGCACGCGGGGGACGTCGCGACC TTGGCGGCGACGGTTCGCGCCAAGCTGGCGCTGGCCACCGCCGAGCAGGAAGAGGCG CGTGTGGTGAAGAACGCGGACTACATCGCGTACCACCGGTTCCTGGACGAGTGCAAG
TTGCGCGGCGCTGTGCCGCAGGCGCACGCGCGGGTGGGACTGCTCGTACGGGACCTG AGCTCGCTCATCGCCGTGCTCGAGGCCGCTGCCGCCAAGCTCGCGGGCGAAGAGAGC GCGACGGAGTGGACGGTCAGCGTTGCTACGGGCGAGGCGGCCTTCCGCGTGCGCGGT GTGGCTACGGAGGCCAACGTGGCGGCGCTGTTCTCGGGCCAGGGCGCGCAGTACACG 5 CACATGTTCAGCGACGTGGCGATGAACTGGCCCCCGTTCCGCGAGAGCGTCGCCGCC ATGGACCGCGCCCAGCGCGAGCGCTTCGGGCGGCCTGCCAAGCGCGTGAGCAGCGTG CTGTACCCGCGCAAGCCGTACGGCGACGAACCGCGGCAGGACCACAAGGAGATCTCG CAAACGCGCTACTCGCAGCCCGCAACGCTCGCGTGCTCGGTCGGCGCCTTTGACATC TTCAAAGCGGCGGGACTGGCGCCGAGCTTTGCGGCGGGCCACTCGCTGGGCGAGTTT 3 GCGGCGCTCTACGCGGCCGGGTCGCTCGATCGCGACGCCGTCTTCGACCTGGTCTGC GCGCGCGCCAAGGCCATGAGCGACTTCACGGCCCAGGCCAGCAGCAGCGGTGGCGCC ATGGCGGCGGTGATTGGCGCCAAGGCGGACCAGCTCTCGCTGGGTGGCGCGCCCGAC GTGTGGCTCGCCAACAGCAACTCGCCCTCGCAGACCGTGATCACGGGAACCGCCGAA GCAGTGGCTGCGGCCTCTGACAAGTTGCGCTGCAGCGGCAACTTCCGCGTCGTGCCT 5 CTGGCCTGCGAGGCGGCCTTCCACTCGCCGCACATGCGCGGCGCGGAGCAGACGTTT GCGTCGGCGCTCGCGCAGGCGCCCGTGTCGGCACCGGCGGCTGCTCGGTTCTACTCT AACGTGACGGGGGGCGCCGCGGTAACCTCGCCCGCGGACGTCAAAACGAACCTGGGC AAGCACATGACGAGCCCTGTGCAGTTCGTGCAGCAGGTGCGAGCCATGCACGCGGCG GGCGCGCGTGTGTTTGTGGAGTTTGGGCCCAAGCAGGTCCTGTCGCGCCTCGTCAAG ) GAGACCCTTGGCGAGGCCGGCGACGTGGTCACGGTCGCCGTCAACCCAGACTCGGCC AAGGACAGCGACACGCAGCTGCGCCAGGCGGCGCTCACGTTGGCGGTCGCCGGCGTG CCGCTCAAGGACTTTGACCGCTGGCAGCTGCCGGATGCCACGCGCCTCGAGCCTGTC AAGAAGAAGAAGACCACGTTGCGGCTCTCGGCAGCCACCTACGTCTCCGCCAAGACG TTGCGCCAGCGCGAGGCCGTGCTCAACGACGGCTACACTGTCAGTGGTGCCACGGCG ) GTAGTCAAGGAAGTGGACACGGCCAACGAGGAGCGTCTCGTCCGCCAAGCCCAGGAT CTCCAGCGCCAGCTCGCGGAGGCCTCGACGGCAGCCCAGGCGGCGCAGTCCAAGGTC GCGGAGCTCGAGCGCACGATCCAGGACTTGGAGCGCAAGGTGCAGCAGCAGCAGCAA GAGAAGGGTGAGAACTCAGACAGCAACGCTGCCGCCGAAGTGCTGCGGCGCCACAAG GAGCTGCTCCAGCGCATGCTGCAGGACTGTGACGAGCAGGCAGTGCCCGTAGCCACG ) GTGGTTCCGACACCTACGTCCTCCCCGACGCCTACATCCTCACCCGTATCCGGCAAC AGCAAGAGCACTCGTGGCAGTGCTGATCTGCAAGCGCTGCTGGCCAAGGCGGAGACT GTGGTGATGGCTGTGCTGGCTGCCAAGACTGGCTACGAGGCCGACATGGTTGAGGCG GACATGGACCTGGAGGCCGAGCTCGGCATCGACTCGATCAAGCGCGTGGAGATCCTT TCCGAGGTGCAGGGCCAGCTGGGCGTCGAGGCCAAGGACGTGGATGCGCTGAGCCGC > ACGCGCACGGTCGGTGAGGTTGTGGACGCCATGAAGGCGGAGATCGTGGCTGCCTCT GGTGGTAGTGCTCCTGCGGTTCCTTCGGCGCCCGCTGCTTCTGCAGCTCCGACTCCC GCTGCTTCGACTGCGCCTTCTGCTGATCTGCAAGCGCTGCTGTCCAAGGCGGAGACT GTGGTGATGGCTGTGCTGGCGGCCAAGACTGGCTACGAGGCCGACATGGTCGAGGCG GACATGGACCTGGAGGCCGAGCTCGGCATCGACTCGATCAAGCGCGTGGAGATCCTC ) TCGGAGGTGCAGGGCCAGCTGGGCGTCGAGGCCAAGGACGTGGATGCGCTGAGCCGC ACGCGCACGGTCGGTGAGGTTGTGGATGCCATGAAGGCGGAAATCGTGGCTGCCTCT GCTGGTAGTGCTCCTGCTCCTGCTGTTCCTTCGGCGCCCGCTGCTTCTGCAGCTCCG ACTCCCGCTGCTTCGACTGCGCCTTCTGCTGATCTGCAAGCGCTGCTGTCCAAGGCG GAGACGGTGGTGATGGCTGTGCTGGCGGCCAAGACTGGCTACGAGGCCGACATGGTC i GAGGCGGACATGGACCTGGAGGCCGAGCTCGGCATCGACTCGATCAAGCGCGTGGAG ATCCTCTCGGAGGTGCAGGGCCAGCTGGGCGTCGAGGCCAAGGACGTGGATGCGCTG AGCCGCACGCGCACGGTCGGTGAGGTTGTGGATGCCATGAAGGCGGAAATCGTGGCT GCCTCTGGTGGTAGTGCTCCTGCTCCTGCGGTTCCTTCGGCGCCCGCTGCTTCTGCA GCTCCGACTCCCGCGGCTGCGACAGCGCCTTCTGCTGATCTGCAAGCGCTGCTGGCC ) AAGGCGGAGACTGTGGTGATGGCTGTGCTGGCGGCCAAGACTGGCTACGAGGCCGAC
ATGGTCGAGGCGGACATGGACCTGGAGGCCGAGCTCGGCATCGACTCGATCAAGCGC GTGGAGATCCTTTCCGAGGTGCAGGGCCAGCTGGGCGTCGAGGCCAAGGACGTAGAT GCGCTGAGCCGCACGCGCACGGTCGGTGAGGTTGTGGATGCCATGAAGGCGGAGATC GTGGCTGCCTCTGCTGGTAGTGCTCCTGCTCCTGCTGTTCCTTCGGCGCCCGCTGCT ί TCTGCAGCTCCGACTCCCGCTGCTTCGACTGCGCCTTCTGCTGATCTGCAAGCGCTG CTGTCCAAGGCGGAGACTGTGGTGATGGCTGTGCTGGCGGCCAAGACTGGCTACGAG GCCGACATGGTCGAGGCGGACATGGACCTGGAGGCCGAGCTCGGCATCGACTCGATC AAGCGCGTGGAGATCCTCTCGGAGGTGCAGGGCCAGCTGGGCGTCGAGGCCAAGGAC GTGGATGCGCTGAGCCGCACGCGCACGGTCGGTGAGGTTGTGGATGCCATGAAGGCG I GAAATCGTGGCTGCCTCTGGTGGTAGTGCTCCTGCTGCTGCTGTTCCTTCGGCGCCC GCTGCTTCTGCAGCTCCGACTCCTGCGACTGCGCCTTCTGCTGATCTGCAAGCGCTG CTGTCCAAGGCGGAGACTGTGGTGATGGCTGTGCTGGCGGCCAAGACTGGCTACGAG GCCGACATGGTCGAGGCGGACATGGACCTGGAGGCCGAGCTCGGCATCGACTCGATC AAGCGCGTGGAGATCCTTTCCGAGGTGCAGGGCCAGCTGGGCGTCGAGGCCAAGGAC I GTAGATGCGCTGAGCCGCACGCGCACGGTCGGTGAAGTGGTGGACGCCATGAAGGCG GAGATCGTGGCTGCCTCTGGTGGTAGTGCTCCTGCTGCTCCTTCGGCGCCCGCGCTT CTTCCAACGCTGTTTGGTTCCGAGTGCGAGGACCTGTCTCTGACCTTTCCCGTGATA ACGACCCTGCCGCTTCCTGCAGAGCTTGTGCTGGCCGAGGGCGGCGCTCGCCCTGTA GTCGTGGTGGATGATGGATCTGCACTCACCTCGTCGCTGGTGTCCTCGCTCGGCGAT I CGTGCGGTGCTGCTGCAGGTGCAGTCTTCCTCTGCCTGCTCGCCGCGCTCGACCACG CACAAGTTGGTGACCGTAGCAGACCGCTCTGAAGCGGCGCTACAGGCGGCGCTCACG TCCGTCGAGGCGCAGTTCGGCAAGGTGGGTGGCTTTGTGTTCCAGTTCGGCGACGAC GACGTGCAAGCGCAGCTCGGCTGGGCGCTGCTCGCGGCCAAGCACCTCAAAACTTCG CTGTCAGAACAGATCGAGGGCGGTCGCACCTTTTTCGTGGCCGTCGCGCGGCTCGAC GGCCAGCTGGGGCTCTCCGGCAAGTCGACGACCGCTACCGTTGATCTCTCCCGCGCG CAGCAGGGCAGCGTGTTCGGCCTGTGCAAGACACTCGACCTGGAGTGGCCCGCTGTC TTCTGCCGCGGAATCGACCTGGCCGCCGACCTCGACGCCGCACAGGCCGCGCGGTGC CTGCTGGGCGAGCTGTCAGACCCCGACGTGGCCGTGCGCGAGTCTGGTTACTCCGCC TCGGGCCAGCGCTGCACGACAACTACGAAGTCGCTGACTACGGGCAAGCCGCACCAG CCGATCTCCTCGTCGGACCTCTTTCTGGTGTCGGGCGGCGCGCGCGGCATCACCCCG CTGTGCGTGCGCGAGCTGGCGCAGCGCGTGGGCGGCGGCACGTACGTGCTCATCGGC CGCTCGGAGCTGCCCACGACGGAGCCTGCCTGGGCGGTCGGCGTGGAGTCTGGCAAG CCGCTGGAGAAGGCCGCGCTGGCGTTCCTGAAGGCGGAGTTTGCAGCGGGCCGCGGG GCCAAGCCGACGCCGATGCTGCACAAGAAGCTCGTGGGCGCCGTGGTCGGAGCGCGC GAGGTGCGAGCCTCGCTCGCCGAGATCACTGCACAGGGCGCCACGGCTGTGTACGAG TCGTGCGACGTGAGCTCTGCCGCCAAGGTGCGTGAGATGGTAGAGCGCGTGCAGCAG CAGGGCGGGCGGCGCGTGTCGGGCGTGTTCCACGCGTCGGGCGTGCTGCGCGACAAG CTCGTGGAGAACAAGTCGCTGGCGGACTTCAGCGCCGTGTACGACACCAAGGTGGGC GGCCTCATCAACCTGCTGGCCTGCGTGGACCTGGCGCAGCTGCGTCACCTCGTGCTC TTCAGCTCGCTCGCGGGCTTCCACGGCAACGTCGGGCAGTCGGACTACGCAATGGCC AACGAGGCGCTCAACAAGCTGGCGGCGCACCTGTCGGCGGTGCACCCGCAGCTGTGC GCGCGCTCGATCTGCTTCGGACCGTGGGACGGCGGCATGGTGACCCCCGCGCTCAAG GCCAACTTCATCCGCATGGGCATCCAGATCATCCCGCGCCAAGGCGGCGCGCAGACC GTCGCCAACATGCTCGTCAGTAGCTCCCCCGGTCAGCTGCTCGTGGGCAACTGGGGC GTGCCACCCGTCGTGCCGAGTGCCACCGAGCACACCGTGCTGCAGACGCTCCGCCAG AGCGACAACCCCTTCCTCGACTCGCACGTGATCCAGGGCCGCCGCGTGCTGCCCATG ACCCTGGCCGTGGGCTACATGGCGCACCAGGCGCAGAGCATCTACGCGGGCCACCAG CTGTGGGCCGTCGAGGACGCCCAGCTCTTCAAGGGCATCGCCATCGACAATGGCGCC GACGTGCCCGTGCGCGTGGAGCTGTCGCGCCGCAAGGAGGAGCAGGAGGACGCCGGC AAGGTCAAGGTCAAGGTGCAGGTGCTGCTCAAATCGCAGGTCAACGGCAAGTCGGTG
CCCGCGTACAAGGCGACCGTCGTGCTGTCCCCTGCGCCGCGCCCCAGCGTCATCACG
CGTGACTTCGACCTCACCCCGGACCCGGCCTGCACGGAGCACGACCTCTACGACGGC
AAGACGCTCTTCCACGGCAAGGCCTTCCAGGGCATCGAGCAGGTGCTCTCGGCGACG
CCCAAGCAGCTCACCGCCAAGTGCCGCAATTTGCCCCTCACGCCCGAGCAGCGCGGC
5 CAGTTCGTCGTTAACCTCAGCCAGCAGGACCCGTTCCAGGCGGACATTGCGTTCCAG GCGATGCTCGTCTGGGCGCGCATGCTGCGCCAATCGGCGGCCCTGCCCAACAACTGC GAGCGCTTCGACTTTTACAAGCCGATGGCCCCGGGCGCCACCTACTACACGTCGGTC AAGCTGGCCTCGGCCTCACCCTTGGTGGACTCTGTGTGCAAGTGCACCGTGGCGATG CACGATGAGCAAGGTGAGGTGTACTTTTCTGCTCGT
) GCCAGCGTCGTCCTCAACAAGACCCTCACGTACTAA SEQ ID N0:3 shows the nucleotide sequence of an exemplary plant-optimized PFA1 gene, referred to herein as PFA1 v2:
ATGGATACCAGAATTGCCATTGTGGGAATGAGTGCGATCCTTCCGAGTGGTGAGAAT
GTTAGAGAGAGCTGGGAGGCCATCAGAGATGGCTTGGATTGTCTGTCTGATCTGCCT
> GCGGATCGTGTGGATGTGACTGCCTATTACAATCCAGAGAAAACGACCAAGGACAAA ATCTACTGCAAAAGAGGTGGGTTCATCCCTGAGTATGACTTTGATGCTCGTGAGTTT GGCCTCAACATGTTCCAGATGGAAGATTCTGATGCGAACCAGACCATCTCATTGCTC AAGGTGAAGGAAGCTCTCACCGATGCCAACATACCTGCTTTCTCAAGTGGCAAAAAG AACATTGGTTGTGTTCTTGGCATAGGTGGAGGTCAGAAGGCGTCACATGAGTTCTAC
) TCCAGACTCAACTATGTTGTGGTTGACAAAGTGCTCAGAAAGATGGGTTTGCCAGAG GAAGATGTGGCAGCTGCGGTGGACAAGTACAAGGCGAGCTTCCCAGAGTGGAGGCTT GATTCTTTTCCTGGTTTCTTGGGCAATGTTACCGCTGGCAGATGTTGCAACACCTTC AACATGGAGGGCATGAACTGTGTCGTTGACGCTGCCTGTGCTTCAAGCCTGATTGCG GTCAAGGTGGCAATAGAAGAGCTTCTCTATGGTGACTGTGATGCCATGATTGCTGGT
> GCCACCTGCACAGACAATTCAATAGGGATGTACATGGCCTTCTCCAAGACGCCTGTT TTCTCTACGGACCCGAGTGTCAAAGCGTATGATGCTGCCACCAAAGGCATGTTGATT GGTGAAGGATCTGCGATGCTTGTTCTGAAGAGATATGCGGATGCTGTCAGAGATGGT GACACTGTTCATGCTGTCATCAAGGGCTGTGCTTCCTCAAGTGATGGAAAAGCAGCT GGAATCTACACACCGACAATCAGCGGACAAGAAGAGGCTCTCCGTAGAGCCTATGCA
) CGTGCCAATGTGGACCCAGCCACTGTCACTCTTGTTGAAGGACATGGAACTGGCACT CCGGTTGGGGACAAGATTGAACTCACAGCTCTGAGCAATCTCTTCTCCAAAGCGTTT TCTGCGAATGGAGGTGGAGCTGAGGAAGCTGAGCAAGTTGCTGTTGGCAGCATCAAG AGCCAGATAGGGCACCTCAAAGCGGTTGCTGGATTGGCTGGATTGGTCAAAGTGGTC CTTGCTCTCAAGCACAAGACATTGCCTCAGACGATCAATGTGGACAAGCCACCTTCA
> CTGGTGGATGGGACACCGATTCAACAGTCCCCTTTGTACGTCAACACCATGAACCGT CCCTGGTTCACTCCGGTTGGGGTTCCGAGGAGAGCTGGCGTTTCCTCATTTGGTTTT GGAGGTGCGAACTACCATGCTGTGCTTGAAGAGTTTGAACCTGAACATGAGAGTGCT TACCGTTACAACAATCTTCCCCAAGTTGCTCTCCTTCATGCTGGGGATGTTGCAACT CTTGCTGCCACAGTTAGGGCAAAACTGGCATTGGCCACTGCTGAGCAAGAAGAGGCT
I AGAGTTGTGAAGAACGCTGATTACATTGCATACCATAGGTTCCTTGATGAATGTAAG TTGAGAGGAGCTGTTCCCCAAGCCCACGCAAGGGTTGGACTTCTGGTGAGGGACCTG TCCTCTCTCATTGCGGTTTTGGAAGCAGCTGCAGCCAAACTTGCTGGAGAAGAGTCA GCAACGGAATGGACGGTCTCAGTTGCCACTGGTGAGGCTGCATTCAGAGTTAGGGGT GTTGCCACAGAGGCCAATGTTGCTGCACTTTTCTCTGGCCAAGGAGCGCAGTACACT
i CACATGTTCTCAGATGTTGCCATGAACTGGCCTCCGTTCAGAGAGAGTGTTGCTGCG ATGGACAGAGCGCAGAGAGAACGTTTTGGGAGGCCAGCCAAAAGAGTCTCCAGTGTT CTCTATCC GAGAAAAC C T TAT G GAGAT GAGC CAAG G CAAGAT CACAAAGAGAT T TC T CAGACGCGTTACTCTCAGCCAGCAACCCTCGCTTGCTCTGTCGGTGCCTTTGACATC
TTCAAAGCAGCTGGATTGGCTCCTTCTTTTGCAGCTGGACATTCCCTGGGAGAGTTT GCAGCTCTCTATGCAGCTGGTTCATTGGATCGTGATGCTGTGTTTGACTTGGTTTGC GCTAGGGCAAAGGCCATGTCTGATTTCACTGCTCAAGCCAGCTCCAGTGGAGGTGCT ATGGCAGCGGTCATAGGAGCCAAGGCTGATCAGCTCAGCCTTGGTGGAGCACCTGAT ) GTTTGGCTGGCCAATAGCAACAGTCCATCACAGACGGTGATCACGGGAACTGCTGAA GCAGTGGCAGCTGCATCTGACAAACTTCGTTGTAGTGGAAACTTCAGAGTGGTTCCT CTTGCTTGTGAAGCTGCCTTCCATTCACCACACATGCGTGGAGCAGAGCAGACATTT GCGTCTGCGCTTGCTCAAGCTCCAGTGTCCGCACCTGCAGCTGCCAGATTCTACAGC AACGTCACTGGTGGAGCTGCAGTCACCTCTCCTGCTGATGTCAAAACGAACCTTGGG ) AAACACATGACTTCTCCTGTGCAGTTTGTGCAGCAAGTCCGTGCCATGCACGCAGCT GGAGCAAGGGTGTTTGTTGAGTTCGGTCCCAAGCAAGTCCTTTCTCGTTTGGTCAAA GAGACCCTTGGGGAAGCTGGAGACGTGGTCACGGTGGCTGTCAACCCAGACTCAGCC AAGGATTCAGACACCCAGCTGAGACAAGCAGCTCTCACCTTGGCTGTGGCTGGTGTT CCACTCAAAGACTTTGACAGATGGCAGCTTCCCGATGCCACTCGTCTTGAGCCTGTC i AAGAAAAAGAAAACAACCTTGAGGTTGAGTGCTGCCACCTATGTCTCTGCCAAGACC TTGAGGCAGAGGGAGGCTGTGCTCAATGATGGTTACACTGTGAGTGGTGCCACAGCG GTTGTCAAAGAAGTGGACACTGCAAACGAAGAGAGACTTGTCAGACAAGCACAAGAC CTCCAGCGTCAGCTTGCTGAAGCAAGCACTGCAGCCCAAGCAGCTCAATCCAAGGTC GCTGAATTGGAGAGGACAATCCAAGACTTGGAGAGGAAGGTTCAACAGCAACAGCAA ) GAGAAAGGTGAGAACTCTGACTCCAATGCAGCTGCGGAAGTGCTTAGGAGACACAAG GAACTGCTCCAGAGGATGCTCCAAGATTGTGATGAGCAAGCAGTTCCCGTGGCAACA GTCGTTCCAACACCCACTTCTTCCCCTACACCAACATCCTCACCAGTTAGCGGAAAC AGCAAGTCCACCAGAGGATCAGCCGACCTCCAAGCACTCCTGGCGAAAGCTGAGACG GTCGTGATGGCAGTTTTGGCTGCAAAGACTGGCTACGAGGCAGACATGGTGGAAGCA ! GATATGGATTTGGAGGCTGAGCTTGGGATTGATTCCATCAAAAGGGTGGAGATCCTG AGTGAAGTCCAAGGGCAGCTCGGAGTTGAAGCGAAGGATGTTGATGCCCTTTCACGT ACAAGGACCGTCGGAGAGGTTGTGGATGCCATGAAGGCTGAGATTGTTGCTGCATCT GGTGGGTCAGCACCTGCTGTCCCCTCTGCACCAGCTGCATCAGCGGCTCCGACACCT GCTGCGAGTACCGCTCCGAGTGCTGATCTTCAGGCTCTCCTGTCTAAAGCCGAGACG I GTTGTGATGGCTGTGCTCGCAGCGAAAACTGGTTACGAGGCTGACATGGTGGAAGCT GACATGGACCTTGAAGCGGAGTTGGGAATAGATAGCATCAAACGTGTTGAAATCTTG TCTGAGGTCCAAGGACAGTTGGGTGTGGAAGCCAAAGATGTCGATGCGCTTTCAAGA ACCAGAACCGTCGGTGAGGTCGTGGACGCCATGAAGGCTGAGATTGTGGCTGCCTCT GCTGGCTCCGCTCCTGCTCCAGCAGTTCCTTCTGCACCTGCAGCGTCAGCGGCTCCA ; ACTCCAGCTGCATCCACGGCTCCTTCTGCAGACCTCCAAGCCTTGCTGTCCAAAGCC GAAACAGTTGTGATGGCTGTCCTTGCTGCAAAGACTGGTTACGAAGCCGACATGGTT GAAGCTGACATGGATTTGGAAGCCGAACTTGGAATAGATTCCATCAAAAGAGTGGAG ATACTCTCTGAGGTGCAAGGTCAGCTCGGAGTTGAAGCGAAAGACGTTGATGCCCTC AGTAGGACCAGAACTGTTGGGGAAGTTGTCGATGCGATGAAGGCTGAGATTGTCGCT
i gccagcggtggatctgcacctgcacctGcggtcccgtcagctccagcagccagcgca GCTCCGACTCCTGCAGCTGCCACAGCACCGAGTGCGGATCTGCAGGCATTGCTTGCG AAGGCTGAAACAGTTGTCATGGCTGTCCTGGCTGCGAAAACTGGCTATGAGGCTGAT ATGGTGGAAGCCGACATGGACCTTGAGGCTGAATTGGGCATTGACAGCATCAAGCGT GTTGAGATTCTCAGTGAAGTCCAAGGACAGCTCGGAGTGGAGGCGAAGGATGTGGAT GCCCTCTCAAGGACCAGAACAGTTGGTGAGGTCGTTGATGCGATGAAGGCAGAGATT GTTGCTGCCAGTGCTGGTTCTGCTCCCGCACCCGCTGTCCCAAGCGCACCAGCTGCC TCCGCCGCTCCCACACCAGCTGCCTCTACTGCACCAAGTGCGGACCTTCAAGCTCTC CTGAGCAAGGCTGAGACAGTTGTGATGGCAGTCCTTGCTGCGAAAACTGGCTATGAG GCAGACATGGTGGAAGCGGACATGGATCTGGAAGCTGAACTTGGAATTGACTCCATC AAACGTGTTGAAATCCTCTCTGAGGTTCAAGGTCAGCTTGGGGTGGAGGCCAAAGAT
GTTGATGCTCTTTCCAGAACAAGGACGGTGGGAGAGGTGGTTGATGCCATGAAGGCT GAGATAGTGGCAGCGTCAGGAGGGTCAGCACCTGCAGCTGCCGTTCCGTCCGCACCA GCAGCCTCTGCAGCTCCCACGCCAGCCACCGCTCCTAGTGCTGATTTGCAAGCCCTC CTTTCAAAAGCTGAAACTGTTGTCATGGCTGTTTTGGCTGCCAAGACTGGCTACGAG 5 GCTGACATGGTTGAGGCTGACATGGACTTGGAAGCCGAGCTTGGGATTGATAGCATC AAGCGTGTGGAAATCCTTTCTGAGGTTCAAGGTCAGCTGGGTGTTGAGGCCAAAGAT GTCGATGCGTTGTCAAGGACCAGAACGGTTGGAGAAGTGGTCGATGCCATGAAGGCT GAGATAGTTGCTGCCTCTGGAGGTTCAGCTCCTGCAGCTCCGTCAGCACCTGCCCTC CTTCCAACTTTGTTTGGTTCTGAGTGTGAAGATTTGAGCTTGACTTTCCCAGTCATC [) ACAACCCTGCCTCTTCCTGCTGAACTTGTGCTGGCTGAAGGTGGAGCACGTCCTGTG GTTGTGGTTGACGATGGCTCTGCACTCACCAGTTCTCTTGTGTCCTCACTTGGTGAT CGTGCTGTGCTCTTGCAAGTTCAGTCCAGCTCTGCCTGTTCACCCAGAAGCACCACG CACAAGTTGGTCACTGTTGCAGACCGTTCTGAAGCTGCATTGCAAGCTGCGCTCACA TCAGTTGAAGCACAGTTTGGAAAAGTGGGAGGTTTTGTGTTCCAGTTTGGTGATGAC 5 GATGTCCAAGCGCAGCTTGGTTGGGCACTGCTTGCTGCCAAACATCTCAAAACGTCC TTGTCAGAACAGATAGAAGGTGGGAGGACCTTCTTTGTTGCCGTTGCGAGGTTGGAT GGTCAGTTGGGGTTGTCTGGAAAGTCCACGACTGCCACTGTTGATCTCTCCAGAGCG CAGCAAGGCTCAGTCTTTGGACTCTGCAAAACCCTTGACTTGGAATGGCCTGCTGTT TTCTGCAGAGGAATCGACCTTGCAGCTGACTTGGATGCTGCACAAGCTGCCAGATGT ) CTTTTGGGTGAGCTTTCAGACCCAGATGTGGCAGTGAGGGAGTCTGGTTACTCCGCA TCTGGGCAAAGATGCACCACAACCACAAAGTCTCTCACCACGGGAAAACCACATCAA CCGATCTCTTCCAGTGATTTGTTCCTGGTCTCTGGAGGTGCTCGTGGAATCACACCT CTTTGTGTGAGAGAATTGGCACAGAGGGTGGGAGGTGGAACCTATGTCCTCATTGGG AGAAGTGAGCTGCCCACCACGGAACCTGCCTGGGCTGTTGGTGTTGAGTCAGGGAAA 5 CCTCTTGAGAAGGCTGCGCTGGCGTTCCTCAAAGCTGAGTTTGCAGCTGGAAGGGGA GCGAAGCCGACACCGATGCTCCACAAGAAACTTGTTGGAGCTGTTGTGGGAGCTAGA GAGGTCCGTGCGAGCCTGGCAGAGATAACTGCTCAAGGTGCCACAGCTGTCTATGAG TCCTGTGATGTCAGCTCTGCAGCCAAGGTTCGTGAAATGGTTGAGAGGGTTCAACAG CAAGGAGGGAGAAGGGTCAGCGGTGTGTTTCATGCAAGTGGTGTTTTGAGAGACAAG ) TTGGTTGAGAACAAGTCACTGGCTGATTTCAGTGCTGTGTATGACACAAAGGTTGGT GGACTCATCAACCTCCTTGCCTGTGTGGATCTTGCACAGCTTAGGCACCTGGTGCTC TTCAGCTCCCTTGCTGGGTTCCACGGCAATGTTGGTCAGAGTGACTATGCAATGGCC AATGAGGCTCTCAACAAGCTGGCTGCACATCTGTCTGCTGTGCATCCCCAACTTTGT GCGAGATCCATTTGCTTTGGTCCGTGGGATGGAGGGATGGTGACGCCTGCACTCAAG
> GCCAACTTCATCAGAATGGGCATTCAGATTATCCCTCGTCAAGGTGGAGCACAGACA GTTGCGAACATGCTTGTCAGCTCCAGCCCTGGTCAGCTCCTTGTTGGGAACTGGGGA GTGCCACCTGTGGTTCCAAGTGCCACTGAGCACACTGTTTTGCAGACTCTTCGTCAG AGCGACAACCCCTTCTTGGATTCACATGTCATTCAAGGGAGAAGGGTTTTGCCGATG ACACTGGCTGTCGGCTACATGGCTCACCAAGCTCAGAGCATCTACGCTGGACATCAG
) CTTTGGGCAGTTGAGGATGCCCAGCTTTTCAAAGGCATAGCCATTGACAATGGAGCT GATGTTCCGGTTAGGGTTGAGTTGTCAAGGAGAAAGGAGGAACAAGAGGATGCTGGC AAGGTCAAGGTCAAGGTTCAAGTGCTTCTCAAATCTCAAGTCAATGGCAAGTCAGTC CCTGCTTACAAGGCGACTGTCGTGCTTTCCCCTGCTCCACGTCCCAGTGTCATCACC CGTGACTTTGATCTCACTCCTGACCCAGCCTGCACCGAACATGACCTCTATGATGGC
> AAGACGCTCTTCCACGGCAAAGCCTTCCAAGGAATAGAACAAGTTCTTTCTGCGACG CCAAAACAGCTCACTGCCAAATGCAGAAACCTTCCACTCACACCGGAGCAGCGTGGC CAGTTTGTGGTCAATCTCAGCCAGCAAGACCCATTCCAAGCTGACATTGCTTTCCAA GCCATGCTTGTTTGGGCTAGGATGTTGAGACAGTCTGCTGCGCTGCCCAATAACTGT GAAAGGTTTGATTTCTACAAACCGATGGCTCCTGGAGCAACTTACTATACCAGTGTC
AAACTGGCTTCAGCTTCACCATTGGTGGATTCTGTGTGCAAATGCACTGTTGCCATG
CACGATGAGCAAGGTGAAGTGTACTTCTCTGCGAGA
GCCAGTGTTGTCCTCAACAAGACACTCACATACTGA SEQ ID N0:4 shows the amino acid sequence of an exemplary PFA2 protein:
i MPCDNIAWGMAVQYAGCKNQDEFWDTLMRKEINSSPISAERLGTRYRDLHFHPQRS KYADTFCNDRYGCVDASVDNEHDLLADLARRALLDAGINLDDASTTANLRDFGIVSG CLSFPMDNLQGELLNLYQVHVENRVGAQRFRDSRPWSERPRAVSPEASDPRVYSDPA SFVANQLGLGPVRYSLDAACASALYCLKLASDHLLSRSADVMLCGATCFPDPFFILS GFSTFQAMPLGGPDDNPLSVPLRQGSQGLTPGEGGAIMVLKRLEDAVRDGDRIYGTL I LGTSLSNAGCGLPLSPHLPSEKSCMEDLYTSVGIDPSEVQYVECHATGTPQGDVVEV EALRHCFRGNTDHPPRMGSTKGNFGHTLVAAGFAGMAKVLLSMQHGTIPPTPGVDRS NCIDPLVVDEAIPWPYSSAQARAGKPGDELKCASLSAFGFGGTNAHCVFREHRQIAA TATASPVLPEVTPGPIAIIGMDATFGTLKGLDAFEQAIYKGTDGASDLPSKRWRFLG ADTDFLTAMGLDAVPRGCYVRDVDVDYKRLRSPMIPEDVLRPQQLLAVATMDRALQD i AGMATGGKVAVLVGLGTDTELYRHRARVTLKERLDPAAFSPEQVQEMMDYINDCGTS TSYTSYIGNLVATRVSSQWGFTGPSFTVTEGANSVYRCLELGKFLLDTHQVDAVWA GVDLCATAENLYLKARRSAISRQDHPRANFEASADGYFAGEGSGALVLKRQADVGSD DKVYASVAGLTCAAQPAEAVSPLLLQVHNDDNEKRVVEMVELAADSGRHAPHLANSP LSAESQLEQVSKLLAHQVPGSVAIGSVRANVGDVGYASGAASLIKTALCLHNRYLPA I NPQWERPVAPVSEALFTCPRSRAWLKNPGESRLAAVASASESGSCFGVLLTDEYATH ESSNRLSLDDAAPKLIAIRGDTVDDIMAKVNAELALLRAHAETGSATDDDPAAAVAF TAHRLRFLRLVGETVASHGATATLCLALLTTPEKLEKELELAAKGVPRSAKAGRNWM SPSGSAFAPTPVTSDRVAFMYGEGRSPYYGVGLDLHRLWPALHERINDKTAALWENG DSWLM PRAVDADSQRAVQTAFDADQIEMFRTGIFVSICLTDYARDVLGVQPKAC FGL SLGEISMLFALS RRNCGLS DQLTQRLRTS PVWSTQLAVE FQALRKLWNVPADAPVE S FWQGYLVRASRAEIEKAIGPDNRFVRLLIVNDSSSALIAGKPAECLRVLERLGGRLP PMPVKQGMIGHCPEVAPYTPGIAHIHEILEIPDSPVKMYTSVTNAELRGGSNSSITE FVQKLYTRIADFPGIVDKVSRDGHDVFVEVGPNNMRSAAVSDILGKAATPHVSVALD RPSESAWTQTLKSLALLTAHRVPLHNPTLFADLYHPTFLTAIDSAMQEPPPKPNRFL I RSVEVNGYFCPDGIS KQ VAAAS AK P S THCM VRLH P AKAWV AAAGAV VADS T P WKA KQT S S SL L VG DDAFLRC YDVDW PL YMGAM AEG IS S VDLWAAAE ARMLAS FGAARL P MDQVELQIREIQQRTSNAFAVNLMPGPDEAATVDALLRTGVSIVEASGYTGALSADL VRYRVTGLRRTSCGASVSATHRWAKVSRTEVAEHFLRPAPAAVLEALVAAKQITPE QAALASRVAMADDVAVEADSGGHTDNRPIHVLLPLVVAQRNRWRHLVDTPVRVGAGG GIACPRAALLAFSLGAAFVVTGSVNQLAREAGTSDAVRLLLATATYSDVAMAPGGVQ VLKKQTMFAARATMLAQLQAKFGSFDAVPEPQLRKLERSVFKQSVADVWAAAREKFG VDATAASPQERMALCVRWYMSQSSRWATEATSARKADYQIWCGPAIGSFNDFVRGTK L DATAGT GE F PRVVDINQHILLGASHYRRVQQQ QQDDDVEYIIV SEQ ID NO:5 shows the nucleotide sequence of an exemplary PFA2 gene, referred to herein as PFA2 vl, isolated from Thraustochytrid Schizochytrium sp. (as represented by ATCC Accession No. PTA-9695):
ATGCCGTGCGATAACATTGCGGTCGTGGGCATGGCGGTGCAGTATGCCGGATGCAAG
AACCAGGACGAGTTCTGGGATACGCTGATGCGTAAGGAGATCAACTCGAGCCCGATC
TCGGCGGAGCGCCTCGGTACGCGCTACCGCGACCTCCACTTCCACCCGCAGCGCAGC
AAGTACGCCGACACCTTCTGCAACGATCGCTACGGCTGCGTCGATGCCAGCGTCGAC
AACGAGCACGACCTCCTCGCCGACCTGGCCCGGCGCGCCCTGCTCGACGCCGGAATT AACCTCGACGACGCCAGCACCACCGCCAACCTACGCGACTTCGGCATCGTGAGCGGC TGCCTGTCGTTCCCCATGGACAATCTGCAGGGCGAGCTGCTCAATCTGTACCAAGTG CATGTGGAGAACCGCGTGGGCGCCCAGCGCTTCCGCGACTCGCGCCCCTGGTCGGAG 5 CGCCCGCGCGCTGTCTCGCCCGAGGCCAGCGACCCGCGCGTGTACTCCGACCCGGCG TCCTTCGTGGCCAACCAGCTCGGCCTGGGGCCCGTGCGCTACAGCCTCGATGCAGCC TGCGCGTCGGCGCTGTACTGCCTCAAGCTGGCGTCCGACCACTTGCTCTCGCGCAGC GCGGACGTGATGCTGTGCGGCGCCACATGCTTTCCGGACCCGTTCTTCATTCTCTCG GGGTTCTCCACCTTCCAGGCGATGCCGCTGGGCGGACCGGACGATAACCCACTGTCC ) GTGCCGCTGCGGCAGGGCAGCCAGGGCCTGACGCCCGGAGAGGGCGGCGCCATCATG GTGCTGAAGCGCCTCGAGGACGCCGTGCGCGACGGCGACCGCATCTACGGCACCTTG CTCGGCACGAGTCTGAGCAACGCCGGGTGCGGCCTGCCGCTGAGCCCGCACCTGCCG AGCGAGAAGTCGTGCATGGAGGACCTGTACACGAGCGTCGGCATCGACCCAAGCGAG GTGCAGTACGTGGAGTGCCACGCCACGGGCACTCCGCAGGGCGACGTCGTGGAGGTA 5 GAGGCGCTGCGCCACTGCTTTCGAGGTAACACGGACCACCCGCCGCGCATGGGCTCC ACCAAGGGCAACTTTGGCCACACTCTCGTGGCGGCCGGGTTCGCAGGCATGGCCAAG GTGCTGCTGTCGATGCAGCACGGCACGATCCCGCCCACGCCCGGTGTCGACCGCTCC AACTGCATCGACCCGCTCGTCGTGGACGAGGCCATCCCTTGGCCGTACTCGTCGGCG CAGGCGCGGGCAGGCAAACCAGGCGATGAGCTCAAGTGCGCCTCGCTCTCCGCCTTT ) GGCTTTGGTGGAACCAACGCGCACTGTGTCTTCCGTGAGCACCGCCAAATTGCTGCT ACTGCGACAGCCTCGCCGGTGCTTCCCGAGGTGACTCCTGGACCGATTGCCATCATC GGGATGGACGCGACGTTTGGTACCCTCAAGGGCCTGGACGCGTTTGAGCAGGCCATC TACAAGGGCACGGACGGCGCCAGCGACCTGCCGAGCAAGCGCTGGCGGTTCCTGGGC GCCGACACGGACTTCTTGACCGCCATGGGCCTCGACGCCGTGCCGCGCGGGTGCTAC > GTGCGCGACGTGGACGTGGACTACAAGCGGCTGCGGTCGCCGATGATCCCTGAGGAC GTCCTGCGCCCGCAACAGCTGCTGGCGGTGGCTACGATGGACCGCGCGCTGCAGGAC GCTGGAATGGCGACGGGAGGCAAGGTGGCGGTGCTGGTGGGGCTCGGCACGGACACC GAGCTGTACCGGCACCGCGCGCGCGTGACACTCAAGGAGCGGCTCGACCCGGCCGCG TTCTCGCCCGAGCAGGTGCAGGAGATGATGGACTACATCAACGACTGCGGCACCTCG ) ACGTCGTACACGTCGTACATCGGCAACCTCGTGGCCACGCGCGTGTCCTCGCAGTGG GGCTTTACGGGCCCGTC.CTTCACCGTCACCGAAGGCGCAAACTCGGTCTACCGCTGC CTCGAGCTGGGCAAGTTCCTGCTCGACACGCACCAGGTGGACGCCGTCGTGGTGGCC GGCGTCGACCTCTGTGCCACCGCCGAGAACCTTTACCTCAAGGCGCGCCGCTCCGCC ATCAGCCGACAGGACCACCCTCGCGCCAACTTTGAGGCCAGCGCCGACGGGTACTTT ί GCCGGCGAGGGCAGCGGCGCCCTGGTCCTCAAGCGCCAGGCCGACGTTGGCTCAGAC GACAAGGTCTACGCCAGTGTCGCGGGCCTCACGTGCGCCGCGCAGCCCGCTGAAGCC GTGTCGCCGCTACTACTCCAAGTCCACAACGACGACAACGAGAAGAGGGTGGTGGAG ATGGTGGAGCTCGCCGCCGACTCGGGTCGCCATGCGCCGCACTTGGCCAACTCGCCG CTGAGCGCCGAGTCGCAGCTGGAGCAAGTGTCCAAGTTGCTCGCGCACCAGGTGCCG I GGCTCGGTGGCCATCGGCAGCGTGCGCGCCAACGTGGGAGACGTCGGGTACGCCTCG GGCGCCGCGAGCCTCATCAAGACGGCGCTGTGCCTCCACAACCGCTACCTCCCGGCC AACCCGCAGTGGGAGCGGCCGGTGGCGCCGGTCTCCGAGGCGCTGTTTACTTGCCCG CGCTCGCGTGCCTGGCTGAAGAACCCGGGCGAGTCGCGACTGGCGGCTGTCGCCAGT GCCTCCGAGAGCGGGTCCTGCTTTGGCGTGCTCCTCACAGACGAGTACGCCACTCAT i GAGAGCAGCAACCGCCTCTCGCTGGATGACGCCGCCCCCAAGCTCATCGCGATCCGT GGCGACACCGTTGACGATATCATGGCCAAGGTCAACGCCGAGCTGGCGCTCCTCCGA GCGCACGCCGAAACCGGGTCTGCTACTGACGACGACCCAGCTGCTGCTGTCGCTTTC ACTGCTCATCGCTTGCGCTTTTTGCGGCTCGTAGGGGAGACGGTGGCTAGTCACGGT GCCACGGCGACCTTGTGTTTGGCCCTGCTGACAACGCCGGAGAAGCTGGAGAAGGAG I TTGGAGCTGGCAGCCAAGGGTGTACCGCGAAGCGCCAAGGCCGGGCGCAACTGGATG
TCGCCATCGGGCAGCGCCTTTGCGCCGACACCTGTGACCAGCGACCGCGTCGCGTTC ATGTACGGCGAGGGCCGCAGCCCCTACTACGGCGTCGGGCTCGACCTGCACCGCCTG TGGCCGGCTTTGCACGAGCGCATCAACGACAAGACCGCGGCGCTGTGGGAGAACGGC GACTCGTGGCTCATGCCGCGCGCGGTGGATGCCGACTCGCAGCGCGCCGTGCAGACG i GCCTTTGACGCGGACCAGATCGAGATGTTCCGCACGGGCATCTTCGTGTCCATCTGC CTCACCGACTACGCGCGCGACGTGCTCGGGGTGCAGCCCAAGGCGTGCTTCGGCCTC AGCCTCGGCGAGATCTCCATGCTCTTTGCGCTGTCGCGACGCAACTGCGGCCTGTCG GACCAGCTCACGCAGCGCCTACGCACCTCGCCGGTGTGGTCGACACAGCTGGCGGTG GAGTTCCAGGCCTTGCGCAAGCTATGGAACGTGCCGGCGGACGCCCCCGTGGAGTCC I TTCTGGCAGGGCTACTTGGTTCGCGCCAGCCGCGCCGAAATCGAGAAGGCGATCGGG CCCGACAACCGCTTCGTGCGCCTGCTGATCGTCAACGACTCGAGCAGCGCGCTGATC GCCGGCAAACCTGCCGAGTGTCTGCGCGTGCTGGAGCGCCTGGGCGGGCGGTTGCCG CCGATGCCCGTCAAGCAAGGCATGATTGGGCACTGCCCCGAAGTGGCGCCCTACACG CCGGGCATCGCGCACATCCACGAGATTTTGGAGATTCCGGACAGCCCCGTCAAGATG i TACACCTCGGTCACCAACGCCGAGCTGCGCGGGGGCAGCAACAGCAGCATCACCGAG TTCGTGCAGAAGTTGTACACGCGCATCGCCGACTTTCCGGGCATCGTCGACAAGGTC AGCCGTGACGGCCACGATGTCTTCGTCGAGGTGGGGCCGAACAACATGCGCTCCGCC GCGGTCAGTGACATTCTTGGCAAGGCTGCCACCCCGCATGTCTCCGTGGCGCTGGAC CGCCCCAGTGAGTCGGCGTGGACGCAGACCCTCAAGTCGCTGGCGCTGCTGACCGCC I CACCGCGTGCCCCTGCACAACCCGACTCTGTTTGCGGACCTGTACCACCCCACGTTC CTGACGGCTATCGACTCTGCGATGCAGGAGCCCCCGCCCAAGCCCAACCGCTTCCTT CGCAGCGTAGAGGTCAACGGGTACTTTTGCCCCGACGGCATCAGCAAGCAGGTTGCT GCTGCAAGTGCCAAACCCTCGACGCATTGCATGGTTCGTTTGCACCCAGCCAAGGCA GTTGTGGTTGCTGCTGCTGGTGCTGTGGTTGCTGATTCGACGCCCGTGGTCAAGGCC AAGCAGACGTCGTCGTCGTTGTTGGTTGGGGATGACGCCTTTCTGCGCTGCTACGAC GTGGACTGGCCGCTCTACATGGGCGCCATGGCGGAAGGCATCTCGTCGGTAGACCTG GTGGTCGCTGCCGCCGAGGCCCGCATGCTGGCATCATTCGGAGCGGCCCGCTTGCCT ATGGACCAGGTGGAACTCCAGATCCGTGAGATCCAGCAACGCACCTCCAACGCCTTT GCTGTCAACCTGATGCCGGGTCCTGACGAGGCCGCGACGGTGGACGCGCTGCTGCGC ' ACGGGCGTCTCAATCGTCGAGGCATCGGGCTACACCGGCGCGCTCTCTGCAGACCTG GTGCGCTACCGTGTCACGGGTCTGCGACGAACTAGTTGCGGTGCTTCTGTGTCGGCG ACTCACCGTGTGGTCGCCAAGGTGTCGCGCACCGAGGTGGCCGAGCACTTTCTGCGC CCGGCGCCGGCCGCCGTACTAGAGGCTTTGGTCGCCGCCAAACAGATTACGCCCGAG CAGGCCGCGCTGGCCAGCCGCGTCGCCATGGCCGACGACGTCGCGGTGGAGGCCGAC TCGGGCGGGCACACCGACAACCGACCGATCCACGTGCTGCTGCCGCTCGTGGTGGCG CAGCGCAACCGCTGGCGCCACCTGGTGGACACGCCAGTGCGCGTCGGCGCCGGCGGC GGGATCGCCTGTCCGCGCGCCGCGCTGCTCGCCTTTTCCCTGGGCGCCGCCTTTGTG GTCACCGGGTCCGTCAACCAACTGGCCCGCGAGGCTGGCACCAGCGACGCGGTCCGA CTACTGCTGGCGACGGCCACCTACTCGGACGTGGCCATGGCGCCGGGCGGCGTCCAG GTGCTCAAGAAGCAGACCATGTTCGCCGCGCGGGCCACGATGCTCGCCCAGCTGCAG GCCAAGTTCGGCTCCTTTGACGCCGTGCCGGAGCCGCAGCTGCGCAAGCTCGAGCGC TCCGTGTTCAAGCAGTCCGTGGCGGACGTGTGGGCTGCTGCACGCGAAAAGTTTGGT GTCGACGCTACCGCTGCAAGTCCGCAGGAGAGGATGGCGCTCTGTGTGCGCTGGTAC . ATGTCGCAGTCGTCGCGATGGGCTACCGAGGCGACGTCCGCGCGCAAGGCGGACTAC CAGATCTGGTGCGGCCCCGCCATCGGCAGCTTCAACGACTTCGTTCGCGGCACCAAG CTGGACGCGACCGCTGGCACCGGCGAGTTTCCGCGCGTCGTGGACATCAACCAGCAC ATCCTCCTCGGAGCCTCGCACTACCGCCGCGTGCAGCAACAA CAACAGGACGACGACGTAGAATACATCATCGTATAA SEQ ID NO:6 shows the nucleotide sequence of an exemplary plant-optimized PFA2 gene, referred to herein as PFA2 v2:
ATGCCGTGTGACAACATTGCTGTGGTTGGAATGGCAGTTCAGTATGCTGGATGCAAG AACCAGGACGAGTTCTGGGACACACTGATGAGGAAGGAGATCAACAGCTCACCGATC ) TCAGCGGAGAGGCTTGGGACAAGATACAGAGACCTCCACTTCCATCCTCAGAGGAGC AAGTATGCAGACACCTTCTGCAATGACAGATATGGTTGTGTTGATGCTTCTGTTGAC AATGAGCATGACTTGCTTGCTGACCTTGCCAGACGTGCTTTGCTTGATGCTGGGATC AACTTGGATGACGCCAGCACCACTGCCAACCTTCGTGACTTTGGGATTGTGAGTGGA TGCCTCTCCTTCCCGATGGACAATCTGCAAGGTGAGCTTTTGAATCTCTATCAAGTC ) CACGTTGAGAACCGTGTGGGTGCCCAGAGGTTCAGAGATTCAAGACCCTGGTCAGAA AGACCAAGAGCTGTGTCCCCTGAAGCCAGTGACCCGAGGGTCTACAGCGACCCTGCT TCCTTTGTGGCCAACCAGCTTGGTCTTGGTCCTGTCAGATACAGCCTTGATGCAGCT TGTGCGAGTGCGCTGTACTGCCTCAAGTTGGCTTCTGATCACTTGCTCTCCCGTTCT GCAGATGTCATGCTGTGTGGTGCCACATGCTTCCCAGACCCGTTTTTCATTCTCTCT > GGGTTCTCCACATTCCAAGCGATGCCATTGGGTGGACCAGATGACAACCCACTCTCT GTGCCACTCCGTCAAGGCAGCCAAGGACTCACACCTGGAGAAGGTGGAGCCATCATG GTTCTGAAGCGTTTGGAAGATGCTGTGAGGGATGGTGATAGGATCTATGGCACCTTG CTTGGGACAAGTCTCAGCAATGCTGGTTGTGGTTTGCCACTTTCACCTCACCTGCCG TCTGAGAAAAGCTGCATGGAGGATTTGTACACGTCAGTTGGCATAGATCCATCTGAG ) GTTCAGTATGTCGAGTGTCATGCCACCGGAACTCCGCAAGGAGATGTGGTTGAAGTT GAGGCTCTGAGACATTGCTTCAGAGGCAACACTGACCACCCACCGAGGATGGGTTCC ACCAAAGGAAACTTTGGTCACACCTTGGTTGCAGCTGGGTTTGCTGGAATGGCCAAA GTGTTGCTTTCCATGCAGCATGGCACGATCCCACCCACGCCTGGTGTTGATAGGAGC AACTGCATAGATCCGCTGGTCGTTGATGAGGCCATACCCTGGCCTTACAGCTCAGCT i CAAGCGAGAGCTGGCAAACCTGGAGATGAATTGAAGTGTGCTTCCCTCTCAGCCTTT GGATTTGGTGGAACAAATGCTCATTGTGTGTTCAGAGAACACAGACAGATTGCTGCC ACTGCGACAGCGTCTCCGGTCCTTCCTGAAGTCACCCCTGGACCCATTGCAATCATT GGGATGGATGCGACGTTTGGCACCCTCAAAGGACTTGATGCGTTTGAACAAGCGATC TACAAAGGCACGGATGGAGCATCTGATCTGCCATCCAAGAGATGGAGGTTCCTTGGT I GCTGACACAGATTTCTTGACTGCAATGGGTCTGGATGCAGTCCCGAGAGGGTGCTAT GTGAGGGATGTTGATGTGGACTACAAAAGACTCAGAAGTCCCATGATCCCTGAAGAT GTCCTCAGACCCCAACAGCTTCTGGCAGTTGCCACGATGGATAGGGCACTTCAAGAT GCTGGCATGGCCACGGGTGGAAAAGTTGCTGTCCTGGTGGGGTTGGGCACTGACACT GAGCTTTACAGACACCGTGCAAGGGTGACACTCAAGGAAAGGCTTGACCCAGCAGCT i TTCTCCCCTGAACAAGTTCAAGAAATGATGGATTACATCAATGATTGTGGAACCTCA ACCAGCTACACTTCTTACATTGGGAATCTTGTGGCCACCAGAGTTTCCTCACAGTGG GGATTCACTGGTCCTTCTTTCACGGTCACTGAAGGTGCAAACTCAGTCTATCGTTGC CTTGAGCTGGGAAAGTTCCTTTTGGACACCCACCAAGTGGATGCAGTTGTGGTTGCT GGAGTTGATCTCTGTGCAACTGCTGAGAACCTTTACCTCAAGGCAAGAAGGTCTGCC I ATAAGCAGACAAGACCATCCACGTGCCAACTTTGAGGCTTCTGCTGATGGATACTTT GCTGGAGAGGGCAGTGGTGCTCTGGTCTTGAAGAGGCAAGCTGATGTTGGCTCAGAT GACAAGGTCTATGCCAGTGTTGCTGGCCTCACATGTGCAGCGCAGCCTGCTGAAGCA GTTTCTCCTCTTCTCCTTCAAGTTCACAATGATGACAATGAGAAAAGGGTTGTGGAG ATGGTGGAACTCGCAGCTGACTCTGGTCGTCATGCTCCCCACTTGGCCAACTCTCCT TTGAGTGCTGAATCACAGCTTGAGCAAGTGTCTAAACTCTTGGCTCATCAAGTCCCT GGTTCAGTCGCGATTGGAAGTGTTCGTGCCAATGTTGGAGATGTTGGATATGCGAGT GGTGCAGCTTCTCTCATAAAGACTGCGCTTTGCCTCCACAACCGTTACTTGCCTGCA AACCCACAGTGGGAAAGACCTGTGGCTCCAGTCTCAGAGGCTCTTTTCACCTGTCCA
AGGTCCCGTGCTTGGCTCAAGAACCCTGGTGAGTCCAGACTTGCTGCAGTGGCCAGT GCTTCTGAGAGTGGGTCTTGCTTTGGAGTGCTTCTCACAGATGAGTATGCCACACAT GAGTCCAGCAACAGATTGTCATTGGATGACGCTGCACCCAAACTCATAGCGATTCGT GGAGACACTGTTGATGACATCATGGCAAAAGTCAATGCTGAACTTGCGTTGCTCCGT 5 GCTCATGCAGAAACTGGGTCTGCCACTGACGATGACCCAGCTGCAGCTGTTGCTTTC ACTGCTCATCGTTTGAGGTTCTTGAGGCTTGTTGGTGAAACAGTTGCCAGTCACGGT GCCACAGCGACCTTGTGTTTGGCTCTGCTCACAACTCCAGAAAAGCTGGAGAAAGAA TTGGAGTTGGCAGCCAAGGGTGTTCCAAGATCAGCCAAGGCTGGCAGAAACTGGATG TCACCATCTGGTTCTGCTTTTGCACCAACACCTGTCACCAGTGATCGTGTTGCGTTC ) ATGTATGGTGAAGGGAGGTCTCCCTACTATGGTGTTGGGTTGGACCTTCACAGACTC TGGCCTGCTTTGCATGAGAGGATCAATGACAAGACAGCTGCACTTTGGGAGAATGGA GACTCCTGGCTCATGCCCAGAGCGGTTGATGCTGACTCTCAGAGGGCTGTCCAGACG GCTTTTGATGCTGACCAGATAGAGATGTTTAGGACGGGAATCTTTGTTTCCATTTGC CTCACAGACTATGCTCGTGATGTCCTTGGAGTCCAACCCAAGGCTTGCTTTGGACTC ) TCCCTTGGAGAAATCTCCATGCTCTTTGCACTTTCAAGGAGAAACTGTGGACTTTCT GACCAGCTCACTCAGAGGCTCAGAACCTCTCCGGTCTGGAGCACACAGCTTGCTGTG GAGTTCCAAGCCTTGAGGAAACTTTGGAATGTCCCTGCTGATGCTCCAGTTGAGTCC TTCTGGCAAGGCTACTTGGTTCGTGCCAGCAGAGCAGAGATTGAAAAGGCCATTGGA CCGGACAACAGATTTGTTCGTTTGCTCATTGTCAACGACTCCAGCAGTGCCCTCATT ) GCTGGCAAACCTGCTGAGTGTCTGAGGGTGCTTGAGCGTCTTGGAGGTCGTTTGCCA CCCATGCCAGTCAAGCAAGGCATGATTGGGCACTGCCCAGAAGTGGCTCCCTATACT CCTGGAATAGCTCACATCCACGAAATCTTGGAGATTCCTGACAGCCCTGTCAAGATG TATACCTCAGTCACCAATGCTGAGCTGAGAGGAGGCAGCAACTCTTCCATCACAGAG TTCGTTCAGAAGTTGTACACCAGAATAGCGGATTTCCCTGGCATTGTTGACAAGGTC i AGCCGTGATGGCCATGATGTTTTCGTGGAAGTTGGTCCGAATAACATGAGGTCAGCA GCTGTCAGTGACATTCTTGGGAAGGCTGCAACTCCTCATGTCAGTGTGGCTCTTGAT CGTCCAAGTGAGTCAGCTTGGACACAGACACTCAAATCTCTTGCCCTGCTCACTGCC CACAGAGTGCCTCTTCACAACCCGACTCTCTTTGCGGATCTTTACCACCCAACCTTC CTCACAGCCATAGATTCTGCAATGCAAGAACCACCTCCCAAGCCCAACAGATTCCTG I AGGTCTGTTGAAGTCAATGGTTACTTCTGCCCTGATGGCATAAGCAAACAAGTTGCA GCTGCAAGTGCCAAACCCAGCACACATTGCATGGTTCGTCTCCATCCAGCCAAAGCT GTTGTGGTTGCAGCTGCCGGAGCTGTGGTTGCTGATTCAACACCGGTTGTCAAAGCC AAGCAGACTTCCTCATCTTTGCTTGTTGGAGACGATGCCTTCCTCAGATGCTATGAT GTGGATTGGCCTCTCTACATGGGAGCGATGGCTGAAGGAATCTCCTCTGTTGACCTT ' GTGGTTGCAGCTGCAGAAGCTAGGATGCTTGCATCATTTGGAGCAGCGAGGCTTCCG ATGGATCAAGTTGAACTCCAGATCCGTGAGATCCAACAGAGAACCTCCAATGCCTTT GCTGTCAACCTCATGCCTGGTCCTGATGAAGCTGCAACGGTGGATGCCCTTCTGAGA ACGGGAGTCAGCATTGTGGAGGCGTCTGGTTACACGGGTGCGCTCTCTGCGGATCTG GTGAGATACCGTGTGACCGGTCTCAGAAGGACCTCCTGTGGTGCTTCTGTGTCAGCG I ACTCACCGTGTTGTGGCCAAAGTTTCAAGAACTGAGGTGGCTGAACATTTCCTGAGA CCAGCACCTGCAGCTGTTCTTGAGGCTTTGGTGGCAGCCAAACAAATCACTCCTGAG CAAGCTGCGCTTGCCAGCAGAGTCGCGATGGCTGACGATGTCGCGGTGGAGGCAGAT TCTGGAGGGCACACTGACAACCGTCCAATCCATGTGCTCCTTCCTTTGGTTGTGGCT CAGAGGAACAGATGGAGGCATCTGGTTGACACGCCAGTGCGTGTGGGAGCTGGAGGT GGGATAGCATGTCCGAGAGCAGCGTTGCTTGCCTTCTCCTTGGGTGCAGCCTTTGTG GTCACTGGAAGTGTCAACCAGCTTGCTCGTGAAGCTGGGACCTCTGATGCAGTCAGA CTCCTTTTGGCGACTGCCACCTATAGTGATGTGGCGATGGCTCCTGGTGGAGTCCAA GTGTTGAAGAAACAAACCATGTTCGCTGCGAGAGCAACGATGTTGGCTCAGCTCCAA GCCAAGTTTGGTTCCTTTGATGCTGTGCCAGAACCGCAACTGAGAAAACTGGAGAGA TCAGTGTTCAAGCAGAGTGTTGCTGATGTTTGGGCAGCTGCAAGGGAAAAGTTTGGG
GTTGATGCCACGGCTGCAAGTCCGCAAGAGAGGATGGCTCTCTGTGTCAGATGGTAC ATGTCTCAAAGCTCACGTTGGGCAACAGAGGCCACTTCAGCAAGGAAAGCGGACTAT CAGATTTGGTGTGGTCCTGCAATAGGCAGCTTCAATGACTTCGTCAGAGGCACCAAA CTTGATGCCACGGCTGGGACTGGTGAGTTCCCGAGAGTTGTGGACATCAACCAGCAC i ATCTTGCTGGGAGCCTCTCATTACAGAAGGGTTCAACAGCAA CAGCAAGACGATGACGTTGAGTACATCATTGTTTGA SEQ ID NO:7 shows the amino acid sequence of an exemplary PFA3 protein:
MTSSKKTPVWEMSKEELLDGKTWFDYNELLEFAEGDVGQVFGPEFDIIDKYRRRVR LPAREYLLVSRVTLMDAEVNNFRVGSRMVTEYDVPVNGELSEGGDVPWAVLVESGQC I DLMLISYMGIDFQCKGDRVYRLLNTSLTFFGVAHEGETLVYDIRVTGFAKGAGGEIS MFFFEYDCFVDGRLLIEMRDGCAGFFTDAELAAGKGVLKTKAELAARAQIQKQDIAP FAPAPCSHKTSLDAREMRLLVDRQWARVFGSGMAGIDYKLCARKMLMIDRVTHLDPR GGAHGLGLLIGEKVLERDHWYFPCHFVRDEVMAGSLVSDGCSQLLKVYMLWLGLHTT VGAFDFRPVSGHANKVRCRGQISPHKGKLVYVMEIKEMGFDAKTGDPFAIADVDIID VNFEEGQAFAGVEDLHSYGQGDLRKKIWDFKGIALSLQKRKEQQKESMTVTTTTTT TSRVIAPPSGCLKGDPTAPTSVTWHPMAEGNGGPGPTPSFSPSAYPPRAVCFSPFPN NPLDNDHTPGQMPLTWFNMSEFMCGKVSNCLGPEFARFDASKTSRSPAFDLALVTRV TSVADMEHGPFYNVDVNPGQGTMVGEFDCPADAWFFGASSRDDHMPYSILMEIALQT SGVLTSVLKAPLTMDKDDILFRNLDADAELVGDAMPDVRGKTIRNFTKCTGYSMLGK ' MGIHRFTFELSVDGAVFYKGSTSFGWFVPEVFESQTGLDNGKPRLPWYRENNVAVDT LSAPASASSAQGQLQLQRRGSQAQFLDTIHLAGSGAGVHGQGYAHGEKAVNKQDWFF SCHFWFDPVMPGSLGIESMFQLVEAWCVKQGLAARHGIAHPVFAHAPGATSWKYRGQ LTPKNDRMDSEVHIKSVAAFSSWVDVVADGFLFVDGLRVYSADNLRVRIQTGAGHVE EQEVAAKATTKNSSIADVDVADLQALKQALLTLERPLQLDAGSEVPACAVSDLGDRG FMETYGVVAPLYSGAMAKGIASADLVIAMGQRKMLGSFGAGGLPMHVVRAGIEKIQA ALPAGPYAVNLIHSPFDANLEKGNVDLFLEKGVRWEASAFMELTPQWRYRATGLS RDARGGSVRTAHKIIGKVSRTELAEMFIRPAPQAILDKLVASGEITPEQAALALEVP MADDIAVEADSGGHTDNRPIHVILPLILSLRNRLQRELKYPARHRVRVGAGGGIGCP QAALGAFHMGAAFWTGTVNQLSRQAGTCDNVRRQLSRATYSDITMAPAADMFEQGV ELQVLKKGTMFPSRAKKLFELFHKYDSFEAMPADELARVEKRIFSKSLAEVWAETKD FYITRLNNPEKIRKAENEDPKLKMSLCFRWYLGLSSFWANNGIADRTMDYQIWCGPA IGAFNDFIADSYLDVAVS
GEFPDWQINLQILSGAAYLQRLLSVKLAPRIDVDTEDDLFTYRPDHAL SEQ ID NO:8 shows the nucleotide sequence of an exemplary PFA3 gene, referred to herein as PFA3 vl, isolated from Thraustochytrid Schizochytrium sp. (as represented by ATCC Accession No. PTA-9695):
ATGACATCATCGAAGAAGACTCCCGTGTGGGAGATGAGCAAGGAGGAGCTGCTGGAC
GGCAAGACGGTGGTCTTCGACTACAACGAGCTGCTCGAATTCGCCGAGGGCGACGTG
GGCCAAGTGTTCGGACCCGAGTTCGACATCATCGACAAGTACCGGCGTCGCGTGCGG
CTGCCGGCGCGCGAGTACCTGCTCGTGTCGCGCGTGACGCTGATGGACGCCGAGGTG
AACAACTTCCGCGTCGGGTCGCGCATGGTGACCGAGTACGACGTGCCCGTGAACGGG
GAGCTGTCGGAGGGCGGGGACGTGCCGTGGGCGGTGCTGGTGGAGTCGGGGCAGTGC
GACCTGATGCTCATCTCGTACATGGGCATCGACTTCCAGTGCAAGGGCGACCGCGTG
TACCGCCTGCTCAACACATCGCTCACCTTCTTCGGGGTGGCGCACGAGGGCGAGACG
CTGGTGTACGACATCCGCGTCACGGGGTTCGCCAAGGGCGCGGGCGGGGAGATCTCG
ATGTTCTTCTTCGAGTACGACTGCTTCGTGGACGGCCGCCTGCTGATCGAGATGCGC
GACGGGTGCGCCGGGTTCTTCACGGACGCCGAGCTGGCCGCCGGCAAGGGCGTGCTT AAGACCAAGGCGGAGCTGGCGGCGCGCGCGCAGATCCAGAAGCAGGACATCGCGCCC TTTGCGCCGGCGCCGTGCTCGCACAAGACCTCGCTGGACGCGCGCGAGATGCGGCTG CTCGTGGACCGCCAGTGGGCGCGCGTCTTCGGCAGCGGCATGGCGGGCATCGACTAC 5 AAGTTGTGCGCTCGCAAGATGCTCATGATCGACCGCGTCACGCACCTCGACCCGCGC GGCGGCGCGCACGGCCTCGGGCTGCTGATCGGGGAGAAGGTGCTGGAGCGCGACCAC TGGTACTTCCCCTGCCACTTTGTGCGCGACGAGGTGATGGCCGGGTCGCTGGTCAGC GACGGCTGCTCGCAGCTCCTCAAGGTGTACATGCTGTGGCTCGGCCTGCACACGACC GTGGGCGCGTTCGACTTTCGTCCCGTGAGCGGGCACGCCAACAAGGTGCGGTGCCGC ) GGGCAGATCTCACCGCACAAGGGCAAGCTCGTGTACGTGATGGAGATCAAGGAAATG GGCTTTGACGCGAAGACGGGCGATCCGTTTGCGATCGCGGACGTGGACATCATCGAC GTCAACTTCGAGGAGGGACAGGCGTTTGCGGGAGTGGAAGACCTGCACAGCTACGGC CAGGGCGACCTCCGCAAGAAGATCGTCGTCGACTTCAAGGGCATCGCGCTCTCCCTG CAGAAGCGGAAGGAGCAGCAGAAGGAAAGCATGACCGTGACTACGACGACGACGACG Ϊ ACGAGCCGGGTGATTGCGCCGCCCAGCGGGTGCCTCAAGGGCGACCCGACGGCGCCG ACGAGCGTGACGTGGCACCCGATGGCGGAGGGCAACGGCGGGCCCGGACCGACGCCG TCGTTCTCGCCGTCCGCGTACCCGCCGCGGGCGGTGTGCTTCTCGCCGTTCCCCAAC AACCCGCTTGACAACGACCACACGCCGGGCCAGATGCCGTTGACCTGGTTCAACATG TCCGAATTCATGTGCGGCAAAGTGTCCAACTGCCTGGGCCCCGAGTTTGCGCGCTTC ) GACGCGAGCAAGACGAGCCGCAGCCCGGCCTTTGACCTGGCGCTCGTGACGCGGGTG ACGAGCGTGGCGGACATGGAGCACGGGCCGTTCTACAACGTGGACGTCAACCCGGGC CAGGGCACGATGGTGGGCGAGTTCGACTGTCCCGCGGACGCGTGGTTCTTCGGCGCC TCGAGCCGCGACGACCACATGCCGTACTCGATCCTGATGGAGATCGCGCTGCAGACG TCGGGCGTCCTCACCTCGGTGCTCAAGGCGCCGCTGACGATGGACAAGGACGACATC i CTCTTCCGCAACCTCGACGCAGACGCCGAGCTCGTGGGCGACGCCATGCCGGACGTG CGCGGCAAGACGATCCGCAACTTCACCAAGTGCACAGGCTACAGCATGCTCGGCAAG ATGGGCATCCACCGCTTCACCTTTGAGCTCAGCGTCGACGGCGCCGTCTTCTACAAG GGCAGCACCTCGTTTGGCTGGTTCGTCCCCGAGGTCTTCGAGTCGCAGACCGGTCTC GACAACGGCAAGCCGCGCCTGCCTTGGTACCGCGAGAACAACGTCGCCGTCGACACG I CTCTCCGCGCCCGCCTCCGCTTCCTCCGCGCAAGGTCAGCTGCAGCTGCAGCGACGC GGGTCGCAGGCGCAGTTCCTGGACACAATCCACCTGGCGGGCAGCGGCGCCGGCGTG CACGGCCAGGGCTACGCGCACGGGGAGAAGGCCGTGAACAAGCAAGATTGGTTCTTC TCGTGCCACTTCTGGTTCGACCCCGTGATGCCCGGGTCCCTGGGCATCGAGTCGATG TTCCAGCTCGTCGAGGCGTGGTGCGTGAAGCAGGGACTCGCGGCGCGGCACGGCATC i GCTCACCCAGTGTTCGCGCACGCGCCCGGGGCCACGAGCTGGAAGTACCGCGGGCAG CTAACCCCCAAGAACGACCGCATGGACAGCGAGGTGCACATCAAGTCGGTGGCGGCC TTCTCCTCCTGGGTCGACGTCGTCGCGGACGGGTTCCTCTTCGTCGACGGCCTCCGC GTCTACTCGGCAGACAACCTCCGCGTCCGCATCCAGACCGGCGCCGGCCACGTTGAA GAGCAAGAGGTTGCTGCCAAGGCCACAACCAAGAACAGCAGTATTGCTGATGTGGAC I GTGGCGGACCTGCAAGCGCTCAAGCAGGCGTTGCTGACGCTGGAGCGACCGCTGCAG CTGGACGCGGGGAGCGAGGTGCCCGCCTGCGCGGTGAGCGACCTGGGCGATAGGGGC TTCATGGAGACGTACGGGGTGGTGGCGCCGCTGTACAGCGGGGCGATGGCCAAGGGC ATCGCGTCGGCGGACCTGGTGATCGCGATGGGCCAGCGCAAGATGCTGGGGTCGTTT GGCGCGGGCGGGCTCCCGATGCACGTCGTGCGCGCGGGGATTGAGAAGATCCAGGCA : GCGCTGCCAGCGGGGCCATACGCGGTCAACCTGATTCACTCGCCTTTTGACGCCAAC
CTGGAGAAGGGCAACGTGGACCTCTTCCTGGAGAAGGGCGTGCGCGTCGTGGAGGCG TCGGCCTTCATGGAGCTCACGCCCCAGGTGGTGCGCTACCGCGCGACGGGCCTCTCT CGCGACGCGCGCGGCGGCTCCGTGCGCACGGCCCACAAGATCATCGGCAAGGTCAGC CGCACCGAGCTGGCCGAGATGTTTATCCGGCCCGCGCCGCAAGCCATTCTCGACAAG i CTTGTGGCGTCCGGCGAGATCACCCCCGAGCAGGCGGCGCTGGCGCTCGAGGTGCCC
ATGGCGGACGACATCGCCGTCGAGGCCGATTCGGGCGGGCACACCGACAACCGCCCC ATCCACGTCATCCTGCCCCTCATCCTCAGCCTGCGCAACCGCCTCCAGCGCGAGCTC AAGTACCCTGCGCGACACCGCGTGCGCGTCGGCGCCGGGGGCGGCATCGGGTGCCCG CAAGCGGCTCTGGGCGCCTTCCACATGGGCGCCGCGTTTGTGGTGACGGGCACGGTC i AACCAGCTGAGCCGGCAGGCCGGGACATGCGACAATGTGCGGCGGCAGCTGTCGCGC GCGACGTACTCGGACATCACGATGGCGCCGGCGGCGGACATGTTCGAGCAGGGCGTC GAGCTGCAGGTGCTCAAGAAGGGCACGATGTTTCCCTCGCGCGCCAAGAAGCTGTTC GAGCTGTTTCACAAGTACGACTCGTTCGAGGCGATGCCGGCGGACGAGCTGGCGCGC GTCGAGAAGCGCATCTTCAGCAAGTCACTCGCCGAGGTGTGGGCCGAGACCAAGGAC I TTCTACATCACGCGGCTCAACAACCCGGAGAAGATCCGCAAGGCGGAGAACGAGGAC CCCAAGCTCAAGATGTCACTCTGCTTCCGCTGGTACCTCGGGCTCAGCTCGTTCTGG GCCAACAACGGCATCGCGGACCGCACGATGGACTACCAGATCTGGTGCGGCCCTGCC ATCGGCGCCTTCAACGACTTCATCGCCGACTCGTACCTCGACGTGGCCGTCTCGGGC GAGTTCCCCGACGTCGTGCAGATCAACCTGCAGATCCTGTCGGGCGCAGCCTACCTC ! CAGCGCCTCCTCTCCGTCAAGCTCGCACCGCGGATCGACGTCGACACCGAGGACGAC CTCTTCACCTACCGC CCCGACCACGCACTCTAA SEQ ID NO:9 shows the nucleotide sequence of an exemplary plant-optimized PFA3 gene, referred to herein as PFA3 v2:
I ATGACATCTTCAAAGAAAACTCCTGTTTGGGAAATGAGCAAGGAAGAGCTGTTGGAT GGCAAGACGGTTGTCTTTGACTACAACGAGCTGTTGGAGTTTGCGGAGGGTGATGTT GGTCAAGTGTTTGGACCAGAGTTTGACATCATTGACAAGTACAGAAGGCGTGTGAGG CTTCCAGCCAGAGAATACTTGCTTGTTTCAAGAGTGACTCTCATGGATGCCGAGGTG AATAACTTCAGAGTTGGCTCCAGAATGGTCACTGAGTATGATGTTCCAGTCAATGGT GAGTTGTCAGAGGGAGGTGATGTTCCCTGGGCAGTTCTTGTTGAAAGTGGGCAGTGT GACTTGATGCTCATCTCCTACATGGGGATTGACTTCCAGTGCAAAGGGGACCGTGTT TACAGATTGCTCAACACATCTCTCACCTTCTTTGGGGTTGCCCATGAAGGAGAAACC CTTGTGTATGACATCAGAGTCACTGGTTTCGCCAAGGGTGCTGGTGGGGAAATCTCA ATGTTCTTTTTCGAGTATGACTGCTTTGTTGATGGCAGATTGCTCATAGAGATGAGA GATGGTTGTGCTGGCTTCTTTACTGATGCCGAACTTGCCGCTGGAAAAGGTGTGCTC AAAACGAAGGCTGAGCTTGCTGCACGTGCTCAGATCCAGAAACAAGACATTGCACCC TTTGCACCTGCACCGTGCAGTCACAAAACCAGCTTGGATGCCAGAGAAATGAGACTG CTTGTTGATAGGCAATGGGCAAGGGTCTTTGGTTCTGGAATGGCTGGCATAGACTAC AAGTTGTGTGCGAGAAAGATGCTGATGATTGACAGAGTCACACACCTTGATCCGCGT GGAGGTGCTCACGGTCTTGGGCTTCTCATTGGGGAGAAAGTGCTTGAGAGGGACCAC TGGTACTTCCCCTGCCACTTTGTGAGGGATGAGGTCATGGCTGGTTCTCTTGTCTCA GATGGATGCTCTCAGCTTCTCAAGGTTTACATGTTGTGGCTTGGCCTTCACACCACT GTTGGTGCGTTCGACTTTCGTCCAGTCAGTGGTCATGCCAACAAAGTGAGGTGTCGT GGACAGATTTCACCGCACAAGGGGAAACTTGTTTATGTCATGGAGATCAAAGAAATG GGCTTTGATGCCAAAACTGGAGATCCATTTGCCATAGCTGATGTTGACATCATTGAT GTCAACTTTGAAGAGGGACAAGCGTTTGCTGGAGTTGAGGATCTTCACAGCTATGGC CAAGGAGATTTGAGGAAAAAGATAGTTGTGGATTTCAAGGGAATTGCGTTGTCACTG CAGAAAAGGAAGGAGCAACAGAAAGAGAGCATGACTGTCACCACTACGACCACGACA ACCAGCAGAGTGATTGCTCCTCCAAGTGGATGCCTCAAAGGTGATCCCACTGCTCCC ACCTCTGTCACGTGGCATCCAATGGCTGAGGGAAATGGAGGTCCTGGACCCACTCCG TCCTTCTCTCCTTCAGCGTATCCTCCCAGAGCTGTTTGCTTCTCTCCTTTCCCCAAC AATCCGCTTGACAATGATCATACACCTGGCCAAATGCCGTTGACCTGGTTCAACATG TCTGAGTTCATGTGTGGAAAAGTGAGCAACTGCTTGGGTCCTGAGTTTGCCAGATTT
GATGCTTCCAAGACATCCAGATCACCAGCTTTTGACCTGGCTCTTGTGACAAGGGTG ACGAGTGTGGCTGACATGGAACATGGTCCTTTCTACAATGTGGATGTCAACCCTGGC CAAGGCACGATGGTGGGTGAGTTTGATTGTCCTGCAGATGCTTGGTTCTTTGGAGCC TCAAGCAGAGACGATCACATGCCGTACAGCATCTTGATGGAGATTGCTCTTCAGACT 5 TC T GGAG T C C T CACAT CTG T GC TCAAAGC T C C GC T CACAAT G GACAAAGAT GACAT C CTTTTCAGAAACCTTGATGCAGATGCAGAACTTGTGGGTGATGCCATGCCTGATGTC AGAGGGAAAACCATAAGGAACTTCACCAAATGCACGGGATACTCCATGCTTGGCAAG ATGGGAATCCATCGTTTCACCTTCGAACTCTCTGTTGACGGAGCAGTTTTCTACAAA GGGAGCACCTCTTTTGGTTGGTTTGTTCCTGAGGTCTTTGAGAGCCAGACTGGATTG ) GACAATGGCAAGCCGAGGTTGCCTTGGTATAGGGAAAACAATGTGGCAGTGGACACA CTCTCAGCACCTGCGTCAGCTTCTAGTGCCCAAGGTCAGCTTCAGCTTCAGAGGAGA GGGTCACAAGCGCAGTTCCTGGACACAATTCATCTTGCTGGGAGTGGAGCTGGAGTG CATGGCCAAGGTTATGCTCATGGGGAGAAAGCTGTGAACAAGCAAGATTGGTTCTTT TCTTGCCATTTCTGGTTTGACCCAGTGATGCCTGGGTCTTTGGGAATTGAGTCCATG 5 TTCCAGCTTGTGGAAGCGTGGTGTGTCAAACAAGGCTTGGCTGCAAGGCATGGAATT GCTCATCCAGTCTTTGCACATGCACCTGGTGCCACCAGCTGGAAGTACAGAGGTCAG TTGACCCCAAAGAATGACAGAATGGACAGTGAAGTTCACATCAAGAGTGTTGCTGCC TTCTCCTCATGGGTTGATGTGGTTGCTGATGGGTTCCTCTTCGTTGATGGCCTCAGA GTCTATTCAGCAGACAACCTGAGGGTCAGAATCCAGACTGGAGCTGGCCATGTTGAA ) GAGCAAGAAGTTGCTGCCAAAGCCACCACAAAGAACTCCAGCATTGCTGATGTGGAT GTGGCTGATCTTCAAGCTCTCAAACAAGCGTTGCTGACACTGGAGAGACCATTGCAG TTGGATGCTGGAAGTGAGGTGCCAGCCTGTGCTGTCAGCGATTTGGGAGACCGTGGA TTCATGGAGACTTATGGGGTGGTTGCTCCGTTGTACAGTGGTGCGATGGCCAAGGGA ATAGCCTCTGCGGATCTGGTCATAGCAATGGGTCAGAGGAAGATGTTGGGGAGCTTT ) GGAGCTGGTGGGTTGCCAATGCACGTTGTCCGTGCTGGGATTGAAAAGATCCAAGCT GCACTTCCCGCTGGTCCGTATGCTGTCAACCTCATCCACTCACCGTTCGATGCCAAC CTGGAAAAGGGCAATGTTGATCTTTTCCTGGAAAAGGGAGTTCGTGTGGTTGAGGCG TCTGCCTTCATGGAACTCACACCACAAGTGGTCCGTTACAGAGCCACGGGACTCTCC AGAGATGCGAGAGGTGGCTCAGTGAGGACAGCACACAAGATCATAGGAAAGGTTTCC ) AGAACAGAGCTTGCGGAGATGTTCATCAGACCTGCACCTCAAGCAATTCTGGACAAA CTTGTTGCGTCTGGTGAAATCACCCCTGAGCAAGCTGCGTTGGCTCTTGAAGTTCCA ATGGCTGATGACATTGCAGTTGAGGCTGACAGTGGAGGGCACACTGACAACCGTCCC ATTCATGTCATTCTGCCGTTGATCCTCAGTCTGAGGAATAGGCTCCAGAGGGAACTC AAGTACCCTGCCAGACACCGTGTTAGGGTTGGTGCTGGTGGAGGCATAGGTTGTCCT ) CAAGCTGCACTTGGAGCCTTCCACATGGGAGCTGCGTTTGTTGTGACTGGCACTGTC AACCAGCTGTCCCGTCAAGCTGGAACATGTGACAACGTGAGGCGTCAGCTCTCTCGT GCCACTTACTCTGACATCACGATGGCACCAGCTGCAGACATGTTTGAACAAGGAGTT GAACTGCAAGTTCTCAAGAAAGGAACGATGTTCCCATCTCGTGCCAAGAAACTCTTT GAACTGTTCCACAAGTATGATTCCTTTGAAGCAATGCCTGCGGATGAATTGGCTCGT ) GTTGAGAAGAGGATCTTCTCCAAGTCCCTTGCAGAAGTTTGGGCAGAGACCAAAGAT TTCTACATCACTCGTCTCAACAATCCTGAAAAGATCAGAAAGGCTGAGAATGAGGAC CCCAAGCTCAAGATGTCCCTCTGCTTCCGTTGGTACTTGGGTCTCAGCTCATTCTGG GCAAACAATGGCATAGCTGACCGTACGATGGATTACCAGATTTGGTGTGGACCTGCC ATAGGAGCCTTCAACGATTTCATTGCAGACAGCTATCTTGATGTTGCAGTCTCTGGT I GAGTTCCCTGATGTTGTGCAGATCAACCTTCAAATCCTGTCTGGTGCTGCGTATCTC CAGAGATTGCTCAGTGTCAAACTTGCACCAAGGATAGATGTGGACACTGAAGATGAC CTCTTCACCTACAGA CCAGATCATGCACTCTGA SEQ ID NO: 10 shows the nucleotide sequence of an exemplary Hetl gene:
ATGCTTCAGCACACTTGGCTTCCGAAGCCTCCCAATCTGACCCTCTTGTCAGATGAG GTTCATCTCTGGAGGATTCCTCTTGACCAGCCTGAGTCACAACTTCAAGACCTTGCT GCCACCCTGAGCAGTGATGAATTGGCGAGGGCAAACAGATTCTACTTTCCAGAACAC j AGAAGGCGTTTCACTGCTGGGAGAGGCATCCTCAGATCCATCTTGGGTGGATACTTG GGAGTGGAACCGGGTCAAGTCAAGTTTGATTATGAGTCCCGTGGGAAACCGATCCTT GGTGACAGATTTGCTGAGAGTGGACTCCTGTTCAACTTGTCTCACAGCCAGAACCTT GCCTTGTGTGCTGTCAACTACACGCGTCAAATAGGCATTGATCTTGAATATCTGCGT CCAACATCTGACTTGGAGTCTCTTGCAAAGAGGTTCTTTCTCCCAAGAGAATATGAA ) CTCTTGAGGTCACTCCCTGATGAGCAGAAACAGAAGATTTTCTTTCGTTACTGGACT TGCAAAGAGGCTTATCTCAAAGCAACGGGAGATGGAATAGCCAAACTTGAAGAGATC GAGATAGCACTCACCCCAACAGAACCTGCCAAGCTCCAAACAGCTCCTGCGTGGTCT CTGTTGGAGCTTGTGCCAGATGACAATTGTGTT GCAGCTGTGGCTGTTGCGGGTTTTGGTTGGCAGCCCAAGTTCTGGCATTACTGA > SEQ ID NO: 11 shows the nucleotide sequence of an exemplary SzACS2 gene, isolated from Schizochytrium ATCC Accession No. 20888:
ATGGCTCCCACTCCCGACGCCACCGCGCCTCTGAACAAGCCGAGCGACTATGCCGTC TACCACGAGGAAGACGGCCCCTTCTGGACCGCCGATTCCAGCGGCGTCTCGCGCGTG AACTTTAGCGAGACCGGCGTGGGATCCGAGGGCGTCATCCCTGCGCTCACGCTCATC ) GACGTCTTCGAGAGGGCCGTCAAGCGCGGCGGAAACAGGATCGCCTTCCGCACGGAA AACATGCCCACGCTCCGCCGCGGCGAAGAGGCCCCGGACGCGCTGCCGCTCAAGGAC TGGAAGTCCTGGTCCTGGAAACAGTACAAGGCCGACGTCCACCGCATCGCCAGGGCT CTCATGGACCTCGGCGTTGAGCAGCATGACGCCGTCTCCATTTTTGGCTTTAACTCG CCCGAATGGTTTCTCAGTGCCGTCGGCGCCGTGCACGCAGGTGCCAAGATTGCCGGC i ATTTACCCCTCAGACACGCCCGCCCAGGTCCAGTACAAGGCCTTCCACAGTGACACC GCTGTTGCCGTTGTCGAAAACGAGCAGTGCTTCAAGAAGTTCGCCGAGGTCGTCGAG GACCTTCCTTACCTCAAGGCCATTGTTTGCTGGGACTATGAAGCCACAGACATCACG CGCGAGGACGGCTCCGTCGTCGAGGTCCTCACCTTTGCCGAGTTCCTCAAGCGCGGC GACACCGTCGAGGCGGCCGCCCTTGACGAGCGCATCTCCAAGATCGAGCCCACCATG I TGCGCTGCCCTTATTTACACCAGCGGTACTACCGGCCGCCCCAAGGCCGTTATGATT TCGCACGACAACCTTGTTTTCGAGGCCAGCGCCGTCGTCCCCAACCTCGGAGGAGCC TGTACGACCACTGCTGAGGAGCGCATTCTCTCGTACCTGCCTCTCTCGCACGTCGCT GGTATGATGGTTGATATTATTGCCCCCATCATTGCCACCGCCTTCCACAAGGGCCGC ATCTGCGTCTGCTTTGCTCGCCCGTACGATTTGCGCACCGGCACGCTGGGCCAGCGC 1 CTCAACGCCGTGGAGCCCACCATCTTCCTTGGCGTGCCCCGTGTGTGGGAAAAGATT CAGGAAAAGCTCATGGCCGTCGGTGCCAAAACCACCGGCCTCAAGAAGAAGCTCTCT ACGGCCGCCAAGAAGCGTGGTCTTGAATTCCAGGAGGAGCAGCAAATCGGCCGCTCC GGTGCCAACCCTGGCTTTGGCCCCCTTGGCATCTACAAGAAGCTCCTCGGCCTCATC AAGGGCAAGCTGGGCCTCACCAAGTGCAAGTTTGCCTTTGCTGGTGCCGCGCCCATG I ACCCGTGAGACCCTTCAGTACTTTGGCGCGCTGAACATCAACATTAACGAGGTCTAC GGCATGTCCGAGTGCTCCGGTGCCGCCACCTGGTCCACGGACAAGGCCCACGAGTGG GGCACTGTTGGCTACGAGATGCCCAGTTGCGAGGTCCGCGTCTTCAAGATTGCCGAG GACGGTACCAAGACCGAGTGCCCGCGCGCCGCCGACATTATGCATGCTACCGAGGAG GAGCAGGGCGAAGTTTGCTTCCGCGGCCGTAACATCATGATGGGCTACCTTGCCAAC ! CCCAAGCTTGGCGACGACCACGTTGCCGAGATCGAGGAGAAGAACGCTGCCGCTATC GACTCCGAGGGCTGGCTCCACAGTGGTGATAAGGGCGCCATTTCTACCCGCGGCATG CTCAAGATCACGGGCCGCTACAAGGAGCTCATCATCGGCGCCGGTGGCGAGAACGTG
GCGCCCGTCCCTATTGAGGACGCCATCAAGGCGCGCATGCCTTTTGTTTCCAACGCC ATGATGGTCGGAGATAAGCGCAAGTTCATGGCTGTCCTCCTTACCCTCAAGACGGTT GGCGCCACGGGCGAGCTTCCCGGTACGAACAAGCTCATGGGCGCTGCCGCCGACTAT GGTGAGACCATCGAGGACGCCTGCGACAACGAGGCGCTCATTGAGGAGATCACGCAG 5 CAGCTCAAGGAGATCGGTGATGATGGCGATGTCACGCCCTCGAACGCGGCTCGCATC CAAAAGTTCACCATTCTCCCGCTCGACTTTTCCGTCTCCACGGACGAGCTCACGGCC ACGCTCAAGCTCAAGCGCTCCGTGGTCGCAGACAAGTACGAAGACATCATCGAGGCC TTTTACGAGTCCAAGAGCGTTTTTGTGCCGTACTCGACCGTTGGCGCCTACGCCACG GGCGGCCCGGTCGACGACTCCGTTGTCGATGGCTCCTTCAAGGGCGACTTTAGCATG ) ATTGGCGACGATGATCCGGATCTTCAAAAC
GTCGATGTCCTCGAGGCGATTGACGAGGACAATTAA SEQ ID NO: 12 shows the nucleotide sequence of a random “spacer” polynucleotide that is utilized in some embodiments. SEQ ID NO: 13 shows the nucleotide sequence of an exemplary 5’ truncated i PFA3 gene, referred to herein as PFA3 v3:
ATGAGCAAGGAGGAGCTGCTGGACGGCAAGACGGTGGTCTTCGACTACAACGAGCTG CTCGAATTCGCCGAGGGCGACGTGGGCCAAGTGTTCGGACCCGAGTTCGACATCATC GACAAGTACCGGCGTCGCGTGCGGCTGCCGGCGCGCGAGTACCTGCTCGTGTCGCGC GTGACGCTGATGGACGCCGAGGTGAACAACTTCCGCGTCGGGTCGCGCATGGTGACC ) GAGTACGACGTGCCCGTGAACGGGGAGCTGTCGGAGGGCGGGGACGTGCCGTGGGCG GTGCTGGTGGAGTCGGGGCAGTGCGACCTGATGCTCATCTCGTACATGGGCATCGAC TTCCAGTGCAAGGGCGACCGCGTGTACCGCCTGCTCAACACATCGCTCACCTTCTTC GGGGTGGCGCACGAGGGCGAGACGCTGGTGTACGACATCCGCGTCACGGGGTTCGCC AAGGGCGCGGGCGGGGAGATCTCGATGTTCTTCTTCGAGTACGACTGCTTCGTGGAC i GGCCGCCTGCTGATCGAGATGCGCGACGGGTGCGCCGGGTTCTTCACGGACGCCGAG CTGGCCGCCGGCAAGGGCGTGCTTAAGACCAAGGCGGAGCTGGCGGCGCGCGCGCAG ATCCAGAAGCAGGACATCGCGCCCTTTGCGCCGGCGCCGTGCTCGCACAAGACCTCG CTGGACGCGCGCGAGATGCGGCTGCTCGTGGACCGCCAGTGGGCGCGCGTCTTCGGC AGCGGCATGGCGGGCATCGACTACAAGTTGTGCGCTCGCAAGATGCTCATGATCGAC I CGCGTCACGCACCTCGACCCGCGCGGCGGCGCGCACGGCCTCGGGCTGCTGATCGGG GAGAAGGTGCTGGAGCGCGACCACTGGTACTTCCCCTGCCACTTTGTGCGCGACGAG GTGATGGCCGGGTCGCTGGTCAGCGACGGCTGCTCGCAGCTCCTCAAGGTGTACATG CTGTGGCTCGGCCTGCACACGACCGTGGGCGCGTTCGACTTTCGTCCCGTGAGCGGG CACGCCAACAAGGTGCGGTGCCGCGGGCAGATCTCACCGCACAAGGGCAAGCTCGTG : TACGTGATGGAGATCAAGGAAATGGGCTTTGACGCGAAGACGGGCGATCCGTTTGCG
ATCGCGGACGTGGACATCATCGACGTCAACTTCGAGGAGGGACAGGCGTTTGCGGGA GTGGAAGACCTGCACAGCTACGGCCAGGGCGACCTCCGCAAGAAGATCGTCGTCGAC TTCAAGGGCATCGCGCTCTCCCTGCAGAAGCGGAAGGAGCAGCAGAAGGAAAGCATG ACCGTGACTACGACGACGACGACGACGAGCCGGGTGATTGCGCCGCCCAGCGGGTGC I CTCAAGGGCGACCCGACGGCGCCGACGAGCGTGACGTGGCACCCGATGGCGGAGGGC AACGGCGGGCCCGGACCGACGCCGTCGTTCTCGCCGTCCGCGTACCCGCCGCGGGCG GTGTGCTTCTCGCCGTTCCCCAACAACCCGCTTGACAACGACCACACGCCGGGCCAG ATGCCGTTGACCTGGTTCAACATGTCCGAATTCATGTGCGGCAAAGTGTCCAACTGC CTGGGCCCCGAGTTTGCGCGCTTCGACGCGAGCAAGACGAGCCGCAGCCCGGCCTTT GACCTGGCGCTCGTGACGCGGGTGACGAGCGTGGCGGACATGGAGCACGGGCCGTTC TACAACGTGGACGTCAACCCGGGCCAGGGCACGATGGTGGGCGAGTTCGACTGTCCC GCGGACGCGTGGTTCTTCGGCGCCTCGAGCCGCGACGACCACATGCCGTACTCGATC
CTGATGGAGATCGCGCTGCAGACGTCGGGCGTCCTCACCTCGGTGCTCAAGGCGCCG CTGACGATGGACAAGGACGACATCCTCTTCCGCAACCTCGACGCAGACGCCGAGCTC GTGGGCGACGCCATGCCGGACGTGCGCGGCAAGACGATCCGCAACTTCACCAAGTGC ACAGGCTACAGCATGCTCGGCAAGATGGGCATCCACCGCTTCACCTTTGAGCTCAGC i GTCGACGGCGCCGTCTTCTACAAGGGCAGCACCTCGTTTGGCTGGTTCGTCCCCGAG GTCTTCGAGTCGCAGACCGGTCTCGACAACGGCAAGCCGCGCCTGCCTTGGTACCGC GAGAACAACGTCGCCGTCGACACGCTCTCCGCGCCCGCCTCCGCTTCCTCCGCGCAA GGTCAGCTGCAGCTGCAGCGACGCGGGTCGCAGGCGCAGTTCCTGGACACAATCCAC CTGGCGGGCAGCGGCGCCGGCGTGCACGGCCAGGGCTACGCGCACGGGGAGAAGGCC I GTGAACAAGCAAGATTGGTTCTTCTCGTGCCACTTCTGGTTCGACCCCGTGATGCCC GGGTCCCTGGGCATCGAGTCGATGTTCCAGCTCGTCGAGGCGTGGTGCGTGAAGCAG GGACTCGCGGCGCGGCACGGCATCGCTCACCCAGTGTTCGCGCACGCGCCCGGGGCC ACGAGCTGGAAGTACCGCGGGCAGCTAACCCCCAAGAACGACCGCATGGACAGCGAG GTGCACATCAAGTCGGTGGCGGCCTTCTCCTCCTGGGTCGACGTCGTCGCGGACGGG i TTCCTCTTCGTCGACGGCCTCCGCGTCTACTCGGCAGACAACCTCCGCGTCCGCATC CAGACCGGCGCCGGCCACGTTGAAGAGCAAGAGGTTGCTGCCAAGGCCACAACCAAG AACAGCAGTATTGCTGATGTGGACGTGGCGGACCTGCAAGCGCTCAAGCAGGCGTTG CTGACGCTGGAGCGACCGCTGCAGCTGGACGCGGGGAGCGAGGTGCCCGCCTGCGCG GTGAGCGACCTGGGCGATAGGGGCTTCATGGAGACGTACGGGGTGGTGGCGCCGCTG I TACAGCGGGGCGATGGCCAAGGGCATCGCGTCGGCGGACCTGGTGATCGCGATGGGC cagcgcaagatgctggggtcgtttggcgcgggcgggctcccgatgcacgtcgtgcgc
GCGGGGATTGAGAAGATCCAGGCAGCGCTGCCAGCGGGGCCATACGCGGTCAACCTG ATTCACTCGCCTTTTGACGCCAACCTGGAGAAGGGCAACGTGGACCTCTTCCTGGAG AAGGGCGTGCGCGTCGTGGAGGCGTCGGCCTTCATGGAGCTCACGCCCCAGGTGGTG CGCTACCGCGCGACGGGCCTCTCTCGCGACGCGCGCGGCGGCTCCGTGCGCACGGCC CACAAGATCATCGGCAAGGTCAGCCGCACCGAGCTGGCCGAGATGTTTATCCGGCCC GCGCCGCAAGCCATTCTCGACAAGCTTGTGGCGTCCGGCGAGATCACCCCCGAGCAG GCGGCGCTGGCGCTCGAGGTGCCCATGGCGGACGACATCGCCGTCGAGGCCGATTCG GGCGGGCACACCGACAACCGCCCCATCCACGTCATCCTGCCCCTCATCCTCAGCCTG ' CGCAACCGCCTCCAGCGCGAGCTCAAGTACCCTGCGCGACACCGCGTGCGCGTCGGC GCCGGGGGCGGCATCGGGTGCCCGCAAGCGGCTCTGGGCGCCTTCCACATGGGCGCC GCGTTTGTGGTGACGGGCACGGTCAACCAGCTGAGCCGGCAGGCCGGGACATGCGAC AATGTGCGGCGGCAGCTGTCGCGCGCGACGTACTCGGACATCACGATGGCGCCGGCG GCGGACATGTTCGAGCAGGGCGTCGAGCTGCAGGTGCTCAAGAAGGGCACGATGTTT CCCTCGCGCGCCAAGAAGCTGTTCGAGCTGTTTCACAAGTACGACTCGTTCGAGGCG ATGCCGGCGGACGAGCTGGCGCGCGTCGAGAAGCGCATCTTCAGCAAGTCACTCGCC GAGGTGTGGGCCGAGACCAAGGACTTCTACATCACGCGGCTCAACAACCCGGAGAAG ATCCGCAAGGCGGAGAACGAGGACCCCAAGCTCAAGATGTCACTCTGCTTCCGCTGG TACCTCGGGCTCAGCTCGTTCTGGGCCAACAACGGCATCGCGGACCGCACGATGGAC TACCAGATCTGGTGCGGCCCTGCCATCGGCGCCTTCAACGACTTCATCGCCGACTCG TACCTCGACGTGGCCGTCTCGGGCGAGTTCCCCGACGTCGTGCAGATCAACCTGCAG ATCCTGTCGGGCGCAGCCTACCTCCAGCGCCTCCTCTCCGTCAAGCTCGCACCGCGG ATCGAC
GTCGACACCGAGGACGACCTCTTCACCTACCGCCCCGACCACGCACTCTAA SEQ ID NO: 14 shows the amino acid sequence of an exemplary 5’ truncated PFA3 protein (PFA3v3):
MSKEELLDGKTVVFDYNELLEFAEGDVGQVFGPEFDIIDKYRRRVRLPAREYLLVSR
VTLMDAEVNNFRVGSRMVTEYDVPVNGELSEGGDVPWAVLVESGQCDLMLISYMGID
FQCKGDRVYRLLNTSLTFFGVAHEGETLVYDIRVTGFAKGAGGEISMFFFEYDCFVD GRLLIEMRDGCAGFFTDAELAAGKGVLKTKAELAARAQIQKQDIAPFAPAPCSHKTS LDAREMRLLVDRQWARVFGSGMAGIDYKLCARKMLMIDRVTHLDPRGGAHGLGLLIG EKVLERDHWYFPCHFVRDEVMAGSLVSDGCSQLLKVYMLWLGLHTTVGAFDFRPVSG i HANKVRCRGQISPHKGKLVYVMEIKEMGFDAKTGDPFAIADVDIIDVNFEEGQAFAG VEDLHSYGQGDLRKKIWDFKGIALSLQKRKEQQKESMTVTTTTTTTSRVIAPPSGC LKGDPTAPTSVTWHPMAEGNGGPGPTPSFSPSAYPPRAVCFSPFPNNPLDNDHTPGQ MPLTWFNMSEFMCGKVSNCLGPEFARFDASKTSRSPAFDLALVTRVTSVADMEHGPF YNVDVNPGQGTMVGEFDCPADAWFFGASSRDDHMPYSILMEIALQTSGVLTSVLKAP I LTMDKDDILFRNLDADAELVGDAMPDVRGKTIRNFTKCTGYSMLGKMGIHRFTFELS VDGAVFYKGSTSFGWFVPEVFESQTGLDNGKPRLPWYRENNVAVDTLSAPASASSAQ GQLQLQRRGSQAQFLDTIHLAGSGAGVHGQGYAHGEKAVNKQDWFFSCHFWFDPVMP GSLGIESMFQLVEAWCVKQGLAARHGIAHPVFAHAPGATSWKYRGQLTPKNDRMDSE VHIKSVAAFSSWVDWADGFLFVDGLRVYSADNLRVRIQTGAGHVEEQEVAAKATTK : NSSIADVDVADLQALKQALLTLERPLQLDAGSEVPACAVSDLGDRGFMETYGWAPL
YSGAMAKGIASADLVIAMGQRKMLGSFGAGGLPMHWRAGIEKIQAALPAGPYAVNL IHSPFDANLEKGNVDLFLEKGVRVVEA3AFMELTPQWRYRATGLSRDARGGSVRTA HKIIGKVSRTELAEMFIRPAPQAILDKLVASGEITPEQAALALEVPMADDIAVEADS GGHTDNRPIHVILPLILSLRNRLQRELKYPARHRVRVGAGGGIGCPQAALGAFHMGA i AFVVTGTVNQLSRQAGTC DNVRRQLSRATYS DITMAPAADMFEQGVELQVLKKGTMF
PSRAKKLFELFHKYDSFEAMPADELARVEKRIFSKSLAEVWAETKDFYITRLNNPEK IRKAENEDPKLKMSLCFRWYLGLSSFWANNGIADRTMDYQIWCGPAIGAFNDFIADS YLDVAVSGEFPDWQINL
QILSGAAYLQRLLSVKLAPRIDVDTEDDLFTYRPDHAL SEQ ID NOs: 15-38 show the nucleotide sequences of several plasmids utilized in certain examples herein. MODE(S) FOR CARRYING OUT THE INVENTION I. Overview of several embodiments 1 The classical pathway for synthesis of LC-PUEAs in eukaryotic organisms involves the elongation and desaturation of saturated or mono-unsaturated fatty acids. The pathway for synthesis of LC-PUFAs via a PUFA synthase is very different from the classical pathway. Specifically, PUFA synthases utilize malonyl-CoA as a carbon source, and produce the final PUFA(s) without releasing intermediates in any significant amount. Also, with PUFA synthases, the appropriate cis double bonds are added during the synthesis using a mechanism that does not require oxygen. For example, NADPH may be used as a reductant during the synthesis cycle.
Described herein are relatively inexpensive compositions and methods for the efficient and effective production of longer-chain or more unsaturated PUFAs (as well 1 as quantities of lipids, e g., TAG and PLs, enriched in such PUFAs) in plants, plant seed, or plant oil. Such fatty acids and methods of their production are useful in a variety of contexts, including dietary and industrial applications. A system for providing and improving PUFA production in host organisms (e.g., plants) by providing recombinant host organisms genetically modified with a PUFA synthase system (for example, containing PUFA synthase components identified from ; Thraustochytrid Schizochytrium alga and a phosphopantetheinyl transferase (Hetl) from the cyanobacteria genus, Nostoc, for example, also containing a Schizochytrium acyl-CoA synthetase isozyme 2), as described herein, provides significant benefits over conventional approaches for obtaining these fatty acids.
The marine Thraustochytrid Schizochytrium alga (as represented by ATCC i Accession No. PTA-9695) produces oil with a high ω-3/ω-6 ratio that may also be used as sources of PUFA synthase genes for crop transformations. Additionally, the Schizochytrium can produce oil that contains significant levels of EPA in addition to DHA. The ability to produce significant amounts of EPA is in contrast to some other Thraustochytrium strains (for example, Schizochytrium sp. ATCC Accession No. i 20888). U.S. Patent Publication No. US2013/0150599A1; PCT International Patent Publication No. WO2013/016546. The Schizochytrium PUFA synthase system may function when heterologously expressed in a wide variety of crop plants, as demonstrated herein by example in canola, soybean, and the model plant, Arabidopsis, to make commercially significant levels of ω-3 LC-PUFAs (e.g., DFIA and EPA). I Accordingly, this gene set may in some embodiments lead to the production of significantly more DHA and EPA than other PUFA synthase gene sets in plants.
Also described herein is the utility of various construct designs, including diversification of different seed-specific promoters and terminators, the use of spacer elements, altered transcriptional orientations, different relative positioning of genes i within the T-DNA, and the use of native and modified gene sequences. These construct designs may be used to further improve the number of LC-PUFA-producing events recovered, and the heritability of the ω-3 LC-PUFA trait in subsequent generations.
In examples herein, canola, soy, and Arabidopsis plants were transformed with I vectors harboring genes encoding three component polypeptides of PUFA synthase (/. e., PFA1, PFA2, and PFA3) from Thraustochytrid Schizochytrium alga, together with a phosphopantetheinyl transferase (Hetl) from Nostoc. In some examples, all four genes were contained within a single construct under the control of a seed-specific promoter, and were driven by a variety of seed-specific promoters in various configurations. Plant transformation experiments generated events that contained all four transgenes and expressed all four polypeptides in the seed. The co-3 LC-PUFAs DHA and EPA were produced in seed lipids from the resulting transgenic events. The co-6 LC-PUFA DPA was also detected. Canola events were recovered containing up to 2.9% DHA and 1.0% EPA (3.9% total o>-3 LC-PUFA) and 1.1% DHA + 2.0% EPA in bulk analyses of Ti seed. Up to 4.6% DHA and 3.7% EPA was detected in single-seed analyses of Ti canola seed. Soybean events were recovered containing up to 1.9% DHA and 2.2% EPA in single seed analyses of Ti seed.
I II. Abbreviations ACS acetyl-CoA synthetase DGAT diacylglycerol acyltransferase DHA docosahexaenoic acid DPA docosapentaenoic acid EPA eicosapentaenoic acid FAME fatty acid methyl ester HPLC high-performance liquid chromatography LC-PUFA long chain polyunsaturated fatty acid LPAT lysophosphatidyl acyltransferase LPCAT lysophosphatidylcholine acyltransferase P1P2P3H PFA1, PFA2, and PFA3 genes expressed with Hetl P1P2P3H-ACS PFA1, PFA2, and PFA3 genes expressed with Hetl and SzACS2 PDAT phospholipidrdiacylglycerol acyltransferase PL phospholipid PPTase phosphopantetheine transferase PTU plant transcription unit PUFA polyunsaturated fatty acid
SzACS2 Schizochytrium acyl-CoA synthetase isozyme 2 TAG triacylglycerol
Ill Terms
As used herein, except where the context requires otherwise the term "comprise" and variations of the term, such as "comprising", "comprises" and "comprised", are not intended to exclude other additives, components, integers or steps.
Baekcrossing: Backcrossing methods may be used to introduce a nucleic acid sequence into plants. The backcrossing technique has been widely used for decades to introduce new traits into plants. Jensen, N., Ed. Plant Breeding Methodology, John Wiley & Sons, Inc., 1988. In a typical backcross protocol, the original variety of interest (recurrent parent) is crossed to a second variety (non-recurrent parent) that carries a gene of interest to be transferred. The resulting progeny from this cross are then crossed again to the recurrent parent, and the process is repeated until a plant is obtained wherein essentially all of the desired morphological and physiological characteristics of Hie recurrent plant are recovered in the converted plant, in addition to the transferred gene from the non-recurrent parent.
Isolated: An “isolated” biological component (such as a nucleic acid or protein) has been substantially separated, produced apart from, or purified away from other biological components in the cell of the organism in which the component naturally occurs (/. e., other chromosomal and extra-chromosomal DNA and RNA, and proteins), while effecting a chemical or functional change in the component (e.g., a nucleic acid may be isolated from a chromosome by breaking chemical bonds connecting the nucleic acid to the remaining DNA in the chromosome). Nucleic acid molecules and proteins that have been “isolated” include nucleic acid molecules and proteins purified by standard purification methods. The term also embraces nucleic acids and proteins prepared by recombinant expression in a host cell, as well as chemically-synthesized nucleic acid molecules, proteins, and peptides.
Nucleic acid molecule: As used herein, the term “nucleic acid molecule” may refer to a polymeric form of nucleotides, which may include both sense and anti-sense strands of RNA, cDNA, genomic DNA, and synthetic forms and mixed polymers of the above. A nucleotide may refer to a ribonucleotide, deoxyribonucleotide, or a modified form of either type of nucleotide. A “nucleic acid molecule,” as used herein, is synonymous with “nucleic acid” and “polynucleotide.” A nucleic acid molecule is usually at least 10 bases in length, unless otherwise specified. The term includes single- and double-stranded forms of DNA. A nucleic acid molecule can include either or both naturally occurring and modified nucleotides linked together by naturally occurring and/or non-naturally occurring nucleotide linkages.
Nucleic acid molecules may be modified chemically or biochemically, or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, intemucleotide modifications (e.g., uncharged linkages: for example, methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.; charged linkages: for example, phosphorothioates, phosphorodithioates, etc.; pendent moieties: for example, peptides; intercalators: for example, acridine, psoralen, etc.; chelators; alkylators; and modified linkages: for example, alpha anomeric nucleic acids, etc.). The term “nucleic acid molecule” also includes any topological confonnation, including single-stranded, double-stranded, partially duplexed, triplexed, hairpinned, circular, and padlocked conformations.
Operably linked: A first nucleotide sequence is operably linked with a second nucleic acid sequence when the first nucleic acid sequence is in a functional relationship with the second nucleic acid sequence. When recombinantly produced, operably linked nucleic acid sequences are generally contiguous, and, where necessary to join two protein-coding regions, in the same reading frame {e.g., in a polycistronic ORF). However, nucleic acids need not be contiguous to be operably linked.
The term, “operably linked,” when used in reference to a regulatory sequence and a coding sequence, means that the regulatory sequence affects the expression of the linked coding sequence. “Regulatory sequences,” or “control elements,” refer to nucleotide sequences that influence the timing and level/amount of transcription, RNA processing or stability, or translation of the associated coding sequence. Regulatory sequences may include promoters; translation leader sequences; introns; enhancers; stem-loop structures; repressor binding sequences; termination sequences; and polyadenylation recognition sequences. Particular regulatory sequences may be located upstream and/or downstream of a coding sequence operably linked thereto. Also, particular regulatory sequences operably linked to a coding sequence may be located on the associated complementary strand of a double-stranded nucleic acid molecule.
Promoter: As used herein, the term “promoter” refers to a region of DNA that may be upstream from the start of transcription, and that may be involved in recognition and binding of RNA polymerase and other proteins to initiate transcription. A promoter may be operably linked to a coding sequence for expression in a cell, or a promoter may be operably linked to a nucleotide sequence encoding a signal sequence which may be operably linked to a coding sequence for expression in a cell. A “plant promoter” may be a promoter capable of initiating transcription in plant cells. Examples of promoters under developmental control include promoters that preferentially initiate transcription in certain tissues, such as leaves, roots, seeds, fibers, xylem vessels, tracheids, or sclerenchyma. Such promoters are referred to as “tissue-preferred.” Promoters which initiate transcription only in certain tissues are referred to as “tissue-specific.” A “cell type-specific” promoter primarily drives i expression in certain cell types in one or more organs, for example, vascular cells in roots or leaves. An “inducible” promoter may be a promoter which may be under environmental control. Examples of environmental conditions that may initiate transcription by inducible promoters include anaerobic conditions and the presence of light. Tissue-specific, tissue-preferred, cell type specific, and inducible promoters ! constitute the class of “non-constitutive” promoters. A “constitutive” promoter is a promoter which may be active in most cells of the organism under most environmental conditions.
Any inducible promoter can be used in some embodiments of the invention. See Ward et al. (1993) Plant Mol. Biol. 22:361-366. With an inducible promoter, the I rate of transcription increases in response to an inducing agent. Exemplary inducible promoters include, but are not limited to: Promoters from the ACEI system that responds to copper; In2 gene from maize that responds to benzenesulfonamide herbicide safeners; Tet repressor from TnlO; and the inducible promoter from a steroid hormone gene, the transcriptional activity of which may be induced by a i glucocorticosteroid hormone (Schena et al. (1991) Proc. Natl. Acad. Sci. USA 88:0421).
Exemplary constitutive promoters include, but are not limited to: Promoters from plant viruses, such as the 35S promoter from CaMV; promoters from rice actin genes; ubiquitin promoters; pEMU; MAS; maize H3 histone promoter; and the ALS ) promoter, Xbal/Ncol fragment 5' to the Brassica naprn ALS3 structural gene (or a nucleotide sequence similarity to said Xbal/Ncol fragment) (International Patent Publication No. WO 96/30530).
Additionally, any tissue-specific or tissue-preferred promoter may be utilized in some embodiments of the invention. Plants transformed with a nucleic acid molecule comprising a coding sequence operably linked to a tissue-specific promoter may produce the product of the coding sequence exclusively, or preferentially, in a specific tissue. Exemplary tissue-specific or tissue-preferred promoters include, but are not limited to: a root-preferred promoter, such as that from the phaseolin gene; a leaf-specific and light-induced promoter such as that from cab or rubisco; an anther-specific promoter such as that from LAT52; a pollen-specific promoter such as that from Zml3\ a microspore-preferred promoter such as that from apg, and a seed specific promoter (e.g., a promoter from PvDlec2, LJKCS3, FAE1, BoACP, or BnaNapinC).
Heterologous: The term “heterologous,” as applied to nucleic acids (e.g., polynucleotides, DNA, RNA, and genes) herein, means of different origin. For example, if a host cell is transformed with a nucleic acid that does not occur in the untransformed host cell in nature, then that nucleic acid is heterologous (and exogenous) to the host cell. Furthermore, different elements (e.g., promoter, enhancer, coding sequence, terminator, etc.) of a transforming nucleic acid may be heterologous to one another and/or to the transformed host. The term heterologous, as used herein, may also be applied to one or more nucleic acid(s) that are identical in sequence to a nucleic acid already present in a host cell, but that are now linked to different additional sequences and/or are present at a different copy number, etc.
Native: As used herein, the term “native” refers to the form of a polynucleotide or gene in its natural location in the organism or in the genome of an organism as found in nature, with its own regulatory sequences, if present.
Endogenous: As used herein, the term “endogenous” refers to a polynucleotide, gene, or polypeptide that is located in the organism or genome that normally comprises the molecule in nature.
Transformation: As used herein, the term “transformation” or “transduction” refers to the transfer of one or more nucleic acid molecule(s) into a cell. A cell is “transformed” by a nucleic acid molecule transduced into the cell when the nucleic acid molecule becomes stably replicated by the cell, either by incorporation of the nucleic acid molecule into the cellular genome, or by episomal replication. As used herein, the term “transformation” encompasses all techniques by which a nucleic acid molecule can be introduced into such a cell. Examples include, but are not limited to: transfection with viral vectors; transformation with plasmid vectors; electroporation (Fromm et al. (1986) Nature 319:791-3); lipofection (Feigner et al. (1987) Proc. Natl. Acad. Sci. USA 84:7413-7); microinjection (Mueller et al. (1978) Cell 15:579-85); Agrobacterium-mediated transfer (Fraley et al. (1983) Proc. Natl. Acad. Sci. USA 80:4803-7); direct DNA uptake; and microprojectile bombardment (Klein et al. (1987) Nature 327:70).
Transgene: An exogenous nucleic acid sequence that is integrated into the genome of the host. In some examples, a transgene may contain regulatory sequences operably linked to a coding sequence of the transgene (e.g., a promoter).
Vector: A nucleic acid molecule as introduced into a cell, for example, to produce a transformed cell. A vector may include nucleic acid sequences that permit it to replicate in the host cell, such as an origin of replication. Examples of vectors include, but are not limited to: a plasmid; cosmid; bacteriophage; or virus that carries exogenous DNA into a cell. A vector may also include one or more genes, antisense molecules, and/or selectable marker genes and other genetic elements known in the art. A vector may transduce, transform, or infect a cell, thereby causing the cell to express the nucleic acid molecules and/or proteins encoded by the vector. A vector optionally includes materials to aid in achieving entry of the nucleic acid molecule into the cell (e.g., a liposome, and protein coating).
Expression: As used herein, the term “expression” may refer to the transcription and stable accumulation of mRNA encoded by a polynucleotide, or to the translation of such an mRNA into a polypeptide. The term “over-expression,” as used herein, refers to expression that is higher than endogenous expression of the same or a closely related gene. A heterologous gene is over-expressed if its expression is higher than that of a closely-related endogenous gene (e.g., a homolog).
Exogenous: The term “exogenous,” as applied to nucleic acids (e.g., polynucleotides, DNA, RNA, and genes) herein, refers to one or more nucleic acid(s) that are not normally present within their specific environment or context. For example, if a host cell is transformed with a nucleic acid that does not occur in the untransformed host cell in nature, then that nucleic acid is exogenous to the host cell. The term exogenous, as used herein, also refers to one or more nucleic acid(s) that are identical in sequence to a nucleic acid already present in a host cell, but that are located in a different cellular or genomic context than the nucleic acid with the same sequence already present in the host cell. For example, a nucleic acid that is integrated in the genome of the host cell in a different location than a nucleic acid with the same sequence is normally integrated in the genome of the host cell is exogenous to the host cell. Furthermore, a nucleic acid (e.g., a DNA molecule) that is present in a plasmid or vector in the host cell is exogenous to the host cell when a nucleic acid with the same sequence is only normally present in the genome of the host cell.
Sequence identity: The term “sequence identity” or “identity,” as used herein, in the context of two nucleic acid or polypeptide sequences, may refer to the residues in the two sequences that are the same when aligned for maximum correspondence over a specified comparison window.
As used herein, the term “percentage of sequence identity” may refer to the value determined by comparing two optimally aligned sequences (e.g., nucleic acid sequences, and amino acid sequences) over a comparison window, wherein the portion of the sequence in the comparison window may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which does not comprise additions or deletions) for optimal alignment of the two sequences. The percentage is calculated by determining the number of positions at which the identical nucleotide or amino acid residue occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of positions in the comparison window, and multiplying the result by 100 to yield the percentage of sequence identity.
Methods for aligning sequences for comparison are well-known in the art. Various programs and alignment algorithms are described in, for example: Smith and Waterman (1981) Adv. Appl. Math. 2:482; Needleman and Wunsch (1970) J. Mol. Biol. 48:443; Pearson and Lipman (1988) Proc. Natl. Acad. Sci. U.S.A. 85:2444; Higgins and Sharp (1988) Gene 73:237-44; Higgins and Sharp (1989) CABIOS 5:151-3; Corpet et al. (1988) Nucleic Acids Res. 16:10881-90; Huang etal. (1992) Comp. Appl. Biosci. 8:155-65; Pearson et al. (1994) Methods Mol. Biol. 24:307-31; Tatiana et al. (1999) FEMS Microbiol. Lett. 174:247-50. A detailed consideration of sequence alignment methods and homology calculations can be found in, e.g., Altschul etal. (1990) J. Mol. Biol. 215:403-10.
The National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST™; Altschul et al. (1990)) is available from several sources, including the National Center for Biotechnology Information (Bethesda, MD), and on the internet, for use in connection with several sequence analysis programs. A description of how to determine sequence identity using this program is available on the internet under the “help” section for BLAST™. For comparisons of nucleic acid sequences, the “Blast 2 sequences” function of the BLAST™ (Blastn) program may be employed using the default parameters. Nucleic acid sequences with even greater similarity to the reference sequences will show increasing percentage identity when assessed by this method.
As used herein, the term “substantially identical” may refer to nucleotide i sequences that are more than 85% identical. For example, a substantially identical nucleotide sequence may be at least 85.5%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; at least 99%; or at least 99.5% identical to the reference sequence. i In some embodiments, the presence of a heterologous nucleic acid in a plant may be detected through the use of a nucleic acid probe. A probe may be a DNA molecule or an RNA molecule. RNA probes can be synthesized by means known in the art, for example, using a DNA molecule template. A probe may contain all or a portion of the nucleotide sequence of the heterologous nucleic acid and additional, I contiguous nucleotide sequence from the plant genome. This is referred to herein as a “contiguous probe.” The additional, contiguous nucleotide sequence is referred to as “upstream” or “downstream” of the heterologous nucleic acid, depending on whether the contiguous nucleotide sequence from the plant chromosome is on the 5’ or the 3’ side of the heterologous nucleic acid, as conventionally understood. As is recognized i by those of ordinary skill in the art, the process of obtaining additional, contiguous nucleotide sequence for inclusion in a probe may be repeated nearly indefinitely (limited only by the length of the chromosome), thereby identifying additional nucleic acid along the chromosome. Any and all of the above-described varieties of probes may be used in some embodiments of the present invention. • A probe may contain a nucleotide sequence that is not contiguous to that of the heterologous nucleic acid; this probe is referred to herein as a “noncontiguous probe.” The sequence of the noncontiguous probe is located sufficiently close to the sequence of the heterologous nucleic acid on the chromosome so that the noncontiguous probe is genetically linked to the heterologous nucleic acid. A probe may also be an exact copy of a heterologous nucleic acid to be detected. A probe may also be a nucleic acid molecule comprising, or consisting of, a nucleotide sequence that is substantially identical to a cloned segment of chromosomal DNA comprising a heterologous nucleic acid to be detected.
An oligonucleotide probe sequence may be prepared synthetically or by cloning. Suitable cloning vectors are well-known to those of skill in the art. An oligonucleotide probe may be labeled or unlabeled. A wide variety of techniques exist for labeling nucleic acid molecules, including, for example and without limitation: Radiolabeling by nick translation; random priming; tailing with terminal deoxytransferase; etc., where the nucleotides employed are labeled, for example, with radioactive 32P. Other labels which may be used include, for example and without limitation: Fluorophores; enzymes; enzyme substrates; enzyme cofactors; enzyme inhibitors; etc. Alternatively, the use of a label that provides a detectable signal, by itself or in conjunction with other reactive agents, may be replaced by ligands to which receptors bind, where the receptors are labeled (for example, by the above-indicated labels) to provide detectable signals, either by themselves, or in conjunction with other reagents. See, e.g., Leary et al. (1983) Proc. Natl. Acad. Sci. USA 80:4045-9. A probe may also be a nucleic acid molecule that is “specifically hybridizable” or “specifically complementary” to an exact copy of the nucleic acid to be detected (“DNA target”). “Specifically hybridizable” and “specifically complementary” are terms that indicate a sufficient degree of complementarity such that stable and specific binding occurs between the nucleic acid molecule and the DNA target A nucleic acid molecule need not be 100% complementary to its target sequence to be specifically hybridizable. A nucleic acid molecule is specifically hybridizable when there is a sufficient degree of complementarity to avoid non-specific binding of the nucleic acid to non-target sequences under conditions where specific binding is desired, for example, under stringent hybridization conditions.
Hybridization conditions resulting in particular degrees of stringency will vary depending upon the nature of the hybridization method of choice and the composition and length of the hybridizing nucleic acid sequences. Generally, the temperature of hybridization and the ionic strength (especially the Na+ and/or Mg^ concentration) of the hybridization buffer will determine the stringency of hybridization, though wash times also influence stringency. Calculations regarding hybridization conditions required for attaining particular degrees of stringency are known to those of ordinary skill in the art, and are discussed, for example, in Sambrook et al. (ed.) Molecular Cloning: A Laboratory Manual, 2nd ed., vol. 1-3, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989, chapters 9 and 11; and Hames and Higgins (eds.) Nucleic Acid Hybridization, IRL Press, Oxford, 1985. Further detailed instruction and guidance with regard to the hybridization of nucleic acids may be found, for example, in Tijssen, “Overview of principles of hybridization and the strategy of nucleic acid probe assays,” in Laboratory Techniques in Biochemistry and Molecular Biology- Hybridization with Nucleic Acid Probes, Part I, Chapter 2, Elsevier, NY, 1993; and Ausubel et al., Eds., Current Protocols in Molecular Biology, Chapter 2, Greene Publishing and Wiley-Interscience, NY, 1995.
As used herein, “stringent conditions” encompass conditions under which hybridization will only occur if there is less than 25% mismatch between the hybridization molecule and the DNA target. “Stringent conditions” include further particular levels of stringency. Thus, as used herein, “moderate stringency” conditions are those under which molecules with more than 25% sequence mismatch will not hybridize; conditions of “medium stringency” are those under which molecules with more than 15% mismatch will not hybridize; and conditions of “high stringency” are those under which sequences with more than 10% mismatch will not hybridize. Conditions of “very high stringency” are those under which sequences with more than 6% mismatch will not hybridize.
In particular embodiments, stringent conditions are hybridization at 65°C in 6x saline-sodium citrate (SSC) buffer, 5x Denhardt’s solution, 0.5% SDS, and 100 pg sheared salmon testes DNA, followed by 15-30 minutes sequential washes at 65°C in 2x SSC buffer and 0.5% SDS, followed by lx SSC buffer and 0.5% SDS, and finally 0.2x SSC buffer and 0.5% SDS.
With respect to all probes discussed, supra, the probe may comprise additional nucleic acid sequences, for example, promoters; transcription signals; and/or vector sequences.
Optimized: As used herein, in the context of a nucleic acid encoding a protein, the term “optimized” refers to a nucleic acid wherein a heterologous nucleotide sequence has been changed to reflect the codon bias of a target host organism. In some embodiments, the nucleotide sequence may be further changed to remove genetic elements that may interfere with gene expression.
It will be understood that, due to the redundancy of the genetic code, multiple DNA sequences may be designed to encode a single amino acid sequence. Thus, optimized DNA sequences may be designed, for example, to remove superfluous restriction sites and undesirable RNA secondary structures, while optimizing the nucleotide sequence of the coding region so that the codon composition resembles the overall codon composition of the host in which the DNA is to be expressed. Guidance regarding the design and production of synthetic DNA sequences can be found in, for example, International Patent Application Nos. WO2013016546, WO2011146524, and WO1997013402; and U.S. Patent Nos. 6,166,302 and 5,380,831.
Conservative substitution: As used herein, the term “conservative substitution” refers to a substitution where an amino acid residue is substituted for another amino acid in the same class. A non-conservative amino acid substitution is one where the residues do not fall into the same class, for example, substitution of a basic amino acid for a neutral or non-polar amino acid. Classes of amino acids that may be defined for the purpose of performing a conservative substitution are known in the art.
In some embodiments, a conservative substitution includes the substitution of a first aliphatic amino acid for a second, different aliphatic amino acid. For example, if a first amino acid is one of Gly; Ala; Pro; lie; Leu; Val; and Met, the first amino acid may be replaced by a second, different amino acid selected from Gly; Ala; Pro; lie; Leu; Val; and Met. In particular examples, if a first amino acid is one of Gly; Ala; Pro; lie; Leu; and Val, the first amino acid may be replaced by a second, different amino acid selected from Gly; Ala; Pro; He; Leu; and Val. In particular examples involving the substitution of hydrophobic aliphatic amino acids, if a first amino acid is one of Ala; Pro; He; Leu; and Val, the first amino acid may be replaced by a second, different amino acid selected from Ala; Pro; He; Leu; and Val.
In some embodiments, a conservative substitution includes the substitution of a first aromatic amino acid for a second, different aromatic amino acid. For example, if a i first amino acid is one of His; Phe; Trp; and Tyr, the first amino acid may be replaced by a second, different amino acid selected from His; Phe; Trp; and Tyr. In particular examples involving the substitution of uncharged aromatic amino acids, if a first amino acid is one of Phe; Trp; and Tyr, the first amino acid may be replaced by a second, different amino acid selected from Phe; Trp; and Tyr.
In some embodiments, a conservative substitution includes the substitution of a first hydrophobic amino acid for a second, different hydrophobic amino acid. For example, if a first amino acid is one of Ala; Val; lie; Leu; Met; Phe; Tyr, and Trp, the first amino acid may be replaced by a second, different amino acid selected from Ala; Val; lie; Leu; Met; Phe; Tyr; and Trp. In particular examples involving the substitution of non-aromatic, hydrophobic amino acids, if a first amino acid is one of Ala; Val; lie; Leu; and Met, the first amino acid may be replaced by a second, different amino acid selected from Ala; Val; He; Leu; and Met
In some embodiments, a conservative substitution includes the substitution of a first polar amino acid for a second, different polar amino acid. For example, if a first amino acid is one of Ser; Thr; Asn; Gin; Cys; Gly; Pro; Arg; His; Lys; Asp; and Glu, the first amino acid may be replaced by a second, different amino acid selected from Ser; Thr; Asn; Gin; Cys; Gly; Pro; Arg; His; Lys; Asp; and Glu. In particular examples involving the substitution of uncharged, polar amino acids, if a first amino acid is one of Ser; Thr; Asn; Gin; Cys; Gly; and Pro, the first amino acid may be replaced by a second, different amino acid selected from Ser; Thr; Asn; Gin; Cys; Gly; and Pro. In particular examples involving the substitution of charged, polar amino acids, if a first amino acid is one of His; Arg; Lys; Asp; and Glu, the first amino acid may be replaced by a second, different amino acid selected from His; Arg; Lys; Asp; and Glu. In further examples involving the substitution of charged, polar amino acids, if a first amino acid is one of Arg; Lys; Asp; and Glu, the first amino acid may be replaced by a second, different amino acid selected from Arg; Lys; Asp; and Glu. In particular examples involving the substitution of positively charged (basic), polar amino acids, if a first amino acid is one of His; Arg; and Lys, the first amino acid may be replaced by a second, different amino acid selected from His; Arg; and Lys. In further examples involving the substitution of positively charged, polar amino acids, if a first amino acid is Arg or Lys, the first amino acid may be replaced by the other amino acid of Arg and Lys. In particular examples involving the substitution of negatively charged (acidic), polar amino acids, if a first amino acid is Asp or Glu, the first amino acid may be replaced by the other amino acid of Asp and Glu.
In some embodiments, a conservative substitution includes the substitution of a first electrically neutral amino acid for a second, different electrically neutral amino acid. For example, if a first amino acid is one of Gly; Ser; Thr; Cys; Asn; Gin; and Tyr, the first amino acid may be replaced by a second, different amino acid selected from Gly; Ser; Thr; Cys; Asn; Gin; and Tyr.
In some embodiments, a conservative substitution includes the substitution of a first non-polar amino acid for a second, different non-polar amino acid. For example, if a first amino acid is one of Ala; Val; Leu; lie; Phe; Trp; Pro; and Met, the first amino acid may be replaced by a second, different amino acid selected from Ala; Val; Leu; lie; Phe; Trp; Pro; and Met.
In many examples, the selection of a particular second amino acid to be used in a conservative substitution to replace a first amino acid may be made in order to maximize the number of the foregoing classes to which the first and second amino acids both belong. Thus, if the first amino acid is Ser (a polar, non-aromatic, and electrically neutral amino acid), the second amino acid may be another polar amino acid (i.e., Thr; Asn; Gin; Cys; Gly; Pro; Arg; His; Lys; Asp; or Glu); another non-aromatic amino acid (i.e., Thr; Asn; Gin; Cys; Gly; Pro; Arg; His; Lys; Asp; Glu; Ala; lie; Leu; Val; or Met); or another electrically-neutral amino acid (i.e., Gly; Thr; Cys; Asn; Gin; or Tyr). However, it may be preferred that the second amino acid in this case be one of Thr; Asn; Gin; Cys; and Gly, because these amino acids share all the classifications according to polarity, non-aromaticity, and electrical neutrality. Additional criteria that may optionally be used to select a particular second amino acid to be used in a conservative substitution are known in the art. For example, when Thr; Asn; Gin; Cys; and Gly are available to be used in a conservative substitution for Ser, Cys may be eliminated from selection in order to avoid the formation of undesirable cross-linkages and/or disulfide bonds. Likewise, Gly may be eliminated from selection, because it lacks an alkyl side chain. In this case, Thr may be selected, e.g., in order to retain the functionality of a side chain hydroxyl group. The selection of the particular second amino acid to be used in a conservative substitution is ultimately, however, within the discretion of the skilled practitioner. PUFA: As used herein, the term “polyunsaturated fatty acid” or “PUFA” refers to a fatty acid with a carbon chain length of at least 16 carbons (e.g., at least 18 carbons, at least 20 carbons, and 22 or more carbons), with at least 3 or more double bonds (e g., 4 or more double bonds, 5 or more double bonds, and 6 or more double bonds), wherein all double bonds are in the cis configuration.
As used herein, the term “long chain polyunsaturated fatty acid” or “LC-PUFA” refers to a fatty acid with a carbon chain length of 20 or more carbons containing 3 or more double bonds, or 22 or more carbons containing 3 or more double bonds (e.g., 4 or more double bonds, 5 or more double bonds, and 6 or more double bonds). LC-PUFAs of the ω-6 series include, for example, and without limitation, di-homo-gamma-linolenic acid (C20:3 n-6), arachidonic acid (C20:4 n-6), adrenic acid (also called docosatetraenoic acid or DTA; C22:4 n-6), and docosapentaenoic acid (C22:5 n-6). LC-PUFAs of the ω-3 series include, for example and without limitation, eicosatrienoic acid (C20:3 n-3), eicosatetraenoic acid (C20:4 n-3), eicosapentaenoic acid (C20:5 n-3), docosapentaenoic acid (C22:5 n-3), and docosahexaenoic acid (C22:6 n-3). LC-PUFAs also include fatty acids with greater than 22 carbons and 4 or more double bonds, for example and without limitation, C28:8 (n-3).
The term “PUFA synthase” or “PFA,” as used herein, refers to an enzyme that produces PUFAs (e.g., LC-PUFAs), as well as a domain of such an enzyme in a system or complex. The term PUFA synthase includes, for example and without limitation, PUFA PKS systems or PKS-like systems for the production of PUFAs. Some specific PUFA synthases are designated herein by an additional notation ^Schizochytrium PUFA synthase,” PFA1, PFA2, and PFA3; e.g., from Schizochytrium sp. ATCC Accession No. PTA-9695). The term “PUFA synthase system” refers to one or more PUFA synthase(s) and any heterologous accessory enzymes that can affect the function of the PUFA synthase (e.g., a PPTase or ACS). PPTase: The terms “phosphopantetheinyl transferase” or “PPTase,” as used herein, refer to an enzyme that activates a PUFA synthase by transferring a cofactor (e.g., 4-phosphopantetheine) from coenzyme A (CoA) to one or more ACP domains present in the PUFA synthase. One example of a PPTase which can activate one or more ACP domains of a PUFA synthase utilized in embodiments herein is the Hetl protein of a Nostoc sp. (e.g., Hetl from PCC 7120; formerly called Anctbaena sp. PCC 7120), designated herein as “NoHetl.” ACS: As used herein, the term “acyl-CoA synthetase,” “ACoAS,” or “ACS” refer to an enzyme that catalyzes the conversion of long chain polyunsaturated free fatty acids (FFA) to acyl-CoA. Specific acyl-CoA synthetases utilized in particular embodiments herein, derived from Schizochytrium ATCC Accession No. 20888, are referred to by an additional notation; e.g., “SzACS2.”
Plant: The term “plant,” as used herein, includes any descendant, cell, tissue, seed, seed oil, or part thereof.
Trait or phenotype: The terms “trait” and “phenotype” are used interchangeably herein. For the purposes of the present disclosure, traits of particular interest include the ω-3 LC-PUFA trait, as may be expressed, for example, in an oilseed crop plant.
Functional food: As used herein, the term “functional food” refers to a food that is similar in appearance to a conventional food that is consumed as part of a usual diet and has enhanced nutritional value and/or specific dietary benefits based on a modification in the proportion of components that typically exist in the unmodified source material of the conventional food.
Unless specifically indicated or implied, the terms “a,” “an,” and “the” signify “at least one,” as used herein.
Unless otherwise specifically explained, all technical and scientific terms used herein have the same meaning as commonly understood by those of ordinary skill in the art to which this disclosure belongs. Definitions of common terms in molecular biology can be found in, for example, Lewin B., Genes V, Oxford University Press, 1994 (ISBN 0-19-854287-9); Kendrew et al. (eds.), The Encyclopedia of Molecular Biology, Blackwell Science Ltd., 1994 (ISBN 0-632-02182-9); and Meyers R.A. (ed.), Molecular Biology and Biotechnology: A Comprehensive Desk Reference, VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8). All percentages are by weight and all solvent mixture proportions are by volume unless otherwise noted. All temperatures are in degrees Celsius. TV. Heterologous PUFA synthase system Schizochytrium PUFA synthases
Embodiments herein include host organisms {e.g., plants) that are genetically modified to express a PUFA synthase. In some embodiments, an organism that has been genetically modified to express a heterologous PUFA synthase system, for example, a functional heterologous protein system comprising a PUFA synthase and at least one accessory protein thereof. The genetic modifications herein may also be used in some embodiments to improve PUFA production in a host organism that endogenously expresses a PUFA synthase. A PUFA synthase system may comprise several multifunctional proteins (and can include single function proteins) that can act together to conduct both iterative processing of the fatty acid chain, as well as non-iterative processing, including trans-cis isomerization and enoyl reduction reactions in selected cycles. These proteins are referred to herein as the core PUFA synthase enzyme system or the core PUFA synthase. General information and details about the domains and motifs contained within these proteins may be found in, for example: U.S. Patent Nos. 6,140,486 and 6,566,583; U.S. Patent Publication Nos. 2002/0194641, 2004/0235127, and 2005/0100995; International Patent Publication No. WO 2006/135866; and Metz et al. (2001) Science 293:290-3. Functional PUFA synthase domains may be found as a single protein (e.g., the domain and protein are synonymous) or as one of two or more domains in a single protein.
Numerous examples of polypeptides having PUFA synthase activity (and polynucleotides and genes encoding the same) are known in the art and may be combined in a genetically modified host comprising a heterologous PUFA synthase that is disclosed herein. Such PUFA synthase proteins (or domains) include both bacterial and non-bacterial PUFA synthases. A non-bacterial PUFA synthase may be a eukaryotic PFA. Certain bacterial PUFA synthases are described, for example, in U.S. Patent Publication No. 2008/0050505. Genetically modified plants of the invention can be produced which incorporate non-bacterial PUFA synthase functional domains with bacterial PUFA synthase functional domains, as well as PUFA synthase functional domains or proteins from other PKS systems (e.g., Type I iterative or modular, Type II, and Type III) and/or FAS systems.
In some embodiments, a heterologous PUFA synthase comprises biologically active domains that are typically contained on three, four, or more proteins selected from the group consisting of: at least one enoyl-ACP reductase (ER) domain; multiple acyl carrier protein (ACP) domain(s) (e.g., at least from one to four, or at least five ACP domains, and in some embodiments up to six, seven, eight, nine, ten, or more than ten ACP domains); at least two β-ketoacyl-ACP synthase (KS) domains; at least one acyltransferase (AT) domain; at least one β-ketoacyl-ACP reductase (KR) domain; at least two FabA-like β-hydroxyacyl-ACP dehydrase (DH) domains; at least one chain length factor (CLF) domain; and at least one malonyl-CoArACP acyltransferase (MAT) domain. In particular embodiments, a heterologous PUFA synthase also comprises at least one region containing a dehydratase conserved active site motif.
In some embodiments, a heterologous PUFA synthase system comprises a PUFA synthase (eg., PFA1, PFA2, and PFA3) from the Thraustochytrid Schizochytrium alga. For example, a heterologous PUFA synthase system according to embodiments herein may include, for example and without limitation, at least one protein comprising an amino acid sequence having at least 80% (e.g., at least 81%; at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; and at least 99%) identity to SEQ ID NO:l; SEQ ID NO:4; SEQ ID NO:7; and/or SEQ ID NO: 14. In particular examples, a heterologous PUFA synthase system includes at least one protein comprising SEQ ID NO:l; SEQ ID NO:4; SEQ ID NO:7; and/or SEQ ID NO: 14. In particular examples, a heterologous PUFA synthase system comprises at least one protein having an amino acid sequence selected from the group consisting of SEQ ID NO:l; SEQ ID NO:4; SEQ IDNO:7; and SEQ ID NO: 14.
Some embodiments include a heterologous PUFA synthase system that comprises at least one functional equivalent of SEQ ID NO:l; SEQ ID NO:4; SEQ ID NO:7; and/or SEQ ID NO:14. For example, the system may comprise a variant, portion, fragment, or derivative of SEQ ID NO:l; SEQ ID NO:4; SEQ ID NO:7; and/or SEQ ID NO: 14, wherein such a polypeptide has PUFA synthase activity. For example, the sequences of other PUFA synthase polypeptides (and genes encoding the same) can be identified in the literature and in bioinformatics databases available in the art. Such sequences may be identified, for example, through BLAST searching of publicly available databases with known PUFA synthase gene or polypeptide sequences. In such a method, identities can be based on the ClustalW method of alignment using the default parameters of GAP PENALTY=10, GAP LENGTH PENALTY=0.1, and Gonnet 250 series of protein weight matrix.
Additionally, the PUFA synthase gene or polypeptide sequences disclosed herein can be used to identify other PUFA synthase homologs in nature. For example, each of the PUFA synthase nucleic acid fragments disclosed herein can be used to isolate genes encoding homologous proteins. Isolation of homologous genes using sequence-dependent protocols is well known in the art. Examples of sequence-dependent protocols include, for example and without limitation: methods of nucleic acid hybridization; methods of DNA and RNA amplification, as exemplified by various uses of nucleic acid amplification technologies (e.g., polymerase chain reaction (PCR), ligase chain reaction (LCR), and strand displacement amplification (SDA); and methods of library construction and screening by complementation.
In some embodiments, a heterologous PUFA synthase comprises a Schizochytrium PUFA synthase domain (e.g., ER domains, ACP domains, KS domains, AT domains, KR domains, DH domains, CLF domains, MAT domains, and dehydratase conserved active site motifs), wherein the domain is combined with one or more domains from a different PUFA synthase to form a complete PUFA synthase having PUFA synthase activity.
In some embodiments, a genetically modified organism comprising a heterologous PUFA synthase can be further modified with at least one domain or biologically active fragment thereof of another PUFA synthase. In particular embodiments, any of the domains of a PUFA synthase may be modified from their natural structure to modify or enhance the function of that domain in the PUFA synthase system (e.g., to modify the PUFA types or ratios thereof produced by the system).
Phosphopantetheinyl Transferases
The phosphopantetheinyl transferases (PPTases) are a family of enzymes involved in fatty acid synthesis, polyketide synthesis, and non-ribosomal peptide synthesis. In particular, the ACP domains present in PUFA synthase enzymes require activation by attachment of a cofactor (4-phosphopantetheine) from coenzyme A to the acyl carrier protein (ACP). Attachment of this cofactor is carried out by PPTases. If the endogenous PPTases of the host organism are incapable of activating the PUFA synthase ACP domains, then it is necessary to provide a PPTase that is capable of carrying out that function.
One example of a PPTase, which has been demonstrated to recognize ACP domains as substrates, is the Hetl protein of a No.stoc sp. Hetl is present in a cluster of genes in Nostoc known to be responsible for the synthesis of certain fatty acids in that organism. Black and Wolk (1994) J. Bacteriol. 176:2282-92; Campbell et al. (1997)
Arch. Microbiol. 167:251-8. Hetl is likely to activate the ACP domains of a protein, HglE, present in that cluster.
In embodiments, a PUFA synthase system Includes at least one PPTase or 4'-phosphopantetheinyl transferase domain as an accessory domain or protein to the PUFA synthase. Numerous examples of polypeptides having PPTase activity are known in the art and may be used in a genetically modified organism herein, if they are capable of activating the ACP domains of the particular PUFA synthase being used. Examples of polypeptides that may be included in such a heterologous PUFA synthase system include, for example and without limitation, at least one protein comprising an i amino acid sequence having at least 80% (e.g., at least 81%; at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; and at least 99%) identity to the polypeptide encoded by SEQ ID NO: 10 (NoHetl protein). In particular examples, a heterologous PUFA synthase system includes the polypeptide encoded by SEQ ID NO: 10.
Some embodiments include a heterologous PUFA synthase system that comprises a functional equivalent of the polypeptide encoded by SEQ ID NO: 10. For example, the system may comprise a variant, portion, fragment, or derivative of the polypeptide encoded by SEQ ID NO: 10, wherein such a polypeptide has l phosphopantetheinyl transferase activity. For example, the sequences of other PPTases (and genes encoding the same) can be identified in the literature and in bioinformatics databases available in the art. Such sequences may be identified, for example, through BLAST searching of publicly available databases with known PPTase gene or polypeptide sequences. In such a method, identities can be based on the ClustalW i method of alignment using the default parameters of GAP PENALTY=10, GAP LENGTH PENALTY=0.1, and Gonnet 250 series of protein weight matrix. The PPTase sequences disclosed herein can be used to identify other PPTase homologs in nature. For example, PPTase nucleic acids herein (e g., SEQ ID NO: 10) can be used to isolate genes encoding homologous proteins. I According to the foregoing, in some embodiments a genetically modified organism (e.g., a plant) and/or a descendant, cell, tissue, or part thereof comprises a heterologous PUFA synthase (e.g., a PUFA synthase from the Thraustochytrid Schizochytrium alga) and a heterologous PPTase (e.g., a NoHetl PPTase).
Acyl-CoA Synthetases
Acyl-CoA synthetase (ACS, or alternatively, ACoAS) proteins catalyze the conversion of long chain PUFA free fatty acids (FFAs) to acyl-CoA. Numerous examples of polypeptides having ACoAS activity are known in the art and may be used in embodiments herein. For example, Schizochytrium sp. ATCC Accession No. 20888 possesses one or more ACoASs that are capable of converting the free fatty acid products of its PUFA synthase into acyl-CoA, including the polypeptide encoded by SEQ IDNO:l 1 (SzACS2 protein).
In some embodiments, a heterologous PUFA synthase system includes, for example and without limitation, at least one protein comprising an amino acid sequence having at least 80% (e.g., at least 81%; at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; and at least 99%) identity to the polypeptide encoded by SEQ ID NO:l 1. In particular examples, a heterologous PUFA synthase system includes the polypeptide encoded by SEQ ID NO: 11.
Some embodiments include a heterologous PUFA synthase system that comprises a functional equivalent of the polypeptide encoded by SEQ ID NO:ll. For example, the system may comprise a variant, portion, fragment, or derivative of the polypeptide encoded by SEQ ID NO:ll, wherein such a polypeptide has acyl-CoA synthetase activity. For example, the sequences of other ACoAS (and genes encoding the same) can be identified in the literature and in bioinformatics databases available in the art. Such sequences may be identified, for example, through BLAST searching of publicly available databases with known ACoAS gene or polypeptide sequences. In such a method, identities can be based on the ClustalW method of alignment using the default parameters of GAP PENALTY=10, GAP LENGTH PENALTY=0.1, and Gonnet 250 series of protein weight matrix. The ACoAS sequences disclosed herein can be used to identify other ACoAS homologs in nature. For example, ACoAS nucleic acids herein (e.g., SEQ ID NO:ll) can be used to isolate genes encoding homologous proteins.
According to the foregoing, in some embodiments a genetically modified organism (e.g., a plant) and/or a descendant, cell, tissue, or part thereof comprises a heterologous PUFA synthase (e.g., a PUFA synthase from the Thraustochytrid Schizochytrium sp.); a heterologous PPTase (e.g., a NoHetl PPTase); and a heterologous ACoAS (e.g., a Schizochytrium ACoAS from ATCC Accession No. 20888).
Functional equivalents include but are not limited to additions or substitutions of amino acid residues within the reference amino acid sequence (i.e., SEQ ID NO:l; SEQ ID NO:4; SEQ ID NO:7; the polypeptide encoded by SEQ ID NO:10; the polypeptide encoded by SEQ ID NO:ll; or SEQ ID NO: 14), but which result in a silent change, thus producing a functionally equivalent gene product. For example, conservative amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues involved.
Site-directed mutations can be made to PUFA synthase, PPTase, and/or ACoAS (using random mutagenesis techniques well known to those skilled in the art) and the resulting mutant enzymes may be assayed to confirm expected activity. For example, SEQ ID NO:l; SEQ ID NO:4; SEQ ID NO:7; the polypeptide encoded by SEQ ID NO: 10; the polypeptide encoded by SEQ ID NO:l 1; or SEQ ID NO: 14 may be aligned with homologs and other related proteins, wherein identical amino acid residues and conserved residues are indicated. Conservative alterations at the variable positions can be engineered in order to produce a polypeptide that retains function; e.g., PUFA synthase activity, phosphopantetheinyl transferase activity, and acyl-CoA synthetase activity.
Embodiments herein effect expression of a heterologous PUFA synthase system, for example, by providing a transgenic organism (e.g., a plant) comprising one or more polynucleotides encoding at least one component of a heterologous PUFA synthase system.
In some embodiments, a heterologous polynucleotide encoding at least one component of a heterologous PUFA synthase system comprises at least one polynucleotide encoding a PUFA synthase from the Thraustochytrid Schizochytrium sp. For example, a heterologous polynucleotide of embodiments herein may encode, for example and without limitation, at least one protein comprising an amino acid sequence having at least 80% (e.g., at least 81%; at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; at least 99%; and 100%) identity to SEQ ID NO:l; SEQ ID NO:4; SEQ ID NO:7; and/or SEQ ID NO:14.
In some examples, a polynucleotide encoding a PUFA synthase from the Thraustochytrid Schizochytrium sp. comprises a nucleotide sequence having at least 70% (e.g., at least 71%; at least 72%; at least 73%; at least 74%; at least 75%; at least 76%; at least 77%; at least 78%; at least 79%; at least 80%; at least 81%; at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least i 95%; at least 96%; at least 97%; at least 98%; at least 99%; and 100%) identity to SEQ ID NO:2; SEQ ID NO:3; SEQ ID NO:5; SEQ ID NO:6; SEQ ID NO:8; SEQ ID NO:9; and/or SEQ ID NO: 13.
In particular examples, the heterologous polynucleotide encoding a PUFA synthase from the Thraustochytrid Schizochytrium sp. hybridizes under stringent conditions (e.g., very stringent conditions) to SEQ ID NO:2 and/or SEQ ID NO:3; SEQ ID NO:5 and/or SEQ ID NO:6; SEQ ID NO:8, SEQ ID NO:9, and/or SEQ ID NO: 13.
In some embodiments, a heterologous polynucleotide encoding at least one component of a heterologous PUFA synthase system comprises a polynucleotide i encoding a phosphopantetheinyl transferase (Hetl) from the cyanobacteria genus, Nostoc. For example, a heterologous polynucleotide of embodiments herein may encode, for example and without limitation, at least one protein comprising an amino acid sequence having at least 80% (e.g., at least 81%; at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; at least 99%; and 100%) identity to the polypeptide encoded by SEQ ID NO: 10 (i.e., NoHetl).
In some examples, a polynucleotide encoding a phosphopantetheinyl transferase (Hetl) from the cyanobacteria genus, Nostoc comprises a nucleotide i sequence having at least 70% {e.g., at least 71%; at least 72%; at least 73%; at least 74%; at least 75%; at least 76%; at least 77%; at least 78%; at least 79%; at least 80%; at least 81%; at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; at least 99%; and 100%) identity to SEQ ID NO: 10.
In particular examples, the polynucleotide encoding a Nostoc phosphopantetheinyl transferase (NoHetl) hybridizes under stringent conditions (e.g., very stringent conditions) to SEQ ID NO: 10.
In some embodiments, a heterologous polynucleotide encoding at least one component of a heterologous PUFA synthase system comprises a polynucleotide encoding an ACoAS from Schizochytrium. For example, a heterologous polynucleotide of embodiments herein may encode, for example and without i limitation, at least one protein comprising an amino acid sequence having at least 80% (e.g., at least 81 %; at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; at least 99%; and 100%) identity to the polypeptide encoded by SEQ ID NO: 11 (/. e., SzACS2).
In some examples, a polynucleotide encoding an heterologous ACoAS from Schizochytrium (e.g., ATCC Accession No. 20888) comprises a nucleotide sequence having at least 70% (e.g., at least 71%; at least 72%; at least 73%; at least 74%; at least 75%; at least 76%; at least 77%; at least 78%; at least 79%; at least 80%; at least 81%; ' at least 82%; at least 83%; at least 84%; at least 85%; at least 86%; at least 87%; at least 88%; at least 89%; at least 90%; at least 91%; at least 92%; at least 93%; at least 94%; at least 95%; at least 96%; at least 97%; at least 98%; at least 99%; and 100%) identity to SEQ ID NO:l 1.
In particular examples, the polynucleotide encoding the heterologous ACoAS hybridizes under stringent conditions (e.g., very stringent conditions) to SEQ ID NO:l 1.
In embodiments, one or more polynucleotides encoding at least one component of a heterologous PUFA synthase may include at least one polynucleotide encoding a PUFA synthase from the Thraustochytrid Schizochytrium sp., either with or without a i polynucleotide encoding a phosphopantetheinyl transferase (Hetl) from the cyanobacteria genus, Nostoc and/or a heterologous polynucleotide encoding an ACoAS from Schizochytrium. In some examples, the at least one polynucleotides encoding the foregoing components are present in a single nucleic acid molecule. In some examples, the at least one polynucleotides are present in multiple nucleic acid molecules.
Some embodiments include vectors (e.g., plasmids) comprising one or more polynucleotides encoding at least one component of a heterologous PUFA synthase. In examples, such vectors include regulatory sequences that are operably linked to the polynucleotides, so as to effect expression of tire polynucleotides in a target host organism. Particular examples of such vectors include recombinant expression vectors, such as pDAB101429 (SEQ ID NO: 15); pDAB101454 (SEQ ID NO: 16); pDAB101496 (SEQ ID NO: 17); pDAB109525 (SEQ ID NO:18); pDAB109584 (SEQ ID NO: 19); pDAB109588 (SEQ ID NO:20); pDABl 12210 (SEQ ID NO:21); pDABl 12206 (SEQ ID NO:22); pDAB107962 (SEQ ID NO:23); pDAB109591 (SEQ ID NO:24); pDAB109592 (SEQ ID NO:25); pDAB107960 (SEQ ID NO:26); pDABl 10132 (SEQ ID NO:27); pDAB107961 (SEQ ID NO:28); pDAB110151 (SEQ ID NO:29); pDABl 12285 (SEQ ID NO:30); pDABl 17501 (SEQ ID NO:31); pDABl 17502 (SEQ ID NO:32); pDABl 12200 (SEQ ID NO:33); pDABl 12201 (SEQ ID NO:34); pDABl 12203 (SEQ ID NO:35); pDABl 12205 (SEQ ID NO:36); pDABl 12208 (SEQ ID NO:37); and pDABl 12209 (SEQ ID NO:38).
Known techniques in recombinant DNA technology may be used in certain embodiments to improve control of expression of heterologous polynucleotides, for example and without limitation, by manipulating the number of copies of the polynucleotides within the host cell; by manipulating the efficiency with which those polynucleotides are transcribed; by manipulating the efficiency with which the resultant transcripts are translated; and by manipulating the efficiency of post-translational modifications. By way of further example, promoter sequences may be genetically engineered to improve the level of expression in the host, as compared to the reference promoter. Thus, techniques useful for controlling the expression of nucleic acid molecules include, for example and without limitation, integration of the nucleic acid molecules into one or more host cell chromosomes; addition of vector stability sequences to plasmids; substitutions or modifications of transcription control signals (e.g., promoters, operators, and enhancers); substitutions or modifications of translational control signals (e.g., ribosome binding sites and Shine-Dalgamo sequences); modification of nucleic acid molecules to correspond to the codon usage of the host cell; and deletion of sequences that destabilize transcripts. V. Methods of Making Genetically Modified Organisms
To produce significantly high yields of one or more desired polyunsaturated fatty acids, a host organism (e.g., a plant) can be genetically modified to introduce a heterologous PUFA synthase system into the organism. In some embodiments, this process is employed to produce a genetically modified plant that comprises the heterologous PUFA synthase system. Some examples also include methods to improve or enhance the effectiveness of such genetic modification, for example, to improve or enhance the production and/or accumulation of the end-product of a PUFA synthase system; e.g., LC-PUFAs, such as DHA and EPA. Particular embodiments herein result in the expression of one or more Schizochytrium PUFA synthases and PPTases, as described above, to increase PUFA production and/or accumulation in a heterologous host. Particular embodiments also result in the expression of an ACS in the host.
Methods for gene expression in a genetically modified organism, including, for example and without limitation, a plant, are known in the art. In some embodiments, the coding regions for the heterologous polynucleotides encoding a component of a PUFA synthase system to be expressed are codon-optimized for a target host cell. Expression of genes in recombinant host cells, including, for example and without limitation, plant cells, can require a promoter operably linked to a coding region of interest, and/or a transcriptional terminator. In some embodiments, a heterologous polynucleotides encoding a component of a PUFA synthase system is operably linked to a seed-specific promoter (e.g., PvDlec2, LfKCS3, FAE1, BoACP, and BnaNapinC). In some embodiments, heterologous polynucleotides encoding a component of a PUFA synthase system is operably linked to a leaf-specific promoter (e.g., ubiquitin and CsVMV). Other non-limiting examples of promoters that may be used in certain embodiments include the acyl carrier protein promoter (International Patent Publication No. WO 1992/18634) and the Phaseolus vulgaris beta-phaseolin promoter (and truncated versions). See, e.g., Slightom et al. (1983) Proc. Natl. Acad. Sci. U.S.A. 80:1897-1901; Sengupta-Gopalan etal. (1985) Proc. Nat. Acad. Sci. U.S.A. 82:3320-4; van der Geest et al. (1997) Plant Mol. Biol. 33:553-7; and Bustos et al. (1991) EMBO J. 10:1469-79.
Some embodiments include a recombinant vector (e.g., a plasmid) comprising one or more heterologous polynucleotides encoding a component of a PUFA synthase system. A recombinant vector is an engineered (e.g., artificially produced) nucleic acid molecule that is used as a tool for manipulating a nucleic acid sequence of choice, and/or for introducing such a nucleic acid sequence into a host cell. The recombinant vector may, therefore, be suitable for use in cloning, sequencing, and/or otherwise manipulating a polynucleotide therein, such as by expressing and/or delivering the polynucleotide into a host cell to form a recombinant cell. A vector may contain nucleotide sequences that are not naturally found adjacent to the polynucleotide to be 1 cloned or delivered. A vector may also contain regulatory nucleic acid sequences (e.g., promoters, untranslated regions) that are naturally found adjacent to the polynucleotide or that are useful for expression of the polynucleotide. An integrated polynucleotide may be under chromosomal promoter control, under native or plasmid promoter control, or under a combination of several promoter controls. A vector may be either RNA or DNA, and may be either prokaryotic or eukaryotic. A vector may be maintained as an extrachromosomal element (e.g., a plasmid) or it may be integrated into a chromosome of a recombinant organism (e.g., a microbe, and plant cell). The entire vector may remain in place within a host cell, or under certain conditions, extraneous DNA (e.g., unnecessary plasmid sequences) may be deleted, leaving behind i one or more heterologous polynucleotides encoding a component of a PUFA synthase system. Single or multiple copies of the heterologous polynucleotides may be integrated into the host genome. A recombinant vector of the present invention may contain at least one selectable marker.
In some embodiments, a recombinant vector comprising one or more ! heterologous polynucleotides encoding a component of a PUFA synthase system is an expression vector, for example, a plant expression vector. In such embodiments, at least one polynucleotide encoding the product to be produced (e.g., a Schizochytrium PUFA synthase, NoHetl, and SzACS2) may be inserted into the recombinant vector in a manner that operably links the polynucleotide(s) to regulatory sequences in the vector I that enable the transcription and translation of the nucleic acid sequence within the recombinant host cell. Vectors useful for the transformation of a variety of host organisms and cells are known in the art. Typically, a vector contains a selectable marker, and sequences allowing autonomous replication or chromosomal integration in the desired host
Suitable methods for transformation of host cells include any method by which DNA can be introduced into a cell, such as by transformation of protoplasts {See, e.g., U.S. Patent 5,508,184), by desiccation/inhibition-mediated DNA uptake {See, e.g., Potiykus et al. (1985) Mol. Gen. Genet. 199:183-8), by electroporation {See, e.g., U.S. Patent 5,384,253), by agitation with silicon carbide fibers {See, e.g., U.S. Patents 5,302,523 and 5,464,765), by Agrobacterium-mediated transformation {See, e.g., U.S. Patents 5,563,055; 5,591,616; 5,693,512; 5,824,877; 5,981,840; and 6,384,301), i and by acceleration of DNA-coated particles {See, e.g., U.S. Patents 5,015,580; 5,550,318; 5,538,880; 6,160,208; 6,399,861; and 6,403,865). Through the application of techniques such as these, the cells of virtually any species may be stably transformed, including both monocotyledonous and dicotyledonous plants. In some embodiments, transforming DNA is integrated into the genome of the host cell. In the case of multicellular species, transgenic cells may be regenerated into a transgenic organism. Any of these techniques may be used to produce a transgenic monocotyledonous or dicotyledonous plant, for example, comprising one or more heterologous polynucleotides encoding a component of a PUFA synthase system in the genome of the transgenic plant. i The most widely utilized method for introducing an expression vector into plants is based on the natural transformation system of Agrobacterium. A. tumefaciens and A. rhizogenes are plant pathogenic soil bacteria which genetically transform plant cells. The Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectively, carry genes responsible for genetic transformation of the plant. The Ti : (tumor-inducing)-plasmids contain a large segment, known as T-DNA, which is transferred to transformed plants. Another segment of the Ti plasmid, the vir region, is responsible for T-DNA transfer. The T-DNA region is bordered by tenninal repeats. In modified binary vectors, the tumor-inducing genes have been deleted, and the functions of the vir region are utilized to transfer foreign DNA bordered by the T-DNA i border sequences. The T-region may also contain a selectable marker for efficient recovery of transgenic plants and cells, and a multiple cloning site for inserting sequences for transfer such as a dsRNA encoding nucleic acid.
Thus, in some embodiments, a plant transformation vector is derived from a Ti plasmid of A. tumefaciens (See, e.g., U.S. Patent Nos. 4,536,475, 4,693,977, 4,886,937, and 5,501,967; and European Patent EP 0 122 791) or a Ri plasmid of A. rhizogenes. Additional plant transformation vectors include, for example and without limitation, i those described by Herrera-Estrella et al. (1983) Nature 303:209-13; Bevan et al. (1983) Nature 304:184-7; Klee et al. (1985) Bio/Technol. 3:637-42; and in European Patent EP 0 120 516, and those derived from any of the foregoing. Other bacteria such as Sinorhizobium, Rhizobium, and Mesorhizobium that interact with plants naturally can be modified to mediate gene transfer to a number of diverse plants. These i plant-associated symbiotic bacteria can be made competent for gene transfer by acquisition of both a disarmed Ti plasmid and a suitable binary vector.
After providing exogenous DNA to recipient cells, transformed cells are generally identified for further culturing and plant regeneration. In order to improve the ability to identify transformed cells, one may desire to employ a selectable or screenable marker gene, as previously set forth, with the transformation vector used to generate the transformant. In the case where a selectable marker is used, transformed cells are identified within the potentially transformed cell population by exposing the cells to a selective agent or agents. In the case where a screenable marker is used, cells may be screened for the desired marker gene trait. 1 Cells that survive the exposure to the selective agent, or cells that have been scored positive in a screening assay, may be cultured in media that supports regeneration of plants. In some embodiments, any suitable plant tissue culture media {e.g., MS and N6 media) may be modified by including further substances, such as growth regulators. Tissue may be maintained on a basic media with growth regulators until sufficient tissue is available to begin plant regeneration efforts, or following repeated rounds of manual selection, until the morphology of the tissue is suitable for regeneration (e.g., at least 2 weeks), then transferred to media conducive to shoot fonnation. Cultures are transferred periodically until sufficient shoot formation has occurred. Once shoots are formed, they are transferred to media conducive to root formation. Once sufficient roots are formed, plants can be transferred to soil for further growth and maturity.
To confirm the presence of a nucleic acid molecule of interest (for example, a heterologous polynucleotide encoding a component of a PUFA synthase system) in the regenerating plants, a variety of assays may be performed. Such assays include, for example: molecular biological assays, such as Southern and Northern blotting, PCR, and nucleic acid sequencing; biochemical assays, such as detecting the presence of a protein product, eg., by immunological means (ELISA and/or Western blots) or by enzymatic function; plant part assays, such as leaf or root assays; and analysis of the phenotype of the whole regenerated plant.
Integration events may be analyzed, for example, by PCR amplification using, e.g., oligonucleotide primers specific for a nucleic acid molecule of interest. PCR genotyping is understood to include, but not be limited to, polymerase-chain reaction (PCR) amplification of genomic DNA derived from isolated host plant callus tissue predicted to contain a nucleic acid molecule of interest integrated into the genome, followed by standard cloning and sequence analysis of PCR amplification products. Methods of PCR genotyping have been well described (for example, Rios, G. et al. (2002) Plant J. 32:243-53) and may be applied to genomic DNA derived from any plant species (e.g., Z. mays or G. max) or tissue type, including cell cultures. A transgenic plant formed using Agrobacterium-dependent transformation methods typically contains a single recombinant DNA sequence inserted into one chromosome. The single recombinant DNA sequence is referred to as a “transgenic event” or “integration event.” Such transgenic plants are heterozygous for the inserted exogenous sequence. In some embodiments, a transgenic plant homozygous with respect to a transgene may be obtained by sexually mating (selfing) an independent segregant transgenic plant that contains a single exogenous gene sequence to itself, for example, an To plant, to produce Ti seed. One fourth of the T] seed produced will be homozygous with respect to the transgene. Germinating T] seed results in plants that can be tested for heterozygosity, typically using a SNP assay or a thermal amplification assay that allows for the distinction between heterozygotes and homozygotes (i.e., a zygosity assay).
In addition to direct transformation of a plant with a recombinant nucleic acid molecule, transgenic plants can be prepared by crossing a first plant having at least one ' transgenic event with a second plant lacking such an event. For example, a recombinant nucleic acid molecule comprising one or more heterologous polynucleotide(s) encoding a component of a PUFA synthase system may be introduced into a first plant line that is amenable to transformation to produce a transgenic plant, which transgenic plant may be crossed with a second plant line to introgress the polynucleotide(s) into the second plant line.
Some embodiments include the targeting of expression of the heterologous PUFA synthase system polypeptides to one or more organelles of the host. For example, in some embodiments, expression of the heterologous PUFA synthase system is targeted to the plastid of a plant. Varieties of plastid targeting sequences are known in the art and can be used in embodiments where the heterologous host is a plant or plant cell, and wherein targeting to the plastid is desired. In some embodiments, expression of the heterologous PUFA synthase system is targeted to the cytosol. In some embodiments, acyl-CoA synthetases (ACoAS) are expressed in the cytosol to convert LC-PUFA free fatty acids to acyl-CoAs, which in turn can be utilized by the acyltransferases. In some embodiments, expression of the heterologous PUFA synthase system is targeted to both the plastid and the cytosol of a plant.
Particular embodiments include the use of organelle targeting (e.g., to the plastid or chloroplast in plants) of at least one Schizochytrium PUFA synthase with a NoHetl PPTase. The targeting of gene products to the plastid or chloroplast is controlled by a signal sequence found at the amino terminal end of various proteins, which is cleaved during import yielding the mature protein. See, e.g., Comai et al. (1988) J. Biol. Chem. 263:15104-9. These signal sequences can be fused to heterologous gene products to effect the import of heterologous products into the chloroplast. van den Broeck et al. (1985) Nature 313:358-63. DNA encoding appropriate signal sequences can be isolated, for example, from cDNAs encoding the RUBISCO protein, the CAB protein, the EPSP synthase enzyme, the GS2 protein, and many other proteins that are known to be chloroplast localized.
An alternative means for localizing genes to chloroplasts or plastids utilized in particular embodiments include chloroplast or plastid transformation. Recombinant plants can be produced in which only the chloroplast DNA has been altered to incorporate heterologous PUFA synthase system polypeptides. Promoters which function in chloroplasts are known in the art. Hanley-Bowden et al. (1987) Trends in Biochem. Sci. 12:67-70. Methods and compositions for obtaining cells containing chloroplasts into which heterologous DNA has been inserted are described, for example, in U.S. Patent Nos. 5,693,507 and 5,451,513.
The foregoing genetic manipulations of a recombinant host can be performed using standard genetic techniques and screening, and can be made in any host cell that is suitable to genetic manipulation. In some embodiments, the recombinant host is a higher plant, including both dicotyledonous and monocotyledonous plants, and consumable plants, including crop plants and plants used for their oils. Thus, any plant species or plant cell can be selected as described further below. VI. Transgenic plants
Any plant or plant cell expressing a heterologous PUFA synthase system, for example, a functional heterologous protein system comprising a PUFA synthase and at least one accessory protein thereof is included in particular embodiments herein. Particular embodiments include a plant cell comprising a heterologous polynucleotide encoding a Schizochytrium PUFA synthase and a polynucleotide encoding a NoHetl PPTase, which plant cell may also comprise a polynucleotide encoding a Schizochytrium ACoAS. In some examples, such a transgenic plant has been further genetically modified to express a further polypeptide (e.g., ACoAS, GPAT, LPAAT, DAGAT, and acetyl CoA carboxylase (ACCase)) for the improvement of the production and/or accumulation of PUFAs or other bioactive products of the PUFA synthase by the host. i In some embodiments, a genetically modified plant (and/or plant cell thereof) is, for example and without limitation, selected from the group consisting of: higher plants; dicotyledonous plants; monocotyledonous plants; consumable plants (e.g., crop plants and plants used for their oils); soybean; rapeseed; linseed; com; safflowers; sunflowers; tobacco; a plant of the family Fabaceae (Leguminosae, legume family, pea family, bean family, or pulse family); a plant of the genus Glycine (e.g., G. albicans, G. aphyonota, G. arenari, G. argyrea, G. canescens, G. clandestine, G. curvata, G. cyrtoloba, G. falcate, G. gracei, G. hirticaulis, G. hirticaulis subsp. leptosa, G. lactovirens, G. latifolia, G. latrobeana, G. microphylla, G. montis-douglas, G. peratosa, G. pescadrensis, G. pindanica, G. pullenii, G. rubiginosa, G. stenophita, G. i syndetika, G. tabacina, G. tomentella, G. soja, and G. max (soybean)); peanut; Phaseolus vulgaris, Vicia faba\ and Pisum sativum.
In some embodiments, a genetically modified plant is a plant that is known to produce compounds used as pharmaceutical agents, flavoring agents, nutraceutical agents, functional food ingredients or cosmetically active agents, or a plant that is genetically engineered to produce these compounds/agents.
In some embodiments, the genetically modified plant is an oilseed plant, wherein the oilseeds, and/or the oil therefrom, contain LC-PUFAs produced by the heterologous PUFA synthase system. In particular embodiments, such oils contain a detectable amount of at least one target or primary LC-PUFA that is the product of the PUFA synthase (e.g., DHA and EPA). In some embodiments, such oils can be substantially free of intermediate or side products that are not the target or primary PUFA products, and that are not naturally produced by the endogenous FAS system in the wild-type plants (e.g., wild-type plants produce some shorter or medium chain PUFAs, such as 18 carbon PUFAs, via the FAS system, but there will be new, or additional, fatty acids produced in the plant as a result of genetic modification with the heterologous PUFA synthase).
In some embodiments, a transgenic plant or seed expressing a heterologous PUFA synthase system described herein also may comprise at least one other transgenic event in its genome, including without limitation: a gene encoding an insecticidal protein (e.g., an Bacillus thuringiensis insecticidal protein); an herbicide tolerance gene (e.g., a gene providing tolerance to glyphosate); and a gene contributing to a desirable phenotype in the transgenic plant, such as increased yield, altered fatty acid metabolism, or restoration of cytoplasmic male sterility. In particular embodiments, polynucleotides encoding at least one Schizochytrium PUFA synthase are combined with such additional transgenes, either by recombinant DNA techniques or conventional breeding with a plant already comprising the additional transgene(s).
Also included in some embodiments are parts of a plant expressing a heterologous PUFA synthase system as described herein. Such plant parts include any parts of a plant, including, for example and without limitation, seeds (including mature seeds and immature seeds); tissues; pollen; embryos; flowers; fruits; shoots; leaves; roots; stems; and explants. Particular embodiments include descendants of a plant expressing a heterologous PUFA synthase system as described herein. VII. Commodity products
Embodiments herein include products produced by or from the plants, descendants, plant parts, or cells described herein, including, for example and without limitation, oils produced therefrom. Thus, some embodiments include commodity products containing one or more of the polypeptides and/or polynucleotides of the heterologous PUFA synthase system described herein, which commodity products are produced from a recombinant plant or seed expressing the heterologous PUFA synthase system. A commodity product containing one or more of the polypeptides and/or polynucleotides of the heterologous PUFA synthase system described herein includes, for example and without limitation: meals; oils; crushed or whole grains or seeds; and any food product comprising meal; oil; or crushed or whole grain of a recombinant plant or seed containing one or more of the polypeptides and/or polynucleotides. The detection of the polypeptides and/or polynucleotides of the heterologous PUFA synthase systems described herein in one or more plant commodity or plant commodity products contemplated herein is de facto evidence that the commodity or commodity product is composed of a transgenic plant expressing the heterologous PUFA synthase system. For example, the detection of polypeptides and/or polynucleotides of the heterologous PUFA synthase systems described herein as a contaminant in an oil is de facto evidence that the oil was produced from a transgenic plant expressing the heterologous PUFA synthase system.
Embodiments herein make possible for the production of commercially valuable lipids enriched in one or more desired (target or primary) PUFAs by the development of genetically modified plants through the use of a heterologous PUFA synthase system that produces PUFAs that are otherwise not produced by the plant species. In some embodiments, a genetically modified organism of the invention produces one or more polyunsaturated fatty acids including, but not limited to, EPA (C20:5, n-3), DHA (C22:6, n-3), DPA (022:5, n-6 or n-3), and any combination thereof. Some embodiments herein specifically include oilseed and oils obtained from the genetically modified plants described herein comprising these PUFAs.
Plants are not known to endogenously contain a PUFA synthase and, therefore, embodiments herein provide an opportunity to produce plants with unique fatty acid production capabilities. Some embodiments offer the ability to create any one of a number of “designer oils,” comprising novel combinations of fatty acids from plants in various ratios and forms. In some embodiments, the use of a heterologous PUFA synthase system described herein can extend the range of PUFA production, and successfully produce such PUFAs within temperature ranges used to grow most crop plants.
In some embodiments, a plant commodity product is “substantially free” of intermediate or side products of the system for synthesizing PUFAs. As used herein in this context, the term “substantially free” means that any intermediate or side product fatty acids (non-target PUFAs) that are produced in the genetically modified plant (and/or parts of plants and/or seed oil fraction) as a result of the introduction or presence of the heterologous PUFA system (e.g., that are not produced by the wild-type plant or the parent plant used as a recipient for the indicated genetic i modification), are present in a quantity that is, for example and without limitation: less than 10% by weight of total fatty acids; less than 9% by weight of total fatty acids; less than 8% by weight of total fatty acids; less than 7% by weight of total fatty acids; less than 6% by weight of total fatty acids; less than 5% by weight of total fatty acids; less than 4% by weight of total fatty acids; less than 3% by weight of total fatty acids; less than 2% by weight of total fatty acids; less than 1 % by weight of total fatty acids; and less than 0.5% by weight of total fatty acids.
In some embodiments, a genetically modified plant, descendant, cell, tissue, or part thereof that expresses a heterologous PUFA synthase system, or an oil or seed obtained from the genetically modified plant, descendant, cell, tissue, or part thereof, I comprises a detectable amount of DHA (docosahexaenoic acid (C22:6, n-3)), DPA(n-6) (docosapentaenoic acid (C22:5 n-6)), and/or EPA (eicosapentaenoic acid (C20:5, n-3)).
In particular embodiments, a genetically modified plant, descendant, cell, tissue, or part thereof that expresses a heterologous PUFA synthase system, or an oil or : seed obtained from the genetically modified plant, descendant, cell, tissue, or part thereof, comprises, for example and without limitation; at least 0.01%, at least 0.02%, at least 0.03%, at least 0.04%, at least 0.05%, at least 0.06%, at least 0.07%, at least 0.08%, at least 0.09%, at least 0.1%, at least 0.2%, at least 0.3%, at least 0.4%, at least 0.5%, at least 0.6%, at least 0.7%, at least 0.8%, at least 0.9%, at least 1%, at least i 1.5%, at least 2%, at least 2.5%, at least 3%, at least 3.5%, at least 4%, at least 4.5%, at least 5%, at least 5.5%, at least 6%, at least 6.5%, at least 7%, at least 7.5%, at least 8%, at least 8.5%, at least 9%, at least 9.5%, at least 10%, at least 10.5%, at least 11%, at least 11.5%, at least 12%, at least 12.5%, at least 13%, at least 13.5%, at least 14%, at least 14.5% or at least 15% DHA by weight of total fatty acids. Useful ranges can be selected between any of these values, for example, 0.01% to 15%, 0.05% to 10% and 1% to 5% DHA by weight of total fatty acids.
In particular embodiments, a genetically modified plant, descendant, cell, ! tissue, or part thereof that expresses a heterologous PUPA synthase system, or an oil or seed obtained from the genetically modified plant, descendant, cell, tissue, or part thereof, comprises, for example and without limitation; at least 0.01%, at least 0.02%, at least 0.03%, at least 0.04%, at least 0.05%, at least 0.06%, at least 0.07%, at least 0.08%, at least 0.09%, at least 0.1%, at least 0.2%, at least 0.3%, at least 0.4%, at least I 0.5%, at least 0.6%, at least 0.7%, at least 0.8%, at least 0.9%, at least 1%, at least 1.5%, at least 2%, at least 2.5%, at least 3%, at least 3.5%, at least 4%, at least 4.5%, at least 5%, at least 5.5%, at least 6%, at least 6.5%, at least 7%, at least 7.5%, at least 8%, at least 8.5%, at least 9%, at least 9.5%, and/or at least 10% EPA by weight of total fatty acids. Useful ranges can be selected between any of these values, for : example, 0.01% to 10%, 0.05% to 5% and 0.1% to 5% EPA by weight of total fatty acids.
In particular embodiments, a genetically modified plant, descendant, cell, tissue, or part thereof that expresses a heterologous PUFA synthase system, or an oil or seed obtained from the genetically modified plant, descendant, cell, tissue, or part I thereof, comprises, for example and without limitation; at least 0.01%, at least 0.02%, at least 0.03%, at least 0.04%, at least 0.05%, at least 0.06%, at least 0.07%, at least 0.08%, at least 0.09%, at least 0.1%, at least 0.2%, at least 0.3%, at least 0.4%, at least 0.5%, at least 0.6%, at least 0.7%, at least 0.8%, at least 0.9%, at least 1%, at least 1.5%, at least 2%, at least 2.5%, at least 3%, at least 3.5%, at least 4%, at least 4.5%, at ' least 5%, at least 5.5%, at least 6%, at least 6.5%, at least 7%, at least 7.5%, at least 8%, at least 8.5%, at least 9%, at least 9.5%, and/or at least 10% DPA(n-6) by weight of total fatty acids. Useful ranges can be selected between any of these values, for example, 0.01% to 10%, 0.01% to 5%, 0.01% to 1%, 0.01% to 0.05%, 0.05% to 5% and 0.1% to 5% DPA(n-6) by weight of total fatty acids, i The percentage amount of PUFA is the percentage by weight of total fatty acids extracted, unless otherwise stated. In some embodiments, total fatty acids are determined by gas chromatography (GC) analysis of a fatty acid methyl ester (FAME) preparation, although determination of total fatty acids is not limited to this method.
In particular embodiments, a genetically modified plant, descendant, cell, tissue, or part thereof that expresses a heterologous PUFA synthase system, or an oil or seed obtained from the genetically modified plant, descendant, cell, tissue, or part thereof, comprises a ratio of EPA:DHA of, for example and without limitation; at least 10:1, at least 9.5:1, at least 9:1, at least 8.5:1, at least 8:1, at least 7.5:1, at least 7:1, at least 6.5:1, at least 6:1, at least 5.5:1, at least 5:1, at least 4.5:1, at least 4:1, at least 3.5:1, at least 3:1, at least 2.5:1, at least 2:1, at least 1.5:1, at least 1:1, at least 1:1.5, at least 1:2, at least 1:2.5, at least 1:3, at least 1:3.5, at least 1:4, at least 1:4.5, at least 1:5, at least 1:5.5, at least 1:6, at least 1:6.5, at least 1:7, at least 1:7.5, at least 1:8, at least 1:8.5, at least 1:9, at least 1:10, at least 1:11, at least 1:12, at least 1:13, at least 1:14, at least 1:15, at least 1:16, at least 1:17, at least 1:18, at least 1:19, at least 1:20, at least 1:21, at least 1:22, at least 1:23, at least 1:24, at least 1:25, at least 1:26, at least 1:27, at least 1:28, at least 1:29, or at least 1:30 by weight of total fatty acids. Useful ranges can be selected between any of these values, for example, a ratio of EPA:DHA of 10:1, 5:1 to 1:1,2:1 to 1:1, 1 to 1:30, 1:1 to 1:25, 1:1 to 1:20, 1:1 to 1:15, 1:1 to 1:10, 1:1 to 1:5,1:1 to 1:3, and 1:1 to 1:2 by weight of total fatty acids.
In particular embodiments, a genetically modified plant, descendant, cell, tissue, or part thereof that expresses a heterologous PUFA synthase system, or an oil or seed obtained from the genetically modified plant, descendant, cell, tissue, or part thereof, comprises a ratio of DPA(n-6):DHA of, for example and without limitation; at least 1:1, at least 1:1.5, at least 1:2, at least 1:2.5, at least 1:3, at least 1:3.5, at least 1:4, at least 1:4.5, at least 1:5, at least 1:5.5, at least 1:6, at least 1:6.5, at least 1:7, at least 1:7.5, at least 1:8, at least 1:8.5, at least 1:9, or at least 1:10 by weight of total fatty acids. Useful ranges can be selected between any of these values, for example, a ratio of DPA(n-6):DHA of 1:1 to 1:10, 1:1 to 1:5, 1:1 to 1:3 and 1:1 to 1:2 by weight of total fatty acids.
In particular embodiments, a genetically modified plant, descendant, cell, tissue, or part thereof that expresses a heterologous PUFA synthase system, or an oil or seed obtained from the genetically modified plant, descendant, cell, tissue, or part 1 thereof, comprises, for example and without limitation, at least 70%, at least 75%, at least 80%o, at least 85%, at least 90%>, at least 95%, or at least 99%> triglycerides by weight of the oil. In some embodiments, an oil obtained from a genetically modified plant, descendant, cell, tissue, or part thereof or seed of the invention comprises from 70% to 99% triglycerides by weight of the oil, from 75% to 99% triglycerides by weight of the oil, from 80% to 99% triglycerides by weight of the oil, from 85% to 99% triglycerides by weight of the oil, or from 90% to 99% triglycerides by weight of the oil. Such triglycerides may incorporate LC-PUFAs that are produced by the heterologous PUFA synthase system.
In particular embodiments, when the target product of a heterologous PUFA synthase system is a LC- PUFA, such as DFIA, DPA (n-6 or n-3), or EPA, intermediate products and side products that are not present in substantial amounts in the total lipids of the genetically modified plant expressing the system include, for example and without limitation: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid (ETA; 20:3, n-9), and various other intermediate or side products, such as 20:0; 20:1 (Δ5); 20:1 (Δ11); 20:2 (Δ8,11); 20:2 (Δ11,14); 20:3 (Δ5,11,14); 20:3 (Δ11,14,17); mead acid (20:3; Δ5,8,11); or 20:4 (Δ5,1,14,17).
In some embodiments, PUFAs produced by a heterologous PUFA synthase system are recovered from a genetically modified plant expressing the system through purification processes that extract the compounds from the plant, descendant, cell, tissue, or part thereof. In some embodiments, the PUFAs are recovered by harvesting the plant, descendant, cell, tissue, or part thereof. In some embodiments, the PUFAs are recovered by harvesting the oil from the plant, descendant, cell, tissue, or part thereof (e.g., from the oil seeds). In some embodiments, the plant, descendant, cell, tissue, or part thereof is consumed in its natural state, or is further processed into consumable products.
In some embodiments herein, the oils from a genetically modified plant expressing a heterologous PUFA synthase system may be used in non-culinary or non-dietary processes and compositions. Some of these uses are industrial, cosmetic, or medical (e.g., the oils can be used in a protective barrier against infection and to enhance transplant graft survival (U.S. Patent 6,210,700)). Oils from a genetically modified plant expressing a heterologous PUFA synthase system can also be used in any application for which the oils of the present invention are suited. In general, the oils can be used to replace, e.g., mineral oils, esters, fatty acids, or animal fats in a variety of applications, such as lubricants, lubricant additives, metal working fluids, hydraulic fluids and fire resistant hydraulic fluids. The oils can also be used as materials in a process of producing modified oils. Examples of techniques for modifying oils include fractionation, hydrogenation, alteration of the oil's oleic acid or linolenic acid content, and other modification techniques known to those of skill in the art.
Examples of cosmetic uses for oils from a genetically modified plant expressing a heterologous PUFA synthase system include use as an emollient in a cosmetic composition; as a petroleum jelly replacement; as comprising part of a soap, as a material in a process for producing soap; as comprising part of an oral treatment 1 solution; as comprising part of an ageing treatment composition; and as comprising part of a skin or hair aerosol foam preparation.
The following examples are provided to illustrate certain particular features and/or embodiments. The examples should not be construed to limit the disclosure to the particular features or embodiments exemplified.
EXAMPLES
Example 1: Materials and Methods
Unless otherwise indicated, molecular biological and biochemical manipulations described in the subsequent Examples were performed by standard methodologies as disclosed in, for example, Ausubel et al. (1995) Current Protocols in Molecular Biology, John Wiley & Sons; Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press; and the like.
Plant-optimized polynucleotides.
Multiple DNA sequences having a canola codon bias were designed and synthesized to produce PUFA synthase enzymes in transgenic plants. A codon usage table for canola (Brassica napus L.) was calculated from protein coding sequences obtained from sequences deposited in GenBank (available on the internet at ncbi.nlm.nih.gov). A Rescaled canola codon set was calculated after omitting any synonymous codon used less than about 10% of total codon uses for that amino acid. The Rescaled representation for each codon was calculated using the formula:
Rescaled % of Cl = 1/(%C1 + %C2 + %C3...) x %C1 x 100, where Cl is the codon in question and %C1, %C2, and %C3... represent the original % usage values of the remaining synonymous codons.
To derive a canola-codon-optimized DNA sequence encoding the amino acid PFA1 protein of SEQ ID NO:l, PFA2 protein of SEQ ID NO:4, and PFA3 protein of SEQ ID NO:7, codon substitutions to the experimentally determined (native) PFA1 DNA sequence (SEQ ID NO:2), PFA2 DNA sequence (SEQ ID NO:5), and PFA3 DNA sequence (SEQ ID NO: 8) were made such that the resulting DNA sequences had the overall codon composition of the canola-optimized codon bias table.
Further refinements of the sequences were made to eliminate undesirable restriction enzyme recognition sites, potential plant intron splice sites, long runs of A/T or C/G residues, and other motifs that might interfere with mRNA stability, transcription, or translation of the coding region in plant cells. Other changes were made to introduce desired restriction enzyme recognition sites, and to eliminate long internal Open Reading Frames (frames other than +1). These changes were all made within the constraints of retaining the canola-biased rescaled codon composition. Polynucleotides comprising the resulting nucleotide canola codon-optimized DNA sequences were synthesized. Table 1.
Table 1. Codon-optimized sequences
Lipid Extraction and Analysis.
Segregating seeds were analyzed via a FAME analysis method to identify transgenic plant events (obtained from soybean, canola, and Arabidopsis) which comprised LC-PUFAs as compared to control plants grown in the same conditions. The LC-PUFA content (% FAMEs by weight) were quantitated and compared to a negative control plant. The FAME analysis was completed on single seeds or performed on bulked seeds from each individual transgenic event that were assayed using the protocol described below.
Arabidopsis and canola seed analysis. Transgenic seed samples (either canola single seeds or bulked Arabidopsis seed samples) were homogenized in heptane containing triheptadecanoin (Nu-Chek™ Prep, Elysian, MN) as a triacylglycerol internal standard, using a steel ball mill. Prior to homogenization, a solution of 0.25 M freshly-prepared sodium methoxide (Sigma-Aldrich, St. Louis, MO) in methanol was added. Extraction and derivation were conducted at 40°C with constant shaking. FAME extraction was repeated three times and the heptane layers were pooled prior to analysis. Bulked Arabidopsis and canola seed analysis consisted of a 10 mg aliquot for Arabidopsis or 8-12 canola seeds, respectively. In order to drive the derivatization reaction to completeness, the oil from bulk seed canola and soybean single seed was first extracted three times with heptane. Then, an aliquot of the combined oil extract was derivatized in FAMEs. The completeness of the reaction was verified by checking for the presence of endogenous FAMEs in a fourth extraction/derivatization. The resulting FAMEs were analyzed by GC-FID using an Agilent 6890 Gas Chromatograph (Agilent Technologies, Santa Clara, CA) and a capillary column BPX 70™ (15 m x 0.25 mm x 0.25 pm) from SGE (Austin, TX). Each FAME was identified by retention time and quantified by the injection of a rapeseed oil reference mix from Matreya LLC (Pleasant Gap, PA) as a calibration standard with addition of appropriate long chain fatty acids (Nu-Chek Prep, Elysian MN). The results of the production of DHA and other LC-PUFAs in Arabidopsis, soybean, and canola are described below.
Example 2: Expression of PUFA Synthase Genes in Plants
Binary vectors were constructed that contained plant transcription units (PTUs) comprising the native and codon-optimized PUFA synthase system transgenes (PFA1, PFA2, and PFA3) operably linked to a promoter and 3’-UTR. The resulting binary vectors also contained a PTU comprising the Hetl transgene operably linked to a promoter and 3’ UTR. Only one binary vector included the SzACS2 transgene operably linked to a promoter and 3’ UTR (pDAB101429). Different promoter and 3’-UTR sequence combinations were incorporated in the binary vectors to drive expression of the PUFA synthase system and Hell transgenes. Use of these different regulatory gene elements was incorporated into the design of the PTUs to alter and vary the expression levels of the transgene. Likewise, the PTUs were positioned within the binary vector in different orientations to test whether the orientation of the PTUs altered or varied the expression levels of the transgenes.
Three different orientations of PTUs were tested. The binary vectors comprising PTUs arranged in the first orientation were situated in a head to tail i configuration.
The binary vectors comprising PTUs arranged in the second orientation were constructed to utilize the bi-directionality of the 3’ UTRs. The PFA1 and Hell PTUs shared a single 3’ UTR, and were oriented in the following configuration; promoter :: gene of interest:: 3’UTR :: gene of interest:: promoter. Likewise, the PFA3 and PFA2 I PTUs shared a single 3’ UTR, and were oriented in the following configuration; promoter:: gene of interest:: 3’UTR:: gene of interest:: promoter.
Finally, the third orientation incorporated a random DNA spacer sequence (SEQ ID NO: 12). The random DNA spacer is positioned between two PTUs oriented upstream of the random DNA spacer, and two PTUs oriented downstream of the 1 random DNA spacer. Both sets of PTUs are constructed in a head to head orientation. Accordingly, the orientation is as follows; <— PFA1 PTU :: Hetl PTU —> :: random DNA spacer:: <— PFA3 PTU :: PFA 2 PTU —».
FIRST ORIENTATION ) pDABl01429. The pDAB101429 plasmid (SEQ ID NO: 15) contains three PUFA synthase PTUs, one acyl-CoA synthetase PTU, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains a truncated Phaseolus vulgaris phytohemagglutinin-L gene promoter (PvDlec2 promoter v2; GenBank Accession Number X06336), i Arabidopsis thaliana AT2S3 gene 5' untranslated region (2S 5' UTR; GenBank Accession Number NM_118850), Schizochytrium sp. PolyUnsaturated Fatty Acid synthase PFA1 v2 and Arabidopsis thaliana 2S albumin gene 3' untranslated region terminator vl (At2S SSP terminator vl; GenBank Accession Number M22035). The second PUFA synthase PTU contains the PvDlec2 promoter v2, 2S 5' UTR, ) Schizochytrium sp. PolyUnsaturated Fatty Acid synthase PFA2 v2, and At2S SSP terminator vl. The third PUFA synthase PTU contains the PvDlec2 promoter v2, 2S 5' UTR, Schizochytrium sp. PolyUnsaturated Fatty Acid synthase PFA3 v2 and At2S SSP terminator vl. The acyl-CoA synthetase PTU contains the PvDlec2 promoter v2, 2S 5' UTR, Schizochytrium sp. acyl-CoA synthetase (SzACS-2 v3) and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains the PvDlec2 promoter v2, 2S 5' UTR, Nostoc sp. 4' phosphopantetheinyl transferase Hetl (NoHetl v3) and At2S SSP terminator vl. These five PTUs were placed in a head-to-tail orientation within the T-strand DNA border regions of a plant transformation binary vector (pDAB7333). The order of the genes is: PFA1 v2, PFA2 v2, PFA3 v2, SzACS-2 v3, NoHetl v3. The plant transformation binary vector also contains the phosphinothricin acetyl transferase PTU: Cassava vein Mosaic Virus Promoter (CsVMV promoter v2; Verdaguer et al., Plant Molecular Biology 31:1129-1139; 1996), phosphinothricin acetyl transferase (PAT v5; Wohlleben et al., Gene 70:25-37; 1988) and Agrobacterium tumefaciens ORF1 3’ untranslated region (AtuORFl 3' UTR v4; Huang et al., J. Bacteriol. 172:1814-1822; 1990), in addition to other regulatory elements such as Overdrive (Toro et al., PNAS 85(22): 8558-8562; 1988) and T-stand border sequences (T-DNA Border A and T-DNA Border B; Gardner et al., Science 231:725-727; 1986 and International Publication No. WO 2001/025459 Al). Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDAB101454. The pDAB101454 plasmid (SEQ ID NO:16) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains the PvDlec2 promoter v2; the 2S 5' UTR; Schizochytrium sp. PFA1 v2; and At2S SSP terminator vl. The second PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA2 v2; and At2S SSP terminator vl. The third PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v2; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; NoHetl v3; and At2S SSP terminator vl. These four PTUs described above were placed in a head-to-tail orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, PFA2 v2, PFA3 v2, NoHetl v3. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDAB101496. The pDAB101496 plasmid (SEQ ID NO: 17) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 v2; and At2S SSP terminator vl. The second PUFA synthase PTU contains the PvPhas promoter v4; PvPhas 5’ UTR; Schizochytrium sp. PFA2 v2; and Agrobacterium tumefaciens Ti plasmid pTil5955 open reading frame 23/24 3' untranslated region (AtuORF23 3' UTR vl GenBank Accession Number AF242881.1). The third PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v3 (SEQ ID NO: 13, which encodes the polypeptide of SEQ ID NO: 14); and At2S SSP terminator vl. Tie phosphopantetheinyl transferase PTU contains the PvPhas promoter v5; PvPhas 5’ UTR; NoHetl v3; and AtuORF23 3' UTR vl. These four PTUs were placed in a head-to-tail orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, PFA2 v2, PFA3 v3, NoHetl v3. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDAB109525. The pDAB 109525 plasmid (SEQ ID NO: 18) contains three i PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 vl; and At2S SSP terminator vl. The second PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA2 vl; and At2S SSP terminator vl. The third ' PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v3; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; NoHetl v3; and At2S SSP terminator vl. These four PTUs were placed in a head-to-tail orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order I of the genes is: PFA1 vl, PFA2 vl, PFA3 v3, NoHetl v3. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids containing the four PTUs were then isolated and tested for incorporation of the four PTUs with restriction enzyme digestion and DNA sequencing. pDAB109584. The pDAB109584 plasmid (SEQ ID NO:19) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains a Brassica napus napin gene promoter (BnaNapinC Promoter vl; GenBank Accession Number M64633.1), Brassica napus napin gene 5' untranslated region (BnaNapinC 5' UTR vl; GenBank Accession Number M64633.1), Schizochytrium sp. PFA1 v2; and Brassica napus napin gene 5' untranslated region (BnaNapinC 3' UTR vl; GenBank Accession Number M64633.1). The second PUFA synthase PTU contains a truncated Phaseolus vulgaris beta-phaseolin promoter (PvPhas promoter v4; GenBank Accession Number JO 1263.1); Phaseolus vulgaris beta-phaseolin 5' untranslated region (PvPhas 5’ UTR; GenBank Accession Number J01263.1); Schizochytrium sp. PFA2 v2, Phaseolus vulgaris beta-phaseolin 3’ untranslated region (PvPhas 3' UTR vl; GenBank Accession Number J01263.1); and Phaseolus vulgaris beta-phaseolin 3’ MAR (PvPhas 3' MAR v2; GenBank Accession Number J01263.1). The third PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v3; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains a Brassica oleracea acyl carrier protein gene promoter (BoACP promoter vl; International Publ. No. WO 1992/18634); Brassica oleracea acyl carrier protein gene 5' untranslated region (BoACP 5' UTR v2; International Publ. No. WO 1992/18634); NoHetl v3; and Brassica napus acyl carrier protein gene 3’ untranslated region (BnACP05 3' UTR vl; GenBank Accession Number X64114.1). These four PTUs were placed in a head-to-tail orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, PFA2 v2, PFA3 v3, NoHetl v3. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl09588. The pDABl09588 plasmid (SEQ ID NO:20) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains a Phaseolus vulgaris beta-phaseolin promoter (PvPhas promoter v3; GenBank Accession Number J01263.1); PvPhas 5’ UTR; Schizochytrium sp. PFA1 v2; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The second PUFA synthase PTU contains
BnaNapinC Promoter vl; BnaNapinC 5' UTR vl; Schizochytrium sp. PFA2 v2; and BnaNapinC 3' UTR vl. The third PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v2; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains BoACP promoter vl; Brassica oleracea acyl carrier protein gene 5' untranslated region (BoACP 5' UTR vl; International Publ. No. WO 1992/18634); NoHetl v3; and AtuORF23 3' UTR vl. These four PTUs were placed in a head-to-tail orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, PFA2 v2, PFA3 v2, NoHetl v3. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDAB112210. The pDABl 12210 plasmid (SEQ ID NO:21) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 vl; and At2S SSP terminator vl. The second PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA2 vl; and At2S SSP terminator vl. The third PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 vl; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; NoHetl v3; and At2S SSP terminator vl. These four PTUs were placed in a head-to-tail orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 vl, PFA2 vl, PFA3 vl, NoHetl vl. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the four PTUs with restriction enzyme digestion and DNA sequencing. pDABl 12206. The pDABl 12206 plasmid (SEQ ID NO:22) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 v2; and At2S SSP terminator vl. The second PUFA synthase PTU contains PvPhas promoter v4; PvPhas 5’ UTR; Schizochytrium sp. PFA2 v2; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The third PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v3; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains a truncated Phaseolus vulgaris beta-phaseolin promoter (PvPhas promoter v6; GenBank Accession Number JO1263.1); PvPhas 5’ UTR; ! NoHetl v3; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. These four PTUs described above were placed in a head-to-tail orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, PFA2 v2, PFA3 v3, NoHetl v3. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant l plasmids were isolated and tested for incorporation of the four PTUs with restriction enzyme digestion and DNA sequencing.
SECOND ORIENTATION pDAB107962. The pDAB107962 plasmid (SEQ ID NO:23) contains three > PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 v2; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains PvPhas promoter v6; PvPhas 5’ UTR; NoHetl v3; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and I AtuORF23 3' UTR vl. The second PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v2; and At2S SSP terminator vl. The third PUFA synthase PTU contains the PvPhas promoter v6; PvPhas 5’ UTR; Schizochytrium sp. PFA2 vl; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and AtuORF23 3' UTR vl. PFA1 v2 and NoHetl v3 were placed in a tail-to-tail orientation, and an ^ AtuORF23 3’UTR was placed between the two PTUs; NoHetl v3 and PFA3 v2 were placed in a head-to-head orientation; PFA3 v2 and PFA2 vl were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 vl. pDAB7333 also I contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDAB109591. The pDAB 109591 plasmid (SEQ ID NO:24) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 v2; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains PvPhas promoter v6; PvPhas promoter PvPhas 5’ UTR; NoHetl v3; PvPhas 3’ UTR vl; PvPhas 3' MAR v2; and AtuORF23 3' UTR vl. The second PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v3; and At2S SSP terminator vl. The third PUFA synthase PTU contains the PvPhas promoter v6; 1 PvPhas 5’ UTR; Schizochytrium sp. PFA2 v2; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and AtuORF23 3' UTR vl. PFA1 v2 and NoHetl v3 were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs; NoHetl v3 and PFA3 v3 were placed in a head-to-head orientation; PFA3 v3 and PFA2 v2 were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v3, PFA2 v2. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDAB109592. The pDAB 109592 plasmid (SEQ ID NO:25) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains BnaNapinC Promoter vl; BnaNapinC 5' UTR vl; Schizochytrium sp. PFA1 v2; and BnaNapinC 3' UTR vl. The phosphopantetheinyl transferase PTU contains BoACP promoter vl; Brassica oleracea acyl carrier protein gene 5' untranslated region (BoACP 5' UTR v2; International Publ. No. WO 1992/18634); NoHetl v3; BnACP05 3' UTR vl; and AtuORF23 3’ UTR vl. The second PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v3; and At2S SSP terminator vl. The third PUFA synthase PTU contains PvPhas promoter v6; Phaseolus vulgaris beta-phaseolin 5' untranslated region (PvPhas promoter PvPhas 5’ UTR; GenBank Accession Number J01263.1); Schizochytrium sp. PFA2 v2; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and AtuORF23 3' UTR vl. PFA1 v2 and NoHetl v3 were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs; NoHetl v3 and PFA3 v3 were placed in a head-to-head orientation; PFA3 v3 and PFA2 v2 were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v3, PFA2 v2. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids containing the four PTUs were then isolated and tested for incorporation of the four PTUs with restriction enzyme digestion and DNA sequencing. pDAB107960. The pDAB107960 plasmid (SEQ ID NO:26) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acelyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 vl; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains PvPhas promoter v6; PvPhas promoter PvPhas 5’ UTR; NoHetl v3; PvPhas 3' UTR vl; PvPhas 3' MAR v2; AtuORF23 3' UTR vl. The second PUFA synthase PTU contains the PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v3; and At2S SSP terminator vl. The third PUFA synthase PTU contains the PvPhas promoter v6; PvPhas 5’ UTR; Schizochytrium sp. PFA2 vl; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and AtuORF23 3' UTR vl. PFA1 vl and NoHetl v3 were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR is placed between the two PTUs; NoHetl v3 and PFA3 v3 were placed in a head-to-head orientation; PFA3 v3 and PFA2 vl were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR is placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 vl, NoHetl v3, PFA3 v3, PFA2 vl. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl 10132. The pDABl 10132 plasmid (SEQ ID NO:27) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains a Phaseolus vulgaris beta-phaseolin promoter (PvPhas promoter v3; GenBank Accession Number J01263.1); Phaseolus vulgaris beta-phaseolin 5' untranslated region (PvPhas promoter PvPhas 5’ UTR; GenBank Accession Number JO 1263.1);
Schizochytrium sp. PFA1 v2; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The phosphopantetheinyl transferase PTU contains BoACP promoter vl; BoACP 5' UTR v2; NoHetl v3; BnACP05 3' UTR vl; and AtuORF23 3' UTR vl. The second PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v2; and At2S SSP terminator vl. The third PUFA synthase PTU contains BnaNapinC Promoter vl; BnaNapinC 5' UTR vl; Schizochytrium sp. PFA2 vl; BnaNapinC 3' UTR vl; and AtuORF23 3' UTR vl. PFA1 v2 and NoFIetl v3 were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs; NoHetl v3 and PFA3 v2 were placed in a head-to-head orientation; PFA3 v2 and PFA2 i vl were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 vl. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDAB107961. The pDAB107961 plasmid (SEQ ID NO:28) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU 1 contains PvPhas promoter v3; PvPhas promoter PvPhas 5’ UTR; Schizochytrium sp. PFA1 vl; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The phosphopantetheinyl transferase PTU contains BoACP promoter vl; BoACP 5' UTR v2; NoHetl v3; BnACP05 3' UTR vl; and AtuORF23 3' UTR vl. The second PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v3; and At2S SSP terminator vl. The third PUFA synthase PTU contains BnaNapinC Promoter vl; BnaNapinC 5' UTR vl; Schizochytrium sp. PFA2 vl; BnaNapinC 3' UTR vl; and AtuORF23 3' UTR vl. PFA1 vl and NoHetl v3 were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs; NoHetl v3 and PFA3 v3 were placed in a head-to-head orientation; PFA3 v3 and PFA2 vl were i placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 vl, NoHetl v3, PFA3 v3, PFA2 vl. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABllOlSl. The pDABl 10151 plasmid (SEQ ID NO:29) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains BnaNapinC Promoter vl; BnaNapinC 5' UTR vl; Schizochytrium sp. PFA1 v2; and BnaNapinC 3' UTR vl. The phosphopantetheinyl transferase PTU contains PvPhas promoter v6; PvPhas promoter PvPhas 5’ UTR; NoHetl v3; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and AtuORF23 3' UTR vl. The second PUFA synthase PTU contains the BnaNapinC Promoter vl; BnaNapinC 5' UTR vl; Schizochytrium sp. PFA3 v2; and At2S SSP terminator vl. The third PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA2 vl; and At2S SSP terminator vl. PFA1 v2 and NoHetl v3 were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs; NoHetl v3 and PFA3 v2 were placed in a head-to-head orientation; PFA3 v2 and PFA2 vl were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 vl. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl 12285. The pDABl 12285 plasmid (SEQ ID NO:30) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 v2; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains PvPhas promoter v6; PvPhas promoter PvPhas 5’ UTR; NoHetl v3; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and AtuORF25/26 3' UTR v3. The second PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v2; and At2S SSP terminator vl. The third PUFA synthase PTU contains PvPhas promoter v6; PvPhas 5’ UTR; Schizochytrium sp. PFA2 vl; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and AtuORF23 3' UTR vl. PFA1 v2 and NoHetl v3 were placed in a tail-to-tail orientation, and an AtuORF25/26 3 ’UTR was placed between the two PTUs; NoHetl v3 and PFA3 v2 were placed in a head-to-head orientation; PFA3 v2 and PFA2 vl were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 vl. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were then isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl 17501. The pDABl 17501 plasmid (SEQ ID NO:31) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 vl; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains PvPhas promoter v6; PvPhas promoter PvPhas 5’ UTR; NoHetl v3; PvPhas 3' UTR vl; PvPhas 3' MAR v2; GenBank Accession Number J01263.1; and AtuORF25/26 3’ UTR v3. The second PUFA synthase PTU contains a Glycine max beta-conglycinin alpha prime subunit gene promoter and 5' untranslated region (SSPR02745.1; GenBank Accession Number GU723691.1); Schizochytrium sp. PFA3 vl; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The third PUFA synthase PTU contains BnaNapinC Promoter vl; BnaNapinC 5' UTR vl; Schizochytrium sp. PFA2 vl; BnaNapinC 3' UTR vl. PFA1 vl and NoHetl v3 were placed in a tail-to-tail orientation, and an AtuORF25/26 3’UTR was placed between the two PTUs; NoHetl v3 and PFA3 vl were placed in a head-to-head orientation; PFA3 vl and PFA2 vl were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 vl, NoHetl v3, PFA3 vl, PFA2 vl. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl 17502. The pDABl 17502 plasmid (SEQ ID NO:32) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvPhas promoter v3; PvPhas promoter PvPhas 5’ UTR; Schizochytrium sp. PFA1 vl; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The phosphopantetheinyl transferase PTU contains BnaNapinC Promoter vl; BnaNapinC 5' UTR vl; NoHetl v3; BnaNapinC 3' UTR vl; and AtuORF25/26 3' UTR vl. The second PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 vl; and At2S SSP terminator vl. The third PUFA synthase PTU contains a SSPR02745.1; Schizochytrium sp. PFA2 vl; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. PFA1 vl and NoHetl v3 were placed in a tail-to-tail orientation, and an AtuORF25/26 3’UTR was placed between the two PTUs; NoHetl v3 and PFA3 vl were placed in a head-to-head orientation; PFA3 vl and PFA2 vl were placed in a tail-to-tail orientation, and an AtuORF23 3’UTR was placed between the two PTUs within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 vl, NoHetl v3, PFA3 vl, PFA2 vl. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing.
THIRD ORIENTATION pDABl 12200. The pDABl 12200 plasmid (SEQ ID NO:33) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 v2; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains PvPhas promoter v4; PvPhas 5’ UTR; NoHetl v3; AtuORF23 3' UTR vl; and a random DNA spacer. The second PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA3 v2; and At2S SSP terminator vl. The third PUFA synthase PTU contains PvPhas promoter v5; PvPhas 5’ UTR; Schizochytrium sp. PFA2 v2; and AtuORF23 3' UTR vl. PFA1 v2 and NoHetl v3 were placed in a head-to-head orientation; NoHetl v3 and PFA3 v2 were placed in a tail-to-tail orientation with a random DNA spacer placed between the two PTUs; PFA3 v2 and PFA2 v2 were placed in a head-to-head orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 v2. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl 12201. The pDABl 12201 plasmid (SEQ ID NO:34) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains a Lesquerella fendleri 3-ketoacyl-CoA synthase gene promoter and 5' untranslated region (LfKCS3 promoter v2; GenBank Accession Number AF367052.1); Schizochytrium sp. PFA1 v2; and 3-ketoacyl-CoA synthase gene 3' untranslated region (SSTER2742.1; GenBank Accession Number AF367052.1). The phosphopantetheinyl transferase PTU contains BoACP promoter vl; BoACP 5' UTR v2; NoHetl v3; BnACP05 3' UTR vl; and a random DNA spacer. The second PUFA synthase PTU contains the LfKCS3 promoter v2; Schizochytrium sp. PFA3 v2; and SSTER2742.1. The third PUFA synthase PTU contains BoACP promoter vl; BoACP 5' UTR v2; Schizochytrium sp. PFA2 v2; and BnACP05 3' UTR vl. PFA1 v2 and NoHetl v3 were placed in a head-to-head orientation; NoHetl v3 and PFA3 v2 were placed in a tail-to-tail orientation with a random DNA spacer placed between the two PTUs; PFA3 v2 and PFA2 v2 were placed in a head-to-head orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 v2. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDAB112203. The pDABl 12203 plasmid (SEQ ID NO:35) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains SSPR02745.1; Schizochytrium sp. PFA1 v2; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The phosphopantetheinyl transferase PTU contains a Glycine max kunitz trypsin inhibitor 3 gene promoter and 5’ untranslated region (SSPR02743.1; GenBank Accession Number AF233296.1); NoHetl v3; Glycine max kunitz trypsin inhibitor 3 gene 3' untranslated region (SSTER2744.1; GenBank Accession Number AF233296.1); and a random DNA spacer. The second PUFA synthase PTU contains SSPR02745.1; Schizochytrium sp. PFA3 v2; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The third PUFA synthase PTU contains SSPR02743.1; Schizochytrium sp. PFA2 v2; and SSTER2744.1. PFA1 v2 and NoHetl v3 were placed in a head-to-head orientation; NoHetl v3 and PFA3 v2 were placed in a tail-to-tail orientation with a random DNA spacer placed between the two PTUs; PFA3 v2 and PFA2 v2 were placed in a head-to-head orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 v2. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl12205. The pDABl 12205 plasmid (SEQ ID NO:36) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA1 v2; and At2S SSP terminator vl. The phosphopantetheinyl transferase PTU contains BoACP promoter vl; BoACP 5' UTR v2; NoHetl v3; BnACP05 3' UTR vl; and a random DNA spacer. The second PUFA synthase PTU contains SSPR02743.1; Schizochytrium sp. PFA3 v2; and SSTER2744.1. The third PUFA synthase PTU contains SSPR02745.1; Schizochytrium sp. PFA2 v2; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. PFA1 v2 and NoHetl v3 were placed in a head-to-head orientation; NoHetl v3 and PFA3 v2 were placed in a tail-to-tail orientation with a random DNA spacer placed between the two PTUs; PFA3 v2 and PFA2 v2 were placed in a head-to-head orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 v2. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were then isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl 12208. The pDABl 12208 plasmid (SEQ ID NO:37) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains SSPR02743.1; Schizochytrium sp. PFA1 v2; and SSTER2744.1. The phosphopantetheinyl transferase PTU contains SSPR02745.1; NoHetl v3; PvPhas 3' UTR vl; PvPhas 3' MAR v2; and a random DNA spacer. The second PUFA synthase PTU contains SSPR02743.1; Schizochytrium sp. PFA3 v2; and SSTER2744.1. The third PUFA synthase PTU contains SSPR02745.1; Schizochytrium sp. PFA2 v2; and PvPhas 3' MAR v2. PFA1 v2 and NoHetl v3 were placed in a head-to-head orientation; NoHetl v3 and PFA3 v2 were placed in a tail-to-tail orientation with a random DNA spacer placed between the two PTUs; PFA3 v2 and PFA2 v2 were placed in a head-to-head orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 v2. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing. pDABl 12209. The pDABl 12209 plasmid (SEQ ID NO:38) contains three PUFA synthase PTUs, one phosphopantetheinyl transferase PTU, and a phosphinothricin acetyl transferase PTU. Specifically, the first PUFA synthase PTU contains SSPR02743.1; Schizochytrium sp. PFA1 v2; and SSTER2744.1. The phosphopantetheinyl transferase PTU contains BoACP promoter vl; BoACP 5' UTR v2; NoHetl v3; BnACP05 3' UTR vl; and random DNA spacer. The second PUFA synthase PTU contains SSPR02745.1; Schizochytrium sp. PFA3 v2; PvPhas 3' UTR vl; and PvPhas 3' MAR v2. The third PUFA synthase PTU contains PvDlec2 promoter v2; 2S 5' UTR; Schizochytrium sp. PFA2 v2; and At2S SSP terminator vl. PFA1 v2 and NoHetl v3 were placed in a head-to-head orientation; NoHetl v3 and PFA3 v2 were placed in a tail-to-tail orientation with a random DNA spacer placed between the two PTUs; PFA3 v2 and PFA2 v2 were placed in a head-to-head orientation within the T-strand DNA border regions of the plant transformation binary vector, pDAB7333. The order of the genes is: PFA1 v2, NoHetl v3, PFA3 v2, PFA2 v2. pDAB7333 also contains the phosphinothricin acetyl transferase PTU, as previously described. Recombinant plasmids were isolated and tested for incorporation of the PTUs with restriction enzyme digestion and DNA sequencing.
Agrobacterium tumefaciens Transformation.
Selected binary constructs were transformed into Agrobaclerium strains for plant transformations. The strains that were selected were for transformation were derivatives of A. tumefaciens strain EHA 105. The following two A. tumefaciens strains, AGL1 and DA2552 {See International Patent Publication No. WO2012016222) were transformed with the binary constructs, and confirmed via restriction enzyme digestion and sequencing.
Arabidopsis To events transformed with Agrobacterium tumefaciens harboring binary plasmids encoding PUFA synthase genes and Iletl (and in some cases SzACS2) under the control of plant expression elements as described above were generated using the floral dip method essentially as described in Clough and Bent (1998) Plant J. 16(6):735-43. The Arabidopsis To events were obtained and selected to grow to maturity and self-fertilized. The resulting Ti seed was harvested and sown. Transformed T] plants were selected by spraying with phosphinothricin to select for those plants containing a functional pat gene as a selectable marker. Leaf tissue from the surviving Ti plants was sampled and analyzed by quantitative PCR reactions specific for the pat gene to identify those plants containing a single copy of the selectable marker (and associated transgenes). These plants were grown to maturity, and the T2 seed was harvested and analyzed for LC-PUFA content (as % of total extractable FAMEs).
Canola Transformation.
Seed germination. Wild-type canola seeds (variety DH12075; ncbi.nlm.nih.gov/pmc/articles/PMC1456120/) were surface-sterilized in 10% Clorox for 10 minutes and rinsed three times with sterile distilled water (seeds are contained in steel strainers during this process). Seeds were planted for germination on % MS Canola medium (1/2X MS, 2% sucrose, 0.8% agar) contained in phytatrays, 25 seeds per phytatray and placed in a Percival Growth Chamber™ with growth regime set at 25°C, a photoperiod of 16 hours light, 8 hours dark; and germinated for 5 days.
Pre-treatment. On day 5, hypocotyl segments of about 3 mm in length were aseptically excised, discarding the root and shoot sections (drying of hypocotyls is prevented by placing hypocotyls segments into 10 ml of sterile milliQ® water during excision process). Hypocotyl segments were placed horizontally on sterile filter paper on callus induction medium MSK1D1 (IX MS, 1 mg/L kinetin, 1 mg/L 2,4-D, 3% sucrose, 0.7% PHYTAGAR®) for 3 days pre-treatment in a Percival Growth Chamber™ with a growth regime set at 22-23°C, and a photoperiod of 16 hours light, 8 hours dark.
Co-cultivation with Agrobacterium. The day before Agrobacterium treatment, flasks of YEP medium containing the appropriate antibiotics were inoculated. Hypocotyl segments were transferred from filter paper to empty 100 x 25 mm petri dishes containing 10 mL liquid M medium to prevent the hypocotyl segments from drying. A spatula was used at this stage to scoop the segments and transfer. The liquid M medium was removed with a pipette, and 40 mL Agrobacterium suspension was added to the petri dish (500 segments with 40 mL Agrobacterium solution). The segments were treated for 30 minutes with periodic swirling of the petri dish, so that the hypocotyls could stay immersed in the Agrobacterium solution. i At the end of the treatment period, the Agrobacterium solution was pipetted into a waste beaker, autoclaved and discarded (the Agrobacterium solution was completely removed to prevent Agrobacterium overgrowth). The treated hypocotyls were transferred with forceps back to the original plates containing MSK1D1 with filter paper (care was taken to ensure that the segments did not dry). The hypocotyl segments along with control segments were returned to the Percival Growth Chamber™ under reduced light intensity (by covering the plates with aluminum foil), and the treated hypocotyls were co-cultivated with Agrobacterium for 3 days.
Callus induction on selection medium. After 3 days of co-cultivation, the hypocotyl segments were transferred individually with forceps onto callus induction medium MSK1D1H1 (IX MS, 1 mg/L Kinetin, 1 mg/L 2,4-D, 0.5 gm/L MES, 5 mg/L AgNo3, 300 mg/L TIMENTIN®, 200 mg/L Carbenicillin™, 1 mg/L Herbiace™, 3% sucrose, 0.7% PHYTAGAR®). The hypocotyl segments were anchored on the medium, but were not embedded in the medium.
Selection and shoot regeneration. After 7 days on callus induction medium, the callusing hypocotyl segments were transferred to Shoot Regeneration Medium 1 with selection MSB3Z1II1 (IX MS, 3 mg/L BAP, 1 mg/L zeatin, 0.5 gm/L MES, 5 mg/L AgNC>3, 300 mg/L TIMENTIN®, 200 mg/L Carbenicillin™, 1 mg/L Herbiace™, 3% sucrose, 0.7% PHYTAGAR®). After 14 days, the hypocotyls with shoots were transferred to Regeneration Medium 2 with increased selection MSB3Z1H3 (IX MS, 3 mg/L BAP, 1 mg/L zeatin, 0.5 gm/L MES, 5 mg/L AgN03, 300 mg/L TIMENTIN®, 200 mg/L Carbenicillin™, 3 mg/L Herbiace™, 3% sucrose, 0.7% PHYTAGAR®).
Shoot elongation. After 14 days, the segments with shoots were transferred to shoot elongation medium MSMESH5 (IX MS, 300 mg/L ΉΜΕΝΊΠΝ®, 5 mg/L Herbiace™, 2% sucrose, 0.7% TC Agar™). Shoots that were already elongated were isolated and transferred to MSMESH5. After 14 days, the remaining shoots which did elongate in the first round were placed on MSMESH5, and transferred to fresh selection medium of the same composition. At this stage, all remaining hypocotyl segments were discarded. Shoots that elongate on MSB3Z1H3 medium after 2 weeks were isolated and transferred to MSMESH5 medium. Remaining shoots that did elongate in the first round on MSMESH5 were isolated and transferred to fresh selection medium of the same composition. At this stage, all remaining hypocotyl segments were discarded.
Root induction. After 14 days, the shoots were transferred to MSMEST medium (IX MS, 0.5 g/L MES, 300 mg/L ΤΙΜΕΝΊΊΝ®, 2% sucrose, 0.7% TC Agar™) for root induction. The shoots that did not root in the first transfer on MSMEST medium were transferred for a second or third cycle on MSMEST medium until rooted plants were obtained. PCR analysis. Samples for PCR were isolated after the shoots were cultured on MSMESH5 medium for at least 14 days. Leaf tissue from the green shoots was tested by PCR for the presence of the pat selectable marker gene. All chlorotic shoots were discarded and not subjected to the PCR assay. Samples that were positive for the PCR reaction were kept and the shoots were left on the MSMEST medium to elongate and develop roots. The shoots that were negative according to the PCR assay were discarded. Plants that rooted on MSMESH5 or MSMEST and were PCR-positive were sent for transplanting into soil. After hardening, the To canola plants were further analyzed for events which contain all of the transgene PTU cassettes, and these plants were transferred to the greenhouse, grown to maturity, and the Tj seed was harvested for fatty acid composition analysis.
Soybean Transformation. ' Cotyledonary node soybean. Agrobacterium-modi&amp;iei transformation of soybean (Glycine max c.v., Maverick) was performed using an Agrobacterium-strain harboring a binary vector via a modified procedure of Zeng et al. (2004) Plant Cell Rep. 22(7): 478-82. The protocol was modified to include the herbicide glufosinate as a selective agent. In addition, another modification included the germination of sterilized soybean seeds on B5 basal medium (Gamborg etal. (1968) Exp Cell Res. 50(1):151-8) solidified with 3 g/L Phytagel™ (Sigma-Aldrich, St. Louis, Mo.). The final modification to the protocol deployed the use of cotyledonaiy node explants that were prepared from 5-6 days old seedlings and infected with Agrobacterium as described by Zhang et al. (1999) Plant Cell Tiss. Org. 56: 37-46. As described in Zeng et al. (2004), co-cultivation was carried out for 5 days on the co-cultivation medium. Shoot initiation, shoot elongation, and rooting media were supplemented with 50 mg/L Cefotaxime™, 50 mg/L TIMENTIN®, and 50 mg/L Vancomycin™, and solidified with 3 g/L Phytagel™.
Split seed soybean transformation method. Agrobacterium-mediated transformation of soybean (Glycine max c.v., Maverick) was performed using an Agrobacterium-strain harboring a binary vector via a modification of the procedure of Paz et al. (2005) Plant Cell Rep. 25:206-13. Briefly, soybean seeds were cut in half by a longitudinal cut along the hilum to separate the seed and remove the seed coat. The embryonic axis was excised and any axial shoots/buds were removed from the cotyledonary node. The resulting half seed explants were infected with Agrobacterium. Shoot initiation, shoot elongation, and rooting media were supplemented with 50 mg/L Cefotaxime™, 50 mg/L TIMENTIN®, and 50 mg/L Vancomycin™, and solidified with 3 g/L Phytagel™. Glufosinate selection was employed to inhibit the growth of non-transformed shoots.
Split seed with partial embryo axis soybean transformation method. Agrobacterium-mediated transformation of soybean (Glycine max c.v., Maverick) was performed using an Agrobacterium-strain harboring a binary vector via the split-seed explant with partial embryo axis soybean transformation protocol described in U.S. Provisional Application No. 61/739,349. After transformation, the soybean tissues were cultured using the tissue culture methods described in U.S. Provisional Application No. 61/739,349. Glufosinate selection was employed to inhibit the growth of non-transformed shoots. Selected shoots were transferred to rooting medium for root development and then transferred to soil mix for acclimatization of plantlets.
Terminal leaflets of selected plantlets were treated topically (leaf paint technique) with glufosinate to screen for putative transformants. The screened plantlets were transferred to the greenhouse, allowed to acclimate, and then leaf-painted with glufosinate to reconfirm tolerance. These putative transformed To plants were sampled, and molecular analyses was used to confirm the presence of transgenes within the PTUs. Identified To plants were allowed to self-fertilize in the greenhouse to produce Ti seed for fatty acid composition analysis.
Lipid Analysis of Mature Ti seed from Transgenic Soybean Events.
To plants from 3 constructs, pDAB101454, pDAB101496 and pDAB107960, were grown to maturity in the greenhouse. Plants were selected that contained copies of PAT v5 and the accompanying four genes for DHA production. These plants were self-fertilized and the resulting Ti seed harvested at maturity. Single seeds were analyzed via FAMEs GC-FID to determine the LC-PUFAs and DHA content in the Ti soybean seed. Twelve whole mature seeds per plant were individually analyzed by crushing the seed with a press and homogenization using a steel ball and ball mill (Spex SamplePrep, LLC). The tissue was defatted three times with hexane, the pooled hexane fractions were evaporated to dryness and the residue weighed and reconstituted in heptane for FAME analysis. A known amount of oil residue was transmethylated with 0.25 M of freshly prepared sodium methoxide (Sigma-Aldrich, St. Louis, MO) in methanol in the presence of the surrogate, triheptadecanoin (Nu-Chek Prep, Elysian, MN). The reaction was conducted under mild heat (40°C) and constant shaking and the resulting FAMEs extracted with heptane. Completion of the reaction was verified by recovery of the reacted heptadecanoate methyl-ester surrogate. The FAMEs extracts were analyzed by GC-FID using an Agilent 6890 Gas Chromatograph (Agilent Technologies, Santa Clara, CA) and a 15m x 0.25mm x 0.25pm BPX 70 capillary column from SGE (Austin, TX). Each FAME peak was identified by its retention time and quantified by the injection of a rapeseed oil reference mix from Matreya LLC (Pleasant Gap, PA). The calibration standard contained individually added standards of DHA (C22:6), EPA (C20:5), DPA(n-6) (C22:5), γ-linolenoate (Cl8:3) and arachidonate methyl esters from Nu-Chek. Data analysis was performed using ChemStation4 software (Agilent). DHA comprised 60% of the total LC-PUFA content in those T2 seeds that contained LC-PUFA. Only the two novel LC-PUFAs, DHA and DPA (n-6), were detected in the T2 soybean seeds. The fatty acids which are expected to be found in soybean seeds were detected at normal levels, except that total C18 fatty acids were proportionally lower due to the presence of LC-PUFAs. Two other fatty acids (γ-linolenic and arachidonic acids) were detected in these transgenic soybean seeds at a low level (less than 1% combined).
Molecular Confirmation of Plant Transformants.
Copy number analysis and detection of coding region. To plants were identified and selected from the transformations described above. These transformants were further analyzed to identify plants which contained each of the transgene PTU expression cassettes. Hydrolysis probe assays, analogous to TAQMAN®, were performed to initially screen for and confirm the presence of the PFA1, PFA2, PFA3, Hetl, SzACS2, and pat transgenes and the absence of the VirD2 Agrobacterium gene. Assays were deployed as previously described in International Patent Publication Nos. WO2013016546 and WO2011146524. The data generated from these quantitative PCR studies was used to determine the transgene presence and copy number. Events containing all of the PTUs were selected for advancement to Ti plants.
Detection of PUFA synthase proteins in canola and Arabidopsis seed. Quantitative Western blot methods were developed to detect PUFA synthase polypeptides from seed samples of canola and Arabidopsis. Antigens for full length PFA1 and PFA3 were recombinantly expressed with an N-terminal HIS tag and partially purified via Cobalt affinity chromatography. Antigens for the full length PFA2 did not contain a HIS tag, and were isolated from inclusion bodies. An N-terminal PFA2 fragment, and a PFA3 fragment that overlapped with a predicted ER domain, were also recombinantly expressed as antigens. All of these fragments were submitted as gel slices for polyclonal antibody production in rabbits. Antigens for full-length Hetl were recombinantly expressed with an N-terminal 6X His tag in BL21(DE3) Escherichia coli cells (Invitrogen; Carlsbad, CA), and were highly purified via Cobalt affinity chromatography. Antigens were submitted as TBS buffered soluble proteins at a concentration of about 2 mg/mL for polyclonal production in rabbits. All antisera were purified by Protein G antibody affinity chromatography.
Recombinant reference standards for PFA1, PFA2, PFA3, and Hetl were heterologously expressed and produced in Arctic Express(DE3) RIL (Invitrogen; Carlsbad, CA), and purified via His-ComAC purification. Protein concentrations were determined by densitometry using a BSA standard curve for in-gel quantitation with denaturing SDS-PAGE and Coomassie blue staining.
Seed samples were prepared for analysis by cracking dry Arabidopsis seed with two stainless steel beads in a Kleco Bead Beater™ (Garcia Machine, Visalia, CA), or by processing the delipidated cake that resulted from a bulk canola seed FAME analysis, as described above. Extraction buffer was added (50 mM Tris, 10 mM EDTA, 2% SDS) to the seed samples, and the tubes containing the samples and extraction buffer were rocked gently for 15-30 minutes. The samples were centrifuged for 30 minutes at 3,000 x g. The supernatant was collected and used for analysis.
The amount of total soluble protein in the seed extract was determined using a Pierce 660 nm Protein Assay™ (Thermo Scientific, Rockford, 1L). Samples were normalized to 1.55 mg/mL total soluble protein, and prepared in LDS sample buffer (Invitrogen, Carlsbad, CA) with 40 mM DTT for a normalized load of 20 pg total soluble protein per lane. Samples were electrophoresed in 3-8% Tris acetate gels (Invitrogen, Carlsbad, CA), and transferred to nitrocellulose membranes. Blots were blocked in blocking buffer and probed with antibodies against the different PUFA synthase polypeptides (PFA1, PFA2, and PFA3). An anti-rabbit fluorescent labeled secondary antibody, Goat Anti-Rabbit AF 633™ (Invitrogen, Carlsbad, CA) was used for detection. Blots were visualized on a Typhoon Trio Plus Fluorescent Imager™ (GE Healthcare, New Brunswick NJ). The resulting SDS-PAGE Western blots of extracts from mature seed for Arabidopsis T2 seed, soybean Ti seed, and canola Tj seed produced bands at the appropriate size when probed with PFA1, PFA2, PFA3, and Hetl specific antisera.
The SDS-PAGE Western blot detection of PUFA synthase polypeptides PFA1, PFA2, PFA3, and Hetl in Ti soybean seed extracts for event 101454[267]-26702.001 produced bands of the expected molecular weight.
Also, the SDS-PAGE Western blot detection of PUFA synthase polypeptides PFA1, PFA2, PFA3, and Hetl in T] canola seed extracts for the following events: 6580[2]-016.Sx001; 6580[2]-017.Sx001; 6580[2]-017.Sx002; 6580[2]-018.Sx001; 6580[2]-019.Sx001; 6580[2]-020.Sx001; 6580[2]-021.Sx001; 6580[2]-021.Sx002; 6580[2]-024.Sx001; 6580[2]-039.Sx001; and 6580[2]-039.Sx002, produced bands of the expected molecular weight.
Next, the PUFA synthase specific proteins were quantified via SDS-PAGE Western blots with 5 point standard curves (100 ng, 50 ng, 25 ng, 12.5 ng and 6.25 ng) for each polypeptide. Table 2 and Table 3 summarize results for Arabidopsis and canola, respectively.
Table 2. Summary of PUFA Synthase Polypeptide content in bulk T2
Arabidopsis seed.
Table 3. Summary of PUFA Synthase Polypeptide content in bulk Ti seed of canola events.
Example 3: LC-PUFA Production in Arabidopsis Seeds Transformed with PUFA Synthase. A summary of DHA and other LC-PUFA oil content obtained from the seed of transgenic Arabidopsis Tj events generated with constructs encoding PUFA synthase genes is shown in Table 4. DHA and EPA HPLC retention times were identified using authentic 14C-labeled standards. FIG. 2. Quantitation of PUFA’s was performed using a l-14C-labeled DHA standard curve. The LC-PUFA content of each event analyzed is shown in FIG. 3. These data indicate that the type of construct PTU configuration and the usage of specific combinations of regulatory elements for expressing the PUFA synthase and Hetl genes can be used to change the number of transgenic events obtained that produce LC-PUFA in T2 Arabidopsis seed.
Table 4. Summary of LC-PUFA content in T2 seeds of Arabidopsis events transformed with the PUFA synthase and Hell transgenes and containing a single copy of the pat transgene.
1 Number of events with LC-PUFA content >1 % of total seed FAMEs with percentage of total events in parentheses 2 Average total LC-PUFA content (DHA (n-3) + EPA(n-3) + DPA (n-6)) as % of total seed FAMEs 3 Maximum total LC-PUFA content of all T2 seed samples analyzed as % of total FAMEs 4 Average n-3 LC-PUFA (DHA + EPA)/Total LC-PUFA content across all LC-PUFA-producing events
For example, only 22% of all single copy events for pDAB101454 produced DHA, whereas 65% of pDAB101496 events and 79% of pDAB112206 events produced DHA. As compared to pDABl01454, the pDAB101496 binary vector contains diversified regulatory elements. Likewise, as compared to pDAB101454, the pDABl 12206 construct contains both diversified regulatory elements and an altered PTU configuration. Another construct, pDABl09584, contains additional regulatory element diversification, and also contains the native coding sequence version of the PFA3 gene, rather than a “plant-optimized” version. In this case, 82% of all single copy events produced LC-PUFAs.
Further modifications of regulatoiy elements, construct configuration, and use of native gene sequences in the second orientation PTU configuration resulted in constructs that produced transgenic Arahidopsis plants with 61-80% of all single copy events producing LC-PUFAs. The second orientation PTU configuration constructs also produced transgenic plants with a higher proportion of events (41-62%) with LC-PUFA contents of >1% in T2 Arabidopsis seed. The presence of native (vs. plant-optimized) PUFA synthase gene sequences also improved the proportion of >1% LC-PUFA events. For example, pDAB109525 and pDABl 12210 containing native gene sequences had 60% and 54%, respectively, of single copy events making >1% LC-PUFA. In contrast, pDAB101454 that comprises all plant-optimized genes (in the same format and using the same regulatory elements) had only 5% of events making >1% LC-PUFA.
The maximum LC-PUFA content of T2 seed from events using the various constructs ranged from 0.71-2.14%. The maximum DHA content ranged from 0.39-1.59%. The highest levels of DHA were obtained with constructs in the second orientation PTU configuration, for example, pDAB109591, pDAB107962 and pDABl07960. These constructs contained two different promoter/terminator combinations to drive the four transgenes, and either one or three native PUFA synthase genes.
The maximum EPA content ranged from 0-1.17% across all the constructs and events generated. Constructs pDABl 12203 (third orientation PTU configuration), pDABl 12200 (third orientation PTU configuration), pDABl 12201 (third orientation PTU configuration) and pDAB101496 (first orientation PTU configuration) were effective at making relatively high levels of EPA compared to other constructs. The LC-PUFA-producing events from these constructs contained a relatively high proportion of EPA (39-56% of LC-PUFAs) in comparison to DHA (37-45% of LC-PUFAs), whereas other constructs typically contained a lower proportion of EPA (9-27% of LC-PUFAs), and a higher proportion of DHA (60-76% of LC-PUFAs).
These data show that the selection of construct configuration and gene regulatory elements results in improved efficiency for producing both co-3 LC-PUFAs, DHA and EPA, in Arabidopsis seeds, and that the selection of construct configuration and gene regulatory elements can be used to increase the efficiency for producing both co-3 LC-PUFAs, DHA and EPA, in crop plants.
Arabidopsis T3 seed.
The T2 seed from high LC-PUFA producing Arabidopsis events were planted and leaf tissue from the T2 plants sampled via quantitative PCR reactions for the pat gene and other transgenes. Plants containing two copies of the transgenes (i.e., homozygotes) were identified and grown to maturity. The resulting T3 seed was harvested and analyzed for LC-PUFA content. Constructs pDAB 101454 and pDAB101429 that contained repeated promoter/terminator expression elements and used all “plant-optimized” PUFA synthase gene sequences exhibited very poor stability of the LC-PUFA trait in the subsequent T3 seed generation, with little or no LC-PUFAs detected in the T3 seed progeny. Other events transformed with constructs having different PTU configurations and/or diversified expression elements (pDAB109588, pDAB 101496) produced transgenic Arabidopsis lines with LC-PUFA detectable to varying levels in the T3 seed generation (Table 5, FIG. 4). Lines from a construct with completely diversified promoter/terminator combinations (pDAB 109584) or with entirely native PUFA synthase gene sequences (pDAB 109525) in the first orientation PTU format showed very good stability into the T3 seed generation. Combining the use of diversified promoter /terminator combinations and/or use of one or three native PUFA synthase sequences in the second orientation or third orientation PTU formats also led to consistent stability into the T3 seed generation (for example, for constructs pDAB 107960 and pDAB 107961). This resulted in individual T3 homozygous seed lines containing up to 1.77% DHA, up to 1.1% EPA, and up to 2.57% total LC-PUFAs.
Table 5. LC-PUFA analysis of T3 seed progeny from selected transgenic Arabidopsis DHA-producing T2 lines.
Total LC-PUFA, DHA and EPA contents are % of total FAMEs 1. T3 seed bulks from 5-20 individual homozygous plants were analyzed
The complete seed fatty acid profile for exemplary individual Arabidopsis T3 lines that are homozygous for the DHA-producing transgenes is shown in Table 6, in comparison with the average lipid profile for the T3 sibling nulls. The production of LC-PUFAs driven by PUFA synthase was associated with reduction in the content of native elongated fatty acids, particularly eicosenoic acid (22:1), and slight increases in oleic (18:1) and linoleic acid (18:2) content. There was no significant change in the content of saturated fatty acids, palmitic (16:0) and stearic acid (18:0).
Table 6. Fatty acid profiles of homozygous T3 seeds of Arabidopsis events transformed with the PUFA synthase and Hetl transgenes.
Selected high DHA-producing homozygous T3 lines were planted, and the T4 seed from these plants was analyzed. Ten lines from pDAB109591, pDAB109584, pDAB109525, pDAB109592, pDAB107960, and pDAB107961 transformations were analyzed. These lines continued to produce DHA (up to 1.85%) and EPA (up to 1.00%) in the T4 seed generation, indicating the stable transmission of the ω-3 LC-PUFA trait through three selfed seed generations.
Example 4: LC-PUFA Production in Canola Seeds Transformed with Schizochytrium PUFA Synthase.
Transgenic canola were produced with the binary constructs pDABl01496, pDAB109584, pDAB109592, pDAB107960, pDAB107961, pDAB107962 and pDABl 17501, (all containing the PUFA synthase genes PFAl, PFA2, PFA3, and the Nostoc PPTase NoHetl), and confirmed to contain a copy of the T-strand transgene via molecular confirmation. Τι seed was harvested from individual To transgenic canola plants, and a bulk seed sample of approximately 10 seeds from each Ti seed sample was analyzed for LC-PUFA content as previously described.
Of the Ti samples analyzed for each construct, a high proportion (81-93%) contained LC-PUFAs. Table 7. The maximum DHA content observed in the canola Ti seed samples was 3.04% for pDAB107960. The maximum EPA content observed was 1.97% for pDABl01496. The maximum combined ω-3 LC-PUFA content (DHA ΙΕΡΑ) was 4.20% for pDABl07960.
Table 7. Summary of LC-PUFA content in Ti seeds of canola events transformed with the PUFA synthase and Hetl transgenes.
To obtain an early indication of transgenic trait segregation, 48 seeds from selected Ti seed samples were analyzed for LC-PUFA content. Events with T-DNA insertion(s) at a single locus would be expected to give approximately 25% null seeds by Mendelian segregation. Of the 42 T| seed samples that were analyzed from the pDAB101496 events, 24 were found to have between 12 and 35% of seeds with no LC-PUFA (the average proportion of null seeds was 24% across all 24 samples. Table 8. Single canola Ti seeds were detected with up to 5.41% DHA, up to 3.72% EPA, and with up to 7.33% combined co-3 LC-PUFA (DHA + EPA).
Table 8. LC-PUFA content of homozygous T2 canola seed from transgenic events transformed with pDAB 101496.
Selected Τι canola seed samples were planted in the greenhouse to generate about 60-75 Ti plants. Leaf samples were taken from the seedlings at the 4-5 leaf stage for DNA analysis to determine the transgene copy number in each Ti segregant plant. The copy number analysis was performed by hydrolysis probe assays of the transgenes using the protocol described above. From these analyses, plants that were homozygous, heterozygous, and null for the transgenes were identified. Southern analysis was performed on genomic DNA extracted from the leaf samples of homozygous plants from 9 lines, probing for the presence of the PFA1 transgene, the pat transgene (at each end of the T-DNA), and the SpecR gene from the plasmid backbone. The Southern banding patterns of Ti plants from 6580[l]-035.Sx001 and 6580[l]-035.Sx002 were similar, indicating that the events were likely of clonal origin, as were 6580[l]-052.Sx001 and 6580[l]-057.Sx001. The results of the Southern analyses are shown in Table 9.
Table 9. Summary of Southern analyses from twelve Ti canola DIIA-producing lines transformed with pDAB101496.
Canola plants that were homozygous for the pDAB 101496 transgenes (and including some heterozygous and null plants) were grown to maturity, and the T2 seed was harvested and bulk seed samples analyzed for LC-PUFA content. Table 10. All ten selected pDAB101496 canola lines produced DHA and EPA in the T2 seed. The maximum LC-PUFA contents were 3.26% DHA (line 6580[l]-035. Sx002) and 1.42% EPA (line 6580[l]-035.Sx001). The amount of co-3 LC-PUFAs (DHA and EPA) in the lines was 80-88% (average 85%) of the total LC-PUFAs, the residual 12-20% being ω-6 DPA. Hemizygous Ti plants produced less LC-PUFA due to the expected segregation of the PUFA synthase trait in T2 seed. FIG. 5. The complete FAME profiles of homozygous and null seed from line canola line 6580[l]-035.Sx002 are shown in Table 10. This line produced 3.3% DHA and 1.3% EPA. There was a slight increase in alpha-linolenic (Cl8:3) and linoleic (Cl8:2) acids, and a decrease in oleic acid (Cl 8:1) content. In addition to the expected LC-PUFAs, DHA, EPA, and DP A, low levels of new ω-6 fatty acids, such as gamma-linolenic and arachidonic acid were also detectable (0.4 and 0.7%, respectively).
Table 10. FAME profile of bulk T2 canola seed from null and homozygous plants from canola event 6580[l]-035.Sx002. The lipid content for each fatly acid is shown as % of total FAMEs.
Single seed LC-PUFA analysis was performed on T2 seed batches from sibling homozygous and hemizygous plants from canola line 6580[l]-035.Sx002 that exhibited higher levels of DHA and EPA production in bulk seed analysis. FIG. 5. The seeds from the homozygous plants were quite uniform in DHA content with coefficients of variation (CVs) below 14%. The four samples of 48 individual seeds analyzed from plants that were homozygous for the transgenes had mean DHA contents of 3.70% (SD=0.44, CV=14%), 3.67% (SD=0.31, CV=8%), 3.11% (SD=0.36, CV=12%), and 3.11% (SD=0.35, CV=11%). The seed from the hemizygous plants contained an average of 15 null seeds per 48-seed sample, close to the value of 12 seeds predicted by Mendelian segregation of a single locus. The DHA content of individual seeds from these hemizygous plants varied up to a maximum value of 5.81%. FIG. 5. T2 seeds from homozygous lines derived from four canola events transformed with pDAB101496 were planted and grown in the greenhouse to produce T3 seed. All the lines continued to produce DHA and EPA in the harvested T3 seeds. Lines derived from events [6]-274.Sx001 and 6580[l]-035.Sx002 were particularly stable for LC-PUFA production, producing in T3 bulk seed measurements from individual plants an average of 3.16% DHA (range 2.73 - 3.61%, across 13 plants from three T2 lines) and 0.78% EPA (range 0.48 - 1.13%) and 3.34% DHA (range 2.85 - 3.89% across 53 plants from 8 T2 lines) and 1.12% EPA (range 0.75 - 1.71%), respectively.
Example 5: LC-PUFA Production in Soybean Seeds Transformed with PUFA Synthase.
Transgenic To soybean events which were produced via plant transformation using the binary constructs pDAB101454 (101454[16]-341.001 and 101454 [267]26702.001), pDAB101496 (101454[330]33007.001, 101454[333]33308.001, and 101454[334]33402.001), and pDAB10796 (107960[12]-626.001, 107960[12]-641.001, 107960[12]-644.001, 107960[26]-655.001, and 107960[26]-733.001) were grown to maturity in the greenhouse. Soybean events were selected that contained copies of pat transgene and the accompanying PUFA synthase and Hetl transgenes. The selected transgenic plants were self-fertilized, and the resulting Ti seed was harvested at maturity. Single seeds were obtained and analyzed via FAMEs GC-FID to determine the LC-PUFA and DHA content. Twelve whole mature seeds per plant were individually analyzed by crushing the seed with a press and homogenization using a steel ball and ball mill. The tissue was defatted three times with hexane, the pooled hexane fractions were evaporated to dryness and the residue weighed and reconstituted in heptane for FAME analysis performed as described in the previous example. The oil content (sum of the masses of the individual FAMEs divided by seed mass) of the transgenic seeds and the number of seeds produced by the transgenic Tj lines was not significantly different from that of the non-transgenic Maverick control cultivar grown in the greenhouse at the same time under the same conditions. The average and maximum level (%) of single T| seed LC-PUFAs content for selected events is summarized in Table 11. The DHA content is up to 2.0% and total PUFAs to 5.1%.
Furthermore, 3 novel, non-endogenous LC-PUFAs; DHA, EPA, and DPA (n-6), were detected in the Ti soybean seeds. Table 11.
Table 11. Ti seed FAMEs analysis from 10 events expressing 3 constructs harboring the PFA1, PFA2, PFA3 and NoHetl genes. The individual seed analysis result is expressed as a maximum content for LC-PUFA, EPA, DHA and DPA and an average of all Ti seeds for each event is also added. The ratio of n3 LC-PUFA (EPA and DHA) / Total LC-PUFAs was calculated.
DHA and EPA comprised 90% to 99% of the total LC-PUFA content in those Ti seeds that contained LC-PUFA. The highest LCPUFAs content (5.1%) was achieved with construct 107960 in event 107960[12]-626.001.
The complete lipid profiles of individual Tl seeds from soybean events 101454[16]-341 .SxOOl, pDAB101496{330}33007.001, 107960[12]-626.001, and 107960[12]-641.001 are shown in Table 12. Two individual seeds from Maverick control were added for comparison. All detected FAMEs were listed.
Table 12. Individual seed T] Fatty Acids Methyl Ester analysis from 3 constructs and control WT Maverick. All compositions are given in weight % of detected fatty acids. Not all fatty acids were quantified at the time were the analysis was conducted (NA). A value of 0 correspond to a level below the limit of quantitation.
Example 6: Lipid Analysis of Mature T2 seed from Transgenic Soybean Events.
To assess the inheritance of the LC-PUFA trait and its stability across generations, several representative events from the three constructs tested, 101454[16]-341.Sx001, 107960[12]-641.001, 107960[12]-644.001 and pDAB101496{334}33402.001 were selected based on their LCPUFAs content for growout in the green house. T] seeds were germinated in the greenhouse and T] plantlets were assayed for the presence of the PAT, PFA1 and NoHetl transgenes. Tj plants with the transgenes were grown to maturity and T2 seeds resulting from selflng were harvested for further oil analysis. From each T2 plant, 5 to 11 T2 seeds were analyzed individually for FAME by the method described in Example 1. The Ti seed result was added in bold as a reference to Table 13. The Ti lines for events 107960[12]-644.001 and pDAB101496{334}33402.001 were not homozygous, therefore, some T2 segregating seeds did not contain LC-PUFAs (the minimum LC-PUFAs is 0). The T2 seeds from event 101454[16]-341.Sx001 show a stable LC-PUFA content (2.2 to 5.6%) comparable to the Tj (3%). The slight increase in mean PUFAs observed is due to the selection of homozygous plants for the transgene locus, thereby eliminating sib null seeds in the T2 progeny. A majority of LC-PUFAs are n3 (ratio=0.9) and split between EPA (0.6 to 2.4%) and DHA (1.3 to 2.7%). The T2 seeds from event, 107960[12]-641.001, show a similar trend compared to the parent seeds. The LC-PUFA content (1.5 to 2.1%) is comparable to the T) seed (3.2%). All lines selected from event pDAB101496{334}33402.001 have T2 seeds with no LC-PUFA indicating that the Ti plants were not fixed for the transgene locus. The average LC-PUFA content of T2 seeds (0.1 to 1.1%) is comparable to the Ti (0.6%). For all those constructs the LC-PUFAs trait is inherited in the next generation without a significant difference in the amount of LC-PUFAs accumulated.
Table 13. T2 seed oil analysis from three events selected from three constructs. The individual seed analysis results is shown as a minimum and maximum content for total LC-PUFA, EPA, DHA, and DPA, and averaged across all seeds analyzed. A ratio of n3 LC-PUFA (EPA and DHA) / total LC-PUFA was calculated.
Example 7: LC-PUFA Production in Canola Seeds from pDAB101496 Grown in the Field. T2 canola seed batches from several homozygous T1 plants derived from six pDAB101496 (SEQ ID NO: 17) canola events producing moderate to high levels of DHA (described above) were independently pooled. This seed was planted in the field in Minnesota and North Dakota in 2013. Canola seed entries from untransformed plants were used as controls and commercially available canola lines were used as checks. Four replicate plots (1.2 x 6 m) were planted for each seed entry and at maturity the resulting seed (T3 seed in the case of the transgenic samples) was harvested from each plot and analyzed for LC-PUFA content. The LC-PUFA content of the harvested canola grain from each experimental plot was determined from FAME extractions of three 10-seed aliquots sampled from bulk grain from each of the four replicate field plots. Table 14.
Across the four field plots per entry, the maximum DHA and EPA contents were 4.27% and 0.65%, respectively, for Event 6580[l]-035.Sx002. The maximum DHA content from an individual plot was 4.54%. The same T2 seed lines that were bulked for each event for planting in the field were also grown in the greenhouse to compare the LC-PUFA contents of the resulting T3 seed. The DHA contents of the top four field grown pDAB101496 lines were on average 22% higher than the equivalent lines grown in the greenhouse. Table 14.
Table 14. LC-PUFA content (% Total FAMEs) of T3 canola seed from transgenic events transformed with pDAB 101496 grown in the field and greenhouse.
* Numbers are bulk seed from field plots for field samples or bulk seed from individual plants for greenhouse samples
There were no significant differences in grain yield for the transgenic canola lines producing DHA relative to the non-transgenic DH12075 plants. Table 15. All lines grown in the field produced seed with average oil contents of >40% g oil/g seed. There were no significant differences between transgenic and non-transgenic controls in chlorophyll content of the seed, or in % seed protein per gram of meal after oil extraction. Flowering time and time to maturity was unchanged between the transgenic lines and the control plants.
Table 15. T3 seed yield of from homozygous T2 transgenic DHA-producing canola plants transformed with pDAB101496 grown in the field, compared with non-transgenic DH12075 canola. Yields are the average of four plots per location (SD = standard deviation of the mean).
Batches of grain from the field trial were pooled and crushed to extract LC-PUFA-containing oil, and the oil processed through refining, bleaching and deodorizing using standard methods. Two batches of seed were processed to yield 1.2 kg RBD oil containing 3.02% DHA and 1.0% EPA and 1.0 kg RBD oil containing 4.1% DHA and 0.7% EPA. This demonstrates that the grain from the transgenic canola plants expressing PUFA synthase and Hetl can be processed to produce canola oil highly enriched in DHA and EPA.
Example 8: LC-PUFA trait stability across multiple seed generations for pDAB 107960 canola events.
For plasmid pDAB 107960, when tested in Arabidopsis, combining the use of diversified promoter /terminator combinations and use of three native PUFA synthase sequences in the second orientation PTU format led to consistent LC-PUFA trait stability into the T3 seed generation. Table 5. Canola events generated with pDAB 107960 were similarly tested for LC-PUFA trait stability over three selfed crop seed generations.
Ti seeds from nine pDABl07960 canola events selected by single seed analysis of T1 seed as previously described were planted in the greenhouse. Plants homozygous for the pDAB 107960 transgenes and segregating as a single Mendelian locus were grown to maturity, and the T2 seed was harvested from plants and the seed samples analyzed for LC-PUFA content. All plants from the nine selected pDAB 107960 canola events produced DHA and EPA in the T2 seed. Table 16. T2 seeds from seven events (113 plants) were planted in the greenhouse, grown to maturity and T3 seed was harvested from these plants. Seed samples were again analyzed from each progeny plant for LC-PUFA content. All 113 T2 canola plants from the seven selected pDAB 107960 events produced DHA and EPA in the T3 seed. Table 16. T3 seeds derived from six events (137 plants) were planted in the greenhouse, grown to maturity and T4 seed harvested from these plants. All 137 canola T3 plants from the six selected pDAB107960 events produced DHA and EPA in the T4 seed. Table 16. Five of the six events tested maintained similar high levels of DHA in each seed generation, demonstrating DHA trait stability across multiple events.
Table 16. LC-PUFA content of T2, T3, and T4 canola seed from homozygous lines from pDAB 107960 canola events. The LC-PUFA contents are shown as % of total FAMEs.
The transgene copy numbers in the six single locus pDAB 107960 canola events taken to the T4 seed generation were determined as described previously. Events 107960[6]-106, 107960[6]-107, 107960[7]-111, and 107960[6]-353 contained two copies of all the transgenes, whereas events 107960[7]-085 and 107960[6]-352 contained one copy of all the transgenes (PFA1, PFA2, PFA3, NoHetl). Thus, the LC-PUFA trait was delivered with either one or two copies of the transgene set and remained stable across three seed generations.
The complete FAME profiles of bulk T4 seed samples from two T3 plants derived from two different events (107960[7]-085 and 107960[6]-353) are shown in Table 17. These seeds contained 4.4% and 4.6% DHA, and 0.6% and 0.7% EPA, respectively. There was a concomitant decrease (-8%) in oleic acid (Cl8:1) content, and slight increase (+2%) in linoleic acid (18:2) content, and otherwise the profile was not significantly altered by the presence of the new LC-PUFAs. In addition to the expected LC-PUFAs, DHA, EPA, and DPA(n-6), low levels of new ω-6 fatty acids, gamma-linolenic (GLA, 18:3) and arachidonic acid (ARA, 20:4), were also detectable (totaling around 1 %).
Table 17. FAME profile of T4 canola seed derived from two pDAB 107960 events compared with control seed from non-transgenic DH12075. The fatty acid content is shown as % of total FAMEs.
*No plasmid
Example 9: LC-PUFA Production in T4 Canola Seeds from pDABl 07960 Grown in the Field. T3 canola seed batches from homozygous T2 plants derived from six different pDABl07960 canola events producing around 3% DHA (described above) were independently pooled. This seed was planted in the field at two locations in North Dakota in 2014. Canola seed entries from untransformed DH12075 plants were used as controls, and commercially available canola lines were used as checks. Four replicate plots (1.2 x 6 m) were planted for each seed entry, and the resulting seed (T4 seed for transgenic samples) was harvested from each plot and analyzed for LC-PUFA content by FAMEs analysis, as described previously. FAME extractions were performed on three 10-seed technical replicates sampled from bulk grain from each of the four replicated field plots. Table 18.
Table 18: LC-PUFA content (as % of total FAMEs) of T4 canola seed from transgenic events transformed with pDABl07960 grown in the field in North Dakota. Values are the average of four replicated plots at each site.
The maximum DHA and EPA contents (average of four plots per entry) were 3.98% and 0.88% for Event 107960[6]-353 and 107960(6)-106, respectively, at Site 1. The maximum DHA content from an individual plot was 4.69% at Site 1 for Event 107960[6]-353. The same T3 seed lines that were bulked for each event for planting in the field were also grown in the greenhouse to compare the LC-PUFA contents of the resulting T4 seed. Table 16. The average DHA content across all six field grown pDAB 107960 events at both sites was 3.24%, whereas the average DHA content across all the equivalent events grown in the greenhouse was 3.06%. Thus, the DHA content of the field-grown canola was on average +6% higher than the equivalent greenhouse-grown material.

Claims (38)

  1. 1. A genetically modified oilseed plant that produces an oil comprising a detectable amount of eicosapentaenoic acid (C20:5, n-3) (EPA), the plant comprising: a first polynucleotide encoding a first polypeptide of a polyunsaturated fatty acid (PUFA) synthase system comprising an amino acid sequence that is at least 80% identical to SEQ ID NO: 1; a second polynucleotide encoding a second polypeptide of the PUFA synthase system comprising an amino acid sequence that is at least 80% identical to SEQ ID NO:4; a third polynucleotide encoding a third, truncated polypeptide of the PUFA synthase System that is 1424 amino acids in length, consisting of an amino acid sequence that is at least 80% identical to SEQ ID NO:14; a fourth polynucleotide encoding a phosphopantetheinyl transferase (PPTase) consisting of an amino acid sequence that is at least 80% identical to the PPTase encoded by SEQ ID NO: 10; and a fifth polynucleotide encoding an acyl-CoA synthetase (ACS) that is at least 80% identical to the ACS encoded by SEQ ID NO:l 1, wherein each of the polynucleotides is operably finked to a promoter functional in the plant.
AU2014354631A2013-11-262014-11-26Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthaseCeasedAU2014354631B2 (en)

Priority Applications (2)

Application NumberPriority DateFiling DateTitle
AU2017272160AAU2017272160B2 (en)2013-11-262017-12-05Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase
AU2019250139AAU2019250139A1 (en)2013-11-262019-10-16Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase

Applications Claiming Priority (3)

Application NumberPriority DateFiling DateTitle
US201361909289P2013-11-262013-11-26
US61/909,2892013-11-26
PCT/US2014/067729WO2015081270A1 (en)2013-11-262014-11-26Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a thraustochytrid pufa synthase

Related Child Applications (1)

Application NumberTitlePriority DateFiling Date
AU2017272160ADivisionAU2017272160B2 (en)2013-11-262017-12-05Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase

Publications (2)

Publication NumberPublication Date
AU2014354631A1 AU2014354631A1 (en)2016-05-19
AU2014354631B2true AU2014354631B2 (en)2017-12-21

Family

ID=53199655

Family Applications (3)

Application NumberTitlePriority DateFiling Date
AU2014354631ACeasedAU2014354631B2 (en)2013-11-262014-11-26Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase
AU2017272160ACeasedAU2017272160B2 (en)2013-11-262017-12-05Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase
AU2019250139AAbandonedAU2019250139A1 (en)2013-11-262019-10-16Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase

Family Applications After (2)

Application NumberTitlePriority DateFiling Date
AU2017272160ACeasedAU2017272160B2 (en)2013-11-262017-12-05Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase
AU2019250139AAbandonedAU2019250139A1 (en)2013-11-262019-10-16Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a Thraustochytrid PUFA synthase

Country Status (17)

CountryLink
US (3)US9994828B2 (en)
EP (2)EP3073817A4 (en)
JP (1)JP2016537985A (en)
KR (1)KR20160111915A (en)
CN (1)CN106413389A (en)
AR (1)AR098539A1 (en)
AU (3)AU2014354631B2 (en)
BR (1)BR102014029437A2 (en)
CA (1)CA2928870A1 (en)
CL (1)CL2016001278A1 (en)
IL (1)IL245787B (en)
MX (1)MX2016006675A (en)
RU (1)RU2728854C2 (en)
TW (1)TW201525136A (en)
UA (1)UA120172C2 (en)
UY (1)UY35856A (en)
WO (1)WO2015081270A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
EP2408797B1 (en)2009-03-192017-03-15DSM IP Assets B.V.Polyunsaturated fatty acid synthase nucleic acid molecules and polypeptides, compositions, and methods of making and uses thereof
TW201525136A (en)*2013-11-262015-07-01Dow Agrosciences LlcProduction of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a thraustochytrid PUFA synthase
CN107532179B (en)2014-11-142021-10-15巴斯夫植物科学有限公司 Materials and methods for producing PUFAs and compositions containing PUFAs
US11987819B2 (en)2016-05-122024-05-21Dsm Ip Assets B.V.Method of increasing omega-3 polyunsaturated fatty acids production in microalgae
US10982226B2 (en)*2016-06-202021-04-20Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical CollegeGreen alga bicarbonate transporter and uses thereof
BR112021000115A2 (en)2018-07-132021-04-06Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College BESTROFINA DAS ALGAS VERDES BICARBONATE CARRIERS

Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2007106903A2 (en)*2006-03-152007-09-20Martek Biosciences CorporationPolyunsaturated fatty acid production in heterologous organisms using pufa polyketide synthase systems
WO2010108114A2 (en)*2009-03-192010-09-23Martek Biosciences CorporationPolyunsaturated fatty acid synthase nucleic acid molecules and polypeptides, compositions, and methods of making and uses thereof

Family Cites Families (45)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4693977A (en)1982-08-231987-09-15Queen's University At KingstonEnzyme immobilization for producing cephalosporin antibiotics
US4536475A (en)1982-10-051985-08-20PhytogenPlant vector
NL8300698A (en)1983-02-241984-09-17Univ Leiden METHOD FOR BUILDING FOREIGN DNA INTO THE NAME OF DIABIC LOBAL PLANTS; AGROBACTERIUM TUMEFACIENS BACTERIA AND METHOD FOR PRODUCTION THEREOF; PLANTS AND PLANT CELLS WITH CHANGED GENETIC PROPERTIES; PROCESS FOR PREPARING CHEMICAL AND / OR PHARMACEUTICAL PRODUCTS.
NZ207765A (en)1983-04-151987-03-06Lubrizol Genetics IncPlant expression of transferred dna(t-dna)from plasmids associated with agrobacterium sp
US5380831A (en)1986-04-041995-01-10Mycogen Plant Science, Inc.Synthetic insecticidal crystal protein gene
US4886937A (en)1985-05-201989-12-12North Carolina State UniversityMethod for transforming pine
US5015580A (en)1987-07-291991-05-14AgracetusParticle-mediated transformation of soybean plants and lines
ATE87032T1 (en)1986-12-051993-04-15Ciba Geigy Ag IMPROVED METHOD OF TRANSFORMING PLANT PROTOPLASTS.
US5416011A (en)1988-07-221995-05-16Monsanto CompanyMethod for soybean transformation and regeneration
US5693507A (en)1988-09-261997-12-02Auburn UniversityGenetic engineering of plant chloroplasts
US5302523A (en)1989-06-211994-04-12Zeneca LimitedTransformation of plant cells
US5501967A (en)1989-07-261996-03-26Mogen International, N.V./Rijksuniversiteit Te LeidenProcess for the site-directed integration of DNA into the genome of plants
US5550318A (en)1990-04-171996-08-27Dekalb Genetics CorporationMethods and compositions for the production of stably transformed, fertile monocot plants and cells thereof
US7705215B1 (en)1990-04-172010-04-27Dekalb Genetics CorporationMethods and compositions for the production of stably transformed, fertile monocot plants and cells thereof
HU220773B1 (en)1990-01-222002-05-28Dekalb Genetics CorporationProcess for producing productive transgenic maize plants
US5484956A (en)1990-01-221996-01-16Dekalb Genetics CorporationFertile transgenic Zea mays plant comprising heterologous DNA encoding Bacillus thuringiensis endotoxin
US5451513A (en)1990-05-011995-09-19The State University of New Jersey RutgersMethod for stably transforming plastids of multicellular plants
US6403865B1 (en)1990-08-242002-06-11Syngenta Investment Corp.Method of producing transgenic maize using direct transformation of commercially important genotypes
US5384253A (en)1990-12-281995-01-24Dekalb Genetics CorporationGenetic transformation of maize cells by electroporation of cells pretreated with pectin degrading enzymes
JP3730658B2 (en)1991-04-092006-01-05ユニリーバー・ナームローゼ・ベンノートシヤープ Plant promoters involved in regulation of lipid biosynthesis in seeds
EP0604662B1 (en)1992-07-072008-06-18Japan Tobacco Inc.Method of transforming monocotyledon
AU670316B2 (en)1992-07-271996-07-11Pioneer Hi-Bred International, Inc.An improved method of (agrobacterium)-mediated transformation of cultured soybean cells
US5659026A (en)1995-03-241997-08-19Pioneer Hi-Bred InternationalALS3 promoter
WO1997013402A1 (en)1995-10-131997-04-17Dow Agrosciences LlcModified bacillus thuringiensis gene for lepidopteran control in plants
US5693512A (en)1996-03-011997-12-02The Ohio State Research FoundationMethod for transforming plant tissue by sonication
US6210700B1 (en)1997-01-142001-04-03Novartis Nutrition AgEnhancement of transplant graft survival through nutritional immunomodulation with omega-9 fatty acid dietary supplement therapy
US5981840A (en)1997-01-241999-11-09Pioneer Hi-Bred International, Inc.Methods for agrobacterium-mediated transformation
JP2002510205A (en)1997-06-042002-04-02カルジーン エルエルシー Production of polyunsaturated fatty acids by expressing polyketide-like synthetic genes in plants
US6566583B1 (en)1997-06-042003-05-20Daniel FacciottiSchizochytrium PKS genes
US7217856B2 (en)1999-01-142007-05-15Martek Biosciences CorporationPUFA polyketide synthase systems and uses thereof
AU2848800A (en)1999-01-142000-08-01Monsanto Technology LlcSoybean transformation method
US7211418B2 (en)1999-01-142007-05-01Martek Biosciences CorporationPUFA polyketide synthase systems and uses thereof
US8003772B2 (en)*1999-01-142011-08-23Martek Biosciences CorporationChimeric PUFA polyketide synthase systems and uses thereof
US7247461B2 (en)1999-01-142007-07-24Martek Biosciences CorporationNucleic acid molecule encoding ORFA of a PUFA polyketide synthase system and uses thereof
KR100785946B1 (en)1999-09-302007-12-14니뽄 다바코 산교 가부시키가이샤 Plant transformation vector
US20040172682A1 (en)2003-02-122004-09-02Kinney Anthony J.Production of very long chain polyunsaturated fatty acids in oilseed plants
RU2337530C2 (en)*2003-11-142008-11-10Консехо Супериор Де Инвестигасионес СьентификасOil, seeds, and plants of sunflower with modified distribution of fatty acids in triacylglycerine molecule
ATE528403T1 (en)*2004-02-272011-10-15Basf Plant Science Gmbh METHOD FOR PRODUCING UNSATURATED OMEGA-3 FATTY ACIDS IN TRANSGENIC ORGANISMS
WO2006135866A2 (en)2005-06-102006-12-21Martek Biosciences CorporationPufa polyketide synthase systems and uses thereof
US7868228B2 (en)2006-01-312011-01-11Monsanto Technology LlcPhosphopantetheinyl transferases from bacteria
CN201389213Y (en)*2009-04-032010-01-27陈春芬Novel sputum aspirator
TW201144442A (en)*2010-05-172011-12-16Dow Agrosciences LlcProduction of DHA and other LC-PUFAs in plants
EP2597943B1 (en)2010-07-292018-02-07Dow AgroSciences LLCStrains of agrobacterium modified to increase plant transformation frequency
TW201307553A (en)2011-07-262013-02-16Dow Agrosciences LlcProduction of DHA and other LC-PUFAs in plants
TW201525136A (en)*2013-11-262015-07-01Dow Agrosciences LlcProduction of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a thraustochytrid PUFA synthase

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
WO2007106903A2 (en)*2006-03-152007-09-20Martek Biosciences CorporationPolyunsaturated fatty acid production in heterologous organisms using pufa polyketide synthase systems
WO2010108114A2 (en)*2009-03-192010-09-23Martek Biosciences CorporationPolyunsaturated fatty acid synthase nucleic acid molecules and polypeptides, compositions, and methods of making and uses thereof

Also Published As

Publication numberPublication date
EP3666066A1 (en)2020-06-17
IL245787A0 (en)2016-07-31
AU2014354631A1 (en)2016-05-19
US20150299676A1 (en)2015-10-22
KR20160111915A (en)2016-09-27
EP3073817A4 (en)2017-06-21
TW201525136A (en)2015-07-01
RU2728854C2 (en)2020-07-31
AU2017272160A1 (en)2017-12-21
US20210395765A1 (en)2021-12-23
RU2016125219A (en)2018-01-09
JP2016537985A (en)2016-12-08
US20180282711A1 (en)2018-10-04
MX2016006675A (en)2017-01-16
AU2017272160B2 (en)2019-08-08
US11046968B2 (en)2021-06-29
BR102014029437A2 (en)2015-07-07
CL2016001278A1 (en)2017-01-20
US9994828B2 (en)2018-06-12
UA120172C2 (en)2019-10-25
UY35856A (en)2015-06-30
EP3073817A1 (en)2016-10-05
CN106413389A (en)2017-02-15
WO2015081270A1 (en)2015-06-04
AR098539A1 (en)2016-06-01
CA2928870A1 (en)2015-06-04
IL245787B (en)2019-09-26
AU2019250139A1 (en)2019-10-31

Similar Documents

PublicationPublication DateTitle
US11168332B2 (en)Production of DHA and other LC PUFAs in plants
US20210395765A1 (en)Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a thraustochytrid pufa synthase
US11053511B2 (en)Production of DHA and other LC PUFAs in plants
JP2009529891A (en) Plant seed oil containing polyunsaturated fatty acids
US20220042028A1 (en)Production of dha and other lc pufas in plants
CA2480724A1 (en)Expression of phospholipid:diacylglycerine acyltransferase (pdat) for the production of plant storage lipids with polyunsaturated fatty acids

Legal Events

DateCodeTitleDescription
FGALetters patent sealed or granted (standard patent)
HBAlteration of name in register

Owner name:CORTEVA AGRISCIENCE LLC

Free format text:FORMER NAME(S): DSM IP ASSETS B.V.; DOW AGROSCIENCES, LLC

Owner name:DSM IP ASSETS B.V.

Free format text:FORMER NAME(S): DSM IP ASSETS B.V.; DOW AGROSCIENCES, LLC

MK14Patent ceased section 143(a) (annual fees not paid) or expired

[8]ページ先頭

©2009-2025 Movatter.jp