|
|
Title:
Method for generating amplified RNA
United States Patent: 7,354,742
Issued: April 8, 2008
Inventors: Kamme; Fredrik
(San Diego, CA), Zhu; Jessica Y. (San Diego, CA)
Assignee: Ortho-McNeil
Pharmaceutical, Inc. (Raritan, NJ)
Appl. No.: 10/080,795
Filed: February 22, 2002
|
|
|
Pharm/Biotech Jobs
|
Abstract
The present invention provides a new
method for producing amplified RNA from a selected set of cells. This
method combines and utilizes known methods in a novel way to produce the
gene expression profiles. The methods of the present invention utilize
thermostable DNA polymerases and RNase H to produce high fidelity second
strand synthesis of nucleic acids from selected cells in a highly
efficient mariner and in substantially reduced time.
Description of the
Invention
SUMMARY OF THE INVENTION
The present invention relates to methods for amplification of nucleic
acids using new improved methods for amplification of RNA derived from
cells, preferably eukaryotic cells and most preferably mammalian cells,
which methods are reproducible and enable amplification of extremely small
amounts of mRNA, such as might be obtained from as few as tens to hundreds
of mammalian cells. In typical embodiments of the invention, a
thermostable DNA polymerase is used in the second strand synthesis step.
The use of a thermostable DNA polymerase enables the generation of a
relatively large amount of RNA from a small starting number of cells with
high efficiency, and in a substantially reduced time period compared to
the known methods for performing RNA amplification. RNAs generated using
this method may be labeled and employed to profile gene expression in
different populations of cells.
In one embodiment, the invention is directed to a method for amplifying at
least one mRNA in a sample containing a plurality of different mRNAs
comprising: a) synthesizing first strand nucleic acid molecules by
contacting under conditions conducive to reverse transcription activity at
least one mRNA in said sample with: (i) reverse transcriptase, and (ii)
(ii) a first primer that is sufficiently complementary to a sequence in
the mRNA so as to prime synthesis in a direction toward the 5' end of the
mRNA; b) synthesizing a double-stranded nucleic acid molecule by
contacting under conditions conducive to thermosilable DNA polymerase
activity the first strand synthesis step with: (i) a first thermostable
DNA polymerase, and (ii) a thermostable RNAse H; c) transcribing resultant
double stranded nucleic acid molecules into cRNA by contacting the double
stranded nucleic acid molecules with an RNA polymerase under conditions
conducive to RNA polymerase activity, such that cRNA is produced.
In another embodiment, the invention is directed to the above amplifying
method, further comprising, after the transcribing step, determining the
presence or absence of a pre-selected target mRNA in said sample.
In a specific embodiment, the invention is directed to an amplifying
method, wherein the mRNA is extracted from at least one cell of interest,
and further comprising contacting the cRNA produced with an array
containing one or more species of polynucleotide positioned at
pre-selected sites on the array, under conditions conducive to
hybridization; and detecting any hybridization that occurs between the one
or more species of polynucleotide and the cRNA.
In another embodiment, the invention is directed to a method for comparing
the presence or amount of at least one mRNA of interest in a first sample
and in a second sample, said first sample and said second sample each
containing a plurality of different mRNAs from one or more cells,
comprising, for the first sample, (a) synthesizing first strand nucleic
acid molecules by contacting under conditions conducive to reverse
transcription activity at least one mRNA in said sample with (i) reverse
transcriptase, and (ii) a first primer that is sufficiently complementary
to a sequence in the mRNA so as to prime synthesis in a direction toward
the 5' end of the mRNA; (b) synthesizing a double-stranded nucleic acid
molecule by contacting under conditions conducive to thermostable DNA
polymerase activity the first strand synthesis step with (i) a first
thermostable DNA polymerase, and (ii) a thermostable RNase H and (c)
transcribing resultant double stranded nucleic acid molecules into cRNA by
contacting the double stranded nucleic acid molecules with an RNA
polymerase under conditions conducive to RNA polymerase activity, such
that cRNA is produced; (d) labeling the cRNA produced in step c) with a
first label; (e) repeating steps (a)-(c) with said second sample; (f)
labeling the cRNA produced in step (e) with a second label distinguishable
from said first label; (f) detecting or measuring the mRNA of interest in
the first sample by contacting the cRNA labeled with said first label with
a polynucleotide probe capable of hybridizing to said cRNA of the mRNA of
interest under conditions conducive to hybridization; and detecting any
hybridization that occurs between said probe and said cRNA; (g) detecting
or measuring the mRNA of interest in the second sample by contacting the
cRNA labeled with said second label with said polynucleotide probe capable
of hybridizing to said cRNA of the mRNA of interest under conditions
conducive to hybridization; and detecting any hybridization that occurs
between said probe and said cRNA; and (h) comparing the mRNA of interest
detected or measured in said first sample with the mRNA of interest
detected or measured in said second sample.
In a specific embodiment of the above method, in steps (f) and (g), the
steps of contacting the cRNA labeled with said first label with said
polynucleotide probe, and contacting the cRNA labeled with said second
label with said polynucleotide probe, are carried out concurrently.
In another specific embodiment of the method described above, the first
sample contains mRNAs from cells that are pathologically aberrant, and the
second sample contains mRNAs from cells that are not pathologically
aberrant.
In another embodiment, the invention is directed to a kit comprising in
one or more containers (a) a mixture of first primers; (b) a thermostable
DNA polymerase; and (c) a thermostable RNase H.
In another embodiment of the invention, a label or labels may be
introduced into the RNA during the transcription step to facilitate gene
expression profiling in different populations of cells.
DETAILED DESCRIPTION OF THE INVENTION
The present invention provides methods for the amplification of nucleic
acid sequences in a population of cells, which methods are reproducible
and enable the amplification of extremely small amounts of mRNA, such as
might be obtained from as few as a single mammalian cell.
It is to be understood that the invention is not limited to the particular
embodiments described below, as variations of the described embodiments
may fall within the enclosed claims. Also, the terminology used to
describe the enclosed embodiments is not intended to be limiting. The
scope of the present invention is established by the claims.
As described above, this invention provides method for amplifying nucleic
acids. Nucleic acids are polymers consisting of ribonucleotides or
deoxyribonycleotides, or a mixture of both. `RNA` is a polymer, consisting
of ribonucleotides, and at least two nucleotides long, but generally
longer, usually 100-10000 nucleotides. `DNA` is a polymer, consisting of
deoxyribonucleotides, and at least two nucleotides long, but generally
longer, usually 100-10000 nucleotides. cDNA is a DNA polymer synthesized
using an RNA polymer as a template, therefore containing the complementary
sequence of the RNA polymer. An oligonucleotide is a short DNA or RNA
polymer, or a mixture of both. Oligonucleotides are at least two
nucleotides long, more usually 10-150 nucleotides. The oligonucleotide
primer for use in the methods of the present invention can be of any
suitable size, and are preferably 40-100 nucleotides in length. The
oligonucleotide primers can be DNA, chimeric mixtures of RNA and DNA or
derivatives or modified versions thereof, so long as it is still capable
of priming the desired reaction. The oligonucleotide primer can be
modified at the base moiety, sugar moiety, or phosphate backbone, and may
include other appending groups or labels, so long as it is still capable
of priming the desired amplification reaction. The oligonucleotide primers
may be derived by cleavage of a larger nucleic acid fragment using
non-specific nucleic acid cleaving chemicals or enzymes or site-specific
restriction endonucleases; or by synthesis by standard methods known in
the art, e.g. by use of an automated DNA synthesizer (such as are
commercially available from Biosearch, Applied Biosystems, etc.) and
standard phosphoramidite chemistry.
The methods of the invention are applicable to nucleic acid sequences
derived from both eukaryotic and prokaryotic cells, although they are
preferably used with eukaryotic cells, and most preferably, with mammalian
cells. Among cells that may serve as sources of DNA or RNA are nucleated
blood cells, established cell lines, tumor cells, and tissue biopsy
specimens, normal tissues, organs, as well as pathologically aberrant
cells, among others that will be readily apparent to those of skill in the
art. Pathologically aberrant cells include cells involved directly or
indirectly with a disease or condition, such as infected cells, cells that
have become abnormal, cancerous cells, cells producing a substance that
causes a disease or condition, or cells that produce or fail to produce a
particular protein or proteins inconsistently with normal cells. Cells
from which nucleic acid molecules are to be derived may be collected using
a wide variety of techniques well known in the art, including, but not
limited to, laser capture using instrumentation, reagents and instructions
supplied by, for example Arcturus, PALM, or Leica manufacturers,
aspiration of cells or the intracellular contents of living or preserved
cells using micropipettes, needle or surgical biopsies, cell sorting, such
as fluorescence activated cell sorting, or by antibody binding sorting.
Cells of interest also include wild-type cells, drug-exposed wild-type
cells, modified cells, and drug-exposed modified cells.
Although the amplification methods of the invention can be adapted to
amplify DNA, it is preferred to utilize the methods to amplify RNA from a
population of cells. Total cellular RNA, cytoplasmic RNA, or poly(A)+ RNA
may be used. Methods for preparing total and poly(A)+ RNA are well known
and are described generally in Sambrook et al., Molecular Cloning--A
Laboratory Manual, 2.sup.nd Edition, (Cold Spring Harbor Laboratory, Cold
Spring Harbor, N.Y., 1989, Vol. 1-3) and Current Protocols in Molecular
Biology, (Ausubel et al. (Eds.), Current Protocols Publishing, NY, 1994,
Vol. 2).
RNA may be isolated from eukaryotic cells by a wide variety of procedures
known in the art that involve lysis of the cells and denaturation of the
proteins contained therein. Additional steps may be employed to remove
DNA. Cell lysis may be accomplished with a nonionic detergent, followed by
microcentrifugation to remove the nuclei and hence the bulk of the
cellular DNA. RNA may be extracted from cells of the various types of
interest using guanidinium thiocyanate lysis followed by CsCl
centrifugation to separate the RNA from DNA (Chirgwin et al., Biochem.
(1979) 18:5294-5299). Poly(A)+ RNA is selected by selection with oligo-dT
cellulose (see Sambrook et al., Molecular Cloning--A Laboratory Manual,
2.sup.nd Edition, (Cold Spring Harbor Laboratory, Cold Spring Harbor,
N.Y., 1989, Vol. 1-3). Alternatively, separation of RNA from DNA can be
accomplished by organic extraction, for example, with acid phenol or
phenol/chloroform/isoamyl alcohol. Alternatively, RNA may be extracted
from samples based on binding of RNA to silica under chaotropic
conditions. If desired, RNase inhibitors may be added to the lysis buffer.
Likewise, for certain cell types, it may be desirable to add a protein
denaturation/digestion step to the protocol.
The sample of RNA can comprise a plurality of different mRNA molecules,
each different mRNA molecule having a different nucleotide sequence. In a
specific embodiment, the mRNA molecules in the RNA sample comprise 100 or
fewer different nucleotide sequences. More typically, the mRNA molecules
of the RNA sample comprise less than 500, or up to 100,000 or more
different nucleotide sequences. The mRNA molecules of a mammalian RNA
sample may possibly contain about 20,000 to 30,000 or more different
nucleotide sequences.
Total RNA or mRNA from cells is used in the methods of the present
invention. The source of the RNA can be from a wide variety of cells,
including, but not limited to, cells of a plant or animal, human, mammal,
primate, non-human animal, dog, cat, mouse, rat, bird, yeast, eukaryote,
or prokaryote.
In the method of this invention, cDNA is enzymatically synthesized from
mRNA, primed by an oligonucleotide that carries in the 5' end the sequence
for an RNA polymerase promoter, such as T7 RNA polymerase. The resulting
single-stranded cDNA is converted into double-stranded cDNA using a mix of
a DNA polymerase and an RNase H. The double-stranded cDNA is then
transcribed using T7 RNA polymerase, or other RNA polymerases, producing
RNA. Several RNA molecules will be transcribed for each template cDNA
molecule, thus constituting an amplification step. The produced RNA is
called amplified RNA. Using high-yield transcription kits, such as the
Ampliscribe kit from Epicentre (Madison, Wis.), it is possible to achieve
500-3000 fold amplification judged by mass.
In the method of this invention, the first-strand cDNA is preferably
produced by reverse transcription, wherein DNA is made from RNA using the
enzyme reverse transcriptase. Reverse transcriptase is found in all
retroviruses and is commonly obtained from avian myeloblastoma virus or
Moloney murine leukemia virus; enzyme from these sources is commercially
available from Invitrogen and Roche.
In general, it is preferred to use the poly(A) tail of mRNA for reverse
transcription, using an oligo(dT)-containing primer. The primer will
preferably contain about twelve to thirty T nucleotides, most preferably
twenty-one T nucleotides. In a specific embodiment, the primer for reverse
transcription contains between 35-100 T nucleotides.
The primer for first strand cDNA synthesis is an oligo dT primer with the
RNA polymerase promoter sequence in the 5' end that primes synthesis in
the direction towards the 5' end of substantially all the mRNAs in the
sample. The primer may be prepared by any suitable methods, such as
phosphotriester and phosphodiester methods of synthesis, or automated
embodiments thereof. It is also possible to use a primer that has been
isolated from a biological source, such as a restriction endonuclease
digest, although synthetic primers are preferred. A wide variety of RNA
polymerase promoter sequences are suitable for use in the methods of the
present invention and are well known to those skilled in the art,
including, but not limited to, T7, T3 and Sp6 promoters.
The primer for first strand cDNA synthesis is annealed to the RNA in
water. 5 ng to 5000 ng, preferably 10 ng to 500 ng, more preferably 25 ng
to 200 ng of primer is mixed with RNA. The mixture is added to a
temperature between 60.degree. C. and 90.degree. C., preferably about
70.degree. C., for about five minutes and then cooled on ice. cDNA is
synthesized using a reverse transcriptase, preferably RNase H- such as
Superscript II (Invitrogen), in a buffer containing 50 mM Tris-HCl, 75 mM
KCl, 3 mM MgCl.sub.2, 20 mM dithiothreitol, 500 .mu.M deoxynucleotide
triphosphates (dNTPs) in a volume of approximately 20 .mu.L. Forty units
RNasin (Promega) may be added to protect RNA from RNase contamination.
Typical amounts of Superscript II reverse transcriptase is 200
units/reaction. The mix is incubated preferably between 37.degree. C. and
42.degree. C. for one to two hours.
In the method of the present invention, a substantial and unexpected
improvement has been made to the second strand cDNA synthesis step by the
usage if thermostable enzymes. In the methods of the present invention,
enzymes are considered thermostable and suitable for use herein if they
maintain enzymatic activity above 50.degree. C., and are referred to
herein as "thermostable DNA polymerases". Previous to the discovery of the
improvements of the present invention, second strand synthesis methods are
carried out by a method known in the art. In the previous methods used in
the art, the DNA polymerase used for second strand cDNA synthesis were
typically E. coli DNA polymerase I, Klenow fragment of E. coli DNA
polymerase I, or T4 DNA polymerase. The most frequently used method to
date is the Gubler-Hoffman method originally described in 1983 (Gubler and
Hoffman (1983) supra), in which E. coli DNA polymerase I, RNase H and E.
coli DNA ligase I is used.
By way of example, but not limitation, second strand cDNA synthesis is
generally performed in the current invention by adding an aliquot of the
first strand cDNA reverse transcribed above into a mix containing 20 mM
Tris-HCl pH 8.8, 10 mM KCl, 10 mM (NH.sub.4).sub.2SO.sub.4, 2 mM
MgSO.sub.4, 0.1% Triton X-0.100, 375 .mu.M dNTPs, 24 U of Bst DNA
polymerase large fragment (New England Biolabs), five units of
thermostable RNase H (`Hybridase`, Epicentre, Wis.) and water to a final
volume of 80 .mu.l and incubated at temperature of 65.degree. C. for ten
minutes. The temperature of incubation is preferably 45.degree. C. to
80.degree. C., more preferably 55.degree. C. to 70.degree. C., and most
preferably 60.degree. C. to 65.degree. C. The time of incubation is one to
sixty minutes, more preferably five to thirty minutes, and most preferably
ten to twenty minutes. The concentration of Bst DNA polymerase large
fragment is preferably 0.012 to 1.3 units/.mu.l, more preferably 0.12 to
0.63 units per/.mu.l, and most preferably 0.18 to 0.44 units/.mu.l. The
concentration of thermostable RNase H is preferably 0.0031 units/.mu.l to
1.3 units/.mu.l, more preferably 0.0062 units/.mu.l to 0.63 units/.mu.l,
and most preferably 0.012 to 0.32 units/.mu.l.
Other thermostable DNA polymerases than Bst DNA polymerase large fragment
may be employed and are suitable for use in the methods of the present
invention for second strand cDNA synthesis at an elevated temperature
above 45.degree. C. Suitable thermostable DNA polymerases include, but not
limited to, Bst DNA polymerase native enzyme, Taq DNA polymerase, Pfu DNA
polymerase, Tgo DNA polymerase, Phi29 DNA polymerase plus a
thermostabilizing agent such as trehalose, T7 DNA polymerase plus a
thermostabilizing agent such as trehalose, or the Klenow fragment of E.
coli polymerase 1 plus a thermostabilizing agent such as trehalose, native
E. coli DNA polymerase I plus a thermostabilizing agent such as trehalose.
After second strand cDNA synthesis, the cDNA may optionally be purified.
Several options exist for purification. A preferred method is purifying
the DNA on a silica column in chaotropic conditions, e.g. the PCRquick
column (Qiagen). It is preferable to add 100 ng of polyinosinic acid to
the sample to be purified to minimize adsorptive losses of materials
during purification. The double-stranded cDNA carries a functional T7 RNA
polymerase promoter and is transcribed using a high yield T7 RNA
transcription kit such as the Ampliscribe kit (Epicentre) or the
Megascribe kit (Ambion).
One cycle of first strand cDNA synthesis, second strand cDNA synthesis and
RNA transcription as described above will yield an amplification of
500-3000 fold as judged by RNA mass. For some applications, in particular
single cell samples, further amplification may be required. This can be
done performing a second cycle of first strand cDNA synthesis, second
strand cDNA synthesis and RNA transcription. Starting from amplified RNA,
the RNA is heat denatured at 70.degree. C. for ten minutes in the presence
of 1 .mu.g of random hexamers in water in a total volume of 10 .mu.l. The
mix is quench cooled on ice, and first strand cDNA synthesis mix, as
described above, is added. The mix is incubated at room temperature for
ten minutes before incubating at 37.degree. C. for two hours. After
incubation, 1 .mu.l E. coli RNase H is added and the mix is incubated a
further thirty minutes at 37.degree. C. to digest the RNA strand in the
RNA-cDNA hybrid. The reaction is then heated to 95.degree. C. for five
minutes, and then cooled on ice. As the amplified RNA used as a template
for the cDNA synthesis reaction contains three to six bases derived from
the T7 RNA polymerase promoter followed by a stretch of uridines, the cDNA
will end at the 3' end with a stretch of deoxyadenines plus the T7 RNA
polymerase promoter derived bases. This sequence in the cDNA is exploited
to prime the second strand synthesis by hybridizing the cDNA with the
T7dT21 oligonucleotide used above. The heat-denatured cDNA is hybridized
with 11 g T7dT21 oligonucleotide at 42.degree. C. for ten minutes, and
then cooled on ice. This serves as the template for second strand cDNA
synthesis. Second strand cDNA synthesis can be performed using the Gubler-Hoffman
method outlined above. Alternatively, Bst DNA polymerase large fragment
can be used as described above. The temperature of incubation is
preferably reduced to 60.degree. C. The double-stranded cDNA is purified
and transcribed using T7 RNA polymerase as described above.
We have observed that free T7dT.sub.21 primer will result in transcription
and the production of RNA if put into a T7 RNA transcription mix. This may
be due to partial hybridization between two T7dT.sub.21 oligonucleotides
as part of the central promoter is canonical. If Bst DNA polymerase large
fragment is used for second strand cDNA synthesis in the first cycle of
first strand cDNA synthesis, second strand cDNA synthesis and T7 RNA
transcription, and, all of the product is used for a second cycle of first
strand cDNA synthesis, second strand cDNA synthesis and T7 RNA
transcription, an RNA product will appear in negative controls. This
effect is probably due to the lack of 5'-3' exonuclease activity in Bst
DNA polymerase large fragment. This will result in T7dT.sub.21
oligonucleotide remaining in solution after second strand cDNA synthesis,
which then will carry over into the transcription reaction. The importance
of this observation may be particularly relevant when very small samples
of RNA are employed, such as single cell RNA samples. This effect can be
greatly reduced by incubating the double-stranded cDNA after second strand
cDNA synthesis with ten to fifteen units of E. coli Exonuclease I for
fifteen minutes prior to purification of the cDNA. Alternatively, the
effect can be prevented by designing the T7dT.sub.21 primer in such a way
that the tendency to from oligonucleotide dimers is reduced. Such a design
may be that the 5' end of the T7dT.sub.21 oligonucleotide forms a hairpin
structure, with a mismatched terminal five to ten bases that sterically
prevent the canonical part of the central promoter to hybridize in a
T7dT.sub.21 dimer. The steric effect for blocking dimers forming can be
enhanced by conjugating a bulky group at the 5'-end of the oligonucleotide.
By way of example, but not limitation, the bulky group may be a
fluorophore such as Cy3.
The method of the present of the invention may be used to perform
amplification in situ, on samples of preserved or fresh cells or tissues
(see, e.g., PCR--In Situ Hybridization: Protocols and Applications,
3.sup.rd Edition, (Nuovo (Ed.) Lippincott-Raven Press, NY, 1997).
Nucleic acid amplification products such as amplified RNA, or cDNA
synthesized from amplified RNA, may be labeled with any art-known
detectable marker, including radioactive labels such as .sup.32P,
.sup.35S, .sup.3H, and the like; fluorophores; chemiluminescers; or
enzymatic markers, with fluorescent labels preferred such as fluorescein
isothiocyanate, lissamine, Cy3, Cy5, and rhodamine 110, with Cy3 and Cy5
particularly preferred. Suitable fluorophore moieties that can be selected
as labels include, but are not limited to,
4-acetamido-4'-isothiocyanatostilbene-2,2'disulfonic acid acridine and
derivatives: acridines, acridine isothiocyanate,
5-(2'-aminoethyl)aminonaphthalene-1-sulfonic acid (EDANS),
4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate (Lucifer
Yellow VS), -(4-anilino-1-naphthyl)maleimide, anthranilimide, Brilliant
Yellow, coumarin and derivatives: coumarin, 7-amino-4-methylcoumarin (AMC,
Coumarin 120), 7-amino-4-trifluoromethylcoumarin (Coumarin 151), Cy3, Cy5,
cyanosine, 4',6-diaminidino-2-phenylindole (DAPI),
5',5''-dibromopyrogallol-sulfonephthalein (Bromopyrogallol Red),
7-diethylamino-3-(4'-isothiocyanatophenyl)-4-methylcoumarin,
diethylenetriamine pentaacetate,
4,4'-diisothiocyanatodihydro-stilbene-2,2'-disulfonic acid,
4,4'-diisothiocyanatostilbene-2,2'-disulfonic acid,
5-[dimethylamino]naphthalene-1-sulfonyl chloride (DNS, dansyl chloride),
4-(4'-dimethylaminophenylazo)benzoic acid (DABCYL),
4-dimethylaminophenylazophenyl-4'-isothiocyanate (DABITC), eosin and
derivatives: eosin, eosin isothiocyanate, erythrosin and derivatives:
erythrosin B, erythrosin isothiocyanate, ethidium, fluorescein and
derivatives: 5-carboxyfluorescein (FAM),
5-(4,6-dichlorotriazin-2-yl)aminofluorescein (DTAF),
2'7'-dimethoxy-4'5'-dichloro-6-carboxyfluorescein (JOE), fluorescein,
fluorescein isothiocyanate, QFITC (XRITC), fluorescamine, IR144, IR1446,
Malachite Green isothiocyanate, 4-methylumbelliferone, ortho
cresolphthalein, nitrotyrosine, pararosaniline, Phenol Red, B-phycoerythrin,
o-phthaldialdehyde, pyrene and derivatives: pyrene, pyrene butyrate,
succinimidyl 1-pyrene butyrate, Reactive Red 4 (Cibacron.RTM. Brilliant
Red 3B-A), rhodamine and derivatives: 6-carboxy-X-rhodamine (ROX),
6-carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride,
rhodamine (Rhod), rhodamine B, rhodamine 110, rhodamine 123, rhodamine X
isothiocyanate, sulforhodamine B, sulforhodamine 101, sulfonyl chloride
derivative of sulforhodamine 101 (Texas Red), N,N,N'N'-tetramethyl-6-carboxyrhodamine
(TAMRA), tetramethyl rhodamines, tetramethyl rhodamine isothiocyanate (TRITC),
riboflavin, rosolic acid, terbium chelate derivatives.
Not only fluorophores, but also chemiluminescers and enzymes, among
others, may be used as labels. In yet another embodiment, the RNA or cDNA
may be labeled with an enzymatic marker that produces a detectable signal
when a particular chemical reaction is conducted, such as alkaline
phosphatase or horseradish peroxidase. Such enzymatic markers are
preferably heat stable, so as to survive the second strand synthesis and
denaturing steps of the amplification process of the present invention.
RNA amplified by the methods of this invention is suitable for
quantitative comparisons of gene expression between different populations
of cells or between populations of cells exposed to different stimuli. The
invention utilizes the ability to measure the responses of a biological
system to a large variety of perturbations. One of ordinary skill in the
art would appreciate that this invention is not limited to the following
specific methods for measuring the responses of a biological system. In
particular, the presence of cRNA(s) of interest (and thus mRNA(s) of
interest in the sample) can be detected or measured by procedures
including, but not limited to, Northern blotting or using bead-bound
oligonucleotides as probes, or the use of polynucleotide microarrays.
One or more labels is introduced into the RNA during the transcription
step to facilitate gene expression profiling. Gene expression can be
profiled in any of several ways, among which the preferred method is to
probe a DNA microarray with the labeled RNA transcripts generated above. A
nucleic acid microarray, or chip, is a microscopic array of nucleic acids,
such as DNA fragments or synthetic oligonucleotides, disposed in a defined
pattern on a solid support, wherein they are amenable to analysis by
standard hybridization methods (Schena et al., BioEssays (1996) 18: pg.
427).
The nucleic acid in a microarray may be derived from genomic or cDNA
libraries, from fully sequenced clones, from partially sequenced cDNAs
known as expressed sequence tags (ESTs), or synthetically made on the
microarray surface or substrate. Methods for obtaining such DNA molecules
are generally known in the art (see, e.g., Current Protocols in Molecular
Biology, (Ausubel et al. (Eds.), Current Protocols Publishing, NY, 1994,
Vol. 2). Alternatively, oligonucleotides may be synthesized by
conventional methods, such as phosphoramidite-based synthesis.
Gene expression profiling can be done for a wide variety of purposes,
including but not limited to, screening, diagnosis, staging a disease, and
monitoring response to therapy, as well as for identifying genetic targets
of drugs and of pathogens.
This invention is particularly useful to produce aRNA for the analysis of
gene expression profiles, in particular for diagnostic applications. For
expression profiling, nucleic acid microarrays are typically probed using
nucleic acid molecules, extracted and amplified from the cells whose gene
expression profile it is desired to analyze, using the amplification
method of the present invention. To facilitate comparison between any two
samples of interest, the nucleic acid molecules are typically labeled
separately with fluorescent dyes that emit at different wavelengths, as
described herein. Some embodiments of this invention are based on
measuring the transcriptional rate of genes.
The transcriptional rate can be measured by techniques of hybridization to
arrays of nucleic acid or nucleic acid mimic probes, or by other gene
expression technologies. However measured, the result is either the
absolute, relative amounts of transcripts or response data including
values representing RNA abundance ratios, which usually reflect DNA
expression ratios (in the absence of differences in RNA degradation
rates). In various alternative embodiments of the present invention,
aspects of the biological state other than the transcriptional state, such
as the translational state, the activity state, or mixed aspects can be
measured. Preferably, measurement of the transcriptional state is made by
hybridization to transcript arrays, which are described in this
subsection. Certain other methods of transcriptional state measurement are
described later in this subsection.
Nucleic acid molecules made using the method of the present invention may
be used in "transcript arrays" (also called herein "microarrays").
Transcript arrays can be employed for analyzing the transcriptional state
in a biological sample and especially for measuring the transcriptional
states of a biological sample exposed to graded levels of a drug of
interest or to graded perturbations to a biological pathway of interest.
Transcript arrays are produced by hybridizing detectably labeled
polynucleotides representing the mRNA transcripts present in a cell (e.g.,
fluorescently labeled cRNA that is amplified by the methods of the present
invention) to a microarray. A microarray is a surface with an ordered
array of binding (e.g., hybridization) sites for products of many of the
genes in the genome of a cell or organism, preferably most or almost all
of the genes. Microarrays can be made in a number of ways, of which
several are described below. However produced, microarrays share certain
preferred characteristics: The arrays are reproducible, allowing multiple
copies of a given array to be produced and easily compared with each
other. Preferably the microarrays are small, usually smaller than 5
cm.sup.2, and they are made from materials that are stable under binding
(e.g., nucleic acid hybridization) conditions. A given binding site or
unique set of binding sites in the microarray will typically specifically
bind the product of a single gene in the cell.
A microarray is typically an array of polynucleotides, the array
comprising a support with at least one surface and multiple different
polynucleotides, each different polynucleotide comprising a different
nucleotide sequence and being attached to the surface of the support in a
different location on the surface. Preferably, the nucleotide sequence of
each of the different polynucleotide probes is in the range of about 20 to
1000 or more nucleotides in length. The array typically comprises
polynucleotides of more than one different sequence, but can have many
more, such as 2,000, 4,000, 10,000, 15,000, 20,000, 50,000, 80,000,
100,000 or more different nucleotide sequences.
The nucleotide sequence of each polynucleotide in the array may be
specific for a particular counterpart or complementary polynucleotide
sequence. The target polynucleotide sequences may also comprise expressed
polynucleotide sequences of a cell or organism. The nucleotide sequences
of the different polynucleotides of the array may be specific for one or
more of the genes in the entire genome of the cells to be analyzed.
Nucleic acid arrays may have more than one, or many thousands of nucleic
acid molecules per 1 cm.sup.2, and are typically arranged as a
positionally addressable array, in that the sequence of the nucleic acid
molecules at each position is known. The nucleic acid molecule in the
array may be DNA or RNA.
When cRNA complementary to the RNA of a cell is made and hybridized to the
nucleic acid molecules in a microarray under suitable hybridization
conditions, the level of hybridization to the site in the array
corresponding to any particular nucleotide sequence will reflect the
prevalence in the cell of mRNA transcribed from the gene corresponding to
that sequence. For example, when detectably labeled (e.g., with a
fluorophore) cRNA complementary to the total cellular mRNA is hybridized
to the nucleic acid molecules of a microarray, the site on the array
corresponding to a particular gene that is not transcribed in the cell
will have little or no signal (e.g., fluorescent signal), and a gene for
which the encoded mRNA is prevalent will have a more detectable signal.
Additionally, cRNAs from two different cells are hybridized to the nucleic
acid molecules of the microarray. In the case of drug responses one
biological sample is exposed to a drug and another biological sample of
the same type is not exposed to the drug. In the case of pathway responses
one cell is exposed to a pathway perturbation and another cell of the same
type is not exposed to the pathway perturbation. The cRNA derived from
each of the two cell types are differently labeled so that they can be
distinguished, for example, cRNA from a cell treated with a drug (or
exposed to a pathway perturbation) is synthesized using a fluorescein-labeled
NTP, and cRNA from a second cell, not drug-exposed, is synthesized using a
rhodamine-labeled NTP. When the two cRNAs are mixed and hybridized to the
microarray nucleic acids, the relative intensity of signal from each cRNA
set is determined for each site on the array, and any relative difference
in abundance of a particular mRNA detected.
In the example described above, the cRNA from the drug-treated (or pathway
perturbed) cell will fluoresce green when the fluorophore is stimulated
and the cRNA from the untreated cell will fluoresce red. As a result, when
the drug treatment has no effect, either directly or indirectly, on the
relative abundance of a particular mRNA in a cell, the mRNA will be
equally prevalent in both cells and, upon reverse transcription,
red-labeled and green-labeled cRNA will be equally prevalent. When
hybridized to the microarray, the binding site(s) for that species of RNA
will emit wavelengths characteristic of both fluorophores. In contrast,
when the drug-exposed cell is treated with a drug that, directly or
indirectly, increases the prevalence of the mRNA in the cell, the ratio of
green to red fluorescence will increase. When the drug decreases the mRNA
prevalence, the ratio will decrease.
The use of a two-color fluorescence labeling and detection scheme to
define alterations in gene expression has been described, e.g., in Schena
et al., Science (1995) 270:467-470, which is incorporated by reference in
its entirety for all purposes. An advantage of using cRNA labeled with two
different fluorophores is that a direct and internally controlled
comparison of the mRNA levels corresponding to each arrayed gene in two
cell states can be made, and variations due to minor differences in
experimental conditions (e.g., hybridization conditions) will not affect
subsequent analyses. However, it will be recognized that it is also
possible to use cRNA from a single cell, and compare, for example, the
absolute amount of a particular mRNA in, e.g., a drug-treated or
pathway-perturbed cell and an untreated cell.
Microarrays are known in the art and consist of a surface to which probes
that correspond in sequence to gene products (e.g., cDNAs, mRNAs, cRNAs,
polypeptides, and fragments thereof), can be specifically hybridized or
bound at a known position. The microarray may be an array (i.e., a matrix)
in which each position represents a discrete binding site for a product
encoded by a gene (e.g., a protein or RNA), and in which binding sites are
present for products of most or almost all of the genes in the organism's
genome. The "binding site" (hereinafter, "site") is a nucleic acid or
nucleic acid analogue to which a particular cognate cRNA can specifically
hybridize. The nucleic acid or analogue of the binding site can be, e.g.,
a synthetic oligomer, a full-length cRNA, a less-than full length cRNA, or
a gene fragment.
Nucleic acid hybridization and wash conditions are optimally chosen so
that the probe "specifically binds" or "specifically hybridizes" to a
specific array site, i.e., the probe hybridizes, duplexes or binds to a
sequence array site with a complementary nucleic acid sequence but does
not hybridize to a site with a non-complementary nucleic acid sequence. It
can easily be demonstrated that specific hybridization conditions result
in specific hybridization by carrying out a hybridization assay including
negative controls (see, e.g., Shalon et al., Genome Research (1996)
6:639-645 and Chee et al., Science (1996) 274:610-614). Optimal
hybridization conditions will depend on the length (e.g., oligomer versus
polynucleotide greater than 200 bases) and type (e.g., RNA, DNA, PNA) of
labeled probe and immobilized polynucleotide or oligonucleotide. General
parameters for specific (i.e., stringent) hybridization conditions for
nucleic acids are described in Sambrook et al., Molecular Cloning--A
Laboratory Manual, 2.sub.nd Edition, (Cold Spring Harbor Laboratory, Cold
Spring Harbor, N.Y., 1989, Vol. 1-3) and in Current Protocols in Molecular
Biology, (Ausubel et al. (Eds.), Greene Publishing, Media, P A and Wiley-Interscience,
NY, 1987). When the cDNA microarrays of Schena et al. (P.N.A.S. USA (1996)
93:10614-10619) are used, typical hybridization conditions are
hybridization in 5.times.SSC plus 0.2% SDS at 65.degree. C. for four hours
followed by washes at 25.degree. C. in low stringency wash buffer
(1.times.SSC plus 0.2% SDS) followed by ten minutes at 25.degree. C. in
high stringency wash buffer (0.1.times.SSC plus 0.2% SDS) (Schena et al.,
(1996) supra). Useful hybridization conditions are also provided in, e.g.,
Hybridization With Nucleic Acid Probes, (Tijssen et al., (Ed.) Elsevier
Science Publishers B.V., The Netherlands and NY, USA, 1993) and
Nonisotopic DNA Probe Techniques, (Kricka et al., (Ed.) Academic Press,
San Diego, Calif., 1992).
Although simultaneous hybridization of differentially labeled mRNA samples
is preferred, it is also possible to use a single label and to perform
hybridizations sequentially rather than simultaneously.
When fluorescent labeled probes are used, the fluorescence emissions at
each site of a transcript array can be, preferably, detected by scanning
confocal laser microscopy. In one embodiment, a separate scan, using the
appropriate excitation line, is carried out for each of the fluorophores
used. Alternatively, a laser can be used that allows simultaneous specimen
illumination at wavelengths specific to the fluorophores and emissions
from the fluorophores can be analyzed simultaneously (see Shalon et al.
(1996) supra, which is incorporated by reference in its entirety for all
purposes). The arrays may be scanned with a laser fluorescent scanner with
a computer controlled X-Y stage and a microscope objective. Sequential
excitation of the fluorophores is achieved with a multi-line, mixed gas
laser and the emitted light is split by wavelength and detected with
multiple photomultiplier tubes. Fluorescence laser scanning devices are
described in Shalon et al. (1996) supra and in other references cited
herein. Alternatively, the fiber-optic bundle described by Ferguson et
al., Nature Biotechnol. (1996) 14:1681-1684, may be used to monitor mRNA
abundance levels at a large number of sites simultaneously.
Signals are recorded and analyzed by computer, e.g., using a twelve-bit
analog to digital board. The scanned image may be despeckled using a
graphics program (e.g., Hijaak Graphics Suite) and then analyzed using an
image gridding program that creates a spreadsheet of the average
hybridization at each wavelength at each site. If necessary, an
experimentally determined correction for "cross talk" (or overlap) between
the channels for the two fluors may be made. For any particular
hybridization site on the transcript array, a ratio of the emission of the
fluorophores can be calculated. The ratio is independent of the absolute
expression level of the cognate gene, but is useful for genes whose
expression is significantly modulated by drug administration, gene
deletion, or any other tested event or cause.
Relative abundance of mRNA in the biological samples is scored as a
perturbation and its magnitude determined (i.e., the abundance is
different in the two sources of mRNA tested), or as not perturbed (i.e.,
the relative abundance is the same). A difference between the sources of
RNA of at least a detectable level, such as a factor of about 25% (RNA
from one source is 25% more abundant in one source than the other source),
more usually about 50%, even more often by a factor of about two (twice as
abundant), three (three times as abundant) or five (five times as
abundant) is scored as a perturbation.
In addition to identifying a perturbation as positive or negative, it is
advantageous to determine the magnitude of the perturbation. This can be
carried out, as noted above, by calculating the ratio of the emission of
the fluorophores used for differential labeling, or by analogous methods
that will be readily apparent to those of skill in the art.
As an example, two samples, each labeled with a different fluor, are
hybridized simultaneously to permit differential expression measurements.
If neither sample hybridizes to a given spot in the array, no fluorescence
will be seen. If only one hybridizes to a given spot, the color of the
resulting fluorescence will correspond to that of the fluor used to label
the hybridizing sample (for example, green if the sample was labeled with
Cy3, or red, if the sample was labeled with Cy5). If both samples
hybridize to the same spot, an intermediate color is produced (for
example, yellow if the samples were labeled with fluorescein and rhodamine).
Then, applying methods of pattern recognition and data analysis known in
the art, it is possible to quantify differences in gene expression between
the samples. Methods of pattern recognition and data analysis are well
known to those skilled in the art.
The amplified RNA, or cDNA synthesized there from, may be used for
expression analysis using other platforms than DNA microarrays. Arrays of
polynucleotides can be made using addressable microspheres [Han et al.,
Nat Biotechnol. (2001) 19:631-635], where addresses may be created by a
combination of fluorescing materials, such as fluorescent compounds or
fluorescent nanoparticles. One set of microspheres carrying a unique
address is created by loading that set of microspheres with a unique
combination of fluorescent materials. The address is decoded by imaging
the microspheres, e.g. with a color CCD camera, and quantifying the
fluorescence at different wavelengths. The combination of colors and
intensity of each color can be modulated to create a detectable address.
By creating 10-100,000 optical addresses in such a way, a fluid array can
be created, where each specific polynucleotide is immobilized on one type
of microspheres carrying one particular optical address. If 100,000
optical addresses were created, it would be possible to analyze 100,000
different specific polynucleotides by hybridization analysis. The
microspheres carrying polynucleotides representing sequences of interest
are mixed and hybridized with labeled amplified RNA, or cDNA synthesized
thereof. The label used for labeling amplified RNA, or cDNA synthesized
thereof, may be fluorescent, radioactive, based on light scattering, using
fluorescent nanoparticles or enzymatic. A preferable label is fluorescent
or nanoparticle labeling. One benefit of hybridizing an array based on
microspheres instead of a planar microarray is that the volume of
hybridization can be dramatically reduced. A planar microarray containing
10,000 spots, representing potentially 10,000 individual sequences,
requires normally 50-100 microliters of hybridization buffer to cover the
surface. If we assume ten microspheres are utilized per address, 10,000
sequences would require 100,000 microspheres, which could be hybridized in
less than 1 microliter, assuming that the size of each microsphere is 10
micrometer in diameter or less. The reduction in hybridization volume will
likely enhance hybridization efficiency dramatically.
An additional aspect of the present invention relates to kits for the
detection or measurement of nucleic acid amplification products and for
determining the responses or state of a biological sample. In specific
embodiments, the kits comprise one or more thermostable DNA polymerases as
used in the method of the present invention, such as Taq DNA polymerase,
Bst DNA polymerase large fragment, Bst DNA polymerase native enzyme, Pfu
DNA polymerase, Tgo DNA polymerase, Phi29 DNA polymerase plus a
thermostabilizing agent such as trehalose, T7 DNA polymerase plus a
thermostabilizing agent such as trehalose, or the Klenow fragment of E.
coli polymerase 1 plus a thermostabilizing agent such as trehalose, in one
or more containers. The kit can further comprise additional components for
carrying out the amplification reactions of the invention. Where the
target nucleic acid sequence being amplified is one implicated in disease
or disorder, the kits can be used for diagnosis or prognosis. In a
specific embodiment, a kit is provided that comprises, in one or more
containers, an oligonucleotide primer for carrying out amplification, a
reverse transcriptase and optionally, a DNA polymerase, a thermostable
RNase H.
Oligonucleotides in containers can be in any form, e.g., lyophilized, or
in solution (e.g., a distilled water or buffered solution), etc.
Oligonucleotides ready for use in the same amplification reaction can be
combined in a single container or can be in separate containers. The kit
optionally further comprises a set of directions for carrying out
transcribing of amplified double-stranded cDNA into cRNA. The kit
optionally further comprises a control nucleic acid, and/or a microarray,
and/or means for stimulating and detecting fluorescent light emissions
from fluorescently labeled RNA, and/or expression profile projection and
analysis software capable of being loaded into the memory of a computer
system.
Another example is a kit for the detection of a selected target DNA target
sequence comprises in one or more containers (a) primers, and optionally:
(b) a control DNA target sequence; (c) an optimized buffer for
amplification; (d) appropriate enzymes for the method of amplification
contemplated, e.g., a thermostable DNA polymerase for second strand
synthesis; (d) a set of directions for carrying out amplification, e.g.,
describing the optimal conditions, e.g., temperature and time. Such kits
may further contain expression profile projection and analysis software
capable of being loaded into the memory of a computer system.
Claim 1 of 21 Claims
1. A method for amplifying at least one
mRNA in a sample containing a plurality of different mRNAs comprising: a)
synthesizing first strand cDNA by contacting under conditions conducive to
reverse transcriptase activity at least one mRNA in said sample with (i)
reverse transcriptase, and (ii) a first primer that is sufficiently
complementary to a sequence in the mRNA so as to prime synthesis in a
direction toward the 5' end of the mRNA; b) synthesizing a second strand
of cDNA by contacting under conditions conducive to a thermostable DNA
polymerase activity, said conditions comprising an incubation temperature
of from 55.degree. C. to 70.degree. C., the first strand cDNA with (i) a
thermostable DNA polymerase, which is Bst DNA polymerase large fragment,
and (ii) a thermostable RNase H, thereby forming double stranded cDNA; and
c) transcribing the double stranded cDNA into cRNA by contacting the
double stranded cDNA with an RNA polymerase under conditions conducive to
RNA polymerase activity, such that cRNA is produced. ____________________________________________
If you want to learn more
about this patent, please go directly to the U.S.
Patent and Trademark Office Web site to access the full
patent.
|