TECHNIQUES

The progress in understanding genomes has been possible because of powerful new techniques that have been developed to analyze genes and chromosomes and, in particular, to determine the sequence of extremely large molecules of DNA.

I. CLONING
II. DNA SEQUENCING
III. GENE TRANSFER
IV. DETERMINATION OF KARYOTYPES

 
I. CLONING  

Related Current Themes

Related Basic Sciences

CLONING OF LARGE ANIMALS

FROM THE ORGANISM TO THE GENE


What is encompassed by the term "cloning"?

The term "cloning" is used by biologists to describe forms of copying and amplification that can be applied at the level of genes, cells, or entire organisms. Therefore, care is needed to keep clearly in mind which type of cloning is involved for any application under discussion. Whereas cloning of genes is now routine and innocuous, cloning of vertebrates (entire organisms) is, depending on the species, difficult or unimaginable (at the very least, for ethical reasons, as in the case of human beings). The three levels of cloning can be further characterized in terms of their specific features:

  • Cloning of genes. The revolution in biotechnology began with the development of methods to isolate genes and transfer them to various organisms. Initial applications involved cloning human genes for expression in bacteria, for example the production of human insulin in the bacteria E. coli. Gene cloning is also at the heart of other forms of cloning, since part of the interest in cloning cells or organisms is to reproduce rapidly animals whose genes have been modified for particular medical or industrial purposes.

  • Cloning of cells. A complex organism is composed of numerous different types of cells. Most classes of cells can be isolated and grown in the laboratory, and these cells retain their specific properties. However, for a small class of cells known as "stem cells," when grown in the laboratory in the presence of specific growth factors, they can be converted to defined cell types (muscle cells, neurons, blood cells, etc.). Such cells could be re-injected into patients to overcome various pathological conditions and this procedure is called "therapeutic cloning".

  • Cloning of animals. An identical copy of an organism can be made by transferring the nucleus of body cells into a fertilized egg from which the original nucleus has been removed. The process has long been successful for plants and certain relatively simple animals, but only recently it has been extended to mammals -- initially to sheep (the famous Dolly, reported in February, 1997), but subsequently also to mice, cows, pigs, and goats. This procedure is called "reproductive cloning."

What methods are involved in cloning?

Each type of cloning involves very different methods.

  • Gene cloning makes use of gene transfer involving vectors, with specific operations that are based on the use of restriction enzymes to "cut and paste" portions of DNA at well-defined sequences.

  • Cloning of cells revolves around isolation of specific cell types and use of the methods of cell culture, i.e. controlled growth of cells in a well-defined chemical growth media. The growth occurs in small, covered glass or plastic dishes in an incubator that maintains a fixed temperature and appropriate CO2 levels. Some types of cells (neurons) are more difficult to grow in culture than other types (lymphocytes), and considerable success has been achieved recently in isolating human stem cells (relatively undifferentiated cells that can be induced by specific growth factors to develop into specialized cells, such as neurons, blood cells, or muscle cells). Ultimately, such cells may be used to treat medical problems (such as Parkinson's disease) that arise from loss of specific cells in the brain, although so far such treatments have been limited to disease models studied in mice.

  • Cloning of animals requires precise micro-methods to remove the nucleus of a fertilized egg and replace it with the nucleus of an appropriate somatic cell. The cell is then placed within a surrogate female for development of the organism, along the lines used for embryo transfer to produce chimeric mice. In principle, this method, were it to achieve a high level of efficiency (which is not yet the case) could make unlimited copies ("clones") of a single adult organism. Cloning on a small scale can also be achieved by imitating the procedure that produces human twins and artificially dividing a very early embryo (at the 2, 4, or 8 cell stage) into individual cells and re-implanting them to achieve separate embryos. However, this cloning by scission can only produce a small number of identical animals and therefore does not have the vast industrial potential of true cloning, which could in principle produce hundreds of identical animals in a relatively short time.

II. DNA SEQUENCING

BACK TO TOP OF PAGE

Related Current Themes

Related Basic Sciences

SEQUENCING THE HUMAN GENOME

FROM THE ORGANISM TO THE GENE
GENES

What is sequencing?

Sequencing refers to the process of determining the order of bases in the DNA. It can involve the sequence of a gene or the complete DNA molecules of an organism. For a bacteria, a single, circular DNA molecule must be sequenced, for example with about 4.6 million base pairs in the case of the bacteria found in the human gut,
E. coli. For a higher organism such a human, sequencing of the entire DNA involves 3,500 million bases distributed over 23 chromosomes. In all cases, very similar methods are used -- breaking the DNA into units of several hundred bases and then reconstructing the sequence of the larger segments of DNA by stringing together the sequences of the smaller units in the correct order.


What is the value of sequence information?

The sequence of bases in the DNA can be used to identify the genes and ultimately to define the proteins that occur in an organism. For human beings, mutations in genes result in modifications of proteins that can cause disease. Identifying such modifications opens the door to possible therapeutic approaches through the design of pharmaceutical agents or through gene therapy. On a broader scale, knowledge of the full sequence of the genome can provide insights into deeper questions of biology and the extent to which distantly related organisms such as the fruit fly and humans, share common proteins.


Which organisms have been sequenced?

Sequencing on the scale of the entire DNA of a small viruses (5000-50,000 base pairs) was first achieved in the late 1970's and early 1980's. For bacteria, with more than 1 million base pairs, the first full sequences were reported in the 1990's, as described in « La barque de Delphes » d'Antoine Danchin. Editions Odile Jacob. By October, 2000, over 30 bacteria were sequenced, including the following examples (in order of decreasing genome size):

Species

Size (million bp)

Date completed

Pseudomonas aeruginosa

6.26

May 16, 2000

Escherichia coli

4.64

Oct. 13, 1998

Mycobacterium tuberculosis

4.41

June 11, 1998

Bacillus subtilis

4.22

Nov. 20, 2000

Vibrio cholerae

2.96

June 14, 2000

Neisseria meningitidis MC58

2.27

Feb. 25, 2000

Haemophilus influenzae Rd

1.83

July 25, 1995

Rickettsia prowazekii

1.11

Nov. 12, 1998

Mycoplasma pneumoniae

0.81

Nov. 15, 1996

Mycoplasma genitalium

0.58

Oct. 30, 1995


For these organisms, the genes within the DNA sequence were readily identified (in bacteria, most of the DNA encodes genes), although many are of unknown function. Nevertheless, this information is providing insights into the evolution of bacteria and how the genes are organized in the these smallest of living cells. Ultimately, such information may permit novel organisms to be designed that provide new tools in biotechnology.

For larger organisms, sequencing has also been progressing rapidly and by the end of the 1990's, yeast (12 million base pairs), the nematode worm (97 million base pairs), and the fruit fly (160 million base pairs) were completed. In June of 2000, virtual completion (95%) of the full sequence of the human genome was announced, although the "annotation" (identification of all of the tens of thousands of genes within the sequences) required additional months. Since the mouse is the major animal of interest for research purposes, sequencing its genome also has a high priority. Early sequencing efforts involved international cooperation of many research groups, particularly for the yeast project. More recent efforts have concentrated on fewer, highly specialized laboratories and the fly and human sequences were obtained in large part by a private industrial company, Celera Genomics in Rockville, Maryland. Finally, the first complete genome of a small flowering plant of importance for research,
Arabidopsis thaliana, was completed in December, 2000. It contains118.7 million base pairs and about 26,000 genes, but astonishingly at least 70% of the genome has been duplicated, so there are only about 15,000 different genes.


What is the basic strategy for sequencing large organisms?

Initial sequencing activities with viruses and bacteria, as well as the international cooperation that completed the yeast sequence, were designed to break up the work into well defined fragments, whose locations on the overall chromosomes were known, and to assign the detailed sequencing to various collaborators. This approach had the advantage of maintaining strict accounting of all of the DNA and as each fragment was sequenced it could be placed precisely in relation to other sequences.

A new strategy was initiated by the Craig Venter, founder of Celera Genomics, and applied initially to several bacterial sequence projects. This approach involved sequencing fragments of DNA generated at random and then using a computer to identify overlapping sequences from different fragments to assign the fragments to their proper order in the chromosomes. This strategy permitted the fruit fly sequence to be obtained in a relatively short time and contributed to accelerating the completion of the human sequence. Nevertheless, the approach may leave some small gaps that to be filled will require a conventional analysis based on isolating particular DNA fragments.


How are the details of sequence determined?


All sequence strategies rely ultimately on the determination of the exact order of the bases in a fragment of DNA. The modern sequencing techniques are based on the principle of copying one of the strands of the DNA fragment, but under conditions that lead to not only complete copying, but also to a series of incomplete copies. The copying reaction involves the enzyme DNA polymerase, a primer sequence upon which chain elongation occurs, and sufficient amounts of the four bases (G, T, A, and C) in the form of deoxynucleotides. At each step in the synthesis of a new strand of DNA, the next base is added by formation of an ester between the phosphate of the incoming nucleotide and the hydroxyl group at the 3' position of the sugar portion of the last base in the newly synthesized chain. For example, the DNA sequence on one strand:

-G-T-G-C-A-C-C-T-G-A-C-T-C-C-T-G-A-G-


will lead to the synthesis of a strand (in bold) with the complementary sequence:

-G-T-G-C-A-C-C-T-G-A-C-T-C-C-T-G-A-G-
-C-A-C-G-T-G-G-A-C-T-G-A-G-G-A-C-T-C-

The key to sequencing involves adding dideoxynucleotides to the reaction mixture. Since these molecules do not contain a 3' hydroxyl group, when they are incorporated into the growing chain, no additional chain elongation can occur. When the dideoxynucleotides are also coupled to a colored dye (with a different color for A, T, G, and C), the incomplete chains are colored differently, depending on which base in the dideoxy form interrupted the elongation reaction. In this case, some complete new strands will be produced, but a series of incomplete strands will also be produced ending with a distinctive color, including:

-C-A-C-G-T-G-G yellow
-C-A-C-G-T-G-G-A green
-C-A-C-G-T-G-G-A-C blue
-C-A-C-G-T-G-G-A-C-T red

and so on, creating a set of incomplete strands that vary in length by one base and carry a distinctive color that indicates the nature of the last base. The mixture of strands is then fractionated by passage through a capillary, such that the incomplete strands emerge in order of size. With the appropriate equipment, the colors that appear at the exit of the capillary provide a direct readout of the sequence.


By obtaining the sequence of both strands of the double stranded-DNA separately, as well as the sequence of overlapping fragments, this method provides sequence data of high reliability. The only "complication" for sequencing the human genome was to obtain extremely large amounts of such sequence data, necessitating the use of hundreds of highly automated machines.


III. GENE TRANSFER

BACK TO TOP OF PAGE

Related Current Themes

Related Basic Sciences

ADVANCEMENTS IN HEALTH CARE
GENETIC ENGINEERING
UNDERSTANDING LIFE

GENES

What is gene transfer?

An individual gene carries the information that encodes the structure of a protein. Therefore, transfer of the gene from one organism to another can enable the protein to be produced in an organisms where the gene had not been initially present. The transfer can ultimately lead to the insertion of a gene from a plant or animal or bacteria into a different plant, animal or bacterial species. However, in all cases two distinct steps are required to carry out the transfer:

Step 1, isolation: Since genes occur as tiny fractions of the entire length of unwieldy chromosomes, the various operations for utilizing a gene start by incorporating the gene into a smaller structure that is easier to isolate and to handle. What is needed is a length of DNA just sufficient to encompass the gene.

Step 2, transfer: In order to complete the transfer, the DNA of the gene must be associated with additional DNA elements that permit the ensemble to replicate in cells that are easy to grow (such as a bacteria). The ensemble generally also includes the presence of some "markers" on the DNA can facilitate screening for its presence. Various DNA constructions of this type have been developed to carry a gene from one type of cell to another and such constructions are called "vectors".


How is isolation of a gene (step 1) achieved?


All gene transfers begin with the identification of the gene. Some genetic diseases were first identified by observing a modified protein (as in the case of sickle cell anemia) and the gene was later identified on the basis of the structure of the protein. More commonly at the present time, a genetic disease is recognized by clinical symptoms and the gene is then identified through analysis of afflicted families. The identification of genes associated with a disease will be increasingly facilitated as all of the genes in the human genome are identified in the completed DNA sequence.

Once identified, the gene must be isolated in a suitable form and amplified to make it available in sufficient quantities for subsequent steps. Isolating a gene from a large genome, such as the human genome with tens of thousands of genes, requires particularly specific methods to find the gene of interest among such a large number of other genes. Two approaches can be used for isolating a gene. One approach involves screening for a fragment of DNA that contains the gene of interest using a gene bank within bacterial viruses called phage. In this case, once a phage containing the gene is isolated, amplification can be achieved by growing the phage in sufficient quantities to provide a source of the DNA for completing the transfer, as summarized below. The other approach involves isolating and amplifying the gene in a single procedure directly from a preparation of DNA using specific probes in conjunction with a powerful method called PCR (for polymerase chain reaction) in order to provide sufficient quantities.

PHAGE GENE BANK

LEGEND: Phage Gene Bank: When bacteriophage are grown on a continuous lawn of bacteria on a culture plate, small clear spots (called "plaques") will appear where one virus has infected one cell and then, after reproducing, has infected the neighboring cells, with the process repeating until a visible plaque is formed. If a complete sample of human DNA has been cut into small pieces of DNA and inserted into the DNA of an appropriate phage vector, when the phage are plated on a bacterial lawn, each plaque will contain the DNA of just one or two human genes. (Pieces of genes will also be present, so following isolation in this way, the DNA must be tested to insure that it contains the entire gene.) Then, if a suitable probe of the gene is available (for example a chemically synthesized strand of DNA whose composition is determined -- using the genetic code -- from the amino acid sequence of the protein specified by the gene), the probe can be added in radioactive form. It will hybridize to the DNA only of phage particles that carry the gene. Then the plates are covered with photographic film and a spot will appear only where the radioactivity darkens the film. In this way phage particles from the plaque can be isolated, grown in large quantities, and the DNA from the phages can treated with an appropriate restriction enzyme to isolate the gene.


What is the basis for isolating a gene by PCR?


With PCR, a gene or any fragment of DNA can be amplified, by using two specific "primers" that bind to the DNA on each side of the region to be amplified. The method works by repeating a cycle of reactions many times that involves heating the DNA to separate the strands, lowering the temperature to bind the primers, and extending the primers to synthesize a new strand of DNA. The extension step is carried out in the presence of a supply of the 4 bases in the appropriate form and a special polymerase that is able to operate at relatively high temperature. The number of copies of the gene defined by the two primers doubles with each reaction cycle. Typically about 30 cycles are used, so beginning with just one copy of the gene to be amplified, the number of copies at the end of the 30 cycles would be 2
30 or about one billion.


How is transfer of the isolated gene (step 2) achieved?


In order for transfer to be carried out, the gene must be incorporated into an appropriated vector, one that is designed to take into account the final destination of the gene, as well as the host in which the vector will be grown. The vector is often a bacterial plasmid, but specific vectors exist for yeast and other cells used in the cloning process. Bacterial plasmids are the generally used because of the rapidity of growth of bacteria and the ease of handling. Plasmids are small circular molecules of DNA (usually containing about 100,000 base pairs) that are capable of replicating freely within a bacterial cell, generally the common gut bacteria of humans,
Escherichia coli. Their replication requires a specific DNA sequence called an "origin of replication" and plasmids are generally constructed with several genes that can serve as selectable markers to test whether the gene of interest has been correctly inserted and whether the bacterial cells have incorporated the plasmid. In the example presented below the plasmid contains a gene for resistance to the antibiotic tetracycline plus a gene for resistance to another antibiotic, ampicillin. By inserting the gene to be transferred into the plasmid at a position that interrupts the ampicillin gene, the plasmid can no longer confer resistance to ampicillin. Therefore, when the plasmid containing the new gene is added back to bacterial cells, only cells containing the plasmid will grow on tetracycline. However, when these same cells are tested on ampicillin, they will be killed, so only tetracycline-resistant cells that are ampicillin-sensitive will be retained for the further cloning steps.

BACTERIAL PLASMID USED AS A VECTOR


What specific tools are used to incorporate a gene into a plasmid?


The actual transfer of a gene into a vector is carried out with the aid of specialized enzymes, called restriction enzymes, that cut the DNA only at certain specific sequences. In the example presented below, the enzyme BamHI cuts DNA between the adjacent G's on both strands (indicated by the vertical arrows):


Restriction enzymes typically cut DNA at a sequence that is "palindromic," that is the sequence on one strand is identical to the sequence on the other strand read in the opposite direction. As a result each strand is left with identical "sticky ends" (....GATCC in this example) that can be used to attach an added fragment of DNA (with the gene of interest) that also contains these sticky ends. When the DNA containing the new gene is added, the sticky ends are stabilized in chemical bonds by the use of another enzyme called DNA ligase. By choosing a restriction enzyme that cuts the DNA within the ampicillin gene, as shown in the figure for BamHI, the capacity of the plasmid to confer resistance to ampicillin will be lost when the gene to be transferred is inserted.

Depending on the objectives, the gene may be transferred to other vectors designed for specialized purposes. For example, if the objective is production of the protein produced by the gene, a vector will be used that carries a powerful promoter in front of the gene, to insure that the protein is expressed in large quantities. If the protein is to used to transform mouse cells, a different type of vector is needed, as described in detail in the section for introducing genes into mice. For transformation of human cells, the most commonly used vectors are viruses that have been inactivated with respect to any harmful activities, but in which new genes have been added for delivery to the target cells.


What are the specific applications of gene transfer?


Gene transfer refers to a range of modifications in the genetic make-up of an organism. The broad categories include:

  • Genetic reprogramming of bacteria or other "production" organisms in order to obtain proteins for industrial or medical purposes. Examples of this type of application include introducing the gene for a human protein such as insulin that can be used to produce insulin from bacteria for medical purposes. While bacteria are the organisms that are the most conveniently modified (and easy to grow rapidly in large quantities), for certain applications it is preferable to use yeast cells, plant or animal cells in culture, or entire plants or animals. Considerable success has been achieved with all types of cells in culture, and with entire plants. In one striking example, human hemoglobin was produced in tobacco plants. For large animals, preliminary results have been achieved in some cases for the production of proteins, for example production of small quantities of human clotting factor IX in the milk of goats.
  • Genetic reprogramming of plants for crop production. Methods have been developed that permit extensive genetic modification of plants, including those that are now at the center of the controversy involving genetically-modified crops that are opposed by some consumer and ecology groups. Genes can be conveniently transferred to plant cells and production of complete plants from single plant cells is readily achieved. In this respect the cloning of plants is much simpler than the cloning of animals. Crops that produce pesticides have been available for some time and recently a yellow form of rice has been produced that synthesizes the precursor of vitamin A.
  • Genetic reprogramming of mice for experimental purposes. Among mammals, research on mice has progressed to an advanced stage, permitting selected genes within the germ cells to be eliminated (gene knock-out) or introduced (gene knock-in), thereby leading to the production of genetically-modified mice in which all cells of the animal have been modified. Moreover, by combining gene transfer with the insertion of certain DNA sequences (called promoters) that control the expression of genes, it has been possible to produce animals in which the desired gene is expressed only in certain tissues. A striking application of these methods was the recent report concerning production of "smart" mice with extra neurotransmitter receptors in their brains. The mice are considered "smart" because they exhibit superior ability in learning and memory for various behavioral tasks.
  • Genetic reprogramming of humans. Compared to the achievements with mice, the status of genetic modifications of humans is much more limited, for both technical and practical reasons (as well as obvious ethical reasons). The modification of a gene in all the cells of a human being is far beyond current technical abilities and ethical considerations also impose restrictions. To date, limited success has been achieved that involved introducing a correct gene into certain cells of individuals with a deficiency in that gene due to a mutation. For example, a rare immunological genetic disease (SCID) has been treated by introducing a gene into lymphocytes. In addition, encouraging preliminary results were obtaining by introducing a gene for a coagulation factor into muscle cells of hemophilic individuals.



Why is gene transfer so important?


Gene transfer permits genetic modifications to be accomplished that would be extremely laborious at best, but often completely impossible to achieve, by classical genetic methods -- mutagenesis, many generations of mating, and screening of mice to determine if only one gene had been affected. Therefore, where the goal is to develop an organism with a specific genetic modification, gene transfer represents a method that provides a dramatic short cut compared to classical methods.


How is a gene introduced into a complex organism, such as mouse?


Many of the methods currently used on animals were developed in mice, since mice are easy to handle and reproduce rapidly. Mouse embryo-derived stem (ES) cells can be grown in culture and genes introduced with the appropriate vectors. Stem cells have the capacity to differentiate into any of the specific types of cell (nerve, muscle, skin, bone, liver, etc.) found in an adult organism. Although the genes are not always introduced correctly or efficiently into all stem cells treated, clever methods have been developed that use special markers to test whether the gene has been properly inserted. A "positive marker" allows only those cells to grow that have inserted the new genetic construction and a "negative marker" kills cells that have inserted the genetic construction incorrectly. Then the transformed cells can be inserted into an early embryo. The transformed cells will then give rise to certain tissues and organs and in some cases the gene will be incorporated into sperm. When mice that produce sperm containing the modified gene are mated with normal mice, mice that are heterozygous for the mutation in all of their cells will be produced. One-quarter of the progeny of such heterozygous mice will be homozygous for the transformed gene. The overall process thus involves several complicated steps, but it is routinely practiced in many laboratories throughout the world. The various steps are examined in more detail below.

The initial phase of the methods involves the exchange of DNA sequences between the newly added sequences introduced into an appropriate vector and sequences present in the ES cells. This process developed by Mario Capecchi in Utah is called "gene-targeting." It takes advantage of a natural cellular phenomenon called homologous recombination by which strands of nucleotides from an added gene align with matching sequences already present in the cell. When the newly inserted gene finds its target, it will line up with it and replace it in a small fraction of the cells treated. The process, summarized in the figure below, occurs even if the new gene carries inserted sequences that turn off the original gene or modify its activity, since the larger gene will loop out, as depicted in the figure, allowing the sequences at the extremes to find their homologous partners. When this process does occur, it will also introduce the "positive" marker in the cell (for example resistance to neomysin) since the gene for the resistance is used to interrupt the gene that is homologous to the targeted gene. Hence, only the transformed cells will survive when grown in the presence of neomysin. A "negative" marker, for example the gene for DTA, (the A fragment of diphtheria toxin that kills cells by interfering with protein synthesis) is also used, since the gene will be eliminated in cells that correctly incorporate the interrupted gene in place of the target gene by homologous recombination. However, in cells that have incorrectly incorporated the interrupted gene, homologous recombination is not involved and the entire construct including the DTA gene will be inserted at random. These latter cells will express the DTA gene and be killed by the toxin. Thus when cells are grown in the presence of neomysin only those that have correctly inserted the interrupted gene will survive.

HOMOLOGOUS RECOMBINATION

The correctly modified ES cells are then used to produce chimeric mice, and ultimately mice with a specific gene knockout, as summarized in the figure below:

PRODUCTION OF CHIMERIC MICE FOR GENE KNOCK-OUT

A powerful application of the method has been to insert DNA sequences that correspond to a gene with a large interruption, so that when homozygous mice are produced the gene of interest is effectively eliminated. Hundreds of different lines of mice with such a gene knock-out have been produced in laboratories around the world and have provided numerous insights into the function of the genes in question. Similar methods have been used to insert specific genes (gene knock-in).

Gene transfer methods can be applied to large animals, but each step is more laborious, much longer to carry out, more expensive, and therefore difficult to maintain for a large enough number of animals to insure success. In principle, once a genetically modified mouse or large animal is produced, exact copies could be reproduced by cloning, that is taking the nucleus from a somatic cell and transferring it to an egg cell. However, although much excitement was generated by the cloning of sheep, notably Dolly, and other animals subsequently, the methods for cloning are extremely inefficient and unreliable, and as a result are not yet of practical use. Much more basic knowledge in biology is required before such approaches can be considered well established.


Do the same principles apply to plants?


Genes can also be readily introduced into plants using methods similar to those employed for animals. Indeed, plants possess a number of advantages that enabled gene transfer to be applied on a large scale for plants earlier than for animals. Some special steps are needed, however, because of the cell wall of plants that makes the introduction of foreign DNA more difficult. One common solution is the "gene gun" that literally shoots genes into plant cells. However, once a genetically modified plant cell is produced, obtaining plants and seeds is relatively straightforward. Compared to the difficulties of Dolly-type cloning of animals, cloning of plants is based on methods that have been solidly established for decades. Plant cells are generally totipotent, that is any cell can be isolated and when properly cultivated will grow into a full plant, complete with many genetically modified seeds. Therefore, genetically modified plants can be produced more readily than genetically modified animals, especially mammals. As a result the practical issues of genetically modified organisms concern almost exclusively plants that have been modified plants to produce their own pesticides or yield improved nutrients, such a rice modified to produce provitamin A.


IV. DETERMINATION OF KARYOTYPES

BACK TO TOP OF PAGE

Related Current Themes

Related Basic Sciences

IDENTIFICATION OF INDIVIDUALS

FROM THE ORGANISM TO THE GENE

What is a karyotype?

The genes of plants and animals correspond to segments along the DNA molecules that constitute the active portion of chromosomes. The cells of a particular species of a plant or animal each contain a characteristic number of chromosomes (generally an even number, since for organisms with sexual reproduction, half are provided by the egg and half by the sperm, or the equivalent gametes of plants). In addition, each chromosome has a characteristic size. Therefore, by examining the karyotype of a fetus, it is possible to detect abnormalities that have altered the size or number of chromosomes. The normal human karyotype involves 22 pairs of autosomal (non-sex) chromosomes, plus a pair of sex chromosomes (X or Y), for a total of 46, the "diploid" number, whereas each gamete possesses the haploid number (23). In a diploid cell, the presence of 2 X chromosomes identifies a female, whereas one X chromosome and one Y chromosome correspond to a male.


What information can be obtained from a karyotype?


The karyotype provides information on larger scale modifications, compared to the small scale changes in bases that constitute mutations in genes. Thus, one of the first major conditions for which karyotypes provided critical information (and for which karyotyping is still widely used for prenatal diagnosis) is trisomy 21. In this case a third compete chromosome 21 is present. Other conditions that are revealed by karyotyping during prenatal diagnosis include the fragile X syndrome (that can be identified by a section of the X chromosome that appears somewhat detached from the main portion), syndromes related to extra sex chromosomes (XXY and XYY), and other transformations that can lead to severe abnormalities (for example, the Robertson translocation that involves an end to end fusion of chromosomes 13 and 14). In addition, numerous other rearrangements and translocations have been reported that are characteristic of acquired disease states, including several related to cancer, such as the Philadelphia chromosome (named after the city in which it was discovered) which is seen in about 90% of patients with chronic myelogenous leukemia (CML). In this translocation a part of chromosome 22 is attached to chromosome 9.

Karyotypes can also be used to compare relatedness of species. For example, chimpanzees possess 48 chromosomes, but the chromosomes are extremely similar to those of humans, with the large human chromosome 2 appearing as a fusion of two smaller chimpanzee chromosomes. For more distantly related species, the numbers of chromosomes vary considerably. The numbers of chromosomes (diploid) for a selection of animal species are, cow: 60; dog: 78; cat: 38; mouse: 40; rat: 42; carp: 104; fruit fly: 8; nematode: 11 or 12 (male versus female). Similar principles apply to plants and examples for the numbers of chromosomes are: potato: 48; tomato: 24; corn: 20.


How is a karyotype obtained?


A karyotype can be prepared by chemical treatment of isolated cells, commonly lymphocytes, to visualize the chromosomes (in the condensed form that occurs during mitosis), with appropriate staining, by microscopy. While the individual chromosomes can be recognized by size, identification is greatly facilitated by staining with the dye Giemsa that introduces characteristic patterns of coloration (called G bands) in each chromosome; in this way different chromosomes that are nearly the same size can be readily distinguished. A representative karyotype is presented below, as a typical chromosome spread (for a male) on the left and the alignment of the observed chromosomes by size on the right.

The banding patterns are summarized in the idealized representation of the chromosomes below:

.