Thursday, March 26, 2009

R.A. Fisher


Sir Ronald Aylmer Fisher was an English statistician, evolutionary biologist, and geneticist. He was described by Anders Hald as "a genius who almost single-handedly created the foundations for modern statistical science" and Richard Dawkins described him as "the greatest of Darwin's successors".

Fisher was born in East Finchley in London, England, to George and Katie Fisher. His father was a successful fine arts dealer. He had a happy childhood, being doted on by three older sisters, an older brother, and his mother, who died when Fisher was 14. His father lost his business in several ill-considered transactions only 18 months later.

Although Fisher had very poor eyesight he was a precocious student, winning the Neeld Medal (a competitive essay in Mathematics) at Harrow School at the age of 16. Because of his poor eyesight, he was tutored in mathematics without the aid of paper and pen, which developed his ability to visualize problems in geometrical terms, as opposed to using algebraic manipulations. He was legendary in being able to produce mathematical results without setting down the intermediate steps. He also developed a strong interest in biology, and, especially, evolution.

In 1909 he won a scholarship to Gonville and Caius College, Cambridge. There he formed many friendships and became enthralled with the heady intellectual atmosphere. At Cambridge, Fisher learned of the newly rediscovered theory of Mendelian genetics; he saw biometry—and its growing corpus of statistical methods—as a potential way to reconcile the discontinuous nature of Mendelian inheritance with continuous variation and gradual evolution. However, his foremost concern was eugenics, which he saw as a pressing social as well as scientific issue that encompassed both genetics and statistics. In 1911 he was involved in forming the Cambridge University Eugenics Society with such luminaries as John Maynard Keynes, R. C. Punnett and Horace Darwin (Charles Darwin's son). The group was active, and held monthly meetings, often featuring addresses by leaders of mainstream eugenics organizations, such as the Eugenics Education Society of London, founded by Charles Darwin's half-cousin, Francis Galton in 1909.

After graduating in 1912, Fisher was eager to join the army in anticipation of Great Britain's entry into World War I; however, he failed the medical examinations (repeatedly) because of his eyesight. Over the next six years, he worked as a statistician for the City of London. For his war work, he took up teaching physics and mathematics at a series of public schools, including Bradfield College in Berkshire, as well as aboard H.M. Training Ship Worcester. Major Leonard Darwin (another of Charles Darwin's sons) and an unconventional and vivacious friend he called Gudruna were almost his only contacts with his Cambridge circle. They sustained him through this difficult period. A bright spot in his life was that Gudruna matched him to her sister Eileen Guinness; they married in 1917 when she was only 17. With the sisters' help, he set up a subsistence farming operation on the Bradfield estate, where they had a large garden and raised animals, learning to make do on very little. They lived through the war without ever using their food coupons.

During this period, Fisher started writing book reviews for the Eugenic Review and gradually increased his interest in genetic and statistical work. He volunteered to undertake all such reviews for the journal, and was hired to a part-time position by Major Darwin. He published several articles on biometry during this period, including the ground-breaking "The Correlation Between Relatives on the Supposition of Mendelian Inheritance" , written in 1916 and published in 1918. This paper laid the foundation for what came to be known as biometrical genetics, and introduced the very important methodology of the analysis of variance, which was a considerable advance over the correlation methods used previously. The paper showed very convincingly that the inheritance of traits measurable by real values, the values of continuous variables, is consistent with Mendelian principles.

With the end of the war he went looking for a new job, and was offered one at the famed Galton Laboratory by Karl Pearson. Because he saw the developing rivalry with Pearson as a professional obstacle, however, he accepted instead a temporary job as a statistician with a small agricultural station in the country in 1919.

In 1919 Fisher started work at Rothamsted Experimental Station located at Harpenden, Hertfordshire, England. Here he started a major study of the extensive collections of data recorded over many years. This resulted in a series of reports under the general title Studies in Crop Variation. This began a period of great productivity. Over the next seven years, he pioneered the principles of the design of experiments and elaborated his studies of "analysis of variance". He furthered his studies of the statistics of small samples. Perhaps even more important, he began his systematic approach of the analysis of real data as the springboard for the development of new statistical methods. He began to pay particular attention to the labour involved in the necessary computations, and developed methods that were as practical as they were founded in rigour. In 1925, this work culminated in the publication of his first book, Statistical Methods for Research Workers. This went into many editions and translations in later years, and became a standard reference work for scientists in many disciplines. In 1935, this was followed by The Design of Experiments, which also became a standard.

In addition to "analysis of variance", Fisher invented the technique of maximum likelihood and originated the concepts of sufficiency, ancillarity, Fisher's linear discriminator and Fisher information. His 1924 article "On a distribution yielding the error functions of several well known statistics" presented Karl Pearson's chi-squared and Student's t in the same framework as the Gaussian distribution, and his own "analysis of variance" distribution z (more commonly used today in the form of the F distribution). These contributions easily made him a major figure in 20th century statistics.

In defending the use of the z distribution when the data were not Gaussian, Fisher introduced the "randomization test". According to biographers Yates and Mather, "Fisher introduced the randomization test, comparing the value of t or z actually obtained with the distribution of the t or z values when all possible random arrangements were imposed on the experimental data."

However, Fisher wrote that randomization tests were "in no sense put forward to supersede the common and expeditious tests based on the Gaussian theory of errors." Fisher thus effectively began the field of non-parametric statistics, even though he didn't believe it was a necessary move.

His work on the theory of population genetics also made him one of the three great figures of that field, together with Sewall Wright and J. B. S. Haldane, and as such was one of the founders of the neo-Darwinian modern evolutionary synthesis. In addition to founding modern quantitative genetics with his 1918 paper, he was the first to use diffusion equations to attempt to calculate the distribution of gene frequencies among populations. He pioneered the estimation of genetic linkage and gene frequencies by maximum likelihood methods, and wrote early papers on the wave of advance of advantageous genes and on clines of gene frequency. His 1950 paper on gene frequency clines is notable as the first application of computers to biology.

Fisher had a long and successful collaboration with E.B. Ford in the field of ecological genetics. The outcome of this work was the general recognition that the force of natural selection was often much stronger than had been appreciated before, and that many ecogenetic situations (such as polymorphism) were not selectively neutral, they were maintained by the force of selection. Fisher was the original author of the idea of heterozygote advantage, which was later found to play a frequent role in genetic polymorphism. The discovery of indisputable cases of natural selection in nature was one of the main strands in the modern evolutionary synthesis.

Fisher introduced the concept of Fisher information in 1925, some years before Shannon's notions of information and entropy. Fisher information has been the subject of renewed interest in the last few years, due to B. Roy Frieden's book Physics from Fisher Information, which attempts to derive the laws of physics from a Fisherian starting point.

Fisher was an ardent promoter of eugenics, which also stimulated and guided much of his work in the genetics of humans. His book The Genetical Theory of Natural Selection was started in 1928 and published in 1930. It contained a summary of what was already known to the literature. He developed ideas on sexual selection, mimicry and the evolution of dominance. He famously showed that the probability of a mutation increasing the fitness of an organism decreases proportionately with the magnitude of the mutation. He also proved that larger populations carry more variation so that they have a larger chance of survival. He set forth the foundations of what was to become known as population genetics.

About a third of the book concerned the applications of these ideas to humans, and presented what data there was available at the time. He presented a theory that attributed the decline and fall of civilizations to its arrival at a state where the fertility of the upper classes is forced down. Using the census data of 1911 for Britain, he showed that there was an inverse relationship between fertility and social class. This was partly due, he believed, to the rise in social status of families who were not capable of producing many children but who rose because of the financial advantage of having a small number of children. Therefore he proposed the abolishment of the economic advantage of small families by instituting subsidies (he called them allowances) to families with larger numbers of children, with the allowances proportional to the earnings of the father. He himself had two sons and six daughters. According to Yates and Mather, "His large family, in particular, reared in conditions of great financial stringency, was a personal expression of his genetic and evolutionary convictions."

The book was reviewed, among others, by physicist Charles Galton Darwin, a grandson of Charles Darwin's, and following publication of his review, C. G. Darwin sent Fisher his copy of the book, with notes in the margin. The marginal notes became the food for a correspondence running at least three years. Fisher's book The Genetical Theory of Natural Selection also had a major influence on the evolutionary biologist W. D. Hamilton and the development of his later theories on the genetic basis for the existence of kin selection.

Between 1929 and 1934 the Eugenics Society also campaigned hard for a law permitting sterilization on eugenic grounds. They believed that it should be entirely voluntary, and a right, not a punishment. They published a draft of a proposed bill, and it was submitted to Parliament. Although it was defeated by a 2:1 ratio, this was viewed as progress, and the campaign continued. Fisher played a major role in this movement, and served in several official committees to promote it.

In 1934, Fisher moved to increase the power of scientists within the Eugenics Society, but was ultimately thwarted by members with an environmentalist point of view, and he, along with many other scientists, resigned.

The interest in eugenics, and his experiences working on the Canadian farm, made Fisher interested in starting a farm of his own. In these plans he was encouraged by Gudruna, the wife of a college friend, and this led to him meeting Ruth Eileen Gratton Guinness, Gudruna's younger sister. Their father, Dr Henry Gratton Guinness, had died when they were young. Ruth Eileen was only sixteen years of age when she met Fisher. She knew that her mother would not approve of her marrying so young. As a result Fisher married Ruth Eileen at a secret wedding ceremony without her mother's knowledge, on 26 April 1917, only days after Ruth Eileen's 17th birthday. They had two sons and seven daughters, one of whom died in infancy. His daughter Joan married George E. P. Box and wrote a well-received biography of her father.

As an adult, Fisher was noted for his loyalty to his friends. Once he had formed a favourable opinion of any man, he was loyal to a fault. A similar sense of loyalty bound him to his culture. He was a patriot, a member of the Church of England, politically conservative, and a scientific rationalist. Much sought after as a brilliant conversationalist and dinner companion, he very early on developed a reputation for carelessness in his dress and, sometimes, his manners. In later years he was the archetype of the absent-minded professor.

He knew the scriptures well and H. Allen Orr describes him as "deeply devout Anglican who, between founding modern statistics and population genetics, penned articles for church magazines" in the Boston Review. But he was not dogmatic in his religious beliefs. In a 1955 broadcast on Science and Christianity, he said, "The custom of making abstract dogmatic assertions is not, certainly, derived from the teaching of Jesus, but has been a widespread weakness among religious teachers in subsequent centuries. I do not think that the word for the Christian virtue of faith should be prostituted to mean the credulous acceptance of all such piously intended assertions. Much self-deception in the young believer is needed to convince himself that he knows that of which in reality he knows himself to be ignorant. That surely is hypocrisy, against which we have been most conspicuously warned."


It was Fisher who referred to the growth rate r (used in equations such as the logistic function) as the Malthusian parameter, as a criticism of the writings of Thomas Robert Malthus. Fisher referred to "...a relic of creationist philosophy..." in observing the fecundity of nature and deducing (as Darwin did) that this therefore drove natural selection.

He received the recognition of his peers in 1929 when he was inducted into the Royal Society. His fame grew and he began to travel more and lecture to wider circles. In 1931 he spent six weeks at the Statistical Laboratory at Iowa State College in Ames, Iowa. He gave three lectures a week on his work, and met many of the active American statisticians, including George W. Snedecor. He returned again for another visit in 1936.

In 1933 he left Rothamsted to become a Professor of Eugenics at University College London. In 1937 he visited the Indian Statistical Institute (in Calcutta), which at the time consisted of one part-time employee, Professor P. C. Mahalanobis. He revisited there often in later years, encouraging its development. He was the guest of honour at its 25th anniversary in 1957 when it had grown to 2000 employees. In 1939, when World War II broke out, the University tried to dissolve the eugenics department, and ordered all of the animals destroyed. Fisher fought back, but he was then exiled back to Rothamsted with a much reduced staff and resources. He was unable to find any suitable war work, and though he kept very busy with various small projects, he became discouraged of any real progress. His marriage disintegrated. His oldest son, George, an aeroplane pilot, was killed in the war.

In 1943 he was offered the Balfour Chair of Genetics at Cambridge University, his alma mater. During the war, this department was almost entirely destroyed, but the University promised him that he would be charged with rebuilding it after the war. He accepted the offer, but the promises were largely unfilled, and the department grew very slowly. A notable exception was the recruitment in 1948 of the Italian researcher Cavalli-Sforza, who established a one man unit of bacterial genetics. He continued his work on mouse chromosome mapping and other projects. They culminated in the publication in 1949 of The Theory of Inbreeding. In 1947 he co-founded with Cyril Darlington the journal Heredity: An International Journal of Genetics.

Ronald Fisher was opposed to the UNESCO Statement of Race. He believed that evidence and everyday experience showed that human groups differ profoundly “in their innate capacity for intellectual and emotional development” and concluded that the “practical international problem is that of learning to share the resources of this planet amicably with persons of materially different nature,” and that “this problem is being obscured by entirely well-intentioned efforts to minimize the real differences that exist.” The revised 1951 statement titled "The Race Concept: Results of an Inquiry" was accompanied by Fisher's dissenting commentary.

He eventually received many awards for his work and was dubbed a Knight Bachelor by Queen Elizabeth II in 1952.

Fisher was opposed to the conclusions of Richard Doll and A.B. Hill that smoking caused lung cancer. He compared the correlations in their papers to a correlation between the import of apples and the rise of divorce in order to show that correlation does not imply causation.

To quote Yates and Mather again, "It has been suggested that the fact that Fisher was employed as consultant by the tobacco firms in this controversy casts doubt on the value of his arguments. This is to misjudge the man. He was not above accepting financial reward for his labours, but the reason for his interest was undoubtedly his dislike and mistrust of puritanical tendencies of all kinds; and perhaps also the personal solace he had always found in tobacco."

After retiring from Cambridge University in 1957 he spent some time as a senior research fellow at the CSIRO in Adelaide, Australia. He died of colon cancer there in 1962.