Find study content
Learning Materials

Discover learning materials by subject, university or textbook.

Explanations

Textbooks

Exams
All Subjects

Anthropology

Archaeology

Architecture

Art and Design

Bengali

Biology

Business Studies

Greek

Chemistry

Chinese

Combined Science

Computer Science

Economics

Engineering

English

English Literature

Environmental Science

French

Geography

German

History

Hospitality and Tourism

Human Geography

Italian

Japanese

Law

Macroeconomics

Marketing

Math

Media Studies

Medicine

Microeconomics

Music

Nursing

Nutrition and Food Science

Physics

Polish

Politics

Psychology

Religious Studies

Sociology

Spanish

Sports Sciences

Translation

All Subjects

Biology

Business Studies

Chemistry

Combined Science

Computer Science

Economics

English

Environmental Science

Geography

History

Math

Physics

Psychology

Sociology

EXAM TYPES

GCSE

IGCSE

AS

A Level

International A Level

University Admissions Tests

GCSE SUBJECTS

GCSE 1

GCSE 2

GCSE 3

GCSE 4

GCSE 5

SOME-TEXT

IGCSE SUBJECTS

IGCSE 1

AS SUBJECTS

AS 1

A Level SUBJECTS

A Level 1

International A Level SUBJECTS

International A Level 1

University Admissions Tests SUBJECTS

University Admissions Tests 1
Features
Features

Discover all of these amazing features with a free account.

Flashcards

Vaia AI

Notes

Study Plans

Study Sets
What’s new?

Flashcards
Study your flashcards with three learning modes.

Study Sets
All of your learning materials stored in one place.

Notes
Create and edit notes or documents.

Study Plans
Organise your studies and prepare for exams.
Resources
Discover

All the hacks around your studies and career - in one place.

Find a job

Find a degree

Student Deals

Magazine

Mobile App
Featured

Magazine
Trusted advice for anyone who wants to ace their studies & career.

Job Board
The largest student job board with the most exciting opportunities.

Vaia Deals
Verified student deals from top brands.

Our App
Discover our mobile app to take your studies anywhere.

Go to App

Learning Materials

Features

Discover

Chi-Square Test

The Chi-Squared test is used to compare what you have measured (observed) against what may be anticipated (expected).

Get started

+ Add tag
Immunology
Cell Biology
Mo

What is Vaia?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How does Vaia help me study more efficiently?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Where can I find more explanations like this?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What's smart about Vaia's flashcards?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Can I create my own content on Vaia?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How does spaced repetition work in Vaia flashcards?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What can you do with flashcards in Vaia?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Is Vaia a science-based learning platform?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How do Vaia's smart learning plans support your exam prep?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Can you create your own study sets in Vaia?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What is Vaia?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How does Vaia help me study more efficiently?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Where can I find more explanations like this?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What's smart about Vaia's flashcards?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Can I create my own content on Vaia?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How does spaced repetition work in Vaia flashcards?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

What can you do with flashcards in Vaia?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Is Vaia a science-based learning platform?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

How do Vaia's smart learning plans support your exam prep?

Show Answer

+ Add tag
Immunology
Cell Biology
Mo

Can you create your own study sets in Vaia?

Show Answer

Fact Checked Content
Last Updated: 31.10.2022
10 min reading time

Content creation process designed by
Content cross-checked by
Content quality checked by

We establish a hypothesis for the feature under investigation and then convert it to a null hypothesis. The null hypothesis states that no relationship between the two population parameters exists. We use it because it helps us see if our hypothesis has validity. It is impossible to prove something with absolute certainty. However, we can disprove a null hypothesis, which allows us to accept that our hypothesis is valid, and we use ‘confidence levels’ and ‘critical values’ to do this.

Null hypothesis: There is no significant difference between specified populations, any observed difference being due to sampling or experimental error.

Imagine you’re investigating the size of trees in a forest and noticed differences as you moved from the outside of the forest to the centre. You saw the trees got denser closer to the centre. You want to know if the variations in the number of trees per 5m^2 are significant or random. You carried out an investigation and know if the variation is statistically significant. Below are the hypotheses you would use:- Hypothesis: The density of trees per 5m^2 increases as you towards the centre of the forest.- Null hypothesis: there is no significant variation in tree density in the forest.

The difference between expected and observed results in experiments can be described in two ways:

Statistically significant
Statistically insignificant (happened by chance)

When results are significant, this suggests that something is happening that wasn’t accounted for.

The chi-squared test is a statistical test commonly used for biological hypotheses to determine if the results are statistically significant.

We can also define our hypothesis as one-tailed or two-tailed. One-tailed hypotheses are based on uni-directional hypotheses and two-tailed on bi-directional hypotheses.

In terms of our earlier hypotheses, this would be:

One-tailed: The density of trees per 5m^2 increases towards the forest’s centre.
Two-tailed: The density of trees per 5m^2 changes towards the forest’s centre.

Chi-Square Test, One-tailed and two-tailed hypothesis, Vaia Fig. 1 - One-tailed and two-tailed hypotheses

Chi-squared tests should only be done using categorical data and if specific criteria are met.

Categorical data: values that can be sorted into groups or categories. It can be further divided into nominal (values you can count but not order, e.g., eye colour) and ordinal (values you can count and order e.g., house numbers).

This is different from the chi-squared contingency test, which tests for the association between two categorical variables.

What are the criteria for performing a chi-squared test?

The sample size has to be large (>20).
The data must be categorical.

Only raw counts can be used – not ratios, rates, fractions or percentages.
The comparison between theoretical (expected) and experimental (observed) results is being made.

What are the assumptions of the chi-squared test?

Additionally, the chi-squared test makes several assumptions:

The comparisons are made on random samples.
The expected count of each cell is greater than one (>1).
No more than 20% of the cells have expected counts less than 5 (<5).

How do you calculate chi-squared?

The formula looks very scary - but don’t panic! We can break it down into steps.

What is the formula for chi-squared?

$X^{2} = \sum_{} \frac{{(O - E)}^{2}}{E} X^{2} = T h e t e s t s t a t i s t i c \sum_{} = T h e s u m o f O = O b s e r v e d f r e q u e n c y E = E x p e c t e d f r e q u e n c y$

In other words, chi-squared $X^{2}$ is the sum of the square of the difference between the observed values and expected values ${(O - E)}^{2}$ , divided by the expected values (E).

To help you understand how we would calculate the chi-squared, we will use flower phenotype as an example.

To calculate:

Obtain the expected and observed results for the experiment (as shown in the table below)
Calculate the difference between each set of results
Square each difference
Divide each squared difference by the expected value
Use the sum of these answers to obtain the chi-squared value

Table 1. Example of a table to find values for Chi-Squared calculation.

Flower Phenotype	Observed number (O)	Expected Ratio	Expected number (E)(total number x ratio/16)	O-E	(O-E)^2/E
Pink/Round	296	9	240	56	13.067
Pink/Long	19	3	80	-61	46.513
Purple/Round	27	3	80	-53	35.113
Purple/long	85	1	27	58	124.593
Total	427			X^2	219.29

How do you calculate degrees of freedom and use a chi-squared distribution table?

The Chi-Squared test has little meaning on its own – it needs to be compared to ‘critical values’, which are found in tables or on graphs as calculated by statistical experts.

First, you must decide the confidence level you want to use. The most common is generally 95% and/or 99%, meaning for every 100 times you carried out the test, you would get chance results on five occasions or one occasion.

Table 2. Confidence, uncertainty and probability levels.

	Highly confident	Very confident	Extremely confident
Confidence level	95%	99%	99.9%
Uncertainty level	5%	1%	0.1%
Probability level (p-value)	0.05	0.01	0.001

We then use the value we have obtained in the Chi-Squared test to see if the data is statistically significant. A distribution table is used for this. The distribution table relates the chi-squared value with probabilities. We also use degrees of freedom to determine the number of comparisons made.

For a chi-squared test, the degrees of freedom equal the number of categories minus one (n-1). You will also need to determine your p-value.

The degrees of freedom used for the Chi-Squared test is always n-1

Here is an example of a standard chi-squared table. You read the table by looking at the row corresponding to the degrees of freedom used in your experiment and the column corresponding to your p-value. You will find your critical value at the intersection of these rows and columns.

Table 3. Standard distribution table.

	The probability that the difference between observed and expected is due to chance
Degrees of freedom	0.1	0.05	0.01	0.001
1	2.27	3.84	6.64	10.83
2	4.60	5.99	9.21	13.82
3	6.25	7.82	11.34	16.27
4	7.78	9.49	13.28	18.46

If your chi-squared test value is greater than the critical value (the value found from the table), then the deviation between your expected and observed results is statistically significant. If it is not greater than the critical value, the difference is not significant.

How is the chi-squared test used in genetics?

Chi-squared tests are used across biology. For instance, they can be very useful for determining whether the results of a genetic cross are significantly different from the theoretical predictions.

Genetic cross: The deliberate breeding of two different individuals that results in offspring that carry part of the genetic material of each parent.

Let’s take, for example, the actual results that Gregor Mendel obtained during his pea experiments on the inheritance of seed type. Mendel performed experiments on pea plants to determine patterns of inheritance for some of the plants’ observable traits. For more information on his experiments, check out our articles on Inheritance!

A single gene determines seed type with a dominant allele that produces smooth seeds and a recessive allele that produces wrinkled seeds. Mendel’s experiment resulted in 5474 smooth and 1850 wrinkled seeds. Allowing for some statistical error, how can we tell if this result fits our expected ratio?

Statistical error: The difference between a measured value and the actual value of the collected data. If the error value is more significant, the data will be considered less reliable.

When following these steps, it is useful to summarise your calculations in a table like so:

Table 4. Another example of how to obtain the values for the equation.

Category	Observed	Expected	O-E	(O-E)2	(O-E)2/E
Smooth	5474	5493	-19	361	0.0657
Wrinkled	1850	1831	19	361	0.1972
	Total = 7324				0.2629

First, let’s calculate the expected values. In this case, we would find the total number of offspring (5474+1850 = 7324) and divide it according to the 3:1 ratio. This gives us expected values of (7324 x ¾ =) 5493 smooth seeds and (7324 x ¼ =)1831 wrinkled seeds.
We now need to know the difference between the observed and expected values. For the smooth category, the difference is (5474-5493 =) -19, while for the wrinkled category, the difference is (1850-1831 =) 19.
We get 361 for smooth seeds and 361 for wrinkled seeds when we square these differences.

When you square values, any negatives cancel out.

4. Finally, when we divide these values by their respective expected values, we get 0.0657 and 0.1972. Added together, this gives us 0.2629. 5. Let’s check the chi-squared distribution table above. We have two categories, which means our degree of freedom is 2-1=1. 6. To find our critical value, use the standard table from before, find the row corresponding to our degree of freedom (1) and the column corresponding to our p-value (0.05). These intersect at the critical value of 3.84. 7 . 0.2629<3.84. Therefore the observed and expected values are not significantly different from one another. The slight differences in value are due to chance.

What if the observed values are significantly different from the expected values?

Suppose the observed values are significantly different to expected values. If the result is smaller than or equal to the stated p-value, there are some things we may want to consider. As discussed in the article on sex-linkage, autosomal linkage, and epistasis, there are several reasons why we might observe patterns of inheritance that do not fit Mendelian ratios.

The trait might be sex-linked, meaning the sex of the individual affects whether they can inherit the trait.
Two genes might be found on the same chromosome and thus exhibit linkage.
Epistasis might also affect the phenotypes expressed by the individual.

Sex linkage: Sex linkage is the phenotypic expression of an allele that is dependent on the gender of the individual and is directly tied to the sex chromosomes.

Autosomal linkage: Autosomal linkage occurs if two or more genes are located on the same autosome (non-sex chromosome). The two genes are less likely to be separated during crossing over, resulting in the alleles of the linked genes being inherited together.

Epistasis: Epistasis is a circumstance where the expression of one gene is affected by the expression of one or more independently inherited genes.

Chi-Square Test - Key takeaways

The chi-squared (χ2) tests the null hypothesis that there is no statistically significant difference between the observed and expected results of an experiment.
It can be performed on large sample sizes (>20), using raw counts of categorical data.
Chi-squared is the sum of the square of the difference between the observed and expected values, divided by the expected values.
A chi-squared distribution table is used to determine the correct critical value for the given degrees of freedom and p-value.
When chi-squared is higher than the critical value, the difference between the expected and observed results is significant.
Degrees of freedom are calculated by subtracting one from the number of categories.

Already have an account? Log in

Frequently Asked Questions about Chi-Square Test

What is chi-square in genetics?

A chi-squared test is used to see if the difference between the observed and expected results of an experiment is statistically significant. Chi-squared tests are often used on biological data.

How do you determine the degrees of freedom?

The degrees of freedom are equivalent to the number of categories (n) minus one. df = n-1

How do you calculate the chi square test?

The chi-squared test is calculated with the equation

What is a chi-square test used for in biology?

To identify if results are statistically significant when comparing observed and expected results.

When would you do a chi-square test?

When we are comparing observed and expected results.

How do you interpret a chi-square test?

We use the value we have obtained in the Chi-Squared test to see if the data is statistically significant. A distribution table is used for this. The distribution table relates the chi-squared value with probabilities. We also use degrees of freedom to determine the number of comparisons made.

Save Article

How we ensure our content is accurate and trustworthy?

At StudySmarter, we have created a learning platform that serves millions of students. Meet the people who work hard to deliver fact based content as well as making sure it is verified.

Content Creation Process:

Lily Hulatt is a Digital Content Specialist with over three years of experience in content strategy and curriculum design. She gained her PhD in English Literature from Durham University in 2022, taught in Durham University’s English Studies Department, and has contributed to a number of publications. Lily specialises in English Literature, English Language, History, and Philosophy.

Get to know Lily

Content Quality Monitored by:

Gabriel Freitas is an AI Engineer with a solid experience in software development, machine learning algorithms, and generative AI, including large language models’ (LLMs) applications. Graduated in Electrical Engineering at the University of São Paulo, he is currently pursuing an MSc in Computer Engineering at the University of Campinas, specializing in machine learning topics. Gabriel has a strong background in software engineering and has worked on projects involving computer vision, embedded AI, and LLM applications.

Get to know Gabriel

Discover learning materials with the free Vaia app

About Vaia

Vaia is a globally recognized educational technology company, offering a holistic learning platform designed for students of all ages and educational levels. Our platform provides learning support for a wide range of subjects, including STEM, Social Sciences, and Languages and also helps students to successfully master various tests and exams worldwide, such as GCSE, A Level, SAT, ACT, Abitur, and more. We offer an extensive library of learning materials, including interactive flashcards, comprehensive textbook solutions, and detailed explanations. The cutting-edge technology and tools we provide help students create their own learning materials. StudySmarter’s content is not only expert-verified but also regularly updated to ensure accuracy and relevance.

Learn more

Vaia Editorial Team

Team Biology Teachers