"> Inferential Statistics in Research: Guide & Examples
Home > Library > Statistics > Inferential Statistics in Research: A Guide With Examples

Published by at August 25th, 2021 , Revised On June 16, 2026

Inferential statistics is the branch of statistics that uses data from a sample to draw conclusions, or inferences, about the wider population that the sample came from. Because it is rarely practical to measure an entire population, researchers study a representative sample and then use two core tools — estimation (such as confidence intervals) and hypothesis testing — to generalise their findings, while quantifying how much uncertainty that generalisation carries.

This is exactly what separates inferential statistics from descriptive statistics. Descriptive statistics simply summarise the data you actually have; inferential statistics use that data to reason about data you do not have. In short: descriptive statistics describe, inferential statistics infer.

This guide explains what inferential statistics is, how it differs from descriptive statistics, the two pillars of estimation and hypothesis testing, which tests to choose for which data, and a fully worked example you can follow step by step.

“Inferential statistics… allow you to make inferences about a population based on data gathered from a sample.” — Australian Bureau of Statistics (ABS), Statistical Language

Inferential Statistics: An Introduction

To understand inferential statistics, it helps to place it within the wider field of statistics — the branch of mathematics concerned with collecting, organising, analysing, interpreting and presenting data.

Statistics is broadly split into two areas:

  • Descriptive statistics — methods for summarising and describing the features of a dataset (for example, the mean, median, standard deviation or a bar chart).
  • Inferential statistics — methods for using sample data to draw conclusions about a population.

Inferential statistics rests on two main activities:

  1. Estimating parameters: using a statistic calculated from a sample (such as the sample mean) to estimate the corresponding, unknown value in the population (such as the population mean).
  2. Hypothesis testing: using sample data to test a specific claim about the population, so you can answer research questions and decide whether an observed effect is likely to be real or simply due to chance.

The bridge between the two is probability. Because a sample is only a slice of the population, every inference carries a degree of uncertainty, and probability is what lets us measure that uncertainty rather than ignore it.

Descriptive vs inferential statistics

Descriptive

  • Summarise the data you have
  • Mean, SD, charts
Inferential

  • Generalise to a population
  • Estimation & hypothesis tests

Inferential Statistics vs. Descriptive Statistics

Descriptive statistics let you summarise and present the data you have collected; inferential statistics let you go beyond that data to make generalisations and predictions about a population. The key differences are set out below.

Feature Descriptive Statistics Inferential Statistics
Purpose Describe and summarise the data you actually collected Draw conclusions about a population from a sample
Scope Limited to the dataset in hand Generalises beyond the dataset to the population
Typical tools Mean, median, mode, range, standard deviation, charts and tables Confidence intervals, hypothesis tests, regression, p-values
Output Summary values and visual displays Estimates, probabilities and decisions about hypotheses
Uncertainty None — it reports exactly what was measured Built in — every inference is reported with a margin of error or significance level

A simple way to remember it: if you measure the exam marks of 30 students and report their average, that is descriptive. If you then use that average to estimate the average mark of all students on the course, that is inferential.

Why should you use a Plagiarism Detector for your Paper?

It ensures:

  • Original work
  • Structure and Clarity
  • Zero Spelling Errors
  • No Punctuation Faults
Calculate your Plagiarism Score - ResearchProspect

Sampling Error and Inferential Statistics

Before going further, here is a quick recap of four terms that sit at the centre of inferential statistics.

Sample: a smaller group selected from the population, which you actually observe and measure.
Population: the entire group you want to draw conclusions about.
Statistic: a number that describes a sample — for example, the sample mean (x̄) or sample proportion.
Parameter: a number that describes the whole population — for example, the population mean (μ) or population proportion.

Because a sample is smaller than the population, it will almost never capture the population perfectly. The difference between the true population value (the parameter) and the value calculated from your sample (the statistic) is called sampling error.

Sampling error can occur with any sampling method, whether random or systematic, which is why some uncertainty is unavoidable in inferential statistics. You cannot eliminate it, but you can reduce and quantify it: larger samples and well-designed probability (random) sampling both shrink sampling error and make your estimates more reliable. The standard error is the statistic that measures how much a sample estimate is expected to vary from sample to sample.

When you estimate a population parameter, you can do so in two ways:

  • Point estimate — a single value that is your best guess for the parameter. For example, using a sample mean of 68 kg as the estimate of the population mean.
  • Interval estimate — a range of values within which the parameter is likely to lie. The most widely used interval estimate is the confidence interval.

Confidence Intervals

A confidence interval (CI) uses the variability around a sample statistic to produce a range of plausible values for the population parameter. Where a point estimate gives a single number, a confidence interval also tells you how precise that number is by accounting for sampling error.

Every confidence interval has an associated confidence level, such as 95%. The correct interpretation is about the procedure, not a single interval: if you repeated the study many times and built a 95% confidence interval each time, about 95% of those intervals would contain the true population parameter. A common 95% confidence interval for a mean takes the form:

CI = sample mean ± (critical value × standard error)

You cannot be 100% certain that any single interval contains the true parameter, because the only way to know a parameter exactly is to measure the entire population. But with an adequate sample size and random sampling, the confidence-interval method is reliable over the long run — and a narrower interval indicates a more precise estimate.

Hypothesis Testing and Inferential Statistics

Hypothesis testing is the second pillar of inferential statistics. It is a formal procedure for using sample data to decide between two competing claims about a population, so you can judge whether an observed effect is likely to be real or simply the result of chance.

A hypothesis test usually involves these steps:

  1. State the hypotheses. The null hypothesis (H₀) says there is no effect or no difference; the alternative hypothesis (H₁) says there is.
  2. Set the significance level (α), commonly 0.05. This is the risk you are willing to take of rejecting a true null hypothesis.
  3. Choose and run a suitable statistical test, which produces a test statistic.
  4. Find the p-value, the probability of obtaining a result at least as extreme as the observed one if the null hypothesis were true.
  5. Make a decision. If the p-value is less than or equal to α, you reject the null hypothesis and call the result statistically significant; otherwise you fail to reject it.

The statistical test you choose accounts for sampling error so that your inference is valid. Tests fall into two broad families:

  • Parametric tests assume the data meet certain conditions (see below). When those assumptions hold, parametric tests are generally more powerful — that is, more likely to detect a real effect.
  • Non-parametric tests make fewer assumptions about the shape of the population distribution and are used when parametric assumptions are violated, for example with skewed data or ordinal data. They are sometimes called “distribution-free” tests.

Common assumptions for parametric tests include:

  • The data are drawn from a population that follows a normal distribution
  • The sample is large enough and randomly drawn to represent the population
  • The variances of the groups being compared are roughly equal (homogeneity of variance)

Forms of Statistical Tests

Inferential statistical tests generally serve one of three goals: comparing groups, measuring association, or predicting an outcome.

Comparison Tests

Comparison tests examine whether the means, medians or rankings of two or more groups differ. Examples include the t-test (two groups) and ANOVA (three or more groups).

Correlation Tests

Correlation tests measure the strength and direction of the association between two or more variables — for example, Pearson’s r for linear relationships.

Regression Tests

Regression tests model how changes in one or more predictor variables relate to changes in an outcome variable, and can be used for prediction. The right regression model depends on the number and type of predictor and outcome variables.

Common Inferential Statistics Tests

The table below summarises widely used inferential tests and when to apply each. Use it alongside our guide on which statistical test you should use.

Test Purpose When to use Type
Independent t-test Compare the means of two independent groups One continuous outcome, two unrelated groups, roughly normal data Parametric
Paired t-test Compare two related means (e.g. before vs after) Paired/repeated measurements on the same subjects Parametric
One-way ANOVA Compare the means of three or more groups One continuous outcome, three+ independent groups Parametric
Mann–Whitney U Compare two groups when data are not normal/ordinal Non-normal or ordinal data, two independent groups Non-parametric
Kruskal–Wallis Compare three or more groups on ranks Non-normal or ordinal data, three+ groups Non-parametric
Pearson’s r Measure linear association between two variables Two continuous, normally distributed variables Parametric
Spearman’s rho Measure monotonic association Ordinal data or non-linear monotonic relationships Non-parametric
Chi-square test Test association between categorical variables Two categorical variables, frequency/count data Non-parametric
Linear regression Predict a continuous outcome from predictors Continuous outcome, one or more predictors Parametric
Logistic regression Predict a binary outcome from predictors Binary (yes/no) outcome, one or more predictors Parametric

Choosing an inferential test

Identify your goal: compare means, test a relationship, or test association
Check number & type of variables
Check parametric assumptions
Pick the matching test

Inferential Statistics: A Worked Example

The example below shows how estimation and hypothesis testing work in practice using a one-sample test.

Example: A university claims that students sleep an average of 7 hours per night. A researcher suspects the true average is lower and surveys a random sample of n = 100 students, finding a sample mean of x̄ = 6.6 hours with a sample standard deviation of s = 1.5 hours.

Step 1 — State the hypotheses.
H₀: μ = 7 (the population mean is 7 hours).
H₁: μ < 7 (the population mean is less than 7 hours). This is a one-tailed test.

Step 2 — Set the significance level. Use α = 0.05.

Step 3 — Compute the standard error.
SE = s ÷ √n = 1.5 ÷ √100 = 1.5 ÷ 10 = 0.15.

Step 4 — Compute the test statistic.
t = (x̄ − μ₀) ÷ SE = (6.6 − 7) ÷ 0.15 = −0.4 ÷ 0.15 ≈ −2.67.

Step 5 — Make a decision. With 99 degrees of freedom, the one-tailed critical value at α = 0.05 is about −1.66. Because −2.67 is more extreme than −1.66 (the corresponding p-value is roughly 0.004, well below 0.05), we reject H₀. There is statistically significant evidence that students sleep fewer than 7 hours on average.

Step 6 — Estimate with a confidence interval. A 95% CI for the mean is x̄ ± 1.984 × SE = 6.6 ± 1.984 × 0.15 = 6.6 ± 0.30, i.e. about 6.30 to 6.90 hours. Note that 7 hours falls outside this interval, which agrees with the test result.

This single example shows the two pillars working together: the hypothesis test answers “is there an effect?”, while the confidence interval answers “how large is it, and how precisely have we measured it?”

Struggling to choose or run the right test?

ResearchProspect to the rescue!

Our statisticians can select the correct method, run the analysis and interpret your results — explore our statistical analysis service.

Frequently Asked Questions

What is inferential statistics in research?

Inferential statistics is the use of sample data to draw conclusions about a larger population. In research, it lets you estimate population parameters (using confidence intervals) and test hypotheses, so you can generalise findings from a manageable sample to the whole group while measuring the uncertainty involved.

Descriptive statistics summarise the data you have collected — for example, the mean or a bar chart — without going beyond it. Inferential statistics use that sample data to make predictions and draw conclusions about a wider population. In short, descriptive statistics describe, while inferential statistics infer.

The two main branches are estimation and hypothesis testing. Estimation uses a sample statistic to estimate an unknown population parameter, often with a confidence interval. Hypothesis testing uses sample data to decide whether a specific claim about the population is supported by the evidence.

A confidence interval is a range of values that is likely to contain the true population parameter. A 95% confidence interval means that if the study were repeated many times, about 95% of the intervals produced would contain the true parameter. A narrower interval indicates a more precise estimate.

It depends on your aim and your data. To compare two groups use a t-test; for three or more groups use ANOVA; to measure association use a correlation test such as Pearson’s r; and to predict an outcome use regression. If your data are not normally distributed or are ordinal, use a non-parametric equivalent such as Mann–Whitney U or Spearman’s rho. See our guide on which statistical test you should use.

Sampling error is the difference between a value calculated from a sample (a statistic) and the true value in the population (a parameter). It arises because a sample cannot perfectly represent the whole population. It cannot be eliminated, but larger samples and random sampling reduce it, and the standard error quantifies it.

About Jamie Walker

Avatar for Jamie WalkerJamie is a content specialist holding a master's degree from Stanford University. His research focuses on the Internet of Things, as well as areas such as politics, medicine, sociology, and other academic writing. Jamie is a member of the content management team at ResearchProspect.

WhatsApp Live Chat