"> Basic Methods of Statistical Analysis (Beginner Guide)
Home > Library > Statistics > Basic Methods of Statistical Analysis: A Beginner’s Guide

Published by at August 26th, 2021 , Revised On June 16, 2026

The basic methods of statistical analysis fall into two families: descriptive statistics, which summarise and describe your data (using the mean, standard deviation, frequencies and skewness), and inferential statistics, which use a sample to draw conclusions about a wider population (using hypothesis testing, correlation, regression and ANOVA). For most student research projects, you will start by describing your data, then test whether the patterns you see are statistically significant.

When carrying out dissertation statistical analysis, many beginners feel they have opened a Pandora’s box. The frustration usually comes from a poorly developed methodology or an inadequately designed research framework rather than the maths itself. If the foundation of your research is sound, the analysis becomes far more manageable. This guide walks you through the core statistical analysis techniques every beginner should know, what each one does, and exactly when to use it, so you can confidently analyse your research data for the first time.

While most students find it easy to collect data through primary and secondary research techniques, they struggle with the analysis. It is worth remembering that the success of a dissertation often rests on the analysis and findings chapter: strong, meaningful analysis is closely associated with higher academic grades. Before you analyse anything, it helps to understand the types of variables and levels of measurement in your dataset, because they determine which technique is valid.

“Statistics is the science of learning from data, and of measuring, controlling, and communicating uncertainty.” — American Statistical Association

The Basic Methods of Statistical Analysis at a Glance

The table below summarises the core techniques covered in this guide, what each measures, and the typical situation in which you would use it. Use it as a quick reference, then read the detailed sections that follow.

Technique What it tells you When to use it
Arithmetic mean The central or “average” value of a dataset To summarise a typical value for interval/ratio data
Standard deviation How spread out values are around the mean To describe variability or consistency in data
Skewness Whether a distribution is symmetric or lopsided To check distribution shape before choosing a test
Hypothesis testing Whether an observed effect is likely real or chance To compare groups or test a claim about a population
Correlation Strength and direction of a linear relationship To see if two numeric variables move together
Regression How one variable predicts or explains another To model and forecast a dependent variable
ANOVA Whether the means of 3+ groups differ To compare more than two group averages at once

Part 1: Descriptive Statistics

Descriptive statistics summarise the main features of a dataset. They do not let you draw conclusions beyond your sample, but they are the essential first step that tells you what your data look like. The three most common descriptive techniques for beginners are the mean, the standard deviation and skewness.

1. Arithmetic Mean (Average)

The arithmetic mean, commonly called the “average”, is the sum of all values divided by the number of values. It belongs to the family of measures of central tendency, which show the extent to which observations cluster around a central point. The mean is quick to calculate by hand or in SPSS, Excel and MATLAB, and gives a fast snapshot of the overall trend in a dataset.

When to use it: for interval or ratio data that are roughly symmetric. Be cautious when the data contain extreme values (outliers), because the mean is easily pulled towards them; in that case the median is often a better summary.

Example: Suppose five students score 60, 65, 70, 75 and 80 in an exam. The mean is (60 + 65 + 70 + 75 + 80) ÷ 5 = 350 ÷ 5 = 70. Now add a sixth student who scored 10. The mean drops to (350 + 10) ÷ 6 = 360 ÷ 6 = 60, even though only one value changed — a clear illustration of how a single outlier can distort the mean.

2. Standard Deviation

The standard deviation, represented by the Greek symbol σ (sigma), is a measure of variability or dispersion of data around the mean. It measures how far individual data points typically lie from the average. A higher standard deviation means values are spread more widely from the mean; a low standard deviation means values cluster tightly around it. It is calculated by squaring each value’s difference from the mean (so positives and negatives don’t cancel out), averaging those squared differences to get the variance, and taking the square root.

When to use it: alongside the mean, to describe how consistent or volatile your data are. Two datasets can share the same mean but have very different standard deviations — and therefore tell very different stories.

Example: Take the values 2, 4, 4, 4, 5, 5, 7, 9. The mean is 5. The squared differences from the mean are 9, 1, 1, 1, 0, 0, 4, 16, which sum to 32. The variance is 32 ÷ 8 = 4, so the standard deviation is √4 = 2. On average, the values lie 2 units from the mean.

What data collection method best suits your research?

  • Find out by hiring an expert from ResearchProspect today!
  • Despite how challenging the subject may be, we are here to help you.

3. Skewness

The shape of a distribution matters. Some distributions are symmetric, like the familiar bell curve, but real data are often lopsided, leaning to the left or right. Skewness measures how asymmetric a distribution is. Because the mean, median and mode all describe the centre of a dataset, the way they relate to one another reveals the skew. In a right-skewed (positively skewed) distribution the mean is typically greater than the median; in a left-skewed (negatively skewed) distribution the mean is typically less than the median.

When to use it: to check distribution shape before choosing an inferential test, since many tests assume an approximately normal distribution. Judging skew from a graph alone is subjective, so it is best calculated numerically — for example, with Pearson’s coefficient of skewness.

Symmetric: mean = median = mode
A symmetric distribution; skew shifts the mean away from the median.

Skewness statistical analysis technique showing left-skewed, symmetric and right-skewed distributions

Part 2: Inferential Statistics

Inferential statistics let you use data from a sample to draw conclusions about a larger population. This is where you test whether the patterns you described are likely to be real rather than a product of chance. The four techniques below — hypothesis testing, correlation, regression and ANOVA — cover the vast majority of beginner dissertation analyses.

4. Hypothesis Testing

Hypothesis testing assesses whether a specific claim about a population is supported by your data. You state a null hypothesis (usually “no effect” or “no difference”) and an alternative hypothesis, then use a test such as the t-test or chi-square test to decide which is more credible. The result is judged statistically significant if the observed effect would be very unlikely to occur by random chance alone, conventionally when the p-value is below 0.05.

When to use it: to compare groups or test a claim — for example, whether a new teaching method produces higher scores than the standard one. Tests can be run easily in SPSS or Minitab.

Example: You want to know whether a study app improves test scores. Your null hypothesis is “the app makes no difference”. You run a two-sample t-test comparing app users and non-users and obtain p = 0.02. Because 0.02 is below the 0.05 threshold, you reject the null hypothesis and conclude there is statistically significant evidence the app improves scores.

5. Correlation

Correlation analysis studies the strength and direction of a relationship between two continuous variables. It is summarised by a correlation coefficient (commonly Pearson’s r) ranging from -1 to +1: a value near +1 indicates a strong positive relationship, near -1 a strong negative relationship, and near 0 little or no linear relationship. Importantly, correlation does not imply causation — two variables can move together without one causing the other.

When to use it: to see whether two numeric variables move together, such as hours studied and exam marks. For ordinal data, Spearman’s rank correlation is the appropriate alternative.

6. Regression

Regression goes a step beyond correlation by modelling how one variable predicts or explains another. The simplest form, simple linear regression, fits the “best” straight line between an independent (explanatory) variable and a dependent variable, usually shown on a scatterplot. The fitted line takes the form y = a + bx, where b is the slope (the predicted change in y for each one-unit increase in x) and a is the intercept.

When to use it: when you want to predict or quantify the effect of one variable on another — for example, predicting sales from advertising spend. Multiple regression extends this to several predictors at once.

Regression statistical analysis technique showing a best-fit line on a scatterplot

7. ANOVA (Analysis of Variance)

A t-test compares the means of two groups, but when you have three or more groups, running multiple t-tests inflates the chance of a false positive. Analysis of variance (ANOVA) solves this by testing, in a single test, whether the means of three or more groups differ significantly. A significant result tells you that at least one group mean differs from the others; follow-up (post-hoc) tests then identify which.

When to use it: to compare averages across three or more groups — for example, comparing mean exam scores across four different schools. One-way ANOVA handles a single grouping factor; two-way ANOVA handles two.

Example: A researcher compares the mean recovery time of patients on three different diets. Rather than running three separate t-tests, a one-way ANOVA returns p = 0.01, indicating at least one diet’s mean recovery time differs. A post-hoc Tukey test then reveals that diet C differs significantly from diets A and B.

How to Choose the Right Statistical Test

Choosing a test comes down to three questions: what type of data do you have, how many groups or variables are involved, and what are you trying to find out (describe, compare or predict)? The flowchart below summarises the decision for the most common beginner scenarios. For a fuller treatment, see our guide on which statistical test you should use.

Choosing a statistical test

Identify your goal: compare means, test a relationship, or test association
Check number & type of variables and level of measurement
Check parametric assumptions (normality, equal variance)
Pick the matching test (t-test, ANOVA, correlation, chi-square)
  • Describe one variable? Use the mean and standard deviation (plus skewness to check shape).
  • Relationship between two numeric variables? Use correlation, then regression if you want to predict.
  • Compare two group means? Use a t-test.
  • Compare three or more group means? Use ANOVA.
  • Test a claim about a population? Use hypothesis testing with an appropriate significance level.

Struggling with your dissertation statistics?

ResearchProspect to the rescue!

Our expert statisticians can run, interpret and write up your analysis correctly — explore our statistical analysis service to get matched with a specialist.

How ResearchProspect Can Help

Whether you are an undergraduate, Master’s or PhD student, our expert statisticians can help with every part of your data analysis, from choosing the right test to interpreting the output and writing up the findings chapter. However urgent or complex your needs, our large team of statistical consultants means we can assign a suitable expert to your order. You can read genuine client feedback on our reviews page before you decide.

Frequently Asked Questions

What are the 5 basic methods of statistical analysis?

The five most widely taught basic methods of statistical analysis are: (1) the mean, to describe a typical value; (2) standard deviation, to describe spread; (3) hypothesis testing, to test claims; (4) correlation, to measure relationships between variables; and (5) regression, to predict one variable from another. ANOVA is often added as a sixth for comparing three or more groups.

In quantitative research, analysis usually begins with descriptive statistics (mean, standard deviation, frequencies) to summarise the data, followed by inferential statistics (hypothesis testing, correlation, regression and ANOVA) to test relationships and differences and generalise from the sample to the population.

Descriptive statistics summarise and describe the data you actually collected, using measures such as the mean and standard deviation. Inferential statistics use that sample data to draw conclusions about a wider population, using techniques such as hypothesis testing and regression.

It depends on your data and question: use a t-test to compare two group means, ANOVA for three or more groups, correlation to measure a relationship between two numeric variables, and regression to predict one variable from another. Our guide to choosing a statistical test walks through each scenario.

No. A correlation shows that two variables move together, but it does not prove that one causes the other. The relationship could be coincidental or driven by a third, unmeasured variable. Establishing causation requires a controlled experimental design.

About Carmen Troy

Avatar for Carmen TroyTroy has been the leading content creator for ResearchProspect since 2017. He loves to write about the different types of data collection and data analysis methods used in research.

WhatsApp Live Chat