"> Quantitative Research Questionnaire: Examples & Design - ResearchProspect
Home > Library > Research Methodology > Quantitative Research Questionnaire: Examples & Design

Published by at August 20th, 2024 , Revised On June 17, 2026

A questionnaire in quantitative research is a structured, standardised data-collection instrument made up mostly of closed questions that convert respondents’ answers into numerical data for statistical analysis. Every participant receives the same items in the same order and selects from a fixed set of response options, so the resulting data can be counted, scored and compared across a sample. You use one when your research questions are about how many, how much, how often or how strongly — and when you need data from enough people to test hypotheses or generalise to a wider population.

This guide explains what a quantitative questionnaire is, when to use it, the main closed question types and the level of measurement each yields, a seven-step design process, a worked set of example items, how to build in reliability and validity, and how the data is analysed.

What a quantitative research questionnaire is

A quantitative research questionnaire is a standardised measurement instrument. Its defining feature is structure: the questions, the wording and the response options are all fixed in advance, so that differences in the data reflect differences between respondents rather than differences in how each person was asked. Because almost every item is closed — the respondent picks from pre-set options rather than writing freely — the answers can be coded as numbers and analysed with descriptive and inferential statistics.

This is the central contrast with a qualitative research questionnaire, which relies on open-ended questions to elicit rich, narrative accounts in the respondent’s own words. Quantitative questionnaires sacrifice some of that depth in exchange for breadth, comparability and statistical power. In Saunders, Lewis and Thornhill’s terms, the questionnaire is a data-collection technique that sits most comfortably within a deductive, positivist or post-positivist research design, where the aim is to measure constructs and test relationships between variables.

A well-built questionnaire does three things at once: it operationalises abstract variables (such as job satisfaction or anxiety) into observable, answerable items; it ensures every respondent interprets each item the same way; and it produces data at a known level of measurement so the right statistical tests can be applied.

When to use a quantitative questionnaire

A quantitative questionnaire is the right tool when several of the following are true:

  • Your research questions or hypotheses concern frequencies, magnitudes, differences or relationships (e.g. “Does remote working predict higher reported wellbeing?”).
  • You need data from a large sample and want to generalise findings to a wider population.
  • The constructs you care about can be operationalised into measurable indicators — attitudes, behaviours, frequencies, demographics.
  • You want comparable, replicable data that another researcher could collect again in the same way.
  • You intend to run statistical tests — correlations, group comparisons, regression — on the results.

It is the wrong tool when you are exploring a poorly understood phenomenon, building theory, or need to understand why people feel as they do in their own words; there, open interviews or a qualitative questionnaire serve better. Many dissertations combine both in a mixed-methods design, as Creswell describes, using the questionnaire for breadth and interviews for depth.

Closed question types and the data they produce

The power of a quantitative questionnaire lives in its closed question types. Each type yields data at a particular level of measurement — nominal, ordinal, interval or ratio — and that level determines which statistics you can legitimately run. Choose the type that matches the construct and the analysis you plan.

Question type Example item Level of measurement
Dichotomous (two options) “Have you used our app in the past 7 days?” — Yes / No Nominal (binary)
Multiple-choice (one of several categories) “Which department do you work in?” — HR / Finance / Sales / Operations Nominal
Likert scale (agreement) “I feel confident using statistical software.” — Strongly disagree … Strongly agree (1–5) Ordinal (often treated as interval when summed across items)
Rating / semantic differential “Rate the lecture quality:” Poor 1—2—3—4—5 Excellent; or Boring —— Engaging Ordinal (commonly treated as interval)
Ranking “Rank these five benefits from most (1) to least (5) important.” Ordinal
Numeric / open numeric “How many hours did you sleep last night?” ___ hours; “What is your age?” ___ Ratio (a true zero exists)

Two practical notes. First, single Likert items are strictly ordinal, but a Likert scale — several items summed or averaged into a composite score — is widely treated as interval-level data, which is what allows means, t-tests and correlations. Second, only ratio (and interval) data support the full range of arithmetic; nominal and ordinal data restrict you to frequencies, modes, medians and non-parametric tests. Matching question type to planned analysis at the design stage saves you from collecting unusable data.

5-point Likert scale: “I feel confident using statistical software.”1Stronglydisagree2Disagree3Neither4 (selected)Agree5StronglyagreeHow 200 responses aggregate8%14%19%37%22%
A single Likert item offers five labelled, ordered response points; across a sample, the chosen points aggregate into a distribution (here a mean of about 3.5) that descriptive statistics summarise.

How to design a quantitative questionnaire: a 7-step process

Designing a sound instrument is a disciplined, sequential process. Work through these steps in order — skipping the early conceptual steps is the most common cause of a questionnaire that collects the wrong data.

  1. Define your constructs. List the variables your research questions require — for example job satisfaction, perceived workload, intention to leave. Be explicit about what each construct does and does not include.
  2. Operationalise each construct. Translate every abstract construct into concrete, observable indicators that can be asked about directly. Where a validated scale already exists (e.g. an established satisfaction or wellbeing scale), adopt or adapt it rather than inventing your own — it brings ready-made evidence of reliability and validity.
  3. Choose item types. For each indicator, select the closed question type that yields the level of measurement your analysis needs (see the table above). Decide deliberately between dichotomous, multiple-choice, Likert, rating, ranking and numeric formats.
  4. Write the item wording. Keep each item short, specific and neutral. Use plain language, avoid jargon and double negatives, and ensure every item asks about exactly one thing.
  5. Build the response scales. Decide on the number of scale points (5- and 7-point are common), label the anchors clearly, keep the scale direction consistent, and make sure options are exhaustive and mutually exclusive. Decide in advance whether to offer a neutral midpoint or a “not applicable” option.
  6. Sequence and format. Open with easy, non-threatening questions; group items by topic; place sensitive or demographic items near the end; and keep the questionnaire as short as the research allows to protect response rates.
  7. Pilot and refine. Test the draft on a small group resembling your sample. Use the pilot to catch ambiguous wording, floor/ceiling effects and items that everyone answers identically, and to compute preliminary reliability before full data collection.
Example: A business student studies whether flexible working predicts employee wellbeing among UK office staff (n = 220). After operationalising “wellbeing” using an established five-item scale, the questionnaire includes:

1. How many days per week do you currently work from home? ___ days (numeric — ratio)
2. I am able to balance my work and personal life. Strongly disagree 1—2—3—4—5 Strongly agree (Likert — ordinal/interval)
3. Over the last two weeks I have felt relaxed. Never / Rarely / Sometimes / Often / Always (Likert frequency — ordinal)
4. Which best describes your role? Junior / Mid-level / Senior / Manager (multiple-choice — nominal)
5. Rank these factors by how much they affect your wellbeing (1 = most): pay, autonomy, commute, workload, recognition (ranking — ordinal)
6. Do you have caring responsibilities at home? Yes / No (dichotomous — nominal)

Items 2–3 plus three further statements are summed into a single wellbeing score, which is then correlated with days worked from home.

Avoid leading and double-barrelled items

Two wording faults quietly destroy data quality. A leading question nudges the respondent toward a particular answer; a double-barrelled question asks about two things at once, so a single answer cannot be interpreted. Compare:

Don’t write Do write Why
“How much do you love our excellent new service?” “How satisfied are you with our new service?” (1–5) Removes the leading, loaded wording
“Is the software fast and easy to use?” Two items: “The software is fast.” / “The software is easy to use.” Splits a double-barrelled item
“You wouldn’t disagree that training was useful, would you?” “The training was useful.” Strongly disagree–Strongly agree Removes the double negative and the nudge

Other faults to police: ambiguous terms (“regularly”, “often”), assumptions the respondent may not meet, overlapping response ranges, and unbalanced scales with more positive than negative options.

Ensuring reliability and validity of the instrument

A questionnaire is only as good as its measurement properties. Reliability is consistency — would the instrument give the same result on repeated use? Validity is accuracy — does it measure the construct it claims to? These are distinct: a scale can be reliable (consistent) yet invalid (consistently measuring the wrong thing). Both are covered in depth in our guide to reliability and validity.

The main levers you control at the design stage are:

  • Piloting — the single most valuable safeguard; it surfaces ambiguity, mistimed length and weak items before they reach your real sample.
  • Internal-consistency reliability — for multi-item scales, compute Cronbach’s alpha; values of roughly 0.70 or above are conventionally treated as acceptable, indicating the items hang together as a coherent scale.
  • Test–retest reliability — administer the questionnaire twice to the same people and correlate the scores to check stability over time.
  • Content validity — have subject experts judge whether the items cover the full construct.
  • Construct validity — check that the scale correlates as theory predicts with related and unrelated measures; using a previously validated scale gives you much of this for free.
Worked example: scoring a 4-item Likert wellbeing scale

Five respondents answer a four-item wellbeing scale (items W1–W4), each scored 1 (Strongly disagree) to 5 (Strongly agree). We compute each respondent’s scale score, then the item mean and SD for W1, and finally check internal consistency.

Respondent W1 W2 W3 W4 Sum Mean
R1 4 5 4 4 17 4.25
R2 2 3 2 3 10 2.50
R3 5 4 5 5 19 4.75
R4 3 3 4 2 12 3.00
R5 1 2 2 2 7 1.75

1. Scale score (sum & mean) per respondent. Sum the four items, then divide by 4. For R1: 4 + 5 + 4 + 4 = 17, so mean = 17 ÷ 4 = 4.25. R2 sums to 10 (mean 2.50), R3 to 19 (4.75), R4 to 12 (3.00), R5 to 7 (1.75). Higher scores indicate higher reported wellbeing.

2. Item mean & SD for W1. W1 values are 4, 2, 5, 3, 1.
• Mean = (4 + 2 + 5 + 3 + 1) ÷ 5 = 15 ÷ 5 = 3.00.
• Deviations from the mean: +1, −1, +2, 0, −2. Squared: 1, 1, 4, 0, 4; sum = 10.
• Using the sample SD (divide by n − 1 = 4): variance = 10 ÷ 4 = 2.50, so SD = √2.50 ≈ 1.58.

3. Check internal consistency. Because W1–W4 are meant to measure one construct, you confirm they “hang together” before summing them. Compute Cronbach’s alpha across the four items in SPSS, R or by hand from the item and total variances. A value of α ≥ 0.70 is the conventional threshold for acceptable reliability; if alpha falls below it, inspect the “alpha if item deleted” output and consider revising or dropping the weakest item rather than reporting an unreliable score.

Note: the five-respondent table is illustrative — a real reliability estimate needs an adequate sample (commonly n ≥ 100+).

“Reliability refers to the extent to which your data collection techniques or analysis procedures will yield consistent findings.” (Source: Saunders, Lewis & Thornhill, Research Methods for Business Students)

How quantitative questionnaire data is analysed

Because responses are numeric, analysis follows the standard two-stage logic of quantitative research. You begin with descriptive statistics to summarise the data, then move to inferential statistics to test hypotheses and generalise from sample to population.

  • Descriptive: frequencies and percentages for nominal items; medians and modes for ordinal items; means and standard deviations for interval/ratio scores; plus charts such as bar charts and histograms.
  • Inferential: correlations (e.g. Pearson’s r between two scale scores), group comparisons (t-tests, ANOVA), associations between categories (chi-square), and prediction (regression). The level of measurement of each variable dictates which test is valid.

This is exactly why matching question type to level of measurement during design matters so much: an ordinal item cannot support a mean, and a nominal item cannot enter a correlation. For a fuller treatment of the second stage, see our guide to inferential statistics.

Quantitative vs qualitative questionnaire

It helps to see the two side by side. The choice is driven by your research question, not by preference.

Dimension Quantitative questionnaire Qualitative questionnaire
Question type Mostly closed (fixed options) Mostly open-ended
Data produced Numerical scores and categories Text, narratives, meanings
Aim Measure, compare, test, generalise Explore, understand, interpret
Sample size Large (for statistical power) Small to moderate (for depth)
Analysis Descriptive & inferential statistics Thematic / content analysis
Strength Breadth, comparability, replicability Depth, context, richness
Reasoning Deductive (test theory) Inductive (build theory)

For a broader comparison of the two paradigms beyond the instrument itself, see quantitative vs qualitative research.

Turn your questionnaire data into findings

Our statisticians clean, test and interpret your survey data — from Cronbach’s alpha to regression — with clear, reportable results.

Strengths and limitations

Understanding where a quantitative questionnaire is strong and where it is weak helps you defend your method choice in the methodology chapter and interpret your findings honestly.

Strengths. Questionnaires are efficient: once designed, they can be distributed to hundreds or thousands of respondents at low cost, especially online. Because every respondent receives an identical instrument, the data is highly comparable and the study is straightforward to replicate — a key marker of rigour in deductive research. Closed responses are quick to code and feed directly into statistical software, and anonymity often encourages more honest answers on sensitive topics than a face-to-face interview would. With an adequate, representative sample drawn through sound sampling methods, findings can be generalised to the wider population.

Limitations. The same standardisation that gives breadth removes depth: you learn what respondents think but not the nuanced why. Fixed options can force respondents into answers that do not quite fit, and poorly written items introduce measurement error that no amount of clever analysis can repair. Self-report data is vulnerable to social-desirability bias, acquiescence bias (a tendency to agree) and careless responding. Low response rates threaten representativeness, and the questionnaire cannot probe or clarify in the moment as an interviewer can. These trade-offs are precisely why mixed-methods designs are popular: the questionnaire supplies the numbers, and a smaller qualitative strand supplies the interpretation.

Common mistakes to avoid

Most weak questionnaires fail for a small, recurring set of reasons. Watch for these:

  • Skipping operationalisation — writing items before clearly defining the construct, so the scale measures something fuzzy.
  • Mismatching item type and analysis — collecting ordinal data and then trying to compute a mean, or nominal data and trying to correlate it.
  • Leading, loaded or double-barrelled wording — biasing responses or making them uninterpretable.
  • Unbalanced or overlapping scales — more positive than negative options, or response ranges such as “0–5” and “5–10” that share a boundary.
  • Over-long instruments — fatigue lowers data quality and response rates; cut every item that does not map to a research question.
  • No pilot — launching to the full sample before testing wording, length and reliability.
  • Ignoring missing data — failing to plan in advance how partial responses will be handled in analysis.

Bringing it together

A strong quantitative questionnaire is engineered, not improvised. Define your constructs, operationalise them into the right closed question types, write clean unambiguous items, build sensible response scales, pilot rigorously, and confirm reliability and validity before you collect a single row of real data. Do that, and the instrument hands you clean, well-structured numbers that descriptive and inferential statistics can turn into defensible answers to your research questions.

Frequently Asked Questions

What is a questionnaire in quantitative research?

It is a structured, standardised instrument made up mostly of closed questions that convert respondents’ answers into numerical data. Every participant answers the same items in the same order, so the data can be counted, scored and analysed statistically to test hypotheses or describe a population.

The main types are dichotomous (yes/no), multiple-choice, Likert agreement scales, rating and semantic-differential scales, ranking questions, and numeric (open-numeric) items. Each yields a particular level of measurement — nominal, ordinal, interval or ratio — which determines the statistics you can run on it.

A single Likert item is strictly ordinal, because the gaps between points are not guaranteed to be equal. However, a Likert scale made of several items summed or averaged into a composite score is commonly treated as interval-level data, which allows means, t-tests and correlations.

Use validated scales where possible, write clear single-barrelled items, and pilot the draft on a similar group. For multi-item scales compute Cronbach’s alpha (around 0.70 or above is acceptable) for internal consistency, consider test–retest for stability, and use expert review and theory-based correlations to check validity.

A double-barrelled question asks about two things at once, such as “Is the software fast and easy to use?” A single answer cannot tell you which part the respondent meant, so the data is uninterpretable. Always split it into two separate items, one per attribute.

Analysis runs in two stages. Descriptive statistics (frequencies, percentages, means, standard deviations) summarise the data, then inferential statistics (correlations, t-tests, ANOVA, chi-square, regression) test hypotheses and generalise to the population. The valid test depends on each variable’s level of measurement.

About Aadam Mae

Avatar for Aadam MaeAadam Mae, an academic researcher and author with a PhD in NLP (Natural Language Processing) at ResearchProspect. Mae's work delves into the intricacies of language and technology, delivering profound insights in concise prose. Pioneering the future of communication through scholarship.

WhatsApp Live Chat