Trustworthiness in qualitative research is the degree to which a qualitative study’s findings can be trusted as accurate, well-grounded and defensible — it is the qualitative parallel to the concepts of reliability and validity used in quantitative work. The term was formalised by Egon Guba and Yvonna Lincoln in their 1985 book Naturalistic Inquiry, where they argued that the positivist yardsticks of internal validity, external validity, reliability and objectivity do not transfer cleanly to interpretive research, and proposed four parallel criteria instead: credibility, transferability, dependability and confirmability, later joined by a fifth criterion, authenticity.
In short: you demonstrate trustworthiness not by claiming your interview study is “valid and reliable” in the statistical sense, but by showing — through specific, documented strategies such as triangulation, member checking, thick description and an audit trail — that your interpretation of the data is faithful to participants’ realities and could be followed and confirmed by someone else.
What “trustworthiness” means — and why qualitative research needs its own word
Quantitative research rests on a shared vocabulary of quality: a measure is reliable if it produces consistent results, and valid if it measures what it claims to. These ideas assume a single, stable, measurable reality that exists independently of the researcher. Most qualitative research does not assume that. It works from an interpretivist or constructivist position in which meaning is co-constructed between participant and researcher, contexts are particular rather than generalisable, and the researcher is the primary “instrument” of data collection and analysis.
Lincoln and Guba’s insight was that quality still matters enormously — a sloppy or biased qualitative study is just as worthless as a flawed survey — but it has to be judged on its own terms. Borrowing the positivist labels wholesale either misdescribes what qualitative work does or sets it up to fail against criteria it was never designed to meet. So they reframed the underlying question. Instead of asking “Is this study reliable and valid?”, they asked: “How can an inquirer persuade their audience that the findings of an inquiry are worth paying attention to, worth taking account of?” That persuasion is what trustworthiness delivers.
“The basic issue in relation to trustworthiness is simple. How can an inquirer persuade his or her audiences that the findings of an inquiry are worth paying attention to, worth taking account of?” (Source: Lincoln & Guba, 1985)
This is not a soft, anything-goes standard. Each of the four criteria maps onto a precise positivist concern, and each comes with concrete techniques you are expected to apply and report. Examiners and reviewers look for exactly these. The sections below define each criterion, pair it with its quantitative parallel, and give you the strategies that satisfy it.
The four criteria mapped to their quantitative parallels
The architecture of trustworthiness is a one-to-one mapping. Each qualitative criterion answers the same fundamental question as a quantitative concept, but in a way that fits interpretive inquiry. The table sets out the correspondence; the figure visualises it.
| Trustworthiness criterion | Quantitative parallel | The question it answers | Headline strategies |
|---|---|---|---|
| Credibility | Internal validity | Are the findings a faithful, believable account of the participants’ realities? | Prolonged engagement, triangulation, member checking, peer debriefing, negative-case analysis |
| Transferability | External validity / generalisability | Could these findings apply to other contexts, and can the reader judge that? | Thick description, purposive sampling |
| Dependability | Reliability | Is the research process consistent, logical and documented enough to be followed? | Audit trail, code–recode, external audit |
| Confirmability | Objectivity / neutrality | Are the findings shaped by the data and participants rather than the researcher’s bias? | Reflexivity, audit trail, confirmability audit |
| Authenticity | (no direct parallel) | Does the research fairly represent different voices and have a positive impact on those studied? | Fairness, ontological & educative authenticity |
Credibility — the qualitative parallel to internal validity
Credibility asks whether your findings are a believable, faithful representation of the participants’ constructed realities. It is the most important of the four criteria and the one examiners scrutinise most closely, because it is the qualitative answer to “are these conclusions actually warranted by the data?” You build credibility through a combination of the following strategies — you do not need all of them, but you should be able to justify the ones you chose.
- Prolonged engagement: spending enough time in the field or with participants to understand the context, build rapport and move past surface or “performed” responses. A two-week ethnography rarely earns the credibility a six-month immersion does.
- Triangulation: using multiple data sources, methods, investigators or theories so that findings are corroborated from more than one angle. If interviews, observations and documents all point the same way, your interpretation is far more defensible. (See our guide on triangulation for the four types.)
- Member checking (respondent validation): taking your transcripts, summaries or emerging themes back to participants to confirm they recognise themselves in your account. If participants say “yes, that captures what I meant,” credibility rises.
- Peer debriefing: exposing your analysis to a disinterested colleague or supervisor who probes your assumptions, plays devil’s advocate and surfaces biases you cannot see yourself.
- Negative-case analysis: actively searching for cases that contradict your emerging pattern and either revising the interpretation to accommodate them or explaining why they differ. This guards against cherry-picking confirming quotes.
Transferability — the parallel to external validity
Quantitative generalisation relies on probability sampling so results extend to a defined population. Qualitative research usually cannot — and does not try to — generalise statistically from a handful of purposively chosen participants. Instead it offers transferability: the responsibility for judging whether findings apply to another setting is shifted to the reader. Your job is to give the reader enough information to make that judgement.
- Thick description: a rich, detailed account of the context, participants, setting and the phenomenon itself — the term comes from Clifford Geertz. A reader in a similar context can then assess “how like my situation is this?”
- Purposive sampling: deliberately selecting information-rich participants who illuminate the phenomenon, and describing the selection logic clearly so readers understand the boundaries of the case.
Dependability — the parallel to reliability
Reliability in quantitative terms means a measure repeated under the same conditions yields the same result. Qualitative contexts shift and the researcher learns as they go, so exact replication is neither expected nor desirable. Dependability instead asks whether the research process is logical, traceable and documented, so that someone could follow it and understand how you reached your conclusions.
- Audit trail: keeping a complete, dated record of raw data, field notes, coding decisions, analytic memos and the evolution of your themes, so the path from data to findings is fully transparent.
- Code–recode strategy: coding a portion of the data, leaving it for a period (commonly two weeks), then recoding it and comparing. High consistency between the two passes signals a dependable coding process.
- External audit: inviting an outside researcher to examine the process and product of the study and judge whether the findings are supported by the data.
Confirmability — the parallel to objectivity
Positivism prizes objectivity — findings free of the investigator’s influence. Interpretive research accepts the researcher is part of the meaning-making, so it cannot claim pure objectivity. Confirmability is the realistic substitute: it asks whether the findings are clearly grounded in the data and participants rather than in the researcher’s preferences, motivations or assumptions.
- Reflexivity: explicitly examining and reporting your own position, assumptions, prior experience and potential biases — often through a reflexive journal — so readers can see where you stand and how it might shape interpretation.
- Audit trail: the same documented chain of evidence that supports dependability also supports confirmability, by letting an auditor trace every conclusion back to its data source.
- Confirmability audit: a colleague checks that interpretations and conclusions are traceable to the data and not imposed on it.
Authenticity — the fifth criterion
Lincoln and Guba later added authenticity, which has no neat positivist parallel. It concerns the ethics and impact of the research: fairness (representing all stakeholder voices and constructions even-handedly), ontological and educative authenticity (participants and others develop a more sophisticated understanding through the research), and catalytic and tactical authenticity (the research prompts and enables action). It is most relevant in participatory, action and emancipatory traditions. Many dissertations focus on the original four and treat authenticity briefly, which is acceptable — but knowing it exists shows methodological maturity.
Credibility. Priya spent a full term in two schools before interviewing (prolonged engagement). She triangulated 14 semi-structured interviews with classroom-observation notes and the schools’ internal policy documents — all three sources pointed to “workload intensification” as a dominant theme. She emailed each participant a one-page summary of their themes (member checking); 12 of 14 confirmed it captured their experience, and the two who pushed back prompted a refinement. She presented her coding frame to her supervisor and a fellow student (peer debriefing), and she deliberately examined three teachers who reported the change as positive rather than burdensome (negative-case analysis), revising her theme to distinguish early-career from experienced staff.
Transferability. She wrote a thick description of both schools — intake, size, region, technology provision — and justified her purposive sample (teachers across subjects and experience levels), so a reader in a comparable UK comprehensive can judge fit.
Dependability. She kept a dated audit trail of every coding decision and analytic memo, and recoded four transcripts two weeks after the first pass, finding 89% agreement (code–recode).
Confirmability. She kept a reflexive journal noting that, as a former teacher herself, she expected to find frustration — and used the audit trail so her supervisor could trace each theme back to specific data extracts rather than to her prior expectations.
In her methodology chapter, Priya devotes one paragraph to each criterion, names the strategy, and states exactly what she did — not what the textbook says could be done. That is what an examiner wants to see.
How to write trustworthiness into your methodology
Trustworthiness should appear as a dedicated subsection of your methodology chapter, usually after you describe data collection and analysis. Follow these steps.
- State the framework. Open by naming Lincoln and Guba (1985) and the four (or five) criteria you will address. This signals you know the qualitative quality vocabulary rather than misapplying the quantitative one.
- Take each criterion in turn. Give it its own short paragraph: define it in one sentence, then describe the specific strategies you actually used.
- Be concrete and past-tense. Write “I triangulated interview data with observation notes and member-checked summaries with all participants,” not “triangulation can be used to enhance credibility.” Examiners reward evidence of action.
- Link strategies to your analysis method. If you used thematic analysis or another approach to qualitative data analysis, show how the audit trail and code–recode fit your coding workflow.
- Acknowledge limits honestly. If time prevented prolonged engagement, say so and explain how other strategies compensated. Honesty strengthens rather than weakens trustworthiness.
Because the researcher is the instrument, your choices about methods of data collection directly shape how credible and confirmable the study can be — so describe them precisely.
Common mistakes to avoid
- Claiming “validity and reliability” unadapted. Writing “this qualitative study is valid and reliable” without reframing the terms signals you have not grasped the epistemological difference. Use the trustworthiness vocabulary instead, or explicitly explain the parallel.
- Listing strategies you did not actually use. Reciting “triangulation, member checking, peer debriefing” as a generic shopping list, when your study did none of them, is easily exposed in a viva. Only claim what you did.
- Confusing the criteria. Putting thick description under credibility, or member checking under dependability, shows shaky understanding. Keep the mapping straight (the table and figure above help).
- Treating member checking as a formality. Sending a transcript and getting silence is not validation. Genuine member checking engages participants with your interpretation, not just the raw text.
- Ignoring reflexivity. Pretending you had no influence on the data contradicts the very philosophy underpinning qualitative work. State your position openly.
- Forgetting the audit trail until the end. You cannot reconstruct a dated record of decisions retrospectively. Start it on day one.
Strengths and limitations of the trustworthiness framework
The framework’s strength is that it gives qualitative researchers a shared, recognised standard that does justice to interpretive work while still demanding rigour — it is the most widely cited quality framework in qualitative methodology, and most UK examiners expect to see it. Its limitations are debated: some scholars (such as Tracy with her “big-tent” criteria) argue it leans too heavily on a positivist scaffolding, and the criteria can become a box-ticking ritual if applied mechanically rather than thoughtfully. The remedy is to treat the strategies as ways to genuinely improve and evidence your inquiry, not as labels to scatter through a chapter.
Need your methodology to convince the examiner?
Our qualified academics can help you design, justify and write up a trustworthy qualitative study — from sampling to credibility strategies.
Get the framework right and trustworthiness stops being a chapter you dread and becomes the section that most clearly proves your study deserves to be believed.