Thematic Analysis: Steps & Examples

Home > Library > Research Methodology > Thematic Analysis: Steps & Examples

Published by Aadam Mae at August 16th, 2021 , Revised On June 17, 2026

Thematic analysis is a qualitative method for identifying, analysing, and reporting patterns of meaning (“themes”) across a dataset such as interview transcripts, focus-group notes, or open-ended survey responses. In short: you read your data closely, attach short labels (codes) to meaningful segments, cluster related codes into candidate themes, then refine and name those themes to tell a coherent, evidence-based story that answers your research question. It is flexible, accessible to first-time qualitative researchers, and compatible with most epistemologies, which is why it is one of the most widely used approaches in dissertations across psychology, education, business, health, and the social sciences.

Use thematic analysis when your data are textual or transcribed, your aim is to understand experiences, perceptions, or meanings rather than to count and measure them, and your research question is exploratory (for example, “How do final-year nursing students experience their first clinical placement?”). The canonical framework is the six-phase approach set out by Virginia Braun and Victoria Clarke in their landmark 2006 paper, which this guide follows in detail below.

What is thematic analysis?

Thematic analysis (TA) is a method, not a methodology tied to a single theory. That distinction matters: because it is theoretically flexible, you can apply TA within a realist framework (treating accounts as a fairly direct window onto experience) or a constructionist one (treating accounts as shaped by language and context). What stays constant is the goal — identifying patterned meaning across a dataset and reporting it rigorously. A theme captures something important about the data in relation to your research question and represents a level of patterned response or meaning. Crucially, a theme is not simply a topic or a domain summary; it is an analytic claim with a central organising concept, supported by coded data extracts.

TA is a form of qualitative analysis applied to the kind of data produced by interviews, focus groups, diaries, and open-ended questionnaire items. If you are still designing the instruments that will generate that data, our guides on methods of data collection and the qualitative research questionnaire will help you produce transcripts rich enough to analyse thematically.

When should you use thematic analysis?

Thematic analysis is a strong choice when:

Your research question is exploratory and concerns experiences, views, perceptions, or meanings (“What”, “How”, and “Why” questions).
Your data are textual or transcribed — interviews, focus groups, free-text survey answers, reflective journals, or documents.
You want a method that is accessible and does not require you to commit to the full theoretical machinery of grounded theory, IPA, or discourse analysis.
You need flexibility to work inductively (themes from the data) or deductively (themes shaped by existing theory), and at a semantic or a latent level.

It is a weaker choice when you want to test causal hypotheses, quantify the prevalence of categories with statistical rigour, or analyse the fine mechanics of language use — for the last of these, a more linguistically focused approach is appropriate. If you mainly want to count the frequency of predefined categories, you are closer to content analysis than thematic analysis (see the comparison below).

Key choices before you start: inductive vs deductive, semantic vs latent

Before coding a single transcript, make two explicit decisions. First, will your themes be driven by the data (inductive) or by an existing theory or coding frame (deductive)? Second, will you analyse what participants explicitly say (semantic themes) or interpret the underlying assumptions and ideas beneath the surface (latent themes)? These are not strict either/or boxes — most real analyses sit somewhere on a continuum — but stating your position makes your analysis transparent and defensible.

Dimension	Option	What it means	When to choose it
Coding orientation	Inductive (“bottom-up”)	Codes and themes are derived from the data itself, without trying to fit them into a pre-existing frame.	Under-researched topics; you want participants’ own framing to lead.
Coding orientation	Deductive (“top-down”)	Coding is guided by existing theory, prior research, or a predefined codebook.	You are testing or extending an established framework or comparing against theory.
Level of meaning	Semantic	Themes describe the explicit, surface meaning of what participants said.	You want to summarise and organise the explicit content.
Level of meaning	Latent	Themes interpret underlying ideas, assumptions, and ideologies beneath the surface.	You are working interpretively/constructionist and want analytic depth.

A psychology dissertation exploring an unfamiliar coping strategy might run inductive + semantic to stay close to participants’ words; a sociology study examining how gendered assumptions shape workplace talk might run deductive + latent, reading beneath the surface through a theoretical lens.

The six phases of thematic analysis (Braun & Clarke, 2006)

Braun and Clarke describe TA as a recursive — not linear — process. You move back and forth between phases rather than marching through them once. The six phases below are the spine of almost every credible TA dissertation, and examiners expect to see them evidenced in your methods chapter.

“Thematic analysis is a method for identifying, analysing, and reporting patterns (themes) within data.” (Source: Braun & Clarke, 2006)

The six phases of thematic analysis (Braun & Clarke, 2006). The dashed orange back-arrow signals that the process is recursive: you revisit earlier phases — re-coding, re-grouping, and redefining — rather than moving through them once in a straight line.

Phase 1 — Familiarising yourself with the data

What you do: Transcribe your recordings (if you have not already), then read and re-read the entire dataset actively, noting initial ideas. Immersion is the point — you cannot identify patterns in data you do not know intimately. Keep a notebook or memo file of first impressions; these are not codes yet, just analytic hunches.

Mini example: While transcribing interviews with new graduates about remote onboarding, you repeatedly notice people mentioning “not knowing who to ask” — you jot it down as an early observation.

Phase 2 — Generating initial codes

What you do: Work systematically through the whole dataset, attaching concise codes to features of the data that are interesting and relevant to your question. A code is a short label (a word or short phrase) summarising the essence of a data segment. Code inclusively, code for as many potential patterns as you can, and keep a little surrounding context with each extract. You can code by hand, in a spreadsheet, or in software such as NVivo or ATLAS.ti.

Mini example: The sentence “I didn’t want to bother my manager with what felt like a stupid question” might be coded as fear of appearing incompetent and reluctance to ask for help.

Phase 3 — Searching for themes

What you do: Shift from codes to themes. Sort your many codes into potential themes and collate the relevant coded extracts under each. A useful technique is a thematic map or even physical cards — grouping codes that seem to share a central idea, and identifying relationships between themes and sub-themes. Some codes will form main themes, some sub-themes, and some may be set aside as a “miscellaneous” pile for now.

Mini example: Codes such as reluctance to ask for help, fear of appearing incompetent, and no informal channels cluster into a candidate theme: “The high social cost of asking questions remotely.”

Phase 4 — Reviewing themes

What you do: Refine your candidate themes against two levels. Level one: re-read the collated extracts for each theme — do they form a coherent pattern? Level two: re-read the entire dataset — do the themes accurately reflect the data as a whole, and have you missed anything? Themes that lack enough supporting data collapse; themes that are too broad split; some merge. By the end, the boundaries of each theme should be clear.

Mini example: You realise “The high social cost of asking questions” overlaps heavily with a separate “isolation” theme, so you merge and rework them into a sharper theme about the erosion of informal learning channels.

Phase 5 — Defining and naming themes

What you do: For each theme, write a short definition (a couple of sentences) capturing its central organising concept — what is unique and central about it, and the story it tells in relation to your question and the other themes. Give each theme a concise, informative, ideally engaging name. If you cannot define a theme in a sentence or two, it probably is not yet a coherent theme.

Mini example: You name a theme “Learning by osmosis — and its quiet collapse online,” defined as the loss of the incidental, overheard learning that physical offices provide and that remote onboarding fails to replace.

Phase 6 — Producing the report

What you do: Write up the analysis. Select vivid, compelling extract examples, relate the analysis back to your research question and the literature, and build an argument — do not just describe. Good TA writing weaves data extracts and your interpretation together so the reader can see why your themes are warranted, not merely that you have them.

Mini example: Under each theme you present two or three quotations, interpret them, and connect the pattern to existing onboarding and organisational-socialisation literature.

A worked example: from codes to a theme

The clearest way to understand TA is to watch a short interview extract turn into coded segments and then into a theme.

Example: Study: A health dissertation asks, “How do patients experience self-managing type 2 diabetes?” One participant says:

“Honestly, every meal feels like a maths exam now. I check the labels, I count the carbs, and even then I’m never sure I’ve got it right — so I just feel guilty most of the time, like I’m failing at something I should be able to do.”

Coding (Phase 2):
• “every meal feels like a maths exam” → cognitive burden of monitoring
• “never sure I’ve got it right” → uncertainty about correct self-management
• “I just feel guilty… like I’m failing” → self-blame / moralised illness

Clustering (Phase 3–5): Across the dataset, cognitive burden, uncertainty, and self-blame recur together. They are clustered, reviewed, and defined into one latent theme: “Managing diabetes as a moral test, not just a medical task” — capturing how participants frame everyday self-care as a question of personal virtue and failure, not only clinical behaviour. Notice the theme is an interpretive claim, not a topic label like “diet.”

Worked example — from one interview extract to a named theme

Study: An education dissertation asks, “How do first-year university students experience moving from school to independent study?” Below is one short extract from a recorded interview, coded segment by segment, then grouped into a single named theme with a supporting quotation.

Interview extract (Participant 4)

“At school the teachers basically told you exactly what to revise and chased you for it. Here, nobody checks. I had three weeks of reading and I just… didn’t do it, and no one noticed until the essay was due. I felt like I’d been thrown in and left to sink or swim on my own.”

Coding the segments (Phase 2)

Text segment	Code
“the teachers basically told you exactly what to revise and chased you for it”	loss of external structure
“Here, nobody checks… no one noticed until the essay was due”	absence of monitoring / accountability
“thrown in and left to sink or swim on my own”	feeling unsupported / sudden independence

Grouping the codes into one theme (Phases 3–5)

loss of external structure+absence of monitoring+feeling unsupported→Theme: “Sink or swim” — independence as abandonment

Analytic claim (Phase 6)

Students did not experience new-found freedom as liberating; they experienced the withdrawal of school-style structure as a form of abandonment, framing self-direction as being left to cope alone rather than as empowerment.

Supporting quote: “I felt like I’d been thrown in and left to sink or swim on my own.” (Participant 4)

Notice the theme is an interpretive claim (“independence as abandonment”), not a topic label like “independent study” — and it is built entirely from qualitative meaning, with no counts or percentages.

Reflexive TA vs codebook and coding-reliability approaches

Braun and Clarke have since clarified that “thematic analysis” is really a family of methods, and they distinguish three broad types. Knowing where your approach sits prevents you from mixing incompatible procedures — a common examiner criticism.

Reflexive TA — the approach their 2006 paper grew into. Coding is organic and flexible, themes are actively generated by the researcher as analytic outputs (not “found”), and the researcher’s subjectivity is treated as a resource. There is no codebook fixed in advance and no expectation of multiple coders agreeing.
Codebook TA — uses a structured coding frame (often developed partly in advance) to map the data, blending some structure with qualitative interpretation. Useful in applied or team research with predefined questions.
Coding-reliability TA — emphasises a fixed codebook and inter-rater reliability (e.g., multiple coders, agreement statistics), reflecting a more (post)positivist stance.

An important practical point: in reflexive TA, seeking inter-coder “reliability” is not a virtue and can even be conceptually inconsistent — themes are interpretations, not objective facts to be agreed upon. So decide your variant first, then justify your quality criteria accordingly rather than importing reliability checks that belong to a different tradition.

Thematic analysis vs content analysis

Students frequently confuse these two. Both work with textual data and both involve coding, but their logic differs. Content analysis is often (though not always) more quantitative: it tends to count the frequency of predefined categories and can report numerical prevalence. Thematic analysis is qualitative throughout: it seeks patterns of meaning and builds interpretive themes, and prevalence (“how many participants”) is not what makes something a theme. As a rule of thumb, if your output is mainly a frequency table, you are doing content analysis; if it is a set of argued, interpreted themes illustrated with extracts, you are doing TA. Our dedicated guide to content analysis sets out that method in full.

Common mistakes (and how to avoid them)

Themes that are really topic summaries. “Barriers,” “Benefits,” or “Diet” are domain labels, not themes. A genuine theme has a central organising concept and makes a point. Rename “Barriers” into the specific claim your data actually support.
Too many themes (or too many sub-themes). A typical undergraduate or master’s study reports roughly three to six well-developed themes. Fifteen thin themes signal under-analysis — you have organised the data but not interpreted it.
Using interview questions as themes. If your themes mirror your topic guide one-to-one, you have summarised the interview structure, not analysed the data.
Thin evidence. Each theme needs enough coded extracts from across (ideally) multiple participants. One striking quote is an anecdote, not a theme.
Anecdotalism — cherry-picking. Choosing only quotes that fit your story, while ignoring disconfirming data, undermines trustworthiness. Actively look for and report variation.
Mismatched quality criteria. Applying inter-rater reliability to a reflexive TA, or claiming themes “emerged” (which implies they were passively found rather than actively constructed).

How to do thematic analysis well: a checklist

State your position upfront: which TA variant (reflexive / codebook / coding-reliability), inductive or deductive, semantic or latent.
Transcribe carefully and immerse yourself before coding.
Code the whole dataset systematically and keep an audit trail of how codes became themes.
Build, review, and refine a thematic map; check themes against both the extracts and the full dataset.
Define and name each theme with a one- or two-sentence central organising concept.
Write an analytic report — interpret extracts, link to literature, and address disconfirming cases.
Keep reflexive notes on how your perspective shaped the analysis.

Struggling to turn transcripts into credible themes?

Our qualified academics can help you design, analyse, and write up a methodologically sound thematic analysis chapter.

Get Dissertation Help

In summary

Thematic analysis is a flexible, rigorous way to make sense of qualitative data by building interpreted themes from systematic coding. Anchor your work in Braun and Clarke’s six recursive phases, decide your variant and your inductive/deductive and semantic/latent positions before you start, support every theme with evidence, and treat themes as analytic claims rather than topic headings. Do that, and your analysis will be both transparent and convincing — exactly what examiners reward.

Related methodology guides

Qualitative Data Analysis
Grounded Theory
Narrative Analysis

Frequently Asked Questions

What is thematic analysis in simple terms?▾

Thematic analysis is a qualitative method for finding, organising, and reporting patterns of meaning (themes) across data such as interview transcripts or open-ended survey answers. You code meaningful segments, group related codes into themes, then refine and name those themes to answer your research question. It is widely used because it is flexible and accessible to first-time qualitative researchers.

What are the six steps of thematic analysis?▾

Braun and Clarke’s (2006) six phases are: (1) familiarising yourself with the data, (2) generating initial codes, (3) searching for themes, (4) reviewing themes, (5) defining and naming themes, and (6) producing the report. The process is recursive, meaning you move back and forth between phases rather than completing them strictly in order.

What is the difference between thematic analysis and content analysis?▾

Both analyse textual data through coding, but content analysis is often more quantitative — it counts the frequency of predefined categories and can report numerical prevalence. Thematic analysis is qualitative throughout: it builds interpreted themes around patterns of meaning, and how often something appears does not determine whether it is a theme.

How many themes should a thematic analysis have?▾

There is no fixed rule, but most undergraduate and master’s dissertations report roughly three to six well-developed themes, sometimes with sub-themes. A long list of thin themes usually signals under-analysis — you have organised the data but not yet interpreted it into coherent, evidenced patterns.

What is the difference between semantic and latent themes?▾

Semantic themes describe the explicit, surface meaning of what participants actually said. Latent themes interpret the underlying ideas, assumptions, or ideologies beneath the surface. Semantic analysis stays close to the data; latent analysis is more interpretive and is often associated with a constructionist theoretical position.

What is reflexive thematic analysis?▾

Reflexive thematic analysis is the approach Braun and Clarke developed from their 2006 paper. Coding is flexible and organic, themes are actively generated by the researcher as analytic outputs rather than passively ‘found’, and the researcher’s subjectivity is treated as a resource. Unlike coding-reliability approaches, it does not rely on a fixed codebook or inter-rater agreement statistics.

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

Thematic Analysis: Steps & Examples

What is thematic analysis?

When should you use thematic analysis?

Key choices before you start: inductive vs deductive, semantic vs latent

The six phases of thematic analysis (Braun & Clarke, 2006)

Phase 1 — Familiarising yourself with the data

Phase 2 — Generating initial codes

Phase 3 — Searching for themes

Phase 4 — Reviewing themes

Phase 5 — Defining and naming themes

Phase 6 — Producing the report

A worked example: from codes to a theme

Reflexive TA vs codebook and coding-reliability approaches

Thematic analysis vs content analysis

Common mistakes (and how to avoid them)

How to do thematic analysis well: a checklist

Struggling to turn transcripts into credible themes?

In summary

Related methodology guides

Frequently Asked Questions

You May Also Like

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

OUR EXPERTS CAN HELP WITH YOUR:

MORE AI TOOLS

Thematic Analysis: Steps & Examples

What is thematic analysis?

When should you use thematic analysis?

Key choices before you start: inductive vs deductive, semantic vs latent

The six phases of thematic analysis (Braun & Clarke, 2006)

Phase 1 — Familiarising yourself with the data

Phase 2 — Generating initial codes

Phase 3 — Searching for themes

Phase 4 — Reviewing themes

Phase 5 — Defining and naming themes

Phase 6 — Producing the report

A worked example: from codes to a theme

Reflexive TA vs codebook and coding-reliability approaches

Thematic analysis vs content analysis

Common mistakes (and how to avoid them)

How to do thematic analysis well: a checklist

Struggling to turn transcripts into credible themes?

In summary

Related methodology guides

Frequently Asked Questions

You May Also Like