"> How Does ChatGPT Work? A Student's Plain-English Guide - ResearchProspect
Home > Library > Using AI Tools > How Does ChatGPT Work? A Student’s Plain-English Guide

Published by at June 22nd, 2026 , Revised On June 22, 2026

How does ChatGPT work? ChatGPT works by predicting the most likely next chunk of text (a “token”) one step at a time, using a large language model (an LLM) that has learned statistical patterns from an enormous amount of human writing. It does not look anything up in a database, and it does not “understand” your essay the way a tutor does; it produces fluent, plausible text by repeatedly answering the question “given everything so far, what word probably comes next?” Knowing that one fact changes how you should treat its output as a student.

This guide explains, in plain English, what generative AI actually is, how a large language model is trained, what really happens when you type a prompt, why ChatGPT confidently invents facts and fake references, and where it fits ethically into legitimate university work. No maths degree required.

What ChatGPT actually is

ChatGPT is a chatbot built on top of a large language model (LLM) made by OpenAI. The model behind it belongs to the GPT family — “GPT” stands for Generative Pre-trained Transformer, which neatly names the three ideas you need to understand how it works:

  • Generative — it produces new text rather than retrieving a stored answer.
  • Pre-trained — it was trained once, in advance, on a vast collection of text before you ever used it.
  • Transformer — the specific type of neural network architecture (introduced by Google researchers in 2017) that makes the whole thing possible.

It is best thought of as an extraordinarily sophisticated autocomplete. Your phone’s keyboard suggests the next word; ChatGPT does the same thing, but with a model so large and so well-trained that it can draft an essay outline, explain photosynthesis, write code, or hold a conversation. The underlying mechanism — predict the next piece of text, then the next, then the next — is identical. For a fuller tour of those capabilities, see our guide to what ChatGPT can do, and if you are curious about the company behind it, we cover who owns ChatGPT separately. If you would rather see what genuine, human-written academic work looks like, browse our free academic samples.

The four-step journey: how does ChatGPT work end to end

Whenever someone asks how does ChatGPT work, the honest answer has four moving parts. Understanding each one demystifies both why it is so impressive and why it is so unreliable for facts.

Stage What happens Plain-English analogy
1. Tokenising Your prompt is chopped into small units called tokens (whole words or word-fragments) and turned into numbers. Translating English into the model’s internal “number language”.
2. Training (done once, in advance) The model has already adjusted billions of internal weights by reading huge volumes of text and learning which tokens tend to follow which. Years of reading, compressed into a giant pattern-spotting brain.
3. Prediction For your specific prompt, the model calculates a probability for every possible next token and picks one. Advanced autocomplete choosing the most likely next word.
4. Generation It adds that token, feeds the whole thing back in, and repeats — word by word — until the answer is complete. Writing a sentence one word at a time, never seeing the full ending in advance.

Step 1 — Tokens: how ChatGPT reads your prompt

ChatGPT does not see letters or even whole words the way you do. The first thing it does is split your text into tokens. A token is roughly four characters of English — sometimes a whole short word like “the”, sometimes a fragment like “ing” or “un”. The word “unbelievable” might become three tokens (“un”, “believ”, “able”).

Each token is converted into a long list of numbers (a “vector”) that captures something about its meaning and how it relates to other tokens. This is why ChatGPT can tell that “bank” in “river bank” differs from “bank” in “savings bank”: the surrounding tokens shift the numbers. Everything the model does from here on is arithmetic on these numbers — there is no dictionary, no fact file, and no little librarian fetching answers.

Example: Type the prompt “The capital of France is”. ChatGPT tokenises it, then predicts the next token. “Paris” has an overwhelmingly high probability because, across billions of sentences it trained on, “The capital of France is” was followed by “Paris” almost every time. It outputs “Paris” — not because it knows a fact, but because that token won the probability contest. Now try “The capital of the fictional country Zarbland is”. There is no real answer, yet ChatGPT will still confidently invent a city, because its job is to produce a likely-sounding next token, not to check whether the thing exists.

Step 2 — Training: how the model learned its patterns

The intelligence in ChatGPT comes from training, which happened before the product was ever released. Training has two broad stages.

Pre-training: reading at scale

The model was shown an enormous quantity of text — books, articles, websites, code, conversations — and given one simple game over and over: hide the next token and try to predict it. When it guessed wrong, an algorithm nudged billions of internal numbers (called parameters or weights) very slightly so it would do better next time. Repeat this trillions of times and the model gradually encodes the statistical structure of human language: grammar, facts, writing styles, reasoning patterns, even some biases present in the source text.

Two important consequences for students follow from this. First, the model has a knowledge cut-off — it only “knows” what was in its training data up to a certain date, so it can be out of date. Second, it does not store the original sources as retrievable documents; it stores blurry statistical patterns. That is precisely why it cannot reliably give you a real, checkable citation — a point we return to below.

Fine-tuning and human feedback

A raw pre-trained model is a brilliant but unruly text predictor. To turn it into the helpful, polite assistant you actually chat with, OpenAI added a second stage: fine-tuning, including a technique called Reinforcement Learning from Human Feedback (RLHF). Human reviewers ranked sample answers from best to worst, and the model was adjusted to favour the kinds of responses people preferred — clear, helpful, safe and on-topic. This is the stage that gives ChatGPT its characteristic tone and its refusal to help with harmful requests.

“ChatGPT is a sophisticated next-word prediction engine. It is not connected to a database of facts, and it does not have beliefs. It generates the statistically most plausible continuation of your text.” — a fair summary of how researchers describe modern large language models.

Step 3 and 4 — Prediction and generation: building the answer

When you press enter, the model takes your tokenised prompt and, for the very next position, computes a probability score for every token it knows — tens of thousands of options. It then selects one. Crucially, it does not always pick the single highest-probability token; a setting often called temperature introduces controlled randomness so the output feels natural and varied rather than robotic. This is also why asking the same question twice can give slightly different wording.

It then appends that token to the conversation and runs the whole process again to choose the next one, and the next, generating the reply one token at a time. The model never plans the full sentence in advance; each word is chosen in the moment, conditioned on everything written so far. The reason a long answer stays coherent is the transformer’s attention mechanism, which lets the model weigh how relevant every earlier token is to the token it is about to produce — effectively keeping the thread of the argument in view.

How a large language model generates textYour prompt“Explain photosynthesis”Split into tokensturned into numbersModel predictsprobability per tokenPick next tokenadd to the answerRepeat: feed the answer back in, one token at a timeNo database lookup — it never “checks” a fact.It outputs the most statistically plausible text.
Figure: ChatGPT generates text one token at a time, looping each new token back into the prompt — with no fact-checking step in between.

Why ChatGPT confidently makes things up

Because the model optimises for plausible text, not true text, it sometimes produces fluent, authoritative-sounding statements that are simply wrong. Researchers call these hallucinations. The most dangerous version for students is the fabricated reference: ask for sources and ChatGPT will happily generate a real-looking citation — plausible author, plausible journal, plausible year, even a DOI — for a paper that does not exist. It is not lying; it is doing exactly what it was built to do, which is to assemble text that looks like a citation.

Example — a fabricated citation: A student asks ChatGPT for “three peer-reviewed studies on caffeine and exam performance”. It returns: “Hargreaves, J. & Lin, M. (2019). Caffeine intake and cognitive performance in undergraduates. Journal of Applied Cognitive Research, 14(2), 233–251.” The formatting is flawless — and the paper, the authors, the volume and the journal are all invented. A marker who checks the reference list finds nothing, and the student faces an academic-misconduct enquiry. The lesson: every fact and every source from ChatGPT must be independently verified against the real literature.

This is also why ChatGPT is poor at maths it has not seen, at very recent events (the knowledge cut-off), and at anything requiring genuine reasoning over precise facts. It is brilliant at form and unreliable at truth — which is exactly why, if you do use it as a study aid, you should treat verification as your job, not the model’s. When you need work that is researched and written from scratch instead, our academic writing services are produced entirely by human experts.

How ChatGPT differs from a search engine

Students often treat ChatGPT as a smarter Google. Mechanically, they could not be more different, and confusing them is where most trouble starts.

Question A search engine ChatGPT
Where does the answer come from? Real, existing web pages it has indexed. Statistical patterns learned during training.
Can you click through to the source? Yes — it gives you links. No — the source does not exist as a retrievable document.
Is the information current? Updated continuously. Limited by the training cut-off (unless connected to live browsing).
Will it invent a plausible falsehood? Rarely — it surfaces what is already published. Yes — hallucination is a known failure mode.
Best used for… Finding and verifying real sources. Drafting, explaining, brainstorming and rephrasing.

Using ChatGPT ethically in your university work

Now the part that matters most for your degree. Understanding how ChatGPT works tells you exactly where it belongs in legitimate academic work — and where it does not. The golden rule is simple: ChatGPT can support your thinking, but the words and ideas you submit must be your own, and your university’s rules come first.

Generative AI is a fast-moving area of academic policy. Most UK universities now treat submitting AI-generated text as your own as a form of plagiarism or contract-cheating, which can carry the same penalties as buying an essay. Before you use it for any assessment, read your institution’s specific guidance and your module handbook, because rules differ by university, department and even individual assignment.

Legitimate, low-risk uses

  • Explaining a difficult concept in simpler terms so you can then write about it in your own words.
  • Brainstorming angles or generating questions to investigate before you do the real reading.
  • Suggesting how to structure an argument or an essay plan that you then research and write yourself.
  • Acting as a study partner — quizzing you, or checking your understanding of a topic.
  • Tidying the grammar of text you have already written, where your institution permits it.

High-risk uses to avoid

  • Submitting ChatGPT’s text as your own work — this is academic misconduct at virtually every UK university.
  • Trusting its “facts” or citations without checking them against the real literature.
  • Using it to evade plagiarism or AI-detection tools — this is dishonest and against the rules.
  • Letting it replace the reading, critical thinking and analysis that the assessment is actually testing.

If you want to explore where AI genuinely helps the academic process — from literature searching to organising notes — we have a dedicated guide on how ChatGPT can help you with your research that stays firmly inside academic-integrity boundaries. For original, ethically produced academic support, our human experts can help with everything from an essay writing service to full dissertation services, all written from scratch and never AI-generated. For longer projects, our dissertation writing services pair you with a subject specialist who produces every chapter from original research.

Example — the same task done two ways: Suppose your assignment is a 1,500-word essay on the causes of the 2008 financial crisis. The wrong way: prompt ChatGPT to “write a 1,500-word essay on the 2008 crisis”, paste the output, and submit it — unverified, un-cited, and not your work. The right way: ask ChatGPT to explain a confusing term like “collateralised debt obligation” in plain English, use that understanding to read real sources, build your own argument, write every sentence yourself, and cite the actual books and papers you read. Same tool, completely different outcome for your integrity — and your mark.

Checking whether text reads as AI-generated

Because so many institutions now screen for it, students and academics alike want to know whether a piece of writing reads as machine-generated. AI-detection tools estimate this by looking at statistical signatures — such as how predictable and uniform the word choices are — although no detector is perfect and false positives do happen. The honest, integrity-safe use of a detector is as a self-check on your own drafting habits and to understand your university’s screening, never as a way to “beat” detection.

Check your writing with our free AI detector

See how your draft reads against AI-detection signals — and keep your work authentically your own.

The bigger picture: generative AI in a nutshell

ChatGPT is one example of generative AI — systems that create new content (text, images, audio, code) by learning patterns from existing examples and then producing fresh combinations. The same next-token-prediction idea powers many text-based AI assistants; image generators use a related but different technique. What unites them is that they generate likely output based on training data, which is exactly why they are powerful, creative and fundamentally unreliable as sources of verified fact.

So the next time someone asks how does ChatGPT work, you can give the one-sentence answer with confidence: it is a transformer-based large language model that has learned the statistical patterns of human language, and it writes by predicting one likely token after another — fluent, fast, occasionally wrong, and only ever as trustworthy as the human who checks its work. Used that way, it is a genuinely useful study aid. Treated as an oracle, it is a fast route to a misconduct hearing.

Frequently Asked Questions

Does ChatGPT understand what it is saying?

Not in any human sense. ChatGPT has no beliefs, intentions or comprehension. It predicts the most statistically likely next token based on patterns learned during training. The result can read as if it understands, but it is performing very advanced pattern-matching, not genuine understanding — which is why it can produce fluent answers that are confidently wrong.

Because it is built to generate plausible text, not verified truth. When you ask for a source, it assembles something that looks like a citation — realistic author, journal and year — even when no such paper exists. These errors are called hallucinations. Always verify every fact and every reference from ChatGPT against the real literature before relying on it.

No. A search engine retrieves real, existing web pages and gives you clickable links. ChatGPT generates new text from statistical patterns and cannot point you to a retrievable source, because the source does not exist as a document. Use a search engine to find and verify real sources, and use ChatGPT for drafting, explaining and brainstorming.

A token is a small chunk of text — a whole short word or a word-fragment — that the model reads as a number; ChatGPT processes everything in tokens rather than letters or words. Parameters (or weights) are the billions of internal numbers the model adjusted during training to capture language patterns. More parameters generally means a more capable model.

Only within your institution’s rules, and never by submitting its output as your own work. Most UK universities treat passing off AI-generated text as your own as academic misconduct. Legitimate uses include explaining concepts, brainstorming and planning — always followed by your own reading, thinking and writing. Check your module handbook and university policy first.

Possibly. Many universities now screen submissions with AI-detection tools that flag statistically machine-like writing, and experienced markers often notice generic, source-free, error-prone text. Detectors are not perfect and false positives occur, so the safe approach is simple: do your own authentic work and use AI only as a permitted study aid, not a ghost-writer.

About Aadam Mae

Avatar for Aadam MaeAadam Mae, an academic researcher and author with a PhD in NLP (Natural Language Processing) at ResearchProspect. Mae's work delves into the intricacies of language and technology, delivering profound insights in concise prose. Pioneering the future of communication through scholarship.

WhatsApp Live Chat