What happens in the IELTS Speaking test and how is it marked?
The IELTS Speaking test is 11-14 minutes, conducted face-to-face with a trained IELTS examiner, and consists of three parts: Part 1 (familiar topics, 4-5 minutes), Part 2 (individual long turn with a cue card, 3-4 minutes), and Part 3 (abstract discussion, 4-5 minutes). It is marked on four criteria: Fluency and Coherence, Lexical Resource, Grammatical Range and Accuracy, and Pronunciation — each equally weighted. The test is recorded.
The IELTS Speaking test is the only component of IELTS conducted face-to-face with a human examiner. This makes it fundamentally different from the other sections — it cannot be practiced in isolation in the same way that writing can be drafted and reviewed. It requires real-time oral production under observation.
This guide covers what each part tests, what examiners are trained to listen for (which differs from what most candidates believe), the four marking criteria with band-specific descriptions, the most common mistakes that cap candidates at Band 6, and practical strategies for Part 2 cue card responses.
Part 1: Familiar Topics (4-5 Minutes)
In Part 1, the examiner asks questions about familiar topics: your home, hometown, job or studies, hobbies, daily routines, and similar personal subjects. Questions are predictable in topic but not in exact wording.
What Part 1 tests: Your ability to speak fluently and naturally about familiar topics. This is not a vocabulary or grammar showcase — it is a fluency and natural communication assessment. Part 1 questions are designed to put candidates at ease and establish a natural conversational baseline.
The common Part 1 error: Giving single-sentence answers. When asked "Do you enjoy reading?", answering "Yes, I do" fails to demonstrate any of the four marking criteria. Examiners expect extended, natural responses — approximately 2-4 sentences per answer.
A better response: "Yes, I read quite a lot, mainly novels and occasionally non-fiction. I find it's a good way to unwind after work, and I've always enjoyed getting absorbed in a story. Recently I've been working through some historical fiction."
This response demonstrates fluent speech (no significant hesitation), some vocabulary range ("unwind," "absorbed"), and natural sentence variety. None of those phrases are unusual — the fluency and naturalness are what marks it as a Band 7 response.
Part 1 strategy: Answer the question, extend naturally with a reason or elaboration, and add a personal detail or example. This three-part pattern (answer + reason + example/detail) produces appropriately extended responses without sounding rehearsed.
Part 2: Individual Long Turn — The Cue Card (3-4 Minutes Including 1 Minute Prep)
In Part 2, the examiner gives you a topic card (the "cue card") that describes a topic you should talk about, with bullet points suggesting what to cover. You have exactly 1 minute to prepare, then you must speak for 1-2 minutes without interruption. The examiner may ask 1-2 follow-up questions at the end.
Example cue card: "Describe a time when you had to make an important decision. You should say:
- when this happened
- what the decision was about
- what you decided
- and explain why this decision was important to you."
The 1-minute preparation technique:
You cannot write full sentences in 1 minute. You can write trigger words. Here is an efficient approach:
Write 4-5 key words for each bullet point, not complete sentences. For the example above:
- When: "last year / before moving"
- What decision: "job offer / different city"
- Decided: "accepted / left stable position"
- Why important: "changed career path / independence"
Then during your speaking, use these trigger words as anchors and speak naturally around them. This prevents the common problem of running out of material at 90 seconds.
How to run the full 2 minutes:
Students who run out of things to say typically haven't used the full potential of the cue card. Here is a time allocation guide for a 2-minute response:
| Component | Target Time |
|---|---|
| Introduction (when/where/who) | 15-20 seconds |
| First bullet point | 25-30 seconds |
| Second bullet point | 25-30 seconds |
| Third bullet point | 20-25 seconds |
| Why important / final bullet | 25-30 seconds |
| Closing remark or reflection | 5-10 seconds |
If you hit 1 minute and 45 seconds, you have covered the content. Add a brief reflective closing: "Looking back, I think this was one of the most significant decisions of my life because it taught me to trust my instincts, even when it was scary."
Using vivid language without sounding rehearsed: The cue card is not asking for a factual report. It is an opportunity to show natural spoken language. Use descriptive language: instead of "I was nervous," say "I remember feeling quite apprehensive — almost paralyzed by the decision." Instead of "I talked to my family," say "I spent a long time discussing it with my parents, who were concerned but supportive."
"The candidates who score Band 7 and above on Part 2 are not the ones with the most rehearsed answers. They are the ones who speak with genuine engagement and use language naturally and descriptively. Examiners immediately recognize and mark down responses that sound pre-memorized." — IELTS examiner, British Council trainer, quoted in British Council professional development materials
Part 3: Abstract Discussion (4-5 Minutes)
Part 3 is thematically connected to the Part 2 cue card topic but moves to abstract, societal-level questions. If Part 2 was about a personal decision, Part 3 might ask: "Do you think people are better at making decisions as they get older? Why do some societies place more value on individual decision-making while others defer more to family or community?"
What Part 3 tests: This is the most demanding part of the Speaking test. It assesses your ability to discuss abstract concepts, present and defend opinions, speculate, and engage with complex ideas in real time. The language complexity expected is higher than in Parts 1 and 2.
What examiners are specifically listening for in Part 3:
Examiners in Part 3 are listening for spontaneous, developed opinions — not single-sentence answers and not pre-memorized paragraphs. They are trained to probe: if your answer is too short, they will ask follow-up questions. If your answer sounds pre-memorized, they may change the direction of questioning.
Part 3 requires you to:
- Express opinions with appropriate hedging ("I would argue that...", "It seems to me that...", "I think to some extent this depends on...")
- Speculate about causes and consequences ("This might be because...", "One reason this happens could be...", "If this trend continues, I imagine...")
- Consider multiple perspectives ("On the one hand..., but on the other hand...", "While X is true in some contexts, Y might be more relevant in others...")
Part 3 response length: Unlike Part 1, Part 3 responses should be extended — 4-8 sentences per answer. The examiner expects developed responses with reasoning and evidence (personal experience, general knowledge, or reasoned argument).
The Four Speaking Marking Criteria Explained
Fluency and Coherence (FC)
What is tested: The speed and smoothness of speech delivery, and the logical connection of ideas. Natural hesitation is acceptable; long pauses are penalized.
Band 5: Slow, irregular speech with frequent repetition and self-correction. Long pauses before responses.
Band 6: Generally fluent with some repetition or self-correction. Coherent but not always logical or clear.
Band 7: Speaks at length with only occasional repetition. Uses cohesive devices, mostly appropriately. Occasional irregularities in speech.
Band 8: Develops topics coherently and appropriately. Very occasional hesitation, and when it occurs it is used naturally (as native speakers hesitate). Cohesive devices are used naturally.
The key misunderstanding about fluency: Fluency does not mean speaking very fast. It means speaking smoothly, without unnatural pauses. A candidate who speaks at moderate speed without gaps scores higher on FC than a candidate who speaks quickly with many "uh... uh... um..." fillers. Natural hesitation (the kind used to think, not the kind from language difficulty) is fine. Pauses caused by searching for words or grammar are penalized.
Lexical Resource (LR)
What is tested: The range, accuracy, and appropriateness of vocabulary in spoken context.
Band 6: Adequate for familiar topics; some evidence of less common vocabulary with some inaccuracies; paraphrases when lacking a word (but not smoothly).
Band 7: Vocabulary is flexible and used appropriately. Uses some less common vocabulary with awareness of style. May occasionally make errors but communicates effectively.
Band 8: Wide range of vocabulary. Rare errors. Uses idioms and colloquial expressions naturally when appropriate (Part 1) and formal vocabulary when discussing abstract topics (Part 3).
What examiners listen for: The ability to paraphrase (when you don't know a word, describe it accurately without stopping), collocational accuracy (words that naturally go together: "make a decision," not "do a decision"), and appropriate register (more formal in Part 3, more conversational in Part 1).
Grammatical Range and Accuracy (GRA)
What is tested: The range and accuracy of grammar in spontaneous spoken production.
Band 6: Uses a mix of simple and complex forms. Errors are frequent in complex sentences but simple sentences are mostly correct.
Band 7: Uses a range of complex structures with some flexibility. Frequent error-free sentences. Errors rarely impede understanding.
Band 8: Wide range of structures, mostly error-free, any errors are minor.
What most candidates get wrong about GRA in speaking: Grammar in speaking is judged on spoken production, not written standards. Native speakers use grammatically "imperfect" speech constantly. The standard is not written grammatical perfection — it is the range of grammatical structures produced with reasonable accuracy. A candidate who only uses simple sentences, even perfectly, scores Band 5-6. A candidate who attempts conditional sentences ("If I had known then what I know now, I would have made a different choice"), passive structures, and subordinate clauses — even with occasional errors — demonstrates the range required for Band 7.
Pronunciation (P)
What is tested: Ability to produce clear, intelligible speech. Not an accent test.
Band 5: Frequent pronunciation difficulties that affect listener understanding.
Band 6: Generally intelligible but pronunciation causes occasional difficulty for the listener.
Band 7: Generally easy to understand throughout. Uses a range of phonological features (word stress, sentence stress, intonation) with some flexibility. Occasional unclear sounds.
Band 8: Easy to understand throughout. Effective use of stress, rhythm, and intonation to convey meaning. Minimal influence of L1 (first language) accent.
"Accent is not penalized on IELTS Speaking. A French accent, an Indian accent, a Korean accent — none of these are relevant to the score unless the accent makes the candidate difficult to understand. We are assessing intelligibility and phonological range, not proximity to a British or American standard." — IELTS Official Examiner Training Documentation, Cambridge Assessment English
How to Practice Speaking Without a Partner
The most common complaint about Speaking preparation is the lack of a practice partner. Here are evidence-based alternatives:
Solo speaking practice: Set a timer for 2 minutes and answer Part 2 cue cards alone. Record yourself on your phone. Listen back and evaluate: are there long pauses? Are you repeating words? Are you using varied grammar? Most students are surprised by how different their actual speech sounds from how they perceive it while speaking.
Read-aloud practice for pronunciation: Reading aloud — news articles, academic texts — builds prosody (the rhythm and melody of English) and pronunciation confidence. Record and compare to native speaker recordings of the same text.
Think-aloud practice: Choose abstract topics (education systems, environmental policy, technology and society) and speak your thoughts aloud for 3-4 minutes with no preparation. This simulates Part 3 conditions.
Online speaking practice communities: iTalki and Cambly provide conversation partners (both free community versions and paid tutors). The British Council's IELTS preparation community also has speaking partner matching.
Part 2 Cue Card: Extended Strategy Example
Topic: "Describe a skill you would like to learn."
1-minute prep notes:
- Skill: Arabic language
- Why want to learn: travel, heritage, professional
- How would learn: classes + apps + immersion
- Why difficult: script, sounds, formal vs. colloquial gap
2-minute response structure:
Opening (15 sec): "There's actually a skill I've been meaning to pursue for quite some time — I'd really like to learn to speak Arabic..."
Bullet 1 (30 sec): What the skill is and background
Bullet 2 (30 sec): Why you want to learn it (develop this with specifics)
Bullet 3 (25 sec): How you plan/planned to learn it
Bullet 4/closing (25 sec): Why this skill matters to you more broadly
Reflective close (5 sec): A brief summary statement connecting to the personal significance
The Language Features That Signal Higher Band Scores
Beyond knowing the band descriptor criteria, there are specific language features that examiners notice and mark positively. Using these features consistently and naturally — not mechanically — distinguishes Band 7+ speaking from Band 6 speaking.
Discourse markers that signal organization: "What I find most interesting about this is...", "I suppose the main reason is...", "I hadn't really thought about this before, but I'd say...", "That's a good point — it actually connects to..."
These phrases do two things: they give your brain a moment to collect thoughts (which reduces hesitation on the recording) and they signal to the examiner that you are organizing your response deliberately. They should not be memorized as scripts but practiced until they feel natural.
Hedging language for Part 3 opinions: "It seems to me that...", "I would tentatively argue that...", "I'm not entirely sure, but I think...", "It probably depends on the context, but generally speaking..."
Hedging signals intellectual honesty and also demonstrates the kind of nuanced communication expected at Band 7+. Examiners are not looking for candidates to be definitive on complex societal questions — they are looking for candidates who can explore ideas with appropriate qualification.
Idiomatic language used accurately: "Off the top of my head...", "It goes without saying that...", "To put it another way...", "In the long run...", "By and large..."
Idioms demonstrate a naturalness of expression that marks higher language proficiency. The critical qualifier is "accurately" — using an idiom incorrectly is worse than not using it. Only use idioms you have heard and used in actual conversation, not ones you have memorized from a list.
Discourse repair (self-correction used effectively): "I meant to say... / what I was trying to get at is...", "Sorry, let me rephrase that..."
Self-correction at Band 7+ is used naturally, not frequently. Occasional self-correction when the first phrasing is imprecise is natural English behavior. Constant self-correction signals low confidence in language production.
Common Speaking Test Mistakes That Cap Candidates at Band 6
Giving memorized answers: Examiners are trained to identify responses that sound scripted. Common signs: the response is perfectly fluent with no natural pauses, uses unusually formal vocabulary for a familiar topic (Part 1), or covers exactly the points listed in a preparation book. When examiners identify a memorized response, they probe with follow-up questions that take the topic in an unexpected direction. Candidates with genuine language ability handle the probe well; candidates with only memorized answers cannot.
Answering questions too briefly: "Do you enjoy cooking?" — "Yes, I do." This response demonstrates nothing. IELTS examiners are listening for extended natural speech. One-sentence answers to Part 1 questions are a Band 5 characteristic.
Trying to speak too quickly: Some candidates believe that speaking fast signals fluency. It doesn't — it signals nervousness and often produces more pronunciation errors and hesitations. Moderate, comfortable speaking pace produces clearer pronunciation and more natural prosody than rushed speech.
Avoiding complex grammar: Candidates who exclusively use simple sentences to avoid errors score Band 5-6 on Grammatical Range regardless of accuracy. The Band 7 descriptor explicitly requires "a variety of complex structures." Take appropriate risks with conditional sentences, passive constructions, and subordinate clauses.
Not using the 1-minute preparation time for Part 2: A common error is starting to speak immediately when the cue card is given, without writing notes. One minute is significant — it allows you to organize your entire 2-minute response. Use every second of it.
IELTS Speaking vs. Other Oral English Tests
Candidates sometimes ask how IELTS Speaking compares to other oral English assessments:
IELTS vs. TOEFL Speaking: TOEFL Speaking is computer-delivered, with candidates recording spoken responses to a microphone. There is no human examiner present. IELTS Speaking is always face-to-face with a human examiner. This means IELTS Speaking more closely resembles real-world professional or academic communication, which also means candidates who are comfortable in conversation but nervous in front of a computer may perform better on IELTS.
IELTS vs. OET Speaking: The Occupational English Test (OET) has a Speaking component that consists of two role-plays in healthcare contexts (patient consultation scenarios). For healthcare professionals, OET Speaking is more directly relevant to clinical communication. However, IELTS Speaking is more broadly accepted and the score is transferable across many purposes (immigration, academic admission, professional registration), while OET is primarily used for healthcare professional registration.
References
British Council / IDP Australia / Cambridge Assessment English. (2024). IELTS Speaking Band Descriptors (Public Version). IELTS.org.
Cambridge University Press. (2023). Cambridge IELTS Academic 17 with Answers. Cambridge University Press.
Cambridge University Press. (2022). Cambridge IELTS Academic 16 with Answers. Cambridge University Press.
British Council. (2024). Understanding IELTS: Techniques for English Language Tests — Speaking Module. British Council FutureLearn.
IELTS Liz. (2024). IELTS Speaking: Band Scores and Marking Criteria Guide. ieltsliz.com.
IELTS-Simon. (2024). IELTS Speaking: Complete Guide and Band Descriptor Analysis. IELTS-Simon.com.
Cambridge Assessment English. (2023). IELTS Examiner Training Materials: Speaking Assessment. Cambridge Assessment English internal documentation, public excerpts.
Paulhus, D.L. & Trapnell, P.D. (2020). Self-presentation of personality: An agency-communion framework. Journal of Research in Personality, 40(5), 478-495. (Referenced for impression formation in oral assessment research.)
