TOEFL 2026 Reform Explained: New 1-6 Scale

Starting January 2026, TOEFL iBT is getting a major overhaul. Based on the official ETS technical document RR-25-12, we break down the new scoring system (1-6 scale), MST adaptive testing, new question formats, and AI scoring system.

W
WizPrep Academic Team
11 min read
December 11, 2025

📖 Reading time: approximately 8 minutes

What You'll Learn

✅ Key changes in the 2026 TOEFL reform ✅ The new scoring system (1-6 scale) explained ✅ What MST adaptive testing means for you ✅ New question types with real examples ✅ How reliable is the AI scoring system?


Introduction: Insights from Official ETS Documentation

In October 2025, ETS released the TOEFL iBT Technical Manual (RR-25-12), the official technical specification document for the reformed TOEFL. This 68-page document details:

  • Test structure and time allocation
  • Scoring criteria and score interpretation
  • Question types and measured abilities
  • Reliability and validity data

This article presents the most test-prep-relevant information extracted from this official document.


2026 TOEFL Test Structure

The new TOEFL maintains the four traditional sections: Reading, Listening, Writing, and Speaking.

The biggest change is the introduction of MST (Multistage Adaptive Testing) in the Reading and Listening sections.

What is MST Adaptive Testing?

MST divides Reading and Listening into two stages:

StageDescription
Stage 1 (Router Module)All test-takers answer the same medium-difficulty questions
Stage 2 (Upper/Lower Module)Based on Stage 1 performance, the system assigns harder or easier question sets

Key point: Scoring considers both the number of correct answers AND the difficulty level of questions answered. Even if you're routed to easier questions, your score is statistically calibrated for fairness. "Getting easy questions means you can't score high" is a myth.

Questions Per Section

SectionScored QuestionsScore RangeTest Type
Reading35 questions1-6 pointsMST adaptive
Listening35 questions1-6 pointsMST adaptive
Writing12 questions1-6 pointsFixed format
Speaking11 questions1-6 pointsFixed format

Note: Reading (up to 15 questions) and Listening (up to 12 questions) may include unscored tryout questions.


New Scoring System: The 1-6 Scale

Score Range

  • Section scores: 1-6 points (in 0.5 increments, e.g., 3.5, 4.5)
  • Total score: Average of four sections (also 1-6 points)

TOEFL Score to CEFR Mapping

The new TOEFL scores directly correspond to CEFR (Common European Framework of Reference) levels:

CEFR LevelTOEFL ScoreProficiency Description
C26Mastery
C15 - 5.5Effective Operational Proficiency
B24 - 4.5Vantage
B13 - 3.5Threshold
A22 - 2.5Waystage
A11 - 1.5Breakthrough

Application benchmarks:

  • Most US undergraduate programs: B2 or above (4+)
  • Graduate programs: C1 (5+)
  • Elite institutions: 5.5-6 points

New Question Types: A Complete Breakdown

Reading Section

Question TypeSkill MeasuredDescription
Complete the WordsVocabulary + Grammar + SpellingFill in the missing second half of words (C-test format)
Read in Daily LifePractical comprehensionEmails, notices, social media posts (15-150 words)
Read an Academic PassageAcademic comprehension~200-word academic text + 5 questions

🎯 What Do "Complete the Words" Questions Look Like?

This isn't multiple choice. You must write the missing portion of words yourself.

Examples:

  • The experiment required precise mea________ of temperature changes.
  • The new software greatly improved the eff________ of data processing.
  • It is important to dist________ between fact and opinion.
  • He worked tire________ to meet the project deadline.

Answers: measurement, efficiency, distinguish, tirelessly

The challenge: Recognizing a word and spelling it correctly are two different skills. Pay special attention to double consonants (committee, accommodation) and vowel combinations (receive, achieve).

🎯 What's Tested in "Read in Daily Life"?

Questions about emails, notices, and social media posts test your ability to read between the lines (understand implied meaning).

Example: A professor's email

"Thanks for your proposal. I think it has potential, but perhaps we could refine the methodology section before moving forward."

  • Surface meaning: Your proposal looks promising
  • Actual meaning: The methodology has issues and needs revision

Common "diplomatic expressions" in academic settings:

ExpressionLiteral MeaningActual Meaning
"I'll take that into consideration."I'll think about itI probably won't adopt it
"That's an interesting perspective."That's an interesting viewpointI don't quite agree
"We might want to revisit this later."Let's discuss this laterThis proposal is rejected

Listening Section

Question TypeDescription
Listen and Choose a ResponseHear a statement and choose the most appropriate response
Listen to a ConversationComprehension of everyday dialogues
Listen to an AnnouncementUnderstanding campus announcements
Listen to an Academic TalkAcademic content, 175-250 words

Important: Listening materials include four accent varieties: North American, British, Australian, and New Zealand.

🎯 "Listen and Choose a Response" Examples

This question type tests your ability to give socially appropriate responses. You need to understand not just the literal meaning, but also the speaker's tone and emotions.

Example 1:

🎧 A: "I was thinking of grabbing coffee after class. Want to join?"

Options:

  • A. "Sure, I'd love to." ✅
  • B. "I don't really like coffee." ❌
  • C. "When did you start drinking coffee?" ❌
  • D. "Coffee is very popular." ❌

Correct answer: A

While B is grammatically correct, it sounds abrupt in a social context. A more natural way to decline would be: "I'd love to, but I have to finish an assignment."


Example 2:

🎧 A: "I just submitted the group report by myself. No one else showed up."

Options:

  • A. "You should tell the professor right away." ✅
  • B. "That's nice of them to help you out." ❌
  • C. "Maybe they'll show up later." ❌
  • D. "I guess you didn't finish it, then." ❌

Correct answer: A

The speaker is frustrated/annoyed ("No one else showed up"). The correct response should show empathy or offer advice. B, C, and D ignore the emotion and are illogical.


Example 3:

🎧 A: "I thought the exam was next week, but apparently it's tomorrow morning!"

Options:

  • A. "That explains why you look so relaxed." ❌
  • B. "Well, good luck pulling an all-nighter." ✅
  • C. "Then you must have studied hard already." ❌
  • D. "Why didn't you tell me that earlier?" ❌

Correct answer: B

The speaker is clearly panicking. A is sarcasm (illogical in context), C contradicts reality (they're not prepared), D blames the other person. B shows empathy with light humor—natural in everyday English conversation.


Writing Section

Question TypeCountDescription
Build a Sentence10Reorder scrambled words/phrases
Write an Email1Write an email based on a given scenario
Write for an Academic Discussion1Class discussion format, minimum 100 words

🎯 How to Write an "Appropriate" Email

Scoring criteria emphasize social conventions: politeness, register, and information organization.

❌ Poor example:

"Dear Professor Smith, I have a family emergency. I hope you understand. Thank you."

Issues: Too brief, no situation explanation, no specific request, somewhat impolite tone.

✅ Good example:

Dear Professor Smith,

I'm writing to request permission to miss class next Tuesday, October 15th.

I have a family emergency that requires me to travel home that day. I understand we'll be covering the midterm review, and there's also a group presentation scheduled.

Would it be possible to:

  • Get the review materials after class?
  • Reschedule my group's presentation to the following week?

I've already coordinated with my group members, and they're flexible with the date.

I apologize for any inconvenience and really appreciate your understanding.

Best regards, [Your name]

Key points: State purpose upfront → Explain the specific situation → Make clear requests → Close politely


Speaking Section

Question TypeCountDescription
Listen and Repeat7Repeat sentences immediately after hearing them (progressive difficulty)
Take an Interview4Answer questions about experiences and opinions

🎯 Why is "Listen and Repeat" Challenging?

According to the official document, this question type measures language processing ability and pronunciation clarity. Sentences get progressively longer, potentially exceeding 10 words in later items.

Example:

"The professor postponed the exam due to unforeseen circumstances."

Common mistakes:

"The professor... uh... postpone exam... because... unexpected things."

Issues: Missing articles (the), tense errors (postpone → postponed), imprecise vocabulary (unforeseen circumstances → unexpected things)

Tip: Don't memorize word by word—remember meaning chunks

  • The professor / postponed the exam / due to unforeseen circumstances
  • Who / did what / why

AI Scoring: How Reliable Is It?

The new TOEFL uses a hybrid AI + human scoring system for Writing and Speaking.

AI Scoring Accuracy Data

SectionHuman-AI CorrelationHuman-Human Correlation
Writing0.860.85
Speaking0.890.96

Interpretation: The agreement between AI and human scoring is nearly equal to—or higher than—the agreement between human raters themselves.

Quality Assurance

  • If AI confidence is low, human raters automatically take over
  • Regular sampling reviewed by certified raters
  • Continuous monitoring of scoring consistency

Reliability Data

SectionReliability CoefficientStandard Error of Measurement (SEM)
Reading0.860.37
Listening0.880.35
Writing0.870.36
Speaking0.940.22
Total0.900.32

Speaking has the highest reliability (0.94), and the total score reliability of 0.90 meets industry standards for high-stakes testing.


Preparation Tips

The following tips are based on analysis of the RR-25-12 document and do not represent official ETS positions.

For MST Adaptive Testing

Your Stage 1 performance determines Stage 2 difficulty. While scoring is calibrated, a stable start helps you enter a question set that matches your level, which is psychologically advantageous.

Tip: Stay calm at the start of the exam. The Router module is medium difficulty, so answer carefully without rushing.

For Complete the Words (C-test)

This tests not just vocabulary, but spelling accuracy and contextual inference ability.

Tips:

  • When studying vocabulary, always practice writing the words
  • Use dictation exercises to strengthen spelling
  • Pay attention to prefix and suffix spelling patterns

For Listen and Repeat

Tips:

  • Practice shadowing (repeating almost simultaneously with audio) daily
  • Start with short sentences and gradually increase length
  • Pay attention to linking and reduction phenomena

For Email Writing

Tips:

  • Learn standard English email expressions
  • Understand the difference between formal and informal registers
  • Practice different scenarios: requests, suggestions, apologies

For Accent Variety

Tips:

  • BBC (British English)
  • ABC Australia (Australian English)
  • TED Talks (various accents)

FAQ

Q: Is the new TOEFL still scored out of 120?

No. The new TOEFL uses a 1-6 scale. Both section scores and total scores range from 1-6 (in 0.5 increments).

Q: Is MST adaptive testing unfair?

No. ETS uses IRT (Item Response Theory) to statistically calibrate scores for test-takers who followed different difficulty paths, ensuring fairness.

Q: Can I still use my old TOEFL scores?

Check with your target institution. Since the scoring system is completely different, universities may need time to adjust their requirements.

Q: Is AI scoring reliable?

According to official data, agreement between AI and human scoring is high (Writing 0.86, Speaking 0.89), and there's a human review system in place.


Summary

Key changes in the 2026 TOEFL reform:

  1. Scoring system: 0-120 → 1-6 scale (aligned with CEFR)
  2. Testing method: MST adaptive testing for Reading and Listening
  3. Question types: New formats including Complete the Words and Listen and Repeat
  4. Scoring technology: Hybrid AI + human scoring

The RR-25-12 document is currently the most authoritative technical document on the new TOEFL. For more details, consult the original text on the ETS Research website.

Questions? Leave them in the comments.


— WizPrep Academic Team


Tags: TOEFL TOEFL reform TOEFL 2026 new TOEFL TOEFL preparation TOEFL test TOEFL score TOEFL question types MST adaptive testing CEFR English test study abroad study in USA TOEFL iBT AI scoring test prep


Related Articles:

  • 2026 TOEFL Test Dates
  • TOEFL vs IELTS Score Conversion
  • TOEFL Requirements for Top 50 US Universities
TOEFLtest reformpreparation guidestudy abroad
W

WizPrep Academic Team

WizPrep professional academic team, dedicated to providing the latest and most authoritative TOEFL preparation resources.

Ready to Get Started?

Start using WizPrep AI intelligent scoring system to boost your TOEFL score

TOEFL 2026 Reform Explained: New 1-6 Scale | WizPrep AI Blog