Psychological Measurement and Testing

What the WAIS-IV Measures: Subtests Explained

what does the WAIS-IV measure?
Published: April 2, 2010 · Last reviewed:
📖1,925 words⏱8 min read📚5 references cited
The question in the title was first posed in print by Benson, Hulac, and Kranzler (2010), who ran the earliest independent factor analysis of the Wechsler Adult Intelligence Scale-Fourth Edition against Cattell-Horn-Carroll (CHC) theory. Sixteen years and a published successor later, the answer has matured into something the original publisher’s manual did not say: the WAIS-IV measures fewer cleanly separable broad abilities than CHC nomenclature suggests, and a single general factor accounts for most of the systematic variance among its subtests. The same pattern has now reappeared in the WAIS-5.

What Benson, Hulac, and Kranzler actually found

Working from the published WAIS-IV standardization correlation matrices, Benson and colleagues fit a series of confirmatory factor models that mapped the ten core and five supplemental subtests onto CHC broad abilities. They concluded that the WAIS-IV does deliver markers of crystallized ability (Gc), visual processing (Gv), short-term memory (Gsm), and processing speed (Gs), but that fluid reasoning (Gf) and quantitative reasoning (Gq) were not cleanly separable as distinct factors with the available subtests; they recommended treating Arithmetic and Figure Weights as joint Gf/Gq markers and offered formulas for CHC-aligned composite scores.

The more striking part of the 2010 result was negative: configural and metric invariance held across age, but stronger forms of measurement invariance did not. Factor loadings shifted systematically across the 16-69 age bands of the standardization sample, meaning that “the WAIS-IV does not measure the same constructs in the same way” across the adult life span (Benson et al., 2010). For practitioners, that finding undercuts a routine assumption: that an older client’s index scores can be interpreted on the same metric as a younger client’s.

The four-vs-five-factor debate

Pearson’s response came from Weiss, Keith, Zhu, and Chen (2013), who used the full 15-subtest standardization sample (N = 1,800) plus a clinical sample (N = 411) to compare the published four-factor model (Verbal Comprehension, Perceptual Reasoning, Working Memory, Processing Speed) against a five-factor CHC-aligned alternative that splits the Perceptual Reasoning Index into a visual processing factor (Gv: Block Design, Visual Puzzles, Picture Completion) and a fluid reasoning factor (Gf: Matrix Reasoning, Arithmetic, Figure Weights). Both models fit acceptably; the five-factor solution fit slightly better and was endorsed for CHC interpretation when supplemental subtests are administered.

This was the first official acknowledgment from the test’s publisher that the printed four-factor structure was a practical compromise rather than the best representation of the data. Independent reanalyses by Canivez and Kush (2013) flagged that the Weiss et al. comparison did not fully address bifactor alternatives in which a strong general factor absorbs most subtest variance, leaving the group factors with little unique interpretive weight—a critique that proved prophetic for the WAIS-5 a decade later.

Extending CHC interpretation to the elderly

The Benson et al. cross-age problem was substantially resolved by Niileksela, Reynolds, and Kaufman (2013), who restricted the analysis to the subtest battery actually administered to ages 70-90 (which omits Picture Completion, Letter-Number Sequencing, Cancellation, and Figure Weights). Within that reduced set, an alternative five-factor CHC model—Gc, Gv, Gf, Gsm, Gs—achieved partial strict measurement invariance across the full 16-90 age range, with only the Similarities subtest showing noninvariance. The takeaway for clinicians: when the WAIS-IV is interpreted using only the subtests common to all ages, CHC index scores are comparable across the adult lifespan; when the full 15-subtest battery is used in younger adults, that comparability is partially lost.

International replications (van Aken et al., 2017, on the Dutch WAIS-IV, N = 1,668) reproduced the five-factor CHC structure but again found that adding more group factors did not substantially reduce the dominance of the general factor in shared variance.

The WAIS-5 keeps the same question alive

When Pearson released the WAIS-5 in 2024, the index structure was revised explicitly to align with CHC theory: the old Perceptual Reasoning Index was split into a Visual Spatial Index (VSI) and a Fluid Reasoning Index (FRI), yielding five primary indices (VCI, VSI, FRI, WMI, PSI). The revision codified what Weiss et al. had argued for the WAIS-IV.

Independent factor analysis by Canivez, Watkins, McGill, and Dombrowski (2026), using the WAIS-5 standardization correlation matrix (N = 2,020) and 20 primary plus secondary subtests, did not vindicate the five-factor split. Their EFA with Schmid-Leiman extraction did not support separate Visual Spatial and Fluid Reasoning factors; a four-factor solution that re-merged VSI and FRI into a single perceptual factor—structurally similar to the WAIS-IV PRI—provided the better empirical fit. CFA results converged on a bifactor model with a dominant general factor and four group factors, with most reliable variance attributable to g rather than to the group factors. The authors note that the WAIS-5 Technical and Interpretive Manual did not report bifactor models, model-based reliability (omega hierarchical), or dimensionality estimates—an evidential gap their independent analysis was designed to fill.

Subtest-level CHC mapping

The CHC interpretive frameworks built by Benson et al. (2010), Weiss et al. (2013), and Niileksela et al. (2013) converge on the following primary and secondary CHC ability assignments for the WAIS-IV core and supplemental subtests. Single-factor purity is rare; most subtests load on a primary CHC factor with non-trivial secondary loadings reflecting the auxiliary cognitive demands of the task.

  • Similarities, Vocabulary, Information, Comprehension → primary marker of Gc (crystallized knowledge); Vocabulary additionally loads on language development (VL) as a narrow stratum.
  • Block Design, Visual Puzzles, Picture Completion → primary marker of Gv (visual processing); Block Design carries secondary Gf variance from the spatial reasoning demands of construction problems.
  • Matrix Reasoning, Figure Weights → primary marker of Gf (fluid reasoning); Figure Weights additionally taps quantitative reasoning (RQ).
  • Arithmetic → joint Gf/Gq/Gsm loadings; the Benson et al. analysis treats it as the WAIS-IV’s primary quantitative knowledge marker but its working memory demands inflate Gsm loadings.
  • Digit Span, Letter-Number Sequencing → primary marker of Gsm (short-term memory); the backward and sequencing components extend into working memory capacity (WM).
  • Coding, Symbol Search, Cancellation → primary marker of Gs (processing speed); Cancellation additionally carries visual scanning (P) and selective attention demands.

The mapping clarifies why the WAIS-IV’s named indices do not translate one-to-one into CHC abilities: the Working Memory Index combines a Gsm task (Digit Span) with a Gf/Gq/Gsm composite (Arithmetic), and the Perceptual Reasoning Index combines two distinct CHC abilities (Gv and Gf) into a single score. The WAIS-5 corrected the second of these by separating VSI from FRI, but the first—an index labeled Working Memory that is partly a Gf measure—persists.

What the WAIS-IV (and WAIS-5) actually measures

Synthesizing fifteen years of independent factor work, three claims are now supportable:

  • The general factor is doing most of the work. Whether the model has four or five group factors, bifactor decompositions consistently show that g accounts for the bulk of common variance among Wechsler subtests; group factors retain limited unique reliable variance after g is partialled.
  • The CHC labels on the indices are reasonable but not pure. VCI maps reasonably well to Gc; PSI to Gs; WMI to Gsm; the WAIS-IV PRI / WAIS-5 VSI+FRI carries Gv and Gf variance that is harder to disentangle than the index names imply.
  • Cross-age comparability requires the elderly-eligible subtest set. Niileksela et al. (2013) gave practitioners a defensible CHC interpretive scheme that survives the lifespan; full-battery interpretations in younger adults trade that comparability for finer ability differentiation.

Practical interpretive guidance

For clinical and educational reports, the contemporary literature implies a hierarchy of confidence. The Full Scale IQ remains the most robustly supported number—it is the proxy for the general factor that bifactor analyses recover as dominant. Index scores are interpretable but their incremental information beyond FSIQ is modest, especially for VSI and FRI considered separately on the WAIS-5. Subtest profile interpretation—the longstanding clinical tradition of comparing Block Design to Matrix Reasoning, or Arithmetic to Digit Span—has the weakest empirical foundation and should be undertaken only when corroborated by external evidence.

The original Benson et al. (2010) paper is now seventeen years old and addresses a discontinued instrument, but the question its title posed has outlasted the test. The WAIS-5 inherits the same answer: a strong g, four broad ability domains that are reliably measured, and a fifth (the VSI/FRI split) that the publisher’s manual asserts and independent factor analysis still does not support.

Frequently asked questions

What does the WAIS-IV measure?

The WAIS-IV measures four published indices—Verbal Comprehension, Perceptual Reasoning, Working Memory, and Processing Speed—plus a Full Scale IQ that aggregates them. Independent factor analysis maps these onto five Cattell-Horn-Carroll broad abilities: crystallized knowledge (Gc), visual processing (Gv), fluid reasoning (Gf), short-term memory (Gsm), and processing speed (Gs). A general intelligence factor (g) accounts for the bulk of common variance among subtests in bifactor decompositions.

Is the WAIS-IV still used in 2026?

Yes, but it has been formally superseded by the WAIS-5 (Pearson, 2024). Many clinicians and researchers continue to use the WAIS-IV during the multi-year transition period, partly because of accumulated normative data and partly because comparison with prior WAIS-IV results is often clinically useful.

What is the difference between the four-factor and five-factor models of the WAIS-IV?

The four-factor model is the published index structure (VCI, PRI, WMI, PSI). The five-factor model splits the Perceptual Reasoning Index into a visual-processing factor (Block Design, Visual Puzzles, Picture Completion) and a fluid-reasoning factor (Matrix Reasoning, Arithmetic, Figure Weights). Weiss et al. (2013) found the five-factor model fits slightly better when all 15 subtests are administered.

How well does Full Scale IQ on the WAIS-IV measure general intelligence?

Bifactor analyses consistently show that the general factor accounts for the majority of reliable variance among WAIS-IV subtests. The Full Scale IQ is the most defensible single score from the test for representing general cognitive ability; index-level differentiation beyond FSIQ adds modest incremental information.

Did the WAIS-5 fix the structural issues identified in the WAIS-IV?

The WAIS-5 was redesigned with five primary indices (VCI, VSI, FRI, WMI, PSI) explicitly aligned with CHC theory. Independent factor analysis of the WAIS-5 standardization data by Canivez and colleagues (2026), however, did not support the separate VSI and FRI factors—a four-factor structure resembling the WAIS-IV PRI was the better empirical fit. The general factor remains dominant in either model.

What does it mean when a WAIS-IV score is interpreted across age groups?

Benson, Hulac, and Kranzler (2010) showed that strict measurement invariance does not hold across the WAIS-IV’s adult age range when all 15 subtests are used. Niileksela, Reynolds, and Kaufman (2013) showed that partial strict invariance does hold across the lifespan when only the subtests common to the elderly battery are used. Cross-age comparisons are most defensible using the elderly-eligible subtest set.

References

  • Benson, N., Hulac, D. M., & Kranzler, J. H. (2010). Independent examination of the Wechsler Adult Intelligence Scale-Fourth Edition (WAIS-IV): What does the WAIS-IV measure? Psychological Assessment, 22(1), 121-130. https://doi.org/10.1037/a0017767
  • Canivez, G. L., Watkins, M. W., McGill, R. J., & Dombrowski, S. C. (2026). Construct validity of the WAIS-5: Complementary exploratory and confirmatory factor analyses of the 20 primary and secondary subtests. Assessment. Advance online publication. https://doi.org/10.1177/10731911251412219
  • Niileksela, C. R., Reynolds, M. R., & Kaufman, A. S. (2013). An alternative Cattell-Horn-Carroll (CHC) factor structure of the WAIS-IV: Age invariance of an alternative model for ages 70-90. Psychological Assessment, 25(2), 391-404. https://doi.org/10.1037/a0031175
  • van Aken, L., van der Heijden, P. T., van der Veld, W. M., Hermans, L., Kessels, R. P. C., & Egger, J. I. M. (2017). Representation of the Cattell-Horn-Carroll theory of cognitive abilities in the factor structure of the Dutch-language version of the WAIS-IV. Assessment, 24(4), 458-466. https://doi.org/10.1177/1073191115607973
  • Weiss, L. G., Keith, T. Z., Zhu, J., & Chen, H. (2013). WAIS-IV and clinical validation of the four- and five-factor interpretative approaches. Journal of Psychoeducational Assessment, 31(2), 94-113. https://doi.org/10.1177/0734282913478030

Related Research

Intelligence Research and Cognitive Abilities

The G Factor: What General Intelligence Means

The g factor — Charles Spearman's name for the common variance that runs through all cognitive tests — is the most replicated and the most…

Apr 10, 2026
Psychometric Testing and IQ Assessment

Raven's Progressive Matrices: Culture-Fair IQ Test

Among the hundreds of cognitive tests developed over the past century, few have achieved the global reach of Raven's Progressive Matrices. Administered in settings from…

Mar 19, 2026
Psychological Measurement and Testing

WAIS-IV vs. WAIS-V: What Changed

Pearson released the Wechsler Adult Intelligence Scale, Fifth Edition (WAIS-5) in late 2024 — the first major revision since the WAIS-IV appeared in 2008. For…

Aug 7, 2025
Educational Psychology and Interventions

Growth Mindset: What the Meta-Analyses Show

Few ideas in education have spread as widely as Carol Dweck's growth mindset — the proposal that believing intelligence is malleable rather than fixed produces…

Jul 23, 2025
Cognitive Abilities and Intelligence

Fluid vs. Crystallized Intelligence

Intelligence is not a single ability. One of the most important distinctions in cognitive science — and one that affects everything from how IQ tests…

Jul 3, 2025

People Also Ask

What are decoding high intelligence: interdisciplinary insights?

Research into high intelligence provides valuable insights into human cognitive abilities and their impact on individual and societal progress. By exploring the historical development of intelligence studies, the challenges of measuring exceptional cognitive abilities, and recent advancements in neuroscience and psychometrics, this article highlights the ongoing importance of understanding high-IQ individuals.

Read more →
What are the complex journey of the wais: insights and transformations?

The Wechsler Adult Intelligence Scale (WAIS), developed in 1955 by David Wechsler, introduced a broader and more dynamic approach to assessing cognitive abilities. Over the years, it has been refined through several editions, becoming one of the most widely used tools in psychological and neurocognitive evaluations. This post reviews its historical development, structure, and contributions to cognitive science.

Read more →
What are tracing the sat's intellectual legacy and its ties to iq?

The Scholastic Assessment Test (SAT) has been a central element of academic assessment in the United States for nearly a century. Initially designed to provide an equitable way to evaluate academic potential, its evolution reflects shifts in societal values, educational theories, and cognitive research. This post examines the SAT’s historical roots, its relationship with intelligence testing, and its continued impact on education.

Read more →
What is assessing verbal intelligence with the iaw test?

The I Am a Word (IAW) test represents a distinct method for assessing verbal abilities, offering an open-ended and untimed format designed to accommodate a diverse range of examinees. This approach promotes genuine responses while fostering inclusivity and engagement in testing environments.

Read more →
Why is background important?

The WAIS-IV, a widely used tool for measuring adult intelligence, has undergone several revisions to improve its theoretical and practical utility. This study focuses on whether its structure accurately reflects CHC theory—a prominent framework for understanding cognitive abilities—and whether the test produces consistent results across ages. Understanding these aspects is essential for practitioners who rely on the WAIS-IV in clinical and educational settings.

How does key insights work in practice?

Alignment with CHC Theory: The WAIS-IV assesses several broad CHC abilities, including crystallized ability (Gc), fluid reasoning (Gf), visual processing (Gv), short-term memory (Gsm), and processing speed (Gs). It also provides a measure of quantitative reasoning (QR), though some abilities are more comprehensively covered than others. Cross-Age Invariance: Results indicate a

📋 Cite This Article

Jouve, X. (2010, April 2). What the WAIS-IV Measures: Subtests Explained. PsychoLogic. https://www.psychologic.online/wais-iv-measures/

Leave a Reply