What is significance?

The JCFS adds value to the existing suite of cognitive assessment tools by focusing on nonverbal abilities. This is particularly beneficial for individuals whose strengths may not be reflected in traditional verbal-centric tests. Its potential applications span clinical diagnostics, research, and educational settings, where a holistic understanding of cognitive abilities is crucial for informed decision-making.

What are future directions?

Further studies are recommended to validate the JCFS in broader populations. Exploring the impact of demographic factors, such as age, cultural background, and educational level, would provide deeper insights into the test's applicability. Additionally, integrating the JCFS with other assessment tools could enhance its utility in creating comprehensive cognitive profiles.

The JCFS represents a meaningful advancement in nonverbal cognitive assessment, combining robust psychometric properties with practical relevance. While there is room for further research and refinement, its initial success underscores its potential as a reliable tool in understanding and measuring cognitive diversity.

Jouve, X. (2023). Psychometric Evaluation Of The Jouve Cerebrals Figurative Sequences As A Measure Of Nonverbal Cognitive Ability. Cogn-IQ Research Papers. https://pubscience.org/ps-1mSQY-6dcead-cugU

JCFS: Assessing Nonverbal Intelligence

Published: April 12, 2023 · Last reviewed: May 6, 2026

📖2,095 words⏱9 min read📚6 references cited

Most cognitive ability tests in widespread use are either verbal-heavy (vocabulary, comprehension, knowledge) or rely on a fixed sequence of items that takes everyone a similar amount of time. The Jouve Cerebrals Figurative Sequences (JCFS), developed by Xavier Jouve, takes a different approach on both fronts. It is purely nonverbal — every item is a 3×3 grid of figural elements that follow abstract transformation rules — and it administers items adaptively, drawing from a bank of 50 tasks but typically reaching a precise ability estimate after only 6–16 items. The two design choices have specific psychometric consequences worth understanding for anyone choosing among nonverbal cognitive measures.

What the JCFS measures

The JCFS targets fluid-inductive reasoning: the ability to discover, hold in mind, and integrate abstract relations among visual elements. Each item presents a 3×3 grid in which a pattern of figures (typically crosses or similar geometric forms) evolves according to one or more transformation rules — for example, “rotate 90° clockwise across rows,” combined with “increment the number of elements down columns.” The examinee must infer the operating rules and produce the figure that completes the sequence.

This places the JCFS in the same broad family as the Raven Progressive Matrices, the most widely-used nonverbal cognitive measure in the world. Carpenter, Just, and Shell’s classic 1990 cognitive analysis in Psychological Review showed that what makes a Raven-style item difficult is precisely the number and type of abstract rules to be inferred and integrated. Items requiring inference of one rule are easy; items requiring inference of two non-trivial rules and their joint application are substantially harder. The JCFS items are constructed within this framework.

In Carroll’s (1993) factor-analytic taxonomy of human cognitive abilities — the empirical foundation of the modern Cattell-Horn-Carroll (CHC) framework — abilities of this type load primarily on fluid reasoning (Gf), the broad cognitive ability most strongly associated with general intelligence (g). Fluid-reasoning tests are the strongest single contributors to Full-Scale IQ in most major intelligence batteries. The JCFS’s nonverbal, content-light design isolates this construct without contamination by verbal knowledge or culturally specific content.

What makes the JCFS structurally distinctive

Three design features differentiate the JCFS from most existing nonverbal measures:

Open-ended response format. Where Raven-style tests typically present 6 or 8 candidate completions and ask the examinee to select the correct one, the JCFS requires the examinee to produce the figure that completes the sequence. This eliminates the cueing effects (process of elimination, partial recognition of distractors) that affect multiple-choice formats.
Adaptive administration. Drawing from a 50-item bank, the JCFS administers items in real time based on previous responses, selecting items whose difficulty is most informative about the examinee’s current ability estimate. This is the same general approach used in modern computerized adaptive testing systems for educational assessment.
Untimed completion. The JCFS does not impose time limits on items or the test as a whole. This separates the construct of fluid-reasoning capability from fluid-reasoning speed — two related but distinguishable abilities that timed nonverbal tests blend.

The adaptive engine: 2PL IRT with EAP estimation

The JCFS uses a two-parameter logistic (2PL) item response theory model. In this framework, each item has two characteristic parameters:

Difficulty (b): the latent-trait level at which an examinee has a 50% probability of answering correctly.
Discrimination (a): how sharply the probability of a correct response increases as ability increases — high-discrimination items distinguish more cleanly between examinees of similar ability.

The 2PL model is more flexible than the simpler one-parameter Rasch model (which constrains all items to the same discrimination) and more parsimonious than the three-parameter model (which adds a guessing parameter, less relevant for open-ended formats).

After each response, the JCFS updates its estimate of the examinee’s ability using Expected A Posteriori (EAP) estimation — a Bayesian approach that combines the response data with a prior distribution to produce a stable posterior estimate. EAP is generally preferred over maximum-likelihood estimation in adaptive testing because it produces stable estimates from the start of the test, before enough responses have been collected to make likelihood-based estimates well-defined.

The combination of an adaptive item-selection algorithm with EAP estimation typically produces precise ability estimates in 6–16 items (average ~10). Wainer, Dorans, Flaugher, Green, and Mislevy’s foundational 2000 text on computerized adaptive testing established the efficiency gains of this approach: an adaptive test typically achieves the same precision as a fixed-form test using roughly half the items.

Psychometric properties

The technical manual (Cogn-IQ, 2025) reports a marginal reliability of approximately ρ = 0.91 across the score range, computed across an N = 892 operational sample using standard IRT marginal-reliability methods (Green et al., 1984). Marginal reliability is the IRT analog of classical-test-theory reliability, computed as the average conditional precision integrated across the ability distribution. A value of 0.91 is comparable to that of major commercial cognitive batteries.

Reliability is not uniform across the score range. By design, IRT-based adaptive tests produce their highest precision near the center of the distribution where the most items are concentrated, with somewhat lower precision in the tails. The JCFS’s expected conditional precision gradient is reasonable for a test of this length: lower at very high and very low ability levels, but adequate across the range that covers most examinees.

The Visual Sequencing Index (VSI) is reported on a reference scale with mean 100 and SD 15, the same scale used for IQ scores in major commercial batteries. This allows direct comparison between JCFS results and other cognitive ability scores expressed on the same metric.

Convergent validity against established nonverbal measures

The JCFS Technical Manual reports zero-order Pearson correlations between the JCFS CAT ability estimate (EAP θ) and external nonverbal-reasoning criteria, with confidence intervals from Fisher’s z-transform and disattenuated coefficients computed via the Spearman correction (Cogn-IQ, 2025):

WAIS Matrix Reasoning: r = 0.83, 95% CI [0.69, 0.91] (N = 36); disattenuated r* = 0.89–0.93 (range reflects published WAIS-IV Matrix Reasoning reliability band of .88–.95).
JCTI Inductive Reasoning Index: r = 0.80, 95% CI [0.71, 0.86] (N = 95); disattenuated r* = 0.90 using IRI reliability ρ_yy = .87 from the JCTI manual.

The .83 correlation with WAIS Matrix Reasoning — the most widely-used clinical nonverbal-reasoning subtest — places the JCFS in the same construct neighborhood as the established Wechsler nonverbal cluster. The disattenuated coefficient of .89–.93 indicates the two measures are essentially measuring the same underlying construct after correcting for measurement error in both. The .80 correlation with the JCTI IRI (a separately developed Cogn-IQ inductive-reasoning measure) provides additional convergent evidence within the broader Gf nomological network. The manual also reports concurrent validity of the legacy fixed-form raw scoring (r = 0.87 with WAIS Matrix Reasoning, N = 36) against the CAT θ scoring, with the small Δ = −0.04 attributable to the different scoring frameworks rather than to substantive construct difference.

How the JCFS compares to other nonverbal measures

The choice between the JCFS and other nonverbal cognitive measures depends on what the assessment is being used for:

vs. Raven’s Progressive Matrices. The JCFS uses the same broad item type (figural pattern completion) but production rather than recognition responses, and adaptive rather than fixed-form administration. The Raven, with longer history and broader normative data, remains the standard for many clinical applications. The JCFS may produce somewhat different ability estimates because of the format change (open-ended vs. multiple-choice).
vs. WAIS Matrix Reasoning. The Wechsler Adult Intelligence Scale’s Matrix Reasoning subtest is a multiple-choice nonverbal subtest within a broader IQ battery. It serves a different purpose — contributing to a Full-Scale IQ estimate — than a stand-alone nonverbal measure like the JCFS.
vs. Cattell Culture Fair Intelligence Test. Like the JCFS, the Cattell CFIT was designed to minimize cultural and language influences. It uses fixed-form administration. The JCFS’s adaptive design produces shorter testing times for similar precision.
vs. other Jouve tests (JCTI). The JCTI is a related Jouve test focused on inductive reasoning. The JCFS is more specifically figural-spatial.

What the JCFS is and is not appropriate for

Several use cases follow from the test’s design:

Research where nonverbal cognitive ability is a covariate or matching variable. The JCFS produces a score on the standard mean-100/SD-15 scale that can be used as a matching or covariate variable.
Self-administered assessment in non-English-speaking populations. The purely figural content is not language-bound. The same test can be administered in any language with no translation effects on item content.
Cognitive assessment when verbal assessment is impractical. Individuals with hearing or speech impairments, or with limited literacy in the test language, can complete the JCFS with minimal accommodation.
Tracking changes in fluid reasoning over time. The same test bank can be administered repeatedly, with the adaptive engine producing comparable scores from the same item pool.

The test is not appropriate for some applications:

Stand-alone diagnostic IQ assessment for clinical decisions. A full intelligence battery (WAIS, Stanford-Binet, RIAS) covering verbal and nonverbal abilities is the standard for clinical IQ assessment. The JCFS is a nonverbal-only measure.
Assessment of crystallized abilities. The JCFS does not measure verbal knowledge, vocabulary, or accumulated factual knowledge. Individuals with high crystallized intelligence and lower fluid reasoning will not show their full cognitive profile on a JCFS-only assessment.
Severely cognitively impaired examinees. The 2PL adaptive engine is calibrated for ability levels in the typical range. Examinees with very low cognitive functioning may produce unstable estimates.

Open research directions

The JCFS evidence base now spans the 2023 psychometric evaluation paper and the current technical manual (Cogn-IQ, 2025), with reliability documented at N = 892 and convergent validity against both WAIS Matrix Reasoning (N = 36) and the JCTI Inductive Reasoning Index (N = 95). Useful extensions of the existing base include:

Independent replication outside the Cogn-IQ ecosystem. Convergent-validity correlations and the CAT calibration would benefit from replication by independent research groups.
Differential item functioning analyses. Demographic-subgroup DIF analyses on the 50-item bank would clarify the bounds of fair use across age, sex, and cultural-background groups.
Test-retest stability across longer intervals. Internal precision (marginal reliability) is well-characterized; test-retest reliability across months or years would complement the existing evidence.
Age-stratified norms. Single-reference norms are less informative than age-stratified norms for individual interpretation, particularly for adult samples spanning broad age ranges.
Convergent validity against additional nonverbal batteries. The Raven’s Progressive Matrices and the NNAT are the most widely-used nonverbal alternatives; correlation studies against these batteries would refine the JCFS’s positioning within the nonverbal-Gf measurement landscape.

Frequently Asked Questions

What is the JCFS test?

A nonverbal, computerized adaptive cognitive test consisting of 3×3 grid items that follow abstract transformation rules. Examinees produce the figure that completes the sequence. The test is untimed and uses adaptive item selection from a 50-item bank.

How is the JCFS scored?

The test produces a Visual Sequencing Index (VSI) on a reference scale with mean 100 and standard deviation 15, the same scale as IQ scores in major commercial batteries.

How long does the JCFS take?

Because it is adaptive, administration time varies. Most examinees complete the test in under 30 minutes, with the adaptive engine typically reaching a precise ability estimate after 6–16 items.

Is the JCFS an IQ test?

The JCFS is a measure of nonverbal cognitive ability, specifically fluid-inductive reasoning. It produces a score on the IQ scale but does not assess verbal abilities, working memory, or other components of a full intelligence battery. It is best understood as a nonverbal-only cognitive measure rather than a full IQ test.

What does “marginal reliability” mean?

The IRT analog of classical reliability, computed as the average precision of ability estimates integrated across the ability distribution. The JCFS’s value of approximately 0.91 is comparable to commercial cognitive batteries.

What’s the difference between the JCFS and Raven’s matrices?

Both are nonverbal figural-pattern tests. The Raven uses fixed-form multiple-choice; the JCFS uses adaptive open-ended (production) responses. The Raven has longer history and broader norming; the JCFS is shorter to administer for comparable precision and avoids cueing effects of multiple-choice formats.

Can the JCFS be administered cross-culturally?

The figural-only content avoids most language and culture-specific elements. Like all cognitive tests, however, performance can still be influenced by familiarity with abstract test-taking conventions, which vary across cultural contexts. Cross-cultural use is more defensible than for verbal tests but is not free of all cultural effects.

References

Cogn-IQ. (2025). JCFS Technical Manual. Cogn-IQ. https://www.cogn-iq.org/methods/jcfs-manual/
Jouve, X. (2023). Psychometric Evaluation of the Jouve Cerebrals Figurative Sequences as a Measure of Nonverbal Cognitive Ability. Cogn-IQ Research Papers. https://pubscience.org/ps-1mSQY-6dcead-cugU
Carpenter, P. A., Just, M. A., & Shell, P. (1990). What one intelligence test measures: A theoretical account of the processing in the Raven Progressive Matrices Test. Psychological Review, 97(3), 404–431. https://doi.org/10.1037/0033-295X.97.3.404
Raven, J., & Raven, J. (2003). Raven Progressive Matrices. In R. S. McCallum (Ed.), Handbook of Nonverbal Assessment (pp. 223–237). Springer. https://doi.org/10.1007/978-1-4615-0153-4_11
Carroll, J. B. (1993). Human Cognitive Abilities: A Survey of Factor-Analytic Studies. Cambridge University Press. https://doi.org/10.1017/CBO9780511571312
Wainer, H., Dorans, N. J., Flaugher, R., Green, B. F., & Mislevy, R. J. (2000). Computerized Adaptive Testing: A Primer (2nd ed.). Routledge. https://doi.org/10.4324/9781410605931

Xavier Jouve, Ph.D.PsychometricianPhD

Xavier Jouve, Ph.D., is a psychometrician and quantitative psychologist specializing in cognitive ability measurement, item response theory, and test development. He is Head of Research at Cogn-IQ, where he has designed and validated seven cognitive assessment instruments — including the JCTI (inductive reasoning), JCCES (crystallized intelligence), IAW (vocabulary), JCFS (figurative sequences), JCWS (verbal reasoning), GIE (general knowledge), and WN (logical inference) — collectively normed on over 13,000 examinees. His work applies 2PL IRT modeling, computerized adaptive testing, and advanced composite scoring methods (including the modified Tellegen & Briggs Formula 4 with cubic correction) to produce research-grade cognitive measures available online. ORCID: 0009-0006-1283-045X

ORCID

Related Research

Child Cognitive Development

Does Music Training Increase IQ?

Few claims in popular science have been as durable as the idea that music makes you smarter. The 1990s "Mozart Effect" sent pregnant women rushing…

Apr 15, 2026

Intelligence Research and Cognitive Abilities

Working Memory: Why It Matters

Working memory is the cognitive system that holds a small amount of information in mind, briefly, in a way that allows you to use it.…

Apr 13, 2026

Intelligence Research and Cognitive Abilities

The G Factor: What General Intelligence Means

The g factor — Charles Spearman's name for the common variance that runs through all cognitive tests — is the most replicated and the most…

Apr 10, 2026

Cognitive Neuroscience and Brain Function

Sleep Deprivation and Cognitive Performance

Williamson and Feyer (2000), in Occupational and Environmental Medicine, ran a deceptively simple experiment: they kept healthy adults awake for 28 hours and tested their…

Apr 8, 2026