A concept test answers one of the most important questions in product and marketing: does this idea work for the people we are building it for, before we invest in building it?
Done well, concept testing prevents expensive launches of things the market does not want and reveals which variant of an idea is most compelling. Done badly, it gives you a false read that sends a development team in the wrong direction for six months.
This guide covers how to design a concept test that gives you a reliable, decision-useful result.
What Concept Testing Is and Is Not
A concept test exposes a target audience to a product, feature, service, or campaign concept and measures their reaction: appeal, purchase intent, clarity, and differentiation.
Concept testing is not a demand forecast. A high purchase intent score in a concept test does not reliably predict sales. Stated intent is always higher than actual behaviour because concept tests lack the friction, competition, and decision context of a real purchase. Use concept test data to compare concepts and identify weaknesses, not to project revenue.
Concept testing is also not a substitute for in-depth qualitative research. Quantitative concept test scores tell you what performed better. They do not tell you why. Pair quantitative concept testing with qualitative interviews for a complete picture.
Writing a Concept Statement
The concept statement is the foundation of the test. If it is poorly written, the test results are uninterpretable.
A good concept statement is:
- Clear: a target consumer who has no knowledge of your internal development work can understand it in one read
- Specific: it describes what the product or feature does, not what you hope it will become
- Benefit-led: it communicates the benefit to the consumer, not the feature or the technical mechanism
- Realistic: it does not oversell or use marketing hyperbole that makes the concept sound implausibly good
The standard format for a concept statement is: a one-paragraph description of the concept, a “how it works” section of two to three bullets, and a closing benefit statement.
Test your concept statement with five people before running the full study. Ask them to read it and tell you in their own words what the product does. If they cannot do that accurately, the statement needs rewriting.
Monadic vs Sequential Testing
There are two main designs for concept tests: monadic and sequential (also called comparative).
Monadic Testing
In a monadic test, each respondent sees only one concept. Different respondents see different concepts. Results are compared across groups.
Monadic testing is the gold standard because it measures each concept in isolation, without the contaminating influence of comparison. When respondents see multiple concepts, they anchor on the first and evaluate all subsequent concepts relative to it. Monadic testing removes this anchor effect.
The tradeoff is cost: monadic testing requires a larger total sample because each concept needs its own cell. For three concepts at 150 respondents per cell, you need 450 respondents total.
Sequential (Comparative) Testing
In a sequential test, each respondent sees all concepts, one after another. This is cheaper because all respondents evaluate all concepts.
Sequential testing is appropriate when you have a clear brief that requires direct comparison. For example, choosing between two ad campaign routes and when the concepts are distinct enough that order effects can be managed by randomising the presentation order across respondents.
Sequential testing is not appropriate when the concepts are similar enough that seeing the first one will meaningfully change how respondents evaluate the second.
Core Metrics for a Concept Test
Appeal
“Overall, how appealing do you find this concept?” Five-point scale from Very appealing to Not at all appealing. Report the “Top 2 Box” score (Very + Somewhat appealing). A Top 2 Box appeal score above 60% is generally considered a strong result in consumer categories.
Purchase Intent
“How likely would you be to purchase/use this if it were available?” Five-point scale. Report Top 2 Box. Purchase intent will almost always be lower than appeal. The gap between appeal and purchase intent is diagnostic: a large gap suggests the concept is interesting but not compelling enough to drive action.
Uniqueness
“How different is this from other options currently available to you?” Five-point scale. Uniqueness is a key predictor of whether a concept will create its own demand or fight for share in an existing category.
Clarity
“How clear is this concept to you?” Five-point scale. If clarity is low, your concept statement is the problem, not the concept itself. Low-clarity results are not interpretable as real concept performance.
Adding Qualitative Depth
Quantitative scores tell you which concept performed better. They do not tell you why, or what would make a weaker concept stronger.
An open-ended question after the scaled ratings: “What would make you more likely to use this product?” or “What concerns, if any, do you have about this concept?” generates verbatim data that can be analysed for themes.
For a more thorough exploration of why a concept did or did not resonate, AI-moderated in-depth interviews give you the kind of conversational depth that a survey cannot. What is an AI Moderator? covers how these work and when to use them alongside quantitative concept testing.
Setting a Decision Rule Before You Start
Define your decision criteria before you see the results. This is a principle borrowed from clinical research but applies directly to concept testing.
What Top 2 Box appeal score will you require to move a concept to the next development stage? What purchase intent score? If you do not set these thresholds in advance, you will find yourself adjusting them after seeing the data to justify the decision you wanted to make anyway.
Pre-specified decision rules also make it easier to defend your recommendation to stakeholders who are emotionally attached to a concept that did not test well.
Interpreting Concept Test Results
A few interpretation principles:
- Do not over-interpret small differences. A Top 2 Box of 61% versus 58% across cells of 150 respondents is within margin of error. Report direction, not false precision.
- Look at the full score profile, not just purchase intent. A concept with modest purchase intent but very high uniqueness may be creating a new category rather than competing in an existing one.
- Segment your results. Aggregate appeal scores mask important variation. Check whether the concept appeals differentially to the core target versus the broader population.
- Combine the quant scores with the open-ended verbatims. Often the verbatims explain an unexpected score pattern that the scales alone cannot.
For guidance on presenting concept test findings to a product or marketing leadership team, see How to Present Research Findings to Stakeholders.
Related Reading
- How to Conduct a Brand Tracking Study
- What is an AI Moderator? How It Works vs. Human Moderators
- In-Depth Interview (IDI) Guide: Templates, Tips & Best Practices
- How to Measure Brand Awareness: Methods, Metrics & Benchmarks
- How to Present Research Findings to Stakeholders
- How to Do a Customer Segmentation Study