Techniques for Evaluating Evidence Offered in Support of Findings

Evidence is the foundation of research claims. A critical researcher does not merely note what evidence is presented — they evaluate whether it is strong enough to support the conclusions drawn. This is one of the most demanding and highest-value skills in Extended Investigation.

The Core Question

For every piece of evidence offered in support of a finding, ask: “Is this evidence sufficient, reliable and valid to warrant the conclusion drawn from it?”

If the answer is no, the conclusion may still be true — but it is not adequately supported by this particular evidence.

KEY TAKEAWAY: A conclusion is only as strong as the evidence and reasoning that support it. Your job is to assess the warrant — the degree to which the evidence justifies the conclusion. A weak link here undermines the whole argument.

Framework: RAVEN

A useful acronym for evaluating evidence offered by sources:

Letter	Criterion	Question to Ask
R	Reputation	Does this source/researcher have credibility in this field?
A	Ability to see	Did they have direct access to what they claim to have observed?
V	Vested interest	Do they have anything to gain from a particular conclusion?
E	Expertise	Are they qualified to make this judgement?
N	Neutrality	Is there a political, commercial or ideological motive?

Evaluating Quantitative Evidence

Sample Quality

Size: Larger samples generally produce more reliable estimates
Representativeness: Does the sample reflect the target population? Random sampling is strongest
Response rate: A low response rate (e.g., 15% survey response) introduces self-selection bias

Measurement Validity

Does the measure actually capture the concept being studied?
Is a standardised, validated instrument used (e.g., a recognised psychological scale)?
Are definitions precise (e.g., how is “anxiety” operationalised)?

Statistical Appropriateness

Is the right statistical test used for the type of data?
Is statistical significance reported? What are the effect sizes?
Is the difference practically significant, not just statistically significant?

Causation vs Correlation

Observational studies can establish correlation — they generally cannot establish causation. Look for:
- Random assignment (experiments) as the gold standard for causal claims
- Confounding variables that might explain the association
- Language that overstates causal claims from correlational data

EXAM TIP: A very common exam question presents a study with a methodological flaw and asks: “How does this weakness affect the conclusion?” Structure your answer: (1) name the flaw, (2) explain its specific effect on validity, (3) state what the researcher should have done.

Evaluating Qualitative Evidence

Credibility

How was data collected? Are procedures described clearly?
Is there evidence of member checking (participants verify the researcher’s interpretation)?
Are negative cases addressed (examples that don’t fit the pattern)?

Transferability

To what contexts or populations might the findings apply?
Are the context and participants described in sufficient detail to judge applicability?

Dependability

Is the analytical process transparent and auditable?
Could another researcher following the same process reach similar conclusions?

Confirmability

Are the researcher’s own values and assumptions acknowledged?
Is there evidence of reflexivity (the researcher reflecting on their own influence)?

Evaluating Evidence Against the Research Question

Even perfectly valid evidence may not support a particular conclusion if it addresses a different question. Ask:
- Is this evidence directly relevant to the claim being made?
- Is there a logical gap between what was measured and what was concluded?
- Does the evidence address the specific population, context or timeframe of the claim?

Common Evidence Weaknesses to Identify

Weakness	Description
Small sample	Findings may not be generalisable
Non-representative sample	Findings may reflect the sample rather than the population
Self-report bias	Participants may respond inaccurately due to social desirability
Correlation ≠ causation	Association does not establish cause
Insufficient controls	Other explanations not ruled out
Publication bias	Only positive results published; effect may be overstated
Outdated data	Findings may not apply to current context

APPLICATION: When summarising a source in your Journal, add a “strengths of evidence” and “limitations of evidence” row to your analysis table. This habit ensures you are always thinking evaluatively, not just descriptively.

COMMON MISTAKE: Citing the statistical significance of a finding as proof of its importance. Statistical significance (p < 0.05) means only that the result is unlikely to be due to chance in that sample — it says nothing about effect size, practical importance, or generalisability. Always look for effect sizes alongside significance values.

Techniques for Evaluating Evidence in Research Findings

Table of Contents

About these notes

Join StudyPulse