解題
Introduction: Validity refers to the accuracy of measurement in psychological research—whether a tool actually measures the psychological construct it claims to measure. Both Saavedra and Silverman (evaluative learning and disgust/fear in button phobia) and Hölzel et al. (neuroplasticity and mindfulness) employ a combination of subjective and objective measurements, balancing internal and ecological validity.
Strength 1: High construct and internal validity of objective measurements.
In Hölzel et al., the use of high-resolution MRI scans (Voxel-Based Morphometry) provides a highly objective, physical measurement of gray matter density changes. Unlike self-reports, brain scans cannot be consciously manipulated by participants, eliminating demand characteristics and social desirability bias. This ensures high internal validity regarding structural brain changes.
In Saavedra and Silverman, direct behavioral observations of the boy's ability to handle buttons (e.g., successfully completing exposure tasks such as putting buttons in his mouth or hugging his mother while she wore buttons) provided objective evidence of behavioral desensitization, complementing his subjective reports.
Weakness 1: Threats to validity in subjective self-reports.
In Saavedra and Silverman, the 9-point Feelings Thermometer was used to measure distress. Although this allowed researchers to track changes in cognitive evaluations over time, it is highly subjective. The boy may have displayed demand characteristics, reporting lower distress scores toward the end of treatment to please the researchers or his mother.
In Hölzel et al., the Five Facet Mindfulness Questionnaire (FFMQ) was used to measure subjective mindfulness levels. Since this relies on self-assessment, participants' ratings might be affected by social desirability (wanting to appear more mindful) or response bias, which limits the construct validity of the mindfulness measure.
Strength 2: Triangulation of measurements.
Both studies enhanced validity by combining qualitative/quantitative measurements or subjective/objective assessments. Hölzel et al. correlated physical changes in gray matter density (MRI scan data) with psychological shifts in mindfulness (FFMQ scores), validating that self-reported changes corresponded with physiological changes. Saavedra and Silverman paired subjective distress ratings (Feelings Thermometer) with qualitative clinical interviews and overt behavioral indicators of fear, ensuring a comprehensive and valid assessment of the boy's phobic symptoms.
Weakness 2: Ecological validity of measurement environments.
In Hölzel et al., the physical measurement of brain structure requires participants to remain perfectly still inside a noisy, cramped, and artificial MRI scanner environment. This stressful environment is low in ecological validity and might affect baseline physiological states, potentially confounding measures of brain activity.
In Saavedra and Silverman, while the therapeutic measurements took place in a controlled clinical environment which allowed for standardized tracking, this setting may not represent the child's true response to buttons in natural, chaotic environments such as a school cafeteria, meaning the clinical measurements may lack ecological validity.
評分準則
Marks are awarded using a level-based marking grid (0–12 marks):
Level 4 (10–12 marks):
- Evaluation is detailed and shows a thorough understanding of the validity of measurements.
- Excellent use of examples from both Saavedra and Silverman and Hölzel et al.
- The argument is balanced, presenting both strengths and weaknesses of objective and subjective measurements.
- Structure is logical and psychological terminology is used accurately throughout.
Level 3 (7–9 marks):
- Evaluation is good, but may lack a complete balance of strengths and weaknesses.
- Examples from both studies are present, though one may be described in more detail than the other.
- Some psychological terminology is used appropriately.
Level 2 (4–6 marks):
- Basic evaluation of the validity of measurements.
- Discussion may focus heavily on description of the studies rather than analyzing their measurement validity.
- Limited or unbalanced use of examples from the studies.
Level 1 (1–3 marks):
- Little or no evaluation of measurement validity.
- Explanations are highly descriptive, superficial, or contain significant inaccuracies.
- Minimally relevant to the question asked.
Level 0 (0 marks):
- No response worthy of credit.