
24
Panko, Curtis, Gorrall & Little
Actualidades en Psicología, 29(119), 2015, 19-27
Regression Discontinuity design is a useful alternative
to and can rival the performance of RCT designs.
Propensity Score Matching.
Propensity Score Matching (PSM) is a quasi-
experimental technique first published by Rosenbaum
and Rubin (1983). Propensity score matching attempts
to rectify selection bias that can occur when random
assignment is not possible by creating two groups that
are statistically equivalent based on a set of important
characteristics (e.g., age, gender, ethnicity, personality,
health status, IQ, experience, etc) that are relevant to
the study at hand. Here, each participant gets a score
on their likelihood (propensity) to be assigned to the
treatment group based on the characteristics that drive
selection (termed, covariates). A treatment participant
is matched to a corresponding control participant
based on the similarity of their respective propensity
score. That is, the control participants included in the
analysis are those who match treatment participants
on the potential confounding selection variables; in
this way, selection bias is controlled.
Before propensity scores can be estimated, the
likely selection covariates must be identified. Most
researchers include all variables that could potentially
correlate with the selection influences impacting
treatment and outcome (Coffman, 2012; Cuong,
2013; Lanza, Coffman, & Xu, 2013; Stuart et al.,
2013), regardless of the magnitude of correlation
(Rubin, 1997).
In practice, propensity scores are typically
estimated using logistic (e.g., Lanza, Moore, &
Butera, 2013), probit (e.g., Lalani et al., 2010), or
multiple binomial logistic regression models (e.g.,
Slade et al., 2008) in which the group membership
is the dependent variable predicted by the selection
variables in the dataset (Caliendo & Kopeinig, 2008;
Lanza et al., 2013). The logistic regression model, as
proposed by Cox (1970), has been the most commonly
employed technique in propensity score calculations
(Rosenbaum & Rubin, 1985). The probability score,
a decimal value ranging from 0 to 1, is retained and
used to match participants from the treatment and
control groups.
Once the propensity scores have been estimated,
each participant from the treatment condition is
matched with a participant from the control condition.
As mentioned, the matching of these participants is
based upon the similarity of their propensity scores.
Matching participants from the treatment condition
with similar participants from the control condition
can be completed utilizing the nearest neighbor, caliper,
stratification, and kernaling techniques (e.g., Austin,
2011). Of these methods, differences exist in the
number of participants from the control group who
are matched to treatment participants and whether or
not control participants can be matched more than
once (Coca-Perraillon, 2006).
The nearest neighbor and caliper techniques are
among the most popular (Coca-Perraillon, 2006).
The treatment and control groups are randomly
sorted for both methods. Then, the first treatment
participant is matched without replacement with the
control participant who has the closest propensity
score. The algorithm moves down the list of all the
treatment participants and repeats the process until
all the treatment participants are matched with a
control counterpart. If any control participants are
left over, they are discarded (Coca-Perraillon, 2006).
The difference in the techniques is that with caliper
matching, treatment participants are only used if there
is a control participant within a specified range. Thus,
in this technique, unlikely matches are avoided (Coca-
Perraillon, 2006).
The optimal full matching technique (Hansen,
2004) improves on these popular techniques in two
ways. First, it creates closer matches than the previous
techniques – with caliper and nearest neighbor, a
match is made independently of the other pairs. On
the other hand, optimal full matching always creates
matches with the smallest possible average propensity
score differences between matched treatment and
control participants by taking into account all the other
matches. Second, optimal full matching allows for all
control participants to be used (Hansen, 2004). After
matching, the participants in the treatment and control