SPSS COMPUTE command sets the data values for (possibly new) numeric variables and string variables. These values are usually a function (such as MEAN, SUM or something more advanced) of other variables.
This tutorial walks you through doing just that. We'll use hospital.sav,a screenshot of which is shown below.
Before proceeding, we'll first set 6 as a user missing value for the last 5 variables. We'll do so by running the syntax below. missing values doctor_rating to facilities_rating (6).
SPSS COMPUTE Existing Numeric Variable
The simplest COMPUTE example is probably computing an already existing numeric variable. Say we'll add one point to facilities_rating because we feel our respondents were overly negative about it. The syntax below does just that. Note that COMPUTE is a transformation so we also run EXECUTE (“exe.”) in order to see the result.
compute facilities_rating = facilities_rating + 1.
Note that 6 does not result in 7. This is because 6 is a user missing value and it's used in a basic numeric function.
SPSS COMPUTE New Numeric Variable
If the variable that's computed doesn't exit yet, SPSS will create it as a numeric variable having an f format. One of the implications is that we can't directly COMPUTE new string variables but we'll get to that in a minute. We first compute the mean over our 5 ratings but only for cases having at least 3 valid values. Note how this is easily accomplished by using the dot operator.
compute mean_score = mean.3(doctor_rating to facilities_rating).
SPSS COMPUTE Existing String Variable
In normal language, COMPUTE usually refers to operations on numbers. In SPSS, however, COMPUTE is used for setting the values of string variables as well. Keep in mind here that you can't use numeric functions on string variables or vice versa. The example below converts surname_prefix to lower case.
compute surname_prefix = lower(surname_prefix).
SPSS COMPUTE New String Variable
SPSS can compute only existing string variables. For new string variables, we must first create new (empty) variables with the STRING command. After doing so, we can set their values with COMPUTE. Like so, the syntax below creates full_name by concatenating the respondents' name components.
*2. COMPUTE full_name with CONCAT and RTRIM.
compute full_name = concat(rtrim(first_name),' ',rtrim(surname_prefix),' ',rtrim(last_name)).
SPSS COMPUTE Date, Time and Datetime Variables
COMPUTE can be used for creating new date variables, time variables and datetime variables. This is because these are all numeric variables. However, new numeric variables always have an f format, which is usually not suitable for the aforementioned variables. The way to go here is to first computing the variables by using date functions or basic numeric functions. After doing so, use FORMATS for displaying their values appropriately. The syntax below gives an example.
compute entry_moment = entry_date + entry_time.
*2. Show datetime values (in seconds) as normal dates with times.
THIS TUTORIAL HAS 9 COMMENTS:
By kings on March 1st, 2016
By Nicola on October 19th, 2018
Hi Ruben. I have a question. I wish I would create a new variable where its values are the same of another variable. For example, I have to create a new variable Z. If variable X=1, variable Z=variable Y. If variable1, variable Z=variable W.
Is it clear? Could you tell me how to do it?
Thank you very much!
By Ruben Geert van den Berg on October 20th, 2018
There's several ways but the easiest is probably using IF:
if(vara = 1) varb = varx.
if(vara = 2) varb = vary.
if(vara = 3) varb = varz.
and so on. An alternative is DO IF which is slower for simple conditions but faster for complex conditions like
do if(gender = 1).
if(age = 1) ...
if(age = 2) ...
else if (gender = 2).
if(age = 1) ...
if(age = 2) ...
We don't have to repeat the gender condition for each statement here.
In some cases, you can speed things up with DO REPEAT:
do repeat #vars = v1 to v5 / #nums = 1 to 5.
if(testvar = #nums) newvar = #vars.
This trick is often used for creating dummy variables for regression analysis.
Hope that helps!
By Nicola on October 20th, 2018
Thank you so much, Ruben! Very useful!
By Erik on January 12th, 2019
I have 4 categorical variables, representing 4 different cholesterol lowering mediactions; each coded 0 if patiënt does not use it And 1 if he uses. I want to "calculate" a new categorical variable " chol_med" coded 1 if one or more of 4 medications are 1 and coded 0 if all are 0. What syntax should i use?