SPSS tutorials website header logo SPSS TUTORIALS FULL COURSE BASICS ANOVA REGRESSION FACTOR

SPSS VECTOR – Quick Tutorial

In SPSS, a VECTOR is a list of (new or existing) variables that can be referenced by their indices in this list. VECTOR is often combined with LOOP.

SPSS Vector with Loop for Dummy Variables Creating Dummy Variables with VECTOR and LOOP

SPSS Vector - Basic Example

Suppose we'd like to Creating Dummy Variables in SPSS for a variable holding values 1 through 4. We'll call the four dummy variables d1 through d4. Now runningVECTOR d(4).first creates these new (empty) variables. As long as the VECTOR is in effect, d1 can be addressed by d(1) and so on. Let's first run the syntax below to verify this.

SPSS Vector Syntax Example

*1. Create mini dataset.

data list free/original.
begin data
1 2 3 4
end data.

*2. Define vector for new variables d1 to d4.

vector d(4).

*3. As long as the vector exists, d1 can be addressed as d(1) and so on.

compute d(1) = original = 1.
compute d(2) = original = 2.
compute d(3) = original = 3.
compute d(4) = original = 4.
exe.

SPSS Vector with Loop

Our first VECTOR syntax didn't save us any effort. So why use VECTOR here? The point is that being able to address variables by their indices enables us to LOOP over them. The syntax below, building upon its predecessor, demonstrates the simplest possible example of this. It deletes the new variables and then recreates them in a more efficient way. Note that we use a scratch variable as our loop index.This COMPUTE command may strike you as odd. It's explained in Compute A = B = C.

SPSS Vector with Loop Syntax

*1. Delete new variables.

delete variables d1 to d4.

*2. Vector.

vector d(4).

*3. Use vector with loop.

loop #i = 1 to 4.
compute d(#i) = original = #i.
end loop.
exe.

Fastest Dummification

A little known use of VECTOR is addressing variables by a non constant (over cases). So if we have a variable original,COMPUTE d(original) = 1.generates different COMPUTE commands for different cases. So for a case holding 3 on original, it implies COMPUTE d3 = 1. The other new variables, d1, d2 and d4 are not affected (and thus hold only system missing values but we'll fix that with RECODE).
The syntax below may be the fastest way to create unlabelled dummy variables. However, since it's highly recommended to label new variables and their values, we recommend our Create Dummy Variables tool for practical purposes.

SPSS Vector Syntax Example

*1. Delete new variables.

delete variables d1 to d4.

*2. Vector.

vector d(4).

*3. Use non constant over cases (variable "original") in vector.

compute d(original) = 1.
exe.

*4. Correct system missings in new variables.

recode d1 to d4 (sysmis = 0).
exe.

SPSS Vector of Existing Variables

Thus far we used VECTOR for creating new variables. Alternatively, we can use it for addressing existing variables with a slightly different syntax: for addressing the (existing) variables d1 through d4, we'll useVECTOR d = d1 TO d4.We can use this for reversing the aforementioned dummification. The next syntax example demonstrates this by looping over an IF command.

SPSS Vector Syntax Example

*1. Vector of existing rather than new variables.

vector d = d1 to d4.

*2. Reconstruct multinomial variable from dummy variables.

loop #v = 1 to 4.
if d(#v) = 1 reconstructed = #v.
end loop.
exe.

SPSS Vector - Final Notes

SPSS Vector Bonus Examples

1. Shift Values Forward

“I have missing values in my data. I'd like to shift the valid values forward within cases so they become adjacent. How can I accomplish that?

SPSS Vector Shift Values Data Before and After Shifting Valid Values Forward

SPSS Vector Syntax Example

*1. Create dataset.

data list free/x1 to x5.
begin data
'' '' '' 0 1 1 0 '' 0 1 '' '' 0 '' 1 '' 0 1 0 '' 1 1 1 '' '' '' '' 1 '' '' '' '' 0 0 1 1 1 '' '' 1 '' 1 '' 0 '' '' 1 1 '' 0
end data.

*2. Shift values forward.

compute #new = 1.

vector x = x1 to x5 / v(5).
loop #old = 1 to 5.
if not(sysmis(x(#old))) v(#new) = x(#old).
if not(sysmis(x(#old))) #new = #new + 1.
end loop.
exe.

Explanation

We'll use one vector for the existing variables and one for new variables. We'll loop through the old variables and every time a valid value is encountered, #new increases by 1. Like so, v(#new) may refer to different variables for different cases. However, it always refers to the first new variable that doesn't have a valid value yet. It's this variable that'll take the next valid value we encounter on the old variables.

2. Unrank Data

“I asked respondents to rank 5 products. The first variable contains their first choice, the second variable their second and so on. However, I'd like to have a variable per product that has a 1 if it was the first choice, a 2 if it was the second product and so on.”

SPSS Vector Unrank Values Data Before and After Unranking Values

SPSS Vector Syntax Example

*1. Create rank data.

data list free/c1 to c5. /*c1 is the first choice, c2 the second and so on.
begin data.
3 4 2 5 1 2 4 5 1 3 1 3 4 5 2 2 5 3 4 1
end data.

*2. Unrank data.

vector o(5)/old = c1 to c5./*o1 is the first option chosen, o2 the second and so on.
loop #value = 1 to 7.
compute o(old(#value)) = #value.
end loop.
exe.

Explanation

We basically use a vector within a vector within the loop. So the value held by the first variable refers to the vector index of the new variable, which gets value 1. Next, value 2 is passed into the new variable whose index is in c2 and so on.

Tell us what you think!

*Required field. Your comment will show up after approval from a moderator.

THIS TUTORIAL HAS 10 COMMENTS: