In SPSS, a VECTOR is a list of (new or existing) variables that can be referenced by their indices in this list. VECTOR is often combined with LOOP.

Creating Dummy Variables with VECTOR and LOOP## SPSS Vector - Basic Example

Suppose we'd like to Creating Dummy Variables in SPSS for a variable holding values 1 through 4. We'll call the four dummy variables d1 through d4. Now runningVECTOR d(4).first creates these new (empty) variables. As long as the VECTOR is in effect, d1 can be addressed by d(1) and so on. Let's first run the syntax below to verify this.

## SPSS Vector Syntax Example

***1. Create mini dataset.**

data list free/original.

begin data

1 2 3 4

end data.

***2. Define vector for new variables d1 to d4.**

vector d(4).

***3. As long as the vector exists, d1 can be addressed as d(1) and so on.**

compute d(1) = original = 1.

compute d(2) = original = 2.

compute d(3) = original = 3.

compute d(4) = original = 4.

exe.

## SPSS Vector with Loop

Our first VECTOR syntax didn't save us any effort. So why use VECTOR here? The point is that *being able to address variables by their indices enables us to LOOP over them*. The syntax below, building upon its predecessor, demonstrates the simplest possible example of this. It deletes the new variables and then recreates them in a more efficient way. Note that we use a scratch variable as our loop index.This COMPUTE command may strike you as odd. It's explained in Compute A = B = C.

## SPSS Vector with Loop Syntax

***1. Delete new variables.**

delete variables d1 to d4.

***2. Vector.**

vector d(4).

***3. Use vector with loop.**

loop #i = 1 to 4.

compute d(#i) = original = #i.

end loop.

exe.

## Fastest Dummification

A little known use of VECTOR is addressing variables by a non constant (over cases). So if we have a variable original,COMPUTE d(original) = 1.generates different COMPUTE commands for different cases. So for a case holding 3 on original, it implies COMPUTE d3 = 1. The other new variables, d1, d2 and d4 are not affected (and thus hold only system missing values but we'll fix that with RECODE).

The syntax below may be the fastest way to create unlabelled dummy variables. However, since it's highly recommended to label new variables and their values, we recommend our Create Dummy Variables tool for practical purposes.

## SPSS Vector Syntax Example

***1. Delete new variables.**

delete variables d1 to d4.

***2. Vector.**

vector d(4).

***3. Use non constant over cases (variable "original") in vector.**

compute d(original) = 1.

exe.

***4. Correct system missings in new variables.**

recode d1 to d4 (sysmis = 0).

exe.

## SPSS Vector of Existing Variables

Thus far we used VECTOR for creating new variables. Alternatively, we can use it for addressing existing variables with a slightly different syntax: for addressing the (existing) variables d1 through d4, we'll useVECTOR d = d1 TO d4.We can use this for reversing the aforementioned dummification. The next syntax example demonstrates this by looping over an IF command.

## SPSS Vector Syntax Example

***1. Vector of existing rather than new variables.**

vector d = d1 to d4.

***2. Reconstruct multinomial variable from dummy variables.**

loop #v = 1 to 4.

if d(#v) = 1 reconstructed = #v.

end loop.

exe.

## SPSS Vector - Final Notes

- Several vectors can be defined at once in a single VECTOR command if desired.
- Those familiar with DO REPEAT will notice that the combination of VECTOR and LOOP offers reasonably similar functionality. A comparison is in place but beyond the scope of this tutorial.
- As a minor technical point, any VECTOR definition stays in effect only until any transformations are run.

# SPSS Vector Bonus Examples

## 1. Shift Values Forward

*“I have missing values in my data. I'd like to shift the valid values forward within cases so they become adjacent. How can I accomplish that?*

## SPSS Vector Syntax Example

***1. Create dataset.**

data list free/x1 to x5.

begin data

'' '' '' 0 1 1 0 '' 0 1 '' '' 0 '' 1 '' 0 1 0 '' 1 1 1 '' '' '' '' 1 '' '' '' '' 0 0 1 1 1 '' '' 1 '' 1 '' 0 '' '' 1 1 '' 0

end data.

***2. Shift values forward.**

compute #new = 1.

vector x = x1 to x5 / v(5).

loop #old = 1 to 5.

if not(sysmis(x(#old))) v(#new) = x(#old).

if not(sysmis(x(#old))) #new = #new + 1.

end loop.

exe.

## Explanation

We'll use one vector for the existing variables and one for new variables. We'll loop through the old variables and every time a valid value is encountered, #new increases by 1. Like so, v(#new) may refer to different variables for different cases. However, it always refers to the first new variable that doesn't have a valid value yet. It's this variable that'll take the next valid value we encounter on the old variables.

## 2. Unrank Data

*“I asked respondents to rank 5 products. The first variable contains their first choice, the second variable their second and so on. However, I'd like to have a variable per product that has a 1 if it was the first choice, a 2 if it was the second product and so on.”*

## SPSS Vector Syntax Example

***1. Create rank data.**

data list free/c1 to c5. /*c1 is the first choice, c2 the second and so on.

begin data.

3 4 2 5 1 2 4 5 1 3 1 3 4 5 2 2 5 3 4 1

end data.

***2. Unrank data.**

vector o(5)/old = c1 to c5./*o1 is the first option chosen, o2 the second and so on.

loop #value = 1 to 7.

compute o(old(#value)) = #value.

end loop.

exe.

## Explanation

We basically use a vector within a vector within the loop. So the value held by the first variable refers to the vector index of the new variable, which gets value 1. Next, value 2 is passed into the new variable whose index is in c2 and so on.

## THIS TUTORIAL HAS 10 COMMENTS:

## By Tam on January 5th, 2016

Good!!

## By Björn on March 25th, 2016

Not very good according to me

## By marian on April 16th, 2019

It is possible to define the vector length using a scratch variable previously defined?

e.g.:

compute #c=5.

vector v(#c).

## By Ruben Geert van den Berg on April 16th, 2019

Nope. This VECTOR specification only takes a positive integer, not a (scratch) variable to define its length.

## By Jerry Lee on December 21st, 2020

I have several variables that all need to have the same operation performed on them but they are not adjacent to each other in the file. For example, if the variables were D1, D2, D3, D4 but I want to preform the operation on just D1, D2 and D4. As I understand Vector the variables have to be adjacent I can do

Vector X= D1 to D4 but not

Vector X= D1 to D2, D4

Is there someway to perform loop type operations on non-adjacent variables?