Getting Started With SPSS Statistics

I am using IBM SPSS Statistics 27 for this tutorial. This post serves two main objectives:

  • 1. Acquainting you with the SPSS Statistics Data Editor
  • 2. Guiding you through the steps of creating a basic SPSS Statistics data file

Data Set Overview

In the Variable View tab, you define characteristics for each variable in your data file. These include the variable’s:

  • Name (no spaces or special characters, can’t start with a number)
  • Type (usually Numeric)
  • Width (default is “8”)
  • Decimals (decimal points in Data View)
  • Label (detailed description not limited like Names)
  • Values (for Value Labels in categorical variables)
  • Missing (numeric codes for missing data)
  • Columns (characters/numbers displayed in Data View)
  • Align (Left, Right, or Center)
  • Measure (Scale, Ordinal, or Nominal, with Scale covering interval and ratio data)
  • Role (predefined roles for sorting in SPSS dialogues, default is Input)

I am going to use a student satisfaction survey that includes:

  • Gender: Male or Female.
  • Age
  • Course of Study: Physiotherapy, Psychology, Speech Therapy, Nursing, Public Health, Occupational Therapy, Biomedical Sciences.
  • How much do they enjoy the course?
  • How easy is the course to get good grades?
  1. This study data is exclusively consists of numerical values. Numbers will be employed to denote various gender and course of study categories. You can utilize labels to offer more comprehensive descriptions of variables than what the Name field allows. If you include a variable label, it will be utilized consistently in your SPSS Statistics output.
  2. Here, you’ll find the definitions for the values (or codes) we employ to depict the levels of each categorical variable. More detailed discussions can be found on the next page.
  3. Occasionally, participants in research may skip or choose not to respond to specific questions, as seen with our second participant who opted not to disclose their age. Alternatively, recording equipment may malfunction, leading to some data becoming unintelligible. In such situations, we can utilize Missing Values Codes, which are explained in greater detail on the following page. Simply tap the Measure cell and choose the right measurement scale from the drop-down menu.

Value Labels

In SPSS Statistics, numeric values are commonly used to signify different levels of categorical variables such as gender or course of study. These codes are defined in the Value Labels dialogue. To access it, select a Values cell in the Variable View, then click . Enter the number for the first category and its name, then click Add. Repeat for other categories and click OK to close the dialogue. In this example, values “1” through “7” represent the seven courses our student participants are enrolled in.

Missing Values

In research, encountering missing data is inevitable due to various reasons like participants skipping questions or equipment failures. To address this, Missing Values codes are used to signify such instances. One approach is using a single numeric code for all types of missing data, or up to three unique codes to differentiate reasons (e.g., refusal to answer versus equipment failure). Any numeric code outside the actual data range can be used, and “9” is chosen here to avoid confusion with real data, ensuring it doesn’t overlap with participant responses or variables like gender in a 5-point rating scale.

Appendix

Data Set Summary

Strongly Disagree1
Strongly Agree5
Scale used in this study for Question 4 and 5
IDQuestion 1Question 2 Question 3 Question 4Question 5
1Male17Speech Therapy55
2MaleN/APhysiotherapy43
3Female18Occupational Therapy54
4Male17Physio44
5Female20Physiotherapy44
6Female21Psychology33
7Female19Speech Therapy32
8Male18Speech Therapy21
9Male19Nursing33
10Female21Public Health54
11Female24Occupational Therapy43
12Female17Occupational Therapy11
13Female38Nursing33
14Male19Occupational Therapy45
15Female18Occupational Therapy54
Summary of entries