PSPP for Beginners: Fundamental Data Analysis Concepts

How to get started with data analysis

A key difference between spreadsheets and statistical software is how data analysis commands are issued. Spreadsheets use formulas entered by the user into cells. An example is =sum(A1:A5) to add the cells from A1 to A5. In constrast, we don't enter formulas in PSPP. The method is to choose the desired analysis from a menu, then choose the data for the analysis.

The analysis menu

PSPP has a drop-down menu called "analysis." This has all of the possible analyses that you can do. Some of the analyses are grouped into related operations, such as descriptive statistics or comparing means. Here is a screenshot.

The analysis menu of PSPP is shown. The compare means subfolder is open with ANOVA being highlighted

The first step in an analysis is to choose an analysis from this menu.

Choosing variables and customization

Selecting the analysis command is only the first step. At this point, PSPP doesn't know which variables you want to use for the analysis. The user must select the variables for the analysis. This is also appoint where details about the analysis can be set.

A dialog box is raised after selecting the analysis. This has a list of variables on the left based upon the variable labels. Users select the desired variables by moving them from the left to a field in the center or right of the dialog box that is often called Variables.

The following example is from the sort data command. Note the list of varialbles in our file on the left and the blank "Sort By:" field.

The dialog boxes often list all the variables on the left. Choose the variable that you want by moving it to the right field.

The desired variables can be chosen in two ways. One way is to click on the variable name on the left, then drag it to the empty variable field on the right. Another method is to highlight the variable that you want, then use the little arrow button between the fields to move the variable.

While all variables might be listed on the left, not all of these might potential candidates for analysis. For example, we cannot calculate a mean from someone's name. PSPP helps the selection process by showing small icons at the left of the variable names. Note how the student name variables are forbidden. The sex variable is categorical (nominal scale), which is represented by a small Venn diagram. The scale variables of exam scores are represented by small ruler icons.This scale feature depends on configuring the variables correcly in the variable view of the data editor. Skipping that step means that you won't have guidance at this step.

To recap, this procedure is used for a wide range of analyses. Select the desired operation from the analysis or data menu. Next, use the dialog box to instruct PSPP about the details: Variables and analysis features.


Home | Start | Variables | Data | Descriptive | Relationship | Inferential | Effect size | Advanced | Video

This work is licensed under a Creative Commons Attribution 4.0 International License that allows sharing, adapting, and remixing.