SAS Missing Values: Everything You Need to Know

SAS Missing Values: Everything You Need to Know

  • Post author:
  • Post category:Base SAS
  • Post comments:2 Comments
  • Reading time:14 mins read

Working with missing values in SAS is one of the most common tasks for a SAS programmer. There are many techniques and tools associated with using missing values. Knowing these techniques will help you work more efficiently using values.

The two fundamental missing value types are numeric and character missing values. A character missing values are represented using a blank (” “) while missing numeric values are represented with a dot ( . ).

Note: For numeric variables, the dot (.) should not be quoted in an assignment statement for the variable. This will create a new character variable or character to numeric conversion.

/*Character variable*/
age='.'
/*Numeric missing value*/
age= .;

Numeric missing values are essentially minus infinity and are smaller than any non-missing value.

If you perform any arithmetic operations on a missing value, it will result in a missing value. However, missing values will be ignored during calculations performed using numeric functions such as SUM, MEAN, etc.

Special Missing Values in SAS

A period, or dot, is commonly used to represent a missing value for a numerical variable. Apart from the dot, SAS can store 27 special missing values in numerical variables.

They are the dot-underscore (._), and dot-letter (.A through.Z). Note that these special values are case insensitive. That is, .A=.a .B=.b .C=.c etc.

SAS identifies it as a variable name if you do not begin a special numeric missing value with a period. 

Therefore, to use a special numeric missing value in a SAS expression or assignment statement, you must begin the value with a period, followed by the letter or underscore, as in the following example:

x=.d;

While printing the special missing value, SAS only prints the letter. When data values contain characters in numeric fields that you want SAS to interpret as special missing values, use the MISSING statement to specify those characters.

Order of Missing Values

The numeric missing value dot (.) is sorted first, followed by an underscore (_), then the special numeric missing value .A, and then the special missing value .Z.

Order of Missing Values
Sort order Symbol Description
smallest _ underscore
  . period
  A-Z special missing values A (smallest) through Z (largest)
  -n negative numbers
  0 zero
largest +n positive numbers

Detecting Missing Values

There are several techniques and functions available in SAS to detect missing values. See the example input datasets below that have missing numeric, special and character values.

data test;
	input a b c d $;
	infile datalines truncover;
	datalines;
1 2 3 A
. 4 . 
3 4 .a c
. . . 
4 6 8 E
.s .v .z F
5 1 .f 
;
run;
input data
A sample dataset with missing values

To check for Numeric Missing values, you can use the If statement below.

if a=. then put "Missing";

To check for all 28 numeric missing values (.  , ._ , .A through .Z) including numeric missing values, use the following code.

If a <=.z then PUT "Missing";

To check for character Missing values, you can use the If statement below.

if a= ' ' then put "Missing"

MISSING= System Option

With the MISSING=, you can specify the character to print for missing numeric values. You can specify only one character you want to replace with the default missing values in SAS. Single or double quotation marks are optional. The MISSING= system option does not apply to special missing values such as .A and .Z
In the below example, we have specified M instead of the default dot(.).

options missing='M;
data test3;
set test;
run
missing option

If you replace the default missing value, you can still use .(dot) and ‘ ‘to filter or perform any operations on missing values.

Functions that handle MISSING values

Missing function

It accepts either a character or numeric variable as the argument and returns one if the argument contains a missing value; else, it returns zero. The Missing function detects numeric, character, and even special missing values.

data test2;
    set test;
    missing=missing(a);
run;
SAS Missing Values: Everything You Need to Know

NMISS Function in SAS

It returns the number of missing values in the specified list of numeric variables. Character values will be converted automatically if the argument value is missing.

data test2;
	set test;
	nmiss=nmiss(a, b, c);
run;
nmiss

CMISS Function in SAS

It counts the number of missing arguments in its argument list and works for the character and numeric variables without requiring character values to be converted to numeric.

data test2;
    set test;
    nmiss=nmiss(a,b,c);
    cmiss=cmiss(a,b,c);
run;
cmiss

N function

This function returns the number of non-missing values in a list of numeric variables.

data test2;
    set test;
    nmiss=nmiss(a,b,c);
    cmiss=cmiss(a,b,c);
    non_missing = n(a,b,c);
run;
non-missing

COALESCE Function

This function selects the first non-missing value in a list of variables.

data test2;
    set test;
    nmiss=nmiss(a,b,c);
    cmiss=cmiss(a,b,c);
    non_missing = n(a,b,c);
    first_non_miss = coalesce(a,b,c);
run;
coalesce

CALL MISSING

With this function, you can explicitly initialise or set a variable value to be missing.

data test3;
set test;
if _n_ = 4 then call missing(a,b,c,d);
run;
Call Missing

Missing values in SAS procedures

PROC FREQIn PROC FREQ, percentages are calculated, excluding the missing values. If you need to include the total observations that are both missing and non-missing, use the “/MISSING” option on the table statement.

proc freq data= test;
tables a / MISSING;
run;
Proc Freq

PROC MEANS: The PROC MEANS procedure only generates statistical data on non-missing values. Use the NMISS option to calculate the number of missing values.

Proc Means Data = test N NMISS;
Var a -- c ;
Run;
proc means

Use the MISSING option in PROC MEANS to see the number of observations having a missing value for the classification variable.

data class;
set sashelp.class;
if age < 14 then call missing(age);
run;

Proc Means data = class N NMISS MISSING;
Class Age ;
Var age -- weight;
Run;
Proc Means Missing

Deleting Missing Values

Once you have found the missing values, you may want to remove them as a part of your data cleaning tasks.

How to delete numeric values?

data test3;
set test;
if a = . then delete;
run;
Delete Missing Values

If you want to remove ALL rows with ANY missing values, you can use the NMISS functions below.

data test3;
set test;
if nmiss(of _numeric_) > 0 then delete;
run;
Delete all Missing

How to delete character values?

data test3;
set test;
if d= '' then delete;
run;
Delete Missing Character Values

To delete all character values, you can use the below codes.

if cmiss(of _character_) > 0 then delete;

You can use the code below to delete all character and numeric values.

if cmiss(of _all_) > 0 then delete;

Also Read: Count missing and non-missing values for each variable in a SAS data set.

Every week we'll send you SAS tips and in-depth tutorials

JOIN OUR COMMUNITY OF SAS Programmers!

Subhro

Subhro provides valuable and informative content on SAS, offering a comprehensive understanding of SAS concepts. We have been creating SAS tutorials since 2019, and 9to5sas has become one of the leading free SAS resources available on the internet.

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

This Post Has 2 Comments

  1. Arlen Vandel

    Appreciate spreading these details within this astounding website. i’ll share this information in my small zynga be the reason for my local freinds <a href=’http://bit.ly/2lLC8VB’>comanda mancare Chitila</a>.

    1. Subhro

      Thanks