SAS EG 5.1 – Data Exploration Feature

This post was kindly contributed by Business Intelligence Notes for SAS® BI Users - go there to comment and to read the full post.

I really dig the SAS Enterprise Guide 5.1 – Data Explorer Tool.  How many times have you received a new data set that you know nothing about? While it’s not like it takes forever to run some PROC FREQ/MEANS – but what if it’s already done for you or can be done quickly?  Did that get your attention?

Here’s how to use this new feature with your data.  

Open Data Exploration from File Menu

From the File menu, select Open > Data Exploration.  Then navigate to the dataset you want to open.  I am using SASHELP.SHOES since many of you may already be familiar with it.

The dataset is open and a new window appears to the right side of the dataset. At the bottom of the window, you can view some general information about the dataset (i.e. 395 rows and 7 columns).

EG51 data explorer 1 1024x530 SAS EG 5.1   Data Exploration Feature

All the Statistics You Can Handle

This is what I really like – I can run all statistics for all variables at once.  From the new window, click the Charts icon and then Click to run all statistics. Notice when I selected the Charts icon, the window changed to list each variable type (character, numeric, currency) with the count. After a few moments, you will see each variable now has statistics and chart icons. The Product variable for instance shows the Distinct values and the total count of non missing values.  When you click the icon you can see an enlarged view of the chart.  Pretty cool?EG51 data explorer 02 SAS EG 5.1   Data Exploration Feature  

 

Tailor Your Statistics

You can customize what information is shown for all statistics. Click the Setting icon and the Select Statistics window appears.  Select what you want to display – lots of choices!!!  For instance, for Date/Time you may not find the Average that interesting.

EG51 data explorer 04 SAS EG 5.1   Data Exploration Feature

Get Column Statistics

You can also get statistics for the individual columns by clicking on them.  In the screenshot – you can see how the differences for the character and numeric values.  For the Product column (shown with the 1), I selected the Distinct Values choice. I can easily see the result.  This is similar to the Excel Auto Filter feature. With the numeric variable, Sales (shown with a 2), you can see some statistics, such as the mean, mode, and lower CLM.

EG51 data explorer 03 SAS EG 5.1   Data Exploration Feature

 

Other Thoughts

I’m running SAS EG 5.1 in a virtual environment on a fast machine with small datasets. The statistics were returned almost immediately. I did see some options where you can further filter results and control how many observations are considered.  So someone else will have to tell me if the larger datasets cause issues with this feature.

What do you think?

This post was kindly contributed by Business Intelligence Notes for SAS® BI Users - go there to comment and to read the full post.