In some predictive modelling projects, we may have variables that most of the observations have the same value, while the small percentage rest ones are populated with meaningful values. For example, 90% observations have values=0 but the rest 10% ha…
This page is slow!
We received several comments noting extremley slow loading pages, especially in the search and browse areas. I checked on the pages and the slowness seemed random and impossible to reproduce. Then I remembered that the world just found out that S…
Implementing Gap statistic for clustering number estimation
Gap statistic is a method used to estimate the most possible number of clusters in a partition clustering, noticeablly k-means clustering. This measurement was originated by Trevor Hastie, Robert Tibshirani, and Guenther Walther, all from Standford U…
R AnalyticFlow
R AnalyticFlow seems to be a nice tool to have a good overview over the analysis. The same kind of mode is available in Orange and SAS Enterprise Guide. I have not tried it yet, though. Does anyone have any experiences with it?Update on 2010-07-31: the…
Something to be excited about
You have probably heard the news by now. SAS is ranked as the #1 best place to work by Fortune Magazine. You can read more about why SAS is ranked #1.
But here’s my story
I have worked at SAS for a long time. Some of my best friends work here…