In this video, M2010 Data Mining Conference keynote speaker, Dr. Simon Sheather compares classification trees and logistic regression and discusses situations in which logistic regression is expected to outperform classification trees and vice ver…
New Friends and A Growing Community: The CSUG Experience
How do I sum up the experiences I’ve just had succinctly? Well, those of you who are regular readers know that I am rarely succinct. So I suppose I’ll just let it all out, as I normally do.I’m sitting at the Calgary airport waiting to board a flight …
Play Matrix within SAS(1): basic files processing
Recently I read Rick Wicklin’s IML blog with great interests(and anticipation for his fore-coming IML book, Statistical Programming with SAS/IML Software). SAS programmers have the following programming tools to facilitate their daily work:
SAS data step: the basic SAS; a generation IV programming language, similar with other procedural languages such as C.
SAS Proc SQL: SAS’s […]
Summary Numerical Data in a Rolling Window
Obtain summary statistics over a rolling window for a given data, usually on a time dimension, is not quit easy in SAS, especially the rolling window may contain different number of records and the maximum number is unknown without pass the data once…
Create format from dataset
data fmt(keep=fmtname start label);      set temp;      fmtname=”TRTFMT”;      start=trtgrp;      label=trtlbl;run;proc format library=work cntlin=fmt;run;=================TEMP:trtgrp      trt…
NOTE: DATA Step’s _N_ is PROC SQL’s Monotonic
I’ve made a few posts over the last few months about SQL (particularly its use within Enterprise Guide). One of my unstated issues with SQL is the inability to do the equivalent of DATA step’s _N_ to count unique rows. The ability to add a sequence num…