Tag: data mining

SAS analysis of the #1 most popular baby names

I’m sure that most of you have seen the really cool interactive Baby Name Voyager, right?  If not, go have a look and try it out (be sure to plug in your name, and the names in your family, and so on). That’s a pretty cool application!  … But it m…

Kaggle Digit Recoginizer: SAS k-Nearest Neighbor solution

Kaggle is hosting an educational data mining competition: Kaggle Digit Recognizer, using MNIST data. Handwritten digit recognition is one of the few applications that kNN classifier performs well. Of course, the benchmark kNN classifier provided …

Weekend Clip: Data Scientist

Two tweets:

One blog post:
Statisticians aren’t the problem for data science. The real problem is too many posers
One job advertisement:
SAS Data Scientist ?(!)
A joke:
The biggest joke about data scientist is that the Google query “data scientis…

KNN Classification and Regression in SAS

kNN stands for k Nearest Neighbor. In data mining and predictive modeling, it refers to a memory-based (or instance-based) algorithm for classification and regression problems. It is a widely used algorithm with many successfully applications in medi…

Calculate your survival chance on the Titanic

On the 10th of April, 1912, the RMS Titanic set out on its maiden voyage across the Atlantic Ocean carrying 2,223 passengers. On the 14th of April, it hit an iceberg and sank. There were 1,517 fatalities. Identifying information was not available for…

6 questions with data mining expert, John Elder

John F. Elder IV, Ph.D., is President of Elder Research Inc. (ERI), a data mining consulting team.  He has authored innovative data mining tools, is a frequent keynote speaker and was co-chair of the 2009 Knowledge Discovery and Data Mining conferenc…