Principle Component Analysis
If where and , and let , then where i the ith principal component and
Factor Analysis
if where , then and the common factors f are independent to .
For the two methods, the proportion of variance of explained…
Principle Component Analysis
If where and , and let , then where i the ith principal component and
Factor Analysis
if where , then and the common factors f are independent to .
For the two methods, the proportion of variance of explained…
I’m sure that most of you have seen the really cool interactive Baby Name Voyager, right? If not, go have a look and try it out (be sure to plug in your name, and the names in your family, and so on). That’s a pretty cool application! … But it m…
1. What is A Data Scientist Anyway?
2. You Just Can’t Be Replaced by Yourselves!
3. We are all Data Scientists!
Kaggle is hosting an educational data mining competition: Kaggle Digit Recognizer, using MNIST data. Handwritten digit recognition is one of the few applications that kNN classifier performs well. Of course, the benchmark kNN classifier provided …
Two tweets:
One blog post:
Statisticians aren’t the problem for data science. The real problem is too many posers
One job advertisement:
SAS Data Scientist ?(!)
A joke:
The biggest joke about data scientist is that the Google query “data scientis…
kNN stands for k Nearest Neighbor. In data mining and predictive modeling, it refers to a memory-based (or instance-based) algorithm for classification and regression problems. It is a widely used algorithm with many successfully applications in medi…
On the 10th of April, 1912, the RMS Titanic set out on its maiden voyage across the Atlantic Ocean carrying 2,223 passengers. On the 14th of April, it hit an iceberg and sank. There were 1,517 fatalities. Identifying information was not available for…
John F. Elder IV, Ph.D., is President of Elder Research Inc. (ERI), a data mining consulting team. He has authored innovative data mining tools, is a frequent keynote speaker and was co-chair of the 2009 Knowledge Discovery and Data Mining conferenc…