I didn’t blog for a while in this first half March and there are bunches of new stuff to catch up: I had a new baby! He was delivered on time (and on budget!), lions tigers and bears, oh my… His brother is Tiger so I named him, Leo. And I got the latest SAS […]
Tag: Industry Review
Github for Clinical/Statistical Programmers
PhUSE-FDA Working Group 5 (Development of Standard Scripts for Analysis and Programming) just adopted Google Code as collaborative programming platform. Google Code is one of the most popular and respected open source software hosting sites in the world and it is definitely a good choice for PhUSE-FDA WG5. But after viewing one of WG5’s working […]
Weekend Clips: Data Scientist Episode II
1. What is A Data Scientist Anyway?
2. You Just Can’t Be Replaced by Yourselves!
3. We are all Data Scientists!
Weekend Clip: Data Scientist
Two tweets:
One blog post:
Statisticians aren’t the problem for data science. The real problem is too many posers
One job advertisement:
SAS Data Scientist ?(!)
A joke:
The biggest joke about data scientist is that the Google query “data scientis…
Is There Any Better Way? Publishing Process For CDISC Standards
1. The Pain I read from Lex Jansen (@LexJansen) that CDISC SDTM v1.3 and SDTMIG v3.1.3 were newly released. It’s pretty nice since CDISC SDTM was supposed to be released semiannually in the new publishing cycle. We can see the team put great efforts on this new version, but frankly speaking, this delivery (the […]
Blogging is Awesome: CDISC Bloggers
I remember when blogging was cool. Before the specializing and monetizing and Twitter-izing. —Peter Dewolf Well I think blogging is still cool (and awesome and awesome …). The most appealing personal reason is, blog posts are Google searchable and suitable for archive while Tweets NOT. Admittedly I hold some sort of Existentialism 2.0: if […]
An Analytical Valley: Big Data and Data Scientists (and SAS Programmers)
Tom Davenport reported an observation that Silicon Valley is becoming more analytical since companies in the Valley such as Google, Facebook, eBay, LinkedLn all have strong presences in analytics. Besides such predominant companies, I’d also like to add Yahoo to the list although Yahoo is no longer in its peak. Yahoo is the largest sponsor […]
Decision Trees in SAS Enterprise Miner and SPSS Clementine
Decision trees are included in SAS Enterprise Miner(EM). The counterpart is SPSS Clementine, which should be called IBM SPSS Modeler for precision after IBM’s acquisition of SPSS.
Recently I read a paper on the comparisons of SAS EM, SPSS Clementine and IBM Intelligent Miner on their decision tree and cluster technology:
Decision Tree Induction & Clustering Techniques […]