Support the Blog Author to Bike MS

My dear readers, I’m asking you to support me in Bike MS (Multiple Sclerosis) and I will ride 60 miles to support the campaign in New Bern, NC, September 6 – 7, 2014. As my understanding, one of the purposes of this fundraising, is to let the message spread. MS is a very rare disease. […]

Translate SAS’s sas7bdat format to SQLite and Pandas

Thanks Jared Hobbs’ sas7bdat package, Python can read SAS’s data sets quickly and precisely. And it will be great to have a few extension functions to enhance this package with SQLite and Pandas.
The good things to transfer SAS libraries to SQLite:
  1. Size reduction:
    SAS’s sas7bdat format is verbose. So far successfully loaded 40GB SAS data to SQLite with 85% reduction of disk usage.
  2. Save the cost to buy SAS/ACCESS
    SAS/ACCESS costs around $8,000 a year for a server, while SQLite is accessible for most common softwares.
The good things to transfer SAS data set to Pandas:
  1. Pandas’ powerful Excel interface:
    Write very large Excel file quickly as long as memory can hold data.
  2. Validation of statistics
    Pandas works well with statsmodels and scikit-learn. Easy to validate  SAS’s outputs. 

How I Got Started with Computers

This is not your typical tech story. I trace my life as a programmer back to ninth grade, not to a computer class or even a math class, but to English.  In ninth grade, I was fortunate to have English with Miss Burke who, along with the standard Shakespeare and expository writing, slipped in material […]