Tag: SAS

SAS Download Manager in Console Mode

SAS® 9.4 M3 is now available (as discussed by Andy Ratcliffe in his recent NOTE: blog post), so I’m downloading a new SAS depot using the SAS Download Manager. I’m downloading it onto a Linux server that happens to have X11 available, but I’m choosing to use console (text) mode, rather than the default X11 […]

NOTE: SAS v9.4 M3 is available

Further to my post on flavours of SAS v9.4 (indeed, flavours of SAS v9.4 m2), this week sees the release of SAS v9.4 m3 (otherwise known as 15w29). I’ve not had a chance to use it yet(!), but the documented features that caught my eye include:The …

It’s Not Too Late to Volunteer for the DataDive

I wrote in a post in June about the good that DataKind does by using teams of volunteers with data science knowledge. It’s not too late to join the DataDive this coming weekend in London (July 17-19). If you’d like to contribute your time and knowledge…

Hadoop is the New Black

It feels like any SAS-related project in 2015 not using Hadoop is simply not ambitious enough. The key question seems to be “how big should our Hadoop cluster be” rather than “do we need a Hadoop cluster”.

Of course, I’m exaggerating, not every project needs to use Hadoop, but there is an element of new thinking required when you consider what data sources are available to your next project and what value would they add to your end goal. Internal and external data sources are easier to acquire, and volume is less and less of an issue (or, stated another way, you can realistically aim to acquire large and larger data sources if they will add value to your enterprise).

Whilst SAS is busy moving clients from PC to web, there’s a lot of work being done by SAS to move the capabilities of the SAS server inside of Hadoop. And that’s to minimise “data miles” by moving the code to the data rather than vice-versa. It surely won’t be long before we see SAS Grid and LASR running inside of Hadoop. It’s almost like Hadoop has become a new operating system on which all of our server-side capabilities must be available.

We tend to think of Hadoop as being a central destination for data but it doesn’t always start its presence in an organisation in that way. Hadoop may enter an organisation for a specific use case, but data attracts data, and so once in the door Hadoop tends to become a centre of gravity. This effect is caused in no small part by the appeal of big data being not just about the data size, but the agility it brings to an organisation.

SAS’s Senior Director of the EMEA and AP Analytical Platform Centre of Excellence, Mark Torr (that’s one heck of a title Mark!) recently wrote a well-founded article on the four levels of Hadoop adoption maturity based upon his experiences with many SAS customers. His experiences chime with my far more limited observations. Mark lists the four levels as:

  1. Monitoring – enterprises that don’t yet see a use for Hadoop within their organisation, or are focused on other priorities
  2. Investigating – those at this level have no clear, focused use for Hadoop but they are open to the idea that it could bring value and hence they are experimenting to see where and how it can deliver benefit(s)
  3. Implementing – the first one or two Hadoop projects are the riskiest because there’s little or no in-house experience, and maybe even some negative political undercurrents too. As Mark notes, the exit from Investigating into Implementing often marks the point where enterprises choose to move from the Apache distribution to a commercial distribution that offers more industrial-strength capabilities such as Hortonworks, Cloudera or MapR
  4. Established – At this level, Hadoop has become a strategic architectural tool for organisations and, given the relative immaturity of Hadoop, the organisations are working with their vendors to influence development towards full production-strength capabilities
Hadoop is (or will be) a journey for all of us. Many organisations are just starting to kick the tyres. Of those who are using Hadoop, most are in the early stages of this process in level 2, with a few front-runners living at level 3. Those organisations at leve 3 are typically big enough to face and invest in solutions to the challenges that the vendors haven’t yet stepped up to, such as managing provenance, data discovery and fine-grained security.

Does anybody live the dream fully yet? Arguably, yes, the internal infrastructures developed at Google and Facebook certainly provide their developers with the advantages and agility of the data lake dream. For most us, we must be content to continue our journey…


Follow me on Twitter: @aratcliffeuk

Summer of Coding

I’m always keen to encourage an awareness and uptake of coding in my kids. I think that coding brings a lot more than the simple ability to write programs. Coding requires a set of disciplines and an approach that are of great benefit in all walks of life.

As the summer holidays are upon us, with weeks upon weeks for kids to idle away their time, now is a good moment to revisit some of the online opportunities to give kids an insight into the joys of coding.

I’ve previously mentioned Scratch and App Inventor 2 (AI2) as two very accessible means for getting kids (and adults!) started, and producing a useful app that they can share with their friends very quickly. Both sites are free and use a clever building blocks interface to allow budding programmers to quickly understand the requirements of syntax. Scratch builds web-based apps and AI2 builds apps for Android devices (phones and tablets) with surprisingly powerful blocks for accessing web-based resources.

Scratch has always encouraged its users to share their work. Earlier this year App Inventor added its own gallery for showing and sharing.

Whilst it’s not free, I’ve heard good things about Tynker. Tynker also takes the building blocks approach to syntax, and offers structured courses to help guide its students to exciting results.

Another means of getting your kids inspired is Lightbot. This is a series of programming-related puzzles featuring a cute robot character in a games app – available for Apple iOS, Android and other platforms. Great fun, and challenging too when you get to some of the higher levels.

As technology becomes more pervasive, traditional trades disappear, and the world of work becomes more globalised, the skills that newer members of the workforce need are changing: problem solving, team working, and communication are but three “21st century skills”. Digital literacy (ability to find and use internet-based resources and information) and creativity— and the latter’s close relative, entrepreneurship—are close behind. And, the young have become more comfortable learning on their own, especially on topics of interest. They just need to be pointed in the right direction!


Follow me on Twitter: @aratcliffeuk

More Flash in Chrome for Less Power … and the HTML5 Migration

If you use one of SAS’s web interfaces you’ll be a great fan of the flexibility and usability of the user interface. And those capabilities are probably provided by Adobe Flash. Your browser is running the Flash plug-in.

But Flash has one or two downsides, principally its tendency to use lots of CPU cycles which in-turn uses lots of battery power. Not a problem maybe if you’re hooked to the mains, but not good on a laptop of mobile phone/tablet.

If you use the Chrome browser you’ll be pleased to hear that Google are improving Chrome’s power consumption when Flash is running. When you’re on a webpage that runs Flash, Chrome will intelligently pause content, e.g. Flash animations, that aren’t central to the webpage, while keeping central content (like a video) playing without interruption. If Chrome accidentally pause something you were interested in, you can just click it to resume playback. This update significantly reduces power consumption, allowing us to do analytics on-the-go for longer before having to hunt for a power outlet.

This feature was enabled by default on Chrome’s desktop Beta channel in June, and will be rolling out soon to everyone else on Chrome desktop.

Looking longer-term, SAS are replacing their use of Flash with HTML5. Whilst the use of Flash requires a plug-in from Adobe, HTML5 is supported by all modern browsers out-of-the-box, with no need for any plug-in. The majority of web sites and vendors are migrating to HTML5 due to its net neutrality and power-consumption benefits. SAS Studio already uses HTML5; Visual Analytics and Visual Statistics currently use Flash. We can expect a migration to HTML5, perhaps starting with the VA hub this summer, which will probably be complete next year.


Follow me on Twitter: @aratcliffeuk

SAS Combinatorial Functions: Snippets

1. Permutation and Combination data _null_;     n = 5;     r = 2;    *the factorial of a number;    fact=fact(n);    *for positive integers, fact(n) = gamma(n+1);    gamm=gamma(n + 1);    *C(n,r): number of combinations of n objects selected r ;    *n! / [r!(n-r)!];    comb1 = comb(n,r);    comb2 = […]

The (Mostly True) History of Computing

I’m half way through a book that I simply must recommend to you before I even finish it. If you have a sense of humour or  if you have half an interest in the earliest evolution of computers, you will enjoy The Thrilling Adventures of Lovelace and Babbage by Sydney Padua as much as I am doing.

On the face of it, it is a graphic novel that accurately describes the Victorian 1830s relationship between the eccentric polymath Charles Babbage and his accomplice, Ada, Countess of Lovelace (the daughter of famed poet Lord Byron). When Ada translated her friend Babbage’s plans for the “Difference Engine,” her lengthy footnotes contained the first appearance of the general computing theory—one hundred years before an actual computer was built. Whilst Padua’s cartoon telling of the story is thickly laced with humour, her copious footnotes provide an unexpected level of detail to the story.

So far so good, but the book really gets into its stride when it moves into a parallel universe. In the real world, Lovelace died of cancer soon after her publication, and Babbage never built any of his machines. In Padua’s parallel universe, Lovelace survives, Babbage does build his Difference Engine, Lovelace smokes a pipe, and they both get into many madcap adventures where their analytical minds and the Difference Engine can save the day.

Yes, it is all a bit surreal! But I’m loving it, and I think it provides a good read for people of all ages. It may just even encourage one or two non-technologists to turn their imagination to computing.

Available from all good book stores including Amazon UK and Amazon USA. I bought my copy as a Kindle edition and I’ve found it very readable on my 7″ Nexus tablet (be sure to read the pop-up instructions in Kindle which tell you how to scroll one panel at a time through the graphics).


Follow me on Twitter: @aratcliffeuk