13. Try, try again: getting started with some software and coding

In this blog I will share what I have learned from sitting at my desk trying to install and use new software and code. The aim is to help you get started and, more importantly, keep going. First up, an overview. • Installing software and loading data rarely worked perfectly and takes some time. Think […]

11. Jumping through Hadoops

Week one of the course introduced Apache Hadoop software, used for distributed processing of massive unstructured data, week two took us deeper into accessing and operating on big data and took me far, far away from my comfort zone. The least geeky explanations of Hadoop and MapReduce that I could find are here and here […]