If you’re interested in using machine learning (ML) on image and video datasets, then you might be interested in heaving a look on a relatively new project called OpenFace (first released in October 2015), with Brandon Amos, Ludwiczuk Bartosz and Mahadev Satyanarayanan as authors. TL;DR: For the impatient Pitch me: Open source project (aka free for you … More Check out OpenFace
If you are using AWS to host your applications, you probably heard that you can apply IAM Roles also to ec2 instances. In a lot of cases this can be a really cool way to avoid passing AWS credentials to your applications, and having the pain of having to manage key distribution among servers, as … More Spark – Redshift: AWS Roles to the rescue
Distcp (distributed copy) is a fairly old tool used to move a large quantity of files usually within hdfs, using MapReduce job to do so where mappers list the source files and reducers do the copy heavy lifting. Another useful integration is that it can also deal with file migrations between hdfs and AWS … More Distcp S3-hdfs for eu-central-1 (AWS Frankfurt) – details “behind the trenches”
Update: this tutorial has been updated mainly up to Spark 1.6.2 (with a minor detail regarding Spark 2.0), which is not the most recent version of Spark at the moment of updating of this post. Nonetheless, for the operations exemplified you can pretty much rest assured that the API has not changed substantially. I will try to … More Getting started with the Spark (part 2) – SparkSQL
This is part of a series of introductory posts about Spark, meant to help beginners getting started with it. Hope it helps! So what’s that funky business people call Spark? Essentially Apache Spark is a framework for distributing parallel computational (inherently iterative) work across many nodes in a cluster of servers maintaining high performance and … More Getting started with Spark in Python/Scala
In an effort to come down to earth and cover a topic which can be useful for the majority of now-a-days Enterprises that have HP gear, I will cover resilience features one can/should use in a HP Networking environment along with IRF. Though I don’t argue that this is a Best Practice for all cases, … More Resilience measures with HP IRF: ISSU, GR and MAD (Part I)
Colleague just sent me this really interesting talk with Magic Networking Nerd Pack (slash some of the best Networking Bloggers nowadays) discussing Brocade VCS versus TRILL.