I investigate how fast Spark 2.2 can query 1.1 billion taxi journeys using a cluster of three Raspberry Pis.
I demonstrate how to extract analytical data from petabytes worth of websites collected by Common Crawl.
I investigate how fast an 11-node Spark 2.1.0 cluster can query over a billion records.
I investigate how fast a small AWS EMR cluster can query over a billion records using Spark.
An end-to-end guide to building a film recommendation engine.
Copyright © 2014 - 2021 Mark Litwintschik. This site's template is based off a template by Giulio Fidente.