Home | Benchmarks | Categories | Atom Feed

Thu 18 April 2024
Umbra's Open Satellite Feed
Mon 01 April 2024
Global EV Charging Points with Open Charge Map
Sat 23 March 2024
Aircraft Route Analysis
Tue 19 March 2024
1.1 Billion Taxi Rides using ClickHouse on Intel's Core i9-14900K
Fri 15 March 2024
1.1 Billion Taxi Rides using DuckDB
Mon 22 January 2024
Tokyo Walking Tour Guide
Thu 30 November 2023
Extracting OSM Features
Tue 21 November 2023
Global Flight Tracking
Fri 17 November 2023
Mapping Estonia with LiDAR
Wed 15 November 2023
Natural Earth's Global Geospatial Datasets
Sun 12 November 2023
Maxar's Open Satellite Feed
Sun 05 November 2023
Overture's Global Geospatial Datasets
Mon 30 October 2023
A Review of Esri's Imagery in Action MOOC
Sat 28 October 2023
Versatile Video Coding
Wed 11 October 2023
Segmenting Satellite Images
Thu 05 October 2023
A Review of Esri's Spatial Data Science MOOC
Sun 17 September 2023
Enhancing ClickHouse's Geospatial Support
Fri 08 September 2023
Asking a Large Language Model How YouTube Works
Fri 01 September 2023
Geospatial Clustering with Uber's H3 in DuckDB & QGIS
Wed 19 July 2023
Popular Airline Passenger Routes Refresh
Sat 03 June 2023
Streaming Video
Thu 25 May 2023
IPinfo's Free IP Address Location Database
Sun 09 April 2023
DuckDB's Spatial Extension
Fri 03 March 2023
Geospatial DuckDB
Thu 23 February 2023
European Route Planning
Tue 10 January 2023
Faster PostgreSQL To BigQuery Transfers
Thu 22 September 2022
1.1 Billion Taxi Rides in ClickHouse on DoubleCloud
Fri 19 August 2022
Awesome Isochrones
Wed 03 August 2022
ECharts for Python
Mon 01 August 2022
Python Data Visualisation
Sun 24 July 2022
Hardening SSH
Wed 20 July 2022
Pretty Maps in Python
Thu 14 July 2022
Making Heatmaps
Thu 07 July 2022
Minimalist Guide to Poem
Sun 03 July 2022
Minimalist Guide to Axum
Fri 17 June 2022
File Sharing with Caddy & MinIO
Thu 26 May 2022
Deploying 5G Around Trees
Tue 24 May 2022
The Streets of Monaco
Wed 04 May 2022
Install ClickHouse Faster
Wed 20 April 2022
Faster Geospatial Enrichment
Sat 08 January 2022
Where is every IP Address?
Mon 29 November 2021
Faster Top Level Domain Name Extraction with Go
Thu 25 November 2021
The Fastest FizzBuzz Implementation
Thu 07 October 2021
ROAPI: An API Server for Static Datasets
Wed 06 October 2021
Actix: A Web Framework for Rust
Thu 16 September 2021
Rocket: A Web Framework for Rust
Tue 31 August 2021
Building PostgreSQL Extensions with Rust
Wed 25 August 2021
Faster Top Level Domain Name Extraction with Rust
Tue 24 August 2021
Track changes in Excel, Word, PowerPoint, PDFs, Images & Videos with Git
Fri 20 August 2021
Faster Compression with Snappy's S2 Extension
Sat 14 August 2021
MeiliSearch: A Minimalist Full-Text Search Engine
Tue 10 August 2021
MinIO: A Bare Metal Drop-In for AWS S3
Thu 22 July 2021
Monitor ClickHouse with Prometheus & Grafana
Sun 11 July 2021
Data Fluent for PostgreSQL
Thu 04 February 2021
1.1 Billion Taxi Rides using Hydrolix on AWS
Tue 21 July 2020
1.1 Billion Taxi Rides using OmniSciDB and a MacBook Pro
Mon 13 April 2020
Python Web Scraping with Virtual Private Networks
Thu 02 January 2020
Fast IPv4 to Host Lookups
Mon 28 October 2019
Faster ZIP Decompression
Sun 20 October 2019
Faster ClickHouse Imports
Tue 17 September 2019
YouTube's Database "Procella"
Fri 28 June 2019
Is Hadoop Dead?
Tue 18 June 2019
Minimalist Guide to Lossless Compression
Wed 20 March 2019
Faster File Distribution with HDFS and S3
Fri 22 February 2019
A Minimalist Guide to Flume
Mon 18 February 2019
A Minimalist Guide to FoundationDB
Mon 28 January 2019
"Architecting Modern Data Platforms" Book Review
Wed 16 January 2019
1.1 Billion Taxi Rides: 108-core ClickHouse Cluster
Wed 09 January 2019
Convert CSVs to ORC Faster
Wed 02 January 2019
1.1 Billion Taxi Rides: Spark 2.4.0 versus Presto 0.214
Tue 16 October 2018
Working with the Hadoop Distributed File System
Mon 08 October 2018
Systems Monitoring: top vs Htop vs Glances
Mon 24 September 2018
Working with Data Feeds
Mon 13 August 2018
A Minimalist Guide to Microsoft SQL Server 2017 on Ubuntu Linux
Thu 28 June 2018
1.1 Billion Taxi Rides with SQLite, Parquet & HDFS
Mon 25 June 2018
Customising Airflow: Beyond Boilerplate Settings
Thu 26 April 2018
Using SQL to query Kafka, MongoDB, MySQL, PostgreSQL and Redis with Presto
Tue 03 April 2018
Python & Big Data: Airflow & Jupyter Notebook with Hadoop 3, Spark & Presto
Tue 27 March 2018
1.1 Billion Taxi Rides: EC2 versus EMR
Mon 19 March 2018
Hadoop 3 Single-Node Install Guide
Mon 11 December 2017
1.1 Billion Taxi Rides with BrytlytDB 2.1 & a 5-node IBM Minsky Cluster
Mon 13 November 2017
1.1 Billion Taxi Rides with BrytlytDB 2.0 & 2 GPU-Powered p2.16xlarge EC2 Instances
Wed 01 November 2017
A Minimalist Guide to SQLite
Sun 17 September 2017
1.1 Billion Taxi Rides with Spark 2.2 & 3 Raspberry Pi 3 Model Bs
Fri 28 July 2017
1.1 Billion Taxi Rides with BrytlytDB & 2 GPU-Powered p2.16xlarge EC2 Instances
Mon 08 May 2017
Compiling MapD's Source Code
Wed 26 April 2017
1.1 Billion Taxi Rides with MapD 3.0 & 2 GPU-Powered p2.8xlarge EC2 Instances
Fri 10 March 2017
Detecting Bots in Apache & Nginx Logs
Sun 05 March 2017
Doom Bots in TensorFlow
Sun 26 February 2017
Analysing Petabytes of Websites
Wed 15 February 2017
A Review of "Designing Data-Intensive Applications"
Thu 09 February 2017
1.1 Billion Taxi Rides on ClickHouse & an Intel Core i5
Tue 07 February 2017
1.1 Billion Taxi Rides on Vertica & an Intel Core i5
Mon 30 January 2017
1.1 Billion Taxi Rides on AWS EMR 5.3.0 & Spark 2.1.0
Wed 25 January 2017
1.1 Billion Taxi Rides on kdb+/q & 4 Xeon Phi CPUs
Thu 15 December 2016
1.1 Billion Taxi Rides on Amazon Athena
Sat 26 November 2016
Alenka: A GPU-Driven, Open Source Database
Fri 12 August 2016
1.1 Billion Taxi Rides with MapD & 8 Nvidia Pascal Titan Xs
Thu 04 August 2016
TensorFlow on a GTX 1080
Mon 01 August 2016
Building a Data Pipeline with Airflow
Sat 16 July 2016
1.1 Billion Taxi Rides with MapD & AWS EC2
Tue 05 July 2016
1.1 Billion Taxi Rides with MapD & 4 Nvidia Titan Xs
Mon 27 June 2016
1.1 Billion Taxi Rides with MapD & 8 Nvidia Tesla K80s
Wed 22 June 2016
1.2 Billion Taxi Rides on AWS RDS running PostgreSQL
Wed 01 June 2016
1.1 Billion Taxi Rides on a Large Redshift Cluster
Fri 13 May 2016
All 1.1 Billion Taxi Rides on Redshift
Fri 06 May 2016
All 1.1 Billion Taxi Rides in Elasticsearch
Thu 05 May 2016
50-node Presto Cluster on Google Cloud's Dataproc
Tue 03 May 2016
Performance Impact of File Sizes on Presto Query Times
Sat 30 April 2016
Faster IPv4 WHOIS Crawling
Fri 29 April 2016
33x Faster Queries on Google Cloud's Dataproc
Tue 26 April 2016
Mass IP Address WHOIS Collection with Django & Kafka
Thu 14 April 2016
A Billion Taxi Rides: AWS S3 versus HDFS
Mon 11 April 2016
A Billion Taxi Rides on Google's Dataproc running Presto
Fri 08 April 2016
50-node Presto Cluster on Amazon EMR
Wed 06 April 2016
A Billion Taxi Rides on Google's BigQuery
Thu 31 March 2016
Bulk IP Address WHOIS Collection with Python and Hadoop
Sun 27 March 2016
A Billion Taxi Rides in PostgreSQL
Fri 25 March 2016
A Billion Taxi Rides in Elasticsearch
Mon 21 March 2016
A Billion Taxi Rides on Amazon EMR running Spark
Fri 11 March 2016
A Billion Taxi Rides on Amazon EMR running Presto
Thu 25 February 2016
Kafka Producer Latency with Large Topic Counts
Mon 22 February 2016
A Billion Taxi Rides in Hive & Presto
Sat 13 February 2016
A Billion Taxi Rides in Redshift
Sun 31 January 2016
Presto, Parquet & Airpal
Sat 23 January 2016
A Million Songs on AWS Redshift
Fri 01 January 2016
Hadoop Up and Running
Tue 10 November 2015
Faster Testing with RAM Drives
Thu 13 August 2015
Popular Airline Passenger Routes
Tue 24 March 2015
Recommendation Engine built using Spark and Python
Wed 11 March 2015
Tightening Django Admin Logins
Mon 09 March 2015
Linting UK Postcodes
Sat 24 January 2015
Passwords in Django
Tue 18 November 2014
Faster Python
Fri 14 November 2014
Crushing, caching and CDN deployment in Django
Mon 10 November 2014
Better Python Package Management
Sat 08 November 2014
Load balancing Django
Wed 05 November 2014
Faster Django Testing
Tue 04 November 2014
Django exception archaeology
Sat 01 November 2014
Python's killer apps for blogging: Pelican and S3cmd
Fri 31 October 2014
Collecting all IPv4 WHOIS records in Python
Thu 30 October 2014
Former PHP developer
Thu 30 October 2014
File uploads to Amazon S3 in Django
Wed 29 October 2014
IP Address lookups using Python
Tue 28 October 2014
Django speaking JSON
Tue 28 October 2014
Querying Elasticsearch from Google App Engine

Copyright © 2014 - 2024 Mark Litwintschik. This site's template is based off a template by Giulio Fidente.