I explore the task of bot detection in web traffic logs.
I examine the performance and reliably increases from using Redis across a 51-node IPv4 WHOIS crawling cluster.
I investigate how fast a cluster of EC2 instances can collect WHOIS records of IPv4 addresses.
I investigate how fast a 40-node Hadoop cluster on AWS EMR can collect WHOIS records of IPv4 addresses.
An exploratory effort to see how hard it is to collect all IPv4's WHOIS records.
A comparison of four methods used to find the country of an IP address.
Copyright © 2014 - 2019 Mark Litwintschik. This site's template is based off a template by Giulio Fidente.