One Billion Row Challenge in Python

This post documents my attempt at the 1 Billion Rows Challenge using pure Python, focusing on optimizing file reading, parallel processing, and data parsing for performance gains. Through various improvements, I reduced execution time from 1733 seconds to 144 seconds, demonstrating significant speedup without external dependencies.
4 minutes to read
Ryan Peel

Setting up a new VPS

My home internet has no easy way for a public connection. I am behind a double NAT and I am not authorized to make changes to the second router’s connection. There are many solutions to this problem, all with their pros and cons. I chose to use a VPS running reverse proxy software.
6 minutes to read
Ryan Peel