Forem: Doogal Simpson

How Face ID Works: Infrared Lasers & Neural Networks

Doogal Simpson — Mon, 11 May 2026 14:15:22 +0000

TL;DR: Face ID does not compare standard selfies to unlock your phone. Instead, it projects 30,000 infrared laser dots onto your face to create a highly precise 3D geometry map. A neural network converts this map into a mathematical feature vector, comparing it to your saved profile to authenticate your identity.

Most people think their phone unlocks by snapping a quick selfie and playing a game of spot-the-difference. I get it—from a user perspective, that’s exactly what it feels like. But under the hood, modern facial authentication relies on an entirely different stack of hardware and software. It’s not looking at your face; it’s reading your face's topography like a surveyor mapping a mountain range. Let's break down the actual engineering happening in the fraction of a second it takes to unlock your screen.

How does Face ID map your face in the dark?

Face ID uses infrared (IR) light instead of the visible light spectrum to detect depth. By projecting 30,000 microscopic infrared laser dots onto your skin, the system can operate flawlessly even in pitch-black conditions. This hardware approach creates a highly accurate 3D topographic map of your features.

Visible light cameras are useless without an external light source. If you’ve ever tried to build a computer vision pipeline using a basic webcam in a dark room, you know the struggle. To bypass this environmental limitation, facial recognition systems use an infrared emitter. Think of it like dropping a massive, invisible net of laser points over your face. As these dots hit your nose, cheekbones, and jawline, they scatter. Because the hardware knows exactly when and where these dots were fired, it can measure how they scatter to map the exact contours of your face down to half a millimeter of distance.

What happens to the scattered infrared data?

A dedicated infrared sensor captures the scattered light to build a raw 3D geometry map. This physical map is then fed into an onboard neural network to be translated into a mathematical format called a feature vector. This vectorization allows for instantaneous mathematical comparisons rather than heavy spatial rendering.

Raw infrared scatter isn't incredibly useful on its own for fast database lookups. Comparing two raw point clouds containing 30,000 spatial coordinates takes significant compute power, and a mobile CPU would choke trying to execute that matrix operation instantly. Instead, the local neural network performs dimensionality reduction. It takes this massive spatial geometry map and compresses it into a highly specific, lightweight array of numbers.

If you imagine a hashing algorithm turning a massive file into a short string, a feature vector is a similar concept, but optimized for physical spatial data. When the device authenticates you, it's essentially running a fast mathematical distance calculation between the live vector and the originally stored vector. If the distance between these two arrays of numbers falls within a strict geometric threshold, the neural network confirms your identity. It is a highly efficient boundary check rather than a heavy pixel-by-pixel image comparison.

What are the steps in the Face ID authentication pipeline?

The pipeline operates as a real-time hardware-to-software relay, transforming physical light into mathematical security tokens. It starts with infrared emission and reflection, moving into depth mapping, and finally executes neural network vectorization to determine a match. Here is exactly how the data flows from the sensor to the CPU during a single authentication request:

1. Emission: The dot projector fires 30,000 infrared lasers at the target surface.
2. Reflection: The lasers scatter based on the physical depth and contours of the face.
3. Capture: An infrared camera reads the distorted dot pattern.
4. Mapping: The system constructs a 3D geometry map from the captured depth data.
5. Vectorization: A neural network translates the 3D map into a unique feature vector.
6. Comparison: The system calculates the mathematical distance between the new vector and the originally saved lock-screen vector to approve or deny access.

Why can't you trick Face ID with a photograph?

A printed photo or a digital screen is a two-dimensional flat plane lacking any physical depth. Because Face ID authenticates based on 3D geometry rather than visual appearance, a flat surface immediately fails the depth check. The resulting depth map simply doesn't match human topography.

Let's say your team is building a fintech app that handles high-value wire transfers. If your internal biometric authentication relied purely on pixel-based visual matching (like a standard webcam feed), a malicious actor could bypass it simply by holding a high-resolution iPad photo of the victim to the lens.

But because Face ID relies on infrared depth mapping, holding up a screen or a printed photo just looks like a perfectly flat sheet of glass or paper to the IR sensor. The neural network expects to see a nose protruding several centimeters and eye sockets recessed into the skull. A flat plane produces a geometry map with zero depth delta. When that flat map is vectorized, the resulting embedding lands miles away from the authorized feature vector in the latent space, and the system instantly denies access.

What else do developers ask about Face ID?

When integrating biometric security, engineers often have secondary questions about privacy limits and hardware edge cases. Understanding how the device stores data and adapts to changes helps clarify system limitations. Here are three common follow-up queries regarding the lifecycle of a facial feature vector.

Does Face ID store photos of my face?

No, the system does not store visible light images or full 3D point clouds of your face. It only retains the mathematical representation, or feature vector, derived from your facial geometry. This vector is securely locked away on the device's hardware enclave and never leaves the phone.

Can Face ID adapt if I wear glasses or grow a beard?

Yes, because it relies on a neural network and feature vectors, the system continuously updates its mathematical model. If a newly generated vector is close enough to authenticate you, the system subtly adjusts the stored baseline vector. This accounts for gradual physical changes over time without requiring a manual reset.

How accurate is the infrared dot projector?

The hardware is exceptionally precise, mapping facial contours with a distance accuracy down to half a millimeter. This ensures that even the most minor topographic depth variations are captured and fed into the neural network. As a result, geometric spoofing of the authentication system is incredibly difficult.

Networking Logic: How Your Data Navigates the Internet

Doogal Simpson — Sat, 28 Mar 2026 12:21:23 +0000

TL;DR: Downloading a file is an asynchronous process where data is fragmented into packets that navigate a mesh of routers independently. By using Layer 3 protocols to prioritize low-latency paths over physical distance, the network ensures resilient delivery even when specific nodes fail or become congested.

When you download a file, you aren't opening a single pipe; you're solving a distributed logistics problem. I look at this process as thousands of independent packets of information converging on a destination. They don't follow a pre-set, static path. Instead, they navigate a web of hardware, jumping from server to server until they reassemble on your machine. To understand how this works, you have to look past the file itself and focus on the routing logic that moves it.

How does data actually move across the internet?

Data moves by being fragmented into discrete packets, usually constrained by a 1500-byte Maximum Transmission Unit (MTU). These packets are injected into the network via the TCP/IP stack and routed independently, which allows the network to utilize all available bandwidth across various physical paths.

I think of it as a group of a hundred friends going on a road trip. You can’t fit them all in one car, so I put them in individual vehicles. Because every vehicle has the final address, they don't need to stay in a bumper-to-bumper convoy. One car might take the highway, another might take a toll road, and a third might take a bypass. As long as they reach the final coordinate, the specific path each car takes is secondary to the goal of arriving at the destination.

What is the role of a router in packet switching?

I look at routers as Layer 3 decision engines that use routing tables and protocols like BGP or OSPF to forward packets toward their destination. A router evaluates the header of every incoming packet and determines the next hop based on current link costs, congestion, and path availability.

In my view, the router is the sat-nav at every intersection. When a packet hits a router, the hardware calculates the best available port. It isn't looking for the shortest physical distance; it’s looking for the path with the lowest latency. If a primary fiber link is saturated, the router will push the packet onto a secondary path that might be physically longer but currently has higher throughput. It’s a series of local hop decisions that ensure global delivery.

How does the network handle path failure and congestion?

Network resilience is managed through dynamic re-routing and Time to Live (TTL) values that prevent packets from looping indefinitely. When a node fails or a path congests, the network reaches "convergence," where routers update their internal tables to reflect the new state of the network and bypass the failure.

If I’m on a road trip and hit a "Road Closed" sign, my sat-nav recalculates the route based on real-time telemetry. The internet functions the same way. If a data center goes offline or an undersea cable is cut, your packets don't just stop. They take a detour through different nodes. I find this design elegant because it assumes the underlying infrastructure is unreliable and builds the reliability into the endpoint logic instead.

Technical Specs of the Data Trip

Parameter	Technical Reality	Analogy Equivalent
MTU	1500-byte packet limit	Car seating capacity.
TTL	Hop-limit counter	A fuel gauge that drops at every intersection.
Router	Layer 3 Forwarding	The sat-nav at every crossroads.
TCP	Sequence Reassembly	The manifest used to check friends in at the goal.
Latency	RTT (Round Trip Time)	Total travel time for one vehicle.

FAQ

Do packets always arrive in the correct order?

No. Since packets take different paths, Packet #100 might arrive before Packet #1. I rely on the TCP layer on the receiving machine to act as a buffer, reassembling the data into the correct order based on sequence numbers once the entire set of packets has arrived.

What happens if a packet is lost in transit?

If a packet is dropped due to hardware failure or congestion, the receiver identifies the gap in the sequence. It sends a request back to the source to re-transmit that specific missing packet. This ensures the integrity of the file without needing to restart the entire transfer from the beginning.

How does a router determine the 'best' path?

Routers maintain routing tables that are updated via neighbor exchange protocols. They use metrics like hop count and bandwidth to determine the path of least resistance. When a router updates its table to reflect a faster path, it’s a process called convergence, ensuring all traffic stays as efficient as possible.

What About Second HTTP? Solving the 100-File Connection Bottleneck

Doogal Simpson — Sat, 28 Mar 2026 12:19:22 +0000

TL;DR: HTTP/2 replaces the inefficient six-connection limit of HTTP/1.1 with a single, multiplexed stream. By breaking assets into small, interleaved chunks, it eliminates head-of-line blocking and prevents multiple connections from fighting for bandwidth, allowing browsers to request and download hundreds of files simultaneously without the overhead of repeated TCP handshakes.

We’ve had first HTTP, but what about second HTTP? Back in the "olden days," websites were tiny. You had an HTML file, a CSS file, and maybe a couple of GIFs. For that scale, HTTP/1.1 worked fine. But modern web engineering has moved toward shipping hundreds of small files—JavaScript modules, fragmented CSS, and optimized assets.

When you try to shove 100 files through a protocol designed for five, things get slow. It isn't just about the raw size of the data; it’s about how the protocol manages the "wire" itself. Let's look at why we had to move on to HTTP/2 to build the websites we actually want to build today.

Why is the 6-connection limit a problem for modern sites?

HTTP/1.1 browsers typically limit themselves to six concurrent TCP connections per domain. If your site requires 100 files to render, the browser is forced to download them in batches of six, leaving the remaining 94 files stuck in a queue until a slot opens up.

This isn't just a queuing issue; it's a resource management disaster. Each of those six connections takes time to establish via a TCP handshake. Once they are open, these connections don't cooperate; they actively fight each other for available bandwidth. Instead of one smooth stream of data, you have six competing processes creating noise and congestion on the network. For a site with 100+ assets, this "batching" adds massive latency that purely sequential processing can't overcome.

What happens when a file gets stuck in HTTP/1.1?

In HTTP/1.1, if one file in a connection downloads slowly or gets "stuck," that entire connection is blocked until the transfer completes. This is known as Head-of-Line (HOL) blocking, where a single heavy asset prevents every subsequent file in the queue from moving forward.

If you're down to five active connections because one is hung up on a large image or a slow server response, your throughput drops immediately. There is no way for the browser to say, "Hey, skip that big file and send me the tiny JS snippet instead." The protocol is strictly sequential within those six pipes. If the "head" of the line is blocked, everything behind it stays put.

How does HTTP/2 multiplexing eliminate the queue?

HTTP/2 ignores the six-connection rule and uses one single, high-performance connection to request everything at once. It does this by chopping every file into little chunks and interleaving them, so data for all 100 files starts moving down the wire simultaneously.

Because it’s one connection, we avoid the overhead of multiple handshakes and the bandwidth contention issues where separate connections fight for priority. If one file gets stuck or arrives slowly, it doesn't matter. The browser is already busy receiving the chunks for the other 99 files. Everything gets sent down the wire in parallel and is reconstructed by the browser on the other end.

Feature	HTTP/1.1 (The Old Way)	HTTP/2 (The Modern Way)
Concurrency	6 connections (Max)	1 connection (Multiplexed)
File Handling	Sequential (one at a time per pipe)	Parallel (all at once via chunks)
Network Efficiency	High contention, multiple handshakes	Low overhead, optimized bandwidth
Failure Mode	HOL blocking stalls the queue	Interleaved frames prevent stalls

Is setting up one connection really faster than six?

You might think more pipes equal more speed, but in networking, the opposite is often true because of the "slow-start" algorithm in TCP. A single HTTP/2 connection can optimize its throughput faster and more accurately than six competing connections that are constantly triggering congestion control mechanisms.

By moving to HTTP/2, we've stopped trying to hack around the protocol and started using one that understands we’re building sites made of hundreds of files. It’s about getting everything down the wire as fast as possible so the user isn't left staring at a loading spinner while the browser tries to manage a congested queue. Cheers!

FAQ

Do I still need to bundle my files into one giant 'bundle.js' with HTTP/2?
While bundling isn't as critical for bypassing connection limits as it was in HTTP/1.1, it’s still useful for compression efficiency. However, HTTP/2 makes it much more performant to ship unbundled modules, which can lead to better caching strategies.

Does HTTP/2 work over unencrypted connections?
While the spec doesn't strictly require encryption, all major browser implementations (Chrome, Firefox, Safari) only support HTTP/2 over TLS (HTTPS). If you want the speed of Second HTTP, you need an SSL certificate.

How does the browser know how to put the chunks back together?
HTTP/2 uses a framing layer. Each chunk of data is wrapped in a 'frame' that contains a stream identifier. The browser sees these IDs and knows exactly which file each chunk belongs to, allowing it to reassemble the assets perfectly even though they arrived interleaved.

The 600ms Tax: Why Every TCP Connection Starts with a State Negotiation

Doogal Simpson — Sat, 28 Mar 2026 12:17:32 +0000

TL;DR: A TCP handshake is a mandatory three-step negotiation—SYN, SYN-ACK, and ACK—required to synchronize sequence numbers and reserve memory buffers before data transfer. This protocol overhead adds a minimum of 1.5 round-trips of latency, making persistent connection reuse a critical optimization for reducing time-to-first-byte (TTFB).

Fetching a file from a remote server isn't just a matter of bandwidth; it’s a battle against the physics of the network. If your round-trip time (RTT) to a server is 200ms, you aren't waiting 200ms for your data. You’re likely waiting 600ms before the first byte even arrives. This delay isn't a limitation of your fiber line—it’s the intentional cost of establishing a reliable state between two machines over an unreliable infrastructure.

I look at this as a "resource reservation tax." Before a single packet of your HTML or JSON is sent, both the client and the server have to agree on exactly how they will track the data. This negotiation is what we call the TCP handshake.

What is a TCP handshake and why is it necessary?

A TCP handshake is a three-way exchange used to initialize a reliable logical connection by synchronizing sequence numbers and allocating memory buffers on both the client and server. This process ensures that both parties have the state necessary to track packet delivery, handle retransmissions, and reassemble data in the correct order.

Think of it as initializing a shared state machine. Both computers need to know where to start counting bytes—the sequence number—and they need to set aside specific memory for that specific connection. Without this synchronization, the receiving end would have no way to distinguish between a new packet and a delayed packet from a previous session. It’s about creating a predictable environment out of the chaos of the internet.

How does the three-way handshake impact network latency?

The TCP handshake impacts latency by requiring three sequential legs of communication—client to server, server to client, and client back to server—before the application data can be requested. This creates a latency floor where the initial connection setup time is directly proportional to the physical distance between the two machines.

Phase	Direction	Technical Purpose
SYN	Client -> Server	Client proposes an initial sequence number and requests synchronization.
SYN-ACK	Server -> Client	Server acknowledges client's sequence, proposes its own, and allocates memory buffers.
ACK	Client -> Server	Client confirms the server's state; usually piggybacks the first actual data request (e.g., HTTP GET).

If each leg of this journey takes 200ms, you’ve spent 600ms just establishing that the two computers can "hear" each other and have enough memory allocated to handle the session. This is why a small 1KB file can often feel as slow to load as a much larger one; the overhead of the handshake is the dominant factor.

Why is connection reuse essential for high-performance apps?

Connection reuse, or persistence, allows multiple requests to be sent over a single established TCP connection, bypassing the 1.5 RTT handshake penalty for subsequent data transfers. By maintaining the synchronized state and allocated memory, the client and server can communicate with zero additional setup overhead after the initial connection.

Imagine you’re building a microservice that needs to fetch twenty different assets. If you opened a new TCP connection for every single asset, you’d be paying that 600ms tax twenty times over. That is 12 seconds of just "saying hello." Instead, modern protocols like HTTP/1.1 and HTTP/2 establish a small pool of persistent connections. We pay the handshake price once, keep the buffers warm, and then stream the data through the existing pipe. This is the single most effective way to mitigate the impact of physical distance on application performance.

FAQ

Why can't we just start sending data with the first SYN packet?
Standard TCP requires the three-way handshake to prevent "Sequence Number Guessing" attacks and to ensure the server doesn't allocate resources for spoofed IP addresses. However, an optimization called TCP Fast Open (TFO) does allow data to be included in the SYN packet for subsequent connections if the client has connected to that server before.

Does a TCP handshake happen for every HTTP request?
In modern web development, no. Thanks to the Keep-Alive header in HTTP/1.1 and the multiplexing capabilities of HTTP/2 and HTTP/3, a single TCP (or QUIC) connection is typically kept open and reused for hundreds of requests to the same origin.

What happens if the ACK packet in the handshake is lost?
If the final ACK from the client is lost, the server will eventually time out the half-open connection and deallocate the memory it reserved. The client, assuming the connection is open, will attempt to send data, which the server will reject or ignore, forcing the client to re-establish the connection.

Cheers.

TCP Exponential Backoff: Why Your Retries are Doubling

Doogal Simpson — Sat, 28 Mar 2026 12:16:15 +0000

TCP prevents network meltdowns by doubling its wait time (Exponential Backoff) every time a packet fails to acknowledge. Instead of spamming a congested link, I look at how the protocol calculates a dynamic Retransmission Timeout (RTO) and then backs off to allow hardware buffers to clear and avoid total congestion collapse.

I find it wild that we can just download a file from a physical computer on another continent through a chaotic web of underwater cables and intermediary servers. When I think about TCP, I'm looking at the protocol responsible for taking that file, chopping it into chunks, and ensuring it arrives mostly reliably despite the physical insanity of the global internet infrastructure. The genius isn't just in the delivery, but in how the protocol knows when to stop talking so the network doesn't cave in on itself.

How does TCP handle packet loss?

I see TCP ensuring reliability by requiring a specific acknowledgment (ACK) for every data segment sent. If the sender transmits a chunk and doesn't receive an ACK within a set window, it assumes the packet was lost and initiates a retransmission.

I usually think of this like a microservice health check or a database heartbeat. If I send a request and don't get a response, I have to decide when that request has officially failed. If I retry after a single millisecond, I'm going to overwhelm a service that might just be slightly lagged. If I wait five seconds, I'm killing my application's throughput. I need TCP to find that precise "wait" window for every unique connection to keep the data moving as fast as possible without causing a bottleneck.

What determines the Retransmission Timeout (RTO)?

The RTO is a dynamic timer that I use to judge when a packet is officially "lost" based on previous Round Trip Time (RTT) measurements. It isn't a static value because the latency I see to a server in London is vastly different from the latency to a server in my own rack.

If my previous packets have been successfully acknowledged in 500ms, my TCP stack might set an RTO of 700ms to provide a buffer for minor jitter. But once that 700ms timer expires without an ACK, the logic has to change. If I just kept hitting the network at 700ms intervals during a failure, I'd be making a bad situation worse. This is why I rely on exponential backoff to handle the silence.

Why does TCP use exponential backoff?

I use exponential backoff to prevent "congestion collapse," a state where a network is so saturated with retransmissions that no actual work can get through. By doubling the RTO after every failure, I'm effectively using a circuit breaker to reduce the load on the network until the bottleneck clears.

To understand why I need this, we have to look at the hardware buffers. Every router between my machine and the destination has a finite amount of memory to queue incoming packets. When network traffic spikes and those buffers reach capacity, the router performs a "tail drop"—it simply discards any new incoming packets because there is nowhere to put them.

If every device on that segment responded to a drop by immediately resending data at high frequency, they would create a broadcast storm. The buffers would stay at 100% utilization, and the router would spend all its resources dropping packets rather than routing them. By exponentially increasing the wait time, I'm giving those hardware buffers the space they need to drain and recover. It's about being a good neighbor to the rest of the traffic on the wire.

How does backoff scale across retries?

With every consecutive failure to receive an ACK, I double the previous RTO. This binary exponential backoff continues until I hit a maximum threshold or the operating system finally decides the connection is dead and kills the socket.

Retry Count	Backoff Multiplier	Example RTO (ms)
Initial Transmission	1x	700
1	2x	1,400
2	4x	2,800
3	8x	5,600
4	16x	11,200
5	32x	22,400

Eventually, the network does time out. There's a limit to how long I'll wait, but this aggressive backing off is what keeps a local network failure from cascading into a total blackout for every other user on that same infrastructure. It’s the difference between a minor lag and a total network shutdown.

FAQ

What is the difference between RTT and RTO?

RTT (Round Trip Time) is the actual measured time it takes for a packet to travel to the destination and back. RTO (Retransmission Timeout) is the calculated duration the sender will wait for an acknowledgment before assuming the packet was lost, typically derived from RTT plus a variance buffer.

Why not just use a fixed retry interval?

Fixed intervals lead to congestion collapse. If thousands of devices all retry at the same static interval, they can synchronize their retransmissions, creating massive spikes in traffic that keep router buffers full and prevent the network from ever recovering.

Can I tune the maximum number of TCP retries?

In Linux, I can tune this via sysctl using the net.ipv4.tcp_retries2 parameter. This setting dictates how many times the kernel will retry before giving up on an established connection. While I can lower this to fail faster, increasing it too much can lead to stale sockets hanging around for over half an hour on a dead link.

TCP: Why the Internet Works Even When It's Broken

Doogal Simpson — Sat, 28 Mar 2026 12:12:58 +0000

TL;DR: TCP is how we send big files over a mess of unreliable cables. It chops data into numbered chunks and won't stop nagging the receiver until every single piece is accounted for. If a packet gets dropped or a router chokes, TCP just resends it until the job is done.

The internet is a series of physical cables and aging hardware that is constantly failing. Between your computer and a server, there are dozens of points of failure where data can be lost, corrupted, or just dropped because a router got too hot. TCP (Transmission Control Protocol) is the protocol that keeps your data from becoming a corrupted mess by assuming the network is going to fail.

How does TCP handle data loss on an unreliable network?

TCP handles data loss by breaking large files into small chunks and requiring a formal acknowledgment for every single one. Instead of hoping a 1GB file arrives in one piece, it treats the network as a "best-effort" medium and takes full responsibility for verifying that every byte landed safely.

Sending a massive file over a physical wire is a gamble. If one bit flips or a single packet hits a congested router and gets dropped, the whole transmission is ruined. TCP doesn't gamble. It puts every page of your data into its own envelope, labels it with a page number, and sends it out. Then, it waits for a phone call. If the recipient says they got pages 1, 2, and 4, the sender knows page 3 was lost in the mail. The sender doesn't have to guess; they just grab a copy of page 3 and send it again until the recipient confirms they have it.

What is the step-by-step process of TCP data transmission?

TCP transmission follows a strict loop of segmentation, sequencing, and verification. It transforms a raw, unreliable stream of bits into a structured conversation between two machines to ensure the final payload is identical to the source.

Phase	What Happens	Why it Matters
Segmentation	Chop the payload into MTU-sized segments.	Keeps chunks small enough for hardware to handle without choking.
Sequencing	Stamp every packet with a sequence ID.	Lets the receiver rebuild the file in order, even if packets arrive late.
Transmission	Push segments onto the physical wire.	This is the "unreliable" part where cables and routers take over.
ACK Loop	Wait for Acknowledgment (ACK) signals.	The only way the sender knows the data actually arrived.
Retransmit	Resend segments if an ACK times out.	Fixes network errors automatically without the user ever noticing.

Why does TCP use sequence numbers for every packet?

Sequence numbers act as the index that allows the receiver to reassemble data in the correct order and identify gaps. Without these numbers, the receiver would have no way of knowing if a packet was missing or if the data arrived out of sequence.

Think about a high-res image being sent across the country. Packet #50 might take a faster route through the network and arrive before packet #49. Without sequence numbers, your computer would just stick the bits together in the order they arrived, and the image would look like static. The sequence number tells the OS exactly where that chunk belongs in the final file, allowing it to buffer early arrivals until the missing gaps are filled.

What happens when a TCP acknowledgment is never received?

When an acknowledgment (ACK) doesn't return within a specific window, the sender assumes the packet died in transit and triggers a retransmission. It keeps the data in a local buffer and refuses to clear it until it's 100% sure the other side has it.

This is the core of TCP's reliability. If you are pushing code to a server and the connection flutters, TCP doesn't just let that chunk of data vanish. It will keep retrying that specific sequence ID until the server finally responds with a green light. It’s a persistent, nagging mechanism that ensures the integrity of the data at the cost of some overhead.

FAQ

What is the cost of TCP reliability?
The primary cost is high-latency overhead. Because every packet requires an acknowledgment (the ACK), and there is a "handshake" to start the connection, TCP is naturally slower than protocols that just fire data into the void without checking if it landed.

Why use TCP over UDP?
You use TCP when accuracy is non-negotiable, like loading a website, sending an email, or downloading software. You use UDP when speed is more important than a few dropped packets, like in a Zoom call or a competitive multiplayer game where a momentary glitch is better than the whole stream pausing to wait for a retransmission.

Does TCP ever give up on resending data?
Yes. While TCP is persistent, it isn't infinite. If it fails to get an ACK after a set number of retries or a specific timeout period, it will eventually "reset" the connection and signal to the application that the network path is dead.

Fixing Biased Entropy: The Von Neumann Unbiasing Trick

Doogal Simpson — Sat, 28 Mar 2026 12:11:38 +0000

TL;DR: I've found that hardware entropy sources are rarely uniform. To solve this, I use Von Neumann Unbiasing, which pairs bits and discards identical results (00, 11). By mapping 01 to 0 and 10 to 1, I can extract a perfectly fair 50/50 distribution from any biased source, provided the bias is constant and bits are independent.

I’ve found that hardware is always noisier than you’d expect—and rarely in the way you want. When I pull entropy from thermal jitter or diode noise, I'm dealing with the messy physical world, which doesn't care about my requirement for a perfect distribution. A sensor might lean toward a logic high or low due to temperature fluctuations or voltage drops, and in practice, achieving a perfect 0.5 probability out of a physical component is almost impossible.

If I see someone using biased entropy for key generation, I know they're shrinking their effective keyspace and making their system vulnerable to brute-force attacks. A cryptographic key is only as strong as the randomness used to create it. If your bits are weighted 60/40, you’ve introduced a pattern that an attacker can exploit. I need to process that raw, physical noise into a mathematically balanced stream before it is used in a production environment.

Why are hardware random number generators often biased?

Physical sensors are influenced by environmental conditions and internal circuit resistance that favor one electrical state over another. Unlike a mathematical pseudo-random number generator (PRNG), a hardware source is a physical device subject to manufacturing defects and external interference.

I want you to imagine you’re building a microservice that relies on an internal hardware RNG. If that hardware is even slightly more sensitive to a certain voltage threshold, it will produce more ones than zeros. This bias can be subtle—perhaps only a 1% shift—but in the world of security, that shift is enough to weaken the randomness of every session key your service generates.

How do you turn a weighted coin flip into a 50/50 result?

I group incoming bits into pairs and discard any pair where the bits match. I only output a single bit when I see a transition—either a zero followed by a one, or a one followed by a zero.

This logic ensures that the output remains fair even if the source is heavily biased. Here is the mapping logic I use to clean the stream:

First Bit	Second Bit	Action	Result
0	0	Discard	(None)
1	1	Discard	(None)
0	1	Accept	0
1	0	Accept	1

Why does this trick guarantee a perfect probability split?

The reason I love this trick is because the math is elegantly simple: p * q is always equal to q * p. Even if your source favors one side, the probability of seeing a specific sequence of two different bits is mathematically identical to seeing its reverse.

Let’s say I am looking at a broken hardware sensor that lands on heads (1) 75% of the time and tails (0) 25% of the time.

The probability of (1,1) is 0.75 * 0.75 (0.5625) -> Discarded.
The probability of (0,0) is 0.25 * 0.25 (0.0625) -> Discarded.
The probability of (0,1) is 0.25 * 0.75 (0.1875) -> Result: 0.
The probability of (1,0) is 0.75 * 0.25 (0.1875) -> Result: 1.

Since 0.1875 is exactly equal to 0.1875, I get an exactly 50% chance of getting a 0 or a 1. The original bias doesn't change the fact that the two mixed outcomes are equally likely.

What are the performance trade-offs of unbiasing?

The primary trade-off I see is throughput; I am forced to throw away a massive amount of raw data, which can lead to entropy starvation in systems like Linux. When the entropy pool in /dev/random runs dry, the OS blocks, which can halt a deployment or stall a cryptographic handshake.

In that 75/25 bias scenario, I am discarding 62.5% of the raw bits. If I have a system generating long-term SSH host keys during a cloud instance boot-up, this discarding logic can cause a visible hang. I've seen setup scripts stop and deployment pipelines stall because the system was waiting on a hardware-accelerated source that was too biased to keep up with the demand. When I implement this in firmware, I keep the logic as lean as possible:

def unbias(bit_stream):
    while True:
        x, y = next(bit_stream), next(bit_stream)
        if x != y:
            return x

FAQ

Does this work if the hardware bias changes over time?

No. This algorithm relies on the probability (p) remaining constant across both bits in the pair. If the bias is drifting rapidly—for instance, if I am looking at a sensor that is overheating and its 1/0 ratio is swinging wildly every millisecond—the unbiasing effect breaks down and I may still end up with skewed output.

What happens if the bits are correlated?

If the bits are not independent—meaning a 1 is more likely to be followed by another 1 (autocorrelation)—this trick fails. In those cases, I would typically use a cryptographic hash function like SHA-256 as an entropy extractor to flatten the distribution and remove the patterns.

Is there a more efficient way to extract bits?

Yes, algorithms like the Peres or Elias strategies extract more entropy by looking at longer bit sequences. However, I rarely use them because they require complex state management and larger memory buffers. Von Neumann is my go-to for low-level work because it requires zero memory and can be implemented with a simple loop or a few logic gates.

Why Your Computer Can't Just Pick a Number: Navigating the Spectrum of Randomness

Doogal Simpson — Sat, 28 Mar 2026 12:09:24 +0000

TL;DR: Computers are deterministic, meaning they struggle to create "true" randomness. I solve this using a spectrum of techniques: Pseudo-Random Number Generators (PRNGs) for logic like gaming, hardware-based True Random Number Generators (TRNGs) for standard security, and quantum systems for absolute cryptographic unpredictability where physics guarantees the result.

One of the best things about computers is that they do exactly what you tell them. And one of the worst things about computers is that they will do exactly what you tell them. If I want a random number, I’m immediately running into a wall because machines are deterministic by design. They don't "guess"; they calculate.

When I need a random result, I can just flip a coin or roll a die. It’s messy, physical, and easy. But for a computer, providing a random value requires breaking its own internal logic to find a source of chaos. Depending on the stakes—whether I'm building a loot drop for an RPG or a high-level encryption layer—I have to choose the right level of randomness.

Why is generating a random number so hard for a computer?

Computers are deterministic systems, meaning if I give a machine the same input and state, it will produce the exact same output every single time. Because a computer lacks the natural "messiness" of a human, it cannot generate a truly random value without an external source of noise.

I’ve found that many engineers overlook how rigid our hardware really is. Every operation is the result of a defined instruction set. If I ask a function to return a random value, that function has to execute logic. And if that logic is based on math, it’s reproducible. To get something that feels random, I have to point the computer at something it can't control.

What is a pseudo random number generator and when should I use one?

A Pseudo-Random Number Generator (PRNG) is a deterministic algorithm that takes a starting "seed" and runs it through a formula to produce a sequence of numbers that appear random. While the output looks chaotic to a user, the entire sequence is actually fixed and will repeat perfectly if I use the same seed twice.

I use PRNGs for the vast majority of my work—specifically in areas like video games or UI testing. If I’m building a game like Minecraft, I actually want this determinism; it’s what allows players to share a "world seed" and see the exact same terrain. For standard tasks like calling Math.random(), a PRNG is plenty, but I have to remember that if an attacker knows my seed, they can predict every "random" number that follows.

How does a true random number generator harvest physical entropy?

True Random Number Generators (TRNGs) move beyond algorithms by harvesting entropy from physical chaos within the hardware, such as CPU temperature fluctuations or the nanosecond timing of hardware interrupts. Instead of calculating a number, the system is essentially "measuring" the noise of the physical world.

I’ve seen people point to simple system time as a source of truth, but let’s be clear: system time is usually just a seed for a PRNG. To get to the TRNG level, I’m looking for hardware "jitter." These are the tiny, unpredictable micro-fluctuations in thermal noise or the exact moment a packet hits a network card. This is the standard for things like gambling websites, where I need to ensure that no amount of reverse-engineering can reveal a pattern in the deck shuffle.

Level of Randomness	Source	Predictability	Best Use Case
Pseudo (PRNG)	Seeded Algorithms	High (if seed is known)	Games, UI, Simulations
True (TRNG)	Hardware Entropy (Heat/Jitter)	Very Low	Gambling, SSL Certificates
Quantum (QRNG)	Subatomic Particles	Zero	High-Stakes Cryptography

Do I need quantum randomness for secure cryptography?

Quantum randomness is the gold standard used when the stakes are high enough that I need unpredictability guaranteed by the laws of physics. This involves measuring events at the subatomic level—like sending particles at a half-silvered mirror—where the outcome is a literal 50/50 probability.

In the world of cryptographics, "good enough" usually isn't enough. If there is even a slight statistical bias in my random number source, a sophisticated attacker can exploit it to break an encryption key. By reaching into the realm of quantum mechanics, I ensure that the randomness is genuine and absolute. It moves the security of the system from a software challenge to a physical certainty.

FAQ

Is the random number generator in my programming language secure?
Generally, no. Most default functions like Math.random() or rand() are PRNGs designed for speed, not security. If I’m generating a password or a session token, I always reach for a cryptographically secure library like the Web Crypto API or crypto/rand in Go.

What happens if I use the same seed in a PRNG?
I will get the exact same sequence of "random" numbers every time. This is a common pitfall in testing; if I don't vary my seed (often by using the current timestamp), my "random" tests will actually be testing the exact same path over and over.

Where does a headless server get its entropy if there’s no user input?
Modern servers gather entropy from hardware sources like the RDRAND instruction on Intel CPUs or interrupt timings from the disk and network. If a system runs out of this entropy, it can actually "starve," causing processes that require high-quality randomness to hang until more chaos is harvested.

The 'Top 1%' Hiring Myth: It’s a Ratio, Not a Talent Rank

Doogal Simpson — Sat, 28 Mar 2026 12:07:51 +0000

TL;DR: When a company claims to hire the "top 1%," they are describing a recruitment ratio—one hire for every 100 CVs—not an objective ranking of talent. Engineering skill is context-dependent, meaning the "elite" candidate for a kernel-heavy infrastructure team is often a poor fit for a product-focused startup.

When a company says they only hire the top 1%, it’s a marketing flex used to justify high prices to VCs and ego boosts to candidates. It suggests there’s a master leaderboard of engineers where every candidate is boiled down to a single "talent" attribute. This is a statistical sleight of hand. They want you to think they’ve found the best human in the pile, but the reality is much more mundane.

What does it actually mean to hire the top 1%?

It means the company received 100 CVs and hired one person. This is a measure of recruitment volume and filtering intensity, not a scientific ranking of engineering capability.

In a hypothetical lineup of 100 developers, the "top 1%" pitch implies a linear ranking where candidate #1 is objectively superior to #2. But engineering talent isn't a scalar value. If a team needs a Rust specialist to write memory-safe kernels, a world-class React developer who can ship a feature in two hours is effectively useless to them. Both are elite in their domains, but they aren't interchangeable on a single scale. The "1%" label is just the result of a specific filter applied to a specific pile of resumes.

Why is technical talent subjective across different companies?

Every engineering team over-indexes on specific pain points, meaning one firm’s "perfect hire" is another firm’s "hard pass." Talent is context-dependent, shifting based on whether a team needs product intuition, deep infrastructure knowledge, or client-facing communication.

Company Need	Priority Trait	Engineering Profile
Rapid Prototyping	Product Sense	High-velocity delivery over perfect abstractions.
High-Scale Infra	Technical Depth	Focus on latency, concurrency, and low-level optimization.
Technical Consulting	Communication	Translating technical debt into stakeholder risk.
Early-Stage Growth	Generalist	Polyglot capable of jumping from CSS to DB migrations.

How does random chance impact the "Top 1%" claim?

Absolutely. Hiring is as much about the interviewer’s mood and niche biases as it is about your GitHub streak.

It is common for the same cohort of 100 candidates to produce completely different "winners" at different companies. One developer might land the offer because they happen to be using the exact library the lead dev is currently struggling with. Another might get rejected because the interviewer has an irrational vendetta against a specific framework. Because different companies prioritize different signals, the "top 1%" label is just a byproduct of whoever happened to fit that week's specific requirements and the interviewers' individual preferences.

Is the recruitment process an objective talent filter?

No, it is a matching process that frequently mistakes coincidence for quality. Different companies have different priorities, and the "top 1%" at one company might be at the bottom of the list for another based solely on the tech stack or the product philosophy.

You could take the same 100 candidates, send them to ten different companies, and walk away with ten different "top 1%" hires. All ten companies would claim they found the elite tier, but in reality, they just found the person who best matched their specific, biased requirements at that exact moment in time.

FAQ

Why do big tech companies use the 1% metric?
It manufactures scarcity and maintains a "premium" brand image. This attracts a high volume of applicants, which ironically makes the ratio even smaller and reinforces the claim.

Is there such a thing as an objectively "elite" engineer?
There are high-impact engineers, but their status is usually a result of being in an environment where their specific skills act as force multipliers. An elite systems architect is just another dev in a team that only needs basic CRUD apps.

Should I tailor my profile for "top tier" companies?
Yes, because they aren't looking for "talent" in the abstract; they are looking for a specific set of attributes—like product sense or specific language depth—that solve their immediate technical hurdles.**

Git Internals: Why Your Commits Aren't Actually Diffs

Doogal Simpson — Sat, 28 Mar 2026 12:06:31 +0000

TL;DR: Git is a content-addressable filesystem that stores project states as full snapshots rather than incremental deltas. Every object—blobs, trees, and commits—is identified by a unique SHA-1 hash of its content, creating an immutable chain where any change to a single byte results in entirely new objects.

You see green and red lines in a pull request and assume Git stores diffs. It doesn't. I view Git as a persistent key-value store where the key is a hash and the value is your data. When I commit code, I am not saving a list of changes; I am saving a snapshot of the entire project state at that exact moment.

How does Git store actual file content?

Git ignores filenames and stores raw data as 'blobs' named after their own SHA-1 hashes.

When I create a file called hello.txt with the content "hello world" and add it to a repo, Git hashes that string and creates a blob. If I look inside the .git/objects directory, I can see exactly how this is stored. Git takes the first two characters of the hash to create a directory and uses the remaining 38 characters as the filename. For example, a hash starting with e69de2 would be stored at .git/objects/e6/9de29.... This is the core of content-addressable storage: the address of the data is derived from the data itself. If I change a single character in that file, the hash changes, and Git writes an entirely new blob file.

What role does a tree object play?

A tree object defines the project structure by mapping human-readable filenames to their specific blob or sub-tree hashes.

I think of a tree as a simple directory listing. Each line records file permissions, the object type, the SHA-1 hash, and the filename. This architecture is why I can rename a file without Git needing to copy the actual file data. The blob hash remains identical because the content "hello world" hasn't changed; I have only updated the tree object to point that same hash to a new filename. Because trees are also named after the hash of their content, any change to a filename or a sub-directory hash results in a brand-new tree hash.

Object	Data Responsibility	Identity Hash Source
Blob	Stores raw file bytes	The literal file content
Tree	Maps names to hashes	The directory list content
Commit	Links trees to parents	The metadata and tree pointer

What happens when a file is modified?

Modifying a single byte triggers a cascade where a new blob, a new tree, and a new commit are all created with unique hashes.

If I update hello.txt from "hello world" to "hello world, how are you?", the system rebuilds the state of the world. Git writes a new blob for the updated string. Because the hash for hello.txt is now different, the tree containing it must be updated, resulting in a new tree hash. Finally, I create a new commit pointing to that new root tree. This commit also stores the hash of its parent commit. This pointer is what creates the chain we call history. Because the parent hash is part of the commit's content, if I change one bit in an old commit, its hash changes, breaking every subsequent hash in the chain. This immutability is why Git history is so mathematically consistent.

FAQ

Does Git's snapshot model waste a lot of disk space?

Git periodically runs a garbage collection process (git gc) that packs objects into compressed files. While it uses delta compression for physical storage, Git maintains the snapshot model at the logical level, ensuring data retrieval is fast and consistent.

How does Git know if a file hasn't changed?

When I run a commit, Git compares the current hash of a file's content to the hash stored in the previous tree. If they match, Git simply reuses the existing hash in the new tree object rather than creating a redundant blob.

Why are commit hashes unique across different machines?

Since the hash is derived from the content—including the tree hash, author, timestamp, and parent hash—the identity is unique to that specific snapshot. This allows developers to work asynchronously without a central server assigning version numbers.

Your JavaScript Array is a Hash Map in Disguise

Doogal Simpson — Sat, 28 Mar 2026 12:04:58 +0000

TL;DR: JavaScript arrays are fundamentally objects where integer keys are treated as strings. To save performance, engines like V8 attempt to optimize these into contiguous memory blocks (Elements Kinds). However, mixing types or creating sparse "holes" triggers a de-optimization to Dictionary Mode, moving data from the 1ns CPU cache to 100ns RAM lookups.

If you have ever wondered why typeof [] returns "object", the answer isn't just "JavaScript is weird." It is an architectural warning. In the underlying C++ of the V8 engine, arrays are not fixed-size contiguous memory buffers by default. They are hash maps—specifically, exotic objects that map integer keys to values. While this makes JS incredibly flexible, it creates a massive performance hurdle that the engine has to work overtime to solve.

Why does typeof return "object" for a JavaScript array?

JavaScript arrays are keyed collections that inherit from the Object prototype, meaning they are essentially specialized objects where the keys are property names. Even though we access elements using arr[0], the engine internally treats that index as a string key "0" to maintain compliance with the language specification.

Under the hood, this means a standard array doesn't have a guaranteed memory layout. In a language like C, an array of four integers is a single 16-byte block of memory. In JavaScript, a naive array is a collection of pointers scattered across the heap. To find an element, the engine has to perform a hash lookup, which is computationally expensive compared to a simple memory offset. This architectural choice is why V8 spends so much effort trying to "guess" when it can treat your array like a real, contiguous block of memory.

How does V8 use "Elements Kinds" to optimize performance?

V8 uses a system called Elements Kinds to track the internal structure of an array, attempting to store data in the most efficient C++ representation possible. If you create an array of small integers, V8 labels it PACKED_SMI_ELEMENTS and stores it as a contiguous block of memory, allowing the CPU to access it with near-zero overhead.

This optimization is all about hardware efficiency. When data is contiguous, it lives in the CPU's L1 or L2 cache. The CPU can use "prefetching" to load the next few elements into the cache before your code even asks for them. Retrieval from the cache takes about 1 nanosecond. However, if the array becomes a hash map (Dictionary Mode), the CPU has to engage in "pointer chasing." It must go all the way to the system RAM—which can take 100 nanoseconds or more—to find the memory address of the next bucket in the hash map. That 100x latency jump is the hidden tax of unoptimized JavaScript.

What triggers the transition to Dictionary Mode?

The transition from a fast, packed array to a slow hash map is often a one-way street. V8 starts with the most optimized state and "downgrades" the array as you introduce complexity, such as mixing data types or creating large gaps between indices.

If you have a PACKED_SMI_ELEMENTS array and you push a floating-point number into it, the engine transitions it to PACKED_DOUBLE_ELEMENTS. If you then push a string, it becomes PACKED_ELEMENTS (a generic array of pointers). The most destructive action, however, is creating a "holey" array. If you define let a = [1, 2, 3] and then suddenly set a[1000] = 4, V8 refuses to allocate 997 empty memory slots. Instead, it panics and converts the entire structure into DICTIONARY_ELEMENTS. Once an array is downgraded to a dictionary, it rarely—if ever—gets promoted back to a packed state.

// Starts as PACKED_SMI_ELEMENTS (Fastest)
const arr = [1, 2, 3]; 

// Transitions to PACKED_DOUBLE_ELEMENTS
arr.push(1.5); 

// Transitions to DICTIONARY_ELEMENTS (Hash Map)
// This creates a 'hole', triggering a full de-optimization
arr[1000] = 42;

Why do these 100ns delays matter in intensive tasks?

In standard UI development, a 100ns delay is invisible. However, in high-throughput backend processing or 60fps graphical programming, these delays are catastrophic. In a requestAnimationFrame loop, you have a hard limit of 16.6ms to finish all calculations. If you are iterating over thousands of "arrays" that are actually hash maps, the constant round-trips to RAM will eat your frame budget and cause visible stuttering.

Similarly, if you are building a data-intensive microservice that processes millions of JSON objects, the cumulative cost of hash map lookups instead of direct memory access can result in a 10x or 100x decrease in total throughput. This is why tools like TensorFlow.js or high-performance game engines use TypedArrays (like Int32Array), which bypass this "Elements Kind" guessing game entirely and force the engine to use contiguous memory.

V8 Array State Transitions

State	Description	Latency
`PACKED_SMI`	Contiguous small integers	~1ns (Cache)
`PACKED_DOUBLE`	Contiguous floats	~1ns (Cache)
`HOLEY_ELEMENTS`	Array with missing indices	Variable (Slower)
`DICTIONARY_ELEMENTS`	Pure Hash Map (De-optimized)	~100ns (RAM)

FAQ

How can I prevent my arrays from becoming hash maps?
Initialize your arrays with their final size if possible and avoid "holey" assignments. Most importantly, keep your arrays monomorphic—meaning, don't mix integers, strings, and objects in the same collection.

Are TypedArrays immune to this de-optimization?
Yes. Int32Array, Float64Array, and others are backed by an ArrayBuffer. They have a fixed length and a fixed type, which guarantees they stay as contiguous blocks of memory regardless of what you do with the data.

Does deleting an element make an array a hash map?
Using the delete keyword on an array index (e.g., delete arr[2]) creates a hole, which transitions the array to a HOLEY state. While it might not immediately hit Dictionary Mode, it significantly slows down access because the engine must now check the prototype chain for that missing index.

Scalable Proximity Search: Why Geohashing Beats Radius Queries

Doogal Simpson — Sat, 28 Mar 2026 12:02:40 +0000

TL;DR: Geohashing maps 2D coordinates to a 1D string using recursive binary partitioning and bit interleaving. By encoding these bits into a Base-32 string, we leverage B-Tree prefix matching for efficient spatial lookups, bypassing the high CPU costs of Haversine distance calculations at scale.

Calculating Haversine distances for 10,000 moving objects per tick is a disaster for database performance. Standard SQL radius queries aren't built for high-concurrency spatial updates; they are computationally heavy and fail to scale because they require calculating the distance between the query point and every potential candidate in the dataset. To keep latency low, you need to stop thinking in floating-point coordinates and start thinking in B-Tree friendly strings. This is where geohashing comes in, moving the heavy lifting from the CPU to the database index.

How does binary partitioning resolve geographic coordinates?

Binary partitioning recursively divides the map into smaller quadrants, assigning a 0 or 1 based on whether a point falls in the lower/left or upper/right half of the current bounding box. This creates a deterministic bitstream representing a specific geographic area rather than a precise point.

I recently posted a video about geohashing because it is the most efficient way to handle real-time location data. We start by looking at the globe vertically. If a coordinate is in the Northern hemisphere, we assign a 1; if it is in the Southern hemisphere, a 0. We then take that specific hemisphere and split it again. Is the point in the upper or lower half of that new section? This recursive halving continues until we reach the desired resolution. We perform the exact same operation horizontally for longitude (Left/Right splits), resulting in two separate binary streams that describe the point's location with increasing precision.

Why do we interleave bitstreams for 1D spatial indexing?

Interleaving, also known as bit-zipping, alternates bits from the latitude and longitude streams to create a single sequence that preserves 2D proximity within a 1D format. This process follows a Morton order curve (or Z-order curve), which maps multi-dimensional data to one dimension while maintaining the locality of the data points.

In a standard implementation, the first bit of the latitude stream is followed by the first bit of the longitude stream, then the second bit of each, and so on. This ensures that the resulting combined bitstream—and the subsequent geohash string—represents a specific "tile" on the map. Because the bits are interleaved, points that are close together in a 2D space are highly likely to share the same prefix in their 1D bitstream.

Bit Index (i)	Latitude Bit (V_i)	Longitude Bit (H_i)	Interleaved Result (V_0 H_0 ... V_i H_i)
0	1	0	10
1	0	1	1001
2	1	1	100111
3	0	0	10011100
4	1	1	1001110011

How does Base-32 encoding optimize database lookups?

Base-32 encoding converts the final interleaved bitstream into a compact, human-readable string using a specific 32-character alphabet (0-9, b-z) that deliberately excludes ambiguous characters like a, i, l, and o. This representation is highly efficient for database indexing because strings are stored lexicographically in B-Trees, allowing for lightning-fast range scans.

When a taxi app needs to find drivers near you, it does not calculate the distance to every driver in the city. It identifies your geohash—for example, gcpvj0—and queries the database for any driver whose geohash starts with the same prefix. This turns a complex spatial intersection into a simple string comparison. Since the database index is sorted, the system can find all records within the same geographic tile by performing a single index seek followed by a range scan, which is significantly faster than any geometric calculation.

FAQ

What is a Z-order curve and how does it relate to geohashing?
A Z-order curve is a space-filling curve that visits every point in a multi-dimensional grid while preserving the locality of points. Geohashing uses this curve by interleaving bits; the path the bits follow as you increase precision looks like a repeating 'Z' shape. This is what allows us to represent 2D data in a 1D string index without losing spatial context.

Why does the geohash alphabet exclude certain letters?
The standard geohash alphabet (the Crockford-inspired Base-32) excludes the letters 'a', 'i', 'l', and 'o' to prevent human transcription errors and character confusion. This makes the hashes more robust when being passed through URLs, logged in debugging consoles, or manually entered by engineers.

How do you handle the 'boundary problem' where nearby points have different hashes?
Because the Z-order curve occasionally 'jumps' (for example, at the equator or the prime meridian), two points can be centimeters apart but have entirely different geohash prefixes. To mitigate this, production proximity services do not just search the user's current geohash tile; they calculate and query the eight immediate neighboring tiles as well.

Cheers.