Forem

# data

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
All Data and AI Weekly #242–18 May 2026
Cover image for All Data and AI Weekly #242–18 May 2026

All Data and AI Weekly #242–18 May 2026

4
Comments
8 min read
Backup Photos from Google Photos: A 2026 Guide
Cover image for Backup Photos from Google Photos: A 2026 Guide

Backup Photos from Google Photos: A 2026 Guide

Comments 1
10 min read
I scraped every **Show HN** post from May 2025 to May 2026 that crossed **200 points** and ran a quick analysis. There were 334 of them. Here is what landed.

I scraped every **Show HN** post from May 2025 to May 2026 that crossed **200 points** and ran a quick analysis. There were 334 of them. Here is what landed.

Comments
3 min read
Exporting CRM data is messier than you think: migration scripts for Honeybook, Dubsado, and 17hats

Exporting CRM data is messier than you think: migration scripts for Honeybook, Dubsado, and 17hats

Comments
6 min read
The Scrabble Dictionary Analyzed: 267,751 Words by Length, Letter Frequency & High Scores
Cover image for The Scrabble Dictionary Analyzed: 267,751 Words by Length, Letter Frequency & High Scores

The Scrabble Dictionary Analyzed: 267,751 Words by Length, Letter Frequency & High Scores

Comments
1 min read
Computed business rules in Okyline: what JSON Schema cannot validate
Cover image for Computed business rules in Okyline: what JSON Schema cannot validate

Computed business rules in Okyline: what JSON Schema cannot validate

Comments
5 min read
What I learned scraping Bulk URL Status Checker: schema, gotchas and the tooling that worked

What I learned scraping Bulk URL Status Checker: schema, gotchas and the tooling that worked

Comments
3 min read
I Built a Company Relationship Search Engine (Competitors, Partners, Acquisitions, and More)
Cover image for I Built a Company Relationship Search Engine (Competitors, Partners, Acquisitions, and More)

I Built a Company Relationship Search Engine (Competitors, Partners, Acquisitions, and More)

Comments
2 min read
Sample dataset analysis: a 100-row snapshot of Sitemap

Sample dataset analysis: a 100-row snapshot of Sitemap

Comments
3 min read
ETL vs. ELT: Which Approach Should You Use and Why?
Cover image for ETL vs. ELT: Which Approach Should You Use and Why?

ETL vs. ELT: Which Approach Should You Use and Why?

1
Comments
2 min read
Palantir: a empresa que transforma dados em decisões operacionais
Cover image for Palantir: a empresa que transforma dados em decisões operacionais

Palantir: a empresa que transforma dados em decisões operacionais

Comments
6 min read
The State of Agentic Commerce — May 2026
Cover image for The State of Agentic Commerce — May 2026

The State of Agentic Commerce — May 2026

Comments
19 min read
Apache Data Lakehouse Weekly: May 7–13, 2026
Cover image for Apache Data Lakehouse Weekly: May 7–13, 2026

Apache Data Lakehouse Weekly: May 7–13, 2026

Comments
17 min read
My 10-Minute Airflow Pitch Approach

My 10-Minute Airflow Pitch Approach

Comments
4 min read
Using FastFileLink to Deliver a 3.8GB, 158-File U.S. Department of War UFO Public Archive

Using FastFileLink to Deliver a 3.8GB, 158-File U.S. Department of War UFO Public Archive

Comments
8 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.