Forem

Scraping

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Track YC Demo Day Companies in Real Time (with code)

Track YC Demo Day Companies in Real Time (with code)

Comments
5 min read
A Self-Hosted Web Content Extraction API

A Self-Hosted Web Content Extraction API

9
Comments 1
5 min read
Scraping dynamic pages with Python, Playwright and AWS Lambda
Cover image for Scraping dynamic pages with Python, Playwright and AWS Lambda

Scraping dynamic pages with Python, Playwright and AWS Lambda

Comments
4 min read
Scrape vs Crawl vs Map: Picking the Right Anakin API for the Job

Scrape vs Crawl vs Map: Picking the Right Anakin API for the Job

Comments
4 min read
Architecture of a Rental Aggregator: Scraping and Normalizing 90+ Sources

Architecture of a Rental Aggregator: Scraping and Normalizing 90+ Sources

Comments
4 min read
Browser Sessions: Stateful Web Automation Behind a CDP Connection

Browser Sessions: Stateful Web Automation Behind a CDP Connection

1
Comments
4 min read
How I scraped 50k YouTube subtitles in 2 weeks for $7 (and the legal gray zones)

How I scraped 50k YouTube subtitles in 2 weeks for $7 (and the legal gray zones)

Comments
4 min read
API or browser agent? We picked yes.
Cover image for API or browser agent? We picked yes.

API or browser agent? We picked yes.

Comments
7 min read
ISP proxies, AI crawlers, and the slow death of datacenter IPs: 2026 in numbers

ISP proxies, AI crawlers, and the slow death of datacenter IPs: 2026 in numbers

Comments
8 min read
I Tested 15 LLMs for Web Scraping and Built Heuristics Instead

I Tested 15 LLMs for Web Scraping and Built Heuristics Instead

Comments
3 min read
How I Sniffed Xiaohongshu's Collection API in 90 Seconds — and Why CORS Made Me Rewrite the Whole Approach

How I Sniffed Xiaohongshu's Collection API in 90 Seconds — and Why CORS Made Me Rewrite the Whole Approach

Comments
6 min read
6 Apify actors I actually use myself

6 Apify actors I actually use myself

Comments
3 min read
Anti-bot without the arms race: what Camoufox does differently

Anti-bot without the arms race: what Camoufox does differently

1
Comments
4 min read
Web Crawling e Web Scraping
Cover image for Web Crawling e Web Scraping

Web Crawling e Web Scraping

Comments
3 min read
YouTube Transcript Scraper: 提取视频字幕的免费工具

YouTube Transcript Scraper: 提取视频字幕的免费工具

Comments
1 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.