How to Scrape Bluesky Posts: AT Protocol Public Data Extraction with Python
Bluesky is the decentralized social network built on the AT Protocol. Unlike traditional platforms, the AT Protocol is designed to be open — making public data freely accessible without authenticat...

Source: DEV Community
Bluesky is the decentralized social network built on the AT Protocol. Unlike traditional platforms, the AT Protocol is designed to be open — making public data freely accessible without authentication. This is a game-changer for data extraction. Here's how to scrape Bluesky posts and profiles using Python. Why Bluesky Data is Special Open by design: The AT Protocol makes public data accessible via standard APIs No auth required: Public posts, profiles, and feeds are openly available Growing fast: Millions of users migrating from Twitter/X Rich data: Posts, replies, likes, reposts, follows — all accessible Decentralized: Data is portable and not locked behind one corporation Understanding the AT Protocol Each user has a data repository identified by their DID (Decentralized Identifier). The public API endpoints let you read this data directly. import requests import json import time BSKY_PUBLIC_API = "https://public.api.bsky.app" def resolve_handle(handle): """Convert a Bluesky handle t