S2Site2MD
Production crawler for Markdown exports

Turn any sitemap into clean Markdown files.

Site2MD recursively parses sitemap indexes, fetches only listed URLs, extracts meaningful content, and creates frontmatter-rich Markdown plus a ZIP archive.

robots.txt aware rate-limited ZIP output

Start a sitemap crawl

Paste a top-level sitemap XML URL. Nested sitemap indexes are followed automatically.

1

Recursive sitemap parsing

Sitemap indexes are followed to a safe configurable depth with URL deduplication.

2

Clean Markdown conversion

Scripts, navigation, ads, forms, and tracking elements are removed before Turndown conversion.

3

Useful partial results

The result page shows converted and failed pages together with per-file downloads.