🎯Industry Use Cases

Extract Blog Post Content via API

Blogs are text. You want the text. Not the navigation, not the sidebar, not the cookie consent modal. denkbot.dog fetches the whole page and returns the text field with HTML stripped. For most blog posts, that's your article. The dog fetches the post. You get the words.

What you'd use this for

Content syndication, blog aggregators, newsletter digests, knowledge bases populated from blog content, and research tools that read the web.

How it works

example
# Extract blog content from WordPress, Ghost, Substack, etc.
curl -X POST https://api.denkbot.dog/scrape \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://blog.example.com/the-state-of-web-scraping-2025",
    "format": "json"
  }' | jq '{ title: .title, content: .text, author: .metadata.ogAuthor }'

Questions & Answers

Does it work with Substack?+

Yes for public posts. Paywalled posts require a subscription.

Can it extract the author name?+

Sometimes — it may appear in meta tags. Check metadata.ogAuthor.

Does it strip the navigation and sidebar?+

The text field strips HTML tags but doesn't do smart content extraction. Some nav text may be included.

Ready to start fetching?

€19/year. Unlimited requests. API key ready in 30 seconds.