Playground

Configure extraction settings, choose output formats, and generate ready-to-use commands for CLI, Docker, and Apify. Open the homepage to start.

Extract from a URL

  • Paste a URL into the input field on the homepage
  • Press the "Extract" button
  • The extracted content appears below

Extract from HTML

  • Paste raw HTML into the input field instead of a URL
  • Press the "Extract" button
  • Contextractor processes the HTML directly without fetching anything

Upload an HTML file

  • Go to the "Upload" page
  • Select an HTML file from your computer
  • The content is extracted immediately

Output formats

Choose your preferred output format:

  • Plain text — clean text with no markup
  • Markdown — preserves headings, lists, links, and basic formatting
  • JSON — structured output with metadata (title, author, date, etc.)
  • XML — standard XML output
  • XML TEI — TEI-compliant XML for academic and archival use
  • JSONL — newline-delimited JSON, one object per page (CLI only, for batch/pipeline use)

Download or copy

After extraction, you can:

  • Copy the result to your clipboard
  • Download the extracted content as a file

Updated: March 23, 2026