Playground
Configure extraction settings, choose output formats, and generate ready-to-use commands for CLI, Docker, and Apify. Open the homepage to start.
Extract from a URL
- Paste a URL into the input field on the homepage
- Press the "Extract" button
- The extracted content appears below
Extract from HTML
- Paste raw HTML into the input field instead of a URL
- Press the "Extract" button
- Contextractor processes the HTML directly without fetching anything
Upload an HTML file
- Go to the "Upload" page
- Select an HTML file from your computer
- The content is extracted immediately
Output formats
Choose your preferred output format:
- Plain text — clean text with no markup
- Markdown — preserves headings, lists, links, and basic formatting
- JSON — structured output with metadata (title, author, date, etc.)
- XML — standard XML output
- XML TEI — TEI-compliant XML for academic and archival use
- JSONL — newline-delimited JSON, one object per page (CLI only, for batch/pipeline use)
Download or copy
After extraction, you can:
- Copy the result to your clipboard
- Download the extracted content as a file
Updated: March 23, 2026