## Features
- **Organize and index** your scanned documents with tags, correspondents, types, and more.
-- _Your_ data is stored locally on _your_ server and is never transmitted or shared in any way.
+- _Your_ data is stored locally on _your_ server and is never transmitted or shared in any way, unless you explicitly choose to do so.
- Performs **OCR** on your documents, adding searchable and selectable text, even to documents scanned with only images.
-- Utilizes the open-source Tesseract engine to recognize more than 100 languages.
+ - Utilizes the open-source Tesseract engine to recognize more than 100 languages.
+ - _New!_ Supports remote OCR with Azure AI (opt-in).
- Documents are saved as PDF/A format which is designed for long term storage, alongside the unaltered originals.
- Uses machine-learning to automatically add tags, correspondents and document types to your documents.
- - Supports PDF documents, images, plain text files, Office documents (Word, Excel, Powerpoint, and LibreOffice equivalents)[^1] and more.
+ - Supports PDF documents, images, plain text files, Office documents (Word, Excel, PowerPoint, and LibreOffice equivalents)[^1] and more.
- Paperless stores your documents plain on disk. Filenames and folders are managed by paperless and their format can be configured freely with different configurations assigned to different documents.
- **Beautiful, modern web application** that features:
- Customizable dashboard with statistics.
"src/paperless_mail/tests/",
"src/paperless_tesseract/tests/",
"src/paperless_tika/tests",
+ "src/paperless_text/tests/",
+ "src/paperless_remote/tests/",
]
addopts = [
"--pythonwarnings=all",