Change8

0.16.21

📦 unstructuredView on GitHub →
3 features🐛 1 fixes🔧 1 symbols

Summary

This release introduces password support for PDF loading and new configuration options for PDF Miner, alongside performance improvements in layout merging and a fix for NDJSON file detection.

✨ New Features

  • Added support for using a password to load PDFs in all modes.
  • Refactored layout merging logic using vectorized operations and the new LayoutElements data structure for improved performance and clarity.
  • Introduced configuration options for PDF Miner via `pdfminer_line_overlap`, `pdfminer_word_margin`, `pdfminer_line_margin`, and `pdfminer_char_margin` parameters in the partition method.

🐛 Bug Fixes

  • Corrected file type detection for NDJSON files, which were previously misidentified as

🔧 Affected Symbols

partition method (for PDF Miner configuration)