v0.1.16
This release contains support in the SycamorePartitioner for extracting table structure and images, as well as a new transform for summarizing images. It also includes a number of bug fixes and enhancements.
What's Changed
- fix ui error when no title is extracted and we're not in ntsb setting by @HenryL27 in #352
- Fix almost all the pyproject.toml and poetry.lock files to have consistent requirements on python dependencies. by @eric-anderson in #345
- Bind mount to convey SSL cert/key to Jupyter & UI by @alexaryn in #349
- Use real SSL certificate for OpenSearch HTTP. by @alexaryn in #353
- copy lib/poetry-lock into containers to make poetry happy by @HenryL27 in #354
- copy lib/poetry-lock into remote-processor-service too. by @HenryL27 in #355
- copy in all of poetry-lock, not just the pyproject files by @HenryL27 in #356
- Update data model for table structure recognition. by @bsowell in #357
- Put token-protected HTTPS proxy in front of UI proxy. by @alexaryn in #359
- Arxiv switched to HTTP for these PDFs; make it work. by @alexaryn in #360
- Add apt update to UI Dockerfiles. by @alexaryn in #361
- Use chown in our copy commands to make sure all files are owned by app by @eric-anderson in #362
- Add TableStructureExtractor interface and TableTransformer impl. by @bsowell in #358
- fix zsh path by @eric-anderson in #367
- Jupyter container improvements by @eric-anderson in #369
- Don't say localhost if it's not going to work. by @alexaryn in #366
- bump deploy timeout for reranking model from 60 to 120 by @HenryL27 in #363
- ingest all ntsb docs, automatically detect docker v not, spread path … by @HenryL27 in #368
- Fix typos in README by @hsm207 in #370
- Fix default prep script when given an empty directory to import by @HenryL27 in #371
- fix typo by @HenryL27 in #372
- Add the ability to summarize images to partitioned docsets. by @bsowell in #365
- Store element bbox as a tuple rather than BoundingBox. by @bsowell in #374
- Jonfritz patch 1 partition update by @jonfritz in #376
- FIX: Error on initiate conversation without a conversation id by @sohamkasar19 in #375
- Add API docs for the SycamorePartitioner and table extraction. by @bsowell in #373
- Fix malformed text from beautiful soup. by @bohou-aryn in #351
- Handle deserializing JSON documents when elements is None. by @bsowell in #377
- Bump sycamore version to 0.1.16 by @bsowell in #378
New Contributors
- @hsm207 made their first contribution in #370
- @sohamkasar19 made their first contribution in #375
Full Changelog: v0.1.15...v0.1.16