v0.1.26
This release includes several stabliity and reliability improvements.
What's Changed
- skip flaky test by @HenryL27 in #956
- Fix mypy warnings. by @mdwelsh in #947
- Work around hang observed during vcrpy recording. by @alexaryn in #950
- Postprocessing to modify plans returned by llm planner; minor issues with query-ui by @amolvdeshpande in #882
- bump sdk to 0.1.7 by @HenryL27 in #961
- Add HeaderAugmenterMerger by @dhruvkaliraman7 in #946
- Update docs to reflect OpenAIPropertyExtractor->LLMPropertyextractor by @bsowell in #964
- Couple of minor fixes and tweaks to the table merger. by @bsowell in #963
- Enable use_elements in query.summarize_data by @baitsguy in #966
- Fix typo in syntax in docstring for Summarize Images by @jonfritz in #967
- Add missing
tokenizer
argument inMarkBreakByTokens
docstring by @MarkLindblad in #969 - Add Lots of Connector Unit Tests by @karanataryn in #957
- Add OCR Evaluation Code by @karanataryn in #685
- Fixed query tag check by @baitsguy in #968
- Fix SDK Threshold Bug by @karanataryn in #970
- Add score to each document in OpenSearch query result. by @bsowell in #971
- Fix HeaderAugmenterMerger by @MarkLindblad in #973
- Refactor
mark_bbox_preset
to expose function outsideDocSet
by @MarkLindblad in #972 - Fix
mark_bbox_preset
'sMarkDropHeaderFooter
parameter by @MarkLindblad in #975 - OpenSearch improvements by @baitsguy in #974
- Adding a separate installation instructions page by @AbhijitP-009 in #977
- Union OCR / PDFMiner Tokens with Table Outputs by @karanataryn in #976
- Make Table Code More Robust by @karanataryn in #979
- fix divide by zero in align_headers by @HenryL27 in #978
- Allow for returning query traces on cached query executions. by @mdwelsh in #959
- Add Enhance Table Option to SDK by @karanataryn in #980
- Bump SDK Version by @karanataryn in #981
- Update Lockfiles by @karanataryn in #920
- Add query planning strategy objects by @baitsguy in #982
- Move tokenized data to device by @baitsguy in #983
- Update vectorsearch query test by @baitsguy in #984
- Integration test for Sycamore Query demo. by @mdwelsh in #985
- Add Closure of Client Connections for Connectors by @karanataryn in #989
- Work around lack of resource module on Windows. by @alexaryn in #962
- Update README.md by @karanataryn in #990
- Merge in Fixes from Luna Demo Deployment by @karanataryn in #992
- Add table-chunker by @dhruvkaliraman7 in #993
- chore: Added back to top , contributors section and star history graph by @samarth29jc in #987
- Return the list of documents referenced in a Luna query. by @mdwelsh in #995
- Sync Locks across all Directories by @karanataryn in #988
- Remove unused code (
_batchify
) by @MarkLindblad in #887 - Don't try to put footers in columns by @HenryL27 in #998
- Docprep notebook testing by @sohamkasar19 in #996
- Add expected documents in query-eval tool by @baitsguy in #997
- Move Aryn DocParse Docs Out of Sycamore by @karanataryn in #994
- Remove seed from rewrite prompt by @baitsguy in #1000
- Fix OpenAI reduce methods to handle Azure deployment names. by @bsowell in #1002
- Add support for custom source parameter for remote Aryn Partitioner by @MarkLindblad in #1003
- Fix mixed samples for schema extraction. by @mdwelsh in #1004
- updating extract table prop by @Soeb-aryn in #1005
- Update Opensearch domain in docprep notebook testing (GHA) by @sohamkasar19 in #1006
- Improve suggested install command by @HenryL27 in #1007
- Fix augment_text docstring by @HenryL27 in #1008
- Add support for using Aryn DocParse chunking from
aryn-sdk
by @MarkLindblad in #1010 - Update sycamore to 0.1.26 by @HenryL27 in #1009
New Contributors
- @amolvdeshpande made their first contribution in #882
- @samarth29jc made their first contribution in #987
Full Changelog: v0.1.25...v0.1.26