Skip to content

Releases: aryn-ai/sycamore

v0.1.10

21 Dec 21:55
Compare
Choose a tag to compare

This Sycamore release adds support for near duplicate detection via shingling. It also includes documentation improvements and incremental bug fixes.

What's Changed

  • Render schema extraction documentation by @mkyl in #194
  • Additional documentation for Schema extraction by @mkyl in #195
  • Add async-timeout dependency by @eric-anderson in #198
  • Add docstrings to all public document methods so they show up on sycamore.readthedocs.io by @eric-anderson in #197
  • Near-Duplicate Detection in Sycamore: Document Tagging and Document Dropping by @alexaryn in #199
  • Bump version to v0.1.10. by @bsowell in #200

Full Changelog: v0.1.9...v0.1.10

v0.1.9

08 Dec 01:04
Compare
Choose a tag to compare

This Sycamore release adds improved heuristics for partitioning documents. It also includes a new method of automatically inferring entities to extract from unstructured documents, as well as incremental features and bug fixes.

What's Changed

  • Change the default merge size to 256. by @eric-anderson in #178
  • Simplify running the http crawler. by @eric-anderson in #180
  • Fix text chunking for html importing to improve result quality. by @eric-anderson in #185
  • Remove docker_compose and opensearch files. They were moved to quickstart. by @eric-anderson in #183
  • Change simple_ingest and s3_ingest to use GTE-small embedding model. by @alexaryn in #169
  • Remove unneeded mapping in OpenSearch index settings. by @alexaryn in #186
  • Added HTML ingest example. Fixed order in S3 ingester. by @alexaryn in #188
  • Simple transform to perform regex replacement on Elements. by @alexaryn in #187
  • Update README.md by @jonfritz in #179
  • Entity Extraction by @mkyl in #161
  • Merging/breaking elements based on heuristics including bbox by @alexaryn in #171
  • Update aiohttp and cryptography to address dependabot alerts. by @bsowell in #192
  • Bump version to v0.1.9. by @bsowell in #191

New Contributors

Full Changelog: v0.1.8...v0.1.9

v0.1.8

18 Nov 17:50
Compare
Choose a tag to compare

This Sycamore release contains code to build Docker containers as well as small improvements and bug fixes.

What's Changed

New Contributors

Full Changelog: v0.1.7...v0.1.8

v0.1.7

03 Nov 00:04
Compare
Choose a tag to compare

This Sycamore release adds support for reading JSON and using Azure OpenAI to enrich data and generate vector embeddings. It also includes documentation improvements, improvements to merging and partitioning, new incremental features, and bug fixes.

What's Changed

Full Changelog: v0.1.6...v0.1.7

v0.1.6

20 Oct 18:37
Compare
Choose a tag to compare

This Sycamore release adds basic support for ingesting PPTX files and support for OpenAI and Amazon Bedrock embedding models. It also contains small improvements and bug fixes.

What's Changed

New Contributors

Full Changelog: v0.1.5...v0.1.6

v0.1.5

12 Oct 23:51
Compare
Choose a tag to compare

This Sycamore release contains small improvements and bug fixes.

What's Changed

  • Update end_to_end_tutorials.md by @HenryL27 in #97
  • Add notebook tests and utilities to strip output from notebooks. by @bsowell in #95
  • Examples to show simple ingestion using Sycamore. by @alexaryn in #98
  • Assume a reasonable default; passing None yielded tiny elements. by @alexaryn in #99
  • Allow bbox into index. by @alexaryn in #101
  • Increase nbmake timeout to address integ test issues. by @bsowell in #104
  • Fix example so it makes a proper KNN index. by @alexaryn in #102
  • Transform to denormalize specified properties from parents to children by @alexaryn in #103
  • Bump version to 0.1.5 by @bsowell in #106

Full Changelog: v0.1.4...v0.1.5

Release version 0.1.4

06 Oct 22:18
25081e6
Compare
Choose a tag to compare

What's Changed

This Sycamore release has a variety of small improvements and bug fixes:

New Contributors

Full Changelog: v0.1.3...v0.1.4

v0.1.3

28 Sep 15:26
e796de4
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.1.2...v0.1.3

v0.1.2

28 Sep 07:52
Compare
Choose a tag to compare

What's Changed

  • Update for document deployment and fix a few typos. by @bsowell in #66
  • Update pyproject.toml by @bsowell in #67

Full Changelog: v0.1.1...v0.1.2

v0.1.1

28 Sep 07:15
Compare
Choose a tag to compare

Minor release to get PyPI in order.

What's Changed

Full Changelog: v0.1.0...v0.1.1