Upgrade Storage Backends to V2 Storage API #6458

mahadzaryab1 · 2025-01-01T20:29:40Z

This is a project proposed as part of LFX Mentorship term #6470.

Background

Jaeger is an open-source, distributed tracing platform designed to monitor and troubleshoot microservices-based systems. A critical component of Jaeger is its storage backends, where traces captured by Jaeger are persisted for querying.

Currently, Jaeger uses a v1 Storage API, which operates on a data model specific to Jaeger. Each storage backend implements this API, requiring transformations between Jaeger's proprietary model and the OpenTelemetry Protocol (OTLP) data model, which is now the industry standard.

As part of #5079, Jaeger has introduced the more efficient v2 Storage API, which natively supports the OpenTelemetry data model (OTLP), allows batching of writes and streaming of resultes. This effort is part of a broader alignment with the OpenTelemetry Collector framework, tracked under #4843.

Objective

Upgrade Jaeger storage backends to natively implement the v2 Storage API.

The chosen storage backend should be upgraded to fully implement the v2 Storage API in place. For a rough idea of how to upgrade from the v1 model to the OTLP data model, take a look at the PRs in the following issues that do a similar upgrade for other components of Jaeger:

Desired Outcomes

Upgrade Memory and Elasticsearch backends

We prioritize these two backends as they are the mostly frequently used with Jaeger and upgrading them paves a path for upgrading other backends.

Testing

The storage implementations should be fully covered by unit tests
There are already integration tests for the storage backend so all of them should pass without needing to be modified

Bonus: Upgrade Other Backends

If time permits, upgrade Badger and Cassandra storage backends.

Risks / Open Questions

The v2 storage API doesn't have a distinction between primary and archive storage but v1 does. The ultimate plan is to remove the archive storage from the v1 implementation as well. That work effort is being tracked in Phase out the distinction between primary and archive storage #6065. We may want to think about how to handle the upgrades for the storages that implement the archive storage in v1 while we work on removing it. We may want to simply ignore the archive part of the storage while we resolve the aforementioned issue if that is possible.

…o v1 (#6485) ## Which problem is this PR solving? - Resolves #6480 ## Description of the changes - This PR implements a reverse adapter (`SpanReader`) that wraps a native v2 storage interface (`tracestore.Reader`) and downgrades it to implement the v1 storage interface (`spanstore.Reader`). - The reverse adapter was integrated with the v1 query service. This code path will only get executed once we start upgrading the existing storage implementations to implement the new `tracestore.Reader` interface as a part of #6458 ## How was this change tested? - CI - Added new unit tests ## Checklist - [x] I have read https://github.com/jaegertracing/jaeger/blob/master/CONTRIBUTING_GUIDELINES.md - [x] I have signed all commits - [x] I have added unit tests for the new functionality - [x] I have run lint and test steps successfully - for `jaeger`: `make lint test` - for `jaeger-ui`: `npm run lint` and `npm run test` --------- Signed-off-by: Mahad Zaryab <[email protected]>

Devaansh-Kumar · 2025-01-07T19:35:14Z

Hello @mahadzaryab1. I am Devaansh Kumar and I am interested in applying for this project under LFX'25. I have some experience with open source earlier as I had participated in GSoC'24 under Kubernetes.

I have a few questions:

What was the reason behind removing the distinction between primary and archive storage for v2 API? It seems intuitive to me to have both in v2.
For contributors new to Jaeger’s codebase, are there specific areas or smaller tasks you’d recommend starting with to get familiarized?

Thank you

yurishkuro · 2025-01-07T20:09:37Z

@Devaansh-Kumar

…o v1 (jaegertracing#6485) ## Which problem is this PR solving? - Resolves jaegertracing#6480 ## Description of the changes - This PR implements a reverse adapter (`SpanReader`) that wraps a native v2 storage interface (`tracestore.Reader`) and downgrades it to implement the v1 storage interface (`spanstore.Reader`). - The reverse adapter was integrated with the v1 query service. This code path will only get executed once we start upgrading the existing storage implementations to implement the new `tracestore.Reader` interface as a part of jaegertracing#6458 ## How was this change tested? - CI - Added new unit tests ## Checklist - [x] I have read https://github.com/jaegertracing/jaeger/blob/master/CONTRIBUTING_GUIDELINES.md - [x] I have signed all commits - [x] I have added unit tests for the new functionality - [x] I have run lint and test steps successfully - for `jaeger`: `make lint test` - for `jaeger-ui`: `npm run lint` and `npm run test` --------- Signed-off-by: Mahad Zaryab <[email protected]> Signed-off-by: adityachopra29 <[email protected]>

mahadzaryab1 added the enhancement label Jan 1, 2025

dosubot bot added area/storage v2 labels Jan 1, 2025

yurishkuro mentioned this issue Jan 2, 2025

🎓 Jaeger's LFX Mentorship - Term 01 - 2025 March - May #6470

Open

5 tasks

mahadzaryab1 mentioned this issue Jan 5, 2025

[v2][storage] Implement reverse adapter to translate v2 storage api to v1 #6485

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade Storage Backends to V2 Storage API #6458

Upgrade Storage Backends to V2 Storage API #6458

mahadzaryab1 commented Jan 1, 2025 •

edited by yurishkuro

Loading

Devaansh-Kumar commented Jan 7, 2025

yurishkuro commented Jan 7, 2025

Upgrade Storage Backends to V2 Storage API #6458

Upgrade Storage Backends to V2 Storage API #6458

Comments

mahadzaryab1 commented Jan 1, 2025 • edited by yurishkuro Loading

Background

Objective

Desired Outcomes

Upgrade Memory and Elasticsearch backends

Testing

Bonus: Upgrade Other Backends

Risks / Open Questions

Devaansh-Kumar commented Jan 7, 2025

yurishkuro commented Jan 7, 2025

mahadzaryab1 commented Jan 1, 2025 •

edited by yurishkuro

Loading