Enable incremental sync for gitlab jobs stream #50847
patygold
started this conversation in
Connector Ideas and Features
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Currently, the GitLab
jobs
stream is only available for full-refresh sync.This is problematic, as it's taking hours to finish the sync.
Even though the Jobs endpoint in GitLab doesn't accept a date filter, the parent endpoint (
pipelines
) does, andjobs
is already a child stream in the current implementation of the GitLab connector.I propose activating the
incremental_dependency
in thejobs
stream, and using thepipelines
filter.The code in the yml file for the jobs stream would look like this:
And we also would need to add the field
updated_at
to thejobs.json
file.I've tested this on Airbyte 0.50.30 and 1.3.1, both works. What I don't know, if this will work for bigger companies, as the state file can be quite big, since it saves the state for each pipeline id.
Eg, this is what the state file can look like for the stream
jobs
with a couple of pipelines in the same project.Beta Was this translation helpful? Give feedback.
All reactions