-
Notifications
You must be signed in to change notification settings - Fork 494
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extension of API {id}/versions
and {id}/versions/{versionId}
with an optional excludeMetadataBlocks
parameter
#10778
Extension of API {id}/versions
and {id}/versions/{versionId}
with an optional excludeMetadataBlocks
parameter
#10778
Conversation
…cks from API response.
…cks from API response.
…sponse # Conflicts: # src/main/java/edu/harvard/iq/dataverse/util/json/JsonPrinter.java
@GPortas we are experiencing some performance issues with our SPA when a user requests information about dataset versions, particularly those with many versions. Are you experiencing similar problems? We believe that reducing the payload by omitting the metadata would solve our problem. As we can load the metadata with another query if needed. What do you think? |
I'm not sure if we've experienced issues with the metadata blocks, and if we have, they may have been minor, possibly because we don't tend to add complex metadata block configurations in our test datasets. It's reasonable to think it will improve performance, as additional queries are omitted. This is somewhat similar to what we did with the files, where we added the optional query parameter called This makes me wonder if it might be interesting to create a 'reduced information' endpoint instead of continuing to include parameters for excluding properties in the general endpoint. |
Here are the docs for |
Our problem is only partly caused by the large complex metadata block, the other cause is the amount of versions (we have a dataset where an update is published every day, the file changes but the metadata is the same). So the payload of this API becomes huge and by omitting the metadata blocks we can mitigate the problem while still getting the necessary information about versions without introducing paging. I'm not a fan of having different endpoints for more or less the same information. It is more code to maintain and more complicated for the user. This PR is inspired by the |
@GPortas at some point we should probably test the SPA against a dataset with lots of versions. We should have datasets like this on the performance cluster. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@johannes-darms overall, looks good. I left some comments. Thanks.
@johannes-darms can you please merge the latest from develop? We need this anyway and it will trigger a Jenkins run, which is failing. Also, please consider the suggestions I made in my review. Thanks. |
b8d0f3d
to
29b30c2
Compare
Sorry for the delay. I've merged, adapted the documentation and wrote a simple test. If you need or want more tests I'm happy to write them. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@johannes-darms thanks for adding tests and addressing all my questions! I didn't run the code myself but tests are passing and code and docs look good. Approved!
@johannes-darms, sorry to do this to you again, but we've recently released version 6.5, so we need you to get the latest from Dev into this branch. Thanks for your patience. |
No worries, merged without manual effort :) |
Sadly, the Jenkins build fails and I cannot access the output. Could you forward the information @pdurbin ? |
@johannes-darms Thanks again for your patience. |
@johannes-darms I know this PR has already been merged but to answer your question, the latest API test run passed: https://jenkins.dataverse.org/job/IQSS-Dataverse-Develop-PR/job/PR-10778/8/testReport/ |
Thanks for sharing but I cannot access the jenkins instance. |
Yes, I know, sorry, that link was a "note to self". Related: |
What this PR does / why we need it:
Extension of API
{id}/versions
and{id}/versions/{versionId}
with an optionalexcludeMetadataBlocks
parameter,that specifies whether the metadataBlocks should be listed in the output. It defaults to
false
, preserving backwardcompatibility. (Note that for a dataset with a large number of versions and/or metadataBlocks having the metadata blocks
included can dramatically increase the volume of the output).
We have slow response from
api/datasets/%s/versions
due to the large response body. Most of the information included (metadatablocks) is not needed as we just want to display a dropdown list with all available versions.Which issue(s) this PR closes:
Suggestions on how to test this: Call the API once with the flag and once without.
Does this PR introduce a user interface change?: No
Is there a release notes update needed for this change?: Maybe, it is a new optional property.
Preview docs at https://dataverse-guide--10778.org.readthedocs.build/en/10778/api/native-api.html#list-versions-of-a-dataset