-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PARQUET-2422: Prevent unwrapping of Hadoop filestreams #1256
Conversation
Could you please create a JIRA? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks trivial to me. Thanks!
It would be good if @steveloughran @Fokko can double check as I'm not that familiar with its history.
Not sure why there's a need to return a function rather than do the work -is there some other code which needs this? In an ideal world parquet would be hadoop 3.3+ only and life would be simpler, not just here but with openFile(), ByteBufferPositionedReadable and more |
@shangxinli @gszadovszky @ConeyLiu Do you have any comment? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks good to me.
@@ -77,16 +78,17 @@ public static SeekableInputStream wrap(FSDataInputStream stream) { | |||
* @param stream stream to probe | |||
* @return A H2SeekableInputStream to access, or H1SeekableInputStream if the stream is not seekable |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Documentation should also be updated
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@rathinb-db Could you resolve this comment? Thanks!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, just left a minor comment about doc.
Make sure you have checked all steps below.
Jira
them in the PR title. For example, "PARQUET-1234: My Parquet PR"
the ASF 3rd Party License Policy.
Refactor Hadoop filestreams to prevent filestream unwrapping (and keeping the original filestream type).
Tests
Tests not necessary since this is a refactor.
Commits
from "How to write a good git commit message":
Style
mvn spotless:apply -Pvector-plugins
Documentation