Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP - RPP Exclusive-Or on HOST and HIP #464

Draft
wants to merge 67 commits into
base: develop
Choose a base branch
from
Draft
Changes from 1 commit
Commits
Show all changes
67 commits
Select commit Hold shift + click to select a range
0a09a27
Update the initial CODE for HIP Implementation of Exclusive OR
Srihari-mcw Oct 21, 2024
4f5d6f0
Add exclusive_or.hpp hip file
Srihari-mcw Sep 9, 2024
885e808
Update the code for initial HOST Code
Srihari-mcw Sep 10, 2024
e130368
Make SSE based updatess for exclusive or
Srihari-mcw Sep 10, 2024
169c59c
Update the code for AVX2 implementation of U8 code
Srihari-mcw Sep 10, 2024
cce836d
Uncomment pragma
Srihari-mcw Sep 10, 2024
5b06d48
Initial commit for I8
Srihari-mcw Sep 11, 2024
7f5df55
Add I8 case
Srihari-mcw Sep 11, 2024
9fe0d11
Fix issues with PKD3 to PLN3 i8 implementation
Srihari-mcw Sep 17, 2024
a83f3fa
Initial updates based on self review
Srihari-mcw Sep 17, 2024
30bd007
More updates
Srihari-mcw Sep 17, 2024
c782fd2
More cleanup
Srihari-mcw Sep 20, 2024
5672802
Update separate code for PLN3 to PLN3 U8
Srihari-mcw Sep 24, 2024
d52e53d
Update separate code for PLN3 to PLN3 I8
Srihari-mcw Sep 24, 2024
10eddc3
Update separate code for PLN3 to PLN3 F32
Srihari-mcw Sep 24, 2024
8937164
Fix compilation issues
Srihari-mcw Sep 24, 2024
b52cef4
Fix accuracy issues for PLN3 to PLN3
Srihari-mcw Sep 24, 2024
f547030
Add comments and formatting
Srihari-mcw Sep 24, 2024
069165d
Rearrange the function declarations
Srihari-mcw Sep 24, 2024
a420d14
Add golden outputs for exclusive or
Srihari-mcw Sep 24, 2024
fa6100d
Add AVX2 flags wherever necessary
Srihari-mcw Sep 24, 2024
700c507
Update the code to have updated F16 load functions
Srihari-mcw Sep 24, 2024
2823e4b
HIP Code Updates
Srihari-mcw Sep 24, 2024
f0732b0
F16 PLN3 to PLN3 Updates
Srihari-mcw Sep 24, 2024
91311c3
Update outputs
Srihari-mcw Oct 21, 2024
13b315f
Rearrange XOR GPU function header
Srihari-mcw Sep 27, 2024
b348dc6
Add empty line
Srihari-mcw Sep 27, 2024
0e18365
Update aligned length
Srihari-mcw Sep 30, 2024
ad5036d
Updates to make F16 outputs consistent with other bit depths
Srihari-mcw Sep 30, 2024
a86e2b0
Add std::nearbyintf in exclusive or hip code
Srihari-mcw Oct 1, 2024
bb3a55a
Update the code to use predefined zero vectors
Srihari-mcw Oct 8, 2024
6f79652
Update to use existing rpp_load96_u8_avx instead of rpp_load96_u8pln3…
Srihari-mcw Oct 9, 2024
db5a2ac
Update the version
Srihari-mcw Oct 23, 2024
a9363ce
Update changelog
Srihari-mcw Oct 23, 2024
bdd306a
Merge branch 'ar/opt_bitwise_xor' into opt_exclusive_or_hip
r-abishek Oct 30, 2024
6c394f0
Merge pull request #338 from Srihari-mcw/opt_exclusive_or_hip
r-abishek Oct 30, 2024
19885c0
Merge branch 'develop' into ar/opt_bitwise_xor
r-abishek Nov 2, 2024
756ba4b
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Nov 5, 2024
1da35eb
Merge branch 'develop' into ar/opt_bitwise_xor
r-abishek Nov 7, 2024
22623a6
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Nov 8, 2024
43852eb
Merge branch 'develop' into ar/opt_bitwise_xor
r-abishek Nov 27, 2024
97653cd
Update CHANGELOG.md
r-abishek Nov 27, 2024
6cfadf8
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Nov 27, 2024
bd312ca
Merge branch 'develop' into ar/opt_bitwise_xor
r-abishek Nov 30, 2024
c981ad2
Merge branch 'develop' into ar/opt_bitwise_xor
Srihari-mcw Dec 6, 2024
0b2be7e
Updates to fix more merge conflicts
Srihari-mcw Dec 6, 2024
2d9f0f2
Update version to 1.9.10 including exclusive or
Srihari-mcw Dec 9, 2024
311f265
Merge pull request #365 from Srihari-mcw/opt_bitwise_xor_rebased
r-abishek Dec 9, 2024
8e5b2a7
Merge branch 'develop' into ar/opt_bitwise_xor
r-abishek Dec 9, 2024
e40d481
Merge branch 'develop' into ar/opt_bitwise_xor
Srihari-mcw Dec 11, 2024
333b811
Merge pull request #370 from Srihari-mcw/opt_bitwise_xor_rebased
r-abishek Dec 11, 2024
3bfb8b7
Merge branch 'develop' into ar/opt_bitwise_xor
Srihari-mcw Dec 13, 2024
40252a1
Remove duplicate definitions of functions
Srihari-mcw Dec 13, 2024
83df6c4
Merge branch 'develop' into ar/opt_bitwise_xor
Srihari-mcw Dec 16, 2024
0ecbc06
Merge branch 'develop' into ar/opt_bitwise_xor
Srihari-mcw Dec 17, 2024
90299a6
Merge branch 'develop' into ar/opt_bitwise_xor
Srihari-mcw Dec 23, 2024
1b616f3
Merge pull request #373 from Srihari-mcw/opt_bitwise_xor_rebased
r-abishek Dec 24, 2024
c7f766a
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Jan 6, 2025
3969611
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Jan 6, 2025
d0f20ac
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Jan 7, 2025
4c8a1f1
Merge branch 'develop' into ar/opt_bitwise_xor
r-abishek Jan 7, 2025
39bed49
Merge branch 'develop' into ar/opt_bitwise_xor
r-abishek Jan 8, 2025
420cdc8
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Jan 8, 2025
0d334f2
Merge branch 'develop' of https://github.com/ROCm/rpp into ar/opt_bit…
r-abishek Jan 15, 2025
6f74447
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Jan 15, 2025
57b2abd
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Jan 15, 2025
590da40
Merge branch 'develop' into ar/opt_bitwise_xor
kiritigowda Jan 16, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Add I8 case
Srihari-mcw committed Oct 23, 2024

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
commit 7f5df55e7388399e7b89b05e37d07ae2ff136098
12 changes: 12 additions & 0 deletions src/modules/rppt_tensor_logical_operations.cpp
Original file line number Diff line number Diff line change
@@ -146,6 +146,18 @@ RppStatus rppt_exclusive_or_host(RppPtr_t srcPtr1,
layoutParams,
rpp::deref(rppHandle));
}
else if ((srcDescPtr->dataType == RpptDataType::I8) && (dstDescPtr->dataType == RpptDataType::I8))
{
exclusive_or_i8_i8_host_tensor(static_cast<Rpp8s*>(srcPtr1) + srcDescPtr->offsetInBytes,
static_cast<Rpp8s*>(srcPtr2) + srcDescPtr->offsetInBytes,
srcDescPtr,
static_cast<Rpp8s*>(dstPtr) + dstDescPtr->offsetInBytes,
dstDescPtr,
roiTensorPtrSrc,
roiType,
layoutParams,
rpp::deref(rppHandle));
}

return RPP_SUCCESS;
}