-
Notifications
You must be signed in to change notification settings - Fork 30
Pass Discrete Sequence to GPU #652
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
… sequences, moved spatial gradient calculation to own function.
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## master #652 +/- ##
==========================================
- Coverage 90.14% 90.09% -0.06%
==========================================
Files 57 58 +1
Lines 3228 3230 +2
==========================================
Hits 2910 2910
- Misses 318 320 +2
Flags with carried forward coverage won't be shown. Click here to find out more.
🚀 New features to boost your workflow:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
KomaMRI Benchmarks
| Benchmark suite | Current: 2035faa | Previous: b5c63a9 | Ratio |
|---|---|---|---|
MRI Lab/Bloch/CPU/2 thread(s) |
365549150.5 ns |
340960290 ns |
1.07 |
MRI Lab/Bloch/CPU/4 thread(s) |
284430115 ns |
272489189 ns |
1.04 |
MRI Lab/Bloch/CPU/8 thread(s) |
260188629 ns |
212102146.5 ns |
1.23 |
MRI Lab/Bloch/CPU/1 thread(s) |
565921915 ns |
553123722 ns |
1.02 |
MRI Lab/Bloch/GPU/CUDA |
20263918 ns |
21705981 ns |
0.93 |
MRI Lab/Bloch/GPU/oneAPI |
77678338 ns |
79470326.5 ns |
0.98 |
MRI Lab/Bloch/GPU/Metal |
103679834 ns |
95738917 ns |
1.08 |
MRI Lab/Bloch/GPU/AMDGPU |
26217619 ns |
26055313 ns |
1.01 |
Slice Selection 3D/Bloch/CPU/2 thread(s) |
1622368544 ns |
1588926486.5 ns |
1.02 |
Slice Selection 3D/Bloch/CPU/4 thread(s) |
902336276 ns |
886294756.5 ns |
1.02 |
Slice Selection 3D/Bloch/CPU/8 thread(s) |
590207573.5 ns |
561392306 ns |
1.05 |
Slice Selection 3D/Bloch/CPU/1 thread(s) |
3098118101 ns |
3027587378 ns |
1.02 |
Slice Selection 3D/Bloch/GPU/CUDA |
31261767.5 ns |
32637155 ns |
0.96 |
Slice Selection 3D/Bloch/GPU/oneAPI |
127028296.5 ns |
124598595.5 ns |
1.02 |
Slice Selection 3D/Bloch/GPU/Metal |
113151145.5 ns |
112846167 ns |
1.00 |
Slice Selection 3D/Bloch/GPU/AMDGPU |
34108195 ns |
34082841 ns |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
|
The macos fails are due to actions/runner-images#13341. |
|
Compare benchmarks with #653. Update: macos cache problem fixed. |
cncastillo
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks very good, no significant performance regressions, but some of them could be explained by some of the suggestions below.
In preparation for higher order sequences, this adds: