Further improve performance #83

tim-hoffman · 2024-01-02T17:16:04Z

No description provided.

Reduces String clones and overall memory usage of String instances

0xddom

I don't really understand what's going on, but I trust you that this improves performance. Because, if I understood it correctly, by splitting between compute and execute you avoid copying the env unnecessarily, correct? Do you have some numbers of the run time now?

Also, I was thinking. Do we still clone the whole IR every time we run a pass? Maybe we could improve performance further if we move the IR and only create new copies of things if we actually modify them. I don't know if it will work since we need the IDs for each bucket but may be worth a try

0xddom · 2024-01-02T18:24:22Z

circuit_passes/src/bucket_interpreter/mod.rs

+            _compute_instruction,
+            _execute_instruction
+        );
+        error::add_loc_if_err(result, inst.as_ref())
    }

    /****************************************************************************************************
     * Private implemenation


tim-hoffman · 2024-01-02T18:51:51Z

You got it. Profiling showed that Env clones and destructors were taking the most time by far.
I've replaced "execute" with "compute" in many cases where the bucket cannot update the Env. The compute_or_execute macro helps in cases where the Env might be updated but we don't end up using the updated Env so it's safe to try the "compute" approach first but if an Env modifying instruction is encountered, fall back to the "execute" approach.

tim-hoffman · 2024-01-02T18:54:00Z

Each pass does still reconstruct the IR via the transform methods but I haven't seen that as a bottleneck in profiling so far.

0xddom · 2024-01-02T18:58:37Z

Each pass does still reconstruct the IR via the transform methods but I haven't seen that as a bottleneck in profiling so far.

It's not only performance but memory usage, but we can leave it for future work then, if it's not as pressing. This PR has enough stuff already

tim-hoffman added 5 commits January 2, 2024 11:13

Remove more Env clones

faf7718

Move subcmp name into the StandardEnvData

d93605b

Reduces String clones and overall memory usage of String instances

fix a mixup in the debug info

19341a1

fix for bucket structure that was overlooked

98d4dbe

add a helpful comment

f50c47b

tim-hoffman mentioned this pull request Jan 2, 2024

implement missed case in CallBucket #84

Merged

tim-hoffman changed the title ~~Further improve perforamance~~ Further improve performance Jan 2, 2024

tim-hoffman marked this pull request as ready for review January 2, 2024 18:17

tim-hoffman requested review from iangneal and 0xddom January 2, 2024 18:17

0xddom reviewed Jan 2, 2024

View reviewed changes

fix typo

5e36dfe

0xddom approved these changes Jan 2, 2024

View reviewed changes

tim-hoffman merged commit 560117e into llvm Jan 2, 2024
2 checks passed

tim-hoffman deleted the th/van-929 branch January 2, 2024 20:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Further improve performance #83

Further improve performance #83

tim-hoffman commented Jan 2, 2024

0xddom left a comment

0xddom Jan 2, 2024

tim-hoffman commented Jan 2, 2024

tim-hoffman commented Jan 2, 2024

0xddom commented Jan 2, 2024

Further improve performance #83

Further improve performance #83

Conversation

tim-hoffman commented Jan 2, 2024

0xddom left a comment

Choose a reason for hiding this comment

0xddom Jan 2, 2024

Choose a reason for hiding this comment

tim-hoffman commented Jan 2, 2024

tim-hoffman commented Jan 2, 2024

0xddom commented Jan 2, 2024