skip h2d and d2h copies between forward functions in gemma4-31b#20286
skip h2d and d2h copies between forward functions in gemma4-31b#20286Gasoonjia wants to merge 1 commit into
Conversation
Summary: This diff updates gemma4-31b export and runtime pipeline to skip the h2d and d2h copies between prefill and decode, and between previous round next round of decode as well. Differential Revision: D108661628
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20286
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New Failures, 5 Pending, 2 Unrelated FailuresAs of commit aefe4cb with merge base 78e0384 ( NEW FAILURES - The following jobs have failed:
FLAKY - The following jobs failed but were likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@Gasoonjia has exported this pull request. If you are a Meta employee, you can view the originating Diff in D108661628. |
This PR needs a
|
Summary: This diff updates gemma4-31b export and runtime pipeline to skip the h2d and d2h copies between prefill and decode, and between previous round next round of decode as well.
Differential Revision: D108661628