Commit Graph

4 Commits

Author SHA1 Message Date
Gareth Jones
ccb208bcac Cache output stride parameters in registers to reduce global loads 2025-02-23 18:44:25 -08:00
Gareth Jones
5fb94d668f Stage accumulator fragment to shared memory using tiled copy 2025-02-23 18:38:15 -08:00
Gareth Jones
9f361aa02e Stage accumulator fragment to shared memory using tiled copy 2025-02-23 18:23:07 -08:00
Jiashi Li
414a2f3eed Initial commit
i
2025-02-24 09:20:23 +08:00