This commit is contained in:
Brooke Vibber 2024-12-30 06:48:04 -08:00
parent c4b98c7be2
commit e51aa91e4e

View file

@ -4,11 +4,7 @@ things to try:
* patch the entire expanded-ram imul8xe on top of imul8 to avoid the 3-cycle thunk penalty :D
* square-root special case of multiplication for zx*zx and zy*zy
* the hi1*hi2 and lo1*lo2 8-bit muls can be optimized into a 512-byte lookup table
* jamey on mastodon tried this but had some problems. see what happens on our version!
* double-check rounding behavior is correct
* optimize out a store/load with mul8_add16 and mul8_add24
* try 3.13 fixed point instead of 4.12 for more precision
* can we get away without the extra bit?