Brooke Vibber
582ddf497f
rather than saving 0 into the high bytes, then adding the high-byte multiplication later, write it directly in place. this saves a few cycles on every iteration, and it adds up nicely. View 1 overview render times: 130XE: 10.050 ms/px - 4m56s 800XL: 10.906 ms/px - 5m21s
9 lines
236 B
Markdown
9 lines
236 B
Markdown
things to try:
|
|
|
|
* patch the entire expanded-ram imul8xe on top of imul8 to avoid the 3-cycle thunk penalty :D
|
|
|
|
* y-axis mirror optimization
|
|
|
|
* extract viewport for display & re-input via keyboard
|
|
|
|
* fujinet screenshot/viewport uploader
|