Please note that warp image (e.g. persective transformation, rotation, scaling) is done by CPU on all STM32 targets. This is very slow compared to any hardware accelerated copy operation done by DMA2D on STM32 targets.
If possible, avoid the usage of WarpImage in order to improve the performance.
Additinaly hints can be provided, if you can share more details / some images of the resulting GUI.