optimise luma copy part a bit