provide optimized a_ctz_32 for arm