DeepSeek FP8 UE8M0 optimization