Test is failing on H100, H200 and B200. Currently passing in CI. Test is commented out for now. results are not the same: ref_result={'test': 'pure_text', 'logprobs ...