model	scenario	result	duration_s	notes
glm-4.5-air	aggressive-client	FAIL	110	[92m07:15:34 - LiteLLM:WARNING[0m: common_utils.py:979 - litellm: could not pre-load bedrock-runtime response stream s
glm-4.5-air	callback-quick-yes	PASS	43	
glm-4.5-air	callback-recorded	FAIL	80	[92m07:18:07 - LiteLLM:WARNING[0m: common_utils.py:979 - litellm: could not pre-load bedrock-runtime response stream s
glm-4.5-air	callback-refused	FAIL	72	[92m07:19:27 - LiteLLM:WARNING[0m: common_utils.py:979 - litellm: could not pre-load bedrock-runtime response stream s
glm-4.5-air	chat-callback-recorded-via-http	SKIP	0	no golden_pass
glm-4.5-air	chat-callback-recorded	PASS	55	
glm-4.5-air	correction-mid-call	PASS	94	
glm-4.5-air	off-topic	PASS	32	
glm-4.5-air	unclear-request	PASS	47	
glm-4.5-air	wrong-number	PASS	37	
