What Happened
- This pull request fixes structured-output chat in run-llama/llama_index by preserving provider-native raw responses and metadata for `StructuredLLM.chat()` and `StructuredLLM.achat()`. Previously these paths rebuilt `ChatResponse` from only the parsed Pydantic payload, which dropped details like token usage and the original provider response object; the update adds an internal `StructuredPredictionResult` wrapper so `structured_predict()` and `astructured_predict()` keep returning parsed models while raw details are retained in the response path for OpenAI chat-completions and responses.parse structured outputs.
- This pull request fixes structured-output chat in run-llama/llama_index by preserving provider-native raw responses and metadata for `StructuredLLM.chat()` and `StructuredLLM.achat()`. Previously these paths rebuilt `ChatResponse` from only the parsed Pydantic payload, which dropped details like token usage and the original provider response object; the update adds an internal `StructuredPredictionResult` wrapper so `structured_predict()` and `astructured_predict()` keep returning parsed models while raw details are retained in the response path for OpenAI chat-completions and responses.parse structured outputs.
- 1 evidence item attached for review.