guidellm.schemas.response
Backend response models for request and response handling.
Provides standardized response models for generation operations that capture output text, usage metrics, and compilation of request statistics. Ensures consistent data handling and statistics aggregation across different backend implementations.
GenerationResponse
Bases: StandardBaseModel
Response model for backend generation operations.
Captures the output and metrics from a generation request, providing structured data for text output, tool call payloads, token usage statistics, and compilation of detailed request statistics for analysis and monitoring purposes.
Example: :: response = GenerationResponse( request_id="req-123", text="Generated response text", input_metrics=UsageMetrics(token_count=50), output_metrics=UsageMetrics(token_count=25) ) stats = response.compile_stats(request, info)
Source code in src/guidellm/schemas/response.py
27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 | |
compile_stats(request, info, prefer_response=True)
Compile and return comprehensive request statistics.
Merges metrics from the request and response objects to create a complete statistical record, with preference given to response-level metrics when available to ensure accuracy of actual execution data.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
request | GenerationRequest | The original generation request containing input data | required |
info | RequestInfo | Metadata and timing information for the request execution | required |
prefer_response | bool | Whether to prefer response metrics over request metrics when both are available | True |
Returns:
| Type | Description |
|---|---|
GenerativeRequestStats | A GenerativeRequestStats object containing detailed statistics |
Raises:
| Type | Description |
|---|---|
ValueError | When request IDs don't match between objects |
Source code in src/guidellm/schemas/response.py
ToolCall
Bases: BaseModel
A single tool call from an OpenAI-compatible API response.