com.databricks.sdk.service.serving
Interfaces
ServingEndpointsDataPlaneService
ServingEndpointsService
Classes
Ai21LabsConfig
AiGatewayConfig
AiGatewayGuardrailParameters
AiGatewayGuardrailPiiBehavior
AiGatewayGuardrails
AiGatewayInferenceTableConfig
AiGatewayRateLimit
AiGatewayUsageTrackingConfig
AmazonBedrockConfig
AnthropicConfig
AutoCaptureConfigInput
AutoCaptureConfigOutput
AutoCaptureState
BuildLogsRequest
BuildLogsResponse
ChatMessage
CohereConfig
CreateServingEndpoint
DatabricksModelServingConfig
DataframeSplitInput
DeleteResponse
DeleteServingEndpointRequest
EmbeddingsV1ResponseEmbeddingElement
EndpointCoreConfigInput
EndpointCoreConfigOutput
EndpointCoreConfigSummary
EndpointPendingConfig
EndpointState
EndpointTag
ExportMetricsRequest
ExportMetricsResponse
ExternalModel
ExternalModelUsageElement
FoundationModel
GetOpenApiRequest
GetOpenApiResponse
GetServingEndpointPermissionLevelsRequest
GetServingEndpointPermissionLevelsResponse
GetServingEndpointPermissionsRequest
GetServingEndpointRequest
GoogleCloudVertexAiConfig
ListEndpointsResponse
LogsRequest
ModelDataPlaneInfo
OpenAiConfig
PaLmConfig
PatchServingEndpointTags
PayloadTable
PutAiGatewayRequest
PutAiGatewayResponse
PutRequest
PutResponse
QueryEndpointInput
QueryEndpointResponse
RateLimit
Route
ServedEntityInput
ServedEntityOutput
ServedEntitySpec
ServedModelInput
ServedModelOutput
ServedModelSpec
ServedModelState
ServerLogsResponse
ServingEndpoint
ServingEndpointAccessControlRequest
ServingEndpointAccessControlResponse
ServingEndpointDetailed
ServingEndpointPermission
ServingEndpointPermissions
ServingEndpointPermissionsDescription
ServingEndpointPermissionsRequest
ServingEndpointsAPI
ServingEndpointsDataPlaneAPI
TrafficConfig
V1ResponseChoiceElement
Enums
AiGatewayGuardrailPiiBehaviorBehavior
AiGatewayRateLimitKey
AiGatewayRateLimitRenewalPeriod
AmazonBedrockConfigBedrockProvider
ChatMessageRole
EmbeddingsV1ResponseEmbeddingElementObject
EndpointStateConfigUpdate
EndpointStateReady
ExternalModelProvider
QueryEndpointResponseObject
RateLimitKey
RateLimitRenewalPeriod
ServedModelInputWorkloadSize
ServedModelInputWorkloadType
ServedModelStateDeployment
ServingEndpointDetailedPermissionLevel
ServingEndpointPermissionLevel