Class ExperimentEvent

    • Constructor Detail

    • Method Detail

      • id

         final String id()

        A unique identifier for the experiment event. If you don't provide one, BrainTrust will generate one for you

      • _xactId

         final String _xactId()

        The transaction id of an event is unique to the network operation that processed the event insertion. Transaction ids are monotonically increasing over time and can be used to retrieve a versioned snapshot of the experiment (see the version parameter)

      • projectId

         final String projectId()

        Unique identifier for the project that the experiment belongs under

      • rootSpanId

         final String rootSpanId()

        A unique identifier for the trace this experiment event belongs to

      • spanId

         final String spanId()

        A unique identifier used to link different experiment events together as part of a full trace. See the tracing guide for full details on tracing

      • context

         final Optional<ExperimentEvent.Context> context()

        Context is additional information about the code that produced the experiment event. It is essentially the textual counterpart to metrics. Use the caller_* attributes to track the location in code which produced the experiment event

      • _expected

         final JsonValue _expected()

        The ground truth value (an arbitrary, JSON serializable object) that you'd compare to output to determine if your output value is correct or not. Braintrust currently does not compare output to expected for you, since there are so many different ways to do that correctly. Instead, these values are just used to help you navigate your experiments while digging into analyses. However, we may later use these values to re-score outputs or fine-tune your models

      • _input

         final JsonValue _input()

        The arguments that uniquely define a test case (an arbitrary, JSON serializable object). Later on, Braintrust will use the input to know whether two test cases are the same between experiments, so they should not contain experiment-specific state. A simple rule of thumb is that if you run the same experiment twice, the input should be identical

      • metadata

         final Optional<ExperimentEvent.Metadata> metadata()

        A dictionary with additional data about the test example, model outputs, or just about anything else that's relevant, that you can use to help find and analyze examples later. For example, you could log the prompt, example's id, or anything else that would be useful to slice/dice later. The values in metadata can be any JSON-serializable type, but its keys must be strings

      • metrics

         final Optional<ExperimentEvent.Metrics> metrics()

        Metrics are numerical measurements tracking the execution of the code that produced the experiment event. Use "start" and "end" to track the time span over which the experiment event was produced

      • _output

         final JsonValue _output()

        The output of your application, including post-processing (an arbitrary, JSON serializable object), that allows you to determine whether the result is correct or not. For example, in an app that generates SQL queries, the output should be the result of the SQL query generated by the model, not the query itself, because there may be multiple valid queries that answer a single question

      • scores

         final Optional<ExperimentEvent.Scores> scores()

        A dictionary of numeric values (between 0 and 1) to log. The scores should give you a variety of signals that help you determine how accurate the outputs are compared to what you expect and diagnose failures. For example, a summarization app might have one score that tells you how accurate the summary is, and another that measures the word similarity between the generated and grouth truth summary. The word similarity score could help you determine whether the summarization was covering similar concepts or not. You can use these scores to help you sort, filter, and compare experiments

      • spanParents

         final Optional<List<String>> spanParents()

        An array of the parent span_ids of this experiment event. This should be empty for the root span of a trace, and should most often contain just one parent element for subspans

      • _id

         final JsonField<String> _id()

        Returns the raw JSON value of id.

        Unlike id, this method doesn't throw if the JSON field has an unexpected type.

      • _tags

         final JsonField<List<String>> _tags()

        Returns the raw JSON value of tags.

        Unlike tags, this method doesn't throw if the JSON field has an unexpected type.

      • builder

         final static ExperimentEvent.Builder builder()

        Returns a mutable builder for constructing an instance of ExperimentEvent.

        The following fields are required:

        .id()
        ._xactId()
        .created()
        .experimentId()
        .projectId()
        .rootSpanId()
        .spanId()