public class CombinePerKeyExamples extends Object
Concepts: the Combine.perKey transform, which lets you combine the values in a key-grouped Collection, and how to use an Aggregator to track information in the Monitoring UI.
Note: Before running this example, you must create a BigQuery dataset to contain your output table.
To execute this pipeline locally, specify general pipeline configuration:
--project=YOUR_PROJECT_ID
and the BigQuery table for the output:
--output=YOUR_PROJECT_ID:DATASET_ID.TABLE_ID
To execute this pipeline using the Dataflow service, specify pipeline configuration:
--project=YOUR_PROJECT_ID
--stagingLocation=gs://<STAGING DIRECTORY>
--runner=BlockingDataflowPipelineRunner
and the BigQuery table for the output:
--output=YOUR_PROJECT_ID:DATASET_ID.TABLE_ID
The BigQuery input table defaults to publicdata:samples.shakespeare and can
be overridden with --input.
| Modifier and Type | Class and Description |
|---|---|
static class |
CombinePerKeyExamples.ConcatWords
A 'combine function' used with the Combine.perKey transform.
|
| Constructor and Description |
|---|
CombinePerKeyExamples() |