public class DeDupExample extends Object
Concepts: the RemoveDuplicates transform, and how to wire transforms together.
Demonstrates TextIO.Read/
RemoveDuplicates/TextIO.Write.
To execute this pipeline locally, specify general pipeline configuration: --project=YOUR_PROJECT_ID and a local output file or output prefix on GCS: --output=[YOUR_LOCAL_FILE | gs://YOUR_OUTPUT_PREFIX]
To execute this pipeline using the Dataflow service, specify pipeline configuration: --project=YOUR_PROJECT_ID --stagingLocation=gs://YOUR_STAGING_DIRECTORY --runner=BlockingDataflowPipelineRunner and an output prefix on GCS: --output=gs://YOUR_OUTPUT_PREFIX
The input defaults to gs://dataflow-samples/shakespeare/* and can be
overridden with --input.
| Constructor and Description |
|---|
DeDupExample() |