deduper / org.bradfordmiller.deduper

Package org.bradfordmiller.deduper

Types

Deduper

dedupes data based on config settings

class Deduper

DedupeReport

summary of a dedupe operation, with total recordCount, hashColumns used, columnsFound in the actual source query, dupeCount, distinctDupeCount, and dupes found in the dedupe process.

data class DedupeReport

DeduperProducer

class DeduperProducer : Runnable

SampleRow

reprsentation of a sample of data showing the comma-delimited sampleString and the associated sampleHash for that sample string

data class SampleRow