deduper / org.bradfordmiller.deduper / Deduper

Deduper

class Deduper

dedupes data based on config settings

Types

ControlQueue

enum class ControlQueue

Persistors

data class Persistors

Constructors

<init>

dedupes data based on config settings

Deduper(config: Config)

Functions

dedupe

runs a dedupe process and returns a dedupe report

fun dedupe(commitSize: Long = 500, outputReportCommitSize: Long = 1000000): DedupeReport

getSampleHash

returns a sample row representing all values in that row plus the associated hash of the row

fun getSampleHash(): SampleRow

Companion Object Properties

logger

val logger: Logger!