deduper / org.bradfordmiller.deduper / Deduper / dedupe
dedupe
fun dedupe(commitSize: Long = 500, outputReportCommitSize: Long = 1000000): DedupeReport
runs a dedupe process and returns a dedupe report
Parameters
commitSize
-
- the number of rows to write in a single transation
outputReportCommitSize
-
- the number of total rows committed before logging the result