deduper / org.bradfordmiller.deduper.persistors / HashPersistor

HashPersistor

interface HashPersistor : WritePersistor<HashRow>

definition for writing out hash values of rows found in source data

Functions

createHashTable

creates the hash file or sql table. deleteIfDupeExists determines if the hash file/table is deleted/dropped before creation.

abstract fun createHashTable(deleteIfHashTableExists: Boolean): Unit

writeRows

writes out list of hash values contained in rows

abstract fun writeRows(rows: MutableList<HashRow>): Long

Inheritors

CsvHashPersistor

creates and writes out hash values found in a deduper process to a csv defined in config

class CsvHashPersistor : CsvPersistor, HashPersistor

SqlHashPersistor

creates a sql table for persisting hashed data rows. This is configured using the hashJndi contained in the associated context

class SqlHashPersistor : HashPersistor