diyar: Record Linkage and Epidemiological Case Definitions in 'R'

An R package for iterative and batched record linkage, and applying epidemiological case definitions. 'diyar' can be used for deterministic and probabilistic record linkage, or multistage record linkage combining both approaches. It features the implementation of nested match criteria, and mechanisms to address missing data and conflicting matches during stepwise record linkage. Case definitions are implemented by assigning records to groups based on match criteria such as person or place, and overlapping time or duration of events e.g. sample collection dates or periods of hospital stays. Matching records are assigned a unique group ID. Index and duplicate records are removed or further analyses as required.

Version: 0.5.1
Depends: R (≥ 3.5.0)
Imports: methods, utils, ggplot2, rlang
Suggests: knitr, rmarkdown, testthat, covr
Published: 2023-11-12
Author: Olisaeloka Nsonwu
Maintainer: Olisaeloka Nsonwu <olisa.nsonwu at>
License: GPL-3
NeedsCompilation: no
Language: en-GB
Materials: README NEWS
In views: MissingData
CRAN checks: diyar results


Reference manual: diyar.pdf
Vignettes: Introduction to epidemiological case definitions with diyar
Introduction to record linkage with diyar
number_line and overlaps
Divvy up events with partitions


Package source: diyar_0.5.1.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release (arm64): diyar_0.5.1.tgz, r-oldrel (arm64): diyar_0.5.1.tgz, r-release (x86_64): diyar_0.5.1.tgz, r-oldrel (x86_64): diyar_0.4.3.tgz
Old sources: diyar archive


Please use the canonical form to link to this page.