Empowers users to fuzzily-merge data frames with millions or tens of millions of rows in minutes with low memory usage. The package uses the locality sensitive hashing algorithms developed by Datar, Immorlica, Indyk and Mirrokni (2004) <doi:10.1145/997817.997857>, and Broder (1998) <doi:10.1109/SEQUEN.1997.666900> to avoid having to compare every pair of records in each dataset, resulting in fuzzy-merges that finish in linear time.
Version: 0.2.1 Depends: R (≥ 4.2) Imports: collapse, dplyr, tibble, tidyr Suggests: babynames, covr, fuzzyjoin, igraph, knitr, microbenchmark, profmem, purrr, rmarkdown, stringdist, testthat (≥ 3.0.0), tidyverse, vdiffr Published: 2025-04-13 DOI: 10.32614/CRAN.package.zoomerjoin Author: Beniamino Green [aut, cre, cph], Etienne Bacher [ctb], The authors of the dependency Rust crates [ctb, cph] (see inst/AUTHORS file for details)RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4