Skip to content

Commit

Permalink
better warning about dupes in sample, closes dedupeio#642
Browse files Browse the repository at this point in the history
  • Loading branch information
fgregg committed Apr 30, 2018
1 parent 422f865 commit 5b91967
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion dedupe/core.py
Original file line number Diff line number Diff line change
Expand Up @@ -83,7 +83,7 @@ def randomPairsMatch(n_records_A, n_records_B, sample_size):
def randomPairsWithReplacement(n_records, sample_size):
# If the population is very large relative to the sample
# size than we'll get very few duplicates by chance
warnings.warn("There may be duplicates in the sample")
warnings.warn("The same record pair may appear more than once in the sample")

try:
random_indices = numpy.random.randint(n_records,
Expand Down

0 comments on commit 5b91967

Please sign in to comment.