Question 1

What counts as a duplicate row?

Accepted Answer

A row is considered a duplicate if every cell value is identical to a row that appeared earlier in the file. All columns are compared — there's no concept of a "key column" in this tool. If you need to dedupe based on a subset of columns (say, only on `email` while ignoring the `last_seen_at` column), use the SheetCompare app's single-file mode where you can pick the key columns explicitly.

Question 2

Which copy of the duplicate is kept?

Accepted Answer

The first occurrence. Rows are processed top-to-bottom; the first time a row appears it's kept, and any later identical rows are dropped. So if your file is sorted by date with newest at the top, you'll keep the newest copy. If newest is at the bottom, you'll keep the oldest. Sort the file before deduping if which copy survives matters.

Question 3

Is the comparison case-sensitive?

Accepted Answer

Yes. "jane@example.com" and "Jane@Example.com" are treated as different rows. Same for whitespace — " Jane " and "Jane" are different. If you need case-insensitive or whitespace-trimmed matching, normalize the values in Excel or your script first (lowercase + TRIM), then run them through this tool.

Question 4

Is row order preserved?

Accepted Answer

Yes. We don't sort the output — surviving rows come out in the same order they appeared in the input. This matters when your file is already in a meaningful order (chronological, by priority, etc.) and you want dedupe to be a passive cleanup pass rather than a reshuffle.

Question 5

How do I see how many rows were removed?

Accepted Answer

The preview shows the row count after dedupe. Compare that against your original file's row count to get the number removed — for example, if your input had 5,200 rows and the preview shows 4,873, you removed 327 duplicates. We don't surface the diff number directly in the UI today; that's on the roadmap.

Question 6

Does it work on Excel files with multiple sheets?

Accepted Answer

Only the first sheet is read and deduped. The output preserves whatever format you uploaded — drop an .xlsx and get an .xlsx back, but with a single sheet containing the cleaned data. If you need to dedupe each sheet of a multi-sheet workbook, run them separately.

Question 7

How big a file can I dedupe?

Accepted Answer

Because everything runs in your browser, memory is the bottleneck. A few hundred thousand rows works comfortably on a modern laptop; multi-million-row files may stall or fail. For very large datasets, use a command-line tool like `awk '!seen[$0]++'` (for full-line dedupe) or a database `SELECT DISTINCT`.

Remove Duplicate Rows

Three steps, no signup

Drop your file

We strip exact-match duplicates

Download the deduplicated file

Frequently asked questions

Need to compare two files?