TSV File Format
Like CSV, but values are separated by tab characters instead of commas. Preferred when data already contains commas — common in scientific, bioinformatics, and academic datasets.
What a Tab-Separated Values file is
Each line is a record and fields are separated by a tab character (U+0009). The first line is typically a header row.
Because tabs almost never appear inside real values, TSV usually needs no quoting at all, which makes it simpler and less ambiguous to parse than CSV.
A short history
Tab-separated values are as old as CSV and share the same idea: plain-text rows with a single-character delimiter — except the delimiter is a tab.
TSV became the default in domains where values routinely contain commas, especially scientific computing, bioinformatics, and natural-language datasets, where comma-delimited files would need heavy quoting.
Strengths
- No quoting needed for values containing commas
- Plain text, streamable, and editor-friendly
- Standard in bioinformatics (BED, VCF derivatives) and NLP corpora
- Easier to parse than CSV — tab is rare inside values
Weaknesses
- Tab characters inside values still need escaping (uncommon but real)
- Less recognized by consumer tools than CSV
- No types, formulas, or formatting — same limits as CSV
When TSV is the right choice
- Your values contain commas and you want to avoid quoting
- You're working with scientific, bioinformatics, or NLP datasets
- You're consuming database COPY/dump output that defaults to tabs
- You want a plain-text, streamable format that's easy to parse
When to reach for something else
- You're handing a file to consumer tools that expect CSV (CSV is more widely recognized)
- Your values can contain literal tab characters (then you still need escaping)
- You need types, formulas, or formatting (use XLSX)
TSV pitfalls that cause silent data corruption
These are the traps that turn a clean file into wrong data, usually without an error message.
Tabs inside values still break things
It's rare, but a value containing a literal tab will split into extra columns unless it's escaped. Free-text fields pasted from elsewhere are the usual culprit.
Tabs and spaces look identical
A file that uses runs of spaces instead of real tabs looks like TSV but won't parse as it. Confirm the delimiter is an actual tab character, not aligned spaces.
Same type limits as CSV
TSV has no notion of numbers, dates, or booleans, so the same leading-zero and long-number corruption that hits CSV in spreadsheet apps hits TSV too.
Convert TSV to another format
Frequently asked questions
TSV vs CSV, which is better?▾
Neither is universally better. TSV avoids quoting when values contain commas and is simpler to parse, which is why scientific and database tools prefer it. CSV is more widely recognized by consumer apps. Pick TSV when your data has commas; pick CSV for broad compatibility.
Why does my TSV open as one column?▾
The receiving tool is probably splitting on commas, not tabs. Tell it the delimiter is a tab, or convert the file to CSV first.
Do TSV files need quoting?▾
Usually not, because tabs rarely appear inside values. The exception is free-text fields that may contain a literal tab, which still need escaping.