|
Document Type
|
Estimate of Actual Text in the Document
|
|---|---|
|
Plain text formats (such as TXT, CSV, log files, or source code)
|
100%
|
|
Text markup formats (such as XML, HTML, or Markdown)
|
100%
|
|
Word processing formats (such as DOCX, ODT, RTF)
|
10%–50% (lower for image‑heavy documents, higher for text‑heavy documents)
|
|
Spreadsheets (such as XLSX, ODS)
|
10%
|
|
Presentations (such as PPTX, ODP)
|
10%
|
|
PDF documents
|
10%
|
|
Document Type
|
Total File Size (GB)
|
Estimated Text Percentage
|
Estimated Text Data (GB)
|
|---|---|---|---|
|
Plain text documents
|
17
|
100%
|
17
|
|
Text markup documents
|
24
|
100%
|
24
|
|
Word processing documents
|
13
|
25% (moderate image usage)
|
3.25
|
|
Spreadsheet documents
|
2
|
10%
|
0.2
|
|
Presentation documents
|
3
|
10%
|
0.3
|
|
PDF documents
|
15
|
10%
|
1.5
|