Document Indexing Specifications
The Windchill AI Assistant supports a wide range of document and business object types for both on‑premises and SaaS environments.
Document Types
The following table provides a list of document formats supported by the Windchill AI Assistant plugin:
Document Type
Document Name
MIME Type
CSV
Comma Separated List
text/csv
DOCX/DOC/DOCM
Microsoft Word
application/msword
application/vnd.ms-word.document.macroenabled.12
application/vnd.openxmlformats-officedocument.wordprocessingml.document
EML
Email
message/rfc822
EPUB
Electronic Publication
application/epub+zip
GZ
GNU Zip compressed file
application/gzip
application/x-gzip
HTML
Hypertext Markup Language
text/html
JSON
JavaScript Object Notation
application/json
KML
Keyhole Markup Language
application/vnd.google-earth.kml+xm
MSG
Outlook Email
application/vnd.ms-outlook
application/msoutlook
ODP
OpenDocument Presentation
application/vnd.oasis.opendocument.presentation
ODS
OpenDocument Spreadsheet
application/vnd.oasis.opendocument.spreadsheet
ODT
OpenDocument Text
application/vnd.oasis.opendocument.text
PDF
Acrobat Document
application/pdf
PPTX/PPT/PPTM
Microsoft PowerPoint
application/vnd.ms-powerpoint
application/vnd.ms-powerpoint.presentation.macroenabled.12
application/vnd.openxmlformats-officedocument.presentationml.presentation
RTF
Rich Text Format
application/rtf
TXT/LOG (examples)
Plain text
text/plain
XLSX/XLS/XLSM
Microsoft Excel
application/vnd.ms-excel
application/vnd.ms-excel.sheet.macroenabled.12
application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
XML
Generic XML, and special XML – such as Word XML or DITA XML
application/xml
text/xml
application/vnd.ms-wordml
application/vnd.ms-word2006ml
application/dita+xml
ZIP
ZIP compressed file
application/zip
application/x-zip-compressed
Business Object Types
The following table provides a list of business objects and their associated documents supported by the Windchill AI Assistant plugin.
Business Object
Primary Content
Representation(s)
Attachment(s)
Annotation(s)
WTDocument
EPM Documents (CAD)
WTPart
* 
WTPart reference documents are indexed as WTDocuments.
Arbortext Dynamic Document
Change Request
Change Notice
Change Task
Problem Report
CAPA
QMS (Quality Management System) Object Types
QMS object types are used to manage quality-related processes across a product lifecycle and during manufacturing. For more information, see Windchill Quality Management Solutions.
The following table provides a list of QMS object types supported by the Windchill AI Assistant plugin.
Object Type
Purpose
Key Attributes
Supported by Windchill AI Assistant
QMS
QMS Documents
Documents pertaining to quality management
Identical to WTDocuments in all aspects
Yes
Same level of support as WTDocuments (including attachments).
CAPA
CAPA Request
Initiates the corrective and preventive action process
Lot/Serial Number, Batch Number, UDI
Yes (attachments only)
Regulatory Submissions and Regulatory Compliance Objects
Regulatory Master
Tracks regulatory submissions and compliance
Agency, Submission ID, Status
Yes
Same level of support as Change Management objects.
Indexing Limits
The indexing process extracts and processes only textual information from the supported document types listed above. This includes continuous text sections, titles, notes, and tables and similar text‑based information. Images, charts, and other non‑textual content are not indexed.
The indexing limit for an individual document is a maximum file size of 128 MB or 4 million characters.
* 
Documents that exceed the supported limits may cause indexing failures. Use one of the following approaches to avoid such issues:
Split large documents into smaller files.
Prevent indexing of large documents by removing the indexing user’s access to those documents. For more information, see Configuring the Indexing User.
Indexing Performance
The indexing process depends on several factors, including performance of your Windchill system, the network connectivity between Windchill and Microsoft Azure, and the capacity of the configured cloud resources.
Cloud resource capacity depends on your deployment model:
For on‑premises environment, capacity depends on the Azure resources you configured.
For SaaS environment, indexing uses PTC‑managed, scalable cloud resources.
Additionally, Microsoft Azure services may experience variations in performance KPIs based on the region, time of day, or temporary service impacts.
As a reference, indexing typically takes approximately two days for the following dataset:
150,000 documents
Average document size: 500 KB
Indexable content: 100% full text
* 
In some cases, indexing may appear to be stalled due to a known Microsoft Azure indexing issue. When this occurs, indexing automatically resumes after approximately six hours. Microsoft plans to deploy a permanent fix for this issue in a future update.
Was this helpful?