Penn State Mark Search Engine documentation Information Technology Services
Documentation Home | User Help | File Types | FAQ | Info. for Web Content Providers | Index Helper | Custom Style Manager | Crawler Info | End-of-Life Scheduled, 2010

The table below provides a list of file types that are crawled and indexed by the Google Search Appliance.

File Type Extension
Access .mdb
AMI/AMI Professional .sam
ANSI Text (7 & 8 bit) .ans, .txt
ASCII Text (7 & 8 bit) .asc, .txt
Corel Presentations .shw
DataEase .dba, .dbm
dBASE .dbf
dBXL .dbf
DEC WPS Plus .dx
DisplayWrite .txt, .doc
Enable .300, .wpf, .ssf
First Choice .ss, .fol
Flash  
FoxBase .dbf
Framework fw3
Freelance .prz, .pre
Harvard Graphics .cht, .ch3
HTML .html, .htm, .shtml
**Special note**
IBM FFT .fft
IBM Revisable Form Text .rft
IBM Writing Assistant .iwa
Ichitaro .jtd
JustWrite .jw
Legacy .chp
Lotus 1-2-3 .wku, .wk1, .wk3, .wk4, .wk5, .wk6
Lotus Manuscript .doc
Lotus Symphony .wr1
Lotus WordPro .lwp
MacWrite II .mcw
MASS11 .aa4, .aa5,.aa6,.aa7,.aa8
Microsoft Multiplan .cod, .col
Microsoft PowerPoint .ppt
Microsoft Project .mpp
Microsoft Rich Text Format .rtf
Microsoft Windows Works .dbf
Microsoft Windows Write .wri
Microsoft Word .doc
Microsoft Works .wps, .wks, .wdb, .wcm
Microsoft WordPad .doc
Mosaic Twin .wku
MSG .msg
Microsoft Excel .xla, .xlc, .xlm, .xls, .xlt, .xlw
MultiMate .doc, .dox, .fnt, .fnx
Navy DIF .dif
Nota Bene .nb
Novell Perfect Works  
Novell Presentations  
Novell WordPerfect .wpd, .wpg, .wpf, .wp5
Office Writer .ow4
Paradox .db, .db3
PC-File+ Letter .ltr
PDF .pdf
Personal R:BASE .rbf
PFS:Professional Plan .tid
PFS:Write .pfb
PostScript .ps
Professional Write .pw1, .pwp
Q&A .qa, .qw, .dtf
QuattroPro .wq1, .wb1, .wb2
Reflex .r2d
Samna Word .sam
SmartWare II .doc, .db, .ws
Sprint .spr
SuperCalc .cal
Total Word .tw
Unicode Text .txt
vCard Electronic Business Card .vcf
Volkswriter 3 & 4 .vw4
VP Planner 3D .wks
Wang PC (IWP) .iwp
WordMARC .wmc
WordPerfect .wp, .wp5, .wpd, .pln, .shw, .wbk, .wkb, .wpf
WordStar .ws, .ws2, .wsd, .ws4, .ws6
XyWrite .xy, .xy3, .xyw

Special Note:
Server-side dynamically generated content such as Active Server Pages (.asp), PHP Hypertext Preprocessor (.php), Cold Fusion (.cfm), Java Server Pages (.jsp), Lotus Notes database files (.nsf), Common Gateway Interface (.cgi), CGI in Perl (.pl), Windows Server Executables (.exe, .com), and others can return content as any of the above, including HTML format. If the format returned is supported, it can be indexed by the search engine. The file type does not always match a specific file name suffix. The suffixes noted above are examples of the most commonly used suffixes for a given type supported by the search engine.


The Pennsylvania State University ©2006. All rights reserved.
Alternative Media - Nondiscrimination Statement
This site maintained by Academic Services and Emerging Technologies, a unit of Information Technology Services.

Comments and suggestions may be directed to The Penn State Search Engine Support Team.

Last revised: Thursday, March 2, 2006.