Penn State Mark Search Engine documentation Information Technology Services
Documentation Home | User Help | File Types | FAQ | Info. for Web Content Providers | Index Helper | Custom Style Manager | Crawler Info

Information About Search Engine Crawlers

The Penn State Search Engine http://search.psu.edu/ uses the following IP addresses and User-Agent HTTP header for crawling new Web sites:

Search ApplianceHostname / IP addressUser-Agent string
Production Google Search Appliance #1, 5.x series search-crawler1.aset.psu.edu / 128.118.142.21 PennStateSpider (http://aset.its.psu.edu/googledocs/crawler_info.html)
Production Google Search Appliance #2, 5.x series search-crawler2.aset.psu.edu / 128.118.142.22 PennStateSpider (http://aset.its.psu.edu/googledocs/crawler_info.html)

The Penn State Search Engine is now supported by two load-balanced machines, which both answer to http://search.psu.edu. For more information on the Penn State Search Engine, please review the documentation.


The Pennsylvania State University ©2009. All rights reserved.
Alternative Media - Nondiscrimination Statement
This site maintained by Academic Services and Emerging Technologies, a unit of Information Technology Services.

Comments and suggestions may be directed to The Penn State Search Engine Support Team.

Last revised: Friday, February 13, 2009.