
Google Search Appliance: Administrative API Developer’s Guide: Protocol 30
The response result is as follows:
<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’
xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<id>http://gsa:8000/feeds/command/pauseCrawl</id>
<updated>2008-12-11T08:55:57.824Z</updated>
<link rel=’self’ type=’application/atom+xml’
href=’http://gsa:8000/feeds/command/pauseCrawl’/>
<link rel=’edit’ type=’application/atom+xml’
href=’http://gsa:8000/feeds/command/pauseCrawl’/>
<gsa:content name=’entryID’>pauseCrawl</gsa:content>
<gsa:content name=’pauseCrawl’>0</gsa:content>
</entry>
Pausing or Resuming Crawl
To pause or resume crawl, send an authenticated
PUT
request to the following URL:
http://Search_Appliance:8000/feeds/command/pauseCrawl
The following is an example of a request to resume crawl:
<?xml version=’1.0’ encoding=’UTF-8’?>
<entry xmlns=’http://www.w3.org/2005/Atom’
xmlns:gsa=’http://schemas.google.com/gsa/2007’>
<gsa:content name=’entryID’>pauseCrawl</gsa:content>
<gsa:content name=’pauseCrawl’>0</gsa:content>
</entry>
Document Status
Retrieve the status of the documents that have been crawled and served using the
documentStatus
entry of the
status
feed. The properties for the document status are:
Retrieving Document Status
To retrieve document status, send an authenticated
GET
request to the following URL:
http://Search_Appliance:8000/feeds/status/documentStatus
Property Description
crawledURLsToday
The number of documents crawled since midnight. (Midnight pertains to
the time that is set on the search appliance.)
crawlPagePerSecond
Current crawling rate measured in pages per second.
errorURLsToday
Document errors that occurred since midnight on the search appliance.
filteredBytes
Document bytes that have been filtered by domain, language, file type, or
metadata.
foundURLs
The number of URLs found that match crawl patterns.
servedURLs
The number of total documents that have been served.
Comentarios a estos manuales