MACROMEDIA COLDFUSION 4.5-ADMINISTRING COLDFUSION SERVER Technical Information Page 134

  • Download
  • Add to my manuals
  • Print
  • Page
    / 174
  • Table of contents
  • TROUBLESHOOTING
  • BOOKMARKS
  • Rated. / 5. Based on customer reviews
Page view 133
129
CONFIGURING AND ADMINISTERING COLDFUSION 9
Indexing Collections with Verity Spider
Last updated 2/21/2012
Note: Although larger values mean more efficient processing by the indexer, smaller values allow more parallelism on
multi-CPU systems. If an outage occurs during indexing, a smaller value means that fewer documents are lost.
If an outage occurs during indexing, the chunk of documents specified by the -submitsize option is lost because no
transactional rollback occurs for indexing. The documents are no longer in the queue for indexing. When you rerun
the indexing task, Verity Spider can only continue with URLs and documents that are enqueued.
-temp
Syntax
-temp path
Specifies the directory for temporary files (disk cache). By default, the temp directory is under the job directory
(optionally specified with the
-jobpath option).
If you do not specify a value for this option, Verity Spider creates a /spider/temp directory within the collection. For
multiple-collection tasks, the first collection specified is used.
Note: Make sure that the location you specify contains enough disk space to handle the documents that are downloaded
and held before indexing. The documents are deleted from the hard disk after they are indexed.
See also
-jobpath” on page 123, for specifying the location of all indexing job directories and files, one of which is the temp
directory.
Networking options
The Verity Spider networking options are listed here.
-agentname
Type
Web crawling only
Syntax
-agentname string
Specifies the value for the agent name field that is part of the HTTP request. You can use the -agentname option to
impersonate a browser client because web servers can be configured to return different versions of the same page
depending on the requesting agent.
Use double-quotation marks if the name contains a space. Use the -cmdfile option if the agent name you want to use
contains forbidden characters, such as slashes or backslashes.
-connections
Syntax
-connections num_connections
Page view 133
1 2 ... 129 130 131 132 133 134 135 136 137 138 139 ... 173 174

Comments to this Manuals

No comments