+

Search Tips   |   Advanced Search


Configure a crawler to search your local portal site

Configure and run a search crawler on your local portal site to gather information and create a search collection that enables your users to search the portal site. Portal Search provides a default portal site search collection that enables your users to search the portal site. Before your users can search the portal site collection, perform the following tasks.

  1. Optional: Set the crawler user ID. To use a dedicated crawler user ID for crawling the portal site content source, define and set that crawler user ID:

    1. Define the crawler user ID by using the Manage Users and Groups portlet.

    2. Set the preferred language of the portal site crawler user ID to match the language of the portal site search collection that it crawls. (If you do this after you started a crawl on the portal site search collection, reset the portal site collection. Refer to Creating or resetting the portal site collection.)

    3. Edit the portal site collection content source and fill in the crawler user ID and its password. To do this...

      1. Click Administration > Manage Search > Search Collections.

      2. In the search collection list, click the Portal Content search collection.

      3. Click the Edit icon next to the Portal Content Source collection name.

      4. Under the Security tab, type the crawler user ID into the appropriate field.

      5. Under the Security tab, type the crawler password into the appropriate field.

      6. Click Save.

    4. Optional: Configure the crawler to follow external links. If you want the crawler to follow external links from inside the Portal, you can modify the value in the Levels of links to follow field under the General Parameters tab. Set the level to a value higher than 1. In addition, you can configure filters for those external links from the Filters tab. The default filter suppresses any links that point back to Portal pages.

      The default filter is displayed only after saving the configuration of the content source.

  2. Start the initial crawl. Start the initial crawl on the portal site content source:

    1. Click Administration > Manage Search > Search Collections.

    2. In the search collection list, click the Portal Content search collection.

    3. Click the Start Crawler icon (right-pointing arrow) next to the Portal content source name.

  3. Configure regular crawls. If you want regular crawls on the portal site content source, perform either of the following tasks:

    • Enable the default scheduler. To do this...

      1. Click the View Content Source Schedulers icon next to the collection name.

      2. In the Manage Schedulers page, click Disabled.

        This changes the status of the scheduler to Enabled and displays a confirmation message.

    • Set up your own scheduler. To do this...

      1. Click the Edit icon for the content source.

      2. Select the Schedulers tab.

      3. Configure your own scheduler as required. For more details about how to do this, refer to the Manage Search portlet help.

For more detailed information about how to work with content sources refer to Manage the content sources of a search collection and to the Manage Search portlet help.Notes:

  1. The local portal site is exposed through a service that requires SSL. Therefore, if the portal is configured with a Web server and you configure the content source root URL through the Web server, configure the Web server for SSL.

  2. By default, items in the result lists from portal site searches provide no summary information. If end users are using the Search and Browse portlet they can refer to the information given under Description:

    for information about the search result list item. To have the summary information added, configure the portlet with the summary parameter enabled as follows: PortalCollectionSummarizer=on.

  3. When you crawl a portal site, be aware of the Memory required for crawls and the Time required for crawls and imports and availability of documents.

  4. Set the preferred language of the crawler user ID to match the language of the search collection that it crawls.

  5. The portal site search collection is created when an administrator navigates to the Manage Search portlet. However, start the crawl for users to be able to search the portal site. Depending on the portal configuration and environment and possible customization, you might need to reset the portal site search collection that was created. For details about such scenarios and the necessary tasks to perform refer to Creating or resetting the portal site collection.

  6. If your users search the portal site search collection on a secured portal site, refer to the additional information under Enable search on a secured portal site with the default configuration.

When users search a portal site, they can access portal pages of two types:

If you customize search on the portal site, you might find useful information under Configure the default location for search collections and Creating or resetting the portal site collection.

If the portal site is multilingual and your users use different languages to search the portal, refer to Crawl a multilingual portal site.


Parent topic:

Search your local portal site


Related tasks


Crawl a multilingual portal site
Configure search on a secured portal site
Creating or resetting the portal site collection
Manage the content sources of a search collection
Enable search on a secured portal site with the default configuration
Enable anonymous users to search public pages of the portal
Configure the default location for search collections


Related reference


Hints and tips for using Portal Search


Related information


Configure the Web server plug-in for Secure Sockets Layer (WAS information center)