OmniFind Edition IBM DB2 Information Integrator OmniFind Edition Installation Requirements for Enterprise Search Version 8.2.2 Before using this information and the product it supports, be sure to read the general information under "Notices." This document contains proprietary information of IBM. It is provided under a license agreement and Copyright law protects it. The information contained in this publication does not include any product warranties, and any statements provided in this manual should not be interpreted as such. You can order IBM publications online or through your local IBM representative: * To order publications online, go to the IBM Publications Center at www.ibm.com/shop/publications/order. * To find your local IBM representative, go to the IBM Directory of Worldwide Contacts at www.ibm.com/planetwide. When you send information to IBM, you grant IBM a nonexclusive right to use or distribute the information in any way it believes appropriate without incurring any obligation to you. Copyright International Business Machines Corporation 2004, 2005. All rights reserved. Contents 1.0 About the installation requirements 2.0 Required software and supported data sources 3.0 Hardware and disk space requirements for DB2 II OmniFind Edition 4.0 Contacting IBM 4.1 Obtaining product information 4.2 Providing comments on the documentation 5.0 Notices 5.1 Trademarks 1.0 About the installation requirements The Installation Requirements document describes supported operating system levels, prerequisite software, hardware requirements, and supported data sources for DB2(R) Information Integrator OmniFind(TM) Edition (enterprise search). DB2 Information Integrator OmniFind Edition provides a technology called enterprise search. To install the enterprise search solution, ensure that you have the correct prerequisite software. The installation program for enterprise search will help you install the prerequisite software except for the WebSphere(R) Application Server fix packs. 2.0 Required software and supported data sources Before you install DB2 Information Integrator OmniFind Edition (DB2 II OmniFind Edition), ensure that you have the required software and a supported operating system. Supported operating systems DB2 Information Integrator OmniFind Edition (enterprise search) is supported on the following operating systems: * AIX(R) 5L * 5.2 (requires Maintenance Level 4 and the August 2004 C++ Runtime for AIX PTF) * 5.3 (requires Maintenance Level 1 and the August 2004 C++ Runtime for AIX PTF) * Linux(TM) * Red Hat Enterprise Linux Advanced Server Version 3.0, Update 2, kernel 2.4.21-9.ELsmp (requires libstdc++3.2.3-34) * SuSE Linux Enterprise Server 8 with Service Pack 3 (UnitedLinux SP3) (requires libstdc++3.2.2-38) * SuSE Linux Enterprise Server 9 with Service Pack 1 (UnitedLinux SP1) (requires libstdc++-3.3.3-43.28) * Microsoft(R) Windows * Microsoft Windows(R) 2000 Advanced Server * Microsoft Windows 2003 Enterprise Edition To download the AIX PTF: 1. Go to the IBM(R) Software Support site for the August 2004 C++ Runtime for AIX PTF. 2. Download the xlc.rte.60.aug2004.ptf.tar.Z file. Follow the instructions on the Web page to uncompress, untar, and install the PTF. 3. Apply the appropriate maintenance levels for your version of AIX. Go to the following Web site to download AIX fixes: www.ibm.com/servers/eserver/support/pseries/aixfixes.html. Follow the instructions on the Web page to uncompress, untar, and install the PTF. Required software for the DB2 II OmniFind Edition (enterprise search) Enterprise search requires the following software: IBM DB2 Universal Database(TM) Enterprise Server Edition, Version 8.2 DB2 UDB Enterprise Server Edition serves as a repository for collected data. Optional: IBM DB2 Universal Database Information Center, Version 8.2 The DB2 Information Center provides information for DB2 II OmniFind Edition (enterprise search), DB2 Information Integrator, and DB2 Universal Database. If you do not install the information center, when you click a help topic, you will be connected to an IBM Web site that hosts the information center. The information center does not include PDF files. IBM DB2 Universal Database Run-time Client, Version 8.2 The DB2 Run-time Client is required only if you install DB2 II OmniFind Edition on multiple servers. IBM WebSphere Application Server, Version 5.1 with Fix Pack 1 (5.1.1) This software includes a Web application server and the IBM HTTP server. The fix pack is not included with DB2 II OmniFind Edition. IBM WebSphere Application Server Deployment Manager, Version 5.1 with Fix Pack 1 (5.1.1) Deployment Manager is required to allow WebSphere to run multiple copies of itself in the same system. The fix pack is not included with DB2 II OmniFind Edition. Required levels of Java IBM Software Development Kit for Java(TM) 1.3.x or 1.4.x. (SDK for Java 1.5 is not supported) The SDK for Java is required to compile the Java search applications that are created with the enterprise search application programming interfaces (APIs). These SDKs for Java are not required to install DB2 II OmniFind Edition (enterprise search). The enterprise search ESSearchApplication sample and the data listener samples should be compiled with SDK for Java 1.4.x. The SIAPI samples can be compiled with either 1.3.x or 1.4.x. The ESSearchApplication in the ES_INSTALL_ROOT/samples directory must be compiled with IBM SDK for Java, Version 1.4.x and must execute in a JRE Version 1.4 environment. WebSphere Application Server and WebSphere Portal both provide the JRE Version 1.4. Supported data sources You can use enterprise search to create searchable collections from the following data sources. Some of these data sources require additional software. See "Required software for data sources" for more information: DB2 Content Manager, Version 8.2 or 8.3 Accessed with the Content Manager crawler. DB2 Universal Database for Linux, UNIX, and Windows, Version 8.1 and 8.2 Accessed with the DB2 crawler. DB2 Universal Database for z/OS, Version 7 or later Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler. Documentum 4.3 or 5.2.5 Accessed with the VeniceBridge crawler (WebSphere Information Integrator Content Edition, Version 8.2). FileNet Paragon CS 5.3 Accessed with the VeniceBridge crawler (WebSphere Information Integrator Content Edition, Version 8.2). Hummingbird DM 5.1 Accessed with the VeniceBridge crawler (WebSphere Information Integrator Content Edition, Version 8.2). Informix IDS, Version 9 or later Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler. Lotus Domino, Version 5.0 or later, Version 6.0 or later Lotus Domino Server 5.0.9a or later is supported. Accessed with the Notes crawler. Microsoft SQL Server 2000 Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler. Microsoft Exchange Server 2000 or 2003 Accessed with the Exchange Server crawler. Oracle 9i and Oracle 10g Accessed through DB2 Information Integrator, Version 8.2 or later with the DB2 crawler. Required software for data sources To crawl Lotus(R) Domino(R) or Notes databases, DB2 Content Manager databases, federated relational databases, or VeniceBridge sources, install the following versions of these products: IBM Lotus Domino Server 6.0.2 or later for Linux and AIX or Lotus Notes 6.0.2 or later for Windows This software is required if you plan to collect data from Lotus Notes or Domino sources. The Notes crawler for NRPC uses Domino libraries as a Lotus Notes client. You install these libraries by installing Lotus Domino Server on the enterprise search crawler server. To ensure that the Notes crawler can work with the Domino libraries, you run a setup script that DB2 II OmniFind Edition provides on the crawler server after you install the Domino libraries. IBM DB2 Information Integrator for Content, Version 8.2 for Windows and AIX or IBM DB2 Content Manager Toolkit, Version 8.2 for Linux For enterprise search on AIX and Windows, the Content Manager crawler uses the Java(TM) connector for Content Manager, Version 8 to access DB2 Content Manager servers. You install this connector by installing IBM DB2 Information Integrator for Content, Version 8.2 for Windows and AIX on the crawler server. To ensure that the Content Manager crawler can work with DB2 Content Manager, you run a setup script that DB2 II OmniFind Edition provides on the crawler server after you install the connector. For enterprise search on Linux, the Content Manager crawler uses the Java connector for Content Manager, Version 8 to access DB2 Content Manager servers. You install this connector by installing the IBM DB2 Content Manager Linux Toolkit, Version 8.2 on the crawler server. To ensure that the Content Manager crawler can work with DB2 Content Manager, you run a setup script that DB2 II OmniFind Edition provides on the crawler server after you install the connector. IBM WebSphere Information Integrator Content Edition, Version 8.2.1 with hot fixes The VeniceBridge crawler uses Java libraries of WebSphere Information Integrator Content Edition as a Java client. You install these libraries by installing WebSphere Information Integrator Content Edition on the crawler server. To ensure that the VeniceBridge crawler can work with the Java libraries, you run a setup script that DB2 II OmniFind Edition provides on the crawler server after you install the WebSphere Information Integrator Content Edition libraries. If you plan to search FileNet CS or Hummingbird data sources, you must download and install a WebSphere Information Integrator Content Edition hot fix for each. For FileNet CS, install APAR JR21417. For Hummingbird, install APAR JR21708. See the WebSphere Information Integrator Content Edition Support Web site for information about installing the hot fixes. The VeniceBridge product was renamed to WebSphere Information Integrator Content Edition. IBM DB2 Information Integrator, Version 8.2 or later DB2 Information Integrator, Version 8.2 is shipped with DB2 II OmniFind Edition. You can use DB2 Information Integrator to crawl relational databases from DB2 Universal Database for z/OS, Informix IDS, and Oracle 9i and Oracle 10g. 3.0 Hardware and disk space requirements for DB2 II OmniFind Edition Hardware and disk space requirements depend on your operating system and your intended use for DB2 II OmniFind Edition (enterprise search). Hardware requirements Disk space requirements can vary depending on the number of documents that you want to crawl and the types of data sources that you crawl. These requirements assume that you build indexes regularly, which means that new documents are added, removed, or updated in the index. For a multiple server configuration, the space requirements affect the index server. The ES_NODE_ROOT directory requires the most disk space on your system. The following list describes the minimum hardware requirements and minimum disk space requirements for a single server configuration and a multiple server configuration: Small solutions Single server configuration: * 2 or more processors: 2.0 GHz or more for Intel and AMD; 1.5 GHz or more for RISC * 4 - 6 GB of RAM (Add 1 to 2 GB of RAM for each additional active collection.) * 200 GB of disk space based on 1 000 000 documents in one or more collections with an average document size of 20 KB Medium solutions Four-server configuration: * 2 or more processors: 2.0 GHz or more for Intel and AMD; 1.5 GHz or more for RISC * 4 - 6 GB of RAM on each server * 2 TB total disk space based on 7 000 000 documents in one or more collections with an average document size of 20 KB Large solutions Four-server configuration: * 4 or more processors: 2.0 GHz or more for Intel and AMD; 1.5 GHz or more for RISC * 8 GB of RAM on each server (4 GB of RAM for Windows) * 6 TB total disk space based on 10 000 000 documents in one or more collections with an average document size of 20 KB 4.0 Contacting IBM To contact IBM customer service in the United States or Canada, call 1-800-IBM-SERV (1-800-426-7378). To learn about available service options, call one of the following numbers: * In the United States: 1-888-426-4343 * In Canada: 1-800-465-9600 To locate an IBM office in your country or region, see the IBM Directory of Worldwide Contacts on the Web at www.ibm.com/planetwide. 4.1 Obtaining product information Information about DB2 Information Integrator products is available by telephone or on the Web. Information about DB2 Information Integrator products is available by telephone or on the Web. The phone numbers provided here are valid in the United States. 1. To order products or to obtain general information: 1-800-IBM-CALL (1-800-426-2255) 2. To order publications: 1-800-879-2755 3. Visit the Web at www.ibm.com/software/data/integration/db2ii/support.html. This site contains the latest information about: * The technical library * Ordering books * Client downloads * Newsgroups * Fix packs * News * Links to Web resources 4.2 Providing comments on the documentation Please send any comments that you have about this book or other DB2 Information Integrator documentation. Your feedback helps IBM to provide quality information. Please send any comments that you have about this book or other DB2 Information Integrator documentation.You can use any of the following methods to provide comments: 1. Send your comments using the online readers' comment form at www.ibm.com/software/data/rcf. 2. Send your comments by e-mail to comments@us.ibm.com. Include the name of the product, the version number of the product, and the name and part number of the book (if applicable). If you are commenting on specific text, please include the location of the text (for example, a title, a table number, or a page number). 5.0 Notices This information was developed for products and services offered in the U.S.A. IBM may not offer the products, services, or features discussed in this document in all countries. Consult your local IBM representative for information on the products and services currently available in your area. Any reference to an IBM product, program, or service is not intended to state or imply that only that IBM product, program, or service may be used. Any functionally equivalent product, program, or service that does not infringe any IBM intellectual property right may be used instead. However, it is the user's responsibility to evaluate and verify the operation of any non-IBM product, program, or service. IBM may have patents or pending patent applications covering subject matter described in this document. The furnishing of this document does not give you any license to these patents. You can send license inquiries, in writing, to: IBM Director of Licensing IBM Corporation North Castle Drive Armonk, NY 10504-1785 U.S.A. For license inquiries regarding double-byte (DBCS) information, contact the IBM Intellectual Property Department in your country/region or send inquiries, in writing, to:IBM World Trade Asia Corporation Licensing 2-31 Roppongi 3-chome, Minato-ku Tokyo 106-0032, Japan The following paragraph does not apply to the United Kingdom or any other country/region where such provisions are inconsistent with local law: INTERNATIONAL BUSINESS MACHINES CORPORATION PROVIDES THIS PUBLICATION "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF NON-INFRINGEMENT, MERCHANTABILITY, OR FITNESS FOR A PARTICULAR PURPOSE. Some states do not allow disclaimer of express or implied warranties in certain transactions; therefore, this statement may not apply to you. This information could include technical inaccuracies or typographical errors. Changes are periodically made to the information herein; these changes will be incorporated in new editions of the publication. IBM may make improvements and/or changes in the product(s) and/or the program(s) described in this publication at any time without notice. Any references in this information to non-IBM Web sites are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM product, and use of those Web sites is at your own risk. IBM may use or distribute any of the information you supply in any way it believes appropriate without incurring any obligation to you. Licensees of this program who wish to have information about it for the purpose of enabling: (i) the exchange of information between independently created programs and other programs (including this one) and (ii) the mutual use of the information that has been exchanged, should contact: IBM Corporation J46A/G4 555 Bailey Avenue San Jose, CA 95141-1003 U.S.A. Such information may be available, subject to appropriate terms and conditions, including in some cases payment of a fee. The licensed program described in this document and all licensed material available for it are provided by IBM under terms of the IBM Customer Agreement, IBM International Program License Agreement, or any equivalent agreement between us. Any performance data contained herein was determined in a controlled environment. Therefore, the results obtained in other operating environments may vary significantly. Some measurements may have been made on development-level systems, and there is no guarantee that these measurements will be the same on generally available systems. Furthermore, some measurements may have been estimated through extrapolation. Actual results may vary. Users of this document should verify the applicable data for their specific environment. Information concerning non-IBM products was obtained from the suppliers of those products, their published announcements, or other publicly available sources. IBM has not tested those products and cannot confirm the accuracy of performance, compatibility, or any other claims related to non-IBM products. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products. All statements regarding IBM's future direction or intent are subject to change or withdrawal without notice, and represent goals and objectives only. This information contains examples of data and reports used in daily business operations. To illustrate them as completely as possible, the examples include the names of individuals, companies, brands, and products. All of these names are fictitious, and any similarity to the names and addresses used by an actual business enterprise is entirely coincidental. COPYRIGHT LICENSE: This information contains sample application programs, in source language, which illustrate programming techniques on various operating platforms. You may copy, modify, and distribute these sample programs in any form without payment to IBM for the purposes of developing, using, marketing, or distributing application programs conforming to the application programming interface for the operating platform for which the sample programs are written. These examples have not been thoroughly tested under all conditions. IBM, therefore, cannot guarantee or imply reliability, serviceability, or function of these programs. You may copy, modify, and distribute these sample programs in any form without payment to IBM for the purposes of developing, using, marketing, or distributing application programs conforming to IBM's application programming interfaces. Each copy or any portion of these sample programs or any derivative work must include a copyright notice as follows: Outside In ((R)) Viewer Technology, (C)1992-2004 Stellent, Chicago, IL., Inc. All Rights Reserved. IBM XSLT Processor Licensed Materials - Property of IBM (C)Copyright IBM Corp., 1999-2004. All Rights Reserved. 5.1 Trademarks This topic lists IBM trademarks and certain non-IBM trademarks. The following terms are trademarks of International Business Machines Corporation in the United States, other countries, or both: IBM AIX AIX 5L DB2 DB2 Universal Database Domino Domino.doc Hummingbird Informix Lotus Lotus Notes Notes OmniFind POWER4 POWER5 RISC System/6000 Tivoli WebSphere Workplace xSeries z/OS The following terms are trademarks or registered trademarks of other companies: Java and all Java-based trademarks and logos are trademarks or registered trademarks of Sun Microsystems, Inc. in the United States, other countries, or both. Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both. Intel, Intel Inside (logos), MMX and Pentium are trademarks of Intel Corporation in the United States, other countries, or both. UNIX is a registered trademark of The Open Group in the United States and other countries. Linux is a trademark of Linus Torvalds in the United States, other countries, or both. Other company, product or service names may be trademarks or service marks of others.