IFilterShop StarOffice/OpenOffice IFilter Server Edition Release 1.6 README

CONTENT OF README FILE


General Information

OpenOffice IFilter is a plug-in that allows Microsoft Search products and services to index StarOffice, OpenOffice and OASIS OpenDocument files, enabling customers to search and organize their content. OpenOffice IFilter extracts metadata and content from Writer, Math, Impress, Draw and Calc files.

OpenOffice IFilter supports Indexing Service, SharePoint Portal Server, SQL Server Full-Text Search, Window Search Service and all other products based on Microsoft Search technology.

OpenOffice IFilter indexes the following file types:
Writer 1.x.sxw, .stw
Writer 2.x and later.odt, .ott
Calc 1.x.sxc, .stc
Calc 2.x and later.ods, .ots
Draw 1.x.sxd, .std
Draw 2.x and later.odg, .otg
Impress 1.x.sxi, .sti
Impress 2.x and later.odp, .otp
Math 1.x.sxm, .stm
Math 2.x and later.odf, .otf
Writer Global Document 1.x.sxg, .stg
Writer Global Document 2.x and later.odm, .otm


System Requirements

OpenOffice IFilter supports the following Microsoft server operating systems:

OpenOffice IFilter supports the following Microsoft desktop operating systems:

OpenOffice IFilter supports the following Microsoft Search products


Information Retrieval

OpenOffice IFilter indexes text and the following metadata in StarOffice/OpenOffice suite files:

OpenOffice Metadata
(with namespace prefix*)
Property NameProperty TypeDescription
meta:generatorgeneratorVT_LPWSTRapplication used to create or last time modify the document
dc:titletitleVT_LPWSTRtitle of the document
dc:descriptiondescriptionVT_LPWSTRa textual outline of the content
dc:subjectsubjectVT_LPWSTRsubject of the document
meta:keywordkeywordsVT_LPWSTRkeywords of the document
meta:initial-creatorinitial-creatorVT_LPWSTRthe person who created the document initially
dc:creatorcreatorVT_LPWSTRthe person who last modified the document
meta:printed-byprinted-byVT_LPWSTRthe last person who printed the document
meta:creation-datecreation-dateVT_FILETIMEwhen the document was created initially
dc:datedateVT_FILETIMEwhen the document was last modified
meta:print-dateprint-dateVT_LPWSTRwhen the document was last printed
dc:languagelanguageVT_LPWSTRlanguage of the document
meta:user-defineduser-defined-<name>**VT_LPWSTRadditional user defined metadata

* All OpenOffice metadata with "meta:" prefix belong to OpenOffice.org XML namespace. All OpenOffice metadata with "dc:" prefix belong to Dublin Core namespace.

In accordance to Microsoft IFilter specification, OpenOffice IFilter defines each metadata as combination of Property Set and Property Name.
All Dublin Core properties belong to {DC099694-64F5-4371-9AA9-868846A5657E} Property Set GUID.
All OpenOffice.org XML properties belong to {00FF8547-08D2-4b02-99A1-F3A1E13EB3F8} Property Set GUID.

** For user-defined metadata Property Name is set as user-defined-<meta:name attribute value>. For example, user-defined metadata element <meta:user-defined meta:name="Info1">Value1</meta:user-defined> OpenOffice IFilter will output as "00FF8547-08D2-4b02-99A1-F3A1E13EB3F8/user-defined-Info1" property.

OpenOffice IFilter also outputs values of OpenOffice metadata as text content of StarOffice/OpenOffice file.


Office Document Properties

OpenOffice IFilter outputs the following standard Indexing Service properties as duplicates of certain OpenOffice.org XML and Dublin Core properties.

Property Friendly NameProperty Set GUIDProperty NameDescriptionOpenOffice element
DocAuthor{F29F85E0-4FF9-1068-AB91-08002B27B3D9}4Author of the documentmeta:initial-creator
DocComments{F29F85E0-4FF9-1068-AB91-08002B27B3D9}6Comments about the documentdc:description
DocCreatedTm{F29F85E0-4FF9-1068-AB91-08002B27B3D9}12Time document was createdmeta:creation-date
DocKeywords{F29F85E0-4FF9-1068-AB91-08002B27B3D9}5Keywords for the documentmeta:keyword
DocLastPrinted{F29F85E0-4FF9-1068-AB91-08002B27B3D9}11Time document was last printeddc:print-date
DocLastSavedTm{F29F85E0-4FF9-1068-AB91-08002B27B3D9}13 Time document was last saveddc:date
DocSubject{F29F85E0-4FF9-1068-AB91-08002B27B3D9}3Subject of the documentdc:subject
DocTitle{F29F85E0-4FF9-1068-AB91-08002B27B3D9}2Title of the documentdc:title


Installation Instructions

Setup file is a self-extracting archive that must be downloaded and opened on the machine where you wish to use StarOffice/OpenOffice IFilter.

  1. Stop all appropriate Search services.
  2. Uninstall any previous version of StarOffice/OpenOffice IFilter.
  3. Start setup file and follow the on-screen instructions.
  4. Start all appropriate Search services.
  5. Re-index catalogs containing OpenOffice Writer, Math, Impress, Draw and Calc files.


Additional Setup Steps

Some Microsoft Search products require additional setup steps as described below:

SharePoint Portal Server 2003:

  1. Open "Site Settings" web page
  2. In the "Search Settings and Indexed Content" section click on "Configure search and indexing"
  3. Click on "Include file types"
  4. Make sure that ".odt", ".odp" and all other StarOffice/OpenOffice file types you are planning to index are included

Office SharePoint Server 2007:

  1. Open Shared Services Provider Admin Site
  2. In the "Search" section click on "Search settings"
  3. Click on "File type inclusions"
  4. Make sure that ".odt", ".odp" and all other StarOffice/OpenOffice file types you are planning to index are included

Windows SharePoint Services 3.0:

  1. Open registry key "HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Applications\<WSS Server Name>\Gather\Search\Extensions\ExtensionList"
  2. Add ".odt", ".odp" and all other StarOffice/OpenOffice extensions to the list of indexable file types
  3. Restart Windows SharePoint Services Search

SharePoint Server 2010:

  1. In SharePoint Central Administration go to "General Application Settings" page
  2. In the "Search" section click on "Farm-Wide Search Administration"
  3. Click on " Search Service Application" link
  4. On the left side menu select "File Types"
  5. Make sure that ".odt", ".odp" and all other StarOffice/OpenOffice file types you are planning to index are included

Windows Desktop Search:

When integrated with Windows Desktop Search, OpenOffice IFilter uses temporary directory to process StarOffice/OpenOffice files. By default it uses system temporary directory. For Window Desktop Search versions 3.x and higher OpenOffice IFilter must be set to work with user specified temporary directory. To change temporary directory setting:

  1. Stop Windows Search service.
  2. Open registry key "HKEY_LOCAL_MACHINE\SOFTWARE\IFilterShop\SOFilter"
  3. Add new String value named "TempPath" and enter the full path to the new temporary directory. If this value is missing, empty or the path is not pointing to a valid directory, system temporary directory will be used. Please make sure that "Users" or "Authenticated Users" Group has "Full Control" permissions to the custom temporary directory.
  4. Start Windows Search service.
When using custom temporary directory, we recommend that you set it as "not indexable" in all your indexing products. Otherwise temporary files may be indexed. This will pollute the index and also can prevent temporary files from proper removal by OpenOffice IFilter.

Vista Search, Windows 7 Search:

Configure custom temporary directory as described in "Additional Setup Steps" for Windows Desktop Search section above.


How to Uninstall

If you ever have to uninstall StarOffice/OpenOffice IFilter application you can easily do it using any of the following methods:


Known Issues

StarOffice/OpenOffice files are not indexed in Windows Vista, Windows 7 or Windows Desktop Search

Custom temporary directory has to be configured as described in "Additional Setup Steps" for Windows Desktop Search section above.


Additional Information

What is Dublin Core?

Dublin Core is an initiative to create digital library metadata for the Web. Dublin Core is made up of 15 metadata (data that describes data) elements that offer expanded cataloging information and improved document indexing for search engine programs. Two forms of Dublin Core exist: Simple Dublin Core and Qualified Dublin Core. Simple Dublin Core expresses elements as attribute-value pairs using just the 15 metadata elements from the Dublin Core Metadata Element Set. Qualified Dublin Core increases the specificity of metadata by adding information about encoding schemes, enumerated lists of values, or other processing clues. While enabling searches to be more specific, qualifiers are also more complex and can pose challenges to interoperability. More information about Dublin Core may be found at http://dublincore.org.

Acknowledgements

This product includes software developed by the Apache Software Foundation (http://www.apache.org).


What's new in this version

Version 1.6

Version 1.5

Version 1.4

Version 1.3

Version 1.2:

Version 1.0:


Contact Information

WWW:
http://www.ifiltershop.com
E-mail:
support@ifiltershop.com