Content Component

Content Component is an ACI server. For details of changes that affect all ACI servers, see ACI Server Framework.

23.3.0

New in this Release

  • The handling of reasons has been improved to merge overlapping reasons. For example, the query text James Watt" DNEAR Jr previously gave the reasons James Watt and Watt Jr. It now returns the single reason James Watt Jr.

  • The efficiency of suggesting spelling corrections has been improved. This change gives particular improvements when UnstemmedMinDocOccs is configured to a value less than the current SpellCheckCorrectMinDocOccs setting.

  • Several updates and improvements have been made to the BIAS FieldText operators:

    • The new BIASRANGE operator has been added. This operator allows you to bias the score of results that fall within a particular date range. It also allows you to reduce the score bias for values within a specified range outside this optimum range. For example:

      BIASRANGE{21/08/2011,25/08/2011,172800,86400,10}:DATE

      This example boosts the score by 10% for documents with a DATE field value in the range 21/08/2011 to 25/08/2011 (inclusive). It gives a smaller boost (on a linear scale) for documents within 172800s (two days) before 21/08/2011, and 86400s (one day) after.

    • The new BIASNRANGE operator has been added. this operator allows you to bias the score of results that contain a value within a specified range in a specified field, and to reduce the score bias linearly for values within a specified range outside this optimum range. For example:

      FieldText=BIASNRANGE{100,150,20,40,10}:*/PRICE

      A document whose PRICE field value is between 100 and 150 has its weight increased by 10%. This boost decreases linearly to 0% at 80 and lower, and 190 and higher.

    • The BIASVAL operator now supports an empty value for its first argument. For example, BIASVAL{,10}:COLOUR applies a score boost to any result document that does not have a COLOUR field, or has a COLOUR field with an empty value.

      NOTE: BIASVAL still requires two arguments, so BIASVAL{10}:COLOUR is not valid syntax.

    • You can now use all BIAS field specifiers in FieldTextField fields for use with AgentBoolean queries (that is, BIAS, BIASDATE, BIASDISTCARTESIAN, BIASDISTSPHERICAL, BIASVAL, BIASRANGE, and BIASNRANGE are now supported for AgentBoolean queries).

  • You can now use an open-ended range in the NRANGE field operator by setting one of the values to a period (.). For example NRANGE{.,5}:NUM means that the NUM field must contain a value of 5 or less.

  • The GetQueryTagValues value response when DocumentCount is set to True now includes the total number of occurrences for each value in the server.

Resolved Issues

  • When used in conjunction with the WHEN operator in XML full-structure mode, the TERM and TERMEXACT FieldText specifiers failed to return some documents that should have matched.

  • The indexer thread could be blocked for an extended time when attempting to delete a file, if the target had been removed in the meantime by an external process.

  • When rebuilding the unstemmed index with RegenerateUnstemmedIndex, numeric/alphanumeric terms were sometimes excluded, regardless of the configured IndexNumbers value.

  • The Content component NiFi processor, ContentServiceImpl was unable to obtain a license correctly.

23.2.0

New in this Release

  • Loading has been optimized for ACLType fields that have also been configured as MemcachedType (see NodetableCacheFields).

    NOTE: This change is only relevant to security models where the DLL load is required for evaluation.

  • The QueryCacheMaxMemKB configuration parameter has been added to the [Security] section. Set this parameter to a value in KB to enable a per-query cache that speeds up security checks for cases where there are many non-unique ACLs in the system (for example, where security is inherited from a top level folder). If the same ACL has already been evaluated during the query, Content does not need to call the security DLL again. You can set QueryCacheMaxMemKB to -1 for an unlimited cache size, or 0 to disable the cache.

    NOTE: This change is only relevant to security models where the DLL load is required for evaluation (for example, there is no need to use this parameter with NT_V4 security).

Resolved Issues

  • In some cases, Content failed to return hits for terms that existed only in the index cache and not in any indexed documents when SearchUncommittedDocuments was set to True.

  • Content could spuriously log an error "Dynterm list is NULL for term". This error tended to happen for terms with a large number (millions) of occurrences, in servers where documents were regularly deleted and the index compacted.

  • When the Active Directory contained a group name that ends with a space character, the Content security index could become invalid after the component was restarted.

  • When the saved best terms cache file was non-valid, the Content application could shut down during a DRECOMPACT operation. Content now automatically rebuilds the cache if it cannot load the saved file.