Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

 

The Publish to Azure Search publisher will post documents to an Azure Search index through httpsthrough 

https://<server>/indexes/<index>/docs/index?api-version=<apiVersion>

as described by Azure Search in the Bulk API.

Panel
titleOn this page

Table of Contents

Features

Some

Features of

the features of

the Publish to Azure Search publisher include:

Customization of the feed for the Azure Search
  • You can customize the Azure Search feed by editing the Groovy script.
Is
  • It is connector independent.
Runs
  • It runs from any machine with access to the Azure Search Cloud Service.


Limitations 


The publisher

itself

is

subjected

subject to the following Azure Search Service limitations.

:

  • Index schema - Certain properties of your index schema can
only
  • be set only once
and
  • .
    • They cannot be updated in the future.
Because of this
    • Therefore, any schema updates that
would
    • require re-indexing (such as changing field types) are not currently possible after the initial configuration.
So be sure that
    •  
      Important:  Make sure your index has all of the required fields with the correct configuration before saving it.
Documents Keys

  • Document keys - can only contain letters, digits, underscore ( _ ), dash ( - ), or equal sign ( = ).
Documents
  • Document field names - must
only
  • start with a
letters
  • letter and contain only
letter
  • letters, digits, or underscore.
The maximum size of a batch

  • Batch size - the maximum is 16 MB
, since you
  • . You can pass a batch of multiple documents to the Index API all at once, so the size limit per document
actually
  • depends on how many documents are in the batch. 
    • If the batch size exceeds 16 MB, the publisher will
try
    • attempt to split the batch and truncate the content field of the document (if exists)
in order
    • to reduce the size
,
    • The maximum document size is 16 MB of JSON for a batch
with
    • containing a single document, 
      Note:  If the
maximum
    • document size
is 16 MB of JSON, if the document size
    • still exceeds 16 MB
even after
    •  after truncating the content field,
it will impossible to publish it without doing modifications to
    • publishing will not be possible without modifying the Groovy Transformation file. 
Maximum

  • Maximums
    • 1000 documents per batch of index uploads, merges, or deletes.
Maximum
    • 32 fields in the $orderby clause in the schema.
Maximum field size for Filterable, Sortable, Facetable or Searchable fields is
    • 32,766
bytes
    •  bytes (32 KB minus 2 bytes) of UTF-8 encoded text for the field size for the Filterable, Sortable, Facetable, and Searchable fields.

  • Is not possible to
clean
  • delete all documents of
a
  • an index directly
, so if the index needs to be cleaned, deleting
  • .
    • If necessary, delete and re-
creating the index is the only way. The publisher will never do this
    • create the index. 
    • If multiple full crawls are executed, data of previous crawls will still exist inthe index.
      Note:  The publisher is unable to delete all documents of an index directly (since it does not "know" the how the schema is configured in order to recreate it
, so if multiple full crawls are executed, data of previous crawls will still exists on the index
    • ).

More information

at https://docs.microsoft.com/en-us/azure/search/search-limits-quotas-capacity and https://docs.microsoft.com/en-us/azure/search/search-create-index-portal