The Publish to SolrCloud CDH publisher will post documents to a SolrCloud index using a SolrJ Library. SolrJ has a CloudSolrClient class to communicate with SolrCloud. Instances of this class communicate with Zookeeper to discover Solr endpoints for SolrCloud collections and then perform requests.
This version of the publisher uses two approaches of connection:
For the Kerberos Authentication, this publisher uses a UserGroupInformation, which is a privileged action and the keytab, principal and hadoop core-site.xml to authenticate with Kerberos. It also uses a custom KerberizedHttpClient to perform the request.
Some of the features of the Publish to SolrCloud publisher include:
Customizable feed to the Solr index by editing the XSLT file
Specify the Zookeeper server and port
Is connector independent
XSL transformations