Updated On 4.0.0

Migrating a site from Solr to Elasticsearch

When upgrading to CrafterCMS 4.0 you need to update the code of all existing sites to use Elasticsearch if your site(s) were built to use Solr.

Updating to Elasticsearch

To update your site to use Elasticsearch instead of Solr you can follow these steps:

Overwrite the target in the Deployer to use Elasticsearch instead of Solr
Index all existing content in Elasticsearch
Find all references to searchService in your FreeMarker templates and replace them with the Elasticsearch client
Find all references to searchService in your Groovy scripts and replace them with the Elasticsearch client
Delete the unused Solr core if needed (can be done using the Solr Admin UI or the data/indexes folder)
Update craftercms-plugin.yaml to use Elasticsearch as the search engine

Overwrite the target

For authoring environments:

curl --request POST \
  --url http://DEPLOYER_HOST:DEPLOYER_PORT/api/1/target/create \
  --header 'content-type: application/json' \
  --data '{
    "env": "preview",
    "site_name": "SITE_NAME",
    "template_name": "local",
    "repo_url": "INSTALL_DIR/data/repos/sites/SITE_NAME/sandbox",
    "disable_deploy_cron": true,
    "replace": true
  }'

For delivery environments:

curl --request POST \
  --url http://DEPLOYER_HOST:DEPLOYER_PORT/api/1/target/create \
  --header 'content-type: application/json' \
  --data '{
    "env": "default",
    "site_name": "SITE_NAME",
    "template_name": "remote",
    "repo_url": "INSTALL_DIR/data/repos/sites/SITE_NAME/published",
    "repo_branch": "live",

    ... any additional settings like git credentials ...

    "replace": true
  }'

Note

For a detailed list of parameters see Create Target

The create target operation will also create the new index in Elasticsearch.

Index all site content

To reindex all existing content execute the following command:

curl --request POST \
  --url http://DEPLOYER_HOST:DEPLOYER_PORT/api/1/target/deploy/ENVIRONMENT/SITE_NAME \
  --header 'content-type: application/json' \
  --data '{
    "reprocess_all_files": true
  }'

Update the site code

Because both Solr and Elasticsearch are based on Lucene, you will be able to keep most of your queries unchanged, however features like sorting, facets and highlighting will require code changes.

Note

To take full advantage of Elasticsearch features it is recommended to replace query strings with other type of queries provided by the Elasticsearch DSL

Warning

If you are using any customization or any advance feature from Solr, you will need to find an alternative using Elasticsearch.

To update your code there are two possible approaches:

Examples

This is a basic example of replacing Crafter Search service with Elasticsearch

Existing Groovy code

def q = "${userTerm}~1 OR *${userTerm}*"

def query = searchService.createQuery()
query.setQuery(q)
query.setStart(start)
query.setRows(rows)
query.setParam("sort", "createdDate_dt asc")
query.setHighlight(true)
query.setHighlightFields(HIGHLIGHT_FIELDS)

def result = searchService.search(query)

def documents = result.response.documents
def highlighting = result.highlighting

Using the Elasticsearch Client the code will look like this:

Elasticsearch Client

import co.elastic.clients.elasticsearch._types.SortOrder

def q = "${userTerm}~1 OR *${userTerm}*"

// Execute the query
def result = elasticsearchClient(r -> r
  .query(q -> q
    .queryString(s -> s
      .query(q as String)
    )
  )
  .from(start)
  .size(rows)
  .sort(s -> s
    .field(f -> f
      .field(createdDate_dt)
      .order(SortOrder.Asc)
    )
  )
  .highlight(h -> {
    HIGHLIGHT_FIELDS.each { field ->
      h.fields(field, f -> f)
    }
  })
, Map)

// Elasticsearch response (highlight results are part of each hit object)
def documents = result.hits().hits()

For additional information you can read the official Java Client documentation and DSL documentation.

Notice in the given example that the query string didn’t change, you will need to update only the code that builds and executes the query. However Elasticsearch provides new query types and features that you can use directly from your Groovy scripts.

If any of your queries includes date math for range queries, you will also need to update them to use the Elasticsearch date math syntax described here.

Example

Solr date math expression

1createdDate_dt: [ NOW-1MONTH/DAY TO NOW-2DAYS/DAY ]

Elasticsearch date math expression

1createdDate_dt: [ now-1M/d TO now-2d/d ]

In Solr there were two special fields _text_ and _text_main_, during indexing the values of other fields were copied to provide a simple way to create generic queries in all relevant text. Elasticsearch provides a different feature that replaces those fields Multi-match query

Example

Solr query for any field

1_text_: some keywords

Elasticsearch query for any field (replacement for _text_)

.multiMatch(m -> m
  .query('some keywords')
)

Elasticsearch also offers the possibility to query fields with postfixes using wildcards

Elasticsearch query for specific fields (replacement for _text_main_)

.multiMatch(m -> m
  .query('some keywords')
  .fields('*_t', '*_txt', '*_html')
)

Update “craftercms-plugin.yaml” to use Elasticsearch

Your site has a craftercms-plugin.yaml file that contains information for use by CrafterCMS. We’ll have to update the file to use Elasticsearch as the search engine.

Edit your craftercms-plugin.yaml, and remove the following property:

AUTHORING_INSTALL_DIR/data/repos/sites/YOURSITE/sandbox/craftercms-plugin.yaml

searchEngine: CrafterSearch

And make sure to commit your changes to craftercms-plugin.yaml.

Migrating a site from the previous Elasticsearch client

Since 4.0.0

CrafterCMS 4.0 provides two different Elasticsearch clients, this is because Elasticsearch has released a new Java API Client to replace the Rest High Level Client and during the transition period both will work. So if you are upgrading from CrafterCMS 3.1 and your site already uses Elasticsearch it will continue to work with some small changes, but it is highly recommended to migrate to the new client to avoid any issues in future releases when the Rest High Level Client is completely removed.

Migrating to the new Elasticsearch client should not require too much effort:

If the existing code uses the builder classes you will need to replace them with the equivalent in the new client

Example:

Groovy code with ES 6.x Builders

import org.elasticsearch.action.search.SearchRequest
import org.elasticsearch.index.search.MatchQuery

import static org.elasticsearch.index.query.QueryBuilders.boolQuery
import static org.elasticsearch.search.builder.SearchSourceBuilder.searchSource

def query = boolQuery()
query.should(matchQuery('content-type', '/component/article'))
def builder = searchSource().query(query)

def searchResponse = elasticsearch.search(new SearchRequest().source(builder))

If the existing code uses a map DSL it only needs to be replaced with the new lambda structure

Example:

Groovy code with ES 6.x Maps

def searchResponse = elasticsearch.search([
  query: [
    bool: [
      should: [
        [ match: [ 'content-type': '/component/article' ] ],
      ]
    ]
  ]
])

For both cases the equivalent code using the new ES API Client is the same:

Groovy code with ES 7.x API Client

def searchResponse = elasticsearchClient.search(r -> r
  query(q -> q
    bool(b -> b
      should(s -> s
        match(m -> m
          .field('content-type')
          .query(v -> v
            .stringValue('/component/article')
          )
        )
      )
    )
  ),
  Map.class)

For additional information about the new client you can read the official documentation