🚨 Disclaimer: This post will guide you through the steps to set up Solr Search as the backbone of a Drupal Search API. This setup is specifically for scenarios where Drupal is running locally on your machine using Docker through DDEV. I might mention some concept during this post, but since I am relatively new to this topic, they are likely just rip-off from Google or Perplexity, and might nto be correct


Intuition of Solr

To begin with, why Apache Solr ? why not use other options such as “Database Search (MySQL)” or “Elastic Search” ?

According to the offcial website for Apache Solr, it says:

Apache Solr is an open-source enterprise search platform built on Apache Lucene, designed for high-performance full-text search and indexing. It supports distributed indexing, replication, load balancing, and advanced search features like faceting, filtering, and spell checking. Solr is highly customizable and fault-tolerant, making it suitable for large-scale applications like e-commerce and content management systems

And I have used perplexity to compare between them, here are the results:

Feature or AspectApache SolrElastic SearchMySQL (Database)
PurposeFull-text Search EngineFull-text Search EngineRelational Structured Database
ArchitectureBuilt on Apache Lucene; Support dustribtued or non-dustributed modes via SolrCloud (standalone vs cloud mode)Real-time indexing with JSON-based queriesNo native full-text indexing; uses indexes for structured queries
Query LanguageSolr Query Parser; REST-like APIsJSON-based DSL for complex queriesSQL
ScalabilityRequires manual configuration (e.g., SolrCloud with ZooKeeper)Seamless horizontal scaling out-of-the-boxLimited horizontal scaling; better suited for vertical scaling
ProcessingBatch ProcessingReal-time ProcessingBatch Processing
PerformanceOptimized for read-heavy applications; good for complex queries but requires setup effortReal-time performance; excels in analytics and dynamic environmentsStrong for transactional operations but slower for complex searches
Use CaseBigher Website; E-commerce, CMS, large-scale text searchBigger Website; Real-time analytics, monitoring dashboardsSmall Website; CRUD operations, structured data storage

The reason solr search is more preferred than mysql (storing the index in database) is because:

  1. Performance and Scalability: Efficient Querying, Scalability

  2. Advanced Search Features: Full-text Search (stemming tokenization, phrase matching, wildcard) Faceted Navigation, Auto-complete and Spell Check

  3. Better Customization: Solr’s extensive configuration options allows the developer to tailor search behavior more precisely to the website’s need without requiring deep PHP/JAVA knowledge

  4. Decoupled, Reduced Database Load: By off-loading search operation to an external Solr server, Drupal website reduce the load on their primary db, improving site performance.

The reason solr search is more preferred than elastic search:

  1. Mature Ecosystem and Community Support in Drupal: Solr has a long standing presenceof being used by Drupal, and hence it is more well-integrated and offers out-of-the-box features with drupal, and the community overall does a much better job maintaining Search API Solr module comparing that of Elastic Search. (In the meanwhile some of its legacy features such as XML-based configruation align well with older system/drupal or infrastructure)
  2. Ease of Integration with Drupal: generally speaking, configuring Solr in Drupal is much easier comparing to that of Elastic search (elastic search may require additioanl effort to adapt its JSON-based query DSL), Solr’s integration with drupal is straightforward due to its compatibility with existing tools and workflows, for instance faceted search, highlighting, custom field weighting (out-of-the-box supported by Search API module)
  3. Static Data Handling : SOlr excels at handling static or less frequently updated data due to its ability to use an univerted reader for sorting and faceting, which improves performance, many goverment organisation uses solr with relatively static datasets, making solr a better fit

Demonstration

Step-1: Example Drupal Website using DDEV

To begin with let’s have an example drupal website up and running using ddev on our local computer:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
# download drupal core (for my instance I'll use version 10.4.4)
composer create-project drupal/recommended-project:10.4.4 "install-dir" 

# rename the directory fron "install-dir" to "ddev_solr_example", and navigate into dir
mv install-dir ddev_solr_example && cd ddev_solr_example

# initialize ddev configuration (note that we are using web as the root)
ddev config --project-type=drupal --project-name="ddev-solr-example" --docroot=web --php-version=8.2

        Creating a new DDEV project config in the current directory (/Users/suowei_hu/Sites/ddev_solr_example) 
        Once completed, your configuration will be written to /Users/suowei_hu/Sites/ddev_solr_example/.ddev/config.yaml

        Configuring a 'drupal' project named 'ddev-solr-example' with docroot 'web' at '/Users/suowei_hu/Sites/ddev_solr_example/web'.
        For full details use 'ddev describe'. 
        Unable to set permissions inside container on settings files: '' 
        No settings.php file exists, creating one 
        Configuration complete. You may now run 'ddev start'. 
        
# start the project environment
ddev start 

# reveal the running services to show the details, and directly open it in browser
ddev describe && ddev launch

In the meanwhile I might also install and enable some modules that are of the common use:

1
2
3
4
5
ddev composer require 'drush/drush'
ddev composer require 'drupal/paragraphs'
ddev composer require 'drupal/admin_toolbar'
ddev composer require 'drupal/field_group'
ddev composer require 'drupal/twig_tweak'
1
2
3
4
5
6
7
8
ddev drush cr            #TEST DRUSH
ddev drush en paragraphs -y
ddev drush en admin_toolbar -y
ddev drush en admin_toolbar_search -y
ddev drush en media -y
ddev drush en media_library -y
ddev drush en field_group -y
ddev drush en twig_tweak -y

After that let’s initialize the example website using the guided interface to install drupal:

2025-03-14T200142

Then you should have your example drupal website ready:

2025-03-14T200527

Notice that it has already gotten a search bar at the top that are functionting, this is built using drupal core “Search” module;

2025-03-14T200639

image-20250314200649772The “Search” modules by itself is actually already functional to most if not all small sites, it provide full-text search functionality across content (node, taxonomy, users) have basic ranking of result based on relevant, and index the content using site’s database (almost alike “Search API” + “Database” Server option) it is simple to setup but have no support towards the advanced features like faceted navigation and auto-complete, and relies on the Drupal’s DB for indexing, it does not support real-time indexing (index is only established when the CRON job is ran), its relatively limited in customization ability for search behavior or result ranking as well… (And that is exactly why we need Solr)

Step-2: Installation of Search API

Next you’ll need to install search api module to the Drupal website, simply run the following:

1
2
ddev composer require 'drupal/search_api:^1.38'  # obviously feel free use a different version if you need to
ddev drush pm:install search_api -y

Now you might ask: what is the Search API Module? (Here goes perplexity again…)

  • Framework for Search: The Search API module acts as a framework to define how content is indexed and searched. It decouples the search logic from the backend, allowing you to use different search engines (e.g., database, Solr, Elasticsearch).

  • Customizable Indexing: You can define what fields to index, apply preprocessors (e.g., stemming, stop words), and control field weighting to influence search result rankings.

  • Integration with Views: The module integrates seamlessly with Drupal’s Views module, enabling you to create custom search result pages and filters. β€’ Advanced Features: β€’ Faceted navigation. β€’ Autocomplete suggestions. β€’ Highlighting search terms in results. β€’ Spell-checking and β€œDid you mean?” functionality.

So Solr search needs the interfaces and models provided by the Search API module, this includes how the content are indexed into Solr, what is the back abstraction, and connect the preprocessors for search behaviour (such as removing stop words from the query) with the same functions provided by Solr, etc.

Now, notice that when we go to the configuration page for the Search API, there’s an error complaining there’s no backend avaible for Search API, this is because:

  1. we have not got the solr search related plug-in/extension installed
  2. we have not got the solr server/service running in DDEV

2025-03-14T201906

Step-3: Configuration of (Solr) Server

So without further or due, let’s get the Solr Sever running and search api server configured.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
# add the ddev add-on for solr search (use `ddev get --list` to show all add-ons)
ddev get ddev/ddev-solr

# restart the ddev environment so that solr related service gets ran
ddev restart 

# reveal the details relating to the solr service
ddev descrive 

        β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
        β”‚ Project: ddev-solr-example ~/Sites/ddev_solr_example https://ddev-solr-example.ddev.site     β”‚
        β”‚ Docker platform: docker-desktop                                                              β”‚
        β”‚ Router: traefik                                                                              β”‚
        β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
        β”‚ SERVICE      β”‚ STAT β”‚ URL/PORT                                          β”‚ INFO               β”‚
        β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
        β”‚ web          β”‚ OK   β”‚ https://ddev-solr-example.ddev.site               β”‚ drupal11 PHP 8.2   β”‚
        β”‚              β”‚      β”‚ InDocker -> Host:                                 β”‚ Server: nginx-fpm  β”‚
        β”‚              β”‚      β”‚  - web:80 -> 127.0.0.1:54928                      β”‚ Docroot: 'web'     β”‚
        β”‚              β”‚      β”‚  - web:443 -> 127.0.0.1:54927                     β”‚ Perf mode: mutagen β”‚
        β”‚              β”‚      β”‚  - web:8025 -> 127.0.0.1:54926                    β”‚ Node.js: 22        β”‚
        β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
        β”‚ db           β”‚ OK   β”‚ InDocker -> Host:                                 β”‚ mariadb:10.11      β”‚
        β”‚              β”‚      β”‚  - db:3306 -> 127.0.0.1:54925                     β”‚ User/Pass: 'db/db' β”‚
        β”‚              β”‚      β”‚                                                   β”‚ or 'root/root'     β”‚
        β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
        β”‚ solr         β”‚ OK   β”‚ https://ddev-solr-example.ddev.site:8943          β”‚                    β”‚
        β”‚              β”‚      β”‚ InDocker:                                         β”‚                    β”‚
        β”‚              β”‚      β”‚  - solr:8983                                      β”‚                    β”‚
        β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
        β”‚ Mailpit      β”‚      β”‚ Mailpit: https://ddev-solr-example.ddev.site:8026 β”‚                    β”‚
        β”‚              β”‚      β”‚ Launch: ddev mailpit                              β”‚                    β”‚
        β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”Όβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
        β”‚ Project URLs β”‚      β”‚ https://ddev-solr-example.ddev.site,              β”‚                    β”‚
        β”‚              β”‚      β”‚ https://127.0.0.1:54927,                          β”‚                    β”‚
        β”‚              β”‚      β”‚ http://ddev-solr-example.ddev.site,               β”‚                    β”‚
        β”‚              β”‚      β”‚ http://127.0.0.1:54928                            β”‚                    β”‚
        β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”΄β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
     

For more information on how to configure this ddev/solr-search you might also wanna have a look on the add-on’s readme file: https://github.com/ddev/ddev-solr

Whenever we have the solr service running, we can install the “Search API Solr” (and the Search API Solr Admin module) and use that service as the backend server for our search API:

1
2
3
ddev composer require 'drupal/search_api_solr:^4.3'
ddev drush pm:install search_api_solr
ddev drush pm:install search_api_solr_admin #(this module helps us to config solr from drupal

Here we want to choose (example configuration screenshot: link):

  1. Solr Connector: Solr Cloud with Basic Auth
  2. Solr node: solr (found in ddev describe’s url/port column
  3. Solr port: 8983 (found in ddev describe’s url/port column
  4. Default Solr collection: example-solr-collection (choose however you like, the collection will be created later when you run the drush command
  5. HTTP Basic Auth (default username/password by ddev/solr-search
    • Username: solr
    • Password: SolrRocks

When complete and click on “Save”, this is what you’ll see

2025-03-14T203828

This is because the collection have not been created, to do that you will need to either click on the “Upload Configset” button, that uses the admin module we just installed to created the collections OR run the following drush command:

1
2
ddev drush --numShards=1 search-api-solr:upload-configset SEARCH_API_SERVER_ID
# SEARCH_API_SERVER_ID = MACHINE NAME OR THE SEARCH SERVER YOU JUST CREATED IN DRUPAL

(more instruction to be found at: https://github.com/ddev/ddev-solr?tab=readme-ov-file#drupal-and-search-api-solr

2025-03-14T204311

2025-03-14T204655

Step-4: Configuration of Index

Now we have our search “server” ready to go, we need to configure the “index”, here I might introduce some explanations for these terminologies to help you with the understanding:

A β€œserver” in Search API refers to the backend system or search engine where data is stored, indexed, and queried. It acts as the engine powering search functionality.

An β€œindex” in Search API defines what content from your Drupal site should be indexed (stored) in the server and how it is structured for search purposes.

Here we’ll just index the “recipe” content type that have some example instances by default:

2025-03-14T205202

Here we have chosen to index the fields: Title, Summary, Tags (taxonomy)

2025-03-14T205440

However, when you finish the setup by clicking “Save changes”, this is what you might be seeing: “0/xx indexed status”, this is because the indexing is triggered only when the CRON job is ran unless configured otherwise:

2025-03-14T205607

Now all the “article” nodes will be indexed and stored into the solr server:

2025-03-14T210122

Step-3: Create Search Page using View

Lastly, I would want to consume these indexes using a page, notice instead of regular drupal entities/node such as “content”, “user” or “taxonomy” I’ve chosen “Index Article (Solr Index)”, that is the “index” we just created:

image-20250314210912365

(note that when you click on save, you might see a warning message that says “The selected caching mechanism does not work with views on Search API indexes. Use one of the Search API-specific caching options. The selected caching mechanism was changed accordingly for the view *Search Recipe (Solr)*.” this is because caching for search is different then that of regular drupal search pages, you can find the relavtn settings under “Advanced > Caching”

link

I’ll have “full-text search” as the exposed filter for user to enter their search query; Title, summary, relavnce score (generated by search api or solr when you perform the search) as the fields to display; And order the items by their relevancy score:

2025-03-14T211728

Now as you can see below, when I search for the word delicious two result comes up, one with the exact spelling delicious one is the adjective version of it deliviously, and they relevancy score differs slightly because of this:

2025-03-14T212059

As we come to an end, I’ll show off some super powers of having solr search in the equation, for instance you can do:

Excerpt

  • by ticking “Highlight” in preprocessor tab of the index’s configuration (screenshot)
  • and adding the “Excerpt” field to the view (screenshot)

2025-03-14T212447

Auto-Complete

  • install the “Search API Auto-complete” and “Search API Solr Auto-complete” modules (screenshot)
  • enable it in the “auto-complete” tab in “search api” configuration(screenshot)

2025-03-14T213203

Facets / Laywerd Filtering

  • install and enable the “Facets” module (screenshot)
  • create new facets using index field"Tag" (screenshot)
  • place the facet block component in sidebar (screenshot)
  • turn on replacing “value/key” with label (screenshot)

2025-03-14T214416

(optionally you can also have the filter inside the view body via “Facets Better Exposed” module that are dependent on the “Better Exposed Filter” module

Searching within Attachment or Document (such as PDF, DOCX, TXT, CSV)

  • install and enable the “Search API Attachment” Module (link)
  • configure the module to use the “Solr extractor” (screenshot)
  • enable “file attachment option” in your index processors options (screenshot)
  • configure the search api index to include field “Search api attachments: YOUR-FILE-FIELD-NAME” (screenshot)
  • change the type for “search api attachment: YOUR-FILE-FILELD-NAME” from “string” to “full-text” (screenshot)
  • reindex via running cron and searc in your view (no special view configuration required)

image-20250317102639393

(for more information you can refer to the ReadMe.md file of the project: https://git.drupalcode.org/project/search_api_attachments


Reference