New versions of this module and support for Omeka S version 3.0 and above are available on GitLab, which seems to respect users and privacy better than the previous repository.
Advanced Search is a module for Omeka S that improves the standard search (visibility, thumbnails, starts with, resources without templates, search in multiple properties at a time, etc.) and that adds search capabilities to the public interface of Omeka S, in particular auto-completion, filters, facets, and aggregated fields querying.
These features are progressively integrated in the Omeka core.
Furthermore, it provides a common interface for other modules to extend it (forms, indexers, queriers). It can be displayed as a block on any page too. Besides, it adds some features to the standard advanced search form.
Here is a live example:
It can be extended in two ways:
The default form answers to most of the common needs. It can be configured in the admin interface to make it a basic form à la Google, or to build a complex form with or without auto-suggestion, advanced filters, sort fields, facets, collection selector, resource class selector, resource template selector, properties filters with various input elements, like numbers or date ranges.
An internal adapter is provided too. It uses the internal sql api of Omeka to search resources. So the search engine is the sql one, without indexer, so it is limited strictly to the request like the standard Omeka S search engine (no wildcards, no management of singular/plural, etc.). Nevertheless, it provides the facets to improve the results (requires the module Reference).
An adapter is available for Solr, one of the most common search engine.
For the standard advanced form, it adds some fields to the advanced search form to make search more precise.
Added fields are:
Moreover, it adds new search query operator for properties (some are available only via api, not in the advanced search form for now):
eq
/neq
: is or is not exactly (default Omeka)in
/nin
: contains or does not contains (default Omeka)sw
/nsw
: starts or does not start withew
/new
: ends or does not end withnear
/nnear
: is or is not similar to (algorithm Soundex, based on British English phonetic)ma
/nma
: matches or does not matchlist
/nlist
: is or is not in list (api only)res
/nres
: has resource or has no resource (default Omeka)resq
/nresq
: is or is not resource matching querylex
/nlex
: is or is not a linked resourcelres
/nlres
: is or is not linked with resource #idlkq
/nlkq
: is or is not linked with resources matching queryex
/nex
: has any value or has no value (default Omeka)exs
/nexs
: has or has not a single valueexm
/nexm
: has or has not multiple valuestp
/ntp
: has or has not main type (literal-like, resource-like, uri-like)tpl
/ntpl
: has or has not type literal-liketpr
/ntpr
: has or has not type resource-liketpu
/ntpu
: has or has not type uri-likedtp
/ndtp
: has or has not data typelt
: lower thanlte
: lower than or equalgte
: greater than or equalgt
: greater than<
: lower than≤
: lower than or equal≥
: greater than or equal>
: greater thandup
and variants: has duplicate values, linked resources, uris, types and languages
The variants allows to check duplicate for simple values only, linked
resources only, uris only, including or not types or languages.Warning: With the internal sql engine, comparisons are mysql comparisons, so alphabetic ones. They works for string and four digit years and standard dates, not for numbers nor variable dates.
Furthermore:
dcterms:creator or dcterms:contributor are equal to value "Anonymous"
.not
that can be use to invert the query. For example,
"and property dcterms:title not equals 'search text'" is the same than "not property dcterms:title equals 'search text'".
It avoids to display half of the complex query types to the user.datatype
to filter a property query by datatype. For example,
"and property dcterms:subject equals 'subject' with datatype 'customvocab:1'".0
, for example item_set_id=0
.sort_by=ids
. The list of ids can be set in keys
id
or sort_ids
as an array or as a comma-separated list.Finally, an option allows to display only the used properties and classes in the advanced search form, with chosen-select.
Warning: the improvements done on query argument "property" were moved to "filter" and will be removed in a future version. So use "filter" instead of "property".
This module is dependant of module Common, that should be installed first.
The module uses an external library jQuery-Autocomplete, so use the release zip to install it, or use and init the source.
Download the last release [AdvancedSearch.zip] from the list of releases (the
master does not contain the dependency), and uncompress it in the modules
directory.
If the module was installed from the source, rename the name of the folder of
the module to AdvancedSearch
, and go to the root module, and run:
composer install --no-dev
See general end user documentation for installing a module.
The default search engine is automatically added to the sites.
The main admin menu Search
allows to manage the search engines and the search
configs: an instance of Omeka can contain multiple engines, for example to hide
some fields in the public front-end, and multiple configs or pages, for example
a single field search and an advanced search with filters, or different
parameters for different sites or different resource types (items or item sets).
An engine and a page for the internal adapter are automatically prepared during install. This search engine can be enabled in main settings and site settings. It can be removed too.
To create a new config for a page with a search engine, follow these steps.
Create an engine
Internal
or whatever you want, using the
Internal
adapter. The engine can be set for items and/or item sets.Create a config for a page
Internal search
or whatever you want, a path to
access it, for example search
or find
, the engine that was created in
the previous step (Internal
here), and a form adapter (Main
) that
will do the mapping between the form and the engine. Forms added by
modules can manage an advanced input field and/or filters.item_set_id
, resource_class_id
,
and resource_template_id
display a select by default. You may have to
use Omeka/Select
, Omeka/MultiCheckbox
, Thesaurus
, or variants to
get option values automatically.
Note that some indexers may have fields that seem duplicated, but they
aren’t: some of them allow to prepare search engines and some other
facets or sort indexes. Some of them may be used for all uses. This is
not the case for the internal indexer, that is a simpler search engine
based on the omeka sql database.
For example, you can use dcterms:type
, dcterms:subject
, dcterms:creator
,
dcterms:date
, dcterms:spatial
, dcterms:language
and dcterms:rights
as facets, and dcterms:title
, dcterms:date
, and dcterms:creator
as
sort fields.In admin and site settings
https://example.com/s/my-site/search
or https://example.com/admin/search
in this example.The search form should appear. Type some text then submit the form to display the results as grid or as list. The page can be themed.
IMPORTANT
The Search module does not replace the default search page neither the default search engine. So the theme should be updated.
A default configuration is provided. It includes all the current features, so you generally only have to remove the one you don't need or the one useless with your data.
A search form may have many parameters. They don't need to be all filled.
Some features are complex, so they have their own config form (autosuggestion for now, and, in a future version, advanced form and facets).
The search engine of a config should not be changed, because the keys may be different.
Currently, two search engines are supported: the default sql and Solr through the module Search Solr.
Example of a direct url for Solr (should be configured first): http://example.com:8983/solr/omeka/suggest?suggest=true&suggest.build=true&suggest.dictionary=mainSuggester&suggest.count=100
.
The query param should be suggest.q
.
Filters are used before the querying. Any field can be added.
In the text area, each line is a filter, with a name, a label, a type and
options, separated with a =
.
For advanced filters, similar to the Omeka ones, use "advanced" as field name and type.
The format to fill facets is "ini", so set a section name between "[]", then each param of the facet. For example:
[subjects]
field = "dcterms:subject"
label = Subjects
sort_by = "total"
display_list = "available"
display_count = true
[template]
field = "resource_template_id"
type = Select
label = Resource types
sort_by= "values"
values = "Text|Image|Audio|Vidéo"
display_count = true
[date]
field = "dcterms:date"
type = Range
label = Year
min = 1789
max = 1804
The section is a unique name.
Keys are: field
, label
, type
, order
, limit
, state
, more
, languages
, data_types
,
main_types
, values
, display_count
, and specific options, like thesaurus
.
Multi-valued keys can be set as ini (key ending with [] or .xxx) or as a string with multiple values separated with a "|".
Only the key "field" is required.
min = 1789
and max = 1804
. For Range, the option "step" can be set too.
With Solr, it works only with date and numbers.thesaurus
,
with the id.languages[] = fr
or as a string separated with |
, like
languages[] = fr|en|
. The use of a comma to separate values is deprecated
and will be removed in a future version.
The module is provided with an adapter for mysql. In order to get facets working, you need the module Reference.
The fields that are added to the advanced search form are available in the api and some other ones are available too.
datetime
, that is a list of arrays with keys field
("created" or "modified"),
joiner
("and" or "or), a type
("lt", "lte", "eq", "gte", "gt", "neq", "ex", "nex")
and a value ("2021-08-23 12:34:56"), partial or not.resource_class_term
, the term can be a single class term or a list. It is case sensitive for now.has_media
for items.has_original
for items and medias.has_thumbnails
for items and medias.item_set_id
for medias.media_types
for items.0
, for example, in a url, resource_template_id=0
.To exclude properties to search in, use key except
. For example, to search
anywhere except in "bibo:content", that may contain ocr or full text, use this
api query https://example.org/api/items?property[0][except]=bibo:content&property[0][type]=in&property[0][text]=text to search
, or in internal api:
$query['filter'][] = [
'join' => 'and',
'field' => '',
'except' => $excludedFields,
'type' => 'in',
'val' => "text to search",
];
// With property (deprecated).
$query['property'][] = [
'joiner' => 'and',
'property' => '',
'except' => $excludedFields,
'type' => 'in',
'text' => "text to search",
];
The excluded fields may be one or multiple property ids or terms.
The title cannot be excluded currently, because it is automatically added by the core.
The visibility check may not working if the api url contains &is_public=&
:
is_public
must not be a empty string. See the patch in https://github.com/omeka/omeka-s/pull/1671.
This patch is integrated in module only for url, and for call to internal api.
This fix has been integrated in Omeka v3.1.
The search engine can be used not only for the classical search page, but for any other views too where you the search of items need to be done quickly, for example the block layouts with a big database (more than 10000 to 100000 items, according to your server and your collections).
To use this feature, a config should be created with the form Api
. This form
is not a true form, but it allows to map the Omeka metadata and properties with
the fields indexed by the search engine. It allows to define a max number of
results too, that is used when no paginator is enable. The argument limit
cannot go further.
When ready, the api search is available via multiple means.
index=1
as query in the block layouts that use it, like Browse preview.$this->api()->search()
with the value 'index' => true
appended to the argument $data
or $options
(recommended when possible to
avoid to mix the query and the parameters).$this->apiSearch()
,
and $this->apiSearchOne()
, that have the same arguments than $this->api()->search()
and $this->api()->searchOne()
. The result is an Omeka Response.$this->apiSearch()
and $this->apiSearchOne()
./s/mysite/api_search_page
.Note that some features may be not available in the external search engine. In particular, some events are not triggered.
The indexation of items and item sets is automatic and all new metadata can be searched in the admin board. Note that there may be a cache somewhere, and they may be not searchable in the public sites.
So when the item pool of a site or the item sets attached to it are modified, a manual reindexation should be done in the Search board. This job can be done via a cron too (see your system cron).
Furthermore, there may be an indexation delay between the moment when a resource is saved and the moment when it is fully available in the search engine (it may be some minutes with Solr, according to your configuration).
'index' => false
).Use it at your own risk.
It’s always recommended to backup your files and your databases and to check your archives regularly so you can roll back if needed.
See online issues on the module issues page on GitLab.
This module is published under the CeCILL v2.1 license, compatible with GNU/GPL and approved by FSF and OSI.
This software is governed by the CeCILL license under French law and abiding by the rules of distribution of free software. You can use, modify and/ or redistribute the software under the terms of the CeCILL license as circulated by CEA, CNRS and INRIA at the following URL "http://www.cecill.info".
As a counterpart to the access to the source code and rights to copy, modify and redistribute granted by the license, users are provided only with a limited warranty and the software’s author, the holder of the economic rights, and the successive licensors have only limited liability.
In this respect, the user’s attention is drawn to the risks associated with loading, using, modifying and/or developing or reproducing the software by the user in light of its specific status of free software, that may mean that it is complicated to manipulate, and that also therefore means that it is reserved for developers and experienced professionals having in-depth computer knowledge. Users are therefore encouraged to load and test the software’s suitability as regards their requirements in conditions enabling the security of their systems and/or data to be ensured and, more generally, to use and operate it in the same conditions as regards security.
The fact that you are presently reading this means that you have had knowledge of the CeCILL license and that you accept its terms.
See commits for full list of contributors.
This module is a merge of features from the deprecated modules Advanced Search Plus, Search and Psl Search Form and derivative ones.
The Psl search form and the Solr modules were initially built by BibLibre and were used by the digital library of PSL, a French university. Next improvements were done for various projects. The auto-completion was built for the future digital library of Campus Condorcet. The aggregated fields feature was built for the digital library Corpus du Louvre.