Customizing automatic resource generation

YAML-formatted options to customize solrconfig.xml and schema.xml generation.

You can customize solrconfig.xml and schema.xml generation by providing a YAML-formatted file of options:
auto_soft_commit_max_time

The maximum auto soft commit time in milliseconds.

default_query_field

The schema field to use when no field is specified in queries.

type_mapping_version

The Solr/Cassandra type mapping version.

directory_factory_class

The class name of the directory factory.

enable_string_copy_fields

Specify to enable generation of non-stored string copy fields for non-key text fields. Text data can be tokenized or non tokenized. The enable_string_copy fields is false by default. True creates a non-stored, non-tokenized copy field, so that you can have text both ways.

exclude_columns

A comma-separated (CSV) list of columns to exclude.

generate_docvalues_for_fields
Define the fields to automatically configure doc values in the generated schema. Specify '*' to add all possible fields:
generate_docvalues_for_fields: '*' ## You can omit this parameter or not specify a value
or specify a comma-separated list of fields, for example:
generate_docvalues_for_fields: uuidfield, bigintfield
include_columns
A comma-separated (CSV) list of columns to include. Empty = includes all columns.
index_merge_factor

The index merge factor.

index_ram_buffer_size

The index ram buffer size in MB.

lenient 
Ignore non-supported type columns and continue to generate resources, instead of erroring out when non-supported type columns are encountered. Default: false
resource_generation_profiles 
To minimize index size, specify a CSV list of profiles to apply while generating resources.
  • spaceSavingAll - Applies all of the following options.
  • spaceSavingNoTextfield - No TextFields. Use StrField instead.
  • spaceSavingNoJoin - Do not index a hidden primary key field. Prevents joins across cores.
  • spaceSavingSlowTriePrecision - Sets trie fields precisionStep to '0', allowing for greater space saving but slower querying.
For example:
resource_generation_profiles: spaceSavingNoTextfield, spaceSavingSlowTriePrecision
rt
Enable live indexing to increase indexing throughput. Enable live indexing on only one Solr core per cluster.
rt=true

Example to customize the solrconfig.xml and YAML files

For example, create a YAML file that lists the following options to customize the solrconfig.xml and YAML files:
default_query_field: name
auto_soft_commit_max_time: 1000
generate_docvalues_for_fields: '*'
enable_string_copy_fields: false

Use the dsetool command to create the core and customize the solrconfig and schema generation. Use coreOptions to specify the YAML file, for example:

dsetool create_core nhanes_ks.nhanes generateResources=true coreOptions=config.yaml

Example to create a core with live indexing on

To create the core with live indexing (also known as RT), use the dsetool command to create the core and use coreOptions to specify a YAML file that includes the rt: true setting, for example:
dsetool create_core udt_ks.users generateResources=true reindex=true coreOptions=rt.yaml

where the contents of the rt.yaml are rt: true

You can verify that DSE Search created the solrconfig and schema by reading core resources using dsetool.

Example to create a core with encryption

You can enable encryption for a new core.