Rebuild nozebra.pl: відмінності між версіями

Матеріал з Koha Ukraine Wiki
Перейти до навігації Перейти до пошуку
Немає опису редагування
Рядок 1: Рядок 1:
== Назва ==
rebuild_nozebra.pl — основний сценарій створення пошукових покажчиків для режиму NoZebra
rebuild_nozebra.pl — сценарій переіндексування бібліографічних та віторитетних MARC-записів у БД Zebra.

Використовуйте це пакетне завдання, щоб переіндексувати усі бібліотечні або авторитетні записи у Вашій базі даних Koha.

== Опис ==
Параметри:

-b index bibliographic records

-a index authority records

-daemon Run in daemon mode. The program will loop checking
for entries on the zebraqueue table, processing
them incrementally if present, and then sleep
for a few seconds before repeating the process
Checking the zebraqueue table is done with a cheap
SQL query. This allows for near realtime update of
the zebra search index with low system overhead.
Use -sleep to control the checking interval.

Daemon mode implies -z, -a, -b. The program will
refuse to start if options are present that do not
make sense while running as an incremental update
daemon (e.g. -r or -offset).

-sleep 10 Seconds to sleep between checks of the zebraqueue
table in daemon mode. The default is 5 seconds.

-z select only updated and deleted
records marked in the zebraqueue
table. Cannot be used with -r
or -s.

--skip-deletes only select record updates, not record
deletions, to avoid potential excessive
I/O when zebraidx processes deletions.
If this option is used for normal indexing,
a cronjob should be set up to run
rebuild_zebra.pl -z without --skip-deletes
during off hours.
Only effective with -z.

-r clear Zebra index before
adding records to index. Implies -w.

-d Temporary directory for indexing.
If not specified, one is automatically
created. The export directory
is automatically deleted unless
you supply the -k switch.

-k Do not delete export directory.

-s Skip export. Used if you have
already exported the records
in a previous run.

-nosanitize export biblio/authority records directly from DB marcxml
field without sanitizing records. It speed up
dump process but could fail if DB contains badly
encoded records. Works only with -x,

-w skip shadow indexing for this batch

-y do NOT clear zebraqueue after indexing; normally,
after doing batch indexing, zebraqueue should be
marked done for the affected record type(s) so that
a running zebraqueue_daemon doesn't try to reindex
the same records - specify -y to override this.
Cannot be used with -z.

-v increase the amount of logging. Normally only
warnings and errors from the indexing are shown.
Use log level 2 (-v -v) to include all Zebra logs.

--length 1234 how many biblio you want to export
--offset 1243 offset you want to start to
example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one
)
note that the numbers are NOT related to biblionumber, that's the intended behaviour.
--where let you specify a WHERE query, like itemtype='BOOK'
or something like that

--run-as-root explicitily allow script to run as 'root' user

--wait-for-lock when not running in daemon mode, the default
behavior is to abort a rebuild if the rebuild
lock is busy. This option will cause the program
to wait for the lock to free and then continue
processing the rebuild request,

--table specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
biblioitems is the default value.

--help or -h show this message.


== Джерела ==
== Джерела ==

Версія за 16:19, 14 листопада 2022

Назва

rebuild_nozebra.pl — сценарій переіндексування бібліографічних та віторитетних MARC-записів у БД Zebra.

Використовуйте це пакетне завдання, щоб переіндексувати усі бібліотечні або авторитетні записи у Вашій базі даних Koha.

Опис

Параметри:

   -b                      index bibliographic records
   -a                      index authority records
   -daemon                 Run in daemon mode.  The program will loop checking
                           for entries on the zebraqueue table, processing
                           them incrementally if present, and then sleep
                           for a few seconds before repeating the process
                           Checking the zebraqueue table is done with a cheap
                           SQL query.  This allows for near realtime update of
                           the zebra search index with low system overhead.
                           Use -sleep to control the checking interval.
                           Daemon mode implies -z, -a, -b.  The program will
                           refuse to start if options are present that do not
                           make sense while running as an incremental update
                           daemon (e.g. -r or -offset).
   -sleep 10               Seconds to sleep between checks of the zebraqueue
                           table in daemon mode.  The default is 5 seconds.
   -z                      select only updated and deleted
                           records marked in the zebraqueue
                           table.  Cannot be used with -r
                           or -s.
   --skip-deletes          only select record updates, not record
                           deletions, to avoid potential excessive
                           I/O when zebraidx processes deletions.
                           If this option is used for normal indexing,
                           a cronjob should be set up to run
                           rebuild_zebra.pl -z without --skip-deletes
                           during off hours.
                           Only effective with -z.
   -r                      clear Zebra index before
                           adding records to index. Implies -w.
   -d                      Temporary directory for indexing.
                           If not specified, one is automatically
                           created.  The export directory
                           is automatically deleted unless
                           you supply the -k switch.
   -k                      Do not delete export directory.
   -s                      Skip export.  Used if you have
                           already exported the records
                           in a previous run.
   -nosanitize             export biblio/authority records directly from DB marcxml
                           field without sanitizing records. It speed up
                           dump process but could fail if DB contains badly
                           encoded records. Works only with -x,
   -w                      skip shadow indexing for this batch
   -y                      do NOT clear zebraqueue after indexing; normally,
                           after doing batch indexing, zebraqueue should be
                           marked done for the affected record type(s) so that
                           a running zebraqueue_daemon doesn't try to reindex
                           the same records - specify -y to override this.
                           Cannot be used with -z.
   -v                      increase the amount of logging.  Normally only
                           warnings and errors from the indexing are shown.
                           Use log level 2 (-v -v) to include all Zebra logs.
   --length   1234         how many biblio you want to export
   --offset 1243           offset you want to start to
                               example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one

)

                               note that the numbers are NOT related to biblionumber, that's the intended behaviour.
   --where                 let you specify a WHERE query, like itemtype='BOOK'
                           or something like that
   --run-as-root           explicitily allow script to run as 'root' user
  --wait-for-lock         when not running in daemon mode, the default
                           behavior is to abort a rebuild if the rebuild
                           lock is busy.  This option will cause the program
                           to wait for the lock to free and then continue
                           processing the rebuild request,
   --table                 specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
                           biblioitems is the default value.
   --help or -h            show this message.

Джерела