Rebuild nozebra.pl: відмінності між версіями

Матеріал з Koha Ukraine Wiki
Перейти до навігації Перейти до пошуку
Рядок 7: Рядок 7:
Параметри:
Параметри:


'''-b'''
'''-b''' index bibliographic records
index bibliographic records


'''-a''' index authority records
'''-a''' index authority records


''' -daemon'''
-daemon Run in daemon mode. The program will loop checking
Run in daemon mode. The program will loop checking
for entries on the zebraqueue table, processing
for entries on the zebraqueue table, processing
them incrementally if present, and then sleep
them incrementally if present, and then sleep
Рядок 25: Рядок 27:
daemon (e.g. -r or -offset).
daemon (e.g. -r or -offset).


''' -sleep 10'''
-sleep 10 Seconds to sleep between checks of the zebraqueue
Seconds to sleep between checks of the zebraqueue
table in daemon mode. The default is 5 seconds.
table in daemon mode. The default is 5 seconds.


''' -z'''
-z select only updated and deleted
select only updated and deleted
records marked in the zebraqueue
records marked in the zebraqueue
table. Cannot be used with -r
table. Cannot be used with -r
or -s.
or -s.


--skip-deletes only select record updates, not record
''' --skip-deletes'''
— only select record updates, not record
deletions, to avoid potential excessive
deletions, to avoid potential excessive
I/O when zebraidx processes deletions.
I/O when zebraidx processes deletions.
Рядок 42: Рядок 47:
Only effective with -z.
Only effective with -z.


''' -r '''
-r clear Zebra index before
clear Zebra index before
adding records to index. Implies -w.
adding records to index. Implies -w.


''' -d '''
-d Temporary directory for indexing.
Temporary directory for indexing.
If not specified, one is automatically
If not specified, one is automatically
created. The export directory
created. The export directory
Рядок 51: Рядок 58:
you supply the -k switch.
you supply the -k switch.


''' -k'''
-k Do not delete export directory.
Do not delete export directory.


''' -s'''
-s Skip export. Used if you have
Skip export. Used if you have
already exported the records
already exported the records
in a previous run.
in a previous run.


''' -nosanitize'''
-nosanitize export biblio/authority records directly from DB marcxml
export biblio/authority records directly from DB marcxml
field without sanitizing records. It speed up
field without sanitizing records. It speed up
dump process but could fail if DB contains badly
dump process but could fail if DB contains badly
encoded records. Works only with -x,
encoded records. Works only with -x,


''' -w'''
-w skip shadow indexing for this batch
skip shadow indexing for this batch


''' -y'''
-y do NOT clear zebraqueue after indexing; normally,
do NOT clear zebraqueue after indexing; normally,
after doing batch indexing, zebraqueue should be
after doing batch indexing, zebraqueue should be
marked done for the affected record type(s) so that
marked done for the affected record type(s) so that
Рядок 71: Рядок 83:
Cannot be used with -z.
Cannot be used with -z.


''' -v'''
-v increase the amount of logging. Normally only
increase the amount of logging. Normally only
warnings and errors from the indexing are shown.
warnings and errors from the indexing are shown.
Use log level 2 (-v -v) to include all Zebra logs.
Use log level 2 (-v -v) to include all Zebra logs.


--length 1234 how many biblio you want to export
''' --length 1234'''
--offset 1243 offset you want to start to
how many biblio you want to export

''' --offset 1243'''
— offset you want to start to
example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one
example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one
)
)
note that the numbers are NOT related to biblionumber, that's the intended behaviour.
note that the numbers are NOT related to biblionumber, that's the intended behaviour.

--where let you specify a WHERE query, like itemtype='BOOK'
''' --where'''
let you specify a WHERE query, like itemtype='BOOK'
or something like that
or something like that


--run-as-root explicitily allow script to run as 'root' user
''' --run-as-root'''
— explicitily allow script to run as 'root' user


--wait-for-lock when not running in daemon mode, the default
''' --wait-for-lock'''
— when not running in daemon mode, the default
behavior is to abort a rebuild if the rebuild
behavior is to abort a rebuild if the rebuild
lock is busy. This option will cause the program
lock is busy. This option will cause the program
Рядок 91: Рядок 111:
processing the rebuild request,
processing the rebuild request,


''' --table'''
--table specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
biblioitems is the default value.
biblioitems is the default value.


--help or -h show this message.
''' --help or -h'''
— show this message.


== Джерела ==
== Джерела ==

Версія за 16:21, 14 листопада 2022

Назва

rebuild_nozebra.pl — сценарій переіндексування бібліографічних та віторитетних MARC-записів у БД Zebra.

Використовуйте це пакетне завдання, щоб переіндексувати усі бібліотечні або авторитетні записи у Вашій базі даних Koha.

Опис

Параметри:

-b — index bibliographic records

-a index authority records

-daemon — Run in daemon mode. The program will loop checking

                           for entries on the zebraqueue table, processing
                           them incrementally if present, and then sleep
                           for a few seconds before repeating the process
                           Checking the zebraqueue table is done with a cheap
                           SQL query.  This allows for near realtime update of
                           the zebra search index with low system overhead.
                           Use -sleep to control the checking interval.
                           Daemon mode implies -z, -a, -b.  The program will
                           refuse to start if options are present that do not
                           make sense while running as an incremental update
                           daemon (e.g. -r or -offset).

-sleep 10 — Seconds to sleep between checks of the zebraqueue

                           table in daemon mode.  The default is 5 seconds.

-z — select only updated and deleted

                           records marked in the zebraqueue
                           table.  Cannot be used with -r
                           or -s.

--skip-deletes — only select record updates, not record

                           deletions, to avoid potential excessive
                           I/O when zebraidx processes deletions.
                           If this option is used for normal indexing,
                           a cronjob should be set up to run
                           rebuild_zebra.pl -z without --skip-deletes
                           during off hours.
                           Only effective with -z.

-r — clear Zebra index before

                           adding records to index. Implies -w.

-d — Temporary directory for indexing.

                           If not specified, one is automatically
                           created.  The export directory
                           is automatically deleted unless
                           you supply the -k switch.

-k — Do not delete export directory.

-s — Skip export. Used if you have

                           already exported the records
                           in a previous run.

-nosanitize — export biblio/authority records directly from DB marcxml

                           field without sanitizing records. It speed up
                           dump process but could fail if DB contains badly
                           encoded records. Works only with -x,

-w — skip shadow indexing for this batch

-y — do NOT clear zebraqueue after indexing; normally,

                           after doing batch indexing, zebraqueue should be
                           marked done for the affected record type(s) so that
                           a running zebraqueue_daemon doesn't try to reindex
                           the same records - specify -y to override this.
                           Cannot be used with -z.

-v — increase the amount of logging. Normally only

                           warnings and errors from the indexing are shown.
                           Use log level 2 (-v -v) to include all Zebra logs.

--length 1234 — how many biblio you want to export

--offset 1243 — offset you want to start to

                               example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one

)

                               note that the numbers are NOT related to biblionumber, that's the intended behaviour.

--where — let you specify a WHERE query, like itemtype='BOOK'

                           or something like that

--run-as-root — explicitily allow script to run as 'root' user

--wait-for-lock — when not running in daemon mode, the default

                           behavior is to abort a rebuild if the rebuild
                           lock is busy.  This option will cause the program
                           to wait for the lock to free and then continue
                           processing the rebuild request,

--table — specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.

                           biblioitems is the default value.

--help or -h — show this message.

Джерела