Rebuild nozebra.pl: відмінності між версіями
Dubyk (обговорення | внесок) (→Опис) |
Dubyk (обговорення | внесок) (→Опис) |
||
Рядок 7: | Рядок 7: | ||
Параметри: |
Параметри: |
||
'''-b''' |
|||
— index bibliographic records |
|||
'''-a''' index authority records |
'''-a''' index authority records |
||
''' -daemon''' |
|||
— Run in daemon mode. The program will loop checking |
|||
for entries on the zebraqueue table, processing |
for entries on the zebraqueue table, processing |
||
them incrementally if present, and then sleep |
them incrementally if present, and then sleep |
||
Рядок 25: | Рядок 27: | ||
daemon (e.g. -r or -offset). |
daemon (e.g. -r or -offset). |
||
''' -sleep 10''' |
|||
— Seconds to sleep between checks of the zebraqueue |
|||
table in daemon mode. The default is 5 seconds. |
table in daemon mode. The default is 5 seconds. |
||
''' -z''' |
|||
— select only updated and deleted |
|||
records marked in the zebraqueue |
records marked in the zebraqueue |
||
table. Cannot be used with -r |
table. Cannot be used with -r |
||
or -s. |
or -s. |
||
--skip-deletes |
''' --skip-deletes''' |
||
— only select record updates, not record |
|||
deletions, to avoid potential excessive |
deletions, to avoid potential excessive |
||
I/O when zebraidx processes deletions. |
I/O when zebraidx processes deletions. |
||
Рядок 42: | Рядок 47: | ||
Only effective with -z. |
Only effective with -z. |
||
''' -r ''' |
|||
— clear Zebra index before |
|||
adding records to index. Implies -w. |
adding records to index. Implies -w. |
||
''' -d ''' |
|||
— Temporary directory for indexing. |
|||
If not specified, one is automatically |
If not specified, one is automatically |
||
created. The export directory |
created. The export directory |
||
Рядок 51: | Рядок 58: | ||
you supply the -k switch. |
you supply the -k switch. |
||
''' -k''' |
|||
— Do not delete export directory. |
|||
''' -s''' |
|||
— Skip export. Used if you have |
|||
already exported the records |
already exported the records |
||
in a previous run. |
in a previous run. |
||
''' -nosanitize''' |
|||
— export biblio/authority records directly from DB marcxml |
|||
field without sanitizing records. It speed up |
field without sanitizing records. It speed up |
||
dump process but could fail if DB contains badly |
dump process but could fail if DB contains badly |
||
encoded records. Works only with -x, |
encoded records. Works only with -x, |
||
''' -w''' |
|||
— skip shadow indexing for this batch |
|||
''' -y''' |
|||
— do NOT clear zebraqueue after indexing; normally, |
|||
after doing batch indexing, zebraqueue should be |
after doing batch indexing, zebraqueue should be |
||
marked done for the affected record type(s) so that |
marked done for the affected record type(s) so that |
||
Рядок 71: | Рядок 83: | ||
Cannot be used with -z. |
Cannot be used with -z. |
||
''' -v''' |
|||
— increase the amount of logging. Normally only |
|||
warnings and errors from the indexing are shown. |
warnings and errors from the indexing are shown. |
||
Use log level 2 (-v -v) to include all Zebra logs. |
Use log level 2 (-v -v) to include all Zebra logs. |
||
--length 1234 |
''' --length 1234''' |
||
— how many biblio you want to export |
|||
''' --offset 1243''' |
|||
— offset you want to start to |
|||
example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one |
example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one |
||
) |
) |
||
note that the numbers are NOT related to biblionumber, that's the intended behaviour. |
note that the numbers are NOT related to biblionumber, that's the intended behaviour. |
||
⚫ | |||
''' --where''' |
|||
⚫ | |||
or something like that |
or something like that |
||
--run-as-root |
''' --run-as-root''' |
||
— explicitily allow script to run as 'root' user |
|||
--wait-for-lock |
''' --wait-for-lock''' |
||
— when not running in daemon mode, the default |
|||
behavior is to abort a rebuild if the rebuild |
behavior is to abort a rebuild if the rebuild |
||
lock is busy. This option will cause the program |
lock is busy. This option will cause the program |
||
Рядок 91: | Рядок 111: | ||
processing the rebuild request, |
processing the rebuild request, |
||
''' --table''' |
|||
— specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index. |
|||
biblioitems is the default value. |
biblioitems is the default value. |
||
--help or -h |
''' --help or -h''' |
||
— show this message. |
|||
== Джерела == |
== Джерела == |
Версія за 16:21, 14 листопада 2022
Назва
rebuild_nozebra.pl — сценарій переіндексування бібліографічних та віторитетних MARC-записів у БД Zebra.
Використовуйте це пакетне завдання, щоб переіндексувати усі бібліотечні або авторитетні записи у Вашій базі даних Koha.
Опис
Параметри:
-b — index bibliographic records
-a index authority records
-daemon — Run in daemon mode. The program will loop checking
for entries on the zebraqueue table, processing them incrementally if present, and then sleep for a few seconds before repeating the process Checking the zebraqueue table is done with a cheap SQL query. This allows for near realtime update of the zebra search index with low system overhead. Use -sleep to control the checking interval.
Daemon mode implies -z, -a, -b. The program will refuse to start if options are present that do not make sense while running as an incremental update daemon (e.g. -r or -offset).
-sleep 10 — Seconds to sleep between checks of the zebraqueue
table in daemon mode. The default is 5 seconds.
-z — select only updated and deleted
records marked in the zebraqueue table. Cannot be used with -r or -s.
--skip-deletes — only select record updates, not record
deletions, to avoid potential excessive I/O when zebraidx processes deletions. If this option is used for normal indexing, a cronjob should be set up to run rebuild_zebra.pl -z without --skip-deletes during off hours. Only effective with -z.
-r — clear Zebra index before
adding records to index. Implies -w.
-d — Temporary directory for indexing.
If not specified, one is automatically created. The export directory is automatically deleted unless you supply the -k switch.
-k — Do not delete export directory.
-s — Skip export. Used if you have
already exported the records in a previous run.
-nosanitize — export biblio/authority records directly from DB marcxml
field without sanitizing records. It speed up dump process but could fail if DB contains badly encoded records. Works only with -x,
-w — skip shadow indexing for this batch
-y — do NOT clear zebraqueue after indexing; normally,
after doing batch indexing, zebraqueue should be marked done for the affected record type(s) so that a running zebraqueue_daemon doesn't try to reindex the same records - specify -y to override this. Cannot be used with -z.
-v — increase the amount of logging. Normally only
warnings and errors from the indexing are shown. Use log level 2 (-v -v) to include all Zebra logs.
--length 1234 — how many biblio you want to export
--offset 1243 — offset you want to start to
example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one
)
note that the numbers are NOT related to biblionumber, that's the intended behaviour.
--where — let you specify a WHERE query, like itemtype='BOOK'
or something like that
--run-as-root — explicitily allow script to run as 'root' user
--wait-for-lock — when not running in daemon mode, the default
behavior is to abort a rebuild if the rebuild lock is busy. This option will cause the program to wait for the lock to free and then continue processing the rebuild request,
--table — specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
biblioitems is the default value.
--help or -h — show this message.