Rebuild nozebra.pl: відмінності між версіями
Перейти до навігації
Перейти до пошуку
Dubyk (обговорення | внесок) (→Опис) |
Dubyk (обговорення | внесок) (→Опис) |
||
(Не показані 2 проміжні версії цього користувача) | |||
Рядок 8: | Рядок 8: | ||
'''-b''' |
'''-b''' |
||
: — індексувати бібліографічні записи |
|||
: — index bibliographic records |
|||
'''-a''' |
'''-a''' |
||
: — індексувати авторитетні записи |
|||
: — index authority records |
|||
''' -daemon''' |
''' -daemon''' |
||
: — Run in daemon mode. The program will loop checking |
: — Run in daemon mode. The program will loop checking for entries on the zebraqueue table, processing them incrementally if present, and then sleep for a few seconds before repeating the process |
||
⚫ | |||
: for entries on the zebraqueue table, processing |
|||
⚫ | |||
: them incrementally if present, and then sleep |
|||
: Daemon mode implies -z, -a, -b. The program will refuse to start if options are present that do not make sense while running as an incremental update daemon (e.g. -r or -offset). |
|||
: for a few seconds before repeating the process |
|||
: Checking the zebraqueue table is done with a cheap |
|||
: SQL query. This allows for near realtime update of |
|||
⚫ | |||
⚫ | |||
: Daemon mode implies -z, -a, -b. The program will |
|||
: refuse to start if options are present that do not |
|||
: make sense while running as an incremental update |
|||
: daemon (e.g. -r or -offset). |
|||
''' -sleep 10''' |
''' -sleep 10''' |
||
: — Seconds to sleep between checks of the zebraqueue |
: — Seconds to sleep between checks of the zebraqueue table in daemon mode. The default is 5 seconds. |
||
: table in daemon mode. The default is 5 seconds. |
|||
''' -z''' |
''' -z''' |
||
: — select only updated and deleted |
: — select only updated and deleted records marked in the zebraqueue table. Cannot be used with -r or -s. |
||
: records marked in the zebraqueue |
|||
: table. Cannot be used with -r |
|||
: or -s. |
|||
''' --skip-deletes''' |
''' --skip-deletes''' |
||
: — only select record updates, not record |
: — only select record updates, not record deletions, to avoid potential excessive I/O when zebraidx processes deletions. |
||
: |
: If this option is used for normal indexing, a cronjob should be set up to run |
||
: I/O when zebraidx processes deletions. |
|||
: If this option is used for normal indexing, |
|||
: a cronjob should be set up to run |
|||
: rebuild_zebra.pl -z without --skip-deletes |
: rebuild_zebra.pl -z without --skip-deletes |
||
: during off hours. |
: during off hours. |
||
Рядок 49: | Рядок 33: | ||
''' -r ''' |
''' -r ''' |
||
: — clear Zebra index before |
: — clear Zebra index before adding records to index. Implies -w. |
||
adding records to index. Implies -w. |
|||
''' -d ''' |
''' -d ''' |
||
: — Temporary directory for indexing. |
: — Temporary directory for indexing. |
||
: If not specified, one is automatically |
: If not specified, one is automatically created. The export directory is automatically deleted unless you supply the -k switch. |
||
: created. The export directory |
|||
: is automatically deleted unless |
|||
: you supply the -k switch. |
|||
''' -k''' |
''' -k''' |
||
Рядок 63: | Рядок 43: | ||
''' -s''' |
''' -s''' |
||
: — Skip export. Used if you have |
: — Skip export. Used if you have already exported the records in a previous run. |
||
: already exported the records |
|||
: in a previous run. |
|||
''' -nosanitize''' |
''' -nosanitize''' |
||
: — export biblio/authority records directly from DB marcxml |
: — export biblio/authority records directly from DB marcxml field without sanitizing records. It speed up dump process but could fail if DB contains badly encoded records. Works only with -x, |
||
: field without sanitizing records. It speed up |
|||
: dump process but could fail if DB contains badly |
|||
: encoded records. Works only with -x, |
|||
''' -w''' |
''' -w''' |
||
Рядок 77: | Рядок 52: | ||
''' -y''' |
''' -y''' |
||
: — do NOT clear zebraqueue after indexing; normally, after doing batch indexing, zebraqueue should be marked done for the affected record type(s) so that a running zebraqueue_daemon doesn't try to reindex the same records - specify -y to override this. |
|||
: — do NOT clear zebraqueue after indexing; normally, |
|||
: after doing batch indexing, zebraqueue should be |
|||
: marked done for the affected record type(s) so that |
|||
: a running zebraqueue_daemon doesn't try to reindex |
|||
: the same records - specify -y to override this. |
|||
: Cannot be used with -z. |
: Cannot be used with -z. |
||
''' -v''' |
''' -v''' |
||
: — increase the amount of logging. Normally only |
: — increase the amount of logging. Normally only warnings and errors from the indexing are shown. |
||
: warnings and errors from the indexing are shown. |
|||
: Use log level 2 (-v -v) to include all Zebra logs. |
: Use log level 2 (-v -v) to include all Zebra logs. |
||
Рядок 99: | Рядок 69: | ||
''' --where''' |
''' --where''' |
||
: — let you specify a WHERE query, like itemtype='BOOK' |
: — let you specify a WHERE query, like itemtype='BOOK' or something like that |
||
or something like that |
|||
''' --run-as-root''' |
''' --run-as-root''' |
||
Рядок 106: | Рядок 75: | ||
''' --wait-for-lock''' |
''' --wait-for-lock''' |
||
: — when not running in daemon mode, the default behavior is to abort a rebuild if the rebuild lock is busy. This option will cause the program to wait for the lock to free and then continue processing the rebuild request, |
|||
: — when not running in daemon mode, the default |
|||
: behavior is to abort a rebuild if the rebuild |
|||
: lock is busy. This option will cause the program |
|||
: to wait for the lock to free and then continue |
|||
: processing the rebuild request, |
|||
''' --table''' |
''' --table''' |
Поточна версія на 16:31, 14 листопада 2022
Назва
rebuild_nozebra.pl — сценарій переіндексування бібліографічних та віторитетних MARC-записів у БД Zebra.
Використовуйте це пакетне завдання, щоб переіндексувати усі бібліотечні або авторитетні записи у Вашій базі даних Koha.
Опис
Параметри:
-b
- — індексувати бібліографічні записи
-a
- — індексувати авторитетні записи
-daemon
- — Run in daemon mode. The program will loop checking for entries on the zebraqueue table, processing them incrementally if present, and then sleep for a few seconds before repeating the process
- Checking the zebraqueue table is done with a cheap SQL query. This allows for near realtime update of the zebra search index with low system overhead.
- Use -sleep to control the checking interval.
- Daemon mode implies -z, -a, -b. The program will refuse to start if options are present that do not make sense while running as an incremental update daemon (e.g. -r or -offset).
-sleep 10
- — Seconds to sleep between checks of the zebraqueue table in daemon mode. The default is 5 seconds.
-z
- — select only updated and deleted records marked in the zebraqueue table. Cannot be used with -r or -s.
--skip-deletes
- — only select record updates, not record deletions, to avoid potential excessive I/O when zebraidx processes deletions.
- If this option is used for normal indexing, a cronjob should be set up to run
- rebuild_zebra.pl -z without --skip-deletes
- during off hours.
- Only effective with -z.
-r
- — clear Zebra index before adding records to index. Implies -w.
-d
- — Temporary directory for indexing.
- If not specified, one is automatically created. The export directory is automatically deleted unless you supply the -k switch.
-k
- — Do not delete export directory.
-s
- — Skip export. Used if you have already exported the records in a previous run.
-nosanitize
- — export biblio/authority records directly from DB marcxml field without sanitizing records. It speed up dump process but could fail if DB contains badly encoded records. Works only with -x,
-w
- — skip shadow indexing for this batch
-y
- — do NOT clear zebraqueue after indexing; normally, after doing batch indexing, zebraqueue should be marked done for the affected record type(s) so that a running zebraqueue_daemon doesn't try to reindex the same records - specify -y to override this.
- Cannot be used with -z.
-v
- — increase the amount of logging. Normally only warnings and errors from the indexing are shown.
- Use log level 2 (-v -v) to include all Zebra logs.
--length 1234
- — how many biblio you want to export
--offset 1243
- — offset you want to start to
- example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one
- )
- note that the numbers are NOT related to biblionumber, that's the intended behaviour.
--where
- — let you specify a WHERE query, like itemtype='BOOK' or something like that
--run-as-root
- — explicitily allow script to run as 'root' user
--wait-for-lock
- — when not running in daemon mode, the default behavior is to abort a rebuild if the rebuild lock is busy. This option will cause the program to wait for the lock to free and then continue processing the rebuild request,
--table
- — specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
- biblioitems is the default value.
--help or -h
- — show this message.