Відмінності між версіями «Rebuild nozebra.pl»
Dubyk (обговорення • внесок) (→Опис) |
Dubyk (обговорення • внесок) (→Опис) |
||
Рядок 29: | Рядок 29: | ||
''' -sleep 10''' | ''' -sleep 10''' | ||
− | — Seconds to sleep between checks of the zebraqueue | + | : — Seconds to sleep between checks of the zebraqueue |
− | table in daemon mode. The default is 5 seconds. | + | : table in daemon mode. The default is 5 seconds. |
''' -z''' | ''' -z''' | ||
− | — select only updated and deleted | + | : — select only updated and deleted |
− | records marked in the zebraqueue | + | : records marked in the zebraqueue |
− | table. Cannot be used with -r | + | : table. Cannot be used with -r |
or -s. | or -s. | ||
''' --skip-deletes''' | ''' --skip-deletes''' | ||
− | — only select record updates, not record | + | : — only select record updates, not record |
− | deletions, to avoid potential excessive | + | : deletions, to avoid potential excessive |
− | I/O when zebraidx processes deletions. | + | : I/O when zebraidx processes deletions. |
− | If this option is used for normal indexing, | + | : If this option is used for normal indexing, |
− | a cronjob should be set up to run | + | : a cronjob should be set up to run |
− | rebuild_zebra.pl -z without --skip-deletes | + | : rebuild_zebra.pl -z without --skip-deletes |
− | during off hours. | + | : during off hours. |
− | Only effective with -z. | + | : Only effective with -z. |
''' -r ''' | ''' -r ''' | ||
− | — clear Zebra index before | + | : — clear Zebra index before |
adding records to index. Implies -w. | adding records to index. Implies -w. | ||
''' -d ''' | ''' -d ''' | ||
− | — Temporary directory for indexing. | + | : — Temporary directory for indexing. |
− | If not specified, one is automatically | + | : If not specified, one is automatically |
− | created. The export directory | + | : created. The export directory |
− | is automatically deleted unless | + | : is automatically deleted unless |
− | you supply the -k switch. | + | : you supply the -k switch. |
''' -k''' | ''' -k''' | ||
− | — Do not delete export directory. | + | : — Do not delete export directory. |
''' -s''' | ''' -s''' | ||
− | — Skip export. Used if you have | + | : — Skip export. Used if you have |
− | already exported the records | + | : already exported the records |
− | in a previous run. | + | : in a previous run. |
''' -nosanitize''' | ''' -nosanitize''' | ||
− | — export biblio/authority records directly from DB marcxml | + | : — export biblio/authority records directly from DB marcxml |
− | field without sanitizing records. It speed up | + | : field without sanitizing records. It speed up |
− | dump process but could fail if DB contains badly | + | : dump process but could fail if DB contains badly |
− | encoded records. Works only with -x, | + | : encoded records. Works only with -x, |
''' -w''' | ''' -w''' | ||
− | — skip shadow indexing for this batch | + | : — skip shadow indexing for this batch |
''' -y''' | ''' -y''' | ||
− | — do NOT clear zebraqueue after indexing; normally, | + | : — do NOT clear zebraqueue after indexing; normally, |
− | after doing batch indexing, zebraqueue should be | + | : after doing batch indexing, zebraqueue should be |
− | marked done for the affected record type(s) so that | + | : marked done for the affected record type(s) so that |
− | a running zebraqueue_daemon doesn't try to reindex | + | : a running zebraqueue_daemon doesn't try to reindex |
− | the same records - specify -y to override this. | + | : the same records - specify -y to override this. |
− | Cannot be used with -z. | + | : Cannot be used with -z. |
''' -v''' | ''' -v''' | ||
− | — increase the amount of logging. Normally only | + | : — increase the amount of logging. Normally only |
− | warnings and errors from the indexing are shown. | + | : warnings and errors from the indexing are shown. |
− | Use log level 2 (-v -v) to include all Zebra logs. | + | : Use log level 2 (-v -v) to include all Zebra logs. |
''' --length 1234''' | ''' --length 1234''' | ||
− | — how many biblio you want to export | + | : — how many biblio you want to export |
''' --offset 1243''' | ''' --offset 1243''' | ||
− | — offset you want to start to | + | : — offset you want to start to |
− | example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one | + | : example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one |
− | ) | + | : ) |
− | note that the numbers are NOT related to biblionumber, that's the intended behaviour. | + | : note that the numbers are NOT related to biblionumber, that's the intended behaviour. |
''' --where''' | ''' --where''' | ||
− | — let you specify a WHERE query, like itemtype='BOOK' | + | : — let you specify a WHERE query, like itemtype='BOOK' |
or something like that | or something like that | ||
''' --run-as-root''' | ''' --run-as-root''' | ||
− | — explicitily allow script to run as 'root' user | + | : — explicitily allow script to run as 'root' user |
''' --wait-for-lock''' | ''' --wait-for-lock''' | ||
− | — when not running in daemon mode, the default | + | : — when not running in daemon mode, the default |
− | behavior is to abort a rebuild if the rebuild | + | : behavior is to abort a rebuild if the rebuild |
− | lock is busy. This option will cause the program | + | : lock is busy. This option will cause the program |
− | to wait for the lock to free and then continue | + | : to wait for the lock to free and then continue |
− | processing the rebuild request, | + | : processing the rebuild request, |
''' --table''' | ''' --table''' | ||
− | — specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index. | + | : — specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index. |
− | biblioitems is the default value. | + | : biblioitems is the default value. |
''' --help or -h''' | ''' --help or -h''' | ||
− | — show this message. | + | : — show this message. |
== Джерела == | == Джерела == | ||
* https://git.koha-community.org/gitweb/?p=koha.git;a=blob;f=misc/migration_tools/rebuild_zebra.pl;h=ec19ce1f5c40445dcdb7c033414fde02dcb06c2b;hb=HEAD | * https://git.koha-community.org/gitweb/?p=koha.git;a=blob;f=misc/migration_tools/rebuild_zebra.pl;h=ec19ce1f5c40445dcdb7c033414fde02dcb06c2b;hb=HEAD | ||
* https://github.com/Koha-Community/Koha/blob/master/misc/migration_tools/rebuild_zebra.pl | * https://github.com/Koha-Community/Koha/blob/master/misc/migration_tools/rebuild_zebra.pl |
Версія за 16:24, 14 листопада 2022
Назва
rebuild_nozebra.pl — сценарій переіндексування бібліографічних та віторитетних MARC-записів у БД Zebra.
Використовуйте це пакетне завдання, щоб переіндексувати усі бібліотечні або авторитетні записи у Вашій базі даних Koha.
Опис
Параметри:
-b
- — index bibliographic records
-a
- — index authority records
-daemon
- — Run in daemon mode. The program will loop checking
- for entries on the zebraqueue table, processing
- them incrementally if present, and then sleep
- for a few seconds before repeating the process
- Checking the zebraqueue table is done with a cheap
- SQL query. This allows for near realtime update of
- the zebra search index with low system overhead.
- Use -sleep to control the checking interval.
- Daemon mode implies -z, -a, -b. The program will
- refuse to start if options are present that do not
- make sense while running as an incremental update
- daemon (e.g. -r or -offset).
-sleep 10
- — Seconds to sleep between checks of the zebraqueue
- table in daemon mode. The default is 5 seconds.
-z
- — select only updated and deleted
- records marked in the zebraqueue
- table. Cannot be used with -r
or -s.
--skip-deletes
- — only select record updates, not record
- deletions, to avoid potential excessive
- I/O when zebraidx processes deletions.
- If this option is used for normal indexing,
- a cronjob should be set up to run
- rebuild_zebra.pl -z without --skip-deletes
- during off hours.
- Only effective with -z.
-r
- — clear Zebra index before
adding records to index. Implies -w.
-d
- — Temporary directory for indexing.
- If not specified, one is automatically
- created. The export directory
- is automatically deleted unless
- you supply the -k switch.
-k
- — Do not delete export directory.
-s
- — Skip export. Used if you have
- already exported the records
- in a previous run.
-nosanitize
- — export biblio/authority records directly from DB marcxml
- field without sanitizing records. It speed up
- dump process but could fail if DB contains badly
- encoded records. Works only with -x,
-w
- — skip shadow indexing for this batch
-y
- — do NOT clear zebraqueue after indexing; normally,
- after doing batch indexing, zebraqueue should be
- marked done for the affected record type(s) so that
- a running zebraqueue_daemon doesn't try to reindex
- the same records - specify -y to override this.
- Cannot be used with -z.
-v
- — increase the amount of logging. Normally only
- warnings and errors from the indexing are shown.
- Use log level 2 (-v -v) to include all Zebra logs.
--length 1234
- — how many biblio you want to export
--offset 1243
- — offset you want to start to
- example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one
- )
- note that the numbers are NOT related to biblionumber, that's the intended behaviour.
--where
- — let you specify a WHERE query, like itemtype='BOOK'
or something like that
--run-as-root
- — explicitily allow script to run as 'root' user
--wait-for-lock
- — when not running in daemon mode, the default
- behavior is to abort a rebuild if the rebuild
- lock is busy. This option will cause the program
- to wait for the lock to free and then continue
- processing the rebuild request,
--table
- — specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
- biblioitems is the default value.
--help or -h
- — show this message.