Відмінності між версіями «Rebuild nozebra.pl»

Матеріал з Koha Ukraine Wiki
Перейти до: навігація, пошук
(Опис)
(Опис)
Рядок 7: Рядок 7:
 
Параметри:
 
Параметри:
  
'''-b'''                     index bibliographic records
+
'''-b'''  
 +
—                    index bibliographic records
  
 
'''-a'''                      index authority records
 
'''-a'''                      index authority records
  
    -daemon                Run in daemon mode.  The program will loop checking
+
'''    -daemon'''
 +
                Run in daemon mode.  The program will loop checking
 
                             for entries on the zebraqueue table, processing
 
                             for entries on the zebraqueue table, processing
 
                             them incrementally if present, and then sleep
 
                             them incrementally if present, and then sleep
Рядок 25: Рядок 27:
 
                             daemon (e.g. -r or -offset).
 
                             daemon (e.g. -r or -offset).
  
    -sleep 10              Seconds to sleep between checks of the zebraqueue
+
'''    -sleep 10'''
 +
              Seconds to sleep between checks of the zebraqueue
 
                             table in daemon mode.  The default is 5 seconds.
 
                             table in daemon mode.  The default is 5 seconds.
  
    -z                      select only updated and deleted
+
'''    -z'''
 +
                    select only updated and deleted
 
                             records marked in the zebraqueue
 
                             records marked in the zebraqueue
 
                             table.  Cannot be used with -r
 
                             table.  Cannot be used with -r
 
                             or -s.
 
                             or -s.
  
    --skip-deletes          only select record updates, not record
+
'''    --skip-deletes'''
 +
        only select record updates, not record
 
                             deletions, to avoid potential excessive
 
                             deletions, to avoid potential excessive
 
                             I/O when zebraidx processes deletions.
 
                             I/O when zebraidx processes deletions.
Рядок 42: Рядок 47:
 
                             Only effective with -z.
 
                             Only effective with -z.
  
    -r                     clear Zebra index before
+
'''    -r '''
 +
—                    clear Zebra index before
 
                             adding records to index. Implies -w.
 
                             adding records to index. Implies -w.
  
    -d                     Temporary directory for indexing.
+
'''    -d '''
 +
—                    Temporary directory for indexing.
 
                             If not specified, one is automatically
 
                             If not specified, one is automatically
 
                             created.  The export directory
 
                             created.  The export directory
Рядок 51: Рядок 58:
 
                             you supply the -k switch.
 
                             you supply the -k switch.
  
    -k                      Do not delete export directory.
+
'''    -k'''
 +
                    Do not delete export directory.
  
    -s                      Skip export.  Used if you have
+
'''    -s'''
 +
                    Skip export.  Used if you have
 
                             already exported the records
 
                             already exported the records
 
                             in a previous run.
 
                             in a previous run.
  
    -nosanitize            export biblio/authority records directly from DB marcxml
+
'''    -nosanitize'''
 +
            export biblio/authority records directly from DB marcxml
 
                             field without sanitizing records. It speed up
 
                             field without sanitizing records. It speed up
 
                             dump process but could fail if DB contains badly
 
                             dump process but could fail if DB contains badly
 
                             encoded records. Works only with -x,
 
                             encoded records. Works only with -x,
  
    -w                      skip shadow indexing for this batch
+
'''    -w'''
 +
                    skip shadow indexing for this batch
  
    -y                      do NOT clear zebraqueue after indexing; normally,
+
'''    -y'''
 +
                    do NOT clear zebraqueue after indexing; normally,
 
                             after doing batch indexing, zebraqueue should be
 
                             after doing batch indexing, zebraqueue should be
 
                             marked done for the affected record type(s) so that
 
                             marked done for the affected record type(s) so that
Рядок 71: Рядок 83:
 
                             Cannot be used with -z.
 
                             Cannot be used with -z.
  
    -v                      increase the amount of logging.  Normally only
+
'''    -v'''
 +
                    increase the amount of logging.  Normally only
 
                             warnings and errors from the indexing are shown.
 
                             warnings and errors from the indexing are shown.
 
                             Use log level 2 (-v -v) to include all Zebra logs.
 
                             Use log level 2 (-v -v) to include all Zebra logs.
  
    --length  1234        how many biblio you want to export
+
'''    --length  1234'''
    --offset 1243          offset you want to start to
+
        how many biblio you want to export
 +
 
 +
'''    --offset 1243'''
 +
          offset you want to start to
 
                                 example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one
 
                                 example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one
 
)
 
)
 
                                 note that the numbers are NOT related to biblionumber, that's the intended behaviour.
 
                                 note that the numbers are NOT related to biblionumber, that's the intended behaviour.
    --where                let you specify a WHERE query, like itemtype='BOOK'
+
 
 +
'''    --where'''
 +
                let you specify a WHERE query, like itemtype='BOOK'
 
                             or something like that
 
                             or something like that
  
    --run-as-root          explicitily allow script to run as 'root' user
+
'''    --run-as-root'''
 +
          explicitily allow script to run as 'root' user
  
  --wait-for-lock        when not running in daemon mode, the default
+
'''  --wait-for-lock'''
 +
        when not running in daemon mode, the default
 
                             behavior is to abort a rebuild if the rebuild
 
                             behavior is to abort a rebuild if the rebuild
 
                             lock is busy.  This option will cause the program
 
                             lock is busy.  This option will cause the program
Рядок 91: Рядок 111:
 
                             processing the rebuild request,
 
                             processing the rebuild request,
  
    --table                specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
+
'''    --table'''
 +
                specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.
 
                             biblioitems is the default value.
 
                             biblioitems is the default value.
  
    --help or -h            show this message.
+
'''    --help or -h'''
 +
          show this message.
  
 
== Джерела ==
 
== Джерела ==
 
* https://git.koha-community.org/gitweb/?p=koha.git;a=blob;f=misc/migration_tools/rebuild_zebra.pl;h=ec19ce1f5c40445dcdb7c033414fde02dcb06c2b;hb=HEAD
 
* https://git.koha-community.org/gitweb/?p=koha.git;a=blob;f=misc/migration_tools/rebuild_zebra.pl;h=ec19ce1f5c40445dcdb7c033414fde02dcb06c2b;hb=HEAD
 
* https://github.com/Koha-Community/Koha/blob/master/misc/migration_tools/rebuild_zebra.pl
 
* https://github.com/Koha-Community/Koha/blob/master/misc/migration_tools/rebuild_zebra.pl

Версія за 17:21, 14 листопада 2022

Назва

rebuild_nozebra.pl — сценарій переіндексування бібліографічних та віторитетних MARC-записів у БД Zebra.

Використовуйте це пакетне завдання, щоб переіндексувати усі бібліотечні або авторитетні записи у Вашій базі даних Koha.

Опис

Параметри:

-b — index bibliographic records

-a index authority records

-daemon — Run in daemon mode. The program will loop checking

                           for entries on the zebraqueue table, processing
                           them incrementally if present, and then sleep
                           for a few seconds before repeating the process
                           Checking the zebraqueue table is done with a cheap
                           SQL query.  This allows for near realtime update of
                           the zebra search index with low system overhead.
                           Use -sleep to control the checking interval.
                           Daemon mode implies -z, -a, -b.  The program will
                           refuse to start if options are present that do not
                           make sense while running as an incremental update
                           daemon (e.g. -r or -offset).

-sleep 10 — Seconds to sleep between checks of the zebraqueue

                           table in daemon mode.  The default is 5 seconds.

-z — select only updated and deleted

                           records marked in the zebraqueue
                           table.  Cannot be used with -r
                           or -s.

--skip-deletes — only select record updates, not record

                           deletions, to avoid potential excessive
                           I/O when zebraidx processes deletions.
                           If this option is used for normal indexing,
                           a cronjob should be set up to run
                           rebuild_zebra.pl -z without --skip-deletes
                           during off hours.
                           Only effective with -z.

-r — clear Zebra index before

                           adding records to index. Implies -w.

-d — Temporary directory for indexing.

                           If not specified, one is automatically
                           created.  The export directory
                           is automatically deleted unless
                           you supply the -k switch.

-k — Do not delete export directory.

-s — Skip export. Used if you have

                           already exported the records
                           in a previous run.

-nosanitize — export biblio/authority records directly from DB marcxml

                           field without sanitizing records. It speed up
                           dump process but could fail if DB contains badly
                           encoded records. Works only with -x,

-w — skip shadow indexing for this batch

-y — do NOT clear zebraqueue after indexing; normally,

                           after doing batch indexing, zebraqueue should be
                           marked done for the affected record type(s) so that
                           a running zebraqueue_daemon doesn't try to reindex
                           the same records - specify -y to override this.
                           Cannot be used with -z.

-v — increase the amount of logging. Normally only

                           warnings and errors from the indexing are shown.
                           Use log level 2 (-v -v) to include all Zebra logs.

--length 1234 — how many biblio you want to export

--offset 1243 — offset you want to start to

                               example: --offset 500 --length=500 will result in a LIMIT 500,1000 (exporting 1000 records, starting by the 500th one

)

                               note that the numbers are NOT related to biblionumber, that's the intended behaviour.

--where — let you specify a WHERE query, like itemtype='BOOK'

                           or something like that

--run-as-root — explicitily allow script to run as 'root' user

--wait-for-lock — when not running in daemon mode, the default

                           behavior is to abort a rebuild if the rebuild
                           lock is busy.  This option will cause the program
                           to wait for the lock to free and then continue
                           processing the rebuild request,

--table — specify a table (can be items, biblioitems, biblio, biblio_metadata) to retrieve biblionumber to index.

                           biblioitems is the default value.

--help or -h — show this message.

Джерела