QA catalogue
a metadata quality assessment tool for library catalogue records (MARC, PICA)

A more detailed instruction how to use qa-catalogue with Docker can be found in the wiki

WEBPORT=9000 docker compose up -d
docker compose --env-file config.env up -d
./docker/qa-catalogue \
  --params "--marcVersion GENT --alephseq" \
  --mask "rug01.export" \
  --catalogue gent \
  all
catalogues/[abbreviation-of-your-library].sh all-analyses
catalogues/[abbreviation-of-your-library].sh all-solr
git clone https://github.com/pkiraly/metadata-qa-api.git
cd metadata-qa-api
mvn clean install
cd ..
git clone https://github.com/pkiraly/metadata-qa-marc.git
cd metadata-qa-marc
mvn clean install
wget https://raw.githubusercontent.com/pkiraly/metadata-qa-marc/master/common-script
wget https://raw.githubusercontent.com/pkiraly/metadata-qa-marc/master/validator
wget https://raw.githubusercontent.com/pkiraly/metadata-qa-marc/master/formatter
wget https://raw.githubusercontent.com/pkiraly/metadata-qa-marc/master/tt-completeness
catalogues/[your script] [command(s)]
./qa-catalogue --params="[options]" [command(s)]
cp setdir.sh.template setdir.sh
BASE_INPUT_DIR=your/path
BASE_OUTPUT_DIR=your/path
#!/usr/bin/env bash

. ./setdir.sh

NAME=loc
MARC_DIR=${BASE_INPUT_DIR}/loc/marc
MASK=*.mrc

. ./common-script
TYPE_PARAMS="--marcVersion DNB --marcxml"
export JAR=target/metadata-qa-marc-0.7.0-jar-with-dependencies.jar
java -cp $JAR de.gwdg.metadataqa.marc.cli.Validator [options] <file>
./validator [options] <file>
catalogues/<catalogue>.sh validate
./qa-catalogue --params="[options]" validate
total
1192536
id,category,instances,records
2,control field,994241,313960
3,data field,12,12
4,indicator,5990,5041
5,subfield,571,555
id,categoryId,category,type,instances,records
5,2,control field,"invalid code",951,541
6,2,control field,"invalid value",993290,313733
8,3,data field,"repetition of non-repeatable field",12,12
10,4,indicator,"obsolete value",1,1
11,4,indicator,"non-empty indicator",33,32
12,4,indicator,"invalid value",5956,5018
13,5,subfield,"undefined subfield",48,48
14,5,subfield,"invalid length",2,2
15,5,subfield,"invalid classification reference",2,2
16,5,subfield,"content does not match any patterns",286,275
17,5,subfield,"repetition of non-repeatable subfield",123,120
18,5,subfield,"invalid ISBN",5,3
19,5,subfield,"invalid ISSN",105,105
id,MarcPath,categoryId,typeId,type,message,url,instances,records
53,008/33-34 (008map33),2,5,invalid code,'b' in 'b ',https://www.loc.gov/marc/bibliographic/bd008p.html,1,1
70,008/00-05 (008all00),2,5,invalid code,Invalid content: '2023  '. Text '2023  ' could not be parsed at index 4,https://www.loc.gov/marc/bibliographic/bd008a.html,1,1
28,008/22-23 (008map22),2,6,invalid value,| ,https://www.loc.gov/marc/bibliographic/bd008p.html,12,12
19,008/31 (008book31),2,6,invalid value, ,https://www.loc.gov/marc/bibliographic/bd008b.html,1,1
17,008/29 (008book29),2,6,invalid value, ,https://www.loc.gov/marc/bibliographic/bd008b.html,1,1
recordId,errors
99117335059205508,1:2;2:1;3:1
99117335059305508,1:1
99117335059405508,2:2
99117335059505508,3:1
id,errorId,instances
99117335059205508,1,2
99117335059205508,2,1
99117335059205508,3,1
99117335059305508,1,1
99117335059405508,2,2
99117335059505508,3,1
type,instances,records
0,0,251
1,1711,848
2,413,275
errorId,recordIds
1,99117329355705508;99117328948305508;99117334968905508;99117335067705508;99117335176005508;...
{
  "args":["/path/to/input.dat"],
  "marcVersion":"MARC21",
  "marcFormat":"PICA_NORMALIZED",
  "dataSource":"FILE",
  "limit":-1,
  "offset":-1,
  "id":null,
  "defaultRecordType":"BOOKS",
  "alephseq":false,
  "marcxml":false,
  "lineSeparated":false,
  "trimId":true,
  "outputDir":"/path/to/_output/k10plus_pica",
  "recordIgnorator":{
    "criteria":[],
    "booleanCriteria":null,
    "empty":true
  },
  "recordFilter":{
    "criteria":[],
    "booleanCriteria":{
      "op":"AND",
      "children":[
        {
          "op":null,
          "children":[],
          "value":{
            "path":{
              "path":"002@.0",
              "tag":"002@",
              "xtag":null,
              "occurrence":null,
              "subfields":{"type":"SINGLE","input":"0","codes":["0"]},
              "subfieldCodes":["0"]
            },
            "operator":"NOT_MATCH",
            "value":"^L"
          }
        },
        {"op":null,"children":[],"value":{"path":{"path":"002@.0","tag":"002@","xtag":null,"occurrence":null,"subfields":{"type":"SINGLE","input":"0","codes":["0"]},"subfieldCodes":["0"]},"operator":"NOT_MATCH","value":"^..[iktN]"}},
        {"op":"OR","children":[{"op":null,"children":[],"value":{"path":{"path":"002@.0","tag":"002@","xtag":null,"occurrence":null,"subfields":{"type":"SINGLE","input":"0","codes":["0"]},"subfieldCodes":["0"]},"operator":"NOT_MATCH","value":"^.v"}},{"op":null,"children":[],"value":{"path":{"path":"021A.a","tag":"021A","xtag":null,"occurrence":null,"subfields":{"type":"SINGLE","input":"a","codes":["a"]},"subfieldCodes":["a"]},"operator":"EXIST","value":null}}],"value":null}
      ],
      "value":null
    },
    "empty":false
  },
  "ignorableFields":{
    "fields":["001@","001E","001L","001U","001U","001X","001X","002V","003C","003G","003Z","008G","017N","020F","027D","031B","037I","039V","042@","046G","046T","101@","101E","101U","102D","201E","201U","202D"],
    "empty":false
  },
  "stream":null,
  "defaultEncoding":null,
  "alephseqLineType":null,
  "picaIdField":"003@$0",
  "picaSubfieldSeparator":"$",
  "picaSchemaFile":null,
  "picaRecordTypeField":"002@$0",
  "schemaType":"PICA",
  "groupBy":null,
  "detailsFileName":"issue-details.csv",
  "summaryFileName":"issue-summary.csv",
  "format":"COMMA_SEPARATED",
  "ignorableIssueTypes":["FIELD_UNDEFINED"],
  "pica":true,
  "replacementInControlFields":null,
  "marc21":false,
  "mqaf.version":"0.9.2",
  "qa-catalogue.version":"0.7.0-SNAPSHOT"
}
id,groupId
010000011,0
010000011,77
010000011,2035
010000011,70
010000011,20
Error in '   00000034 ': 
  110$ind1 has invalid code: '2'
Error in '   00000056 ': 
  110$ind1 has invalid code: '2'
Error in '   00000057 ': 
  082$ind1 has invalid code: ' '
Error in '   00000086 ': 
  110$ind1 has invalid code: '2'
Error in '   00000119 ': 
  700$ind1 has invalid code: '2'
Error in '   00000234 ': 
  082$ind1 has invalid code: ' '
Errors in '   00000294 ': 
  050$ind2 has invalid code: ' '
  260$ind1 has invalid code: '0'
  710$ind2 has invalid code: '0'
  710$ind2 has invalid code: '0'
  710$ind2 has invalid code: '0'
  740$ind2 has invalid code: '1'
Error in '   00000322 ': 
  110$ind1 has invalid code: '2'
Error in '   00000328 ': 
  082$ind1 has invalid code: ' '
Error in '   00000374 ': 
  082$ind1 has invalid code: ' '
Error in '   00000395 ': 
  082$ind1 has invalid code: ' '
Error in '   00000514 ': 
  082$ind1 has invalid code: ' '
Errors in '   00000547 ': 
  100$ind2 should be empty, it has '0'
  260$ind1 has invalid code: '0'
Errors in '   00000571 ': 
  050$ind2 has invalid code: ' '
  100$ind2 should be empty, it has '0'
  260$ind1 has invalid code: '0'
...
catalogues/<catalogue>.sh validate-sqlite
./qa-catalogue --params="[options]" validate-sqlite
./common-script [options] validate-sqlite
id         INTEGER,  -- identifier of the error
MarcPath   TEXT,     -- the location of the error in the bibliographic record
categoryId INTEGER,  -- the identifier of the category of the error
typeId     INTEGER,  -- the identifier of the type of the error
type       TEXT,     -- the description of the type
message    TEXT,     -- extra contextual information 
url        TEXT,     -- the url of the definition of the data element
instances  INTEGER,  -- the number of instances this error occured
records    INTEGER   -- the number of records this error occured in
id         TEXT,    -- the record identifier
errorId    INTEGER, -- the error identifier (-> issue_summary.id)
instances  INTEGER  -- the number of instances of an error in the record
groupId    INTEGER,
id         INTEGER,
MarcPath   TEXT,
categoryId INTEGER,
typeId     INTEGER,
type       TEXT,
message    TEXT,
url        TEXT,
instances  INTEGER,
records    INTEGER
id         TEXT,
errorId    INTEGER,
instances  INTEGER
id         TEXT,
groupId    INTEGER
groupId    INTEGER,
typeId     INTEGER,
records    INTEGER,
instances  INTEGER
groupId    INTEGER,
categoryId INTEGER,
records    INTEGER,
instances  INTEGER
groupId    INTEGER,
typeId     INTEGER,
path       TEXT,
records    INTEGER,
instances  INTEGER
java -cp $JAR de.gwdg.metadataqa.marc.cli.Formatter [options] <file>
./formatter [options] <file>
LEADER 01697pam a2200433 c 4500
001 1023012219
003 DE-101
005 20160912065830.0
007 tu
008 120604s2012    gw ||||| |||| 00||||ger  
015   $a14,B04$z12,N24$2dnb
016 7 $2DE-101$a1023012219
020   $a9783860124352$cPp. : EUR 19.50 (DE), EUR 20.10 (AT)$9978-3-86012-435-2
024 3 $a9783860124352
035   $a(DE-599)DNB1023012219
035   $a(OCoLC)864553265
035   $a(OCoLC)864553328
040   $a1145$bger$cDE-101$d1140
041   $ager
044   $cXA-DE-SN
082 04$81\u$a622.0943216$qDE-101$222/ger
083 7 $a620$a660$qDE-101$222sdnb
084   $a620$a660$qDE-101$2sdnb
085   $81\u$b622
085   $81\u$z2$s43216
090   $ab
110 1 $0(DE-588)4665669-8$0http://d-nb.info/gnd/4665669-8$0(DE-101)963486896$aHalsbrücke$4aut
245 00$aHalsbrücke$bzur Geschichte von Gemeinde, Bergbau und Hütten$chrsg. von der Gemeinde Halsbrücke anlässlich des Jubliäums "400 Jahre Hüttenstandort Halsbrücke". [Hrsg.: Ulrich Thiel]
264  1$a[Freiberg]$b[Techn. Univ. Bergakad.]$c2012
300   $a151 S.$bIll., Kt.$c31 cm, 1000 g
653   $a(Produktform)Hardback
653   $aGemeinde Halsbrücke
653   $aHüttengeschichte
653   $aFreiberger Bergbau
653   $a(VLB-WN)1943: Hardcover, Softcover / Sachbücher/Geschichte/Regionalgeschichte, Ländergeschichte
700 1 $0(DE-588)1113208554$0http://d-nb.info/gnd/1113208554$0(DE-101)1113208554$aThiel, Ulrich$d1955-$4edt$eHrsg.
850   $aDE-101a$aDE-101b
856 42$mB:DE-101$qapplication/pdf$uhttp://d-nb.info/1023012219/04$3Inhaltsverzeichnis
925 r $arb
./formatter --selector "008~7-10;008~0-5" \
            --defaultRecordType BOOKS \
            --separator "," \
            --outputDir ${OUTPUT_DIR} \
            --fileName marc-history.csv \
             ${MARC_DIR}/*.mrc
java -cp $JAR de.gwdg.metadataqa.marc.cli.Completeness [options] <file>
./completeness [options] <file>
catalogues/<catalogue>.sh completeness
./qa-catalogue --params="[options]" completeness
{
  "args":["/path/to/input.xml.gz"],
  "marcVersion":"MARC21",
  "marcFormat":"XML",
  "dataSource":"FILE",
  "limit":-1,
  "offset":-1,
  "id":null,
  "defaultRecordType":"BOOKS",
  "alephseq":false,
  "marcxml":true,
  "lineSeparated":false,
  "trimId":false,
  "outputDir":"/path/to/_output/",
  "recordIgnorator":{
    "conditions":null,
    "empty":true
  },
  "recordFilter":{
    "conditions":null,
    "empty":true
  },
  "ignorableFields":{
    "fields":null,
    "empty":true
  },
  "stream":null,
  "defaultEncoding":null,
  "alephseqLineType":null,
  "picaIdField":"003@$0",
  "picaSubfieldSeparator":"$",
  "picaSchemaFile":null,
  "picaRecordTypeField":"002@$0",
  "schemaType":"MARC21",
  "groupBy":null,
  "groupListFile":null,
  "format":"COMMA_SEPARATED",
  "advanced":false,
  "onlyPackages":false,
  "replacementInControlFields":"#",
  "marc21":true,
  "pica":false,
  "mqaf.version":"0.9.2",
  "qa-catalogue.version":"0.7.0"
}
groupId             INTEGER,
documenttype        TEXT,
path                TEXT,
packageid           INTEGER,
package             TEXT,
tag                 TEXT,
subfield            TEXT,
number-of-record    INTEGER,
number-of-instances INTEGER,
min                 INTEGER,
max                 INTEGER,
mean                REAL,
stddev              REAL,
histogram           TEXT
java -cp $JAR de.gwdg.metadataqa.marc.cli.ThompsonTraillCompleteness [options] <file>
./tt-completeness [options] <file>
catalogues/[catalogue].sh tt-completeness
./qa-catalogue --params="[options]" tt-completeness
id,ISBN,Authors,Alternative Titles,Edition,Contributors,Series,TOC,Date 008,Date 26X,LC/NLM, \
LoC,Mesh,Fast,GND,Other,Online,Language of Resource,Country of Publication,noLanguageOrEnglish, \
RDA,total
"010002197",0,0,0,0,0,0,0,1,2,0,0,0,0,0,0,0,1,0,0,0,4
"01000288X",0,0,1,0,0,1,0,1,2,0,0,0,0,0,0,0,0,0,0,0,5
"010004483",0,0,1,0,0,0,0,1,2,0,0,0,0,0,0,0,1,0,0,0,5
"010018883",0,0,0,0,1,0,0,1,2,0,0,0,0,0,0,0,1,1,0,0,6
"010023623",0,0,3,0,0,0,0,1,2,0,0,0,0,0,0,0,1,0,0,0,7
"010027734",0,0,3,0,1,2,0,1,2,0,0,0,0,0,0,0,1,0,0,0,10
java -cp $JAR de.gwdg.metadataqa.marc.cli.ShelfReadyCompleteness [options] <file>
./shelf-ready-completeness [options] <file>
catalogues/[catalogue].sh shelf-ready-completeness
./qa-catalogue --params="[options]" shelf-ready-completeness
java -cp $JAR de.gwdg.metadataqa.marc.cli.SerialScore [options] <file>
./serial-score [options] <file>
catalogues/[catalogue].sh serial-score
./qa-catalogue --params="[options]" serial-score
java -cp $JAR de.gwdg.metadataqa.marc.cli.FunctionalAnalysis [options] <file>
./functional-analysis [options] <file>
catalogues/<catalogue>.sh functional-analysis
./qa-catalogue --params="[options]" functional-analysis
java -cp $JAR de.gwdg.metadataqa.marc.cli.ClassificationAnalysis [options] <file>
Rscript scripts/classifications/classifications-type.R <output directory>
./classifications [options] <file>
Rscript scripts/classifications/classifications-type.R <output directory>
catalogues/[catalogue].sh classifications
./qa-catalogue --params="[options]" classifications
java -cp $JAR de.gwdg.metadataqa.marc.cli.AuthorityAnalysis [options] <file>
./authorities [options] <file>
catalogues/<catalogue>.sh authorities
./qa-catalogue --params="[options]" authorities
catalogues/[catalogue].sh pareto
./qa-catalogue --params="[options]" pareto
catalogues/[catalogue].sh marc-history
./qa-catalogue --params="[options]" marc-history
catalogues/[catalogue].sh sqlite
./qa-catalogue --params="[options]" sqlite
java -cp $JAR de.gwdg.metadataqa.marc.cli.MarcToSolr [options] [file]
catalogues/[catalogue].sh all-solr
./qa-catalogue --params="[options]" all-solr

variable	`qa-catalogue`	description	default
`ANALYSES`	`-a`/`--analyses`	which tasks to run with `all-analyses`	`validate, validate_sqlite, completeness, completeness_sqlite, classifications, authorities, tt_completeness, shelf_ready_completeness, serial_score, functional_analysis, pareto, marc_history`
	`-c`/`--catalogue`	display name of the catalogue	`$NAME`
`NAME`	`-n`/`--name`	name of the catalogue	qa-catalogue
`BASE_INPUT_DIR`	`-d`/`--input`	parent directory of input file directories	`./input`
`INPUT_DIR`	`-d`/`--input-dir`	subdirectory of input directory to read files from
`BASE_OUTPUT_DIR`	`-o`/`--output`	parent output directory	`./output`
`MASK`	`-m`/`--mask`	a file mask which input files to process, e.g. `*.mrc`	`*`
`TYPE_PARAMS`	`-p`/`--params`	parameters to pass to individual tasks (see below)
`SCHEMA`	`-s`/`--schema`	record schema	`MARC21`
`UPDATE`	`-u`/`--update`	optional date of input files
`VERSION`	`-v`/`--version`	optional version number/date of the catalogue to compare changes
`WEB_CONFIG`	`-w`/`--web-config`	update the specified configuration file of qa-catalogue-web
	`-f`/`--env-file`	configuration file to load environment variables from (default: `.env`)

machine name	explanation
record level issues
`undetectableType`	the document type is not detectable
`invalidLinkage`	the linkage in field 880 is invalid
`ambiguousLinkage`	the linkage in field 880 is ambiguous
control field position issues
`obsoleteControlPosition`	the code in the position is obsolete (it was valid in a previous version of MARC, but it is not valid now)
`controlValueContainsInvalidCode`	the code in the position is invalid
`invalidValue`	the position value is invalid
data field issues
`missingSubfield`	missing reference subfield (880$6)
`nonrepeatableField`	repetition of a non-repeatable field
`undefinedField`	the field is not defined in the specified MARC version(s)
indicator issues
`obsoleteIndicator`	the indicator value is obsolete (it was valid in a previous version of MARC, but not in the current version)
`nonEmptyIndicator`	indicator that should be empty is non-empty
`invalidValue`	the indicator value is invalid
subfield issues
`undefinedSubfield`	the subfield is undefined in the specified MARC version(s)
`invalidLength`	the length of the value is invalid
`invalidReference`	the reference to the classification vocabulary is invalid
`patternMismatch`	content does not match the patterns specified by the standard
`nonrepeatableSubfield`	repetition of a non-repeatable subfield
`invalidISBN`	invalid ISBN value
`invalidISSN`	invalid ISSN value
`unparsableContent`	the value of the subfield is not well-formed according to its specification
`nullCode`	null subfield code
`invalidValue`	invalid subfield value

documenttype	path	packageid	package	tag	subfield	number-of-record	number-of-instances	min	max	mean	stddev	histogram
all	leader23	0	Control Fields	Leader	Undefined	1099	1099	1	1	1.0	0.0	1=1099
all	leader22	0	Control Fields	Leader	Length of the implementation-defined portion	1099	1099	1	1	1.0	0.0	1=1099
all	leader21	0	Control Fields	Leader	Length of the starting-character-position portion	1099	1099	1	1	1.0	0.0	1=1099
all	110$a	2	Main Entry	Main Entry - Corporate Name	Corporate name or jurisdiction name as entry element	4	4	1	1	1.0	0.0	1=4
all	340$b	5	Physical Description	Physical Medium	Dimensions	2	3	1	2	1.5	0.3535533905932738	1=1; 2=1
all	363$a	5	Physical Description	Normalized Date and Sequential Designation	First level of enumeration	1	1	1	1	1.0	0.0	1=1
all	340$a	5	Physical Description	Physical Medium	Material base and configuration	2	3	1	2	1.5	0.3535533905932738	1=1; 2=1

documenttype	packageid	name	label	iscoretag	count
all	1	01X-09X	Numbers and Code	true	1099
all	2	1XX	Main Entry	true	816
all	6	4XX	Series Statement	true	358
all	5	3XX	Physical Description	true	715
all	8	6XX	Subject Access	true	514
all	4	25X-28X	Edition, Imprint	true	1096
all	7	5XX	Note	true	354
all	0	00X	Control Fields	true	1099
all	99	unknown	unknown origin	false	778

library	count
"00Mf"	713
"British Library"	525
"Inserted article about the fires from the Courant after the title page."	1
"National Library of Scotland"	310
"StEdNL"	1
"UkOxU"	33

pkiraly / qa-catalogue

readme

QA catalogue
a metadata quality assessment tool for library catalogue records (MARC, PICA)

Table of Contents

Quick start guide

Installation

Configuration

With Docker

Use

build

... or download

Usage

Helper scripts

run

configuration

Detailed instructions

General parameters

Validating MARC records

post processing validation result (validate-sqlite)

Catalogue for a single library

Union catalogue for multiple libraries

Display one MARC record, or extract data elements from MARC records

Calculating data element completeness

post processing completeness result (completeness-sqlite)

Calculating Thompson-Traill completeness

Shelf-ready completeness analysis

Serial score analysis

FRBR functional requirement analysis

Classification analysis

Authority name analysis

Field frequency distribution

Generating cataloguing history chart

Import tables to SQLite

Indexing bibliographic records with Solr

Solr field names

Fixed fields

Mapped fields

groupId	documenttype	path	packageid	package	tag	subfield	number-of-record	number-of-instances	min	max	mean	stddev	histogram
350	all	044K$9	50	PICA+ bibliographic description	"Schlagwortfolgen (GBV, SWB, K10plus)"	PPN	1	1	1	1	1.0	0.0	1=1
350	all	044K$7	50	PICA+ bibliographic description	"Schlagwortfolgen (GBV, SWB, K10plus)"	Vorläufiger Link	1	1	1	1	1.0	0.0	1=1

group	documenttype	packageid	name	label	iscoretag	count
0	Druckschriften (einschließlich Bildbänden)	50	0...	PICA+ bibliographic description	false	987
0	Druckschriften (einschließlich Bildbänden)	99	unknown	unknown origin	false	3
0	Medienkombination	50	0...	PICA+ bibliographic description	false	1
0	Mikroform	50	0...	PICA+ bibliographic description	false	11
0	Tonträger, Videodatenträger, Bildliche Darstellungen	50	0...	PICA+ bibliographic description	false	1
0	all	50	0...	PICA+ bibliographic description	false	1000
0	all	99	unknown	unknown origin	false	3
100	Druckschriften (einschließlich Bildbänden)	50	0...	PICA+ bibliographic description	false	20
100	Medienkombination	50	0...	PICA+ bibliographic description	false	1

id	group	count
0	all	1000
100	Otto-von-Guericke-Universität, Universitätsbibliothek Magdeburg [DE-Ma9]	21
1003	Kreisarchäologie Rotenburg [DE-MUS-125322...]	1
101	Otto-von-Guericke-Universität, Universitätsbibliothek, Medizinische Zentralbibliothek (MZB), Magdeburg [DE-Ma14...]	6
1012	Mariengymnasium Jever [DE-Je1]	19

library	count
"103861"	1
"BA-SaUP"	143
"BoCbLA"	25
"CStRLIN"	110
"DLC"	3

pkiraly / qa-catalogue

readme

QA cataloguea metadata quality assessment tool for library catalogue records (MARC, PICA)

Table of Contents

Quick start guide

Installation

Configuration

With Docker

Use

build

... or download

Usage

Helper scripts

run

configuration

Detailed instructions

General parameters

Validating MARC records

post processing validation result (validate-sqlite)

Catalogue for a single library

Union catalogue for multiple libraries

Display one MARC record, or extract data elements from MARC records

Calculating data element completeness

post processing completeness result (completeness-sqlite)

Calculating Thompson-Traill completeness

Shelf-ready completeness analysis

Serial score analysis

FRBR functional requirement analysis

Classification analysis

Authority name analysis

Field frequency distribution

Generating cataloguing history chart

Import tables to SQLite

Indexing bibliographic records with Solr

Solr field names

Fixed fields

Mapped fields

QA catalogue
a metadata quality assessment tool for library catalogue records (MARC, PICA)