| To the reader: The following
article originally appeared in Registrar (A
publication of the Registrars Committee of the American
Association of Museums), Vol. 4, No. 2 (Fall 1987), 2-6.
While several of its examples are now clearly outdated,
the issues addressed are still relevant and should prove
valuable to museum workers embarking on their first
automated collection management project. |
[Home Page]
COMPUTERIZED
COLLECTION MANAGEMENT:
A PRIMER
Robert A. Baron
Museum Computer Consultant
For registrars of small museums and large, the
prospect of using computerized collection management
systems is becoming ever more attractive, particularly as
resources become more scarce, and additional
responsibilities draw upon their time. But, as many who
have tried to use computers have found out, not all
computerization projects fulfill their promise. This
article discusses some of the benefits and limitations of
computerized collection control, and surveys some of the
techniques and programs accessible to registrars.
While computers have more readily served museum needs
as diverse as membership, development and fund
accounting, other tasks, such as collection management,
including accession control and collection description,
have not been so easily assigned to the computer.
Recent database management systems have become more
responsive to the complex data types required by
historical artifacts. Adequate collection management
programs are now available, aided by advances in
video-disk technology and digital imaging techniques.
SELECTED PROBLEMS
Object Tracking
The task of "Collection-Management" should
be understood as a fusion of two separate, but
overlapping functions: Object Description, and Object
Tracking.
Although operationally complex, "Object
Tracking" is but a particular implementation of what
database managers call "inventory control."
These programs record the status of objects, track loans,
monitor acquisitions and exhibitions. They record object
disposition, storage locations, keep tabs on lenders,
borrowers, donors, and the expenses and activities
associated with object management. Loan agreements,
receipts, insurance lists, and tabulations of works to be
prepared for shipment, and objects due for return, are
generated by tracking modules.
Many object management systems developed from a need
to establish an inventory. Even the simplest computer
inventory, composed of little more than accession numbers
and object classifications, may have incalculable
value--indexing old filing schemes, comparing object
lists to card lists, identifying missing cards and
objects.
To facilitate monitoring object transactions, simple
inventory databases may be augmented by files containing
related data, such as borrowers and lenders. Were
tracking functions the only components needed, almost any
competent database developer would be able to construct
an object management system--many have tried. But when
standard accessioning forms, worksheets, lists of
collection categories, sorted by artist, style, subject,
material or culture, are added to make collection control
more manageable, intelligible, and provide public access,
inventory control systems must be enlarged by adding
object classifications and descriptions. No longer an
administrative machine, the database now must cope with
the scholarly vocabulary of the humanities. This is why
it been so difficult to create credible museum databases.
Object Description
The most severe obstacles to this endeavor lie in the
process of describing man-made objects. To make a
computer successfully sort and select data, one must obey
conventions specified by the database program, while
imposing standard vocabularies--these conventions do not
necessarily correspond to the way historians and curators
traditionally classify objects.
Therefore, in the past, successful object management
has had to impose its own conventions. This required 1)
defining the kinds of data to collect, 2) deciding how to
control the nomenclature used, and 3) determining what to
do with repeating and alternate values, including
conflicting and former opinions--usually ignored in the
face of the difficulty of the first two tasks.
Furthermore, each of these issues has been made crucial
by the limited database management systems available for
use in small computer environments.
Because each museum, collection, period, culture and
object type, ideally requires its own unique set of data
fields, it has been impossible to set uniform standards.
To fit countless varieties of objects into one system,
designers often must choose between establishing a set of
common, but inadequate number of data fields, or placing
diverse collections into disconnected databases.
The problem of turning an object's critical history
into retrievable data is rarely considered. Although
dedicated to the business of the day, the registrar
certainly must have authentic records reflecting current,
and even historical opinion. Information is dynamic--each
object has its own history. Because current conceptions
become past opinions, museum databases cannot ideally
assume otherwise.
Authority Lists and Nomenclature
As all who are concerned with museum computerization
projects are aware, success or failure of data entry will
often depend upon establishing conventionalized
terminologies. Unfortunately, the more each system
strives for internal consistency, the more obvious
becomes the difficulty of regulating systematic data
elements within each institution. Intra-institutional
vocabulary control is expensive, parochial, and rarely
solves the problem of communicating with outsiders.
Moreover, although some museums develop style sheets
and lexicons in preparation for current or future
computer projects, the terms and practices defined
therein are often determined by a mix of institutional
traditions and personal preferences. Even so, not all
museums have the resources to follow their own style
sheets and conventions.
To attain cross-institutional consistency in
geographical, cultural, material, biographical,
historical, stylistic, and other terminologies,
nomenclature projects such as Chenhall, Icon Class, the
Getty Art and Architecture Thesaurus (AAT), and others
were pursued. In spite of their undeniable value, these
efforts do not fully address the workday problems of how
extant, often old records are to be reconciled to the new
conventions without extensive revision.
Determining Needs
Of course, for each institution considering adopting a
computerized collection management system, the importance
given to the above questions largely depends upon the
nature and the size of each collection--upon those
functions the database must serve, who is to use it and
care for it, and upon how it might serve the public.
Small specialized collections will need different types
of systems than large diversified ones. New collections
have different needs than old collections. The
requirements of academic institutions differ from private
and public galleries and museums.
Each need demands its own solution. For example, if
object management databases are to be used to produce
publishable lists of objects suitable for catalogues,
copy for wall labels, and official credit lines, custom
data fields may have to be fashioned to accommodate an
official terminology which differs from the nomenclature
used for queries.
Video systems merge object images with the database,
but not all video linkages are the same. Some systems
work well to document the collection and defend it
against potentially harmful handling, while significantly
reducing the need for staff to retrieve items for
verification and identification, thereby saving time and
personnel. Others serve these purposes less well, but
offer high quality manipulatable images of great value to
conservators and curators. Some query fast, some slow,
some offer high quality, some low, some use vast data
storage reserves, others do not interfere with normal
data files.
Therefore, when selecting an imaging system, many
questions must be answered: How are new images added; how
much data storage will be used; will production be too
costly? Is the system too advanced for the intended
function (read too expensive)? Can the entire collection
be recorded in reasonable time. Are images linked to the
query apparatus? How swiftly can groups of images appear
during queries? Will one user's call for images prevent
other users from using the system effectively? Can query
results be saved with images; can images be reproduced on
paper?
SELECTED SOLUTIONS
Choosing a packaged computer system to manage a
collection is a more complex endeavor than just locating
a program which meets functional specifications and
provides necessary forms and fields. One must be aware of
the mechanism each system uses to achieve its ends; know
whether the program and the equipment on which it runs
can comfortably support the intended number of users
without serious degradation; whether it is wiser for the
machinery to be shared with other applications, or to
remain independent of word-processing, membership, and
financial management. One must determine if the machinery
offers proper storage capacity, and how quickly it
answers queries, among other technical matters.
Some systems use outdated operating systems or old
technologies which offer little opportunity for
enhancement, and others tie the user to specific brands
of equipment. On occasion, municipal and collegiate
collections have been urged to share time on the mother
main-frame, where they may be placed in a chain of access
priorities, and isolated from all developments in the
field. Whether these solutions are workable or not is not
always obvious.
One benchmark of a program's sophistication may be
seen in the way it handles repeating data. For example,
if a program supports two artists for each object, it is
vital to know how it works. Can a query in the first
artist field find that artist when listed in the second
artist field? Whereas some programs are severely limited
in these matters, others are more flexible, permitting
all makers ascribed to an object to be entered--each
conditioned by attribution parameters, e.g. "In the
manner of Rosso Fiorentino". Sophisticated programs
create master lists of authoritative field values (such
as artist names), validating every new entry by searching
these lists for approval. Others are even more efficient,
and maintain a lexicon of valid entries, linking each to
the object record, forcing approval of new values.
Obviously, any decision to use an object management
system, simple or sophisticated, must be made with full
knowledge of each package's limitations and capabilities.
The correct system for one museum may be wrong for
another. Some institutions may best be served by not
computerizing.
Because choosing or developing a database system is
always complex, it may be wise for any museum considering
computerizing to experiment with simple solutions first.
This process will help staff more fully understand and
assess its needs and database requirements. Perhaps a low
level of computerization will prove adequate. Impulsive
overkill may be as inefficient as underkill, but a lot
more expensive.
With the above in mind, surveyed below are the most
popular methods of using computers to manage objects in
the registrar's office.
Word-processor Lists
Many museums successfully rely solely upon
word-processors for project and list management. The
word-processor's ability to edit lists simplifies many
management tasks--itemizing new acquisitions, enumerating
objects in exhibitions, or sending
"mail-merged" letters. As long as these
assignments are finite in scope and duration, of defined
purpose and limited size, and restricted to one person or
one office, word-processor lists have value. Their
disadvantages lie in their uncontrolled structure, and in
their specialized nature. These lists are always
difficult to query, impossible to sort by specified
criteria, and offer no ability to tally classifications.
Rigorous conventions, strictly imposed, might make
word-processing lists work; but in the end, the effort
may not be useful for any project which must provide
lasting data to be used for any other than its primary
purpose.
Text-Bases
Text-bases index free-form word-processing documents,
providing very quick access to masses of unstructured
materials.
Typically, all words, or designated key-words are
registered by these programs. When a document is needed,
the user forms a query by stringing key words together,
linking them with AND, OR, and NOT. For example, one may
ask for all references to "Rembrandt,"
excluding all entries with "Peale". Even though
conventionalized vocabulary is rarely imposed on
text-bases, some of the problems associated with the lack
of defined nomenclature may be averted with clever
queries. Text-bases may be useful for research, but no
administrative system should rely upon instinct and
ingenuity as a means of finding data.
Still, text-bases are intriguing as quick means of
turning chaotic curatorial and object files into machine
readable form--as long as one is willing to accept the
clutter found in these files. Text-scanning readers are
recommended; today many can read common mono-spaced
fonts.
The best text-base systems for the IBM PC family are ZyIndex,
FYI-3000, and the academic word-processor Nota
Bene. For refined applications Star, by Cuadra
is well respected. Star allows field definitions and
cross-field indexed queries, and accommodates dynamically
changing data, unlike the others. None permit the kind of
transactional programming registrars need to maintain
object histories and record changing conditions. Star,
however, does provide elaborate vocabulary checking and
mimics some relational features of advanced database
systems, but its forté lies in object description and
bibliographic citation, not in object management.
Flat File Databases
Any user may appreciate the complexities of art
historical and curatorial databases by trying to create
one himself. Simple flat-file database management systems
are the closest computer equivalent to the traditional
index-card file. For the IBM PC and compatibles, PC-FILE
is a popular favorite flat-file system. Reviews of others
regularly appear in the pages of PC Magazine and PC
World, and other journals.
Flat-file databases can be used to create computer
lists of all kinds: names and addresses, telephone
numbers, mailing labels, and simple inventory management
utilities. Their querying, sorting and reporting
functions are typically slow, however, and their file
sizes may be limited.
The prime disadvantage of such systems, however, lies
in their fragmented approach to database design. Data is
not entered economically, repeating data elements must be
entered into each record. Few controls are available to
manage data. These programs waste valuable computer
storage space, and tend to accept errors easily.
Overlapping flat-file applications do not interact with
each other, requiring time-consuming maintenance to keep
them current.
Relational Databases
Most relational databases are composed of flat-file
components linked by common data types. Joining files to
each other fabricates powerful "one-to-many" or
"many-to-many" relationships. Thus, a single
borrower, cited once, may be tied to all works he
borrowed, or one donor linked to all his gifts, or one
artist record attached to all objects attributed to him.
To be most effective, relational database systems,
such as Dbase III Plus, R:Base System V,
and Informix, must usually be constructed and
programmed by specialists in database design--but some
allow knowledgeable users to create simple applications
with relative ease. Unfortunately, most professional
database programmers are not sufficiently familiar with
the unique properties of object data and with scholarly
issues to design respectable museum systems--and curators
and registrars are not yet adequately versed in the
intricacies of database management systems to insure that
their needs are being provided for correctly and
efficiently. For complex applications, museums would be
best advised to turn to someone familiar with both museum
practices and database systems.
In each database management program there are
limitations which may affect the utility of complex
applications. Some put severe restraints on the number of
files which may be opened on a screen at once--limiting
the number of definable relationships. When the limit is
reached, complex needs soon turn fine relational
databases into series of related flat-file systems. Some
packages, R:Base, for example, keep all data in
one file--prohibiting distribution and separation of data
among storage devices. Some systems have inflexible
sorting rules, others limit field size, and so on.
Most personal computer database applications suffer
from two fundamental data limitations: fields cannot
accept multiple values and must have fixed length.
Fixed-length fields require strict economies of database
design and exact countless compromises. Fixed-length
fields demand their defined space, whether they are
filled or not. Obviously, one must avoid defining
specialized fields for object descriptions if they will
rarely be used.
When fields are single-valued, meaning that they can
be filled only once--to the limit of available space--the
user must decide what to enter and what to omit, and must
decide which of several elements is most important.
Sorting and reporting act on the first value listed,
searches are often slower. "Plaster and wood"
is different than "wood and plaster." On the
other hand, multi-valued fields, often defined for artist
names, for stylistic or cultural groupings, or for media
and/or supports, treat each entity equally.
Since the storage needed by variable-length fields
depends upon the number of characters actually used,
there may be no practical limit to the text entered. When
citing the title of a work, correct use of multi-valued
and variable-length fields permits one to record as many
titles or subjects as required to identify an
object--each may be independently queried and sorted.
Some database management systems designed for small
computers (e.g. Oracle and Revelation) are
beginning to offer multi-valued and variable-length
fields. The Pick operating system (which Revelation
copies) is well known for these features--now
incorporated into the newest generation of commercial
object management databases.
Dedicated Object Management Systems
Museum databases and their vendors are abundant at AAM
meetings. The most visible of these are ARTIS, Willoughby
Associates, and Questor Systems. Each provides
respectable services, and has its own position regarding
how to computerize collections.
When choosing a prepared package for object management
over an expensive customized system, the consumer usually
must learn to adapt his operations to the system
provided, but obtains the benefits of collected
experience. Some packages are fixed in shape, while
others permit customizations. Some firms, such as
Willoughby, have developed paper based report systems to
simplify fixing erroneous data. Another, Questor, avoids
paper reports, providing, instead, internal means of
validating and structuring data elements.
Differences are apparent in approaches to vocabulary
control. In the ARTIS package, the artist file employs a
list of artist names which must be manually queried for
validation. In its more advanced (promised) packages,
Willoughby Associates expects to provide automatic
referencing of predefined vocabulary lists for artists'
biographies and other data fields, as well as hierarchic
query structures for searching sub-terms automatically
when parent or equivalent terms have been specified.
Hierarchically controlled lexicons were pioneered by
Questor Systems of Los Angeles. Questor's, ARGUS, created
for ethnographic collections, is being adapted to the
needs of fine arts museums. Pick-based ARGUS contains
multi-valued and variable-length fielding, supports
multiple users, security levels, and offers an optional
video-disk add-on.
Hierarchic querying may be the single most important
achievement in museum database technology, resolving many
of the authority control problems responsible for earlier
failures. For example, a query for "Florentine
Renaissance paintings," should find works listed as
"Florentine," as well as those cited as made in
"Florence" and "Fierenza." If
"Florentine" proves too fine, the user may
choose a more general term, perhaps "Tuscan,"
which may include works catalogued as made in Tuscany,
Toscana, and the towns therein. Furthermore, now only one
field is needed to identify the geographical origin of a
work--fields labeled "country,"
"region," and "city" are unnecessary.
Fewer fields serve where more had been required.
Although hierarchic fields offer distinct advantages
over those defined for rigid nomenclatures, using such
hierarchies confers its own complications. To invest
these queries with power, terminology must be structured,
though not at the field level, below it. Because, terms
are input as found in extant records, little revision is
necessary during data entry, but the relationships and
authorities must still be supplied by structuring real
records, and by inputting defined ordered
vocabularies--no small task.
CONCLUSION
Choosing an object management system must not be taken
lightly. The expenses incurred in purchasing a system,
and entering data, may be small compared to the
investment of will-power needed to maintain it. Good
functional systems have been abandoned for little reason,
save lack of resolve to service them. Museums beginning
the process of computerization might determine their
needs, consider the financial investment required, assay
their determination by experimenting, and then choose a
system of adequate sophistication to accomplish their
goals.
* * *
For further reading:
1. Peter H. Welsh and Steven A. LeBlanc,
"Computer Literacy and Collection
Management," Museum News, June 1987, 42.
2. Richard B. Light, D. Andrew Roberts and
Jennifer D. Stewart, eds., Museum Documentation
Systems: Developments and Applications, London,
1986.
3. Lenore Sarasan, "Why Museum Computer
Projects Fail," Museum News, Jan/Feb,
1981, 40.
[Go to Top]
[Home Page]
|