HOW DO WE INDEX?: A REPORT OF SOME ASLIB INFORMATICS GROUP ACTIVITY

Document

Cited in

Date	01 January 1983
Published date	01 January 1983
Pages	1-23
DOI	https://doi.org/10.1108/eb026736
Author	KEVIN P. JONES
Subject Matter	Information & knowledge management,Library & information science

THE

Journal of Documentation

VOLUME 39 NUMBER 1 MARCH 1983

HOW DO WE INDEX?: A REPORT OF SOME

ASLIB INFORMATICS GROUP ACTIVITY

KEVIN P. JONES

Malaysian Rubber

Producers'

Research Association

The Aslib Informatics Group and

its

predecessor the Co-ordinate Indexing Group

have made several attempts to understand the indexing process. This has been

sought through seminars and indexing projects. The seminars produced some data

on an ad

hoc

basis and although most have been assembled they have not been

reported previously. More recently a formal project, involving sixteen volunteer

indexers, has been organized around five short New

Scientist

articles and the data

from this exercise form the major component in the present study. An attempt has

been made to correlate indexer performance with the original texts. There appears

to be evidence to support the assertion that the selection of index entries

to the structure of the original texts, especially the frequency of individual words.

THE ASLIB INFORMATICS GROUP and its predecessor the Co-ordinate In-

dexing Group (CIG) have made several attempts to understand the indexing pro-

cess more fully. In part, this quest has been associated with the design of thesauri,

as it was considered that it was impossible to design successful thesauri without

understanding the indexing

process.

There have now been

five

projects—both the

earliest and most recent were conducted on an ordinary indexing basis as an in-

dividual activity; the remainder involved a degree of group participation. Only

the results of the first project have been reported.1-2

The activity of indexing, as typified by Collison,3 Knight,4 and to an extent by

Borko and Bernier,5 tends to be concerned with the mechanics of alphabetization,

cross-indexing and the form of name, subject or 'idea' (Collison) index entries.

The relationship between texts and index entries is rarely examined. Knight

avoided this entirely, but Collison incorporated two relatively long textual

extracts together with what he regarded as suitable sets of index entries. Never-

theless, Collison fails to establish an explicit algorithm of how one

transformed

into the other. Moreover, his strictures on an example by Holmstrom6 which did

attempt to link text with index entries are illuminating: 'Examples of this kind are

always

(present author's italics) misleading unless related to

the

complete work...'

One would expect a book about the preparation of indexes to contain material on

Journal

Documentation,

Vol. 39, No. 1, March 1983, pp. 1–23.

JOURNAL OF DOCUMENTATION Vol. 39, no.

alphabetization and other techniques—it

is the

lack

any bridge between text

and index entries which

strange. Interestingly, Borko and Bernier bridge this

gap

a chapter on computer-aided indexing.

THE TIMES INDEXING PROJECT

The earliest project, partially surveyed by Dammers1 and Gilchrist,2 was a large

venture

as it

involved indexing seventy-seven second leaders from

The

Times

published between April and June 1966. Eighteen volunteers indexed all

this

material,

further one indexed all bar three leaders, and

further twenty-seven

indexed only some of the material. Unfortunately, the amount of data produced

exceeded

the

energy available

for

analysis, even with

the aid of

computer

assistance.

A total

approximately 22,000 keywords

was

generated.

5,500

different

keywords were produced and 56%

these were used uniquely.

must be em-

phasized, however, that

attempt was made

reduce dissimilarities

word

morphologies; thus,

governmental,

government

and

governments

were treated

three distinct keywords. Neither was

any

attempt made

group synonyms.

Therefore,

it is not

surprising that only thirty-nine keywords were used more

than fifty-one times. The most commonly used were

Britain,

government,

China

and

politics

and these were followed by

Rhodesia, Indonesia, Russia,

Vietnam,

United

Kingdom,

Malaysia,

education,

United

Nations,

United States

and

opposition.

With the

exception of the Saturday issues which tended to venture far, most of the second

leaders kept a close eye on the then current political

scene.

Dammers asserted that

indexer proficiency

was

inversely proportional to the number of unique keywords

selected: CIG members used

mean

approximately seventy unique keywords

per 100 documents, whereas

the

overall mean was approximately 130

and the

non-CIG members were grouped around

geometric mean

around 220.

The

CIG indexers were regarded as the expert group. This was probably

fair assess-

ment,

but at

that time these members were pursuing

policy of minimizing in-

dexing vocabularies (as typified in the work of Boyd,7 Rostron8 and Snel9). This

approach has since been

questioned:10-11

therefore, this assertion may also be

questionable validity. Unfortunately, inter-indexer consistency within the CIG

indexer

set was not

studied. This would have been more interesting than

the

result that

desirable vocabulary

size

of about 300 was required to index the docu-

ment

set. It

must

stressed, however, that many indexers would accept this

figure as not being unreasonable for

set of seventy-seven short items.

INDEXING SEMINAR 1977

The second project organized after

long lapse in 1977 took the form of an index-

ing seminar. The seventeen participants were circulated with copies

the texts

prior

the meeting and were expected

come armed with appropriate index

entries—the seminar was devoted

discussing the participants' sets

entries.

Data capture was limited

recording

show

hands techniques. The texts

(reproduced

Appendices

1 and 2)

were relatively short extracts from books,

consisting

three and two paragraphs respectively. One was

section

lava

taken from Holmes's

Principles

Physical Geology—a

well-known textbook. The

other was

extract concerning holistic theory from Arthur Koestler's

Beyond

Atomism

and

Holism.

This probably remains the most difficult text to be tackled,

and some participants questioned the value of such

text for

practical seminar. A

To continue reading

Request your trial

Subscribers can access the reported version of this case.

You can sign up for a trial and make the most of our service including these benefits.

Request your trial

Why Sign-up to vLex?

Over 100 Countries

Search over 120 million documents from over 100 countries including primary and secondary collections of legislation, case law, regulations, practical law, news, forms and contracts, books, journals, and more.
Thousands of Data Sources

Updated daily, vLex brings together legal information from over 750 publishing partners, providing access to over 2,500 legal and news sources from the world’s leading publishers.
Find What You Need, Quickly

Advanced A.I. technology developed exclusively by vLex editorially enriches legal information to make it accessible, with instant translation into 14 languages for enhanced discoverability and comparative research.
Over 2 million registered users

Founded over 20 years ago, vLex provides a first-class and comprehensive service for lawyers, law firms, government departments, and law schools around the world.

Subscribers are able to see a list of all the cited cases and legislation of a document.

You can sign up for a trial and make the most of our service including these benefits.

Request your trial

Why Sign-up to vLex?

Over 100 Countries

Search over 120 million documents from over 100 countries including primary and secondary collections of legislation, case law, regulations, practical law, news, forms and contracts, books, journals, and more.
Thousands of Data Sources

Updated daily, vLex brings together legal information from over 750 publishing partners, providing access to over 2,500 legal and news sources from the world’s leading publishers.
Find What You Need, Quickly

Advanced A.I. technology developed exclusively by vLex editorially enriches legal information to make it accessible, with instant translation into 14 languages for enhanced discoverability and comparative research.
Over 2 million registered users

Founded over 20 years ago, vLex provides a first-class and comprehensive service for lawyers, law firms, government departments, and law schools around the world.

Subscribers are able to see a list of all the documents that have cited the case.

You can sign up for a trial and make the most of our service including these benefits.

Request your trial

Why Sign-up to vLex?

Over 100 Countries

Search over 120 million documents from over 100 countries including primary and secondary collections of legislation, case law, regulations, practical law, news, forms and contracts, books, journals, and more.
Thousands of Data Sources

Updated daily, vLex brings together legal information from over 750 publishing partners, providing access to over 2,500 legal and news sources from the world’s leading publishers.
Find What You Need, Quickly

Advanced A.I. technology developed exclusively by vLex editorially enriches legal information to make it accessible, with instant translation into 14 languages for enhanced discoverability and comparative research.
Over 2 million registered users

Founded over 20 years ago, vLex provides a first-class and comprehensive service for lawyers, law firms, government departments, and law schools around the world.

Subscribers are able to see the revised versions of legislation with amendments.

You can sign up for a trial and make the most of our service including these benefits.

Request your trial

Why Sign-up to vLex?

Over 100 Countries

Search over 120 million documents from over 100 countries including primary and secondary collections of legislation, case law, regulations, practical law, news, forms and contracts, books, journals, and more.
Thousands of Data Sources

Updated daily, vLex brings together legal information from over 750 publishing partners, providing access to over 2,500 legal and news sources from the world’s leading publishers.
Find What You Need, Quickly

Advanced A.I. technology developed exclusively by vLex editorially enriches legal information to make it accessible, with instant translation into 14 languages for enhanced discoverability and comparative research.
Over 2 million registered users

Founded over 20 years ago, vLex provides a first-class and comprehensive service for lawyers, law firms, government departments, and law schools around the world.

Subscribers are able to see any amendments made to the case.

You can sign up for a trial and make the most of our service including these benefits.

Request your trial

Why Sign-up to vLex?

Over 100 Countries

Search over 120 million documents from over 100 countries including primary and secondary collections of legislation, case law, regulations, practical law, news, forms and contracts, books, journals, and more.
Thousands of Data Sources

Updated daily, vLex brings together legal information from over 750 publishing partners, providing access to over 2,500 legal and news sources from the world’s leading publishers.
Find What You Need, Quickly

Advanced A.I. technology developed exclusively by vLex editorially enriches legal information to make it accessible, with instant translation into 14 languages for enhanced discoverability and comparative research.
Over 2 million registered users

Founded over 20 years ago, vLex provides a first-class and comprehensive service for lawyers, law firms, government departments, and law schools around the world.

Subscribers are able to see a visualisation of a case and its relationships to other cases. An alternative to lists of cases, the Precedent Map makes it easier to establish which ones may be of most relevance to your research and prioritise further reading. You also get a useful overview of how the case was received.

Request your trial

Why Sign-up to vLex?

Over 100 Countries

Search over 120 million documents from over 100 countries including primary and secondary collections of legislation, case law, regulations, practical law, news, forms and contracts, books, journals, and more.
Thousands of Data Sources

Updated daily, vLex brings together legal information from over 750 publishing partners, providing access to over 2,500 legal and news sources from the world’s leading publishers.
Find What You Need, Quickly

Advanced A.I. technology developed exclusively by vLex editorially enriches legal information to make it accessible, with instant translation into 14 languages for enhanced discoverability and comparative research.
Over 2 million registered users

Founded over 20 years ago, vLex provides a first-class and comprehensive service for lawyers, law firms, government departments, and law schools around the world.

Subscribers are able to see the list of results connected to your document through the topics and citations Vincent found.

You can sign up for a trial and make the most of our service including these benefits.

Request your trial

Why Sign-up to vLex?

Over 100 Countries

Search over 120 million documents from over 100 countries including primary and secondary collections of legislation, case law, regulations, practical law, news, forms and contracts, books, journals, and more.
Thousands of Data Sources

Updated daily, vLex brings together legal information from over 750 publishing partners, providing access to over 2,500 legal and news sources from the world’s leading publishers.
Find What You Need, Quickly

Advanced A.I. technology developed exclusively by vLex editorially enriches legal information to make it accessible, with instant translation into 14 languages for enhanced discoverability and comparative research.
Over 2 million registered users

Founded over 20 years ago, vLex provides a first-class and comprehensive service for lawyers, law firms, government departments, and law schools around the world.

HOW DO WE INDEX?: A REPORT OF SOME ASLIB INFORMATICS GROUP ACTIVITY

You can sign up for a trial and make the most of our service including these benefits.

Why Sign-up to vLex?

Over 100 Countries

Thousands of Data Sources

Find What You Need, Quickly

Over 2 million registered users

You can sign up for a trial and make the most of our service including these benefits.

Why Sign-up to vLex?

Over 100 Countries

Thousands of Data Sources

Find What You Need, Quickly

Over 2 million registered users

You can sign up for a trial and make the most of our service including these benefits.

Why Sign-up to vLex?

Over 100 Countries

Thousands of Data Sources

Find What You Need, Quickly

Over 2 million registered users

You can sign up for a trial and make the most of our service including these benefits.

Why Sign-up to vLex?

Over 100 Countries

Thousands of Data Sources

Find What You Need, Quickly

Over 2 million registered users

You can sign up for a trial and make the most of our service including these benefits.

Why Sign-up to vLex?

Over 100 Countries

Thousands of Data Sources

Find What You Need, Quickly

Over 2 million registered users

Why Sign-up to vLex?

Over 100 Countries

Thousands of Data Sources

Find What You Need, Quickly

Over 2 million registered users

You can sign up for a trial and make the most of our service including these benefits.

Why Sign-up to vLex?

Over 100 Countries

Thousands of Data Sources

Find What You Need, Quickly

Over 2 million registered users