Supriya Ghosh (Editor)

Index (publishing)

Updated on
Edit
Like
Comment
Share on FacebookTweet on TwitterShare on LinkedInShare on Reddit
Index (publishing)

An index (plural: usually indexes, more rarely indices; see below) is a list of words or phrases ('headings') and associated pointers ('locators') to where useful material relating to that heading can be found in a document or collection of documents. Examples are an index in the back matter of a book and an index that serves as a library catalog. In a traditional back-of-the-book index, the headings will include names of people, places, events, and concepts selected by a person (the indexer) as being relevant and of interest to a possible reader of the book. (The indexer may be the author, the editor, or a professional indexer working as a third party.) The pointers are typically page numbers, paragraph numbers or section numbers. In a library catalog the words are authors, titles, subject headings, etc., and the pointers are call numbers. Internet search engines, such as Google, and full text searching help provide access to information but are not as selective as an index, as they provide non-relevant links, and may miss relevant information if it is not phrased in exactly the way they expect.

Contents

Perhaps the most advanced investigation of problems related to book indexes is made in the development of topic maps, which started as a way of representing the knowledge structures inherent in traditional back-of-the-book indexes. The concept embodied by book indexes lent its name to database indexes, which similarly provide an abridged way to look up information in a larger collection, albeit one for computer use rather than human use.

Earliest examples in English

In the English language, indexes have been referred to as early as 1593, as can be seen from lines in Christopher Marlowe's Hero and Leander of that year:

Therefore, even as an index to a book
So to his mind was young Leander's look.

A similar reference to indexes is in Shakespeare's lines from Troilus and Cressida (I.3.344), written nine years later:

And in such indexes, although small pricks
To their subsequent volumes, there is seen
The baby figure of the giant mass
Of things to come at large.

But according to G. Norman Knight, "at that period, as often as not, by an 'index to a book' was meant what we should now call a table of contents."

Among the first indexes – in the modern sense – to a book in the English language was one in Plutarch's Parallel Lives, in Sir Thomas North's 1595 translation. A section entitled "An Alphabetical Table of the most material contents of the whole book" may be found in Henry Scobell's Acts and Ordinances of Parliament of 1658. This section comes after "An index of the general titles comprised in the ensuing Table". Both of these indexes predate the index to Alexander Cruden's Concordance (1737), which is erroneously held to be the earliest index found in an English book.

Etymology and plural

The word is derived from Latin, in which index means "one who points out", an "indication", or a "forefinger".

In Latin, the plural form of the word is indices. In English, the plural "indices" is commonly used in mathematical and computing contexts, and sometimes in bibliographical contexts – for example, in the 17-volume Women in World History: A Biographical Encyclopedia (1999–2002). However, this form is now seen as an archaism by many writers and commentators, who prefer the anglicised plural "indexes". "Indexes" is widely used in the publishing industry; in the International Standard ISO 999, Information and documentation – Guidelines for the content, organization and presentation of indexes; and is preferred by the Oxford Style Manual. The Chicago Manual of Style allows both forms.

G. Norman Knight quotes Shakespeare's lines from Troilus and Cressida (I.3.344) – "And in such indexes ..." – and comments:

"But the real importance of this passage is that it establishes for all time the correct literary plural; we can leave the Latin form "indices" to the mathematicians (and similarly "appendices" to the anatomists)."

Conventional indexing

The indexer reads through the text, identifying indexable concepts (those for which the text provides useful information and which will be of relevance for the text's readership). The indexer creates index headings to represent those concepts, which are phrased such that they can be found when in alphabetical order (so, for example, one would write 'indexing process' rather than 'how to create an index'). These headings and their associated locators (indicators to position in the text) are entered into specialist indexing software which handles the formatting of the index and facilitates the editing phase. The index is then edited to impose consistency throughout the index.

Indexers must analyze the text to enable presentation of concepts and ideas in the index that may not be named within the text. The index is intended to help the reader, researcher, or information professional, rather than the author, find information, so the professional indexer must act as a liaison between the text and its ultimate user.

In the United States, according to tradition, the index for a non-fiction book is the responsibility of the author, but most authors don't actually do it. Most indexing is done by freelancers hired by authors, publishers or an independent business which manages the production of a book, publishers or book packagers. Some publishers and database companies employ indexers.

Indexing software

Commercial software packets are available for aiding an indexer in building a book index. There are several dedicated, indexing software programs available to assist with the special sorting and copying needs involved in index preparation. The most widely known include Cindex, Macrex and SkyIndex. TExtract is a hybrid semi-automatic program combining conventional manual indexing with automated indexing features and text linking.

Embedded indexing

Embedded indexing involves including the index headings in the midst of the text itself, but surrounded by codes so that they are not normally displayed. A usable index is then generated automatically from the embedded text using the position of the embedded headings to determine the locators. Thus, when the pagination is changed the index can be regenerated with the new locators.

LaTeX documents support embedded indexes primarily through the MakeIndex package. Several widely used XML DTDs, including DocBook and TEI, have elements that allow index creation directly in the XML files. Most word processing software, such as StarWriter/OpenOffice.org Writer, Microsoft Word, and WordPerfect, as well as some desktop publishing software (for example, FrameMaker and InDesign), as well as other tools (MadCap Software's Flare)) have some facility for embedded indexing as well. TExtract supports embedded indexing of Microsoft Word documents.

An embedded index requires more time to create than a conventional static index; however, an embedded index can save time in the long run when the material is updated or repaginated. This is because, with a static index, if even a few pages change, the entire index must be recreated, while with an embedded index, only the pages that changed need updating or indexing.

Purpose

Indexes are designed to help the reader find information quickly and easily. A complete and truly useful index is not simply a list of the words and phrases used in a publication (which is properly called a concordance), but an organized map of its contents, including cross-references, grouping of like concepts, and other useful intellectual analysis.

Sample back-of-the-book index excerpt:

sage, 41-42. See also Herbs ← directing the reader to related terms Scarlet Sages. See Salvia coccinea ← redirecting the reader to term used in the text shade plants ← grouping term (may not appear in the text; may be generated by indexer) sunflower, 47 ← regular entry

In books, indexes are usually placed near the end (this is commonly known as "BoB" or back-of-book indexing). They complement the table of contents by enabling access to information by specific subject, whereas contents listings enable access through broad divisions of the text arranged in the order they occur. It has been remarked that, while "[a]t first glance the driest part of the book, on closer inspection the index may provide both interest and amusement from time to time."

Index quality

Some principles of good indexing include:

  • Ensuring each topic/section includes a variety of relevant index entries; use two or three entries per topic
  • Understanding the audience and understand what kind of index entries they're likely to look for
  • Use the same form throughout (singular vs. plural, capitalisation, etc.), using standard indexing conventions
  • Indexing pitfalls:

  • Significant topics with no index entries at all
  • Indexing 'mere mentions' --- "But John Major was no Winston Churchill..." indexed under 'Churchill, Winston'
  • Circular cross-references: 'Felidae. See Cats' --- 'Cats. See Felidae'
  • References to discussions of a single topic scattered among several main headings: 'Cats, 50-62' --- 'Felidae, 175-183'
  • Inconsistently indexing similar topics
  • Confusing similar names: Henry V of England, Henri V of France
  • Incorrect alphabetization: 'α-Linolenic acid' under 'A' instead of 'L'
  • Inappropriate inversions: 'processors, word' for 'word processors'
  • Inappropriate subheadings: 'processors: food, 213-6; word, 33-7'
  • Computer indexing from section headings: e.g. 'Getting to know your printer' under 'G'
  • Indexer roles

    Some indexers specialize in specific formats, such as scholarly books, microforms, web indexing (the application of a back-of-book-style index to a website or intranet), search engine indexing, database indexing (the application of a pre-defined controlled vocabulary such as MeSH to articles for inclusion in a database), and periodical indexing (indexing of newspapers, journals, magazines).

    Some indexers with expertise in controlled vocabularies also work as taxonomists and ontologists.

    Some indexers specialize in particular subject areas, such as anthropology, business, computers, economics, education, government documents, history, law, mathematics, medicine, psychology, and technology. An indexer can be found for any subject.

    Standards

  • ISO 999:1996 Guidelines for the Content, Organization, and Presentation of Indexes (this is also the national standard in the UK, Australia, and New Zealand)
  • Societies

    The American Society for Indexing, Inc. (ASI) is a national association founded in 1968 to promote excellence in indexing and increase awareness of the value of well-designed indexes. ASI serves indexers, librarians, abstractors, editors, publishers, database producers, data searchers, product developers, technical writers, academic professionals, researchers and readers, and others concerned with indexing. It is the only professional organization in the United States devoted solely to the advancement of indexing, abstracting and related methods of information retrieval.

    Other similar societies include:

  • Association of Southern African Indexers and Bibliographers (ASAIB)
  • Australian and New Zealand Society of Indexers
  • British Record Society
  • China Society of Indexers
  • German Network of Indexers/Deutsches Netwerk der Indexer (DNI)
  • Indexing Society of Canada/Société canadienne d'indexation (ISC/SCI)
  • Nederlands Indexers Netwerk (NIN)
  • Society of Indexers
  • References

    Index (publishing) Wikipedia